/*****************************************************************************/
/* Document     : Simplified descriptions and examples of some well known    */
/*                technical Architectures, or methods, in IT.                */
/* Doc. Versie  : 25                                                         */
/* File         : architectures.txt                                          */
/* Date         : 05-04-2009                                                 */
/* Content      : a few (hopefully) interesting views on some architectures. */  
/*                But it's geared somewhat towards popular platforms.        */
/* Compiled by  : Albert van der Sel                                         */                                   
/* Note         : The independed sections are separated by                   */
/*                three "###" lines, for easy identification.                */
/* Best usage   : Use find/search in your editor to search for a             */
/*                frase, identifier, command etc.., or on the literal        */
/*                text mentioned in the Contents (Section number + text).    */
/*****************************************************************************/


Contents:

Section 1.  A Scetch of the CORBA architecture: (this is a very lightweight discussion)
Section 2.  (Traditional) Client connections to SQL Server
Section 3.  N-tier architectures: (1) Browser based Client connections to a Server
Section 4.  IPC: Named pipes, Sockets, and Multiprotocol in Windows
Section 5.  IPC in UNIX
Section 6.  "Traditional" Cluster system in Redhat Linux
Section 7.  Oracle 10g RAC example on Redhat Linux
Section 8.  CLUSTERS ON AIX: GPFS  (also repeated in section 18)
Section 9.  CLUSTERS ON AIX: HACMP (also repeated in section 18)
Section 10. Cisco IOS version 10.x, 11.x, 12.x router commands:
Section 11. Basic VMS commands and Operations
Section 12. NT/200x/XP CMD shell script examples
Section 13. OO and C# elementary code fragments and basic DOT.NET theory
Section 14. Basic PHP theory and examples
Section 15. Extended PL/SQL examples and code snippets
Section 16. Basic VB and VBscript code snippets
Section 17. SQL Server 7 and 2000 system queries
Section 18. BIG SECTION: UNIX: AIX, HP, Solaris, Linux commands and architecture
SECTION 19. BIG SECTION: Oracle RDBMS 8,8i,9i,10g system queries and architecture
section 20. How to trace in Unix
Section 21. How to undelete a file in UNIX.
Section 22. Oracle 10g/11g RAC
Section 23. MQ errors and messages
Section 24. A collection of Unix errorcodes


#############################################################################################
#############################################################################################
#############################################################################################


=====================================================================================
Secton 1. A scetch of the CORBA architecture: (this is a very lightweight discussion)
=====================================================================================

   ----------               -------------------------
   | CLIENT |               | Object Implementation | 
   ----------               -------------------------
     | IDL |                   | IDL       |
     |STUB |                   | Skeleton  |
  ----------------------------------------------------
  |    |       -------------        |                 |
  |    --->--> | REQUEST   | ---->--                  |
  |            -------------                          |
  |    OBJECT REQUEST BROKER (ORB)                    |
  -----------------------------------------------------
Fig 1: local connection


   ----------     ---------                   ---------   ----------
   | Client |     |Object |                   |Client |   |Object  |
   ----------     ---------                   ---------   ----------
     |STUB|         |SKEL|                     |STUB|       |SKEL|
  --------------------------                 ------------------------
  |     |             ^     |                |                ^     |
  |     |-> ORB 1-->--|     |--->------------|---->-- ORB 2---|     |
  |                         |    IIOP        |                      |
  --------------------------    protocol     ------------------------
Fig 2: remote invocation


CORBA is about distributed (networked) computing.
You create "objects" and "Interface Definitions" (through IDL: Interface Definition Language),
using a well defined infrastructure.
(Theoretically) It is platform and OS independent.

The Common Object Request Broker Architecture (CORBA) [OMG:95a] is an emerging open distributed object 
computing infrastructure, (being) standardized by the Object Management Group (OMG). CORBA automates many 
common network programming tasks such as object registration, location, and activation; request demultiplexing; 
framing and error-handling; parameter marshalling and demarshalling; and operation dispatching. 

At least theoratically, using the standard protocol IIOP, a CORBA-based program from any vendor,  
on almost any computer, operating system, programming language, and network, can interoperate with a 
CORBA-based program from the same or another vendor, on almost any other computer, operating system, 
programming language, and network. 

CORBA applications are composed of objects, individual units of running software that combine functionality 
and data, and that frequently (but not always) represent something in the real world. Typically, there 
are many instances of an object of a single type - for example, an e-commerce website would have many 
shopping cart object instances, all identical in functionality but differing in that each is assigned 
to a different customer, and contains data representing the merchandise that its particular customer 
has selected. For other types, there may be only one instance. When a legacy application, such as an 
accounting system, is wrapped in code with CORBA interfaces and opened up to clients on the network, 
there is usually only one instance. 

For each object type, such as the shopping cart that we just mentioned, you define an interface 
in OMG IDL. The interface is the syntax part of "the contract" that the server object offers to the 
clients that invoke it. Any client that wants to invoke an operation on the object must use this IDL 
interface to specify the operation it wants to perform, and to marshal the arguments that it sends. 
When the invocation reaches the target object, the same interface definition is used there to 
unmarshal the arguments so that the object can perform the requested operation with them. 
The interface definition is then used to marshal the results for their trip back, and to unmarshal 
them when they reach their destination. 

The IDL interface definition is independent of programming language, but maps to all of the 
popular programming languages via OMG standards: OMG has standardized mappings from IDL to C, C++, 
Java, COBOL, Smalltalk, Ada, Lisp, Python, and IDLscript. 

 
The interface to each object is defined very strictly. In contrast, the implementation of an object - 
its running code, and its data - is hidden from the rest of the system (that is, encapsulated) 
behind a boundary that the client may not cross. Clients access objects only through their advertised 
interface, invoking only those operations that that the object exposes through its IDL interface, 
with only those parameters (input and output) that are included in the invocation.

Figure 1 shows how everything fits together, at least within a single process: You compile your IDL 
into client stubs and object skeletons, and write your object (shown on the right) and a client 
for it (on the left). Stubs and skeletons serve as proxies for clients and servers, respectively.  
Because IDL defines interfaces so strictly, the stub on the client side has no trouble meshing perfectly 
with the skeleton on the server side, even if the two are compiled into different programming languages, 
or even running on different ORBs from different vendors. 

How do remote invocations work?
Figure 2 diagrams a remote invocation. In order to invoke the remote object instance, the client first 
obtains its object reference. (There are many ways to do this, but we won't detail any of them here. 
Easy ways include the Naming Service and the Trader Service.) To make the remote invocation, the client 
uses the same code that it used in the local invocation we just described, substituting the object reference 
for the remote instance. When the ORB examines the object reference and discovers that the target object 
is remote, it routes the invocation out over the network to the remote object's ORB. (Again we point out: 
for load balanced servers, this is an oversimplification.)

(Note the similarity with OO "early binding": that is: at compile time, all options are then binded.)

How does this work? OMG has standardized this process at two key levels: First, the client knows 
the type of object it's invoking (that it's a shopping cart object, for instance), and the client stub 
and object skeleton are generated from the same IDL. This means that the client knows exactly which 
operations it may invoke, what the input parameters are, and where they have to go in the invocation; 
when the invocation reaches the target, everything is there and in the right place. We've already seen 
how OMG IDL accomplishes this. Second, the client's ORB and object's ORB must agree on a common 
protocol - that is, a representation to specify the target object, operation, all parameters 
(input and output) of every type that they may use, and how all of this is represented over the wire. 
OMG has defined this also - it's the standard protocol IIOP. (ORBs may use other protocols besides 
IIOP, and many do for various reasons. But virtually all speak the standard protocol IIOP for 
reasons of interoperability, and because it's required by OMG for compliance.)

Some examples of typical code:
------------------------------


Hello.idl, the interface definition
The following file, Hello.idl, is written in the OMG Interface Definition Language, and describes a CORBA object 
whose sayHello() operation returns a string and whose shutdown() method shuts down the ORB. 
OMG IDL is a purely declarative language designed for specifying programming-language-independent 
operational interfaces for distributed applications. The IDL can be mapped to a variety of programming languages. 
The IDL mapping for Java is summarized in "IDL to Java Language Mapping Summary". 

Hello.idl 


module HelloApp
{
  interface Hello
  {
  string sayHello();
  oneway void shutdown();
  };
};


#############################################################################################
#############################################################################################
#############################################################################################


=============================================================================
Section 2. (Traditional) Client connections to SQL Server:
=============================================================================


 -------   -------     -------  -------
 |App 1|   |App 2|     |App 3|  |App 4|
 -------   -------     -------  -------
     |        |           |        |
     |     -------     -------     |
     |     |ADO  |     |RDO  |     |
     |     -------     -------     |
     |        |            |       |
   -----------------   ---------------
   |OLE DB         |   |ODBC         |   (TabularDataStream TDS)
   -----------------   ---------------
           |                   |
   -----------------------------------
   |Client Network library api       |
   |- named pipes                    |
   |- tcpip sockets                  |
   |- multiprotocol                  |
   -----------------------------------
                |
                | 
                \ network stack: tcp/ip, spx/ipx etc..
----------------_\--------------------------------------
                \
                 \
                 |
   ----------------------------------- 
   |SQL Server network library       |
   -----------------------------------
                |
   -----------------
   |SQL Server     |  (TDS)
   -----------------


#############################################################################################
#############################################################################################
#############################################################################################


==================================================================================
Section 3. N-tier architectures: (1) Browser based Client connections to a Server:
==================================================================================


Example 1: ASP
--------------


Client                           WebServer + ASP engine
                                ------------------------------
 -------   request test.asp     |  <% if Hour(Now)< 12 then %>|
|       |---------------------> |      Good Morning.          |
|Web    |                       |  <% else %>                 |
|browser|  test.htm returned    |      Good Day.              |
|       | <------------------   |  <% end if %>               |
 ------                      |  ------------------------------
                             |                | 
                             |                | if its before 12.00 o'clock      
                             |                V
                             -----<------ Good Morning

<SCRIPT LANGUAGE=VBScript RUNAT=SERVER>
Function ComputeAMPM()
  If Hour(Now) < 12 Then
     ComputeAMPM="morning"
  Else
     ComputeAMPM="afternoon"
  End If
End Function
</SCRIPT>


Example 2: jsp, servlets, java
------------------------------


                                      ....................................
                                      .                                  .    optional backend DB
                http://.../x.jsp      . -----------------    ----------  .      ----------
   ------------                       . |  jsp page      |   |BEANS or|  .     |          |
   | BROWSER  |----------------->     . |                |<->|EJB     |<------>|DataBase  |
   |          |                       . |Web Server      |   |        |        |          |
   |          |<-----------------     . |JSP Engine      |   ----------  .     -----------
   ------------                       . |                |               .
                                      . ------------------               .
                                      .                                  .
                                      ....................................


Example 3: N-tier architecture jsp, servlets, java, middleware
--------------------------------------------------------------


                                     ....................................
                                     .                                  .                  optional backend DB
               http://.../x.jsp      . -----------------    ----------  .  ------------      ----------
  ------------                       . |  jsp page      |   |BEANS or|  .  |            |    |        |
  | BROWSER  |----------------->     . |                |<->|EJB     |<--->|- Jolt/     |<-->|DataBase|
  |          |                       . |Web Server      |   ----------  .  |  Tuxedo    |    |        |
  |          |<-----------------     . |iAS, Websphere  |               .  |  middleware|    ----------
  ------------                       . |                |<---------------->|- Cobol obj |
                                     . ------------------               .  --------------
                                     .                                  .
                                     ....................................


#############################################################################################
#############################################################################################
#############################################################################################


=============================================================================
Section 4. IPC: Named pipes, Sockets, and Multiprotocol in Windows:
=============================================================================


This section is Windows orientated. 
See Section 5 for a Unix or more general interpretation.


4.1 TCP/IP Sockets:
-------------------

Suppose the Server "10.10.10.1", has multiple Server programs running. 
How does a client differentiate between the multiple Server programs?

The usual way with tcpip is the use of sockets. A socket is an "identifier" completely
identifying the location of a Server on the network, as well as the "port" the server service is listening on,
like for example:

10.10.10.1 : 1521  or for example
10.10.10.1 : 1433

The client should have knowledge of the "port" of the desired Host program or the host service is listening on.
For example it could come from a local services file, or some registry.

The client constructs a tcp header, while in the destination port, the port is listed where the Host Server service
or deamon is listening on.


                          Server, IP=10.10.10.1
  
                         |------------------------------------------------
                         |                                               |
                         |      ----------        ---------------        |
                         |      | Oracle |        | SQL server  |        |
                         |      ----------        ---------------        |
                         |           |                 |                 |
                         |   ------------------   ---------------------  |
                         |  |Oracle listener   |  |SQL Server listener | |
                         |  |listening on port |  |listening on port   | |
                         |  |1521              |  | 1433               | |
                         |   ------------------    --------------------  |
                         |           ^                 ^                 |
                         |           |                 |                 |
 client request for      |       1521|             1433|                 |
 connection to Oracle    |           |                 |                 |
 10.10.10.1:1521         |   -------------------------------             |
 ----------------------> |   |Portmapper / Netlib router   |             |
                         |   |or other "service", handling |             |
 Client request for      |   |requests to the desired host |             |
 connection to SQL Server|   |program                      |             |
 10.10.10.1:1433         |   |                             |             |
 ----------------------> |   |                             |             |
                         |   ------------------------------              |
                         |                                               |
                         |------------------------------------------------


4.2 Named pipes:
----------------

A high level process, like a client program, can open and write to a "special file", the "named pipe".
The named pipe can be considered to be at the OSI layer 7, and is an IPC mechanism for process to process
communication, locally or across a network.


In Windows, the design of named pipes is biased towards client-server communication, and they work much like sockets: 
other than the usual read and write operations, Windows named pipes also support an explicit "passive" mode 
for server applications (compare: UNIX domain sockets).

Named pipes aren't permanent and can't be created as special files on any writable filesystem, unlike in UNIX, 
but are volatile names (freed after the last reference to them is closed) allocated in the root directory of 
the named pipe filesystem (NPFS), mounted under the special path \\.\pipe\ (that is, a pipe named "foo" would 
have a full path name of \\.\pipe\foo). Anonymous pipes used in pipelining actually are named pipes with a random name.

In "constructing" the client program (VB, C++, VB.NET, C# etc...) there is some sort of mechanisme to create
a named pipe, for example:

Public Declare Function CallNamedPipe Lib "kernel32" Alias "CallNamedPipeA" _
(ByVal lpNamedPipeName As String, etc......


The pipe is an IPC construct above any network protocol as sockets/tcp/ip, 
or nwlink spx/ipx etc..
It uses the IPC$ share of the remote system, just like a filesystemshare.

\\computername\pipe\MSSQL$instancename\sql\query


CLIENT:
----------------------------------------       rw to and from pipe
named pipe \\.\sql\query,                  <-------------------------> Server named pipe
functions like a sort of URL or share
----------------------------------------
session management, sockets, netbios
----------------------------------------
TCP   SPX   
----------------------------------------
IP    IPX
----------------------------------------
Datalink
----------------------------------------
physiscal network
----------------------------------------


4.3 Multiprotocol:
------------------

It's a protocol that layers over named pipes, tcpip sockets, or nwlink spx/ipx sockets.
So, just MUST have one of the above IPC mechanismens available.

The Multiprotocol selection has two key features: 

Automatic selection of an available network protocol to communicate with an instance of Microsoft� SQL Server�. 
This is convenient when you want to connect to multiple servers running different network protocols 
but do not want to reconfigure the client connection for each server. If the client and server Net-Libraries 
for TCP/IP Sockets, NWLink IPX/SPX, or Named Pipes are installed on the client and server, 
the Multiprotocol Net-Library will automatically choose the first available network protocol 
to establish a connection.

Client encryption. 
You can enforce encryption over the Multiprotocol Net-Library on clients running on the Microsoft 
Windows NT� 4.0, Windows� 2000, Windows 95, or Windows 98 operating system to prevent others from intercepting 
and viewing sensitive data.

The Multiprotocol Net-Library takes advantage of the remote procedure call (RPC) facility of 
Windows NT 4.0 and Windows 2000, which provides Windows Authentication. For the Multiprotocol Net-Library, 
clients determine the server address using the server name.

Usage Considerations
Before using the Multiprotocol Net-Library, consider the following: 

The Multiprotocol Net-Library does not support named instances of SQL Server 2000. You can use the 
Multiprotocol Net-Library to connect to the default instance of SQL Server on a computer, but you cannot connect 
to any named instances.

The Multiprotocol Net-Library does not support server enumeration. From applications that can list servers 
by calling dbserverenum, you cannot identify servers running an instance of SQL Server and listening 
on the Multiprotocol Net-Library. 


#############################################################################################
#############################################################################################
#############################################################################################


=============================================================================
5. Section IPC in UNIX:
=============================================================================


#############################################################################################
#############################################################################################
#############################################################################################


=============================================================================
Section 6. "Traditional" Cluster system in Redhat Linux
=============================================================================

The Red Hat "Cluster Manager" software was originally based on the open source Kimberlite
http://oss.missioncriticallinux.com/kimberlite/ cluster project which was developed by Mission
Critical Linux, Inc.
Subsequent to its inception based on Kimberlite, developers at Red Hat have made a large number
of enhancements and modifications.


6.1 Cluster Overview:

To set up a cluster, an administrator must connect the cluster systems (often referred to as member
systems) to the cluster hardware, and configure the systems into the cluster environment. The foundation
of a cluster is an advanced host membership algorithm. This algorithm ensures that the cluster
maintains complete data integrity at all times by using the following methods of inter-node communication:

� Quorum partitions on shared disk storage to hold system status
� Ethernet and serial connections between the cluster systems for heartbeat channels

To make an application and data highly available in a cluster, the administrator must configure a 
"cluster service" � a discrete group of service properties and resources, such as an application and shared
disk storage. A service can be assigned an IP address to provide transparent client access to the service.
For example, an administrator can set up a cluster service that provides clients with access to
highly-available database application data.
Both cluster systems can run any service and access the service data on shared disk storage. 

However, each service can run on only one cluster system at a time, in order to maintain data integrity. 
Administrators can set up

- an "active-active" configuration in which both cluster systems run different services,
or 
- an "active-passive" (hot-standby) configuration in which a primary cluster system runs all the services, 
  and a backupcluster system takes over only if the primary system fails.

  NOTE:
  So this is actually a difference from Oracle 10g Real Application Cluster (RAC), where both instances,
  or multiple instances (from 2 - 100), accesses the single database on shared storage, at the same time !


Scetch of a 2-node Linux cluster


         ------------------------------------------ public network
             |                              |
             |                              |
        ------------                    -------------
        |cluster   |                    |cluster    |
        |system    |Ethernet            |system     |
        |          |--------------------|           |
        |          |heartbeat           |           |
        |          |                    |           |
        |          |____________        |           |
        |ServiceA  |  -----    -|---    |           |
        |ServiceB  |--|PWR|    |PWR|----|ServiceC   |
        |          |  -----    -----    |           |
        |          |    |_______________|           |
        |          |                    |           |
        ------------                    -------------
             | SCSI bus or Fible Channel      |
             ------------------  --------------
               Interconnect   |  |
                              |  |
Fig 6.1                   -----------
                          |Shared   |  - has Quorum partition
                          |Disk     |  - has partitions for ServiceA, B, C
                          |Storage  |
                          ----------- 


Figure 6�1, shows an example of a cluster in an active-active configuration.
If a hardware or software failure occurs, the cluster will automatically restart the failed system�s services
on the functional cluster system. This service failover capability ensures that no data is lost,
and there is little disruption to users. When the failed system recovers, the cluster can re-balance the
services across the two systems.
In addition, a cluster administrator can cleanly stop the services running on a cluster system and then
restart them on the other system. This service relocation capability enables the administrator to maintain
application and data availability when a cluster system requires maintenance.

-- Service configuration framework:

Clusters enable an administrator to easily configure individual services to make data and applications
highly available. To create a service, an administrator specifies the resources used in the
service and properties for the service, including the service name, application start and stop script,
disk partitions, mount points, and the cluster system on which an administrator prefers to run the
service. After the administrator adds a service, the cluster enters the information into the cluster
database on shared storage, where it can be accessed by both cluster systems.
The cluster provides an easy-to-use framework for database applications. For example, a database
service serves highly-available data to a database application. The application running on a cluster
system provides network access to database client systems, such as Web servers. If the service
fails over to another cluster system, the application can still access the shared database data. A
network-accessible database service is usually assigned an IP address, which is failed over along
with the service to maintain transparent access for clients.
The cluster service framework can be easily extended to other applications, as well.

-- Multiple cluster communication methods:

To monitor the health of the other cluster system, each cluster system monitors the health of the
remote power switch, if any, and issues heartbeat pings over network and serial channels to monitor
the health of the other cluster system. In addition, each cluster system periodically writes a
timestamp and cluster state information to two quorum partitions located on shared disk storage.
System state information includes whether the system is an active cluster member. Service state
information includes whether the service is running and which cluster system is running the service.
Each cluster system checks to ensure that the other system�s status is up to date.
To ensure correct cluster operation, if a system is unable to write to both quorum partitions at
startup time, it will not be allowed to join the cluster. In addition, if a cluster system is not updating
its timestamp, and if heartbeats to the system fail, the cluster system will be removed from the
cluster.

If a hardware or software failure occurs, the cluster will take the appropriate action to maintain application
availability and data integrity. For example, if a cluster system completely fails, the other
cluster system will restart its services. Services already running on this system are not disrupted.
When the failed system reboots and is able to write to the quorum partitions, it can rejoin the
cluster and run services. Depending on how the services are configured, the cluster can re-balance
the services across the two cluster systems.

-- Manual service relocation capability:

In addition to automatic service failover, a cluster enables administrators to cleanly stop services
on one cluster system and restart them on the other system. This allows administrators to perform
planned maintenance on a cluster system, while providing application and data availability.

-- Event logging facility:

To ensure that problems are detected and resolved before they affect service availability, the cluster
daemons log messages by using the conventional Linux syslog subsystem. Administrators can
customize the severity level of the logged messages.

-- Application Monitoring:
The cluster services infrastructure can optionally monitor the state and health of an application. In
this manner, should an application-specific failure occur, the cluster will automatically restart the
application. In response to the application failure, the application will attempt to be restarted on
the member it was initially running on; failing that, it will restart on the other cluster member.

-- Status Monitoring Agent:

A cluster status monitoring agent is used to gather vital cluster and application state information.
This information is then accessible both locally on the cluster member as well as remotely. A
graphical user interface can then display status information from multiple clusters in a manner
which does not degrade system performance.


6.2 Notes about Shared Storage:

The operation of the cluster depends on reliable, coordinated access to shared storage. In the event of
hardware failure, it is desirable to be able to disconnect one member from the shared storage for repair
without disrupting the other member. Shared storage is truly vital to the cluster configuration.
Testing has shown that it is difficult, if not impossible, to configure reliable multi-initiator parallel
SCSI configurations at data rates above 80 MBytes/sec. using standard SCSI adapters. Further tests
have shown that these configurations can not support online repair because the bus does not work
reliably when the HBA terminators are disabled, and external terminators are used. For these reasons,
multi-initiator SCSI configurations using standard adapters are not supported. Single-initiator parallel
SCSI buses, connected to multi-ported storage devices, or Fibre Channel, are required.
The Red Hat Cluster Manager requires that both cluster members have simultaneous access to the
shared storage. Certain host RAID adapters are capable of providing this type of access to shared
RAID units. These products require extensive testing to ensure reliable operation, especially if the
shared RAID units are based on parallel SCSI buses. These products typically do not allow for online
repair of a failed system. No host RAID adapters are currently certified with Red Hat Cluster Manager.
Refer to the Red Hat web site at http://www.redhat.com for the most up-to-date supported hardware
matrix.
The use of software RAID, or software Logical Volume Management (LVM), is not supported on
shared storage. This is because these products do not coordinate access from multiple hosts to shared
storage. Software RAID or LVM may be used on non-shared storage on cluster members (for example,
boot and system partitions and other filesysytems which are not associated with any cluster services).


6.3 More detailed view of an almost "No single point of failure" 2-Node Clustered System:

                            ----------
                            |NETWORK |
         -------------------|SWITCH  |-----------------------
         |                  ----------                      |
         |                      |		            |               
 ---------------------          |		    ---------------------   
 |network interface  |      ----------		    |network interface  |   
 |--------------------      |terminal|		    |--------------------   
 |serial port        |------|server  |--------------|serial port        |
 |--------------------      ----------		    |--------------------   
 |CLUSTER            | 				    |CLUSTER            | 
 |SYSTEM             |				    |SYSTEM             |
 |--------------------				    |--------------------
 |network interface  |------------------------------|network interface  | 
 |--------------------				    |--------------------
 |serial port        |------------------------------|serial port        |
 |--------------------				    |--------------------
 |serial port        |-----------------\	    |                   |
 |--------------------   -----       -----	    |--------------------   
 |power plug         |---|PWR|       |PWR|----------|power plug         |
 |--------------------   -----       -----	    |--------------------   
 |                   |	   |			    |-------------------|
 |                   |	   -------------------------|serial port        |
 |--------------------				    |--------------------
 |SCSI adapter (T)   |				    |SCSI adapter (T)   |
 ---------------------				    ---------------------
       |                                                       |
       |                                                       |
       -----------           -----------------------------------
                 |           |
                 |           |                 (T)         (T)
             -------------------------------------------------------
             | Port A/in | Port B/in |    | Port A/Out| Port B/Out |
             |------------------------------------------------------
             |     |           |                |          |       |
             |  -------------------         --------------------   |
             |  |controller 1     |         |controller 2      |   |
             |  -------------------         --------------------   |
             |          |                            |             | 
             |          |                            |             |
  RAID       |         ( )                          ( )            |
             |          |                            |             |
             |         ( )                          ( )            |
             |                                                     |
             |    mirrored shared disks                            |
             -------------------------------------------------------


single-initiator
----------------

A single-initiator SCSI bus has only one node connected to it, and provides host isolation and better 
performance than a multi-initiator bus. Single-initiator buses ensure that each node is protected 
from disruptions due to the workload, initialization, or repair of the other nodes.

When using a single- or dual-controller RAID array that has multiple host ports and provides 
simultaneous access to all the shared logical units from the host ports on the storage enclosure, 
the setup of the single-initiator SCSI buses to connect each cluster node to the RAID array is possible. 
If a logical unit can fail over from one controller to the other, the process must be transparent 
to the operating system. Note that some RAID controllers restrict a set of disks to a specific 
controller or port. In this case, single-initiator bus setups are not possible.


To set up a single-initiator SCSI bus configuration, perform the following steps:

Enable the onboard termination for each host bus adapter.
Enable the termination for each RAID controller. 
Use the appropriate SCSI cable to connect each host bus adapter to the storage enclosure.

Setting host bus adapter termination is done in the adapter BIOS utility during system boot. 
To set RAID controller termination, refer to the vendor documentation. 


  ---------   SI SCSI bus                   --------------
  |      T|---------------                  |  HBA        |
  |HBA    |               |       ----------|T            |
  |       |               |       |         --------------
  ---------               |       |
                          |       |
                     -------------------
                     |    T       T    |
                     |Storage Enclosure|
                     -------------------

In general, recommended in Linux an Sun clusters.


Multi Initiator SCSI
--------------------


Multi Initiator SCSI configurations are configurations with two SCSI host adapter boards connect 
to a single SCSI bus like in the following example: 

   --------   SI SCSI bus                   --------------
  |      T|-------------------------------- |T            |
  |       |                  |              |             |
  |HBA    |                  |              |HBA          |
  |       |                  |              |             |
  ---------                  |              ---------------           
                     -------------------
                     |       T         |
                     |Storage Enclosure|
                     -------------------


In general, not recommended for Linux or Solaris clusters.


#############################################################################################
#############################################################################################
############################################################################################# 


=============================================================================
7. Oracle 10g RAC example on Redhat Linux:
=============================================================================


7.1 Overview:
-------------


- RAC Architecture Overview

Let's begin with a brief overview of RAC architecture.

A cluster is a set of 2 or more machines (nodes) that share or coordinate resources to perform the same task. 
A RAC database is 2 or more instances running on a set of clustered nodes, with all instances accessing 
a shared set of database files. 
Depending on the O/S platform, a RAC database may be deployed on a cluster that uses vendor clusterware 
plus Oracle's own clusterware (Cluster Ready Services), or on a cluster that solely uses 
Oracle's own clusterware.
Thus, every RAC sits on a cluster that is running Cluster Ready Services. srvctl is the primary tool DBAs use 
to configure CRS for their RAC database and processes.


- Cluster Ready Services and the OCR

Cluster Ready Services, or CRS, is a new feature for 10g RAC. Essentially, it is Oracle's own clusterware. 
On most platforms, Oracle supports vendor clusterware; in these cases, CRS interoperates with the vendor 
clusterware, providing high availability support and service and workload management. On Linux and Windows clusters, 
CRS serves as the sole clusterware. In all cases, CRS provides a standard cluster interface that is consistent 
across all platforms.

CRS consists of four processes (crsd, occsd, evmd, and evmlogger) and two disks: 
the Oracle Cluster Registry (OCR), and the voting disk. 

CRS manages the following resources: 

. The ASM instances on each node 
. Databases 
. The instances on each node 
. Oracle Services on each node 
. The cluster nodes themselves, including the following processes, or "nodeapps":
  . VIP 
  . GSD 
  . The listener 
  . The ONS daemon

CRS stores information about these resources in the OCR. If the information in the OCR for one of these 
resources becomes damaged or inconsistent, then CRS is no longer able to manage that resource. 
Fortunately, the OCR automatically backs itself up regularly and frequently.


10g RAC (10.2) uses, or depends on,:

- Oracle Clusterware (10.2), formerly referred to as CRS "Cluster Ready Services" (10.1).
- Oracle's optional Cluster File System OCFS (This is optional), or use ASM and RAW.
- Oracle Database extensions

RAC is "scale out" technology: just add commodity nodes to the system.
The key component is "cache fusion". Data are transferred from one node
to another via very fast interconnects. 
Essential to 10g RAC is a "Shared Cache" technology.

Automatic Workload Repository (AWR) plays a role also.  The Fast Application Notification (FAN) mechanism
that is part of RAC, publishes events that describe the current service level being provided
by each instance, to AWR. The load balancing advisory information is then used to determine
the best instance to serve the new request.

. With RAC, ALL Instances of ALL nodes in a cluster, access a SINGLE database.
. But every instance has it's own UNDO tablespace, and REDO logs.

The Oracle Clusterware comprise several background processes that facilitate cluster operations.
The Cluster Synchronization Service CSS, Event Management EVM, and Oracle Cluster components
communicate with other cluster components layers in the other instances within the same 
cluster database environment.


Questions per implementation arise in the following points:
. Storage
. Computer Systems/Storage-Interconnect
. Datbase
. Application Server
. Public and Private networks
. Application Control & Display

On the Storage level, it can be said that 10g RAC supports
- Automatic Storage Management (ASM)
- Oracle Cluster File System (OCFS)
- ??? Network File System (NFS) - limited (only theoretical actually)
- Disk raw partitions
- Third party cluster file systems

For application control and tools, it can be said that 10g RAC supports
- OEM Grid Control     http://hostname:5500/em
  OEM Database Control http://hostname:1158/em
- "svrctl" is a command line interface to manage the cluster configuration,
   for example, starting and stopping all nodes in one command.
- Cluster Verification Utility (cluvfy) can be used for an installation and sanity check.

Failure in Client connections:

Depending on the Net configuration, type of connection, type of transaction etc.., 
Oracle Net services provides a feature called "Transparant Application Failover" 
which can fail over a client session to another backup connection.

About HA and DR:

- RAC is HA       , High Availability, that will keep things Up and Running in one site.
- Data Guard is DR, Disaster Recovery, and is able to mirror one site to another remote site.


7.2 Prepare your nodes:
-----------------------


7.2.1 Scetch of a 2-node Linux cluster

			192.168.2.0
         ------------------------------------------ public network 
             |                              |
             |                              |
        ------------                    -------------
        |InstanceA |Private network     |InstanceB  |
        |          |Ethernet            |           |
        |          |--------------------|           |
        |          |192.168.1.0         |           |
        |          |                    |           |
        |          |____________        |           |
        |          |  -----    -|---    |           |
        |          |--|PWR|    |PWR|----|           |
        |          |  -----    -----    |           |
        |          |    |_______________|           |
        |          |                    |           |
        ------------                    -------------
             | SCSI bus or Fible Channel      |
             ------------------  --------------
               Interconnect   |  |
                              |  |
Fig 7.1                   -----------
                          |Shared   |  - has Single DB on ASM or OCFS or RAW
                          |Disk     |  - has OCR and Voting disk on OCFS or RAW
                          |Storage  |
                          ----------- 


7.2.2 Storage Options

Storage					Oracle Clusterware	Database	Recovery area
--------------				------------------	--------	-------------
Automatic Storage Management 		No 			Yes 		Yes 
Cluster file system (OCFS) 		Yes 			Yes 		Yes 
Shared raw storage 			Yes 			Yes 		No 


In the following, we will do an example installation on 3 nodes.


7.2.3 Install Redhat on all nodes with all options.

7.2.4 create oracle user and groups dba, oinstall on all nodes.
      Make sure they all have the same UID and GUI.

7.2.5 Make sure the user oracle has an appropriate .profile or .bash_profile

7.2.6 Every node needs a private network connection and a public network connection (at least
      two networkcards).

7.2.7 Linux kernel parameters:

Most out of the box kernel parameters (of RHELS 3,4,5) are set correctly for Oracle
except a few.

You should have the following minimal configuration:

net.ipv4.ip_local_port_range	1024  65000
kernel.sem			250  32000  100  128
kernel.shmmni			4096
kernel.shmall			2097152
kernel.shmmax			2147483648
fs.file-max			65536


You can check the most important parameters using the following command:

# /sbin/sysctl -a | egrep 'sem|shm|file-max|ip_local'

net.ipv4.ip_local_port_range = 1024  65000
kernel.sem = 250  32000  100  128
kernel.shmmni = 4096
kernel.shmall = 2097152
kernel.shmmax = 2147483648
fs.file-max = 65536

If some value should be changed, you can change the "/etc/sysctl.conf" file and run the "/sbin/sysctl -p" command
to change the value immediately.
Every time the system boots, the init program runs the /etc/rc.d/rc.sysinit script. This script contains 
a command to execute sysctl using /etc/sysctl.conf to dictate the values passed to the kernel. 
Any values added to /etc/sysctl.conf will take effect each time the system boots. 
 

7.2.8 make sure ssh and scp are working on all nodes without asking for a password.
      Use shh-keygen to arrange that.


7.2.9 Example "/etc/host" on the nodes:

Suppose you have the following 3 hosts, with their associated public and private names:

public  private
oc1	poc1
oc2	poc2
oc3	poc3

Then this could be a valid host file on the nodes: 

127.0.0.1	localhost.localdomain	localhost

192.168.2.99	rhes30
192.168.2.166	oltp
192.168.2.167	mw

192.168.2.101	oc1	#public1
192.168.1.101	poc1	#private1
192.168.2.19	voc1	#virtual1

192.168.2.102	oc2	#public2
192.168.1.102	poc2	#private2
192.168.2.177	voc2	#virtual2

192.168.2.103	oc3	#public3
192.168.1.103	poc3	#private3
192.168.2.178	voc3	#virtual3


7.2.10 Example disk devices

On all nodes, the shared disk devices should be accessible through the same devices names.

Raw Device Name		Physical Device Name	Purpose
/dev/raw/raw1		/dev/sda1		ASM Disk 1: +DATA1
/dev/raw/raw2		/dev/sdb1		ASM Disk 1: +DATA1
/dev/raw/raw3		/dev/sdc1		ASM Disk 2: +RECOV1
/dev/raw/raw4		/dev/sdd1		ASM Disk 2: +RECOV1
/dev/raw/raw5		/dev/sde1		OCR Disk (on RAW device)
/dev/raw/raw6		/dev/sdf1		Voting Disk (on RAW device)


7.3 CRS installation:
---------------------

7.3.1 First install CRS in its own home directory

First install CRS in its own home directory, e.g. CRS10gHome, apart from the Oracle home dir.

As Oracle user:

./runInstaller

 ---------------------------------------------------
 |                                                 |  Screen 1
 |Specify File LOcations                           |
 |                                                 |
 |Source                                           |
 |Path: /install/crs10g/Disk1/stage/products.xml   |
 |                                                 |
 |Destination                                      |
 |Name: CRS10gHome                                 |
 |Path: /u01/app/oracle/product/10.1.0/CRS10gHome  |
 |                                                 |
 ---------------------------------------------------


 ---------------------------------------------------
 |                                                 |  Screen 2
 |Cluster Configuration                            |
 |                                                 |
 |Cluster Name: lec1                               |
 |                                                 |
 | Public Node Name            Private Node Name   |
 | ---------------------------------------------   |
 | |oc1                 | p0c1                  |  |
 | |--------------------------------------------   |
 | |oc2                 | p0c2                  |  |
 | |--------------------------------------------   |
 | |oc3                 | poc3                  |  |
 | |--------------------------------------------   |
 ---------------------------------------------------

In the next screen, you specify which of your networks is to be used as
the public interface (to connect to the public network) and which will be used
for the private interconnect to support cache fushion and the cluster heartbeat.

 ---------------------------------------------------
 |                                                 |  Screen 3
 |Private Interconnect Enforcement                 |
 |                                                 |
 |                                                 |
 |                                                 |
 | Interface Name   Subnet          Interface type |
 | ---------------------------------------------   |
 | |eth0           |192.168.2.0   |Public      |   |
 | |--------------------------------------------   |
 | |eth1           |192.168.1.0   |Private     |   |
 | |--------------------------------------------   |
 |                                                 |
 ---------------------------------------------------

In the next screen, you specify /dev/raw/raw5 as the raw disk for the Oracle Cluster Registry.

 ---------------------------------------------------
 |                                                 |  Screen 4
 |Oracle Cluster Registry                          |
 |                                                 |
 |Specify OCR Location: /dev/raw/raw5              |
 |                                                 |
 ---------------------------------------------------

In a similar fashion you specify the location of the Voting Disk.

 ---------------------------------------------------
 |                                                 |  Screen 5
 |Voting Disk                                      |
 |                                                 |
 |Specify Voting Disk: /dev/raw/raw6               |
 |                                                 |
 ---------------------------------------------------

You now have to execute the /u01/app/oracle/orainventory/orainstRoot.sh script
on all Cluster Nodes as the root user.

After this, you can continue with the other window, and see an "Install Summary" screen.
No you click "Install" and the installation begins.
Apart from the node you work on, the software will also be copied to the other nodes as well.

After the installation is complete, you are once again prompted to run a script as root
on each node of the Cluster.
This is the script "/u01/app/oracle/product/10.1.0/CRS10gHome/root.sh".

-- The olsnodes command.

After finishing the CSR installation, you can verify that the installation completed successfully
by running on any node the following command:

# cd /u01/app/oracle/product/10.1.0/CRS10gHome/bin
# olsnodes -n
oc1   1
oc2   2
oc3   3


7.4 Database software installation:
-----------------------------------

You can install the database software into the same directory in each node.
With OCFS2, you might do one install in a common shared directory for all nodes.

Because CSR is already running, the OUI detects that, and because its cluster aware, it
provides you with the options to install a clustered implementation.

You start the installation by running ./runInstaller as the oracle user on one node.
For most part, it looks the same as a single-instance installation.

After the file location screen, that is source and destination, you will see this screen:

 ---------------------------------------------------
 |                                                 |  
 |Specify Hardware Cluster Installation Mode       |
 |                                                 |
 | o Cluster installation mode                     |
 |                                                 |
 |  Node name                                      |
 |  ---------------------------------------------  |
 |  | [] oc1                                    |  |
 |  | [] oc2                                    |  |
 |  | [] oc3                                    |  |
 |  ---------------------------------------------  |
 |                                                 |
 | o Local installation (non cluster)              |
 |                                                 |
 |-------------------------------------------------|

Most of the time, you will do a "software only" installation, and create the database later
with the DBCA.

For the first node only, after some time, the Virtual IP Configuration Assistant, VIPCA, will start.
Here you can configure the Virtual IP adresses you will use for application failover
and the Enterprise Manager Agent.
Here you will select the Virtual IP's for all nodes.
VIPCA only needs to run once per Cluster.


7.5 Creating the RAC database with DBCA:
----------------------------------------

Launching the DBCA for installing a RAC database is much the same as launching DBCA for a single instance.
If DBCA detects cluster software installed, it gives you the option to install a RAC database 
or a single instance.

as oracle user:

% dbca &

 ---------------------------------------------------
 |                                                 |  
 |Welcome to the database configuration assistant  |
 |                                                 |
 |                                                 |
 |                                                 |
 | o Oracle Real Application Cluster database      |
 |                                                 |
 | o Oracle single instance database               |
 |                                                 |
 |-------------------------------------------------|

After selecting RAC, the next screen gives you the option to select nodes:

 ---------------------------------------------------
 |                                                 |  
 |Select the nodes on which you want to create     |
 |the cluster database. The local node oc1 will    |
 |always be used whether or not it is selected.    |
 |                                                 |
 |  Node name                                      |
 |  ---------------------------------------------  |
 |  | [] oc1                                    |  |
 |  | [] oc2                                    |  |
 |  | [] oc3                                    |  |
 |  ---------------------------------------------  |
 |                                                 |
 |                                                 |
 |-------------------------------------------------|
 
In the next screens, you can choose the type of database (oltp, dw etc..), and all
other items, just like a single instance install.
At a cetain point, you can choose to use ASM diskgroups, flash-recovery area etc..


7.5 Example tnsnames.ora and listener.ora:
------------------------------------------


7.6 RAC utilities:
------------------

Some examples will illustrate the use of some important utilities.


Example 1: removing and adding a failed node
--------------------------------------------

Suppose, using above example, that instance rac3 on node oc3, fails. Suppose that you need to repair
the node (e.g. harddisk crash).

-- Remove the instance:

% srvctl remove instance -d rac -i rac3
Remove instance rac3 for the database rac (y/n)? y

-- Remove the node from the cluster:

# cd /u01/app/oracle/product/10.1.0/CRS10gHome/bin
# ./olsnode -n
oc1   1
oc2   2
oc3   3
# cd ../install
# ./rootdeletenode.sh oc3,3
# cd ../bin
# ./olsnode -n
oc1   1
oc2   2
#

Suppose that you have repared host oc3. We now want to add it back into the cluster.
Host oc3 has the OS newly installed, and its /etc/host file is just like it is on the other nodes.

-- Add the node at the clusterware layer:

From oc1 or oc2, go to the $CRS_Home/oui/bin directory, and run

# ./addNode.sh

A graphical screen pops up, and you are able to add oc3 to the cluster.
Al CRS files are copied to the new node.

To start the services on the new node, you are then prompted to run "rootaddnode.sh" on the active node
and "root.sh" on the new node.

# ./rootaddnode.sh

# ssh oc3
# cd /u01/app/oracle product/10.1.0/CRS10gHome
# ./root.sh

-- Install the Oracle software on the new node:


Example 2: showing all nodes from a node
----------------------------------------

# lsnodes -v

# cd /u01/app/oracle/product/10.1.0/CRS10gHome/bin
# ./olsnode -n
oc1   1
oc2   2
oc3   3


Example 3: using svrctl
-----------------------

The Server Control SVRCTL utility is installed on each node by default. 
You can use SRVCTL to start and stop the database and instances, manage configuration information,
and to move or remove instances and services.

Some SVRCTL operations store configuration information in the OCR. 
SVRCTL performs other operations, such as starting and stopping instances, by sending request
to the Oracle Clusterware process CSRD, which then starts or stops the Oracle Clusterware resources.

srvctl must be run from the $ORACLE_HOME of the RAC you are administering. 
The basic format of a srvctl command is 

srvctl <command> <target> [options]

where command is one of

enable|disable|start|stop|relocate|status|add|remove|modify|getenv|setenv|unsetenv|config

and the target, or object, can be a database, instance, service, ASM instance, or the nodeapps.


-- Example 1: To view help:

% svrctl -h
% svrctl command -h

-- Example 2: To see the SRVCTL version number, enter

% svrctl -V

-- Example 3. Bring up the MYSID1 instance of the MYSID database.

% srvctl start instance -d MYSID -i MYSID1

-- Example 4. Stop the MYSID database: all its instances and all its services, on all nodes.

% srvctl stop database -d MYSID

-- Example 5. Stop the nodeapps on the myserver node. NB: Instances and services also stop.

% srvctl stop nodeapps -n myserver

-- Example 6. Add the MYSID3 instance, which runs on the myserver node, to the MYSID clustered database.

% srvctl add instance -d MYSID -i MYSID3 -n myserver

-- Example 7. Add a new node, the mynewserver node, to a cluster.

% srvctl add nodeapps -n mynewserver -o $ORACLE_HOME -A 149.181.201.1/255.255.255.0/eth1
(The -A flag precedes an address specification.)

-- Example 8. To change the VIP (virtual IP) on a RAC node, use the command

% srvctl modify nodeapps -A new_address

-- Example 9. Find out whether the nodeapps on mynewserver are up.

% srvctl status nodeapps -n mynewserver

VIP is running on node: mynewserver
GSD is running on node: mynewserver
Listener is not running on node: mynewserver
ONS daemon is running on node: mynewserver

-- Example 10. The following command and output show the expected configuration for a three node 
               database called ORCL.

% srvctl config database -d ORCL

server01 ORCL1 /u01/app/oracle/product/10.1.0/db_1
server02 ORCL2 /u01/app/oracle/product/10.1.0/db_1
server03 ORCL3 /u01/app/oracle/product/10.1.0/db_1


-- Example 11. Disable the ASM instance on myserver for maintenance.

% srvctl disable asm -n myserver


-- Example 12. Debugging srvctl

Debugging srvctl in 10g couldn't be easier. Simply set the SRVM_TRACE environment variable.

% export SRVM_TRACE=true


-- Example 13. Question Version 10G RAC

Q: how to add a listener to the nodeapps using the srvctl command ??
or even if it can be added using srvctl ??

A: just edit listener.ora on all concerned nodes and add entries ( the usual way).
srvctl will automatically make use of it.
For example

% srvctl start database -d SAMPLE

will start database SAMPLE and its associated listener LSNR_SAMPLE. 


-- Example 14. Adding services.

% srvctl add database -d ORCL -o /u01/app/oracle/product/10.1.0/db_1
% srvctl add instance -d ORCL -i ORCL1 -n server01
% srvctl add instance -d ORCL -i ORCL2 -n server02
% srvctl add instance -d ORCL -i ORCL3 -n server03


-- More examples

% srvctl remove instance -d rac -i rac3
% srvctl disable instance -d orcl -i orcl2
% srvctl enable instance -d orcl -i orcl2 


#############################################################################################
#############################################################################################
#############################################################################################


==============================================
Sections 8 and 9: CLUSTERS ON AIX:
==============================================


Section 8: GPFS

========================================
8.1. General Parallel File System (GPFS):
========================================


Only AIX and Linux (pSeries) related.

General Parallel File System (GPFS) is a high performance "shared-disk file system" that can provide data access 
from nodes in a cluster environment. Parallel and serial applications can readily access shared files 
using standard UNIX� file system interfaces, and the same file can be accessed concurrently from multiple nodes. 
GPFS is designed to provide high availability through logging and replication, and can be configured for failover 
from both disk and server malfunctions.

GPFS operates often within the context of a HACMP cluster, but you can build just GPFS "clusters" as well.


8.2. Creating a 2 node GPFS Cluster:
====================================

Suppose we have two nodes named node2 and node3. Our goal is to create a single GPFS filesystem,
named "/my_gpfs", consisting of 2 disks used for data and metadata. These disks are housed by two
DS4300 storage subsystems. A tiebreaker disk, in a seperate DS4100, will be used to maintain node quorom
during single nodes failures. Additionally, a "filesystem descriptor" disk for /my_gpfs is located
at the same site.

Servers: 2 Nodes= 2 x lpar; per lpar 1 cpu, 2GB RAM, 2 x FC adapter, 2 x Ethernet adapter
Storage: 2 x DS4300 for GPFS and data, 1 x DS4100 for tiebreaker disk 

Suppose further that the nodes uses the following IP addresses:
Node2: 10.1.1.32
Node3: 10.1.1.33

The Ethernet adapters per Server, are Aggregated, or configured in NIB (backup standby mode).


  Note : What are Tiebreaker disks?

  GPFS can use two types op quorum mechanisms in order to determine service availability:
  - Disk quorom
  - Node quorom

  In case availability of either of these resources is less or equal to 50%, GPFS file system services are
  automatically stopped.

  When node quorom is not met, GPFS stops its cluster-wide services and access to all filesystems
  within the cluster is no longer possible. If less than 50% of disks serving a GPFS file system fail,
  disk quorom, that is the number of "filesystem descriptors" for that particular file system, 
  is no longer met and the filesystem will be unmounted.

  To eliminate the need of a tiebreaker node, as from GPFS 2.3, a new node quorom mechanism was introduced
  for a two node cluster. Its called a tiebreaker disk. 
  If one of the two nodes goes down, we still have "enough" node qourom to keep the GPFS system running.
  Basically, a tiebreaker disk replaces a "tiebreaker node".


-- Preparations:
-- -------------

1. The systems have AIX >= 5.3ML2 installed, and gpfs.base.xxxx installed
2. Make sure names resolution is ok, either by DNS or by /etc/hosts
3. Sync the system clocks, for example by NTP
4. Make sure rcp, ssh, scp is working (via ./rhosts etc.. or ssh protocols)
5. A distributed shell (DSH) is installed on each node.
6. During cluster setup some configuration files may be created and used with GPFS commands.
   These files reside in a user created directory called /var/mfs/conf.


--  Creating the GPFS cluster:
-- ---------------------------

The first step is to create a GPFS cluster named TbrCl using the command:

# mmcrcluster -n /var/mmfs/conf/nodefile -p node2 -s node3 -C TbrCl -A

A file called "nodefile" contains the cluster node information, describing the function of each node:

  # Node2 can be a file system manager and is relevant for GPFS quorum
  node2:manager-quorom 
  # Node3 can be a file system manager and is relevant for GPFS quorum
  node3:manager-quorom

Each node can fullfill the function of a file system manager and is relevant for maintaining node quorom.
A GPFS cluster designates a primary cluster manager (node2) and appoints a backup (node3) in case the
primary fails. Cluster services will be started automatically during node boot (-A). After successfully
creating the cluster, you can verify your setup:

# mmlscluster 

  GPFS cluster information
  ========================

  GPFS cluster name:		TbrCl.node2
  GPFS cluster id:		720858653441148399
  GPFS UID domain:		TbrCl.node2
  Remote shell command:		/usr/bin/rsh
  Remote file copy command:	/usr/bin/rcp

  GPFS cluster configuration servers:
  -----------------------------------
  Primary server:		node2
  Secondary server: 		node3

  Node number Node name IP address    Full node name    Remarks
  -------------------------------------------------------------
  1           node2     10.1.1.32     node2              quorom node
  2           node3     10.1.1.33     node3              quorom node


The GPFS daemon has to be started on all nodes:

# mmstartup -a

With GPFS you can administer the whole cluster from any cluster node. After starting GPFS services you
should examine the state of the cluster:

# mmgetstate -aL

  Node number Node name Quorom    Nodes up  Total nodes GPFS state
  -------------------------------------------------------------
  1           node2     2         2         2           active    
  2           node3     2         2         2           active


At this point, the cluster software is running, but you haven't done anything yet on the filesystems.


-- Configuring GPFS disks
-- ----------------------

Before starting with the configuration of GPFS disks, you have to make sure that each cluster node has
access to each SAN attached disk when running in a shared disk environment. With AIX 5L, you can use
the lspv command to verify your disks (hdisk) are properly configured:

# lspv

hdisk2   none     none
hdisk3   none     none
hdisk4   none     none
hdisk5   none     none

If you look for LUN related information (e.g. volume names) issue the following command against a
dedicated hdisk:

# lsattr -El hdisk2

..
.... (in the output, you will also see SAN stuff)
..


Its very important to keep a well balanced disk configuration when using GPFS because this makes sure
you get optimal performance by distributing I/O requests evenly among storage subsystems and attached
data disks. Keep in mind that all GPFS disks belonging to a particular file system should be of same size.


GPFS uses a mechanism called Network Shared Disk (NSD) to provide file system access to cluster nodes,
which do not have direct physical access to file system disks. A diskless node accesses an NSD via the
cluster network and I/O operations are handled as if they run against a directly attached disk from
an operating systems perspective. A special device driver handles data shipping using the cluster network.
NSDs can also be used in a purely SAN based GPFS configuration where each node can directly access
any disk. In case a node looses direct disk access, it automatically switches to NSD-mode, sending I/O
requests via network to other direct direct disk attached nodes. This mechanism increases file system
availability, and should normally be used.

When using NSD, a primary and a backup server are assigned to each NSD. In case a node looses its
direct disk attachment, it contacts the primary NSD server, or backup server in case the primary
is not available.

In order to establish NSD you need to create "descriptor files" in order to describe each 
disk functionality. In our example, we will use the following file:

 /var/mmfs/conf/diskfile

  #Description of disk attributes
  #<disk name>:<primary NSD server>:<2ndary NSD server>:<disk usage>:<failure group>:<NSD name>

  #Data and metadata disk for /my_gpfs, site A, DS4300_1
  hdisk2:node2:node3:dataAndMetadata:1:

  #Data and metadata disk for /my_gpfs, site B, DS4300_2
  hdisk3:node3:node2:dataAndMetadata:2:

  #File system descriptor disk for /my_gpfs, site C, DS4100
  hdisk4:::descOnly:3:

  #Tiebreaker disk, site C, DS4100
  hdisk5:::descOnly:-1:

Here, our cluster uses 4 disks with GPFS. Filesystem "/my_gpfs" uses hdisk2 and hdisk3 for data and metadata.
Therefore these disks will use the NSD mechanism to provide file system data access in case direct disk access
fails on one of the cluster nodes.
Node2 is the primary NSD server for hdisk2 with node3 being its backup. The same is true for hdisk3, but then
the other way around.
Each of these disks belongs to a different "failure group" (1=site A, 2=site B) which basically enables
replication of file system data and metadata between the two sites.

After successfully creating the "disk descriptor file", the following command is used to define the NSDs:


# mmcrnsd -F /var/mmfs/conf/diskfile -v yes


GPFS assigns a Physical Volume ID PVID to each of the disks. This information is written to sector 2
on the AIX5L hdisk. Since GPFS uses its own PVIDs, do not confuse them with AIX5L PVIDs.

After a successful creation of the NSDs, you can verify your setup using the mmlsnsd command:


# mmlsnsd -aL

File system    Disk name     NSD Volume ID     Primary node         Backup node
-------------------------------------------------------------------------------
(free disk)    gpfs1nsd      099CAF2043A04625  node2                node3
(free disk)    gpfs2nsd      099CAF2043A04627  node3                node2
(free disk)    gpfs3nsd      099CAF2043A04628  (directly attached)
(free disk)    gpfs4nsd      099CAF2043A04629  (directly attached)

During NSD creation, the diskfile was rewritten. Each hdisk stanza is commented out, and a
equivalent NSD stanza is inserted.

  #<disk name>:<primary NSD server>:<2ndary NSD server>:<disk usage>:<failure group>:<NSD name>

  #Data and metadata disk for /my_gpfs, site A, DS4300_1
  #hdisk2:node2:node3:dataAndMetadata:1:
  gpfs1nsd:::dataAndMetadata:1

  #Data and metadata disk for /my_gpfs, site B, DS4300_2
  #hdisk3:node3:node2:dataAndMetadata:2:
  gpfs2nsd:::dataAndMetadata:2

  #File system descriptor disk for /my_gpfs, site C, DS4100
  #hdisk4:::descOnly:3:
  gpfs3nsd:::descOnly:3

  #Tiebreaker disk, site C, DS4100
  #hdisk5:::descOnly:-1:
  gpfs4nsd:::descOnly:-1


`
-- Activating tiebreaker mode
-- --------------------------

When using a two node cluster with tiebraker disks, the cluster configuration must be switched
to tiebreaker mode. Ofcourse you need to know which disks are being used as tiebreaker disks.
Up to 3 disks are allowed. In our example, gpfs4nsd (that is hdisk5) is the only tiebreaker disk.
With the following command sequence, tiebreaker mode is turned on:

# mmshutdown -a
# mmstartup -a

A 2 node cluster running in tiebreaker mode can easily be identified by running the following command:

# mmgetstate -aL


  Node number Node name   Quorom    Nodes up  Total nodes GPFS state
  ---------------------------------------------------------------
  1           node2       1*        2         2           active    
  2           node3       1*        2         2           active


If the quorum information is displayed as "1*", this is a 2 node tiebreaker disk cluster.
Another nice command to check the status of the cluster is "mmlsconfig".

# mmlsconfig

  Configuration data for cluster TbrCl.node2:
  -------------------------------------------
  ClusterName TbrCl.node2
  ClusterId 8262362723390
  ClusterType 1c
  Multinode yes
  autoload yes
  useDiskLease yes
  MaxFeatureLevelAllowed 809
  tiebreakerDisks gpfs4nsd


-- Creating a GPFS Filesystem
-- --------------------------

GPFS generally maintains at least 3 filesystem descriptors, or quorum, per filesystem.
Best would be, to have the descriptors distributed over many disks. But you might have
only 2 disks, resulting in 2 copies on one disk, and 1 copy on the other disk.
That would be an unbalanced situation. GPFS always verifies if more than 50% of the
filesystem disks are available, and if not, it will unmount the filesystem.

Before we can create the /my_gpfs filesystem we need to prepare a file named "fsdisks_mygpfs"
describing all disks belonging to the filesystem.
In our example, we use only 2 disks for the filesystem, but we like to have a balanced situation
with at least 3 descriptor area's. For this, we can use "#hdisk4:::descOnly:3:"
as shown before as an entry in the "nsd diskfile".

Our "fdisk_mygpfs" looks like this:

  #<disk name>:<primary NSD server>:<2ndary NSD server>:<disk usage>:<failure group>:<NSD name>

  #Data and metadata disk for /my_gpfs, site A, DS4300_1
  gpfs1nsd:::dataAndMetadata:1

  #Data and metadata disk for /my_gpfs, site B, DS4300_2
  gpfs2nsd:::dataAndMetadata:2

  #File system descriptor disk for /my_gpfs, site C, DS4100
  gpfs3nsd:::descOnly:3


The next step is to create the file system:

# mmcrfs /my_gpfs /dev/my_gpfs -F /var/mmfs/conf/fdisk_mygpfs -A yes -m2 -M2 -r2 -R2 -v yes


The mountpoint is /my_gpfs and a device called /dev/my_gpfs is created. The option -F is used to specify
a configuration file describing the filesystem's NSDs. We want this filesystem to be mounted automatically
during startup (-A yes). When designing our cluster, we decided to use data and metadata replication (-r2,-m2)
to provide high availability.

If you intend to create several filesystems within your cluster, repeat all the steps as shown above.


-- mounting a GPFS Filesystem
-- --------------------------

Filesystem "/my_gpfs" will be mounted on each of the cluster nodes using the command:

# dsh -a mount -t mmfs

The command dsh is the Distributed Shell, wich should be available on your AIX53 systems.
Your GPFS filesystem is also registered in /etc/filesystems. Also, standard AIX commands can be used against
the GPFS filesystems, like for example:

# dsh -w node2,node3 df -k /my_gpfs

Filesystem /my_gpfs is now available to both nodes with all three file system descripters being well
balanced across failure groups and disks.

# mmlsdisk my_gpfs

disk            driver     sector   failure   holds    holds 
name            type       size     group     metadata data  status    availability  disk id  remarks
-----------------------------------------------------------------------------------------------------
gpfs1nsd        nsd        512      1         yes      yes   ready     up             1       desc
gpfs2nsd        nsd        512      2         yes      yes   ready     up             2       desc
gpfs3nsd        nsd        512      3         no       no    ready     up             3       desc


root@zd111l13.nl.eu.abnamro.com:/data/documentum/dmadmin#mmlsdisk /dev/gpfsfs0

disk         driver   sector failure holds    holds                            storage
name         type       size   group metadata data  status        availability pool
------------ -------- ------ ------- -------- ----- ------------- ------------ ------------
gpfs3nsd     nsd         512       1 yes      yes   ready         up           system
gpfs4nsd     nsd         512       2 yes      yes   ready         up           system


Notes:
------

Note 1: SDD driver

Subsystem Device Driver, SDD, is a pseudo driver designed to support the multipath configuration environments
in the IBM Totalstorage Enterprise Storage Server, the IBM TotalStorage DS family, and the IBM System Storage
SAN Volume Controller.  
You can see this driver installed, for example, in HACMP and GPFS systems.
 
At this time, SSD version 1.6.1.0 is not supported by VIOS. Ofcourse, this might change later.

Note 2: pv listing:

In a gpfs cluster, a lspv might show output like the following example:

root@zd110l13:/root# lspv
hdisk0          00cb61fe0b562af0                    rootvg          active
hdisk1          00cb61fe0fb40619                    rootvg          active
hdisk2          00cb61fe33429fa6                    vge0corddap01   active
hdisk3          00cb61fe3342a096                    vge0corddap01   active
hdisk4          00cb61fe3342a175                    gpfs3nsd
hdisk5          00cb61fe33536125                    gpfs4nsd

root@zd110l13:/root# mmlsnsd -aL

 File system   Disk name    NSD volume ID      Primary node             Backup node
---------------------------------------------------------------------------------------------
 gpfsfs0       gpfs3nsd     0A208FB64650A409   zd110l13                 zd110l14.nl.eu.abnamro.com
 gpfsfs0       gpfs4nsd     0A208FB64650A40D   zd110l13                 zd110l14.nl.eu.abnamro.com


8.3 GPFS commands:
===================


8.3.1 The mmcrcluster Command:
--------------------------------

Name
mmcrcluster - Creates a GPFS cluster from a set of nodes.

Synopsis
mmcrcluster -n NodeFile -p PrimaryServer [-s SecondaryServer] [-r RemoteShellCommand] 
               [-R RemoteFileCopyCommand] [-C ClusterName] [-U DomainName] [-A] [-c ConfigFile]

Description
Use the mmcrcluster command to create a GPFS cluster.

Upon successful completion of the mmcrcluster command, the /var/mmfs/gen/mmsdrfs and the /var/mmfs/gen/mmfsNodeData 
files are created on each of the nodes in the cluster. Do not delete these files under any circumstances. 
For further information, see the General Parallel File System: Concepts, Planning, and Installation Guide.

You must follow these rules when creating your GPFS cluster:

While a node may mount file systems from multiple clusters, the node itself may only be added to a single cluster 
using the mmcrcluster or mmaddnode command. 
The nodes must be available for the command to be successful. If any of the nodes listed are not available 
when the command is issued, a message listing those nodes is displayed. You must correct the problem on each node 
and issue the mmaddnode command to add those nodes. 
You must designate at least one node as a quorum node. You are strongly advised to designate the cluster 
configuration servers as quorum nodes. How many quorum nodes altogether you will have depends on whether 
you intend to use the node quorum with tiebreaker algorithm. or the regular node based quorum algorithm. 
For more details, see the General Parallel File System: Concepts, Planning, and Installation Guide and 
search for designating quorum nodes.

Parameters
-A 
Specifies that GPFS daemons are to be automatically started when nodes come up. The default is not to start 
daemons automatically. 
-C ClusterName 
Specifies a name for the cluster. If the user-provided name contains dots, it is assumed to be a fully 
qualified domain name. Otherwise, to make the cluster name unique, the domain of the primary configuration 
server will be appended to the user-provided name. 
If the -C flag is omitted, the cluster name defaults to the name of the primary GPFS cluster configuration server.

-c ConfigFile 
Specifies a file containing GPFS configuration parameters with values different than the documented defaults. 
A sample file can be found in /usr/lpp/mmfs/samples/mmfs.cfg.sample. See the mmchconfig command for a detailed 
description of the different configuration parameters. 
The -c ConfigFile parameter should only be used by experienced administrators. Use this file to only set up 
parameters that appear in the mmfs.cfg.sample |file. Changes to any other values may be ignored by GFPS. 
When in doubt, use the mmchconfig command instead.

-n NodeFile 
NodeFile consists of a list of node descriptors, one per line, to be included in the GPFS cluster. 
Node descriptors are defined as: 

NodeName:NodeDesignationswhere: 

NodeName is the hostname or IP address to be used by GPFS for node to node communication. 
The hostname or IP address must refer to the communications adapter over which the GPFS daemons communicate. 
Alias interfaces are not allowed. Use the original address or a name that is resolved by the host command 
to that original address. You may specify a node using any of these forms:

Format Example 
Short hostname   k145n01 
Long hostname    k145n01.kgn.ibm.com 
IP address       9.119.19.102 

NodeDesignations is an optional, '-' separated list of node roles. 
manager | client   - Indicates whether a node is part of the pool of nodes from which configuration and 
                     file system managers are selected. The default is client. 
quorum | nonquorum - Indicates whether a node is to be counted as a quorum node. The default is nonquorum.

You must provide a descriptor for each node to be added to the GPFS cluster.

-p PrimaryServer 
Specifies the primary GPFS cluster configuration server node used to store the GPFS configuration data. 
This node must be a member of the GPFS cluster. 
-R RemoteFileCopy 
Specifies the fully-qualified path name for the remote file copy program to be used by GPFS. The default value is 
/usr/bin/rcp. 
The remote copy command must adhere to the same syntax format as the rcp command, but may implement an 
alternate authentication mechanism.

-r RemoteShellCommand 
Specifies the fully-qualified path name for the remote shell program to be used by GPFS. The default value is 
/usr/bin/rsh. 
The remote shell command must adhere to the same syntax format as the rsh command, but may implement an 
alternate authentication mechanism.

-s SecondaryServer 
Specifies the secondary GPFS cluster configuration server node used to store the GPFS cluster data. 
This node must be a member of the GPFS cluster. 
It is suggested that you specify a secondary GPFS cluster configuration server to prevent the loss of 
configuration data in the event your primary GPFS cluster configuration server goes down. When the GPFS daemon 
starts up, at least one of the two GPFS cluster configuration servers must be accessible.

If your primary GPFS cluster configuration server fails and you have not designated a secondary server, 
the GPFS cluster configuration files are inaccessible, and any GPFS administrative commands that are issued fail. 
File system mounts or daemon startups also fail if no GPFS cluster configuration server is available.

-U DomainName 
Specifies the UID domain name for the cluster. 
A detailed description of the GPFS user ID remapping convention is contained in UID Mapping for GPFS In a 
Multi-Cluster Environment at www.ibm.com/servers/eserver/clusters/library/wp_aix_lit.html.

Exit status

0 
Successful completion. 
1 
A failure has occurred. 

Security
You must have root authority to run the mmcrcluster command.

You may issue the mmcrcluster command from any node in the GPFS cluster.

A properly configured .rhosts file must exist in the root user's home directory on each node in the GPFS cluster. 
If you have designated the use of a different remote communication program on either the mmcrcluster or the 
mmchcluster command, you must ensure:

Proper authorization is granted to all nodes in the GPFS cluster. 
The nodes in the GPFS cluster can communicate without the use of a password, and without any extraneous messages.


Example 1:
----------

To create a GPFS cluster made of all of the nodes listed in the file /u/admin/nodelist, using node k164n05 
as the primary server, and node k164n04 as the secondary server, issue:

# mmcrcluster  -n /u/admin/nodelist -p k164n05 -s k164n04

where /u/admin/nodelist has the these contents:

k164n04.kgn.ibm.com:quorum
k164n05.kgn.ibm.com:quorum
k164n06.kgn.ibm.com

The output of the command is similar to:

Mon Aug  9 22:14:34 EDT 2004: 6027-1664 mmcrcluster: Processing node
                              k164n04.kgn.ibm.com
Mon Aug  9 22:14:38 EDT 2004: 6027-1664 mmcrcluster: Processing node 
                              k164n05.kgn.ibm.com
Mon Aug  9 22:14:42 EDT 2004: 6027-1664 mmcrcluster: Processing node 
                              k164n06.kgn.ibm.com
mmcrcluster: Command successfully completed
mmcrcluster: 6027-1371 Propagating the changes to all affected.
                       nodes. This is an asynchronous process.

To confirm the creation, enter: 

# mmlscluster

The system displays information similar to:

GPFS cluster information
========================
  GPFS cluster name:         k164n05.kgn.ibm.com
  GPFS cluster id:           680681562214606028
  GPFS UID domain:           k164n05.kgn.ibm.com
  Remote shell command:      /usr/bin/rsh
  Remote file copy command:  /usr/bin/rcp

GPFS cluster configuration servers:
-------------------------------------
  Primary server:    k164n05.kgn.ibm.com
  Secondary server:  k164n04.kgn.ibm.com

 Node number  Node name  IP address      Full node name       Remarks

--------------------------------------------------------------------------
       1      k164n04    198.117.68.68   k164n04.kgn.ibm.com  quorum node
       2      k164n05    198.117.68.69   k164n05.kgn.ibm.com  quorum node
       3      k164n06    198.117.68.70   k164n06.kgn.ibm.com  


Example 2:
----------

# mmcrcluster  -n /home/root/nodelist -p zcnodeb -s n5nodea -r /usr/bin/rsh 
  -R /usr/bin/rcp -C MDLPR -A

Where the -C option determines the clustername.

You can start the cluster (GPFS daemon) by using

# mmstartup -a

Check if all nodes are registered in the cluster

# mmlscluster


8.3.2 Other GPFS commands:
---------------------------

The most common gpfs commands, will be illustrated by examples.


-- List cluster info: mmlscluster
-- ------------------------------

# mmlscluster

The system displays information similar to:

GPFS cluster information
========================
  GPFS cluster name:         k164n05.kgn.ibm.com
  GPFS cluster id:           680681562214606028
  GPFS UID domain:           k164n05.kgn.ibm.com
  Remote shell command:      /usr/bin/rsh
  Remote file copy command:  /usr/bin/rcp

GPFS cluster configuration servers:
-------------------------------------
  Primary server:    k164n05.kgn.ibm.com
  Secondary server:  k164n04.kgn.ibm.com

 Node number  Node name  IP address      Full node name       Remarks

--------------------------------------------------------------------------
       1      k164n04    198.117.68.68   k164n04.kgn.ibm.com  quorum node
       2      k164n05    198.117.68.69   k164n05.kgn.ibm.com  quorum node
       3      k164n06    198.117.68.70   k164n06.kgn.ibm.com  


-- Retrieving the Cluster status:
-- ------------------------------

# mmgetstate -aL

  Node number Node name Quorom    Nodes up  Total nodes GPFS state
  -------------------------------------------------------------
  1           node2     2         2         2           active    
  2           node3     2         2         2           active


-- Retreiving config data of the Cluster:
-- --------------------------------------

# mmlsconfig

  Configuration data for cluster TbrCl.node2:
  -------------------------------------------
  ClusterName TbrCl.node2
  ClusterId 8262362723390
  ClusterType 1c
  Multinode yes
  autoload yes
  useDiskLease yes
  MaxFeatureLevelAllowed 809
  tiebreakerDisks gpfs4nsd


root@zd110l13:/root#mmlsconfig
Configuration data for cluster cluster_name.zd110l13:
-----------------------------------------------------
clusterName cluster_name.zd110l13
clusterId 729741152660153204
clusterType lc
autoload no
useDiskLease yes
maxFeatureLevelAllowed 912
tiebreakerDisks gpfs3nsd;gpfs4nsd
[zd110l13]
takeOverSdrServ yes

File systems in cluster cluster_name.zd110l13:
----------------------------------------------
/dev/gpfsfs0


root@zd110l13:/var/adm/ras#df -k | grep /dev/gpfsfs0
/dev/gpfsfs0   2097152000 2009668608    5%   101193     5% /data/documentum/dmadmin


-- Change the status of a disk, and listing status: mmchdisk and mmlsdisk
-- ----------------------------------------------------------------------

You can even simulate the loss of a NSD disk from a Cluster, for example

# mmchdisk my_gpfs stop -d "gpfs1nsd"
# mmlsdisk my_gpfs -L

disk            driver     sector   failure   holds    holds 
name            type       size     group     metadata data  status    availability  disk id  remarks
-----------------------------------------------------------------------------------------------------
gpfs1nsd        nsd        512      1         yes      yes   ready     down           1       desc
gpfs2nsd        nsd        512      2         yes      yes   ready     up             2       desc
gpfs3nsd        nsd        512      3         no       no    ready     up             3       desc

We have used the example of the 2 node cluster of section 74.1 here. Since the quorom is still met,
even with one disk "down", the service is still working.


-- Changes GPFS cluster configuration data. 
-- ----------------------------------------

The mmchcluster command serves different purposes: 

Change the primary or secondary GPFS cluster data server. 
Synchronize the primary GPFS cluster data server. 
Change the remote shell and remote file copy programs to be used by the nodes in the cluster. 

To change the primary GPFS server for the cluster, enter: 

# mmchcluster -p k145n03

 
-- Changes the attributes of a GPFS file system
-- --------------------------------------------

Use the mmchfs command to change the attributes of a GPFS file system.

To change the default replicas for metadata to 2 and the default replicas for data to 2 for new files 
created in the fs0 file system, enter:

# mmchfs fs0 -m 2 -r 2

To confirm the change, enter:

# mmlsfs fs0 -m -r

The system displays information similar to:

flag value          description
---- -------------- -----------------------------------
 -m  2              Default number of metadata replicas
 -r  2              Default number of data replicas

More examples:


-- Add a node to the cluster
-- -------------------------

The mmaddnode command adds nodes to a GPFS cluster.
Use the mmaddnode command to add nodes to an existing GPFS cluster. On each new node a mount point directory
and character mode device is created for each GPFS filesystem.

Example:
To add the nodes "k164n06" and "k164n07" as quorom nodes, designating "k164n06" to be available as 
manager node, use the following command:

# mmaddnode -N k164n06:quorom-manager,k164n07:quorom


-- Mounting and unmounting GPFS file
-- ----------------------------------

Use the mmmount and mmumount to mount or unmount GPFS filesystem on one or more nodes in the cluster.

Examples:

- To mount all GPFS filesystems on all of the nodes in the cluster:

# mmmount all -a

- To mount filesystem "fs2" read-only on the local node, use

# mmmount fs2 -o ro

- To mount fs1 on all NSD server nodes, use

# mmmount fs1 -N nsdnodes  

- To unmount fs1 on all nodes of the cluster, use

# mmumount fs1 -a


-- Creates cluster-wide names for Network Shared Disks (NSDs) used by GPFS
-- -----------------------------------------------------------------------

mmcrnsd -F DescFile [-v {yes |no}]

The mmcrnsd command is used to create cluster-wide names for NSDs used by GPFS.

This is the first GPFS step in preparing a disk for use by a GPFS file system. A disk descriptor file supplied 
to this command is rewritten with the new NSD names and that rewritten disk descriptor file can then be supplied 
as input to the mmcrfs command.

The name created by the mmcrnsd command is necessary since disks connected at multiple nodes may have differing 
disk device names in /dev on each node. The name uniquely identifies the disk. This command must be run 
for all disks that are to be used in GPFS file systems. The mmcrnsd command is also used to assign a 
primary and backup NSD server that can be used for I/O operations on behalf of nodes that do not have 
direct access to the disk.

To identify that the disk has been processed by the mmcrnsd command, a unique NSD volume ID is written on 
sector 2 of the disk. All of the NSD commands (mmcrnsd, mmlsnsd, and mmdelnsd) use this unique 
NSD volume ID to identify and process NSDs.

After the NSDs are created, the GPFS cluster data is updated and they are available for use by GPFS.

Examples:

To create your NSDs from the descriptor file nsdesc containing: 

 sdav1:k145n05:k145n06:dataOnly:4
 sdav2:k145n04::dataAndMetadata:5:ABC

enter:

# mmcrnsd -F nsdesc 


8.4 Installing GPFS:
=====================

Installing GPFS V. 2.3 or v. 3.1


Installing GPFS on AIX 5L nodes
It is suggested you read Planning for GPFS and the GPFS FAQs at 
publib.boulder.ibm.com/infocenter/clresctr/topic/com.ibm.cluster.gpfs.doc/gpfs_faqs/gpfsclustersfaq.html.

Do not attempt to install GPFS if you do not have the prerequisites listed in Hardware requirements 
and Software requirements.

Ensure that the PATH environment variable on each node includes /usr/lpp/mmfs/bin.

The installation process includes:

-Files to ease the installation process 
-Verifying the level of prerequisite software 
-Installation procedures

>> Files to ease the installation process

Creation of a file that contains all of the nodes in your GPFS cluster prior to the installation of GPFS, 
will be useful during the installation process. Using either host names or IP addresses when constructing 
the file will allow you to use this information when creating your cluster through the mmcrcluster command.

For example, create the file /tmp/gpfs.allnodes, listing the nodes one per line: 

k145n01.dpd.ibm.com 
k145n02.dpd.ibm.com 
k145n03.dpd.ibm.com 
k145n04.dpd.ibm.com 
k145n05.dpd.ibm.com 
k145n06.dpd.ibm.com 
k145n07.dpd.ibm.com 
k145n08.dpd.ibm.com 


>> Verifying the level of prerequisite software

It is necessary to verify you have the correct levels of the prerequisite software installed. If the correct level 
of prerequisite software is not installed, see the appropriate installation manual before proceeding with your 
GPFS installation: 

1. AIX 5L Version 5 Release 2 with the latest level of service available 

   # WCOLL=/tmp/gpfs.allnodes dsh "oslevel"

   Output similar to this should be displayed: 
   5.2.0.10

2. AIX 5L Version 5 Release 3 with the latest level of service available 

   # WCOLL=/tmp/gpfs.allnodes dsh "oslevel"

   Output similar to this should be displayed: 
   5.3.0.0
   If you are utilizing NFS V4, at a minimum your output should include: 
   5.3.0.10


>>Installation procedures

The installation procedures are generalized for all levels of GPFS. Ensure you substitute the correct 
numeric value for the modification (m) and fix (f) levels, where applicable. The modification and fix 
level are dependent upon the level of PTF support.

Follow these steps to install the GPFS software using the installp command:

1. Electronic license agreement 
2. Creating the GPFS directory 
3. Creating the GPFS installation table of contents file 
4. Installing the GPFS man pages 
5. Installing GPFS on your network 
6. Existing GPFS files 
7. Verifying the GPFS installation


--1. Electronic license agreement

The GPFS software license agreements is shipped and viewable electronically. The electronic license agreement 
must be accepted before software installation can continue.

For additional software package installations, the installation cannot occur unless the appropriate 
license agreements are accepted. When using the installp command, use the -Y flag to accept licenses 
and the -E flag to view license agreement files on the media.

--2. Creating the GPFS directory

To create the GPFS directory:

On any node create a temporary subdirectory where GPFS installation images will be extracted. For example: 

# mkdir  /tmp/gpfslpp

Copy the installation images from the CD-ROM to the new directory, by issuing: 

# bffcreate -qvX -t /tmp/gpfslpp -d /dev/cd0 all

This will place the following GPFS images in the image directory :

gpfs.base 
gpfs.docs 
gpfs.msg.en_US


--3. Creating the GPFS installation table of contents file

Make the new image directory the current directory: 

# cd /tmp/gpfslpp

Use the inutoc command to create a .toc file. The .toc file is used by the installp command. 

# inutoc .

--4. Installing the GPFS man pages

In order to use the GPFS man pages you must install the gpfs.docs image. The GPFS manual pages will be 
located at /usr/share/man/.

Installation consideration:
The gpfs.docs image need not be installed on all nodes if man pages are not desired or local file system space 
on the node is minimal.

--5. Installing GPFS on your network

Install GPFS according to these directions, where localNode is the name of the node on which you are running:

If you are installing on a shared file system network, ensure the directory where the GPFS images can be found 
is NFS exported to all of the nodes planned for your GPFS cluster (/tmp/gpfs.allnodes). 

Ensure an acceptable directory or mountpoint is available on each target node, such as /tmp/gpfslpp. 
If there is not, create one: 

# WCOLL=/tmp/gpfs.allnodes dsh "mkdir /tmp/gpfslpp"

If you are installing on a shared file system network, to place the GPFS images on each node in your network, 
issue: 

# WCOLL=/tmp/gpfs.allnodes dsh "mount localNode:/tmp/gpfslpp /tmp/gpfslpp"

Otherwise, issue: 

# WCOLL=/tmp/gpfs.allnodes dsh "rcp localNode:/tmp/gpfslpp/gpfs* /tmp/gpfslpp"
# WCOLL=/tmp/gpfs.allnodes dsh "rcp localNode:/tmp/gpfslpp/.toc /tmp/gpfslpp"

Install GPFS on each node: 

# WCOLL=/tmp/gpfs.allnodes dsh "installp -agXYd /tmp/gpfslpp gpfs" 

--6. Existing GPFS files

If you have previously installed GPFS on your system, during the install process you may see 
messages similar to:

Some configuration files could not be automatically merged into the
system during the installation.  The previous versions of these files
have been saved in a configuration directory as listed below.  Compare
the saved files and the newly installed files to determine if you need
to recover configuration data.  Consult product documentation to
determine how to merge the data.

Configuration files which were saved in /lpp/save.config:
  /var/mmfs/etc/gpfsready
  /var/mmfs/etc/gpfsrecover.src
  /var/mmfs/etc/mmfsdown.scr
  /var/mmfs/etc/mmfsup.scr

If you have made changes to any of these files, you will have to reconcile the differences with the 
new versions of the files in directory /var/mmfs/etc. This does not apply to file /var/mmfs/etc/mmfs.cfg 
which is automatically maintained by GPFS.

--7. Verifying the GPFS installation

Use the lslpp command to verify the installation of GPFS file sets on each node:

lslpp -l gpfs\* 

Output similar to the following should be returned:

  Fileset                      Level  State      Description         
  ----------------------------------------------------------------------------
Path: /usr/lib/objrepos
gpfs.base              2.3.0.0  COMMITTED  GPFS File Manager
gpfs.docs.data         2.3.0.0  COMMITTED  GPFS Server Manpages
gpfs.msg.en_US         2.3.0.0  COMMITTED  GPFS Server Messages - U.S. English
Path: /etc/objrepos
gpfs.base              2.3.0.0  COMMITTED  GPFS File Manager


Example:

root@zd110l14:/root#lslpp -L "*gpfs*"
  Fileset                      Level  State  Type  Description (Uninstaller)
  ----------------------------------------------------------------------------
  gpfs.base                 3.1.0.11    C     F    GPFS File Manager
  gpfs.docs.data             3.1.0.4    C     F    GPFS Server Manpages and
                                                   Documentation
  gpfs.msg.en_US            3.1.0.10    C     F    GPFS Server Messages - U.S.
                                                   English


State codes:
 A -- Applied.
 B -- Broken.
 C -- Committed.
 E -- EFIX Locked.
 O -- Obsolete.  (partially migrated to newer version)
 ? -- Inconsistent State...Run lppchk -v.

Type codes:
 F -- Installp Fileset
 P -- Product
 C -- Component
 T -- Feature
 R -- RPM Package
 E -- Interim Fix


root@zd110l14:/root#lslpp -l gpfs\*
  Fileset                      Level  State      Description
  ----------------------------------------------------------------------------
Path: /usr/lib/objrepos
  gpfs.base                 3.1.0.11  COMMITTED  GPFS File Manager
  gpfs.msg.en_US            3.1.0.10  COMMITTED  GPFS Server Messages - U.S.
                                                 English

Path: /etc/objrepos
  gpfs.base                 3.1.0.11  COMMITTED  GPFS File Manager

Path: /usr/share/lib/objrepos
  gpfs.docs.data             3.1.0.4  COMMITTED  GPFS Server Manpages and
                                                 Documentation


8.5 GPFS error messages:
=========================


The MMFS log
GPFS writes both operational messages and error data to the MMFS log file. The MMFS log can be found 
in the /var/adm/ras directory on each node. The MMFS log file is named mmfs.log.date.nodeName, where date 
is the time stamp when the instance of GPFS started on the node and nodeName is the name of the node. 
The latest mmfs log file can be found by using the symbolic file name /var/adm/ras/mmfs.log.latest. 
The MMFS log from the previous instance of GPFS can be found by using the symbolic file name 
/var/adm/ras/mmfs.log.previous. All other files have a timestamp and node name appended to the file name.

Example:

root@zd110l13:/var/adm/ras#cat mmfs.log.latest
Sun May 20 22:10:37 DFT 2007 runmmfs starting
Removing old /var/adm/ras/mmfs.log.* files:
Loading kernel extension from /usr/lpp/mmfs/bin . . .
GPFS: 6027-500 /usr/lpp/mmfs/bin/aix64/mmfs64 loaded and configured.
Sun May 20 22:10:39 2007: GPFS: 6027-310 mmfsd64 initializing. {Version: 3.1.0.11   Built: Apr  6 2007 09:38:56} ...
Sun May 20 22:10:44 2007: GPFS: 6027-1710 Connecting to 10.32.143.184 zd110l14.nl.eu.abnamro.com
Sun May 20 22:10:44 2007: GPFS: 6027-1711 Connected to 10.32.143.184 zd110l14.nl.eu.abnamro.com
Sun May 20 22:10:44 2007: GPFS: 6027-300 mmfsd ready
Sun May 20 22:10:44 DFT 2007: mmcommon mmfsup invoked
Sun May 20 22:10:44 DFT 2007: mounting /dev/gpfsfs0
Sun May 20 22:10:44 2007: Command: mount gpfsfs0 323816
Sun May 20 22:10:46 2007: Command: err 0: mount gpfsfs0 323816
Sun May 20 22:10:46 DFT 2007: finished mounting /dev/gpfsfs0


At GPFS startup, files that have not been accessed during the last ten days are deleted. 
If you want to save old files, copy them elsewhere.

This example shows normal operational messages that appear in the MMFS log file:

Tue Aug 31 16:02:43 edt 2004 runmmfs starting
Removing old /var/adm/ras/mmfs.log.* files:
mv: 0653-401 Cannot rename /var/adm/ras/mmfs.log.previous to /var/adm/ras/mmfs.log.previous.save:
             A file or directory in the path name does not exist.
Loading kernel extension from /usr/lpp/mmfs/bin . . .
/usr/lpp/mmfs/bin/vcmdummy64 loaded and configured
/usr/lpp/mmfs/bin/aix64/mmfs64 loaded and configured.
Tue Aug 31 16:02:44 2004: GPFS: 6027-310 mmfsd64 initializing. {Version: 3.7.0.0 
    Built: Aug 30 2004 17:10:20} ...
Tue Aug 31 16:02:54 2004: GPFS: 6027-1710 Connecting to 198.16.0.9 k154gn09
Tue Aug 31 16:02:55 2004: GPFS: 6027-1711 Connected to 198.16.0.9 k154gn09
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.2 k154gn02
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.18 k155gn02
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.49 kolt1g_r1b32
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.17 k155gn01
Tue Aug 31 16:02:55 2004: GPFS: 6027-1710 Connecting to 198.16.0.10 k154gn10
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.35
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.5
Tue Aug 31 16:02:57 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.23
Tue Aug 31 16:02:57 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.6
Tue Aug 31 16:02:57 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.21
Tue Aug 31 16:03:00 edt 2004 /var/mmfs/etc/gpfsready invoked
Tue Aug 31 16:03:00 2004: GPFS: 6027-300 mmfsd ready
Tue Aug 31 16:03:00 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.10 k154gn10
Tue Aug 31 16:03:00 edt 2004: mounting /dev/fs3
Tue Aug 31 16:03:00 2004: Command: mount fs3 594128 

Depending on the size and complexity of your system configuration, the amount of time to start GPFS varies. 
Taking your system configuration into consideration, after a reasonable amount of time if you cannot access 
the file system look in the log file for error messages.

The GPFS log is a repository of error conditions that have been detected on each node, as well as 
operational events such as file system mounts. The GPFS log is the first place to look when attempting 
to debug abnormal events. Since GPFS is a cluster file system, events that occur on one node may affect 
system behavior on other nodes, and all GPFS logs may have relevant data.


GPFS for AIX 5L V2.2 in an HACMP Cluster
Problem Determination Guide

The operating system error log facility
GPFS records file system or disk failures using the error logging facility provided by the 
operating system: syslog facility on Linux and errpt facility on AIX. For the remainder of this book, 
the error logging facility will be referred to as 'the error log'.

These failures can be viewed by issuing this command: 

errpt -a
The error log contains information about several classes of events or errors. These classes are:

MMFS_ABNORMAL_SHUTDOWN 
MMFS_DISKFAIL 
MMFS_ENVIRON 
MMFS_FSSTRUCT 
MMFS_GENERIC 
MMFS_LONGDISKIO 
MMFS_PHOENIX 
MMFS_QUOTA 
MMFS_SYSTEM_UNMOUNT 
MMFS_SYSTEM_WARNING
MMFS_ABNORMAL_SHUTDOWN

The MMFS_ABNORMAL_SHUTDOWN error log entry means that GPFS has determined that it must shutdown all operations 
on this node because of a problem. This is most likely caused by some interaction with the Group Services component. 
Group services failures may result in abnormal shutdown, as well as possible loss of quorum. 
Insufficient memory on the node to handle critical recovery situations can also cause this error. 
In general there will be other error log entries from GPFS or some other component associated with this error log entry.

MMFS_DISKFAIL
The MMFS_DISKFAIL error log entry indicates that GPFS has detected the failure of a disk and forced the disk 
to the stopped state. Unable to access disks describes the actions taken in response to this error. 
This is ordinarily not a GPFS error but a failure in the disk subsystem or the path to the disk subsystem. 
the book AIX 5L System Management Guide: Operating System and Devices and search on logical volume. 
Follow the problem determination and repair actions specified.

MMFS_ENVIRON
MMFS_ENVIRON error log entry records are associated with other records of the MMFS_GENERIC or MMFS_SYSTEM_UNMOUNT types. 
They indicate that the root cause of the error is external to GPFS and usually in the network that supports GPFS. 
Check the network and its physical connections. The data portion of this record supplies the return code provided 
by the communications code.

MMFS_FSSTRUCT
The MMFS_FSSTRUCT error log entry indicates that GPFS has detected a problem with the on-disk structure of 
the file system. The severity of these errors depends on the exact nature of the inconsistent data structure. 
If it is limited to a single file, EIO errors will be reported to the application and operation will continue. 
If the inconsistency affects vital metadata structures, operation will cease on this file system. 
These errors are often associated with an MMFS_SYSTEM_UNMOUNT error log entry and will probably occur on all nodes. 
If the error occurs on all nodes, some critical piece of the file system is inconsistent. This may occur as a 
result of a GPFS error or an error in the disk system. Issuing the mmfsck command may repair the error:

Issue the mmfsck -n command to collect data. 
Issue the mmfsck -y command off-line to repair the file system.
If the file system is not repaired after issuing the mmfsck command, contact the IBM Support Center.

MMFS_GENERIC
The MMFS_GENERIC error log entry means that GPFS self diagnostics have detected an internal error, or that 
additional information is being provided with an MMFS_SYSTEM_UNMOUNT report. If the record is associated with an 
MMFS_SYSTEM_UNMOUNT report, the event code fields in the records will be the same. The error code and return code 
fields may describe the error. See Messages for a listing of codes generated by GPFS.

If the error is generated by the self diagnostic routines, service personnel should interpret the return and error 
code fields since the use of these fields varies by the specific error. Errors caused by the self checking logic 
will result in the shutdown of GPFS on this node.

MMFS_GENERIC errors may result from an inability to reach a critical disk resource. These errors may look different 
depending on the specific disk resource that has become unavailable, like logs and allocation maps. 
This type of error will usually be associated with other error indications. Other errors generated by disk subsystems, 
high availability components, and communications components at the same time as, or immediately preceding, 
the GPFS error should be pursued first because they may be the cause of these errors. MMFS_GENERIC error indications 
without an associated error of those types represent a GPFS problem that requires the IBM Support Center. 
See Information to collect before contacting the IBM Support Center.

MMFS_LONGDISKIO
The MMFS_LONGDISKIO error log entry indicates that GPFS is experiencing very long response time for disk requests. 
This is a warning message and may indicate that your disk system is overloaded or that a failing disk is requiring 
many I/O retries. Follow your operating system's instructions for monitoring the performance of your I/O subsystem 
on this node. The data portion of this error record specifies the disk involved. 
There may be related error log entries from the disk subsystems that will pinpoint the actual cause of the problem. 
See the book AIX 5L Performance Management Guide.

MMFS_PHOENIX
MMFS_PHOENIX error log entries reflect a failure in GPFS interaction with Group Services. Go to the book 
Reliable Scalable Cluster Technology: Administration Guide. Search for diagnosing group services problems. 
Follow the problem determination and repair action specified. These errors are usually not GPFS problems, 
although they will disrupt GPFS operation.

MMFS_QUOTA
The MMFS_QUOTA error log entry is used when GPFS detects a problem in the handling of quota information. 
This entry is created when the quota manager has a problem reading or writing the quota file. If the quota manager 
cannot read all entries in the quota file when mounting a file system with quotas enabled, the quota manager 
shuts down, but file system manager initialization continues. Client mounts will not succeed and will return 
an appropriate error message.

In order for GPFS quota accounting to work properly, the system administrator should ensure that the user and group 
information is consistent throughout the nodeset, such as the /etc/passwd and /etc/group files are identical across 
the nodeset. Otherwise, unpredictable and erroneous quota accounting will occur.

It may be necessary to run an off-line quota check (mmcheckquota) to repair or recreate the quota file. 
If the quota file is corrupted, mmcheckquota will not restore it. The file must be restored from the backup copy. 
If there is no backup copy, an empty file may be set as the new quota file. This is equivalent to recreating 
the quota file. To set an empty file or use the backup file, issue the mmcheckquota command with the 
appropriate operand:

-u UserQuotaFilename for the user quota file 
-g GroupQuotaFilename for the group quota file
Reissue the mmcheckquota command to check the file system inode and space usage.

MMFS_SYSTEM_UNMOUNT
The MMFS_SYSTEM_UNMOUNT error log entry means that GPFS has discovered a condition which may result in 
data corruption if operation with this file system continues from this node. GPFS has marked the file system 
as disconnected and applications accessing files within the file system will receive ESTALE errors. 
This may be the result of:

The loss of a path to all disks containing a critical data structure. 
An internal processing error within the file system.
See File system forced unmount. Follow the problem determination and repair actions specified.

MMFS_SYSTEM_WARNING
The MMFS_SYSTEM_WARNING error log entry means that GPFS has detected a system level value approaching its 
maximum limit. This may occur as a result of the number of inodes (files) reaching its limit. Issue the mmchfs 
command to increase the number of inodes for the file system so there is at least a minimum of 5% free.

Error log entry example
This is an example of an error log entry which indicates loss of the Group Services subsystem:

LABEL:          MMFS_ABNORMAL_SHUTD
IDENTIFIER:     1FB9260D

Date/Time:       Thu May 16 14:39:07 EDT 
Sequence Number: 759
Machine Id:      000196364C00
Node Id:         k145n01
Class:           S
Type:            PERM
Resource Name:   mmfs            

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
COMPONENT ID
595B9500 
PROGRAM
mmfsd64 
DETECTING MODULE
/fs/mmfs/ts/phoenix/PhoenixInt.C
MAINTENANCE LEVEL
2.2.0.0 
LINE
        4409
RETURN CODE
         668
REASON CODE
0000 0000 
EVENT CODE
           0


===============================
9. HACMP
===============================


Section 9: HACMP


9.1: Overview Cluster solutions and terminology on AIX:
========================================================


-- CSM: (Management of Cluster)
-- ----------------------------

What is Cluster Systems Management (CSM)?
Cluster Systems Management (CSM) software provides a distributed system management solution that allows 
a system administrator to set up and maintain a cluster of nodes that run the AIX� or Linux� operating system. 
CSM simplifies cluster administration tasks by providing management from a single point-of-control. 
CSM can be used to manage homogeneous clusters of servers that run Linux, homogeneous servers that run AIX, 
or mixed clusters which include both AIX and Linux.

You can use the following hardware for your CSM management server, install server, and nodes:

IBM System x: System x, IBM xSeries�, IBM BladeCenter�*, and IBM eServer 325, |326, and 326m hardware |
IBM System p: System p, IBM pSeries, IBM BladeCenter*, System p5, IBM eServer OpenPower
*The BladeCenter JS models use the POWER architecture common to all System p servers.

The management server is the machine that is designated to operate, monitor, and maintain the rest of the cluster. 
Install servers are the machines that are used to install the nodes. By default, the management server 
is the install server. Managed nodes are instances of the operating system that you can manage in the cluster. 
Managed devices are the non-node devices for which CSM supports power control and remote console access. 
For hardware and software support information, see Planning for CSM software.

Communicating with CSM:
CSM offers you several options for issuing commands to the cluster:

-Command line interface 
-Distributed Command Execution Manager (DCEM) 
-IBM� Web-based System Manager 
-SMIT


-- GPFS:
-- -----

Introducing General Parallel File System

GPFS is a high-performance cluster file system for AIX 5L, Linux and mixed clusters that provides users 
with shared access to files spanning multiple disk drives. By dividing individual files into blocks 
and reading/writing these blocks in parallel across multiple disks, GPFS provides very high bandwidth; 
in fact, GPFS has won awards and set world records for performance. In addition, GPFS's multiple data paths 
can also eliminate single points of failure, making GPFS extremely reliable. GPFS currently powers many of 
the world�s largest scientific supercomputers and is increasingly used in commercial applications requiring 
high-speed access to large volumes of data such as digital media, engineering design, business intelligence, 
financial analysis and geographic information systems. GPFS is based on a shared disk model, providing lower 
overhead access to disks not directly attached to the application nodes, and using a distributed protocol 
to provide data coherence for access from any node. 

IBM's General Parallel File System (GPFS) provides file system services to parallel and serial applications. 
GPFS allows parallel applications simultaneous access to the same files, or different files, from any node 
which has the GPFS file system mounted while managing a high level of control over all file system operations. 
GPFS is particularly appropriate in an environment where the aggregate peak need for data bandwidth exceeds 
the capability of a distributed file system server.

GPFS allows users shared file access within a single GPFS cluster and across multiple GPFS clusters. 
A GPFS cluster consists of: 

AIX 5L� nodes, Linux� nodes, or a combination thereof (see GPFS cluster configurations). A node may be: 
An individual operating system image on a single computer within a cluster. 
A system partition containing an operating system. Some System p5� and pSeries� machines allow multiple 
system partitions, each of which is considered to be a node within the GPFS cluster.

Network shared disks (NSDs) created and maintained by the NSD component of GPFS 
All disks utilized by GPFS must first be given a globally accessible NSD name. 
The GPFS NSD component provides a method for cluster-wide disk naming and access. 

On Linux machines running GPFS, you may give an NSD name to: 
 Physical disks 
 Logical partitions of a disk 
 Representations of physical disks (such as LUNs)

On AIX� machines running GPFS, you may give an NSD name to: 
 Physical disks 
 Virtual shared disks 
 Representations of physical disks (such as LUNs)

A shared network for GPFS communications allowing a single network view of the configuration. 
A single network, a LAN or a switch, is used for GPFS communication, including the NSD communication.


-- PSSP: (predecessor to Cluster Systems Management (CSM))
-- -------------------------------------------------------

Parallel System Support Programs (PSSP)

The PSSP 3.5 software is a comprehensive suite of applications to manage a system as a full-function 
parallel processing system. It provides administrative tasks that help increase productivity by enabling 
administrators to view, monitor, and operate the system from the control workstation, a single point of control. 
The PSSP software is discussed in terms of functional entities called components of PSSP. Most functions 
are base components of PSSP while others are optional; they come with the PSSP software, but you can choose 
whether to install and use them.

With PSSP 3.5, AIX 5L 5.1 or 5.2 must be on the control workstation. Note that your control workstation 
must be at the highest AIX level in the system. If you have any HMC-controlled servers in your system, 
AIX 5L 5.1 or 5.2 must be on each HMC-controlled server node. Other nodes can have AIX 5L 5.1 and PSSP 3.4, 
or AIX 4.3.3 with PSSP 3.4 or PSSP 3.2. However, you can only run with the 64-bit AIX kernel and switch 
between 64-bit and 32-bit AIX kernel mode on nodes with PSSP 3.5.

Parallel System Support Programs (PSSP) for AIX�
PSSP is the systems management predecessor to Cluster Systems Management (CSM) and does not support 
IBM System p servers or AIX 5L V5.3. New cluster deployments should use CSM and existing PSSP customers 
with software maintenance will be transitioned to CSM at no charge. 


-- Tivoli Workload Scheduler LoadLeveler
-- -------------------------------------

Used for dynamic workload scheduling, Tivoli Workload Scheduler LoadLeveler is a distributed network-wide 
job management facility designed to dynamically schedule work such as maximize resource utilization 
and minimize job completion time. Jobs are scheduled based on job priority, job requirements, 
resource availability and user-defined rules to match processing needs with resources. LoadLeveler provides 
consolidated accounting and reporting and supports IBM servers including IBM System p and System x environments. 


-- Engineering Scientific Subroutine Library (ESSL) and Parallel ESSL 
-- ------------------------------------------------------------------

ESSL is a collection of state�of�the�art mathematical subroutines specifically tuned to IBM hardware 
and offering significant performance improvement to any math�intensive scientific or engineering applications. 
Parallel ESSL extends the function of ESSL to support parallel applications that use the Message Passing 
Interface included in IBM Parallel Environment. ESSL and Parallel ESSL support C, C++ and Fortran applications. 


-- Parallel Environment (PE)
-- -------------------------

Parallel Environment for AIX 5L is a comprehensive development and execution environment for parallel 
applications (distributed-memory, message-passing applications running across multiple nodes). 
It is designed to help organizations develop, test, debug, tune and run high-performance parallel 
applications in C, C++ and Fortran on IBM System p and System x clusters. Parallel Environment runs 
on AIX 5L V5.2 and V5.3.  

-- HACMP:
-- ------

HACMP is designed to provide high availability for critical business applications and data through 
system redundancy and failover. HACMP constantly monitors the status of servers, networks and applications 
to detect failures or performance degradation and can respond by automatically restarting a troubled 
application on designated backup hardware, taking care of all network or storage connections in the process. 
With HACMP, clients can scale up to 32 nodes and mix and match system sizes and performance levels as well 
as network adapters and disk subsystems to satisfy specific application, network and disk performance needs. 

HACMP/XD extends HACMP�s high availability capabilities across geographic sites with remote data 
mirroring (replication) and failover using this mirrored data; this combination can maintain application 
and data availability even if an entire site is disabled by a disaster. HACMP/XD provides IP-based data 
mirroring and also supports hardware-based mirroring products such as 
IBM Enterprise Storage Systems Metro-Mirror (formerly PPRC). 

-- RSCT:
-- -----

Reliable Scalable Cluster Technology. Since HACMP 5.1, HACMP relies on RSCT. So, in modern HACMP, RSCT is
a neccessary component or subsystem. For example, HACMP uses the heartbeat facility of RSCT.
RSCT is a standard component in AIX5L.

Reliable Scalable Cluster Technology, or RSCT, is a set of software components that together provide a 
comprehensive clustering environment for AIX� and Linux�. RSCT is the infrastructure used by a variety 
of IBM� products to provide clusters with improved system availability, scalability, and ease of use. 
RSCT includes the following components: 

- Resource Monitoring and Control (RMC) subsystem. This is the scalable, reliable backbone of RSCT. 
  It runs on a single machine or on each node (operating system image) of a cluster and provides a common 
  abstraction for the resources of the individual system or the cluster of nodes. You can use RMC for 
  single system monitoring or for monitoring nodes in a cluster. In a cluster, however, RMC provides global 
  access to subsystems and resources throughout the cluster, thus providing a single monitoring and management 
  infrastructure for clusters. 
- RSCT core resource managers. A resource manager is a software layer between a resource 
  (a hardware or software entity that provides services to some other component) and RMC. A resource manager 
  maps programmatic abstractions in RMC into the actual calls and commands of a resource. 
- RSCT cluster security services, which provide the security infrastructure that enables RSCT components 
  to authenticate the identity of other parties. 
- Topology Services subsystem, which, on some cluster configurations, provides node and network failure detection. 
  Group Services subsystem, which, on some cluster configurations, provides cross-node/process coordination.


RSCT is the �glue� that holds the nodes together in a cluster. It is a group of low-level components 
that allow clustering technologies, such as High-Availability Cluster Multiprocessing (HACMP) and 
General Parallel File System (GPFS), to be built easily. 

RSCT technology was originally developed by IBM for RS/6000 SP systems (Scalable POWERparallel). 
As time passed, it became apparent that these capabilities could be used on a growing number of general 
computing applications, so they were moved into components closer to the operating system (OS), such as 
Resource Monitoring and Control (RMC), Group Services, and Topology Services. 

The components were originally packaged as part of the RS/6000 SP Parallel System Support Program (PSSP) 
and called RSCT. RSCT is now packaged as part of AIX 5L Version 5.1 and later. 

RSCT is also included in Cluster Systems Management (CSM) for Linux. Now, Linux nodes (with appropriate 
hardware and software levels) running CSM 1.3 for Linux can be part of the management domain cluster 1600, 
and RSCT (with RMC) is the common interface for clustering. For more information about this heterogeneous 
cluster, see An Introduction to CSM 1.3 for AIX 5L, SG24-6859. 

RSCT includes these components: 

-Resource Monitoring and Control (RMC) 
-Resource managers (RM) 
-Cluster Security Services (CtSec) 
-Group Services 
-Topology Services

Group Services and Topology Services

Group Services and Topology Services, although included in RSCT, are not used in the management 
domain structure of CSM. These two components are used in peer domain clusters for applications, 
such as High-Availability Cluster Multiprocessing (HACMP) and General Parallel File System (GPFS), 
providing node and process coordination and node and network failure detection. Therefore, for these 
applications, a .rhosts file may be needed (for example, for HACMP configuration synchronization). 

These services are often referred to as hats and hags: 
high availability Group Services daemon (hagsd) 
and high availability Topology Services daemon (hatsd). 

- What are management domains and peer domains?
In order to understand how the various RSCT components are used in a cluster, you should be aware 
that nodes of a cluster can be configured for either manageability or high availability.

>> You configure a set of nodes for manageability using the Clusters Systems Management (CSM) product as 
described in IBM� Cluster Systems Management: Administration Guide. The set of nodes configured for manageability 
is called a management domain of your cluster.

>>You configure a set of nodes for high availability using RSCT's Configuration resource manager. 
The set of nodes configured for high availability is called an RSCT peer domain of your cluster. 
For more information, refer to Creating and administering an RSCT peer domain.


-- HPSS:	 
-- -----

High Performance Storage System
What is High Performance Storage System? HPSS is software that manages petabytes of data on disk and robotic tape 
libraries. HPSS provides highly flexible and scalable hierarchical storage management that keeps recently 
used data on disk and less recently used data on tape. HPSS uses cluster, LAN and/or SAN technology to aggregate 
the capacity and performance of many computers, disks, and tape drives into a single virtual file system 
of exceptional size and versatility. This approach enables HPSS to easily meet otherwise unachievable demands 
of total storage capacity, file sizes, data rates, and number of objects stored. HPSS provides a variety of user 
and filesystem interfaces ranging from the ubiquitous vfs, ftp, samba and nfs to higher performance pftp, 
client API, local file mover and third party SAN (SAN3P). HPSS also provides hierarchical storage management 
(HSM) services for IBM General Parallel File System (GPFS). 


-- C-SPOC:
-- -------

The Cluster Single Point of Control (C-SPOC) utility lets system administrators perform administrative tasks 
on all cluster nodes from any node in the cluster.


-- HA Network Server:
-- ------------------

The High Availability Network Server (HA Network Server) is a complete solution that quickly and automatically 
configures certain network services in a high availability environment. HA Network Server solution is designed 
to enhance the HACMP product by offering a set of scripts that set up highly available network services 
such as Domain Name System (DNS), Dynamic Host Configuration Protocol (DHCP), Network File System (NFS), 
and printing services. This is possible by using the framework offered in HACMP to monitor and act upon 
potential problems with network services in order to extend high availability beyond just hardware recovery. 
Making these services highly available means there is no down time in services that are critical to running 
a business. This solution is now available by download.

HA Network Server components
The HA Network Server solution is comprised of three network service plug-ins providing for DNS, DHCP, 
and print services (HACMP already contains integrated support for high availability NFS (HANFS)). 
Each of these plug-ins is available on this Web site as a downloadable tar file. These example scripts start 
and stop the network service processes, verify that configuration files are present and stored in a 
shared filesystem, and assist the HACMP monitoring functions that check on the health of the network service process. 
These scripts are provided as examples that may be customized for your environment.

A setup program is also provided with each of these plug-ins to assist with the setup after downloading the plug-in. 
Since several prerequisites must be completed by the user before setup begins, please read the README file that is 
included within the plug-in tar file. After download and tar file expansion, the README will be located in 
/usr/es/sbin/cluster/plug-ins/<network_service>, where <network_service> will be dns, dhcp, or printserver 
depending on which plug-in was downloaded.


9.2: Items in HACMP:
=====================


Application Servers:
-------------------- 
To put the application under HACMP control, you create an application server resource that associates 
a user-defined name with the names of specially written scripts to start and stop the application. 
By defining an application server, HACMP can start another instance of the application on the takeover node 
when a fallover occurs. This protects your application so that it does not become a single point of failure. 
An application server can also be monitored with the application monitoring feature and the Application 
Availability Analysis tool. 

After you define the application server, you can add it to a resource group. A resource group is a set of 
resources that you define so that the HACMP software can treat them as a single unit.


Application Monitoring:
----------------------- 
HACMP can monitor applications that are defined to application servers, in one of two ways: 

-Process monitoring detects the termination of a process, using RSCT Resource Monitoring and Control (RMC) capability. 
-Custom monitoring monitors the health of an application based on a monitor method that you define. 


Daemons:
--------

Cluster Services 
Notice that if you list the daemons in the AIX System Resource Controller (SRC), you will see ES appended 
to their names. The actual executables do not have the ES appended; the process table shows the executable 
by path (/usr/es/sbin/cluster...). 

The following lists the required and optional HACMP/ES daemons: 

- Cluster Manager daemon (clstrmgr):
This daemon monitors the status of the nodes and their interfaces, and invokes the appropriate scripts 
in response to node or network events. It also centralizes the storage of and publishes updated information 
about HACMP-defined resource groups. The Cluster Manager on each node coordinates information gathered from 
the HACMP global ODM, and other Cluster Managers in the cluster to maintain updated information about the content, 
location, and status of all HACMP resource groups. This information is updated and synchronized among all nodes 
whenever an event occurs that affects resource group configuration, status, or location.
All cluster nodes must run the clstrmgr daemon.

- Cluster SMUX Peer daemon (clsmuxpd):
This daemon maintains status information about cluster objects. This daemon works in conjunction with 
the Simple Network Management Protocol (snmpd) daemon. All cluster nodes must run the clsmuxpd daemon.
Note: The clsmuxpd daemon cannot be started unless the snmpd daemon is running.

- Cluster Information Program daemon (clinfo):
This daemon provides status information about the cluster to cluster nodes and clients and invokes 
the /usr/es/sbin/cluster/etc/clinfo.rc script in response to a cluster event. The clinfo daemon is optional 
on cluster nodes and clients.

- Cluster Lock Manager daemon (cllockd):
This daemon provides advisory locking services. The cllockd daemon is required on cluster nodes only if 
those nodes are part of a concurrent access configuration.

- Cluster Topology Services daemon (topsvcsd):
This daemon monitors the status of network adapters in the cluster. 
All cluster nodes must run the topsvcsd daemon.

- Cluster Event Management daemon (emsvcsd):
This daemon matches information about the state of system resources with information about resource conditions 
of interest to client programs (applications, subsystems, and other programs).The emsvcsd daemon runs on each node 
of a domain.

- Event Management AIX Operating System Resource Monitor (emaixos):
This daemon acts as a resource monitor for the event management subsystem and provides information about 
the operating system characteristics and utilization. The emaixos daemon is started automatically by Event Management

- Cluster Group Services daemon (grpsvcsd):
This daemon manages all of the distributed protocols required for cluster operation. 
All cluster nodes must run the grpsvcsd daemon.

- Cluster Globalized Server Daemon daemon (grpglsmd):
This daemon operates as a grpsvcs client; its function is to make switch adapter membership global across 
all cluster nodes. All cluster nodes must run the grpglsmd daemon. 

- Group Services Concurrent Logical Volume Manager (gsclvmd).
When extended concurrent Volume Groups are used, this process manages concurrent Volumes.


The AIX System Resource Controller (SRC) controls the HACMP/ES daemons (except for cllockd, which is a 
kernel extension). It provides a consistent interface for starting, stopping, and monitoring processes 
by grouping sets of related programs into subsystems and groups. In addition, it provides facilities for 
logging of abnormal terminations of subsystems or groups and for tracing of one or more subsystems. 
 

The HACMP/ES daemons are collected into the following SRC subsystems and groups: 

Daemon 				Subsystem	Group 
/usr/es/sbin/cluster/clstrmgr	clstrmgrES	cluster 
/usr/es/sbin/cluster/clinfo	clinfoES	cluster 
/usr/es/sbin/cluster/clsmuxpd	clsmuxpdES	cluster 
/usr/es/sbin/cluster/cllockd	cllockdES	lock 
/usr/sbin/rsct/bin/emsvcs	emsvcs		emsvcs 
/usr/sbin/rsct/bin/topsvcs	topsvcs		topsvcs 
/usr/sbin/rsct/bin/hagsglsmd	grpglsm		grpsvcs 
/usr/sbin/rsct/bin/emaixos	emsvcs		emsvcs 
/usr/es/sbin/cluster/clcomd	clcomdES	clcomd

When using the SRC commands, you can control the clstrmgr, clinfo, and clsmuxpd daemons by specifying 
the SRC cluster group. 

The required and optional HACMP and RSCT daemons are:

- clcomdES	Cluster communication daemon
- clstrmgrES	Cluster manager
- clinfoES	Cluster information daemon
- rmcd		RSCT resource Monitoring and Control daemon 
- hatsd		RSCT Topology Services subsystem (includes hats_nim* which send and receives heartbeats)
- hagsd		RSCT group services subsystem
- grpglsmd	main function is to make switch adapter membership global accross all cluster nodes.

Starting with hacmp 5.3, the cluster manager process is always running. It can be in one of two states,
as displayed by the command

# lssrc -ls clstrmgrES

ST_INIT (start event has executed)
ST_NOTCONFIGURED (start event has not executed)


Understanding Cluster Service Startup:
--------------------------------------
 
You start cluster services on a node by executing the HACMP/ES /usr/es/sbin/cluster/etc/rc.cluster script. 
Use the Start Cluster Services SMIT screen, described in the section Starting Cluster Services, 
to build and execute this command. The rc.cluster script initializes the environment required for HACMP/ES 
by setting environment variables and then calls the /usr/es/sbin/cluster/utilities/clstart script 
to start the HACMP/ES daemons. The clstart script is the HACMP/ES script that starts all the cluster services. 
The clstart script calls the SRC startsrc command to start the specified subsystem or group. 
The following figure illustrates the major commands and scripts called at cluster startup: 

rc.cluster -> clstart -> startsrc

The HACMP/ES daemons are started in the following order: 

-RSCT daemons (Group Services, Topology Services, then Event Management) 
-Cluster Manager 
-Cluster SMUX daemon 
-Cluster Information Program daemon (optional) 

Using the C-SPOC utility, you can start cluster services on any node (or on all nodes) in a cluster 
by executing the C-SPOC /usr/es/sbin/cluster/sbin/cl_rc.cluster command on a single cluster node. 
The C-SPOC cl_rc.cluster command calls the rc.cluster command to start cluster services on the nodes specified 
from the one node. The nodes are started in sequential order, not in parallel. The output of the command 
run on the remote node is returned to the originating node. Because the command is executed remotely, 
there can be a delay before the command output is returned. 

The following example shows the major commands and scripts executed on all cluster nodes when cluster 
services are started in clusters using the C-SPOC utility. 


        NODE A           NODE B  
        cl_rc.cluster
             |        \rsh
             |         \
           rc.cluster    rc.cluster 
             |             | 
             |             |
           clstart        clstart
             |             |
             |             |
           startsrc       startsrc


-- Automatically Restarting Cluster Services 
You can optionally have cluster services start whenever the system is rebooted. If you specify the -R flag 
to the rc.cluster command, or specify "restart or both" in the Start Cluster Services SMIT screen, 
the rc.cluster script adds the following line to the /etc/inittab file. 

hacmp:2:wait:/usr/es/sbin/cluster/etc/rc.cluster -boot> /dev/console 2>&1 
# Bring up Cluster 

At system boot, this entry causes AIX to execute the /usr/es/sbin/cluster/etc/rc.cluster script to start HACMP/ES. 

WARNING: Be aware that if the cluster services are set to restart automatically at boot time, you may face 
problems with node integration after a power failure and restoration, or you may want to test a node after 
doing maintenance work before having it rejoin the cluster. 

-- Starting Cluster Services with IP Address Takeover Enabled 
If IP address takeover is enabled, the /usr/es/sbin/cluster/etc/rc.cluster script calls the /etc/rc.net script 
to configure and start the TCP/IP interfaces and to set the required network options. 

-- Editing the rc.cluster File to Turn Deadman Switch Off 
In HACMP/ES, the Deadman Switch (DMS) is controlled by RSCT Topology Services. If, in a rare case, you want 
to turn the DMS off, you must edit the rc.cluster file as follows: 

There is a -D flag in clstart, located in /usr/es/sbin/cluster/utilities 
In the /usr/es/sbin/cluster/etc/rc.cluster file, find a call to "clstart" at about line #486. 
Edit this call to include the -D flag. 


Understanding Stopping Cluster Services:
----------------------------------------
 
You stop cluster services on a node by executing the HACMP/ES /usr/es/sbin/cluster/utilities/clstop script. 
Use the HACMP for AIX Stop Cluster Services SMIT screen, described in the section Stopping Cluster Services 
to build and execute this command. The clstop script stops an HACMP/ES daemon or daemons. The clstop script 
starts all the cluster services or individual cluster services by calling the SRC command stopsrc. 

The following figure illustrates the major commands and scripts called at cluster shutdown: 

clstop -> stopsrc

Using the C-SPOC utility, you can stop cluster services on a single node or on all nodes in a cluster 
by executing the C-SPOC /usr/es/sbin/cluster/sbin/cl_clstop command on a single node. The C-SPOC cl_clstop 
command performs some cluster-wide verification and then calls the clstop command to stop cluster services 
on the specified nodes. The nodes are stopped in sequential order, not in parallel. The output of the command 
run on the remote node is returned to the originating node. Because the command is executed remotely, 
there can be a delay before the command output is returned. 

        NODE A           NODE B  
        cl_clstop
             |       \rsh
             |        \
           clstop       clstop
             |             | 
             |             |
           stopsrc      stopsrc


Starting and stopping using smitty:

To start cluster services, use

smit cl_admin -> Manage HACMP Services -> Start Cluster Services

To stop cluster services, use

smit cl_admin -> Manage HACMP Services -> Stop Cluster Services


9.3: Most important commands in HACMP:
=======================================


9.4 Other notes on HACMP:
==========================


Filesets and compatibility list HACMP versions - AIX versions:

Note 1:
-------

HACMP Version Compatibility Matrix 

http://www-03.ibm.com/support/techdocs/atsmastr.nsf/WebIndex/TD101347

Document Author:  
Shawn Bodily

Document ID: 
TD101347 

Doc. Organization: 
Advanced Technical Support 
 
Document Revised: 
03/06/2007 

Product(s) covered: 
HACMP 
 

Abstract: This document provides a HACMP Version Compatibility Matrix. 


HACMP 	Version Supported? 	AIX Level(s) MISC 
1.2 	NO 			3.2.5   
2.1 	NO 			3.2.5   
3.1.0 	NO 			3.2.5   
3.1.1 	NO 			3.2.5  
4.1.0 	NO 			4.1.X   
4.1.1 	NO 			4.1.X  
4.2 	NO 			4.1.4, 4.2.X  
4.2.1 	NO 			4.1.5, 4.2.X  
4.2.2 	NO 			4.1.5, 4.2.1, 4.3.X  
4.3 	NO 			4.3.2, 4.3.3  
4.3.1 	NO 			4.3.2, 4.3.3  
4.4 	NO 			4.3.3  
4.4.1 	NO 			4.3.3, 5.1  
4.5 	NO 			5.1, 5.2  
5.1 	NO-09/01/2006 		5.1, 5.2,5.3  
5.2 	Y-9/30/2007 		5.1, 5.2,5.3  
5.3 	Y-9/30/2008 		5.2(ML4), 5.3(ML2) AIX 5.2 RSCT 2.3.6 or higher AIX 5.3 RSCT 2.4.2 or higher  
5.4 	Yes 			5.2 (TL8), 5.3(TL4) AIX 5.2 RSCT 2.3.9 or higher AIX 5.3 RSCT 2.4.5. or higher 
 
 
Cross Reference Chart 

		AIX 4.3.3 AIX 5.1 AIX 5.1(64-bit) AIX 5.2 AIX 5.3 
HACMP 4.5 	No Yes No Yes No 
HACMP/ES 4.5 	No Yes Yes Yes No 
HACMP/ES 5.1 	No Yes Yes Yes Yes 
HACMP/ES 5.2 	No Yes Yes Yes Yes 
HACMP/ES 5.3 	No No No Yes Yes 
HACMP/ES 5.4 	No No No Yes Yes 
 

Note 2:
-------

HACMP 5.1 requires:
- AIX 5L v5.1 ML5 with RSCT v2.2.1.30 or higher
- AIX 5L v5.2 ML2 with RSCT v2.3.1.0 or higher
- c-spoc vpath support requires SDD 1.3.1.3 or higher

HACMP 5.2:
AIX 
Each cluster node must have one of the following installed: 
AIX 5L v5.1 plus the most recent maintenance level (minimum ML 5) 
AIX 5L v5.2 plus the most recent maintenance level (minimum ML 2) 

HACMP 5.3 is supported on AIX 5.2 and 5.3
- AIX 5.2 ML06 or later with RSCT 2.3.6 or later
- AIX 5.3 ML02 or later with RSCT 2.4.2 or later


Note 3: HACMP FAQ:
------------------


I have installed HACMP, now what? 
 
Why does HACMP require so many subnets for IP address takeover? 
 
Does HACMP have any limits? 
 
How can I avoid the nameserver as a single point-of-failure? 
 
What is a config_too_long event? 
 
Do all cluster nodes need to be at the same version of HACMP and AIX 5L operating system? 
 
Why do I need a non-IP heartbeat network? 
 
Can I put different types of processors, communications adapters, or disk subsystems in the same cluster? 
 
What kinds of applications are best suited for a high availability environment? 
 
Can I use Etherchannel with HACMP? 
 
Can I use an existing Enhanced Concurrent Mode volume group for disk heartbeat? Or do I need to define a new one? 
 
 
Question: I have installed HACMP, now what?

Answer: Before HACMP can manage and keep your application highly available, you need to tell HACMP about 
your cluster and the application. There are 4 steps:

Step 1) Define the nodes that will keep your application highly available

The local node (the one where you are configuring HACMP) is assumed to be one of the cluster nodes 
and you must give HACMP the name of the other nodes that make up the cluster. Just enter a hostname or IP address 
for each node. 

Step 2) Define the application you want to keep highly available 
There are 3 things you need to tell HACMP about the application: 
name�provide a name 
start script�specify a script for HACMP to use to start the application 
stop script�specify a script for HACMP to use to stop the application 

Step 3) Verify and synchronize the cluster 
HACMP will discover all the networks and disks connected to the nodes. A verification step will ensure 
that the cluster configuration will be able to keep the application highly available. When successful the 
configuration will be copied to the rest of the nodes in the cluster. 

Step 4) Manage the application 
When you start HACMP it will begin managing the application and keeping it highly available. You can also use 
the maintenance facilities provided by HACMP to move the application between nodes for maintenance purposes. 

To see just how easy it is to configure HACMP, look for Using the SMIT Assistant in Chapter 11 of the 
Installation Guide. View the online documentation for HACMP. HACMP for Linux does not include the advanced 
discovery and verification features available on AIX 5L. When configuring HACMP for Linux you must manually 
define the cluster, networks and network interfaces. Any changes to the configuration require HACMP for Linux 
to be restarted on all nodes. 


Question: Why does HACMP require so many subnets for IP address takeover?

Answer: HACMP (using RSCT) determines adapter state by sending heartbeats across a specific network interface
�as long as heartbeat messages can be sent through an interface, the interface is considered alive. 
Prior to AIX 5L V5, AIX did not allow more than one interface to own a subnet route but in AIX 5L V5.1 multiple 
interfaces can have a route to the same subnet. This is sometimes referred to as multipath routing or 
route striping and when this situation exists, AIX 5L will multiplex outgoing packets destined for a particular 
subnet across all interfaces with a route to that subnet. This interferes with RSCT's ability to reliably 
send heartbeats to a specific interface. Therefore the subnetting rules for boot, service and persistent labels 
are such that there will never be a duplicate subnet route created by the placement of these addresses.

HACMP V5 includes a new feature whereby you may be able to avoid some of the subnet requirements 
by configuring HACMP to use a different set of IP alias addresses for heartbeat. With this feature you provide 
a base or starting address and HACMP calculates a set of addresses in proper subnets�when cluster services 
are active, HACMP adds these addresses as IP alias addresses to the interfaces and then uses these alias 
addresses exclusively for heartbeat traffic. You can then assign your "regular" boot, service and persistent 
labels in any subnet, but be careful: although this feature avoids multipath routing for heartbeat, 
multipath routing may adversely affect your application. Heartbeat via IP Aliasing is discussed in Chapter 2 
of the Concepts and Facilities Guide and Chapter 3 of the Administration and Troubleshooting Guide. 
View the online documentation for HACMP.


Question: Does HACMP have any limits?

Answer: The functional limits for HACMP (e.g. number of nodes and networks) can be found in Chapter 1 
of the Planning and Installation Guide. View the online documentation for HACMP.


Question: How can I avoid the nameserver as a single point-of-failure?

Answer: 1) Make the nodes look at /etc/hosts first before the nameserver by creating a 
/etc/netsvc.conf file with the following entry:

hosts=local,bind 

where local tells it to look at /etc/hosts first and then the nameserver

2) Remove /etc/resolv.conf (or modify name to save it for later use) so it looks for name resolution 
in /etc/hosts first.

For information on updating the /etc/hosts file and nameserver configuration, Installation Guide. 
View the online documentation for HACMP. 


Question: What is a config_too_long event?

Answer: The config_too_long event is an informational event run by HACMP whenever a cluster event runs 
for longer that a preset time. This can occur when:

an AIX 5L command (e.g. fsck) is taking a long time to complete, or has hung 
there was an un-recoverable error encountered � in this case there will be an "EVENT FAILED" indication 
in hacmp.out 

If the config_too_long event is run, you should check the hacmp.out file to determine the cause and if manual 
intervention is required. For more information on recovery after an event failure, refer to Recover from HACMP 
Script Failure in Chapter 18 of the Administration and Troubleshooting Guide. 


Question: Do all cluster nodes need to be at the same version of HACMP and AIX 5L operating system?

Answer: No, though there are some restrictions when running mixed mode clusters.

Mixed levels of AIX 5L on cluster nodes do not cause problems for HACMP as long as the level of AIX 5L 
is adequate to support the level of HACMP being run on that node. All cluster operations are supported 
in such an environment. The HACMP install and update packaging will enforce the minimum level of AIX 5L 
required on each system.

Similarly for Linux on POWER, different levels of the operating system should not cause problems as long as 
the minimum supported version is installed. Mixing different platforms�AIX 5L, RedHat and SUSE�within the 
same cluster is not supported.

As a matter of practicality, it is recommended that all nodes be at the same levels of operating system 
and HACMP whenever possible. Keeping, the operating system, HACMP and the application at the same level 
on all nodes will make the administration of the cluster easier and less error prone, and will go a long way 
towards reducing the frustration of the administrators. The Planning Guide has advice for effectively managing 
different installation and migration scenarios.


Question: Why do I need a non-IP heart beat network?

Answer: The purpose of the non-IP heartbeat link is often misunderstood. The requirement comes from the following: 
HACMP heartbeats on IP networks are sent as UDP datagrams. This means that if a node or network is congested, 
the heartbeats can be discarded. If there were only IP networks, and if this congestion went on long enough, 
the node would be seen as having failed and HACMP would initiate a takeover. Since the node is still alive, 
HACMP takeover can cause both nodes to have the same IP address, and can cause the nodes to both try to own 
and access the shared disks. This situation is sometimes referred to as "split brain" or "partitioned cluster". 
Data corruption is all but inevitable in this circumstance.

HACMP therefore strongly recommends that there be at least one non-IP network connecting a node to at least one 
other node. For clusters with more than two nodes, the most reliable configuration includes two non-IP networks 
on each node. The distance limitations on non-IP links�particularly RS-232�has often made this requirement 
difficult to meet. For such clusters, HACMP disk heartbeating should be strongly considered. Disk heartbeating 
enables the easy creation of multiple non-IP networks without requiring additional hardware or software.


Question: Can I put different types of processors, communications adapters, or disk subsystems in the same cluster?

Answer: In general, yes, as long as the individual components are supported by HACMP. Note that there are some 
combinations which may not be reasonable or desirable. For example, putting two Ethernet adapters that run at 
different speeds on the same network will generally force all adapters on the network to run at the speed of 
the slower one. Likewise, having a low powered processor back up a high-powered processor may result in 
unacceptable performance should HACMP have to run the application on the lower powered one. (But see the 
questions on dynamic LPAR and CUoD for a way of dealing with this). As long as AIX 5L and the hardware support 
the interconnections, HACMP will support them as well.


Question: What kinds of applications are best suited for a high availability environment?

Answer: HACMP detects failures in the cluster then moves or restarts resources in order to keep the application 
highly available. For an application to work well in a high availability environment, the application itself 
must be capable of being managed (start, stop, restart) programmatically (no user intervention required) and must 
have no "hard coded" dependencies on specific resources. For example, if the application relies on the hostname 
of the server (and cannot dynamically accept a change in hostname), then it is practically impossible to 
restart the application on a backup server after a failure.

Question: Can I use Etherchannel with HACMP?

Answer: See Using Etherchannel with HACMP.


Question: Can I use an existing Enhanced Concurrent Mode volume group for disk heartbeat? 
Or do I need to define a new one?

Answer: To achieve the highest levels of availability under the widest range of failure scenarios, the best practice 
would be to configure one disk heartbeat connection per physical disk enclosure (or LUN).

The heartbeat operation itself involves reading and writing messages from a non-data area of the shared disk. 
Although the space used for heartbeat messages does not decrease the space available for the application 
(it is in the reserved area of the disk) there is some overhead when the disk seeks back and forth between 
the reserved area and the application data area.

If you configure the disk heartbeat path using the same disk and vg as is used by the application, the best practice 
is to select a disk which does not have frequently accessed or performance critical application data: 
although the disk heartbeat overhead is small (2-4 seeks/sec), it could potentially impact application performance or,
conversely, excess application access could cause the disk hb connection to appear to go up and down.

Ultimately the decision of which disk and volume group to use for heartbeat depends on what makes sense for 
your shared disk environment and management procedures. For example, using a separate vg just for heartbeat 
isolates the heartbeat from the application data, but adds another volume group that has to be maintained 
(during upgrades, changes, etc) and consumes another LUN.

If you decide on a separate vg for heartbeat, it does not need to be included in an HACMP resource group, 
however, the CSPOC utilities use a resource group node list as the set of nodes to perform operations: 
including the vg in a resource group with just the (sub)set of nodes connected to the disk will let you take 
advantage of the CSPOC functions. You can also define and use a disk which is not part of any volume group, 
though such a setup would have to be manually configured and maintained.

   
Note 4: Cluster logfiles:
-------------------------

Cluster log files
HACMP for AIX scripts, daemons, and utilities write messages to the log files shown below.

HACMP log files Log file name Description 

/var/adm/cluster.log 	Contains time-stamped, formatted messages generated by HACMP for AIX scripts and daemons. 
			In this log file, there is one line written for the start of each event, and one line written 
			for the completion. 
/tmp/hacmp.out 		Contains time-stamped, formatted messages generated by the HACMP for AIX scripts. 
			In verbose mode, this log file contains a line-by-line record of each command executed 
			in the scripts, including the values of the arguments passed to the commands. By default, 
			the HACMP for AIX software writes verbose information to this log file; however, you can 
			change this default. Verbose mode is recommended. 
system error log 	Contains time-stamped, formatted messages from all AIX subsystems, including the HACMP 
			for AIX scripts and daemons. 

/usr/sbin/cluster/
history/cluster.mmdd 	Contains time-stamped, formatted messages generated by the HACMP for AIX scripts. 
			The system creates a new cluster history log file every day that has a cluster event 
			occurring. It identifies each day's file by the file name extension, where mm indicates 
			the month and dd indicates the day. 
/tmp/cm.log 		Contains time-stamped, formatted messages generated by HACMP for AIX clstrmgr activity. 
			Information in this file is used by IBM Support personnel when the clstrmgr is in debug mode. 
			Note that this file is overwritten every time cluster services are started; 
			so, you should be careful to make a copy of it before restarting cluster services on a 
			failed node. 
/tmp/cspoc.log 		Contains time-stamped, formatted messages generated by HACMP for AIX C-SPOC commands. 
			Because the C-SPOC utility lets you start or stop the cluster from a single cluster node, 
			the /tmp/cspoc.log is stored on the node that initiates a C-SPOC command. 
/tmp/dms_logs.out 	Stores log messages every time HACMP for AIX triggers the deadman switch. 
/tmp/emuhacmp.out 	Contains time-stamped, formatted messages generated by the HACMP for AIX Event Emulator. 
			The messages are collected from output files on each node of the cluster, and cataloged 
			together into the /tmp/emuhacmp.out log file. In verbose mode (recommended), this log file 
			contains a line-by-line record of every event emulated. Customized scripts within the event 
			are displayed, but commands within those scripts are not executed. 

/var/hacmp/clverify
/clverify.log		Contains messages when the cluster verification has run.


#############################################################################################
#############################################################################################
#############################################################################################


===============================================================
Section 10. Cisco IOS version 10.x, 11.x, 12.x router commands:
===============================================================


PART 1: Basic IOS commands:
=========================== 


1. Entering user mode, or privileged mode, or configuration mode:
-----------------------------------------------------------------

- user mode
-----------

When you access a router through console, aux, or remote terminal,
you first enter the router in "user exec mode" (user mode).
Here you can see all settings but you can not change anything.

login to IOS via console, aux, or via a terminal via network -> you enter 
user exec mode first.

- privileged mode
-----------------

Via the "enable" command you can enter "privileged mode"
whereby you can enter configuration mode and change settings of the router

router>enable
pasword: xxxx
router#

goiing back to user mode

router#disable
router>

logout
router>logout

- configuration mode
--------------------

When you are in privileged mode, you can enter the "configuration mode":

- change running config
router# configure terminal  (or just config t)
router(config)#

- change startup config in NVRAM
router# configure memory  (or just config mem)
router(config)#

so,

user mode -> via 'enable' -> privileged mode -> via 'config t' ->configuration mode

Getting out from configuration mode can be done with "exit" or "Ctrl-Z"
- exit brings you 'one level higher'
- Ctrl-Z gets you out configuration mode

  examples:

  -- first logon to router

password: xxxx
router>enable
password: yyyy
router#configure terminal
router(config)#enable password abcd
router(config)#enable secret abcd

router(config)#line console 0
router(config-line)#login
router(config-line)#password cisco

router(config-line)#line vty 0 4
router(config-line)#login
router(config-line)#password cisco

router(config)#service password-encryption
router(config)#no service password-encryption

router(config-line)#hostname critter
critter(config)#prompt emma
emma(config)#interface serial 1
emma(config-if)#exit
emma(config)#exit
emma#

router(config)#interface fastethernet0/0
router(config-if)#
  
router(config)#int f0/0.1
router(config-subif)#

router#config t
router(config)#router rip
router(config-router)#

clock: if the router must provide clocksignal

router(config)#interface serial 0
router(config-line)#clock rate 64000

banners: exec, incoming,login, motd

router(config)#banner motd #
... enter the banner text.... end with #   
  

Prompts:

ROMMON 1>                   Monitor mode
ROUTER>                     user mode
ROUTER#                     privileged mode
router(config)#             global configuration mode
router(config-if)#          interface configuration mode
router(config-subif)#       Sub-interface configuration mode
router(config-line)#        line configuration mode
router(config-router)#      router configuration mode
router(config-ipx-router)#  ipx router configuration mode


Router>enable
Router#config t
Enter configuration commands, one per line.  End with CNTL/Z.
Router(config)#exit
Router#exit  -- ends the session

2. Logging and debugging commands:
----------------------------------

IOS creates (syslog) messages and by default, sends them to the console.
But when you have a telnet session for example, no syslog messages are seen.

router>terminal monitor

means that this terminal is monitoring syslog messages

or

router>logging buffered

means let the router buffer the messages

router>show logging

is the command to display the messages to your terminal session


3. Memory types and configuration types in Cisco routers:
---------------------------------------------------------

When the router boots, it loads it's IOS from FLASH memory, which is 
some sort of PCMCIA card or EEPROM.
The configuration of the router (address lists, ip addresses on interfaces etc..)
is stored as the "startup configuration" in NVRAM which will be loaded into RAM
as the "working configuration".

RAM:	working memory, with loaded IOS from FLASH, 
        and running configuration initally loaded from NVRAM
ROM:	basic IOS software, should not be used normally
FLASH:	IOS software                                    (=rewriteable permanent memory)
NVRAM:	contains startup, and saved, configuration      (=Non Volatile RAM)

You can display the "startup configuration" in NVRAM, 
and the "running configuration" in RAM with the following commands:

router#show running-config

router#show startup-config


4. copy of configuration files:
-------------------------------

You can copy the running configuration to the startup configuration,
and the other way around. 
You can also store the configuration to an ascii file via TFTP

router#copy running-config startup-config
router#copy startup-config running-config
router#copy tftp startup-config
router#copy startup-config tftp

erase the startup-config:
router#erase startup-config

If you have an new IOS and want to load it into the router:

router#copy tftp flash

And you must reload or reboot the router.


5. BOOT procedure router:
-------------------------

1. power on self test
2. router loads bootstrap code from ROM
3. router finds IOS from flash and loads it
4. router finds startup configuration file and loads it as running configuration


If no configuration is found in NVRAM, the router goes
to setup mode
Here will be asked to go choose from basic or extended
setup mode

The "config register" command:

You can change the normal sequence by setting the "configuration register"
to some other value. This register is a 16 bit register in the router which
can be set by the "config register" command.

The bootfield of the register are the first 4 bits.
If the bootfield in hex is
- 0: 2100 - load ROMMON; is used for lowlevel debugging or password recovery
- 1: 2101 - RXBOOT; is used to load the limited function IOS from ROM
- 2: 2102 - load normal IOS

example: 

config-register 0x2101

bit 6 can be used to ignore the NVRAM, for recovering password
put the config-register at 0x2141

6. CDP protocol:
----------------

CDP is enabled by default.

S#no cdp run    -- global command, disabling cdp
S#cdp run       -- enabling cdp

S#(config-if)#no cdp enable -- disabling cdp for this interface
S#(config-if)#cdp enable    -- enabling cdp for this interface

S#show cdp neighbour
S#show cdp neighbour detail
S#show cdp entry yosemite
S#show cdp entry yosemite protocol
S#show cdp interface
S#show cdp traffic 


7. Configuration interfaces example:
------------------------------------

hostname Gorno
enable password cisco

interface Serial0
ip address 134.141.12.1 255.255.255.0

interface Serial1
ip address 134.141.13.1 255.255.255.0

interface Ethernet0
ip address 134.141.1.1 255.255.255.0

-- to enable rip (classfull)
RouterA(config)#router rip 
RouterA(config-router)network 134.141.0.0

-- to disable rip
no router rip

-- to disable rip on 1 interface
RouterA(config)#router rip
RouterA(config-router)#passive-interface serial 0

- Add a route:

ip route network-number network-mask ip-address
ip name-server server-address1 serveraddress-2...
ip domain-lookup

ip route 10.1.2.0 255.255.255.0 10.1.128.252


ip address 10.1.7.252 255.255.255.0 seconday
ip address 10.1.2.252 255.255.255.0 

default route example:
----------------------

R1(config)# ip route 0.0.0.0 0.0.0.0 168.13.1.101


PART 2. NETWORK CONFIGURATIONS:
===============================

8. IP/IPX configuration on point-to-point
------------------------------------------

8.1 IP configuration on point-to-point serial links:
----------------------------------------------------

LAPB, HDLC, and PPP are used for
a single point-to-point serial link. See section 10.


        -----
         |
         A
        / \
       Y---S   
       |   |
      ---  ---

Albequerque#
A#configure terminal
A(config)#   interface serial 0
A(config-if)#ip address 10.1.128.251 255.255.255.0
A(config)#   interface serial 1
A(config-if)#ip address 10.1.130.251 255.255.255.0
A(config)#   interface ethernet 0
A(config-if)#ip address 10.1.1.251 255.255.255.0
A#show running-config

A#show ip route

10.0.0.0/24 is subnetted, 3 subnets
C    10.1.1.0 is directly connected, Ethernet0
C    10.1.130.0 is directly connected, Serial1 
C    10.1.128.0 is directly connected, Serial0

A#terminal ip netmask-format decimal  -- used to go from /24 notation
                                      -- to 255.255.255.0
A#show ip route

Yosemite#
Y#show ip interface brief

Interface   IP-Address   OK?  Method   Status    Protocol
Serial0     10.1.128.252 YES  Manual   up        up
Serial1     10.1.129.252 YES  Manual   up        up
Ethernet0   10.1.2.252   YES  Manual   up        up

Seville#
S#show ip route
S#show ip interface serial 1
S#show ip interface serial 0
S#show ip arp
S#debug ip packet
IP packet debugging is on
S#ping 10.1.130.251

Add static routes:
A#ip route 10.1.2.0 255.255.255.0 10.1.128.252
A#ip route 10.1.3.0 255.255.255.0 10.1.130.253

A#show ip route

10.0.0.0/24 is subnetted, 5 subnets
S    10.1.3.0 [1/0] via 10.1.130.253
S    10.1.2.0 [1/0] via 10.1.128.252
C    10.1.1.0 is directly connected, Ethernet0
C 10.1.130.0 is directly connected, Serial1 
C 10.1.128.0 is directly connected, Serial0

Set a default route:
R1(config)#ip route 0.0.0.0 0.0.0.0 10.1.17.251

If you use a default route, you should use the command
router(config)#ip classless


8.2 IPX configuration on point-to-point serial links:
-----------------------------------------------------

       -----
         |
         A
        / \
       Y---S   
       |   |
      ---  ---

=Router Alburquerque:

ipx routing 0200.aaaa.aaaa (mac address lan)

interface serial0
ip address 10.1.12.1 255.255.255.0
ipx network 1012
bandwith 56

interface serial1
ip address 10.1.13.1 255.255.255.0
ipx network 1013

interface ethernet 0
ip address 10.1.1.1 255.255.255.0
ipx network 1

=Router Yosemite:

ipx routing 0200.bbbb.bbbb

interface serial0
ip address 10.1.12.2 255.255.255.0
ipx network 1012
bandwith 56

interface serial1
ip address 10.1.23.1 255.255.255.0
ipx network 1023

interface ethernet 0
ip address 10.1.2.2 255.255.255.0
ipx network 2

------------------------

A#show interface serial 0
A#show interface Ethernet0   
A#sh int e0
A#show ipx interface serial0
A#show ip interface serial 0
A#show ip interface brief
A#show ip route
A#show ipx route

A#show ipx servers
A#debug ipx routing activity  (IPXRIP activity)
A#debug ipx routing events    (IPXRIP events)
A#debug ipx sap activity      (IPXSAP activity)

A#undebug all
A#no debug all


9. Configuring RIP and IGRP:
----------------------------

Each network command enables RIP or IGRP on a
set of interfaces.

RIP:

interface ethernet 0
ip address 10.1.2.3 255.255.255.0
interface ethernet 1
ip address 172.16.1.1 255.255.255.0
interface tokenring 0
ip address 10.1.3.3 255.255.255.0
interface serial 0
ip address 199.1.1.1 255.255.255.0
interface serial 1
ip address 199.1.2.1 255.255.255.0

R1#configure terminal
R1(config)#router rip
R1(config-router)#network 199.1.1.0
R1(config-router)#network 10.0.0.0

-- Ethernet0, Tokenring0, Serial0 have rip enabled

IGRP:

R1#configure terminal
R1(config)#router igrp 1  -- autonomous system id
R1(config-router)#network 199.1.1.0
R1(config-router)#network 10.0.0.0
R1(config-router)#network 199.1.2.0
R1(config-router)#network 172.16.0.0

-- all interfaces have now igrp enabled

EIGRP:

router eigrp (autonomous system id)
network command

for example

router eigrp 10
network 10.0.0.0
network 172.16.0.0

DEBUGGING:

R1#debug ip rip
R1#debug ip igrp transactions
R1#debug ip igrp events
R1#no debug all
R1#undebug all

DISABLE RIP:

R1(config)#no router rip


10. Serial links:
-----------------

LAPB, HDLC, and PPP are used for
a single point-to-point serial link.

     Error detection  Protocol type field
SDLC Yes              None
LAPB Yes              None
LAPD No               None
HDLC Yes              None / Yes Cisco proprierty
PPP  Yes              Yes


-- encapsulation hdlc | ppp | lapb
   hdlc is default

R1(config)#interface serial 0
R1(config-if)encapsulation ppp

R1(config)#interface serial 0
R1(config-if)encapsulation hdlc

-- compress predictor | stac | mppc

R1(config)#interface serial 0
R!(config-if)ip address 10.1.11.253 255.255.255.0
R1(config-if)encapsulation ppp
R1(config-if)compress stac

R1#show compress
R1#show process

- ppp: LCP control protocols like IPCP, LQM, looped link detection, Authentication
      compression, mulitlink support
- ppp, lapb, hdlc all support compression
       ppp : stac, predictor, mppc
       lapb: stac, predictor
       hdlc: stac

- synchronous serial interface 60 pin D
   V.35, X.21, EIA/TIA-232, EIA/TIA-449, EIA/TIA-530
     
11. Frame Relay:
----------------

key terms:
DTE, DCE, VC, DLCI, LMI, DE, FECN, BECN, LAPF, ITU Q.9xx/ANSI T1.6xx

encapsulation frame-relay  (ietf|cisco)
frame-relay lmi-type       (cisco|ansi|q933a)
frame-relay map            (ip nr - dlci nr)
frame-relay interface-dlci (dlci-nr)
bandwith num
keepalive sec

show ip route
show frame-relay pvc
show frame-relay map
show frame-relay lmi
show interfaces
show interface s0
debug frame-relay lmi

11.1 One IP subnet/IPX network:
-------------------------------

            -----
             |
             A dlci 51 199.1.1.1
            / \
dlci 52    B---C dlci 53 199.1.1.3   
199.1.1.2  |   |
          ---  ---


example 1: lmi automatical, cisco instead ietf etc..

Router A: 

ipx routing 0200.aaaa.aaaa
interface serial 0
encapsulation frame-relay
ip address 199.1.1.1 255.255.255.0
ipx network 199

interface ethernet 0
ip address 199.1.10.1 255.255.255.0
ipx network 1

router igrp 1
network 199.1.1.0
network 199.1.10.0

Similar for routers B and C....

example 2: lmi is ansi:

Router A:

ipx routing 0200.aaaa.aaaa
interface serial 0
encapsulation frame-relay
frame-relay lmi-type ansi
ip address 199.1.1.1 255.255.255.0
ipx network 199
...
Mayberry#show ip route
Mayberry#show frame-relay pvc
Mayberry#show frame-relay map
...

DLCI - IP mapping is here automatically done by Inverse ARP

example 3: same network, no Inverse ARP

Now we must make mappings

Router A:

interface serial 0
frame-relay map ip 199.1.1.2 52 broadcast
frame-relay map ipx 199.0200.bbbb.bbbb 52 broadcast
frame-relay map ip 199.1.1.3 53 broadcast
frame-relay map ipx 199.0200.cccc.cccc 53 broadcast

similar for routers B and C


11.2 One IP subnet/IPX network per VC:
--------------------------------------

            -----
             |
             A dlci 51 
  140.1.1.0=/ \=140.1.2.0
dlci 52    B   C dlci 53   
           |   |
          ---  ---


Router A:
A(config)#ipx routing 0200.aaaa.aaaa
A(config)#interface serial 0
A(config-if)#encapsulation frame-relay

A(config-if)#interface serial 0.1 point-to-point
A(config-subif)#ip address 140.1.1.1 255.255.255.0
A(config-subif)#ipx network 1
A(config-subif)#frame-relay interface-dlci 52

A(config-fr-dlci)#interface serial 0.2 point-to-point
A(config-subif)#ip address 140.1.2.1 255.255.255.0
A(config-subif)#ipx network 2
A(config-subif)#frame-relay interface-dlci 53

A(config-fr-dlci)#interface ethernet 0
A(config-if)#ip address 140.1.11.1 255.255.255.0
A(config-if)#ipx network 11

Router B:
B(config)#ipx routing 0200.bbbb.bbbb
B(config)#interface serial 0
B(config-if)#encapsulation frame-relay

B(config-if)#interface serial 0.1 point-to-point
B(config-subif)#ip address 140.1.1.2 255.255.255.0
B(config-subif)#ipx network 1
B(config-if)#frame-relay interface-dlci 51

interface ethernet 0
ip address 140.1.12.2 255.255.255.0
ipx network 13


The 'ipx routing' command enables SAP and RIP.
The 'ipx network' command per interface allows 
to use SAP and RIP on that interface.


11.3 Different frametypes with IPX:
----------------------------------

Novell:         Cisco:
Ethernet_II     ARPA
Ethernet_802.3  Novell-ether   this is the default
Ethernet_802.2  SAP
Ethernet_SNAP   SNAP

Suppose on the Ethernet of Router B, 2 frametypes are used:
Ethernet_802.3 and Ethernet_802.2

Router B:
B(config)#ipx routing 0200.bbbb.bbbb
B(config)#interface serial 0
B(config-if)#encapsulation frame-relay

B(config-if)#interface serial 0.1 point-to-point
B(config-subif)#ip address 140.1.1.2 255.255.255.0
B(config-subif)#ipx network 1
B(config-if)#frame-relay interface-dlci 51

interface ethernet 0
ip address 140.1.12.2 255.255.255.0
ipx network 13 encapsulation novell-ether
ipx network 23 encapsulation sap secondary
 
or use

interface ethernet 0.1
ipx network 13 encapsulation novell-ether

interface ethernet 0.2
ipx network 23 encapsulation 23


12. Access lists:
=================


ip packet -> inbound ACL ->ROUTING -> outbound ACL ->

- packets can be filtered as they enter an interface, before routing decision
- packets can be filtered before they exit an interface, after routing decision

12.1 Standard IP access list:
-----------------------------

Logic:

1. compare matching of the first access-list statement to packet
2. If a match is made, perform permit or deny
3. Or, repeat matching next sequential access-list statements
4. no match, perform deny

The standard access list only use the source ip address, or part of the address,
to filter traffic.

commands:

ip access-group 'number' : to bind to an interface
access-list 'number'     : define the access-list
access-class

show access-list              : shows all access lists
show ip access-list           : shows the ip access lists
show ipx access-list          : shows the ipx access lists

show ip interface             : shows all acl's and interfaces
show ipx interface            : shows all acl's and interfaces
show ip interface ethernet 0  :show all acl's attached to this interface
show ipx interface ethernet 0 :show all acl's attached to this interface

access-list 'number', where number is 1-100

Wildcards in access-list commands:

0.0.0.0         = complete match ip address
0.0.0.255       = match the first 24 bits
0.0.255.255     = match the first 16 bits
0.255.255.255   = match the first 8 bits
255.255.255.255 = always a match

example 1:
----------

RouterA(config)#interface Ethernet0
RouterA(config-if)#ip address 172.16.1.1 255.255.255.0
RouterA(config-if)ip access-group 1 out

RouterA(config)#access-list 1 deny 172.16.3.10 0.0.0.0
RouterA(config)#access-list 1 permit 0.0.0.0 255.255.255.255

or the modern equivalent:

interface Ethernet0
ip address 172.16.1.1 255.255.255.0
ip access-group 1 

access-list 1 deny host 172.16.3.10
access-list 1 permit any

example 2:
----------

             ----- 10.1.1.0
              |
              A                   s0 A s1
  10.1.128.0=/ \=10.1.130.0       s0/ \s0
            B---C                  B---C
            |129|                  s1  s1
10.1.2.0   ---  --- 10.1.3.0
           x=    
      10.1.2.1

Suppose: 
- x not allowed access to 10.1.1.0
- all hosts on 10.1.3.0 not allowed access to 10.1.2.0
- all other combinations are allowed

On Router B:

interface serial 0
ip access-group 1

access-list 1 deny host 10.1.2.1
access-list 1 permit any

On Router C:

interface serial 1
ip access-group 1

access-list 1 deny 10.1.3.0 0.0.0.255
access-list 1 permit any


12.2 Extended IP access list:
-----------------------------

- access-list "number" where number must be in 100-199
- here you can match on ports, protocols, and other fields in
  the ip and tcp/udp headers

General syntax:

ip access-group 'number' : to bind to an interface
access-list 'number'     : define the access-list
access-list number deny|permit protocol source destination

RouterA(config)#access-list 101 deny tcp any host 10.1.1.1 eq 23
RouterA(config)#access-list 101 deny tcp any host 10.1.1.1 eq telnet
RouterA(config)#access-list 101 deny udp 1.0.0.0 0.255.255.255 lt 1023 any
RouterA(config)#access-list 101 deny upd 1.0.0.0 0.255.255.255 lt 1023 44.1.2.3 0.0.255.255
RouterA(config)#access-list 101 deny ip 33.1.2.0 0.0.0.255 44.1.2.3 0.0.255.255
RouterA(config)#access-list 101 deny icmp 33.1.2.0 0.0.0.255 44.1.2.3 0.0.255.255 echo
RouterA(config)#access-list 101 deny tcp any host 172.16.30.2 eq 23 log

RouterA(config)#access-list 128 deny tcp any 10.55.66.0 0.0.0.255 eq 23

You should follow this with
RouterA(config)#access-list 101 permit ip any any

12.3 Named IP access list:
--------------------------

numbered: access-list 1-99 permit|deny
named:    ip access-list standard 'name' permit|deny

numbered: access-list 100-199 permit|deny
named:    ip access-list extended 'name' permit|deny

numbered: ip access-group 1-99 in|out
named:    ip access-group 'name' in|out

numbered: ip access-group 100-199 in|out
named:    ip access-group 'name' in|out


Using access-list with vty telnet access

config t
line vty 0 4
login
password cisco
access-class 3 in

access-list 3 permit 10.1.1.0 0.0.0.255


12.4 IPX standard and extend access lists:
------------------------------------------

Similar to IP access lists IPX has two types of access lists: 
Standard IPX Access Lists and Extended IPX Access lists.

Standard:
---------

Standard IPX access lists allow or deny packets based on 
source and destination IPX addresses. 
Template to enter standard IPX access lists is as follows:

Access-list (number from 800 to 899) (permit or deny) 
            (source network IPX number) (destination network IPX number)

Following example will show how the access list will permit or deny access to IPX packets.

Router#config t
Router(config)#access-list 810 permit 30 10
Router(config)#int e0
Router(config-if)#ipx access-group 810 out

Router#config t
Router(config)#access-list 810 deny 50 10
Router(config)#int e0
Router(config-if)#ipx access-group 810 out

Extended:
---------

Extended IPX access lists can filter based on the following: 
Source network, source node, destination network, destination node, 
IPX protocol (SAP, SPX etc) and IPX sockets.

Template to enter the extended IPX access list is as follows:

access-list (number, 900 to 999) permit or deny (protocol) 
            (source IPX network number) (source socket) 
            (destination IPX network number) (destination socket)

Following example will show how the extended access list will permit or deny 
IPX network access using extended access lists

Router(config)#access-list 910 deny �1 50 0 10 0

This means that the access is denied to any IPX protocol type from 
IPX network 50 on all sockets 
to enter IPX network 10 on all sockets.

Access lists:
-------------

ipx access-group 'number'|'name' in|out : to bind to an interface
ipx input-sap-filter number             : to bind a sap filter to an interface
ipx output-sap-filter number            : to bind a sap filter to an interface

access-list 800-899 permit|deny   : numbered IPX standard
access-list 900-999 permit|deny   : numbered IPX extended
access-list 1000-1099 permit|deny : numbered SAP access-list

ipx access-list standard|extended|sap 'name': named access-list

Example 1:
----------
           102
           ----- 
          eth1|  eth0
              R2---|101
             /1001 |
            /s0
        |--R1
        |   \s1
        |    \1002
       200    R3---|
               eth0|302


At R1:
ipx routing 0200.1111.1111

interface serial 0
ip address 10.1.1.1 255.255.255.0
ipx network 1001
ipx access-group 820 in

interface serial 1
ip address 10.1.2.1 255.255.255.0
ipx network 1002

interface ethernet 0
ip address 10.1.200.1 255.255.255.0
ipx network 200
ipx access-group 810

access-list 820 deny 101
access-list 820 permit -1

access-list 810 permit 302

Example 2: network wildcard mask
--------------------------------

interface serial0
ip address 10.1.1.2 255.255.255.0
ipx network 200
ipx access-group 910

access-list 910 deny any 1000 0000000F
access-list 910 permit any -1


13. Cisco switch configuration:
===============================


Cisco switch IOS is a bit different compared to the regular router IOS,
ofcourse due to the different functions.
But for most configuration syntax, they are pretty much alike.
Sometimes, a port is called 'interface', but it's really a port.

A crossover utp cable must be used to connect a switch to another
switch or hub:

pin 1 - pin 3
pin 2 - pin 6

example Catalyst 1912 with 12 10BaseT ports: e0/1 - e0/12
2 fastethernet ports fa0/26, fa0/27

s#show running_config
s#show spantree
s#show vlan_membership
s#show vlan
s#show vlan 3
s#show ip
s#show interfaces
s#show mac-address-table
s#show mac-address-table security
s#show version

s#ip address  (for inband management, global for switch)
s#ip default-gateway
s#mac-address-table permanent mac-address port
s#mac-address-table restricted static port src-list
s#port secure max-mac-count number

s#copy nvram tftp://
S#copy tftp:// nvram
s#address-violation (suspend|ignore|disable)
s#no address-violation

  s#delete nvram
  note that with a router, it is
  R1#erase startup-config

nvram is automatically updated when running-config is changed, so
there is no 'copy run start' command

sample session: to configure a port
-----------------------------------

s#config terminal
s(config)#ip address 10.5.5.11 255.255.255.0
s(config)#ip default-gateway 10.5.5.3
s(config)#interface e0/1
s(config-if)#duplex half (full, auto, half, full-flow-control)
s(config-if)#end
s#

sample session: to configure restrictions
-----------------------------------------

In this example, a server is always on port e0/3 (permanent) and
another server is on port e0/4 and only devices on port e0/1 are
allowed to send frames to it.

s(config)#mac-address-table permanent 0200.2222.2222 e0/3
s(config)#mac-address-table restricted static 0200.1111.1111 e0/4 e0/1
s(config)#end
s#show mac-address-table

sample session: port security
-----------------------------

Port security limits the number of mac addresses associated with a port
in the mac address table. Port security can be used to restrict port e0/1
so that only 3 mac addresses can source frames that enter port e0/1

s(config)#mac-address-table permanent 0200.2222.2222 e0/3
s(config)#mac-address-table restricted static 0200.1111.1111 e0/4 e0/1
s(config)#interface ethernet 0/1
s(config-if)#port secure max-mac-count 3
s(config-if)#end
s#show mac-address-table security

VLAN:
-----

A switch creates 1 broadcast domain, but every port is
its own collision domain.
This is an implicit VLAN 1.

VLAN's:
- can create n broadcast domains = n VLAN's = n layer 3 groupings
- routing is needed between VLAN's
- switch let devices in 1 VLAN communicate,
  but do not forward a frame entering 1 vlan to go to different vlan 
- seperate address table for each VLAN
- interswitch communication between members of the same vlan is done via
  tagging the frame with an 26 byte ISL header = trunking
- trunking with ISL = Cisco, alternative is IEEE 802.1Q

Trunking is used between 2 switches, but also between a switch and arouter,
if the router supports 'ISL' routing. Then tagged frames can go to and from the router.
The router is connected with a trunk link to the sdwitch.

How does the router use this. It sees the vlan-id and layer 3 address in the frame.
And the router should be configured as in this example:

#interface fastethernet 0.1
#ip address 10.1.1.1 255.255.255.0
#encapsulation isl 1

#interface fastethernet 0.2
#ip address 10.1.2.1 255.255.255.0
#encapsulation isl 2

But you can also have multiple router intefaces connect to
multiple normal accesslinks on the switches which are in the corresponding VLANS.

sample session: creating VLANS
------------------------------

s(config)#vlan 2 name VLAN2
s(config)#vlan 3 name VLAN3
s(config)#interface e 05
s(config-if)#vlan-membership static 2
s(config-if)#interface e 0/6
s(config-if)#vlan-membership static 2
s(config-if)#interface e 0/7
s(config-if)#vlan-membership static 2
..
..
s#show vlan 2
..

To let a VLAN span multiple switches, connect them
via fast ethernet ports, and put 'trunking' on.

s1(config)#vlan 2 name VLAN2
s1(config)#vlan 3 name VLAN3
s1(config)#interface e 05
s1(config-if)#vlan-membership static 2
s1(config-if)#interface e 0/6
s1(config-if)#vlan-membership static 2
s1(config-if)#interface e 0/7
s1(config-if)#vlan-membership static 2
s1(config-if)#interface e 0/8
s1(config-if)#vlan-membership static 3
s1(config-if)#interface e 0/9
s1(config-if)#vlan-membership static 3
s1(config-if)#interface fa 0/26
s1(config-if)#trunk on 
s1(config-if)#vlan-membership static 1
s1(config-if)#vlan-membership static 2
s1(config-if)#vlan-membership static 3

s1#show trunk a | b

VTP:
----

VLAN trunking protocol:
- 1 Domain, 1 VTP Server with VTP clients.
- configure VTP Server and clients:

s1(config)#vtp server domain abc pruning enable
s2(config)#vtp client  
s1#show vtp


14. Some Examples:
==================

Example 1:
----------

Starboss# show running-config

Current configuration:
!
version 12.0
service timestamps debug uptime
service timestamps log uptime
no service password-encryption
!
hostname Starboss
!
enable password cwc
!
!
!
!
!
memory-size iomem 15
ip subnet-zero
!
frame-relay switching
isdn switch-type basic-net3
!
!
process-max-time 200
!
interface FastEthernet0/0
 description Starboss RUK LAN
 ip address 172.17.35.70 255.255.255.0
 no ip directed-broadcast
 ip accounting output-packets
 speed 100
 full-duplex
!
interface Serial0/0
 bandwidth 128
 no ip address
 no ip directed-broadcast
 encapsulation frame-relay IETF
 no ip mroute-cache
 priority-group 1
 cdp enable
!
interface Serial0/0.1 point-to-point
 description 32k PVC to Titan ref:NXPC203765
 bandwidth 32
 ip address 10.10.35.2 255.255.255.0
 no ip directed-broadcast
 no arp frame-relay
 frame-relay interface-dlci 100
!
interface BRI0/0
 no ip address
 no ip directed-broadcast
 encapsulation ppp
 shutdown
 dialer map ip 172.17.34.1 02082614099
 dialer-group 1
 isdn switch-type basic-net3
!
interface Ethernet1/0
 description Starboss RPL LAN
 ip address 172.29.31.30 255.255.255.0
 no ip directed-broadcast
!
ip classless
ip route 0.0.0.0 0.0.0.0 Serial0/0.1
no ip http server
!
priority-list 1 protocol ip high tcp telnet
 description Starboss RPL LAN
 ip address 172.29.31.30 255.255.255.0
 no ip directed-broadcast
!
ip classless
ip route 0.0.0.0 0.0.0.0 Serial0/0.1
no ip http server
!
priority-list 1 protocol ip high tcp telnet
dialer-list 1 protocol ip permit
snmp-server engineID local 000000090200003094017780
snmp-server community ricoh RO
!
line con 0
 password cwc
 transport input none
line aux 0
line vty 0 4
 password cwc
 login
!
end

Starboss#


Example 2:
----------

Titan#show running-config

Current configuration:
!
version 12.0
service timestamps debug uptime
service timestamps log uptime
no service password-encryption
!
hostname Titan
!
enable password cwc
!
ip subnet-zero
!
frame-relay switching
!
!
!
interface FastEthernet0/0
 description connected to Titan LAN
 ip address 172.17.30.33 255.255.255.0
 no ip directed-broadcast
 ip accounting output-packets
!
interface Serial0/0
 description *** LMI to C&W Node HRW/EM1 Fruni 4320 Cct M1181933 NXUK271094 ***
 bandwidth 256
 no ip address
 no ip directed-broadcast
 encapsulation frame-relay IETF
 no ip mroute-cache
 priority-group 1
!
interface Serial0/0.1 point-to-point
 description **** 32k Pvc to Starboss S0/0.1 ****
 bandwidth 32
 ip address 10.10.35.1 255.255.255.0
 no ip directed-broadcast
 frame-relay interface-dlci 101
!
interface Serial0/0.2 point-to-point
 description **** 32k Pvc to Hatton S0/0.1 ****
 bandwidth 32
 ip address 10.10.33.1 255.255.255.0
 no ip directed-broadcast
 frame-relay interface-dlci 102
!
interface Serial0/0.3 point-to-point
 description **** 32k Pvc to Cornhill S0/0.1 ****
 bandwidth 32
 ip address 10.10.37.1 255.255.255.0
 no ip directed-broadcast
 frame-relay interface-dlci 103
!
ip classless
ip route 10.1.7.1 255.255.255.255 172.17.30.22 permanent
ip route 133.139.117.53 255.255.255.255 172.17.30.1 permanent
ip route 133.139.157.51 255.255.255.255 172.17.30.1
ip route 172.17.0.0 255.255.0.0 172.17.30.1
ip route 172.17.2.209 255.255.255.255 172.17.30.1 permanent
ip route 172.17.31.0 255.255.255.0 172.17.30.1
ip route 172.17.32.0 255.255.255.0 172.17.30.1
ip route 172.17.33.0 255.255.255.0 Serial0/0.2
ip route 172.17.35.0 255.255.255.0 Serial0/0.1
ip route 172.17.36.0 255.255.255.0 172.17.30.1
ip route 172.17.37.0 255.255.255.0 Serial0/0.3
ip route 172.17.38.0 255.255.255.0 Null0
ip route 172.29.31.0 255.255.255.0 172.17.35.70 permanent
ip route 192.168.174.6 255.255.255.255 172.17.30.1 permanent
ip route 172.17.33.0 255.255.255.0 Serial0/0.2
ip route 172.17.35.0 255.255.255.0 Serial0/0.1
ip route 172.17.36.0 255.255.255.0 172.17.30.1
ip route 172.17.37.0 255.255.255.0 Serial0/0.3
ip route 172.17.38.0 255.255.255.0 Null0
ip route 172.29.31.0 255.255.255.0 172.17.35.70 permanent
ip route 192.168.174.6 255.255.255.255 172.17.30.1 permanent
no ip http server
!
priority-list 1 protocol ip high tcp telnet
snmp-server engineID local 000000090200003094C14FA0
snmp-server community ricoh RO
!
line con 0
 password cwc
 transport input none
line aux 0
line vty 0 4
 password cwc
 login
!
end

Titan#


PART 3: OTHER STUFF:
====================

1. Subnetting ip network:
-------------------------

Traditional Classes:

A: 1-126	0xxxxxxx.yyyyyyyy.yyyyyyy.yyyyyyyy
B: 128-191	10xxxxxx.xxxxxxxx.yyyyyyy.yyyyyyyy
C: 192-223	110xxxxx.xxxxxxxx.xxxxxxx.yyyyyyyy
D: 224		1110----.--------.-------.--------


Class C subnetting:
		    subnets   hosts	subnetbits hostbits
-----------------------------------------------------------
 *255.255.255.128        NA      NA           1        7 not valid
  255.255.255.192         2      62           2        6
  255.255.255.224         6      30           3        5
  255.255.255.240        14      14           4        4
  255.255.255.248        30       6           5        3
  255.255.255.252        62       2           6        2


Class B subnetting:
		    subnets   hosts	subnetbits hostbits
-----------------------------------------------------------
255.255.128.0            NA      NA           1       15
255.255.192.0             2   16382           2       14
255.255.224.0             6    8190           3       13
255.255.240.0            14    4094           4       12
255.255.248.0            30    2046           5       11
255.255.252.0            62    1022           6       10
255.255.254.0           126     510           7        9
255.255.255.0           254     254           8        8
255.255.255.128         510     126           9        7
255.255.255.192        1022      62          10        6
255.255.255.224        2046      30          11        5
255.255.255.240        4094      14          12        4
255.255.255.248        8190       6          13        3
255.255.255.252       16382       2          14        2
     

PART 4: ISDN:
=============

- Reference points

        -------NT1---- Carrier/ISDN switch
            T       U


-------NT2-----NT1---- Carrier/ISDN switch
    S   |   T       U
        |
-------TA
    R

R1---U---------Provider	  Router with ISDN card with U interface (NT1) - bri0

R1--S/T--NT1---U---Provider   Router with ISDN card with S/T interface (TE1) -bri0

R1--R----TA--S--NT2--T--NT1--U--Provider  Router no isdn hardware (TE2) - serial0


- Channels:
BRI: 2B+1D, 
PRI: 23B+1D (US), 30B+1D (Europe)

- Standards

Telephone network and ISDN  - E series example E.163, E.164
ISDN conceps, interfaces    - I series example I.100, I.400
Switching and signaling     - Q series example Q.921, Q.931

- Signalling, Call setup
LAPD is used on D channel between router - ISDN switch
HDLC or PPP is used on B channel from end to end, but PPP support
control protocols as well as PAP and CHAP
Call setup messages refers to both the called and calling SPIDs

- router setup for PPP and CHAP

Router Fred:
username Barney password xyz
interface bri 0
ip address 10.3.3.1 255.255.255.0
encapsulation ppp
ppp authentication chap        

Router Barney:
username Fred password xyz
interface bri 0
ip address 10.3.3.2 255.255.255.0
encapsulation ppp
ppp authentication chap
ppp multilink

-- ppp multilink
dialer load-threshold 25 either  (in|out|either)
ppp multilink

-- Configuration Router
RouterA#config t
RouterA(config)#int bri0
RouterA(config-if)#encapsulation ppp
RouterA(config-if)#isdn switch-type 'type' --remote switch type
RouterA(config-if)#isdn spid1 086506610100 8650661
RouterA(config-if)#isdn spid2 086506620100 8650662

-- DDR
1. define static routes on the routers involved

RouterA(config)#int bri0
RouterA(config-if)#ip address 172.16.60.1 255.255.255.0
RouterA(config-if)#encap ppp

RouterA(config)#ip route 172.16.50.0 255.255.255.0 172.16.60.2
RouterA(config)#ip route 172.16.60.2 255.255.255.255 bri0

2. define interesting traffic, or what brings up the isdn line

RouterA(config)#dialer-list 1 protocol ip permit
RouterA(config)#int bri0
RouterA(config-if)#dialer-group 1        -- binds the access list to bri0

3. define the dialer information, or who must be dialed

RouterA(config-if)#dialer-group 1
RouterA(config-if)#dialer string 8350661  

or use
RouterA(config-if)#dialer map ip 172.16.60.2 name 804B 8350661
This associates an isdn phone number to a next hop router ip address

And now define an idle time-out to terminate the connection, and
allocate multiple channels at a certain threshold.

RouterA(config-if)#dialer load-threshold 125 either
RouterA(config-if)#dialer idle-timeout 180
RouterA(config-if)#dialer fast-idle 120  (if more B channels active)

5. Access lists

You can limit possible traffic by using an extended access list.
For example, permit only email cross the isdn link

RouterA(config)#dialer-list 1 list 110
RouterA(config)#access-list 110 permit tcp any any eq smtp
RouterA(config)#int bri0
RouterA(config-if)#dialer-group 1


#show interfaces bri 0:1
#show dialer interface bri 0
#show isdn active
#show isdn status
#debug isdn q921
#debug isdn q931
#debug dialer events
#debug dialer packets


============
PART 5: NAT:
============


CISCO NAT:
==========


The translation done by NAT can be either static or dynamic. Static translation is where 
we specify a lookup table, and one inside address is turned into one pre-specified outside address. 
Dynamic is where we tell the NAT router what inside addresses need to be translated, 
and what pool of addresses may be used for the outside addresses. 
There can be multiple pools of outside addresses. 
ICMP host unreachable messages are used when addresses run out. 

With NAT, multiple internal hosts can also share a single outside IP address, 
which conserves address space. This is done by port multiplexing: changing the source port 
on the outbound packet so that replies can be directed back to the appropriate machine. 

Address translation is not practical for large numbers of internal hosts all talking 
at the same time to the outside world. NAT just won't work well at a large scale. 

Performance may be a consideration. Currently, NAT causes process switching on NAT interfaces 
on a Cisco 7000. You can think of this as: the CPU has to look at every packet, to decide whether or not 
to translate it, and to alter the IP header, possibly the TCP header. 
One doubts that this will be easily cache-able. 


Configuring NAT:
----------------

Static:
-------

Here's a minimal sample configuration for static address translation. We assume Ethernet 0 is "inside" 
and Serial 0 is "outside". 

Private network 10.0.0.0 is used inside, and 192.1.1.0 is used outside.
We'll translate "10.1.2.3" to "192.1.1.2" (and vice versa). 
The words "inside source" emphasize that the inside source address is what's getting changed. 


10.0.0.0                                           192.1.1.0

                            |----------------|
                    --------|                |-----------------------------
                    |    e0 |----------------|s0
                    |
------------------------
         |
         |
      10.1.2.3


ip nat inside source static 10.1.2.3 192.1.1.2
interface ethernet 0
ip address 10.1.2.1 255.255.255.0
ip nat inside
interface serial 0
ip address 192.1.1.1 255.255.255.0
ip nat outside

You may add address mappings or inside or outside interfaces as necessary. 

Dynamic:
--------

Let's look at dynamic (pooled) translation. Same network and addresses as before. We'll set up 
a pool of addresses, translating sources in the range 10.1.2.0 through 10.1.2.255 to the range 
192.1.1.10 through 20. The access list indicates what source addresses can be translated. 
The idea of the third line is that inside source addresses matching list 20 get translated 
to addresses from the pool named LegalPool. It pretty much says that, doesn't it! 

ip nat pool LegalPool 192.1.1.10 192.1.1.20
access-list 20 permit 10.1.2.0 0.0.0.255
ip nat inside source list 20 pool LegalPool
interface ethernet 0
ip address 10.1.2.1 255.255.255.0
ip nat inside
interface serial 0
ip address 192.1.1.1 255.255.255.0
ip nat outside


You can configure outside source address translation similarly, changing "inside source" to "outside source" 
in the above examples. 
Let's look at how to do static outside address translation, supposing subnet 10.1.5.0 occurs 
both inside and outside (we're connecting to another company here). We only need to talk to the 
outside machine 10.1.5.3, and we'll readdress it as private address 192.168.1.1 on the inside 
(if we use 10.1.5.x, we have more complex routing issues to think about). 
This might call for something like the following. 

ip nat outside source static 10.1.5.3 192.168.1.1
interface ethernet 0
ip address 10.1.2.1 255.255.255.0
ip nat inside
interface serial 0
ip address 10.1.3.1 255.255.255.0
ip nat outside


Examples:
---------


Example 1:
==========


Define Inside Local and Inside Global Addresses:
------------------------------------------------


            A= 10.10.10.1                         171.16.68.1
                   |                                 |
                   |        |----------------|       |
              --------------|                |----------------------------- >>>
                         e0 |----------------|s0

In the configuration shown, when the NAT router receives a packet on its inside interface 
with a source address of 10.10.10.1, the source address is translated to 171.16.68.5. 
This also means that when the NAT router receives a packet on its outside interface 
with a destination address of 171.16.68.5, the destination address is translated to 10.10.10.1.


ip nat inside source static 10.10.10.1 171.16.68.5 

!--- Inside device A is known by the outside cloud as 171.16.68.5.

interface s 0
ip nat inside

interface s 1
ip nat outside

Because of the way NAT is configured, the inside addresses are the only addresses that are translated; 
therefore, the "inside local" address is different from the "inside global" address, while the 
"outside local" address is the same and the "outside global" address.


Define Outside Local and Outside Global Addresses:
--------------------------------------------------

In the next configuration, when the NAT router receives a packet on its outside interface with a source address 
of 171.16.68.1, the source address is translated to 10.10.10.5. This also means that if the NAT router receives 
a packet on its inside interface with a destination address of 10.10.10.5, the destination address is translated 
to 171.16.68.1.


ip nat outside source static 171.16.68.1 10.10.10.5

!--- Outside device A is known to the inside cloud as 10.10.10.5.


interface s 0
ip nat inside

interface s 1
ip nat outside

In this example, because of the way NAT is configured, only the outside addresses get translated; 
therefore, the "outside local" address is different from the "outside global" address, while the 
"inside local" address is the same and the "inside global" address.


Define All Local and Global Addresses:
--------------------------------------

In the final configuration, when the NAT router receives a packet on its inside interface with a source address 
of 10.10.10.1, the source address is translated to 171.16.68.5. When the NAT router receives a packet on its 
outside interface with a source address of 171.16.68.1, the source address is translated to 10.10.10.5.

This also means that when the NAT router receives a packet on its outside interface with a destination address of 
171.16.68.5, the destination address is translated to 10.10.10.1. Also, when the NAT router receives a packet on 
its inside interface with a destination address of 10.10.10.5, the destination address is translated to 171.16.68.1.


ip nat inside source static 10.10.10.1 171.16.68.5

!--- Inside device A is known to the outside cloud as 171.16.68.5.


ip nat outside source static 171.16.68.1 10.10.10.5 

!--- device A is known to the inside cloud as 10.10.10.5.


interface s 0
ip nat inside

interface s 1
ip nat outside


Example 2:
==========


    internal Device A
                 |           NAT
 10.10.10.1/24 --|       e0 ------
                 |----------|    |
               --|          ------
                 |            |s0 172.16.130.2/24
                              |
                              |
                              |
                              |
                              |172.16.130.1/24
                           -------
                           |     | OutSide Device A
                           -------
                              |192.168.1.1/24
                              |
                              |         |   |   |
                         -------------------------


These commands are configured on the NAT router shown above:

ip nat pool test 172.16.131.2 172.16.131.10 netmask 255.255.255.0
ip nat inside source list 7 pool test  
ip nat inside source static 10.10.10.1 172.16.131.1
interface e 0
ip address 10.10.10.254 255.255.255.0
ip nat inside
interface s 0
ip address 172.16.130.2 255.255.255.0
ip nat outside
ip route 192.168.1.0 255.255.255.0 172.16.130.1
access-list 7 permit 10.10.10.0 0.0.0.255


The configuration on the OutsideA device is:

interface Serial1/0
ip address 172.16.130.1 255.255.255.0
serial restart-delay 0
clockrate 64000

!
interface FastEthernet2/0
ip address 192.168.1.1 255.255.255.0
speed auto
half-duplex
ip route 172.16.131.0 255.255.255.0 172.16.130.2


The configuration on the InsideA device is:

interface Ethernet1/0
ip address 10.10.10.1 255.255.255.0
half-duplex
!
ip route 0.0.0.0 0.0.0.0 10.10.10.254


Using the show ip nat translations command, you can see the contents of the translation table:

NATrouter#show ip nat translations
Pro Inside global    Inside local    Outside local    Outside global
--- 172.16.131.1     10.10.10.1      ---              ---


Example 3:
==========


      internal Device A
                    |                         NAT
 145.21.32.150/22 --|     145.21.32.89/22 e0  ------
                    |-------------------------|    |
                  --|                         ------
                    |                           |e1 10.x.y.z/24
                                                |
                                                |
                                                |
                                                |
                                                |10.x.y.w/24
                                             -------
                                             |     | OutSide Device A
                                             -------
                                                |10.
                                                |
                                                |         |   |   |
                                           -------------------------


These commands are configured on the NAT router shown above:

ip nat pool test 10.x.w.n 10.x.w.m netmask 255.255.255.0
ip nat inside source list 1 pool miskm  
ip nat inside source static 145.21.32.150 10.x.w.n
interface e 0
ip address 145.21.32.89 255.255.248.0
ip nat inside
interface  e 1
ip address 10.x.y.z 255.255.255.0
ip nat outside
ip route 192.168.1.0 255.255.255.0 172.16.130.1
access-list 7 permit 145.21.32.0 0.0.0.255


CISCO PIX NAT:
==============


Example 1:
----------

In this tip, administrators can learn how to configure a new PIX firewall, out of the box. 
You will configure passwords, IP addresses, network address translation (NAT) and basic firewall rules. 
Let's say that your boss hands you a new PIX firewall. It has never been configured. 
He says that it needs to be configured with some basic IP addresses, security and a couple of basic 
firewall rules. You have never used a PIX firewall before. How will you be able to perform this configuration? 
After reading this article, it should be easy. Let's find out how. 

-- The basics of a Cisco PIX firewall

A Cisco PIX firewall is meant to protect one network from another. There are PIX firewalls for small home 
networks and PIX firewalls for huge campus or corporate networks. In this example, we will be configuring 
a PIX 501 firewall. The 501 model is meant for a small home network or a small business. 

PIX firewalls have the concept of inside and outside interfaces. The inside interface is the internal, 
usually private, network. The outside interface is the external, usually public, network. You are trying to protect 
the inside network from the outside network. 

PIX firewalls also use the adaptive security algorithm (ASA). This algorithm assigns security levels to 
interfaces and says that no traffic can flow from a lower-level interface (like the outside interface) 
to a higher-level interface (like the inside interface) without a rule allowing it. 
The outside interface has a security level of zero and the inside interface has a security level of 100. 

Here is what the output of the show nameif command looks like: 

pixfirewall# show nameif
nameif ethernet0 outside security0
nameif ethernet1 inside security100
pixfirewall#


Notice the ethernet0 interface is the outside interface (its default name) and the security level is 0. 
On the other hand, the ethernet1 interface is named inside (the default) and has a security level of 100. 

-- Guidelines:
-- ----------- 

Before beginning the configuration, your boss has given you some guidelines that you need to follow. Here they are: 

-All passwords should be set to "cisco" 
 (in reality, you make these whatever you want, but not "cisco"). 

-The inside network is 10.0.0.0 with a 255.0.0.0 subnet mask. 
 The inside IP address for this PIX should be 10.1.1.1. 

-The outside network is 1.1.1.0 with a 255.255.255.0 subnet mask. 
 The outside IP address for this PIX should be 1.1.1.1. 

-You want to create a rule to allow all inside clients on the 10.0.0.0 network to do port address translation 
 and connect to the outside network. They will all share the global IP address 1.1.1.2. 

-However, clients should only have access to port 80 (Web browsing). 

-The default route for the outside (Internet) network will be 1.1.1.254.

            10.0.0.0 / 8                      1.1.1.0 / 24
                   |                                 |
                   |        |----------------|       |                  |---------------|
              --------------| PIX            |--------------------------| Router        |--------
                         e1 |----------------|e0	    1.1.1.254   |---------------|
                    10.1.1.1                 1.1.1.1
                                               1.1.1.2

-- The configuration:
-- ------------------ 

When you boot up your PIX firewall for the first time, you should see a screen like this: 


Cannot be shown in a text document, but looks a bit like:

  *************************************
 Copyright (c) 1996-2003 Cisco Systems, Inc.

             Restricted Rights Legend

 Use, duplication >>>>>>>>>>>>>>>
 >>>>>>>> more stuff >>>>>>>>>>>>
 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>

 Cryptochecksum(changed): d41424 gs6266 e373738 ec52525

 Pre-configure PIX Firewall now through interactive prompts [yes]?


You will be prompted to answer YES or NO as to whether or not you want to configure the PIX 
through interactive prompts. Answer NO to this question because you want to learn how to really configure 
the PIX firewall, not just answer a series of questions. 
After that, you will be sent to a prompt that looks like this:

pixfirewall> 

With the "greater than" symbol at the end of the prompt, you are in the PIX user mode. 
Change to privileged mode with the en or enable command. Press "enter" at the Password prompt. Here is an example: 

pixfirewall> en 
Password: 
pixfirewall#

You now have administrative mode to show things but would have to go into global configuration mode 
to configure the PIX. 

Now, let's move on to basic configuration of the PIX: 


-- Basic PIX configuration :
-- ------------------------

What I am calling basic configuration is made up of three things: 

Set the hostname 
Set passwords (login and enable) 
Configure IP addresses on interfaces 
Enable interfaces 
Configure a default route

Before you can do any of these things, you need to go into global configuration mode. 
To do this, type: 

pixfirewall# config t
pixfirewall(config)# 

To set the hostname, use the hostname command, like this: 

pixfirewall(config)# hostname PIX1
PIX1(config)# 

Notice that the prompt changed to the name that you set. 

Next, set the login password to cisco, like this: 

PIX1(config)# password cisco
PIX1(config)#

This is the password required to gain any access to the PIX except administrative access. 

Now, configure the enable mode password, used to gain administrative mode access. 

PIX1(config)# enable password cisco
PIX1(config)#


Now we need to configure IP addresses on interfaces and enable those interfaces. The PIX, unlike a router, 
has no concept of interface configuration mode. To configure the IP address on the inside interface, 
use this command: 

PIX1(config)# ip address inside 10.1.1.1 255.0.0.0
PIX1(config)#


Now, configure the outside interface IP address: 
PIX1(config)# ip address outside 1.1.1.1 255.255.255.0
PIX1(config)#


Next, enable both the inside and outside interfaces. Make sure that the Ethernet cable, on each interface, 
is connected to a switch. Note that the ethernet0 interface is the outside interface, and it is only 
a 10base-T interface on a PIX 501. The ethernet1 interface is the inside interface, and it is a 
100Base-T interface. Here is how you enable these interfaces: 

PIX1(config)# interface ethernet0 10baset
PIX1(config)# interface ethernet1 100full 
PIX1(config)# 

Note that you can do a show interfaces command, right from the global configuration prompt line. 

Finally, let's configure a default route so that all traffic sent to the PIX will flow to the next upstream router (the 1.1.1.254 IP address that we were given). Here is how you do this: 

PIX1(config)# route outside 0 0 1.1.1.254
PIX1(config)#


The PIX firewall can, of course, support dynamic routing protocols as well (such as RIP and OSPF). 

Now, let's move on to some more advanced configuration. 


-- Network Address Translation:
-- ----------------------------

Now that we have IP address connectivity, we need to use Network Address Translation (NAT) 
to allow inside users to connect to the outside. We will use a type of NAT, called PAT or NAT Overload, 
so that all inside devices can share one public IP address (the outside IP address of the PIX firewall). 
To do this, enter these commands: 

PIX1(config)# nat (inside) 1 10.0.0.0 255.0.0.0
PIX1(config)# global (outside) 1 1.1.1.2 
Global 1.1.1.2 will be Port Address Translated
PIX1(config)#

With this, all inside clients are able to connect to devices on the public network and share 
IP address 1.1.1.2. However, clients don't yet have any rule allowing them to do this. 


-- Firewall rules:
-- ---------------

These clients on the inside network have a NAT translation, but that doesn't necessarily mean 
that they are allowed access. They now need a rule to allow them to access the outside network (the Internet). 
That rule will also allow the return traffic to come back in. 

To make a rule to allow these clients port 80 (Web browsing), you would type this: 

PIX1(config)# access-list outbound permit tcp 10.0.0.0 255.0.0.0 any eq 80
PIX1(config)# access-group outbound in interface inside 
PIX1(config)# 

Note that PIX access lists, unlike router access lists, use a normal subnet mask, not a wildcard mask. 

With this access list, you have restricted the inside hosts to accessing Web servers only on 
the outside network (routers). 


-- Showing and saving the configuration:
-- ------------------------------------- 

Now that you have configured the PIX firewall, you can show your configuration with the show run command. 

Make sure that you save your configuration with the write memory or wr m command. If you don't, 
your configuration will be lost when the PIX is powered off. 


Example 2:
----------


!--- Sets the outside address of the PIX Firewall:

ip address outside 131.1.23.2 

!--- Sets the inside address of the PIX Firewall:

ip address inside 10.10.254.1

!--- Sets the global pool for hosts inside the firewall:

global (outside) 1 131.1.23.12-131.1.23.254

!--- Allows hosts in the 10.0.0.0 network to be
!--- translated through the PIX:

nat (inside) 1 10.0.0.0 

!--- Configures a static translation for an admin workstation 
!--- with local address 10.14.8.50:

static (inside,outside) 131.1.23.11 10.14.8.50

!--- Allows syslog packets to pass through the PIX from RTRA.
!--- You can use conduits OR access-lists to permit traffic.
!--- Conduits has been added to show the use of the command,
!--- however they are commented in the document, since the 
!--- recommendation is to use access-list.
!--- To the admin workstation (syslog server):
!--- Using conduit: 
!--- conduit permit udp host 131.1.23.11 eq 514 host 131.1.23.1 


!--- Using access-list:

Access-list 101 permit udp host 131.1.23.1 host 131.1.23.11 255.255.255.0 eq 514
Access-group 101 in interface outside

!--- Permits incoming mail connections to 131.1.23.10:

static (inside, outside) 131.1.23.10 10.10.254.3

!--- Using conduits
!--- conduit permit TCP host 131.1.23.10 eq smtp any
!--- Using Access-lists, we use access-list 101
!--- which is already applied to interface outside.

Access-list 101 permit tcp any host 131.1.23.10 eq smtp

!--- PIX needs static routes or the use of routing protocols
!--- to know about networks not directly connected.
!--- Add a route to network 10.14.8.x/24.

route inside 10.14.8.0 255.255.255.0 10.10.254.2

!--- Add a default route to the rest of the traffic 
!--- that goes to the internet.

Route outside 0.0.0.0 0.0.0.0 131.1.23.1

!--- Enables the Mail Guard feature 
!--- to accept only seven SMTP commands 
!--- HELO, MAIL, RCPT, DATA, RSET, NOOP, and QUIT:
!--- (This can be turned off to permit ESMTP by negating with 
!--- the no fixup protocol smtp 25 command):

fixup protocol smtp 25

!--- Allows Telnet from the inside workstation at 10.14.8.50 
!--- into the inside interface of the PIX:

telnet 10.14.8.50

!--- Turns on logging:

logging on

!--- Turns on the logging facility 20:

logging facility 20

!--- Turns on logging level 7:

logging history 7

!--- Turns on the logging on the inside interface:

logging host inside 10.14.8.50


Example 3:
----------

pix outside:		195.73.20.75 / 255.255.255.248							
Device A in inside is:	192.168.1.2 / 255.255.255.0	


            A= 192.168.1.2                       195.73.20.75
                   |                                 |
                   |        |----------------|       |                  |---------------|
              --------------| PIX            |--------------------------| ADSL or Cable |--------
                         e0 |----------------|e1	   195.73.20.73 |---------------|


ip address outside 195.73.20.75 255.255.255.248
ip address inside 192.168.1.1 255.255.255.0
nat (inside) 1 192.168.1.0
static (inside, outside) 192.168.1.2 195.73.20.75
route outside 0 0 195.73.20.73 1

					
Example 4:
----------

ip address inside 10.1.1.1 255.255.255.0

ip address outside 209.165.201.1 255.255.255.224

nat (inside) 1 10.1.1.0 255.255.255.0

global (outside) 1 209.165.201.2 netmask 255.255.255.224

static (inside,outside) 209.165.201.3 10.1.1.3 netmask 255.255.255.255

access-list acl_out permit tcp any host 209.165.201.3 eq www

aaa authentication include http outside 209.165.201.3 255.255.255.255 0 0 TACACS+

route outside 0 0 209.165.201.4 1

telnet 10.1.1.2 255.255.255.255


In these examples, the ip address commands specify addresses for the inside and outside network interfaces. 
The ip address command only uses network masks. The inside interface is a Class A address, but only the l
ast octet is used in the example network and therefore has a Class C mask. The outside interface i
s part of a subnet so the mask reflects the .224 subnet value.

The nat command lets users start connections from the inside network. Because a network address is specified, 
the class mask specified by the ip address inside command is used.

The global command provides a PAT (Port Address Translation) address to handle the translated connections 
from the inside. The global address is also part of the subnet and contains the same mask specified 
in the ip address outside command.

The static command maps an inside host to a global address for access by outside users. Host masks are always 
specified as 255.255.255.255.

The access-list command permits any outside host to access the global address specified by the static command. 
The host parameter is the same as if you specified 209.165.201.3 255.255.255.255.

The aaa command indicates that any users wishing to access the global address must be authenticated. 
Because authentication only occurs when users access the specified global which is mapped to a host, 
the mask is for a host. The "0 0" entry indicates any host and its respective mask.

The route statement specifies the address of the default router. The "0 0" entry indicates any host and 
its respective mask. 

The telnet command specifies a host that can access the PIX Firewall unit's console using Telnet. Because it is 
a single host, a host mask is used. 


2. About the Global command:
----------------------------

[no] global [(if_name)] nat_id {global_ip [-global_ip] [netmask global_mask]} | interface 

clear global 

show global 


The global command defines a pool of global addresses. The global addresses in the pool provide 
an IP address for each outbound connection, and for those inbound connections resulting from outbound 
connections. Ensure that associated nat and global command statements have the same nat_id. 

When used on a PPPoE interface, the global command should explicitly include a netmask. Otherwise, 
the 255.255.255.255 netmask, assigned to the interface by PPPoE, is used as the broadcast mask. 
In that case, all addresses in the global pool may become broadcast addresses and will become unusable 
for address translation. 

Use caution with names that contain a "-" (dash) character because the global command interprets 
the last (or only) "-" character in the name as a range specifier instead of as part of the name. 
For example, the global command treats the name "host-net2" as a range from "host" to "net2". 
If the name is "host-net2-section3" then it is interpreted as a range from "host-net2" to "section3". 

The following command form is used for Port Address Translation (PAT) only:
global [(if_name)] nat_id {{global_ip} [netmask global_mask] | interface} 

After changing or removing a global command statement, use the clear xlate command. 

Use the no global command to remove access to a nat_id, or to a Port Address Translation (PAT) address, 
or address range within a nat_id. 

The "show global" command displays the global command statements in the configuration. 

Examples:
global (outside) 1 209.165.201.2 netmask 255.255.255.224
global (outside) 1 209.165.201.1-209.165.201.10 netmask 255.255.255.224
global (outside) 1 interface
global (inside) 1 209.165.202.128 netmask 255.255.255.224


PAT 

You can enable the Port Address Translation (PAT) feature by entering a single IP address with 
the global command. PAT lets multiple outbound sessions appear to originate from a single IP address. 
With PAT enabled, the PIX Firewall chooses a unique port number from the PAT IP address for each outbound 
xlate (translation slot). This feature is valuable when an Internet service provider cannot allocate enough 
unique IP addresses for your outbound connections. An IP address you specify for a PAT cannot be used in 
another global address pool. 

When a PAT augments a pool of global addresses, first the addresses from the global pool are used, 
then the next connection is taken from the PAT address. If a global pool address is available, the next 
connection takes that address. The global pool addresses always come first, before a PAT address is used. 
Augment a pool of global addresses with a PAT by using the same nat_id in the global command statements 
that create the global pools and the PAT. 

For example: 

global (outside) 1 209.165.201.1-209.165.201.10 netmask 255.255.255.224

global (outside) 1 209.165.201.22 netmask 255.255.255.224


More examples:
--------------

1.
==

Cisco PIX: Allow traffic to an internal host

Permit selected traffic to an internal host:

First, a static mapping must be made for the host. There is another recipe for this configuration. 

static (inside,outside) 1.1.1.1 192.168.0.100 netmask 255.255.255.255

then: 

To allow traffic, a conduit must be constructed. For example, to allow ICMP (ping) traffic to all hosts 
from anywhere (bad idea): 

conduit permit icmp any any


To allow SSH to a specific host from anywhere: 

conduit permit tcp host 1.1.1.1 eq 22 any 


or 

With ACLs: 

access-list 100 permit tcp any host 1.1.1.1 22 
access-group 100 in interface outside

2.
==

How to add a static map through a PIX to a device on the inside of your network. A one to one translation.
static (inside,outside) (outside IP) (inside IP) netmask 255.255.255.255 

Example: 

static (inside,outside) x.x.x.x x.x.x.x netmask 255.255.255.255 

Now you have a static nat to a specific device on the inside of your PIX. You can now write an Access List 
to specify what services to allow to this device.

3:
==

Load a new Cisco PIX software image from a TFTP server:

TFTP (trivial file transfer protocol) provides a convenient means of quickly transferring a Cisco 
IOS image to a firewall over an ethernet interface. This procedure is substantially faster than 
transferring over a serial port.

Step 1: Copy the IOS binary file to the TFTP directory. 

By default on most UNIX systems, the default data directory for the TFTP server is /tftpboot 
Copy the IOS image file to this directory and make sure it is world readable (i.e., chmod 544 
/tftpboot/filename.bin). The first time you try this procedure, or anytime you experience troubles, 
test the TFTP server configuration with the tftp client: 

cd /tmp 
tftp localhost 
get filename.bin

You can change directory to /tmp or any other directory that does not contain the image file. 
You must use the exact name of your binary file. 
If there are no error messages, proceed; otherwise troubleshoot based on the error message. 

Step 2: Configure an ethernet interface on the firewall if not already configured. 

Test the configuration by pinging the ip address of the TFTP server from the firewall. 

Step 3: Load the IOS image 

From enable mode on the firewall, the following command will load the IOS image in filename.bin from 
the TFTP server at IP address 192.168.200.15: 

copy tftp://192.168.200.15/filename.bin flash

You will be asked to confirm this procedure. Press ENTER to confirm. 

Step 4: Restart the firewall 

From enable mode, use the 'reload' command to restart the firewall.


#############################################################################################
#############################################################################################
#############################################################################################


============================================================
Section 11. Basic VMS commands and Operations:
============================================================


1. The Platform:
================

VMS stand for Virtual Memory System. OpenVMS is not much different to VMS. 
It was just a marketing name change to reflect the Posix support in VMS.

In Alpha AXP the AXP does not stand for anything. It's just that you can't copyright 
a Greek letter, so DEC added AXP. 

VMS and/or OPenVMS commonly runs on DEC VAX hardware and Alpha machines.
VAX stands for Virtual Address eXtension. 
The follwing hardware can run VMS:

- VAX workstations 
- small VAXes (MicroVAX I, II, 3000, 4000) 
- medium VAXes (VAX 6000, VAX 7000, VAX 8000) 
- big VAXes (VAX 9000, VAX 10000) 
- ft VAXes 
- DEC ALPHAs (DEC 2000,3000,4000,7000,10000) 
- AlphaStation ALPHAs 
- AlphaServer ALPHAs 
- AlphaStation XP/DS/ES 
- AlphaServer DS/ES/GS 

HP says that OpenVMS will be ported to the Intel Itanium platform,
which could have important consequences about the lifetime/lifecycle of OpenVMS.


2. VMS Files and directories:
=============================

A VMS file specification consists of three parts: 

1 physical or logical device name, like PDS$DISK: 
2 directory or sub-directory, like [USER], or [USER.TEX] 
3 the file name itself, which has the form: name.type;version 

where name is an alphanumeric string of up to 40 characters, type is the file type 
(up to 40 characters), and v is the version number between 1 and 32767, e.g. 

PROG.EXE;17, or TEST.TXT;1

So, a file completely qualified could be written like

PDS$DISK:[USER.TEX]PROG.EXE;17

Thus, complete file specification of a file stored on a disk has the following format: 
                                            
device:[directory.subdirectory]filename.type;version

The device and directory part are known as the pathname, and may be prefixed 
by 'nodename::' if they are on a different computer.

Each user has defaults for the device name, and the directory. 
You may find out your current defaults by typing 

$ SHOW DEFAULT 

This shows you the device and directory which VMS will assume if you specify 
a file name only. You may change the default, separately for the device name and the directory, using 

$ SET DEF new_default 

If you enter a file name in any command without items 1 and 2, e.g. simply TEST.TXT , 
the system will precede the name by the default internally, and take the highest version number of the file.

Some default file types:

Type  Default for         Contents  
COM   DCL                 DCL command file (like Unix script) 
EXE   VMS                 Executable program image 
C     C Compiler          C source  
CXX   CXX Compiler        C++ source  
FOR   Fortran Compiler    FORTRAN source  
MAR   Macro Assembler     VAX Macro Assembler  
DAT   Many things         Data file, e.g. program input/output 
LIS   Compilers etc.      Informational listing (e.g. compiler output) 
LOG   VMS batch jobs      Batch job output log file 
MAP   LINKer              Map created by object linker 
OBJ   LINKer              Object file (= relocatable binary) 
OLB   LIBRARY Librarian   Binary object library 
PS    TEX, DECwrite etc.  PostScript laser printer commands 
TEX   TEX                 Text to be processed by TeX  
DVI   TEX                 Device-independent output from TeX  


If you conform to the standard extensions, compiling, linking and running
could be as easy as:

$ FORTRAN TUT 
$ LINK TUT 
$ RUN TUT 

$ cc program
$ link program
$ run program


3. ASSIGN command for substituting a logical name for physical name:
====================================================================

You can use a logical device name in a file specifications instead of a physical device, for example like

$ ASSIGN/NOLOG DEV$DISK:[HORACE.MCARLO] MONTE$DIR:

Files in the DEV$DISK:[HORACE.MCARLO] directory could then be referred to as MONTE$DIR:filename.type;v . 
This is shorter, and has the advantage that if you move your files for any reason, 
you need only change one logical name and everything will work in the new directory.

Any logical names which you ASSIGN or DEFINE are placed in a separate name table for your process only. 
These will disappear once you log off, so frequently used logical names should be assigned in your 
LOGIN.COM command file.

To see what logical names are defined for you enter 

$ SHOW LOGICAL 
or 
$ SHOW LOGICAL SY* 

to see only the logical names beginning with th letters "SY", for example.

More examples:

$ ASSIGN $DISK1:[CREMERS.MEMOS] MEMOSD
      
The ASSIGN command in this example equates the partial file specification $DISK1:[CREMERS.MEMOS] 
to the logical name MEMOSD. 

$ ASSIGN/USER_MODE $DISK1:[FODDY.MEMOS]WATER.TXT TM1
      
The ASSIGN command in this example equates the logical name TM1 to a file specification. 
After the next image runs, the logical name is deassigned automatically. 


4. MANIPULATING FILES:
======================


Directories:
------------

A directory is a special type of file that contains information about
the other files contained within it.  The directory part of a file
specification is delimited by [] brackets.  SET DEFAULT commands
specify the "default directory", where files will be read from or
written to, unless the filename explicitly specifies a different
directory. 

  Nomenclature

    []            The current directory.
    [-]           One level up.
    [-.-]         Two levels up.
    [--]          Ditto.
    [...]         Everything below the current level.
    [.*]          All subdirectories one level down.
    SYS$LOGIN     The user's login directory.
    SYS$SCRATCH   The user's scratch directory (for large operations). 

Actions

  $ CREATE/DIR  [.SUBDIR] Create a subdirectory.
  $ SET DEFAULT [.SUBDIR] Move to this new subdirectory.
  $ SET DEFAULT PRGDISK:[SHARED.PROGRAMS] 
                          Move to this location.
  $ SET DEFAULT [-]       Move to one directory level up.
  $ SET DEFAULT SYS$LOGIN Move to the user's home directory.

  To delete a directory, first make all files in it deletable, then
  remove them:

  $ SET FILE/PROT=O:RWED [.SUBDIR...]*.*;*
  $ DELETE [.SUBDIR...]*.*;* Issue this command until no error messages appear.

  then do:

  $ SET FILE/PROT=O:RWED SUBDIR.DIR
  $ DELETE SUBDIR.DIR;


DEL, DIR, PURGE, RENAME, SEARCH and file management:
----------------------------------------------------

DIR command:
============

In general, it is useful to look at all your files now and then, using the command 

$ DIR         List everything in the current directory.
$ DIR DISK:[DIR1.SUBDIR1]
                      List everything in the specified directory.
$ DIR/SIZE/OWNER/PROT FRED*.*
                      List all files beginnig with "FRED" and show
                      their size, who owns them, and what their protections are.
$ DIR/SIZ=ALL    or    DIR/SIZ=ALL filename
$ DIRECTORY/SINCE=TODAY/SIZE=ALL
$ HELP DIR    More information on the DIRECTORY command.


DEL command:
============

You can delete a file by typing (after having run the above example) 

$ DEL TUT.EXE;1 

(the ; is always necessary, but the version number may be omitted if 
you mean the highest version number) which will do it, and tell you so if 
your LOGIN.COM file defaults are set correctly. 
You can also use the confirm switch.

$ DELETE/CONFIRM PROTO.*;*
|     USER$DISK:[FAXYZ]PROTO.DAT;2, DELETE? [N]:


PURGE command:
==============

A useful command is PURGE, e.g. 

$ PURGE TUT.OBJ 

(without version number or ; ) which will delete all files TUT.OBJ except 
the highest version number. A qualifier allows you to specify how 
many versions to keep, counting from the top, e.g. 

$ PURGE/KEEP=3 name.type 

will keep the three highest version numbers of the file specified. 

RENAME command:
===============

Sometimes you want to RENAME a file, like in the following example:

$ RENAME VITALFILE.FOR;2 VITALFILE.BAC 

As you may have guessed by now, no file is ever deleted or replaced by the system, 
but a higher version is created instead. This makes the PURGE command so necessary 
if you are to avoid using up all of your disk space allocation, or `disk quota'. 

SEARCH command:
===============

Just like 'find' in DOS or 'grep' in UNIX, DCL allows to find stringvalues in files
with the SEARCH command.
Just a few examples will provide the general idea:

$ SEARCH *.* Fred       
              means find Fred, FRED, fred, etc. in the latest version of any file

$ SEARCH/MATCH=EXACT *.*;* "Fred"
              means find only Fred in any version of any file

$ SEARCH *.* search_string
              means search all files for search_string           
  
$ SEARCH *.FOR string 

which (in the above case) will copy all lines containing "string" from all files of type .FOR 
to the screen, or onto an output file if you specify the /OUTPUT=filename option after SEARCH. 


USE OF WILDCARDS:
-----------------

Just like in DOS or UNIX, you can use wildcards in filemanagement or listings.
Use asterisks "*" and percent signs "%" in connection with file names, 
where an asterisk stands for "any alphanumeric string" and a "%" for "any alphanumeric character". 
A command containing this type of name specification is called a WILD card. 

Examples:

$ PURGE *.* 
or just 
$ PUR 
will delete all but the highest version number of all your files (it is good practise to do this from time to time). 

The command

$ DIR *.COM 

will list all your files of file type COM, 

$ DIR MY*.%%A 

will list all your files starting with `MY' and having a file type 
of three characters, the last one being `A'. 


Copy command and filetransfer:
------------------------------

When VAX and Alphas are clustered, transfer is trivial. 
Certain disks on the different machines are available to all members of the cluster, 
and they have names of the form "node_name$disk", eg. YR9$DKA300, YRL$DKA100, YRE$DKA400. 
You can see which disks are available using the SHOW DEVICE D command. 

Example of filetransfer:

To copy a file from user MORRISSEY's directory on DEV$DISK to use MARR's directory 
on PDS$DISK without changing the filename you would enter 

$  COPY DEV$DISK:[MORRISSEY]TCM.TEX PDS$DISK:[MARR] 

There is usually no need for this sort of transfer, since you can access all the main disks 
from every cluster member anyway, though you might want your own copy of a file to modify. 

When VAX or Alphas are not clustered, but are linked by DECnet as is often the case, 
file transfer over DECnet is achieved by a simple copy command, which looks like this: 

$ COPY node_name1::from_file node_name2::to_file 

Note that the remote file specification must contain the DECnet node name in addition 
to the disk and directory name. The node name is followed by ::. 

Example:

To copy a file called COMMAND.COM from directory DISK$USERS:[TEST] on a VAX or Alpha called VMS1, 
to your current default directory you would enter 

$ COPY VMS1::DISK$USERS:[TEST]COMMAND.COM []  

Usually the system managers of the machines you use will have set up what is know 
as a DECnet proxy entry to allow you to copy easily, as in the above example. 
If this isn't the case, then you may have to specify your remote username and password when you do the copy. 
Imagine you were copying a local file called REPORT.TXT to a remote machine, DAKOTA, 
on which you had an account with username FARGO and password UBETCHAYAH. 
You place the username and password in quotes, between the username and the :: . 

$ COPY REPORT.TXT DAKOTA"FARGO
UBETCHAYAH"::USER$DISK:[REPORTS] 

If you were to miss out the USER$DISK:[REPORTS] part of the destination specification, 
then the file would end up in FARGO's default login directory. 
It's quite easy to lose track of where files are going during a COPY, so it's a good idea to add 
the /LOG qualifier to the copy command so that you are told exactly where the file ended up ! 


File protection or permissions:
-------------------------------

Four forms of access to your files may be allowed or denied to four classes of users. 

- The four access classes are: 

  Read access, Write access, Delete access and Execute access. 

- The four classes of users are: 

  System manager, Owner, Group members and World (everyone). 


You may show the current protection of a file by entering a command like the following: 

$ DIRECTORY/PROTECTION  to see something like:

Directory USER$DISK:[FAXYZ]

DRAFT.TXT;1                124  OCT-31-1988      (RE,RWED,RW,R)

The notation in parentheses shows that the system manager is given read and execute access 
to the file DRAFT.TXT, while the owner retains all four forms of access, 
members of the same group have read and write access while "world" users (anyone else) 
has only read access. 
You may alter the access to the file with the command SET PROTECTION by specifying 
the access and the filename: 

$ SET PROTECTION=(S:RW,O:RWE,G:R,W:) DRAFT.TXT;1


5. VMS-UNIX command conversion chart:
=====================================

Sometimes it is helpfull to compare some unix commands to VMS commands:

UNIX			VMS
----			----

help			HELP
man command		HELP COMMAND
ls			DIR{ECTORY}
ls -ls			DIR /OWNER /DATE /SIZE /PROT{ECTION}
ls ..			DIR [-]
ls subdirectory		DIR [.SUBDIRECTORY]
ls subdir1/subdir2	DIR [.SUBDIR1.SUBDIR2]
mkdir subdir		CREATE/DIR [.SUBDIR]
cd			SET DEFAULT SYS$LOGIN
cd subdir		SET DEF [.SUBDIR]
cd ../subdir		SET DEF [-.SUBDIR]
cp file1 file2		COPY FILE1 FILE2
cp file subdir		COPY FILE [.SUBDIR]
mv file1 file2 		RENAME FILE1 FILE2
rm file			DELETE FILE;
rmdir subdir		SET FILE/PROTECTION=(OWNER:RWED)
			DEL SUBDIR.DIR;

chmod ... filenm 	SET FILE/PROT=(...) FILENM
 u			O{WNER}:
 g			G{ROUP}:
 o			W{ORLD}:
 r			R
 w			W
 x			E
 			D (DELETE)
chmod 755 filenm	SET FILE/PROT=(O:RWED,G:RE,W:RE) FILENM
command > file		command/OUTPUT=FILE
command < file		command/INPUT=FILE
rlogin machine		SET HOST MACHINE
script {scriptfile}	SET HOST MACHINE /LOG{=SCRIPTFILE}
 (default typescript)	 (default SETHOST.LOG)

vi/edit file		ADAM FILE (EDT default editor)
			EDIT/TPU FILE (programmable editor w/windows)
			LS{EDIT} FILE (language sensitive editor)

cat file		TYPE FILE
more file		TYPE/PAGE FILE
cat file1 ... filen > newfile	COPY FILE1,...,FILEN NEWFILE
cat file1 ... filen >> newfile	APPEND FILE1,...,FILEN NEWFILE
lno file		SEARCH/NUMBER FILE ""	
lpr file		HOTPRINT FILE -or- PRINT FILE
lpq			SHOW QUEUE
grep string file	SEARCH FILE STRING
sort file > outfile	SORT FILE OUTFILE
sort file		SORT FILE SYS$OUPTUT
write user		PHONE USER
mail user		MAIL
			SEND

ps			SHOW SYSTEM
date			SHOW TIME -or- SHO DATE	
scriptfile		@DCLSCRIPT.COM
. scriptfile
sh < scriptfile
source scriptfile

alias command 'string' (csh)	COMMAND :== STRING (see consultant)
alias			SHOW SYMBOL/GLOBAL/ALL
 .login -or- .profile	LOGIN.COM
stdin			SYS$INPUT (VMS logicals)
stdout			SYS$OUTPUT
stderr			SYS$ERROR
whoami			SHOW PROCESS
dc -or- bc		CALC
^D (logout)		LO{GOUT}
^D (EOF)		^Z
netcp (unix to unix)	NETCOPY (vms to vms)
transfer		TRANSFER
nroff			RUNOFF
passwd			SET PASSWORD


6. DCL Commands, common ones in alphabetical order:
===================================================

Common commands:
----------------

Overview

  This is a list of the commands most likely to be used by nonprivileged
  users.

Actions

  $ _numeric == 20
                     Define a symbol that contains a numeric value.
  $ _symbol :== a string
                     Define a symbol that contains a string.
  $ append           Append one or more files to one file.
  $ assign           Define a logical name.
  $ attach           Transfer control of terminal to a different process.
  $ backup           Make copies of files, directories, disks.
  $ continue         After a {ctrl Y}, let program continue.
  $ convert          Change the format or contents of a file.
  $ copy             Copy a file or files.
  $ create           Create a file.
  $ create/dir       Create a directory.
  $ deassign         Cancel a logical name assignment.
  $ define           Define a logical name.
  $ delete           Delete a file, queue entry, or symbol.
  $ differences      Compare two files, show the differences.
  $ directory        List a directory's contents.
  $ edit             Edit a file.  Many editors available.
  $ ftp              Transfer files to/from another computer.
  $ help             Get help on a topic.
  $ mail             Start the MAIL utility, send/read/print/delete mail.
  $ merge            Merge up to 10 presorted files into one.
  $ monitor          Check on disk, processor, etc. usage.
  $ multinet ping    Check the route to another computer.
  $ phone            Interactive conversation with another user.
  $ posix            Enter the POSIX shell (like Unix).
  $ print            Print a file.
  $ purge            Delete lower numbered versions of a file.
  $ rcp              Copy files to/from another computer.
  $ read             Read information from the screen or a file.
  $ recall           Recall previous commands.
  $ rename           Change the name of a file or files.
  $ rshell            Execute commands on another computer.
  $ run              Run a program.
  $ search           Search file(s) for one or more strings.
  $ set              Set many things: terminal, queue entry, priority,etc.
  $ show             Show whatever SET can set.
  $ sort             Sort the contents of a file.
  $ spawn            Create a subprocess.
  $ stop             Stop a process or queue.
  $ submit           Start a batch job.
  $ talk             Interactive conversation with user on another computer.
  $ telnet           Interactive session on another computer.
  $ type             Type a file to the terminal.
  $ write            Send information to the screen or a file.


Commands related to devices:
----------------------------

Devices

Overview

  There are many types of devices available on OpenVMS systems, such 
  as disks, tapes, terminals, printers, and so forth.  The operating
  system will assign each of them a name like "DKA100:" (note the trailing
  colon).  One special device is NLA0:, which is the null device.  Output
  directed there disappears - convenient for disposing of status messages
  and such. 

Actions

  $ SHOW DEVICE [/FULL] [device_name]
                     Display information on one or more devices.

  Disk and tape commands, usually issued by privileged users.

  $ INIT device_name volume_label 
                     Initialize the device.
  $ MOUNT device_name volume_label 
                     Mount the volume in the device on the system.
  $ DISMOUNT device_name
                     Dismount the volume in the device.
  
  Terminal commands, typically anybody can use these.

  $ SHOW TERM        Show terminal characteristics.

  $ SET TERM [/WIDTH=132] [/PAGE=50] [/SPEED=9600]
                     Change terminal characteristics.


7. COMMAND FILES:
=================

Just as in UNIX or DOS, you can create scripts, or batchfiles that run a series
of statements.
Command files are files containing a series of DCL commands, just as you would enter them from a terminal. 
They can either be executed interactively, or submitted for batch execution 
- see the later section on batch jobs. A command file which you have executed already, 
perhaps without realising it, is your LOGIN.COM . This is executed automatically every time you log in, 
although you can stop it from being executed (if you have made 
some sort of mistake in it that causes a loop, say) by adding /NOCOM after your user name when you log in. 

The default file type for command files is .COM , so if you just type @MYCOMMANDS then VMS 
will assume that you mean MYCOMMANDS.COM. 

Each command must be preceeded by a $ sign; lines without this are interpreted 
as input to procedures called from the command file, and are otherwise skipped, with an error message. 
Continuation lines are indicated by a "-" (hyphen = minus sign) at the end of the line, 
and simple continuation in the next line, e.g. 


$ SHOW -
    SYSTEM  ! This would be a silly place to split a line, but you
get the idea

Example:
Try this ! 

Create the file TESTCOM.COM and try to predict its action. 
The exclamation mark is a comment character in DCL, like * in FORTRAN, // in C++. 

$ CREATE TESTCOM.COM ! Whatever you type now goes into the file
TESTCOM.COM 
$ WRITE SYS$OUTPUT "Hello World" ! Ever original 
$ EXIT
'Ctrl-Z'

The 'Ctrl-Z' terminates the input to the CREATE command. If you 

$ TYPE TESTCOM.COM 

It should look like this: 

$ WRITE SYS$OUTPUT "Hello World" ! Ever original
$ EXIT

To execute the file, you have to enter 

$ @TESTCOM 


8. DCL SYMBOLS OR VARIABLES:
============================

Symbols are useful for defining shorthand for frequently used commands, 
and for use as "variables" in DCL command procedures. 
Using the single "=" sign you can define symbols which are local to a command file, 
ie. they disappear at exit from it, or you can define global symbols 
which remain valid until you logoff, using "==". 

Examples:

 three = 3
 file := SYS$EXAMPLES:TUT.FOR

can be used inside the command file, e.g. setting up the files for a batch job. 
Please note that "=" assigns a value, ":=" assigns a string to a symbol. 
Placing the string in double quotes " is also acceptable. 

 three == 3
 file :== SYS$EXAMPLES:TUT.FOR
 string1 :==The whole of this line will end up in STRING1
 string2 == "This is a string too"

All these symbols will however remain valid even after the execution of the command file, 
because the "==" was used 

DCL is case-insensitive for the most part, so it doesn't matter whether your symbols 
are uppercase or lowercase. Having said that I tend to use lowercase for my own symbols, 
and uppercase for built in DCL commands, just to make it easier to read and tell them apart. 
To invoke a symbol, put it between quotes, e.g. 'file', as in 

$ file := MYDATA.DAT  ! Local symbol
$ COPY 'file' FARVAX::SCRATCH$DEVICE:

Note that it is the right-hand single quote ' both before and after the symbol. 
If you use the symbol within a quoted string, you need two quotes before it and one after, like this: 

$ file :== TESTCOM.COM ! Global symbol
$ WRITE SYS$OUTPUT "Copying ''file' to the remote system." ! Two '
$ COPY 'file' FARVAX::SCRATCH$DEVICE:                      ! One '

Since symbols can be defined directly, without command files, 
try the above definition of file followed by the command 

$ TYPE 'file'

and you will understand. 

Symbols are often used to provide a shorthand way of specifying a frequently used command 
with several qualifiers. For example, instead of having to type 

$ DIRECTORY/SINCE=TODAY/SIZE=ALL ! Get all files created
today, show their size

you could define a symbol in your LOGIN.COM like this: 

$! Get all files created today, show their size
$ SDIR:==DIRECTORY/SINCE=TODAY/SIZE=ALL

then you need only type 

$ SDIR

to get the same information. It is considered bad practise to define symbols that clash 
with built-in DCL commands, because it can lead to all sorts of confusion regarding 
the expected behaviour of commands. To see what symbols you already have defined you can type 

$ SHOW SYMBOL *

This assumes that someone hasn't defined a symbol called SHOW to do something else ! 
If you suspect that they have, you can get rid of symbols by typing 

$ DELETE/SYMBOL symbol_name

You could guarantee that DELETE would give you the DCL DELETE functionality by doing 

$ DELETE:=DELETE

and indeed you will occasionally see this done in command files, to insulate them from 
the effects of any users who have been foolish enough to define symbols that clash with DCL commands. 


9. Processes:
=============

SHOW PROCESSES:
---------------

$ SHOW SYSTEM
$ SHOW PROCESS/ALL 
$ SHOW PROCESS /id=process_id
$ SHOW PROCESS process_name

which gives information about your process. Normal user priority is 4, 
but certain system tasks have higher priority, and user batch jobs 
always have lower priority (1, 2 or 3 for long, medium or normal batch jobs) 
so that they use up spare CPU time with very little inconvenience to interactive users. 
Normal users can only set their priority, or that of their batch jobs, 
up to the base limit of 4. VMS also manages the batch job queues, 
allows the different VAX and Alphas in the cluster to talk to each other, 
and many other tasks of this nature.

You can tell which processes are hogging which resources using variants
of the MONITOR command:

 $ MONITOR process/topcpu   Who's using all the CPU?
 $ MONITOR process/topfault Who's page faulting so much?
 $ MONITOR disk             What's going on on the disks?


CREATE A SUBPROCESS:
--------------------

Use the SPAWN command.  Here is an example of interrupting a program,
creating a subprocess, doing some stuff in it interactively, and then
returning to the program running in the main process:

$ run myprog
^Y
$ spawn
$ dir *.dat  Do a couple of commands, this is just an example
$ logout
$ continue   The program completes normally.

Note that giving a command other than spawn or attach would have
killed the halted program "myprog".

You can also use Spawn to get a subprocess running at the same time
as the main process.  For instance, the following will start the program
XV (an interactive graphics program for DECwindows) and then let you 
continue with the current session:

  $ spawn/nowait xv
  $

Note that a ^Y or ^C at the top session will kill the subprocess.

STOP A PROCESS:
---------------

If you know the name or process ID, and it belongs to you, or you have
sufficient privileges: 

$ stop process_name

           or

$ stop/id=process_number   a typical number: 20200242

You can get the process_name or _number from:

$ show system

If the process you want to stop is your current session, or the program
you are running, use:

{^Y}   Control key and Y, stop the current program.
$ logout


BATCHJOBS:
----------

How do I start a batch job?
---------------------------

First you need to put a series of DCL commands into a file, because
batch jobs require DCL procedure files to tell them what to do.  (They
aren't interactive, so you can't do so from your terminal.)  Here is 
a simple procedure file that sorts a couple of files and then merges them.
Generally, you would use an editor to create this file.

-- just an example 'test.com' file:

$! first line of "TEST.COM", note no error checking!!!
$ sort file1.txt file1.txt_sorted
$ sort file2.txt file2.txt_sorted
$ sort file3.txt file3.txt_sorted
$ merge file1.txt_sorted,file2,file3 file4.txt
$ delete file%.txt.
$ write sys$Output "All done"
$!last line of file

This is one command you might use to start it on a batch queue:

 $ SUBMIT/NOTIFY/NOPRINT/LOG=SYS$LOGIN: [QUEUE=queue_name] test.com

This says:
  Put it on the batch queue named "queue_name"
  Notify my terminal when it finishes (only works if you are still
     logged in!) 
  Keep a log file, it will be SYS$LOGIN:TEST.LOG
  Don't print the log file

It will tell you the entry number when it is placed on the queue.


How do I stop a batch job?
--------------------------

First, figure out the entry number, if you didn't write it down when
you issued the SUBMIT command to place it on the QUEUE.

$ SHOW ENTRY    Show all entries that you own in any queue.

Figure out which one is yours.  Then do:

$ DELETE/ENTRY=entry_number


10. BOOTPROCEDURE OpenVMS:
==========================

Generic description of the bootprocedure:
-----------------------------------------

Together, the booting and startup processes comprise the following steps: 

BOOT command 
  |
loads primary bootstrap
On VAX,  this is VMB.EXE
On Alpha this is APB.EXE
  |
loads secondary bootstrap: SYS$SYSTEM:SYSBOOT.EXE
  |
SYSBOOT.EXE loads parameters from default parameter file
  |
loads Executive
  |
loads SWAPPER
  |
loads SYSINIT
  |
starts STARTUP process and executes STARTUP.COM
  |
This will also execute SYSTARTUP_VMS.COM
  

You enter the BOOT command. The boot block, a fixed location on disk, 
points to the primary bootstrap image, which is loaded from disk into main memory.
 
- On VAX systems, the primary bootstrap image is VMB.EXE. 
- On Alpha systems, the primary bootstrap image is APB.EXE. 

The primary bootstrap image allows access to the system disk by finding 
the secondary bootstrap image, 

SYS$SYSTEM:SYSBOOT.EXE, and loading it into memory. 

SYSBOOT.EXE loads the system parameters stored in the default parameter file into memory. 

If you are performing a conversational boot, the procedure stops and displays 
the SYSBOOT> prompt. 

Otherwise, SYSBOOT.EXE loads the operating system executive into memory 
and transfers control to the executive. 
When the executive finishes, it executes the SWAPPER process. 
The SWAPPER creates the SYSINIT process. 
Among other actions it performs, SYSINIT creates the STARTUP process. 
STARTUP executes SYS$SYSTEM:STARTUP.COM (unless you indicated another file using 
SYSMAN, SYSGEN, or conversational boot). 

STARTUP.COM executes a series of other startup command procedures, including SYSTARTUP_VMS.COM. 
The current values of system parameters are written back to the default parameter file. 
The boot process finishes, and you can log in to the operating system. 

To start an Oracle database automatically when the VMS system is rebooted,
place the following commands in a command procedure, e.g. DUA0:[ORACLE7]START_SALES.COM. 

$ @ORA_ROOT:[DB_SALES]ORAUSER_SALES
$ INSORACLE
$ @ORA_ROOT:[DB_SALES]STARTUP_EXCLUSIVE_SALES


Then edit the system startup file SYS$MANAGER:SYSTARTUP_VMS.COM and add 
the following command at the end of the file: 
$ SUBMIT/USER=ORACLE7 DUA0:[ORACLE7]START_SALES


11. ORACLE STUFF ON VMS:
========================

Paragraphs 11.1 & 11.2 deals with the support of
Oracle Releases on VAX OpenVMS and Alpha OpenVMS.

In short, these are the conclusions:

VAX/OpenVMS: max Oracle Server 7.3.x
Alpha/OpenVMS: Oracle 7,8,8i,9i


11.1 Supported releases on HP VAX VMS:
===============================================================================

Note:52574.1 Subject: HP VAX OpenVMS Certification Matrix Type: FAQ Status: PUBLISHED


ORACLE Server 
------------- 
HP VAX OPENVMS 
-------------- 
CERTIFICATION MATRIX 
-------------------- 
 
This article lists the historic certification matrix for HP VAX OpenVMS. 
 
		Version support - Certification Matrix 
		====================================== 
 
 
As of 1st January 2001, the Oracle Server is no longer fully supported on  
HP VAX OpenVMS. 
 
The upgrade path is to any other supported platform (ie HP ALPHA OpenVMS) 
via full export/import 
 
Oracle Server release 7.3.2.3.1 is the terminal release on VAX. 
It will remain under Extended Assistance Support until 1st January 2004 
(ie for a period of 3 years) 
 

Support Matrix for HP VAX OpenVMS versions and Oracle Server releases. 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 
Last Update: 30-AUG-2002 
Updated by:  Grant Hayden 
 
            VAX/OPENVMS 
-------------------------------------------------------------------------- 
   5.5          6.0          6.1          6.2         7.0         7.1 
-------------------------------------------------------------------------- 
 6.0.37.6   | 6.0.37.6   | 6.0.37.6   |           |           | 
 7.0.13.1   | 7.0.13.1   | 7.0.13.1   |           |           | 
 7.0.15.4   | 7.0.15.4   | 7.0.15.4   |           |           | 
 7.0.16.6.0 | 7.0.16.6.0 | 7.0.16.6.0 |           |           | 
 7.0.16.6.2 | 7.0.16.6.2 | 7.0.16.6.2 |           |           | 
 7.1.3.2    | 7.1.3.2    | 7.1.3.2    | 7.1.3.2   |           | 
 7.1.3.4    | 7.1.3.4    | 7.1.3.4    | 7.1.3.4   |           | 
 7.1.5.2.4  | 7.1.5.2.4  | 7.1.5.2.4  | 7.1.5.2.4 |           | 
            |            |            |           | 7.3.2.3.1 | 7.3.2.3.1 
========================================================================== 
(For ALPHA OpenVMS Certification Matrix see [NOTE:62150.1] 
<ml2_documents.showDocument?p_id=62150.1&p_database_id=NOT>) 
 
NOTES: 
 
A.	Oracle versions prior to 7.3 are *NOT* supported on OpenVMS 7.x 
 
B.	Oracle 7.1.5.2.4 will be the LAST release of Oracle which supports 
	V5.5 of VAX/VMS due to the move from VAX 'C' to DEC 'C'. 
 
C.	Oracle 8 will not be shipped to HP VAX hardware. Hence Oracle  
	Server release 7.3 will be the terminal release of Oracle on the  
	VAX port. 
 
D.	Oracle 7.3.2.3.1 is the terminal Oracle release on VAX OpenVMS. 
	It will remain fully supported until 01-JAN-2001. 


11.2 Supported releases on Alpha Open VMS:
===============================================================================

Note:62150.1 
Subject:  HP Alpha OpenVMS Certification Matrix 
Type:  FAQ 
Status:  PUBLISHED 


ORACLE Server 
------------- 
HP ALPHA OPENVMS 
---------------- 
CERTIFICATION MATRIX 
-------------------- 
 
This article lists the current certification matrix for HP ALPHA OpenVMS. 
 
In Metalink, this note is best viewed using the 'default font'. To change 
to the 'default font', click on the 'fixed font' text at the top of the 
screen. 
 
                Version support - Certification Matrix 
                ====================================== 
 
 
Support Matrix for HP ALPHA OpenVMS versions and Oracle Server releases 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 
Last Update: 03-JAN-2003 
Updated by:  Grant Hayden 
 
ALPHA/OPENVMS  
--------------------------------------------------------------------------- 
              |              |              |              |              | 
     6.2      |     7.0      |     7.1      |     7.2      |     7.3      | 
              |              |              |              |** see note J | 
--------------------------------------------------------------------------- 
 7.1.5.2.3    |              |              |              |              | 
              | 7.3.2.2.0    |              |              |              | 
              | 7.3.2.3.0    | 7.3.2.3.0    |              |              | 
              |              | 7.3.2.3.2    |              |              | 
              |              | 7.3.3.4      |              |              | 
              |              | 7.3.3.6      | 7.3.3.6      |              | 
              |              | 7.3.4.3      | 7.3.4.3      |              | 
              |              | 7.3.4.4      | 7.3.4.4      |              | 
--------------------------------------------------------------------------- 
              |              | 8.0.3.2.0    |              |              | 
              |              | 8.0.5.0.0    | 8.0.5.0.0    |              | 
              |              | 8.0.5.0.1    | 8.0.5.0.1    |              | 
              |              | 8.0.5.1.0    | 8.0.5.1.0    |              | 
              |              |              | 8.1.6.0.0    |              | 
============================================ ** see note E ================ 
              |              |              | 8.1.7.0.0    | 8.1.7.0.0    | 
              |              |              |       ** see note E         | 
              |              |              | 8.1.7.1b     | 8.1.7.1b     | 
              |              |              | 8.1.7.3.0    | 8.1.7.3.0    | 
              |              |              | 8.1.7.4.0    | 8.1.7.4.0    | 
              |              |              |       ** see note D         | 
              |              |              | 9.0.1.0.0    |              | 
              |              |              |** see note C |              | 
              |              |              | 9.0.1.3.0    | 9.0.1.3.0    | 
              |              |              |** see note B |              | 
              |              |              |              | 9.2.0.2.0    | 
              |              |              |              |** see Note A | 
=========================================================================== 
(For VAX OpenVMS Certification matrix - see 
[NOTE:52574.1] <ml2_documents.showDocument?p_id=52574.1&p_database_id=NOT>) 
 
A.      Oracle 9 Server Release 9.2.0.2 was announced on 21-DEC-02 
 
        Additional information on this release is available under 
        [NOTE:222553.1] <ml2_documents.showDocument?p_id=222553.1&p_database_id=NOT> 
FAQ for Oracle RDBMS Release 9.2.0.2.0 for Alpha OpenVMS 
 
B.      Patch set 9.0.1.3.0 is now available from the Metalink patch 
        download area. 
 
        Use patch number 2271678 and platform 'HP Alpha OpenVMS' to  
        locate the download. 
 
        The patch set, like all recent one-off patches, is supplied as 
        a zip file. 
 
        Please FTP the downloaded file in BINARY mode to your VMS system. 
 
        The zip file should not be expanded locally on your PC 
        as the subsequent FTP transfer to VMS of the expanded file 
        set will corrupt some of the supplied files and hence make 
        it impossible to apply the patch set correctly. 
 
        If you have not got the UNZIP utility on VMS, it can be obtained 
        from one of the following locations :- 
 
   ftp://oracle-ftp.oracle.com/server/patchsets/midrange/alpha/zip/ 
 
   http://www.info-zip.org/pub/infozip/Zip.html#VMS 
 
   http://www.openvms.compaq.com/  Then search for UNZIP 
 
        *IMPORTANT NOTE* - Please review the PATCH_NOTE.HTM file 
        provided with this patch set prior to installation. 
 
        9i RAC is only certified against OpenVMS 7.2-1H1 and above and 
        patch 2267002 is required for this certification. This patch is 
        available from the Metalink download area. Use patch number 2267002  
        and platform 'HP Alpha OpenVMS' to locate the download. 
         
C.      Oracle 9 Server Release 9.0.1.0.0 is now orderable on OpenVMS. 
 
        The Server CD kit is shipping under part number A91377-01 
 
        This release includes RAC (Real Application Clusters) which is  
        the replacement for the Oracle Parallel Server product under  
        Oracle 8 and earlier releases. 
 
        Note that this release is only certified against OpenVMS 7.2-1  
        and 7.2-1H1. 
 
        Please apply the 9.0.1.3.0 patch set for certification against  
        OpenVMS 7.2-2 and 7.3. (See note B) 
 
D.      Patch set 8.1.7.4.0 is now available from the Metalink patch  
        download area. 
 
        Use patch number 2376472, platform 'HP Alpha OpenVMS' to  
        locate the download. 
 
        The patch set, like all recent one-off patches, is supplied as 
        a zip file. 
 
        Please FTP the downloaded file in BINARY mode to your VMS system. 
 
        The zip file should not be expanded locally on your PC as the  
        subsequent FTP transfer to VMS of the expanded file set will corrupt  
        some of the supplied files and hence make it impossible to apply the  
        patch set correctly. 
 
        If you have not got the UNZIP utility on VMS, it can be obtained 
        from one of the following locations :- 
 
   ftp://oracle-ftp.oracle.com/server/patchsets/midrange/alpha/zip/ 
 
   http://www.info-zip.org/pub/infozip/Zip.html#VMS 
 
   http://www.openvms.compaq.com/  Then search for UNZIP 
 
        *IMPORTANT NOTE* - Please review the PATCH_NOTE.HTM file 
        provided with this patch set prior to installation. 
 
        For information: 
 
        Patch set 8.1.7.3 is still available from the Metalink patch  
        download area. Use patch number 2189751, platform 'HP Alpha OpenVMS'  
        to locate the 8173 download. 
 
        Patch set 8.1.7.1(b)is still available from the Metalink patch  
        download area. Use patch number 1746764, platform 'HP Alpha OpenVMS'  
        to locate the 8171b download. 
 
E.      Oracle 8 Server Release 8.1.7.0.0 is now orderable. 
 
        Note that a minimum of OpenVMS 7.2-1 is required for this 
        release. 
 
        Current part number  A87888-02 includes TG4RDB 
        Previous part number A87888-01 does not include TG4RDB 
 
        This is the terminal Oracle 8i release. 
 
F.      Oracle Server releases prior to Oracle Server release 8.1.7 
        are no longer fully supported. 
 
        Oracle Server releases 7.3.4, 8.0.5 and 8.1.6 are  
        currently under the Extended Assistance Support (EAS) program. 
        See [NOTE:66697.1] <ml2_documents.showDocument?p_id=66697.1&p_database_id=NOT> for a definition of this program. 
 
        EAS ends on 31-DEC-2003 for Oracle Release 7.3.4 ([NOTE:66409.1] <ml2_documents.showDocument?p_id=66409.1&p_database_id=NOT>) 
        EAS ends on 30-JUN-2003 for Oracle Release 8.0.5 ([NOTE:72533.1] <ml2_documents.showDocument?p_id=72533.1&p_database_id=NOT>) 
        EAS ends on 31-OCT-2004 for Oracle Release 8.1.6 ([NOTE:123178.1] <ml2_documents.showDocument?p_id=123178.1&p_database_id=NOT>) 
          
        Oracle Server releases prior to 8.1.7.0 are not certified to run 
        against OpenVMS 7.3 
 
G.      Patch sets and one-off patches are available for download from 
        Metalink. Some historic patches are also available from the 
        following URL. 
 
   ftp://oracle-ftp.oracle.com/server/patchsets/midrange/alpha/ 
 
        Please note that patch sets are cumulative and can be applied, 
        unless otherwise stated in the patch set documentation, directly 
        against the Oracle base Release or any intervening patch set 
        version. 
 
H.      Please note Oracle Server Release 7.3.3.4 and beyond only supports 
        Developer 2000 version 1.6.1. 
 
        Developer 2000 version 1.3.2 which is available with server  
        release 7.3.2.3.2 can be used against Oracle 7.3.3.x but only 
        when installed under a separate code tree.(ie a different ORA_ROOT). 
 
        Similarly, Developer 2000 1.6.1 can be used against Oracle 7.3.4.x 
        and Oracle 8 but only when installed under a separate code tree. 
 
I.      ALPHA OpenVMS Desupport notice for EV5 or earlier systems 
 
        Please review [NOTE:181307.1] for information on what the minimum  
        hardware requirements will be for running Oracle9i Release 2 
 
J.      Oracle releases 8.1.7 and 9.0.1 are now certified against OpenVMS 7.3-1 
 
 
        **Warning** 
 
        VMS 7.3 Extended File Cache (XFC) may cause data corruption. 
 
        There is a bug in the VMS 7.3 Extended File Cache (XFC) software  
        that may cause data corruption. XFC is controlled by the sysgen  
        parameter vcc_flags. 
 
        By default, XFC is enabled in VMS 7.3. The workaround is to set 
        vcc_flags to 1. 
 
        The OpenVMS ECO VMS73_XFC-V0200 (or later) should be applied to  
        resolve this issue. 
 
        Contact HP for more information. 
 
 
11.3 Notes on Oracle Server 7 on OpenVMS:
===============================================================================

- LOGICALS:

Instead of UNIX or Windows 'ORACLE_SID' environment variable, VMS uses a logical name
and the equivalent of the ORACLE_SID is

ORA_SID

- ORA_ROOT:

When Oracle is installed a root directory is chosen which is pointed to by the logical name ORA_ROOT. 
This directory can be placed anywhere on the VMS system. The majority of code, 
configuration files and command procedures are found below this root directory. 

When a new database is created a new directory is created in the root directory 
to store database specific configuration files. This directory is called [.DB_dbname]. 
This directory will normally hold the system tablespace data file 
as well as the database specific startup, shutdown and orauser files. 

- SYSTEM TABLESPACE:

The SYSTEM tablespace will be installed in the
ORA_ROOT:[DB_<dbname>] directory.

- USERS ENVIRONMENT:

The Oracle environment for a VMS user is set up by running the appropriate ORAUSER_dbname.COM file. 
This sets up the necessary command symbols and logical names to access the various ORACLE utilities. 
Each database created on a VMS system will have an ORAUSER file in it's home directory 
and will be named ORAUSER_dbname.COM, e.g. for a database SALES the file specification could be: 

ORA_ROOT:[DB_SALES]ORAUSER_SALES.COM

To have the environment set up automatically on login, run this command file in your login.com file. 
Now a user have easy access to for example SQLPLUS using the following command:

$ SQLPLUS username/password

- END A USER SESSION:

You can forcefully end a user session in Oracle in one of two ways:

ALTER SYSTEM KILL SESSION from within an Oracle tool
or
$STOP/ID=<process_id>

- STARTING AND STOPPING A DATABASE:

There are several methods available for database startup and shutdown. 
ORACLEINS (the Oracle install program) and SQLDBA both have menu driven methods 
to start or stop a database. 

Alternatively use command files. The following commands will start a database called SALES 
(the command INSORACLE will install various shared images which improve Oracle performance): 

$ @ORA_ROOT:[DB_SALES]ORAUSER_SALES
$ INSORACLE
$ @ORA_ROOT:[DB_SALES]STARTUP_EXCLUSIVE_SALES

To start this database automatically when the VMS system is rebooted place these commands 
in a command procedure, e.g. DUA0:[ORACLE7]START_SALES.COM. 

Then edit the system startup file SYS$MANAGER:SYSTARTUP_VMS.COM and add the following command at the end of the file: 
$ SUBMIT/USER=ORACLE7 DUA0:[ORACLE7]START_SALES

This will start a batch job running under the Oracle7 user account which will start up 
the database instance SALES. 

A database can be shut down by running the command procedure SHUTDOWN_dbname.COM 
which is found in the database's home directory. 


11.4 Global overview installation Oracle 9.2.x on Alpha OpenVMS:
===============================================================================

1. Check memory first:

$ SHOW MEMORY
$ SHOW MEMORY/RESERVED

2. Check the following:

Do you have:
3 GB free diskspace
HP OpenVMS 7.3
TCPIP UCX
X-windows, needed for running the OUI

Check OS version with:
$ SHOW SYSTEM/NOPROCESS/FULL

Check X-Windows with for example
$ RUN SYS$SYSTEM:DECW$CLOCK

3. Check the filesystem:

The disk containing the Oracle code tree must use ODS-2 (data) or ODS-5 (software).
The logicals ORA_ROOT, ORAROOT_DIR, ORACLE_HOME will point to locations
on this disk.

Check with:
$ SHOW DEVICE/FULL <device_name>

Change structure of disk example:
$ SET VOLUME/STRUCTURE_LEVEL=5 $2$DCK100:

Format disk example:
$ INITIALIZE/STRUCTURE=5 $2$DCK100: TESTVOL

4. Create the Oracle OpenVMS account:

$ SET DEFAULT SYS$SYSTEM
$ RUN AUTHORIZE

UAF>ADD Oracle9 /PASSWORD=ORACLE/UIC=[277,100] -
/DEVICE=<device>/DIRECTORY=[Oracle9i]/OWNER="ORACLE DBA"

5. Privileges:

A number of privileges needs to be granted to Oracle9

UAF>MODIFY Oracle9 -
/PRIVILEGE=(.,.,.,..)  -- see manual


Install Oracle 9.2.0.2 on OpenVMS:
=====================================


Simple example of using the OUI to install Oracle9i Release 2 on an OpenVMS System:
===================================================================================


We have a PC running Xcursion and a 16 Processor GS1280 with the 2 built-in disks
In the examples we booted on disk DKA0:
Oracle account is on disk DKA100. Oracle and the database will be installed on DKA100.
Install disk MUST be ODS-5.

Installation uses the 9.2 downloaded from the Oracle website. It comes in a Java JAR file.
Oracle ships a JRE with its product. However, you will have to install Java on OpenVMS so you can unpack 
the 9.2 JAR file that comes from the Oracle website
Unpack the JAR file as described on the Oracle website. This will create two .BCK files.

Follow the instructions in the VMS_9202_README.txt file on how to restore the 2 backup save sets.

When the two backup save sets files are restored, you should end up with two directories:

[disk1] directory 
[disk2] directory

These directories will be in the root of a disk. In this example they are in the root of DKA100.
The OUI requires X-Windows. If the Alpha system you are using does not have a graphic head, 
use a PC with an X-Windows terminal such as Xcursion.

During this install we discovered a problem:
Instructions tell you to run 

@DKA100:[disk1]runinstaller.

This will not work because the RUNINSTALLER.COM file is not in the root of DKA100:[disk1]. 
You must first copy RUNINSTALLER.COM from the dka100:[disk1.000000] directory into dka100:[disk1]:

$ Copy dka100:[disk1.000000]runinstaller.com dka100:[disk1]

From a terminal window execute:

@DKA100:[disk1]runinstaller

- Oracle Installer starts
  Start the installation
  Click Next to start the installation.

- Assign name and directory structure for the Oracle Home ORACLE_HOME

  Assign a name for your Oracle home.
  Assign the directory structure for the home, for example

  Ora_home
  Dka100:[oracle.oracle9]

  This is where the OUI will install Oracle.
  The OUI will create the directories as necessary

- Select product to install
  Select Database.
  Click Next.
- Select type of installation
  Select Enterprise Edition (or Standard Edition or Custom).
  Click Next.
- Enable RAC
  Select No.
  Click Next.
- Database summary
  View list of products that will be installed.
  Click Install.
- Installation begins
  Installation takes from 45 minutes to an hour.
  Installation ends
  Click Exit.

Oracle is now installed in DKA100:[oracle.oracle9]. 
To create the first database, you must first set up Oracle logicals. 
To do this use a terminal and execute 

@[.oracle9]orauser .

The tool to create and manage databases is DBCA.
On the terminal, type DBCA to launch the Database Assistant.
Welcome to Database Configuration Assistant
DBCA starts.
Click Next.
Select an operation
Select Create a Database.
Click Next.
Select a template
Select New Database.
Click Next.
Enter database name and SID
Enter the name of the database and Oracle System Identifier (SID):
In this example, the database name is DB9I.
The SID is DB9I1.
Click Next.
Select database features
Select which demo databases are installed.
In the example, we selected all possible databases.
Click Next.
Select default node
Select the node in which you want your database to operate by default.
In the example, we selected Shared Server Mode.
Click Next.
Select memory
In the example, we selected the default.
Click Next.
Specify database storage parameters
Select the device and directory.
Use the UNIX device syntax I.E.
For example, DKA100:[oracle.oracle9.database] would be:

	/DKA100/oracle/oracle9/database/

In the example, we kept the default settings.
Click Next.

Select database creation options
Creating a template saves time when creating a database.
Click Finish.
Create a template
Click OK.
Creating and starting Oracle Instance
The database builds.
If it completes successfully, click Exit.
If it does not complete successfully, build it again.
Running the database
Enter �show system� to see the Oracle database up and running.
Set up some files to start and stop the database.
Example of a start file
This command sets the logicals to manage the database:

$ @dka100:[oracle.oracle9]orauser db9i1

The next line starts the Listener (needed for client connects).
The final lines start the database.
Stop database example
Example of how to stop the database.
Test database server
Use the Enterprise Manager console to test the database server.
Oracle Enterprise Manager
Enter address of server and SID.
Name the server.
Click OK.
Databases connect information
Select database.
Enter system account and password.
Change connection box to �AS SYSDBA.�
Click OK.
Open database
Database is opened and exposed.
Listener
Listener automatically picks up the SID from the database.
Start Listener before database and the SID will display in the Listener.
If you start the database before the Listener, the SID may not appear immediately.
To see if the SID is registered in the Listener, enter:

$lsnrctl stat

Alter a user
User is altered:

SQL> alter user oe identified by oe account unlock;
SQL> exit

Preferred method is to use the Enterprise Manager Console.


12. OpenVMS File systems and Diskstructures:
============================================

On-Disk Structure (ODS) refers to a logical structure given to information stored on a disk or CD-ROM. 
It is a hierarchical organization of files, their data, and the directories needed to gain access to them. 
The OpenVMS file system implements the On-Disk Structure and provides access control to the files 
located on the disk. 

 
OpenVMS File Structure Options 

On-Disk Structures include Levels 1, 2, and 5. (Levels 3 and 4 are internal names for ISO 9660 
and High Sierra CD formats.) ODS-1 and ODS-2 structures have been available on OpenVMS systems for some time. 
With OpenVMS Version 7.2 on Alpha systems, you can now specify ODS-5 to format disks as well. 

ODS-1  Both  VAX only; use for RSX compatibility: RSX--11M, RSX--11D, RSX--11M--PLUS, 
             and IAS operating systems.  
ODS-2  Both  Use to share data between VAX and Alpha with full compatibility; default disk structure of the OpenVMS 
             operating system.  
ODS-5  Both  Superset of ODS-2; use on Alpha systems when working with systems like NT that need expanded character 
             sets or deeper directories than ODS-2.  


#############################################################################################
#############################################################################################
#############################################################################################


========================================================
Section 12: NT/200x/XP CMD shell script examples:
========================================================


#############################################################################
#############################################################################
Part 1: Traditional old cmd/dos batch command examples DOS/Win9x/NT/200x/XP/Vista
#############################################################################
#############################################################################


1. Put day, month, year into variables:
=======================================

@echo off
for /f "tokens=2-4 delims=/ " %%a in ('date /t') do (
set mm=%%a
set dd=%%b
set yyyy=%%c)

REM to show these variables

echo %mm%
echo %dd%
echo %yyyy%

Or put in a logfile:

echo ============== >> c:\temp\report.log
echo START RUNTIME: >> c:\temp\report.log
echo ============== >> c:\temp\report.log

date /T >> c:\temp\report.log
time /T >> c:\temp\report.log


2. Some copy and xcopy command examples:
========================================

-- If you want to xcopy files from a certain date:

xcopy *.* /D:01-13-2002 f:\backup

xcopy *.* /D:%datum%    f:\backup

-- Some examples of copy commands using variables:

copy %NTResKit%\perfmib.dll %systemroot%\system32\perfmib.dll
copy %NTResKit%\perfmib.ini %systemroot%\system32\perfmib.ini

If you want to use xcopy for backup purposes in Win2Kx / Vista / XP, please see Part 6.


3. The use of "FOR" example:
============================


Example: print all .txt files in 1 command
------------------------------------------

for %f in (*.doc *.txt) do type %f > prn

in a batchfile, just use: %%f

Example: register some dll's in 1 command
------------------------------------------

for %f in (*.dll) do regsrv32 %f

Example: copy tekst into a file a number of times
-------------------------------------------------

for /L %%f in (1,1,1000) do echo Albert >> c:\test\test.txt

(1,1,1000) means (start,step,end)


Example:
--------
Or look at this example:
FOR /L %variable IN (start,step,end) DO command [command-parameters]

To see this in action, at a command prompt, type 

FOR /L %i in (1,1,5) do @echo %i 

and you should see:

1
2
3
4
5

Example: sort of unix cut functionality with the use of for:
------------------------------------------------------------

suppose you have the following file "myfile.txt":

a,b,c
d,e,f
g,h,i

FOR /F "tokens=2,3* delims=, " %i in (myfile.txt) do @echo %i %j >> myfile2.txt

will create the following file "myfile2.txt":

b,c
e,f
h,i

Example:

@ECHO OFF
  IF (%1)==() FOR %%v in (GOTO:END ECHO.(%%1):(%1)) do %%v
  ECHO Got a value
  :END
  ECHO The end


4. "If.. Then..Else" test and the use of Labels:
================================================

Example 1:
----------

@echo off
setlocal

if (%2)==() goto usage
sqlplus %1/%2 @%ORACLE_HOME%\sqlplus\demo\demobld.sql
goto exit

:usage
echo Usage: demobld userid passwd

:exit
endlocal


Example 2:
----------

@echo off
set test=q
if %test%==%1 goto lab2

:lab1
echo not_equal
goto end

:lab2
echo equal
goto end

:end


if exist c:\temp goto lab2

:lab1
echo bestaat niet
goto end

:lab2
echo bestaat wel
goto end

:end

Example 3:
----------

Some loose statements:

if '%1' == '' goto ERR0 

if not exist %SYSTEMROOT%\SYSTEM32\SQRDB3.DLL goto ERR1

if errorlevel 1 set DRV=C:
if errorlevel 2 set DRV=D:

if %errorlevel% EQU 0 goto GO12
if %errorlevel% GTR 0 goto ERR6

find "not exist" c:\temp\report.log > nul
if %errorlevel% EQU 0 goto ERRNAME

if "%OS%" == "Windows_NT" goto NT_BIN

if exist _runscr.log del _runscr.log > nul


Example 4:
----------

if "%OS%" == "Windows_NT" goto NT_OS
CALL other.bat
EXIT
:NT_OS
CALL ntlogon.bat
EXIT


Example 5:
==========

IF NOT EXIST TypeFinder\BUILDALL.BAT GOTO TYPEFINDEREND
  CD TypeFinder
  CALL BUILDALL.BAT %1
  CD ..
:TYPEFINDEREND

IF NOT EXIST Wintalk\BUILDALL.BAT GOTO WINTALKEND
  CD Wintalk
  CALL BUILDALL.BAT %1
  CD ..
:WINTALKEND

IF NOT EXIST WordCount\BUILDALL.BAT GOTO WORDCOUNTEND
  CD WordCount
  CALL BUILDALL.BAT %1
  CD ..
:WORDCOUNTEND


Example 6:
==========

@echo off
csc /t:module CountDownSecondsLabel.cs /r:System.dll /r:System.Windows.Forms.dll /r:System.Drawing.dll
rem if C++ is specified, create C++ DLL, otherwise create C# DLL
if "C++"=="%1" goto CPP
if "c++"=="%1" goto CPP
if "%1"=="" goto CS
goto ERROR

:CS
csc /t:module CountDownErrorLabel.cs /r:System.dll /r:System.Windows.Forms.dll /r:System.Drawing.dll
goto Continue

:CPP
cl /clr /LD CountDownErrorLabel.cpp /link /OUT:CountDownErrorLabel.netmodule

:Continue
ilasm Counter.il /dll
ilasm CountDownComponents.il /dll
ilasm CountDown.il
goto END

:ERROR
echo Invalid command line argument '%1'
echo.

:END


5. The use of "Choice":
=======================

Choice is an external "cmd" or "DOS box" executable you for example can find in
MS Resource kits of Win9x, NT, 2000.

Use it as in the following example:

echo Please enter the drive letter ( c/d/e/f/g )
choice /c:cdefg

if errorlevel 1 set DRV=C:
if errorlevel 2 set DRV=D:
etc..

echo Is this correct (y/n) ?
choice /c:yn /n > nul
if errorlevel 2 goto


6. Pipelining examples:
=======================

SET | FIND "windir" | IF errorlevel=1 ECHO Windows not running

In order to see if Oracle services are running on this machine:

net start | FIND "Ora"


7. Creating sub-routines in CMD files without creating new files:
=================================================================

With NT/2000 CMD files it's possible to call sub-routines without creating a new CMD file. 
This gives you, the programmer/scripter, the possibility to keep your scripts 
in one file and maintain an overview of scripts in use.

How does it work then? Well, for those who know the DOS BATCH files (.bat),
will remember the LABELS and GOTO commands. 
Within NT, Microsoft made an addition to this functionality so that you 
can go to a label, and at the end of youre sub-routine, it will jump back
to the point where you have called the label.

Just look at the following example:

@echo off
ECHO Start of part 1
CALL :part2
ECHO End of part 1
goto end

:part2
ECHO Start of part 2
ECHO (Some things you want to do)
ECHO End of part 2
goto :EOF

:end
ECHO Finished script

The EOF is a hidden label which jumps to the end of the "subroutine", and so 
returns to its previous caller.


8. Oracle backup scripts partial code:
======================================


Example 1: archivelog backups
-----------------------------

@echo off
for /f "tokens=2-4 delims=/ " %%a in ('date /t') do (
set mm=%%a
set dd=%%b
set yyyy=%%c)

REM month/day/year mm/dd/yyyy

echo %mm%
echo %dd%
echo %yyyy%

set /A lastday=%dd%-1
echo %newday%

set copydate=%mm%/%lastday%/%yyyy%
echo %copydate%

g:
cd\archives
xcopy *.* /D:%copydate% f:\backup


Example 2: maintenance exportfiles
----------------------------------

move /Y d:\backups\pegacc\2dayago\*.Z d:\backups\pegacc\3dayago
move /Y d:\backups\pegacc\1dayago\*.Z d:\backups\pegacc\2dayago
move /Y d:\backups\pegacc\*.Z d:\backups\pegacc\1dayago

move /Y d:\backups\pegtst\2dayago\*.Z d:\backups\pegtst\3dayago
move /Y d:\backups\pegtst\1dayago\*.Z d:\backups\pegtst\2dayago
move /Y d:\backups\pegtst\*.Z d:\backups\pegtst\1dayago


9. Append date and time to filename:
====================================

Q. How can I append the date and time to a file?

A. You can use the batch file below which will rename a file to filename_YYYYMMDDHHMM.

@Echo OFF
TITLE DateName
REM DateName.CMD
REM takes a filename as %1 and renames as %1_YYMMDDHHMM
REM
REM -------------------------------------------------------------
IF %1.==. GoTo USAGE
Set CURRDATE=%TEMP%\CURRDATE.TMP
Set CURRTIME=%TEMP%\CURRTIME.TMP

DATE /T > %CURRDATE%
TIME /T > %CURRTIME%

Set PARSEARG="eol=; tokens=1,2,3,4* delims=/, "
For /F %PARSEARG% %%i in (%CURRDATE%) Do SET YYYYMMDD=%%l%%k%%j

Set PARSEARG="eol=; tokens=1,2,3* delims=:, "
For /F %PARSEARG% %%i in (%CURRTIME%) Do Set HHMM=%%i%%j%%k

Echo RENAME %1 %1_%YYYYMMDD%%HHMM%
RENAME %1 %1_%YYYYMMDD%%HHMM%
GoTo END

:USAGE
Echo Usage: DateName filename
Echo Renames filename to filename_YYYYMMDDHHMM
GoTo END

:END
REM
TITLE Command Prompt

Example:

D:\Exchange> datetype logfile.log
RENAME logfile.log logfile.log_199809281630


10. Output of a program into an environment variable:
===================================================== 

Q. How can I force the output of a program into an environment variable?

A. Some programs return values to the command line and it may be you want these 
   in a variable so they can be viewed/queried by other processes.

The easiest way to put the result into an environment variable is to trap 
it in a FOR statement.

For /f "Tokens=*" %i in ('command') do set variable="%i"

For example:

C:\>For /f "Tokens=*" %i in ('ver') do set NTVersion="%i"

C:\>set NTVersion="Windows NT Version 4.0 "

C:\>echo %NTVersion%
"Windows NT Version 4.0 "

If you place the command in a batch file you require two % in front of i, e.g.

For /f "Tokens=*" %%i in ('ver') do set NTVersion="%%i"

 
11. Get m columns from n in a text file:
========================================

Use the unix port freeware program cut.exe.

Suppose you have a file x.txt similar to

a b c d
e f g h
i j k l

etc..

Now you only want certain columns in a new file.

type x.txt | cut 1 3 > y.txt

y.txt:

a b
e f
i j 


12. SCHEDULING:
===============

Example 1:
----------

How to use the "at" command, please see the help given by:

C:\> at /?


>>> Example of the use of the at command:

at 23:00 /every:M,T,W,Th,F backup.cmd   

That commands schedules the backup.cmd script on your local Server, to be executed at 23:00h
at Monday, Tuesday, Wednesday, Thursday and Friday. 


>>> Other example

@echo off
rem
rem   NAME
rem      setat.cmd - NT command script
rem
at %1 /every:M,T,W,Th,F,S,Su %COMSPEC% /c "r:\ifa\bin\ifa.cmd"


See also Part 3, section 5


13. Delete all files without prompting:
=======================================

>> Best solution on NT, 2Kx, XP:
---------------------------------

Delete of files, silently, in subdirs, also readonly ones

(This is like a "rm -rf" on UNIX)

cd %1
del /F /Q /S *.*


>> Alternatives on all WinOS:
-----------------------------

One of the most Frequently Asked Questions (FAQs) about batches is
how to suppress the "Are you sure (Y/N)?" confirmation requirement

for del *.*.  Use the following:
 echo y| del *.*

If you wish to suppress the message too, use
 echo y| del *.* > nul

There is also another alternative for doing this. It has the
advantange of being MS-DOS language version independent.
 for %%f in (*.*) do del %%f

If the directory is empty you can avoid the "File not found" message
by applying
 if exist *.* echo y| del *.* > nul
A better, obvious alternative by Rik D'haveloose:
  if exist *.* for %%f in (*.*) do del %%f


14. Is there an easy way to append a new directory to the path?
===============================================================

This often needed trick is basically very simple. For example
to add directory %1 to path use
 path=%path%;%1

Note that you can only use this trick in a batch. It will not work
at the MS-DOS prompt because the environment variables are expanded
(%path%) only within batches. 

It also is typical to need a fuller path only for the duration of
executing some particular program, and to restore the original after
that:
  @echo off
  set path_=%path%
  path=%path_%;f:\ftools
  ::
  call whatever
  ::
  path=%path_%
  set path_=


15. Start an installation, tool etc..
=====================================

Example 1:
----------

@echo off
REM Oracle Migration Workbench startup script for Windows NT

set PATH=E:\Program Files\Oracle\jre\1.1.7\bin\;E:\oracle\ora81\bin;E:\oracle\ora81\Omwb\olite;%PATH%
SET JRE=jrew -nojit -mx128m
SET NT_START=start 

REM Starting Oracle Migration Workbench on Windows NT
%NT_START% %JRE% -classpath "E:\oracle\ora81\Omwb\olite\Oljdk11.jar;E:\oracle\ora81\Omwb\olite\Olite40.jar;E:\Program Files\Oracle\jre\1.1.7\lib\rt.jar;E:\Program Files\Oracle\jre\1.1.7\lib\i18n.jar;E:\oracle\ora81\Omwb\jlib;E:\oracle\ora81\Omwb\plugins\SQLServer6.jar;E:\oracle\ora81\Omwb\plugins\Sybase.jar;E:\oracle\ora81\Omwb\plugins\MSAccess.jar;E:\oracle\ora81\Omwb\plugins\SQLAnywhere.jar;E:\oracle\ora81\Omwb\plugins\SQLServer7.jar;E:\oracle\ora81\Omwb\jlib\omwb-1_3_0_0_0.jar;E:\oracle\ora81\jdbc\lib\classes111.zip;E:\oracle\ora81\lib\vbjorb.jar;E:\oracle\ora81\jlib\ewt-swingaccess-1_1_1.jar;E:\oracle\ora81\jlib\ewt-3_3_6.jar;E:\oracle\ora81\jlib\ewtcompat-opt-3_3_6.zip;E:\oracle\ora81\jlib\share-1_0_8.jar;E:\oracle\ora81\jlib\help-3_1_8.jar;E:\oracle\ora81\jlib\ice-4_06_6.jar;E:\oracle\ora81\jlib\kodiak-1_1_3.jar" -DORACLE_HOME=E:\oracle\ora81 oracle.mtg.migrationUI.MigrationApp oracle.mtg.migrationUI.MigrationApp

Example 2:
----------

set OSQLPATH="c:\Program Files\Microsoft SQL Server\80\Tools\Binn
set DBNAME=%2

%OSQLPATH%\osql.exe" -n -S%1 -d %DBNAME% -E -i%3.sql >> _runscr.log


16. Get rid of Carriage return ^M in files:
===========================================

How do I eliminate carriage returns (^M) in my files?

In unix its simple:
-------------------

If you transfer text files from a DOS machine to a UNIX machine, you might see a ^M 
before the end of each line. This character corresponds to a carriage return.

In DOS a newline is represented by the character sequence \r\n, where \r is the carriage return 
and \n is newline. In UNIX a newline is represented by \n. When text files created on a 
DOS system are viewed on UNIX, the \r is displayed as ^M.

You can strip these carriage returns out by using the tr command as follows:

tr -d '\r' < file > newfile
or on some unixes:

tr -d '\015' < file > newfile

Here file is the name of the file that contains the carriage returns, and newfile is the name you want to give 
the file after the carriage returns have been deleted.

Here you are using the octal representation \015 for carriage return, 
because the escape sequence \r will not be correctly interpreted by all versions of tr.

Or you can use sed in the following way:

move from unix to dos:
$ sed -e 's/$/\r/' myunix.txt > mydos.txt

move from dos to unix:
$ sed -e 's/.$//' mydos.txt > myunix.txt

So, install a unix shell on your PC, like Cygwin


But now in dos/nt/2000/xp:
--------------------------

(1) get for example Gygwin or other 'unix' emulator engine
for nt/2000/xp where you can run tr and sed like commands.

(2) with nt/2000/xp tools only:


17: start a file minimised window:
==================================

Example:

start /min notepad c:\core\cmdshell.txt


18: COMM ports in DOS (dos, NT, 2000, 2003, XP):
================================================

Examples to test a port:

1.
echo AT&F>com1

2.

C:\>mode com3

Status for device COM3:
-----------------------
    Baud:            115200
    Parity:          None
    Data Bits:       8
    Stop Bits:       1
    Timeout:         OFF
    XON/XOFF:        OFF
    CTS handshaking: OFF
    DSR handshaking: OFF
    DSR sensitivity: OFF
    DTR circuit:     ON
    RTS circuit:     OFF


Examples to assign a port:

voorbeelden
Als u COM12 wilt toewijzen aan COM1, zodat deze kan worden gebruikt door een MS-DOS-toepassing, typt u:

change port com12=com1

Met de volgende opdracht geeft u de huidige poorttoewijzingen weer:

change port /query


19. Special File and Volume commands in XP:
===========================================


fsutil:
-------


Fsutil is a command-line utility that you can use to perform many FAT and NTFS file system 
related tasks, such as managing reparse points, managing sparse files, dismounting a volume, 
or extending a volume. Because fsutil is quite powerful, it should only be used by advanced users 
who have a thorough knowledge of Windows XP. In addition, you must be logged on as an administrator 
or a member of the Administrators group in order to use fsutil.

Fsutil: dirty Queries
--------------------- 

Use this to see whether a volume's dirty bit is set, or use it to sets a volume's dirty bit. 
When a volume's dirty bit is set, autochk automatically checks the volume for errors the next time 
the computer is restarted.

Syntax
fsutil dirty {query|set} PathName

Parameters
-query 
Queries the dirty bit. 

-set 
Sets a volume's dirty bit. 

-PathName 
Specifies the drive letter (followed by a colon), mount point, or volume name. 

Examples

- To query the dirty bit on drive C, type:

  fsutil dirty query C:

  Sample output:

  Volume C: is dirty

  or

  Volume C: is not dirty

- To set the dirty bit on drive C, type:

  fsutil dirty set C:


Fsutil: volume
--------------

Us this to manage a volume. Dismounts a volume or queries to see how much free space is available on a disk.

Syntax

fsutil volume [diskfree] drivename

fsutil volume [dismount] VolumePathname

Parameters
-diskfree 
Queries the free space of a volume. 
-drivename 
Specifies the drive letter (followed by a colon). 
-dismount 
Dismounts a volume. 
-VolumePathname 
Specifies the drive letter (followed by a colon), mount point, or volume name. 

Examples

- To dismount a volume on drive C, type: 

  fsutil volume dismount C:

- To query the free space of a volume on drive C, type: 

  fsutil volume diskfree C:


20. Start a program like a DB sql prompt util and run a script:
===============================================================


example.cmd
-----------

cls
echo off
c:\oracle\ora92\bin\sqlplus /nolog @c:\logging\example.sql > z:\its\oc\databases\oracle_logging\example.log


So the .cmd file calls a program sqlplus which will run a .sql script, while the output
will be placed in a designated logfile.


example.sql
-----------

The .sql file might contain something like the following:

connect system/arcturus81@ECM_172.17.203.162    REM Logon to DB

alter system checkpoint                         REM true DB commands
/
SELECT * FROM v$sgastat 
WHERE name = 'free memory'
/
alter system flush shared_pool
/
SELECT * FROM v$sgastat 
WHERE name = 'free memory'
/


21. Remote terminal Services:
=============================

If your XP, or Server has the terminal services client, or RDP, installed, you can run it via

C:\>mstsc


A dialog box will show, where you can enter the name or IP of the target system.

C:\>mstsc /?

Will show all switches you can use.


22. Run a script, or program with elevated credentials:
=======================================================

In XP, Vista, Win2Kx you can, as an ordinary user, run a script, or program,
with elevated credentials, that is, using another account, using the "runas" utility.

- From the prompt, use runas:

Syntax
      RUNAS [/profile] [/env] [/netonly] /user:user Program

Key
   /profile   Option to load the user's profile (registry)
   /env       Use current environment instead of user's.
   /netonly   Use if the credentials specified are for RAS only.
   /user      Username in form USER@DOMAIN or DOMAIN\USER
              (USER@DOMAIN is not compatible with /netonly)
   Program    The command to execute

Examples:
   runas /profile /user:mymachine\administrator CMD
   runas /profile /env /user:SCOT_DOMAIN\administrator NOTEPAD
   runas /env /user:jDoe@swest.ss64.com "NOTEPAD \"my file.txt\""
Enter the password when prompted. 

- From the Windows explorer GUI
Select an executable file, Right-click and select Run As..
This option can be hidden by setting
HKLM\Software\Microsoft\Windows\CurrentVersion\Policies\Explorer 


Examples:

C:\> runas /user:Administrator@afa.com "mycommand.exe"

Where you run "mycommand.exe" as the Adminstrator from the Domain "afa".

So runas works quite like the unix sudo tool.


But the tool will ask for the password of the user listed in the command.

If you need encryption of command files, and many other options, checkout the great
"runasspc" tool.

Just google on runasspc to find more on this usefull tool.


23. Show running services:
==========================

C:\> net start

Shows all running services on your machine

C:\> net start | find "Part of Service name"

Shows all services with a name like "Part of Service name"

To show all services related to Oracle: 
C:\> net start | find "Ora"

Or use this to show services:

C:> cmd /C SC Query>C:\temp\services.txt 

Lists all your running services to the file services.txt


24. Some systemtools for Windows:
=================================

We are not going to differentiate between all possible Windows versions here (like XP,Vista, Win2K3 etc..)
but there might be a few additional tools that can be of interest.

Ofcourse, everybody knows regedit or regedt32, for viewing or editing the Registry.
And, likewise, everybody knows that the Resource Kits deliver you many additional tools for you platform.

Besides all that, most Windows versions also have:

-- sysedit.exe:

It shows you win.ini, system.ini, config.sys and autoexec.bat.
The configfiles win.ini and system.ini might still be important for older win applications.

-- systeminfo.exe:

Its shows you many hardware and system related information. It might also present you a nice list
of all the patches and hotfixes that were applied on your system.


#######################################################################
#######################################################################
Part 2: Profiles and Loginscripts for clients on Win2Kx Servers.
#######################################################################
#######################################################################


1. Logon scripts in 2000/ 2003:
===============================


Note 1:
-------

When a client logs on from a Domain member machine, such as a win 2000 professional
workstation, or an XP workstation, or a Vista machine, a logon script can be excuted for this user.

It can be a kixstart script, or a shell .cmd file, .vbs file, or something else.

The use of .cmd files is generally considered as "old fashion", and many people use
GPO and even .vbs files. But there is no serious reason not to use a plain old .cmd file.

Just put the logon script on the nearest Domain Controller in the following
location:

%systemroot%\sysvol\sysvol\<domain>\SCRIPTS

This schould be replicated to other Domain controllers in the tree.


>>>> Typical commands in a cmd batch loginscript could be: <<<<

-- General drive mappings for all users, like for example:

net use u: \\starboss\public
net use v: \\starboss\software

-- Per user settings, like for example:

if "%username%"=="John" <command>
if "%username%"=="Mark" <command>
etc..

Example:

IF /I "%USERNAME%" == "TESTUSR" goto Test

:test
Here are your commands under the lable test


Note 2:
-------

Extended note for loginscripts:

Creating logon scripts
You can use logon scripts to assign tasks that will be performed when a user logs on to a particular computer. 
The scripts can carry out operating system commands, set system environment variables, and call other scripts 
or executable programs. The Windows Server 2003 family supports two scripting environments: 
the command processor runs files containing batch language commands, and Windows Script Host (WSH) 
runs files containing Microsoft Visual Basic Scripting Edition (VBScript) or Jscript commands. 
You can use a text editor to create logon scripts. Some tasks commonly performed by logon scripts include:

-Mapping network drives.
-Installing and setting a user's default printer.
-Collecting computer system information.
-Updating virus signatures.
-Updating software.

The following example logon script contains VBScript commands that use Active Directory Service Interfaces (ADSI) 
to perform three common tasks based on a user's group membership:

It maps the H: drive to the home directory of the user by calling the WSH Network object's 
MapNetworkDrive method in combination with the WSH Network object's UserName property.

It uses the ADSI IADsADSystemInfo object to obtain the current user's distinguished name, 
which in turn is used to connect to the corresponding user object in Active Directory. 
Once the connection is established, the list of groups the user is a member of is retrieved 
by using the user's memberOf attribute. The multivalued list of group names is joined 
into a single string by using VBScript's Join function to make it easier to search for target group names.

If the current user is a member of one of the three groups defined at the top of the script, 
then the script maps the user's G: drive to the group shared drive, and sets the user's 
default printer to be the group printer.

To create an example logon script

Open Notepad or other ascii text editor.

Copy and paste, or type, the following:

Const ENGINEERING_GROUP     = "cn=engineering"
Const FINANCE_GROUP         = "cn=finance"
Const HUMAN_RESOURCES_GROUP = "cn=human resources"

Set wshNetwork = CreateObject("WScript.Network")
wshNetwork.MapNetworkDrive "h:",
"\\FileServer\Users\" & wshNetwork.UserName

Set ADSysInfo = CreateObject("ADSystemInfo")
Set CurrentUser = GetObject("LDAP://" &
ADSysInfo.UserName)
strGroups = LCase(Join(CurrentUser.MemberOf))

If InStr(strGroups, ENGINEERING_GROUP) Then

    wshNetwork.MapNetworkDrive "g:",
    "\\FileServer\Engineering\"
    wshNetwork.AddWindowsPrinterConnection
    "\\PrintServer\EngLaser"
    wshNetwork.AddWindowsPrinterConnection
    "\\PrintServer\Plotter"
    wshNetWork.SetDefaultPrinter
    "\\PrintServer\EngLaser"

ElseIf InStr(strGroups, FINANCE_GROUP) Then

    wshNetwork.MapNetworkDrive "g:",
    "\\FileServer\Finance\"
    wshNetwork.AddWindowsPrinterConnection
    "\\PrintServer\FinLaser"
    wshNetWork.SetDefaultPrinter
    "\\PrintServer\FinLaser"

ElseIf InStr(strGroups, HUMAN_RESOURCES_GROUP) Then

    wshNetwork.MapNetworkDrive "g:",
    "\\FileServer\Human Resources\"
    wshNetwork.AddWindowsPrinterConnection
    "\\PrintServer\HrLaser"
    wshNetWork.SetDefaultPrinter
    "\\PrintServer\HrLaser"

End If

On the File menu, click Save As.

In Save in, click the directory that corresponds to the domain controller's Netlogon shared folder 
(usually SystemRoot\SYSVOL\Sysvol\DomainName\Scripts where DomainName is the domain's fully qualified domain name).


Note 3:
-------

If you want to assign login scripts through a Group Policy Object, go to
Active Directory Users and Computers tool, and use GPO, and navigate to

"User Config>Windows Settings >Scripts (Logon/Logoff)"


Note 4:
-------

Simple login script, login.cmd, example for Vista or XP clients on Win2K3 Server:


net use u: \\sonne\public
net use v: \\sonne\data
net use w: \\sonne\software
net use t: \\sonne\buro
net use s: \\sonne\Backups_Netz_PCs

regedit /s pol11.reg
regedit /s pol12.reg

copy \\sonne\netlogon\crpol.bat c:\temp /Y
call c:\temp\crpol.bat

regedit /s c:\temp\crpol.reg

copy \\sonne\netlogon\message.vbs c:\temp /Y
REM copy \\sonne\netlogon\Paths.xcu c:\users\%username%\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul

if %username%==Absolutus copy \\sonne\netlogon\Absolutus.xcu c:\users\Absolutus\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Alkoholix copy \\sonne\netlogon\Alkoholix.xcu c:\users\Alkoholix\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Ammoniake copy \\sonne\netlogon\Ammoniake.xcu c:\users\Ammoniake\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Appelmus copy \\sonne\netlogon\Appelmus.xcu c:\users\Appelmus\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Avantipopulus copy \\sonne\netlogon\Avantipopulus.xcu c:\users\Avantipopulus\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Bossix copy \\sonne\netlogon\Bossix.xcu c:\users\Bossix\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Cleopatra copy \\sonne\netlogon\Cleopatra.xcu c:\users\Cleopatra\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Crazfus copy \\sonne\netlogon\Crazfus.xcu c:\users\Crazfus\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Gutzufus copy \\sonne\netlogon\Gutzufus.xcu c:\users\Gutzufus\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Kontrabas copy \\sonne\netlogon\Kontrabas.xcu c:\users\Kontrabas\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Ofenaus copy \\sonne\netlogon\Ofenaus.xcu c:\users\Ofenaus\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Stenograf copy \\sonne\netlogon\Stenograf.xcu c:\users\Stenograf\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Wachtelchen copy \\sonne\netlogon\Wachtelchen.xcu c:\users\Wachtelchen\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul
if %username%==Prognostix copy \\sonne\netlogon\Prognostix.xcu c:\users\Prognostix\AppData\Roaming\OpenOffice.org2\user\registry\data\org\openoffice\office /Y > nul

c:\temp\message.vbs


2. The ifmember utility:
========================


This is also batch .cmd related, which is a quite old technique, but it's still possible
to use it in Win2Kx login scripts.
Although advisable is the use of GPO, that is the Group Policy Editor.

IfMember is often used in Windows logon scripts and other batch files. In the following example, 
the batch file containing IfMember maps a network drive based on group membership. If the user logs on 
to a computer using this batch file and is a member of the HR group, the batch file maps the following 
share for the user:

\\server1\hr_share$
If the user is a member of the Marketing group, the batch file maps the following share for the user:

\\server1\marketing_share$
If the user is a member of the Administrtors group, the batch file maps the following share for the user:

\\server1\admin_share$
The share being mapped for the user appears as a network connection in Windows Explorer on the user's computer. 
The mapped share is assigned the next available drive letter.

Batch File
echo off

ifmember hr
if errorlevel 1 goto hr

ifmember marketing
if errorlevel 1 goto marketing

ifmember administrators
if errorlevel 1 goto administrators

goto end

:hr
net use * \\server1\hr_share$
goto end

:marketing
net use * \\server1\marketing_share$
goto end

:administrators
net use * \\server1\admin_share$
goto end

:end
Exit


In the following example, the batch file containing IfMember queries multiple groups simultaneously. 
If the user logs on to a computer using this batch file and is a member of the HR or Marketing group, 
the batch file maps the following share for the user:

echo off

ifmember hr marketing
if errorlevel 2 goto hrANDmarketing
if errorlevel 1 goto hrORmarketing

ifmember administrators
if errorlevel 1 goto administrators

goto end

:hrANDmarketing
net use * \\server1\marketinghr_share$
goto end

:hrORmarketing
net use * \\server1\standard_share$
goto end

:administrators
net use * \\server1\admin_share$
goto end

:end
Exit


3. The use of Runas in a login script:
======================================

See also Part 1, section 22.

Ofcourse you can use "runas" in a loginscript (or other batch file), but
the standard "runas" utility asks for the password of the user you want "to run as".
You can pass the password on the commandline, but in a script this will be cleartext, which
might present a security problem.

One of the better options here is to take a look at the "runasspc" tool.

http://www.robotronic.de/runasspc/

Please take a look at that tool. Its really good, and you are able to store the password 
in an encrypted file.


4. Some remarks on Vista profiles on Windows Server 2003/2008:
==============================================================


Note 1: If you have a corrupt profile, or it does not load correctly from the Server:
=====================================================================================

If you have a corrupt Vista or XP profile on a client station, or the userprofile 
somehow does not load anymore from the Server, you might consider
the following:

Suppose the profile of Domain User "Alkoholix" does not load correctly to this station,
while on another station it works OK.

-- On that particular client Workstation, Login as the local administrator, or Domain Admin.

-- Optional: If the former local userprofile, might contain data, you should save that first.
   That might be done like this:
   >> Now run 'takeown /r /a /d y /f %systemdrive%\users\Alkoholix'
   >> move Alokoholix to Alkoholix.save

-- run regedit and take a look in:
   HKLM\SOFTWARE\Microsoft\Windows NT\CurrentVersion\ProfileList
   Remove the suspect corrupt SID

-- Login as the Domain User "Alkoholix"

Hopefully the userprofile loads correctly now.


You might also check the following in the registry of that client:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Windows NT\CurrentVersion\ProfileList


There is 1 line for each profile. If you suspect a profile is bad, you can check the following records:
-- Ensure the key name doesn't end in ".bad"
-- Ensure the RefCount value is 0
-- Ensure the State value is 0


Note 2: XP and Vista clients in the same Domain with roaming profiles:
======================================================================

This can lead to unplesant surprises. Server stored XP and Vista profiles, do not match enough
to take for granted that if a user logs on to a Domain from XP, and get's a userprofile loaded, that if 
he or she goes to a Vista client, that then the same userprofile is used.
That is in a standard setup, not the case.

If you look at the Server in the directory where the profiles are stored, you might observe the
following for, for example, the user "Alkoholix":

\\starboss\PROFILES\Alkoholix.V2         (Vista profile)
\\starboss\PROFILES\Alkoholix            (XP profile)


There are many differences between XP and Vista profiles.

So what now?

This subject is "large" "enough" to redirect you to other places for better information.
Some suggestions are:

http://4sysops.com/archives/windows-vista-and-windows-xp-roaming-user-profiles-interoperability-folder-redirection-is-the-only-way/
http://technet.microsoft.com/en-us/library/cc766489.aspx


###############################################################################################################
###############################################################################################################
Part 3: Some commands in Vista and Windows 2008 Server (some commands were available in older versions as well)
###############################################################################################################
###############################################################################################################


1. Showing and altering settings on network interfaces: The "netsh" command:
============================================================================

Runs on 2000, XP, 2003, 2008

Let's first show some examples on usage:

-- Show settings on network interfaces

C:\>netsh interface ipv4 show interfaces

Idx  Met   MTU    Status       Naam
---  ---  -----  -----------  -------------------
  1   50 4294967295  connected    Loopback Pseudo-Interface 1
 10   40   1500  connected    Draadloze netwerkverbinding
  9   30   1500  disconnected  LAN-verbinding
 12   50   1500  disconnected  Bluetooth-netwerkverbinding


The "Idx" number identifies your interface if you want to change settings.

-- Example of Altering settings 


C:\> netsh interface ipv4 set address name=10 source=static
address=192.168.100.75 mask=255.255.255.0 gateway=192.168.100.1


C:\>netsh interface ipv4 add dnsserver name=2 address=192.168.100.40


-- More on netsh:

Netsh.exe is a command-line scripting utility that allows you to, either locally or remotely, display or modify 
the network configuration of a computer that is currently running. Netsh.exe also provides a scripting feature 
that allows you to run a group of commands in batch mode against a specified computer. Netsh.exe can also save 
a configuration script in a text file for archival purposes or to help you configure other servers.

Netsh.exe is available on Windows 2000, Windows XP, Windows Server 2003, Vista, Windows Server 2008 .


2. Activate your Windows 2008 Server:
=====================================

C:\> slmgr.vbs -ato

This is indeed a windows scripting host file, executed by Windows Scripting Host.


Activating the remote Windows 2008 Server called "starboss":

C:\> slmgr.vbs starboss Administrator <password_of_that_Administrator> -ato


-- More on slmgr:

SLMGR stands for Software License ManaGeR, or its full name, Windows Software Licensing Management Tool. SLMgr is the main component 
in Windows Vista (and 2003, 2008) that manages system activation and product key, the license to use Windows.

All functions of SLMgr is provided by slmgr.vbs, a command line utility based on VBScript. Most activation related commands available 
in graphics user interface such as System Properties will call slmgr.vbs VBS script to perform the licensing operation. 
And even if you trigger or run SLMgr commands in command line, the results or any error details will display in pop-up dialog window 
in Vista explorer. Here�s some hack and usage guide for slmgr in Vista,a useful reference when you facing activation or not activated problem, 
or when you have been force into Reduced Functionality Mode.

Where and How to Use SLMgr.vbs

There are several ways actually to access and run SLMgr.vbs commands.

Command prompt window. This is the way to to run SLMgr with options which requires elevated administrator privileges. 
Run command (Guide: Display Run command in Vista). 
Start Search box integrated in the Start Menu. Using this method will require user to type in full script name - SLMgr.vbs into the search box 
so that the command looks like �slmgr.vbs -ato� and etc. 
The most famous and common use of SLMgr is to perform a �slmgr.vbs -rearm� to extend the trial period of Vista for another 30 days. 
However, other than this popular switch, SLMgr.vbs actually supports a list of other options, which you can also view by using �SLMgr.vbs -?� command. 
You will see a result window displayed at below.

SLMgr Usage

Syntax

slmgr.vbs [MachineName [User Password]] [<Option>]

* MachineName: Name of remote machine (default is local machine)
* User: Account with required privilege on remote machine
* Password: password for the previous account

Global Options

-ipk <product key>
Install product key (replaces existing key)

-upk
Uninstall product key

-ato
Activate Windows

-dli [Activation ID | All]
Display license information (default: current license)

-dlv [Activation ID | All]
Display detailed license information (default: current license)

-xpr
Expiration date for current license state

Advanced Options

-cpky
Clear product key from the registry (prevents disclosure attacks)

-ilc <License file>
Install license

-rilc
Re-install system license files

-rearm
Reset the licensing status of the machine

-dti
Display Installation ID for offline activation

-atp <Confirmation ID>
Activate product with user-provided Confirmation ID 

KMS Options

-skms <KMS activation server name>
Set KMS server name

-skms <KMS activation server port number>
Set KMS server port number

-skms <KMS activation server name:port number>
Set KMS server name and port number in single command

-ckms
Clear KMS server name and port number to default


3. Install or remove or alter Active Directory: dcpromo tool:
=============================================================


Installs and removes Active Directory Domain Services (AD DS).


The following example supplies an answer file named NewForestInstallation:

dcpromo /answer:NewForestInstallation

The following example creates the first domain controller in a new child domain where you expect to install at least 
some Windows Server 2003 domain controllers:

dcpromo /unattend /InstallDns:yes /ParentDomainDNSName:contoso.com /replicaOrNewDomain:domain /newDomain:child /newDomainDnsName:east.contoso.com 
        /childName:east /DomainNetbiosName:east /databasePath:"e:\ntds" /logPath:"e:\ntdslogs" /sysvolpath:"g:\sysvol" 
        /safeModeAdminPassword:FH#3573.cK /forestLevel:2 /domainLevel:2 /rebootOnCompletion:yes


The following example creates an additional domain controller with the global catalog, and it installs and configures the DNS Server service:


dcpromo /unattend /InstallDns:yes /confirmGC:yes /replicaOrNewDomain:replica /databasePath:"e:\ntds" /logPath:"e:\ntdslogs" 
       /sysvolpath:"g:\sysvol" /safeModeAdminPassword:M6$,U8Gvx4 /rebootOnCompletion:yes


4. The wmic command: and interface on WMI:
========================================== 


WMIC.exe

Windows Management Instrumentation Command. 
Read a huge range of information about local or remote computers. Also provides a way to make configuration changes to multiple remote machines.

Syntax
   Retrieve information about <Alias>:
      WMIC [global_switches] [/locale:ms_409] <alias> [options] [format]

   Interactive mode:
      WMIC

Aliases:
 ALIAS               - Access local system aliases [CALL]

 BASEBOARD           - Base board management (motherboard or system board) 
 BIOS                - BIOS management (Basic input/output services) 
 BOOTCONFIG          - Boot configuration

 CDROM               - CD-ROM
 COMPUTERSYSTEM      - Computer system [CALL/SET]
 CPU                 - CPU
 CSPRODUCT           - Computer system product information from SMBIOS. 

 DATAFILE            - DataFiles [CALL]
 DCOMAPP             - DCOM Applications.
 DESKTOP             - User's Desktop
 DESKTOPMONITOR      - Desktop Monitor
 DEVICEMEMORYADDRESS - Device memory addresses
 DISKDRIVE           - Physical disk drive
 DISKQUOTA           - Disk space usage for NTFS volumes.[SET]
 DMACHANNEL          - Direct memory access (DMA) channel

 ENVIRONMENT         - System environment settings [SET]
 FSDIR               - Filesystem directory entry [CALL]

 GROUP               - Group account [CALL]

 IDECONTROLLER       - IDE Controller
 IRQ                 - Interrupt request line

 JOB                 - Jobs scheduled using the schedule service.[CALL]

 LOADORDER           - System services that define execution dependencies. 
 LOGICALDISK         - Local storage devices [CALL/SET]
 LOGON               - LOGON Sessions.

 MEMCACHE            - Cache memory
 MEMLOGICAL          - System memory, layout and availability
 MEMPHYSICAL         - Physical memory management

 NETCLIENT           - Network Client management.
 NETLOGIN            - Network login information for a particular user. 
 NETPROTOCOL         - Protocols (and their network characteristics).
 NETUSE              - Active network connection.
 NIC                 - Network Interface Controller (NIC)
 NICCONFIG           - Network adapter. [CALL] 
 NTDOMAIN            - NT Domain. [SET]  
 NTEVENT             - NT Event Log.  
 NTEVENTLOG          - NT eventlog file [CALL/SET]

 ONBOARDDEVICE       - Common adapter devices built into the motherboard.
 OS                  - Operating System/s [CALL/SET]

 PAGEFILE            - Virtual memory file swapping
 PAGEFILESET         - Page file settings [SET]
 PARTITION           - Partitioned areas of a physical disk.
 PORT                - I/O ports
 PORTCONNECTOR       - Physical connection ports
 PRINTER             - Printer device [CALL/SET]
 PRINTERCONFIG       - Printer device configuration  
 PRINTJOB            - Print job [CALL]
 PROCESS             - Processes [CALL]*
 PRODUCT             - Windows Installer [CALL]

 QFE                 - Quick Fix Engineering (patches)
 QUOTASETTING        - Setting information for disk quotas on a volume. [SET]

 REGISTRY            - Computer system registry [SET]

 SCSICONTROLLER      - SCSI Controller [CALL]
 SERVER              - Server information 
 SERVICE             - Service application [CALL]
 SHARE               - Shared resourcees [CALL]
 SOFTWAREELEMENT     - Elements of a software product*
 SOFTWAREFEATURE     - Subsets of SoftwareElement. [CALL]*
 SOUNDDEV            - Sound Devices 
 STARTUP             - Commands that run automatically when users logon
 SYSACCOUNT          - System account  
 SYSDRIVER           - System driver for a base service. [CALL]
 SYSTEMENCLOSURE     - Physical system enclosure
 SYSTEMSLOT          - Physical connection points including ports,
                       slots and peripherals, and proprietary connections points.

 TAPEDRIVE           - Tape drives  
 TEMPERATURE         - Temperature sensor (electronic thermometer).
 TIMEZONE            - Time zone data 

 UPS                 - Uninterruptible power supply (UPS) 
 USERACCOUNT         - User accounts [CALL/SET]

 VOLTAGE             - Voltage sensor (electronic voltmeter) data
 VOLUME              - Local storage volume [CALL/SET]
 VOLUMEQUOTASETTING  - Associates the disk quota setting with a specific disk volume. [SET]

 WMISET              - WMI service operational parameters [SET]

New aliases in Windows 2003: 
 MEMORYCHIP          - Memory chip information.
 RDACCOUNT           - Remote Desktop connection permission [CALL]
 RDNIC               - Remote Desktop connection on a specific network adapter [CALL/SET]
 RDPERMISSIONS       - Permissions to a specific Remote Desktop connection [CALL]
 RDTOGGLE            - Turn Remote Desktop listener on or off remotely[CALL]
 RECOVEROS           - Blue Screen Information [SET]
 SHADOWCOPY          - Shadow copy management [CALL]
 SHADOWSTORAGE       - Shadow copy storage areas [CALL/SET]
 VOLUMEUSERQUOTA     - Per user storage volume quotas  [SET]
Options 

By default an alias will return a standard LIST of information, you can also choose to GET one or more specific properties.

Configuration changes can be made, where indicated above with: [CALL or SET ]

The CREATE and DELETE options allow you to change the WMI schema itself.

   alias 
   alias LIST [BRIEF | FULL | INSTANCE | STATUS |SYSTEM | WRITEABLE]
                [/TRANSLATE:BasicXml|NoComma ]
                   [/EVERY:no_secs] [/FORMAT:format]
   alias GET [property list]
                [/VALUE ] [/ALL ] [/TRANSLATE:BasicXml|NoComma ]
                   [/EVERY:no_secs] [/FORMAT:format]
   alias CALL method_name [parameters]
   alias SET [assignments]
   alias CREATE 
   alias DELETE
   alias ASSOC [/RESULTCLASS:classname] [/RESULTROLE:rolename][/ASSOCCLASS:assocclass]

For more help
   WMIC /locale:ms_409 /alias /?
   WMIC /locale:ms_409 /alias option /?
   e.g.
   WMIC /locale:ms_409 /BIOS /CALL /?
   WMIC /locale:ms_409 /MEMLOGICAL /SET /?The order of the /FORMAT and /TRANSLATE switches is significant: 
if /TRANSLATE follows /FORMAT, the output is formatted first and then translated. 

All the options above can be extended with a WHERE clause, best shown by the examples below:

Format:
  Format defines the layout of the information:
    csv.xsl, hform.xsl, htable-sortby.xsl, htable.xsl    texttable.xsl, textvaluelist.xsl, xml.xsl

  All output files are unicode text (convert to ASCII with TYPE)
  Tab Separated Values (.tsv) can be opened in excel Examples

WMIC /locale:ms_409 OS 

WMIC OS LIST BRIEF

WMIC OS GET csname, locale, bootdevice

WMIC /locale:ms_409 NTEVENT where LogFile='system'

WMIC NTEVENT where "LogFile='system' and Type>'0'" 

WMIC SERVICE where (state=�running�) GET caption, name, state > services.tsv

WMIC SERVICE where caption='TELNET' CALL STARTSERVICE

WMIC PRINTER LIST STATUS

WMIC PRINTER where PortName="LPT1:" GET PortName, Name, ShareName
 
WMIC /INTERACTIVE:ON PRINTER where PortName="LPT1:" DELETE

WMIC PROCESS where name='evil.exe' delete

WMIC /output:"%computername%.txt" MEMORYCHIP where "memorytype=17" get Capacity

Interactive mode:
 C:>START "Windows Management" WMIC
 wmic:root\cli>/locale:ms_409
 wmic:root\cli>OS get csname
 wmic:root\cli>quit
Notes

WMIC is available on Windows XP Professional and Windows 2003. To retrieve WMI information from older remote machines 
download & install: WMI core for Win 9x / WMI core for Win NT 4

The availability of WMI information does vary across different versions of Windows
e.g. ODBC, SNMP, Windows Installer.

To run WMIC requires administrator rights.

In Windows 2000, around 4,000 properties can be monitored, and around 40 can be configured.
In Windows XP around 6,000 properties can be monitored, and around 140 can be configured.

Windows 2003 offers a few improvements and bug fixes: the global option /locale:ms_409 is not required (it defaults to English US.) 

When you type WMIC for the first time in Windows 2003 all the aliases are compiled. The second, and subsequent times you run WMIC, 
it will start immediately. Under XP WMIC is slower to initialise, therefore to run several WMI queries it can be quicker to use interactive mode.


5. Scheduled task management in Win2Kx Server:
==============================================


>>> Graphical Interface:
------------------------

Everybody knows this, but to make sure, From "Control Panel" -> "Scheduled Tasks" you open
a graphical interface for creating and managing scheduled tasks in Win2Kx Server.
Its really easy to deploy and manage tasks from the Scheduled Tasks interface.

But here we explore some options that are available from prompt or scripting level.


>>> at command:
---------------

Ofcourse, to start with, we have the at command from which you can create, schedule and manage
tasks. You can schedule e.g. batchscripts, wsh, and also executables from the at interface.

If you do not know the at command, see the help of this tool by using:

C:\>at /?

Also take a look at:

http://support.microsoft.com/kb/313565/en-us

In short, the at command has the following syntax:

You can use the at command to schedule a command, a script, or a program to run at a specified 
date and time. You can also use this command to view existing scheduled tasks. 

To use the at command, the Task Scheduler service must be running, and you must be logged on as a member 
of the local Administrators group. When you use the at command to create tasks, 
you must configure the tasks so that they run in the same user account. 

The at command uses the following syntax: 

at \\computername time /interactive | /every:date,... /next:date,... command

at \\computername id /delete | /delete/yes

The following list describes the parameters that you can use with the at command: 
� \\computername: Use this parameter to specify a remote computer. If you omit this parameter, 
  tasks are scheduled to run on the local computer.  
� time: Use this parameter to specify the time when the task is to run. Time is specified as hours:minutes 
  based on the 24-hour clock. For example, 0:00 represents midnight and 20:30 represents 8:30 P.M. 
� /interactive: Use this parameter to allow the task to interact with the desktop of the user 
  who is logged on at the time the task runs. 
� /every:date,...: Use this parameter to schedule the task to run on the specified day or days of the week or month, 
  for example, every Friday or the eighth day of every month. Specify date as one or more days of the week 
  (use the following abbreviations: M,T,W,Th,F,S,Su) or one or more days of the month 
  (use the numbers 1 through 31). Make sure that you use commas to separate multiple date entries. 
  If you omit this parameter, the task is scheduled to run on the current day. 
� /next:date,...: Use this parameter to schedule the task to run on the next occurrence of the day 
  (for example, next Monday). Specify date as one or more days of the week (use the following abbreviations: M,T,W,Th,F,S,Su) or one or more days of the month (use the numbers 1 through 31). Make sure that you use commas to separate multiple date entries. If you omit this parameter, the task is scheduled to run on the current day.  
� command: Use this parameter to specify the Windows 2000 command, the program (.exe or .com file), 
  or the batch program (.bat or .cmd file) that you want to run. If the command requires a path as an argument, 
  use the absolute path name (the entire path beginning with the drive letter). If the command is on 
  a remote computer, use the Uniform Naming Convention (UNC) path name (\\ServerName\ShareName). 
  If the command is not an executable (.exe) file, you must precede the command with cmd /c, for example, 
  cmd /c copy C:\*.* C:\temp. 
� id: Use this parameter to specify the identification number that is assigned to a scheduled task.  
� /delete: Use this parameter to cancel a scheduled task. If you omit the id parameter, all scheduled tasks 
  on the computer are canceled. 
� /yes: Use this parameter to force a yes answer to all queries from the system when you cancel scheduled tasks. 
  If you omit this parameter, you are prompted to confirm the cancellation of a task. 

Example:

at \\starboss 23:00 /every:M,T,W,Th,F backup.cmd


>>> Scheduled tasks return codes:
---------------------------------

Especially from the graphical interface, you can view the result codes from the tasks that have run.
These are the most common ones:

� 0x0: The operation completed successfully. 
� 0x1: An incorrect function was called or an unknown function was called. 
� 0xa: The environment is incorrect. 

If the result code has the "C0000XXX" format, the task did not complete successfully 
(the "C" indicates an error condition). The most common "C" error code is "0xC000013A: The application terminated 
as a result of a CTRL+C".


>>> The taskkill command:
-------------------------

In the latest Windows versions, per default, a couple of unix type "kill" commands are available.
For example, in XP we have the "tskill" and "taskkill" commands, with which you can kill/stop processes.
The taskkill command is more sophisticated than tskill.

Remark: For older Windows versions, from the Resource Kits, the "kill" command could be obtained.


Syntax:

taskkill [/s Computer] [/u Domain\User [/p Password]]] [/fi FilterName] [/pid ProcessID]|[/im ImageName] [/f][/t]


/s  computer Specifies the name or IP address of a remote computer (do not use backslashes). 
    The default is the local computer. 
/u  domain\user Runs the command with the account permissions of the user specified by User or Domain\User. 
    The default is the permissions of the current logged on user on the computer issuing the command. 
/p  password Specifies the password of the user account that is specified in the /u parameter. 
/fi FilterName Specifies the types of process(es) to include in or exclude from termination. 
    The following are valid filter names, operators, and values. 
    Name 	Operators 		Value 
    Hostname 	eq, ne 			Any valid string. 
    Status 	eq, ne 			RUNNING|NOT RESPONDING 
    Imagename 	eq, ne 			Any valid string. 
    PID 	eq, ne, gt, lt, ge, le 	Any valid positive integer. 
    Session 	eq, ne, gt, lt, ge, le 	Any valid session number. 
    CPUTime 	eq, ne, gt, lt, ge, le 	Valid time in the format of hh:mm:ss. 
                                        The mm and ss parameters should be between 0 and 59 and 
                                        hh can be any valid unsigned numeric value. 
    Memusage 	eq, ne, gt, lt, ge, le 	Any valid integer. 
    Username 	eq, ne 			Any valid user name ([Domain\]User). 
    Services 	eq, ne 			Any valid string. 
    Windowtitle eq, ne 			Any valid string. 
 
/pid processID Specifies the process ID of the process to be terminated. 
/im  ImageName Specifies the image name of the process to be terminated. 
     Use the wildcard (*) to specify all image names. 
/f   Specifies that process(es) be forcefully terminated. This parameter is ignored for 
     remote processes; all remote processes are forcefully terminated. 
/t   Specifies to terminate all child processes along with the parent process, commonly known as a tree kill. 


Examples:

taskkill /f /im insman.exe      (kills the isman.exe process)
taskkill /f /fi "status eq not responding" 


Needless to say that you should be very carefull with those commands in production environments.


#############################################################################
#############################################################################
Part 4: Other notes related to profiles and roaming mail:
#############################################################################
#############################################################################


The purpose here is that we want to have roaming email profiles/pst files
of XP or Vista clients, on Win2Kx Servers, without using a backend Server service.

Note: It's probably best to have a central backend email facility installed, like Exchange Server,
but this section only deals with roaming email profiles and email related folders.
This has it's limitations ofcourse, if you consider very large email folders.
In that case, you should really consider using a backend email store.


Note 1: Notes related to Location of the outlook .pst file: roaming pst
=======================================================================

================
>> Article 1: <<
================

The use of the registry key "ForcePSTPath" that points to a networkshare, so users will
save there .pst files on that share.

See some notes in:

http://support.microsoft.com/kb/896591

That info is repeated here:

You cannot specify a separate folder to store the .ost file when you use the ForcePSTPath value in Outlook 2003.

View products that this article applies to.
Article ID : 896591 
Last Review : October 28, 2005 
Revision : 2.4 
On This Page

SYMPTOMS

RESOLUTION

Service pack information

How to obtain the hotfix

STATUS

MORE INFORMATION

REFERENCES
SYMPTOMS
When you add the ForcePSTPath value to the registry to change the location of the personal folders (.pst) 
file in Microsoft Office Outlook 2003, the offline folder (.ost) file is also changed to the same folder. 
You cannot specify a separate folder to store the .ost file.

To add the ForcePSTPath value to the registry, follow these steps.


1. Quit Outlook 2003. 
2. Click Start, click Run, type regedit in the Open box, and then click OK. 
3. Locate and then select the following registry subkey: 
   HKEY_CURRENT_USER\Software\Microsoft\Office\11.0\Outlook 
4. After you select the subkey that is specified in step 3, point to New on the Edit menu, and then click 
   Expandable String Value. 
5. Type ForcePSTPath, and then press ENTER. 
6. Right-click ForcePSTPath, and then click Modify. 
7. In the Value data box, type the full path of where you want to store the .pst file, and then click OK. 
8. On the File menu, click Exit to quit Registry Editor. 


================
>> Article 2: <<
================

In relation to the information of Article 1:

Changing the Default Location for .OST and .PST Files (All Windows)

By default, Outlook places each Offline Folders (.ost) file and Personal Folders (.pst) file 
that it creates in the %userprofile%\Local Settings\Application Data\Microsoft\Outlook folder. 
This setting allows you to change the default behavior and store the files in any folder.

Open your registry and find the appropriate key below. 


Outlook 2003 - [HKEY_CURRENT_USER\Software\Microsoft\Office\11.0\Outlook] 
Outlook XP - [HKEY_CURRENT_USER\Software\Microsoft\Office\10.0\Outlook] 
Outlook 2000 - [HKEY_CURRENT_USER\Software\Microsoft\Office\9.0\Outlook] 
Create a new REG_EXPAND_SZ (Expandable String) value called 'ForcePSTPath' and set it to equal the full path 
of the required personal folder directory (e.g. "C:\Mailbox", or "\\starboss\profiles\%username%"). 

Restart Outlook for the change to take effect.

Note: This is a expandable string value so you can use environment variables, such as %userprofile%, 
to specify the path. When the value does not exist the default value of 
"%userprofile%\Local Settings\Application Data\Microsoft\Outlook" is used.


================
>> Article 3: <<
================

Disable UAC (User Access Control) in Vista:

Method #2 - Using Regedit
Open Registry Editor. 

In Registry Editor, navigate to the following registry key: 

HKEY_LOCAL_MACHINE\Software\Microsoft\Windows\CurrentVersion\Policies\System

Locate the following value (DWORD): 

EnableLUA

and give it a value of 0.

Close Registry Editor. You need to reboot the computer for changes to apply. 
In order to re-enable UAC just change the above value to 1.


================
>> Article 4: <<
================

>>>>>>>> Search XP registry on "ProfileDIr"


To stop certain folders being replicated as part of the user profile, 
explore how to stop certain areas of a profile from replicating when you use roaming profiles. 
A problem in Windows 2000 causes the ExcludeProfileDirs entry to be set to null if the entry is longer 
than 260 characters. To address this problem, check the value at 
record:
HKEY_CURRENT_USER\Software\Microsoft\WindowsNT\CurrentVersion\Winlogon\ExcludeProfileDirs 
ExcludeProfileDirs
Value:
Local Settings;Temporary Internet Files;History;Temp;Local Settings\Application Data\Microsoft\Outlook


>>>>>>>> Search XP registry on "outlook.pst"


HKEY_CURRENT_USER\Software\Microsoft\Windows NT\CurrentVersion\Windows Messaging Subsystem\Profiles\"

record:
001e6700
C:\Documents and Settings\root\Local Settings\Application Data\Microsoft\Outlook\outlook.pst

Outlook 2003 uses a binary value in "001f6700" (instead of "001e6700"). 

Note 1:
=======

>> thread from internet:


Q:

Folks, 
I hope you can help me or point me in the right direction, I'm in the process of migrating peoples 
PST Files to a Network Share. 
My problem is that the registry Key that holds the file path information varies on machine. 
So I have no definite key to navigate to too achieve this. 

Below is the Key where the PST File is registered. {Username} can be resolved with objNet.UserName 
but the {xxxxxxxxxxxxxxxxxxxxx} is the problem. 

HKEY_Current_User\Software\Microsoft\Windows NT\CurrentVersion\Windows Messaging Subsystem\
Profiles\{Username}\{xxxxxxxxxxxxxxxxxxxxx} 

I have found that all PST Files that are attached in Outlook have a Reg_SZ Value of 001e6700. 
So what I need to accomplish is to 
search every key under "HKEy_Current_User\Software\Microsoft\Windows NT\CurrentVersion\Windows Messaging 
Subsystem\Profiles\{Username}\" for 001e6700 so I can locate the PST Files and copy them to a 
Network Share and change the 
Reg_SZ Value to reflect their new network path. 

Below is code that will list all the {xxxxxxxxxxxxxxxxxxxxx} variable keys 

----

Set objNet = CreateObject("WScript.NetWork") 
Const HKCU = &H80000001  
  
REGKEY = "Software\Microsoft\Windows NT\CurrentVersion\Windows Messaging Subsystem\Profiles\" & objNet.username & ""  
strComputer = "."  
Set wbemServices = GetObject("winmgmts:\\" & strComputer)  
Set objReg=GetObject("winmgmts:\\" & strComputer & "\root\default:StdRegProv")  
GetSubKeys REGKEY  
Sub GetSubKeys(parmKey)  
objReg.EnumKey HKCU, parmKey, collSubKeys  
If Not IsNull(collSubKeys) Then  

For each subKey in collSubKeys  
fullKey = parmKey & "\" & subKey  
Wscript.echo fullKey  
GetSubKeys fullKey  
Next  

End If  
End Sub

----

What would i need to add to it to make it search inside each one of them and return the String Value 
for Reg_SZ 001e6700? 

Thanks Very Much for you help with ths 


A:

Option Explicit 
Dim oWS : Set oWS = CreateObject("WScript.Shell") 
Dim oFSO : Set oFSO = CreateObject("Scripting.FileSystemObject") 
Dim sSearchFor 
sSearchFor = InputBox("This script will search your Registry and find all " & _ 
           "instances of the search string you input."  & vbcrlf & vbcrlf & _ 
           "This search could take several minutes, so please be patient." & _ 
           vbcrlf & vbcrlf & "Enter search string (case insensitive) and " & _ 
           "click OK...") 
If sSearchFor = "" Then Cleanup() 
Dim StartTime : StartTime = Timer 
Dim sRegTmp, sOutTmp, eRegLine, iCnt, sRegKey, aRegFileLines 
sRegTmp = oWS.Environment("Process")("Temp") & "\RegTmp.tmp " 
sOutTmp = oWS.Environment("Process")("Temp") & "\sOutTmp" & _ 
        Hour(Now) & Minute(Now) & Second(Now) & ".tmp " 
oWS.Run "regedit /e /a " & sRegTmp, , True '/a enables export as Ansi for WinXP 
With oFSO.OpenTextFile(sOutTmp, 8, True) 
.WriteLine("REGEDIT4" & vbcrlf & vbcrlf & "; Registry search " & _ 
  "results for string " & Chr(34) & sSearchFor & Chr(34) & " " & Now & _ 
  vbcrlf & vbcrlf & "; NOTE: This file will be deleted when you close " & _ 
  "WordPad." & vbcrlf & "; You must manually save this file to a new " & _ 
  "location if you want to refer to it again later." & vbcrlf & "; (If " & _ 
  "you save the file with a .reg extension, you can use it to restore " & _ 
  "any Registry changes you make to these values.)" & vbcrlf) 
With oFSO.GetFile(sRegTmp) 
  aRegFileLines = Split(.OpenAsTextStream(1, 0).Read(.Size), vbcrlf) 
End With 
oFSO.DeleteFile(sRegTmp) 
For Each eRegLine in aRegFileLines 
  If InStr(1, eRegLine, "[", 1) > 0 Then sRegKey = eRegLine 
  If InStr(1, eRegLine, sSearchFor, 1) >  0 Then 
    If sRegKey <> eRegLine Then 
      .WriteLine(vbcrlf & sRegKey) & vbcrlf & eRegLine 
    Else 
      .WriteLine(vbcrlf & sRegKey) 
    End If 
    iCnt = iCnt + 1 
  End If 
Next 
Erase aRegFileLines 
If iCnt < 1 Then 
  oWS.Popup "Search completed in " & FormatNumber(Timer - StartTime, 0) & " seconds." & _ 
            vbcrlf & vbcrlf & "No instances of " & chr(34) & sSearchFor & chr(34) & _ 
            " found.",,, 4096 
  .Close 
  oFSO.DeleteFile(sOutTmp) 
  Cleanup() 
End If 
.Close 
End With 
oWS.Popup "Search completed in " & FormatNumber(Timer - StartTime, 0) & " seconds." & _ 
        vbcrlf & vbcrlf & iCnt & " instances of " & chr(34) & sSearchFor & chr(34) & _ 
        " found." & vbcrlf & vbcrlf & "Click OK to open Results in WordPad.",,, 4096 
oWS.Run "WordPad " & sOutTmp, 3, True 
oFSO.DeleteFile(sOutTmp) 
Cleanup() 
Sub Cleanup() 
Set oWS = Nothing 
Set oFSO = Nothing 
WScript.Quit 
End Sub 


Note 2:
=======

Locate PST Files on Remote Workstations

If you have Microsoft Outlook deployed in your organization or are planning a deployment, 
I am sure there have been conversations about PST (Personal Folder Storage) files: 

Should we allow creation of Outlook PST files? 
If so, how do we control the type (Unicode vs. ANSI), size, or default location 
What do we do with existing PST files? 
There are a number of ways PST files can be controlled or managed. This includes: 

-Group Policy Templates  
-Custom Installation Wizard (CIW) or Custom Maintenance Wizard (CMW) from the Office Resource Kit (ORK)  
-Registry Modification using a logon script perhaps 

A challenge with methods 1 and 2 listed above is that most group policies and CIW settings only apply 
to New Outlook Profiles. 

So the question arises: How do I know the name, location, and size of Existing PST files stored 
by information workers on local workstations? If you have already invested in migration tools, 
some can be used to 'crawl' workstations to gather this information. However, if you do not have 
the luxury of using such tools or are looking for a simpler solution, here is one which 
can be implemented via a login script: 

The solution described in this blog is only a sample and will require additional coding to fully 
automate the process. 

ON CLIENT WORKSTATION 

Navigate the registry to: HKEY_CURRENT_USER\ Software\Microsoft\Windows NT\CurrentVersion\
Windows Messaging Subsystem\Profiles 

If multiple profiles have been configured on the workstation, you will find a key representing 
each Outlook Profile. 

Select an Outlook profile. For purposes of this blog, let's say my Outlook Profile Name is 
"Outlook". So I select: HKCU\ Software\Microsoft\Windows NT\CurrentVersion\Windows Messaging Subsystem
\Profiles\Outlook

Now search for a REG_BINARY value:

001f6700 for Outlook 2003 - 2008 
001e6700 for Outlook 98 - 2000 

If multiple PST files exist on the workstation, your search will find multiple instances of the 
REG_BINARY value. Notice that the path to the REG_BINARY value changes for each instance of PST file. 
Example � If I have two Outlook PST files:

-- The reference to my first PST file: "HLCU\ Software\Microsoft\Windows NT\CurrentVersion
\Windows Messaging Subsystem\Profiles\Outlook\627a9d56b540e64abb70db3817bd5793\001f6700" 

-- The reference to my second PST file: "HLCU\ Software\Microsoft\Windows NT\CurrentVersion
\Windows Messaging Subsystem\Profiles\Outlook\763cdcca473fe843ad8fd406044dfc95\001f6700"


Read the Name and Path to the PST file. Oops, the data is in Binary. How do you read it? 
A Million Dollar question  The Answer at no charge J 
If this was just one workstation, you could of-course read the value thru the Registry UI,
somewhat cryptic, but can be ascertained. 

But what if we need to read this value programmatically? Here is a sample code in vbscript 
which shows how you can read this value programmatically (and the PST file size) using an explicit path 
to the Binary Key in the registry. Once you are familiar with the concept, you can easily write a wrapper to: 

-Enumerate Outlook Profiles 
-For Each Outlook Profile, find instances of the Binary Key 
-Read and/or Modify the Binary Key to manipulate the PST filename or path 

Sample Script to read a single PST File Name, PST Path/Location, and PST File Size: 

Const HKEY_CURRENT_USER = &H80000001 
strComputer = "." 
Set oReg=GetObject("winmgmts:{impersonationLevel=impersonate}!\\" &_ 
strComputer & "\root\default:StdRegProv") 
strKeyPath = "Software\Microsoft\Windows NT\CurrentVersion\Windows Messaging Subsystem\Profiles
\Outlook\627a9d56b540e64abb70db3817bd5793" 

'Change the last value in the above key (right of Outlook) to reflect the path in the 'registry where you can 'find Binary values "001f6700" or "001e6700" 

binValueName = "001f6700" 

'Change the above value to "001e6700" depending on version of Outlook

oReg.GetBinaryValue HKEY_CURRENT_USER,strKeyPath,binValueName,binValue 

strPath = "" 

For i = lBound(binValue) to uBound(binValue) 
     If binValue(i) <> 0 then 
        strPath = strPath & Chr(binValue(i)) 
     End If 
Next 
strPSTFileName = Trim(strPath) 
Set filesys = CreateObject("Scripting.FileSystemObject") 
Set PSTFile = filesys.GetFile(strPSTFileName) 
PSTFileSize = (PSTFile.Size/1024000) 
WScript.echo "PST File Name: " & strPSTFileName & vbCrLf & vbCrLf & _ 
                   "PST File Size: " & PSTFileSize & " MB" 

Once you have located the name and path to Outlook PST Files on workstations throughout your organization, 
you are now ready to do one or more of the following: 

-Enforce policies around existing PST files 
-Develop a plan to backup the Outlook PST files 
-Develop a plan to Archive e-mail using Microsoft or Third-Party archiving solutions 
-Develop a plan to Consolidate Outlook PST files to a network file share 

Hints: 

Use object.Move method to move the PST file 
Use object.SetBinaryValue method to write or modify a binary key in the registry 


Note 3:
=======

To find where in the registry outlook pst files are located, you might use:

reg query "HKEY_CURRENT_USER\Software\Microsoft\Windows NT\CurrentVersion\Windows Messaging Subsystem\Profiles" /s|find ".pst"


=================
>> Article 5: <<
=================

This article demonstrates the implementation of roaming mail (Windows Mail) of Vista clients, that store
their messages and folders on a share on a Win2K, Win2K3, or Win2K8 Servers.

This means that the user can sit on any workstation, and will find and save his or her mail
on the same central Server location.

This is the trick:

1. First, disable UAC on the Vista clients, or we cannot write silently to the registry
in HKEY_CURRENT_USER

2. Create a login script on the Server in netlogon that resembles something like this:


   copy \\server\netlogon\crpol.bat c:\temp /Y
   call c:\temp\crpol.bat

   regedit /s c:\temp\crpol.reg

Now observe that we copy a batchfile to c:\temp of the Workstation (or choose another local directory).

The batchfile, that will be called from the login script, runs on the local Workstation,
and has the following source

   crpol.bat:
   ----------

   echo Windows Registry Editor Version 5.00 > c:\temp\crpol.reg

   echo [HKEY_CURRENT_USER\Software\Microsoft\Windows Mail] >> c:\temp\crpol.reg
   echo "Store Root"="\\\\sonne\\home\\%USERNAME%" >> c:\temp\crpol.reg

Now here you see that in c:\temp, a regfile is created (through the use of the echo commands).
This reg file get's imported silently into the registry, (through the use of "regedit /s crpol.reg") in
HKEY_CURRENT_USER\Software\Microsoft\Windows Mail
and affects the value of "Store Root" that will now be "\\server\home\%username%", where
the variable %username% is resolved to the useraccount name, like e.g. johndoe.

This instructs Windows Mail to store and retrieve all mail from this central Server location.


#############################################################################
#############################################################################
Part 5: Backup and Recovery commands in XP/Vista/Win2k
#############################################################################
#############################################################################


!!!!!!!!! VERY IMPORTANT MESSAGE:


!!!!!!!!! This section is a incomplete description of Win2K3 backups  !!!!!!!!
!!!!!!!!! It just describe some pointers on which you may decide your !!!!!!!!
!!!!!!!!! backup policies. Please investigate ALL options             !!!!!!!!
!!!!!!!!! on Win2K3 Server backup / restore.                          !!!!!!!!
!!!!!!!!! Especially take "care" and investigate "system state"       !!!!!!!!
!!!!!!!!! backups, which enables you to restore a Server to a well    !!!!!!!!
!!!!!!!!! known state.                                                !!!!!!!!
!!!!!!!!! This section only describes user data backups and only      !!!!!!!!
!!!!!!!!! touches the subject of a good "OS" System State backup.     !!!!!!!!


Obviously, when you work at a larger organization, you probably will use a professional
backup suite, like TSM.

In this section, you will ONLY find just a series of prompt commands and scripts (and really, nothing more 
than that), and a description of of the standard graphical interface.


1. The regular xcopy command to create backups of user data:
============================================================

One advantage of the regular xcopy command is, thats its available on all Windows versions.
Actually, its a very powerful command, and you can use many switches in order to create good user data backups.
Please take notice of the fact that we distinquish here between user data backups, and
backups of the Operating System (with all fancy stuff like Active Directory etc..)

Suppose on a Win2Kx Server, you have a directory "C:\data" which has under it a big tree of subdirectories
with other data directories (like c:\data\user1, c:\data\user2 etc..).

Now you want to copy this complete tree to another drive, say drive M:, included with all subdirectories, all files, 
and information of the Owner and ACL�s.

The following example shows how you might do this:

C:\data> xcopy *.* m:\data /S /C /O /H /Y

/S: copies subdirectories also
/C: copy the files even if an error shows up (like the file is in use)
/O: includes also ownership and Access Control Lists information.
/H: includes also all hidden and system files
/Y: suppresses the confirmation if you are about to overwrite files

Ofcourse, the command xcopy /?  will show you all switches.


2. The robocopy command:
========================

The robocopy command is included with all latest Windows versions. But on the older versions, you needed
to get the command from the "Resource Kit". So on Vista and Win2008 its present per default, but for example
on Windows 2003 Server, you must get it from a support CD or the Resource Kit.

Its a very robust command, and its really your perfect partner for creating command line (or scheduled batch)
backup scripts.
If you like the relatively simple xcopy command, using the right switches, you can keep using that ofcourse,
but robocopy is really more advanced, and I recommend using it for creating backups.

Robocopy is designed for reliable mirroring of directories or directory trees. It has features to ensure all NTFS attributes 
and properties are copied, and includes additional restart code for network connections subject to disruption.

Remark: robocopy is a directory/folder copier, and its not directly good usable for just individual files.

Here we will not list all options and possibilities of robocopy, but we will show some good examples.


-- Example 1:

Copy directory recursively (/E), and copy all file information (/COPYALL, 
                                                                equivalent to /COPY:DATSOU, D=Data, A=Attributes, 
                                                                                            T=Timestamps, S=Security=NTFS ACLs, 
                                                                                            O=Owner info, U=aUditing info), 
                                                                                            do not retry locked files (/R:0):

C:\> robocopy C:\foo C:\bar /COPYALL /E /R:0


-- Example 2:

Mirror foo to bar, destroying any files in bar that are not present in foo (/MIR), copy files in restartable mode (/Z) 
in case network connection is lost:

C:\> robocopy C:\foo \\backupserver\bar /MIR /Z


3. ntbackup utility, regular data backups & System State Backups:
=================================================================


For Win2K3, the well known "ntbackup" utility is still in place.
You can create adhoc backups, as well as scheduled backups, with this utility.

There are several ways to start it. From the prompt, just call ntbackup, like:

C:\> ntbackup

A graphical interface will show up. You can do a lot of actions from here.
We will not go into all details of "normal" backup operations, like creating a normal backup, 
an incremental backup, an differential backup etc..
That theory is quite the same on all playforms.

What we will distinguish here is this:


-----------------------------------------------
>>>> 1. regular file backups (user data) <<<<<
-----------------------------------------------

On your Server, you might have many true user data directories, like for example "D:\DATA" or "C:\SALES",
which users might access from the network. This is VERY different of creating a good OS backup.

To backup user data, you might use the methods listed in sections 1 and 2, that is, using
xcopy with the right switches, or using the robocopy utility.

But creating good backups (or backup jobs) of user data can also be done with a graphical interface, that is, ntbackup.
Also take notice that ntbackup can also be called from scripts, like batch files.
 
If you want to backup user data now, or create a scheduled job to do that regularly (e.g. once a day),
just follow the wizard, and point and click on what you want to have backupped, and if you want, create
a scheduled job.
That is really easy to do.


------------------------------------------------------------------------------------------
>>>> 2.OS backups (active directory, sysvol, registry, components, Server OS etc..) <<<<<
------------------------------------------------------------------------------------------

You have read the "IMPORTANT MESSAGE" above. In order to create a good Operating System backup, including
all Domain Controller "stuff" like Active Directory etc.., you really need to know what needs to be done.
This document will NOT describe that.

But please investigate the options of "System State" backups of the ntbackup utility, or other tools,
or third party software.

These are a few quite good references:

http://www.ilopia.com/Articles/WindowsServer2003/Backup.aspx
http://www.petri.co.il/backup-windows-server-2003-active-directory.htm


4. Creating a Win2Kx bootdisk (dvd etc..):
==========================================


>>>> Note 1: floppy disk NT4, XP, Win200, Win2003:
--------------------------------------------------

How to create a Windows NT 4.0, 2000, XP or Server 2003 boot floppy disk

The NT BOOT diskette allows you to boot from a floppy using the NT OS Loader menu to select the NT partition to load 
the kernel from. This can be very handy if you have lost your NT boot sector by installing another OS 
which copies over the partition boot record.

Supports up to 2 harddisks (any partition).

The steps are:


Format a floppy disk using a Windows NT 4.0, 2000, XP or Server 2003 machine (not windows 9x!)
format a: /u


Copy NTDETECT.COM and NTLDR onto the floppy disk

Download this BOOT.INI file and put it onto the floppy disk

BOOT.INI file 
[boot loader]
timeout=-1
default=multi(0)disk(0)rdisk(0)partition(1)\WINDOWS
[operating systems]
multi(0)disk(0)rdisk(0)partition(1)\WINDOWS="First harddisk, first partition" /sos
multi(0)disk(0)rdisk(0)partition(2)\WINDOWS="First harddisk, second partition" /sos
multi(0)disk(0)rdisk(0)partition(3)\WINDOWS="First harddisk, third partition" /sos
multi(0)disk(0)rdisk(0)partition(4)\WINDOWS="First harddisk, fourth partition" /sos
multi(0)disk(0)rdisk(1)partition(1)\WINDOWS="Second harddisk, first partition" /sos
multi(0)disk(0)rdisk(1)partition(2)\WINDOWS="Second harddisk, second partition" /sos
multi(0)disk(0)rdisk(1)partition(3)\WINDOWS="Second harddisk, third partition" /sos
multi(0)disk(0)rdisk(1)partition(4)\WINDOWS="Second harddisk, fourth partition" /sos
C:\="Previous Operating System on C:\" 


This boot.ini assumes that windows is installed in the "WINDOWS" folder, for terminal server you must edit the boot.ini 
and replace all "WINDOWS" into "WTSRV", for Windows NT 4.0 you must edit the boot.ini and replace all "WINDOWS" into "WINNT".

This boot.ini will not work for SCSI Controllers without a SCSI BIOS (need NTBOOTDD.SYS on the diskette)

For more info on boot.ini switches look at http://www.sysinternals.com/ntw2k/info/bootini.shtml. 
Done, try and boot it!


>>>>> Note 2: create a bootable Windows 2003 + SP1 DVD or CDR:
--------------------------------------------------------------

To create a bootable Windows 2003 CD, you first need to extract the boot sector of an existing 
Windows 2003 installation CD-ROM. (This procedure should also work to create a Windows XP bootable CD-ROM; 
simply capture the boot sector of an XP CD-ROM.) To extract the boot sector, I used the IsoBuster CD-ROM 
and DVD data-recovery tool, which you can download at http://www.smart-projects.net/isobuster . 
After you install IsoBuster, perform these steps: 

Insert the Windows 2003 CD-ROM that you want to integrate with SP1. 
Open IsoBuster and select Bootable CD from the left pane, right-click the Microsoft Corporation.img file, 
and select Extract Microsoft Corporation.img from the context menu. 
Enter a name for the boot sector you're extracting and click Save. 
Exit IsoBuster. 
Alternatively, you can use a pre-extracted Windows 2003 boot sector file called Windows2003StdCDBootSector.img , which you can download here .


Next, you'll create the new structure for the Windows 2003 with integrated SP1 CD-ROM by performing these steps: 

Create a new folder on a local file system, and name the folder windows2003sp1. 
Copy the contents of the existing Windows 2003 CD-ROM to the new folder. 
Create an extracted version of the service pack that you want to slipstream (in this example, SP1). To do so, download the service pack, 
then execute it with the /x switch, as in the following example: 
 /x

Open the extracted service pack, navigate to the "update" subfolder, and run this command: 
update /integrate: 

as in this example 
update /integrate:D:\temp\windows2003stdsp1


You can also choose to not extract the service pack first and instead simply add the /integrate switch to the 
downloaded SP1 file, as in this example: 

  /integrate: . 

The integrate switch tells the update command to integrate the service pack files into an existing Windows 2003 
installation source. You can also update the support tools and deployment tools with their SP1 versions. (For download information, 
see the FAQ "Where can I get the updated support tools and deployment tools for Windows Server 2003 Service Pack 1 (SP1)?" 
at http://www.windowsitpro.com/articles/index.cfm?articleid=46056 .) Rename the downloaded deployment tools .cab file 
to deploy.cab and place the file in the \support\tools subfolder of the Windows 2003 CD-ROM folder that has the slipstreamed SP1 
(replacing the existing deploy.cab file). To update the SP1 support tools, extract them to a new folder using the command 
\c \t 

as in this example: 
D:\temp\windowsserver2003-kb892777-supporttools-x86-enu.exe /c /t:d:\temp\2003sp1suptools 

Copy the four extracted files (sup_pro.cab, sup_srv.cab, support.cab, and suptools.msi) to the 
\support\tools folder of the Windows 2003 folder.

You're now ready to burn this new structure and the boot sector you extracted earlier to a CD-ROM to make a bootable 
Windows 2003 CD-ROM that has SP1 slipstreamed into it. For this example, I used the Nero 6.6 CD-ROM burning software, 
but you can use any CD-ROM burner software that lets you create a bootable CD-ROM. T
o create the Windows 2003 CD-ROM, perform these steps: 


-Start the Nero or other CD-ROM burning application. 
-From the File menu, select New. 
-From the list of CD type options, select CD-ROM (Boot). 
-Select the Boot tab, then select "Image file" and enter the location of your boot sector image file. 
 Check the "Enable expert settings" and set the emulation to "No Emulation." Set the load segment to 07C0 
 and the number of sectors to 4, as the figure shows. 
-Select the Label tab and enter the volume label of the original CD-ROM (e.g., NRMSFPP_EN for Windows 2003 Standard Server). 
-Under Burn CD, select the "Finalize CD (No further writing possible!)" option. 
-Click New. 
-Drag all the files from the Windows 2003 with slipstreamed SP1 folder to the CD project, as the figure shows.
-From the Recorder menu, select Burn Compilation. Click Burn. 

The application then creates your SP1-integrated bootable Windows 2003 CD-ROM.


>>>>> Note 3: create a bootable Windows 2003 + SP2 DVD or CDR:
--------------------------------------------------------------


How to create a Windows Server 2003 with Service Pack 2 bootable installation disc.
This guide will walk you through the slipstreaming process for creating a bootable Microsoft Windows Server 2003 
installation disc that includes Service Pack 2, and should work for any version of Windows Server 2003.

This how-to requires the following:

Windows Server 2003 CD-ROM 
1.2 GB of free drive space for temporary storage 
Windows Server 2003 Service Pack 2 (free download) 
ISO Buster (free download) 
Nero 
1 Blank CD-R 

You can use these instructions on any computer running any version of Windows 2000, Windows XP, Windows 2003 Server, or Windows Vista. 
The directory names used here are not obligatory.

First, create the following directory:

c:\win2003

Next, insert your Windows Server 2003 CD-ROM into your PC and copy the contents to to c:\win2003. If you choose a different location, 
make sure you substitute the correct location in later steps.

Next, launch ISO Buster. ISO Buster is a product for data recovery from optical media such as CDs and DVDs. 
It can be used for rescuing data off bad discs, or extracting elements off a disc that are normally not 
user accessible, such as boot data. It is found at ISOBuster.com

In the left pane of ISO Buster, select Bootable CD from under CD\Session 1\Track 01. In the right pane, right click 
the item named Microsoft Corporation.img and select "Extract Microsoft Corporation.img". 
Save the file to your c:\ drive. Note: If your Windows 2003 Server CD is an OEM CD from Dell, HP or other company, 
the boot image may have a slightly different name (ex: BootImage.img)

Put your CD-ROM away, we are done with it.
---------------------

Move the WindowsServer2003-KB914961-SP2-x86-ENU.exe Service Pack 2 file to your C: drive. 
Then, from your start menu, select "Run" and type:

c:\WindowsServer2003-KB914961-SP2-x86-ENU.exe /x

If your Service Pack 2 file is in a different location, substitute the correct path to this file.

You will be prompted for a directory to extract to. Type:

c:\win2k3_sp2

Windows will extract the contents of Service Pack 2 to a directory so you can apply them to the files 
you copied off your CD-ROM. This process will take a few minutes and will vary depending on the speed of your computer.

When extraction is complete, go to your start menu, select "Run" and type:

c:\win2k3_sp2\i386\update\update.exe -s:c:\win2003

The slipstreaming process will begin. Your Windows Server 2003 installation files are being updated with the new components 
of Service Pack 2. This process can take a few minutes and will vary depending on the speed of your computer. 
When integration has been finished, you will receive a "Integrated install has been completed successfully" message.
---------------------

When the slipstreaming process is complete, launch Nero Burning ROM. Create a new Bootable CD-ROM compilation.

Here there are some important settings you must make, or your CD will not boot. Under "Source of boot image data", 
check Image file. Click the "Browse" button and select the Microsoft Corporation.img file you extracted with ISO Buster.

Next, check "Enable expert settings". Select No Emulation from the "Kind of Emulation" option. 
Then change the "Number of loaded sectors" to 4. Once you have done this, click the "New" button.

Select the contents of your c:\win2003\ folder and add them to the new CD compilation.

From the "Recorder" menu, select "Choose Recorder". Select Image Recorder. This will allow us to save 
an ISO image of the disc before burning it. If you do not wish to have an ISO file of your updated disc, 
you can skip this step.

From the "Recorder" menu, select Burn Compilation. In the burn settings window, ensure both Write and Finalize CD are selected. 
Next select Disc-at-once from the "Write Method" option. Once you have done this, click the burn option.

If you selected the "Image Recorder", you will be prompted to save a disc image. Select ISO Image Files (*.iso) from 
the "Save as type:" prompt. Save the file as c:\win2k3_sp2.iso. Nero will create your disc image. 
This process will take a few minutes.

You have now created a bootable CD-ROM image! Use Nero to burn the ISO disc image to make your actual CD-ROM. 
Store the slipstreamed ISO somewhere safe in case you need to burn a new copy. Now you can use this disc to install 
a fresh copy of Windows Server 2003 with Service Pack 2.


#######################################################################
#######################################################################
Part 6: Some wsh examples:
#######################################################################
#######################################################################


Intro:

A Windows script is a text file. You can create a script with any text editor as long 
as you save your script with a WSH-compatible script extension (.js, vbs, or .wsf).

The most commonly available text editor is already installed on your computer: Notepad. 
You can also use your favorite HTML editor, Textpad, Microsoft Visual C++, or Visual InterDev.

To create a simple script with Notepad or better, with TextPad :

-Start Notepad or TextPad. 
-Write your script. For example purposes, type 

 WScript.Echo("Hello World!"); 

-Save this text file with a .js extension (instead of the default .txt extension). For example, Hello.js. 
-Navigate to the file you just saved, and double-click it. 
-Windows Script Host invokes the JScript engine and runs your script. In the example, 
 a message box is displayed with the message "Hello World!" 

To simplify your script writing, you can divide a script into more than one part. With this approach, 
you would create a .wsf file and use it as the starting point of execution. 
The other parts could be .js or .vbs files. You would reference these files from the .wsf file.

This approach makes your code more robust because it isolates pieces of it, 
allowing you to debug one piece at a time. It also makes your code reusable because 
it allows you to create functions that can be called again and again.

Examples:

-----------------------------------------------------

Dim WSHShell
Set WSHShell = WScript.CreateObject("WScript.Shell")
WshShell.Run ("MSACCESS.EXE " & " " &"g:\tests\test_linked_server.mdb")

-----------------------------------------------------


Connect to SQL Server via ADO: create a .vbs file as follows
============================================================

Dim oConn
  Dim oCmd
  Dim oRs
  Dim tmpBody
  Set oConn = CreateObject("ADODB.Connection")
  oConn.Open ("DATABASE=aida;DSN=MDB;UID=karel;Password=karel;")
  Set oCmd = CreateObject("ADODB.Command")
  oCmd.ActiveConnection = oConn
  oCmd.CommandText = "exec fill_x"
  oCmd.CommandType = 1
  oCmd.Prepared = True
  Set oRs = oCmd.Execute
  
  Set oRs = Nothing
  Set oCmd = Nothing
  Set oConn = Nothing
  

-----------------------------------------------------
On SQL Server, start an asp on remote machine:\

make a vbscript job in SQLServer,
Step:

Dim WshShell
Set WshShell =CreateObject("WScript.Shell")
WshShell.Run ("http://yourserver/yourfolder/YourASPPage.asp?yourparameter=<YourXMLDocumentRoot><Yourtag>Whatever</YourTag></YourXMLDocumentRoot>")
Set WsShell = Nothing

-----------------------------------------------------

<%
dim WshShell, strRunCommand, intReturn

set WshShell = server.createobject("WScript.shell")

'start access and load file
strRunCommand = "MSACCESS.EXE " & " " &"g:\tests\test_linked_server.mdb /X macro1"
intReturn = wshshell.run(strRunCommand, 1, TRUE)

Response.Write "<P>Return code = " & intReturn
set WshShell = nothing
%>

-----------------------------------------------------

I am trying to run WinZip Self-Extractor on my website's remote web server.  
I need to package various files into self extracting archives for distribution 
to users in real-time based on user actions.  Creating these self-extracting 
archives is a two-step process.
1) Use WinZip (version 8.1) to create a .zip file
2) Use WinZip Self-Extractor (version 2.2) to create an .exe file from the .zip file

Step 1 works fine using the following ASP and WSH code:

<%
dim WshShell, strRunCommand, intReturn

set WshShell = server.createobject("WScript.shell")

'Create zip file
strRunCommand =     "d:\inetpub\wwwroot\my-domain\software\WinZip\wzzip -a -yb d:\inetpub\wwwroot\my-domain\vbmdata\test.zip d:\inetpub\wwwroot\my-domain\vbmdata\ziptest\*.*"
intReturn = wshshell.run(strRunCommand, 1, TRUE)

Response.Write "<P>Return code = " & intReturn
set WshShell = nothing
%>

-----------------------------------------------------
GetRegValue.vbs:
const HKEY_CURRENT_USER = &H80000001
const HKEY_LOCAL_MACHINE = &H80000002
strComputer = "."

Set oReg=GetObject("winmgmts:{impersonationLevel=impersonate}!\\" &_
strComputer & "\root\default:StdRegProv")

'Get a DWORD value
strKeyPath = "Software\TEST"
strValueName = "TestDWord"
oReg.GetDWORDValue HKEY_CURRENT_USER,strKeyPath,strValueName,dwValue
Wscript.Echo "Current User: " & strvaluename & " - " & dwValue 

'Get a string value
strKeyPath = "SOFTWARE\TEST"
strValueName = "TestString"
oReg.GetStringValue HKEY_LOCAL_MACHINE,strKeyPath,strValueName,strValue
Wscript.Echo "Local Machine: " & strvaluename & " - " & strvalue

-----------------------------------------------------
CantRun.vbs:
strComputer = "."
Set objWMIService = GetObject("winmgmts:" _
& "{impersonationLevel=impersonate}!\\" & strComputer & "\root\cimv2")
Set colMonitoredProcesses = objWMIService. _ 
ExecNotificationQuery("select * from __instancecreationevent " _ 
& " within 1 where TargetInstance isa 'Win32_Process'")
i = 0
Do While i = 0
Set objLatestProcess = colMonitoredProcesses.NextEvent
If objLatestProcess.TargetInstance.Name = "IEXPLORE.EXE" Then
objLatestProcess.TargetInstance.Terminate
End If
Loop

-----------------------------------------------------
GeneralBiosInfo.vbs:
CRLF = Chr(13) & Chr(10) ' carriage return, line feed

strmsg = ""
strComputer = "."
Set objWMIService = GetObject("winmgmts:" _
& "{impersonationLevel=impersonate}!\\" & strComputer & "\root\cimv2")
Set colBIOS = objWMIService.ExecQuery _
("Select * from Win32_BIOS")
For each objBIOS in colBIOS
strmsg=strmsg & "Serial Number: " & objBIOS.SerialNumber & CRLF
strmsg=strmsg & "SMBIOS Version: " & objBIOS.SMBIOSBIOSVersion & CRLF
strmsg=strmsg & "Version: " & objBIOS.Version & CRLF
Next


strComputer = "."
Set objWMIService = GetObject("winmgmts:" _
& "{impersonationLevel=impersonate}!\\" & strComputer & "\root\cimv2")
Set colSettings = objWMIService.ExecQuery _
("Select * from Win32_OperatingSystem")
For Each objOperatingSystem in colSettings 
strmsg=strmsg & "Windows Directory: " & _
objOperatingSystem.WindowsDirectory & CRLF
strmsg=strmsg & "Locale: " & objOperatingSystem.Locale & CRLF
strmsg=strmsg & "Available Physical Memory: " & _
objOperatingSystem.FreePhysicalMemory & CRLF
strmsg=strmsg & "Total Virtual Memory: " & _
objOperatingSystem.TotalVirtualMemorySize & CRLF
strmsg=strmsg & "Available Virtual Memory: " & _
objOperatingSystem.FreeVirtualMemory & CRLF
Next
Set colSettings = objWMIService.ExecQuery _
("Select * from Win32_ComputerSystem")
For Each objComputer in colSettings 
strmsg=strmsg & "System Name: " & objComputer.Name & CRLF
strmsg=strmsg & "System Manufacturer: " & objComputer.Manufacturer & CRLF
strmsg=strmsg & "System Model: " & objComputer.Model & CRLF
strmsg=strmsg & "Total Physical Memory: " & _
objComputer.TotalPhysicalMemory & CRLF
Next
Set colSettings = objWMIService.ExecQuery _
("Select * from Win32_Processor")
For Each objProcessor in colSettings 
strmsg=strmsg & "Processor: " & objProcessor.Description & CRLF
Next
Set colSettings = objWMIService.ExecQuery _
("Select * from Win32_BIOS")
For Each objBIOS in colSettings 
strmsg=strmsg & "BIOS Version: " & objBIOS.Version & CRLF
Next

'Set dtmConvertedDate = CreateObject("WbemScripting.SWbemDateTime")
strComputer = "."
Set objWMIService = GetObject("winmgmts:" _
& "{impersonationLevel=impersonate}!\\" & strComputer & "\root\cimv2")
Set colOperatingSystems = objWMIService.ExecQuery _
("Select * from Win32_OperatingSystem")
For Each objOperatingSystem in colOperatingSystems
strmsg=strmsg & "Organization: " & objOperatingSystem.Organization & CRLF
strmsg=strmsg & "Registered User: " & objOperatingSystem.RegisteredUser & CRLF
Next

Wscript.Echo strmsg

-----------------------------------------------------
CreateShortCut.vbs:
set WshShell = WScript.CreateObject("WScript.Shell")
strDesktop = WshShell.SpecialFolders("Desktop")
set oShellLink = WshShell.CreateShortcut(strDesktop & "\Shortcut Script.lnk")
oShellLink.TargetPath = WScript.ScriptFullName
oShellLink.WindowStyle = 1
oShellLink.Hotkey = "CTRL+SHIFT+F"
oShellLink.IconLocation = "notepad.exe, 0"
oShellLink.Description = "Shortcut Script"
oShellLink.WorkingDirectory = strDesktop
oShellLink.Save
set oUrlLink = WshShell.CreateShortcut(strDesktop & "\Microsoft Web Site.url")
oUrlLink.TargetPath = "http://www.microsoft.com"
oUrlLink.Save

-----------------------------------------------------
LaunchIE.vbs:
Private myIE

Dim WSHShell


Set myIE = CreateObject("InternetExplorer.Application")
myIE.Navigate "http://intranet.company.com/index.htm"
'myIE.ToolBar = True
'myIE.StatusBar = False
myIE.AddressBar = False
myIE.MenuBar = False
myIE.Resizable = False
myIE.TheaterMode = False


Do
Loop While myIE.Busy

myIE.Width = 1024
myIE.Height = 740
myIE.Left = 0
myIE.Top = 0
myIE.Visible = True

Set WSHShell = WScript.CreateObject("WScript.Shell")
WshShell.AppActivate("Microsoft Internet Explorer")
Set WSHShell = Nothing

-----------------------------------------------------
ListLocalUsers.vbs:
On Error Resume Next
strComputer = "."
Set objWMIService = GetObject("winmgmts:\\" & strComputer & "\root\cimv2")
Set colItems = objWMIService.ExecQuery("Select * from Win32_Account Where LocalAccount = True")
For Each objItem in colItems
' Wscript.Echo "Description: " & objItem.Description
' Wscript.Echo "Domain: " & objItem.Domain
' Wscript.Echo "Install Date: " & objItem.InstallDate
' Wscript.Echo "Local Account: " & objItem.LocalAccount
' Wscript.Echo "Name: " & objItem.Name
' Wscript.Echo "SID: " & objItem.SID
' Wscript.Echo "SID Type: " & objItem.SIDType
' Wscript.Echo "Status: " & objItem.Status
if objItem.SIDType = 1 THEN

Wscript.Echo "Status: " & objItem.GetObjectText_
end if
Next

-----------------------------------------------------
ReadTextFile.vbs:
Const ForReading = 1
strComputer = "."

strfind=inputbox("What text do you want to find?","Find text in document")
strfile=inputbox("What file do you want to look in?","Find text in document")

Set objFSO = CreateObject("Scripting.FileSystemObject")
strmsg=""

on error resume next
if objFSO.fileexists (strfile) then
Set objTextFile = objFSO.OpenTextFile (strFile, ForReading)
Do Until objTextFile.AtEndOfStream 
strTextLine = objTextFile.Readline
stroffset=instr(1,strtextline,strfind,1)
if stroffset>0 then 
strmsg=strmsg & strtextline & vbcrlf
end if
Loop
objTextFile.Close
end if
if len(strmsg)>0 then 
wscript.echo strmsg
else
wscript.echo "No match found"
end if

-----------------------------------------------------
SetRegValue.vbs:
const HKEY_CURRENT_USER = &H80000001
const HKEY_LOCAL_MACHINE = &H80000002
strComputer = "."

Set oReg=GetObject("winmgmts:{impersonationLevel=impersonate}!\\" &_
strComputer & "\root\default:StdRegProv")

'Get a DWORD value
strKeyPath = "Software\TEST"
strValueName = "TestDWord"
dwvalue = 100
oReg.SetDWORDValue HKEY_CURRENT_USER,strKeyPath,strValueName,dwValue

'Get a string value
strKeyPath = "SOFTWARE\TEST"
strValueName = "TestString"
strvalue = "Now is the time"
oReg.SetStringValue HKEY_LOCAL_MACHINE,strKeyPath,strValueName,strValue

-----------------------------------------------------
GetRegValue.vbs:
const HKEY_CURRENT_USER = &H80000001
const HKEY_LOCAL_MACHINE = &H80000002
strComputer = "."

Set oReg=GetObject("winmgmts:{impersonationLevel=impersonate}!\\" &_
strComputer & "\root\default:StdRegProv")

'Get a DWORD value
strKeyPath = "Software\TEST"
strValueName = "TestDWord"
oReg.GetDWORDValue HKEY_CURRENT_USER,strKeyPath,strValueName,dwValue
Wscript.Echo "Current User: " & strvaluename & " - " & dwValue 

'Get a string value
strKeyPath = "SOFTWARE\TEST"
strValueName = "TestString"
oReg.GetStringValue HKEY_LOCAL_MACHINE,strKeyPath,strValueName,strValue
Wscript.Echo "Local Machine: " & strvaluename & " - " & strvalue
 
-----------------------------------------------------
Windows Script, as we have recently seen, can be used destructively, and that�s unfortunate.
We simply have to guard against potential abuses and take the appropriate advance
 steps to make sure that we are safe.   However, as developers, I am sure you will 
agree that the advantages that scripting provides far outweigh the potential dangers.

Here is a real � world example of how to use the Windows Scripting 
Host with VBScript in an ASP Page that can be run on a remote web server
 by simply �loading the URL� into your browser. I hope it provides �food for 
thought� and a basis for you to be able to solve some of your own problems.

Recently, one of the web sites I run on a friend�s machine got messed up. 
Somebody removed an old web site folder. Unfortunately, the folder also 
contained an Access database that was being used by one of my active 
sites in some Cold Fusion pages. I FTP�ed a new copy of the MDB file but 
soon discovered that somehow the ODBC System DSN had also been removed.  
My friend was away and I didn�t know when he would return, and I had no 
�desktop� access to the web server, only FTP access.

Rather than being faced with the embarrassment of having a web site whose
 �classifieds� section would show all kinds of errors, here�s how I solved the problem:

First, I created a new ODBC System DSN on my own machine so I could export 
all the registry entries.  To export registry entries, run REGEDIT, 
highlight the key you want to export, and choose �Registry/ Export Registry File�
 from the Main Menu.  The result is a text file with a �.reg� extension.  
You can execute this by double clicking on it to enter the items contained therein 
into the registry. And, you can also copy the contents into an ASP page and have 
the Windows Script Host enter them remotely!

Second, I created the following ASP page, and filled in all the registry keys
 and values that I needed to recreate:

<%

Dim WSHShell

� Create an instance of the Windows Script Host �Shell� Object

Set WSHShell = CreateObject("WScript.Shell")

' IMPORTANT! When first string has a "\" at end, this creates the KEY itself, 
not the key or subkey value!

' Remove the "\" to make it write the actual subkey name and its value (second string)

� Create the �classifieds� main key

WSHShell.RegWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\", "Default"

� Begin to write all the subkeys and their corresponding values
WSHShell.RegWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\Driver", "C:\WINDOWS\SYSTEM\odbcjt32.dll"
WSHShell.RegWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\DBQ", "C:\CFUSION\database\classifieds.mdb"
WSHShell.RegWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\DriverId", 00000019, "REG_DWORD"
WSHShell.RegWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\FIL" ,"MS Access;"
WSHShell.RegWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\SafeTransactions" ,00000000, "REG_DWORD"
WSHShell.RegWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\UID" ,""
WSHShell.RegWrite "HKLM\Software\ODBC\ODBC.INI\ODBC Data Sources\Classifieds", "Microsoft Access Driver (*.mdb)"
WSHShell.RegWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\Engines\" , ""
WSHShell.regWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\Engines\Jet\", ""
WSHShell.regWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\Engines\Jet\ImplicitCommitSync", ""
WSHShell.regWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\Engines\Jet\MaxBufferSize", 00000800, "REG_DWORD"
WSHShell.regWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\Engines\Jet\PageTimeout", 00000005, "REG_DWORD"
WSHShell.regWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\Engines\Jet\Threads", 00000003, "REG_DWORD"
WSHShell.regWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\Engines\Jet\UserCommitSync" ,"Yes"
Response.write "Registry Modifications Complete!"

%>

When I FTP�ed the above �RegistryEdit.asp� page to my webspace on my friend�s machine, 
and opened my browser with http://www.myserver.com/RegistryEdit.asp, Voila!

All the correct ODBC entries were restored on the web server! My classifieds section worked great!
The key here is to understand the syntax involved in getting the difference between 
creating a registry �KEY�, and entering a key or subkey �VALUE�. The documentation (and example)  from Microsoft on the Wscript.Shell  �Regwrite�  syntax is relatively poor. Here�s the catch:

When you want to CREATE a new key (or just make sure that it�s already there), 
you place a backslash (�\�) at the end of the first string in the �Regwrite� 
method line.  So if you look at my script, the first thing I needed to do was to recreate the �classifieds� KEY. I did this with :

WSHShell.RegWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\", "Default"

I put �Default� in there; you can put an empty string (��) if you like. 
The Important thing to remember is, the first string after the WSHShell.RegWrite 
directive is terminated in a backslash when you want to create a key instead of 
write a key value.

Now that I have made sure the �HKLM\Software\ODBC\ODBC.INI\classifieds\� key is there, 
I am ready to write the subkeys and their values there, like this:

WSHShell.RegWrite "HKLM\Software\ODBC\ODBC.INI\classifieds\Driver", "C:\WINDOWS\SYSTEM\odbcjt32.dll"

The syntax is:

object.RegWrite strName, anyValue [,strType] 

object
 WshShell object.
 
strName
 Key or value name to write.
 
anyValue
 The value to write into the key or registry value.
 
strType
 Optional. The data type for the value being stored in the registry.
 
If strName ends with the backslash character (\), this method returns (creates) the 
key instead of the value. StrName must begin with one of following root key names: 

Short
 Long
 
HKCU
 HKEY_CURRENT_USER
 
HKLM
 HKEY_LOCAL_MACHINE
 
HKCR
 HKEY_CLASSES_ROOT
 
   HKEY_USERS
 
   HKEY_CURRENT_CONFIG
 

RegWrite supports strType as REG_SZ, REG_EXPAND_SZ, REG_DWORD, and REG_BINARY.
 If another data type is passed as strType, RegWrite returns E_INVALIDARG


Of course, you can also read registry values from within as ASP script 
or instantiated COM component by using the corresponding �RegRead� method. 
I have found this process to be reasonably fast, and that means we have 
an alternative method of storing state in the registry, rather than relying 
on cookies, ASP session variables, the querystring, hidden form fields or a database. 
 Of course, if you have a lot of users and many keys and values to write, you could 
create a really big registry, so you have to think things through. 
But it�s a nice tool to know about.

This should give you the basics of what it takes to write to the Windows registry 
remotely using ASP with VBScript and the Windows Scripting Host Shell Object. 

-----------------------------------------------------

This script launches the Manage Computer MMC, compmgmt.msc, after prompting for local or remote system name. 

Script: 

'====================
' NAME: LaunchManage.vbs
'
' AUTHOR: Alan Kaplan , MSD
' DATE : 6/19/2003
'
' COMMENT: Launches Management MMC
'====================

dim wshShell,command,strComputer
Set wshShell = WScript.CreateObject("WScript.Shell")
dim quote
quote=chr(34)

'Cosmetic only...Make sure running Wscript.
If IsCScript() Then         'If CScript, re-run with Wscript...to remove ugly box
    WshShell.Run "WScript.exe " & quote & WScript.ScriptFullName & quote, 1, true
WScript.Quit     '...and stop running as cscript
End If


'not checking for /help or /? ....
'but allows you to run as batch

If WScript.Arguments.count = 1 Then 
    strComputer = WScript.Arguments(0)
else
    strComputer = wshShell.ExpandEnvironmentStrings("%Computername%")
    strcomputer=InputBox("Run Management MMC for what Computer?","Launch Management MMC",strComputer)
    If strComputer = "" Then WScript.Quit
End If 

'the heart of the matter.... works for local PC, too.
command = "mmc %windir%\system32\compmgmt.msc -s /computer:\\"&strcomputer
wshShell.Run command,1,False

Function IsCScript()
If (InStr(UCase(WScript.FullName), "CSCRIPT") <> 0) Then
IsCScript = True
Else
IsCScript = False
End If
End Function


-----------------------------------------------------

Start MSAccess and macro

Option Explicit
const QUOTE =3D """"
const ACCESSPATH =3D "C:\Program Files\Microsoft
Office\Office\MSACCESS.EXE"
const ACCESSDATABASE =3D "C:\My Documents\My Database.mdb"
const ACCESSMACRO =3D "MyMacro"
Public Sub Main()
  dim wshShell, command
  set wshShell =3D WScript.CreateObject("WScript.Shell")
  command =3D QUOTE & ACCESSPATH & QUOTE & " " & QUOTE & ACCESSDATABASE
& QUOTE & " /x " & ACCESSMACRO
  wshShell.Run command,2,true
  set wshShell =3D nothing
End Sub
Call Main()
' *************************************

-----------------------------------------------------

Motivating the Component Object Model: Examples

Before we get too mired in the details of COM programming. let's examine some examples which illustrate the advantages of utilizing COM.

The entire office suite of Microsoft is written using COM. Therefore, various pieces of EXCEL. WORD, ACCESS, etc can be used in your program by accessing its COM interface.

Here are some examples of what you can do easily with the COM interfaces into OFFICE Suite.

Accessing COM objects
// Windows Script Host Sample Script
//
// ------------------------------------------------------------------------
//               Copyright (C) 1996 Microsoft Corporation
//
// You have a royalty-free right to use, modify, reproduce and distribute
// the Sample Application Files (and/or any modified version) in any way
// you find useful, provided that you agree that Microsoft has no warranty,
// obligations or liability for any Sample Application Files.
// ------------------------------------------------------------------------

// This sample will display Windows Scripting Host properties in Excel.


var vbOKCancel = 1;
var vbInformation = 64;
var vbCancel = 2;

var L_Welcome_MsgBox_Message_Text    = "This script will display Windows Scripting Host properties in Excel.";
var L_Welcome_MsgBox_Title_Text      = "Windows Scripting Host Sample";
Welcome();
    

//////////////////////////////////////////////////////////////////////////////////
//
// Excel Sample
//
var objXL = WScript.CreateObject("Excel.Application");

objXL.Visible = true;

objXL.WorkBooks.Add;

objXL.Columns(1).ColumnWidth = 20;
objXL.Columns(2).ColumnWidth = 30;
objXL.Columns(3).ColumnWidth = 40;

objXL.Cells(1, 1).Value = "Property Name";
objXL.Cells(1, 2).Value = "Value";
objXL.Cells(1, 3).Value = "Description";

objXL.Range("A1:C1").Select; 
objXL.Selection.Font.Bold = true;
objXL.Selection.Interior.ColorIndex = 1;
objXL.Selection.Interior.Pattern = 1; //xlSolid
objXL.Selection.Font.ColorIndex = 2;

objXL.Columns("B:B").Select;
objXL.Selection.HorizontalAlignment = -4131; // xlLeft

var intIndex = 2;

function Show(strName, strValue, strDesc) {
    objXL.Cells(intIndex, 1).Value = strName;
    objXL.Cells(intIndex, 2).Value = strValue;
    objXL.Cells(intIndex, 3).Value = strDesc;
    intIndex++; 
    objXL.Cells(intIndex, 1).Select;
}

//
// Show WScript properties
//
Show("Name",           WScript.Name,           "Application Friendly Name");
Show("Version",        WScript.Version,        "Application Version");
Show("FullName",       WScript.FullName,       "Application Context: Fully Qualified Name");
Show("Path",           WScript.Path,           "Application Context: Path Only");
Show("Interactive",    WScript.Interactive,    "State of Interactive Mode");


//
// Show command line arguments.
//
var colArgs = WScript.Arguments
Show("Arguments.Count", colArgs.length, "Number of command line arguments");

for (i = 0; i < colArgs.length; i++) {
    objXL.Cells(intIndex, 1).Value = "Arguments(" + i + ")";
    objXL.Cells(intIndex, 2).Value = colArgs(i);
    intIndex++;
    objXL.Cells(intIndex, 1).Select;
}


//////////////////////////////////////////////////////////////////////////////////
//
// Welcome
//
function Welcome() {
    var WSHShell = WScript.CreateObject("WScript.Shell");
    var intDoIt;

    intDoIt =  WSHShell.Popup(L_Welcome_MsgBox_Message_Text,
                              0,
                              L_Welcome_MsgBox_Title_Text,
                              vbOKCancel + vbInformation );
    if (intDoIt == vbCancel) {
        WScript.Quit();
    }
}

Set format for selected region 
treat set of cells as an array for interating 

--------------------------------------------------------------------------------

 
// Windows Script Host Sample Script (In JScript)
//
// ------------------------------------------------------------------------
//               Copyright (C) 1996-1997 Microsoft Corporation
//
// You have a royalty-free right to use, modify, reproduce and distribute
// the Sample Application Files (and/or any modified version) in any way
// you find useful, provided that you agree that Microsoft has no warranty,
// obligations or liability for any Sample Application Files.
// ------------------------------------------------------------------------


// This sample demonstrates how to access Microsoft Excel using the Windows Scripting Host.

var vbOKCancel = 1;
var vbInformation = 64;
var vbCancel = 2;

var L_Welcome_MsgBox_Message_Text    = "This script demonstrates how to access Excel using the Windows Scripting Host.";
var L_Welcome_MsgBox_Title_Text      = "Windows Scripting Host Sample";
Welcome();
    
//////////////////////////////////////////////////////////////////////////////////
//
// Excel Sample
//

var objXL;

objXL = WScript.CreateObject("Excel.Application");
objXL.Workbooks.Add;
objXL.Cells(1,1).Value = 5;
objXL.Cells(1,2).Value = 10;
objXL.Cells(1,3).Value = 15
objXL.Range("A1:C1").Select;

var objXLchart = objXL.Charts.Add();
objXL.Visible = true;
objXLchart.Type = -4100;

var intRotate;
for(intRotate = 5; intRotate <= 180; intRotate += 5) {
    objXLchart.Rotation = intRotate;
}

for (intRotate = 175; intRotate >= 0; intRotate -= 5) {
    objXLchart.Rotation = intRotate;
}

//////////////////////////////////////////////////////////////////////////////////
//
// Welcome
//
function Welcome() {
    var WSHShell = WScript.CreateObject("WScript.Shell");
    var intDoIt;

    intDoIt =  WSHShell.Popup(L_Welcome_MsgBox_Message_Text,
                              0,
                              L_Welcome_MsgBox_Title_Text,
                              vbOKCancel + vbInformation );
    if (intDoIt == vbCancel) {
        WScript.Quit();
    }
}

--------------------------------------------------------------------------------

To demonstrate language independence - here is the same program in Visual Basic Script

// Windows Script Host Sample Script (In Visual Basic Script)

' Windows Script Host Sample Script
'
' ------------------------------------------------------------------------
'               Copyright (C) 1996-1997 Microsoft Corporation
'
' You have a royalty-free right to use, modify, reproduce and distribute
' the Sample Application Files (and/or any modified version) in any way
' you find useful, provided that you agree that Microsoft has no warranty,
' obligations or liability for any Sample Application Files.
' ------------------------------------------------------------------------

' This sample demonstrates how to access Microsoft Excel using the Windows Scripting Host.

L_Welcome_MsgBox_Message_Text    = "This script demonstrates how to access Excel using the Windows Scripting Host."
L_Welcome_MsgBox_Title_Text      = "Windows Scripting Host Sample"
Call Welcome()
    
' ********************************************************************************
' *
' * Excel Sample
' *

Dim objXL
Dim objXLchart
Dim intRotate

Set objXL = WScript.CreateObject("Excel.Application")
objXL.Workbooks.Add
objXL.Cells(1,1).Value = 5
objXL.Cells(1,2).Value = 10
objXL.Cells(1,3).Value = 15
objXL.Range("A1:C1").Select

Set objXLchart = objXL.Charts.Add()
objXL.Visible = True
objXLchart.Type = -4100     

For intRotate = 5 To 180 Step 5
    objXLchart.Rotation = intRotate
Next

For intRotate = 175 To 0 Step -5
    objXLchart.Rotation = intRotate
Next

' ********************************************************************************
' *
' * Welcome
' *
Sub Welcome()
    Dim intDoIt

    intDoIt =  MsgBox(L_Welcome_MsgBox_Message_Text, _
                      vbOKCancel + vbInformation,    _
                      L_Welcome_MsgBox_Title_Text )
    If intDoIt = vbCancel Then
        WScript.Quit
    End If
End Sub


-----------------------------------------------------


Remote WSH, which is a new technology included in WSH 5.6, provides the ability to run a script on a remote machine or machines. With Remote WSH, the script is physically copied from the local machine to the remote machine before executing. In order to enable Remote WSH functionality, you must first set up the remote machine with the proper security settings. The steps below perform the tasks that enable Remote WSH.

Note   Both the remote and local machines must be running Windows NT 4 SP3 or greater in order to use Remote WSH.
To enable a machine to run remote scripts 

Install WSH V5.6 on the machine. If you are using Windows XP or have installed Internet Explorer 6 or greater, WSH 5.6 has already been installed. 
Note   WSH 5.6 is available for download from the Web at http://msdn.microsoft.com/scripting
Add yourself to the remote machine's Local Administrators group. 
To enable Remote WSH, use the System Policy Editor (Poledit.exe) on the server. 
Note   An administrator who wants to enable Remote WSH should add a subkey entry named Remote of type REG_SZ to the registry key HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Windows Script Host\Settings. To enable Remote WSH, set the value of Remote to 1; to disable Remote WSH, set the value to 0. If the value of the Remote value is not set, by default Remote WSH is disabled.
Note   For more information on the System Policy Editor, see the Microsoft Windows online help system.
WSH is enabled on the machine. To test it, see Running Scripts Remotely. 
See Also

-----------------------------------------------------

' VBScript.
RemoteTest.WSF
------------------------------- 
<package>
<job>
<script language="VBScript">
set oController = CreateObject("WSHController")
set oProcess = oController.CreateScript("c:\wsh5.6\beenhere.wsf", "remmachine")
oProcess.Execute
While oProcess.Status <> 2
   WScript.Sleep 100
WEnd
WScript.Echo "Done"
</script>
</job>
</package>


-----------------------------------------------------

' VBScript.
Dim net
Set net = CreateObject("WScript.Network")    
net.MapNetworkDrive "I:", "\\computer2\public","True"

-----------------------------------------------------

REF:

Here is a quick reference to some of the more common ProgIDs 


Application  Prog ID  Control  
    
  
Database  
  
ADO Command ADODB.Command msdado15.dll 
ADO Connection ADODB.Connection msdado15.dll 
ADO Error ADODB.Error msdado15.dll 
ADO Parameter ADODB.Parameter msdado15.dll 
ADO Recordset ADODB.Recordset msdado15.dll 

--------------------------------------------------------------------------------
 
MS ADO Data Control MSAdodcLib.Adodc.6 msadodc.ocx 
  
Internet/Networking  
  
MS ActiveX Upload Control V1.5 MSFlUpl.Flupl flupl.ocx 

--------------------------------------------------------------------------------
 
MS Internet Explorer InternetExplorer.Application shdocvw.dll/iexplore.exe 
MS Web Browser Shell.Explorer shdocvw.dll 

--------------------------------------------------------------------------------
 
MS Internet Transfer Control InetCtls.Inet msinet.ocx 
Shell Automation Service Shell.Application shdocvw.dll 
Timer Object Internet.Timer ietimer.ocx 
MS WinSock Control V6.0 MSWinsock.Winsock mswinsck.ocx 

--------------------------------------------------------------------------------
 
MS Communications Control V6.0 MSCOMMLib.MSComm mscom32.ocx 
  
Microsoft Office  
  
MS Access Access.Application msaccess.exe 

--------------------------------------------------------------------------------
 
MS Chart Control V5.0 MSChartLib.MSChart mschart.ocx 

--------------------------------------------------------------------------------
 
MS Excel Application Excel.Application excel.exe 
MS Excel Chart Excel.Chart excel.exe 
MS Excel WorkSheet Excel.Sheet excel.exe 

--------------------------------------------------------------------------------
 
MS Graph 97 Application MSGraph.Application graph8.exe 
MS Graph 97 Chart MSGraph.Chart graph8.exe 

--------------------------------------------------------------------------------
 
MS Organization Chart 2.0 OrgPlusWOPX.4  orgchart.exe 

--------------------------------------------------------------------------------
 
MS PowerPoint Presentation PowerPoint.Show powerpnt.exe 
MS PowerPoint Slide PowerPoint.Slide powerpnt.exe 

--------------------------------------------------------------------------------
 
MS Word6.0 - 7.0 Document  Word.Document winword.exe 
MS Word6.0 - 7.0 Picture  Word.Picture winword.exe 
MS Word Application Word.Application winword.exe 
MS Word Basic Word.Basic winword.exe 
  
Messaging  
  
MS MAPI Messages Control V6.0 MSMAPI.MAPIMessages msmapi32.ocx 
MS MAPI Session Control V6.0 MS MAPISession msmapi32.ocx 

--------------------------------------------------------------------------------
 
MS Outlook 98 Object Library Outlook.Application outlook.exe 
  
Misc  
  
MS Animation Control V5(sp2) ComCtl2.Animation comct232.ocx 
MS Animation Control V6.0 MSComCtl2.Animation mscomct2.ocx 

--------------------------------------------------------------------------------
 
MS Common Dialog Control MSComDlg.CommonDialog comdlg32.ocx 
Media Clip MPlayer mplay32.exe 
MS Picture Clip Control V6.0 PicClip.PictureClip picclp32.ocx 
MS ProgressBar Control V5.0 COMCTL.ProgCtrl comctl32.ocx 
Wang Image Admin Control WangImage.AdminCtrl imgadmin.ocx 
Wang Image Viewer 1.0  WangImage.Application wangimg.exe 
WordPad Document  WordPad.Document wordpad.exe 
  
WSH  
  
File System Object Scripting.FileSystemObject scrrun.dll 
Scripting.Dictionary Scripting.Dictionary scrrun.dll 
WSH Network Object WScript.Network wshom.ocx 
WSH Shell Object WScript.Shell wshom.ocx 
   

-----------------------------------------------------

Dim RutaAgenda 'Path to the DataBase.
Dim Engine 'Compactation var.
Dim Sys	'FileSystemObject.

RutaAgenda = "D:\Inetpub\wwwroot\Agenda\bbdd\"		'Path a la BBDD.
AgendaFile = "Agenda.mdb"				'BBDD real.
AgendaFile1 = "Agenda1.mdb"				'BBDD temporal.

Set Engine = CreateObject("DAO.DBEngine.35")
Set Sys = CreateObject( "Scripting.FileSystemObject" )

'Repair DataBase
   Engine.RepairDatabase RutaAgenda & AgendaFile

'Delete Temporal Database.
If Sys.FileExists( RutaAgenda & AgendaFile1 ) Then
   Sys.DeleteFile( RutaAgenda & AgendaFile1 )
Else
    'Not exist temporal DataBase. 
End If

'DataBase Compactation.
   Engine.CompactDatabase RutaAgenda & AgendaFile, RutaAgenda & AgendaFile1

'Delete old DataBase
If Sys.FileExists( RutaAgenda & AgendaFile ) Then
   Sys.DeleteFile( RutaAgenda & AgendaFile )
Else
    'Not exist old DataBase
End If

'Copy temporal DataBase into the real DataBase
If Sys.FileExists( RutaAgenda & AgendaFile1 ) Then
   Sys.CopyFile RutaAgenda & AgendaFile1, RutaAgenda & AgendaFile, TRUE
Else
   'Error: Temporal DataBase not exist
End If

'Delete temporal DataBase.
If Sys.FileExists( RutaAgenda & AgendaFile1 ) Then
  Sys.DeleteFile( RutaAgenda & AgendaFile1 )
Else
  'Not exist DataBase
End If

Set Sys = Nothing

-----------------------------------------------------

<% 

Const Jet_Conn_Partial = "Provider=Microsoft.Jet.OLEDB.4.0; Data source="
Dim strDatabase, strFolder, strFileName

'################################################# 
'# Edit the following two lines
'# Define the full path to where your database is
strFolder = "F:\InetPub\wwwroot\_db\" 
'# Enter the name of the database
strDatabase = "YourAccessDatabase.mdb"
'# Stop editing here
'##################################################

Private Sub dbCompact(strDBFileName)
Dim SourceConn
Dim DestConn
Dim oJetEngine
Dim oFSO

SourceConn = Jet_Conn_Partial & strFolder & strDatabase
DestConn = Jet_Conn_Partial & strFolder & "Temp" & strDatabase

Set oFSO = Server.CreateObject("Scripting.FileSystemObject")
Set oJetEngine = Server.CreateObject("JRO.JetEngine")

With oFSO

       If Not .FileExists(strFolder & strDatabase) Then
           Response.Write ("Not Found: " & strFolder & strDatabase)
           Stop
       Else
                 If .FileExists(strFolder & "Temp" & strDatabase) Then
                       Response.Write ("Something went wrong last time " _
                       & "Deleting old database... Please try again")
                      .DeleteFile (strFolder & "Temp" & strDatabase)
                 End If
      End If
End With

With oJetEngine
.CompactDatabase SourceConn, DestConn
End With

oFSO.DeleteFile strFolder & strDatabase
oFSO.MoveFile strFolder & "Temp" _
& strDatabase, strFolder& strDatabase

Set oFSO = Nothing
Set oJetEngine = Nothing
End Sub

Private Sub dbList()
Dim oFolders
Set oFolders = Server.CreateObject("Scripting.FileSystemObject")
   Response.Write ("<Select Name=""DBFileName"">")
   For Each Item In oFolders.GetFolder(strFolder).Files
   If LCase(Right(Item, 4)) = ".mdb" Then
       Response.Write ("<Option Value=""" & Replace(Item, strFolder, "") _
       & """>" & Replace(Item, strFolder, "") & "</Option>")
   End If
Next
Response.Write ("</Select>")

Set oFolders = Nothing
End Sub


%>
<%
' Compact database and tell the user the database is optimized
Select Case Request.form("cmd")
Case "Compact"
dbCompact Request.form("DBFileName")
Response.Write ("Database " & Request.form("DBFileName") & " is optimized.")
End Select
%>

<p><font size="4">Compact and repair database</font></p>
<form method="POST" action="">
<p><%dbList%><input type="submit" value="Compact" name="cmd"></p>
</form>

----------------------------------------------------------------


Following SQL Query Can be used to get
the information from MS Access

select b.pid,b.ProductName,a.prize from 
productdescription a inner join
(SELECT pid,ProductName FROM
OPENROWSET('Microsoft.Jet.OLEDB.4.0',_
'c:\example.mdb';'admin';'',[productm
aster])) as b on
a.pid=b.pid


-----------------------------------------------------------------

!/usr/bin/cscript
' progid4:TLI.TLIApplication

set oTLI = WScript.CreateObject("TLI.TLIApplication")

' get interface info from your qry object
set iInfo = oTLI.InterfaceInfoFromObject( qry )

' enumerate the members collection
for each prop in iInfo.members
sout = sout & prop.name & "=" & prop.value & vbCrLf
next

wscript.ech sout

------------------------------------------------------------------

'acc.vbs
'launches Access
set oAccess = CreateObject("Access.Application")
oAccess.Visible = True
oAccess.UserControl = True

------------------------------------------------------------------

Connect to Access:
==================

Set oConn = CreateObject("ADODB.Connection")
oConn.Open("Provider=Microsoft.Jet.OLEDB.4.0;Data Source=C:\MyDB.mdb;Persist
Security Info=False")
'PROVIDER=SQLOLEDB;DATA SOURCE=;UID=;PWD=;DATABASE=SQLDB")

' Define and open the recordset
Set oRS = oConn.OpenSchema(23) ' adSchemaViews

sStr = "" & vbCrLf & " "

' The VB GetString function makes this too easy & fast!
sStr = sStr + oRS.GetString( , , " ", " 
" & vbCrLf &
" ", " ")

sStr = sStr + " 
" & vbCrLF & " "

' Close the recordset
oRS.Close
Set oRS = Nothing
oConn.Close
Set oConn = Nothing

' Display it
WScript.echo(sStr)

-----------------------------------------------------------

Connect to SQL Server:
======================

Dim oConn
  Dim oCmd
  Dim oRs
  Dim tmpBody
  Set oConn = CreateObject("ADODB.Connection")
  oConn.Open ("DATABASE=aida;DSN=MDB;UID=karel;Password=karel;")
  Set oCmd = CreateObject("ADODB.Command")
  oCmd.ActiveConnection = oConn
  oCmd.CommandText = "exec fill_x"
  oCmd.CommandType = 1
  oCmd.Prepared = True
  Set oRs = oCmd.Execute
  
  Set oRs = Nothing
  Set oCmd = Nothing
  Set oConn = Nothing
  

-----------------------------------------------------------

Dim objSendMail
Dim strTo, strFrom
Dim strSubject, strBody
Dim shipUic

' mail constants
Const CdoBodyFormatType = 0	               ' Body property is HTML
Const CdoMailFormatType = 0	               ' NewMail object is in MIME format

Const CdoNormal = 1		               ' Normal importance (default)

strFrom = "admin@northwind.com"               ' System administrator or DBA mail account
strTo =" manager@northwind.com"               ' Recipient mail account - i.e. Sales Manager
strSubject = "Sales over $10,000"                  ' Mail subject

' Call function to build the HTML mail body
strBody = MailBody()               

' The following section creates the E-mail object and sends the mail
Set objSendMail = CreateObject("CDONTS.NewMail")

objSendMail.From       = strFrom
objSendMail.To         = strTo
objSendMail.Subject    = strSubject
objSendMail.Body       = strBody
objSendMail.BodyFormat = CdoBodyFormatType
objSendMail.MailFormat = CdoMailFormatType
objSendMail.Importance = CdoNormal

objSendMail.Send

Set  objSendMail = Nothing

' **********************************************************************************

Function MailBody()

  Dim oConn
  Dim oCmd
  Dim oRs
  Dim tmpBody
  set oConn = CreateObject("ADODB.Connection")			
  oConn.Open("DATABASE=Northwind;DSN=Northwind;UID=sa;Password=;")
  set oCmd = CreateObject("ADODB.Command")
  oCmd.ActiveConnection = oConn
  oCmd.CommandText = "select * from Northwind.dbo.[10k_order_qry] order by subtotal desc"
  oCmd.CommandType = 1
  oCmd.Prepared = True
  set oRs = oCmd.Execute
  
  oRs.moveFirst
  tmpBody = "<H2><FONT COLOR=Red>10K Customer Report</FONT></H2>"
  tmpBody = tmpBody & "<B><FONT COLOR=Blue>As of " & Date() & "</FONT><B><BR><BR>"
  tmpBody = tmpBody & "<TABLE BORDER=2><TR BGCOLOR=Skyblue ALIGN=Middle>"
  tmpBody = tmpBody & "<TH><B>ORDER ID</B></TH>"
  tmpBody = tmpBody & "<TH><B>SUBTOTAL</B></TH>"
  tmpBody = tmpBody & "<TH><B>COMPANY</B></TH>"
  tmpBody = tmpBody & "<TH><B>CONTACT</B></TH>"
  tmpBody = tmpBody & "<TH><B>COUNTRY</B></TH>"
  tmpBody = tmpBody & "<TH><B>PHONE</B></TH>"

  while not oRs.EOF		
    tmpBody = tmpBody & "<TR BGCOLOR=Cornsilk><TD>" & oRs.Fields("OrderID") & "</TD>"
    tmpBody = tmpBody & "<TD>" & "$" & oRs.Fields("Subtotal") & "</TD>"
    tmpBody = tmpBody & "<TD>" & oRs.Fields("CompanyName") & "</TD>"
    tmpBody = tmpBody & "<TD>" & oRs.Fields("ContactName") & "</TD>"
    tmpBody = tmpBody & "<TD>" & oRs.Fields("Country") & "</TD>"
    tmpBody = tmpBody & "<TD>" & oRs.Fields("Phone") & "</TD>"
    oRs.moveNext
  wend

  tmpBody = tmpBody & "</TR></TABLE>"
 
  MailBody = tmpBody

  set oRs   = nothing
  set oCmd  = nothing
  set oConn = nothing
  
End Function


---------------------------

StrComputer = "R3328"
Set fso = CreateObject("Scripting.FileSystemObject")
Set WshShell = WScript.CreateObject("WScript.Shell")
Set WshNetwork = WScript.CreateObject("WScript.Network")
sqlServer = strComputer & "RCS0001\PRDRCS" 
sqlProvider = "SQLOLEDB"
sqlUserName = ""
sqlPassword = ""
AppName = "JesseHarris" 
sSQL = "Provider=" & sqlProvider & ";Data Source=" & 
sqlServer & ";User Id=" & sqlUserName & ";Password=" & 
sqlPassword & ";" & "Application Name=" & AppName & ";"

set cn = CreateObject("ADODB.Connection")

cn.Open sSQL
Set rs = CreateObject("ADODB.Recordset")
sSQL = "exec sp_db_status"
rs.Open sSQL,cn,1,1
wscript.echo rs.eof
rs.Close
set rs = nothing
cn.Close
set cn = nothing
quit

--------------------------------

Remote scripting:
=================

Remote WSH, which is a new technology included in WSH 5.6, provides the ability 
to run a script on a remote machine or machines. 
With Remote WSH, the script is physically copied from the local machine to the remote machine 
before executing. In order to enable Remote WSH functionality, you must first set up 
the remote machine with the proper security settings. T
he steps below perform the tasks that enable Remote WSH.

Note   Both the remote and local machines must be running Windows NT 4 SP3 or greater 
in order to use Remote WSH.

To enable a machine to run remote scripts 

- Install WSH V5.6 on the machine. If you are using Windows XP or have installed 
  Internet Explorer 6 or greater, WSH 5.6 has already been installed. 
   Note   WSH 5.6 is available for download from the Web at http://msdn.microsoft.com/scripting
- Add yourself to the remote machine's Local Administrators group. 
- To enable Remote WSH, use the System Policy Editor (Poledit.exe) on the server. 
  Note   An administrator who wants to enable Remote WSH should add a subkey entry 
  named Remote of type REG_SZ to the registry key 
  HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Windows Script Host\Settings. 
  To enable Remote WSH, set the value of Remote to 1; 
  to disable Remote WSH, set the value to 0. If the value of the Remote value is not set, 
  by default Remote WSH is disabled.
  Note   For more information on the System Policy Editor, see the Microsoft Windows online help system.
  WSH is enabled on the machine. To test it, see Running Scripts Remotely. 


Remote WSH, which is a new technology included in WSH 5.6, provides the ability to 
run a script on a remote machine or machines. With Remote WSH, the script is 
physically copied from the local machine to the remote machine before executing. 
In order to enable Remote WSH functionality, you must first set up the remote machine 

with the proper security settings. The steps below perform the tasks that enable Remote WSH.

Note   Both the remote and local machines must be running Windows NT 4 SP3 or greater in order to use Remote WSH.
To enable a machine to run remote scripts 

Install WSH V5.6 on the machine. If you are using Windows 2001 or have installed Internet Explorer 6 
or greater, WSH 5.6 has already been installed. 
Note   WSH 5.6 is available for download from the web at http://msdn.microsoft.com/scripting
Add yourself to the remote machine's Local Administrators group. 
To enable Remote WSH, use Poledit.exe on the server. 
Note   An administrator who wants to enable Remote WSH must either acquire the Windows 2000 resource kit, 
or use http://msdn.microsoft.com/scripting to acquire the necessary windowsscript.adm file that 
contains the WSH settings. The windowsscript.adm file must be copied to the server that sets 
the gapplicabel group's policies. Although it is not necessary to copy the file to the server's \WINNT\INF directory, 
this is nonetheless where the default adm files are located.
Note   For more information on Poledit.exe, see the Poledit.exe's online help system.
WSH should now be enabled on the machine. To test it, see Running Scripts Remotely. 
See Also
Security and Windows Script Host | Running Scripts Remotely


WSH 5.6 can run scripts that reside on remote systems. The following scripts demonstrate this capability. 
These scripts make the assumption that the files are located on a local machine directory called 
"c:\wsh5.6"; change the local path and the remote machine name as necessary.

After initially running RemoteTest.WSF on the local machine, there may be a small pause 
as DCOM verifies your identity. After you see the "Done" message, a file named "c:\beenhere.txt"
on the remote machine indicates the time that you executed the command (from the remote computer's clock).


' VBScript.
RemoteTest.WSF
------------------------------- 
<package>
<job>
<script language="VBScript">
set oController = CreateObject("WSHController")
set oProcess = oController.CreateScript("c:\wsh5.6\beenhere.wsf", "remmachine")
oProcess.Execute
While oProcess.Status <> 2
   WScript.Sleep 100
WEnd
WScript.Echo "Done"
</script>
</job>
</package>
------------------------------- 

BeenHere.WSF
------------------------------- 
<package>
<job>
<script language="VBScript">
set fso = CreateObject("Scripting.FileSystemObject")
set fout = fso.CreateTextFile("c:\beenhere.txt", true)
fout.WriteLine Now
fout.Close
</script>
</job>
</package>


#############################################################################
#############################################################################
Part 7:  Listing Active Directory Objects and Properties
#############################################################################
#############################################################################


Note 1:
=======

The comments list the elements to change if you would a specific organizational unit, 
specific file name, or date format. The HTMLReport object takes the passed string elements 
for generating the report so you can add whatever visual customizations you want (CSS etc.). 
You can grab whatever user attributes you want by utilizing the object attributes (link only works in IE) 
available within Active Directory.

Highlight and copy the following and save it in a file with the extension ".wsf". Make sure you run it 
on a computer in the domain, and the user has adequate credentials to grab this information.

-- Start highlight below this link --
<package>
<job id="User-List">
<script language="vbscript">

Option Explicit
On Error Resume Next

' Declare public variables.
Dim FSO, adsRootDSE, strDomainPath, adsDefaultDomain, strDate, adsUsers, HTMLReport, intCounter

Const ForReading = 1, ForWriting = 2, ForAppending = 8, E_ADS_PROPERTY_NOT_FOUND = &h8000500D

Set FSO = CreateObject("Scripting.FileSystemObject")
Set adsRootDSE = GetObject("LDAP://RootDSE")
strDomainPath = adsRootDSE.Get("DefaultNamingContext")
Set adsDefaultDomain = GetObject("LDAP://" & strDomainPath)
Set adsRootDSE = Nothing

' ** Change these values to reflect your environment.
' This is the current date. Change it to meet whatever date format you would like to use.
strDate = Month(Date) & "-" & Day(Date) & "-" & Year(Date) ' Creates a date string delimited by hyphens.

' Location of user accounts. Default value: ("LDAP://" & strDomainPath) searches the entire domain.
Set adsUsers = GetObject("LDAP://" & strDomainPath)

' The file name for the report.
Set HTMLReport = FSO.OpenTextFile(strDate & ".User-List.html", ForWriting, true)

adsUsers.Filter = Array("organizationalUnit")

' Initialize counter.
intCounter = 0

' Generate HTML headers.
HTMLReport.WriteLine ( "<html><head>")
HTMLReport.WriteLine ( "<title>" & strDate & " - AD User List</title>")
HTMLReport.WriteLine ( "</head><body>")
HTMLReport.WriteLine ( "<h2>AD User List - Generated " & strDate & "</h2>")
HTMLReport.WriteLine ( "<table><tr><td>Common name</td><td>Given name</td><td>Surname</td></tr>")

Call EnumOUs(adsUsers)

' Close out the report.
HTMLReport.WriteLine ( "<h3>" & intCounter & " users.</h3></body></html>")
HTMLReport.Close

' Close out objects and quit the script.
Set HTMLReport = Nothing
Set adsUsers = Nothing
Set adsDefaultDomain = Nothing
Set FSO = Nothing
Wscript.Quit

Sub EnumOUs(objParent)

On Error Resume Next

Dim objUser, cn, givenName, surname, objChild

' Recursive subroutine to enumerate all OU's.
objParent.Filter = Array("User")
For Each objUser in objParent
If objUser.Class = "user" Then
' Expand on this if you would to grab other attributes.
cn = objUser.cn
givenName = objUser.givenName
surname = objUser.sn

' Generate unique row for user.
HTMLReport.WriteLine ( "<tr><td>" & cn & "</td><td>" & givenName & "</td><td>" & surname & "</td><tr>")
End If
Next

objParent.Filter = Array("organizationalUnit")

For Each objChild In objParent
Call EnumOUs(objChild)
Next
End Sub

</script>
</job>
</package>
-- End highlight above this line --


Note 2:
=======


#############################################################################################
#############################################################################################
#############################################################################################


==================================================================
Section 13: OO and C# elementary code fragments and basic theory :
==================================================================


1. First Some stuff About Classes and Objects in other Environments:
====================================================================


This section should explain the basic idea about classes en objects, as it is used
in "older" environments.


1.1 Objects in a language that's often considered "Traditional" (pl/sql):
=========================================================================

Even in what is considered traditional programming enviroments, "object orienting programming"
is now possible. Look for example at the following PL/SQL code in Oracle ( > version Oracle 8):

In PL/SQL, object-oriented programming is based on object types. An object type encapsulates a data structure along 
with the functions and procedures needed to manipulate the data.
The variables that form the data structure are called attributes. 
The functions and procedures that characterize the behavior of the object type are called methods. 

Object types reduce complexity by breaking down a large system into logical entities. 
This allows you to create software components that are modular, maintainable, and reusable. 

When you define an object type using the CREATE TYPE statement (in SQL*Plus for example), 
you create an abstract template for some real-world object. As the following example of a bank account shows, 
the template specifies only those attributes and behaviors the object will need in the application environment: 

Example objecttype:
-------------------

CREATE TYPE Bank_Account AS OBJECT ( 
   acct_number INTEGER(5),
   balance     REAL,
   status      VARCHAR2(10),
   MEMBER PROCEDURE open (amount IN REAL),
   MEMBER PROCEDURE verify_acct (num IN INTEGER),
   MEMBER PROCEDURE close (num IN INTEGER, amount OUT REAL),
   MEMBER PROCEDURE deposit (num IN INTEGER, amount IN REAL),
   MEMBER PROCEDURE withdraw (num IN INTEGER, amount IN REAL),
   MEMBER FUNCTION curr_bal (num IN INTEGER) RETURN REAL 
);

At run time, when the data structure is filled with values, you have created an instance of an abstract bank account. 
You can create as many instances (called objects) as you need. 
Each object has the number, balance, and status of an actual bank account. 

The important thing to notice here, is that Bank_account has internal data, and some type
of internal memberfunctions declared. This is quite typical for OO objects.


1.2 Objects in Traditional C++:
===============================

In C++, classes and objects have the following meaning.

A class is a sort of a template, for creating real instantiated objects.
Look at the class "Book":
It�s a class for someone who shelves books: but a real copy is an instance 
It�s an object for the person does order processing: an instance of class book. 

A class has it's private and public memberfunctions and data.

Put in another way:

A class is a fundamental building block of OO software. 
A class defines a data type, much like a struct would be in C. In a computer science sense, a type consists 
of both a set of states and a set of operations which transition between those states. 
Thus int is a type because it has both a set of states and it has operations like i + j or i++, etc. 
In exactly the same way, a class provides a set of (usually public) operations, and a set of (usually non-public) 
data bits representing the abstract values that instances of the type can have. 

Its also an important feature of OO, that the "internal data" can only get a value, or the values of the
data can only be retrieved, by the member functions of the object. 

You can imagine that int is a class that has member functions called operator++, etc. (int isn't really a class, 
but the basic analogy is this: a class is a type, much like int is a type.) 

After the declaration int i; we say that "i is an object of type int." 
In OO/C++, "object" usually means "an instance of a class." 
Thus a class defines the behavior of possibly many objects (instances). 


-- Example 1. 
-- ----------

// objpart.cpp
// widget part as an object

#include <iostream.h>

class part                     // specify an object
   {

   private:                                         // private data, only visible to the object itself
      int modelnumber;        // ID of widget
      int partnumber;         // ID of widget part
      float cost;             // cost of part

   public:
      void setpart(int mn, int pn, float c)        // memberfunction to set the data. it's public so anyone can call it.
         {
         modelnumber=mn;
         partnumber=pn;
         cost=c;
         }
      void showpart()                             // memberfunction to show the data
         {
         cout << "\nModel "  << modelnumber;      // cout is a standard operator which prints to standard output,
         cout << ", part "   << partnumber;       // which is your screen.
         cout << ", costs $" << cost;
         }
    };


void main()
   {
   part part1;

   part1.setpart(6244,373,217.55);
   part1.showpart();
   }


-- Example 2:
-- ----------

class Animal { 
  public: 
   void Eat( Food* ); 
  protected: 
   float m_fWeight; 
}; 


class Dog : public Animal 
{ public: 
   void Bark( void ); 
  protected: 
   string m_strName; 
}; 


-- instatiate object's

animal elephant;
animal lion;

dog barky;

In example 2, you see the effect of "inheritance": the subclass Dog, inherets the member
function (or method) "Eat" from the parentclass Animal.

Now that an elementary idea of classses and objects is established, We return to C#:


2. BASIC ELEMENTS OF C# PROGRAMS:
=====================================


2.1 CLASSES AND METHODS IN C# IN GENERAL:
========================================


2.1.1. Class Declaration:
==========================

As C# is an object-oriented language, C# programs must be placed in classes.
To illustrate a Class declaration in general, look at the following example:

Example 1:
----------

public class Cat
{
    public Cat(string aName)             // memberfunction or method
      {
          _name = aName;
      }
    
    public void sleep()                 // memberfunction or method
      {
          _sleeping = true;
      }
    
    public void wake()                  // memberfunction or method
      {
          _sleeping = false;
      } 

    // Member variables
 
    protected bool   _sleeping;
    protected string _name;
}

To instantiate an object of class Cat:

  Cat anastasia = new Cat();

To call a method, or memberfunction, of anastasia:

  anastasia.Sleep();


- "Public" means that an external entity, can call the function.
- "Private" means that it's for internal use only. The object does not expose the member.
- "Protected" means that it's available for components in the Assembly (see a later section).


2.1.2 METHODS:
==============

In C++, you can write functions which have nothing to do with a class.
But also in C++, most of the time, memberfunctions of a class, are defined inside the class specifier.
But this need not always be the case: it's possible to declare the memberfunction
inside the class, but the body of the function is listed elsewhere,

In C# all methods exist as class members because C# does not support standalone functions.
Methods are always declared as part of the class declaration. Unlike C++, there is no way
to declare a C# method implementation seperately from the class declaration.

A method declaration consists of a
accessibility level, a return type, a name and a list of zero or more parameters

Example 1:
----------

public int GetNextRecordId(string TableName)
{
 ...
}


If the accessibility level is omitted, the level is private by default.

Methods can be declared as static, and in this case the method must be called
using the class name rather than using an object reference.

Example 2:
----------

// define the class Cat

class Cat: Animal
{
   public static void ChaseMouse()
   {
    ...
   }
}

// define the class CatApp

class CatApp
{
  static void Main()
  {
     Cat c = new Cat();

     // This is not OK:
     c.ChaseMouse();

     // This is OK:
     Cat.ChaseMouse();
  }
  ...
}


2.1.3 Inheritance:
==================

Class Animal
{
   public void Eat()
     {
      ..
     }
   public void Sleep()
     {
     ..
     }
}
  

class Cat: Animal
{
    public static void ChaseMouse()
    {
    ..
    }
}

Here, Cat inherits all members from the Animal class. Inheritance enables you to look for similarities 
between classes and factor those similarities out into base classes that are shared by descendent classes. 
For example, all of our animal classes like Cat or Dog, share the common characteristics Eat and Sleep. 

All types in C# are ultimately derived from the object class. Of course, not all classes derive directly 
from object, but if you follow the inheritance hierarchy for any type, you�ll eventually come to object. 
In fact, deriving from object is such a fundamental aspect of programming in C# that the compiler will 
automatically generate the necessary code to inherit from object if you don�t specify any inheritance 
for your class. This means that the following two class definitions are identical:

class Cat
{    
..
}

class Cat: object
{
..    
} 


Declaring a Class as abstract or sealed:
----------------------------------------

When you�re declaring a hierarchy of classes such as the animal classes, the base classes are often incomplete 
and shouldn�t be directly instantiated. Instead of directly creating an object from the Animal class, 
it�s more appropriate to create an instance of the Cat or Dog class.

In C#, base classes that aren�t to be directly instantiated are tagged with the abstract keyword, as follows:

abstract class Animal
{
..
}

Using the abstract keyword with your base classes allows the compiler to enforce proper use of your 
class hierarchy by preventing base classes from being directly instantiated.

If a method signature is defined in an abstract base class but not implemented in the class, 
it is marked as abstract, as shown here:

abstract class Horse
{
    abstract public void ChaseAfterBadGuys();
}


Communicating with the base class:
----------------------------------

Like C++, C# allows you to access the current object through the this keyword. In addition, C# allows 
you to access the members of the immediate base class through the base keyword. 
The following example calls the PreDraw method of the CommandWindow class:

class MyCommandWindow: CommandWindow
{
    public void DrawShape()
    {
        base.PreDraw();
        Draw();
    }

}


2.2 Main method:
================

A Console mode application (dos box), has a Main method that serves as the entrypoint for your application.
Take a look at the following "HelloWorld" application: 


using System;

namespace HelloWorld

{
   class HelloWorldApp
   {
       static void Main(string[] args)
       {
          Console.WriteLine("Hello World!");
       }

   }
}

The first thing to note about C# is that it is case-sensitive. You will therefore get compiler errors if, 
for instance, you write 'console' rather than 'Console'. 

The second thing to note is that every statement finishes with a semicolon (;) or else takes a 
code block within curly braces. 

Line 1 of the code declares we are using the System namespace (namespaces are also covered later). 
The point of this declaration is mostly to save ourselves time typing. Because the 'Console' object 
used in line 10 of the code actually belongs to the 'System' namespace, its fully qualified name is 
'System.Console'. However, because in line 1 we declare that the code is using the System namespace, 
we can then leave off the 'System.' part of its name within the code. 

The namespace "HelloWorld" helps to insulate any types you create from types that might exist
elswhere in the .NET framework.

When compiled and run, the program above will automatically run the 'Main' method declared 
and begun in line 6. Note again C#'s case-sensitivity - the method is 'Main' rather than 'main'. 

In order to run it, the program above must first be saved in a file. Unlike in Java, the name 
of the class and the name of the file in which it is saved do not need to match up, although it does 
make things easier if you use this convention. In addition, you are free to choose any extension 
for the file, but it is usual to use the extension '.cs'. 

Suppose that you have saved the file as 'HelloWorld.cs'. Then to compile the program from a command line, 
you would use the command 

csc HelloWorld.cs

This command would generate the executable HelloWorld.exe, which could be run in the usual way, by entering its name: 

HelloWorld

You can use the following "flavors" which explain the "void" and "args" elements:


static void Main(string[] args)
{
   // No return values (void); accepts command-line parameters
}


static int Main(string[] args)
{
   // Returns integer value; accepts command-line parameters
}

static void Main()
{
   // No return values (void); no command-line parameters
}

static int Main()
{
   // Returns integer value; no command-line parameters
}


Before the word Main is a static modifier. The static modifier explains that this method works in this 
specific class only, rather than an instance of the class.  
This is necessary, because when a program begins, no object instances exist. 


2.3 Pointers:
=============

Maybe it's strange to start with an element that's relatively not much encouraged in C#.

But pointers are well known from languages as C,C++ etc.. 
In C#, also because of the "Garbage collection process", the use of pointers
is limited to "unsafe code", that is, code that is "marked" as unsafe.
So it's understandable that pointers are avoided in C# code.

However, we still need to know what a pointer is, and how it can be used.

A pointer is a variable that holds the memory address of another type.
Pointers are declared implicitly, using the 'dereferencer' symbol *, as in the following example: 

int *p; 

What does it mean? Look at the following examples:

Example 1:
----------

int i = 5;
int *p;
p = &i; 


Here you should know that the symbol & is the operator which in this context returns the memory address 
of the variable it prefixes. 

Given the above, the code 

*p = 10; 

changes the value of i to 10, since '*p' can be read as 'the integer located at the memory value held by p'. 

Example 2:
----------

A pointer can be declared in relation to an array, as in the following: 

int[] a = {4, 5};
int *b = a; 

What happens in this case is that the memory location held by b is the location of the first type held by a. 
This first type must, as before, be a value type.


2.4 Introduction to arrays 
==========================


An array is a reference type that contains a sequence of variables of a specific type (value types or reference types). 
An array is declared by including index brackets between the type and the name of the array variable, as shown here:

int [] ages;

This example declares a variable named ages that�s an array of int, but it doesn�t attach that reference variable 
to an actual array object. To do so requires that the array be initialized, as shown here:

int [] ages = {5, 8, 39};

Arrays are reference types that the Visual C# .NET compiler automatically subclasses from the System.Array class. 
When an array contains value types, the space for the types is allocated as part of the array. When an array 
contains �reference elements, the array contains only references�the objects are allocated elsewhere on the managed heap,

The individual elements of an array are accessed through an index, with 0 always referring to the first element 
in the array, as follows:

int currentAge = ages[0];

Some interresting properties:

-- You can determine the number of elements in an array by using the Length property:

   int elements = nameArray.Length;

-- An array can be cloned with the Clone method, which returns a new copy of the array. Because Clone is declared as 
   returning an array of object, you must explicitly state the type of the new array, as follows:

   string [] secondArray = (string[])nameArray.Clone();


-- Clear is a static method in the Array class that removes one or more of the array elements by setting 
   the removed array elements to 0 (for value types) or null (for reference types). The array to be cleared is passed 
   as the first parameter, along with the index of the first element to clear and the number of elements be removed. 
   To eliminate all the elements of the array, pass 0 as the start element and the array length as the third parameter, 
   as shown here:

   Array.Clear(nameArray, 0, nameArray.Length);


-- Reverse is a static method in the Array class that reverses the order of array elements, operating on either 
   the complete array or just a subset of elements. To reverse an entire array, simply pass the array to the 
   static method, as shown here:

   Array.Reverse(nameArray);

   To reverse a range within the array, pass the array along with the start element and the number of items 
   to be reversed.

   Array.Reverse(nameArray, 0, nameArray.Length);

-- Sort is a static method that sorts an array. There are several versions of Sort; the simplest version 
   accepts an array as its only parameter and sorts the elements in ascending order.

   Array.Sort(nameArray);


The following example manipulates an array containing the names of the month. 
The array is examined, reversed, sorted, cloned, and finally cleared.


using System;
namespace MSPress.CSharpCoreRef.ArrayExample
{
    class ArrayExampleApp
    {
        static void Main(string[] args)
        {
            string [] months = { "January", "February", "March",
                                 "April", "May", "June", "July",
                                 "August", "September", "October",
                                 "November", "December"};
            Console.WriteLine("The array has a rank of {0}.",
                              months.Rank);
            int elements = months.Length;
            Console.WriteLine("There are {0} elements in the array.",
                              elements);

            Console.WriteLine("Reversing...");
            Array.Reverse(months);
            PrintArray(months);

            Console.WriteLine("Sorting...");
            Array.Sort(months);
            PrintArray(months);

            string [] secondArray = (string[])months.Clone();
            Console.WriteLine("Cloned Array...");
            PrintArray(months);
            
            Console.WriteLine("Clearing...");
            Array.Clear(months, 0, months.Length);
            PrintArray(months);
        }
        /// <summary>
        /// Print each element in the names array.
        /// </summary>
        static void PrintArray(string[] names)
        {
            foreach(string name in names)
            {
                Console.WriteLine(name);
            }
        }
    }
}


2.5 Loops and flow control:
===========================

C# provides a number of the common loop, and flow statements: 

- while
- do-while
- for
- foreach 
- break
- continue
- goto
- return 
- throw 

Here we will illustrate them with some simple examples: 


WHILE LOOPS:
============

syntax: while (expression) statement[s] 

A 'while' loop executes a statement, or a block of statements wrapped in curly braces, 
repeatedly until the condition specified by the boolean expression returns false. 

Example 1:
----------

int a = 0;
while (a < 3)
 
{
 
 System.Console.WriteLine(a);
 a++;
 
}
 
It produces the following output: 

0
1
2 

Example 2:
----------

using System;

class WhileLoop
{
    public static void Main()
    {
        int myInt = 0;

        while (myInt < 10)
        {
            Console.Write("{0} ", myInt);
            myInt++;
        }
        Console.WriteLine();
    }
} 


DO-WHILE LOOPS:
===============

syntax: do statement[s] while (expression) 

do-while' loop is just like a 'while' loop except that the condition is evaluated after the block of code 
specified in the 'do' clause has been run. So even where the condition is initially false, 
the block runs once. For instance, the following code outputs '4': 

Example 1:
----------

int a = 4;
do
 
{
 
 System.Console.WriteLine(a);
 a++;
 
} while (a < 3);
 

Example 2:
----------

using System;

class DoLoop
{
    public static void Main()
    {
        string myChoice;

        do
       {
            // Print A Menu
            Console.WriteLine("My Address Book\n");

            Console.WriteLine("A - Add New Address");
            Console.WriteLine("D - Delete Address");
            Console.WriteLine("M - Modify Address");
            Console.WriteLine("V - View Addresses");
            Console.WriteLine("Q - Quit\n");

            Console.WriteLine("Choice (A,D,M,V,or Q): ");

            // Retrieve the user's choice
            myChoice = Console.ReadLine();

            // Make a decision based on the user's choice
            switch(myChoice)
            {
                case "A":
                case "a":
                    Console.WriteLine("You wish to add an address.");
                    break;
                case "D":
                case "d":
                    Console.WriteLine("You wish to delete an address.");
                    break;
                case "M":
                case "m":
                    Console.WriteLine("You wish to modify an address.");
                    break;
                case "V":
                case "v":
                    Console.WriteLine("You wish to view the address list.");
                    break;
                case "Q":
                case "q":
                    Console.WriteLine("Bye.");
                    break;
                default:
                    Console.WriteLine("{0} is not a valid choice", myChoice);
                    break;
            }

            // Pause to allow the user to see the results
            Console.Write("Press Enter key to continue...");
            Console.ReadLine();
            Console.WriteLine();
        } while (myChoice != "Q" && myChoice != "q"); // Keep going until the user wants to quit
    }
} 


FOR LOOPS:
========== 

syntax: for (statement1; expression; statement2) statement[s]3 

The 'for' clause contains three parts. Statement1 is executed before the loop is entered. 
'For' loops tend to be used when one needs to maintain an iterator value. 
Usually, as in the following example, the first statement initialises the iterator, 
the condition evaluates it against an end value, and the second statement changes the iterator value. 

Example 1:
----------

for (int a =0; a<5; a++)

{
 
 System.Console.WriteLine(a);
 
}

Example 2:
----------

using System;

class ForLoop
{
    public static void Main()
    {
        for (int i=0; i < 20; i++)
        {
            if (i == 10)
                break;

            if (i % 2 == 0)
                continue;

            Console.Write("{0} ", i);
        }
        Console.WriteLine();
    }
} 
 
 
FOREACH LOOPS:
==============

syntax: foreach (variable1 in variable2) statement[s] 

The 'foreach' loop is used to iterate through the values contained by any object which implements 
the IEnumerable interface. When a 'foreach' loop runs, the given variable1 is set in turn to each value 
exposed by the object named by variable2. As we have seen previously, such loops can be used to access 
array values. So, we could loop through the values of an array in the following way: 

Example 1:
----------

int[] a = new int[]{1,2,3};
foreach (int b in a) 
  System.Console.WriteLine(b);
 

The main drawback of 'foreach' loops is that each value extracted (held in the given example 
by the variable 'b') is read-only. 


Example 2:
----------

using System;

class ForEachLoop
{
    public static void Main()
    {
        string[] names = {"Cheryl", "Joe", "Matt", "Robert"};

        foreach (string person in names)
        {
            Console.WriteLine("{0} ", person);
        }
    }
}

The foreach loop allows you to iterate through a collection. 
An array, is one such collection 
 
 
BREAK:
======

The 'break' statement breaks out of the 'while' and 'for' loops, and the 'switch' statements 
covered later in this lesson. The following code gives an example - albeit a very inefficient one - 
of how it could be used. The output of the loop is the numbers from 0 to 4. 

int a = 0;
while (true)
{
  System.Console.WriteLine(a);
  a++;
  if (a == 5)
  break;
}

 
CONTINUE:
=========

The 'continue' statement can be placed in any loop structure. When it executes, 
it moves the program counter immediately to the next iteration of the loop. 
The following code example uses the 'continue' statement to count the number of 
between 1 and 100 inclusive that are not multiples of seven. At the end of the loop 
the variable y holds the required value. 

int y = 0;
for (int x=1; x<101; x++)
{
 if ((x % 7) == 0)
 continue;
 y++;
}
 
GOTO:
===== 
 
The 'goto' statement is used to make a jump to a particular labelled part of the program code.
 It is also used in the 'switch' statement described below. We can use a 'goto' statement to construct 
a loop, as in the following example (but again, this usage is not recommended): 

int a = 0;
start:
System.Console.WriteLine(a);
a++;
if (a < 5)
goto start;
 

IF ELSE:
======== 
 
'If-else' statements are used to run blocks of code conditionally upon a boolean expression 
evaluating to true. The 'else' clause, present in the following example, is optional. 

if (a == 5)
   System.Console.WriteLine("A is 5");
else
   System.Console.WriteLine("A is not 5");
 
 
If statements can also be emulated by using the conditional operator "?". 
The conditional operator returns one of two values, depending upon the value of a boolean expression. 
To take a simple example, the line of code 

int i = (myBoolean) ? 1 : 0 ; 

sets i to 1 if myBoolean is true, and sets i to 0 if myBoolean is false. 
The 'if' statement in the previous code example could therefore be written like this: 

System.Console.WriteLine( a==5 ? "A is 5" : "A is not 5");
 
 
SWITCH:
=======

This is a sort "case" statement, known from other environments.
'Switch' statements provide a clean way of writing multiple if - else statements. In the following example, 
the variable whose value is in question, is 'a'. If a equals 1, then the output is 'a>0'; 
if a equals 2, then the output is 'a>1 and a>0'. Otherwise, it is reported that the variable is not set. 

switch(a)
{
  case 2:
  Console.WriteLine("a>1 and ");
  goto case 1;

  case 1:
  Console.WriteLine("a>0");
  break;

  default:
  Console.WriteLine("a is not set");
  break;
 }
 
 
Each case (where this is taken to include the 'default' case) will either have code specifying a conditional action, 
or no such code. Where a case does have such code, the code must (unless the case is the last one 
in the switch statement) end with one of the following statements: 

break;
goto case k; (where k is one of the cases specified)
goto default; 

From the above it can be seen that C# 'switch' statements lack the default 'fall through' 
behaviour found in C++ and Java. 


2.6 How to write some of those handy shortcuts:
===============================================

Some shorthand notations, like for example "a++", needs to be explained:

n++ or ++n :
------------

The ++ and -- operators work with a single operand, incrementing or decrementing the value of the operand. 
You can use either the prefix or postfix versions of these operators. The difference is subtle but important 
if you�re testing for the resulting value of the expression. A prefix increment or decrement operation 
increments the value of the operand, and the resulting expression is the changed value of the operand, 
as shown here:

int n = 41;
int answer = ++n; // answer = 42, n = 42;

Although a postfix increment or decrement operation increments the value of the operand, 
the resulting expression has the value of the operand before the operator was applied, as shown here:

int n = 41;
int answer = n++; // answer = 41, n = 42;

+=:
---

The addition assignment operator (+=) combines addition and assignment, adding the first operand to the second 
and then storing the result in the first operand, as shown here:

y = 40;
y += 2;

This example is equivalent to the following code:

y = 40;
y = y + 2; 


2.7 Value types and reference types:
====================================

Here's a quick recap of the difference between value types and reference types. 

- where a variable v contains a value type, it directly contains an object with some value. 
  No other variable v' can directly contain the object contained by v (although v' might contain an object 
  with the same value). 

Value types in Visual C# include the primitive types such as int, float, and decimal. 
Value types also include enum and struct types (discussed in Chapter 2). Other types in the .NET Framework 
are also explicitly declared to be value types, 
such as the Rectangle, Point, and Size structures found in the System.Drawing namespace

When a value type variable is declared in Visual C#, the variable contains the actual type instance. 
For example, an integer named Age is declared as follows:

int Age = 42;

The Visual C# .NET compiler will allocate 4 bytes of stack area for the Age variable, making it available 
for direct access without any indirection to the managed heap. For consistency, value type variables can also 
be declared using the new syntax, as shown here:

int Age = new int(42);

In the following lines of code, two variables are declared and set with integer values. 

int x = 10;
int y = x;
y = 20; // after this statement x holds value 10 and y holds value 20 


- where a variable v contains a reference type, what it directly contains is something which refers to an object. 
  Another variable v' can contain a reference to the same object refered to by v. 

Reference types actually hold the value of a memory address occupied by the object they reference. Consider the following piece of code, in which two variables are given a reference to the same object (for the sake of the example, this object is taken to contain the numeric property 'myValue'). 

object x = new object();
x.myValue = 10;
object y = x;
y.myValue = 20; // after this statement both x.myValue and y.myValue equal 20 

If you take a look at the following code:

Shape rect=new Shape();
Shape tempRect=rect;

Then both variables point to the same object.

Boxing:
-------

C# allows you convert any value type to a corresponding reference type, and to convert the resultant 
'boxed' type back again. The following piece of code demonstrates boxing. When the second line executes, 
an object is initiated as the value of 'box', and the value held by i is copied across to this object. 
It is interesting to note that the runtime type of box is returned as the boxed value type; the 'is' operator 
thus returns the type of box below as 'int'. 

int i = 123;
object box = i;
if (box is int) 
{Console.Write("Box contains an int");} // this line is printed 


2.8 Exception handling Try.. Catch.. Finally:
=============================================

Exceptions are unforeseen errors that happen in your programs.  Most of the time, you can, and should, 
detect and handle program errors in your code.  For example, validating user input, checking for null objects, 
and verifying the values returned from methods are what you expect, are all examples of good standard error handling 
that you should be doing all the time.  

However, there are times when you don't know if an error will occur.  For example, you can't predict when 
you'll receive a file I/O error, run out of system memory, or encounter a database error.  

When exceptions occur, they are said to be "thrown".  What is actually thrown is an object that is derived 
from the System.Exception class.  In the next section, I'll be explaining how thrown exceptions are handled 
with try/catch blocks.  

The System.Exception class provides several methods and properties for obtaining information on what went wrong.  
For example, the Message property provides summary information about what the error was, the StackTrace property 
provides information from the stack for where the problem occurred, and the ToString() method is overridden 
to reveal a verbose description of the entire exception.

Identifying the exceptions you'll need to handle depends on the routine you're writing.  
For example, if the routine opened a file with the "System.IO.File.OpenRead()" method, it could throw any of 
the following exceptions:

SecurityException 
ArgumentException 
ArgumentNullException 
PathTooLongException 
DirectoryNotFoundException 
UnauthorizedAccessException 
FileNotFoundException 
NotSupportedException

It's easy to find out what exceptions a method can raise by looking in the .NET Frameworks SDK Documentation.  
Just go to the Reference/Class Library section and look in the Namespace/Class/Method documentation 
for the methods you use.  The exception in the list above were found by looking at the OpenRead() method definition 
of the File class in the System.IO namespace.  Each exception identified has a hyperlink to its class definition 
that you can use to find out what that exception is about.  Once you've figured out what exceptions can be generated 
in your code, you need to put the mechanisms in place to handle the exceptions, should they occur.

try/catch Blocks
----------------

When exceptions are thrown, you need to be able to handle them. This is done by implementing a try/catch block.  
Code that could throw an exception is put in the try block an exception handling code goes in the catch block.  
The following listing shows how to implement a try/catch block.  Since an OpenRead() method could throw one of 
several exceptions, it is placed in the try block.  If an exception is thrown, it will be caught in the catch block.  
The code will print message and stack trace information out to the console if an exception is raised.


using System;
using System.IO;

class TryCatchDemo
{
    static void Main(string[] args)
    {
        try
        {
            File.OpenRead("NonExistentFile");  // on purpose a non-existent file
        }
        catch(Exception ex)
        {
            Console.WriteLine(ex.ToString());
        }
    }
}


Although the code above only has a single catch block, all exceptions will be caught there because the type 
is of the base exception type "Exception".  In exception handling, more specific exceptions will be caught 
before their more general parent exceptions.  For example, the following snippet shows how to place multiple catch blocks:

        catch(FileNotFoundException fnfex)
        {
            Console.WriteLine(fnfex.ToString());
        }
        catch(Exception ex)
        {
            Console.WriteLine(ex.ToString());
        }


If the file doesn't exist, a FileNotFoundException exception will be thrown and caught by the first catch block.  
However, if a PathTooLongException exception was raised, the second catch part would catch the exception.  
This is because there isn't a catch block for the PathTooLongException exception and the generic Exception type 
catch block is the only option available to catch the exception.

Exceptions that are not handled will normally bubble up the stack until a calling routine in the call chain 
handles them.  If you forget to include try/catch blocks in a part of your code and there aren't any try/catch 
blocks earlier in the call chain, your program will abort with a message describing the exception.  
To your users this would be very cryptic and uncomfortable.  It is good practice to provide exception handling 
in your programs.

Finally Blocks:
---------------

An exception can leave your program in an inconsistent state by not releasing resources or doing some other 
type of cleanup.  A catch block is a good place to figure out what may have went wrong and try to recover, 
however it can't account for all scenarios.  Sometimes you need to perform clean up actions whether or not 
your program succeeds.  These situations are good candidates for using a finally block.
In C#, a finally block is implemented as "garanteed to run" at all times.

The following code illustrates the usefulness of a finally block.  As you know, a file stream must be closed 
when your done with it.  In this case, the file stream is the resource that needs to be cleaned up.  
In the below code, outStream is opened successfully, meaning the program now has a handle to an open file resource.  
When trying to open the inStream, a FileNotFoundException exception is raised, causing control to go immediately 
to the catch block.

It's possible to close the outStream in the catch block, but what if the algorithm executed successfully without 
an exception?  On success, the file would never be closed.  Fortunately, we've included a finally block 
which will always be executed.  That's right, regardless of whether the algorithm in the try block raises 
an exception or not, the code in the finally block will be executed before control leaves the method.

using System;
using System.IO;

class FinallyDemo
{
    static void Main(string[] args)
    {
        FileStream outStream = null;
        FileStream inStream = null;

        try
        {
            outStream = File.OpenWrite("DestinationFile.txt");
            inStream = File.OpenRead("BogusInputFile.txt");
        }
        catch(Exception ex)
        {
            Console.WriteLine(ex.ToString());
        }
        finally
        {
            if (outStream != null)
            {
                outStream.Close();
                Console.WriteLine("outStream closed.");
            }
            if (inStream != null)
            {
                inStream.Close();
                Console.WriteLine("inStream closed.");
            }
        }
    }
}

  
A finally block is not required and you may ask what happens if you just put code after the catch block.  
True, under normal circumstances, if the exception is caught, all code following the catch will be executed.  
However, try/catch/finally is for exceptional circumstances and it is better to plan for the worst to make your 
program more robust.  For example, if one of the catch handlers rethrew and exception or caused another exception, 
the code following the catch block (not in a finally block) would never be executed.  Also, if you don't catch 
the exception at all, program flow would immediately do a stack walk looking for an exception handler that fits 
and the code following the catch blocks would not be executed.  Since there is too much potential for code in an 
algorithm to not be executed, a finally block is your insurance for executing those critical actions you need.


Before we go deeper into C#, we will talk a bit about the .NET architecture and program structures.


3. About the .NET Architecture:
==================================
 
The .NET platform consists of the CLR (common language runtime) that is responsible for managing
and executing code written for the .NET framework, and a large set of Class libraries.

The .NET Framework has two main components: 

The CLR (common language runtime) and the .NET Framework class library. 

CLR:
====

The common language runtime is the foundation of the .NET Framework. You can think of the runtime as an agent 
that manages code at execution time, providing core services such as memory management, thread management,
and remoting, while also enforcing strict type safety and other forms of code accuracy that ensure security 
and robustness. 

In fact, the concept of code management is a fundamental principle of the runtime. 
Code that targets the runtime is known as managed code, while code that does not target the runtime is known 
as unmanaged code. The class library, the other main component of the .NET Framework, is a comprehensive, 
object-oriented collection of reusable types (classes) that you can use to develop applications ranging from 
traditional command-line or graphical user interface (GUI) applications to applications based on the latest 
innovations provided by ASP.NET, such as Web Forms and XML Web services.

You can make some comparison to a virtual machine, on which the executing code runs, but it's not
the same all the way. 
 
The runtime is designed to enhance performance. Although the common language runtime provides many 
standard runtime services, managed code is never interpreted. A feature called just-in-time (JIT) compiling 
enables all managed code to run in the native machine language of the system on which it is executing. 
Meanwhile, the memory manager removes the possibilities of fragmented memory and increases memory 
locality-of-reference to further increase performance.

.NET Framework Class Library
============================

The .NET Framework class library is a collection of reusable types that tightly integrate with the 
common language runtime. The class library is object oriented, providing types from which your own managed code 
can derive functionality. This not only makes the .NET Framework types easy to use, but also reduces 
the time associated with learning new features of the .NET Framework. In addition, third-party components 
can integrate seamlessly with classes in the .NET Framework.

For example, the .NET Framework collection classes implement a set of interfaces that you can use to develop 
your own collection classes. Your collection classes will blend seamlessly with the classes in the .NET Framework.

As you would expect from an object-oriented class library, the .NET Framework types enable you to accomplish a 
range of common programming tasks, including tasks such as string management, data collection, 
database connectivity, and file access. In addition to these common tasks, the class library includes types 
that support a variety of specialized development scenarios. For example, you can use the .NET Framework 
to develop the following types of applications and services: 

- Console applications. 
- Scripted or hosted applications. 
- Windows GUI applications (Windows Forms). 
- ASP.NET applications (Web forms). 
- XML Web services. 
- Windows services. 

For example, the Windows Forms classes are a comprehensive set of reusable types that vastly simplify 
Windows GUI development. If you write an ASP.NET Web Form application, you can use the Web Forms classes.

 
.NET program architecture:
==========================

The Visual C# compiler (or other tool, such as "csc" of the SDK), does not generate machine code that can
be executed directly on your computer. Instead your project's source code is compiled into an ASSEMBLY,
as we have tried to visualize in the underneath figure.

Assemblies are the building blocks of .NET Framework applications; they form the fundamental unit 
of deployment, version control, reuse, activation scoping, and security permissions. 
An assembly is a collection of types and resources that are built to work together and form a logical unit
of functionality. An assembly provides the common language runtime with the information it needs 
to be aware of type implementations. To the runtime, a type does not exist outside the context of an assembly.

  --------------------
  |assembly          |  
  | ---------------- | 
  | | METADATA     | |
  | |              | |
  | ---------------- |
  | | Intermediate | |
  | | Language IL  | |
  | ---------------- |
  --------------------

An Assembly has two parts: MSIL and Metadata. 
The IL contains the executable portion of the program. But it cannot be be executed directly on you computer
because it hasn't been translated into the binary format. Instead, it must undergo a final compilation
pass by a compiler that's part of the .NET framework.

IL:
---

When the code in an assembly must be executed, the runtime compiles the assembly into machine code.
However, the entire assembly isn't compiled in one step. Instead, each method in the assembly is compiled
as it is needed in a process known as "just-in-time compilation" or jitting.
As an option, you may choose to compile your assembly into processor specific code. This is not done
by the Visual C# .NET compiler. The tool you can use for this purpose is "ngen.exe".

Metadata:
---------

This describes completely the assembly contents. It makes an assembly completely "self describing"
and eliminates the need for component registration.
Since metadata is stored in a programming-language-independent fashion
with the code, not in a central store such as the Windows Registry, it makes
.NET applications self-describing. The metadata can be queried at runtime to
get information about the code.

Types of Assemblies:
--------------------

Private assemblies: 
Used by a single application, typically located in the same directory as
the application that uses them. Because a private assembly isn't shared with other applications,
they can be easily updated or replaced, with no impact on other applications.

Shared assemblies: 
Intended for use by multiple applications. A shared assembly has restrictions
placed on it by the runtime and must adhere to naming and versioning rules.

An aplication that depends on private assemblies can just be copied or moved
to another machine (which has the .NET framework also) and it can be run straight away.

An assembly in it's most common form "looks" like any other .EXE or .DLL.


Other features of assemblies:
-----------------------------

It contains code that the common language runtime executes. Microsoft intermediate language (MSIL) code 
in a portable executable (PE) file will not be executed if it does not have an associated assembly manifest. 
Note that each assembly can have only one entry point (that is, DllMain, WinMain, or Main). 

It forms a security boundary. An assembly is the unit at which permissions are requested and granted. 

It forms a type boundary. Every type's identity includes the name of the assembly in which it resides. 
A type called MyType loaded in the scope of one assembly is not the same as a type called MyType loaded 
in the scope of another assembly. 

It forms a reference scope boundary. The assembly's manifest contains assembly metadata that is used 
for resolving types and satisfying resource requests. It specifies the types and resources that are exposed 
outside the assembly. The manifest also enumerates other assemblies on which it depends.
 
It forms a version boundary. The assembly is the smallest versionable unit in the common language runtime; 
all types and resources in the same assembly are versioned as a unit. The assembly's manifest describes 
the version dependencies you specify for any dependent assemblies. 

It forms a deployment unit. When an application starts, only the assemblies that the application initially 
calls must be present. Other assemblies, such as localization resources or assemblies containing 
utility classes, can be retrieved on demand. This allows applications to be kept simple and thin 
when first downloaded. 

Assemblies can be static or dynamic. Static assemblies can include .NET Framework types (interfaces and classes), 
as well as resources for the assembly (bitmaps, JPEG files, resource files, and so on). 
Static assemblies are stored on disk in PE files. 
You can also use the .NET Framework to create dynamic assemblies, which are run directly from memory 
and are not saved to disk before execution. You can save dynamic assemblies to disk after they have executed.

There are several ways to create assemblies. You can use development tools, such as Visual Studio .NET, 
that you have used in the past to create .dll or .exe files. You can use tools provided in the 
.NET Framework SDK to create assemblies with modules created in other development environments. 
You can also use common language runtime APIs, such as Reflection.Emit, to create dynamic assemblies.


4. EXAMPLES OF SIMPLE C# PROGRAMS:
==================================

In the first sections, we will stick to CONSOLE programs. In section 6, we will turn our attention
to Windows applications.

Console programs are programs without a GUI. On windows it means running a program
from a DOS box.
Ofcourse, also with console programs, everything is coded in classes.
But console programs always must have a Main() method.

You can write the source with any text editor (notepad, textpad etc..).
Save the file with an appropriate name. Recommended is to let the name be the same name
as you have named the class.

From the DOt NET Framework SDK, you have the csc.exe (c sharp compile) utility, with which
you can compile your code to an executable.
Ofcourse, if you have Visual Studio .NET, that would be mutch better.


4.1 Hello example with argument(s):
===================================

// Namespace Declaration
using System;

// Program start class
class NamedWelcome
{
    // Main begins program execution.
    public static void Main(string[] args)
    {
        // Write to console
        Console.WriteLine("Hello, {0}!", args[0]);
        Console.WriteLine("Welcome to the C# Station Tutorial!"); 
    }
} 


Note that identifiers are case sensitive. Also note that you do not need 
to close a block with ";".


Compile:
--------

E:\data\C#\projects\p1>csc NamedWelcome.cs
Microsoft (R) Visual C# .NET Compiler version 7.00.9466
for Microsoft (R) .NET Framework version 1.0.3705
Copyright (C) Microsoft Corporation 2001. All rights reserved.

Run:
----

E:\data\C#\projects\p1>NamedWelcome appie
Hello, appie!
Welcome to the C# Station Tutorial!

E:\data\C#\projects\p1>NamedWelcome appie,piet
Hello, appie,piet!
Welcome to the C# Station Tutorial!

E:\data\C#\projects\p1>NamedWelcome appie and piet
Hello, appie!
Welcome to the C# Station Tutorial!


Explanation:
------------

In Listing 5.1, you'll notice an entry in the Main method's parameter list. The parameter name is args.  
It is what you use to refer to the parameter later in your program. The string[] expression defines 
the type of parameter that args is. The string type holds characters. These characters could form 
a single word, or multiple words.  The "[]", square brackets denote an Array, which is like a list.  
Therefore, the type of the args parameter, is a list of words from the command-line.

You'll also notice an additional Console.WriteLine(...) statement within the Main method.  
It has a formatted string with a "{0}" parameter embedded in it. The first parameter in a formatted string 
begins at number 0, the second is 1, and so on.  The "{0}" parameter means that the next argument following 
the end quote will determine what goes in that position.  Hold that thought, and now we'll look at the next 
argument following the end quote.  
This is the args[0] argument, which refers to the first string in the args array.  The first element of 
an Array is number 0, the second is number 1, and so on.  For example, if I wrote "NamedWelcome Joe" 
on the command-line, the value of args[0] would be "Joe".

Now we'll get back to the embedded "{0}" parameter in the formatted string.  Since args[0] is the 
first argument, after the formatted string, of the Console.WriteLine() statement, its value will be placed 
into the first embedded parameter of the formatted string.  When this command is executed, the value of args[0],
which is "Joe" will replace "{0}" in the formatted string.  Upon execution of the command-line with "NamedWelcome Joe",
the output will be as follows:

>Hello, Joe! 
>Welcome to the C# Station Tutorial! 


4.2 Getting Interactive Input:  
------------------------------

// Namespace Declaration
using System;

// Program start class
class InteractiveWelcome
{
    // Main begins program execution.
    public static void Main()
    {
        // Write to console/get input
        Console.Write("What is your name?: ");
        Console.Write("Hello, {0}! ", Console.ReadLine());
        Console.WriteLine("Welcome to the C# Station Tutorial!"); 
    }
} 

Another way to provide input to a program is via the console.  Listing 5.2 shows how to obtain 
interactive input from the user.

This time, the Main method doesn't have any parameters.  However, there are now three statements 
and the first two are different from the third.  They are Console.Write(...) instead of Console.WriteLine(...).  
The difference is that the Console.Write(...) statement writes to the console and stops 
on the same line, but the Console.WriteLine(...) goes to the next line after writing to the console. 

string name = Console.ReadLine(); 
Console.Write("Hello, {0}! ", name);

  
4.3 Variable declaration, 1:
============================

// Listing 4.3.  Displaying Boolean Values:  Boolean.cs 

using System;

class Booleans
{
    public static void Main()
    {
        bool content = true;
        bool noContent = false; 

        Console.WriteLine("It is {0} that C# Station provides C# programming language content.", content);
        Console.WriteLine("The statement above is not {0}.", noContent);
    }
} 


4.4 Variable declaration, 2:
============================

using System;

class Unary
{
    public static void Main()
    {
        int unary = 0;
        int preIncrement;
        int preDecrement;
        int postIncrement;
        int postDecrement;
        int positive;
        int negative;
        sbyte bitNot;
        bool logNot;

        preIncrement = ++unary;
        Console.WriteLine("Pre-Increment: {0}", preIncrement);

        preDecrement = --unary;
        Console.WriteLine("Pre-Decrement: {0}", preDecrement);

        postDecrement = unary--;
        Console.WriteLine("Post-Decrement: {0}", postDecrement);

        postIncrement = unary++;
        Console.WriteLine("Post-Increment: {0}", postIncrement);

        Console.WriteLine("Final Value of Unary: {0}", unary);

        positive = -postIncrement;
        Console.WriteLine("Positive: {0}", positive);

        negative = +postIncrement;
        Console.WriteLine("Negative: {0}", negative);

        bitNot = 0;
        bitNot = (sbyte)(~bitNot);
        Console.WriteLine("Bitwise Not: {0}", bitNot);

        logNot = false;
        logNot = !logNot;
        Console.WriteLine("Logical Not: {0}", logNot);
    }
} 

You can expect the following output from the above program.

Pre-Increment:  1
Pre-Decrement  0
Post-Decrement:  0
Post-Increment  -1
Final Value of Unary:  0
Positive:  1
Negative:  -1
Bitwise Not:  -1
Logical Not:  True 

Another example:

using System;

class Binary
{
    public static void Main()
    {
        int x, y, result;
        float floatResult;

        x = 7;
        y = 5;

        result = x+y;
        Console.WriteLine("x+y: {0}", result);

        result = x-y;
        Console.WriteLine("x-y: {0}", result);

        result = x*y;
        Console.WriteLine("x*y: {0}", result);

        result = x/y;
        Console.WriteLine("x/y: {0}", result);

        floatResult = (float)x/(float)y;
        Console.WriteLine("x/y: {0}", floatResult);

        result = x%y;
        Console.WriteLine("x%y: {0}", result);

        result += x;
        Console.WriteLine("result+=x: {0}", result);
    }
} 

And here's the output:

x+y: 12 
x-y: 2 
x*y: 35 
x/y: 1 
x/y: 1.4 
x%y: 2 
result+=x: 9


5. SOME MORE INFO ON ARRAYS:
============================


Another data type is the Array, which can be thought of as a container that has a list of storage locations 
for a specified type.  When declaring an Array, specify the type, name, dimensions, and size.

// Listing 6.1 Array Operations:  Array.cs 

using System;

class Array
{
    public static void Main()
    {
        int[] myInts = { 5, 10, 15 };
        bool[][] myBools = new bool[2][];
        myBools[0] = new bool[2];
        myBools[1] = new bool[1];
        double[,] myDoubles = new double[2, 2];
        string[] myStrings = new string[3];

        Console.WriteLine("myInts[0]: {0}, myInts[1]: {1}, myInts[2]: {2}", myInts[0], myInts[1], myInts[2]);

        myBools[0][0] = true;
        myBools[0][1] = false;
        myBools[1][0] = true;
        Console.WriteLine("myBools[0][0]: {0}, myBools[1][0]: {1}", myBools[0][0], myBools[1][0]);

        myDoubles[0, 0] = 3.147;
        myDoubles[0, 1] = 7.157;
        myDoubles[1, 1] = 2.117;
        myDoubles[1, 0] = 56.00138917;
        Console.WriteLine("myDoubles[0, 0]: {0}, myDoubles[1, 0]: {1}", myDoubles[0, 0], myDoubles[1, 0]);

        myStrings[0] = "Joe";
        myStrings[1] = "Matt";
        myStrings[2] = "Robert";
        Console.WriteLine("myStrings[0]: {0}, myStrings[1]: {1}, myStrings[2]: {2}", myStrings[0], myStrings[1], myStrings[2]);

    }
} 


And here's the output:

myInts[0]: 5, 
myInts[1]: 10, 
myInts[2]: 15
myBools[0][0]: True, 
myBools[1][0]: True
myDoubles[0, 0]: 3.147, 
myDoubles[1, 0]: 56.00138917
myStrings[0]: Joe, 
myStrings[1]: Matt, 
myStrings[2]: Robert


Listing 2-4 shows different implementations of Arrays.  The first example is the myInts Array.  
It is initialized at declaration time with explicit values.

Next is a jagged array.  It is essentially an array of arrays.  We needed to use the new operator to instantiate 
the size of the primary array and then use the new operator again for each sub-array.

The third example is a two dimensional array.  Arrays can be multi-dimensional, with each dimension 
separated by a comma.  it must also be instantiated with the new operator.

Finally, we have the one dimensional array of string types.

In each case, you can see that array elements are accessed by identifying the integer index for the item you wish 
to refer to.  Arrays sizes can be any int type value.  Their indexes begin at 0. 


Single-Dimensional Arrays

The type of each array declared is given firstly by the type of basic elements it can hold, and secondly 
by the number of dimensions it has. Single-dimensional arrays have a single dimension (ie, are of rank 1). 
They are declared using square brackets, eg: 

int[] i = new int[100]; 

This line of code declares variable i to be an integer array of size 100. 
It contains space for 100 integer elements, ranging from i[0] to i[99]. 

To populate an array one can simply specify values for each element, as in the following code: 

int[] i = new int[2]; 
i[0] = 1;
i[1] = 2; 

One can also run together the array declaration with the assignment of values to elements using 

int[] i = new int[] {1,2}; 
or the even shorter version of this: 

int[] i = {1,2}; 
By default, as we have seen, all arrays start with their lower bound as 0 
(and we would recommend that you stick with this default). 
However, using the .NET framework's System.Array class it is possible to create and manipulate 
arrays with an alternative initial lower bound. 

The (read-only) Length property of an array holds the total number of its elements across all of its dimensions. 
As single-dimensional arrays have just one dimension, this property will hold the length of the single dimension.
For instance, given the definition of array i above, i.Length is 2. 

Rectangular Arrays

C# supports two types of multidimensional arrays: rectangular and jagged. 
A rectangular array is a single array with more than one dimension, with the dimensions' sizes 
fixed in the array's declaration. The following code creates a 2 by 3 multi-dimensional array: 

int[,] squareArray = new int[2,3]; 
As with single-dimensional arrays, rectangular arrays can be filled at the time they are declared. 
For instance, the code 

int[,] squareArray = {{1, 2, 3}, {4, 5, 6}}; 
creates a 2 by 3 array with the given values. It is, of course, important that the given values 
do fill out exactly a rectangular array. 

The System.Array class includes a number of methods for determining the size and bounds of arrays. 
These include the methods GetUpperBound(int i) and GetLowerBound(int i), which return, respectively, 
the upper and lower subscripts of dimension i of the array (note that i is zero based, so the first 
array is actually array 0). 

For instance, since the length of the second dimension of squareArray is 3, the expression 

"squareArray.GetLowerBound(1)" returns 0, and the expression 

"squareArray.GetUpperBound(1)" 
returns 2. 

System.Array also includes the method GetLength(int i), which returns the number of elements in the 
ith dimension (again, zero based). 

The following piece of code loops through squareArray and writes out the value of its elements 

for(int i = 0; i < squareArray.GetLength(0); i++)
 
   for (int j = 0; j < squareArray.GetLength(1); j++)
 
      Console.WriteLine(squareArray[i,j]);
 

A foreach loop can also be used to access each of the elements of an array in turn, but using this construction 
one doesn't have the same control over the order in which the elements are accessed. 


6. WINDOWS FORMS OR .NET WINAPPS: 
=================================

The key class in the System.Windows.Forms namespace is "Form", which is the base class for all top-level
windows, including the application's main windows, view windows and any dialog boxes that you create.

Example 1:
==========

Suppose we create a windows application, which has a mainform, which shows a number of buttons which
can open other forms.

You actually do not need to "program" the majority of the code, as shown in this example. The "Forms Designer"
inject these lines into your source code, as you add controls and set properties.

// start code of the form

using System;
using System.Drawing;
using System.Collections;
using System.ComponentModel;
using System.Windows.Forms;
using System.Data;

namespace MSPress.CSharpCoreRef.Controls
{
	/// <summary>
	/// Summary description for Form1.
	/// </summary>
	public class mainForm : System.Windows.Forms.Form
	{
        private System.Windows.Forms.Button buttonsButton;
        private System.Windows.Forms.Button listBoxesButton;
        private System.Windows.Forms.Button textBoxesButton;
        private System.Windows.Forms.Button combosButton;
        private System.Windows.Forms.Button scrollbarsButton;
        private System.Windows.Forms.Button containersButton;
        private System.Windows.Forms.Button webLookButton;
		/// <summary>
		/// Required designer variable.
		/// </summary>
		private System.ComponentModel.Container components = null;

		public mainForm()
		{
			//
			// Required for Windows Form Designer support
			//
			InitializeComponent();

			//
			// TODO: Add any constructor code after InitializeComponent call
			//
		}

		/// <summary>
		/// Clean up any resources being used.
		/// </summary>
		protected override void Dispose( bool disposing )
		{
			if( disposing )
			{
				if (components != null) 
				{
					components.Dispose();
				}
			}
			base.Dispose( disposing );
		}

		#region Windows Form Designer generated code
		/// <summary>
		/// Required method for Designer support - do not modify
		/// the contents of this method with the code editor.
		/// </summary>
		private void InitializeComponent()
		{
            this.buttonsButton = new System.Windows.Forms.Button();
            this.listBoxesButton = new System.Windows.Forms.Button();
            this.textBoxesButton = new System.Windows.Forms.Button();
            this.combosButton = new System.Windows.Forms.Button();
            this.scrollbarsButton = new System.Windows.Forms.Button();
            this.containersButton = new System.Windows.Forms.Button();
            this.webLookButton = new System.Windows.Forms.Button();
            this.SuspendLayout();
            // 
            // buttonsButton
            // 
            this.buttonsButton.Location = new System.Drawing.Point(64, 32);
            this.buttonsButton.Name = "buttonsButton";
            this.buttonsButton.TabIndex = 0;
            this.buttonsButton.Text = "Buttons";
            this.buttonsButton.Click += new System.EventHandler(this.buttonsButton_Click);
            // 
            // listBoxesButton
            // 
            this.listBoxesButton.Location = new System.Drawing.Point(152, 32);
            this.listBoxesButton.Name = "listBoxesButton";
            this.listBoxesButton.TabIndex = 1;
            this.listBoxesButton.Text = "List Boxes";
            this.listBoxesButton.Click += new System.EventHandler(this.ListBoxesButton_Click);
            // 
            // textBoxesButton
            // 
            this.textBoxesButton.Location = new System.Drawing.Point(64, 64);
            this.textBoxesButton.Name = "textBoxesButton";
            this.textBoxesButton.TabIndex = 2;
            this.textBoxesButton.Text = "Text Boxes";
            this.textBoxesButton.Click += new System.EventHandler(this.TextBoxesButton_Click);
            // 
            // combosButton
            // 
            this.combosButton.Location = new System.Drawing.Point(64, 96);
            this.combosButton.Name = "combosButton";
            this.combosButton.TabIndex = 4;
            this.combosButton.Text = "Combos";
            this.combosButton.Click += new System.EventHandler(this.CombosButton_Click);
            // 
            // scrollbarsButton
            // 
            this.scrollbarsButton.Location = new System.Drawing.Point(152, 96);
            this.scrollbarsButton.Name = "scrollbarsButton";
            this.scrollbarsButton.TabIndex = 5;
            this.scrollbarsButton.Text = "Scroll Bars";
            this.scrollbarsButton.Click += new System.EventHandler(this.scrollbarsButton_Click);
            // 
            // containersButton
            // 
            this.containersButton.Location = new System.Drawing.Point(152, 64);
            this.containersButton.Name = "containersButton";
            this.containersButton.TabIndex = 6;
            this.containersButton.Text = "Containers";
            this.containersButton.Click += new System.EventHandler(this.containersButton_Click);
            // 
            // webLookButton
            // 
            this.webLookButton.Location = new System.Drawing.Point(104, 136);
            this.webLookButton.Name = "webLookButton";
            this.webLookButton.TabIndex = 7;
            this.webLookButton.Text = "Web Look";
            this.webLookButton.Click += new System.EventHandler(this.webLookButton_Click);
            // 
            // mainForm
            // 
            this.AutoScaleBaseSize = new System.Drawing.Size(5, 13);
            this.ClientSize = new System.Drawing.Size(292, 198);
            this.Controls.AddRange(new System.Windows.Forms.Control[] {
                                                                          this.webLookButton,
                                                                          this.containersButton,
                                                                          this.scrollbarsButton,
                                                                          this.combosButton,
                                                                          this.textBoxesButton,
                                                                          this.listBoxesButton,
                                                                          this.buttonsButton});
            this.FormBorderStyle = System.Windows.Forms.FormBorderStyle.FixedDialog;
            this.Name = "mainForm";
            this.Text = "Basic Controls Demo";
            this.ResumeLayout(false);

        }
		#endregion

		/// <summary>
		/// The main entry point for the application.
		/// </summary>
		[STAThread]
		static void Main() 
		{
			Application.Run(new mainForm());
		}

        private void ListBoxesButton_Click(object sender, System.EventArgs e)
        {
            Form f = new ListBoxForm();
            f.Show();
        }

        private void CombosButton_Click(object sender, System.EventArgs e)
        {
            Form f = new ComobBoxForm();
            f.ShowDialog();
        }

        private void TextBoxesButton_Click(object sender, System.EventArgs e)
        {
            Form f = new TextBoxForm();
            f.ShowDialog();
        }

        private void scrollbarsButton_Click(object sender, System.EventArgs e)
        {
            Form f = new ScrollbarsForm();
            f.ShowDialog();
        }

        private void containersButton_Click(object sender, System.EventArgs e)
        {
            Form f = new ContainersForm();
            f.ShowDialog();
        }

        private void buttonsButton_Click(object sender, System.EventArgs e)
        {
            Form f = new ButtonForm();
            f.Show();
        }

        private void webLookButton_Click(object sender, System.EventArgs e)
        {
            Form f = new WebLookForm();
            f.Show();
        }
	}
}


// end code


Let's take a look at some noticable points: 

- The Main() method in a Windows Forms application includes a call to Application.Run

		static void Main() 
		{
			Application.Run(new mainForm());
		}

- Display a form:

A modal form prevents access to any other portions of the application while the form is displayed.
A modeless form allows other parts of the application be used.

- To display a modeless form, us the Show method:

Form addressForm=new AddressForm();
addressForm.Show();

To destroy the form, use the Close() method:

addressForm.Close();

- To display a modal form, use the ShowDialog() method, as shown here:

addressForm.ShowDialog();


CALL A STORED PROCEDURE:
========================


Example 1.
---------- 

using(MyDatabase db = new MyDatabase())
{
    // Call the GetUsersByZip stored procedure and return data in a DataTable object 
    DataTable users  = db.StoredProcedures.GetUsersByZipDataTable(77057);

    // You do not need to close the DB connection manually.
    // The using statement do it automatically. 
} 

or 
MyDatabase db = new MyDatabase();
try 
{
    // Call the GetUsersByZip stored procedure and return data in a DataTable object
    DataTable users = db.StoredProcedures.GetUsersByZipDataTable(77057);
}
finally 
{
    // Do not forget to close the DB connection 
    db.Close();
} 


Example 2.
----------

Oracle is one of the most popular databases around. So, there is a good enough chance that you will end up working 
with Oracle and .NET. 
Stored Procedures is one of the ways that you can shift some of the business logic to the database server. 
Here , we will see how to execute a stored procedure which takes some input parameters.


Here is the code for the Stored Procedure:  

CREATE OR REPLACE PROCEDURE updateBalance(accNo number, amount number) 
IS 
BEGIN 
             UPDATE customer_account 
             SET balance = balance + amount 
             WHERE  accountNo = accNo ;
END;
/
 

Here our procedure is a very simple one which has only one update statement and which takes two input parameters 
1. Account Number (AccNo) and
2. Amount (amount) 


You can run the above code in SQL*PLUS and the procedure is created. Of course, you should have the table 
Customer_Account with at least the required fields i.e

1. AccountNo  which is a number and
2. Balance      also a number.

Let's see the how can we execute it using .NET. Here, we will be a using a simple console application 
to execute the stored procedure. The language used is C#.

Here is the  C# code (console.cs) 

// import the required assemblies
using System; 
using System.Data; 
using System.Data.OleDb;

class StoredProc
      { 
            public static void Main() 
                {
 
                        string DBstr = "" ;  // your Database connection string

                         // create an instance of the connection object
                         OleDbConnection oCn = new OleDbConnection(DBstr) ;

        //create an instance of the command object giving the procedure name 
              OleDbCommand oCm = new OleDbCommand("updateBalance",oCn) ;

        // Define the command type u r executing as a Stored Procedure.                       
                        oCm.CommandType = CommandType.StoredProcedure ;

        //Add the parameter "accNo" giving it's value and defining it as a Input parameter
                        oCm.Parameters.Add("accNo",OleDbType.Integer,16);  
                        oCm.Parameters["accNo"].Value = 1 ;                      
                        oCm.Parameters["accNo"].Direction = ParameterDirection.Input ;

       //Add the parameter "amount" giving it's value & defining it as Input parameter 
                oCm.Parameters.Add("amount",OleDbType.Integer,16);
                oCm.Parameters["amount"].Value = 200 ;  
                oCm.Parameters["amount"].Direction = ParameterDirection.Input ;

                        // using the Try Catch Finally Block.
                        try 
                          { 
                                // Open the connection
                                oCn.Open(); 

                                // giving screen output
                                Console.WriteLine("created connection") ; 

                                // execute the stored procedure
                                oCm.ExecuteNonQuery() ; 

                                // get the confirmation on the screen
                                Console.WriteLine("Procedure Completed") ; 

                          }
                       catch(Exception ex) 
                          { 

                               // catch the error message and put it in the string "msg"
                               string msg = ex.Message ; 

                               //show the error message on the screen
                               Console.WriteLine(msg) ; 

                          } 
                       finally 
                          { 

                              // Destroy the command object
                              oCm.Dispose() ; 

                              // Destroy the connection object
                              oCn.Dispose() ;  

                          }
                } 
       } 


You will have to compile this separately with the following command 

csc /r:System.Data.dll console.cs

This creates console.exe which you can run at command prompt.


#############################################################################################
#############################################################################################
#############################################################################################


=====================================================
Section 14: Basic PHP and examples:
=====================================================


=======================
1. Basic PHP and MySQL:
=======================


1.1 Connecting to a MySQL Database in PHP:
==========================================

-- mysql_connect()

Before you can access and work with data in a database, you must create a connection 
to the database.

In PHP, this is done with the mysql_connect() function.

Syntax:
mysql_connect(servername,username,password); 

Parameter Description 
servername Optional. Specifies the server to connect to. Default value is "localhost:3306" 
username Optional.   Specifies the username to log in with. Default value is the name of the user 
                     that owns the server process 
password Optional.   Specifies the password to log in with. Default is "" 

Note: There are more available parameters, but the ones listed above are the most important. 
Visit our full PHP MySQL Reference for more details.

Example
In the following example we store the connection in a variable ($con) for later use in the script. 
The "die" part will be executed if the connection fails:

<?php
$con = mysql_connect("localhost","peter","abc123");
if (!$con)
  {
  die('Could not connect: ' . mysql_error());
  }// some code
?>


1.2 Closing a Connection:
=========================

-- mysql_close()

The connection will be closed as soon as the script ends. To close the connection before, 
use the mysql_close() function.

<?php
$con = mysql_connect("localhost","peter","abc123");
if (!$con)
  {
  die('Could not connect: ' . mysql_error());
  }// some code
mysql_close($con);
?> 


1.3 Create a database:
======================

-- mysql_query()

Create a Database
The CREATE DATABASE statement is used to create a database in MySQL.

Syntax
CREATE DATABASE database_name 

Example:

CREATE DATABASE IF NOT EXISTS spldev1;
USE spldev1;


To get PHP to execute the statement above we must use the 

mysql_query() 

function. This function is used to send a query or command to a MySQL connection. 

Example:
In the following example we create a database called "my_db":

<?php
$con = mysql_connect("localhost","peter","abc123");
if (!$con)
  {
  die('Could not connect: ' . mysql_error());
  }
  if (mysql_query("CREATE DATABASE my_db",$con))
  {
  echo "Database created";
  }
  else
  {
  echo "Error creating database: " . mysql_error();
  }mysql_close($con);
?>


Example 2:

<?php

// set your infomation.
$dbhost='localhost';
$dbusername='david';
$dbuserpass='mypassword';
$dbname='test';

// connect to the mysql database server.
$link_id = mysql_connect ($dbhost, $dbusername, $dbuserpass);
echo "success in database connection.";

// create the database.
$dbname=$dbusername."_".$dbname;
if (!mysql_query("CREATE DATABASE $dbname")) die(mysql_error());
echo "success in database creation.";

?> 


1.4 Create a table:
===================

-- mysql_query()

Create a Table
The CREATE TABLE statement is used to create a database table in MySQL.

Syntax
CREATE TABLE table_name
(
column_name1 data_type,
column_name2 data_type,
column_name3 data_type,
.......
) 

Ofcourse, creating a table from a query promt tool, or some graphical Admin tool,
is very straightforward.

But if you want to create a table from PHP code,
we must add the CREATE TABLE statement to the mysql_query() function to execute the command.

Example 1:

The following example shows how you can create a table named "Person", with three columns. 
The column names will be "FirstName", "LastName" and "Age":

<?php
$con = mysql_connect("localhost","peter","abc123");
if (!$con)
  {
  die('Could not connect: ' . mysql_error());
  }// Create database
if (mysql_query("CREATE DATABASE my_db",$con))
  {
  echo "Database created";
  }
else
  {
  echo "Error creating database: " . mysql_error();
  }// Create table in my_db database
mysql_select_db("my_db", $con);
$sql = "CREATE TABLE Person 
(
FirstName varchar(15),
LastName varchar(15),
Age int
)";
mysql_query($sql,$con);
mysql_close($con);
?>

Example 2:

<?php
$user="username";
$password="password";
$database="database";
mysql_connect(localhost,$user,$password);
@mysql_select_db($database) or die( "Unable to select database");
$query="CREATE TABLE contacts (id int(6) NOT NULL auto_increment,first varchar(15) NOT NULL,
last varchar(15) NOT NULL,phone varchar(20) NOT NULL,mobile varchar(20) NOT NULL,
fax varchar(20) NOT NULL,email varchar(30) NOT NULL,web varchar(30) NOT NULL,
PRIMARY KEY (id),UNIQUE id (id),KEY id_2 (id))";
mysql_query($query);
mysql_close();
?>


Important: A database must be selected before a table can be created. The database is selected with 
the mysql_select_db() function.

Note: When you create a database field of type varchar, you must specify the maximum length of the field, 
e.g. varchar(15).


1.5 MySQL Data Types:
=====================

Below is the different MySQL data types that can be used:

-- Numeric Data Types:

int(size)
smallint(size)
tinyint(size)
mediumint(size)
bigint(size) 
decimal(size,d)
double(size,d)
float(size,d) 

-- Textual Data Types:

char(size)                 Holds a fixed length string (can contain letters, numbers, 
                           and special characters). The fixed size is specified in parenthesis 
varchar(size)              Holds a variable length string (can contain letters, numbers, 
                           and special characters). The maximum size is specified in parenthesis 
tinytext                   Holds a variable string with a maximum length of 255 characters 

text
blob                       Holds a variable string with a maximum length of 65535 characters 

mediumtext
mediumblob                 Holds a variable string with a maximum length of 16777215 characters 

longtext
longblob                   Holds a variable string with a maximum length of 4294967295 characters 

-- Date time datatypes:

date(yyyy-mm-dd)
datetime(yyyy-mm-dd hh:mm:ss)
timestamp(yyyymmddhhmmss)
time(hh:mm:ss) 

-- Other datatypes:

enum(value1,value2,ect)    ENUM is short for ENUMERATED list. 
                           Can store one of up to 65535 values listed within the ( ) brackets. 
                           If a value is inserted that is not in the list, 
                           a blank value will be inserted 

set                        SET is similar to ENUM. However, SET can have up to 64 list items 
                           and can store more than one choice 


1.6 Primary Keys and Auto Increment Fields:
===========================================

Each table should have a primary key field.

A primary key is used to uniquely identify the rows in a table. Each primary key value must 
be unique within the table. Furthermore, the primary key field cannot be null because the database engine 
requires a value to locate the record.

The primary key field is always indexed. There is no exception to this rule! 
You must index the primary key field so the database engine can quickly locate rows based on the key's value.

The following example sets the personID field as the primary key field. 
The primary key field is often an ID number, and is often used with the AUTO_INCREMENT setting. 
AUTO_INCREMENT automatically increases the value of the field by 1 each time a new record is added. 
To ensure that the primary key field cannot be null, we must add the NOT NULL setting to the field.

Example:


$sql = "CREATE TABLE Person 
(
personID int NOT NULL AUTO_INCREMENT, 
PRIMARY KEY(personID),
FirstName varchar(15),
LastName varchar(15),
Age int
)";

mysql_query($sql,$con);


1.7: Get the Results of a SELECT Statement in PHP:
==================================================

-- mysql_query()
-- mysql_fetch_row()


Example 1:

$sql="SELECT * FROM EMPLOYEE";

$result=mysql_query($sql)
       or die("Error executing Query");

print "<table border='1'>\n";

while ($line=mysql_fetch_row($result)) {
       print "\t<tr>\n";
       foreach ($line as $col_value) {
               print "\t\t<td>$col_value</td>\n";
       }
       print "\t<tr>\n";
}
print "</table>\n";


The resultset of the query is stored in the MySQL resource variable "$result".
By using the mysql_fetch_row function, we can use the rows one by one,
and use them in an array like fashion.


1.8 Insert data through PHP:
============================

The INSERT INTO statement is used to add new records to a database table.

Syntax
INSERT INTO table_name
VALUES (value1, value2,....) 

You can also specify the columns where you want to insert the data:

INSERT INTO table_name (column1, column2,...)
VALUES (value1, value2,....) 

To get PHP to execute the statements above we must use the mysql_query() function. 
This function is used to send a query or command to a MySQL connection.

Example 1:

<?php
$con = mysql_connect("localhost","peter","abc123");
if (!$con)
  {
  die('Could not connect: ' . mysql_error());
  }mysql_select_db("my_db", $con);

mysql_query("INSERT INTO person (FirstName, LastName, Age) 
VALUES ('Peter', 'Griffin', '35')");

mysql_query("INSERT INTO person (FirstName, LastName, Age) 
VALUES ('Glenn', 'Quagmire', '33')");

mysql_close($con);
?>


Insert Data From a Form Into a Database:

Now we will create an HTML form that can be used to add new records to the "Person" table.

Here is the HTML form:

<html>
<body><form action="insert.php" method="post">
Firstname: <input type="text" name="firstname" />
Lastname: <input type="text" name="lastname" />
Age: <input type="text" name="age" />
<input type="submit" />
</form></body>
</html>

When a user clicks the submit button in the HTML form in the example above, the form data is sent 
to "insert.php". The "insert.php" file connects to a database, and retrieves the values 
from the form with the PHP $_POST variables. Then, the mysql_query() function executes 
the INSERT INTO statement, and a new record will be added to the database table.

Below is the code in the "insert.php" page:

<?php
$con = mysql_connect("localhost","peter","abc123");
if (!$con)
  {
  die('Could not connect: ' . mysql_error());
  }mysql_select_db("my_db", $con);

$sql="INSERT INTO person (FirstName, LastName, Age)
VALUES
('$_POST[firstname]','$_POST[lastname]','$_POST[age]')";

if (!mysql_query($sql,$con))
  {
  die('Error: ' . mysql_error());
  }
echo "1 record added";mysql_close($con)
?>


Example 2:

<?php
// Make a MySQL Connection
mysql_connect("localhost", "admin", "1admin") or die(mysql_error());
mysql_select_db("test") or die(mysql_error());

// Insert a row of information into the table "example"
mysql_query("INSERT INTO example 
(name, age) VALUES('Timmy Mellowman', '23' ) ") 
or die(mysql_error());  

mysql_query("INSERT INTO example 
(name, age) VALUES('Sandy Smith', '21' ) ") 
or die(mysql_error());  

mysql_query("INSERT INTO example 
(name, age) VALUES('Bobby Wallace', '15' ) ") 
or die(mysql_error());  

echo "Data Inserted!";

?>


1.9 Arrays in PHP:
==================

An array is a sort variable that actually is a numbererd list of similar values.
In most cases, its a one dimensional list, but multiple dimensions
are also possible.
An array has an associated "index", that identifies a certain value
in the list.

You can construct an array with:

array
(
[index]=>value,
[index]=>value,
..
)

Examples:

- Simple array:

$months=array("januari","februari","march","april","may","june",
"july","august","september","october","november","december");

- Associative array:

$inv1=array('forks'=>6,'knifes'=>5,'spoons'=>7);

or similar

$inv2['forks']=6;
$inv2['knifes']=5;
$inv2['spoons']=7;

print an array:

In order to print the values of an array, you can use the print_r() function:

echo '$inv1: '; print_r($inv1);

the "foreach" instruction:

You can test or walk through all elements of an array, using the foreach loop:

Example:

foreach ($month as $value) {
    echo "maand: $value<br>\n";
}

result:
maand: januari
maand: febrari
maand: maart
etc..


Additional notes about array:
-----------------------------

-- 1. From http://nl3.php.net/types.array:
-- ---------------------------------------

Arrays
An array in PHP is actually an ordered map. A map is a type that maps values to keys. 
This type is optimized in several ways, so you can use it as a real array, or a list (vector), 
hashtable (which is an implementation of a map), dictionary, collection, stack, 
queue and probably more. Because you can have another PHP array as a value, you can also 
quite easily simulate trees. 

Explanation of those data structures is beyond the scope of this manual, but you'll find at least 
one example for each of them. For more information we refer you to external literature about this broad topic. 

Syntax
Specifying with array()
An array can be created by the array() language-construct. It takes a certain number 
of comma-separated key => value pairs. 

array( [key =>] value
     , ...
     )
// key may be an integer or string
// value may be any value 

<?php
$arr = array("foo" => "bar", 12 => true);

echo $arr["foo"]; // bar
echo $arr[12];    // 1
?>  


A key may be either an integer or a string. If a key is the standard representation 
of an integer, it will be interpreted as such (i.e. "8" will be interpreted as 8, while "08" 
will be interpreted as "08"). Floats in key are truncated to integer. There are no different 
indexed and associative array types in PHP; there is only one array type, which can both contain 
integer and string indices. 

A value can be of any PHP type. 

<?php
$arr = array("somearray" => array(6 => 5, 13 => 9, "a" => 42));

echo $arr["somearray"][6];    // 5
echo $arr["somearray"][13];  // 9
echo $arr["somearray"]["a"];  // 42
?> 


-- 2. An example from http://www.phpfreaks.com/phpmanual/page/function.array.html
-- ------------------------------------------------------------------------------

Automatic index with array()

<?php
$array = array(1, 1, 1, 1,  1, 8 => 1,  4 => 1, 19, 3 => 13);
print_r($array);
?>  

The above example will output:

Array
(
    [0] => 1
    [1] => 1
    [2] => 1
    [3] => 13
    [4] => 1
    [8] => 1
    [9] => 19
)  


1.10 $_POST and $_GET variables:
================================

The $_POST variable is a global array that will contain all values
submitted to the called script (for example, when the user clicked
the submit button in a form).
The indexes are the names of the inputfields.

The same is true for the $_GET variable.

There is a difference in the location of the values send to the server
in the datastream, In effect, the $_POST variable will not show the values
in the URL, while the $_GET does.

Example:

Suppose you have a form with a number of inputfields (or buttons), like

<td><input type='submit' name='b1' values='1'/></td>
<td><input type='submit' name='b2' values='2'/></td>
<td><input type='submit' name='b3' values='3'/></td>
<td><input type='submit' name='b4' values='4'/></td>
<td><input type='submit' name='b5' values='5'/></td>
<td><input type='submit' name='b6' values='6'/></td>
<td><input type='submit' name='b7' values='7'/></td>
..
(like the buttons of a calculator..)

Now, you might want to walk through the elements of the array in the following fashion:

for ($i=0; $i<10; $i ++) {
    if (isset($_POST["b$i"])) {
       echo "button $i, value: ".$_POST["b$i"];
    }
}


1.11 MySQL query prompt tool:
=============================

Connecting to and Disconnecting from the Server
To connect to the server, you will usually need to provide a MySQL user name when you 
invoke mysql and, most likely, a password. If the server runs on a machine other than the one 
where you log in, you will also need to specify a host name. Contact your administrator to 
find out what connection parameters you should use to connect (that is, what host, 
user name, and password to use). Once you know the proper parameters, 
you should be able to connect like this: 

shell> mysql -h host -u user -p
Enter password: ********

or as an example of a local connection:

mysql -u root -pvga88nt

host and user represent the host name where your MySQL server is running and the user name 
of your MySQL account. Substitute appropriate values for your setup. 
The ******** represents your password; enter it when mysql displays the Enter password: prompt. 

shell> mysql -h host -u user -p
Enter password: ********
Welcome to the MySQL monitor.  Commands end with ; or \g.
Your MySQL connection id is 25338 to server version: 5.0.30-standard

Type 'help;' or '\h' for help. Type '\c' to clear the buffer.

mysql>


================================
2. Loading data in MySQL tables:
================================


2.1 From existing table to a new table:
=======================================

You can create a new table, with all rows included, from any existing table.

2.1.1. CREATE TABLE <New_Table> AS SELECT FROM <Existing_Table>

Syntax:

create <new_table> as select * from <existing_table>;

Example:

create table EMPLOYEE2 as select * from EMPLOYEE;

MySQL also offers the following easy way to create a table on basis
of an existing table:

2.1.2 CREATE TABLE <New_Table> LIKE <Existing_Table>;

Example:

CREATE TABLE t2 LIKE t1;


2.2 Load data from a text file into a table:
============================================

From the "mysql>" prompt, you can load data from a textfile
to a table in one step.

Suppose you have the pet table, and a file pet.txt.
To load the text file pet.txt into the pet table, use this command: 

mysql> LOAD DATA LOCAL INFILE '/path/pet.txt' INTO TABLE pet;


Note that if you created the file on Windows with an editor that uses \r\n as a line terminator, 
you should use: 

mysql> LOAD DATA LOCAL INFILE '/path/pet.txt' INTO TABLE pet
    -> LINES TERMINATED BY '\r\n';


===============================================
3. An example of a simple Relational Datamodel:
===============================================

3.1 The model:
==============

(PK): Primary Key; 
(FK): Foreign Key

               --------------------
               |TABLE CUSTOMERS:  |
               |------------------|
     ----------|Cust_ID (PK)      |
     |         |Cust_name         |
     |         |Address           |
     |         |Postal+code       |
     |         |City              |
     |         |Country           |
     |         --------------------
     |
     |     
     |          -----------------           --------------------
     | 1:n     |TABLE ORDERS    |           |TABLE ORDERDETAIL |
     |         |----------------|    1:n    |------------------|
     |         |Order_id  (PK)  |------<<<--|Order_id   (PK/FK)|
     |---<<<---|Cust_id   (FK)  |           |Product_id (PK/FK)|--<<>>---
               |Order_date      |           |Quantity          |        |
               |Emp_id    (FK)  |-->>>-     |Discount          |        |
               ------------------     |     |                  |        |
                                      |     --------------------        |
                                 1:n  |                             1:1 |
                                      |                                 |
            ------------------        |     --------------------        |
            |TABLE EMPLOYEES |        |     |TABLE PRODUCTS    |        |
            |----------------|        |     |------------------|        |
            |Emp_id (PK)     |--------|     |Product_id (PK)   |---------
            |Name            |              |Product_name      |
            |Lastname        |              |No_In_Stock       |
            ------------------              |To_Order          |
                                            |Price             |
                                            --------------------

3.2 The DDL:
============

USE SALES  -- or other database of your choice

-- Here is the DDL:

CREATE TABLE Customers
(
Cust_id     int NOT NULL,
Cust_name   varchar(20) NOT NULL,
Address     varchar(30),
City        varchar(20),
Country     varchar(20),
CONSTRAINT  pk_cust PRIMARY KEY (cust_id)
); 

CREATE TABLE Employees
(
Emp_id      int NOT NULL,
Name        varchar(20) NOT NULL,
LastName    varchar(30) NOT NULL,
CONSTRAINT  pk_emp PRIMARY KEY (emp_id)
);

CREATE TABLE Products
(
Product_id      int NOT NULL,
Product_Name    varchar(20) NOT NULL,
Unit_price      decimal(7,2) NOT NULL,
No_In_Stock     int NOT NULL,
To_Order        char(1) NOT NULL,  
CONSTRAINT      pk_product PRIMARY KEY (product_id)
); 

CREATE TABLE Orders
(
Order_id      int NOT NULL,
Cust_id       int NOT NULL,
Emp_id        int NOT NULL,
Order_date    datetime NOT NULL,
CONSTRAINT    pk_order PRIMARY KEY (order_id),
CONSTRAINT    fk_cust_id FOREIGN KEY (cust_id) REFERENCES customers (cust_id),
CONSTRAINT    fk_emp_id FOREIGN KEY (emp_id) REFERENCES employees (emp_id)
); 

CREATE TABLE OrderDetail
(
Order_id      int NOT NULL,
Product_id    int NOT NULL,
Quantity      int NOT NULL,
CONSTRAINT pk_detail PRIMARY KEY (order_id,product_id),
CONSTRAINT fk_product FOREIGN KEY (product_id) REFERENCES products (product_id),
CONSTRAINT fk_order FOREIGN KEY (order_id) REFERENCES orders (order_id)
);


3.3 Some sample data:
=====================

-------------------------------------------------------------------------
insert into customers
values
(1,'AKZO','Piersonlaan 100','Heerlen','NL');

insert into customers
values
(2,'GM','1 Road Avenue','New York','US');

insert into customers
values
(3,'McPhierson','Blackburry 7','Oxford','GB');

insert into customers
values
(4,'DSM','Hoeplaweg 7','Amsterdam','NL');
-------------------------------------------------------------------------
insert into orders
values
(1,3,1,'2007-01-01');

insert into orders
values
(2,2,3,'2007-01-01');

insert into orders
values
(3,1,1,'2007-01-01');

insert into orders
values
(4,1,1,'2007-01-01');
-------------------------------------------------------------------------
insert into orderdetail
values
(1,1,4);

insert into orderdetail
values
(1,2,3);

insert into orderdetail
values
(2,1,100);

insert into orderdetail
values
(3,4,40);

insert into orderdetail
values
(3,2,44);

insert into orderdetail
values
(4,2,90);
-------------------------------------------------------------------------
insert into Employees
values
(1,'Harry','Ekelson');

insert into Employees
values
(2,'Marie','Zwoels');

insert into Employees
values
(3,'Gerrit','Bruinsma');
-------------------------------------------------------------------------
insert into Products
values
(1,'Hamer',100.00,10,'n');

insert into Products
values
(2,'Zaag',25.50,10,'n');

insert into Products
values
(3,'Beitel',10.75,10,'n');

insert into Products
values
(4,'Schroef',5.15,10,'n');

insert into Products
values
(5,'Schop',70.45,10,'n');


3.4 Cascaded deletes:
=====================

A table with a FK points to another table with a PK.
Normally, when you try to delete a row in the table with the PK, "child records"
may exist in the table with the FK.

This ofcourse could mean trouble.

Example:

create table customers2
(
custid int not null,
custname varchar(10),
CONSTRAINT pk_cust PRIMARY KEY (custid) 
);


create table contacts2
( 
contactid int not null,
custid int not null,
contactname varchar(10),
CONSTRAINT pk_contactid PRIMARY KEY (contactid),
CONSTRAINT fk_cust FOREIGN KEY (custid) REFERENCES customers2(custid) 
);

In this case, the table contacts is "linked" to the table customers.
You cannot just delete a row from customers, if there are corresponding
rows in contacts having the same custid as the row you are trying to delete
from customers.

-- Lets try it:
---------------

insert into customers2
values
(1,'AKZO');

insert into contacts2
values
(1,1,'Harry');


mysql> delete from customers2 where custid=1;

ERROR 1451 (23000): Cannot delete or update a parent row: a foreign key constraint fails (`spldev1/c
ontacts2`, CONSTRAINT `fk_cust` FOREIGN KEY (`custid`) REFERENCES `customers2` (`custid`))
mysql>


-- Now lets try this:
-- ------------------

create table customers3
(
custid int not null,
custname varchar(10),
CONSTRAINT pk_cust3 PRIMARY KEY (custid) 
);


create table contacts3
( 
contactid int not null,
custid int not null,
contactname varchar(10),
CONSTRAINT pk_contactid3 PRIMARY KEY (contactid),
CONSTRAINT fk_cust3 FOREIGN KEY (custid) REFERENCES customers3(custid) ON DELETE CASCADE 
);


This time we have used the ON DELETE CASCADE clause.

insert into customers3
values
(1,'AKZO');

insert into contacts3
values
(1,1,'Harry');

mysql> delete from customers3 where custid=1;
Query OK, 1 row affected (0.03 sec)

mysql> select * from contacts3;
Empty set (0.00 sec)


3.5 Triggers:
=============

Just to compare the different RDBMSses, let look at some examples from

-- -----------------------
-- 1. SQL Server examples:
-- -----------------------


Example 1:
----------

CREATE TRIGGER orders_INSERT ON orders 
FOR INSERT, UPDATE
AS
UPDATE stock
SET in_stock=in_stock-INSERTed.amount
FROM stock, INSERTed
WHERE stock.item_id=INSERTed.item_id


Example 2:
----------

CREATE TRIGGER employee_insupd
ON employee
FOR INSERT, UPDATE
AS
--Get the range of level for this job type FROM the jobs table.
declare @min_lvl tinyint,
   @max_lvl tinyint,
   @emp_lvl tinyint,
   @job_id smallint
SELECT @min_lvl = min_lvl,
   @max_lvl = max_lvl,
   @emp_lvl = i.job_lvl,
   @job_id = i.job_id
FROM employee e, jobs j, INSERTed i
WHERE e.emp_id = i.emp_id AND i.job_id = j.job_id
IF (@job_id = 1) AND (@emp_lvl <> 10)
BEGIN
   raiserror ('Job id 1 expects the default level of 10.',16,1)
   ROLLBACK TRANSACTION
END
ELSE
IF NOT (@emp_lvl BETWEEN @min_lvl AND @max_lvl)
BEGIN
   raiserror ('The level for job_id:%d should be between %d AND %d.',
      16, 1, @job_id, @min_lvl, @max_lvl)
   ROLLBACK TRANSACTION
END


-- ----------------
-- Oracle examples:
-- ----------------

Example 1:
----------

CREATE OR REPLACE TRIGGER tr_CUSTOMER_ins
BEFORE INSERT ON CUSTOMER FOR EACH ROW
BEGIN
	SELECT seq_customer.NEXTVAL INTO :NEW.CUSTOMER_ID FROM dual;
END;

Example 2:
----------

CREATE OR REPLACE TRIGGER MYTRIG2 
AFTER DELETE OR INSERT OR UPDATE ON JD11.BOOK
FOR EACH ROW
BEGIN
   IF DELETING THEN
      INSERT INTO JD11.XBOOK (PREVISBN, TITLE, DELDATE) VALUES (:OLD.ISBN, :OLD.TITLE, SYSDATE); 
   ELSIF INSERTING THEN
      INSERT INTO JD11.NBOOK (ISBN, TITLE, ADDDATE) VALUES (:NEW.ISBN, :NEW.TITLE, SYSDATE); 
   ELSIF UPDATING ('ISBN) THEN
      INSERT INTO JD11.CBOOK (OLDISBN, NEWISBN, TITLE, UP_DATE) VALUES (:OLD.ISBN :NEW.ISBN, :NEW.TITLE, SYSDATE);
   ELSE /* UPDATE TO ANYTHING ELSE THAN ISBN */
      INSERT INTO JD11.UBOOK (ISBN, TITLE, UP_DATE) VALUES (:OLD.ISBN :NEW.TITLE, SYSDATE); 
   END IF
END;


Lets now take a look at triggers in MySQL:

-- ---------------
-- MySQL examples:
-- ---------------

Support for triggers is included beginning with MySQL 5.0.2. A trigger is a named database object 
that is associated with a table and that is activated when a particular event occurs for the table. 


Example 1:
----------

For example, the following statements create a table and an INSERT trigger. 
The trigger sums the values inserted into one of the table's columns: 

CREATE TABLE account (acct_num INT, amount DECIMAL(10,2));

CREATE TRIGGER ins_sum BEFORE INSERT ON account
FOR EACH ROW SET @sum = @sum + NEW.amount;


Example 2:
----------

create table debit
(
account int not null,
saldo decimal(10,2) not null
);

create table credit
(
account int not null,
saldo decimal(10,2) not null
);

insert into debit
values
(1,100);

insert into credit
values
(1,100);


CREATE TRIGGER test BEFORE UPDATE ON credit
FOR EACH ROW
UPDATE debit
set saldo=saldo-NEW.saldo
where debit.account=NEW.account;

Here, when you update the amount in table "credit", the amount will be substracted from "debit".

Notes:
------

Note 1: From http://www.onlamp.com/pub/a/onlamp/2005/02/03/triggers.html
------------------------------------------------------------------------

MySQL 5.0, the alpha version of MySQL that's available for testing new features, has trigger support. 
This is no surprise, as triggers were promised in the MySQL Development Roadmap, but it's a novel experience 
to work with one of the big "MySQL can't do that" features and watch MySQL doing it.

For these tests I downloaded the most recent MySQL 5.0 source as described in the MySQL Reference Manual 
section Installing from the Development Source Tree. Material downloaded from the source tree is generally 
much newer--and less tested--than what you find on the MySQL 5.0 Downloads page.

Test-Driving Triggers

I start the mysql client program from a Linux shell, and with my first statement 
I make sure that I have version 5:


mysql> SELECT version();
+-------------------+
| version()         |
+-------------------+
| 5.0.2-alpha-debug |
+-------------------+
1 row in set (0.00 sec)

Then I create a table in a test database, create a trigger, and run an INSERT statement 
to test the trigger.

mysql> CREATE DATABASE test_db;
Query OK, 1 row affected (0.27 sec)

mysql> USE test_db;
Database changed

mysql> CREATE TABLE t (column1 TINYINT);
Query OK, 0 rows affected (0.28 sec)

mysql> CREATE TRIGGER t_bi              /* line 1 */
    -> BEFORE INSERT ON t               /* line 2 */
    -> FOR EACH ROW                     /* line 3 */
    -> SET @x = @x + 1;                 /* line 4 */
Query OK, 0 rows affected (0.00 sec)

mysql> SET @x = 0;                      /* line 5 */
Query OK, 0 rows affected (0.00 sec)

mysql> INSERT INTO t VALUES (1),(NULL); /* line 6 */
Query OK, 2 rows affected (0.00 sec)
Records: 2  Duplicates: 0  Warnings: 0

mysql> SELECT @x;                       /* line 7 */
+------+
| @x   |
+------+
| 2    |
+------+
1 row in set (0.01 sec)


To begin with the conclusion: the above exercise proves that triggers work with MySQL. 
To demonstrate why, I'll have to go through the CREATE TRIGGER statement one line at a time.

Explaining Triggers

CREATE TRIGGER trigger_name            /* line 1 */

Naturally, the first part is CREATE TRIGGER and the name of the new trigger.
I tend to use a convention: I start with the name of the table, then an underscore, 
then one of these six codes: bi, ai, bu, au, bd, or ad. Those codes stand for, respectively:

BEFORE INSERT ON table_name            /* line 2 */
or AFTER INSERT ON table_name
or BEFORE UPDATE ON table_name
or AFTER UPDATE ON table_name
or BEFORE DELETE ON table_name
or AFTER DELETE ON table_name

Those are the six possible times that a trigger might be activated. A trigger is always 
associated with a data-change statement on a single base table. My trigger, which has the 
clause BEFORE INSERT ON t, will be activated when I do INSERTs on table t.

FOR EACH ROW                           /* line 3 */

Specifically, the activation will happen for each row that I insert. 
If I INSERT zero rows, which is possible with INSERT ... SELECT statements, then zero activations 
take place. If I INSERT 1,000 rows, then 1,000 activations take place. 
Standard SQL allows you to say FOR EACH STATEMENT instead, which would mean that the activation 
happens once, no matter how many rows there are.


SET @x = @x + 1;                       /* line 4 */

Finally, there is the "body" of the trigger. When a trigger is activated, the statement 
in the trigger's body is executed. In my trigger the statement is SET @x = @x + 1, which 
increments the variable @x each time activation happens.

In other words, @x is a counter. Whenever an INSERT for a row happens, @x goes up. Of course, 
if @x starts with a NULL value then nothing will happen, and that's why I start by initializing the counter:

SET @x = 0;                            /* line 5 */

The big test moment comes when I INSERT:

INSERT INTO t VALUES (1),(NULL);       /* line 6 */

Every time I insert a row in t, the value of @x should rise, because there is a FOR EACH ROW trigger 
on t that says that's what should happen. Thus when I SELECT

SELECT @x;                              /* line 7 */

the result will be 2, because there were two rows.


======================================
4. Using PHP and Microsoft SQL Server:
======================================


Note 1:
=======


Requirements for Win32 platforms. 

The extension requires the MS SQL Client Tools to be installed on the system where PHP is installed. 
The Client Tools can be installed from the MS SQL Server CD or by copying ntwdblib.dll 
from \winnt\system32 on the server to \winnt\system32 on the PHP box. 
Copying ntwdblib.dll will only provide access. Configuration of the client will 
require installation of all the tools. 

Requirements for Unix/Linux platforms. 

To use the MSSQL extension on Unix/Linux, you first need to build and install the FreeTDS library. 
Source code and installation instructions are available at the 
FreeTDS home page: http://www.freetds.org/ 

Note: In Windows, the DBLIB from Microsoft is used. Functions that return a column name are based 
on the dbcolname() function in DBLIB. DBLIB was developed for SQL Server 6.x where the max identifier 
length is 30. For this reason, the maximum column length is 30 characters. On platforms where 
FreeTDS is used (Linux), this is not a problem. 

Installation
The MSSQL extension is enabled by adding extension=php_mssql.dll to php.ini. 

To get these functions to work, you have to compile PHP with --with-mssql[=DIR], where DIR is 
the FreeTDS install prefix. And FreeTDS should be compiled using --enable-msdblib. 

Runtime Configuration
The behaviour of these functions is affected by settings in php.ini. 


Table 1. MS SQL Server configuration options


Name 				Default Changeable 	Changelog 
mssql.allow_persistent		 "1" PHP_INI_SYSTEM   
mssql.max_persistent		 "-1" PHP_INI_SYSTEM   
mssql.max_links			 "-1" PHP_INI_SYSTEM   
mssql.min_error_severity	 "10" PHP_INI_ALL   
mssql.min_message_severity	 "10" PHP_INI_ALL   
mssql.compatability_mode	 "0" PHP_INI_ALL   
mssql.connect_timeout		 "5" PHP_INI_ALL   
mssql.timeout			 "60" PHP_INI_ALL 	Available since PHP 4.1.0. 
mssql.textsize			 "-1" PHP_INI_ALL   
mssql.textlimit			 "-1" PHP_INI_ALL   
mssql.batchsize			 "0" PHP_INI_ALL 	Available since PHP 4.0.4. 
mssql.datetimeconvert		 "1" PHP_INI_ALL 	Available since PHP 4.2.0. 
mssql.secure_connection		 "0" PHP_INI_SYSTEM 	Available since PHP 4.3.0. 
mssql.max_procs			 "-1" PHP_INI_ALL 	Available since PHP 4.3.0 

Functions that you can use:

mssql_bind 	--  Adds a parameter to a stored procedure or a remote stored procedure 
mssql_close 	-- Close MS SQL Server connection
mssql_connect 	-- Open MS SQL server connection
mssql_data_seek -- Moves internal row pointer
mssql_execute 	--  Executes a stored procedure on a MS SQL server database 
mssql_fetch_array	 --  Fetch a result row as an associative array, a numeric array, or both 
mssql_fetch_assoc	 --  Returns an associative array of the current row in the result set specified by result_id 
mssql_fetch_batch	 --  Returns the next batch of records 
mssql_fetch_field	 -- Get field information
mssql_fetch_object	 -- Fetch row as object
mssql_fetch_row		 -- Get row as enumerated array
mssql_field_length	 -- Get the length of a field
mssql_field_name	 -- Get the name of a field
mssql_field_seek	 -- Seeks to the specified field offset
mssql_field_type	 -- Gets the type of a field
mssql_free_result	 -- Free result memory
mssql_free_statement	 -- Free statement memory
mssql_get_last_message	 --  Returns the last message from the server 
mssql_guid_string	 --  Converts a 16 byte binary GUID to a string 
mssql_init	 	 --  Initializes a stored procedure or a remote stored procedure 
mssql_min_error_severity	 -- Sets the lower error severity
mssql_min_message_severity	 -- Sets the lower message severity
mssql_next_result	 -- Move the internal result pointer to the next result
mssql_num_fields	 -- Gets the number of fields in result
mssql_num_rows	 -- Gets the number of rows in result
mssql_pconnect	 -- Open persistent MS SQL connection
mssql_query	 -- Send MS SQL query
mssql_result	 -- Get result data
mssql_rows_affected --  Returns the number of records affected by the query 
mssql_select_db	 -- Select MS SQL database


Note 2:
=======

Stored Procedures on PHP and Microsoft SQL Server


Though it's not as common a combination as PHP and MySQL, PHP and Microsoft SQL Server 
can be a powerful team. You can query SQL Server databases easily and effectively using 
the PEAR database abstraction layer, just as you would a MySQL database. 
But once you start trying to use one of the primary benefits of SQL Server over MySQL 
-- namely, stored procedures -- a few problems quickly become apparent:

First, your PHP code is often nearly as messy as if you were dynamically building SQL statements 
for execution. Take the following stored procedure definition:

GetCustomerList @StoreId int, 
@CustomerType varchar(50)

and consider the PHP code needed to build the SQL statement that will execute this procedure 
from some page submission:


$sql = "EXEC GetCustomerList @StoreId="; 
$sql .= intval($_GET['StoreId']); 
$sql .= ', @CustomerType='; 
if ($_GET['CustomerType'] == '') { 
 $sql .= 'NULL'; 
} 
else { 
 $sql .= "'" . $_GET['CustomerType'] . "'" ; 
} 

// Assume you have an open PEAR database connection ($pearDB) 
$arrResults = $pearDB->getAll($sql);

Not exactly the most readable or aesthetically pleasing chunk of code, is it?

Second, what about when you want to do something slightly more advanced than call a 
stored procedure that simply queries for a list of results? Say, for instance that you'd like 
to retrieve return values or use output parameters in your stored procedures? There�s nothing 
built directly into the PEAR database library that will allow for this.

Finally, and most importantly, consider security. The code listed above, which produces the 
SQL string necessary to call the GetCustomerList procedure, is not at all secure. Because the 
value of $_GET['CustomerType'], which is assumed to come from user input, is used directly 
in the SQL string, with no checks for unsafe content or escaping of quotes, the SQL string that 
is generated could easily produce unexpected and undesired results. Most of us have read about 
SQL Injection Attacks far too often to take them for granted (if not, I strongly suggest 
you read up on them now).


Happily, there are some features built into PHP that can help minimize the likelihood 
of these attacks happening -- �magic quotes� and the associated �stripslashes� function, 
for instance. This PHP functionality can be used to �automagically� escape single quotes 
in all strings input through GET or POST values, which makes these strings safe for use in 
database queries. If you�re at all like me, though, you may find the magic quotes option a 
bit cumbersome to work with after a while. Also, I personally believe that the fewer global settings 
that I depend on the better -- I�ve moved my code to new machines too many times to depend 
on identical server configurations ever being anything but the exception to the rule.


Enter: The SqlCommand Class

The SqlCommand class is an object that was designed to try to minimize each of these problems, 
and help you produce more readable (debuggable), powerful, and secure code. 
The basic usage is fairly simple, containing only 6 commonly-used public methods 
(optional parameters are shown in square brackets):

1. -- SqlCommand([$sCommandText], [$bGetReturnValue])

Class instantiation, normally used to define the stored procedure name.

2. -- addParam($sParamName, [$oValue], [$sType], [$iLength], [$bForOutput])

Configure a parameter that must be passed to the stored procedure. 
The $sType option shown here is the exact SQL Server name of the variable type. 
Supported values currently include: bit, int, smallint, tinyint, real, float, money, text, 
char, varchar, datetime, and smalldatetime.

3. -- execute($oDB)

Execute without obtaining a resultset (such as for insert/update/deletes).

4. -- getAll($oDB, [$mode])

Execute and obtain a resultset (such as for select statements).

5. -- getReturnValue()

Retrieve the return value of the stored procedure.

6. -- getOutputValue($sParamName)

Retrieve the value of any output parameter defined in the stored procedure.


To actually use the SqlCommand class, you must first instantiate a new object of SqlCommand type, 
configure the object with the name of the stored procedure you want to execute, and set any parameters 
that are required. Then you can execute your stored procedure with the option of returning 
a resultset or not (getAll() vs. execute()). Along the way, the SqlCommand object will 
validate parameter values to ensure they're safe to use (which includes escaping single quotes 
in string values), and gives you methods by which to easily retrieve return and output parameter 
values from your procedure. 


Note 3:
=======

Database abstraction layers.

PearDB:

What is PEAR?
PEAR is short for "PHP Extension and Application Repository" and is pronounced just like the fruit. 
The purpose of PEAR is to provide: 


A structured library of open-sourced code for PHP users 

A system for code distribution and package maintenance 

A standard style for code written in PHP, specified here 

The PHP Extension Community Library (PECL), see more below 

A web site, mailing lists and download mirrors to support the PHP/PEAR community 


DB is a database abstraction layer providing:
* an OO-style query API
* portability features that make programs written for one DBMS work with other DBMS's
* a DSN (data source name) format for specifying database servers
* prepare/execute (bind) emulation for databases that don't support it natively
* a result object for each query response
* portable error codes
* sequence emulation
* sequential and non-sequential row fetching as well as bulk fetching
* formats fetched rows as associative arrays, ordered arrays or objects
* row limit support
* transactions support
* table information interface
* DocBook and phpDocumentor API documentation

DB layers itself on top of PHP's existing database extensions.

Drivers for the following extensions pass the complete test suite and provide
interchangeability when all of DB's portability options are enabled:

fbsql, ibase, informix, msql, mssql,
mysql, mysqli, oci8, odbc, pgsql,
sqlite and sybase.

There is also a driver for the dbase extension, but it can't be used
interchangeably because dbase doesn't support many standard DBMS features.

DB is compatible with both PHP 4 and PHP 5.

File: dbs/PearDB.php 
Role: Class source 
Content type: text/plain 
Description: Implementation for PEAR 
Class: anyDB
DB class for MYSQL, POSTGRES, SQLITE, PHPLIB, ODBC 

As a web programming language, one of PHP's strengths traditionally has been to make it easy 
to write scripts that access databases so that you can create dynamic web pages that incorporate 
database content. This is important when you want to provide visitors with information that is 
always up-to-date, without hand tweaking a lot of static HTML pages. However, although PHP is easy to use, 
it includes no general-purpose database access interface. Instead it has a number of specialized ones 
that take the form of separate sets of functions for each database system. There is one set for MySQL, 
another for InterBase, and another for PostgreSQL--and others as well.

This wide range of support for different database engines help make PHP popular because it means e
ssentially that no matter which database you use, PHP probably supports it. On the other hand, having 
a different set of functions for each database also makes PHP scripts non-portable at the lexical 
(source code) level. For example, the function for issuing a SQL statement is named mysql_query(), 
ibase_query(), or pg_exec(), depending on whether you are using MySQL, InterBase, or PostgreSQL. 
This necessitates a round of messy script editing to change function names if you want to use your 
scripts with a different database engine, or if you obtain scripts from someone who 
doesn't use the same engine you do.

In PHP 4 and up, this problem is addressed by means of a database module included in PEAR 
(the PHP Extension and Add-on Repository). The PEAR DB module supports database access based 
on a two-level architecture:

- The top level provides an abstract interface that hides database-specific details and thus is the same 
  for all databases supported by PEAR DB. Script writers need not think about which set of functions to use.

- The lower level consists of individual drivers. Each driver supports a particular database engine 
  and translates between the abstract interface seen by script writers and the database-specific interface 
  required by the engine. This provides you the flexibility of using any database for which a driver exists, 
  without having to consider driver-specific details.


Note 4:
=======

Dit heb ik gedaan heb om MSSQL functies in PHP(in Apache/Linux) te kunnen gebruiken.


1. Installatie freetds (free tabular datastream)

Hier heb ik freetds gedownload.
http://www.freetds.org/

Ik kreeg dit bestand van iets meer dan 1MB:
freetds-0.62.3.tar.gz

kheb dan dit gedaan:
tar -xzvf freetds-0.62.3.tar.gz

vervolgens heb ik freetds gecompileerd
cd freetds-0.62.3
./configure --enable-msdblib
make
make install


2. Compilatie PHP

Ik heb eerst PHP gedownload, ik kreeg dit bestand van ongeveer 5MB
php-4.3.6.tar.gz

cd php-4.3.6

Dan heb ik deze fantastische configure uitgevoerd...
'./configure' '--host=i386-redhat-linux' '--build=i386-redhat-linux' '--target=i386-redhat-linux-gnu' '--program-prefix=' '--prefix=/usr' '--exec-prefix=/usr' '--bindir=/usr/bin' '--sbindir=/usr/sbin' '--sysconfdir=/etc' '--datadir=/usr/share' '--includedir=/usr/include' '--libdir=/usr/lib' '--libexecdir=/usr/libexec' '--localstatedir=/var' '--sharedstatedir=/usr/com' '--mandir=/usr/share/man' '--infodir=/usr/share/info' '--cache-file=../config.cache' '--with-config-file-path=/etc' '--with-config-file-scan-dir=/etc/php.d' '--enable-force-cgi-redirect' '--disable-debug' '--enable-pic' '--disable-rpath' '--enable-inline-optimization' '--with-bz2' '--with-db4=/usr' '--with-curl' '--with-exec-dir=/usr/bin' '--with-freetype-dir=/usr' '--with-png-dir=/usr' '--with-gd' '--enable-gd-native-ttf' '--with-gdbm' '--with-gettext' '--with-ncurses' '--with-gmp' '--with-iconv' '--with-jpeg-dir=/usr' '--with-openssl' '--with-png' '--with-regex=system' '--with-xml' '--with-expat-dir=/usr' '--with-dom=shared,/usr' '--with-dom-xslt=/usr' '--with-dom-exslt=/usr' '--with-xmlrpc=shared' '--with-pcre=/usr' '--with-zlib' '--with-layout=GNU' '--enable-bcmath' '--enable-exif' '--enable-ftp' '--enable-magic-quotes' '--enable-safe-mode' '--enable-sockets' '--enable-sysvsem' '--enable-sysvshm' '--enable-discard-path' '--enable-track-vars' '--enable-trans-sid' '--enable-yp' '--enable-wddx' '--without-oci8' '--with-pear=/usr/share/pear' '--with-kerberos' '--with-ldap=shared' '--with-mysql=shared,/usr' '--enable-memory-limit' '--enable-bcmath' '--enable-shmop' '--enable-calendar' '--enable-dbx' '--enable-dio' '--enable-mcal' '--enable-mbstring' '--enable-mbstr-enc-trans' '--enable-mbregex' '--with-apxs2=/usr/sbin/apxs' '--with-mssql=/usr/local'
make
make install
Helaas, na het herstarten van httpd (=Apache) werkte er niks meer van PHP...


3. Wanhoopspoging

En toen deed ik hetvolgende, en ik geloofde nooit dat het zou werken...
apt-get install php-mysql
make install
httpd restart
...en alles werkte.


4. Hier een PHP die via MySQL en via MSSQL een select uitvoert.

echo 'verbind met MySQL Server...';
$mydb=mysql_connect("192.168.1.5", "polleke", "paswoord");
echo 'verbind met polDB via MySQL...';
$mydb_selected = mysql_select_db("poldb",$mydb);
echo 'tabel Persoon van polDB via MySQL...';
$result = mysql_query("SELECT Naam, Mail, Web FROM Persoon ORDER BY Naam");
while ($myrow = mysql_fetch_row($result))
{
echo '$myrow[0] $myrow[1] $myrow[2]';
}
echo 'verbind met MSSQL Server...';
$msdb=mssql_connect("192.168.1.204:1433","polleke","paswoord");
echo 'verbind met polDB via MSSQL...';
$msdb_selected = mssql_select_db("poldb",$msdb);
echo 'tabel Persoon van polDB via MSSQL...';
$result = mssql_query("SELECT Naam, Mail, Web FROM Persoon ORDER BY Naam");
while ($msrow = mssql_fetch_row($result))
{
echo '$msrow[0] $msrow[1] $msrow[2]';
}


Note 5:
=======

Executing SQL Server Stored Procedures With PHP 
(Page 1 of 6 )

In this article Joe looks at how to connect to an SQL Server 2000 database using PHP's set of 
mssql_xxx functions, and also how to execute commands and stored procedures against that 
database.

In my time as a web developer and database administrator, I've come to love the combination of 
PHP and Microsoft SQL Server to deliver dynamic content to my clients. I know that MySQL is the most 
popular database to use with PHP, but PHP includes support for many other databases, and I just happen 
to love the power and flexibility of SQL Server 2000.

Just like ASP and ASP.NET, PHP can also connect to an SQL Server database and execute queries 
and stored procedures. If you're not familiar with SQL Server, then you'd be unfamiliar with 
stored procedures. Basically, a stored procedure is a bunch of SQL queries grouped together that can 
be called kind-of like a function with input and output parameters. 
Stored procedures can return complete recordsets, just values, or both recordsets and values.

In this article we're going to look at how to equip your installation of PHP with SQL Server support, 
how to connect to an SQL Server 2000 database using PHP's set of mssql_xxx functions and also 
how to execute commands and stored procedures against that database, capturing the output parameters 
and recordsets returned from those stored procedures.

To get the most benefit from this article, you should be running PHP on a Windows server as well as 
SQL Server 2000 on either the same or a different PC. I will assume that the PC running SQL Server 2000 
is the same one that you have installed PHP onto.

-- Support for the mssql_xxx functions (like the native mysql_xxx functions)

As you may or may not know, PHP supports SQL Server with its set of mssql_xxx functions, 
which are similar to its mysql_xxx functions. On Windows, it's extremely easy to interface 
with SQL Server because of PHP's support for these functions.

If you're running Unix/Linux then unfortunately the mssql_xxx functions aren't available, and 
you'll need to use either ODBC or PHP's Sybase set of functions (sybase_xxx) as well as a 
tabular data stream protocol such as FreeTDS to work with SQL Server. In this article I'll 
be focusing on running PHP running on a Windows PC to connect to SQL Server running on a Windows PC, 
so click here if you need to learn how to setup SQL Server support for Linux/Unix.

Ok, first step is to download the support material for this article and copy php_mssql.dll 
to your PHP extensions directory, which you can find under the paths and directories header of 
your php.ini file (open it in notepad and search for "extension_dir ="). My extension directory 
is the same as my PHP directory, however yours might be [your php directory]\extensions.

Next, open your php.ini file (which should be c:\windows\php.ini for Win9x users and c:\winnt\php.ini 
for everyone else) and look for the line starting ;extension=php_mssql.dll. The semi-colon is a comment, 
so removing it will effectively uncomment that line. The extension=php_mssql.dll line 
tells PHP to load the php_mssql.dll extension library into memory, which makes the mssql_xxx set 
of functions available to us.

You'll also need the SQL client tools installed on the same PC as where you're running PHP. 
You only need one file, which is ntwdblib.dll. You should be able to find this file in the 
\winnt\system32 directory on your SQL Server or on your SQL Server installation CD. 
Copy it to \winnt\system32 on your PHP server.

Lastly, restart your web server. For IIS users, jump to a DOS window and run iisreset. 
For Apache users, jump to a DOS window and run apache �k restart.

Hopefully at this point you've restarted your web server software and are ready to continue. 
Before we move on, create the script below and run it through your web browser, 
just to make sure that the extensions were registered and loaded correctly:

<?php

$myServer = "localhost"; 

$myUser = "sa"; 

$myPass = ""; 

$myDB = "Northwind";

$s = @mssql_connect($myServer, $myUser, $myPass) 

or die("Couldn't connect to SQL Server on $myServer");

$d = @mssql_select_db($myDB, $s) 

or die("Couldn't open database $myDB");

?>

Replace the value of the variables shown above to match the details of your SQL Server installation. 
If everything is working fine then when you run the script you should see nothing but a blank page. 
On the other hand, if you see any messages about mssql_connect and mssql_select_db not existing, 
then you haven't installed the extensions properly. Double check that you've put the right path 
to your extensions folder in the php.ini file and that you've copied the DLL into the right folder 
as well. On some occasions a reboot of Windows will fix the problem also.

Now that we can connect to SQL Server correctly, let's look at executing some TSQL commands as well 
as some stored procedures.

As I mentioned earlier, PHP supports the mssql_xxx set of functions that offer the same kind 
of functionality and syntax that we've come to enjoy for MySQL. Because SQL Server has features 
that MySQL does not (such as stored procedures and triggers), the mssql_xxx set of functions 
includes a couple of unique functions. A list of these functions is shown below: 

mssql_get_last_message: Returns the last message that was generated by the server. 
mssql_min_error_severity: Sets the lower error severity at which an error will be raised. 
mssql_min_message_severity: Sets the lower error message severity at which an error will be raised. 
mssql_init: Used to initialize a stored procedure. 
mssql_execute: Executes a stored procedure against an SQL Server database. 
mssql_bind: Adds a parameter to a stored procedure. 
mssql_fetch_batch: Returns subsequent batches of records from SQL Server (if any). 
mssql_rows_affected: Returns the number of rows affected by the last query against the database


If you've come from an ASP background like I have, then I'm sure you�ll agree that the way 
in which PHP supports SQL Server is excellent.

-- mssql_connect()
-- mssql_select_db
-- mssql_query
-- mssql_fetch_array()

Let's start with a basic query that returns a list of employees from the Northwind table. 
Create a new PHP script called employee.php and enter the following code into it, substituting 
database connection variables where necessary:

<?php
$myServer = "localhost"; 
$myUser = "sa"; 
$myPass = ""; 
$myDB = "Northwind";
$s = @mssql_connect($myServer, $myUser, $myPass) 
or die("Couldn't connect to SQL Server on $myServer");
$d = @mssql_select_db($myDB, $s) 
or die("Couldn't open database $myDB");
$query = "SELECT TitleOfCourtesy+' '+FirstName+' '+LastName AS Employee "; 
$query .= "FROM Employees "; 
$query .= "WHERE Country='USA' AND Left(HomePhone, 5) = '(206)'";
$result = mssql_query($query); 
$numRows = mssql_num_rows($result);
echo "<h1>" . $numRows . " Row" . ($numRows == 1 ? "" : "s") . " Returned </h1>";
while($row = mssql_fetch_array($result)) 
{ 
echo "<li>" . $row["Employee"] . "</li>"; 
}
?>

If you've worked with PHP and MySQL before, then much of the code shown above won't be anything new. 
However, for those who haven't, let's just quickly run through the code from our example:

  $s = @mssql_connect($myServer, $myUser, $myPass) 

  or die("Couldn't connect to SQL Server on $myServer");

  $d = @mssql_select_db($myDB, $s) 

  or die("Couldn't open database $myDB");

Firstly we use the mssql_connect and mssql_select_db functions to connect to our SQL Server 
and select a database. If either of these actions fails, then the die function terminates 
our script, outputting the appropriate message to the browser.

  $query = "SELECT TitleOfCourtesy+' '+FirstName+' '+LastName AS Employee "; 

  $query .= "FROM Employees "; 

  $query .= "WHERE Country='USA' AND Left(HomePhone, 5) = '(206)'";

Next, we build a basic SQL query that uses field merging and the where clause to return all employees 
who live in USA or whose phone number starts with (206).

  $result = mssql_query($query); 

  $numRows = mssql_num_rows($result);

  echo "<h1>" . $numRows . " Row" . ($numRows == 1 ? "" : "s") . " Returned </h1>";

We then execute our query using mssql_query, capturing the result into the $result variable. 
We pass $result to mssql_num_rows to check how many rows were returned from the query and output 
that number to the browser.

  while($row = mssql_fetch_array($result)) 

  { 

  echo "<li>" . $row["Employee"] . "</li>"; 

  }

Lastly, we loop through the recordset using mssql_fetch_array so that we can refer to the employee 
field by its name.


Executing a stored procedure with PHP and the mssql_xxx set of functions is easy. 
To execute a stored procdure that accepts no parameters and doesn't return a value, 
we only need to make use of the mssql_init and mssql_execute functions.

Open query analyzer on your SQL Server and run the following code:

USE Northwind 

GO

CREATE PROC sp_AddNewShipper 

AS

-- Add a record to the shippers table 

INSERT INTO Shippers(CompanyName, Phone) 

VALUES('Johns Shipping', '(555) 555-0493')


We've just created a new stored procedure that's attached to the Northwind database. 
It's called sp_AddNewShipper. When it is executed, it will add a new record to the shippers table 
and won't return a value.

To execute sp_AddNewShipper from PHP, create a new file called shipper.php and enter 
the following code into it:

<?php
$myServer = "localhost"; 
$myUser = "sa"; 
$myPass = ""; 
$myDB = "Northwind";
$s = @mssql_connect($myServer, $myUser, $myPass) 
or die("Couldn't connect to SQL Server on $myServer");
$d = @mssql_select_db($myDB, $s) 
or die("Couldn't open database $myDB");
$query = mssql_init("sp_AddNewShipper", $s); 
$result = mssql_execute($query);
?>

When you run the script in your web browser and then check the shippers table of your Northwind database, 
you'll see that we've just added a new record.


The only new parts of our code are the calls to mssql_init and mssql_execute:

  $query = mssql_init("sp_AddNewShipper", $s); 

  $result = mssql_execute($query);

The call to mssql_init initializes our call to the sp_AddNewShipper stored procedure. 
It's main use is to facilitate the addition of output parameters for stored procedure calls 
(which we will look at next), and its signature looks like this:

  int mssql_init ( string sp_name [, int conn_id])

It accepts the name of the stored procedure as well as an optional connection identifier, 
which we've passed in as $s. It returns a numerical identifier that we then pass to mssql_execute. 
Mssql_execute actually creates all of the necessary SQL plumbing and executes our stored procedure 
on the SQL Server. Its signature looks like this:

  int mssql_execute ( int stmt)

Nothing to really explain about the mssql_execute function except that it takes the identifier 
of the statement to execute (which is returned by a call to mssql_init) and returns a result 
that can optionally include one/more records.

It's good to be able to execute stored procedures that don't accept or return any parameters, 
but in the real world stored procedures usually accept and/or return both parameters and values.

-- Stored Procedure with input parameters:

Clear your query analyzer window and enter the following batch of TSQL code:

USE Northwind 
GO

CREATE PROC sp_GetProductsBySupplier 
@supplierId TINYINT 
AS

-- Return products whose supplierId field is @supplierId 

SELECT ProductName 
FROM Products 
WHERE SupplierID = @supplierId 
ORDER BY ProductName ASC

We've just created a stored procedure called sp_GetProductsBySupplier. It accepts one integer 
input parameter called SupplerID, which will be used to return all products from the Northwind 
products table based on that products SupplierID field.

We can now use PHP and the mssql_xxx functions to execute our stored procedure, passing in an 
input parameter that contains an integer value (which will be used to filter the products based 
on their SupplierID field). Create a new file called products.php and enter the following code into it:

<?php

$myServer = "localhost"; 

$myUser = "sa"; 

$myPass = ""; 

$myDB = "Northwind";

$s = @mssql_connect($myServer, $myUser, $myPass) 

or die("Couldn't connect to SQL Server on $myServer");

$d = @mssql_select_db($myDB, $s) 

or die("Couldn't open database $myDB");

$query = mssql_init("sp_GetProductsBySupplier", $s); 

$supId = 2;

mssql_bind($query, "@supplierId", &$supId, SQLINT2); 

$result = mssql_execute($query);

$numProds = mssql_num_rows($result); 

echo "<h1>" . $numProds . " Product" . ($numProds == 0 ? "" : "s") . " Found: </h1>";

while($row = mssql_fetch_row($result)) 

{ 

echo "<li>" . $row[0] . "</li>"; 

}

?>


We can also run our stored procedure using query analyzer with the following code:

USE Northwind 
GO

EXEC sp_GetProductsBySupplier 2

The only new mssql function that we're calling in products.php is mssql_bind, which we can use 
to create a new input/output parameter for our stored procedure. Its signature looks like this:

  int mssql_bind ( int stmt, string param_name, mixed var, int type [, int is_output [, int is_null [, int maxlen]]])

In our example we want to pass in one numerical input parameter, so we use mssql_bind like this:

  mssql_bind($query, "@supplierId", $supId, SQLINT2);

For the type parameter, I've specified SQLINT2, which is defined by PHP internally to represent 
an integer of two bytes. Other possible values for the type parameter include 
SQLTEXT, SQLVARCHAR, SQLCHAR, SQLINT1, SQLINT2, SQLINT4, SQLBIT and SQLFLT8.

I haven't specified a value for the is_output, is_null and maxlen parameters. If I wanted to, 
I could've specified FALSE for is_output, meaning that our parameter is an input parameter and 
not an output parameter (is_output defaults to false anyway, so I chose not to specify it).

Once we've bound our parameter to our stored procedure, we execute the stored procedure using mssql_execute, 
capturing its recordset into a variable called $result:

  $result = mssql_execute($query); 


-- Stored rocedure with output parameters:

It's just as easy to use output parameters and capture return values from stored procedures. 
Run the following code in query analyzer to create a stored procedure that accepts both 
input and output values and also returns a result:

USE Northwind 
GO

CREATE PROC sp_GetNumProdsByPrice 
@minPrice MONEY, 
@maxPrice MONEY, 
@lowestPricedProduct VARCHAR(40) OUTPUT, 
@highestPricedProduct VARCHAR(40) OUTPUT 
AS
SELECT @lowestPricedProduct = (SELECT TOP 1 ProductName 
FROM Products 
WHERE UnitPrice >= @minPrice 
ORDER BY UnitPrice ASC)
SELECT @highestPricedProduct = (SELECT TOP 1 ProductName 
FROM Products 
WHERE UnitPrice <= @maxPrice 
ORDER BY UnitPrice DESC)
RETURN (SELECT COUNT(*) FROM Products WHERE UnitPrice >= @minPrice AND UnitPrice <= @maxPrice)

Our stored procedure works with the products table of the Northwind database and is called 
sp_GetNumProdsByPrice. It accepts two input and two output parameters. The input parameters are 
the minimum and maximum price of the item (UnitPrice) to match. The output parameters will contain 
the names of the products whose UnitPrice field was close to @minPrice and @maxPrice respectively.

Now, onto the PHP code. Create a new file called prodsbyprice.php and enter the following code into it:

<?php

$myServer = "localhost"; 

$myUser = "sa"; 

$myPass = ""; 

$myDB = "Northwind";

$s = @mssql_connect($myServer, $myUser, $myPass) 

or die("Couldn't connect to SQL Server on $myServer");

$d = @mssql_select_db($myDB, $s) 

or die("Couldn't open database $myDB");

$query = mssql_init("sp_GetNumProdsByPrice", $s);

$minPrice = 0.00; 

$maxPrice = 35.00; 

$lowProd = ""; 

$highProd = ""; 

$numProds = 0;

// Bind the parameters 

mssql_bind($query, "@minPrice", $minPrice, SQLFLT8); 

mssql_bind($query, "@maxPrice", $maxPrice, SQLFLT8); 

mssql_bind($query, "@lowestPricedProduct", &$lowProd, SQLVARCHAR, TRUE, FALSE, 40); 

mssql_bind($query, "@highestPricedProduct", &$highProd, SQLVARCHAR, TRUE, FALSE, 40);

// Bind the return value 

mssql_bind($query, "RETVAL", &$numProds, SQLINT2); 

mssql_execute($query);

echo "<h2>There were $numProds products returned.</h2>"; 

echo "The lowest priced product was $lowProd.<br>"; 

echo "The highest priced product was $highProd.";

?>

We start of by defining a number of variables that will be used with our calls to mssql_bind:

$minPrice = 0.00; 
$maxPrice = 35.00; 
$lowProd = ""; 
$highProd = ""; 
$numProds = 0;

[Note] When you are using mssql_bind to setup parameters for stored procedures, 
you cannot explicitly specify the value for that parameter. You must pass in a variable that 
contains the value instead. [End Note]

// Bind the parameters 

mssql_bind($query, "@minPrice", $minPrice, SQLFLT8); 

mssql_bind($query, "@maxPrice", $maxPrice, SQLFLT8);

We then create the input parameters @minPrice and @maxPrice with values taken from the 
$minPrice amd $maxPrice variables.

mssql_bind($query, "@lowestPricedProduct", &$lowProd, SQLVARCHAR, TRUE, FALSE, 40); 

mssql_bind($query, "@highestPricedProduct", &$highProd, SQLVARCHAR, TRUE, FALSE, 40);

For our output parameters, we have specified values for every parameter that the mssql_bind 
function accepts. Each of our output parameters needs to accept a value back from the stored procedure, 
so we pass $lowProd and $highProd by reference not value (the ampersand signifies the reference). 
They will contain the values of the output parameters from the stored procedure after it's executed.

Our parameters will hold the names of products, so we declare them as SQLVARCHAR types. 
They are output parameters, so we pass in TRUE for the is_output parameter, FALSE for the 
is_null parameter, and 40 for the maxlen parameter.

In our stored procedure, we return the total number of products whose UnitPrice field is between 
the values of the @minPrice and @maxPrice input parameters:

RETURN (SELECT COUNT(*) FROM Products WHERE UnitPrice >= @minPrice AND UnitPrice <= @maxPrice)

We access this return value as "RETVAL", returning its value to the $numProds variable, 
which we pass to mssql_bind by reference:

// Bind the return value 

mssql_bind($query, "RETVAL", &$numProds, SQLINT2);

When mssql_execute is called, the stored procedure executes and the values for the output parameters 
are returned as well as the RETVAL return value. We then output the results to the browser with the echo command:

mssql_execute($query);

echo "<h2>There were $numProds products returned.</h2>"; 

echo "The lowest priced product was $lowProd.<br>"; 

echo "The highest priced product was $highProd.";

Here's how prodsbyprice.php looks in my browser:


======================================
5. Using PHP and Oracle:
======================================


Note 1:
=======


Oracle/ PHP FAQ
$Date: 26-Jul-2003 $
$Revision: 1.01 $
$Author: Frank Naud� $


Topics
What is PHP and what's it got to do with Oracle? 
What is the difference between the OCI and ORA extension modules? 
How does one configure PHP to use Oracle? 
How does one connect to Oracle? 
Why do we get error "Call to undefined function: ora_logon()/ ocilogon()"? 
How does one SELECT, INSERT, UPDATE and DELETE data from PHP? 
How are database transactions handled in PHP? 
How are database errors handled in PHP? 
How does one call stored procedures from PHP? 
Does PHP offer Oracle connection pooling? 
Where can one find more info about PHP and Oracle? 

--------------------------------------------------------------------------------
Back to Oracle FAQ Index 
--------------------------------------------------------------------------------


What is PHP and what's it got to do with Oracle?
PHP is a recursive acronym for "PHP Hypertext Preprocessor". It is an open source, interpretive, HTML centric, server side scripting language. PHP is especially suited for Web development and can be embedded into HTML pages. PHP is comparable to languages such as JSP (Java Server Pages) and Oracle's PSP (PL/SQL Server Pages). 
This FAQ describes how PHP interacts with the Oracle Database. It assumes that the reader has PHP installed and working. To test if PHP is working, create a simple PHP document, say hello.php: 

<html>
<p>If PHP is working, you will see "Hello World" below:<hr>
<?php
   echo "Hello world";
   phpinfo();  // Print PHP version and config info
?>
</html>

Execute hello.php from command line (php hello.php) or open it from a web browser (http://localhost/hello.php) to see the output. If it's not working, PHP is not correctly installed and this FAQ will not help you. 

Note that current versions of Oracle's HTTP Server (Apache) does not ship with PHP (mod_php) pre-installed and that Oracle does not support mod_php or the language itself; however, it may support configurations that include mod_php. Future releases of Oracle iAS (Internet Application Server) will support PHP and include instructions on how to install and use PHP. 


What is the difference between the OCI and ORA extension modules?
PHP offers two extension modules that can be used to connect to Oracle: 
The normal Oracle functions (ORA); and 
the Oracle Call-Interface functions (OCI). 
OCI should be used whenever possible since it is optimised and provides more options. For example, ORA doesn't include support for CLOBs, BLOBs, BFILEs, ROWIDs, etc. 


How does one configure PHP to use Oracle?
Follow these steps to prepare your PHP installation for connecting to Oracle databases: 
Download PHP from www.php.net, install as per the install.txt file, and test if everything is working. 

Install the Oracle Client (or Server) software on your machine and configure SQL*Net to connect to your database(s). See the SQL*Net FAQ for details. 

Edit your php.ini file and uncomment the following two lines (only if your version shipped with pre-compiled extension modules): 
  ;extension = php_oci8.dll
  ;extension = php_oracle.dll

... otherwise, compile PHP with the following options: 
  --with-oracle=/path/to/oracle/home/dir
  --with-oci8=/path/to/oracle/home/dir


Ensure that your "extension_dir" parameter (in php.ini) points to the location where the above extension files reside. 

Write a small program to test connectivity - see the next question. 


How does one connect to Oracle?
Using the OCI Extension Module - 
<?php
if ($c=OCILogon("scott", "tiger", "orcl")) {
  echo "Successfully connected to Oracle.\n";
  OCILogoff($c);
} else {
  $err = OCIError();
  echo "Oracle Connect Error " . $err[text];
}
?>

Using the ORA Extension Module - 
<?php
if ($c=ora_logon("scott@orcl","tiger")) {
  echo "Successfully connected to Oracle.\n";
  ora_commitoff($c);
  ora_logoff($c);
} else {
  echo "Oracle Connect Error " . ora_error();
}
?>

NOTE: You might want to set your Oracle environment from within PHP before connecting, look at this example: 
<?php
  PutEnv("ORACLE_SID=ORCL");
  PutEnv("ORACLE_HOME=/app/oracle/product/9.2.0");
  PutEnv("TNS_ADMIN=/var/opt/oracle");
...

Please note that PHP will share/re-use connections if the same userid/password combination is used (more than once) on a particular "page" or httpd server session. One can use the OCINLogon() function to ensure one gets a new session. Use the OCIPLogon() function to make persistent connections. 


Why do we get error "Call to undefined function: ora_logon()/ ocilogon()"?
PHP is not using the correct extension module. Try compiling PHP with the following options: 
  --with-oracle=/path/to/oracle/home/dir
  --with-oci8=/path/to/oracle/home/dir

On Windows systems one can just uncomment the following lines in the php.ini file: 
  ;extension = php_oci8.dll
  ;extension = php_oracle.dll


How does one SELECT, INSERT, UPDATE and DELETE data from PHP?
The following example demonstrates how data can be SELECTed and manipulated via INSERT, UPDATE and DELETE statements: 
<?php
  $c=OCILogon("scott", "tiger", "orcl");
  if ( ! $c ) {
    echo "Unable to connect: " . var_dump( OCIError() );
    die();
  }

  // Drop old table...
  $s = OCIParse($c, "drop table tab1");
  OCIExecute($s, OCI_DEFAULT);

  // Create new table...
  $s = OCIParse($c, "create table tab1 (col1 number, col2 varchar2(30))");
  OCIExecute($s, OCI_DEFAULT);

  // Insert data into table...
  $s = OCIParse($c, "insert into tab1 values (1, 'Frank')");
  OCIExecute($s, OCI_DEFAULT);

  // Insert data using bind variables...
  $var1 = 2;
  $var2 = "Scott";
  $s = OCIParse($c, "insert into tab1 values (:bind1, :bind2)");
  OCIBindByName($s, ":bind1", $var1);
  OCIBindByName($s, ":bind2", $var2);
  OCIExecute($s, OCI_DEFAULT);

  // Select Data...
  $s = OCIParse($c, "select * from tab1");
  OCIExecute($s, OCI_DEFAULT);
  while (OCIFetch($s)) {
    echo "COL1=" . ociresult($s, "COL1") .
       ", COL2=" . ociresult($s, "COL2") . "\n";
  }

  // Commit to save changes...
  OCICommit($c);

  // Logoff from Oracle...
  OCILogoff($c);
?>


How are database transactions handled in PHP?
When using the OCI Extension Module, PHP will commit whenever ociexecute() returns successfully. One can control this behaviour by specifying OCI_COMMIT_ON_SUCCESS (the default) or OCI_DEFAULT as the second parameter to the ociexecute() function call. OCI_DEFAULT can be used to prevent statements from being auto-committed. The OCICommit() and OCIRollback() functions can then be used to control the transaction. 
Note that when OCI_DEFAULT is used on any statement handle, it is inherited by the other statement handles for the connection. You cannot use a mix of autocommit/explicit commit on the same connection handle. If you want to do that you need to use ociNLogon() to get a separate handle. 

The ORA Extension Module supports an autocommit mode. Use the ORA_CommitOn() and ORA_CommitOff() functions to toggle between autocommit mode and normal mode. When in normal mode (ORA_CommitOff), one can use the ORA_Commit() and ORA_Rollback() functions to control transactions. 

If one doesn't commit or rollback at the end of a script, PHP will do an implicit commit. This is consistent with the way SQL*Plus works. 


How are database errors handled in PHP?
When using the OCI extension Module, the OCIError() function can be used to obtain an array with error code, message, offset and SQL text. One can also obtain the error for a specific session or cursor by supplying the appropriate handle as an argument to OCIError(). Without any arguments, OCIError() will return the last encountered error. 
<?php
  $err = OCIError();
  var_dump($err);

  print "\nError code = "     . $err[code];
  print "\nError message = "  . $err[message];
  print "\nError position = " . $err[offset];
  print "\nSQL Statement = "  . $err[sqltext];
?>

When using the ORA Extension Module, one can use the ora_error() and ora_errorcode() functions to report errors: 
<?php
  print "\nError code = "    . ora_errorcode();
  print "\nError message = " . ora_error();
?>


How does one call stored procedures from PHP?
The following example creates a procedure with IN and OUT parameters. The procedure is then executed and the results printed out. 
<?php
  // Connect to database...
  $c=OCILogon("scott", "tiger", "orcl");
  if ( ! $c ) {
     echo "Unable to connect: " . var_dump( OCIError() );
     die();
  }

  // Create database procedure...
  $s = OCIParse($c, "create procedure proc1(p1 IN number, p2 OUT number) as " .
                    "begin" .
                    "  p2 := p1 + 10;" .
                    "end;");
  OCIExecute($s, OCI_DEFAULT);

  // Call database procedure...
  $in_var = 10;
  $s = OCIParse($c, "begin proc1(:bind1, :bind2); end;");
  OCIBindByName($s, ":bind1", $in_var);
  OCIBindByName($s, ":bind2", $out_var, 32); // 32 is the return length
  OCIExecute($s, OCI_DEFAULT);
  echo "Procedure returned value: " . $out_var;

  // Logoff from Oracle...
  OCILogoff($c);
?>


Does PHP offer Oracle connection pooling?
Unfortunately PHP does not offer connection pooling. One can open "persistent" Oracle connections 
with the ora_plogon() and OCIPLogon() function calls. Nevertheless, persistent connections do not 
scale as well as connection pooling. A persistent connection will be kept open for a process, 
but it will not allow connections to be shared between different processes. 
Third party tools like SQL Relay (http://sqlrelay.sourceforge.net/) can be used to enable 
connection pooling for Oracle and other databases. 


Note 2:
=======

Example 1. Basic query

<?php

  $conn = oci_connect('hr', 'hr', 'orcl');
  if (!$conn) {
   $e = oci_error();
   print htmlentities($e['message']);
   exit;
  }

  $query = 'SELECT * FROM DEPARTMENTS';

  $stid = oci_parse($conn, $query);
  if (!$stid) {
   $e = oci_error($conn);
   print htmlentities($e['message']);
   exit;
  }

  $r = oci_execute($stid, OCI_DEFAULT);
  if (!$r) {
   $e = oci_error($stid);
   echo htmlentities($e['message']);
   exit;
  }

  print '<table border="1">';
  while ($row = oci_fetch_array($stid, OCI_RETURN_NULLS)) {
   print '<tr>';
       foreach ($row as $item) {
         print '<td>'.($item?htmlentities($item):'&nbsp;').'</td>';
       }
       print '</tr>';
  }
  print '</table>';

  oci_close($conn);
?>  
 

Example 2. Insert with bind variables

<?php

  // Before running, create the table:
  //  CREATE TABLE MYTABLE (mid NUMBER, myd VARCHAR2(20));

  $conn = oci_connect('scott', 'tiger', 'orcl');

  $query = 'INSERT INTO MYTABLE VALUES(:myid, :mydata)';

  $stid = oci_parse($conn, $query);

  $id = 60;
  $data = 'Some data';

  oci_bind_by_name($stid, ':myid', $id);
  oci_bind_by_name($stid, ':mydata', $data);

  $r = oci_execute($stid);

  if ($r)
   print "One row inserted";

  oci_close($conn);

?>  
 

Example 3. Inserting data into a CLOB column

<?php

// Before running, create the table:
//    CREATE TABLE MYTABLE (mykey NUMBER, myclob CLOB);

$conn = oci_connect('scott', 'tiger', 'orcl');

$mykey = 12343;  // arbitrary key for this example;

$sql = "INSERT INTO mytable (mykey, myclob)
       VALUES (:mykey, EMPTY_CLOB())
       RETURNING myclob INTO :myclob";

$stid = oci_parse($conn, $sql);
$clob = oci_new_descriptor($conn, OCI_D_LOB);
oci_bind_by_name($stid, ":mykey", $mykey, 5);
oci_bind_by_name($stid, ":myclob", $clob, -1, OCI_B_CLOB);
oci_execute($stid, OCI_DEFAULT);
$clob->save("A very long string");

oci_commit($conn);

// Fetching CLOB data

$query = 'SELECT myclob FROM mytable WHERE mykey = :mykey';

$stid = oci_parse ($conn, $query);
oci_bind_by_name($stid, ":mykey", $mykey, 5);
oci_execute($stid, OCI_DEFAULT);

print '<table border="1">';
while ($row = oci_fetch_array($stid, OCI_ASSOC)) {
  $result = $row['MYCLOB']->load();
  print '<tr><td>'.$result.'</td></tr>';
}
print '</table>';

?>  
 

You can easily access stored procedures in the same way as you would from the command line. 

Example 4. Using Stored Procedures

<?php
// by webmaster at remoterealty dot com
$sth = oci_parse($dbh, "begin sp_newaddress( :address_id, '$firstname',
 '$lastname', '$company', '$address1', '$address2', '$city', '$state',
 '$postalcode', '$country', :error_code );end;");

// This calls stored procedure sp_newaddress, with :address_id being an
// in/out variable and :error_code being an out variable.
// Then you do the binding:

   oci_bind_by_name($sth, ":address_id", $addr_id, 10);
   oci_bind_by_name($sth, ":error_code", $errorcode, 10);
   oci_execute($sth);

?>  

Note 3:
=======

Get Started with Oracle and PHP 

By Sean Hull 

As this introduction explains, Oracle and PHP are a powerful combination for building fast, 
scalable web-based applications 

Oracle is a powerful database for building web-based applications. PHP is famous for being quick 
and efficient; it can help you build fast applications that are not going to weigh down your database. 
Pair Oracle up with PHP, and you get a powerhouse combination. 

PHP (an acronym for PHP: Hypertext Preprocessor) has grown by leaps and bounds into one of the most 
popular web programming languages around. (According to netcraft.com, Apache commands 55% of the 
entire web server market, and PHP claims 38% of all those Apache servers.) It has done so by allowing you 
to quickly and efficiently get the job done while providing sophisticated features for more 
complex applications. With efficiency and low overhead serving as its prime directives, 
it helps produce some of the fasted web-based applications around. PHP is also open source�you don't 
have to wait for the vendor to fix bugs, and plenty of peer review uncovers and irons out the ones there. 

PHP is basically a set of scripts�much like those you might write in Perl or Python�that you can directly 
embed in your HTML pages. In this approach, HTML serves as the basic framework for a page, while 
dynamic PHP code draws content and information from your Oracle database. 

PHP's compatability with Oracle is nothing new; in fact, Oracle was one of the first databases other 
than MySQL to which PHP could connect. Programmers have been building PHP applications for Oracle for 
years, usually by building Apache with PHP + Oracle support. What is new, however, is Oracle support 
for this combination and for users building PHP-based applications�including documentation on OTN, 
as well as Metalink support for installing mod_php with Oracle Application Server. 

In upcoming releases of Oracle Application Server 10g, Oracle plans to bump up this support further, 
including PHP on the release CDs and integrating it further with other Oracle products. 

Time to Get Started 

Before you can do anything with PHP, of course, you have to ensure that it is installed and working on 
your system. One quick and painless approach is to edit the file test.php, put this line in it: 


<?php phpinfo()?>


...and then point your browser to that page. For instance, if your server is called learningphp.com, 
your URL might be http://learningphp.com/test.php. If you get a "Not Found" error, you may have 
incorrectly specified the filename, its path, or something else in the URL. If you get a "Forbidden" error, 
the permissions on the file are probably not readable by the web server. (These are basic Apache 
configuration issues; see the "Next Steps" box for more information about Apache configuration.) 

Another serious problem you may encounter is that you see only the above three lines of code 
displayed in your browser. If that happens, either Apache does not have PHP loaded or your 
Apache configuration didn't specify to load it. In that case, take a look at your httpd.conf file and 
confirm that these lines are present: 


AddModule mod_php4.c

<IfModule mod_php4.c>
    AddType application/x-httpd-php4 .php4
    AddType application/x-httpd-php4-source .phps

</IfModule>


If after making these changes and restarting Apache you still can't load the test.php file, you probably 
need to consider compiling Apache and PHP, a task that is beyond the scope of this article. 

When you have this file loading properly, your browser will display all sorts of useful PHP + 
Apache configuration information. Refer to the php3.ini file for further fine-tuning of your PHP configuration. 

A Simple Example 

For our first example, I'll show you the traditional hello world program, and also ask the server 
to print the current time. You will quickly see how a piece of PHP code looks embedded within 
an HTML page. You will also see how a couple of simple PHP functions perform. 

First, edit a file in your document root called hello_world.php3, and put the following lines in that file: 


<HTML>
<TITLE>hello world</TITLE>
<BODY>

<!�Add comments here �>

<H1>Hello World</H1>  <p>

<H2>
<?php
$currtime = time ();
$currtimestr = strftime ("%H:%M:%S", $currtime);
echo "The current time is: $currtimestr";

?>
</H2>

</BODY>
</HTML>


When you're done, save the file and load it in your browser. Click Reload a few times until you're convinced that the time is dynamically determined. 

This little example also illustrates how variables are used in PHP: they can internally hold values during computations or strings while you're putting together an array of information. Variables can also get values into your script through the URL. 

Here's another example; save this code sample to variables.php3: 


<HTML>
<TITLE>First Variables Example</TITLE>
<BODY>

<?php echo "VALUE IS: $zipcode";?>

</BODY>

</HTML>


Now call the page with a URL such as http://myserver.com/variables.php3?zipcode=10003. 

A very simple example, but as you can see, variables used in a URL automatically become variables inside your PHP script. Later we'll see how to use this feature in PHP to our advantage. 

How To Use A Database 

Anyone developing a moderately complex web application quickly finds that persistent variables between pages won't hold information for any length of time. That's where the database comes in; Oracle is supremely well positioned to fit this need. 

So how do we tie PHP to Oracle? In keeping with the PHP tradition, the process is very simple: Provide username and password and database connection details, and you can quickly connect to Oracle to store and retrieve data. 

For example, assume we've created a table called "example" with two columns, first and last name respectively. (Be sure that ORACLE_HOME + ORACLE_SID are set for Apache user.) Connecting to an Oracle database is a simple matter: 


<HTML>
<TITLE>First Variables Example</TITLE>
<BODY>

<?php
$tns = 'sean';

$user = "scott@$tns";
$pass = 'tiger';
$q1 = 'SELECT * FROM example';
$conn = ora_logon($user, $pass);

$mycursor = ora_open ($conn);
ora_parse ($mycursor, $q1, 0);
ora_exec ($mycursor);

while (ora_fetch($mycursor)) {
  echo "RESULT: " . ora_getcolumn ($mycursor, 0) . ", " . ora_getcolumn ($mycursor, 1) . "<br>";
}
ora_close($mycursor);
?>

</BODY>
</HTML>


If you're planning to use the PEAR library of extensions with PHP (more on that later), you should take a look at PEAR DB. PEAR DB provides a database abstraction layer, which allows you to write one application and just change the conntion descriptor in order to connect to a different type of database (for example, to prototype on MySQL and use Oracle for production). Database abstraction means your application isn't tied to one database, but can work on numerous ones. However, it also means you're stuck with the lowest common denominator of features. So your mileage may vary. 

Overview Of Features 

Conditions, loops and operators 

Like all iterative programming languages, PHP supports conditions via if/then statements. Here's an example of syntax: 


if ($a > 100) {

   do this...
} elseif ($a > 50) {
   do that...
} else {
   do some other thing...
} 


It also supports similar functionality through switch, otherwise known as case statements. Then, of course, there are loops, supported by WHILE where the condition is specified first, DO...WHILE where the condition is specified last, and FOR where the number of iterations is specified at the outset. For example: 


while ($notdone == 0) {
   some code here...
   if (some condition) {
      $notdone = 1;
   }

}


As for operators, you'll find everything you would expect, including +, -, *, / for basic mathematical operations, % for modulus, and >, >=, <=, !=, and <> for comparison operators. Also included are logical operators provided by the keywords AND (&&), OR (||), and XOR,! for NOT. Note that = is assignment, while == is comparison for equality. You can also concatenate strings using the '.' operator. (Much of this functionality is similar to that of Perl and other scripting languages. See your PHP Manual for details.) 

Supported data types 

PHP supports constants via the DEFINE statement, integers, floating point values, strings, arrays of various types, and abstract object types used in object-oriented programming. When you create a variable, PHP sets the type based on its initial value. If you give a value of 2, for example, PHP will create an integer (2.0), a float ("2.0"), and a string. 

Regular Expressions 

One of PHP's best features is its support for regular expressions. Regular expressions are used in Unix via grep, egrep, awk, and scripting in languages such as sh, csh, and Perl. Basically, regular expressions specify a pattern against which the engine will compare to find sections of strings or replace them (or both). 

For example, suppose you have a web form where the user specifies an email address and you want to confirm that it is valid. (See the Security section to learn why it's so important to check input values in forms.) This little bit of code will do that for you: 


if (ereg ("^.+@.+\\..+$", $inEmail)) {
   do something with your email address...
} else {
   address is invalid, discard and/or return error
}


Oracle API 

Because so many web-based applications need to connect to Oracle on the back end, PHP has provided such support from its inception. There is also an Oracle8-specific API that allows you to use some of Oracle's more specialized features. 

If the application you are building really needs these features, go ahead and use them, but keep in mind that using this library will tie your application to this platform. You can always hold off and use these features as your application matures and demand more sophisticated functionality. 

XML API 

XML is essentially an abstraction of a markup language. If you think of HTML as a particular instance of one markup language, XML is a meta-HTML, if you will. It allows arbitrary definitions to be created to specify and breakup your particular content. For example, if you had an e-commerce site that lists products in a catalog, you could use tags like this: 


<products>
<item>
<prod_name>Blue Widget</prod_name>
<price>$29.95</price>
<description>
This widget is very blue in color, and good for many things.
</description>
</item>
<item>
<prod_name>Green Widget</prod_name>
<price>$24.95</price>
<description>
This widget is very green in color, and good for many things.
</description>

</item>
</products>


PHP provides sophisticated support for XML so you can put this great technology to use in your applications � such as to build a datafeed for your web site, or to create a format with which external information can be easily incorporated into your site. 

Image Manipulation 

Do your application needs include displaying or manipulating images of products, logos, maps, and people, or drawing graphs from information stored in your database? For all these purposes, PHP's image manipulation API provides support. Some of the features include managing Postscript files, manipulating fonts and colors, drawing and plotting points, and image sizing and modification. 

PHP and Security 

Numerous considerations exist when addressing security in a PHP + Oracle environment. Let's look at each one in turn. 

First and foremost, your network needs to be secure�perhaps via a firewall that permits traffic only through port 80 or port 443, those used for HTTP and HTTPS respectively. Securing your network might also involve setting up a VPN for remote users. 

In addition, however, you should also secure each of the machines behind that firewall by using good passwords, disabling unused ports and services, and keeping up with the latest patches for vulnerabilities. A good place to start is http://securityfocus.com, a clearinghouse of security information for all sorts of platforms. 

Of course, no web-based application is worth much without an Oracle database on the back end. In addition to all the standard OS-level security concerns, your database should make use of good passwords, and set file permissions so users other than 'oracle' cannot modify or mess around with the database files themselves. (Review http://metalink.oracle.com regularly for vulnerabilities, and related patches, and apply them as soon as possible.) But don't get too carried away: an Oracle database is only as secure as your UNIX-level permissions. Users, for instance, can view datafiles with a hex editor if they have read permissions to see them. 

In addition to these concerns, be sure to keep in mind those related to schemas, roles, and grants. As a rule of thumb, grant a user no permissions, and then gradually add only those that are necessary for them to do their job. The same goes for system privs, creating objects, and which objects the user can see. 

Finally, keep application-level security in mind. In fact, when building a web-based application, try to be paranoid: Keep brainstorming about what malicious things a hacker might try to do. Pete Finnigan has an excellent web site devoted to Oracle security, and one of his papers, "SQL Injection and Oracle," speaks directly to these concerns. 

Modular Programming With Classes and Objects 

Before object-oriented programming languages came along, the library was the preferred vehicle for reusing components. Along came C++, Java, and numerous other object-oriented languages, which provided a more sophisticated, less error-prone mechanism for code reuse. PHP provides this functionality as well, and that's a good thing �because any moderately complex web site has a need for it. 

For example, if you're building a financial application, you want to encapsulate all your financial formulas and routines into classes or libraries that can be called repeatedly. If you're building an e-commerce site, you'll want to encapsulate calculations for tax, credit card handling, and so on into separate libraries that can be included whenever needed. 

Let's examine a very simple example to demonstrate the concept in PHP. Here's a class I put together called someMath: 


<?

class someMath {

  var $defaultPower = 2;

  function someMath ($inPower) {
    $this->defaultPower = $inPower;
  }

  function defPower ($inNum) {
    return $this->doPower ($inNum, $this->defaultPower);
  }

  function doPower ($inNum, $inPower) {
    return pow ($inNum, $inPower);
  }

  function setPower ($inNum) {
    $this->defaultPower = $inNum;
  }

  function getPower () {
    return $this->defaultPower;

  }

}

?>


As you can see, PHP already provides a function to compute powers, so this class doesn't do much beyond demonstrating, in a very simple way, how you can encapsulate functionality into classes that you use throughout your code. 

The first function is the constructor, traditionally used to initialize variables in your class. The constructor always bears the same name as the class itself. Next you see two similar functions: one computes powers using the default power set in the class, and the other requires you to tell it what power you want each time it is called. Notice that defPower simply calls doPower with the classes $defaultPower variable included. Finally, setPower and getPower allow you to interact with variables that would otherwise be hidden from you outside of the class. 

Now we make use of our class with a bit more code: 


<HTML>
<TITLE>my class test</TITLE>
<BODY>

<H1>Testing someMath class</H1><p>


<H2>
<?php

require ("somemath.php3");
$mytest = new someMath (2);

$ansA = $mytest->defPower (10);
$ansB = $mytest->doPower (10,3);
$ansC = $mytest->doPower (2,8);
$ansD = $mytest->doPower (2,32);

echo "10 to the power of 2 is: $ansA<br>";
echo "10 to the power of 3 is: $ansB<br>";
echo "2 to the power of 8 is: $ansC<br>";
echo "2 to the power of 32 is: $ansD<p>";

?>
</H2>

</BODY>
</HTML>


#############################################################################################
#############################################################################################
#############################################################################################


=======================================================
Section 15: Extended PL/SQL examples and code snippets:
=======================================================


=================================
1. CHARACTER FUNCTIONS:
=================================


1.1 Basic character functions:
===============================


A character function is a function that takes one or more character values as parameters and returns 
either a character value or a number value.
Here some functions along with examples.


ASCII() AND CHR(): 
------------------

ASCII() Returns the ASCII code of a character.

  select ASCII('b') from dual;

  ASCII('B')
  ----------
          98


The "inverse" function, chr(), is very usefull, like in this example:

  translate(l_output , chr(13)||chr(10), 'XY' )

  to have carriage returns (chr(13)) turned into X (or whatever of course) and 
  linefeeds (chr(10)=newline) turned into Y (or whatever) 


CONCAT: 
-------

Concatenates two strings into one.

  SELECT ename||', who is the '||concat(job,' for our company')
  as "Name and role" FROM emp;

  Name and role
  ---------------------------------------------
  SMITH, who is the CLERK for our company
  ALLEN, who is the SALESMAN for our company
  WARD, who is the SALESMAN for our company
  etc..

  CONCAT ('abc', 'defg') ==> 'abcdefg'
  CONCAT (NULL, 'def') ==> 'def'

INITCAP:
--------

Sets the first letter of each word to uppercase. All other letters are set to lowercase. 

  INITCAP ('this is lower') ==> 'This Is Lower'
  INITCAP ('wHatISthis_MESS?') ==> 'Whatisthis_Mess?'
 
INSTR: 
------

The INSTR function searches a string to find a match for the substring and, if found, 
returns the position, in the source string, of the first character of that substring. 
If there is no match, then INSTR returns 0.

Use INSTR like INSTR(string1,string2,start_position,nth_appearance)
 
  INSTR ('bug-or-tv-character?archie', 'archie') ==> 21
  INSTR ('bug-or-tv-character?archie', 'ar', 14) ==> 21
  INSTR ('bug-or-tv-character?archie', 'archie', 1, 2) ==> 0


LENGTH: 
-------

Returns the length of a string.

  LENGTH (NULL) ==> NULL
  LENGTH ('') ==> NULL -- Same as a NULL string.
  LENGTH ('abcd') ==> 4
  LENGTH ('abcd ') ==> 5

 
LOWER:
------ 

Converts all letters to lowercase.

  LOWER ('BIG FAT LETTERS') ==> 'big fat letters'
  LOWER ('BIG fat LETters') ==> 'big fat letters'
 
LPAD:
-----

Pads a string on the left with the specified characters.

  LPAD ('55', 10, '0') ==> '0000000055'
  LPAD ('HITOP TIES', 45, 'sell!') ==>  'sell!sell!sell!sell!sell!sell!sell!HITOP TIES'

  SQL> select ename, lpad(ename,20,'-') from emp;

  ENAME      LPAD(ENAME,20,'-')
  ---------- ---------------------------------------
  SMITH      ---------------SMITH
  ALLEN      ---------------ALLEN
  WARD       ----------------WARD
  JONES      ---------------JONES
  MARTIN     --------------MARTIN

 
LTRIM:
-----

Trims the left side of a string of all specified characters.

  LTRIM ('    Way Out in Right Field') ==> 'Way Out in Right Field'

  my_string := '123123123LotsaLuck123';
  LTRIM (my_string, '123') ==> 'LotsaLuck123'

  my_string := '70756234LotsaLuck';               -- pay attention to this one !
  LTRIM (my_string, '0987612345') ==> 'LotsaLuck'

  LTRIM ('abcabcccccI LOVE CHILI', 'abc') ==> 'I LOVE CHILI'


You can also use it in DML queries, like

INSERT INTO IOB_KITAP_STAGING_HND
  (Volgnummer,RecordId,DatumTijd,Uitgevernummer,Automaatnummer,Afnemersupervisornummer,
   Pasvolgnummersupervisor,Landcodepas,Landcodeautomaat,Systeemcode,
   Tanknummer15,Produktnummer15,Litertotaal15,
   Tanknummer26,Produktnummer26,Litertotaal26,
   Tanknummer37,Produktnummer37,Litertotaal37,
   Tanknummer48,Produktnummer48,Litertotaal48)
  SELECT 
   Volgnummer,
   RecordId,
   to_date(DatumTijd,'DD-MM-YYYY;HH24:MI:SS'),
   Uitgevernummer,
   Automaatnummer,
   Afnemersupervisornummer,
   Pasvolgnummersupervisor,
   Landcodepas,
   Landcodeautomaat,
   Systeemcode,
   Tanknummer15,
   Produktnummer15,
   to_number(LTRIM(Litertotaal15,'-')),
   Tanknummer26,
   Produktnummer26,
   to_number(LTRIM(Litertotaal26,'-')),
   Tanknummer37,
   Produktnummer37,
   to_number(LTRIM(Litertotaal37,'-')),
   Tanknummer48,
   Produktnummer48,
   to_number(LTRIM(Litertotaal48,'-'))
   FROM IOB_KITAP_IMPORT_HND
   WHERE AUTOMAATNUMMER IS NOT NULL;

 
REPLACE:
-------- 
Replaces a character sequence in a string with a different set of characters. 

Use as REPLACE (string1 IN VARCHAR2, match_string IN VARCHAR2 [, replace_string IN VARCHAR2])

  REPLACE ('CAT CALL','C','K') ==> 'KAT KALL'
  REPLACE ('CAT CALL', 'C') ==> 'AT ALL'

  REPLACE (INITCAP ('ALMOST_UNREADABLE_VAR_NAME'), '_', NULL) ==> 'AlmostUnreadableVarName'

You can also use it in DML queries like

UPDATE CI_MD_CTL_L  
SET DESCR=REPLACE(DESCR,'''',CHR(7));

 
RPAD:
-----
Pads a string on the right with the specified characters. Similar to LPAD.

  RPAD ('55', 10, '0') ==> '5500000000'
  RPAD ('-', 60, '-')==>'------------------------------------------------------------'

RPAD (null, 1, '0') 

 
RTRIM:
------
Trims the right side of a string of all specified characters. Similar to LTRIM.

  RTRIM (`Way Out in Right Field                  ')==> 'Way Out in Right Field'

  my_string := 'Sound effects: BAM!ARGH!BAM!HAM';
  RTRIM (my_string, 'BAM! ARGH!') ==> 'Sound effects:'


SOUNDEX:
--------
Returns the "soundex" of a string.

 
SUBSTR:
------
Returns the specified portion of a string.

  SUBSTR ('now_or_never', 0, 3) ==> 'now'

Can also be used in DML queries like:

INSERT INTO IOB_COLOR_STAGING
  SELECT substr(importrecord,1,3), 
         substr(importrecord,4,8), 
         substr(importrecord,32,8)
  FROM IOB_COLOR_IMPORT
  WHERE substr(importrecord,1,3) IN ('101','104')
  AND importrecord IS NOT NULL;


select FLD_NAME from CI_MD_FLD_L where  substr(FLD_NAME,1,1)=''''


TRANSLATE:
----------
Translates single characters in a string to different characters.

TRANSLATE ('abcd', 'ab', '12') ==> '12cd'
TRANSLATE ('12345', '15', 'xx') ==> 'x234x'
TRANSLATE ('grumpy old possum', 'uot', '%$*') ==>   'gr%mpy $ld p$ss%m'
TRANSLATE ('my language needs the letter e', 'egms', 'X') ==>   'y lanuaX nXXd thX lXttXr X';
TRANSLATE ('please go away', 'a', NULL) ==> NULL


declare
x varchar2(64);
y varchar2(64);
 begin
 x:='hallo klote   zeg';
 y:=translate(x,' ','$');
 dbms_output.put_line(x);
 dbms_output.put_line(y);
 end;
/
hallo klote   zeg
hallo$klote$$$zeg

PL/SQL procedure successfully completed.

declare
x varchar2(64);
y varchar2(64);
 begin
 x:='hallo klote   zeg';
 y:=replace(translate(x,' ','$'),'$',' ');
 dbms_output.put_line(x);
 dbms_output.put_line(y);
 end;
/

hallo klote   zeg
hallo klote   zeg

PL/SQL procedure successfully completed.


declare
x varchar2(64);
y varchar2(64);
z varchar2(64);
 begin
 x:='hallo klote   zeg';
 y:=translate(x,' ','$');
 z:=replace(translate(x,' ','$'),'$');
 dbms_output.put_line(x);
 dbms_output.put_line(y);
 dbms_output.put_line(z);
 end;
/

hallo klote   zeg
hallo$klote$$$zeg
halloklotezeg

PL/SQL procedure successfully completed.

 
UPPER:
------
Converts all letters in the string to uppercase.

  UPPER ('short little letters no more') ==> 'SHORT LITTLE LETTERS NO MORE'
  UPPER ('123abc') ==> '123ABC'


1.2 Other functions:
====================

nvl function:
-------------

Substitutes a value for null in a column

Example:

  SELECT empno, ename, nvl(mgr,0) as MGR
  FROM emp;


DECODE function:
----------------

Example:

  SELECT ename ||' does the ' || decode(JOB, 'ANALYST', 'analyzing', 'CLERK', 'filing', 'goofing off')
  FROM emp;


ARITHMETIC functions:
---------------------

Some arithmetric functions

abs(x)
round(x,y)
ceil(x)
floor(x)
mod(x,y)   
sign(x)
sqrt(x)
trunc(x,y)
vsize(x)

  trunc(123.232, 2)= 123.23
  mod(10,2)=0
  mod(55,4)=3


======================
2. DATE functions:
=====================


2.1 Over NLS Settings:
======================

Bij Server: 

1. De database characterset wordt gespecificeerd bij CREATE DATABASE, maar:
2. De Sever kan wel meerdere locale in runtime laden uit files gespecificeerd in
   $ export ORA_NLSxx=$ORACLE_HOME/ocommon/nls/admin/data
3. Set de default NLS init.ora parameters t.b.v. de USER SESSIONS.


The database has a set of session-independent NLS parameters that are specified when the database is created. 
Two of the parameters specify the database character set and the national character set, 
that is an alternate Unicode character set that can be specified for NCHAR, NVARCHAR2, and NCLOB data. 
The parameters specify the character set that is used to store text data in the database. 
Other parameters, like language and territory, are used to evaluate check constraints.

If the client session and the database server specify different character sets, 
then the Oracle9i database converts character set strings automatically.

From a globalization support perspective, all applications are considered to be clients, 
even if they run on the same physical machine as the Oracle instance. For example, when SQL*Plus 
is started by the UNIX user who owns the Oracle software from the Oracle home in which the RDBMS software 
is installed, and SQL*Plus connects to the database through an adapter by specifying the 
ORACLE_SID parameter, SQL*Plus is considered a client. Its behavior is ruled by client-side NLS parameters.


client:

1. client heeft lokaal een NLS environment setting
2. client connect naar database, een session wordt gevormd, en de NLS enviroment wordt gemaakt
   aan de hAND van de NLS init.ora parameters.
   Is bij de clent de NLS_LANG environment variable gezet, dan communiceerd
   de client dat naar de server session. Hierdoor zijn beide hetzelfde.
   Is er geen NLS_LANG, dan gelden de init.ora NLS parameters voor de server session
3. De session NLS kan worden verANDert via ALTER SESSION. Dit heeft alleen effect
   op de PL/SQL en SQL statements executed op de server


init.ora parameters bij server    : invloed op sessions op server
environment variables bij client  : locale bij client, overrides session
alter session statement           : verANDert de session, overides init.ora
expliciet in SQL statement        : overides alles

Voorbeeld van override:

in init.ora:   NLS_SORT=ENGLISH
bij client:    ALTER SESSION SET NLS_SORT=FRENCH;

ALTER SESSION SET nls_date_format = 'dd/mm/yy'
ALTER SESSION SET NLS_DATE_FORMAT = 'DD-MON-YYYY'
ALTER SESSION SET NLS_LANGUAGE='ENGLISH';


priority:

1. expliciet in SQL
2. ALTER SESSION
3. environment variable
4. init.ora


NLS parameters, te zetten via:

NLS_CALENDAR             init.ora, env, alter session
NLS_COMP                 init.ora, env, alter session
NLS_CREDIT                -        env  -
NLS_CURRENCY             init.ora, env, alter session
NLS_DATE_FORMAT          init.ora, env, alter session
NLS_DATE_LANGUAGE        init.ora, env, alter session
NLS_DEBIT                 -        env  -
NLS_ISO_CURRENCY         init.ora, env, alter session
NLS_LANG                  -        env  -
NLS_LANGUAGE             init.ora, -  , alter session
NLS_LIST_SEPERATOR        -        env  -   
NLS_MONETARY_CHARACTERS   -        env  -
NLS_NCHAR                 -        env  -
NLS_NUMMERIC_CHARACTERS  init.ora, env, alter session
NLS_SORT                 init.ora, env, alter session
NLS_TERRITORY            init.ora, -  , alter session
NLS_DUAL_CURRENCY        init.ora, env, alter session

 
SELECT sysdate FROM dual;
15-MAR-01

 
2.2 Change Date format:
=======================
 
ALTER SESSION SET NLS_DATE_FORMAT='DD-MON-YYYY HH24:MI:SS';
ALTER SESSION SET NLS_DATE_FORMAT='dd/mm/yyyy';
ALTER SESSION SET NLS_DATE_FORMAT = 'dd/mm/yy'
ALTER SESSION SET NLS_DATE_FORMAT = 'dd-mm-yyyy'
ALTER SESSION SET NLS_DATE_FORMAT = 'DD-MON-YYYY'
ALTER SESSION SET NLS_LANGUAGE='ENGLISH';


Example 1:
---------

  SQL> select verbruikdatum from verbruik where verbruikid=859624;

  VERBRUIKD
  ---------
  11-JUN-02

  SQL> ALTER SESSION SET NLS_DATE_FORMAT = 'dd-mm-yyyy';
  
  Session altered.

  SQL> select verbruikdatum from verbruik where verbruikid=859624;

  VERBRUIKDA
  ----------
  11-06-2002

  SQL> ALTER SESSION SET NLS_DATE_FORMAT ='dd-MON-yyyy';

  Session altered.

  SQL> select verbruikdatum from verbruik where verbruikid=859624;

  VERBRUIKDAT
  -----------
  11-JUN-2002
 

Example 2:
----------

  ALTER SESSION SET NLS_LANGUAGE=Italian;

  Enter a SELECT statement:

  SQL> SELECT last_name, hire_date, ROUND(salary/8,2) salary FROM employees;

  You should see results similar to the following:

  LAST_NAME                 HIRE_DATE     SALARY
  ------------------------- --------- ----------
  Sciarra                   30-SET-97      962.5
  Urman                     07-MAR-98        975
  Popp                      07-DIC-99      862.5


Note that the month name abbreviations are in Italian.

Immediately after the connection has been established, if the NLS_LANG environment setting 
is defined on the client side, then an implicit ALTER SESSION statement synchronizes 
the client and session NLS environments.


2.3. Language- Territory support, and Date time formats:
========================================================

Language Support
The Oracle9i database enables you to store, process, and retrieve data in native languages. 
The languages that can be stored in an Oracle9i database are all languages written in scripts 
that are encoded by Oracle-supported character sets. Through the use of Unicode databases and datatypes, 
Oracle9i supports most contemporary languages.

Territory Support
The Oracle9i database supports cultural conventions that are specific to geographical locations. 
The default local time format, date format, and numeric and monetary conventions depend on the 
local territory setting. By setting different NLS parameters, the database session can use different 
cultural settings. For example, you can set British pound sterling (GBP) as the primary currency 
and the Japanese yen (JPY) as the secondary currency for a given database session even when 
the territory is defined as AMERICA.

Date and Time Formats
Different conventions for displaying the hour, day, month, and year can be handled 
in local formats. For example, in the United Kingdom, the date is displayed using 
the DD-MON-YYYY format, while Japan commonly uses the YYYY-MM-DD format.


2.4 ABOUT DATE FUNCTIONS:
=========================

All SQL functions whose behavior depends on globalization support conventions allow NLS parameters 
to be specified. These functions are:

TO_CHAR 
TO_DATE 
TO_NUMBER 
NLS_UPPER 
NLS_LOWER 
NLS_INITCAP 
NLSSORT 

Explicitly specifying the optional NLS parameters for these functions enables the functions to be 
evaluated independently of the session's NLS parameters. This feature can be important for 
SQL statements that contain numbers and dates as string literals.

You can only get into trouble when using literals like '01-JAN-1990'.

Take a look at the following examples and situations:

--  ----------------------------------------------------------------------------
  Example:
 
  SQL> alter session set nls_date_language='ITALIAN';

  Session altered.

  SQL> SELECT ename, hiredate FROM scott.emp WHERE hiredate > '01-JAN-1981';
  SELECT ename, hiredate FROM scott.emp WHERE hiredate > '01-JAN-1981'
                                                       *
  ERROR at line 1:
  ORA-01858: a non-numeric character was found where a numeric was expected
--  ----------------------------------------------------------------------------
  Example:

  SQL> alter session set nls_date_language='AMERICAN';

  Session altered.

  SQL> SELECT ename, hiredate FROM scott.emp WHERE hiredate > '01-JAN-1982';

  ENAME      HIREDATE
  ---------- ----------
  ALLEN      20-02-1981
  WARD       22-02-1981
  JONES      02-04-1981
  etc..
--  ----------------------------------------------------------------------------
  Example:
  SQL> alter session set nls_date_language='ITALIAN';

  Session altered.

  SQL> SELECT ename, hiredate FROM scott.emp WHERE hiredate > '01-SET-81';

  ENAME      HIREDATE
  ---------- ----------
  SMITH      17-12-1980
  ALLEN      20-02-1981
  WARD       22-02-1981
  etc..
--  ----------------------------------------------------------------------------
  Example:
  But queries can be made independent of the current date language by using a statement 
  similar to the following:

  SQL> alter session set nls_date_language='ITALIAN';

  Session altered.

  SELECT ename, hiredate FROM scott.emp WHERE hiredate > 
  TO_DATE('01-JAN-1982','DD-MON-YYYY', 'NLS_DATE_LANGUAGE = AMERICAN');

  ENAME      HIREDATE
  ---------- ----------
  SCOTT      19-04-1987
  ADAMS      23-05-1987
  MILLER     23-01-1982

The following NLS parameters can be specified in SQL functions:

NLS_DATE_LANGUAGE 
NLS_NUMERIC_CHARACTERS 
NLS_CURRENCY 
NLS_ISO_CURRENCY 
NLS_SORT 

--  ----------------------------------------------------------------------------
  Example:
  SQL> alter session set nls_date_language='ITALIAN';

  Session altered.

  SQL> SELECT ename, to_char(hiredate, 'DD-MM-YYYY;HH24:MI') 
    2  FROM scott.emp WHERE hiredate > '01-SET-81';

  ENAME      TO_CHAR(HIREDATE
  ---------- ----------------
  SMITH      17-12-1980;00:00
  ALLEN      20-02-1981;00:00
  WARD       22-02-1981;00:00
  etc..

  SQL> SELECT ename, to_char(hiredate, 'DD-MM-YYYY;HH24:MI') 
    2  FROM scott.emp WHERE hiredate > '01-JAN-81';
  FROM scott.emp WHERE hiredate > '01-JAN-81'
                                *
  ERROR at line 2:
  ORA-01858: a non-numeric character was found where a numeric was expected
--  ----------------------------------------------------------------------------
  Example:
  SQL> alter session set nls_date_language='ITALIAN';

  Session altered.

  SQL> select ADD_MONTHS ('28-JAN-1994', 2) from dual;
  select ADD_MONTHS ('28-JAN-1994', 2) from dual
                   *
  ERROR at line 1:
  ORA-01843: not a valid month

  SQL> alter session set nls_date_language='AMERICAN';

  Session altered.

  SQL> select ADD_MONTHS ('28-JAN-1994', 2) from dual;

  ADD_MONTH
  ---------
  28-MAR-94
-- -----------------------------------------------------------------------------
  Example:
  connect scott/tiger

  SQL> CREATE TABLE EMP2
    2  (
    3    EMPNO     NUMBER(4)                               NULL,
    4    ENAME     VARCHAR2(10 BYTE)                       NULL,
    5    JOB       VARCHAR2(9 BYTE)                        NULL,
    6    MGR       NUMBER(4)                               NULL,
    7    HIREDATE  DATE                                    NULL,
    8    HIREDATE2 DATE                                    NULL
    9  );

  Table created.

  SQL> insert into EMP2
    2  select empno, ename, job, mgr, hiredate, hiredate
    3  from emp;

  14 rows created.

  SQL> alter session set nls_date_language='ITALIAN';

  Session altered.

  SQL> select * from emp2 where hiredate2 > '01-JAN-1981';
  select * from emp2 where hiredate2 > '01-JAN-1981'
                                     *
  ERROR at line 1:
  ORA-01843: not a valid month

  SQL> select * from emp2 where hiredate2>hiredate;

  no rows selected

  SQL> alter session set nls_date_language='AMERICAN';

  Session altered.

  SQL> select * from emp2 where hiredate2 > '01-JAN-1981';

       EMPNO ENAME      JOB              MGR HIREDATE  HIREDATE2
  ---------- ---------- --------- ---------- --------- ---------
        7499 ALLEN      SALESMAN        7698 20-FEB-81 20-FEB-81
        7521 WARD       SALESMAN        7698 22-FEB-81 22-FEB-81
        7566 JONES      MANAGER         7839 02-APR-81 02-APR-81
        etc..

--  ----------------------------------------------------------------------------
  Example:
create or replace procedure datetest(datum IN date)
as
employee emp2%rowtype;
begin
  select * into employee from emp where hiredate=datum;
 dbms_output.put_line(employee.hiredate);
end;
/

  SQL> alter session set nls_date_language='AMERICAN';

  Session altered.

  SQL> exec datetest('17-DEC-80');
  17-DEC-80

  PL/SQL procedure successfully completed.


  SQL> alter session set nls_date_language='ITALIAN';

  Session altered.

  SQL> exec datetest('17-DEC-80');
  BEGIN datetest('17-DEC-80'); END;
                                     *
  ERROR at line 1:
  ORA-01843: not a valid month
  ORA-06512: at line 1


Explicit Conversion
To convert values from one datatype to another, you use built-in functions. 
For example, to convert a CHAR value to a DATE or NUMBER value, you use the function 
TO_DATE or TO_NUMBER, respectively. Conversely, to convert a DATE or NUMBER value to 
a CHAR value, you use the function TO_CHAR. For more information about these functions, 
see Oracle9i SQL Reference. 

Implicit Conversion
When it makes sense, PL/SQL can convert the datatype of a value implicitly. This lets you use 
literals, variables, and parameters of one type where another type is expected. 
In the example below, the CHAR variables start_time and finish_time hold string values 
representing the number of seconds past midnight. The difference between those values 
must be assigned to the NUMBER variable elapsed_time. So, PL/SQL converts the CHAR 
values to NUMBER values automatically. 

For instance, PL/SQL can convert the CHAR value '02-JUN-92' to a DATE value but cannot 
convert the CHAR value 'YESTERDAY' to a DATE value. 
Similarly, PL/SQL cannot convert a VARCHAR2 value containing alphabetic characters to a NUMBER value. 


2.5 The TO_CHAR and TO_DATE functions and examples:
===================================================


Several of the conversion functions (TO_CHAR, TO_DATE, and TO_NUMBER) use format models 
to determine the format of the converted data. Format models convert between strings and dates, 
and strings and numbers. This section discusses these format models, which are then put to use 
in the function descriptions. 


The to_char function converts a number or date to a string.
The syntax for the to_char function is:

to_char (value, [format_mask], [nls_language] )

value can either be a number or date that will be converted to a string. 
The format_mask is optional.  This is the format that will be used to convert value to a string.
The nls_language is optional.  This is the nls language used to convert value to a string. 

Examples:
---------

  to_char (1210.73, '9999.9') would return '1210.7' 
  to_char (1210.73, '9,999.99') would return '1,210.73' 
  to_char (1210.73, '$9,999.00') would return '$1,210.73' 
  to_char (21, '000099') would return '000021' 

  to_char (sysdate, 'yyyy/mm/dd'); would return '2003/07/09' 
  to_char (sysdate, 'Month DD, YYYY'); would return 'July 09, 2003' 
  to_char (sysdate, 'FMMonth DD, YYYY'); would return 'July 9, 2003' 
  to_char (sysdate, 'MON DDth, YYYY'); would return 'JUL 09TH, 2003' 
  to_char (sysdate, 'FMMON DDth, YYYY'); would return 'JUL 9TH, 2003' 
  to_char (sysdate, 'FMMon ddth, YYYY'); would return 'Jul 9th, 2003' 


  to_char (SYSDATE,'HH:MM')      ; would return '02:07'

  create or replace procedure test1 is
  begin
      if TO_CHAR(SYSDATE, 'DAY')='WEDNESDAY' then
         dbms_output.put_line('Today is wednesday');
      else
         dbms_output.put_line('other day');
      end if;
  end;
  /

  SQL> select to_char(SYSDATE,'HH:MM') from dual;

  TO_CH
  -----
  01:07

  SQL> select replace(to_char(SYSDATE,'HH:MM'),':',null) from dual;

  REPLA
  -----
  0107


You can use the TO_CHAR function with a 'date_format_mask': 
TO_CHAR(SYSDATE, 'x') 

DD
DAY
MON
MONTH
YY
YYYY
RR
RRRR
HH (am/pm)
HH24
etc..

It tells the TO_CHAR function *HOW* to display the string. 
       

Examples with the TO_DATE function:
-----------------------------------

Example 1:
----------

SQL> select TO_DATE ('123188', 'MMDDYY') from dual;

TO_DATE('
---------
31-DEC-88


Example 2:
----------

SQL> select TO_DATE ('123188', 'DDMMYY') from dual;
select TO_DATE ('123188', 'DDMMYY') from dual
                *
ERROR at line 1:
ORA-01843: not a valid month

So you see, the mask tells TO_DATE *HOW* to make a DATE from the string.
Here, the mask is obviously wrong.


Example 3:
----------

select TO_DATE ('123188', 'DDMMYY') from dual;

insert 
into   MKM_KPI_WAARDEN
(	   day_code,
	   kpi,
	   kpi_id,
	   kleur,
	   ool_id,
	   trend,
	   kleur_trend,
	   vorige_kleur,
	   vorige_kpi)
(	   select day_code,
	   		  0.880,
	   		  1081,
	   		  'GROEN',
	   		  901,
	   		  'STABIEL',
	   		  'GROEN',
	   		  'GROEN',
	   		  1
	   from   mkm_days 
	   where  day_code >= TO_DATE('01-07-2004', 'dd-mm-yyyy') 
	   AND 	  day_code <= TO_DATE('31-07-2004', 'dd-mm-yyyy'));


Example 4:
----------

INSERT INTO IOB_KITAP_STAGING_AUT 
  (Volgnummer,RecordId,DatumTijd,Landcodepas,automaatnummer,
   Tanknummer1,Produktnummer1,Litertotaal1,
   Tanknummer2,Produktnummer2,Litertotaal2,
   Tanknummer3,Produktnummer3,Litertotaal3,
   Tanknummer4,Produktnummer4,Litertotaal4,
   Tanknummer5,Produktnummer5,Litertotaal5)
  SELECT 
   Volgnummer,
   RecordId,
   to_date(DatumTijd,'DD-MM-YYYY;HH24:MI:SS'),
   Landcodepas,
   automaatnummer,
   Tanknummer1,
   Produktnummer1,
   to_number(LTRIM(Litertotaal1,'-')),
   Tanknummer2,
   Produktnummer2,
   to_number(LTRIM(Litertotaal2,'-')),
   Tanknummer3,
   Produktnummer3,
   to_number(LTRIM(Litertotaal3,'-')),
   Tanknummer4,
   Produktnummer4,
   to_number(LTRIM(Litertotaal4,'-')),
   Tanknummer5,
   Produktnummer5,
   to_number(LTRIM(Litertotaal5,'-'))
  FROM IOB_KITAP_IMPORT_AUT
  WHERE AUTOMAATNUMMER IS NOT NULL;


2.6 Other DATE functions:
=========================


add_months(x,y)       -- levert een datum op al x + y maanden
last_day(x)           -- levert de laatste dag van de maand 
months_between(x,y)   -- aantal maanden tussen x en y
new_time(x,y,z)       -- date-time x in zone y for zone z
next_day(x)           -- naam van de volgende dag bij datum x

SQL> select LAST_DAY(ADD_MONTHS(LAST_DAY(SYSDATE+1),1)) from dual;

LAST_DAY(
---------
31-AUG-04

SQL> select LAST_DAY(ADD_MONTHS(LAST_DAY(SYSDATE+1),1))+1 from dual;

LAST_DAY(
---------
01-SEP-04


2.7 TRUNC Function:
===================

If you are not sure about the time components of your date fields and variables and 
want to make sure that your operations on dates disregard the time component, TRUNCate them: 

IF TRUNC (request_date) BETWEEN TRUNC (start_date) AND TRUNC (end_date)
THEN
  ..


2.8 Some handy functions:
=========================

CREATE OR REPLACE FUNCTION WB_Cal_Yr ( v_date IN DATE ) RETURN NUMBER
IS
BEGIN
RETURN ( TO_NUMBER( TO_CHAR( v_date, 'YYYY' ) ) );
END WB_Cal_Yr;
/

CREATE OR REPLACE FUNCTION time24HMMSS  ( v_date IN DATE   ) RETURN VARCHAR2
IS
BEGIN
RETURN ( TO_CHAR( v_date, 'HH24:MI' ) );
END timepart;
/


CREATE OR REPLACE FUNCTION time24HHMM  ( v_date IN DATE   ) RETURN VARCHAR2
IS
BEGIN
RETURN ( TO_CHAR( v_date, 'HH24:MI' ) );
END time24HHMM;
/

select timepart(verbruiktijd) from verbruik where verbruikid=859624;
TIMEPART(VERBRUIKTIJD)
----------------------
15:22


CREATE OR REPLACE FUNCTION WB_Day_of_Month       ( v_date IN DATE   ) RETURN NUMBER
IS
BEGIN
RETURN ( TO_NUMBER( TO_CHAR( v_date, 'DD' ) ) );
END WB_Day_of_Month;
/

CREATE OR REPLACE FUNCTION WB_Day_of_Week        ( v_date IN DATE   ) RETURN NUMBER
IS
BEGIN
RETURN ( TO_NUMBER( TO_CHAR( v_date, 'D' ) ) );
END WB_Day_of_Week;
/

create or replace procedure a
is
x number;
begin
x:=WB_CAL_Yr(01/01/2004);
end;
/

create or replace procedure no_op
is
myvar number;
begin
myvar:=WB_Cal_Yr('14-FEB-2004');
dbms_output.put_line(TO_CHAR(myvar));
end;
/

create or replace procedure no_op (v_date IN DATE)
is
myvar number;
begin
myvar:=WB_Cal_Yr(v_date);
dbms_output.put_line(myvar);
end;
/

  How to use? for example:
  ------------------------

  SQL> exec no_op('05-05-2003');
  2003

  PL/SQL procedure successfully completed.


FUNCTION WB_Cal_Year_Name ( v_date IN DATE ) RETURN VARCHAR2
IS
BEGIN
RETURN ( TO_CHAR( v_date, 'fmYear' ) );
END WB_Cal_Year_Name;
/

FUNCTION WB_Day_of_Month ( v_date IN DATE ) RETURN NUMBER
IS
BEGIN
RETURN ( TO_NUMBER( TO_CHAR( v_date, 'DD' ) ) );
END WB_Day_of_Month;
/

FUNCTION WB_Day_of_Week ( v_date IN DATE   ) RETURN NUMBER
IS
BEGIN
RETURN ( TO_NUMBER( TO_CHAR( v_date, 'D' ) ) );
END WB_Day_of_Week;
/

FUNCTION WB_Day_of_Year ( v_date IN DATE   ) RETURN NUMBER
IS
BEGIN
RETURN ( TO_NUMBER( TO_CHAR( v_date, 'DDD' ) ) );
END WB_Day_of_Year;
/

FUNCTION WB_Hour12 ( v_date IN DATE   ) RETURN NUMBER
IS
BEGIN
RETURN ( TO_NUMBER( TO_CHAR( v_date, 'HH12' ) ) );
END WB_Hour12;
/ 


-- ----------------------------------------------------------------------

2.9 Weird example of filling a table with dates:
------------------------------------------------

declare
  i number := 1;
  j date;
  k varchar2(30);
  l varchar2(30);
  m varchar2(30);
  n number(9,2);
  o number;
  p varchar2(30);
begin
  while i<100000 loop
       j:=sysdate+i;
       k:=TO_CHAR(j,'DAY');
       o:=sin(i);
       n:=abs(round(o,4)*100);
         begin
          if TO_CHAR(j,'DAY') LIKE 'WED%' then
                 l:='DSM';
                 p:='Utrecht';
         else 
             if TO_CHAR(j,'DAY') LIKE 'THURS%' then
                 l:='AKZO';
                 p:='Alkmaar';
                 else 
                   if TO_CHAR(j,'DAY') LIKE 'FRI%' then
                   l:='McDonalds';
                   p:='Den Haag';
             else
                 l:='MACRO';
                 p:='Amsterdam';
             end if;
            end if;
            end if;
         end;

       insert into CUSTOMER
       (CUST_ID,CUST_NAME,CUST_CITY,ORDER_DATE,DAY,AMOUNT)
        values (i,l,p,j,k,n);

      i := i + 1;
         commit;
         
  end loop;
  commit;
end;
/


Or use something much better in unix and perl, like for example:

#!/usr/bin/perl

for ($i=1; $i < 10000; $i++ ) {

    print "insert into access_log values ( '$i', 'yada $i','klsdfkjl');\n"

}


=========
3. JOINS:
=========

3.1. Create three sample tables:
--------------------------------

In order to demonstrate the joins and summarizations in the next sections, 
let us first create some example tables.


create table LOC            -- table of locations
(
LOCID      int,
CITY       varchar2(16),
constraint pk_loc primary key (locid)
);


create table DEPT           -- table of departments
(
DEPID      int,
DEPTNAME   varchar2(16),
LOCID      int,
constraint pk_dept     primary key (depid),
constraint fk_dept_loc foreign key (locid) references loc(locid)
);


create table EMP            -- table of employees
(
EMPID      int,
EMPNAME    varchar2(16),
DEPID      int,
constraint pk_emp      primary key (empid),
constraint fk_emp_dept foreign key (depid) references dept(depid)
);


-- ---------------------------------------------------------------------

3.2. Now insert some sample records:
------------------------------------

INSERT INTO LOC VALUES (1,'Amsterdam');
INSERT INTO LOC VALUES (2,'Haarlem');
INSERT INTO LOC VALUES (3,null);
INSERT INTO LOC VALUES (4,'Utrecht');


INSERT INTO DEPT VALUES (1,'Sales',1);
INSERT INTO DEPT VALUES (2,'PZ',1);
INSERT INTO DEPT VALUES (3,'Management',2);
INSERT INTO DEPT VALUES (4,'RD',3);
INSERT INTO DEPT VALUES (5,'IT',4);

INSERT INTO EMP VALUES (1,'Joop',1);
INSERT INTO EMP VALUES (2,'Gerrit',2);
INSERT INTO EMP VALUES (3,'Harry',2);
INSERT INTO EMP VALUES (4,'Christa',3);
INSERT INTO EMP VALUES (5,null,4);
INSERT INTO EMP VALUES (6,'Nina',5);
INSERT INTO EMP VALUES (7,'Nadia',5);

-- ----------------------------------------------------------------------

3.3. Show whats in these tables:
--------------------------------

SELECT * FROM emp;
SELECT * FROM dept;
SELECT * FROM loc;

empid       empname          depid       
----------- ---------------- ----------- 
1           Joop             1
2           Gerrit           2
3           Harry            2
4           Christa          3
5           NULL             4
6           Nina             5
7           Nadia            5

(7 row(s) affected)

depid       deptname         locid       
----------- ---------------- ----------- 
1           Sales            1
2           PZ               1
3           Management       2
4           RD               3
5           IT               4

(5 row(s) affected)

locid       city             
----------- ---------------- 
1           Amsterdam
2           Haarlem
3           NULL
4           Utrecht

(4 row(s) affected)

-- ----------------------------------------------------------------------

3.4. Let's try some join statements:
------------------------------------

Query 1:
--------

SELECT deptname, city
FROM   dept, loc
WHERE  dept.locid=loc.locid;      -- or next, what is essentially the same query:

SELECT deptname, city FROM dept INNER JOIN loc
ON dept.locid=loc.locid;

Result 1:
--------

deptname         city             
---------------- ---------------- 
Sales            Amsterdam
PZ               Amsterdam
Management       Haarlem
RD               NULL
IT               Utrecht


Query 2:
--------

SELECT e.empid, e.empname, d.depid, d.deptname
FROM   emp e, dept d
WHERE  e.depid=d.depid;

SELECT e.empid, e.empname, d.depid, d.deptname 
FROM emp e INNER JOIN dept d ON e.depid=d.depid;

Result 2:
---------

empid       empname          depid       deptname         
----------- ---------------- ----------- ---------------- 
1           Joop             1           Sales
2           Gerrit           2           PZ
3           Harry            2           PZ
4           Christa          3           Management
5           NULL             4           RD
6           Nina             5           IT
7           Nadia            5           IT


So for example, Nina and Nadia are both in the IT department.

Query 3:
--------

SELECT e.empid, e.empname, d.depid, d.deptname, l.locid, l.city
FROM emp e INNER JOIN dept d ON e.depid=d.depid INNER JOIN loc l  ON d.locid=l.locid;

or the same query with parenthesis:

SELECT e.empid, e.empname, d.depid, d.deptname, l.locid, l.city
FROM ((emp e INNER JOIN dept d ON e.depid=d.depid) INNER JOIN loc l  ON d.locid=l.locid);


Result 3:
---------

empid       empname          depid       deptname         locid       city             
----------- ---------------- ----------- ---------------- ----------- ---------------- 
1           Joop             1           Sales            1           Amsterdam
2           Gerrit           2           PZ               1           Amsterdam
3           Harry            2           PZ               1           Amsterdam
4           Christa          3           Management       2           Haarlem
5           NULL             4           RD               3           NULL
6           Nina             5           IT               4           Utrecht
7           Nadia            5           IT               4           Utrecht

So both Nina and Nadia are in the IT department in Utrecht


Most time, inner joins deliver the correct answer in business problems.

An outer join, which can be a 'left outer join' or a 'right outer join',
accommodates the situation in which you want to display data
in a join statement, WHERE records FROM one table don't necessarilly all have
corresponding records in the other.

For example, table EMP contains 7 records. If we put the departmentid depid
of Christa to NULL, the following effect is visible:

UPDATE EMP
SET depid=null
WHERE empname='Christa';

SQL> select * from EMP;

     EMPID EMPNAME               DEPID
---------- ---------------- ----------
         1 Joop                      1
         2 Gerrit                    2
         3 Harry                     2
         4 Christa
         5                           4
         6 Nina                      5
         7 Nadia                     5

7 rows selected.

>> Now watch this INNER JOIN:

SQL> SELECT e.empid, e.empname, d.depid, d.deptname 
  2  FROM emp e INNER JOIN dept d ON e.depid=d.depid;

     EMPID EMPNAME               DEPID DEPTNAME
---------- ---------------- ---------- ----------------
         1 Joop                      1 Sales
         2 Gerrit                    2 PZ
         3 Harry                     2 PZ
         5                           4 RD
         6 Nina                      5 IT
         7 Nadia                     5 IT

6 rows selected.

The data of Christa is completely left out from the resultset !

>> Now watch this LEFT JOIN:

Query 4:
--------

SQL> SELECT e.empid, e.empname, d.depid, d.deptname
  2  FROM emp e LEFT JOIN dept d ON e.depid=d.depid;

     EMPID EMPNAME               DEPID DEPTNAME
---------- ---------------- ---------- ----------------
         1 Joop                      1 Sales
         3 Harry                     2 PZ
         2 Gerrit                    2 PZ
         5                           4 RD
         7 Nadia                     5 IT
         6 Nina                      5 IT
         4 Christa

7 rows selected.

So the LEFT JOIN says that we also want to see all records from the "left table"
which do not neccesarily have a matching id.

Some more examples:

Query 5:
--------

SELECT e.empid, e.empname, d.depid, d.deptname, l.locid, l.city
FROM emp e 
LEFT JOIN dept d ON e.depid=d.depid LEFT JOIN loc l  ON d.locid=l.locid;

Result 5:
---------

     EMPID EMPNAME               DEPID DEPTNAME              LOCID CITY
---------- ---------------- ---------- ---------------- ---------- ----------------
         2 Gerrit                    2 PZ                        1 Amsterdam
         3 Harry                     2 PZ                        1 Amsterdam
         1 Joop                      1 Sales                     1 Amsterdam
         5                           4 RD                        3
         6 Nina                      5 IT                        4 Utrecht
         7 Nadia                     5 IT                        4 Utrecht
         4 Christa

7 rows selected.


In general, use a LEFT JOINT to see all values FROM the "left" table 
even if it has possible NULL values in the common key. 
In general, use a RIGHT JOINT to see all values FROM the "right" table 
even if it has possible NULL values in the common key. 


We can also extent such queries easily with other clausules, like

- IN, NOT IN, or subqueries, or WHERE EXISTS:

Query 5. (use of IN or NOT IN)
------------------------------

SQL> SELECT e.empid, e.empname, d.depid, d.deptname
  2  FROM   emp e, dept d
  3  WHERE  e.depid=d.depid
  4  and    d.depid IN (1,5);

     EMPID EMPNAME               DEPID DEPTNAME
---------- ---------------- ---------- ----------------
         6 Nina                      5 IT
         7 Nadia                     5 IT
         1 Joop                      1 Sales


Query 6 (use of subquery)
-------------------------

select e.ename, d.location, d.depid
from   emp e, dept d
where  e.depid=d.depid
and    d.depid IN (SELECT depid from location where city='Amsterdam');

ename      salary      dname                location             deptno      
---------- ----------- -------------------- -------------------- ----------- 
Joop       2000        sales                Amsterdam            1
Klaas      1500        management           Amsterdam            2
Miranda    7000        management           Amsterdam            2
Nadia      1000        sales                Amsterdam            1

(4 records)


Query 7 (use of WHERE EXISTS)
-----------------------------

SELECT e.ename, e.salary
FROM employee e
WHERE exists (SELECT d.deptno FROM department d
              WHERE d.location='Amsterdam' and e.deptno=d.deptno); 

ename      salary      
---------- ----------- 
Joop       2000
Klaas      1500
Miranda    7000
Nadia      1000


3.3 SELFJOIN:
=============

This is a join using the same table, with 2 aliases in the query.
You can use this when there is a possibility that some slight differences
exists between some rows that would otherwise be duplicate records.

SELECT e.empno, e.ename, e.job
FROM emp e, emp e2
WHERE e.empno=e2.empno

SELECT e.empno, e.ename, e.job
FROM emp e, emp e2
WHERE e.empno<>e2.empno
and e.ename=e2.ename;


3.4 Tree query:
===============


CREATE TABLE STUDENTS
(     
StudentID        NUMBER(5,0)       NOT NULL,
Name             VARCHAR2(25),
Major            VARCHAR2(15),
GPA              NUMBER(6,3),
tutorid          NUMBER(5,0),
CONSTRAINT pk_studentid primary key (studentid)
);


INSERT INTO students VALUES (101, 'Bill', 'CIS', 3.45,  102);
INSERT INTO students VALUES (102, 'Mary', 'CIS', 3.10,  NULL);
INSERT INTO students VALUES (103, 'Sue',  'Marketing', 2.95, 102);
INSERT INTO students VALUES (104, 'Tom',  'Finance', 3.5, 106);
INSERT INTO students VALUES (105, 'Alex', 'CIS', 2.75, 106);
INSERT INTO students VALUES (106, 'Sam',  'Marketing', 3.25, 103);
INSERT INTO students VALUES (107, 'Jane', 'Finance', 2.90, 102);


CREATE TABLE COURSES
(
StudentID        NUMBER(5,0)       NOT NULL,
CourseNumber     VARCHAR2(15)      NOT NULL,
CourseName       VARCHAR2(25),
Semester         VARCHAR2(10),
Year             NUMBER(4,0),
Grade            VARCHAR2(2),
CONSTRAINT FK_STUDENTID foreign key (studentid) references students(studentid) );


INSERT INTO courses VALUES (101, 'CIS3400', 'DBMS I', 'FALL', 1997, 'B+');
INSERT INTO courses VALUES (101, 'CIS3100', 'OOP I', 'SPRING', 1999, 'A-');
INSERT INTO courses VALUES (101, 'MKT3000', 'Marketing', 'FALL', 1997, 'A');
INSERT INTO courses VALUES (102, 'CIS3400', 'DBMS I', 'SPRING', 1997, 'A-');
INSERT INTO courses VALUES (102, 'CIS3500', 'Network I', 'SUMMER', 1997, 'B');
INSERT INTO courses VALUES (102, 'CIS4500', 'Network II', 'FALL', 1997, 'B+');
INSERT INTO courses VALUES (103, 'MKT3100', 'Advertizing', 'SPRING', 1998, 'A');
INSERT INTO courses VALUES (103, 'MKT3000', 'Marketing', 'FALL', 1997, 'A');
INSERT INTO courses VALUES (103, 'MKT4100', 'Marketing II', 'SUMMER', 1998, 'A-');


Another form of recursive query is the tree query. A tree query decomposes the table 
such that each row is a node the tree and nodes are related in levels. 
Consider the Students table defined above. 

Bill tutors Alex, Mary and Sue. 
Mary tutors Liz and Ed 
Sue tutors Petra 

Using the SQL SELECT statements CONNECT BY and START WITH clauses, 
we can form a set of relationships between the rows of the table that form 
a tree structure. 

START WITH - indicates which row the tree should start with. 
CONNECT BY - indicates how successive related rows are to be identified and included in the result. 
LEVEL - a pseudo-column that indicates which level of the tree the current row is assigned to. 
The following example prints a tree structure modeled after the tutoring relationships 
in the Students table. We will start with Mary's student id (102) since no one tutors her. 

SELECT            LPAD(' ',2*(LEVEL-1)) || students.name
                  As TutorTree
FROM              students
START WITH        studentid = '102'
CONNECT BY PRIOR  studentid = tutorid;

TUTORTREE
--------------------------------------------------------------------------------
Mary
  Bill
  Sue
    Sam
      Tom
      Alex
  Jane

7 rows SELECTed.

FROM the tree we can see that Mary tutors Bill, Sue and Jane. 
In turn, Sue tutors Sam. Finally, Sam tutors both Tom and Alex. 


3.5 Some extended UPDATE examples:
==================================

In SQL Server you can have really easy UPDATE statements where
2 tables are involved. 
Suppose you have the following:


create table a
(
id   int not null,
name varchar(10))

create table b
(
id   int not null,
name varchar(10))

alter table a add constraint pk_a primary key (id)

alter table b add constraint pk_b primary key (id)

insert into a values (1, 'joop');
insert into a values (2, 'joop');

insert into b values (1, 'karel');
insert into b values (2, 'karel');

Now it's possible to update a from b, with a very easy statement:

update a
set name=b.name
from a,b
where a.id=b.id

In Oracle it's a little different.

Example 2:
----------

I have 2 tables and i need to update data in the first table with data from the second table.  
The following statement will not work but it will give you an idea of what i am trying to do.

UPDATE table1
   SET table1.code_id = table2.system_id
 WHERE table1.code_id = table2.code_id

The statement does not work.

Now try:

UPDATE 
(SELECT table1.code_id t1_code, table2.system_id t2_sys
 FROM   table1, table2
 WHERE  table1.code_id = table2.code_id)
SET t1_code = t2_sys;

This seems to be heading in the right direction.  
However now i'm getting error "ORA-01779 cannot modify a cloumn which maps 
to a non key-preserved table"

Answer:

You need a primary key/unique constraint on system_id in table2 to ensure that 
each row in table1 joins to AT MOST 1 row in table2

Try:

update table1
set code_id = (select system_id from table2 
               where table2.code_id = table1.code_id)
where exists (select system_id 
              from table2 
              where table2.code_id = table1.code_id)

This works.


Example 2:
----------

You want this:

update a
set name=b.name
from a,b
where a.id=b.id

In Oracle you use a statement like this:

UPDATE a
SET a.name = (SELECT b.name FROM b WHERE a.id = b.id );


Example 3:
----------

update (select x1.name as old_name,
                     y1.name as new_name
              from x1 inner join y1 on x1.name=y1.name)
set old_name=new_name;

Example 4:
----------

UPDATE (SELECT x.details AS old_details,
                     y.details AS new_details
              FROM x INNER JOIN y ON x.NAME=y.NAME and y.dept_id = 112)
SET old_details=new_details;

Example 5:
----------

UPDATE emp e 
  SET (ename, job, mgr, hiredate, sal, comm, deptno) =
    (SELECT ename, job, mgr, hiredate, sal, comm, deptno
     FROM emp_load el
     WHERE e.empno = el.empno)
  WHERE e.empno IN (SELECT empno FROM emp_load);


UPDATE /*+ USE_NL(e) INDEX(e) */ emp
     SET (ename, job, mgr, hiredate, sal, comm, deptno) = 
          (SELECT ename, job, mgr, hiredate, sal, comm, deptno
           FROM emp_load el
           WHERE e.empno = el.empno )
     WHERE e.empno IN ( SELECT empno FROM emp_load)

UPDATE /*+ USE_NL(e) INDEX(e) */ emp
    SET (ename, job, mgr, hiredate, sal, comm, deptno) =
        (SELECT ename, job, mgr, hiredate, sal, comm, deptno
         FROM emp_load el
         WHERE e.empno = el.empno )
    WHERE e.empno IN ( SELECT empno FROM emp_load)

Example 6:
----------

update verbruik
set verbruiktijd=null;

update VERBRUIK
set verbruiktijd = (select verbruiktijd from RESERVE_VERBRUIK
               where RESERVE_VERBRUIK.verbruikid = VERBRUIK.verbruikid)
where exists (select verbruiktijd 
              from RESERVE_VERBRUIK 
              where RESERVE_VERBRUIK.verbruikid = VERBRUIK.verbruikid);


3.6 IN, EXISTS, AND NOT IN, NOT EXISTS:
=======================================

You Asked (Jump to Tom's latest followup)

Hi Tom,

     Can you pls explain the diff between IN and EXISTS and NOT IN 
and NOT EXISTS. Because I have read that EXISTS will work better than
IN and NOT EXISTS will work better than NOT IN (read this is Oracle 
server tunning). 


and we said...

see

http://asktom.oracle.com/pls/ask/f?p=4950:8:::::F4950_P8_DISPLAYID:953229842074


It truly depends on the query and the data as to which is BEST.

Note that in general, NOT IN and NOT EXISTS are NOT the same!!!


select count(*) from emp where empno not in ( select mgr from emp );

  COUNT(*)
----------
         0

apparently there are NO rows such that an employee is not a mgr -- everyone is 
a mgr (or are they)


select count(*) from emp T1
where not exists ( select null from emp T2 where t2.mgr = t1.empno );

  COUNT(*)
----------
         9


Ahh, but now there are 9 people who are not managers.  Beware the NULL value and 
NOT IN!!  (also the reason why NOT IN is sometimes avoided).  


NOT IN can be just as efficient as NOT EXISTS -- many orders of magnitude BETTER 
even -- if an "anti-join" can be used (if the subquery is known to not return 
nulls)


Tom,

Instead of

select count(*) from emp T1
where not exists ( select null from emp T2 where t2.mgr = t1.empno );

you could have used 

select count(*) from emp T1
where not exists ( select mgr  from emp T2 where t2.mgr = t1.empno );

Could you tell what circumstances do we use "select null"
instead of "select <value>". Are there any advantages
 

Followup:  
why select mgr?

I find select null to be semantically more meaningful.  You are NOT selecting 
anything really -- so say that. 
 

Hi Tom,
       Your answer is superb. Can you tell us why there is no record selected 
for NOT IN when there is NULL?

 
Followup:  
Because NULL means -- gee, I don't know.  (litterally, null means Unknown)


So, the predicate

where x not in ( NULL )

evaluates to neither TRUE, nor FALSE


ops$tkyte@ORA817DEV.US.ORACLE.COM> select * from dual where dummy not in ( NULL 
);

no rows selected

ops$tkyte@ORA817DEV.US.ORACLE.COM> select * from dual where NOT( dummy not in 
(NULL) );

no rows selected


(you would think one of the two queries would return a row -- but there is a 
third state for a boolean expression in sql -- "I don't know what the answer 
is") 

 
Other example of IN, NOT IN, EXISTS, NOT EXISTS:
================================================


create table A
(id int,
name varchar2(10),
datum date);

create table B
(id int,
name varchar2(10),
datum date);


insert into A values (1,'piet',sysdate);
insert into A values (2,'piet',sysdate-1);
insert into A values (3,'klaas',sysdate-2);
insert into A values (4,'gerrit',sysdate-3);

insert into B values (1,'piet',sysdate);
insert into B values (2,'piet',sysdate-1);
insert into B values (3,'snoopy',sysdate-2);
insert into B values (4,'gerrit',sysdate-3);

SQL> select * from A;

        ID NAME       DATUM
---------- ---------- ---------
         1 piet       14-SEP-04
         2 piet       13-SEP-04
         3 klaas      12-SEP-04
         4 gerrit     11-SEP-04

SQL> select * from B;

        ID NAME       DATUM
---------- ---------- ---------
         1 piet       14-SEP-04
         2 piet       13-SEP-04
         3 snoopy     12-SEP-04
         4 gerrit     11-SEP-04


select * from A where
id NOT IN (select id from B where id is not null) AND
name NOT IN (select name from B where name is not null) ;

no rows selected 

select * from A where
id NOT IN (select id from B where id is not null) OR
name NOT IN (select name from B where name is not null) ;

        ID NAME       DATUM
---------- ---------- ---------
         3 klaas      29-MAR-05

select * from B where
id NOT IN (select id from A where id is not null) AND
name NOT IN (select name from A where name is not null) ;

no rows selected

select * from B where
id NOT IN (select id from A where id is not null) OR
name NOT IN (select name from A where name is not null) ;

        ID NAME       DATUM
---------- ---------- ---------
         3 snoopy     29-MAR-05

update A set ID=5 where name='klaas';

SQL> select * from A;

        ID NAME       DATUM
---------- ---------- ---------
         1 piet       14-SEP-04
         2 piet       13-SEP-04
         5 klaas      12-SEP-04
         4 gerrit     11-SEP-04


select * from A where
id NOT IN (select id from B where id is not null) AND
name NOT IN (select name from B where name is not null) ;

        ID NAME       DATUM
---------- ---------- ---------
         5 klaas      12-SEP-04


select * from A where
id NOT IN (select id from B where id is not null) AND
name NOT IN (select name from B where name is not null) AND
trunc(datum) NOT IN (select trunc(datum) from B where datum is not null) ;

no rows selected

select * from A where
id IN (select id from B where id is not null) AND
name IN (select name from B where name is not null) ;


        ID NAME       DATUM
---------- ---------- ---------
         4 gerrit     11-SEP-04
         1 piet       14-SEP-04
         2 piet       13-SEP-04

-------------------------------------------------------

select * from A where
id NOT IN (select id from B where id is not null) AND
name NOT IN (select name from B where name is not null) ;

        ID NAME       DATUM
---------- ---------- ---------
         5 klaas      12-SEP-04


select * from A where
NOT EXISTS (select id from B where a.id=b.id) AND
NOT EXISTS (select name from B where a.name=b.name) ;

        ID NAME       DATUM
---------- ---------- ---------
         5 klaas      12-SEP-04


=================================
4. Group and aggregate functions:
=================================


4.1 simple use:
===============

avg(x)
count(x)
max(x)
min(x)
stddev(x)
sum(x)
variance(x)
etc..

SELECT avg(sal) FROM emp;
SELECT count(*) FROM emp;

Those functions do not include NULL columns values.


4.2 Using the "GROUP BY" clause:
================================

SELECT deptno, avg(sal)
FROM emp
GROUP BY deptno;

SELECT deptno, job, avg(sal)
FROM emp
GROUP BY deptno, job
ORDER BY job;

SELECT deptno, job, avg(sal)
FROM emp
GROUP BY deptno, job
ORDER BY avg(sal);

SELECT deptno, sum(sal)
FROM emp
GROUP BY deptno;

SELECT deptno, job, sum(sal)
FROM emp
GROUP BY deptno, job;


4.3 Rollup and cube:
====================

Rollup: (1 dimension)

xx j1   --
xx j2   --
-- tot  xx
yy j1   --
yy j2   --
yy j3   --
-- tot  yy
   tot  xy


This is OK:

SELECT deptno, job, sum(sal)
FROM emp
GROUP BY rollup(deptno, job);

This is wrong:

SELECT deptno, job, sum(sal)
FROM emp
GROUP BY rollup(deptno);


Cube: (n dimensions)

xx  j1    --
xx  j2    --
--  tot   xx
yy  j1    --
yy  j2    --
yy  j3    --
--  tot   yy
    totj1 aa
    totj2 bb
    totj3 cc

SELECT deptno, job, sum(sal)
FROM emp
GROUP BY cube(deptno, job);


4.4 Having clause:
==================

Once the data is grouped using the "GROUP BY" statement, it is sometimes
usefull to weed out unwanted data.

The HAVING clause, acts for the GROUP BY clause, as the WHERE clause.


SELECT deptno, avg(sal)
FROM emp
GROUP BY deptno
HAVING deptno>10;

SELECT deptno, avg(sal)
FROM emp
GROUP BY deptno
HAVING avg(sal)>2000;

    Notice the difference with the following:
    
    SELECT deptno, avg(sal)
    FROM emp
    WHERE sal>2000    -- WHERE avg(sal)>2000: not allowed
    GROUP BY deptno;


==============
5. Subqueries:
==============

5.1 General:
============

This is a SELECT statement within a SELECT statement, designed to limit
the resultset. In most cases you can find the second query in the WHERE clause of the
parent query. But also in the FROM clause is possible.

SELECT ename, deptno, sal
FROM emp
WHERE deptno=(SELECT deptno FROM dept WHERE loc='NEW YORK');


    This query resolves to a query like

    SELECT ename, deptno, sal
    FROM emp
    WHERE deptno=10;


SELECT ename, deptno, sal
FROM emp
WHERE deptno in (SELECT deptno FROM dept WHERE loc='NEW YORK');


5.2 With the "WHERE exists" clause:
===================================

These type of subqueries resolve INTO the statement:

SELECT .. FROM .. WHERE exists (is TRUE)

   SELECT ename, deptno, sal
   FROM emp
   WHERE exists (SELECT deptno FROM dept WHERE loc='NEW YORK');

   This query will NOT deliver the correct result, because it actually
   resolves in

   SELECT ename, deptno, sal
   FROM emp
   WHERE exists (is TRUE);

   which will turn up much more records than we are interrested in.

This one works:

SELECT e.ename, e.job, e.sal
FROM emp e
WHERE exists (SELECT d.deptno FROM dept d
              WHERE d.loc='NEW YORK' and e.deptno=d.deptno); 


5.3 Types of subqueries:
========================


5.3.1 Single row subqueries:
----------------------------

The main query expects the subquery to return only one values.
We have seen these before.

SELECT ename, deptno, sal
FROM emp
WHERE deptno=(SELECT deptno FROM dept WHERE loc='NEW YORK');

We have use the "=' operator and therefore the subquery MUST return ONLY 1 value.


5.3.2  Multi row subqueries:
----------------------------

In this type of query, the parent query can expect more than one values.

SELECT ename, job, sal
FROM emp
WHERE deptno in (SELECT deptno FROM dept
                 WHERE dname in ('ACCOUNTING','SALES'));

SELECT deptno, job, avg(sal)
FROM emp
GROUP BY deptno, job
HAVING avg(sal)>(SELECT sal FROM emp WHERE ename='MARTIN');


5.3.3 Null values:
------------------

Suppose ename='KING' has a deptno=null

SELECT deptno, ename, job, sal
FROM emp
WHERE (deptno, sal) in (SELECT deptno, max(sal) FROM emp
                        GROUP BY deptno);

Een subquery geeft geen null terug. Daarom zal KING in de resultset
niet te zien zijn.

Herschrijf de query naar een correlated subquery:

select e.deptno, e.ename, e.job, e.sal
from emp e
where e.sal=(select max(e2.sal) from emp e2 where e.deptno=e2.deptno)


==========
6. PL/SQL:
==========


6.1 SIMPLE FUNCTIONS AND PROCEDURES:
====================================

6.1.1 Identifiers:
------------------

You use identifiers to name PL/SQL program items and units, which include constants, 
variables, exceptions, cursors, cursor variables, subprograms, and packages. 
Some examples of identifiers follow: 

X
t2
phone#
credit_limit
LastName
oracle$number
program5


An identifier consists of a letter optionally followed by more letters, numerals, 
dollar signs, underscores, and number signs. Other characters such as hyphens, slashes, 
and spaces are illegal, as the following examples show: 


this is wrong:

              mine&yours    -- illegal ampersand
              debit-amount  -- illegal hyphen
              on/off        -- illegal slash
              user id       -- illegal space

this is OK:
             money$$$tree
             SN## 
             try_again_


6.1.2 VARIABLE DECLARATION:
---------------------------


Variable declaration:
---------------------

part_no  NUMBER(4) ;
in_stock BOOLEAN   ;


constrained: 
               itty_bitty_# NUMBER(1);

unconstrained:
               no_limits_here NUMBER;


When you declare a scalar variable (a variable with a scalar or noncomposite datatype), 
you can provide a default or initial value for that variable. In the following example, 
I declare the total_sales variable and initialize it to zero using both the DEFAULT syntax 
and the assignment operator: 

total_sales NUMBER (15,2) := 0;


Constant declaration:
---------------------

pi constant number:=3.14;

next_tax_filing_date CONSTANT DATE := '15-APR-96';


6.1.3 ASSIGNING VALUES TO VARIABLES:
------------------------------------

- Via assignment operator := 

  tax := price * tax_rate;
  bonus := current_salary * 0.10;
  amount := TO_NUMBER(SUBSTR('750 dollars', 1, 3));
  valid := FALSE;

- via SELECT 

SELECT sal * 0.10 INTO bonus FROM emp WHERE empno = emp_id;


Default value taken from Oracle Forms bind variable:

  call_topic VARCHAR2 (100) DEFAULT :call.description;

Default value is the result of the expression:

  order_overdue CONSTANT BOOLEAN :=
   ship_date > ADD_MONTHS (order_date, 3) OR
   priority_level (company_id) = 'HIGH';


not null clause:

  company_name VARCHAR2(60) NOT NULL DEFAULT 'PCS R US';


6.1.4 SIMPLE FUNCTIONS:
-----------------------

Example 1:
----------

create or replace function area_of_circle(p_radius in number) return number
as

my_area number default 0;
pi constant number:=3.14;
begin
 my_area:=p_radius*p_radius*pi;
 return my_area;
end;
/

Output:

  SELECT area_of_circle(5) FROM dual;

  It's NOT !!! possible to execute area_of_circle like this:

  SQL>execute area_of_circle(5);


NOTE 1: So, functions are most of the time specialized code units, to be used from within
        other procedures.

Example 2:
----------

create or replace procedure area_proc(p_radius in number) 
is

my_area number default 0;
begin
 my_area:=area_of_circle(p_radius);
 dbms_output.put_line('Dit is het resultaat: '||my_area);
end;
/


  SQL> exec area_proc(7);
  Dit is het resultaat: 153,86

  PL/SQL-procedure is geslaagd.


6.1.5 SIMPLE PROCEDUREs:
------------------------

create or replace procedure no_op(p_var in number, p_var2 out number) is
begin
p_var2:=p_var;
dbms_output.put_line(TO_CHAR(p_var2));
end;
/


Now try this:

SQL> exec no_op(1);
BEGIN no_op(1); END;

      *
ERROR at line 1:
ORA-06550: line 1, column 7:
PLS-00306: wrong number or types of arguments in call to 'NO_OP'
ORA-06550: line 1, column 7:
PL/SQL: Statement ignored


!!! procedures die GEEN return values geven kun je simpleweg zo starten:

  SQL>execute no_op;

!!! procedures DIE WEL return values (<variable_name> out <type>) terug geven:

  declareer altijd een variabele om daarin de return value van de procedure
  te plaatsen.

-- --------------------------------------------------------

create or replace procedure no_op (p_var1 in number, p_var2 out number) is
begin
p_var2:=2 * p_var1;
end;
/

declare
myvar1 number;
myvar2 number;
begin
  no_op(5, myvar2);
  dbms_output.put_line(TO_CHAR(myvar2));
end;
/

-- --------------------------------------------------------

create or replace procedure no_op (p_var1 in number, p_var2 out number, p_var3 out number) is
begin
p_var2:=2 * p_var1;
p_var3:=3 * p_var1;
end;
/

declare
myvar2 number;
myvar3 number;
begin
  no_op(5, myvar2, myvar3);
  dbms_output.put_line(TO_CHAR(myvar2));
  dbms_output.put_line(TO_CHAR(myvar3));
end;
/

-- --------------------------------------------------------


create or replace procedure no_op (p_var1 in number, p_var2 out number, p_var3 out number) is
begin
p_var2:=2 * p_var1;
p_var3:=3 * p_var1;
end;
/

declare
myvar2 number;
myvar3 number;
begin
  no_op(5, myvar2, myvar3);
  dbms_output.put_line(TO_CHAR(myvar2));
  dbms_output.put_line(TO_CHAR(myvar3));
end;
/


Example:
--------

create or replace procedure test6 (p_var1 in number) is
p_var2 number;
begin
  p_var2:=10 * p_var1;
  dbms_output.put_line(TO_CHAR(p_var2));
end;
/

This you can execute right away:

SQL> exec test6(4);
40

PL/SQL procedure successfully completed.

Example:
--------

create or replace procedure test8 is
p_var2 number;
begin
  p_var2:=area_of_circle(10);
  dbms_output.put_line(TO_CHAR(p_var2));
end;
/

Example:
--------

create or replace procedure ins_sales_parm(id in number, name in varchar2)
as
begin
insert into sales values (id,name);
end;
/

Procedure is aangemaakt.

SQL> exec ins_sales_parm(5,'piet');

Example:
--------

FUNCTION clearImportTable
	    RETURN boolean
IS
BEGIN
  BEGIN
    DELETE FROM STG_IMPORT;
    EXCEPTION
      WHEN OTHERS THEN
        return false;
  END;
  return true;
END clearImportTable;


6.2 IF THEN ELSE:
=================


Example 1:
----------

IF caller_type = 'VIP'
THEN
   generate_response ('EXPRESS');

ELSIF caller_type = 'BILL_COLLECTOR'
THEN
   generate_response ('THROUGH_CHICAGO');

ELSIF caller_type = 'INTERNATIONAL'
THEN
   generate_response ('AIR');

ELSE
   generate_response ('NORMAL');
END IF;

Example 3:
----------

IF <condition1>
THEN
   IF <condition2>
   THEN
      <statements2>
   ELSE
      IF <condition3>
      THEN
         <statements3>
      ELSIF <condition4>
      THEN
         <statements4>
      END IF;
   END IF;
END IF;

Here inside checking will not be done if the outer test is not true.


Example 4:
----------
set serveroutput on

begin
  if TO_CHAR(SYSDATE, 'DAY')='WEDNESDAY' then
     dbms_output.put_line('Today is wednesday');
  else 
     if TO_CHAR(SYSDATE, 'DAY')='VRIJDAG' then
     dbms_output.put_line('Today is friday');
  else
     if TO_CHAR(SYSDATE, 'DAY')='MONDAY' then
     dbms_output.put_line('Today is saturday');
  end if;
 end if;
end if;
end;

Example 5:
----------

begin
for mynum in 0..4 loop

   if     mynum=1 then my_team(mynum):='SMITH';
   elsif  mynum=2 then my_team(mynum):='JONES';
   elsif  mynum=3 then my_team(mynum):='TURNER';
   elsif  mynum=4 then my_team(mynum):='KING'; 
   end if;

end loop;


Also inserts, updates, deletes to tables are possible

DECLARE
TEMP_COST NUMBER(10,2);
BEGIN
   SELECT COST FROM JD11.BOOK INTO TEMP_COST WHERE ISBN = 21;
   IF TEMP_COST > 0 THEN
      UPDATE JD11.BOOK SET COST = (TEMP_COST*1.175) WHERE ISBN = 21;
   ELSE 
      UPDATE JD11.BOOK SET COST = 21.32 WHERE ISBN = 21;
   END IF; 
COMMIT;
EXCEPTION
   WHEN NO_DATA_FOUND THEN
      INSERT INTO JD11.ERRORS (CODE, MESSAGE) VALUES(99, 'ISBN 21 NOT FOUND');
END;


6.3 LOOPS:
==========

loop
         statements
end loop;


6.3.1 loop - if condition then exit
-----------------------------------

create or replace procedure test_1 AS

x number:=5;

begin
  loop
  dbms_output.put_line('Ik heb dit '||TO_CHAR(x)||' gedaan.');
  x:=x-1;
  if x=0 then exit;
  end if;
  end loop;
end;
/


6.3.2: loop - exit when condition
---------------------------------

declare 
   x number:=5;
begin

loop
  dbms_output.put_line('Ik heb dit '||TO_CHAR(x)||' gedaan.');
  x:=x-1;
  exit when x=0;
end loop;
end;
/


6.3.3: while condition loop
---------------------------

declare 
   x number:=5;
begin

  while x>0 loop
  dbms_output.put_line('Ik heb dit '||TO_CHAR(x)||' gedaan.');
  x:=x-1;
  end loop;
end;
/


declare
  i number := 1000000;
begin

  while i>1 loop
       insert INTO customers
        values (1, 'joop');

      i := i - 1;
         commit;
         
  end loop;
  commit;
end;
/


6.3.4: for condition loop
-------------------------

declare
     -- nothing to declare
begin
  for x in 0..4 loop
  dbms_output.put_line('Ik heb dit '||TO_CHAR(x)||' gedaan.');
  end loop;
end;
/

-- Note that you do not need to declare x

NESTED LOOP:

create or replace procedure loop4 is
begin
     for x in 0..3 loop
         for y in 0..2 loop
         dbms_output.put_line(TO_CHAR(x)||','||TO_CHAR(y));
         end loop;
    end loop;
end;
/


6.3.5: Goto statements
----------------------

Statements..

if condition then
goto label

statements..

<<label>>
Statements..


6.3.6: Execute immediate:
-------------------------

DROP SEQUENCE "ALBERT"."SEQ_UITGEVER";

DECLARE
maxID  NUMBER;
BEGIN
SELECT MAX(UitgeverID) into maxID from Uitgever;
execute immediate('CREATE SEQUENCE "ALBERT"."SEQ_UITGEVER" MINVALUE '|| (maxID+1));
END;
/


6.3.7: CASE Statement:
----------------------

A CASE expression selects a result from one or more alternatives, and returns the result. 
The CASE expression uses a selector, 
an expression whose value determines which alternative to return.

The selector is followed by one or more WHEN clauses, which are checked sequentially. 
The value of the selector determines which clause is executed. 
The first WHEN clause that matches the value of the selector determines the result value, 
and subsequent WHEN clauses are not evaluated. An example follows:


DECLARE
   grade CHAR(1) := 'B';
   appraisal VARCHAR2(20);
BEGIN
   appraisal := 
      CASE grade
         WHEN 'A' THEN 'Excellent'
         WHEN 'B' THEN 'Very Good'
         WHEN 'C' THEN 'Good'
         WHEN 'D' THEN 'Fair'
         WHEN 'F' THEN 'Poor'
         ELSE 'No such grade'
      END;
END;


DECLARE
   grade CHAR(1);
   appraisal VARCHAR2(20);
BEGIN
   ...
   appraisal := 
      CASE
         WHEN grade = 'A' THEN 'Excellent'
         WHEN grade = 'B' THEN 'Very Good'
         WHEN grade = 'C' THEN 'Good'
         WHEN grade = 'D' THEN 'Fair'
         WHEN grade = 'F' THEN 'Poor'
         ELSE 'No such grade'
      END;
   ...
END;


6.3.8. Some more examples:
--------------------------

-- --------------------------------------------------------

create or replace procedure ttn (x IN varchar2)
AS
y number;
z varchar2(32);
begin
y:=to_number(ltrim(x,'0'));
dbms_output.put_line(x);
dbms_output.put_line(y);
end;
/


-- --------------------------------------------------------

create table iftest
(
id int,
name varchar2(20));


SQL> insert into iftest values (1,'Jaap');

SQL> insert into iftest values (2,'Joop');

SQL> insert into iftest values (3,'Gerrit');

SQL> insert into iftest values (4,'Jannie');

SQL> insert into iftest values (5,'Marie');

SQL> insert into iftest values (6,'Klasie');

SQL> insert into iftest values (7,'Nadia');

SQL> insert into iftest values (8,'Miranda');


create or replace procedure abc
AS
i number;

cursor CUR IS
SELECT * from iftest;

cur_rec cur%rowtype;

begin

  for cur_rec IN cur loop

  dbms_output.put_line(cur_rec.id);

  if cur_rec.id in (3,4) then
     dbms_output.put_line('Speciaal !! :'||to_char(cur_rec.id));
  else
     exit;                                                        !! exit van de loop
  end if;

  end loop;
end;
/


Procedure created.

SQL> exec abc;
1

PL/SQL procedure successfully completed.

-- --------------------------------------------------------

create or replace procedure abc
AS
i number;

cursor CUR IS
SELECT * from iftest;

cur_rec cur%rowtype;

begin

  for cur_rec IN cur loop

  dbms_output.put_line(cur_rec.id);

  if cur_rec.id in (3,4) then
     dbms_output.put_line('Speciaal !! :'||to_char(cur_rec.id));
  else
     null;                                                        
  end if;

  end loop;
end;
/

Procedure created.

SQL> exec abc;
1
2
3
Speciaal !! :3
4
Speciaal !! :4
5
6
7
8

PL/SQL procedure successfully completed.

-------------------------------------------------------------

create or replace procedure ap
as

x varchar2(10);

cursor CUR_STR IS
SELECT 
 DATUMTIJD,UITGEVERNUMMER,AUTOMAATNUMMER,AFNEMERNUMMER,PASNUMMER,CHAUFFEURNUMMER,        
 ISOPAS,ISOAUTOMAAT,SYSTEEMKODE,Tanknummer,PRODUKTNUMMER,LITERS,AFLEVERBON,ORDERNUMMER,TRANSPORTNUMMER        
FROM BRAINS.IOB_KITAP_STAGING_STR;

cur_rec CUR_STR%rowtype;

begin

x:=ltrim(rtrim(cur_rec.PRODUKTNUMMER));

for cur_rec IN CUR_STR loop

dbms_output.put_line(cur_rec.datumtijd);
dbms_output.put_line(cur_rec.AUTOMAATNUMMER);
dbms_output.put_line('x :'||to_char(x));
dbms_output.put_line(cur_rec.produktnummer);


end loop;

end;
/

-------------------------------------------------------------


6.3.9. Handling null:
---------------------


declare
x number;
y number;

begin
x := 5;
y := NULL;

IF y is not null THEN
  IF x != y THEN  
    dbms_output.put_line(' x ! y ');
  END IF;
ELSE
    dbms_output.put_line(' y is null ');
END IF;
end;
/

y is null

PL/SQL procedure successfully completed.


declare
x number;
y number;

begin
x := 5;
y := NULL;

IF y is null THEN
  IF x ! y THEN
    dbms_output.put_line(' y is null ');
  END IF;
ELSE
    dbms_output.put_line(' y is not null ');
END IF;
end;
/

PL/SQL procedure successfully completed.


declare
x number;
y number;

begin
x := 5;
y := NULL;

IF y is null THEN
 
    dbms_output.put_line(' y is null ');

ELSE
    dbms_output.put_line(' y is not null ');
END IF;
end;
/

y is null

PL/SQL procedure successfully completed.


declare
x number;
y number;

begin
x := 5;
y := NULL;

IF y is not null THEN
 
    dbms_output.put_line(' y is not null ');

ELSE
    dbms_output.put_line(' y is null ');
END IF;
end;
/

y is null

PL/SQL procedure successfully completed.


x := 5;
y := NULL;
...
IF x != y THEN  -- yields NULL, not TRUE
   sequence_of_statements;  -- not executed
END IF;


so the IF condition yields NULL, the sequence of statements is bypassed.


=============================
7. Interaction with Database:
=============================


7.1: SQL in PL/SQL
==================

This is wrong:

SQL> declare
  2     -- nothing to declare
  3  begin
  4    SELECT empno, ename, job, sal
  5    FROM scott.emp;
  6  end;
  7  /
  SELECT empno, ename, job, sal
  *
ERROR at line 4:
ORA-06550: line 4, column 3:
PLS-00428: an INTO clause is expected in this SELECT statement


You need to follow these rules:

1. Declare Variables for output
2. SELECT .. INTO variables
2. WHERE clause


This is OK:

declare
my_empno number(4);
my_ename varchar2(10);
my_job   varchar2(9);
my_sal   number(7,2);

begin

  SELECT empno, ename, job, sal
  INTO my_empno, my_ename, my_job, my_sal
  FROM scott.emp WHERE empno=7844;

dbms_output.put_line(TO_CHAR(my_empno)||' '||my_ename);

end;
/


7.2 Declaring dynamic variables with %type
==========================================

in plaats van rekening te houden met datatype en formaat
van de columns, kunnen we dat dynamisch laten bepalen.

declare
my_empno scott.emp.empno%type;
my_ename scott.emp.ename%type;
my_job   scott.emp.job%type;
my_sal   scott.emp.sal%type;

begin
  SELECT empno, ename, job, sal
  INTO my_empno, my_ename, my_job, my_sal
  FROM scott.emp WHERE empno=7844;
  dbms_output.put_line(TO_CHAR(my_empno)||' '||my_ename);
end;
/


7.3 DML in plsql:
=================

insert, update en delete kunnen ook in plsql code worden opgenomen.

CREATE OR REPLACE PROCEDURE pro10 
IS
  my_empno scott.emp.empno%type;
  my_comm  scott.emp.comm%type;

begin
  SELECT empno, comm INTO my_empno, my_comm 
  FROM scott.emp WHERE empno=7844;
    dbms_output.put_line('Old commission'||TO_CHAR(my_comm));
  
  my_comm:=my_comm+1000;

  update scott.emp set comm=my_comm
  WHERE empno=my_empno;
    dbms_output.put_line('New commission'||TO_CHAR(my_comm));
end;
/

Take notice that there is no "declare" in the procedure definition, as is always true.

Some more examples:
-------------------

CREATE OR REPLACE PROCEDURE giveraise (dept_in IN NUMBER, raise_in IN NUMBER) IS
BEGIN
   UPDATE scott.emp
      SET sal = sal + raise_in
    WHERE deptno = dept_in;
END;
/

-- use of %type in parameter list:

CREATE OR REPLACE PROCEDURE add_job_history
(    p_emp_id          job_history.employee_id%type
   , p_start_date      job_history.start_date%type
   , p_end_date        job_history.end_date%type
   , p_job_id          job_history.job_id%type
   , p_department_id   job_history.department_id%type
   )
IS
BEGIN
  INSERT INTO job_history (employee_id, start_date, end_date,
                           job_id, department_id)
    VALUES(p_emp_id, p_start_date, p_end_date, p_job_id, p_department_id);
END add_job_history;


7.4 Transaction processing:
===========================

- SET TRANSACTION kan niet worden gebruikt in plsql.

  set transaction use rollback segment SEGMENT_NAME

- commit, rollback, savepoint kunnen wel worden gebruikt

- DBMS_TRANSACTION kan worden gebruikt


7.5 Composite Datataypes: records en tables:
============================================

We hebben als datatypes:

		- scalar datatypes (single value, geen interne componenenten: nummeric, varchar etc..)
                - composite types  (samengestelde typen zoals records, tables)
                - reference types


7.5.1 Records:
--------------

Tot nu toe hebben we vrij krampachtig eerst een aantal variabelen moeten 
declareren die row informatie opslaan, zoals in het volgende voorbeeld.

  declare
    my_empno emp.empno%type;
    my_ename emp.ename%type;
    my_job   emp.job%type;
    my_sal   emp.sal%type;

  begin
    SELECT empno, ename, job, sal
    INTO my_empno, my_ename, my_job, my_sal
    FROM emp WHERE empno=7844;
    dbms_output.put_line(TO_CHAR(my_empno)||' '||my_ename);
  end;
  /


Het is in plsql mogelijk om een NIEUW samengesteld DATATYPE te maken,
en op basis hiervan VARIABELEN te declareren.

declare
        -- EERST HET NIEUWE DATATYPE DECLAREREN

type t_emp is record (
     my_empno scott.emp.empno%type,
     my_ename scott.emp.ename%type,
     my_job   scott.emp.job%type,
     my_sal   scott.emp.sal%type );

        -- NU EEN VARIABELE VAN HET NIEUWE DATATYPE DECLAREREN

employee t_emp;

        -- EEN ASSIGNMENT VAN VALUES AAN t_emp

begin

  SELECT empno, ename, job, sal 
  INTO
       employee.my_empno, employee.my_ename, employee.my_job, employee.my_sal
  FROM scott.emp WHERE empno=7844;
    dbms_output.put_line(TO_CHAR(employee.my_empno)||' '||employee.my_ename);

end;
/


Wat betreft de assignment had het volgende ook gekunt:

  begin
    employee.my_empno:=7844;
    employee.my_ename:='TURNER';
    employee.my_job  :='SALESMAN';
    employee.my_sal  :=1500;
  end;


Nog een voorbeeld van records:

  DECLARE
     
     TYPE TimeRec IS RECORD (hours SMALLINT, minutes SMALLINT);
     
     TYPE MeetingTyp IS RECORD (
        date_held DATE,
        duration  TimeRec,  -- nested record
        location  VARCHAR2(20),
        purpose   VARCHAR2(50));


7.5.2 %rowtype:
---------------

Veelal is de assignment van values aan een variable van type record
een hoop werk.
Er bestaat voor bepaalde toepassingen een shortcut m.b.v. %rowtype
om een record variabele snel te vullen.

Dus als een record variabele alle colums moet hebben van een table row,
kun je de "tablename%rowtype" declaratie gebruiken.

Syntax:

<variable_name> <table_name>%rowtype;

Example 1:
----------

declare
employee scott.emp%rowtype;

begin
      SELECT * INTO employee FROM scott.emp WHERE empno=7844;
      dbms_output.put_line(TO_CHAR(employee.empno)||' '||employee.ename);
end;
/

Example 2:
----------

DECLARE
REC1 JD11.BOOK%ROWTYPE;
REC4 JD11.BOOK%ROWTYPE;

BEGIN
   SELECT * FROM JD11.BOOK INTO REC1 WHERE ISBN = 21;
END;


BEGIN
   REC4 := REC1;
   IF REC4.COST > 0 THEN
      REC4.SECTION_ID := 10;
   ELSE
      REC4.SECTION_ID := 7;
   END IF; 
END;


7.6 Procedures and in, out, in out parameters:
==============================================

You Asked (Jump to Tom's latest followup)

hello tom,

can you explain me what is the diference between variables in and out in pl/sql?

thanks a lot
razvan 

 
and we said...

An IN parameter can be read but not written to in plsql.  If I attempt to modify 
an IN parameter -- it will fail at compile time.  For example:

create or replace procedure p( x in number )
as
begin
   dbms_output.put_line( 'x = ' || x );
   x := 55;
end;
/

Warning: Procedure created with compilation errors.
ops$tkyte@8i> show err
Errors for PROCEDURE P:

LINE/COL ERROR
-------- -----------------------------------------------------
5/2      PLS-00363: expression 'X' cannot be used as an 
         assignment target
5/2      PL/SQL: Statement ignored


An IN OUT parameter can not only be READ but it can be WRITTEN to as well.  It 
retains whatever value it had before coming into the subroutine.  Consider:


create or replace procedure p( x in OUT number )
as
begin
  dbms_output.put_line( 'x = ' || x );
  x := 55;
  dbms_output.put_line( 'x = ' || x );
end;
/

Procedure created.

declare
y       number default 10;
begin
  p(y);
  dbms_output.put_line( 'y = ' || y );
end;
/

x = 10
x = 55
y = 55


PL/SQL procedure successfully completed.


So, the value 10 was passed in and the value 55 was written to it on the way 
out.  

An OUT parameter can be read and written however an OUT only parameter is 
always assigned NULL on the way into the routine. Consider:


create or replace procedure p( x OUT number )
as
begin
   dbms_output.put_line( 'x = ' || x );
   x := 55;
end;
/

Procedure created.

declare
y number default 10;
begin
    p(y);
    dbms_output.put_line( 'y = ' || y );
end;
/

x =
y = 55


PL/SQL procedure successfully completed.
on the way in, NULL -- not 10 -- was passed.  OUT parameters always default to 
NULL.


Here is another example showing that an OUT parameter is *always* modified -- 
even if we don't directly modify it in the routine:

ops$tkyte@8i> create or replace procedure p2( x OUT number )
  2  as
  3  begin
  4          dbms_output.put_line( 'x = ' || x );
  5          -- x := 55;  we do not assign 55
  6  end;
  7  /

Procedure created.

ops$tkyte@8i> declare
  2          y       number default 10;
  3  begin
  4          p2(y);
  5          dbms_output.put_line( 'y = ' || y );
  6  end;
  7  /
x =
y =

PL/SQL procedure successfully completed.

Notice how y is set to NULL, even though we made no assignments to it


It is interesting to note that the ability to READ an OUT parameter is new with 
7.3.  In prior releases you would have gotten the error:

LINE/COL ERROR
-------- ----------------------------------------------------
4/2      PL/SQL: Statement ignored
4/34     PLS-00365: 'X' is an OUT parameter and cannot be read


Please give more examples using copy and nocopy.
and also the example for inout.

Thx

 
Followup:  
Good point, COPY and NOCOPY (new with 8i) modify this behaviour somewhat.

Normally paramters are copied to the OUT values AFTER the succesful execution of 
a procedure, so for example when we compare the behaviour of a COPY and NOCOPY 
routine:

ops$tkyte@ORA8I.WORLD> create or replace procedure p1( x OUT number, y IN OUT 
number )
  2  as
  3  begin
  4          x := 55;
  5          y := 55;
  6          raise program_error;
  7  end;
  8  /

Procedure created.

ops$tkyte@ORA8I.WORLD> create or replace procedure p2( x OUT nocopy number, y IN 
OUT nocopy number )
  2  as
  3  begin
  4          x := 55;
  5          y := 55;
  6          raise program_error;
  7  end;
  8  /

Procedure created.

ops$tkyte@ORA8I.WORLD> 
ops$tkyte@ORA8I.WORLD> declare
  2          l_x number default 0;
  3          l_y number default 0;
  4  begin
  5          p1( l_x, l_y );
  6  exception
  7          when others then
  8                  dbms_output.put_line( 'x = ' || l_x );
  9                  dbms_output.put_line( 'y = ' || l_y );
 10  end;
 11  /
x = 0
y = 0

PL/SQL procedure successfully completed.

ops$tkyte@ORA8I.WORLD> 
ops$tkyte@ORA8I.WORLD> declare
  2          l_x number default 0;
  3          l_y number default 0;
  4  begin
  5          p2( l_x, l_y );
  6  exception
  7          when others then
  8                  dbms_output.put_line( 'x = ' || l_x );
  9                  dbms_output.put_line( 'y = ' || l_y );
 10  end;
 11  /
x = 55
y = 55

PL/SQL procedure successfully completed.

we see that x and y's values are different.  In the COPY routine -- p1 -- the 
values are COPIED to the out parameters upon successful completion.  In the 
nocopy routine, PLSQL is in effect sending a pointer to X and Y -- as soon as we 
modify them in the subroutine, their values are changed in the calling routine.  


So, that begs the question, why the heck would you want to do this?  The side 
effect seems to be not nice, whats the benefit?  Performance:


ops$tkyte@ORA8I.WORLD> create or replace procedure p3( x OUT dbms_sql.varchar2s 
)
  2  as
  3  begin
  4          for i in 1 .. 20000 loop
  5                  x(i) := rpad( '*', 255, '*' );
  6          end loop;
  7  end;
  8  /

Procedure created.

ops$tkyte@ORA8I.WORLD> create or replace procedure p4( x OUT NOCOPY 
dbms_sql.varchar2s )
  2  as
  3  begin
  4          for i in 1 .. 20000 loop
  5                  x(i) := rpad( '*', 255, '*' );
  6          end loop;
  7  end;
  8  /

Procedure created.

ops$tkyte@ORA8I.WORLD> 
ops$tkyte@ORA8I.WORLD> set timing on
ops$tkyte@ORA8I.WORLD> declare
  2          l_x dbms_sql.varchar2s;
  3  begin
  4          p3(l_x);
  5  end;
  6  /

PL/SQL procedure successfully completed.

Elapsed: 00:00:00.47
ops$tkyte@ORA8I.WORLD> declare
  2          l_x dbms_sql.varchar2s;
  3  begin
  4          p4(l_x);
  5  end;
  6  /

PL/SQL procedure successfully completed.
Elapsed: 00:00:00.35


as you can see, avoiding the copy of that much data can shave some runtime off 
our execution. You should consider NOCOPY for all large variables (tables of 
anything) if it doesn't hurt your logic to do so.
 

7.7 Use of a REF CURSOR in a procedure:
=======================================

create or replace package pkg_dept
AS
  type rc_dept is ref cursor;
end;
/

create or replace
    procedure sp_dept( t_deptno IN NUMBER,t_designation  IN NUMBER,
                    dept_cur in out pkg_dept.rc_dept )
   is
   begin
              
       OPEN dept_cur for
        select ename,salary,join_date
        from emp
        where deptno=t_deptno
        and designation=t_designation;
             
 end;
   /

-- From Sql*plus we can use it as

var c refcursor;
execute sp_dept(100,'CLERK',:c);
print c;

-- If you need to access the values, you would have to explicitly 
   fetch and print the results:

declare
   c pkg_dept.rc_dept;
   l_ename emp.ename%type;
   l_sal   emp.salary%type;
   l_join_date emp.join_date%type;
begin
   sp_dept( 100, 'CLERK', c );
   loop
       fetch c into l_ename, l_sal, l_join_date;
       exit when c%notfound;
        ....
   end loop;
   close c;


7.8 NO DATA FOUND en EXACT FETCH:
=================================

Let op de juiste vorm van select statements om een variabele te vullen:

Example:
--------

SQL> select * from AP;

        ID NAME
---------- ----------
         1 KLM
         2 KLM
         3 DSM
         4
         5
           gerrit
           Mira


  declare 
  x number;
  y varchar2(64);
  begin
    select id into x from ap where name='gerrit';
      if x=0 then
         dbms_output.put_line('niet gevonden'||to_char(x));
      else
         dbms_output.put_line('gevonden'||to_char(x));
      end if;
  end;
  /

gevonden

PL/SQL procedure successfully completed.

gerrit is er wel, maar "gerrit" heeft geen id, het is null.
Neem wel notitie van het feit dat x=0 is !!, en daar kun je verder
mee testen.


  declare 
  x number;
  y varchar2(64);
  begin
    select id into x from ap where name='GERRIT';
      if x=0 then
         dbms_output.put_line('niet gevonden'||to_char(x));
      else
         dbms_output.put_line('gevonden'||to_char(x));
      end if;
  end;
  /


*
ERROR at line 1:
ORA-01403: no data found
ORA-06512: at line 5

Dit is dus een error. 


How to deal with:

7.9 ORA-01422 exact fetch returns more than requested number of rows:
====================================================================


Hi Tom

I'm trying to execute SQL statment but the result is :
  
 ORA-01422       exact fetch returns more than requested number of rows  
  
 Cause:       More rows were returned from an exact fetch than specified.  
  Action:       Rewrite the query to return fewer rows or specify more rows in 
the exact fetch.  
  
  So How can I handle this proplem ?

Thank you 

 
and we said...

If you EXPECT the query to return more then one row, you would code:


   for x in ( select * from t where ... ) 
   loop
      -- process the X record here
   end loop;


If you expect the query to return AT LEAST one record and AT MOST one record, 
you would code:


   begin
       select * into ....
         from t where ....

       process....
   exception
       when NO_DATA_FOUND then
          error handling code when no record is found
       when TOO_MANY_ROWS then
          error handling code when too many records are found
   end;


7.10 Unknown number of arguments in a procedure:
===============================================

You Asked (Jump to Tom's latest followup)

In PL/SQL can you pass a unknown number of arguments to a procedure? 

 
and we said...

You would use a collection type -- either a VARRAY, NESTED TABLE or even a PLSQL 
Table type.  For example:


create type myArgType as table of varchar2(4000)
/

Type created.


create or replace procedure flexible( x in myArgType )
as
begin
       for i in 1 .. x.count loop
                dbms_output.put_line( x(i) );
       end loop;
end;
/


ops$tkyte@ORA8I.WORLD> exec flexible( myArgType( 'a', 'b', 'c' ) );
a
b
c

PL/SQL procedure successfully completed.


One  other 'trick' that works very well on Oracle8.0 and up is to use an object type 
that is a nested table type and a simple parse routine that returns this nested 
table type given a string input.  What I mean is best explained via an example:


create or replace type myTableType as table of number;
/
Type created.


ops$tkyte@8i> create or replace function str2tbl( p_str in varchar2 ) return 
myTableType
  2  as
  3      l_str   long default p_str || ',';
  4      l_n        number;
  5      l_data    myTableType := myTabletype();
  6  begin
  7      loop
  8          l_n := instr( l_str, ',' );
  9          exit when (nvl(l_n,0) = 0);
 10          l_data.extend;
 11          l_data( l_data.count ) := ltrim(rtrim(substr(l_str,1,l_n-1)));
 12          l_str := substr( l_str, l_n+1 );
 13      end loop;
 14      return l_data;
 15  end;
 16  /

Function created.

ops$tkyte@8i> 
ops$tkyte@8i> select * from all_users
  2  where user_id in ( select *
  3    from THE ( select cast( str2tbl( '1, 3, 5, 7, 99' ) as mytableType ) from 
dual ) )
  4  /

USERNAME                          USER_ID CREATED
------------------------------ ---------- ---------
SYSTEM                                  5 20-APR-99


===========
8. CURSORS:
===========


8.1 Implicit cursor:
====================

Alle SQL statements hebben een implicit cursor, wat een naam is
voor het memory address waar de results staan.

Implicit cursor naam: 

  sql

Implicit cursor attributes: 

  %notfound     : heeft het sql statement data verandert, of zijn we bij de laatste?
  %found        : heeft het sql statement data verandert
  %rowcount     : hoeveel rows zijn geprocessed door het sql statement
  %isopen       : is de cursor open


Example 1:
----------

declare 
  -- nothing to declare

begin

  delete FROM scott.emp WHERE ename='SEL';
  if sql%notfound then
     dbms_output.put_line('niet gevonden');
  end if;

end;
/


Example 2:
----------

CREATE TABLE EMP2
AS SELECT * FROM EMP;

DECLARE
ROW_DEL_NO NUMBER;
BEGIN 
   DELETE FROM EMP2;
   ROW_DEL_NO := SQL%ROWCOUNT;
   dbms_output.put_line(ROW_DEL_NO);
END; 

14
PL/SQL procedure successfully completed.


8.2 Explicit cursor:
====================


Twee varianten op de control van een cursor:

1. CURSOR FOR LOOP     struktuur (Gebruikt geen FETCH, dit zit impliciet in for..loop)
2. OPEN, FETCH, CLOSE  struktuur


Example opbouw 1:
-----------------

De onderstaande cursor wordt gebruikt in een CURSOR.. FOR .. LOOP constructie.


DECLARE cursor CUR IS
        SELECT empno, ename FROM SCOTT.EMP;

cur_rec cur%rowtype;

begin
  for cur_rec IN cur loop
  dbms_output.put_line(TO_CHAR(cur_rec.empno)||' '||cur_rec.ename);
  end loop;
end;
/

Bij deze constructie, de FOR.. CURSOR.. LOOP, wordt de cursor NIET EXPLICIET
geopend en gesloten.


TRIAL 1:

DECLARE cursor CUR IS
        SELECT id, longrecord FROM IMPSTATEMENTS2;

myvar1 number;
myvar2 varchar(254);

cur_rec cur%rowtype;

begin
  for cur_rec IN cur loop
  if cur_rec.longrecord like 'CI_%' then
  dbms_output.put_line(cur_rec.longrecord);
  end if;
  end loop;
end;
/


TRIAL 2:

DECLARE cursor CUR IS
        SELECT longrecord FROM z1;

myvar1 number;
myvar2 varchar(254);

cur_rec cur%rowtype;

begin
  for cur_rec IN cur loop
  insert into z3 values (cur_rec.longrecord);
  insert into z3 select * from z2;
  end loop;
end;
/


Example Opbouw 2:
-----------------

DECLARE cursor cur_emp IS
        SELECT empno, job, sal FROM scott.emp;

my_empno emp.empno%type;
my_job   emp.job%type;
my_sal   emp.sal%type;

begin

open cur_emp;

  loop
  fetch cur_emp into my_empno, my_job, my_sal;
  
  exit when cur_emp%notfound;
    
        if    my_job='CLERK' then my_sal:=mY_sal*1.2;
        elsif my_job='SALESMAN' then my_sal:=my_sal*1.5;
        elsif my_job='PRESIDENT' then my_sal:=my_sal*1.7;
        end if;

        update emp set sal=my_sal
        WHERE empno=my_empno;

  end loop;
  commit;

close cur_emp;
end;
/


Of beter als volgt m.b.v. %rowtype in plaats van aparte variabelen:


DECLARE cursor cur_emp IS
        SELECT * FROM emp; -- cur_emp is a sort of virtual table

employee cur_emp%rowtype;

begin

open cur_emp;

  loop
  fetch cur_emp into employee;
  
  exit when cur_emp%notfound;
    
        if    employee.job='CLERK' then employee.sal:=employee.sal*1.2;
        elsif employee.job='SALESMAN' then employee.sal:=employee.sal*1.5;
        elsif employee.job='PRESIDENT' then employee.sal:=employee.sal*1.7;
        end if;

        update emp set sal=employee.sal
        WHERE empno=employee.empno;

  end loop;
  commit;

close cur_emp;
end;
/


Example:

DECLARE cursor cur_emp IS
        SELECT table_name, num_rows from user_tables;

my_var cur_emp%rowtype;
i number;

begin
open cur_emp;

  loop
  fetch cur_emp into my_var;
  
  exit when cur_emp%notfound;

  dbms_output.put_line(to_char(my_var.table_name)||'  Number of rows:'||to_char(my_var.num_rows));
  end loop;
  
close cur_emp;
end;
/

Example:

DECLARE cursor cur_emp IS
        SELECT table_name, num_rows from user_tables;

my_var cur_emp%rowtype;
i number;

begin
open cur_emp;

  loop
  fetch cur_emp into my_var;
  
  exit when cur_emp%notfound;

  dbms_output.put_line(my_var.table_name);
  end loop;
  
close cur_emp;
end;
/


8.3 Passing parameters to an explicit cursor:
=============================================

DECLARE cursor cur_emp(low_empno in number, high_empno in number) IS
        SELECT * FROM emp WHERE empno>low_empno and empno<high_empno;

employee cur_emp%rowtype;

begin

open cur_emp(7600,7700);

  loop
  fetch cur_emp into employee;
  
  exit when cur_emp%notfound;
    
        if    employee.job='CLERK' then employee.sal:=employee.sal*1.2;
        elsif employee.job='SALESMAN' then employee.sal:=employee.sal*1.5;
        elsif employee.job='PRESIDENT' then employee.sal:=employee.sal*1.7;
        end if;

        update emp set sal=employee.sal
        WHERE empno=employee.empno;

  end loop;
  commit;

close cur_emp;
end;
/


8.4 For update and WHERE current of clauses:
============================================


For update:
-----------

We can expand the cursor declaration with the "for update" clause.
The associated rows will then be locked in advance.


declare cursor cur is
         SELECT * FROM emp for update;

my_emp cur%rowtype;

begin

    for my_emp in cur loop

        if    my_emp.job='CLERK' then my_emp.sal:=my_emp.sal*1.2;
        elsif my_emp.job='SALESMAN' then my_emp.sal:=my_emp.sal*1.5;
        elsif my_emp.job='PRESIDENT' then my_emp.sal:=my_emp.sal*1.7;
        end if;

        update emp set sal=my_emp.sal
        WHERE empno=my_emp.empno;
    
    end loop;
end;
/


WHERE current of:
-----------------

Wanneer de "for update" clause in de cursor declaratie is toegepast,
is ook de "WHERE current of" clause mogelijk.
Je kunt dan cursor elements direkt benaderen.


declare cursor cur is
         SELECT * FROM emp for update;

my_emp cur%rowtype;

begin

    for my_emp in cur loop

        if    my_emp.job='CLERK' then my_emp.sal:=my_emp.sal*1.2;
        elsif my_emp.job='SALESMAN' then my_emp.sal:=my_emp.sal*1.5;
        elsif my_emp.job='PRESIDENT' then my_emp.sal:=my_emp.sal*1.7;
        end if;

        update emp set sal=my_emp.sal
        WHERE current of cur;           -- dit is het direkt benaderen
    
    end loop;
end;
/


8.5 Subqueries in cursor declaratie:
====================================

Je kunt ook subqueries gebruiken in de cursor declaratie.

declare
        cursor cur is
        SELECT empno, ename FROM emp
        WHERE deptno in (SELECT deptno FROM dept WHERE loc<>'CHICAGO');


declare 
        cursor cur is
        SELECT t1.deptno, dname, "STAFF"
        FROM dept t1, (SELECT deptno, count(*) "STAFF"
                       FROM emp group by deptno) t2
        WHERE t1.deptno=t2.deptno and "STAFF">=5;


8.6 Cursor within a cursor:
===========================

SQL> select * from emp2;

     EMPNO ENAME      JOB              MGR HIREDATE         SAL       COMM     DEPTNO
---------- ---------- --------- ---------- --------- ---------- ---------- ----------
      7369 SMITH      CLERK           7902 17-DEC-80        800        100         20
      7499 ALLEN      SALESMAN        7698 20-FEB-81       1600        300         30
      7521 WARD       SALESMAN        7698 22-FEB-81       1250        500         30
      7566 JONES      MANAGER         7839 02-APR-81       2975                    20
      7654 MARTIN     SALESMAN        7698 28-SEP-81       1250       1400         30
      7698 BLAKE      MANAGER         7839 01-MAY-81       2850                    30
      7782 CLARK      MANAGER         7839 09-JUN-81       2450                    10
      7788 SCOTT      ANALYST         7566 19-APR-87       3000                    20
      7839 KING       PRESIDENT            17-NOV-81       5000                    10
      7844 TURNER     SALESMAN        7698 08-SEP-81       1500          0         30
      7876 ADAMS      CLERK           7788 23-MAY-87       1100                    20
      7900 JAMES      CLERK           7698 03-DEC-81        950                    30
      7902 FORD       ANALYST         7566 03-DEC-81       3000                    20
      7934 MILLER     CLERK           7782 23-JAN-82       1300                    10

14 rows selected.

SQL> select * from dept2;

    DEPTNO DNAME          LOC
---------- -------------- -------------
        10 ACCOUNTING     NEW YORK
        20 RESEARCH       DALLAS
        30 SALES          CHICAGO
        40 OPERATIONS     BOSTON


CREATE OR REPLACE PROCEDURE doublecursor
as

dept_no number;

cursor CUR1 IS
      SELECT DEPTNO, DNAME, LOC
FROM DEPT2;

cur_rec1 CUR1%rowtype;

begin
  
  for cur_rec1 IN CUR1 loop

    DBMS_OUTPUT.PUT_LINE('HET DEPTNO IN LOOP1= '||cur_rec1.deptno);
 
    declare
    cursor CUR2 IS
       select ename, job, deptno from emp2
       where deptno=cur_rec1.deptno;
    
    cur_rec2 CUR2%rowtype;
    begin

     for cur_rec2 in CUR2 loop

        dbms_output.put_line(cur_rec2.ename||' '||cur_rec2.job||' '||cur_rec2.deptno);

     end loop;
   end;

end loop;

end;
/

SQL> exec curdouble;

HET DEPTNO IN LOOP1= 10

CLARK MANAGER 10
KING PRESIDENT 10
MILLER CLERK 10

HET DEPTNO IN LOOP1= 20

SMITH CLERK 20
JONES MANAGER 20
SCOTT ANALYST 20
ADAMS CLERK 20
FORD ANALYST 20

HET DEPTNO IN LOOP1= 30

ALLEN SALESMAN 30
WARD SALESMAN 30
MARTIN SALESMAN 30
BLAKE MANAGER 30
TURNER SALESMAN 30
JAMES CLERK 30

HET DEPTNO IN LOOP1= 40


8.7 Cursors and performance:
============================

Another cause of poor performance is inefficient SQL statements. Because SQL is so flexible, 
you can get the same result with two different statements, but one statement might be less efficient. 
For example, the following two SELECT statements return the same rows 
(the name and number of every department having at least one employee): 

EXEC SQL SELECT dname, deptno 
    FROM scott.dept 
    WHERE deptno IN (SELECT deptno FROM scott.emp); 

EXEC SQL SELECT dname, deptno 
    FROM scott.dept 
    WHERE EXISTS 
    (SELECT deptno FROM scott.emp WHERE dept.deptno = emp.deptno); 

However, the first statement is slower because it does a time-consuming full scan of the EMP table 
for every department number in the DEPT table. Even if the DEPTNO column in EMP is indexed, 
the index is not used because the subquery lacks a WHERE clause naming DEPTNO. 


8.7 Bind Variables:
===================

Bind variables are used in SQL and PL/SQL statements for holding data or result sets. They are commonly 
used in SQL statements to optimize statement performance. A statement with a bind variable 
may be re-executed multiple times without needing to be re-parsed. Their values can be set and referenced 
in PL/SQL blocks. They can be referenced in SQL statements e.g. SELECT. 
Except in the VARIABLE and PRINT commands, bind variable references should be prefixed with a colon.

Bind variables are created with the VARIABLE command. The following PL/SQL block sets a bind variable:

variable bv number
begin
  :bv := 8;
end;
/

    PL/SQL procedure successfully completed.

Once a value is set, you can show it with the PRINT command.

print bv

       BV
 ----------
       8

Numeric bind variables can be used in the EXIT command to return a value to the operating system:

    SQL> EXIT :bv

Other SQL*Plus commands do not recognize bind variables.
There is no way to undefine or delete a bind variable in a SQL*Plus session. 
However, bind variables are not remembered when you exit SQL*Plus.


============================
9. USE OF DBMS_SQL in plsql:
============================


The DBMS_SQL package offers access to dynamic SQL and dynamic PL/SQL from within PL/SQL programs. 
"Dynamic" means that the SQL statements you execute with this package are not 
prewritten into your programs. They are, instead, constructed at runtime 
as character strings and then passed to the SQL engine for execution

Truly dynamic SQL occurs when you literally construct the SQL statement from 
runtime variable values. This is shown in the next example. 
The create_index procedure creates an index where the name of the index, the name of the table, 
and the column on which the index is to be created are passed as parameters to the procedure. 
This action would be impossible without DBMS_SQL for two reasons: this is a DDL call and the 
SQL statement isn't known until the procedure is called.

Example 1:
----------


CREATE OR REPLACE PROCEDURE create_index 
   (index_in IN VARCHAR2, table_in IN VARCHAR2, column_in IN VARCHAR2)
IS
   cursor_handle INTEGER;
   feedback      INTEGER;
BEGIN
   /* Create a cursor to use for the dynamic SQL */
   cursor_handle := DBMS_SQL.OPEN_CURSOR;            -- there is allways a cursor involved with a SQL statement 

   /* Construct the SQL statement and parse it in native mode. */
   DBMS_SQL.PARSE 
      (cursor_handle,
       'CREATE INDEX ' || index_in || ' ON ' || table_in ||
          '( ' || column_in || ')',
       DBMS_SQL.NATIVE);

   /* You should always execute your DDL! */
   feedback := DBMS_SQL.EXECUTE (cursor_handle);

   DBMS_SQL.CLOSE_CURSOR (cursor_handle);
END create_index;
/


Example 2:
----------

PROCEDURE giveraise (dept_in IN INTEGER, raise_in IN NUMBER) 
IS
   cursor_handle INTEGER;
   emps_updated  INTEGER;
BEGIN
   /* Create a cursor to use for the dynamic SQL */
   cursor_handle := DBMS_SQL.OPEN_CURSOR;
   /* 
   || Construct the SQL statement and parse it in Version 7 mode.
   || Notice that the statement includes two bind variables; these
   || are "placeholders" in the SQL statement.
   */
   DBMS_SQL.PARSE 
      (cursor_handle,
       'UPDATE employee SET salary = salary + :raise_amount ' ||
          'WHERE department_id = :dept', 
       DBMS_SQL.V7);

   /* Now I must supply values for the bind variables */
   DBMS_SQL.BIND_VARIABLE (cursor_handle, 'raise_amount', raise_in);
   DBMS_SQL.BIND_VARIABLE (cursor_handle, 'dept', dept_in);

   /* Execute the SQL statement */
   emps_updated := DBMS_SQL.EXECUTE (cursor_handle);

   /* Close the cursor */
   DBMS_SQL.CLOSE_CURSOR (cursor_handle);
EXCEPTION
   WHEN OTHERS 
   THEN
      /* Clean up on failure too. */
      DBMS_SQL.CLOSE_CURSOR (cursor_handle);
END;


The following procedures are defined in the DBMS_SQL Package. 

BIND_ARRAY Binds a specific value to a host array (PL/SQL8 only).
BIND_VARIABLE Binds a specific value to a host variable.
CLOSE_CURSOR Closes the cursor.
COLUMN_VALUE Retrieves a value from the cursor into a local variable.
COLUMN_VALUE_LONG Retrieves a selected part of a LONG value from a cursor's column defined with DEFINE_COLUMN_LONG.
DEFINE_ARRAY Defines an array to be selected from the specified cursor (PL/SQL8 only).
DEFINE_COLUMN Defines a column to be selected from the specified cursor.
DEFINE_COLUMN_LONG Defines a LONG column to be selected from the specified cursor.
DESCRIBE_COLUMNS Describes the columns for a dynamic cursor (PL/SQL8 only).
EXECUTE Executes the cursor.
EXECUTE_AND_FETCH Executes the cursor and fetches its row(s).
FETCH_ROWS Fetches the row(s) from the cursor.
IS_OPEN Returns TRUE if the cursor is open.
LAST_ERROR_POSITION Returns the byte offset in the SQL statement where the error occurred.
LAST_ROW_COUNT Returns the total number of rows fetched from the cursor.
LAST_ROW_ID Returns the ROWID of the last row fetched from the cursor.
LAST_SQL_FUNCTION_CODE Returns the SQL function code for the SQL statement.
OPEN_CURSOR Opens the cursor.
PARSE Parses the specified SQL statement. If the statement is a DDL statement, then the parse also executes the statement.
VARIABLE_VALUE Gets a value of a variable in a cursor.


===================
10. Error handling:
===================


predefined exceptions  :  exception only
internal exception     :  declaration and exception
user defined exception :  declaration, code, and exception


10.1 predefined execptions:
===========================

Deze vorm van error handling associeert een beperkt aantal ORA error's
met een beperkte lijst van named exceptions.


declare 
        my_var number;
begin
  SELECT empno into my_var
  FROM scott.emp WHERE ename='SMITH';
    dbms_output.put_line('We have smith');
    dbms_output.put_line(TO_CHAR(my_var));
                                           -- test of tweede block wordt uitgevoerd
      begin
         dbms_output.put_line('tweede block');
      end;
end;
/

Maar als SMITH niet gevonden wordt: ORA-ERROR message no data found
en het tweede block wordt NIET uitgevoerd.

ERROR at line 1:
ORA-01403: no data found
ORA-06512: at line 4


declare 
        my_var number;
begin
  SELECT empno into my_var
  FROM scott.emp WHERE ename='AALBERG';
    dbms_output.put_line('We have AALBERG');
    dbms_output.put_line(TO_CHAR(my_var));
                                           -- test of tweede block wordt uitgevoerd                          
      begin
         dbms_output.put_line('tweede block');
      end;

exception

  when no_data_found 
  then                  -- de predefined exception
  dbms_output.put_line('record not found');

end;
/

record not found
PL/SQL procedure successfully completed.

The error is dealt with, but the second block is not executed.


The following predefined exceptions exists:

  no_data_found
  too_many_rows
  dup_val_on_index
  row_type_mismatch
  others  etc..


10.2 Internal exceptions:
=========================

Je kunt iedere ORA error afvangen met de internal exception methode.
Stel we inserten een record in emp waarbij de primary key al voorkomt


declare
  cons_violate exception;
  pragma exception_init(cons_violate, -0001);
begin
  insert into scott.emp (empno, ename, job)
  values (7844, 'TURNER', 'SALESMAN');

exception
  when cons_violate then
  dbms_output.put_line('record reeds aanwezig');
end;
/


10.3 User defined exceptions:
=============================

Je kunt iedere error afvangen. Dit hoeft niet overeen te komen met
een ORA error, zoals bij de vorige twee exceptions wel het geval was.

Bij de voorgaande twee exceptions deed Oracle het RAISE werk. 
Nu moeten we zelf voorzieningen treffen.

exception declaration
exception testing
exception handling


DECLARE cursor cur_emp IS
        SELECT * FROM emp WHERE job='SALESMAN';

bad_ename emp.ename%type;

employee cur_emp%rowtype;

comm_is_null exception;

begin
    for employee in cur_emp loop
    bad_ename:=employee.ename;
    if employee.comm<>0 then 
       dbms_output.put_line(employee.ename||' is ok');
    else
       raise comm_is_null; 
    end if;
    end loop;

exception
   when comm_is_null then
   dbms_output.put_line('comm is nul for salesman '||bad_ename);   
   when others then
   dbms_output.put_line('other exception occurred');   
end;
/


Example on scope of an exception:
---------------------------------


When you declare an exception in a block, it is local to that block, but global to all the blocks 
which are enclosed by that block (nested blocks). 
In the version of check_account shown in the following example, 
the procedure contains an anonymous subblock which also raises the overdue_balance. 
Because the subblock is enclosed by the procedure block, PL/SQL can resolve the reference to that exception: 

PROCEDURE check_account (company_id_in IN NUMBER)
IS
   overdue_balance EXCEPTION;
BEGIN
   ... executable statements ...

   -- Start of sub-block inside check_account
   BEGIN
      ... statements within sub-block ...
      RAISE overdue_balance;  -- Exception raised in sub-block.
   END;
   -- End of sub-block inside check_account

   LOOP
      ...
      IF ... THEN
         RAISE overdue_balance; -- Exception raised in main block.
      END IF;
   END LOOP;

EXCEPTION
   WHEN overdue_balance THEN ... -- Exception handled in main block.
END;


10.4 PK, FK violations and TOO_MANY_ROWS :
==========================================


you can also insert your errors into some log error table calling SQLCODE and SQLERRM, or print it
DBMS_OUTPUT.PUT_LINE (SQLCODE||','||SQLERRM)

Example:
--------

CREATE OR REPLACE PROCEDURE test
AS


tankid               number;
tanknummer            number;
naam             varchar(10);
cnt_tank_nummer       number;
soort_id                 number;
cnt_tran_succes       number;
cnt_tran_error        number;

-- exceptions

pk_cons_violate exception;
pragma exception_init(pk_cons_violate, -00001);

fk_cons_violate exception;
pragma exception_init(fk_cons_violate, -02291);

too_many exception;
pragma exception_init(too_many, -01422);


cursor CUR_STR IS
SELECT * from tankje;
 

cur_rec CUR_STR%rowtype;

BEGIN

cnt_tran_succes :=0;
cnt_tran_error  :=0;

for cur_rec IN CUR_STR loop

  BEGIN


      SELECT soortbrandstofid INTO soort_id FROM soort
      WHERE tankid=cur_rec.tankid;

      dbms_output.put_line('de soort is: '||to_char(soort_id));

           cnt_tran_succes:=cnt_tran_succes+1;

  EXCEPTION
    WHEN pk_cons_violate THEN
         cnt_tran_error:=cnt_tran_error+1;
         dbms_output.put_line('PK constraint violation.');
    
    WHEN fk_cons_violate THEN
         cnt_tran_error:=cnt_tran_error+1;
         dbms_output.put_line('FK constraint violation.');
    
    WHEN too_many THEN
         dbms_output.put_line('Meer dan 1 waarde');
         cnt_tran_error:=cnt_tran_error+1;
         
    WHEN no_data_found THEN
         dbms_output.put_line('Geen waarde gevonden.');
         cnt_tran_error:=cnt_tran_error+1;
    
    WHEN OTHERS THEN  
         cnt_tran_error:=cnt_tran_error+1;
         
  
    END;

end loop;


dbms_output.put_line('cnt_tran_error :'||to_char(cnt_tran_error));
dbms_output.put_line('cnt_tran_succes :'||to_char(cnt_tran_succes));

EXCEPTION
 
      WHEN OTHERS THEN  
           RAISE;
END;
/


Example 2:
----------

I have written a cursor as below. Each time I go through a row I want to find a phone number 
based on a name column in the table from the cursor. 
I think there is a problem with using SELECT .. INTO

I am getting "ERROR at line 1: 
ORA-01422: exact fetch returns more than requested number of rows 
ORA-06512: at line 14 
"

Any helps appreciated!

set serveroutput on size 100000;

DECLARE 
  phoneNum VARCHAR2(40);
  CURSOR td_cur IS
    SELECT *
    FROM data;
  
BEGIN
  FOR td_rec IN td_cur
  LOOP
    
    SELECT phone INTO phoneNum
    FROM tb_phone
    WHERE locality = td_rec.name;
  
  END LOOP;
END;

Answer:

You have more than one records for some name.  If you want to avoid duplicates, try this

SELECT phone INTO phoneNum
    FROM tb_phone
    WHERE locality = td_rec.name
      and rownum < 2;

SELECT phone INTO phoneNum
    FROM tb_phone
    WHERE locality = td_rec.name and rownum=1;

BEGIN
  FOR td_rec IN td_cur
  LOOP
   SELECT phone INTO phoneNum
    FROM tb_phone
    WHERE 
    rowid in  (select min(rowid) from tb_phone group by locality)
    AND locality=td_rec.name; 
  END LOOP;
END;


10.5 further tests on execptions:
=================================

test 1:
-------

-- hoe gaat plsql om met het feit dat er wel een exception
-- is gedeclareerd maar er is geen exception op de raise
-- levert dus gewoon een fout op: 
-- ORA-06510: PL/SQL: unhandled user-defined exception


DECLARE cursor cur_emp IS
        SELECT * FROM scott.emp WHERE job='SALESMAN';

bad_ename scott.emp.ename%type;

employee cur_emp%rowtype;

comm_is_null exception;

begin
    for employee in cur_emp loop
    bad_ename:=employee.ename;
    if employee.comm<>0 then 
       dbms_output.put_line(employee.ename||' is ok');
    else
       raise comm_is_null; 
    end if;
    end loop;

--exception
--   when comm_is_null then
--   dbms_output.put_line('comm is nul for salesman '||bad_ename);   
--   when others then
--   dbms_output.put_line('other exception occurred');   
end;
/

ERROR at line 1:
ORA-06510: PL/SQL: unhandled user-defined exception
ORA-06512: at line 16


test 2:
-------

-- hoe gaat plsql om met het feit dat er wel een exception
-- is gedeclareerd maar er is geen raise
-- dit loopt gewoon door


DECLARE cursor cur_emp IS
        SELECT * FROM emp WHERE job='SALESMAN';

bad_ename emp.ename%type;

employee cur_emp%rowtype;

comm_is_null exception;

begin
    for employee in cur_emp loop
    bad_ename:=employee.ename;
    if employee.comm<>0 then 
       dbms_output.put_line(employee.ename||' is ok');
    -- else
    --   raise comm_is_null; 
    end if;
    end loop;

exception
   when comm_is_null then
   dbms_output.put_line('comm is nul for salesman '||bad_ename);   
   when others then
   dbms_output.put_line('other exception occurred');   
end;
/

ALLEN is ok
WARD is ok
MARTIN is ok

PL/SQL procedure successfully completed.

test 3:
-------

-- zelfde procedure, de raise is nu niet uitgecommentarieerd
-- de loop wordt geheel doorgewerkt.

DECLARE cursor cur_emp IS
        SELECT * FROM emp WHERE job='SALESMAN';

bad_ename emp.ename%type;

employee cur_emp%rowtype;

comm_is_null exception;

begin
    for employee in cur_emp loop
    bad_ename:=employee.ename;
    if employee.comm<>0 then 
       dbms_output.put_line(employee.ename||' is ok');
    else
       raise comm_is_null; 
    end if;
    end loop;

exception
   when comm_is_null then
   dbms_output.put_line('comm is nul for salesman '||bad_ename);   
   when others then
   dbms_output.put_line('other exception occurred');   
end;
/

ALLEN is ok
WARD is ok
MARTIN is ok
comm is nul for salesman TURNER

PL/SQL procedure successfully completed.

     EMPNO ENAME      JOB              MGR HIREDATE         SAL       COMM     DEPTNO
---------- ---------- --------- ---------- --------- ---------- ---------- ----------
      7369 SMITH      CLERK           7902 17-DEC-80        800        100         20
      7499 ALLEN      SALESMAN        7698 20-FEB-81       1600        300         30
      7521 WARD       SALESMAN        7698 22-FEB-81       1250        500         30
      7566 JONES      MANAGER         7839 02-APR-81       2975                    20
      7654 MARTIN     SALESMAN        7698 28-SEP-81       1250       1400         30
      7698 BLAKE      MANAGER         7839 01-MAY-81       2850                    30
      7782 CLARK      MANAGER         7839 09-JUN-81       2450                    10
      7788 SCOTT      ANALYST         7566 19-APR-87       3000                    20
      7839 KING       PRESIDENT            17-NOV-81       5000                    10
      7844 TURNER     SALESMAN        7698 08-SEP-81       1500          0         30
      7876 ADAMS      CLERK           7788 23-MAY-87       1100                    20
      7900 JAMES      CLERK           7698 03-DEC-81        950                    30
      7902 FORD       ANALYST         7566 03-DEC-81       3000                    20
      7934 MILLER     CLERK           7782 23-JAN-82       1300                    10

14 rows selected.

Je ziet dat de loop geheel is doorgewerkt (14 keer)


test 3:
-------


-- test of we uit de loop springen als de exception waar is
-- de loop wordt inderdaad afgebroken

declare cursor cur is 
        select * from emp;

employee cur%rowtype;
badrec varchar2(20);

begin
      for employee in cur loop
          dbms_output.put_line(employee.ename);

      end loop;
end;
/

SMITH
ALLEN
WARD
JONES
MARTIN
BLAKE
CLARK
SCOTT
KING
TURNER
ADAMS
JAMES
FORD
MILLER

PL/SQL procedure successfully completed.


declare cursor cur is 
        select * from emp;

employee cur%rowtype;
badrec varchar2(20);
comm_is_null exception;

begin
      for employee in cur loop
          dbms_output.put_line(employee.ename);
          if employee.comm is null then
             raise comm_is_null;
          end if;
      end loop;

exception
      when comm_is_null then
      dbms_output.put_line(employee.ename||' heeft null commssion.');


end;
/

SMITH
ALLEN
WARD
JONES
heeft null commssion.

PL/SQL procedure successfully completed.


============
11. Packages:
============


11.1 Structure package definition en package body:
==================================================

Example 1:
----------


CREATE OR REPLACE PACKAGE cust_actions AS  -- package specification

   PROCEDURE newcustomer (custid in NUMBER, custname in VARCHAR);
   PROCEDURE delcustomer (cust in NUMBER) ;

END cust_actions;
/


CREATE OR REPLACE PACKAGE BODY cust_actions AS  -- package body

PROCEDURE newcustomer (custid IN NUMBER, custname IN VARCHAR) 
IS
BEGIN
  INSERT INTO customers values (custid,custname);
  commit;
END;

PROCEDURE delcustomer (cust IN NUMBER)
IS
BEGIN
  delete from customers where custid=cust;
  commit;
END;

END cust_actions;
/


Example 2:
----------

CREATE OR REPLACE PACKAGE cust_actions AS  -- package specification

   PROCEDURE newcustomer (custid in NUMBER, custname in VARCHAR);
   PROCEDURE delcustomer (cust in NUMBER) ;
   PROCEDURE insert_customer (id in NUMBER, name in VARCHAR);

END cust_actions;
/


CREATE OR REPLACE PACKAGE BODY cust_actions AS  -- package body

PROCEDURE newcustomer (custid IN NUMBER, custname IN VARCHAR) 
IS
BEGIN
cust_actions.insert_customer(id => custid, name => custname);
END;

PROCEDURE delcustomer (cust IN NUMBER)
IS
BEGIN
  delete from customers where custid=cust;
  commit;
END;

PROCEDURE insert_customer(id IN NUMBER, name IN VARCHAR) 
IS
BEGIN
  INSERT INTO customers values (id,name);
  commit;
END;

END cust_actions;
/


Example 3:
----------

A package is a set of related functions and / or routines. 
Packages are used to group together PL/SQL code blocks which make up a common application 
or are attached to a single business function. Packages consist of a specification and a body. 

The package specification lists the public interfaces to the blocks within the package body. 

The package body contains the public and private PL/SQL blocks which make up the application, 
private blocks are not defined in the package specification and cannot be called by any routine
other than one defined within the package body. 
The benefits of packages are that they improve the organisation of procedure 
and function blocks, allow you to update the blocks that make up the package body 
without affecting the specification (which is the object that users have rights to) 
and allow you to grant execute rights once instead of for each and every block.

To create a package specification we use a variation on the CREATE command, 
all we need put in the specification is each PL/SQL block header that will 
be public within the package. An example follows :-


CREATE OR REPLACE PACKAGE MYPACK1 AS
PROCEDURE MYPROC1 (REQISBN IN NUMBER, MYVAR1 IN OUT CHAR,TCOST OUT NUMBER);
FUNCTION MYFUNC1;
END MYPACK1;


To create a package body we now specify each PL/SQL block that makes up the package, 
note that we are not creating these blocks separately (no CREATE OR REPLACE is 
required for the procedure and function definitions). An example follows :-

CREATE OR REPLACE PACKAGE BODY MYPACK1 AS

PROCEDURE MYPROC1
(REQISBN IN NUMBER,MYVAR1 IN OUT CHAR,TCOST OUT NUMBER)

TEMP_COST NUMBER(10,2))
IS BEGIN
   SELECT COST FROM JD11.BOOK INTO TEMP_COST WHERE ISBN = REQISBN;
   IF TEMP_COST > 0 THEN
      UPDATE JD11.BOOK SET COST = (TEMP_COST*1.175) WHERE ISBN = REQISBN;
   ELSE 
      UPDATE JD11.BOOK SET COST = 21.32 WHERE ISBN = REQISBN;
   END IF; 
   TCOST := TEMP_COST;
   COMMIT;
EXCEPTION
   WHEN NO_DATA_FOUND THEN
      INSERT INTO JD11.ERRORS (CODE, MESSAGE) VALUES(99, 'ISBN NOT FOUND');
END MYPROC1;

FUNCTION MYFUNC1
RETURN NUMBER
IS 
RCOST NUMBER(10,2); 
BEGIN
   SELECT COST FROM JD11.BOOK INTO RCOST WHERE ISBN = 21;
   RETURN (RCOST);
END MYFUNC1;
END MYPACK1;

You can execute a public package block like this :-
EXECUTE :PCOST := JD11.MYPACK1.MYFUNC1 

WHERE JD11 is the schema name that owns the package. 
You can use DROP PACKAGE and DROP PACKAGE BODY to remove the package objects FROM the database.

Example 4:
----------

CREATE OR REPLACE PACKAGE schema.package

CREATE PACKAGE emp_mgmt AS
   FUNCTION hire (last_name VARCHAR2, job_id VARCHAR2,
 manager_id NUMBER, salary NUMBER, 
 commission_pct NUMBER, department_id NUMBER)
 RETURN NUMBER;
 FUNCTION create_dept(department_id NUMBER, location NUMBER)
 RETURN NUMBER;
 PROCEDURE remove_emp(employee_id NUMBER);
 PROCEDURE remove_dept(department_id NUMBER);
 PROCEDURE increase_sal(employee_id NUMBER, salary_incr NUMBER);
 PROCEDURE increase_comm(employee_id NUMBER, comm_incr NUMBER);
 no_comm EXCEPTION;
 no_sal EXCEPTION;
END emp_mgmt;
/

Before you can call this package's procedures and functions, 
you must define these procedures and functions in the package body. 

Example 5:
----------

CREATE PACKAGE employee_management AS
   FUNCTION hire_emp (name VARCHAR2, job VARCHAR2,
      mgr NUMBER, hiredate DATE, sal NUMBER, comm NUMBER,
      deptno NUMBER) RETURN NUMBER;
   PROCEDURE fire_emp (emp_id NUMBER);
   PROCEDURE sal_raise (emp_id NUMBER, sal_incr NUMBER);
END employee_management;


CREATE PACKAGE BODY employee_management AS

   FUNCTION hire_emp (name VARCHAR2, job VARCHAR2,
      mgr NUMBER, hiredate DATE, sal NUMBER, comm NUMBER,
      deptno NUMBER) RETURN NUMBER IS
 
      new_empno    NUMBER(10);

   BEGIN
      SELECT emp_sequence.NEXTVAL INTO new_empno FROM dual;
      INSERT INTO emp VALUES (new_empno, name, job, mgr,
         hiredate, sal, comm, deptno);
      RETURN (new_empno);
   END hire_emp;


   PROCEDURE fire_emp(emp_id IN NUMBER) AS
   BEGIN
      DELETE FROM emp WHERE empno = emp_id;
      IF SQL%NOTFOUND THEN
      raise_application_error(-20011, 'Invalid Employee
         Number: ' || TO_CHAR(emp_id));
   END IF;
   END fire_emp;

 
  PROCEDURE sal_raise (emp_id IN NUMBER, sal_incr IN NUMBER) AS
   BEGIN

   -- If employee exists, then update salary with increase.
   
      UPDATE emp
         SET sal = sal + sal_incr
         WHERE empno = emp_id;
      IF SQL%NOTFOUND THEN
         raise_application_error(-20011, 'Invalid Employee
            Number: ' || TO_CHAR(emp_id));
      END IF;
   END sal_raise;
   END employee_management;


NOTE: SUBPROCEDURES AND FUNCTION MAY BE DECLARED WITHOUT "CREATE OR REPLACE"


==================================
12. tests on scopes of variables :
==================================

For nested blocks an object defined in a parent block is available within all its child (nested blocks).
The reverse is not true, objects defined in a child block are not visible to the parent

-- ---------------------------------------------------------------------

test 1:
-------

declare

x       number;
V_sal   number;
V_found varchar2(10):='TRUE';

begin

x:=1;
V_sal:=1500;

  declare

  y number;

   begin
     if (V_sal>400) then V_found:='YES';
     end if;
     dbms_output.put_line('2. The value V_found is '||V_found);
     dbms_output.put_line('2. The value V_sal is '||V_sal);
     y:=25;
   end;

 dbms_output.put_line('1. The value V_found is '||V_found);
 -- dbms_output.put_line('1. The value y is '||TO_CHAR(y));

end;
/


2. The value V_found is YES
2. The value V_sal is 1500
1. The value V_found is YES

PL/SQL procedure successfully completed.

-- ---------------------------------------------------------------------

test 2:
-------

-- let op de scope van de variabelen

declare

x number;
V_sal number;
V_found varchar2(10):='TRUE';

begin

x:=1;
V_sal:=1500;

  declare

  V_found varchar2(10);  -- !!! again declared
  y number;

   begin
     if (V_sal>400) then V_found:='YES';
     end if;
     dbms_output.put_line('2. The value V_found is '||V_found);
     dbms_output.put_line('2. The value V_sal is '||V_sal);
     y:=25;
   end;

 dbms_output.put_line('1. The value V_found is '||V_found);
 -- dbms_output.put_line('1. The value y is '||TO_CHAR(y));

end;
/

2. The value V_found is YES
2. The value V_sal is 1500
1. The value V_found is TRUE

PL/SQL procedure successfully completed.

Maar V_found is wel 2x gedeclareerd, in een inner block, alswel in het outer block.


-- ---------------------------------------------------------------------

test 3:
-------

declare

x number;
V_sal number;
V_found varchar2(10):='TRUE';

begin

x:=1;
V_sal:=1500;

  declare

  V_found varchar2(10);
  y number;

   begin
     if (V_sal>400) then V_found:='YES';
     end if;
     dbms_output.put_line('The value V_found is '||V_found);
     dbms_output.put_line('The value V_sal is '||V_sal);
     y:=25;
   end;

 dbms_output.put_line('The value V_found is '||V_found);
 dbms_output.put_line('The value y is '||TO_CHAR(y));

end;
/

ERROR at line 26:
ORA-06550: line 26, column 50:
PLS-00201: identifier 'Y' must be declared
ORA-06550: line 26, column 2:
PL/SQL: Statement ignored


-------------------------------------------------------------------

test 4:
-------

-- THIS WORKS:

declare
my_emp scott.emp%rowtype;
my_var1 number;

begin
  my_var1:=1;
  dbms_output.put_line('level 1 en variabele :'||TO_CHAR(my_var1));
  declare
  my_var2 number;
         begin
           my_var2:=2;
           dbms_output.put_line('level 2 en variable :'||TO_CHAR(my_var2));
           declare my_var3 number;
              begin
                my_var3:=3;
                dbms_output.put_line('level 3 en variable :'||TO_CHAR(my_var3));
              end;
         end;
end;
/

level 1 en variabele :1
level 2 en variable :2
level 3 en variable :3

PL/SQL procedure successfully completed.


----------------------------------------------------------------------

test 5:

-- THIS IS WRONG:

declare
my_emp scott.emp%rowtype;
my_var1 number;

begin
  my_var1:=1;
  dbms_output.put_line('level 1 en variabele :'||TO_CHAR(my_var1));
  declare
  my_var2 number;
         begin
           my_var2:=2;
           dbms_output.put_line('level 2 en variable :'||TO_CHAR(my_var3));
           declare my_var3 number;
              begin
                my_var3:=3;
                dbms_output.put_line('level 3 en variable :'||TO_CHAR(my_var3));
              end;
         end;
end;
/

ERROR at line 12:
ORA-06550: line 12, column 66:
PLS-00201: identifier 'MY_VAR3' must be declared
ORA-06550: line 12, column 12:
PL/SQL: Statement ignored

------------------------------------------------------------------------------

test 6:
-------

-- THIS IS WRONG:

declare
my_emp emp%rowtype;
my_var1 number;

begin
  my_var1:=1;
  dbms_output.put_line('level 1 en variabele :'||TO_CHAR(my_var1));
  declare
  my_var2 number;
         begin
           my_var2:=2;
           dbms_output.put_line('level 2 en variable :'||TO_CHAR(my_var2));
           declare my_var3 number;
              begin
                my_var3:=3;
                dbms_output.put_line('level 3 en variable :'||TO_CHAR(my_var3));
              end;
         end;
dbms_output.put_line(TO_CHAR(my_var3));
end;
/


ERROR at line 19:
ORA-06550: line 19, column 30:
PLS-00201: identifier 'MY_VAR3' must be declared
ORA-06550: line 19, column 1:
PL/SQL: Statement ignored

------------------------------------------------------------------------------

test 7:
-------


DECLARE
   acct_balance NUMBER(11,2);
   acct         CONSTANT NUMBER(4) := 3;
   debit_amt    CONSTANT NUMBER(5,2) := 500.00;
BEGIN
   SELECT bal INTO acct_balance FROM accounts
      WHERE account_id = acct
      FOR UPDATE OF bal;
   IF acct_balance >= debit_amt THEN
      UPDATE accounts SET bal = bal - debit_amt
         WHERE account_id = acct;
   ELSE
      INSERT INTO temp VALUES
         (acct, acct_balance, 'Insufficient funds');
            -- insert account, current balance, and message
   END IF;
   COMMIT;
END;

-----------------------------------------------------------------------------

test 8: HOW TO DEAL WITH QUOTES:
--------------------------------

declare
OUTP VARCHAR(32);
BEGIN
OUTP:='bra''ins';
DBMS_OUTPUT.PUT_LINE(OUTP);
END;
/

bra'ins

PL/SQL procedure successfully completed.


declare
OUTP VARCHAR(32);
BEGIN
OUTP:='''brains';
DBMS_OUTPUT.PUT_LINE(OUTP);
END;
/

'brains

PL/SQL procedure successfully completed.


declare
OUTP VARCHAR(32);
BEGIN
OUTP:='''brains''';
DBMS_OUTPUT.PUT_LINE(OUTP);
END;
/

'brains'

PL/SQL procedure successfully completed.


===========================================
13.Composite Datataypes: records en tables:
===========================================


We hebben als datatypes:

- scalar datatypes (single value, geen interne componenenten: nummeric, varchar etc..)
        
declare
 my_empno number(4);
 my_ename varchar2(10);
 my_job varchar2(9);

- composite types  (samengestelde typen zoals records, tables)


13.1 Records:
=============

13.1.1 Introduction:
--------------------

We have already seen the record datatype:
A PL/SQL record is a variable that contains a collection of separate fields. 
Each field is individually addressable. You can reference the field names in both 
assignments and expressions. The fields within a record may have different 
datatypes and sizes, like the columns of a database table. Records are a convenient way 
of storing a complete fetched row from a database table.

Use the %ROWTYPE attribute to declare a record based upon a collection of database columns 
from a table or view. The fields within the record take their names and datatypes from 
the columns of the table or view.

Declare the record in the DECLARE section along with any other required variables 
and constants. An example follows :-


CREATE TABLE BOOK
(
ISBN    integer,
TITLE   varchar2(64),
Author  varchar2(64),
cost    number(7,2)
);

insert into book values (100, 'PLSQL Programming','Piet',12.75);
insert into book values (101,'C++ Programming', 'Klaas', 32.80);
insert into book values (102,'Java Programming', 'Miranda', 17.55);
insert into book values (104, '.NET Programming', 'Bridget',22.00);
COMMIT;


DECLARE
REC1 BOOK%ROWTYPE;
REC4 BOOK%ROWTYPE;

BEGIN
   SELECT * INTO REC1 FROM BOOK WHERE ISBN = 101;
   dbms_output.put_line('Title: '||REC1.Title||' Author: '||REC1.Author);
END;

The above declaration sets the object REC1 to be a record object holding fields 
that match the columns in the BOOK table. It doesn't hold any values until it is populated. 

Assign values into a PL/SQL record by naming the record after the INTO keyword 
of a SELECT statement. The INTO keyword defines the name specification for the 
storage area(s) of queried value(s). 

BEGIN
   SELECT * FROM SCOTT.BOOK INTO REC1 WHERE ISBN = 21;
END;


If the record is not a "copy" of an existing table, the proceed as follows:

DECLARE
         
TYPE TimeRec IS RECORD (hours SMALLINT, minutes SMALLINT);
     
TYPE MeetingTyp IS RECORD (
     date_held DATE,
     duration  TimeRec,  -- nested record
     location  VARCHAR2(20),
     purpose   VARCHAR2(50));

As another example:

declare
-- FIRST DECLARE THE NEW DATATYPE

type t_emp is record (
     my_empno scott.emp.empno%type,
     my_ename scott.emp.ename%type,
     my_job   scott.emp.job%type,
     my_sal   scott.emp.sal%type );

-- NOW INSTANTIATE A VARIABLE OF THIS DATATYPE 

employee t_emp;


13.1.2 Kinds of Records:
------------------------

PL/SQL supports three different kinds of records: table-based, cursor-based, and programmer-defined. 
These different types of records are used in different ways and for different purposes, 
but all three share the same internal structure: every record is composed of one or more fields. 
However, the way these fields are defined in the record depend on the record type. 


Table-based: (no declare needed)
------------

A record based on a table's column structure.
Each field corresponds to -- and has the same name as -- a column in a table
 
REC1 SCOTT.BOOK%ROWTYPE;

Cursor-based:
-------------

A record based on the cursor's SELECT statement.
Each field corresponds to a column or expression in the cursor SELECT statement. 

DECLARE
   /*
   || Create a cursor and rename the columns to give them a more
   || specific meaning for this particular cursor and block of code.
   */
   CURSOR high_losses_cur IS
      SELECT country_code  dying_country_cd,
             size_in_acres shrinking_plot,
             species_lost  above_avg_loss
        FROM rain_forest_history
       WHERE species_lost >
               (SELECT AVG (species_lost)
                  FROM rain_forest_history
                 WHERE TO_CHAR (analysis_date, 'YYYY') = '1994');

   /* Define the record for this cursor */

   high_losses_rec high_losses_cur%ROWTYPE;

BEGIN
   OPEN high_losses_cur;
   LOOP
      FETCH high_losses_cur INTO high_losses_rec;
      EXIT WHEN high_losses_cur%NOTFOUND;
      /*
      || Now when I reference one of the record's fields, I use the
      || name I gave that field in the cursor, not the original column
      || name from the table.
      */
      publicize_loss (high_losses_rec.dying_country_cd);
      project_further_damage (high_losses_rec.shrinking_plot);
   END LOOP;
   CLOSE high_losses_cur;
END;

 
Programmer-defined: (declare needed)
-------------------

A record whose structure you, the programmer, get to define with a declaration statement. 
Each field is defined explicitly (its name and datatype) in the TYPE statement for that record; 
a field in a programmer-defined record can even be another record. 

Contrary to the former record type, here you have two steps:

- declare the TYPE
- declare the variable of that TYPE

 
TYPE TimeRec IS RECORD (
hours SMALLINT, 
minutes SMALLINT);

TimeRec_rec Timerec;

Notice that I do not need the %ROWTYPE attribute, or any other kind of keyword, to denote this 
as a record declaration. The %ROWTYPE attribute is only needed for table and cursor records. 


13.1.3 Assigning Values to and from Records:
--------------------------------------------

You can modify the values in a record in the following ways: 

- Direct field assignment with the assignment operator

DECLARE
   rain_forest_rec rain_forest_history%ROWTYPE;

BEGIN
   /* Set values for the record */
   rain_forest_rec.country_code  := 1005;
   rain_forest_rec.analysis_date := SYSDATE;
   rain_forest_rec.size_in_acres := 32;
   rain_forest_rec.species_lost  := 425;

   /* Insert a row in the table using the record values */
   INSERT INTO rain_forest_history VALUES
      (rain_forest_rec.country_code,
       rain_forest_rec.analysis_date,
       rain_forest_rec.size_in_acres,
       rain_forest_rec.species_lost);
   ...
END;


- SELECT INTO from an implicit cursor

DECLARE
   TYPE customer_sales_rectype IS RECORD
      (customer_id   NUMBER (5),
       customer_name customer.name%TYPE,
       total_sales   NUMBER (15,2)
       );
   top_customer_rec  customer_sales_rectype;
BEGIN
   /* Move values directly into the record: */
   SELECT customer_id, name, SUM (sales)
     INTO top_customer_rec
     FROM customer
    WHERE sold_on BETWEEN < ADD_MONTHS (SYSDATE, -3);

   /* or list the individual fields: */
   SELECT customer_id, name, SUM (sales)
     INTO top_customer_rec.customer_id, top_customer_rec.customer_name,
          top_customer_rec.total_sales
     FROM customer
    WHERE sold_on BETWEEN < ADD_MONTHS (SYSDATE, -3);


- FETCH INTO from an explicit cursor

DECLARE
   /*
   || Declare a cursor and then define a record based on that cursor
   || with the %ROWTYPE attribute.
   */
   CURSOR cust_sales_cur IS
      SELECT customer_id, name, SUM (sales) tot_sales
        FROM customer
       WHERE sold_on BETWEEN < ADD_MONTHS (SYSDATE, -3);
   cust_sales_rec cust_sales_cur%ROWTYPE;

BEGIN
   /* Move values directly into record by fetching from cursor */

   OPEN cust_sales_cur;
   FETCH cust_sales_cur INTO cust_sales_rec;

   /* or fetch values from the select list into individual fields. */

   OPEN cust_sales_cur;
   FETCH cust_sales_cur
      INTO cust_sales_rec.customer_id,
           cust_sales_rec.customer_name,
           cust_sales_rec.total_sales;


- Aggregate assignment

DECLARE
REC1 BOOK%ROWTYPE;
REC4 BOOK%ROWTYPE;
..

BEGIN

  REC1:=REC4;


Pass as a parameter:
--------------------

You can also pass a record as a parameter to a procedure: 

DECLARE
   TYPE customer_sales_rectype IS RECORD (...);
   customer_rec customer_sales_rectype;
BEGIN
   display_sales_data (customer_rec);
END; 


PROCEDURE compare_companies
   (prev_company_rec IN company%ROWTYPE)
IS
   curr_company_rec company%ROWTYPE := prev_company_rec;
BEGIN
   ...
END;

--------------------------------------------------------


type t_emp is record (
     my_empno scott.emp.empno%type,
     my_ename scott.emp.ename%type,
     my_job   scott.emp.job%type,
     my_sal   scott.emp.sal%type );

declare

type rr_reden is record (
              redenid number,
              statusid number);

r_reden rr_reden;

r_b rr_b;

create or replace function f(a in number, b in number)

return rr_b

x rr_b;

is

begin

x.redenid:=a;
x.statusid:=b;

return x

end;
/

13.1.4 Nested Records:
-----------------------

You can include a record as a field within another record. This is called a nested record. 
The record that contains the nested record as a field is called the enclosing record. 


DECLARE
     
 TYPE TimeRec IS RECORD (hours SMALLINT, minutes SMALLINT);
     
 TYPE MeetingTyp IS RECORD (
      date_held DATE,
      duration  TimeRec,  -- nested record
      location  VARCHAR2(20),
      purpose   VARCHAR2(50));


In the following example I declare a record type for all the elements of a phone number (phone_rectype), 
and then declare a record type which collects all the phone numbers for a person together 
in a single structure (contact_set_rectype). 

DECLARE
   TYPE phone_rectype IS RECORD
      (intl_prefix   VARCHAR2(2),
       area_code     VARCHAR2(3),
       exchange      VARCHAR2(3),
       phn_number    VARCHAR2(4),
       extension     VARCHAR2(4)
      );
   TYPE contact_set_rectype IS RECORD
      (day_phone#    phone_rectype, /* Nested record */
       eve_phone#    phone_rectype, /* Nested record */
       fax_phone#    phone_rectype, /* Nested record */
       cell_phone#   phone_rectype  /* Nested record */
      );

   auth_rep_info_rec contact_set_rectype;

BEGIN

   auth_rep_info_rec.fax_phone#.area_code :=   auth_rep_info_rec.home_phone#.area_code;


13.2 plsql tables:
==================


13.2.1 Introduction:
--------------------

A records contains one set of values at a time. You can use a record
in a loop, and fetch table records into it, but it will contain 1 record at any time.
A record is not an indexed structure, like an array in other languages.

A "traditional" PL/SQL table is a one-dimensional, unbounded, sparse collection of homogeneous elements, 
indexed by integers. In technical terms, it is like an array; 
it is like a SQL table; yet it is not precisely the same as either of those data structures. 

There are 3 Table types:

- plsql tables
- nested tables
- VARRAYS or Variable Arrays

Some books speak of the following types:

1 dimensionale array of scalar datatypes:

  - index_by tables
  - nested tables.

2 dimensionale array van scalar datatypes:

  - tables of records


But a definition worth repeating: A "traditional" PL/SQL table is a one-dimensional, unbounded, 
sparse collection of homogenous elements, indexed by integers. 
A PL/SQL table can have only one column. It is, in this way, similar to a one-dimensional array.
There is no predefined limit to the number of rows in a PL/SQL table. The PL/SQL table grows 
dynamically as you add more rows to the table. The PL/SQL table is, in this way, 
very different from an array. 

You can consider a plsql table as a structure of 1 column and many rows.

Because a PL/SQL table can have only a single column, all rows in a PL/SQL table 
contain values of the same datatype. It is, therefore, homogeneous. 

With PL/SQL Release 2.3, you can have PL/SQL tables of records. The resulting table is still, 
however, homogeneous. Each row simply contains the same set of columns

You cannot SELECT from PL/SQL tables. There is no way to perform set-at-a-time processing 
to retrieve data from a PL/SQL table. This is a programmatic construct in a programmatic language. 
Instead you can use PL/SQL loops to move through the contents of a PL/SQL table, one row at a time. 

You cannot issue DML statements (INSERTs, UPDATEs, and DELETEs) against PL/SQL tables 
(though PL/SQL Release 2.3 does offer a DELETE operator). 

Usage:

- Define a particular PL/SQL table structure (made up of strings, dates, etc.) 
  using the table TYPE statement.
- Declare the actual table based on that table type. The declaration of a PL/SQL table 
  is a specific instance of a generic datatype. 


13.2.2 Example declarations of INDEX BY Tables:
-----------------------------------------------

Let's take a look at an example and then explore the characteristics of a table. 
The following procedure accepts a name and a row and assigns that name to the 
corresponding row in the PL/SQL table: 


Example 1:
----------

TYPE company_keys_tabtype IS TABLE OF company.company_id%TYPE NOT NULL
   INDEX BY BINARY_INTEGER;

TYPE reports_requested_tabtype IS TABLE OF VARCHAR2 (100)
   INDEX BY BINARY_INTEGER;

To create the actual table:

<table_name> <table_type>;


Example 2:
----------

CREATE OR REPLACE PROCEDURE set_name(name_in IN VARCHAR2, row_in IN INTEGER)
IS

TYPE s_tabletype IS
       TABLE OF VARCHAR2(30) INDEX BY BINARY_INTEGER;

company_name_table   s_tabletype;

BEGIN
    company_name_table (row_in) := name_in;
END;


Example 3.
----------

   -- First declare a few types:

declare

TYPE names IS TABLE OF varchar2(20)
INDEX BY BINARY_INTEGER;                     -- index_by table type

type team_type IS TABLE OF varchar2(20)      -- index_by table type
INDEX BY BINARY_INTEGER;


  -- Now declare variables of those types:

customer_names    names      ;
my_team           team_type  ;

  -- Assignment of values:

begin
for mynum in 0..4 loop

   if     mynum=1 then my_team(mynum):='SMITH';
   elsif  mynum=2 then my_team(mynum):='JONES';
   elsif  mynum=3 then my_team(mynum):='TURNER';
   elsif  mynum=4 then my_team(mynum):='KING'; 
   end if;

end loop;

dbms_output.put_line(my_team(1));
dbms_output.put_line(my_team(2));
end;


Example 4:
----------

company_names_tab (15) := 'Fabricators Anonymous';

company_keys_tab (-2000) := new_company_id;

header_string := 'Sales for ' || company_names_tab (25);


13.2.3 Assignment of values in INDEX BY Tables:
-----------------------------------------------


Direct Assignment:
------------------

As shown in previous examples, you can simply assign a value to a row with the assignment operator: 

countdown_test_list (43)            := 'Internal pressure';
company_names_table (last_name_row) := 'Johnstone Clingers';


Iterative Assignment:
---------------------
 
In order to fill up multiple rows of a table, I recommend taking advantage of a PL/SQL loop. 
Within the loop you will still perform direct assignments to set the values of each row, 
but the primary key value will be set by the loop rather than hardcoded into the assignment itself. 

In the following example, I use a WHILE loop to fill and then display a PL/SQL date table 
with the next set of business days, as specified by the ndays_in parameter: 

/* Filename on companion disk: bizdays.sp */

CREATE OR REPLACE PROCEDURE show_bizdays
   (start_date_in IN DATE := SYSDATE, ndays_in IN INTEGER := 30)
IS
   TYPE date_tabtype IS TABLE OF DATE INDEX BY BINARY_INTEGER;
   bizdays date_tabtype;

   /* The row in the table containing the nth_day */

   nth_day  BINARY_INTEGER := 1;
   v_date DATE := start_date_in;
BEGIN

   /* Loop through the calendar until enough biz days are found */

   WHILE nth_day <= ndays_in
   LOOP

      /* If the day is not on the weekend, add to the table. */

      IF TO_CHAR (v_date, 'DAY') NOT IN ('SAT', 'SUN')
      THEN
         bizdays (nth_day) := v_date;
         DBMS_OUTPUT.PUT_LINE (v_date);
         nth_day := nth_day + 1;
      END IF;
      v_date := v_date + 1;
   END LOOP;
END show_bizdays;
/


Aggregate Assignment:
---------------------

DECLARE
   TYPE name_table IS TABLE OF VARCHAR2(100) INDEX BY BINARY_INTEGER;
   old_names name_table;
   new_names name_table;
BEGIN
   /* Assign values to old_names table */
   old_names(1) := 'Smith';
   old_names(2) := 'Harrison';

   /* Assign values to new_names table */
   new_names(111) := 'Hanrahan';
   new_names(342) := 'Blimey';

   /* Transfer values from new to old */
   old_names := new_names;

   /* This assignment will raise NO_DATA_FOUND */
   DBMS_OUTPUT.PUT_LINE (old_names (1));
END;


13.2.4 Example of a table of records:
-------------------------------------


declare
         -- DECLAREER EERST HET RECORD TYPE

type t_emp is record (
           my_empno emp.empno%type,
           my_ename emp.ename%type,
           my_job   emp.job%type,
           my_sal   emp.sal%type );

         -- DECLAREER NU DE TYPE TABLE OF RECORDS

type t_table_emp is table of t_emp;

         -- DECLAREER NU EEN RECORD EN TABLE VARIABELE

employee t_emp;

my_emp t_table_emp;

         -- ASSIGNMENT OF VALUES, EERST HET RECORD VARIABELE DAN DE TABLE VARIABELE

begin

  SELECT empno, ename, job, sal INTO
         employee.my_empno, employee.my_ename, employee.my_job, employee.my_sal
  FROM emp WHERE empno=7844;

my_emp:=t_table_emp(employee);

dbms_output.put_line(my_emp(1).my_ename);


end;
/


13.3. Object types:
===================

In PL/SQL, object-oriented programming is based on object types. 
An object type encapsulates a data structure along with the 
functions and procedures needed to manipulate the data.
The variables that form the data structure are called attributes. 
The functions and procedures that characterize the behavior 
of the object type are called methods. 

Object types reduce complexity by breaking down a large system 
into logical entities. 
This allows you to create software components that are 
modular, maintainable, and reusable. 

When you define an object type using the CREATE TYPE statement 
(in SQL*Plus for example), you create an abstract template for some 
real-world object. As the following example of a bank account shows, 
the template specifies only those attributes and behaviors the object 
will need in the application environment: 

Example objecttype:
-------------------

CREATE TYPE Bank_Account AS OBJECT ( 
   acct_number INTEGER(5),
   balance     REAL,
   status      VARCHAR2(10),
   MEMBER PROCEDURE open (amount IN REAL),
   MEMBER PROCEDURE verify_acct (num IN INTEGER),
   MEMBER PROCEDURE close (num IN INTEGER, amount OUT REAL),
   MEMBER PROCEDURE deposit (num IN INTEGER, amount IN REAL),
   MEMBER PROCEDURE withdraw (num IN INTEGER, amount IN REAL),
   MEMBER FUNCTION curr_bal (num IN INTEGER) RETURN REAL 
);


At run time, when the data structure is filled with values, 
you have created an instance of an abstract bank account. 
You can create as many instances (called objects) as you need. 
Each object has the number, balance, and status of an actual bank account. 


13.4 Metalink Article:
======================

Oracle8 provides two collection types: nested tables and varying  arrays or VARRAYS. 
A collection is an ordered group of elements of the same type.  
Each element from the group can be accessed  using a unique subscript.  The element types of a collection can  
be either built-in datatypes, user-defined types or references (REFs) to object types.   

Nested Tables 
------------- 
An ordered group of items of type TABLE are called nested tables. 
Nested tables can contain multiple columns and can be used as 
variables, parameters, results, attributes, and columns. They 
can be thought of as one column database tables. Rows of a 
nested table are not stored in any particular order. 
The size of a nested table can increase dynamically, i.e., nested 
tables are unbounded. Elements in a nested table initially have 
consecutive subscripts, but as elements are deleted, they can have 
non-consecutive subscripts. 
Nested tables can be fully manipulated using SQL, Pro*C, OCI, and 
PL/SQL. The range of values for nested table subscripts is 
1..2147483647. To extend a nested table, the built-in procedure 
EXTEND must be used. To delete elements, the built-in procedure 
DELETE must be used. 
An uninitialized nested table is atomically null, so the IS NULL 
comparison operator can be used to see if a nested table is null. 
Oracle8 provides new operators such as CAST, THE, and MULTISET for 
manipulating nested tables. 

Examples for Nested Tables 
-------------------------- 

Example 1: 
---------- 
The following example illustrates how a simple nested table is created. 

a) First, define a Object type as follows: 

SQL> CREATE TYPE ELEMENTS AS OBJECT ( 
2> ELEM_ID NUMBER(6), 
3> PRICE NUMBER(7,2)); 
4> / 

(This looks like a record).

b) Next, create a table type ELEMENTS_TAB which stores ELEMENTS objects: 
SQL> CREATE TYPE ELEMENTS_TAB AS TABLE OF ELEMENTS; 
2> / 

(This looks like a table of records).

c) Finally, create a database table STORAGE having type ELEMENTS_TAB as 
one of its columns: 
SQL> CREATE TABLE STORAGE ( 
2> SALESMAN NUMBER(4), 
3) ELEM_ID  NUMBER(6), 
4) ORDERED  DATE, 
5) ITEMS    ELEMENTS_TAB) 
6) NESTED TABLE ITEMS STORE AS ITEMS_TAB; 

Example 2: 
---------- 
This example demonstrates how to populate the STORAGE table with a single 
row: 
SQL> INSERT INTO STORAGE 
2> VALUES (100,123456,SYSDATE, 
3> ELEMENTS_TAB(ELEMENTS(175692,120.12), 
4> ELEMENTS(167295,130.45), 
5> ELEMENTS(127569,99.99))); 

Example 3: 
---------- 
The following example demonstrates how to use the operator THE which is 
used in a SELECT statement to identify a nested table: 
SQL> INSERT INTO 
2> THE 
3> (SELECT ITEMS FROM STORAGE WHERE ELEM_ID = 123456) 
4> VALUES (125762, 101.99); 

Example 4: 
---------- 
The following example shows how to update the STORAGE table row where 
salesman column has value 100: 
SQL> UPDATE STORAGE 
2> SET ITEMS = ELEMENTS_TAB(ELEMENTS(192512, 199.99)) 
3> WHERE SALESMAN = 100; 

Varrays 
------- 

Varrays are ordered groups of items of type VARRAY. Varrays can be used 
to associate a single identifier with an entire collection. This allows 
manipulation of the collection as a whole and easy reference of 
individual elements. 
The maximum size of a varray needs to be specified in its type definition. 
The range of values for the index of a varray is from 1 to the maximum 
specified in its type definition. If no elements are in the array, then 
the array is atomically null. The main use of a varray is to group 
small or uniform-sized collections of objects. 
Elements of a varray cannot be accessed individually through SQL, although 
they can be accessed in PL/SQL, OCI, or Pro*C using the array style 
subscript. The type of the element of a VARRAY can be any PL/SQL type 
except the following: 
BOOLEAN 
TABLE 
VARRAY 
object types with TABLE or VARRAY attributes 
REF CURSOR 
NCHAR 
NCLOB 
NVARCHAR2 

Varrays can be used to retrieve an entire collection as a value. Varray 
data is stored in-line, in the same tablespace as the other data in its row. 
When a varray is declared, a constructor with the same name as the varray is 
implicitly defined. The constructor creates a varray from the elements 
passed to it. You can use a constructor wherever you can use a function 
call, including the SELECT, VALUES, and SET clauses. 
A varray can be assigned to another varray, provided the datatypes are the 
exact same type. For example, suppose you declared two PL/SQL types: 
TYPE My_Varray1 IS VARRAY(10) OF My_Type; 
TYPE My_Varray2 IS VARRAY(10) OF My_Type; 
An object of type My_Varray1 can be assigned to another object of type 
My_Varray1 because they are the exact same type. However, an object of 
type My_Varray2 cannot be assigned to an object of type My_Varray1 because 
they are not the exact same type, even though they have the same element type. 
Varrays can be atomically null, so the IS NULL comparison operator can be 
used to see if a varray is null. Varrays cannot be compared for equality 
or inequality. 

Examples for Varrays 
-------------------- 

Example 5: 
--------- 
The following shows how to create a simple VARRAY: 

a) First, define a object type ELEMENTS as follows: 
SQL> CREATE TYPE MEDICINES AS OBJECT ( 
2> MED_ID NUMBER(6), 
3> MED_NAME VARCHAR2(14), 
4> MANF_DATE DATE); 
5> / 

b) Next, define a VARRAY type MEDICINE_ARR which stores MEDICINES objects: 
SQL> CREATE TYPE MEDICINE_ARR AS VARRAY(40) OF MEDICINES; 
2> / 
c) Finally, create a relational table MED_STORE which has MEDICINE_ARR as a 
column type: 
SQL> CREATE TABLE MED_STORE ( 
2> LOCATION VARCHAR2(15), 
3> STORE_SIZE NUMBER(7), 
4> EMPLOYEES NUMBER(6), 
5> MED_ITEMS MEDICINE_ARR); 

Example 6: 
---------- 
The following example shows how to insert two rows into the MED_STORE table: 

SQL> INSERT INTO MED_STORE 
2> VALUES ('BELMONT',1000,10, 
3> MEDICINE_ARR(MEDICINES(11111,'STOPACHE',SYSDATE))); 
SQL> INSERT INTO MED_STORE 
2> VALUES ('REDWOOD CITY',700,5, 
3> MEDICINE_ARR(MEDICINES(12345,'STRESS_BUST',SYSDATE))); 

Example 7: 
---------- 
The following example shows how to delete the second row we have inserted in 
example 6 above: 

SQL> DELETE FROM MED_STORE 
2> WHERE LOCATION = 'REDWOOD CITY'; 

Example 8: 
---------- 
The following example shows how to update the MED_STORE table and add more 
medicines to the Belmont store: 

SQL> UPDATE MED_STORE 
2> SET MED_ITEMS = MEDICINE_ARR ( 
3> MEDICINES(12346,'BUGKILL',SYSDATE), 
4> MEDICINES(12347,'INHALER',SYSDATE), 
5> MEDICINES(12348,'PAINKILL',SYSDATE)); 

Differences Between Nested Tables and Varrays 
--------------------------------------------- 
* Nested tables are unbounded, whereas varrays have a maximum size. 
* Individual elements can be deleted from a nested table, but not from 
a varray. Therefore, nested tables can be sparse, whereas varrays are 
always dense. 
* Varrays are stored by Oracle in-line (in the same tablespace), whereas 
nested table data is stored out-of-line in a store table, which is a 
system-generated database table associated with the nested table. 
* When stored in the database, nested tables do not retain their ordering 
and subscripts, whereas varrays do. 
* Nested tables support indexes while varrays do not. 


Example of Pipeline function:
-----------------------------

But -- just creating a pipelined in a package is easy:

tkyte@TKYTE9I.US.ORACLE.COM> create or replace type myScalarType as object
  2  ( a   int,
  3    b   date,
  4    c   varchar2(25)
  5  )
  6  /

Type created.

tkyte@TKYTE9I.US.ORACLE.COM>
tkyte@TKYTE9I.US.ORACLE.COM> create or replace type myTableType as table of 
myScalarType
  2  /

Type created.

tkyte@TKYTE9I.US.ORACLE.COM>
tkyte@TKYTE9I.US.ORACLE.COM>
tkyte@TKYTE9I.US.ORACLE.COM> create or replace package my_pkg
  2  as
  3          function f return myTableType PIPELINED;
  4  end;
  5  /

Package created.

tkyte@TKYTE9I.US.ORACLE.COM> create or replace package body my_pkg
  2  as
  3          function f return myTableType
  4          PIPELINED
  5          is
  6          begin
  7                  for i in 1 .. 5
  8                  loop
  9                          pipe row ( myScalarType( i, sysdate+i, 'row ' || i 
) );
 10                  end loop;
 11                  return;
 12          end;
 13  end;
 14  /

Package body created.

tkyte@TKYTE9I.US.ORACLE.COM>
tkyte@TKYTE9I.US.ORACLE.COM> select * from table( my_pkg.f() );

         A B         C
---------- --------- -------------------------
         1 29-JUN-02 row 1
         2 30-JUN-02 row 2
         3 01-JUL-02 row 3
         4 02-JUL-02 row 4
         5 03-JUL-02 row 5


select * 
  from ( select a.*, rownum rnum
           from ( YOUR_QUERY_GOES_HERE -- including the order by ) a
          where rownum <= MAX_ROWS )
 where rnum >= MIN_ROWS
/


======================================
14. READ AND WRITE TO FILES
======================================


14.1 SQL*Loader:
================


SQL*Loader is used for loading data from text files into Oracle tables. 
The text file can have fixed column positions or columns separated by a special character, 
for example an ",".

to call sqlloader

sqlldr system/manager control=smssoft.ctl
sqlldr parfile=bonus.par


Example 1:
----------

-- example records in file mtc.pmp

T8;0010003;BP Schimmel Reek;Mgr Borretstraat;000065;;REEK;;
T8;0010011;Esso Tramdijk;Tramdijk;000005;;SPIJKENISSE;;
T8;0010015;Esso Matlingeweg;Matlingeweg;000011;;ROTTERDAM;;
T8;0010018;BP Pr. Beatrixlaan;Prinses Beatrixlaan;000028;;'S-GRAVENHAGE;;
T8;0010019;Total Vollenhoven Nuenen;Collse Hoefdijk;000009;a;NUENEN;;
T8;0010021;Swallow Retail Operations B.V.;Brinklaan;000166;;BUSSUM;;

-- create staging table

SQL> create table STG_POMP
  2  (
  3  type varchar2(2),
  4  nummer varchar2(10),
  5  naam varchar2(64),
  6  straat varchar2(64),
  7  huisnummer varchar2(16),
  8  toevoeging varchar2(16),
  9  plaats varchar2(64),
 10  landcode varchar2(16));

-- get mtc .pmp data into STG_POMP


e:\test\sqlldr parfile=pomp.par

-- pomp.par:

userid=brains
control=pomp.ctl
bad=pomp.bad
log=pomp.log
discard=pomp.dis

-- pomp.ctl:

LOAD DATA
INFILE 'e:\test\MTC01.pmp'
TRUNCATE
INTO TABLE STG_POMP
FIELDS TERMINATED BY ';' OPTIONALLY ENCLOSED BY '"'
(type,nummer,naam,straat,huisnummer,toevoeging,plaats,landcode)


Example 2:
----------

BONUS.PAR:

userid=brains
control=import_tra.ctl
bad=import_tra.bad
log=import_tra.log
discard=import_tra.dis
rows=2
errors=2
skip=0

BONUS.CTL:

LOAD DATA
INFILE bonus.dat
APPEND
INTO TABLE BONUS
(name position(01:08) char,
city position(09:19) char,
salary position(20:22) integer external)

Now you can use the command: 
$ sqlldr parfile=bonus.par

Example 3:
----------
LOAD1.CTL:

LOAD DATA
INFILE 'PLAYER.TXT'
INTO TABLE BASEBALL_PLAYER
FIELDS TERMINATED BY ',' OPTIONALLY ENCLOSED BY '"'
  (player_id,last_name,first_name,middle_initial,start_date)

SQLLDR system/manager CONTROL=LOAD1.CTL LOG=LOAD1.LOG
 BAD=LOAD1.BAD DISCARD=LOAD1.DSC


- Convential path load:
When the DIRECT=Y parameter is not used, the convential path is used.
This means that essentially INSERT statements are used, 
triggers and referential integrety are in normal use, and that
the buffer cache is used.

- Direct path load:
Buffer cache is not used. Existing used blocks are not used.
New blocks are written as needed.
Referential integrety and triggers are disabled during the load.


Special options:
----------------

to skip the first record, simply use skip=


sqlldr userid=x/y control=yourfile.ctl skip=1


In order to skip that first field, use FILLER, see:


14.2 UTL_FILE Package:
======================

You can create, read, and write to and from a OS file from pl/sql.
The system package UTL_FILE has a number of functions that makes this possible.

Example 1:
----------

create or replace procedure fwrite
is 
v_output_file1 utl_file.file_type; 
begin 
v_output_file1 := utl_file.fopen('LOG_DIR', 'NEW2.txt', 'a'); 
utl_file .put_line(v_output_file1, 'NATURE and grietjes'); 
utl_file.fclose_all; 
end; 
/ 
 
put_line appends an operating system-specific line terminator.


Example 2:
----------

create or replace procedure fwrite2 
is 

v_output_file1 utl_file.file_type; 

cursor cur_emp IS
        SELECT * FROM EMP;

my_var cur_emp%rowtype;

BEGIN
  v_output_file1 := utl_file.fopen('BRAINS_EXPORTS', 'employee.txt', 'a'); 

  open cur_emp;

  loop
    fetch cur_emp into my_var;
    utl_file.put_line(v_output_file1,my_var.empno||','||my_var.ename); 
    exit when cur_emp%notfound;
  end loop;

utl_file.fclose_all; 
END;
/


Example 3: universal function to output ANY query to ANY file (!!!):
--------------------------------------------------------------------

We want to dump the output of any SELECT to a file from our choice.
This can be done with the following:

create or replace function  BRAINS_EXPORT
                    ( p_query     in varchar2,
                      p_separator in varchar2 default ';',
                      p_dir       in varchar2 ,
                      p_filename  in varchar2 )
return number
is
      l_output        utl_file.file_type;
      l_theCursor     integer default dbms_sql.open_cursor;
      l_columnValue   varchar2(4096);
      l_status        integer;
      l_colCnt        number default 0;
      l_separator     varchar2(10) default '';
      l_cnt           number default 0;
begin
      l_output := utl_file.fopen( p_dir, p_filename, 'w' );
  
      dbms_sql.parse(  l_theCursor,  p_query,
                                           dbms_sql.native );
  
      for i in 1 .. 255 loop
          begin
              dbms_sql.define_column( l_theCursor, i,
                                      l_columnValue, 2000 );
              l_colCnt := i;
          exception
              when others then
                  if ( sqlcode = -1007 ) then exit;
                  else
                      raise;
                  end if;
          end;
      end loop;
  
      dbms_sql.define_column( l_theCursor, 1,
                              l_columnValue, 2000 );
  
      l_status := dbms_sql.execute(l_theCursor);
  
      loop
          exit when ( dbms_sql.fetch_rows(l_theCursor) <= 0 );
          l_separator := '';
          for i in 1 .. l_colCnt loop
              dbms_sql.column_value( l_theCursor, i,
                                     l_columnValue );
              utl_file.put( l_output,
                            l_separator || l_columnValue );
              l_separator := p_separator;
          end loop;
          utl_file.new_line( l_output );
          l_cnt := l_cnt+1;
      end loop;
      dbms_sql.close_cursor(l_theCursor);
  
      utl_file.fclose( l_output );
      return l_cnt;
  end BRAINS_EXPORT;
/

How can this be used?

declare
l_rows    number;
begin
l_rows := export( 'select 9010310||pasnummer||1200000 from pas','','IMP_MTC','geblokkeerd.txt' );
dbms_output.put_line( to_char(l_rows) ||' rows extracted to ascii file' );
end;
/


declare
     l_rows    number;
begin
      l_rows := dump_csv( 'select *
                             from all_users
                            where rownum < 5',
                          ',',
                         'e:\io',
                          'test.dat' );
      dbms_output.put_line( to_char(l_rows) ||
                            ' rows extracted to ascii file' );
end;
/


If you need to translate anything:

You can use translate(l_output , chr(13)||chr(10), 'XY' )

to have carriage returns (chr(13)) turned into X (or whatever of course) and 
linefeeds (newlines) turned into Y (or whatever) 


Example 4: Univeral function to read ANY ascii file into a table (!!!).
-----------------------------------------------------------------------

create table iob_implog( errm varchar2(4000), data varchar2(4000) );


create or replace function FNC_BRAINS_IMPORT
                     ( p_table     in varchar2,
                       p_cnames    in varchar2,
                       p_dir       in varchar2,
                       p_filename  in varchar2,
                       p_delimiter in varchar2 default '|' )
return number
is
      l_input         utl_file.file_type;
      l_theCursor     integer default dbms_sql.open_cursor;
      l_buffer        varchar2(4000);
      l_lastLine      varchar2(4000);
      l_status        integer;
      l_colCnt        number default 0;
      l_cnt           number default 0;
      l_sep           char(1) default NULL;
      l_errmsg        varchar2(4000);
begin
   l_input := utl_file.fopen( p_dir, p_filename, 'r' );
  
   l_buffer := 'insert into ' || p_table || ' values ( ';
      l_colCnt := length(p_cnames)-
                  length(replace(p_cnames,',',''))+1;
  
      for i in 1 .. l_colCnt
      loop
          l_buffer := l_buffer || l_sep || ':b'||i;
          l_sep    := ',';
      end loop;
      l_buffer := l_buffer || ')';
   
       dbms_sql.parse(l_theCursor, l_buffer, dbms_sql.native);
  
      loop
          begin
               utl_file.get_line( l_input, l_lastLine );
          exception
               when NO_DATA_FOUND then
                  exit;
          end;
          l_buffer := l_lastLine || p_delimiter;
   
   
           for i in 1 .. l_colCnt
           loop
               dbms_sql.bind_variable
                ( l_theCursor, ':b'||i,
                  substr( l_buffer, 1,
                  instr(l_buffer,p_delimiter)-1 ) ) ;
               l_buffer := substr( l_buffer,
                              instr(l_buffer,p_delimiter)+1 );
           end loop;
   
           begin
               l_status := dbms_sql.execute(l_theCursor);
               l_cnt := l_cnt + 1;
           exception
               when others then
                   l_errmsg := sqlerrm;
                   insert into badlog ( errm, data )
                   values ( l_errmsg, l_lastLine );
           end;
       end loop;
   
       dbms_sql.close_cursor(l_theCursor);
       utl_file.fclose( l_input );
       commit;
   
       return l_cnt;
end FNC_BRAINS_IMPORT;
/


usage:

begin
  2     dbms_output.put_line(
  3         load_data( 'T1',
  4                    'x,y,z',
  5                    '/tmp',
  6                    't1.dat',
  7                    ',' ) || ' rows loaded' );
  8  end;
  9  /
3 rows loaded

PL/SQL procedure successfully completed.


You asked.....

I want to UTL_FILE (from unix) and output an ascii text file to an NT machine 
directly, no ftp, no nothing, using nfs. 
I already tried this, no pbm.
The file need to be viewed as any normal text file,
on a notepad, or opened in Excel, or any other application.

The pbm is, there is this special character at the end of each line (I think it 
is a ^M).
I know I can use unix2dos to change/modify the file to remove the special 
character, so that everything look OK on a notepad.

But, is there anyway that I can achive the same by doing it in one step, by 
using 'UTL_FILE' alone, without using "unix2dos" or any other additiional steps 
??

The text output program is to be a monthly automatic process. I do not wish to 
call an external program to perform the stripping job.

thanks 


Followup:  


try

utl_file.put_line( your_text || chr(13) );

We'll put the linefeed (char(10)),  bill gates long ago decided we needed two 
characters CARRIAGE_RETURN/LINE_FEED  (chr(13)||chr(10)) to terminate a line.  
Add the chr(13) and they windows people may be apeased. 
 

14.3 Move, rename or delete of a file:
======================================

Most important procedures

utl_file.fcopy (
location   IN VARCHAR2,
filename   IN VARCHAR2,
dest_dir   IN VARCHAR2,
dest_file  IN VARCHAR2,
start_line IN PLS_INTEGER DEFAULT 1,
end_line   IN PLS_INTEGER DEFAULT NULL);

utl_file.frename (
location  IN VARCHAR2,
filename  IN VARCHAR2, 
dest_dir  IN VARCHAR2,
dest_file IN VARCHAR2,
overwrite IN BOOLEAN DEFAULT FALSE);

utl_file.fgetattr(
location    IN  VARCHAR2, 
filename    IN  VARCHAR2, 
exists      OUT BOOLEAN, 
file_length OUT NUMBER, 
blocksize   OUT NUMBER);


Example 1
---------

UTL_FILE.fcopy (
   src_location      => 'WINNERS_DIR',
   src_filename      => 'names.txt',
   dest_location     => 'OLD_NEWS_DIR',
   dest_filename     => 'prevnames.txt',
   start_line        => 1,
   end_line          => 6
   );

Example 2:
----------

UTL_FILE.frename('IMP_DIR','bios.txt','IMP_ARCHIVE','bios_old.txt');

Example 3:
----------

BEGIN
  utl_file.frename('ORALOAD', 'test.txt', 'ORALOAD', 'x.txt', TRUE);
END frename;
/

Example 4:
----------

BEGIN
  utl_file.fremove('ORALOAD', 'dump.txt');
END fremove;
/

Example 5:
----------

PROCEDURE archive IS
BEGIN

  UTL_FILE.FRENAME ('BDUMP_DIR',
                    alertfile,
                    'BDUMP_DIR',
                    TO_CHAR(SYSDATE,'YYYYMMDD')||
		    '_'||alertfile);
END archive;

Example 6:
----------

set serveroutput on

DECLARE

ex         BOOLEAN;
flen       NUMBER;
bsize      NUMBER;

BEGIN
  utl_file.fgetattr('ORALOAD', 'test.txt', ex, flen, bsize);

  IF ex THEN
    dbms_output.put_line('File Exists');
  ELSE
    dbms_output.put_line('File Does Not Exist');
  END IF;
  dbms_output.put_line('File Length: ' || TO_CHAR(flen));
  dbms_output.put_line('Block Size: ' || TO_CHAR(bsize));
END fgetattr;
/


Example 7:
----------

l_file         UTL_FILE.file_type         ;
l_location     VARCHAR2(100) := 'IMP_BIOS';
l_filename     VARCHAR2(100) := 'bios.txt';
l_text         VARCHAR2(178)              ;

BEGIN

  l_file := UTL_FILE.fopen(l_location, l_filename, 'r', 32767);
  

  UTL_FILE.get_line(l_file, l_text, 32767);


  BEGIN
    LOOP
      UTL_FILE.get_line(l_file, l_text, 32767);

      INSERT INTO IOB_BIOS_IMPORT
      (longrecord)
      VALUES
      (l_text);

    END LOOP;
  EXCEPTION
    WHEN NO_DATA_FOUND THEN
      NULL;
  END;
  
  -- Close the file.
  UTL_FILE.fclose(l_file);

END;


14.4 Create a new table on basis of existing table:
===================================================


CREATE TABLE EMPLOYEE_2
AS SELECT * FROM EMPLOYEE

insert into t SELECT * FROM t2;

insert into DSA_IMPORT    
SELECT * FROM MDB_DW_COMPONENTEN@SALES


14.5 Send a file with mail from Oracle:
=======================================

Here a procedure is described which sends up to 3 files to an emailaddress.

Parameters

from_name (varchar2, mandatory) 
to_name (varchar2, mandatory) 
subject (varchar2, mandatory) 
message (varchar2, mandatory) 
max_size (number, optional) 
filename1 (varchar2, optional) 
filename2 (varchar2, optional) 
filename3 (varchar2, optional) 
debug (number, optional) 
  

Example

    mail_files( from_name => 'oracle' ,
                to_name   => 'someone@somewhere.com' ,
                subject   => 'A test',
                message   => 'A test message',
                filename1 => '/data/oracle/dave_test1.txt',
                filename2 => '/data/oracle/dave_test2.txt');

source:


14.6 Organization External tables:
==================================


Example 1:
----------

If you have a file "x.txt"like:

B000Albert
B001Basil
B002Caesar
B003Darius


create table ext_table (
   field_1 char(4),
   field_2 char(30)
 )
 organization external (
   type       oracle_loader
   default directory ext_dir
   access parameters (
   records delimited by newline
   fields (
   field_1 position(1:4) char(4),
   field_2 position(5:30) char(30)
     )
   )
   location ('x.txt')
 )
 reject limit unlimited;


If you have a file "y.txt" like:

1,one,first
2,two,second
3,three,third
4,four,fourth


create table ext_table (
  i   Number,
  n   Varchar2(20),
  m   Varchar2(20)
)
organization external (
  type              oracle_loader
  default directory ext_dir
  access parameters (
    records delimited  by newline
    fields  terminated by ','
    missing field values are null
  )
  location ('y.txt')
)
reject limit unlimited;

Example 2:
----------

CREATE TABLE IOB_MTC_IMPORT_PMP
(
type         varchar2(2),
nummer       varchar2(10),
naam         varchar2(64),
straat       varchar2(64),
huisnummer   varchar2(16),
toevoeging   varchar2(16),
plaats       varchar2(64),
landcode     varchar2(16)
)
organization external
(
type oracle_loader
default directory IMP_MTC
access parameters (
records delimited by newline
fields terminated by ';'
missing field values are null
)
location ('MTC.pmp')
)
reject limit unlimited;

Example 3:
----------

create table external_emp (
EMPNO NUMBER(4),
ENAME VARCHAR2(10),
JOB VARCHAR2(9),
MGR NUMBER(4),
HIREDATE DATE,
SAL NUMBER(7,2),
COMM NUMBER(7,2),
DEPTNO NUMBER(2))
Organization external
(type oracle_loader
default directory BLAH
access parameters (records delimited by newline
fields terminated by �,�)
location (�extemp.txt�))
reject limit 1000;

Example 4:
----------

create table IOB_BIOS_IMPORT
(
eenveld varchar(178)
)
organization external
(type oracle_loader
default directory IMP_BIOS
access parameters (
records delimited by newline
fields terminated by '*'
missing field values are null)
location ('bios.txt')
)
reject limit unlimited;	


Example 5:
----------

CREATE TABLE emp_external (
   employee_id    NUMBER(6),
   last_name      VARCHAR2(20),
   email          VARCHAR2(25),
   hire_date      DATE,
   job_id         VARCHAR2(10),
   salary         NUMBER(8,2)
)
ORGANIZATION EXTERNAL
(TYPE oracle_loader
 DEFAULT DIRECTORY admin
 ACCESS PARAMETERS
 (
  RECORDS DELIMITED BY newline
  BADFILE 'ulcase1.bad'
  DISCARDFILE 'ulcase1.dis'
  LOGFILE 'ulcase1.log'
  SKIP 20
  FIELDS TERMINATED BY ","  OPTIONALLY ENCLOSED BY '"'
  (
   deptno     INTEGER EXTERNAL,
   dname      CHAR,
   loc        CHAR
  )
 )
 LOCATION ('ulcase1.dat')
)
REJECT LIMIT UNLIMITED;


REMARKS:
--------

REJECT LIMIT:

reject limit specifies the number of rows that can be rejected before 
the command returns an error. If this threshold is reached, the following error 
appears when trying to access the table: 

ERROR at line 1:
ORA-29913: error in executing ODCIEXTTABLEFETCH callout
ORA-30653: reject limit reached
ORA-06512: at "SYS.ORACLE_LOADER", line 14
ORA-06512: at line 1

LOGFILE | NOLOGFILE:

The LOGFILE clause names the file that contains messages generated by the external tables utility 
while it was accessing data in the datafile. If a log file already exists by the same name, 
the access driver reopens that log file and appends new log information to the end. 
This is different from bad files and discard files, which overwrite any existing file. 
NOLOGFILE is used to prevent creation of a log file. 

The DELIMITED BY 

The DELIMITED BY clause is used to indicate the characters that identify the end of a record. 
If DELIMITED BY NEWLINE is specified, then the actual value used is platform-specific. On UNIX platforms, 
NEWLINE is assumed to be "\n". On Windows NT, NEWLINE is assumed to be "\r\n". 
If DELIMITED BY string is specified, string can either be text or a series of hexadecimal digits.
 If it is text, then the text is converted to the character set of the datafile and the result is used 
for identifying record boundaries. See string. 

BADFILE | NOBADFILE

The BADFILE clause names the file to which records are written when they cannot be loaded because of errors. 
For example, a record was written to the bad file because a field in the data source could not be converted 
to the datatype of a column in the external table. Records that fail the LOAD WHEN clause are not written to the 
bad file but are written to the discard file instead. Also, any errors in using a record from an external table 
(such as a constraint violation when using INSERT INTO...AS SELECT... from an external table) will not cause 
the record to be written into the bad file. 
The purpose of the bad file is to have one file where all rejected data can be examined and fixed 
so that it can be loaded. If you do not intend to fix the data, then you can use the NOBADFILE option to prevent 
creation of a bad file, even if there are bad records. 


14.7 SPECIAL CASES, ERRORS:
===========================

Example 1:
----------

You Asked (Jump to Tom's latest followup)

In my database table, there is one field for the employee name in the order of 
(First_name,Last_name,Middle_initial).
Example (Brian,Robbin,D).
I want to separate each of these words and put it into a new table 
with three fields (First_name, Last_name, Middle_name).

I want to write a PL/SQL , so for all the employees, it would be done at a time. 

 
and we said...


instr and substr is all you need.  don't use PLSQL:

variable x varchar2(25)

exec :x := 'Brian,Robbin,D'
select substr( :x||',', 1, instr(:x,',')-1 ) first_name,
       substr( :x||',,', instr( :x||',,', ',') +1, 
                         instr( :x||',,', ',', 1, 2 )-instr(:x||',,',',')-1 ) 
last_name,
       rtrim(substr( :x||',,', instr( :x||',,',',',1,2)+1),',') middle_init
 from dual
/


External table errors:
----------------------

1.

CREATE TABLE temp_student
 (
   STU_KEY NUMBER(10) ,
   NAME   VARCHAR2(32 ),
   STU_ID CHAR(9 ), 
   BIRTH_DATE DATE,
   SEX CHAR(1 )  
  )  
  ORGANIZATION EXTERNAL
     (TYPE oracle_loader
      DEFAULT DIRECTORY student_database
      ACCESS PARAMETERS
       (
        RECORDS DELIMITED BY newline
        BADFILE 'd:\oracle\migration\student_bad.txt'
        DISCARDFILE 'd:\oracle\migration\student_discard.txt'
        LOGFILE 'd:\oracle\migration\student_log.txt'
       )
   LOCATION ('d:\oracle\student_database\dwstubio')   
     );


SQL> select count(*) from temp_student;
select count(*) from temp_student
*
ERROR at line 1:
ORA-29913: error in executing ODCIEXTTABLEOPEN callout
ORA-29400: data cartridge error
KUP-00554: error encountered while parsing input commands
KUP-01006: error signalled during parse
KUP-00562: unknown escape sequence
ORA-06512: at "SYS.ORACLE_LOADER", line 14
ORA-06512: at line 1


From Metalink :

    * symptom: SELECT from an EXTERNAL TABLE fails
    * symptom: ORA-29913: error in executing ODCIEXTTABLEOPEN callout
    * symptom: ORA-29400: data cartridge error
    * symptom: KUP-00554: error encountered while parsing access parameters
    * symptom: KUP-01006: error signalled during parse of access parameters
    * symptom: KUP-00562: unknown escape sequence
    * symptom: ORA-06512: at "SYS.ORACLE_LOADER", line 14
    * symptom: ORA-06512: at line 1
    * cause: The location of the badfile or logfile or discardfile contains 
    the full path and therefore the \ character, considered as an escape character. 

fix:

Do not enter in the ACCESS PARAMETERS clause, the full path of the file names 
in the CREATE TABLE ORGANIZATION EXTERNAL statement.

2.


fact: Oracle Server - Enterprise Edition 9.0.1
symptom: Select from external table fails after creation
symptom: ORA-29913: error in executing ODCIEXTTABLEFETCH callout
symptom: ORA-30653: reject limit reached
symptom: ORA-06512: at "SYS.ORACLE_LOADER", line 14
symptom: ORA-06512: at line 1
cause: Wrong data in the flat file of the External Table or invalid 
delimiter
specification in the create table statement


fix:


A. Check the end of the file to be 'loaded' specified by 'LOCATION' 
-------------------------------------------------------------------
 This file should not contain an extra 'empty line'

3.

Hi,

I have a PL/SQL script which outputs a data file with the following format:

0000011920200404201 <<<header record>>>
0000511920005038775000210000000500000000000000000000000000000000000000000000000000000000000GBPGB<<<data>>>
.etc
.etc
.etc
000099000004737000000000000000000000000000000000000000000000000000000000000000000000000000000000<<<trailer_record>>>
<<<newline>>>

using UTL_FILE.PUT_LINE to write each record to the output file and UTL_FILE.FCLOSE to close it. 
What i need to know is is there a way using PL/SQL to either:
1. stop the newline from being created at the end of the file, or 
2. to remove the newline once it has been created?

There is a checksum in the trailer record which is thrown out by the inclusion in the data file of the newline, so i have to get rid of it.

Any help on this matter would be greatly appreciated.

thanks in advance!


4.

> I have finally competed setting up the samba server and setup the share
> between NT and Samba server.
> 
> However, when I open a unix text file in Windows NT using notepad, i see
> many funny characters and the text file is not in order (Just like when I
> ftp the unix text file out into NT in binary format) ...I think this has to
> be something to do with whether the file transfer is in Binary format or
> ASCII ... Is there a parameter to set for this ? I have checked the
> documents ... but couldn't find anything on this ...
> 

This is a FAQ, but it brief, it's like this. Unix uses a single newline
character to end a line ("\n"), while DOS/Win/NT use a
carriage-return/newline pair ("\r\n"). FTP in ASCII mode translates
these for you. FTP in binary mode, or other forms of file transfer, such
as Samba, leave the file unaltered. Doing so would be extremely
dangerous, as there's no clear way to isolate which files should be
translated

You can get Windows editors that understand Unix line-end conventions
(Ultra Edit is one), or you can use DOS line endings on the files, which
will then look odd from the Unix side. You can stop using notepad, and
use Wordpad instead, which will deal appropriately with Unix line
endings.

You can convert a DOS format text file to Unix with this:-

tr -d '\r' < dosfile.txt > unixfile.txt

The best solution to this seems to be using a Windows editor that can
handle working with Unix line endings.

HTH

Mike.

Note:

There are two ways of moving to a new line...carriage return, which is chr(13), 
and new line which is chr(10).  In windows you're supposed to use a sequence 
of a carriage return followed by a new line.  
For example, in VB you can use Wrap$=Chr$(13)+Chr$(10)  which creates a wrap character.


5.

ORA-29913: error in executing ODCIEXTTABLEOPEN callout
ORA-29400: data cartridge error
KUP-00554: error encountered while parsing access parameters
KUP-01005: syntax error: found "badfile": expecting one of: "enclosed, exit, 
(, ltrim, lrtrim, ldrtrim, missing, notrim, optionally, rtrim, reject"
KUP-01007: at line 3 column 1
ORA-06512: at "SYS.ORACLE_LOADER", line 14
ORA-06512: at line 1

6.

CREATE TABLE temp_student
 (
   STU_KEY NUMBER(10) ,
   NAME   VARCHAR2(32 ),
   STU_ID CHAR(9 ), 
   BIRTH_DATE DATE,
   SEX CHAR(1 )  
  )  
  ORGANIZATION EXTERNAL
     (TYPE oracle_loader
      DEFAULT DIRECTORY student_database
      ACCESS PARAMETERS
       (
        RECORDS DELIMITED BY newline
        BADFILE 'd:\oracle\migration\student_bad.txt'
        DISCARDFILE 'd:\oracle\migration\student_discard.txt'
        LOGFILE 'd:\oracle\migration\student_log.txt'
       )
   LOCATION ('d:\oracle\student_database\dwstubio')   
     );

CREATE table employees_ext (
employee_id NUMBER(5),
first_name varchar2(30),
last_name varchar2(30),
email varchar2(30)
)
ORGANIZATION EXTERNAL -- external table
(
TYPE oracle_loader -- dit is de Access Driver
DEFAULT DIRECTORY demo_dir -- Files Directory
ACCESS PARAMETERS -- Lijkt op SQL*Loader
(
RECORDS DELIMITED BY NEWLINE
BADFILE 'bad_sample'
LOGFILE 'log_sample'
FIELDS TERMINATED BY ','
MISSING FIELD VALUES ARE NULL
)
LOCATION ('Employees.csv')
)
PARALLEL 1 -- Onafhankelijk van een aantal files
REJECT LIMIT UNLIMITED;


Difficult files to load:
========================

Example 1:
----------

You Asked (Jump to Tom's latest followup)

Dear Tom,
Hope you are fine.
I am facing a problem using SQL Loader.
I have a text file and I have to take thses value into table.
I have also a table consists of 2 fields name and details.
Here I give some content of that text file:
LANGUILLI
     Come on, girl-san.  Tell me how you say that
     one more time.

BIENSTOCK
     Hey, why don't you ask her how to say
     'jailbreak'...in Vietnamese?

                     
 and so many lines like these...
Here I want to insert these text into that table.
As example LANGUILLI will insert into 'name' field and
the remain text which is not fixed length will insert into 'details' field.

There is no terminated symbol here except 'Enter' and 'Double Enter'. How I will 
set terminated point in sql loader?


and we said...

You will load into a "temp" staging table and use a procedure to reformat the 
data.

Suppose you have:

drop table t;
create table t ( name varchar2(30), text clob );

drop table temp;
create table temp ( seqno int primary key, text varchar2(4000) )
organization index overflow tablespace users;

T is where you ultimately want the data. You can write a simple procedure:

create or replace procedure reformat
as
    l_name varchar2(80);
    l_text varchar2(32000);
begin
    for x in ( select * from temp order by seqno )
    loop
        if ( x.text = chr(10) )
        then
            null; -- skip it
        elsif ( substr( x.text, 1, 1 ) <> ' ' )
        then
            if ( l_name is not null )
            then
                insert into t values ( l_name, l_text );
            end if;
            l_name := x.text;
            l_text := null;
        else
            l_text := l_text || x.text || chr(10);
        end if;
    end loop;
    insert into t values ( l_name, l_text );
end reformat;
/


Example 2:
----------

You Asked (Jump to Tom's latest followup)


Hi Tom,
What i am trying to do is load in bank transactions ( downloaded in a comma 
delimited format from the bank ) into my database.  My approach is to create an 
external table from the file and then create a regular table from the external 
one. then the data can be manipulated etc.
the problem i am having is that the .csv file has a date format that the insert 
( from external_table to table t ) is failing on.
  
an example line from my .csv file looks like:
2/24/2003,-40,*,,"ATM WITHDRAWAL "

my external table definition looks like:
create table external_table ( 
    trandate date,
    amount number(10,2),
    ignore1 char(1),
    ignore2 char(1) null,
    descr varchar2(4000) 
)
organization external (
    type oracle_loader
    default directory filebackup
    access parameters
        (     fields terminated by ','
            optionally enclosed by '"'
            missing field values are null )
    location ('Checking.csv') 
) ;

When i load i get this error:
ERROR at line 1:
ORA-29913: error in executing ODCIEXTTABLEFETCH callout
ORA-30653: reject limit reached
ORA-06512: at "SYS.ORACLE_LOADER", line 14
ORA-06512: at line 1

error processing column TRANDATE in row 1 for datafile D:\sql\Checking.csv
ORA-01843: not a valid month

So it appears it is chocking on the date format. I tried to_date on the insert 
but that didn't work. That date in the .csv file can either be in M:DD:YYYY or 
MM:DD:YYYY format. 
How best is this solved ?


and we said...

Use a date format in the CREATE TABLE.  Here is one I've used:

create table big_table_external
( OWNER              VARCHAR2(30),
  OBJECT_NAME        VARCHAR2(30),
  SUBOBJECT_NAME     VARCHAR2(30),
  OBJECT_ID          NUMBER,
  DATA_OBJECT_ID     NUMBER,
  OBJECT_TYPE        VARCHAR2(18),
  CREATED            DATE,
  LAST_DDL_TIME      DATE,
  TIMESTAMP          VARCHAR2(19),
  STATUS             VARCHAR2(7),
  TEMPORARY          VARCHAR2(1),
  GENERATED          VARCHAR2(1),
  SECONDARY          VARCHAR2(1)
)
ORGANIZATION EXTERNAL
( type oracle_loader
  default directory data_dir
  access parameters
  (
    records delimited by newline skip 21
    fields terminated by '|'
    missing field values are null
    ( owner ,object_name ,subobject_name ,object_id ,data_object_id ,object_type
      ,created date 'dd-mon-yy' ,last_ddl_time date 'dd-mon-yy'
      ,"TIMESTAMP" ,status ,temporary ,generated ,secondary
    )
  )
    location ('big_table.dat')
  )
/

for example -- your date format mask would be mm:dd:yyyy which would handle 
both.

ALTERNATIVELY

you could have trandate be a VARCHAR -- just a string.  And then you can use 
to_date() on the string in your SQL.


I would use the create table approach however, it'll put bad records to the .bad 
file and not fail your SQL statement when you encounter a bad date (if you set 
rejects on the create table that is...)


Example 3:
----------

Suppose we have a table of longstring (continous characters) in 1 column.
It must be converted to a table with 3 columns.
The hidden columns in the longstring are separated with 1 or more spaces.


CREATE SEQUENCE SEQ_SOURCE
  INCREMENT BY 1
  START WITH 1
  MAXVALUE 9999999
  NOCYCLE;

create table SOURCE
(
id number(10) not null,
longrecord varchar2(128));

CREATE OR REPLACE TRIGGER tr_source
BEFORE INSERT ON SOURCE FOR EACH ROW
BEGIN
	SELECT seq_source.NEXTVAL INTO :NEW.id FROM dual;
END;
/

insert into SOURCE (longrecord) values ('ikgadeze   week   naarhuis hoera hoera');
insert into SOURCE (longrecord) values ('aaa   bbbb cc ddd eeee');
insert into SOURCE (longrecord) values ('ddddd   eee   ff gggg');
insert into SOURCE (longrecord) values ('ggggg hh ii jjjjj');
insert into SOURCE (longrecord) values ('a b c d e');


create table DEST
(
col1 varchar2(64),
col2 varchar2(64),
col3 varchar2(64),
col4 varchar2(64),
col5 varchar2(64));

-- -----------------------------------------------------
--create or replace procedure welltryit
--as
--Use INSTR like INSTR(string1,string2,start_position,nth_appearance)
declare

pos1  number;
pos2  number;
pos3  number;
pos4  number;
pos5  number;

str1  varchar2(256);
str2  varchar2(256);
str3  varchar2(256);
str4  varchar2(256);
str5  varchar2(256);

col1  varchar2(64);
col2  varchar2(64);
col3  varchar2(64);
col4  varchar2(64);
col5  varchar2(64);

lr1   number;
lr2   number;
lr3   number;
lr4   number;
lr5   number;

cursor CUR IS
SELECT * FROM SOURCE;

cur_rec cur%rowtype;


begin

for cur_rec IN cur loop

col1:=null;
col2:=null;
col3:=null;
col4:=null;
col5:=null;


  str1:=cur_rec.longrecord;
  lr1 :=length(str1);
  pos1:=instr(str1,' ',1);

  if pos1 =0 then
     col1:=str1;
  else
    col1:=substr(str1,1,pos1);
    str2:=LTRIM(substr(str1,pos1,lr1));
    lr2:=length(str2);
    pos2:=instr(str2,' ',1);
  
    if pos2=0 then
       col2:=str2;
    else
       col2:=substr(str2,1,pos2);
       str3:=LTRIM(substr(str2,pos2,lr1));
       lr3:=length(str3);
       pos3:=instr(str3,' ',1);

       if pos3=0 then
          col3:=str3;
       else
          col3:=substr(str3,1,pos3);
          str4:=LTRIM(substr(str3,pos3,lr1));
          lr4:=length(str4);
          pos4:=instr(str4,' ',1);
 
          if pos4=0 then
             col4:=str4;
          else
             col4:=substr(str4,1,pos4);
             str5:=LTRIM(substr(str4,pos4,lr1));
             lr5:=length(str5);
             pos5:=instr(str5,' ',1);
             col5:=str5;
          end if;

       end if;
    end if;
  end if;


  insert into DEST
  values
  (col1,col2,col3,col4,col5);

end loop;
end;
/


-- -----------------------------------------------------

Other stuff:
------------

Some special uses: To show all fields with an unprintable character:

select distinct ascii(NOTES_ACCOUNT_NME)
   from CIN1_PDF_EDS where length(NOTES_ACCOUNT_NME) = 1

SELECT NAME, NOTES_ACCOUNT_NME FROM CIN1_PDF_EDS 
where NOTES_ACCOUNT_NME = chr(13) or NOTES_ACCOUNT_NME = chr(10)
ORDER BY NAME

You can use translate function to remove/replace multiply occurences of 'undesired' symbols like

SELECT NAME, NOTES_ACCOUNT_NME
  FROM CIN1_PDF_EDS
where NOTES_ACCOUNT_NME is not Null
    and length(trim(translate(NOTES_ACCOUNT_NME,chr(10)||chr(13),' ')))>0
ORDER BY NAME;


========================
15. REF CURSOR EXAMPLES:
========================

By showing some instructive examples, the use of REF CURSOR should become clear.


Example 1:
----------

variable result_set refcursor
 
begin
open :result_set for
    select a.ename, b.dname
    from ( select * from emp ) a,
    ( select * from dept ) b
     where a.deptno = b.deptno;
end;
/

PL/SQL procedure successfully completed.

print result_set

ENAME      DNAME
---------- --------------
SMITH      RESEARCH
ALLEN      SALES
WARD       SALES
JONES      RESEARCH
MARTIN     SALES
BLAKE      SALES
CLARK      ACCOUNTING
SCOTT      RESEARCH
KING       ACCOUNTING
TURNER     SALES
ADAMS      RESEARCH
JAMES      SALES
FORD       RESEARCH
miller     ACCOUNTING


Example 2:
----------

(from asktom.oracle.com)

You Asked ................

Tom,

I 've written a stored procedure which uses ref cursor. Following is how I 've 
written it :-

create or replace package pkg_dept
AS
  type rc_dept is ref cursor;
end;
/

create or replace
    procedure sp_dept( t_deptno IN NUMBER, t_job  IN varchar,
                    dept_cur in out pkg_dept.rc_dept )
is
begin
              
    OPEN dept_cur for
    select ename,sal,hiredate
    from emp
    where deptno=t_deptno
    and job=t_job;
             
end;
/

From Sql*plus I can call the stored procedure and get results using :-

var c refcursor;
execute sp_dept(10,'CLERK',:c);
print c

1. But how do I do it using a small pl/sql block ie. how do I put 
var c refcursor (Step 1)
execute sp_dept(100,'CLERK',:c); (Step 2)
print c  (Step 3)
the above 3 steps into a pl/sql block and get the output, I tried a couple of 
ways but it didn't work.

2. Is the above method of opening a ref cursor and executing the results a good 
method ? In my application I will be calling the stored procedure from a jsp 
page, I ran the jsp page and it works fine in retrieving the results, But I am  
wondering if the method I 've adopted in opening the cursor a good one,because I 
dont want to end up into any problems in the future. Also could you mention some 
EXCEPTIONS that I could use in the stored procedure. 

3. I would like to know if I am using bind variables in the stored procedure. I 
am rewriting a sql query in to stored procedure for it to use bind variables to 
improve performance. So I wanted to confirm that this procedure uses bind 
variables.


Thanks in Advance,
TS
----------------------------------------------------------------


and we said.....................

1) you would have to explicitly fetch and print the results:

declare
   c pkg_dept.rc_dept;
   l_ename emp.ename%type;
   l_sal   emp.salary%type;
   l_join_date emp.join_date%type;
begin
   sp_dept( 100, 'CLERK', c );
   loop
       fetch c into l_ename, l_sal, l_join_date;
       exit when c%notfound;
        ....
   end loop;
   close c;


2) I firmly believe the best java programs are those that have ZERO 
"selects/inserts/updates/deletes" in them.  Hence, using ref cursors is the way 
to go.  Lets you tune without bothering those java programmers.

You use the exception you need to -- there is no "list" of ones you should use.  
You use what you need?

3) yes you are.  T_DEPTNO and T_DESIGNATION are bind variables.  Now, just make 
sure the CALLER (java programmer) uses bind variables when calling your 
procedure!!!!


Example 3:
----------

TYPE var_cur_type IS REF CURSOR;

You can pass a cursor variable to PL/SQL by calling a stored procedure that declares 
a cursor variable as one of its formal parameters. To centralize data retrieval, you can group 
type-compatible queries in a packaged procedure, as the following example shows:


CREATE PACKAGE emp_data AS

   TYPE EmpCurTyp IS REF CURSOR RETURN emp%ROWTYPE;
   PROCEDURE open_emp_cv (emp_cv IN OUT EmpCurTyp, choice IN NUMBER);

END emp_data;


CREATE PACKAGE BODY emp_data AS

PROCEDURE open_emp_cv (emp_cv IN OUT EmpCurTyp, 
                          choice IN NUMBER) IS
BEGIN
      IF choice = 1 THEN
         OPEN emp_cv FOR SELECT * FROM emp WHERE comm IS NOT NULL;
      ELSIF choice = 2 THEN
         OPEN emp_cv FOR SELECT * FROM emp WHERE sal > 2500;
      ELSIF choice = 3 THEN
         OPEN emp_cv FOR SELECT * FROM emp WHERE deptno = 20;
      END IF;
END open_emp_cv;

END emp_data;


To run this from SQL*Plus:

var c refcursor;
execute emp_data.open_emp_cv(:c,1);
print c;

SQL> var c refcursor;
SQL> execute emp_data.open_emp_cv(:c,1);

PL/SQL procedure successfully completed.

SQL> print c;

     EMPNO ENAME      JOB              MGR HIREDATE         SAL       COMM     DEPTNO
---------- ---------- --------- ---------- --------- ---------- ---------- ----------
      7369 SMITH      CLERK           7902 17-DEC-80        800        100         20
      7499 ALLEN      SALESMAN        7698 20-FEB-81       1600        300         30
      7521 WARD       SALESMAN        7698 22-FEB-81       1250        500         30
      7654 MARTIN     SALESMAN        7698 28-SEP-81       1250       1400         30
      7844 TURNER     SALESMAN        7698 08-SEP-81       1500          0         30


Alternatively, you can use a standalone procedure to open the cursor variable. 
Simply define the REF CURSOR type in a separate package, then reference that type 
in the standalone procedure. 
For instance, if you create the following (bodiless) package, you can create standalone procedures 
that reference the types it defines:

CREATE PACKAGE cv_types AS
   TYPE EmpCurTyp IS REF CURSOR RETURN emp%ROWTYPE;
   TYPE DeptCurTyp IS REF CURSOR RETURN dept%ROWTYPE;
   TYPE BonusCurTyp IS REF CURSOR RETURN bonus%ROWTYPE;
   ...
END cv_types;


Example 3:
----------

(from asktom.oracle.com)


You Asked.................

How do I return the values from a PL/SQL table
(indexed by BINARY_INTEGER) into a ref cursor?

The contents of the PL/SQL table are NOT returnable  by a single 
SQL statement. AS it is a PL/SQL table ; I can't do a standard 
select (This doesn't work -open TunnelCrs for 
                           'select gw1, gw2 from a') 

Currently, to create the ref. cursor am currently doing thefollowing: 
   open TunnelCrs for 
   'select '||to_char(a(1).gw1)||','|| 
   to_char(a(1).gw1)||' from dual';
 
If there are multiple rows ; I am using an 'union all' . 


The following is my type and PL/SQL table definitions:

declare
TYPE gw_ttn is record (
        gw_id1 INTEGER,
        gw_id2 INTEGER
);

TYPE gw_tn is table of gw_ttn index by binary_integer;
TYPE TunnelCursor IS REF CURSOR;

a       gw_tn;


Is there a more elegant solution?  


and we said................

the proper way to do this is to NOT use a PLSQL table type but to use a SQL 
Object Type instead.  It would look like this:

-- no declare

create or replace type myType as object
(   x    int,
    y    date,
    z    varchar2(25)
);
/

Type created.


create or replace type myTableType as table of myType;
/

Type created.

create or replace
    function demo_proc( p_start_row in number,
                        p_end_row in number )
    return myTableType
as
l_data             myTableType := myTableType();
l_cnt              number default 0;

begin
      for x in ( select * from emp order by sal desc )
      loop
          l_cnt := l_cnt + 1;
          if ( l_cnt >= p_start_row )
          then
              l_data.extend;
              l_data(l_data.count) :=
                      myType( x.empno,
                              x.hiredate,
                              x.ename );
          end if;
          exit when l_cnt = p_end_row;
      end loop;
  
      return l_data;
end;
/

Function created.

select *  from the ( select cast( demo_proc(2,6) as mytableType )
from dual ) a
/

         X Y         Z
---------- --------- -------------------------
      7788 09-DEC-82 SCOTT
      7902 03-DEC-81 FORD
      7566 02-APR-81 JONES
      7698 01-MAY-81 BLAKE
      7782 09-JUN-81 CLARK

tkyte@OSI1.WORLD> 


So, I am recommending you use a SQL type -- not a plsql table type (they work 
very much the same with the notable exception that the SQL Nested table demands 
you use .EXTEND to allocate space whereas the plsql table type just "makes room" 
as needed.

By using the SQL Type, you can select from the table easily.  Your ref cursor 
example would be:


create or replace package my_pkg
as
    type rc is ref cursor;

    procedure p( p_cursor in out rc );
end;
/


create or replace package BrainsCursor
as
    type Brains_cur is ref cursor;

end;
/


create or replace package body my_pkg
as
 
procedure p( P_cursor in out rc )
is
      l_data  myTableType := myTableType();
begin
      for i in 1 .. 3 loop
          l_data.extend;
          l_data(i) :=
             myScalarType( i, sysdate+i, i || ' data');
      end loop;
  
      open p_cursor for
      select *
        from TABLE ( cast ( l_data as myTableType) );
end;
 
end;
/

Package body created.

tkyte@OSI1.WORLD> set autoprint on
tkyte@OSI1.WORLD> variable x refcursor
tkyte@OSI1.WORLD> exec my_pkg.p(:x)

PL/SQL procedure successfully completed.


         X Y         Z
---------- --------- -------------------------
         1 27-MAY-00 1 data
         2 28-MAY-00 2 data
         3 29-MAY-00 3 data


Example 4:
----------

Step 1 - Table Definition
First, we need a table created in Oracle called "wine".  Below is the create statement for the wine table.

create table wine
( col1 varchar2(40),
  col2 varchar2(40),
  col3 varchar2(40)
);

We've made this table definition very simple, for demonstration purposes.
 

Step 2 - Create package
Next, we've created a package called "winepkg" that contains our cursor definition.  
This needs to be done so that we can use a cursor as an output parameter in our stored procedure.

create or replace PACKAGE winepkg
IS
   /* Define the REF CURSOR type. */
   TYPE wine_type IS REF CURSOR RETURN wine%ROWTYPE;
END winepkg;

This cursor will accept all fields from the "wine" table.

Step 3 - Create stored procedure
Our final step is to create a stored procedure to return the cursor.  
It accepts three parameters (entered by the user on the HTML Form) and returns 
a cursor (c1) of type "wine_type" which was declared in Step 2.

The procedure will determine the appropriate cursor to return, based on the value(s) that 
have been entered by the user (input parameters).

create or replace procedure find_wine2
  (col1_in in varchar2,
   col2_in in varchar2,
   col3_in in varchar2,
   c1 out winepkg.wine_type)
as

BEGIN

   /* all columns were entered */
   IF (length(col1_in) > 0) and (length(col2_in) > 0) and (length(col3_in) > 0)
   THEN
      OPEN c1 FOR
      select *
      from wine
      where wine.col1 = col1_in
      and  wine.col2 = col2_in
      and  wine.col3 = col3_in;

   /* col1 and col2 were entered */
   ELSIF (length(col1_in) > 0) and (length(col2_in) > 0) and (length(col3_in) = 0)
   THEN
      OPEN c1 FOR
      select *
      from wine
      where wine.col1 = col1_in
      and  wine.col2 = col2_in;


   /* col1 and col3 were entered */
   ELSIF (length(col1_in) > 0) and (length(col2_in) = 0) and (length(col3_in) > 0)
   THEN
      OPEN c1 FOR
      select *
      from wine
      where wine.col1 = col1_in
      and  wine.col3 = col3_in;

   /* col2 and col3 where entered */
   ELSIF (length(col1_in) = 0) and (length(col2_in) > 0) and (length(col3_in) > 0)
   THEN
      OPEN c1 FOR
      select *
      from wine
      where wine.col2 = col2_in
      and  wine.col3 = col3_in;


   /* col1 was entered */
   ELSIF (length(col1_in) > 0) and (length(col2_in) = 0) and (length(col3_in) = 0)
   THEN
      OPEN c1 FOR
      select *
      from wine
      where wine.col1 = col1_in;


   /* col2 was entered */
   ELSIF (length(col1_in) = 0) and (length(col2_in) > 0) and (length(col3_in) = 0)
   THEN
      OPEN c1 FOR
      select *
      from wine
      where wine.col2 = col2_in;


   /* col3 was entered */
   ELSIF (length(col1_in) = 0) and (length(col2_in) = 0) and (length(col3_in) > 0)
   THEN
      OPEN c1 FOR
      select *
      from wine
      where wine.col3 = col3_in;

   END IF;

END find_wine2
 

Example 5:
----------

FETCH a REF cursor:

  FETCH {cursor_name | :host_cursor_variable_name}
     INTO {variable1[, variable2,...] | record_name};

The variables must match (both in number and positionally) the 
columns listed in the REF cursor OPEN statement.
Also the data types must either match or be compatible.

A fetch statement retrieves rows one at a time from
the result set of a multi-row query - in other words it
advances the cursor to the next row.

CLOSE a REF cursor:

  CLOSE {cursor_name | :host_cursor_variable_name};

Closing a cursor releases the context area. 

REF Cursor Attributes:

cursor%ROWCOUNT  - int - number of rows affected by last SQL statement
cursor%FOUND    - bool - TRUE if >1 row returned
cursor%NOTFOUND - bool - TRUE if 0 rows returned
cursor%ISOPEN   - bool - TRUE if cursor still open 

Typically the REF CURSOR definition and the
OPEN FOR SELECT will be in a packaged procedure on the server

A client-side application will then call the procedure
- thus obtaining a valid open cursor with the correct SQL
The client-side application will then perform further
processing.. FETCH into variables etc

Note that the cursor variable must be the same TYPE
for both the packaged procedure on the server and
in the DECLARE section of the client-side application.

The way to be sure of this is to declare the TYPE in a
PACKAGE

Example:

CREATE PACKAGE my_cursor_types AS

   TYPE MyCursor IS REF CURSOR;
   ...
END my_cursor_types;


CREATE PROCEDURE GetCarter ( proc_cv IN OUT my_cursor_types.MyCursor,
                             emp_name VARCHAR2(50) )
   ...

Then the client-side application code would start like

DECLARE
    local_cv        my_cursor_types.MyCursor;
    carter_record   carter%ROWTYPE
BEGIN
   GetCarter(local_cv,:employee)    -- employee is a host variable
   FETCH local_cv INTO carter_record;
   ...

Example 6:
----------

This example shows how to use JDBC with VARRAYS and REF CURSORs. 
It also shows use of the PreparedStatement and CallableStatement methods.
The example does the follows:

1. selects from a table of VARRAYs
2. inserts into a table of VARRAYs
3. selects from a table of VARRAYs
4. calls stored procedure -- parameters <ref cursor, varray>

In order to test it, you will need to do two things first:
1) Create related tables and types first. The screipt is given below.
2) Create a package that gets called from JAVA code. The script is given below.

======================= Step 1 create tables etc. cute here ==================
-- Run this through SQL*PLUS

drop TABLE varray_table;
drop TYPE num_varray;
drop TABLE sec;
-- create the type
create TYPE num_varray as VARRAY(10) OF NUMBER(12, 2);
-- create the table
create TABLE varray_table (col1 num_varray);
-- create the sec table
create table sec (sec_id number(8) not null, sec_grp_id number(8) not null,
company_id number(8) not null);
insert into sec values (1,200,11);
insert into sec values (2,1100,22);
insert into sec values (3,1300,33);
insert into sec values (4,1800,44);

==================== End of step 1===========================================

================== Step 2 create package ====================================

-- Run it through sql*plus

CREATE OR REPLACE PACKAGE packageA AS
type sctype is ref cursor return SEC%ROWTYPE;
procedure get_port_consensus(sc IN OUT sctype, arr IN num_varray);
procedure test_port_consensus(sc IN OUT sctype);
END packageA;
/

CREATE OR REPLACE PACKAGE BODY packageA AS
procedure test_port_consensus(sc IN OUT sctype)
IS
testArr num_varray := num_varray(200, 1100, 1300, 1800);
BEGIN
get_port_consensus(sc, testArr);
END test_port_consensus;

procedure get_port_consensus(sc IN OUT sctype, arr IN num_varray)
IS
BEGIN
open sc for select * from sec
where sec_grp_id = arr(1) 
or sec_grp_id = arr(2) 
or sec_grp_id = arr(3) 
or sec_grp_id = arr(4);
END get_port_consensus;
END packageA;
/

===================== End of step 2 ===================================

============ JAVA code to test the whole thing ========================

import java.sql.*;
import oracle.sql.*;
import oracle.jdbc.oracore.Util;
import oracle.jdbc.driver.*;
import java.math.BigDecimal;

public class ArrayExample
{
public static void main (String args<>)
DriverManager.registerDriver(new oracle.jdbc.driver.OracleDriver());

// Connect to the database
// You need to put your database name after the @ sign in
// the connection URL.
//
// The example retrieves an varray of type "NUM_VARRAY",
// materializes the object as an object of type ARRAY.
// A new ARRAY is then inserted into the database.

Connection conn =
DriverManager.getConnection ("jdbcracleci8:@v81",
"scott", "tiger");

// It's faster when auto commit is off
conn.setAutoCommit (false);

// Create a Statement
Statement stmt = conn.createStatement ();

System.out.println("Querying varray_table");
ResultSet rs = stmt.executeQuery("SELECT * FROM varray_table");
showResultSet (rs);

// now insert a new row
// create a new ARRAY object
int elements<> = { 200, 1100, 1300, 1800 };
ArrayDescriptor desc = ArrayDescriptor.createDescriptor("NUM_VARRAY",conn);
ARRAY newArray = new ARRAY(desc, conn, elements);

// prepare statement to be inserted and bind the num_varray type
System.out.println("PreparedStatement: Inserting into varray_table");
PreparedStatement ps =
conn.prepareStatement ("insert into varray_table values (?)");
((OraclePreparedStatement)ps).setARRAY (1, newArray);
ps.execute ();

// query to view our newly inserted row
System.out.println("Querying varray_table again");
rs = stmt.executeQuery("SELECT * FROM varray_table");
showResultSet (rs);

// prepare a callable statement -- call the stored procedure
// passing <ref cursor in out, varray in>
System.out.println("CallableStatement: Calling Stored Procedure");
OracleCallableStatement oraStmt1 =
(OracleCallableStatement)conn.prepareCall("{ call 
packageA.get_port_consensus(?, ?) }");
oraStmt1.registerOutParameter(1, OracleTypes.CURSOR);
oraStmt1.setARRAY(2, newArray);
oraStmt1.execute();
rs = (ResultSet)oraStmt1.getObject(1);

// loop through the result set of the ref cursor and display
while (rs.next()) {
System.out.println(rs.getString("sec_grp_id"));
}

// Close all the resources
rs.close();
ps.close();
stmt.close();
oraStmt1.close();
conn.close();
}

public static void showResultSet (ResultSet rs)
throws SQLException
{
int line = 0;
while (rs.next())
{
line++;
System.out.println("Row "+line+" : ");
ARRAY array = ((OracleResultSet)rs).getARRAY (1);

System.out.println ("Array is of type "+array.getSQLTypeName());
System.out.println ("Array element is of type code "+array.getBaseType());

System.out.println ("Array is of length "+array.length());

// get Array elements
BigDecimal<> values = (BigDecimal<>) array.getArray();

for (int i=0; i<values.length; i++)
{
BigDecimal value = (BigDecimal) values;
System.out.println(">> index "+i+" = "+value.intValue());
}
}
}
}


=====================
16. Temporary tables:
=====================

==============================================================================


Question 1:
===========


How do I read files from a certain directory with PL/SQL, without
knowing the exact name ?

My program must interface with another system which puts files in 
a directory on the server. UTL_FILE only reads a file when you 
know the name of the file, but I don't know the name in advance.
Is it possible to use wildcards (eg. '*') in the name of the file?

Answer:
=======

We cannot do this with PLSQL directly however, using Java (or a C extproc) we 
can do this pretty easily.

The interface I came up with uses a global temporary table which will "lose" its 
rows every time you commit.  You'll call a stored procedure providing a 
DIRECTORY to scan and I'll put a list of all of the files that are in that 
directory into this temp table.  If you want to "filter" the files (eg: only 
interested in *.txt files), you'll use SQL "select * from dir_list where 
filename like '%.txt'" to do so.

The implementation is:

ops$tkyte@8i> GRANT JAVAUSERPRIV to ops$tkyte
  2  /

Grant succeeded.

That grant must be given to the owner of the procedure..  Allows them to read 
directories.

ops$tkyte@8i> create global temporary table DIR_LIST
  2  ( filename varchar2(255) )
  3  on commit delete rows
  4  /

Table created.


ops$tkyte@8i> create or replace
  2     and compile java source named "DirList"
  3  as
  4  import java.io.*;
  5  import java.sql.*;
  6  
  7  public class DirList
  8  {
  9  public static void getList(String directory)
 10                     throws SQLException
 11  {
 12      File path = new File( directory );
 13      String[] list = path.list();
 14      String element;
 15  
 16      for(int i = 0; i < list.length; i++)
 17      {
 18          element = list[i];
 19          #sql { INSERT INTO DIR_LIST (FILENAME)
 20                 VALUES (:element) };
 21      }
 22  }
 23  
 24  }
 25  /


Java created.


ops$tkyte@8i> 
ops$tkyte@8i> create or replace
  2  procedure get_dir_list( p_directory in varchar2 )
  3  as language java
  4  name 'DirList.getList( java.lang.String )';
  5  /

Procedure created.

ops$tkyte@8i> 
ops$tkyte@8i> exec get_dir_list( '/tmp' );

PL/SQL procedure successfully completed.

ops$tkyte@8i> select * from dir_list where rownum < 5;

FILENAME
------------------------------------------------------
data.dat
.rpc_door
.pcmcia
ps_data

And thats it... 


More pointers on java:
-----------------------

Exec.execWait("/usr/local/bin/java Foo");

e89. Executing a Command
See also e90 Reading Output from a Command. 
    try {
        // Execute a command without arguments
        String command = "ls";
        Process child = Runtime.getRuntime().exec(command);
    
        // Execute a command with an argument
        command = "ls /tmp";
        child = Runtime.getRuntime().exec(command);
    } catch (IOException e) {
    }

If an argument contain spaces, it is necessary to use the overload that requires the command and its arguments to be supplied in an array: 
    try {
        // Execute a command with an argument that contains a space
        String[] commands = new String[]{"grep", "hello world", "/tmp/f.txt"};
        commands = new String[]{"grep", "hello world", "c:\\Documents and Settings\\f.txt"};
        Process child = Runtime.getRuntime().exec(commands);
    } catch (IOException e) {
    }


=============================================================================

In Oracle, you should create the global temporary table ONCE.  DDL is hugely 
expensive, it commits any outstanding work, you have to drop the table yourself 
(which again commits), and unless you name them uniquely -- it will cause you to 
be able to run this stored procedure serially (only ONE person at a time).

The correct way to code the above will be:

create global temporary table tempslot ( iddd raw(16), nnname varchar2(255) );

create or replace procedure temp_table
as
begin
   insert into tempslot select id, name from temptest;
end;
/

and that is it.

=============================================================================

create global temporary table gtd_tab1 (col1 varchar2(30)) on commit delete 
rows;

=============================================================================

scott@ORA8I.WORLD> variable result_set refcursor
scott@ORA8I.WORLD> 
scott@ORA8I.WORLD> begin
  2    open :result_set for
  3          select a.ename, b.dname
  4            from ( select * from emp ) a,
  5                     ( select * from dept ) b
  6           where a.deptno = b.deptno;
  7  end;
  8  /

PL/SQL procedure successfully completed.

scott@ORA8I.WORLD> 
scott@ORA8I.WORLD> print result_set

ENAME      DNAME
---------- --------------
SMITH      RESEARCH
ALLEN      SALES
WARD       SALES
JONES      RESEARCH
MARTIN     SALES
BLAKE      SALES
CLARK      ACCOUNTING
SCOTT      RESEARCH
KING       ACCOUNTING
TURNER     SALES
ADAMS      RESEARCH
JAMES      SALES
FORD       RESEARCH
miller     ACCOUNTING

14 rows selected.

=============================================================================

Question:

I create a procedure with a creation of a temporary table, but when I execute it,
the following errors comes up:

ERROR at line 1:
ORA-01031: insufficient privileges
ORA-06512: at "BRIOADMIN.TEST604", line 3
ORA-06512: at line 1

Answer:

PLSQL stored procedures execute with the base privs 
of the definer (owner) of the routine meaning that ROLES are not enabled.  

You have the create table privelege via a role, you need to have it granted 
directly to you.


=============================================================================

I believe you mean "temporary tables" -- temporal tables in a database are 
another thing entirely (there is actually such a thing -- temporal tables are 
tables that can return the answer that existed at a point in time -- you can ask 
the table to return the answer that existed at midnight last night, instead of 
the answer that exists right now)...

Oracle's temporary tables are similar to temp tables in those 
other databases the main exception being that they are 'statically' defined.  
You create them once per database, not once per stored procedure in the 
database.  They always exist but appear empty until you put data in them.  They 
may be SESSION based (data survives a commit but not a disconnect/reconnect).  
They may be TRANSACTION based (data disappears after a commit).  Here is an 
example showing the behaviour of both.  I used the scott.emp table as a 
template:

SQL> create global temporary table temp_table_session
  2  on commit preserve rows
  3  as
  4  select * from scott.emp where 1=0
  5  /
Table created.


the ON COMMIT PRESERVE ROWS makes this a session based temporary table.  rows 
will stay in this table until a logoff.  Only I can see them though, no other 
session will ever see 'my' rows even after I commit

SQL> 
SQL> 
SQL> create global temporary table temp_table_transaction
  2  on commit delete rows
  3  as
  4  select * from scott.emp where 1=0
  5  /
Table created.


the ON COMMIT DELETE ROWS makes this a transaction based temp table.  when you 
commit -- the rows disappear.


SQL> insert into temp_table_session select * from scott.emp;
14 rows created.

SQL> insert into temp_table_transaction select * from temp_table_session;
14 rows created.


we've just put 14 rows into each temp table and this shows we can 'see' them:

SQL> select count(*) from temp_table_session
  2  /

  COUNT(*)
----------
        14

SQL> select count(*) from temp_table_transaction
  2  /

  COUNT(*)
----------
        14

SQL> commit;
Commit complete.


since we've committed, we'll see the session based rows but not the transaction 
based rows:


SQL> 
SQL> select count(*) from temp_table_session
  2  /

  COUNT(*)
----------
        14

SQL> select count(*) from temp_table_transaction
  2  /

  COUNT(*)
----------
         0

SQL> 


SQL> connect tkyte/tkyte
Connected.
SQL> 

since we've started a new session, we'll see no rows now:


SQL> 
SQL> select count(*) from temp_table_session
  2  /

  COUNT(*)
----------
         0

SQL> select count(*) from temp_table_transaction
  2  /

  COUNT(*)
----------
         0

SQL> 


Instead of executing "select x, y, z into #temp from some_table" you would:

o once per database create "TEMP" as a global temporary table.

o then in your procedures you would simply "insert into temp (x,y,z) select 
x,y,y from some_table"


=============================================================================


If you really need the temp table to be created in the procedure itself, 
Oracle8i release 8.1 makes this much easier to do as well.  Consider the 
following example which uses plsql to create, insert into, fetch from and drop a 
temporary table -- whose name is not known until run time.  Its almost as easy 
as static sql is:

declare
      type mycur is ref cursor;
  
      l_tname     varchar2(30) default 'temp_table_' || userenv('sessionid');
      l_cursor    mycur;
      l_ename     scott.emp.ename%type;
  begin
      execute immediate 'create global temporary table ' ||
                         l_tname || ' on commit delete rows
                         as
                         select * from scott.emp where 1=0 ';
  
      execute immediate 'insert into ' || l_tname ||
                        ' select * from scott.emp';
  
      open l_cursor for
          'select ename from ' || l_tname || ' order by ename';
  
      loop
          fetch l_cursor into l_ename;
          exit when l_cursor%notfound;
          dbms_output.put_line( l_ename );
      end loop;
  
      close l_cursor;
      execute immediate 'drop table ' || l_tname;
  end;
  /


=============================================================================

var c refcursor;
exec STP_BRAINS_GETEENHEID(3301,:c);
print c

var c refcursor;
exec STP_BRAINS_GETEENHEIDPARENTVRB(3307,'01-JUL-01',sysdate,:c);
print c

var c refcursor;
exec STP_BRAINS_GETEENHEIDVERBRUIK(3307,'01-JUL-01',sysdate,:c);

var c refcursor;
exec STP_BRAINS_GETEENVERBR(3307,'01-JUL-01',sysdate,:c);

var c refcursor;
exec STP_BRAINS_GETEENHEIDPARENTVRB(3307,'01-JUL-01',sysdate,:c);

var c refcursor;
exec STP_BRAINS_GETEENHEIDPARENTVRB(3326,'01-JUL-01',sysdate,:c);

exec export_bios(3326,'01-JUL-01',sysdate);

var c refcursor;
exec STP_BRAINS_GETTANKSBYLOCATIE(651,:c);
print c

var c refcursor;
exec STP_BRAINS_GETALLVRDTANKBULK(154,'01-JAN-2004','01-JUL-2004',:c);
print c

var c refcursor;
exec STP_BRAINS_GETVRBBULKHIER(154,'01-JAN-2004','01-JUL-2004',:c);


SQL> var c refcursor;
SQL> exec STP_BRAINS_GETEENHEIDPARENTVRB(3307,'01-JUL-01',sysdate,:c);

PL/SQL procedure successfully completed.

SQL> print c

HIERARCHIE EENHEID                                            ELCO                 LOCATIE                                            SOORTLOCATIE                                       KENTEKEN            KMSTAND   VERBRUIK     LITERS DATUM     TIJD       BRANDSTOF                                           PRIJSEURO TOTAALEURODISPLAY
---------- -------------------------------------------------- -------------------- -------------------------------------------------- -------------------------------------------------- ---------------- ---------- ---------- ---------- --------- ---------- -------------------------------------------------- ---------- -----------------
         0 KABINET BEVELHEBBER KLU                            8103B1K000           SHELL Kok Ruygenhoek-West                          Geautomatiseerd civiel tankstation                 LM-50-80             129941       20.8       61.9 27-MAY-02 12:00      MTC DIESEL (30)                                     .80242326        49.6699998
         0 KABINET BEVELHEBBER KLU                            8103B1K000           B. Kerkhof en Zn Bv (993910)                       Geautomatiseerd civiel tankstation                 LM-50-80             130724       13.1         60 12-JUN-02 12:00      MTC DIESEL (30)                                          .778             46.68
         0 KABINET BEVELHEBBER KLU                            8103B1K000           Ss De Thij (993949)                                Geautomatiseerd civiel tankstation                 LM-50-80             131262       13.8         39 17-JUN-02 12:00      MTC DIESEL (30)                                     .74692307        29.1299997
         0 KABINET BEVELHEBBER KLU                            8103B1K000           B. Kerkhof en Zn Bv (993910)                       Geautomatiseerd civiel tankstation                 LM-50-80             131774       12.8         40 21-JUN-02 12:00      MTC DIESEL (30)                                          .783             31.32
         0 KABINET BEVELHEBBER KLU                            8103B1K000           B. Kerkhof en Zn Bv (993910)                       Geautomatiseerd civiel tankstation                 LM-50-80             132404       33.6         50 05-JUL-02 12:00      MTC DIESEL (30)                                          .783             39.15
         0 KABINET BEVELHEBBER KLU                            8103B1K000           B. Kerkhof en Zn Bv (993910)                       Geautomatiseerd civiel tankstation                 LM-50-80                                        1 12-JUL-02 12:00      MTC SMEERMIDDELEN (57)                                     25                25
         0 KABINET BEVELHEBBER KLU                            8103B1K000           B. Kerkhof en Zn Bv (993910)                       Geautomatiseerd civiel tankstation                 LM-50-80             133192                    59 24-JUL-02 12:00      MTC DIESEL (30)                                     .77898305             45.96

7 rows selected.

SQL> spool  off

=============================================================================


=============================================================================


=======================
17. Iets over SQL*Plus:
=======================


15.1 SQL*Plus commands:
----------------------

SQL>1                 -- maakt regel 1 current
SQL>c
SQL>edit              -- roept de editor op met inhoud uit afiedt.buf
SQL>get filename
SQL>run
SQL>@SELECT_emp.sql
SQL>@c:\scripts\create_users.sql   -- runs dit scriptfile
SQL>desc
SQL>list
SQL>del number        -- verwijdert regel

SQL>append string     -- toevoegen aan cuurent line

SQL>clear buffer

SQL>input             -- toevoegen new line aan einde buffer

SQL>number string     -- vervangt die line

SQL>spool (to filename)

SQL>save filename

set some_configuration 
                       - set linesize 120
                       - set pagesize 66


15.2 Entering variables:
-----------------------


1. Substitution variable &variable:

Laat SQL*Plus vragen om waarde via &variable:

  select ename, job, deptno, sal
  from emp
  where ename='&ename';

  select empno, ename, job, deptno, sal
  from emp
  where empno=&empno;  


2. Set define ?

  select empno, ename, job, deptno, sal
  from emp
  where empno=?empno;


3. Accept

In een script kan het accept commando handig zijn:

  accept var_empno prompt 'Enter empno: '
  select ename, job
  from emp
  where empno=?var_empno;


15.3 SQL*Plus environment:
-------------------------

Hier gaat het om de SQL*Plus systemvariables die met het
set commando zijn in te stellen:

SET ARRAYSIZE

Hiermee bepaal je het aantal rows dat SQL*Plus in 1 batch ophaalt
vanuit de database. 

SET COLSEP

Dit is de column seperator, kop en data

set colsep '-'
set colsep ' '

SET FEEDBACK

Deze setting bepaald of je de "n rows selected" 
bij de query output /
te zien krijgt.

set feedback on/off

SET HEADING

Dit bepaald of je de kolomkoppen te zien krijgt
in de query output.
set heading on/off

SET LINESIZE

Totaal aantal karakters wat op een line getoont wordt
voordat de volgende regel begint.

set linesize 120

SET PAGESIZE

Sets het aantal lines per page

SET LONG

Zet de maximum breedte in bytes by displaying
long, clob, nclob values

SET NUMFORMAT

Sets de default format voor het tonen getallen.

SET NUMWIDTH

Sets de default breedte voor het tonen getallen.

SET PAUSE

SET TERMOUT

Dit bepaald of je de output ziet op scherm van commands
die uitgevoerd worden per script

set termout on/off

SET TRIMSPOOL ON/OFF

------------------------------------------------------------


============
OTHER STUFF:
============


1. Generate Insert statements from an Oracle table:
---------------------------------------------------

Example 1:
----------

Build Insert Statements for the Existing Data in Tables 

by Ashish Kumar <kumara@jagat.com> 

This script builds insert statements for the existing data in the tables. One can run the generated script 
to repopulate the data. 

-- By: Ashish Kumar
-- Date Created: 10/01/2001
-- EMail: kumara@jagat.com
-- Code Version: 1.0.1

-- Objective:
-- You can use the following code to extract the existing data from tables in the form
-- of insert statements.  The generated script can be run at a later time to re-create your data.
-- This code is no match for EXPORT and IMPORT utilities.
-- Use it for *quick and dirty* situations.
-- The code handles only date, char, varchar2, and numeric data types.

-- Change History:

-- The example used in the code uses scott schema.

-- AUTHOR MAKES NO WARRANTIES FOR THIS CODE.

-- Step 1: Create this procedure:
create or replace Function ExtractData(v_table_name varchar2) return varchar2 As
    b_found boolean:=false;
    v_tempa varchar2(8000);
    v_tempb varchar2(8000);
    v_tempc varchar2(255);
begin
    for tab_rec in (select table_name from user_tables where table_name=upper(v_table_name))
    loop
        b_found:=true;
        v_tempa:='select ''insert into '||tab_rec.table_name||' (';
        for col_rec in (select * from user_tab_columns
                            where
                                table_name=tab_rec.table_name
                            order by
                                column_id)
        loop
            if  col_rec.column_id=1 then
                v_tempa:=v_tempa||'''||chr(10)||''';
            else
                v_tempa:=v_tempa||',''||chr(10)||''';
                v_tempb:=v_tempb||',''||chr(10)||''';
            end if;
            v_tempa:=v_tempa||col_rec.column_name;
            if  instr(col_rec.data_type,'CHAR') > 0 then
                v_tempc:='''''''''||'||col_rec.column_name||'||''''''''';
            elsif instr(col_rec.data_type,'DATE') > 0 then
                v_tempc:='''to_date(''''''||to_char('||col_rec.column_name||',''mm/dd/yyyy hh24:mi'')||'''''',''''mm/dd/yyyy hh24:mi'''')''';
            else
                v_tempc:=col_rec.column_name;
            end if;
            v_tempb:=v_tempb||'''||decode('||col_rec.column_name||',Null,''Null'','||v_tempc||')||''';
        end loop;
        v_tempa:=v_tempa||') values ('||v_tempb||');'' from '||tab_rec.table_name||';';
    end loop;
    if  Not b_found then
        v_tempa:='-- Table '||v_table_name||' not found';
    else
        v_tempa:=v_tempa||chr(10)||'select ''-- commit;'' from dual;';
    end if;
    return v_tempa;
end;
/
show errors

-- STEP 2: Run the following code to extract the data.
set head off
set pages 0
set trims on
set lines 2000
set feed off
set echo off
var retline varchar2(4000)
spool c:\t1.sql
select 'set echo off' from dual;
select 'spool c:\CI_MD_AT_DTL.sql' from dual;
select 'select ''-- This data was extracted on ''||to_char(sysdate,''mm/dd/yyyy hh24:mi'') from dual;' from dual;

-- Repeat the following two lines as many times as tables you want to extract
exec :retline:=ExtractData('CI_MD_AT_DTL');
print :retline;

select 'spool off' from dual;
spool off
@c:\t1

-- STEP3: Run the spooled output c:\recreatedata.sql to recreate data.


Example 2:
----------


Try the following, in sqlplus 

SQL> set trimspool on
SQL> set pagesize 0
SQL> set heading off
SQL> set feedback off
SQL> set termout off
SQL> spool file.txt
SQL> SELECT 'INSERT INTO TABLE_B (foo, bar) VALUES (' || foo || ', ' || bar || ');' FROM TABLE_A;
SQL> spool off
SQL> exit
It should result in a file called file.txt with one row for each row returned by the select statement.


If your rows contain strings, you'll need to quote them, like...

	... VALUES (''', || foo || ''', ''' || bar || ''');' FROM ...
If your strings contain single quotes, you'll also need to pass them through the REPLACE function to escape them, like...

	... || REPLACE(foo, ''', '''') || ...
.


Example 3:
----------

                       Retrieving Data from the Database
 This bulletin covers two methods for retrieving data in the database for
reloading.  The first, is a script that will generate insert statements for
an existing table. The second, is a script that once executed will
generate a control file that can be used with sql*loader as well as spools
the data to a file to be used by the control file.  Both scripts do all the
work automatically, you simply need to provide a table name.  This
information has been provided to Oracle Worldwide Support by a customer,
Ramesh K Meda.
 Preliminatry testing was done on these scripts, but we urge all users to conduct
tests in their environment.  The end user is solely responsible for results of
the execution of these scripts.
 SCRIPT TO GENERATE INSERT STATEMENTS:
------------------------------------
 /*
|| File:
|| Dump.SQL
|| Description:
|| Creates insert statements for existing data
|| Author:
|| Ramesh K Meda.
||
*/
set pages 0
set lines 132
set verify off
set feedback off
 accept sTab prompt 'Enter table name: '
 column MaxColId noprint new_val sMaxColId
 select max(column_id) MaxColId
from   user_tab_columns
where  table_name = upper('&sTab')
/
 spool junk.sql
prompt Select
select decode(column_id, 1, '''insert into &sTab. Values ('' || chr(10) || ')
    || decode(column_id, 1, '', ' || ')
    || decode(column_id, 1, '', ''','' ||')
    || 'decode(' || column_name || ', null, ''Null'',' || '''''''''|| ' ||
       column_name || ' || '''''''''  || ')'
    || '|| chr(10)'
    || decode(column_id, &sMaxColId, ' || '')'' || chr(10) || ''/'' ', null)
from   user_tab_columns
where  table_name = upper('&sTab')
order  by column_id
/
Prompt from &sTab.;
Prompt /
spool off
 spool &sTab..dat
@junk.sql
spool off
Prompt Output spooled to &sTab..dat


Example 4:
----------

Hi

I need to write a package or procedure to generate insert statements, for 
example generate insert statements for EMP table I would use this

select 'insert into emp values ('
       || empno
       ||', '
       || '''' || ename || ''''
       || ', '
       || '''' || job || ''''
       || ', '
       || mgr
       || ', '
       || '''' || hiredate || ''''
       || ', '
       || sal 
       || ', '
       || '''' || nvl(comm, '') || ''''
       ||', '
       || deptno
       || ');'
from emp;

and this generates

insert into emp values (7369, 'SMITH', 'CLERK', 7902, '19801217 00:00:00', 800, 
'', 20);
insert into emp values (7499, 'ALLEN', 'SALESMAN', 7698, '19810220 00:00:00', 
1600, '300', 30);
insert into emp values (7521, 'WARD', 'SALESMAN', 7698, '19810222 00:00:00', 
1250, '500', 30);

I am trying to do this using a pl/sql which takes a parameter, the table name 
and by querying the data dictionary this will generate the insert statements. I 
can generate 

select 'insert into ci_md_ctl_l values ('
       || plug_in_name
       ||', '
       || '''' || language_cd || ''''
       || ', '
       || '''' || descr || ''''
       || ', '
       || version
       || ', '
       || '''' || seq_num 
              || ');'
from ci_md_ctl_l;


-- END OF FILE


#############################################################################################
#############################################################################################
#############################################################################################


=====================================================
Section 16: Basic VB and VBscript code snippets:
=====================================================


Part I  deals with VBSCRIPT
Part II deals with VB6


####################################################################
PART I: VBSCRIPT:
####################################################################


1. Introduction:
================

What is VBScript?

-VBScript is a scripting language 
-A scripting language is a lightweight programming language 
-VBScript is a light version of Microsoft's programming language Visual Basic 

How Does it Work?

When a VBScript is inserted into a HTML document, the Internet browser will read the HTML 
and interpret the VBScript. The VBScript can be executed immediately, or at an event that occurs later.
Scripting languages, like JavaScript and VBScript, are designed as an extension to HTML. 
The web browser receives scripts along with the rest of the web document. 
It is the browser's responsibility to parse and process the scripts. 
HTML was extended to include a tag that is used to incorporate scripts into HTML-the <SCRIPT> tag. 

One of the main purposes of adding script to a web page is to create event procedures
for objects on a page, such as ActiveX controls and standard HTML controls.


2. Old and new ways to place in HTML:
=====================================

example 1: old style with older browsers:
-----------------------------------------

<HTML>
<HEAD>
<TITLE>VBScript Test</TITLE>
<SCRIPT LANGUAGE="VBS">
<!--
Sub Button1_OnClick
        MsgBox "Hello World!"
End Sub
-->
</SCRIPT>
</HEAD>
<BODY>
<H1>VBScript Test</H1>
<HR>
<FORM>
<INPUT NAME="Button1" TYPE="BUTTON" VALUE="Press here to see the message">
</FORM>
</BODY>
</HTML>

Comment:

You see that in order to support the really old browsers
we have put the statements between comment tags <!--    -->.


example 2: new style:
---------------------

<html>                                          <html>
<head>                                          <head>
                                                </head>
<script type="text/vbscript">                   <body>
sub mySub()                                     <SCRIPT LANGUAGE="VBSCRIPT">
  msgbox("This is a sub procedure")                MsgBox "Welcome to my web page!"
end sub
</script>                                       </SCRIPT>
                                                </body>
</head>                                         </html>

<body>
<script type="text/vbscript">
call mySub()
</script>
<p>A sub procedure does not return a result.</p>

</body>
</html>


  To insert a script in an HTML document, use the <script> tag. 
  Use the type attribute to define the scripting language.

  <script type="text/vbscript">
 
  And end the script with

  </script>


Some info about HTML Forms:
===========================

Study the example below to see some fundamental HTML form controls:

HEAD>
<TITLE>Intranet Time Away From Work</TITLE>
</HEAD>
<BODY>
<CENTER>
<H3>Intranet Time-Away-From-Work Form</H3>
</CENTER>
<FORM METHOD=POST ACTION="savedata.exe">

<PRE>
<BR>Your Name:      <INPUT NAME="name" TYPE=text SIZE=50 MAXSIZE=50>
<BR>E-mail address: <INPUT NAME="email" TYPE=text SIZE=50 MAXSIZE=50>
<BR>Dates Absent:   <INPUT NAME="item" TYPE=text SIZE=50 MAXSIZE=50>
<BR>Special Notes:  <TEXTAREA NAME="reason" ROWS=2 COLS=55 MAXLENGTH=150></TEXTAREA>
</PRE>

<P>Type of Absence:
<INPUT TYPE=radio NAME="holiday" VALUE="holiday" Checked>Holiday
<INPUT TYPE=radio NAME="vacation" VALUE="vacation">Vacation
<INPUT TYPE=radio NAME="sick" VALUE="sick">Sick
<INPUT TYPE=radio NAME="leave" VALUE="leave">Leave of Absence
<INPUT TYPE=checkbox NAME="pay" VALUE="pay" CHECKED>With Pay
</P>

<P>Your Department:
<SELECT NAME="department">
<OPTION>Accounting
<OPTION>Administration
<OPTION>Engineering
<OPTION>Marketing
<OPTION SELECTED>Sales
<OPTION>Support
</SELECT>

<INPUT TYPE=submit VALUE="I'm Outta Here!">
</P>
</FORM>

<HR>
<ADDRESS>
This document last modified: April 20, 1996<BR>
By Scott Zimmerman<BR>
e-mail: <A HREF="mailto:scottz@sd.znet.com">scottz@sd.znet.com</A>
</ADDRESS>
</BODY>
</HTML>


3. SIMPLE FORMS AND VBSCRIPT:
=============================

example 1:
----------

<HTML>
<HEAD>
<TITLE>Working With VBScript: Exercise 1</TITLE>
</HEAD>

<BODY>

  <H1>Your First VBScript Exercise</H1>
  <P> By utilizing VBScript you can give your Web pages actions. 
  Click on the button below to see what we mean. </P>

  <FORM NAME="frmExercise1">
    <INPUT TYPE="Button" NAME="cmdClickMe" VALUE="Click Me">
    <SCRIPT FOR="cmdClickMe" EVENT="onClick" LANGUAGE="VBScript">
      MsgBox "A simple example of VBScript in action."
    </SCRIPT>

  </FORM>
</BODY>
</HTML>

example 2: A better approach for example 1:
-------------------------------------------

<HTML>
<HEAD>
<TITLE>Working With VBScript: Exercise 1</TITLE>

<SCRIPT LANGUAGE="VBScript">

<!-- Instruct non-IE browsers to skip over VBScript modules.
  Sub cmdClickMe_OnClick
    MsgBox "A simple example of VBScript in action."
  End Sub
-->

</SCRIPT>

</HEAD>

<BODY>

  <H1>Your First VBScript Exercise</H1>
  <P> By utilizing VBScript you can give your Web pages actions. 
  Click on the button below to see what we mean. </P>

  <FORM NAME="frmExercise1">
    <INPUT TYPE="Button" NAME="cmdClickMe" VALUE="Click Me">
  </FORM>
</BODY>
</HTML>

Now we have used a sub-procedure called cmdClickMe_OnClick. 
This will be executed any time that the control cmdClickMe is clicked. 
This type of procedure is referred to as an event procedure. 
The event is the user clicking the button. 


4. USING VARIABLES:
===================

A variable is a named location in computer memory that you can use 
for storage of data during the execution of your scripts. You can use variables to:

-Store input from the user gathered via your web page
-Save data returned from functions
-Hold results from calculations

- Declare a variable:

Dim
Name

- Assigning values:

Variable_name = value

Name = "Larry Roof"
HoursWorked = 50
Overtime = True

The VBScript language provides support for arrays. You declare an array using the Dim statement, 
just as you did with variables:

Dim States(50)

The statement above creates an array with 51 elements. 
Why 51? Because VBScript arrays are zero-based, meaning that the first 
array element is indexed 0 and the last is the number specified when declaring the array.


Example 1:
----------

<HTML>
<HEAD>
<TITLE>Working With VBScript: Exercise 1</TITLE>

<SCRIPT LANGUAGE="VBScript">
<!-- Instruct non-IE browsers to skip over VBScript modules.

Sub cmdClickMe_OnClick
  Dim Name
  Name = InputBox("Enter your name: ")
  MsgBox "The name you entered was " & Name
End Sub

-->
</SCRIPT>

</HEAD>
<BODY>

  <H1>Your First VBScript Exercise</H1>
  <P> By utilizing VBScript you can give your Web pages actions. 
  Click on the button below to see what we mean. </P>
  <FORM NAME="frmExercise1">

    <INPUT TYPE="Button" NAME="cmdClickMe" VALUE="Click Me">

  </FORM>
</BODY>
</HTML>


Example 2:
----------

<HTML>
<HEAD>
<TITLE>Working With VBScript: Exercise 2</TITLE>
<SCRIPT LANGUAGE="VBScript">
<!-- Add this to instruct non-IE browsers to skip over VBScript modules.

Option Explicit

Sub cmdCalculate_OnClick
  Dim AmountofTax
  Dim CRLF
  Dim Message
  Dim Subtotal
  Dim TABSPACE
  Dim TAX_RATE
  Dim TotalCost

' Define our constant values.
  TAX_RATE = 0.06
  CRLF = Chr(13) & Chr(10)
  TABSPACE = Chr(9)

' Perform order calculations.
  Subtotal = document.frmExercise2.txtQuantity.value _
           * document.frmExercise2.txtUnitPrice.value

  AmountofTax = Subtotal * TAX_RATE
  TotalCost = Subtotal + AmountofTax

' Display the results.
  Message = "The total for your order is:"
  Message = Message & CRLF & CRLF
  Message = Message & "Subtotal:" & TABSPACE & "$" & Subtotal & CRLF
  Message = Message & "Tax:" & TABSPACE & "$" & AmountofTax & CRLF
  Message = Message & "Total:" & TABSPACE & "$" & TotalCost

  MsgBox Message,,"Your Total"
End Sub
-->
</SCRIPT>
</HEAD>

<BODY>
<H1>Your Second VBScript Exercise</H1>
<P> Variables can be used to store and manipulate values. To 
see a demonstration of this enter a quantity and unit price 
in the fields below and click the "Calculate Cost" button.</P>

<FORM NAME="frmExercise2">
  <TABLE>
    <TR>
      <TD><B>Quantity:</B></TD>
      <TD><INPUT TYPE="Text" NAME="txtQuantity" SIZE=5></TD>
    </TR>
    <TR>
      <TD><B>Unit price:</B></TD>
      <TD><INPUT TYPE="Text" NAME="txtUnitPrice" SIZE=5></TD>
    </TR>
  </TABLE>
  <BR>
  <INPUT TYPE="Button" NAME="cmdCalculate" VALUE="Calculate Cost">
</FORM>
</BODY>
</HTML>

Comments:

1. Chr() is a VBScript function that returns the character associated 
with a specified ASCII code. ASCII codes 13, 10 and 9 are carriage return, 
line feed and tab, respectively. 

CRLF = Chr(13) & Chr(10)
TABSPACE = Chr(9)

2. The form was named frmExercise2. Here we are referencing our web document, 
then the form, then the input field and finally the value of that field. 
The value associated with each field contains what the user entered into 
that field on the web page. The * says to multiply the value of the first 
field, txtQuantity, by the second field, txtUnitPrice.


5. SCRIPT FLOW AND LOOPS:
=========================

Simple examples IF:
-------------------

if i=10 Then msgbox "Hello"

if i=10 Then
   msgbox "Hello"
   i = i+1
end If

if i=10 then
   msgbox "Hello"
else
   msgbox "Goodbye"
end If

if payment="Cash" then
   msgbox "You are going to pay cash!"
 elseif payment="Visa" then
   msgbox "You are going to pay with visa."
 elseif payment="AmEx" then
   msgbox "You are going to pay with American Express."
 else
   msgbox "Unknown method of payment."
end If


Simple examples SELECT CASE:
----------------------------

select case payment
 case "Cash"
   msgbox "You are going to pay cash"
 case "Visa"
   msgbox "You are going to pay with visa"
 case "AmEx"
   msgbox "You are going to pay with American Express"
 case Else
   msgbox "Unknown method of payment"
end select


Simple examples FOR..NEXT or Do While:
--------------------------------------

dim names(2)
names(0)="Tove"
names(1)="Jani"
names(2)="Hege"

For Each x in names
  document.write(x & "<br>")
Next

Dim x(10)
For i=1 to 10
    x(i)=i*10
Next

Do While i>10
  some code
Loop


6. OBJECTS:
===========

- standard - intrinsic html: controls such as buttons, textboxes on forms
- ActiveX controls
- Java applets


6.1 Assigning names to controls:
--------------------------------

To use an object in client-side script, you must first create the object and then
assign a name to it. You use this object name to create event procedures 
and to access the properties and methods of the object. The syntax for assigning
names varies slightly for different types of objects.

Standard HTML Controls:
-----------------------

To assign a name to a standard HTML control, you set the NAME attribute.
example:

<INPUT TYPE="BUTTON" NAME="cmdValidateOrder" VALUE="Validate Order">

Now you can write eventprocedures for this control like 

Sub cmdValidateOrder_OnClick
  -- code
End Sub

ActiveX Controls:
-----------------

To assign a name to an ActiveX control, you set the ID attribute
of the <OBJECT> tag.
example:


<OBJECT
  classid="clsid:99B42120-6EC7-11CF-A6C7-00AA00A47DD2"
  id=lblOccupation
>


Java Applets:
-------------

To assign a name to a Java applet control, you set the NAME attribute
of the <APPLET> tag.

<APPLET
  CODE=Outline.class
  NAME=myoutline
  HEIGHT=150
  WIDTH=200>
</APPLET>


7. VBScript and DTS:
====================

Suppose we need to load data from one table, or other source like a textfile,
into a table in a SQL Server database, with DTS.

Suppose a limited form of data transformation is needed. You might
use VBScript to accomplish this, as in the following simple example:

Function Main()
Dim strFullName
Dim intLoc

' Copy most of the fields directly:

DTSDestination("customer-id")=DTSSource("Cust-Num")
DTSDestination("Country")=DTSSource("Country")
DTSDestination("Name")=DTSSource("Name")
DTSDestination("City")=DTSSource("City")
'etc
' Now split the ContactName into firstname and lastname fields:
strFullName=DTSSource("Contact")
intLoc=InstrRev(strFullName," ")  'looking for a space
If intLoc <> 0 Then
   DTSDestination("ContactFname")=Left(strFullName, intLoc)
   DTSDestination("ContactLname")=Mid(strFullName, intLoc+1)
End If

Main=1


DTSDestination("customer-id")=DTSSource("Cust-Num")
DTSDestination("customer-id")=DTSSource("Cust-Num")


8. FORM VALIDATION:
===================

Form validation involves checking if the required information is provided by the user. 
We can make sure to see if a field is empty, figure out type of value provided, 
count number of characters or value, can check if special character(s) is present and more. 
Here is a syntax for checking a field value.

if form.name.value="" then msgbox"Please enter name". 

This checks if the name field is empty and informs the user.

if len(form.name.value) < 2 or len(form.txtname.value)>50 then msgbox "Too short or too long". 

This checks if the lenth of the value provided is less than 2 characters or more than 50 characters 
and if so prompts message.

if instr(form.txtemail.value,"@") = 0 then msgbox("Invalid e-mail"). 

This checks if @ is present and prompts message if not.

if (Not (IsNumeric(form.txtage.value)) then msgbox"Invalid age". 

This check if the value is not numeric and prompts message if so. To check if it's numeric, 
do not specify NOT before IsNumeric.

form.txtage.focus
. This put focus in the text box, age.
form.txtage.select
. This highlights the text in the text box.


9. SOME MORE EXAMPLES:
======================


Example 1: replace text
-----------------------

<html>
<body>

<script type="text/vbscript">
sometext="Welcome to this Web!!"
document.write(Replace(sometext, "Web", "Page"))
</script>

</body>
</html>

Example 2: trim spaces
----------------------

<html>
<body>

<script type="text/vbscript">
fname=" Bill "
document.write("Hello" & trim(fname) & "Gates<br />")
document.write("Hello" & rtrim(fname) & "Gates<br />")
document.write("Hello" & ltrim(fname) & "Gates<br />")
</script>

</body>
</html>

Example 3: display date and time
--------------------------------

<html>
<body>

<script type="text/vbscript">
document.write("Today's date is " & date())
document.write("<br />")
document.write("The time is " & time())
</script>

</body>
</html>

Example 4: format the date
--------------------------

<html>
<body>

<script type="text/vbscript">
document.write(FormatDateTime(date(),vbgeneraldate))
document.write("<br />")
document.write(FormatDateTime(date(),vblongdate))
document.write("<br />")
document.write(FormatDateTime(date(),vbshortdate))
document.write("<br />")
document.write(FormatDateTime(now(),vblongtime))
document.write("<br />")
document.write(FormatDateTime(now(),vbshorttime))
</script>

<p>Syntax for FormatDateTime: FormatDateTime(date,namedformat).</p>

</body>
</html>


Example 5:
----------

<html>
<body>

<script type="text/vbscript">
for i=1 to 6
 document.write("<h" & i & ">This is header " & i & "</h" & i & ">")
next
</script>

</body>
</html>


Example 6: Sort a 100 random numbers:
-------------------------------------

<HTML><HEAD><TITLE>Sorting an Array</TITLE>
<SCRIPT LANGUAGE="VBScript">
Dim Data(100)
Sub Generate_OnClick()
Randomize()
'Generate numbers
Numbers = ""
For i = 1 to 100
Data(i)=Int(Rnd()*100 + 1)
Numbers=Numbers & Data(i) & chr(13) & chr(10)
Next
Form1.Input.Value=Numbers
End Sub
Sub Sort_OnClick()
'Sort the array
For i = 1 To Ubound(Data) -1 
For j = i + 1 to Ubound(Data)
If Data(i) > Data(j) Then
Temp = Data(j)
Data(j) = Data(i)
Data(i) = Temp
End if
Next
Next
Numbers =""
For i = 1 to UBound(Data)
Numbers = Numbers & Data(i) & Chr(13) & Chr(10)
Next 
Form1.Output.Value = Numbers
End Sub
</SCRIPT></HEAD>
<BODY BGCOLOR=lightblue>
<H1>Sorting an Array</H2><P>
Click the Generate button to generate 100 random numbers in the text box on the left.
Then click the Sort button to sort these numbers and put them in the text box on the right.
<P><INPUT TYPE=Button Name=Generate VALUE="Generate">
<INPUT TYPE=Button Name=Sort VALUE="Sort">
<FORM NAME=Form1><P>
<TEXTAREA NAME=Input ROWS=12 COLS=15></TEXTAREA>
<TEXTAREA NAME=Output ROWS=12 COLS=15></TEXTAREA>
</FORM>
<div style="display: block; font-family: Verdana, Geneva, Arial; font-size: 10px">
The University of Southern California does not screen or control the content on this website and thus does not guarantee the accuracy, integrity, or quality of such content.  All content on this website is provided by and is the sole responsibility of the person from which such content originated, and such content does not necessarily reflect the opinions of the University administration or the Board of Trustees
</div>


Example 7: date function
------------------------

DatePart function is a very useful function to get the a part of a date. You may get year, month, day of year .. etc. 
of a specific date. 


An Example :

Function GetYear(strDate)
   GetYear = DatePart("yyyy", strDate)
End Function


Some of settings can be used with DatePart function :

yyyy : Year 
q : Quarter 
m : Month 
y : Day of year 
d : Day 
w : Weekday 
ww : Week of year 
h : Hour 
n : Minute 
s : Second 


10 WRITE TO FILES:
==================

Dim objFile, strGuyFile, strFilePath

strFilePath = "e:\ezine\strGuyFile.txt"
Set objFile = CreateObject("Scripting.FileSystemObject")
Set strGuyFile = objFile.CreateTextFile(strFilePath, True)
strGuyFile.WriteLine("This was made using VBScript.")
strGuyFile.Close


Creating Files
There are three ways to create an empty text file (sometimes referred to as a "text stream").

The first way is to use the CreateTextFile method. The following example demonstrates how to create a text file using the CreateTextFileMethod method.

[VBScript]
Dim fso, f1
Set fso = CreateObject("Scripting.FileSystemObject")
Set f1 = fso.CreateTextFile("c:\testfile.txt", True)
[JScript]
var fso, f1;
fso = new ActiveXObject("Scripting.FileSystemObject");
f1 = fso.CreateTextFile("c:\\testfile.txt", true);
The second way to create a text file is to use the OpenTextFile method of the FileSystemObject object with the ForWriting flag set.

[VBScript]
Dim fso, ts
Const ForWriting = 2
Set fso = CreateObject("Scripting. FileSystemObject")
Set ts = fso.OpenTextFile("c:\test.txt", ForWriting, True)
[JScript]
var fso, ts;
var ForWriting= 2;
fso = new ActiveXObject("Scripting.FileSystemObject");
ts = fso.OpenTextFile("c:\\test.txt", ForWriting, true);
A third way to create a text file is to use the OpenAsTextStream method with the ForWriting flag set.

[VBScript]
Dim fso, f1, ts
Const ForWriting = 2
Set fso = CreateObject("Scripting.FileSystemObject")
fso.CreateTextFile ("c:\test1.txt")
Set f1 = fso.GetFile("c:\test1.txt")
Set ts = f1.OpenAsTextStream(ForWriting, True)
[JScript]
var fso, f1, ts;
var ForWriting = 2;
fso = new ActiveXObject("Scripting.FileSystemObject");
fso.CreateTextFile ("c:\\test1.txt");
f1 = fso.GetFile("c:\\test1.txt");
ts = f1.OpenAsTextStream(ForWriting, true);
Adding Data to the File
Once the text file is created, add data to the file using the following three steps:

Open the text file. 

Write the data. 

Close the file. 

To open an existing file, use either the OpenTextFile method of the FileSystemObject object or the OpenAsTextStream method of the File object.

To write data to the open text file, use the Write, WriteLine, or WriteBlankLines methods of the TextStream object, according to the tasks outlined in the following table.

Task Method 
Write data to an open text file without a trailing newline character. Write 
Write data to an open text file with a trailing newline character. WriteLine 
Write one or more blank lines to an open text file. WriteBlankLines 

To close an open file, use the Close method of the TextStream object.

Note   The newline character contains a character or characters (depending on the operating system) to advance the cursor to the beginning of the next line (carriage return/line feed). Be aware that the end of some strings may already have such nonprinting characters.
The following example demonstrates how to open a file, use all three write methods to add data to the file, and then close the file:

[VBScript]
Sub CreateFile()
   Dim fso, tf
   Set fso = CreateObject("Scripting.FileSystemObject")
   Set tf = fso.CreateTextFile("c:\testfile.txt", True)
   ' Write a line with a newline character.
   tf.WriteLine("Testing 1, 2, 3.") 
   ' Write three newline characters to the file.        
   tf.WriteBlankLines(3) 
   ' Write a line.
   tf.Write ("This is a test.") 
   tf.Close
End Sub
[JScript]
function CreateFile()
{
   var fso, tf;
   fso = new ActiveXObject("Scripting.FileSystemObject");
   tf = fso.CreateTextFile("c:\\testfile.txt", true);
   // Write a line with a newline character.
   tf.WriteLine("Testing 1, 2, 3.") ;
   // Write three newline characters to the file.
   tf.WriteBlankLines(3) ;
   // Write a line.
   tf.Write ("This is a test.");
   tf.Close();
}
Reading Files
To read data from a text file, use the Read, ReadLine, or ReadAll method of the TextStream object. The following 
table describes which method to use for various tasks.

Task Method 
Read a specified number of characters from a file. Read 
Read an entire line (up to, but not including, the newline character). ReadLine 
Read the entire contents of a text file. ReadAll 

If you use the Read or ReadLine method and want to skip to a particular portion of data, use the Skip or SkipLine method. 
The resulting text of the read methods is stored in a string which can be displayed in a control, 
parsed by string functions (such as Left, Right, and Mid), concatenated, and so forth.

The following example demonstrates how to open a file, write to it, and then read from it:

[VBScript]
Sub ReadFiles
   Dim fso, f1, ts, s
   Const ForReading = 1
   Set fso = CreateObject("Scripting.FileSystemObject")
   Set f1 = fso.CreateTextFile("c:\testfile.txt", True)
   ' Write a line.
   Response.Write "Writing file <br>"
   f1.WriteLine "Hello World"
   f1.WriteBlankLines(1)
   f1.Close
   ' Read the contents of the file.
   Response.Write "Reading file <br>"
   Set ts = fso.OpenTextFile("c:\testfile.txt", ForReading)
   s = ts.ReadLine
   Response.Write "File contents = '" & s & "'"
   ts.Close
End Sub
[JScript]
function ReadFiles()
{
   var fso, f1, ts, s;
   var ForReading = 1;
   fso = new ActiveXObject("Scripting.FileSystemObject");
   f1 = fso.CreateTextFile("c:\\testfile.txt", true);
   // Write a line.
   Response.Write("Writing file <br>");
   f1.WriteLine("Hello World");
   f1.WriteBlankLines(1);
   f1.Close();
   // Read the contents of the file.
   Response.Write("Reading file <br>");
   ts = fso.OpenTextFile("c:\\testfile.txt", ForReading);
   s = ts.ReadLine();
   Response.Write("File contents = '" + s + "'");
   ts.Close();
}
Moving, Copying, and Deleting Files
The FSO object model has two methods each for moving, copying, and deleting files, as described in the following table.

Task Method 
Move a file File.Move or FileSystemObject.MoveFile 
Copy a file File.Copy or FileSystemObject.CopyFile 
Delete a file File.Delete or FileSystemObject.DeleteFile 

The following example creates a text file in the root directory of drive C, writes some information to it, moves it 
to a directory called \tmp, makes a copy of it in a directory called \temp, then deletes the copies from both directories.

To run the following example, create directories named \tmp and \temp in the root directory of drive C:

[VBScript]
Sub ManipFiles
   Dim fso, f1, f2, s
   Set fso = CreateObject("Scripting.FileSystemObject")
   Set f1 = fso.CreateTextFile("c:\testfile.txt", True)
   Response.Write "Writing file <br>"
   ' Write a line.
   f1.Write ("This is a test.")
   ' Close the file to writing.
   f1.Close
   Response.Write "Moving file to c:\tmp <br>"
   ' Get a handle to the file in root of C:\.
   Set f2 = fso.GetFile("c:\testfile.txt")
   ' Move the file to \tmp directory.
   f2.Move ("c:\tmp\testfile.txt")
   Response.Write "Copying file to c:\temp <br>"
   ' Copy the file to \temp.
   f2.Copy ("c:\temp\testfile.txt")
   Response.Write "Deleting files <br>"
   ' Get handles to files' current location.
   Set f2 = fso.GetFile("c:\tmp\testfile.txt")
   Set f3 = fso.GetFile("c:\temp\testfile.txt")
   ' Delete the files.
   f2.Delete
   f3.Delete
   Response.Write "All done!"
End Sub
[JScript]
function ManipFiles()
{
   var fso, f1, f2, s;
   fso = new ActiveXObject("Scripting.FileSystemObject");
   f1 = fso.CreateTextFile("c:\\testfile.txt", true);
   Response.Write("Writing file <br>");
   // Write a line.
   f1.Write("This is a test.");
   // Close the file to writing.
   f1.Close();
   Response.Write("Moving file to c:\\tmp <br>");
   // Get a handle to the file in root of C:\.
   f2 = fso.GetFile("c:\\testfile.txt");
   // Move the file to \tmp directory.
   f2.Move ("c:\\tmp\\testfile.txt");
   Response.Write("Copying file to c:\\temp <br>");
   // Copy the file to \temp.
   f2.Copy ("c:\\temp\\testfile.txt");
   Response.Write("Deleting files <br>");
   // Get handles to files' current location.
   f2 = fso.GetFile("c:\\tmp\\testfile.txt");
   f3 = fso.GetFile("c:\\temp\\testfile.txt");
   // Delete the files.
   f2.Delete();
   f3.Delete();
   Response.Write("All done!");
}


####################################################################
PART II: VB6
####################################################################


Remark:

Most of the time you will create a form associated with your program.
To close you program: make a Quit or Exit button on your form
with the following eventcode:

Private Sub btnQuit_Click()
  End
End Sub


================================================
1. Call a  SQL Server stored procedure with ADO:
================================================


Example 1: call a stored procedure, no parameters
-------------------------------------------------

To call a SQL Server stored procedure, for example "set_bezig", from VB, use code like:

Private Sub Command1_Click()
Dim oConn
  Dim oCmd
  Dim oRs
  Dim tmpBody
  Set oConn = CreateObject("ADODB.Connection")
  oConn.Open ("DATABASE=aida;DSN=MDB;UID=karel;Password=karel;")
  Set oCmd = CreateObject("ADODB.Command")
  oCmd.ActiveConnection = oConn
  oCmd.CommandText = "exec set_bezig"
  oCmd.CommandType = 1
  oCmd.Prepared = True
  Set oRs = oCmd.Execute
  
  Set oRs = Nothing
  Set oCmd = Nothing
  Set oConn = Nothing
  
End Sub

The procedure "set_bezig" in the above example, does something, and in this case
we do not need to pass parameter.

So "set_bezig" could for example be as simple as

  create procedure set_bezig
  as 
  update CLR_ADMIN
  set BEZIG='J'
  GO


Example 2: call a stored procedure with parameters
--------------------------------------------------

Suppose you have a form, with a Textbox and a Command button.
In the textbox, a name can be filled in, and this must go
to a table in SQL server.
So now we use a stored procedure "fill_y", which takes a parameter.

Private Sub Command1_Click()
Dim oConn
  Dim oCmd
  Dim oRs
  Dim tmpBody
  Dim name As String
  name = txtInput.Text
  Set oConn = CreateObject("ADODB.Connection")
  oConn.Open ("DATABASE=test;DSN=MDB;UID=klaas;Password=klaas;")
  Set oCmd = CreateObject("ADODB.Command")
  oCmd.ActiveConnection = oConn
  oCmd.CommandText = "exec fill_y " & name
  oCmd.CommandType = 1
  oCmd.Prepared = True
  Set oRs = oCmd.Execute
  
  Set oRs = Nothing
  Set oCmd = Nothing
  Set oConn = Nothing
  
End Sub


  create procedure fill_y @name varchar(10)
  as
  insert sales
  (name)
  values
  (@name)
  go


================================================
2. Call a  SQL Server stored procedure with RDO:
================================================

Private Sub Command1_Click()

Dim rs As rdoResultset
Dim cn As New rdoConnection
Dim qd As New rdoQuery
Dim cl As rdoColumn
	
cn.Connect = "uid=sa;pwd=;server=MyServer;" _
    & "driver={SQL Server};database=pubs;" _
    & "DSN='ABC';"
cn.CursorDriver = rdUseOdbc
cn.EstablishConnection rdDriverNoPrompt

Set qd.ActiveConnection = cn
qd.SQL = "{ ? = call dbo.ByRoyalty (?) }"
    
qd(0).Direction = rdParamReturnValue
qd(1).Direction = rdParamInput

qd.rdoParameters(1) = 100

Set rs = qd.OpenResultset(rdOpenForwardOnly, rdConcurReadOnly)

For Each cl In rs.rdoColumns
    Debug.Print cl.Name,
Next
    Debug.Print

   Do Until rs.EOF
     For Each cl In rs.rdoColumns
        Debug.Print cl.Value,
	    Next
	    rs.MoveNext
	    Debug.Print
        Loop

	rs.Close
	qd.Close
	cn.Close

End Sub


============================
3. Call an Oracle procedure:
============================

Suppose we have the following simple tables:

create table sales1
(
cust_name varchar2(10)
);

create table sales2
(
cust_id number,
cust_name varchar2(10)
);

Suppose we have the following simple procedures, which fills a table:


create or replace procedure ins_sales2
as
begin
insert into sales2 values (1,'Joop');
end;
/

create or replace procedure ins_sales1_parm (cust_name in varchar2)
as
begin
insert into sales1 values (cust_name);
end;
/


Example: Call a Oracle procedure without parameters with ADO:
-------------------------------------------------------------

Suppose we create a DSN with the Oracle ODBC driver:

Private Sub Command1_Click()
Dim oConn
  Dim oCmd
  Dim oRs
  Dim tmpBody
  Set oConn = CreateObject("ADODB.Connection")
  oConn.Open ("DATABASE=o901;DSN=MISKM;UID=mis_owner;Password=mis_owner;")
  Set oCmd = CreateObject("ADODB.Command")
  oCmd.ActiveConnection = oConn
  oCmd.CommandText = "ins_sales2"
  'oCmd.CommandType = 1
  'oCmd.Prepared = True
  Set oRs = oCmd.Execute
  
  Set oRs = Nothing
  Set oCmd = Nothing
  Set oConn = Nothing
  
End Sub


Example: Call an Oracle procedure with parameters with ADO:
----------------------------------------------------------

Suppose we have a simple procedure, which fills a table:


Private Sub Command1_Click()
Dim oConn
  Dim oCmd
  Dim oRs
  Dim tmpBody
  Dim cust_name As String
  Set oConn = CreateObject("ADODB.Connection")
  oConn.Open ("DATABASE=o901;DSN=MISKM;UID=mis_owner;Password=mis_owner;")
  Set oCmd = CreateObject("ADODB.Command")
  oCmd.ActiveConnection = oConn
  oCmd.CommandText = "ins_sales_parm"
  'objCmd.CommandText = "{ CALL Employees.GetEmpRecords(?,?) }"

  'oCmd.CommandType = 1
  'oCmd.Prepared = True
  Set oRs = oCmd.Execute
  
  Set oRs = Nothing
  Set oCmd = Nothing
  Set oConn = Nothing
  
End Sub


Example: Call an Oracle procedure with parameters with RDO:
-----------------------------------------------------------


CREATE TABLE rdooracle 
(
item_number    NUMBER(3) PRIMARY KEY,
depot_number   NUMBER(3)
);

CREATE OR REPLACE PROCEDURE rdoinsert (insnum IN NUMBER, outnum OUT NUMBER)
IS
BEGIN
  INSERT INTO rdooracle
  (Item_Number, Depot_Number)
   VALUES
  (insnum, 16);
  outnum := insnum/2;
END;


-- add a reference to Microsoft Remote Dataobjects msrdo20.dll

-- create a form like:

      Control     Name     Text/Caption
      ---------------------------------
      Button      cmdCheck  Check
      Button      cmdSend   Send
      Text Box    txtInput
      Label       lblInput  Input:

-- Code:

Option Explicit
      Dim Cn As rdoConnection
      Dim En As rdoEnvironment
      Dim CPw As rdoQuery
      Dim Rs As rdoResultset
      Dim Conn As String
      Dim QSQL As String
      Dim Response As String
      Dim Prompt As String

      Private Sub cmdCheck_Click()

          QSQL = "Select Item_Number, Depot_Number From rdooracle Where " _
          & "item_number =" & txtInput.Text
          Set Rs = Cn.OpenResultset(QSQL, rdOpenStatic, , rdExecDirect)

          Prompt = "Item_Number = " & Rs(0) & ".  Depot_Number = " _
          & Rs(1) & "."

          Response = MsgBox(Prompt, , "Query Results")

          Rs.Close

      End Sub

      Private Sub cmdSend_Click()

          CPw(0) = Val(txtInput.Text)
          CPw.Execute

          Prompt = "Return value from stored procedure is " & CPw(1) & "."
          Response = MsgBox(Prompt, , "Stored Procedure Result")

      End Sub

      Private Sub Form_Load()
    
          Conn = "UID=mis_owner;PWD=mis_owner;driver={Microsoft ODBC voor Oracle};" _
               & "CONNECTSTRING=o901;"

          Set En = rdoEnvironments(0)
          Set Cn = En.OpenConnection("", rdDriverPrompt, False, Conn)
          QSQL = "{call rdoinsert(?,?)}"
          Set CPw = Cn.CreateQuery("", QSQL)

      End Sub

      Private Sub Form_Unload(Cancel As Integer)

          En.Close

      End Sub


======================
4. VB AND COM or DCOM:
======================

Creating Objects, Local and Remote.
-----------------------------------

One of the most basic requirements of a distributed system is the ability to create components. 
In the COM world, object classes are named with globally unique identifiers, or GUIDs. 
When GUIDs are used to refer to particular classes of objects, they are called Class IDs. These Class IDs are nothing more 
than fairly large integers (128 bits) that provide a collision free, decentralized namespace for object classes. 

If a COM programmer wants to create a new object, he or she calls one of several functions in the COM libraries. 


-CoCreateInstance(Ex) (<CLSID>) 
 Creates an interface pointer to an uninitialized instance of the object class<CLSID>. 

-CoGetInstanceFromFile 
 Creates a new instance and initializes it from a file. 

-CoGetInstanceFromIStorage 
 Creates a new instance and initializes it from storage. 

-CoGetClassObject (<CLSID>) 
 Returns an interface pointer to a "class factory object" that can be used to 
 create one or more  uninitialized instances of the object class <CLSID>. 

-CoGetClassObjectFromURL Returns an interface pointer to a "class factory object" for a given class. 
 If no class is specified, this function will choose the appropriate class for a specified MIME type. 
 If the desired object is installed on the system, it is instantiated. 
 Otherwise, the necessary code is downloaded and installed from a specified URL. 


The COM libraries look up the appropriate binary code (dynamic-link library or executable) 
in the system registry, create the object, and return an interface pointer to the caller. 

For DCOM, the object creation mechanism in the COM libraries is enhanced to allow object creation 
on other machines. In order to be able to create a remote object, the COM libraries need to know the network name 
of the server. Once the server name and the CLSID are known, a portion of the COM libraries called the 
Service Control Manager, or SCM, on the client machine connects to the SCM on the server machine 
and requests creation of the object. 

DCOM provides two fundamental mechanisms for allowing clients to indicate the remote server name when 
an object is created. The remote server name can be indicated:

- As a fixed configuration in the system registry or in the DCOM Class Store.
- As an explicit parameter to CoCreateInstanceEx, CoGetInstanceFromFile, CoGetInstanceFromStorage, or CoGetClassObject.

External RemoteServerName configuration
---------------------------------------

The first mechanism, indicating the remote server name as a fixed configuration, is extremely useful for 
maintaining location transparency: 
clients need not know whether a component is running locally or remotely. When the remote server name 
is made part of the server component's configuration information on the client machine, clients do not 
have to maintain or obtain the server location. 
All a client ever needs to know is the CLSID of the component. 

It simply calls CoCreateInstance (or CreateObject in Visual Basic�, or "new" in Java), 
and the COM libraries transparently create the correct component on the preconfigured server. 
Even existing COM clients that were designed before the availability of DCOM can transparently use 
remote components using this mechanism. 

So the familiar "CreateObject" statement in VB can be associated to a more fundamental COM or DCOM functions 
in the COM libraries.

OLE Servers (VB4), ActiveX (VB5) en COM:
----------------------------------------

COM components were known (to some extend) as OLE Servers in VB4 and ActiveX in VB5 environments.
Loosely speaking, ActiveX and COM are the same thing.


===============================
5. Examples with queries in VB:
===============================

Example 1:
----------

Dim Mydb As Database
Dim strSQL As String

Set Mydb = CurrentDb

strSQL = "UPDATE tblRefuel SET odometer = " & Me!ComboBox & " WHERE VehID = " & Me!VehID
Mydb.Execute strSQL

Mydb.Close

Example 2:
----------

DoCmd.RunSQL is used like this:

Dim strSQL As String

strSQL = "UPDATE TableName SET FieldName = " & Me.TextBox & " WHERE IDField = " & Me.ControlWithIDValue

DoCmd.RunSQL strSQL

The DoCmd.RunSQL can ONLY be used with action queries such as INSERT, UPDATE, DELETE, 
CREATE TABLE, DROP TABLE, TRUNCATE TABLE, etc.  You CANNOT use it to return a recordset (i.e. SELECT statement).

Example 3:
----------

DoCmd.RunSQL "DELETE FROM tblTest"

CurrentDB.Execute "DELETE FROM tblTest"

DoCmd.RunSQL "DELETE * FROM Artikel WHERE [Artikel-Nr] = Forms!frmArtikel![Artikel-Nr]"

CurrentDB.Execute "DELETE * FROM Artikel WHERE [Artikel-Nr] = " & Forms!frmArtikel![Artikel-Nr]

Public Function Beispiel()

    Dim db As Database
    Dim qdf As QueryDef
    Set db = CurrentDb
    Set qdf = db.QueryDefs("qryAnf�geabfrageMitParameter")
    qdf.Parameters("Kategorie") = "Neue Kategorie"
    On Error Resume Next
    qdf.Execute dbFailOnError
    If Not Err = 0 Then
        MsgBox "Fehler: " & Err.Number & vbCrLf & "Fehlerbeschreibung: " & Err.Description
    End If
End Function


Example 4:
----------

There are a number of ways to execute a SQL Data Manipulation Language (DML) statement from Microsoft Access, 
besides the obvious process of creating an Action Query and double-clicking its icon. Understanding your options 
with respect to executing SQL will make your code cleaner, give you more flexibility and we'll even throw in a 
great debugging feature as an added bonus. 

The following article, while not exploring every facet and option, will demonstrate how to execute SQL using 
the following methods: 

DoCmd.RunSQL 
DoCmd.OpenQuery 
[Querydef].Execute 
[Database].Execute 
dbFailOnError 
Saved Queries verses Embedded SQL

For the sake of this discussion, a differentiation will be made between a saved query object in Microsoft Access 
and a raw SQL statement.  When you read the word query in the text below, understand it to be a prepared, 
saved and tested Querydef object in Microsoft Access.  Read SQL to be raw, embedded SQL in the VBA code itself.  

This becomes important for the following reasons:

    The RunSQL object cannot execute a saved query
    The OpenQuery object cannot execute raw SQL
    The Querydef object requires a saved query

The demo download code for this includes a simple form that displays both the actual SQL statements and the 
VBA code to execute them.  The application is designed to stop in debug mode so you may follow the execution 
in the code module itself.  Saved queries are used where necessary but I have used embedded SQL everywhere possible. 


RunSQL
RunSQL is a method of the DoCmd object in Microsoft Access.  It is designed for DML SQL, such as 
UPDATE, INSERT and DELETE statements.  You cannot "execute" a SELECT query so the RunSQL method will fail if you 
attempt to pass a select statement to it.

As mentioned above, RunSQL requires the actual SQL statement, not the name of a saved query.  
The SQL string may be passed as a literal or through a variable, as follows:

      DoCmd.RunSQL "UPDATE titles SET price = price * 1.10

or ...

     sSQL = "UPDATE titles SET price = price * 1.10
     DoCmd.RunSQL sSQL 

The effect to the user is the same as if a query object had been double-clicked.  If warnings are enabled, 
the user will be informed of how many records will be affected and given the standard error report in the case 
of failures.  We will discuss errors in more detail shortly.

One advantage of this method is that it is a quick, simple way to execute simple SQL updates and deletes. 
The down side is that some SQL statements, especially inserts, can get very complicated very quickly 
so the sSQL variable becomes difficult to manage and debug.  In addition, if you do not want users to be bothered
with the standard Access warning messages, you will have to toggle them off and back on after the procedure.

OpenQuery  
The OpenQuery method solves the first of the above-mentioned problems: knarly SQL statements.  
It is very easy to create complex INSERT, UPDATE and DELETE queries from the Microsoft Query By Example (QBE) grid 
and save them as a Querydef object.  Once saved, they may be executed using the OpenQuery command of the DoCmd object.

  DoCmd.OpenQuery "qMkTbl_sales_bkup"

This does not, however, address the issue of warnings that require user intervention to complete 
the query transaction.  If you want to be sure that the query runs without the user knowing or being able 
to terminate, you need to turn off the warnings, like this ...

  DoCmd.SetWarnings False
  DoCmd.OpenQuery "qMkTbl_sales_bkup"
  DoCmd.SetWarnings True 

Now, there's a slight issue with this approach as well.  It assumes that warnings are enabled.  
What if the user already has them turned off?  Well, the above code will turn them on, which could 
irritate the user.  I once wrote some code to determine whether or not warnings were enabled and return 
the setting to the previous state after executing, but that is extra code, and there's an easier way to 
handle this issue.


Example 5:
----------

Access 2000 and higher:

Sub PickRandom()
   Dim db As DAO.Database
   Dim tdf As DAO.TableDef
   Dim fld As DAO.Field
   Dim rst As DAO.Recordset
   Dim strSQL As String
   Dim strTableName As String

Access 97:

Sub PickRandom()
    Dim db As Database
    Dim tdf As TableDef
    Dim fld As Field
    Dim rst As Recordset
    Dim strSQL As String
    Dim strTableName As String

Code

' 1: Create a new temporary table containing the required fields
    strSQL = "SELECT tblStaff.Firstname, tblStaff.Lastname " & _
             "INTO tblTemp " & _
             "FROM tblStaff;"
    DoCmd.SetWarnings False
    DoCmd.RunSQL strSQL
    DoCmd.SetWarnings True
    
' 2: Add a new field to the new table
    Set db = CurrentDb()
    Set tdf = db.TableDefs("tblTemp")
    Set fld = tdf.CreateField("RandomNumber", dbSingle)    
    tdf.Fields.Append fld

' 3: Place a random number in the new field for each record
    Set rst = db.OpenRecordset("tblTemp", dbOpenTable)    
    rst.MoveFirst
    Do
        Randomize
        rst.Edit
            rst![RandomNumber] = Rnd()
        rst.Update
        rst.MoveNext
    Loop Until rst.EOF    
    rst.Close
    Set rst = Nothing
    
' 4: Sort the data by the random number and move the top 25 into a new table
    strTableName = "tblRandom_" & Format(Date, "ddmmmyyyy")
    strSQL = "SELECT TOP 25 tblTemp.Firstname, tblTemp.Lastname " & _
             "INTO " & strTableName & " " & _
             "FROM tblTemp " & _
             "ORDER BY tblTemp.RandomNumber;"
    DoCmd.SetWarnings False
    DoCmd.RunSQL strSQL
    DoCmd.SetWarnings True

' 5: Delete the temporary table
    db.TableDefs.Delete ("tblTemp")
End Sub


Example 6:
----------

Discussion from a forum:

--
Dim Mydb As Database
Dim strSQL As String

Set Mydb = CurrentDb

strSQL = "UPDATE tblRefuel SET odometer = " & Me!ComboBox & " WHERE VehID = " & Me!VehID
Mydb.Execute strSQL

Mydb.Close
--


-- 
Accepted Answer from morpheus30 
Date: 11/20/2003 07:31PM PST
Grade: B
 Accepted Answer  


DoCmd.RunSQL is used like this:
Dim strSQL As String

strSQL = "UPDATE TableName SET FieldName = " & Me.TextBox & " WHERE IDField = " & Me.ControlWithIDValue

DoCmd.RunSQL strSQL

The DoCmd.RunSQL can ONLY be used with action queries such as INSERT, UPDATE, DELETE, CREATE TABLE, DROP TABLE, 
TRUNCATE TABLE, etc.  You CANNOT use it to return a recordset (i.e. SELECT statement).
 
Assisted Answer from thorkyl 
Date: 11/21/2003 10:27AM PST
Grade: B
 Assisted Answer  

--
currentdb.execute ("UPDATE TableName SET FieldName = " & Me.TextBox & " WHERE IDField = " & Me.ControlWithIDValue)

and you dont get any warnings

or

as everyone above

--
Just add

docmd.setwarnings=false
DoCmd.RunSQL strSQL
docmd.setwarnings=true


DoCmd.RunSQL "CREATE TABLE tblTest ([StaffID] COUNTER 
CONSTRAINT ndxStaffID PRIMARY KEY, [FirstName] TEXT(25), 
[LastName] TEXT(30), [BirthDate] DATETIME);�


SELECT * INTO x
FROM lpars;


Example 7:
----------

MS Access 2000 and MS Access 2002 (part of MS Office XP pro) do not allow the MS Access 97 code to work. 
Specifically, CurrentDb and some of the other commands won't work. Instead, you need something like this. 

Dim cnn As ADODB.Connection
Dim rst1 As New ADODB.Recordset

 Set cnn = CurrentProject.Connection
 Query = "Select * from Alias_Details_tbl where Purpose='" & Alias_UICombo & "'"
 rst1.Open Query, cnn, adOpenKeyset, adLockOptimistic, adCmdTableDirect

 Do Until rst1.EOF
 
    ' Place your code here
   str1 = rst1!FieldName
 Loop

 rst1.Close


This syntax works for queries that don't return a record set. 
DoCmd.RunSQL "Delete * from Aliases_tbl"


==============================
6.
==============================


==================
7. CODE FRAGMENTS:
==================


Fragment 1. Use of Internet Control:
-------------------------------------

Navigate to a site:
-------------------

Public Explorer As SHDocVw.InternetExplorer

Private Sub Command1_Click()
  On Error GoTo errorhandler
  Set Explorer=New SHDocVw.InternetExplorer
  Explorer.Visible=True
  Explorer.Navigate Combo1.Text
  Exit Sub
errorhandler:
  MsgBox "Error displaying file", Err.Description
End Sub

Private Sub Form_Load()
  Combo1.AddItem "http:/www.antapex.org"
  Combo1.AddItem "http:/www.abc.com"
  Combo1.AddItem "http:/www.xyz.com"
End Sub


Download of a file:
-------------------

The two most important things to know about the ITC is that there are two methods 
of downloading files from a web site - the OpenURL method and the Execute method. 
Both support the FTP and HTTP file transfer protocols. 

The OpenURL method is very simple. You put in a file name to download and tell 
the program whether the file is all text or binary. The code looks for 
an HTTP transfer of a text file looks like this:

text1.text = inet1.OpenURL ("http://www.vbinformation.com/badclick.htm", icString)

The code for an HTTP transfer of a binary file looks like this:

Dim bData() as Byte
bData() = inet1.OpenURL ("http://www.vbinformation.com/badclick.htm", icByteArray)

Since all files (text or binary) can be transferred as a binary file, I used the 
same file name in both examples. Note that in the first case, the downloaded file content 
is placed in a textbox named 'text1'. In the second case, the downloaded file content is saved i
n a Byte array whose upper bound is set by the number of bytes downloaded by the OpenURL method. 
Also, note that both examples use HTTP URLs, but FTP URLs could have been used just as readily. 

In case you don't remember, an easy way to save the bData byte array is:


Open "filename" for Binary as #1
Put #1, , bData()
Close #1

This is really all there is to successfully downloading a file by using the OpenURL method.

The second method for downloading a file is the Execute method.
inet1.Execute ("ftp://www.microsoft.com", "DIR")

This command transfers the directory listing of the Microsoft ftp site. 
Note than while the OpenURL method returns data to a variable or an array, 
the Execute method does not! The data returned by the Execute method will either be kept 
within the ITC's buffer, or be directed to a file according to the specifics of the command it is given. 
The Execute method actually supports 14 FTP commands (which are placed in the 'operation' argument), 
but there are primarily three (CD, GET, and PUT) which you will use most often:


inet1.Execute ("ftp://www.microsoft.com", "CD newdirectory" 
inet1.Execute ("ftp://www.microsoft.com", "GET remotefile localfile" 
inet1.Execute ("ftp://www.microsoft.com", "PUT localfile remotefile" 


Fragment 2: use of WinApi:
--------------------------

AddIn Manager: Api Viewer

Public Declare Sub GlobalMemoryStatus Lib "kernel32" _
(lpBuffer As MEMORYSTATUS)

Public Declare Function SetEnvironmentVariable Lib "kernel32" _
Alias "SetEnvironmentVariableA" (ByVal lpName As String, ByVal lpValue As String) As Long

Declare Function ShellExecute Lib "shell32.dll" Alias "ShellExecuteA" _
      (ByVal hwnd As Long, ByVal lpOperation As String, ByVal lpFile As String, _
      ByVal lpParameters As String, ByVal lpDirectory As String, _
      ByVal nShowCmd As Long) As Long

 Const SW_SHOWNORMAL = 1 

Private Sub mnuHomepage_Click()
     Dim rc As Long
        rc = ShellExecute(Me.hwnd, "Open", "http://www.abc-ware.de/",_
         "", "", SW_SHOWNORMAL)
     End Sub

     Private Sub mnuEMail_Click()
     Dim rc As Long
        rc = ShellExecute(Me.hwnd, "Open", _
         "mailto:Name@domain.de?Subject=Mathe-Max", "", "", SW_SHOWNORMAL)
     End Sub
 
Question
 
What is a hWnd and what can it be used for?
Code examples for different kinds of usage are welcome.

Answer
 
An hWnd is a Handle to a Window if you will.  A handle is a 
long integer generated by the operating system so it can 
keep track of the all the objects (a form, a command button 
etc.)  

You can't set a hwnd at design or runtime, and the value of 
the handle changes each time the form is opened. Handles 
are used when you make calls to API functions, the function 
needs to know the handle of the window, plus other 
arguements depending on the what
the API does.

The GetWindowsText API frx:

Declare Function GetWindowText Lib "user32" alias _ 
"GetWindowTextA" (byval Hwnd as long, byval lpstring as _ 
string, byval cch as long) as Long

Assuming the object in question is a form, passing the hwnd 
of the form to the api will cause a search to performed in 
windows internal data structures looking for that handle, 
and then return what text is in the forms title bar
or caption.
 

Fragment 3: How to use shell:
-----------------------------

Examples:
---------

Shell "C:\Program Files\Microsoft Office\Office\Winword.exe " & _
  Chr$(34) & "C:\My Documents\Mydoc.doc" & Chr$(34)

Shell "c:\project\create_db.bat"


Fragment 4: Shell and wait:
---------------------------

The ShellAndWait subroutine uses the Shell function to start the other program. 
It calls the OpenProcess API function to connect to the new process and then uses 
WaitForSingleObject to wait until the other process terminates. Note that neither the program 
nor the development environment can take action during this wait. 
After WaitForSingleObject returns, the ShellAndWait subroutine calls CloseHandle 
to close the process handle opened by OpenProcess and then exits 
at which point the program resumes normal execution. 
 
 
' Start the indicated program and wait for it
' to finish, hiding while we wait.
Private Sub ShellAndWait(ByVal program_name As String, _
    ByVal window_style As VbAppWinStyle)
Dim process_id As Long
Dim process_handle As Long

    ' Start the program.
    On Error GoTo ShellError
    process_id = Shell(program_name, window_style)
    On Error GoTo 0

    ' Hide.
    Me.Visible = False
    DoEvents

    ' Wait for the program to finish.
    ' Get the process handle.
    process_handle = OpenProcess(SYNCHRONIZE, 0, process_id)
    If process_handle <> 0 Then
        WaitForSingleObject process_handle, INFINITE
        CloseHandle process_handle
    End If

    ' Reappear.
    Me.Visible = True
    Exit Sub

ShellError:
    MsgBox "Error starting task " & _
        txtProgram.Text & vbCrLf & _
        Err.Description, vbOKOnly Or vbExclamation, _
        "Error"
End Sub
 

Or use this:
------------

Private Declare Function OpenProcess Lib "Kernel32" (ByVal dwDesiredAccess As Long, ByVal bInheritHandle As Long, ByVal dwProcessId As Long) As Long
Private Declare Function GetExitCodeProcess Lib "Kernel32" (ByVal hProcess As Long, lpExitCode As Long) As Long
Private Declare Sub Sleep Lib "Kernel32" (ByVal dwMilliseconds As Long)
Const STILL_ACTIVE = &H103
Const PROCESS_QUERY_INFORMATION = &H400

Private Sub Shell32Bit(ByVal JobToDo As String)
        Dim hProcess As Long
        Dim RetVal As Long
        hProcess = OpenProcess(PROCESS_QUERY_INFORMATION, False, Shell(JobToDo, vbHide))
        Do
            GetExitCodeProcess hProcess, RetVal
            DoEvents: Sleep 100
        Loop While RetVal = STILL_ACTIVE
End Sub

Private Sub Command1_Click()
Shell32Bit "command.com /c ipconfig > C:\tmpp"
MsgBox "Complete"
End Sub

Or use this:
------------

'
'  Runs a command as the Shell command does but waits for the command
'  to finish before returning.  Note: The full path and filename extention
'  is required.
'  You might want to use Environ$("COMSPEC") & " /c " & command
'  if you wish to run it under the command shell (and thus it)
'  will search the path etc...
'
'  returns false if the shell failed
'
Public Function ShellWait(ByVal cCommandLine As String) As Boolean
   Dim NameOfProc As PROCESS_INFORMATION
   Dim NameStart As STARTUPINFO
   Dim i As Long

   NameStart.cb = Len(NameStart)
   i = CreateProcessA(0&, cCommandLine, 0&, 0&, 1&, _
       NORMAL_PRIORITY_CLASS, 0&, 0&, NameStart, NameOfProc)
  
   If i <> 0 Then
      Call WaitForSingleObject(NameOfProc.hProcess, INFINITE)
      Call CloseHandle(NameOfProc.hProcess)
      ShellWait = True
   Else
      ShellWait = False
   End If
   
End Function


Fragment 4. Multiple forms:
---------------------------

Private Sub List1_Click()
If List1.ListIndex=0 Then
   Form1.WindowState = 2 ' Maximize
Elseif List1.ListIndex =1 Then 
   Load Form2
   Form2.Show
Elseif List1.Listindex =2 Then
   Load MDIForm1
   MDIForm1.Show
Else
   Form1.WindowState = 0 ' Normal
End If

End Sub


Sub Main()
    Load Form1
    Form1.Show
End Sub

Private Sub Command1_Click()
frmChangeDir.Hide
End Sub

Private Sub Command2_Click()
   Load frmCreateDir
   frmCreateDir.Show
End Sub


Fragment 5. textFile:
---------------------

Private Sub mnuItemOpen_Click()
  Wrap$=Chr$(13)+Chr$(10)  'creates a wrap character
  CommonDialog1.Filter="Text files (*.TXT) | *.TXT"
  CommonDialog1.ShowOpen
  If CommonDialog1.Filename<> "" Then
     Form1.MousePointer=11  'display hourglass
     Open CommonDialog1.FileName For Input As #1
     On Error GoTo TooBig:
     Do Until EOF(1) 'then read lines from file
        Line Input #1, LineOfText$
        AllText$ = AllText$ & LineOfText$ & Wrap$
     Loop
     lblFile.Caption=CommonDialog1.FileNmae
     txtFile.Text=AllText$  'display the file
     txtFile.Enabled=True
     mnuItemOpen.Enabled=False
Cleanup:
     Form1.MousePointer=0
     Close #1
End If
Exit Sub
TooBig:
   MsgBox ("The file is too big.")
   Resume Cleanup:  'jumps to Cleanup routine
End Sub

Private Sub mnuItemExit_Click()
  End
End Sub

Private Sub mnuItemSave_Click()
' the entire file is stored in a string
CommonDialog1.Filter = "Text files (*.TXT)|*.TXT"
CommonDialog1.ShowSave  'display Save Dialog
If CommonDialog1.FileName<>"" Then
   Open CommonDialog1.FileName For Output As #1
   Print #1, txtNote.Text  - name of TextBox
   Close #1
End If
End Sub

Private mnuItemClose_Click()
  txtFile.Text=""
  lblFile.Caption="Load a text file with the Open command"
  mnuItemClose.Enabled=False
  mnuItemOpen.Enabled=True
  txtFile.Enabled=False
End Sub


Fragment 6: write to file:
--------------------------

Dim iFileNum As Integer

'Get a free file handle
iFileNum = FreeFile

'If the file is not there, one will be created
'If the file does exist, this one will overwrite it.
Open App.Path & "\MyFile.txt" For Output As iFileNum

Print #iFileNum, Text1.Text

Close iFileNum
'--end code block

Fragment 7: write to file:
--------------------------

Open "Result.txt" For output as #1
print #1, ResultVariable
close #1
' Then in your other VB Prog read it in

Open "Result.txt" For Input As #1
Line Input #1, ResultVaraiable
Close #1

Fragment 8. write to file:
--------------------------

Private Sub Command1_Click()
      Dim fnum As Integer
      Dim s As String
      Dim fname As String
      Dim winPath As String
      On error goto ErrReadTextFile
      fnum = FreeFile
      ' get the windows folder name
      winPath = Environ$("SystemRoot")
      If winPath = "" Then
         MsgBox "Unable to retrieve the Windows path.", _
            vbInformation, "Error Reading Windows Path"
         Exit Sub
      End If
      ' create a file name
      fname = winPath & "\win.ini"
      ' ensure the file exists
      If Dir$(fname) <> "" Then
         ' open the file
         Open fname For Binary As #fnum
         If Err.Number = 0 Then
            s = Space$(LOF(fnum))
            ' read the file
            Get #fnum, 1, s
            Close #fnum
            Text1.Text = s
         Else
            Text1 = "Unable to read the file " & _
            fname & "."
         End If
      Else
         Text1 = "The file " & fname & " does not exist."
      End If
   ExitReadTextFile:
      Exit Sub
   ErrReadTextFile:
      MsgBox Err.Number & ": " & Err.Description
      Exit Sub
   End SubEnd Sub


Fragment 9. Write to file:
--------------------------

   Dim fno As Integer, s As String
   ' calculate string to return
   s = "Fred"
   fno = FreeFile
   Open "c:\tmp.bat" For Binary As #fno
   Put #fno, , "SET vbvar=""" & s & """"
   Close #fno

This will write a small .bat file to set the environment variable
then your .bat file should look like :

 c:\myprog.exe
 call c:\tmp.bat
 echo %vbvar%

Fragment 10. Write to file:
---------------------------

Dim BatchFile As String
BatchFile = "C:\tmpbatch.bat"
Open BatchFile For Output As #1
Print #1, "start C:\count.cmd"
Close #1
Shell BatchFile, vbMinimizedNoFocus 

Fragment 11. Write to file:
---------------------------

   Dim EnvString, Indx
   Indx = 1
   Do
       EnvString = Environ(Indx)
       If ucase(EnvString) = "DEBUG=ON" Then
           IsDebugMode = True
           Exit Do
       End If
       Indx = Indx + 1   ' Not PATH entry,
   Loop Until EnvString = 


Fragment 12. CommonDialog:
--------------------------

Common Dialog/Direct is a new DLL or class library which shows how 
to completely replace COMDLG32.OCX through Visual Basic code. 
The main advantage of this is you no longer need to put a control on a form t
o use common dialogs - just declare an instance of the class and you have 
a straight replacement. You can also incorporate the Common Dialog/Direct code 
straight into your own project if you want to reduce dependency files 
when you ship your project. 

    Dim c As New cCommonDialog 
    With c 
        .DialogTitle = "Choose Text FIle" 
        .CancelError = True 
        .hWnd = Me.hWnd 
        .flags = OFN_FILEMUSTEXIST Or OFN_PATHMUSTEXIST 
        .InitDir = "C:\STEVEMAC" 
        .Filter = "Internet documents (*.HTM)|*.HTM|Text files (*.TXT)|*.TXT|All Files (*.*)|*.*" 
        .FilterIndex = 1 
        .ShowOpen 
        
        txtFileName = .filename 
        txtFilter = .Filter 
        txtContents = GetFileText(.filename) 
        
    End With 


Fragment 13. Complete program example:
--------------------------------------

Private Sub btnInstall_Click()
If SequenceStatus <> 1 Then
   MsgBox ("You must first perform Step 1 (download the create scripts), before you can begin installing.")
Else
   
'STEP 1. First create setup.ini for unattended msde install
'--------------------------------------------------
iFileNum = FreeFile

Open TmpPath & "\setup.ini" For Output As iFileNum

Print #iFileNum, "[OPTIONS]"
Print #iFileNum, "[TARGETDIR]=" & TARGETDIR
Print #iFileNum, "[DATADIR]=" & DATADIR

Close iFileNum
InstallStatus = " Setup.ini created.."
txtStatus.Text = txtStatus.Text & InstallStatus & WrapChar
  

'STEP 2. Now create create_db.sql
'--------------------------------------------------
SetEnvironmentVariable "DATADIR", DATATDIR
SetEnvironmentVariable "TARGETDIR", TARGETDIR
Dim BatchFile As String
Dim CmdStr As String
BatchFile = TmpPath & "\create_db.cmd"
Open BatchFile For Output As #1
Print #1, "SET DATADIR=" & DATADIR
Print #1, "SET TARGETDIR=" & TARGETDIR
Print #1, "type c:\download\create_db.txt >> "; TmpPath & "\create_db.cmd"
Close #1
CmdStr = TmpPath & "\create_db.cmd"
Shell (CmdStr)

'STEP 3. Secondly we create the msde install string like
'<path>\Setup.exe /i <path>\SqlRun01.msi /settings <path>\setup.ini /qr
'--------------------------------------------------
InstallMSDEStr = "D:\PERSONAL\MSDE\setup.exe /i " _
& "D:\PERSONAL\MSDE\SETUP\SQLRUN01.msi /settings " & Chr(34) & TmpPath & "\setup.ini" & Chr(34) & " /qr"
iFileNum = FreeFile
Open TmpPath & "\setup.cmd" For Output As iFileNum
Print #iFileNum, InstallMSDEStr
Close iFileNum
' Shell (InstallMSDEStr)
InstallStatus = " Setup commandline created.."
txtStatus.Text = txtStatus.Text & InstallStatus & WrapChar


'STEP 4. Install MSDE
'--------------------------------------------------

CmdStr = TmpPath & "\setup.cmd"
Shell (CmdStr)
InstallStatus = " SQLServer MSDE installed"
txtStatus.Text = txtStatus.Text & InstallStatus & WrapChar

'STEP 5. Startup MSSQLServer
'--------------------------------------------------

Shell ("net start MSSQLServer")
InstallStatus = " Starting up SQLServer service.."
txtStatus.Text = txtStatus.Text & InstallStatus & WrapChar

Form1.Show
End If
End Sub

Private Sub btnCreateDatabase_Click()
'STEP 6. Create ELICSYR Database
'--------------------------------------------------
InstallOSQLStr = "osql -E -i " & TmpPath & "\create_db.sql"
Shell (InstallOSQLStr)
InstallStatus = " ELICSYR Database installed"
txtStatus.Text = txtStatus.Text & InstallStatus & WrapChar
Form1.Show
End If

End Sub

Private Sub btnQuit_Click()
End
End Sub

Private Sub Form_Load()
Init
ComboBox1.AddItem ("http//www.tollogic.nl/site1")
ComboBox1.AddItem ("ftp//ftp.tollogic.nl/site2")
End Sub


----------------------------------------------------------------------------

Private Declare Function GetEnvironmentStrings Lib "kernel32" Alias "GetEnvironmentStringsA" () As Long
Private Declare Function FreeEnvironmentStrings Lib "kernel32" Alias "FreeEnvironmentStringsA" (ByVal lpsz As String) As Long
Private Declare Function lstrlen Lib "kernel32" Alias "lstrlenA" (ByVal lpString As Long) As Long
Private Declare Sub CopyMemory Lib "kernel32" Alias "RtlMoveMemory" (lpvDest As Any, lpvSource As Any, ByVal cbCopy As Long)
Private Sub Form_Load()
   'The KPD-Team 2001
   'URL: http://www.allapi.net/
   'E-Mail: KPDTeam@Allapi.net
   Dim lngRet As Long, strDest As String, lLen As Long
   'set the graphics mode to persistent
   Me.AutoRedraw = True
   'retrieve the initial pointer to the environment strings
   lngRet = GetEnvironmentStrings
   Do
       'get the length of the following string
       lLen = lstrlen(lngRet)
       'if the length equals 0, we've reached the end
       If lLen = 0 Then Exit Do
       'create a buffer string
       strDest = Space$(lLen)
       'copy the text from the environment block
       CopyMemory ByVal strDest, ByVal lngRet, lLen
       'show the text
       Me.Print GetFilePathName(strDest, LongFile)
       'move the pointer
       lngRet = lngRet + lstrlen(lngRet) + 1
   Loop
   'clean up
   FreeEnvironmentStrings lngRet
End Sub

----------------------------------------------------------------------------

I've done something similar where I kick off many DOS commands in a loop 
and the DOS window closes and the program in the batch file actually runs.

The DOS program also redirects it's output to a file.

This is the code what I used:
for i = 1 to 10
  cmdline = App.Path & "\doit.bat 123.123.123.123"
  aa = Shell(cmdline, vbHide)
next i

My doit.bat file contains something like this:
nbtstat -A %1 >>C:\DATA\%1.TXT

----------------------------------------------------------------------------

Shell "command.com /c C:\Program.exe > outfile.dat" 

----------------------------------------------------------------------------

Private Sub Command1_Click()

' connect to library
Set PDF = CreateObject("PDFCreatorPilot.piPDFDocument")
' initialize PDF Engine
PDF.StartEngine "demo@demo", "demo"
' set PDF ouput filename
PDF.FileName = "HelloPDF_VB.pdf"
PDF.AutoLaunch = True ' auto-open generated pdf document
' start document generation
PDF.BeginDoc
' draw "HELLO, PDF" message on the current PDF page
PDF.PDFPAGE_BeginText
PDF.PDFPAGE_SetActiveFont "Verdana", True, False, False, False, 14, 0
PDF.PDFPAGE_TextOut 10, 20, 0, "HELLO, PDF!"
PDF.PDFPAGE_EndText
' finalize document generation
PDF.EndDoc
' disconnect from library
Set PDF = Nothing

End Sub

This function will generate PDF document and save it as "HelloPDF_VB.PDF" file in the application's folder. 

----------------------------------------------------------------------------

Private Sub Form_Load()
btnInstall.Enabled = False
Dim Drive As String
Dim Path As String
Dim TARGETDIR As String
Dim DataDir As String
Dim TmpPath As String
Dim iFileNum As Integer
End Sub

Private Sub btnInstall_Click()
fnum = FreeFile
      ' get the TMP folder name
TmpPath = Environ$("TMP")
lblCheck.Caption = TmpPath
'Get a free file handle
iFileNum = FreeFile

Open App.Path & "\create_db.bat" For Output As iFileNum
Print #iFileNum, "SET DataDir=" & lblPath.Caption

Close iFileNum
'Shell "copy var1 + create_db > create_db.bat"
'Shell "c:\project\create_db.bat"
End Sub

Private Sub btnQuit_Click()
End
End Sub

Private Sub Drive1_Change()
Dir1.Path = Drive1.Drive
End Sub

Private Sub Dir1_Change()
lblPath.Caption = Dir1.Path
DataDir = Dir1.Path
btnInstall.Enabled = True
End Sub

Private Sub Form_Load()
btnInstall.Enabled = False
Dim Drive As String
Dim Path As String
Dim TARGETDIR As String
Dim DataDir As String
Dim TmpPath As String
Dim iFileNum As Integer
End Sub
----------------------------------------------------------------------------

Create a unique temporary file:
-------------------------------

Place all this code in a module and then just call the GetNewTempFile function and 
pass it a string to prefix the temporary filename with. (can be anything you want.), 
and it will pass back the fullpath of the temporary file created. 
It`s up to you to kill the file when you are though with it. 

Private Declare Function GetTempPath Lib "kernel32" _
            Alias "GetTempPathA" (ByVal nBufferLength As Long, _
            ByVal lpBuffer As String) As Long
 
Private Declare Function GetTempFileName Lib "kernel32" _
   Alias "GetTempFileNameA" (ByVal lpszPath As String, _
   ByVal lpPrefixString As String, ByVal wUnique As Long, _
   ByVal lpTempFileName As String) As Long

Public Function GetNewTempFile(strPrefix As String) As String
   Dim strPath As String * 512
   Dim strName As String * 576
   Dim lngRetVal As Long

   lngRetVal = GetTempPath(512, strPath)
   If (lngRetVal > 0 And lngRetVal < 512) Then
      lngRetVal = GetTempFileName(strPath, strPrefix, 0, strName)
      If lngRetVal <> 0 Then
         GetNewTempFile = Left$(strName, _
            InStr(strName, vbNullChar) - 1)
      End If
   End If
End Function

----------------------------------------------------------------------------

The fastest way to read a text file is using the Input$ function, 
as shown in this reusable procedure:


Function FileText (filename$) As String
    Dim handle As Integer
    handle = FreeFile
    Open filename$ For Input As #handle
    FileText = Input$(LOF(handle), handle)
    Close #handle
End Function

This method is much faster than reading each single line of the file using 
a Line Input statements. Here's how you can load a multiline textbox control 
with the contents of Autoexec.bat: 
Text1.Text = FileText("c:\autoexec.bat")

UPDATE: Andrew Marshall wrote us to point out that the above routine fails when 
the file includes a Ctrl-Z (EOF) character, so we prepared a better version 
that works around that problem: 

Function FileText(ByVal filename As String) As String
    Dim handle As Integer
    
    ' ensure that the file exists
    If Len(Dir$(filename)) = 0 Then
        Err.Raise 53   ' File not found
    End If
    
    ' open in binary mode
    handle = FreeFile
    Open filename$ For Binary As #handle
    ' read the string and close the file
    FileText = Space$(LOF(handle))
    Get #handle, , FileText
    Close #handle
End Function


--------------------------------------------------------------------------------
Fragment  Errorhandling:
------------------------

public sub whatever()
on error goto err_handler

.......
code here
.......


err_exit:
  Exit Sub
Err_Handler:
  MsgBox "An error occurred while loading the clinic view form, " & Err.Description & "(" & Err.Number & ")." & _
    vbCrLf & vbCrLf & "Source: form1:Load"
  Module1.LogError "form1:whatever", Err.Description, Err.Number
  Resume err_exit
End Sub 
 

----------------------------------------------------------------------------
Public Drive As String
Public Path As String
Public TARGETDIR As String
Public DATADIR As String
Public TmpPath As String
Public ChooseState As Integer
Public iFileNum As Integer
Public WrapChar As String
Public TEMP As String
Public InstallMSDEStr As String
Public InstallMSDE As String
Public InstallOSQLStr As String
Public CmdStr As String
Public StartMSSqlserver As String
Public InstallStatus As String
Public SequenceStatus As Integer
Public txtLine As String
Public ChooseDir As Integer
Public sDirectory As String
Public Tempstring As Long
Public Length As Long
Public Location As String
Public ResultInstall As Integer

Public Declare Function SetEnvironmentVariable Lib "kernel32" Alias "SetEnvironmentVariableA" (ByVal lpName As String, ByVal lpValue As String) As Long
Public Declare Function GetTempPath Lib "kernel32" Alias "GetTempPathA" (ByVal nBufferLength As Long, ByVal lpBuffer As String) As Long

Sub Main()
    Load Form1
    Form1.Show
End Sub

Sub GetTempDir()
Length = 99
Location = String$(100, 0)
Tempstring = GetTempPath(Length, Location)
End Sub

Public Sub Init()
GetTempDir
Dim Explorer As SHDocVw.InternetExplorer
Dim GetEnvironmentVar As String
TARGETDIR = "C:\MSSQL2K"
DATADIR = "C:\MSSQL2K"
TEMP = "C:\TEMP"
SetEnvironmentVariable "TARGETDIR", TARGETDIR
SetEnvironmentVariable "DATADIR", DATATDIR
WrapChar = Chr(13) & Chr(10)
TmpPath = Environ$("TMP")
Form1.lblTargetDir.Caption = TARGETDIR
Form1.lblDataDir.Caption = DATADIR
Form1.lblTmpPath.Caption = TmpPath
Form1.lblSysTmpPath.Caption = Location
End Sub
Function FileText(filename$) As String
    Dim handle As Integer
    handle = FreeFile
    Open filename$ For Input As #handle
    FileText = Input$(LOF(handle), handle)
    Close #handle
End Function

Sub Create_Directory()


Dim strPath As String       'The directory which will be created...
Dim intOffset As Integer    'Searches for a "\" so it can create the dirs...
Dim intAnchor As Integer    'Equal to the above variable...
Dim strOldPath As String    'Returns the CurDir to the old path(the dir
                            'the setup file is in)...

On Error Resume Next        'Error handling...

strOldPath = CurDir$        'Find the current Directory...
intAnchor = 0               'Reset intAnchor...

'Searches for the "\" to create the dirs properly...
intOffset = InStr(intAnchor + 1, sDirectory, "\")
intAnchor = intOffset   'Equal to the above...
Do
    intOffset = InStr(intAnchor + 1, sDirectory, "\")
    intAnchor = intOffset
    
    If intAnchor > 0 Then   'If there is 1 or more "\" then...
        
        'Create the directory using the text before the "\"...
        strPath = Left$(sDirectory, intOffset - 1)
        
        ' Determine if this directory already exists...
        Err = 0
        ChDir strPath   'If it does, change to that directory...
        
        If Err Then     'If it doesn't exist...
            
            ' We must create this directory...
            Err = 0
            MkDir strPath   'Make the Directory...
        End If
    End If
Loop Until intAnchor = 0    'Loop until all directories have been made
                            'I.e C:\Prog\David\Cowan is 3 directories...

Done:
    ChDir strOldPath        'Change back to the the 'old' current directory...

Err = 0                     'Reset the error number...

End Sub


Private Sub Command1_Click()
End Sub


Private Sub btnProgDir_Click()
ChooseDir = 1
   Load frmChangeDir
   frmChangeDir.Show
   lblTargetDir.Caption = TARGETDIR
End Sub
Private Sub btnDataDir_Click()
ChooseDir = 2
   Load frmChangeDir
   frmChangeDir.Show
   lblDataDir.Caption = DATADIR
End Sub

Private Sub btnDownLoad_Click()

  SequenceStatus = 0
  'Method 1: just for local test
     ' Inet1.Execute txtURL.Text, _
     ' "GET C:\temp\create_db.cmd"
     ' Inet1.Execute txtURL.Text, _
     ' "SEND c:\download\create_db.txt c:\temp\create_db.cmd"

  'Method 2: this works for real if we use a remote machine
    ' Dim bData() As Byte
    ' bData() = Inet1.OpenURL("file://c:\download\create_db.txt", icByteArray)
    ' Open TmpPath & "\create_db.cmd" For Binary As #1
    ' Put #1, , bData()
    ' Close #1
  MsgBox "File downloaded"

  SequenceStatus = 1
  InstallStatus = "Create scripts downloaded.."
  txtStatus.Text = InstallStatus & WrapChar
  
End Sub


Private Sub btnInstall_Click()
If SequenceStatus <> 1 Then
   MsgBox ("You must first perform Step 1 (download the create scripts), before you can begin installing.")
Else
   
'STEP 1. First create setup.ini for unattended msde install
'--------------------------------------------------
iFileNum = FreeFile

Open TmpPath & "\setup.ini" For Output As iFileNum

Print #iFileNum, "[OPTIONS]"
Print #iFileNum, "[TARGETDIR]=" & TARGETDIR
Print #iFileNum, "[DATADIR]=" & DATADIR

Close iFileNum
InstallStatus = " Setup.ini created.."
txtStatus.Text = txtStatus.Text & InstallStatus & WrapChar
  

'STEP 2. Now create create_db.sql
'--------------------------------------------------
SetEnvironmentVariable "DATADIR", DATATDIR
SetEnvironmentVariable "TARGETDIR", TARGETDIR
Dim BatchFile As String
Dim CmdStr As String
BatchFile = TmpPath & "\create_db.cmd"
Open BatchFile For Output As #1
Print #1, "SET DATADIR=" & DATADIR
Print #1, "SET TARGETDIR=" & TARGETDIR
Print #1, "type c:\download\create_db.txt >> "; TmpPath & "\create_db.cmd"
Close #1
CmdStr = TmpPath & "\create_db.cmd"
Shell (CmdStr)

'STEP 3. Secondly we create the msde install string like
'<path>\Setup.exe /i <path>\SqlRun01.msi /settings <path>\setup.ini /qr
'--------------------------------------------------
InstallMSDEStr = "D:\PERSONAL\MSDE\setup.exe /i " _
& "D:\PERSONAL\MSDE\SETUP\SQLRUN01.msi /settings " & Chr(34) & TmpPath & "\setup.ini" & Chr(34) & " /qr"
iFileNum = FreeFile
Open TmpPath & "\setup.cmd" For Output As iFileNum
Print #iFileNum, InstallMSDEStr
Close iFileNum
' Shell (InstallMSDEStr)
InstallStatus = " Setup commandline created.."
txtStatus.Text = txtStatus.Text & InstallStatus & WrapChar


'STEP 4. Install MSDE
'--------------------------------------------------

CmdStr = TmpPath & "\setup.cmd"
Shell (CmdStr)
InstallStatus = " SQLServer MSDE installed"
txtStatus.Text = txtStatus.Text & InstallStatus & WrapChar

'STEP 5. Startup MSSQLServer
'--------------------------------------------------

Shell ("net start MSSQLServer")
InstallStatus = " Starting up SQLServer service.."
txtStatus.Text = txtStatus.Text & InstallStatus & WrapChar

Form1.Show
End If
End Sub

Private Sub btnCreateDatabase_Click()
'STEP 6. Create ELICSYR Database
'--------------------------------------------------
InstallOSQLStr = "osql -E -i " & TmpPath & "\create_db.sql"
Shell (InstallOSQLStr)
InstallStatus = " ELICSYR Database installed"
txtStatus.Text = txtStatus.Text & InstallStatus & WrapChar
Form1.Show
End If

End Sub

Private Sub btnQuit_Click()
End
End Sub

Private Sub Form_Load()
Init
ComboBox1.AddItem ("http//www.tollogic.nl/site1")
ComboBox1.AddItem ("ftp//ftp.tollogic.nl/site2")
End Sub


---------------------------------------------------


Simple Examples:
================

Example 1:
----------

Private Sub cmdBepaal_Click()

' Variabelen declaratie
Dim testdeler As Integer
Dim uitkomst As String
Dim g As Integer

' Toekenning
g = CInt(txtInput.Text)
lblOutput.Caption = ""
uitkomst = "Ja"

' Bepaling of het ingevoerde getal > 2 is
If Round(g, 0) <= 2 Then
   MsgBox ("Het getal is kleiner dan of gelijk aan 2.")
Else
' Bepaling of de invoer een priemgetal is
For testdeler = 2 To Sqr(g)
    If g Mod testdeler = 0 Then 'deler gevonden
        uitkomst = "Nee"
        Exit For
    End If
Next testdeler
lblOutput.Caption = uitkomst
End If
End Sub

Private Sub cmdExit_Click()
End
End Sub

Private Sub Form_Load()

End Sub

Private Sub lblAant100_Click()

End Sub


Example 2:
----------

Private Sub cmdBepaal_Click()

' Variabelen declaratie
Dim AantalBoeken As Integer
Dim AantalDagen As Integer
Dim BoeteGeld As Double
Dim boete As String
Dim WDatum As Date
Dim UDatum As Date
Dim Vandaag As Date

' Toekenning waarden aan variabelen
' m.b.v. functie CInt kunnen we van string naar integer converteren
' m.b.v. functie CDate kunnen we van string naar date converteren

BoeteGeld = 0.75
AantalBoeken = CInt(txtAantalBoeken.Text)
UDatum = CDate(txtUitDatum.Text)
WDatum = CDate(txtInlDatum.Text)
Vandaag = Now()
boete = "0"

' Bepaling of het aantal ingevoerde Aantal Boeken > 0 is
If Round(AantalBoeken, 0) <= 0 Then
   MsgBox ("Het aantal boeken is kleiner dan of gelijk aan 0.")
Else
  ' Bepaling of het inleveren te laat is.
  ' De functie Datediff bepaald het aantal dagen tussen WDatum en UDatum
  ' Als de inleverdatum nog onder de uiterste inleverdatum zit,
  ' is er geen boete.
    If DateDiff("d", UDatum, WDatum) < 0 Then
       MsgBox ("Boeken zijn niet te laat.")
       lblOutput.Caption = boete
    Else
    ' De werkelijke inleverdatum is voorbij de uiterste
    ' inleverdatum, en dus is er sprake van een boete.
    ' Bepaling van de boete:
    boete = BoeteGeld * AantalBoeken * DateDiff("d", UDatum, WDatum)
    lblOutput.Caption = boete
    End If
End If
End Sub

Private Sub cmdExit_Click()
End
End Sub

Private Sub Label1_Click()

End Sub

Private Sub lblOut_Click()

End Sub

Private Sub txtUitInlDatum_Change()

End Sub


Example 3:
----------

Private Sub cmdBepaal_Click()

' Variabelen declaratie
Dim Invoer As String        ' De ingevoerde waarde
Dim Bedrag  As Double       ' Ingevoerde waarde geconverteerd naar bedrag
Dim Afgerond As Integer     ' Het ingevoerde bedrag afgerond naar geheel getal
Dim Nieuwbedrag As Integer  ' hulpvariabele
Dim B100 As Integer  ' Aantal briefjes van f. 100
Dim B10  As Integer  ' Aantal briefjes van f. 10
Dim M5   As Integer  ' Aantal munten van f. 5
Dim M1   As Integer  ' Aantal munten van f. 1
Dim Rest As Double   ' Het restbedrag


' Assignments
B100 = 0
B10 = 0
M5 = 0
M1 = 0

' Bepaling of de ingevoerde waarde een getal is
If IsNumeric(txtInput.Text) Then
   Bedrag = Val(txtInput.Text)
Else
   MsgBox ("De ingevoerde waarde is geen getal")
End If
   
' Bepaling of het ingevoerde bedrag < 5000
If Bedrag >= 5000 Or Bedrag < 0 Then
   MsgBox ("Het ingevoerde bedrag is groter dan 5000 of kleiner dan 0")
   
Else
' De bepaling minimale geldeenheden'
' Eerst het bedrag naar beneden afronden
' en het restbedrag bepalen
Afgerond = Int(Bedrag)
Rest = Round(Bedrag - Afgerond, 2)
' Aantal briefjes van f. 100 bepalen
B100 = Int(Afgerond / 100)
' Aantal briefjes van f. 10 bepalen
If B100 >= 1 Then
   Nieuw_Bedrag = Afgerond - (B100 * 100)
   B10 = Int(Nieuw_Bedrag / 10)
   If B10 >= 1 Then
   ' Aantal munten van f. 5 bepalen
    Nieuw_Bedrag = Nieuw_Bedrag - (B10 * 10)
    M5 = Int(Nieuw_Bedrag / 5)
    If M5 = 0 Then
    ' Aantal munten van f. 1 bepalen
        M1 = Int(Nieuw_Bedrag)
        Else
         If M5 >= 1 Then
            Nieuw_Bedrag = Nieuw_Bedrag - 5
            M1 = Int(Nieuw_Bedrag)
           
         End If
      End If
    End If
   End If
lblAant100.Caption = B100
lblAant10.Caption = B10
lblAant5.Caption = M5
lblAant1.Caption = M1
lblAantrest.Caption = Rest

End If

End Sub

Private Sub cmdExit_Click()
End
End Sub


Example 4:
----------

Dim rstRecordset As ADODB.Recordset
Dim cnnConnection As ADODB.Connection
Dim strStream As ADODB.Stream
Dim imgname As String

Private Sub cmdLoad_Click()
    Set cnnConnection = New ADODB.Connection
    Set rstRecordset = New ADODB.Recordset
    imgname = GiveId.Text

    cnnConnection.Open ("Provider=SQLOLEDB; " & _
    "data Source=xpora;" & _
    "Initial Catalog=pubs; " & _
    "User Id=karel;Password=karel")
    
    rstRecordset.Open "Select * from docs where id=" & imgname, cnnConnection, _
    adOpenKeyset, adLockOptimistic
         
    Set strStream = New ADODB.Stream
    strStream.Type = adTypeBinary
    strStream.Open
    
    strStream.Write rstRecordset.Fields("Doc").Value
    strStream.SaveToFile "C:\Temp.doc", adSaveCreateOverWrite
 
    Shell "E:\Program Files\Microsoft Office\Office\Winword.exe " & _
  Chr$(34) & "C:\temp.doc", 1
 End Sub

Private Sub cmdLoad2_Click()
    Set cnnConnection = New ADODB.Connection
    Set rstRecordset = New ADODB.Recordset
    imgname = GiveId.Text

    cnnConnection.Open ("Provider=SQLOLEDB; " & _
    "data Source=xpora;" & _
    "Initial Catalog=pubs; " & _
    "User Id=karel;Password=karel")
    
    rstRecordset.Open "Select * from docs where id=" & imgname, cnnConnection, _
    adOpenKeyset, adLockOptimistic
         
    Set strStream = New ADODB.Stream
    strStream.Type = adTypeBinary
    strStream.Open
    
    strStream.Write rstRecordset.Fields("Doc").Value
    strStream.SaveToFile "C:\Temp.doc", adSaveCreateOverWrite
    
    Dim wsApp As Word.Application
    'Set wsApp = GetObject(, "Word.Application")
    Set wsApp = CreateObject("Word.Application")
        wsApp.Visible = True
        wsApp.Documents.Open ("c:\temp.doc")
    
End Sub

Private Sub cmdQuit_Click()
  End
End Sub

Private Sub cmdSelectSave_Click()

'Shell "C:\Program Files\Microsoft SQL Server\mssql\binn\textcopy.exe " & _
'"-I -S xpora -D pubs -T docs -C doc -U karel -P karel -W where id=" & imgname & " -F c:\temp.doc"

    Set cnnConnection = New ADODB.Connection
    Set rstRecordset = New ADODB.Recordset
    imgname = GiveId.Text

    cnnConnection.Open ("Provider=SQLOLEDB; " & _
    "data Source=xpora;" & _
    "Initial Catalog=pubs; " & _
    "User Id=karel;Password=karel")
    
    rstRecordset.Open "Select * from docs where id=" & imgname, cnnConnection, _
    adOpenKeyset, adLockOptimistic
         
    Set mstream = New ADODB.Stream
    mstream.Type = adTypeBinary
    mstream.Open
    mstream.LoadFromFile "c:\temp.doc"
    rstRecordset.Fields("doc").Value = mstream.Read
    rstRecordset.Update

    rstRecordset.Close
    cnnConnection.Close

End Sub


Example 5:
----------

Dim rstRecordset As ADODB.Recordset
Dim cnnConnection As ADODB.Connection
Dim strStream As ADODB.Stream


Private Sub cmdClear_Click()
Image1.Picture = Nothing
End Sub

Private Sub cmdLoad_Click()
    If Not LoadPictureFromDB(rstRecordset) Then
        MsgBox "Invalid Data Or No Picture In DB"
    End If
End Sub

Private Sub cmdSelectSave_Click()
    'Open Dialog Box
    With Dialog
        .DialogTitle = "Open Image File...."
        .Filter = "Image Files (*.gif; *.bmp)| *.gif;*.bmp"
        .CancelError = True
procReOpen:
         .ShowOpen
         
        If .FileName = "" Then
            MsgBox "Invalid filename or file not found.", _
                vbOKOnly + vbExclamation, "Oops!"
            GoTo procReOpen
        Else
            If Not SavePictureToDB(rstRecordset, .FileName) Then
                MsgBox "Save was unsuccessful :(", vbOKOnly + _
                        vbExclamation, "Oops!"
                Exit Sub
            End If
        End If
            
    End With
End Sub

Private Sub Form_Load()
    Set cnnConnection = New ADODB.Connection
    Set rstRecordset = New ADODB.Recordset

    cnnConnection.Open ("Provider=SQLOLEDB; " & _
        "data Source=xpora;" & _
        "Initial Catalog=pubs; " & _
        "User Id=karel;Password=karel")
    rstRecordset.Open "Select * from doctest", cnnConnection, _
         adOpenKeyset, adLockOptimistic
    

End Sub

Public Function LoadPictureFromDB(RS As ADODB.Recordset)

    On Error GoTo procNoPicture
    
    'If Recordset is Empty, Then Exit
    If RS Is Nothing Then
        GoTo procNoPicture
    End If
    
    Set strStream = New ADODB.Stream
    strStream.Type = adTypeBinary
    strStream.Open
    
    strStream.Write RS.Fields("Doc").Value

    
    strStream.SaveToFile "C:\Temp.doc", adSaveCreateOverWrite
    'Image1.Picture = LoadPicture("C:\Temp.doc")
    'Kill ("C:\Temp.doc")
    Shell "E:\Program Files\Microsoft Office\Office\Winword.exe " & _
  Chr$(34) & "C:\temp.doc" & Chr$(34)
    LoadPictureFromDB = True

procExitFunction:
    Exit Function
procNoPicture:
    LoadPictureFromDB = False
    GoTo procExitFunction
End Function

Public Function SavePictureToDB(RS As ADODB.Recordset, _
    sFileName As String)

    On Error GoTo procNoPicture
    Dim oPict As StdPicture
    
    Set oPict = LoadPicture(sFileName)
    
    'Exit Function if this is NOT a picture file
    If oPict Is Nothing Then
        MsgBox "Invalid Picture File!", vbOKOnly, "Oops!"
        SavePictureToDB = False
        GoTo procExitSub
    End If
    
    RS.AddNew
    

    Set strStream = New ADODB.Stream
    strStream.Type = adTypeBinary
    strStream.Open
    strStream.LoadFromFile sFileName
    RS.Fields("Pic").Value = strStream.Read
    
    Image1.Picture = LoadPicture(sFileName)
    SavePictureToDB = True
    
    
procExitSub:
    Exit Function
procNoPicture:
    SavePictureToDB = False
    GoTo procExitSub
End Function


Example 6:
----------

How to generate SQL insert statements:


'
'GenerateSqlInserts.vbs
'
'Version 2005.11.26
'
'Copyright (c) 2004-2005 CodeHQ.net - Free to use anywhere as long as this
'message is retained intact.
'
'Based on sp_generate_inserts stored procedure by Narayana Vyas Kondreddi
'(http://vyaskn.tripod.com/).
'
'Requires:
'
'   1.  VBScript version 5.
'
'   2.  ADO 2.5 or higher.
'
'   3.  SQL Server 2000 or higher.
'
'Features:
'
'   1.  Unicode output file. Ensures no character mistranslations occur.
'
'   2.  Correct handling of long text and ntext columns (greater than 4000
'       characters).
'
'   3.  Handles binary, varbinary and image types. The output code is a bit
'       slow at the moment but works correctly.
'
'   4.  Multiple table names can be specified. Inserts are generated in the
'       same table order.
'
'   5.  Any existing records in the named tables can be preserved or deleted
'       (default is to delete). The records can be deleted in the specified
'       table order or the reverse (default is reverse). Normally the table
'       inserts are ordered to satisfy foreign key constraints. Deleting in
'       this order (forward) may fail if a table has not enabled cascaded
'       deletes.
'
'Current limitations:
'
'   1.  The "INSERT INTO", column list and "VALUES" always appear on a single
'       line (before the line break tests start). This is not usually an
'       issue if the line width is wide enough.
'
'   2.  Only a limited number of data types have been coded (it is quite
'       simple to add others). The GetFieldValue function will raise an error
'       when an unhandled type is encountered.
'
'   3.  Doesn't support the sp_generate_inserts advanced options such as the
'       renamed output table and disabling constraints.
'
'   4.  The script doesn't determine inter-table foreign key dependencies.
'
Option Explicit

'============================================================================
'
'GLOBALS
'

Dim sServer, sDatabase, sOwner, sTableList, sTableName, sColumnList
Dim sIdentityColumn, sColumnName, sql, insPrefix, insSuffix, insField, ins
Dim rhs, sOutFile, sQuotedTableName, sForcedIdentity
Dim oConn, oCmd, oRS, oArgs, oFSO, oFile
Dim i, cnt, lenIns, lenField, lenPrefix, idx, maxLineLength, listIndex
Dim bReverseDelete, bNoDelete, bNoCreateTime, bVerbose
Dim aTableNames(), cTables


'============================================================================
'
'CONSTANTS
'

Const kVersionString        = "2005.11.26"

Const msecsPerDay           = 86400000 '(24hrs X 60mins X 60secs X 1000msecs)

'Selected FieldStatusEnum values:
Const adFieldIsNull         = 3

'Selected DataTypeEnum values:
Const adSmallInt            = 2
Const adInteger             = 3
Const adSingle              = 4
Const adDouble              = 5
Const adCurrency            = 6
Const adBoolean             = 11
Const adDecimal             = 14
Const adTinyInt             = 16
Const adBigInt              = 20
Const adGUID                = 72
Const adBinary              = 128
Const adChar                = 129
Const adWChar               = 130
Const adNumeric             = 131
Const adDBTimeStamp         = 135
Const adVarChar             = 200
Const adLongVarChar         = 201
Const adVarWChar            = 202
Const adLongVarWChar        = 203
Const adVarBinary           = 204
Const adLongVarBinary       = 205

'Selected CursorTypeEnum values:
Const adOpenStatic          = 3

'Selected LockTypeEnum values:
Const adLockOptimistic      = 3

'Selected CommandTypeEnum values:
Const adCmdText             = &H0001


'============================================================================


'
'Process command line.
'
Set oArgs= WScript.Arguments.Named
cnt= (oArgs.Count - 1)
If (Not oArgs.Exists("database") Or _
        Not oArgs.Exists("tables") Or _
        Not oArgs.Exists("out") Or _
        (WScript.Arguments.Unnamed.Count > 0)) Then
    Call Usage("Invalid command line.")
End If

If (oArgs.Exists("server")) Then
    sServer= oArgs("server")
Else
    sServer= "(local)"
End If

sDatabase= oArgs("database")
If (oArgs.Exists("owner")) Then
    sOwner= oArgs("owner")
Else
    sOwner= "dbo"
End If

sTableList= oArgs("tables")

maxLineLength = 500 'Default width
If (oArgs.Exists("width")) Then
    maxLineLength= CLng(oArgs("width"))
    If (maxLineLength < 80) Then
        maxLineLength= 80
    ElseIf (maxLineLength > 2000) Then
        maxLineLength= 2000
    End If
End If

maxLineLength= maxLineLength - 4    'Fixup for trailing line characters

sOutFile= oArgs("out")

bReverseDelete= Not oArgs.Exists("forwardDelete")
bNoDelete= oArgs.Exists("noDelete")
bNoCreateTime= oArgs.Exists("noCreateTime")
bVerbose= oArgs.Exists("verbose")

'
' Create ADO objects
'
Set oConn= CreateObject("ADODB.Connection")
Set oCmd= CreateObject("ADODB.Command")

oConn.Open "Provider=SQLOLEDB;Data Source=" & sServer & ";Initial Catalog=" & _
        sDatabase & ";Integrated Security=SSPI"


'
'Parse and validate the table list
'
cTables= 0
While (sTableList <> "")
    listIndex= InStr(sTableList, ",")
    If (listIndex > 0) Then
        sTableName= Left(sTableList, listIndex - 1)
        sTableList= Mid(sTableList, listIndex + 1)
    Else
        sTableName= sTableList
        sTableList= ""
    End If

    idx= InStrRev(sTableName, ":")
    If (idx > 0) Then
        sForcedIdentity= Mid(sTableName, idx + 1)
        sTableName= Left(sTableName, idx - 1)
    Else
        sForcedIdentity= ""
    End If

    sql= "SELECT 1 FROM INFORMATION_SCHEMA.TABLES WHERE (TABLE_NAME = '" & sTableName & _
            "') AND (TABLE_TYPE = 'BASE TABLE' OR TABLE_TYPE = 'VIEW') AND (TABLE_SCHEMA = '" & _
            sOwner & "')"

    Set oRS= oConn.Execute(sql)
    If (oRS.EOF) Then
        Set oRS= Nothing
        Set oCmd= Nothing
        Set oConn= Nothing
        Call Usage("Table or view (" & sTableName & ") does not exist.")
    End If

    ReDim Preserve aTableNames(2, cTables + 1)
    aTableNames(0, cTables)= sTableName
    aTableNames(1, cTables)= sForcedIdentity
    cTables= cTables + 1
WEnd

If (cTables = 0) Then
    Call Usage("No tables or views specified.")
End If

'
'Create file, write header.
'
Set oFSO= CreateObject("Scripting.FileSystemObject")
Set oFile= oFSO.CreateTextFile(sOutFile, oArgs.Exists("overwrite"), True)

oFile.WriteLine "/*"
oFile.WriteLine " * " & oFSO.GetFile(sOutFile).Name
oFile.WriteLine " *"
If (bNoCreateTime) Then
    oFile.WriteLine " * Automatically generated."
Else
    oFile.WriteLine " * Automatically generated on " & GetDateString(Now)
End If
oFile.WriteLine " *"
oFile.WriteLine " * Created by GenerateSqlInserts, version " & kVersionString
oFile.WriteLine " * From CodeHQ.net, http://codehq.net/"
oFile.WriteLine " *"
oFile.WriteLine " */"
oFile.WriteLine
oFile.WriteLine "USE " & sDatabase & ";"
oFile.WriteLine "GO"
oFile.WriteLine

oFile.WriteLine "EXEC sp_dboption '" & sDatabase & "', 'select into/bulkcopy', 'true';"
oFile.WriteLine "GO"
oFile.WriteLine

'
'Process each named table.
'
If (Not bNoDelete) Then
    WriteSectionHeader()
    If (bReverseDelete) Then
        For idx = (cTables - 1) To 0 Step -1
            sTableName= aTableNames(0, idx)
            sForcedIdentity= aTableNames(1, idx)
            Call WriteDelete(GetQuotedTableName(sOwner, sTableName))
        Next
    Else
        For idx = 0 To (cTables - 1)
            sTableName= aTableNames(0, idx)
            sForcedIdentity= aTableNames(1, idx)
            Call WriteDelete(GetQuotedTableName(sOwner, sTableName))
        Next
    End If
End If

For idx = 0 To (cTables - 1)
    sTableName= aTableNames(0, idx)
    sForcedIdentity= aTableNames(1, idx)
    Call ProcessTable()
Next


'
'Write footer, cleanup.
'
WriteSectionHeader()
oFile.WriteLine "EXEC sp_dboption '" & sDatabase & "', 'select into/bulkcopy', 'false';"
oFile.WriteLine "GO"
oFile.WriteLine
oFile.WriteLine "PRINT N'Done.';"
oFile.WriteLine

oFile.Close
Set oFile= Nothing
Set oFSO= Nothing

Set oRS= Nothing
Set oCmd= Nothing
Set oConn= Nothing

WScript.Echo "OK."
Call WScript.Quit(0)


'============================================================================


Sub ProcessTable()
    Dim sBinaryColumns, sNtextColumns, sTextColumns, sFieldName
    Dim ofs, chunkSize, binSize
    Dim identityValue
    Dim bHasRealIdentityColumn
    Dim oFld

    sQuotedTableName= GetQuotedTableName(sOwner, sTableName)

    If (bVerbose) Then
        WScript.Echo sQuotedTableName
    End If

    '
    'Create column list, determine identity column (if defined).
    '
    sColumnList= ""
    sIdentityColumn= ""
    bHasRealIdentityColumn= False
    sql= "SELECT ORDINAL_POSITION AS Ordinal, " & _
            "COLUMN_NAME AS ColumnName, " & _
            "DATA_TYPE AS DataType, " & _
            "COLUMNPROPERTY(OBJECT_ID('" & sQuotedTableName & _
                    "'), COLUMN_NAME, 'IsIdentity') AS IsIdentity, " & _
            "COLUMNPROPERTY(OBJECT_ID('" & sQuotedTableName & _
                    "'), COLUMN_NAME, 'IsComputed') AS IsComputed " & _
            "FROM INFORMATION_SCHEMA.COLUMNS (NOLOCK) " & _
            "WHERE (TABLE_NAME = '" & sTableName & "') AND (TABLE_SCHEMA = '" & sOwner & "');"
    Set oRS= oConn.Execute(sql)
    While (Not oRS.EOF)
        sColumnName= oRS.Fields("ColumnName")
        sColumnList= sColumnList & "[" & sColumnName & "]"
        If (oRS.Fields("IsIdentity") = 1) Then
            sIdentityColumn= sColumnName
            bHasRealIdentityColumn= True
        End If
        oRS.MoveNext
        If (Not oRS.EOF) Then
            sColumnList= sColumnList & ","
        End If
    WEnd

    If (sForcedIdentity <> "") Then
        sIdentityColumn= sForcedIdentity
    End If

    sql= "SELECT " & sColumnList & "FROM " & sQuotedTableName
    'Set oRS= oConn.Execute(sql)
    Set oRS= CreateObject("ADODB.Recordset")
    oRS.Open sql, oConn, adOpenStatic, adLockOptimistic, adCmdText
    If (oRS.EOF) Then
        WScript.Echo "WARNING: " & sQuotedTableName & " has no rows!"
        Exit Sub
    End If

    insPrefix= "INSERT INTO " & sQuotedTableName & " (" & sColumnList & ") VALUES ("
    insSuffix= ");"
    lenPrefix= Len(insPrefix)

    WriteSectionHeader()

    If (oRS.RecordCount > 0) Then
        If (bVerbose) Then
            WScript.Echo "  Rows=" & oRS.RecordCount
        End If

        oFile.WriteLine "PRINT N'Inserting " & oRS.RecordCount & " row(s) into " & sQuotedTableName & "';"
    Else
        oFile.WriteLine "PRINT N'Inserting into " & sQuotedTableName & "';"
    End If

    oFile.WriteLine

    If (bHasRealIdentityColumn) Then
        oFile.WriteLine "SET IDENTITY_INSERT " & sQuotedTableName & " ON;"
        oFile.WriteLine "GO"
        oFile.WriteLine
    End If


    While (Not oRS.EOF)
        ins= insPrefix
        lenIns= lenPrefix
        cnt= oRS.Fields.Count - 1
        sBinaryColumns= ""
        sNtextColumns= ""
        sTextColumns= ""
        For i= 0 To cnt
            Set oFld= oRS.Fields(i)
            insField= GetFieldValue(oFld, False) 'Suppress real value for binary or long text fields
            lenField= Len(insField)
            If (IsBinary(oFld) And (insField <> "NULL")) Then
                'Deferred binary insertion
                sBinaryColumns= sBinaryColumns & oFld.Name & "|"
                ins= ins & "0x00"
                lenIns= lenIns + 4
            ElseIf (IsNtext(oFld) And (insField <> "NULL")) Then
                'Deferred NTEXT insertion
                sNtextColumns= sNtextColumns & oFld.Name & "|"
                ins= ins & "N''"
                lenIns= lenIns + 3
            ElseIf (IsText(oFld) And (insField <> "NULL")) Then
                'Deferred TEXT insertion
                sTextColumns= sTextColumns & oFld.Name & "|"
                ins= ins & "''"
                lenIns= lenIns + 2
            ElseIf (IsTextField(oFld) And (lenIns + lenField >= maxLineLength) And (insField <> "NULL")) Then
                Do While (lenIns + lenField >= maxLineLength)
                    idx= maxLineLength - lenIns
                    If (idx < 3) Then
                        If (lenField < 3) Then
                            idx= lenField
                        Else
                            idx= 3
                        End If
                    ElseIf (idx >= lenField) Then
                        idx= lenField - 1
                    End If

                    While ((idx > 0) And (Mid(insField, idx, 1) = "'"))
                        idx= idx - 1
                    WEnd

                    rhs= Mid(insField, idx + 1)
                    If (rhs = "'") Then
                        'Include single trailing quote.
                        idx= idx + 1
                        rhs= ""
                    End If

                    If (rhs = "") Then
                        'EOL
                        ins= ins & insField
                        lenIns= lenIns + lenField
                        If (i < cnt) Then
                            'More columns
                            ins= ins & ","
                            lenIns= lenIns + 1
                        End if

                        oFile.WriteLine ins
                        ins= "    "
                        lenIns= 4
                        insField= ""
                        lenField= 0
                        Exit Do
                    End If

                    ins= ins & Left(insField, idx) & "' +"

                    oFile.WriteLine ins

                    If (IsWideText(oFld)) Then
                        ins= "    N'"
                    Else
                        ins= "    '"
                    End If

                    lenIns= Len(ins)
                    insField= Mid(insField, idx + 1)
                    lenField= lenField - idx
                Loop

                If (insField <> "") Then
                    ins= ins & insField
                    lenIns= lenIns + lenField
                End If
            Else
                ins= ins & insField
                lenIns= lenIns + lenField
            End If

            If ((i < cnt) And (lenIns > 4)) Then
                ins= ins & ","
                lenIns= lenIns + 1
                If (lenIns >= maxLineLength) Then
                    oFile.WriteLine ins
                    ins= "    "
                    lenIns= 4
                End If
            End if
        Next

        ins= ins & insSuffix
        oFile.WriteLine ins

        If ((sBinaryColumns <> "") And (sIdentityColumn = "")) Then
            WScript.Echo "WARNING: Cannot insert binary columns into " & _
                    sQuotedTableName & vbCrLf & " - it has no identity column!"
            Exit Sub
        ElseIf ((sNtextColumns <> "") And (sIdentityColumn = "")) Then
            WScript.Echo "WARNING: Cannot insert NTEXT columns into " & _
                    sQuotedTableName & vbCrLf & " - it has no identity column!"
            Exit Sub
        ElseIf ((sTextColumns <> "") And (sIdentityColumn = "")) Then
            WScript.Echo "WARNING: Cannot insert TEXT columns into " & _
                    sQuotedTableName & vbCrLf & " - it has no identity column!"
            Exit Sub
        End If

        '
        'Write out BINARY/VARBINARY/LONGVARBINARY column data.
        '
        While (sBinaryColumns <> "")
            idx= InStr(sBinaryColumns, "|")
            sFieldName= Left(sBinaryColumns, idx - 1)
            sBinaryColumns= Mid(sBinaryColumns, idx + 1)
            WScript.Echo "Deferred binary processing for " & sQuotedTableName & ".[" & sFieldName & "]"

            insField= GetFieldValue(oRS.Fields(sFieldName), True) 'Get real binary value
            lenField= Len(insField)
            binSize= oRS.Fields(sFieldName).ActualSize
            identityValue= oRS.Fields(sIdentityColumn).Value
            oFile.WriteLine "DECLARE @ptrval binary(16);"
            oFile.WriteLine "SELECT @ptrval= TEXTPTR([" & sFieldName & _
                    "]) FROM " & sQuotedTableName & " WHERE ([" & _
                    sIdentityColumn & "] = " & identityValue & ");"
            ofs= 0
            chunkSize= (maxLineLength - 60) \ 2
            While (ofs < binSize)
                If (ofs + chunkSize > binSize) Then
                    chunkSize= binSize- ofs
                End If

                If (ofs = 0) Then
                    oFile.WriteLine "WRITETEXT  " & sQuotedTableName & ".[" & _
                            sFieldName & "] @ptrval           0x" & _
                            Mid(insField, (ofs * 2) + 1, chunkSize * 2) & ";"
                Else
                    oFile.WriteLine "UPDATETEXT " & sQuotedTableName & ".[" & _
                            sFieldName & "] @ptrval " & Right(Space(6) & ofs, 7) & " 0 0x" & _
                            Mid(insField, (ofs * 2) + 1, chunkSize * 2) & ";"
                End If

                ofs= ofs + chunkSize
            WEnd

            oFile.WriteLine "GO"
        WEnd

        '
        'Write out NTEXT column data.
        '
        While (sNtextColumns <> "")
            idx= InStr(sNtextColumns, "|")
            sFieldName= Left(sNtextColumns, idx - 1)
            sNtextColumns= Mid(sNtextColumns, idx + 1)
            WScript.Echo "Deferred NTEXT processing for " & sQuotedTableName & ".[" & sFieldName & "]"

            insField= GetFieldValue(oRS.Fields(sFieldName), True) 'Get real field value
            lenField= Len(insField)
            binSize= oRS.Fields(sFieldName).ActualSize / 2
            identityValue= oRS.Fields(sIdentityColumn).Value
            oFile.WriteLine "DECLARE @ptrval binary(16);"
            oFile.WriteLine "SELECT @ptrval= TEXTPTR([" & sFieldName & _
                    "]) FROM " & sQuotedTableName & " WHERE ([" & _
                    sIdentityColumn & "] = " & identityValue & ");"
            ofs= 0
            chunkSize= (maxLineLength - 60)
            While (ofs < binSize)
                If (ofs + chunkSize > binSize) Then
                    chunkSize= binSize- ofs
                End If

                If (ofs = 0) Then
                    oFile.WriteLine "WRITETEXT  " & sQuotedTableName & ".[" & _
                            sFieldName & "] @ptrval           N'" & _
                            FixQuotes(Mid(insField, ofs + 1, chunkSize)) & "';"
                Else
                    oFile.WriteLine "UPDATETEXT " & sQuotedTableName & ".[" & _
                            sFieldName & "] @ptrval " & Right(Space(6) & ofs, 7) & " 0 N'" & _
                            FixQuotes(Mid(insField, ofs + 1, chunkSize)) & "';"
                End If

                ofs= ofs + chunkSize
            WEnd

            oFile.WriteLine "GO"
        WEnd

        '
        'Write out TEXT column data.
        '
        While (sTextColumns <> "")
            idx= InStr(sTextColumns, "|")
            sFieldName= Left(sTextColumns, idx - 1)
            sTextColumns= Mid(sTextColumns, idx + 1)
            WScript.Echo "Deferred TEXT processing for " & sQuotedTableName & ".[" & sFieldName & "]"

            insField= GetFieldValue(oRS.Fields(sFieldName), True) 'Get real field value
            lenField= Len(insField)
            binSize= oRS.Fields(sFieldName).ActualSize
            identityValue= oRS.Fields(sIdentityColumn).Value
            oFile.WriteLine "DECLARE @ptrval binary(16);"
            oFile.WriteLine "SELECT @ptrval= TEXTPTR([" & sFieldName & _
                    "]) FROM " & sQuotedTableName & " WHERE ([" & _
                    sIdentityColumn & "] = " & identityValue & ");"
            ofs= 0
            chunkSize= (maxLineLength - 60)
            While (ofs < binSize)
                If (ofs + chunkSize > binSize) Then
                    chunkSize= binSize- ofs
                End If

                If (ofs = 0) Then
                    oFile.WriteLine "WRITETEXT  " & sQuotedTableName & ".[" & _
                            sFieldName & "] @ptrval           '" & _
                            FixQuotes(Mid(insField, ofs + 1, chunkSize)) & "';"
                Else
                    oFile.WriteLine "UPDATETEXT " & sQuotedTableName & ".[" & _
                            sFieldName & "] @ptrval " & Right(Space(6) & ofs, 7) & " 0 '" & _
                            FixQuotes(Mid(insField, ofs + 1, chunkSize)) & "';"
                End If

                ofs= ofs + chunkSize
            WEnd

            oFile.WriteLine "GO"
        WEnd

        oRS.MoveNext
    WEnd

    oFile.WriteLine "GO"
    oFile.WriteLine

    If (bHasRealIdentityColumn) Then
        oFile.WriteLine "SET IDENTITY_INSERT " & sQuotedTableName & " OFF;"
        oFile.WriteLine "GO"
        oFile.WriteLine
    End If
End Sub


Function GetQuotedTableName(own, tab)
    If (own <> "") Then
        GetQuotedTableName= "[" & own & "].[" & tab & "]"
    Else
        GetQuotedTableName= "[" & tab & "]"
    End If
End Function


Function GetFieldValue(fld, realValue)
    If ((fld.Status = adFieldIsNull) Or (IsNull(fld.Value))) Then
        GetFieldValue= "NULL"
        Exit Function
    End If

    Select Case fld.Type

        Case adSmallInt
            GetFieldValue= fld.Value

        Case adInteger
            GetFieldValue= fld.Value

        Case adSingle
            GetFieldValue= fld.Value

        Case adDouble
            GetFieldValue= fld.Value

        Case adCurrency
            GetFieldValue= fld.Value

        Case adBoolean
            GetFieldValue= fld.Value

        Case adDecimal:
            GetFieldValue= fld.Value

        Case adTinyInt
            GetFieldValue= fld.Value

        Case adBigInt
            GetFieldValue= fld.Value

        Case adGUID
            GetFieldValue= "'" & CStr(fld.Value) & "'"

        Case adChar
            If (realValue And (fld.ActualSize >= 2000)) Then
                GetFieldValue= fld.Value
            Else
                GetFieldValue= "'" & FixQuotes(fld.Value) & "'"
            End If

        Case adWChar
            If (realValue And (fld.ActualSize >= 2000)) Then
                GetFieldValue= fld.Value
            Else
                GetFieldValue= "N'" & FixQuotes(fld.Value) & "'"
            End If

        Case adNumeric
            GetFieldValue= fld.Value

        Case adDBTimeStamp
            GetFieldValue= "'" & GetDateString(fld.Value) & "'"

        Case adVarChar
            If (realValue And (fld.ActualSize >= 2000)) Then
                GetFieldValue= fld.Value
            Else
                GetFieldValue= "'" & FixQuotes(fld.Value) & "'"
            End If

        Case adLongVarChar
            If (realValue And (fld.ActualSize >= 2000)) Then
                GetFieldValue= fld.Value
            Else
                GetFieldValue= "'" & FixQuotes(fld.Value) & "'"
            End If

        Case adVarWChar
            If (realValue And (fld.ActualSize >= 2000)) Then
                GetFieldValue= fld.Value
            Else
                GetFieldValue= "N'" & FixQuotes(fld.Value) & "'"
            End If

        Case adLongVarWChar
            If (realValue And (fld.ActualSize >= 2000)) Then
                GetFieldValue= fld.Value
            Else
                GetFieldValue= "N'" & FixQuotes(fld.Value) & "'"
            End If

        Case adBinary
            If (realValue) Then
                GetFieldValue= GetBinaryString(fld)
            Else
                GetFieldValue= "00"
            End If

        Case adVarBinary
            If (realValue) Then
                GetFieldValue= GetBinaryString(fld)
            Else
                GetFieldValue= "00"
            End If

        Case adLongVarBinary
            If (realValue) Then
                GetFieldValue= GetBinaryString(fld)
            Else
                GetFieldValue= "00"
            End If

        Case Else
            Call Usage("Encountered unsupported field type (" & fld.Type & ")")

    End Select
End Function


Function GetBinaryString(fld)
    Dim x, sb, sx, sc, byt, hc, bin
    sb= ""  '"0x"
    sx= ""
    bin= fld.GetChunk(fld.ActualSize)
    If (IsNull(bin)) Then
        bin= fld.GetChunk(fld.ActualSize)
    End If

    For x= 1 To fld.ActualSize
        sc= Right("0" & Hex(AscB(MidB(bin, x, 1))), 2)
        If (Len(sc) <> 2) Then
            WScript.Echo "ERROR: sc=[" & sc & "]"
        End If

        sx= sx & sc
        If ((x \ 300) = 0) Then
            sb= sb & sx
            sx= ""
        End If
    Next
    'WScript.Echo "Bin length=" & Len(sb & sx)
    GetBinaryString= sb & sx
End Function


Function IsTextField(fld)
    If (fld.Status = adFieldIsNull) Then
        IsTextField= False
    ElseIf ((fld.Type = adWChar) Or _
            (fld.Type = adVarWChar) Or _
            (fld.Type = adLongVarWChar)) Then
        IsTextField= True
    Else
        IsTextField= False
    End If
End Function


Function IsWideText(fld)
    If (fld.Status = adFieldIsNull) Then
        IsWideText= False
    ElseIf ((fld.Type = adWChar) Or _
            (fld.Type = adVarWChar) Or _
            (fld.Type = adLongVarWChar)) Then
        IsWideText= True
    Else
        IsWideText= False
    End If
End Function


Function IsBinary(fld)
    If (fld.Status = adFieldIsNull) Then
        IsBinary= False
    ElseIf ((fld.Type = adBinary) Or _
            (fld.Type = adVarBinary) Or _
            (fld.Type = adLongVarBinary)) Then
        IsBinary= True
    Else
        IsBinary= False
    End If
End Function


Function IsNTEXT(fld)
    If (fld.Status = adFieldIsNull) Then
        IsNTEXT= False
    ElseIf (((fld.Type = adWChar) Or _
             (fld.Type = adVarWChar) Or _
             (fld.Type = adLongVarWChar)) And _
            (fld.ActualSize >= 2000)) Then
        IsNTEXT= True
    Else
        IsNTEXT= False
    End If
End Function


Function IsTEXT(fld)
    If (fld.Status = adFieldIsNull) Then
        IsTEXT= False
    ElseIf (((fld.Type = adChar) Or _
             (fld.Type = adVarChar) Or _
             (fld.Type = adLongVarChar)) And _
            (fld.ActualSize >= 2000)) Then
        IsTEXT= True
    Else
        IsTEXT= False
    End If
End Function


Function FixQuotes(val)
    FixQuotes= Replace(val, "'", "''")
End Function


Function GetDateString(val)
    Dim ds, msecs

    '
    'NOTE:  VBScript doesn't have a DatePart parameter for the milliseconds
    '       in a datetime (adDBTimeStamp) value.
    '
    ds= DatePart("yyyy", val) & "-" & _
        Right("0" & DatePart("m", val), 2) & "-" & _
        Right("0" & DatePart("d", val), 2) & " " & _
        Right("0" & DatePart("h", val), 2) & ":" & _
        Right("0" & DatePart("n", val), 2) & ":" & _
        Right("0" & DatePart("s", val), 2)

    '
    'Now take the difference between the to-the-second date string
    'and the actual field value to obtain the milliseconds value;
    'multiply it by the number of millseconds in a day (86400000).
    'Get the rightmost 3 zero-filled digits as a string.
    '
    msecs= CLng(CDbl(val) - CDbl(CDate(ds))) * msecsPerDay
    GetDateString= ds & "." & Right("00" & msecs, 3)
End Function


Sub WriteDelete(sTab)
    oFile.WriteLine "PRINT N'Deleting existing values from " & sTab & "';"
    oFile.WriteLine "DELETE FROM " & sTab & ";"
    oFile.WriteLine "GO"
    oFile.WriteLine
End Sub


Sub WriteSectionHeader()
    oFile.WriteLine "/* ======================================================================= */"
    oFile.WriteLine
End Sub


Sub Usage(reason)
    WScript.Echo
    WScript.Echo reason
    WScript.Echo
    WScript.Echo "GenerateSqlInserts version " & kVersionString
    WScript.Echo "From CodeHQ.net, http://codehq.net/"
    WScript.Echo
    WScript.Echo "Usage: GenerateSqlInserts.vbs"
    WScript.Echo "            [/server:{s}] /database:{d} [/owner:{o}] /tables:{t}"
    WScript.Echo "            [/width:{w}] /out:{f} [/overwrite] [/noCreateTime]"
    WScript.Echo "            [/forwardDelete | /noDelete]"
    WScript.Echo
    WScript.Echo "    {s}  SQL Server instance name; default: (local)."
    WScript.Echo "    {d}  Database name; no default."
    WScript.Echo "    {o}  Object owner name; default: dbo."
    WScript.Echo "    {t}  Comma-separated list of table names; no default."
    WScript.Echo "    {w}  Output width in characters; default: 500."
    WScript.Echo "    {f}  Output file name; no default."
    WScript.Echo
    WScript.Echo "    Each table name in the list can be optionally followed by a colon"
    WScript.Echo "    and the name of a forced identity column (if the table does not"
    WScript.Echo "    have one)."
    WScript.Echo
    WScript.Echo "    The script will NOT overwrite an existing output file unless the"
    WScript.Echo "    /overwrite switch is specified."
    Call WScript.Quit(1)
End Sub


#######################################################################################
#######################################################################################
#######################################################################################


=======================================================================
Section 17: SQL Server 7 and 2000 Queries and examples:
=======================================================================


======================================================================
1. SCRIPTS FOR FINDING BLOCKED PROCESSES AND LOCKS IN SQL Server 2000:
======================================================================


-- 1.1 SHOW ACTIVE PROCESSES IN SQL SERVER
--     IF for a process the blocked column is not zero, it is blocked

SELECT spid, cpu, physical_io, blocked, cmd, waittime, 
       substring(convert(varchar(20),last_batch), 1, 20) as "LASTBATCH",
       substring(nt_username, 1, 15)  AS "USERNAME",
       substring(loginame, 1, 20)     AS "LOGINNAME",
       substring(hostname, 1, 15)     AS "HOSTNAME",
       substring(program_name, 1, 40) AS "PROGRAM"  
FROM   master.dbo.sysprocesses
WHERE  cmd NOT LIKE 'AWAIT%'      

-- 1.2 SHOW THE MOST INTERRESTING FIELDS FROM SYSPROCESSES

USE master
SELECT spid, cpu, physical_io, blocked, cmd, waittime, 
       substring(convert(varchar(20),last_batch), 1, 20) as "LASTBATCH",
       substring(nt_username, 1, 15)  AS "USERNAME",
       substring(loginame, 1, 20)     AS "LOGINNAME",
       substring(hostname, 1, 15)     AS "HOSTNAME",
       substring(program_name, 1, 40) AS "PROGRAM"  
FROM   master.dbo.sysprocesses
-- WHERE  cpu>50 AND physical_io>50


-- 1.3 LOCKS IN THE SYSTEM

The Enterprise Manager can show you all locks in the system
via a graphical interface. 
But you can also USE queries lauched FROM the Query Analyzer
such as the following query on 'master.dbo.syslockinfo'.
 
/* Some information about locks in the SYSTEM                                                 */
/* This query complements the EM "Current Activity Window                                     */
/* rsc_type: 1=nothing, 2=database, 3=file, 4=index, 5=table, 6=page, 7=key, 8=extent, 9=RID  */

USE master

SELECT s.spid,l.req_spid, s.cpu, s.physical_io, s.blocked, s.cmd, s.waittime, 
       substring(convert(varchar(20),s.last_batch), 1, 20) as "LASTBATCH",
       substring(s.nt_username, 1, 15) AS "USERNAME",
       substring(s.loginame, 1, 20) AS "LOGINNAME",
       substring(s.hostname, 1, 15)    AS "HOSTNAME",
       substring(s.program_name, 1, 40) AS "PROGRAM",
       l.req_status AS "Status_of_Lock", 
       l.rsc_type AS "Resource_type", 
       l.req_mode AS "Lock_request_mode"
FROM   sysprocesses s, syslockinfo l
WHERE  s.spid=l.req_spid

SELECT DISTINCT s.spid,l.req_spid, s.cpu, s.physical_io, s.blocked, s.cmd, s.waittime, 
       substring(convert(varchar(20),s.last_batch), 1, 20) as "LASTBATCH",
       substring(s.nt_username, 1, 15) AS "USERNAME",
       substring(s.loginame, 1, 20) AS "LOGINNAME",
       substring(s.hostname, 1, 15)    AS "HOSTNAME",
       substring(s.program_name, 1, 40) AS "PROGRAM",
       l.req_status AS "Status_of_Lock", 
       l.rsc_type AS "Resource_type", 
       l.req_mode AS "Lock_request_mode"
FROM   sysprocesses s, syslockinfo l
WHERE  s.spid=l.req_spid
AND    l.rsc_type in (4,5,6,7,8,9,10)


==============================================
2. Examples of conversion AND stringfunctions:
==============================================


Example 1: DATEDIFF: difference of dates
----------------------------------------

Example:
--------

USE msdb
DECLARE   @v1 VARCHAR(30)
DECLARE   @v2 VARCHAR(30)
SELECT    @v1=max(backup_finish_date) FROM backupset
SELECT    @v2=getdate()

IF (SELECT DATEDIFF(day, @v1, @v2)) in (0, 1)
  BEGIN
    BACKUP LOG sales TO sales_log_dump WITH INIT
  END


Example 2: PATINDEX() string function
-------------------------------------

IF you want to find the starting position of WHERE a string starts FROM
in a column or expression, you can USE the PATINDEX("pattern", column) function.

This example finds the position at which the pattern �wonderful� BEGINs in a 
specIFic row of the notes column in the titles table.

USE pubs
GO
SELECT PATINDEX('%wonderful%', notes)
FROM titles
WHERE title_id = 'TC3218'


Example 3: The CHARINDEX() string function
------------------------------------------

Returns the starting position of the specIFied expression in a character string. 
CHARINDEX ( expression1 , expression2 [ , start_location ] ) 
IF expression1 is not found within expression2, CHARINDEX returns 0.

USE PUBS
GO
SELECT CHARINDEX('wonderful', notes)
FROM titles
WHERE title_id = 'TC3218'
GO

IF (charindex('\', @loginame) = 0)
BEGIN
raiserror(15407, -1, -1, @loginame)
return (1)
END


IF @importpath IS NULL or (charindex('\', @importpath) = 0)
BEGIN
   PRINT 'whatever you wanted to print here.. '
   RETURN
END


Example 4: The REPLACE() string function
----------------------------------------

This is very usefull IF you need to replace a string, or part of a string,
in a field in a table:

REPLACE(field, 'string_to_be_replaced', 'replacement')

SELECT cust_name, city, replace(city,'Den','De')
FROM customers

To replace funny characters from a field:

UPDATE TABLE
SET FIELD=REPLACE(FIELD,CHAR(13),'')

Control         character Value 
-------         ---------------
Tab             CHAR(9) 
Line feed       CHAR(10) 
carriage return CHAR(13)


Example 5:  convert datetime to char, and/or present it in a smaller format
---------------------------------------------------------------------------

Example:
--------

declare @x datetime
declare @y varchar(20)

SELECT @x=GETDATE()  -- thus @x is of datatype "datetime"
SELECT @x

SELECT @y=convert(varchar(10),@x,20) 
SELECT @y


  Result:

  2004-02-27 17:02:51.390  (this is @x, in DATETIME datatype)
                     
  2004-02-27               (this is @y, in VARCHAR(10) datatype)


Example:
--------

Ofcourse, GETDATE() returns datetime. So if you need to convert datetime to
another format:

SELECT CONVERT(varchar(30), GETDATE(), 113)   -- returns 25 Jan 1999 19:29:44:893
SELECT CONVERT(varchar(30), GETDATE(), 111)   -- returns 1999/01/25
SELECT CONVERT(varchar(30), GETDATE(), 102)   -- returns 1999.01.25
SELECT CONVERT(varchar(30), GETDATE(), 20)    -- returns 1999-04-24 15:43:07
SELECT CONVERT(varchar(10), GETDATE(), 20)    -- returns 1999-04-24

Example:
--------

DECLARE @myval decimal (5, 2)
SET @myval = 193.57
SELECT CONVERT(decimal(10,5), CONVERT(varbinary(20), @myval))

You can have the same conversion with:

SELECT CAST(CAST(@myval AS varbinary(20)) AS decimal(10,5))

Example:
-------

Suppose ART_PR_IN is a float or an int.

Converting FROM float or int to money: convert(money, @ART_PR_IN)

Converting FROM integer to char:       convert(varchar(10),@COUNT_BEFORE)

Example:
--------

declare @x smalldatetime
declare @y varchar(20)
declare @z smalldatetime

SELECT @x=(select in_date from Orders where order_nr='AC95')  -- thus @x is of datatype "smalldatetime"
SELECT @x

SELECT @y=convert(varchar(10),@x,20) 
SELECT @y

SELECT @z=convert(smalldatetime,@y) 
SELECT @z

Output:

1998-06-23 00:00:00

(1 row(s) affected)

                     
1998-06-23

(1 row(s) affected)

                                                        
1998-06-23 00:00:00

(1 row(s) affected)


Example:
--------

SELECT @x=CONVERT(float,LTRIM(RTRIM(SUBSTRING(EENH_PRG_VRL,2,LEN(EENH_PRG_VRL)-2))))
SELECT @y=CONVERT(smalldatetime,LTRIM(RTRIM(SUBSTRING(EENH_STER_DT,2,LEN(EENH_STER_DT)-2))))


Example:
--------

USE pubs
SELECT 'The order date is ' + CAST(ord_date AS varchar(30))
FROM sales
WHERE ord_num = 'A2976'
ORDER BY ord_num


Example 6: function ISNULL, Replace null value
----------------------------------------------

The function ISNULL:
Replaces NULL with the specIFied replacement value.

Syntax
ISNULL ( check_expression , replacement_value ) 

USE pubs
GO
SELECT AVG(ISNULL(price, $10.00))
FROM titles
GO

So, if a value in the price column is null,
the IsNull function substitutes the value of $10.00.


Example 7: function ISDATE, Check date
--------------------------------------

The function ISDATE:
Determines whether an input expression is a valid date.

Syntax:
ISDATE ( expression )

DECLARE @datestring varchar(8)
SET     @datestring = '12/21/98'
SELECT  ISDATE(@datestring)

This will return 1

IF ISDATE(@BEGINdatum) <> 1 or ISDATE(@einddatum) <> 1
BEGIN
   print 'invalid date, please use the format YYYY-MM-DD '
   goto error_section
END   


Example 8: function LTRIM, Removing leading spaces
--------------------------------------------------

The function LTRIM:
Returns a character expression after removing leading blanks.

Syntax:
LTRIM ( character_expression ) 

DECLARE @string_to_trim varchar(60)
SET @string_to_trim = '     Five spaces are at the BEGINning of this
   string.'
SELECT 'Here is the string without the leading spaces: ' + 
   LTRIM(@string_to_trim)


Example 9: LEFT function, used to return 'n' leftmost characters
-----------------------------------------------------------------

Syntax:
LEFT(character_expression,n) 

SELECT LEFT(title, 5) 
FROM titles
ORDER BY title_id
GO

Here is the example result set:

----- 
The B 
Cooki 
You C 


Example 10: RIGHT function, used to return 'n' rightmost characters
-------------------------------------------------------------------

Syntax:
RIGHT(character_expression,n) 

SELECT RIGHT(au_fname, 5) 
FROM authors
ORDER BY au_fname
GO

Here is the result set:

------------------
raham 
Akiko 
lbert 
Ann   
Anne  


Example 11: function LEN, number of characters
----------------------------------------------

The function LEN:
Returns the number of characters, rather than the number of bytes, 
of the given string expression, excluding trailing blanks.

Syntax
LEN ( string_expression ) 

USE Northwind
GO
SELECT LEN(CompanyName) AS 'Length', CompanyName
FROM Customers
WHERE Country = 'FinlAND'


Example 12: remove '' FROM fields:
----------------------------------

Suppose you have a table like

id   name    city
'1' 'Harry'  'Boston'
'2' 'Miriam' 'Seattle'
etc..

WHERE all fields are enclosed by ''.
To create a new table without these quotes, you can use
the following example.

DECLARE @id          varchar(15)
DECLARE @name        varchar(15)
DECLARE @city        varchar(15)
DECLARE @length_id   INT
DECLARE @length_name INT
DECLARE @length_city INT
DECLARE @id2         varchar(15)
DECLARE @name2       varchar(15)
DECLARE @city2       varchar(15)

DECLARE cur1 CURSOR FOR
SELECT id, name, city FROM tab1

OPEN cur1
FETCH NEXT FROM cur1 INTO @id, @name, @city

WHILE (@@fetch_status<>-1)
BEGIN

SELECT @length_id   =LEN(@id)
SELECT @length_name =LEN(@name)
SELECT @length_city =LEN(@city)

SELECT @id2   =substring(@id,2,@length_id-2)
SELECT @name2 =substring(@name,2,@length_name-2)
SELECT @city2 =substring(@city,2,@length_city-2)

INSERT INTO tab2
values
(@id2,@name2,@city2)

FETCH NEXT FROM cur1 INTO @id, @name, @city

END

CLOSE cur1
DEALLOCATE cur1


Example 13: How to use ' in char or varchar:
--------------------------------------------

DECLARE @VAR1 VARCHAR(64)
SET @VAR1=' Appie''s   '
SELECT @var1

Result:

Appie's


Example 14: Find datatypes of table columns:
--------------------------------------------

SELECT substring(c.name, 1, 30) as "ColumName",
       c.xtype, 
       substring(object_name(c.id),1,30) as "TableName", 
       substring(t.name,1,30) as "DataType"
FROM   syscolumns c, systypes t
WHERE  c.xtype=t.xtype
AND    object_name(c.id)='Orders'  -- fill in the tablename


Example 15: Find names of table columns like '%x%:
--------------------------------------------------

SELECT substring(c.name, 1, 30) as "ColumName",
       c.xtype, 
       substring(object_name(c.id),1,30) as "TableName", 
       substring(t.name,1,30) as "DataType"
FROM   syscolumns c, systypes t
WHERE  c.xtype=t.xtype
AND    c.name like '%x%' -- fill in the fieldname
AND    object_name(c.id) not like 'stp_%'


Example 16: Remove characters FROM string or field:
---------------------------------------------------

16.1 ThE following sample code can be used to remove or save any range of characters 
     from a string or field.

declare @s varchar(100), @i int

SELECT @s = 'asd i/.,<>as>[{}]vnbv'
SELECT @s

	SELECT @i = patindex('%[^a-z^A-Z^0-9^ ]%', @s)
	while @i > 0
	BEGIN
		SELECT @s = replace(@s, substring(@s, @i, 1), '')
	SELECT @i = patindex('%[^a-z^A-Z^0-9^ ]%', @s)
	END

SELECT @s


gives
before
	asd i/.,<>as>[{}]vnbv
after
	asd iasvnbv


16.2 Removing the characters FROM a field in a table


create table #a (s varchar(100))

INSERT #a (s) SELECT 'asd i/.,<>as>[{}]vnbv'
INSERT #a (s) SELECT 'aaa'
INSERT #a (s) SELECT '123 ''h 9)'

SELECT * FROM #a

	while @@rowcount > 0
		update 	#a
		set	s = replace(s, substring(s, patindex('%[^a-z^A-Z^0-9^ ]%', s), 1), '')
		WHERE	patindex('%[^a-z^A-Z^0-9^ ]%', s) <> 0

SELECT * FROM #a

Gives

before
	asd i/.,<>as>[{}]vnbv
	aaa
	123 'h 9)
after
	asd iasvnbv
	aaa
	123 h 9


Example 17: use of logical NOT operator:
----------------------------------------

Suppose we have some sort of inventory table:

CREATE TABLE INVENTORY_TABLE
(
DEVICE_ID    INT,
NAME         VARCHAR(64)
)

Now put some sample records in this table:

insert into INVENTORY_TABLE values (1, 'NL_AMSTERDAM_LAP_07')
insert into INVENTORY_TABLE values (2, 'PLUTO')
insert into INVENTORY_TABLE values (3, 'NL_ALKMAAR-LAP_21')
insert into INVENTORY_TABLE values (4, 'STARBOSS')
insert into INVENTORY_TABLE values (5, 'US_NY_DSK007')

select * from INVENTORY_TABLE

  DEVICE_ID   NAME                                                             
  ----------- ------------------- 
  1           NL_AMSTERDAM_LAP_07
  2           PLUTO
  3           NL_ALKMAAR-LAP_21
  4           STARBOSS
  5           US_NY_DSK007

  (5 row(s) affected)

Now try this:

SELECT COUNT(*) 
FROM INVENTORY_TABLE WHERE NOT (NAME LIKE '%LAP%' OR NAME LIKE '%DSK%')

Result=2, this is what we want.

Or try this:

SELECT COUNT(*) 
FROM INVENTORY_TABLE WHERE NAME NOT LIKE '%LAP%' AND NAME NOT LIKE '%DSK%'

Result=2, so this is also OK.

Now try this:

SELECT COUNT(*) 
FROM INVENTORY_TABLE WHERE NAME NOT LIKE '%LAP%' OR NAME NOT LIKE '%DSK%'

Result=5, this is probably NOT what we wanted to have as a result !


===========================================
3. EXAMPLES OF JOINS AND SUMMARIZING DATA:
===========================================


3.1. Create three sample tables:
--------------------------------

In order to demonstrate the joins and summarizations in the next sections, 
let us first create some example tables.


create table LOC            
(
LOCID      int,
CITY       varchar(16),
constraint pk_loc primary key (locid)
)


create table DEPT           
(
DEPID      int,
DEPTNAME   varchar(16),
LOCID      int,
constraint pk_dept     primary key (depid),
constraint fk_dept_loc foreign key (locid) references loc(locid)
)


create table EMP    
(        
EMPID      int,
EMPNAME    varchar(16),
DEPID      int,
SAL        int,
constraint pk_emp      primary key (empid),
constraint fk_emp_dept foreign key (depid) references dept(depid)
)


-- ---------------------------------------------------------------------

3.2. Now insert some sample records:
------------------------------------

INSERT INTO LOC VALUES (1,'Amsterdam');
INSERT INTO LOC VALUES (2,'Haarlem');
INSERT INTO LOC VALUES (3,null);
INSERT INTO LOC VALUES (4,'Utrecht');

INSERT INTO DEPT VALUES (1,'Sales',1);
INSERT INTO DEPT VALUES (2,'PZ',1);
INSERT INTO DEPT VALUES (3,'Management',2);
INSERT INTO DEPT VALUES (4,'RD',3);
INSERT INTO DEPT VALUES (5,'IT',4);

INSERT INTO EMP VALUES (1,'Joop',1,1000);
INSERT INTO EMP VALUES (2,'Gerrit',2,500);
INSERT INTO EMP VALUES (3,'Harry',2,2000);
INSERT INTO EMP VALUES (4,'Christa',3,900);
INSERT INTO EMP VALUES (5,null,4,3000);
INSERT INTO EMP VALUES (6,'Nina',5,5000);
INSERT INTO EMP VALUES (7,'Nadia',5,4000);

-- ----------------------------------------------------------------------

3.3. Show whats in these tables:
--------------------------------

SELECT * FROM emp
SELECT * FROM dept
SELECT * FROM loc

empid       empname          depid       
----------- ---------------- ----------- 
1           Joop             1
2           Gerrit           2
3           Harry            2
4           Christa          3
5           NULL             4
6           Nina             5
7           Nadia            5

(7 row(s) affected)

depid       deptname         locid       
----------- ---------------- ----------- 
1           Sales            1
2           PZ               1
3           Management       2
4           RD               3
5           IT               4

(5 row(s) affected)

locid       city             
----------- ---------------- 
1           Amsterdam
2           Haarlem
3           NULL
4           Utrecht

(4 row(s) affected)

-- ----------------------------------------------------------------------

3.4. Let's try some join statements:
------------------------------------

Query 1:
--------

SELECT deptname, city
FROM   dept, loc
WHERE  dept.locid=loc.locid      

SELECT deptname, city FROM dept INNER JOIN loc
ON dept.locid=loc.locid

Result 1:
--------

deptname         city             
---------------- ---------------- 
Sales            Amsterdam
PZ               Amsterdam
Management       Haarlem
RD               NULL
IT               Utrecht


Query 2:
--------

SELECT e.empid, e.empname, d.depid, d.deptname
FROM   emp e, dept d
WHERE  e.depid=d.depid

SELECT e.empid, e.empname, d.depid, d.deptname 
FROM emp e INNER JOIN dept d
ON e.depid=d.depid

Result 2:
---------

empid       empname          depid       deptname         
----------- ---------------- ----------- ---------------- 
1           Joop             1           Sales
2           Gerrit           2           PZ
3           Harry            2           PZ
4           Christa          3           Management
5           NULL             4           RD
6           Nina             5           IT
7           Nadia            5           IT


So Nina and Nadia are both in the IT department.

Query 3:
--------

SELECT e.empid, e.empname, d.depid, d.deptname, l.locid, l.city
FROM emp e 
INNER JOIN dept d ON e.depid=d.depid 
INNER JOIN loc l  ON d.locid=l.locid

Result 3:
---------

empid       empname          depid       deptname         locid       city             
----------- ---------------- ----------- ---------------- ----------- ---------------- 
1           Joop             1           Sales            1           Amsterdam
2           Gerrit           2           PZ               1           Amsterdam
3           Harry            2           PZ               1           Amsterdam
4           Christa          3           Management       2           Haarlem
5           NULL             4           RD               3           NULL
6           Nina             5           IT               4           Utrecht
7           Nadia            5           IT               4           Utrecht

So both Nina and Nadia are in the IT department in Utrecht

Query 4:
--------

SELECT e.empid, e.empname, d.depid, d.deptname, l.locid, l.city
FROM emp e 
LEFT JOIN dept d ON e.depid=d.depid 
LEFT JOIN loc l  ON d.locid=l.locid

Result 4:
---------

empid       empname          depid       deptname         locid       city             
----------- ---------------- ----------- ---------------- ----------- ---------------- 
1           Joop             1           Sales            1           Amsterdam
2           Gerrit           2           PZ               1           Amsterdam
3           Harry            2           PZ               1           Amsterdam
4           Christa          3           Management       2           Haarlem
5           NULL             4           RD               3           NULL
6           Nina             5           IT               4           Utrecht
7           Nadia            5           IT               4           Utrecht

In this case, there is no difference between the INNER JOINS in Query 3
and the LEFT JOINS in Query 4.
This is so because there are no NULL values in the common key. But in general
it could be different.
In general, use a LEFT JOINT to see all values FROM the "left" table 
even if it has possible NULL values in the common key. 
In general, use a RIGHT JOINT to see all values FROM the "right" table 
even if it has possible NULL values in the common key. 

Koppeltabel:
------------

select * from functies

functie_id  functie_omschrijving           lpar_id     
----------- ------------------------------ ----------- 
1           Web server                     NULL
2           Applicatie server              NULL
3           Database server                NULL

select * from softwarecomponenten

software_id softwareom_schrijving  
----------- -----------------------
1           http                                                                                                                             basis                                              2
2           Oracle                                                                                                                           basis                                              3
3           Jolt                                                                                                                             basis                                              3
4           Tuxedo                                                                                                                           basis                                              3
5           WAS        


select * from funct_soft

software_id functie_id  
----------- ----------- 
1           1
2           3
3           2
4           2
5           2

Query to get results:

SELECT functies.functie_omschrijving,softwarecomponenten.softwareom_schrijving 
FROM functies,softwarecomponenten,funct_soft 
WHERE functies.functie_id=funct_soft.functie_id 
AND softwarecomponenten.software_id=funct_soft.software_id


-- --------------------------------------------------------------------

3.5 Use of ROLLUP and CUBE:
---------------------------


Suppose we have the following simple table:

create table emp2
(
name   varchar(10),
city   varchar(10),
salary decimal(7,2)
)

Let's put some example values into emp2:

insert into emp2 values ('joop','amsterdam',2000.00)
insert into emp2 values ('klaas','Haarlem',3000.00)
insert into emp2 values ('marie','amsterdam',1000.00)
insert into emp2 values ('nadia','alkmaar',4000.00)
insert into emp2 values ('miranda','alkmaar',1500.00)
insert into emp2 values ('maarten','haarlem',7000.00)
insert into emp2 values ('nina','haarlem',6000.00)

Now show all records in emp2: 

SELECT * FROM emp2

  name       city       salary    
  ---------- ---------- --------- 
  joop       amsterdam  2000.00
  klaas      Haarlem    3000.00
  marie      amsterdam  1000.00
  nadia      alkmaar    4000.00
  miranda    alkmaar    1500.00
  maarten    haarlem    7000.00
  nina       haarlem    6000.00


EXAMPLE QUERY WITH A "GROUP BY" CLAUSE:
---------------------------------------

Let's try a "GROUP BY":

SELECT city, sum(salary)
FROM emp2
GROUP BY city

  city                                                
  ---------- ---------
  alkmaar    5500.00
  amsterdam  3000.00
  Haarlem    16000.00

Thus use the GROUP BY in conjunction with an aggegate function like SUM(), AVG() etc..
Also, you MUST mention in the GROUP BY list, the other fields not used in the function.


EXAMPLE QUERY WITH A ROLLUP CLAUSE:
-----------------------------------

The ROLLUP operator is useful in generating reports 
that contain subtotals and totals.

SELECT   city, sum(salary)
FROM     emp2
GROUP BY city WITH ROLLUP

Result:

city                                                
---------- ---------
alkmaar    5500.00

amsterdam  3000.00
Haarlem    16000.00
NULL       24500.00


SELECT   name, city, sum(salary)
FROM     emp2
GROUP BY city, name WITH ROLLUP

Result:

  name       city                                                
  ---------- ---------- ---------
  miranda    alkmaar    1500.00
  nadia      alkmaar    4000.00
  NULL       alkmaar    5500.00
  joop       amsterdam  2000.00
  marie      amsterdam  1000.00
  NULL       amsterdam  3000.00
  klaas      Haarlem    3000.00
  maarten    haarlem    7000.00
  nina       haarlem    6000.00
  NULL       Haarlem    16000.00
  NULL       NULL       24500.00


COMPUTE BY example:
-------------------

An alternative for ROLLUP is the COMPUTE and COMPUTE BY clause.

SELECT   name, salary
FROM     emp2
ORDER BY name
compute sum(salary)

Result:

  name       salary    
  ---------- --------- 
  joop       2000.00
  klaas      3000.00
  maarten    7000.00
  marie      1000.00
  miranda    1500.00
  nadia      4000.00
  nina       6000.00

             sum
             ========
             24500.00


3.6. An example of a very simple Relation Database:
---------------------------------------------------


(PK): Primary Key; 
(FK): Foreign Key

               --------------------
               |TABLE CUSTOMERS:  |
               |------------------|
     ----------|Cust_ID (PK)      |
     |         |Cust_name         |
     |         |Address           |
     |         |Postal+code       |
     |         |City              |
     |         |Country           |
     |         --------------------
     |
     |     
     |          -----------------           --------------------
     | 1:n     |TABLE ORDERS    |           |TABLE ORDERDETAIL |
     |         |----------------|    1:n    |------------------|
     |         |Order_id (PK)   |-------<<--|Order_id (PK/FK)  |
     |---<<----|Cust_id         |           |Product_id (PK/FK |---<>----
               |Order_date      |           |Quantity          |        |
               |Emp_id          |-->>--     |Discount          |        |
               ------------------     |     |                  |        |
                                      |     --------------------        |
                                 1:n  |                             1:1 |
                                      |                                 |
            ------------------        |     --------------------        |
            |TABLE EMPLOYEES |        |     |TABLE PRODUCTS    |        |
            |----------------|        |     |------------------|        |
            |Emp_id (PK)     |--------|     |Product_id        |---------
            |Name            |              |Product_name      |
            |Lastname        |              |No_In_Stock       |
            ------------------              |To_Order          |
                                            |Price             |
                                            --------------------


Here is the DDL:

-- this is comment
-- dbo is database owner, but could be other owner or schema

USE SALES  -- or other database of your choice
GO

CREATE TABLE [dbo].[Customers] 
(
Cust_id     int NOT NULL,
Cust_name   varchar(20) NOT NULL,
Address     varchar(30),
City        varchar(20),
Country     varchar(20),
CONSTRAINT pk_cust PRIMARY KEY (cust_id)
) ON DATA  -- The data filegroup  of the sales database
GO

CREATE TABLE [dbo].[Employees] 
(
Emp_id      int NOT NULL,
Name        varchar(20) NOT NULL,
LastName    varchar(30) NOT NULL,
CONSTRAINT pk_emp PRIMARY KEY (emp_id)
) ON DATA  -- The data filegroup  of the sales database
GO

CREATE TABLE [dbo].[Products] 
(
Product_id      int NOT NULL,
Product_Name    varchar(20) NOT NULL,
Unit_price      decimal(7,2) NOT NULL,
No_In_Stock     int NOT NULL,
To_Order        char(1) NOT NULL,  -- boolean field: y or n
CONSTRAINT pk_product PRIMARY KEY (product_id)
) ON DATA  -- The data filegroup  of the sales database
GO

CREATE TABLE [dbo].[Orders] 
(
Order_id      int IDENTITY(1,1) NOT NULL,
Cust_id       int NOT NULL,
Emp_id        int NOT NULL,
Order_date    datetime NOT NULL default getdate(),
CONSTRAINT pk_order PRIMARY KEY (order_id),
CONSTRAINT fk_cust_id FOREIGN KEY (cust_id) REFERENCES dbo.customers (cust_id),
CONSTRAINT fk_emp_id FOREIGN KEY (emp_id) REFERENCES dbo.employees (emp_id)
) ON DATA  -- The data filegroup  of the sales database
GO

CREATE TABLE [dbo].[OrderDetail] 
(
Order_id      int NOT NULL,
Product_id    int NOT NULL,
Quantity      int NOT NULL,
CONSTRAINT pk_detail PRIMARY KEY (order_id,product_id),
CONSTRAINT fk_product FOREIGN KEY (product_id) REFERENCES dbo.products (product_id),
CONSTRAINT fk_order FOREIGN KEY (order_id) REFERENCES dbo.orders (order_id)
) ON DATA  -- The data filegroup  of the sales database
GO


=========================================
4. Database settable options: sp_dboption
=========================================

To show the properties of your databases 
(like for example, 'Truncate log on checkpoint' etc..)
you can USE the stored procedure 'sp_dboption'.

You can also alter any property by this stored procedure.

Example: to show the properties for the PUBS database only, you can execute
the following in the Query Analyzer

exec sp_dboption pubs

IF you want to document the properties for all database, you can USE
the following script:


-- BEGIN SCRIPT

-- You should probably set the "Results in text"
-- option on in the Query Analyzer


DECLARE @dbname   VARCHAR(30)

DECLARE cur1 CURSOR FOR
SELECT name
FROM master.dbo.sysdatabases

OPEN cur1
FETCH NEXT FROM cur1 INTO @dbname

WHILE (@@fetch_status<>-1)
BEGIN
PRINT 'DATABASE OPTION SET FOR: ' +@dbname
PRINT '  '
EXEC ('sp_dboption '+@dbname)
FETCH NEXT FROM cur1 INTO @dbname
END

CLOSE cur1
DEALLOCATE cur1

-- END OF SCRIPT


============================================================================
5. SCRIPTS FOR DOCUMENTING PHYSICAL FILES, FILEGROUPS, AND THEIR PROPERTIES:
============================================================================


5.1 Document the files and filegroups of a database
---------------------------------------------------

You are probably interested in the physical files
AND their properties, of your databases, like
filenames, paths, size etc..

Every database has the system table 'sysfiles',
which registers all important data about files.
So when you're in a database, you can query sysfiles 
like in the following way:

SELECT fileid,  
       size, 
       (size * 8 /1024)           AS SIZE_IN_MB,
       substring(name, 1, 30)     AS NAME,
       substring(filename, 1, 50) AS FILENAME 
FROM   sysfiles

The following query might also be USEfull.
Every database has the system table 'sysfiles', but
also supplementary information can be retrieved FROM
the system table 'sysfilegroups'.
So we can join both tables, as follows:

SELECT sysfiles.fileid, sysfiles.groupid, sysfiles.size,
       (sysfiles.size * 8 / 1024)                AS "SIZE_IN_MB",
       substring(sysfiles.name, 1, 15)           AS NAME, 
       substring(sysfiles.filename, 1, 30)       AS FILENAME, 
       substring(sysfilegroups.groupname, 1, 40) AS GROUPNAME
FROM   sysfiles, sysfilegroups
WHERE  sysfiles.groupid=sysfilegroups.groupid


IF you want to show the fileproperties of all
databases, you can USE the following script:

-- BEGIN SCRIPT

-- You should probably set the "Results in text"
-- option on in the Query Analyzer


DECLARE @dbname   VARCHAR(30)

DECLARE cur1 CURSOR FOR
SELECT name
FROM master.dbo.sysdatabases

OPEN cur1
FETCH NEXT FROM cur1 INTO @dbname

WHILE (@@fetch_status<>-1)
BEGIN
PRINT 'DATABASE FILES FOR: ' +@dbname
PRINT '  '
EXEC ('SELECT 
fileid, 
(size * 8 /1024)           AS SIZE_IN_MB,
substring(name, 1, 20)     AS NAME,
substring(filename, 1, 35) AS FILENAME 
FROM '+@dbname+'.dbo.sysfiles')

FETCH NEXT FROM cur1 INTO @dbname
END

CLOSE cur1
DEALLOCATE cur1

-- END OF SCRIPT


5.2 Document the "devices" in SQL Server 2000
---------------------------------------------

Althoug databases consist of physical files
in the operating system, the concept of
"device" still exist in SQL Server 7 & 2000.
But not anymore as the building block of a database
as was the case in SQL server 6.x

You can create in SQL Server 7 & 200, for example, 
"backup-dump devices" which are os files ofcourse, 
someWHERE on the filesystem, but are
registered in SQL Server as devices in master.dbo.sysdevices.
Thes files have nothing to do with database files, except that
it may contain backup(s) of normal databases.

You want to document the information in master.dbo.sysdevices


SELECT   substring(name, 1, 30) AS NAME, size, status, cntrltype,
         substring(phyname, 1, 50) AS FILE_NAME
FROM     master.dbo.sysdevices
WHERE    cntrltype=2 AND status=16


Name		:   logical name of the device
Phyname	        :   path AND physical name of the file
Status		:   16 = Backup file 
Cntrltype	:    2=  Disk backup file

There are several cntrltypes, such as 2 which is a disk backup file
but a registered device could also be a tapedevice.


==========================================================================
6. SCRIPTS FOR DOCUMENTING ALL YOUR LOGINS, USERS, ROLES, AND ROLEMEMBERS:   
==========================================================================


-- 6.1 SHOW ALL LOGINS (NT Authentication, SQL Server Mixed)

SELECT substring(loginname, 1, 30) as LOGINNAME, isntname, isntgroup, 
       substring(dbname, 1, 30) as DEFAULT_DB, createdate
FROM   master.dbo.syslogins

-- 6.2 SHOW USERS OF A DATABASE

SELECT l.suid, substring(l.name, 1, 20) as "LOGINNAME", u.uid, 
       substring(u.name,1,20) as "DBNAME", u.suid, u.isntuser, u.issqluser
FROM   master.dbo.syslogins l, sysusers u
WHERE  l.suid=u.suid
ORDER BY u.name


-- 6.3 SHOW ALL USERS OF ALL DATABASES


-- BEGIN SCRIPT

-- You should probably set the "Results in text"
-- option on in the Query Analyzer


DECLARE @db_name    VARCHAR(30)

DECLARE cur1 CURSOR FOR
SELECT  name FROM master.dbo.sysdatabases

OPEN cur1
FETCH NEXT FROM cur1 INTO @db_name

WHILE (@@fetch_status<>-1)
BEGIN
PRINT 'DATABASE USERS PLUS ROLES FOR DATABASE: '+@db_name
EXEC ('SELECT substring(name, 1, 20) AS NAME,
uid AS USER_ID
FROM '+@db_name+'.dbo.sysUSErs')

FETCH NEXT FROM cur1 INTO @db_name

END

CLOSE cur1
DEALLOCATE cur1

-- END OF SCRIPT


-- 6.4 SHOW ALL ROLEMEMBERS OF ALL ROLES OF ALL DATABASES

-- BEGIN SCRIPT

-- You should probably set the "Results in text"
-- option on in the Query Analyzer

DECLARE @dbname   VARCHAR(30)

DECLARE cur1 CURSOR FOR
SELECT name
FROM master.dbo.sysdatabases

OPEN cur1
FETCH NEXT FROM cur1 INTO @dbname

WHILE (@@fetch_status<>-1)
BEGIN
PRINT 'DATABASE ROLES AND MEMBERS FOR: ' +@dbname
PRINT '  '

EXEC('SELECT 
DbRole = substring(g.name, 1, 30), 
MemberName = substring(u.name, 1, 30)
FROM '+@dbname+'.dbo.sysUSErs'+' u'+','
+@dbname+'.dbo.sysUSErs'+' g'+','
+@dbname+'.dbo.sysmembers'+' m '
+'WHERE   g.uid = m.groupuid
AND g.issqlrole = 1
AND u.uid = m.memberuid
ORDER BY dbrole')


FETCH NEXT FROM cur1 INTO @dbname
END

CLOSE cur1
DEALLOCATE cur1

-- END OF SCRIPT


======================================================
7. Change the owner of a set of objects in a database:   
======================================================


7.1 Change the owner of one object, like a table:
-------------------------------------------------


Suppose charlie is the owner of the table 'orders' in the database 'sales'.
Suppose that the ownership of orders must be changed to the new owner harry.

You can USE the system stored procedure 'sp_changeobjectowner' in order
to change the ownership of a object.

In our example, you should USE a commAND similar to the following:

USE sales
exec sp_changeobjectowner 'db1.charlie.orders', 'harry'


7.2 Script for changing the owner of a set of objects, in one run:
------------------------------------------------------------------

Sometimes it might be needed to change the ownership
for a large set of objects, in one automated run.

Suppose in database DB1, the USEr charlie is the current owner of 
a large set of tables. 
Suppose further that the USEr john should be the owner
of this set of tables. So, how can we automate the
change of ownership?

The following script will serve as an example of how to change the ownership
for a set of tables. Run this script FROM the Query Analyzer. 
First change to the database WHERE the owners must be changed.
Next, set the variables @old_owner AND @new_owner accordingly.

-- BEGIN OF SCRIPT

-- first allow updates to the system tables; the default is false

exec sp_configure 'allow updates', 1
reconfigure with override
go
-- declaring some variables

DECLARE @old_owner     varchar(50)
DECLARE @new_owner     varchar(50)
DECLARE @oldownerid    int
DECLARE @newownerid    int
DECLARE @id            int
DECLARE @tabname       varchar(50)

-- Here is WHERE you set the OLD_OWNER AND NEW_OWNER:

SELECT @old_owner='charlie'
SELECT @new_owner='john'

SELECT @oldownerid=(SELECT uid FROM sysUSErs WHERE name=@old_owner)
SELECT @newownerid=(SELECT uid FROM sysUSErs WHERE name=@new_owner)

DECLARE cur1 cursor
FOR
SELECT name FROM sysobjects WHERE type='U' AND uid=@oldownerid
OPEN cur1                          
FETCH NEXT FROM cur1 INTO @tabname

WHILE (@@fetch_status<>-1)
BEGIN
  UPDATE sysobjects
  SET uid=@newownerid
  WHERE name=@tabname
FETCH NEXT FROM cur1 INTO @tabname
END

-- remove cursor FROM memory 
CLOSE cur1
DEALLOCATE cur1

-- setting allowing updates to system tables back to false
exec sp_configure 'allow updates', 0
reconfigure with override
go

-- END OF SCRIPT    
  

7.3 Change the owner of a database:
-----------------------------------

Most of the time, your production databases will be owned (created) by the
system administrator, or the NT/2000 Administrator (who is mapped to the
sysadmins serverrole).

But suppose the USEr harry 'owns' the database 'db1' (probably harry is member of the
serverrole 'database creators', or has inherited the permission in another way).

IF you want to change the owner to another USEr, 
you can USE the system stored procedure 'sp_changedbowner'

You can do this in a way similar to the following example:

First make sure the new owner is not already a 
database USEr of our example database db1.

Secondly, go to the database db1, using the Query Analyzer, AND execute
the commAND:

sp_changedbowner �new_owner�


7.4 Renaming a database:
------------------------

Suppose you want to change the name of a database, say 'dev1' into 'dev2'.

It's best to change the database in single USEr mode first.
Now you can change the database name using the Query Analyzer, by executing
the commAND

sp_renamedb �dev1�, �dev2�


======================================================================
8. SCRIPTS FOR RETRIEVING PROPERTIES OF INDEXES AND PK-FK CONSTRAINTS:   
======================================================================


---------------------------------------------------------
In this section, we might demonstrate some object properties. 
So, for illustration purposes, consider the following: 

suppose you have a database called 'db1', with
the database USEr 'charlie', who has enough permissions
to create tables AND indexes.

Suppose further that charlie logs on, AND creates the following tables

create table customers
(
custid int not null,
custname varchar(10),
CONSTRAINT pk_cust PRIMARY KEY (custid) 
)


create table contacts
( 
contactid int not null,
custid int,
contactname varchar(10),
CONSTRAINT pk_contactid PRIMARY KEY (contactid),
CONSTRAINT fk_cust FOREIGN KEY (custid) REFERENCES customers(custid) 
)

--------------------------------------------------------------------------


8.1 Overview Foreign Keys AND Referring- AND Referenced tables Query on sysreferences
-------------------------------------------------------------------------------------


Many tables in a database will be "linked" by Primary Key AND Foreign Key
relationships. Maybe you think it's hANDy to get a list of all
Foreign Keys AND the associated referring tables (with the FK), pointing to
the referred tables (with the PK).
Every USEr database contains the system table 'sysreferences', on which
we can USE the following query:


SELECT substring(name, 1, 60) as "ForeignKey", 
       substring(object_name(parent_obj), 1, 40) as "TableWithFK"
FROM   sysobjects o, sysreferences r
WHERE  o.type='F'
AND    o.name=object_name(r.constid)

SELECT substring(object_name(constid), 1, 40) AS FK,
       substring(object_name(fkeyid), 1, 40)  AS "Referencing Table",
       substring(object_name(rkeyid), 1, 40)  AS "Referenced Table"
FROM   sysreferences
ORDER BY object_name(rkeyid)


These queries shows all FK's in your database, plus the associated PK-FK 
linked tables.
So, IF you would USE this query in database db1, you would see the following
resultset:

FK            Referencing Table         Referenced Table                         
------------- ----------------          ---------------
fk_cust       contacts                  customers


8.2 To see all Tables with a PK:
--------------------------------

SELECT substring(name,1,30) AS "PrimaryKey", 
       id, xtype, object_name(parent_obj) AS "Parent_table" 
FROM   sysobjects
WHERE  xtype='PK'
ORDER BY object_name(parent_obj)


8.3 example of adding AND dropping pk:
--------------------------------------


ALTER TABLE ASSET_SMS_EXT
DROP CONSTRAINT ASSET_PK

ALTER TABLE ASSET_SMS_EXT
ADD CONSTRAINT ASSET_PK PRIMARY KEY (DWMACHINEID) 


8.4 To DISABLE Foreign Keys in tables that point to a PK in another table:
--------------------------------------------------------------------------

With the following query, you can delete one or more rows
FROM the table with the PK, without error messages that other
tables point with a FK to that PK.
This is so because you have DISABLED te FK constraint.
You can also enable the FK again after you have finished.


DECLARE @FK         VARCHAR(128)
DECLARE @REFERENCED VARCHAR(128)

DECLARE cur1 CURSOR 
FOR
SELECT name, object_name(parent_obj) 
FROM sysobjects o, sysreferences r
WHERE o.type='F'
AND o.name=object_name(r.constid)

OPEN cur1
FETCH NEXT FROM cur1 INTO @FK, @REFERENCED

WHILE (@@fetch_status<>-1)
  BEGIN
    EXEC('ALTER TABLE '+@REFERENCED+' NOCHECK CONSTRAINT '+@FK)
--  EXEC('ALTER TABLE '+@REFERENCED+' DROP CONSTRAINT '+@FK)
    FETCH NEXT FROM cur1 INTO @FK, @REFERENCED
  END

CLOSE cur1
DEALLOCATE cur1


8.5 Overview clustered AND nonclustered indexes. Query on sysindexes AND sysobjects.
------------------------------------------------------------------------------------

Every USEr database contains the system tables 
'sysindexes', 'sysconstraints' AND 'sysobjects'

Sysconstraints registers all defined constraints like PK, FK, CHECK constraints
in your database, AND the internal id's of the tables involved.

Sysobjects registers all objects like tables, views, procedures, with their 
names, internal id's, owner etc..

Sysindexes describes all indexes in the database, with their index-id,
associated table id, type of index etc..


-Some of the most interesting columns of sysindexes:
----------------------------------------------------

Column name 	Data type 	Description 

id 		int 		ID of table to which the index belongs. 
indid 		smallint 	ID of index: 
				1 = Clustered index
				>1 = Nonclustered
groupid 	smallint 	Filegroup ID on which the object was created. 
dpages 		int 		For indid = 0 or indid = 1, dpages is the count of data pages USEd. 
                                For indid > 1 it is the count of index pages USEd. 
rows 		int 		rowcount based on indid = 0 AND indid = 1, 
				AND the value is repeated for indid >1.
name		  

 
-Some of the most interesting columns of sysobjects:
----------------------------------------------------

Column name 	Data type 	Description 

name 		sysname 	Object name.
Id 		int 		Object identIFication number
type 		char(2) 	Object type. Can be one of these object types: 
				C = CHECK constraint
				D = Default or DEFAULT constraint
				F = FOREIGN KEY constraint
				L = Log
				FN = Scalar function
				IF = Inlined table-function
				P = Stored procedure
				PK = PRIMARY KEY constraint (type is K)
				RF = Replication filter stored procedure 
				S = System table
				TF = Table function
				TR = Trigger
				U = USEr table
				UQ = UNIQUE constraint (type is K)
				V = View
				X = ExtENDed stored procedure

uid 		smallint 	USEr ID of owner object.

- Some of the most interesting columns of sysconstraints:
---------------------------------------------------------

constid     int Constraint number. 
id          int ID of the table that owns the constraint. 
colid       smallint ID of the column on which the constraint is defined, 0 IF a table constraint. 
spare1      tinyint Reserved. 
status      int Bitmap indicating the status. Possible values include: 
1 = PRIMARY KEY constraint.
2 = UNIQUE KEY constraint.
3 = FOREIGN KEY constraint.
4 = CHECK constraint.
5 = DEFAULT constraint.
16 = Column-level constraint.
32 = Table-level constraint.
 
 
Now let's see IF we can build a query, that joins sysindexes
AND sysobjects, in order to retreive some interesting information:

  This query shows all indexes AND tables in a database:

-- indid 1     = clustered, indid>1 nonclustered
-- indid 0     = table itself

SELECT  id, substring(object_name(id),1,30), rows, indid, substring(name,1,30) 
FROM    sysindexes
WHERE   name not like '_WA%'

SELECT   substring(sysobjects.name,1,30) AS TABLENAME, 
         substring(sysindexes.name,1,30) AS INDEXNAME,
         sysobjects.id, sysindexes.indid, sysindexes.groupid, 
         sysindexes.rows
FROM     sysobjects, sysindexes
WHERE    sysobjects.id=sysindexes.id
ORDER BY sysindexes.rows desc


  This query shows all indexes AND tables in a database WHERE
  sysobjects.type=U, meaning all normal USEr tables AND indexes:

SELECT   substring(sysobjects.name,1,30) AS TABLENAME, 
         substring(sysindexes.name,1,30) AS INDEXNAME,
         sysobjects.id, sysindexes.indid, sysobjects.xtype, 
         sysindexes.rows, (dpages*8192/1024/1024) AS SIZE
FROM     sysobjects, sysindexes
WHERE    sysobjects.id=sysindexes.id AND sysindexes.indid >= 1
AND      sysobjects.type='U'
ORDER BY sysindexes.rows desc


8.6 SCRIPT TO CREATE PK'S FROM UNIQUE INDEXES:
----------------------------------------------

-- SCRIPT PART 1.
-- DYNAMICALLY CREATES 'ALTER TABLE .. ADD CONSTRAINT..' STATEMENTS

SET NOCOUNT ON

DECLARE @TABLENAME VARCHAR(64)
DECLARE @INDEXNAME VARCHAR(64)
DECLARE @KEYSET    VARCHAR(64)
DECLARE @I         INT
DECLARE @J         INT

create table ##ui_keys
(
id                int identity(1,1),
table_name        varchar(128),
index_name        varchar(128),
index_description varchar(128),
index_keys        varchar(128)
)

DECLARE c1 CURSOR FOR
SELECT   sysobjects.name AS TABLENAME, 
         sysindexes.name AS INDEXNAME
FROM     sysobjects, sysindexes
WHERE    sysobjects.id=sysindexes.id AND sysindexes.indid=1
AND      sysobjects.xtype='U'
AND      sysindexes.name NOT LIKE 'PK_%'
ORDER BY sysobjects.name

OPEN c1
FETCH NEXT FROM c1 INTO @TABLENAME,@INDEXNAME

WHILE (@@fetch_status<>-1)
BEGIN

  INSERT ##ui_keys
  (index_name, index_description, index_keys)
  exec ('sp_helpindex '+@TABLENAME)
  UPDATE ##ui_keys
  SET table_name=@TABLENAME

FETCH NEXT FROM c1 INTO @TABLENAME,@INDEXNAME
END

CLOSE c1
DEALLOCATE c1

DELETE FROM ##ui_keys
WHERE index_description LIKE 'nonclustered%'

SELECT @J=(SELECT MAX(id) FROM ##ui_keys)
SELECT @I=1

WHILE @I <= @J
  BEGIN
  SELECT @TABLENAME=(SELECT table_name FROM ##ui_keys WHERE id=@I)
  SELECT @INDEXNAME=(SELECT index_name FROM ##ui_keys WHERE id=@I)
  SELECT @KEYSET   =(SELECT index_keys FROM ##ui_keys WHERE id=@I)

  PRINT 'ALTER TABLE '+@TABLENAME+' ADD CONSTRAINT '+@INDEXNAME+' PRIMARY KEY '+'('+@KEYSET+')'
  PRINT 'GO'
  SELECT @I=@I+1
  END


8.7 Script to show dependent tables of a parent table:
-------------------------------------------------------

create procedure show_dependents @tabname varchar(64)
AS

DECLARE @I     INT
DECLARE @J     INT
DECLARE @CHILD VARCHAR(64)
 
BEGIN

if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[CHILDS]') and OBJECTPROPERTY(id, N'IsUserTable') = 1)
drop table [dbo].[CHILDS]

CREATE TABLE CHILDS
(
FK_NAME     VARCHAR(64),
TABPARENT   VARCHAR(64),
TABCHILD    VARCHAR(64),
L_DISTANCE  CHAR(1)
)

INSERT  CHILDS
(FK_NAME,TABPARENT,TABCHILD,L_DISTANCE)
SELECT
object_name(constid), 
object_name(rkeyid),
object_name(fkeyid),'1'
FROM sysreferences
WHERE object_name(rkeyid)=@tabname

SELECT @I=(SELECT COUNT(*) FROM CHILDS)

DECLARE cur1 CURSOR STATIC
FOR
SELECT TABCHILD FROM CHILDS 

OPEN cur1                          
FETCH NEXT FROM cur1 INTO @CHILD

WHILE (@@fetch_status<>-1)

  BEGIN

    IF @CHILD<>@tabname

       BEGIN
         INSERT  CHILDS
         (FK_NAME,TABPARENT,TABCHILD,L_DISTANCE )
         SELECT
         object_name(constid), 
         object_name(rkeyid),
         object_name(fkeyid),'2'
         FROM sysreferences
         WHERE object_name(rkeyid)=@CHILD

       END
    FETCH NEXT FROM cur1 INTO @CHILD

  END

  CLOSE cur1
  DEALLOCATE cur1

SELECT * FROM CHILDS

END


==============================
9. Other USEfull dba queries:   
==============================

Example 9.1:
------------

--     The system table msdb.dbo.backupset, registers all backups of your databases.
--     So, you can get a list of backups FROM a certain date as
--     shown in the following example


USE msdb
SELECT backup_start_date, backup_finish_date, media_set_id,
       type, database_name
FROM   backupset
WHERE  backup_start_date>'2003-06-01'


USE msdb
SELECT backup_set_id, media_set_id, catalog_family_number,
       software_vENDor_id, USEr_name,
       backup_start_date, backup_finish_date,
       type, database_name
FROM   backupset
/* WHERE backup_start_date > "your_date_of_choice" */


SELECT substring(s.database_name,1,20) as "database", (s.backup_size/1024/1024) as "Size_in_MB", s.type,
       s.backup_start_date, s.backup_finish_date, substring(f.physical_device_name,1,30)
FROM   backupset s, backupmediafamily f
WHERE  s.media_set_id=f.media_set_id
AND    s.backup_start_date > '2003-07-01'
ORDER BY s.backup_start_date


SELECT substring(s.database_name,1,20) as "database", 
       (s.backup_size/1024/1024)       as "Size_in_MB", 
       s.type, s.backup_start_date, s.backup_finish_date, 
       substring(f.physical_device_name,1,30) as "Device",
       s.first_lsn, s.last_lsn, database_backup_lsn
FROM   backupset s, backupmediafamily f
WHERE  s.media_set_id=f.media_set_id
AND    s.backup_start_date > '2003-07-15'
ORDER BY s.backup_start_date


-- Just get the latest backup date as recorded in the msdb database

SELECT max(backup_finish_date) FROM backupset


Example 9.2:
------------

--     IF you have created jobs, or have installed replication,
--     the tables msdb.dbo.sysjobs AND msdb.dbo.sysjobsteps
--     describes interesting properties about your jobs.


USE msdb
SELECT sysjobs.job_id, substring(sysjobs.name, 1, 30) AS JOBNAME,
       sysjobs.enabled, sysjobsteps.step_id, 
       substring(sysjobsteps.step_name, 1, 20) AS STEPNAME,
       substring(sysjobsteps.commAND, 1, 20) AS COMMAND
FROM   sysjobs, sysjobsteps
WHERE  sysjobs.job_id=sysjobsteps.job_id

SELECT i.job_id, substring(i.name,1,40) as jobname, step_id, substring(step_name, 1, 40) as stepname
FROM sysjobs i, sysjobsteps j 
WHERE i.job_id=j.job_id


Example 9.3:
------------

--     Automated Reindexing of all USEr tables in a database
--     Here, DBCC DBREINDEX() is USEd for reindexing
--     You should only using this method when your database
--     is free FROM USEr activity, AND no batch processing should run
--     Be carefull using the following script in a very large database


-- BEGIN OF SCRIPT
-- Go to the database of interest

DECLARE @tabname    VARCHAR(40)

DECLARE cur1 CURSOR FOR
SELECT 
sysobjects.name
FROM sysobjects, sysindexes
WHERE sysobjects.id=sysindexes.id
AND sysobjects.type='U'

OPEN cur1
FETCH NEXT FROM cur1 INTO @tabname

WHILE (@@fetch_status<>-1)
BEGIN
EXEC ('DBCC DBREINDEX('+@tabname+')')
PRINT 'REINDEXING '+@tabname
FETCH NEXT FROM cur1 INTO @tabname
END

CLOSE cur1
DEALLOCATE cur1

-- END OF SCRIPT


Example 9.4:
------------

-- Automated UPDATE STATISTICS commAND for all your USEr tables


-- BEGIN OF SCRIPT

DECLARE @tabname    VARCHAR(40)

DECLARE cur1 CURSOR FOR
SELECT name FROM sysobjects WHERE type='U'   

OPEN cur1
FETCH NEXT FROM cur1 INTO @tabname

WHILE (@@fetch_status<>-1)
BEGIN
EXEC ('UPDATE STATISTICS '+@tabname)
FETCH NEXT FROM cur1 INTO @tabname
END

CLOSE cur1
DEALLOCATE cur1

-- END OF SCRIPT


Example 9.5:
------------

-- Running automated scripts:

You can run scripts FROM the commAND line as follows:


E:\MSSQL7\BINN>osql �E �i show_dboptions.sql > dboptions.txt

-E				: use a trusted connection
-Uusername �Ppassword		: Or use an internal sqlserver account
-i				: switch to specIFy the inputscript


Example 9.6:
-------------

Generating backup commands:

-- SCRIPT FOR GENERATING CORRECT BACKUP COMMANDS
-- FOR DATABASES ON SGHDRC12

-- VERSION 2.0
-- DATE 27-01-2004


SET NOCOUNT ON

DECLARE @NAME         VARCHAR(128)
DECLARE @DATUM        DATETIME
DECLARE @BACKUP_DATUM VARCHAR(128)

SELECT @DATUM=GETDATE()
SELECT @BACKUP_DATUM=CONVERT(VARCHAR(10),@DATUM,20)

-- NU DE DATABASENAMEN OPHALEN UIT DE DICTIONARY

DECLARE c1 CURSOR FOR
SELECT   name FROM master.dbo.sysdatabases WHERE name not like '%AIDA%'

OPEN c1

FETCH NEXT FROM c1 INTO @NAME

WHILE (@@fetch_status<>-1)
BEGIN

  PRINT 'BACKUP DATABASE '+@NAME+' TO DISK=''d:\backup\local_dbs\'+@NAME+'_'+@BACKUP_DATUM+'.dmp'
  PRINT 'GO'

FETCH NEXT FROM c1 INTO @NAME
END

CLOSE c1
DEALLOCATE c1

-- END OF FILE


==============================
9. Full-Text Catalogs:   
==============================


1. First, enable a database for Full-Text catalogs:
---------------------------------------------------

Syntax:
sp_fulltext_database [@action =] 'action' 

action=enable:

enable Enables full-text indexing within the current database. 

Important  Use carefully. If full-text catalogs already exist, this procedure drops all full-text catalogs, 
re-creates any full-text indexing indicated in the system tables, and marks the database as full-text enabled.
This action does not cause index population to begin; an explicit start_full or start_incremental on each 
catalog must be issued using sp_fulltext_catalog to populate or repopulate the full-text index.
 
action=disable:
disable Removes all full-text catalogs in the file system for the current database and marks the database 
as being disabled for full-text indexing. This action does not change any full-text index metadata at 
the full-text catalog or table level. 

Example:

This example enables full-text indexing for the Northwind database.

USE Northwind
EXEC sp_fulltext_database 'enable'


2. Enable one or more tables to be full-text indexed:
-----------------------------------------------------

This means that for a table a PK or Unique key is needed. Secondly, you have a characterbased column
on which you want the full-text search to apply on.
The following procedure marks or unmarks a table for full-text indexing.

Syntax:
sp_fulltext_table [ @tabname = ] 'qualified_table_name' 
    , [ @action = ] 'action' 
    [ , [ @ftcat = ] 'fulltext_catalog_name' 
    , [ @keyname = ] 'unique_index_name' ]

action can be Create, Drop, Activate etc..

Example:

USE Northwind
EXEC sp_fulltext_table 'Categories', 'create', 'Cat_Desc', 'PK_Categories'
.. Add some columns
EXEC sp_fulltext_column 'Categories','Description','add'
.. Activate the index
EXEC sp_fulltext_table 'Categories','activate'


3. Enable a column to participate in full-text indexing:
--------------------------------------------------------

The following procedure specifies whether or not a particular column of a table participates in full-text indexing.

sp_fulltext_column [ @tabname = ] 'qualified_table_name' , 
    [ @colname = ] 'column_name' , 
    [ @action = ] 'action' 
    [ , [ @language = ] 'language' ] 
    [ , [ @type_colname = ] 'type_column_name' ]


Example:

This example adds the Description column from the Categories table to the table's full-text index. 
USE Northwind
EXEC sp_fulltext_column Categories, Description, 'add'

4. Now start populating the catalog:
------------------------------------

sp_fulltext_catalog
Creates and drops a full-text catalog, and starts and stops the indexing action for a catalog. 
Multiple full-text catalogs can be created for each database. 

Syntax
sp_fulltext_catalog [ @ftcat = ] 'fulltext_catalog_name' , 
    [ @action = ] 'action' 
    [ , [ @path = ] 'root_directory' ] 


Examples:

A. Create a full-text catalog
This example creates an empty full-text catalog, Cat_Desc, in the Northwind database.
USE Northwind
EXEC sp_fulltext_catalog 'Cat_Desc', 'create'

B. To rebuild a full-text catalog
This example rebuilds an existing full-text catalog, Cat_Desc, in the Northwind database.
USE Northwind
EXEC sp_fulltext_catalog 'Cat_Desc', 'rebuild'

C. Start the population of a full-text catalog
This example begins a full population of the Cat_Desc catalog.
USE Northwind
EXEC sp_fulltext_catalog 'Cat_Desc', 'start_full'

D. Stop the population of a full-text catalog
This example stops the population of the Cat_Desc catalog.
USE Northwind
EXEC sp_fulltext_catalog 'Cat_Desc', 'stop'

E. To remove a full-text catalog
This example removes the Cat_Desc catalog.
USE Northwind
EXEC sp_fulltext_catalog 'Cat_Desc', 'drop'


=====================================================================
11. Stored procedure examples, and code examples you can use in sp's:
=====================================================================


11.1 Typical stored procedure with input parameters:
----------------------------------------------------

Typical sp, that might be used at some form of orderentry,
here shown as a typical example:


CREATE procedure orderentry 

@order_id   int, 
@cust_name  varchar(20), 
@product    varchar(20), 
@quantity   int

AS

BEGIN TRAN orders

INSERT INTO orders
(order_id, cust_name)
values
(@order_id, @cust_name)

IF @@error=0
BEGIN
  INSERT INTO orderdetail
  (order_id, product, quantity)
  values
  (@order_id, @product, @quantity) 

  COMMIT TRAN orders
END

ELSE
BEGIN
  ROLLBACK TRAN orders
  RAISERROR ('Orderentry did not succeed.', 16, 1) WITH LOG
  PRINT 'Orderentry did not succeed due to errors.' 
END
GO

You can execute the sp in the following way:

exec orderentry 5, 'Intel', 'chips', 10


11.2 Stored procedure with input AND output parameters:
-------------------------------------------------------

CREATE PROCEDURE mathtutor

@m1 int,
@m2 int,
@result int OUTPUT

AS

SET @result=@m1 * @m2

----

Now execute the sp:

You may not just do
exec mathtutor 2, 5

The right way to use the sp is as follows:

DECLARE @answer int
EXECUTE mathtutor 2, 5, @answer OUTPUT
SELECT 'the result is: ', @answer


11.3 Example of a system stored procedure:
------------------------------------------

You can use a sp to add a job to SQL Server:

DECLARE @JobID BINARY(16)  
DECLARE @ReturnCode INT    
SELECT @ReturnCode = 0   
  
EXECUTE @ReturnCode = msdb.dbo.sp_add_job @job_id = @JobID OUTPUT , 
        @job_name = N'STEP1_input_data', 
        @owner_login_name = N'W2KSQL\Administrator', 
        @description = N'No description available.', 
        @category_name = N'[Uncategorized (Local)]', 
        @enabled = 0, 
        @notIFy_level_email = 0, 
        @notIFy_level_page = 0, 
        @notIFy_level_netsEND = 0, 
        @notIFy_level_eventlog = 0, 
        @DELETE_level= 0


Add a jobstep to the job:

EXECUTE @ReturnCode = msdb.dbo.sp_add_jobstep @job_id = @JobID, 
        @step_id = 1, 
        @step_name = N'Step1', 
        @commAND = N'e:\synchro\bin\input.bat', 
        @database_name = N'', 
        @server = N'', 
        @database_user_name = N'', 
        @subsystem = N'CmdExec', 
        @cmdexec_success_code = 0, 
        @flags = 0, 
        @retry_attempts = 0, 
        @retry_interval = 1, 
        @output_file_name = N'', 
        @on_success_step_id = 0, 
        @on_success_action = 1, 
        @on_fail_step_id = 0, 
        @on_fail_action = 2


11.5: get the username or applicationname
-----------------------------------------

SELECT SUSER_NAME()  -- is sql 7 syntax, in sql2k use SUSER_SNAME
SELECT SUSER_SNAME() -- returns nt login IF nt, sql login IF sql
SELECT USER_NAME()   -- returns sql login / database user
SELECT SESSION_USER  -- returns sql login
SELECT HOST_ID()
SELECT HOST_NAME()   -- return machinename
SELECT CURRENT_USER  -- returns sql login / database user
SELECT APP_NAME()    -- returns application
SELECT SUSER_SID()   -- returns SID
SELECT SUSER_SNAME() -- returns nt IF nt, return sql IF sql
SELECT USER_ID()     -- Returns a user's database identIFication number.
SELECT @@spid        -- returns spid
SELECT @@servername
SELECT @@servicename

-- sql login / database user
DECLARE @session_usr char(30)
SET @session_usr = SESSION_USER
SELECT 'This session''s current user is: '+ @session_usr
GO

-- returns nt login IF nt, or sql login IF sql
DECLARE @sys_usr char(30)
SET @sys_usr = SYSTEM_USER
SELECT 'The current system user is: '+ @sys_usr
GO

-- database user
DECLARE @usr char(30)
SET @usr = user
SELECT 'The current user''s database username is: '+ @usr
GO

SELECT substring(nt_username, 1, 15)  AS "USERNAME (NT or NULL)",
       substring(loginame, 1, 20)     AS "LOGINNAME (NT or SQL)",
       substring(hostname, 1, 15)     AS "HOSTNAME",
       substring(program_name, 1, 40) AS "PROGRAM"  
FROM   master.dbo.sysprocesses

SELECT spid AS "SQL process ID", 
       kpid AS "NT thread ID", 
       ecid AS "Execution context ID", 
       nt_username, loginame 
FROM   master.dbo.sysprocesses


11.6: can be used at errorhANDling
----------------------------------

IF (@@TRANCOUNT > 0)
BEGIN
  RAISERROR()
END


IF (@@ROWCOUNT = 0)
BEGIN
  RAISERROR()
END

IF @@error > 0
BEGIN
   GOTO error_section
END
.
.
error_section:
PRINT @MESSAGE
RETURN
GO


11.7: set a value IF it's null
------------------------------

IF (@Volgnr IS NULL ) 
BEGIN
  SET @Volgnr = 1
END
ELSE 
BEGIN
  SET @Volgnr = @Volgnr + 1
END


11.8: INSERT variable values into a table
-----------------------------------------

INSERT AfschrIFt 
(AfschrIFtId, RekeningId, VolgNr, Type, EindDatum) 
VALUES
(@R, @RekeningId, @VolgNr, 'TR', CONVERT(smalldatetime, CONVERT(varchar(10), GetDate(), 112), 112))


11.9:  IF ON, "x" IF x is an object, it cannot be used in DDL
-------------------------------------------------------------
SET QUOTED_IDENTIFIER  OFF    SET ANSI_NULLS  ON 
GO


11.10: Using xp_cmdshell AND bcp or other external cmd:
-------------------------------------------------------

Example 1:
----------

SELECT @LOGSTRING=@ART_NR+' :'+@ACTION+' '+CONVERT(VARCHAR(64),@ACTION_DATE)
SELECT @log_cmd='echo'+' '+@LOGSTRING+' >> C:\TEMP\LOAD_FILE.LOG'

EXEC master.dbo.xp_cmdshell @log_cmd

Example 2:
----------

SELECT @totalcommAND='bcp ##BCP_LOAD in '+@importpath+' -c -F2 -T'

EXEC @RESULT = master.dbo.xp_cmdshell @totalcommAND
IF (@RESULT <> 0)
   BEGIN
   SET @MESSAGE='Error loading data in temporary table. Possibly wrong path or file not found.'
   GOTO error_section
   END

Example 3:
----------

DECLARE @cmd sysname, @var sysname
SET @var = 'Hello world'
SET @cmd = 'echo ' + @var + ' > var_out.txt'
EXEC master..xp_cmdshell @cmd

DECLARE @cmd sysname, @var sysname
SET @var = 'dir/p'
SET @cmd = @var + ' > dir_out.txt'
EXEC master..xp_cmdshell @cmd

Example 4:
----------

WHILE (@@FETCH_STATUS<>-1)
  BEGIN
    SELECT @length_table   =LEN(@TEMPTAB)
    SELECT @TEMPTAB2=substring(@TEMPTAB,3,@length_table-2)
    SELECT @TEMPTAB2
    SELECT @totalcmd='c:\exp_solid\solexp -o c:\exp_solid\'+@TEMPTAB2+'.txt '+' "shm zg39bbv"'+' zg39bbv  zg39bbv '+@TEMPTAB2
    EXEC master..xp_cmdshell @totalcmd

    FETCH NEXT FROM cur1 INTO @TEMPTAB
  END

Example 5:
----------

DECLARE @FileName varchar(50),
        @bcpCommand varchar(2000)

SET @FileName = REPLACE('c:\authors_'+CONVERT(char(8),GETDATE(),1)+'.txt','/','-')

SET @bcpCommand = 'bcp "SELECT * FROM pubs..authors ORDER BY au_lname" queryout "'
SET @bcpCommand = @bcpCommand + @FileName + '" -U garth -P pw -c'

EXEC master..xp_cmdshell @bcpCommand


Other notes:
------------

By default, xp_cmdshell can only be used by logins in the sysadmin server role.
When a login that's in the sysadmin role executes xp_cmdshell, it runs under 
the windows account that SQL Server is running under. When you grant the right 
to run xp_cmdshell to a login that is not in the sysadmin role, you must set the account 
that is used to run xp_cmdshell and any programs that it invokes. 
This is done with the extended stored procedure xp_sqlagent_proxy_account.

Suppose that you want to grant the right to execute xp_cmdshell to the SQL login LimitedUser. 
You'll need an NT account to execute the program. Here's the script:

use master
go

xp_sqlagent_proxy_account N'SET'
                        , N'<mydomain>'
                        , N'<ntuser>'
                        , N'<ntuser's password>'
go

-- retrieve the proxy account to check that it's correct.
xp_sqlagent_proxy_account N'GET'
go

-- grant database access in master
sp_grantdbaccess 'LimitedUser'
go

grant exec on xp_cmdshell to LimitedUser
go

Also, you may go into Enterprise Manager and bring up the property page for SQL Agent. 
Click on the "Job System" tab. In the Non-SysAdmin job step proxy account area of the page, 
clear the check box Only users with SysAdmin privileges can execute CmdExec and ActiveScripting job steps.
You can fill in an account here.

--Corresponds to the Enterprise Manager SQL Agent property page
-- Job System tab.  Sets the value of "Only users with SysAdmin
-- privileges can execute CmdExec and ActiveScripting job steps"
--  1 Turns on the restriction
--  0 Turns off the restriction and allows non sysadmin logins
--               to do this and to run xp_cmdshell
EXECUTE msdb..sp_set_sqlagent_properties @sysadmin_only = 0
go

Be aware that Service Pack 3 changes the behavior of SQL Server by making the 
permission for non-sysadmin accounts to use xp_cmdshell conditional on the value 
of the Only users with SysAdmin privileges can execute 
CmdExec and ActiveScripting job steps flag. 
 

Custom xp_cmdshell:
-------------------

CREATE PROCEDURE xp_cmdshell(@cmd varchar(255), @Wait int = 0) AS
  --Create WScript.Shell object
  DECLARE @result int, @OLEResult int, @RunResult int
  DECLARE @ShellID int

  EXECUTE @OLEResult = sp_OACreate 'WScript.Shell', @ShellID OUT
  IF @OLEResult <> 0 SELECT @result = @OLEResult
  IF @OLEResult <> 0 RAISERROR ('CreateObject %0X', 14, 1, @OLEResult)


  EXECUTE @OLEResult = sp_OAMethod @ShellID, 'Run', Null, @cmd, 0, @Wait
  IF @OLEResult <> 0 SELECT @result = @OLEResult
  IF @OLEResult <> 0 RAISERROR ('Run %0X', 14, 1, @OLEResult)
  --If @OLEResult <> 0 EXEC sp_displayoaerrorinfo @ShellID, @OLEResult 


  EXECUTE @OLEResult = sp_OADestroy @ShellID

  return @result


11.11: check for a precondition value before some action takes place
--------------------------------------------------------------------

DECLARE @message 	VARCHAR(100)

IF (SELECT BLOK_CODE FROM DSA_IMPORT..BLOKKADE) = 'Y'
BEGIN
  SELECT @message = 'error: blocking flag is Y'
  goto error_section
END

11.12: days of week
-------------------

SET DATEFIRST 5
SELECT @@DATEFIRST AS '1st Day', DATEPART(dw, GETDATE()) AS 'Today'

Here is the result set. Counting FROM Friday, today (Saturday) is day 2.

1st Day           Today
----------------  --------------
5                 2

Or see this sample:

SELECT @datumset = @@DATEFIRST
set DATEFIRST 1
SELECT @startdatum = @nextdat - (DATEPART(dw, @nextdat)-1)
SELECT @einddatum =  @nextdat + (7 - DATEPART(dw,@nextdat))
SET DATEFIRST  @datumset

INSERT INTO DESTINATION_TABLE    
SELECT * FROM SOURCE_TABLE
WHERE DATE_CRITERIUM between @nextdat AND @einddatum

Or see this sample:

set datefirst 1
declare @dag varchar(10)
set @dag = (SELECT dag = case 
                        when datepart(dw,getdate()) = 1 then 'MA'
                        when datepart(dw,getdate()) = 2 then 'DI'
                        when datepart(dw,getdate()) = 3 then 'WOE'
                        when datepart(dw,getdate()) = 4 then 'DO'
                        when datepart(dw,getdate()) = 5 then 'VR'
else 'onbekEND'
END)
SELECT @dag

11.13: keeping the no of transactions manageble
-----------------------------------------------

DECLARE @i INT
DECLARE @j INT
DECLARE @k INT

SET @i=0
SET @j=0

WHILE (@i<10000)
BEGIN
  SET @J=0
    BEGIN TRAN
      WHILE (@j<100)
       BEGIN
         INSERT INTO y
         values
         (@i,'joop')
         SELECT @i=@i+1
         SELECT @j=@j+1
       END
    COMMIT
-- just inspect those integers
 SELECT @k=(SELECT COUNT(*) FROM y)
 SELECT @k
 SELECT @i
-- END inspection
END

11.14: Use of a user defined function
-------------------------------------

User defined functions can accept parameters AND return either
a scalar value or a table.

CREATE FUNCTION udfEmpByCity(@city varchar(20))
RETURNS TABLE
AS
RETURN (
        SELECT EmployeeID, LastName, FirstName
        FROM Employees
        WHERE (City=@City)
)

To access the resultset of this function you can use the next example:

SELECT * FROM udfEmpByCity('Seattle')

11.15: a compare option
-----------------------

DECLARE @I        INT
DECLARE @J        INT
DECLARE @NEXTID   VARCHAR(10)
DECLARE @ID       VARCHAR(20)
DECLARE @SMS      VARCHAR(64)


SELECT @J = (SELECT COUNT(*) FROM SOFTCODE) 
SELECT @I = 0

WHILE @I < @J

BEGIN
  
 SELECT @I=@I+1
 SELECT @ID   =(SELECT ID FROM SOFTCODE WHERE IDENT=@I)
 SELECT @SMS  =(SELECT DISTINCT packagename FROM packages 
 WHERE packagename like '%'+'ID='+'%'+@ID+'%') 

 UPDATE SOFTCODE 
 SET SOFTCODE.SMSSOFT=@SMS
 FROM SOFTCODE
 WHERE SOFTCODE.ID=@ID

END

11.16: remove duplicate records
-------------------------------

Example 1:
----------

DECLARE @MACHINE VARCHAR(64)

DECLARE CUR1 CURSOR
FOR
SELECT NAME0 FROM VIDENTIFICATION GROUP BY NAME0 HAVING COUNT(*) > 1
OPEN CUR1

FETCH NEXT FROM CUR1 INTO @MACHINE
WHILE (@@FETCH_STATUS <> -1)

BEGIN

  DELETE FROM ASSET_SMS_EXT
  WHERE NAME=@MACHINE

  FETCH NEXT FROM CUR1 INTO @MACHINE

END

Example 2:
----------

Suppose we have this table:

CREATE TABLE [dbo].[tbl_EventMatrix] 
(
	[SessionID]    [int]            NOT NULL ,
	[DaNr]         [int]            NOT NULL ,
	[HSS Ident]    [nvarchar] (15)  NOT NULL ,
	[TaTl]         [datetime]       NOT NULL ,
	[TaTlms]       [smallint]       NOT NULL ,
	[Duration]     [real]           NULL ,
	[H/LVaTl]      [datetime]       NULL ,
	[H/LVaTlms]    [smallint]       NULL ,
	[H/LValue]     [real]           NULL ,
	[Value1]       [nvarchar] (50)  NULL ,
	[Value2]       [nvarchar] (50)  NULL ,
	[Value3]       [nvarchar] (50)  NULL ,
	[False]        [bit] NOT NULL 
) ON [PRIMARY]
GO

AND suppose you want this PK (or Unique Index) defined:

ALTER TABLE [dbo].[tbl_EventMatrix] WITH NOCHECK ADD 
	CONSTRAINT [PK_tbl_EventMatrix] PRIMARY KEY  NONCLUSTERED 
	(
         [SessionID],[DaNr],[HSS Ident],[TaTl],[TaTlms]
        )  
ON [PRIMARY] 
GO

But there are some duplicate records, so the creation of the PK fails.
The following script deals with this problem.

-- BEGIN SCRIPT

Create table ##dup_keys
(
	[SessionID]    [int]            NOT NULL ,
	[DaNr]         [int]            NOT NULL ,
	[HSS Ident]    [nvarchar] (15)  NOT NULL ,
	[TaTl]         [datetime]       NOT NULL ,
	[TaTlms]       [smallint]       NOT NULL 
)

INSERT INTO ##dup_keys
([SessionID],[DaNr],[HSS Ident],[TaTl],[TaTlms])
SELECT 	[SessionID],[DaNr],[HSS Ident],[TaTl],[TaTlms]
FROM tbl_EventMatrix GROUP BY [SessionID],[DaNr],[HSS Ident],[TaTl],[TaTlms]
HAVING COUNT(*) > 1


CREATE TABLE [dbo].[tbl_EventMatrix_TMP] 
(
        [ID]           [int]            NOT NULL identity(1,1),
	[SessionID]    [int]            NOT NULL ,
	[DaNr]         [int]            NOT NULL ,
	[HSS Ident]    [nvarchar] (15)  NOT NULL ,
	[TaTl]         [datetime]       NOT NULL ,
	[TaTlms]       [smallint]       NOT NULL ,
	[Duration]     [real]           NULL ,
	[H/LVaTl]      [datetime]       NULL ,
	[H/LVaTlms]    [smallint]       NULL ,
	[H/LValue]     [real]           NULL ,
	[Value1]       [nvarchar] (50)  NULL ,
	[Value2]       [nvarchar] (50)  NULL ,
	[Value3]       [nvarchar] (50)  NULL ,
	[False]        [bit] NOT NULL 
) ON [PRIMARY]
GO

INSERT INTO tbl_EventMatrix_TMP
([SessionID],[DaNr],[HSS Ident],[TaTl],[TaTlms],[Duration],[H/LVaTl],[H/LVaTlms],[H/LValue],[Value1],[Value2],[Value3],[False])
SELECT * FROM  tbl_EventMatrix

truncate table tbl_EventMatrix

-- CREATE LOOP
DECLARE @SessionID      int          
DECLARE @DaNr           int           
DECLARE @HSS            nvarchar(15)  
DECLARE @TaTl           datetime      
DECLARE @TaTlms         smallint  
DECLARE @MAXID          int

DECLARE cur1 CURSOR FOR
SELECT [SessionID],[DaNr],[HSS Ident],[TaTl],[TaTlms]
FROM ##dup_keys

OPEN cur1
FETCH NEXT FROM cur1 INTO @SessionID,@DaNr,@HSS,@TaTl,@TaTlms

WHILE (@@fetch_status<>-1)
BEGIN

SELECT @MAXID=(SELECT MAX(ID) FROM tbl_EventMatrix_TMP
               WHERE [SessionID]= @SessionID
                 AND [DaNr]     = @DaNr
                 AND [HSS Ident]= @HSS
                 AND [TaTl]     = @TaTl
                 AND [TaTlms]   = @TaTlms)

DELETE FROM tbl_EventMatrix_TMP
WHERE                ([SessionID]= @SessionID
                 AND [DaNr]     = @DaNr
                 AND [HSS Ident]= @HSS
                 AND [TaTl]     = @TaTl
                 AND [TaTlms]   = @TaTlms
                 AND ID < @MAXID)


FETCH NEXT FROM cur1 INTO @SessionID,@DaNr,@HSS,@TaTl,@TaTlms
END

CLOSE cur1
DEALLOCATE cur1


INSERT INTO tbl_EventMatrix
SELECT [SessionID],[DaNr],[HSS Ident],[TaTl],[TaTlms],[Duration],[H/LVaTl],[H/LVaTlms],[H/LValue],[Value1],[Value2],[Value3],[False]
FROM  tbl_EventMatrix_TMP

DROP TABLE tbl_EventMatrix_TMP

-- END SCRIPT


11.17: remove AND add primary key
---------------------------------

ALTER TABLE ASSET_SMS_EXT
DROP CONSTRAINT ASSET_PK

ALTER TABLE ASSET_SMS_EXT
ADD CONSTRAINT ASSET_PK PRIMARY KEY (DWMACHINEID) 

ALTER TABLE SMSMACHINES
ADD CONSTRAINT SMSMACHINES_PK PRIMARY KEY (DWMACHINEID)

ALTER TABLE SMSSW
ADD CONSTRAINT SMSSW_FK FOREIGN KEY (DWMACHINEID) 
REFERENCES SMSMACHINES(DWMACHINEID


BEGIN
  EXEC('ALTER TABLE '+@tab_name+' NOCHECK CONSTRAINT '+@fk_name)
  EXEC('ALTER TABLE '+@tab_name+' DROP CONSTRAINT '+@fk_name)
  FETCH NEXT FROM cur1 INTO @fk_name, @tab_name
END

11.18: tricky procedure to create a dIFference table between 2 tables
---------------------------------------------------------------------

CREATE PROCEDURE DELTA_SMS_CMDB @MYVAR VARCHAR(20)
AS
/* CLEANUP TEMPORY TABLES */
TRUNCATE TABLE SMSSOFT
TRUNCATE TABLE CMDBSOFT
TRUNCATE TABLE DELTA_SMS

/* PUT DATA INTO TABLES */
INSERT INTO SMSSOFT
(NAME, SERIALNUMBER, SOFTWARE)
SELECT DISTINCT SHORTNAME, SERIALNUMBER, SOFTWARE
FROM SMSSOFTWARE WHERE SOFTWARE LIKE '%'+@MYVAR+'%' 
AND SERIALNUMBER IS NOT NULL 
ORDER BY SHORTNAME

INSERT INTO CMDBSOFT
(NAME, SERIALNUMBER, SOFTWARE)
SELECT DISTINCT NAME, SERIALNUMBER, SOFTWARE
FROM CMDBSOFTWARE WHERE SOFTWARE LIKE '%'+@MYVAR+'%'
AND SERIALNUMBER IS NOT NULL
ORDER BY NAME

/* DECLARE NEEDED VARIABLES IN OUR CODE */
DECLARE @i    INT 
DECLARE @j    INT
DECLARE @p    INT
DECLARE @t    INT
DECLARE @nameA VARCHAR(20)
DECLARE @nameB VARCHAR(20)

SELECT @p=(SELECT COUNT(*) FROM CMDBSOFT)

IF @p=0 GOTO EINDE

DECLARE CUR1 CURSOR 
FOR
SELECT SMSSOFT.SERIALNUMBER FROM SMSSOFT
OPEN CUR1
FETCH NEXT FROM CUR1 INTO @nameA

WHILE (@@FETCH_STATUS <> -1)
BEGIN
   INSERT INTO DELTA_SMS
   (NAME, SERIALNUMBER, SOFTWARE)
   SELECT DISTINCT SMSSOFT.NAME, SMSSOFT.SERIALNUMBER, SMSSOFT.SOFTWARE
   FROM SMSSOFT, CMDBSOFT
   WHERE 
   (SMSSOFT.SERIALNUMBER=@nameA) AND (CMDBSOFT.SERIALNUMBER<>@nameA)

FETCH NEXT FROM CUR1 INTO @NAMEA
END

CLOSE CUR1
DEALLOCATE CUR1

/* NOW MAKE THE DIFFERENCE WITH CMDBSOFT, THAT IS CMDBSOFT WITH ONLY @MYVAR */
DECLARE CUR2 CURSOR 
FOR
SELECT CMDBSOFT.SERIALNUMBER FROM CMDBSOFT
OPEN CUR2
FETCH NEXT FROM CUR2 INTO @nameB

WHILE (@@FETCH_STATUS <> -1)
BEGIN
   DELETE FROM DELTA_SMS
   WHERE SERIALNUMBER=@nameB
FETCH NEXT FROM CUR2 INTO @NAMEB
END

CLOSE CUR2
DEALLOCATE CUR2

/* NOW CREATE NEW TABLE WITH @MYVAR DATA */

SELECT @t=(SELECT ID FROM SYSOBJECTS WHERE NAME="DELTA_SMS_'+@MYVAR+'")
IF @t=0
GOTO PROC1

ELSE 
GOTO PROC2 


PROC1:
EXECUTE 
('CREATE TABLE DELTA_SMS_'+@MYVAR+ '(
NAME         VARCHAR(20) NOT NULL,
SERIALNUMBER VARCHAR(30) NULL,
SOFTWARE     VARCHAR(64) NULL,
DEEP         INT IDENTITY(1,1) NOT NULL)')
EXECUTE 
('INSERT INTO DELTA_SMS_'+@MYVAR+ ' (NAME, SERIALNUMBER, SOFTWARE)
SELECT DISTINCT NAME, SERIALNUMBER, SOFTWARE FROM DELTA_SMS')
GOTO EINDE2


PROC2:
EXECUTE ('DROP TABLE DELTA_SMS_'+@MYVAR+'')
EXECUTE 
('CREATE TABLE DELTA_SMS_'+@MYVAR+ '(
NAME         VARCHAR(20) NOT NULL,
SERIALNUMBER VARCHAR(30) NULL,
SOFTWARE     VARCHAR(64) NULL,
DEEP         INT IDENTITY(1,1) NOT NULL)')
EXECUTE 
('INSERT INTO DELTA_SMS_'+@MYVAR+ ' (NAME, SERIALNUMBER, SOFTWARE)
SELECT DISTINCT NAME, SERIALNUMBER, SOFTWARE FROM DELTA_SMS')
GOTO EINDE2


EINDE:

SELECT "TABLE CMDBSOFTWARE DOES NOT CONTAIN THIS PARTICULAR SOFTWARE: "
SELECT @myvar
SELECT "THIS MEANS THAT TABLE SMSSOFTWARE QUERIED ON THIS PARTICULAR SOFTWARE " 
SELECT "IS THE DIFFERENCE LIST."
RETURN


EINDE2:
RETURN


11.19: Some CASE WHEN examples:
-------------------------------

WHILE (SELECT STA_VERWERKING FROM STATUS_CSO) in ('Y','J')
  BEGIN

  END

-----
declare @v1 varchar(10)
 
SELECT @v1=(SELECT name1 FROM t1 WHERE id=1)
SELECT
      CASE @v1
         WHEN 'appie1' THEN 'dit is appie1'
         WHEN 'appie2' THEN 'dit is appie2'
         ELSE 'Gerrit'
      END
-----

CREATE procedure SessionConfig1
AS
SELECT tbl_NetworkSessionTable.[HSS Ident], tbl_NetworkSessionTable.Identnr, 
       tbl_NetworkSessionTable.Value, tbl_NetworkSessionTable.SessionId, 
       tbl_SerialnumberReference.Serialnumber, 
       tbl_SerialnumberReference.SerialnumbersettingID, tbl_SerialnumberReference.type, 
       tbl_SerialnumberReference.[Offset value1], 
       tbl_SerialnumberReference.[Offset value2], 
(SELECT CASE substring([HSS ident],1,3)
         WHEN 'ECU' THEN substring([HSS ident],4,1)
         ELSE ''
         END)  AS Position
FROM tbl_NetworkSessionTable LEFT JOIN tbl_SerialnumberReference 
ON tbl_NetworkSessionTable.Value=tbl_SerialnumberReference.Serialnumber

-----
UPDATE KP_VPT_RELATIES_SBL
   SET VRSB_SOORT_MUTATIE =
       CASE
          WHEN RSBL_BRONSYSTEEM IS NULL THEN 'I'
          WHEN RSBL_BRONSYSTEEM IS NOT NULL THEN 'U'
       END
      FROM
         KP_VPT_RELATIES_SBL
         LEFT JOIN DW_VPT_RELATIES_SBL ON
             RSBL_BRONSYSTEEM                       = VRSB_BRONSYSTEEM
         AND RSBL_CLIENTNUMMER                      = VRSB_CLIENTNUMMER
         AND RSBL_RELATIEROL_CODE                   = VRSB_RELATIEROL_CODE
      WHERE    
          UPPER(VRSB_STATUS_CODE) = 'N'
      AND UPPER(VRSB_SOORT_MUTATIE) in ('I','U')   
-----

11.20: Wait in procedure:
-------------------------

SELECT @l_tijdlus = '000:00:05'

WAITFOR DELAY @l_tijdlus

11.21: Use of sENDmail:
-----------------------

Example 1
---------

declare @rc int

exec xp_cmdshell 'osql -SServername -E -q pkinfo.dbo.csp_pktrap_s -o f:\Reports\report.txt' 

exec @rc = master.dbo.xp_smtp_sENDmail
	@FROM			= N'ServerX',
	@FROM_NAME		= N'DBA OPERATIONS',
	@TO			    = N'A.Brown@xyzcompany.com',
 	@CC			    = N'B.Blackj@xyzcompany.com',
	@BCC			= N'',
	@priority		= N'NORMAL',
	@subject		= N'Weekly report about...',
	@message		= N'The content of: csp_pktrap_s',
	@messagefile	= N'',
	@type			= N'text/plain',
	@attachment		= N'f:\Reports\report.txt',
	@attachments	= N'',
	@codepage		= 0,
	@server 		= N'SVR'
SELECT RC = @rc 
go


Example 2
---------

Use xp_sENDmail with no variables
This example sENDs a message to user Robert King (e-mail is robertk) 
that the master database is full.

EXEC xp_sENDmail 'robertk', 'The master database is full.'

Example 3
---------

Use xp_sENDmail with variables 
This example sENDs the message to users Robert King AND Laura Callahan (e-mail is laurac), 
with copies sent to Anne Dodsworth (e-mail is anned) AND Michael Suyama (e-mail is michaels). 
It also specIFies a subject line for the message.

EXEC xp_sENDmail @recipients = 'robertk;laurac', 
   @message = 'The master database is full.',
   @copy_recipients = 'anned;michaels',
   @subject = 'Master Database Status'


11.22 Transactions:
-------------------

BEGIN TRAN xyx

DELETE oas_grplist
WHERE  grpcode = '1RFC22642'

SELECT
    @l_rowcount       	= @@rowcount
   ,@l_error          	= @@error

IF @l_error  <> 0
BEGIN
   PRINT 'FOUT  OPGETREDEN BIJ UITVOERING - GEEN DELETE UITGEVOERD'
   ROLLBACK TRAN xyz
   GOTO END_section
END

IF @l_rowcount <> @p_maxaantal
BEGIN
   PRINT 'VERKEERDE AANTAL RIJEN GESELECTEERD - ' + CAST(@l_rowcount AS varchar) + ' - GEEN DELETE UITGEVOERD'
   ROLLBACK TRAN grootboeksysteem_DELETE_001
   GOTO END_section
END

COMMIT TRAN xyz
PRINT 'DELETE GROOTBOEKSYSTEEM UITGEVOERD VOOR ' + CAST(@l_rowcount AS varchar) + ' RIJEN.'

END_section:
   RETURN
END

GO

11.23 Assign permission to an object:
-------------------------------------

example:

grant SELECT,INSERT,update on table1 to user_test


11.24 Linked Server:
--------------------

sp_addlinkedserver [ @server = ] 'server'                    -- local name
                   [ , [ @srvproduct = ] 'product_name' ] 
                   [ , [ @provider   = ] 'provider_name' ] 
                   [ , [ @datasrc    = ] 'data_source' ] 
                   [ , [ @location   = ] 'location' ] 
                   [ , [ @provstr    = ] 'provider_string' ] 
                   [ , [ @catalog    = ] 'catalog' ] 


Example 1:
----------

This example creates a linked server named SEATTLESales that uses the 
Microsoft OLE DB Provider for SQL Server.

USE master
GO
EXEC sp_addlinkedserver 
    'SEATTLESales',
    N'SQL Server'
GO

Example 2:
----------

This example creates a linked server named 'TESTsrv' that uses the 
Microsoft OLE DB Provider for Oracle AND assumes that the SQL*Net alias 
for the Oracle database is 'airm'.

USE master
GO

EXEC sp_addlinkedserver
   @server = 'TESTsrv',
   @srvproduct = 'Oracle',
   @provider = 'MSDAORA',  -- sql*net alias
   @datasrc = 'airm'
GO

EXEC sp_addlinkedsrvlogin 'TESTsrv', false, 'w2ksql\Administrator', 'piet', 'piet'
GO

Example 3:
----------

This example creates a linked server named 'TestAccess' to MS Access.

EXEC sp_addlinkedserver 
   @server = 'TestAccess', 
   @provider = 'Microsoft.Jet.OLEDB.4.0', 
   @srvproduct = 'OLE DB Provider for Jet',
   @datasrc = 'C:\MSOffice\Access\Samples\Northwind.mdb'
GO


To query tables of piet:
------------------------

SELECT * FROM TESTsrv..PIET.ABC

INSERT INTO TESTsrv..PIET.ABC
values
(10,'lukt het')

Drop the linked server:
-----------------------

  drop the linked server:
  sp_droplinkedsrvlogin 'TESTsrv', 'w2ksql\Administrator'
  sp_dropserver 'TESTsrv'


Example 4:
----------

Execute sp_addlinkedserver to create the linked server, specIFying MSDAORA 
as provider_name, AND the SQL*Net alias name for the Oracle database instance as data_ source. 
This example assumes that an SQL*Net alias name has been defined as OracleDB.

sp_addlinkedserver 'OrclDB', 'Oracle', 'MSDAORA', 'OracleDB'

Use sp_addlinkedsrvlogin to create login mappings FROM SQL Server logins 
to Oracle logins. 
This example maps the SQL Server login Joe to the linked server 
defined in the former step using the Oracle login AND password OrclUsr AND OrclPwd:

sp_addlinkedsrvlogin 'OrclDB', false, 'Joe', 'OrclUsr', 'OrclPwd'

Example 5:
----------

CREATE PROCEDURE set_link @alias nvarchar(50)
AS
/*
Creates a linked server TO ORACLE database

*/
SET NOCOUNT ON

-- IF exists must drop
IF EXISTS (SELECT * FROM master.dbo.sysservers WHERE srvname = 'OPC_TEMP_LINK_SVR')
BEGIN
EXEC sp_droplinkedsrvlogin 'OPC_TEMP_LINK_SVR', NULL

EXEC sp_dropserver 'OPC_TEMP_LINK_SVR'
END

-- create
EXEC sp_addlinkedserver
@server = 'OPC_TEMP_LINK_SVR', -- used by all dm stored procs
@srvproduct = 'Oracle',
@provider = 'MSDAORA',
@datasrc = @alias --SQL*Net alias for the Oracle database 

EXEC sp_addlinkedsrvlogin 'OPC_TEMP_LINK_SVR', 'false', NULL, 'Oracle_User, 'Oracle_Password'

-- done

GO

11.25 DB options:
-----------------

Example:

EXEC sp_dboption 'sales', 'offline', 'TRUE' 


11.26 fill a table FROM other table:
------------------------------------

Example 1:
----------

Two options to realize this:

SELECT * INTO table_b
FROM table_a

INSERT INTO table_b
SELECT * FROM table_a

Example 2:
----------

INSERT INTO FILELIST
(SERIALNUMBER, FILENAME, FILESIZE, FILEDATE)
SELECT DISTINCT SERIALNUMBER, FILENAME, FILESIZE, FILEDATE
FROM VPFILELIST
WHERE SERIALNUMBER=@SERIALNR

DROP INDEX FILELIST.indxfilelist1
CREATE INDEX indxfilelist1 on FILELIST(SERIALNUMBER) with fillfactor=70

Example 3:
----------

INSERT INTO ##STAGE_LOAD
SELECT ART_NR, LTRIM(ART_OMS), LTRIM(ART_VRP_EENH),  convert(money,ART_PR_EX), convert(money, ART_PR_IN)
FROM ##BCP_LOAD

Example 4:
----------

INSERT INTO NG39AFGP
SELECT
        CONVERT(INT,AFG_PNT_NR),
                U_VERSION,
                AFG_PNT_OMS,
        CONVERT(bit,AFG_PNT_VAST)
FROM ##NG39AFGP


11.27 Use of triggers:
----------------------

A trigger is like a stored procedure, but it is an object that's bound
to a table. It will only run (fire) when an INSERT, UPDATE or DELETE
Statement is issued to this table.
When an INSERT an/or UPDATE and/or DELETE is done on a table, the 
trigger will fire and the statements in the trigger will execute.
A common type of application is that another table will be updated
when the Trigger table (the table WHERE the trigger is defined on) is modified, 
like for example a STOCK table that automatically get's updated when 
an ORDER is placed in the ORDER table.

Special temporary "tables" DELETED and INSERTED can be used with triggers.
They have the same datamodel as the table the trigger is defined on.

DELETED : stores copy / copies of the row(s) that is affected by the 
          DELETE or UPDATE statement.

INSERTED: stores copy / copies of the row(s) that is affected by the 
          INSERT or UPDATE statement.


Example 1:
----------

CREATE TRIGGER orders_INSERT ON orders 
FOR INSERT, UPDATE
AS
UPDATE stock
SET in_stock=in_stock-INSERTed.amount
FROM stock, INSERTed
WHERE stock.item_id=INSERTed.item_id


Example 2:
----------

CREATE TRIGGER employee_insupd
ON employee
FOR INSERT, UPDATE
AS
--Get the range of level for this job type FROM the jobs table.
declare @min_lvl tinyint,
   @max_lvl tinyint,
   @emp_lvl tinyint,
   @job_id smallint
SELECT @min_lvl = min_lvl,
   @max_lvl = max_lvl,
   @emp_lvl = i.job_lvl,
   @job_id = i.job_id
FROM employee e, jobs j, INSERTed i
WHERE e.emp_id = i.emp_id AND i.job_id = j.job_id
IF (@job_id = 1) AND (@emp_lvl <> 10)
BEGIN
   raiserror ('Job id 1 expects the default level of 10.',16,1)
   ROLLBACK TRANSACTION
END
ELSE
IF NOT (@emp_lvl BETWEEN @min_lvl AND @max_lvl)
BEGIN
   raiserror ('The level for job_id:%d should be between %d AND %d.',
      16, 1, @job_id, @min_lvl, @max_lvl)
   ROLLBACK TRANSACTION
END

Example 3:
----------

CREATE TRIGGER TR_NG10QGRP_INS on RES_NG10QGRP FOR INSERT
AS

DECLARE	@ERR_MESSAGE    VARCHAR(128)
DECLARE @FIE_FUNKKODE   VARCHAR(10)
DECLARE @GRP_GRPKODE    VARCHAR(6)

SELECT  @GRP_GRPKODE = i.GRP_GRPKODE FROM inserted i, RES_NG10QGRP p WHERE i.GRP_GRPKODE=p.GRP_GRPKODE    

IF EXISTS (SELECT FUG_GRPKODE FROM RES_NG10QFUG WHERE FUG_GRPKODE=@GRP_GRPKODE)
BEGIN 
  PRINT 'FUG_GRPKODE ALREADY EXIST IN NG10QFUG'
  RAISERROR ('FUG_GRPKODE ALREADY EXIST IN NG10QFUG', 16, 1) WITH LOG
  RETURN
END

ELSE -- vul blok in QFUG voor deze nieuwe GRP, even dynamisch ophalen uit QFIE
 BEGIN  
   DECLARE C_1 CURSOR 
   FOR
   SELECT DISTINCT FIE_FUNKKODE FROM RES_NG10QFIE
   OPEN C_1
   FETCH NEXT FROM C_1 INTO @FIE_FUNKKODE 

    WHILE (@@FETCH_STATUS <> -1)
      BEGIN
      INSERT INTO RES_NG10QFUG
      (FUG_FUNKKODE, FUG_GRPKODE)
      VALUES
      (@FIE_FUNKKODE, @GRP_GRPKODE)

      FETCH NEXT FROM C_1 INTO @FIE_FUNKKODE 
      END

   CLOSE C_1
   DEALLOCATE C_1
 END

Example 4:
----------

CREATE TRIGGER tr_ng10mod_ins on NG10MOD FOR INSERT
as
DECLARE	@UserName     VARCHAR(128)
DECLARE @cinfo        VARBINARY(128)
DECLARE @MODNR        NVARCHAR(16)


SELECT  @cinfo=(SELECT context_info FROM master.dbo.sysprocesses 
               WHERE spid=@@spid)

if @cinfo=0x0                   -- meaning context_info could not be set
   begin
   SET @username=SUSER_SNAME()  -- So, take NT/2000 name as alternative
   end
else
 begin
 SET @username=convert(VARCHAR(128),@cinfo)	
 end

SELECT  @MODNR=i.MODNR FROM inserted i, NG10MOD p WHERE i.MODNR=p.MODNR            

INSERT INTO NG10AUDIT
(TABEL,MODNR,MUTATIE,GEBRUIKER)
VALUES 
(NG10MOD,@MODNR,'T',@username)

GO

Example 5:
----------

View enabled and disabled triggers:

SELECT LEFT(sop.name,36) AS 'Table', LEFT(so.name,36) AS 'Trigger',
CASE WHEN OBJECTPROPERTY(so.id, 'ExecIsTriggerDisabled') = 1 
THEN 'Disabled' ELSE 'Enabled' END AS 'Trigger Status'
FROM sysobjects so
INNER JOIN sysobjects sop ON so.parent_obj = sop.id
WHERE so.xtype = 'TR'
ORDER BY 1, 2

Example 6: Some mixed examples
---------

CREATE TRIGGER updEmployeeData 
ON employeeData 
FOR update AS
/*Check whether columns 2, 3 or 4 has been updated. If any or all of columns 2, 3 or 4 
have been changed, create an audit record. The bitmask is: power(2,(2-1))+power(2,(3-1))+power(2,(4-1)) = 14. 
To check if all columns 2, 3, and 4 are updated, use = 14 in place of >0 (below).*/

IF (COLUMNS_UPDATED() & 14) > 0
/*Use IF (COLUMNS_UPDATED() & 14) = 14 to see if all of columns 2, 3, and 4 are updated.*/
BEGIN
/*DO SOMETHING HERE */
END
GO

There also is an example of how to detect a change in all columns is there are more than 8. 
That example is below. 

CREATE TRIGGER tr1 ON Customers
FOR UPDATE AS
IF ( (SUBSTRING(COLUMNS_UPDATED(),1,1)=power(2,(3-1))
+ power(2,(5-1))) 
AND (SUBSTRING(COLUMNS_UPDATED(),2,1)=power(2,(1-1)))
) 
PRINT 'Columns 3, 5 and 9 updated'
GO

CREATE TRIGGER trg_tableName_upd
ON tableName
AFTER UPDATE
AS
IF UPDATE(columnToCheckValueOn)
BEGIN
INSERT INTO otherTable
SELECT i.* --[or] column1, column2, ...
FROM inserted i
INNER JOIN deleted d ON i.keyCol = d.keyCol --AND i.keyCol2 = d.keyCol2
WHERE i.columnToCheckValueOn = 'valueToCheckFor'
AND i.columnToCheckValueOn <> d.columnToCheckValueOn
END --IF 

CREATE TRIGGER trg_functies
ON functies
AFTER insert, update
AS
DECLARE @x INT
SELECT @x=(SELECT functie_id from INSERTED)
BEGIN
INSERT INTO funct_soft
(functie_id)
values
(@x)
END


11.28 Ways to run DTS:
----------------------

1. call a DTS FROM a stored procedure:

- via
sp_Oacreate

- via job
cREATE PROCEDURE dbo.mydtsjob AS 
exec msdb.dbo.sp_start_job @job_name = 'MyDTSJob'

- via xp_cmdshell

@Result int
SET @Result=EXEC xp_cmdshell 'dtsrun 'Mypackage' '
IF @result=0 
print 'OK'
else
print 'Failure'

use master
exec xp_cmdshell "DTSRun /S servername /U username /P password /N packagename"

create proc newprocedurename as
exec xp_cmdshell "DTSRun /S servername /U username /P password /N packagename"


11.29 Check on Server AND database:
-----------------------------------

DECLARE @db       VARCHAR(128)
DECLARE @srv      VARCHAR(128)

SELECT @db=DB_NAME()

IF @db in ('master', 'model', 'msdb')
BEGIN
PRINT 'PROCEDURE MAY NOT RUN IN THE MASTER, MODEL OR MSDB DATABASE.'
PRINT 'PROCEDURE TERMINATED.'
RETURN
END

SELECT @srv=@@SERVERNAME

IF @srv='CODWDB035P'
BEGIN
PRINT 'PROCEDURE MAY NOT RUN ON CODA PRODUCTION SERVER.'
PRINT 'PROCEDURE TERMINATED.'
RETURN
END


11.30 Use of sp_executesql:
---------------------------

Example 1:
----------

exec sp_executesql N'CREATE TABLE X (ID INT NULL)'

Example 2:
----------

declare @Statement NVARCHAR(1024)
SET @Statement = N'CREATE TABLE Y (ID INT NULL)'
exec sp_executesql @statement


11.32 Use of sp_OaCreate:
-------------------------

Example 1:
----------

CREATE PROCEDURE sp_AppendToFile(@FileName varchar(255), @Text1 varchar(255)) AS
DECLARE @FS int, @OLEResult int, @FileID int


EXECUTE @OLEResult = sp_OACreate 'Scripting.FileSystemObject', @FS OUT
IF @OLEResult <> 0 PRINT 'Scripting.FileSystemObject'

--Open a file
execute @OLEResult = sp_OAMethod @FS, 'OpenTextFile', @FileID OUT, @FileName, 8, 1
IF @OLEResult <> 0 PRINT 'OpenTextFile'

--Write Text1
execute @OLEResult = sp_OAMethod @FileID, 'WriteLine', Null, @Text1
IF @OLEResult <> 0 PRINT 'WriteLine'

EXECUTE @OLEResult = sp_OADestroy @FileID
EXECUTE @OLEResult = sp_OADestroy @FS

Example 2:
----------

CREATE PROCEDURE xp_cmdshell(@cmd varchar(255), @Wait int = 0) AS
  --Create WScript.Shell object
  DECLARE @result int, @OLEResult int, @RunResult int
  DECLARE @ShellID int

  EXECUTE @OLEResult = sp_OACreate 'WScript.Shell', @ShellID OUT
  IF @OLEResult <> 0 SELECT @result = @OLEResult
  IF @OLEResult <> 0 RAISERROR ('CreateObject %0X', 14, 1, @OLEResult)


  EXECUTE @OLEResult = sp_OAMethod @ShellID, 'Run', Null, @cmd, 0, @Wait
  IF @OLEResult <> 0 SELECT @result = @OLEResult
  IF @OLEResult <> 0 RAISERROR ('Run %0X', 14, 1, @OLEResult)
  --If @OLEResult <> 0 EXEC sp_displayoaerrorinfo @ShellID, @OLEResult 


  EXECUTE @OLEResult = sp_OADestroy @ShellID

  return @result


11.31 Use of WHERE CURRENT OF cursor:
-------------------------------------

DECLARE @dossier_id         INT
DECLARE @status_id_dossier  INT
DECLARE @status_id_stam     INT
DECLARE @status             INT

DECLARE cur1 CURSOR FOR
SELECT dossier_id, status_id from G0Q_Dossier

OPEN cur1
FETCH NEXT FROM cur1 INTO @dossier_id, @status_id_dossier

WHILE (@@fetch_status<>-1)
BEGIN

IF @status_id_dossier IS NOT NULL

BEGIN

SELECT @status_id_stam=(select status_id from G0Q_Dossier_status
                        where status=@status_id_dossier)

UPDATE G0Q_Dossier
set status_id=@status_id_stam
where current of cur1

END

FETCH NEXT FROM cur1 INTO @dossier_id, @status_id_dossier
END

CLOSE cur1
DEALLOCATE cur1
GO


11.32 Generate Insert Statements from a table:
----------------------------------------------

This script will generate insert statements for the given tables. You can pass the tables names, 
separated by commas, into sp_DataAsInsCommand stored procedure as in the example below: 

EXEC sp_DataAsInsCommand 'employee,titleauthor,pub_info' 


CREATE PROC sp_DataAsInsCommand (
  @TableList varchar (8000))
AS
SET NOCOUNT ON
DECLARE @position int, @exec_str varchar (2000), @TableName varchar (50)
DECLARE @name varchar(128), @xtype int, @status tinyint, @IsIdentity tinyint
SELECT @TableList = @TableList + ','
SELECT @IsIdentity = 0
SELECT @position = PATINDEX('%,%', @TableList)
WHILE (@position <> 0)
  BEGIN

    SELECT @TableName = SUBSTRING(@TableList, 1, @position - 1)
    SELECT @TableList = STUFF(@TableList, 1, PATINDEX('%,%', @TableList),'')
    SELECT @position = PATINDEX('%,%', @TableList)

    SELECT @exec_str = 'DECLARE fetch_cursor CURSOR FOR '  + 'SELECT a.name, a.xtype, a.status FROM syscolumns a, sysobjects b WHERE a.id = b.id and b.name = ''' + @TableName + ''''
    EXEC (@exec_str)
    OPEN fetch_cursor
    FETCH fetch_cursor INTO @name, @xtype, @status
    IF (@status & 0x80) <> 0
      BEGIN
        SELECT @IsIdentity = 1
        SELECT 'SET IDENTITY_INSERT ' + @TableName + ' ON'
        SELECT 'GO'
      END
    SELECT @exec_str = "SELECT 'INSERT INTO " + @TableName + " VALUES (' + "
    Select ' -- The table name is: ' + @TableName
    --text or ntext
    IF (@xtype = 35) OR (@xtype = 99)
        SELECT @exec_str = @exec_str + '''"None yet"'''
    ELSE

    --image
    IF (@xtype = 34)
        SELECT @exec_str = @exec_str + '"' + '0xFFFFFFFF' + '"'
    ELSE

    --smalldatetime or datetime
    IF (@xtype = 58) OR (@xtype = 61)
        SELECT @exec_str = @exec_str + 'Coalesce(' + ' + ''"'' + ' + ' + CONVERT(varchar,' + @name + ',101)' + ' + ''"''' + ',"null")'
    ELSE

    --varchar or char or nvarchar or nchar
    IF (@xtype = 167) OR (@xtype = 175) OR (@xtype = 231) OR (@xtype = 239)
        SELECT @exec_str = @exec_str + 'Coalesce(' + '''"'' + ' + @name + ' + ''"''' + ',"null")'
    ELSE

    --uniqueidentifier
    IF (@xtype = 36)
        SELECT @exec_str = @exec_str + ' + Coalesce(''"'' + ' + ' + CONVERT(varchar(255),' + @name + ')' + ' + ''"''' + ',"null")'
    ELSE

    --binary or varbinary
    IF (@xtype = 173) OR (@xtype = 165)
        SELECT @exec_str = @exec_str + '"' + '0x0' + '"'
    ELSE

        SELECT @exec_str = @exec_str + 'Coalesce(CONVERT(varchar,' + @name + '), "null")'

    WHILE @@FETCH_STATUS <> -1
      BEGIN
        FETCH fetch_cursor INTO @name, @xtype, @status
        IF (@@FETCH_STATUS = -1) BREAK
        IF (@status & 0x80) <> 0
          BEGIN
            SELECT @IsIdentity = 1
            SELECT 'SET IDENTITY_INSERT ' + @TableName + ' ON'
            SELECT 'GO'
          END

        --text or ntext
        IF (@xtype = 35) OR (@xtype = 99)
           SELECT @exec_str = @exec_str + ' + ","' + ' + ''"None yet"'''
        ELSE

        --image
        IF (@xtype = 34)
           SELECT @exec_str = @exec_str + ' + "," + ' + '"' + '0xFFFFFFFF' + '"'
        ELSE

        --smalldatetime or datetime
        IF (@xtype = 58) OR (@xtype = 61)
           SELECT @exec_str = @exec_str + ' + ","' + ' + Coalesce(''"'' + ' + ' + CONVERT(varchar,' + @name + ',101)' + ' + ''"''' + ',"null")'
        ELSE

        --varchar or char or nvarchar or nchar
        IF (@xtype = 167) OR (@xtype = 175) OR (@xtype = 231) OR (@xtype = 239)
           SELECT @exec_str = @exec_str + ' + ","' + ' + Coalesce(''"'' + ' + @name + ' + ''"''' + ',"null")'
        ELSE

        --uniqueidentifier
        IF (@xtype = 36)
           SELECT @exec_str = @exec_str + ' + ","' + ' + Coalesce(''"'' + ' + ' + CONVERT(varchar(255),' + @name + ')' + ' + ''"''' + ',"null")'
        ELSE

        --binary or varbinary
        IF (@xtype = 173) OR (@xtype = 165)
           SELECT @exec_str = @exec_str + ' + "," + ' + '"' + '0x0' + '"'
        ELSE

           SELECT @exec_str = @exec_str + ' + ","' + ' + Coalesce(CONVERT(varchar,' + @name + '), "null")'
      END

    CLOSE fetch_cursor
    DEALLOCATE fetch_cursor

    SELECT @exec_str = @exec_str + '+ ")" FROM ' + @TableName
    EXEC(@exec_str)
-- print (@exec_str)
    SELECT 'GO'

    IF @IsIdentity = 1
       BEGIN
         SELECT @IsIdentity = 0
         SELECT 'SET IDENTITY_INSERT ' + @TableName + ' OFF'
         SELECT 'GO'
       END
  END


======================================
12. SOME DISASTER RECOVERY SCENARIO'S:  
======================================

IF you encounter a severe error in a SQL Server database,
like corruption, or some sort of serious crash, you should ofcourse
first rely on your Backup-Recovery procedures.

There is no substitute for good regular backups of your
master, msdb, AND your production databases !!!!
But that's a trivial statement.

On the other hAND, there could be situations WHERE it might not be
easy to apply a backup. Or it might turn out that you do
not have a good recent backup.

There are some situations WHERE the following tricks might
help you revive your database, without restoring a backup.

Again, please note that this section may only be regarded
as complementary to good backup/restore procedures.


-- 12.1 'suspect' database.
-- ------------------------

IF you have a suspect database, that type of status will be 
shown in the Enterprise manager.

No USEr can access the database.
Thus, the database is not accessible.


Possible solution 1:
--------------------

IF the suspect status is due to the fact that one more files really are missing,
or corrupt, you should restore a backup.

But... The suspect status could also be due to a situation WHERE one or more files
are, for example, full (AND/or are not allowed to grow further). 
You can imagine a situation WHERE SQL Server needs to recover, but cannot apply the
entries FROM the transaction log, becaUSE the database files are "full" or 
the disks are full.
You might find evidence for this by inspection the SQL Server log.

Suppose that you have, in one way or the other, increased free diskspace.
Now SQL Server has, in principle at least, a way to recover. 

Suppose your suspect database is called 'SALES'.
Now execute sp_resetstatus.

This procedure modIFies the system tables, so the system administrator must enable updates 
to the system tables. In order to make updates to the system catalog possible, 
we "tell" SQL Server that it's allowed to make updates:

USE master
GO
sp_configure 'allow updates', 1
GO
RECONFIGURE WITH OVERRIDE
GO

Now USE sp_resetstatus:

For example:

exec sp_resetstatus 'SALES'

AND restart SQL server (stop AND start MSSQLServer service)

IF this have not helped, maybe SQL server needs another new
datafile or logfile to complete recovery. 

You could add a datafile or a logfile to your database, via special
stored procedures

- to add a datafile (.ndf) AND revive the database, USE:

sp_add_data_file_recover_suspect_db

Adds a data file to a filegroup when recovery cannot complete on a database due to an 
"insufficient space" (1105) error on the filegroup. After the file is added, this stored
procedure turns off the suspect setting AND completes the recovery of the database

- to add a transaction log file AND revive the database, USE:

sp_add_log_file_recover_suspect_db


This procedure adds a log file to a filegroup when recovery cannot complete on a database due to an 
"insufficient log space" (9002) error. After the file is added, this stored 
procedure turns off the suspect setting AND completes the recovery of the database. 


Example syntax:

sp_add_log_file_recover_suspect_db 
[@dbName =] database',
[@name =] 'logical_file_name',
[@filename =] 'os_file_name',
[@size =] 'size',
[@maxsize =] 'max_size',
[@filegrowth =] 'growth_increment'


Example:

Suppose you suspect that you need to add another datafile. Suppose the
database in question is called 'SALES'. Suppose you have space free on H:
Then you could try a statement like the following:

sp_add_log_file_recover_suspect_db SALES, sales_data_005,
�h:\mssql7\data\sales_data_005.ndf', 50

The stored procedure will add the file to your database AND will 'reset'
your database status to a normal value.


Possible solution 2:
--------------------

IF nothing helps, AND filesizes AND diskspaces seems ok, AND you cannot
find a real reason why you have the suspect status, 
AND you do NOT (!) have good backups, AND you feel totally lost by now,
maybe you can USE this procedure.

A suspect database can be placed in 'emercency mode'. IF you have
placed a database in emercency mode, it might be possible
to salvage data via bcp, DTS, SELECT queries etc..
You must consider this option as a last resort.

To place the database 'SALES' in emercency mode, you must
manually update the system table master.dbo.sysdatabases
AND change the status field of database 'SALES'.

First, make it possible to manually update the system tables
using the following statements:

USE master
exec sp_configure 'allow updates', 1
reconfigure with override
go


Update sysdatabases
Set status= 32768
WHERE name=�SALES�


-- 12.2 Re-attach of a database.
--------------------------------


Suppose you have a Server with multiple disks.
Suppose there is a crash in such a way, that
NT/Win2K AND/or SQL Server must be reinstalled (for example,
a disk crash WHERE the program files resides on, or disk crash
of the boot/system disk of NT/2000).

Suppose further, that you have a database 'sales',
which files are on disks unaffected by any crash.

So, here is a situation WHERE NT/2000 or SQL Server
may be unusable, but you have your database files intact.

Ok, so you reinstall NT/2000 AND/or SQL Server, possibly on new disks.
How to 're-attach', or 'register' your sales database
in the fresh SQL Server installation? In other words, how
do you update your master database with existence of 
the marketing database.


There are at least 2 ways to do that en get back into business:


Solution 1: CREATE DATABASE ... FOR ATTATCH
-------------------------------------------

After the fresh SQL Server installation is complete,
you can 'register' your sales database with the rather
special commAND "CREATE DATABASE .. FOR ATTACH".

You only need to mention your .mdf file location in this
commAND, becaUSE actually the primary .mdf file contains
information about the paths AND names of any other database file.

In the example of the sales database, you would enter a commAND
similar to the following:

CREATE DATABASE sales
ON PRIMARY
(
name='sales�,
filename=�f:\mssql\sales.mdf�
)
FOR ATTACH

Your database is back online AND the data is accessible.
But you might have problems with the defined logins AND
database USErs.


Solution 2: USE sp_attachdb
---------------------------

As an example, suppose your 'sales' database files
resides on the following disks with the following layout:

F:\mssql\sales.mdf
G:\mssql\sales_data_001.ndf		
H:\mssql\sales_data_002.ndf
I:\mssql\sales_data_003.ndf
J:\mssql\sales_log_001.ldf
K:\mssql\sales_log_002.ldf

After you have re-installed NT/2000 AND SQL Server. you
can then USE the system stored procedure sp_attachdb
to register your sales database:

Sp_attach_db �sales�,
@filename1=�F:\mssql\sales.mdf�,
@filename2=�G:\mssql\sales_data_001.ndf�,
@filename3=�H:\mssql\sales_data_002.ndf�,	
@filename4=�I:\mssql\sales_data_003.ndf�,
@filename5=�J:\mssql\sales_log_001.ldf�,
@filename6=�K:\mssql\sales_log_002.ldf�

Now your sales database is back online. Still you
will probably have some troubles with login's AND database USErs
of the sales database.
So, having a backup of the master database AND the sales database
is better ofcourse. Anyway, the solution is at least a way to get
all your data back.


Please note: 

- You can also USE sp_attachdb to move your
database to dIFferent disks in your same SQL Server. In this case,
there are no problems at all with defined database USErs.
You should then first USE 'sp_detachdb', then move the files
to their new location, AND execute 'sp_attachdb' to register 
your database again.

- You can also USE sp_detachdb AND sp_attachdb to move or copy
a database across Servers (IF both USE the same character set)


-- 12.3: Backup & Restore:
==========================


1. Type of backup or recovery modes of a database:
--------------------------------------------------

Simple Recovery
---------------

Simple Recovery requires the least administration. In the Simple Recovery model, 
data is recoverable only to the most recent full database or differential backup. 
Transaction log backups are not used, and minimal transaction log space is used. 
After the log space is no longer needed for recovery from server failure, it is reused.

With the Simple Recovery model, the database can be recovered to the point of the last backup. 
However, you cannot restore the database to the point of failure or to a specific point in time. 
To do that, choose either the Full Recovery or Bulk-Logged Recovery model.

The backup strategy for simple recovery consists of: 

-- Database backups.

-- Differential backups (optional). 


Note:  This model is similar to setting the trunc. log on chkpt. database option 
in Microsoft� SQL Server� version 7.0 or earlier.


Full and Bulk-Logged Recovery
-----------------------------

Full Recovery and Bulk-Logged Recovery models provide the greatest protection for data. 
These models rely on the transaction log to provide full recoverability and to prevent work loss 
in the broadest range of failure scenarios. 

The Full Recovery model provides the most flexibility for recovering databases to an earlier point in time. 
For more information, see Full Recovery.

The Bulk-Logged model provides higher performance and lower log space consumption for certain 
large-scale operations (for example, create index or bulk copy). It does this at the expense of some 
flexibility of point-in-time recovery. For more information, see Bulk-Logged Recovery.

The backup strategy for full recovery consists of: 

-- Database backups.

-- Differential backups (optional). 

-- Transaction log backups.


Example of creating a differential backup:
------------------------------------------


It is not possible to create a differential database backup unless 
the database has been backed up first.

-- Create a full database backup first.
BACKUP DATABASE IAMV 
   TO IAMVDUMP 
   WITH INIT
GO
-- Time elapses.
-- Create a differential database backup, appending the backup
-- to the backup device containing the database backup.
BACKUP DATABASE MyNwind
   TO IAMVDUMP
   WITH DIFFERENTIAL
GO


Example of restore a full backup and a differential backup:
-----------------------------------------------------------

Execute the RESTORE DATABASE statement, specifying the NORECOVERY clause, to restore the database backup 
preceding the differential database backup. 

Execute the RESTORE DATABASE statement to restore the differential database backup, specifying: 

--The name of the database to which the differential database backup will be applied.
--The backup device where the differential database backup will be restored from.
--The NORECOVERY clause if you have transaction log backups to apply after the differential database backup
  is restored, otherwise specify the RECOVERY clause. 

-- Assume the database is lost at this point. Now restore the full 
-- database. Specify the original full backup and NORECOVERY.
-- NORECOVERY allows subsequent restore operations to proceed.

RESTORE DATABASE IAMV
   FROM IAMVDUMP
   WITH NORECOVERY
GO

-- Now restore the differential database backup, the second backup on 
-- the IAMVDUMP backup device.

RESTORE DATABASE IAMV
   FROM IAMVDUMP
   WITH FILE = 2,
      RECOVERY
GO


Example of restore a full backup, differental backup and transactionlog backup:
-------------------------------------------------------------------------------

This example restores a database, differential database, and transaction log backup of the IAMV database.

-- Assume the database is lost at this point. Now restore the full 
-- database. Specify the original full backup and NORECOVERY.
-- NORECOVERY allows subsequent restore operations to proceed.

RESTORE DATABASE IAMV
   FROM MyNwind_1
   WITH NORECOVERY
GO

-- Now restore the differential database backup, the second backup on 
-- the MyNwind_1 backup device.

RESTORE DATABASE IAMV
   FROM MyNwind_1
   WITH FILE = 2,
      NORECOVERY
GO

-- Now restore each transaction log backup created after
-- the differential database backup.

RESTORE LOG IAMV
   FROM iamv_log1
   WITH NORECOVERY
GO
RESTORE LOG IAMV
   FROM iamv_log2
   WITH RECOVERY
GO


Example: Restore of full and diff backups:
------------------------------------------

Stel we hebben een database SharePointPortal_Sites",
met een voorbeeld tabel XYZ. XYZ bevat nu alleen nog 1 record.

insert into XYZ
values
(1,'waarde 1')


SELECT * from XYZ

id          name                 
----------- -------------------- 
1           waarde 1

(1) Stel we maken nu een FULL backup of database "SharePointPortal_Sites":

backup database SharePointPortal_Sites to FULLBACKUP_SPPS with init

(2) We voeren een record in in XYZ

insert into XYZ
values
(2,'waarde 2')

SELECT * FROM XYZ

id          name                 
----------- -------------------- 
1           waarde 1
2           waarde 2

Dus dit "tweede" record is niet opgenomen in de full backup.

(3) We maken nu een eerste DIFF backup:

backup database SharePointPortal_Sites to DIFFBACKUP_SPPS with differential, init

(4) We voeren nu een derde record in in XYZ

insert into XYZ
values
(3,'waarde 3')

SELECT * FROM XYZ

id          name                 
----------- -------------------- 
1           waarde 1
2           waarde 2
3           waarde 3

Dus dit "derde" record is niet opgenomen in de full en 1ste DIFF backup.

(5) We maken nu een tweede DIFF backup:

backup database SharePointPortal_Sites to DIFFBACKUP_SPPS with differential, noinit


(6) We doen nu een vierde record in XYZ

insert into XYZ
values
(4,'waarde 4')

SELECT * FROM XYZ

id          name                 
----------- -------------------- 
1           waarde 1
2           waarde 2
3           waarde 3
4           waarde 4


(7) We maken nu een derde DIFF backup:

backup database SharePointPortal_Sites to DIFFBACKUP_SPPS with differential, noinit

(8) NU HEBBEN WE EEN CRASH VAN SharePointPortal_Sites

Wat moeten we restoren?

We hebben 1 FULL backup
We hebben 3 DIFF backups.

Experiment 1:
-------------

RESTORE DATABASE SharePointPortal_Sites
   FROM FULLBACKUP_SPPS
   WITH NORECOVERY
GO

RESTORE DATABASE SharePointPortal_Sites
   FROM DIFFBACKUP_SPPS
   WITH RECOVERY
GO

Database is weer open:

SELECT * FROM XYZ

id          name                 
----------- -------------------- 
1           waarde 1
2           waarde 2

Dit correspondeerd precies met de FULL backup met de 1ste DIFF backup.


More Backup and Restore Examples:
---------------------------------

sp_addumpdevice 'disk', 'bbv_full', 'g:\bbv\bbv_full.bak'
sp_addumpdevice 'disk', 'bbv_log',  'g:\bbv\bbv_log.bak'

backup database bbv to bbv_full with init
backup log bbv to bbv_log with init

Restore example 1:
------------------

RESTORE DATABASE MyNwind
   FROM MyNwind_1, MyNwind_2
   WITH NORECOVERY
RESTORE LOG MyNwind
   FROM MyNwindLog1
   WITH NORECOVERY
RESTORE LOG MyNwind
   FROM MyNwindLog2
   WITH RECOVERY, STOPAT = 'Apr 15, 1998 12:00 AM'

Restore example 2:
------------------

RESTORE DATABASE TestDB 
   FROM DISK = 'c:\Northwind.bak'
   WITH MOVE 'Northwind' TO 'c:\test\testdb.mdf',
   MOVE 'Northwind_log' TO 'c:\test\testdb.ldf'

Restore example 3:
------------------

RESTORE DATABASE bbv FROM DISK = 'g:\bbv\bbv.bak' WITH 
MOVE 'ZOG39BBV_Data' TO 'G:\BBV\bbv_data.mdf',
MOVE 'ZOG39BBV_Log' TO 'G:\BBV\bbv_log.ldf', REPLACE

Restore example 4:
------------------

restore database bbv with recovery
 

-- 12.4: Restore master AND msdb:
=================================

- To start the default instance of SQL Server in single-user mode FROM a commAND prompt 

SQL2000:
--------

FROM a commAND prompt, enter: 
sqlservr.exe -c -m

SQL 7:
------

FROM a commAND prompt, enter: 
sqlservr.exe  -m

- To start a named instance of SQL Server in single-user mode FROM a commAND prompt 

FROM a commAND prompt, enter: 
sqlservr.exe -c - m -s {instancename}

- To restore the master database 

Start Microsoft� SQL Server� in single-user mode.

Execute the RESTORE DATABASE statement to restore the master database backup, specIFying: 
The backup device FROM WHERE the master database backup will be restored. 
Examples
This example restores the master database backup FROM tape without using a permanent (named) backup device.

USE master
GO
RESTORE DATABASE master
   FROM TAPE = '\\.\Tape0'
GO

RESTORE DATABASE master
   FROM DISK = 'c:\master.bak'


Sequence:

1. restore master
2. restore msdb
3. restore user databases

The msdb databse might hold the backup information, 
which you might need to restore the application DBs. 

In SQL 200:

Also run 

sp_dropserver <old_name>
sp_addserver <new_name>


In SQL 7:

run setup again to change the servername


-- 12.5: Repair corrupt systemtable:
------------------------------------

This stored procedure can be used to fix a corruption in a system table
by recreate the index.

Syntax

sp_fixindex  database, systemcatalog, ind_id
 

WHERE

database      - is the database name. database is sysname.
systemcatalog - is the system table name. systemcatalog is sysname.
ind_id        - is the index id value. ind_id is int

Note. Before using this stored procedure the database has to be
      in single user mode.

Example:

USE pubs
GO
EXEC sp_fixindex pubs, sysindexes, 2
GO


-- 12.6: Alter a Database:
--------------------------


ALTER DATABASE database
{    ADD FILE <filespec> [,...n] [TO FILEGROUP filegroup_name]
    | ADD LOG FILE <filespec> [,...n]
    | REMOVE FILE logical_file_name 
    | ADD FILEGROUP filegroup_name
    | REMOVE FILEGROUP filegroup_name
    | MODIFY FILE <filespec>
    | MODIFY FILEGROUP filegroup_name filegroup_property
}


<filespec> ::=
(NAME = logical_file_name
  [, FILENAME = 'os_file_name' ]
  [, SIZE = size]
  [, MAXSIZE = { max_size | UNLIMITED } ]
  [, FILEGROWTH = growth_increment] )


Example 1: Add a file:
----------------------

ALTER DATABASE Test1 
ADD FILE 
(
 NAME = Test1dat2,
 FILENAME = 'c:\mssql7\data\t1dat2.ndf',
 SIZE = 5MB,
 MAXSIZE = 100MB,
 FILEGROWTH = 5MB
)
TO FILEGROUP DATA
GO

Example 2: Shrink a file:
-------------------------

DBCC SHRINKFILE (DataFile1, 700)


======================================
13. Database Creation script examples:
======================================

13.1. Simple example:
---------------------

/* Create Database SALES:                                 */
/* Just 1 system file, 1 datafile, 1 indexfile, 1 logfile */

create database SALES
on PRIMARY
(
name='SALES',
filename='f:\mssql7\SALES.mdf',
size=400MB,
filegrowth= 0MB,
maxsize= 400MB
),
FILEGROUP SALES_DATA
(
name='SALES_DATA_001',
filename='g:\mssql7\SALES_DATA_001.ndf',
size= 4000MB,
filegrowth= 0MB,
maxsize= 4000MB
),
FILEGROUP SALES_INDEX
(
name='SALES_INDEX_001',
filename='h:\mssql7\SALES_INDEX_001.ndf',
size= 4000MB,
filegrowth= 0MB,
maxsize= 4000MB
)
LOG ON
(
name='SALES_LOG_001',
filename='i:\mssql7\SALES_LOG_001.ldf',
size= 3000MB,
filegrowth= 0MB,
maxsize= 3000MB
)

ALTER DATABASE SALES
MODIFY FILEGROUP SALES_DATA DEFAULT
GO


13.2. Extensive example:
------------------------

/* Create Database UMDB                                     */
/* Creates one system file, 16 datafiles, AND 4 log files   */


create database UMDB
on PRIMARY
(
name='UMDB',
filename='f:\mssql7\UMDB.mdf',
size=400MB,
filegrowth= 0MB,
maxsize= 400MB
),
FILEGROUP UMDB_DATA
(
name='UMDB_DATA_001',
filename='g:\mssql7\UMDB_DATA_001.ndf',
size= 4000MB,
filegrowth= 0MB,
maxsize= 4000MB
),
(
name='UMDB_DATA_002',
filename='h:\mssql7\UMDB_DATA_002.ndf',
size= 4000MB,
filegrowth= 0MB,
maxsize= 4000MB
),
(
name='UMDB_DATA_003',
filename='i:\mssql7\UMDB_DATA_003.ndf',
size= 4000MB,
filegrowth= 0MB,
maxsize= 4000MB
),
(
name='UMDB_DATA_004',
filename='j:\mssql7\UMDB_DATA_004.ndf',
size= 4000MB,
filegrowth= 0MB,
maxsize= 4000MB
),
(
name='UMDB_DATA_005',
filename='k:\mssql7\UMDB_DATA_005.ndf',
size= 4000MB,
filegrowth= 0MB,
maxsize= 4000MB
),
(
name='UMDB_DATA_006',
filename='l:\mssql7\UMDB_DATA_006.ndf',
size= 4000MB,
filegrowth= 0MB,
maxsize= 4000MB
),
(
name='UMDB_DATA_007',
filename='m:\mssql7\UMDB_DATA_007.ndf',
size= 4000MB,
filegrowth= 0MB,
maxsize= 4000MB
),
(
name='UMDB_DATA_008',
filename='n:\mssql7\UMDB_DATA_008.ndf',
size= 4000MB,
filegrowth= 0MB,
maxsize= 4000MB
),
FILEGROUP UMDB_EXT
(
name='UMDB_EXT_001',
filename='t:\mssql7\UMDB_EXT_001.ndf',
size= 2000MB,
filegrowth= 0MB,
maxsize= 2000MB
),
FILEGROUP UMDB_SIEBEL
(
name='UMDB_SIEBEL_001',
filename='u:\mssql7\UMDB_SIEBEL_001.ndf',
size= 2000MB,
filegrowth= 0MB,
maxsize= 2000MB
),
FILEGROUP UMDB_IND
(
name='UMDB_IND_001',
filename='v:\mssql7\UMDB_IND_001.ndf',
size= 2000MB,
filegrowth= 0MB,
maxsize= 2000MB
)
LOG ON
(
name='UMDB_LOG_001',
filename='o:\mssql7\UMDB_LOG_001.ldf',
size= 3000MB,
filegrowth= 0MB,
maxsize= 3000MB
),
(
name='UMDB_LOG_002',
filename='p:\mssql7\UMDB_LOG_002.ldf',
size= 3000MB,
filegrowth= 0MB,
maxsize= 3000MB
),
(
name='UMDB_LOG_003',
filename='q:\mssql7\UMDB_LOG_003.ldf',
size= 3000MB,
filegrowth= 0MB,
maxsize= 3000MB
),
(
name='UMDB_LOG_004',
filename='r:\mssql7\UMDB_LOG_004.ldf',
size= 3000MB,
filegrowth= 0MB,
maxsize= 3000MB
)

/* Change the default filegroup FROM primary to VPT_DATA */
ALTER DATABASE UMBD
MODIFY FILEGROUP UMDB_DATA DEFAULT
GO


======================================
14. OTHER STUFF:
======================================


----------------------------
14.1 XML primer: Tutorial 1:
----------------------------

XML:
====

XML was designed to describe data and focus on what data is.

HTML was designed to display data and focus on how data looks.

Extensible Markup Language (XML) is a meta-markup language that provides a format for describing 
structured data. This facilitates more precise declarations of content and more meaningful 
search results across multiple platforms. In addition, XML is enabling a new generation of Web-based 
data viewing and manipulation applications.

An xml file looks like:

<?xml version="1.0" encoding="ISO-8859-1"?>
<note>
<to>Tove</to>
<from>Jani</from>
<heading>Reminder</heading>
<body>Don't forget me this weekEND!</body>
</note>

It has a root element AND child elements. An element can have attributes.

All XML documents must have a root element. All XML documents must contain a single tag pair 
to define a root element. All other elements must be within this root element.
All elements can have sub elements (child elements). Sub elements must be correctly nested 
within their parent element:

<root>
  <child>
    <subchild>.....</subchild>
  </child>
</root>  


- There may be a first line in the document - the XML declaration - which defines 
  the XML version AND the character encoding used in the document.
- There may also be a DTD, which is used is to define the legal building blocks of 
  an XML document. It defines the document structure with a list of legal elements.
  A better alternative to DTD is the use of XSD xml Schema Definition.

<?xml version="1.0" encoding="ISO-8859-1"?>
<!DOCTYPE note SYSTEM "InternalNote.dtd">

XML is a meta-markup language, a set of rules for creating semantic tags used to describe data. 
An XML element is made up of a start tag, an end tag, and data in between. 
The start and end tags describe the data within the tags, which is considered 
the value of the element. For example, the following XML element is a <director> element 
with the value "Ed Wood."

<director>Ed Wood</director>

The element name "director" allows you to mark up the value "Ed Wood" semantically, 
so you can differentiate that particular bit of data from another, similar bit of data. 
For example, there might be another element with the value "Ed Wood."

<actor>Ed Wood</actor>

Because each element has a different tag name, you can easily tell that one element refers to Ed Wood, 
the director of Jail Bait, while the other refers to Ed Wood, the lead actor in Glen or Glenda. 
If there were no way to mark up the data semantically, having two elements with the same value might 
cause some confusion.

In addition, XML tags are case-sensitive, so the following are each a different element.

<City> <CITY> <city>

Attributes:
An element can optionally contain one or more attributes. An attribute is a name-value pair 
separated by an equal sign (=).

<CITY ZIP="01085">Westfield</CITY>

In this example above, ZIP="01085" is an attribute of the <CITY> element. Attributes are used 
to attach additional, secondary information to an element, usually meta-information. 
Attributes can also accept default values, while elements cannot. Each attribute of an element 
can be specified only once, but in any order.

From HTML you will remember this: <IMG SRC="computer.gif">. The SRC attribute provides additional 
information about the IMG element.

In HTML (and in XML), attributes provide additional information about elements, for example:

<img src="computer.gif">
<a href="demo.asp"> 

Attributes often provide information that is not a part of the data. In the example below, 
the file type is irrelevant to the data, but important to the software that wants to manipulate the element:

<file type="gif">computer.gif</file> 


HTML is all about how to show data, and XML decribes what the data means.


XML in HTML:
------------

What is an XML data island?
A data island is an XML document that exists within an HTML page. It allows you to script against 
the XML document without having to load it through script or through the <OBJECT> tag. 
Almost anything that can be in a well-formed XML document can be inside a data island.

The <XML> element marks the beginning of the data island, and its ID attribute provides a name 
that you can use to reference the data island.

The XML for a data island can be either inline:

<XML ID="XMLID">
  <customer>
    <name>Mark Hanson</name>
    <custID>81422</custID>
  </customer>
</XML>

or referenced through a SRC attribute on the <XML> tag:

<XML ID="XMLID" SRC="customer.xml"></XML>

You can also use the <SCRIPT> tag to create a data island:

<SCRIPT LANGUAGE="xml" ID="XMLID">
  <customer>
    <name>Mark Hanson</name>
    <custID>81422</custID>
  </customer>
</SCRIPT>


Show XML (in a browser):
------------------------

Raw XML files can be viewed in IE 5.0 (and higher) and in Netscape 6, but to make it display 
like a web page, you have to add some display information.
XML documents do not carry information about how to display the data.
Since XML tags are "invented" by the author of the XML document, browsers do not know if a tag 
like <table> describes an HTML table or a dining table.

Without any information about how to display the data, most browsers will just 
display the XML document as it is.

To "add" information about HOW to display the data contained in xml,
a number of methods exists:

- css: 
(old method, not much used)
<?xml-stylesheet type="text/css" href="cd_catalog.css"?>, 
links the XML file to the CSS file:

- XSL:
<?xml-stylesheet type="text/xsl" href="simple.xsl"?>
also links the XML file to information about how to display the data


XSL:
----

XSL - The Style Sheet of XML

Because XML does not use predefined tags (we can use any tags we want), 
the meanings of these tags are not understood: <table> could mean an HTML table, 
a piece of furniture, or something else. A browser does not know how to display an XML document.
IF you only have an xml file, IF loaded in the browser, it will show the root AND
child elements in a hierarchical way.

Therefore there must be something in addition to the XML document that describes 
how the document should be displayed; AND that is XSL!

XSLT - XSL Transformations
XSLT is the most important part of the XSL StANDard. 
It is the part of XSL that is used to transform an XML document into another XML document, 
or another type of document that is recognized by a browser. One such format is XHTML. 
Normally XSLT does this by transforming each XML element into an XHTML element.

XSLT can also add new elements into the output file, or remove elements. It can rearrange 
AND sort elements, AND test AND make decisions about which elements to display, AND a lot more.

A common way to describe the transformation process is to say that XSL uses XSLT 
to transform an XML source tree into an XML result tree.

The correct way to declare an XSL style sheet according to the W3C XSL RecommENDation is:

<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform"> 

or:

<xsl:transform version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform"> 

EXAMPLE:
========

Start with your XML Document.

1. Suppose we want to transform the following XML document ("cdcatalog.xml") into XHTML:

<?xml version="1.0" encoding="ISO-8859-1"?>
<catalog>
  <cd>
    <title>Empire Burlesque</title>
    <artist>Bob Dylan</artist>
    <country>USA</country>
    <company>Columbia</company>
    <price>10.90</price>
    <year>1985</year>
  </cd>
.
.
.
</catalog> 

2. Then you create an XSL Style Sheet ("cdcatalog.xsl") with a transformation template: 

<?xml version="1.0" encoding="ISO-8859-1"?>
<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
<xsl:template match="/">
  <html>
  <body>
    <h2>My CD Collection</h2>
    <table border="1">
    <tr bgcolor="#9acd32">
      <th align="left">Title</th>
      <th align="left">Artist</th>
    </tr>
    <xsl:for-each SELECT="catalog/cd">
    <tr>
      <td><xsl:value-of SELECT="title"/></td>
      <td><xsl:value-of SELECT="artist"/></td>
    </tr>
    </xsl:for-each>
    </table>
  </body>
  </html>
</xsl:template></xsl:stylesheet> 

3. Link the XSL Style Sheet to the XML Document.

Finally, add an XSL Style Sheet reference to your XML document ("cdcatalog.xml"):

<?xml version="1.0" encoding="ISO-8859-1"?>
<?xml-stylesheet type="text/xsl" href="cdcatalog.xsl"?>
<catalog>
  <cd>
    <title>Empire Burlesque</title>
    <artist>Bob Dylan</artist>
    <country>USA</country>
    <company>Columbia</company>
    <price>10.90</price>
    <year>1985</year>
  </cd>
.
.
.
</catalog> 

The result, viewed in a browser, is a table
listing all titles AND artists in a grid.


XSD: XML Schema Definition
==========================

XML Schema is an XML based alternative to DTD.
An XML schema describes the structure of an XML document.
The XML Schema language is also referred to as XML Schema Definition (XSD).

An XML Schema:

defines elements that can appear in a document 
defines attributes that can appear in a document 
defines which elements are child elements 
defines the order of child elements 
defines the number of child elements 
defines whether an element is empty or can include text 
defines data types for elements AND attributes 
defines default AND fixed values for elements AND attributes 

When data is sent FROM a sENDer to a receiver it is essential that both parts have 
the same "expectations" about the content.
With XML Schemas, the sENDer can describe the data in a way 
that the receiver will understAND.

EXAMPLE:
========

1. A Simple XML Document

Look at this simple XML document called "note.xml":

<?xml version="1.0"?>
<note>
<to>Tove</to>
<FROM>Jani</FROM>
<heading>Reminder</heading>
<body>Don't forget me this weekEND!</body>
</note> 


2. A Simple DTD

This is a simple DTD file called "note.dtd" that defines the elements 
of the XML document above ("note.xml"):

<!ELEMENT note (to, FROM, heading, body)>
<!ELEMENT to (#PCDATA)>
<!ELEMENT FROM (#PCDATA)>
<!ELEMENT heading (#PCDATA)>
<!ELEMENT body (#PCDATA)> 

Line 1 defines the note element to have four elements: "to, FROM, heading, body". 
Line 2-5 defines the to element to be of the type "#PCDATA", the FROM element 
to be of the type "#PCDATA", AND so on... 

3. A Simple XML Schema 

This is a simple XML Schema file called "note.xsd" that defines the elements 
of the XML document above ("note.xml"): 

<?xml version="1.0"?>
<xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema"
targetNamespace="http://www.w3schools.com"
xmlns="http://www.w3schools.com"
elementFormDefault="qualIFied">

<xs:element name="note">
    <xs:complexType>
      <xs:sequence>
	<xs:element name="to" type="xs:string"/>
	<xs:element name="FROM" type="xs:string"/>
	<xs:element name="heading" type="xs:string"/>
	<xs:element name="body" type="xs:string"/>
      </xs:sequence>
    </xs:complexType>
</xs:element>
</xs:schema> 


The note element is said to be of a complex type because it contains other elements. 
The other elements (to, FROM, heading, body) are said to be simple types because 
they do not contain other elements.


Referencing a Schema in an XML Document:

This XML document has a reference to an XML Schema:

<?xml version="1.0"?>
<note xmlns="http://www.w3schools.com"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.w3schools.com note.xsd">

<to>Tove</to>
<FROM>Jani</FROM>
<heading>Reminder</heading>
<body>Don't forget me this weekEND!</body>
</note> 


14.2 Example of XML formatted data as a result set of a query:
--------------------------------------------------------------

IF you have a virtual directory configured in IIS
AND have configured SQL XML Support in IIS.
Suppose this directory is "c:\reskit\xmltest".


in browser enter as url:

http://w2ksql/xmltest?sql=SELECT+customers.customerid,orderid,orderdate+
FROM+CUSTOMERS+INNER+JOIN+ORDERS+ON+customers.customerid=orders.customerid+FOR+XML+AUTO&root=root

Output:

  <?xml version="1.0" encoding="utf-8" ?> 
- <root>
+ <CUSTOMERS customerid="VINET">
  <ORDERS orderid="10248" orderdate="1996-07-04T00:00:00" /> 
  </CUSTOMERS>
+ <CUSTOMERS customerid="TOMSP">
  <ORDERS orderid="10249" orderdate="1996-07-05T00:00:00" /> 
  </CUSTOMERS>
+ <CUSTOMERS customerid="HANAR">
  <ORDERS orderid="10250" orderdate="1996-07-08T00:00:00" /> 
  </CUSTOMERS>
- <CUSTOMERS customerid="VICTE">
  <ORDERS orderid="10251" orderdate="1996-07-08T00:00:00" /> 
  </CUSTOMERS>
- <CUSTOMERS customerid="SUPRD">
  <ORDERS orderid="10252" orderdate="1996-07-09T00:00:00" /> 
  </CUSTOMERS>
- <CUSTOMERS customerid="HANAR">
  <ORDERS orderid="10253" orderdate="1996-07-10T00:00:00" /> 
  </CUSTOMERS>
etc..

You can also call a stored procedure:
sql=EXECUTE+stored_procedure&root=root


14.3 Example of XML Template:
-----------------------------

IF you have the virtual directory configured to
allow the use of template queries, AND have given a Virtual name
of for example "templates", then create the following
sample file AND save it as suppliers.xml in the virtual directory:

<ROOT xmlns:sql="urn:schemas-microsoft-com:xml-sql">
<sql:query>
SELECT
SupplierID,
CompanyName,
ContactName,
Phone
FROM
suppliers
ORDER BY
CompanyName
FOR XML AUTO
</sql:query>
</ROOT>

FROM a browser a client can now enter

http://localhost/xmltest/templates/suppliers.xml

Output:

- <ROOT xmlns:sql="urn:schemas-microsoft-com:xml-sql">
    <suppliers SupplierID="18" CompanyName="Aux joyeux eccl�siastiques" ContactName="Guyl�ne Nodier" Phone="(1) 03.83.00.68" /> 
    <suppliers SupplierID="16" CompanyName="Bigfoot Breweries" ContactName="Cheryl Saylor" Phone="(503) 555-9931" /> 
    <suppliers SupplierID="5" CompanyName="Cooperativa de Quesos 'Las Cabras'" ContactName="Antonio del Valle Saavedra" Phone="(98) 598 76 54" /> 
    <suppliers SupplierID="27" CompanyName="Escargots Nouveaux" ContactName="Marie Delamare" Phone="85.57.00.07" /> 
 </ROOT>


14.4 Using XLS in a template:
-----------------------------

IF you use a template such as example 14.3, an Extensible Stylesheet Language (XSL) style sheet 
can be applied to the query results. When you execute a template using HTTP, 
you can specIFy an XSL file in these ways: 

-Use the sql:xsl attribute in the template.
-Use the xsl keyword as part of the URL to specIFy the XSL file that will be used to process 
 the resulting XML data. 

In this example, a template includes a simple SELECT statement. 
The query result is processed according to the instructions in the XSL file 
specIFied using sql:xsl.

<?xml version ='1.0' encoding='UTF-8'?>                      
 <root xmlns:sql='urn:schemas-microsoft-com:xml-sql'          
       sql:xsl='MyXSL.xsl'>                              
   <sql:query>                                                
      SELECT FirstName, LastName FROM Employees FOR XML AUTO  
   </sql:query>                                               
</root> 


In this example we use the xsl keyword in the url:

http://IISServer/nwind/template/templateFile.xml?xsl=MyXSL.xsl


14.5 XDR:
---------

You can create XML views of relational data using XDR (XML-Data Reduced) schemas. 
These views can then be queried using XPath queries. This is similar to creating views 
using CREATE VIEW statements AND specIFying SQL queries against the view.

In an XDR schema, the <Schema> element encloses the entire schema. As properties 
of the <Schema> element, you can describe attributes that define the schema name 
AND the namespaces in which the schema reside. In the XDR language, 
all element declarations must be contained within the <Schema> element.

The minimum XDR schema is:

<?xml version="1.0" ?>
<Schema xmlns="urn:schemas-microsoft-com:xml-data">
   ...
</Schema>


Example of an XDR Schema
This example shows how annotations are added to the XDR schema. 
This XDR schema consists of an <Employee> element AND the EmpID, Fname, AND Lname attributes.

<?xml version="1.0" ?>
<Schema xmlns="urn:schemas-microsoft-com:xml-data"
        xmlns:dt="urn:schemas-microsoft-com:datatypes"
        xmlns:sql="urn:schemas-microsoft-com:xml-sql">

<ElementType name="Employee" >
    <AttributeType name="EmpID" />
    <AttributeType name="FName" />
    <AttributeType name="LName" />

    <attribute type="EmpID" />
    <attribute type="FName" />
    <attribute type="LName" />
</ElementType>
</Schema>


Now, annotations are added to this XDR schema to map its elements AND attributes 
to the database tables AND columns. This is the annotated XDR schema:  

<?xml version="1.0" ?>
<Schema xmlns="urn:schemas-microsoft-com:xml-data"
        xmlns:dt="urn:schemas-microsoft-com:datatypes"
        xmlns:sql="urn:schemas-microsoft-com:xml-sql">

<ElementType name="Employee" sql:relation="Employees" >
    <AttributeType name="EmpID" />
    <AttributeType name="FName" />
    <AttributeType name="LName" />

    <attribute type="EmpID" sql:field="EmployeeID" />
    <attribute type="FName" sql:field="FirstName" />
    <attribute type="LName" sql:field="LastName" />
</ElementType>
</Schema>

In the mapping schema, the <Employee> element is mapped to the Employees table 
using sql:relation annotation. The attributes EmpID, Fname, AND Lname are mapped to 
the EmployeeID, FirstName, AND LastName columns in the Employees table using the sql:field annotations.

This annotated XDR schema provides the XML view of the relational data. 
This XML view can be queried using the XPath (XML Path) language. 
The query returns an XML document as a result, instead of the rowset returned by the SQL queries.
----


14.6 OPENXML:
-------------

Up to this point we have been dealing with how to get data from SQL Server and display it in an XML format.  
What about if we want to get data from an XML document into SQL Server?  
This is possible using OPENXML.  

The syntax for OPENXML is like this:

         OPENXML(iDoc, RowPattern, [Flags], 

[WITH (SchemaDeclaration | TableName)]

Let�s talk about those parameters.

iDoc .  We get this by calling a stored procedure called sp_xml_preparedocument.  
        We�ll talk more about this stored procedure in a moment. 
The RowPattern parameter specified which nodes we want OPENXML to process using XPath. 
The Flags parameter specifies the format of our results.  The following values can be used: 
0 � Default value.  Attribute centric mapping. 
1 � Use Attribute centric mapping. 
2 � Use element centric mapping. 
8 � Only unconsumed data should be copied to the overflow property @mp;xmltext. 


So what is attribute and element centric mapping?  Attribute centric grabs the data from 
specific elements whereas element centric grabs data from specific sub elements.  
This will all make sense when we do a couple of examples.  It is possible to enter a value of 3 
for the Flag parameter which indicates that both attribute and element are to be used together.

The WITH clause can be left out completely.  However, if it is used there are two options 
that can be used with it.  The first is by associating the results with an existing database table.  
This comes in handy when the XML document is formatted to fit the structure of table.  
The other is to specify specific columns and their data types.

It�s time to show how all of this works.  
Here is the XML document we are going to use in the following examples.

      
<?xml version=�1.0� encoding=�UTF-8�?>
<ROOT>
    <Team Sponsor = �Yamaha�>
    <Class CC = �125�>
       <Rider>Ivan Tedesco</Rider>
    </Class>
    <Class CC=�250�>
      <Rider>Jeremy McGrath</Rider>
      <Rider>David Vuillemin</Rider>
      <Rider>Damon Bradshaw</Rider>
    </Class>
    </Team>
    <Team Sponsor = �Honda�>
    <Class CC=�125�>
      <Rider>Travis Preston</Rider>
    </Class>
    <Class CC=�250�>
      <Rider>Ricky Carmichael</Rider>
      <Rider>Sebastien Tortelli</Rider>
    </Class>
    </Team>
</ROOT>


Now apply OPENXML to this XML document and here is what it looks like using Attribute-Centric mapping:

Declare @rDoc int, @sDoc varchar(4000)

Set @sDoc = '<ROOT>
    <Team Sponsor = "Yamaha">
    <Class CC = "125">
       <Rider>Ivan Tedesco</Rider>
    </Class>
    <Class CC="250">
      <Rider>Jeremy McGrath</Rider>
      <Rider>David Vuillemin</Rider>
      <Rider>Damon Bradshaw</Rider>
    </Class>
    </Team>
    <Team Sponsor = "Honda">
    <Class CC="125">
      <Rider>Travis Preston</Rider>
    </Class>
    <Class CC="250">
      <Rider>Ricky Carmichael</Rider>
      <Rider>Sebastien Tortelli</Rider>
    </Class>
    </Team>
</ROOT>'

EXEC sp_xml_preparedocument @rDoc OUTPUT, @sDoc

SELECT Sponsor
FROM OPENXML (@rDoc, �/ROOT/Team�, 1)
With Team

EXEC sp_xml_removedocument @rDoc


Let�s take a minute and examine each piece of this.  The first thing we do is load our XML document 
into the @sDoc variable and pass that into the sp_xml_preparedocument.  This procedure takes the 
incoming XML and gets it ready to be used by the OPENXML statement.  It does this by parsing 
the XML and then creating an internal representation of the document in memory.

In our OPENXML statement is an XPath statement �/ROOT/Team� which tells OPENXML to loop through 
the Team nodes and then return all the values of the Sponsor column.

We then use the sp_xml_removedocument to remove the XML document from memory and any references to it.

In this Attribute-Centric example, an attribute name maps directly to a column name of the same name 
in the result set.


We can also insert records using OPENXML and it is not that difficult.   Our source XML stays the same but 
our SQL Statement would look like this:

INSERT Team (Sponsor)
SELECT DISTINCT Sponsor
FROM OPENXML (@rDoc, �ROOT/Team�, 1)
WITH (Sponsor varchar(50) �@Sponsor�)

Now when go look in the Team table we should see a few records inserted into the table 
that look like the following:

         TeamID      Sponsor

1                    Yamaha
2                    Honda
3                    Kawasaki
4                    Suzuki

We could continue this and create follow-up INSERT statements to insert into Class and Rider tables also.  
But we�ll save this for the Worksheet.

If we can insert records we should be able to update records also, true?  Not only is it true, but it is quite easy.  
First, go into the Rider table and add a RiderNumber column.  You can either do this manually thru Enterprise Manager 
or run the following code in SQL Analyzer:

ALTER TABLE Rider ADD RiderNumber int NULL

Now, let�s modify the source XML.  Add a Number attribute to each Rider element as follows:

<Rider Number=�45�>Ivan Tedesco</Rider>
<Rider Number=�2�>Jeremy McGrath</Rider>

It doesn�t matter what the actual numbers are just as long each Rider element has a Number attribute.

Now we modify our SQL to look like the following:

UPDATE Rider
SET RiderNumber = r.RiderNumber
FROM (
SELECT Number, 
OPENXML (@rDoc, �/ROOT/Team/Class/Rider�, 1)
WITH (Number                 int         �@Number�,
Class)) r,
        Team t,
        Rider rt
WHERE t.TeamID = r.TeamID
AND r.Rider = rt.RiderName

Now when we look in the Rider table we should see the new RiderNumber field and the new values in the new column.


==============================================
15. SQL Server buildnumbers AND service packs:
==============================================


15.1 SQL Server 7 buildnumbers and service packs:
-------------------------------------------------

SQL Server 7.0/MSDE 1
 
Main Releases
 
7.00.1063 Service Pack 4 
7.00.961  Service Pack 3 
7.00.842  Service Pack 2 
7.00.699  Service Pack 1 
7.00.623   RTM 

All Releases
 
7.00.1078 SP 4 + Q327068 
7.00.1077 SP 4 + Q327068 
7.00.1076 SP 4 + Q327068 
7.00.1063 Service Pack 4 
7.00.1030 SP 3 + Q318268 
7.00.1004 SP 3 + Q304851 
7.00.996 SP 3 + 
7.00.978 SP 3 + Q285870 
7.00.977 SP 3 + Q284351 
7.00.970 SP 3 + Q283837/Q282243 
7.00.961 Service Pack 3 
7.00.921 SP 2 + Q283837 
7.00.919 SP 2 + Q282243 
7.00.918 SP 2 + Q280380 
7.00.917 SP 2 + Q279180 
7.00.910 SP 2 + Q275901 
7.00.905 SP 2 + Q274266 
7.00.889 SP 2 + Q243741 
7.00.879 SP 2 + Q281185 
7.00.857 SP 2 + Q260346 
7.00.843 SP 2 + 
7.00.842 Service Pack 2 
7.00.835 Service Pack 2 Beta 
7.00.776 SP 1 + Q258087 
7.00.770 SP 1 + Q252905 
7.00.745 SP 1 + Q253738 
7.00.722 SP 1 + Q239458 
7.00.699 Service Pack 1 
7.00.689 Service Pack 1 Beta 
7.00.677 MSDE in Office 2K  Dev 
7.00.662 Q232707 
7.00.658 Q244763 
7.00.657 Q229875 
7.00.643 Q220156 
7.00.623 RTM 


15.2 SQL Server 2000 buildnumbers and service packs:
----------------------------------------------------

SQL Server 2000/MSDE 2
 
Main Releases
 
8.00.760 Service Pack 3 
8.00.534 Service Pack 2 
8.00.384 Service Pack 1 
8.00.194 RTM 

All Releases
 
8.00.760 Service Pack 3  Also SP3a reports the 8.00.760 build number. 
8.00.686 SP 2 + Q316333 
8.00.679 SP 2 + Q316333 
8.00.665 SP 2 + Q316333 
8.00.655 SP 2 + Q316333 
8.00.650 SP 2 + Q316333 
8.00.644 SP 2 + Q324186 
8.00.608 SP 2 + 
Q316333/Q356326/Q356938 
8.00.578 SP 2 + Q316333 
8.00.534 Service Pack 2 
8.00.532 Service Pack 2 Beta 
8.00.452 SP 1 + Q308547 
8.00.444 SP 1 + Q307540/Q307655 
8.00.443 SP 1 + Q307538 
8.00.428 SP 1 + Q304850 
8.00.384 Service Pack 1 
8.00.287 Q297209 
8.00.251 Q300194 
8.00.250 Q291683 
8.00.249 Q288122 
8.00.239 Q285290 
8.00.233 Q282416 
8.00.231 Q282279 
8.00.226 Q278239 
8.00.225 Q281663 
8.00.223 Q280380 
8.00.222 Q281769 
8.00.218 Q279183 
8.00.217 Q279293/Q279296 
8.00.211 Q276329 
8.00.210 Q275900 
8.00.205 Q274330 
8.00.204 Q274329 
8.00.194 RTM 


15.3 Distinguishing Between SP3 and SP3a:
-----------------------------------------

When you for instance run

SELECT @@version

from the QA, both sp3 and sp3a reports the same buildnumber 8.00.760

To determine whether you have SP3 or SP3a installed, look at the version number of the 
Net-Library file, Ssnetlib.dll. If the version number of this file is 2000.80.760.0, you have SP3; 
if the version number of this file is 2000.80.766.0, you have SP3a.


======================================================
16. List of Character sets, sort order AND collations:
======================================================

SQL Server 2000 replaces code pages AND sort orders with collations. 
SQL Server 2000 includes support for most collations supported in earlier versions of SQL Server, 
AND introduces a new set of collations based on Windows collations. You can now specIFy collations 
at the database level or at the column level. Previously, code pages AND sort orders 
could be specIFied only at the server level AND applied to all databases on a server.


Microsoft� SQL Server� 2000 supports several collations. A collation encodes the rules governing 
the proper use of characters for either a language, such as Macedonian or Polish, or an alphabet, 
such as Latin1_General (the Latin alphabet used by western European languages).

Each SQL Server collation specIFies three properties: 

- The sort order to use for Unicode data types (nchar, nvarchar, AND ntext). 
  A sort order defines the sequence in which characters are sorted, AND the way characters 
  are evaluated in comparison operations.

- The sort order to use for non-Unicode character data types (char, varchar, AND text).

- The code page used to store non-Unicode character data. 

The characterset Latin 1 (or ANSI), for example, which is referred to as ISO 8859-1 under SQL Server 7.0, 
is called ISO 1252 in SQL Server 2000's default settings.

For SQL Server installations in use by western European languages 
uses one of the Latin1_General* collations. These collations are split into
collations which are yes or no Case Sensitive (CS/CI), and yes or no Accent Sensitive (AS/AI)

Example 1:
----------

-- Stel we maken twee databases aan, A and B, met verschillende collations:

-- Database A: Latin1_general_CI_AS
-- Database B: SQL_Latin1_General_CP1_CI_AS

-- In A maken we een simple tabelletje aan met de naam "a":

USE A
GO

create table a
(
cust_id   int,
cust_name varchar(20)
)
GO

-- Even in a een testrecord invoeren:

insert into a
values
(1,'Piet')
GO

-- In B maken we een overeenkomend simple tabelletje met de naam "b":

USE B
GO

create table b
(
cust_id   int,
cust_name varchar(20)
)
GO

-- Even in b een testrecord invoeren:

insert into b
values
(1,'Piet')
GO

-- Query 1: 
-- Doe nu de volgende test query:

select a.cust_name,b.cust_name from A.dbo.a a, B.dbo.b b
where a.cust_name=b.cust_name

-- SQL Server komt terug met een error:

-- Server: Msg 446, Level 16, State 9, Line 1
-- Cannot resolve collation conflict for equal to operation.

-- Query 2:
-- Doe nu de volgende test query:

select a.cust_name,b.cust_name from A.dbo.a a, B.dbo.b b
where a.cust_id=b.cust_id

cust_name            cust_name            
-------------------- -------------------- 
Piet                 Piet

(1 row(s) affected)

-- Deze keer is er geen error

--

Het is zo dat bij JOINS, of bij "Where" clauses met comparison operators als =, >, < etc...,
dat deze collation gevoelig zijn als het gaat om char, varchar, nchar, nvarchar, text en ntext velden.
Het is niet gevoelig voor getal gebaseerde columns (int etc..), zoals te zien
is in het bovenstaande voorbeeld.

Database B (of A) zou ook de (standaard aanwezige) SQLServer TEMPDB database kunnen zijn. 
Dus zelfs bij een database waarbij
de queries (of stored procedures) alleen de eigen tabellen en kolommen gebruikt, ogenschijnlijk
in dezelfde database, kunnen mogelijk collation errors optreden. Dit kan gebeuren als
een stored procedure een temporary table (of table datatype) gebruikt 
(via create #tablename, create ##tablename, declare @table_type_name)
welke een construct is in de TEMPDB database, die mogelijk een andere collation gebruikt, en dan
kunnen dus collation errors optreden.

Indien een "gevulde" database reeds een bepaalde collation heeft, dan is deze NIET
gemakkelijk te veranderen naar een andere collation.
Het komt er dan op neer om data uit te pompen, nieuwe db aanmaken (of de nu lege db te wijzigen),
en de data weer inpompen (via bijv. bcp, dts etc..).
Opmerking: Het statement "ALTER DATABASE .. COLLATE collation_name" werkt alleen bij een nog lege database.

De problematische stored procedures en queries (zijn dus niet echt problematisch)
kunnen echter omgebouwd worden zodat er geen errors optreden. 
We kunnen de Server namelijk de "regel / handleiding" meegeven om
een van de set van tablekolommen te herleiden naar de andere collation.

We kunnen Query 1 dan bijvoorbeeld als volgt herbouwen:

select a.cust_name,b.cust_name from A.dbo.a a, B.dbo.b b
where a.cust_name COLLATE SQL_Latin1_General_CP1_CI_AS =b.cust_name

en er treden geen collation errors meer op.

Indien er dus slechts enkele sp's zijn die goochelen met temporary tables,
dan is de ombouw operatie best wel te overzien.
Anderzijds kan mogelijk ook gekozen worden om CREATE #TEMPTABLE te vervangen door
het juiste SELECT INTO.. statement daar deze de properties van source table overneemt.


Tabellen met collation informatie:
----------------------------------

TABLE 1:
--------

Sort order ID - SQL collation name 
30 SQL_Latin1_General_Cp437_BIN 
31 SQL_Latin1_General_Cp437_CS_AS 
32 SQL_Latin1_General_Cp437_CI_AS 
33 SQL_Latin1_General_Pref_CP437_CI_AS 
34 SQL_Latin1_General_Cp437_CI_AI 
40 SQL_Latin1_General_Cp850_BIN 
41 SQL_Latin1_General_Cp850_CS_AS 
42 SQL_Latin1_General_Cp850_CI_AS 
43 SQL_Latin1_General_Pref_CP850_CI_AS 
44 SQL_Latin1_General_Cp850_CI_AI 
49 SQL_1Xcompat_CP850_CI_AS 
50 Latin1_General_BIN 
51 SQL_Latin1_General_Cp1_CS_AS 
52 SQL_Latin1_General_Cp1_CI_AS 
53 SQL_Latin1_General_Pref_CP1_CI_AS 
54 SQL_Latin1_General_Cp1_CI_AI 
55 SQL_AltDiction_Cp850_CS_AS 
56 SQL_AltDiction_Pref_CP850_CI_AS 
57 SQL_AltDiction_Cp850_CI_AI 
58 SQL_ScANDinavian_Pref_Cp850_CI_AS 
59 SQL_ScANDinavian_Cp850_CS_AS 
60 SQL_ScANDinavian_Cp850_CI_AS 
61 SQL_AltDiction_Cp850_CI_AS 
71  Latin1_General_CS_AS 
72  Latin1_General_CI_AS 
73 Danish_Norwegian_CS_AS 
74 Finnish_Swedish_CS_AS 
75 IcelANDic_CS_AS 
80 Hungarian_BIN (or Albanian_BIN, Czech_BIN, AND so on)1 
81 SQL_Latin1_General_Cp1250_CS_AS 
82 SQL_Latin1_General_Cp1250_CI_AS 
83 SQL_Czech_Cp1250_CS_AS 
84 SQL_Czech_Cp1250_CI_AS 
85 SQL_Hungarian_Cp1250_CS_AS 
86 SQL_Hungarian_Cp1250_CI_AS 
87 SQL_Polish_Cp1250_CS_AS 
88 SQL_Polish_Cp1250_CI_AS 
89 SQL_Romanian_Cp1250_CS_AS 
90 SQL_Romanian_Cp1250_CI_AS 
91 SQL_Croatian_Cp1250_CS_AS 
92 SQL_Croatian_Cp1250_CI_AS 
93 SQL_Slovak_Cp1250_CS_AS 
94 SQL_Slovak_Cp1250_CI_AS 
95 SQL_Slovenian_Cp1250_CS_AS 
96 SQL_Slovenian_Cp1250_CI_AS 
104 Cyrillic_General_BIN (or Ukrainian_BIN, Macedonian_BIN) 
105 SQL_Latin1_General_Cp1251_CS_AS 
106 SQL_Latin1_General_Cp1251_CI_AS 
107 SQL_Ukrainian_Cp1251_CS_AS 
108 SQL_Ukrainian_Cp1251_CI_AS 
112 Greek_BIN 
113 SQL_Latin1_General_Cp1253_CS_AS 
114 SQL_Latin1_General_Cp1253_CI_AS 
120 SQL_MixDiction_Cp1253_CS_AS 
121 SQL_AltDiction_Cp1253_CS_AS 
124 SQL_Latin1_General_Cp1253_CI_AI 
128 Turkish_BIN 
129 SQL_Latin1_General_Cp1254_CS_AS 
130 SQL_Latin1_General_Cp1254_CI_AS 
136 Hebrew_BIN 
137 SQL_Latin1_General_Cp1255_CS_AS 
138 SQL_Latin1_General_Cp1255_CI_AS 
144 Arabic_BIN 
145 SQL_Latin1_General_Cp1256_CS_AS 
146 SQL_Latin1_General_Cp1256_CI_AS 
153 SQL_Latin1_General_Cp1257_CS_AS 
154 SQL_Latin1_General_Cp1257_CI_AS 
155 SQL_Estonian_Cp1257_CS_AS 
156 SQL_Estonian_Cp1257_CI_AS 
157 SQL_Latvian_Cp1257_CS_AS 
158 SQL_Latvian_Cp1257_CI_AS 
159 SQL_Lithuanian_Cp1257_CS_AS 
160 SQL_Lithuanian_Cp1257_CI_AS 
183 SQL_Danish_Pref_Cp1_CI_AS 
184 SQL_SwedishPhone_Pref_Cp1_CI_AS 
185 SQL_SwedishStd_Pref_Cp1_CI_AS 
186 SQL_IcelANDic_Pref_Cp1_CI_AS 
192 Japanese_BIN 
193 Japanese_CI_AS 
194 Korean_Wansung_BIN 
195 Korean_Wansung_CI_AS 
196 Chinese_Taiwan_Stroke_BIN 
197 Chinese_Taiwan_Stroke_CI_AS 
198  Chinese_PRC_BIN 
199 Chinese_PRC_CI_AS 
200 Japanese_CS_AS 
201 Korean_Wansung_CS_AS 
202 Chinese_Taiwan_Stroke_CS_AS 
203 Chinese_PRC_CS_AS 
204 Thai_BIN 
205 Thai_CI_AS 
206 Thai_CS_AS 
210 SQL_EBCDIC037_CP1_CS_AS 
211 SQL_EBCDIC273_CP1_CS_AS 
212 SQL_EBCDIC277_CP1_CS_AS 
213 SQL_EBCDIC278_CP1_CS_AS 
214 SQL_EBCDIC280_CP1_CS_AS 
215 SQL_EBCDIC284_CP1_CS_AS 
216 SQL_EBCDIC285_CP1_CS_AS 
217 SQL_EBCDIC297_CP1_CS_AS 

TABLE 2:
--------

SQL7 sort order versus  SQL2000 collation name

30 Binary order, for use with the 437 (U.S. English) character set. 
31 Dictionary order, case-sensitive, for use with the 437 (U.S. English) character set. 
32 Dictionary order, case-insensitive, for use with the 437 (U.S. English) character set. 
33 Dictionary order, case-insensitive, uppercase preference, for use with the 437 (U.S. English) character set. 
34 Dictionary order, case-insensitive, accent-insensitive, for use with the 437 (U.S. English) character set. 
40 Binary order, for use with the 850 (Multilingual) character set. 
41 Dictionary order, case-sensitive, for use with the 850 (Multilingual) character set. 
42 Dictionary order, case-insensitive, for use with the 850 (Multilingual) character set. 
43 Dictionary order, case-insensitive, uppercase preference, for use with the 850 (Multilingual) character set. 
44 Dictionary order, case-insensitive, accent-insensitive, for use with the 850 (Multilingual) character set. 
49 Strict compatibility with version 1.x case-insensitive databases, for use with the 850 (Multilingual) character set. 
50 Binary order for use with 1252 character set. 
51 Dictionary order, case-sensitive, for use with 1252 character set. 
52 Dictionary order, case-insensitive, for use with 1252 character set. 
53 Dictionary order, case-insensitive, uppercase preference, for use with 1252 character set. 
54 Dictionary order, case-insensitive, accent-insensitive, for use with 1252 character set. 
55 Alternate dictionary order, case-sensitive, for use with the 850 (Multilingual) character set. 
56 Alternate dictionary order, case-insensitive, uppercase preference, for use with the 850 (Multilingual) character set. 
57 Alternate dictionary order, case-insensitive, accent-insensitive, for use with the 850 (Multilingual) character set. 
58 ScANDinavian dictionary order, case-insensitive, uppercase preference, for use with the 850 (Multilingual) character set. 
59 ScANDinavian dictionary order, case-sensitive, for use with the 850 (Multilingual) character set. 
60 ScANDinavian dictionary order, case-insensitive, for use with the 850 (Multilingual) character set. 
61 Alternate dictionary order, case-insensitive, for use with the 850 (Multilingual) character set. 
71  Latin-1 case-sensitive, for use with 1252 character set. 
72  Latin-1 case-insensitive, for use with 1252 character set. 
73 Danish/Norwegian case-sensitive sort order for code page 1252. 
74 Finnish/Swedish case-sensitive sort order for code page 1252. 
75 IcelANDic case-sensitive sort order for code page 1252. 
80 Binary order, for use with the 1250 (Central European) character set. 
81 Dictionary order, case-sensitive, for use with the 1250 (Central European) character set. 
82 Dictionary order, case-insensitive, for use with the 1250 (Central European) character set. 
83 Czech dictionary order, case-sensitive, for use with the 1250 (Central European) character set. 
84 Czech dictionary order, case-insensitive, for use with the 1250 (Central European) character set. 
85 Hungarian dictionary order, case-sensitive, for use with the 1250 (Central European) character set. 
86 Hungarian dictionary order, case-insensitive, for use with the 1250 (Central European) character set. 
87 Polish dictionary order, case-sensitive, for use with the 1250 (Central European) character set. 
88 Polish dictionary order, case-insensitive, for use with the 1250 (Central European) character set. 
89 Romanian dictionary order, case-sensitive, for use with the 1250 (Central European) character set. 
90 Romanian dictionary order, case-insensitive, for use with the 1250 (Central European) character set. 
91 Croatian dictionary order, case-sensitive, for use with the 1250 (Central European) character set. 
92 Croatian dictionary order, case-insensitive, for use with the 1250 (Central European) character set. 
93 Slovak dictionary order, case-sensitive, for use with the 1250 (Central European) character set. 
94 Slovak dictionary order, case-insensitive, for use with the 1250 (Central European) character set. 
95 Slovenian dictionary order, case-sensitive, for use with the 1250 (Central European) character set. 
96 Slovenian dictionary order, case-insensitive, for use with the 1250 (Central European) character set. 
97 Windows Polish case-sensitive sort order for code page 1250. 
98 Windows Polish case-insensitive sort order for code page 1250. 
104 Binary order, for use with the 1251 (Cyrillic) character set. 
105 Dictionary order, case-sensitive, for use with the 1251 (Cyrillic) character set. 
106 Dictionary order, case-insensitive, for use with the 1251 (Cyrillic) character set. 
107 Ukrainian dictionary order, case-sensitive, for use with the 1251 (Cyrillic) character set. 
108 Ukrainian dictionary order, case-insensitive, for use with the 1251 (Cyrillic) character set. 
112 Binary order, for use with the 1253 (Greek) character set. 
113 Dictionary order, case-sensitive, for use with the 1253 (Greek) character set. 
114 Dictionary order, case-insensitive, for use with the 1253 (Greek) character set. 
120 Mixed dictionary order, for use with the 1253 (Greek) character set. 
121 Dictionary order, case-sensitive, accent-sensitive, for use with the 1253 (Greek) character set. 
124 Dictionary order, case-insensitive, accent-insensitive, for use with the 1253 (Greek) character set. 
128 Binary order, for use with the 1254 (Turkish) character set. 
129 Dictionary order, case-sensitive, for use with the 1254 (Turkish) character set. 
130 Dictionary order, case-insensitive, for use with the 1254 (Turkish) character set. 
136 Binary order, for use with the 1255 (Hebrew) character set. 
137 Dictionary order, case-sensitive, for use with the 1255 (Hebrew) character set. 
138 Dictionary order, case-insensitive, for use with the 1255 (Hebrew) character set. 
144 Binary order, for use with the 1256 (Arabic) character set. 
145 Dictionary order, case-sensitive, for use with the 1256 (Arabic) character set. 
146 Dictionary order, case-insensitive, for use with the 1256 (Arabic) character set. 
152 Binary order, for use with the 1257 (Baltic) character set. 
153 Dictionary order, case-sensitive, for use with the 1257 (Baltic) character set. 
154 Dictionary order, case-insensitive, for use with the 1257 (Baltic) character set. 
155 Estonian dictionary order, case-sensitive, for use with the 1257 (Baltic) character set. 
156 Estonian dictionary order, case-insensitive, for use with the 1257 (Baltic) character set. 
157 Latvian dictionary order, case-sensitive, for use with the 1257 (Baltic) character set. 
158 Latvian dictionary order, case-insensitive, for use with the 1257 (Baltic) character set. 
159 Lithuanian dictionary order, case-sensitive, for use with the 1257 (Baltic) character set. 
160 Lithuanian dictionary order, case-insensitive, for use with the 1257 (Baltic) character set. 
183 Danish/Norwegian dictionary order, case-insensitive, uppercase preference, for use with 1252 character set. 
184 Swedish/Finnish (StANDard) dictionary order, case-insensitive, uppercase preference, for use with 1252 character set. 
185 Swedish/Finnish (Phone) dictionary order, case-insensitive, uppercase preference, for use with 1252 character set. 
186 IcelANDic dictionary order, case-insensitive, uppercase preference, for use with 1252 character set. 
192 Binary order, for use with the 932 (Japanese) character set. 
193 Dictionary order, case-insensitive, for use with the 932 (Japanese) character set 
194 Binary order, for use with the 949 (Korean) character set. 
195 Dictionary order, case-insensitive, for use with the 949 (Korean) character set. 
196 Binary order, for use with the 950 (Traditional Chinese) character set. 
197 Dictionary order, case-insensitive, for use with the 950 (Traditional Chinese) character set. 
198  Binary order, for use with the 936 (SimplIFied Chinese) character set. 
199 Dictionary order, case-insensitive, for use with the 936 (SimplIFied Chinese) character set. 
200 Dictionary order, case-sensitive, for use with the 932 (Japanese) character set. 
201 Dictionary order, case-sensitive, for use with the 949 (Korean) character set. 
202 Dictionary order, case-sensitive, for use with the 950 (Traditional Chinese) character set. 
203 Dictionary order, case-sensitive, for use with the 936 (SimplIFied Chinese) character set. 
204 Binary order, for use with the 874 (Thai) character set. 
205 Dictionary order, case-insensitive, for use with the 874 (Thai) character set. 
206 Dictionary order, case-sensitive, for use with the 874 (Thai) character set. 


TABLE 3:
--------

Locale_ID Name 
1033 General Unicode 
33280 Binary Order 
1027 Catalan 
197636 Chinese Bopomofo (Taiwan Region) 
2052 Chinese Punctuation 
133124 Chinese Stroke Count 
1028 Chinese Stroke Count (Taiwan Region) 
1050 Croatian 
1029 Czech 
1043 Dutch 
1061  Estonian 
1036 French 
66615 Georgian Modern 
1031 German 
66567 German Phone Book 
1038 Hungarian 
66574 Hungarian Technical 
1039 IcelANDic 
1040 Italian 
1041 Japanese 
66577 Japanese Unicode 
1042 Korean 
66578 Korean Unicode 
1062 Latvian 
1063 Lithuanian 
1071  Macedonian 
1044 Norwegian/Danish 
1045 Polish 
1046 Portuguese 
1048 Romanian 
1051 Slovak 
1060 Slovenian 
1034 Spanish Traditional 
3082 Spanish Modern 
1053 Swedish/Finnish 
1054 Thai 
2057 UK English 
1058  Ukrainian 
1066 Vietnamese 


=================
17. Trace flags:
=================

Trace flags can either be set upon startup of SQL Server by using the -Ttrace# option upon 
SQL Server startup or by using the DBCC TRACEON console commAND. 
Either way the trace flag will be active until SQL Server is restarted or you use 
the DBCC TRACEOFF console commAND to turn the trace flag off. 

1 Sets trace flags for all client connections, rather than for a single client connection. Because trace flags set using the -T commAND-line option automatically apply to all connections, this trace flag is used only when setting trace flags using DBCC TRACEON AND DBCC TRACEOFF.  
106 Disables line number information for syntax errors.  
107 Interprets numbers with a decimal point as float instead of decimal.  
205 Report when a statistics-depENDent stored procedure is being recompiled as a result of AutoStat.  
206 Provides backward compatibility for the setuser statement.  
208 SET QUOTED IDENTIFIER ON.  
242 Provides backward compatibility for correlated subqueries WHERE non-ANSI-stANDard results are desired.  
243 The behavior of SQL Server is now more consistent because nullability checks are made at run time AND a nullability violation results in the commAND terminating AND the batch or transaction process continuing.  
244 Disables checking for allowed interim constraint violations. By default, SQL Server checks for AND allows interim constraint violations. An interim constraint violation is caused by a change that removes the violation such that the constraint is met, all within a single statement AND transaction. SQL Server checks for interim constraint violations for self-referencing DELETE statements, INSERT, AND multirow UPDATE statements. This checking requires more work tables. With this trace flag you can disallow interim constraint violations, thus requiring fewer work tables.  
257 Will invoke a print algorithm on the XML output before returning it to make the XML result more readable.  
260 Prints the versioning information about extENDed stored procedure dlls.  
302 Prints information about whether the statistics page is used, the actual SELECTivity (IF available), AND what SQL Server estimated the physical AND logical I/O would be for the indexes. Trace flag 302 should be used with trace flag 310 to show the actual join ordering.  
310 Prints information about join order. Index SELECTion information is also available in a more readable format using SET SHOWPLAN_ALL, as described in the SET statement.  
325 Prints information about the cost of using a nonclustered index or a sort to process an ORDER BY clause.  
326  Prints information about the estimated AND actual cost of sorts.  
330 Enables full output when using the SET SHOWPLAN_ALL option, which gives detailed information about joins.  
506 Enforces SQL-92 stANDards regarding null values for comparisons between variables AND parameters. Any comparison of variables AND parameters that contain a NULL always results in a NULL.  
652 Disables read ahead for the server.  
653  Disables read ahead for the current connection.  
809  Limits the amount of Lazy Write activity in SQL Server 2000.  
1180  Forces allocation to use free pages for text or image data AND maintain efficiency of storage.  
1200 Prints lock information (the process ID AND type of lock requested).  
1204 Returns the type of lock participating in the deadlock AND the current commAND affect by the deadlock.  
1205 Returns more detailed information about the commAND being executed at the time of a deadlock.  
1206 Used to complement flag 1204 by displaying other locks held by deadlock parties  
1609 Turns on the unpacking AND checking of remote procedure call (RPC) information in Open Data Services. Used only when applications depEND on the old behavior.  
1704  Prints information when a temporary table is created or dropped.  
1807  Allows you to configure SQL Server with network-based database files.  
2505 Prevents DBCC TRACEON 208, SPID 10 errors FROM appearing in the error log.  
2508 Disables parallel non-clustered index checking for DBCC CHECKTABLE.  
2509 Used with DBCC CHECKTABLE.html to see the total count of ghost records in a table  
2528 Disables parallel checking of objects by DBCC commANDs.  
2701 Sets the @@ERROR system function to 50000 for RAISERROR messages with severity levels of 10 or less. When disabled, sets the @@ERROR system function to 0 for RAISERROR messages with severity levels of 10 or less.  
3104 Causes SQL Server to bypass checking for free space.  
3111 Cause LogMgr::ValidateBackedupBlock to be skipped during backup AND restore operations.  
3205 Disables hardware compression for tape drivers.  
3222 Disables the read ahead that is used by the recovery operation during roll forward operations.  
3502  Prints a message to the log at the start AND END of each checkpoint.  
3503 Indicates whether the checkpoint at the END of automatic recovery was skipped for a database (this applies only to read-only databases).  
3602 Records all error AND warning messages sent to the client.  
3604 SENDs trace output to the client. Used only when setting trace flags with DBCC TRACEON AND DBCC TRACEOFF.  
3605 SENDs trace output to the error log. (IF you start SQL Server FROM the commAND prompt, the output also appears on the screen.)  
3607 Skips automatic recovery (at startup) for all databases.  
3608 Skips automatic recovery (at startup) for all databases except the master database.  
3609 Skips the creation of the tempdb database at startup. Use this trace flag IF the device or devices on which tempdb resides are problematic or problems exist in the model database.  
3626  Turns on tracking of the CPU data for the sysprocesses table.  
3640 Eliminates the sENDing of DONE_IN_PROC messages to the client for each statement in a stored procedure. This is similar to the session setting of SET NOCOUNT ON, but when set as a trace flag, every client session is hANDled this way.  
4022  Bypasses automatically started procedures.  
4030 Prints both a byte AND ASCII representation of the receive buffer. Used when you want to see what queries a client is sENDing to SQL Server. You can use this trace flag IF you experience a protection violation AND want to determine which statement caused it. Typically, you can set this flag globally or use SQL Server Enterprise Manager. You can also use DBCC INPUTBUFFER.  
4031 Prints both a byte AND ASCII representation of the sEND buffers (what SQL Server sENDs back to the client). You can also use DBCC OUTPUTBUFFER.  
4032 Traces the SQL commANDs coming in FROM the client. The output destination of the trace flag is controlled with the 3605/3604 trace flags.  
7300 Retrieves extENDed information about any error you encounter when you execute a distributed query.  
7501 Dynamic cursors are used by default on forward-only cursors. Dynamic cursors are faster than in earlier versions AND no longer require unique indexes. This flag disables the dynamic cursor enhancements AND reverts to version 6.0 behavior.  
7502  Disables the caching of cursor plans for extENDed stored procedures.  
7505 Enables version 6.x hANDling of return codes when calling dbcursorfetchex AND the resulting cursor position follows the END of the cursor result set.  
7525 Reverts to the SQL Server 7.0 behavior of closing nonstatic cursors regardless of the SET CURSOR_CLOSE_ON_COMMIT state in SQL Server 2000.  
8202 Replicates all UPDATE commANDs as DELETE/INSERT pairs at the publisher.  
8206 Supports stored procedure execution with a user specIFied owner name for SQL Server subscribers or without owner qualIFication for heterogeneous subscribers in SQL Server 2000.  
8207 Enables singleton updates for Transactional Replication, released with SQL Server 2000 Service Pack 1.  
8599 Allows you to use a savepoint within a distributed transaction.  
8679 Prevents the SQL Server optimizer FROM using a Hash Match Team operator.  
8687 Used to disable query parallelism.  
8721  Dumps information into the error log when AutoStat has been run.  
8783 Allows DELETE, INSERT, AND UPDATE statements to honor the SET ROWCOUNT ON setting when enabled.  
8816  Logs every two-digit year conversion to a four-digit year.  


=======================
18. Auditing examples:
=======================


TEST 1:
=======

-- TEST TABLE TO AUDIT:

CREATE TABLE TEST_PRODUCTS
(
prod_id int not null primary key,
prod_name varchar(20)
)

INSERT INTO TEST_PRODUCTS
values
(1,'auto')


-- TEST AUDIT TRAIL TABLE:

CREATE TABLE TEST_AUDIT
  (
  AuditTrailID Int IDENTITY (1, 1) NOT NULL,
  TableName VarChar (50) NULL, 
  ActionDate  DateTime   NULL,
  type varchar(1)        NULL,
  name VarChar (128)     NULL,
  spid int               NULL,
  cinfo varbinary(128)   NULL
  ) 

-- PROCEDURE TO GET USERNAME:

create procedure set_c1 @name varchar(128)
as
declare @x varbinary(128)
SELECT @x=convert(varbinary(128),@name)
set context_info @x


-- TRIGGERS ON TEST_PRODUCTS:

create trigger tr_audit_ins on TEST_PRODUCTS for INSERT
as
declare	@UserName varchar(128)
declare @cinfo varbinary(128)
SELECT @cinfo=(SELECT context_info FROM master.dbo.sysprocesses 
               WHERE spid=@@spid)
set @username=convert(varchar(128),@cinfo)	
INSERT INTO TEST_AUDIT
(TableName,ActionDate,type,name,spid,cinfo)
VALUES 
('Products',GetDate(),'I',@username,@@spid,@cinfo)
	
create trigger tr_audit_del on TEST_PRODUCTS for DELETE
as
declare	@UserName varchar(128)
declare @cinfo varbinary(128)
SELECT @cinfo=(SELECT context_info FROM master.dbo.sysprocesses 
               WHERE spid=@@spid)
set @username=convert(varchar(128),@cinfo)	
INSERT INTO TEST_AUDIT
(TableName,ActionDate,type,name,spid,cinfo)
VALUES 
('Products',GetDate(),'D',@username,@@spid,@cinfo)

create trigger tr_audit_up on TEST_PRODUCTS for update
as
declare	@UserName varchar(128)
declare @cinfo varbinary(128)
SELECT @cinfo=(SELECT context_info FROM master.dbo.sysprocesses 
               WHERE spid=@@spid)
set @username=convert(varchar(128),@cinfo)	
INSERT INTO TEST_AUDIT
(TableName,ActionDate,type,name,spid,cinfo)
VALUES 
('Products',GetDate(),'U',@username,@@spid,@cinfo)


--------------------------------------------------------------

TEST 2:
=======

CREATE TABLE dbo.AuditTrail 
  (
  AuditTrailID Int IDENTITY (1, 1) NOT NULL,
  TableName VarChar (50) NOT NULL, 
  ActionTaken Char (1) NOT NULL, 
  ActionUser VarChar (50) NOT NULL, 
  ActionDate  DateTime NOT NULL 
  ) 
ON [PRIMARY]
GO


CREATE TRIGGER [AuditINSERTUpdate] ON dbo.Products
  FOR INSERT, UPDATE
  AS
  INSERT INTO AuditTrail (TableName, ActionTaken, ActionUser, ActionDate)
    VALUES ('Products', 'I', User_Name(), GetDate())

UPDATE dbo.Products
SET UnitPrice = 1
WHERE ProductID = 1
 
--------------------------------------------------------------

TEST 3:
=======

CREATE TRIGGER "audit_trigger_Orders" ON "Orders"
FOR INSERT, UPDATE, DELETE
NOT FOR REPLICATION
AS
DECLARE 
@TrigTime DateTime
set @TrigTime = getDate()

UPDATE 
audit_Orders
SET 
audit_ENDdatetime = (@TrigTime), 
audit_ENDappname = (APP_Name()), 
audit_ENDusername = (USER_Name()), 
audit_ENDhostname = (HOST_NAME()) 
FROM 
DELETEd,audit_Orders 
WHERE 
audit_Orders.OrderID = DELETEd.OrderID 
AND 
audit_ENDdatetime = '9/9/9999'

INSERT INTO 
audit_Orders (
OrderID,
CustomerID,
EmployeeID,
OrderDate,
RequiredDate,
ShippedDate,
ShipVia,
Freight,
ShipName,
ShipAddress,
ShipCity,
ShipRegion,
ShipPostalCode,
ShipCountry,
audit_startdatetime,
audit_ENDdatetime,
audit_startusername,
audit_startappname,
audit_starthostname
)
SELECT 
OrderID,
CustomerID,
EmployeeID,
OrderDate,
RequiredDate,
ShippedDate,
ShipVia,
Freight,
ShipName,
ShipAddress,
ShipCity,
ShipRegion,
ShipPostalCode,
ShipCountry,
@TrigTime,
'9/9/9999',
user_name(),
app_name(),
host_name()
FROM
INSERTed
		
--------------------------------------------------------------

TEST 4:
=======

create table trigtest 
(
i_int_key int not null, 
j_int_key int not null, 
s_varchar varchar(10), 
t_char varchar(10), 
d_date datetime
)
go
alter table trigtest 
add constraint pk primary key (i_int_key, j_int_key)
go

create table trigtest_au 
(
i_int_key int not null, 
j_int_key int not null, 
s_varchar varchar(10), 
t_char varchar(10), 
d_date datetime, 
UpdateDate datetime, 
UserName varchar(128), 
type varchar(10)
)
go

create trigger tr_au_trigtest on trigtest for update, DELETE
as
declare	@type varchar(1) ,
	@UpdateDate datetime ,
	@UserName varchar(128)
	IF exists (SELECT * FROM INSERTed)
		SELECT @type = 'U'
	else
		SELECT @type = 'D'

	SELECT 	@UpdateDate = getdate() ,
		@UserName = system_user
	
	INSERT	trigtest_au (i_int_key, j_int_key, s_varchar, t_char, d_date, UpdateDate, UserName, type)
	SELECT	i_int_key, j_int_key, s_varchar, t_char, d_date, @UpdateDate, @UserName, @type + '_old'
	FROM DELETEd
go

create trigger tr_au_trigtest on trigtest for INSERT, update, DELETE
as
declare	@type varchar(1) ,
	@UpdateDate datetime ,
	@UserName varchar(128)
	IF exists (SELECT * FROM INSERTed) AND exists (SELECT * FROM DELETEd)
		SELECT @type = 'U'
	else IF exists (SELECT * FROM INSERTed)
		SELECT @type = 'I'
	else
		SELECT @type = 'D'

	SELECT 	@UpdateDate = getdate() ,
		@UserName = system_user
	
	INSERT	trigtest_au (i_int_key, j_int_key, s_varchar, t_char, d_date, UpdateDate, UserName, type)
	SELECT	i_int_key, j_int_key, s_varchar, t_char, d_date, @UpdateDate, @UserName, @type + '_old'
	FROM DELETEd
	INSERT	trigtest_au (i_int_key, j_int_key, s_varchar, t_char, d_date, UpdateDate, UserName, type)
	SELECT	i_int_key, j_int_key, s_varchar, t_char, d_date, @UpdateDate, @UserName, @type + '_new'
	FROM INSERTed
go


--------------------------------------------------------------

TEST 5:
=======


CREATE TABLE dbo.CUSTOMER_AUDIT 
( 
customerid int not null, 
action char not null, 
modIFieddate datetime not null, 
modIFiedby sysname not null 
) 

CREATE CLUSTERED INDEX IE_CUSTOMER_AUDIT_00 ON dbo.CUSTOMER_AUDIT ( customerid ) 

We can now create a trigger that audits each DML statement, regardless 
of whether it originated FROM a stored procedure or someWHERE else. 

CREATE TRIGGER dbo.CUSTOMER_AUDIT_IUD ON dbo.CUSTOMER FOR INSERT, UPDATE, DELETE 
AS 

DECLARE @i int, @d int, @action char 

IF (@@ROWCOUNT = 0) RETURN 

SELECT @i = COUNT(*) FROM INSERTed 
SELECT @d = COUNT(*) FROM DELETEd 

SELECT @action = CASE 
WHEN (@i != 0) AND (@d = 0) THEN 'I' 
WHEN (@i != 0) AND (@d != 0) THEN 'U' 
WHEN (@i = 0) AND (@d != 0) THEN 'D' 
END 

IF (@action IN ( 'I', 'U' )) 

INSERT INTO dbo.CUSTOMER_AUDIT ( customerid, action, modIFieddate, modIFiedby ) 
SELECT customerid, @action, current_timestamp, suser_sname() 
FROM INSERTed 

ELSE 

INSERT INTO dbo.CUSTOMER_AUDIT ( customerid, action, modIFieddate, modIFiedby ) 
SELECT customerid, @action, current_timestamp, suser_sname() 
FROM DELETEd 

RETURN 


================================================
19. Impersonation and Authentication Delegation:
================================================


19.1. What is Authentication Delegation:
----------------------------------------

Impersonation.
--------------

This mechanism allows a server process to run using the security credentials of the client. 
When the server is impersonating the client, any operations performed by the server are performed 
using the client's credentials. Impersonation does not allow the server to access remote resources 
on behalf of the client. This requires delegation. 

Delegation. 
-----------

Like impersonation, delegation allows a server process to run using the security credentials 
of the client. However, delegation is more powerful and allows the server process to make calls 
to other computers while acting as the client. 


Security account delegation is the ability to connect to multiple servers and, 
with each server change, to retain the authentication credentials 
of the original client. 
For example, if a user (LONDON\joetuck) connects to ServerA, which then connects 
to ServerB, ServerB knows that the connection security identity is LONDON\joetuck.

Delegation is the act of allowing a service to impersonate a user account or a 
computer account to access resources throughout the network. In an N-tier program, 
the user authenticates to a middle-tier service. The middle-tier service authenticates 
to a back-end data server on behalf of the user. 

Delegation depends on the middle-tier service that is being trusted for delegation. 
If this server is set to "Trusted for delegation", the service can impersonate a user to use 
other network services. For example, a user runs a Web program and that Web program 
uses several different SQL databases that exist on different servers. 
When the user authenticates to a server (the front-end server) that is trusted for delegation, 
the server can access the SQL database on the other servers as the user. 

Because the server that is "trusted for delegation" has the user's ticket-granting ticket (TGT), 
it can authenticate to any service on the network. 
In Windows Server 2003, you can control the services that can 
impersonate the user by using constrained delegation. 


19.2. Use of IIS (ASP and ASP.NET) and SQLServer:
-------------------------------------------------

If the SQL and IIS server are separate boxes, you *can't* authenticate a browser client 
against an SQL backend without Kerberos delegation setup correctly, 
NT does not support delegation of access tokens. 

This requires an W2K/W2K3 AD realm , and IE 5.x or higher clients. 
The client machines must be members of the W2K/W2K3 domain. 

The domain "accounts" must have "Delegation" enabled, and the IIS server "machine" account 
must be "trusted for delegation". 

You also have to register SQL server in the AD (see Books online). 

Consider also that: 
- IIS may not run on a DC, as a DC cannot be trusted for delegation. 
- you throw away SQL connection pooling, as connections must carry the same credentials for pooling to work. 


19.3. More information:
------------------------

1. Delegated Authentication

Windows services impersonate clients when accessing resources on their behalf. 
In many cases, a service can complete its work for the client by accessing resources 
on the local computer. Both NTLM and Kerberos provide the information that a service needs 
to impersonate its client locally. However, some distributed applications are designed 
so that a front-end service must impersonate clients when connecting to back-end services 
on other computers. The Kerberos protocol has a proxy mechanism that allows a service 
to impersonate its client when connecting to other services. 

No equivalent is available with NTLM.

Kerberos authentication generates a delegate-level token, as long as the following 
two conditions are met: 

-> The account that you are trying to delegate is not marked 
   "sensitive and cannot be delegated" in the Active Directory.

-> The principal account against which you are authenticating 
   (the user account under which the server process is running) is marked 
   "Trusted for delegation" in the Active Directory.

A typical scenario in which you may want to delegate user credentials is 
if a computer (Computer A) that has Microsoft Internet Explorer installed 
requests Active Server Pages (ASP) pages FROM a Microsoft Internet Information Server (IIS)
Web server on a second computer (Computer B), and the ASP pages invoke 
Component Object Model (COM)/COM+ components on a third computer (Computer C). 
You want the COM/COM+ application to see the identity of the user that is logged on 
to the first computer.

Computer A 		Computer B 			Computer C 
Internet Explorer 	Internet Information Server 	COM/COM+ components 
User A 			User B 				User C 

For delegation to work in this scenario, clear the 
"Account is sensitive and cannot be delegated" check box for User A, 
and SELECT the "Trusted for delegation" check box for Computer B. 
After you configure these settings for User A and Computer B, 
the COM/COM+ application on Computer C can see the identity of the user 
who is logged on to Computer A.


=================================
20. Transaction Isolation levels:
=================================


SET TRANSACTION ISOLATION LEVEL
Controls the default transaction locking behavior for all SELECT statements issued by a connection.

Syntax
SET TRANSACTION ISOLATION LEVEL 
    { READ COMMITTED 
        | READ UNCOMMITTED 
        | REPEATABLE READ 
        | SERIALIZABLE 
    }


The isolation property is one of the four ACID properties a logical unit of work must display 
to qualify as a transaction. It is the ability to shield transactions from the effects 
of updates performed by other concurrent transactions. The level of isolation is actually 
customizable for each transaction.

Microsoft� SQL Server� supports the transaction isolation levels defined in SQL-92. 
Setting transaction isolation levels allows programmers to trade off increased risk 
of certain integrity problems with support for greater concurrent access to data. 
Each isolation level offers more isolation than the previous level, but does so 
by holding more restrictive locks for longer periods. The transaction isolation levels are: 

READ UNCOMMITTED
READ COMMITTED
REPEATABLE READ
SERIALIZABLE 

Transaction isolation levels can be set using Transact-SQL or through a database API:


READ COMMITTED

Specifies that shared locks are held while the data is being read 
to avoid dirty reads, but the data can be changed before the end of the transaction, 
resulting in nonrepeatable reads or phantom data. This option is the SQL Server default.

READ UNCOMMITTED

Implements dirty read, or isolation level 0 locking, which means that no shared locks are issued 
and no exclusive locks are honored. When this option is set, it is possible to read uncommitted or dirty data; 
values in the data can be changed and rows can appear or disappear in the data set 
before the end of the transaction. This option has the same effect as setting NOLOCK on all tables 
in all SELECT statements in a transaction. This is the least restrictive of the four isolation levels.

REPEATABLE READ

Locks are placed on all data that is used in a query, preventing other users from updating the data, 
but new phantom rows can be inserted into the data set by another user and are included in later 
reads in the current transaction. Because concurrency is lower than the default isolation level, 
use this option only when necessary.

SERIALIZABLE

Places a range lock on the data set, preventing other users from updating or inserting rows into the data set 
until the transaction is complete. This is the most restrictive of the four isolation levels. 
Because concurrency is lower, use this option only when necessary. This option has the same effect 
as setting HOLDLOCK on all tables in all SELECT statements in a transaction.

Remarks
Only one of the options can be set at a time, and it remains set for that connection until 
it is explicitly changed. This becomes the default behavior unless an optimization option 
is specified at the table level in the FROM clause of the statement.

The setting of SET TRANSACTION ISOLATION LEVEL is set at execute or run time and not at parse time.

Examples
This example sets the TRANSACTION ISOLATION LEVEL for the session. 
For each Transact-SQL statement that follows, SQL Server holds all of the shared locks 
until the end of the transaction.

SET TRANSACTION ISOLATION LEVEL REPEATABLE READ
GO
BEGIN TRANSACTION
SELECT * FROM publishers
SELECT * FROM authors
...
COMMIT TRANSACTION


Transact-SQL

Transact-SQL scripts and DB-Library applications use the SET TRANSACTION ISOLATION LEVEL statement.

ADO

ADO applications set the IsolationLevel property of the Connection object to 
adXactReadUncommitted, adXactReadCommitted, adXactRepeatableRead, or adXactReadSerializable.

OLE DB

OLE DB applications call ITransactionLocal::StartTransaction with isoLevel set 
to ISOLATIONLEVEL_READUNCOMMITTED, ISOLATIONLEVEL_READCOMMITTED, ISOLATIONLEVEL_REPEATABLEREAD, 
or ISOLATIONLEVEL_SERIALIZABLE

ODBC

ODBC applications call SQLSetConnectAttr with Attribute set to SQL_ATTR_TXN_ISOLATION 
and ValuePtr set to SQL_TXN_READ_UNCOMMITTED, SQL_TXN_READ_COMMITTED, SQL_TXN_REPEATABLE_READ, 
or SQL_TXN_SERIALIZABLE.


Over de mogelijke "locking mechanismen/isolatin levels" in SQL2000, 
en het mogelijk toepasbaar zijn in de iamv applicatie, lees dan aub het volgende:

Er zijn 4 mogelijke transaction isolation levels:

READ UNCOMMITTED (minst restrictieve: bestaat eigenlijk alleen omdat het in het ansi protocol voorkomt) 
READ COMMITTED   (default)
REPEATABLE READ
SERIALIZABLE     (meest restrictieve level)


Stel we hebben joop en klaas
en we maken nu table SALES:

create table sales
(
id      int,
product varchar(10)
)

insert into sales values (1,'Boeken')
insert into sales values (2,'Bier')
insert into sales values (3,'koekjes')
etc..


Stel joop en piet zijn bezig, en onafhankelijk doen ze select, insert en update statements
Je ziet dan echt bijna nooit een locking effect, zelfs al zou bijvoorbeeld piet hebben gedaan

set transaction isolation level SERIALIZABLE

update sales
set product='kaakjes'
where id=3

Meestal is de "vlotheid" van de atomische statements (impliciete transactions) zo, 
dat locks bestaan voor de duration van milliseconds ofzo.

Maar nu doet piet dit:


set transaction isolation level SERIALIZABLE

BEGIN TRANSACTION XYZ
update sales
set product='snoepies'
where id=4


Wat Karel nu ook probeert te doen, bijv. een insert van een nieuwe rij of een update
van een andere rij, het wordt nimmer doorgevoerd totdat 
piet een COMMIT uitvoert op transactie XYZ.

sp_lock laat dan bijvoorbeeld zien (vooraf piet's commit):


spid      dbid   ObjId       IndId  Type Resource         Mode     Status 
------    ------ ----------- ------ ---- ---------------- -------- ------ 
52        7      0           0      DB                    S        GRANT
53        7      0           0      DB                    S        GRANT
53(piet)  7      1977058079  0      TAB                   X        GRANT
54(karel) 7      1977058079  0      TAB                   IX       WAIT
54        7      0           0      DB                    S        GRANT
55        1      85575343    0      TAB                   IS       GRANT


=============================
21. CREATE A DATABASE REPORT:
=============================


DECLARE @NO_OF_OBJ INT

PRINT '-- -----------------------------------------------------'
PRINT '-- CHECK 1: NUMBER OF DIFFERENT OBJECTS IN DATABASE:'
PRINT '-- -----------------------------------------------------'


select @NO_OF_OBJ=(select count(*) from sysobjects where xtype='U')
PRINT 'NO OF TABLES: '+convert(varchar(32),@NO_OF_OBJ)

select @NO_OF_OBJ=(select count(*) from sysobjects where xtype='V')
PRINT 'NO OF VIEWS: '+convert(varchar(32),@NO_OF_OBJ)

select @NO_OF_OBJ=(select count(*) from sysobjects where xtype='P')
PRINT 'NO OF STORED PROCEDURES: '+convert(varchar(32),@NO_OF_OBJ)

select @NO_OF_OBJ=(select count(*) from sysobjects where xtype='TR')
PRINT 'NO OF TRIGGERS: '+convert(varchar(32),@NO_OF_OBJ)

select @NO_OF_OBJ=(select count(*) from sysobjects where xtype='C')
PRINT 'NO OF CHECK CONSTRAINTS: '+convert(varchar(32),@NO_OF_OBJ)

select @NO_OF_OBJ=(select count(*) from sysobjects where xtype='D')
PRINT 'NO OF DEFAULT CONSTRAINTS: '+convert(varchar(32),@NO_OF_OBJ)

select @NO_OF_OBJ=(select count(*) from sysobjects where xtype='R')
PRINT 'NO OF RULES: '+convert(varchar(32),@NO_OF_OBJ)

PRINT '-- -----------------------------------------------------'
PRINT '-- CHECK 2: COUNT OF NO OF RECORDS IN ALL TABLES OF THE DATABASE:'
PRINT '-- -----------------------------------------------------'

SET NOCOUNT ON

declare @TABLE VARCHAR(64)
declare @num   INT

declare c1 cursor for
select name from sysobjects where xtype='U' order by name
open c1
fetch next from c1 into @table

while (@@fetch_status<>-1)
begin
print 'count records for: '+@table
exec('select count(*) from '+@table)
fetch next from c1 into @table
end

close c1
deallocate c1 


PRINT '-- -----------------------------------------------------'
PRINT '-- CHECK 3: LIST OF ALL PRIMARY KEYS:'
PRINT '-- -----------------------------------------------------'


SELECT substring(name,1,30) AS "PrimaryKey", 
       id, xtype, object_name(parent_obj) AS "Parent_table" 
FROM   sysobjects
WHERE  xtype='PK'
ORDER BY object_name(parent_obj)


PRINT '-- -----------------------------------------------------'
PRINT '-- CHECK 4: LIST OF ALL FOREIGN KEYS:'
PRINT '-- -----------------------------------------------------'


PRINT 'LIST 1:'
PRINT '-------'

SELECT substring(name, 1, 40) as "ForeignKey", 
       substring(object_name(parent_obj), 1, 30) as "TableWithFK"
FROM   sysobjects o, sysreferences r
WHERE  o.type='F'
AND    o.name=object_name(r.constid)
ORDER BY name

PRINT 'LIST 2:'
PRINT '-------'

SELECT substring(object_name(constid), 1, 40) AS FK,
       substring(object_name(fkeyid), 1, 30)  AS "Referencing Table",
       substring(object_name(rkeyid), 1, 30)  AS "Referenced Table"
FROM   sysreferences
ORDER BY object_name(fkeyid)

PRINT '-- -----------------------------------------------------'
PRINT '-- CHECK 5: LIST OF COLUMNS AND DATATYPES:'
PRINT '-- -----------------------------------------------------'


PRINT 'LIST OF ALL COLUMNS AND DATATYPES:'
PRINT '----------------------------------'


SELECT substring(c.name, 1, 30) as "ColumName",
       c.xtype, 
       substring(object_name(c.id),1,30) as "TableName", 
       substring(t.name,1,30) as "DataType"
FROM   syscolumns c, systypes t
WHERE  c.xtype=t.xtype
AND    object_name(c.id) in (SELECT name FROM sysobjects WHERE xtype='U')
AND    t.name not like '%sysname%'
ORDER BY object_name(c.id)


PRINT '-- -----------------------------------------------------'
PRINT '-- CHECK 6: LIST OF DATABASE OPTIONS AND COLLATION:'
PRINT '-- -----------------------------------------------------'

PRINT 'DATABASE OPTIONS:'
PRINT '-----------------'

exec sp_dboption

PRINT 'DATABASE COLLATION:'
PRINT '-------------------'

exec sp_helpsort

PRINT 'END OF LISTING.'


-- END OF FILE


======================================================================
22. SOME BACKUP SCRIPTS
======================================================================


SCRIPT 1:
=========


SET QUOTED_IDENTIFIER OFF 
GO
SET ANSI_NULLS ON 
GO


create procedure usp_build_restore_script
as
--
-- This stored procedure was written by Greg Larsen for Washington State Department of Health.
-- Date: 12/16/2001
--
-- Description:
--  This stored procedure generates TSQL script that will restore all the databases 
--  on the current SQL Server.  This stored procedure takes into account when the last 
--  full and differential backups where taken, and how many transaction log backups 
--  have been taken since the last database backup, based on the information in
--  the msdb database. 
--
-- Modified:
--
--
-- Declare variables used in SP
declare @cmd nvarchar (1000) 
declare @cmd1 nvarchar (1000) 
declare @db nvarchar(128)
declare @filename nvarchar(128)
declare @cnt int
declare @num_processed int
declare @name nvarchar(128) 
declare @physical_device_name nvarchar(128) 
declare @backup_start_date datetime
declare @type char(1) 
-- Turn off the row number message
set nocount on
 
-- SECTION 1 ----------------------------------------------
-- Define cursor to hold all the different databases for the restore script will be built
declare db cursor for 
select name from master..sysdatabases
where name not in ('tempdb') 
 
-- Create a global temporary table that will hold the name of the backup, the database name, and the type of database backup.
create table ##backupnames (
name nvarchar(100), 
database_name nvarchar(100), 
type char(1) )
 
-- Open cursor containing list of database names.
open db
fetch next from db into @db
 
-- Process until no more databases are left
WHILE @@FETCH_STATUS = 0
BEGIN
-- Subsection 1A --------------------------------------------
-- initialize the physical device name
 set @physical_device_name = ''
-- get the name of the last full database backup
 select @physical_device_name = physical_device_name , @backup_start_date = backup_start_date
 from  msdb..backupset a join msdb..backupmediaset b on a.media_set_id = b.media_set_id
      join msdb..backupmediafamily c on a.media_set_id = c.media_set_id 
       where type='d' and backup_start_date = 
        (select top 1 backup_start_date from msdb..backupset 
             where @db = database_name and type = 'd'
              order by backup_start_date desc)  
-- Did a full database backup name get found 
if @physical_device_name <> '' 
begin
-- Build command to place a record in table that holds backup names
  select @cmd = 'insert into ##backupnames values (' + char(39) + 
              @physical_device_name + char(39) + ',' + char(39) + @db + char(39) + ',' +
              char(39) + 'd' + char(39)+ ')'     
-- Execute command to place a record in table that holds backup names      
  exec sp_executesql @cmd
end
-- Subsection 1B --------------------------------------------
-- Reset the physical device name 
set @physical_device_name = ''
-- Find the last differential database backup
 select @physical_device_name = physical_device_name, @backup_start_date = backup_start_date 
 from  msdb..backupset a join msdb..backupmediaset b on a.media_set_id = b.media_set_id
      join msdb..backupmediafamily c on a.media_set_id = c.media_set_id 
       where type='i' and backup_start_date = 
        (select top 1 backup_start_date from msdb..backupset 
             where @db = database_name and type = 'I' and backup_start_date > @backup_start_date 
              order by backup_start_date desc) 
-- Did a differential backup name get found
if @physical_device_name <> ''
begin
 
-- Build command to place a record in table that holds backup names
  select @cmd = 'insert into ##backupnames values (' + char(39) + 
              @physical_device_name + char(39) + ',' + char(39) + @db + char(39) + ',' +
              char(39) + 'i' + char(39)+ ')'     
-- Execute command to place a record in table that holds backup names        
  exec sp_executesql @cmd
end
-- Subsection 1C --------------------------------------------
-- Build command to place records in table to hold backup names for all 
-- transaction log backups from the last database backup
set @CMD = 'insert into ##backupnames select physical_device_name,' + char(39) + @db + char(39) + 
 ',' + char(39) + 'l' + char(39) +   
 'from  msdb..backupset a join msdb..backupmediaset b on a.media_set_id = b.media_set_id join ' + 
 'msdb..backupmediafamily c on a.media_set_id = c.media_set_id ' +  
       'where type=' + char(39) + 'l' + char(39) + 'and backup_start_date >  @backup_start_dat and' + 
 char(39) + @db + char(39) + ' = database_name order by backup_start_date'
-- Execute command to place records in table to hold backup names 
--  for all transaction log backups from the last database backup
exec sp_executesql @cmd,@params=N'@backup_start_dat datetime', @backup_start_dat = @backup_start_date
-- get next database to process 
fetch next from db into @db
end
-- close 
close db
-- Section B ----------------------------------------------
open db
-- Get first recod from database list cursor
fetch next from db into @db
-- Generate Heading in Restore script
print '-- Restore All databases'
 
-- Process all databases
WHILE @@FETCH_STATUS = 0
BEGIN
-- define cursor for all database and log backups for specific database being processed
  declare backup_name cursor for 
     select name,type from ##backupnames where database_name = @DB
-- Open cursor containing list of database backups for specific database being processed  
  open backup_name
-- Determine the number of different backups available for specific database being processed
  select @CNT = count(*) from ##backupnames where database_name = @DB 
-- Get first database backup for specific database being processed
  fetch next from backup_name into @physical_device_name, @type
-- Set counter to track the number of backups processed
  set @NUM_PROCESSED = 0
-- Process until no more database backups exist for specific database being processed
  WHILE @@FETCH_STATUS = 0
  BEGIN
-- Increment the counter to track the number of backups processed
  set @NUM_PROCESSED = @NUM_PROCESSED + 1
-- Is the number of database backup processed the same as the number of different backups 
-- available for specific database being processed?
  if @CNT = @NUM_PROCESSED
-- If so, is the type of backup currently being processed a transaction log backup?
    if @TYPE = 'l'
-- build restore command to restore the last transaction log
      select @cmd = 'restore log ' + rtrim(@db) + char(13) +
              ' from disk = ' + char(39) +  
               rtrim(substring(@physical_device_name,1,len(@physical_device_name))) + 
                 char(39) + char(13) + ' with replace'
    else
-- Last backup was not a transaction log backup
-- Build restore command to restore the last database backup 
      select @cmd = 'restore database ' + rtrim(@db) + char(13) +
            ' from disk = ' + char(39) +  
             rtrim(substring(@physical_device_name,1,len(@physical_device_name))) + 
               char(39) + char(13) + ' with replace'
  else 
-- Current backup is not the last backup
-- Is the current backup being processed a transaction log backup?
    if @TYPE = 'l'
-- Build restore command to restore the current transaction backup, with no recovery
      select @cmd = 'restore log ' + rtrim(@db) + char(13) +
              ' from disk = ' + char(39) +  
               rtrim(substring(@physical_device_name,1,len(@physical_device_name))) + 
                 char(39) + char(13) + ' with replace, norecovery'
    else
-- Current backup being processed is not a transaction log backup
-- Build restore command to restore the currrent database backup, with no recovery
      select @cmd = 'restore database ' + rtrim(@db) + char(13) +
           ' from disk = ' + char(39) +  
            rtrim(substring(@physical_device_name,1,len(@physical_device_name))) + 
              char(39) + char(13) + ' with replace, norecovery'

-- if it is master comment line out
   if @db = 'master' 
      set @cmd = '/* ' + char(13) + @cmd + char(13) + '*/'
-- Generate the restore command and other commands for restore script
   print @cmd
   print 'go'
   print ' '
    
-- Get next database backup to process
  fetch next from backup_name into @physical_device_name, @type
end 
-- Close and deallocate database backup name cursor for current database being processed
close backup_name
deallocate backup_name
-- Get next database to process
  fetch next from db into @db
end
-- Close and deallocate cursor containing list of databases to process
close db
deallocate db
-- Drop global temporary table 
drop table ##backupnames


GO
SET QUOTED_IDENTIFIER OFF 
GO
SET ANSI_NULLS ON 
GO


SCRIPT 2:
=========

SET NOCOUNT ON

DECLARE @NAME         VARCHAR(128)
DECLARE @DATUM        DATETIME
DECLARE @BACKUP_DATUM VARCHAR(128)

SELECT @DATUM=GETDATE()
SELECT @BACKUP_DATUM=CONVERT(VARCHAR(10),@DATUM,20)

-- NU DE DATABASENAMEN OPHALEN UIT DE DICTIONARY

DECLARE c1 CURSOR FOR
SELECT   name FROM master.dbo.sysdatabases WHERE name not like '%AIDA%'

OPEN c1

FETCH NEXT FROM c1 INTO @NAME

WHILE (@@fetch_status<>-1)
BEGIN

  PRINT 'BACKUP DATABASE '+@NAME+' TO DISK=''d:\backup\local_dbs\'+@NAME+'_'+@BACKUP_DATUM+'.dmp'
  PRINT 'GO'

FETCH NEXT FROM c1 INTO @NAME
END

CLOSE c1
DEALLOCATE c1


SCRIPT 3:
=========


USE msdb
DECLARE   @v1 VARCHAR(30)
DECLARE   @v2 VARCHAR(30)
SELECT    @v1=max(backup_finish_date) FROM backupset
SELECT    @v2=getdate()

IF (SELECT DATEDIFF(day, @v1, @v2)) in (0, 1)
  BEGIN
    BACKUP LOG sales TO sales_log_dump WITH INIT
  END

select MAX(backup_finish_date) from msdb.dbo.backupset
where database_name like 'SharePoint%'


SCRIPT 4:
=========


-- ---------------------------------------
-- JOB: BACKUP_ALL_DATABASES (Once a week)
--
-- Version : 1.2
-- Date    : 27-12-2004
-- ---------------------------------------


-- Step 1: removal of backup files older than 60 days
-- --------------------------------------------------

DECLARE @fname     VARCHAR(128)
DECLARE @delstring VARCHAR(128)
DECLARE @checkfile VARCHAR(128)
DECLARE @result    INT
DECLARE @ftrue     INT

DECLARE c_1 CURSOR FOR
SELECT f.physical_device_name
FROM   msdb.dbo.backupset s, msdb.dbo.backupmediafamily f
WHERE  s.media_set_id=f.media_set_id
AND    s.backup_start_date >getdate()-90
AND    s.backup_start_date <getdate()-60
AND    f.physical_device_name NOT LIKE 'master%'
AND    f.physical_device_name NOT LIKE 'msdb%'
AND    f.physical_device_name NOT LIKE 'model%'
AND    f.physical_device_name NOT LIKE 'full%'
AND    f.physical_device_name NOT LIKE 'diff%'

OPEN c_1

FETCH NEXT FROM c_1 INTO @fname

WHILE (@@fetch_status<>-1)
BEGIN
  SELECT @delstring='del '+@fname
  SELECT @checkfile='dir '+@fname

  EXEC @ftrue=master.dbo.xp_cmdshell @checkfile 

  IF (@ftrue=0)

     EXEC @RESULT = master.dbo.xp_cmdshell @delstring
     IF (@RESULT <> 0)
        BEGIN
          RAISERROR ('Backup_all_databases: One or more old backupdumps not found.', 16, 1) WITH LOG
        END

FETCH NEXT FROM c_1 INTO @fname
END

CLOSE c_1
DEALLOCATE c_1


-- Step 2: backup databases
-- ------------------------

SET NOCOUNT ON

DECLARE @NAME         VARCHAR(128)
DECLARE @DATUM        DATETIME
DECLARE @BACKUP_DATUM VARCHAR(128)

SELECT @DATUM=GETDATE()
SELECT @BACKUP_DATUM=CONVERT(VARCHAR(10),@DATUM,20)

-- NU DE DATABASENAMEN OPHALEN UIT DE DICTIONARY

DECLARE c1 CURSOR FOR
SELECT   name from master.dbo.sysdatabases where name not like '%AIDA%'
AND name not in ('master','model','tempdb','pubs','northwind','msdb')
AND name not like 'SharePointPortal_Site%'
AND name not like 'ProjectServer%'
AND status < 100

OPEN c1

FETCH NEXT FROM c1 INTO @NAME

WHILE (@@fetch_status<>-1)
BEGIN

EXEC('BACKUP DATABASE '+@NAME+' TO DISK=''d:\backup\'+@NAME+'_'+@BACKUP_DATUM+'.dmp'''+' WITH INIT')

FETCH NEXT FROM c1 INTO @NAME
END

CLOSE c1
DEALLOCATE c1


===========================
23. DBCC COMMANDS:
===========================


1. DBCC SHOWCONTIG
==================

Understanding SQL Server's DBCC SHOWCONTIG  


Probably one of the most significant performance problems found in databases is centered around table data fragmentation. 
One situation that may be analogous to table fragmentation might be an index at the end of a large book. 
A single index entry in such a book might point to several pages scattered throughout the book. You must then scan each page 
for the specific information you require. This differs significantly from the index of the phone book which stores its 
data in sorted order. A typical query for the name "Jones" might span multiple consecutive pages, but are always held 
in a sorted order.

In the case of a database, we start out with the data looking more like a phone book, and end with the data looking 
more like a history book. Therefore, we need to occasionally resort the data in an effort to recreate the phone book 
order. Below, you will see a graphical presentation of how SQL Server lays out the data so that we can discuss the actual 
findings more clearly.

   |====|====|====..     ..|====|    the whole picture is one 64KB extent
   |====|====|====..     ..|====|
   |====|====|====..     ..|====|
   |====|====|====..     ..|====|
   |====|====|====..     ..|====|
    page


A Quick SQL Server Internals Discussion

We are most familiar with the data row. The row size is set only by the definition of the table that holds it 
(e.g. A table of addresses require more data per row then a table of class names). In SQL Server, a table may 
define a row as storing as little as 4 bytes to as much as 8060.This limit is set by the size of the data page, 
which stores up to 8,192 bytes (8 KB). The remaining 132 bytes are used by SQL Server to track other information 
under the covers. Although SQL Server is designed around 8 KB pages, the smallest unit of data that SQL Server can 
allocate is 64 KB. This is called an extent. 

To store the data in a sorted order, as in a phone book, SQL Server uses something called a clustered index. 
When a typical database is created, clustered indexes exist on nearly all tables. However, just because the data exists 
in sorted order within the page does not mean that it exists as such within an extent. The reason for this derives from 
situations in which there is no more room on a given page in which it can insert a row. SQL Server then removes approximately 
half the page and moves it to another page, which is called a Page Split (Page Splits will not occur with clustered indexes 
on IDENTITY based columns, but hotspotting may). In some cases, it may move that data to another extent altogether, 
possibly even allocating a new extent to do so. So, while we start off with names beginning with A and ending with H on 
one page, and names beginning with I and ending with Z on the next page, through usage, we may see that names 
A through C are now located on one page in one extent, D through E on another extent and S through Z back on the fifth
 page of the first extent, etc. It is because of the page split that there are times in which we may prefer to use 
tables with no clustered indexes at all. However, these tables are usually scratch tables which are highly volatile. 
In those situations, we desire the quicker write times at the cost of slower reads.

Calling DBCC SHOWCONTIG

Using Query Analyzer, connect to the database you wish to view. Next, you will need to get the object id of the table(s) 
you wish to examine. I have simplified this task to retrieve the top 10 tables by size using the following script.

SELECT TOP 10 
'DBCC SHOWCONTIG(' + CAST(id AS NVARCHAR(20)) + ')' 
+ CHAR(10) + 
'PRINT '' ''' + CHAR(10) 
FROM 
sysindexes 
WHERE 
indid = 1 or 
indid = 0 
ORDER BY rows DESC

Execute this script in the database that you wish to check, and you will get an output resembling (repeated 10 times, 
once for each of the 10 largest tables):

DBCC SHOWCONTIG(123456789)
PRINT ''

Copy and paste the complete resultset into your query window and execute it.


The Results Explained

The results from the previous command will look something like the following:

DBCC SHOWCONTIG scanning 'MyTable1' table...
Table: 'MyTable1' (1556968673); index ID: 1, database ID: 16
TABLE level scan performed.
- Pages Scanned................................: 18986
- Extents Scanned..............................: 2443
- Extent Switches..............................: 9238
- Avg. Pages per Extent........................: 7.8
- Scan Density [Best Count:Actual Count].......: 25.70% [2374:9239]
- Logical Scan Fragmentation ..................: 44.58%
- Extent Scan Fragmentation ...................: 87.07%
- Avg. Bytes Free per Page.....................: 1658.7
- Avg. Page Density (full).....................: 79.51%
DBCC execution completed. If DBCC printed error messages, 
contact your system administrator.

DBCC SHOWCONTIG scanning 'MyTable2' table...
Table: 'MyTable2' (183984032); index ID: 1, database ID: 16
TABLE level scan performed.
- Pages Scanned................................: 28980
- Extents Scanned..............................: 3687
- Extent Switches..............................: 22565
- Avg. Pages per Extent........................: 7.9
- Scan Density [Best Count:Actual Count].......: 16.06% [3623:22566]
- Logical Scan Fragmentation ..................: 83.05%
- Extent Scan Fragmentation ...................: 87.44%
- Avg. Bytes Free per Page.....................: 3151.1
- Avg. Page Density (full).....................: 61.07%
DBCC execution completed. If DBCC printed error messages,
contact your system administrator.

In the first table, MyTable1, we see that there were 18,986 pages examined to create the report. Those pages 
existed within 2,443 extents, indicating that the table consumed approximately 97% (7.8 pages per extent on average) 
of the extents allocated for it. We then see that while examining the pages for fragmentation, the server had to 
switch extent locations 9, 238 times. The Scan Density restates this by indicating the percentage of all pages within 
all extents were contiguous. In an ideal environment, the density displayed would be close to 100. The Logical 
Scan Fragmentation and Extent Scan Fragmentation are indications of how well the indexes are stored within the system 
when a clustered index is present (and should be ignored for tables that do not have a clustered index). 
In both cases, a number close to 0 is preferable. There is another anomaly being displayed here that is a little 
difficult to explain, but it is that SQL Server allows multiple tables to exist within a single extent, which further 
explains the 7.8 pages per extent (multiple tables may not however exist within a page).

The next items discuss a somewhat more mundane but important issue of page utilization. Again using the first table as 
the example, there are an average of 1659 bytes free per page, or that each page is 79.51% utilized. The closer 
that number gets to 100, the faster the database is able to read in records, since more records exist on a single page. 
However, this must be balanced with the cost of writing to the table. Since a page split will occur if a write is 
required on a page that is full, the overhead can be tremendous. This is exaggerated when using RAID 5 disk subsystems, 
since RAID 5 has a considerably slower write time compared to its read time. To account for this, we have the ability 
of telling SQL Server to leave each page a certain percentage full. 

DBCC REINDEX is a related tool that will reorganize your database information in much the same way Norton Defrag 
will work on your hard drive (see Books Online for information on how to use DBCC REINDEX). The following report 
displays the differences in the data after we defragmented the data using DBCC DBREINDEX. 

DBCC SHOWCONTIG scanning 'MyTable1' table...
Table: 'MyTable1' (1556968673); index ID: 1, database ID: 16
TABLE level scan performed.
- Pages Scanned................................: 15492
- Extents Scanned..............................: 1945
- Extent Switches..............................: 2363
- Avg. Pages per Extent........................: 8.0
- Scan Density [Best Count:Actual Count].......: 81.94% [1937:2364]
- Logical Scan Fragmentation ..................: 15.43%
- Extent Scan Fragmentation ...................: 20.15%
- Avg. Bytes Free per Page.....................: 159.8
- Avg. Page Density (full).....................: 98.03%
DBCC execution completed. If DBCC printed error messages, 
contact your system administrator.
 
DBCC SHOWCONTIG scanning 'MyTable2' table...
Table: 'MyTable2' (183984032); index ID: 1, database ID: 16
TABLE level scan performed.
- Pages Scanned................................: 35270
- Extents Scanned..............................: 4415
- Extent Switches..............................: 4437
- Avg. Pages per Extent........................: 8.0
- Scan Density [Best Count:Actual Count].......: 99.35% [4409:4438]
- Logical Scan Fragmentation ..................: 0.11%
- Extent Scan Fragmentation ...................: 0.66%
- Avg. Bytes Free per Page.....................: 3940.1
- Avg. Page Density (full).....................: 51.32%
DBCC execution completed. If DBCC printed error messages, 
contact your system administrator.

Here, we can see several key improvements and some examples of how proper indexing can be very important. The most glaring 
items for us are how well we were able to increase the scan density. Again, using the MyTable1 table as a reference, 
we can see that out of 1,945 extents, there were only 2363 extent switches. Notice that the number of extent switches 
is now a lower number than the original number of extents. This is due to the more efficient allocation of the data.
And, since there is a significant reduction of the number of extent switches, searches for large quantities of contiguous 
data will be fulfilled much more quickly.

These reports were taken after only a small amount of processing had occurred on this system, yet already we can see 
that there has been a fair amount of fragmentation of the data. The table MyTable1 has already begun to show signs of 
performance degradation. When there is an unusually large amount of new data being inserted into the tables, these numbers 
will quickly begin to resemble the those that we see in the previous report.

In the table MyTable2, we see a stark difference from MyTable1.This is because of some index tuning that I had done on 
that table. As I said earlier, SQL Server uses the clustered indexes in order to understand how data should be ordered. 
To prevent page splits, I had SQL Server leave each page only 50% full. This allows for multiple inserts to occur without 
generating page splits, allowing our scan density to remain high for a longer period of time. But this also comes at 
the cost of reducing the quantity of contiguous records on each page and doubles the amount of space consumed by the table, 
hence the now much larger number of pages and extents scanned. 

Conclusion

From examining the output of DBCC SHOWCONTIG, we were able to locate several key issues. First, we saw that our 
database was heavily fragmented, and required defragmentation using DBCC DBREINDEX. Next, we were able to tell 
what percentage of the allocated pages were actually being used by SQL Server. Finally, we saw that by modifying the 
fillfactor on an index, we had a tremendous affect on page splitting at the cost of more page I/O for each read.


The following SQL Server DBCC commands, some documented and some not documented, can come in handy when you are trying 
to optimize your SQL Servers. 
 

DBCC CACHESTATS: Displays information about the object currently in the buffer cache, such as hit rates, 
compiled objects and plans, etc. 
Note in the sample results below that each of these SQL Server objects can be cached in the buffer cache of SQL Server.

Example:

DBCC CACHESTATS

Sample Results (abbreviated):

Object Name       Hit Ratio
------------      -------------

Proc              0.86420054765378507
Prepared          0.99988494930394334
Adhoc             0.93237136647793051
ReplProc          0.0
Trigger           0.99843452831887947
Cursor            0.42319205924058612
Exec Cxt          0.65279111666076906
View              0.95740334726893905
Default           0.60895011346896522
UsrTab            0.94985969576133511
SysTab            0.0
Check             0.67021276595744683
Rule              0.0
Summary           0.80056155581812771
 

Here's what some of the key statistics from this command mean: 

Hit Ratio: Displays the percentage of time that this particular object was found in SQL Server's cache. 
The bigger this number, the better.
Object Count: Displays the total number of objects of the specified type that are cached.
Avg. Cost: A value used by SQL Server that measures how long it takes to compile a plan, along with the amount 
of memory needed by the plan. This value is used by SQL Server to determine if the plan should be cached or not.  
Avg. Pages: Measures the total number of 8K pages used, on average, for cached objects.

  
LW Object Count, LW Avg Cost, WL Avg Stay, LW Ave Use: All these columns indicate how many of the specified objects 
have been removed from the cache by the Lazy Writer. The lower the figure, the better. 

* * * * *

DBCC DROPCLEANBUFFERS: Use this command to remove all the test data from SQL Server's data cache (buffer) between 
tests to ensure fair testing. Keep in mind that this command only removes clean buffers, not dirty buffers. 
Because of this, before running the DBCC DROPCLEANBUFFERS command, you may first want to run the CHECKPOINT command first. 
Running CHECKPOINT will write all dirty buffers to disk. And then when you run DBCC DROPCLEANBUFFERS, you can be 
assured that all data buffers are cleaned out, not just the clean ones.

Example:

DBCC DROPCLEANBUFFERS


* * * * *

DBCC ERRORLOG: If you rarely restart the mssqlserver service, you may find that your server log gets very large 
and takes a long time to load and view. You can truncate (essentially create a new log) the Current Server log by 
running DBCC ERRORLOG. You might want to consider scheduling a regular job that runs this command once a week to 
automatically truncate the server log. As a rule, I do this for all of my SQL Servers. Also, you can accomplish 
the same thing using this stored procedure: sp_cycle_errorlog.

Example:

DBCC ERRORLOG

* * * * *

DBCC FLUSHPROCINDB: Used to clear out the stored procedure cache for a specific database on a SQL Server, 
not the entire SQL Server. The database ID number to be affected must be entered as part of the command.

You may want to use this command before testing to ensure that previous stored procedure plans won't negatively 
affect testing results.

Example:

DECLARE @intDBID INTEGER SET @intDBID = (SELECT dbid FROM master.dbo.sysdatabases WHERE name = 'database_name')
DBCC FLUSHPROCINDB (@intDBID)


* * * * *

DBCC INDEXDEFRAG: In SQL Server 2000, Microsoft introduced DBCC INDEXDEFRAG to help reduce logical disk fragmentation.
When this command runs, it reduces fragmentation and does not lock the table, allowing other users to access the table 
when the defragmentation process is running. Unfortunately, this command doesn't do a great job of logical defragmentation.

The only way to truly reduce logical fragmentation is to rebuild your table's indexes. While this will reduce all 
fragmentation, unfortunately it will lock the table, preventing users from accessing it during this process. 
This means that you will need to find a time when this will not present a problem to your users.

Of course, if you are unable to find a time to reindex your indexes, then running DBCC INDEXDEFRAG is better than doing 
nothing.

Example:

DBCC INDEXDEFRAG (Database_Name, Table_Name, Index_Name)

* * * * *

DBCC FREEPROCCACHE: Used to clear out the stored procedure cache for all SQL Server databases. You may want to 
use this command before testing to ensure that previous stored procedure plans won't negatively affect testing results.

Example:

DBCC FREEPROCCACHE


* * * * *

DBCC MEMORYSTATUS: Lists a breakdown of how the SQL Server buffer cache is divided up, including buffer activity. 
Undocumented command, and one that may be dropped in future versions of SQL Server.

Example:

DBCC MEMORYSTATUS


* * * * *

DBCC OPENTRAN: An open transaction can leave locks open, preventing others from accessing the data they need in a database. 
This command is used to identify the oldest open transaction in a specific database.

Example:

DBCC OPENTRAN('database_name')


* * * * *

DBCC PAGE: Use this command to look at contents of a data page stored in SQL Server.

Example:

DBCC PAGE ({dbid|dbname}, pagenum [,print option] [,cache] [,logical])

where:

Dbid or dbname: Enter either the dbid or the name of the database in question.

Pagenum: Enter the page number of the SQL Server page that is to be examined.

Print option: (Optional) Print option can be either 0, 1, or 2. 0 - (Default) This option causes 
DBCC PAGE to print out only the page header information. 1 - This option causes DBCC PAGE to print out the page 
header information, each row of information from the page, and the page's offset table. Each of the rows printed out 
will be separated from each other. 2 - This option is the same as option 1, except it prints the page rows as a 
single block of information rather than separating the individual rows. The offset and header will also be displayed.

Cache: (Optional) This parameter allows either a 1 or a 0 to be entered. 0 - This option causes DBCC PAGE to 
retrieve the page number from disk rather than checking to see if it is in cache. 1 - (Default) This option takes 
the page from cache if it is in cache rather than getting it from disk only.

Logical: (Optional) This parameter is for use if the page number that is to be retrieved is a virtual page rather 
then a logical page. It can be either 0 or 1. 0 - If the page is to be a virtual page number. 1 - (Default) If the page is the logical page number.


* * * * *

DBCC PINTABLE & DBCC UNPINTABLE: By default, SQL Server automatically brings into its data cache the pages it needs 
to work with. These data pages will stay in the data cache until there is no room for them, and assuming they are not 
needed, these pages will be flushed out of the data cache onto disk. At some point in the future when SQL Server needs 
these data pages again, it will have to go to disk in order to read them again into the data cache for use. If SQL Server 
somehow had the ability to keep the data pages in the data cache all the time, then SQL Server's performance would be 
increased because I/O could be significantly reduced on the server.

The process of "pinning a table" is a way to tell SQL Server that we don't want it to flush out data pages for specific 
named tables once they are read in in the first place. This in effect keeps these database pages in the data cache all 
the time, which eliminates the process of SQL Server from having to read the data pages, flush them out, and reread them 
again when the time arrives. As you can imagine, this can reduce I/O for these pinned tables, boosting SQL Server's 
performance.

To pin a table, the command DBCC PINTABLE is used. For example, the script below can be run to pin a 
table in SQL Server:

DECLARE @db_id int, @tbl_id int
USE Northwind
SET @db_id = DB_ID('Northwind')
SET @tbl_id = OBJECT_ID('Northwind..categories')
DBCC PINTABLE (@db_id, @tbl_id)

While you can use the DBCC PINTABLE directly, without the rest of the above script, you will find the script 
handy because the DBCC PINTABLE's parameters refer to the database and table ID that you want to pin, not by their 
database and table name. This script makes it a little easier to pin a table. You must run this command for every 
table you want to pin.

Once a table is pinned in the data cache, this does not mean that the entire table is automatically loaded into the 
data cache. It only means that as data pages from that table are needed by SQL Server, they are loaded into the data cache, 
and then stay there, not ever being flushed out to disk until you give the command to unpin the table using the 
DBCC UNPINTABLE. It is possible that part of a table, and not all of it, will be all that is pinned.

When you are done with a table and you no longer want it pinned, you will want to unpin your table. To do so, run this 
example code:

DECLARE @db_id int, @tbl_id int
USE Northwind
SET @db_id = DB_ID('Northwind')
SET @tbl_id = OBJECT_ID('Northwind..categories')
DBCC UNPINTABLE (@db_id, @tbl_id)


* * * * *

DBCC PROCCACHE: Displays information about how the stored procedure cache is being used.

Example:

DBCC PROCCACHE


* * * * *

DBCC REINDEX: Periodically (weekly or monthly) perform a database reorganization on all the indexes on all the tables 
in your database. This will rebuild the indexes so that the data is no longer fragmented. Fragmented data can cause 
SQL Server to perform unnecessary data reads, slowing down SQL Server's performance.

If you do a reorganization on a table with a clustered index, any non-clustered indexes on that same table will 
automatically be rebuilt.

Database reorganizations can be done scheduling SQLMAINT.EXE to run using the SQL Server Agent, or if by running
 your own custom script via the SQL Server Agent (see below).

Unfortunately, the DBCC DBREINDEX command will not automatically rebuild all of the indexes on all the tables in a 
database, it can only work on one table at a time. But if you run the following script, you can index all the tables 
in a database with ease.

Example:

DBCC DBREINDEX('table_name', fillfactor)

or

--Script to automatically reindex all tables in a database

USE DatabaseName --Enter the name of the database you want to reindex

DECLARE @TableName varchar(255)

DECLARE TableCursor CURSOR FOR
SELECT table_name FROM information_schema.tables
WHERE table_type = 'base table'

OPEN TableCursor

FETCH NEXT FROM TableCursor INTO @TableName
WHILE @@FETCH_STATUS = 0
BEGIN 
PRINT "Reindexing " + @TableName
DBCC DBREINDEX(@TableName,' ',90)
FETCH NEXT FROM TableCursor INTO @TableName
END

CLOSE TableCursor

DEALLOCATE TableCursor

The script will automatically reindex every index in every table of any database you select, and provide a 
fillfactor of 90%. You can substitute any number you want for the 90 in the above script. 

When DBCC DBREINDEX is used to rebuild indexes, keep in mind that as the indexes on a table are being rebuilt, 
that the table becomes unavailable for use by your users. For example, when a non-clustered index is rebuilt, a 
shared table lock is put on the table, preventing all but SELECT operations to be performed on it. When a clustered 
index is rebuilt, an exclusive table lock is put on the table, preventing any table access by your users. 
Because of this, you should only run this command when users don't need access to the tables being reorganized. 

* * * * *

DBCC SHOWCONTIG: Used to show how fragmented data and indexes are in a specified table. If data pages storing data 
or index information becomes fragmented, it takes more disk I/O to find and move the data to the SQL Server cache buffer, 
hurting performance. This command tells you how fragmented these data pages are. If you find that fragmentation is 
a problem, you can reindex the tables to eliminate the fragmentation. Note, this fragmentation is fragmentation of 
data pages within the SQL Server MDB file, not of the physical file itself.

Since this command requires you to know the ID of both the table and index being analyzed, you may want to run the 
following script so you don't have to manually look up the table name ID number and the index ID number.

Example:

DBCC SHOWCONTIG (Table_id, IndexID)

or

--Script to identify table fragmentation

--Declare variables
DECLARE
@ID int,
@IndexID int,
@IndexName varchar(128)

--Set the table and index to be examined
SELECT @IndexName = 'index_name'           --enter name of index
SET @ID = OBJECT_ID('table_name')          --enter name of table

--Get the Index Values
SELECT @IndexID = IndID
FROM sysindexes
WHERE id = @ID AND name = @IndexName

--Display the fragmentation
DBCC SHOWCONTIG (@id, @IndexID)

While the DBCC SHOWCONTIG command provides several measurements, the key one is Scan Density. This figure should be as 
close to 100% as possible. If the scan density is less than 75%, then you may want to reindex the tables in your database. 

* * * * *

DBCC SHOW_STATISTICS: Used to find out the selectivity of an index. Generally speaking, the higher higher the 
selectivity of an index, the greater the likelihood it will be used by the query optimizer. You have to specify 
both the table name and the index name you want to find the statistics on.

Example:

DBCC SHOW_STATISTICS (table_name, index_name)


* * * * *

DBCC SQLMGRSTATS: Used to produce three different values that can sometimes be useful when you want to find out how well 
caching is being performed on ad-hoc and prepared Transact-SQL statements.

Example:

DBCC SQLMGRSTATS

Sample Results:

Item                      Status 
------------------------- ----------- 
Memory Used (8k Pages)    5446
Number CSql Objects       29098
Number False Hits         425490

Here's what the above means:

Memory Used (8k Pages): If the amount of memory pages is very large, this may be an indication that some user connection 
is preparing many Transact-SQL statements, but it not un-preparing them.

Number CSql Objects: Measures the total number of cached Transact-SQL statements.

Number False Hits: Sometimes, false hits occur when SQL Server goes to match pre-existing cached Transact-SQL statements. 
Ideally, this figure should be as low as possible.


* * * * *

DBCC SQLPERF(): This command includes both documented and undocumented options. Let's take a look at all of them and 
see what they do.

DBCC SQLPERF (LOGSPACE)

This option (documented) returns data about the transaction log for all of the databases on the SQL Server, 
including Database Name, Log Size (MB), Log Space Used (%), and Status.

DBCC SQLPERF (UMSSTATS)

This option (undocumented) returns data about SQL Server thread management.

DBCC SQLPERF (WAITSTATS)

This option (undocumented) returns data about wait types for SQL Server resources.

DBCC SQLPERF (IOSTATS)

This option (undocumented) returns data about outstanding SQL Server reads and writes.

DBCC SQLPERF (RASTATS)

This option (undocumented) returns data about SQL Server read-ahead activity.

DBCC SQLPERF (THREADS)

This option (undocumented) returns data about I/O, CPU, and memory usage per SQL Server thread. 

* * * * *

DBCC SQLPERF (UMSSTATS): When you run this command, you get output like this. 
(Note, this example was run on a 4 CPU server. There is 1 Scheduler ID per available CPU.)

Statistic                        Value 
-------------------------------- ------------------------ 
Scheduler ID                     0.0
num users                        18.0
num runnable                     0.0
num workers                      13.0
idle workers                     11.0
work queued                      0.0
cntxt switches                   2.2994396E+7
cntxt switches(idle)             1.7793976E+7
Scheduler ID                     1.0
num users                        15.0
num runnable                     0.0
num workers                      13.0
idle workers                     10.0
work queued                      0.0
cntxt switches                   2.4836728E+7
cntxt switches(idle)             1.6275707E+7
Scheduler ID                     2.0
num users                        17.0
num runnable                     0.0
num workers                      12.0
idle workers                     11.0
work queued                      0.0
cntxt switches                   1.1331447E+7
cntxt switches(idle)             1.6273097E+7
Scheduler ID                     3.0
num users                        16.0
num runnable                     0.0
num workers                      12.0
idle workers                     11.0
work queued                      0.0
cntxt switches                   1.1110251E+7
cntxt switches(idle)             1.624729E+7
Scheduler Switches               0.0
Total Work                       3.1632352E+7

Below is an explanation of some of the key statistics above:

num users: This is the number of SQL Server threads currently in the scheduler.
num runnable: This is the number of actual SQL Server threads that are runnable
num workers: This is the actual number of worker there are to process threads. This is the size of the thread pool.
idle workers: The number of workers that are currently idle.
cntxt switches: The number of context switches between runnable threads.
cntxt switches (idle): The number of context switches to the idle thread.


* * * * *

DBCC TRACEON & DBCC TRACEOFF: Used to turn on and off trace flags. Trace flags are often used to temporarily 
turn on and off specific server behavior or server characteristics. In rare occasions, they can be useful to 
troubleshooting SQL Server performance problems.

Example:

To use the DBCC TRACEON command to turn on a specified trace flag, use this syntax:

DBCC TRACEON (trace# [,...n])

To use the DBCC TRACEON command to turn off a specified trace flag, use this syntax:

DBCC TRACEOFF (trace# [,...n])

You can also use the DBCC TRACESTATUS command to find out which trace flags are currently turned on in your server 
using this syntax:

DBCC TRACESTATUS (trace# [,...n])

For specific information on the different kinds of trace flags available, search this website or look them up 
in Books Online. [6.5, 7.0, 2000] More information on SQL Server 7.0 trace flags, and more 
info on SQL Server 2000 trace flags.

* * * * *

DBCC UPDATEUSAGE: The official use for this command is to report and correct inaccuracies in the sysindexes table, 
which may result in incorrect space usage reports. Apparently, it can also fix the problem of unreclaimed data pages in SQL Server. You may want to consider running this command periodically to clean up potential problems. This command can take some time to run, and you want to run it during off times because it will negatively affect SQL Server's performance when running. When you run this command, you must specify the name of the database that you want affected.

Example:

DBCC UPDATEUSAGE ('databasename')


=========================== 
24. DOCUMENTS AND GRAPHICS:
===========================


Usually, text, ntext, or image strings are large (a maximum of 2GB) character or binary strings stored outside 
a data row. The data row contains only a 16-byte text pointer that points to the root node of a tree built 
of internal pointers that map the pages in which the string fragments are stored.


24.1 Pointer 1:
===============

Insert Image 
------------ 

Using TextCopy utility , you can import image data into SQLServer. 

Syntax as follow : 
---------------------------------------------------------------------------- 
TEXTCOPY Version 1.0 
DB-Library version 8.00.100 

Copies a single text or image value into or out of SQL Server. The value 
is a specified text or image 'column' of a single row (specified by the 
"where clause") of the specified 'table'. 

If the direction is IN (/I) then the data from the specified 'file' is 
copied into SQL Server, replacing the existing text or image value. If the 
direction is OUT (/O) then the text or image value is copied from 
SQL Server into the specified 'file', replacing any existing file. 

TEXTCOPY [/S [sqlserver]] [/U [login]] [/P [password]] 
[/D [database]] [/T table] [/C column] [/W"where clause"] 
[/F file] [{/I | /O}] [/K chunksize] [/Z] [/?] 

/S sqlserver The SQL Server to connect to. If 'sqlserver' is not 
specified, the local SQL Server is used. 
/U login The login to connect with. If 'login' is not specified, 
a trusted connection will be used. 
/P password The password for 'login'. If 'password' is not 
specified, a NULL password will be used. 
/D database The database that contains the table with the text or 
image data. If 'database' is not specified, the default 
database of 'login' is used. 
/T table The table that contains the text or image value. 
/C column The text or image column of 'table'. 
/W "where clause" A complete where clause (including the WHERE keyword) 
that specifies a single row of 'table'. 
/F file The file name. 
/I Copy text or image value into SQL Server from 'file'. 
/O Copy text or image value out of SQL Server into 'file'. 
/K chunksize Size of the data transfer buffer in bytes. Minimum 
value is 1024 bytes, default value is 4096 bytes. 
/Z Display debug information while running. 
/? Display this usage information and exit. 

------------------------------------------------------------------------------------ 

-- create the table with image column.... insert the some value... 
then import image into the table using textcopy... 
then only you get the text or image pointer to update the image value..... 

Example : 

-- import table stru 

create table EmployeeImage 
( EmployeeID int, 
Pic image default "image") 

insert EmployeeImage(EmployeeID) values (1) 
insert EmployeeImage(EmployeeID) values (2) 

You should also have some value in the Pic column. e.g. 0x0 is OK, otherwise
textcopy gives an error.

UPDATE EmployeeImage SET Pic=0x0 where employeeid=2

-- Fill the UserName,Password,ServerName and the Image file with full path like c:\employee1.bmp... 

textcopy -I -U -P -S -D Pubs -TEmployeeImage -C pic -W"where EmployeeID = 2" -F c:\Employee1.bmp 
textcopy -I -U -P -S -D Pubs -TEmployeeImage -C pic -W"where EmployeeID = 2" -F c:\Employee2.bmp 

-- Get Image Data from SQL Server and Displayed through Visual Basic 


The following example, uses Visual Basic 6.0 and ADO 2.5, to retreive Employee image from the table EmployeeImage. 


Assume that, Employee Image info is available for EmployeeID as 1.[using TextCopy] 

EmployeeID, Pic 
--------- ------------------------------- 
1         ox1234.......................... 


create proc EmployeeSel 
@EmployeeID int 
as 
select Pic from EmployeeImage where EmployeeID = @EmployeeID 


Sample VB code 
-------------- 

In this code , you have to fill the values of Data Source,Initial Catalog,UserID,Password in the sServerDBConn string. 
Open the standard EXE project, add a Form and a Image control and name it as ImgEmployee. Use the code and you 
will the Picture. 

You cannot directly store the image column. Open a Binary file, write the image info and then load the image. 
you can retrieve the info through direct select/view/Stored Procedure. 
Here the Stored Procedure "EmployeeSel", retrive the Image of Employee [whose ID is 1]. 

-------------------------------------------------------------------------------------------- 
Dim objConn As New ADODB.Connection 
Dim objcmd As New ADODB.Command 
Dim objRec As ADODB.Recordset 
Dim bytImage() As Byte 
Dim sServerDBConn As String 


sServerDBConn = "Provider=SQLOLEDB; Data Source =SQLServerName; Initial Catalog = DatabaseName; User ID=UserName;Password= " 
objConn.Open sServerDBConn 

With objcmd 
.ActiveConnection = objConn 
.CommandText = "EmployeeSel" 
.CommandType = adCmdStoredProc 
.Parameters.Append .CreateParameter("EmployeeID", adInteger, adParamInput, 0, 1) 
Set objRec = objcmd.Execute 
End With 

bytImage = objRec("Pic") 
Open "c:\temp\EmpImage.bmp" For Binary As #1 
Put #1, , bytImage() 
Close #1 
imgEmployee.Picture = LoadPicture("c:\temp\EmpImage.bmp") 
------------------------------------------------------------------------------------------------- 

It is a Simple one. For more info , refer SQL Server BOL,Visual Basic Documentation and ActiveX Data Objects Documentation. 
[GetChunk/AppendChunk methods]. 

For SQLServer 2000, more options available. Refer BOL. 


24.2 Pointer 2:
===============


'***************************************************************
'* Save/Retrieve Image Data From SQL Server DataBase Using
'* ADOStream Objects.
'*************************************************************** 
'* Code By: Michael P. Gerety
'***************************************************************

'Make sure you have a reference to ADODB v. 2.5 or later

Dim rstRecordset As ADODB.Recordset
Dim cnnConnection As ADODB.Connection
Dim strStream As ADODB.Stream


'*Setup:
'*Create a form and place 3 command buttons named:
'*cmdLoad, cmdSelectSave, and cmdClear
'*Place a CommonDialog Control Named Dialog
'*Place an ImageBox (or PictureBox) named Image1


'** The field type in Sql Server must be "Image"
'** Everywhere you see "***" in the code is where you must enter 
'** your own data.

Private Sub cmdClear_Click()
    Image1.Picture = Nothing
    
End Sub

Private Sub cmdLoad_Click()
    If Not LoadPictureFromDB(rstRecordset) Then
        MsgBox "Invalid Data Or No Picture In DB"
    End If
End Sub

Private Sub cmdSelectSave_Click()
    'Open Dialog Box
    With dlgDialog
        .DialogTitle = "Open Image File...."
        .Filter = "Image Files (*.gif; *.bmp)| *.gif;*.bmp"
        .CancelError = True
procReOpen:
         .ShowOpen
         
        If .FileName = "" Then
            MsgBox "Invalid filename or file not found.", _
                vbOKOnly + vbExclamation, "Oops!"
            GoTo procReOpen
        Else
            If Not SavePictureToDB(rstRecordset, .FileName) Then
                MsgBox "Save was unsuccessful :(", vbOKOnly + _
                        vbExclamation, "Oops!"
                Exit Sub
            End If
        End If
            
    End With
End Sub

Private Sub Form_Load()
    Set cnnConnection = New ADODB.Connection
    Set rstRecordset = New ADODB.Recordset

    cnnConnection.Open ("Provider=SQLOLEDB; " & _
        "data Source=**YourServer**;" & _
        "Initial Catalog=**YourDatabase**; " & _
        "User Id=**YourUID**;Password=***YourPass***")
    rstRecordset.Open "Select * from YourTable", cnnConnection, _
         adOpenKeyset, adLockOptimistic
    

End Sub


Public Function LoadPictureFromDB(RS As ADODB.Recordset)

    On Error GoTo procNoPicture
    
    'If Recordset is Empty, Then Exit
    If RS Is Nothing Then
        GoTo procNoPicture
    End If
    
    Set strStream = New ADODB.Stream
    strStream.Type = adTypeBinary
    strStream.Open
    
    strStream.Write RS.Fields("**YourImageField**").Value

    
    strStream.SaveToFile "C:\Temp.bmp", adSaveCreateOverWrite
    Image1.Picture = LoadPicture("C:\Temp.bmp")
    Kill ("C:\Temp.bmp")
    LoadPictureFromDB = True

procExitFunction:
    Exit Function
procNoPicture:
    LoadPictureFromDB = False
    GoTo procExitFunction
End Function

Public Function SavePictureToDB(RS As ADODB.Recordset, _
    sFileName As String)

    On Error GoTo procNoPicture
    Dim oPict As StdPicture
    
    Set oPict = LoadPicture(sFileName)
    
    'Exit Function if this is NOT a picture file
    If oPict Is Nothing Then
        MsgBox "Invalid Picture File!", vbOKOnly, "Oops!"
        SavePictureToDB = False
        GoTo procExitSub
    End If
    
    RS.AddNew
    

    Set strStream = New ADODB.Stream
    strStream.Type = adTypeBinary
    strStream.Open
    strStream.LoadFromFile sFileName
    RS.Fields("***YourImageField***").Value = strStream.Read
    
    Image1.Picture = LoadPicture(sFileName)
    SavePictureToDB = True
    
    
procExitSub:
    Exit Function
procNoPicture:
    SavePictureToDB = False
    GoTo procExitSub
End Function


24.3 Pointer 3:
===============

SET QUOTED_IDENTIFIER ON 
GO
SET ANSI_NULLS ON 
GO

CREATE PROCEDURE audit_dealer
AS

DECLARE @ptrfromA varbinary(16), 
@ptrtoA varbinary(16),
@ptrfromB varbinary(16),
@ptrtoB varbinary(16),
@dealer_zip_id int

INSERT INTO dealer_zip(create_date) VALUES(getdate())

SELECT @dealer_zip_id = MAX(dealer_zip_id)
FROM dealer_zip

INSERT INTO dealer_audit(dealer_zip_id, dealer_univ_nbr, boxa_upload_user, 
boxa_upload_date, boxa_approve_user,
boxa_approve_date, boxb_upload_user, boxb_upload_date, 
boxb_approve_user, boxb_approve_date, boxb_text_id, boxb_image_id)
SELECT @dealer_zip_id, dealer_imageset.dealer_univ_nbr, boxa_upload_user,
boxa_upload_date, boxa_approve_user,
boxa_approve_date, boxa_upload_user, boxb_upload_date,
boxb_approve_user, boxb_approve_date, boxb_text_id, boxb_image_id
FROM dealer_imageset, dealer
WHERE dealer_imageset.dealer_univ_nbr = dealer.dealer_univ_nbr
AND boxa_approved_flag = 'y'
AND (boxb_approved_flag = 'y' or boxb_permission_flag <> 'y')
AND dealer_imageset.dealer_univ_nbr in (
SELECT DISTINCT dealer_univ_nbr
FROM dealer_info
WHERE a_graphic IS NOT NULL
OR b_graphic IS NOT NULL)

DECLARE audit_cursor CURSOR FOR
SELECT TEXTPTR(di.boxa_image) from_boxa, TEXTPTR(di.boxb_image) from_boxb,
TEXTPTR(a.boxa_image) to_boxa, TEXTPTR(a.boxb_image) to_boxb
FROM dealer_imageset di, dealer_audit a, dealer d
WHERE a.dealer_zip_id = @dealer_zip_id
AND d.dealer_univ_nbr = a.dealer_univ_nbr
AND di.dealer_univ_nbr = a.dealer_univ_nbr
AND (di.boxb_text = NULL OR di.boxb_text = '')
AND di.boxa_approved_flag = 'y'
AND (di.boxb_approved_flag = 'y' or d.boxb_permission_flag <> 'y')

OPEN audit_cursor

FETCH NEXT FROM audit_cursor
INTO @ptrfromA, @ptrfromB, @ptrtoA, @ptrtoB

WHILE @@FETCH_STATUS = 0
BEGIN
IF @ptrfromA is not null 
BEGIN
UPDATETEXT dealer_audit.boxa_image @ptrtoA 0 null dealer_imageset.boxa_image @ptrfromA
END

IF @ptrfromB is not null 
BEGIN
UPDATETEXT dealer_audit.boxa_image @ptrtoB 0 null dealer_imageset.boxa_image @ptrfromB
END

FETCH NEXT FROM audit_cursor
INTO @ptrfromA, @ptrfromB, @ptrtoA, @ptrtoB
END

CLOSE audit_cursor
DEALLOCATE audit_cursor

-- finally, delete all blobs that have been in the audit table for more than two months
UPDATE dealer_audit
SET boxa_image = null, boxb_image = null
WHERE ISNULL(boxa_approve_date, getdate()) < getdate() - 60
AND ISNULL(boxb_approve_date, getdate()) < getdate() - 60

GO
SET QUOTED_IDENTIFIER OFF 
GO
SET ANSI_NULLS ON 
GO


24.4 Pointer 4:
===============

Formatted text strings, such as Microsoft� Word� document files or HTML files, cannot be stored in character string or 
Unicode columns because many of the bytes in these files contain data structures that do not form valid characters. 
Database applications may still have a need to access this data and apply full-text searches to it. Many sites store 
this type of data in image columns, because image columns do not require that each byte form a valid character. 
SQL Server 2000 introduces the ability to perform full-text searches against these types of data stored in image columns. 
SQL Server 2000 supplies filters that allow it to extract the textual data from Microsoft Office� files 
(.doc, .xls, and .ppt files), text files (.txt files), and HTML files (.htm files). When you design the table, 
in addition to the image column that holds the data, you include a binding column to hold the file extension for the 
format of data stored in the image column. You can create a full-text index that references both the image column and 
the binding column to enable full-text searches on the textual information stored in the image column. 
The SQL Server 2000 full-text search engine uses the file extension information from the binding column to select 
the proper filter to extract the textual data from the column.


24.5 Pointer 5:
===============

Const ForReading = 1, ForWriting = 2, ForAppending = 8

Dim buf, rs, cn, ConnectionString, Sql, JpegFileName
Dim fso, f


JpegFileName = "C:\FullPath\FileName.JPG"

ConnectionString = "Provider=SQLOLEDB;Server=(local);" & _ 
       "Database=MyPictures;Trusted_Connection=Yes;"

set cn = Server.CreateObject("ADODB.Connection")
set rs = Server.CreateObject("ADODB.Recordset")

cn.Open ConnectionString
Set fso = CreateObject("Scripting.FileSystemObject")
Set f = fso.OpenTextFile(JpegFileName, ForReading, False)
buf = f.ReadAll
f.Close

Sql = "SELECT ImageData FROM Pix"

rs.open Sql, cn, adOpenKeyset, adLockOptimistic
rs.AddNew 
rs(0).AppendChunk buf
rs.Update
rs.Close

Set rs.ActiveConnection = Nothing
Set rs = Nothing
cn.Close
Set cn = Nothing


24.6 Pointer 6:
===============

Maximum bytes per row is 8060. Binary data like images has a 16 byte pointer on the page & the data is stored on 
separate pages [small amounts of binary data <8K can be stored on the page in SQL 2K] 


To store/retrieve this sort of data within TSQL scripts you have to use the WRITETEXT and READTEXT commands rather 
than standard INSERT/SELECT statements. These are documented, with examples, in the books-online but are basically 
a real pain to use. 

There are more manageable commands available from within the relevant programming languages - e.g. RDO and ADO 
from VB/C can use GetChunk and AppendChunk commands - but you still have to manage the image/text chunks/blocks 
of data at a time. About the only upside of storing this sort of data within SQL Server is that it can be kept 
transactionally consistent with the other data. For sample code see Q194975 - "Sample Functions Demonstrating 
GetChunk and AppendChunk".


Private Sub Upload()


Dim rs As ADODB.Recordset
Dim stm As ADODB.Stream
Dim SQL As String

'instantiating the objects
Set rs = New ADODB.Recordset
Set stm = New ADODB.Stream

'getting the image from the file
With stm
stm.Type = adTypeBinary
stm.Open
stm.LoadFromFile PICName 'This file is passed from user click control's click event.
End With

'establishing the SQL statement
SQL = "SELECT Pic, pic_location FROM Picture"

'storing the file into the database
With rs
.CursorType = adOpenKeyset
.LockType = adLockOptimistic
.Open SQL, DatabaseConnection(ServerName, DatabaseName, UserId, Password)
.AddNew
.Fields("pic") = stm.Read
.Fields("Pic_Location") = PICName
.Update
.Close
End With

'prompting the user
MsgBox "Image File : " & PICName & " has been succesfully uploaded to the database.", vbInformation

'clean up
Set rs.ActiveConnection = Nothing
stm.Close
Set rs = Nothing
Set stm = Nothing

End Sub

24.7 Pointer 7:
===============

I am having trouble with creating a format file to bcp in a jpeg. The table has one column which has image as the datatype. When I use the following bcp command and manually provide the listed responses the jpeg goes in fine. When I then try and use the format file the system generates nothing happens no error messages at all.

Here is the bcp command and the responses to the four prompts.

bcp ecsm..image_test in manchesterbw.jpg -T -SDPDDDDS001

Enter the file storage type of field picture [image]: I
Enter prefix-length of field picture [4]: 0
Enter length of field picture [0]: 54026
Enter field terminator [none]:


This is the format file that is generated by the system:

6.0
1
1 SQLBINARY 0 54026 "" 1 picture


This the bcp statement I am using to utilize the format file:

bcp ecsm..image_test in manchesterbw.jpg -f russell.fmt -T -SDPDDDDS001 

--

 think the formal file should be as follows :

6.0
1
1 SQLIMAGE 0 54026 "" 1 data 

--

I changed the SQLBINARY to SQLIMAGE. All is now working thanks for the tip. 

--


24.8 Pointer 8:
===============


Text, ntext, and image data have been around a long time, but their nuances can be easy to overlook. This tutorial, 
provides a quick overview of the implementation and usage of these special data types. 

Databases are growing in size and complexity, in part because today's hardware and software allow us to store mind-boggling 
amounts of data�including multimedia and document data. JPG, PNG, MP3, DOC/RTF, HTML, Unicode, and XML data can all 
be stored as image, text, or ntext in SQL Server databases.

Generally speaking, you use text to store huge ASCII character strings, ntext for Unicode character strings, and image 
for binary image data. Worried about size? Text gives you up to 2^31 - 1 (2,147,483,647) variable-length non-Unicode 
characters, ntext up to 2^30 - 1 (1,073,741,823) characters, and image up to 2^31 - 1 (2,147,483,647) bytes. 
The actual storage size, in bytes, for ntext is two times the number of characters entered. The SQL-92 synonym for ntext 
is national text.

So how do they work? They use pointers to reference the data. Special functions allow the pointer to add, extract, 
or remove data from them. 

Text and image functions
Here I'll describe a number of text and image functions, showing the syntax and an example with output for each.

TEXTPTR
TEXTPTR returns a varbinary of length 16 bytes, which is the text-pointer value that references a text, ntext, or 
image column.

Syntax: TEXTPTR ( column )

--TEXTPTR sample, create a text-pointer, see its value
create table #t (n ntext)
insert #t values('abcdef')
DECLARE @ptrval binary(16)
SELECT @ptrval = TEXTPTR(n) FROM #t 
print @ptrval 
drop table #t

Output:

0xFFFF6900000000004D00000001000000

TEXTVALID
TEXTVALID returns an int, which will be of value 1 if the text-pointer is valid, 0 otherwise.

Syntax: TEXTVALID ( 'table.column' , text_ptr )

--TEXTVALID sample, creates a text-pointer, tests it
create table #t (n ntext)
insert #t values('abxyef')
DECLARE @ptrval binary(16), @ptrval2 binary(16)
SELECT @ptrval = TEXTPTR(n) FROM #t
if TEXTVALID('#t.n',@ptrval)=1
  print '@ptrval has a valid text pointer.'
else	
  print '@ptrval has an invalid text pointer.'
if TEXTVALID('#t.n',@ptrval2)=1	
  print '@ptrval2 has a valid text pointer.'
else  print '@ptrval2 has an invalid text pointer.'
drop table #t

Output:

@ptrval has a valid text pointer.
@ptrval2 has an invalid text pointer.

SET TEXTSIZE
SET TEXTSIZE sets the size, an int value, of text and ntext data to be returned when using a SELECT statement.

Syntax: SET TEXTSIZE { number }

--SET TEXTSIZE sample
create table #t (n ntext)
insert #t values('abcdefghijk')
SET TEXTSIZE 10--ntext is unicode, 2 bytes/character
select * from #t
SET TEXTSIZE 20--ntext is unicode, 2 bytes/character
select * from #t
drop table #t

Output:

abcde
abcdefghij

@@TEXTSIZE
@@TEXTSIZE returns the size, an int value, of text and ntext data to be returned when using a SELECT statement. 
This value is set with SET TEXTSIZE.

Syntax: @@TEXTSIZE

--@@TEXTSIZE sample
SET TEXTSIZE 10--ntext is unicode, 2 bytes/character
print @@TEXTSIZE
SET TEXTSIZE 20--ntext is unicode, 2 bytes/character
print @@TEXTSIZE

Output:

10
20

WRITETEXT
WRITETEXT overwrites the data from a text, ntext, or image column.

Syntax: WRITETEXT { table.column text_ptr } [ WITH LOG ] { data }

--WRITETEXT sample
create table #t (n ntext)
insert #t values('abc')
DECLARE @ptrval binary(16)
SELECT @ptrval = TEXTPTR(n) 
FROM #t
WRITETEXT #t.n @ptrval 'def'
select * from #t
drop table #t

Output:

def

UPDATETEXT
UPDATETEXT changes the data from an existing text, ntext, or image column.

Syntax: UPDATETEXT { table_name.dest_column_name dest_text_ptr } { NULL | insert_offset } { NULL | delete_length } 
[ WITH LOG ] [ inserted_data | { table_name.src_column_name src_text_ptr } ]

--UPDATETEXT sample insertion only
create table #t (n ntext)
insert #t values('bd')
DECLARE @ptrval binary(16), @i int
SELECT @ptrval = TEXTPTR(n) 
FROM #t
UPDATETEXT #t.n @ptrval 0 0 'a'--insert at beginning
select * from #t
UPDATETEXT #t.n @ptrval 2 0 'c'--insert in the middle
select * from #t
set @i=(select DATALENGTH(n) from #t)/2
--/2 only if ntext, 2 bytes/character
print @i
UPDATETEXT #t.n @ptrval @i 0 'e'--insert at the end
select * from #t
drop table #t

Output:

abd
abcd
abcde

Sample deletion and insertion:

--UPDATETEXT sample deletion+insertion
create table #t (n ntext)
insert #t values('abxyef')
DECLARE @ptrval binary(16), @i int
SELECT @ptrval = TEXTPTR(n) 
FROM #t
UPDATETEXT #t.n @ptrval 2 2 'cd'--insert 2, delete 2 
--chars starting at position 2
select * from #t
drop table #t

Output:

abcdef

READTEXT
READTEXT reads a certain amount of data from a text, ntext, or image column.

Syntax: READTEXT { table.column text_ptr offset size } [ HOLDLOCK ]

--READTEXT sample
create table #t (n ntext)
insert #t values('abcdefghijk')
DECLARE @ptrval binary(16)
SELECT @ptrval = TEXTPTR(n) FROM #t
READTEXT #t.n @ptrval 3 8
--read 8 characters starting at position 3
drop table #t

Output:

defghijk

DATALENGTH
DATALENGTH returns the size (number of bytes) of a text, ntext, or image column.

Syntax: DATALENGTH ( expression )

--DATALENGTH sample
create table #t (n ntext)
insert #t values('1234567890')
DECLARE @i int
set @i=(select DATALENGTH(n) from #t)
--it should return the length in bytes=2*UNICODE length
PRINT @i
drop table #t

Output:

20

PATINDEX
PATINDEX returns the location, an int value, of the first occurrence of a pattern in a text, ntext, or image column, 
or 0 if the pattern wasn't found.

Syntax: PATINDEX ( '%pattern%' , expression )

--PATINDEX sample
create table #t (n ntext)
insert #t values('Hello Tim, long time no see!')
SELECT PATINDEX('%tim%', n) FROM #t
SELECT PATINDEX('%time%', n) FROM #t
drop table #t

Output:

7
17

CONVERT
CONVERT returns an expression converted from one data type to another.

Syntax: CONVERT ( data_type [ ( length ) ] , expression [ , style ] )

--CONVERT sample
create table #t (n ntext)
insert #t values('Hello Tim, long time no see!')
DECLARE @c nvarchar(5)
SET @c=(select convert(nvarchar(5),n) from #t)
print @c
drop table #t

Output:

Hello

CAST
CAST returns an expression casted (converted) from one data type to another.

Syntax: CAST ( expression AS data_type )

--CAST sample
create table #t (n ntext)
insert #t values('Hello Tim, long time no see!')
DECLARE @c nvarchar(5)
SET @c=(select CAST ( n  AS nvarchar(5) ) from #t)
print @c
drop table #t

Output:

Hello


Saving a text, ntext, or image column to a file
I created three separate stored procs that show you how you can save one column of type text, ntext, or image to a file. 
You'll find the code for these and all other examples in the accompanying Download file.

--saveText2file sample
create table ##t (n text)
insert ##t values('Hello Tim, long time no see!')
EXEC saveText2file 'c:\test.txt', '##t','n', ''
drop table ##t

--saveNtext2file sample
create table ##t (n ntext)
insert ##t values('Hello Tim, long time no see!')
EXEC saveNtext2file 'c:\test.txt', '##t','n', ''
drop table ##t

--saveImage2file sample
exec saveImage2file 'c:\Category1.bak', 
'Northwind..Categories', 'Picture', 
'where categoryid=1'

Updating a text, ntext, or image column from a file
Because TEXTPTR, WRITETEXT, and UPDATETEXT don't allow variable names to define the table or column parameters, 
reading the contents of a file into a column requires you to use dynamic SQL. Stored proc readImageFromfile can handle 
both image and varchar data types because it reads the data as binary and writes it without using temporary tables. 
Ntext can be read using readNtextFromfile.

--readImageFromfile sample 
--reading a text column from a file
create table ##t (n text)
insert ##t values('Hi Tim, long time no see!')
EXEC readImageFromfile 'c:\hello.txt', '##t','n', ''
select * from ##t
drop table ##t

Output:

Hello


24.9 Pointer 9:
===============

Save Data in a SQL Server Image Column with VB6

I needed to retrieve image fields on SQL Server 7.0 with VB6, and I couldn't find any article about it. 
So, I assume others have had the same problem. I've since found a method for doing it. 
You must use Microsoft ADO 2.5 and set it into the following project reference: 

dim rst as new adodb.recordset
dim adoConn as new adodb.Connection
You also have to open the connection with the database. 
 'Open recordset....
rst.Open "Select * from <TABLE> where <CONDITION>", adoConn, adOpenKeyset,
adLockOptimistic


'THIS FUNCTION SAVES AN IMAGE INTO AN IMAGE DATATYPE FIELD
Private Function SaveImage()
  Dim mStream As New ADODB.Stream

  With mStream
    .Type = adTypeBinary
    .Open
    .LoadFromFile "<IMAGE FILE NAME>"
    rst("<IMAGE FIELD NAME>"). Value = .Read
    rst.Update
  End With
  Set mStream = Nothing
End Function

'THIS FUNCTION LOAD IMAGE FROM IMAGE DATATYPE FIELD AND SAVE IT INTO A
FILE.....
Private Function LoadImage()
  Dim mStream As New ADODB.Stream

  With mStream
    .Type = adTypeBinary
    .Open
    .Write rst("<IMAGE FIELD NAME>")
    .SaveToFile "<DESTINATION FILE NAME>", adSaveCreateOverWrite
  End With

  Set mStream = Nothing

End Function

Aside from this method, you can use a picture control to store an image, put a picture control into a form, 
and call it PictureTemp. 

 PictureTemp.DataField = "Immagine"			'Set DataField....
Set PictureTemp.DataSource = rst			'Set DataSource
You can use the PictureTemp.Picture property to get your image. 
 Private Function LoadImage()
  Dim mStream As New ADODB.Stream

  With mStream
    .Type = adTypeBinary
    .Open
    PictureTemp.DataField = "Immagine"			'Set DataField....
    Set PictureTemp.DataSource = rst			'Set DataSource
    Set MSFGRID.CellPicture = PictureTemp.Picture	'Show image into a cell of
Microsoft FlexGrid
  End With

  Set mStream = Nothing

End
Function

24.10 Pointer 10:
=================

An application that reads and writes Word docs to and from SQL server.


Dim rstRecordset As ADODB.Recordset
Dim cnnConnection As ADODB.Connection
Dim strStream As ADODB.Stream
Dim imgname As String


Private Sub cmdLoad_Click()
    Set cnnConnection = New ADODB.Connection
    Set rstRecordset = New ADODB.Recordset
    imgname = GiveId.Text

    cnnConnection.Open ("Provider=SQLOLEDB; " & _
    "data Source=xpora;" & _
    "Initial Catalog=pubs; " & _
    "User Id=karel;Password=karel")
    
    rstRecordset.Open "Select * from docs where id=" & imgname, cnnConnection, _
    adOpenKeyset, adLockOptimistic
         
    Set strStream = New ADODB.Stream
    strStream.Type = adTypeBinary
    strStream.Open
    
    strStream.Write rstRecordset.Fields("Doc").Value
    strStream.SaveToFile "C:\Temp.doc", adSaveCreateOverWrite
 
    Shell "E:\Program Files\Microsoft Office\Office\Winword.exe " & _
  Chr$(34) & "C:\temp.doc", 1
 End Sub

'Or as an alternative to the Shell command:

Private Sub cmdLoad2_Click()
    Set cnnConnection = New ADODB.Connection
    Set rstRecordset = New ADODB.Recordset
    imgname = GiveId.Text

    cnnConnection.Open ("Provider=SQLOLEDB; " & _
    "data Source=xpora;" & _
    "Initial Catalog=pubs; " & _
    "User Id=karel;Password=karel")
    
    rstRecordset.Open "Select * from docs where id=" & imgname, cnnConnection, _
    adOpenKeyset, adLockOptimistic
         
    Set strStream = New ADODB.Stream
    strStream.Type = adTypeBinary
    strStream.Open
    
    strStream.Write rstRecordset.Fields("Doc").Value
    strStream.SaveToFile "C:\Temp.doc", adSaveCreateOverWrite
    
    Dim wsApp As Word.Application
    'Set wsApp = GetObject(, "Word.Application")
    Set wsApp = CreateObject("Word.Application")
        wsApp.Visible = True
        wsApp.Documents.Open ("c:\temp.doc")
    
End Sub

Private Sub cmdQuit_Click()
  End
End Sub

Private Sub cmdSelectSave_Click()

'Shell "C:\Program Files\Microsoft SQL Server\mssql\binn\textcopy.exe " & _
'"-I -S xpora -D pubs -T docs -C doc -U karel -P karel -W where id=" & imgname & " -F c:\temp.doc"

    Set cnnConnection = New ADODB.Connection
    Set rstRecordset = New ADODB.Recordset
    imgname = GiveId.Text

    cnnConnection.Open ("Provider=SQLOLEDB; " & _
    "data Source=xpora;" & _
    "Initial Catalog=pubs; " & _
    "User Id=karel;Password=karel")
    
    rstRecordset.Open "Select * from docs where id=" & imgname, cnnConnection, _
    adOpenKeyset, adLockOptimistic
         
    Set mstream = New ADODB.Stream
    mstream.Type = adTypeBinary
    mstream.Open
    mstream.LoadFromFile "c:\temp.doc"
    rstRecordset.Fields("doc").Value = mstream.Read
    rstRecordset.Update

    rstRecordset.Close
    cnnConnection.Close

End Sub


============================================
25. Named pipes, Sockets, and Multiprotocol:
============================================


25.1 TCP/IP Sockets:
--------------------

Suppose the Server 10.10.10.1 has multiple Server programs running. 
How does a client differentiate between the multiple Server programs?

The usual way with tcpip is the use of sockets. A socket is an "identifier" completely
identifying the location of a Server on the network, as well as the "port" the server service is listening on,
like for example:

10.10.10.1 : 1521  or for example
10.10.10.1 : 1433

The client should have knowledge of the "port" of the desired Host program or the host service is listening on.
For example it could come from a local services file, or a registry.

The client constructs a tcp header, while in the destination port, the port is listed where the Host Server service
or deamon is listening on.


                          Server, IP=10.10.10.1
  
                         |------------------------------------------------
                         |                                               |
                         |   ------------------   ---------------------  |
                         |  |Oracle listener   |  |SQL Server listener | |
                         |  |listening on port |  |listening on port   | |
                         |  |1521              |  | 1433               | |
                         |   ------------------    --------------------  |
                         |           ^                 ^                 |
                         |           |                 |                 |
 client request for      |           |                 |                 |
 connection to Oracle    |           |                 |                 |
 10.10.10.1:1521         |   -------------------------------             |
 ----------------------> |   |Portmapper / Netlib router   |             |
                         |   |handling                     |             |
 Client request for      |   |requests to the desired host |             |
 connection to SQL Server|   |program                      |             |
 10.10.10.1:1433         |   |                             |             |
 ----------------------> |   |                             |             |
                         |   ------------------------------              |
                         |                                               |
                         |------------------------------------------------


25.2 Named pipes:
-----------------

A high level process, like a client program, can open and write to a "special file", the "named pipe".
The named pipe can be considered to be at the OSI layer 7, and is an IPC mechanism for process to process
communication, locally or across a network.


In Windows, the design of named pipes is biased towards client-server communication, and they work much like sockets: 
other than the usual read and write operations, Windows named pipes also support an explicit "passive" mode 
for server applications (compare: UNIX domain sockets).

Named pipes aren't permanent and can't be created as special files on any writable filesystem, unlike in UNIX, 
but are volatile names (freed after the last reference to them is closed) allocated in the root directory of 
the named pipe filesystem (NPFS), mounted under the special path \\.\pipe\ (that is, a pipe named "foo" would 
have a full path name of \\.\pipe\foo). Anonymous pipes used in pipelining actually are named pipes with a random name.

In "constructing" the client program (VB, C++, VB.NET, C# etc...) there is some sort of mechanisme to create
a named pipe, for example:

Public Declare Function CallNamedPipe Lib "kernel32" Alias "CallNamedPipeA" _
(ByVal lpNamedPipeName As String, etc......


The pipe is an IPC construct above any network protocol as sockets/tcp/ip, 
or nwlink spx/ipx etc..
It uses the IPC$ share of the remote system, just like a filesystemshare.

\\computername\pipe\MSSQL$instancename\sql\query


CLIENT:
----------------------------------------       rw to and from pipe
named pipe \\.\sql\query,                  <-------------------------> Server named pipe
functions like a sort of URL or share
----------------------------------------
session management, sockets, netbios
----------------------------------------
TCP   SPX   
----------------------------------------
IP    IPX
----------------------------------------
Datalink
----------------------------------------
physiscal network
----------------------------------------


25.3 Multiprotocol:
-------------------

It's a protocol that layers over named pipes, tcpip sockets, or nwlink spx/ipx sockets.
So, just MUST have one of the above IPC mechanismens available.

The Multiprotocol selection has two key features: 

Automatic selection of an available network protocol to communicate with an instance of Microsoft� SQL Server�. 
This is convenient when you want to connect to multiple servers running different network protocols 
but do not want to reconfigure the client connection for each server. If the client and server Net-Libraries 
for TCP/IP Sockets, NWLink IPX/SPX, or Named Pipes are installed on the client and server, 
the Multiprotocol Net-Library will automatically choose the first available network protocol 
to establish a connection.

Client encryption. 
You can enforce encryption over the Multiprotocol Net-Library on clients running on the Microsoft 
Windows NT� 4.0, Windows� 2000, Windows 95, or Windows 98 operating system to prevent others from intercepting 
and viewing sensitive data.

The Multiprotocol Net-Library takes advantage of the remote procedure call (RPC) facility of 
Windows NT 4.0 and Windows 2000, which provides Windows Authentication. For the Multiprotocol Net-Library, 
clients determine the server address using the server name.

Usage Considerations
Before using the Multiprotocol Net-Library, consider the following: 

The Multiprotocol Net-Library does not support named instances of SQL Server 2000. You can use the 
Multiprotocol Net-Library to connect to the default instance of SQL Server on a computer, but you cannot connect 
to any named instances.

The Multiprotocol Net-Library does not support server enumeration. From applications that can list servers 
by calling dbserverenum, you cannot identify servers running an instance of SQL Server and listening 
on the Multiprotocol Net-Library. 


====================================================
26. (Traditional) Client connections to SQL Server:
====================================================

 -------   -------     -------  -------
 |App 1|   |App 2|     |App 3|  |App 4|
 -------   -------     -------  -------
     |        |           |        |
     |     -------     -------     |
     |     |ADO  |     |RDO  |     |
     |     -------     -------     |
     |        |            |       |
   -----------------   ---------------
   |OLE DB         |   |ODBC         |   (TabularDataStream TDS)
   -----------------   ---------------
           |                   |
   -----------------------------------
   |Client Network library api       |
   |- named pipes                    |
   |- tcpip sockets                  |
   |- multiprotocol                  |
   -----------------------------------
                |
network         | tcp/ip, spx/ipx etc..
----------------------------------------------------------
                |
                |
   ----------------------------------- 
   |SQL Server network library       |
   -----------------------------------
                |
   -----------------
   |SQL Server     |  (TDS)
   -----------------


=======================================================
27. Example of a complete program, implemented as a SP:
=======================================================


CREATE PROCEDURE stp_Mig_Sol_Revised
AS


/**********************************************************************/
/* PURPOSE:  MIGRATION OF SOLID TO SQLServer. Revised Procedure.      */
/* STATUS :  READY FOR PRODUCTION.                                    */
/* VERSION:  1.0                                                      */
/*                                                                    */
/* DATE   :  21-04-2004                                               */
/* AUTHOR :  AvdS                                                     */
/* --------------------------------------------------                 */
/*                                                                    */
/* COMPLETELY REVISED PROCEDURE. NOW WE USE DTS TO LOAD DIRECTLY      */
/* FROM SOLID TABLES TO THE MSSQL TABLES, INSTEAD OF OUTPUTTING FROM  */
/* SOLID WITH SOLEXP TO TXTFILES, THEN USING bcp FOR IMPORT, SCANNING */
/* FIELDS ON UNWANTED CHARACTERS, AND ENDLESS SCRUBBING AND           */
/* TRANSFORMING FIELDS.                                               */
/*                                                                    */
/* THE BUSINESS LOGIC IS STILL THE SAME.                              */
/*                                                                    */
/* You may name this procedure anything you like.                     */
/*                                                                    */
/**********************************************************************/


-- BEFORE RUNNING THIS PROCEDURE CONSIDER THE FOLLOWING:
-- -----------------------------------------------------

-- 1. THE DTS PACKAGES (9 PACKAGES) SHOULD BE LOCATED IN C:\EXP_SOLID
--    IF THE PACKEGES ARE LOCATED ELSEWHERE, YOU MUST FIND/REPLACE THE PATH 
      C:\EXP_SOLID IN THIS PROCEDURE, WITH THE CORRECT PATH. 
-- 2. THE STANDARD MSSQL TOOL "DTSRUN.EXE" EXECUTABLE MUST BE PRESENT ON THE TARGET SYSTEM
-- 3. THE SOLID SERVER 2.x SHOULD BE UP AND RUNNING. 
-- 4. THE SOLID ODBC DRIVER SHOULD BE PRESENT ON THE TARGET SYSTEM.


SET ANSI_WARNINGS OFF
SET NOCOUNT ON


/*********************************************************/
/*STEP 1. VARIABLE DECLARATIONS.                         */
/*********************************************************/

-- VARIABLES FOR PROCESSING

DECLARE @TEMPTAB               VARCHAR(256)
DECLARE @TEMPTAB2              VARCHAR(256)
DECLARE @length_table          VARCHAR(256)
DECLARE @FK                    VARCHAR(128)
DECLARE @REFERENCED            VARCHAR(128)
DECLARE @ART_NR                VARCHAR(128)
DECLARE @EENH_NM               VARCHAR(128)
DECLARE @COUNT_ARTICLES        INT

-- VARIABLES FOR ERROR AND REPORTING:

DECLARE @ERR_MESSAGE           VARCHAR(256)
DECLARE @LOGSTRING             VARCHAR(256)
DECLARE @LOGFILE               VARCHAR(256)
DECLARE @DTS_PRESENT           INT
DECLARE @ODBC_PRESENT          INT

DECLARE @_NG39AFGP_B           INT
DECLARE @_NG39LEV_B            INT
DECLARE @_NG39ART_B            INT
DECLARE @_NG39TOES_B           INT
DECLARE @_NG39BUDG_B           INT
DECLARE @_NG39MLT_B            INT
DECLARE @_NG39USER_B           INT
DECLARE @_NG39EENH_B           INT
DECLARE @_NG39SYST_B           INT

DECLARE @_NG39AFGP_A           INT
DECLARE @_NG39LEV_A            INT
DECLARE @_NG39ART_A            INT
DECLARE @_NG39TOES_A           INT
DECLARE @_NG39BUDG_A           INT
DECLARE @_NG39MLT_A            INT
DECLARE @_NG39USER_A           INT
DECLARE @_NG39EENH_A           INT
DECLARE @_NG39SYST_A           INT

DECLARE @DIFF_NG39AFGP         INT
DECLARE @DIFF_NG39LEV          INT
DECLARE @DIFF_NG39ART          INT
DECLARE @DIFF_NG39TOES         INT
DECLARE @DIFF_NG39BUDG         INT
DECLARE @DIFF_NG39MLT          INT
DECLARE @DIFF_NG39USER         INT
DECLARE @DIFF_NG39EENH         INT
DECLARE @DIFF_NG39SYST         INT


/*********************************************************/
/*STEP 2. SIMPLE CHECKs ON SOME PRELIMININARIES.         */
/*********************************************************/

-- IF YOU DO NOT LIKE STEP 2, YOU CAN WIPE IT OUT ENTIRELY.


-- CHECK ON C:\EXP_SOLID
-- ---------------------

EXEC @DTS_PRESENT = master.dbo.xp_cmdshell 'dir c:\exp_solid\*.dts'
IF (@DTS_PRESENT <> 0)
BEGIN
   SET @ERR_MESSAGE='DTS packages not found in c:\exp_solid. Procedure aborted.'
   GOTO error_section
END

-- CHECK ON SOLID 2.x ODBC DRIVER
-- ------------------------------

EXEC @ODBC_PRESENT = master.dbo.xp_cmdshell 'dir %SYSTEMROOT%\system32\sosw*.*'
IF (@ODBC_PRESENT <> 0)
BEGIN
   SET @ERR_MESSAGE='SOLID ODBC DRIVER PROBABLY NOT INSTALLED. Procedure aborted.'
   GOTO error_section
END


/************************************************************/
/*STEP 3. REMOVE POSSIBLY EXISTING STAGING TEMP TABLES.     */
/************************************************************/

DECLARE cur1 CURSOR FOR
SELECT name FROM sysobjects
WHERE name like 'ST%' AND xtype='U'

OPEN cur1
FETCH NEXT FROM cur1 INTO @TEMPTAB

WHILE (@@FETCH_STATUS<>-1)
  BEGIN
    exec ('DROP TABLE '+@TEMPTAB)
    FETCH NEXT FROM cur1 INTO @TEMPTAB
  END
CLOSE cur1
DEALLOCATE cur1

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error: error dropping temporary staging tables.'
   GOTO error_section
END


/************************************************************/
/*STEP 4. DROP FOREIGN KEY CONSTRAINTS.                     */
/************************************************************/


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39AFGR_NG39AFG]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39AFGR] DROP CONSTRAINT FK_NG39AFGR_NG39AFG


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39AFG_NG39AFGP]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39AFG] DROP CONSTRAINT FK_NG39AFG_NG39AFGP


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39AFGR_NG39ART]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39AFGR] DROP CONSTRAINT FK_NG39AFGR_NG39ART


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39ONTR_NG39ART]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39ONTR] DROP CONSTRAINT FK_NG39ONTR_NG39ART


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39DGLB_NG39BUDG]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39DGLB] DROP CONSTRAINT FK_NG39DGLB_NG39BUDG


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39AFG_NG39DGLB]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39AFG] DROP CONSTRAINT FK_NG39AFG_NG39DGLB


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39DSTK_NG39DGLB]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39DSTK] DROP CONSTRAINT FK_NG39DSTK_NG39DGLB


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39DTOE_NG39DGLB]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39DTOE] DROP CONSTRAINT FK_NG39DTOE_NG39DGLB


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39SCHF_NG39DGLB]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39SCHF] DROP CONSTRAINT FK_NG39SCHF_NG39DGLB


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39SCHF_NG39DSTK]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39SCHF] DROP CONSTRAINT FK_NG39SCHF_NG39DSTK


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39ONT_NG39LEV]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39ONT] DROP CONSTRAINT FK_NG39ONT_NG39LEV


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39DSTK_NG39MLT]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39DSTK] DROP CONSTRAINT FK_NG39DSTK_NG39MLT


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39ONTR_NG39ONT]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39ONTR] DROP CONSTRAINT FK_NG39ONTR_NG39ONT


if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].[FK_NG39DTOE_NG39TOES]') and OBJECTPROPERTY(id, N'IsForeignKey') = 1)
ALTER TABLE [dbo].[NG39DTOE] DROP CONSTRAINT FK_NG39DTOE_NG39TOES


/************************************************************/
/*STEP 5. CREATE STAGING TABLES.                            */
/************************************************************/

-- WE DO NOT USE #tablename or ##tablename TEMPOPARY TABLES, BUT
-- TRUE DATABASE TABLES. IN THIS CASE, COLLATION CONFLICTS ARE IMPOSSIBLE.
-- WE DO THIS TO AVOID ANY PROBLEM.
-- AFTERWARDS, ALL TEMPOPARY TABLES ARE DROPPED.

-- TEMP STAGING TABLES:
-- --------------------

CREATE TABLE [dbo].[STNG39AFGP] (
	[AFG_PNT_NR]   [int]            NOT NULL ,
	[U_VERSION]    [nvarchar] (1)   NULL ,
	[AFG_PNT_OMS]  [nvarchar] (40)  NULL ,
	[AFG_PNT_VAST] [char] (1)       NULL    -- will be converted to bit
) ON [PRIMARY]


IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error in creating temp table STNG39AFGP. Procedure aborted.'
   GOTO error_section
END


CREATE TABLE [dbo].[STNG39EENH] (
	[EENH_NM]      [nvarchar] (92)   NOT NULL ,
	[U_VERSION]    [nvarchar] (1)    NULL ,
	[VLG_OFF_NM]   [nvarchar] (92)   NULL ,
	[VLG_OFF_RNG]  [nvarchar] (15)   NULL ,
	[EENH_CMD_NM]  [nvarchar] (92)   NULL ,
	[EENH_CMD_RNG] [nvarchar] (15)   NULL ,
	[EENH_HLD_NM]  [nvarchar] (92)   NULL ,
	[EENH_HLD_RNG] [nvarchar] (15)   NULL ,
	[OFF_GNKD_NM]  [nvarchar] (92)   NULL ,
	[OFF_GNKD_RNG] [nvarchar] (15)   NULL ,
	[EENH_PRG_VRL] [float]           NULL ,
	[EENH_BTW_H]   [float]           NOT NULL ,
	[EENH_BTW_L]   [float]           NOT NULL ,
	[EENH_VERW_DT] [smalldatetime]   NULL ,
	[EENH_STER_DT] [smalldatetime]   NULL 
) ON [PRIMARY]


IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error in creating temp table STNG39EENH. Procedure aborted.'
   GOTO error_section
END

CREATE TABLE [dbo].[STNG39LEV] (
	[LEV_NR]      [int]           NOT NULL , -- IN BBV WE HAVE THE IDENTITY (1,1) PROPERTY BOUND TO THIS FIELD
	[U_VERSION]   [nvarchar] (1)  NULL ,
	[LEV_NM]      [nvarchar] (30) NOT NULL ,
	[LEV_ADR]     [nvarchar] (30) NULL ,
	[LEV_PC]      [nvarchar] (6)  NULL ,
	[LEV_PLTS]    [nvarchar] (42) NULL ,
	[LEV_REGMAG]  [char](1)       NOT NULL  -- will be converted to bit
     -- [LEV_BTLND]   [bit]           NOT NULL             -- COLUMN NOT PRESENT IN SOLID
) ON [PRIMARY]

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error in creating temp table STNG39LEV. Procedure aborted.'
   GOTO error_section
END


CREATE TABLE [dbo].[STNG39MLT] (
	[MLT_NR]     [int]            NOT NULL ,
	[U_VERSION]  [nvarchar] (1)   NULL ,
	[MLT_NM]     [nvarchar] (30)  NOT NULL ,
	[MLT_PR]     [money]          NOT NULL 
) ON [PRIMARY]

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error in creating temp table STNG39MLT. Procedure aborted.'
   GOTO error_section
END


CREATE TABLE [dbo].[STNG39TOES] (
	[TOES_NR]   [int]            NOT NULL ,
	[U_VERSION] [nvarchar] (1)   NULL ,
	[TOES_BDR]  [money]          NOT NULL 
     -- [TOES_OMS]  [varchar] (500)  NULL   -- COLUMN IN SOLID NOT PRESENT
     -- [TOES_URL]  [varchar] (100)  NULL   -- COLUMN IN SOLID NOT PRESENT
) ON [PRIMARY]


IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error in creating temp table STNG39TOES. Procedure aborted.'
   GOTO error_section
END

CREATE TABLE [dbo].[STNG39USER] (
	[USR_ID]       [nvarchar] (12)  NOT NULL ,
	[USR_GROEP]    [nvarchar] (6)   NULL ,
	[USR_NAAM]     [nvarchar] (60)  NULL ,
	[USR_PASSWORD] [nvarchar] (32)  NOT NULL ,
	[USR_PASS_DT]  [smalldatetime]  NULL ,
	[USR_PASS_PER] [int]            NOT NULL 
) ON [PRIMARY]

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error in creating temp table STNG39USER. Procedure aborted.'
   GOTO error_section
END

CREATE TABLE [dbo].[STNG39ART] (
	[ART_NR]       [nvarchar] (7)   NOT NULL ,
	[U_VERSION]    [nvarchar] (1)   NULL ,
	[ART_OMS]      [nvarchar] (55)  NULL ,
	[ART_VRP_EENH] [nvarchar] (3)   NOT NULL ,
	[ART_AANT]     [decimal](8, 2)  NOT NULL ,
	[ART_PR]       [money]          NOT NULL ,
	[ART_BTW]      [nvarchar] (1)   NULL ,
	[ART_STATUS]   [nvarchar] (1)   NULL 
) ON [PRIMARY]

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error in creating temp table STNG39ART. Procedure aborted.'
   GOTO error_section
END

CREATE TABLE [dbo].[STNG39BUDG] (
	[BUD_PER_NR]  [int]          NOT NULL ,
	[U_VERSION]   [nvarchar] (1) NULL ,
	[BUD_BEG_SAL] [money]        NULL ,
	[BUD_END_SAL] [money]        NULL ,
	[BUD_ASL_IND] [nvarchar] (1) NULL 
) ON [PRIMARY]

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error in creating temp table STNG39BUDG. Procedure aborted.'
   GOTO error_section
END

CREATE TABLE [dbo].[STNG39SYST] (
	[ID]        [nvarchar] (10)   NOT NULL ,
	[U_VERSION] [nvarchar] (1)    NULL ,
	[WAARDE]    [nvarchar] (40)   NULL 
) ON [PRIMARY]

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error in creating temp table STNG39SYST. Procedure aborted.'
   GOTO error_section
END


/************************************************************/
/*STEP 6. LOAD DATA FROM SOLID TO STAGING TABLES WITH DTS.  */
/************************************************************/

-- INFO: HOW TO RUN DTS PACKAGES:

-- FROM SQL:
-- ---------

-- exec xp_cmdshell "DTSRun /S servername /U username /P password /Fpackagename"

-- Or:

-- exec xp_cmdshell "DTSRun /S servername /E  /Fpackagename"

-- /E : trusted connection, or use /U username /P password

exec master.dbo.xp_cmdshell "DTSRun  /E /FC:\exp_solid\INIT_STNG39AFGP.dts"
exec master.dbo.xp_cmdshell "DTSRun  /E /FC:\exp_solid\INIT_STNG39ART.dts"
exec master.dbo.xp_cmdshell "DTSRun  /E /FC:\exp_solid\INIT_STNG39BUDG.dts"
exec master.dbo.xp_cmdshell "DTSRun  /E /FC:\exp_solid\INIT_STNG39EENH.dts"
exec master.dbo.xp_cmdshell "DTSRun  /E /FC:\exp_solid\INIT_STNG39LEV.dts"
exec master.dbo.xp_cmdshell "DTSRun  /E /FC:\exp_solid\INIT_STNG39MLT.dts"
exec master.dbo.xp_cmdshell "DTSRun  /E /FC:\exp_solid\INIT_STNG39SYST.dts"
exec master.dbo.xp_cmdshell "DTSRun  /E /FC:\exp_solid\INIT_STNG39TOES.dts"
exec master.dbo.xp_cmdshell "DTSRun  /E /FC:\exp_solid\INIT_STNG39USER.dts"

IF (SELECT COUNT(*) FROM STNG39ART) = 0
BEGIN
   SET @ERR_MESSAGE='Error in running DTS datatransfer. Procedure aborted.'
   GOTO error_section
END


/************************************************************/
/*STEP 7. CHECK ON DUPLICATE ARTICLES.                      */
/************************************************************/

-- LET'S FIRST CHECK ON DUPLICATE ARTICLES IN STAGING TABLE STNG39ART.
-- IF DUPLICATE ARTICLES ARE FOUND, WE ABORT THIS PROCEDURE
-- AND NOTHING WILL BE CHANGED.


IF EXISTS (SELECT COUNT(ART_NR) FROM STNG39ART GROUP BY ART_NR HAVING COUNT(*) > 1)

   BEGIN
   PRINT 'DUPLICATE ARTICLES FOUND:'
   SELECT ART_NR FROM STNG39ART GROUP BY ART_NR HAVING COUNT(*) > 1
   SET @ERR_MESSAGE='Duplicate Article_numbers found in Staging Artikelentabel. Procedure aborted.'
   GOTO error_section
   END


/************************************************************/
/*STEP 8. TRANSFORMATIONS                                   */
/************************************************************/


-- SOME TRANSFORMATIONS ARE STILL NEEDED.


-- 8.1. SPECIAL CASES OF BIT VALUES "T" AND "F".
-- ---------------------------------------------

-- STAGING TABLE STNG39AFGP HAS THE COLUMN [AFG_PNT_VAST] (on purpose) SET TO [char] (1) 
-- BECAUSE THAT IS THE SAME AS THE CORRESPONDING TABLE IN SOLID (NG39AFGP).
-- THAT COLUMN CONTAINS VALUES AS "T"AND "F".
-- SO HERE WE REBUILD THOSE VALUES TO "1" AND "0".
-- A SIMILAR ARGUMENT IS TRUE FOR STNG39LEV.

--  STNG39AFGP:
-- ------------

IF (SELECT COUNT(*) FROM STNG39AFGP WHERE AFG_PNT_VAST='T') > 0
    BEGIN
      UPDATE STNG39AFGP
      SET AFG_PNT_VAST ='1' WHERE AFG_PNT_VAST='T'
    END 

IF (SELECT COUNT(*) FROM STNG39AFGP WHERE AFG_PNT_VAST='F') > 0
    BEGIN
      UPDATE STNG39AFGP
      SET AFG_PNT_VAST ='0' WHERE AFG_PNT_VAST='F'
    END

-- STNG39LEV:
-- ----------

IF (SELECT COUNT(*) FROM STNG39LEV WHERE LEV_REGMAG='T') > 0
    BEGIN
      UPDATE STNG39LEV
      SET LEV_REGMAG ='1' WHERE LEV_REGMAG='T'
    END 

IF (SELECT COUNT(*) FROM STNG39LEV WHERE LEV_REGMAG='F') > 0
    BEGIN
      UPDATE STNG39LEV
      SET LEV_REGMAG ='0' WHERE LEV_REGMAG='F'
    END


/****************************************************************************/
/*STEP 9. EMPTY THE INVOLVED PRODUCTION TABLES IN SQL Server.               */
/****************************************************************************/


-- 9.1 FIRST LET'S COUNT THE ORIGINAL NO OF RECORDS IN PRODUCTION TABLES FOR REPORTING PURPOSES,
--     BEFORE IMPORT TAKES PLACE.
-- ---------------------------------------------------------------------------------------------

SELECT @_NG39AFGP_B    = (SELECT COUNT(*) FROM NG39AFGP)
SELECT @_NG39LEV_B     = (SELECT COUNT(*) FROM NG39LEV)
SELECT @_NG39ART_B     = (SELECT COUNT(*) FROM NG39ART)
SELECT @_NG39TOES_B    = (SELECT COUNT(*) FROM NG39TOES)
SELECT @_NG39BUDG_B    = (SELECT COUNT(*) FROM NG39BUDG)
SELECT @_NG39MLT_B     = (SELECT COUNT(*) FROM NG39MLT)
SELECT @_NG39USER_B    = (SELECT COUNT(*) FROM NG39USER)
SELECT @_NG39EENH_B    = (SELECT COUNT(*) FROM NG39EENH)
SELECT @_NG39SYST_B    = (SELECT COUNT(*) FROM NG39SYST)


-- 9.2 EMPTY TABLES (MOST LIKELY THEY ARE ALREADY EMPTY)
-- -----------------------------------------------------

-- IF YOU HAVE CONCERNS ABOUT THE SIZE OF THE TRANSACTION LOG, YOU COULD INSTEAD USE TRUNCATE STATEMENTS.
-- HERE WE PUT THE DELETES INTO A TRANSACTION.
-- PRODUCTION TABLES ARE PROBABLY EMPTY TO START WITH.
-- WE ONLY EMPTY THE TABLES THAT WILL RECEIVE THE MIGRATED RECORDS FROM SOLID.


BEGIN TRAN EMPTY

DELETE FROM NG39AFGP
DELETE FROM NG39LEV
DELETE FROM NG39ART
DELETE FROM NG39TOES
DELETE FROM NG39BUDG
DELETE FROM NG39MLT
DELETE FROM NG39USER
DELETE FROM NG39EENH
DELETE FROM NG39SYST
DELETE FROM NG39DGLB

IF @@error=0
BEGIN
COMMIT TRAN EMPTY
END

ELSE
BEGIN
ROLLBACK TRAN EMPTY
SET @ERR_MESSAGE='Removing old data from tables did not succeed.' 
GOTO error_section
END


/**************************************************************************/
/*STEP 10. NOW COPY THE DATA FROM STAGING TABLES INTO PRODUCTION TABLES.  */
/**************************************************************************/

-- ---------------------------------------
INSERT INTO NG39AFGP
SELECT
        CONVERT(INT,AFG_PNT_NR),
                U_VERSION,
                AFG_PNT_OMS,
        CONVERT(bit,AFG_PNT_VAST)
FROM STNG39AFGP

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error loading NG39AFGP'
   GOTO error_section
END

-- ---------------------------------------
SET IDENTITY_INSERT NG39LEV ON

INSERT INTO NG39LEV
(LEV_NR, U_VERSION,LEV_NM,LEV_ADR,LEV_PC,LEV_PLTS,LEV_REGMAG,LEV_BTLND)
SELECT
         CONVERT(int,LEV_NR),
                U_VERSION,
                LEV_NM,
                LEV_ADR,
                LEV_PC,
                LEV_PLTS,
        CONVERT(bit,LEV_REGMAG),
				0
FROM STNG39LEV
SET IDENTITY_INSERT NG39LEV OFF

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error loading NG39LEV'
   GOTO error_section
END

-- ---------------------------------------
INSERT INTO NG39ART
SELECT
                ART_NR,
                U_VERSION,
                ART_OMS,
                ART_VRP_EENH,
        CONVERT(decimal (8,2),ART_AANT),
        CONVERT(money,CONVERT(float,ART_PR)),
                ART_BTW,
                ART_STATUS
FROM STNG39ART

UPDATE NG39ART
SET ART_PR=(ART_PR/100)


IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error loading NG39ART'
   GOTO error_section
END

-- ---------------------------------------
INSERT INTO NG39TOES
(TOES_NR,U_VERSION,TOES_BDR)
SELECT
        CONVERT(int,TOES_NR),
                U_VERSION,
        CONVERT(money,CONVERT(float,TOES_BDR)) --,
                -- TOES_OMS,
                -- TOES_URL
FROM STNG39TOES

UPDATE NG39TOES
SET TOES_BDR=(TOES_BDR/100)

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error loading NG39TOES'
   GOTO error_section
END

-- ---------------------------------------
INSERT INTO NG39BUDG
SELECT
        CONVERT(int,BUD_PER_NR),
                U_VERSION,
        CONVERT(money,CONVERT(float,BUD_BEG_SAL)),
        CONVERT(money,CONVERT(float,BUD_END_SAL)),
                BUD_ASL_IND
FROM STNG39BUDG

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error loading NG39BUDG'
   GOTO error_section
END

-- ---------------------------------------
INSERT INTO NG39MLT
SELECT
        CONVERT(int,MLT_NR),
                U_VERSION,
                MLT_NM,
        CONVERT(money,CONVERT(float,MLT_PR))
FROM STNG39MLT

UPDATE NG39MLT
SET MLT_PR=(MLT_PR/100)


/*
UPDATE NG39MLT
SET MLT_PR=(MLT_PR/(100+(EENH_BTW_H/100)) * 100) where ART_BTW = "H";
UPDATE NG39MLT
SET MLT_PR=(MLT_PR/(100+(EENH_BTW_L/100)) * 100) where ART_BTW = "L";
*/

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error loading NG39MLT'
   GOTO error_section
END


-- ---------------------------------------
INSERT INTO NG39EENH
SELECT 
              EENH_NM,
              U_VERSION,
              VLG_OFF_NM,
              VLG_OFF_RNG,
              EENH_CMD_NM,
              EENH_CMD_RNG,
              EENH_HLD_NM,
              EENH_HLD_RNG,
              OFF_GNKD_NM,
              OFF_GNKD_RNG,
      CONVERT(float,EENH_PRG_VRL),
      CONVERT(float,EENH_BTW_H),
      CONVERT(float,EENH_BTW_L),
      CONVERT(smalldatetime,EENH_VERW_DT),
      CONVERT(smalldatetime,EENH_STER_DT)
FROM STNG39EENH

UPDATE NG39EENH
SET EENH_PRG_VRL=(EENH_PRG_VRL/10),
    EENH_BTW_H=(EENH_BTW_H/10),
    EENH_BTW_L=(EENH_BTW_L/10)


UPDATE NG39EENH
SET VLG_OFF_NM=''
WHERE VLG_OFF_NM='UL'

UPDATE NG39EENH
SET VLG_OFF_RNG=''
WHERE VLG_OFF_RNG='UL'


IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error loading NG39EENH'
   GOTO error_section
END

-- Updaten van Prijzen. Was Incl. BTW nu ex BTW maken. 
DECLARE @EENH_BTW_H float
DECLARE @EENH_BTW_L float
SELECT @EENH_BTW_H = (Select EENH_BTW_H from NG39EENH)
SELECT @EENH_BTW_L = (Select EENH_BTW_L from NG39EENH)

UPDATE    NG39ART
SET              ART_PR = (ART_PR / (100 + @EENH_BTW_H)) * 100
WHERE     (ART_BTW = N'H')

UPDATE    NG39ART
SET              ART_PR = (ART_PR / (100 + @EENH_BTW_L)) * 100
WHERE     (ART_BTW = N'L')


-- ---------------------------------------

INSERT INTO NG39USER
SELECT
               USR_ID,
               USR_GROEP,
               USR_NAAM,
               'Y2djWZxBe9M=', --USR_PASSWORD,
               CONVERT(smalldatetime,'01-01-2002'),
               1 -- CONVERT(int,USR_PASS_PER)
FROM STNG39USER


IF ((SELECT COUNT(usr_id) FROM NG39user where usr_id = 'fsb') = 1)

   BEGIN
   PRINT 'FSB user found'
   END
ELSE
   begin
   PRINT 'FSB user NOT found'
   INSERT INTO NG39USER (USR_ID, USR_PASSWORD, USR_PASS_DT, USR_PASS_PER)
             VALUES     (N'FSB', 'i03E6cJh5hmEkqBhqiRl9g==', '1-1-2050', 4)
   END
   
   
IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error loading NG39USER'
   GOTO error_section
END

-- ---------------------------------------

INSERT INTO NG39SYST
SELECT * FROM STNG39SYST


IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error loading NG39SYST'
   GOTO error_section
END


-- NOW LET'S COUNT THE NO OF RECORDS IN PRODUCTION TABLES AFTER IMPORT.
-- --------------------------------------------------------------------

SELECT @_NG39AFGP_A    = (SELECT COUNT(*) FROM NG39AFGP)
SELECT @_NG39LEV_A     = (SELECT COUNT(*) FROM NG39LEV)
SELECT @_NG39ART_A     = (SELECT COUNT(*) FROM NG39ART)
SELECT @_NG39TOES_A    = (SELECT COUNT(*) FROM NG39TOES)
SELECT @_NG39BUDG_A    = (SELECT COUNT(*) FROM NG39BUDG)
SELECT @_NG39MLT_A     = (SELECT COUNT(*) FROM NG39MLT)
SELECT @_NG39USER_A    = (SELECT COUNT(*) FROM NG39USER)
SELECT @_NG39EENH_A    = (SELECT COUNT(*) FROM NG39EENH)
SELECT @_NG39SYST_A    = (SELECT COUNT(*) FROM NG39SYST)


-- IF WE DO NOT WANT THE U_VERSION DATA, WE CAN DO THE FOLLOWING:

-- UPDATE  NG39AFGP  SET U_VERSION='NULL'   
-- UPDATE  NG39LEV   SET U_VERSION='NULL'     
-- UPDATE  NG39ART   SET U_VERSION='NULL'      
-- UPDATE  NG39TOES  SET U_VERSION='NULL'      
-- UPDATE  NG39BUDG  SET U_VERSION='NULL'      
-- UPDATE  NG39MLT   SET U_VERSION='NULL'      
-- UPDATE  NG39USER  SET U_VERSION='NULL'      
-- UPDATE  NG39EENH  SET U_VERSION='NULL'      


/***************************************************************************/
/*STEP 12. ENABLING FOREIGN KEYS AGAIN.                                    */
/***************************************************************************/

DECLARE @FK_NG39AFGR_NG39AFG      INT     
DECLARE @FK_NG39AFG_NG39AFGP      INT   
DECLARE @FK_NG39AFGR_NG39ART      INT   
DECLARE @FK_NG39ONTR_NG39ART      INT  
DECLARE @FK_NG39DGLB_NG39BUDG     INT  
DECLARE @FK_NG39SCHF_NG39DGLB     INT 
DECLARE @FK_NG39AFG_NG39DGLB      INT   
DECLARE @FK_NG39DSTK_NG39DGLB     INT    
DECLARE @FK_NG39DTOE_NG39DGLB     INT
DECLARE @FK_NG39SCHF_NG39DSTK     INT
DECLARE @FK_NG39ONT_NG39LEV       INT
DECLARE @FK_NG39DSTK_NG39MLT      INT
DECLARE @FK_NG39ONTR_NG39ONT      INT
DECLARE @FK_NG39DTOE_NG39TOES     INT


-- -----------------------------

-- CREATE FKs IN BBV:

-- -----------------------------

ALTER TABLE [dbo].[NG39ONTR] ADD 
CONSTRAINT [FK_NG39ONTR_NG39ART] FOREIGN KEY ([ART_NR]) REFERENCES [dbo].[NG39ART] ([ART_NR])

ALTER TABLE [dbo].[NG39ONTR] ADD CONSTRAINT [FK_NG39ONTR_NG39ONT] FOREIGN KEY ([ONT_BON_NR]) 
REFERENCES [dbo].[NG39ONT] ([ONT_BON_NR])

-- -----------------------------

ALTER TABLE [dbo].[NG39AFG] ADD 
CONSTRAINT [FK_NG39AFG_NG39AFGP] FOREIGN KEY ([AFG_PNT_NR]) REFERENCES [dbo].[NG39AFGP] ([AFG_PNT_NR])

ALTER TABLE [dbo].[NG39AFG] ADD 
CONSTRAINT [FK_NG39AFG_NG39DGLB] FOREIGN KEY ([DGLB_STER_DT]) REFERENCES [dbo].[NG39DGLB] ([DGLB_STER_DT])

-- -----------------------------

ALTER TABLE [dbo].[NG39AFGR] ADD 
CONSTRAINT [FK_NG39AFGR_NG39AFG] FOREIGN KEY ([AFG_PNT_NR],[AFG_BON_NR]) 
REFERENCES [dbo].[NG39AFG] ([AFG_PNT_NR],[AFG_BON_NR])

ALTER TABLE [dbo].[NG39AFGR] ADD 
CONSTRAINT [FK_NG39AFGR_NG39ART] FOREIGN KEY ([ART_NR]) REFERENCES [dbo].[NG39ART] ([ART_NR])

-- -----------------------------

ALTER TABLE [dbo].[NG39DGLB] ADD 
CONSTRAINT [FK_NG39DGLB_NG39BUDG] FOREIGN KEY ([BUD_PER_NR]) REFERENCES [dbo].[NG39BUDG] ([BUD_PER_NR])

-- -----------------------------

ALTER TABLE [dbo].[NG39DSTK] ADD 
CONSTRAINT [FK_NG39DSTK_NG39DGLB] FOREIGN KEY ([DGLB_STER_DT]) REFERENCES [dbo].[NG39DGLB] ([DGLB_STER_DT])

ALTER TABLE [dbo].[NG39DSTK] ADD 
CONSTRAINT [FK_NG39DSTK_NG39MLT] FOREIGN KEY ([MLT_NR]) REFERENCES [dbo].[NG39MLT] ([MLT_NR])

-- -----------------------------

ALTER TABLE [dbo].[NG39DTOE] 
ADD CONSTRAINT [FK_NG39DTOE_NG39DGLB] FOREIGN KEY ([DGLB_STER_DT]) REFERENCES [dbo].[NG39DGLB] ([DGLB_STER_DT])

ALTER TABLE [dbo].[NG39DTOE] ADD
CONSTRAINT [FK_NG39DTOE_NG39TOES] FOREIGN KEY ([TOES_NR]) REFERENCES [dbo].[NG39TOES] ([TOES_NR])

-- -----------------------------

ALTER TABLE [dbo].[NG39ONT] ADD 
CONSTRAINT [FK_NG39ONT_NG39LEV] FOREIGN KEY ([LEV_NR]) REFERENCES [dbo].[NG39LEV] ([LEV_NR])

-- -----------------------------

ALTER TABLE [dbo].[NG39SCHF] ADD 
CONSTRAINT [FK_NG39SCHF_NG39DGLB] FOREIGN KEY ([DGLB_STER_DT]) REFERENCES [dbo].[NG39DGLB] ([DGLB_STER_DT])

ALTER TABLE [dbo].[NG39SCHF] ADD 
CONSTRAINT [FK_NG39SCHF_NG39DSTK] FOREIGN KEY ([DGLB_STER_DT],[MLT_NR]) 
REFERENCES [dbo].[NG39DSTK] ([DGLB_STER_DT],[MLT_NR])

-- -----------------------------


/****************************************************************************/
/*STEP 13. REPORT OF THE CONVERSION.                                        */
/****************************************************************************/


SELECT @DIFF_NG39AFGP   = (SELECT COUNT(*) FROM NG39AFGP)-(SELECT COUNT(*) FROM STNG39AFGP)
SELECT @DIFF_NG39LEV    = (SELECT COUNT(*) FROM NG39LEV)-(SELECT COUNT(*) FROM STNG39LEV)
SELECT @DIFF_NG39ART    = (SELECT COUNT(*) FROM NG39ART)-(SELECT COUNT(*) FROM STNG39ART)
SELECT @DIFF_NG39TOES   = (SELECT COUNT(*) FROM NG39TOES)-(SELECT COUNT(*) FROM STNG39TOES)
SELECT @DIFF_NG39BUDG   = (SELECT COUNT(*) FROM NG39BUDG)-(SELECT COUNT(*) FROM STNG39BUDG)
SELECT @DIFF_NG39MLT    = (SELECT COUNT(*) FROM NG39MLT)-(SELECT COUNT(*) FROM STNG39MLT)
SELECT @DIFF_NG39USER   = (SELECT COUNT(*) FROM NG39USER)-(SELECT COUNT(*) FROM STNG39USER)
SELECT @DIFF_NG39EENH   = (SELECT COUNT(*) FROM NG39EENH)-(SELECT COUNT(*) FROM STNG39EENH)
SELECT @DIFF_NG39SYST   = (SELECT COUNT(*) FROM NG39SYST)-(SELECT COUNT(*) FROM STNG39SYST)


PRINT '************* REPORT OF CONVERSION *****************'
PRINT '  '
PRINT '  '
PRINT 'NO OF RECORDS IN PRODUCTION TABLES BEFORE IMPORT            :'
PRINT '-------------------------------------------------------------'
PRINT 'NO OF RECORDS IN NG39AFGP     :'+ convert(varchar(10),@_NG39AFGP_B)
PRINT 'NO OF RECORDS IN NG39LEV      :'+ convert(varchar(10),@_NG39LEV_B)
PRINT 'NO OF RECORDS IN NG39ART      :'+ convert(varchar(10),@_NG39ART_B)
PRINT 'NO OF RECORDS IN NG39TOES     :'+ convert(varchar(10),@_NG39TOES_B)
PRINT 'NO OF RECORDS IN NG39BUDG     :'+ convert(varchar(10),@_NG39BUDG_B)
PRINT 'NO OF RECORDS IN NG39MLT      :'+ convert(varchar(10),@_NG39MLT_B)
PRINT 'NO OF RECORDS IN NG39USER     :'+ convert(varchar(10),@_NG39USER_B)
PRINT 'NO OF RECORDS IN NG39EENH     :'+ convert(varchar(10),@_NG39EENH_B)
PRINT 'NO OF RECORDS IN NG39SYST     :'+ convert(varchar(10),@_NG39SYST_B)


PRINT '  '
PRINT 'NO OF RECORDS IN PRODUCTION TABLES AFTER IMPORT            :'
PRINT '-------------------------------------------------------------'
PRINT 'NO OF RECORDS IN NG39AFGP     :'+ convert(varchar(10),@_NG39AFGP_A)
PRINT 'NO OF RECORDS IN NG39LEV      :'+ convert(varchar(10),@_NG39LEV_A)
PRINT 'NO OF RECORDS IN NG39ART      :'+ convert(varchar(10),@_NG39ART_A)
PRINT 'NO OF RECORDS IN NG39TOES     :'+ convert(varchar(10),@_NG39TOES_A)
PRINT 'NO OF RECORDS IN NG39BUDG     :'+ convert(varchar(10),@_NG39BUDG_A)
PRINT 'NO OF RECORDS IN NG39MLT      :'+ convert(varchar(10),@_NG39MLT_A)
PRINT 'NO OF RECORDS IN NG39USER     :'+ convert(varchar(10),@_NG39USER_A)
PRINT 'NO OF RECORDS IN NG39EENH     :'+ convert(varchar(10),@_NG39EENH_A)
PRINT 'NO OF RECORDS IN NG39SYST     :'+ convert(varchar(10),@_NG39SYST_A)


PRINT '  '
PRINT 'NO OF RECORDS IN PRODUCTION TABLES AFTER IMPORT COMPARED TO STAGING TABLES:'
PRINT '---------------------------------------------------------------------------'
PRINT 'DIFF RECORDS NG39AFGP     :'+ convert(varchar(10),@DIFF_NG39AFGP)
PRINT 'DIFF RECORDS NG39LEV      :'+ convert(varchar(10),@DIFF_NG39LEV)
PRINT 'DIFF RECORDS NG39ART      :'+ convert(varchar(10),@DIFF_NG39ART)
PRINT 'DIFF RECORDS NG39TOES     :'+ convert(varchar(10),@DIFF_NG39TOES)
PRINT 'DIFF RECORDS NG39BUDG     :'+ convert(varchar(10),@DIFF_NG39BUDG)
PRINT 'DIFF RECORDS NG39MLT      :'+ convert(varchar(10),@DIFF_NG39MLT)
PRINT 'DIFF RECORDS NG39USER     :'+ convert(varchar(10),@DIFF_NG39USER)
PRINT 'DIFF RECORDS NG39EENH     :'+ convert(varchar(10),@DIFF_NG39EENH)
PRINT 'DIFF RECORDS NG39SYST     :'+ convert(varchar(10),@DIFF_NG39SYST)


/***************************************************************************/
/*STEP 14. DROP THE STAGING TABLES.                                        */
/***************************************************************************/


DECLARE cur1 CURSOR FOR
SELECT name FROM sysobjects
WHERE name like 'ST%' AND xtype='U'

OPEN cur1
FETCH NEXT FROM cur1 INTO @TEMPTAB

WHILE (@@FETCH_STATUS<>-1)
  BEGIN
    exec ('DROP TABLE '+@TEMPTAB)
    FETCH NEXT FROM cur1 INTO @TEMPTAB
  END
CLOSE cur1
DEALLOCATE cur1

IF @@error > 0
BEGIN
   SET @ERR_MESSAGE='Error: error dropping temporary staging tables.'
   GOTO error_section
END


RETURN

--

error_section:
PRINT @ERR_MESSAGE
RETURN

-- END OF PROCEDURE
GO


#############################################################################################
#############################################################################################
#############################################################################################

ABCDEFGH

==================================================================================
SECTION 18: BIG SECTION: Unix command examples and architecture:
==================================================================================


############################################
SECTION 1. COMMANDS TO RETREIVE SYSTEM INFO:
############################################


==========================
1. HOW TO GET SYSTEM INFO:
==========================


1.1 Short version:
==================

See section 1.2 for more detailed commands and options.

Memory:
-------
AIX:     bootinfo -r
         lsattr -E -l mem0
         lsattr -E -l sys0 -a realmem
         svmon -G
         vmstat -v
         or use a tool as "topas" or "nmon" (these are utilities)

Linux:   cat /proc/meminfo
         /usr/sbin/dmesg | grep "Physical"
         free   (the free command)
HP:      /usr/sam/lbin/getmem
         grep MemTotal /proc/meminfo
         /etc/dmesg | grep -i phys      
         wc -c /dev/mem
         or us a tool as "glance", like entering "glance -m" from prompt (is a utility)
Solaris: /usr/sbin/prtconf | grep "Memory size"
Tru64:   /bin/vmstat -P | grep "Total Physical Memory"


Swap:
-----

AIX:           lsps -a  (or lsps -s)
               pstat -s
               
HP:            /usr/sbin/swapinfo -a
Solaris:       /usr/sbin/swap -l
Linux:         /sbin/swapon -s
               cat /proc/swaps
               cat /proc/meminfo


cpu:
----

HP:       ioscan -kfnC processor		
	  getconf CPU_VERSION		
	  getconf CPU_CHIP_TYPE		
	  model	

AIX:      lparstat (-i)       
          prtconf | grep proc
          pmcycles -m
          lsattr -El procx (x is 0,2, etc..)
          lscfg | grep proc
          pstat -S

Linux:    cat /proc/cpuinfo

Solaris:  psrinfo -v
          prtconf
          psrset -p 
          prtdiag


OS version:
-----------

HP:      uname -a

Linux:   cat /proc/version 
      
Solaris: uname -a
         cat /etc/release   (or other way to view that file, like "more /etc/release")
Tru64:   /usr/sbin/sizer -v

AIX:     oslevel -r
         lslpp -h bos.rte

AIX firmware:
lsmcode -c               display the system firmware level and service processor
lsmcode -r -d scraid0    display the adapter microcode levels for a RAID adapter scraid0
lsmcode -A               display the microcode level for all supported devices
prtconf                  shows many setting including memory, firmware, serial# etc..


  Notes about Power 4 or 5 lpars: 
  -------------------------------

  For AIX: The uname -L command identifies a partition on a system with multiple LPARS. The LPAR id  
  can be useful for writing shell scripts that customize system settings such as IP address or hostname. 

  The output of the command looks like: 

  # uname -L
  1 lpar01 

  The output of uname -L varies by maintenance level. For consistent output across maintenance levels,  
  add a -s flag. For illustrate, the following command assigns the partition number to the variable 
  "lpar_number" and partiton name to "lpar_name". 

  For HP-UX:
  Use commands like "parstatus" or "getconf PARTITION_IDENT" to get npar information.


patches:
--------

AIX:     Is a certain fix (APAR) installed?
         instfix -ik APAR_number
         instfix -a -ivk APAR_number
         
         To determine your platform firmware level, at the command prompt, type:

         lscfg -vp | grep -p Platform

         The last six digits of the ROM level represent the platform firmware date in the format, YYMMDD.


HP:      /usr/sbin/swlist -l patch
         swlist | grep patch
Linux:   rpm -qa
Solaris: showrev -p
         pkginfo -i package_name
Tru64:   /usr/sbin/dupatch -track -type kit


Netcards:
---------

AIX:	 lsdev -Cc adapter
         lsdev -Cc adapter | grep ent
	 lsdev -Cc if
         lsattr -E -l ent1
         ifconfig -a
Solaris: prtconf -D    /    prtconf -pv   /     prtconf | grep "card"
         prtdiag | grep "card"
         svcs -x
         ifconfig -a (up plumb)


Network sniffing:
-----------------

Here are a few short descriptions, and examples, of usefull network trace / dump commands.


-- Solaris: 

snoop command examples:

For example, if we want to observe traffic between systems alpha and beta  we can use the following command: 
# snoop alpha,beta
To enable data captures from the snoop output without losing packets while writing to the screen, send the snoop output to a file. For example:
# snoop -o /tmp/snooper -V 128.50.1.250
To snoop a specific port:
# snoop -o port xxx 


-- AIX:

tcpdump command examples:

# tcpdump port 23
# tcpdump -i en0 
A good way to use tcpdump is to save the network trace to a file with the -w flag and then analyze the trace by using different
filtering options together with the -r flag. The following example show how to run a basic tcpdump network trace, 
saving the output in a file with the -w flag (on a Ethernet network interface):
# tcpdump -w /tmp/tcpdump.en0 -i en0

To limit the number of traced packets, use the -c flag and specify the number, such as in the following example
that traces the first 128 packets (on a token-ring network interface):
# tcpdump -c 128 -w /tmp/tcpdump.tr0 -i tr0

iptrace command examples:

To start the iptrace daemon with the System Resource Controller (SRC),
# startsrc -s iptrace -a "/tmp/nettrace"

To stop the iptrace daemon with SRC enter the following:
# stopsrc -s iptrace

To record packets coming in and going out to any host on every interface, enter the command in the following format:
# iptrace /tmp/nettrace

The recorded packets are received on and sent from the local host. All
packet flow between the local host and all other hosts on any interface is
recorded. The trace information is placed into the /tmp/nettrace file.

To record packets received on an interface from a specific remote host,
enter the command in the following format:
# iptrace - i en0 -p telnet -s airmail /tmp/telnet.trace

The packets to be recorded are received on the en0 interface, from remote
hostairmail, over the telnet port. The trace information is placed into the
/tmp/telnet.trace file.

To record packets coming in and going out from a specific remote host,
enter the command in the following format:
# iptrace -i en0 -s airmail -b /tmp/telnet.trace

The packets to be recorded are received on the en0 interface, from remote
host airmail. The trace information is placed into the /tmp/telnet.trace file.


-- HPUX:

nettl command:

Initialize the tracing/logging facility:
# nettl -start
Logging is enabled for all subsystems as determined by the /etc/nettlgen.conf file. Log messages are sent 
to a log file whose name is determined by adding the suffix .LOG000 to the log file name specified
in the /etc/nettlgen.conf configuration file. 

To stop the tracing facility:
# nettl -stop

Turn on inbound and outbound PDU tracing for the transport and session (OTS/9000) subsystems
and send binary trace messages to file /var/adm/trace.TRC000. 
# nettl -traceon pduin pduout -entity transport session \ 
     -file /var/adm/trace 

Session using nettl and the formatter netfmt:
1. Capture packets
nettl -tn all -e ns_ls_ip -tm 99999 -size 1024 -f some-raw-capture-file

2. Reproduce problem.

3. Turn off trace: nettl -tf -e all

4. Create formatter filter file. Example:
filter tcp_sport 6699
filter tcp_dport 6699

5. Filter the packets:
5.1 "Long" display
netfmt -Nlnc filter-file -f some-raw.capture > formatted.out
5.2 "One-liner" display
netfmt -Nln1Tc filter-file -f some-raw.capture > one-liner.out


-- Restart inetd, nfs:
-- -------------------

Starting and stopping NFS:			
--------------------------
			
On all unixes, a number of daemons should be running in order for NFS to be functional, like for example			
the rpc.* processes, biod, nfsd and others.			
			
Once nfs is running, and in order to actually "share" or "export" your filesystem on your server, so remote clients 			
are able to mount the nfs mount, in most cases you should edit the "/etc/exports" file.			
			
-- AIX:			
The following subsystems are part of the nfs group: nfsd, biod, rpc.lockd, rpc.statd, and rpc.mountd. 			
The nfs subsystem (group) is under control of the "resource controller", so starting and stopping nfs			
is actually easy			
			
# startsrc -g nfs			
# stopsrc -g nfs			
			
Or use smitty.			
			
-- Redhat Linux:			
# /sbin/service nfs restart			
# /sbin/service nfs start			
# /sbin/service nfs stop			
			
-- On some other Linux distros			
# /etc/init.d/nfs start 			
# /etc/init.d/nfs stop			
# /etc/init.d/nfs restart			
			
-- Solaris:			
If the nfs daemons aren't running, then you will need to run:			
# /etc/init.d/nfs.server start 			
			
-- HP-UX:			
Issue the following command on the NFS server to start all the necessary NFS processes (HP): 			
# /sbin/init.d/nfs.server start 			
 			
Or if your machine is only a client:			
# cd /sbin/init.d			
# ./nfs.client start			
			
			
Restart or refresh inetd after you have edited "inetd.conf":			
------------------------------------------------------------
			
After you have edited "/etc/inetd.conf", for example, to enable or disable some service,			
you need to restart, or refresh inetd, to read the new configuration information.			
To let inetd to reread the configfile:			
			
-- AIX:			
# refresh -s inetd			
			
-- HPUX:			
# /usr/sbin/inetd -c 			
			
-- Solaris:			
# /etc/init.d/inetd stop			
# /etc/init.d/inetd start			
# pkill -HUP inetd		# The command will restart the inetd and reread the configuration.	
			
-- RedHat / Linux			
# service xinetd restart			
or			
# /etc/init.d/inetd restart			


1.2 More Detail:
================

1.2.1 Show memory in Solaris:
=============================

prtconf:
--------
Use this command to obtain detailed system information about your Sun Solaris installation
# /usr/sbin/prtconf

# prtconf -v 
Displays the size of the system memory and reports information about peripheral devices 

Use this command to see the amount of memory:
# /usr/sbin/prtconf | grep "Mem" 

sysdef -i reports on several system resource limits. Other parameters can be checked on a running system 
using adb -k :

# adb -k /dev/ksyms /dev/mem
parameter-name/D
^D (to exit) 


1.2.2 Show memory in AIX: 
=========================

>> Show Total memory:
--------=====--------

# bootinfo -r
# lsattr -El sys0 -a realmem 
# prtconf   (you can grep it on memory)


>> Show Details of memory:
--------------------------

You can have a more detailed and comprehensive look at AIX memory by using "vmstat -v" and "vmo -L" or "vmo -a":

For example:

# vmstat -v
               524288 memory pages
               493252 lruable pages
                67384 free pages
                    7 memory pools
               131820 pinned pages
                 80.0 maxpin percentage
                 20.0 minperm percentage
                 80.0 maxperm percentage
                 25.4 numperm percentage
               125727 file pages
                  0.0 compressed percentage
                    0 compressed pages
                 25.4 numclient percentage
                 80.0 maxclient percentage
               125575 client pages
                    0 remote pageouts scheduled
                14557 pending disk I/Os blocked with no pbuf
              6526890 paging space I/Os blocked with no psbuf
                18631 filesystem I/Os blocked with no fsbuf
                    0 client filesystem I/Os blocked with no fsbuf
                49038 external pager filesystem I/Os blocked with no fsbuf
                    0 Virtualized Partition Memory Page Faults
                 0.00 Time resolving virtualized partition memory page faults


The vmo command really gives lots of output. In the following example only a small fraction of the output is shown:

# vmo -L

..
lrubucket                 128K   128K   128K   64K           4KB pages         D
--------------------------------------------------------------------------------
maxclient%                80     80     80     1      100    % memory          D
     maxperm%
     minperm%
--------------------------------------------------------------------------------
maxfree                   1088   1088   1088   8      200K   4KB pages         D
     minfree
     memory_frames
--------------------------------------------------------------------------------
maxperm                   394596        394596                                 S
--------------------------------------------------------------------------------
maxperm%                  80     80     80     1      100    % memory          D
     minperm%
     maxclient%
--------------------------------------------------------------------------------
maxpin                    424179        424179                                 S
..
..


>> To further look at your virtual memory and its causes, you can use a combination of: 
---------------------------------------------------------------------------------------
  
# ipcs -bm               (shared memory) 
# lsps -a                (paging) 
# vmo -a  or vmo -L      (virtual memory options) 
# svmon -G               (basic memory allocations) 
# svmon -U               (virtual memory usage by user)
# svmon -P
# vmstat -v 

     To print out the memory usage statistics for the users root and steve
     taking into account only working segments, type:

     svmon -U root steve -w

     To print out the top 10 users of the paging space, type:

     svmon -U -g -t 10

     To print out the memory usage statistics for the user steve, including the
     list of the process identifiers, type:

     svmon -U steve -l
     svmon -U emcdm -l

# vmo -o npswarn=value
# schedo -o pacefork=15

Note: sysdumpdev -e
Although the sysdumpdev command is used to show or alter the dumpdevice for a system dump,
you can also use it to show how much real memory is used.

The command
# sysdumpdev -e
provides an estimated dump size taking into account the current memory (not pagingspace) currently 
in use by the system.

Note: the rmss command:

The rmss (Reduced-Memory System Simulator) command is used to ascertain the effects of reducing the amount 
of available memory on a system without the need to physically remove memory from the system. It is useful 
for system sizing, as you can install more memory than is required and then use rmss to reduce it. 
Using other performance tools, the effects of the reduced memory can be monitored. The rmss command has 
the ability to run a command multiple times using different simulated memory sizes and produce statistics 
for all of those memory sizes.

The rmss command resides in /usr/bin and is part of the bos.perf.tools fileset, which is installable 
from the AIX base installation media.

Syntax rmss -p -c <MB> -r 
Options 
  -p  Print the current value 
  -c MB Change to M size (in Mbytes) 
  -r  Restore all memory to use 
  -p  Print the current value 

Example: find out how much memory you have online
rmss -p  
Example: Change available memory to 256 Mbytes
rmss -c 256  
Example: Undo the above 
rmss -r 

Warning:

rmss can damage performance very seriously 
Don't go below 25% of the machines memory 
Never forget to finish with rmss -r 

The pstat command:
------------------

The pstat command, which displays many system tables such as a process table, inode table, or processor status table, 
The pstat command interprets the contents of the various system tables and writes it to standard output. 

Use the pstat command from the AIX 5.2 command prompt. See the command reference for details and examples, 
or use the syntax summary in the table below. 

Flags 
-a		Displays entries in the process table  
-A		Displays all entries in the kernel thread table  
-f		Displays the file table  
-i		Displays the i-node table and the i-node data block addresses 
-p		Displays the process table 
-P		Displays runnable kernel thread table entries only 
-s		Displays information about the swap or paging space usage 
-S		Displays the status of the processors 
-t		Displays the tty structures 
-u ProcSlot	Displays the user structure of the process in the designated slot of the process table. An error message is generated if you attempt to display a swapped out process. 
-T		Displays the system variables. These variables are briefly described in var.h 
-U ThreadSlot	Displays the user structure of the kernel thread in the designated slot of the kernel thread table. An error message is generated if you attempt to display a swapped out kernel thread. 


Note: How to get a "reaonable" view on memory consumption of a process in UNIX:
-------------------------------------------------------------------------------

With using just the command line, or some free utils.


In general not so easy to answer, because of the "sub components" you might distinguish
in memory occupation. For example, do you mean RSS, real, shared, virtual, paging, including all libraries loaded, etc..?

-- Some people like to use the ps command with some special flags, like
   ps -vg
   ps auxw   # or  ps auxw | sort -r +3 |head -10 (top users)

   But those commands seems not so very satisfactory, and not "complete" in their output.

-- There are some great common utilities like topas, nmon, top etc.., or tools specific to a certain Unix, like SMC for Solaris.
   No bad word on those tools, because they are great. But some people think that they are not satisfactory 
   on the subject of memory consumption of a process (although they show a lot of other interesting information).

-- Some other ways might be:

# procmap pid      (in e.g. AIX)
# pmap -x pid      (in e.g. Solaris)

Those tools also show a "total" memory usage, which is a good indicator.

For example:
   
# pmap -x $$

492328: -ksh
 Address  Kbytes     RSS    Anon  Locked Mode   Mapped File
00010000     192     192       -       - r-x--  ksh
00040000       8       8       8       - rwx--  ksh
00042000      40      40       8       - rwx--    [ heap ]
FF180000     680     680       -       - r-x--  libc.so.1
FF23A000      24      24       -       - rwx--  libc.so.1
FF240000       8       8       8       - rwx--  libc.so.1
FF280000     576     576       -       - r-x--  libnsl.so.1
FF310000      40      40       -       - rwx--  libnsl.so.1
FF31A000      24      16       -       - rwx--  libnsl.so.1
FF350000      16      16       -       - r-x--  libmp.so.2
FF364000       8       8       -       - rwx--  libmp.so.2
FF380000      40      40       -       - r-x--  libsocket.so.1
FF39A000       8       8       -       - rwx--  libsocket.so.1
FF3A0000       8       8       -       - r-x--  libdl.so.1
FF3B0000       8       8       8       - rwx--    [ anon ]
FF3C0000     152     152       -       - r-x--  ld.so.1
FF3F6000       8       8       8       - rwx--  ld.so.1
FFBFC000      16      16       8       - rw---    [ stack ]
-------- ------- ------- ------- -------
total Kb    1856    1848      48       -

This gives you a reasonable idea on memory consumption of a pid.

You can also try:

# svmon -G
# svmon -U
# svmon -P -t 10     (top 10 users)
# svmon -U steve -l  (memory stats for user steve)

But svmon is not available on all unixes.

The following might also be helpfull (not on all unixes):

# ls -l /proc/{pid}/as
# prstat -a -s rss


1.2.3 Show memory in Linux:
===========================

# /usr/sbin/dmesg | grep "Physical:"
# cat /proc/meminfo
# free -m

The ipcs, vmstat, iostat and that type of commands, are ofcourse more or less the same
in Linux as they are in Solaris or AIX.


1.2.4 Show aioservers in AIX:
=============================

# lsattr -El aio0
autoconfig available STATE to be configured at system restart True
fastpath   enable    State of fast path                       True
kprocprio  39        Server PRIORITY                          True
maxreqs    4096      Maximum number of REQUESTS               True
maxservers 10        MAXIMUM number of servers per cpu        True
minservers 1         MINIMUM number of servers                True

# pstat -a | grep -c aios
20

# ps -k | grep aioserver
  331962      -  0:15 aioserver
  352478      -  0:14 aioserver
  450644      -  0:12 aioserver
  454908      -  0:10 aioserver
  565292      -  0:11 aioserver
  569378      -  0:10 aioserver
  581660      -  0:11 aioserver
  585758      -  0:17 aioserver
  589856      -  0:12 aioserver
  593954      -  0:15 aioserver
  598052      -  0:17 aioserver
  602150      -  0:12 aioserver
  606248      -  0:13 aioserver
  827642      -  0:14 aioserver
  991288      -  0:14 aioserver
  995388      -  0:11 aioserver
 1007616      -  0:12 aioserver
 1011766      -  0:13 aioserver
 1028096      -  0:13 aioserver
 1032212      -  0:13 aioserver

What are aioservers in AIX5?:

With IO on filesystems, for example if a database is involved, you may try to tune the number
of aioservers (asynchronous IO)

AIX 5L supports asynchronous I/O (AIO) for database files created both on file system partitions and on raw devices. 
AIO on raw devices is implemented fully into the AIX kernel, and does not require database processes 
to service the AIO requests. When using AIO on file systems, the kernel database processes (aioserver) 
control each request from the time a request is taken off the queue until it completes. The kernel database 
processes are also used with I/O with virtual shared disks (VSDs) and HSDs with FastPath disabled. By default, 
FastPath is enabled. The number of aioserver servers determines the number of AIO requests that can be executed 
in the system concurrently, so it is important to tune the number of aioserver processes when using file systems 
to store Oracle Database data files. 

- Use one of the following commands to set the number of servers. This applies only when using asynchronous I/O 
on file systems rather than raw devices: 

# smit aio 

# chdev -P -l aio0 -a maxservers='128' -a minservers='20' 

- To set asynchronous IO to `Available':
# chdev -l aio0 -P -a autoconfig=available

You need to restart the Server:
# shutdown -Fr


1.2.5 aio on Linux distro's:
============================

On some Linux distro's, Oracle 9i/10g supports asynchronous I/O but it is disabled by default because 
some Linux distributions do not have libaio by default. For Solaris, the following configuration is not required 
- skip down to the section on enabling asynchronous I/O.

On Linux, the Oracle binary needs to be relinked to enable asynchronous I/O. The first thing to do is shutdown 
the Oracle server. After Oracle has shutdown, do the following steps to relink the binary:

su - oracle
cd $ORACLE_HOME/rdbms/lib
make -f ins_rdbms.mk async_on
make -f ins_rdbms.mk ioracle


1.2.6 The ipcs and ipcrm commands:
==================================

The "ipcs" command is really a "listing" command. But if you need to intervene
in memory structures, like for example if you need to "clear" or remove a shared memory segment, 
because a faulty or crashed
application left semaphores, memory identifiers, or queues in place,
you can use to "ipcrm" command to remove those structures.

Example ipcrm command usage:
----------------------------

Suppose an application crashed, but it cannot be started again. The following might help,
if you happened to know which IPC identifier it used.
Suppose the app used 47500 as the IPC key. Calcultate this decimal number to hex
which is, in this example, B98C.

No do the following:

# ipcs -bm | grep B89C

This might give you, for example, the shared memory identifier "50855977".
Now clear the segment: 

# ipcrm -m 50855977

It might also be, that still a semaphore and/or queue is still "left over".
In that case you might also try commands like the following example:

ipcs -q
ipcs -s

# ipcrm -s 2228248    (remove semaphore)
# ipcrm -q 5111883    (remove queue)


Note: in some cases the "slibclean" command can be used to clear unused modules in kernel and library memory.
Just give as root the command:

# slibclean

Other Example:
--------------

If you run the following command to remove a shared memory segment and you get this error:

# ipcrm -m 65537
ipcrm: 0515-020 shmid(65537) was not found.

However, if you run the ipcs command, you still see the segment there:

# ipcs | grep 65537
m 65537 0x00000000 DCrw------- root system

If you look carefully, you will notice the "D" in the forth column. The "D" means:

D If the associated shared memory segment has been removed. It disappears when the last process attached 
to the segment detaches it.

So, to clear the shared memory segment, find the process which is still associated with the segment:

# ps -ef | grep process_owner

where process_owner is the name of the owner using the shared segment 

Now kill the process found from the ps command above

# kill -9 pid

Running another ipcs command will show the shared memory segment no longer exists:

# ipcs | grep 65537 
Example

ipcrm -m 65537 


1.2.7 Show patches, version, systeminfo:
========================================

Solaris:
========

showrev:
--------

#showrev
Displays system summary information.

#showrev -p
Reports which patches are installed 

sysdef and dmesg:
-----------------

The follwing commands also displays configuration information
# sysdef
# dmesg


versions:
---------

==> To check your Solaris version:
# uname -a or uname -m
# cat /etc/release 
# isainfo -v

==> To check your AIX version:

# oslevel
# oslevel -r    tells you which maintenance level you have.

>> To find the known recommended maintenance levels:
# oslevel -rq

>> To find all filesets lower than a certain maintenance level:
# oslevel -rl 5200-06

>> To find all filesets higher than a certain maintenance level:
# oslevel -rg 5200-05

>> To list all known recommended maintenance and technology levels on the system, type:

# oslevel -q -s
Known Service Packs
-------------------
5300-05-04
5300-05-03
5300-05-02
5300-05-01
5300-05-00
5300-04-CSP
5300-04-03
5300-04-02
5300-04-01
5300-03-CSP

>> How can I determine which fileset updates are missing from a particular AIX level?
To determine which fileset updates are missing from 5300-04, for example, run the following command:

# oslevel -rl 5300-04 

>> What SP (Service Pack) is installed on my system?
To see which SP is currently installed on the system, run the oslevel -s command. Sample output for an 
AIX 5L Version 5.3 system, with TL4, and SP2 installed would be:

# oslevel -s
5300-04-02
			 
>> Is a CSP (Concluding Service Pack) installed on my system?
To see if a CSP is currently installed on the system, run the oslevel -s command. 
Sample output for an AIX 5L Version 5.3 system, with TL3, and CSP installed would be:

# oslevel -s
5300-03-CSP
 

==> To check your HP machine:

# model
9000/800/rp7410


: machine info on AIX

How do I find out the Chip type, System name, Node name, Model Number etc.? 

The uname command provides details about your system. uname -p  Displays the chip type of the system. 
For example, powerpc. 

uname -r  Displays the release number of the operating system. 
uname -s  Displays the system name. For example, AIX. 
uname -n  Displays the name of the node.  
uname -a  Displays the system name, nodename,Version, Machine id. 
uname -M  Displays the system model name. For example, IBM, 7046-B50. 
uname -v  Displays the operating system version 
uname -m  Displays the machine ID number of the hardware running the system. 
uname -u  Displays the system ID number.  

Architecture:
-------------

To see if you have a CHRP machine, log into the machine as the root user, and run the following command:

# lscfg | grep Architecture               or use:
# lscfg -pl sysplanar0 | more

The bootinfo -p command also shows the architecture of the pSeries, RS/6000

# bootinfo -p
chrp


1.2.8 Check whether you have a 32 bit or 64 bit version:
========================================================

- Solaris:

# iasinfo -vk

If /usr/bin/isainfo cannot be found, then the OS only 
supports 32-bit process address spaces. (Solaris 7 
was the first version that could run 64-bit binaries 
on certain SPARC-based systems.) 
So a ksh-based test might look something like

if [ -x /usr/bin/isainfo ]; then
bits=`/usr/bin/isainfo -b`
else
bits=32
fi

- AIX:

Command:        /bin/lslpp -l bos.64bit     ...to see if bos.64bit is installed & committed.           
        -or-    /bin/locale64               ...error message if on 32bit machine such as:           
                                               Could not load program /bin/locale64: 
                                               Cannot run a 64-bit program on a 32-bit machine.      

Or use:

# bootinfo -K         displays the current kernel wordsize of "32" or "64"
# bootinfo -y         tells if hardware is 64-bit capable
# bootinfo -p         If it returns the string 32 it is only capable of running the 
                      32-bit kernel. If it returns the string chrp the machine is 
                      capable of running the 64-bit kernel or the 32-bit kernel.
Or use:

# /usr/bin/getconf HARDWARE_BITMODE

This command should return the following output:

64


Note:
-----

  HOW TO CHANGE KERNEL MODE OF IBM AIX 5L (5.1)
  ---------------------------------------------
 
  The AIX 5L has pre-configured kernels. These are listed below for Power 
  processors:

     /usr/lib/boot/unix_up    32 bit uni-processor
     /usr/lib/boot/unix_mp    32 bit multi-processor kernel 
     /usr/lib/boot/unix_64    64 bit multi-processor kernel

  Switching between kernel modes means using different kernels. This is simply
  done by pointing the location that is referenced by the system to these kernels.
  Use symbolic links for this purpose. During boot AIX system runs the kernel
  in the following locations:

     /unix
     /usr/lib/boot/unix

  The base operating system 64-bit runtime fileset is bos.64bit. Installing bos.64bit also installs 
  the /etc/methods/cfg64 file. The /etc/methods/cfg64 file provides the option of enabling or disabling 
  the 64-bit environment via SMIT, which updates the /etc/inittab file with the load64bit line. 
  (Simply adding the load64bit line does not enable the 64-bit environment).

  The command lslpp -l bos.64bit reveals if this fileset is installed. The bos.64bit fileset 
  is on the AIX media; however, installing the bos.64bit fileset does not ensure that you will be able 
  to run 64-bit software. If the bos.64bit fileset is installed on 32-bit hardware, you should be able 
  to compile 64-bit software, but you cannot run 64-bit programs on 32-bit hardware.

  The syscalls64 extension must be loaded in order to run a 64-bit executable. This is done from 
  the load64bit entry in the inittab file. You must load the syscalls64 extension even when running 
  a 64-bit kernel on 64-bit hardware.

  To determine if the 64-bit kernel extension is loaded, at the command line, enter genkex |grep 64.
  Information similar to the following displays: 
  149bf58 a3ec /usr/lib/drivers/syscalls64.ext


  To change the kernel mode follow steps below:

     1. Create symbolic link from /unix and /usr/lib/boot/unix to the location 
        of the desired kernel.
     2. Create boot image.
     3. Reboot AIX.

  Below lists the detailed actions to change kernel mode:

  To change to 32 bit uni-processor mode:

     # ln -sf /usr/lib/boot/unix_up  /unix
     # ln -sf /usr/lib/boot/unix_up  /usr/lib/boot/unix
     # bosboot -ad /dev/ipldevice
     # shutdown -r

  To change to 32 bit multi-processor mode:
  
     # ln -sf /usr/lib/boot/unix_mp  /unix
     # ln -sf /usr/lib/boot/unix_mp  /usr/lib/boot/unix
     # bosboot -ad /dev/ipldevice
     # shutdown -r

  To change to 64 bit multi-processor mode:

     # ln -sf /usr/lib/boot/unix_64  /unix
     # ln -sf /usr/lib/boot/unix_64  /usr/lib/boot/unix
     # bosboot -ad /dev/ipldevice
     # shutdown -r

  IMPORTANT NOTE: If you are changing the kernel mode to 32-bit and you will run 
  9.2 on this server, the following line should be included in /etc/inittab:

     load64bit:2:wait:/etc/methods/cfg64 >/dev/console 2>&1 # Enable 64-bit execs

  This allows 64-bit applications to run on the 32-bit kernel. Note that this 
  line is also mandatory if you are using the 64-bit kernel.


In AIX 5.2, the 32-bit kernel is installed by default. The 64-bit kernel, along with JFS2 
(enhanced journaled file system), can be enabled at installation time.


Checking if other unixes are in 32 or 64 mode:
----------------------------------------------

- Digital UNIX/Tru64:    This OS is only available in 64bit form.    

- HP-UX(Available in 64bit starting with HP-UX 11.0):  
  Command: /bin/getconf KERNEL_BITS    ...returns either 32 or 64   

- SGI:  This OS is only available in 64bit form.  

- The remaining supported UNIX platforms are only available in 32bit form.   


scinstall:
----------

# scinstall -pv 
Displays Sun Cluster software release and package version information 


1.2.9 Info about CPUs:
======================

Solaris:
--------

# psrinfo -v
Shows the number of processors and their status.

# psrinfo -v|grep "Status of processor"|wc -l
Shows number of cpu's

Linux:
------

# cat /proc/cpuinfo
# cat /proc/cpuinfo | grep processor|wc -l

Especially with Linux, the /proc directory contains special "files" that either extract information from 
or send information to the kernel

HP-UX:
------

# ioscan -kfnC processor
# /usr/sbin/ioscan -kf | grep processor
# grep processor /var/adm/syslog/syslog.log
# /usr/contrib/bin/machinfo   (Itanium)

Several ways as,

1. sam -> performance monitor -> processor
2. print_manifest (if ignite-ux installed)
3. machinfo (11.23 HP versions)
4. ioscan -fnC processor
5. echo "processor_count/D" | adb /stand/vmunix /dev/kmem
6. top command to get cpu count

The "getconf" command can give you a lot of interesting info. The parameters are:

          ARG_MAX                _BC_BASE_MAX              BC_DIM_MAX
           BS_SCALE_MAX          BC_STRING_MAX             CHARCLASS_NAME_MAX
           CHAR_BIT              CHAR_MAX                  CHAR_MIN
           CHILD_MAX             CLK_TCK                   COLL_WEIGHTS_MAX
           CPU_CHIP_TYPE         CS_MACHINE_IDENT          CS_PARTITION_IDENT
           CS_PATH               CS_MACHINE_SERIAL         EXPR_NEST_MAX
           HW_CPU_SUPP_BITS      HW_32_64_CAPABLE          INT_MAX
           INT_MIN               KERNEL_BITS               LINE_MAX
           LONG_BIT              LONG_MAX                  LONG_MIN
           MACHINE_IDENT         MACHINE_MODEL             MACHINE_SERIAL
           MB_LEN_MAX            NGROUPS_MAX               NL_ARGMAX
           NL_LANGMAX            NL_MSGMAX                 NL_NMAX
           NL_SETMAX             NL_TEXTMAX                NZERO
           OPEN_MAX              PARTITION_IDENT           PATH
           _POSIX_ARG_MAX        _POSIX_JOB_CONTROL        _POSIX_NGROUPS_MAX
           _POSIX_OPEN_MAX       _POSIX_SAVED_IDS          _POSIX_SSIZE_MAX
           _POSIX_STREAM_MAX     _POSIX_TZNAME_MAX         _POSIX_VERSION
           POSIX_ARG_MAX         POSIX_CHILD_MAX           POSIX_JOB_CONTROL
           POSIX_LINK_MAX        POSIX_MAX_CANON           POSIX_MAX_INPUT
           POSIX_NAME_MAX        POSIX_NGROUPS_MAX         POSIX_OPEN_MAX
           POSIX_PATH_MAX        POSIX_PIPE_BUF            POSIX_SAVED_IDS
           POSIX_SSIZE_MAX       POSIX_STREAM_MAX          POSIX_TZNAME_MAX
           POSIX_VERSION         POSIX2_BC_BASE_MAX        POSIX2_BC_DIM_MAX
           POSIX2_BC_SCALE_MAX   POSIX2_BC_STRING_MAX      POSIX2_C_BIND
           POSIX2_C_DEV          POSIX2_C_VERSION          POSIX2_CHAR_TERM
           POSIX_CHILD_MAX       POSIX2_COLL_WEIGHTS_MAX   POSIX2_EXPR_NEST_MAX
           POSIX2_FORT_DEV       POSIX2_FORT_RUN           POSIX2_LINE_MAX
           POSIX2_LOCALEDEF      POSIX2_RE_DUP_MAX         POSIX2_SW_DEV
           POSIX2_UPE            POSIX2_VERSION            SC_PASS_MAX
           SC_XOPEN_VERSION      SCHAR_MAX                 SCHAR_MIN
           SHRT_MAX              SHRT_MIN                  SSIZE_MAX

Example:

# getconf CPU_VERSION


sample function in shell script:

get_cpu_version() 
{

   case `getconf CPU_VERSION` in
      # ???) echo "Itanium[TM] 2" ;;
      768) echo "Itanium[TM] 1" ;;
      532) echo "PA-RISC 2.0" ;;
      529) echo "PA-RISC 1.2" ;;
      528) echo "PA-RISC 1.1" ;;
      523) echo "PA-RISC 1.0" ;;
        *) return 1 ;;
   esac
   return 0


AIX:
----

# pmcycles -m
Cpu 0 runs at 1656 MHz
Cpu 1 runs at 1656 MHz
Cpu 2 runs at 1656 MHz
Cpu 3 runs at 1656 MHz


# lscfg | grep proc

More cpu information on AIX:

# lsattr -El procx        (where x is the number of the cpu)
type powerPC_POWER5     Processor type     False
frequency 165600000     Processor speed    False
..
..
where False means that the value cannot be changed through an AIX command.


# lparstat              (only for latest AIX versions)
# lparstat -i


To view CPU scheduler tunable parameters, use the schedo command:

# schedo -a

In AIX 5L on Power5, you can switch from Simultaneous Multithreading SMT, or Single Threading ST, as follows
(smtcl)
# smtctl -m off		will set SMT mode to disabled
# smtctl -m on		will set SMT mode to enabled
# smtctl -W boot	makes SMT effective on next boot
# smtctl -W now		effects SMT now, but will not persist across reboots

When you want to keep the setting across reboots, you must use the bosboot command
in order to create a new boot image.


1.2.10 Other stuff:
===================

runlevel:
---------
To show the init runlevel:
# who -r 


Top users:
----------

To get a quick impression about the top 10 users in the system at this time:

ps auxw | sort -r +3 |head -10    -Shows top 10 memory usage by process
ps auxw | sort -r +2 |head -10    -Shows top 10 CPU usage by process


More accuracy in memory usage with the ps command: ps -vg

ps -vg:
-------

Using "ps vg" gives a per process tally of memory usage for each running process. Several fields give memory usage 
in different units, but these numbers do not tell the whole story on where all the memory goes. 

First of all, the man page for ps does not give an accurate description of the memory related fields. 
Here is a better description: 

RSS - This tells how much RAM resident memory is currently being used for the text and data segments 
for a particular process in units of kilobytes. (this value will always be a multiple of 4 since memory is allocated in 4 KB pages). 

%MEM - This is the fraction of RSS divided by the total size of RAM for a particular process. 
Since RSS is some subset of the total resident memory usage for a process, the %MEM value will also be lower than actual. 

TRS - This tells how much RAM resident memory is currently being used for the text segment for a particular process 
in units of kilobytes. This will always be less than or equal to RSS. 

SIZE - This tells how much paging space is allocated for this process for the text and data segments in units 
of kilobytes. If the executable file is on a local filesystem, the page space usage for text is zero. 
If the executable is on an NFS filesystem, the page space usage will be nonzero. This number may be greater 
than RSS, or it may not, depending on how much of the process is paged in. The reason RSS can be larger is that 
RSS counts text whereas SIZE does not. 

TSIZ - This field is absolutely bogus because it is not a multiple of 4 and does not correlate to any of the other fields. 

These fields only report on a process text and data segments. Segment size which cannot be interrogated at this time are: 

Text portion of shared libraries (segment 13)
Files that are in use. Open files are cached in memory as individual segments. 


Shared data segments created with shmat. 
Kernel segments such as kernel segment 0, kernel extension segments, 
and virtual memory management segments. 

In summary, ps is not a very good tool to measure system memory usage. It can give you some idea where some 
of the memory goes, but it leaves too many questions unanswered about the total usage. 


shared memory:
--------------
To check shared memory segment, semaphore array, and message queue limits, issue the ipcs -l command. 
# ipcs

The following tools are available for monitoring the performance of your UNIX-based system. 

pfiles:
-------
/usr/proc/bin/pfiles
This shows the open files for this process, which helps you diagnose whether you are having problems 
caused by files not getting closed.

lsof:
-----

This utility lists open files for running UNIX processes, like pfiles. However, lsof gives more 
useful information than pfiles. You can find lsof at ftp://vic.cc.purdue.edu/pub/tools/unix/lsof/.

Example of lsof usage:

You can see CIO (concurrent IO) in the FILE-FLAG column if you run lsof +fg, e.g.:
 
tarunx01:/home/abielewi:# /p570build/LSOF/lsof-4.76/usr/local/bin/lsof +fg /baanprd/oradat

COMMAND     PID     USER   FD   TYPE              FILE-FLAG DEVICE
SIZE/OFF NODE NAME
oracle   434222   oracle   16u  VREG     R,W,CIO,DSYN,LG;CX   39,1
6701056  866 /baanprd/oradat (/dev/bprdoradat)
oracle   434222   oracle   17u  VREG     R,W,CIO,DSYN,LG;CX   39,1
6701056  867 /baanprd/oradat (/dev/bprdoradat)
oracle   442384   oracle   15u  VREG     R,W,CIO,DSYN,LG;CX   39,1
1174413312  875 /baanprd/oradat (/dev/bprdoradat)
oracle   442384   oracle   16u  VREG     R,W,CIO,DSYN,LG;CX   39,1
734011392  877 /baanprd/oradat (/dev/bprdoradat)
oracle   450814   oracle   15u  VREG     R,W,CIO,DSYN,LG;CX   39,1
1174413312  875 /baanprd/oradat (/dev/bprdoradat)
oracle   450814   oracle   16u  VREG     R,W,CIO,DSYN,LG;CX   39,1
1814044672  876 /baanprd/oradat (/dev/bprdoradat)
oracle   487666   oracle   15u  VREG     R,W,CIO,DSYN,LG;CX   39,1
1174413312  875 /baanprd/oradat (/dev/bprdoradat
 
You should also see O_CIO in your file open calls if you run truss,
e.g.:
 
open("/opt/oracle/rcat/oradat/redo01.log",
O_RDWR|O_CIO|O_DSYNC|O_LARGEFILE) = 18
 

VMSTAT SOLARIS:
---------------
# vmstat 
This command is ideal for monitoring paging rate, which can be found under the page in (pi) and page out (po) columns. 
Other important columns are the amount of allocated virtual storage (avm) and free virtual storage (fre). 
This command is useful for determining if something is suspended or just taking a long time.

Example:

 kthr      memory            page            disk          faults      cpu
 r b w   swap  free  re  mf pi po fr de sr m0 m1 m3 m4   in   sy   cs us sy id
 0 0 0 2163152 1716720 157 141 1179 1 1 0 0 0  0  0  0  680 1737  855 10  3 87
 0 0 0 2119080 1729352 0  1  0  0  0  0  0  0  0  1  0  345  658  346  1  1 98
 0 0 0 2118960 1729232 0 167 0  0  0  0  0  0  0  0  0  402 1710  812  4  2 94
 0 0 0 2112992 1723264 0 1261 0 0  0  0  0  0  0  0  0 1026 5253 1848 10  5 85
 0 0 0 2112088 1722352 0 248 0  0  0  0  0  0  0  0  0  505 2822 1177  5  2 92
 0 0 0 2116288 1726544 4 80  0  0  0  0  0  0  0  0  0  817 4015 1530  6  4 90
 0 0 0 2117744 1727960 4  2 30  0  0  0  0  0  0  0  0  473 1421  640  2  2 97


procs/r: Run queue length. 
procs/b: Processes blocked while waiting for I/O. 
procs/w: Idle processes which have been swapped. 
memory/swap: Free, unreserved swap space (Kb). 
memory/free: Free memory (Kb). (Note that this will grow until it reaches lotsfree, at which point 
            the page scanner is started. See "Paging" for more details.) 
page/re: Pages reclaimed from the free list. (If a page on the free list still contains data needed 
         for a new request, it can be remapped.) 
page/mf: Minor faults (page in memory, but not mapped). (If the page is still in memory, a minor fault 
         remaps the page. It is comparable to the vflts value reported by sar -p.) 
page/pi: Paged in from swap (Kb/s). (When a page is brought back from the swap device, the process 
         will stop execution and wait. This may affect performance.) 
page/po: Paged out to swap (Kb/s). (The page has been written and freed. This can be the result of 
         activity by the pageout scanner, a file close, or fsflush.) 
page/fr: Freed or destroyed (Kb/s). (This column reports the activity of the page scanner.) 
page/de: Freed after writes (Kb/s). (These pages have been freed due to a pageout.) 
page/sr: Scan rate (pages). Note that this number is not reported as a "rate," but as a total number of pages scanned. 
disk/s#: Disk activity for disk # (I/O's per second). 
faults/in: Interrupts (per second). 
faults/sy: System calls (per second). 
faults/cs: Context switches (per second). 
cpu/us: User CPU time (%). 
cpu/sy: Kernel CPU time (%). 
cpu/id: Idle + I/O wait CPU time (%). 

When analyzing vmstat output, there are several metrics to which you should pay attention. For example, 
keep an eye on the CPU run queue column. The run queue should never exceed the number of CPUs on the server. 
If you do notice the run queue exceeding the amount of CPUs, it's a good indication that your server 
has a CPU bottleneck.
To get an idea of the RAM usage on your server, watch the page in (pi) and page out (po) columns 
of vmstat's output. By tracking common virtual memory operations such as page outs, you can infer 
the times that the Oracle database is performing a lot of work. Even though UNIX page ins must correlate 
with the vmstat's refresh rate to accurately predict RAM swapping, plotting page ins can tell you 
when the server is having spikes of RAM usage.

Once captured, it's very easy to take the information about server performance directly from the 
Oracle tables and plot them in a trend graph. Rather than using an expensive statistical package 
such as SAS, you can use Microsoft Excel. Copy and paste the data from the tables into Excel. 
After that, you can use the Chart Wizard to create a line chart that will help you view server 
usage information and discover trends.


# VMSTAT AIX:
-------------

This is virtually equal to the usage of vmstat under solaris.

vmstat can be used to give multiple statistics on the system. For CPU-specific work, try the following command:

# vmstat -t 1 3 

This will take 3 samples, 1 second apart, with timestamps (-t). You can, of course, change the parameters 
as you like. The output is shown below. 

      kthr     memory             page              faults        cpu        time
      ----- ----------- ------------------------ ------------ ----------- --------
       r  b   avm   fre  re  pi  po  fr   sr  cy  in   sy  cs us sy id wa hr mi se
       0  0 45483   221   0   0   0   0    1   0 224  326 362 24  7 69  0 15:10:22
       0  0 45483   220   0   0   0   0    0   0 159   83  53  1  1 98  0 15:10:23
       2  0 45483   220   0   0   0   0    0   0 145  115  46  0  9 90  1 15:10:24


In this output some of the things to watch for are: 

"avm", which is Active Virtual Memory.
Ideally, under normal conditions, the largest avm value should in general be smaller than the amount of RAM.
If avm is smaller than RAM, and still exessive paging occurs, that could be due to RAM being filled
with file pages.

avm x 4K = number of bytes


Columns r (run queue) and b (blocked) start going up, especially above 10. This usually is an indication 
that you have too many processes competing for CPU. 

If cs (contact switches) go very high compared to the number of processes, then you may need to tune 
the system with vmtune. 

In the cpu section, us (user time) indicates the time is being spent in programs. Assuming Java is 
at the top of the list in tprof, then you need to tune the Java application). 

In the cpu section, if sys (system time) is higher than expected, and you still have id (idle) time left, 
this may indicate lock contention. Check the tprof for lock related calls in the kernel time. You may want 
to try multiple instances of the JVM. It may also be possible to find deadlocks in a javacore file. 

In the cpu section, if wa (I/O wait) is high, this may indicate a disk bottleneck, and you should use 
iostat and other tools to look at the disk usage. 

Values in the pi, po (page in/out) columns are non-zero may indicate that you are paging and need more memory. 
It may be possible that you have the stack size set too high for some of your JVM instances. 
It could also mean that you have allocated a heap larger than the amount of memory on the system. Of course, 
you may also have other applications using memory, or that file pages may be taking up too much of the memory


Other example:
--------------

# vmstat 1

System configuration: lcpu=2 mem=3920MB

kthr    memory                page              faults          cpu    
-----  -----------    ------------------------ ------------  -----------
r  b    avm   fre    re  pi  po  fr   sr  cy  in   sy  cs   us sy id wa
0  0  229367 332745   0   0   0   0    0   0   3  198  69    0  0 99  0
0  0  229367 332745   0   0   0   0    0   0   3   33  66    0  0 99  0
0  0  229367 332745   0   0   0   0    0   0   2   33  68    0  0 99  0
0  0  229367 332745   0   0   0   0    0   0  80  306 100    0  1 97  1
0  0  229367 332745   0   0   0   0    0   0   1   20  68    0  0 99  0
0  0  229367 332745   0   0   0   0    0   0   2   36  64    0  0 99  0
0  0  229367 332745   0   0   0   0    0   0   2   33  66    0  0 99  0
0  0  229367 332745   0   0   0   0    0   0   2   21  66    0  0 99  0
0  0  229367 332745   0   0   0   0    0   0   1  237  64    0  0 99  0
0  0  229367 332745   0   0   0   0    0   0   2   19  66    0  0 99  0
0  0  229367 332745   0   0   0   0    0   0   6   37  76    0  0 99  0
 

The most important fields to look at here are:

r -- The average number of runnable kernel threads over whatever sampling interval you have chosen. 
b -- The average number of kernel threads that are in the virtual memory waiting queue over your sampling interval. r should always be higher than b; if it is not, it usually means you have a CPU bottleneck. 
fre -- The size of your memory free list. Do not worry so much if the amount is really small. More importantly, determine if there is any paging going on if this amount is small. 
pi -- Pages paged in from paging space. 
po -- Pages paged out to paging space. 
CPU section:
us 
sy 
id 
wa 

Let's look at the last section, which also comes up in most other CPU monitoring tools, albeit with different headings:

us -- user time 
sy -- system time 
id -- idle time 
wa -- waiting on I/O 


# IOSTAT:
---------
This command is useful for monitoring I/O activities. You can use the read and write rate to estimate the 
amount of time required for certain SQL operations (if they are the only activity on the system). 
This command is also useful for determining if something is suspended or just taking a long time. 

Basic synctax is iostat  <options>   interval  count

option - let you specify the device for which information is needed like disk , 
         cpu or terminal. (-d , -c , -t  or -tdc ) .  x options gives the extended statistics .

interval -  is time period in seconds between two samples . iostat  4  will give data at each 4 seconds interval.

count  - is the number of times the data is needed .  iostat 4 5 will give data at 4 seconds interval 5 times.

Example:

$ iostat -xtc 5 2
                          extended disk statistics       tty         cpu
     disk r/s  w/s Kr/s Kw/s wait actv svc_t  %w  %b  tin tout us sy wt id
     sd0   2.6 3.0 20.7 22.7 0.1  0.2  59.2   6   19   0   84  3  85 11 0
     sd1   4.2 1.0 33.5  8.0 0.0  0.2  47.2   2   23
     sd2   0.0 0.0  0.0  0.0 0.0  0.0   0.0   0    0
     sd3  10.2 1.6 51.4 12.8 0.1  0.3  31.2   3   31
 
disk    name of the disk
r/s     reads per second
w/s     writes per second
Kr/s    kilobytes read per second
Kw/s    kilobytes written per second
wait    average number of transactions waiting for service (Q length)
actv    average number of transactions  actively  
        being serviced (removed  from  the queue but not yet completed)
%w      percent of time there are transactions  waiting for service (queue non-empty)
%b      percent of time the disk is busy  (transactions in progress)

The values to look from the iostat output  are:

Reads/writes  per second (r/s , w/s) 
Percentage busy (%b) 
Service time (svc_t) 
If a disk shows consistently high reads/writes along with , the percentage busy (%b) of the disks 
is greater than 5 percent, and the average service time  (svc_t) is greater than 30 milliseconds, 
then action needs to be taken.


# netstat 
This command lets you know the network traffic on each node, and the number of error packets encountered. 
It is useful for isolating network problems. 

Example:

To find out all listening services, you can use the command

# netstat -a -f inet


1.2.11 Some other utilities for Solaris:
========================================

# top
For example:

load averages:  0.66,  0.54,  0.56   11:14:48
187 processes: 185 sleeping, 2 on cpu
CPU states:     % idle,     % user,     % kernel,     % iowait,     % swap
Memory: 4096M real, 1984M free, 1902M swap in use, 2038M swap free

  PID USERNAME THR PRI NICE  SIZE   RES STATE   TIME    CPU COMMAND
 2795 oraclown   1  59    0  265M  226M sleep   0:13  4.38% oracle
 2294 root      11  59    0 8616K 7672K sleep  10:54  3.94% bpbkar
13907 oraclown  11  59    0  271M  218M cpu2    4:02  2.23% oracle
14138 oraclown  12  59    0  270M  230M sleep   9:03  1.76% oracle
 2797 oraclown   1  59    0  189M  151M sleep   0:01  0.96% oracle
 2787 oraclown  11  59    0  191M  153M sleep   0:06  0.69% oracle
 2799 oraclown   1  59    0  190M  151M sleep   0:02  0.45% oracle
 2743 oraclown  11  59    0  191M  155M sleep   0:25  0.35% oracle
 2011 oraclown  11  59    0  191M  149M sleep   2:50  0.27% oracle
 2007 oraclown  11  59    0  191M  149M sleep   2:22  0.26% oracle
 2009 oraclown  11  59    0  191M  149M sleep   1:54  0.20% oracle
 2804 oraclown   1  51    0 1760K 1296K cpu2    0:00  0.19% top
 2013 oraclown  11  59    0  191M  148M sleep   0:36  0.14% oracle
 2035 oraclown  11  59    0  191M  149M sleep   2:44  0.13% oracle
  114 root      10  59    0 5016K 4176K sleep  23:34  0.05% picld

Process ID
This column shows the process ID (pid) of each process. The process ID is a positive number, 
usually less than 65536. It is used for identification during the life of the process. 
Once a process has exited or been killed, the process ID can be reused. 

Username
This column shows the name of the user who owns the process. The kernel stores this information 
as a uid, and top uses an appropriate table (/etc/passwd, NIS, or NIS+) to translate this uid in to a name. 

Threads
This column displays the number of threads for the current process. This column is present only 
in the Solaris 2 port of top.
For Solaris, this number is actually the number of lightweight processes (lwps) created by the 
threads package to handle the threads. Depending on current resource utilization, there may not 
be one lwp for every thread. Thus this number is actually less than or equal to the total number 
of threads created by the process. 

Nice
This column reflects the "nice" setting of each process. A process's nice is inhereted from its parent. 
Most user processes run at a nice of 0, indicating normal priority. Users have the option of starting 
a process with a positive nice value to allow the system to reduce the priority given to that process. 
This is normally done for long-running cpu-bound jobs to keep them from interfering with 
interactive processes. The Unix command "nice" controls setting this value. Only root can set 
a nice value lower than the current value. Nice values can be negative. On most systems they range from -20 to 20.
The nice value influences the priority value calculated by the Unix scheduler. 

Size
This column shows the total amount of memory allocated by each process. This is virtual memory 
and is the sum total of the process's text area (program space), data area, and dynamically 
allocated area (or "break"). When a process allocates additional memory with the system call "brk", 
this value will increase. This is done indirectly by the C library function "malloc". 
The number in this column does not reflect the amount of physical memory currently in use by the process. 

Resident Memory
This column reflects the amount of physical memory currently allocated to each process. 
This is also known as the "resident set size" or RSS. A process can have a large amount 
of virtual memory allocated (as indicated by the SIZE column) but still be using very little physical memory. 

Process State
This column reflects the last observed state of each process. State names vary from system to system. 
These states are analagous to those that appear in the process states line: the second line of the display. 
The more common state names are listed below.
cpu   - Assigned to a CPU and currently running 
run   - Currently able to run 
sleep - Awaiting an external event, such as input from a device 
stop  - Stopped by a signal, as with control Z 
swap  - Virtual address space swapped out to disk 
zomb  - Exited, but parent has not called "wait" to receive the exit status 

CPU Time
This column displayes the accumulated CPU time for each process. This is the amount of time 
that any cpu in the system has spent actually running this process. The standard format shows 
two digits indicating minutes, a colon, then two digits indicating seconds. 
For example, the display "15:32" indicates fifteen minutes and thirty-two seconds. 
When a time value is greater than or equal to 1000 minutes, it is displayed as hours with the suffix H. 
For example, the display "127.4H" indicates 127 hours plus four tenths of an hour (24 minutes). 
When the number of hours exceeds 999.9, the "H" suffix is dropped so that the display 
continues to fit in the column. 

CPU Percentage
This column shows the percentage of the cpu that each process is currently consuming. 
By default, top will sort this column of the output.
Some versions of Unix will track cpu percentages in the kernel, as the figure is used in the calculation 
of a process's priority. On those versions, top will use the figure as calculated by the kernel. 
Other versions of Unix do not perform this calculation, and top must determine the percentage explicity 
by monitoring the changes in cpu time.
On most multiprocessor machines, the number displayed in this column is a percentage of the total 
available cpu capacity. Therefore, a single threaded process running on a four processor system will never 
use more than 25% of the available cpu cycles. 

Command
This column displays the name of the executable image that each process is running. 
In most cases this is the base name of the file that was invoked with the most recent kernel "exec" call. 
On most systems, this name is maintained separately from the zeroth argument. A program that changes 
its zeroth argument will not affect the output of this column. 


# modinfo
The modinfo command provides information about the modules currently loaded by the kernel.

The /etc/system  file:
Available for Solaris Operating Environment, the /etc/system file contains definitions for kernel configuration limits 
such as the maximum number of users allowed on the system at a time, the maximum number of processes per user, 
and the inter-process communication (IPC) limits on size and number of resources. These limits are important because 
they affect DB2 performance on a Solaris Operating Environment machine. See the Quick Beginnings information 
for further details. 

# more /etc/path_to_inst
To see the mapping between the kernel abbreviated instance name for physical device names,
view the /etc/path_to_inst file.

# uptime
uptime - show how long the system has been up

/export/home/oraclown>uptime
 11:32am  up  4:19,  1 user,  load average: 0.40, 1.17, 0.90


1.2.12 proc toos for Solaris:
=============================

The proc tools are called that way, because the retreive information fromn the /proc virtual filesystem
They are:

/usr/proc/bin/pflags  [-r] pid...
/usr/proc/bin/pcred   pid...
/usr/proc/bin/pmap    [-rxlF] pid...
/usr/proc/bin/pldd    [-F] pid...
/usr/proc/bin/psig    pid...
/usr/proc/bin/pstack  [-F] pid...
/usr/proc/bin/pfiles  [-F] pid...
/usr/proc/bin/pwdx    [-F] pid...
/usr/proc/bin/pstop   pid...
/usr/proc/bin/prun    pid...
/usr/proc/bin/pwait   [-v] pid...
/usr/proc/bin/ptree   [-a] [[pid| user]...]
/usr/proc/bin/ptime   command [arg...]
/usr/proc/bin/pattr   [-x ] [pid...]
/usr/proc/bin/pclear  [pid...]
/usr/proc/bin/plabel  [pid...]
/usr/proc/bin/ppriv   [-a] [pid...]


-- pfiles:
reports all the files which are opened by a given pid

-- pldd 
lists all the dynamic libraries linked to the process

-- pwdx 
gives the directory from which the process is running

-- ptree
The ptree utility prints the process trees containing the specified pids or users, with child processes 
indented from their respective parent processes. An argument of all digits is taken to be a process-ID, 
otherwise it is assumed to be a user login name. The default is all processes.


Use it like 


# ptree <PID>


Or use it with params, which enables you to produce different listings

The following example prints the process tree (including children of process 0) for processes which match the command name ssh: 

$ ptree -a `pgrep ssh`
        1     /sbin/init
          100909 /usr/lib/ssh/sshd
            569150 /usr/lib/ssh/sshd
              569157 /usr/lib/ssh/sshd
                569159 -ksh
                  569171 bash
                    569173 /bin/ksh
                      569193 bash 

  ----------------------------------------------------------------------
  Remark: many Linux distros adopted the ptree command, as the "pstree" command.
  As in

  ubuntu$ pstree -pl
  init(1)---NetworkManager(5427)
          +-NetworkManagerD(5441)
          +-acpid(5210)
          +-apache2(6966)---apache2(2890)
          �               +-apache2(2893)
          �               +-apache2(7163)
          �               +-apache2(7165)
          �               +-apache2(7166)
          �               +-apache2(7167)
          �               +-apache2(7168)
          +-atd(6369)
          +-avahi-daemon(5658)---avahi-daemon(5659)
          +-bonobo-activati(7816)---{bonobo-activati}(7817)
         etc..
         ..

  ------------------------------------------------------------------------

Back to Solaris again:

Suppose you did a pfiles on an Apache process:

# pfiles 13789

13789: /apps11i/erpdev/10GAS/Apache/Apache/bin/httpd -d /apps11i/erpdev/10G
Current rlimit: 1024 file descriptors
0: S_IFIFO mode:0000 dev:350,0 ino:114723 uid:65060 gid:54032 size:301
O_RDWR
1: S_IFREG mode:0640 dev:307,28001 ino:612208 uid:65060 gid:54032 size:386
O_WRONLY|O_APPEND|O_CREAT
/apps11i/erpdev/10GAS/opmn/logs/HTTP_Server~1
2: S_IFIFO mode:0000 dev:350,0 ino:143956 uid:65060 gid:54032 size:0
O_RDWR
3: S_IFREG mode:0600 dev:307,28001 ino:606387 uid:65060 gid:54032 size:1056768
O_RDWR|O_CREAT
/apps11i/erpdev/10GAS/Apache/Apache/logs/mm.19389.mem
4: S_IFREG mode:0600 dev:307,28001 ino:606383 uid:65060 gid:54032 size:0
O_RDWR|O_CREAT
5: S_IFREG mode:0600 dev:307,28001 ino:621827 uid:65060 gid:54032 size:1056768
O_RDWR|O_CREAT
6: S_IFDOOR mode:0444 dev:351,0 ino:58 uid:0 gid:0 size:0
O_RDONLY|O_LARGEFILE FD_CLOEXEC door to nscd[421]
/var/run/name_service_door
7: S_IFIFO mode:0000 dev:350,0 ino:143956 uid:65060 gid:54032 size:0
O_RDWR
8: S_IFCHR mode:0666 dev:342,0 ino:47185924 uid:0 gid:3 rdev:90,0
O_RDONLY
/devices/pseudo/kstat@0:kstat
etc..
..
..
O_RDWR|O_CREAT
/apps11i/erpdev/10GAS/Apache/Apache/logs/dms_metrics.19389.shm.sem
21: S_IFREG mode:0600 dev:307,28001 ino:603445 uid:65060 gid:54032 size:17408
O_RDONLY FD_CLOEXEC
/apps11i/erpdev/10GAS/rdbms/mesg/ocius.msb
23: S_IFSOCK mode:0666 dev:348,0 ino:60339 uid:0 gid:0 size:0
O_RDWR
SOCK_STREAM
SO_SNDBUF(49152),SO_RCVBUF(49152),IP_NEXTHOP(0.0.192.0)
sockname: AF_INET 3.56.189.4 port: 45395
peername: AF_INET 3.56.189.4 port: 12501
256: S_IFREG mode:0444 dev:85,0 ino:234504 uid:0 gid:3 size:1616
O_RDONLY|O_LARGEFILE
/etc/inet/hosts


Suppose you tried pldd on the same process gave this result:

# pldd 13789

13789: /apps11i/erp
dev/10GAS/Apache/Apache/bin/httpd -d /apps11i/erpdev/10G
/apps11i/erpdev/10GAS/lib32/libdms2.so
/lib/libpthread.so.1
/lib/libsocket.so.1
/lib/libnsl.so.1
/lib/libdl.so.1
/lib/libc.so.1
/platform/sun4u-us3/lib/libc_psr.so.1
/lib/libmd5.so.1
/platform/sun4u/lib/libmd5_psr.so.1
/lib/libscf.so.1
/lib/libdoor.so.1
/lib/libuutil.so.1
/lib/libgen.so.1
/lib/libmp.so.2
/lib/libm.so.2
/lib/libresolv.so.2
/apps11i/erpdev/10GAS/Apache/Apache/libexec/mod_onsint.so
/lib/librt.so.1
/apps11i/erpdev/10GAS/lib32/libons.so
/lib/libkstat.so.1
/lib/libaio.so.1
/apps11i/erpdev/10GAS/Apache/Apache/libexec/mod_mmap_static.so
/apps11i/erpdev/10GAS/Apache/Apache/libexec/mod_vhost_alias.so
/apps11i/erpdev/10GAS/Apache/Apache/libexec/mod_env.so
..
..
etc

/usr/lib/libsched.so.1
/apps11i/erpdev/10GAS/lib32/libclntsh.so.10.1
/apps11i/erpdev/10GAS/lib32/libnnz10.so
/apps11i/erpdev/10GAS/Apache/Apache/libexec/mod_wchandshake.so
/apps11i/erpdev/10GAS/Apache/Apache/libexec/mod_oc4j.so
/apps11i/erpdev/10GAS/Apache/Apache/libexec/mod_dms.so
/apps11i/erpdev/10GAS/Apache/Apache/libexec/mod_rewrite.so
/apps11i/erpdev/10GAS/Apache/oradav/lib/mod_oradav.so
/apps11i/erpdev/10GAS/Apache/modplsql/bin/modplsql.so 


# pmap -x $$

492328: -ksh
 Address  Kbytes     RSS    Anon  Locked Mode   Mapped File
00010000     192     192       -       - r-x--  ksh
00040000       8       8       8       - rwx--  ksh
00042000      40      40       8       - rwx--    [ heap ]
FF180000     680     680       -       - r-x--  libc.so.1
FF23A000      24      24       -       - rwx--  libc.so.1
FF240000       8       8       8       - rwx--  libc.so.1
FF280000     576     576       -       - r-x--  libnsl.so.1
FF310000      40      40       -       - rwx--  libnsl.so.1
FF31A000      24      16       -       - rwx--  libnsl.so.1
FF350000      16      16       -       - r-x--  libmp.so.2
FF364000       8       8       -       - rwx--  libmp.so.2
FF380000      40      40       -       - r-x--  libsocket.so.1
FF39A000       8       8       -       - rwx--  libsocket.so.1
FF3A0000       8       8       -       - r-x--  libdl.so.1
FF3B0000       8       8       8       - rwx--    [ anon ]
FF3C0000     152     152       -       - r-x--  ld.so.1
FF3F6000       8       8       8       - rwx--  ld.so.1
FFBFC000      16      16       8       - rw---    [ stack ]
-------- ------- ------- ------- -------
total Kb    1856    1848      48       -


1.2.13 Wellknown tools for AIX:
===============================

1. commands:
------------

CPU		Memory Subsystem	I/O Subsystem		Network Subsystem
---------------------------------------------------------------------------------
vmstat		vmstat			iostat			netstat
iostat		lsps			vmstat			ifconfig
ps		svmon			lsps			tcpdump
sar		filemon			filemon
tprof		ipcs			lvmstat

nmon and topas can be used to monitor those subsystems in general.

2. topas:
---------

topas is a useful graphical interface that will give you immediate results of what is going on in the system. 
When you run it without any command-line arguments, the screen looks like this: 


Topas Monitor for host:    aix4prt              EVENTS/QUEUES    FILE/TTY
Mon Apr 16 16:16:50 2001   Interval:  2         Cswitch    5984  Readch     4864
                                                Syscall   15776  Writech   34280
Kernel   63.1   |##################          |  Reads         8  Rawin         0
User     36.8   |##########                  |  Writes     2469  Ttyout        0
Wait      0.0   |                            |  Forks         0  Igets         0
Idle      0.0   |                            |  Execs         0  Namei         4
                                                Runqueue   11.5  Dirblk        0
Network  KBPS   I-Pack  O-Pack   KB-In  KB-Out  Waitqueue   0.0
lo0     213.9   2154.2  2153.7   107.0   106.9
tr0      34.7     16.9    34.4     0.9    33.8  PAGING           MEMORY
                                                Faults     3862  Real,MB    1023
Disk    Busy%     KBPS     TPS KB-Read KB-Writ  Steals     1580  % Comp     27.0
hdisk0    0.0      0.0     0.0     0.0     0.0  PgspIn        0  % Noncomp  73.9
                                                PgspOut       0  % Client    0.5
Name         PID CPU% PgSp Owner                PageIn        0
java       16684 83.6 35.1 root                 PageOut       0  PAGING SPACE
java       12192 12.7 86.2 root                 Sios          0  Size,MB     512
lrud        1032  2.7  0.0 root                                  % Used      1.2
aixterm    19502  0.5  0.7 root                 NFS (calls/sec)  % Free     98.7
topas       6908  0.5  0.8 root                 ServerV2       0
ksh        18148  0.0  0.7 root                 ClientV2       0   Press:
gil         1806  0.0  0.0 root                 ServerV3       0   "h" for help
 

The information on the bottom left side shows the most active processes; here, java is consuming 83.6% of CPU. 
The middle right area shows the total physical memory (1 GB in this case) and Paging space (512 MB), 
as well as the amount being used. So you get an excellent overview of what the system is doing 
in a single screen, and then you can select the areas to concentrate based on the information being shown here.


Note: about waits:
------------------

Don't get caught up in this whole wait i/o thing. a single cpu system 
with 1 i/o outstanding and no other runable threads (i.e. idle) will 
have 100% wait i/o. There was a big discussion a couple of years ago on 
removing the kernel tick as it has confused many many many techs. 

So, if you have only 1 or few cpu, then you are going to have high wait i.o 
figures, it does not neccessarily mean your disk subsystem is slow. 


3. trace:
---------

trace captures a sequential flow of time-stamped system events. The trace is a valuable tool for observing 
system and application execution. While many of the other tools provide high level statistics such as 
CPU and I/O utilization, the trace facility helps expand the information as to where the events happened, 
which process is responsible, when the events took place, and how they are affecting the system. 
Two post processing tools that can extract information from the trace are utld (in AIX 4) and curt 
(in AIX 5). These provide statistics on CPU utilization and process/thread activity. The third post 
processing tool is splat which stands for Simple Performance Lock Analysis Tool. This tool is used to analyze 
lock activity in the AIX kernel and kernel extension for simple locks.

4. nmon:
--------

nmon is a free software tool that gives much of the same information as topas, but saves the information 
to a file in Lotus 123 and Excel format. The download site is 
http://www.ibm.com/developerworks/eserver/articles/analyze_aix/. 
The information that is collected included CPU, disk, network, adapter statistics, kernel counters, 
memory and the "top" process information. 

5. tprof:
---------

tprof is one of the AIX legacy tools that provides a detailed profile of CPU usage for every 
AIX process ID and name. It has been completely rewritten for AIX 5.2, and the example below uses 
the AIX 5.1 syntax. You should refer to AIX 5.2 Performance Tools update: Part 3 for the new syntax. 

The simplest way to invoke this command is to use:  

# tprof -kse -x "sleep 10" 
# tprof -ske -x "sleep 30"


At the end of ten seconds, or 30 seconds, a new file __prof.all, or sleep.prof, is generated that contains 
information about what commands are using CPU on the system. Searching for FREQ, the information looks something 
like the example below:


              Process   FREQ  Total Kernel   User Shared  Other
              =======    ===  ===== ======   ==== ======  =====
               oracle    244  10635   3515   6897    223      0
                 java    247   3970    617      0   2062   1291
                 wait     16   1515   1515      0      0      0
    ...
              =======    ===  ===== ======   ==== ======  =====
                Total   1060  19577   7947   7252   3087   1291

 
This example shows that over half the CPU time is associated with the oracle application and that Java 
is using about 3970/19577 or 1/5 of the CPU. The wait usually means idle time, but can also include 
the I/O wait portion of the CPU usage.


svmon:
------

The svmon command captures a snapshot of the current state om memory.
use it with the -G switch to get global statistics for the whole system.

svmon is the most useful tool at your disposal when monitoring a Java process, especially native heap. 
The article "When segments collide" gives examples of how to use svmon -P <pid> -m to monitor the 
native heap of a Java process on AIX. But there is another variation, svmon -P <pid> -m -r, that is very 
effective in identifying native heap fragmentation. The -r switch prints the address range in use, so it gives 
a more accurate view of how much of each segment is in use. 
As an example, look at the partially edited output below: 

   Pid Command          Inuse      Pin     Pgsp  Virtual 64-bit Mthrd LPage
   10556 java            681613     2316     2461   501080      N     Y     N

    Vsid      Esid Type Description              LPage  Inuse   Pin Pgsp Virtual
   22ac4         9 mmap mapped to sid b1475          -      0     0    -     - 
   21047         8 mmap mapped to sid 30fe5          -      0     0    -     - 
   126a2         a mmap mapped to sid 91072          -      0     0    -     - 
   7908c         7 mmap mapped to sid 6bced          -      0     0    -     - 
   b2ad6         b mmap mapped to sid b1035          -      0     0    -     - 
   b1475         - work                              -  65536     0  282 65536 
   30fe5         - work                              -  65536     0  285 65536 
   91072         - work                              -  65536     0   54 65536 
   6bced         - work                              -  65536     0  261 65536 
   b1035         - work                              -  45054     0    0 45054 
                   Addr Range: 0..45055
   e0f9f         5 work shmat/mmap                   -  48284     0    3 48284 
   19100         3 work shmat/mmap                   -  46997     0  463 47210 
   c965a         4 work shmat/mmap                   -  46835     0  281 46953 
   7910c         6 work shmat/mmap                   -  37070     0    0 37070 
                   Addr Range: 0..50453
   e801d         d work shared library text          -   9172     0    0  9220 
                   Addr Range: 0..30861
   a0fb7         f work shared library data          -    105     0    1   106 
                   Addr Range: 0..2521
   21127         2 work process private              -     50     2    1    51 
                   Addr Range: 65300..65535
   a8535         1 pers code,/dev/q109waslv:81938    -     11     0    -     - 
                   Addr Range: 0..11


Other example:

# svmon -G -i 2 5    # sample five times at two second intervals

memory                 in use                     pin         pg space
size  inuse free pin   work  pers   clnt   work   pers  clnt  size  inuse
16384 16250 134  2006  10675 2939   2636   2006   0     0     40960  12674
16384 16250 134  2006  10675 2939   2636   2006   0     0     40960  12674
16384 16250 134  2006  10675 2939   2636   2006   0     0     40960  12674
16384 16250 134  2006  10675 2939   2636   2006   0     0     40960  12674
16384 16250 134  2006  10675 2939   2636   2006   0     0     40960  12674

In this example, there are 16384 pages of total size of memory. Multuply this number by 4096
to see the total real memory size. In this case the total memory is 64 MB.


filemon:
--------

filemon can be used to identify the files that are being used most actively. This tool gives a very 
comprehensive view of file access, and can be useful for drilling down once vmstat/iostat confirm disk 
to be a bottleneck.

Example:

# filemon -o /tmp/filemon.log; sleep 60; trcstop

The generated log file is quite large. Some sections that may be useful are:

Most Active Files
    ------------------------------------------------------------------------
      #MBs  #opns   #rds   #wrs  file                 volume:inode
    ------------------------------------------------------------------------

      25.7     83   6589      0  unix                 /dev/hd2:147514
      16.3      1   4175      0  vxe102               /dev/mailv1:581
      16.3      1      0   4173  .vxe102.pop          /dev/poboxv:62
      15.8      1      1   4044  tst1                 /dev/mailt1:904
       8.3   2117   2327      0  passwd               /dev/hd4:8205
       3.2    182    810      1  services             /dev/hd4:8652
    ...
    ------------------------------------------------------------------------
    Detailed File Stats
    ------------------------------------------------------------------------

    FILE: /var/spool/mail/v/vxe102  volume: /dev/mailv1 (/var/spool2/mail/v)  inode: 581
    opens:                  1
    total bytes xfrd:       17100800
    reads:                  4175    (0 errs)
      read sizes (bytes):   avg  4096.0 min    4096 max    4096 sdev     0.0
      read times (msec):    avg   0.543 min   0.011 max  78.060 sdev   2.753
    ...

curt:
-----

curt Command
Purpose
The CPU Utilization Reporting Tool (curt) command converts an AIX trace file into a number of statistics related 
to CPU utilization and either process, thread or pthread activity. These statistics ease the tracking of 
specific application activity. curt works with both uniprocessor and multiprocessor AIX Version 4 and AIX Version 5 
traces.

Syntax
curt -i inputfile [-o outputfile] [-n gennamesfile] [-m trcnmfile] [-a pidnamefile] [-f timestamp] 
                  [-l timestamp] [-ehpstP]

Description
The curt command takes an AIX trace file as input and produces a number of statistics related to 
processor (CPU) utilization and process/thread/pthread activity. It will work with both uniprocessor and 
multiprocessor AIX traces if the processor clocks are properly synchronized.


1.2.14 Not so well known tools for AIX: the proc tools:
=======================================================


--proctree 
Displays the process tree containing the specified process IDs or users. To display the ancestors 
and all the children of process 12312, enter: 

# proctree 21166
11238    /usr/sbin/srcmstr
  21166    /usr/sbin/rsct/bin/IBM.AuditRMd 


To display the ancestors and children of process 21166, including children of process 0, enter: 

#proctree -a 21166 
1    /etc/init
   11238    /usr/sbin/srcmstr
      21166    /usr/sbin/rsct/bin/IBM.AuditRMd 


-- procstack 
Displays the hexadecimal addresses and symbolic names for each of the stack frames of the current thread 
in processes. To display the current stack of process 15052, enter: 

# procstack 15052
15052 : /usr/sbin/snmpd
d025ab80  select   (?, ?, ?, ?, ?) + 90
100015f4  main   (?, ?, ?) + 1814
10000128  __start   () + 8c
 
Currently, procstack displays garbage or wrong information for the top stack frame, and possibly for the 
second top stack frame. Sometimes it will erroneously display "No frames found on the stack," and sometimes 
it will display: deadbeef ???????? (?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ...) The fix for this problem had not 
been released at the writing of this article. When the fix becomes available, you need to download the 
APAR IY48543 for 5.2. For AIX 5.3 it all should work OK.

-- procmap 
Displays a process address map. To display the address space of process 13204, enter: 

# procmap  13204 
13204 : /usr/sbin/biod 6
10000000	  3K	read/exec	biod
20000910	  0K	read/write	biod
d0083100	 79K	read/exec	/usr/lib/libiconv.a
20013bf0	 41K	read/write	/usr/lib/libiconv.a
d007a100	 34K	read/exec	/usr/lib/libi18n.a
20011378	  4K	read/write	/usr/lib/libi18n.a
d0074000	 11K	read/exec	/usr/lib/nls/loc/en_US
d0077130	  8K	read/write	/usr/lib/nls/loc/en_US
d00730f8	  2K	read/exec	/usr/lib/libcrypt.a
f03c7508	  0K	read/write	/usr/lib/libcrypt.a
d01d4e20    1997K	read/exec 	/usr/lib/libc.a
f0337e90     570K	read/write	/usr/lib/libc.a 


-- procldd 
Displays a list of libraries loaded by a process. To display the list of dynamic libraries loaded by 
process 11928, enter 

# procldd 11928. T 
 11928 : -sh
 /usr/lib/nls/loc/en_US
 /usr/lib/libcrypt.a
 /usr/lib/libc.a 


-- procflags 
Displays a process tracing flags, and the pending and holding signals. To display the tracing flags of 
process 28138, enter: 

# procflags 28138
28138 : /usr/sbin/rsct/bin/IBM.HostRMd
data model = _ILP32 flags = PR_FORK
/64763: flags = PR_ASLEEP | PR_NOREGS
/66315: flags = PR_ASLEEP | PR_NOREGS
/60641: flags = PR_ASLEEP | PR_NOREGS
/66827: flags = PR_ASLEEP | PR_NOREGS
/7515: flags = PR_ASLEEP | PR_NOREGS
/70439: flags = PR_ASLEEP | PR_NOREGS
/66061: flags = PR_ASLEEP | PR_NOREGS
/69149: flags = PR_ASLEEP | PR_NOREGS 


-- procsig 
Lists the signal actions for a process. To list all the signal actions defined for process 30552, enter: 

# procsig 30552
30552 : -ksh
HUP caught
INT caught
QUIT caught
ILL caught
TRAP caught
ABRT caught
EMT caught
FPE caught
KILL default RESTART BUS caught 


-- proccred 
Prints a process' credentials. To display the credentials of process 25632, enter: 

# proccred  25632
25632: e/r/suid=0  e/r/sgid=0 


-- procfiles 
Prints a list of open file descriptors. To display status and control information on the file descriptors 
opened by process 20138, enter: 

# procfiles -n 20138
20138 : /usr/sbin/rsct/bin/IBM.CSMAgentRMd
  Current rlimit: 2147483647 file descriptors
   0: S_IFCHR mode:00 dev:10,4 ino:4178 uid:0 gid:0 rdev:2,2
      O_RDWR  name:/dev/null
   2: S_IFREG mode:0311 dev:10,6 ino:250 uid:0 gid:0 rdev:0,0
      O_RDWR size:0   name:/var/ct/IBM.CSMAgentRM.stderr
   4: S_IFREG mode:0200 dev:10,6 ino:255 uid:0 gid:0 rdev:0,0 


-- procwdx 
Prints the current working directory for a process. To display the current working directory 
of process 11928, enter: 

# procwdx 11928
11928 :  /home/guest 


-- procstop 
Stops a process. To stop process 7500 on the PR_REQUESTED event, enter:

# procstop 7500 . 

-- procrun 
Restart a process. To restart process 30192 that was stopped on the PR_REQUESTED event, enter:

# procrun 30192 . 

-- procwait 
Waits for all of the specified processes to terminate. To wait for process 12942 to exit and display 
the status, enter 

# procwait -v 12942 .  
12942 : terminated, exit status 0 


1.2.15 Other monitoring:
========================


Nagios: open source Monitoring for most unix systems:
-----------------------------------------------------

Nagios is an open source host, service and network monitoring program. 

Latest versions: 2.5 (stable) 

Overview 
 
Nagios is a host and service monitor designed to inform you of network problems before your clients, 
end-users or managers do. It has been designed to run under the Linux operating system, but works fine 
under most *NIX variants as well. The monitoring daemon runs intermittent checks on hosts and services you specify 
using external "plugins" which return status information to Nagios. When problems are encountered, 
the daemon can send notifications out to administrative contacts in a variety of different ways 
(email, instant message, SMS, etc.). Current status information, historical logs, and reports can all 
be accessed via a web browser. 
 
System Requirements 

The only requirement of running Nagios is a machine running Linux (or UNIX variant) and a C compiler. 
You will probably also want to have TCP/IP configured, as most service checks will be performed over the network. 

You are not required to use the CGIs included with Nagios. However, if you do decide to use them, 
you will need to have the following software installed... 


- A web server (preferrably Apache) 
- Thomas Boutell's gd library version 1.6.3 or higher (required by the statusmap and trends CGIs) 


rstat: Monitoring Machine Utilization with rstat:
-------------------------------------------------

rstat stands for Remote System Statistics service

Ports exist for most unixes, like Linux, Solaris, AIX etc..

-- rstat on Linux, Solaris:

rstat is an RPC client program to get and print statistics from any machine running the rpc.rstatd daemon, 
its server-side counterpart. The rpc.rstad daemon has been used for many years by tools such as Sun's perfmeter 
and the rup command. The rstat program is simply a new client for an old daemon. The fact that the rpc.rstatd daemon 
is already installed and running on most Solaris and Linux machines is a huge advantage over other tools 
that require the installation of custom agents. 

The rstat client compiles and runs on Solaris and Linux as well and can get statistics from any machine running 
a current rpc.rstatd daemon, such as Solaris, Linux, AIX, and OpenBSD. The rpc.rstatd daemon is started 
from /etc/inetd.conf on Solaris. It is similar to vmstat, but has some advantages over vmstat:

You can get statistics without logging in to the remote machine, including over the Internet. 

It includes a timestamp. 

The output can be plotted directly by gnuplot. 

The fact that it runs remotely means that you can use a single central machine to monitor the performance 
of many remote machines. It also has a disadvantage in that it does not give the useful scan rate measurement 
of memory shortage, the sr column in vmstat. rstat will not work across most firewalls because it relies on 
port 111, the RPC port, which is usually blocked by firewalls.

To use rstat, simply give it the name or IP address of the machine you wish to monitor. Remember that rpc.rstatd 
must be running on that machine. The rup command is extremely useful here because with no arguments, 
it simply prints out a list of all machines on the local network that are running the rstatd demon. 
If a machine is not listed, you may have to start rstatd manually. 

To start rpc.rstatd under Red Hat Linux, run 

# /etc/rc.d/init.d/rstatd start     as root. 

On Solaris, first try running the rstat client because inetd is often already configured to automatically 
start rpc.rstatd on request. If it the client fails with the error "RPC: Program not registered," 
make sure you have this line in your /etc/inet/inetd.conf and kill -HUP your inetd process to get it to 
re-read inetd.conf, as follows:

rstatd/2-4 tli rpc/datagram_v wait root /usr/lib/netsvc/rstat/rpc.rstatd rpc.rstatd

Then you can monitor that machine like this: 

% rstat enkidu 
2001 07 10 10 36 08  0   0   0 100    0    27   54     1     0    0   12  0.1 

This command will give you a one-second average and then it will exit. If you want to continuously monitor, 
give an interval in seconds on the command line. Here's an example of one line of output every two seconds: 

% rstat enkidu 2 
2001 07 10 10 36 28  0   0   1  98    0     0    7     2     0    0   61  0.0 
2001 07 10 10 36 30  0   0   0 100    0     0    0     2     0    0   15  0.0 
2001 07 10 10 36 32  0   0   0 100    0     0    0     2     0    0   15  0.0 
2001 07 10 10 36 34  0   0   0 100    0     5   10     2     0    0   19  0.0 
2001 07 10 10 36 36  0   0   0 100    0     0   46     2     0    0  108  0.0 
^C 

To get a usage message, the output format, the version number, and where to go for updates, just type rstat 
with no parameters:

% rstat
usage: rstat machine [interval]
output:
yyyy mm dd hh mm ss usr wio sys idl pgin pgout intr ipkts opkts coll  cs load
docs and src at http://patrick.net/software/rstat/rstat.html

Notice that the column headings line up with the output data.


-- AIX:

In order to get rstat working on AIX, you may need to configure rstatd.

As root 

1. Edit /etc/inetd.conf
Uncomment or add entry for rstatd
Eg
rstatd sunrpc_udp udp wait root /usr/sbin/rpc.rstatd rstatd 100001 1-3

2. Edit /etc/services
Uncomment or add entry for rstatd
Eg
rstatd 100001/udp

3. Refresh services
refresh -s inetd

4. Start rstatd
/usr/sbin/rpc.rstatd


1.2.16 UNIX ERROR CODES:
========================

It's always "handy" to have a list of errcodes from the errno.h headerfile.
It should be reasonable the same accross the unix versions.

Actually, this is only a very small list of errors and code. 
It is ONLY associated with the interaction of a process with the system. 

For example, the errors can be seen at boottime of a system, or what an 
error logging daemon might write in a logfile, is a very different story.


from the errno.h file:


>>> Errcodes Linux (generic):


#define EPERM            1      /* Operation not permitted */
#define ENOENT           2      /* No such file or directory */
#define ESRCH            3      /* No such process */
#define EINTR            4      /* Interrupted system call */
#define EIO              5      /* I/O error */
#define ENXIO            6      /* No such device or address */
#define E2BIG            7      /* Arg list too long */
#define ENOEXEC          8      /* Exec format error */
#define EBADF            9      /* Bad file number */
#define ECHILD          10      /* No child processes */
#define EAGAIN          11      /* Try again */
#define ENOMEM          12      /* Out of memory */
#define EACCES          13      /* Permission denied */
#define EFAULT          14      /* Bad address */
#define ENOTBLK         15      /* Block device required */
#define EBUSY           16      /* Device or resource busy */
#define EEXIST          17      /* File exists */
#define EXDEV           18      /* Cross-device link */
#define ENODEV          19      /* No such device */
#define ENOTDIR         20      /* Not a directory */
#define EISDIR          21      /* Is a directory */
#define EINVAL          22      /* Invalid argument */
#define ENFILE          23      /* File table overflow */
#define EMFILE          24      /* Too many open files */
#define ENOTTY          25      /* Not a typewriter */
#define ETXTBSY         26      /* Text file busy */
#define EFBIG           27      /* File too large */
#define ENOSPC          28      /* No space left on device */
#define ESPIPE          29      /* Illegal seek */
#define EROFS           30      /* Read-only file system */
#define EMLINK          31      /* Too many links */
#define EPIPE           32      /* Broken pipe */
#define EDOM            33      /* Math argument out of domain of func */
#define ERANGE          34      /* Math result not representable */
#define EDEADLK         35      /* Resource deadlock would occur */
#define ENAMETOOLONG    36      /* File name too long */
#define ENOLCK          37      /* No record locks available */
#define ENOSYS          38      /* Function not implemented */
#define ENOTEMPTY       39      /* Directory not empty */
#define ELOOP           40      /* Too many symbolic links encountered */
#define EWOULDBLOCK     EAGAIN  /* Operation would block */
#define ENOMSG          42      /* No message of desired type */
#define EIDRM           43      /* Identifier removed */
#define ECHRNG          44      /* Channel number out of range */
#define EL2NSYNC        45      /* Level 2 not synchronized */
#define EL3HLT          46      /* Level 3 halted */
#define EL3RST          47      /* Level 3 reset */
#define ELNRNG          48      /* Link number out of range */
#define EUNATCH         49      /* Protocol driver not attached */
#define ENOCSI          50      /* No CSI structure available */
#define EL2HLT          51      /* Level 2 halted */
#define EBADE           52      /* Invalid exchange */
#define EBADR           53      /* Invalid request descriptor */
#define EXFULL          54      /* Exchange full */
#define ENOANO          55      /* No anode */
#define EBADRQC         56      /* Invalid request code */
#define EBADSLT         57      /* Invalid slot */
#define EDEADLOCK       EDEADLK
#define EBFONT          59      /* Bad font file format */
#define ENOSTR          60      /* Device not a stream */
#define ENODATA         61      /* No data available */
#define ETIME           62      /* Timer expired */
#define ENOSR           63      /* Out of streams resources */
#define ENONET          64      /* Machine is not on the network */
#define ENOPKG          65      /* Package not installed */
#define EREMOTE         66      /* Object is remote */
#define ENOLINK         67      /* Link has been severed */
#define EADV            68      /* Advertise error */
#define ESRMNT          69      /* Srmount error */
#define ECOMM           70      /* Communication error on send */
#define EPROTO          71      /* Protocol error */
#define EMULTIHOP       72      /* Multihop attempted */
#define EDOTDOT         73      /* RFS specific error */
#define EBADMSG         74      /* Not a data message */
#define EOVERFLOW       75      /* Value too large for defined data type */
#define ENOTUNIQ        76      /* Name not unique on network */
#define EBADFD          77      /* File descriptor in bad state */
#define EREMCHG         78      /* Remote address changed */
#define ELIBACC         79      /* Can not access a needed shared library */
#define ELIBBAD         80      /* Accessing a corrupted shared library */
#define ELIBSCN         81      /* .lib section in a.out corrupted */
#define ELIBMAX         82      /* Attempting to link in too many shared libraries */
#define ELIBEXEC        83      /* Cannot exec a shared library directly */
#define EILSEQ          84      /* Illegal byte sequence */
#define ERESTART        85      /* Interrupted system call should be restarted */
#define ESTRPIPE        86      /* Streams pipe error */
#define EUSERS          87      /* Too many users */
#define ENOTSOCK        88      /* Socket operation on non-socket */
#define EDESTADDRREQ    89      /* Destination address required */
#define EMSGSIZE        90      /* Message too long */
#define EPROTOTYPE      91      /* Protocol wrong type for socket */
#define ENOPROTOOPT     92      /* Protocol not available */
#define EPROTONOSUPPORT 93      /* Protocol not supported */
#define ESOCKTNOSUPPORT 94      /* Socket type not supported */
#define EOPNOTSUPP      95      /* Operation not supported on transport endpoint */
#define EPFNOSUPPORT    96      /* Protocol family not supported */
#define EAFNOSUPPORT    97      /* Address family not supported by protocol */
#define EADDRINUSE      98      /* Address already in use */
#define EADDRNOTAVAIL   99      /* Cannot assign requested address */
#define ENETDOWN        100     /* Network is down */
#define ENETUNREACH     101     /* Network is unreachable */
#define ENETRESET       102     /* Network dropped connection because of reset */
#define ECONNABORTED    103     /* Software caused connection abort */
#define ECONNRESET      104     /* Connection reset by peer */
#define ENOBUFS         105     /* No buffer space available */
#define EISCONN         106     /* Transport endpoint is already connected */
#define ENOTCONN        107     /* Transport endpoint is not connected */
#define ESHUTDOWN       108     /* Cannot send after transport endpoint shutdown */
#define ETOOMANYREFS    109     /* Too many references: cannot splice */
#define ETIMEDOUT       110     /* Connection timed out */
#define ECONNREFUSED    111     /* Connection refused */
#define EHOSTDOWN       112     /* Host is down */
#define EHOSTUNREACH    113     /* No route to host */
#define EALREADY        114     /* Operation already in progress */
#define EINPROGRESS     115     /* Operation now in progress */
#define ESTALE          116     /* Stale NFS file handle */
#define EUCLEAN         117     /* Structure needs cleaning */
#define ENOTNAM         118     /* Not a XENIX named type file */
#define ENAVAIL         119     /* No XENIX semaphores available */
#define EISNAM          120     /* Is a named type file */
#define EREMOTEIO       121     /* Remote I/O error */
#define EDQUOT          122     /* Quota exceeded */
#define ENOMEDIUM       123     /* No medium found */
#define EMEDIUMTYPE     124     /* Wrong medium type */


The list above should actually be enough, but we shall list the same for AIX:


>>> errcodes AIX:


#define EPERM   1       /* Operation not permitted              */
#define ENOENT  2       /* No such file or directory            */
#define ESRCH   3       /* No such process                      */
#define EINTR   4       /* interrupted system call              */
#define EIO     5       /* I/O error                            */
#define ENXIO   6       /* No such device or address            */
#define E2BIG   7       /* Arg list too long                    */
#define ENOEXEC 8       /* Exec format error                    */
#define EBADF   9       /* Bad file descriptor                  */
#define ECHILD  10      /* No child processes                   */
#define EAGAIN  11      /* Resource temporarily unavailable     */
#define ENOMEM  12      /* Not enough space                     */
#define EACCES  13      /* Permission denied                    */
#define EFAULT  14      /* Bad address                          */
#define ENOTBLK 15      /* Block device required                */
#define EBUSY   16      /* Resource busy                        */
#define EEXIST  17      /* File exists                          */
#define EXDEV   18      /* Improper link                        */
#define ENODEV  19      /* No such device                       */
#define ENOTDIR 20      /* Not a directory                      */
#define EISDIR  21      /* Is a directory                       */
#define EINVAL  22      /* Invalid argument                     */
#define ENFILE  23      /* Too many open files in system        */
#define EMFILE  24      /* Too many open files                  */
#define ENOTTY  25      /* Inappropriate I/O control operation  */
#define ETXTBSY 26      /* Text file busy                       */
#define EFBIG   27      /* File too large                       */
#define ENOSPC  28      /* No space left on device              */
#define ESPIPE  29      /* Invalid seek                         */
#define EROFS   30      /* Read only file system                */
#define EMLINK  31      /* Too many links                       */
#define EPIPE   32      /* Broken pipe                          */
#define EDOM    33      /* Domain error within math function    */
#define ERANGE  34      /* Result too large                     */
#define ENOMSG  35      /* No message of desired type           */
#define EIDRM   36      /* Identifier removed                   */
#define ECHRNG  37      /* Channel number out of range          */
#define EL2NSYNC 38     /* Level 2 not synchronized             */
#define EL3HLT  39      /* Level 3 halted                       */
#define EL3RST  40      /* Level 3 reset                        */
#define ELNRNG  41      /* Link number out of range             */
#define EUNATCH 42      /* Protocol driver not attached         */
#define ENOCSI  43      /* No CSI structure available           */
#define EL2HLT  44      /* Level 2 halted                       */
#define EDEADLK 45      /* Resource deadlock avoided            */
#define ENOTREADY       46      /* Device not ready             */
#define EWRPROTECT      47      /* Write-protected media        */
#define EFORMAT         48      /* Unformatted media            */
#define ENOLCK          49      /* No locks available           */
#define ENOCONNECT      50      /* no connection                */
#define ESTALE          52      /* no filesystem                */
#define EDIST           53      /* old, currently unused AIX errno*/
#define EINPROGRESS     55      /* Operation now in progress */
#define EALREADY        56      /* Operation already in progress */
#define ENOTSOCK        57      /* Socket operation on non-socket */
#define EDESTADDRREQ    58      /* Destination address required */
#define EDESTADDREQ     EDESTADDRREQ /* Destination address required */
#define EMSGSIZE        59      /* Message too long */
#define EPROTOTYPE      60      /* Protocol wrong type for socket */
#define ENOPROTOOPT     61      /* Protocol not available */
#define EPROTONOSUPPORT 62      /* Protocol not supported */
#define ESOCKTNOSUPPORT 63      /* Socket type not supported */
#define EOPNOTSUPP      64      /* Operation not supported on socket */
#define EPFNOSUPPORT    65      /* Protocol family not supported */
#define EAFNOSUPPORT    66      /* Address family not supported by protocol family */
#define EADDRINUSE      67      /* Address already in use */
#define EADDRNOTAVAIL   68      /* Can't assign requested address */
#define ENETDOWN        69      /* Network is down */
#define ENETUNREACH     70      /* Network is unreachable */
#define ENETRESET       71      /* Network dropped connection on reset */
#define ECONNABORTED    72      /* Software caused connection abort */
#define ECONNRESET      73      /* Connection reset by peer */
#define ENOBUFS         74      /* No buffer space available */
#define EISCONN         75      /* Socket is already connected */
#define ENOTCONN        76      /* Socket is not connected */
#define ESHUTDOWN       77      /* Can't send after socket shutdown */
#define ETIMEDOUT       78      /* Connection timed out */
#define ECONNREFUSED    79      /* Connection refused */
#define EHOSTDOWN       80      /* Host is down */
#define EHOSTUNREACH    81      /* No route to host */
#define ERESTART        82      /* restart the system call */
#define EPROCLIM        83      /* Too many processes */
#define EUSERS          84      /* Too many users */
#define ELOOP           85      /* Too many levels of symbolic links      */
#define ENAMETOOLONG    86      /* File name too long                     */
#define EDQUOT          88      /* Disc quota exceeded */
#define ECORRUPT        89      /* Invalid file system control data */
#define EREMOTE         93      /* Item is not local to host */
#define ENOSYS          109     /* Function not implemented  POSIX */
#define EMEDIA          110     /* media surface error */
#define ESOFT           111     /* I/O completed, but needs relocation */
#define ENOATTR         112     /* no attribute found */
#define ESAD            113     /* security authentication denied */
#define ENOTRUST        114     /* not a trusted program */
#define ETOOMANYREFS    115     /* Too many references: can't splice */
#define EILSEQ          116     /* Invalid wide character */
#define ECANCELED       117     /* asynchronous i/o cancelled */
#define ENOSR           118     /* temp out of streams resources */
#define ETIME           119     /* I_STR ioctl timed out */
#define EBADMSG         120     /* wrong message type at stream head */
#define EPROTO          121     /* STREAMS protocol error */
#define ENODATA         122     /* no message ready at stream head */
#define ENOSTR          123     /* fd is not a stream */
#define ECLONEME        ERESTART /* this is the way we clone a stream ... */
#define ENOTSUP         124     /* POSIX threads unsupported value */
#define EMULTIHOP       125     /* multihop is not allowed */
#define ENOLINK         126     /* the link has been severed */
#define EOVERFLOW       127     /* value too large to be stored in data type */


==================================
2. NFS and Mount command examples:
==================================


Let's start with something that might be of interrest right now:


Examples of mounting a DVD or CDROM:
===================================

AIX:
----
# mount -r -v cdrfs /dev/cd0 /cdrom


Solaris:
--------
# mount -r -F hsfs /dev/dsk/c0t6d0s2 /cdrom


HPUX:
-----

mount -F cdfs -o rr /dev/dsk/c1t2d0 /cdrom


SuSE Linux:
-----------
# mount -t iso9660 /dev/cdrom /cdrom
# mount -t iso9660 /dev/cdrom /media/cdrom


Redhat Linux:
-------------
# mount -t iso9660 /dev/cdrom /media/cdrom

Other commands on Linux:
------------------------

Sometimes on some Linux, and some scsi CDROM devices, you might try

# mount /dev/sr0 /mount_point
# mount -t iso9660 /dev/sr0 /mount_point


Now we return to a discussion of "mounting" and NFS.


2.1 NFS:
========

We will discuss the most important feaures of NFS, by showing how its implemented on 
Solaris, Redhat and SuSE Linux. Most of this applies to HP-UX and AIX as well.


2.1.1 NFS and Redhat Linux:
---------------------------

Linux uses a combination of kernel-level support and continuously running daemon processes to provide 
NFS file sharing, however, NFS support must be enabled in the Linux kernel to function. 
NFS uses Remote Procedure Calls (RPC) to route requests between clients and servers, meaning that the 
portmap service must be enabled and active at the proper runlevels for NFS communication to occur. 
Working with portmap, various other processes ensure that a particular NFS connection is allowed and may 
proceed without error: 

rpc.mountd  - The running process that receives the mount request from an NFS client and checks to see 
              if it matches with a currently exported file system. 
rpc.nfsd    - The process that implements the user-level part of the NFS service. It works with the Linux kernel 
              to meet the dynamic demands of NFS clients, such as providing additional server threads for 
              NFS clients to uses. 
rpc.lockd   - A daemon that is not necessary with modern kernels. NFS file locking is now done by the kernel. 
              It is included with the nfs-utils package for users of older kernels that do not include this 
              functionality by default. 
rpc.statd   - Implements the Network Status Monitor (NSM) RPC protocol. This provides reboot notification 
              when an NFS server is restarted without being gracefully brought down. 
rpc.rquotad - An RPC server that provides user quota information for remote users. 

Not all of these programs are required for NFS service. The only services that must be enabled are rpc.mountd, 
rpc.nfsd, and portmap. The other daemons provide additional functionality and should only be used if your server 
environment requires them. 


NFS version 2 uses the User Datagram Protocol (UDP) to provide a stateless network connection between 
the client and server. NFS version 3 can use UDP or TCP running over an IP. The stateless UDP connection 
minimizes network traffic, as the NFS server sends the client a cookie after the client is authorized 
to access the shared volume. This cookie is a random value stored on the server's side and is passed 
with along with RPC requests from the client. The NFS server can be restarted without affecting the clients 
and the cookie will remain intact. 

NFS only performs authentication when a client system attempts to mount a remote file system. To limit access, 
the NFS server first employs TCP wrappers. TCP wrappers reads the /etc/hosts.allow and /etc/hosts.deny files 
to determine if a particular client should be permitted or prevented access to the NFS server.  
After the client is allowed past TCP wrappers, the NFS server refers to its configuration file, 
"/etc/exports", to determine whether the client has enough privileges to mount any of the exported file systems. 
After granting access, any file and directory operations are sent to the server using remote procedure calls. 

 Warning 
  NFS mount privileges are granted specifically to a client, not a user. If you grant a client machine access 
  to an exported file system, any users of that machine will have access to the data. 

When configuring the /etc/exports file, be extremely careful about granting read-write permissions 
(rw) to a remote host. 
 
-- NFS and portmap
NFS relies upon remote procedure calls (RPC) to function. portmap is required to map RPC requests to the 
correct services. RPC processes notify portmap when they start, revealing the port number they are monitoring 
and the RPC program numbers they expect to serve. The client system then contacts portmap on the server with 
a particular RPC program number. portmap then redirects the client to the proper port number to communicate 
with its intended service. 

Because RPC-based services rely on portmap to make all connections with incoming client requests, 
portmap must be available before any of these services start. If, for some reason, the portmap service 
unexpectedly quits, restart portmap and any services running when it was started. 

The portmap service can be used with the host access files (/etc/hosts.allow and /etc/hosts.deny) to control 
which remote systems are permitted to use RPC-based services on your machine. Access control rules for portmap 
will affect all RPC-based services. Alternatively, you can specify each of the NFS RPC daemons to be affected 
by a particular access control rule. The man pages for rpc.mountd and rpc.statd contain information regarding 
the precise syntax of these rules. 

-- portmap Status
As portmap provides the coordination between RPC services and the port numbers used to communicate with them, 
it is useful to be able to get a picture of the current RPC services using portmap when troubleshooting. 
The rpcinfo command shows each RPC-based service with its port number, RPC program number, version, 
and IP protocol type (TCP or UDP). 
To make sure the proper NFS RPC-based services are enabled for portmap, rpcinfo -p can be useful: 

# rpcinfo -p

   program vers proto   port
    100000    2   tcp    111  portmapper
    100000    2   udp    111  portmapper
    100024    1   udp   1024  status
    100024    1   tcp   1024  status
    100011    1   udp    819  rquotad
    100011    2   udp    819  rquotad
    100005    1   udp   1027  mountd
    100005    1   tcp   1106  mountd
    100005    2   udp   1027  mountd
    100005    2   tcp   1106  mountd
    100005    3   udp   1027  mountd
    100005    3   tcp   1106  mountd
    100003    2   udp   2049  nfs
    100003    3   udp   2049  nfs
    100021    1   udp   1028  nlockmgr
    100021    3   udp   1028  nlockmgr
    100021    4   udp   1028  nlockmgr
 

The -p option probes the portmapper on the specified host or defaults to localhost if no specific host is listed. 
Other options are available from the rpcinfo man page. 
From the output above, various NFS services can be seen running. If one of the NFS services does not start up 
correctly, portmap will be unable to map RPC requests from clients for that service to the correct port. 
In many cases, restarting NFS as root (/sbin/service nfs restart) will cause those service to correctly 
register with portmap and begin working. 

# /sbin/service nfs restart

-- NFS Server Configuration Files
Configuring a system to share files and directories using NFS is straightforward. Every file system being 
exported to remote users via NFS, as well as the access rights relating to those file systems, 
is located in the /etc/exports file. This file is read by the exportfs command to give rpc.mountd and rpc.nfsd 
the information necessary to allow the remote mounting of a file system by an authorized host. 

The exportfs command allows you to selectively export or unexport directories without restarting the various 
NFS services. When exportfs is passed the proper options, the file systems to be exported are written to 
/var/lib/nfs/xtab. Since rpc.mountd refers to the xtab file when deciding access privileges to a file system, 
changes to the list of exported file systems take effect immediately. 

Various options are available when using exportfs: 


-r - Causes all directories listed in /etc/exports to be exported by constructing a new export list in 
     /etc/lib/nfs/xtab. This option effectively refreshes the export list with any changes that have been 
     made to /etc/exports. 

-a - Causes all directories to be exported or unexported, depending on the other options passed to exportfs. 

-o   options - Allows the user to specify directories to be exported that are not listed in /etc/exports. 
     These additional file system shares must be written in the same way they are specified in /etc/exports. 
     This option is used to test an exported file system before adding it permanently to the list of file systems 
     to be exported. 

-i - Tells exportfs to ignore /etc/exports; only options given from the command line are used to define 
     exported file systems. 

-u - Unexports directories from being mounted by remote users. The command exportfs -ua effectively suspends 
     NFS file sharing while keeping the various NFS daemons up. To allow NFS sharing to continue, type exportfs -r. 

-v - Verbose operation, where the file systems being exported or unexported are displayed in greater detail 
     when the exportfs command is executed. 

If no options are passed to the exportfs command, it displays a list of currently exported file systems. 

Changes to /etc/exports can also be read by reloading the NFS service with the service nfs reload command. 
This keeps the NFS daemons running while re-exporting the /etc/exports file. 

-- /etc/exports
The /etc/exports file is the standard for controlling which file systems are exported to which hosts, 
as well as specifying particular options that control everything. Blank lines are ignored, comments can be made 
using #, and long lines can be wrapped with a backslash (\). Each exported file system should be on its own line. 
Lists of authorized hosts placed after an exported file system must be separated by space characters. 
Options for each of the hosts must be placed in parentheses directly after the host identifier, without any spaces 
separating the host and the first parenthesis. 

In its simplest form, /etc/exports only needs to know the directory to be exported and the hosts 
permitted to use it: 

/some/directory bob.domain.com
/another/exported/directory 192.168.0.3
 
n5111sviob

After re-exporting /etc/exports with the "/sbin/service nfs reload" command, the bob.domain.com host will be 
able to mount /some/directory and 192.168.0.3 can mount /another/exported/directory. Because no options 
are specified in this example, several default NFS preferences take effect.

In order to override these defaults, you must specify an option that takes its place. For example, if you do 
not specify rw, then that export will only be shared read-only. Each default for every exported file system 
must be explicitly overridden. Additionally, other options are available where no default value is in place. 
These include the ability to disable sub-tree checking, allow access from insecure ports, and allow insecure 
file locks (necessary for certain early NFS client implementations). See the exports man page for details 
on these lesser used options. 

When specifying hostnames, you can use the following methods: 

single host - Where one particular host is specified with a fully qualified domain name, hostname, or IP address. 

wildcards   - Where a * or ? character is used to take into account a grouping of fully qualified domain names 
              that match a particular string of letters. Wildcards are not to be used with IP addresses; however, 
              they may accidently work if reverse DNS lookups fail. 

However, be careful when using wildcards with fully qualified domain names, as they tend to be more exact 
than you would expect. For example, the use of *.domain.com as wildcard will allow sales.domain.com to access 
the exported file system, but not bob.sales.domain.com. To match both possibilities, as well as 
sam.corp.domain.com, you would have to provide *.domain.com *.*.domain.com. 

IP networks - Allows the matching of hosts based on their IP addresses within a larger network. For example, 
              192.168.0.0/28 will allow the first 16 IP addresses, from 192.168.0.0 to 192.168.0.15, 
              to access the exported file system but not 192.168.0.16 and higher. 

netgroups   - Permits an NIS netgroup name, written as @<group-name>, to be used. This effectively puts the 
              NIS server in charge of access control for this exported file system, where users can be added 
              and removed from an NIS group without affecting /etc/exports. 


Warning 
  The way in which the /etc/exports file is formatted is very important, particularly concerning the use of 
  space characters. Remember to always separate exported file systems from hosts and hosts from one another 
  with a space character. However, there should be no other space characters in the file unless they are used 
  in comment lines. 

  For example, the following two lines do not mean the same thing: 

 /home bob.domain.com(rw)
 /home bob.domain.com (rw)
 

  The first line allows only users from bob.domain.com read-write access to the /home directory. 
  The second line allows users from bob.domain.com to mount the directory read-only (the default), but the rest 
  of the world can mount it read-write. Be careful where space characters are used in /etc/exports. 
 

-- NFS Client Configuration Files - What to do on a client?

Any NFS share made available by a server can be mounted using various methods. Of course, the share can be 
manually mounted, using the mount command, to acquire the exported file system at a particular mount point. 
However, this requires that the root user type the mount command every time the system restarts. 
In addition, the root user must remember to unmount the file system when shutting down the machine. 
Two methods of configuring NFS mounts include modifying the /etc/fstab or using the autofs service. 

> /etc/fstab
Placing a properly formatted line in the /etc/fstab file has the same effect as manually mounting the 
exported file system. The /etc/fstab file is read by the /etc/rc.d/init.d/netfs script at system startup. 
The proper file system mounts, including NFS, are put into place. 

A sample /etc/fstab line to mount an NFS export looks like the following: 

<server>:</path/of/dir> </local/mnt/point> nfs <options> 0 0
 
The <server-host> relates to the hostname, IP address, or fully qualified domain name of the server exporting 
the file system. The </path/to/shared/directory> tells the server what export to mount. 
The </local/mount/point> specifies where on the local file system to mount the exported directory. 
This mount point must exist before /etc/fstab is read or the mount will fail. The nfs option specifies 
the type of file system being mounted. 

The <options> area specifies how the file system is to be mounted. For example, if the options 
area states rw,suid on a particular mount, the exported file system will be mounted read-write and the 
user and group ID set by the server will be used. Note, parentheses are not to be used here.  


2.1.2 NFS and SuSE Linux:
-------------------------

-- Importing File Systems with YaST

Any user authorized to do so can mount NFS directories from an NFS server into his own file tree. 
This can be achieved most easily using the YaST module `NFS Client'. Just enter the host name of the NFS server, 
the directory to import, and the mount point at which to mount this directory locally. 
All this is done after clicking `Add' in the first dialog.


-- Importing File Systems Manually

File systems can easily be imported manually from an NFS server. The only prerequisite is a running 
RPC port mapper, which can be started by entering the command 
# rcportmap start 

as root. Once this prerequisite is met, remote file systems exported on the respective machines 
can be mounted in the file system just like local hard disks using the command mount with the following syntax: 

# mount host:remote-path local-path

If user directories from the machine sun, for example, should be imported, the following command can be used: 

# mount sun:/home /home
 

-- Exporting File Systems with YaST

With YaST, turn a host in your network into an NFS server - a server that exports directories and files 
to all hosts granted access to it. This could be done to provide applications to all coworkers of a group 
without installing them locally on each and every host. To install such a server, start YaST and select 
`Network Services' -> `NFS Server' 

Next, activate `Start NFS Server' and click `Next'. In the upper text field, enter the directories to export. 
Below, enter the hosts that should have access to them. 
There are four options that can be set for each host: single host, netgroups, wildcards, and IP networks. 
A more thorough explanation of these options is provided by man exports. `Exit' completes the configuration. 


-- Exporting File Systems Manually

If you do not want to use YaST, make sure the following systems run on the NFS server: 

RPC portmapper (portmap) 
RPC mount daemon (rpc.mountd) 
RPC NFS daemon (rpc.nfsd) 

For these services to be started by the scripts "/etc/init.d/portmap" and "/etc/init.d/nfsserver" 
when the system is booted, enter the commands 

# insserv /etc/init.d/nfsserver    and 
# insserv /etc/init.d/portmap. 

Also define which file systems should be exported to which host in the configuration file "/etc/exports". 

For each directory to export, one line is needed to set which machines may access that directory 
with what permissions. All subdirectories of this directory are automatically exported as well. 
Authorized machines are usually specified with their full names (including domain name), but it is possible 
to use wild cards like * or ? (which expand the same way as in the Bash shell). If no machine is specified here, 
any machine is allowed to import this file system with the given permissions. 

Set permissions for the file system to export in brackets after the machine name. The most important options are: 

ro 		File system is exported with read-only permission (default).  
rw 		File system is exported with read-write permission.  
root_squash 	This makes sure the user root of the given machine does not have root permissions 
                on this file system. This is achieved by assigning user ID 65534 to users with user ID 0 (root). 
                This user ID should be set to nobody (which is the default).  
no_root_squash 	Does not assign user ID 0 to user ID 65534, keeping the root permissions valid.  
link_relative	Converts absolute links (those beginning with /) to a sequence of ../. 
                This is only useful if the entire file system of a machine is mounted (default).  
link_absolute	Symbolic links remain untouched.  
map_identity	User IDs are exactly the same on both client and server (default).  
map_daemon	Client and server do not have matching user IDs. This tells nfsd to create a conversion table 
                for user IDs. The ugidd daemon is required for this to work.  

/etc/exports is read by mountd and nfsd. If you change anything in this file, restart mountd and nfsd 
for your changes to take effect. This can easily be done with "rcnfsserver restart". 


Example SuSE /etc/exports

#
# /etc/exports
#
/home            sun(rw)   venus(rw)
/usr/X11         sun(ro)   venus(ro)
/usr/lib/texmf   sun(ro)   venus(rw)
/                earth(ro,root_squash)
/home/ftp        (ro)
# End of exports


2.2 Mount command:
==================

The standard form of the mount command, is 

mount -F typefs device mountdir (solaris, HP-UX)
mount -t typefs device mountdir (many other unix's)

This tells the kernel to attach the file system found on "device" (which is of type type) 
at the directory "dir". 
The previous contents (if any) and owner and mode of dir become invisible, 
and as long as this file system remains mounted, 
the pathname dir refers to the root of the file system on device. 

The syntax is:
mount [options] [type] [device] [mountpoint]


-- mounting a remote filesystem:

syntax: mount -F nfs <options> <-o specific options> -O  <server>:<filesystem> <local_mount_point>

# mount -F nfs hpsrv:/data /data
# mount -F nfs -o hard,intr thor:/data  /data


- standard mounts are determined by files like  /etc/fstab (HP-UX) or /etc/filesystems (AIX) or /etc/vfstab etc..


2.2.1 Where are the standard mounts defined?
============================================

In Solaris:
===========

- standard mounts are determined by /etc/vfstab etc..
- NFS mounts are determined by the file /etc/dfs/dfstab. Here you will find share commands. 
- currently mounted filesystems are listed in /etc/mnttab

In Linux:
=========

- standard mounts are determined by most Linux distros by "/etc/fstab".

In AIX:
=======

- standard mounts and properties are determined by the file "/etc/filesystems".

In HP-UX:
=========

There is a /etc/fstab which contains all of the filesystems are mounted at boot time.
The filesystems that are OS related are / , /var, /opt , /tmp, /usr , /stand

The filesystem that is special is /stand, this is where your kernel is built and resides. 
Notice that the filesystem type is "hfs". HPUX kernels MUST reside on an hfs filesystem


An example of /etc/vfstab:
--------------------------

starboss:/etc $ more vfstab
#device         device          mount           FS      fsck    mount   mount
#to mount       to fsck         point           type    pass    at boot options
#
fd      -       /dev/fd fd      -       no      -
/proc   -       /proc   proc    -       no      -
/dev/md/dsk/d1  -       -       swap    -       no      -
/dev/md/dsk/d0  /dev/md/rdsk/d0 /       ufs     1       no      logging
/dev/md/dsk/d4  /dev/md/rdsk/d4 /usr    ufs     1       no      logging
/dev/md/dsk/d3  /dev/md/rdsk/d3 /var    ufs     1       no      logging
/dev/md/dsk/d7  /dev/md/rdsk/d7 /export ufs     2       yes     logging
/dev/md/dsk/d5  /dev/md/rdsk/d5 /usr/local      ufs     2       yes     logging
/dev/dsk/c2t0d0s0 /dev/rdsk/c2t0d0s0    /export2        ufs     2       yes     logging
swap - /tmp tmpfs - yes size=512m


mount adds an entry, umount deletes an entry.
mounting applies to local filesystemes, or remote filesystems via NFS


Local mount example:


mount -F ufs -o logging /dev/dsk/c0t0d0s3 /mnt

At Remote server: 
share, shareall, or add entry in /etc/dfs/dfstab
# share -F nfs /var/mail  

Unmount a mounted FS

First check who is using it
# fuser -c mountpoint
# umount mointpoint


2.2.2 Mounting a NFS filesystem in HP-UX:
=========================================

Mounting Remote File Systems 
You can use either SAM or the mount command to mount file systems located on a remote system.

Before you can mount file systems located on a remote system, NFS software must be installed and 
configured on both local and remote systems. Refer to Installing and Administering NFS for information.

For information on mounting NFS file systems using SAM, see SAM's online help.

To mount a remote file system using HP-UX commands,

You must know the name of the host machine and the file system's directory on the remote machine.
Establish communication over a network between the local system (that is, the "client") and the 
remote system. (The local system must be able to reach the remote system via whatever hosts database is in use.) 
(See named(1M) and hosts(4).) If necessary, test the connection with /usr/sbin/ping; see ping(1M).

Make sure the file /etc/exports on the remote system lists the file systems that you wish to make available 
to clients (that is, to "export") and the local systems that you wish to mount the file systems.

For example, to allow machines called rolf and egbert to remotely mount the /usr file system, edit the file 
/etc/exports on the remote machine and include the line:
 
/usr rolf egbert 
 
Execute /usr/sbin/exportfs -a on the remote system to export all directories in /etc/exports to clients.

For more information, see exportfs(1M).
 
 NOTE: If you wish to invoke exportfs -a at boot time, make sure the NFS configuration file /etc/rc.config.d/nfsconf 
 on the remote system contains the following settings: NFS_SERVER=1 and START_MOUNTD=1. 
 The client's /etc/rc.config.d/nfsconf file must contain NFS_CLIENT=1. Then issue the following command 
 to run the script: 
 /sbin/init.d/nfs.server start  
 
Mount the file system on the local system, as in:
 
# mount -F nfs remotehost:/remote_dir /local_dir 


Just a bunch of mount command examples:
---------------------------------------

# mount
# mount -a
# mountall -l
# mount -t type device dir                  
# mount -F pcfs /dev/dsk/c0t0d0p0:c /pcfs/c 
# mount /dev/md/dsk/d7 /u01
# mount sun:/home /home
# mount -t nfs 137.82.51.1:/share/sunos/local /usr/local
# mount /dev/fd0 /mnt/floppy
# mount -o ro /dev/dsk/c0t6d0s1 /mnt/cdrom
# mount -V cdrfs -o ro /dev/cd0  /cdrom


2.2.3 Solaris mount command:
============================

The unix mount command is used to mount a filesystem, and it attaches disks, and directories logically 
rather than physically. It takes a minimum of two arguments:

1) the name of the special device which contains the filesystem
2) the name of an existing directory on which to mount the file system

Once the file system is mounted, the directory becomes the mount point. All the file systems will now be usable 
as if they were subdirectories of the file system they were mounted on. The table of currently mounted file systems 
can be found by examining the mounted file system information file. This is provided by a file system that is usually 
mounted on /etc/mnttab.


Mounting a file system causes three actions to occur:

1. The superblock for the mounted file system is read into memory
2. An entry is made in the /etc/mnttab file
3. An entry is made in the inode for the directory on which the file system is mounted which marks the directory 
as a mount point

The /etc/mountall command mounts all filesystems as described in the /etc/vfstab file.
Note that /etc/mount and /etc/mountall commands can only be executed by the superuser.

OPTIONS

-F FSType
   Used to specify the FSType on which to operate. The FSType must be specified or must be determinable from
   /etc/vfstab, or by consulting /etc/default/fs or /etc/dfs/fstypes.

-a [ mount_points. . . ]
   Perform mount or umount operations in parallel, when possible.

If mount points are not specified, mount will mount all file systems whose /etc/vfstab "mount at boot"
field is "yes". If mount points are specified, then /etc/vfstab "mount at boot" field will be ignored.

If mount points are specified, umount will only umount those mount points. If none is specified, then umount
will attempt to unmount all file systems in /etc/mnttab, with the exception of certain system
required file systems: /, /usr, /var, /var/adm, /var/run, /proc, /dev/fd and /tmp.

-f Forcibly unmount a file system.
   Without this option, umount does not allow a file system to be unmounted if a file on the file system is
   busy. Using this option can cause data loss for open files; programs which access files after the file sys-
   tem has been unmounted will get an error (EIO).

-p Print the list of mounted file systems in the /etc/vfstab format. Must be the only option specified.

-v Print the list of mounted file systems in verbose format. Must be the only option specified.

-V Echo the complete command line, but do not execute the command. umount generates a command line by using the
   options and arguments provided by the user and adding to them information derived from /etc/mnttab. This
   option should be used to verify and validate the command line.

generic_options
Options that are commonly supported by most FSType-specific command modules. The following options are
available:

-m Mount the file system without making an entry in /etc/mnttab.

-g Globally mount the file system. On a clustered system, this globally mounts the file system on
   all nodes of the cluster. On a non-clustered system this has no effect.

-o Specify FSType-specific options in a comma separated (without spaces) list of suboptions
   and keyword-attribute pairs for interpretation by the FSType-specific module of the command.
   (See mount_ufs(1M))

-O Overlay mount. Allow the file system to be mounted over an existing mount point, making
   the underlying file system inaccessible. If a mount is attempted on a pre-existing mount point
   without setting this flag, the mount will fail, producing the error "device busy".

-r Mount the file system read-only.


Example mount:

mount -F ufs -o logging /dev/dsk/c0t0d0s3 /mnt


Example mountpoints and disks:
------------------------------

Mountpunt	Device	        Omvang 	Doel
/	       /dev/md/dsk/d1   100	Unix Root-filesysteem
/usr	       /dev/md/dsk/d3	1200	Unix usr-filesysteem
/var	       /dev/md/dsk/d4	200	Unix var-filesysteem
/home	       /dev/md/dsk/d5	200	Unix opt-filesysteem
/opt	       /dev/md/dsk/d6	4700	Oracle_Home
/u01	       /dev/md/dsk/d7	8700	Oracle datafiles	
/u02	       /dev/md/dsk/d8	8700	Oracle datafiles	
/u03	       /dev/md/dsk/d9	8700	Oracle datafiles	
/u04	       /dev/md/dsk/d10	8700	Oracle datafiles	
/u05	       /dev/md/dsk/d110	8700	Oracle datafiles	
/u06	       /dev/md/dsk/d120	8700	Oracle datafiles	
/u07	       /dev/md/dsk/d123	8650 	Oracle datafiles	

Suppose you have only 1 disk of about 72GB, 2GB RAM:

Entire disk= Slice 2

/        Slice 0, partition  about 2G
swap     Slice 1, partition  about 4G
/export  Slice 3, partition  about 50G, maybe you link it to /u01
/var     Slice 4, partition  about 2G
/opt     Slice 5, partition  about 10G if you plan to install apps here
/usr     Slice 6, partition  about 2G
/u01     Slice 7, partition  optional, standard it's /home
         Depending on how you configure /export, size could be around 20G


find . -name dfctowdk\*.zip | while read file; do pkzip25 -extract -translate=unix ->


2.2.4 mount command on AIX:
===========================

Typical examples:

# mount -o soft 10.32.66.75:/data/nim /mnt
# mount -o soft abcsrv:/data/nim /mnt
# mount -o soft n580l03:/data/nim /mnt


Note 1:
-------

mount [ -f ] [ -n Node ] [ -o Options ] [ -p ] [ -r ] [ -v VfsName ] [ -t Type | [ Device | Node:Directory ] 
      Directory | all | -a ] [-V [generic_options] special_mount_points 

If you specify only the Directory parameter, the mount command takes it to be the name of the directory or file on which 
a file system, directory, or file is usually mounted (as defined in the /etc/filesystems file). The mount command looks up 
the associated device, directory, or file and mounts it. This is the most convenient way of using the mount command, 
because it does not require you to remember what is normally mounted on a directory or file. You can also specify only 
the device. In this case, the command obtains the mount point from the /etc/filesystems file.

The /etc/filesystems file should include a stanza for each mountable file system, directory, or file. This stanza should 
specify at least the name of the file system and either the device on which it resides or the directory name. 
If the stanza includes a mount attribute, the mount command uses the associated values. It recognizes five values 
for the mount attributes: automatic, true, false, removable, and readonly. 

The mount all command causes all file systems with the mount=true attribute to be mounted in their normal places. 
This command is typically used during system initialization, and the corresponding mounts are referred to as 
automatic mounts. 

Example mount command on AIX:
-----------------------------

$ mount

  node       mounted        mounted over    vfs       date        options
-------- ---------------  ---------------  ------ ------------ ---------------
         /dev/hd4         /                jfs2   Jun 06 17:15 rw,log=/dev/hd8
         /dev/hd2         /usr             jfs2   Jun 06 17:15 rw,log=/dev/hd8
         /dev/hd9var      /var             jfs2   Jun 06 17:15 rw,log=/dev/hd8
         /dev/hd3         /tmp             jfs2   Jun 06 17:15 rw,log=/dev/hd8
         /dev/hd1         /home            jfs2   Jun 06 17:16 rw,log=/dev/hd8
         /proc            /proc            procfs Jun 06 17:16 rw
         /dev/hd10opt     /opt             jfs2   Jun 06 17:16 rw,log=/dev/hd8
         /dev/fslv00      /XmRec           jfs2   Jun 06 17:16 rw,log=/dev/hd8
         /dev/fslv01      /tmp/m2          jfs2   Jun 06 17:16 rw,log=/dev/hd8
         /dev/fslv02      /software        jfs2   Jun 06 17:16 rw,log=/dev/hd8
         /dev/oralv       /opt/app/oracle  jfs2   Jun 06 17:25 rw,log=/dev/hd8
         /dev/db2lv       /db2_database    jfs2   Jun 06 19:54 rw,log=/dev/loglv00
         /dev/fslv03      /bmc_home        jfs2   Jun 07 12:11 rw,log=/dev/hd8
         /dev/homepeter   /home/peter      jfs2   Jun 13 18:42 rw,log=/dev/hd8
         /dev/bmclv       /bcict/stage     jfs2   Jun 15 15:21 rw,log=/dev/hd8
         /dev/u01         /u01             jfs2   Jun 22 00:22 rw,log=/dev/loglv01
         /dev/u02         /u02             jfs2   Jun 22 00:22 rw,log=/dev/loglv01
         /dev/u05         /u05             jfs2   Jun 22 00:22 rw,log=/dev/loglv01
         /dev/u03         /u03             jfs2   Jun 22 00:22 rw,log=/dev/loglv01
         /dev/backuo      /backup_ora      jfs2   Jun 22 00:22 rw,log=/dev/loglv02
         /dev/u02back     /u02back         jfs2   Jun 22 00:22 rw,log=/dev/loglv03
         /dev/u01back     /u01back         jfs2   Jun 22 00:22 rw,log=/dev/loglv03
         /dev/u05back     /u05back         jfs2   Jun 22 00:22 rw,log=/dev/loglv03
         /dev/u04back     /u04back         jfs2   Jun 22 00:22 rw,log=/dev/loglv03
         /dev/u03back     /u03back         jfs2   Jun 22 00:22 rw,log=/dev/loglv03
         /dev/u04         /u04             jfs2   Jun 22 10:25 rw,log=/dev/loglv01


Example /etc/filesystems file:

/var:
        dev             = /dev/hd9var
        vfs             = jfs2
        log             = /dev/hd8
        mount           = automatic
        check           = false
        type            = bootfs
        vol             = /var
        free            = false

/tmp:
        dev             = /dev/hd3
        vfs             = jfs2
        log             = /dev/hd8
        mount           = automatic
        check           = false
        vol             = /tmp
        free            = false


/opt:
        dev             = /dev/hd10opt
        vfs             = jfs2
        log             = /dev/hd8
        mount           = true
        check           = true
        vol             = /opt
        free            = false

Example of the relation of Logigal Volumes and mountpoints:

/dev/lv01 = /u01
/dev/lv02 = /u02
/dev/lv03 = /u03
/dev/lv04 = /data
/dev/lv00 = /spl


2.2.5 Some other commands related to mounts:
===========================================

fsstat command:
---------------

On some unixes, the fsstat command is available. It provides filesystem statitstics.
It can take a lot of switches, thus be sure to check the man pages.

On Solaris, the following example shows the statistics for each file operation for "/" (using the -f option):

$ fsstat -f /
Mountpoint: /
 operation  #ops  bytes
      open 8.54K
     close  9.8K
      read 43.6K  65.9M
     write 1.57K  2.99M
     ioctl 2.06K
     setfl     4
   getattr 40.3K
   setattr    38
    access 9.19K
    lookup  203K
    create   595
    remove    56
      link     0
    rename     9
     mkdir    19
     rmdir     0
   readdir 2.02K  2.27M
   symlink     4
  readlink 8.31K
     fsync   199
  inactive 2.96K
       fid     0
    rwlock 47.2K
  rwunlock 47.2K
      seek 29.1K
       cmp 42.9K
    frlock 4.45K
     space     8
    realvp 3.25K
   getpage  104K
   putpage 2.69K
       map 13.2K
    addmap 34.4K
    delmap 33.4K
      poll   287
      dump     0
  pathconf    54
    pageio     0
   dumpctl     0
   dispose 23.8K
getsecattr   697
setsecattr     0
   shrlock     0
   vnevent     0


fuser command:
--------------

AIX:

Purpose
Identifies processes using a file or file structure. 

Syntax
fuser [ -c | -d | -f ] [ -k ] [ -u ] [ -x ] [ -V ]File ... 


Description
The fuser command lists the process numbers of local processes that use the local or remote files 
specified by the File parameter. For block special devices, the command lists the processes that use 
any file on that device.


Flags

-c Reports on any open files in the file system containing File. 
-d Implies the use of the -c and -x flags. Reports on any open files which haved been unlinked from the file system 
   (deleted from the parent directory). When used in conjunction with the -V flag, it also reports the inode number 
   and size of the deleted file.  
-f Reports on open instances of File only. 
-k Sends the SIGKILL signal to each local process. Only the root user can kill a process of another user.  
-u Provides the login name for local processes in parentheses after the process number. 
-V Provides verbose output. 
-x Used in conjunction with -c or -f, reports on executable and loadable objects in addition to the standard fuser output. 


To list the process numbers of local processes using the /etc/passwd file, enter: 
# fuser /etc/passwd

To list the process numbers and user login names of processes using the /etc/filesystems file, enter: 
# fuser -u /etc/filesystems

To terminate all of the processes using a given file system, enter: 
#fuser -k -x -u /dev/hd1 -OR-
#fuser -kxuc /home

Either command lists the process number and user name, and then terminates each process that is using 
the /dev/hd1 (/home) file system. Only the root user can terminate processes that belong to another user. 
You might want to use this command if you are trying to unmount the /dev/hd1 file system and a process 
that is accessing the /dev/hd1 file system prevents this.

To list all processes that are using a file which has been deleted from a given file system, enter: 
# fuser -d /usr


Examples on linux distro's:

- To kill all processes accessing the file system /home in any way.
# fuser  -km /home 

- invokes something if no other process is using /dev/ttyS1.       
if fuser -s /dev/ttyS1; then :; else something; fi 

- shows all processes at the (local) TELNET port.       
# fuser telnet/tcp 

A similar command is the lsof command.


2.2.6 Starting and stopping NFS:
================================

Short note on stopping and starting NFS. See other sections for more detail.

On all unixes, a number of daemons should be running in order for NFS to be functional, like for example
the rpc.* processes, biod, nfsd and others.

Once nfs is running, and in order to actually "share" or "export" your filesystem on your server, so remote clients 
are able to mount the nfs mount, in most cases you should edit the "/etc/exports" file.
See other sections in this document (search on exportfs) on how to accomplish this.

-- AIX:

The following subsystems are part of the nfs group: nfsd, biod, rpc.lockd, rpc.statd, and rpc.mountd. 
The nfs subsystem (group) is under control of the "resource controller", so starting and stopping nfs
is actually easy

# startsrc -g nfs
# stopsrc -g nfs

Or use smitty.


-- Redhat Linux:
# /sbin/service nfs restart
# /sbin/service nfs start
# /sbin/service nfs stop

-- On some other Linux distros
# /etc/init.d/nfs start 
# /etc/init.d/nfs stop
# /etc/init.d/nfs restart


-- Solaris:
If the nfs daemons aren't running, then you will need to run:
# /etc/init.d/nfs.server start 


-- HP-UX:
Issue the following command on the NFS server to start all the necessary NFS processes (HP): 
# /sbin/init.d/nfs.server start 
 
Or if your machine is only a client:

# cd /sbin/init.d
# ./nfs.client start


===========================================
3. Change ownership file/dir, adding users:
===========================================

3.1 Changing ownership:
-----------------------

chown -R user[:group] file/dir        (SVR4)
chown -R user[.group] file/dir        (bsd)

(-R recursive dirs)

Examples:
chown -R oracle:oinstall /opt/u01
chown -R oracle:oinstall /opt/u02
chown -R oracle:oinstall /opt/u03
chown -R oracle:oinstall /opt/u04

-R means all subdirs also.

chown rjanssen file.txt             - Give permissions as owner to user rjanssen. 


#  groupadd dba
#  useradd oracle
#  mkdir /usr/oracle
#  mkdir /usr/oracle/9.0
#  chown -R oracle:dba /usr/oracle
#  touch /etc/oratab
#  chown oracle:dba /etc/oratab


Note: Not owner message:
------------------------

>>> Solaris:

it is possible to turn the chown command on or off (i.e., allow it to be used or disallow its use) on a system by 
altering the /etc/system file. The /etc/system file, along with the files in /etc/default should be thought of a 
"system policy files" -- files that allow the systems administrator to determine such things as whether 
root can login over the network, whether su commands are logged, and whether a regular user can change ownership of his own files. 

On a system disallowing a user to change ownership of his files (this is now the default), the value of rstchown is set to 1. 
Think of this as saying "restrict chown is set to TRUE". You might see a line like this in /etc/system (or no rstchown value at all): 

set rstchown=1 

On a system allowing chown by regular users, this value will be set to 0 as shown here: 

set rstchown=0 

Whenever the /etc/system file is changed, the system will have to be rebooted for the changes to take effect. 
Since there is no daemon process associated with commands such a chown, there is no process that one could send 
a hangup (HUP) to effect the change in policy "on the fly". 

Why might system administrators restrict access to the chown command? For a system on which disk quotas are enforced,
 they might not want to allow files to be "assigned" by one user to another user's quota. More importantly, 
for a system on which accountability is deemed important, system administrators will want to know who 
created each file on a system - whether to track down a potential system abuse or simply to ask if a file that is 
occupying space in a shared directory or in /tmp can be removed. 

When a system disallows use of the chown command, you can expect to see dialog like this: 

% chown wallace myfile
chown: xyz: Not owner 

Though it would be possible to disallow "chowning" of files by changing permissions on /usr/bin/chown, 
such a change would not slow down most Unix users. They would simple copy the /usr/bin/chown file to their own directory 
and make their copy executable. Designed to be extensible, Unix will happily comply. Making the change in the /etc/system 
file blocks any chown operation from taking effect, regardless of where the executable is stored, who owns it, 
and what it is called. If usage of chown is restricted in /etc/system, only the superuser can change ownership of files. 


3.2 Add a user in Solaris:
--------------------------

Examples:

# useradd -u 3000 -g other -d /export/home/tempusr -m -s /bin/ksh -c "temporary user" tempusr
# useradd -u 1002 -g dba -d /export/home/avdsel -m -s /bin/ksh -c "Albert van der Sel" avdsel
# useradd -u 1001 -g oinstall -G dba -d /export/home/oraclown -m -s /bin/ksh -c "Oracle owner" oraclown
# useradd -u 1005 -g oinstall -G dba -d /export/home/brighta -m -s /bin/ksh -c "Bright Alley" brighta

useradd -u 300 -g staff -G staff -d /home/emc -m -s /usr/bin/ksh -c "EMC user" emc

a password cannot be specified using the useradd command. 
Use passwd to give the user a password:

# passwd tempusr

UID must be unique and is typically a number between 100 and 60002
GID is a number between 0 and 60002

Or use the graphical "admintool" or smc, the solaris management console.


-- Profiles a user can use to set the environment:

1. Korn Shell ksh:
------------------

When the POSIX or Korn Shell is your login shell, it looks for these following files and executes them, if they exist:

/etc/profile
This default system file is executed by the shell program and sets up default environment variables.

.profile
If this file exists in your home directory, it is executed next at login.

At any time-this includes login time-the POSIX or Korn Shell is invoked, it looks for the file referenced by the following shell variable, 
and executes it, if it exists:

ENV
When you invoke the shell, it looks for a shell variable called ENV which is usually set in your .profile. ENV is evaluated and if it is set 
to an existing file, that file is executed. By convention, ENV is usually set to .kshrc but may be set to any file name.

These files provide the means for customizing the shell environment to fit your needs.


2. Bourne Shell sh:
-------------------

it looks for these following files and executes them, if they exist:

/etc/profile

.profle in the home directory, for example "/home/user1/.profile"


3.3 Add a user in AIX:
----------------------

You can also use the useradd command, just as in Solaris.
Or use the native "mkuser" command.

# mkuser albert

The mkuser command does not create password information for a user. It initializes the password field 
with an * (asterisk). Later, this field is set with the passwd or pwdadm command. 
New accounts are disabled until the passwd or pwdadm commands are used to add authentication 
information to the /etc/security/passwd file.

You can use the Users application in Web-based System Manager to change user characteristics. You could also 
use the System Management Interface Tool (SMIT) "smit mkuser" fast path to run this command.

The /usr/lib/security/mkuser.default file contains the default attributes for new users. 
This file is an ASCII file that contains user stanzas. These stanzas have attribute default values 
for users created by the mkuser command. Each attribute has the Attribute=Value form. If an attribute 
has a value of $USER, the mkuser command substitutes the name of the user. The end of each attribute pair 
and stanza is marked by a new-line character.

There are two stanzas, user and admin, that can contain all defined attributes except the id and admin attributes. 
The mkuser command generates a unique id attribute. The admin attribute depends on whether the -a flag is used with 
the mkuser command.

A typical user stanza looks like the following:

user:
   pgroup = staff
   groups = staff
   shell = /usr/bin/ksh
   home = /home/$USER
   auth1 = SYSTEM

# mkuser [ -de | -sr ] [-attr Attributes=Value [ Attribute=Value... ] ] Name
# mkuser [ -R load_module ] [ -a ] [ Attribute=Value ... ] Name


To create the davis user account with the default values in the /usr/lib/security/mkuser.default file, type: 
# mkuser davis

To create the davis account with davis as an administrator, type: 
# mkuser -a davis

Only the root user or users with the UserAdmin authorization can create davis as an administrative user.

To create the davis user account and set the su attribute to a value of false, type: 
# mkuser su=false davis

To create the davis user account that is identified and authenticated through the LDAP load module, type: 
# mkuser -R LDAP davis


To add davis to the groups finance and accounting, enter: 
chuser groups=finance,accounting davis 

-- Add a user with the smit utility:
-- ---------------------------------
Start SMIT by entering

smit <Enter>

  From the Main Menu, make the following selections:

  -Security and Users 
    -Users 
      -Add a User to the System

The utility displays a form for adding new user information. Use the <Up-arrow> and <Down-arrow> keys to move through 
the form. Do not use <Enter> until you are finished and ready to exit the screen.
Fill in the appropriate fields of the Create User form (as listed in Create User Form) and press <Enter>.
The utility exits the form and creates the new user.


-- Using SMIT to Create a Group:
-- -----------------------------
Use the following procedure to create a group.

Start SMIT by entering the following command:

smit <Enter>

The utility displays the Main Menu.

  From the Main Menu, make the following selections:

  -Security and Users 
    -Users 
      -Add a Group to the System

The utility displays a form for adding new group information. 
Type the group name in the Group Name field and press <Enter>.
The group name must be eight characters or less.
The utility creates the new group, automatically assigns the next available GID, and exits the form

Primary Authentication method of system:
----------------------------------------

To check whether root has a primary authentication method of SYSTEM, use the following command:
# lsuser -a auth1 root

If needed, change the value by using
# chuser auth1=SYSTEM root


3.4 Add a user in HP-UX:
------------------------

-- Example 1:

Add user john to the system with all of the default attributes.

# useradd john

Add the user john to the system with a UID of 222 and a primary group
of staff.

# useradd -u 222 -g staff john

-- Example 2:

=> Add a user called guestuser as per following requirements
=> Primary group member of guests 
=> Secondary group member of www and accounting
=> Shell must be /usr/bin/bash3
=> Home directory must be /home/guestuser

# useradd -g guests -G www,accounting -d /home/guests -s /home/guestuser/ -m guestuser
# passwd guestuser


3.5 Add a user in Linux Redhat:
-------------------------------

You can use tools like useradd or groupadd to create new users and groups from the shell prompt. 
But an easier way to manage users and groups is through the graphical application, User Manager. 

Users are described in the /etc/passwd file
Groups are stored on Red Hat Linux in the /etc/group file. 

Or invoke the Gnome Linuxconf GUI Tool by typing "linuxconf". In Red Hat Linux, linuxconf is found in the 
/bin directory.


================================
4. Change filemode, permissions:
================================

Permissions are given to:
u = user
g = group
o = other/world
a = all

file/directory permissions (or also called "filemodes") are:
r = read
w = write
x = execute

special modes are:
X = sets execute if already set (this one is particularly sexy, look below)
s = set setuid/setgid bit
t = set sticky bit


Examples:
---------

readable by all, everyone
% chmod a+r essay.001

to remove read write and execute permissions on the file biglist for the group and others
% chmod go-rwx biglist 

make executable:
% chmod +x mycommand

set mode:
% chmod 644 filename

    rwxrwxrwx=777
    rw-rw-rw-=666
    rw-r--r--=644 corresponds to umask 022
    r-xr-xr-x=555
    rwxrwxr-x=775

1 = execute
2 = write
4 = read 

note that the total is 7
execute and read are: 1+4=5
read and write are: 2+4=6
read, write and exec: 1+2+4=7
and so on 

directories must always be executable... 

so a file with, say 640, means, the owner can read and write (4+2=6), the group can read (4) 
and everyone else has no permission to use the file (0). 

chmod -R a+X .
This command would set the executable bit (for all users) of all directories and executables 
below the current directory that presently have an execute bit set. Very helpful when you want to set 
all your binary files executable for everyone other than you without having to set the executable bit 
of all your conf files, for instance. *wink* 

chmod -R g+w .
This command would set all the contents below the current directory writable by your current group. 

chmod -R go-rwx
This command would remove permissions for group and world users without changing the bits for the file owner. 
Now you don't have to worry that 'find . -type f -exec chmod 600 {}\;' will change your binary files 
non-executable. Further, you don't need to run an additional command to chmod your directories. 

chmod u+s /usr/bin/run_me_setuid
This command would set the setuid bit of the file. It's simply easier than remembering which number to use 
when wanting to setuid/setgid, IMHO. 


========================
5. About the sticky bit:
========================


- This info is valid for most Unix OS including Solaris and AIX:
----------------------------------------------------------------

A 't' or 'T' as the last character of the "ls -l" mode characters
indicates that the "sticky" (save text image) bit is set. See ls(1) for
an explanation the distinction between 't' and 'T'.

The sticky bit has a different meaning, depending on the type of file it
is set on...

sticky bit on directories
-------------------------
[From chmod(2)]
If the mode bit S_ISVTX (sticky bit) is set on a directory, files
inside the directory may be renamed or removed only by the owner of
the file, the owner of the directory, or the superuser (even if the
modes of the directory would otherwise allow such an operation).

[Example]
drwxrwxrwt  104 bin        bin          14336 Jun  7 00:59 /tmp

Only root is permitted to turn the sticky bit on or off. In addition the sticky bit applies to anyone 
who accesses the file. The syntax for setting the sticky bit on a dir /foo directory is as follows: 

chmod +t /foo 


sticky bit on regular files
---------------------------
[From chmod(2)]
If an executable file is prepared for sharing, mode bit S_ISVTX prevents
the system from abandoning the swap-space image of the program-text
portion of the file when its last user terminates.  Then, when the next
user of the file executes it, the text need not be read from the file
system but can simply be swapped in, thus saving time.

[From HP-UX Kernel Tuning and Performance Guide]
Local paging. When applications are located remotely, set the "sticky
bit"
on the applications binaries, using the chmod +t command. This tells the
system to page the text to the local disk. Otherwise, it is "retrieved"
across the network. Of course, this would only apply when there is actual
paging occurring. More recently, there is a kernel parameter,
page_text_to_local, which when set to 1, will tell the kernel to page all
NFS executable text pages to local swap space.

[Example]
-r-xr-xr-t   6 bin        bin         24111111111664 Nov 14  2000
/usr/bin/vi


Solaris:
--------

The sticky bit on a directory is a permission bit that protects files within that directory. 
If the directory has the sticky bit set, only the owner of the file, the owner of the directory, 
or root can delete the file. The sticky bit prevents a user from deleting other users' files from 
public directories, such as uucppublic:

castle% ls -l /var/spool/uucppublic
drwxrwxrwt   2 uucp     uucp         512 Sep 10 18:06 uucppublic
castle%

When you set up a public directory on a TMPFS temporary file system, make sure that you set the sticky bit manually. 

You can set sticky bit permissions by using the chmod command to assign the octal value 1 as the first number 
in a series of four octal values. Use the following steps to set the sticky bit on a directory:

1.  If you are not the owner of the file or directory, become superuser. 
2.  Type chmod <1nnn> <filename> and press Return. 
3.  Type ls -l <filename> and press Return to verify that the permissions of the file have changed. 
The following example sets the sticky bit permission on the pubdir directory:

castle% chmod 1777 pubdir
castle% ls -l pubdir
drwxrwxrwt   2 winsor    staff    512 Jul 15 21:23 pubdir
castle%


================
6. About SETUID:
================

Each process has three user ID's: 
the real user ID (ruid)
the effective user ID (euid) and
the saved user ID (suid)

The real user ID identifies the owner of the process, the effective uid is used in most
access control decisions, and the saved uid stores a previous user ID so that it
can be restored later.
Similar, a process has three group ID's.

When a process is created by fork, it inherits the three uid's from the parent process.
When a process executes a new file by exec..., it keeps its three uid's unless the
set-user-ID bit of the new file is set, in which case the effective uid and saved uid
are assigned the user ID of the owner of the new file.


When setuid (set-user identification) permission is set on an executable file, a process that runs this file 
is granted access based on the owner of the file (usually root), rather than the user who created the process. 
This permission enables a user to access files and directories that are normally available only to the owner.

The setuid permission is shown as an s in the file permissions. 
For example, the setuid permission on the passwd command enables a user to change passwords, 
assuming the permissions of the root ID are the following:

castle% ls -l /usr/bin/passwd
-r-sr-sr-x   3 root     sys        96796 Jul 15 21:23 /usr/bin/passwd
castle%

You setuid permissions by using the chmod command to assign the octal value 4 as the first number 
in a series of four octal values. Use the following steps to setuid permissions:

1.  If you are not the owner of the file or directory, become superuser. 
2.  Type chmod <4nnn> <filename> and press Return. 
3.  Type ls -l <filename> and press Return to verify that the permissions of the file have changed. 

The following example sets setuid permission on the myprog file:

#chmod 4555 myprog
-r-sr-xr-x   1 winsor    staff    12796 Jul 15 21:23 myprog
#


The setgid (set-group identification) permission is similar to setuid, except that the effective group ID 
for the process is changed to the group owner of the file and a user is granted access based on permissions 
granted to that group. The /usr/bin/mail program has setgid permissions:

castle% ls -l /usr/bin/mail
-r-x-s-x   1 bin      mail       64376 Jul 15 21:27 /usr/bin/mail
castle%

When setgid permission is applied to a directory, files subsequently created in the directory belong to the group 
the directory belongs to, not to the group the creating process belongs to. Any user who has write permission 
in the directory can create a file there; however, the file does not belong to the group of the user, 
but instead belongs to the group of the directory.

You can set setgid permissions by using the chmod command to assign the octal value 2 as the first number 
in a series of four octal values. Use the following steps to set setgid permissions:

1.  If you are not the owner of the file or directory, become superuser. 
2.  Type chmod <2nnn> <filename> and press Return. 
3.  Type ls -l <filename> and press Return to verify that the permissions of the file have changed. 
The following example sets setuid permission on the myprog2 file:

#chmod 2551 myprog2
#ls -l myprog2
-r-xr-s-x   1 winsor    staff  26876 Jul 15 21:23 myprog2
#


=========================
7. Find command examples:
=========================

Introduction 
The find command allows the Unix user to process a set of files and/or directories in a file subtree. 

You can specify the following: 

where to search (pathname) 
what type of file to search for (-type: directories, data files, links) 
how to process the files        (-exec: run a process against a selected file) 
the name of the file(s)         (-name) 
perform logical operations on selections (-o and -a) 
Search for file with a specific name in a set of files (-name) 


EXAMPLES
--------

# find . -name "rc.conf" -print 

This command will search in the current directory and all sub directories for a file named rc.conf. 
Note: The -print option will print out the path of any file that is found with that name. In general -print wil 
print out the path of any file that meets the find criteria. 

# find . -name "rc.conf" -exec chmod o+r '{}' \; 

This command will search in the current directory and all sub directories. All files named rc.conf will be processed 
by the chmod -o+r command. The argument '{}' inserts each found file into the chmod command line. 
The \; argument indicates the exec command line has ended. 
The end results of this command is all rc.conf files have the other permissions set to read access
(if the operator is the owner of the file). 

# find . -exec grep "www.athabasca" '{}' \; -print 

This command will search in the current directory and all sub directories. 
All files that contain the string will have their path printed to standard output. 

# find / -xdev -size +2048 -ls | sort -r +6 

This command will find all files in the root directory larger than 1 MB.

# find .  -exec grep "CI_ADJ_TYPE" {} \; -print

This command search all subdirs all files to find text CI_ADJ_TYPE


Other examples:
---------------
# find . -name file -print
# find / -name $1 -exec ls -l {} \;

# find / -user nep -exec ls -l {} \; >nepfiles.txt
In English: search from the root directory for any files owned by nep 
and execute an ls -l on the file when any are found. 
Capture all output in nepfiles.txt.

# find $HOME -name \*.txt -print
In order to protect the asterisk from being expanded by the shell, 
it is necessary to use a backslash to escape the asterisk as in:

# find / -atime +30 -print
This prints files that have not been accessed in the last 30 days

# find / -atime +100 -size +500000c -print
The find search criteria can be combined. This command will locate and list all files 
that were last accessed more than 100 days ago, and whose size exceeds 500,000 bytes.

# find /opt/bene/process/logs -name 'ALBRACHT*'  -mtime +90 -exec rm {} \;

# find /example /new/example -exec grep -l 'Where are you' {} \;
# find / \( -name a.out -o -name '*.o' \) -atime +7 -exec rm {} \;
# find . -name '*.trc' -mtime +3 -exec rm {} \;
# find / -fsonly hfs -print
# cd /; find . ! -path ./Disk -only -print | cpio -pdxm /Disk
# cd /; find . -path ./Disk -prune -o -print | cpio -pdxm /Disk
# cd /;  find . -xdev -print | cpio -pdm /Disk
# find  -type f -print | xargs chmod 444
# find  -type d -print | xargs chmod 555
# find . -atime +1 -name '*' -exec rm -f {} \; 
# find /tmp -atime +1 -name '*' -exec rm -f {} \; 
# find /usr/tmp -atime +1 -name '*' -exec rm -f {} \; 
# find / -name core -exec rm -f {} \; 
# find . -name "*.dbf" -mtime -2 -exec ls {} \;


* Search and list all files from current directory and down for the string ABC:
find ./ -name "*" -exec grep -H ABC {} \;
find ./ -type f -print | xargs grep -H "ABC" /dev/null
egrep -r ABC *
* Find all files of a given type from current directory on down:
find ./ -name "*.conf" -print
* Find all user files larger than 5Mb:
find /home -size +5000000c -print
* Find all files owned by a user (defined by user id number. see /etc/passwd) on the system: (could take a very long time)
find / -user 501 -print
* Find all files created or updated in the last five minutes: (Great for finding effects of make install)
find / -cmin -5
* Find all users in group 20 and change them to group 102: (execute as root)
find / -group 20 -exec chown :102 {} \;
* Find all suid and setgid executables:
find / \( -perm -4000 -o -perm -2000 \) -type f -exec ls -ldb {} \;
find / -type f -perm +6000 -ls


Example:
--------

cd /database/oradata/pegacc/archive
archdir=`pwd`
if [ $archdir=="/database/oradata/pegacc/archive" ]
   then
      find . -name "*.dbf" -mtime +5 -exec rm {} \;
   else
      echo "error in onderhoud PEGACC archives" >> /opt/app/oracle/admin/log/archmaint.log
fi


Example:
--------

The following example shows how to find files larger than 400 blocks in the current directory:

# find . -size +400 -print


REAL COOL EXAMPLE:
------------------

This example could even help in recovery of a file:

In some rare cases a strangely-named file will show itself in your directory and appear to be 
un-removable with the rm command. Here is will the use of ls -li and find with its -inum [inode] 
primary does the job. 
Let's say that ls -l shows your irremovable as 

-rw-------  1 smith  smith  0 Feb  1 09:22 ?*?*P

Type: 

ls -li

to get the index node, or inode. 

153805 -rw-------  1 smith  smith  0 Feb  1 09:22 ?*?^P

The inode for this file is 153805. Use find -inum [inode] to make sure that the file is correctly identified. 


%  find -inum 153805 -print
./?*?*P

Here, we see that it is. Then used the -exec functionality to do the remove. . 
  
% find . -inum 153805 -print -exec /bin/rm {} \;

Note that if this strangely named file were not of zero-length, it might contain accidentally misplaced 
and wanted data. Then you might want to determine what kind of data the file contains and move the file 
to some temporary directory for further investigation, for example: 

% find . -inum 153805 -print -exec /bin/mv {} unknown.file \;

Will rename the file to unknown.file, so you can easily inspect it. 


COOL EXAMPLE: Using find and cpio to create really good backups:
----------------------------------------------------------------

Suppose you have a lot of subdirs and files in "/dir1/dira"
Now you want to copy, or backup, this to "/dir2/dirb"
And not only just the files and subdirs, BUT ALSO all filemodes (permissions), ownership information, acl's etc..

Then DO NOT USE "cp -R" or something similar. Instead use "find" in combination with the "cpio" backup command.

# cd /dir1/dira
# find . | cpio -pvdm /dir2/dirb


Note: difference betweeen mtime and atime:
------------------------------------------

In using the find command where you want to delete files older than a certain date, you can use
commands like
find . -name "*.log" -mtime +30 -exec rm {} \;   or
find . -name "*.dbf" -atime +30 -exec rm {} \;

Why should you choose, or not choose, between atime and mtime?

It is important to distinguish between a file or directory's change time (ctime), access time (atime), 
and modify time (mtime).

ctime -- In UNIX, it is not possible to tell the actual creation time of a file. The ctime--change time--
         is the time when changes were made to the file or directory's inode (owner, permissions, etc.). 
         The ctime is also updated when the contents of a file change. It is needed by the dump command 
         to determine if the file needs to be backed up. You can view the ctime with the ls -lc command.

atime -- The atime--access time--is the time when the data of a file was last accessed. Displaying the contents 
         of a file or executing a shell script will update a file's atime, for example. 

mtime -- The mtime--modify time--is the time when the actual contents of a file was last modified. 
         This is the time displayed in a long directoring listing (ls -l).

Thats why backup utilities use the mtime when performing incremental backups:
When the utility reads the data for a file that is to be included in a backup, it does not 
affect the file's modification time, but it does affect the file's access time. 

So for most practical reasons, if you want to delete logfiles (or other files) older than a certain
date, its best to use the mtime attribute.

How to make those times visible?

"ls -l"   shows atime
"ls -lc"  shows ctime
"ls -lm"  shows mtime

"istat filename" will show all three.

pago-am1:/usr/local/bb>istat bb18b3.tar.gz
Inode 20 on device 10/9 File
Protection: rw-r--r--   
Owner: 100(bb)          Group: 100(bb)
Link count:   1         Length 427247 bytes

Last updated:   Tue Aug 14 11:01:46 2001
Last modified:  Thu Jun 21 07:36:32 2001
Last accessed:  Thu Nov 01 20:38:46 2001


===================
7. Crontab command:
===================

Cron is uded to schedule or run periodically all sorts of executable programs or shell scripts,
like backupruns, housekeeping jobs etc..
The crond daemon makes it all happen.

Who has access to cron, is on most unixes determined by the "cron.allow" and "cron.deny" files.
Every allowed user, can have it's own "crontab" file.
The crontab of root, is typically used for system administrative jobs.

On most unixes the relevant files can be found in:
/var/spool/cron/crontabs     or 
/var/adm/cron                or 
/etc/cron.d

For example, on Solaris the /var/adm/cron/cron.allow and /var/adm/cron/cron.deny files control 
which users can use the crontab command. 

Most common usage:

- if you just want a listing:     crontab -l
- if you want to edit and change: crontab -e

crontab [ -e | -l | -r | -v | File ]
 
-e: edit, submit  -r remove, -l list

A crontab file contains entries for each cron job. Entries are separated by newline characters. 
Each crontab file entry contains six fields separated by spaces or tabs in the following form:

 
  minute  hour  day_of_month  month  weekday  command

  0       0     *             8       *       /u/harry/bin/maintenance


Notes:
------

Note 1: start and stop cron:
----------------------------

-- Solaris and some other unixes:

The proper way to stop and restart cron are:

# /etc/init.d/cron stop
# /etc/init.d/cron start

In Solaris 10 you could use the following command as well:
# svcadm refresh cron
# svcadm restart cron

-- Other way to restart cron:

In most unixes, cron is started by init and there is a record in the /etc/initab file
which makes that happen. Check if your system has indeed a record of cron in the inittab file.
The type of start should be "respawn", which means that should the
superuser do a "kill -9 crond", the cron daemon is simply restarted again.
Again, preferrably, there should be a stop and start script to restart cron.

Especially on AIX, there is no true way to restart cron in a neat way. Not via the Recourse Control startscr command, 
or script, a standard method is available. Just kill crond and it will be restarted.

-- On many linux distros:
 
to restart the cron daemon, you could do either a "service crond restart" or a "service 
crond reload". 


Note 2:
-------

Create a cronjobs file
You can do this on your local computer in Notepad or you can create the file directly on 
your Virtual Server using your favorite UNIX text editor (pico, vi, etc). 
Your file should contain the following entries: 

    MAILTO="USER@YOUR-DOMAIN.NAME"
    0 1 1 1-12/3 *   /usr/local/bin/vnukelog


This will run the command "/usr/local/bin/vnukelog" (which clears all of your log files) at 
1 AM on the first day of the first month of every quarter, or January, April, July, and October (1-12/3). 
Obviously, you will need to substitute a valid e-mail address in the place of "USER@YOUR-DOMAIN.NAME". 

If you have created this file on your local computer, 
FTP the file up to your Virtual Server and store it in your home directory under the name 
"cronjobs" (you can actually use any name you would like). 


Register your cronjobs file with the system
After you have created your cronjobs file (and have uploaded it to your Virtual Server if applicable), 
you need to Telnet to your server and register the file with the cron system daemon. To do this, simply type: 
    crontab cronjobs 

Or if you used a name other than "cronjobs", substitute the name you selected for the occurrence of "cronjobs" above. 


Note 3:
-------
# use /bin/sh to run commands, no matter what /etc/passwd says
SHELL=/bin/sh
# mail any output to `paul', no matter whose crontab this is
MAILTO=paul
#
# run five minutes after midnight, every day
5 6-18 * * *       /opt/app/oracle/admin/scripts/grepora.sh
# run at 2:15pm on the first of every month -- output mailed to paul
15 14 1 * *     $HOME/bin/monthly
# run at 10 pm on weekdays, annoy Joe
0 22 * * 1-5   mail -s "It's 10pm" joe%Joe,%%Where are your kids?%
23 0-23/2 * * * echo "run 23 minutes after midn, 2am, 4am ..., everyday"
5 4 * * sun     echo "run at 5 after 4 every sunday"

2>&1 means:

It means that standard error is redirected along with standard output. Standard error
could be redirected to a different file, like
ls > toto.txt 2> error.txt If your shell is csh or tcsh, you would redirect standard
output and standard error like this
lt >& toto.txt Csh or tcsh cannot redirect standard error separately.

Note 4:
-------

thread

Q:

> Isn't there a way to refresh cron to pick up changes made using 
> crontab -e? I made the changes but the specified jobs did not run. 
> I'm thinking I need to refresh cron to pick up the changes. Is this 
> true? Thanks. 

A:

Crontab -e should do that for you, that's the whole point of using 
it rather than editing the file yourself. 
Why do you think the job didn't run? 
Post the crontab entry and the script. Give details of the version of 
Tru64 and the patch level. 
Then perhaps we can help you to figure out the real cause of the problem. 
Hope this helps 

A:

I have seen the following problem when editing the cron file for another 
user: 

crontab -e idxxxxxx 

This changed the control file, 
when I verified with crontab -l the contents was correctly shown, 
but the cron daemon did not execute the new contents. 

To solve the problem, I needed to follow the following commands: 

su - idxxxxxx 
crontab -l |crontab 

This seems to work ... since then I prefer the following 

su - idxxxxxx 
crontab -e 

which seems to work also ... 


Note 5:
-------

On AIX it is observed, that if the "daemon=" attribute of a user is set to be false,
this user cannot use crontab, even if the account is placed in cron.allow.

You need to set the attribute to "daemon=true".

* daemon        Defines whether the user can execute programs using the system
*               resource controller (SRC). Possible values: true or false.

Note 6:
-------

If you want to quick test the crontab of a user:

su - user
and put the following in the crontab of that user:

* * * * *  date >/tmp/elog

After checking the /tmp/elog file, which will rapidly fills with dates, don't forget
to remove the crontab entry shown above.


Note 7: the at and atq commands:
--------------------------------

On many unix systems the scheduling "at" command and "atq" commands are available.
With "at", you can schedule commands, and with "atq" you can view all your, or other users, scheduled tasks.

atq- Display the jobs queued to run at specified times


For example, on Solaris:

The at command is used to schedule jobs for execution at a later time. Unlike crontab, which schedules a job to happen at regular intervals, 
a job submitted with at executes once, at the designated time.

To submit an at job, type at followed by the time that you would like the program to execute. You'll see the at> prompt displayed and it's here 
that you enter the at commands. When you are finished entering the at command, press control-d to exit the at prompt 
and submit the job as shown in the following example:

# at 07:45am today
at> who > /tmp/log
at> <Press Control-d>

job 912687240.a at Thu Jun 6 07:14:00

When you submit an at job, it is assigned a job identification number, which becomes its filename along with the .a extension. 
The file is stored in the /var/spool/cron/atjobs directory. In much the same way as it schedules crontab jobs, 
the cron daemon controls the scheduling of at files.


===========================
8. Job control, background:
===========================

To put a sort job (or other job) in background:
# sort < foo > bar &

To show jobs:
# jobs

To show processes:
# ps
# ps -ef | grep ora

Job in foreground -> background:
Ctrl-Z (suspend)
#bg  or bg jobID

Job in background -> foreground:
# fg %jobid

Stop a process:
# kill -9 3535   (3535 is the pid, process id)

Stop a background process you may try this:
# kill -QUIT 3421


-- Kill all processes of a specific users:
-- --------------------------------------- 

To kill all processes of a specific user, enter: 
# ps -u [user-id] -o pid | grep -v PID | xargs kill -9 

Another way: 
Use who to check out your current users and their terminals. Kill all processes related to a specific terminal:
# fuser -k /dev/pts[#] 

Yet another method: 
Su to the user-id you wish to kill all processes of and enter:
# su - [user-id] -c kill -9 -1 

Or su - to that userid, and use the killall command, which is available on most unix'es, like for example AIX.
# killall


So in order to kill all processes of a user:

# kill -9 -1            # not on all unixes

or

# killall               # not on all unixes


The nohup command:
------------------

When working with the UNIX operating system, there will be times when you will want to run commands that are immune 
to log outs or unplanned login session terminations.  This is especially true for UNIX system administrators.  
The UNIX command for handling this job is the nohup (no hangup) command.

Normally when you log out, or your session terminates unexpectedly, the system will kill all processes you have started.  
Starting a command with nohup counters this by arranging for all stopped, running, and background jobs to ignore 
the SIGHUP signal.

The syntax for nohup is:
nohup command [arguments]
 
You may optionally add an ampersand to the end of the command line to run the job in the background:
nohup command [arguments] &

If you do not redirect output from a process kicked off with nohup, both standard output (stdout) and 
standard error (stderr) are sent to a file named nohup.out.  This file will be created in $HOME (your home directory) 
if it cannot be created in the working directory.  Real-time monitoring of what is being written to nohup.out 
can be accomplished with the "tail -f nohup.out" command.

Although the nohup command is extremely valuable to UNIX system administrators, it is also a must-know tool 
for others who run lengthy or critical processes on UNIX systems 

The nohup command runs the command specified by the Command parameter and any related Arg parameters, 
ignoring all hangup (SIGHUP) signals. Use the nohup command to run programs in the background after logging off. 
To run a nohup command in the background, add an & (ampersand) to the end of the command.

Whether or not the nohup command output is redirected to a terminal, the output is appended to the nohup.out file 
in the current directory. If the nohup.out file is not writable in the current directory, the output is redirected 
to the $HOME/nohup.out file. If neither file can be created nor opened for appending, the command specified 
by the Command parameter is not invoked. If the standard error is a terminal, all output written by the 
named command to its standard error is redirected to the same file descriptor as the standard output.

To run a command in the background after you log off, enter: 
$ nohup find / -print &

After you enter this command, the following is displayed: 
670
$ Sending output to nohup.out
The process ID number changes to that of the background process started by & (ampersand). The message Sending 
output to nohup.out informs you that the output from the find / -print command is in the nohup.out file. 
You can log off after you see these messages, even if the find command is still running. 

Example of ps -ef on a AIX5 system:

[LP 1]root@ol16u209:ps -ef
     UID   PID  PPID   C    STIME    TTY  TIME CMD
    root     1     0   0   Oct 17      -  0:00 /etc/init
    root  4198     1   0   Oct 17      -  0:00 /usr/lib/errdemon
    root  5808     1   0   Oct 17      -  1:15 /usr/sbin/syncd 60
  oracle  6880     1   0 10:27:26      -  0:00 ora_lgwr_SPLDEV1
    root  6966     1   0   Oct 17      -  0:00 /usr/ccs/bin/shlap
    root  7942 43364   0   Oct 17      -  0:00 sendmail: accepting connections
 alberts  9036  9864   0 20:41:49      -  0:00 sshd: alberts@pts/0
    root  9864 44426   0 20:40:21      -  0:00 sshd: alberts [priv]
    root 27272 36280   1 20:48:03  pts/0  0:00 ps -ef
  oracle 27856     1   0 10:27:26      -  0:01 ora_smon_SPLDEV1
  oracle 31738     1   0 10:27:26      -  0:00 ora_dbw0_SPLDEV1
  oracle 31756     1   0 10:27:26      -  0:00 ora_reco_SPLDEV1
 alberts 32542  9036   0 20:41:49  pts/0  0:00 -ksh
 maestro 33480 34394   0 05:59:45      -  0:00 /prj/maestro/maestro/bin/batchman -parm 32000
    root 34232 33480   0 05:59:45      -  0:00 /prj/maestro/maestro/bin/jobman
 maestro 34394 45436   0 05:59:45      -  0:00 /prj/maestro/maestro/bin/mailman -parm 32000 -- 2002 OL16U209 CONMAN UNIX 6.
    root 34708     1   0 13:55:51   lft0  0:00 /usr/sbin/getty /dev/console
  oracle 35364     1   0 10:27:26      -  0:01 ora_cjq0_SPLDEV1
  oracle 35660     1   0 10:27:26      -  0:04 ora_pmon_SPLDEV1
    root 36280 32542   0 20:45:06  pts/0  0:00 -ksh
    root 36382 43364   0   Oct 17      -  0:00 /usr/sbin/rsct/bin/IBM.ServiceRMd
    root 36642 43364   0   Oct 17      -  0:00 /usr/sbin/rsct/bin/IBM.CSMAgentRMd
    root 36912 43364   0   Oct 17      -  0:03 /usr/opt/ifor/bin/i4lmd -l /var/ifor/logdb -n clwts
    root 37186 43364   0   Oct 17      -  0:00 /etc/ncs/llbd
    root 37434 43364   0   Oct 17      -  0:17 /usr/opt/ifor/bin/i4llmd -b -n wcclwts -l /var/ifor/llmlg
    root 37738 37434   0   Oct 17      -  0:00 /usr/opt/ifor/bin/i4llmd -b -n wcclwts -l /var/ifor/llmlg
    root 37946     1   0   Oct 17      -  0:00 /opt/hitachi/HNTRLib2/bin/hntr2mon -d
  oracle 38194     1   0   Oct 17      -  0:00 /prj/oracle/product/9.2.0.3/bin/tnslsnr LISTENER -inherit
    root 38468 43364   0   Oct 17      -  0:00 /usr/sbin/rsct/bin/IBM.AuditRMd
    root 38716     1   0   Oct 17      -  0:00 /usr/bin/itesmdem itesrv.ini /etc/IMNSearch/search/
  imnadm 39220     1   0   Oct 17      -  0:00 /usr/IMNSearch/httpdlite/httpdlite -r /etc/IMNSearch/httpdlite/httpdlite.con
    root 39504 36912   0   Oct 17      -  0:00 /usr/opt/ifor/bin/i4lmd -l /var/ifor/logdb -n clwts
    root 39738 43364   0   Oct 17      -  0:01 /usr/DynamicLinkManager/bin/dlmmgr
    root 40512 43364   0   Oct 17      -  0:01 /usr/sbin/rsct/bin/rmcd -r
    root 40784 43364   0   Oct 17      -  0:00 /usr/sbin/rsct/bin/IBM.ERrmd
    root 41062     1   0   Oct 17      -  0:00 /usr/sbin/cron
     was 41306     1   0   Oct 17      -  2:10 /prj/was/java/bin/java -Xmx256m -Dwas.status.socket=32776 -Xms50m -Xbootclas
  oracle 42400     1   0 10:27:26      -  0:02 ora_ckpt_SPLDEV1
    root 42838     1   0   Oct 17      -  0:00 /usr/sbin/uprintfd
    root 43226 43364   0   Oct 17      -  0:00 /usr/sbin/nfsd 3891
    root 43364     1   0   Oct 17      -  0:00 /usr/sbin/srcmstr
    root 43920 43364   0   Oct 17      -  0:00 /usr/sbin/aixmibd
    root 44426 43364   0   Oct 17      -  0:00 /usr/sbin/sshd -D
    root 44668 43364   0   Oct 17      -  0:00 /usr/sbin/portmap
    root 44942 43364   0   Oct 17      -  0:00 /usr/sbin/snmpd
    root 45176 43364   0   Oct 17      -  0:00 /usr/sbin/snmpmibd
 maestro 45436     1   0   Oct 17      -  0:00 /prj/maestro/maestro/bin/netman
    root 45722 43364   0   Oct 17      -  0:00 /usr/sbin/inetd
    root 45940 43364   0   Oct 17      -  0:00 /usr/sbin/muxatmd
    root 46472 43364   0   Oct 17      -  0:00 /usr/sbin/hostmibd
    root 46780 43364   0   Oct 17      -  0:00 /etc/ncs/glbd
    root 46980 43364   0   Oct 17      -  0:00 /usr/sbin/qdaemon
    root 47294     1   0   Oct 17      -  0:00 /usr/local/sbin/syslog-ng -f /usr/local/etc/syslog-ng.conf
    root 47484 43364   0   Oct 17      -  0:00 /usr/sbin/rpc.lockd
  daemon 48014 43364   0   Oct 17      -  0:00 /usr/sbin/rpc.statd
    root 48256 43364   0   Oct 17      -  0:00 /usr/sbin/rpc.mountd
    root 48774 43364   0   Oct 17      -  0:00 /usr/sbin/biod 6
    root 49058 43364   0   Oct 17      -  0:00 /usr/sbin/writesrv
[LP 1]root@ol16u209:


Another example of ps -ef on a AIX5 system:
# ps -ef

     UID     PID    PPID   C    STIME    TTY  TIME CMD
    root       1       0   0   Jan 23      -  0:33 /etc/init
    root   69706       1   0   Jan 23      -  0:00 /usr/lib/errdemon
    root   81940       1   0   Jan 23      -  0:00 /usr/sbin/srcmstr
    root   86120       1   2   Jan 23      - 236:39 /usr/sbin/syncd 60
    root   98414       1   0   Jan 23      -  0:00 /usr/ccs/bin/shlap64
    root  114802   81940   0   Jan 23      -  0:32 /usr/sbin/rsct/bin/IBM.CSMAgentRMd
    root  135366   81940   0   Jan 23      -  0:00 /usr/sbin/sshd -D
    root  139446   81940   0   Jan 23      -  0:07 /usr/sbin/rsct/bin/rmcd -r
    root  143438       1   0   Jan 23      -  0:00 /usr/sbin/uprintfd
    root  147694       1   0   Jan 23      -  0:26 /usr/sbin/cron
    root  155736       1   0   Jan 23      -  0:00 /usr/local/sbin/syslog-ng -f /usr/local/etc/syslog-ng.conf
    root  163996   81940   0   Jan 23      -  0:00 /usr/sbin/rsct/bin/IBM.ERrmd
    root  180226   81940   0   Jan 23      -  0:00 /usr/sbin/rsct/bin/IBM.ServiceRMd
    root  184406   81940   0   Jan 23      -  0:00 /usr/sbin/qdaemon
    root  200806       1   0   Jan 23      -  0:08 /opt/hitachi/HNTRLib2/bin/hntr2mon -d
    root  204906   81940   0   Jan 23      -  0:00 /usr/sbin/rsct/bin/IBM.AuditRMd
    root  217200       1   0   Jan 23      -  0:00 ./mflm_manager
    root  221298   81940   0   Jan 23      -  1:41 /usr/DynamicLinkManager/bin/dlmmgr
    root  614618       1   0   Apr 03   lft0  0:00 -ksh
 reserve 1364024 1548410   0 07:10:10  pts/0  0:00 -ksh
    root 1405140 1626318   1 08:01:38  pts/0  0:00 ps -ef
    root 1511556  614618   2 07:45:52   lft0  0:41 tar -cf /dev/rmt1.1 /spl
 reserve 1548410 1613896   0 07:10:10      -  0:00 sshd: reserve@pts/0
    root 1613896  135366   0 07:10:01      -  0:00 sshd: reserve [priv]
    root 1626318 1364024   1 07:19:13  pts/0  0:00 -ksh


Some more examples:

# nohup somecommand & sleep 1; tail -f preferred-name

# nohup make bzImage & 
# tail -f nohup.out

# nohup make modules 1> modules.out 2> modules.err & 
# tail -f modules.out 


==========================================
9. Backup commands, TAR, and Zipped files:
==========================================


For SOLARIS as well as AIX, and many other unix'es, the following commands can be used:
tar, cpio, dd, gzip/gunzip, compress/uncompress, backup and restore.


Very important:
If you will backup to tape, make sure you know what is your "rewinding" class and "nonrewinding" class
of your tapedevice.


9.1 tar: Short for "Tape Archiver":
===================================

Some examples should explain the usage of "tar" to create backups, or to create 
easy to transport .tar files.

Create a backup to tape device 0hc of file sys01.dbf
# tar -cvf /dev/rmt/0hc /u01/oradata/sys01.dbf
# tar -rvf /dev/rmt/0hc /u02/oradata/data_01.dbf

-c create 
-r append 
-x extract
-v verbose
-t list

Extract the contents of example.tar and display the files as they are extracted.
# tar -xvf example.tar  

Create a tar file named backup.tar from the contents of the directory /home/ftp/pub
# tar -cf backup.tar /home/ftp/pub  

list contents of example.tar to the screen
# tar -tvf example.tar  

to restore the file /home/bcalkins/.profile from the archive:
- First we do a backup: 
# tar -cvf /dev/rmt/0 /home/bcalkins
- And later we do a restore:
# tar -xcf /dev/rmt/0 /home/bcalkins/.profile

If you use an absolute path, you can only restore in "a like" destination directory.
If you use a relative path, you can restore in any directory.
In this case, use tar with a relative pathname, for example if you want to backup /home/bcalkins
change to that directory and use

# tar -cvf backup_oracle_201105.tar ./*


To extract the directory conv:

# tar -xvf /dev/rmt0 /u02/oradata/conv

Example:
--------

mt -f /dev/rmt1  rewind
mt -f /dev/rmt1.1 fsf 6
tar -xvf /dev/rmt1.1 /data/download/expdemo.zip


Most common errors messages with tar:
-------------------------------------

-- 0511-169: A directory checksum error on media: MediaName not equal to Number

Possible Causes
From the command line, you issued the tar command to extract files from an archive that was not created 
with the tar command.

-- 0511-193: An error occurred while reading from the media

Possible Causes
You issued the tar command to read an archive from a tape device that has a different block size 
than when the archive was created.

Solution:

# chdev -l rmt0 -a block_size=0

-- File too large:


Extra note of tar command on AIX:
---------------------------------

If you need to backup multiple large mountpoints to a large tape, you might think you
can use something like:

tar -cvf /dev/rmt1 /spl
tar -rvf /dev/rmt1 /prj
tar -rvf /dev/rmt1 /opt
tar -rvf /dev/rmt1 /usr
tar -rvf /dev/rmt1 /data
tar -rvf /dev/rmt1 /backups
tar -rvf /dev/rmt1 /u01/oradata
tar -rvf /dev/rmt1 /u02/oradata
tar -rvf /dev/rmt1 /u03/oradata
tar -rvf /dev/rmt1 /u04/oradata
tar -rvf /dev/rmt1 /u05/oradata

Actually on AIX this is not OK. The tape will rewind after each tar command, effectively
you will end up with ONLY the last backupstatement.

You should use the non-rewinding class instead, like for example:

tar -cf /dev/rmt1.1 /spl
tar -cf /dev/rmt1.1 /apps
tar -cf /dev/rmt1.1 /prj
tar -cf /dev/rmt1.1 /software
tar -cf /dev/rmt1.1 /opt
tar -cf /dev/rmt1.1 /usr
tar -cf /dev/rmt1.1 /data
tar -cf /dev/rmt1.1 /backups
#tar -cf /dev/rmt1.1 /u01/oradata
#tar -cf /dev/rmt1.1 /u02/oradata
#tar -cf /dev/rmt1.1 /u03/oradata
#tar -cf /dev/rmt1.1 /u04/oradata
#tar -cf /dev/rmt1.1 /u05/oradata

Use this table to decide on which class to use:

The following table shows the names of the rmt special files and their characteristics.

Special File Rewind on Close Retension on Open Density Setting 
/dev/rmt*    Yes             No                #1 
/dev/rmt*.1  No              No                #1 
/dev/rmt*.2  Yes             Yes               #1 
/dev/rmt*.3  No              Yes               #1 
/dev/rmt*.4  Yes             No                #2 
/dev/rmt*.5  No              No                #2 
/dev/rmt*.6  Yes             Yes               #2 
/dev/rmt*.7  No              Yes               #2 


To restore an item from a logical tape, use commands as in the following example:

mt -f /dev/rmt1  rewind
mt -f /dev/rmt1.1 fsf 2  in order to put the pointer to the beginning of block 3.

mt -f /dev/rmt1.1 fsf 7  in order to put the pointer to the beginning of block 8.

Now you can use a command like for example:

tar -xvf /dev/rmt1.1 /backups/oradb/sqlnet.log

Another example:

mt -f /dev/rmt1  rewind
mt -f /dev/rmt1.1 fsf 8
tar -xvf /dev/rmt1.1 /u01/oradata/spltrain/temp01.dbf


Tapedrives on Solaris:
----------------------

Tape dvices on Solaris are named like /dev/rmt/0 or /dev/rmt/1
The default is /dev/rmt0. This also configured in the "/kernel/drv/st.conf" file.
If you need to add support for a tape device, you need to modify this file.

First tape device name: /dev/rmt/0
Second tape device name: /dev/rmt/1

You can also add special character letter to specify density using following format
/dev/rmt/ZX

Z is tape drive number such as 0,1..n 
X can be any one of following (as supported by your device, read the manual of your tape device & controller to see if all of them supported or not): 
l - Low density 
m - Medium density 
h - High density 
u - Ultra density 
c - Compressed density 
n - No rewinding 
For example to specify the first, drive with high-density with no rewinding use device /dev/rmt/0hn.


First drive, rewinding 
 /dev/rmt/0 
 
First drive, nonrewinding 
 /dev/rmt/0n 
 
Second drive, rewinding 
 /dev/rmt/1 
 
Second drive, nonrewinding 
 /dev/rmt/1n 
 

Example Backupscript on AIX:
----------------------------

#!/usr/bin/ksh

# BACKUP-SCRIPT SPL SERVER PSERIES 550
# DIT IS DE PRIMAIRE BACKUP, NAAR DE TAPEROBOT RMT1.
# OPMERKING: ER LOOPT NAAST DEZE BACKUP, OOK NOG EEN BACKUP VAN DE
# /backup DISK NAAR DE INTERNE TAPEDRIVE RMT0.

# OMDAT WE NOG NIET GEHEEL IN BEELD HEBBEN OF WE VOORAF DE BACKUP APPLICATIES MOETEN
# STOPZETTEN, IS DIT SCRIPT NOG IN REVISIE.

# VERSIE: 0.1
# DATUM : 27-12-2005
# DOEL VAN HET SCRIPT:
#   - STOPPEN VAN DE APPLICATIES
#   - VERVOLGENS BACKUP NAAR TAPE
#   - STARTEN VAN DE APPLICATIES

# CONTROLEER VOORAF OF DE TAPELIBRARY GELADEN IS VIA "/opt/backupscripts/load_lib.sh"

BACKUPLOG=/opt/backupscripts/backup_to_rmt1.log
export BACKUPLOG

DAYNAME=`date +%a`;export DAYNAME
DAYNO=`date +%d`;export DAYNO


########################################
# 1. REGISTRATIE STARTTIJD IN EEN LOG  #
########################################

echo "-----------------" >> ${BACKUPLOG}
echo "Start Backup 550:" >> ${BACKUPLOG}
date >> ${BACKUPLOG}


########################################
# 2. STOPPEN APPLICATIES               #
########################################


#STOPPEN VAN ALLE ORACLE DATABASES
su - oracle -c "/opt/backupscripts/stop_oracle.sh"
sleep 30 

#STOPPEN VAN WEBSPHERE
cd /prj/was/bin
./stopServer.sh server1 -username admin01 -password vga88nt
sleep 30 

#SHUTDOWN ETM instances:
su - cissys -c '/spl/SPLDEV1/bin/splenviron.sh -e SPLDEV1 -c "spl.sh -t stop"'
sleep 2
su - cissys -c '/spl/SPLDEV2/bin/splenviron.sh -e SPLDEV2 -c "spl.sh -t stop"'
sleep 2
su - cissys -c '/spl/SPLCONF/bin/splenviron.sh -e SPLCONF -c "spl.sh -t stop"'
sleep 2
su - cissys -c '/spl/SPLPLAY/bin/splenviron.sh -e SPLPLAY -c "spl.sh -t stop"'
sleep 2
su - cissys -c '/spl/SPLTST3/bin/splenviron.sh -e SPLTST3 -c "spl.sh -t stop"'
sleep 2
su - cissys -c '/spl/SPLTST1/bin/splenviron.sh -e SPLTST1 -c "spl.sh -t stop"'
sleep 2
su - cissys -c '/spl/SPLTST2/bin/splenviron.sh -e SPLTST2 -c "spl.sh -t stop"'
sleep 2
su - cissys -c '/spl/SPLDEVP/bin/splenviron.sh -e SPLDEVP -c "spl.sh -t stop"'
sleep 2
su - cissys -c '/spl/SPLPACK/bin/splenviron.sh -e SPLPACK -c "spl.sh -t stop"'
sleep 2
su - cissys -c '/spl/SPLDEVT/bin/splenviron.sh -e SPLDEVT -c "spl.sh -t stop"'
sleep 2


#STOPPEN SSH DEMON
stopsrc -s sshd
sleep 2

date >> /opt/backupscripts/running.log
who >> /opt/backupscripts/running.log

########################################
# 3. BACKUP COMMANDS                   #
########################################


case $DAYNAME in
Tue) tapeutil -f /dev/smc0 move 256 4116
tapeutil -f /dev/smc0 move 4101 256     
;;
Wed) tapeutil -f /dev/smc0 move 256 4117
tapeutil -f /dev/smc0 move 4100 256    
;;
Thu) tapeutil -f /dev/smc0 move 256 4118
tapeutil -f /dev/smc0 move 4099 256      
;;
Fri) tapeutil -f /dev/smc0 move 256 4119
tapeutil -f /dev/smc0 move 4098 256      
;;
Sat) tapeutil -f /dev/smc0 move 256 4120
tapeutil -f /dev/smc0 move 4097 256    
;;
Mon) tapeutil -f /dev/smc0 move 256 4121
tapeutil -f /dev/smc0 move 4096 256
;;
esac

sleep 50

 
echo "Starten van de backup zelf" >> ${BACKUPLOG}
mt -f /dev/rmt1 rewind
tar -cf /dev/rmt1.1 /spl
tar -cf /dev/rmt1.1 /apps
tar -cf /dev/rmt1.1 /prj
tar -cf /dev/rmt1.1 /software
tar -cf /dev/rmt1.1 /opt
tar -cf /dev/rmt1.1 /usr
tar -cf /dev/rmt1.1 /data
tar -cf /dev/rmt1.1 /backups
tar -cf /dev/rmt1.1 /u01/oradata
tar -cf /dev/rmt1.1 /u02/oradata
tar -cf /dev/rmt1.1 /u03/oradata
tar -cf /dev/rmt1.1 /u04/oradata
tar -cf /dev/rmt1.1 /u05/oradata
tar -cf /dev/rmt1.1 /u06/oradata
tar -cf /dev/rmt1.1 /u07/oradata
tar -cf /dev/rmt1.1 /u08/oradata
tar -cf /dev/rmt1.1 /home
tar -cf /dev/rmt1.1 /backups3

sleep 10

# TIJDELIJKE ACTIE
date >> /opt/backupscripts/running.log
ps -ef | grep pmon >> /opt/backupscripts/running.log
ps -ef | grep BBL >> /opt/backupscripts/running.log
ps -ef | grep was >> /opt/backupscripts/running.log
who >> /opt/backupscripts/running.log
defragfs /prj

# EIND TIJDELIJKE ACTIE


########################################
# 4. STARTEN APPLICATIES               #
########################################

#STARTEN SSH DEMON
startsrc -s sshd
sleep 2

#STARTEN VAN ALLE ORACLE DATABASES
su - oracle -c "/opt/backupscripts/start_oracle.sh"
sleep 30

#STARTEN ETM instances:
su - cissys -c '/spl/SPLDEV1/bin/splenviron.sh -e SPLDEV1 -c "spl.sh -t start"'
sleep 2
su - cissys -c '/spl/SPLDEV2/bin/splenviron.sh -e SPLDEV2 -c "spl.sh -t start"'
sleep 2
su - cissys -c '/spl/SPLCONF/bin/splenviron.sh -e SPLCONF -c "spl.sh -t start"'
sleep 2
su - cissys -c '/spl/SPLPLAY/bin/splenviron.sh -e SPLPLAY -c "spl.sh -t start"'
sleep 2
su - cissys -c '/spl/SPLTST3/bin/splenviron.sh -e SPLTST3 -c "spl.sh -t start"'
sleep 2
su - cissys -c '/spl/SPLTST1/bin/splenviron.sh -e SPLTST1 -c "spl.sh -t start"'
sleep 2
su - cissys -c '/spl/SPLTST2/bin/splenviron.sh -e SPLTST2 -c "spl.sh -t start"'
sleep 2
su - cissys -c '/spl/SPLDEVP/bin/splenviron.sh -e SPLDEVP -c "spl.sh -t start"'
sleep 2
su - cissys -c '/spl/SPLPACK/bin/splenviron.sh -e SPLPACK -c "spl.sh -t start"'
sleep 2
su - cissys -c '/spl/SPLDEVT/bin/splenviron.sh -e SPLDEVT -c "spl.sh -t start"'
sleep 2


#STARTEN VAN WEBSPHERE
cd /prj/was/bin
./startServer.sh server1 -username admin01 -password vga88nt

sleep 30


########################################
# 5. REGISTRATIE EINDTIJD IN EEN LOG   #
########################################

#Laten we het tapenummer en einddtijd registreren in de log:

tapeutil -f /dev/smc0 inventory | head -88 | tail -2  >> ${BACKUPLOG}

echo "Einde backup 550:" >> ${BACKUPLOG}
date >> ${BACKUPLOG}


Some examples about day vars:
-----------------------------

DAYNAME=`date +%a`;export DAYNAME
echo $DAYNAME
Thu


DAYNO=`date +%d`;export DAYNO
echo $DAYNO
29

weekday=`date +%a%A`; export weekday
echo $weekday
ThuThursday

weekday=`date +%a-%A`
echo $weekday
Thu-Thursday

       %a
            Displays the locale's abbreviated weekday name.
       %A
            Displays the locale's full weekday name.
       %b
            Displays the locale's abbreviated month name.
       %B
            Displays the locale's full month name.
       %c
            Displays the locale's appropriate date and time representation. This is the default.
       %C
            Displays the first two digits of the four-digit year as a decimal number (00-99). A year is divided by 100 and truncated to an integer.
       %d
            Displays the day of the month as a decimal number (01-31). In a two-digit field, a 0 is used as leading space fill.
       %D
            Displays the date in the format equivalent to %m/%d/%y.
       %e
            Displays the day of the month as a decimal number (1-31). In a two-digit field, a blank space is used as leading space fill.


9.2 compress and uncompress:
============================

# compress -v bigfile.exe
Would compress bigfile.exe and rename that file to bigfile.exe.Z.

# uncompress *.Z            
would uncompress the files *.Z


9.3 gzip:
=========

To compress a file using gzip, execute the following command: 

# gzip filename.tar 

This will become filename.tar.gz

To decompress:

# gzip -d filename.tar.gz
# gunzip filename.tar.gz
# gzip -d users.dbf.gz


9.4 bzip2:
==========

#bzip2 filename.tar
This will become filename.tar.bz2


9.5 dd:
=======

Solaris:
--------

# dd if=<input file> of=<output file> <option=value>

to duplicate a tape:
# dd if=/dev/rmt/0 of=/dev/rmt/1

to clone a disk with the same geometry:
# dd if=/dev/rdsk/c0t1d0s2  of=/dev/rdsk/c0t4d0s2 bs=128

AIX:
----

same command syntax apply to IBM AIX. Here is an AIX pSeries machine with floppydrive example:

clone a diskette:

# dd if=/dev/fd0 of=/tmp/ddcopy
# dd if=/tmp/ddcopy of=/dev/fd0

Note: 

On Linux distros the device associated to the floppy drive is also /dev/fd0
 

9.6 cpio:
=========

solaris:
--------

cpio <mode><option>
copy-out: cpio -o
copy_in : cpio -i
pass    : cpio -p


#  cd /var/bigspace
#  cpio -idmv Linux9i_Disk1.cpio.gz
#  cpio -idmv Linux9i_Disk2.cpio.gz
#  cpio -idmv Linux9i_Disk3.cpio.gz

#  cpio -idmv < 9204_solaris_release.cpio

# cd /work
# ls -R | cpio -ocB > /dev/rmt/0

# cd /work
# cpio -icvdB < /dev/rmt/0     

d will create directories as needed
c will create header information in ascii format for portability
v verbose
c character heading in file

AIX:
----

AIX uses the same syntax. Usually, you should use the following command:

# cpio -idmv < filename.cpio


Copying directories with cpio:
------------------------------

cpio is very good in cloning directories, or making backups, because it copies files and directories
inclusive their ownership and permissions.

Example:
--------

Just cd to the directory that you want to clone and use a command similar to the following examples.

# find . -print | cpio -pdl /u/disk11/jdoe/fiber

# find . -print | cpio -pdm /a/dev

# find . -print | cpio -pdl /home/jim/newdir

# find . -print | cpio -pdmv /backups2/CONV2-0212

# find . -print | cpio -pdmv /backups2/SPLcobAS40

# find . -print | cpio -pdmv /backups2/SPLcobAS40sp2

# find . -print | cpio -pdmv /backups2/runtime/SPLTST2

The p in the flags, stands for pass-through

cd /spl/SPLDEV1
find . -print | cpio -pdmv /spl/SPLDEVT
find . -print | cpio -pdmv /backups2/data

# find . -print | cpio -pdmv /data/documentum/dmadmin/backup_1008/dba_cluster
# find . -print | cpio -pdmv /data/documentum/dmadmin/backup_1008/dmw_et3
# find . -print | cpio -pdmv /data/documentum/dmadmin/backup_1008/dmw_et
# find . -print | cpio -pdmv /data/documentum/dmadmin/backup_1508/dmw_eu
find . -print | cpio -pdmv /data/emcdctm/home2

find . -print | cpio -pdmv /data/documentum/dmadmin/backup_1809/dmw_et
find . -print | cpio -pdmv /data/documentum/dmadmin/backup_1809/dmw_et3


find . -print | cpio -pdmv /data/documentum/dmadmin/appl/l13appl
find . -print | cpio -pdmv /data/documentum/dmadmin/appl/l14appl
find . -print | cpio -pdmv /data/documentum/dmadmin/backup_3110/dmw_et
find . -print | cpio -pdmv /appl/emcdctm/dba_save_311007


Example:
--------

Use cpio copy-pass to copy a directory structure to another location:

# find path -depth -print | cpio -pamVd /new/parent/dir


Example:
--------

Become superuser or assume an equivalent role.
Change to the appropriate directory.

# cd filesystem1

Copy the directory tree from filesystem1 to filesystem2 by using a combination of the find and cpio commands. 

# find . -print -depth | cpio -pdm filesystem2


Example:
--------

Copying directories
Both cpio and tar may be used to copy directories while preserving ownership, permissions, and directory structure.

cpio example:
cd fromdir
find . | cpio -pdumv todir

tar example:
cd fromdir; tar cf - . | (cd todir; tar xfp -)

tar example over a compressed ssh tunnel:
tar cvf - fromdir | gzip -9c | ssh user@host 'cd todir; gzip -cd | tar xpf -'


Errors:
-------

Errors sometimes found with cpio:

cpio: 0511-903
cpio: 0511-904

1.Try using with -c option: cpio -imdcv < filename.cpio
 

9.7 the pax command:
====================

Same for AIX and SOLARIS.

The pax utility supports several archive formats, including tar and cpio.

The syntax for the pax command is as follows:

pax <mode> <options>

-r: Read mode .when -r is specified, pax extracts the filenames and directories found in the archive.
    The archive is read from disk or tape. If an extracted file is a directory, the hierarchy
    is extracted as well. The extracted files are created relative to the current directory.

None: List mode. When neither -r or -w is specified, pax displays the filenames and directories
      found in the archive file. The list is written to standard output.

-w: Write mode. If you want to create an archive, you use -w.
    Pax writes the contents of the file to the standard output in an archive format specified
    by the -x option.

-rw: Copy mode. When both -r and -w are specified, pax copies the specified files to
     the destination directory.


most important options:
-a = append to the end of an existing archive
-b = block size, multiple of 512 bytes
-c = you can specify filepatterns
-f = specifies the pathname of the input or output archive
-p <string> = aemo
              a does not preserve file access time
              e preserve everything: user id, group id, filemode bits, etc..
              m does not preserve file modification times
              o preserve uid and gid
              P preserve filemode bits
-x <format> = specifies the archive format. 
              
Examples:

To copy current directory contents to tape, use -w mode and -f
# pax -w -f /dev/rmt0

To list a verbose table of contents stored on tape rmt0, use None mode and f
# pax -v -f /dev/rmt0


9.8 pkzip25:
============

PKZIP Usage: 

Usage: pkzip25 [command] [options] zipfile [@list] [files...] 

Examples: 

     View .ZIP file contents: pkzip25 zipfile 

     Create a .ZIP file: pkzip25 -add zipfile file(s)... 

     Extract files from .ZIP: pkzip25 -extract zipfile 

These are only basic examples of PKZIP's capability 

About "-extract" switch:

extract  
extract files from a .ZIP file. Its a configurable switch.

-- all - all files in .ZIP file  
-- freshen - only files in the .ZIP file that exist in the target directory and that are "newer" than those files 
   will be extracted  
-- update - files in the .ZIP file which already exist in the target directory and that are "newer" than those files 
   as well as files that are "not" in the target directory will be extracted  

default = all

Example:

# pkzip25 -ext=up save.zip


9.9 SOLARIS: ufsdump and ufsrestore:
====================================

level 0 is an full backup, 1-9 are incremental backups

Examples:
---------

# ufsdump 0ucf /dev/rmt/0 /users
# ufsdump 0ucf sparc1:/dev/rmt/0 /export/home

# ufsrestore f /dev/rmt/0 filename
# ufsrestore rf sparc1:/dev/rmt/0 filename


9.10 AIX: mksysb:
================


The mksysb command creates an installable image of the rootvg. This is synonym to say that mksysb creates
a backup of the operating system (that is, the root volume group). 
You can use this backup to reinstall a system to its original state after it has been corrupted. 
If you create the backup on tape, the tape is bootable and includes the installation programs 
needed to install from the backup.

To generate a system backup and create an /image.data file (generated by the mkszfile command) to a tape device 
named /dev/rmt0, type: 
# mksysb -i /dev/rmt0

To generate a system backup and create an /image.data file with map files (generated by the mkszfile command) 
to a tape device named /dev/rmt1, type: 
# mksysb -m /dev/rmt1


To generate a system backup with a new /image.data file, but exclude the files in directory /home/user1/tmp, 
create the file "/etc/exclude.rootvg" containing the line /home/user1/tmp/, and type: 
# mksysb -i -e /dev/rmt1

This command will backup the /home/user1/tmp directory but not the files it contains.

To generate a system backup file named /mksysb_images/node1 and a new /image.data file for that image, type: 
# mksysb -i /userimage/node1

There will be four images on the mksysb tape, and the fourth image will contain ONLY rootvg JFS or JFS2
mounted file systems. The target tape drive must be local to create a bootable tape. 

The following is a description of mksysb's four images. 

  +---------------------------------------------------------+
  |  Bosboot  |  Mkinsttape  |  Dummy TOC  |    rootvg      |
  |   Image   |     Image    |    Image    |     data       |
  |-----------+--------------+-------------+----------------|
  |<----------- Block size 512 ----------->| Blksz defined  |
  |                                        | by the device  |
  +---------------------------------------------------------+ 


Special notes:
--------------

Note 1: mksysb problem
----------------------

Question:
I'm attempting to restore a mksysb tape to a system that only has 18GB of drive space available for the Rootvg. 
Does the mksysb try to restore these mirrored LVs, or does it just make one copy? 
If it is trying to rebuild the mirror, is there a way that I can get around that? 

Answer:
I had this same problem and received a successful resolution. I place those same tasks here:
1) Create a new image.data file, run mkszfile file.
2) Change the image.data as follows:
a) cd /
b) vi image.data
c) In each lv_data stanza of this file, change the values of the copies
line by one-half (i.e. copies = 2, change to copies = 1)
Also, change the number of Physical Volumes "hdisk0 hdisk1" to "hdisk0".
d) Save this file.
3) Create another mksysb from the command line that will utilize the newly edited image.data file by the command:
mksysb /dev/rmt0 (Do not use smit and do not run with the -i flag,
both will generate a new image.data file
4) Use this new mksysb to restore your system on other box without mirroring. 


Note 2: How to restore specific files from a mksysb tape:								
---------------------------------------------------------
							
$ tctl fsf 3								
$ restore -xvf /dev/rmt0.1 ./your/file/name								
								
For example, if you need to get the vi command back, put the mksysb tape in the tape drive 
(in this case, /dev/rmt0) and do the following:								
								
cd /                         # get to the root directory								
tctl -f /dev/rmt0 rewind     # rewind the tape								
tctl -f /dev/rmt0.1 fsf 3    # move the tape to the third file, no rewind								
restore -xqf /dev/rmt0.1 -s 1 ./usr/bin/vi    # extract the vi binary, no rewind								
								
Further explanation why you must use the fsf 3 (fast forward skip file 3):								
The format of the tape is as follows:								
1. A BOS boot image								
2. A BOS install image								
3. A dummy Table Of Contents								
4. The system backup of the rootvg								
								
So if you just need to restore some files, first forward the tape pointer to position 3, counting from 0.				


Note 3: How to restore specific files from a mksysb FILE
--------------------------------------------------------

See also note 2

view: restore -Tvqf [mksysb file] 
To restore: restore -xvqf [mksysb file] [file name] 


Note 4: How to restore a directory from a mksysb FILE
------------------------------------------------------


Simply using the restore command. 


restore -xvdf <mksysb.image> ./your/directory 


The dot at the front of the path is important. 
The "-d" flag indicates that this is a directory and everything in it should 
be restored. If you omit that, you'll restore an empty directory. 


The directory will be restored underneath whatever directory you're in. So 
if you're in your home directory it might create: 
/home/azhou/your/directory. 


With a mksysb image on disk you don't have any positioning to do, like with 
a tape. 


Note 5: Performing a mksysb migration with CD installation
----------------------------------------------------------

You can perform a mksysb migration with a CD installation of AIXr 5.3

Step 1. Prepare your system for installation:


Prepare for migrating to the AIX 5.3 BOS by completing the following steps:

- Insert the AIX Volume 1 CD into the CD-ROM device. 
- Shut down the target system. If your machine is currently running, power it off by following these steps:
    Log in as the root user. 
    Type shutdown -F. 
  If your system does not automatically power off, place the power switch in the Off (0) position. 
  Attention: You must not turn on the system unit until instructed to do so.

- Turn on all attached external devices. External devices include the following:
   Terminals 
   CD-ROM drives 
   DVD-ROM drives 
   Tape drives 
   Monitors 
   External disk drives 

Turning on the external devices first is necessary so that the system unit can identify each peripheral device 
during the startup (boot) process. 

- If your MKSYSB_MIGRATION_DEVICE is a tape, insert the tape for the mksysb in the tape drive. 
If your MKSYSB_MIGRATION_DEVICE is a CD or DVD, and there is an additional CD or DVD drive on the system 
(other than the one being used to boot AIX), insert the mksysb CD or DVD in the drive to avoid being 
prompted to swap medias. 

- Insert your customized bosinst.data supplemental diskette in the diskette drive. If the system does not 
have a diskette drive, use the network installation method for mksysb migration. 


Step 2. Boot from your installation media:


The following steps migrate your current version of the operating system to AIX 5.3. 
If you are using an ASCII console that was not defined in your previous system, you must define it. 
For more information about defining ASCII consoles, see Step 3. Setting up an ASCII terminal.

Turn the system unit power switch from Off (0) to On (|). 

When the system beeps twice, press F5 on the keyboard (or 5 on an ASCII terminal). If you have a graphics display, 
you will see the keyboard icon on the screen when the beeps occur. If you have an ASCII terminal 
(also called a tty terminal), you will see the word "keyboard" when the beeps occur. 
Note: If your system does not boot using the F5 key (or the 5 key on an ASCII terminal), refer to your 
hardware documentation for information about how to boot your system from an AIX product CD.

The system begins booting from the installation media. The mksysb migration installation proceeds 
as an unattended installation (non-prompted) unless the MKSYSB_MIGRATION_DEVICE is the same CD or DVD drive 
as the one being used to boot and install the system. In this case, the user is prompted to switch 
the product CD for the mksysb CD or DVD(s) to restore the image.data and the /etc/filesystems file. 
After this happens the user is prompted to reinsert the product media and the installation continues. 
When it is time to restore the mksysb image, the same procedure repeats. 

The BOS menus do not currently support mksysb migration, so they cannot be loaded. In a traditional migration, 
if there are errors that can be fixed by prompting the user for information through the menus, 
the BOS menus are loaded. If such errors or problems are encountered during mksysb migration, 
the installation asserts and an error stating that the migration cannot continue displays. 
Depending on the error that caused the assertion, information specific to the error might be displayed. 
If the installation asserts, the LED shows "088".


Note 6: create a mksysb tape MANUALLY
-------------------------------------


THIS NOTE DESCRIBES NOT A SUPPORTED METHOD, AND IS NOT CHECKED..

Here we do not mean the "mksysb -i /dev/rmtx" method, but...:

Question:
I have to clone a standalone 6H1 equipped with a 4mm tape, from
another 6H1 which is node of an SP and which does not own a tape !
The consequence is that my source mksysb is a file that is recorded in
/spdata/sys1/install/aixxxx/images

How will I copy this file to a tape to create the correct mksysb tape
that could be used to restore on my target machine ?

Answer:
using the following method in the case the two server are in the same
AIX level and kernel type (32/64 bits, jfs or jfs2)

- the both servers must communicate over an IP network and have .rhosts
file documented (for using rsh)

cp /var/adm/ras/bosinst.data /bosinst.data
mkszfile

copy these files (bosinst.data and image.data) under "/" on the remote
system

on the server:

tctl -f /dev/rmt0 status
if the block size is not 512:

# chdev -l /dev/rmt0 -a block_size=512
tctl -f /dev/rmt0 rewind
bosboot -a -d /dev/rmt0.1 

(create the boot image on the first file of mksysb)

mkinsttape /dev/rmt0.1 (create the second file on the
mksysb with image.data, bosinst.data, and oher files like drivers and
commands)

echo " Dummy tape TOC" | dd of=/dev/rmt0.1 conv=sync bs=512 > /dev/null
2>&1 (create the third file "dummy toc")


create a named pipe:

mknod /tmp/pipe p

and run the mksysb as this:

dd if=/tmp/pipe | rsh "server_hostname" dd of=/dev/rmt0.1 &
mksysb /tmp/pipe

this last command create the fourth file with "rootvg" in backup/restore
format


Note 7: Creating a root volume group backup on CD or DVD with the ISO9660 format
--------------------------------------------------------------------------------

Follow this procedure to create a root volume group backup on CD or DVD with the ISO9660 format.

You can use Web-based System Manager or SMIT to create a root volume group backup on CD or DVD with the 
ISO9660 format, as follows:

Use the Web-based System Manager Backup and Restore application and select System backup wizard method. 
This method lets you create bootable or non-bootable backups on CD-R, DVD-R, or DVD-RAM media. 
OR

To create a backup to CD, use the smit mkcd fast path. 
To create a backup to DVD, use the smit mkdvd fast path and select ISO9660 (CD format). 

The following procedure shows you how to use SMIT to create a system backup to CD. 
(The SMIT procedure for creating a system backup to an ISO9660 DVD is similar to the CD procedure.) 
Type the smit mkcd fast path. The system asks whether you are using an existing mksysb image. 
Type the name of the CD-R device. (This can be left blank if the Create the CD now? field is set to no.) 
If you are creating a mksysb image, select yes or no for the mksysb creation options, Create map files? 
and Exclude files?. Verify the selections, or change as appropriate. 
The mkcd command always calls the mksysb command with the flags to extend /tmp.

You can specify an existing image.data file or supply a user-defined image.data file. See step 16.

Enter the file system in which to store the mksysb image. This can be a file system that you created in the rootvg, 
in another volume group, or in NFS-mounted file systems with read-write access. If this field is left blank, 
the mkcd command creates the file system, if the file system does not exist, and removes it when the command completes. 

Enter the file systems in which to store the CD or DVD file structure and final CD or DVD images. These can be 
file systems you created in the rootvg, in another volume group, or in NFS-mounted file systems. If these fields 
are left blank, the mkcd command creates these file systems, and removes them when the command completes, 
unless you specify differently in later steps in this procedure. 

If you did not enter any information in the file systems' fields, you can select to have the mkcd command either 
create these file systems in the rootvg, or in another volume group. If the default of rootvg is chosen 
and a mksysb image is being created, the mkcd command adds the file systems to the exclude file and calls 
the mksysb command with the -e exclude files option. 

In the Do you want the CD or DVD to be bootable? field, select yes to have a boot image created on the 
CD or DVD. If you select no, you must boot from a product CD at the same version.release.maintenance level, 
and then select to install the system backup from the system backup CD. 

If you change the Remove final images after creating CD? field to no, the file system for the CD images 
(that you specified earlier in this procedure) remains after the CD has been recorded. 

If you change the Create the CD now? field to no, the file system for the CD images (that you specified earlier 
in this procedure) remains. The settings that you selected in this procedure remain valid, but the CD is not 
created at this time. 

If you intend to use an Install bundle file, type the full path name to the bundle file. The mkcd command copies 
the file into the CD file system. You must have the bundle file already specified in the BUNDLES field, 
either in the bosinst.data file of the mksysb image or in a user-specified bosinst.data file. When this 
option is used to have the bundle file placed on the CD, the location in the BUNDLES field of the bosinst.data 
file must be as follows: 
/../usr/sys/inst.data/user_bundles/bundle_file_name

To place additional packages on the CD or DVD, enter the name of the file that contains the packages list 
in the File with list of packages to copy to CD field. The format of this file is one package name per line. 
If you are planning to install one or more bundles after the mksysb image is restored, follow the directions 
in the previous step to specify the bundle file. You can then use this option to have packages listed 
in the bundle available on the CD. If this option is used, you must also specify the location of installation 
images in the next step.

Enter the location of installation images that are to be copied to the CD file system (if any) in the Location 
of packages to copy to CD field. This field is required if additional packages are to be placed on the CD 
(see the previous step). The location can be a directory or CD device. 

You can specify the full path name to a customization script in the Customization script field. If given, 
the mkcd command copies the script to the CD file system. You must have the CUSTOMIZATION_FILE field already set 
in the bosinst.data file in the mksysb image or else use a user-specified bosinst.data file with the CUSTOMIZATION_FILE field set. The mkcd command copies this file to the RAM file system. Therefore, the path in the CUSTOMIZATION_FILE field must be as follows: 
/../filename

You can use your own bosinst.data file, rather than the one in the mksysb image, by typing the full path name 
of your bosinst.data file in the User supplied bosinst.data file field. 
To turn on debugging for the mkcd command, set Debug output? to yes. The debug output goes to the smit.log. 
You can use your own image.data file, rather than the image.data file in the mksysb image, by typing the 
full path name of your image.data file for the User supplied image.data file field. 


Note 8: 0301-150 bosboot: Invalid or no boot device specified!
--------------------------------------------------------------


== Technote:

APAR status
Closed as program error.

Error description 

On a system, that does not have tape support
installed, running mkszfile will show the
following error:
0301-150 bosboot: Invalid or no boot device
specified.

Local fix 
Install device support for scsi tape devices.

Problem summary 
Error message when creating backup if devices.scsi.tape.rte
not installed even if the system does not have a tape drive.

Problem conclusion 
Redirect message to /dev/null.

Temporary fix 
Ignore message.

Comments 
APAR information 
APAR number IY52551 IY95261
Reported component name AIX 5L POWER V5 
Reported component ID 5765E6200 
Reported release 520 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2004-01-12 
Closed date 2004-01-12 
Last modified date 2004-02-27 


== Technote:

APAR status
Closed as program error.

Error description 
If /dev/ipldevice is missing, mksfile will show the
bosboot usage statement.

  0301-150 bosboot: Invalid or no boot device
           specified!
Local fix 
Problem summary 
If /dev/ipldevice is missing, mksfile will show the
bosboot usage statement.

  0301-150 bosboot: Invalid or no boot device
           specified!

Problem conclusion 
Do not run bosboot against /dev/ipldevice.

Temporary fix 
Comments 

APAR information 
APAR number IY95261 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2007-02-22 
Closed date 2007-02-22 
Last modified date 2007-06-06 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Publications Referenced


Fix information 
Fixed component name AIX 5.3 
Fixed component ID 5765G0300 


== thread:

Q:

> 
> Someone out there knows the fix for this one; if you get a moment, would you 
> mind giving me the fix? 
> 
> 
> # mksysb -i /dev/rmt0 
> 
> /dev/ipldevice not found 
> 

A:

The ipldevice file is probably deleted from your /dev directory, or 
point to wrong 
entry. The '/dev/ipldevice' file is (re)created in boot time 2nd 
phase. For additional 
information look into /sbin/rc.boot script... The ipldevice entry 
type is hardlink. Usually point to /dev/rhdiskN, assuming that boot 
device is hdiskN. 
Check your system and you should got similar ... 
find /dev -links 2 -ls 
.... 
8305 0 crw------- 2 root system 14, 1 Feb 20 2005 /dev/rhdisk0 
8305 0 crw------- 2 root system 14, 1 Feb 20 2005 /dev/ipldevice 
... 
(The first cloumn of the output is the inode number) 

So, you can recreate the wrong, or missing ipdevice file. 
'bootinfo -b' says the physical boot device name. 
For exapmle: 
ln -f /dev/rhdisk0 /dev/ipldevice 

I hope this will solve your bosboot problem. 


Q:

I was installing Atape driver and noticed bosboot failure when installp 
calls bosboot with /dev/ipldevice. Messages below: 

0503-409 installp: bosboot verification starting... 
0503-497 installp: An error occurred during bosboot verification 
processing. 

Inspection of /dev showed no ipldevice file 

I was able to easily recreate the /dev/ipldevice using 

ln /dev/rhdisk0 /dev/ipldevice 

then successfully install the Atape driver software. 

After reboot /dev/ipldevice is missing again???. 

Environment is p5 520 AIX 5.3 ML1 
mirrored internal drives hdisk0 and hdisk1 in rootvg 

I have 5.3 ML2 (but have not applied yet) 
I don't see any APAR's in ML2 regarding /dev/ipldevice problems.

A:

Are you using EMC disk? There is a known problem with the later 
Powerpath versions where the powerpath startup script removes the 
/dev/ipldevice file if there is more than one device listed in the 
bootlist. 

A:

Yes, running EMC PowerPath 4.3 for AIX, with EMC Clariion CX600 Fibre 
disks attached to SAN. I always boot from, and mirror the OS on IBM 
internal disks. We order 4 internal IBM drives. Two for primary OS and 
mirror, the other two for alt_disk and mirrors. 

Thanks for the tip. I will investigate at EMC Powerlink site for fix. I 
know PowerPath 4.4 for AIX is out, but still pretty new.


A:

ipldevice is a link to the rawdevice (rhdisk0 , not hdisk0) 


-----Original Message----- 
From: IBM AIX Discussion List [mailto:aix-l@Princeton.EDU] On Behalf Of 
Robert Miller 
Sent: Wednesday, April 07, 2004 6:13 PM 
To: aix-l@Princeton.EDU 
Subject: Re: 64 Bit Kernel 


It may be one of those odd IBMisms where they want to call something a 
certain name so they put it in as a link to the actual critter... 

Looking on my box, the /dev/ipldevice has the same device major and 
minor numbers as hdisk0 - tho it is interesting that ipldevice is a 
character device, where a drive is usually a block device: 


mybox:rmiller$ ls -l /dev/ipl* 
crw------- 2 root system 23, 0 Jan 15 2002 /dev/ipldevice 
mybox:rmiller$ ls -l /dev/hdisk0 
brw------- 1 root system 23, 0 Sep 13 2002 /dev/hdisk0 


A:

> Hi, 

> AIX 5.3 
> I have a machine where /dev/ipldevice doesn't exit 
> I can reboot it safely ? 
> How I can I re-create it ? 

> Thanks in advance 

I did this today, and there is probably a more accepted way. 
I made a hard link from my rhdiskX device to /dev/ipldevice. 

If your boot device is /dev/hdisk0, then the command line would be as 
follows: 

ln /dev/rhdisk0 /dev/ipldevice 

Again, there is probably a more acceptable way to achieve this, but it 
worked for me. 


== thread:

how to recover from an invalid or no boot device error in AIX 
Description

When running the command "bosboot -ad /dev/ipldevice" in IBM AIX, you get the following error:

0301-150 bosboot: Invalid or no boot device specified!

A device specified with the bosboot -d command is not valid. The bosboot command was unable to finish processing 
because it could not locate the required boot device. The installp command calls the bosboot command 
with /dev/ipldevice. If this error does occur, it is probably because /dev/ipldevice does not exist. 
/dev/ipldevice is a link to the boot disk. 

To determine if the link to the boot device is missing or incorrect :

1) Verify the link exists:

# ls -l /dev/ipldevice
ls: 0653-341 The file /dev/ipldevice does not exist.

2) In this case, it does not exist. To identify the boot disk, enter "lslv -m hd5". The boot disk name displays. 

# lslv -m hd5
hd5:N/A
LP PP1 PV1 PP2 PV2 PP3 PV3
0001 0001 hdisk4 0001 hdisk1 

In this example the boot disk name is hdisk4 and hdisk1.

3) Create a link between the boot device indicated and the /dev/ipldevice file. Enter: 

# ln /dev/boot_device_name /dev/ipldevice
(An example of boot_device_name is rhdisk0.)

In my case, I ran:

# ln /dev/rhdisk4 /dev/ipldevice

4) Now run the bosboot command again:

# bosboot -ad /dev/ipldevice 
Example

lslv -m hd5; ln /dev/rhdisk4 /dev/ipldevice; bosboot -ad /dev/ipldevice 


Note 9: Other mksysb errors on AIX 5.3:
---------------------------------------

It turns out, that on AIX 5.3, on certain ML/TL levels (below TL 6), an mksysb error turns up,
if you have other volume groups defined other than rootvg, while there is NO filesystem created on
those Volume groups.

Solution: create a filesystem, even only a "test" or "dummy" filesystem, on those VG's.


>> thread 1:

Q:

Hi 

can't find any information about "backup structure of volume group, vios". included service: 
"savevgstruct vgname" working with errors: 
# lsvg 
rootvg 
vg_dev 
datavg_dbs 
# /usr/ios/cli/ioscli savevgstruct vg_dev 

Creating information file for volume group vg_dev.. 

Some error messages may contain invalid information 
for the Virtual I/O Server environment. 

cat: 0652-050 Cannot open /tmp/vgdata/vg_dev/fs_data_tmp. 

# ls -al /tmp/vgdata/vg_dev/ 
total 16 
drwxr-xr-x 2 root staff 256 Apr 02 08:38 . 
drwxrwxr-x 5 root system 256 Apr 02 08:20 .. 
-rw-r--r-- 1 root staff 2002 Apr 02 08:35 filesystems 
-rw-r--r-- 1 root staff 1537 Apr 02 08:35 vg_dev.data 
# oslevel -r 
5300-05 
# df -k | grep tmp 
/dev/hd3 1310720 1309000 1% 42 1% /tmp 


A:

I had this issue as well with VIO 1.3. I called IBM support 
about it and it is a known issue. The APAR is IY87935. The fix 
will not be released until AIX 5.3 TL 6, which is due out in 
June. It occurs when you run savevgstruct on a user defined 
volume group that contains volumes where at least one does not 
have a filesystem defined on it. The workaround is to define a 
filesystem on every volume in the user defined volume group.


>> thread 2:

IBM APAR Note:

http://www-1.ibm.com/support/docview.wss?uid=isg1IY87935

IY87935: MKVGDATA/SAVEVG CAN FAIL


APAR status
Closed as program error.

Error description 
The mkvgdata command when executed on a volume group that does
not have any mounted filesystems:

  # savevg -f /home/vgbackup -i vg00

  Creating information file for volume group vg00..cat:
  0652-050 Cannot open /tmp/vgdata/vg00/fs_data_tmp.

  /usr/bin/savevg 33 :  BACKUPSHRINKSIZE = 16 + FSSHRINKSIZE :
  0403-009 The specified number is not valid for this command.

Local fix 

Problem summary 
The mkvgdata command when executed on a volume group that does
not have any mounted filesystems:

  # savevg -f /home/vgbackup -i vg00

  Creating information file for volume group vg00..cat:
  0652-050 Cannot open /tmp/vgdata/vg00/fs_data_tmp.

  /usr/bin/savevg 33 :  BACKUPSHRINKSIZE = 16 + FSSHRINKSIZE :
  0403-009 The specified number is not valid for this command.

Problem conclusion 
Check variable.

Temporary fix 

Comments 

APAR information 
APAR number IY87935 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2006-08-09 
Closed date 2006-08-09 
Last modified date 2006-08-09 


9.11 AIX: the backup and restore commands:
------------------------------------------

The backup command creates copies of your files on a backup medium, such as a magnetic tape or diskette. 
The copies are in one of the two backup formats:

- Specific files and directories, backed up by name using the -i flag. 
- Entire file system backed up by i-node, not using the -i flag, 
  but instead using the Level and FileSystem parameters.

Unless you specify another backupmedia with the -f parameter, the backup command automatically
writes its output to /dev/rfd0 which is the diskette drive.

(1) Backing up the user directory "userdirectory":
# cd /userdirectory
# find . -depth | backup -i -f /dev/rmt0            # or use find . -print

(2) Incremental backups:
You can create full and incremental backups of filesystems as well, as shown in the following example.
When the -u flag is used with the backup command, the system will do an incremental backup 
according to the -level number specified. For example, a level 5 backup will only back up the
data that has changed after the level 4 was made.
Levels can range from 0 to 9.

Example;

On Sunday:
# backup -0 -uf /dev/rmt0 /data
On Monday:
# backup -1 -uf /dev/rmt0 /data
..
..
On Saturday:
# backup -6 -uf /dev/rmt0 /data

Due to the -u parameter, information about the backups is written to the /etc/dumpdates file.

To backup the / (root) file system, enter: 
# backup  -0 -u -f /dev/rmt0 /

Note that we do noy use the -i flag, but instead backup an entire fs "/". 

Other examples:
---------------

To backup all the files and subdirectories in current directory using relative pathnames, use
# find . -print | backup -if /dev/rmt0

To backup the files /bosinst.data and /signature to the diskette, use
# ls ./bosinst.dat ./signature | backup -iqv

How to restore a file:
----------------------

Suppose we want to restore the /etc/host file, because its missing.

# tctl -f /dev/rmt0 rewind                  # - rewind tape
# restore -x -d -v -q -s4 -f /dev/rmt0.1 ./etc/hosts

Another example:

# restore -qvxf /dev/rmt0.1 "./etc/passwd"     Restore /etc/passwd file 
# restore -s4 -qTvf /dev/rmt0.1                Lists contents of a mksysb tape 


9.12 AIX: savevg and restvg:
----------------------------

To backup, or clone, a VG, you can use the 

- mksysb command for the rootvg
- savevg command for other user VG's

To backup a user Volume Group (VG, see also sections 30 and 31) you can use savevg to backup a VG
and restvg to restore a VG.

# lsvg                                # - shows a list of online VG's
rootvg
uservg

# savevg -if /dev/rmt0 uservg         # - now backup the uservg


9.13 AIX: tctl:
---------------

Purpose
Gives subcommands to a streaming tape device.

Syntax
tctl [  -f Device ] [  eof | weof | fsf | bsf | fsr | bsr | rewind | offline |  rewoffl | erase | retension | reset | status ] [ Count ]

tctl [  -b BlockSize ] [  -f Device ] [  -p BufferSize ] [  -v ] [  -n ] [  -B ] {  read | write }

Description
The tctl command gives subcommands to a streaming tape device. If you do not specify the Device variable 
with the -f flag, the TAPE environment variable is used. If the environment variable does not exist, 
the tctl command uses the /dev/rmt0.1 device. (When the tctl command gives the status subcommand, 
the default device is /dev/rmt0.) The Device variable must specify a raw (not block) tape device. 
The Count parameter specifies the number of end-of-file markers, number of file marks, or number of records. 
If the Count parameter is not specified, the default count is 1.

Examples
To rewind the rmt1 tape device, enter: 
tctl  -f /dev/rmt1  rewind

To move forward two file marks on the default tape device, enter: 
tctl  fsf 2

To write two end-of-file markers on the tape in /dev/rmt0.6, enter: 
tctl  -f /dev/rmt0.6  weof 2

To read a tape device formatted in 80-byte blocks and put the result in a file, enter: 
tctl  -b 80  read > file

To read variable-length records from a tape device formatted in 80-byte blocks and put the result in a file, enter: 
tctl  -b 80  -n  read > file

To write variable-length records to a tape device using a buffer size of 1024 byes, enter: 
cat file | tctl  -b 1024  -n  -f/dev/rmt1  write

To write to a tape device in 512-byte blocks and use a 5120-byte buffer for standard input, enter: 
cat file | tctl  -v  -f /dev/rmt1  -p 5120  -b 512  write


Note: The only valid block sizes for quarter-inch (QIC) tape drives are 0 and 512.
To write over one of several backups on an 8 mm tape, position the tape at the start of the backup file 
and issue these commands: 
tctl  bsf 1

tctl  eof 1


9.14 AIX mt command:
--------------------


Purpose
Gives subcommands to streaming tape device.

Syntax
mt [  -f TapeName ] Subcommand [ Count ]

Description
The mt command gives subcommands to a streaming tape device. If you do not specify the -f flag 
with the TapeName parameter, the TAPE environment variable is used. If the environment variable 
does not exist, the mt command uses the /dev/rmt0.1 device. The TapeName parameter must be a raw (not block) 
tape device. You can specify more than one operation with the Count parameter.


Subcommands

eof, weof Writes the number of end-of-file markers specified by the Count parameter at the 
          current position on the tape. 
fsf       Moves the tape forward the number of files specified by the Count parameter and positions 
          it to the beginning of the next file. 
bsf       Moves the tape backwards the number of files specified by the Count parameter and positions 
          it to the beginning of the last file skipped. If using the bsf subcommand would cause the tape head 
          to move back past the beginning of the tape, then the tape will be rewound, and the mt command will return EIO. 
fsr       Moves the tape forward the number of records specified by the Count parameter. 
bsr       Moves the tape backwards the number of records specified by the Count parameter. 
rewoff1, rewind Rewinds the tape. The Count parameter is ignored. 
status    Prints status information about the specified tape device. The output of the status command 
          may change in future implementations 

Examples
To rewind the rmt1 tape device, enter: 

mt -f /dev/rmt1 rewind
To move forward two files on the default tape device, enter: 

mt fsf 2
To write two end-of-file markers on the tape in the /dev/rmt0.6 file, enter: 

mt -f /dev/rmt0.6 weof 2


9.14 AIX tapeutil command:
--------------------------

tapeutil -f <devicename> <commands>
- A program which came with the tape library to control it's working. Called without arguments gives a menu. 
Is useful for doing things like moving tapes from the slot to the drive. e.g.

$ tapeutil -f /dev/smc0 move -s 10 -d 23 

which moves the tape in slot 10 to the drive (obviously, this will depend on your own individual tape library, 
may I suggest the manual?). 

The fileset you need to install for 'tapeutil' command is:
Atape.driver 7.1.5.0.

Example:
--------

We are using 3583 automated tape library for backups.for tapeutil command u need to have a file atape.sys 
on ur system.to identify the positioning of tape drives and source just type tapeutil it will give 
u a number of options.choose element information to identify the source and tape drive numbers.
In our case the tape drives numbers are 256 and 257 and the source number to insert the tape is 16.
we usually give the following commands to load and move the tape.

Loading Tape:-
tapeutil -f /dev/smc0 move -s 16 -d 256
(to insert the tape in tapedrive 1,where 16 is source and 256 is destination)
to take the backup:-

find filesystem1 filesystem2 | backup -iqvf /dev/rmt1

((filessystem name without mount point slash))

after taking the backup and unloading tape:-

tapeutil -f /dev/rmt1 unload

tapeutil -f /dev/smc0 move -s 256 -d 16

(first unload the tape then move it to source destination)

this might help u to use the taputil command in taking backup.

Example:
--------

In order to move tapes in and out of the Library here is what I do.

First  I unload the tape with the command  #tapeutil -f /dev/rmtx unload
Where x is 0,1,2,3...
then I move the tape from external slot (16) using the media changer, not the tape drive.

#tapeutil -f /dev/smcx move 256 16
The above command moves the tape in your first tape drive (256) to the external slot.
Note that you can also move from the internal slots to the external slot or the tape drive.
To move the tape back from the external slot, I just switch 256 and 16 parameters.


Example:
--------

The code I use to list the I/O station slots is:

/usr/bin/tapeutil -f /dev/smc0 inventory | grep -p Station | egrep
'Station|Volume' | awk '{
if($1 =3D=3D "Import/Export") ioslot=3D$4;
if($1 =3D=3D "Volume") {
      if(NF =3D=3D 4) volser=3D$4;
      else volser=3D"-open-";
      print ioslot, volser;
}}'

The tapeutil command to move a tape is:

/usr/bin/tapeutil -f /dev/smc0 move <fromslot> <toslot>

For example:  /usr/bin/tapeutil -f /dev/smc0 move 773 1037

You can get the slot numbers, and volsers in them, with the command:
/usr/bin/tapeutil -f /dev/smc0 inventory

To find an open slot just look for a slot with a blank "Volume Tag".

One little hitch, however.  If a tape is currently mounted, the "tapeut=il inventory" command will show a
slot as open ("Volume Tag" is blank), but TSM will have it reserved for=
 the
mounted tape.  So what I did
in my script is to check the TSM device configuration file for each ope=
n
slot that I find and if that slot number
appears in it then I skip that slot and go on to the next one.


Example:
--------

#!/bin/ksh
DEVICE=$1
HOST=$2
TAPE=$3
case $TAPE in
2) tapeutil -f /dev/smc0 move 23 10
      tapeutil -f /dev/smc0 move 11 23
;;
3) tapeutil -f /dev/smc0 move 23 11
      tapeutil -f /dev/smc0 move 12 23
;;
4) tapeutil -f /dev/smc0 move 23 12
      tapeutil -f /dev/smc0 move 13 23
;;
5) tapeutil -f /dev/smc0 move 23 13
      tapeutil -f /dev/smc0 move 14 23
;;
esac

Example:
--------

tapeutil -f /dev/rmt1 unload 
tapeutil -f /dev/smc0 move 257 16 
tapeutil -f /dev/smc0 move -s 256 -d 16
tapeutil -f /dev/smc0 move 257 1025 
tapeutil -f /dev/smc0 move 16 257 

tapeutil -f /dev/smc0 exchange 34 16 40
tapeutil -f /dev/smc0 inventory | more
tctl -f/dev/rmt0 rewoffl
tapeutil -f/dev/smc0 elementinfo
tapeutil -f /dev/scm0 inventory


Example:
--------

tapeutil -f /dev/rmt1 unload 
sleep 20

DAYNO=`date +%d`;export DAYNO

case $DAYNO in
01) tapeutil -f /dev/smc0 move 23 10
      tapeutil -f /dev/smc0 move 11 23
;;
02) tapeutil -f /dev/smc0 move 23 10
      tapeutil -f /dev/smc0 move 11 23
;;
03) tapeutil -f /dev/smc0 move 23 10
      tapeutil -f /dev/smc0 move 11 23
;;
04) tapeutil -f /dev/smc0 move 23 10
      tapeutil -f /dev/smc0 move 11 23
;;
05) tapeutil -f /dev/smc0 move 23 10
      tapeutil -f /dev/smc0 move 11 23
;;
06) tapeutil -f /dev/smc0 move 23 10
      tapeutil -f /dev/smc0 move 11 23
;;
07) tapeutil -f /dev/smc0 move 23 10
      tapeutil -f /dev/smc0 move 11 23
;;
esac

Example:
--------

tapeutil -f /dev/rmt1 unload 
sleep 20

DAYNAME=`date +%a`;export DAYNAME

case $DAYNAME in
Sun) tapeutil -f /dev/smc0 move 256 4098
tapeutil -f /dev/smc0 move 4099 256      
;;
Mon) tapeutil -f /dev/smc0 move 256 4099
tapeutil -f /dev/smc0 move 4100 256    
;;
Tue) tapeutil -f /dev/smc0 move 256 4100
tapeutil -f /dev/smc0 move 4113 256      
;;
Wed) tapeutil -f /dev/smc0 move 256 4113
tapeutil -f /dev/smc0 move 4114 256     
;;
Thu) tapeutil -f /dev/smc0 move 256 4114
tapeutil -f /dev/smc0 move 4109 256    
;;
Fri) tapeutil -f /dev/smc0 move 256 4109
tapeutil -f /dev/smc0 move 4124 256      
;;
Sat) tapeutil -f /dev/smc0 move 256 4124
tapeutil -f /dev/smc0 move 4110 256      
;;
esac

tapeutil -f /dev/smc0 move 256 4098
tapeutil -f /dev/smc0 move 4099 256

Example:
--------


tapeutil -f /dev/smc0 move 16 4096
sleep 10
tapeutil -f /dev/smc0 move 17 4097
sleep 10
tapeutil -f /dev/smc0 move 18 4098
sleep 10
tapeutil -f /dev/smc0 move 19 4099
sleep 10
tapeutil -f /dev/smc0 move 20 4100
sleep 10
tapeutil -f /dev/smc0 move 21 4101
sleep 10

Example:
--------

mt -f /dev/rmt1  rewind
mt -f /dev/rmt1.1 fsf 6
tar -xvf /dev/rmt1.1 /data/download/expdemo.zip
SPL bld


About Ts3310:
-------------

Abstract 
Configuration Information for IBM TS3310 (IBM TotalStorage 3576)  
  
Content 

IBM TS3310 (IBM TotalStorage 3576)

Drive Addresses Storage Slot Addresses Changer Address Entry/Exit Slot Address 
256-261         4096-4223              1               16-21 

Notes:

1. Barcodes are required. Without a barcode label, a volume will show as unknown media.

2. ELEMent=AUTODetect in the DEFINE/UPDATE DRIVE command is supported.

3. Device identification and firmware used during validation 
Library ID: IBM 3576-MTL --- Firmware: 0.62

4. The IBM device driver is required. The IBM device drivers are available at ftp://ftp.software.ibm.com/storage/devdrvr.

5. The library is available with IBM LTO Generation 3 drives.

6. For more information on IBM TS3310, see TS3310 Tape Library.

 
Example:
--------

First, list the tape device names: 
lsdev -Cc tape
Assume it returns smc0 for the library, and rmt0 and rmt1 for the tape drives, and all devices are Available. 

Next, take an inventory of the library. 
tapeutil -f /dev/smc0 inventory | more
Assume the inventory returns two drives with element numbers 256 and 257 and shows a tape stored in slot 1025. 

Then, start moving the tape to each drive in turn, and verify which device name it is associated with 
by running tctl or mt rewoffl. If it returns without error, the device name matches the element number. 

Move the tape from the tape slot to the first drive: 
tapeutil -f /dev/smc0 move 1025 256
tctl -f/dev/rmt0 rewoffl
If the command returns with no errors, then element # 256 matches device name /dev/rmt0. 

Move the tape to the next drive 
tapeutil -f /dev/smc0 move 256 257
tctl -f/dev/rmt1 rewoffl
If the command returns with no errors, then element # 257 matches device name /dev/rmt1

Move the tape back to the storage slot it came from: 
tapeutil -f /dev/smc0 move 257 1025 

If at any point, the tctl command returns with errors, then try another device name until it returns without errors. 

NOTE: the 'rewoffl' flag on tctl simply rewinds and ejects the tape from the drive. 


9.15 Recover from AIX OS failure:
---------------------------------

Recover from OS failure.								
								
Contents:								
1. How to view the bootlist:								
2. How to change the bootlist:								
3. How to make a device bootable:								
4. How to make a backup of the OS:								
5. Shutdown a pSeries AIX system in the most secure way:								
6. How to restore specific files from a mksysb tape:								
7. Recovery of rootvg								
								

1. How to view the bootlist:								
								
At boottime, once the POST is completed, the system will search the boot list for a								
bootable image. The system will attempt to boot from the first entry in the bootlist.								
Its always a good idea to see what the OS thinks are the bootable devices and the order of what the OS 								
thinks it should use. Use the bootlist command to view the order:								
								
# bootlist -m normal -o								
								
As the first item returned, you will see hdisk0, the bootable harddisk.								
								
If you need to check the bootlist in "service mode", for example if you want to boot from tape to restore the rootvg, use								
								
# bootlist -m service -o								
								
								
2. How to change the bootlist:								
								
The bootlist, in normal operations, can be changed using the same command as used in section 1, for example								
								
# bootlist -m normal hdisk0 cd0								
								
This command makes sure the hdisk0 is the first device used to boot the system.								
								
If you want to change the bootlist for the system in service mode, you can change the list in order to use rmt0								
if you need to restore the rootvg.								
								
# bootlist -m service rmt0								
								
								
3. How to make a device bootable:								
								
To make a device bootable, use the bosboot command:								
								
# bosboot -ad /dev/ipldevice								
								
So, if hdisk0 must be bootable, or you want to be sure its bootable, use								
								
# bosboot -ad /dev/hdisk0								
								
								
4. How to make a backup of the OS:								
								
The mksysb command creates an installable image of the rootvg. This is synonym to say that mksysb creates								
a backup of the operating system (that is, the root volume group). 								
								
You can use this backup to reinstall a system to its original state after it has been corrupted. 								
If you create the backup on tape, the tape is bootable and includes the installation programs 								
needed to install from the backup.								
								
To generate a system backup and create an /image.data file (generated by the mkszfile command) to a tape device 								
named /dev/rmt0, type: 								
								
# mksysb -i /dev/rmt0								
								
If a backup tape was created with the -e switch, like in:								
								
# mksysb -i -e /dev/rmt0								
								
then a number of directories are NOT included in the backup. These exclusions are listed in the "/etc/exclude.rootvg" file.								
								
The mksysb command should be used regularly. It must certainly be done after installing apps or devices.								
In normal conditions, the OS does not change, and a bootable tape should be created at some frequency.								
								
								
5. Shutdown a pSeries AIX system in the most secure way:								
								
1. Shut down all applications in a controlled way.								
2. Make sure no users are on the system.								
3. Use the shutdown command:								
								
shutdown -r		to reboot the system						
shutdown -m		to reboot in maintenance mode						
								
								
6. How to restore specific files from a mksysb tape:								
								
$ tctl fsf 3								
$ restore -xvf /dev/rmt0.1 ./your/file/name								
								
For example, if you need to get the vi command back, put the mksysb tape in the tape drive (in this case, /dev/rmt0) 								
and do the following:								
								
cd /                         # get to the root directory								
tctl -f /dev/rmt0 rewind     # rewind the tape								
tctl -f /dev/rmt0.1 fsf 3    # move the tape to the third file, no rewind								
restore -xqf /dev/rmt0.1 -s 1 ./usr/bin/vi    # extract the vi binary, no rewind								
								
Further explanation why you must use the fsf 3 (fast forward skip file 3):								
The format of the tape is as follows:								
1. A BOS boot image								
2. A BOS install image								
3. A dummy Table Of Contents								
4. The system backup of the rootvg								
								
So if you just need to restore some files, first forward the tape pointer to position 3, counting from 0.								
								
								
7. Recovery of rootvg								
								
7.1 Check if the system can boot from tape:
# bootinfo -e

If a 1 is returned, the system can boot from tape, if a 0 is returned a boot from tape is not supported.

7.2 Recover the rootvg:

One possible method is the following:
1. Check whether the tape is in front of the disk with the bootlist command:
   # bootlist -m normal -o
2. Insert the mksysb tape
3. Power on the machine. The system will boot from the tape.
4. The Installation and Maintenance Menu will be displayed.


                      Welcome to Base Operating System
                      Installation and Maintenance

Type the number of your choice and press Enter.  Choice is indicated by >>>.

>>> 1 Start Install Now with Default Settings

    2 Change/Show Installation Settings and Install

    3 Start Maintenance Mode for System Recovery


Type 3 and press enter to start maintenance mode.
   The next screen you should see is :-

                    Maintenance 

Type the number of your choice and press Enter.

>>> 1 Access a Root Volume Group 
    2 Copy a System Dump to Removable Media
    3 Access Advanced Maintenance Functions
    4 Install from a System Backup

>>> Choice [1]: 

Type 4 and press enter to install from a system backup.
   The next screen you should see is :-

                    Choose Tape Drive

Type the number of the tape drive containing the system backup to be
installed and press Enter.

      Tape Drive                     Path Name

>>> 1 tape/scsi/ost                  /dev/rmt0

>>> Choice [1]:  

Type the number that corresponds to the tape drive that the mysysb tape 
   is in and press enter.
   The next screen you should see is :-

                      Welcome to Base Operating System
                      Installation and Maintenance

Type the number of your choice and press Enter.  Choice is indicated by >>>.

>>> 1 Start Install Now with Default Settings

    2 Change/Show Installation Settings and Install

    3 Start Maintenance Mode for System Recovery


                       +-----------------------------------------------------
    88  Help ?         |Select 1 or 2 to install from tape device /dev/rmt0
    99  Previous Menu  |
                       | 
>>> Choice [1]: 

You can now follow your normal mksysb restore procedures.


9.16 HP-UX make_net_recovery:
----------------------------- 


There are two ways you can recover from a tape with make_net_recovery. The method you choose depends on your needs.

- Use make_medialif
This method is useful when you want to create a totally self-contained recovery tape. The tape will be bootable 
and will contain everything needed to recover your system, including the archive of your system. During recovery, 
no access to an Ignite-UX server is needed. Using make_medialif is described beginning on 
"Create a Bootable Archive Tape via the Network" and also on the Ignite-UX server in the file: 
/opt/ignite/share/doc/makenetrec.txt

- Use make_boot_tape
This method is useful when you do not have the ability to boot the target machine via the network, but are still 
able to access the Ignite-UX server via the network for your archive and configuration data. This could happen 
if your machine does not support network boot or if the target machine is not on the same subnet as the 
Ignite-UX server. In these cases, use make_boot_tape to create a bootable tape with just enough information 
to boot and connect with the Ignite-UX server. The configuration files and archive are then retrieved from the 
Ignite-UX server. See the make_boot_tape(1M) manpage for details. 


-- make_boot_tape:

make_boot_tape(1M)                                       make_boot_tape(1M)

 NAME
      make_boot_tape - make a bootable tape to connect to an Ignite-UX
      server

 SYNOPSIS
      /opt/ignite/bin/make_boot_tape [-d device-file-for-tape] [-f config-
           file] [-t tmpdir] [-v]

      /opt/ignite/bin/make_boot_tape [-d device-file-for-tape] [-g gateway]
           [-m netmask] [-t tmpdir] [-v]

 DESCRIPTION
      The tape created by make_boot_tape is a bootable tape that contains
      just enough information to boot the system and then connect to the
      Ignite-UX server where the tape was created.  Once the target system
      has connected with the Ignite-UX server, it can be installed or
      recovered using Ignite-UX.  The tape is not a fully self-contained
      install tape; an Ignite-UX server must also be present.  The
      configuration information and software to be installed on the target
      machine reside on the Ignite-UX server, not on the tape.  If you need
      to build a fully self-contained recovery tape, see make_recovery(1m)
      or make_media_lif(1m).

      make_boot_tape is used in situations when you have target machines
      that cannot boot via the network from the Ignite-UX server.  This
      happens either because the machine does not support booting from the
      network or because it is not on the same subnet as the Ignite-UX
      server.  In this case, booting from a tape generated by make_boot_tape
      means you do not need to set up a boot helper system.  A tape created
      by make_boot_tape can be used to kick off a normal Ignite-UX
      installation.  It can also be used to recover from recovery
      configurations saved on the Ignite-UX server.

      There is no "target-specific" information on the boot tape.  Only
      information about the Ignite-UX server is placed on the tape.  Thus,
      it is possible to initiate an installation of any target machine from
      the same boot tape provided that the same Ignite-UX server is used.
      Likewise, the target machine can be installed with any operating
      system configuration that is available on the Ignite-UX server.

      Typically, the make_boot_tape command is run from the Ignite-UX server
      that you wish to connect with when booting from the tape later on.

      A key file that contains configuration information is called
      INSTALLFS. This file exists on the Ignite-UX server at
      /opt/ignite/boot/INSTALLFS and is also present on the tape created by
      make_boot_tape. See instl_adm(4) for details on the configuration file
      syntax.  Unless the -f option is used, the configuration information
      already present in the INSTALLFS file is used on the tape as well.
      The make_boot_tape command will never alter the INSTALLFS file on the
      Ignite-UX server; it will only change the copy that is placed on the
      tape.

Examples:
---------

      Create a boot tape on the default tape drive (/dev/rmt/0m).

          # make_boot_tape

      Create a boot tape on a specified (non-default) tape drive. Create a
      DDS1 device file for the tape drive first.  Show as much information
      about the tape creation as is possible.

           ioscan -fC tape     # to get the hardware path
           mksf -v -H <hardware path> -b DDS1 -n -a
           make_boot_tape -d /dev/<devfile created by mksf> -v

      Create a boot tape and replace the configuration information contained
      in the INSTALLFS file.  Use the /tmp directory for all temporary files
      instead of the default /var/tmp.

          # instl_adm -d > tmp_config_file
           ## edit tmp_config_file as appropriate
          # make_boot_tape -f tmp_config_file -t /tmp

      Create a boot tape and specify a different gateway IP address.  Set
      the netmask value as well. All other configuration information is from
      what is already in /opt/ignite/boot/INSTALLFS.

          # make_boot_tape -g 15.23.34.123 -m 255.255.248.0


9.17 /etc/dumpdates
-------------------

On some unixes the /etc/dumpdates file exists, for example, Solaris.

Purpose of the /etc/dumpdates File
The ufsdump command, when used with the -u option, maintains and updates the /etc/dumpdates file. 
Each line in the /etc/dumpdates file shows the following information:

The file system backed up
The dump level of the last backup
The day, date, and time of the backup

For example:

# cat /etc/dumpdates
/dev/rdsk/c0t0d0s0               0 Wed Jul 28 16:13:52 2004
/dev/rdsk/c0t0d0s7               0 Thu Jul 29 10:36:13 2004
/dev/rdsk/c0t0d0s7               9 Thu Jul 29 10:37:12 2004 


When you do an incremental backup, the ufsdump command checks the /etc/dumpdates file to find the date 
of the most recent backup of the next lower dump level. Then, this command copies to the media all files that were modified 
since the date of that lower-level backup. After the backup is complete, a new information line, which describes the backup 
you just completed, replaces the information line for the previous backup at that level. 

Use the /etc/dumpdates file to verify that backups are being done. This verification is particularly important 
if you are having equipment problems. If a backup cannot be completed because of equipment failure, the backup 
is not recorded in the /etc/dumpdates file.

If you need to restore an entire disk, check the /etc/dumpdates file for a list of the most recent dates and levels 
of backups so that you can determine which tapes you need to restore the entire file system.


9.18 UFS snapshot on Solaris
----------------------------

UFS Snapshots Overview
The Solaris release includes the fssnap command for backing up file systems while the file system is mounted. You can use 
the fssnap command to create a read-only snapshot of a file system. A snapshot is a file system's temporary image that is 
intended for backup operations.

When the fssnap command is run, it creates a virtual device and a backing-store file. You can back up the virtual device, 
which looks and acts like a real device, with any of the existing Solaris backup commands. The backing-store file is a bitmap file 
that contains copies of presnapshot data that has been modified since the snapshot was taken.

Why Use UFS Snapshots?
The UFS snapshots feature enables you to keep the file system mounted and the system in multiuser mode during backups. 
Previously, you were advised to bring the system to single-user mode to keep the file system inactive when you used 
the ufsdump command to perform backups. You can also use additional Solaris backup commands, such as tar and cpio, 
to back up a UFS snapshot for more reliable backups.

The fssnap command gives administrators of nonenterprise-level systems the power of enterprise-level tools, 
such as Sun StorEdgeT Instant Image, without the large storage demands.

The UFS snapshots feature is similar to the Instant Image product. Although UFS snapshots can make copies of large file systems, 
Instant Image is better suited for enterprise-level systems. UFS snapshots is better suited for smaller systems. Instant Image allocates 
space equal to the size of the entire file system that is being captured. However, the backing-store file that is created by UFS snapshots 
occupies only as much disk space as needed.

Example of how to use it:

# fssnap -F ufs -o bs=/backing-store-file /file-system

Obviously, the backing-store file must reside on a different file system than the file system that is being captured 
using UFS snapshots.

The following example shows how to create a snapshot of the /usr file system. 
The backing-store file is /scratch/usr.back.file. The virtual device is /dev/fssnap/1.

# fssnap -F ufs -o bs=/scratch/usr.back.file /usr
/dev/fssnap/1
 
You can display the current snapshots on the system by using the fssnap -i option. If you specify a file system, 
you see detailed information about that snapshot. If you don't specify a file system, you see information about all 
of the current UFS snapshots and their corresponding virtual devices.

List all current snapshots:

For example:

# /usr/lib/fs/ufs/fssnap -i
Snapshot number               : 0
Block Device                  : /dev/fssnap/0
Raw Device                    : /dev/rfssnap/0
Mount point                   : /usr
Device state                  : idle
Backing store path            : /var/tmp/snapshot3
Backing store size            : 256 KB
Maximum backing store size    : Unlimited
Snapshot create time          : Wed Oct 08 10:38:25 2003
Copy-on-write granularity     : 32 KB
Snapshot number               : 1
Block Device                  : /dev/fssnap/1
Raw Device                    : /dev/rfssnap/1
Mount point                   : /
Device state                  : idle
Backing store path            : /tmp/bs.home
Backing store size            : 448 KB
Maximum backing store size    : Unlimited
Snapshot create time          : Wed Oct 08 10:39:29 2003
Copy-on-write granularity     : 32 KB

 
19.19 Recovery of the root filesystem on Solaris:
=================================================

Note 1:
------


Restoring the root (/) File System

-- To restore the / (root) file system, boot from the Solaris CD-ROM and then run ufsrestore.

If / (root), /usr, or the /var file system is unusable because of some type of corruption the system will not boot.

The following procedure demonstrates how to restore the / (root) file system which is assumed to be on boot disk c0t0d0s0.

1. Insert the Solaris 8 Software CD 1, and boot the CD-ROM with the single-user mode option. 

ok boot cdrom -s

2. Create the new file system structure.

# newfs /dev/rdsk/c0t0d0s0

3. Mount the file system to an empty mount point directory, /a and change to that directory.

# mount /dev/dsk/c0t0d0s0 /a
# cd /a

4. Restore the / (root) file system from its backup tape.

# ufsrestore rf /dev/rmt/0

Note - Remember to always restore a file system starting with the level 0 backup tape and continuing with the next lowest level 
tape up through the highest level tape.

5. Remove the restoresymtable file.

# rm restoresymtable

6. Install the bootblk in sectors 1-15 of the boot disk. Change to the directory containing the bootblk, and run the installboot command.

# cd /usr/platform/`uname -m`/lib/fs/ufs
# installboot bootblk /dev/rdsk/c0t0d0s0 


7. Unmount the new file system.

# cd /
# umount /a

8. Use the fsck command to check the restored file system.

# fsck /dev/rdsk/c0t0d0s0

9. Reboot the system.

# init 6

10. Perform a full backup of the file system. For example:

# ufsdump 0uf /dev/rmt/0 /dev/rdsk/c0t0d0s0

Note - Always back up the newly created file system, as ufsrestore repositions the files and changes the inode allocation. 

Restoring the /usr and /var File Systems 


-- To restore the /usr and /var file systems repeat the steps described above, except step 6. 
This step is required only when restoring the (/) root file system.

To restore a regular file system, (for example, /export/home, or /opt) back to disk, repeat the steps described above, except steps 1, 6, and 9.

Example

# newfs /dev/rdsk/c#t#d#s#
# mount /dev/dsk/c#t#d#s# /mnt
# cd /mnt
# ufsrestore rf /dev/rmt/#
# rm restoresymtable
# cd /
# umount /mnt
# fsck /dev/rdsk/c#t#d#s#
# ufsdump 0uf /dev/rmt/# /dev/rdsk/c#t#d#s#

 
Note 2:
-------


=============
10. uuencode:
=============

Unix to Unix Encoding. A method for converting files from Binary to ASCII so that they can be sent across 
the Internet via e-mail. 

Encode binary file (to uuencoded ASCII file) 

uuencode file remotefile 
uudecode file 

Example: 

Encode binary file
uuencode example example.en 

Decode encoded file
uudecode example.en 
 

uuencode converts a binary file into an encoded representation that can be sent using mail(1) . 
It encodes the contents of source-file, or the standard input if no source-file argument is given. 
The decode_pathname argument is required. The decode_pathname is included in the encoded file's header 
as the name of the file into which uudecode is to place the binary (decoded) data. 
uuencode also includes the permission modes of source-file, (except setuid , setgid, and sticky-bits), 
so that decode_pathname is recreated with those same permission modes. 

example:
The following example packages up a source tree, compresses it, uuencodes it and mails it to 
a user on another system. When uudecode is run on the target system, the file ``src_tree.tar.Z'' 
will be created which may then be uncompressed and extracted into the original tree. 

# tar cf - src_tree | compress | uuencode src_tree.tar.Z | mail sys1!sys2!user 

example:
uuencode <file_a> <file_b> > <uufile>                                  |
| note: here, file_a is encoded and a new file named uufile is produced  |
|       when you decode file uufile a file named file_b is produced      |

# uuencode dipl.doc dipl.doc >dipl.uu
Hier wird die Datei dipl.doc (z.B. ein WinWord-Dokument) in die Datei dipl.uu umgewandelt. Dabei legen wir fest, 
dasz die Datei nach dem Decodieren wieder dipl.doc heiszen soll. 

example:
uuencode long_name.tar.Z arc.trz > arc.uue


11. grep command:
=================

# grep Sally people
# grep "Sally Smith" people
# grep -v "^$" people.old > people
# grep -v "^ *$" people.old > people    # deletes all blank lines
# grep "S.* D.*" people.old > people


12. sort command:
=================

sort files by size, largest first...
# ls -al | sort +4 -r | more   

# sort +1 -2 people
# sort +2b people
# sort +2n +1 people
# sort +1 -2 *people > everybody
# sort -u +1 hardpeople softpeople > everybody  # -u=unique
# sort -t: +5 /etc/passw                        # -t field sep.

cp /etc/hosts /etc/hosts.`date +%o%b%d`


13. SED:
========

Can be used to replace a character sting with a different string.

# sed s/string/newstring file

#sed s/Smith/White/ people.old > people
#sed "s/Sally Smith/Sally White/" people.old > people

Note: depending on your shell and system, in most cases, you might need to enclose s/string/newstring by a " or a '.


you can also use a regular expression, for instance we can put a left margin of 5
spaces on the people file

# sed "s/^/     /" people.old > people
# sed "s/[0-9]*$//" people.old > people        (remove numbers)
# sed -e "s/^V^M//" filename > outputfilename 

The character after the s is the delimiter. It is conventionally a slash, because this is what ed, more, and vi use. 
It can be anything you want, however. If you want to change a pathname that contains a slash - say /usr/local/bin to /common/bin - 
you could use the backslash to quote the slash: 

sed 's/\/usr\/local\/bin/\/common\/bin/' <old >new

or use _ as a delimter

sed 's_/usr/local/bin_/common/bin_' <old >new


Example:
--------

Suppose the file cdc_LEG.sql contains the following:

    spool Publisher.06.PublisherDefineChangeTable.tdba_cdc.cdc_LEG.log ;

    connect / as sysdba ;

    grant all on rm_live.LEG to tdba_cdc ;

    prompt   User: tdba_cdc ;
    connect tdba_cdc ;

    begin
      dbms_cdc_publish.create_change_table
      ( owner             => 'tdba_cdc'
      , change_table_name => 'cdc_LEG'
      , change_set_name   => 'BODI_CDC_SET'
      , source_schema     => 'rm_live'
      , source_table      => 'LEG'
      , column_type_list  => '  IDFLT NUMBER(9)   ,  IDLEG NUMBER(9)   ,  LEGDATE DATE   ,  IDLEGDATA NUMBER(9)   ,  CANCELLED CHAR(1)   ,  IDWORKSET NUMBER(9)   ,  IDTEXTTTS NUMBER(9)   ,  IDSEGMENTDATACOMBINE NUMBER(9) '
      , capture_values    => 'both'
      , source_colmap     => 'y'
      , target_colmap     => 'y'
      , options_string    => 'tablespace  tdba_cdc'
      ) ;
    end ;
    /

    grant select on tdba_cdc.cdc_LEG to bodi_cdc ;


Now we want to replace the "connect tdba_cdc" by "connect tdba_cdc/tdba_cdc"

Try:

#sed 's!connect tdba_cdc!connect tdba_cdc/tdba_cdc!' cdc_LEG.sql > cdc_LEG.txt
#sed 's/playroca/accproca!' cdc_LEG.sql > cdc_LEG.txt

gives:

    spool Publisher.06.PublisherDefineChangeTable.tdba_cdc.cdc_LEG.log ;

    connect / as sysdba ;

    grant all on rm_live.LEG to tdba_cdc ;

    prompt   User: tdba_cdc ;
    connect tdba_cdc/tdba_cdc ;

    begin
      dbms_cdc_publish.create_change_table
      ( owner             => 'tdba_cdc'
      , change_table_name => 'cdc_LEG'
      , change_set_name   => 'BODI_CDC_SET'
      , source_schema     => 'rm_live'
      , source_table      => 'LEG'
      , column_type_list  => '  IDFLT NUMBER(9)   ,  IDLEG NUMBER(9)   ,  LEGDATE DATE   ,  IDLEGDATA NUMBER(9)   ,  CANCELLED CHAR(1)   ,  IDWORKSET NUMBER(9)   ,  IDTEXTTTS NUMBER(9)   ,  IDSEGMENTDATACOMBINE NUMBER(9) '
      , capture_values    => 'both'
      , source_colmap     => 'y'
      , target_colmap     => 'y'
      , options_string    => 'tablespace  tdba_cdc'
      ) ;
    end ;
    /

    grant select on tdba_cdc.cdc_LEG to bodi_cdc


If you have a lot of those files, use something like

for file in `ls`
do
   sed 's!connect tdba_cdc!connect tdba_cdc/tdba_cdc!' $file > $file.sql
done


for file in `ls`
do
   echo $file
done


for file in `ls`
do
 echo "connect / as sysdba;" >> $file
done

for file in `ls`
do
   sed 's!quit!;!' $file > $file.sql
done

Other example:
--------------

If you want sed to remove a space at either side of a field, like  Albert van der Sel , Antapex.org, 5 , 20
you could use:

sed 's/[ ]*,[ ]*/,/g'
or
sed -e 's/[ ]*,[ ]*/,/g' -e 's/^[ ]*//' -e 's/[ ]*$//' file1 > file2


sed -e 's#\(00/00/0000\)[, ][, ]*$#\1,,,,,,,,,,,,,,,,,#g' file


Most common error:

Message sed: 0602-404 Function __ cannot be parsed. 
If you were trying to use the sed "substitute" command, e.g. s/a/b/, you may have forgotton the trailing delimiter. 


14. AWK:
========

When lines containing `foo' are found, they are printed, because `print $0' means print the current line:
  # awk '/foo/ { print $0 }' BBS-list

looks for all files in the ls listing that matches Nov and it prints the total of bytes:
  # ls -l | awk '$5 == "Nov" { sum += $4 }
               END { print sum }'

only print the lines containing Smith from file people:
  # awk /Smith/ people                                   

# awk '/gold/' coins.txt
# awk '/gold/ {print $0}' coins.txt
# awk '/gold/ {print $5,$6,$7,$8}' coins.txt
# awk '{if ($3 < 1980) print $3, "    ",$5,$6,$7,$8}' coins.txt


# awk '/Smith/ {print $1 "-" $3}' people
# ls -l /home | awk '{total += $5}; END {print total}'
# ls -lR /home | awk '{total += $5}; END {print total}'


Example:
--------

Suppose you have a text file with lines much longer than, for example, 72 characters,
and you want to have a file with lines with a maximum of 72 chars, then you might use awk
in the following way:

-- Shell file r13.sh:

#!/bin/bash

DIR=/cygdrive/c/exports
FILE=result24.txt

awk -f r13.awk ${DIR}/${FILE} > ${DIR}/${FILE}.new

-- r13.awk

BEGIN { maxlength=72 }
{ 
  l=length();
  if (l > 72) { 
    i=(l/72)
    for (j=0; j<i; j++) {
      printf "%s\r\n",substr($0, (j*72)+1, maxlength)
    }
  } else { 
    printf "%s\r\n",$0
  }
}  


15. tr command:
===============

Used for translating characters in a file. tr works on standard input, so if you want
to take input from a file you have to redirect standard input so that it comes from that file.

Suppose we want to replace all characters in the 
range a-z by the characters A-Z

# tr "[a-z]" "[A-Z]" < people

squeeze  muliple occurences osf a character (e.g. a space) in one
# tr -s " " people.old > people

remove blank lines:
# tr -s "\012" < people.old > people


to remove the evil microsoft carriage return.
# tr -d '\015' < original.file > new.file

# cat filename1 | tr -d "^V^M" > newfile 

#! /bin/sh
#           
#  recursive dark side repair technique
#   eliminates spaces in file names from current directory down
#    useful for supporting systems where clueless vendors promote NT
#
for name in `find . -depth -print`
do
	na=`echo "$name" | tr ' ' '_'`
	if [ "$na" != "$name" ]
	then
		echo "$name" 
	fi
done

note:

> I have finally competed setting up the samba server and setup the share
> between NT and Samba server.
> 
> However, when I open a unix text file in Windows NT using notepad, i see
> many funny characters and the text file is not in order (Just like when I
> ftp the unix text file out into NT in binary format) ...I think this has to
> be something to do with whether the file transfer is in Binary format or
> ASCII ... Is there a parameter to set for this ? I have checked the
> documents ... but couldn't find anything on this ...
> 

This is a FAQ, but it brief, it's like this. Unix uses a single newline
character to end a line ("\n"), while DOS/Win/NT use a
carriage-return/newline pair ("\r\n"). FTP in ASCII mode translates
these for you. FTP in binary mode, or other forms of file transfer, such
as Samba, leave the file unaltered. Doing so would be extremely
dangerous, as there's no clear way to isolate which files should be
translated

You can get Windows editors that understand Unix line-end conventions
(Ultra Edit is one), or you can use DOS line endings on the files, which
will then look odd from the Unix side. You can stop using notepad, and
use Wordpad instead, which will deal appropriately with Unix line
endings.

You can convert a DOS format text file to Unix with this:-

tr -d '\r' < dosfile.txt > unixfile.txt

The best solution to this seems to be using a Windows editor that can
handle working with Unix line endings.

HTH

Mike.

Note:

There are two ways of moving to a new line...carriage return, which is chr(13), 
and new line which is chr(10).  In windows you're supposed to use a sequence 
of a carriage return followed by a new line.  
For example, in VB you can use Wrap$=Chr$(13)+Chr$(10)  which creates a wrap character.


16. cut and paste:
==================

cutting columns:

# cut -c17, 18, 19 people
# cut -c17- people > phones
# cut -c1-16 people > names

cutting fields:

#cut -d" " -f1,2 people > names            # -d field seperator

paste:

# paste -d" " firstname lastname phones > people


17. mknod:
==========

mknod creates a FIFO (named pipe), character special file, or block special file with the specified name. 
A special file is a triple (boolean, integer, integer) stored in the filesystem. 
The boolean chooses between character special file and block special file. 
The two integers are the major and minor device number.

Thus, a special file takes almost no place on disk, and is used only for communication 
with the operating system, not for data storage. Often special files refer to hardware devices 
(disk, tape, tty, printer) or to operating system services (/dev/null, /dev/random).

Block special files usually are disk-like devices 
(where data can be accessed given a block number, and e.g. it is meaningful to have a block cache). 
All other devices are character special files. 
(Long ago the distinction was a different one: 
I/O to a character special file would be unbuffered, to a block special file buffered.)

The mknod command is what creates files of this type.

The argument following name specifies the type of file to make:

p   for a FIFO 
b   for a block (buffered) special file 
c   for a character (unbuffered) special file 

When making a block or character special file, the major and minor device numbers must be given 
after the file type (in decimal, or in octal with leading 0; the GNU version also allows hexadecimal 
with leading 0x). By default, the mode of created files is 0666 (`a/rw') minus the bits set in the umask.  


In /dev we find logical devices, created by the mknod command.
# mknod /dev/kbd c 11 0
# mknod /dev/sunmouse c 10 6
# mknod /dev/fb0 c 29 0


create a pipe in /dev called 'rworldlp'

# mknod /dev/rworldlp p; chmod a+rw /dev/rworldlp


If one cannot afford to buy extra disk space one can run the export and compress 
utilities simultaneously. 
This will prevent the need to get enough space for both the export file AND the 
compressed export file. Eg: 

	# Make a pipe
	mknod expdat.dmp p            # or mkfifo pipe
	# Start compress sucking on the pipe in background
	compress < expdat.dmp > expdat.dmp.Z &
	# Wait a second or two before kicking off the export
	sleep 5
	# Start the export
	exp scott/tiger file=expdat.dmp


Create a compressed export on the fly. 

        # create a named pipe
        mknod exp.pipe p
        # read the pipe - output to zip file in the background
        gzip < exp.pipe > scott.exp.gz &
        # feed the pipe
        exp userid=scott/tiger file=exp.pipe ...


Extended Example:
-----------------


# Load the cron environment
. ~/cronjobs/.profile.cron
##################################################################
compareVersionDBMS 10.2.0.1.0 10.2.0.2.0 10.2.0.3.0
##################################################################
wantedSchemas=""
wantedDatabase=""
wantedInputDir=""
##################################################################
if [ $# -ne 0 ]
then
  function showSyntaxParam
  { ( Comment  "\t[-db=<Database>] [-dir=<InputDir>] [-schema=<Schema,schema,schema>]"
    )
  }
  for param in $*
  do
    echo ${param} \
      | awk 'BEGIN {FS="="}{print $1,$2}' \
      | read flag value
    if [ "${flag}" != "-h" ] && [ "${value}" = "" ]
    then
      Error "Empty value for ${flag}"
      showSyntaxSystemParam
      showSyntaxParam
    else
      case ${flag} in
        -schema)     wantedSchemas=${value}                       ;;
        -dir)        wantedInputDir=${value}                     ;;
        *)           checkSystemParam ${param} || showSyntaxParam ;;
      esac
    fi
  done
fi
##################################################################
BlankLine
Comment "Selected options are:"
Comment "  Database         :   -db=${wantedDatabase}"
Comment "  Schema's         :   -schema=${wantedSchemas}"
Comment "  Input directory  :   -dir=${wantedInputDir}"
Line
##################################################################
if [ ${continue} = true ]
then
  if [ "${wantedDatabase}" = "" ]
  then
    BlankLine
    Error "No database specified"
    BlankLine
  else
    echo ${wantedDatabase} \
    | grep -i prod \
    | wc -l \
    | read prod
    if [ ${prod} -ne 0 ]
    then
      BlankLine
      Error "Production environment not allowed!!"
      BlankLine
    else
      moveLogFile ${wantedDatabase}
    fi
  fi
  #
  if [ "${wantedInputDir}" = "" ]
  then
    BlankLine
    Error "No input directory specified"
    BlankLine
  else
    if [ ! -d ${wantedInputDir} ]
    then
      BlankLine
      Error "Input directory ${wantedInputDir} doesn't exists"
      BlankLine
    fi
  fi
  #
  if [ "${wantedSchemas}" = "" ]
  then
    BlankLine
    Error "No schema's to load"
    BlankLine
  fi
fi
##################################################################
wantedSchemas=`echo ${wantedSchemas} | sed 's/,/ /g'`
if [ ${continue} = true ]
then
  for schema in ${wantedSchemas}
  do
    impPipeFile=${wantedInputDir}/${schema}.${currentUser}.load.pipe
    impLogFile=${wantedInputDir}/${schema}.${currentUser}.load.log
    impCompressFile=${wantedInputDir}/${schema}.data.Z
    #
    if [ ${continue} = true ]
    then
      Message "Check file permissions"
      if [ ! -w ${wantedInputDir} ]
      then
        Error "Unable to write in ${wantedInputDir}"
      fi
    fi
    #
    if [ ${continue} = true ]
    then
      rm ${impPipeFile}     2> /dev/null
      rm ${impLogFile}      2> /dev/null
    fi
    #
    if [ ${continue} = true ]
    then
      Message "Load schema ${schema} into database ${wantedDatabase}"
    fi
    #
    if [ ${continue} = true ]
    then
      Message "Create pipe for load"
      CmdCapture "mknod ${impPipeFile} p"
    fi
    #
    if [ ${continue} = true ]
    then
      if [ ! -f ${impCompressFile} ]
      then
        BlankLine
        Error "File not found: ${impCompressFile}"
        BlankLine
      fi
    fi
    #
    if [ ${continue} = true ]
    then
      Message "Start uncompression into background"
      uncompress -c < ${impCompressFile} > ${impPipeFile} &
      #
      Message "Start import"
      imp \"sys/change_on_install as sysdba\" file=${impPipeFile} log=${impLogFile} full=y statistics=always >/dev/null 2>/dev/null
      #
      Message     "Output of import"
      CmdCapture  "cat ${impLogFile}"
      #
      Message "Allowed warnings are:"
      Comment "  IMP-00017 IMP-00041 IMP-00003 ORA-14063 ORA-14048 ORA-02270"
      cat ${impLogFile} \
      | egrep '^ORA-|^ERROR|^IMP-' \
      | egrep -v 'IMP-00017|IMP-00041|IMP-00003|ORA-14063|ORA-14048|ORA-02270' \
      | wc -l \
      | read count
      if [ ${count} -ne 0 ]
      then
        Error "Problem with import !!"
      else
        Message "Import succesful"
      fi
    fi
    #
    rm ${impPipeFile} 2> /dev/null
    if [ ${continue} = true ]
    then
      Line
    fi
  done
fi

##################################################################
finish
##################################################################


18. Links:
==========

A symbolic link is a pointer or an alias to another file. The command 

# ln -s fromfile /other/directory/tolink


makes the file fromfile appear to exist at /other/directory/tolink simultaneously. 
The file is not copied, it merely appears to be a part of the file tree in two places. 
Symbolic links can be made to both files and directories. 

The usage of the link command is. 

%ln -s ActualFilename LinkFileName

Where -s indicates a symbolic link. ActualFilename is the name of the file which is to be linked to, 
and LinkFileName is the name by which the file should be known. 

You should use full paths in the command.


This example shows copying three files from a directory into the current working directory. 

    [2]%cp ~team/IntroProgs/MoreUltimateAnswer/more*
    [3]%ls -l more*
    -rw-rw-r--   1 mrblobby  mrblobby    632 Sep 21 18:12 moreultimateanswer.adb
    -rw-rw-r--   1 mrblobby  mrblobby   1218 Sep 21 18:19 moreultimatepack.adb
    -rw-rw-r--   1 mrblobby  mrblobby    784 Sep 21 18:16 moreultimatepack.ads

The three files take a total of 2634 bytes. The equivalent ln commands would be: 


    [2]%ln -s ~team/IntroProgs/MoreUltimateAnswer/moreultimateanswer.adb .
    [3]%ln -s ~team/IntroProgs/MoreUltimateAnswer/moreultimatepack.adb .
    [4]%ln -s ~team/IntroProgs/MoreUltimateAnswer/moreultimatepack.adb .
    [5]%ls -l
    lrwxrwxrwx   1  mrblobby  mrblobby     35 Sep 22 08:50 moreultimateanswer.adb ->
                     /users/team/IntroProgs/MorUltimateAnswer/moreultimateanswer.adb
    lrwxrwxrwx   1  mrblobby  mrblobby     37 Sep 22 08:49 moreultimatepack.adb ->                       
                     /users/team/IntroProgs/MorUltimateAnswer/moreultimatepack.adb
    lrwxrwxrwx   1   mrblobby  mrblobby    37 Sep 22 08:50 moreultimatepack.ads ->
                     /users/team/IntroProgs/MorUltimateAnswer/moreultimatepack.ads


     The ln utility creates a new directory entry (linked file) which has the
     same modes as the original file.  It is useful for maintaining multiple
     copies of a file in many places at once without using up storage for the
     copies; instead, a link ``points'' to the original copy.  There are two
     types of links; hard links and symbolic links.  How a link points to a
     file is one of the differences between a hard and symbolic link.

     By default, ln makes ``hard'' links.  A hard link to a file is indistin-
     guishable from the original directory entry; any changes to a file are
     effectively independent of the name used to reference the file.  Hard
     links may not normally refer to directories and may not span file sys-
     tems.

     A symbolic link contains the name of the file to which it is linked.  The
     referenced file is used when an open(2) operation is performed on the
     link.  A stat(2) on a symbolic link will return the linked-to file; an
     lstat(2) must be done to obtain information about the link.  The
     readlink(2) call may be used to read the contents of a symbolic link.
     Symbolic links may span file systems, refer to directories, and refer to
     non-existent files.


19. Relink van Oracle:
======================

info:

  showrev -p
  pkginfo -i

relink:

  mk -f $ORACLE_HOME/rdbms/lib/ins_rdbms.mk install
  mk -f $ORACLE_HOME/svrmgr/lib/ins_svrmgr.mk install
  mk -f $ORACLE_HOME/network/lib/ins_network.mk install


20. trace:
==========

20.1 truss on Solaris:
----------------------

  truss -aef -o /tmp/trace svrmgrl

To trace what a Unix process is doing enter: 

  truss -rall -wall -p <PID>
  truss -p $ lsnrctl dbsnmp_start

NOTE: The "truss" command works on SUN and Sequent. Use "tusc" on HP-UX, "strace" on Linux, 
"trace" on SCO Unix or call your system administrator to find the equivalent command on your system. 
Monitor your Unix system: 

Solaris:

Truss is used to trace the system/library calls (not user calls) and signals made/received 
by a new or existing process. It sends the output to stderr. 


NOTE: Trussing a process throttles that process to your display speed. Use -wall and -rall sparingly. 
Truss usage 

    truss  -a  -e  -f  -rall  -wall  -p  
    truss  -a  -e  -f  -rall  -wall  

    -a        Show arguments passed to the exec system calls
    -e        Show environment variables passed to the exec system calls
    -f        Show forked processes 
                (they will have a different pid: in column 1)
    -rall     Show all read data (default is 32 bytes)
    -wall     Show all written data (default is 32 bytes)
    -p        Hook to an existing process (must be owner or root)
    <program> Specify a program to run
  
Truss examples 
  # truss -rall -wall -f -p <PID>
  # truss -rall -wall lsnrctl start
  # truss -aef lsnrctl dbsnmp_start


20.2 syscalls command on AIX:
-----------------------------

1. syscalls Command 

Purpose 
Provides system call tracing and counting for specific processes and the system. 

Syntax 
To Create or Destroy Buffer: 
syscalls [ [ -enable  bytes ]| -disable  ] 

To Print System Call Counts: 
syscalls -c 

To Print System Call Events or Start Tracing: 
syscalls [ -o  filename ] [ -t  ] { [ [ -p pid ] -start | -stop  ] | -x  program } 

Description 
The syscalls (system call tracing) command, captures system call entry and exit events by individual processes 
or all processes on the system. The syscalls command can also maintain counts for all system calls 
made over long periods of time. 

Notes: 
System call events are logged in a shared-memory trace buffer. The same shared memory identifier may be used 
by other processes resulting in a collision. In such circumstances, the -enable flag needs to be issued. 
The syscalls command does not use the trace daemon. 
The system crashes if ipcrm -M sharedmemid is run after syscalls has been run. 
Run stem -shmkill instead of running ipcrm -M to remove the shared memory segment.

Flags 
-c  Prints a summary of system call counts for all processes. The counters are not reset.  

-disable  Destroys the system call buffer and disables system call tracing and counting.  

-enable bytes  Creates the system call trace buffer. If this flag is not used, the syscalls command 
 creates a buffer of the default size of 819,200 bytes. Use this flag if events are not being logged 
 in the buffer. This is the result of a collision with another process using the same shared memory buffer ID.  

-o filename  Prints output to filename rather than standard out.  

-p pid  When used with the -start flag, only events for processes with this pid will be logged 
   in the syscalls buffer. When used with the -stop option, syscalls filters the data in the buffer 
   and only prints output for this pid.  

-start  Resets the trace buffer pointer. This option enables the buffer if it does not exist and resets 
        the counters to zero.  

-stop  Stops the logging of system call events and prints the contents of the buffer.  

-t  Prints the time associated with each system call event alongside the event.  

-x program  Runs program while logging events for only that process. The buffer is enabled if needed.  


Security 
Access Control: You must be root or a member of the perf group to run this command. 

Examples 
To collect system calls for a particular program, enter: 
syscalls -x /bin/ps
Output similar to the following appears: 
   PID    TTY  TIME CMD
 19841  pts/4  0:01 /bin/ksh 
 23715  pts/4  0:00 syscalls -x /bin/ps 
 30720  pts/4  0:00 /bin/ps 
 34972  pts/4  0:01 ksh
   PID   System Call          
 30720           .kfork  Exit , return=0  Call preceded tracing.
 30720          .getpid  () = 30720
 30720       .sigaction  (2, 2ff7eba8, 2ff7ebbc) = 0
 30720       .sigaction  (3, 2ff7eba8, 2ff7ebcc) = 0
 30720     .sigprocmask  (0, 2ff7ebac, 2ff7ebdc) = 0
 30720       .sigaction  (20, 2ff7eba8, 2ff7ebe8) = 0
 30720           .kfork  () = 31233
 30720        .kwaitpid  (2ff7ebfc, 31233, 0, 0) = 31233
 30720       .sigaction  (2, 2ff7ebbc, 0) = 0
 30720       .sigaction  (3, 2ff7ebcc, 0) = 0
 30720       .sigaction  (20, 2ff7ebe8, 0) = 0
 30720     .sigprocmask  (2, 2ff7ebdc, 0) = 0
 30720         .getuidx  (4) = 0
 30720         .getuidx  (2) = 0
 30720         .getuidx  (1) = 0
 30720         .getgidx  (4) = 0
 30720         .getgidx  (2) = 0
 30720         .getgidx  (1) = 0
 30720           ._load  NoFormat, (0x2ff7ef54, 0x0, 0x0, 0x2ff7ff58) = 537227760
 30720            .sbrk  (65536) = 537235456
 30720          .getpid  () = 30720

To produce a count of system calls made by all processes, enter: 
syscalls -start
followed by entering: 
syscalls -c
Output similar to the following appears: 
 System Call Counts for all processes
       5041      .lseek
       4950      .kreadv
        744      .sigaction
        366      .close
        338      .sbrk
        190      .kioctl
        120      .getuidx
        116      .kwritev
        108      .kfcntl
        105      .getgidx
         95      .kwaitpid
         92      .gettimer
         92      .select
         70      .getpid
         70      .sigprocmask
         52      .execve
         51      ._exit
         51      .kfork
         35      .open
         35      ._load
         33      .pipe
         33      .incinterval
         28      .sigreturn
         27      .access
         16      .brk 
         15      .times
         15      .privcheck
         15      .gettimerid
         10      .statx
          9      .STEM_R10string
          4      .sysconfig
          3      .P2counters_accum
          3      .shmget
          3      .shmat
          2      .setpgid
          2      .shmctl
          2      .kioctl
          1      .Patch_Demux_Addr_2
          1      .Patch_Demux_Addr_High
          1      .STEM_R3R4string
          1      .shmdt
          1      .Stem_KEX_copy_demux_entry
          1      .STEM_R3R4string
          1      .Patch_Demux_Addr_1
          1      .pause
          1      .accessx
Files 
/usr/bin/syscalls  Contains the syscalls command.  


20.3 truss command on AIX:
--------------------------

AIX 5.1,5.2,5.3


The truss command is also available for SVR4 UNIX-based environments. This command is useful for tracing 
system calls in one or more processes. In AIX 5.2, all base system call parameter types are now recognized. 
In AIX 5.1, only about 40 system calls were recognized. 

Truss is a /proc based debugging tool that executes and traces a command, or traces an existing process. 
It prints names of all system calls made with their arguments and return code. System call parameters are 
displayed symbolically. It prints information about all signals received by a process. The AIX 5.2 version 
supports library calls tracing. For each call, it prints parameters and return codes. 
It can also trace a subset of libraries and a subset of routines in a given library. The timestamps on each line 
are also supported.

In AIX 5.2, truss is packaged with bos.sysmgt.serv_aid, which is installable from the AIX base installation media. 
See the command reference for details and examples, or use the information below. 

-a Displays the parameter strings that are passed in each executed system call. 

# truss -a  sleep

execve("/usr/bin/sleep", 0x2FF22980, 0x2FF22988)  argc: 1
argv: sleep
sbrk(0x00000000)                                = 0x200007A4
sbrk(0x00010010)                                = 0x200007B0
getuidx(4)                                             = 0
.
.
__loadx(0x01000080, 0x2FF1E790, 0x00003E80, 0x2FF22720, 0x00000000) = 
   0xD0077130 access("/usr/lib/nls/msg/en_US/sleep.cat", 0)   = 0
_getpid()                                       = 31196
open("/usr/lib/nls/msg/en_US/sleep.cat", O_RDONLY) = 3
kioctl(3, 22528, 0x00000000, 0x00000000)        Err#25 ENOTTY
kfcntl(3, F_SETFD, 0x00000001)                  = 0
kioctl(3, 22528, 0x00000000, 0x00000000)        Err#25 ENOTTY
kread(3, "\0\001 �\001\001 I S O 8".., 4096)    = 123
lseek(3, 0, 1)                                  = 123
lseek(3, 0, 1)                                  = 123
lseek(3, 0, 1)                     	             = 123
_getpid()                         	             = 31196
lseek(3, 0, 1)                     	             = 123
Usage: sleep Seconds
kwrite(2, " U s a g e :   s l e e p".., 21) 	    = 21
kfcntl(1, F_GETFL, 0x00000000)           	    = 2
kfcntl(2, F_GETFL, 0x00000000)           	    = 2
_exit(2)


-c Counts traced system calls, faults, and signals rather than displaying trace results line by line. 
A summary report is produced after the traced command terminates or when truss is interrupted. 
If the -f flag is also used, the counts include all traced Syscalls, Faults, and Signals for child processes. 
 
# truss -c ls

	syscall			seconds   	calls  errors
execve				.00		1
__loadx			.00	      17
_exit				.00		1
close				.00		2
kwrite				.00		5
lseek                     	.00		1
setpid                   	.00	       1
getuidx                   	.00	      19
getdirent                 	.00	       3
kioctl                    	.00	       3
open                      	.00	       1
statx                     	.00	       2
getgidx                   	.00	      18
sbrk                      	.00	       4
access                    	.00	       1
kfcntl                    	.00	       6
                         	----	     ---    ---
sys totals:               	.01	      85      0
usr time:                 	.00
elapsed:                  	.01


More truss examples:
--------------------

truss -o /tmp/tst -p 307214

root@zd93l14:/tmp#cat tst
                                                = 0
_nsleep(0x4128B8E0, 0x4128B958)                 = 0
_nsleep(0x4128B8E0, 0x4128B958)                 = 0
_nsleep(0x4128B8E0, 0x4128B958)                 = 0
_nsleep(0x4128B8E0, 0x4128B958)                 = 0
thread_tsleep(0, 0xF033159C, 0x00000000, 0x43548E38) = 0
thread_tsleep(0, 0xF0331594, 0x00000000, 0x434C3E38) = 0
thread_tsleep(0, 0xF033158C, 0x00000000, 0x4343FE38) = 0
thread_tsleep(0, 0xF0331584, 0x00000000, 0x433BBE38) = 0
thread_tsleep(0, 0xF0331574, 0x00000000, 0x432B2E38) = 0
thread_tsleep(0, 0xF033156C, 0x00000000, 0x4322EE38) = 0
thread_tsleep(0, 0xF0331564, 0x00000000, 0x431AAE38) = 0
thread_tsleep(0, 0xF0331554, 0x00000000, 0x42F99E38) = 0
thread_tsleep(0, 0xF033154C, 0x00000000, 0x4301DE38) = 0
thread_tsleep(0, 0xF0331534, 0x00000000, 0x42E90E38) = 0
thread_tsleep(0, 0xF033152C, 0x00000000, 0x42E0CE38) = 0
thread_tsleep(0, 0xF033157C, 0x00000000, 0x43337E38) = 0
thread_tsleep(0, 0xF0331544, 0x00000000, 0x42F14E38) = 0
                                                = 0
thread_tsleep(0, 0xF033153C, 0x00000000, 0x42D03E38) = 0
_nsleep(0x4128B8E0, 0x4128B958)                 = 0


20.4 man pages for truss AIX:
-----------------------------

Purpose

Traces a process's system calls, dynamically loaded user level function calls,
received signals, and incurred machine faults.

Syntax

truss [ -f] [ -c] [ -a] [ -l ] [ -d ] [ -D ] [ -e] [ -i] [ { -t | -x} [!]
Syscall [...] ] [ -s [!] Signal [...] ] [ { -m }[!] Fault [...]] [ { -r | -w}
[!] FileDescriptor [...] ] [ { -u } [!]LibraryName [...]:: [!]FunctionName [ ...
] ] [ -o Outfile] {Command| -p pid [. . .]}

Description

The truss command executes a specified command, or attaches to listed process
IDs, and produces a trace of the system calls, received signals, and machine
faults a process incurs. Each line of the trace output reports either the Fault
or Signal name, or the Syscall name with parameters and return values. The
subroutines defined in system libraries are not necessarily the exact system
calls made to the kernel. The truss command does not report these subroutines,
but rather, the underlying system calls they make. When possible, system call
parameters are displayed symbolically using definitions from relevant system
header files. For path name pointer parameters, truss displays the string being
pointed to. By default, undefined system calls are displayed with their name,
all eight possible argments and the return value in hexadecimal format.

When the -o flag is used with truss, or if standard error is redirected to a
non-terminal file, truss ignores the hangup, interrupt, and signals processes.
This facilitates the tracing of interactive programs which catch interrupt and
quit signals from the terminal.

If the trace output remains directed to the terminal, or if existing processes
are traced (using the -p flag), then truss responds to hangup, interrupt, and
quit signals by releasing all traced processes and exiting. This enables the
user to terminate excessive trace output and to release previously existing
processes. Released processes continue to function normally.

Flags

-a Displays the parameter strings which are passed in each executed system call.

-c Counts traced system calls, faults, and signals rather than displaying trace
results line by line. A summary report is produced after the traced command
terminates or when truss is interrupted. If the -f flag is also used, the counts
include all traced Syscalls, Faults, and Signals for child processes.

-d A timestamp will be included with each line of output. Time displayed is in
seconds relative to the beginning of the trace. The first line of the trace
output will show the base time from which the individual time stamps are
measured. By default timestamps are not displayed.

-D Delta time is displayed on each line of output. The delta time represents the
elapsed time for the LWP that incurred the event since the last reported event
incurred by that thread. By default delta times are not displayed.

-e Displays the environment strings which are passed in each executed system
call.

-f Follows all children created by the fork system call and includes their
signals, faults, and system calls in the trace output. Normally, only the
first-level command or process is traced. When the -f flag is specified, the
process id is included with each line of trace output to show which process
executed the system call or received the signal.

-i Keeps interruptible sleeping system calls from being displayed. Certain
system calls on terminal devices or pipes, such as open and kread, can sleep for
indefinite periods and are interruptible. Normally, truss reports such sleeping
system calls if they remain asleep for more than one second. The system call is
then reported a second time when it completes. The -i flag causes such system
calls to be reported only once, upon completion.

-l Display the id (thread id) of the responsible LWP process along with truss
output. By default LWP id is not displayed in the output.

-m [!]Fault Traces the machine faults in the process. Machine faults to trace
must be separated from each other by a comma. Faults may be specified by name or
number (see the sys/procfs.h header file). If the list begins with the "!"
symbol, the specified faults are excluded from being traced and are not
displayed with the trace output. The default is -mall -m!fltpage.

-o Outfile Designates the file to be used for the trace output. By default, the
output goes to standard error.

-p Interprets the parameters to truss as a list of process ids for an existing
process rather than as a command to be executed. truss takes control of each
process and begins tracing it, provided that the user id and group id of the
process match those of the user or that the user is a privileged user.

-r [!] FileDescriptor Displays the full contents of the I/O buffer for each read
on any of the specified file descriptors. The output is formatted 32 bytes per
line and shows each byte either as an ASCII character (preceded by one blank) or
as a two-character C language escape sequence for control characters, such as
horizontal tab (\t) and newline (\n). If ASCII interpretation is not possible,
the byte is shown in two-character hexadecimal representation. The first 16
bytes of the I/O buffer for each traced read are shown, even in the absence of
the -r flag. The default is -r!all.

-s [!] Signal Permits listing Signals to trace or exclude. Those signals
specified in a list (separated by a comma) are traced. The trace output reports
the receipt of each specified signal even if the signal is being ignored, but
not blocked, by the process. Blocked signals are not received until the process
releases them. Signals may be specified by name or number (see sys/signal.h). If
the list begins with the "!" symbol, the listed signals are excluded from being
displayed with the trace output. The default is -s all.

-t [!] Syscall Includes or excludes system calls from the trace process. System
calls to be traced must be specified in a list and separated by commas. If the
list begins with an "!" symbol, the specified system calls are excluded from the
trace output. The default is -tall.

-u [!] [LibraryName [...]::[!]FunctionName [...] ]

Traces dynamically loaded user level function calls from user libraries. The
LibraryName is a comma-separated list of library names. The FunctionName is a
comma-separated list of function names. In both cases the names can include
name-matching metacharacters *, ?, [] with the same meanings as interpreted by
the shell but as applied to the library/function name spaces, and not to files.

A leading ! on either list specifies an exclusion list of names of libraries or
functions not to be traced. Excluding a library excludes all functions in that
library. Any function list following a library exclusion list is ignored.
Multiple -u options may be specified and they are honored left-to-right. By
default no library/function calls are traced.

-w [!] FileDescriptor Displays the contents of the I/O buffer for each write on
any of the listed file descriptors (see -r). The default is -w!all.

-x [!] Syscall Displays data from the specified parameters of traced sytem calls
in raw format, usually hexadecimal, rather than symbolically. The default is
-x!all.

Examples

  1. To produce a trace of the find command on the terminal, type:

     truss find . -print >find.out

  2. To trace the lseek, close, statx, and open system calls, type:

     truss -t lseek,close,statx,open find . -print > find.out

  3. To display thread id along with regular output for find command, enter:
     truss -l find . -print >find.out

  4. To display timestamps along with regular output for find command, enter:
     truss -d find . -print >find.out

  5. To display delta times along with regular output for find command, enter:
     truss -D find . -print >find.out

  6. To trace the malloc() function call and exclude the strlen() function call
     in the libc.a library while running the ls command, enter:
     truss -u libc.a::malloc,!strlen ls

  7. To trace all function calls in the libc.a library with names starting with
     "m" while running the ls command, enter:
     truss -u libc.a::m*,!strlen ls

  8. To trace all function calls from the library libcurses.a and exclude calls
     from libc.a while running executable foo, enter:
     truss -u libcurses.a,!libc.a::* foo

  9. To trace the refresh() function call from libcurses.a and the malloc()
     function call from libc.a while running the executable foo, enter:
      truss -u libc.a::malloc -u libcurses.a::refresh foo


20.5 Note: How to trace an AIX machine: 
---------------------------------------

The trace facility and commands are provided as part of the Software Trace Service Aids fileset
named bos.sysmgt.trace.

To see if this fileset is installed, use the following command:

# lslpp -l | grep bos.sysmgt.trace


Taking a trace:
---------------

The events traced are referenced by hook identifiers.
Each hook ID uniquely refers to a particular activity that can be traced.

When tracing, you can select the hook IDs of interest and exclude others that are
not relevant to your problem. A trace hook ID is a 3 digit hexidecimal number
that identifies an event being traced. 
Trace hook IDs are defined in the "/usr/include/sys/trchkid.h" file.

The currently defined trace hook IDs can be listed using the trcrpt command:

# trcrpt -j | sort | pg

001 TRACE ON
002 TRACE OFF
003 TRACE HEADER
004 TRACEID IS ZERO
005 LOGFILE WRAPAROUND
006 TRACEBUFFER WRAPAROUND
..
..

The trace daemon configures a trace session and starts the collection of system events. 
The data collected by the trace function is recorded in the trace log. A report from the trace log 
can be generated with the trcrpt command.

When invoked with the  -a, -x, or -X flags, the trace daemon is run asynchronously (i.e. as a background task).
Otherwise, it is run interactively and prompts you for subcommands.


Some trace examples:

# trace -adf -C all -r PURR -o trace.raw
# trace -Jfop fact proc procd filephys filepfsv filepvl filepvld locks -A786578 -Pp -a
# trace -Jfop fact proc procd filephys filepfsv filepvl filepvld locks -Pp -a
# trace -Jfop fact proc procd filephys filepfsv filepvl filepvld locks -Pp -a


Some trcrpt examples:

Examples
       1    To format the trace log file and print the result, enter:

            trcrpt | qprt
       2    To send a trace report to the /tmp/newfile file, enter:

            trcrpt -o /tmp/newfile
       3    To display process IDs and exec path names in the trace report, enter:

            trcrpt pid=on,exec=on -O /tmp/newfile 
       4    To create trace ID histogram data, enter:

            trcrpt -O hist=on
       5    To produce a list of all event groups, enter:

            trcrpt -G
            The format of this report is shown under the trcevgrp command.
       6    To generate back-to-back LMT reports from the common and rare buffers, specify:

            trcrpt -M all
       7    If, in the above example, the LMT files reside at /tmp/mydir, and we want the LMT traces to be merged, 
            specify:

            trcrpt -m -M all:/tmp/mydir
       8    To merge the system trace with the scdisk.hdisk0 component trace, specify:

            trcrpt -m -l scdisk.hdisk0 /var/adm/ras/trcfile
       9    To merge LMT with the system trace while not eliminating duplicate events, specify:

            trcrpt -O removedups=off -m -M all /var/adm/ras/trcfile
       10   To merge all component traces in /tmp/mydir with the LMT traces in the default LMT directory 
            while showing the source file for each trace event, specify:

            trcrpt -O filename=on -m -M all /tmp/mydir
            Note: This is equivalent to:

            trcrpt -O filename=on -m -M all -l all:/tmp/mydir

            Note: If the traces are from a 64-bit kernel, duplicate entries will be removed. However, 
            on the 32-bit kernel,
            duplicate entries will not be removed since we do not know the CPU IDs of the entries in the 
            components traces.


Another example of the usage of trace:
--------------------------------------


>> Obtaining a Sample Trace File

Trace data accumulates rapidly. We want to bracket the data collection as closely around the area of interest 
as possible. One technique for doing this is to issue several commands on the same command line. For example:

$ trace -a -k "20e,20f" -o ./trcraw ; cp ../bin/track /tmp/junk ; trcstop

captures the execution of the cp command. We have used two features of the trace command. The -k "20e,20f" option 
suppresses the collection of events from the lockl and unlockl functions. These calls are numerous and add volume 
to the report without adding understanding at the level we're interested in. The -o ./trc_raw option causes the 
raw trace output file to be written in our local directory.

Note: This example is more educational if the input file is not already cached in system memory. Choose as the source 
file any file that is about 50KB and has not been touched recently.


>> Formatting the Sample Trace

We use the following form of the trcrpt command for our report:

$ trcrpt -O "exec=on,pid=on" trcraw > /tmp/cp.rpt

This reports both the fully qualified name of the file that is execed and the process ID that is assigned to it.

A quick look at the report file shows us that there are numerous VMM page assign and delete events in the trace, 
like the following sequence: 

1B1 ksh            8525          0.003109888       0.162816                   VMM page delete:      V.S=00
00.150E ppage=1F7F
                                                                               delete_in_progress proce
ss_private working_storage

1B0 ksh            8525          0.003141376       0.031488                   VMM page assign:      V.S=00
00.2F33 ppage=1F7F                                                           delete_in_progress process_private working_
storage

We are not interested in this level of VMM activity detail at the moment, so we reformat the trace with:

$ trcrpt -k "1b0,1b1" -O "exec=on,pid=on" trcraw > cp.rpt2

The -k "1b0,1b1" option suppresses the unwanted VMM events in the formatted output. It saves us from having 
to retrace the workload to suppress unwanted events. We could have used the -k function of trcrpt instead of 
that of the trace command to suppress the lockl and unlockl events, if we had believed that we might need 
to look at the lock activity at some point. If we had been interested in only a small set of events, 
we could have specified -d "hookid1,hookid2" to produce a report with only those events. Since the hook ID 
is the left-most column of the report, you can quickly compile a list of hooks to include or exclude.

A comprehensive list of Trace hook IDs is defined in /usr/include/sys/trchkid.h.

>> Reading a Trace Report

The header of the trace report tells you when and where the trace was taken, as well as the command that was 
used to produce it:

Fri Nov 19 12:12:49 1993
System: AIX ptool Node: 3
Machine: 000168281000
Internet Address: 00000000 0.0.0.0
trace -ak 20e 20f -o -o ./trc_raw

The body of the report, if displayed in a small enough font, looks as follows:

ID  PROCESS NAME   PID           ELAPSED_SEC     DELTA_MSEC   APPL    SYSCALL KERNEL  INTERRUPT
101 ksh            8525          0.005833472       0.107008           kfork
101 ksh            7214          0.012820224       0.031744           execve
134 cp             7214          0.014451456       0.030464           exec cp ../bin/trk/junk

In cp.rpt you can see the following phenomena:

The fork, exec, and page fault activities of the cp process 
The opening of the input file for reading and the creation of the /tmp/junk file 
The successive read/write system calls to accomplish the copy 
The process cp becoming blocked while waiting for I/O completion, and the wait process being dispatched 
How logical-volume requests are translated to physical-volume requests 
The files are mapped rather than buffered in traditional kernel buffers, and the read accesses cause page faults that must be resolved by the Virtual Memory Manager. 
The Virtual Memory Manager senses sequential access and begins to prefetch the file pages. 
The size of the prefetch becomes larger as sequential access continues. 

When possible, the disk device driver coalesces multiple file requests into one I/O request to the drive.
The trace output looks a little overwhelming at first. This is a good example to use as a learning aid. 
If you can discern the activities described, you are well on your way to being able to use the trace facility 
to diagnose system-performance problems.

>> Filtering of the Trace Report

The full detail of the trace data may not be required. You can choose specific events of interest to be shown. 
For example, it is sometimes useful to find the number of times a certain event occurred. To answer the question 
"How many opens occurred in the copy example?" first find the event ID for the open system call. 
This can be done as follows:

$ trcrpt -j | grep -i open

You should be able to see that event ID 15b is the open event. Now, process the data from the copy example as follows:

$ trcrpt -d 15b -O "exec=on" trc_raw

The report is written to standard output, and you can determine the number of open subroutines that occurred. 
If you want to see only the open subroutines that were performed by the cp process, run the report command 
again using the following:

$ trcrpt -d 15b -p cp -O "exec=on" trc_raw

$ trcrpt -o /tmp/newfile


A Wrapper around trace:
-----------------------

Simple instructions for using the AIX trace facility

>> Five aix commands are used: 

-trace 
-trcon 
-trcoff 
-trcstop 
-trcrpt 

These are described in AIX Commands Reference, Volume 5, but hopefully you won't have to dig into that. 
Scripts to download
I've provided wrappers for the trace and trcrpt commands since there are various command-line parameters to specify. 

-atrace 
-atrcrpt 

>> Contents atrace:

# To change from the default trace file, set TRCFILE to
# the name of the raw trace file name here; this should 
# match the name of the raw trace file in atrcrpt.
# Don't do this on AIX 4.3.3 ML 10, where you'll need
# to use the default trace file, /usr/adm/ras/trcfile
#TRCFILE="-o /tmp/raw"

# trace categories not to collect
IGNORE_VMM="1b0,1b1,1b2,1b3,1b5,1b7,1b8,1b9,1ba,1bb,1bc,1bd,1be"
IGNORE_LOCK=20e,20f
IGNORE_PCI=2e6,2e7,2e8
IGNORE_SCSI=221,223
IGNORE_OTHER=100,10b,116,119,11f,180,234,254,2dc,402,405,469,7ff

IGNORE="$IGNORE_VMM,$IGNORE_LOCK,$IGNORE_PCI,$IGNORE_SCSI,$IGNORE_LVM,$IGNORE_OTHER"

trace -a -d -k $IGNORE $TRCFILE

>> Contents atrcrpt:

# To change from the default trace file, set TRCFILE to
# the name of the raw trace file name here; this should 
# match the name of the raw trace file in atrace.
# Don't do this on AIX 4.3.3 ML 10, where you'll need
# to use the default trace file, /usr/adm/ras/trcfile
# TRCFILE=/tmp/raw

# edit formatted trace file name here
FMTFILE=/tmp/fmt

trcrpt -O pid=on,tid=on,timestamp=1 $TRCFILE >$FMTFILE


Setup instructions

edit atrace and atrcrpt and ensure that names of files for raw and formatted trace are appropriate 
Please see the comments in the scripts about 4.3.3 ML 10 being broken for trcrpt, such that the default file name 
needs to be used. You may find that specifying non-default filenames does not have the desired effect. 
make atrace and atrcrpt executable via chmod 

Data collection

./atrace                 (this is my wrapper for the trace command)
trcon
(at this point we're collecting the trace; wait for a bit of time to
trace whatever the failure is)
trcoff
trcstop
./atrcrpt                (this is my wrapper for formatting the report)

After running atrcrpt, the formatted report will be in file /tmp/fmt. 

Sample section of formatted trace
Note that failing system calls generally show "error Esomething" in the race, as highlighted below. 
The second column is the process id and the third column is the thread id. Once you see something of interest 
in the trace, you may want to use grep to pull out all records for that process id, since in general the trace 
is interleaved with the activity of all the processes in the system. 

101 14690    19239              statx LR = D0174110
107 14690    19239                      lookuppn: /usr/HTTPServer/htdocs/en_US/manual/ibm/index.htmlxxxxxxxxxxx
107 14690    19239                      lookuppn: file not found
104 14690    19239              return from statx. error ENOENT [79 usec]
101 14690    19239              statx LR = D0174110
107 14690    19239                      lookuppn: /usr/HTTPServer/htdocs/en_US/manual/ibm
104 14690    19239              return from statx [36 usec]


Note about an AIX trace on Websphere:
-------------------------------------

In addition to the WebSpherer MQ trace, WebSphere MQ for AIXr users can use the standard AIX system trace. 
AIX system tracing is a two-step process: 

>> Gathering the data 
>> Formatting the results 

WebSphere MQ uses two trace hook identifiers: 

X'30D' 
This event is recorded by WebSphere MQ on entry to or exit from a subroutine. 
X'30E' 
This event is recorded by WebSphere MQ to trace data such as that being sent or received across a 
communications network. Trace provides detailed execution tracing to help you to analyze problems. 
IBMr service support personnel might ask for a problem to be re-created with trace enabled. The files produced 
by trace can be very large so it is important to qualify a trace, where possible. For example, you can optionally 
qualify a trace by time and by component.

There are two ways to run trace: 

>> Interactively. 

The following sequence of commands runs an interactive trace on the program myprog and ends the trace. 

trace -j30D,30E -o trace.file
->!myprog
->q

>> Asynchronously. 

The following sequence of commands runs an asynchronous trace on the program myprog and ends the trace. 
trace -a -j30D,30E -o trace.file
myprog
trcstop

You can format the trace file with the command: 
trcrpt -t /usr/mqm/lib/amqtrc.fmt trace.file > report.file
report.file is the name of the file where you want to put the formatted trace output.


20.6 Nice example: Tracing with truss on AIX:
---------------------------------------------

Application tracing displays the calls that an application makes to external libraries and the kernel. 
These calls give the application access to the network, the file system, and the display. By watching 
the calls and their results, you can get some idea of what the application "expects", 
which can lead to a solution.

Each UNIXr system provides its own commands for tracing. This article introduces you to truss, which Solaris 
and AIXr support. On Linuxr, you perform tracing with the strace command. Although the command-line parameters 
might be slightly different, application tracing on other UNIX flavors might go by the names ptrace, 
ktrace, trace, and tusc.

>> A classic file permissions problem

One class of problems that plagues systems administrators is file permissions. An application likely has to open 
certain files to do its work. If the open operation fails, the application should let the administrator know. 
However, developers often forget to check the result of functions or, to add to the confusion, perform the check, 
but don't adequately handle the error. For example, here's the output of an application that's failing to open:

$ ./openapp
This should never happen!


After running the fictitious openapp application, I received the unhelpful (and false) error message, 
This should never happen!. This is a perfect time to introduce truss. Listing 1 shows the same application 
run under the truss command, which shows all the function calls that this program made to outside libraries.


Listing 1. Openapp run under truss

$ truss ./openapp
execve("openapp", 0xFFBFFDEC, 0xFFBFFDF4)  argc = 1
getcwd("/export/home/sean", 1015)               = 0
stat("/export/home/sean/openapp", 0xFFBFFBC8)   = 0
open("/var/ld/ld.config", O_RDONLY)             Err#2 ENOENT
stat("/opt/csw/lib/libc.so.1", 0xFFBFF6F8)      Err#2 ENOENT
stat("/lib/libc.so.1", 0xFFBFF6F8)              = 0
resolvepath("/lib/libc.so.1", "/lib/libc.so.1", 1023) = 14
open("/lib/libc.so.1", O_RDONLY)                = 3
memcntl(0xFF280000, 139692, MC_ADVISE, MADV_WILLNEED, 0, 0) = 0
close(3)                                        = 0
getcontext(0xFFBFF8C0)
getrlimit(RLIMIT_STACK, 0xFFBFF8A0)             = 0
getpid()                                        = 7895 [7894]
setustack(0xFF3A2088)
open("/etc/configfile", O_RDONLY)               Err#13 EACCES [file_dac_read]
ioctl(1, TCGETA, 0xFFBFEF14)                    = 0


fstat64(1, 0xFFBFEE30)                          = 0
stat("/platform/SUNW,Sun-Blade-100/lib/libc_psr.so.1", 0xFFBFEAB0) = 0
open("/platform/SUNW,Sun-Blade-100/lib/libc_psr.so.1", O_RDONLY) = 3
close(3)                                        = 0
This should never happen!
write(1, " T h i s   s h o u l d  ".., 26)      = 26
_exit(3)
 

Each line of the output represents a function call that the application made along with the return value, 
if applicable. (You don't need to know each function call, but for more information, you can call up the 
man page for the function, such as with the command man open.) To find the call that is potentially 
causing the problem, it's often easiest to start at the end (or as close as possible to where 
the problems start). For example, you know that the application outputs This should never happen!, 
which appears near the end of the output. Chances are that if you find this message and work your way up 
through the truss command output, you'll come across the problem.

Scrolling up from the error message, notice the line beginning with open("/etc/configfile"..., 
which not only looks relevant but also seems to return an error of Err#13 EACCES. Looking at the man page 
for the open() function (with man open), it's evident that the purpose of the function is to open a file 
-- in this case, /etc/configfile -- and that a return value of EACCES means that the problem is related 
to permissions. Sure enough, a look at /etc/configfile shows that the user doesn't have permissions to read 
the file. A quick chmod later, and the application is running properly.

The output of Listing 1 shows two other calls, open() and stat(), that return an error. Many of the calls 
toward the beginning of the application, including the other two errors, are added by the operating system 
as it runs the application. Only experience will tell when the errors are benign and when they aren't. 
In this case, the two errors and the three lines that follow them are trying to find the location of libc.so.1, 
which they eventually do. You'll see more about shared library problems later.


>> The application doesn't start

Sometimes, an application fails to start properly; but rather than exiting, it just hangs. This behavior is often 
a symptom of contention for a resource (such as two processes competing for a file lock), or the application 
is looking for something that is not coming back. This latter class of problems could be almost anything, 
such as a name lookup that's taking a long time to resolve, or a file that should be found in a certain spot but 
isn't there. In any case, watching the application under truss should reveal the culprit.

While the first code example showed an obvious link between the system call causing the problem and the file, 
the example you're about to see requires a bit more sleuthing. Listing 2 shows a misbehaving application 
called Getlock run under truss.


Listing 2. Getlock run under truss

$ truss ./getlock
execve("getlock", 0xFFBFFDFC, 0xFFBFFE04)  argc = 1
getcwd("/export/home/sean", 1015)               = 0
resolvepath("/export/home/sean/getlock", "/export/home/sean/getlock", 1023) = 25
resolvepath("/usr/lib/ld.so.1", "/lib/ld.so.1", 1023) = 12
stat("/export/home/sean/getlock", 0xFFBFFBD8)   = 0
open("/var/ld/ld.config", O_RDONLY)             Err#2 ENOENT
stat("/opt/csw/lib/libc.so.1", 0xFFBFF708)      Err#2 ENOENT
stat("/lib/libc.so.1", 0xFFBFF708)              = 0
resolvepath("/lib/libc.so.1", "/lib/libc.so.1", 1023) = 14
open("/lib/libc.so.1", O_RDONLY)                = 3
close(3)                                        = 0
getcontext(0xFFBFF8D0)
getrlimit(RLIMIT_STACK, 0xFFBFF8B0)             = 0
getpid()                                        = 10715 [10714]
setustack(0xFF3A2088)
open("/tmp/lockfile", O_WRONLY|O_CREAT, 0755)   = 3
getpid()                                        = 10715 [10714]
fcntl(3, F_SETLKW, 0xFFBFFD60)  (sleeping...)
 

The final call, fcntl(), is marked as sleeping, because the function is blocking. This means that the function 
is waiting for something to happen, and the kernel has put the process to sleep until the event occurs. To determine 
what the event is, you must look at fcntl().

The man page for fcntl() (man fcntl) describes the function simply as "file control" on Solaris and 
"manipulate file descriptor" on Linux. In all cases, fcntl() requires a file descriptor, which is an integer 
describing a file the process has opened, a command that specifies the action to be taken on the file descriptor, 
and finally any arguments required for the specific function. In the example in Listing 2, the file descriptor is 3, 
and the command is F_SETLKW. (The 0xFFBFFD60 is a pointer to a data structure, which doesn't concern us now.) 
Digging further, the man page states that F_SETLKW opens a lock on the file and waits until the lock can be obtained.

From the first example involving the open() system call, you saw that a successful call returns a file descriptor. 
In the truss output of Listing 2, there are two cases in which the result of open() returns 3. 
Because file descriptors are reused after they are closed, the relevant open() is the one just above fcntl(), 
which is for /tmp/lockfile. A utility like lsof lists any processes holding open a file. Failing that, 
you could trace through /proc to find the process with the open file. However, as is usually the case, 
a file is locked for a good reason, such as limiting the number of instances of the application or configuring 
the application to run in a user-specific directory.


>> Attaching to a running process

Sometimes, an application is already running when a problem occurs. Being able to run an already-running process 
under truss would be helpful. For example, notice that in the output of the Top application, a certain process 
has been consuming 95 percent of the CPU for quite some time, as shown in Listing 3.


Listing 3. Top output showing a CPU-intensive process

   PID USERNAME LWP PRI NICE  SIZE   RES STATE    TIME    CPU COMMAND
 11063 sean       1   0    0 1872K  952K run     87.9H 94.68% udpsend
 

The -p option to truss allows the owner of the process, or root, to attach to a running process and view 
the system call activity. The process id (PID) is required. In the example shown in Listing 3, the PID is 11063. 
Listing 4 shows the system call activity of the application in question.


Listing 4. truss output after attaching to a running process

$ truss -p 11063:

sendto(3, " a b c", 3, 0, 0xFFBFFD58, 16)       = 3
sendto(3, " a b c", 3, 0, 0xFFBFFD58, 16)       = 3
sendto(3, " a b c", 3, 0, 0xFFBFFD58, 16)       = 3
sendto(3, " a b c", 3, 0, 0xFFBFFD58, 16)       = 3
sendto(3, " a b c", 3, 0, 0xFFBFFD58, 16)       = 3
sendto(3, " a b c", 3, 0, 0xFFBFFD58, 16)       = 3
sendto(3, " a b c", 3, 0, 0xFFBFFD58, 16)       = 3
sendto(3, " a b c", 3, 0, 0xFFBFFD58, 16)       = 3
. repeats ...
 

The sendto() function's man page (man sendto) shows that this function is used to send a message from a socket 
-- typically, a network connection. The output of truss shows the file descriptor (the first 3) and the data 
being sent (abc). Indeed, capturing a sample of network traffic with the snoop or tcpdump tool shows a large amount 
of traffic being directed to a particular host, which is likely not the result of a properly behaving application.

Note that truss was not able to show the creation of file descriptor 3, because you had attached after the descriptor 
was created. This is one limitation of attaching to a running process and the reason why you should gather 
other information using a tool, such as a packet analyzer before jumping to conclusions.

This example might seem somewhat contrived (and technically it was, because I wrote the udpsend application 
to demonstrate how to use truss), but it is based on a real situation. I was investigating a process running 
on a UNIX-based appliance that had a CPU-bound process. Tracing the application showed the same packet activity. 
Tracing with a network analyzer showed the packets were being directed to a host on the Internet. After escalating 
with the vendor, I determined that the problem was their application failing to perform proper error checking 
on a binary configuration file. The file had somehow become corrupted. As a result, the application interpreted 
the file incorrectly and repeatedly hammered a random IP address with User Datagram Protocol (UDP) datagrams. 
After I replaced the file, the process behaved as expected.


>> Filtering output


After a while, you'll get the knack of what to look for. While it's possible to use the grep command to go through 
the output, it's easier to configure truss to focus only on certain calls. This practice is common if you're trying 
to determine how an application works, such as which configuration files the application is using. In this case, 
the open() and stat() system calls point to any files the application is trying to open.

You use open() to open a file, but you use stat() to find information about a file. Often, an application looks for 
a file with a series of stat() calls, and then opens the file it wants.

For truss, you add filtering system calls with the -t option. For strace under Linux, you use -e. In either case, 
you pass a comma-separated list of system calls to be shown on the command line. By prefixing the list with the 
exclamation mark (!), the given calls are filtered out of the output. Listing 5 shows a fictitious application 
looking for a configuration file.


Listing 5. truss output filtered to show only stat() and open() functions

$ truss -tstat,open ./app
stat("/export/home/sean/app", 0xFFBFFBD0)   = 0
open("/var/ld/ld.config", O_RDONLY)             Err#2 ENOENT
stat("/opt/csw/lib/libc.so.1", 0xFFBFF700)      Err#2 ENOENT
stat("/lib/libc.so.1", 0xFFBFF700)              = 0
open("/lib/libc.so.1", O_RDONLY)                = 3
stat("/export/home/sean/.config", 0xFFBFFCF0)   Err#2 ENOENT
stat("/etc/app/configfile", 0xFFBFFCF0)         Err#2 ENOENT
stat("/etc/configfile", 0xFFBFFCF0)             = 0
open("/etc/configfile", O_RDONLY)               = 3
 

The final four lines are the key here. The stat() function for /export/home/sean/.config results in ENOENT, 
which means that the file wasn't found. The code then tries /etc/app/configfile before it finds the correct 
information in /etc/configfile. The significance of first checking in the user's home directory is that you 
can override the configuration by user.


>> Final thoughts

Whether your operating system uses truss, strace, trace, or something else, the ability to peer into an application's 
behavior is a powerful tool for problem solving. The methodology can be summed up as follows:

Describe the problem. 
Trace the application. 
Start at the spot at which the problem occurs and work backward through the system calls to identify the problem. 
Use the man pages for help on interpreting the system calls. 
Correct the behavior and test. 
Tracing application behavior is a powerful troubleshooting tool, because you're observing the system calls 
that the application makes to the operating system. When the usual problem-solving methods fail, turn to 
application tracing.


20.7. snap command on AIX:
--------------------------

The snap command gathers system configuration information and compresses the information into a pax file. 
The information gathered with the snap command may be required to identify and resolve system problems.

In normal conditions, the command "snap -gc" should be sufficient. The pax file will be stored in /tmp/ibmsupt

# snap -gc 

create the following file:

/tmp/ibmsupt/snap.pax.Z


Further info:

snap Command

Purpose

       Gathers system configuration information.

Syntax

       snap [ -a ] [ -A ] [ -b ] [ -B ] [ -c ] [ -C ] [ -D ] [ -f ] [ -g ] [ -G ] [ -i ] [ -k ] [ -l ] [ -L ][ -n ] [ -N ] 
       [ -p ] [ -r ] [ -R  ] [ -s ] [ -S ] [ -t ] [ -T  Filename ] [ -w  ] [ -o OutputDevice ] [ -d Dir ] [ -v Component ] 
       [ -O FileSplitSize ] [ -P Files ]
       [ script1 script2 ... | All | file:filepath ]

       snap [ -a ] [ -A ] [ -b ] [ -B ] [ -c ] [ -C ] [ -D ] [ -f ] [ -g ] [ -G ] [ -i ] [ -k ] [ -l ] [ -L ][ -n ] [ -N ] 
       [ -p ] [ -r ] [ -R  ] [ -s ] [ -S ] [ -t ] [ -T  Filename ] [ -o OutputDevice ] [ -d Dir ] [ -v Component ] 
        [ -O FileSplitSize ] [ -P Files ] [
       script1 script2 ... | All | file:filepath ]

       snap -e [ -m Nodelist ] [ -d Dir ]

Description

       The snap command gathers system configuration information and compresses the information into a pax file. The file may then be
       written to a device such as tape or DVD, or transmitted to a remote system. The information gathered with the snap command might be
       required to identify and resolve system problems. Note: Root user authority is required to execute the snap command. Use the snap -o
       /dev/cd0 command to copy the compressed image to DVD. Use the snap -o /dev/rmt0 command to copy the image to tape.

       Use the snap -o /dev/rfd0 command to copy the compressed image to diskette. Use the snap -o /dev/rmt0 command to copy the image to
       tape.

       Approximately 8MB of temporary disk space is required to collect all system information, including contents of the error log. If you
       do not gather all system information with the snap -a command, less disk space may be required (depending on the options selected).
       Note: If you intend to use a tape to send a snap image to IBM(R) for software support, the tape must be one of the following formats:
       *    8mm, 2.3 Gb capacity
       *    8mm, 5.0 Gb capacity
       *    4mm, 4.0 Gb capacity

       Using other formats prevents or delays IBM software support from being able to examine the contents.

       The snap -g command gathers general system information, including the following:
       *    Error report
       *    Copy of the customized Object Data Manager (ODM) database
       *    Trace file
       *    User environment
       *    Amount of physical memory and paging space
       *    Device and attribute information
       *    Security user information

       The output of the snap -g command is written to the /tmp/ibmsupt/general/general.snap file.

       The snap command checks for available space in the /tmp/ibmsupt directory, the default directory for snap command output. You can
       write the output to another directory by using the -d flag. If there is not enough space to hold the snap command output, you must
       expand the file system.

       Each execution of the snap command appends information to previously created files. Use the -r flag to remove previously gathered and
       saved information.

       Flags:

       -a
            Gathers all system configuration information. This option requires approximately 8MB of temporary disk space.
       -A
            Gathers asynchronous (TTY) information.
       -b
            Gathers SSA information.
       -B
            Bypasses collection of SSA adapter dumps. The -B flag only works when the -b flag is also specified; otherwise, the -B flag is
            ignored.
       -c
            Creates a compressed pax image (snap.pax.Z file) of all files in the /tmp/ibmsupt directory tree or other named output
            directory. Note: Information not gathered with this option should be copied to the snap directory tree before using the -c flag.
            If a test case is needed to demonstrate the system problem, copy the test case to the /tmp/ibmsupt/testcase directory before
            compressing the pax file.
       -C
            Retrieves all the files in the fwdump_dir directory. The files are placed in the "general" subdirectory. The -C snap option
            behaves the same as -P*.
       -D
            Gathers dump and /unix information. The primary dump device is used. Notes:
              1    If bosboot -k was used to specify the running kernel to be other than /unix, the incorrect kernel is gathered. Make sure
                   that /unix is , or is linked to, the kernel in use when the dump was taken.
              2    If the dump file is copied to the host machine, the snap command does not collect the dump image in the /tmp/ibmsupt/dump
                   directory. Instead, it creates a link in the dump directory to the actual dump image.
       -d AbsolutePath
            Identifies the optional snap command output directory (/tmp/ibmsupt is the default). You must specify the absolute path.
       -e
            Gathers HACMP(TM) specific information. Note: HACMP specific data is collected from all nodes belonging to the cluster . This
            flag cannot be used with any other flags except -m and -d.
       -f
            Gathers file system information.
       -g
            Gathers the output of the lslpp -hac command, which is required to recreate exact operating system environments. Writes output
            to the /tmp/ibmsupt/general/lslpp.hBc file. Also collects general system information and writes the output to the
            /tmp/ibmsupt/general/general.snap file.
       -G
            Includes predefined Object Data Manager (ODM) files in general information collected with the -g flag.
       -i
            Gathers installation debug vital product data (VPD) information.


strace example on Linux:
------------------------

One main trace utility on most Linux distro's, is the "strace" command.
You can use it with many parameters, but the "-o outputfile" is very important, in order to save the output to a file.

Use it like:

# strace -o logfile <command_or_program_you_want_to_trace> 

Because strace will show you the systemcalls and signals, you can use it to reveal whether a program cannot
find a file, or does not have permissions to read (or write to) a file. In such a case, a program might fail.

Example:

Suppose we have a file called "/etc/security.conf". Now we run a utility to read the file (like cat, pg, more, less etc..)
as a normal user, which user does not have permissions to read the file. Let's trace that event to a logfile, and see
what we can discover.

$ strace -o strace_example.log less /etc/security.conf

A trace file can get pretty long, but you should just browse it and be alert on what seems to be an error reported.
So, if we take a look in the logfile "strace_example.log"

..
..
open("/etc/security.conf", O_RDONLY|O_LARGEFILE) = -1 EACCES (Permission denied)
write(2, "/etc/security.conf: Permission denied\n", 32) = 32
..
..

We can clearly see, that our program failed due to lack of read permission.


=============
21. Logfiles:
=============


21.1 Solaris:
=============

Unix message files record all system problems like disk errors, swap errors, NFS problems, etc. 
Monitor the following files on your system to detect system problems: 

  tail -f /var/adm/syslog
  tail -f /var/adm/messages
  tail -f /var/log/syslog

You can also use the dmesg command.
Messages are recorded by the syslogd demon.

Diagnostics can be done from the OK prompt after a reboot, like probe-scsci, show-devs, show-disks, test memory etc..
You can also use SunVTS tool to run diagnostics. SunVTS is Suns's Validation Test package.

System dumps:
You can manage system dumps by using the dumpadm command.


Userlogins are recorded in /var/adm/utmpx
Solaris 8,9 does not use wtmp or utmp


Logfiles:
---------

/var/adm/messages
The syslogd daemon logs its findings into this file

/var/adm/lastlog
This file holds the most recent login time for each user of the system

/var/adm/utmpx
This database file contains user access and accounting information for commands such as
who, write, login. The utmpx file is where information such as the terminal and login time
are stored, and if you use the who command, it will retrieve that information.

/var/adm/wtmpx
This file contains the history of user access and accounting information, for the utmpx database.
The "last" command will use this file, to show you the historical login and logout info, since the last reboot.

/var/adm/sulog
This file shows you which users has used the su command, to switch to another user.

/var/adm/acct
If accounting is enabled, accounting information is recorded in that file.

/var/adm/loginlog
If it is important for you to track whether users are trying to log in to your user accounts, 
you can create a /var/adm/loginlog file with read and write permissions for root only. After you create the loginlog file, 
all failed login activity is written to this file automatically after five failed attempts. The five-try limit avoids recording 
failed attempts that are the result of typographical errors.

The loginlog file contains one entry for each failed attempt. Each entry contains the user's login name, 
tty device, and time of the attempt.


AIX:
----

Periodical the following files have to be decreased in size. You can use cat /dev/null command

Example: cat /dev/null >/var/adm/sulog

/var/adm/sulog 
/var/adm/cron/log 
/var/adm/wtmp 
/etc/security/failedlogin 

Notes about the errorlog, thats the file /var/adm/ras/errlog.

Do NOT use cat /dev/null to clear the errorlog. 
Use instead the following procedure:

# /usr/lib/errstop   (stop the error daemon)
move the errlog file
# /usr/lib/errstart  (start the error daemon)


errdemon:
---------

On most UNIX systems, information and errors from system events and processes are managed by the 
syslog daemon (syslogd); depending on settings in the configuration file /etc/syslog.conf, messages are passed 
from the operating system, daemons, and applications to the console, to log files, or to nowhere at all. 
AIX includes the syslog daemon, and it is used in the same way that other UNIX-based operating systems use it. 
In addition to syslog, though, AIX also contains another facility for the management of hardware, operating system, 
and application messages and errors. This facility, while simple in its operation, provides unique and valuable 
insight into the health and happiness of an AIX system.

The AIX error logging facility components are part of the bos.rte and the bos.sysmgt.serv_aid packages, 
both of which are automatically placed on the system as part of the base operating system installation. 

Unlike the syslog daemon, which performs no logging at all in its default configuration as shipped, 
the error logging facility requires no configuration before it can provide useful information about the system. 
The errdemon is started during system initialization and continuously monitors the special file /dev/error 
for new entries sent by either the kernel or by applications. The label of each new entry is checked 
against the contents of the Error Record Template Repository, and if a match is found, additional information 
about the system environment or hardware status is added, before the entry is posted to the error log.

The actual file in which error entries are stored is configurable; the default is /var/adm/ras/errlog. 
That file is in a binary format and so should never be truncated or zeroed out manually. The errlog file 
is a circular log, storing as many entries as can fit within its defined size. A memory buffer is set 
by the errdemon process, and newly arrived entries are put into the buffer before they are written to the log 
to minimize the possibility of a lost entry. The name and size of the error log file and the size of the memory buffer 
may be viewed with the errdemon command:


[aixhost:root:/] # /usr/lib/errdemon -l
Error Log Attributes
--------------------------------------------
Log File                /var/adm/ras/errlog
Log Size                1048576 bytes
Memory Buffer Size      8192 bytes

The parameters displayed may be changed by running the errdemon command with other flags, documented 
in the errdemon man page. The default sizes and values have always been sufficient on our systems, 
so I've never had reason to change them.

Due to use of a circular log file, it is not necessary (or even possible) to rotate the error log. 
Without intervention, errors will remain in the log indefinitely, or until the log fills up with new entries. 
As shipped, however, the crontab for the root user contains two entries that are executed daily, 
removing hardware errors that are older than 90 days, and all other errors that are older than 30 days.


0 11  *  *  * /usr/bin/errclear -d S,O 30
0 12  *  *  * /usr/bin/errclear -d H 90


The errdemon deamon constantly checks the /dev/error special file, and when new data
is written, the deamon conducts a series of operations.

- To determine the path to your system's error logfile, run the command:
# /usr/lib/errdemon -l
Error Log Attributes
Log File          /var/adm/ras/errlog
Log Size          1048576 bytes
Memory            8192 bytes

- To change the maximum size of the error log file, enter:
# /usr/lib/errdemon -s 200000


You can generate the error reports using smitty or through the errpt command.

# smitty errpt       gives you a dialog screen where you can select types of information.

# errpt -a
# errpt - d H

# errpt -a|pg      Produces a detailed report for each entry in the error log 
# errpt -aN hdisk1 Displays an error log for ALL errors occurred on this drive. If more than a few errors 
                   occur within a 24 hour period, execute the CERTIFY process under DIAGNOSTICS to determine 
                   if a PV is becoming marginal. 
 

If you use the errpt without any options, it generates a summary report. 
If used with the -a option, a detailed report is created.
You can also display errors of a particular class, for example for the Hardware class.

Examples using errpt:
---------------------

To display a complete summary report, enter: 

errpt
To display a complete detailed report, enter: 
errpt  -a

To display a detailed report of all errors logged for the error identifier E19E094F, enter: 
errpt  -a  -j E19E094F

To display a detailed report of all errors logged in the past 24 hours, enter: 
errpt  -a  -s mmddhhmmyy

where the mmddhhmmyy string equals the current month, day, hour, minute, and year, minus 24 hours. 
To list error-record templates for which logging is turned off for any error-log entries, enter: 
errpt  -t  -F log=0

To view all entries from the alternate error-log file /var/adm/ras/errlog.alternate, enter: 
errpt  -i /var/adm/ras/errlog.alternate

To view all hardware entries from the alternate error-log file /var/adm/ras/errlog.alternate, enter: 
errpt  -i /var/adm/ras/errlog.alternate -d H

To display a detailed report of all errors logged for the error label ERRLOG_ON, enter: 
errpt  -a  -J ERRLOG_ON

To display a detailed report of all errors and group duplicate errors, enter: 

errpt -aD
To display a detailed report of all errors logged for the error labels DISK_ERR1 and DISK_ERR2 during 
the month of August, enter: 
errpt -a -J DISK_ERR1,DISK_ERR2 -s 0801000004 -e 0831235904"

errclear:

Deletes entries in the error log

Example: errclear 0 (Truncates the errlog to 0 bytes)


Example errorreport:
--------------------

Example 1:
----------

P550:/home/reserve $ errpt

IDENTIFIER TIMESTAMP  T C RESOURCE_NAME  DESCRIPTION
0EC00096   0130224507 P U SYSPFS         STORAGE SUBSYSTEM FAILURE
0EC00096   0130224007 P U SYSPFS         STORAGE SUBSYSTEM FAILURE
0EC00096   0130224007 P U SYSPFS         STORAGE SUBSYSTEM FAILURE
0EC00096   0130223507 P U SYSPFS         STORAGE SUBSYSTEM FAILURE
F7DDA124   0130223507 U H LVDD           PHYSICAL VOLUME DECLARED MISSING
52715FA5   0130223507 U H LVDD           FAILED TO WRITE VOLUME GROUP STATUS AREA
CAD234BE   0130223507 U H LVDD           QUORUM LOST, VOLUME GROUP CLOSING
613E5F38   0130223507 P H LVDD           I/O ERROR DETECTED BY LVM
613E5F38   0130223507 P H LVDD           I/O ERROR DETECTED BY LVM
613E5F38   0130223507 P H LVDD           I/O ERROR DETECTED BY LVM
0873CF9F   0130191907 T S pts/4          TTYHOG OVER-RUN
0EC00096   0130162407 P U SYSPFS         STORAGE SUBSYSTEM FAILURE
51E537B5   0130161807 P H sysplanar0     platform_dump saved to file
291D64C3   0130161807 I H sysplanar0     platform_dump indicator event
291D64C3   0130161807 I H sysplanar0     platform_dump indicator event
BFE4C025   0130161807 P H sysplanar0     UNDETERMINED ERROR
51E537B5   0130161707 P H sysplanar0     platform_dump saved to file
291D64C3   0130161707 I H sysplanar0     platform_dump indicator event
291D64C3   0130161707 I H sysplanar0     platform_dump indicator event
51E537B5   0130161707 P H sysplanar0     platform_dump saved to file
291D64C3   0130161707 I H sysplanar0     platform_dump indicator event
291D64C3   0130161707 I H sysplanar0     platform_dump indicator event
BFE4C025   0130161607 P H sysplanar0     UNDETERMINED ERROR
BFE4C025   0130161407 P H sysplanar0     UNDETERMINED ERROR
BFE4C025   0130161307 P H sysplanar0     UNDETERMINED ERROR
BFE4C025   0130161307 P H sysplanar0     UNDETERMINED ERROR
BFE4C025   0130161207 P H sysplanar0     UNDETERMINED ERROR
BFE4C025   0130161207 P H sysplanar0     UNDETERMINED ERROR
0EC00096   0130161207 P U SYSPFS         STORAGE SUBSYSTEM FAILURE
BFE4C025   0130161107 P H sysplanar0     UNDETERMINED ERROR
D2A1B43E   0130161107 P U SYSPFS         FILE SYSTEM CORRUPTION
D2A1B43E   0130161107 P U SYSPFS         FILE SYSTEM CORRUPTION
CD546B25   0130161107 I O SYSPFS         FILE SYSTEM RECOVERY REQUIRED
CD546B25   0130161107 I O SYSPFS         FILE SYSTEM RECOVERY REQUIRED
1ED0A744   0130161107 P U SYSPFS         FILE SYSTEM LOGGING SUSPENDED
CD546B25   0130161107 I O SYSPFS         FILE SYSTEM RECOVERY REQUIRED
D2A1B43E   0130161107 P U SYSPFS         FILE SYSTEM CORRUPTION
1ED0A744   0130161107 P U SYSPFS         FILE SYSTEM LOGGING SUSPENDED
F7DDA124   0130161107 U H LVDD           PHYSICAL VOLUME DECLARED MISSING
52715FA5   0130161107 U H LVDD           FAILED TO WRITE VOLUME GROUP STATUS AREA
CAD234BE   0130161107 U H LVDD           QUORUM LOST, VOLUME GROUP CLOSING
613E5F38   0130161107 P H LVDD           I/O ERROR DETECTED BY LVM
EAA3D429   0130161107 U S LVDD           PHYSICAL PARTITION MARKED STALE
613E5F38   0130161107 P H LVDD           I/O ERROR DETECTED BY LVM
613E5F38   0130161107 P H LVDD           I/O ERROR DETECTED BY LVM
41BF2110   0130161107 U H LVDD           MIRROR WRITE CACHE WRITE FAILED
613E5F38   0130161107 P H LVDD           I/O ERROR DETECTED BY LVM
CAD234BE   0130161107 U H LVDD           QUORUM LOST, VOLUME GROUP CLOSING
F7DDA124   0130161107 U H LVDD           PHYSICAL VOLUME DECLARED MISSING
41BF2110   0130161107 U H LVDD           MIRROR WRITE CACHE WRITE FAILED
613E5F38   0130161107 P H LVDD           I/O ERROR DETECTED BY LVM
6472E03B   0130161107 P H sysplanar0     EEH permanent error for adapter
FEC31570   0130161107 P H sisscsia2      UNDETERMINED ERROR
C14C511C   0130161107 T H scsi5          ADAPTER ERROR
BFE4C025   0130161107 P H sysplanar0     UNDETERMINED ERROR
FE2DEE00   0130144307 P S SYSXAIXIF      DUPLICATE IP ADDRESS DETECTED IN THE NET
FE2DEE00   0130143207 P S SYSXAIXIF      DUPLICATE IP ADDRESS DETECTED IN THE NET
B6048838   0129100507 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838   0129100307 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED


You might create a script called alert.sh and call it from your .profile

#!/usr/bin/ksh
cd ~
rm -rf /root/alert.log
echo "Important alerts in errorlog: " >> /root/alert.log
errpt | grep -i STORAGE >> /root/alert.log
errpt | grep -i QUORUM >> /root/alert.log
errpt | grep -i ADAPTER >> /root/alert.log
errpt | grep -i VOLUME >> /root/alert.log
errpt | grep -i PHYSICAL >> /root/alert.log
errpt | grep -i STALE >> /root/alert.log
errpt | grep -i DISK >> /root/alert.log
errpt | grep -i LVM >> /root/alert.log
errpt | grep -i LVD >> /root/alert.log
errpt | grep -i UNABLE >> /root/alert.log
errpt | grep -i USER >> /root/alert.log
errpt | grep -i CORRUPT >> /root/alert.log
cat /root/alert.log


if [ `cat alert.log|wc -l` -eq 1 ]
then
   echo "No critical errors found."
fi

echo " "
echo "Filesystems that might need attention, e.g. %used:"
df -k |awk '{print $4,$7}' |grep -v "Filesystem"|grep -v tmp  > /tmp/tmp.txt
cat /tmp/tmp.txt | sort -n | tail -3


Example 2:
----------

IDENTIFIER TIMESTAMP  T C RESOURCE_NAME  DESCRIPTION
173C787F   0710072007 I S topsvcs        Possible malfunction on local adapter
90D3329C   0710072007 P S topsvcs        NIM read/write error
AE3E3FAD   0710064907 I O SYSJ2          FSCK FOUND ERRORS
AE3E3FAD   0710064907 I O SYSJ2          FSCK FOUND ERRORS
AE3E3FAD   0710064907 I O SYSJ2          FSCK FOUND ERRORS
AE3E3FAD   0710064907 I O SYSJ2          FSCK FOUND ERRORS
AE3E3FAD   0710064907 I O SYSJ2          FSCK FOUND ERRORS
AE3E3FAD   0710064907 I O SYSJ2          FSCK FOUND ERRORS
AE3E3FAD   0710064907 I O SYSJ2          FSCK FOUND ERRORS
C1348779   0710061107 I O SYSJ2          LOG I/O ERROR
C1348779   0710061107 I O SYSJ2          LOG I/O ERROR
C1348779   0710061107 I O SYSJ2          LOG I/O ERROR
EAA3D429   0710061007 U S LVDD           PHYSICAL PARTITION MARKED STALE


IDENTIFIER TIMESTAMP  T C RESOURCE_NAME  DESCRIPTION
12337A8D   0723152107 T S DR_KER_MEM     Affected memory not available for DR rem


Some notes on disk related errors:
----------------------------------

DISK_ERR4 is bad block relocation. Not a serious error. 
DISK_ERR2 is a hardware error as opposed to a media or corrected read error on disk. This could be serious.


EAA3D429   0121151108 U S LVDD           PHYSICAL PARTITION MARKED STALE


Note 1:
-------

thread 1:

Q:

Has anyone seen these errors before? We're running 6239 fc cards on a 
CX600. AIX level is 52-03 with the latest patches for devices.pci.df1000f7 
as well. 


I didn't know that these adapters still used devices.pci.df1000f7 as part 
of their device driver set, but aparently they do. We're mostly seeing 
ERR4s on bootup and occassionaly throughout the day. They're TEMP but 
should I be concerned about this? Any help would be greatly appreciated! 

LABEL: SC_DISK_ERR4 
IDENTIFIER: DCB47997 

A:

DISK_ERR_4 are simply bad-block relocation errors. They are quite normal. 
However, I heard that if you get more than 8 in an 8-hour period, you 
should get the disk replaced as it is showing signs of impending failure. 


thread 2:

Q:

> Has anyone corrected this issue? SC_DISK_ERR2 with EMC Powerpath = 
> filesets listed below? I am using a CX-500.=20 
> 


A:

 got those errors before using a CX700 and it turned out to be a 
firmware problem on the fibre adapter, model 6259. EMC recommended the 
92X1 firmware and to find out IBM found problems with timeouts to the 
drives and recommended going back a level to 81X1. 

A:

We have the same problem as well. EMC say its a firmware error on the 
FC adapters

A:

This is how to fix these errors, downgrading firware is not recommended. 

Correcting SCSI_DISK_ERR2's in the AIX Errpt Log - Navisphere Failover 
Wizard 

1. In the Navisphere main screen, select tools and then click the 
Failover Setup Wizard. Click next to continue. 

2. From the drop-down list select the host server you wish to 
modify and click next 

3. Highlight the CX-500 and click next 

4. Under the specify settings box be sure to select 1 for the 
failover setting and disable for array commpath. Click next to process. 
5. The next screen is the opportunity to review your selections 
(host, failover mode and array commpath); click next to commit 
6. The following screen displays a warning message to alert you are 
committing these changes. Click yes to process. 

7. Next login to the AIX command prompt as root and perform the 
following commands to complete stopping the SCSI_DISK_ERR2. 
a. lsdev -Cc disk | grep LUNZ 

(Filter for disks with LUNZ in the description) 
b. rmdev -dl hdisk(#)'s 

(Note the disks and remove them from the ODM) 
c. errclear 0 
(Clear the AIX system error log) 
d. cfgmgr -v 
(Attempt to re-add the LUNZ disks) 
e. lsdev -Cc disk | grep LUNZ 
(Double check to make sure the LUNZ disk does not add itself back to the 
system after the cfgmgr command) 
f. errpt -a 

(Monitor the AIX error log to insure the SCSI_DISK_ERR2's are gone) 
Task Complete... 


E87EF1BE   0512150008 P O dumpcheck      The largest dump device is too small.
------------------------------------------------------------------------------


Problems with errpt:
--------------------

Invalid log, or other problems

thread 1:

Q:

Hello ...

the 'errpt' Command tells me:

0315-180 logread: UNEXPECTED EOF 0315-171 Unable to process the error log file
/var/adm/ras/errlog. 0315-132 The supplied error log is not valid:
/var/adm/ras/errlog.

# ls -l /var/adm/ras/errlog
-rw-r--r-- 1 root system 0 Jun 14 17:31 /var/adm/ras/errlog

How can I fix this problem?

A:

/usr/lib/errstop           # stop logging

rm /var/adm/ras/errlog     # get rid of that log.

/usr/lib/errdemon          # restart the daemon, creating a new error log.


Some err identifiers that can sometimes be hard to trace to their true sources:
===============================================================================

Take a look at those errpt entries:


--------------------------------------------------------------------------


ERRPT ENTRY 1:
--------------

LABEL:          CORE_DUMP 
IDENTIFIER:     C69F5C9B 

Date/Time:       Thu Jan 15 02:00:45 MET 2009 
Sequence Number: 999 
Machine Id:      00CC94EE4C00 
Node Id:         srv1 
Class:           S 
Type:            PERM 
Resource Name:   SYSPROC 

Description 
SOFTWARE PROGRAM ABNORMALLY TERMINATED 

Probable Causes 
SOFTWARE PROGRAM 

User Causes 
USER GENERATED SIGNAL 

        Recommended Actions 
        CORRECT THEN RETRY 

Failure Causes 
SOFTWARE PROGRAM 

        Recommended Actions 
        RERUN THE APPLICATION PROGRAM 
        IF PROBLEM PERSISTS THEN DO THE FOLLOWING 
        CONTACT APPROPRIATE SERVICE REPRESENTATIVE 

Detail Data 
SIGNAL NUMBER 
          11 
USER'S PROCESS ID: 
               1298680 
FILE SYSTEM SERIAL NUMBER 
          57 
INODE NUMBER 
       37134 
CORE FILE NAME 
/var/core/core.1298680.15010044 
PROGRAM NAME 
BS_sear 
STACK EXECUTION DISABLED 
           0 
COME FROM ADDRESS REGISTER 

PROCESSOR ID 
  hw_fru_id: 1 
  hw_cpu_id: 9 

ADDITIONAL INFORMATION 
?? 
?? 
Unable to generate symptom string. 


  (or as another example of the last lines, where you can see the "program name")

  PROGRAM NAME 
  opmn 
  STACK EXECUTION DISABLED 
           0 
  COME FROM ADDRESS REGISTER 

  PROCESSOR ID 
    hw_fru_id: 0 
    hw_cpu_id: 2 

  ADDITIONAL INFORMATION 
  strlen 0 
  pmStrdup 14 

  Symptom Data 
  REPORTABLE 
  1 
  INTERNAL ERROR   
  0 
  SYMPTOM CODE 
  PCSS/SPI2 FLDS/opmn SIG/11 FLDS/strlen VALU/0 FLDS/pmStrdup 
  

--------------------------------------------------------------------------

POSSIBLE EXPLANATION:
=====================

http://publib.boulder.ibm.com/infocenter/systems/index.jsp?topic=/com.ibm.aix.security/doc/security/stack_exec_disable.htm

AIXr has enabled the stack execution disable (SED) mechanism to disable the execution of code on a stack 
and select data areas of a process.

By disabling the execution and then terminating, an infringing program, the attacker is prevented 
from gaining root user privileges through a buffer overflow attack. While this feature does not stop 
buffer overflows, it provides protection by disabling the execution of attacks on buffers that have been overflowed.

Beginning with the POWER4T family of processors, you can use a page-level execution enable and/or disable feature 
for the memory. The AIX SED mechanism uses this underlying hardware support for implementing a 
no-execution feature on select memory areas. Once this feature is enabled, the operating system checks 
and flags various files during the executable programs. It then alerts the operating system memory manager 
and the process managers that the SED is enabled for the process being created. The select memory areas 
are marked for no-execution. If any execution occurs on these marked areas, the hardware raises 
an exception flag and the operating system stops the corresponding process. The exception and application 
termination details are captured through the AIX error log events.

SED is implemented mainly through the sedmgr command. The sedmgr command permits control 
of the systemwide SED mode of operation as well as setting the executable file based SED flags.

SED modes and monitoring
The stack execution disable (SED) mechanism in AIXr is implemented through systemwide mode flags, 
as well as individual executable file-based header flags.

While systemwide flags control the systemwide operation of the SED, file level flags indicate 
how files should be treated in SED. The buffer overflow protection (BOP) mechanism provides 
for four systemwide modes of operation:

-- off 
The SED mechanism is turned off and no process is marked for SED protection. 
--select 
Only a select set of files are enabled and monitored for SED protection. The select set of files 
are chosen by reviewing the SED related flags in the executable program binary headers. 
The executable program header enables SED related flags to request to be included in the select mode. 
-- setidfiles 
Permits you to enable SED, not only for the files requesting such a mechanism, but all the important 
setuid and setgid system files. In this mode, the operating system not only provides SED for the files 
with the request SED flag set, but also enables SED for the executable files with the following 
characteristics (except the files marked for exempt in their file headers):
 .SETUID files owned by root 
 .SETGID files with primary group as system or security 
-- all 
All executable programs loaded on the system are SED protected except for the files requesting 
an exemption from SED mode. Exemption related flags are part of the executable program headers. 
The SED feature on AIX also provides the ability to monitor instead of stopping the process when 
an exception happens. This systemwide control permits a system administrator to check for breakdowns 
and issues in the system environment by monitoring it before the SED is deployed in the production systems. 

The sedmgr command provides an option that permits you to enable SED to monitor files instead 
of stopping the processes when exceptions occur. The system administrator can evaluate whether 
an executable program is doing any legitimate stack execution. This setting works in conjunction 
with the systemwide mode set using the -c option. When the monitor mode is turned on, the system permits 
the process to continue operating even if an SED-related exception occurs. Instead of stopping the process, 
the operating system logs the exception in the AIX error log. If SED monitoring is off, 
the operating system stops any process that violates and raises an exception per SED facility.

Any changes to the SED mode systemwide flags requires that you restart the system for the changes 
to take effect. All of these types of events are audited.


--------------------------------------------------------------------------

ERRPT ENTRY 2:
--------------

LABEL:          SRC 
IDENTIFIER:     E18E984F 

Date/Time:       Fri Jan 16 09:31:33 MET 2009 
Sequence Number: 1513 
Machine Id:      00C503AC4C00 
Node Id:         heilbot 
Class:           S 
Type:            PERM 
Resource Name:   SRC 

Description 
SOFTWARE PROGRAM ERROR 

Probable Causes 
APPLICATION PROGRAM 

Failure Causes 
SOFTWARE PROGRAM 

        Recommended Actions 
        PERFORM PROBLEM RECOVERY PROCEDURES 

Detail Data 
SYMPTOM CODE 
           0 
SOFTWARE ERROR CODE 
       -9053 
ERROR CODE 
           2 
DETECTING MODULE 
'tellsrc.c'@line:'87' 
FAILING MODULE 

Duplicates 
Number of duplicates 
           3 
Time of first duplicate 
Fri Jan 16 09:31:18 MET 2009 
Time of last duplicate 
Fri Jan 16 09:31:33 MET 2009 


POSSIBLE EXPLANATIONS:
======================

In entry 2, we see the identifier E18E984F, and "SOFTWARE ERROR CODE -9053", and "Detecting module tellsrc.c@line:87".
tellsrc.c'@line:'87'


http://www-01.ibm.com/support/docview.wss?uid=isg1IZ03064

IZ03064: VARYONVG -C FAILS WITH "GSCHILD:CANNOT REGISTER WITH DRIVER APPLIES TO AIX 5300-07


APAR status
Closed as program error.

Error description 
"varyonvg -c" fails to varyon concurrent volume group and
reports the following error message:

tellclvmd: request failed rc = -9014 [UNKNOWN rc]
0516-1334 varyonvg: The command /usr/sbin/tellclvmd
   returned an error.


errpt logs following entry:

LABEL:          SRC
IDENTIFIER:     E18E984F
Class:           S
Type:            PERM
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        PERFORM PROBLEM RECOVERY PROCEDURES

Detail Data
SYMPTOM CODE
           0
SOFTWARE ERROR CODE
       -9053
ERROR CODE
          74
DETECTING MODULE
'srcmstr.c'@line:'529'
FAILING MODULE
Local fix 
This problem occurs when multiple "varyonvg -nc"
commands are performed together. By serializing
these commands, this can be avoided.
Problem summary 
Multiple varyonvg -c processes will all create threads in
the gsclvmd daemon.  With certain timing, these threads can
interfere with eachothers global variables and possibly cause
varyonvg to fail.
Problem conclusion 
Privatize variables so mutliple vgs coming online can't
interfere with eachother.
Temporary fix 
Comments 
5200-10 - use AIX APAR IZ05735
5300-06 - use AIX APAR IZ02334
5300-07 - use AIX APAR IZ03064
APAR information 
APAR number IZ03064 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2007-08-14 
Closed date 2007-09-04 
Last modified date 2007-12-06 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Publications Referenced


Fix information 
Fixed component name AIX 5.3 
Fixed component ID 5765G0300 


--------------------------------------------------------------------------


diag command:
-------------

Whenever a hardware problem occurs in AIX, use the diag command to diagnose the problem.

The diag command is the starting point to run a wide choice of tasks and service aids. 
Most of the tasks/service aids are platform specific. 

To run diagnostics on the scdisk0 device, without questions, enter:

# diag -d scdisk0 -c


System dumps:
-------------

A system dump is created when the system has an unexpected system halt or system failure.
In AIX 5L the default dump device is /dev/hd6, which is also the default paging device.
You can use the sysdumpdev command to manage system crash dumps.

The sysdumpdev command changes the primary or secondary dump device designation in a system that is running. 
The primary and secondary dump devices are designated in a system configuration object. 
The new device designations are in effect until the sysdumpdev command is run again, or the system is restarted.

If no flags are used with the sysdumpdev command, the dump devices defined in the SWservAt 
ODM object class are used. The default primary dump device is /dev/hd6. The default secondary dump device is 
/dev/sysdumpnull.


Examples
To display current dump device settings, enter: 
sysdumpdev  -l

To designate logical volume hd7 as the primary dump device, enter: 
sysdumpdev  -p /dev/hd7

To designate tape device rmt0 as the secondary dump device, enter: 
sysdumpdev  -s /dev/rmt0

To display information from the previous dump invocation, enter: 
sysdumpdev  -L

To permanently change the database object for the primary dump device to /dev/newdisk1, enter: 
sysdumpdev  -P  -p /dev/newdisk1

To determine if a new system dump exists, enter: 
sysdumpdev  -z

If a system dump has occurred recently, output similar to the following will appear: 

4537344 /dev/hd7
To designate remote dump file /var/adm/ras/systemdump on host mercury for a primary dump device, enter: 
sysdumpdev  -p mercury:/var/adm/ras/systemdump

A : (colon) must be inserted between the host name and the file name. 
To specify the directory that a dump is copied to after a system crash, if the dump device is /dev/hd6, enter: 
sysdumpdev  -d /tmp/dump

This attempts to copy the dump from /dev/hd6 to /tmp/dump after a system crash. If there is an error during the copy, 
the system continues to boot and the dump is lost. 
To specify the directory that a dump is copied to after a system crash, if the dump device is /dev/hd6, enter: 
sysdumpdev  -D /tmp/dump

This attempts to copy the dump from /dev/hd6 to the /tmp/dump directory after a crash. If the copy fails, 
you are prompted with a menu that allows you to copy the dump manually to some external media.


Starting a system dump:
-----------------------

If you have the Software Service Aids Package installed, you have access to the sysdumpstart command.
You can start the system dump by entering:
# sysdumpstart -p

You can also use:
# smit dump

Notes regarding system dumps:
-----------------------------

note 1:
-------

The_Nail <tomapam@gmail.com> wrote: 
> I handle several AIX 5.1 servers and some of them warns me (via errpt) 
> about a lack of disk space for the dumpcheck ressource. 
> Here is a copy of the message : 

> 
> Description 
> The copy directory is too small. 
> 
> Recommended Actions 
> Increase the size of that file system. 
> 
> Detail Data 
> File system name 
> /var/adm/ras 
> 
> Current free space in kb 
> 7636 
> Current estimated dump size in kb 
> 207872 


> I guess /dev/hd6 is not big enough to contain a system dump. So how 
> can i change that? 


The error message tells you something else. 
Read it, and you will understand! 


> How can i configure a secondary susdump space in case the primary 
> would be unavailable? 


sysdumpdev -s /dev/whatever 


> What does "copy directory /var/adm/ras" mean? 


That's where the crash dump will be put when you reboot after the crash. 
/dev/hd6 will be needed for other purposes (paging space), so you cannot 
keep your system dump there. 


And that file system is too small to contain the dump, that's the meaning 
of the error message. 


You have two options: 


- increase the /var file system (it should have ample free space anyway). 
- change the dump directory to something where you have more space: 
  sysdumpdev -D /something/in/rootvg/with/free/space 


Yours, 
Laurenz Albe 


Note 2:
-------

Suppose you find the following error:

$ errpt
IDENTIFIER TIMESTAMP  T C RESOURCE_NAME  DESCRIPTION
F89FB899   0822150005 P O dumpcheck      The copy directory is too small

This message is the result of a dump device check. You can fix this by 
increasing the size of your dump device. If you are using the default 
dump device (/dev/hd6) then increase your paging size or go to smit dump 
and "select System Dump Compression". Myself, I don't like to use the 
default dump device so I create a sysdumplv and make sure I have enough 
space. To check space needed go to smit dump and select "Show Estimated 
Dump Size" this will give you an idea about the size needed.

The copy directory is whatever sysdumpdev says it is.
Run sysdumpdev and you will get something like
#sysdumpdev
primary              /dev/hd6
secondary            /dev/sysdumpnull
copy directory       /var/adm/ras
forced copy flag     TRUE
always allow dump    FALSE
dump compression     ON

# sysdumpdev -e
0453-041 Estimated dump size in bytes: 57881395
Divide this number by 1024.  This is the free space that is needed in 
your copy directory.  Compare it to a df -k or divide this number by 
512.  This is the free space that is needed in your copy directory.  
Compare it to a df

HP:
---


22. Diagnostic output:
======================

0:Standard input  1: Standard output  2: Diagnostic output

redirect diag. outp. to file
# cat somefile nofile 2>errfile              
# cat somefile nofile > outfile 2>errfile

redirect diag. outp. to same place as standard outp.
# cat firsthalf secondhalf > composite 2>1&  


23. DOS2UNIX:
=============

If you want to convert a ascii PC file to unix, you can use many tools like tr etc..

# tr -d '\r' < original.file > new.file

# tr -d '\015' < original.file > new.file

Or scripts like:

#!/bin/sh
perl -p -i -e 'BEGIN { print "Converting DOS to UNIX.\n" ; } END { print "Done.\n" ; } s/\r\n$/\n/' $*

perl -p -i.bak -e 's/^\r+//;s/\r+$//;s/\r/\n/gs' file

Or, on many unixes You can use the utility "  dos2unix " to remove the ^M
Just type:  dos2unix <filename1> <filename2>  [RETURN]


dos2unix [ -ascii ] [ -iso ] [ -7 ] originalfile convertedfile 

-ascii 
Removes extra carriage returns and converts end of file characters in DOS format text files to conform to SunOS requirements. 
-iso 
This is the default. It converts characters in the DOS extended character set to the corresponding ISO standard characters. 
-7 
Convert 8 bit DOS graphics characters to 7 bit space characters so that SunOS can read the file. 


#!/bin/sh
# a script to strip carriage returns from DOS text files
if test -f $1
then
	tr -d '\r' <$1 >$.tmp
	rm $1
	mv $.tmp $1
fi

# tr -d '\015' < original.file > new.file

Note: Other formats on AIX:
---------------------------

1. nvdmetoa command:

How to convert EBCDIC files to ASCII:

On your AIX system, the tool nvdmetoa might be present.

Examples:
 
nvdmetoa <AS400.dat  >AIXver3.dat 

Converts an EBCDIC file taken off an AS400 and converts to an ASCII file for the pSeries or RS/6000 

nvdmetoa 132 <AS400.txt  >AIXver3.txt 

Converts an EBCDIC file with a record length of 132 characters to an ASCII file with 132 bytes per line 
PLUS 1 byte for the linefeed character. 


2. od command:

The od command translate a file into other formats, like for example hexadecimal format.
To translate a file into several formats at once, enter: 

# od -t cx a.out > a.xcd

This command writes the contents of the a.out file, in hexadecimal format (x) and character format (c), 
into the a.xcd file. 


24. Secure shell connections:
=============================

ssh:
====


What is Open Secure Shell?

Open Secure Shell (OpenSSH) is an open source version of the SSH protocol suite of network connectivity tools. 
The tools provide shell functions that are authenticated and encrypted. A shell is a command language interpreter 
that reads input from a command line string, stdin or a file. Why use OpenSSH? When you're running over 
unsecure public networks like the Internet, you can use the SSH command suite instead of the unsecure commands telnet, 
ftp, and r-commands.

OpenSSH delivers code that communicates using SSH1 and SSH2 protocols. What's the difference? The SSH2 protocol 
is a re-write of SSH1. SSH2 contains separate, layered protocols, but SSH1 is one large set of code. SSH2 supports 
both RSA & DSA keys, but SSH1 supports only RSA, and SSH2 uses a strong crypto integrity check, where SSH1 uses 
a CRC-32 check. The Internet Engineering Task Force (IETF) maintains the secure shell standards.


Example 1:
----------

Go to a terminal on your local Unix system (Solaris, Linux, Mac OS X, etc.) and type the following command:

ssh -l username acme.gatech.edu

Replace "username" with your Prism ID. If this is your first time connecting to acme, you will see 
a warning similar to this:

  The authenticity of host 'acme.gatech.edu (130.207.165.23)' can't be established.
  DSA key fingerprint is 72:ce:63:c5:86:3a:cb:8c:cb:43:6c:da:00:0d:4c:1f.
  Are you sure you want to continue connecting (yes/no)?

Type the word "yes" and hit <ENTER>. You should see the following warning:

  Warning: Permanently added 'acme.gatech.edu,130.207.165.23' (DSA) to the list of
  known hosts.
  
Next, you will be prompted for your password. Type your password and hit <ENTER>. 


Example 2:
----------

A secure shell 'terminal':

# ssh -l oracle 193.172.126.193
# ssh oracle@193.172.126.193


pscp:
=====

Example to Copy a file to a remote unix server:

# pscp c:\documents\foo.txt fred@example.com:/tmp/foo

To receive (a) file(s) from a remote server: 

pscp [options] [user@]host:source target
So to copy the file /etc/hosts from the server example.com as user fred to the file c:\temp\example-hosts.txt, 
you would type: 

pscp fred@example.com:/etc/hosts c:\temp\example-hosts.txt

To send (a) file(s) to a remote server: 

pscp [options] source [source...] [user@]host:target
So to copy the local file c:\documents\foo.txt to the server example.com as user 
fred to the file /tmp/foo you would type: 

pscp c:\documents\foo.txt fred@example.com:/tmp/foo

You can use wildcards to transfer multiple files in either direction, like this: 

pscp c:\documents\*.doc fred@example.com:docfiles
pscp fred@example.com:source/*.c c:\source


  Example of scripts using pscp with parameters;

  ------------------------------------
  @echo off

  REM Script om via pscp.exe een bestand van een UNIX systeem te copi%ren naar het werkstation.
  
  Echo Copy bestand van unix naar werkstation 

  SET /P systemname=Geef volledige systeemnaam:
  SET /P remotefile=Geef UNIX path+filename:
  SET /P localfile=Geef local filename:
  SET /P username=Geef username:

  echo pscp.exe %username%@%systemname%:%remotefile% %localfile%

  pscp.exe %username%@%systemname%:%remotefile% %localfile%

  echo bestand %remotefile% gecopieerd naar %localfile%
  pause

  ------------------------------------

  @echo off

  REM Script om via pscp.exe een bestand naar een UNIX systeem te copi%ren van het werkstation.
  
  Echo Copy bestand van werkstation naar unix

  SET /P systemname=Geef volledige systeemnaam:
  SET /P localfile=Geef local filename:
  SET /P remotefile=Geef UNIX path+filename:
  SET /P username=Geef username:

  echo pscp.exe %localfile% %username%@%systemname%:%remotefile% 
  pscp.exe %localfile% %username%@%systemname%:%remotefile% 
  echo bestand %localfile% gecopieerd naar %remotefile%
  pause
  ------------------------------------

scp:
====

Scp is a utility which allows files to be copied between machines. Scp is an updated version of an 
older utility named Rcp. It works the same, except that information (including the password used to log in) 
is encrypted. Also, if you have set up your .shosts file to allow you to ssh between machines 
without using a password as described in help on setting up your .shosts file, you will be able to scp 
files between machines without entering your password. 

Either the source or the destination may be on the remote machine; i.e., you may copy files or directories 
into the account on the remote system OR copy them from the account on the remote system into the account 
you are logged into. 

Example:
# scp conv1.tar.gz bu520@192.168.2.2:/backups/520backups/splenvs
# scp conv2.tar.gz bu520@192.168.2.2:/backups/520backups/splenvs


Example: 
# scp myfile xyz@sdcc7:myfile

Example: 	
To copy a directory, use the -r (recursive) option. 
# scp -r mydir xyz@sdcc7:mydir

Example: 
cd /oradata/arc
/usr/local/bin/scp *.arc  SPRAT:/oradata/arc

Example:
While logged into xyz on sdcc7, copy file "letter" into file "application" in remote account abc on sdcc3: 
% scp letter abc@sdcc3:application

While logged into abc on sdcc3, copy file "foo" from remote account xyz on sdcc7 into filename "bar" in abc: 
% scp xyz@sdcc7:foo bar


To permit a connection (ssh or scp) from a local machine to a remote machine without always 
typing a password, on the remote machine, create the file ".shosts" in your home that contains 
the name of the local machine. The permissions on "e;.shosts"e; should be rw for the user and
--- for everyone else (The command chmod 600 .shosts will set the permissions correctly). If you have 
the file ".rhosts", please delete it. 
SSH and SCP will use the ssh_know_hosts file. If the local machine is correctly entered in the user's 
.ssh/known_hosts file, then the connection will be permitted with out a password. 

To make this work, you may need to log back in from the remote machine to your local machine. 
For example, if your local machine is i7.msi.umn.edu and you want to connect to origin.msi.umn.edu, 
use the following procedure to set up connecting from i7 to origin without a password: 

Estiblish an ssh connection to origin: 
ssh -X origin.msi.umn.edu 

After typing a password and establishing a connection, Add i7.msi.umn.edu to the file "e;.shosts"e; 
in your home directory. 
Extablish an ssh connection back to i7.msi.umn.edu. 
ssh -X i7.msi.umn.edu 

After typing a password on i7, you can exit from i7. 


ssh on AIX:
===========

After you download the OpenSSL package, you can install OpenSSL and OpenSSH.

Install the OpenSSL RPM package using the geninstall command: 

# geninstall -d/dev/cd0 R:openssl-0.9.6m

Output similar to the following displays: 
SUCCESSES
---------
openssl-0.9.6m-3

Install the OpenSSH installp packages using the geninstall command: 
# geninstall -I"Y" -d/dev/cd0 I:openssh.base

Use the Y flag to accept the OpenSSH license agreement after you have reviewed the license agreement. 
(Note: we have seen this line as well: 
# geninstall -Y -d/dev/cd0 I:openssh.base)

Output similar to the following displays: 

Installation Summary                                                           
--------------------                                                           
Name                        Level           Part        Event       Result     
-------------------------------------------------------------------------------
openssh.base.client         3.8.0.5200      USR         APPLY       SUCCESS    
openssh.base.server         3.8.0.5200      USR         APPLY       SUCCESS    
openssh.base.client         3.8.0.5200      ROOT        APPLY       SUCCESS    
openssh.base.server         3.8.0.5200      ROOT        APPLY       SUCCESS     

You can also use the SMIT install_software fast path to install OpenSSL and OpenSSH.

The following OpenSSH binary files are installed as a result of the preceding procedure:

scp File copy program similar to rcp 
sftp Program similar to FTP that works over SSH1 and SSH2 protocol 
sftp-server SFTP server subsystem (started automatically by sshd daemon) 
ssh Similar to the rlogin and rsh client programs 
ssh-add Tool that adds keys to ssh-agent 
ssh-agent An agent that can store private keys 
ssh-keygen Key generation tool 
ssh-keyscan Utility for gathering public host keys from a number of hosts 
ssh-keysign Utility for host-based authentication 
ssh-rand-helper A program used by OpenSSH to gather random numbers. It is used only on AIX 5.1 installations. 
sshd Daemon that permits you to log in 

The following general information covers OpenSSH: 
The /etc/ssh directory contains the sshd daemon and the configuration files for the ssh client command. 
The /usr/openssh directory contains the readme file and the original OpenSSH open-source license text file. 
This directory also contains the ssh protocol and Kerberos license text. 

The sshd daemon is under AIX SRC control. You can start, stop, and view the status of the daemon 
by issuing the following commands: 

startsrc -s sshd   OR startsrc -g ssh  (group)
stopsrc -s sshd    OR stopsrc -g ssh
lssrc -s sshd      OR lssrc -s ssh


More on ssh-keygen:
===================


ssh-keygen: password-less SSH login 
SSH is often used to login from one system to another without requiring passwords. 
A number of methods may be used for that to work properly, one of which is to setup a 
.rhosts file (permission 600) with its content being the name of the remote system you trust, 
followed by the username your trust: 

nickel.sao.nrc.ca cantin 

would mean you trust user cantin from nickel.sao.nrc.ca to connect to your account, 
without requiring a password. 
But for that to work, SSH itself must be configured to trust .rhosts files (which it does not 
for most OpenSSH installations - but we do on most systems RCSG maintains), and the private/public key pair 
of each system must be properly set in the system-wide ssh_known_hosts public key file. 

This, of course, requires help from the local systems administrator. 

The second method does not require any help from the systems administrator. And it does not require modifications 
to the .rhosts file. Instead, it requires you generate your own personal set of private/public pair. 

ssh-keygen is used to generate that key pair for you. Here is a session where your own personal 
private/public key pair is created: 

cantin@sodium:~> ssh-keygen -t rsa
Generating public/private rsa key pair.
Enter file in which to save the key (/home/cantin/.ssh/id_rsa):
Enter passphrase (empty for no passphrase):
Enter same passphrase again:
Your identification has been saved in /home/cantin/.ssh/id_rsa.
Your public key has been saved in /home/cantin/.ssh/id_rsa.pub.
The key fingerprint is:
f6:61:a8:27:35:cf:4c:6d:13:22:70:cf:4c:c8:a0:23 cantin@sodium

The command ssh-keygen -t rsa initiated the creation of the key pair. 

No passphrase was entered (Enter key was pressed instead). 

The private key was saved in .ssh/id_rsa. This file is read-only and only for you. No one else must 
see the content of that file, as it is used to decrypt all correspondence encrypted with the public key. 

The public key is save in .ssh/id_rsa.pub. 

In this case, the content of file id_rsa.pub is 

ssh-rsa AAAAB3NzaC1yc2EAAAABIwAAAIEArkwv9X8eTVK4F7pMlSt45pWoiakFkZMw
G9BjydOJPGH0RFNAy1QqIWBGWv7vS5K2tr+EEO+F8WL2Y/jK4ZkUoQgoi+n7DWQVOHsR
ijcS3LvtO+50Np4yjXYWJKh29JL6GHcp8o7+YKEyVUMB2CSDOP99eF9g5Q0d+1U2WVdB
WQM= cantin@sodium

It is one line in length. 

Its content is then copied in file .ssh/authorized_keys of the system you wish to SSH to without 
being prompted for a password. 

The example shown here generated keys on sodium by user cantin. If the public key generated, 
file .ssh/id_rsa.pub, was copied to your account, file .ssh/authorized_keys on nickel.sao.nrc.ca, 
then user cantin@sodium is allowed to SSH into your own account on nickel.sao.nrc.ca without 
the use of a password. 

To summarize, a personal private/public key pair is generated using the ssh-keygen command. 
The public key is then copied onto a remote systems' .ssh/authorized_keys file. 
And you can now SSH to the remote systems's account without the use of a password. 

Example: 
--------

The backup user bu520 on a p520, needs to copy backupfiles to a p550.
The process is a cronjob which uses scp. The user should not be
confronted with a pasword entry.

On p520:

/home/bu520/.ssh:>ls -al
total 7
drwx------   2 bu520    staff           512 Apr 24 2006  .
drwxr-xr-x   3 bu520    staff           512 Apr 24 2006  ..
-rw-------   1 bu520    staff           883 Apr 24 2006  id_rsa
-rw-r--r--   1 bu520    staff           225 Apr 24 2006  id_rsa.pub
-rw-r--r--   1 bu520    staff           663 Jun 01 2006  known_hosts

/home/bu520/.ssh:>cat id_rsa
-----BEGIN RSA PRIVATE KEY-----
MIICWgIBAAKBgQCq901MXZ+l+QFUkyLUgPskqEYz11eGR0nFr0ydVsUDrAnAQngE
BGNyrURqGxC+vA2dhU1kdeDLa6PlrxrQ9j02hpcG4mSO369BzJ3QEg9C4yPnHxfJ
L9/GauVRzgY3WjmCzwAm51GOsW6S/1s9SQWDG4uepvuUTasIZgf3fktcKQIBIwKB
gQCNqFX9ciUxv7ClKXShci8tAHSunHu4ZvP7kT97DWFpcUnocZakPiaDluDqM67J
7EXLqPb7d50AUd+SbIPu9+mSOTrkXSBII+eVzMIM8yJKgy8+nrsctDE3vw/ZGb+l
Gf8R6zwd2YR0Y2LBS0RSP5DNgf4B6FZO9o+VGTjMlvYkiwJBANfwcJL5G9EQmQkO
zzVhkX4N/oXN3LmmbI9+QMPHhbXiXj2J0sqchx/gir+hcPo9PsRq5gHgtO2Hr+qS
sAFWAMkCQQDKrvV1GFnIzcfVQ7Nwnso5hJ0F2tt5cLV5OXTz/x9Y09n5+M77tBEr
QvunF+Sg9jHUuTHtzTCgfuJUMLqAJJBhAkB1OWGu3wB4zn72Sd4y69Kjg/CRx4Zz
aPkaskBqR72dQF8LdrRCGnU9MMBZZkSlGe7fp76wj+0wfNvXHG4snGbTAkAXKfAq
o7J9WViqqKbLCtVIZu1fwT2nephloCqfit8C1mIN8IyvDUPKbg4huZZ4y63sbO/D
Z+hM200Q76BJKMALAkB/ocrU8gkAiTBqanu0HR8bsLpIQRM+bAohXc2+wGSOFeZG
ZijMWsvl+FDtLWcFgEi3fB6dR86YSax5VFLhsLIL
-----END RSA PRIVATE KEY-----

/home/bu520/.ssh:>cat id_rsa.pub
ssh-rsa AAAAB3NzaC1yc2EAAAABIwAAAIEAqvdNTF2fpfkBVJMi1ID7JKhGM9dXhkdJxa9MnVbFA6wJwEJ4BARjcq1EahsQvrwNnYVNZHXgy2uj5a8a0PY9NoaXBuJkjt+vQcyd0BIPQuMj5x8XyS/fxmrlUc4GN1o5gs8AJudRjrFukv9bPUkFgxuLnqb7lE2rCGYH935LXCk= bu520@ol116u209

/home/bu520/.ssh:>cat id_rsa.pub
ssh-rsa AAAAB3NzaC1yc2EAAAABIwAAAIEAqvdNTF2fpfkBVJMi1ID7JKhGM9dXhkdJxa9MnVbFA6wJwEJ4BARjcq1EahsQvrwNnYVNZHXgy2uj5a8a0PY9NoaXBuJkjt+vQcyd0BIPQuMj5x8XyS/fxmrlUc4GN1o5gs8AJudRjrFukv9bPUkFgxuLnqb7lE2rCGYH935LXCk= bu520@ol116u209


/home/bu520/.ssh:>cat known_hosts
192.168.2.2 ssh-rsa AAAAB3NzaC1yc2EAAAABIwAAAIEAx16h52LfGNbf5VIn4zDsIWSnFm668YZ3k2immcyA+ih5RRohh9f+Z8lS9EFDvnNQsTLMwduPBpjXPZY3mZXOVDtpsu6rnKCWKNx9DFaxsLtBSk+1tV4Yr1u7nO6hxs/2vE5xwWys5qQP0XABJ/m0+eY8IYMkE/LeXXw0to8iz7c=
192.168.2.3 ssh-rsa AAAAB3NzaC1yc2EAAAABIwAAAIEAzSFdlVb+RyI5k3pWcpsP0oMcAhMgmb7g/GKLfOyAtf1+c+MeVADz3jJzZywDKvzAJ+o409nhDSIuqvuoRQ2wva08jrPh16ewnSfGzjWY0n9aAMztMwWIvEXodowBNJVSBGV4SZdgtzqauQ06H22dl0vORdie0/4M5OHYYbV2lxE=
192.168.1.2 ssh-rsa AAAAB3NzaC1yc2EAAAABIwAAAIEAx16h52LfGNbf5VIn4zDsIWSnFm668YZ3k2immcyA+ih5RRohh9f+Z8lS9EFDvnNQsTLMwduPBpjXPZY3mZXOVDtpsu6rnKCWKNx9DFaxsLtBSk+1tV4Yr1u7nO6hxs/2vE5xwWys5qQP0XABJ/m0+eY8IYMkE/LeXXw0to8iz7c=


Automatic startup of sshd on boot:
----------------------------------

For example, on AIX create the following script "Sssh" in /etc/rc.d/rc2.d

root@zd110l14:/etc/rc.d/rc2.d#cat Ssshd
#!/bin/ksh

##################################################
# name: Ssshd
# purpose: script that will start or stop the sshd daemon.
##################################################

case "$1" in
start )
        startsrc -g ssh
        ;;
stop )
        stopsrc -g ssh
        ;;
* )
        echo "Usage: $0 (start | stop)"
        exit 1
esac


25. Pipelining and Redirecting:
===============================

CONCEPT: UNIX allows you to connect processes, by letting the standard output of one process feed into the 
standard input of another process. That mechanism is called a pipe. 
Connecting simple processes in a pipeline allows you to perform complex tasks without writing complex programs. 

EXAMPLE: Using the more command, and a pipe, send the contents of your .profile and .shrc files to the 
screen by typing 

cat .profile .shrc | more
to the shell. 

EXERCISE: How could you use head and tail in a pipeline to display lines 25 through 75 of a file? 

ANSWER: The command 

cat file | head -75 | tail -50

would work. The cat command feeds the file into the pipeline. The head command gets the first 75 lines 
of the file, and passes them down the pipeline to tail. The tail command then filters out all but the last 
50 lines of the input it received from head. It is important to note that in the above example, tail never 
sees the original file, but only sees the part of the file that was passed to it by the head command. 
It is easy for beginners to confuse the usage of the input/output redirection symbols < and >, with the 
usage of the pipe. Remember that input/output redirection connects processes with files, while the pipe connects 
processes with other processes. 

Grep
The grep utility is one of the most useful filters in UNIX. Grep searches line-by-line for a specified pattern, 
and outputs any line that matches the pattern. The basic syntax for the grep command is 
grep [-options] pattern [file]. If the file argument is omitted, grep will read from standard input.
 It is always best to enclose the pattern within single quotes, to prevent the shell 
from misinterpreting the command. 

The grep utility recognizes a variety of patterns, and the pattern specification syntax was taken from the 
vi editor. Here are some of the characters you can use to build grep expressions: 

The carat (^) matches the beginning of a line. 
The dollar sign ($) matches the end of a line. 
The period (.) matches any single character. 
The asterisk (*) matches zero or more occurrences of the previous character. 
The expression [a-b] matches any characters that are lexically between a and b. 

EXAMPLE: Type the command 

grep 'jon' /etc/passwd

to search the /etc/passwd file for any lines containing the string "jon". 

EXAMPLE: Type the command 

grep '^jon' /etc/passwd
to see the lines in /etc/passwd that begin with the character string "jon". 

EXERCISE:List all the files in the /tmp directory owned by the user root. 

EXPLANATION: The command 

ls -l /tmp | grep 'root'
would show all processes with the word "root" somewhere in the line. That doesn't necessarily mean that 
all the process would be owned by root, but using the grep filter can cut the down the number of processes 
you will have to look at. 


Redirecting:
------------

CONCEPT: Every program you run from the shell opens three files: Standard input, standard output, 
and standard error. The files provide the primary means of communications between the programs, 
and exist for as long as the process runs. 

The standard input file provides a way to send data to a process. As a default, the standard input is read 
from the terminal keyboard. 

The standard output provides a means for the program to output data. As a default, the standard output 
goes to the terminal display screen. 

The standard error is where the program reports any errors encountered during execution. 
By default, the standard error goes to the terminal display. 

CONCEPT: A program can be told where to look for input and where to send output, using input/output 
redirection. UNIX uses the "less than" and "greater than" special characters (< and >) to signify input 
and output redirection, respectively. 


Redirecting input
Using the "less-than" sign with a file name like this: 
< file1 

in a shell command instructs the shell to read input from a file called "file1" instead of from the keyboard. 

EXAMPLE:Use standard input redirection to send the contents of the file /etc/passwd to the more command: 

more < /etc/passwd 

Many UNIX commands that will accept a file name as a command line argument, will also accept input from 
standard input if no file is given on the command line. 

EXAMPLE: To see the first ten lines of the /etc/passwd file, the command: 

head /etc/passwd 
will work just the same as the command: 
head < /etc/passwd 

Redirecting output
Using the "greater-than" sign with a file name like this: 
> file2 
causes the shell to place the output from the command in a file called "file2" instead of on the screen. 
If the file "file2" already exists, the old version will be overwritten. 

EXAMPLE: Type the command 

ls /tmp > ~/ls.out

to redirect the output of the ls command into a file called "ls.out" in your home directory. 
Remember that the tilde (~) is UNIX shorthand for your home directory. In this command, the ls command 
will list the contents of the /tmp directory. 
Use two "greater-than" signs to append to an existing file. For example: 

>> file2 

causes the shell to append the output from a command to the end of a file called "file2". If the file 
"file2" does not already exist, it will be created. 

EXAMPLE: In this example, I list the contents of the /tmp directory, and put it in a file called myls. 
Then, I list the contents of the /etc directory, and append it to the file myls: 

ls /tmp > myls 
ls /etc >> myls 

Redirecting error
Redirecting standard error is a bit trickier, depending on the kind of shell you're using 
(there's more than one flavor of shell program!). In the POSIX shell and ksh, redirect the standard error 
with the symbol "2>". 

EXAMPLE: Sort the /etc/passwd file, place the results in a file called foo, and trap any errors in a file 
called err with the command: 

sort < /etc/passwd > foo 2> err 


===========================
27. UNIX DEVICES and mknod:
===========================


27.1 Note 1:
============

the files in the /dev directory are a little different from anything you may be used to in 
other operating systems. 
The very first thing to understand is that these files are NOT the drivers for the devices. Drivers are in 
the kernel itself (/unix etc..), and the files in /dev do not actually contain anything at all: 
they are just pointers to where the driver code can be found in the kernel. There is nothing more to it 
than that. These aren't programs, they aren't drivers, they are just pointers. 

That also means that if the device file points at code that isn't in the kernel, it obviously is not 
going to work. Existence of a device file does not necessarily mean that the device code is in the kernel, 
and creating a device file (with mknod) does NOT create kernel code. 

Unix actually even shows you what the pointer is. When you do a long listing of a file in /dev, 
you may have noticed that there are two numbers where the file size should be: 


brw-rw-rw-   2 bin      bin        2, 64 Dec  8 20:41 fd0

That "2,64" is a pointer into the kernel. I'll explain more about this in a minute, 
but first look at some more files: 

brw-rw-rw-   2 bin      bin        2, 64 Dec  8 20:41 fd0
brw-rw-rw-   2 bin      bin        2, 48 Sep 15 16:13 fd0135ds15
brw-rw-rw-   2 bin      bin        2, 60 Feb 12 10:45 fd0135ds18
brw-rw-rw-   1 bin      bin        2, 16 Sep 15 16:13 fd0135ds21
brw-rw-rw-   2 bin      bin        2, 44 Sep 15 16:13 fd0135ds36
brw-rw-rw-   3 bin      bin        2, 36 Sep 15 16:13 fd0135ds9

A different kind of device would have a different major number. For example, here are the serial com ports: 

crw-rw-rw-   1 bin      bin        5,128 Feb 14 05:35 tty1A
crw-rw-rw-   1 root     root       5,  0 Dec  9 13:13 tty1a
crw-rw-rw-   1 root     sys        5,136 Nov 25 07:28 tty2A
crw-r--r--   1 uucp     sys        5,  8 Nov 25 07:16 tty2a

Notice the "b" and the "c" as the first characters in the mode of the file. It designates whether
we have a block "b", or a character "c" device.

Notice that each of these files shares the "5" part of the pointer, but that the other number is different. 
The "5" means that the device is a serial port, and the other number tells exactly which com port you are 
referring to. In Unix parlance, the 5 is the "major number" and the other is the "minor number". 

These numbers get created with a "mknod" command. For example, you could type "mknod /dev/myfloppy b 2 60" and 
then "/dev/myfloppy" would point to the same driver code that /dev/fd0135ds18 points to, and it would 
work exactly the same. 

This also means that if you accidentally removed /dev/fd0135ds18, you could instantly recreate it with "mknod". 

But if you didn't know that the magic numbers were "2,60", how could you find out? 

It turns out that it's not hard. 

First, have a look at "man idmknod". The idmknod command wipes out all non-required devices, and then recreates them. 
Sounds scary, but this gets called every time you answer "Y" to that "Rebuild Kernel environment?" question that 
follows relinking. Actually, on 5.0.4 and on, the existing /dev files don't get wiped out; the command simply 
recreates whatever it has to. 

idmknod requires several arguments, and you'd need to get them right to have success. You could make it easier 
by simply relinking a new kernel and answering "Y" to the "Rebuild" question, but that's using a fire hose to 
put out a candle. 

A less dramatic method would be to look at the files that idmknod uses to recreate the device nodes. These are found 
in /etc/conf/node.d 

In this case, the file you want would be "fd". A quick look at part of that shows: 

fd	fd0		b	64	bin	bin	666
fd	fd0135ds36	b	44	bin	bin	666
fd	fd0135ds21	b	16	bin	bin	666
fd	fd0135ds18	b	60	bin	bin	666
fd	fd0135ds15	b	48	bin	bin	666
fd	fd0135ds9	b	36	bin	bin	666
fd	fd048		b	4	bin	bin	666

This gives you *almost* everything you need to know about the device nodes in the "fd" class. The only thing it 
doesn't tell you is the major number, but you can get that just by doing an "l" of any other fd entry: 

brw-rw-rw-   1 bin      bin        2, 60 Feb  5 09:45 fd096ds18

this shows you that the major number is "2". 

Armed with these two pieces of information, you can now do 

mknod /dev/fd0135ds18 b 2 60
chown bin /dev/fd0135ds18
chgrp bin /dev/fd0135ds18
chmod 666 /dev/fd0135ds18

If you examined the node file closely, you would also notice that /dev/rfd0135ds18 and /dev/fd0135ds18 differ only 
in that the "r" version is a "c" or character device and the other is "b" or block. If you had already known that, 
you wouldn't have even had to look at the node file; you'd simply have looked at an "l" of the /dev/rfd0135ds18 and 
recreated the block version appropriately. 

There are other fascinating things that can be learned from the node files. For example, fd096ds18 is also minor number 60, 
and can be used in the same way with identical results. In other words, if you z'd out (were momentarily innattentive, 
not CTRL-Z in a job control shell) and dd'd an image to /dev/fd096ds18, it would write to your hd floppy without incident. 

If you have a SCSI tape drive, notice what happens when you set it to be the "default" tape drive. 
It creates device files that have different names (rct0, etc.) but that have the same major and minor numbers. 

Knowing that it's easy to recreate missing device files also means that you can sometimes capture the output 
of programs that write directly to a device. For example, suppose some application prints directly to /dev/lp 
but you need to capture this to a file. In most situations, you can simply "rm /dev/lp" (after carefully noting 
its current ownership, permissions and, of course, major/minor numbers), and then "touch /dev/lp" to create an 
ordinary file. You'll need to chmod it for appropriate permissions, and then run your app. Unless the app has 
tried to do ioctl calls on the device, the output will be there for your use. This can be particularly useful 
for examining control characters that the app is sending. 

What's the Difference?
One question that comes up fairly often is "what's the difference between a block and a character device and when 
should I use one rather than the other?". To answer that question fully is hard, but I'm going to try to at least 
get you started here. 

The real difference lies in what the kernel does when a device file is accessed for reading or writing. If the device 
is a block device, the kernel gives the driver the address of a kernel buffer that the driver will use as the source 
or destination for data. Note that the address is a "kernel" address; that's important because that buffer will be 
cached by the kernel. If the device is raw , then the address it will use is in the user space of the process that is 
using the device. A block device is something you could make a filesystem on (a disk). You can move forward and backward, 
from the beginning of a block device to its end, and then back to the beginning again. If you ask to read a block that 
the kernel has buffered, then you get data from the buffer. If you ask for a block that has not yet been buffered, 
the kernel reads that block (and probably a few more following it) into the buffer cache. If you write to a block device, 
it goes to the buffer cache (eventually to the device, of course). A raw (or character) device is often something that 
doesn't have a beginning or end; it just gives a stream of characters that you read. A serial port is an excellent 
example- however, it is not at all unusual to have character (raw) drivers for things that do have a beginning 
and an end- a tape drive, for example. And many times there are BOTH character and block devices for the same 
physical device- disks, for example. Nor does using a raw device absolutely mean that you can't move forward and back, 
from beginning to end- you can move wherever you want with a tape or /dev/rfd0. 

And that's where the differences get confusing. It seems pretty reasonable that you'd use the block device to mount 
a disk. But which do you use for format? For fsck? For mkfs? 

Well, if you try to format /dev/fd0135ds18, you'll be told that it is not a formattable device. 
Does that make any sense? Well, the format process involves sequential access- it starts at the beginning and just 
keeps on going, so it seems to make sense that it wouldn't use the block device. But you can run "mkfs" on either 
the block or character device; it doesn't seem to care. The same is true for fsck. But although that's true for those 
programs on SCO OSR5, it isn't necessarily going to be true on some other UNIX, and the "required" device may make sense 
to whover wrote the program, but it may not make sense to you. 

You'd use a block device when you want to take advantage of the caching provided by the kernel. You'd use the raw device 
when you don't, or for ioctl operations like "tape status" or "stty -a". 


27.2 Note 2:
============


One of the unique things about Unix as an operating system is that regards everything as a file. Files can be divided into 
three categories; ordinary or plain files, directories, and special or device files.

Directories in Unix are properly known as directory files. They are a special type of file that holds a list of the 
other files they contain. 

Ordinary or plain files in Unix are not all text files. They may also contain ASCII text, binary data, and program input 
or output. Executable binaries (programs) are also files, as are commands. When a user enters a command, the associated 
file is retrieved and executed. This is an important feature and contributes to the flexibility of Unix.

Special files are also known as device files. In Unix all physical devices are accessed via device files; they are 
what programs use to communicate with hardware. Files hold information on location, type, and access mode for a 
specific device. There are two types of device files; character and block, as well as two modes of access.

- Block device files are used to access block device I/O. Block devices do buffered I/O, meaning that the the data is 
  collected in a buffer until a full block can be transfered.

- Character device files are associated with character or raw device access. They are used for unbuffered data transfers 
  to and from a device. Rather than transferring data in blocks the data is transfered character by character. 
  One transfer can consist of multiple characters.

So what about a device that could be accessed in character or block mode? How many device files would it have? 

One. 
Two. 
There are no such devices. 

Some devices, such as disk partitions, may be accessed in block or character mode. Because each device file corresponds 
to a single access mode, physical devices that have more than one access mode will have more than one device file.

Device files are found in the /dev directory. Each device is assigned a major and minor device number. The major 
device number identifies the type of device, i.e. all SCSI devices would have the same number as would all the keyboards. 
The minor device number identifies a specific device, i.e. the keyboard attached to this workstation.

Device files are created using the mknod command. The form for this command is:

mknod device-name type major minor 

device-name is the name of the device file 
type is either "c" for character or "b" for block 
major is the major device number 
minor is the minor device number 
The major and minor device numbers are indexed to device switches. There are two types of device switches; c
devsw for character devices and bdevsw for block devices. These switches are kernel structures that hold the names 
of all the control routines for a device and tell the kernel which driver module to execute. Device switches are 
actually tables that look something like this:

0 keyboard 
1 SCSIbus 
2 tty 
3 disk 
Using the ls command in the /dev directory will show entries that look like:

brw-r----- 1 root sys 1, 0 Aug 31 16:01 /dev/sd1a 

The "b" before the permissions indicates that this is a block device file. When a user enters /dev/sd1a the kernel sees 
the file opening, realizes that it's major device number 1, and calls up the SCSIbus function to handle it.


====================
28. Solaris devices:
====================

Devices are described in three ways in the Solaris environment, using three distinct naming
conventions: the physical device name, the instance name, and the logical device name.

Solaris stores the entries for physical devices under the /devices directory, 
and the logical device entries behind the /dev directory.


- A "physical device name" represents the full pathname of the device. 
  Physical device files are found in the /devices directory and have a
  naming convention like the following example:

  /devices/sbus@1,f8000000/esp@0,40000/sd@3,0:a

  Each device has a unique name representing both the type of device and the location of that device
  in the system-addressing structure called the "device tree". The OpenBoot firmware builds the 
  device tree for all devices from information gathered at POST. The device tree is loaded in memory
  and is used by the kernel during boot to identify all configured devices.
  A device pathname is a series of node names separated by slashes. 
  Each device has the following form: 
  
  driver-name@unit-address:device-arguments


- The "instance name" represents the kernel's abbreviated name for every possible device
  on the system. For example, sd0 and sd1 represents the instance names of two SCSI disk devices.
  Instance names are mapped in the /etc/path_to_inst file, and are displayed by using the
  commands dmesg, sysdef, and prtconf

- The "Logical device names" are used with most Solaris file system commands to refer to devices.
  Logical device files in the /dev directory are symbolically linked to physical device files
  in the /devices directory. Logical device names are used to access disk devices in the
  following circumstances:
  - adding a new disk to the system and partitioning the disk
  - moving a disk from one system to another
  - accessing or mounting a file system residing on a local disk
  - backing up a local file system
  - repairing a file system

  Logical devices are organized in subdirs under the /dev directory by their device types
  /dev/dsk    block interface to disk devices
  /dev/rdsk   raw or character interface to disk devices. 
              In commands, you mostly use raw logical devices, like for example # newfs /dev/rdsk/c0t3d0s7
  /dev/rmt    tape devices
  /dev/term   serial line devices 
  etc..

  Logical device files have a major and minor number that indicate device drivers, 
  hardware addresses, and other characteristics.
  Furthermore, a device filename must follow a specific naming convention.
  A logical device name for a disk drive has the following format:

  /dev/[r]dsk/cxtxdxsx

  where cx refers to the SCSI controller number, tx to the SCSI bus target number,
  dx to the disk number (always 0 except on storage arrays)
  and sx to the slice or partition number.

  
===========================
29. filesystems in Solaris:
===========================


29.1 A few traditional filesystem commands:
===========================================

The UFS filesystem has always been the most popular fs on Solaris.
Ofcourse, when the newer ZFS filesystem became available, it has been rapidly adopted.

We will frst take a look at a few classical commands, that you would typically use on a UFS filesystem.
Ofcourse, many "listing commands" like for example, df (to show what's used and what is free space), 
can be used on ZFS as well. But creating an fs on ZFS goes absolutly different from what you can find in section 29.1


Checks on the filesystems in Solaris:
-------------------------------------

1. used space etc.. 
#  df -k, df -h etc..

# du -ks /home/fred 

Shows only a summary of the disk usage of the /home/fred subdirectory (measured in kilobytes).

# du -ks /home/fred/* 

Shows a summary of the disk usage of each subdirectory of /home/fred (measured in kilobytes).

# du -s /home/fred

Shows a total summary of /home/fred

# du -sg /data

Shows a total summary of /data in GB


This command shows the diskusage of /dirname in GB
# du -g /dirname

2. examining the disklabel
#  prtvtoc /dev/rdisk/c0t3d0s2

3. format just by itself shows the disks
#  format

#  format -> specify disk -> choose partition -> choose print to get the partition table

4. Display information about SCSI devices

# cfgadm -al

or, from the PROM, commands like probe-scsi


What is the CDROM device in Solaris:
------------------------------------

-- pointer 1.

If you have a CD put in the drive, and it was automounted, simply use the "df" command to view your filesystems:

# df -k    or df -h

-- pointer 2.

From the output of the command

# iostat -En

you could figure out what logical device name your CDROM has.

-- pointer 3.

Solaris uses the same naming conventions as used with hardisks, for example the CDROM in the following command

# mount -r -F hsfs /dev/dsk/c0t6d0s2 /cdrom

means that in this case, the CDROM device is "/dev/dsk/c0t6d0s2"
Normally, a CD is automounted on "/cdrom" or "/cdrom/cdrom0"

The simplest way to mount CDROM on Solaris is use vold daemon.  The vold daemon in Solaris manages the CD-ROM device 
and automatically performs the mounting similar to how Windows manages CDROMs (but not as transparent or reliable). 
If CD is detected in drive its should be  automatically mounted to the /cdrom/cdrom0 directory. 


Recovering disk partition information in Solaris:
-------------------------------------------------

Use the fmthard command to write the backup VTOC information back to the disk.
The following example uses the fmthard command to recover a corrupt label on a disk
named /dev/rdisk/c0t3d0s1. The backup VTOC information is in a file named c0t3d0
in the /vtoc directory.

# fmthard -s /vtoc/c0t3d0s0 /dev/rdsk/c0t3d0s2

Remember that the format of /dev/(r)dsk/cWtXdYsZ means:

W is the controller number,
X is the SCSI target number,
Y is the logical unit number (LUN, almost always 0),
Z is the slice or partition number

Make a new filesystem in Solaris:
---------------------------------

To create a UFS filesystem on a formatted disk that already has been divided into slices
you need to know the raw device filename of the slice that will contain the filesystem.
Example:

# newfs /dev/rdsk/c0t3d0s7

defaults on UFS on Solaris: 
blocksize 8192
fragmentsize 1024
one inode for each 2K of diskspace

FSCK in Solaris:
----------------

If you just want to determine the state of a filesystem, whether it needs checking, 
you can use the fsck command while the fs is mounted.
Example:

# fsck -m /dev/rdsk/c0t0d0s6

The state flag in the superblock of the filesystem you specify is checked to see
whether the filesystem is clean or requires checking.

If you ommit the device argument, all the filesystems listed in /etc/vfstab  with a fsck 
pass value greater than 0 are checked.


Adding a disk in Solaris 2.6, 2.7, 8, 9:
----------------------------------------

In case you have just build in a new disk,
its probably best, to first use the probe-scsi command from the OK prompt:

ok probe-scsi
..
Target 3
 Unit 0  Disk   Seagate ST446452W   0001
..

Next, do a reconfiguration reboot, with the "boot -r" command:

ok boot -r

Specifying the -r flag when booting, tells Solaris to reconfigure itself by scanning
for new hardware.
Once the system is up, check the output for "dmesg" to find kernel messages relating
to the new disk.
You probably find complaints telling you stuff as "corrupt label - wrong magic number" etc..
That's good, because we now know that the kernel is aware of this new disk.

In this example, our disk is SCSI target 3, so we can refer to the whole disks as
/dev/rdsk/c0t3d0s2           # slice 2, or partition 2, s2 refers to the whole disk


Remember that the format of /dev/(r)dsk/cWtXdYsZ means:

W is the controller number,
X is the SCSI target number,
Y is the logical unit number (LUN, almost always 0),
Z is the slice or partition number


We now use the format program to partition the disk, and afterwards create filesystems.

# format /dev/rdsk/c0t3d0s2
(.. output..)
FORMAT MENU:

format>label
Ready to label disk, continue? y

format>partition 
PARTITION MENU:

partition>

Once you have created and sized the partitions, you can get a list with the "partition>print" command.

Now, for example, you can create a filesystem like in the following command:

# newfs /dev/rdsk/c0t3d0s0


devfsadm:
---------

As from Solaris 8:

devfsadm(1M) maintains the /dev and /devices namespaces. It replaces the previous suite of devfs administration tools 
including drvconfig(1M) , disks(1M) , tapes(1M) , ports(1M) , audlinks(1M) , and devlinks(1M) .

The default operation is to attempt to load every driver in the system and attach to all possible device instances. devfsadm then creates 
device special files in /devices and logical links in /dev .

In other words, the devfsadm command is used to dynamically reconfigure system device tables
without having to reboot the system.

Examples:

# devfsadm -i sd
# devfsadm -c tape

In the first example, devfsadm configures only those devices supported by the
sd driver. In the second example, devfsadm configures only tape devices.


29.2 Notes on filesystems on Solaris:
=====================================

There are at least 4 different types of filesystems you can use with Solaris 10 (except for zfs, 
for the older Solaris 8 and 9 versions).
These are:

-- UFS
The traditional filesystem for Solaris systems. UFS is old technology but it is a stable and fast filesystem. 
Sun has continuously tuned and improved the code over the years.
Solaris 10 (and older ofcouse) can only boot from a UFS root filesystem. In the future, 
ZFS boot will be available, as it already is in OpenSolaris. But for now, every Solaris system must have 
at least one UFS filesystem.
Note: This "boot-statement" was true at the time of writing. Maybe you read this way after that time, and maybe
Solaris can now boot from zfs or other filesystem.

-- ZFS
We will talk a bit on ZFS in section 29.3

-- VxFS
The Veritas filesystem and volume manager have their roots in a fault-tolerant proprietary minicomputer 
built by Veritas in the 1980s. They have been available for Solaris since at least 1993 and have been 
ported to AIX and Linux. They are integrated into HP-UX and SCO UNIX, and Veritas Volume Manager code 
has been used (and extensively modified) in Tru64 UNIX and even in Windows. 
VxFS has never been part of Solaris but, when UFS was the only option, it was a popular addition. 
VxVM and VxFS are tightly integrated. Through vxassist, one may shrink and grow filesystems and their 
underlying volumes with minimal trouble. 

VxFS can run in single instance mode or in a parallel access/cluster file system mode. 
This latter mode allows for multiple servers (also known as cluster nodes) to simultaneously access 
the same file system. When run in this mode, VxFS is referred to as VERITAS Cluster File System. 
Cluster File System provides cache coherency and POSIX compliance across nodes, so that data changes 
are atomically seen by all cluster nodes simultaneously. Because Cluster File System shares the same 
binaries and same on-disk layout as single instance VxFS, moving between cluster and single instance mode 
is straightforward.


-- SAM and QFS
QFS is Sun's cluster filesystem, meaning that the same filesystem may be simultaneously mounted 
by multiple systems. SAM is a hierarchical storage manager; it allows a set of disks to be used 
as a cache for a tape library. SAM and QFS are designed to work together, but each may be used separately. 

-- PCFS
It's even possible to use the DOS FAT filesystem.

-- HSFS
Ofcourse, the CDROM HSFS can be used.

Maybe the following list will show you what can be used in Solaris:

Filesystem 	Type 	Device 		Description 
UFS 		Regular Disk 		Unix Fast filesystem; default in Solaris
ZFS 		Regular	Disk		The new Regular FS in Solaris 10 
VxFS 		Regular Disk 		Veritas filesystem 
QFS 		Regular Disk 		QFS filesystem from LSC Inc. 
pcfs 		Regular Disk 		MSDOS FAT and FAT32 filesystem 
hsfs 		Regular Disk 		High Sierra filesystem (CDROM) 
tmpfs 		Regular Memory 		Uses memory and swap 
nfs 		Pseudo 	Network 	Network filesystem 
cachefs 	Pseudo 	filesystem 	Uses a local disk as cache for another NFS filesystem 
autofs 		Pseudo 	filesystem 	Uses a dynamic layout to mount other filesystems 
specfs 		Pseudo 	Device drivers 	filesystem for the /dev devices 
procfs 		Pseudo 	Kernel 		/proc filesystem representing processes 
sockfs 		Pseudo 	Network		Filesystem of socket connections 
fifofs 		Pseudo 	Files 		FIFO filesystem 

If we look at the regular disk based filesystems, the following can be said on the "allocation format":

Filesystem 	Allocation format 
UFS 		Block, allocator tries to allocate sequential blocks 
VxFS 		Extent based 
QFS 		Extent based 
ZFS		Extent based


29.3 Some notes on the ZFS filesystem. Solaris 10 
=================================================


>>> ZFS Pooled Storage:
-----------------------

ZFS uses the concept of storage pools to manage physical storage. Historically, file systems were constructed on top of a single physical device. 
To address multiple devices and provide for data redundancy, the concept of a "logical volume manager", LVM, was introduced to provide for Volume Groups,
and Logical Volumes (which could span multiple disks), and then add a filesystem on such a Logical Volume. This design added another layer 
of complexity and ultimately prevented certain file system advances, because the file system had no control over the physical placement 
of data on the virtualized volumes. 

ZFS eliminates the volume management altogether. Instead of forcing you to create virtualized volumes, ZFS aggregates devices into a storage pool. 
The storage pool describes the physical characteristics of the storage (device layout, data redundancy, and so on,) and acts as an arbitrary data store 
from which file systems can be created. File systems are no longer constrained to individual devices, allowing them to share space with all file systems 
in the pool. You no longer need to predetermine the size of a file system, as file systems grow automatically within the space allocated to the storage pool. 
When new storage is added, all file systems within the pool can immediately use the additional space without additional work. In many ways, 
the storage pool acts as a virtual memory system. When a memory DIMM is added to a system, the operating system doesn't force you to invoke some commands 
to configure the memory and assign it to individual processes. All processes on the system automatically use the additional memory.

Everything you hate about managing file systems and volumes is gone: you don't have to use format, and create slices/partitions, use newfs, mount, edit /etc/vfstab, 
fsck, growfs, metadb, metainit, etc.

Meet your new best friends: zpool and zfs.

ZFS is easy, so let's get on with it! It's time to create your first pool: 

# zpool create tank c1t2d0

You now have a single-disk storage pool named tank, with a single file system mounted at /tank. There is nothing else to do.
Yes, its really true: 
The new ZFS file system, tank, can use as much of the disk space as needed, and is automatically mounted at /tank.

You can determine if your pool was successfully created by using the zpool list command. 

# zpool list
NAME                    SIZE    USED   AVAIL    CAP  HEALTH     ALTROOT
tank                     80G    137K     80G     0%  ONLINE     - 


Suppose we create a file in /tank and want to see how things looks like:
# mkfile 100m /tank/foo
# df -h /tank
Filesystem             size   used  avail capacity  Mounted on
tank                   80G   100M    80G     1%    /tank


If you want mirrored storage for mail and home directories, that's easy too:

Create the pool:

# zpool create tank mirror c1t2d0 c2t2d0

Now lets try to create the "/var/mail" file system:

# zfs create tank/mail
# zfs set mountpoint=/var/mail tank/mail

Create home directories, and mount them all in /export/home/<username>:

# zfs create tank/home
# zfs set mountpoint=/export/home tank/home


At this point, we have "/export/home" present.
Now you could even do this:

# zfs create tank/home/ahrens

ZFS file systems are hierarchical: each one inherits properties from above. In this example, the mountpoint property is inherited 
as a pathname prefix. That is, tank/home/ahrens is automatically mounted at /export/home/ahrens because tank/home is mounted at /export/home. 
You don't have to specify the mountpoint for each individual user - you just tell ZFS the pattern.


>>> Commit and Rollback semantics:
----------------------------------

ZFS uses a commit and rollback mechanism, to ensure that all data is written completely, and if not, everything is rolled back.
You probably know that with former filesystems, that you could choose 
- for a filesystem without journaling (logging)
- or indeed use journaling (or logging).

Now you have a third option: using a transactional filesystem, like zfs.

ZFS is a transactional file system, which means that the file system state is always consistent on disk. Traditional file systems (with no logging) 
overwrite data in place, which means that if the machine loses power, for example, between the time a data block is allocated and 
when it is linked into a directory, the file system will be left in an inconsistent state. Historically, this problem was solved through the use 
of the fsck command. This command was responsible for going through and verifying file system state, making an attempt to repair any inconsistencies 
in the process. This problem sometimes caused great pain to administrators and was never guaranteed to fix all possible problems. 

More recently, file systems have introduced the concept of journaling. The journaling process records action in a separate journal, 
which can then be replayed safely if a system crash occurs. This process introduces unnecessary overhead, because the data needs 
to be written twice, and often results in a new set of problems, such as when the journal can't be replayed properly. 

With a transactional file system, data is managed using copy on write semantics. Data is never overwritten, and any sequence of operations 
is either entirely committed or entirely ignored. This mechanism means that the file system can never be corrupted through accidental 
loss of power or a system crash. So, no need for a fsck equivalent exists. While the most recently written pieces of data might be lost, 
the file system itself will always be consistent. In addition, synchronous data (written using the O_DSYNC flag) is always guaranteed 
to be written before returning, so it is never lost.


>>> Unparalleled Scalability:
-----------------------------

ZFS has been designed from the ground up to be a very scalable file system. The file system itself is 128-bit, allowing for 256 quadrillion zettabytes 
of storage. All metadata is allocated dynamically, so no need exists to pre-allocate inodes or otherwise limit the scalability 
of the file system when it is first created. All the algorithms have been written with scalability in mind. 
Directories can have up to 248 (256 trillion) entries, and no limit exists on the number of file systems or number of files 
that can be contained within a file system.


>>> Some more examples:
-----------------------

-- To give user ahrens a 10G quota:

# zfs set quota=10g tank/home/ahrens

-- To give user bonwick a 100G reservation (membership has its privileges):

# zfs set reservation=100g tank/home/bonwick

-- To automatically NFS-export all home directories read/write:

# zfs set sharenfs=rw tank/home

-- To scrub all disks and verify the integrity of all data in the pool:

# zpool scrub tank

-- To replace a flaky disk:

# zpool replace tank c2t2d0 c4t1d0

-- To add more space:

# zpool add tank mirror c5t1d0 c6t1d0

-- To move your pool from SPARC machine 'sparky' to AMD machine 'amdy':

[on sparky]
    # zpool export tank

Physically move your disks from sparky to amdy.

[on amdy]
    # zpool import tank


-- Determining if Problems Exist in a ZFS Storage Pool

The easiest way to determine if any known problems exist on the system is to use the "zpool status x" command. 
This command describes only pools exhibiting problems. If no bad pools exist on the system, 
then the command displays a simple message, as follows:

# zpool status -x

all pools are healthy

Without the x flag, the command displays the complete status for all pools (or the requested pool, if specified on the command line), 
even if the pools are otherwise healthy. 


-- Understanding zpool status Output
The complete zpool status output looks similar to the following:

# zpool status tank
  pool: tank
 state: DEGRADED
status: One or more devices has been taken offline by the administrator.
        Sufficient replicas exist for the pool to continue functioning in a
        degraded state.
action: Online the device using 'zpool online' or replace the device with
        'zpool replace'.
 scrub: none requested
 config:

        NAME         STATE     READ WRITE CKSUM
        tank         DEGRADED     0     0     0
          mirror     DEGRADED     0     0     0
            c1t0d0   ONLINE       0     0     0
            c1t1d0   OFFLINE      0     0     0

errors: No known data errors


29.4 Some examples on VxFS:
===========================

See section 29.2 for a general description about the filesystems you can use on Solaris.

Example 1:
----------

# mkfs -F vxfs /dev/vx/rdsk/testdg/msvol1 200m
version 4 layout
409600 sectors, 204800 blocks of size 1024, log size 1024 blocks
unlimited inodes, largefiles not supported
204800 data blocks, 203656 free data blocks
7 allocation units of 32768 blocks, 32768 data blocks
last allocation unit has 8192 data blocks

Example 2:
----------

We are going to show how to create a mirroring volume and a stripping volume on Veritas Storage Foundation.
on Solaris 10.

The first step is to check quantity of disks you have available on the server. 
A simple way to check this on solaris is using format utility:

bash-3.00# format

Searching for disks.done

AVAILABLE DISK SELECTIONS:

0. c1t0d0 <DEFAULT cyl 4092 alt 2 hd 128 sec 32>
/pci@0,0/pci15ad,1976@10/sd@0,0

1. c1t1d0 <DEFAULT cyl 7 alt 2 hd 64 sec 32>
/pci@0,0/pci15ad,1976@10/sd@1,0

2. c1t2d0 <DEFAULT cyl 7 alt 2 hd 64 sec 32>
/pci@0,0/pci15ad,1976@10/sd@2,0

3. c1t3d0 <DEFAULT cyl 2 alt 2 hd 64 sec 32>
/pci@0,0/pci15ad,1976@10/sd@3,0

Also, you can check disks available to Veritas Storage Foundation using vxdisk command:

bash-3.00# vxdisk -o alldgs list

DEVICE TYPE DISK GROUP STATUS

c1t0d0s2 auto:none - - online invalid
c1t1d0s2 auto:none - - online invalid
c1t2d0s2 auto:none - - online invalid
c1t3d0s2 auto:none - - online invalid

You can see above that there are 4 disks on the server that are available to Veritas but they have not yet 
been initialized by Veritas (invalid status). To use a disk on Veritas SF you need to initialize this 
using Veritas utilities.

NOTE: If you are going to use a disk on Veritas, pay attention that you should give this whole disk to Veritas. 
Disk will be formatted and you will lose all data in the disk when you are allocating a disk to Veritas Storage.

In this example the only disk that is in use for O.S Solaris is the first one. (c1t0d0s2).

We can use those 3 others disks to add on Veritas Storage.

Caution: If for a mistake we add the first disk (c1t0d0s2) to Veritas Storage, it will format 
the disk and erase Solaris info. We need to pay attention to get the right disks.

Let's start allocating (initializing) those 3 disks to solaris:

# vxdisksetup -i c1t1d0
#
# vxdisksetup -i c1t2d0

# vxdisksetup -i c1t3d0

We have those 3 disks initialized on Veritas, then the next step is to create a Disk Group.

>>> Disk Group

Disk Group is a collection of disks. Disk Group is very useful for management and isolation purpose.
Lets create a DG using only the fist disk initialized on Veritas (c1t1d0). 
We are using DG1 for the name of Disk Group.

# vxdg init DG1 c1t1d0

Check if  DG1 was created successfully:

# vxdg list

NAME STATE ID

DG1 enabled,cds 1218633322.13.vrt2

Also, check if the disk is properly assigned to DG1:

# vxdisk -o alldgs list

DEVICE TYPE DISK GROUP STATUS

c1t0d0s2 auto:none - - online invalid
c1t1d0s2 auto:cdsdisk c1t1d0 DG1 online
c1t2d0s2 auto:cdsdisk - - online
c1t3d0s2 auto:cdsdisk - - online

Let's add more 2 disks to DG1:

# vxdg -g DG1 adddisk c1t2d0s2 c1t3d0s2

Check if the disks are properly assigned to DG1:

# vxdisk -o alldgs list

DEVICE TYPE DISK GROUP STATUS

c1t0d0s2 auto:none - - online invalid
c1t1d0s2 auto:cdsdisk c1t1d0 DG1 online
c1t2d0s2 auto:cdsdisk c1t2d0 DG1 online
c1t3d0s2 auto:cdsdisk c1t3d0 DG1 online

At this point we have added 3 disks into Disk Group DG1. 

Next step we will create 2 different volumes in the DG1.

>>> Volumes

A volume is a virtual storage that is used as an physical disk. Volume can be composed by many disks 
and have many layouts.

In this example, we are going to create two Volumes:

Volume VolS - Stripping layout using c1t1d0 and c1t2d0 disks (RAID 0).
Volume VolM - Mirroring layout using c1t2d0 and c1t3d0 (RAID 1).

-- To create a Stripping Volume VolS (Size=10m):

# vxassist -g DG1 make VolS 10m layout=stripe c1t1d0s2 c1t2d0s2

To check if volume VolS was created successfully:

# vxprint -g DG1

TY NAME ASSOC KSTATE LENGTH PLOFFS STATE TUTIL0 PUTIL0

dg DG1 DG1 - - - - - -

dm c1t1d0 c1t1d0s2 - 159488 - - - -
dm c1t2d0s2 c1t2d0s2 - 159488 - - - -
dm c1t3d0s2 c1t3d0s2 - 159488 - - - -


v VolS fsgen ENABLED 20480 - ACTIVE - -
pl VolS-01 VolS ENABLED 20480 - ACTIVE - -
sd c1t1d0-01 VolS-01 ENABLED 10240 0 - - -
sd c1t2d0s2-01 VolS-01 ENABLED 10240 0 - - -


-- To create a Mirroring Volume VolM (Size=10m):

# vxassist -g DG1 make VolM 10m layout=mirror c1t2d0s2 c1t3d0s2

To check if Volume VolM was created successfully:

# vxprint -g DG1

TY NAME ASSOC KSTATE LENGTH PLOFFS STATE TUTIL0 PUTIL0

dg DG1 DG1 - - - - - -

dm c1t1d0 c1t1d0s2 - 159488 - - - -
dm c1t2d0s2 c1t2d0s2 - 159488 - - - -
dm c1t3d0s2 c1t3d0s2 - 159488 - - - -

v VolM fsgen ENABLED 20480 - ACTIVE - -
pl VolM-01 VolM ENABLED 20480 - ACTIVE - -
sd c1t3d0s2-01 VolM-01 ENABLED 20480 0 - - -

pl VolM-02 VolM ENABLED 20480 - ACTIVE - -
sd c1t2d0s2-02 VolM-02 ENABLED 20480 0 - - -

v VolS fsgen ENABLED 20480 - ACTIVE - -
pl VolS-01 VolS ENABLED 20480 - ACTIVE - -
sd c1t1d0-01 VolS-01 ENABLED 10240 0 - - -
sd c1t2d0s2-01 VolS-01 ENABLED 10240 0 - - -

Note: You can see above that both Volumes were created successfully. Also, you can note the difference 
between stripping and mirroring volume layouts. 

VolM is using two different Plex in differente disks. This means that if you lose one disk (Plex) 
you still have the data in the other disk (other Plex). It is the main configuration of Mirroring Volumes.

VolS is using only one Plex divided in 2 disks. This means that the data will be split in those 2 disks. 
If you lose one disk you would lose the whole Plex, therefore you would lose the data. 
This is the main configuration of Stripping Volumes. It does not provide data protection but it is very useful 
for performance for purpose.

Also, you can add those 2 layouts in only one layout that provide data protection and better performance. 
It is the case of RAID 0 + 1 or RAID 1 + 0.

In the next step we will create 2 different Filesystem using those 2 Volumes.

>>> Filesystem

In this example we will create two filesystem:

- Filesystem fsS will use VolS. It will be mounted at /stripe mount point.
- Filesystem fsM will use VolM. It will be mounted at /mirror mount point.

To create a VxFS filesystem:

# mkfs -F vxfs /dev/vx/rdsk/DG1/VolS

version 7 layout

20480 sectors, 10240 blocks of size 1024, log size 1024 blocks
largefiles supported

# mkfs -F vxfs /dev/vx/rdsk/DG1/VolM

version 7 layout

20480 sectors, 10240 blocks of size 1024, log size 1024 blocks

largefiles supported

To mount a VxFS filesystem:

# mount -F vxfs /dev/vx/dsk/DG1/VolS /stripe/
# mount -F vxfs /dev/vx/dsk/DG1/VolM /mirror/

Now there are 2 filesystems configured and you can use it at Solaris Mount Point level.

Any data written in /stripe directory will be written in the stripping VolS volume.
Any data written in /mirror directory will be written in the mirroring VolM volume.


Example 3:
----------

Rather than mess with vxmake  you can employ vxassist to do all the dirty work. If you have any amount of experience with vxassist 
you'll know that the more information you can supply to vxassist the better the end product will be. 

I'm going to use vxassist to build a stripe-pro volume from four disks and I want the volume to be 1G in size:

# vxassist -g testdg make stripeprovol 1g  layout=stripe-mirror \
			testdg01 testdg02 testdg03 testdg04


Pretty kool, huh? Quick, efficient, and poorly named; everything you love about vxassist. I can then go a bit further 
and explore my sizing options to see how much I can grow my new volume if I need to:

# vxassist -g testdg maxgrow stripeprovol

Volume stripeprovol can be extended by 282050560 to 284147712 (138744Mb)

See? Just like a normal volume. Now comes the beauty part. When you look at that seemingly unmanageable mess of objects above 
does it really make you want to tear it apart and work on it like you might other "normal" volumes? Probably not. And you'd be wise 
to feel that way, there are just too many places to get confused or make a mistake when real data is involved. What if you could get back 
to a more normal point of view? Luckily you can, check this out:

# vxassist -g testdg convert stripeprovol layout=mirror-stripe


Veritas terminology:

In a "typical" RAID0+1 volume configuration, we take several disks and then create a stripe across thoughs disks (the RAID0 part). 
Then once complete we do this again on a separate set of disks, and then attach that new stripe to the first creating a mirror (the +1 part). 
We then have a RAID0+1 volume thats ready to have a filesystem put on it. The point of interest with this setup is that we're actually 
mirroring a complete stripe (and therefore ALL the disks in that stripe) to another stripe (and therefore ALL of it's disks). 
The problem here is that if for some reason we need to re-sync the volume we'd need to re-sync a full stripe to a full stripe (very timely) 
which is a nearly tragic proposition if your talking about 50G+. A far more efficient setup would be to mirror each disk to each disk... 
in other words, to mirror a bunch of disks on a one-to-one basis, and then build a stripe on top of these mirrors. In this case if we need 
to re-sync due to a disk failure we can simply sync the failed disk to its mirror, instead of the full stripe. This is the power of RAID1+0; 
the difference between mirroring the stripes (0+1) and stripping the mirrors (1+0).

If the terms seem to confuse you, try this for size:

RAID0	Striping (VxVM says: stripe)
RAID1	Mirroring (VxVM says: mirror)
RAID0+1 Striping plus Mirroring (VxVM says: mirror-stripe)
	Think this: Striped disks, then mirror the stripes
RAID1+0 Mirroring plus Striping (VxVM says: stripe-mirror) 
	(Veritas Marketing Dept says: StripePro
	Think this: Mirrored disks, then stripe on top of the mirrors
Concat+Mirror	Concatenation plus Mirroring (VxVM says: mirror)
		Same as RAID1
Mirror+Concat	Mirroring plus Concatenation (VxVM says: concat-mirror)
		(Veritas Marketing Dept says: ConcatPro)
		Think this: Concatenation on top of mirrored disks.


Veritas Default diskgroup: rootdg

Default rootdg disk group. 
 Block Device Node /dev/vx/dsk/volume_name 
 Raw Device Node /dev/vx/rdsk/volume_name 
Other DiskGroups 
 Block Device Node /dev/vx/dsk/diskgroup_name/volume_name 
 Raw Device Node /dev/vx/rdsk/diskgroup_name/volume_name 
 

Example 4:
----------

Some more examples:

Create Veritas layout on a disk:
	vxdisksetup -i c1t10d0

Create a disk group on a new disk:
	vxdg init <dg name> <media name>=c1t10d0

Add disk to an existing disk group:
	vxdg -g <dg name> adddisk <media name>=c2t0d0
 	replace addisk with rmdisk to remove a disk

Set up a preferred reading plex, this can be useful if we have a sparse plex (plex in RAM):
	vxvol -g <group> rdpol prefer <volname> <plexname>
	instead of prefer we can have round or sdeet

View configuration:
	vxprint -th
List disks:
	vxdisk list
	vxdisk -o alldgs list (shows deported disks)

Adding disks while solaris is running:
	drvconfig	(This probes scsi - Solaris)
	disks		(Creates links in /dev - Solaris)
	prtvtoc		(View the vtoc - Solaris)
	vxdctl enable	(Rescan for disks - Veritas)
	vxdisk list	(Shows the disk in error as they are not initalized jet)
	vxdisksetup  	(init the disks)

To encapsulate use:
 	vxencap -g <discgroup> <devicename>

Export a disk group:
	vxdg deport <dg name>
	vxdg -h <hostame> deport <dgname> to export to another host

Import a disk group:
	vxdg import <dg name>
	vxdg -C to clear hostid of old host (When failing over in DR situation)
	vxdg -fC to clear hostid of old host and forcing diskgroup online

Destroy a disk group:
	vxdg destroy <disk group>

Evacuate data from a disk:
	vxevac -g <dg name> <from disk> <to disks>

Create a volume on a diskgroup:
	vxassist -g <dg name> make <volname> <size> layou=stripe
	ncols=number of colums stripeunit=size

Create a veritas filesystem on this volume:
        mkfs -F vxfs /dev/vx/rdsk/<disk group>/<volume> <size>

Delete a volume	same as creatiuon but replace make with remove

Resize a filesystem:
        vxresize -g <disk group> -F <fstype> <volume> <size>

If Veritas is ever causing you problems, do the following:
	Touch /etc/vx/reconfig.d/state.d/install-db
	edit /etc/system and modify /etc/vfstab 
	to disable VRTS to start up and access the old root
	partitions


vxassist make martin 100m
makes a volume called martin using any disk

vxassist make martin 100m disk10
makes a volume called martin using disk10

vxassist make martin 100m layout=stripe disk07 disk08
creates a 100mb striped volume called martin using disks7 and 8

vxassist mirror martin disk05 disk06
uses disks5 and 6 ro make a mirror on volume called martin

vxassist make martin 50m layout=mirror
makes a 50Mb mirror using any 2 disks

vxassist make martin 50m layout=mirror disk05 disk06
makes a 50mb mirror using disks 5 and 6

vxassist make martin 50m layout=mirror,stripe disk05 disk06 disk07 
disk08
makes a 50Mb stripe using disks5 and 6 mirrored across 7 and 8

vxassist make martin 50m layout=mirror,stripe,log disk05 disk06 disk07 
disk08
makes a 50Mb stripe using disks5 and 6 mirrored across 7 and 8 and uses 
a 
log subdisk

vxassist make martin 100m layout=raid5
makes a 100m raid5 volume

/usr/sbin/vxedit -g rootdg rename disk12 disk09 
to rename disk12 to disk09 in the rootdg

vxedit rm disk10 
to remove a greyed out or obsolete disk in this case disk10
or to remove a disk from a diskgroup

vxdisk list - to list all disks under vmcontrol 

vxdisk clearimport c#t#d#s#
to allow a disk to be imported after a server crash

vxdg -g razadg rmdisk test
to remove a disk called test from a dg called razadg

vxdg -g razadg adddisk test=c1t3d3  
to add disk c1t3d3 to a dg called razadg calling the disk test, use 
vxdisk list
to determine what disks are free :)

vxedit -g rootdg set spare=on disk09
sets disk09 in the rootdg as a hotspare.


vxmirror rootdisk disk01
mirrors all the volumes on the root disk to disk01

vxassist -g rootdg mirror vol01 disk03
mirrors vol01 (in rootdg) to disk03


vxassist mirror martin

will mirror the volume martin


to make a mirror manually try

 /usr/sbin/vxmake -g rootdg sd disk03-01 dm_name=disk03 dm_offset=0 
 len=81920 
 to create a subdisk on disk03 callin the subdisk disk03-01 the len 
 81920 is
 81920sectors x 512bytes =40M 

 vxmake plex martin-02 sd=disk03-01
 creates a plex called martin-02 using subdisk disk03-01

 vxplex att martin martin-02
 attaches the plex martin-02 to volume martin

 to list all volumes on your primary boot disk enter
 vxprint -t -v -e 'aslist.aslist.sd_disk="boot_disk_name"'


 vxsd mv disk03-01 disk05-01
 moves the contents of subdisk disk03-01 to disk05-01
 then moves  subdisk disk05-01 into the plex where subdisk disk03-01
 once lived, leaving disk03-01 to your mercy :)


 to make a subdisk

 vxmake sd disk02-02 disk02,0,8000
 this would create a subdisk called disk02-02 at the start of disk02
 and would be 8000blocks (4000k) long.
 if you wanted to create another subdisk on this disk the offset would 
 be
 8000 as this is where the next free space would be onthe disk so...
 vxmake sd disk02-02 disk02,8000,8000 would create another 8000block
 subdisk.


 vxdisk rm c#t#d#s2
 to remove a disk so it's out of vm control

 vxdiskadd c#t#d#
 to add bring a new disk under vm control

 or you can try...
 vxdisksetup -i c#t#d#  

 vxvol -g dg volname stop
 this stops a volume

 vxedit -rf rm martin
 removes a volume called martin and plex(es) and subdisks though

 vxprint -ht volume


================
30. AIX devices:
================

In AIX 5.x, the device configuration information is stored in the ODM repository. The corresponding files
are in 

/etc/objrepos
/usr/lib/objrepos
/usr/share/lib/objrepos


There are 2 sections in ODM:
- predefined: all of the devices in principle supported by the OS
- customized: all devices already configured in the system

Every device in ODM has a unique definition that is provided by 3 attributes:

1. Type
2. Class
3. Subclass


Information thats stored in the ODM:

- PdDv,PdAt, PdCn   :  Predefined device information
- CuDv, CuAt, CuDep :  Customized device information
- lpp, inventory    :  Software vital product data
- smit menu's
- Error log, alog, and dump information
- System Resource Controller: SRCsubsys, SRCsubsrv
- NIM: nim_attr, nim_object, nim_pdattr


There are commands, representing an interface to ODM, so you can add, retrieve, drop and change objects.
The following commands can be used with ODM:

odmadd, 
odmdrop, 
odmshow, 
odmdelete, 
odmcreate, 
odmchange

Examples:

# odmget -q "type LIKE lv*" PdDv
# odmget -q name=hdisk0 CuAt


Logical devices and physical devices:
-------------------------------------

AIX includes both logical devices and physical devices in the ODM device configuration database.
Logical devices include Volume Groups, Logical Volumes, network interfaces and so on.
Physical devices are adapters, modems etc..


Most devices are selfconfiguring devices, only serial devices (modems, printers) are not selfconfigurable.

The command that configures devices is "cfgmgr", the "configuration manager".
When run, it compares the information from the device with the predefined section in ODM.
If it finds a match, then it creates the entries in the customized section in ODM.

The configuration manager runs every time the system is restarted.

If you have installed an adapter for example, and you have put the software in a directory
like /usr/sys/inst.images, you can call cfgmgr to install device drivers as well with

# cfgmgr -i /usr/sys/inst.images

$$
09-08-00-1,0
u5971-t1-l1-l0


Device information:
-------------------

The most important AIX command to show device info is "lsdev". This command queries the ODM, so we can use
it to locate the customized or the predifined devices.

The main commands in AIX to get device information are:
- lsdev  : queries ODM
- lsattr : gets specific configuration attributes of a device
- lscfg  : gets vendor name, serial number, type, model etc.. of the device

lsdev also shows the status of a device as Available (that is configured) or as Defined (that is predefined).


lsdev examples:
---------------

If you need to see disk or other devices, defined or available, you can use the lsdev command
as in the following examples:

# lsdev -Cc tape
rmt0  Available  10-60-00-5,0  SCSI 8mm Tape Drive

# lsdev -Cc disk
hdisk0 Available 20-60-00-8,0    16 Bit LVD SCSI Disk Drive
hdisk1 Available 20-60-00-9,0    16 Bit LVD SCSI Disk Drive
hdisk2 Available 20-60-00-10,0   16 Bit LVD SCSI Disk Drive
hdisk3 Available 20-60-00-11,0   16 Bit LVD SCSI Disk Drive
hdisk4 Available 20-60-00-13,0   16 Bit LVD SCSI Disk Drive

Note: -C queries the Customized section of ODM, -P queries the Predefined section of ODM.

Example if some of the disks are on a SAN (through FC adapters):

# lsdev -Cc disk
hdisk0 Available          Virtual SCSI Disk Drive
hdisk1 Available          Virtual SCSI Disk Drive
hdisk2 Available 02-08-02 SAN Volume Controller MPIO Device  (through FC adapter)
hdisk3 Available 02-08-02 SAN Volume Controller MPIO Device  (through FC adapter)

# lsattr -El hdisk2
PCM             PCM/friend/sddpcm                                   PCM                                     True
PR_key_value    none                                                Reserve Key                             True
algorithm       load_balance                                        Algorithm                               True
dist_err_pcnt   0                                                   Distributed Error Percentage            True
dist_tw_width   50                                                  Distributed Error Sample Time           True
hcheck_interval 20                                                  Health Check Interval                   True
hcheck_mode     nonactive                                           Health Check Mode                       True
location                                                            Location Label                          True
lun_id          0x0                                                 Logical Unit Number ID                  False
lun_reset_spt   yes                                                 Support SCSI LUN reset                  True
max_transfer    0x40000                                             Maximum TRANSFER Size                   True
node_name       0x50050768010029c8                                  FC Node Name                            False
pvid            00cb5b9e66cc16470000000000000000                    Physical volume identifier              False
q_type          simple                                              Queuing TYPE                            True
qfull_dly       20                                                  delay in seconds for SCSI TASK SET FULL True
queue_depth     20                                                  Queue DEPTH                             True
reserve_policy  no_reserve                                          Reserve Policy                          True
rw_timeout      60                                                  READ/WRITE time out value               True
scbsy_dly       20                                                  delay in seconds for SCSI BUSY          True
scsi_id         0x611013                                            SCSI ID                                 False
start_timeout   180                                                 START unit time out value               True
unique_id       33213600507680190014E30000000000001E204214503IBMfcp Device Unique Identification            False
ww_name         0x50050768014029c8                                  FC World Wide Name                      False


lsdev [ -C ][ -c Class ] [ -s Subclass ] [ -t Type ] [ -f File ] [ -F Format |
-r ColumnName ] [ -h ] [ -H ] [ -l { Name | - } ] [ -p Parent ] [ -S State ]

lsdev -P [ -c Class ] [ -s Subclass ] [ -t Type ] [ -f File ] [ -F Format | -r
ColumnName ] [ -h ] [ -H ]

Remark:

For local attached SCSI devices, the general format of the LOCATION code "AB-CD-EF-GH" is actually "AB-CD-EF-G,H" , 
the first three sections are the same and for the GH section, the G is de SCSI ID and the H is the LUN. 
For adapters, only the AB-CD is mentioned in the location code.

A location code is a representation of the path to the device, from drawer, slot, connector and port.

- For an adapter it is sufficient to have the codes of the drawer and slot to identify
  the adapter. The location code of an adapter takes the form of AB-CD.

- Other devices needs more specification, like a specific disk on a specific SCSI bus.
  For other devices the format is AB-CD-EF-GH. 
  The AB-CD part then indicates the adapter the device is connected on.

- For SCSI devices we have a location code like AB-CD-EF-S,L where the S,L fields identifies
  the SCSI ID and LUN of the device.


To lists all devices in the Predefined object class with column headers, use
# lsdev -P -H

To list the adapters that are in the Available state in the Customized Devices object class, use
# lsdev -C -c adapter -S 


lsattr examples:
----------------

This command gets the current attributes (-E flag) for a tape drive: 

# lsattr -El rmt0
mode           yes     Use DEVICE BUFFERS during writes    True
block_size     1024    Block size (0=variable length)      True
extfm          no      Use EXTENDED file marks             True
ret            no      RETENSION on tape change or reset   True
..
..

(Ofcourse, the equivalent for the above command is for example # lsattr -l rmt0 -E )

To list the default values for that tape device (-D flag), use
# lsattr -l -D rmt0


This command gets the attributes for a network adapter:

# lsattr -E -l ent1
busmem     0x3cfec00     Bus memory address     False
busintr    7             Bus interrupt level    False
..
..

To list only a certain attribute (-a flag), use the command as in the following example:

# lsattr -l -E scsi0 -a bus_intr_lvl 
bus_intr_lvl 14 Bus interrupt level False

# lsattr -El tty0 -a speed
speed 9600 BAUD rate true


You must specify one of the following flags with the lsattr command: 
-D  Displays default values.  
-E  Displays effective values (valid only for customized devices specified with the -l flag).  
-F  Format  Specifies the user-defined format.  
-R  Displays the range of legal values.  
-a  Displays for that attribute


lscfg examples:
---------------

Example 1:

This command gets the Vital Product Data for the tape drive rmt0:

# lscfg -vl rmt0
Manufacturer...............EXABYTE
Machine Type and Model.....IBM-20GB
Device Specific(Z1)........38zA
Serial Number..............60089837
..
..

-l Name Displays device information for the named device.

-p Displays the platform-specific device information. This flag only applies to
   AIX 4.2.1 or later.

-v Displays the VPD found in the Customized VPD object class. Also, on AIX 4.2.1
   or later, displays platform specific VPD when used with the -p flag.

-s Displays the device description on a separate line from the name and
   location.


# lscfg -vp | grep -p 'Platform Firmware:'

# lscfg -vp | grep -p Platform

sample output:

Platform Firmware:
ROM Level.(alterable).......3R040602
Version.....................RS6K
System Info Specific.(YL)...U1.18-P1-H2/Y2
Physical Location: U1.18-P1-H2/Y2
The ROM Level denotes the firmware/microcode level
Platform Firmware:
ROM Level ............. RH020930
Version ................RS6K
.. 


Example 2:

The following command shows details about the Fiber Channel cards:

# lscfg -vl fcs*          (fcs0 for example, is the parent of fsci0)


Adding a device:
----------------

Adding a device with cfmgr:
---------------------------

To add a device you can run cfgmgr, or shutdown the system, attach the new device and boot the system.
There are also many smitty screens to accomplish the task of adding a new device.


Adding a device with mkdev:
---------------------------

Also the mkdev command can be used as in the following example:

# mkdev -c tape -s scsi -t scsd -p scsi0 -w 5,0

where

-c    Class of the device
-s    Subclass of the device
-t    Type of the device. This is a specific attribute for the device 
-p    The parent adapter of the device. You have to specify the logical name.
-w    You have to know the SCSI ID that you are goiing to assign to the new device.
      If it's non SCSI, you have to know the port number on the adapter.
-a    Specifies the device attribute-value pair


The mkdev command also creates the ODM entries for the device and loads the device driver.

The following command configures a new disk and ensures that it is available as a physical volume.
This example adds a 2.2GB disk with a scsi ID of 6 and a LUN of 0 to the scsi3 SCSI bus.

# mkdev -c disk -s scsi -t 2200mb -p scsi3 -w 6,0 -a pv=yes

This example adds a terminal:

# mkdev -c tty -t tty -s rd232 -p sa1 -w 0 -a login=enable -a term=ibm3151
tty0 Available


Changing a device with chdev:
-----------------------------

Suppose you have just added a new disk. Suppose the cfgmgr has run and detected the disk.

Now you run
# lspv
hdisk1    none                 none
OR
hdisk1    0005264d2            none

The first field identifies the system-assigned name of the disk. The second field displays the
"physical volume id" PVID. If that is not shown, you can use chdev:

# chdev -l hdisk2 -a pv=yes


Removing a device with rmdev:
-----------------------------

Examples:

# lsdev -Cc tape
rmt0  Available  10-60-00-5,0  SCSI 8mm Tape Drive

# rmdev -l rmt0               # -l indicates using the logical device name
rmt0 Defined

The status have shifted from Available to Defined.

# lsdev -Cc tape
rmt0  Defined  10-60-00-5,0  SCSI 8mm Tape Drive

If you really want to remove it from the system, use the -d flag as well

# rmdev -l rmt0 -d

To unconfigure the childeren of PCI bus pci1 and all devices under them, while retaining their
device definition in the Customized Devices Object Class. 

# rmdev -p pci1
rmt0 Defined
hdisk1 Defined
scsi1 Defined
ent0 Defined


The special device sys0:
------------------------

In AIX 5.x we have a special device named sys0 that is used to manage some kernel parameters.
The way to change these values is by using smitty, the chdev command or WSM.

Example.

To change the maxusersprocesses parameter, you can for example use the Web-based System Manager.
You can also use the chdev command:

#chdev -l sys0 -a maxuproc=50
sys0 changed

Note: In Solaris, to change kernel parameters, you have to edit /etc/system.

Device drivers:
---------------

Device drivers are located in /usr/lib/drivers directory.


============================
31. filesystem commands AIX:
============================


31.1 The Logical Volume Manager LVM:
====================================

In AIX, it's common to use a Logical Volume Manager LVM to cross the boundaries posed by
traditional disk management.
Traditionally, a filesystem was on a single disk or on a single partition.
Changing a partionion size was a difficult task. With a LVM, we can create logical volumes
which can span several disks.

The LVM has been a feature of the AIX operating system since version 3, and it is installed 
automatically with the Operating System.

LVM commands in AIX:
--------------------

mkvg  (or the mkvg4vp command in case of SAN vpath disks. See section 31.3)
cplv
rmlv
mklvcopy
extendvg
reducevg
getlvcb
lspv
lslv
lsvg
mirrorvg
chpv
migratepv
exportvg, importvg
varyonvg, varyoffvg

And related commands:
mkdev
chdev
rmdev
lsdev

Volume group:
-------------

What a physical disk is, or a physical volume is, is evident. When you add a physical volume to a volume group,
the physical volume is partitioned into contiguous equal-sized units of space called "physical partitions".
A physical partition is the smallest unit of storage space allocation and is a contiguous space
on a physical volume.
The physical volume must now become part of a volume group. The disk must be in a available state
and must have a "physical volume id" assigned to it.

A volume group (VG) is an entity consisting of 1 to 32 physical volumes (of varying sizes and types). 
A "Big volume group" kan scale up to 128 devices.

You create a volume group with the "mkvg" command. You add a physical volume to an existing volume group with
the "extendvg" command, you make use of the changed size of a physical volume with the "chvg" command,
and remove a physical volume from a volume group with the "reducevg" command.
Some of the other commands that you use on volume groups include:
list (lsvg), remove (exportvg), install (importvg), reorganize (reorgvg), synchronize (syncvg),
make available for use (varyonvg), and make unavailable for use (varyoffvg).

To create a VG, using local disks, use the "mkvg" command:

mkvg -y <name_of_volume_group> -s <partition_size> <list_of_hard_disks>

Typical example:

mkvg -y oravg -s 64 hdisk3 hdisk4

mkvg -y appsvg -s 32 hdisk2
mkvg -y datavg -s 64 hdisk3

mkvg -y appsvg -s 32 hdisk3
mkvg -y datavg -s 32 hdisk2
mkvg -y vge1corrap01 -s 64 hdisk2


In case you use the socalled SDD subsystem with vpath SAN storage, you should use the "mkvg4vp" command,
which works similar (same flags) as the mkvg command.


Types of VG's:
==============

There are 3 kinds of VG's:

- Normal VG (AIX 5L)
- Big VG (AIX 5L)
- Scalable VG (as from AIX 5.3)

Normal VG:
----------

Number of disks		Max number of partitions/disk
1			32512
2			16256
4			8128
8			4064
16			2032
32			1016

Big VG:
-------
Number of disks		Max number of partitions/disk
1			130048
2			65024
4			32512
8			16256
16			8128
32			4064
64			2032
128			1016


VG Type		Max PV's	Max LV's	Max PP's per VG
---------------------------------------------------------------
Normal		32		256		32512
Big		128		512		130048
Scalable	1024		4096		2097152


Physical Partition:
===================

You can change the NUMBER of PPs in a VG, but you cannot change the SIZE of PPs afterwards.
Defaults:
- 4 MB partition size. It can be a multiple of that amount. The Max size is 1024 MB
- The default is 1016 PPs per disk. You can increase the number of PPs in powers of 2 per PV, but the number
  of maximum disks per VG is decreased. 

#disks   max # of PPs / disk
32       1016
16       2032
8        4064
4        8128
2       16256
1       32512


In the case of a set of "normal" internal disks of, for example, 30G or 70G or so,
common partition sizes are 64M or 128M.


Logical Partition:
------------------

A LP maps to (at least) one PP, and is actually the smallest unit of allocatable space.


Logical Volume:
---------------

Consists of LPs in a VG. A LV consists of LPs from actual PPs from one or more disks.


   |-----|               | ----|
   |LP1  |      --->     | PP1 | 
   |-----|               | ----|
   |LP2  |      --->     | PP2 |
   |-----|               | ----|
   |..   |                hdisk 1 (Physical Volume 1)
   |..   |
   |..   |
   |-----|               |---- |
   |LPn  |      --->     |PPn  |
   |-----|               |---- |
   |LPn+1|      --->     |PPn+1|
   |-----|               |---- |
   Logical Volume      hdisk2 (Physical Volume 2)


So, a VG is a collection of related PVs, but you know that actually LVs are created in the VG.
For the applications, the LVs are the entities they work with.
In AIX, a filesystem like "/data", corresponds to a LV.


lspv Command
------------

Purpose: Displays information about a physical volume within a volume group.

lspv [ -L ] [ -l | -p | -M ] [ -n DescriptorPhysicalVolume] [ -v VolumeGroupID] PhysicalVolume

-p: lists range, state, region, LV names, type and mount points


# lspv
# lspv hdisk3
# lspv -p hdisk3


# lspv
hdisk0   00453267554   rootvg
hdisk1   00465249766   rootvg

# lspv hdisk23
PHYSICAL VOLUME:    hdisk23                  VOLUME GROUP:     oravg
PV IDENTIFIER:      00ccf45d564cfec0 VG IDENTIFIER     00ccf45d00004c0000000104564d2386
PV STATE:           active
STALE PARTITIONS:   0                        ALLOCATABLE:      yes
PP SIZE:            256 megabyte(s)          LOGICAL VOLUMES:  3
TOTAL PPs:          947 (242432 megabytes)   VG DESCRIPTORS:   1
FREE PPs:           247 (63232 megabytes)    HOT SPARE:        no
USED PPs:           700 (179200 megabytes)
FREE DISTRIBUTION:  00..00..00..57..190
USED DISTRIBUTION:  190..189..189..132..00


# lspv -p hdisk23
hdisk23:
PP RANGE  STATE   REGION        LV NAME             TYPE       MOUNT POINT
  1-22    used    outer edge    u01                 jfs2       /u01
 23-190   used    outer edge    u02                 jfs2       /u02
191-379   used    outer middle  u01                 jfs2       /u01
380-568   used    center        u01                 jfs2       /u01
569-600   used    inner middle  u02                 jfs2       /u02
601-700   used    inner middle  u03                 jfs2       /u03
701-757   free    inner middle
758-947   free    inner edge

# lspv -p hdisk0
hdisk0:
PP RANGE  STATE   REGION        LV NAME             TYPE       MOUNT POINT
1-1       used    outer edge    hd5                 boot       N/A
2-48      free    outer edge       
49-51     used    outer edge    hd9var              jfs        /var
52-52     used    outer edge    hd2                 jfs        /usr
53-108    used    outer edge    hd6                 paging     N/A
109-116   used    outer middle  hd6                 paging     N/A
117-215   used    outer middel  hd2                 jfs        /usr
216-216   used    center        hd8                 jfslog     N/A
217-217   used    center        hd4                 jfs        /
218-222   used    center        hd2                 jfs        /usr
223-320   used    center        hd4                 jfs        /
..
..

Note that in this example the Logical Volumes corresponds to the filesystems in the
following way: 
hd4= /, hd5=boot, hd6=paging, hd2=/usr, hd3=/tmp, hd9var=/var


lslv Command
------------
Purpose: Displays information about a logical volume.


To Display Logical Volume Information
lslv [ -L ] [ -l| -m ] [ -nPhysicalVolume ] LogicalVolume

To Display Logical Volume Allocation Map
lslv [ -L ] [ -nPhysicalVolume ] -pPhysicalVolume [ LogicalVolume ]


# lslv -l lv06
lv06:/backups
PV                COPIES        IN BAND       DISTRIBUTION
hdisk3            512:000:000   100%          000:218:218:076:000


# lslv lv06
LOGICAL VOLUME:     lv06                   VOLUME GROUP:   backupvg
LV IDENTIFIER:      00c8132e00004c0000000106ef70cec2.2 PERMISSION:     read/write
VG STATE:           active/complete        LV STATE:       opened/syncd
TYPE:               jfs                    WRITE VERIFY:   off
MAX LPs:            512                    PP SIZE:        64 megabyte(s)
COPIES:             1                      SCHED POLICY:   parallel
LPs:                512                    PPs:            512
STALE PPs:          0                      BB POLICY:      relocatable
INTER-POLICY:       minimum                RELOCATABLE:    yes
INTRA-POLICY:       middle                 UPPER BOUND:    32
MOUNT POINT:        /backups               LABEL:          /backups
MIRROR WRITE CONSISTENCY: on/ACTIVE
EACH LP COPY ON A SEPARATE PV ?: yes
Serialize IO ?:     NO

# lslv -p hdisk3
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE       1-10
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      11-20
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      21-30
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      31-40
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      41-50
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      51-60
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      61-70
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      71-80
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      81-90
..
..


Also, you can list LVs per VG by running, for example:

# lsvg -l backupvg
backupvg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
loglv02             jfslog     1     1     1    open/syncd    N/A
lv06                jfs        512   512   1    open/syncd    /backups

# lsvg -l splvg
splvg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
loglv01             jfslog     1     1     1    open/syncd    N/A
lv04                jfs        240   240   1    open/syncd    /data
lv00                jfs        384   384   1    open/syncd    /spl
lv07                jfs        256   256   1    open/syncd    /apps

For a complete storage system, this could yield in for example:

-redovg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
redo1lv             jfs2       42    42    3    open/syncd    /u05
redo2lv             jfs2       1401  1401  3    open/syncd    /u04
loglv03             jfs2log    1     1     1    open/syncd    N/A
-db2vg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
db2lv               jfs2       600   600   2    open/syncd    /db2_database
loglv00             jfs2log    1     1     1    open/syncd    N/A
-oravg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
u01                 jfs2       800   800   2    open/syncd    /u01
u02                 jfs2       400   400   2    open/syncd    /u02
u03                 jfs2       200   200   2    open/syncd    /u03
logfs               jfs2log    2     2     1    open/syncd    N/A
-rootvg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
hd5                 boot       1     2     2    closed/syncd  N/A
hd6                 paging     36    72    2    open/syncd    N/A
hd8                 jfs2log    1     2     2    open/syncd    N/A
hd4                 jfs2       8     16    3    open/syncd    /
hd2                 jfs2       24    48    2    open/syncd    /usr
hd9var              jfs2       9     18    3    open/syncd    /var
hd3                 jfs2       11    22    3    open/syncd    /tmp
hd1                 jfs2       10    20    2    open/syncd    /home
hd10opt             jfs2       2     4     2    open/syncd    /opt
fslv00              jfs2       1     2     2    open/syncd    /XmRec
fslv01              jfs2       2     4     3    open/syncd    /tmp/m2
paging00            paging     32    32    1    open/syncd    N/A
sysdump1            sysdump    80    80    1    open/syncd    N/A
oralv               jfs2       100   100   1    open/syncd    /opt/app/oracle
fslv03              jfs2       63    63    2    open/syncd    /bmc_home


And you can list the LVs by PV by running
# lspv -l hdiskn


lsvg Command:
-------------

-o          Shows only the active volume groups.
-p VG_name  Shows all the PVs that belong to the vg_name
-l VG_name  Shows all the LVs that belong to the vg_name


Examples:

# lsvg
rootvg
informixvg
oravg

# lsvg -o
rootvg
oravg

# lsvg oravg
VOLUME GROUP:   oravg                    VG IDENTIFIER:  00ccf45d00004c0000000104564d2386
VG STATE:       active                   PP SIZE:        256 megabyte(s)
VG PERMISSION:  read/write               TOTAL PPs:      1894 (484864 megabytes)
MAX LVs:        256                      FREE PPs:       492 (125952 megabytes)
LVs:            4                        USED PPs:       1402 (358912 megabytes)
OPEN LVs:       4                        QUORUM:         2
TOTAL PVs:      2                        VG DESCRIPTORS: 3
STALE PVs:      0                        STALE PPs:      0
ACTIVE PVs:     2                        AUTO ON:        yes
MAX PPs per PV: 1016                     MAX PVs:        32
LTG size:       128 kilobyte(s)          AUTO SYNC:      no
HOT SPARE:      no                       BB POLICY:      relocatable

# lsvg -p informixvg
informixvg
PV_NAME       PV STATE     TOTAL PPs     FREE PPs     FREE DISTRIBUTION
hdisk3        active       542           462          109..28..108..108..109
hdisk4        active       542           447          109..13..108..108..109

# lsvg -l rootvg
LV NAME       TYPE         LPs    PPs    PVs     LV STATE      MOUNT POINT
hd5           boot         1      1      1       closed/syncd  N/A
hd6           paging       24     24     1       open/syncd    N/A
hd8           jfslog       1      1      1       open/syncd    N/A
hd4           jfs          4      4      1       open/synced   /
hd2           jfs          76     76     1       open/synced   /usr
hd9var        jfs          4      4      1       open/synced   /var
hd3           jfs          6      6      1       open/synced   /tmp
paging00      paging       20     20     1       open/synced   N/A
..
..

Suppose we have 70GB disk=70000MB
1016 partitions=> 63 MB per PP


extendvg command:
-----------------

extendvg VGName hdiskNumber

# extendvg newvg hdisk23

How to Add a Disk to a Volume Group? 

extendvg   VolumeGroupName   hdisk0 hdisk1 ... hdiskn 


reducevg command:
-----------------

To remove a PV from a VG:

# reducevg myvg hdisk23

To remove a VG:

Suppose we have a VG informixvg with 2 PV, hdisk3 and hdisk4:

# reducevg -d informixvg hdisk4

When you delete the last disk from the VG, the VG is also removed.

# reducevg -d informix hdisk3


varyonvg and varyoffvg commands:
--------------------------------

When you activate a VG for use, all its resident filesystems are mounted by default if they have
the flag mount=true in the /etc/filesystems file.

# varyonvg apachevg

# varyoffvg apachevg

To use this command, you must be sure that none of the logical volumes are opened, that is, in use.


mkvg command:
-------------

You can create a new VG by using "smitty mkvg" or by using the mkvg command.

Use the following command, where s "partition_size" sets the number of megabytes in each physical partition 
where the partition_size is expressed in units of megabytes from 1 through 1024. The size variable must 
be equal to a power of 2 (for example 1, 2, 4, 8). The default value is 4.

mkvg -y <name_of_volume_group> -s <partition_size> <list_of_hard_disks>

As with physical volumes, volume groups can be created and removed and their characteristics
can be modified.

Before a new volume group can be added to the system, one or more physical volumes not used
in other volume groups, and in an available state, must exist on the system.

The following example shows the use of the mkvg command to create a volume group myvg
using the physical volumes hdisk1 and hdisk5.

# mkvg -y myvg -d 10 -s 8 hdisk1 hdisk5

# mkvg -y oravg -d 10 -s 64 hdisk1


mklv command:
-------------

To create a LV, you can use the smitty command "smitty mklv" or just use the mklv command
by itself.

The mklv command creates a new logical volume within the VolumeGroup. For example, all file systems 
must be on separate logical volumes. The mklv command allocates the number of logical partitions 
to the new logical volume. If you specify one or more physical volumes with the PhysicalVolume parameter, 
only those physical volumes are available for allocating physical partitions; otherwise, all the 
physical volumes within the volume group are available. 

The default settings provide the most commonly used characteristics, but use flags to tailor the logical volume 
to the requirements of your system. Once a logical volume is created, its characteristics can be changed 
with the chlv command. 

When you create a LV, you also specify the number of LP's, and how a LP maps to PP's. 
Later, you can create one filesystem per LV.

Examples

The following example creates a LV "lv05" on the VG "splvg", with two copies (2 PPs) of each LP.
In this case, we are mirroring a LP to two PP's.
Also, 200 PP's are specified. If a PP is 128 MB is size, the total amount of space of one "mirror" is 25600 MB.

# mklv -y lv05 -c 2 splvg 200

The following example shows the use of mklv command to create a new LV newlv in the rootvg
and it will have 10 LP's and each LP consists of 2 physical partitions.

# mklv -y newlv -c 2 rootvg 10

To make a logical volume in volume group vg02 with one logical partition and a total of two copies of the data, enter: 

# mklv -c 2 vg02 1

To make a logical volume in volume group vg03 with nine logical partitions and a total of three copies 
spread across a maximum of two physical volumes, and whose allocation policy is not strict, enter: 

# mklv -c 3 -u 2 -s n vg03 9

To make a logical volume in vg04 with five logical partitions allocated across the center sections of the 
physical volumes when possible, with no bad-block relocation, and whose type is paging, enter: 

# mklv -a c -t paging -b n vg04 5

To make a logical volume in vg03 with 15 logical partitions chosen from physical volumes hdisk5, hdisk6, and hdisk9, 
enter: 

# mklv vg03 15 hdisk5 hdisk6 hdisk9

To make a striped logical volume in vg05 with a stripe size of 64K across 3 physical volumes and 12 
logical partitions, enter: 

# mklv -u 3 -S 64K vg05 12

To make a striped logical volume in vg05 with a stripe size of 8K across hdisk1, hdisk2, and hdisk3 and 
12 logical partitions, enter: 

# mklv -S 8K vg05 12 hdisk1 hdisk2 hdisk3

The following example uses a "map file /tmp/mymap1" which list which PPs are to be used in creating a LV:

# mklv -t jfs -y lv06 -m /tmp/mymap1 rootvg 10


The setting Strict=y means that each copy of the LP is placed on a different PV. The setting Strict=n means
that copies are not restricted to different PVs. 
The default is strict.


# mklv -y lv13 -c 2 failovervg 150
# crfs -v jfs -d lv13 -m /backups2 -a bf=true

Another simple example using local disks:

# mkvg -y appsvg -s 32 hdisk2
# mkvg -y datavg -s 32 hdisk3

# mklv -y testlv -c 1 appsvg 10
# mklv -y backuplv -c 1 datavg 10

# crfs -v jfs -d testlv -m /test -a bf=true
# crfs -v jfs -d backuplv -m /backup -a bf=true

mklv -y testlv1 -c 1 appsvg 10
mklv -y testlv2 -c 1 datavg 10
crfs -v jfs -d testlv1 -m /test1 -a bf=true
crfs -v jfs -d testlv2 -m /test2 -a bf=true


mklv -y testlv1 -c 1 vgp0corddap01 10
mklv -y testlv2 -c 1 vgp0corddad01 10
crfs -v jfs -d testlv1 -m /test1 -a bf=true
crfs -v jfs -d testlv2 -m /test2 -a bf=true

rmlv command:
-------------

# rmlv newlv
Warning, all data on logical volume newlv will be destroyed.
rmlv: Do you wish to continue? y(es) n(o) y
#

extendlv command:
-----------------

The following example shows the use of the extentlv command to add 3 more LP's to the LP newlv:

# extendlv newlv 3

cplv command:
-------------

The following command copies the contents of LV oldlv to a new LV called newlv:
# cplv -v myvg -y newlv oldlv

To copy to an existing LV:
# cplv -e existinglv oldlv

Purpose
Copies the contents of a logical volume to a new logical volume.

Syntax
To Copy to a New Logical Volume

cplv [ -vg VolumeGroup ] [ -lv NewLogicalVolume | -prefix Prefix ] SourceLogicalVolume

To Copy to an Existing Logical Volume

cplv [ -f ] SourceLogicalVolume DestinationLogicalVolume

cplv -e DestinationLogicalVolume [-f] SourceLogicalVolume

-e: specifies that the DestinationLogicalVolume already exists.
-f: no user confirmation
-y: specifies the name to use for the NewLogicalVolume, instead of a system generated name.

Description
Attention: Do not copy from a larger logical volume containing data to a smaller one. Doing so results 
in a corrupted file system because some data is not copied.
The cplv command copies the contents of SourceLogicalVolume to a new or existing logical volume. 
The SourceLogicalVolume parameter can be a logical volume name or a logical volume ID. 
The cplv command creates a new logical volume with a system-generated name by using the default syntax. 
The system-generated name is displayed. 

Note:
The cplv command can not copy logical volumes which are in the open state, 
including logical volumes 
that are being used as backing devices for virtual storage.
Flags
-f Copies to an existing logical volume without requesting user confirmation. 
-lv NewLogicalVolume Specifies the name to use, in place of a system-generated name, 
 for the new logical volume. Logical volume names must be unique systemwide names, and can range 
 from 1 to 15 characters. 
-prefix Prefix Specifies a prefix to use in building a system-generated name for the new logical volume. 
 The prefix must be less than or equal to 13 characters. A name cannot be a name already used by another device. 
-vg VolumeGroup Specifies the volume group where the new logical volume resides. If this is not specified, 
 the new logical volume resides in the same volume group as the SourceLogicalVolume. 

Examples
To copy the contents of logical volume fslv03 to a new logical volume, type: 

# cplv fslv03
The new logical volume is created, placed in the same volume group as fslv03, 
and named by the system. 

To copy the contents of logical volume fslv03 to a new logical volume in volume group vg02, 
type: 
#cplv  -vg vg02 fslv03
The new logical volume is created, named, and added to volume group vg02. 

#To copy the contents of logical volume lv02 to a smaller, existing logical volume, 
lvtest, without requiring user confirmation, type: 
cplv -f lv02 lvtest


Errors:
-------

0516-746 cplv: Destination logical volume must have 
         type set to copy 

chlv -t copy lvprj


==========================================================================
CASES of usage of cplv command:

CASE 1:
-------

TITLE    : Procedure for moving a filesystem between disks that are in
           different volume groups using the cplv command.
OS LEVEL : AIX 4.x
DATE     : 25/11/99
VERSION  : 1.0

----------------------------------------------------------------------------

In the following example, an RS6000 has 1 one disk with rootvg on, and has
just had a second disk installed. The second disk needs a volume group
creating on it and a data filesystem transferring to the new disk. Ensure
that you have a full system backup befor you start.


lspv

hdisk0         00009922faf79f0d    rootvg         
hdisk1         None                None           

df -k

Filesystem    1024-blocks      Free %Used    Iused %Iused Mounted on
/dev/hd4             8192      1228   86%     1647    41% /
/dev/hd2           380928     40984   90%    11014    12% /usr
/dev/hd9var         32768     20952   37%      236     3% /var
/dev/hd3            28672      1644   95%      166     3% /tmp
/dev/hd1            53248     51284    4%       95     1% /home
/dev/lv00          200704    110324   46%     1869     4% /home/john
/dev/ftplv         102400     94528    8%       32     1% /home/ftp
/dev/lv01          114688     58240   50%       59     1% /usr2

In this example the /usr2 filesystem needs to be moved to the new disk 
drive, freeing up space in the root volume group. 


1, Create a data volume group on the new disk (hdisk1), the command below
   will create a volume group called datavg on hdisk1 with a PP size of 
   32 Meg:-

   mkvg -s 32 -y datavg hdisk1

2, Create a jfslog logical volume on the new volume group :-

   mklv -y datalog -t jfslog datavg 1

3, Initialise the jfslog :-

   logform /dev/datalog

   logform: destroy /dev/datalog (y)?y

4, Umount the filesystem that is being copied :-

   umount /usr2

5, Copy the /usr2 logical volume (lv01) to a new logical volume (lv11) on 
   the new volume group :-

   cplv -y lv11 -v datavg lv01

   cplv: Logical volume lv01 successfully copied to lv11 .

6, Change the /usr2 filesystem to use the new (/dev/lv11) logical volume 
   and not the old (/dev/lv01) logical volume :-

   chfs -a dev=/dev/lv11 /usr2

7, Change the /usr2 filesystem to use the jfslog on the new volume group 
   (/dev/datalog) :- 

   chfs -a log=/dev/datalog /usr2

8, Mount the filesystem :-

   mount /usr2

   df -k

   Filesystem    1024-blocks      Free %Used    Iused %Iused Mounted on
   /dev/hd4             8192      1220   86%     1649    41% /
   /dev/hd2           380928     40984   90%    11014    12% /usr
   /dev/hd9var         32768     20952   37%      236     3% /var
   /dev/hd3            28672      1644   95%      166     3% /tmp
   /dev/hd1            53248     51284    4%       95     1% /home
   /dev/lv00          200704    110324   46%     1869     4% /home/john
   /dev/ftplv         102400     94528    8%       32     1% /home/ftp
   /dev/lv11          114688     58240   50%       59     1% /usr2

9, Once the filesystem has been checked out, the old logical volume can
   be removed :-

   rmfs /dev/lv01

   Warning, all data contained on logical volume lv01 will be destroyed.
   rmlv: Do you wish to continue? y(es) n(o)? y
   rmlv: Logical volume lv01 is removed. 


If you wish to copy further filesystems repeat parts 4 to 9.

==========================================================================

CASE 2:
-------

Doel:
-----

Een "move" van het /prj filesystem (met Websphere in /prj/was) op rootvg,
naar een nieuw (groter en beter) volume group "wasvg".
Het huidige /prj op rootvg, correspondeerd met de LV "prjlv".
De nieuw te maken /prj op wasvg, correspondeerd met de LV "lvprj".

  ROOTVG                     WASVG
  --------------            --------------
  |/usr  (hd2) |            |             |
  |..          |            |             |
  |/prj (prjlv)|----------->|/prj (lvprj) | 
  |..          |            |             |
  --------------             -------------
  hdisk0,hdisk1              hdisk12,hdisk13

opm: /prj bevat "/prj/was", en dat is Websphere.

Hier maken we geen gebruik van een backup tape.

Gebruik het cplv command

  umount /prj
  chfs -m /prj_old /prj

 + mkvg -y wasvg -d 10 -s 128 hdisk12 hdisk13   -- maak VG aan

 + mklv -y lvprj -c 2 wasvg 400                 -- maak LV aan

 + mklv -y waslog -t jfslog wasvg 1             -- maak een jfslog

 + logform /dev/waslog                          -- init de log


  cplv -e lvprj prjlv

  chfs -a dev=/dev/lvprj /prj_old                   --
 
  chfs -a log=/dev/waslog /prj_old

  chfs -m /prj /prj_old
 
  mount /prj

==========================================================================


migratepv command:
------------------

Use the following command to move PPs from hdisk1 to hdisk6 and hdisk7 (all PVs must be in 1 VG)
# migratepv hdisk1 hdisk6 hdisk7

Use the following command to move PPs in LV lv02 from hdisk1 to hdisk6 
# migratepv -l lv02 hdisk1 hdisk6


chvg command:
-------------

This example multiplies by 2 the number of PPs:
# chvg -t2 datavg
 

chpv command:
-------------

The chpv command changes the state of the physical volume in a volume group by setting allocation 
permission to either allow or not allow allocation and by setting the availability to either 
available or removed. This command can also be used to clear the boot record for the given physical volume. 
Characteristics for a physical volume remain in effect unless explicitly changed with the corresponding flag.

Examples

To close physical volume hdisk03, enter: 
# chpv -v r hdisk03

The physical volume is closed to logical input and output until the -v a flag is used. 

To open physical volume hdisk03, enter: 
# chpv -v a hdisk03

The physical volume is now open for logical input and output. 

To stop the allocation of physical partitions to physical volume hdisk03, enter: 
# chpv -a n hdisk03

No physical partitions can be allocated until the -a y flag is used. 

To clear the boot record of a physical volume hdisk3, enter: 
# chpv -c hdisk3


How to synchronize stale partitions in a VG?:
---------------------------------------------

the syncvg command:

syncvg Command

Purpose
Synchronizes logical volume copies that are not current.

Syntax
syncvg [ -f ] [ -i ] [ -H ] [ -P NumParallelLps ] { -l | -p | -v } Name ...

Description
The syncvg command synchronizes the physical partitions, which are copies of the original physical partition, 
that are not current. The syncvg command can be used with logical volumes, physical volumes, 
or volume groups, with the Name parameter representing the logical volume name, physical volume name, 
or volume group name. The synchronization process can be time consuming, depending on the 
hardware characteristics and the amount of data.

When the -f flag is used, a good physical copy is chosen and propagated to all other copies 
of the logical partition, whether or not they are stale. Using this flag is necessary 
in cases where the logical volume does not have the mirror write consistency recovery.

Unless disabled, the copies within a volume group are synchronized automatically when the volume group is 
activated by the varyonvg command. 

Note:
For the sycnvg command to be successful, at least one good copy of the logical volume should 
be accessible, and the physical volumes that contains this copy should be in ACTIVE state. 
If the -f option is used, the above condition applies to all mirror copies.
If the -P option is not specified, syncvg will check for the NUM_PARALLEL_LPS environment variable. 
The value of NUM_PARALLEL_LPS will be used to set the number of logical partitions to be synchronized in parallel.

Examples
To synchronize the copies on physical volumes hdisk04 and hdisk05, enter: 
# syncvg  -p hdisk04 hdisk05

To synchronize the copies on volume groups vg04 and vg05, enter: 
# syncvg  -v vg04 vg05


How to Mirror a Logical Volume? :
--------------------------------

mklvcopy LogicalVolumeName Numberofcopies 
syncvg VolumeGroupName 

To add a copy for LV lv01 on disk hdisk7:

# mklvcopy lv01 2 hdisk7


Identifying hotspots: lvmstat command:
--------------------------------------

The lvmstat command display statistics values since the previous lvmstat command.
# lvmstat -v rootvg -e
# lvmstat -v rootvg -C
# lvmstat -v rootvg

Logical Volume       iocnt    KB_read   KB_wrtn   Kbps
hd8                   4        0        0         0.00
paging01              0        0        0         0.00
..
..


31.2 Mirroring a VG:
====================

LVM provide a disk mirroring facility at the LV level. 
Mirroring is the association of 2 or 3 PP's with each LP in a LV.

Use the "mklv", or the "mklvcopy", or the "mirrorvg" command.

The mklv command allows you to select one or two additional copies for each logical volume.

example:

To make a logical volume in volume group vg03 with nine logical partitions and a total of three copies 
spread across a maximum of two physical volumes, and whose allocation policy is not strict, enter: 

mklv -c 3 -u 2 -s n vg03 9

Mirroring can also be added to an existing LV using the mklvcopy command.

The mirrorvg command mirrors all the LV's on a given VG.
Examples:

- To triply mirror a VG, run
# mirrorvg -c 3 myvg

- To get default mirroring of the rootvg, run
# mirrorvg rootvg

- To replace a failed disk in a mirrored VG, run
# unmirrorvg workvg hdisk7
# reducevg workvg hdisk7
# rmdev -l hdisk7 -d

Now replace the failed disk with a new one and name it hdisk7
# extendvg workvg hdisk7
# mirrorvg workvg


mirrorvg command:
-----------------

mirrorvg Command


Purpose
Mirrors all the logical volumes that exist on a given volume group. 
This command only applies to AIX 4.2.1 or later. 


Syntax
mirrorvg [ -S | -s ] [ -Q ] [ -c Copies] [ -m ] VolumeGroup [ PhysicalVolume ... ] 


Description
The mirrorvg command takes all the logical volumes on a given volume group and mirrors 
those logical volumes. This same functionality may also be accomplished manually if you execute 
the mklvcopy command for each individual logical volume in a volume group. As with mklvcopy, 
the target physical drives to be mirrored with data must already be members of the volume group. 
To add disks to a volume group, run the extendvg command. 

By default, mirrorvg attempts to mirror the logical volumes onto any of the disks in a volume group. 
If you wish to control which drives are used for mirroring, you must include the list of disks in the 
input parameters, PhysicalVolume. Mirror strictness is enforced. Additionally, mirrorvg mirrors 
the logical volumes, using the default settings of the logical volume being mirrored. 
If you wish to violate mirror strictness or affect the policy by which the mirror is created, 
you must execute the mirroring of all logical volumes manually with the mklvcopy command. 

When mirrorvg is executed, the default behavior of the command requires that the synchronization 
of the mirrors must complete before the command returns to the user. If you wish to avoid the delay, 
use the -S or -s option. Additionally, the default value of 2 copies is always used. To specify a value 
other than 2, use the -c option. 


Note: To use this command, you must either have root user authority or be a member of the system group. 

Attention: The mirrorvg command may take a significant amount of time before completing because 
of complex error checking, the amount of logical volumes to mirror in a volume group, and the time 
is takes to synchronize the new mirrored logical volumes. 
You can use the Volumes application in Web-based System Manager (wsm) to change volume characteristics. 
You could also use the System Management Interface Tool (SMIT) smit mirrorvg fast path to run this command. 


Flags

-c Copies  Specifies the minimum number of copies that each logical volume must have after 
   the mirrorvg command has finished executing. It may be possible, through the independent use 
   of mklvcopy, that some logical volumes may have more than the minimum number specified after 
   the mirrorvg command has executed. Minimum value is 2 and 3 is the maximum value. 
   A value of 1 is ignored.  
-m exact map  Allows mirroring of logical volumes in the exact physical partition order that 
   the original copy is ordered. This option requires you to specify a PhysicalVolume(s) where the exact map 
   copy should be placed. If the space is insufficient for an exact mapping, then the command will fail. 
   You should add new drives or pick a different set of drives that will satisfy an exact 
   logical volume mapping of the entire volume group. The designated disks must be equal to or exceed 
   the size of the drives which are to be exactly mirrored, regardless of if the entire disk is used. 
   Also, if any logical volume to be mirrored is already mirrored, this command will fail.  
-Q Quorum Keep  By default in mirrorvg, when a volume group's contents becomes mirrored, volume group 
   quorum is disabled. If the user wishes to keep the volume group quorum requirement after mirroring 
   is complete, this option should be used in the command. For later quorum changes, refer to the chvg command.  
-S Background Sync  Returns the mirrorvg command immediately and starts a background syncvg of the volume group. 
   With this option, it is not obvious when the mirrors have completely finished their synchronization. 
   However, as portions of the mirrors become synchronized, they are immediately used by the operating system 
   in mirror usage.  
-s Disable Sync  Returns the mirrorvg command immediately without performing any type of 
   mirror synchronization. If this option is used, the mirror may exist for a logical volume but 
   is not used by the operating system until it has been synchronized with the syncvg command.  


The following is a description of rootvg: 

- rootvg mirroring  When the rootvg mirroring has completed, you must perform three additional tasks: 
bosboot, bootlist, and reboot. 
The bosboot command is required to customize the bootrec of the newly mirrored drive. 
The bootlist command needs to be performed to instruct the system which disk and order you prefer 
the mirrored boot process to start. 

Finally, the default of this command is for Quorum to be turned off. For this to take effect 
on a rootvg volume group, the system must be rebooted. 
 
- non-rootvg mirroring  When this volume group has been mirrored, the default command causes Quorum 
to deactivated. The user must close all open logical volumes, execute varyoffvg and then varyonvg on 
the volume group for the system to understand that quorum is or is not needed for the volume group. 
If you do not revaryon the volume group, mirror will still work correctly. However, any quorum changes 
will not have taken effect.  
rootvg and non-rootvg mirroring  The system dump devices, primary and secondary, should not be mirrored. 
In some systems, the paging device and the dump device are the same device. However, most users want 
the paging device mirrored. When mirrorvg detects that a dump device and the paging device are the same, 
the logical volume will be mirrored automatically. 
If mirrorvg detects that the dump and paging device are different logical volumes, the paging device 
is automatically mirrored, but the dump logical volume is not. The dump device can be queried and modified 
with the sysdumpdev command. 

 
Remark:
-------
Run bosboot to initialize all boot records and devices by executing the 
following command:
bosboot -a -d /dev/hdisk?
hdisk? is the first hdisk listed under the PV heading after the command 
lslv -l hd5 has executed.

Secondary, you need to understant that the mirroring under AIX it's at 
the logical volume level. The mirrorvg command is a hight level command 
that use "mklvcopy" command.
So, all LV created before runing the mirrorvg command are keep 
synchronised, but if you add a new LV after runing mirrorvg, you need to 
mirror it manualy using "mklvcopy" .

Remark:
-------

lresynclv


Mirroring the rootvg:
---------------------

Method 1:
---------

Howto mirror an AIX rootvg
The following steps will guide you trough the mirroring of an AIX rootvg.
This info is valid for AIX 4.3.3, AIX 5.1, AIX 5.2 and AIX 5.3.

Make sure you have an empty disk, in this example its hdisk1 
Add the disk to the vg via 

# extendvg rootvg hdisk1 

Mirror the vg via: 

# mirrorvg -s rootvg

Now synchronize the new copies you created:

# syncvg -v rootvg

As we want to be able to boot from different disks, we need to use bosboot:

# bosboot -a

As hd5 is mirrored there is no need to do it for each disk.

Now, update the bootlist:

# bootlist -m normal hdisk1 hdisk0
# bootlist -m service hdisk1 hdisk0


When mirrorvg is executed, the default behavior of the command requires that the synchronization of the mirrors 
must complete before the command returns to the user. If you wish to avoid the delay, use the -S or -s option. 
Additionally, the default value of 2 copies is always used. To specify a value other than 2, use the -c option.


Method 2:
---------

-------------------------------------------------------------------------------
# Add the new disk, say its hdisk5, to rootvg

extendvg rootvg hdisk5

# If you use one mirror disk, be sure that a quorum is not required for varyon:

chvg -Qn rootvg

# Add the mirrors for all rootvg LV's:

mklvcopy hd1 2 hdisk5
mklvcopy hd2 2 hdisk5
mklvcopy hd3 2 hdisk5
mklvcopy hd4 2 hdisk5
mklvcopy hd5 2 hdisk5
mklvcopy hd6 2 hdisk5
mklvcopy hd8 2 hdisk5
mklvcopy hd9var 2 hdisk5
mklvcopy hd10opt 2 hdisk5
mklvcopy prjlv 2 hdisk5

#If you have other LV's in your rootvg, be sure to create copies for them as well !!
------------------------------------------------------------------------------

# lspv -l hdisk0
hd5                   1     1     01..00..00..00..00    N/A
prjlv                 256   256   108..44..38..50..16   /prj
hd6                   59    59    00..59..00..00..00    N/A
fwdump                5     5     00..05..00..00..00    /var/adm/ras/platform
hd8                   1     1     00..00..01..00..00    N/A
hd4                   26    26    00..00..02..24..00    /
hd2                   45    45    00..00..37..08..00    /usr
hd9var                10    10    00..00..02..08..00    /var
hd3                   22    22    00..00..04..10..08    /tmp
hd1                   8     8     00..00..08..00..00    /home
hd10opt               24    24    00..00..16..08..00    /opt


Method 3:
---------

In the following example, an RS6000 has 3 disks, 2 of which have the AIX
filesystems mirrored on. The boolist contains both hdisk0 and hdisk1. 
There are no other logical volumes in rootvg other than the AIX system 
logical volumes. hdisk0 has failed and need replacing, both hdisk0 and hdisk1
are in "Hot Swap" carriers and therefore the machine does not need shutting 
down. 

lspv

hdisk0         00522d5f22e3b29d    rootvg
hdisk1         00522d5f90e66fd2    rootvg 
hdisk2         00522df586d454c3    datavg                                     

lsvg -l rootvg

rootvg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
hd6                 paging     4     8     2    open/syncd    N/A
hd5                 boot       1     2     2    closed/syncd  N/A
hd8                 jfslog     1     2     2    open/syncd    N/A
hd4                 jfs        1     2     2    open/syncd    /
hd2                 jfs        12    24    2    open/syncd    /usr
hd9var              jfs        1     2     2    open/syncd    /var
hd3                 jfs        2     4     2    open/syncd    /tmp
hd1                 jfs        1     2     2    open/syncd    /home


1, Reduce the logical volume copies from both disks to hdisk1 only :-

   rmlvcopy hd6 1 hdisk0
   rmlvcopy hd5 1 hdisk0
   rmlvcopy hd8 1 hdisk0
   rmlvcopy hd4 1 hdisk0
   rmlvcopy hd2 1 hdisk0
   rmlvcopy hd9var 1 hdisk0
   rmlvcopy hd3 1 hdisk0
   rmlvcopy hd1 1 hdisk0
   
2, Check that no logical volumes are left on hdisk0 :-

   lspv -p hdisk0

   hdisk0:
   PP RANGE  STATE   REGION        LV ID          TYPE       MOUNT POINT
     1-101   free    outer edge
   102-201   free    outer middle
   202-301   free    center
   302-401   free    inner middle
   402-501   free    inner edge     

3, Remove the volume group from hdisk0

   reducevg -df rootvg hdisk0

4, Recreate the boot logical volume on hdisk1, and reset bootlist:-

   bosboot -a -d /dev/hdisk1
   bootlist -m normal rmt0 cd0 hdisk1

5, Check that everything has been removed from hdisk0 :-

   lspv

   hdisk0         00522d5f22e3b29d    None
   hdisk1         00522d5f90e66fd2    rootvg
   hdisk2         00522df586d454c3    datavg          

6, Delete hdisk0 :-

   rmdev -l hdisk0 -d

7, Remove the failed hard drive and replace with a new hard drive.

8, Configure the new disk drive :-

   cfgmgr

9, Check new hard drive is present :-

   lspv

10, Include the new hdisk in root volume group :-

    extendvg rootvg hdisk?  (where hdisk? is the new hard disk)

11, Re-create the mirror :-

    mirrorvg rootvg hdisk?  (where hdisk? is the new hard disk)

12, Syncronise the mirror :-

    syncvg -v rootvg

13, Reset the bootlist :-

    bootlist -m normal rmt0 cd0 hdisk0 hdisk1

14, Turn off Quorum checking on rootvg :-

    chvg -Q n rootvg


Method 4:
---------

Howto mirror an AIX rootvg
The following steps will guide you trough the mirroring of an AIX rootvg.
This info is valid for AIX 4.3.3, AIX 5.1, AIX 5.2 and AIX 5.3.

Make sure you have an empty disk, in this example its hdisk1 
Add the disk to the vg via "extendvg rootvg hdisk1 
Mirror the vg via: "mirrorvg rootvg" 
Adapt the bootlist to add the current disk, the system will then fail to hdisk1 is hdisk0 fails during startup 
do bootlist -o -m normal 
this will list currently 1 disk, in this exmaple hdisk0 
do bootlist -m normal hdisk0 hdisk1 
Run a bosboot on both new disks, this will install all software needed for boot on the disk 
bosboot -ad hdisk0 
bosboot -ad hdisk1 


Method 5:
---------

Although the steps to mirror volume groups between HP and AIX are incredibly similar, 
there are enough differences to send me through hoops if/when I ever have to do that. 
Therefore, the following checklist: 

1. Mirror the logical volumes: 
If you don't care what disks the lvs get mirrored to, execute

mirrorvg rootvg


Otherwise: 

for lv in $(lsvg -l rootvg | grep -i open/syncd | \
	grep -v dumplv | awk '{print $1}')
do
	mklvcopy ${lv} 1 ${disk}
done

2. Change the quorum checking if you did not use mirrorvg:

chvg -Q n rootvg


3. Run bosboot on the new drive to copy boot files to it:

bosboot ${disk}


4. Update the bootlist with the new drive:

bootlist -m normal hdisk0 hdisk1


5. Reboot the system to enable the new quorum checking parameter 


Method 6:
---------

Audience: System Administrators 
Date: September 25, 2002 


Mirroring "rootvg" protects the operating system from a disk failure. Mirroring "rootvg" 
requires a couple extra steps compared to other volume groups. The mirrored rootvg disk must be bootable 
*and* in the bootlist. Otherwise, if the primary disk fails, you'll continue to run, 
but you won't be able to reboot. 

In brief, the procedure to mirror rootvg on hdisk0 to hdisk1 is 

1. Add hdisk1 to rootvg:
extendvg rootvg hdisk1 

2. Mirror rootvg to hdisk1:
mirrorvg rootvg hdisk1 (or smitty mirrorvg) 

3. Create boot images on hdisk1:
bosboot -ad /dev/hdisk1 

4. Add hdisk1 to the bootlist:
bootlist -m normal hdisk0 hdisk1 

5. Reboot to disable quorum checking on rootvg. The mirrorvg turns off quorum by default, 
but the system needs to be rebooted for it to take effect. 

For more information, and a comprehensive procedure see the man page for mirrorvg and 


Example using mklvcopy:
-----------------------

mklvcopy [ -a Position ] [ -e Range ] [ -k ] [ -m MapFile ] [ -s Strict ] [ -u UpperBound ] LogicalVolume 
         Copies [ PhysicalVolume... ] 


Add a copy of LV "lv01" on disk hdisk7:

# mklvcopy lv01 2 hdisk7

The mklvcopy command increases the number of copies in each logical partition in LogicalVolume. 
This is accomplished by increasing the total number of physical partitions for each logical partition 
to the number represented by Copies. The LogicalVolume parameter can be a logical volume name or 
logical volume ID. You can request that the physical partitions for the new copies be allocated 
on specific physical volumes (within the volume group) with the PhysicalVolume parameter; 
otherwise, all the physical volumes within the volume group are available for allocation.

The logical volume modified with this command uses the Copies parameter as its new copy characteristic. 
The data in the new copies are not synchronized until one of the following occurs: 
the -k option is used, the volume group is activated by the varyonvg command, or the volume group 
or logical volume is synchronized explicitly by the syncvg command. Individual logical partitions 
are always updated as they are written to.

The default allocation policy is to use minimum numbering of physical volumes per logical volume copy, 
to place the physical partitions belong to a copy as contiguously as possible, and then to place 
the physical partitions in the desired region specified by the -a flag. Also, by default, each copy 
of a logical partition is placed on a separate physical volume.


Using smitty:
-------------

# smit mklv 

or 

# smit mklvcopy

Using "smit mklv" you can create a new LV and at the same time tell the system to create a mirror
(2 or 3 copies) of each LP and which PV's are involved.

Using "smit mklvcopy" you can add mirrors to an existing LV.


31.3 Filesystems in AIX:
========================

After a VG is created, you can create filesystems. You can use smitty or the crfs and mkfs command.
File systems are confined to a single logical volume.

The journaled file system (JFS) and the enhanced journaled file system (JFS2) are built into the 
base operating system. Both file system types link their file and directory data to the structure 
used by the AIX Logical Volume Manager for storage and retrieval. A difference is that JFS2 is designed to accommodate 
a 64-bit kernel and larger files.

Run lsfs -v jfs2 to determine if your system uses JFS2 file systems. 
This command returns no output if it finds only standard file systems. 


crfs:
-----

crfs -v VfsType { -g VolumeGroup | -d Device } [ -l LogPartitions ]
     -m MountPoint [ -n NodeName ] [ -u MountGroup ] [ -A { yes | no } ] [ -p {ro | rw } ] 
     [ -a Attribute= Value ... ] [ -t { yes | no } ]


The crfs command creates a file system on a logical volume within a previously created volume group. 
A new logical volume is created for the file system unless the name of an existing logical volume is 
specified using the -d. An entry for the file system is put into the /etc/filesystems file.

crfs -v jfs -g(vg) -m(mount point) -a size=(size of fs) -A yes 
Will create a logical volume on the volume group and create the file system on 
the logical volume. All at the size stated. Will add entry into 
/etc/filesystems and will create the mount point directory if it does not exist. 

- To make a JFS on the rootvg volume group with nondefault fragment size and nondefault nbpi, enter:
# crfs  -v jfs  -g  rootvg  -m /test -a size=32768 -a frag=512 -a nbpi=1024

This command creates the /test file system on the rootvg volume group with a fragment size of 512 bytes, 
a number of bytes per i-node (nbpi) ratio of 1024, and an initial size of 16MB (512 * 32768).

- To make a JFS on the rootvg volume group with nondefault fragment size and nondefault nbpi, enter: 
# crfs -v jfs -g rootvg -m /test -a size=16M -a frag=512 -a nbpi=1024

This command creates the /test file system on the rootvg volume group with a fragment size of 512 bytes, 
a number of bytes per i-node (nbpi) ratio of 1024, and an initial size of 16MB. 

- To create a JFS2 file system which can support NFS4 ACLs, type: 
# crfs -v jfs2 -g rootvg -m /test -a size=1G -a ea=v2

- This command creates the /test JFS2 file system on the rootvg volume group with an initial size of 1 gigabyte. 
The file system will store extended attributes using the v2 format.
# crfs -v jfs -g backupvg -m /backups -a size=32G -a bf=true

# crfs -v jfs -g oravg -m /filetransfer -a size=4G -a bf=true


Extended example:
-----------------

The following command creates a JFS filesystem on a previously created LV "lv05".
In this example, suppose the LV was created in the following way:

# mklv -y lv05 -c 2 splvg 200

In this case, it is clear that we mirror each LP to 2 PP's (because of the -c 2).

Now to create a filesystem on lv05, we can use the command
# crfs -v jfs -d lv05 -m /spl -a bf=true

Note that we did not mentioned the size of the filesystem. This is because we use a previously defined LV
with a known size. 
 

Notes:

1. The option -a bf=true allows large files [ > 2Gb]; 

2. Specifying -m /<name> (like for example "/data") will create the entry in /etc/filesystems for you


Some more examples:
-------------------

Commands to create VG's:
mkvg oravg -d 10 -s 128 hdisk2 hdisk4
mkvg splvg -d 10 -s 128 hdisk3 hdisk5
mkvg softwvg -d 10 -s 128 hdisk6
mkvg backupvg -d 10 -s 128 hdisk7

Set of Create Logical Volume and Filesystem commands:	

# crfs -v jfs -g <Vgname> -m <Mountpoint> -a size=xG -a bf=true
or
# mklv -y <LV_name> -c 2 <VG_name> No_Of_PPs
# crfs -v jfs -d <LV_name> -m <MountPoint> -a bf=true

		
# mklv -y lv05 -c 2 splvg 300			
# crfs -v jfs -d lv05 -m /spl -a bf=true			
# mklv -y lv06 -c 2 splvg 100			
# crfs -v jfs -d lv06 -m /u04 -a bf=true			
			
# mklv -y lv02 -c 2 oravg 200			
# mklv -y lv03 -c 2 oravg 200			
# mklv -y lv04 -c 2 oravg 200			
# crfs -v jfs -d lv02 -m /u01 -a bf=true			
# crfs -v jfs -d lv03 -m /u02 -a bf=true			
# crfs -v jfs -d lv04 -m /u03 -a bf=true			
			
# crfs -v jfs -g backupvg -m /backups -a size=33G -a bf=true			
# crfs -v jfs -g backupvg -m /data -a size=33G -a bf=true			
# crfs -v jfs -g softwvg -m /apps -a size=16G -a bf=true			
# crfs -v jfs -g softwvg -m /software -a size=33G -a bf=true			
# crfs -v jfs -g softwvg -m /u05 -a size=12G -a bf=true			


mkfs:
-----

The mkfs command makes a new file system on a specified device. The mkfs command initializes the volume label, 
file system label, and startup block.

The Device parameter specifies a block device name, raw device name, or file system name. If the parameter 
specifies a file system name, the mkfs command uses this name to obtain the following parameters from the 
applicable stanza in the /etc/filesystems file, unless these parameters are entered with the mkfs command.

- To specify the volume and file system name for a new file system, type: 
# mkfs  -lworks  -vvol001 /dev/hd3

This command creates an empty file system on the /dev/hd3 device, giving it the volume serial number vol001 
and file system name works. The new file system occupies the entire device. 
The file system has a default fragment size (4096 bytes) and a default nbpi ratio (4096). 

- To create a file system with nondefault attributes, type: 
# mkfs  -s 8192  -o nbpi=2048,frag=512 /dev/lv01

This command creates an empty 4 MB file system on the /dev/lv01 device with 512-byte fragments and 
1 i-node for each 2048 bytes. 

-To create a large file enabled file system, type: 
# mkfs -V jfs -o nbpi=131072,bf=true,ag=64 /dev/lv01

This creates a large file enabled JFS file system with an allocation group size of 64 megabytes and 1 inode 
for every 131072 bytes of disk. The size of the file system will be the size of the logical volume lv01.

- To create a file system with nondefault attributes, type: 
# mkfs -s 4M -o nbpi=2048, frag=512 /dev/lv01

This command creates an empty 4 MB file system on the /dev/lv01 device with 512-byte fragments and one i-node 
for each 2048 bytes. 

- To create a JFS2 file system which can support NFS4 ACLs, type: 
# mkfs -V jfs2 -o ea=v2 /dev/lv01

This command creates an empty file system on the /dev/lv01 device with v2 format for extended attributes.


chfs command:
-------------

- Example 1:

How do I change the size of a filesystem? 

To increase /usr filesystem size by 1000000 512-byte blocks, type:
# chfs -a size=+1000000 /usr
- Example 2:

To split off a copy of a mirrored file system and mount it read-only for use as an online backup, enter: 
# chfs -a splitcopy=/backup -a copy=2 /testfs
This mount a read-only copy of /testfs at /backup.

- Example 3:

To change the mount point of a file system, enter: 
# chfs  -m /test2 /test
This command changes the mount point of a file system from /test to /test2. 

- Eaxample 4:

# chfs -a size=+20G /data/udb/eidwha2/eddwha2/DATA03

- Example 5:

chfs -a size=+5M /opt


 would do it this way:

1) chfs -m old_filename new_filename

2) umount old_filename

3) mount new_filename

To stop or kill access to a fs, use:
fuser -xuc /scratch


lsfs command:
-------------

Displays the characteristics of file systems.

Syntax
lsfs [ -q ] [ -c | -l ] [ -a | -v VfsType | -u MountGroup| [FileSystem...] ]

Description
The lsfs command displays characteristics of file systems, such as mount points, automatic mounts, permissions, 
and file system size. The FileSystem parameter reports on a specific file system. 
The following subsets can be queried for a listing of characteristics:

All file systems 
All file systems of a certain mount group 
All file systems of a certain virtual file system type 
One or more individual file systems

The lsfs command displays additional Journaled File System (JFS) or Enhanced Journaled File System (JFS2) 
characteristics if the -q flag is specified.

To show all file systems in the /etc/filesystems file, enter: 
#lsfs

To show all file systems of vfs type jfs, enter: 
#lsfs  -v jfs

To show the file system size, the fragment size, the compression algorithm (if any), and the 
number of bytes per i-node as recorded in the superblock of the root file system, enter: 
#lsfs  -q /


31.4 SAN connection via SDD, and related commands:
==================================================

If you use advanced storage on AIX, the workings on disks and volume groups are a bit different
from the traditional ways, using local disks, as described above. 

You can use SDD or SDDPCM Multipath IO. This section describes SDD. See section 31.5 for SDDPCM.


Overview of the Subsystem device driver:
----------------------------------------

The IBM System Storage Multipath Device Driver SDD provides multipath configuration environment support
for a host system that is attached to storage devices. It provides:

-Enhanced data availability 
-Automatic path failover and recovery to an alternate path 
-Dynamic load balancing of multiple paths 
-Concurrent microcode upgrade.

The IBM System Storage Multipath Subsystem Device Driver Path Control Module SDDPCM provides
AIX MPIO support. Its a loadable module. During the configuration of supported devices, SDDPCM is loaded
and becomes part of the AIX MPIO Fibre Channel protocol device driver. The AIX MPIO-capable device driver
with the SDDPCM module provides the same functions that SDD provides.

Note that before attempting to exploit the Virtual shared disk support for the Subsystem device driver, 
you must read IBM Subsystem Device Driver Installation and User's Guide.

An SDD implementation is available for AIX, Solaris, HP-UX, some Linux distro's, Windows 200x.

An impression about the architecture on AIX can be seen in the following figure:


               -------------------------------
               | Host System                 |
               | -------             ------- |
               | |FC 0 |             | FC 1| |
               | -------             ------- |
               -------------------------------
                    |                   |
                    |                   |
              ----------------------------------
          ESS |  --------         --------    |
              |  |port 0|         |port 1|    |
              |  -------- \      /--------    |
              |      |      \   /      |      | 
              |      |        \/       |      |
              |      |        / \      |      |
              |   -----------/    \---------- |
              |   |Cluster 1|      |Cluster 2||
              |   -----------      -----------|
              |    |  |  |  |       | | |  |  |
              |    |  |  |  |       | | |  |  |
              |    O--|--|--|-------| | |  |  |           
              |   lun0|  |  |         | |  |  |
              |       O--|--|---------| |  |  |
              |      lun1|  |           |  |  |
              |          O--|-----------|  |  |
              |         lun2|              |  |
              |             O--------------|  |
              |            lun3               |
              ---------------------------------


DPO (Data Path Optimizer) was renamed by IBM a couple years ago- and became SDD (Subsystem Device Driver). 
When redundant paths are configured to ESS logical units, and the SDD is installed and configured, 
the AIX(R) lspv command shows multiple hdisks as well as a new construct called a vpath. The hdisks and vpaths 
represent the same logical unit. You will need to use the lsvpcfg command to get more information. 

Each SDD vpath device represents a unique physical device on the storage server.
Each physical device is presented to the operating system as an operating system disk device.
So, essentially, a vpath device acts like a disk.

You will see later on that a hdisk is actually a "path" to a LUN, that can be reached either by fscsi0 or fscsi1.
Also you will see that a vpath represents the LUN.

SDD does not support multipathing to a bootdevice.

Support for VIO:
----------------

Starting from SDD version 1.6.2.0, a unique ID attribute is added to SDD vpath devices, in order to 
support AIX5.3 VIO future features. AIX device configure methods have been changed in both AIX52 TL8 and 
AIX53 TL4 for this support.


Examples:
---------

For example, after issuing lspv, you see output similar to this:

# lspv
hdisk0          000047690001d59d      rootvg
hdisk1          000047694d8ce8b6      None
hdisk18         000047694caaba22      None
hdisk19         000047694caadf9a      None
hdisk20         none                  None
hdisk21         none                  None
hdisk22         000047694cab2963      None
hdisk23         none                  None
hdisk24         none                  None
vpath0          none                  None
vpath1          none                  None
vpath2          000047694cab0b35      gpfs1scsivg
vpath3          000047694cab1d27      gpfs1scsivg


After issuing lsvpcfg, you see output similar to this:

# lsvpcfg
vpath0 (Avail ) 502FCA01 = hdisk18 (Avail pv )
vpath1 (Avail ) 503FCA01 = hdisk19 (Avail pv )
vpath2 (Avail pv gpfs1scsivg) 407FCA01 = hdisk20 (Avail ) hdisk24 (Avail )


The examples above illustrate some important points:

- vpath0 consists of a single path (hdisk18) and therefore will not provide failover protection. 
Also, hdisk18 is defined to AIX as a physical volume (pv flag) and has a PVID, as you can see from the output 
of the lspv command. Likewise for vpath1.

- vpath2 has two paths (hdisk20 and hdisk24) and has a volume group defined on it. Notice that with the 
lspv command, hdisk20 and hdisk24 look like newly installed disks with no PVIDs. The lsvpcfg command had 
to be used to determine that hdisk20 and hdisk24 make up vpath2, which has a PVID.

Warning: so be very carefull not to use a hdisk for a "local" VG, if its already used for a vpath.


Other Example:
--------------

# lspv
 hdisk0          00c49e8c8053fe86                    rootvg          active
 hdisk1          00c49e8c841a74d5                    rootvg          active
-hdisk2          none                                None
-hdisk3          none                                None
 vpath0          00c49e8c94c02c15                    datavg          active
 vpath1          00c49e8c94c050d4                    appsvg          active
-hdisk4          none                                None
 vpath2          00c49e8c2806dc22                    appsvg          active
-hdisk5          none                                None
-hdisk6          none                                None
-hdisk7          none                                None


# lsvpcfg

vpath0 (Avail pv datavg) 75BAFX1006C = hdisk2 (Avail ) hdisk5 (Avail )
vpath1 (Avail pv appsvg) 75BAFX1017B = hdisk3 (Avail ) hdisk6 (Avail )
vpath2 (Avail pv appsvg) 75BAFX10329 = hdisk4 (Avail ) hdisk7 (Avail )


# datapath query adapter

Active Adapters :2

Adpt#     Name   State     Mode             Select     Errors  Paths  Active
    0   fscsi0  NORMAL   ACTIVE           12611291          0      3       3
    1   fscsi1  NORMAL   ACTIVE           13375287          0      3       3


# datapath query device

Total Devices : 3


DEV#:   0  DEVICE NAME: vpath0  TYPE: 2107900         POLICY:    Optimized  # this is vpath0
SERIAL: 75BAFX1006C
==========================================================================
Path#      Adapter/Hard Disk          State     Mode     Select     Errors
    0          fscsi0/hdisk2           OPEN   NORMAL   12561763          0
    1          fscsi1/hdisk5           OPEN   NORMAL   13324883          0

DEV#:   1  DEVICE NAME: vpath1  TYPE: 2107900         POLICY:    Optimized
SERIAL: 75BAFX1017B
==========================================================================
Path#      Adapter/Hard Disk          State     Mode     Select     Errors
    0          fscsi0/hdisk3           OPEN   NORMAL      28024          0
    1          fscsi1/hdisk6           OPEN   NORMAL      28847          0

DEV#:   2  DEVICE NAME: vpath2  TYPE: 2107900         POLICY:    Optimized
SERIAL: 75BAFX10329
==========================================================================
Path#      Adapter/Hard Disk          State     Mode     Select     Errors
    0          fscsi0/hdisk4           OPEN   NORMAL      21672          0
    1          fscsi1/hdisk7           OPEN   NORMAL      21712          0


# lsattr -El vpath0
active_hdisk  hdisk2/75BAFX1006C/fscsi0        Active hdisk               False
active_hdisk  hdisk5/75BAFX1006C/fscsi1        Active hdisk               False
policy        df                               Scheduling Policy          True
pvid          00c49e8c94c02c150000000000000000 Physical volume identifier False
serial_number 75BAFX1006C                      LUN serial number          False


# lsdev -Cc adapter
ent0      Available 04-08 10/100/1000 Base-TX PCI-X Adapter (14106902)
ent1      Available 06-08 10/100/1000 Base-TX PCI-X Adapter (14106902)
fcs0      Available 05-08 FC Adapter
fcs1      Available 07-08 FC Adapter
sa0       Available       LPAR Virtual Serial Adapter
sisscsia0 Available 03-08 PCI-X Ultra320 SCSI Adapter


# lsattr -El fcs0
bus_intr_lvl  131193     Bus interrupt level                                False
bus_io_addr   0xcfc00    Bus I/O address                                    False
bus_mem_addr  0xc0040000 Bus memory address                                 False
init_link     al         INIT Link flags                                    True
intr_priority 3          Interrupt priority                                 False
lg_term_dma   0x800000   Long term DMA                                      True
max_xfer_size 0x100000   Maximum Transfer Size                              True
num_cmd_elems 200        Maximum number of COMMANDS to queue to the adapter True
pref_alpa     0x1        Preferred AL_PA                                    True
sw_fc_class   2          FC Class for Fabric                                True


# lscfg -lv fcs0
  fcs0             U7879.001.DQDKCPR-P1-C2-T1  FC Adapter

        Part Number.................03N6441
        EC Level....................A
        Serial Number...............1D54508045
        Manufacturer................001D
        Feature Code................280B
        FRU Number.................. 03N6441
        Device Specific.(ZM)........3
        Network Address.............10000000C94F91CD
        ROS Level and ID............0288193D
        Device Specific.(Z0)........1001206D
        Device Specific.(Z1)........00000000
        Device Specific.(Z2)........00000000
        Device Specific.(Z3)........03000909
        Device Specific.(Z4)........FF801412
        Device Specific.(Z5)........0288193D
        Device Specific.(Z6)........0683193D
        Device Specific.(Z7)........0783193D
        Device Specific.(Z8)........20000000C94F91CD
        Device Specific.(Z9)........TS1.90X13
        Device Specific.(ZA)........T1D1.90X13
        Device Specific.(ZB)........T2D1.90X13
        Device Specific.(YL)........U7879.001.DQDKCPR-P1-C2-T1


# lsdev -Cc adapter -F 'name parent'
ent0      pci4
ent1      pci6
fcs0      pci5
fcs1      pci7
sa0
sisscsia0 pci3


# lsdev -Cc disk -F 'name location'
hdisk0 03-08-00-3,0
hdisk1 03-08-00-5,0
hdisk2 05-08-01 ------------------------>|
hdisk3 05-08-01 ------------------------>|
hdisk4 05-08-01 ------------------------>|
hdisk5 07-08-01                          |
hdisk6 07-08-01                          |
hdisk7 07-08-01                          |
vpath0                                   |
vpath1                                   |
vpath2                                   |
                                         |
                                         |
# lsdev -Cc driver -F 'name location'    |
dpo                                      |
fcnet0 05-08-02                          |
fcnet1 07-08-02                          |
fscsi0 05-08-01 <-------------------------
fscsi1 07-08-01
iscsi0
scsi0  03-08-00

Please note that, for example, from the above output, that fsci0 can be "linked" to hdisk2, hdisk3 and hdisk4,
due to the location code.
You can compare that to the output of "datapath query device".
Also interesting can be the following:

# lsdev -C | grep fc
fcnet0      Defined   05-08-02      Fibre Channel Network Protocol Device
fcnet1      Defined   07-08-02      Fibre Channel Network Protocol Device
fcs0        Available 05-08         FC Adapter
fcs1        Available 07-08         FC Adapter

# lsdev -C | grep fsc
fscsi0      Available 05-08-01      FC SCSI I/O Controller Protocol Device
fscsi1      Available 07-08-01      FC SCSI I/O Controller Protocol Device

From this, you can see that fcs0 is the "parent" of the child "fsci0".


# lsattr -D -l fscsi0
attach       none         How this adapter is CONNECTED         False
dyntrk       no           Dynamic Tracking of FC Devices        True
fc_err_recov delayed_fail FC Fabric Event Error RECOVERY Policy True
scsi_id                   Adapter SCSI ID                       False
sw_fc_class  3            FC Class for Fabric                   True

# lsattr -D -l fcs0
bus_intr_lvl             Bus interrupt level                                Fals                                              e
bus_io_addr   0x00010000 Bus I/O address                                    Fals                                              e
bus_mem_addr  0x01000000 Bus memory address                                 Fals                                              e
init_link     al         INIT Link flags                                    True
intr_priority 3          Interrupt priority                                 Fals                                              e
lg_term_dma   0x800000   Long term DMA                                      True
max_xfer_size 0x100000   Maximum Transfer Size                              True
num_cmd_elems 200        Maximum number of COMMANDS to queue to the adapter True
pref_alpa     0x1        Preferred AL_PA                                    True
sw_fc_class   2          FC Class for Fabric                                True


# datapath query essmap
 Disk          Path  P     Location   adapter    LUN SN       Type           Size   LSS     Vol  Rank  C/A   S   Connection  port RaidMode
-------       -----  -   -----------  ------   -----------  ------------     ----   ----    ---  ----- ----  -   ----------- ---- --------
vpath0        hdisk2     05-08-01[FC] fscsi0   75BAFX1006C  IBM 2107-900  107.5GB     0    108   fff2   02   Y   R1-B3-H3-ZC  232 RAID5
vpath0        hdisk5     07-08-01[FC] fscsi1   75BAFX1006C  IBM 2107-900  107.5GB     0    108   fff2   02   Y   R1-B3-H3-ZA  230 RAID5
vpath1        hdisk3     05-08-01[FC] fscsi0   75BAFX1017B  IBM 2107-900   14.3GB     1    123   fff1   0b   Y   R1-B3-H3-ZC  232 RAID5
vpath1        hdisk6     07-08-01[FC] fscsi1   75BAFX1017B  IBM 2107-900   14.3GB     1    123   fff1   0b   Y   R1-B3-H3-ZA  230 RAID5
vpath2        hdisk4     05-08-01[FC] fscsi0   75BAFX10329  IBM 2107-900   14.3GB     3     41   ffe1   08   Y   R1-B3-H3-ZC  232 RAID5
vpath2        hdisk7     07-08-01[FC] fscsi1   75BAFX10329  IBM 2107-900   14.3GB     3     41   ffe1   08   Y   R1-B3-H3-ZA  230 RAID5

From this you can see that a hdisk is actually a "path" to a LUN, that can be reached either by fscsi0 or fscsi1.
Also you can see that a vpath represents the LUN.
 

# datapath query adaptstats

Adapter #:  0
=============
                Total Read  Total Write  Active Read  Active Write   Maximum
I/O:               9595892      4371836            0             0        23
SECTOR:          176489389    138699019            0             0      5128

Adapter #:  1
=============
                Total Read  Total Write  Active Read  Active Write   Maximum
I/O:              10238891      4523508            0             0        24
SECTOR:          188677891    143739157            0             0      5128


# datapath query portmap
                          BAY-1(B1)                BAY-2(B2)                BAY-3(B3)                BAY-4(B4)
   ESSID    DISK      H1   H2   H3   H4        H1   H2   H3   H4        H1   H2   H3   H4        H1   H2   H3   H4
                     ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD
                          BAY-5(B5)                BAY-6(B6)                BAY-7(B7)                BAY-8(B8)
                      H1   H2   H3   H4        H1   H2   H3   H4        H1   H2   H3   H4        H1   H2   H3   H4
                     ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD
 75BAFX1    vpath0   ---- ---- ---- ----      ---- ---- ---- ----      ---- ---- Y-Y- ----      ---- ---- ---- ----
 75BAFX1    vpath1   ---- ---- ---- ----      ---- ---- ---- ----      ---- ---- Y-Y- ----      ---- ---- ---- ----
 75BAFX1    vpath2   ---- ---- ---- ----      ---- ---- ---- ----      ---- ---- Y-Y- ----      ---- ---- ---- ----

Y  =  online/open               y = (alternate path) online/open
O  =  online/closed             o = (alternate path) online/closed
N  =  offline                   n = (alternate path) offline
-  =  path not configured
PD =  path down

Note: 2105 devices' essid has 5 digits, while 1750/2107 device's essid has 7 digits.


# datapath query wwpn
Adapter Name    PortWWN
fscsi0          10000000C94F91CD
fscsi1          10000000C94F9923


If you need to force the Subsystem Device Driver (SDD), or equivalent driver, to rescan and map the new devices,
use the following command at the system prompt: 

# /usr/sbin/cfgvpath

Procedure to make a new lun available to AIX:
---------------------------------------------

-Allocate the new lun on the SAN 
-Run "cfgmgr" 
-Verify the new vpath/hdisk by running "lsvpcfg" 

There should be a new vpath and it should be available with no volume group - if not, rerun cfgmgr


Create Volume groups with vpaths:
---------------------------------

You should use the mkvg4vp command to create Volume Groups.

Example:

# mkvg4vp -B -t 32 -s 4 -y DB01_RECOV_VG1 vpath4 vpath10

By default, VG's can accommodate up to 255 LV's and 32 PV's. If the -B flag is used on the mkvg or mkvg4vp
command, the resulting VG will support up to 512 LV's and 128 PV's.
The -s flag, as usual, designates the Partition size.


SDD software on AIX:
--------------------

Starting with SDD 1.6.1.0, the SDD package for AIX53 is devices.sdd.53.rte and requires AIX53E 
with APAR IY76997.

Starting with SDD 1.6.2.0, the SDD package for AIX52 is devices.sdd.52.rte and requires AIX52M
with APAR IY76997.

See also in this document:
IBM Flash Alert: SDD 1.6.2.0 requires minimum AIX code levels; possible 0514-035 error

The SDD installation package installs a number of new commands, like datapath, chgvpath, lsvpcfg etc..

Before installing SDD, you should check firmware levels, and AIX APAR requirements. See the following sites: 

-- scsi and ESS, and Fiber:
www-1.ibm.com/servers/storage/support/
www-1.ibm.com/servers/eserver/support/unixservers/index.html 

-- AIX APAR:
www-03.ibm.com/servers/eserver/support/unixservers/aixfixes.html            or,
www.ibm.com/servers/eserver/support/pseries/aixfixes.html                   or,
www14.software.ibm.com/webapp/set2/sas/f/genunix3/aixfixes.html


31.5 SAN connections with SDDPCM MPIO:
======================================

We have seen the SDD connections in section 31.4.

This section covers some of the SDDPCM MPIO SAN connections. 
There are some different commands with this type
of connections to SAN storage.

The use of SDD or SDDPCM gives the AIX host the ability to access multiple paths to a single LUN 
within an ESS or SAN. This ability to access a single LUN on multiple paths allows for a higher degree of 
data availability in the event of a path failure. Data can continue to be accessed within the ESS 
as long as there is at least one available path. Without one of these installed, you will lose access 
to the LUN in the event of a path failure. 

If you have "sdd" installed use the datapath command, and with sddpcm use the pcmpath command.

Just as the commands shown in section 31.4, just replace datapath with pcmpath, like


# pcmpath query device

DEV#:   2  DEVICE NAME: hdisk2  TYPE: 2107900  ALGORITHM:  Load Balance
SERIAL: 75065711100
==========================================================================
Path#      Adapter/Path Name          State     Mode     Select     Errors
    0           fscsi0/path0           OPEN   NORMAL       1240          0
    1           fscsi0/path1           OPEN   NORMAL       1313          0
    2           fscsi0/path2           OPEN   NORMAL       1297          0
    3           fscsi0/path3           OPEN   NORMAL       1294          0

DEV#:   3  DEVICE NAME: hdisk3  TYPE: 2107900  ALGORITHM:  Load Balance
SERIAL: 75065711101
==========================================================================
Path#      Adapter/Path Name          State     Mode     Select     Errors
    0           fscsi0/path0          CLOSE   NORMAL          0          0
    1           fscsi0/path1          CLOSE   NORMAL          0          0
    2           fscsi0/path2          CLOSE   NORMAL          0          0
    3           fscsi0/path3          CLOSE   NORMAL          0          0

DEV#:   4  DEVICE NAME: hdisk4  TYPE: 1750500  ALGORITHM:  Load Balance
SERIAL: 13AAGXA1101
==========================================================================
Path#      Adapter/Path Name          State     Mode     Select     Errors
    0*          fscsi0/path0           OPEN   NORMAL         12          0
    1           fscsi0/path1           OPEN   NORMAL       3787          0
    2*          fscsi1/path2           OPEN   NORMAL         17          0
    3           fscsi1/path3           OPEN   NORMAL       3822          0


# pcmpath query essmap


Some possible errors with pcmpath:

root@zd110l04:/root#pcmpath query device

Kernel extension sdduserke was not loaded. Errno=8.
Please verify SDDPCM device configuration.


On a system with SDDPCM, you will see the SDDPCM server daemon, "pcmsrv", running. 
This process checks available paths and does other checks and monitoring.

The process is under control of the resource controller, like for example starting and stopping it goes with

# stopsrc -s pcmsrv
# startsrc -s pcmsrv

The process is started on boot from inittab:

# cat /etc/inittab | grep pcmsrv
srv:2:wait:/usr/bin/startsrc -s pcmsrv > /dev/null 2>&1


Notes on SDD and SDDPCM:
========================

Note 1:
-------

thread

Q +A:

> I've been reading IBM web sites and PDF manuals and still can't decide
> on exactly how to upgrade my AIX 4.3.3 machine to AIX 5.2 and have my
> ESS SDD vpath disks visible and working when I'm done.
>
> Has someone done this? Can you comment on my proposed method here?

Yes, I've done this.


> What I think I need to do is this:
>
> 1. Do the migration installation from 4.3.3 to 5. Question: Do I need to
> do anything to my ESS disks BEFORE migrating? Unmount? Vary off volume
> groups? Export volume groups?

Yes to all of the above, prior to upgrade. Uninstall SDD software.


> 2. After the migration, and reboot, I understand that the ESS disks will
> not "be there", since the migration does not upgrade the SDD (subsystem
> device driver) does NOT get upgraded. Question: Is this true?

Yes, the datapath devices will be gone because you deleted the SDD
software; IIRC, that is part of the un-install process. After your
upgrade, install SDD just like the first time. This will get you your
hdisks and vpaths back, though not necessarily with the same numbers; have
a 'lsvpcfg' from before your upgrade to cross-reference your new setup to.
'importvg' the VG(s) one at a time, using one of the hdisk's which
constitute the vpath, then run 'hd2vp' on the VG. That will convert the
VG back to using the vpath's.

Note: IIRC, If I Recall/Remember Correctly

>
> 3. Vary off all ESS volume groups, if I shouldn't have done this back in
> step 1.
>
> 4. Remove all the "datapath devices", via: rmdev -dl dpo -R
>
> 5. Uninstall the 4.3 version of the SDD.
>
> 6. Install the 5.2 version of the SDD.
>
> 7. Install the latest PTF of the 5.2 SDD, that they call version
> 1.5.1.3.
>
> 8. Reboot.
>
>
> If you can tell me how to make this procedure more nearly correct, I'd
> greatly appreciate it.


Note 2:
-------

thread

Q + A:

>
> I need a quick refresher here. I've got a HACMP (4.4) cluster with SAN- attached
> ESS storage. SDD is installed. Can I add volumes to one of these volume groups on
> the fly, or does HA need to be down? It's been awhile since I have done this and I
> can't quite remember if I have to jump through any hoops. Thanks for the help.

Should be relatively easy with no downtime required.
1) acquire the new disks on primary node (where the VG is in service) with: 

cfgmgr -Svl fcs0 
- repeat this for all fcs adapters in system
2) convert hdisks to vpaths, note use the smit screens for this because the commands
have changed from version to version.
3) add vpaths to VG with: extendvg4vp vgname vpath#
4) create LVs/filesystems on the vpaths.
5) break VG/scsi locks so that other systems can see the disks with: varyonvg
-b -u vgname
6) perform steps 1 & 2 for all failover nodes in the cluster.
7) refresh the VG definitions on all the failover nodes with: importvg -L
vgname vpath#
8) reestablish disk locks on service node with: varyonvg vgname
9) add new filesystems to HA configuration.
10) synchronise HA resources to the cluster.


Note 3:
-------

From IBM Doc SC30-4131-00:


hd2vp and vp2hd 

SDD provides two conversion scripts, hd2vp and vp2hd. 

The hd2vp script converts a volume group from supported storage device
hdisks to SDD vpath devices, and the vp2hd script converts a volume
group from SDD vpath devices to supported storage device hdisks. 

Use the vp2hd program when you want to configure your applications back
to original supported storage device hdisks, or when you want to remove
SDD from your AIX host system. 

The syntax for these conversion scripts is as follows:
hd2vp vgname 
vp2hd vgname 

vgname Specifies the volume group name to be converted.


Note 4:
-------

thread

Q:

Hi There, 
I want to add a vpath to running hacmp cluster with HACMP 5.1 on AIX 5.2 with Rotating Resource Group. 
If anyone has done it before then can provide a step by step procedure for this. Do i need to stop and start 
HACMP for this? 


A:

On Vg active node : 
#extendvg4vp vg00 vpath10 vpath11 
#smitty chfs ( Increase the f/s as required ) 
#varyonvg -bu vg00 ( this is to un-lock the vg) 

On Secondary node where vg is not active : 
# cfgmgr -vl fscsi0 ( fscsi1 and fcs0 and fcs1 ) 
Found new vpaths 
# chdev -l vpath10 -a pv=yes ( for vpath11 also ) 
# lsvg vg00|grep path ( just note down any one vpath which is from this o/p-for e.g vpath0 ) 
# importvg vg00 vpath0 

Once its fine...go to Primary Node 

# varyonvg vg00 ( Locking the VG ) 

Regards

Note 5:
-------

> HI,

> Is there a way to know dependencies between devices.
> For example,
> hdisk2 is attached to fscsi0 which in turn is attached to fcs0

> I have found nothing in lsdev's man
> Do I have to look in the odm directly

> I need this in order to improve a script

This is a good question and the lsdev man
page should be burned in front of the building
where they develop and document AIX in
Austin, TX, for not answering it for you.
After all, you bothered to read the damn
thing; why didn't it tell you?

$ /usr/sbin/lsdev -Cc adapter -F 'name parent'
ppa0 isa0
sa0 isa0
sa1 isa0
sa2 isa0
siokma0 isa0
fda0 isa0
scsi0 pci0
ent0 pci0
cxpa0 pci0
ent1 pci0
mga0 pci1
ent2 pci1
scsi1 pci2
sioka0 siokma0
sioma0 siokma0
ent3 pci0

There's also the lsparent command.

Regards,

Actually, I have the same question as Frederic and you have not
quite answered it. Sure, lsdev can tell you that "hdisk5" is
matched to "fcs0" . . . but what tells you that "fcs0" in turn
matches to "fscsi0"? And if "hdisk126" matches to adapter "fchan1",
how do I determine what that matches to? I've checked all of the
various lsxxxx commands but can't find this bit of info.

ONCE AGAIN the answer pops up just moments after announcing
to the world that "there's no way to do that" and "I've looked
everywhere and tried everything". Herewith the output from the
necessary commands, with extraneous lines removed:

# lsdev -C -c disk -F 'name location'
hdisk0 11-08-00-2,0
hdisk1 11-08-00-4,0
hdisk2 3A-08-01
hdisk3 3A-08-01
hdisk4 27-08-01
hdisk5 27-08-01


# lsdev -C -c driver -F 'name location'
fscsi0 27-08-01
fscsi1 3A-08-01

# lsdev -C -c adapter -F 'name location'
scsi0 11-08
scsi1 11-09
fcs0 27-08
mg20 2D-08
fcs1 3A-08
#

Obviously it is a simply matter to match disk to adapter to driver
by the location of each object. After that I can easily

sprintf(pathname, "/dev/%s", driver);
fp = open(pathname, O_RDONLY | O_NDELAY);
ioctl(fp, SCIOINQU, &info);

to get the scsi inquiry buffer.


Note 6:
-------

thread

Q:

where to fidnd a guide for the adapter (described  all its states, LED blinkging/lighting)

Adapter is cabled by SAN guys, they double checked it and when I run:

rmdev -Rl fcs0
cfgmgr -l fcs0
lsattr -El fscsi0 -l attach

I don't see "switch" but "none".


thx in advance.

A:

Did you check SAN Switch Zoning?

Regards,

Do something like:

rmdev -Rdl fscsi0
rmdev -dl fcnet0
rmdev -l fcs0
cfgmgr -l fcs0

rmdev -Rdl fscsi0

rmdev -Rdl fscsi1
rmdev -l fcs1

This way, the FC adapter re-negociates an FC fabric logon.

HTH,

I had already done something similiar but it didn't helped:

# lsslot -c slot|grep fcs0
U787B.001.DNWFFM5-P1-C4   Logical I/O Slot  pci4 fcs0
# rmdev -dl pci4 -R
fcnet0 deleted
fscsi0 deleted
fcs0 deleted
pci4 deleted
# cfgmgr
Method error (/usr/lib/methods/cfgefscsi -l fscsi0 ):
        0514-061 Cannot find a child device.
# lsattr -El fscsi0 -a attach
attach none How this adapter is CONNECTED False

the second FC is connected ok:
# lsattr -El fscsi1 -a attach
attach switch How this adapter is CONNECTED False
#

thx anyway,
I will ask my SAN team to check cables once more.
 

Note 7:
-------

thread

hdisk and vpath correspondance for IBM SAN (shark) 
Description

Correspondance between phsical disks:

4 hdisk = 1 vpath = 1 physical disk

To remove all vpaths run the command:

# rmdev -dl dpo -R

To remove all fibre channel disks (2 cards in this example):

# rmdev -dl fscsi0 -R
# rmdev -dl fscsi1 -R

To recreate the hdisks run the command:
# cfgmgr -vl fcs0
# cfgmgr -vl fcs1

To recreate the vpaths run the command:

# cfallvpath

To delete a device run this command:

# rmdev -l fcs1 -d 
Example

rmdev -dl dpo -R ; rmdev -dl fscsi0 -R ; cfgmgr -vl fcs0 ; cfallvpath 


Note 8:
-------

Technote (FAQ) 
  
Problem 
When non-root AIX users issue SDD datapath commands, the "No device file found" message results.  
  
Cause 
AIX SDD does not distinguish between file not found and invalid permissions.  
  
Solution 
Login as the root user or "su" to root user and re-execute command in order to obtain the desired SDD datapath 
command output.  


Note 9: 
-------

(thread ibm site)

Question:

Hi,

I have an AIX 5.3 server running with 2 FCs. One on a DS8300 and one on a DS4300.
On the server, i have a filesystems that is mounted and active (hdisks are from the DS8300). 
I can access it fine, write, delete etc...

Yet, when i do a "datapath query adapter" i get the following :

# datapath query adapter
Active Adapters :1
Adpt# Name State Mode Select Errors Paths Active
0 fscsi0 NORMAL ACTIVE 4111177 0 32 0

I would expect to see my 32 paths Active. I checked another server that has a similar configuration 
(though it only has 1 FC) and i can see 32 Paths, 32 Active...

Is it because of the other FC being connected to a DS4300?

Answer:

Hi.

The reason is that the vpaths are not part of a varied on volume group.
If you do a 'datapath query device' you should find all the paths will be 
state=closed.
If the vpaths are being used by a volume group, do a varyonvg xxxx.
Then display the datapath and the paths should be active.

Question:

Hi.

THanks, but as i mentionned in my original post, the VG is varied on and the FS is mounted. I ran the 
datapath command after i i varyonvg bkpvg and mount /backup. 
I then dumped a DB within the FS, deleted and everything else works...yet datapath query adapter shows 
no Active paths...weird...

Question:

Hi.

What version of SDD?
What does 'datapath query device' say?

Answer:

Version of SDD is 1.6.0.5
And a datapath query device shows :

...

DEV#: 14 DEVICE NAME: vpath14 TYPE: 2107900 POLICY: Optimized
SERIAL: 75AYYV111B7
===========================================================================
Path# Adapter/Hard Disk State Mode Select Errors
0 fscsi0/hdisk40 CLOSE NORMAL 147989 0
1 fscsi0/hdisk23 CLOSE NORMAL 0 0

DEV#: 15 DEVICE NAME: vpath15 TYPE: 2107900 POLICY: Optimized
SERIAL: 75AYYV111B8
===========================================================================
Path# Adapter/Hard Disk State Mode Select Errors
0 fscsi0/hdisk41 CLOSE NORMAL 155256 0
1 fscsi0/hdisk24 CLOSE NORMAL 0 0


yet, as i mentionned, my FS /backup is mounted and accessible... 


Note 10:
--------

thread

Q:

Hi All, 

I am having problems on a p570 on which there are 3 HBA cards. 
2 of the HBAs are connected via a SAN switch to an ESS 800. 
It appears only one of the "paths" to the ESS 800 is working 
As I only have one set of view of the disks on the ESS. 

Running cfgmgr on the adapter gives the following error. 

I have tried removing fscsi0 then unconfiguring fcs0, 
Then reconfiguring fcs0 but I still get the same error. 
Any ideas? Is there some command/utility I can run to verify 
The state of ths HBA? Thank you. 

bash-3.00# cfgmgr -l fcs0 
Method error (/usr/lib/methods/cfgefscsi -l fscsi0 ): 
0514-061 Cannot find a child device. 
bash-3.00# 

0514-061 Cannot find a child device 

A:

HI 

I have had the same problem using HDS SAN devices. 

AT that time I did not have the corect version off the device driver for the fiber cards in P570. 

For aix 5.2 
devices.pci.df1000fa >= 5.2.0.40 
For aix 5.3 
devices.pci.df1000f7 >= 5.3.0.10 

/HGA


Note 11:
--------

Greetings: 

The "0514-061 Cannot find a child device" is common when the FC card is either 
not attached to a FC device, or if it is attached, then I would look at the 
polarity of the cable 
ie. (tx -> rx and rx -> tx) NOT (tx -> tx and rx -> rx) 

cfgmgr is attempting to configure the FC device it is connected to (child 
device) but is unable to see it. 

In this context, device would be some sort of FC endpoint, not just a switch or 
director. 

I would make sure the FC card has connectivity to a FC device, not just the 
fabric and re-run cfgmgr. 


-=Patrick=- 


"Vincent D'Antonio, III" <dantoniov@COMCAST.NET> on 02/19/2003 01:51:24 PM 

Please respond to IBM AIX Discussion List <aix-l@Princeton.EDU> 

  To: aix-l@Princeton.EDU 
  cc: (bcc: Patrick Bigelbach/DSS) 
  Subject Re: Cannot cfgmgr on a new FC 

Put in your OS cd in the cdrom drive and run: 

cfgmgr -vi /dev/cd0 

this should load any filesets you need for the adapter if they are not 
already there. You should the adapter in lsdev -Cc adapter | grep fs. 

HTH 
Vince 

-----Original Message----- 
From: IBM AIX Discussion List [mailto:aix-l@Princeton.EDU] On Behalf Of 
Calderon, Linda 
Sent: Wednesday, February 19, 2003 10:12 AM 
To: aix-l@Princeton.EDU 
Subject: Cannot cfgmgr on a new FC 

I am trying to connect a new HBA on a P660 to a switch for a SAN. This HBA 
has not been used previously, newly cabled etc. I issued the following 
commands and receive the following errors: 

* rmdev -Rdl fsc1 

0514-519 The following device was not found in the customized device 
configuration database: name 'fcs1' 

* cfgmgr 

0514-061 Cannot find a child device 

Looking for ideas as to root cause. 


Note 12:
--------

thread

Q:

Hi All AIXers,
I am trying to add some vpath to Current Volume Group (which is on vpath)and i
am getting this error


Method Error (/usr/lib/methods/chgvpath):
0514-047 Cannot access a device

0516-1182 extendvg open failure on vpath3

0516-792 extendvg: Unable to estend a Volume Group

Do anybody have any idea about this error. I never seen this error before.
Thanks


A:

James,

If you're adding a vpath to a volume group that has other vpaths, you
will need to use extendvg4vp instead of extendvg.

Hope this helps!


Note 13:
--------

On Vg active node : 
#extendvg4vp vg00 vpath10 vpath11 
#smitty chfs ( Increase the f/s as required ) 
#varyonvg -bu vg00 ( this is to un-lock the vg) 

On Secondary node where vg is not active : 
# cfgmgr -vl fscsi0 ( fscsi1 and fcs0 and fcs1 ) 
Found new vpaths 
# chdev -l vpath10 -a pv=yes ( for vpath11 also ) 
# lsvg vg00|grep path ( just note down any one vpath which is from this o/p-for e.g vpath0 ) 
# importvg vg00 vpath0 

Once its fine...go to Primary Node 

# varyonvg vg00 ( Locking the VG ) 

Regards


Note 14:
--------

thread

How to add a a new PV into an existing concurrent mounted VG.

The PMR action plan suggests:

- stop of the resource group
- varyoffvg dummyvg
- varyonvg -nc dummyvg
- extendvg4vp dummyvg vpath0
- start of the resource group

as a backup action

- restart of the cluster
- extendvg4vp dummyvg vpath0
- start of the resource group

After a spech with the Country IBM referent we modify the action plan
in:

- stop of the cluster
- varyoffvg dummyvg
- varyonvg dummyvg
dummyvg should remain Enhanced Concurrent Capable, but I mount
it in normal mode to do the extentions
- extendvg4vp dummyvg vpath0
- importvg -L dummyvg disk on the other node of the cluster
- varyoffvg dummyvg
- cluster verification & syncro
- start of the cluster

Anyway before applying the modified action plan I try to follow the
original one, but with unpredictable return codes. With some vpaths
works, with someothers halfworks (update the VGDA, but not the odm),
with others return the original error.

In my opinion there is an high probability that the cause is in
gsclvmd...

So, a bit disappointed, I applied the modified plan.
All works and the extendvg4vp enlarged the dummyvg...
My machines are too downlevel and very full of lacks :-(

After that my curiosity pulls me to try the next step:

mirrorvg -s -c 2 dummyvg vpath0 vpath1
0516-1509 : VGDA corruption: physical partition info for this LV
is invalid.
0516-842 : Unable to make logical partition copies for logical
volume.
0516-1199 mirrorvg: Failed to create logical partition copies for
logical volume dummylv.
0516-1200 mirrorvg: Failed to mirror the volume group

Now, IBM support is working for analyze this new issue......

Regards.


Note 15: cfgmgr method errors:
------------------------------

1:
==

APAR status
Closed as program error.

Error description 
Users of the 64bit kernel may observe an error when cfgmgr is
invoked at runtime in the cfgsisscsi or cfgsisioa config
methods. Following is an example:
# cfgmgr
Method error (/usr/lib/methods/cfgsisscsi -l sisscsia0 ):
        0514-061 Cannot find a child device.

The error occurs in the cfgsisscsi or cfgsisioa routines
which automatically update the microcode on the adapter if
it is found to be at a level lower than the minimum supported
microcode level.

If the adapter was previously unconfigured, the adapter will
remain in the Defined state. A system reboot should make it
Available.

APAR information 
APAR number IY48873 
Reported component name AIX 5L POWER V5 
Reported component ID 5765E6200 
Reported release 520 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2003-09-19 
Closed date 2003-09-19 
Last modified date 2003-10-24 


Note 16: cfgmgr method errors:
------------------------------

Q:

cfgmgr error-- devices are reported twice
Asked by kuntal_acharyy... on 11/28/2005 6:15:00 AM  

I have an IBM DS4400 with two EXP 700s expansion units connected to a pSeries 650 with AIX 5.1.I have 
created two logical drives in the storage unit.When i run "cfgmgr" to recognise the new raw physical volume 
each disk is reported twice. 

hdisk4 Available 1n-08-01 1742 (700) Disk Array Device 
hdisk5 Available 1n-08-01 1742 (700) Disk Array Device 
hdisk6 Available 11-08-01 1742 (700) Disk Array Device 
hdisk7 Available 11-08-01 1742 (700) Disk Array Device 

There is an error message while running cfgmgr: 

Method error (/etc/methods/cfgfdar -l dar0 ): 
0514-002 Cannot initialize the ODM. 
cfgmgr: 0514-621 WARNING: The following device packages are required for 
device support but are not currently installed. 
devices.scsi 

What may have cause the problem ? 
How ca I solve this problem? 
Any advice is truly welcome. 

A:

hi, I had met the same problem just as 
yours. 3 LPARs(AIX 5300-02) on a p570 
connect FastT600(Ds4300) with 2 HBA cards each, using SAN fibre switch. 2 of the 
LPARs reported hdisk twice, and 1 of them 
reported normally. And I found that the HBA cards on the normal one are in the PCI 
Slots belong to different BUSs, and the HBA cards on unnormal ones are in the same 
BUSs. Then I changed HBA cards to different BUSs' slots, deleted all the dar 
dac and HBA cards in the system, and cfgmgr at last. The problem got solved. I guess there must be some thing wrong with 
the BUS design. Some one told me that he solved the problem by install the last 
patch (AIX 5300-03). So my advice is that 
you should chang the HBA cards to differet 
slots, clear the system and cfgmgr. Or 
maybe update your AIX with the last patch. 
Just try and tell me the result. Good luck!


Note 17: cfgmgr method errors:
------------------------------

ed.malina@uvm.edu (Ed) wrote in message news:<bb30127.0311120759.171bdc46@posting.google.com>... 
> I deleted a scsi device from my 4.3.3 configuration with the following 
> command: 
> rmdev -l scsi2 -dR 
> 
> The device is a dual channel ultra scsi 3 card. I deleted it to try 
> to resolve some performance problems with a drawer connected to the 
> device. Incidentally, scsi3 which is the other side of the dual 
> channel card, is working fine. 
> 
> When I try to reconfigure the device with: 
> cfgmgr -v -lscsi2 
> 
> I get the following error: 
> 
> Method error (/usr/lib/methods/cfgncr_scsi -l scsi2 ): 
> 0514-034 The following attributes do not have valid values: 
> 
> Any thoughts on how to fix it? For the timebeing I can't reboot the 
> machine. Would a reboot be able to resolve the problem if there is no 
> other solution? 
> 
> Thanks! 
> -- Ed 

#>> Ed, 


what you probably should do is run the cfgmgr comand without the 
device name behind it. Because you deleted the scsi device with the 
options -dR you also removed any child devices. 


try this: cfgmgr -v 


Note 18: cfgmgr method errors:
------------------------------

Q:

Hi... 

Does someone know what to do with an SDD driver which can't detect vpaths 
from an ESS F20 but hdisks are already available on AIX? 

showvpath, cfgvpath, datapath query commands don't display or found anything 

By the way, rebooting the system didn't help 

I accept any suggestions. 

Regards 

Luis A. Rojas

A:

Thank you all for your suggestions 

I solve the problem using the hd2vp command which converts the logical 
hdisk 
to its related vpath. And Wal? !.. vpaths suddenly were recognized by 
cfgvpath command. 

I don't know why this happened, but, everything is OK now. 

To those people with similar problems, please check these following 
commands: dpovgfix, hd2vp, vp2hd 

Best Regards 


Note 19: fget_config:
---------------------

how to show the current state and volume (hdisk) ownership in a IBM DS4000 
Description

The fget_config command shows the current state and volume (hdisk) ownership.

To display controllers and hdisks that are associated with a specified DS4000 (dar):

# fget_config

To display the state of each controller in a DS4000 array, and the current path that is being used 
for I/O for each hdisk:

# fget_config -A 
Example

fget_config -A 


Note 20:
--------

Q:

dpovgfix, hd2vp, vp2hd
Asked by RandallGoff on 1/23/2007 9:38:00 AM  

What filesets do dpovgfix, hd2vp and vp2hd belong to. I installed my sdd 
driver and can see everything but can't find these commands. 

A:

They are part of your SDD drivers. You probably installed the devices.xxx filesets. Did you also 
install the host attachment script... the ibm2105 filesets?


Note 21:
--------

thread

Q:

Hi 

I have several AIX LPARS running on SVC controlled disks. Right now i have SDD SW 1.6.1.2. After configuration 
i have some vpath devices that can be managed using the datapath command. 
Now in a recent training of SVC i was asked to install the new SDDPCM driver in order to get some of the benefits 
of this SW driver. 

SDDPCM does not use the concept of vpath anymore, instead a hdisk device object is created. 
This object has definitions and attributes in ODM files. 

Recently i had to change a faulty HBA under SDD drivers. I was able to: 

1- datapath query device: in order to check hdisk devices belonging to the faulty adaptr. 
2- datapath query adapter: in order to check the faulty adapter. 
3- datapath set adapter XX offline: in order to put the faulty HAB offline. 
4- datapath remove adapter XX 
5- Used the diag Hot Plug option to remove the PCI-x HBA and install a new one. 
   Configured the system and modified the corresponden zone. 

How to do the same with SDDPCM even when there's no concept of vpath anymore. 

Thanks in advanced

A:

Hello , 
You can do the same with sddpcm , either using the MPIO commands or smitty screens , smitty devices ---> MPIO devices 
there you can list paths , remove paths , adapters. 
IN the SDD user guide there is a complete section describing what you can do , but same functions you use 
for the vpath , you can use for sddpcm. 
Here is the link for the latest user guide 
http://www-1.ibm.com/support/docview.wss?rsP3&con text=ST52G7&dc=DA490&dc=DA4A30&dc=DA480&dc=D700&dc =DA410&dc=DA4A20&dc=DA460&dc=DA470&dc=DA400&uid=ss g1 S7000303&loc=en_US&cs=utf-8&lang=en


Note 22:
--------

thread

Q:

Greetings: 

Has anyone encountered the 0516-1182 ( mkvg: Open Failure on vpath ) or 
0516-826 ( mkvg: Unable to create volume group ) 
errors while trying to create a new volume group ? 

I attempted to create a new volume group using a couple of newly added 
vpath devices and received 
those errors. 

Any help will be greatly appreciated. 

Thanks in advance. 

Jay. 

A:

Hi 

If using vpath devices then you can confirm that you can open any given device by running: 

datapath query device 

and confirm there's no error in the HBA communications. 

Also you can review the errpt reports in order to look for VPATH OPEN messages. You can also use 
the lquerypr command in order to check for SCSI reservations in the SAN box previously set 
by another host (in case of a cluster). 

Hope this helps


Example lquerypr output

# lquerypr -Vh /dev/hdisk12
connection type: fscsi1
open dev: /dev/hdisk12

Attempt to read reservation key...

Attempt to read registration keys...
Read Keys parameter
        Generation :  52
        Additional Length:  32
        Key0 :  c8ca9d09
        Key1 :  c8ca9d09
        Key2 :  c8cabd09
        Key3 :  c8cabd09
Reserve Key provided by current host = c8cabd09
Not reserved.


Note 23:
--------

thread

Q:

All, 

I'm in the process of preparing for our upcoming disaster recovery exercise 
which is happening in a few weeks. Our plan is to create one big volume 
group, instead of a bunch of little ones like we have in our production 
environment, to try and save some time. 

My question is, is there a way to script using a for/next loop to assign 
each hdisk/vpath when creating a new volume group instead of going into smit 
and assigning them one by one by hand? The hdisks will be sequential and 
will probably be over a hundred in number so you can imagine how tedious 
this will be. Also, this will need to be bigvg enabled. 

Any of you scripters out there have any suggestions? Thanks for your help in 
advance!


A:

Create the VG 
>mkvg -B -y datavg vpathN 

Extend it 
for i in `lspv | grep vpath | grep None | awk '{print #1}'` 
do 
extendvg datavg $i 
done 

That would assign all unused vpaths to the VG. BTW Use the vpath and 
not the hdisk. You could add a count into it to limit the number of 
disks you assign.


Note 24:
--------

thread

Q:

Is anyone aware of a problem if i do a

cfgmgr -vl dp0
and once the vpaths are made
it shows as
vpathxx none None

and then i add the vpath to VG

#extendvg VGname vpathxx

Does this create a problem ?

A:

it sound like the vpath is showing correctly after cfgmgr so thats OK.
But you need to use extendvg4vp and not just extendvg
Do a 'smitty vg' and choose
'Add a Data Path Volume to a Volume Group'

Once its added to a VG then it will show more info in lspv


Note 25: cfgmgr Method error (/usr/sbin/fcppcmmap > /etc/essmap.out):
---------------------------------------------------------------------

Method error (/usr/sbin/fcppcmmap > /etc/essmap.out):
        0514-001 System error:


Note 26: mkpath, lspath commands:
---------------------------------

Examples mkpath:

--To define and configure an already defined path between scsi0 and the hdisk1 device at SCSI ID 5 
and LUN 0 (i.e., connection 5,0), enter: 
# mkpath -l hdisk1 -p scsi0 -w 5,0

The system displays a message similar to the following: 
path available

--To configure an already defined path from 'fscsi0' to fiber channel disk 'hdisk1', the command would be: 
# mkpath -l hdisk1 -p fscsi0

The message would look similar to: 
path available

--To only add to the Customized Paths object class a path definition between scsi0 and the hdisk1 disk device 
at SCSI ID 5 and LUN 0, enter: 
# mkpath -d -l hdisk1 -p scsi0 -w 5,0

The system displays a message similar to the following: 
path defined


Examples lspath:

lspath displays information about paths to an MultiPath I/O (MPIO) capable device.

Examples of displaying path status:

-- To display the status of all paths to hdisk1 with column headers, enter: 
# lspath -H -l hdisk1

The system will display a message similar to the following: 
status    device   parent
enabled   hdisk1   scsi0
disabled  hdisk1   scsi1
missing   hdisk1   scsi2

-- To display, without column headers, the set of paths whose operational status is disabled, enter: 
# lspath -s disabled

The system will display a message similar to the following: 
disabled  hdisk1   scsi1
disabled  hdisk2   scsi1
disabled  hdisk23  scsi8
disabled  hdisk25  scsi8

--To display the set of paths whose operational status is failed, enter: 
# lspath -s failed

The system will display a message similar to the following: 
failed  hdisk1   scsi1
failed  hdisk2   scsi1
failed  hdisk23  scsi8
failed  hdisk25  scsi8

-- To display in a user-specified format, without column headers, the set of paths to hdisk1 whose path status 
is available enter: 
# lspath -l hdisk1 -s available -F"connection:parent:path_status:status"

The system will display a message similar to the following: 
5,0:scsi0:available:enabled
6,0:scsi1:available:disabled

Note that this output shows both the path status and the operational status of the device. 
The path status simply indicates whether the path is configured or not. The operational status indicates 
how the path is being used with respect to path selection processing in the device driver. 
Only paths with a path status of available also have an operational status. If a path is not currently configured 
into the device driver, it does not have an operational status.
Examples of displaying path attributes:

--If the target device is a SCSI disk, to display all attributes for the path to parent scsi0 at connection 5,0, 
use the command: 
# lspath -AHE -l hdisk10 -p scsi0 -w "5,0"
The system will display a message similar to the following: 
attribute  value  description                       user_settable
weight     1      Order of path failover selection  true


Note 26: About FastT and DS Storage:
------------------------------------

IBM TotalStorager FAStT has been renamed IBM TotalStorage DS4000 series 

DS4100 formerly FAStT100

DS4300 formerly FAStT600

DS4300 Turbo formerly FAStT600 Turbo

DS4400 formerly FAStT700

DS4500 formerly FAStT900


Note 27: from GPFS FAQ: 
-----------------------

Q20:

What's the difference between using an ESS with or without SDD or SDDPCM installed on the host? 

A20: 
The use of SDD or SDDPCM gives the AIX host the ability to access multiple paths to a single LUN 
within an ESS. This ability to access a single LUN on multiple paths allows for a higher degree of 
data availability in the event of a path failure. Data can continue to be accessed within the ESS 
as long as there is at least one available path. Without one of these installed, you will lose access 
to the LUN in the event of a path failure. 
However, your choice of whether to use SDD or SDDPCM impacts your ability to use single-node quourm:

Single-node quorum is not supported if SDD is installed. 
Single-node quorum is support if SDDPCM is installed.
To determine the GPFS disk support guidelines for SDD and SDDPCM for your cluster type, see

Q3: What disk support guidelines must be followed when running GPFS in an sp cluster type? 
Q6: What disk support guidelines must be followed when running GPFS in an rpd cluster type? 
Q9:What are the disk support guidelines that must be followed when running GPFS in an hacmp cluster type


Note 28: changing attributes of a fcs0 device:
----------------------------------------------

Examples:

# chdev -l fscsi0 -a fc_err_recov=fast_fail
# chdev -l fscsi0 -a dyntrk=yes

Display attributes:

# lsattr -El fscsi0

attach       switch       How this adapter is CONNECTED         False
dyntrk       no           Dynamic Tracking of FC Devices        True
fc_err_recov fast_fail    FC Fabric Event Error RECOVERY Policy True
scsi_id      0x741113     Adapter SCSI ID                       False
sw_fc_class  3            FC Class for Fabric                   True


Note 29: Flash alerts:
----------------------


IBM Flash Alert on AIX migration with vpaths:
---------------------------------------------

http://www-1.ibm.com/support/docview.wss?rs=540&context=ST52G7&uid=ssg1S1002295&loc=en_US&cs=utf-8&lang=en

All hdisks and vpath devices must be removed from host system before upgrading to SDD host attachment script 
32.6.100.21 and above. All MPIO hdisks must be removed from host system before upgrading to SDDPCM host attachment 
script 33.6.100.9. 
 Flash (Alert) 
  
Abstract 
When upgrading from SDDPCM host attachment script devices.fcp.disk.ibm2105.mpio.rte version 33.6.100.8 or below 
to 33.6.100.9, all SDDPCM MPIO hdisks must be removed from the AIX host system before the upgrade. 
When upgrading from SDD host attachment script ibm2105.rte version 32.6.100.18 or below to 32.6.100.21 or later, 
all AIX hdisks and SDD vpath devices must be removed from the AIX host system before the upgrade.  
  
Content 
Please note that this document contains the following sections:


Problem description, symptoms, and information 
SDD/host attachment upgrade procedures 
Recovery procedures should the ODM become corrupted 
Recovery procedures should the associations become corrupted 
Procedures for upgrading if rootvg is on an ESS disk

- Problem description, symptoms, and information:

Starting with SDDPCM host attachment script devices.fcp.disk.ibm2105.mpio.rte version 33.6.100.9 and 
SDD host attachment script ibm2105.rte version 32.6.100.21, ESS FCP devices are configured as "IBM MPIO FC 2105" 
for MPIO devices, and "IBM FC 2105" for ESS devices. This information can be seen in the "lsdev -Cc disk" output. 
Prior to these host attachment script versions, ESS FCP devices were configured as "IBM MPIO FC 2105XXX" for 
MPIO devices and "IBM FC 2105XXX" for ESS devices, where 'XXX' is the ESS device module, such as F20 or 800. 

If a host system is upgraded without removing all of the hdisks first, then the AIX host system ODM will 
be corrupted. Additionally, if all he hdisks are removed without removing all SDD vpath devices, 
then the associations between an SDD vpath device and its hdisks may be corrupted because the hdisk's device 
minor number may change after reconfiguration. The ODM corruption may look something like the following in the 
"lsdev -Cc disk" output:

# lsdev -Cc disk
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk1.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk2.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk3.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk4.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk5.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk6.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk7.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk8.
hdisk0 Available 10-60-00-8,0 16 Bit SCSI Disk Drive
hdisk1 Available 20-60-01 N/A
hdisk2 Available 20-60-01 N/A
hdisk3 Available 20-60-01 N/A
hdisk4 Available 20-60-01 N/A
hdisk5 Available 20-60-01 N/A
hdisk6 Available 20-60-01 N/A
hdisk7 Available 20-60-01 N/A
hdisk8 Available 20-60-01 N/A

- SDD/host attachment upgrade procedures:

In order to prevent ODM corruption and vpath/hdisk association corruption, all hdisks and SDD vpath devices 
must be removed prior to the upgrade. The following procedure should be used when you want to upgrade:

- AIX OS only*
- Host attachment + AIX OS*
- SDD + AIX OS*
- Host attachment + SDD
- Host attachment only
- SDD + Host attachment + AIX OS*

* Upgrading the AIX OS will always require you to install the SDD which corresponds to the new AIX OS level.

To upgrade SDD only, follow the procedure in the SDD User's Guide.

1. Ensure rootvg is on local scsi disks. If this is not possible, see "Procedures for upgrading if rootvg is on 
   an ESS disk" below.
2. Stop all applications running on SDD Volume Groups/File Systems.
3. Unmount all File Systems of SDD volume group.
4. Varyoff all SDD volume groups.
5. If upgrading OS, save output of lspv command to remember pvids of VGs.
6. If upgrading OS, export volume groups with exportvg.
7. Remove SDD vpath devices with rmdev command.
8. Remove 2105 hdisk devices with rmdev command.
9. If upgrading OS, run 'stopsrc -s sddsrv' to stop sddsrv daemon.
10. If upgrading OS, uninstall SDD.
11. If required, upgrade ibm2105.rte. The recommended version is 32.6.100.18 if support for ESS model 750 is 
    not needed. Version 32.6.100.21 is required to support ESS model 750.
12. If upgrading OS, migrate AIX OS level.
13. If OS upgraded, boot to new AIX level with no disk groups online except rootvg, which is on local scsi disks. 
    /* reboot will automatically start at the end of migration */
14. If OS upgraded, install SDD for the new OS level. Otherwise, if required, upgrade SDD.
15. If OS not upgraded, configure hdisks with the 'cfgmgr -vl fcsX' command.
16. Configure SDD vpath devices by running 'cfallvpath'.
17. If OS upgraded, use lspv command to find out one physical volume which has a pvid matching the previous 
    SDD VG's pv.

Example:
===================================================
Previous lspv output (from step 4):
hdisk0 000bc67da3945d3c None 
hdisk1 000bc67d531c699f rootvg active
hdisk2 none None 
hdisk3 none None 
hdisk4 none None 
hdisk5 none None 
hdisk6 none None 
hdisk7 none None 
hdisk8 none None 
hdisk9 none None 
hdisk10 none None 
hdisk11 none None 
hdisk12 none None 
hdisk13 none None 
hdisk14 none None 
hdisk15 none None 
hdisk16 none None 
hdisk17 none None 
hdisk18 none None 
hdisk19 none None 
hdisk20 none None 
hdisk21 none None 
vpath0 000bc67d318fb8ea SDDVG0 
vpath1 000bc67d318fde50 SDDVG1 
vpath2 000bc67d318ffbb0 SDDVG2 
vpath3 000bc67d319018f3 SDDVG3 
vpath4 000bc67d319035b2 SDDVG4
Current lspv output (from this step):
hdisk0 000bc67da3945d3c None 
hdisk1 000bc67d531c699f rootvg active
hdisk2 000bc67d318fb8ea None 
hdisk3 000bc67d318fde50 None 
hdisk4 000bc67d318ffbb0 None 
hdisk5 000bc67d319018f3 None 
hdisk6 000bc67d319035b2 None 
hdisk7 000bc67d318fb8ea None 
hdisk8 000bc67d318fde50 None 
hdisk9 000bc67d318ffbb0 None 
hdisk10 000bc67d319018f3 None 
hdisk11 000bc67d319035b2 None 
hdisk12 000bc67d318fb8ea None 
hdisk13 000bc67d318fde50 None 
hdisk14 000bc67d318ffbb0 None 
hdisk15 000bc67d319018f3 None 
hdisk16 000bc67d319035b2 None 
hdisk17 000bc67d318fb8ea None 
hdisk18 000bc67d318fde50 None 
hdisk19 000bc67d318ffbb0 None 
hdisk20 000bc67d319018f3 None 
hdisk21 000bc67d319035b2 None 
vpath0 none None 
vpath1 none None 
vpath2 none None 
vpath3 none None 
vpath4 none None 

In this case, hdisk2, hdisk7, hdisk12, and hdisk17 from the current lspv output
has the pvid which matches the pvid of SDDVG0 from the previous lspv output. 
So, use either hdisk2, hdisk7, hdisk12, or hdisk17 to import the volume group 
with the name SDDVG0

18. Run hd2vp on all SDD volume groups.
19. Vary on all SDD volume groups.
20. Mount all file system back.

- Recovery procedures should the ODM become corrupted:

If the host system's ODM is already corrupted as a result of upgrading without removing the hdisks, 
please contact IBM Customer Support at 1-800-IBM-SERV to request a script to fix the corrupted ODM. 

- Recovery procedures should the associations become corrupted:

If vpath/hdisk association corruption has occurred because hdisks were removed without removing SDD vpath devices, 
all SDD vpath devices must be removed and reconfigured in order to correct this corrupted association.

- Procedures for upgrading if rootvg is on an ESS disk:

If rootvg is on an ESS device and cannot be moved to local scsi disks, all hdisks cannot be removed prior 
to the upgrade. In this case, the following procedure should be used to upgrade the SDD host attachment script 
to version 32.6.100.21 or later:

. Contact IBM Customer Support at 1-800-IBM-SERV to request a script to fix the corrupted ODM referenced above. 
. Without removing ESS hdisks, use smitty to upgrade the SDD host attachment script on the host system. 
. Immediately run the script to fix the corrupted ODM on the host system. 
. Run bosboot on the host system. 
. Reboot the host system so that the hdisks can be configured with the new ODM attributes. 
. Return to the "SDD/host attachment upgrade procedures" above and follow the appropriate upgrade steps now that 
  the SDD host attachment script upgrade is complete. 

This issue only occurs when upgrading to devices.fcp.disk.ibm2105.mpio.rte version 33.6.100.9 and SDD host 
attachment script ibm2105.rte version 32.6.100.21 and above.  
  
 
IBM Flash Alert: SDD 1.6.2.0 requires minimum AIX code levels; possible 0514-035 error:
---------------------------------------------------------------------------------------
 Flash (Alert) 
  
Abstract 
SDD 1.6.2.0 requires minimum AIX code levels. Not upgrading to correct AIX version and level can result in 
0514-035 error when attempting removal of dpo or vpath device  
  
Content 
Starting from SDD version 1.6.2.0, a unique ID attribute is added to SDD vpath devices, in order to 
support AIX5.3 VIO future features. AIX device configure methods have been changed in both AIX52 TL8 and 
AIX53 TL4 for this support.

Following are the requirements for this version of SDD with:

AIX5.2 and AIX5.3:  
AIX52 TL8 & above with PTF U804193 (IY76991)
AIX53 TL4 & above with PTF U804397 (IY76997)

Please view 1.6.2.0 readme for further details

If upgraded to SDD 1.6.2.0 and above without first upgrading AIX to the levels listed above the following error 
will be experienced when attempting to remove any vpath devices using the:

# rmdev -dl dpo -R

or the 

# rmdev -dl vpathX command.
                                                   
Method error (/usr/lib/methods/ucfgdevice):                           
0514-035 Cannot perform the requested function because of missing predefined information in the device 
configuration database. 

Solution:
1) Upgrade AIX to correct level and ptf, or
2) Contact SDD support at 1-800-IBM-SERV for steps to clean up ODM to allow for downgrading the SDD level 
   from 1.6.2.0, if unable to upgrade AIX to a newer technology level.  
 

Note 30:
--------

Suppose the following happens:

# rmdev -dRl fcs0

fcnet0 deleted
fscsi0 deleted
fcs0 deleted

# cfgmgr

Method error (/usr/lib/methods/cfgefscsi -l fscsi0 ):
        0514-061 Cannot find a child device.

root@n5114l02:/root#
adapter checked with several commands
connection with san seems impossible.
root@n5114l02:/root#lsattr -El fscsi0
attach       none         How this adapter is CONNECTED         False
dyntrk       no           Dynamic Tracking of FC Devices        True
fc_err_recov delayed_fail FC Fabric Event Error RECOVERY Policy True
scsi_id                   Adapter SCSI ID                       False
sw_fc_class  3            FC Class for Fabric                   True


Note 31:
--------

IY83872: AFTER CHVG -T, VG IS IN INCONSISTENT STATE 

 A fix is available 
Obtain fix for this APAR
 

APAR status
Closed as program error.

Error description 
#---------------------------------------------------
chvg -t renumber pvs that have pv numbers greater than
maxpvs with the new factor. chvg -t is only updating the
new pv_num in lvmrec and not updating the VGDA.
chvg -t leaves the vg is inconsistent state and any changes to
vg may get unpredictable results like a system crash.
Local fix 
Problem summary 
#---------------------------------------------------
chvg -t renumber pvs that have pv numbers greater than
maxpvs with the new factor. chvg -t is only updating the
new pv_num in lvmrec and not updating the VGDA.
chvg -t leaves the vg is inconsistent state and any changes to
vg may get unpredictable results like a system crash.
Problem conclusion 
Fix chvg -t to update the VGDA with the new pv number.
Add a check in hd_kextendlv to make sure that the pvol we
are trying to access is not null.
Temporary fix 
Comments 
APAR information 
APAR number IY83872 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2006-04-11 
Closed date 2006-04-11 
Last modified date 2006-05-03 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Publications Referenced


Fix information 
Fixed component name AIX 5.3 
Fixed component ID 5765G0300 

Applicable component levels 
R530 PSY U805071    UP06/05/03 I 1000 
 

Note 32:
========


ESB-2008.0267 -- [AIX] -- AIX Logical Volume Manager buffer overflow 

--------------------------------------------------------------------------------
 
Date: 14 March 2008 
AusCERT Reference #: ESB-2008.0267

Click here for printable version 
Click here for PGP verifiable version

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

===========================================================================
             AUSCERT External Security Bulletin Redistribution

                          ESB-2008.0267 -- [AIX]
                AIX Logical Volume Manager buffer overflow
                               14 March 2008

===========================================================================

        AusCERT Security Bulletin Summary
        ---------------------------------

Product:              AIX 5.2
                      AIX 5.3
Publisher:            IBM
Operating System:     AIX
Impact:               Root Compromise
Access:               Existing Account

Original Bulletin:    
http://www14.software.ibm.com/webapp/set2/subscriptions/pqvcmjd?mode=18&ID=4169

- --------------------------BEGIN INCLUDED TEXT--------------------

- -----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

IBM SECURITY ADVISORY

First Issued: Tue Jan 22 14:02:18 CST 2008
| Updated: Tue Mar 11 12:55:14 CDT 2008
| IZ10828 availablity updated
===============================================================================
                           VULNERABILITY SUMMARY

VULNERABILITY:   AIX Logical Volume Manager buffer overflow

PLATFORMS:       AIX 5.2, 5.3

SOLUTION:        Apply the fix or workaround as described below.

THREAT:          A local attacker may execute arbitrary code with root
                 privileges.

CERT VU Number:  n/a
CVE Number:      n/a
===============================================================================
                           DETAILED INFORMATION

I. OVERVIEW

    The AIX Logical Volume Manager provides a suite of utilities for
    AIX logical volume management features and functions. The primary
    fileset for the AIX Logical Volume Manager is 'bos.rte.lvm'. In
    addition, AIX provides another suite of utilities for concurrent
    logical volume management across multiple hosts.  The primary
    fileset for the AIX Concurrent Logical Volume Manager is
    'bos.clvm.enh'. Several imporant commands provided by these
    filesets for performing various logical volume management tasks
    have been identified as containing buffer overflow
    vulnerabilities.

II. DESCRIPTION

    Buffer overflow vulnerabilities exist in the 'bos.rte.lvm' and
    'bos.clvm.enh' fileset commands listed below.  A local attacker
    may execute arbitrary code with root privileges because the
    commands are setuid root.  The local attacker must be a member of
    the 'system' group to execute these commands.

    The following 'bos.rte.lvm' commands are vulnerable:

        /usr/sbin/lchangevg
        /usr/sbin/ldeletepv
        /usr/sbin/putlvodm
        /usr/sbin/lvaryoffvg
        /usr/sbin/lvgenminor

    The following 'bos.clvm.enh' command is vulnerable:

        /usr/sbin/tellclvmd

III. IMPACT

    The successful exploitation of this vulnerability allows a
    non-privileged user to execute code with root privileges.

IV. PLATFORM VULNERABILITY ASSESSMENT

    To determine if your system is vulnerable, execute the following
    command:

    lslpp -L bos.rte.lvm bos.clvm.enh

    The following fileset levels are vulnerable:

    AIX Fileset        Lower Level       Upper Level
    ------------------------------------------------
    bos.rte.lvm        5.2.0.0           5.2.0.107
    bos.rte.lvm        5.3.0.0           5.3.0.61
    bos.clvm.enh       5.2.0.0           5.2.0.105
    bos.clvm.enh       5.3.0.0           5.3.0.60

V. SOLUTIONS

    A. APARS

        IBM provides the following fixes:

        AIX Level           APAR number        Availability
        -----------------------------------------------------
        5.2.0               IZ00559            (available now)
|       5.2.0               IZ10828            05/07/2008
        5.3.0               IY98331            (available now)
        5.3.0               IY98340            (available now)
        5.3.0               IY99537            (available now)

        Subscribe to the APARs here:

        http://www.ibm.com/support/docview.wss?uid=isg1IZ00559
        http://www.ibm.com/support/docview.wss?uid=isg1IZ10828
        http://www.ibm.com/support/docview.wss?uid=isg1IY98331
        http://www.ibm.com/support/docview.wss?uid=isg1IY98340
        http://www.ibm.com/support/docview.wss?uid=isg1IY99537

        By subscribing, you will receive periodic email alerting you
        to the status of the APAR, and a link to download the fix once
        it becomes available.

    B. FIXES

        Fixes are available.  The fixes can be downloaded via ftp
        from:

        ftp://aix.software.ibm.com/aix/efixes/security/lvm_ifix.tar

        The link above is to a tar file containing this signed
        advisory, fix packages, and PGP signatures for each package.
        The fixes below include prerequisite checking. This will
        enforce the correct mapping between the fixes and AIX
        Technology Levels.

        AIX Fileset         AIX Level            Fix and Interim Fix
        -----------------------------------------------------------------
        bos.lvm.rte         5200-08              IZ10828_08.071212.epkg.Z
        bos.lvm.rte         5200-08              IZ00559_8a.071212.epkg.Z
        bos.clvm.enh        5200-08              IZ00559_8b.071212.epkg.Z

        bos.lvm.rte         5200-09              IZ10828_09.071212.epkg.Z
        bos.lvm.rte         5200-09              IZ00559_9a.071211.epkg.Z
        bos.clvm.enh        5200-09              IZ00559_9b.071211.epkg.Z

        bos.lvm.rte         5200-10              IZ10828_10.071212.epkg.Z
        bos.lvm.rte         5200-10              bos.rte.lvm.5.2.0.107.U
        bos.clvm.enh        5200-10              bos.clvm.enh.5.2.0.107.U

        bos.lvm.rte         5300-05              IY98331_05.071212.epkg.Z
        bos.lvm.rte         5300-05              IY99537_05.071212.epkg.Z
        bos.lvm.rte         5300-05              IY98340_5a.071211.epkg.Z
        bos.clvm.enh        5300-05              IY98340_5b.071211.epkg.Z

        bos.lvm.rte         5300-06              bos.rte.lvm.5.3.0.63.U
        bos.clvm.enh        5300-06              bos.clvm.enh.5.3.0.61.U

        To extract the fixes from the tar file:

        tar xvf lvm_ifix.tar
        cd lvm_ifix

        Verify you have retrieved the fixes intact:

        The checksums below were generated using the "sum", "cksum",
        "csum -h MD5" (md5sum), and "csum -h SHA1" (sha1sum) commands
        and are as follows:

        sum         filename
        ------------------------------------
        14660    17 IY98331_05.071212.epkg.Z
        26095     9 IY98340_5a.071211.epkg.Z
        40761     8 IY98340_5b.071211.epkg.Z
        10885    16 IY99537_05.071212.epkg.Z
        24909    10 IZ00559_8a.071212.epkg.Z
        64769     9 IZ00559_8b.071212.epkg.Z
        65110    10 IZ00559_9a.071211.epkg.Z
        25389     9 IZ00559_9b.071211.epkg.Z
        26812    26 IZ10828_08.071212.epkg.Z
        55064    26 IZ10828_09.071212.epkg.Z
        55484    26 IZ10828_10.071212.epkg.Z
        03885   157 bos.clvm.enh.5.2.0.107.U
        30581   128 bos.clvm.enh.5.3.0.61.U
        48971  1989 bos.rte.lvm.5.2.0.107.U
        64179  2603 bos.rte.lvm.5.3.0.63.U

        cksum              filename
        -------------------------------------------
        3121912357 16875   IY98331_05.071212.epkg.Z
        107751313  9190    IY98340_5a.071211.epkg.Z
        1129637178 7735    IY98340_5b.071211.epkg.Z
        4019303479 16201   IY99537_05.071212.epkg.Z
        1791374386 9289    IZ00559_8a.071212.epkg.Z
        3287090389 8299    IZ00559_8b.071212.epkg.Z
        565672617  9294    IZ00559_9a.071211.epkg.Z
        257555679  8302    IZ00559_9b.071211.epkg.Z
        3930477686 26525   IZ10828_08.071212.epkg.Z
        1199269029 26533   IZ10828_09.071212.epkg.Z
        358657844  26480   IZ10828_10.071212.epkg.Z
        3753492719 160768  bos.clvm.enh.5.2.0.107.U
        4180839749 131072  bos.clvm.enh.5.3.0.61.U
        3765659627 2036736 bos.rte.lvm.5.2.0.107.U
        3338925192 2665472 bos.rte.lvm.5.3.0.63.U

        csum -h MD5 (md5sum)              filename
        ----------------------------------------------------------
        73bcf7604dd13f26a7500e45468ff5f7  IY98331_05.071212.epkg.Z
        5f32179fc2156bb6e29e775aa7bff623  IY98340_5a.071211.epkg.Z
        7c47e56cadabcba0a105ffa7fc1d40fc  IY98340_5b.071211.epkg.Z
        ef3e4512c3b55091893ce733c707e1a2  IY99537_05.071212.epkg.Z
        db04be33e56169b6a8e8fd747e6948da  IZ00559_8a.071212.epkg.Z
        553f31ccf6a265333938d81eeae6dabc  IZ00559_8b.071212.epkg.Z
        2921b9d2a3dbd84591d60fddf0663798  IZ00559_9a.071211.epkg.Z
        93ce34dec8f4fa9681a2c7c86be065fc  IZ00559_9b.071211.epkg.Z
        e6b0a4a91ba197de0005bd800f06ba4e  IZ10828_08.071212.epkg.Z
        602a8c777cc27e51c3d3dbfa8ebd69be  IZ10828_09.071212.epkg.Z
        b84a5cae03921d30675e522da29da1aa  IZ10828_10.071212.epkg.Z
        2aa4b9b43ca55f74b0fac6be7bc48b66  bos.clvm.enh.5.2.0.107.U
        844e1f2ef9d388d2ddd8cf3ef6251f06  bos.clvm.enh.5.3.0.61.U
        0c73aa8f0211c400455feaa6fb8a95c4  bos.rte.lvm.5.2.0.107.U
        1b5a08eabe984d957db9a145e2a4fd06  bos.rte.lvm.5.3.0.63.U

        csum -h SHA1 (sha1sum)                    filename
        ------------------------------------------------------------------
        d9929214a4d85b986fb2e06c9b265c768c7178a9  IY98331_05.071212.epkg.Z
        0f5fbcdfbbbf505366dad160c8dec1c1ce75285e  IY98340_5a.071211.epkg.Z
        cf2cda3b8d19b73d06b69eeec7e4bae192bec689  IY98340_5b.071211.epkg.Z
        9d8727b5733bc34b8daba267b82864ef17b7156f  IY99537_05.071212.epkg.Z
        e7a366956ae7a08deb93cbd52bbbbf451d0f5565  IZ00559_8a.071212.epkg.Z
        1898733cdf6098e4f54ec36132a03ebbe0682a7e  IZ00559_8b.071212.epkg.Z
        f68c458c817f99730b193ecbd02ae24b9e51cc67  IZ00559_9a.071211.epkg.Z
        185954838c439a3c7f8e5b769aa6cc7d31123b59  IZ00559_9b.071211.epkg.Z
        6244138dc98f3fd16928b2bbcba3c5b4734e9942  IZ10828_08.071212.epkg.Z
        98bfaf44ba4bc6eba452ea074e276b8e87b41c9d  IZ10828_09.071212.epkg.Z
        2a9c0dd75bc79eba153d0a4e966d930151121d45  IZ10828_10.071212.epkg.Z
        96706ec5afd792852350d433d1bf8d8981b67336  bos.clvm.enh.5.2.0.107.U
        91f6d3a4d9ffd15d258f4bda51594dbce7011d8a  bos.clvm.enh.5.3.0.61.U
        4589a5bca998f437aac5c3bc2c222eaa51490dab  bos.rte.lvm.5.2.0.107.U
        3449afd795c24594c7a0c496f225c7148b4071ab  bos.rte.lvm.5.3.0.63.U

        To verify the sums, use the text of this advisory as input to
        csum, md5sum, or sha1sum. For example:

        csum -h SHA1 -i Advisory.asc
        md5sum -c Advisory.asc
        sha1sum -c Advisory.asc

        These sums should match exactly. The PGP signatures in the tar
        file and on this advisory can also be used to verify the
        integrity of the fixes.  If the sums or signatures cannot be
        confirmed, contact IBM AIX Security at
        security-alert@austin.ibm.com and describe the discrepancy.

     C. FIX AND INTERIM FIX INSTALLATION

        IMPORTANT: If possible, it is recommended that a mksysb backup
        of the system be created.  Verify it is both bootable and
        readable before proceeding.

        To preview a fix installation:

        installp -a -d . -p all

        To install a fix package:

        installp -a -d . -X all

        Interim fixes have had limited functional and regression
        testing but not the full regression testing that takes place
        for Service Packs; thus, IBM does not warrant the fully
        correct functionality of an interim fix.

        Interim fix management documentation can be found at:

        http://www14.software.ibm.com/webapp/set2/sas/f/aix.efixmgmt/home.html

        To preview an interim fix installation:

        emgr -e ipkg_name -p         # where ipkg_name is the name of the
                                     # interim fix package being previewed.

        To install an interim fix package:

        emgr -e ipkg_name -X         # where ipkg_name is the name of the
                                     # interim fix package being installed.

VI. WORKAROUNDS

    There are two workarounds available.

    A. OPTION 1

        Change the permissions of these commands to remove the setuid
        bit using the following commands:

        chmod 500 /usr/sbin/lchangevg
        chmod 500 /usr/sbin/ldeletepv
        chmod 500 /usr/sbin/putlvodm
        chmod 500 /usr/sbin/lvaryoffvg
        chmod 500 /usr/sbin/lvgenminor
        chmod 500 /usr/sbin/tellclvmd

        NOTE: chmod will disable functionality of these commands for
        all users except root.

    B. OPTION 2 (AIX 6.1, AIX 5.3 TL6 and TL7)

        Use the File Permissions Manager (fpm) command to manage
        setuid and setgid programs.

        fpm documentation can be found in the AIX 6 Security Redbook
        at:

        http://www.redbooks.ibm.com/abstracts/sg247430.html

        An fpm level of high will remove the setuid bit from the
        affected commands.  For example:

        fpm -l high -p    # to preview changes
        fpm -l high       # to execute changes

        NOTE: Please review the documentation before execution.  fpm
        will disable functionality of multiple commands for all users
        except root.

VII. OBTAINING FIXES

    AIX security related fixes can be downloaded from:

        ftp://aix.software.ibm.com/aix/efixes/security

    AIX fixes can be downloaded from:

        http://www.ibm.com/eserver/support/fixes/fixcentral/main/pseries/aix

    NOTE: Affected customers are urged to upgrade to the latest
    applicable Technology Level and Service Pack.

VIII. CONTACT INFORMATION

    If you would like to receive AIX Security Advisories via email,
    please visit:

        http://www14.software.ibm.com/webapp/set2/subscriptions/pqvcmjd
 
    Comments regarding the content of this announcement can be
    directed to:

        security-alert@austin.ibm.com

    To request the PGP public key that can be used to communicate
    securely with the AIX Security Team you can either:

        A. Send an email with "get key" in the subject line to:

            security-alert@austin.ibm.com

        B. Download the key from a PGP Public Key Server. The key ID is:

            0xA6A36CCC

    Please contact your local IBM AIX support center for any
    assistance.

    eServer is a trademark of International Business Machines
    Corporation.  IBM, AIX and pSeries are registered trademarks of
    International Business Machines Corporation.  All other trademarks
    are property of their respective holders.

IX. ACKNOWLEDGMENTS

    IBM discovered and fixed this vulnerability as part of its
    commitment to secure the AIX operating system.


31.6 Other filesystem commands:
===============================


df command:
-----------

df Command

Purpose
Reports information about space on file systems. This document describes the AIXr df command as well as 
the System V version of df.

Syntax
df [ [ -P ] | [  -I | -M | -i | -t | -v ] ] [ -k ] [ -m ] [ -g ] [ -s ] [FileSystem ... | File... ]

Description
The df command displays information about total space and available space on a file system. 
The FileSystem parameter specifies the name of the device on which the file system resides, the directory 
on which the file system is mounted, or the relative path name of a file system. The File parameter specifies 
a file or a directory that is not a mount point. 
If the File parameter is specified, the df command displays information for the file system on which the file 
or directory resides. 
If you do not specify the FileSystem or File parameter, the df command displays information for all 
currently mounted file systems. 
File system statistics are displayed in units of 512-byte blocks by default.

The df command gets file system space statistics from the statfs system call. However, specifying the -s flag 
gets the statistics from the virtual file system (VFS) specific file system helper. If you do not specify 
arguments with the -s flag and the helper fails to get the statistics, the statfs system call statistics 
are used. Under certain exceptional conditions, such as when a file system is being modified while 
the df command is running, the statistics displayed by the df command might not be accurate.

Note:
Some remote file systems, such as the Network File System (NFS), do not provide all the information 
that the df command needs. The df command prints blanks for statistics that the server does not provide.

flags:

-g Displays statistics in units of GB blocks. The output values for the file system statistics would be in floating point numbers 
  as value of each unit in bytes is significantly high. 
-i Displays the number of free and used i-nodes for the file system; this output is the default when the specified file system is mounted. 
-I Displays information on the total number of blocks, the used space, the free space, the percentage of used space, and the mount point for the file system. 
-k Displays statistics in units of 1024-byte blocks. 
-m Displays statistics in units of MB blocks. The output values for the file system statistics would be in floating point numbers 
   as value of each unit in bytes is significantly high. 
-M Displays the mount point information for the file system in the second column. 
-P Displays information on the file system in POSIX portable format.  
-s Gets file system statistics from the VFS specific file system helper instead of the statfs system call.
   Any arguments given when using the -s flag must be a JFS or Enhanced JFS filesystem mount point or device. 
   The filesystem must also be listed in /etc/filesystems. 
-t Includes figures for total allocated space in the output. 
-v Displays all information for the specified file system. 

examples:

To display information about all mounted file systems, enter: 

df
If your system has the /, /usr, /site, and /usr/venus file systems mounted, the output from the df command 
resembles the following: 

Filesystem 512-blocks Free   %Used   Iused  %Iused  Mounted on
/dev/hd0    19368     9976    48%     4714    5%     /
/dev/hd1    24212     4808    80%     5031   19%     /usr
/dev/hd2     9744     9352     4%     1900    4%     /site
/dev/hd3     3868     3856     0%      986    0%     /usr/venus 


To display information about /test file system in 1024-byte blocks, enter: 
df -k /test
Filesystem    1024 blocks    Free    %Used   Iused  %Iused  Mounted on 
/dev/lv11         16384     15824       4%      18      1%  /tmp/ravi1
This displays the file system statistics in 1024-byte disk blocks. 


To display information about /test file system in MB blocks, enter: 
df -m /test
Filesystem    MB blocks    Free    %Used    Iused  %Iused  Mounted on 
/dev/lv11       16.00     15.46       4%       18      1%  /tmp/ravi1
This displays file system statistics in MB disk blocks rounded off to nearest 2nd decimal digit. 


To display information about the /test file system in GB blocks, enter: 
df -g /test
Filesystem    GB blocks   Free     %Used    Iused  %Iused  Mounted on 
/dev/lv11          0.02   0.02        0%       18      1%  /tmp/ravi1
This displays file system statistics in GB disk blocks rounded off to nearest 2nd decimal digit. 


To display available space on the file system in which your current directory resides, enter: 

cd/
df .
The output from this command resembles the following: 

Device   512-blocks  free   %used   iused   %iused  Mounted on
/dev/hd4    19368    9976    48%     4714    5%     / 


The defragfs command:
---------------------

defragfs Command

Purpose
Increases a file system's contiguous free space.

Syntax
defragfs [ -q | -r | -s] { Device | FileSystem }

Description
The defragfs command increases a file system's contiguous free space by reorganizing allocations to be 
contiguous rather than scattered across the disk. The file system to be defragmented can be specified 
with the Device variable, which is the path name of the logical volume (for example, /dev/hd4). 
It can also be specified with the FileSystem variable, which is the mount point in the /etc/filesystems file.

The defragfs command is intended for fragmented and compressed file systems. However, you can use 
the defragfs command to increase contiguous free space in nonfragmented file systems.

You must mount the file system read-write for this command to run successfully. Using the -q flag, 
the -r flag or the -s flag generates a fragmentation report. These flags do not alter the file system.

The defragfs command is slow against a JFS2 file system with a snapshot due to the amount of data 
that must be copied into snapshot storage object. The defragfs command issues a warning message 
if there are snapshots. The snapshot command can be used to delete the snapshots and then used again 
to create a new snapshot after the defragfs command completes.

Flags

-q Reports the current state of the file system. 
-r Reports the current state of the file system and the state that would result if 
   the defragfs command is run without either the -q, -r or -s flag. 
-s Reports the fragmentation in the file system. This option causes defragfs to pass through 
   meta data in the file system which may result in degraded performance. 

Output
On a JFS filesystem, the definitions for the messages reported by the defragfs command are as follows:

Number of free fragments 
The number of free fragments in the file system. 
Number of allocated fragments 
The number of allocated fragments in the file system. 
Number of free spaces shorter than a block 
The number of free spaces within the file system that are shorter than a block. A free space is a set of contiguous fragments that are not allocated. 
Number of free fragments in short free spaces 
The total number of fragments in all the short free spaces. A short free space is one that is shorter than a block. 
Number of fragments moved 
The total number of fragments moved. 
Number of logical blocks moved 
The total number of logical blocks moved. 
Number of allocation attempts 
The number of times free fragments were reallocated. 
Number of exact matches 
The number of times the fragments that are moved would fit exactly in some free space. 
Total number of fragments 
The total number of fragments in the file system. 
Number of fragments that may be migrated 
The number of fragments that may be moved during defragmentation. 
FileSystem filesystem is n percent fragmented 
Shows to what extent the file system is fragmented in percentage. 
On a JFS2 filesystem the definitions for the messages reported by the defragfs command are as follows:

Total allocation groups 
The number of allocation groups in the file system. Allocation groups divide the space on a file system into chunks. Allocation groups allow JFS2 resource allocation policies to use well known methods for achieving good I/O performance. 
Allocation groups defragmented 
The number of allocation groups that were defragmented. 
Allocation groups skipped - entirely free 
The number of allocation groups that were skipped because they were entirely free. 
Allocation groups skipped - too few free blocks 
The number of allocation groups that were skipped because there were too few free blocks in them for reallocation. 
Allocation groups skipped - contains a large contiguous free space 
The number of allocation groups that were skipped because they contained a large contiguous free space which is not worth defragmenting. 
Allocation groups are candidates for defragmenting 
The number of allocation groups that are fit for defragmenting. 
Average number of free runs in candidate allocation groups 
The average number of free runs per allocation group, for allocation groups that are found fit for defragmentation. A free run is a contiguous set of blocks which are not allocated. 
Total number of blocks 
The total number of blocks in the file system. 
Number of blocks that may be migrated 
The number of blocks that may be moved during defragmentation. 
FileSystem filesystem is n percent fragmented 
Shows to what extent the file system is fragmented in percentage. 


Examples:
To defragment the /data1 file system located on the /dev/lv00 logical volume, enter: 
defragfs /data1

To defragment the /data1 file system by specifying its mount point, enter: 
defragfs /data1

To generate a report on the /data1 file system that indicates its current status as well as its status 
after being defragmented, enter: 
defragfs  -r /data1

To generate a report on the fragmentation in the /data1 file system, enter: 
defragfs -s /data1


The fsck command:
-----------------

Purpose
Checks file system consistency and interactively repairs the file system.

Syntax
fsck [ -n ] [ -p ] [ -y ] [ -dBlockNumber ] [ -f ] [ -ii-NodeNumber ] [ -o Options ] [ -tFile ] 
     [ -V VfsName ] [ FileSystem1 - FileSystem2 ... ]

Description
Attention: Always run the fsck command on file systems after a system malfunction. Corrective actions 
may result in some loss of data. The default action for each consistency correction is to wait for the operator 
to enter yes or no. If you do not have write permission for an affected file system, the fsck command defaults 
to a no response in spite of your actual response.

Notes:
The fsck command does not make corrections to a mounted file system. 
The fsck command can be run on a mounted file system for reasons other than repairs. 
However, inaccurate error messages may be returned when the file system is mounted. 
The fsck command checks and interactively repairs inconsistent file systems. You should run this command 
before mounting any file system. You must be able to read the device file on which the file system resides 
(for example, the /dev/hd0 device). Normally, the file system is consistent, and the fsck command merely reports 
on the number of files, used blocks, and free blocks in the file system. If the file system is inconsistent, 
the fsck command displays information about the inconsistencies found and prompts you for permission to repair them.

The fsck command is conservative in its repair efforts and tries to avoid actions that might result in the 
loss of valid data. In certain cases, however, the fsck command recommends the destruction of a damaged file. 
If you do not allow the fsck command to perform the necessary repairs, an inconsistent file system may result. 
Mounting an inconsistent file system may result in a system crash.

If a JFS2 file system has snapshots, the fsck command will attempt to preserve them. If this action fails, 
the snapshots cannot be guaranteed to contain all of the before-images from the snapped file system. 
The fsck command will delete the snapshots and the snapshot logical volumes.

If you do not specify a file system with the FileSystem parameter, the fsck command checks all file systems 
listed in the /etc/filesystems file for which the check attribute is set to True. You can enable this type of 
checking by adding a line in the stanza, as follows:

check=true
You can also perform checks on multiple file systems by grouping the file systems in the /etc/filesystems file. 
To do so, change the check attribute in the /etc/filesystems file as follows:

check=Number
The Number parameter tells the fsck command which group contains a particular file system. 
File systems that use a common log device should be placed in the same group. File systems are checked, 
one at a time, in group order, and then in the order that they are listed in the /etc/filesystems file. 
All check=true file systems are in group 1. The fsck command attempts to check the root file system before 
any other file system regardless of the order specified on the command line or in the /etc/filesystems file.

The fsck command checks for the following inconsistencies:

-Blocks or fragments allocated to multiple files. 
-i-nodes containing block or fragment numbers that overlap. 
-i-nodes containing block or fragment numbers out of range. 
-Discrepancies between the number of directory references to a file and the link count of the file. 
-Illegally allocated blocks or fragments. 
-i-nodes containing block or fragment numbers that are marked free in the disk map. 
-i-nodes containing corrupt block or fragment numbers. 
-A fragment that is not the last disk address in an i-node. This check does not apply to compressed file systems. 
-Files larger than 32KB containing a fragment. This check does not apply to compressed file systems. 
-Size checks: 
 Incorrect number of blocks. 
 Directory size not a multiple of 512 bytes.
 These checks do not apply to compressed file systems. 
-Directory checks: 
 Directory entry containing an i-node number marked free in the i-node map. 
 i-node number out of range. 
 Dot (.) link missing or not pointing to itself. 
 Dot dot (..) link missing or not pointing to the parent directory. 
 Files that are not referenced or directories that are not reachable.
-Inconsistent disk map. 
-Inconsistent i-node map.
-Orphaned files and directories (those that cannot be reached) are, if you allow it, reconnected by placing them 
 in the lost+found subdirectory in the root directory of the file system. The name assigned is the i-node number. 
 If you do not allow the fsck command to reattach an orphaned file, it requests permission to destroy the file.

In addition to its messages, the fsck command records the outcome of its checks and repairs through its exit value. 
This exit value can be any sum of the following conditions:

0 All checked file systems are now okay. 
2 The fsck command was interrupted before it could complete checks or repairs. 
4 The fsck command changed the file system; the user must restart the system immediately. 
8 The file system contains unrepaired damage. 

When the system is booted from a disk, the boot process explicitly runs the fsck command, 
specified with the -f and -p flags on the /, /usr, /var, and /tmp file systems. If the fsck command 
is unsuccessful on any of these file systems, the system does not boot. Booting from removable media and 
performing maintenance work will then be required before such a system will boot.

If the fsck command successfully runs on /, /usr, /var, and /tmp, normal system initialization continues. 
During normal system initialization, the fsck command specified with the -f and -p flags runs from the 
/etc/rc file. This command sequence checks all file systems in which the check attribute is set to True (check=true). 
If the fsck command executed from the /etc/rc file is unable to guarantee the consistency of any file system, 
system initialization continues. However, the mount of any inconsistent file systems may fail. 
A mount failure may cause incomplete system initialization.

Note:
By default, the /, /usr, /var, and /tmp file systems have the check attribute set to False (check=false) 
in their /etc/filesystem stanzas. The attribute is set to False for the following reasons: 
The boot process explicitly runs the fsck command on the /, /usr, /var, and /tmp file systems. 
The /, /usr, /var, and /tmp file systems are mounted when the /etc/rc file is executed. The fsck command 
will not modify a mounted file system. Furthermore, the fsck command run on a mounted file system produces 
unreliable results.
You can use the File Systems application in Web-based System Manager (wsm) to change file system characteristics. 
You could also use the System Management Interface Tool (SMIT) smit fsck fast path to run this command.

Flags

-dBlockNumber Searches for references to a specified disk block. Whenever the fsck command encounters a file that 
contains a specified block, it displays the i-node number and all path names that refer to it. 
For JFS2 filesystems, the i-node numbers referencing the specified block will be displayed but not 
their path names." 
-f Performs a fast check. Under normal circumstances, the only file systems likely to be affected by halting 
the system without shutting down properly are those that are mounted when the system stops. The -f flag prompts 
the fsck command not to check file systems that were unmounted successfully. The fsck command determines this 
by inspecting the s_fmod flag in the file system superblock. 
This flag is set whenever a file system is mounted and cleared when it is unmounted successfully. 
If a file system is unmounted successfully, it is unlikely to have any problems. Because most file systems 
are unmounted successfully, not checking those file systems can reduce the checking time.
 
-ii-NodeNumber Searches for references to a specified i-node. Whenever the fsck command encounters a directory 
 reference to a specified i-node, it displays the full path name of the reference. 
-n Assumes a no response to all questions asked by the fsck command; does not open the specified file system 
 for writing. 
-o Options Passes comma-separated options to the fsck command. The following options are currently supported 
 for JFS (these options are obsolete for newer file systems and can be ignored): 
mountable 
Causes the fsck command to exit with success, returning a value of 0, if the file system in question is mountable (clean). 
If the file system is not mountable, the fsck command exits returning with a value of 8. 
mytype 
Causes the fsck command to exit with success (0) if the file system in question is of the same type as either specified in the 
/etc/filesystems file or by the -V flag on the command line. Otherwise, 8 is returned. For example, 
fsck -o mytype -V jfs / exits with a value of 0 if / (the root file system) is a journaled file system.  
-p Does not display messages about minor problems but fixes them automatically. This flag does not grant the wholesale license that the -y flag does and is useful for performing automatic checks when the system is started normally. You should use this flag as part of the system startup procedures, whenever the system is being run automatically. 
If the primary superblock is corrupt, the secondary superblock is verified and copied to the primary superblock. 
-tFile Specifies a File parameter as a scratch file on a file system other than the one being checked, if the fsck command cannot obtain enough memory to keep its tables. If you do not specify the -t flag and the fsck command needs a scratch file, it prompts you for the name of the scratch file. However, if you have specified the -p flag, the fsck command is unsuccessful. If the scratch file is not a special file, it is removed when the fsck command ends. 
-V VfsName Uses the description of the virtual file system specified by the VFSName variable for the file system instead of using the /etc/filesystems file to determine the description. If the -V VfsName flag is not specified on the command line, the /etc/filesystems file is checked and the vfs=Attribute of the matching stanza is assumed to be the correct file system type. 
-y Assumes a yes response to all questions asked by the fsck command. This flag lets the fsck command take any action it considers necessary. Use this flag only on severely damaged file systems. 

Examples
To check all the default file systems, enter: 

fsck
This command checks all the file systems marked check=true in the /etc/filesystems file. 
This form of the fsck command asks you for permission before making any changes to a file system.

To fix minor problems with the default file systems automatically, enter: 

fsck -p
To check a specific file system, enter: 

fsck /dev/hd1
This command checks the unmounted file system located on the /dev/hd1 device.


31.6 DESCRIPTOR AREA'S:
-----------------------

- 1. VOLUME GROUP DESCRIPTOR AREA, VGDA 

Global to the VG:
The VGDA, located at the beginning of each physical volume, contains information that describes all
the LV's and all the PV's that belong to the VG of which that PV is a member.
The VGDA makes a VG selfdescribing. An AIX System can read the VGDA on a disk, and from that, can
determine what PV's and LV's are part of this VG.
There are one or two copies per disk.

- 2. VOLUME GROUP STATUS AREA, VGSA

Tracks the state of mirrorred copies.
The VGSA contains state information about physical partitions and physical volumes.
For example, the VGSA knows if one PV in a VG is unavailable.

Each PV has at least one VGDA/VGSA. The number of VGDA's contained on a single disk
varies according to the number of disks in the VG.

- 3. LOGICAL VOLUME CONTROL BLOCK, LVCB

Contains LV attributes (policies, number of copies).
The LVCB is located at the start of every LV. It contains information about the logical volume. 
You can however, use the mklv command with the -T option, to request that the LVCB will not
be stored in the beginning of the LV. 

With Scalable VG's, LVCM info is no longer stored in the first user block of any LV.
All relevant LVCM info is kept in the VGDA.


31.7 The lqueryvg command:
--------------------------

The lqueryvg command reads the VGDA from a specified disk in a VG.

Example:

# lqueryvg -p hdisk1 -At
# lqueryvg -Atp hdisk0

-p: which PV
-A: show all available information
-t: show descriptive tags

Example:

#lqueryvg -Atp hdisk0
Max LVs:        256
PP Size:        25
Free PPs:       468
LV count:       20
PV count:       2
Total VGDAs:    3
Conc Allowed:   0
MAX PPs per PV  1016
MAX PVs:        32
Conc Autovaryo  0
Varied on Conc  0
Logical:        00c665ed00004c0000000112b7408848.1   hd5 1
                00c665ed00004c0000000112b7408848.2   hd6 1
                00c665ed00004c0000000112b7408848.3   hd8 1
                00c665ed00004c0000000112b7408848.4   hd4 1
                00c665ed00004c0000000112b7408848.5   hd2 1
                00c665ed00004c0000000112b7408848.6   hd9var 1
                00c665ed00004c0000000112b7408848.7   hd3 1
                00c665ed00004c0000000112b7408848.8   hd1 1
                00c665ed00004c0000000112b7408848.9   hd10opt 1
                00c665ed00004c0000000112b7408848.10  hd7 1
                00c665ed00004c0000000112b7408848.11  hd7x 1
                00c665ed00004c0000000112b7408848.12  beheerlv 1
                00c665ed00004c0000000112b7408848.13  varperflv 1
                00c665ed00004c0000000112b7408848.14  loglv00 1
                00c665ed00004c0000000112b7408848.15  db2_server_v8 1
                00c665ed00004c0000000112b7408848.16  db2_var_v8 1
                00c665ed00004c0000000112b7408848.17  db2_admin_v8 1
                00c665ed00004c0000000112b7408848.18  db2_adminlog_v8 1
                00c665ed00004c0000000112b7408848.19  db2_dasscr_v8 1
                00c665ed00004c0000000112b7408848.20  db2_Fixpak10 1
Physical:       00c665edb74079bc                2   0
                00c665edb7f2987a                1   0
Total PPs:      1022
LTG size:       128
HOT SPARE:      0
AUTO SYNC:      0
VG PERMISSION:  0
SNAPSHOT VG:    0
IS_PRIMARY VG:  0
PSNFSTPP:       4352
VARYON MODE:    0
VG Type:        0
Max PPs:        32512


31.8 The lquerypv command:
--------------------------

-------
How do I find out what the maximum supported logical track group (LTG) size of my hard disk? 

You can use the lquerypv command with the -M flag. The output gives the LTG size in KB. For instance, 
the LTG size for hdisk0 in the following example is 256 KB.

/usr/sbin/lquerypv -M hdisk0
256
------ 

run 

lquerypv -h core 6b0 

to find the executable (probably man, but man may have called 
something else in the background) 

then run 

dbx path_/to_/executable core 

and run the subcommand 


dbx> where 

and paste the stack output, should be able to find it from there. also 
paste the level of fileset you are on for the executable 


lslpp -w /path_/to_/executable -> this will give fileset_name 
lslpp -l fileset_name 

-------

Wie l,sst sich ein Storage Lock auf einer SAN-Disk brechen?
Endlich die ersehnte SAN-Disk bekommen und dann das, es l,sst sich keine Volume Group darauf anlegen. 

# mkvg -f vpath100 

gibt einen I/O Error. Was tun? 
H"chstwahrscheinlich befindet sich noch ein Lock auf der SAN-Disk. Dies l,sst sich mit dem Befehl 

# lquerypv -ch /dev/vpath100

aufbrechen und die Volume Group kann angelegt werden. 


-------

# lquerypv -h /dev/hdisk9 80 10
  00000080   00001155 583CD4B0 00000000 00000000  |...UX<..........|


# lquerypv -h /dev/hdisk1
00000000   C9C2D4C1 00000000 00000000 00000000  |................|
00000010   00000000 00000000 00000000 00000000  |................|
00000020   00000000 00000000 00000000 00000000  |................|
00000030   00000000 00000000 00000000 00000000  |................|
00000040   00000000 00000000 00000000 00000000  |................|
00000050   00000000 00000000 00000000 00000000  |................|
00000060   00000000 00000000 00000000 00000000  |................|
00000070   00000000 00000000 00000000 00000000  |................|
00000080   00C665ED B7F2987A 00000000 00000000  |..e....z........|
00000090   00000000 00000000 00000000 00000000  |................|
000000A0   00000000 00000000 00000000 00000000  |................|
000000B0   00000000 00000000 00000000 00000000  |................|
000000C0   00000000 00000000 00000000 00000000  |................|
000000D0   00000000 00000000 00000000 00000000  |................|
000000E0   00000000 00000000 00000000 00000000  |................|
000000F0   00000000 00000000 00000000 00000000  |................|

# lquerypv -h /dev/hdisk0 80 10

root@zd93l12:/root#lquerypv -h /dev/hdisk0 80 10
00000080   00C665ED B74079BC 00000000 00000000  |..e..@y.........|


31.9 The getlvcb command:
-------------------------

The LVCB stores attributes of a LV. The getlvcb command reads the LVCB of a specified LV.
Displays a formatted output of the data in the LVCB of a LV.

Example:

# getlvcb -At hd2

# getlvcb -TA hd3 
Displays the information held in the LVCB of LV hd3. 


31.10 The putlvcb command:
--------------------------

Writes the control block information (only the specified fields) into block 0 of a logical volume (LVCB).


# putlvcb -t jfs lvdata
writes the LV type jfs to the LVCB of LV lvdata. 


32. Some Filesystem related errors in AIX:
==========================================


32.1 The root / Filesystem is full:
===================================


Dealing with a 100% full root (/) filesystem in AIX

Number one - DON'T Re-boot.
 Do a chfs -a size=+1 /  (enter).  The root filesystem will be increased by one
physical partition.

If the box is re-booted, shutdown, or crashes do the following:

Load the AIX Installation CD #1 and type shutdown -Fr.
Upon re-boot press F1 to enter the Systems Management Services (SMS) Menu.
Click on the Multi-Boot icon.


The bootlist needs to be changed so that CD0 is the first boot device.
Shutdown and re-boot.

Press F1 and enter.
Press 1 and enter.
Select Maintenance Mode option (3?).
Select Access a Root Volume Group.
Select  the option that does NOT mount the filesystems.
At the prompt, type mount /dev/hd4 (this is where the root filesystem lives)
/mnt
At the prompt type mount /dev/hd2 /usr

Type df and enter.  Note filesystem sizes.

Now, chfs -a size=+1 /

Type:  df and enter.  Note that the filesystem / is larger.
Type:  sync

You need to change your bootlist to boot off of hdisk0:
 Type:  bootlist -m normal hdisk0 hdisk1 rmt0 cd0 and  enter.
Type:  shutdown -Fr.

the system will re-boot and should come back online in it's proper state.


32.2 Fixing ODM problems on a VG which is not the rootvg:
=========================================================

In the following examle, the VG is called "myvg" consisting of the Physical Volume hdisk3.

1. Unmount all filesystems in that VG first, otherwise you cannot varyoff the VG.
Then varyoff the VG.

# varyoffvg myvg

2. Now remove the complete information of that VG from ODM. The VGDA and LVCB
on the actual disks are NOT touched by the exportvg command.

# exportvg myvg

3. Now import the VG and create new ODM objects associated with that VG:

# importvg -y myvg hdisk3

You only need to specify one intact PV of the VG in the above command. Any disk in the VG
will have a VGDA which contains all neccessary information.
The importvg command reads the VGDA and LVCB on that disk and creates completely new ODM entries.


32.3 Fixing ODM problems on the rootvg:
=======================================

rvgrecover:
-----------

You can try to use the "rvgrecover" shell script.
The rootvg cannot be varied off, like an ordinary VG, so the solution from the
former section cannot be used.
But the script "rvgrecover" issues a series of odmdelete statements, just like exportvg does.
At the end of the script, an importvg is done.
The importvg command, reads the VGDA and LVCB from the boot disk, resulting in new ODM entries.

The rvgrecover script has the following contents:

Reinitializing the rootvg Volume Group 
To reinitialize the rootvg volume group, copy the shell script to /bin/rvgrecover and run 
the following to make that file executable: 

chmod +x /bin/rvgrecover 
Then run: 

/bin/rvgrecover
Use the following shell script to reinitialize the ODM entries for the rootvg volume group: 

PV=/dev/ipldevice  # PV=hdisk0
VG=rootvg
    cp /etc/objrepos/CuAt /etc/objrepos/CuAt.$$
    cp /etc/objrepos/CuDep /etc/objrepos/CuDep.$$
    cp /etc/objrepos/CuDv /etc/objrepos/CuDv.$$
    cp /etc/objrepos/CuDvDr /etc/objrepos/CuDvDr.$$
    lqueryvg -Lp $PV | awk '{ print $2 }' | while read LVname; do
        odmdelete -q "name = $LVname" -o CuAt
        odmdelete -q "name = $LVname" -o CuDv
        odmdelete -q "value3 = $LVname" -o CuDvDr
    done
    odmdelete -q "name = $VG" -o CuAt
    odmdelete -q "parent = $VG" -o CuDv
    odmdelete -q "name = $VG" -o CuDv
    odmdelete -q "name = $VG" -o CuDep
    odmdelete -q "dependency = $VG" -o CuDep
    odmdelete -q "value1 = 10" -o CuDvDr
    odmdelete -q "value3 = $VG" -o CuDvDr
    importvg -y $VG $PV      # ignore lvaryoffvg errors
    varyonvg $VG


redefinevg:
-----------

redefinevg Command

Purpose
Redefines the set of physical volumes of the given volume group in the device configuration database. 

Syntax
redefinevg { -d Device | -i Vgid } VolumeGroup

Description
During normal operations the device configuration database remains consistent with the 
Logical Volume Manager (LVM) information in the reserved area on the physical volumes. 
If inconsistencies occur between the device configuration database and the LVM, the redefinevg command 
determines which physical volumes belong to the specified volume group and re-enters this information 
in the device configuration database. The redefinevg command checks for inconsistencies by reading 
the reserved areas of all the configured physical volumes attached to the system.


Note: To use this command, you must either have root user authority or be a member of the system group.

Flags

-d Device The volume group ID, Vgid, is read from the specified physical volume device. 
   You can specify the Vgid of any physical volume belonging to the volume group that you are redefining. 
-i Vgid The volume group identification number of the volume group to be redefined. 

Example

To redefine rootvg physical volumes in the Device Configuration Database, enter a command similar to the following:

# redefinevg -d hdisk0 rootvg


synclvodm:
----------

synclvodm Command 
Purpose
Synchronizes or rebuilds the logical volume control block, the device configuration database, 
and the volume group descriptor areas on the physical volumes. 

Syntax
synclvodm [ -v ] VolumeGroup [ LogicalVolume ... ] 


Description
During normal operations, the device configuration database remains consistent with the 
logical volume manager information in the logical volume control blocks and the volume group descriptor 
areas on the physical volumes. If for some reason the device configuration database is not consistent 
with Logical Volume Manager information, the synclvodm command can be used to resynchronize the database. 
The volume group must be active for the resynchronization to occur (see varyonvg). 
If logical volume names are specified, only the information related to those logical volumes is updated. 

Attention: Do not remove the /dev entries for volume groups or logical volumes. Do not change the 
device configuration database entries for volume groups or logical volumes using the object data manager. 
Note: To use this command, you must either have root user authority or be a member of the system group.
Flags
-v verbose 

Example

To synchronize the device configuration database with the logical volume manager information for rootvg, 
enter the following: 

synclvodm rootvg


32.4 How to Replace a Disk?: 
============================

1. Short version for normal VG (not rootvg) and the disk is working:
--------------------------------------------------------------------

extendvg VolumeGroupName hdiskY
migratepv hdiskX hdiskY
reducevg -d VolumeGroupName hdiskX


2. More Detail:
---------------

2.1 The disk is mirrored:
-------------------------

1. Remove all copies from the disk:
   # unmirrorvg vg_name hdiskX

2. Remove disk from VG:
   # reducevg vg_name hdiskX

3. Remove disk from ODM:
   # rmdev -l hdiskX -d

4. Add new disk to the system.

5. Add the new disk to the VG:
   # extendvg vg_name hdiskY

6. Create new copies:
   # mirrorvg vg_name 
   # syncvg vg_name


2.2 The disk was not mirrored, or you want to replace a working disk:
---------------------------------------------------------------------

1. Add the new disk to the system.

2. Add the disk to the VG:
   # extendvg vg_name hdiskY

3. Migrate old disk to new disk:
   # migratepv hdiskX hdiskY

4. Remove old disk from VG:
   # reducevg vg_name hdiskX

5. Remove old disk from ODM:
   # rmdev -l hdiskX -d


2.3 Replace the disk in the rootvg:
-----------------------------------

1. Add the new disk to the system.

2. Add the disk to the VG:
   # extendvg rootvg hdiskY

3. The diskX contains hd5? If so:

   # migratepv -l hd5 hdiskX hdiskY
   # bosboot -ad /dev/hdiskY
   # chpv -c hdiskX
   # bootlist -m normal hdiskY

   If hdiskX contains the primary dump device, you must deactivate it:
   # sysdumpdev -p /dev/sysdumpnull

4. Migrate old disk to new disk:
   # migratepv hdiskX hdiskY

   If the primary dump device has been deactivated, activate it again
   # sysdumpdev -p /dev/hdX

5. Remove old disk from VG:
   # reducevg rootvg hdiskX

6. Remove old disk from ODM:
   # rmdev -l hdiskX -d


32.5 Filesystem errors:
=======================


32.5.1 ksh: Invalid file system control data detected:
======================================================

Note 1:
-------

Q:

Anybody recognize this? This directory seems to be missing the ".", I can't 
umount, can't remove the directory, can't copy a good directory over it, 
etc. 

spiderman# cd probes 
spiderman# pwd 
/opt/diagnostics/probes 
spiderman# ls -la 
ls: 0653-341 The file . does not exist. 
spiderman# cd .. 
spiderman# ls -la probes 
ls: probes: Invalid file system control data detected. 
total 0 
spiderman# 

spiderman# fuser /opt 
/opt: 
spiderman# umount /opt 
umount: 0506-349 Cannot unmount /dev/hd10opt: The requested resource is 
busy. 
spiderman# umount /dev/hd10opt 
umount: 0506-349 Cannot unmount /dev/hd10opt: The requested resource is 
busy. 

spiderman# fsck /opt 

** Checking /dev/hd10opt (/opt) MOUNTED FILE SYSTEM; WRITING SUPPRESSED; 
Checking a mounted filesystem does not produce dependable results. 
** Phase 1 - Check Blocks and Sizes 
** Phase 2 - Check Pathnames 
DIRECTORY CORRUPTED (NOT FIXED) 
DIRECTORY CORRUPTED (NOT FIXED) 
Directory /diagnostics/probes, '.' entry is missing. (NOT FIXED) 
Directory /diagnostics/probes, '..' entry is missing. (NOT FIXED) 
** Phase 3 - Check Connectivity 
** Phase 4 - Check Reference Counts 
link count directory I@98 owner=bin mode$0755 
sizeQ2 mtime=May 13 14:54 2005 
count 3 should be 2 (NOT ADJUSTED) 
link count directory I@99 owner=bin mode$0755 
size24 mtime=Jan 10 13:45 2005 
count 2 should be 1 (NOT ADJUSTED) 
Unreferenced file IA06 owner=bin mode0555 
sizee56 mtime=Jul 07 14:25 2004 (NOT RECONNECTED) 
Unreferenced file IA06 (NOT CLEARED) 
Unreferenced file IA07 owner=bin mode0555 
size)12 mtime=Jul 07 14:25 2004 (NOT RECONNECTED) 
etc....


A:

Some good news here. Yes, your directory is hosed, but the important 
things is that all a directory is a repository for storing inode numbers 
and associated (human readable) file names. Since fsck is so nicely 
generating all of those now currently inaccessible inode numbers, a find 
command can be used to move them into a new directory. Once the old 
directory is empty, you can (hopefully) rm -r it. 

Here's what you need to do. 

a) Get all the inode numbers generated from your fsck 
b) put them into a variable (e.g. lost_inodes="4099 4106....etc." 
c) Make a target directory for the lost inodes to be moved into: 
mkdir /tmp/recovery 
d) cd into your problem File System: 
cd /opt 
d) Run a loop using find: 
for i in ${lost_inodes} 
do 
find . -inum ${i} mv * /tmp/recovery \; 
echo "Moved and recovered inode # ${i}" 
done 

That should do it. Let me know if it works ok! BTW, the new "file 
name" should be the inode number of the file. You will have to rename 
the files as needed. 


Note 2: IY94101: J2_DMAP_CORRUPT ERROR REPORT AFTER SHRINKING JFS2 FILESYSTEM
-----------------------------------------------------------------------------

http://www-1.ibm.com/support/docview.wss?uid=isg1IY94101

IY94101: J2_DMAP_CORRUPT ERROR REPORT AFTER SHRINKING JFS2 FILESYSTEM

APAR status
Closed as program error.

Error description 
After shrinking a filesystem, J2_DMAP_CORRUPT reports
appear in the error report and some file creates/writes
fail with "Invalid file system control data detected".
Local fix 
Problem summary 
Problem conclusion 
Temporary fix 
Comments 
APAR information 
APAR number IY94101 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2007-01-26 
Closed date 2007-01-29 
Last modified date 2007-05-25 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Publications Referenced


Fix information 
Fixed component name AIX 5.3 
Fixed component ID 5765G0300 


Note 3:
-------

Q:

Since applying ML7 for AIX 5.1 I have been getting file corruption error 
messages on a particular filesystem and the only way to fix it is to umount 
the filesystem and fsck it. I thought it might be a hardware problem but 
now it is also happening on another machine I put the ML7 on and it is 
happening to the same filesystem (one machine is a test server of the 
other). The only unique thing about the filesystem is that it is not in 
rootvg and it is large -1281228 1024-blocks. Has anyone heard of this? 
Below is the error I am getting: 
LABEL: JFS_META_CORRUPTION 
IDENTIFIER: 684A365B 


Date/Time: Tue Apr 26 13:45:26 EDT 
Sequence Number: 2023 
Machine Id: 0000F11F4C00 
Node Id: XX00 
Class: U 
Type: UNKN 
Resource Name: SYSPFS 
Resource Class: NONE 
Resource Type: NONE 
Location: NONE 
VPD: 


Description 
FILE SYSTEM CORRUPTION 


Probable Causes 
INVALID FILE SYSTEM CONTROL DATA 


        Recommended Actions 
        PERFORM FULL FILE SYSTEM RECOVERY USING FSCK UTILITY OBTAIN 
DUMP 
        CHECK ERROR LOG FOR ADDITIONAL RELATED ENTRIES 


Failure Causes 
ADAPTER HARDWARE OR MICROCODE 
DISK DRIVE HARDWARE OR MICROCODE 
SOFTWARE PROGRAM 
STORAGE CABLE LOOSE, DEFECTIVE, OR UNTERMINATED 


        Recommended Actions 
        CHECK CABLES AND THEIR CONNECTIONS 
        INSTALL LATEST ADAPTER AND DRIVE MICROCODE 
        INSTALL LATEST STORAGE DEVICE DRIVERS 
        IF PROBLEM PERSISTS, CONTACT APPROPRIATE SERVICE REPRESENTATIVE 


Detail Data 
FILE NAME 
xix_lookup.c 
LINE NO. 
         300 
MAJOR/MINOR DEVICE NUMBER 
0026 0006 
ADDITIONAL INFORMATION 
4A46 5345 426E 8C46 0000 000E 0000 001D 0003 0610 0000 0000 0000 0000 0000 
0002 
164D A330 0001 86D3 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 
0000 
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 
0000 
--------------------------------------------------------------------------- 
LABEL: JFS_FSCK_REQUIRED 
IDENTIFIER: CD546B25 


Date/Time: Tue Apr 26 13:45:26 EDT 
Sequence Number: 2022 
Machine Id: 0000F11F4C00 
Node Id: XX00 
Class: O 
Type: INFO 
Resource Name: SYSPFS 


Description 
FILE SYSTEM RECOVERY REQUIRED 


        Recommended Actions 
        PERFORM FULL FILE SYSTEM RECOVERY USING FSCK UTILITY 


Detail Data 
MAJOR/MINOR DEVICE NUMBER 
0026 0006 
FILE SYSTEM DEVICE AND MOUNT POINT 
/dev/lv04, /opt/egate 


Note 3:
-------

Q: 

How can I remove a bizarre, irremovable file from a directory? I've tried every way of using 
/bin/rm and nothing works." 

A: 

In some rare cases a strangely-named file will show itself in your directory and appear to be 
un-removable with the rm command. Here is will the use of ls -li and find with its -inum [inode] 
primary does the job. 
Let's say that ls -l shows your irremovable as 

-rw-------  1 smith  smith  0 Feb  1 09:22 ?*?*P

Type: 

ls -li

to get the index node, or inode. 

153805 -rw-------  1 smith  smith  0 Feb  1 09:22 ?*?^P

The inode for this file is 153805. Use find -inum [inode] to make sure that the file is correctly identified. 


%  find -inum 153805 -print
./?*?*P

Here, we see that it is. Then used the -exec functionality to do the remove. . 
  
% find . -inum 153805 -print -exec /bin/rm {} \;

Note that if this strangely named file were not of zero-length, it might contain accidentally misplaced 
and wanted data. Then you might want to determine what kind of data the file contains and move the file 
to some temporary directory for further investigation, for example: 

% find . -inum 153805 -print -exec /bin/mv {} unknown.file \;

Will rename the file to unknown.file, so you can easily inspect it. 

Another way to remove strangely-named files is to use "ls -q" or "cat -v" to show the special characters, 
and then use shell's globbing mechanism to delete the file. 

$ ls
-????*'?
$ ls | cat -v
-^B^C?^?*'

$ rm ./-'^B'*           -- achieved by typing control-V control-B
$ ls


the argument given to rm is a judicious selection of glob wildcards (*'s) and sufficient control characters 
to uniquely identify the file. The leading "./" is useful when the file begins with a hyphen. 
These binary name files are caused by: 

* accidental cut-and-pastes to shell prompts - especially when you paste something of the form: "junk > garbage" 
because the shell creates the file "garbage" before trying to execute the command "junk" 

* filesystem corruption (in which case touching the filesystem any more can really stuff things up) 
If you discover that you have two files of the same name, one of the files probably has a bizarre 
(and unprintable) character in its name. Most probably, this unprintable character is a backspace. 

For example: 


    $ ls
    filename filename
    $ ls -q
    filename fl?ilename
    $ ls | cat -v
    filename
    fl^Hilename


32.5.2 More on Filesystem errors (1):
=====================================

Note 1:
-------

Q:

Hi all, 

I have a error message complaining about filesystem being full. 
but df does not sure any filesystem being full. 
The error report gives me the major/minor number: 0027/0004 
I went to /dev dir, and searched for the numbers, but it turns out to be ptyp4. 
Why is that? What does this mean? 

Any suggestion? 

A:

Those numbers are reported in hex, the actual major/minor #'s 
are 39 and 4

A:

Convert the errpt #'s to hex. The use ls -l to find them. 


Note 2:
-------

Q:

Hi, 
I get a error concerning a filesystem. 
Now I have 2 questions: 


- What is the way to find out which filesystems is concerned? 
- What can I do? Because all fs have unused space. I cannot find any fs 
with 100% in use. 

LABEL:            J2_FS_FULL
IDENTIFIER: CED6B4B5
Date/Time:       Mon Dec 27 12:49:35 NFT
Sequence Number: 3420
Machine Id:      00599DDD4C00
Node Id:         srvdms0
Class:           O
Type:            INFO
Resource Name:   SYSJ2
Description
UNABLE TO ALLOCATE SPACE IN FILE SYSTEM
Probable Causes
FILE SYSTEM FULL
 Recommended Actions
 INCREASE THE SIZE OF THE ASSOCIATED FILE SYSTEM  REMOVE UNNECESSARY
DATA FROM FILE SYSTEM  USE FUSER UTILITY TO LOCATE UNLINKED FILES STILL
REFERENCED
Detail Data
JFS2 MAJOR/MINOR DEVICE NUMBER
 002B 000B
 

A:

002b is 2*16+11 -->43 
ls -l /dev|grep 43, 
000b is 11 --> look for 43, 11 

Date:         Wed, 29 Dec 2004 11:06:27 +0000
To: aix-l@Princeton.EDU

Q:

Subject 
Re: error concerning filesystem [Virus checked] 

Hi Holger, 

A small query...how did you arrive at this figure of 43 from the error 
code. 
The decimal value of B is 11 but I could not understand the 2*16.. 

can you please exp this.... 

A:

The major/minor numbers (002B 000B) are in hex: hex abcd = 
a*16^3+b*16^2+c*16^1+d therefore hex 002B=0*16^3+0*16^2+2*16^1+11=2*16+11 


Note 3: AIX superblock issues:
------------------------------

-- Hint 1 for AIX:
-- ---------------

thread:

Use this command in case the superblock is corrupted. This will restore the BACKUP COPY of the superblock 
to the CURRENT copy.

# dd count=1 bs=4k skip=31 seek=1 if=/dev/hd4 of=/dev/hd4

# fsck /dev/hd4 2>&1 | tee /tmp/fsck.errors


Note:

fuser
Identifies processes using a file or file system

# fuser -u /dev/hd3
Sample output: /dev/hd3: 2964(root) 6615c(root) 8465(casado) 11290(bonner)


-- Hint 2 for AIX:
-- ---------------

http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.aix.howtos/doc/howto/HT_baseadmn_badmagnumber.htm


Fixing a corrupted magic number in the file system superblock
If the superblock of a file system is damaged, the file system cannot be accessed. You can fix a 
corrupted magic number in the file system superblock.

Most damage to the superblock cannot be repaired. The following procedure describes how to repair a superblock 
in a JFS file system when the problem is caused by a corrupted magic number. If the primary superblock is corrupted 
in a JFS2 file system, use the fsck command to automatically copy the secondary superblock and repair the primary 
superblock.

In the following scenario, assume /home/myfs is a JFS file system on the physical volume /dev/lv02.

The information in this how-to was tested using AIXr 5.2. If you are using a different version or level of AIX, 
the results you obtain might vary significantly. 

1. Unmount the /home/myfs file system, which you suspect might be damaged, using the following command: 

# umount /home/myfs

2. To confirm damage to the file system, run the fsck command against the file system. For example: 

# fsck -p /dev/lv02

If the problem is damage to the superblock, the fsck command returns one of the following messages: 

fsck: Not an AIXV5 file system
OR 
Not a recognized filesystem type

3. With root authority, use the od command to display the superblock for the file system, 
as shown in the following example: 

# od -x -N 64 /dev/lv02 +0x1000

Where the -x flag displays output in hexadecimal format and the -N flag instructs the system to format 
no more than 64 input bytes from the offset parameter (+), which specifies the point in the file where 
the file output begins. The following is an example output: 

0001000  1234 0234 0000 0000 0000 4000 0000 000a
0001010  0001 8000 1000 0000 2f6c 7633 0000 6c76
0001020  3300 0000 000a 0003 0100 0000 2f28 0383
0001030  0000 0001 0000 0200 0000 2000 0000 0000
0001040

In the preceding output, note the corrupted magic value at 0x1000 (1234 0234). If all defaults were taken 
when the file system was created, the magic number should be 0x43218765. If any defaults were overridden, 
the magic number should be 0x65872143. 

4. Use the od command to check the secondary superblock for a correct magic number. An example command 
and its output follows: 

# od -x -N 64 /dev/lv02 +0x1f000

001f000  6587 2143 0000 0000 0000 4000 0000 000a
001f010  0001 8000 1000 0000 2f6c 7633 0000 6c76
001f020  3300 0000 000a 0003 0100 0000 2f28 0383
001f030  0000 0001 0000 0200 0000 2000 0000 0000
001f040

Note the correct magic value at 0x1f000. 

5. Copy the secondary superblock to the primary superblock. An example command and output follows: 

# dd count=1 bs=4k skip=31 seek=1 if=/dev/lv02 of=/dev/lv02

dd: 1+0 records in.
dd: 1+0 records out.

Use the fsck command to clean up inconsistent files caused by using the secondary superblock. For example: 

# fsck /dev/lv02 2>&1 | tee /tmp/fsck.errs

For more information

The fsck and od command descriptions in AIX 5L Version 5.3 Commands Reference, Volume 4 
AIX Logical Volume Manager from A to Z: Introduction and Concepts, an IBM Redbook 
AIX Logical Volume Manager from A to Z: Troubleshooting and Commands, an IBM Redbook 
"Boot Problems" in Problem Solving and Troubleshooting in AIX 5L, an IBM Redbook 


Note 4: Linux superblock issues:
--------------------------------

1.

DAMAGED SUPERBLOCK


If a filesystem check fails and returns the error message "Damaged Superblock" you're lost . . . . . . . 
or not ?
Well, not really, the damaged "superblock" can be restored from a backup. There are several backups stored 
on the harddisk. But let me first have a go at explaining what a "superblock"is.

A superblock is located at position 0 of every partition, contains vital information about the filesystem 
and is needed at a fielsystem check.

The information stored in the superblock are about what sort of fiesystem is used, the I-Node counts, 
block counts, free blocks and I-Nodes, the numer of times the filesystem was mounted, date of the 
last filesystem check and the first I-Node where / is located.

Thus, a damaged superblock means that the filesystem check will fail. 

Our luck is that there are backups of the superblock located on several positions and we can restore 
them with a simple command.

The usual ( and only ) positions are: 8193, 32768, 98304, 163840, 229376 and 294912. ( 8193 in many cases 
only on older systems, 32768 is the most current position for the first backup )
You can check this out and have a lot more info about a particular partition you have on your HD by:


CODE  
# dumpe2fs /dev/hda5 

You will see that the primary superblock is located at position 0, and the first backup on position 32768.
O.K. let's get serious now, suppose you get a "Damaged Superblock" error message at filesystem check 
( after a power failure ) and you get a root-prompt in a recovery console, then you give the command:

CODE  
# e2fsck -b 32768 /dev/hda5 


don't try this on a mounted filesystem

It will then check the filesystem with the information stored in that backup superblock and if the check 
was successful it will restore the backup to position 0.
Now imagine the backup at position 32768 was damaged too . . . then you just try again with the backup 
stored at position 98304, and 163840, and 229376 etc. etc. until you find an undamaged backup  
( there are five backups so if at least one of those five is okay it's bingo ! )

So next time don't panic . . just get the paper where you printed out this Tip and give the magic command
 
CODE  
# e2fsck -b 32768 /dev/hda5  


32.6 Undelete programs:
=======================

Note 1: AIX and JFS
-------------------

/*****************************************************************************
 * rsb.c - Read Super Block. Allows a jfs superblock to be dumped, inode
 * table to be listed or specific inodes data pointers to be chased and
 * dumped to standard out (undelete).
 *
 * Phil Gibbs - Trinem Consulting (pgibbs@trinem.co.uk)
 ****************************************************************************/
#include <stdio.h>
#include <jfs/filsys.h>
#include <jfs/ino.h>
#include <sys/types.h>
#include <pwd.h>
#include <grp.h>
#include <unistd.h>
#include <time.h>

#define FOUR_MB		(1024*1024*4)
#define THIRTY_TWO_KB	(1024*32)

extern int optind;
extern int Optopt;
extern int Opterr;
extern char *optarg;

void PrintSep()
{
	int k=80;

	while (k)
	{
		putchar('-');
		k--;
	}
	putchar('\n');
}

char *UserName(uid_t uid)
33333{
char replystr[10];
struct passwd *res;

res=getpwuid(uid);
if (res->pw_name[0])
{
	return res->pw_name;
}
else
{
	sprintf(replystr,"%d",uid);
	return replystr;
}
}

char *GroupName(gid_t gid)
{
struct group *res;
res=getgrgid(gid);
return res->gr_name;
}


ulong NumberOfInodes(struct superblock *sb)
{
	ulong MaxInodes;
	ulong TotalFrags;

	if (sb->s_version==fsv3pvers)
	{
		TotalFrags=(sb->s_fsize*512)/sb->s_fragsize;
		MaxInodes=(TotalFrags/sb->s_agsize)*sb->s_iagsize;
	}
	else
	{
		MaxInodes=(sb->s_fsize*512)/sb->s_bsize;
	}
	return MaxInodes;
}


void AnalyseSuperBlock(struct superblock *sb)
{
	ulong TotalFrags;

	PrintSep();
	printf("SuperBlock Details:\n-------------------\n");
	printf("File system size:  %ld x 512 bytes (%ld Mb)\n",
				sb->s_fsize,
				(sb->s_fsize*512)/(1024*1024));
	printf("Block size:        %d bytes\n",sb->s_bsize);
	printf("Flags:             ");
	switch (sb->s_fmod)
	{
		case (char)FM_CLEAN:
			break;
		case (char)FM_MOUNT:
			printf("mounted ");
			break;
		case (char)FM_MDIRTY:
			printf("mounted dirty ");
			break;
		case (char)FM_LOGREDO:
			printf("log redo failed ");
			break;
		default:
			printf("Unknown flag ");
			break;
	}
	if (sb->s_ronly) printf("(read-only)");
	printf("\n");
	printf("Last SB update at: %s",ctime(&(sb->s_time)));
	printf("Version:           %s\n",
	sb->s_version?"1 - fsv3pvers":"0 - fsv3vers");
	printf("\n");
	if (sb->s_version==fsv3pvers)
	{
		TotalFrags=(sb->s_fsize*512)/sb->s_fragsize;
		printf("Fragment size:     %5d         ",sb->s_fragsize);
		printf("inodes per alloc:  %8d\n",sb->s_iagsize);
		printf("Frags per alloc:   %5d         ",sb->s_agsize);
		printf("Total Fragments:   %8d\n",TotalFrags);
		printf("Total Alloc Grps:  %5d         ",
						TotalFrags/sb->s_agsize);
		printf("Max inodes:        %8ld\n",NumberOfInodes(sb));
	}
	else
	{
		printf("Total Alloc Grps:  %5d         ",
				(sb->s_fsize*512)/sb->s_agsize);
		printf("inodes per alloc:  %8d\n",sb->s_agsize);
		printf("Max inodes:      %8ld\n",NumberOfInodes(sb));
	}
	PrintSep();
}

void ReadInode(	FILE *in,
		ulong StartInum,
		struct dinode *inode,
		ulong InodesPerAllocBlock,
		ulong AllocBlockSize)
{
	off_t			SeekPoint;
	long			BlockNumber;
	int			OffsetInBlock;
	static struct dinode	I_NODES[PAGESIZE/DILENGTH];
	ulong			AllocBlock;
	ulong			inum;
	static off_t		LastSeekPoint=-1;

	AllocBlock=(StartInum/InodesPerAllocBlock);
	BlockNumber=(StartInum-(AllocBlock*InodesPerAllocBlock))/
			(PAGESIZE/DILENGTH);
	OffsetInBlock=(StartInum-(AllocBlock*InodesPerAllocBlock))-
			(BlockNumber*(PAGESIZE/DILENGTH));
	SeekPoint=(AllocBlock)?
		(BlockNumber*PAGESIZE)+(AllocBlock*AllocBlockSize):
		(BlockNumber*PAGESIZE)+(INODES_B*PAGESIZE);
	if (SeekPoint!=LastSeekPoint)
	{
		sync();
		fseek(in,SeekPoint,SEEK_SET);
		fread(I_NODES,PAGESIZE,1,in);
		LastSeekPoint=SeekPoint;
	}
	*inode=I_NODES[OffsetInBlock];
}

void DumpInodeContents(	long	inode,
			FILE	*in,
			ulong	InodesPerAllocBlock,
			ulong	AllocBlockSize,
			ulong	Mask,
			ulong	Multiplier)
{
	struct dinode		DiskInode;
	ulong			SeekPoint;
	char			Buffer[4096];
	ulong			FileSize;
	int			k;
	int			BytesToRead;
	ulong			*DiskPointers;
	int			NumPtrs;

	ReadInode(	in,
			inode,
			&DiskInode,
			InodesPerAllocBlock,
			AllocBlockSize);
	FileSize=DiskInode.di_size;

	if (FileSize>FOUR_MB)
	{
		/* Double indirect mapping */
	}
	else
	if (FileSize>THIRTY_TWO_KB)
	{
		/* Indirect mapping */
		SeekPoint=DiskInode.di_rindirect & Mask;
		SeekPoint=SeekPoint*Multiplier;
		DiskPointers=(ulong *)malloc(1024*sizeof(ulong));
		fseek(in,SeekPoint,SEEK_SET);
		fread(DiskPointers,1024*sizeof(ulong),1,in);
		NumPtrs=1024;
	}
	else
	{
		/* Direct Mapping */
		DiskPointers=&(DiskInode.di_rdaddr[0]);
		NumPtrs=8;
	}

	for (k=0;k<=NumPtrs && FileSize;k++)
	{
		SeekPoint=(DiskPointers[k] & Mask);
		SeekPoint=SeekPoint*Multiplier;

		BytesToRead=(FileSize>sizeof(Buffer))?sizeof(Buffer):FileSize;
		fseek(in,SeekPoint,SEEK_SET);
		fread(Buffer,BytesToRead,1,in);
		FileSize=FileSize-BytesToRead;
		write(1,Buffer,BytesToRead);
	}
}

void DumpInodeList(	FILE	*in,
			ulong	MaxInodes,
			ulong	InodesPerAllocBlock,
			ulong	AllocBlockSize)
{
	long			inode;
	struct dinode		DiskInode;
	struct tm		*TimeStruct;

	printf("   Inode Links     User    Group     Size    ModDate\n");
	printf("-------- ----- -------- -------- --------    -------\n");
	for (inode=0;inode<=MaxInodes;inode++)
	{
		ReadInode(	in,
				inode,
				&DiskInode,
				InodesPerAllocBlock,
				AllocBlockSize);
		if (DiskInode.di_mtime)
		{
			TimeStruct=localtime((long *)&DiskInode.di_mtime);
			printf("%8d %5d %8s %8s %8d %02d/%02d/%4d\n",
				inode,
				DiskInode.di_nlink,
				UserName(DiskInode.di_uid),
				GroupName(DiskInode.di_gid),
				DiskInode.di_size,
				TimeStruct->tm_mday,
				TimeStruct->tm_mon,
				TimeStruct->tm_year+1900);
		}
	}
}

void ExitWithUsageMessage()
{
	fprintf(stderr,"USAGE: rsb [-i inode] [-d] [-s] <block_device>\n");
	exit(1);
}

main(int argc,char **argv)
{
	FILE			*in;
	struct superblock	SuperBlock;
	short			Valid;
	long			inode=0;
	struct dinode		DiskInode;
	ulong			AllocBlockSize;
	ulong			InodesPerAllocBlock;
	ulong			MaxInodes;
	ulong			Mask;
	ulong			Multiplier;
	int			option;
	int			DumpSuperBlockFlag=0;
	int			DumpFlag=0;

	while ((option=getopt(argc,argv,"i:ds")) != EOF)
	{
		switch(option)
		{
			case 'i':
				/* Inode specified */
				inode=atol(optarg);
				break;
			case 'd':
				/* Dump flag */
				DumpFlag=1;
				break;
			case 's':
				/* List Superblock flag */
				DumpSuperBlockFlag=1;
				break;
			default:
				break;
		}
	}

	if (strlen(argv[optind])) in=fopen(argv[optind],"r");
	else ExitWithUsageMessage();

	if (in)
	{
		fseek(in,SUPER_B*PAGESIZE,SEEK_SET);
		fread(&SuperBlock,sizeof(SuperBlock),1,in);
		switch (SuperBlock.s_version)
		{
			case fsv3pvers:
				Valid=!strncmp(SuperBlock.s_magic,fsv3pmagic,4);
				InodesPerAllocBlock=SuperBlock.s_iagsize;
				AllocBlockSize=
				SuperBlock.s_fragsize*SuperBlock.s_agsize;
				Multiplier=SuperBlock.s_fragsize;
				Mask=0x3ffffff;
				break;
			case fsv3vers:
				Valid=!strncmp(SuperBlock.s_magic,fsv3magic,4);
				InodesPerAllocBlock=SuperBlock.s_agsize;
				AllocBlockSize=SuperBlock.s_agsize*PAGESIZE;
				Multiplier=SuperBlock.s_bsize;
				Mask=0xfffffff;
				break;
			default:
				Valid=0;
				break;
		}
		if (Valid)
		{
			if (DumpSuperBlockFlag==1)
			{
				AnalyseSuperBlock(&SuperBlock);
			}
			MaxInodes=NumberOfInodes(&SuperBlock);
			if (DumpFlag==1)
			{
				if (inode)
				DumpInodeContents(inode,in,InodesPerAllocBlock,AllocBlockSize,Mask,Multiplier);
				else
				DumpInodeList(in,MaxInodes,InodesPerAllocBlock,AllocBlockSize);
			}
		}
		else
		{
			fprintf(stderr,"Superblock - bad magic number\n");
			exit(1);
		}
	}
	else
	{
		fprintf(stderr,"couldn't open ");
		perror(argv[optind]);
		exit(1);
	}
}


Note 2: Undelete a text file on most unixes (no garantee):
----------------------------------------------------------

Works mainly on Linux Distro's

Using grep (traditional UNIX way) to recover files
Use following grep syntax:

# grep -b 'search-text' /dev/partition > file.txt
OR
# grep -a -B[size before] -A[size after] `text' /dev/[your_partition] > file.txt

Where,

-i : Ignore case distinctions in both the PATTERN and the input files i.e. match both uppercase and lowercase character. 
-a : Process a binary file as if it were text 
-B Print number lines/size of leading context before matching lines. 
-A: Print number lines/size of trailing context after matching lines. 
To recover text file starting with "nixCraft" word on /dev/sda1 you can try following command:
# grep -i -a -B10 -A100 'nixCraft' /dev/sda1 > file.txt
Next use vi to see file.txt. This method is ONLY useful if deleted file is text file. 
If you are using ext2 file system, try out recover command. .


Note 3:
-------

For AIX there are undelete tools: http://www.compunix.com/


Note 4: lsof and Linux:
-----------------------

Bring back deleted files with lsof
By Michael Stutz on November 16, 2006 (8:00:00 AM) 

Briefly, a file as it appears somewhere on a Linux filesystem is actually just a link to an inode, 
which contains all of the file's properties, such as permissions and ownership, as well as the addresses 
of the data blocks where the file's content is stored on disk. When you rm a file, you're removing the link 
that points to its inode, but not the inode itself; other processes (such as your audio player) might still 
have it open. It's only after they're through and all links are removed that an inode and the data blocks 
it pointed to are made available for writing.

This delay is your key to a quick and happy recovery: if a process still has the file open, the data's there 
somewhere, even though according to the directory listing the file already appears to be gone.

This is where the Linux process pseudo-filesystem, the /proc directory, comes into play. Every process on 
the system has a directory here with its name on it, inside of which lies many things -- 
including an fd ("file descriptor") subdirectory containing links to all files that the process has open. 
Even if a file has been removed from the filesystem, a copy of the data will be right here:

/proc/process id/fd/file descriptor 

To know where to go, you need to get the id of the process that has the file open, and the file descriptor. 
These you get with lsof, whose name means "list open files." (It actually does a whole lot more than this 
and is so useful that almost every system has it installed. If yours isn't one of them, you can grab the latest 
version straight from its author.)

Once you get that information from lsof, you can just copy the data out of /proc and call it a day.

This whole thing is best demonstrated with a live example. First, create a text file that you can delete 
and then bring back:

$ man lsof | col -b > myfile 

Then have a look at the contents of the file that you just created:

$ less myfile 

You should see a plaintext version of lsof's huge man page looking out at you, courtesy of less.

Now press Ctrl-Z to suspend less. Back at a shell prompt make sure your file is still there:

$ ls -l myfile
-rw-r--r--  1 jimbo jimbo 114383 Oct 31 16:14 myfile
$ stat myfile
  File: `myfile'
  Size: 114383          Blocks: 232        IO Block: 4096   regular file
Device: 341h/833d       Inode: 1276722     Links: 1
Access: (0644/-rw-r--r--)  Uid: ( 1010/    jimbo)   Gid: ( 1010/    jimbo)
Access: 2006-10-31 16:15:08.423715488 -0400
Modify: 2006-10-31 16:14:52.684417746 -0400
Change: 2006-10-31 16:14:52.684417746 -0400
Yup, it's there all right. OK, go ahead and oops it:

$ rm myfile
$ ls -l myfile
ls: myfile: No such file or directory
$ stat myfile
stat: cannot stat `myfile': No such file or directory
$
It's gone.

At this point, you must not allow the process still using the file to exit, because once that happens, 
the file will really be gone and your troubles will intensify. Your background less process in this walkthrough 
isn't going anywhere (unless you kill the process or exit the shell), but if this were a video or sound file that 
you were playing, the first thing to do at the point where you realize you deleted the file would be to 
immediately pause the application playback, or otherwise freeze the process, so that it doesn't eventually 
stop playing the file and exit. 

Now to bring the file back. First see what lsof has to say about it:

$ lsof | grep myfile
less      4158    jimbo    4r      REG       3,65   114383   1276722 /home/jimbo/myfile (deleted)
The first column gives you the name of the command associated with the process, the second column is the 
process id, and the number in the fourth column is the file descriptor (the "r" means that it's a regular file). 
Now you know that process 4158 still has the file open, and you know the file descriptor, 4. That's everything 
you have to know to copy it out of /proc.

You might think that using the -a flag with cp is the right thing to do here, since you're restoring the file -- 
but it's actually important that you don't do that. Otherwise, instead of copying the literal data contained 
in the file, you'll be copying a now-broken symbolic link to the file as it once was listed in its original directory:

$ ls -l /proc/4158/fd/4
lr-x------  1 jimbo jimbo 64 Oct 31 16:18 /proc/4158/fd/4 -> /home/jimbo/myfile (deleted)
$ cp -a /proc/4158/fd/4 myfile.wrong
$ ls -l myfile.wrong
lrwxr-xr-x  1 jimbo jimbo 24 Oct 31 16:22 myfile.wrong -> /home/jimbo/myfile (deleted)
$ file myfile.wrong
myfile.wrong: broken symbolic link to `/home/jimbo/myfile (deleted)'
$ file /proc/4158/fd/4
/proc/4158/fd/4: broken symbolic link to `/home/jimbo/myfile (deleted)'
So instead of all that, just a plain old cp will do the trick:

$ cp /proc/4158/fd/4 myfile.saved 

And finally, verify that you've done good:

$ ls -l myfile.saved
-rw-r--r--  1 jimbo jimbo 114383 Oct 31 16:25 myfile.saved
$ man lsof | col -b > myfile.new
$ cmp myfile.saved myfile.new
No complaints from cmp -- your restoration is the real deal.

Incidentally, there are a lot of useful things you can do with lsof in addition to rescuing lost files.


32.7 Some notes about disks on x86 systems: MBR and Partition Bootsector:
=========================================================================

The following applies to PC's and x86 based Servers.

There are two sectors on the disk that are critical to starting the computer:

- Master Boot Record
- Partition Boot Sector

The MBR is created when you create the first partition on the harddisk.
The location is always cylinder 0, head 0 and sector 1.

The MBR contains the Partition Table for the disk and a small amount of executable code.
On x86 machines, this executable code examines the Partition Table and identifies
the system partition. The code then finds the system partition's starting location on the disk,
and loads an copy of its Partition Boot Sector into memory.

If you would take a look at the MBR, you would find:

The first 446 bytes in the sector is the MBR.
After that, you would see the Partition Table, a 64 byte structure. Each table entry is 16 bytes long,
the first byte being the Boot Indicator field. This tells the code which partition is bootable.

The Partition Boot Sector, has its own "layout" depending on the type of system.


32.8 How to get LUN ID's:
=========================

# lscfg -vl hdiskx
# lsattr -El hdiskx

ZD110L05
600507680190014DC000000000000304

ZD110L08
600507680190014DC000000000000305

ZD111L05
600507680190014DC000000000000306

ZD111L08
600507680190014DC000000000000307


#############################
33. Filesystems in Linux:
#############################


33.1 Disks:
===========

Linux on x86 systems, have the following (storage) devices:

-- Entire harddisks are listed as devices without numbers, such as "/dev/hda" or "/dev/sda".

- IDE:

/dev/hda    is the primary IDE master drive,
/dev/hdb    is the primary IDE slave drive,
/dev/hdc    is the secondary IDE master,
/dev/hdd    is the secondary IDE slave,

- SCSI:
/dev/sda   is the first SCSI interface and 1st device id number
etc..

-- Partitions on a disk are referred to with a number such as

/dev/hda1


Floppydrive:

/dev/fd0
# mount -t auto /dev/fd0 /mnt/floppy
# mount -t vfat /dev/fd0 /mnt/floppy
# mount /dev/fd0 /mnt/floppy

Zipdrive:

# insmod ppa       # load the module
# mount -t vfat /dev/sda /mnt/zip


33.2 Filesystems:
=================

Linux supports a huge number of filesystems, including FAT, JFS, NTFS etc.. But the most common are ext2 and ext3.
For the "native" filesystems, we take a look at the following FS's:

- ReiserFS   
A journaled filesystem

- Ext2
The most popular filesystem for years. But it does not use a log/jounal,
so gradually it becomes less important.

- Ext3
Very related to Ext2, but this one supports journaling.
An Ext2 filesystem can easily be upgraded to Ext3.


33.3 Adding a disk in Linux:
============================

Suppose you have SCSI card on with a disk is attached.  
The disk as a whole would be refferred to as "/dev/sda" and the
first partition would be referred to as "/dev/sda1".

But we have a new disk here.
If you cannot find the device files /dev/sda in /dev, you might
create it with the /dev/MAKEDEV script:

# cd /dev
# ./MAKEDEV sda

The disk is now ready to be partitioned. In this example, we plan
to create 3 partitions, including a swap partition.

# fdisk /dev/sda
The number of cylinders for this disk is set to ..
(.. more output..)
Command:

The fdisk program is interactive; pressing m displays a list of all its commands.

Command: new
Command action
  e extended
  p primary partition (1-4): 1
(.. more output..)

Command: print

Device           Boot    Start   End   Blocks   Id   System
/dev/sda1                1       255   2048256  83   Linux

So we have created our first partition.
We now create the swap partition:

Command: new
Command action
  e extended
  p primary partition (1-4): 2
(.. more output..)

Command: type
Partition number (1-4): 2
Hex code: 82              # which is a Linix swap partition
Changed system type of partition 2 to 82 (Linux swap)

The third partition can be created in a similar way.
We now would like to see a listing of our partitions

Command: print

Device           Boot    Start   End   Blocks   Id   System
/dev/sda1                1       255   2048256  83   Linux
/dev/sda2                256     511   2056320  82   Swap
/dev/sda3                512    5721  41849325  83   Linux


Now, save the label to the disk:

Command: write
(.. more output..)

Ofcourse, we now would like to create the filesystems and the swap.

If you want to use the Ext2 filesystem on partition one, use the following command:

# mke2fs /dev/sda1 2048256       ( or # mkfs -t ext2 -b 4096 /dev/sda1 )

Lets check the filesystem with fsck:
# fsck -f /dev/sda1

A new filesystem can be mounted as soon as the mount point is created.

# mkdir /bkroot
# mount /dev/sda1 /bkroot

Lets now create the swap space:
# mkswap -c /dev/sda2 2056320

and activate it using the command:

# swapon /dev/sda2

See also section 34.3 for administering swap space on Linux.


33.4 Notes about Linux and LVM:
==============================


Note 1:
=======


-What is RAID and LVM 
-Initial setup of a RAID-5 array 
-Initial setup of LVM on top of RAID 
-Handling a Drive Failure 
-Common Glitches 
-Other Useful Resources 
-Expanding an Array/Filesytem 

--------------------------------------------------------------------------------

-What is RAID and LVM
RAID is usually defined as Redundant Array of Inexpensive disks. It is normally used to spread data among several 
physical hard drives with enough redundancy that should any drive fail the data will still be intact. 
Once created a RAID array appears to be one device which can be used pretty much like a regular partition. 
There are several kinds of RAID but I will only refer to the two most common here. 
The first is RAID-1 which is also known as mirroring. With RAID-1 it's basically done with two essentially 
identical drives, each with a complete set of data. The second, the one I will mostly refer to in this guide 
is RAID-5 which is set up using three or more drives with the data spread in a way that any one drive failing 
will not result in data loss. The Red Hat website has a great overview of the RAID Levels. 

There is one limitation with Linux Software RAID that a /boot parition can only reside on a RAID-1 array. 

Linux supports both several hardware RAID devices but also software RAID which allows you to use any IDE or 
SCSI drives as the physical devices. In all cases I'll refer to software RAID. 

LVM stands for Logical Volume Manager and is a way of grouping drives and/or partition in a way where instead 
of dealing with hard and fast physical partitions the data is managed in a virtual basis where the virtual 
partitions can be resized. The Red Hat website has a great overview of the Logical Volume Manager. 

There is one limitation that a LVM cannot be used for the /boot. 


--------------------------------------------------------------------------------

Initial set of a RAID-5 array
I recommend you experiment with setting up and managing RAID and LVM systems before using it on an 
important filesystem. One way I was able to do it was to take old hard drive and create a bunch of 
partitions on it (8 or so should be enough) and try combining them into RAID arrays. 
In my testing I created two RAID-5 arrays each with 3 partitions. You can then manually fail and hot remove 
the partitions from the array and then add them back to see how the recovery process works. You'll get a warning 
about the partitions sharing a physical disc but you can ignore that since it's only for experimentation. 
In my case I have two systems with RAID arrays, one with two 73G SCSI drives running RAID-1 (mirroring) and my other 
test system is configured with three 120G IDE drives running RAID-5. In most cases I will refer to my RAID-5 
configuration as that will be more typical. 

I have an extra IDE controller in my system to allow me to support the use of more than 4 IDE devices which caused a very odd drive assignment. 
The order doesn't seem to bother the Linux kernel so it doesn't bother me. My basic configuration is as follows: 

hda 120G drive
hdb 120G drive
hde 60G boot drive not on RAID array
hdf 120G drive
hdg CD-ROM drive

The first step is to create the physical partitions on each drive that will be part of the RAID array. 
In my case I want to use each 120G drive in the array in it's entirety. All the drives are partitioned identically 
so for example, this is how hda is partitioned: 

Disk /dev/hda: 120.0 GB, 120034123776 bytes
16 heads, 63 sectors/track, 232581 cylinders
Units = cylinders of 1008 * 512 = 516096 bytes

   Device Boot      Start         End      Blocks   Id  System
/dev/hda1   *           1      232581   117220792+  fd  Linux raid autodetect

So now with all three drives with a partitioned with id fd Linux raid autodetect you can go ahead and combine 
the paritions into a RAID array: 

# /sbin/mdadm --create --verbose /dev/md0 --level=5 --raid-devices=3 \
	/dev/hdb1 /dev/hda1 /dev/hdf1

Wow, that was easy. That created a special device /dev/md0 which can be used instead of a physical parition. 
You can check on the status of that RAID array with the mdadm command: 

# /sbin/mdadm --detail /dev/md0
        Version : 00.90.01
  Creation Time : Wed May 11 20:00:18 2005
     Raid Level : raid5
     Array Size : 234436352 (223.58 GiB 240.06 GB)
    Device Size : 117218176 (111.79 GiB 120.03 GB)
   Raid Devices : 3
  Total Devices : 3
Preferred Minor : 0
    Persistence : Superblock is persistent

    Update Time : Fri Jun 10 04:13:11 2005
          State : clean
 Active Devices : 3
Working Devices : 3
 Failed Devices : 0
  Spare Devices : 0

         Layout : left-symmetric
     Chunk Size : 64K

           UUID : 36161bdd:a9018a79:60e0757a:e27bb7ca
         Events : 0.10670

    Number   Major   Minor   RaidDevice State
       0       3        1        0      active sync   /dev/hda1
       1       3       65        1      active sync   /dev/hdb1
       2      33       65        2      active sync   /dev/hdf1

The important lines to see are the State line which should say clean otherwise there might be a problem. 
At the bottom you should make sure that the State column always says active sync which says each device 
is actively in the array. You could potentially have a spare device that's on-hand should any drive should fail. 
If you have a spare you'll see it listed as such here. 
One thing you'll see above if you're paying attention is the fact that the size of the array is 240G but I 
have three 120G drives as part of the array. That's because the extra space is used as extra parity data that is 
needed to survive the failure of one of the drives. 


--------------------------------------------------------------------------------

- Initial set of LVM on top of RAID
Now that we have /dev/md0 device you can create a Logical Volume on top of it. Why would you want to do that? 
If I were to build an ext3 filesystem on top of the RAID device and someday wanted to increase it's capacity 
I wouldn't be able to do that without backing up the data, building a new RAID array and restoring my data. 
Using LVM allows me to expand (or contract) the size of the filesystem without disturbing the existing data. 
Anyway, here are the steps to then add this RAID array to the LVM system. The first command pvcreate will 
"initialize a disk or parition for use by LVM". The second command vgcreate will then create the Volume Group, 
in my case I called it lvm-raid: 

# pvcreate /dev/md0
# vgcreate lvm-raid /dev/md0

The default value for the physical extent size can be too low for a large RAID array. In those cases you'll need 
to specify the -s option with a larger than default physical extent size. The default is only 4MB as of the 
version in Fedora Core 5. For example, to successfully create a 550G RAID array a size of 2G works well: 

# vgcreate -s 2G <volume group name>

Ok, you've created a blank receptacle but now you have to tell how many Physical Extents from the 
physical device (/dev/md0 in this case) will be allocated to this Volume Group. In my case I wanted all the data 
from /dev/md0 to be allocated to this Volume Group. If later I wanted to add additional space I would create 
a new RAID array and add that physical device to this Volume Group. 
To find out how many PEs are available to me use the vgdisplay command to find out how many are available 
and now I can create a Logical Volume using all (or some) of the space in the Volume Group. 
In my case I call the Logical Volume lvm0. 

# vgdisplay lvm-raid
	.
	.
   Free  PE / Size       57235 / 223.57 GB

# lvcreate -l 57235 lvm-raid -n lvm0

In the end you will have a device you can use very much like a plain 'ol parition called /dev/lvm-raid/lvm0. 
You can now check on the status of the Logical Volume with the lvdisplay command. The device can then be used to to create a filesystem on. 

# lvdisplay /dev/lvm-raid/lvm0 
  --- Logical volume ---
  LV Name                /dev/lvm-raid/lvm0
  VG Name                lvm-raid
  LV UUID                FFX673-dGlX-tsEL-6UXl-1hLs-6b3Y-rkO9O2
  LV Write Access        read/write
  LV Status              available
  # open                 1
  LV Size                223.57 GB
  Current LE             57235
  Segments               1
  Allocation             inherit
  Read ahead sectors     0
  Block device           253:2

# mkfs.ext3 /dev/lvm-raid/lvm0
	.
	.
# mount /dev/lvm-raid/lvm0 /mnt

# df -h /mnt
Filesystem            Size  Used Avail Use% Mounted on
/dev/mapper/lvm--raid-lvm0
                       224G   93M  224G   1% /mnt


--------------------------------------------------------------------------------

- Handling a Drive Failure
As everything eventually does break (some sooner than others) a drive in the array will fail. It is a very good idea 
to run smartd on all drives in your array (and probably ALL drives period) to be notified of a failure 
or a pending failure as soon as possible. You can also manually fail a partition, meaning to take it out 
of the RAID array, with the following command: 

# /sbin/mdadm /dev/md0 -f /dev/hdb1
mdadm: set /dev/hdb1 faulty in /dev/md0

Once the system has determined a drive has failed or is otherwise missing (you can shut down and pull out a drive 
and reboot to similate a drive failure or use the command to manually fail a drive above it will show something 
like this in mdadm: 

# /sbin/mdadm --detail /dev/md0
     Update Time : Wed Jun 15 11:30:59 2005
           State : clean, degraded
  Active Devices : 2
 Working Devices : 2
  Failed Devices : 1
   Spare Devices : 0
	.
	.
     Number   Major   Minor   RaidDevice State
        0       3        1        0      active sync   /dev/hda1
        1       0        0        -      removed
        2      33       65        2      active sync   /dev/hdf1

You'll notice in this case I had /dev/hdb fail. I replaced it with a new drive with the same capacity and was able 
to add it back to the array. The first step is to partition the new drive just like when first creating the array. 
Then you can simply add the partition back to the array and watch the status as the data is rebuilt onto the newly replace drive. 

# /sbin/mdadm /dev/md0 -a /dev/hdb1
# /sbin/mdadm --detail /dev/md0
     Update Time : Wed Jun 15 12:11:23 2005
           State : clean, degraded, recovering
  Active Devices : 2
 Working Devices : 3
  Failed Devices : 0
   Spare Devices : 1

          Layout : left-symmetric
      Chunk Size : 64K

  Rebuild Status : 2% complete
	.
	.

During the rebuild process the system performance may be somewhat impacted but the data should remain in-tact. 
--------------------------------------------------------------------------------

- Expanding an Array/Filesytem
The answer to how to expand a RAID-5 array is very simple: You can't. 
I'm used to working with a NetApp Filer where you plug in a drive, type a simple command and that drive was added 
to the existing RAID array, no muss, no fuss. While you can't add space to a RAID-5 array directly in Linux you CAN 
add space to an existing Logical Volume and then expand the ext3  filesytem on top of it. That's the main reason you 
want to run LVM on top of RAID. 

Before you start it's probably a good idea to back up your data just in case something goes wrong. 

Assuming you want your data to be protected from a drive failing you'll need to create another RAID array 
per the instructions above. In my case I called it /dev/md1  so after partitioning I can create the array: 

# /sbin/mdadm --create --verbose /dev/md1 --level=5 --raid-devices=3 \
	/dev/hde1 /dev/hdg1 /dev/hdh1
# /sbin/mdadm --detail /dev/md1

The next couple steps will add the space from the new RAID array to the space available to be used by Logical Volumes. 
You then check to see how many Physical Extents you have and add them to the Logical Volume you're using. 
Remember that since you can have multiple Logical Volumes on top of a physical RAID array you need to do this extra step. 

# vgextend lvm-raid /dev/md1
# vgdisplay lvm-raid
	.
	.
	.
  Alloc PE / Size       57235 / 223.57 GB
  Free  PE / Size       57235 / 223.57 GB
# lvextend -l 57235 lvm-raid -n lvm0

There, you now have a much larger Logical Volume which is using space on two separate RAID arrays. 
You're not done yet, you now have to extend your filesystem to make use of all that new space. Fortunately this 
is easy on FC4 and RHEL4 since there is a command to expand a ext3  filesytem without even unmounting it! 
Be patient, expanding the file system takes a while. 

# lvdisplay /dev/lvm-raid/lvm0
	.
	.
  LV Size                447.14 GB
	.
# df /raid-array
Filesystem           1K-blocks      Used Available Use% Mounted on
/dev/mapper/lvm--raid-lvm0
                     230755476  40901348 178132400  19% /raid-array
# ext2online /dev/lvm-raid1/lvm0 447g
Get yourself a sandwich
# df /raid-array
Filesystem           1K-blocks      Used Available Use% Mounted on
/dev/mapper/lvm--raid-lvm0
                     461510952  40901348 40887876   9% /raid-array

Congrats, you now have more space. Now go fill it with something. 


Note 2:
=======

Creating a LVM in Linux
 

I am sure anybody who have used windows (2000 and above) have come across the term dynamic disks. 
Linux/Unix also have its own dynamic disk management called LVM.

What is an LVM ?

LVM stands for Logical Disk Manager which is the fundamental way to manage UNIX/Linux storage systems 
in a scalable manner. An LVM abstracts disk devices into pools of storage space called Volume Groups. 
These volume groups are in turn subdivided into virtual disks called Logical Volumes. The logical volumes 
may be used just like regular disks with filesystem created on them and mounted in the Unix/Linux 
filesystem tree. The logical volumes can span multiple disks. Even though a lot of companies have implemented 
their own LVM's for *nixes, the one created by Open Software Foundation (OSF) was integrated into many 
Unix systems which serves as a base for the Linux implementation of LVM.

Note: Sun Solaris ships with LVM from Veritas which is substantially different from the OSF implementation.

Benefits of Logical Volume Management

LVM created in conjunction with RAID can provide fault tolerance coupled with scalability and easy disk management. 
Create a logical volume and filesystem which spans multiple disks.

By creating virtual pools of space, an administrator can create dozens of small filesystems for different projects 
and add space to them as needed without (much) disruption. When a project ends, he can remove the space a
nd put it back into the pool of free space.

Note : Before you move to implement LVM's in linux, make sure your kernel is 2.4 and above. Or else you will have 
to recompile your kernel from source to include support for LVM.

LVM Creation
To create a LVM, we follow a three step process.

Step One : We need to select the physical storage resources that are going to be used for LVM. Typically, these 
are standard partitions but can also be Linux software RAID volumes that we've created. In LVM terminology, 
these storage resources are called "physical volumes" (eg: /dev/hda1, /dev/hda2 ... etc).

Our first step in setting up LVM involves properly initializing these partitions so that they can be recognized 
by the LVM system. This involves setting the correct partition type (usually using the fdisk command, and entering 
the type of partition as 'Linux LVM' - 0x8e ) if we're adding a physical partition; and then running 
the pvcreate command.

# pvcreate /dev/hda1 /dev/hda2 /dev/hda3
# pvscan

The above step creates a physical volume from 3 partitions which I want to initialize for inclusion 
in a volume group.

Step Two : Creating a volume group. You can think of a volume group as a pool of storage that consists of one 
or more physical volumes. While LVM is running, we can add physical volumes to the volume group or even remove them.

First initialize the /etc/lvmtab and /etc/lvmtab.d files by running the following command:

# vgscan

Now you can create a volume group and assign one or more physical volumes to the volume group.

# vgcreate my_vol_grp /dev/hda1 /dev/hda2

Behind the scenes, the LVM system allocates storage in equal-sized "chunks", called extents. 
We can specify the particular extent size to use at volume group creation time. The size of an extent 
defaults to 4Mb, which is perfect for most uses.You can use the -s flag to change the size of the extent. 
The extent affects the minimum size of changes which can be made to a logical volume in the volume group, 
and the maximum size of logical and physical volumes in the volume group. A logical volume can contain at most 
65534 extents, so the default extent size (4 MB) limits the volume to about 256 GB; a size of 1 TB would require 
extents of atleast 16 MB. So to accomodate a 1 TB size, the above command can be rewriten as :

# vgcreate -s 16M my_vol_grp /dev/hda1 /dev/hda2

You can check the result of your work at this stage by entering the command:

# vgdisplay

This command displays the total physical extends in a volume group, size of each extent, 
the allocated size and so on.

Step Three : This step involves the creation of one or more "logical volumes" using our volume group storage pool. 
The logical volumes are created from volume groups, and may have arbitary names. The size of the new volume 
may be requested in either extents (-l switch) or in KB, MB, GB or TB ( -L switch) rounding up to whole extents.

# lvcreate -l 50 -n my_logical_vol my_vol_grp

The above command allocates 50 extents of space in my_vol_grp to the newly created my_logical_vol. 
The -n switch specifies the name of the logical volume we are creating.

Now you can check if you got the desired results by using the command :

# lvdisplay

which shows the information of your newly created logical volume.

Once a logical volume is created, we can go ahead and put a filesystem on it, mount it, and start using 
the volume to store our files. For creating a filesystem, we do the following:

# mke2fs -j /dev/my_vol_grp/my_logical_vol

The -j signifies journaling support for the ext3 filesystem we are creating.
Mount the newly created file system :

# mount /dev/my_vol_grp/my_logical_vol /data
Also do not forget to append the corresponding line in the /etc/fstab file:

#File: /etc/fstab
/dev/my_vol_grp/my_logical_vol /data ext3 defaults 0 0
Now you can start using the newly created logical volume accessable at /data mount point.
Next : Resizing Logical Volumes


Some more on Linux LVM commands:


Linux vgcreate command:
=======================

Linux / Unix Command: vgcreate 
 
 Command Library  

NAME
vgcreate - create a volume group   
SYNOPSIS
vgcreate [-A|--autobackup {y|n}] [-d|--debug] [-h|--help] [-l|--maxlogicalvolumes MaxLogicalVolumes] 
[-p|--maxphysicalvolumes MaxPhysicalVolumes] [-s|--physicalextentsize PhysicalExtentSize[kKmMgGtT]] 
[-v|--verbose] [--version] VolumeGroupName PhysicalVolumePath [PhysicalVolumePath...]   

DESCRIPTION
vgcreate creates a new volume group called VolumeGroupName using the block special device 
PhysicalVolumePath previously configured for LVM with pvcreate(8).   

OPTIONS
-A, --autobackup {y|n} 
      Controls automatic backup of VG metadata after the change (see vgcfgbackup(8)). Default is yes. 
-d, --debug 
      Enables additional debugging output (if compiled with DEBUG). 
-h, --help 
      Print a usage message on standard output and exit successfully. 
-l, --maxlogicalvolumes MaxLogicalVolumes 
      Sets the maximum possible logical volume count. More logical volumes can't be created in this volume group. 
      Absolute maximum is 256. 
-p, --maxphysicalvolumes MaxPhysicalVolumes 
      Sets the maximum possible physical volume count. More physical volumes can't be included in this volume group. Absolute maximum is 256. 
-s, --physicalextentsize PhysicalExtentSize[kKmMgGtT] 
      Sets the physical extent size on physical volumes of this volume group. A size suffix 
      (k for kilobytes up to t for terabytes) is optional, megabytes is the default if no suffix is present. 
      Values can be from 8 KB to 16 GB in powers of 2. The default of 4 MB causes maximum LV sizes of ~256GB 
      because as many as ~64k extents are supported per LV. In case larger maximum LV sizes are needed (later), 
      you need to set the PE size to a larger value as well. Later changes of the PE size in an existing VG are 
      not supported. 
-v, --verbose 
      Display verbose runtime information about vgcreate's activities. 
--version 
      Display tool and IOP version and exit successfully. 
  
EXAMPLES
To create a volume group named test_vg using physical volumes /dev/hdk1, /dev/hdl1, and /dev/hdm1 
with default physical extent size of 4MB: 

# vgcreate test_vg /dev/sd[k-m]1

To create a volume group named test_vg using physical volumes /dev/hdk1, and /dev/hdl1 with default 
physical extent size of 4MB:

# vgcreate test_vg /dev/sdk1 /dev/sdl1

NOTE: If you are using devfs it is essential to use the full devfs name of the device rather than the 
symlinked name in /dev. so: the above could be 

# vgcreate test_vg /dev/scsi/host1/bus0/target[1-3]/lun0/part1


Linux vgextend command:
=======================


Linux / Unix Command: vgextend 
 
 Command Library  

NAME
vgextend - add physical volumes to a volume group   

SYNOPSIS
vgextend [-A|--autobackup{y|n}] [-d|--debug] [-h|--help] [-v|--verbose] VolumeGroupName 
         PhysicalVolumePath [PhysicalVolumePath...]   

DESCRIPTION
vgextend allows you to add one or more initialized physical volumes ( see pvcreate(8) ) to an existing 
volume group to extend it in size.   

OPTIONS
-A, --autobackup y/n 
Controls automatic backup of VG metadata after the change ( see vgcfgbackup(8) ). Default is yes. 
-d, --debug 
Enables additional debugging output (if compiled with DEBUG). 
-h, --help 
Print a usage message on standard output and exit successfully. 
-v, --verbose 
Gives verbose runtime information about lvextend's activities. 
  
Examples

# vgextend vg00 /dev/sda4 /dev/sdn1

tries to extend the existing volume group "vg00" by the new physical volumes (see pvcreate(8) ) 
"/dev/sdn1" and /dev/sda4".   


Linux pvcreate command:
=======================

Linux / Unix Command: pvcreate 
 
 Command Library  

NAME
pvcreate - initialize a disk or partition for use by LVM   

SYNOPSIS
pvcreate [-d|--debug] [-f[f]|--force [--force]] [-y|--yes] [-h|--help] [-v|--verbose] [-V|--version] 
         PhysicalVolume [PhysicalVolume...]   

DESCRIPTION
pvcreate initializes PhysicalVolume for later use by the Logical Volume Manager (LVM). Each PhysicalVolume 
can be a disk partition, whole disk, meta device, or loopback file. For DOS disk partitions, 
the partition id must be set to 0x8e using fdisk(8), cfdisk(8), or a equivalent. For whole disk devices 
only the partition table must be erased, which will effectively destroy all data on that disk. This can be done 
by zeroing the first sector with: 

# dd if=/dev/zero of=PhysicalVolume bs=512 count=1 

Continue with vgcreate(8) to create a new volume group on PhysicalVolume, or vgextend(8) to add PhysicalVolume 
to an existing volume group.   

OPTIONS
-d, --debug 
      Enables additional debugging output (if compiled with DEBUG). 
-f, --force 
      Force the creation without any confirmation. You can not recreate (reinitialize) a physical volume belonging 
      to an existing volume group. In an emergency you can override this behaviour with -ff. In no case case can you 
      initialize an active physical volume with this command. 
-s, --size 
      Overrides the size of the physical volume which is normally retrieved. Useful in rare case where this value 
      is wrong. More useful to fake large physical volumes of up to 2 Terabyes - 1 Kilobyte on smaller devices 
      for testing purposes only where no real access to data in created logical volumes is needed. If you wish 
      to create the supported maximum, use "pvcreate -s 2147483647k PhysicalVolume [PhysicalVolume ...]". 
      All other LVM tools will use this size with the exception of lvmdiskscan(8) 
-y, --yes 
      Answer yes to all questions. 
-h, --help 
      Print a usage message on standard output and exit successfully. 
-v, --verbose 
      Gives verbose runtime information about pvcreate's activities. 
-V, --version 
      Print the version number on standard output and exit successfully. 
  
Example

Initialize partition #4 on the third SCSI disk and the entire fifth SCSI disk for later use by LVM: 

# pvcreate /dev/sdc4 /dev/sde 


33.5 Installing a Cluster filesystem on Linux:
==============================================

Suppose, in this example, we have 2 Linux nodes, and we want to create a scsi attached shared disksystem.
We plan to use OCFS2 as the Clustered FileSystem.

First, we partition the disks to raw volumes.

This example uses /dev/sdb (an empty SCSI disk with no existing partitions) to create a single partition for the entire disk (36 GB). 
We will do this for all disks.


Ex:
# fdisk /dev/sdb
Device contains neither a valid DOS partition table, nor Sun, SGI or OSF disklabel
Building a new DOS disklabel. Changes will remain in memory only,
until you decide to write them. After that, of course, the previous
content won't be recoverable.


The number of cylinders for this disk is set to 4427.
There is nothing wrong with that, but this is larger than 1024,
and could in certain setups cause problems with:
1) software that runs at boot time (e.g., old versions of LILO)
2) booting and partitioning software from other OSs
 (e.g., DOS FDISK, OS/2 FDISK)

Command (m for help): p

Disk /dev/sdb: 255 heads, 63 sectors, 4427 cylinders
Units = cylinders of 16065 * 512 bytes

 Device Boot Start End Blocks Id System

Command (m for help): n
Command action
 e extended
 p primary partition (1-4)
p
Partition number (1-4): 1
First cylinder (1-4427, default 1):
Using default value 1
Last cylinder or +size or +sizeM or +sizeK (1-4427, default 4427):
Using default value 4427

Command (m for help): w
The partition table has been altered!

Calling ioctl() to re-read partition table.

WARNING: If you have created or modified any DOS 6.x
partitions, please see the fdisk manual page for additional
information.
Syncing disks.


Now verify the new partition: 
Ex:
# fdisk -l /dev/sdb

Disk /dev/sdb: 36.4 GB, 36420075008 bytes
255 heads, 63 sectors/track, 4427 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot      Start         End      Blocks   Id  System
/dev/sdb1   *           1        4427    35559846   83  Linux

Repeat the above steps for each disk to be partitioned.   Disk partitioning should be done from one node only.  
When finished partitioning, run the 'partprobe' command as root on each of the remaining cluster nodes in order to assure 
that the new partitions are configured.

Ex:
# partprobe

  
Oracle Cluster File System (OCFS) Release 2
-------------------------------------------

OCFS2 is a general-purpose cluster file system that can be used to store Oracle Clusterware files, Oracle RAC database files,
 Oracle software, or any other types of files normally stored on a standard filesystem such as ext3.  
This is a significant change from OCFS Release 1, which only supported Oracle Clusterware files and Oracle RAC database files.   

Obtain OCFS2

OCFS2 is available free of charge from Oracle as a set of three RPMs:  a kernel module, support tools, and a console.  
There are different kernel module RPMs for each supported Linux kernel so be sure to get the OCFS2 kernel module for your Linux kernel.  
OCFS2 kernel modules may be downloaded from http://oss.oracle.com/projects/ocfs2/files/ and the tools and console may be downloaded from 
http://oss.oracle.com/projects/ocfs2-tools/files/.  

To determine the kernel-specific module that you need, use uname -r. 

# uname -r
2.6.9-22.ELsmp

For this example I downloaded:
ocfs2console-1.0.3-1.i386.rpm
ocfs2-tools-1.0.3-1.i386.rpm
ocfs2-2.6.9-22.ELsmp-1.0.7-1.i686.rpm 

>>> Install OCFS2 as root on each cluster node 

# rpm -ivh ocfs2console-1.0.3-1.i386.rpm \
ocfs2-tools-1.0.3-1.i386.rpm \
ocfs2-2.6.9-22.ELsmp-1.0.7-1.i686.rpm

Preparing...                ########################################### [100%]
   1:ocfs2-tools            ########################################### [ 33%]
   2:ocfs2console           ########################################### [ 67%]
   3:ocfs2-2.6.9-22.ELsmp   ########################################### [100%]
Configure OCFS2 

Run ocfs2console as root: 
# ocfs2console


Now a Graphical interface will appear:

Select Cluster ? Configure Nodes
Click on Add and enter the Name and IP Address of each node in the cluster

Once all of the nodes have been added, click on Cluster --> Propagate Configuration.  This will copy the OCFS2 configuration file 
to each node in the cluster.  You may be prompted for root passwords as ocfs2console uses ssh to propagate the configuration file.  
Leave the OCFS2 console by clicking on File --> Quit.  It is possible to format and mount the OCFS2 partitions using the ocfs2console GUI; however, 
this guide will use the command line utilities. 


>>> Enable OCFS2 to start at system boot: 

As root, execute the following command on each cluster node to allow the OCFS2 cluster stack to load at boot time:
/etc/init.d/o2cb enable
Ex:
# /etc/init.d/o2cb enable


Writing O2CB configuration: OK
Loading module "configfs": OK
Mounting configfs filesystem at /config: OK
Loading module "ocfs2_nodemanager": OK
Loading module "ocfs2_dlm": OK
Loading module "ocfs2_dlmfs": OK
Mounting ocfs2_dlmfs filesystem at /dlm: OK


 Starting cluster ocfs2: OK


>>> Create a mount point for the OCFS filesystem 

As root on each of the cluster nodes, create the mount point directory for the OCFS2 filesystem
Ex:
# mkdir /u03


>>> Create the OCFS2 filesystem on the unused disk partition:

The example below creates an OCFS2 filesystem on the unused /dev/sdc1 partition with a volume label of "/u03" (-L /u03), a block size of 4K (-b 4K) 
and a cluster size of 32K (-C 32K) with 4 node slots (-N 4).  See the OCFS2 Users Guide for more information on mkfs.ocfs2 command line options.

Ex:
# mkfs.ocfs2 -b 4K -C 32K -N 4 -L /u03 /dev/sdc1

mkfs.ocfs2 1.0.3
Filesystem label=/u03
Block size=4096 (bits=12)
Cluster size=32768 (bits=15)
Volume size=36413280256 (1111245 clusters) (8889960 blocks)
35 cluster groups (tail covers 14541 clusters, rest cover 32256 clusters)
Journal size=33554432
Initial number of node slots: 4
Creating bitmaps: done
Initializing superblock: done
Writing system files: done
Writing superblock: done
Writing lost+found: done
mkfs.ocfs2 successful


>>> Mount the OCFS2 filesystem:

Since this filesystem will contain the Oracle Clusterware files and Oracle RAC database files, we must ensure that all I/O 
to these files uses direct I/O (O_DIRECT).  Use the "datavolume" option whenever mounting the OCFS2 filesystem to enable direct I/O.  
Failure to do this can lead to data loss in the event of system failure.

Ex:
# mount -t ocfs2 -L /u03 -o datavolume /u03

Notice that the mount command uses the filesystem label (-L  u03) used during the creation of the filesystem. This is a handy way to refer 
to the filesystem without having to remember the device name. 

To verify that the OCFS2 filesystem is mounted, issue the mount command or run df: 

# mount -t ocfs2
/dev/sdc1 on /u03 type ocfs2 (rw,_netdev,datavolume)

# df /u03
Filesystem           1K-blocks      Used Available Use% Mounted on
/dev/sdc1             35559840    138432  35421408   1% /u03

The OCFS2 filesystem can now be mounted on the other cluster nodes. 

To automatically mount the OCFS2 filesystem at system boot, add a line similar to the one below to /etc/fstab on each cluster node: 
LABEL=/u03   /u03    ocfs2   _netdev,datavolume,nointr 0 0


Create the directories for shared files 
CRS files
mkdir /u03/oracrs
chown oracle:oinstall /u03/oracrs
chmod 775 /u03/oracrs

Database files
mkdir /u03/oradata
chown oracle:oinstall /u03/oradata
chmod 775 /u03/oradata


34. SWAP space:
===============

34.1 Solaris:
-------------

-- View swap space:
-- ----------------

The /usr/sbib/swap utility provides a method of adding, deleting, and monitoring the system swap areas
used by the memory manager.

# swap -l

The -l option can be used to list swap space. The system displays information like:
swapfile           dev      swaplo    blocks    free
/dev/dsk/c0t0d0s3  136,3        16    302384    302384

path  : the pathname for the swaparea. In this example the pathname is swapfile.
dev   : the major/minor device number is in decimal if it's a block special device; zeroes otherwise
swaplo: the offset in 512 byte blocks where usable swapspace begins
blocks: size in 512 byte blocks. The swaplen value can be adjusted as a kernel parameter.
free  : free 512 byte blocks.
The swap -l command does not include physical memory in it's calculation of swap space.

# swap -s

The -s option can be used to list a summary of the system's virtual swap space.
total: 31760k bytes allocated + 5952k reserved = 37712k used, 202928k available

These numbers are in 1024 byte blocks.

-- Add swap area's:
-- ----------------

There are 2 methods available for adding more swap to your system.

(1) create a secondary swap partition:
(2) create a swapfile in an existing UFS file system

(1) Creating a secondary swap partition requires additional unused diskspace. You must use the format coommand
to create a new partition and filesystem on a disk.
Suppose we have the /data directory currently on slice 5 and is 200MB in size.
- free up the /data directory (save the contents to another location )
- unmount /dev/dsk/c0t0d0s5
- use format:
  Enter partition id tag (unassigned): swap
  Enter partition permission flags (wm): wu
  Enter new starting cil(3400): return
  Enter partition size: return
  Then label the disk as follows
  Partition> la
  Ready to label disk? y

- Run the newfs command on that partition to create a fresh filesystem on slice 5
  newfs /dev/rdsk/c0t0d0s5
- Make an entry to the /etc/vfstab file
- Run the swapadd script to add the swap to your system as follows:
  /sbin/swapadd
- verify that the swap has been added with swap -l


(2) The other method to add more swap space is to use the mkfile and swap commands
to designate a part of an existing UFS filesystem as a supplementary swap area.
You can use it as a temporary solution, or as a solution for longer duration as well,
but a swap file is just another file in the filesystem, so you cannot unmount that
filesystem while the swapfile is in use.
The following steps enable you to add more swap space without repartitioning a disk.
- As root, use df -k to locate a suitable filesystem. Suppose /data looks allright
  for this purpose
- Use the mkfile command to add a 50MB swapfile named swapfile in the /data partition.

  mkfile 50m /data/swapfile

- use ls -l /data to verify that the file has been created.
  Notice that the sticky bit has automatically been set.
- Activate the swaparea with the swap command as follows:

  /usr/sbin/swap -a /data/swapfile

- verify that the swap has been added with swap -l
  The system responds something like this:
  
swapfile           dev      swaplo    blocks    free
/dev/dsk/c0t0d0s3  136,3        16    302384    302384
/data/swapfile       -          16    102384    102384

If this will be a permanent swaparea, add an entry for the swapfile in the vfstab file.
/data/swapfile - - swap - no -

-- Removing a swapfile:
-- --------------------

As root use the swap -d command to remove a swaparea is follows

swap -d /dev/dsk/c0t0d0s5  for a swap partition
swap -d /data/swapfile     for a swapfile

Use the swap -l command to verify that the swaparea is gone.
Edit the /etc/vfstab file and delete the entry for the swapfile if neccessary.

In case of a swapfile, just remove the file with rm /data/swapfile

-- Creating a Temporary File System:
-- ---------------------------------

Create a directory which will serve as the mount point for the TMPFS file system.
There is no command such as newfs to create a TMPFS file system before mounting it.
The TMPFS file system actually gets created in RAM when you execute the mount command
and specify a filesystem type of TMPFS. The following example creates a new directory
/export/data and mounts a TMPFS filesystem, limiting it to 25MB.

mount -F tmpfs -o size=25m swap /export/data 


34.2 AIX:
---------

The installation creates a default paging logical volume, hd6, on drive hdisk0,
also referred as primary paging space.

The reports from the "vmstat" and "topas" commands indicate the amount of paging space I/O that is
taking place. 

Showing paging space:
---------------------

The lsps -a command provides a snapshot of the current utilization of each of the paging spaces
on the system, while the lsps -s command provides a summary of the total active paging space
and its current utilization.

# lsps -a
Page Space    Physical Volume    Volume Group     Size    %Used    Active    Auto  Type
paging00      hdisk1             rootvg           80MB    1        yes       yes   lv
hd6           hdisk1             rootvg          256MB    1        yes       yes   lv

The /etc/swapspaces file specifies the paging-space devices that are activated by the swapon -a command.
A pagingspace is added to this file when its created by the mkps -a command, and removed from
the file when rmps is used. 

You can also try:

# pstat -s

Managing Paging space:
----------------------

The following commands are used to manage paging space:

chps      : changes the attributes of a paging space
lsps      : displays the characteristics of a paging space
pstat -s  : displays the characteristics of a paging space
mkps      : creates an additional paging space
rmps      : removes an inactive paging space
swapon    : activates a paging space
swapoff   : deactivates one or more paging spaces


Managing Paging behaviour:
--------------------------

There are several page space allocation policies available in AIXr.

- Deferred Page Space Allocation (DPSA) 
- Late Page Space Allocation (LPSA) 
- Early Page Space Allocation (EPSA) 
- Deferred page space allocation

The deferred page space allocation policy is the default policy in AIX. 

Late page space allocation LPSA
The AIX operating system provides a way to enable the late page space allocation policy, which means that the disk block 
for a paging space page is only allocated when the corresponding in-memory page is touched. 

Early page space allocation EPSA
If you want to ensure that a process will not be killed due to low paging conditions, this process can 
preallocate paging space by using the early page space allocation policy. 

Choosing between LPSA and DPSA with the vmo command:

Using the "vmo -o defps" command enables turning the deferred page space allocation, or DPSA, 
on or off in order to preserve the late page space allocation policy, or LPSA. 

Paging space and virtual memory
The vmstat command (avm column), ps command (SIZE, SZ), and other utilities report the amount 
of virtual memory actually accessed because with DPSA, the paging space might not get touched. 


Show paging space usage:

# lsps -a

Increase paging space:

# chps -s 32 hd6   32x32MB

where we increased the size of hd6 with 30 LP's.

Reducing paging space:

# chps -d 1 hd6

where we decreased the size of hd6 with 1 LP.


mkps:
-----

To Add a Logical Volume for Additional Paging Space
mkps [ -a ] [ -n ] [ -t lv ] -s LogicalPartitions VolumeGroup [ PhysicalVolume ]

To create a paging space in volume group myvg that has four logical partitions and is activated immediately 
and at all subsequent system restarts, enter: 

# mkps  -a  -n  -s 4 myvg

To create a paging space in rootvg on hdisk0

# mkps -a -n -s 30 rootvg hdisk0

rmps:
-----

Before AIX 5L:
Active paging spaces cannot be removed. It must first be made inactive.
Use the chps command so the paging space is not used on the next restart.
After reboot, the paging space is inactive and can be removed with the rmps command.

AIX 51 or later:
Use the swapoff command to dynamically deactive the paging space, then use the rmps command.
# swapoff /dev/paging03
# rmps paging03

chps:
-----

As from AIX 5L you can use the chps -d command, to decrease the size of a paging space, 
without having to deactive it, then reboot, then remove, and then recreate it with a smaller size.
Decrease it with a number of LP's like:
# chps -d 2 paging03

chps -a {y|n} paging00 : specifies that the paging space paging00 is active (y) or inactive (n) at subsequent system restarts.
chps -s 10 paging02 : adds ten LPs to paging02 without rebooting.
chps -d 5 paging01 : removes five LPs from paging01 without rebooting.
chps -d 50 hd6 : removes fifty LPs from hd6 without rebooting.


List the active paging spaces:
------------------------------

# lsps -a     or lsps -s

# pg /etc/swapspaces
hd6:
         dev=/dev/hd6

paging00
         dev=/dev/paging00


Note on paging on AIX:
----------------------

If the amount of paging space is less than the amount of real memory in the system, it's possible the system 
will run out of paging space before real memory. This is because AIX performs early allocation of page space. 
When a page is referenced, real memory and paging space blocks are allocated. If there are less paging space blocks 
then real memory pages, paging space will be exhaused before all of real memory is consumed.

Early allocation algorithm
The second operating system's paging-space-slot-allocation method is intended for use in installations 
where this situation is likely, or where the cost of failure to complete is intolerably high. Aptly called early allocation, 
this algorithm causes the appropriate number of paging-space slots to be allocated at the time the 
virtual-memory address range is allocated, for example, with the malloc() subroutine. If there are not 
enough paging-space slots to support the malloc() subroutine, an error code is set. 
The early-allocation algorithm is invoked as follows:

# export PSALLOC=early
This example causes all future programs to be executed in the environment to use early allocation. 
The currently executing shell is not affected.

Early allocation is of interest to the performance analyst mainly because of its paging-space size implications. 
If early allocation is turned on for those programs, paging-space requirements can increase many times. 
Whereas the normal recommendation for paging-space size is at least twice the size of the system's real memory, 
the recommendation for systems that use PSALLOC=early is at least four times the real memory size. 
Actually, this is just a starting point. Analyze the virtual storage requirements of your workload and 
allocate paging spaces to accommodate them. As an example, at one time, the AIXwindows server required 250 MB of paging space 
when run with early allocation.

When using PSALLOC=early, the user should set a handler for the following SIGSEGV signal by pre-allocating and setting 
the memory as a stack using the sigaltstack function. Even though PSALLOC=early is specified, when there 
is not enough paging space and a program attempts to expand the stack, the program may receive the SIGSEGV signal.

Deferred allocation algorithm
The third operating system's paging-space-slot-allocation method is the default beginning with AIX 4.3.2 
Deferred Page Space Allocation (DPSA) policy delays allocation of paging space until it is necessary to page out the page, 
which results in no wasted paging space allocation. This method can save huge amounts of paging space, which means disk space.
Best to use Deffered.

On some systems, paging space might not ever be needed even if all the pages accessed have been touched. 
This situation is most common on systems with very large amount of RAM. However, this may result in overcommitment 
of paging space in cases where more virtual memory than available RAM is accessed.

To disable DPSA and preserve the Late Page Space Allocation policy, run the following command:

# vmo -o defps=0

To activate DPSA, run the following command:

# vmo -o defps=1

In general, system performance can be improved by DPSA, because the overhead of allocating page space after 
page faults is avoided the. Paging space devices need less disk space if DPSA is used


34.3 Linux:
-----------


-- Check the swapspace:

# cat /proc/meminfo 
# cat /proc/swaps
# /sbin/swapon -s

-- Creating swap space using a partition

Create a partition of the proper size using fdisk.
Format the partition, for example

# mkswap -c /dev/hda4

Enable the swap, for example

# swapon /dev/hd4

If you want the swap space enabled after boot, include the appropriate entry into /etc/fstab, for example
/dev/hda4  swap swap defaults 0 0

If you need to disable the swap, you can do it with
# swapoff /dev/hda4


-- Creating swap space using a swapfile

Create a file with the size of your swapfile
# dd if=/dev/zero of=/swapfile bs=1024 count=8192

Setup the file with the command
# mkswap /swapfile 8192

Enable the swap with the command
# swapon /swapfile

When you are done using the swapfile, you can turn it off and remove with
# swapoff /swapfile
# rm /swapfile


34.4: Note about swap:
----------------------

Page replacement in Linux 2.4 memory management
Rik van Riel 
Conectiva Inc. 
riel@conectiva.com.br, http://www.surriel.com/ 


Abstract 
While the virtual memory management in Linux 2.2 has decent performance for many workloads, it suffers from a number of problems. 
The first part of this paper contains a description of how the Linux 2.2 VMM works and an analysis of why 
it has bad behaviour in some situations. 
The way in which a lot of this behaviour has been fixed in the Linux 2.4 kernel is described in the second part of the paper. 
Due to Linux 2.4 being in a code freeze period while these improvements were implemented, only known-good solutions 
have been integrated. A lot of the ideas used are derived from principles used in other operating systems, 
mostly because we have certainty that they work and a good understanding of why, making them suitable for integration 
into the Linux codebase during a code freeze. 


--Linux 2.2 memory management 
The memory management in the Linux 2.2 kernel seems to be focussed on simplicity and low overhead. While this works pretty well in practice for most systems, it has some weak points left and simply falls apart under some scenarios. 

Memory in Linux is unified, that is all the physical memory is on the same free list and can be allocated to any of the following memory pools on demand. Most of these pools can grow and shrink on demand. Typically most of a system's memory will be allocated to the data pages of processes and the page and buffer caches. 


The slab cache: this is the kernel's dynamically allocated heap storage. This memory is unswappable, but once all objects within one (usually page-sized) area are unused, that area can be reclaimed. 

The page cache: this cache is used to cache file data for both mmap() and read() and is indexed by (inode, index) pairs. No dirty data exists in this cache; whenever a program writes to a page, the dirty data is copied to the buffer cache, from where the data is written back to disk. 

The buffer cache: this cache is indexed by (block device, block number) tuples and is used to cache raw disk devices, inodes, directories and other filesystem metadata. It is also used to perform disk IO on behalf of the page cache and the other caches. For disk reads the pagecache bypasses this cache and for network filesystems it isn't used at all. 

The inode cache: this cache resides in the slab cache and contains information about cached files in the system. Linux 2.2 cannot shrink this cache, but because of its limited size it does need to reclaim individual entries. 

The dentry cache: this cache contains directory and name information in a filesystem-independent way and is used to lookup files and directories. This cache is dynamically grown and shrunk on demand. 

SYSV shared memory: the memory pool containing the SYSV shared memory segments is managed pretty much like the page cache, but has its own infrastructure for doing things. 

Process mapped virtual memory: this memory is administrated in the process page tables. Processes can have page cache or SYSV shared memory segments mapped, in which case those pages are managed in both the page tables and the data structures used for respectively the page cache or the shared memory code. 


--Linux 2.2 page replacement 
The page replacement of Linux 2.2 works as follows. When free memory drops below a certain threshold, the pageout daemon (kswapd) is woken up. The pageout daemon should usually be able to keep enough free memory, but if it isn't, user programs will end up calling the pageout code itself. 

The main pageout loop is in the function try_to_free_pages, which starts by freeing unused slabs from the kernel memory pool. After that, it calls the following functions in a loop, asking each of them to scan a small part of their part of memory until enough memory has been freed. 


shrink_mmap is a classical clock algorithm, which loops over all physical pages, clearing referenced bits, queueing old dirty pages pages for IO and freeing old clean pages. The main disadvantage it has compared to a clock algorithm, however, is that it isn't able to free pages which are in use by a program or a shared memory segment. Those pages need to be unmapped by swap_out first. 

shm_swap scans the SYSV shared memory segments, swapping out those pages that haven't been referenced recently and which aren't mapped into any process. 

swap_out scans the virtual memory of all processes in the system, unmapping pages which haven't been referenced recently, starting swapout IO and placing those pages in the page cache. 

shrink_dcache_memory recaims entries from the VFS name cache. This is not directly reusable memory, but as soon as a whole page of these entries gets unused we can reclaim that page. 
Some balancing between these memory freeing function is achieved by calling them in a loop, starting of by asking each of these functions to scan a little bit of their memory, as each of these funnctions accepts a priority argument which tells them how big a percentage of their memory to scan. If not enough memory is freed in the first loop, the priority is increased and the functions are called again. The idea behind this scheme is that when one memory pool is heavily used, it will not give up its resources lightly and we'll automatically fall through to one of the other memory pools. However, this scheme relies on each of the memory pools to react in a similar way to the priority argument under different load conditions. This doesn't work out in practice because the memory pools just have fundamentally different properties to begin with. 


--Problems with the Linux 2.2 page replacement 

Balancing between evicting pages from the file cache, evicting unused process pages and evicting pages from shm segments. If memory pressure is "just right" shrink_mmap is always successful in freeing cache pages and a process which has been idle for a day is still in memory. This can even happen on a system with a fairly busy filesystem cache, but only with the right phase of moon. 

Simple NRU[Note] replacement cannot accurately identify the working set versus incidentally accessed pages and can lead to extra page faults. This doesn't hurt noticably for most workloads, but it makes a big difference in some workloads and can be fixed easily, mostly since the LFU replacement used in older Linux kernels is known to work. 

Due to the simple clock algorithm in shrink_mmap, sometimes clean, accessed pages can get evicted before dirty, old pages. With a relatively small file cache that mostly consists of dirty data, eg unpacking a tarball, it is possible for the dirty pages to evict the (clean) metadata buffers that are needed to write the dirty data to disk. A few other corner cases with amusing variations on this theme are bound to exist. 

The system reacts badly to variable VM load or to load spikes after a period of no VM activity. Since kswapd, the pageout daemon, only scans when the system is low on memory, the system can end up in a state where some pages have referenced bits from the last 5 seconds, while other pages have referenced bits from 20 minutes ago. This means that on a load spike the system has no clue which are the right pages to evict from memory, this can lead to a swapping storm, where the wrong pages are evicted and almost immediately afterwards faulted back in, leading to the pageout of another random page, etc... 

Under very heavy loads, NRU replacement of pages simply doesn't cut it. More careful and better balanced pageout eviction and flushing is called for. With the fragility of the Linux 2.2 pageout framework this goal doesn't really seem achievable. 
The facts that shrink_mmap is a simple clock algorithm and relies on other functions to make process-mapped pages freeable makes it fairly unpredictable. Add to that the balancing loop in try_to_free_pages and you get a VM subsystem which is extremely sensitive to minute changes in the code and a fragile beast at its best when it comes to maintenance or (shudder) tweaking. 


--Changes in Linux 2.4 
For Linux 2.4 a substantial development effort has gone into things like making the VM subsystem fully fine-grained for SMP systems and supporting machines with more than 1GB of RAM. Changes to the pageout code were done only in the last phase of development and are, because of that, somewhat conservative in nature and only employ known-good methods to deal with the problems that happened in the page replacement of the Linux 2.2 kernel. Before we get to the page replacement changes, however, first a short overview of the other changes in the 2.4 VM: 


More fine-grained SMP locking. The scalability of the VM subsystem has improved a lot for workloads where multiple CPUs are reading or writing the same file simultaneously; for example web or ftp server workloads. This has no real influence on the page replacement code. 

Unification of the buffer cache and the page cache. While in Linux 2.2 the page cache used the buffer cache to write back its data, needing an extra copy of the data and doubling memory requirements for some write loads, in Linux 2.4 dirty page cache pages are simply added in both the buffer and the page cache. The system does disk IO directly to and from the page cache page. That the buffer cache is still maintained separately for filesystem metadata and the caching of raw block devices. Note that the cache was already unified for reads in Linux 2.2, Linux 2.4 just completes the unification. 

Support for systems with up to 64GB of RAM (on x86). The Linux kernel previously had all physical memory directly mapped in the kernel's virtual address space, which limited the amount of supported memory to slightly under 1GB. For Linux 2.4 the kernel also supports additional memory (so called "high memory" or highmem), which can not be used for kernel data structures but only for page cache and user process memory. To do IO on these pages they are temporarily mapped into kernel virtual memory and the data is copied to or from a bounce buffer in "low memory". 
At the same time the memory zone for ISA DMA (0 - 16 MB physical address range) has also been split out into a separate page zone. This means larger x86 systems end up with 3 memory zones, which all need their free memory balanced so we can continue allocating kernel data structures and ISA DMA buffers. The memory zones logic is generalised enough to also work for NUMA systems. 


The SYSV shared memory code has been removed and replaced with a simple memory filesystem which uses the page cache for all its functions. It supports both POSIX SHM and SYSV SHM semantics and can also be used as a swappable memory filesystem (tmpfs). 
Since the changes to the page replacement code took place after all these changes and in the (one and a half year long) code freeze period of the Linux 2.4 kernel, the changes have been kept fairly conservative. On the other hand, we have tried to fix as many of the Linux 2.2 page replacement problems as possible. Here is a short overview of the page replacement changes: they'll be described in more detail below. 


Page aging, which was present in the Linux 1.2 and 2.0 kernels and in FreeBSD has been reintroduced into the VM. However, a few small changes have been made to avoid some artifacts of virtual page based aging. 

To avoid the eviction of "wrong" pages due to interactions from page aging and page flushing, the page aging and flushing has been separated. There are active and inactive page lists. 

Page flushing has been optimised to avoid too much interference by writeout IO on the more time-critical disk read IO. 

Controlled background page aging during periods of little or no VM activity in order to keep the system in a state where it can easily deal with load spikes. 

Streaming IO is detected; we do early eviction on the pages that have already been used and reward the IO stream with more agressive readahead. 

--Linux 2.4 page replacement changes in detail 
The development of the page replacement changes in Linux 2.4 has been influenced by two main factors. Firstly the bad behaviours of Linux 2.2 page replacement had to be fixed, using only known-good strategies because the development of Linux 2.4 had already entered the "code freeze" state. Secondly the page replacement had to be more predictable and easier to understand than Linux 2.2 because tuning the page replacement in Linux 2.2 was deserving of the proverbial label "subtle and quick to upset". This means that only VM ideas that are well understood and have little interactions with the rest of the system were integrated. Lots of ideas were taken from other freely available operating systems and literature. 


--Page aging 
Page aging was the first easy step in making the bad border-case behaviour from Linux 2.2 go away, it works reasonably well in Linux 1.2, Linux 2.0 and FreeBSD. Page aging allows us to make a much finer distinction between pages we want to keep in memory and pages we want to swap out than the NRU aging in Linux 2.2. 
Page aging in these OSes works as follows: for each physical page we keep a counter (called age in Linux, or act_count in FreeBSD) that indicates how desirable it is to keep this page in memory. When scanning through memory for pages to evict, we increase the page age (adding a constant) whenever we find that the page was accessed and we decrease the page age (substracting a constant) whenever we find that the page wasn't accessed. When the page age (or act_count) reaches zero, the page is a candidate for eviction. 

However, in some situations the LFU[Note] page aging of Linux 2.0 is known to have too much CPU overhead and adjust to changes in system load too slowly. Furthermore, research[Smaragdis, Kaplan, Wilson] has shown that recency of access is a more important criteria for page replacement than frequency. 

These two problems are solved by doing exponential decline of the page age (divide by two instead of substracting a constant) whenever we find a page that wasn't accessed, resulting in page replacement which is closer to LRU[Note] than LFU. This reduces the CPU overhead of page aging drastically in some cases; however, no noticable change in swap behaviour has been observed. 

Another artifact comes from the virtual address scanning. In Linux 1.2 and 2.0 the system reduces the page age of a page whenever it sees that the page hasn't been accessed from the page table which it is currently scanning, completely ignoring the fact that the page could have been accessed from other page tables. This can put a severe penalty on heavily shared pages, for example the C library. 

This problem is fixed by simply not doing "downwards" aging from the virtual page scans, but only from the physical-page based scanning of the active list. If we encounter pages which are not referenced, present in the page tables but not on the active list, we simply follow the swapout path to add this page to the swap cache and the active list so we'll be able to lower the page age of this page and swap it out as soon as the page age reaches zero. 


--Multiple page lists 
The bad interactions between page aging and page flushing, where referenced clean pages were freed before old dirty pages, is fixed by keeping the pages which are candidates for eviction separated from the pages we want to keep in memory (page age zero vs. nonzero). We separate the pages out by putting them on various page lists and having separate algorithms deal with each list. 
Pages which are not (yet) candidate for eviction are in process page tables, on the active list or both. Page aging as described above happens on these pages, with the function refill_inactive() balancing between scanning the page tables and scanning the active list. 

When the page age on a page reaches zero, due to a combination of pageout scanning and the page not being actively used, the page is moved to the inactive_dirty list. Pages on this list are not mapped in the page tables of any process and are, or can become, reclaimable. Pages on this list are handled by the function page_launder(), which flushes the dirty pages to disk and moves the clean pages to the inactive_clean list. 

Unlike the active and inactive_dirty lists, the inactive_clean list isn't global but per memory zone. The pages on these lists can be immediately reused by the page allocation code and count as free pages. These pages can also still be faulted back into where it came from, since the data is still there. In BSD this would be called the "cache" queue. 


--Dynamically sized inactive list 
Since we do page aging to select which pages to evict, having a very large statically sized inactive list (like FreeBSD has) doesn't seem to make much sense. In fact, it would cancel out some of the effects of doing the page aging in the first place: why spend much effort selecting which pages to evict[Dillon] when you keep as much as 33% of your swappable pages on the inactive list? Why do careful page aging when 33% of your pages end up as candidates for eviction at the same priority and you've effectively undone the aging for those 33% of pages which are candidates for eviction? 
On the other hand, having lots of inactive pages to choose from when doing page eviction means you have more chances of avoiding writeout IO or doing better IO clustering. It also gives you more of a "buffer" to deal with allocations due to page faults, etc. 

Both a large and a small target size for the inactive page list have their benefits. In Linux 2.4 we have chosen for a middle ground by letting the system dynamically vary the size of the inactive list depending on VM activity, with an artificial upper limit to make sure the system always preserves some aging information. 

Linux 2.4 keeps a floating average of the amount of pages evicted per second and sets the target for the inactive list and the free list combined to the free target plus this average number of page steals per second. Not only does this second give us enough time to do all kinds of page flushing optimisations, it also is small enough to keep page age distribution within the system intact, allowing us to make good choices on which pages to evict and which pages to keep. 


--Optimised page flushing 
Writing out pages from the inactive_dirty list as we encounter them can cause a system to totally destroy read performance because of the extra disk seeks done. A better solution is to delay writeout of dirty pages and let these dirty pages accumulate until we can do better IO clustering so that these pages can be written out to disk with less disk seeks and less interference with read performance. 
Due to the development of the page replacement changes happening in the code freeze, the system currently has a rather simple implementation of what's present in FreeBSD 4.2. As long as there are enough clean inactive pages around, we keep moving those to the inactive_clean list and never bother with syncing out the dirty pages. Note that this catches both clean pages and pages which have been written to disk by the update daemon (which commits filesystem data to disk periodically). 

This means that under loads where data is seldom written we can avoid writing out dirty inactive pages most of the time, giving us much better latencies in freeing pages and letting streaming reads continue without the disk head moving away to write out data all the time. Only under loads where lots of pages are being dirtied quickly does the system suffer a bit from syncing out dirty data irregularly. 

Another alternative would have been the strategy used in FreeBSD 4.3, where dirty pages get to stay in the inactive list longer than clean pages but are synced out before the clean pages are exhausted. This strategy gives more consistent pageout IO in FreeBSD during heavy write loads. However, a big factor causing the irregularities in pageout writes using the simpler strategy above may well be caused because of the huge inactive list target in FreeBSD (33It is not at all clear what this more complicated strategy would do when used on the dynamically sized inactive list on Linux 2.4, because of this Linux 2.4 uses the better understood strategy of evicting clean inactive pages first and only after those are gone start syncing the dirty ones. 


--Background page aging 
On many systems the normal operating mode is that after a period of relative activity a sudden load spike comes in and the system has to deal with that as gracefully as possible. Linux 2.2 has the problem that, with the lack of an inactive page list, it is not clear at all which pages should be evicted when a sudden demand for memory kicks in. 
Linux 2.4 is better in this respect, with the reclaim candidates neatly separated out on the inactive list. However, the inactive list could have any random size the moment VM pressure drops off. We'd like get the system in a more predictable state while the VM pressure is low. In order to achieve this, Linux 2.4 does background scanning of the pages, trying to get a sane amount of pages on the inactive list, but without scanning agressively so only truly idle pages will end up on the inactive list and the scanning overhead stays small. 


--Drop behind 
Streaming IO doesn't just have readahead, but also its natural complement: drop behind. After the program doing the streaming IO is done with a page, we depress its priority heavily so it will be a prime candidate for eviction. Not only does this protect the working set of running processes from being quickly evicted by streaming IO, but it also prevents the streaming IO from competing with the pageouts and pageins of the other running processes, which reduces the number of disk seeks and allows the streaming IO to proceed at a faster speed. Currently readahead and drop-behind only work for read() and write(); mmap()ed files and swap-backed anonymous memory aren't supported yet. 

--Conclusions 
Since the Linux 2.4 kernel's VM subsystem is still being tuned heavily, it is too early to come with conclusive figures on performance. However, initial results seem to indicate that Linux 2.4 generally has better performance than Linux 2.2 on the same hardware. 

Reports from users indicate that performance on typical desktop machines has improved a lot, even though the tuning of the new VM has only just begun. Throughput figures for server machines seem to be better too, but that could also be attributed to the fact that the unification of the page cache and the buffer cache is complete. 

One big difference between the VM in Linux 2.4 and the VM in Linux 2.2 is that the new VM is far less sensitive to subtle changes. While in Linux 2.2 a subtle change in the page flushing logic could upset page replacement, in Linux 2.4 it is possible to tweak the various aspects of the VM with predictable results and little to no side-effects in the rest of the VM. 

The solid performance and relative insensitivity to subtle changes in the environment can be taken as a sign that the Linux 2.4 VM is not just a set of simple fixes for the problems experienced in Linux 2.2, but also a good base for future development. 


Remaining issues 
The Linux 2.4 VM mainly contains easy to implement and obvious to verify solutions for some of the known problems Linux 2.2 suffers from. A number of issues are either too subtle to implement during the code freeze or will have too much impact on the code. The complete list of TODO items can be found on the Linux-MM page[Linux-MM]; here are the most important ones: 


Low memory deadlock prevention: with the arrival of journaling and delayed-allocation filesystems it is possible that the system will need to allocate memory in order to free memory; more precisely, to write out data so memory can become freeable. To remove the possibility for deadlock, we need to limit the number of outstanding transactions to a safe number, possibly letting each of the page flushing functions indicate how much memory it may need and doing bookkeeping of these values. Note that the same problem occurs with swap over network. 

Load control: no matter how good we can get the page replacement code, there will always be a point where the system ends up thrashing to death. Implementing a simple load control system, where processes get suspended in round-robin fashion when the paging load gets too high, can keep the system alive under heavy overload and allow the system to get enough work done to bring itself back to a sane state. 

RSS limits and guarantees: in some situations it is desirable to control the amount of physical memory a process can consume 
(the resident set size, or RSS). With the virtual address based page scanning of Linux' VM subsystem it is trivial to implement 
RSS ulimits and minimal RSS guarantees. Both help to protect processes under heavy load and allow the system administrator 
to better control the use of memory resources. 

VM balancing: in Linux 2.4, the balancing between the eviction of cache pages, swap-backed anonymous memory and the inode and dentry caches is essentially the same as in Linux 2.2. While this seems to work well for most cases there are some possible scenarios where a few of the caches push the other users out of memory, leading to suboptimal system performance. It may be worthwhile to look into improving the balancing algorithm to achieve better performance in "non-standard" situations. 

Unified readahead: currently readahead and drop-behind only works for read() and write(). Ideally they should work for mmap()ed files and anonymous memory too. Having the same set of algorithms for both read()/write(), mmap() and swap-backed anonymous memory will simplify the code and make performance improvements in the readahead and drop-behind code immediately available to all of the system. 


AIX swap notes:
---------------

Note 1:
-------

Q:

Hi All,

I'm seeing an interesting paging behavior (paging out to paging space when I don't think it should) on our AIX 5.3 TL3CSP system. 
First the system particulars:

AIX 5.3 TL3 with CSP
HACMP v5.2
Oracle 10g
28GB memory
8GB paging space
EMC LUNs for Oracle data.
CIO used for Oracle data.

Virtual memory tuned as such
vmo -p -o maxclient%=50 
vmo -p -o maxperm%=50 
vmo -p -o 'lru_file_repage=0' 
vmo -p -o 'minperm%=3' 

So, given that configuration, it is my understanding that AIX, when under memory pressure, will steal memory from the file cache 
instead of paging process memory out to the paging space (lru_file_repage = 0).

Now, this system works for the most part like I understand it should. Via nmon, I can watch it stealing memory from the FileSystemCache 
(numclient values decrease) when the box gets under memory pressure. However, every once in a while when under memory pressure, 
I can see that the system starts writing to the paging space when there is plenty of FileSystemCache available to steal from.

Below is a snapshot from the nmon 'm'emory switch:
nmon.jpg
You can see here that I've got 1.7GB paged out, while numclient is at 21%.

So, my question is, why does AIX page out when under memory pressure instead of stealing from the FileSystemCache memory like I want it to?


A:

Look at the Paging to/from the Paging Space - its zero. Once info is in the paging space its left there until the space is needed 
for something else. So at this point the server isn't actually paging. 

It Has paged in the past however.


Note 2:
------

AIX will always try to use 100% of real memory--> AIX will use the amount of 
memory solicited by your processes. The remaining capacity will be used as 
filesystem cache. 


You can change the minimum and maximum amounts of memory used to cache files 
with vmtune (vmo for 5.2+), and it is advised to do so if your're running 
databases with data on raw devices (since the db engine usually has its own 
cache algorithm, and AIX can't cache data on raw devices). The values to 
modify are minperm, maxperm, minclient and maxpin (use at you own risk!!!). 


Paging space use will be very low: 5% is about right--> A paging space so 
little used seems to be oversized. In general, the paging space should be 
under 40%, and the size must be determined accordingly to the application 
running (i.e. 4X the physical memory size for oracle). In AIX 5L a paging 
space can be reduced without rebooting. Anyway, AIX always uses some paging 
space, even keeping copies of the data on memory and on disk, as a 
"predictive" paging. 

Look in topas for the values "comp mem" (proceses) and "non comp mem" 
(filesystem cache) to see the distribution of the memory usage. Nmon can 
show you the top proceses by memory usage, along with many other statistics. 


There are several tools which can give you a more detailed picture of how 
memory is being used. "svmon" is very comprehensive. Tools such as topas 
and nmon will also give you a bit more information. 

Note 3:
-------

Memory utilization on AIX systems typically runs around 100%. This is often a source of concern. However, high memory utilization 
in AIX does not imply the system is out of memory. By design, AIX leaves files it has accessed in memory. 
This significantly improves performance when AIX reaccesses these files because they can be reread directly from memory, not disk*. 
When AIX needs memory, it discards files using a "least used" algorithm. This generates no I/O and has almost no performance impact 
under normal circumstances. 

Sustained paging activity is the best indication of low memory. Paging activity can be monitored using the "vmstat" command. 
If the "page-in" (PI) and "page-out" (PO) columns show non-zero values over "long" periods of time, then the system is short on memory. 
(All systems will show occasional paging, which is not a concern.) 

Memory requirements for applications can be empirically determined using the AIX "rmss"command. The "rmss" command is a test tool 
that dynamically reduces usable memory. The onset of paging indicates an application's minimum memory requirement. 

Finally, the "svmon" command can be used to list how much memory is used each process. The interpretation of the svmon output 
requires some expertise. See the AIX documentation for details. 


==================================================================
35 Volume group, logical volumes, and filesystem commands in HPUX:
==================================================================


35.1 Filesystems in HPUX:
-------------------------

HFS : used at HP-UX < v. 10
VxFS: used at HP-UX >= v. 10

Ofcourse, CDFS (cdroms), and other filesystem types, are supported.

HP-UX's implementation of a journaled file system, also known as JFS, is based on the version from 
VERITAS Software Inc. called VxFS.

Up through the 10.0 release of HP-UX, HFS has been the only available locally mounted read/write file system. 
Beginning at 10.01, you also have the option of using VxFS. (Note, however, that VxFS cannot be used 
as the root file system.)

As compared to HFS, VxFS allows much shorter recovery times in the event of system failure. 
It is also particularly useful in environments that require high performance or deal with large 
volumes of data. This is because the unit of file storage, called an extent, can be multiple blocks, 
allowing considerably faster I/O than with HFS. It also provides for minimal downtime by allowing 
online backup and administration - that is, unmounting the file system will not be necessary for 
certain tasks. You may not want to configure VxFS, though, on a system with limited memory 
because VxFS memory requirements are considerably larger than that for HFS.

Basic VxFS functionality is included with the HP-UX operating system software. Additional enhancements 
to VxFS are available as a separately orderable product called HP "OnlineJFS", product number B5117AA (Series 700) 
and B3928AA (Series 800). 


35.2 How to create a filesystem in HP-UX: an outline.
-----------------------------------------------------


-- Task 1. Estimate the Size Required for the Logical Volume  
 
-- Task 2. Determine If Sufficient Disk Space Is Available for the Logical Volume within Its Volume Group  
 
Use the vgdisplay command to calculate this information. vgdisplay will output data on one or more volume groups, 
including the physical extent size (under PE Size (Mbytes)) and the number of available physical extents 
(under Free PE). By multiplying these two figures together, you will get the number of megabytes available 
within the volume group. See vgdisplay(1M) for more information.

-- Task 3. Add a Disk to a Volume Group If Necessary 
 
If there is not enough space within a volume group, you will need to add a disk to a volume group.
To add a disk to an existing volume group, use pvcreate(1M) and vgextend(1M). You can also add a disk 
by creating a new volume group with pvcreate(1M) and vgcreate(1M).

-- Task 4. Create the Logical Volume  
 
Use lvcreate to create a logical volume of a certain size in the above volume group. See lvcreate(1M) for details.
Use lvcreate as in the following example:

Create a logical volume of size 100 MB in volume group /dev/vg03:
# lvcreate -L 100 /dev/vg03

-- Task 5. Create the New File System  
 
Create a file system using the newfs command. Note the use of the character device file. For example:
 
# newfs -F hfs /dev/vg02/rlvol1 
 
If you do not use the -F FStype option, by default, newfs creates a file system based on the content 
of your /etc/fstab file. If there is no entry for the file system in /etc/fstab, then the file system type 
is determined from the file /etc/default/fs. For information on additional options, see newfs(1M).

$ cat /etc/default/fs
LOCAL=vxfs


For HFS, you can explicitly specify that newfs create a file system that allows short file names or long file names 
by using either the -S or -L option. By default, these names will as short or long as those allowed 
by the root file system. Short file names are 14 characters maximum. Long file names allow up to 255 characters. 
Generally, you use long file names to gain flexibility in naming files. Also, files created on other systems 
that use long file names can be moved to your system without being renamed.

When creating a VxFS file system, file names will automatically be long.

After creating a filesystem, you need to mount it to make it accesible, for example like:


-- Task 6. mount the new local file system:

Choose an empty directory to serve as the mount point for the file system. Use the mkdir command to 
create the directory if it does not currently exist. For example, enter:
 
# mkdir /test 
 
Mount the file system using the mount command. Use the block device file name that contains the file system. 
You will need to enter this name as an argument to the mount command.

For example, enter
 
# mount /dev/vg01/lvol1 /test 


Note: 
The newfs command is a "friendly" front-end to the mkfs command (see mkfs(1M)). The newfs command 
calculates the appropriate parameters and then builds the file system by invoking the mkfs command.


35.3 HP-UX LVM commands:
========================

-- vgdisplay:
-- ----------

Displays information about volume groups.

Examples:

# vgdisplay
# vgdisplay -v vgdatadir


-- pvdisplay:
-- ----------

Display information about physical volumes within LVM volume group. 

EXAMPLES

Display the status and characteristics of a physical volume: 
# pvdisplay /dev/dsk/c1t0d0 

Display the status, characteristics, and allocation map of a physical volume: 
# pvdisplay -v /dev/dsk/c2t0d0 

# pvdisplay /dev/dsk/c102t9d3

--- Physical volumes ---
PV Name                     /dev/dsk/c43t9d3
PV Name                     /dev/dsk/c102t9d3   Alternate Link
VG Name                     /dev/vgora_e1atlas_data
PV Status                   available
Allocatable                 yes
VGDA                        2
Cur LV                      2
PE Size (Mbytes)            4
Total PE                    1668
Free PE                     102
Allocated PE                1566
Stale PE                    0
IO Timeout (Seconds)        default
Autoswitch                  On


-- lvdisplay:
-- ----------

Displays information about logical volumes.

Examples:

# lvdisplay lvora_p0gencfg_apps
# lvdisplay -v lvora_p0gencfg_apps
# lvdisplay -v /dev/vg00/lvol2

# lvdisplay /dev/vgora_e0etea_data/lvora_e0etea_data
--- Logical volumes ---
LV Name                     /dev/vgora_e0etea_data/lvora_e0etea_data
VG Name                     /dev/vgora_e0etea_data
LV Permission               read/write
LV Status                   available/syncd
Mirror copies               1
Consistency Recovery        MWC
Schedule                    parallel
LV Size (Mbytes)            17020
Current LE                  4255
Allocated PE                8510
Stripes                     0
Stripe Size (Kbytes)        0
Bad block                   on
Allocation                  strict
IO Timeout (Seconds)        default


-- vgchange:
-- ---------

Set volume group availability. This command activates or deactivates one or more volume groups as specified
by the -a option, namely y or n.

Activate a volume group:
# vgchange -a y /dev/vg03

Deactivate a volume group:
# vgchange -a n /dev/vg03


-- vgcreate:
-- ---------


/usr/sbin/vgcreate [-f] [-A autobackup] [-x extensibility] [-e max_pe] [-l max_lv] [-p max_pv] 
                   [-s pe_size] [-g pvg_name] vg_name pv_path ...

The vgcreate command creates a new volume group. vg_name is a symbolic name for the volume group and must be used 
in all references to it. vg_name is the path to a directory entry under /dev that must contain a character 
special file named group. Except for the group entry, the vg_name directory should be empty. 
The vg_name directory and the group file have to be created by the user (see lvm(7)).

vgcreate leaves the volume group in an active state.


EXAMPLES

1. Create a volume group named /dev/vg00 containing two physical volumes
with extent size set to 2 Mbytes.  If directory /dev/vg00 exists with
the character special file group, the volume group is created:

# vgcreate -s 2 /dev/vg00 /dev/dsk/c1d0s2 /dev/dskc2d0s2

2. Create a volume group named /dev/vg01 that can contain a maximum of
three logical volumes, with extent size set to 8 Mbytes:

# vgcreate -l 3 -s 8 /dev/vg01 /dev/dsk/c4d0s2

3. Create a volume group named /dev/vg00 and a physical volume group
named PVG0 with two physical volumes:

# vgcreate -g PVG0 /dev/vg00 /dev/dsk/c1d0s2 /dev/dsk/c2d0s2

3. Create a volume group named /dev/vg00 containing two physical volumes with extent size 
set to 2 MB, from scratch. 

First, create the directory /dev/vg00 with the character special file called group. 

mkdir /dev/vg00 
mknod /dev/vg00/group c 64 0x030000 

The minor number for the group file should be unique among all the volume groups on the system. 
It has the format 0xNN0000, where NN runs from 00 to ff. The maximum value of NN is controlled by the kernel 
tunable parameter maxvgs.

Initialize the disks using pvcreate(1M). 

pvcreate /dev/rdsk/c1t0d0 
pvcreate /dev/rdsk/c1t2d0 

Create the volume group. 

vgcreate -s 2 /dev/vg00 /dev/dsk/c1t0d0 /dev/dsk/c1t2d0 


Note About the "dsk" and "rdsk" notation:
-----------------------------------------

Physical volumes are identified by their device file names, for example

/dev/dsk/cntndn

/dev/rdsk/cntndn

Note that each disk has a block device file and a character or raw device file, the latter identified by the r. 
Which name you use depends on what task you are doing with the disk. In the notation above, the first name 
represents the block device file while the second is the raw device file.

-- Use a physical volume's raw device file for these two tasks only:

-> When creating a physical volume. Here, you use the device file for the disk. For example, 
this might be /dev/rdsk/c3t2d0 if the disk were at card instance 3, target address 2, and device number 0. 
(The absence of a section number beginning with s indicates you are referring to the entire disk.)

-> When restoring your volume group configuration.

For all other tasks, use the block device file. For example, when you add a physical volume to a volume group, 
you use the disk's block device file for the disk, such as /dev/dsk/c5t3d0.


-- vgextend:
-- ---------

Extends a volume group by adding physical volumes to it.

Examples:

Add physical volumes /dev/dsk/c1d0s2 and /dev/dsk/c2d0s2 to volume group /dev/vg03:
# vgextend /dev/vg03 /dev/dsk/c1d0s2 /dev/dsk/c2d0s2

# vgextend vg01 /dev/dsk/c0t4d0


-- pvcreate:
-- ---------

Creates physical volume for use in a volume group.

Examples:

# pvcreate -f /dev/rdsk/c1d0s2

# ioscan -fnC disk
# pvcreate -f /dev/rdsk/c0t1d0


-- lvcreate:
-- ---------

Create logical volume in LVM volume group 

The lvcreate command creates a new logical volume within the volume group specified by vg_name. 
Up to 255 logical volumes can be created in one volume group

SYNOPSIS
      /etc/lvcreate [-d schedule] {-l logical_extents_number | -L
      logical_volume_size} [-m mirror_copies] [-n lv_path] [-p permission]
      [-r relocate] [-s strict] [-C contiguous] [-M mirror_write_cache] [-c
      vol_group_name


Examples:

Create a logical volume in volume group /dev/vg02: 

# lvcreate /dev/vg02 

Create a logical volume in volume group /dev/vg03 with nonstrict allocation policy: 

# lvcreate -s n /dev/vg03 

Create a logical volume of size 100 MB in volume group /dev/vg03: 

# lvcreate -L 100 /dev/vg03 

Create a logical volume of size 90 MB striped across 3 disks with a stripe size of 64 KB: 

# lvcreate -L 90 -i 3 -I 64 /dev/vg03 


-- fstyp:
-- ------

Determines file system type.

SYNOPSIS
/usr/sbin/fstyp [-v] special

The fstyp command allows the user to determine the file system type of a mounted or unmounted file system. 
special represents a device special file (for example: /dev/dsk/c1t6d0).

The file system type is determined by reading the superblock of the supplied special file. If the superblock 
is read successfully, the command prints the file system type identifier on the standard output and exits 
with an exit status of 0. If the type of the file system cannot be identified, the error message 
unknown_fstyp (no matches) is printed and the exit status is 1. Exit status 2 is not currently returned, 
but is reserved for the situation where the file system matches more than one file system type. 
Any other error will cause exit status 3 to be returned.

The file system type is determined by reading the superblock of the supplied special file.

Examples:

Find the type of the file system on a disk, /dev/dsk/c1t6d0: 

# fstyp /dev/dsk/c1t6d0 

Find the type of the file system on a logical volume, /dev/vg00/lvol6: 

# fstyp /dev/vg00/lvol6 

Find the file system type for a particular device file and also information about its super block: 

# fstyp -v /dev/dsk/c1t6d0 


-- mkboot:
-- -------

mkboot is used to install or update boot programs on the specified device file.

The position on device at which boot programs are installed depends on the disk layout of the device. 
mkboot examines device to discover the current layout and uses this as the default. If the disk is uninitialized, 
the default is LVM layout on PA-RISC and Whole Disk on Itanium(R)-based systems. 
The default can be overridden by the -l, -H, or -W options.

Boot programs are stored in the boot area in Logical Interchange Format (LIF), which is similar to a file system. 
For a device to be bootable, the LIF volume on that device must contain at least the ISL 
(the initial system loader) and HPUX (the HP-UX bootstrap utility) LIF files. If, in addition, the device 
is an LVM physical volume, the LABEL file must be present (see lvlnboot(1M) ).

For the VERITAS Volume Manager (VxVM) layout on the Itanium-based system architecture, the only relevant 
LIF file is the LABEL file. All other LIF files are ignored. VxVM uses the LABEL file when the system boots 
to determine the location of the root, stand, swap, and dump volumes.

EXAMPLES

Install default boot programs on the specified disk, treating it as an LVM disk: 

# mkboot -l /dev/dsk/c0t5d0 

Use the existing layout, and install only SYSLIB and ODE files and preserve the EST file on the disk: 

# mkboot -i SYSLIB -i ODE -p EST /dev/rdsk/c0t5d0 

Install only the SYSLIB file and retain the ODE file on the disk. Use the Whole Disk layout. Use the file 
/tmp/bootlf to get the boot programs rather than the default. (The -i ODE option will be ignored): 

# mkboot -b /tmp/bootlf -i SYSLIB -i ODE -p ODE -W /dev/rdsk/c0t5d0 

Install EFI utilities to the EFI partition on an Itanium-based system, treating it as an LVM or VxVM disk: 

# mkboot -e -l /dev/dsk/c3t1d0 

Create AUTO file with the string autofile command on a device. If the device is on an Itanium-based system, 
the file is created as /EFI/HPUX/AUTO in the EFI partition. If the device is on a PA-RISC system, the file 
is created as a LIF file in the boot area. 

# mkboot -a "autofile command" /dev/dsk/c2t0d0 


-- bdf:
-- ----

Report number of free disk blocks.

bdf prints out the amount of free disk space available on the specified filesystem (/dev/dsk/c0d0s0, for example) 
or on the file system in which the specified file ($HOME, for example) is contained.
If no file system is specified, the free space on all of the normally mounted file systems is printed.  
The reported numbers are in kilobytes.
 
Examples:

# bdf

oranh300:/home/se1223>bdf | more
Filesystem          kbytes    used   avail %used Mounted on
/dev/vg00/lvol3     434176  165632  266504   38% /
/dev/vg00/lvol1     298928   52272  216760   19% /stand
/dev/vg00/lvol8    2097152 1584488  508928   76% /var
/dev/vg00/lvol11    524288    2440  490421    0% /var/tmp
/dev/vg00/lvucmd     81920    1208   75671    2% /var/opt/universal
/dev/vg00/lvol9    1048576  791925  240664   77% /var/adm
/dev/vg00/lvol10   2064384   47386 1890941    2% /var/adm/crash
/dev/vg00/lvol7    1548288 1262792  283320   82% /usr
/dev/vg00/vsaunixlv
                    311296  185096  118339   61% /usr/local/vsaunix
/dev/vg00/lvol4    1867776    5264 1849784    0% /tmp
/dev/vg00/lvol6    1187840  757456  427064   64% /opt
/dev/vg00/lvol5     262144   34784  225632   13% /home
/dev/vg00/lvbeheer  131072   79046   48833   62% /beheer
/dev/vg00/lvbeheertmp
                    655360   65296  553190   11% /beheer/tmp
/dev/vg00/lvbeheerlog
                    524288   99374  398407   20% /beheer/log
/dev/vg00/lvbeheerhistlog
..
..


# bdf /tmp
Filesystem          kbytes    used   avail %used Mounted on
/dev/vg00/lvol4    1867776    5264 1849784    0% /tmp


-- lvextend:
-- ---------

Increase number of physical extents allocated to a logical volume.

/etc/lvextend {-l logical_extents_number | -L logical_volume_size | -m
              mirror_copies} lv_path [physical_volume_path ...  |
              physical_vol_group_name...]

lvextend increases the number of mirrored copies or the size of the lv_path parameter.  
The change is determined according to which command options are specified.

WARNINGS
      The -m option cannot be used on HP-IB devices.

EXAMPLES
- Increase the number of the logical extents of a logical volume to one hundred:

# lvextend -l 100 /dev/vg01/lvol3

- Increase the logical volume size to 400 Mbytes:

# lvextend -L 400 /dev/vg01/lvol4

Allocate two mirrors (that is, three copies) for each logical extent of a logical volume:

# lvextend -m 2 /dev/vg01/lvol5


-- extendfs:
-- ---------

Extend file system size.

/etc/extendfs [-q] [-v] [-s size] special

If the original hfs filesystem image created on special does not make use of all of the available space, 
extendfs can be used to increase the capacity of an hfs filesystem by updating the filesystem structure
to include the extra space.
The command-line parameter special specifies the character device special file of either a logical volume 
or a disk partition. If special refers to a mounted filesystem, special must be un-mounted
before extendfs can be run (see mount(1M)).

The root filesystem cannot be extended using the extendfs command
because the root filesystem is always mounted, and extendfs only works
on unmounted filesystems.


EXAMPLES
To increase the capacity of a filesystem created on a logical volume, enter:

# umount /dev/vg00/lvol1

# lvextend -L larger_size /dev/vg00/lvol1

# extendfs /dev/vg00/rlvol1


-- fsadm:
-- ------

 
EXAMPLES
Convert a HFS file system from a nolargefiles file system to a largefiles file system: 

# fsadm -F hfs -o largefiles /dev/vg02/lvol1 

Display HFS relevant file system statistics: 

# fsadm -F hfs /dev/vg02/lvol1 


-- diskinfo:
-- ---------

diskinfo - describe characteristics of a disk device

SYNOPSIS
     /etc/diskinfo [-b|-v] character_devicefile

DESCRIPTION
      diskinfo determines whether the character special file named by
      character_devicefile is associated with a SCSI, CS/80, or Subset/80
      disk drive; if so, diskinfo summarizes the disk's characteristics.

Example:

# diskinfo /dev/rdsk/c31t1d3
SCSI describe of /dev/rdsk/c31t1d3:
             vendor: IBM
         product id: 2105800
               type: direct access
               size: 13671904 Kbytes
   bytes per sector: 512


35.4 Notes and further examples:
================================


Examples: More on how to create a filesystem on HP-UX:
------------------------------------------------------


Example 1: 
----------

Here we repeat the essentials of section 35.2:

Task 1. Estimate the Size Required for the Logical Volume  
Task 2. Determine If Sufficient Disk Space Is Available for the Logical Volume within Its Volume Group  
Task 3. Add a Disk to a Volume Group If Necessary 
 
Task 4. Create the Logical Volume  
 
Use lvcreate to create a logical volume of a certain size in the above volume group. See lvcreate(1M) for details.
Use lvcreate as in the following example:

Create a logical volume of size 100 MB in volume group /dev/vg03:

# lvcreate -L 100 /dev/vg03

-- Task 5. Create the New File System  
 
Create a file system using the newfs command. Note the use of the character device file. For example:
 
# newfs -F hfs /dev/vg02/rlvol1 
 
-- Task 6. mount the new local file system:

Choose an empty directory to serve as the mount point for the file system. Use the mkdir command to 
create the directory if it does not currently exist. For example, enter:
 
# mkdir /test 
 
Mount the file system using the mount command. Use the block device file name that contains the file system. 
You will need to enter this name as an argument to the mount command.

For example, enter
 
# mount /dev/vg01/lvol1 /test 


Example 2:
----------

This is an example of creating volume group vg01 & logical 
volume/partion data. 

Prepare for logical volume creation: 

root:/> mkdir /dev/vg01 
root:/> mknod /dev/vg01/group c 64 0x010000 
root:/> pvcreate -f /dev/rdsk/c0t5d0 
Physical volume "/dev/rdsk/c0t5d0" has been successfully created. 

root:/> vgcreate vg01 /dev/dsk/c0t5d0 
Volume group "/dev/vg01" has been successfully created. 
Volume Group configuration for /dev/vg01 has been saved in 
/etc/lvmconf/vg01.conf 

root:/> vgdisplay -v vg01 
root:/> lvcreate -L 100 -n data vg01 
Logical volume "/dev/vg01/data" has been successfully created with 
character device "/dev/vg01/rdata". 

Create HFS file system 

root:/> newfs -F hfs /dev/vg01/rdata 

Create Journal or Veritas file system 

root:/> newfs -F vxfs /dev/vg02/rdata 


Example 3:
----------

To create a VxFS file system 12288 sectors in size on VxVM volume, enter: 

# mkfs -F vxfs /dev/vx/rdsk/diskgroup/volume 12288

To use mkfs to create a VxFS file system on /dev/rdsk/c0t6d0: 

# mkfs -F vxfs /dev/rdsk/c0t6d0 1024 

To use mkfs to determine the command that was used to create the VxFS file system on /dev/rdsk/c0t6d0: 

# mkfs -F vxfs -m /dev/rdsk/c0t6d0 

To create a VxFS file system on /dev/vgqa/lvol1, with a Version 4 disk layout and largefiles capability: 

# mkfs -F vxfs -o version=4,largefiles /dev/vgqa/lvol1 


http://www.docs.hp.com/en/B2355-90672/index.html


Example 4:
----------

Example: Creating a Logical Volume Using HP-UX Commands

To create a logical volume:

Select one or more disks. ioscan(1M) shows the disks attached to the system and their device file names.
Initialize each disk as an LVM disk by using the pvcreate command. For example, enter
 
# pvcreate /dev/rdsk/c0t0d0 
 
Note that using pvcreate will result in the loss of any existing data currently on the physical volume.
You use the character device file for the disk.
Once a disk is initialized, it is called a physical volume.

- Pool the physical volumes into a volume group. To complete this step:

Create a directory for the volume group. For example:
 
# mkdir /dev/vgnn 
 
Create a device file named group in the above directory with the mknod command.
 
# mknod /dev/vgnn/group c 64 0xNN0000 
 
The c following the device file name specifies that group is a character device file.
The 64 is the major number for the group device file; it will always be 64.
The 0xNN0000 is the minor number for the group file in hexadecimal. Note that each particular NN must be a 
unique number across all volume groups.

For more information on mknod, see mknod(1M); for more information on major numbers and minor numbers, 
see Configuring HP-UX for Peripherals.

Create the volume group specifying each physical volume to be included using vgcreate. For example:
 
# vgcreate /dev/vgnn /dev/dsk/c0t0d0 
 
Use the block device file to include each disk in your volume group. You can assign all the physical volumes 
to the volume group with one command. No physical volume can already be part of an existing volume group.

Once you have created a volume group, you can now create a logical volume using lvcreate. For example:

# lvcreate /dev/vgnn 
 
Using the above command creates the logical volume /dev/vgnn/lvoln with LVM automatically assigning 
the n in lvoln.

When LVM creates the logical volume, it creates the block and character device files and places them in the directory 
/dev/vgnn.


VxFS can, theoretically, support files up to two terabytes in size because file system structures 
are no longer in fixed locations (see Chapter 2 "Disk Layout"). The maximum size tested and supported 
on HP-UX 11.x systems is one terabyte. Large files are files larger than two gigabytes in size.

 NOTE: Be careful when enabling large file capability. Applications and utilities such as backup may experience 
 problems if they are not aware of large files. 
 
 
Creating a File System with Large Files 

You can create a file system with large file capability by entering the following command:

# mkfs -F vxfs -o largefiles special_device size 
 
Specifying largefiles sets the largefiles flag, which allows the file system to hold files 
up to one terabyte in size. Conversely, the default nolargefiles option clears the flag and limits 
files being created to a size of two gigabytes or less:

# mkfs -F vxfs -o nolargefiles special_device size 


Notes:
------

Note 1: Create a System Mirror Disk:
------------------------------------

This note describes how to configure LVM mirroring of a system disk. In this example the HP server is STSRV1,
the primary boot device is SCSI=6 (/dev/dsk/c2t6d0) and the alternative mirrored bootdevice is 
SCSI=5 (/dev/dsk/c2t5d0). The following commands will do the trick:

# ioscan -fnC disk
# pvcreate -Bf /dev/rdsk/c2t5d0
# mkboot -l /dev/rdsk/c2t5d0
# mkboot -a "hpux -lq (;0)/stand/vmunix" /dev/rdsk/c2t5d0
# vgextend /dev/vg00 /dev/dsk/c2t5d0

# for P in 1 2 3 4 5 6 7 8 9 10
> do
> lvextend -m 1 /dev/vg00/lvol$P /dev/dsk/c2t5d0
> sleep 1
> done


Note 2: Create a System Mirror Disk:
------------------------------------

# ioscan -fnC disk 
Class I H/W Path Driver S/W State H/W Type Description 
===================================================================== 
disk 0 0/0/1/1.2.0 sdisk CLAIMED DEVICE HP 73.4GMAN3735MC 
                         /dev/dsk/c1t2d0 /dev/rdsk/c1t2d0 
disk 1 0/0/2/0.2.0 sdisk CLAIMED DEVICE HP 73.4GATLAS10K3_73_SCA 
                         /dev/dsk/c2t2d0 /dev/rdsk/c2t2d0 
  
Note: c1t2d0 is the boot disk and c2t2d0 is the mirrored disk. 
       
1) Initialize the disk and make it bootable 
        pvcreate -B /dev/rdsk/c2t2d0 
            Note: the -B parameter tells pvcreate that this will be a bootable disk. 
       
2) Add the physical volume to the volume group 
            vgextend /dev/vg00 /dev/dsk/c2t2d0 
       
3) Use mkboot to place the boot utilities in the boot area and add the AUTO file. 
            mkboot /dev/dsk/c2t2d0 
            mkboot -a "hpux -lq" /dev/rdsk/c2t2d0 
       
4) Use mkboot to update the AUTO file on the primary boot disk. 
            mkboot -a "hpux -lq" /dev/rdsk/c1t2d0 
       
5) Mirror the stand, root and swap logical volumes 
            lvextend -m 1 /dev/vg00/lvol1 
            lvextend -m 1 /dev/vg00/lvol2 
            lvextend -m 1 /dev/vg00/lvol3 


Note: LVM will resynchronize the new mirror copies. 


Repeat the lvextend for all other logical volumes on the boot mirror. 
            lvextend -m 1 /dev/vg00/lvol4 
            lvextend -m 1 /dev/vg00/lvol5 
            lvextend -m 1 /dev/vg00/lvol6 
            lvextend -m 1 /dev/vg00/lvol7 
            lvextend -m 1 /dev/vg00/lvol8 


6) Modify your alternate boot path to point to the mirror copy of the boot disk. 
Note: Use the Hardware path for your new boot disk. 
            setboot -a 0/0/2/0.2.0 


Note 3: Increase a filesystem in HP-UX:
---------------------------------------

Example 1:
----------

In this example, you would need to increase the file system size of /var by 10 MB, which actually needs 
to be rounded up to 12 MB.

Increase /var
Follow these steps to increase the size limit of /var.

- Determine if any space is available for the /dev/vg00:

# /sbin/vgdisplay /dev/vg00 

 
The Free PE indicates the number of 4 MB extents available, in this case 79 (equivalent to 316 MB).

- Change to single user state:

/sbin/shutdown

This allows /var to be unmounted.

- View mounted volumes:

# /sbin/mount

You see a display similar to the following:

/ on /dev/vg00/lvol1 defaults on Sat Mar 8 23:19:19 1997
/var on /dev/vg00/lvol7 defaults on Sat Mar 8 23:19:28 1997 


# Determine which logical volume maps to /var. In this example, it is /dev/vg00/lvol7

- Unmount /var:

# /sbin/umount /var

This is required for the next step, because extendfs can only work on unmounted volumes. If you get a 
"device busy" error at this point, reboot the system and log on in single-user mode before continuing.

- Extend the size of the logical volume:

# /sbin/lvextend -L new_size_in_MB /dev/vg00/lvol7

For example, to make this volume 332 MB:

# /sbin/lvextend -L 332 /dev/vg00/lvol7

To extend the file system size to the logical volume size:

# /sbin/extendfs /dev/vg00/rlvol7

Mount /var:

# /sbin/mount /var

Go back to the regular init state: init 3 or init 4, or reboot.


Example 2:
----------

To increase the capacity of a file system created on a logical volume, enter:

# umount /dev/vg00/lvol1
# lvextend -L larger_size /dev/vg00/lvol1
# extendfs -F hfs /dev/vg00/rlvol1          -- For operation like mkfs or extendfs, you should use raw device interface. 
# mount /dev/vg00/lvol1 mount_directory


Example 3:
----------

> 
> Date: 12/14/99 
> Document description: Extending /var, /usr, /tmp without Online JFS 
> Document id: KBRC00000204 
> 
> 
> You may provide feedback on this document 
> 
> 
> Extending /var, /usr, /tmp without Online JFS DocId: KBRC00000204 Updated: 
> 12/14/99 1:14:29 PM 
> 
> PROBLEM 
> Since /var, /usr, /tmp (and sometimes /opt) are always in use by the 
> operating system, they cannot be unmounted with the umount command. In order 
> to extend these filesystems, the system must be in single user mode. 
> 
> RESOLUTION 
> This example will show how to extend /usr to 400MB without Online JFS 
> 
> 
> 1.. Backup the filesystem before extending 
> 
> 
> 2.. Display disk information on the logical volume 
> 
> lvdisplay -v /dev/vg00/lvol4 | more 
> 
> 
> a.. Make sure this is enough Free PE's to increase this filesystem. 
> b.. Make sure that allocation is NOT strict/contiguous. 
> 
> 
> 3.. Reboot the machine 
> 
> shutdown -r now 
> 
> 
> 4.. When prompted, press "ESC" to interrupt the boot. 
> 
> 
> 5.. Boot from the primary device and invoke ISL interaction. 
> 
> bo pri isl 
> 
> NOTE: If prompted to interact with ISL, respond "y" 
> 
> 
> 6.. Boot into single user mode 
> 
> hpux -is 
> 
> NOTE:Nothing will be mounted. 
> 
> 
> 7.. Extend the logical volume that holds the filesystem. 
> 
> /sbin/lvextend -L 400 /dev/vg00/lvol4 
> 
> 
> 8.. Extend the file system. 
> 
> /sbin/extendfs -F hfs /dev/vg00/rlvol4 
> 
> NOTE: The use of the character device. 
> 
> 
> 9.. Ensure the filesystem now reports to be the new size 
> 
> bdf 
> 
> 
> 10.. Reboot the system to its normal running state. 
> 
> shutdown -r now 
> 
> 
> 
The only thing is that you have to have contiguous lvols to do that. The 
best way is to do an Ignite make_tape_recovery -i for vg00 and then 
resize it when you recreate it. If you have vg00 on a seperate disk then 
it is real easy, the backup can run in the background, and the restore 
interactive will take about 2.5 hours for a 9GB root disk, you can make 
the lvols any size you want and it also puts it back in place in order 
so you save space. 


Example 4:
----------

The right way to extend a file system with "OnLine jfs" is using the command "fsadm".
For example, if you want to extend the fs /mk2/toto in the
/dev/vgmk2/lvtoto in from 50Mbytes to 60 you must extend de logical volume

# lvextend -L 60 /dev/vgmk2/lvtoto

Now use fsadm ( I supose you have vxfs, if you are using hfs is not
possible to increase on-line, or at least I don't know how ).

# fsadm -F vxfs -b 61440 /mk2/toto

You will have your fs increased on line ... be carefull if your fs is 100% occupied the comand fsadm will fail, you
need some free space on the file system ( it depends on the fs type, size etc ..).

In general, Online jfs should be increased in the following way:

lvextend -L ???? /dev/vg??/lvol??

fsadm -F vxfs -b ????? /<filesystem name>

oranh300:/home/se1223>cat /etc/inittab | grep enab
vxen::bootwait:/sbin/fs/vxfs/vxenablef -a


Note 4:
-------

Extend OnlineJFS licenses on next D&ST servers:
aavnh400
oranh503
oranh603
orazh500
orazh601
orazh602

commands are:
swagentd -r
swinstall -x mount_all_filesystems=false -x enforce_dependencies=true -s hpdepot.ao.nl.abnamro.com:/beheer/depot/OnlineJFS_License OnlineJFS
swagentd -k


HP-UX errors: Error 23 filetable overflow:
------------------------------------------

Error: 23 is a infamous error, as shown in this thread:

thread:

Doc ID: Note:1018306.102 
Problem Description:
====================
You are backing up your database and are getting the following errors:

HP-UX Error 23: file table overflow

RMAN-569 file not found
LEM-00031 file not found
LEM-00033 lempgfm couldn't open message file
RMAN indicates that Recovery Manager is complete, however the database
and the catalog are not resync'd.
Problem Explanation:
====================
Recovery Manager cannot find or open the message file.
Search Words:
=============
Recovery Manager, LEM-33, LEM-31, RMAN-00569, message file, lempgfm,
error 23, HPUX error 23, HP-UX error 23
Solution Description:
=====================
You may need to increase the value of the unix kernel parameter 'nfile'.
Solution Explanation:
=====================
'nfile' needs to have a value in the thousands for a database server. 
If this parameter is < 1000, increase it to something like 5000 or 
greater. If there is enough memory on your system, this parameter can
be set to values > 30000.


35.5 Some important filesystem related kernel params:
=====================================================


nfile:
------

nfile defines the maximum number of files that can be open simultaneously, system-wide, at any given time.

Acceptable Values:
Minimum 
14 
Maximum 
Memory limited 
Default 
((16*(Nproc+16+MaxUsers)/10)+32+2*(Npty+Nstrpty) 

Specify integer value or use integer formula expression. For more information, see Specifying Parameter Values.

Description
nfile defines the maximum number files that can be open at any one time, system-wide.
It is the number of slots in the file descriptor table. Be generous with this number because the required memory 
is minimal, and not having enough slots restricts system processing capacity.

Related Parameters and System Factors
The value used for nfile must be sufficient to service the number of users and processes allowed by the combination 
of nproc, maxusers, npty , and nstrpty.

Every process uses at least three file descriptors per process (standard input, standard output, 
and standard error).

Every process has two pipes per process (one per side), each of which requires a pty. Stream pipes also use s
treams ptys which are limited by nstrpty.


35.6 HP-UX kernel parameters:
=============================

Take especially notice of the parameters nfile, nflocks, ninodes, nprocs.
They determine how many open files, open locks, simultaneous processes are possible *system-wide*.
Too low values may result in HP-UX errors when dealing with larger databases, huge App Servers
and the like.

Entering Values: 
 
Use the kcweb web interface or the kmtune command to view and change values. kcweb is described 
in the kcweb(1M) manpage and in the program's help topics. You can run kcweb from the command line 
or from the System Administration Manager (SAM); see sam(1M). You run kmtune from the command line; 
see kmtune(1M) for details.


Accounting
 acctresume Resume accounting when free space on the file system where accounting log files reside rises above acctresume plus minfree percent of total usable file system size. Manpage: acctsuspend(5).
 
Accounting
 acctsuspend
 Suspend accounting when free space on the file system where accounting log files reside drops below acctsuspend plus minfree percent of total usable file system size. Manpage: acctsuspend(5).
 
Asynchronous I/O
 aio_listio_max
 Maximum number of POSIX asynchronous I/O operations allowed in a single lio_listio() call. Manpage: aio_listio_max(5).
 
Asynchronous I/O
 aio_max_ops
 System-wide maximum number of POSIX asynchronous I/O operations allowed at one time. Manpage: aio_max_ops(5).
 
Asynchronous I/O
 aio_physmem_pct
 Maximum percentage of total system memory that can be locked for use in POSIX asynchronous I/O operations. Manpage: aio_physmem_pct(5).
 
Asynchronous I/O
 aio_prio_delta_max
 Maximum priority offset (slowdown factor) allowed in a POSIX asynchronous I/O control block (aiocb). Manpage: aio_prio_delta_max(5).
 
Memory Paging
 allocate_fs_swapmap
 Enable or disable preallocation of file system swap space when swapon() is called as opposed to allocating swap space when malloc() is called. Enabling allocation reduces risk of insufficient swap space and is used primarily where high availability is important. Manpage: allocate_fs_swapmap(5).
 
Kernel Crash Dump
 alwaysdump
 Select which classes of system memory pages are to be dumped if a kernel panic occurs. Manpage: alwaysdump(5).
 
Spinlock Pool
 bufcache_hash_locks
 Buffer-cache spinlock pool. NO MANPAGE. 
 
File System: Buffer
 bufpages
 Number of 4 KB pages in file system static buffer cache. Manpage: bufpages(5).
 
Spinlock Pool
 chanq_hash_locks
 Channel queue spinlock pool. Manpage: chanq_hash_locks(5).
 
IPC: Share
 core_addshmem_read
 Flag to include readable shared memory in a process core dump. Manpage: core_addshmem_read(5).
 
IPC: Share
 core_addshmem_write
 Flag to include read/write shared memory in a process core dump. Manpage: core_addshmem_write(5).
 
Miscellaneous: Links
 create_fastlinks
 Create fast symbolic links using a newer, more efficient format to improve access speed by reducing disk block accesses during path name look-up sequences. Manpage: create_fastlinks(5).
 
File System: Buffer
 dbc_max_pct
 Maximum percentage of memory for dynamic buffer cache. Manpage: dbc_max_pct(5).
 
File System: Buffer
 dbc_min_pct
 Minimum percentage of memory for dynamic buffer cache. Manpage: dbc_min_pct(5).
 
Miscellaneous: Disk I/O
 default_disk_ir
 Immediate reporting for disk writes; whether a write() returns immediately after the data is placed in the disk's write buffer or waits until the data is physically stored on the disk media. Manpage: default_disk_ir(5).
 
File System: Buffer
 disksort_seconds
 Maximum wait time for disk requests. NO MANPAGE.
 
Miscellaneous: Disk I/O
 dma32_pool_size
 Amount of memory to set aside for 32-bit DMA (bytes). Manpage: dma32_pool_size(5).
 
Spinlock Pool
 dnlc_hash_locks
 Number of locks for directory cache synchronization. NO MANPAGE.
 
Kernel Crash Dump
 dontdump
 Select which classes of system memory pages are not to be dumped if a kernel panic occurs. Manpage: dontdump(5).
 
Miscellaneous: Clock
 dst
 Enable/disable daylight savings time. Manpage: timezone(5).
 
Miscellaneous: IDS
 enable_idds
 Flag to enable the IDDS daemon, which gathers data for IDS/9000. Manpage: enable_idds(5).
 
Miscellaneous: Memory
 eqmemsize
 Number of pages of memory to be reserved for equivalently mapped memory, used mostly for DMA transfers. Manpage: eqmemsize(5).
 
ProcessMgmt: Process
 executable_stack
 Allows or denies program execution on the stack. Manpage: executable_stack(5).
 
File System: Write
 fs_async
 Enable/disable asynchronous writes of file system data structures to disk. Manpage: fs_async(5).
 
Spinlock Pool
 ftable_hash_locks
 File table spinlock pool. NO MANPAGE. 
 
Spinlock Pool
 hdlpreg_hash_locks
 Set the size of the pregion spinlock pool. Manpage: hdlpreg_hash_locks(5).
 
File System: Read
 hfs_max_ra_blocks
 The maximum number of read-ahead blocks that the kernel may have outstanding for a single HFS file system. Manpage: hfs_max_ra_blocks(5).
 
File System: Read
 hfs_max_revra_blocks
 The maximum number of reverse read-ahead blocks that the kernel may have outstanding for a single HFS file system. Manpage: hfs_max_revra_blocks(5).
 
File System: Read
 hfs_ra_per_disk
 The amount of HFS file system read-ahead per disk drive, in KB. Manpage: hfs_ra_per_disk(5).
 
File System: Read
 hfs_revra_per_disk
 The amount of memory (in KB) for HFS reverse read-ahead operations, per disk drive. Manpage: hfs_revra_per_disk(5).
 
File System: Read
 hp_hfs_mtra_enabled
 Enable or disable HFS multithreaded read-ahead. NO MANPAGE.
 
Kernel Crash Dump
 initmodmax
 Maximum size of the dump table of dynamically loaded kernel modules. Manpage: initmodmax(5).
 
Spinlock Pool
 io_ports_hash_locks I/O port spinlock pool. NO MANPAGE.  
Miscellaneous: Queue
 ksi_alloc_max
 Maximum number of system-wide queued signals that can be allocated. Manpage: ksi_alloc_max(5).
 
Miscellaneous: Queue
 ksi_send_max
 Maximum number of queued signals that a process can send and have pending at one or more receivers. Manpage: ksi_send_max(5).
 
ProcessMgmt: Memory
 maxdsiz
 Maximum process data storage segment space that can be used for statics and strings, as well as dynamic data space allocated by sbrk() and malloc() (32-bit processes). Manpage: maxdsiz(5).
 
ProcessMgmt: Memory
 maxdsiz_64bit
 Maximum process data storage segment space that can be used for statics and strings, as well as dynamic data space allocated by sbrk() and malloc() (64-bit processes). Manpage: maxdsiz(5).
 
File System: Open/Lock
 maxfiles
 Soft limit on how many files a single process can have opened or locked at any given time. Manpage: maxfiles(5).
 
File System: Open/Lock
 maxfiles_lim
 Hard limit on how many files a single process can have opened or locked at any given time. Manpage: maxfiles_lim(5).
 
ProcessMgmt: Memory
 maxrsessiz
 Maximum size (in bytes) of the RSE stack for any user process on the IPF platform. Manpage: maxrsessiz(5).
 
ProcessMgmt: Memory
 maxrsessiz_64bit
 Maximum size (in bytes) of the RSE stack for any user process on the IPF platform. Manpage: maxrsessiz(5).
 
ProcessMgmt: Memory
 maxssiz
 Maximum dynamic storage segment (DSS) space used for stack space (32-bit processes). Manpage: maxssiz(5).
 
ProcessMgmt: Memory
 maxssiz_64bit
 Maximum dynamic storage segment (DSS) space used for stack space (64-bit processes). Manpage: maxssiz(5).
 
ProcessMgmt: Memory
 maxtsiz
 Maximum allowable process text segment size, used by unchanging executable-code (32-bit processes). Manpage: maxtsiz(5).
 
ProcessMgmt: Memory
 maxtsiz_64bit
 Maximum allowable process text segment size, used by unchanging executable-code (64-bit processes). Manpage: maxtsiz(5).
 
ProcessMgmt: Process
 maxuprc
 Maximum number of processes that any single user can have running at the same time, including login shells, user interface processes, running programs and child processes, I/O processes, etc. If a user is using multiple, simultaneous logins under the same login name (user ID) as is common in X Window, CDE, or Motif environments, all processes are combined, even though they may belong to separate process groups. Processes that detach from their parent process group, where that is possible, are not counted after they detach (line printer spooler jobs, certain specialized applications, etc.). Manpage: maxuprc(5).
 
Miscellaneous: Users
 maxusers
 Maximum number of users expected to be logged in on the system at one time; used by other system parameters to allocate system resources. Manpage: maxusers(5).
 
File System: LVM
 maxvgs
 Maximum number of volume groups configured by the Logical Volume Manager on the system. Manpage: maxvgs(5).
 
Accounting
 max_acct_file_size
 Maximum size of the accounting file. Manpage: max_acct_file_size(5).
 
Asynchronous I/O
 max_async_ports
 System-wide maximum number of ports to the asynchronous disk I/O driver that processes can have open at any given time. Manpage: max_async_ports(5).
 
Memory Paging
 max_mem_window
 Maximum number of group-private 32-bit shared memory windows. Manpage: max_mem_window(5).
 
ProcessMgmt: Threads
 max_thread_proc
 Maximum number of threads that any single process can create and have running at the same time. Manpage: max_thread_proc(5).
 
IPC: Message
 mesg
 Enable or disable IPC messages at system boot time. Manpage: mesg(5).
 
Kernel Crash Dump
 modstrmax
 Maximum size, in bytes, of the savecrash kernel module table that contains module names and their locations in the file system. Manpage: modstrmax(5).
 
IPC: Message
 msgmap
 Size of free-space resource map for allocating shared memory space for messages. Manpage: msgmap(5).
 
IPC: Message
 msgmax
 System-wide maximum size (in bytes) for individual messages. Manpage: msgmax(5).
 
IPC: Message
 msgmnb
 Maximum combined size (in bytes) of all messages that can be queued simultaneously in a message queue. Manpage: msgmnb(5).
 
IPC: Message
 msgmni
 Maximum number of message queues allowed on the system at any given time. Manpage: msgmni(5).
 
IPC: Message
 msgseg
 Maximum number of message segments that can exist on the system. Manpage: msgseg(5).
 
IPC: Message
 msgssz
 Message segment size in bytes. Manpage: msgssz(5).
 
IPC: Message
 msgtql
 Maximum number of messages that can exist on the system at any given time. Manpage: msgtql(5).
 
File System: Buffer
 nbuf
 System-wide number of static file system buffer and cache buffer headers. Manpage: nbuf(5).
 
Miscellaneous: CD
 ncdnode
 Maximum number of entries in the vnode table and therefore the maximum number of open CD-ROM file system nodes that can be in memory. Manpage: ncdnode(5).
 
Miscellaneous: Terminal
 nclist
 Maximum number of cblocks available for data transfers through tty and pty devices. Manpage: nclist(5).
 
File System: Open/Lock
 ncsize
 Inode space needed for directory name lookup cache (DNLC). NO MANPAGE.
 
File System: Open/Lock
 nfile
 Maximum number of files that can be open simultaneously on the system at any given time. Manpage: nfile(5).
 
File System: Open/Lock
 nflocks
 Maximum combined number of file locks that are available system-wide to all processes at one time. Manpage: nflocks(5).
 
File System: Open/Lock
 ninode
 Maximum number of open inodes that can be in memory. Manpage: ninode(5).
 
ProcessMgmt: Threads
 nkthread
 Maximum number of kernel threads allowed on the system at the same time. Manpage: nkthread(5).
 
ProcessMgmt: Process
 nproc
 Defines the maximum number of processes that can be running simultaneously on the entire system, including remote execution processes initiated by other systems via remsh or other networking commands. Manpage: nproc(5).
 
Miscellaneous: Terminal
 npty
 Maximum number of pseudo-tty entries allowed on the system at any one time. Manpage: npty(5).
 
Streams
 NSTREVENT
 Maximum number of outstanding streams bufcalls that are allowed to exist at any given time on the system. This number should be equal to or greater than the maximum bufcalls that can be generated by the combined total modules pushed onto any given stream, and serves to limit run-away bufcalls. Manpage: nstrevent(5).
 
Miscellaneous: Terminal
 nstrpty
 System-wide maximum number of streams-based pseudo-ttys that are allowed on the system. Manpage: nstrpty(5).
 
Streams
 nstrpty
 System-wide maximum number of streams-based pseudo-ttys that are allowed on the system. Manpage: nstrpty(5).
 
Streams
 NSTRPUSH
 Maximum number of streams modules that are allowed to exist in any single stream at any one time on the system. This provides a mechanism for preventing a software defect from attempting to push too many modules onto a stream, but it is not intended as adequate protection against malicious use of streams. Manpage: nstrpush(5).
 
Streams
 NSTRSCHED
 Maximum number of streams scheduler daemons that are allowed to run at any given time on the system. This value is related to the number of processors installed in the system. Manpage: nstrsched(5).
 
Miscellaneous: Terminal
 nstrtel
 Number of telnet session device files that are available on the system. Manpage: nstrtel(5).
 
Memory Paging
 nswapdev
 Maximum number of devices, system-wide, that can be used for device swap. Set to match actual system configuration. Manpage: nswapdev(5).
 
Memory Paging
 nswapfs
 Maximum number of mounted file systems, system-wide, that can be used for file system swap. Set to match actual system configuration. Manpage: nswapfs(5).
 
Miscellaneous: Memory
 nsysmap
 Number of entries in the kernel dynamic memory virtual address space resource map (32-bit processes). Manpage: nsysmap(5).
 
Miscellaneous: Memory
 nsysmap64
 Number of entries in the kernel dynamic memory virtual address space resource map (64-bit processes). Manpage: nsysmap(5).
 
Miscellaneous: Disk I/O
 o_sync_is_o_dsync
 Specifies whether an open() or fcntl() with the O_SYNC flag set can be converted to the same call with the O_DSYNC flag instead. This controls whether the function can return before updating the file access. NO MANPAGE.
 
ProcessMgmt: Memory
 pa_maxssiz_32bit
 Maximum size (in bytes) of the stack for a user process running under the PA-RISC emulator on IPF. Manpage: pa_maxssiz(5).
 
ProcessMgmt: Memory
 pa_maxssiz_64bit
 Maximum size (in bytes) of the stack for a user process running under the PA-RISC emulator on IPF. Manpage: pa_maxssiz(5).
 
Spinlock Pool
 pfdat_hash_locks
 Pfdat spinlock pool. Manpage: pfdat_hash_locks(5).
 
Miscellaneous: Disk I/O
 physical_io_buffers
 Total buffers for physical I/O operations. Manpage: physical_io_buffers(5).
 
Spinlock Pool
 region_hash_locks
 Process-region spinlock pool. Manpage: region_hash_locks(5).
 
Memory Paging
 remote_nfs_swap
 Enable or disable swap to mounted remote NFS file system. Used on cluster clients for swapping to NFS-mounted server file systems. Manpage: remote_nfs_swap(5).
 
Miscellaneous: Schedule
 rtsched_numpri
 Number of distinct real-time interrupt scheduling priority levels are available on the system. Manpage: rtsched_numpri(5).
 
Miscellaneous: Terminal
 scroll_lines
 Defines the number of lines that can be scrolled on the internal terminal emulator (ITE) system console. Manpage: scroll_lines(5).
 
File System: SCSI
 scsi_maxphys
 Maximum record size for the SCSI I/O subsystem, in bytes. Manpage: scsi_maxphys(5).
 
File System: SCSI
 scsi_max_qdepth
 Maximum number of SCSI commands queued up for SCSI devices. Manpage: scsi_max_qdepth(5).
 
ProcessMgmt: Process
 secure_sid_scripts
 Controls whether setuid and setgid bits on scripts are honored. Manpage: secure_sid_scripts(5).
 
IPC: Semaphore
 sema
 Enable or disable IPC semaphores at system boot time. Manpage: sema(5).
 
IPC: Semaphore
 semaem
 Maximum value by which a semaphore can be changed in a semaphore "undo" operation. Manpage: semaem(5).
 
IPC: Semaphore
 semmni
 Maximum number of sets of IPC semaphores allowed on the system at any one time. Manpage: semmni(5).
 
IPC: Semaphore
 semmns
 Maximum number of individual IPC semaphores available to system users, system-wide. Manpage: semmns(5).
 
IPC: Semaphore
 semmnu
 Maximum number of processes that can have undo operations pending on any given IPC semaphore on the system. Manpage: semmnu(5).
 
IPC: Semaphore
 semmsl
 Maximum number of individual System V IPC semaphores per semaphore identifier. Manpage: semmsl(5).
 
IPC: Semaphore
 semume
 Maximum number of IPC semaphores that a given process can have undo operations pending on. Manpage: semume(5).
 
IPC: Semaphore
 semvmx
 Maximum value any given IPC semaphore is allowed to reach (prevents undetected overflow conditions). Manpage: semvmx(5).
 
Miscellaneous: Web
 sendfile_max
 The amount of buffer cache that can be used by the sendfile() system call on HP-UX web servers. Manpage: sendfile_max(5).
 
IPC: Share
 shmem
 Enable or disable shared memory at system boot time. Manpage: shmem(5).
 
IPC: Share
 shmmax
 Maximum allowable shared memory segment size (in bytes). Manpage: shmmax(5).
 
IPC: Share
 shmmni
 Maximum number of shared memory segments allowed on the system at any given time. Manpage: shmmni(5).
 
IPC: Share
 shmseg
 Maximum number of shared memory segments that can be attached simultaneously to any given process. Manpage: shmseg(5).
 
Streams
 STRCTLSZ
 Maximum number of control bytes allowed in the control portion of any streams message on the system. Manpage: strctlsz(5).
 
Streams
 streampipes
 Force all pipes to be streams-based. Manpage: streampipes(5).
 
Streams
 STRMSGSZ
 Maximum number of bytes that can be placed in the data portion of any streams message on the system. Manpage: strmsgsz(5).
 
File System: SCSI
 st_ats_enabled
 Flag whether to reserve a tape device on open. Manpage: st_ats_enabled(5).
 
File System: SCSI
 st_fail_overruns
 SCSI tape read resulting in data overrun causes failure. Manpage: st_fail_overruns(5).
 
File System: SCSI
 st_large_recs
 Enable large record support for SCSI tape. Manpage: st_large_recs(5).
 
Memory Paging
 swapmem_on
 Enable or disable pseudo-swap allocation. This allows systems with large installed memory to allocate memory space as well as disk swap space for virtual memory use instead of restricting availability to defined disk swap area. Manpage: swapmem_on(5).
 
Memory Paging
 swchunk
 Amount of space allocated for each chunk of swap area. Chunks are allocated from device to device by the kernel. Changing this parameter requires extensive knowledge of system internals. Without such knowledge, do not change this parameter from the normal default value. Manpage: swchunk(5).
 
Spinlock Pool
 sysv_hash_locks
 System V interprocess communication spinlock pool. Manpage: sysv_hash_locks(5).
 
Miscellaneous: Network
 tcphashsz
 TCP hash table size, in bytes. Manpage: tcphashsz(5).
 
ProcessMgmt: CPU
 timeslice
 Maximum time a process can use the CPU until it is made available to the next process having the same process execution priority. This feature also prevents runaway processes from causing system lock-up. Manpage: timeslice(5).
 
Miscellaneous: Clock
 timezone
 The offset between the local time zone and Coordinated Universal Time (UTC), often called Greenwich Mean Time or GMT. Manpage: timezone(5).
 
Miscellaneous: Memory
 unlockable_mem
 Amount of system memory to be reserved for system overhead and virtual memory management, that cannot be locked by user processes. Manpage: unlockable_mem(5).
 
Spinlock Pool
 vnode_cd_hash_locks
 Vnode clean/dirty spinlock pool. NO MANPAGE. 
 
Spinlock Pool
 vnode_hash_locks
 Vnode spinlock pool. NO MANPAGE. 
 
Memory Paging: Size
 vps_ceiling
 Maximum system-selected page size (in KB) if the user does not specify a page size. Manpage: vps_ceiling(5).
 
Memory Paging: Size
 vps_chatr_ceiling
 Maximum page size a user can specify with the chatr command in a program. Manpage: vps_chatr_ceiling(5).
 
Memory Paging: Size
 vps_pagesize
 Minimum user page size (in KB) if no page size is specified using chatr. Manpage: vps_pagesize(5).
 
File System: Journaled
 vxfs_max_ra_kbytes
 Maximum amount of read-ahead data, in KB, that the kernel may have outstanding for a single VxFS file system. Manpage: vxfs_max_ra_kbytes(5).
 
File System: Read
 vxfs_max_ra_kbytes
 Maximum amount of read-ahead data, in KB, that the kernel may have outstanding for a single VxFS file system. Manpage: vxfs_max_ra_kbytes(5).
 
File System: Journaled
 vxfs_ra_per_disk
 Maximum amount of VxFS file system read-ahead per disk, in KB. Manpage: vxfs_ra_per_disk(5).
 
File System: Read
 vxfs_ra_per_disk
 Maximum amount of VxFS file system read-ahead per disk, in KB. Manpage: vxfs_ra_per_disk(5).
 
File System: Journaled
 vx_fancyra_enable
 Enable or disable VxFS file system read-ahead. NO MANPAGE.
 
File System: Journaled
 vx_maxlink
 Number of subdirectories created within a directory. NO MANPAGE.
 
File System: Journaled
 vx_ncsize
 Memory space reserved for VxFS directory path name cache. Manpage: vx_ncsize(5).
 
File System: Journaled
 vx_ninode
 Number of entries in the VxFS inode table. NO MANPAGE
 

36. Some remarks about VI:
==========================

Before you run vi:
------------------

If you've connected to a central UCS computer to use vi, first tell that host about your communications software 
(e.g., NCSA Telnet). At IUB, your software will typically emulate a VT-100 terminal. 
To find out what shell program you use, type:

echo $SHELL 

Then if you use ksh, bash, or sh, type:
TERM=vt100; export TERM 

If you use csh or tcsh, type:
set term = vt100 

You can automate this task by adding the appropriate command to your default command shell's configuration file. 

Using vi modes:
---------------

Vi has three "modes": edit, insert, and colon.

- Edit mode (press Esc)
Vi enters edit mode by default when it starts up. Edit mode allows you to move the cursor and 
edit the text buffer. 

- Insert mode (press i)
Insert mode "drops" the cursor at a specific point in the buffer, allowing you to insert text. 
To enter insert mode, position the cursor where you want to place text and press i. 

If you make a typing mistake, press ESC to return to edit mode and then reposition the cursor at the error, 
and press i to get back to insert mode.

- Colon mode (press : with a command)
You enter colon mode from edit mode by typing a colon followed by a command. Some useful commands are:

:w           Write buffer to the current filename.
:w newname   Write buffer to file newname.
:r           Read the current filename into the buffer.
:r oldname   Read the file oldname into the buffer.
:q!          Quit vi without saving buffer.
:wq          Write buffer to current filename and quit vi.
:e filename  Close current buffer and edit (open) filename.
:e #         Close current buffer and edit (open) previous file.

Search and Replace:
-------------------

Replace: Same as with sed, Replace OLD with NEW: 
ESC,

 First occurrence on current line:      :s/OLD/NEW
 Globally (all) on current line:        :s/OLD/NEW/g 
 Between two lines #,#:                 :#,#s/OLD/NEW/g
 Every occurrence in file:              :%s/OLD/NEW/g 


The VI editor has two kinds of searches: string and character. For a string search, the / and ? commands are used. 
When you start these commands, the command just typed will be shown on the bottom line, where you type the particular 
string to look for. These two commands differ only in the direction where the search takes place. 
The / command searches forwards (downwards) in the file, while the ? command searches backwards (upwards) in the file. 
The n and N commands repeat the previous search command in the same or opposite direction, respectively. 
Some characters have special meanings to VI, so they must be preceded by a backslash (\) to be included as part 
of the search expression. 


36. ulimit:
===========

limit, ulimit, unlimit - set or get limitations on the  system resources available to the current shell and its 
descendents.

/usr/bin/ulimit
     Example 1:  Limiting the stack size

     To limit the stack size to 512 kilobytes:

     example% ulimit -s 512
     example% ulimit -a
     time(seconds)         unlimited
     file(blocks)            100
     data(kbytes)            523256
     stack(kbytes)           512
     coredump(blocks)        200
     nofiles(descriptors)    64
     memory(kbytes)          unlimited


ULIMIT - Sets the file size limit for the login. Units are disk blocks. Default is zero (no limit). 
Be sure to specify even numbers, as the ULIMIT variable accepts a number of 512-byte blocks.


$ ulimit -a    # Display limits for your session under sh or ksh
$ limit        # Display limits for your session under csh or tcsh
$ ulimit -c SIZE_IN_BLOCKS       # Limit core size under sh or ksh
$ limit coredumpsize SIZE_IN_KB  # Limit core size under csh or tcsh

If you see a core file lying around, just type "file core" to get some details about it. Example: 
$ file core
  core:ELF-64 core file - PA-RISC 2.0 from 'sqlplus' - received SIGABRT


Run the Unix process debugger to obtain more information about where and why the process abended. 
This information is normally requested by Oracle Support for in-depth analysis of the problem. Some example: 

      Solaris:
          $ gdb $ORACLE_HOME/bin/sqlplus core
            bt                 # backtrace of all stack frames
            quit

      HP-UX, Solaris, etc:
          $ adb $ORACLE_HOME/bin/sqlplus core
            $c
            $q

      Sequent:
          $ debug -c core $ORACLE_HOME/bin/sqlplus
          debug> stack
          debug> quit

AIX:


Purpose
Sets or reports user resource limits.

Syntax
ulimit [ -H ] [ -S ] [ -a ] [ -c ] [ -d ] [  -f ] [ -m ] [ -n ] [ -s ] [ -t ] [ Limit ]

Description
The ulimit command sets or reports user process resource limits, as defined in the /etc/security/limits file. 
This file contains these default limits:


fsize = 2097151
core = 2097151
cpu = -1
data = 262144
rss = 65536
stack = 65536
nofiles = 2000

These values are used as default settings when a new user is added to the system. The values are set with the 
mkuser command when the user is added to the system, or changed with the chuser command.

Limits are categorized as either soft or hard. With the ulimit command, you can change your soft limits, 
up to the maximum set by the hard limits. You must have root user authority to change resource hard limits.

Many systems do not contain one or more of these limits. The limit for a specified resource is set when the 
Limit parameter is specified. The value of the Limit parameter can be a number in the unit specified with 
each resource, or the value unlimited. To set the specific ulimit to unlimited, use the word unlimited


Note: Setting the default limits in the /etc/security/limits file sets system wide limits, not just limits 
taken on by a user when that user is created.
The current resource limit is printed when you omit the Limit parameter. The soft limit is printed unless 
you specify the -H flag. When you specify more than one resource, the limit name and unit is printed 
before the value. If no option is given, the -f flag is assumed.

Since the ulimit command affects the current shell environment, it is provided as a shell regular built-in command. 
If this command is called in a separate command execution environment, it does not affect the file size limit of 
the caller's environment. This would be the case in the following examples:


nohup ulimit -f 10000
env ulimit 10000
Once a hard limit has been decreased by a process, it cannot be increased without root privilege, even to revert 
to the original limit.

For more information about user and system resource limits, refer to the getrlimit, setrlimit, or vlimit 
subroutine in AIX 5L Version 5.2 Technical Reference: Base Operating System and Extensions Volume 1.

Flags

-a Lists all of the current resource limits. 
-c Specifies the size of core dumps, in number of 512-byte blocks. 
-d Specifies the size of the data area, in number of K bytes. 
-f Sets the file size limit in blocks when the Limit parameter is used, or reports the file size limit if no parameter is specified. The -f flag is the default. 
-H Specifies that the hard limit for the given resource is set. If you have root user authority, you can increase the hard limit. Anyone can decrease it. 
-m Specifies the size of physical memory, in number of K bytes. 
-n Specifies the limit on the number of file descriptors a process may have. 
-s Specifies the stack size, in number of K bytes. 
-S Specifies that the soft limit for the given resource is set. A soft limit can be increased up to the value of the hard limit. If neither the -H nor -S flags are specified, the limit applies to both. 
-t Specifies the number of seconds to be used by each process. 


You can check the current ulimit settings using the ulimit -a command, and at least the following 
three commands should be run, as the user account that will launch Java: 

ulimit -m unlimited

ulimit -d unlimited

ulimit -f unlimited 


=====================================
37. RAM disks:
=====================================


37.1 AIX:
=========

Example:
--------

# mkramdisk SIZE
/dev/rramdiskxx
# mkfs -V jfs /dev/ramdiskxx
# mount -V jfs -o nointegrity /dev/ramdiskxx /whatever_mountpoint


mkramdisk Command:
------------------
Purpose
Creates a RAM disk using a portion of RAM that is accessed through normal reads and writes.

Syntax
mkramdisk [ -u ] size[ M | G ]

Description
The mkramdisk command is shipped as part of bos.rte.filesystems, which allows the user to create a RAM disk. 
Upon successful execution of the mkramdisk command, a new RAM disk is created, a new entry added to /dev, 
the name of the new RAM disk is written to standard output, and the command exits with a value of 0. 
If the creation of the RAM disk fails, the command prints an internalized error message, and the command 
will exit with a nonzero value.

The size can be specified in terms of MB or GB. By default, it is in 512 byte blocks. A suffix of M will be used 
to specify size in megabytes and G to specify size in gigabytes.

The names of the RAM disks are in the form of /dev/rramdiskx where x is the logical RAM disk number (0 through 63).

The mkramdisk command also creates block special device entries (for example, /dev/ramdisk5) although use 
of the block device interface is discouraged because it adds overhead. The device special files in /dev are owned 
by root with a mode of 600. However, the mode, owner, and group ID can be changed using normal system commands.

Up to 64 RAM disks can be created. 

Note:
The size of a RAM disk cannot be changed after it is created.
The mkramdisk command is responsible for generating a major number, loading the ram disk kernel extension, 
configuring the kernel extension, creating a ram disk, and creating the device special files in /dev. 
Once the device special files are created, they can be used just like any other device special files through 
normal open, read, write, and close system calls.

RAM disks can be removed by using the rmramdisk command. RAM disks are also removed when the machine is rebooted.

By default, RAM disk pages are pinned. Use the -u flag to create RAM disk pages that are not pinned.

Flags
-u Specifies that the ram disk that is created will not be pinned. By default, the ram disk will be pinned. 

Parameters

size Indicates the amount of RAM (in 512 byte increments) to use for the new RAM disk. For example, typing: 

# mkramdisk 1

creates a RAM disk that uses 512 bytes of RAM. 

To create a RAM disk that uses approximately 20 MB of RAM, type: 

# mkramdisk 40000 

Exit Status
The following exit values are returned:

0 Successful completion. 
>0 An error occurred. 

Examples:

To create a new ram disk using a default 512-byte block size, and the size is 500 MBs (1048576 * 512), enter: 

# mkramdisk 1048576 
/dev/rramdisk0

The /dev/rramdisk0 ramdisk is created.

To create a new ramdisk with a size of 500 Megabytes, enter: 

# mkramdisk 500M 
/dev/rramdisk0

The /dev/rramdisk0 ramdisk is created. Note that the ramdisk has the same size as example 1 above.

To create a new ram disk with a 2-Gigabyte size, enter: 

# mkramdisk 2G 
/dev/rramdisk0

To set up a RAM disk that is approximately 20 MB in size and create a JFS file system on that RAM disk, 
enter the following: 

# mkramdisk 40000
# ls -l /dev | grep ram
# mkfs -V jfs /dev/ramdiskx
# mkdir /ramdisk0
# mount -V jfs -o nointegrity /dev/ramdiskx /ramdiskx

where x is the logical RAM disk number. 

Note:
If using file system on a RAM disk, the RAM disk must be pinned.


37.2 Linux:
===========

Redhat:

It is very easy to use a ramdisk. First of all, the default installation of RedHat >= 6.0 comes with ramdisk support.
 All you have to do is format a ramdisk and then mount it to a directory. To find out all the ramdisks you 
have available, do a "ls -al /dev/ram*". This gives you the preset ramdisks available to your liking. 
These ramdisks don't actually grab memory until you use them somehow (like formatting them). 
Here is a very simple example of how to use a ramdisk. 

# create a mount point:
mkdir /tmp/ramdisk0
# create a filesystem:
mke2fs /dev/ram0
# mount the ramdisk:
mount /dev/ram0 /tmp/ramdisk0


Those three commands will make a directory for the ramdisk , format the ramdisk (create a filesystem), 
and mount the ramdisk to the directory "/tmp/ramdisk0". Now you can treat that directory as a pretend partition! 
Go ahead and use it like any other directory or as any other partition. 
If the formatting of the ramdisk faild then you might have no support for ramdisk compiled into the Kernel. 
The Kernel configuration option for ramdisk is CONFIG_BLK_DEV_RAM . 
The default size of the ramdisk is 4Mb=4096 blocks. You saw what ramdisk size you got while you were running mke2fs. 
mke2fs /dev/ram0 should have produced a message like this: 

mke2fs 1.14, 9-Jan-1999 for EXT2 FS 0.5b, 95/08/09
Linux ext2 filesystem format
Filesystem label=
1024 inodes, 4096 blocks
204 blocks (4.98%) reserved for the super user
First data block=1
Block size=1024 (log=0)
Fragment size=1024 (log=0)
1 block group
8192 blocks per group, 8192 fragments per group
1024 inodes per group

Running df -k /dev/ram0 tells you how much of that you can really use (The filesystem takes also some space): 

>df -k /dev/ram0
Filesystem  1k-blocks  Used Available Use% Mounted on
/dev/ram0        3963    13      3746   0% /tmp/ramdisk0

What are some catches? Well, when the computer reboots, it gets wiped. Don't put any data there that isn't 
copied somewhere else. If you make changes to that directory, and you need to keep the changes, figure out 
some way to back them up.     

- Changing the size of the ramdisks
To use a ram disk you either need to have ramdisk support compiled into the Kernel or you need to compile 
it as loadable module. The Kernel configuration option is CONFIG_BLK_DEV_RAM . Compiling the ramdisk a loadable module 
has the advantage that you can decide at load time what the size of your ramdisks should be.

Okay, first the hard way. Add this line to your lilo.conf file: 

   ramdisk_size=10000 (or ramdisk=10000 for old kernels) 

and it will make the default ramdisks 10 megs after you type the "lilo" command and reboot the computer. 
Here is an example of my /etc/lilo.conf file. 

boot=/dev/hda
map=/boot/map
install=/boot/boot.b
prompt
timeout=50
image=/boot/vmlinuz
	label=linux
	root=/dev/hda2
	read-only
	ramdisk_size=10000

Actually, I got a little over 9 megs of usable space as the filesystem takes also a little space. 
When you compile ramdisk support as loadable module then you can decide at load time what the size should be. 
This is done either with an option line in the /etc/conf.modules file: 

options rd rd_size=10000

or as a command line parameter to ismod: 

insmod rd rd_size=10000

Here is an example which shows how to use the module: 
Unmount the ramdisk mounted in the previous chapter, umount /tmp/ramdisk0 . 
Unload the module (it was automatically loaded in the previous chapter), rmmod rd 
Load the ramdisk module and set the size to 20Mb, insmod rd rd_size=20000 
create a file system, mke2fs /dev/ram0 
mount the ramdisk, mount /dev/ram0 /tmp/ramdisk0 
  
- Example of how to use a RamDisk for a webserver.
Okay, here is an example of how to use 3 ramdisks for a webserver. Let us say you are 99% confident that your default installation of Apache for RedHat 6.0 won't use more than 9 megs for its cgi-scripts, html, and icons. Here is how to install one. 
First, issue this command to move the real copy of the document root directory of your webserver to a different place. Also, make the directories to mount the ramdisks . 
mv /home/httpd/ /home/httpd_real
mkdir /home/httpd
mkdir /home/httpd/cgi-bin
mkdir /home/httpd/html
mkdir /home/httpd/icons

Then, add these commands to the start procedure in your /etc/rc.d/init.d/httpd.init 
(or where ever the httpd gets started on your system): 

	### Make the ramdisk partitions
/sbin/mkfs -t ext2 /dev/ram0
/sbin/mkfs -t ext2 /dev/ram1
/sbin/mkfs -t ext2 /dev/ram2

	### Mount the ramdisks to their appropriate places

mount /dev/ram0 /home/httpd/cgi-bin
mount /dev/ram1 /home/httpd/icons
mount /dev/ram2 /home/httpd/html

	### Copying real directory to ramdisks (the
  ### data on the ramdisks is lost after a reboot)
tar -C /home/httpd_real -c . | tar -C /home/httpd -x
  
  ### After this you can start the web-server.

  
37.3 Solaris:
=============

Note 1:
-------

Solaris 9 and higher: use the ramdiskadm command:

Quick example:

Example: Creating a 2MB Ramdisk Named mydisk 

# ramdiskadm -a mydisk 2m
/dev/ramdisk/mydisk 

Example: Listing All Ramdisks

# ramdiskadm
Block Device                   Size  Removable
/dev/ramdisk/miniroot     134217728    No
/dev/ramdisk/certfs         1048576    No
/dev/ramdisk/mydisk         2097152    Yes 


-- The ramdiskadm command:


NAME
ramdiskadm- administer ramdisk pseudo device
SYNOPSIS
/usr/sbin/ramdiskadm -a name size [g | m | k | b]
/usr/sbin/ramdiskadm -d name 
/usr/sbin/ramdiskadm 

DESCRIPTION
The ramdiskadm command administers ramdisk(7D), the ramdisk driver. Use ramdiskadm to create a new named 
ramdisk device, delete an existing named ramdisk, or list information about exisiting ramdisks.

Ramdisks created using ramdiskadm are not persistent across reboots.

OPTIONS
The following options are supported:

-a name size 
Create a ramdisk named name of size size and its corresponding block and character device nodes.

name must be composed only of the characters a-z, A-Z, 0-9, _ (underbar), and - (hyphen), but it must not 
begin with a hyphen. It must be no more than 32 characters long. Ramdisk names must be unique.

The size can be a decimal number, or, when prefixed with 0x, a hexadecimal number, and can specify the size 
in bytes (no suffix), 512-byte blocks (suffix b), kilobytes (suffix k), megabytes (suffix m) 
or gigabytes (suffix g). The size of the ramdisk actually created might be larger than that specified, 
depending on the hardware implementation.

If the named ramdisk is successfully created, its block device path is printed on standard out.

-d name 
Delete an existing ramdisk of the name name. This command succeeds only when the named ramdisk is not open. 
The associated memory is freed and the device nodes are removed.

You can delete only ramdisks created using ramdiskadm. It is not possible to delete a ramdisk that was created 
during the boot process.

Without options, ramdiskadm lists any existing ramdisks, their sizes (in decimal), and whether they can be removed 
by ramdiskadm (see the description of the -d option, above).


Note 2:
-------

thread:

In Solaris =< version 8, its a bit of a pain.

This is what i asked:

Is there anyone who could tell me how to make a ram disk in Solaris 8?

I have a Sun Sparc Box running Solaris 8, and I want to use some of
it's memory to mount a new file-system

Thanks in advance,

The solution:

As many mentioned i could use tmpfs, lik this:

mkdir /ramdisk
mount -F tmpfs -o size=500m swap /ramdisk

However this is not a true ramdisk (it really uses VM, not RAM, and the size
is an upper limit, not a reservation) This is what Solaris provides.


======================
38. Software Packages:
======================


38.1 Software Packages on Solaris:
==================================

This section deals about software packages for Solaris. A software package is a collection of files
and directories in a defined format. It describes a software application such as manual pages and
line printer support. Solaris 8 has about 80 packages that total about 900MB.

A Solaris software package is the standard way to deliver bundeld and unbundled software.
Packages are administered by using the package administration commands, and are generally
identified by a SUNWxxx naming convention.

Software packages are grouped into software clusters, which are logical collections of
software packages. Some clusters contain just 1 or 2 packages, while another may contain more
packages.

Installing Software Packages:
-----------------------------

Solaris provides the tools for adding and removing software from a system.
You can use pkgadd command to install packages, and the pkgrm command to remove packages.
There are also GUI tools to install and remove packages.

Package files are delivered in package format and are unusable as they are delivered. The pkgadd command interprets the software package's 
control files, and then uncompresses and installs the product files onto the system's local disk.

Although the pkgadd and pkgrm commands do not log their output to a standard location, they do keep track of the product 
that is installed or removed. The pkgadd and pkgrm commands store information about a package that has been installed 
or removed in a software product database.

By updating this database, the pkgadd and pkgrm commands keep a record of all software products installed on the system.


-- pkgadd:
-- -------

pkgadd [-nv] [-a admin] [-d device] [[-M]-R root_path] [-r response] [-V fs_file] [pkginst...]
pkgadd -s spool [-d device] [pkginst...]


     -a admin
           Define an installation administration file, admin,  to
           be  used  in place of the default administration file.
           The token none overrides the use of  any  admin  file,
           and  thus  forces interaction with the user.  Unless a
           full path name is given, pkgadd  first  looks  in  the
           current working directory for the administration file.
           If the specified administration file  is  not  in  the
           current   working   directory,  pkgadd  looks  in  the
           /var/sadm/install/admin directory for the  administra-
           tion file.

     -d device
           Install or copy a package from device. device can be a
           full  path  name to a directory or the identifiers for
           tape, floppy disk, or  removable  disk  (for  example,
           /var/tmp  or   /floppy/floppy_name ). It can also be a
           device alias (for example, /floppy/floppy0).


pkgadd transfers the contents of a software package from the distribution medium or directory to install 
it onto the system. Used without the -d option, pkgadd looks in the default spool directory for 
the package (var/spool//pkg). Used with the -s option, it writes the package to a spool directory 
instead of installing it.

In general you would pkgadd as follows:

# pkgadd -a admin-file -d device-name pkgid

Or just

# pkgadd -d device-name pkgid


-a admin-file 
 (Optional) Specifies an administration file that the pkgadd command should consult during the installation. 
-d device-name 
 Specifies the absolute path to the software packages. device-name can be the path to a device, a directory, or a spool directory. 
 If you do not specify the path where the package resides, the pkgadd command checks the default spool directory (/var/spool/pkg). 
 If the package is not there, the package installation fails. 
pkgid 
 (Optional) Is the name of one or more packages (separated by spaces) to be installed. 
 If omitted, the pkgadd command installs all available packages.
   

After installing a package, verify the install with 

# pkgchk -v pkgid


Example 1:

following example shows how install the SUNWpl5u package from a mounted Solaris 9 CD. 
The example also shows how to verify that the package files were installed properly. 

# pkgadd -d /cdrom/cdrom0/s0/Solaris_9/Product SUNWpl5u
	.
Installation of <SUNWpl5u> was successful.
# pkgchk -v SUNWpl5u
/usr
/usr/bin
/usr/bin/perl
/usr/perl5
/usr/perl5/5.00503
 

Example 2:
# pkgadd -d /cdrom/cdrom0/s0/Solaris_2.6

Example 3:
# pkgadd -d /tmp/signed_pppd
The following packages are available:
  1  SUNWpppd     Solaris PPP Device Drivers
                  (sparc) 11.10.0,REV=2003.05.08.12.24

Select package(s) you wish to process (or 'all' to process
all packages). (default: all) [?,??,q]: all
Enter keystore password:

Example 4:
# pkgadd -d http://install/signed-video.pkg

## Downloading...
..............25%..............50%..............75%..............100%
## Download Complete

Example 5:
# pkgadd -d . DISsci    The command will create a new directory structure in /opt/DISsci

Example 6:
Spooling the packages to a spool directory

# pkgadd -d /cdrom/sol_8_sparc/s0/Solaris_8/Product -s /var/spool/pkg SUNWaudio


Example 7:

Installing Software Packages From a Remote Package Server
If the packages you want to install are available from a remote system, you can manually mount the directory that contains the packages 
(in package format) and install packages on the local system.

The following example shows how install software packages from a remote system. In this example, assume that the remote system 
named package-server has software packages in the /latest-packages directory. The mount command mounts the packages locally on /mnt, 
and the pkgadd command installs the SUNWpl5u package. 

# mount -F nfs -o ro package-server:/latest-packages /mnt
# pkgadd -d /mnt SUNWpl5u
	.
Installation of <SUNWpl5u> was successful. 


Other package related commands:
-------------------------------

- pkgrm
- pkgchk
- pkginfo
- pkgask
- pkgparam

Displays a package parameter values.
# pkgparam -d /cdrom/cdrom0/s0/Solaris_2.8/Product SUNWvolr SUNW_PKGTYPE
The system responds with the location where the application will be stored.


Using a Response File:
----------------------

A response file contains your answers to specific questions that are asked by an interactive package. An interactive package includes 
a request script that asks you questions prior to package installation, such as whether or not optional pieces of the package should be installed.

If prior to installation, you know that the package you want to install is an interactive package, and you want to store 
your answers to prevent user interaction during future installations of this package, you can use the pkgask command to save your response. 

Once you have stored your responses to the questions asked by the request script, you can use the pkgadd -r command 
to install the package without user interaction.


-- pkginfo
-- -------

# pkginfo
system      SUNWaccr       System Accounting, (Root)
system      SUNWaccu       System Accounting, (Usr)
system      SUNWadmap      System administration applications
system      SUNWadmc       System administration core libraries
.
.
etc..

Example-Displaying Detailed Information About Software Packages


# pkginfo -l SUNWcar

   PKGINST:  SUNWcar
      NAME:  Core Architecture, (Root)
  CATEGORY:  system
      ARCH:  sparc.sun4u
   VERSION:  11.9.0,REV=2001.10.16.17.05
   BASEDIR:  /
    VENDOR:  Sun Microsystems, Inc.
      DESC:  core software for a specific hardware platform group
    PSTAMP:  crash20011016171723
  INSTDATE:  Nov 02 2001 08:53
   HOTLINE:  Please contact your local service provider
    STATUS:  completely installed
     FILES:    111 installed pathnames
                36 shared pathnames
                40 directories
                56 executables
             17626 blocks used (approx) 


# pkginfo -d /export/host1/packages -l SUNWman

For the spool directory, you may use the token spool.


-- pkgrm:
-- ------

Always use the pkgrm command to remove installed packages. Do not use the rm command, which will corrupt 
the system's record-keeping of installed packages. 

Examples:

# pkgrm pkgid ... 

pkgid identifies the name of one or more packages (separated by spaces) to be removed. If omitted, pkgrm removes all available packages.


# pkgrm SUNWctu

The following package is currently installed:
   SUNWctu         Netra ct usr/platform links (64-bit)
                   (sparc.sun4u) 11.9.0,REV=2001.07.24.15.53

Do you want to remove this package? y

## Removing installed package instance <SUNWctu>
## Verifying package dependencies.
## Processing package information.
## Removing pathnames in class <none>

This example shows how to remove a spooled package.

# pkgrm -s /export/pkg SUNWdmfex.u
The following package is currently spooled:
   SUNWdmfex.u           Sun Davicom 10/100Mb Ethernet Driver (64-bit)
                         (sparc.sun4u) 11.9.0,REV=2001.07.24.15.53

Do you want to remove this package? y

Removing spooled package instance <SUNWdmfex.u> 


Some Graphical tools for installing packages:
---------------------------------------------

>>> admintool (Solaris 8,9 Not in Solaris 10)

>>> Solaris Product Registry

The Solaris Product Registry is a GUI tool that enables you to install and uninstall software packages.

To startup the Solaris Product Registry to view, install or uninstall software, use the command
/usr/bin/prodreg

>>> Solaris Management Console (smc) Patch Manager

The Solaris Management Console provides a new Patches Tool for managing patches. You can only use the Patches Tool 
to add patches to a system running the Solaris 9 or later release.


Installing Patches:
-------------------

#patchadd
#patchrm

patchadd [-d] [-u] [-B backout_dir] [-C net_install_image| -R client_root_path| -S service] patch 
patchadd [-d] [-u] [-B backout_dir] [-C net_install_image| -R client_root_path| -S service] -M patch_dir| patch_id...
         | patch_dir patch_list 
patchadd [-C net_install_image| -R client_root_path| -S service] -p 

Examples:

Example 1:
Show the patches on your system:
# showrev -p    shows all patches applied to a system
# patchadd -p   same as above
# pkgparam <pkgid> PATCHLIST  shows all patches applied to the package identified by <pkgid>

Example 2:
# patchadd /var/spool/patch/104945-02
# patchadd -R /export/root/client1  /var/spool/patch/104945-02
# patchadd -M /var/spool/patch 104945-02  104946-02 102345-02
# patchadd -M /var/spool/patch patchlist
# patchadd -M /var/spool/patch -R /export/root/client1 -B /export/backoutrepository 104945-02 104946-02 102345-02


The /var/sadm/install/contents file:
------------------------------------

The /var/sadm/install/contents file is the file which Solaris uses to keep track of all the files 
installed on a system, and their corresponding packages.

Every file installed on a Solaris OS using the pkgadd command has an entry in the database
of installed files /var/sadm/install/contents.
The contents is a textfile that contains one line per installed file.


38.2 Software Packages on AIX:
==============================

Installing software, filesets, packages, lpp:
---------------------------------------------

Similar to Solaris, AIX5L also has a specific terminology related to installable software.
There are 4 basic package concepts in AIX5L: fileset, package, LPP, and bundle.

- Fileset: 
A fileset is the smallest individually installable unit. It's a collection of files that provide a specific
function. For example, the "bos.net.tcp.client" is a fileset in the "bos.net" package. 

- Package:
A package contains a group of filesets with a common function, This is a single installable image,
for example "bos.net".

- LPP:
This is a complete software product collection, including all the packages and filesets required.
LPP's are separately orderable products that will run on the AIX operating system, for example 
BOS, DB2, CICS, ADSM and so on.


-- AIX verifying correct installation:
# lppchk

# lppchk -v     Fileset version consistency check
# lppchk -l     File link verification


P521:/apps $lppchk -l
lppchk:  No link found from /etc/security/mkuser.sys to /usr/lib/security/mkuser.sys.
lppchk:  No link found from /etc/security/mkuser.default to /usr/lib/security/mkuser.default.


-- AIX installing maintenance levels and fixes:
1. download the fix from IBM website 
   http://techsupport.services.ibm.com/server/support?view=pSeries
2. uncompress and untar the software archive
3. type 

smitty update_all


Install a fix with instfix:
---------------------------

P521:/apps $instfix
Usage: instfix [-T [-M platform]] [-s string] [ -k keyword | -f file ]
        [-d device] [-S] [-p | [-i [-c] [-q] [-t type] [-v] [-F]]] [-a]

Function: Installs or queries filesets associated with keywords or fixes.

        -a Display the symptom text (can be combined with -i, -k, or -f).
        -c Colon-separated output for use with -i. Output includes keyword
           name, fileset name, required level, installed level, status, and
           abstract.  Status values are < (down level), = (correct level),
           + (superseded), and ! (not installed).
        -d Input device (required for all but -i and -a).
        -F Returns failure unless all filesets associated with the fix
           are installed.
        -f Input file containing keywords or fixes. Use '-' for standard input.
           The -T option produces a suitable input file format for -f.
        -i Use with -k or -f option to display whether specified fixes or
           keywords are installed.  Installation is not attempted.
           If neither -k nor -f is specified, all known fixes are displayed.
        -k Install filesets for a keyword or fix.
        -M Use with -T option to display information for fixes present
           on the media that have to do with the platform specified.
        -p Use with -k or -f to print filesets associated with keywords.
           Installation is not attempted when -p is used.
        -q Quiet option for use with -i.  If -c is specified, no heading is
           displayed.  Otherwise, no output is displayed.
        -S Suppress multi-volume processing.
        -s Search for and display fixes on media containing a specified string.
        -T Display fix information for complete fixes present on the media.
        -t Use with -i option to limit search to a given type.  Currently
           valid types are 'f' (fix) and 'p' (preventive maintenance).
        -v Verbose option for use with -i.  Gives information about each
           fileset associated with a fix or keyword.
           to the environment provided.


Another option is to use the instfix command. Any fix can have a single fileset or multiple filesets that
comprise that fix. Fix information is organized in the Table of Contents (TOC) on the installation media.
After a fix is installed, fix information is kept on the system in a fix database.

instfix [ -T ] [ -s String ] [ -S ] [ -k Keyword | -f File ] [ -p ] [ -d Device ] [ -i [ -c ] [ -q ] 
        [ -t Type ] [ -v ] [ -F ] ] [ -a ]

Examples:

- If you want to install only a specific fix, use # instfix -k <fileset> -d <device>, for example
# instfix -k IX75893 -d /dev/cd0
# instfix -k IX75893 -d .
# instfix -k IY63533 -d .

- To list fixes that are on a CD-ROM in /dev/cd0, enter
# instfix -T -d /dev/cd0
IX75893

- To determine if for example APAR IX75893 is installed on the system, enter
# instfix -ik IX75893
Not all filesets for IX75893 were found.

You will always be able to determine if an APAR is installed on your system using the 
command instfix -ivk APAR_NUMBER , whereas installed PTFs are not trackable. 

- How to determine if all filesets of a ML are installed?

P521:/apps $instfix -i | grep ML
    All filesets for 5.2.0.0_AIX_ML were found.
    All filesets for 5200-01_AIX_ML were found.
    All filesets for 5200-02_AIX_ML were found.
    All filesets for 5200-03_AIX_ML were found.
    All filesets for 5200-04_AIX_ML were found.
    All filesets for 5200-05_AIX_ML were found.
    All filesets for 5200-06_AIX_ML were found.
    All filesets for 5200-07_AIX_ML were found.
    All filesets for 5200-08_AIX_ML were found.
    All filesets for 5200-09_AIX_ML were found.


The command "instfix -i | grep ML" is essentially the same as "instfix -i -tp".

- To detect incomplete AIX maintaince levels: 
# instfix -i |grep ML
 Not all filesets for 4.3.1.0_AIX_ML were found.
 Not all filesets for 4.3.2.0_AIX_ML were found.
 All filesets for 4.3.1.0_AIX_ML were found.
 Not all filesets for 4.3.2.0_AIX_ML were found.
 Not all filesets for 4.3.3.0_AIX_ML were found.
 Not all filesets for 4330-02_AIX_ML were found.
 All filesets for 4320-02_AIX_ML were found.
 Not all filesets for 4330-03_AIX_ML were found.
..
..

You can also use smitty:

# smitty instfix

                          Update Software by Fix (APAR)

Type or select a value for the entry field.
Press Enter AFTER making all desired changes.

                                                        [Entry Fields]
* INPUT device / directory for software              []                                                                   +


The lslpp command:
------------------


Purpose

Lists installed software products.

Syntax

lslpp { -d | -E | -f | -h | -i | -l | -L | -p } ] [ -a] [ -c] [ -J ] [ -q ] [ -I
] [ -O { [ r ] [ s ] [ u ] } ] [ [ FilesetName ... | FixID ... | all ]

lslpp -w [ -c ] [ -q ] [ -O { [ r ] [ s ] [ u ] } ] [ FileName ... | all ]

lslpp -L -c [ -v]

lslpp -S [A|O]

lslpp -e

Description

The lslpp command displays information about installed filesets or fileset
updates. The FilesetName parameter is the name of a software product. The FixID
(also known as PTF or program temporary fix ID) parameter specifies the
identifier of an update to a formatted fileset.

When only the -l (lowercase L) flag is entered, the lslpp command displays the
latest installed level of the fileset specified for formatted filesets. The base
level fileset is displayed for formatted filesets. When the -a flag is entered
along with the -l flag, the lslpp command displays information about all
installed filesets for the FilesetName specified. The -I (uppercase i) flag
combined with the -l (lowercase L) flag specifies that the output from the lslpp
command should be limited to base level filesets.


        -a Displays additional ("all") information when combined with
           other flags.  (Not valid with -f, only valid with -B when
           combined with -h)
        -B Permits PTF ID input.  (Not valid with -L)
        -c Colon-separated output.
           (Includes all deinstallable levels of software if -Lc)
        -d Dependents (filesets for which this is a requisite).
        -E License Agreements.
        -S Lists Automatically and Optionally installed filesets.
        -e Lists all efixes on the system.
        -f Files that belong to this fileset.
        -h History information.
        -I Limits listings to base level filesets (no updates displayed).
        -i Product Identification information (requested per fileset).
        -J Use list as the output format.  (Valid with -l and -L)
        -L Lists fileset names, latest level, states, and descriptions.
           (Consolidates usr, root and share part information.)
        -l Lists fileset names, latest level, states, and descriptions.
           (Separates usr, root and share part information.)
        -O Data comes from [r] root and/or [s] share and/or [u] usr.
           (Not valid with -L)
        -p Requisites of installed filesets.
        -q Quiet (no column headers).
        -v Lists additional information from vendor database.
      (Valid with -Lc only)
        -w Lists the fileset that owns this file.

         One of the following mutually exclusive flags: d,f,h,i,L,l,p,w,E,S,e
         must be specified.
P521:/apps $


To display information about installed filesets, you can use the lslpp command.

If you need to check whether certain filesets have been installed, use the lslpp command
as in the following example:

# lslpp -h bos.adt.include bos.adt.l1b bos.adt.l1bm \
           bos.net.ncs 1for_ls.compat 1for_ls.base

In the above example, we check whether those filesets have been installed.

lslpp options:

-l: Displays the name, level, state and description of the fileset.
-h: Displays the installation and update history for the fileset.
-p: Displays requisite information for the fileset.
-d: Displays dependent information for the fileset.
-f: Displays the filenames added to the system during installation of the fileset.
-w: Lists the fileset that owns a file or files.

Examples:

- To display the name, level of the bos.adt.include fileset, use
zd57l09 
# lslpp -l bos.adt.include

  Fileset                      Level  State      Description
  ----------------------------------------------------------------------------
Path: /usr/lib/objrepos
  bos.adt.include           5.2.0.95  COMMITTED  Base Application Development
                                                 Include Files


- To display all files in the inventory database which include vmstat, use

# lslpp -w "*vmstat*"
  File                                        Fileset               Type
  ----------------------------------------------------------------------------
  /usr/sbin/lvmstat                           bos.rte.lvm           File
  /usr/share/man/info/EN_US/a_doc_lib/cmds/aixcmds6/vmstat.htm
                               infocenter.man.EN_US.commands        File
  /usr/share/man/info/EN_US/a_doc_lib/cmds/aixcmds3/lvmstat.htm
                               infocenter.man.EN_US.commands        File
  /usr/bin/vmstat                             bos.acct              File
  /usr/bin/vmstat64                           bos.acct              File
  /usr/es/sbin/cluster/OEM/VxVM40/cllsvxvmstat
                                     cluster.es.server.utils        File

The same for trying to find out what contains the make command:

# lslpp -w "*make*"

  /usr/bin/makedev                            bos.txt.tfs           File
  /usr/ccs/bin/make                           bos.adt.base          File
  /usr/bin/make                               bos.adt.base          Symlink
  /usr/bin/makekey                            bos.adt.base          Symlink
  /usr/ccs/bin/makekey                        bos.adt.base          File


- To list the installation state for the most recent level of installed filesets for all of the bos.rte filesets, use
# lslpp -l "bos.rte.*"
# lslpp -l | grep bos.rte

So, "lslpp -l" shows all of the filesets

- To display the names of the files added to the system during installation of the bos.perf.perfstat fileset, use
# lslpp -f "*perf*"

- To check whether some certain filesets have been installed, like in the following example:
# lslpp -h bos.adt.include bos.adt.lib bos.adt.l1bm \
           bos.net.ncs 1for_ls.compat 1for_ls.base

- To check you have the SDD driver on your system:
# lslpp -L devices.sdd.*


- To check the Java filesets on your system:
# lslpp -l | grep Java


/root:>lslpp -l | grep Java
  Java131.rte.bin           1.3.1.16  COMMITTED  Java Runtime Environment
  Java131.rte.lib           1.3.1.16  COMMITTED  Java Runtime Environment
                                                 Java-based build tool.
                                                 JavaBeans(TM) (EJB(TM)).
                                                 Javadocs
                                                 Java(TM) technology-based Web
                                                 Java(TM) technology-based Web
                                                 Javadocs
  idebug.rte.hpj             9.2.5.0  COMMITTED  High-Performance Java Runtime
  idebug.rte.jre             9.2.5.0  COMMITTED  Java Runtime Environment
  idebug.rte.olt.Java        9.2.5.0  COMMITTED  Object Level Trace Java

# lslpp -l | grep Java13_64


# lslpp -l | grep App
                                                 Application Server Dynamic
                                                 WebSphere Application Server.
                                                 for WebSphere Application
                                                 the WebSphere Application
                                                 Application Profile, and
  X11.adt.bitmaps            5.2.0.0  COMMITTED  AIXwindows Application
  X11.adt.ext               5.2.0.30  COMMITTED  AIXwindows Application
  X11.adt.imake              5.2.0.0  COMMITTED  AIXwindows Application
  X11.adt.include           5.2.0.10  COMMITTED  AIXwindows Application
  X11.adt.lib               5.2.0.40  COMMITTED  AIXwindows Application
  X11.adt.motif              5.2.0.0  COMMITTED  AIXwindows Application
  X11.apps.aixterm          5.2.0.30  COMMITTED  AIXwindows aixterm Application
  X11.apps.clients           5.2.0.0  COMMITTED  AIXwindows Client Applications
                                                 Applications
  X11.apps.msmit            5.2.0.50  COMMITTED  AIXwindows msmit Application
                                                 Configuration Applications
                                                 Applications
  X11.apps.xdm              5.2.0.40  COMMITTED  AIXwindows xdm Application
  X11.apps.xterm             5.2.0.0  COMMITTED  AIXwindows xterm Application
                             5.2.0.0  COMMITTED  AIXwindows Client Application
  X11.msg.en_US.apps.config  5.2.0.0  COMMITTED  AIXwindows Config Application
  bos.adt.base              5.2.0.50  COMMITTED  Base Application Development
  bos.adt.debug             5.2.0.50  COMMITTED  Base Application Development
  bos.adt.graphics          5.2.0.40  COMMITTED  Base Application Development
  bos.adt.include           5.2.0.53  COMMITTED  Base Application Development
  bos.adt.lib               5.2.0.50  COMMITTED  Base Application Development
  bos.adt.libm              5.2.0.50  COMMITTED  Base Application Development
  bos.adt.sccs               5.2.0.0  COMMITTED  SCCS Application Development
  bos.adt.syscalls          5.2.0.50  COMMITTED  System Calls Application
  bos.adt.utils             5.2.0.50  COMMITTED  Base Application Development
  bos.net.tcp.adt           5.2.0.40  COMMITTED  TCP/IP Application Toolkit
                                                 Application Runtime
  xlC.adt.include            6.0.0.0  COMMITTED  C Set ++ Application
  bos.adt.data               5.2.0.0  COMMITTED  Base Application Development


Removing a fix:
---------------

On AIX you can use either the 
installp -r    command, or use the
smitty reject  fast path


Smitty fastpaths:
-----------------

-- AIX software maintenance:
# smitty maintain_software

From here you can commit or reject installed software. You can also copy the filesets from the installation media
to a directory on disk. The default directory for doing this is /usr/sys/inst.images  

-- Install new software:
# smitty install_update
# smitty install_latest

-- To commit software:
# smitty install_commit

-- To reject software:
# smitty install_reject

-- To remove installed and commited software:
# smitty install_remove

-- To see what fixes are installed on your system:
# smitty show_apar_stat

-- To install individual fix:
# smitty instfix   or
# smitty update_by_fix

-- To install all filesets:
# smitty update_all

-- To view already installed software:
# smitty list_installed


The AIX installp command:
-------------------------

installp Command
Purpose
Installs available software products in a compatible installation package.

Syntax
To Install with Apply Only or with Apply and Commit
installp [ -a | -ac [ -N ] ] [ -eLogFile ] [ -V Number ] [ -dDevice ] [ -b ] [ -S ] [ -B ] [ -D ] [ -I ] [ -p ] 
         [ -Q ] [ -q ] [ -v ] [ -X ] [ -F | -g ] [ -O { [ r ] [ s ] [ u ] } ] [ -tSaveDirectory ] [ -w ] [ -zBlockSize ] 
         { FilesetName [ Level ]... | -f ListFile | all }

To Commit Applied Updates
installp -c [ -eLogFile ] [ -VNumber ] [ -b ] [ -g ] [ -p ] [ -v ] [ -X ] [ -O { [ r ] [ s ] [ u ] } ] [ -w ] { FilesetName [ Level ]... | -f ListFile | all } 

To Reject Applied Updates
installp -r [ -eLogFile ] [ -VNumber ] [ -b ] [ -g ] [ -p ] [ -v ] [ -X ] [ -O { [ r ] [ s ] [ u ] } ] [ -w ] { FilesetName [ Level ]... | -f ListFile } 

To Deinstall (Remove) Installed Software
installp -u [ -eLogFile ] [ -VNumber ] [ -b ] [ -g ] [ -p ] [ -v ] [ -X ] [ -O { [ r ] [ s ] [ u ] } ] [ -w ] { FilesetName [ Level ]... | -f ListFile } 

To Clean Up a Failed Installation:
installp -C [ -b ] [ -eLogFile ] 

To List All Installable Software on Media
installp { -l | -L } [ -eLogFile ] [ -d Device ] [ -B ] [ -I ] [ -q ] [ -zBlockSize ] [ -O { [ s ] [ u ] } ] 

To List All Customer-Reported Problems Fixed with Software or Display All Supplemental Information
installp { -A|-i } [ -eLogFile ] [ -dDevice ] [ -B ] [ -I ] [ -q ] [ -z BlockSize ] [ -O { [ s ] [ u ] } ] { FilesetName [ Level ]... | -f ListFile | all } 

To List Installed Updates That Are Applied But Not Committed
installp -s [ -eLogFile ] [ -O { [ r ] [ s ] [ u ] } ] [ -w ] { FilesetName [ Level ]... | -fListFile | all }

fileset is the lowest installable base unit. For example, bos.net.tcp.client 4.1.0.0 is a fileset. 
A fileset update is an update with a different fix ID or maintenance level. 
For example, bos.net.tcp.client 4.1.0.2 and bos.net.tcp.client 4.1.1.0 are both fileset updates 
for bos.net.tcp.client 4.1.0.0. 

When a base level (fileset) is installed on the system, it is automatically committed. You can remove a fileset 
regardless of the state (committed, broken, committed with applied updates, committed with committed updates, etc.). 

When a fileset update is applied to the system, the update is installed. The current version of that software, 
at the time of the installation, is saved in a special save directory on the disk so that later you can return 
to that version if desired. Once a new version of a software product has been applied to the system, that version 
becomes the currently active version of the software. 

Updates that have been applied to the system can be either committed or rejected at a later time. 
The installp -s command can be used to get a list of applied updates that can be committed or rejected. 

When updates are committed with the -c flag, the user is making a commitment to that version of the software product, 
and the saved files from all previous versions of the software product are removed from the system, thereby making 
it impossible to return to a previous version of the software product. 
Software can be committed at the time of installation by using the -ac flags. Note that committing already 
applied updates does not change the currently active version of a software product. 
It merely removes saved files for previous versions of the software product. 

Examples:

To install all filesets within the bos.net software package in /usr/sys/inst.images directory in the
applied state, enter

# installp -avX -d/usr/sys/inst.images bos.net

To commit all updates, enter

# installp -cgX all

To list the software that is on your CDROM, enter

# installp -L -d /dev/cd0

A record of the installp output can be found in the /var/adm/sw/installp.summary
# cat /var/adm/sw/installp.summary

Used to cleanup after a failed lpp install/update:
# installp -C 

Commits all applied LPPs or PTFs:                                  
# installp -c -g -X all    

Lists the table of contents for the install/update media and saves it into a file named /tmp/toc.list                  
# installp -q -d/dev/rmt1.1 -l > /tmp/toc.list   

Lists the lpps that have been applied but not yet committed or rejected:                                                
# installp -s    

[P521]root@ol116u106:installp -s
0503-459 installp:  No filesets were found in the Software
        Vital Product Database in the APPLIED state.
                               

The AIX geninstall command:
---------------------------

A generic installer that installs software products of various packaging formats. 
For example, installp, RPM, and ISMP.

With the geninstall command, you can list and install packages from media that contains installation images 
packaged in any of the listed formats. The geninstall and gencopy commands recognize the non-installp 
installation formats and either call the appropriate installers or copy the images, respectively.


Beginning in AIX 5L, you can not only install installp formatted packages, but also RPM and 
Install Shield Mutli-Platform (ISMP) formatted packages. Use the Web-based System Manager, 
SMIT, or the geninstall command to install and uninstall these types of packages. 
The geninstall command is designed to detect the format type of a specified package and run the 
appropriate install command.

                                       
Syntax
geninstall -d Media [ -I installpFlags ] [ -E | -T ] [ -t ResponseFileLocation ] 
          [-e LogFile] [ -p ] [ -F ] [ -Y ] [ -Z ] [ -D ] { -f File | Install_List ] | all}

OR

geninstall -u [-e LogFile] [ -E | -T ] [ -t ResponseFileLocation ] [ -D ] {-f File | Uninstall_List...}

OR

geninstall -L -d Media [-e LogFile] [ -D ]

Description
Accepts all current installp flags and passes them on to installp. Some flags (for example, -L) are overloaded 
to mean list all products on the media. Flags that don't make sense for ISMP packaged products are ignored. 
This allows programs (like NIM) to continue to always send in installp flags to geninstall, but only the flags 
that make sense are used.

The geninstall command provides an easy way to see what modifications have been made to the configuration files 
listed in /etc/check_config.files. When these files have been changed during a geninstall installation or update 
operation, the differences between the old and new files will be recorded in the /var/adm/ras/config.diff. 
If /etc/check_config.files requests that the old file be saved, the old file can be found in the /var/adm/config 
directory.

The /etc/check_config.files file can be edited and can be used to specify whether old configuration files 
that have been changed should be saved (indicated by s) or deleted (indicated by d), and has the following format: 

d /etc/inittab

A summary of the geninstall command's install activity is kept at /var/adm/sw/geninstall.summary. 
This file contains colon-separated lists of filesets installed by installp and components installed 
by ISMP. This is used mainly to provide summary information for silent installs.

Note:
Refer to the README.ISMP file in the /usr/lpp/bos directory to learn more about ISMP-packaged installations 
and using response files.
 
Examples:

- To install all the products on a CD media that is in drive cd0, type:

# geninstall -d /dev/cd0 all

If ISMP images are present on the media, a graphical interface is presented. Any installp or RPM images 
are installed without prompting, unless the installp images are spread out over multiple CDs.


- If you using the geninstall command to install RPM or ISMP packages, use the prefix type to designate 
to the geninstall command the type of package you are installing. In AIX 5L, the package prefix types 
are the following:

I: installp format 
R: RPM format 
J: ISMP format 

For example, to install the cdrecord RPM package and the bos.games installp package, type the following:

# geninstall -d/dev/cd0 R:cdrecord I:bos.games

The geninstall command detects that the cdrecord package is an RPM package type and runs the rpm command 
to install cdrecord. The geninstall command then detects that bos.games is an installp package type and runs 
the installp command to install bos.games. The process for uninstallation is similar to the installation process.


Fixdist:
--------

There is a tool named fixdist you can use to download fixes from IBM.


Maintenance levels:
===================

Notes:

Note 1:
-------

Current versions of AIX5L are 5200-04, 05, 06, 07  

04: V5.2 with the 5200-04 Recommended Maintenance Package APAR IY56722
plus APAR IY60347 � 

05: V5.2 with the 5200-05 Recommended Maintenance Package   


Note 2: Go from 5200-00 to 5200-05:
-----------------------------------

Use this package to update to 5200-05 (ML 05) an AIX 5.2.0 system whose current ML is 5200-00 (i.e. base level) or higher.
(Nota: ML 05 notably brings the fileset bos.mp.5.2.0.54) 

AIX 5200-05 maintenance package:

AIX 5200-05 maintenance package 
Recommended maintenance for AIX 5.2.0 

This package, 5200-05, updates AIX 5.2 from base level (no maintenance level) to maintenance level  05 (5200-05). 
This package is a recommended maintenance package for AIX 5.2. IBM recommends that customers install the latest 
available maintenace package for their AIX release.  
 
To determine if AIX 5200-05 is already installed on your system, run the following command:
oslevel -r  
  
General description 

This package contains code corrections for the AIX operating system and many related subsystems. 
Unless otherwise stated, this package is released 
for all languages. For additional information, refer to the Package information   
 
Download and install instructions 
 
Package                   Released Size (Bytes) Checksum 
 520005.tar.gz (See Note) 01/20/05 750,314,420  2116147779 

Additional space needed to extract the filesets 1,034,141,696 
  
Note: IBM recommends that you create a separate file system for /usr/sys/inst.images to prevent the expansion 
of the /usr file system. 
More information

Click on the package name above. 
Put the package (a tar.gz file) in /usr/sys/inst.images 
Extract the filesets from the package. 
cd /usr/sys/inst.images 
gzip -d -c 520005.tar.gz | tar -xvf - 
Back up your system. 
Install the package by creating a table of contents for install to use. 
Then update the install subsystem itself. Run SMIT to complete the installation. 

# inutoc /usr/sys/inst.images 
# installp -acgXd /usr/sys/inst.images bos.rte.install 
# smit update_all 
Reboot your system. This maintenance package replaces critical operating system code. 
 
  
Installation Tips

 * You will need to be logged in as 'root' to perform the
   installation of this package.

 * Creating a system backup is recommended before starting the
   installation procedure. Refer to the mksysb command in the
   AIX 5.2 Commands Reference manual for additional information.

 * The latest AIX 5.2 installation hints and tips are available
   from the eServer Subscription Services web site at:

   https://techsupport.services.ibm.com/server/pseries.subscriptionSvcs


   These tips contain important information that should be
   reviewed before installing this update.


Installation

 To install selected updates from this package, use the command:

   smit update_by_fix

 To install all updates from this package that apply to installed
 filesets on your system, use the command:

   smit update_all

 It is highly recommended that you apply all updates from this
 package.

 After successful installation, a system reboot is required for
 this update to take effect.
 

Note 2: Go from 5200-04 to 5200-05:
-----------------------------------

AIX 5200(04)-05 maintenance package 
Recommended maintenance for AIX 5.2.0 

This package, 5200(04)-05, updates AIX 5.2 from maintenance level 04 (5200-04) to maintenance level 05 (5200-05). 
This package is a recommended maintenance package for AIX 5.2. IBM recommends that customers install the latest available 
maintenace package for their AIX release.  
 
To determine if AIX 5200-05 is already installed on your system, run the following command:
oslevel -r  
  
General description 
 
This package contains code corrections for the AIX operating system and many related subsystems. Unless otherwise stated, 
this package is released for all languages. For additional information, refer to the Package information  
  
Download and install instructions 
 
Package                   Released Size (Bytes) Checksum 
 520405.tar.gz (See Note) 01/20/05 637,751,943  3712904912 
Additional space needed to extract the filesets 856,494,080 
 
 
Note: IBM recommends that you create a separate file system for /usr/sys/inst.images to prevent 
the expansion of the /usr file system. 
 More information

Click on the package name above. 
Put the package (a tar.gz file) in /usr/sys/inst.images 
Extract the filesets from the package. 
cd /usr/sys/inst.images 
gzip -d -c 520405.tar.gz | tar -xvf - 
Back up your system. 
Install the package by creating a table of contents for install to use. Then update the install subsystem itself. 
Run SMIT to complete the installation. 

# inutoc /usr/sys/inst.images 
# installp -acgXd /usr/sys/inst.images bos.rte.install 
# smit update_all 

Reboot your system. This maintenance package replaces critical operating system code. 


Note 3: Go from 5200-05 to 5200-07:
-----------------------------------

Always run the inutoc command to ensure the installation subsystem will recognize the new fix packages 
you download. This command creates a new .toc file for the fix package. Run the inutoc command in 
the same directory where you downloaded the package filesets. For example, if you downloaded the 
filesets to /usr/sys/inst.images, run the following command: 

# inutoc /usr/sys/inst.images 

- For selected updates

To install selected updates from this package, use the following command: 

# smit update_by_fix 


- For all updates

To install all updates from this package that apply to the installed filesets on your system, 
use the following command: 

# smit update_all 

It is highly recommended that you apply all updates from this package. 

Reboot the system. A reboot is required for this update to take effect. 


--

First do the bos.rte.install

# installp -acgYqXd /software/ML07 bos.rte.install

# inutoc /software/ML07

# smitty update_all


Note 4: About the /usr/sys/inst.images fs:
------------------------------------------

Create a LV

# crfs -v jfs -a bf=true -dXXX##instlv -m/usr/sys/inst.images -Ayes -prw -tno -a nbpi=4096 -a ag=64 

# mount /usr/sys/inst.images 


Note 5: About the inutoc command:
---------------------------------

inutoc Command

Purpose
Creates a .toc file for directories that have backup format file install images. 
This command is used by the installp command and the install scripts.

Syntax
inutoc [ Directory ]

Description
The inutoc command creates the .toc file in Directory. If a .toc file already exists, it is recreated with new information. 
The default installation image Directory is /usr/sys/inst.images. The inutoc command adds table of contents entries 
in the .toc file for every installation image in Directory.

The installp command and the bffcreate command call this command automatically upon the creation or use 
of an installation image in a directory without a .toc file.

Examples
To create the .toc file for the /usr/sys/inst.images directory, enter: 
# inutoc

To create a .toc file for the /tmp/images directory, enter: 
# inutoc /tmp/images


Note 6: About the bffcreate command:
------------------------------------

bffcreate Command
Purpose
Creates installation image files in backup format. 

Syntax
bffcreate [ -q ] [ -S ] [ -U ] [ -v ] [ -X ] [ -d Device ] [ -t SaveDir ] [ -w Directory ] 
          [ -M Platform ] { [ -l | -L ] | -c [ -s LogFile ] | Package [Level ] ... | -f ListFile | all }

Description
The bffcreate command creates an installation image file in backup file format (bff) to support 
software installation operations.

The bffcreate command creates an installation image file from an installation image file 
on the specified installation media. Also, it automatically creates an installation image file from 
hyptertext images (such as those on the operating system documentation CD-ROMs). The installp command 
can use the newly created installation file to install software onto the system. The file is created 
in backup format and saved to the directory specified by SaveDir. The .toc file in the directory 
specified by the SaveDir parameter is updated to include an entry for the image file.

The bffcreate command determines the bff name according to this information:

Neutral Packages         package.v.r.m.f.platform.installtype 
POWER-based platform     Packages package.v.r.m.f.installtype 

Image Type                                             Target bff Name 
Installation image for the POWER-based platform        package.v.r.m.f.I 
Installation image for Neutral                         package.v.r.m.f.N.I 
3.1 update for the POWER-based platform                package.v.r.m.f.service# 
3.2 update for the POWER-based platform                package.v.r.m.f.ptf 
4.X** or later updates for the POWER-based platform    package.part.v.r.m.f.U 
Update image for Neutral                               package.v.r.m.f.N.U 
** 4.X or later updates contain one package only. In addition, AIX Version 4 and later updates do not contain ptf IDs.
 

package = the name of the software package as described by the PackageName parameter

v.r.m.f = version.release.modification.fix, the level associated with the software package. 
The PackageName is usually not the same as the fileset name.

ptf = program temporary fix ID (also known as FixID)

The installation image file name has the form Package.Level.I. The Package is the name of the software package, 
as described for the Package Name parameter. Level has the format of v.r.m.f, where v = version, r = release, 
m = modification, f = fix. The I extension means that the image is an installation image rather than an update image.

Update image files containing an AIX 3.1 formatted update have a service number extension following the level. 
The Servicenum parameter can be up to 4 digits in length. One example is xlccmp.3.1.5.0.1234.

Update image files containing an AIX 3.2 formatted update have a ptf extension following the level. 
One example is bosnet.3.2.0.0.U412345.

AIX Version 4 and later update image file names begin with the fileset name, not the PackageName. 
They also have U extensions to indicate that they are indeed update image files, not installation images. 
One example of an update image file is bos.rte.install.4.3.2.0.U.

The all keyword indicates that installation image files are created for every installable software package on the device.

You can extract a single update image with the AIX Version 4 and later bffcreate command. 
Then you must specify the fileset name and the v.r.m.f. parameter. As in example 3 in the Examples section, 
the PackageName parameter must be the entire fileset name, bos.net.tcp.client, not just bos.net.

Attention: Be careful when selecting the target directory for the extracted images, especially if 
that directory already contains installable images. If a fileset at a particular level exists as both 
an installation image and as an update image in the same directory, unexpected installation results can occur. 
In cases like this, installp selects the image it finds first in the table of contents (.toc) file. 
The image it selects may not be the one you intended and unexpected requisite failures can result. 
As a rule of thumb, you should extract maintenance levels to clean directories.


Examples
To create an installation image file from the bos.net software package on the tape in the /dev/rmt0 tape drive 
and use /var/tmp as the working directory, type: 
# bffcreate  -d /dev/rmt0.1 -w /var/tmp bos.net

To create an installation image file from the package software package on the diskette in the /dev/rfd0 
diskette drive and print the name of the installation image file without being prompted, type: 
# bffcreate  -q  -v package

To create a single update image file from the bos.net.tcp.client software package on the CD in /dev/cd0, type: 
# bffcreate  -d /dev/cd0 bos.net.tcp.client 4.2.2.1

To list the packages on the CD in /dev/cd0, type: 
# bffcreate  -l -d /dev/cd0

To create installation and/or update images from a CD in /dev/cd0 by specifying a list of PackageNames 
and Levels in a ListFile called my MyListFile, type: 
# bffcreate  -d /dev/cd0 -f MyListFile

To create installation or update images of all software packages on the CD-ROM media for the current platform, type: 
# bffcreate -d /dev/cd0 all

To list fileset information for the bos.games software package from a particular device, type: 
# bffcreate -d /usr/sys/inst.images/bos.games -l

To list all the Neutral software packages on the CD-ROM media, type: 
# bffcreate -d /dev/cd0 -MN -l


38.3 Software Packages on Linux:
================================


38.3.1 RPM packages on Linux (1):
---------------------------------

Note 1:
-------

First we show a few simple examples:

- Examples getting software info from your system:

# rpm -q kernel
kernel-2.4.7-10
 
# rpm -q glibc
glibc-2.2.4-19.3
 
# rpm -q gcc
gcc-2.96-98

Show everything:

# rpm -qa

- Examples installing rpm packages:

# rpm -Uvh libpng-1.2.2-22.i386.rpm

# rpm -Uvh gnome-libs-1.4.1.2.90-40.i386.rpm

# rpm -Uvh oracleasm-support-2.0.0-1.i386.rpm \
    oracleasm-lib-2.0.0-1.i386.rpm \
    oracleasm-2.6.9-5.0.5-ELsmp-2.0.0-1.i686.rpm

# rpm -Uvh /mnt/cdrom/RedHat/RPMS/tripwire*.rpm

Note: 
the U switch really means starting an Upgrade, but if nothing is there, an installation will take place.


Note 2:
-------

What is RPM?

RPM is the RPM Package Manager. It is an open packaging system available for anyone to use. 
It allows users to take source code for new software and package it into source and binary form 
such that binaries can be easily installed and tracked and source can be rebuilt easily. 
It also maintains a database of all packages and their files that can be used for verifying packages 
and querying for information about files and/or packages. 

Red Hat, Inc. encourages other distribution vendors to take the time to look at RPM and use it 
for their own distributions. RPM is quite flexible and easy to use, though it provides the base 
for a very extensive system. It is also completely open and available, though we would appreciate 
bug reports and fixes. Permission is granted to use and distribute RPM royalty free under the GPL. 

More complete documentation is available on RPM in the book by Ed Bailey, Maximum RPM. That book is 
available for download or purchase at www.redhat.com. 

RPM is a core component of many Linux distributions, such as Red Hat Enterprise Linux, the Fedora Project, 
SUSE Linux Enterprise, openSUSE, CentOS, Mandriva Linux, and many others. 
It is also used on many other operating systems as well, and the RPM format is part of the Linux Standard Base. 


Acquiring RPM
The best way to get RPM is to install Red Hat Linux. If you don't want to do that, you can still get 
and use RPM. It can be acquired from ftp.redhat.com. 

RPM Requirements
RPM itself should build on basically any Unix-like system. It has been built and used on Tru64 Unix, 
AIX, Solaris, SunOS, and basically all flavors of Linux. 

To build RPMs from source, you also need everything normally required to build a package, like gcc, make, etc. 


In its simplest form, RPM can be used to install packages: 

# rpm -i foobar-1.0-1.i386.rpm
    
The next simplest command is to uninstall a package: 

# rpm -e foobar
    
One of the more complex but highly useful commands allows you to install packages via FTP. 
If you are connected to the net and want to install a new package, all you need to do is specify 
the file with a valid URL, like so: 

# rpm -i ftp://ftp.redhat.com/pub/redhat/rh-2.0-beta/RPMS/foobar-1.0-1.i386.rpm
 

Please note, that RPM will now query and/or install via FTP. 

While these are simple commands, rpm can be used in a multitude of ways. To see which options are available 
in your version of RPM, type: 

# rpm --help

You can find more details on what those options do in the RPM man page, found by typing: 

# man rpm

RPM is a very useful tool and, as you can see, has several options. The best way to make sense of them 
is to look at some examples. I covered simple install/uninstall above, so here are some more examples: 

Let's say you delete some files by accident, but you aren't sure what you deleted. If you want to verify 
your entire system and see what might be missing, you would do: 

# rpm -Va 

Let's say you run across a file that you don't recognize. To find out which package owns it, you would do: 

# rpm -qf /usr/X11R6/bin/xjewel
	
The output would be sometime like: 

xjewel-1.6-1
	
You find a new koules RPM, but you don't know what it is. To find out some information on it, do: 

# rpm -qpi koules-1.2-2.i386.rpm

The output would be: 

Name        : koules                      Distribution: Red Hat Linux Colgate
Version     : 1.2                               Vendor: Red Hat Software
Release     : 2                             Build Date: Mon Sep 02 11:59:12 1996
Install date: (none)                        Build Host: porky.redhat.com
Group       : Games                         Source RPM: koules-1.2-2.src.rpm
Size        : 614939
Summary     : SVGAlib action game with multiplayer, network, and sound support
Description :

This arcade-style game is novel in conception and excellent in execution.
No shooting, no blood, no guts, no gore.  The play is simple, but you
still must develop skill to play.  This version uses SVGAlib to
run on a graphics console.
	
Now you want to see what files the koules RPM installs. You would do: 

# rpm -qpl koules-1.2-2.i386.rpm

The output is: 

/usr/doc/koules
/usr/doc/koules/ANNOUNCE
/usr/doc/koules/BUGS
/usr/doc/koules/COMPILE.OS2
/usr/doc/koules/COPYING
/usr/doc/koules/Card
/usr/doc/koules/ChangeLog
/usr/doc/koules/INSTALLATION
/usr/doc/koules/Icon.xpm
/usr/doc/koules/Icon2.xpm
/usr/doc/koules/Koules.FAQ
/usr/doc/koules/Koules.xpm
/usr/doc/koules/README
/usr/doc/koules/TODO
/usr/games/koules
/usr/games/koules.svga
/usr/games/koules.tcl
/usr/man/man6/koules.svga.6
	
 
SYNOPSIS
QUERYING AND VERIFYING PACKAGES:

rpm {-q|--query} [select-options] [query-options] 
rpm {-V|--verify} [select-options] [verify-options] 
rpm --import PUBKEY ... 
rpm {-K|--checksig} [--nosignature] [--nodigest] 
PACKAGE_FILE ... 


INSTALLING, UPGRADING, AND REMOVING PACKAGES:
rpm {-i|--install} [install-options] PACKAGE_FILE ... 
rpm {-U|--upgrade} [install-options] PACKAGE_FILE ... 
rpm {-F|--freshen} [install-options] PACKAGE_FILE ... 
rpm {-e|--erase} [--allmatches] [--nodeps] [--noscripts] 
[--notriggers] [--repackage] [--test] PACKAGE_NAME ... 


MISCELLANEOUS:
rpm {--initdb|--rebuilddb} 
rpm {--addsign|--resign} PACKAGE_FILE ... 
rpm {--querytags|--showrc} 
rpm {--setperms|--setugids} PACKAGE_NAME ... 


Note 3:
-------

NAME
rpm - RPM Package Manager 
SYNOPSIS
QUERYING AND VERIFYING PACKAGES:


rpm {-q|--query} [select-options] [query-options] 
rpm {-V|--verify} [select-options] [verify-options] 
rpm --import PUBKEY ... 
rpm {-K|--checksig} [--nosignature] [--nodigest] 
PACKAGE_FILE ... 


INSTALLING, UPGRADING, AND REMOVING PACKAGES:
rpm {-i|--install} [install-options] PACKAGE_FILE ... 
rpm {-U|--upgrade} [install-options] PACKAGE_FILE ... 
rpm {-F|--freshen} [install-options] PACKAGE_FILE ... 
rpm {-e|--erase} [--allmatches] [--nodeps] [--noscripts] 
[--notriggers] [--repackage] [--test] PACKAGE_NAME ... 


MISCELLANEOUS:
rpm {--initdb|--rebuilddb} 
rpm {--addsign|--resign} PACKAGE_FILE ... 
rpm {--querytags|--showrc} 
rpm {--setperms|--setugids} PACKAGE_NAME ... 

select-options

[PACKAGE_NAME] [-a,--all] [-f,--file FILE] 
[-g,--group GROUP] {-p,--package PACKAGE_FILE] 
[--fileid MD5] [--hdrid SHA1] [--pkgid MD5] [--tid TID] 
[--querybynumber HDRNUM] [--triggeredby PACKAGE_NAME] 
[--whatprovides CAPABILITY] [--whatrequires CAPABILITY] 


query-options

[--changelog] [-c,--configfiles] [-d,--docfiles] [--dump] 
[--filesbypkg] [-i,--info] [--last] [-l,--list] 
[--provides] [--qf,--queryformat QUERYFMT] 
[-R,--requires] [--scripts] [-s,--state] 
[--triggers,--triggerscripts] 


verify-options

[--nodeps] [--nofiles] [--noscripts] 
[--nodigest] [--nosignature] 
[--nolinkto] [--nomd5] [--nosize] [--nouser] 
[--nogroup] [--nomtime] [--nomode] [--nordev] 


install-options

[--aid] [--allfiles] [--badreloc] [--excludepath OLDPATH] 
[--excludedocs] [--force] [-h,--hash] 
[--ignoresize] [--ignorearch] [--ignoreos] 
[--includedocs] [--justdb] [--nodeps] 
[--nodigest] [--nosignature] [--nosuggest] 
[--noorder] [--noscripts] [--notriggers] 
[--oldpackage] [--percent] [--prefix NEWPATH] 
[--relocate OLDPATH=NEWPATH] 
[--repackage] [--replacefiles] [--replacepkgs] 
[--test] 


DESCRIPTION
rpm is a powerful Package Manager, which can be used to build, install, query, verify, update, and erase 
individual software packages. A package consists of an archive of files and meta-data used to install 
and erase the archive files. The meta-data includes helper scripts, file attributes, and descriptive 
information about the package. Packages come in two varieties: binary packages, used to encapsulate 
software to be installed, and source packages, containing the source code and recipe necessary 
to produce binary packages. 

One of the following basic modes must be selected: Query, Verify, Signature Check, Install/Upgrade/Freshen, 
Uninstall, Initialize Database, Rebuild Database, Resign, Add Signature, Set Owners/Groups, Show Querytags, 
and Show Configuration. 

GENERAL OPTIONS
These options can be used in all the different modes. 

-?, --help
Print a longer usage message then normal. 
--version
Print a single line containing the version number of rpm being used. 
--quiet
Print as little as possible - normally only error messages will be displayed. 
-v
Print verbose information - normally routine progress messages will be displayed. 
-vv
Print lots of ugly debugging information. 
--rcfile FILELIST
Each of the files in the colon separated FILELIST is read sequentially by rpm for configuration information. 
Only the first file in the list must exist, and tildes will be expanded to the value of $HOME. 
The default FILELIST is /usr/lib/rpm/rpmrc:/usr/lib/rpm/redhat/rpmrc:~/.rpmrc. 
--pipe CMD
Pipes the output of rpm to the command CMD. 
--dbpath DIRECTORY
Use the database in DIRECTORY rathen than the default path /var/lib/rpm 
--root DIRECTORY
Use the file system tree rooted at DIRECTORY for all operations. Note that this means the database within 
DIRECTORY will be used for dependency checks and any scriptlet(s) (e.g. %post if installing, or %prep if building, 
a package) will be run after a chroot(2) to DIRECTORY. 

INSTALL AND UPGRADE OPTIONS
The general form of an rpm install command is 


rpm {-i|--install} [install-options] PACKAGE_FILE ... 


This installs a new package. 

The general form of an rpm upgrade command is 

rpm {-U|--upgrade} [install-options] PACKAGE_FILE ... 

This upgrades or installs the package currently installed to a newer version. This is the same as install, 
except all other version(s) of the package are removed after the new package is installed. 


rpm {-F|--freshen} [install-options] PACKAGE_FILE ... 


This will upgrade packages, but only if an earlier version currently exists. The PACKAGE_FILE may be specified 
as an ftp or http URL, in which case the package will be downloaded before being installed. See FTP/HTTP OPTIONS 
for information on rpm's internal ftp and http client support. 


--aid
Add suggested packages to the transaction set when needed. 
--allfiles
Installs or upgrades all the missingok files in the package, regardless if they exist. 
--badreloc
Used with --relocate, permit relocations on all file paths, not just those OLDPATH's included in the binary package relocation hint(s). 
--excludepath OLDPATH
Don't install files whose name begins with OLDPATH. 
--excludedocs
Don't install any files which are marked as documentation (which includes man pages and texinfo documents). 
--force
Same as using --replacepkgs, --replacefiles, and --oldpackage. 
-h, --hash
Print 50 hash marks as the package archive is unpacked. Use with -v|--verbose for a nicer display. 
--ignoresize
Don't check mount file systems for sufficient disk space before installing this package. 
--ignorearch
Allow installation or upgrading even if the architectures of the binary package and host don't match. 
--ignoreos
Allow installation or upgrading even if the operating systems of the binary package and host don't match. 
--includedocs
Install documentation files. This is the default behavior. 
--justdb
Update only the database, not the filesystem. 
--nodigest
Don't verify package or header digests when reading. 
--nosignature
Don't verify package or header signatures when reading. 
--nodeps
Don't do a dependency check before installing or upgrading a package. 
--nosuggest
Don't suggest package(s) that provide a missing dependency. 
--noorder
Don't reorder the packages for an install. The list of packages would normally be reordered to satisfy dependancies. 
--noscripts
--nopre
--nopost
--nopreun
--nopostun
Don't execute the scriptlet of the same name. The --noscripts option is equivalent to 
--nopre --nopost --nopreun --nopostun 

and turns off the execution of the corresponding %pre, %post, %preun, and %postun scriptlet(s). 

--notriggers
--notriggerin
--notriggerun
--notriggerpostun
Don't execute any trigger scriptlet of the named type. The --notriggers option is equivalent to 
--notriggerin --notriggerun --notriggerpostun 

and turns off execution of the corresponding %triggerin, %triggerun, and %triggerpostun scriptlet(s). 

--oldpackage
Allow an upgrade to replace a newer package with an older one. 
--percent
Print percentages as files are unpacked from the package archive. This is intended to make rpm easy to run from other tools. 
--prefix NEWPATH
For relocateable binary packages, translate all file paths that start with the installation prefix in the package relocation hint(s) to NEWPATH. 
--relocate OLDPATH=NEWPATH
For relocatable binary packages, translate all file paths that start with OLDPATH in the package relocation hint(s) to NEWPATH. This option can be used repeatedly if several OLDPATH's in the package are to be relocated. 
--repackage
Re-package the files before erasing. The previously installed package will be named according to the macro %_repackage_name_fmt and will be created in the directory named by the macro %_repackage_dir (default value is /var/tmp). 
--replacefiles
Install the packages even if they replace files from other, already installed, packages. 
--replacepkgs
Install the packages even if some of them are already installed on this system. 
--test
Do not install the package, simply check for and report potential conflicts. 
ERASE OPTIONS
The general form of an rpm erase command is 


rpm {-e|--erase} [--allmatches] [--nodeps] [--noscripts] [--notriggers] [--repackage] [--test] PACKAGE_NAME ... 


The following options may also be used: 

--allmatches
Remove all versions of the package which match PACKAGE_NAME. Normally an error is issued if PACKAGE_NAME matches multiple packages. 
--nodeps
Don't check dependencies before uninstalling the packages. 
--noscripts
--nopreun
--nopostun
Don't execute the scriptlet of the same name. The --noscripts option during package erase is equivalent to 
--nopreun --nopostun 

and turns off the execution of the corresponding %preun, and %postun scriptlet(s). 

--notriggers
--notriggerun
--notriggerpostun
Don't execute any trigger scriptlet of the named type. The --notriggers option is equivalent to 
--notriggerun --notriggerpostun 

and turns off execution of the corresponding %triggerun, and %triggerpostun scriptlet(s). 

--repackage
Re-package the files before erasing. The previously installed package will be named according to the macro %_repackage_name_fmt and will be created in the directory named by the macro %_repackage_dir (default value is /var/tmp). 
--test
Don't really uninstall anything, just go through the motions. Useful in conjunction with the -vv option for debugging. 
QUERY OPTIONS
The general form of an rpm query command is 


rpm {-q|--query} [select-options] [query-options] 


You may specify the format that package information should be printed in. To do this, you use the 


--qf|--queryformat QUERYFMT 

option, followed by the QUERYFMT format string. Query formats are modifed versions of the standard printf(3) formatting. The format is made up of static strings (which may include standard C character escapes for newlines, tabs, and other special characters) and printf(3) type formatters. As rpm already knows the type to print, the type specifier must be omitted however, and replaced by the name of the header tag to be printed, enclosed by {} characters. Tag names are case insesitive, and the leading RPMTAG_ portion of the tag name may be omitted as well. 

Alternate output formats may be requested by following the tag with :typetag. Currently, the following types are supported: 

:armor

Wrap a public key in ASCII armor. 
:base64
Encode binary data using base64. 
:date
Use strftime(3) "%c" format. 
:day
Use strftime(3) "%a %b %d %Y" format. 
:depflags
Format dependency flags. 
:fflags
Format file flags. 
:hex
Format in hexadecimal. 
:octal
Format in octal. 
:perms
Format file permissions. 
:shescape
Escape single quotes for use in a script. 
:triggertype
Display trigger suffix. 
For example, to print only the names of the packages queried, you could use %{NAME} as the format string. To print the packages name and distribution information in two columns, you could use %-30{NAME}%{DISTRIBUTION}. rpm will print a list of all of the tags it knows about when it is invoked with the --querytags argument. 

There are two subsets of options for querying: package selection, and information selection. 

PACKAGE SELECTION OPTIONS:

PACKAGE_NAME
Query installed package named PACKAGE_NAME. 
-a, --all
Query all installed packages. 
-f, --file FILE
Query package owning FILE. 
--fileid MD5
Query package that contains a given file identifier, i.e. the MD5 digest of the file contents. 
-g, --group GROUP
Query packages with the group of GROUP. 
--hdrid SHA1
Query package that contains a given header identifier, i.e. the SHA1 digest of the immutable header region. 
-p, --package PACKAGE_FILE
Query an (uninstalled) package PACKAGE_FILE. The PACKAGE_FILE may be specified as an ftp or http style URL, in which case the package header will be downloaded and queried. See FTP/HTTP OPTIONS for information on rpm's internal ftp and http client support. The PACKAGE_FILE argument(s), if not a binary package, will be interpreted as an ASCII package manifest. Comments are permitted, starting with a '#', and each line of a package manifest file may include white space seperated glob expressions, including URL's with remote glob expressions, that will be expanded to paths that are substituted in place of the package manifest as additional PACKAGE_FILE arguments to the query. 
--pkgid MD5
Query package that contains a given package identifier, i.e. the MD5 digest of the combined header and payload contents. 
--querybynumber HDRNUM
Query the HDRNUMth database entry directly; this is useful only for debugging. 
--specfile SPECFILE
Parse and query SPECFILE as if it were a package. Although not all the information (e.g. file lists) is available, this type of query permits rpm to be used to extract information from spec files without having to write a specfile parser. 
--tid TID
Query package(s) that have a given TID transaction identifier. A unix time stamp is currently used as a transaction identifier. All package(s) installed or erased within a single transaction have a common identifier. 
--triggeredby PACKAGE_NAME
Query packages that are triggered by package(s) PACKAGE_NAME. 
--whatprovides CAPABILITY
Query all packages that provide the CAPABILITY capability. 
--whatrequires CAPABILITY
Query all packages that requires CAPABILITY for proper functioning. 
PACKAGE QUERY OPTIONS:

--changelog
Display change information for the package. 
-c, --configfiles
List only configuration files (implies -l). 
-d, --docfiles
List only documentation files (implies -l). 
--dump
Dump file information as follows: 


path size mtime md5sum mode owner group isconfig isdoc rdev symlink
        

This option must be used with at least one of -l, -c, -d. 

--filesbypkg
List all the files in each selected package. 
-i, --info
Display package information, including name, version, and description. This uses the --queryformat if one was specified. 
--last
Orders the package listing by install time such that the latest packages are at the top. 
-l, --list
List files in package. 
--provides
List capabilities this package provides. 
-R, --requires
List packages on which this package depends. 
--scripts
List the package specific scriptlet(s) that are used as part of the installation and uninstallation processes. 
-s, --state
Display the states of files in the package (implies -l). The state of each file is one of normal, not installed, or replaced. 
--triggers, --triggerscripts
Display the trigger scripts, if any, which are contained in the package. 
VERIFY OPTIONS
The general form of an rpm verify command is 


rpm {-V|--verify} [select-options] [verify-options] 


Verifying a package compares information about the installed files in the package with information about the files taken from the package metadata stored in the rpm database. Among other things, verifying compares the size, MD5 sum, permissions, type, owner and group of each file. Any discrepencies are displayed. Files that were not installed from the package, for example, documentation files excluded on installation using the "--excludedocs" option, will be silently ignored. 

The package selection options are the same as for package querying (including package manifest files as arguments). Other options unique to verify mode are: 

--nodeps
Don't verify dependencies of packages. 
--nodigest
Don't verify package or header digests when reading. 
--nofiles
Don't verify any attributes of package files. 
--noscripts
Don't execute the %verifyscript scriptlet (if any). 
--nosignature
Don't verify package or header signatures when reading. 
--nolinkto
--nomd5
--nosize
--nouser
--nogroup
--nomtime
--nomode
--nordev
Don't verify the corresponding file attribute. 
The format of the output is a string of 8 characters, a possible attribute marker: 


c %config configuration file.
d %doc documentation file.
g %ghost file (i.e. the file contents are not included in the package payload).
l %license license file.
r %readme readme file.

from the package header, followed by the file name. Each of the 8 characters denotes the result of a comparison of attribute(s) of the file to the value of those attribute(s) recorded in the database. A single "." (period) means the test passed, while a single "?" (question mark) indicates the test could not be performed (e.g. file permissions prevent reading). Otherwise, the (mnemonically emBoldened) character denotes failure of the corresponding --verify test: 


S file Size differs
M Mode differs (includes permissions and file type)
5 MD5 sum differs
D Device major/minor number mis-match
L readLink(2) path mis-match
U User ownership differs
G Group ownership differs
T mTime differs


DIGITAL SIGNATURE AND DIGEST VERIFICATION
The general forms of rpm digital signature commands are 


rpm --import PUBKEY ... 


rpm {--checksig} [--nosignature] [--nodigest] 
PACKAGE_FILE ... 


The --checksig option checks all the digests and signatures contained in PACKAGE_FILE to ensure the integrity and origin of the package. Note that signatures are now verified whenever a package is read, and --checksig is useful to verify all of the digests and signatures associated with a package. 

Digital signatures cannot be verified without a public key. An ascii armored public key can be added to the rpm database using --import. An imported public key is carried in a header, and key ring management is performed exactly like package management. For example, all currently imported public keys can be displayed by: 

rpm -qa gpg-pubkey* 

Details about a specific public key, when imported, can be displayed by querying. Here's information about the Red Hat GPG/DSA key: 

rpm -qi gpg-pubkey-db42a60e 

Finally, public keys can be erased after importing just like packages. Here's how to remove the Red Hat GPG/DSA key 

rpm -e gpg-pubkey-db42a60e 

SIGNING A PACKAGE

rpm --addsign|--resign PACKAGE_FILE ... 


Both of the --addsign and --resign options generate and insert new signatures for each package PACKAGE_FILE given, replacing any existing signatures. There are two options for historical reasons, there is no difference in behavior currently. 

USING GPG TO SIGN PACKAGES
In order to sign packages using GPG, rpm must be configured to run GPG and be able to find a key ring with the appropriate keys. By default, rpm uses the same conventions as GPG to find key rings, namely the $GNUPGHOME environment variable. If your key rings are not located where GPG expects them to be, you will need to configure the macro %_gpg_path to be the location of the GPG key rings to use. 

For compatibility with older versions of GPG, PGP, and rpm, only V3 OpenPGP signature packets should be configured. Either DSA or RSA verification algorithms can be used, but DSA is preferred. 

If you want to be able to sign packages you create yourself, you also need to create your own public and secret key pair (see the GPG manual). You will also need to configure the rpm macros 

%_signature
The signature type. Right now only gpg and pgp are supported. 
%_gpg_name
The name of the "user" whose key you wish to use to sign your packages. 
For example, to be able to use GPG to sign packages as the user "John Doe <jdoe@foo.com>" from the key rings located in /etc/rpm/.gpg using the executable /usr/bin/gpg you would include 


%_signature gpg
%_gpg_path /etc/rpm/.gpg
%_gpg_name John Doe <jdoe@foo.com>
%_gpgbin /usr/bin/gpg

in a macro configuration file. Use /etc/rpm/macros for per-system configuration and ~/.rpmmacros for per-user configuration. 

REBUILD DATABASE OPTIONS
The general form of an rpm rebuild database command is 


rpm {--initdb|--rebuilddb} [-v] [--dbpath DIRECTORY] [--root DIRECTORY] 


Use --initdb to create a new database, use --rebuilddb to rebuild the database indices from the installed package headers. 

SHOWRC
The command 

rpm --showrc 

shows the values rpm will use for all of the options are currently set in rpmrc and macros configuration file(s). 

FTP/HTTP OPTIONS
rpm can act as an FTP and/or HTTP client so that packages can be queried or installed from the internet. 
Package files for install, upgrade, and query operations may be specified as an ftp or http style URL: 

ftp://USER:PASSWORD@HOST:PORT/path/to/package.rpm 

If the :PASSWORD portion is omitted, the password will be prompted for (once per user/hostname pair). 
If both the user and password are omitted, anonymous ftp is used. In all cases, passive (PASV) ftp transfers 
are performed. 

rpm allows the following options to be used with ftp URLs: 

--ftpproxy HOST
The host HOST will be used as a proxy server for all ftp transfers, which allows users to ftp through firewall machines which use proxy systems. This option may also be specified by configuring the macro %_ftpproxy. 
--ftpport HOST
The TCP PORT number to use for the ftp connection on the proxy ftp server instead of the default port. This option may also be specified by configuring the macro %_ftpport. 
rpm allows the following options to be used with http URLs: 

--httpproxy HOST
The host HOST will be used as a proxy server for all http transfers. This option may also be specified by configuring the macro %_httpproxy. 
--httpport PORT
The TCP PORT number to use for the http connection on the proxy http server instead of the default port. This option may also be specified by configuring the macro %_httpport. 
LEGACY ISSUES
Executing rpmbuild
The build modes of rpm are now resident in the /usr/bin/rpmbuild executable. Although legacy compatibility provided by the popt aliases below has been adequate, the compatibility is not perfect; hence build mode compatibility through popt aliases is being removed from rpm. Install the rpmbuild package, and see rpmbuild(8) for documentation of all the rpm build modes previously documented here in rpm(8). 

Add the following lines to /etc/popt if you wish to continue invoking rpmbuild from the rpm command line: 


rpm     exec --bp               rpmb -bp
rpm     exec --bc               rpmb -bc
rpm     exec --bi               rpmb -bi
rpm     exec --bl               rpmb -bl
rpm     exec --ba               rpmb -ba
rpm     exec --bb               rpmb -bb
rpm     exec --bs               rpmb -bs 
rpm     exec --tp               rpmb -tp 
rpm     exec --tc               rpmb -tc 
rpm     exec --ti               rpmb -ti 
rpm     exec --tl               rpmb -tl 
rpm     exec --ta               rpmb -ta
rpm     exec --tb               rpmb -tb
rpm     exec --ts               rpmb -ts 
rpm     exec --rebuild          rpmb --rebuild
rpm     exec --recompile        rpmb --recompile
rpm     exec --clean            rpmb --clean
rpm     exec --rmsource         rpmb --rmsource
rpm     exec --rmspec           rpmb --rmspec
rpm     exec --target           rpmb --target
rpm     exec --short-circuit    rpmb --short-circuit

SEE ALSO

popt(3),
rpm2cpio(8),
rpmbuild(8),

http://www.rpm.org/ http://www.rpm.org/> 


39. Simplified overview Kernel parameters Solaris, AIX, Linux:
==============================================================

Throughout this document, you can find many other examples of settings.
This section is only a simplified overview.


39.1 Solaris:
-------------

The "/etc/system" file:

Available for Solaris Operating Environment, the /etc/system file contains definitions for kernel configuration limits 
such as the maximum number of users allowed on the system at a time, the maximum number of processes per user, 
and the inter-process communication (IPC) limits on size and number of resources. These limits are important because 
they affect, for example, DB2, Oracle performance on a Solaris Operating Environment machine. 

Some examples:

set shmsys:shminfo_shmmax=4294967295
set shmsys:shminfo_shmmin=1
set shmsys:shminfo_shmmni=100
set shmsys:shminfo_shmseg=10
set semsys:seminfo_semmni=100
set semsys:seminfo_semmsl=100
set semsys:seminfo_semmns=2500
set semsys:seminfo_semopm=100
set semsys:seminfo_semvmx=32767
..
..

You can use, among others, the "ipcs" command and "adb" command to retrieve kernel parameters and mem info.

Some remarks on Shared Memory and Semaphores:

- Shared Memory
Shared memory provides the fastest way for processes to pass large amounts of data to one another. 
As the name implies, shared memory refers to physical pages of memory that are shared by more than one process. 

Of particular interest is the "Intimate Shared Memory" facility, where the translation tables are shared 
as well as the memory. This enhances the effectiveness of the TLB (Translation Lookaside Buffer), 
which is a CPU-based cache of translation table information. Since the same information is used for 
several processes, available buffer space can be used much more efficiently. In addition, ISM-designated memory 
cannot be paged out, which can be used to keep frequently-used data and binaries in memory. 

Database applications are the heaviest users of shared memory. Vendor recommendations should be consulted 
when tuning the shared memory parameters. 

Solaris 10 only uses the shmmax and shmmni parameters. (Other parameters are set dynamically within the 
Solaris 10 IPC model.) 

shmmax (max-shm-memory in Solaris 10+): This is the maximum size of a shared memory segment 
(ie the largest value that can be used by shmget). Its theoretical maximum value is 4294967295 (4GB), 
but practical considerations usually limit it to less than this. There is no reason not to tune this value 
as high as possible, since no kernel resources are allocated based on this parameter. Solaris 10 sets shmmax 
to 1/4 physical memory by default, vs 512k for previous versions. 
shmmin: This is the smallest possible shared memory segment size. The default is 1 byte; this parameter 
should probably not be tuned. 
shmmni (max-shm-ids in Solaris 10+): Maximum number of shared memory identifiers at any given time. 
This parameter is used by kernel memory allocation to determine how much size to put aside for shmid_ds structures. 
Each of these is 112 bytes and requires an additional 8 bytes for a mutex lock; if it is set too high, memory useage 
can be a problem. The maximum setting for this variable in Solaris 2.5.1 and 2.6 is 2147483648 (2GB), and the 
default is 100. For Solaris 10, the default is 128 and the maximum is MAXINT. 
shmseg: Maximum number of segments per process. It is usually set to shmmni, but it should always be less 
than 65535. Sun documentations suggests a maximum for this parameter of 32767 and a default of 8 for 
Solaris 2.5.1 and 2.6. 

- Semaphores
Semaphores are a shareable resource that take on a non-negative integer value. They are manipulted 

by the P (wait) and V (signal) functions, which decrement and increment the semaphore, respectively. When a 
process needs a resource, a "wait" is issued and the semaphore is decremented. When the semaphore contains 
a value of zero, the resources are not available and the calling process spins or blocks (as appropriate) 
until resources are available. When a process releases a resource controlled by a semaphore, it increments 
the semaphore and the waiting processes are notified. 

Solaris 10 only uses the semmni, semmsl and semopm parameters. (Other parameters are dynamic within 
the Solaris 10 IPC model.) 

semmap: This sets the number of entries in the semaphore map. This should never be greater than semmni. If the number 
of semaphores per semaphore set used by the application is "n" then set semmap = ((semmni + n - 1)/n)+1
or more. Alternatively, we can set semmap to semmni x semmsl. An undersized semmap leads to "WARNING: 
rmfree map overflow" errors. The default setting is 10; the maximum for Solaris 2.6 is 2GB. The default for 
Solaris 9 was 25; Solaris 10 increased the default to 512. The limit is SHRT_MAX. 
semmni (max-sem-ids in Solaris 10+): Maximum number of systemwide semaphore sets. Each control structure consumes 
84 bytes. For Solaris 2.5.1-9, the default setting is 10; for Solaris 10, the default setting is 128. 
The maximum is 65535 
semmns: Maximum number of semaphores in the system. Each structure uses 16 bytes. This parameter should be set 
to semmni x semmsl. The default is 60; the maximum is 2GB. 
semmnu: Maximum number of undo structures in the system. This should be set to semmni so that each control structure 
has an undo structure. The default is 30, the maximum is 2 GB. 
semmsl (max-sem-nsems in Solaris 10+): Maximum number of semaphores per semaphore set. The default is 25, 
the maximum is 65535. 
semopm (max-sem-ops in Solaris 10+): Maximum number of semaphore operations that can be performed in each 
semop call. The default in Solaris 2.5.1-9 is 10, the maximum is 2 GB. Solaris 10 increased the default to 512. 
semume: Maximum number of undo structures per process. This should be set to semopm times the number of processes 
that will be using semaphores at any one time. The default is 10; the maximum is 2 GB. 
semusz: Number of bytes required for semume undo structures. This should not be tuned; it is set to 
semume x (1 + sizeof(undo)). The default is 96; the maximum is 2 GB. 
semvmx: Maximum value of a semaphore. This should never exceed 32767 (default value) unless SEM_UNDO 
is never used. The default is 32767; the maximum is 65535. 
semaem: Maximum adjust-on-exit value. This should almost always be left alone. The default is 16384; 
the maximum is 32767. 


39.2 Linux:
-----------

Kernel parameters used for system configuration are found in "/etc/sysctl.conf" and on a running system also in "/proc/sys/kernel", where you 
will find an individual file for each configuration parameter. Because these parameters have a direct effect on system 
performance and viability, you must have root access in order to modify them.

Occasionally, a prerequisite to a package installation requires the modification of kernel parameters. 
Since each parameter file contains a single line of data consisting of either a text 
string or numeric values, it is often easy to modify a parameter by simply using the echo command:

# echo 2048 > /proc/sys/kernel/msgmax

The aforementioned command will set the value of the msgmax parameter to 2048.

-- More on the proc File System:

The Linux kernel has two primary functions: to control access to physical devices on the computer 
and to schedule when and how processes interact with these devices. The /proc/ directory contains 
a hierarchy of special files which represent the current state of the kernel - allowing applications 
and users to peer into the kernel's view of the system. 

Within the /proc/ directory, one can find a wealth of information about the system hardware and any processes 
currently running. In addition, some of the files within the /proc/ directory tree can be manipulated by users 
and applications to communicate configuration changes to the kernel. 

Under Linux, all data are stored as files. Most users are familiar with the two primary types of files: 
text and binary. But the /proc/ directory contains another type of file called a virtual file. 
It is for this reason that /proc/ is often referred to as a virtual file system. 
These virtual files have unique qualities. Most of them are listed as zero bytes in size and yet when one 
is viewed, it can contain a large amount of information. In addition, most of the time and date settings 
on virtual files reflect the current time and date, indicative of the fact they constantly changing. 

Virtual files such as interrupts, /proc/meminfo, /proc/mounts, and /proc/partitions provide an 
up-to-the-moment glimpse of the system's hardware. Others, like /proc/filesystems and the /proc/sys/ 
directory provide system configuration information and interfaces. 

For organizational purposes, files containing information on a similar topic are grouped into virtual 
directories and sub-directories. For instance, /proc/ide/ contains information for all physical IDE devices. 
Likewise, process directories contain information about each running process on the system. 

By using the cat, more, or less commands on files within the /proc/ directory, you can immediately access 
an enormous amount of information about the system. For example, if you want to see what sort of CPU 
your computer has, type "cat /proc/cpuinfo" and you will see something similar to the following: 

processor	: 0
vendor_id	: AuthenticAMD
cpu family	: 5
model		: 9
model name	: AMD-K6(tm) 3D+ Processor
stepping	: 1
cpu MHz		: 400.919
cache size	: 256 KB
fdiv_bug	: no
hlt_bug		: no
f00f_bug	: no
coma_bug	: no
fpu		: yes
fpu_exception	: yes
cpuid level	: 1
wp		: yes
flags		: fpu vme de pse tsc msr mce cx8 pge mmx syscall 3dnow k6_mtrr
bogomips	: 799.53
 

When viewing different virtual files in the /proc/ file system, you will notice some of the information is 
easily understandable while some is not human-readable. This is in part why utilities exist to pull data 
from virtual files and display it in a useful way. Some examples of such applications are 
lspci, apm, free, and top. 

As a general rule, most virtual files within the /proc/ directory are read only. However, some can be used 
to adjust settings in the kernel. This is especially true for files in the /proc/sys/ subdirectory. 

To change the value of a virtual file, use the echo command and a > symbol to redirect the new value to the file. 
For instance, to change your hostname on the fly, you can type: 

echo bob.subgenius.com > /proc/sys/kernel/hostname 
 
Other files act as binary or boolean switches. For instance, if you type cat /proc/sys/net/ipv4/ip_forward, 
you will see either a 0 or a 1. A 0 indicates the kernel is not forwarding network packets. By using the 
echo command to change the value of the ip_forward file to 1, you can immediately turn packet forwarding on. 

Another command used to alter settings in the /proc/sys/ subdirectory is /sbin/sysctl.


-- sysctl:

Linux also provides the sysctl command to modify kernel parameters at runtime. 
Sysctl uses parameter information stored in a file called /etc/sysctl.conf. If, for example, we wanted to 
change the value of the msgmax parameter as we did above, but this time using sysctl, the command would 
look like this:

# sysctl -w kernel.msgmax=2048


- About the kernel:

Finding the Kernel
Locate the kernel image on your hard disk. It should be in the file /vmlinuz, or /vmlinux, or /boot/vmlinux
In some installations, /vmlinuz is a soft link to the actual kernel, so you may need to track down 
the kernel by following the links. On Redhat 6.1 it is in "/boot/vmlinuz". To find the kernel being used 
look in "/etc/lilo.conf".

You can also type "uname -a" to see the kernel version. 

/proc/cmdline

This file shows the parameters passed to the kernel at the time it is started. A sample /proc/cmdline file 
looks like this: 

ro root=/dev/hda2

This tell us the kernel is mounted read-only - signified by (ro) - off of the second partition 
on the first IDE device (/dev/hda2). 


- Kernel, memory tuning:

Most about tuning memory en kernel params seem to do with the "/etc/sysctl.conf" file:

In most distributions, the "/etc/sysctl.conf" determines the limits and/or behaviour of the kernel 
and memory.

If you type "sysctl -a |more" you will see a long list of kernel parameters. 
You can use this sysctl program to modify these parameters, for example:

# sysctl -w kernel.shmmax=100000000
# sysctl -w fs.file-max=65536
# echo "kernel.shmmax = 100000000" >> /etc/sysctl.conf


Example configuration: setting kernel parameters before installing Oracle 10g:
------------------------------------------------------------------------------

Most out of the box kernel parameters (of RHELS 3,4,5) are set correctly for Oracle
except a few.

You should have the following minimal configuration:

net.ipv4.ip_local_port_range	1024  65000
kernel.sem			250  32000  100  128
kernel.shmmni			4096
kernel.shmall			2097152
kernel.shmmax			2147483648
fs.file-max			65536


You can check the most important parameters using the following command:

# /sbin/sysctl -a | egrep 'sem|shm|file-max|ip_local'

net.ipv4.ip_local_port_range = 1024  65000
kernel.sem = 250  32000  100  128
kernel.shmmni = 4096
kernel.shmall = 2097152
kernel.shmmax = 2147483648
fs.file-max = 65536

If some value should be changed, you can change the "/etc/sysctl.conf" file and run the "/sbin/sysctl -p" command
to change the value immediately.
Every time the system boots, the init program runs the /etc/rc.d/rc.sysinit script. This script contains 
a command to execute sysctl using /etc/sysctl.conf to dictate the values passed to the kernel. 
Any values added to /etc/sysctl.conf will take effect each time the system boots. 


Example configuration: from: Installing Oracle 91 on Linux
-----------------------------------------------------------

For Linux, use the ipcs command to obtain a list of the system's current shared memory segments and 
semaphore sets, and their identification numbers and owner. 

Perform the following steps to modify the kernel parameters by using the /proc file system. 

Log in as the root user. 

Change to the /proc/sys/kernel directory. 

Review the current semaphore parameter values in the sem file by using the cat or more utility. 
For example, using the cat utility, enter the following command: 

# cat sem

The output lists, in order, the values for the SEMMSL, SEMMNS, SEMOPM, and SEMMNI parameters. 
The following example shows how the output appears: 

250 32000 32 128

In the preceding output example, 250 is the value of the SEMMSL parameter, 32000 is the value of the 
SEMMNS parameter, 32 is the value of the SEMOPM parameter, and 128 is the value of the SEMMNI parameter. 

Modify the parameter values by using the following command syntax: 

# echo SEMMSL_value SEMMNS_value SEMOPM_value SEMMNI_value > sem

Replace the parameter variables with the values for your system in the order that they are entered 
in the preceding example. For example: 

# echo 100 32000 100 100 > sem

Review the current shared memory parameters by using the cat or more utility. For example, using the cat utility, 
enter the following command: 

# cat shared_memory_parameter

In the preceding example, the variable shared_memory_parameter is either the SHMMAX or SHMMNI parameter. 
The parameter name must be entered in lowercase letters. 

Modify the shared memory parameter by using the echo utility. For example, to modify the SHMMAX parameter, 
enter the following command: 

# echo 2147483648 > shmmax

Modify the shared memory parameter by using the echo utility. For example, to modify the SHMMNI parameter, 
enter the following command: 

# echo 4096 > shmmni

Modify the shared memory parameter by using the echo utility. For example, to modify the SHMALL parameter, 
enter the following command: 

# echo 2097152 > shmall

Write a script to initialize these values during system startup, and include the script in your system init files. 

See Also: 
Your system vendor's documentation for more information on script files and init files.  

Set the File Handles by using ulimit -n and /proc/sys/fs/file-max. 

# echo 65536 > /proc/sys/fs/file-max
ulimit -n 65536

Set the Sockets to /proc/sys/net/ipv4/ip_local_port_range 

# echo 1024 65000 > /proc/sys/net/ipv4/ip_local_port_change

Set the Process limit by using ulimit -u. This will give you the number of processes per user. 

ulimit -u 16384


39.4 Linux modules:
-------------------


Modules on Linux (1):
---------------------

- insmod, rmmod, lsmod

lsmod:
------

lsmod - list loaded modules.   

SYNOPSIS
lsmod [-hV]   
DESCRIPTION
lsmod shows information about all loaded modules. 
The format is name, size, use count, list of referring modules. The information displayed is identical 
to that available from "/proc/modules". 

If the module controls its own unloading via a can_unload routine then the user count displayed by lsmod 
is always -1, irrespective of the real use count.   

insmod:
-------

insmod - install loadable kernel module 

SYNOPSIS
insmod [-fhkLmnpqrsSvVxXyYN] [-e persist_name] [-o module_name] [-O blob_name] [-P prefix] module [ symbol=value ... ] 
DESCRIPTION
insmod installs a loadable module in the running kernel. 
insmod tries to link a module into the running kernel by resolving all symbols from the kernel's 
exported symbol table. 

If the module file name is given without directories or extension, insmod will search for the module 
in some common default directories. The environment variable MODPATH can be used to override this default. 
If a module configuration file such as /etc/modules.conf exists, it will override the paths defined in MODPATH. 

The environment variable MODULECONF can also be used to select a different configuration file from the 
default /etc/modules.conf (or /etc/conf.modules (deprecated)). This environment variable will override 
all the definitions above. 

When environment variable UNAME_MACHINE is set, modutils will use its value instead of the machine field 
from the uname() syscall. This is mainly of use when you are compiling 64 bit modules in 32 bit user space 
or vice versa, set UNAME_MACHINE to the type of the modules. Current modutils does not support full 
cross build mode for modules, it is limited to choosing between 32 and 64 bit versions of the host architecture. 

rmmod:
------

rmmod - unload loadable modules   
SYNOPSIS
rmmod [ -aehrsvV ] module ...   
DESCRIPTION
rmmod unloads loadable modules from the running kernel. 
rmmod tries to unload a set of modules from the kernel, with the restriction that they are not in use 
and that they are not referred to by other modules. 

If more than one module is named on the command line, the modules will be removed in the given order. 
This supports unloading of stacked modules. 

With the option '-r', a recursive removal of modules will be attempted. This means that if a top module 
in a stack is named on the command line, all modules that are used by this module will be removed as well, 
if possible. 


More info about the mod commands:
---------------------------------

- Hardware Detection with the Help of hwinfo
hwinfo can detect the hardware of your system and select the drivers needed to run this hardware. 
Get a small introduction to this command with hwinfo --help. If you, for example, need information about 
your SCSI devices, use the command hwinfo --scsi.

All this information is also available in YaST in the hardware information module. 

- Handling Modules
The following commands are available:

insmod
insmod loads the requested module after searching for it in a subdirectory of /lib/modules/<version>. 
It is better, however, to use modprobe rather than insmod. 

rmmod
Unloads the requested module. This is only possible if this module is no longer needed. For example, 
the isofs module cannot be unloaded while a CD is still mounted. 

depmod
Creates the file modules.dep in /lib/modules/<version> that defines the dependencies of all the modules. 
This is necessary to ensure that all dependent modules are loaded with the selected ones. 
This file will be built after the system is started if it does not exist.

modprobe
Loads or unloads a given module while taking into account dependencies of this module. This command 
is extremely powerful and can be used for a lot of things (e.g., probing all modules of a given type 
until one is successfully loaded). In contrast to insmod, modprobe checks /etc/modprobe.conf and therefore 
is the preferred method of loading modules. For detailed information about this topic, refer to the 
corresponding man page. 

lsmod
Shows which modules are currently loaded as well as how many other modules are using them. Modules started 
by the kernel daemon are tagged with autoclean. This label denotes that these modules will automatically 
be removed once they reach their idle time limit. 

modinfo
Shows module information.

/etc/modprobe.conf
The loading of modules is affected by the files /etc/modprobe.conf and /etc/modprobe.conf.local 
and the directory /etc/modprobe.d. See man modprobe.conf. Parameters for modules that access hardware directly
must be entered in this file. Such modules may need system-specific options (e.g., CD-ROM driver or network driver). 
The parameters used here are described in the kernel sources. Install the package kernel-source and read the 
documentation in the directory /usr/src/linux/Documentation. 

Kmod - the Kernel Module Loader
The kernel module loader is the most elegant way to use modules. Kmod performs background monitoring 
and makes sure the required modules are loaded by modprobe as soon as the respective functionality is needed 
in the kernel. 

To use Kmod, activate the option `Kernel module loader' (CONFIG_KMOD) in the kernel configuration. 
Kmod is not designed to unload modules automatically; in view of today's RAM capacities, the potential memory savings 
would be marginal. For reasons of performance, monolithic kernels may be more suitable for servers 
that are used for special tasks and need only a few drivers. 


modprobe.conf:
--------------

Example 1:

# This file is autogenerated from /etc/modules.conf using generate-modprobe.conf command

alias eth1 sk98lin
alias eth0 ipw2200
alias sound-slot-0 snd-hda-intel
install scsi_hostadapter /sbin/modprobe ahci; /bin/true
remove snd-hda-intel /sbin/modprobe -r snd-pcm-oss; /sbin/modprobe --first-time -r --ignore-remove snd-hda-intel
install snd-hda-intel /sbin/modprobe --first-time --ignore-install snd-hda-intel && { /sbin/modprobe snd-pcm-oss; /bin/true; }
install usb-interface /sbin/modprobe uhci-hcd; /sbin/modprobe ehci-hcd; /bin/true
#alias eth1 eth1394
alias ieee1394-controller ohci1394
alias net-pf-10 off

#irda
alias tty-ldisc-11 irtty
alias char-major-161-* ircomm-tty

# Para nsc 383 SIO:
alias char-major-160-* nsc-ircc
alias irda0 nsc-ircc
options nsc-irc io=0x2f8 irq=3 dma=0
install nsc-ircc { /bin/setserial /dev/ttyS1 uart none; } ; /sbin/modprobe --first-time --ignore-install nsc-ircc

#irda: 0x2f8, irq 3, dma 0
#lpt: 0x3f8, irq 7, dma 1

options parport_pc io=0x378 irq=7 dma=1

Example 2:

alias ieee1394-controller ohci1394
alias eth0 eepro100
alias sound-slot-0 emu10k1
alias net-pf-10 off
install snd-emu10k1 /sbin/modprobe --first-time --ignore-install snd-emu10k1 
&& { /sbin/modprobe snd-pcm-oss; /bin/true; }
install usb-interface /sbin/modprobe usb-uhci; /sbin/modprobe ehci-hcd; /bin/true
remove snd-emu10k1 { /sbin/modprobe -r snd-pcm-oss; } ; /sbin/modprobe -r --first-time --ignore-remove snd-emu10k1 


/etc/sysconfig:
---------------

Note 1:
-------

SuSEconfig and /etc/sysconfig
The main configuration of SUSE LINUX can be made with the configuration files in /etc/sysconfig. 
Former versions of SUSE LINUX relied on /etc/rc.config for system configuration, but it became obsolete 
in previous versions. /etc/rc.config is not created at installation time, as all system configuration 
is controlled by /etc/sysconfig. However, if /etc/rc.config exists at the time of a system update, 
it remains intact.

The individual files in /etc/sysconfig are only read by the scripts to which they are relevant. This ensures 
that network settings, for instance, need to be parsed only by network-related scripts. Apart from that, 
there are many other system configuration files that are generated according to the settings in /etc/sysconfig. 
This task is performed by SuSEconfig. For example, if you change the network configuration, SuSEconfig is likely 
to make changes to the file /etc/host.conf as well, as this is one of the files relevant for the 
network configuration. 

If you change anything in these files manually, run SuSEconfig afterwards to make sure all the necessary 
changes are made in all the relevant places. If you change the configuration using the YaST sysconfig editor, 
all changes are applied automatically - YaST automatically starts SuSEconfig to update the configuration 
files as needed.

This concept enables you to make basic changes to your configuration without needing to reboot the system. 
Because some changes are rather complex, some programs must be restarted for the changes to take effect. 
For instance, changes to the network configuration may require a restart of the network programs concerned. 
This can be achieved by entering the commands rcnetwork stop and rcnetwork start.

Note 2:
-------

The Linux sysconfig directory
The /etc/sysconfig directory is where many of the files that control the system configuration are stored. 
This section lists these files and many of the optional values in the files used to make system changes. 
To get complete information on these files read the file /usr/doc/initscripts-4.48/sysconfig.txt. 

/etc/sysconfig/clock
Used to configure the system clock to Universal or local time and set some other clock parameters. An example file: 
UTC=false
ARC=false

Options: 
UTC - true means the clock is set to UTC time otherwise it is at local time 
ARC - Set true on alpha stations only. It indicates the ARC console's 42-year time offset is in effect. If not set to true, the normal Unix epoch is assumed. 
ZONE="filename" - indicates the zonefile under the directory /usr/share/zoneinfo that the /etc/localtime file is a copy of. This may be set to: 
ZONE="US/Eastern" 

/etc/sysconfig/init
This file is used to set some terminal characteristics and environment variables. A sample listing: 
# color => new RH6.0 bootup
# verbose => old-style bootup
# anything else => new style bootup without ANSI colors or positioning
BOOTUP=color
# column to start "[  OK  ]" label in 
RES_COL=60
# terminal sequence to move to that column. You could change this
# to something like "tput hpa ${RES_COL}" if your terminal supports it
MOVE_TO_COL="echo -en \\033[${RES_COL}G"
# terminal sequence to set color to a 'success' color (currently: green)
SETCOLOR_SUCCESS="echo -en \\033[1;32m"
# terminal sequence to set color to a 'failure' color (currently: red)
SETCOLOR_FAILURE="echo -en \\033[1;31m"
# terminal sequence to set color to a 'warning' color (currently: yellow)
SETCOLOR_WARNING="echo -en \\033[1;33m"
# terminal sequence to reset to the default color.
SETCOLOR_NORMAL="echo -en \\033[0;39m"
# default kernel loglevel on boot (syslog will reset this)
LOGLEVEL=1
# Set to something other than 'no' to turn on magic sysrq keys...
MAGIC_SYSRQ=no
# Set to anything other than 'no' to allow hotkey interactive startup...
PROMPT=yes

Options: 
BOOTUP=bootupmode - Choices are color, or verbose. The choice color sets new boot display. The choice verbose sets old style display. Anything else sets a new display without ANSI formatting. 
LOGLEVEL=number - Sets the initial console logging level for the kernel. The default is 7. The values are: 
emergency, panic - System is unusable 
alert - Action must be taken immediately 
crit - Critical conditions 
err, error (depreciated) - Error conditions 
warning, warn (depreciated) - Warning conditions 
notice - Normal but significant conditions 
info - Informational message 
debug - Debug level message 
RES_COL=number - Screen column to start status labels at. The Default is 60. 
MOVE_TO_COL=command - A command to move the cursor to $RES_COL. 
SETCOLOR_SUCCESS=command - Set the color used to indicate success. 
SETCOLOR_FAILURE=command - Set the color used to indicate failure. 
SETCOLOR_WARNING=command - Set the color used to indicate warning. 
SETCOLOR_NORMAL=command - Set the color used tor normal color 
MAGIC_SYSRQ=yes|no - Set to 'no' to disable the magic sysrq key. 
PROMPT=yes|no - Set to 'no' to disable the key check for interactive mode. 


/etc/sysconfig/keyboard
Used to configure the keyboard. Used by the startup script /etc/rc.d/rc.sysinit. An example file: 
KEYTABLE="us"

Options: 
KEYTABLE="keytable file" - The line [ KEYTABLE="/usr/lib/kbd/keytables/us.map" ] tells the system to use the file shown for keymapping. 
KEYBOARDTYPE=sun|pc - The selection, "sun", indicates attached on /dev/kbd is a sun keyboard. The selection "pc" indicates a PS/2 keyboard is on the ps/2 port. 


/etc/sysconfig/mouse
This file is used to configure the mouse. An example file: 
FULLNAME="Generic - 2 Button Mouse (PS/2)"
MOUSETYPE="ps/2"
XEMU3="yes"
XMOUSETYPE="PS/2"

Options: 
MOUSETYPE=type - Choices are microsoft, mouseman, mousesystems, ps/2, msbm, logibm, atibm, logitech, mmseries, or mmhittab. 
XEMU3=yes|no - If yes, emulate three buttons, otherwise not. 


/etc/sysconfig/network
Used to configure networking options. All IPX options default to off. An example file: 
NETWORKING=yes
FORWARD_IPV4="yes"
HOSTNAME="mdct-dev3"
GATEWAY="10.1.0.25"
GATEWAYDEV="eth0"

Options: 
NETWORKING=yes|no - Sets network capabilities on or off. 
HOSTNAME="hostname". To work with old software, the /etc/HOSTNAME file should contain the same hostname. 
FORWARD_IPV4=yes|no - Turns the ability to perform IP forwarding on or off. Turn it on if you want to use the machine as a router. Turn it off to use it as a firewall or IP masquerading. 
DEFRAG_IPV4=yes|no - Set this to automatically defragment IPv4 packets. This is good for masquerading, and a bad idea otherwise. It defaults to 'no'. 
GATEWAY="gateway IP" 
GATEWAYDEV="gateway device" Possible values include eth0, eth1, or ppp0. 
NISDOMAIN="nis domain name" 
IPX=yes|no - Turn IPX ability on or off. 
IPXAUTOPRIMARY=on|off - Must not be yes or no. 
IPXAUTOFRAME=on|off 
IPXINTERNALNETNUM="netnum" 
IPXINTERNALNODENUM="nodenum" 


/etc/sysconfig/static-routes
Configures static routes on a network. Used to set up static routing. An example file: 
eth1 net 192.168.199.0 netmask 255.255.255.0 gw 192.168.199.1
eth0 net 10.1.0.0 netmask 255.255.0.0 gw 10.1.0.153
eth1 net 255.255.255.255 netmask 255.255.255.255

The syntax is: 
device net network netmask netmask gw gateway 

The device may be a device name such as eth0 which is used to have the route brought up and down as the device is brought up or down. The value can also be "any" to let the system calculate the correct devices at run time. 


/etc/sysconfig/routed 
Sets up dynamic routing policies. An example file: 
EXPORT_GATEWAY="no"
SILENT="yes"

Options: 
SILENT=yes|no 
EXPORT_GATEWAY=yes|no 


/etc/sysconfig/pcmcia
Used to configure pcmcia network cards. An example file: 
PCMCIA=no
PCIC=
PCIC_OPTS=
CORE_OPTS=

Options: 
PCMCIA=yes|no 
PCIC=i82365|tcic 
PCIC_OPTS=socket driver (i82365 or tcic) timing parameters 
CORE_OPTS=pcmcia_core options 
CARDMGR_OPTS=cardmgr options 


/etc/sysconfig/amd
Used to configure the auto mount daemon. An example file: 
ADIR=/.automount
MOUNTPTS='/net /etc/amd.conf'
AMDOPTS=

Options: 
ADIR=/.automount (normally never changed) 
MOUNTPTS='/net /etc/amd.conf' (standard automount stuff) 
AMDOPTS= (extra options for AMD) 


/etc/sysconfig/tape
Used for backup tape device configuration. Options: 
DEV=/dev/nst0 - The tape device. Use the non-rewinding tape for these scripts. For SCSI tapes the device is /dev/nst#, where # is the number of the tape drive you want to use. If you only have one then use nst0. For IDE tapes the device is /dev/ht#. For floppy tape drives the device is /dev/ftape. 
ADMIN=root - The person to mail to if the backup fails for any reason 
SLEEP=5 - The time to sleep between tape operations. 
BLOCKSIZE=32768 - This worked fine for 8mm, then 4mm, and now DLT. An optimal setting is probably the amount of data your drive writes at one time. 
SHORTDATE=$(date +%y:%m:%d:%H:%M) - A short date string, used in backup log filenames. 
DAY=$(date +log-%y:%m:%d) - Used for the log file directory. 
DATE=$(date) - Date string, used in log files. 
LOGROOT=/var/log/backup - Root of the logging directory 
LIST=$LOGROOT/incremental-list - This is the file name the incremental backup will use to store the incremental list. It will be $LIST-{some number}. 
DOTCOUNT=$LOGROOT/.count - For counting as you go to know which incremental list to use. 
COUNTER=$LOGROOT/counter-file - For rewinding when done...might not use. 
BACKUPTAB=/etc/backuptab - The file in which we keep our list of backup(s) we want to make. 


/etc/sysconfig/sendmail
An example file: 
DAEMON=yes
QUEUE=1h

Options: 
DAEMON=yes|no - yes implies -bd 
QUEUE=1h - Given to sendmail as -q$QUEUE. The -q option is not given to sendmail if /etc/sysconfig/sendmail exists and QUEUE is empty or undefined. 


/etc/sysconfig/i18n
Controls the system font settings. The language variables are used in /etc/profile.d/lang.sh. An example i18n file: 
LANG="en_US"
LC_ALL="en_US"
LINGUAS="en_US"

Options: 
LANG= set locale for all categories, can be any two letter ISO language code. 
LC_CTYPE= localedata configuration for classification and conversion of characters. 
LC_COLLATE= localedata configuration for collation (sort order) of strings. 
LC_MESSAGES= localedata configuration for translation of yes and no messages. 
LC_NUMERIC= localedata configuration for non-monetary numeric data. 
LC_MONETARY= localedata configuration for monetary data. 
LC_TIME= localedata configuration for date and time. 
LC_ALL= localedata configuration overriding all of the above. 
LANGUAGE= can be a : separated list of ISO language codes. 
LINGUAS= can be a ' ' separated list of ISO language codes. 
SYSFONT= any font that is legal when used as /usr/bin/consolechars -f $SYSFONT ... (See console-tools package for consolechars command) 
UNIMAP= any SFM (screen font map, formerly called Unicode mapping table - see consolechars(8)) 
/usr/bin/consolechars -f $SYSFONT --sfm $UNIMAP 

SYSFONTACM= any ACM (application charset map - see consolechars(8)) 
/usr/bin/consolechars -f $SYSFONT --acm $SYSFONTACM 

The above is used by the /sbin/setsysfont command (which is run by rc.sysinit at boot time.) 


/etc/sysconfig/network-scripts/ifup:
/etc/sysconfig/network-scripts/ifdown:
These are symbolic links to /sbin/ifup and /sbin/ifdown, respectively. These symlinks are here for legacy purposes only. They will probably be removed in future versions. These scripts take one argument normally: the name of the device (e.g. eth0). They are called with a second argument of "boot" during the boot sequence so that devices that are not meant to be brought up on boot (ONBOOT=no, see below) can be ignored at that time. 


/etc/sysconfig/network-scripts/network-functions
This is not really a public file. Contains functions which the scripts use for bringing interfaces up and down. In particular, it contains most of the code for handling alternative interface configurations and interface change notification through netreport. 


/etc/sysconfig/network-scripts/ifcfg-interface
/etc/sysconfig/network-scripts/ifcfg-interface-clone
Defines an interface. An example file called ifcfg-eth0: 
DEVICE="eth0"
IPADDR="10.1.0.153"
NETMASK="255.255.0.0"
ONBOOT="yes"
BOOTPROTO="none"
IPXNETNUM_802_2=""
IPXPRIMARY_802_2="no"
IPXACTIVE_802_2="no"
IPXNETNUM_802_3=""
IPXPRIMARY_802_3="no"
IPXACTIVE_802_3="no"
IPXNETNUM_ETHERII=""
IPXPRIMARY_ETHERII="no"
IPXACTIVE_ETHERII="no"
IPXNETNUM_SNAP=""
IPXPRIMARY_SNAP="no"
IPXACTIVE_SNAP="no"

The /etc/sysconfig/network-scripts/ifcfg-interface-clone file only contains the parts of the definition that are different in a "clone" (or alternative) interface. For example, the network numbers might be different, but everything else might be the same, so only the network numbers would be in the clone file, but all the device information would be in the base ifcfg file.

Base items in the above two files: 

NAME="friendly name for users to see" - Most important for PPP. Only used in front ends. 
DEVICE="name of physical device" 
IPADDR= 
NETMASK= 
GATEWAY= 
ONBOOT=yes|no 
USERCTL=yes|no 
BOOTPROTO=none|bootp|dhcp - If BOOTPROTO is not "none", then the only other item that must be set is the DEVICE item; all the rest will be determined by the boot protocol. No "dummy" entries need to be created. 
Base items being deprecated: 
NETWORK="will be calculated automatically with ifcalc" 
BROADCAST="will be calculated automatically with ifcalc" 
Ethernet-only items: 
{IPXNETNUM,IPXPRIMARY,IPXACTIVE}_{802_2,802_3,ETHERII,SNAP} configuration matrix for IPX. Only used if IPX is active. Managed from /etc/sysconfig/network-scripts/ifup-ipx 
PPP/SLIP items: 
PERSIST=yes|no 
MODEMPORT=device - An example device is /dev/modem. 
LINESPEED=speed - An example speed is 115200. 
DEFABORT=yes|no - Tells netcfg whether or not to put default abort strings in when creating/editing the chat script and/or dip script for this interface. 
PPP-specific items 
WVDIALSECT="list of sections from wvdial.conf to use" - If this variable is set, then the chat script (if it exists) is ignored, and wvdial is used to open the PPP connection. 
PEERDNS=yes|no - Modify /etc/resolv.conf if peer uses msdns extension. 
DEFROUTE=yes|no - Set this interface as default route? 
ESCAPECHARS=yes|no -Simplified interface here doesn't let people specify which characters to escape; almost everyone can use asyncmap 00000000 anyway, and they can set PPPOPTIONS to asyncmap foobar if they want to set options perfectly). 
HARDFLOWCTL=yes|no - Yes implies "modem crtscts" options. 
PPPOPTIONS="arbitrary option string" - It is placed last on the command line, so it can override other options like asyncmap that were specified differently. 
PAPNAME="name $PAPNAME" - On pppd command line. Note that the "remotename" option is always specified as the logical ppp device name, like "ppp0" (which might perhaps be the physical device ppp1 if some other ppp device was brought up earlier...), which makes it easy to manage pap/chap files -- name/password pairs are associated with the logical ppp device name so that they can be managed together. 
REMIP="remote ip address" - Normally unspecified. 
MTU= 
MRU= 
DISCONNECTTIMEOUT="number of seconds" The current default is 5. This is the time to wait before re-establishing the connection after a successfully-connected session terminates before attempting to establish a new connection. 
RETRYTIMEOUT="number of seconds" - The current default is 60. This is the time to wait before re-attempting to establish a connection after a previous attempt fails. 
/etc/sysconfig/network-scripts/chat-interface - This is the chat script for PPP or SLIP connection intended to establish the connection. For SLIP devices, a DIP script is written from the chat script; for PPP devices, the chat script is used directly.


/etc/sysconfig/network-scripts/dip-interface
A write-only script created from the chat script by netcfg. Do not modify this. In the future, this file may disappear by default and created on-the-fly from the chat script if it does not exist.


/etc/sysconfig/network-scripts/ifup-post
Called when any network device EXCEPT a SLIP device comes up. Calls /etc/sysconfig/network-scripts/ifup-routes to bring up static routes that depend on that device. Calls /etc/sysconfig/network-scripts/ifup-aliases to bring up aliases for that device. Sets the hostname if it is not already set and a hostname can be found for the IP for that device. Sends SIGIO to any programs that have requested notification of network events. It could be extended to fix up nameservice configuration, call arbitrary scripts, etc, as needed.


/etc/sysconfig/network-scripts/ifup-routes
Set up static routes for a device. An example file: 
#!/bin/sh

# adds static routes which go through device $1

if [ "$1" = "" ]; then
	echo "usage: $0 <net-device>"
	exit 1
fi

if [ ! -f /etc/sysconfig/static-routes ]; then
	exit 0
fi

#note the trailing space in the grep gets rid of aliases
grep "^$1 " /etc/sysconfig/static-routes | while read device args; do
	/sbin/route add -$args $device
done


/etc/sysconfig/network-scripts/ifup-aliases
Bring up aliases for a device.


/etc/sysconfig/network-scripts/ifdhcpc-done
Called by dhcpcd once dhcp configuration is complete; sets up /etc/resolv.conf from the version dhcpcd dropped in /etc/dhcpc/resolv.conf 


Note 3:
-------

Red Hat Linux 8.0: The Official Red Hat Linux Reference Guide 
Prev Chapter 3. Boot Process, Init, and Shutdown Next 

--------------------------------------------------------------------------------

The /etc/sysconfig/ Directory
The following information outlines some of the files found in the /etc/sysconfig/ directory, their function, 
and their contents. This information is not intended to be complete, as many of these files have a variety 
of options that are only used in very specific or rare circumstances.

The /usr/share/doc/initscripts-<version-number>/sysconfig.txt file contains a more authoritative listing 
of the files found in the /etc/sysconfig directory and the configuration options available.

Files in the /etc/sysconfig/ Directory
The following files are normally found in the /etc/sysconfig/ directory:

amd
apmd
arpwatch
authconfig
cipe
clock
desktop
dhcpd
firstboot
gpm
harddisks
hwconf
i18n
identd
init
ipchains
iptables
irda
keyboard
kudzu
mouse
named
netdump
network
ntpd
pcmcia
radvd
rawdevices
redhat-config-users
redhat-logviewer
samba
sendmail
soundcard
squid
tux
ups
vncservers
xinetd

It is possible that your system may be missing a few of them if the corresponding program that would need 
that file is not installed.

Next, we will take a look at each one.

/etc/sysconfig/amd
The /etc/sysconfig/amd file contains various parameters used by amd allowing for the automounting and 
automatic unmounting of file systems.

/etc/sysconfig/apmd
The /etc/sysconfig/apmd file is used by apmd as a configuration for what things to start/stop/change 
on suspend or resume. It is set up to turn on or off apmd during startup, depending on whether your hardware 
supports Advanced Power Management (APM) or if you choose not to use it. apm is a monitoring daemon that works 
with power management code within the Linux kernel. It can alert you to a low battery if you are using 
Red Hat Linux on a laptop, among other things.

/etc/sysconfig/arpwatch
The /etc/sysconfig/arpwatch file is used to pass arguments to the arpwatch daemon at boot time. 
The arpwatch daemon maintains a table of Ethernet MAC addresses and their IP address pairings. 
For more information about what parameters you can use in this file, type man arpwatch. By default, 
this file sets the owner of the arpwatch process to the user pcap.

/etc/sysconfig/authconfig
The /etc/sysconfig/authconfig file sets the kind of authorization to be used on the host. 
It contains one or more of the following lines:

USEMD5=<value>, where <value> is one of the following:

yes - MD5 is used for authentication.
no - MD5 is not used for authentication.

USEKERBEROS=<value>, where <value> is one of the following:

yes - Kerberos is used for authentication.
no - Kerberos is not used for authentication.

USELDAPAUTH=<value>, where <value> is one of the following:

yes - LDAP is used for authentication.
no - LDAP is not used for authentication.

/etc/sysconfig/clock
The /etc/sysconfig/clock file controls the interpretation of values read from the system hardware clock.

The correct values are:

UTC=<value>, where <value> is one of the following boolean values:

true or yes - Indicates that the hardware clock is set to Universal Time.
false or no - Indicates that the hardware clock is set to local time.

ARC=<value>, where <value> is the following:

true or yes - Indicates the ARC console's 42-year time offset is in effect. This setting is only 
for ARC- or AlphaBIOS-based Alpha systems. Any other value indicates that the normal UNIX epoch is in use.

SRM=<value>, where <value> is the following:

true or yes - Indicates the SRM console's 1900 epoch is in effect. This setting is only for SRM-based 
Alpha systems. Any other value indicates that the normal UNIX epoch is in use.

ZONE=<filename> - Indicates the timezone file under /usr/share/zoneinfo that /etc/localtime is a copy of, such as:

ZONE="America/New York"


Earlier releases of Red Hat Linux used the following values (which are deprecated):

CLOCKMODE=<value>, where <value> is one of the following:

GMT - Indicates that the clock is set to Universal Time (Greenwich Mean Time).

ARC - Indicates the ARC console's 42-year time offset is in effect (for Alpha-based systems only).

/etc/sysconfig/desktop
The /etc/sysconfig/desktop file specifies the desktop manager to be run, such as:

DESKTOP="GNOME"

/etc/sysconfig/dhcpd
The /etc/sysconfig/dhcpd file is used to pass arguments to the dhcpd daemon at boot time. 
The dhcpd daemon implements the Dynamic Host Configuration Protocol (DHCP) and the Internet Bootstrap 
Protocol (BOOTP). DHCP and BOOTP assign hostnames to machines on the network. For more information 
about what parameters you can use in this file, type man dhcpd.

/etc/sysconfig/firstboot
Beginning with Red Hat Linux 8.0, the first time you boot the system, the /sbin/init program calls 
the etc/rc.d/init.d/firstboot script. This allows the user to install additional applications 
and documentation before the boot process completes.

The /etc/sysconfig/firstboot file tells the firstboot command not to run on subsequent reboots. 
If you want firstboot to run the next time you boot the system, simply remove /etc/sysconfig/firstboot 
and execute chkconfig --level 5 firstboot on.

/etc/sysconfig/gpm
The /etc/sysconfig/gpm file is used to pass arguments to the gpm daemon at boot time. The gpm daemon is the 
mouse server which allows mouse acceleration and middle-click pasting. For more information about what 
parameters you can use in this file, type man gpm. By default, it sets the mouse device to /dev/mouse.

/etc/sysconfig/harddisks
The /etc/sysconfig/harddisks file allows you to tune your hard drive(s). You can also use /
etc/sysconfig/hardiskhd[a-h], to configure parameters for specific drives.

 Warning 
  Do not make changes to this file lightly. If you change the default values stored here, you could 
  corrupt all of the data on your hard drive(s).
 
The /etc/sysconfig/harddisks file may contain the following:

USE_DMA=1, where setting this to 1 enables DMA. However, with some chipsets and hard drive combinations, 
DMA can cause data corruption. Check with your hard drive documentation or manufacturer before enabling this.

Multiple_IO=16, where a setting of 16 allows for multiple sectors per I/O interrupt. When enabled, 
this feature reduces operating system overhead by 30-50%. Use with caution.

EIDE_32BIT=3 enables (E)IDE 32-bit I/O support to an interface card.

LOOKAHEAD=1 enables drive read-lookahead.

EXTRA_PARAMS= specifies where extra parameters can be added.

/etc/sysconfig/hwconf
The /etc/sysconfig/hwconf file lists all the hardware that kudzu detected on your system, as well as 
the drivers used, vendor ID and device ID information. The kudzu program detects and configures new and/or 
changed hardware on a system. The /etc/sysconfig/hwconf file is not meant to be manually edited. 
If you do edit it, devices could suddenly show up as being added or removed.

/etc/sysconfig/i18n
The /etc/sysconfig/i18n file sets the default language, such as:

LANG="en_US"

/etc/sysconfig/identd
The /etc/sysconfig/identd file is used to pass arguments to the identd daemon at boot time. 
The identd daemon returns the username of processes with open TCP/IP connections. Some services on 
the network, such as FTP and IRC servers, will complain and cause slow responses if identd is not running. 
But in general, identd is not a required service, so if security is a concern, you should not run it. 
For more information about what parameters you can use in this file, type man identd. By default, 
the file contains no parameters.

/etc/sysconfig/init
The /etc/sysconfig/init file controls how the system will appear and function during the boot process.

The following values may be used:

BOOTUP=<value>, where <value> is one of the following:

BOOTUP=color means the standard color boot display, where the success or failure of devices and services starting up is shown in different colors.

BOOTUP=verbose means an old style display, which provides more information than purely a message of success or failure.

Anything else means a new display, but without ANSI-formatting.

RES_COL=<value>, where <value> is the number of the column of the screen to start status labels. Defaults to 60.

MOVE_TO_COL=<value>, where <value> moves the cursor to the value in the RES_COL line. Defaults to ANSI sequences output by echo -e.

SETCOLOR_SUCCESS=<value>, where <value> sets the color to a color indicating success. Defaults to ANSI sequences output by echo -e, setting the color to green.

SETCOLOR_FAILURE=<value>, where <value> sets the color to a color indicating failure. Defaults to ANSI sequences output by echo -e, setting the color to red.

SETCOLOR_WARNING=<value>, where <value> sets the color to a color indicating warning. Defaults to ANSI sequences output by echo -e, setting the color to yellow.

SETCOLOR_NORMAL=<value>, where <value> sets the color to 'normal'. Defaults to ANSI sequences output by echo -e.

LOGLEVEL=<value>, where <value> sets the initial console logging level for the kernel. The default is 7; 8 means everything (including debugging); 1 means nothing except kernel panics. syslogd will override this once it starts.

PROMPT=<value>, where <value> is one of the following boolean values:

yes - Enables the key check for interactive mode.

no - Disables the key check for interactive mode.

/etc/sysconfig/ipchains
The /etc/sysconfig/ipchains file contains information used by the kernel to set up ipchains packet filtering rules at boot time or whenever the service is started.

This file is modified by typing the command /sbin/service ipchains save when valid ipchains rules are in place. You should not manually edit this file. Instead, use the /sbin/ipchains command to configure the necessary packet filtering rules and then save the rules to this file using /sbin/service ipchains save.

Use of ipchains to set up firewall rules is not recommended as it is deprecated and may disappear from future releases of Red Hat Linux. If you need a firewall, you should use iptables instead.

/etc/sysconfig/iptables
Like /etc/sysconfig/ipchains, the /etc/sysconfig/iptables file stores information used by the kernel to set up packet filtering services at boot time or whenever the service is started.

You should not modify this file by hand unless you are familiar with how to construct iptables rules. The simplest way to add rules is to use the /usr/sbin/lokkit command or the gnome-lokkit graphical application to create your firewall. Using these applications will automatically edit this file at the end of the process.

If you wish, you can manually create rules using /sbin/iptables and then type /sbin/service iptables save to add the rules to the /etc/sysconfig/iptables file.

Once this file exists, any firewall rules saved there will persist through a system reboot or a service restart.

For more information on iptables see Chapter 13.

/etc/sysconfig/irda
The /etc/sysconfig/irda file controls how infrared devices on your system are configured at startup.

The following values may be used:

IRDA=<value>, where <value> is one of the following boolean values:

yes - irattach will be run, which periodically checks to see if anything is trying to connect to the infrared port, such as another notebook computer trying to make a network connection. For infrared devices to work on your system, this line must be set to yes.

no - irattach will not be run, preventing infrared device communication.

DEVICE=<value>, where <value> is the device (usually a serial port) that handles infrared connections.

DONGLE=<value>, where <value> specifies the type of dongle being used for infrared communication. This setting exists for people who use serial dongles rather than real infrared ports. A dongle is a device that is attached to a traditional serial port to communicate via infrared. This line is commented out by default because notebooks with real infrared ports are far more common than computers with add-on dongles.

DISCOVERY=<value>, where <value> is one of the following boolean values:d

yes - Starts irattach in discovery mode, meaning it actively checks for other infrared devices. This needs to be turned on for the machine to be actively looking for an infrared connection (meaning the peer that does not initiate the connection).

no - Does not start irattach in discovery mode.

/etc/sysconfig/keyboard
The /etc/sysconfig/keyboard file controls the behavior of the keyboard. The following values may be used:

KEYBOARDTYPE=sun|pc, which is used on SPARCs only. sun means a Sun keyboard is attached on /dev/kbd, and pc means a PS/2 keyboard connected to a PS/2 port.

KEYTABLE=<file>, where <file> is the name of a keytable file.

For example: KEYTABLE="us". The files that can be used as keytables start in /lib/kbd/keymaps/i386 and branch into different keyboard layouts from there, all labeled <file>.kmap.gz. The first file found beneath /lib/kbd/keymaps/i386that matches the KEYTABLE setting is used.

/etc/sysconfig/kudzu
The /etc/sysconfig/kuzdu allows you to specify a safe probe of your system's hardware by kudzu at boot time. A safe probe is one that disables serial port probing.

SAFE=<value>, where <value> is one of the following:

yes - kuzdu does a safe probe.

no - kuzdu does a normal probe.

/etc/sysconfig/mouse
The /etc/sysconfig/mouse file is used to specify information about the available mouse. The following values may be used:

FULLNAME=<value>, where <value> refers to the full name of the kind of mouse being used.

MOUSETYPE=<value>, where <value> is one of the following:

microsoft - A MicrosoftT mouse.

mouseman - A MouseManT mouse.

mousesystems - A Mouse SystemsT mouse.

ps/2 - A PS/2 mouse.

msbm - A MicrosoftT bus mouse.

logibm - A LogitechT bus mouse.

atibm - An ATIT bus mouse.

logitech - A LogitechT mouse.

mmseries - An older MouseManT mouse.

mmhittab - An mmhittab mouse.

XEMU3=<value>, where <value> is one of the following boolean values:

yes - The mouse only has two buttons, but three mouse buttons should be emulated.

no - The mouse already has three buttons.


XMOUSETYPE=<value>, where <value> refers to the kind of mouse used when X is running. The options here are the same as the MOUSETYPE setting in this same file.

DEVICE=<value>, where <value> is the mouse device.

In addition, /dev/mouse is a symbolic link that points to the actual mouse device.

/etc/sysconfig/named
The /etc/sysconfig/named file is used to pass arguments to the named daemon at boot time. The named daemon is a Domain Name System (DNS) server which implements the Berkeley Internet Name Domain (BIND) version 9 distribution. This server maintains a table of which hostnames are associated with IP addresses on the network.

Currently, only the following values may be used:

ROOTDIR="</some/where>", where </some/where> refers to the full directory path of a configured chroot environment under which named will run. This chroot environment must first be configured. Type info chroot for more information on how to do this.

OPTIONS="<value>", where <value> any option listed in the man page for named except -t. In place of -t, use the ROOTDIR line above instead.

For more information about what parameters you can use in this file, type man named. For detailed information on how to configure a BIND DNS server, see Chapter 16. By default, the file contains no parameters.

/etc/sysconfig/netdump
The /etc/sysconfig/netdump file is the configuration file for the /etc/init.d/netdump service. The netdump service sends both oops data and memory dumps over the network. In general, netdump is not a required service, so you should only run it if you absolutely need to. For more information about what parameters you can use in this file, type man netdump.

/etc/sysconfig/network
The /etc/sysconfig/network file is used to specify information about the desired network configuration. The following values may be used:

NETWORKING=<value>, where <value> is one of the following boolean values:

yes - Networking should be configured.

no - Networking should not be configured.

HOSTNAME=<value>, where <value> should be the Fully Qualified Domain Name (FQDN), such as hostname.domain.com, but can be whatever hostname you want.

 Note 
  For compatibility with older software that people might install (such as trn), the /etc/HOSTNAME file should contain the same value as here.
 

GATEWAY=<value>, where <value> is the IP address of the network's gateway.

GATEWAYDEV=<value>, where <value> is the gateway device, such as eth0.

NISDOMAIN=<value>, where <value> is the NIS domain name.

/etc/sysconfig/ntpd
The /etc/sysconfig/ntpd file is used to pass arguments to the ntpd daemon at boot time. The ntpd daemon sets and maintains the system clock to synchronize with an Internet standard time server. It implements version 4 of the Network Time Protocol (NTP). For more information about what parameters you can use in this file, point a browser at the following file: /usr/share/doc/ntp-<version>/ntpd.htm (where <version> is the version number of ntpd). By default, this file sets the owner of the ntpd process to the user ntp.

/etc/sysconfig/pcmcia
The /etc/sysconfig/pcmcia file is used to specify PCMCIA configuration information. The following values may be used:

PCMCIA=<value>, where <value> is one of the following:

yes - PCMCIA support should be enabled.

no - PCMCIA support should not be enabled.

PCIC=<value>, where <value> is one of the following:

i82365 - The computer has an i82365-style PCMCIA socket chipset.

tcic - The computer has a tcic-style PCMCIA socket chipset.

PCIC_OPTS=<value>, where <value> is the socket driver (i82365 or tcic) timing parameters.

CORE_OPTS=<value>, where <value> is the list of pcmcia_core options.

CARDMGR_OPTS=<value>, where <value> is the list of options for the PCMCIA cardmgr (such as -q for quiet mode; -m to look for loadable kernel modules in the specified directory, and so on). Read the cardmgr man page for more information.

/etc/sysconfig/radvd
The /etc/sysconfig/radvd file is used to pass arguments to the radvd daemon at boot time. The radvd daemon listens to for router requests and sends router advertisements for the IP version 6 protocol. This service allows hosts on a network to dynamically change their default routers based on these router advertisements. For more information about what parameters you can use in this file, type man radvd. By default, this file sets the owner of the radvd process to the user radvd.

/etc/sysconfig/rawdevices
The /etc/sysconfig/rawdevices file is used to configure raw device bindings, such as:

/dev/raw/raw1 /dev/sda1
/dev/raw/raw2 8 5

 
/etc/sysconfig/redhat-config-users
The /etc/sysconfig/redhat-config-users file is the configuration file for the graphical application, User Manager. Under Red Hat Linux 8.0 this file is used to filter out system users such as root, daemon, or lp. This file is edited by the Preferences => Filter system users and groups pull-down menu in the User Manager application and should not be edited by hand. For more information on using this application, see the chapter called User and Group Configuration in the Official Red Hat Linux Customization Guide.

/etc/sysconfig/redhat-logviewer
The /etc/sysconfig/redhat-logviewer file is the configuration file for the graphical, interactive log viewing application, Log Viewer. This file is edited by the Edit => Preferences pull-down menu in the Log Viewer application and should not be edited by hand. For more information on using this application, see the chapter called Log Files in the Official Red Hat Linux Customization Guide.

/etc/sysconfig/samba
The /etc/sysconfig/samba file is used to pass arguments to the smbd and the nmbd daemons at boot time. The smbd daemon offers file sharing connectivity for Windows clients on the network. The nmbd daemon offers NetBIOS over IP naming services. For more information about what parameters you can use in this file, type man smbd. By default, this file sets smbd and nmbd to run in daemon mode.

/etc/sysconfig/sendmail
The /etc/sysconfig/sendmail file allows messages to be sent to one or more recipients, routing the message over whatever networks are necessary. The file sets the default values for the Sendmail application to run. Its default values are to run as a background daemon, and to check its queue once an hour in case something has backed up.

The following values may be used:

DAEMON=<value>, where <value> is one of the following boolean values:

yes - Sendmail should be configured to listen to port 25 for incoming mail. yes implies the use of Sendmail's -bd options.

no - Sendmail should not be configured to listen to port 25 for incoming mail.

QUEUE=1h which is given to Sendmail as -q$QUEUE. The -q option is not given to Sendmail if /etc/sysconfig/sendmail exists and QUEUE is empty or undefined.

/etc/sysconfig/soundcard
The /etc/sysconfig/soundcard file is generated by sndconfig and should not be modified. The sole use of this file is to determine what card entry in the menu to pop up by default the next time sndconfig is run. Sound card configuration information is located in the /etc/modules.conf file.

It may contain the following:

CARDTYPE=<value>, where <value> is set to, for example, SB16 for a Soundblaster 16 sound card.

/etc/sysconfig/squid
The /etc/sysconfig/squid file is used to pass arguments to the squid daemon at boot time. The squid daemon is a proxy caching server for Web client applications. For more information on configuring a squid proxy server, use a Web browser to open the /usr/share/doc/squid-<version>/ directory (replace <version> with the squid version number installed on your system). By default, this file sets squid top start in daemon mode and sets the amount of time before it shuts itself down.

/etc/sysconfig/tux
The /etc/sysconfig/tux file is the configuration file for the Red Hat Content Accelerator (formerly known as TUX), the kernel-based web server. For more information on configuring the Red Hat Content Accelerator, use a Web browser to open the /usr/share/doc/tux-<version>/tux/index.html (replace <version> with the version number of TUX installed on your system). The parameters available for this file are listed in /usr/share/doc/tux-<version>/tux/parameters.html.

/etc/sysconfig/ups
The /etc/sysconfig/ups file is used to specify information about any Uninterruptible Power Supplies (UPS) connected to your system. A UPS can be very valuable for a Red Hat Linux system because it gives you time to correctly shut down the system in the case of power interruption. The following values may be used:

SERVER=<value>, where <value> is one of the following:

yes - A UPS device is connected to your system.

no - A UPS device is not connected to your system.

MODEL=<value>, where <value> must be one of the following or set to NONE if no UPS is connected to the system:

apcsmart - For a APC SmartUPST or similar device.

fentonups - For a Fenton UPST.

optiups - For an OPTI-UPST device.

bestups - For a Best PowerT UPS.

genericups - For a generic brand UPS.

ups-trust425+625 - For a TrustT UPS.

DEVICE=<value>, where <value> specifies where the UPS is connected, such as /dev/ttyS0.

OPTIONS=<value>, where <value> is a special command that needs to be passed to the UPS.

/etc/sysconfig/vncservers
The /etc/sysconfig/vncservers file configures the way the Virtual Network Computing (VNC) server starts up.

VNC is a remote display system which allows you to view a desktop environment not only on the machine where it is running but across different networks on a variety of architectures.

It may contain the following:

VNCSERVERS=<value>, where <value> is set to something like "1:fred", to indicate that a VNC server should be started for user fred on display :1. User fred must have set a VNC password using vncpasswd before attempting to connect to the remote VNC server.

Note that when you use a VNC server, your communication with it is unencrypted, and so it should not be used on an untrusted network. For specific instructions concerning the use of SSH to secure the VNC communication, please read the information found at http://www.uk.research.att.com/vnc/sshvnc.html. To find out more about SSH, see Chapter 9 or Official Red Hat Linux Customization Guide.

/etc/sysconfig/xinetd
The /etc/sysconfig/xinetd file is used to pass arguments to the xinetd daemon at boot time. 
The xinetd daemon starts programs that provide Internet services when a request to the port for that service 
is received. For more information about what parameters you can use in this file, type man xinetd. 
For more information on the xinetd service, see the Section called Access Control Using xinetd in Chapter 8.

Directories in the /etc/sysconfig/ Directory
The following directories are normally found in /etc/sysconfig/ and a basic description of what they contain:

apm-scripts - This contains the Red Hat APM suspend/resume script. You should not edit this file directly. If you need customization, simple create a file called /etc/sysconfig/apm-scripts/apmcontinue and it will be called at the end of the script. Also, you can control the script by editing /etc/sysconfig/apmd.

cbq - This directory contains the configuration files needed to do Class Based Queuing for bandwidth management on network interfaces.

networking - This directory is used by the Network Administration Tool (redhat-config-network) and its contents should not be edited manually. For more information about configuring network interfaces using the Network Administration Tool, see the chapter called Network Configuration in the Official Red Hat Linux Customization Guide.

network-scripts - This directory contains the following network-related configuration files:

Network configuration files for each configured network interface, such as ifcfg-eth0 for the eth0 Ethernet interface.

Scripts used to bring up and down network interfaces, such as ifup and ifdown.

Scripts used to bring up and down ISDN interfaces, such as ifup-isdn and ifdown-isdn

Various shared network function scripts which should not be edited directly.

For more information on the network-scripts directory, see Chapter 12

rhn - This directory contains the configuration files and GPG keys for the Red Hat Network. No files in this directory should be edited by hand. For more information on the Red Hat Network, see the Red Hat Network website at the following URL: https://rhn.redhat.com.


39.5 AIX kernel parameters:
---------------------------

Througout this document, you can find many AIX kernel parameter statements.
Most commands are related to retrieving or changing attributes on the sys0 object.

Please see section 9.2 for a complete description.

For example, take a look at the following example:

  maxuproc:    Specifies the maximum number of processes per user ID. 
  Values:      Default: 40; Range: 1 to 131072 
  Display:     lsattr -E -l sys0 -a maxuproc 
  Change:      chdev -l sys0 -a maxuproc=NewValue 
               Change takes effect immediately and is preserved over boot. If value is reduced, 
               then it goes into effect only after a system boot. 
  Diagnosis:   Users cannot fork any additional processes. 
  Tuning:      This is a safeguard to prevent users from creating too many processes. 


Kernel Tunable Parameters
Following are kernel parameters, grouped into the following sections:

-Scheduler and Memory Load Control Tunable Parameters 
-Virtual Memory Manager Tunable Parameters 
-Synchronous I/O Tunable Parameters 
-Asynchronous I/O Tunable Parameters 
-Disk and Disk Adapter Tunable Parameters 
-Interprocess Communication Tunable Parameters
-Scheduler and Memory Load Control Tunable Parameters
-Most of the scheduler and memory load control tunable parameters are fully described in the schedo man page. 
-The following are a few other related parameters:


40. NFS:
========

On Solaris:
-----------

NFS uses a number of deamons to handle its services. These services are initialized at startup
from the "/etc/init.d/nfs.server" and "/etc/init.d/nfs.client" startup scripts.

nfsd:		handles filesystem exporting and file access from remote systems
mountd:		handles mount requests from nfs clients. provides also info about which filesystems
		are mounted by which clients. use the showmount command to view this information.
lockd:		runs on nfs server and nfs clients and provides locking services
statd:		runs on nfs server and nfs clients and provides crash and recovery functions for lockd
rpcbind:	facilitates the initial connection between client and server
nfslogd:	provides logging 


On AIX:
-------

To start the NFS daemons for each system, whether client or Server, you can use either

# smitty mknfs
# mknfs -N  (or -B or -I)

The mknfs command configures the system to rum the NFS daemons. The command also adds an entry 
to the /etc/inittab file, so that the /etc/rc.nsf file is executed on system restart.

mknfs flags:

-B: adds an entry to the inittab and it also executes /etc/rc.nsf to start the daemons now.
-I: adds an entry to the inittab to execute rc.nfs at system restart.
-N: executes rc.nfs now to start the daemons.

The NFS daemons can be started individually or all at once. To start individual daemons, you can use
the System Resource Controller:

# startsrc -s daemon, like e.g. # startsrc -s nfsd

To start the complete nfs system:
(good command)

# startsrc -g nfs


Exporting NFS directories:

To export filesystems using smitty, follow this procedure:

1. Verify that NFS is already running using the command "lssrc -g nfs". The output should indicate
that the nfsd and rpc.mountd daemons are active.

# lssrc -g nfs
Subsystem           Group         PID        Status
biod                nfs           1234       active
nfsd                nfs           5678       active
rpc.mountd          nfs           9101       active
rpc.statd           nfs           1213       active
rpc.lockd           nfs           1516       active

2. To export the dirctory use either

# smitty mknfsexp       or
# mknfsexp              or
# edit the /etc/exports file, like for example
  vi /etc/exports

  /home1
  /home2
  etc..

  
41. NETWORK COMMANDS AND FILES:
===============================


41.1 SOLARIS:
=============

ifconfig:
---------

ifconfig enables or disables a network interface, sets its IP address, subnet mask, and sets
various other options.

syntax: 
ifconfig interface address options .. up

Examples:

# ifconfig -a
Displays the systems IP address and mac address.

# ifconfig en0 128.138.240.1 netmask 255.255.255 up
# ifconfig lo0 127.0.0.1 up
# ifconfig en0 128.138.243.151 netmask 255.255.255.192 broadcast 128.138.243.191 up

An identifier as en0 identifies the network interface to which the command applies.
Some common names are ie0, le0, le1, en0, we0, qe0, hme0, eth0, lan0, lo0

Under Solaris, network interfaces must be attached with "ifconfig interface plumb"
before they become configurable.

rpcinfo:
--------

This utility can list all registered RPC services running on a system, for example 

# rpcinfo -p 192.168.1.21

You can also unregister an rpc service using the -d option, for example

#rpcinfo -d sprayd 1  

which would stop spayd


route:
------

The route command defines static routes.

Syntax:
route [-f] add/delete destination gateway [hop-count]

# route add default gateway_ipaddress


files:
------

- /etc/hostname.interface
The file contains the hostname or IP address associated with the networkinterface.
Suppose the system is called system1 and the interface is le0
then the file would be "hostname.le0" and contains the entry "system1".

- /etc/nodename
The file should contain one entry: the hostname of the local machine.

- /etc/defaultdomain
The file is present if the network uses a name service. The file should contain
one entry: the fully qualified Domain name of the administrative domain to which
the local host belongs.

- /etc/inet/hosts or /etc/hosts
This is the well known local hosts file, which resolves names to IP addresses.
The /etc/hosts is a symbolic link to /etc/inet/hosts.

- /etc/defaultrouter
This file should contain an entry for each router directly connected to the network.

- /etc/inetd.conf
The inetd deamon runs on behalf of other networkservices. It starts the appropriate server process
when a request for that service is received. The /etc/inetd.conf file lists the services that
inetd is to provide

- /etc/services
This file lists the well known ports.

- /etc/hosts.equiv
This file contains a list of trusted hosts for a remote system, one per line.
It has the following structure:
system1
system2 user_a

If the user attemps to login remotely by using rlogin from one of the hosts listed
in this file, the system allows the user to login without a password.

~/.rhosts

This file is the user equivalent of /etc/hosts.equiv file. This is normally regarded as a security hole.
This file could be found in a user home directory. It could contain the name of a remote host
that want, for example, copy files to this host.

- /etc/resolv.conf

Create or edit /etc/resolv.conf

Here you tell it three things:

 What domain we're in 
 Specify any additional search domains 
 What the nameservers are (it will use them in the order you put them in the file) 
 When you're done it should look something like this:


# cat resolv.conf
domain yourdomain.com
search yourdomain.com
search client1.com
nameserver 192.168.0.9
nameserver 192.168.0.11


41.2 AIX:
=========

41.2.1 Network initialization at boot:
------------------------------------

At IPL time, the init process will run the /etc/rc.tcpip after starting the SRC.
This is so because in /etc/inittab the following record is present:

rctcpip:23456789:wait:/etc/rc.tcpip > /dev/console 2>&1 # Start TCP/IP daemons

The /etc/rc.tcpip file is a shell script that uses SRC commands to initialize selected deamons.
It can also be executed at any time from the command line.
These deamons are:

inetd (started by default),gated,routed,named,timed,rwhod

There are also deamons specific to the bos or to other applications that can be started through
the rc.tcpip file. These deamons are lpd, portmap, sendmail, syslogd (started by default)

The subsystems started from rc.tcpip can be stopped and restarted using the stopsrc and startsrc commands.

Example:
# stopsrc -s inetd

To configure tcp/ip use the command

# mktcpip

or use smitty

# smitty mktcpip (only for the first time)
# smitty tcpip
# smitty inet  OR smitty chgenet (for configuring the network interface)
# smitty configtcp (many advanced options)

or use the Web-based System manager. 

Smitty uses a number of screens to guide you through the process, As an example of the command, take
a look at the following example:

# mktcpip -h server1 -a 10.10.10.5 -m 255.255.255.0 -i en0 \
-n 10.10.10.254 -d abc.xyz.nl -g 10.10.10.254 -s -C -A no

If you need to further configure your network, use

# smitty configtcp


41.2.2 resolving hostnames and /etc/netsvc.conf:
-----------------------------------------

The default order in resolving host names is: 

- BIND/DNS (named) 
- Network Information Service (NIS) 
- Local /etc/hosts file 

The default order can be overwritten by creating the configuration file, /etc/netsvc.conf and specifying 
the desired order. Both the default and /etc/netsvc.conf can be overwritten with the environment variable NSORDER. 

You can override the order by creating the /etc/netsvc.conf file with an entry. 
If /etc/netsvc.conf does not exist, it will be just like you have the following entry: 

hosts = bind,nis,local

You can override the order by changing the NSORDER environment variable. If it is not set, 
it will be just like you have issued the command: 

export NSORDER=bind,nis,local


the /etc/resolv.conf file:
--------------------------

If you use name services, you can provide the minimal information needed through the mktcpip command.
Typically, the "/etc/resolv.conf" file stores your domain name and name server ip addresses.
The mktcpip command creates or updates the /etc/resolv.conf file for you.


41.2.3 Adapter:
---------------

When an adapter is added to the system, a logical device is created in the ODM, for example 
Ethernet adapters as follows:

# lsdev -Cc adapter | grep ent
ent0   Available 10-80   IBM PCI Ethernet Adapter (22100020)
ent1   Available 20-60   Gigabit Ethernet-SX PCI Adapter (14100401)


So you will have an adapter, and a corresponding interface, like for example
The Adapter is       : ent0
Then the interface is: en0

To list all interfaces on the system, use:

# lsdev -Cc if
en0 Defined   10-80  Standard Ethernet Network Interface
en1 Defined   20-60  Standard Ethernet Network Interface
et0 Defined   10-80  IEEE 802.3 Ethernet Network INterface
et1 Defined   20-60  IEEE 802.3 Ethernet Network INterface
lo0 Available        Loopback Network INterface

A corresponding network interface will allow tcpip to use the adapter.
Most of the time, we will deal with auto-detectable adapters, but in some cases an interface might 
need to be created manually with
# smitty inet  or   smitty mkinet

To change or view attributes like duplex settings, use
# smitty chgenet 

more info:

An Ethernet can have 2 interfaces: Standard ethernet (enX) or IEEE 802.3 (etX). X is the same number 
in the entX adapter name, like for example ent0 and en0. Only one of these interfaces can be using 
TCPIP at a time. The adapter ent0 can have en0 and et0 interfaces.
An ATM adapter (atmX) can have only one atm interface (atX). For example ATM adapter atm0 has an at0 interface.


41.2.4 Other stuff:
-------------------

iptrace:
--------

The iptrace command can be used to record the packets that are exchanged on an interface to and from
a remote host. This is like a Solaris snoop facility.

Examples

  1. To start the iptrace daemon with the System Resource Controller (SRC),
     enter:
     startsrc -s iptrace -a "/tmp/nettrace"

     To stop the iptrace daemon with SRC enter the following:
     stopsrc -s iptrace

  2. To record packets coming in and going out to any host on every interface,
     enter the command in the following format:

     iptrace /tmp/nettrace

     The recorded packets are received on and sent from the local host. All
     packet flow between the local host and all other hosts on any interface is
     recorded. The trace information is placed into the /tmp/nettrace file.
  3. To record packets received on an interface from a specific remote host,
     enter the command in the following format:

     iptrace - i en0 -p telnet -s airmail /tmp/telnet.trace

     The packets to be recorded are received on the en0 interface, from remote
     hostairmail, over the telnet port. The trace information is placed into the
     /tmp/telnet.trace file.
  4. To record packets coming in and going out from a specific remote host,
     enter the command in the following format:

     iptrace -i en0 -s airmail -b /tmp/telnet.trace

     The packets to be recorded are received on the en0 interface, from remote
     hostairmail. The trace information is placed into the /tmp/telnet.trace
     file.


Adding routes:
--------------

Use smitty mkroute
or use the route add command, like for example:

# route add -net 192.168.1 -netmask 255.255.255.0 9.3.1.124

Changing the IP Address:
------------------------

You can check the interfaces whether they have IP addresses asigned to them with
# ifconfig -a
# ifconfig <interface>

Changing the IP adress:

# smitty mktcpip
# smitty chinet

or use the ifconfig command, like for example:

# ifconfig tr0 up                                   # activate interface
# ifconfig tr0 down                                 # deactivate interface
# ifconfig tr0 detach                               # removes the interface
# ifconfig tr0                                      # put it back again
# ifconfig tr0 delete                               # delete the IP address
# ifconfig en0 10.1.2.3 netmask 255.255.255.0 up    # configure IP params on the interface

You can even use the chdev command like:

# chdev -l en0 -a netaddr='9.3.240.58' -a netmask='255.255.255.0'

Smitty and chdev will update the ODM database, and makes changes permanent, while ifconfig commands will not.


host.equiv and .rhost files:
----------------------------

- /etc/hosts.equiv
This file contains a list of trusted hosts for a remote system, one per line.
It has the following structure:
system1
system2 user_a

If the user attemps to login remotely by using rlogin from one of the hosts listed
in this file, the system allows the user to login without a password.

~/.rhosts

This file is the user equivalent of /etc/hosts.equiv file. This is normally regarded as a security hole.

For example, to allow all the users on the host toaster and machine to login to the local host,
you would have a host.equiv file like

toaster
starboss

To allow only the user bob to login from starboss, you would have

toaster
starboss bob

To allow the user lester to login from any host, you would have

toaster
starboss bob
+ lester

Show statistics and collisions of an interface:
-----------------------------------------------

# entstat -d en0

This command shows Media speed and that kind of stuff etc..


Check the current routing table:
--------------------------------

# netstat -nr

Add or change routes can be done by using "smitty mkroute".

If your system is going to be configured as a static router (it has 2 or more network interface cards),
then it needs to be enabled as a router by the no command, that is the network option command, for example

# no -o ipforwarding=1

note:
-----

The no command is used to configure network attributes. The no commands sets or displays current 
network attributes in the kernel. It will only operate on the currently running kernel.
Whether the commands sets or displays an attribute is determined by the accompanying flag:
the -o flag performs both actions.

Some examples:

# no -o thewall=3072
# no -o tcp_sendspace=16384
# no -o ipqmaxlen=512       (controls the number of incoming packets that can exists on the IP interrupt queue)


# no -a

                 arpqsize = 12
               arpt_killc = 20
              arptab_bsiz = 7
                arptab_nb = 149
                bcastping = 0
      clean_partial_conns = 1
                 delayack = 0
            delayackports = {}
         dgd_packets_lost = 3
            dgd_ping_time = 5
           dgd_retry_time = 5
       directed_broadcast = 0
         extendednetstats = 0
                 fasttimo = 200
        icmp6_errmsg_rate = 10
          icmpaddressmask = 0
ie5_old_multicast_mapping = 0
                   ifsize = 256
          inet_stack_size = 16
               ip6_defttl = 64
                ip6_prune = 1
            ip6forwarding = 0
       ip6srcrouteforward = 0
       ip_ifdelete_notify = 0
                 ip_nfrag = 200
             ipforwarding = 0
                ipfragttl = 2
        ipignoreredirects = 1
                ipqmaxlen = 100
          ipsendredirects = 1
        ipsrcrouteforward = 0
           ipsrcrouterecv = 0
           ipsrcroutesend = 0
          llsleep_timeout = 3
                  lo_perf = 1
                lowthresh = 90
                 main_if6 = 0
               main_site6 = 0
                 maxnip6q = 20
                   maxttl = 255
                medthresh = 95
               mpr_policy = 1
              multi_homed = 1
                nbc_limit = 891289
            nbc_max_cache = 131072
            nbc_min_cache = 1
         nbc_ofile_hashsz = 12841
                 nbc_pseg = 0
           nbc_pseg_limit = 1048576
           ndd_event_name = {all}
        ndd_event_tracing = 0
            ndp_mmaxtries = 3
            ndp_umaxtries = 3
                 ndpqsize = 50
                ndpt_down = 3
                ndpt_keep = 120
               ndpt_probe = 5
           ndpt_reachable = 30
             ndpt_retrans = 1
             net_buf_size = {all}
             net_buf_type = {all}
        net_malloc_police = 0
           nonlocsrcroute = 0
                 nstrpush = 8
              passive_dgd = 0
         pmtu_default_age = 10
              pmtu_expire = 10
 pmtu_rediscover_interval = 30
              psebufcalls = 20
                 psecache = 1
             pseintrstack = 24576
                psetimers = 20
           rfc1122addrchk = 0
                  rfc1323 = 0
                  rfc2414 = 1
             route_expire = 1
          routerevalidate = 0
                 rto_high = 64
               rto_length = 13
                rto_limit = 7
                  rto_low = 1
                     sack = 0
                   sb_max = 1048576
       send_file_duration = 300
              site6_index = 0
               sockthresh = 85
                  sodebug = 0
              sodebug_env = 0
                somaxconn = 1024
                 strctlsz = 1024
                 strmsgsz = 0
                strthresh = 85
               strturncnt = 15
          subnetsarelocal = 1
       tcp_bad_port_limit = 0
                  tcp_ecn = 0
       tcp_ephemeral_high = 65535
        tcp_ephemeral_low = 32768
             tcp_finwait2 = 1200
           tcp_icmpsecure = 0
          tcp_init_window = 0
    tcp_inpcb_hashtab_siz = 24499
              tcp_keepcnt = 8
             tcp_keepidle = 14400
             tcp_keepinit = 150
            tcp_keepintvl = 150
     tcp_limited_transmit = 1
              tcp_low_rto = 0
             tcp_maxburst = 0
              tcp_mssdflt = 1460
          tcp_nagle_limit = 65535
        tcp_nagleoverride = 0
               tcp_ndebug = 100
              tcp_newreno = 1
           tcp_nodelayack = 0
        tcp_pmtu_discover = 0
            tcp_recvspace = 16384
            tcp_sendspace = 16384
            tcp_tcpsecure = 0
             tcp_timewait = 1
                  tcp_ttl = 60
           tcprexmtthresh = 3
                  thewall = 1048576
         timer_wheel_tick = 0
       udp_bad_port_limit = 0
       udp_ephemeral_high = 65535
        udp_ephemeral_low = 32768
    udp_inpcb_hashtab_siz = 24499
        udp_pmtu_discover = 0
            udp_recvspace = 42080
            udp_sendspace = 9216
                  udp_ttl = 30
                 udpcksum = 1
                 use_isno = 1
           use_sndbufpool = 1


rcp command:
------------

Purpose
Transfers files between a local and a remote host or between two remote hosts.

Syntax

rcp [ -p] [ -F] [ -k realm ] { { User@Host:File | Host:File | File } 
    { User@Host:File | Host:File | File | User@Host:Directory | Host:Directory | Directory } | 
    [ -r] { User@Host:Directory | Host:Directory |Directory } { User@Host:Directory | Host:Directory | Directory } }

-r Recursively copies 

Description
The /usr/bin/rcp command is used to copy one or more files between the local host and a remote host, 
between two remote hosts, or between files at the same remote host.

Remote destination files and directories require a specified Host: parameter. If a remote host name is not 
specified for either the source or the destination, the rcp command is equivalent to the cp command. 
Local file and directory names do not require a Host: parameter

- Using Standard Authentication
The remote host allows access if one of the following conditions is satisfied:

The local host is included in the remote host /etc/hosts.equiv file and the remote user is not the root user. 
The local host and user name is included in a $HOME/.rhosts file on the remote user account.
Although you can set any permissions for the $HOME/.rhosts file, it is recommended that the permissions 
of the .rhosts file be set to 600 (read and write by owner only).

In addition to the preceding conditions, the rcp command also allows access to the remote host if the 
remote user account does not have a password defined. However, for security reasons, the use of a password 
on all user accounts is recommended.

- For Kerberos 5 Authentication
The remote host allows access only if all of the following conditions are satisfied:

The local user has current DCE credentials. 
The local and remote systems are configured for Kerberos 5 authentication (On some remote systems, 
this may not be necessary. It is necessary that a daemon is listening to the klogin port). 
The remote system accepts the DCE credentials as sufficient for access to the remote account. 
See the kvalid_user function for additional information.

Examples:

In the following examples, the local host is listed in the /etc/hosts.equiv file at the remote host.

- To copy a local file to a remote host, enter: 

# rcp localfile host2:/home/eng/jane
The file localfile from the local host is copied to the remote host host2.

- The following example uses rcp to copy the local file, YTD_sum from the directory /usr/reports 
on the local host to the file year-end in the directory /usr/acct on the remote host moon: 

# rcp /usr/reports/YTD_sum  moon:/usr/acct/year-end 

- To copy a remote file from one remote host to another remote host, enter:  

# rcp host1:/home/eng/jane/newplan host2:/home/eng/mary
The file /home/eng/jane/newplan is copied from remote host host1 to remote host host2.

- To send the directory subtree from the local host to a remote host and preserve the modification times and modes, 
enter: 
# rcp  -p  -r report jane@host2:report

The directory subtree report is copied from the local host to the home directory of user jane 
at remote host host2 and all modes and modification times are preserved. 
The remote file /home/jane/.rhosts includes an entry specifying the local host and user name. 

Note:
rcp is ofcourse used to copy files between unix systems. On nt/w2k/xp computers, rcp could be available
with some different syntax, like
rcp [{-a | -b}] [-h] [-r] [Host][.User:] [Source] [Host][.User:] [Path\Destination]


Notes on the FTP services:
==========================

Note 1:
=======

Have a look at '/usr/lpp/tcpip/samples/anon.ftp'. It is a shell script
and will set up a anonymous ftp site on your local RS/6000.  Note: the
ftpd that comes with AIX does not support the display messages every
time a user changes a directory or even when they login.

Note 2:
=======

ftpd Daemon
Purpose
Provides the server function for the Internet FTP protocol.

Syntax
Note: The ftpd daemon is normally started by the inetd daemon. It can also be controlled from the command line, 
using SRC commands.
/usr/sbin/ftpd [ -d ] [ -k ] [ -l ] [ -t TimeOut ] [ -T MaxTimeOut ] [ -s ] [ -u OctalVal ]


Description
The /usr/sbin/ftpd daemon is the DARPA Internet File Transfer Protocol (FTP) server process. The ftpd daemon 
uses the Transmission Control Protocol (TCP) to listen at the port specified with the ftp command service 
specification in the /etc/services file. 

Changes to the ftpd daemon can be made using the System Management Interface Tool (SMIT) or 
System Resource Controller (SRC), by editing the /etc/inetd.conf or /etc/services file. 
Entering ftpd at the command line is not recommended. The ftpd daemon is started by default when it is 
uncommented in the /etc/inetd.conf file.

The inetd daemon gets its information from the /etc/inetd.conf file and the /etc/services file.

- The ftpaccess.ctl file:

The /etc/ftpaccess.ctl file is searched for lines that start with allow:, deny:, readonly:, writeonly:, 
readwrite:, useronly:, grouponly:, herald: and/or motd:. Other lines are ignored. If the file doesn't exist, 
then ftp access is allowed for all hosts. The allow: and deny: lines are for restricting host access. 
The readonly:, writeonly: and readwrite: lines are for restricting ftp reads (get) and writes (put). 
The useronly: and grouponly: lines are for defining anonymous users. The herald: and motd: lines are 
for multiline messages before and after login.

- If the current authentication method is the Standard Operating system authentication method:
Before the ftpd daemon can transfer files for a client process, it must authenticate the client process. 
The ftpd daemon authenticates client processes according to these rules:

The user must have a password in the password database, /etc/security/passwd. 
(If the user's password is not null, the client process must provide that password.) 
The user name must not appear in the /etc/ftpusers file. 
The user's login shell must appear in the shells attribute of the /etc/security/login.cfg file. 
If the user name is anonymous, ftp or is a defined anonymous user in the /etc/ftpaccess.ctl file, 
an anonymous FTP account must be defined in the password file. In this case, the client process 
is allowed to log in using any password. By convention, the password is the name of the client host. 
The ftpd daemon takes special measures to restrict access by the client process to the anonymous account.


Note 3:
=======

FTP memory-to-memory transfer 
This is useful for testing network performance between two machines while eliminating 
disk I/O (1 GB transfer example):

ftp> bin
ftp> put "| dd if=/dev/zero bs=512k count=2000" /dev/null


Note 4:
=======

Subject:	ftp, anonymous setup, troubleshooting - hp

Document Text
Title	    : How to setup anonymous ftp, and troubleshooting ftp
Date	    : 970828
Type	    : EN
Document ID : A4786122

Problem Description

Can you explain the proper setup of anonymous FTP and how to
troubleshoot any problems?

Configuration Info

Operating System -HP-UX
    Version -10.10
Hardware System - HP 9000
    Series -K400

Solution

Verification and setup of services:

1.   Verify that the following line is in /etc/inetd.conf and not
     commented out (there should be no # in the first column):

     10.X:
	ftp	     stream tcp nowait root /usr/lbin/ftpd	ftpd

     9.X:
	ftp	     stream tcp nowait root /etc/ftpd		ftpd

     or
     netstat -a |grep ftp
     the output should look like:

     tcp      0	    0  *ftp.		    *.*

2.   Verify the following services are in /etc/services and not
     commented out (with no # in the first column):

     ftp-data	   20/tcp	     # File Transfer Protocol (Data)
     ftp	   21/tcp	     # File Transfer Protocol (Control)

    *Note: If you are using NIS (Network Information Services)
	   then verify on the master server that these services
	   are available, or do 'ypcat services |grep ftp'

Creation of anonymous FTP:

If possible use SAM to create anonymous ftp by entering SAM Areas:
Networking and Communications, and then Networking Services.  Select
the desired service then choose Actions and Enable.  If this method is
either undesirable or you are experiencing difficulties with SAM
then do the following steps:

1.   Create an ftp user in /etc/passwd:

     10.X:
	ftp:*:500:1:Anonymous FTP user:/home/ftp:/usr/bin/false

     9.X:
	ftp:*:500:1:Anonymous FTP user:/users/ftp:/bin/false

	*Note: If UID 500 is not available, use a UID that
	 is not currently being used.
	*Note: GID 1 is usually group 'other', verify that group 'other'

	 does exist, and match its group ID in this field.

2.   Create a home directory for the ftp user that is owned by ftp and
     has permissions set to 0555:

     10.X:
	mkdir /home/ftp
	chmod 555 /home/ftp
	chown ftp:other /home/ftp

     9.X:
	mkdir /users/ftp
	chmod 555 /users/ftp
	chown ftp:other /users/ftp

3.   Create a bin directory that is owned by root and has
     permissions set to	 0555:

     10.X:
	mkdir -p /home/ftp/usr/bin
	chmod 555 /home/ftp/usr/bin /home/ftp/usr
	chown root /home/ftp/usr/bin /home/ftp/usr

	*Note: ftp structure has changed from 9.X to 10.x, there is
	 no longer a /home/ftp/bin.  The bin directory was moved to
	 be under /home/ftp/usr:

     9.X:
	mkdir /users/ftp/bin
	chmod 555 /users/ftp/bin
	chown root /users/ftp/bin

4.   Copy 'ls' to the new bin directory with permissions set to 0111:

     10.X:
	cp /sbin/ls /home/ftp/usr/bin/ls
	chmod 111 /home/ftp/usr/bin/ls

     9.X:
	cp /bin/ls /users/ftp/bin/ls
	chmod 111 /users/ftp/bin/ls

5.   Create an etc directory that is owned by root and has permissions
     of 0555:

     10.X:
	mkdir /home/ftp/etc
	chmod 555 /home/ftp/etc
	chown root /home/ftp/etc

     9.X:
	mkdir /users/ftp/etc
	chmod 555 /users/ftp/etc
	chown root /users/ftp/etc

     This directory should contain versions of the files passwd and
     group.  These files must be owned by root and have
     permissions of 0444:

     10.X:
	cp /etc/passwd /etc/group /home/ftp/etc
	chown root /home/ftp/etc/passwd /home/ftp/etc/group
	chmod 444 /home/ftp/etc/passwd /home/ftp/etc/group

     9.X:
	cp /etc/passwd /etc/group /users/ftp/etc
	chown root /users/ftp/etc/passwd /users/ftp/etc/group
	chmod 444 /users/ftp/etc/passwd /users/ftp/etc/group

6.   OPTIONAL:
     Create a dist directory that is owned by root and has permissions
     of 755.  Superuser can put read-only files in this directory to
     make them available to anonymous ftp users.

     10.X:
	mkdir /home/ftp/dist
	chown root /home/ftp/dist
	chmod 755 /home/ftp/dist

     9.X:
	mkdir /users/ftp/dist
	chown root /users/ftp/dist
	chmod 755 /users/ftp/dist

7.   OPTIONAL:
     Create a pub directory that is owned by ftp and writable by all.
     Anonymous ftp users can put files in this directory to make them
     available to other anonymous ftp users.

     10.X:
	mkdir /home/ftp/pub
	chown ftp:other /home/ftp/pub
	chmod 777 /home/ftp/pub

     9.X:
	mkdir /users/ftp/pub
	chown ftp:other /users/ftp/pub
	chmod 777 /users/ftp/pub


Troubleshooting FTP:

1.   Verify the installation steps.

2.   If receiving message: ftp: connect: Connection refused.

     Verify that inetd is running by entering 'ps -ef|grep inetd'.
     You should see output like:

     root  3730	 2217  1 13:54:57 ttyp2	    0:00 grep inetd
     root  2324	    1  0 13:43:28 ?	    0:00 inetd

     *Note: You may not see the grep process.
     If inetd is not currently running, then as root type 'inetd'

3.   If receiving either message: 530 access denied login failed,
     or 530 User [name] access denied.

     A.	  Verify netrc. in the user's home directory.
	  If the netrc. file contains password or account information
	  for use other than for anonymous ftp, its owner must match
	  the effective user ID of the current process.	 Its read,

	  write, and execute permission bits for group and other must
	  all be zero, and it must be readable by its owner.
	  Otherwise, the file is ignored.

	  So if you are unsure about this file, rename it to netrc.old.
	  for troubleshooting purposes.

     B.	  Check /etc/ftpusers.
	  ftpd rejects remote logins to local user accounts that are
	  named in /etc/ftpusers.  Each restricted account name must
	  appear alone on a line in the file.  The line cannot contain
	  any white space.  User accounts that specify a restricted
	  login shell in /etc/passwd should be listed in /etc/ftpusers
	  because ftpd accesses local accounts without using their
	  login shells.

     C.	  You need to add or verify /etc/shells.
	  /etc/shells is an ASCII file containing a list of legal shells
	  on the system.  Each shell is listed in the file by its
	  absolute path name. To learn more about this file, run 'man
	  shells'.  To see the legal shells for your system run 'man
	  getusershell'.  This will list all valid shells for your
	  system.  If you use both 9.X and 10.X environments, include
	  the shells for both operating systems.

	  Example entries:

	  /bin/sh	   <<<-
	  /bin/rsh	       |
	  /bin/ksh	       |
	  /bin/rksh		> 9.X valid shells
	  /bin/csh	       |
	  /bin/pam	       |
	  /usr/bin/keysh       |
	  /bin/posix/sh	   <<<-

	  /sbin/sh	   <<<-
	  /usr/bin/sh	       |
	  /usr/bin/rsh	       |
	  /usr/bin/ksh		> 10.X valid shells
	  /usr/bin/rksh	       |
	  /usr/bin/csh	       |
	  /usr/bin/keysh   <<<-

	  All shells referred to in /etc/passwd or in the NIS passwd map
	  should be valid shells or links on this system and be listed
	  in /etc/shells.

4.   If receiving message: ftp: ftp/tcp: unknown service.

     Check your /etc/services file.  If you make a change to
     /etc/services, you must force the system to recognize the new
     changes by typing:
	  inetd -c

     Verify that permissions for /etc/services are 444 (-r--r--r--).

5.   If receiving message: 421 Service not available, remote server
     has closed connection.

     Verify that /var/adm/inetd.sec does not contain an ftp entry of
     either deny or allow.  When you allow one user, you deny all other
     users.  For troubleshooting purposes you could rename
     /var/adm/inetd.sec to /var/adm/inetd.sec.old.  inetd.sec is not
     needed unless you have a need for tightened security beyond login
     verification.

6.   If receiving message: 150 Opening ASCII mode data connection for
     /usr/bin/ls. crt0: ERROR couldn't open /usr/lib/dld.sl
     errno:000000002.

     You have the wrong version of the command ls in /home/ftp/usr/bin.
     To resolve this execute:
	  cp /sbin/ls /home/ftp/usr/bin/ls


Note 5:
=======

ftpd(1M), the file transfer protocol server, is run by the Internet daemon (see inetd(1M)) when a service request 
is received at the port indicated in /etc/services.

ftpd rejects remote logins to local user accounts named in /etc/ftpusers. Each restricted account name must appear 
by itself on a line in the file. The line cannot contain any spaces or tabs. User accounts with restricted 
login shells in /etc/passwd should be listed in /etc/ftpusers, because ftpd accesses local accounts without 
using their login shells. uucp accounts also should be listed in /etc/ftpusers. If /etc/ftpusers does not exist, 
ftpd skips the security check.


Note 6:
=======

On HP-UX:

Symptom: Some or all users can't ftp to an HP-UX system. 

If no users can ftp to a given system, check first of all that inetd is running on that system:

# ps -ef | grep inetd 
 
If inetd is not running, start it: 

It is also possible that the FTP service is disabled. Check /etc/inetd.conf for the following line: 
 
FTP stream tcp nowait root /usr/lbin/FTPd FTPd -l 

If this line does not exist, or is commented out (preceded by a pound sign, (#) add it (or remove the pound sign) 
and restart inetd:

# /usr/sbin/inetd -c 


Note 7:
=======

There are five files used to hold FTP configuration information. These files are listed here:

/etc/ftpd/ftpaccess       The primary configuration file defining the operation of the ftpd daemon. 
/etc/ftpd/ftpconversions  Defines options for compression/decompression and tar/untar operations.  
/etc/ftpd/ftphosts        Lets you allow/deny FTP account access according to source IP addresses and host names. 
/etc/ftpd/ftpusers        Restricts FTP access for specified users. For more information see ftpusers(4).
/etc/ftpd/ftpgroups       The group password file for use with the SITE GROUP and SITE GPASS commands.  


The /etc/ftpd/ftpaccess configuration file is the primary configuration file for defining how the 
ftpd daemon operates. It is not necessary to enable the ftpacess file inorder to run ftpd. 

The configuration files allow you to configure FTP features, such as the number of FTP login tries permitted, 
FTP banner displays, logging of incoming and outgoing file transfers, access permissions, 
use of regular expressions, etc. For complete details on these files, see the ftpaccess(4), ftpgroups(4), 
ftpusers(4), ftphosts(4), and ftpconversion(4) manpages.

- If the ftpaccess file is enabled:

Settings in the ftpaccess file override any similar settings in the other files.
Any settings in the other files that are not present in ftpaccess are treated as supplemental or additional 
configuration information.

- If the ftpaccess file is disabled:

The settings in the ftpusers, ftphosts, and ftpconversion files will be used.
The ftpgroups file will not be used.

Enabling/Disabling the /etc/ftpd/ftpaccess Configuration File 
 
-- To enable the /etc/ftpd/ftpaccess file, specify the -a option for the ftp entry in the /etc/inetd.conf file. 
For example, 

ftp  stream tcp nowait root /usr/lbin/ftpd ftpd -a -l -d
(The -l option logs all commands sent to the ftpd server into syslog. The -d option logs debugging information 
into syslog.)

-- To disable the /etc/ftpd/ftpaccess file, specify the -A option for the ftp entry in the /etc/inetd.conf file. 
For example,

ftp  stream tcp nowait root /usr/lbin/ftpd ftpd -A -L -d


Note 8: ftp commandline and batches:
------------------------------------

It can be interresting if you transfer a file with ftp from a scheduled script.
Here are some examples on how to do this:

Example 1:
----------

#!/usr/bin/ksh
ftp -v -n "YOUR.IP.ADD.RESS" << cmd
user "user" "passwd"
cd /distant/directory
lcd /local/directoryget ssh_install
get ( or put) your files
quit
cmd


Example 2:
----------

autounix.sh 

#!/bin/ksh 

# Declaring all the variables 
s_filepath='/sap/usr/sap/trans/data/' 
s_backuppath='/sap/usr/sap/trans/data/autozip/' 
s_unixfile1=$s_filepath'FILE1' 
s_unixfile2=$s_filepath'FILE2' 
s_unixfile3=$s_filepath'FILE3' 
  

# This has been changed to accepting parameter pass in as date 
#s_date=`date '+%Y%m%d'` 
s_date=$1 
s_filename='SAP.'$s_date'.ZIP' 
s_donefilename=$s_filename'.DONE' 

# Execute the zip command 

/usr/local/bin/pkzip -add -pass=test123 $s_backuppath$s_filename $s_unixfile1 $s_unixfile2 $s_unixfile3 

# Execute the FTP transfer 
user='ftp' 
passwd='ftp1234' 
destdir='data/test' 

cd $s_backuppath 
ftp -in ftp-out.sapservx.com << EndHere 
   user $user $passwd 
   cd $destdir 
   bin 
   put $s_filename 
   rename $s_filename $s_donefilename 
   quit 
EndHere 


41.3 Linux:
===========

Much of the above network related commands, like ifconfig, applies to Linux distro's as well.
But many items in sections 41.1 (Solaris) and 41.2 (AIX), is specific to those Operating Systems.
 
Here we describe some specifics for Linux.


41.3.1 About TCP Wrappers:
--------------------------

- What is it?

TCP wrappers and xinetd control access to services by hostname and IP addresses. In addition, these tools 
also include logging and utilization management capabilities that are easy to configure. 
TCP wrappers is installed by default with a server-class installation of Red Hat Linux 8.0, and provides 
access control to a variety of services. Most modern network services, such as SSH, Telnet, and FTP, 
make use of TCP wrappers, a program that is designed to stand guard between an incoming request 
and the requested service. 

The idea behind TCP wrappers is that client requests to server applications are "wrapped" by an 
authenticating service, allowing a greater degree of access control and logging for anyone attempting 
to use the service. 
The functionality behind TCP wrappers is provided by libwrap.a, a library that network services, 
such as xinetd, sshd, and portmap, are compiled against. Additional network services, even networking programs 
you may write, can be compiled against libwrap.a to provide this functionality. Red Hat Linux bundles 
the necessary TCP wrapper programs and library in the tcp_wrappers-<version> RPM file. 

- Host-Based Access Control Lists

Host-based access for services that use TCP wrappers is controlled by two files: 

/etc/hosts.allow and /etc/hosts.deny. 

These file use a simple format to control access to services on a server. 
If no rules are specified in either hosts.allow or hosts.deny, then the default rule is to allow anyone 
to access to the services. 
Order is important since rules in hosts.allow take precedence over rules specified in hosts.deny. 
Even if a rule specifically denying all access to a particular service is defined in hosts.deny, 
hosts specifically given access to the service in hosts.allow are allowed to access the service. 
In addition, all rules in each file take effect from the top down. 
Any changes to these files take effect immediately, so restarting services is not required. 

Formatting Rules
All access control rules are placed on lines within hosts.allow and hosts.deny, and any blank lines 
or lines that start with the comment character (#) are ignored. Each rule needs to be on its own line. 

The rules must be formatted in the following manner: 

<daemon_list>: <client_list>[: spawn <shell_command> ]
 
Patterns are particularly helpful when specifying groups of clients that may or may not access a certain service. 
By placing a "." character at the beginning of a string, all hosts that share the end of that string 
are applied to that rule. So, .domain.com would catch both system1.domain.com and system2.domain.com. 
The "." character at the end of a string has the same effect, except going the other direction. 
This is primarily used for IP addresses, as a rule pertaining to 192.168.0. would apply to the entire 
class C block of IP addresses. Netmask expressions can also be used as a pattern to control access to a 
particular group of IP addresses. You can even use asterisks (*) or question marks (?) to select entire 
groups of hostnames or IP addresses, so long as you do not use them in the same string as the other 
types of patterns. 

This access control "language" can be extended with the following wildcards. They may be used in the access 
control rules instead of using specific hosts or groups of hosts: 

ALL      - Matches every client with a service. To allow a client access to all services, 
           use the ALL in the daemons section. 
LOCAL    - Matches any host that does not contain a "." character. 
KNOWN    - Matches any host where the hostname and host address are known or where the user is known. 
UNKNOWN  - Matches any host where the hostname or host address are unknown or where the user is unknown. 
PARANOID - Matches any host where the hostname does not match the host address. 

You can use the above wildcards in combination with the EXCEPT operator.

Example:

# all domain.com hosts are allowed to connect
# to all services except cracker.domain.com
ALL: .domain.com EXCEPT cracker.domain.com

# 123.123.123.* addresses can use all services except FTP
ALL EXCEPT in.ftpd: 123.123.123.

Users that wish to prevent any hosts other than specific ones from accessing services usually place 
ALL: ALL in hosts.deny. Then, they place lines in hosts.allow, such as: 

in.telnetd: 10.0.1.24
in.ftpd: 10.0.1. EXCEPT 10.0.1.1
 
- Shell commands:

Beyond simply allowing or denying access to services for certain hosts, the TCP wrappers also supports 
the use of shell commands. These shell commands are most commonly used with deny rules to set up booby traps, 
which usually trigger actions that log information about failed attempts to a special file or email 
an administrator. Below is an example of a booby trap in the hosts.deny file which will write a log line 
containing the date and client information every time a host from the the IP range 10.0.1.0 to 10.0.1.255 
attempts to connect via Telnet: 

in.telnetd: 10.0.1.: spawn (/bin/echo `date` %c >> /var/log/telnet.log) &
 
The following expansions can be used:

%a - The client's IP address.
%A - The server's IP address.
%c - Supplies a variety of client information, such as the username and hostname, or the username and IP address. 
%d - The daemon process name.
%h - The client's hostname (or IP address, if the hostname is unavailable). 
%H - The server's hostname (or IP address, if the hostname is unavailable). 
%n - The client's hostname. If unavailable, unknown is printed.  
%N - The server's hostname. If unavailable, unknown is printed.  
%p - The daemon process ID.
%s - Various types of server information, such as the daemon process and the host or IP address of the server. 
%u - The client's username. If unavailable, unknown is printed. 


41.3.2 About xinetd:
--------------------

- Access Control Using xinetd
The benefits offered by TCP wrappers are enhanced when the libwrap.a library is used in conjunction 
with xinetd, a super-daemon that provides additional access, logging, binding, redirection and resource 
utilization control. 

Red Hat Linux configures a variety of popular network services to be used with xinetd, including FTP, 
IMAP, POP, and Telnet. When any of these services are accessed via their port numbers in /etc/services, 
the xinetd daemon handles the request. Before bringing up the requested network service, xinetd ensures 
that the client host information meets the access control rules, the number of instances of this service 
is under a particular threshold, and any other rules specified for that service or all xinetd services 
are followed. Once the target service is brought up for the connecting client, xinetd goes back to sleep, 
waiting for additional requests for the services it manages. 

- xinetd Configuration Files
The xinetd service is controlled by the "/etc/xinetd.conf" file, as well as the various service-specific 
files in the "/etc/xinetd.d/" directory. 
The xinetd.conf file is the parent of all xinetd-controlled service configuration files, as the 
service-specific files are also parsed every time xinetd starts. By default, xinetd.conf contains some basic 
configuration settings that apply to every service. Below is an example of a typical xinetd.conf: 

defaults
{
        instances               = 60
        log_type                = SYSLOG authpriv
        log_on_success          = HOST PID
        log_on_failure          = HOST
        cps                     = 25 30
}

includedir /etc/xinetd.d
 
- Files in the /etc/xinetd.d/ Directory
The files in the /etc/xinetd.d/ directory are read every time xinetd starts, due to the includedir 
/etc/xinetd.d/ statement at the bottom of /etc/xinetd.conf. These files, with names such as finger, 
ipop3, and rlogin, correlate to the services controlled by xinetd. 
The files in /etc/xinetd.d/ use the same conventions as /etc/xinetd.conf. The primary reason they are stored 
in separate configuration files is to make it easier to add and remove a service from xinetd without affecting 
other services. 

To get an idea of how these files are structured, consider the wu-ftp file: 

service ftp
{
        socket_type             = stream
        wait                    = no
        user                    = root
        server                  = /usr/sbin/in.ftpd
        server_args             = -l -a
        log_on_success          += DURATION USERID
        log_on_failure          += USERID
        nice                    = 10
        disable                 = yes
}
 

The first line defines the service's name. The lines within the brackets contain settings that define how this 
service is supposed to be started and used. The wu-ftp file states that the FTP service uses a 
stream socket type (rather than dgram), the binary executable file to use, the arguments to pass 
to the binary, the information to log in addition to the /etc/xinetd.conf settings, the priority with which 
to run the service, and more. 

The use of xinetd with a service also can serve as a basic level of protection from a 
Denial of Service (DoS) attack. The max_load option takes a floating point value to set a CPU usage 
threshold when no more connections for a particular service will be accepted, preventing certain services 
from overwhelming the system. The cps option accepts an integer value to set a rate limit on the number 
of connections available per second. Configuring this value to something low, such as 3, will help prevent 
attackers from being able to flood your system with too many simultaneous requests for a particular service. 

The xinetd host access control available through its various configuration files is different from 
the method used by TCP wrappers. While TCP wrappers places all of the access configuration within two files, 
/etc/hosts.allow and /etc/hosts.deny, each service's file in /etc/xinetd.d can contain access control rules 
based on the hosts that will be allowed to use that service. 

For example, the following /etc/xinetd.d/telnet file can be used to block telnet access to a system 
by a particular network group and restrict the overall time range that even legitimate users can log in: 

service telnet
{
        disable         = no
        flags           = REUSE
        socket_type     = stream
        wait            = no
        user            = root
        server          = /usr/sbin/in.telnetd
        log_on_failure  += USERID
        no_access       = 10.0.1.0/24
        log_on_success  += PID HOST EXIT
        access_times    = 09:45-16:15
}
 

In this example, when any system from the 10.0.1.0/24 subnet, such as 10.0.1.2, tries to telnet into the server, 
they will receive a message stating Connection closed by foreign host. In addition, their login attempt 
is logged in /var/log/secure.


41.3.3 Linux Network files:
---------------------------

- Network Scripts

Using Red Hat Linux, all network communications occur between configured interfaces and physical 
networking devices connected to the system. The different types of interfaces that exist are as varied 
as the physical devices they support. 
The configuration files for network interfaces and the scripts to activate and deactivate them are located in the 

"/etc/sysconfig/network-scripts/" directory. 

While the existence of interface files can differ from system to system, the three different types of files 
that exist in this directory, interface configuration files, interface control scripts, and network 
function files, work together to enable Red Hat Linux to use various network devices. 
This chapter will explore the relationship between these files and how they are used. 

- Network Configuration Files

Before we review the interface configuration files themselves, let us itemize the primary configuration files 
used by Red Hat Linux to configure networking. Understanding the role these files play in setting up the 
network stack can be helpful when customizing your system. 

The primary network configuration files are as follows: 

/etc/hosts - The main purpose of this file is to resolve hostnames that cannot be resolved any other way. 
             It can also be used on resolve hostnames on small networks with no DNS serer. Regardless of the 
             type of network the computer is on, this file should contain a line specifying the IP address 
             of the loopback device (127.0.0.1) as localhost.localdomain.  

/etc/resolv.conf - This file specifies the IP addresses of DNS servers and the search domain. 
                   Unless configured to do otherwise, the network initialization scripts populate this file. 

/etc/sysconfig/network - Specifies routing and host information for all network interfaces.  

/etc/sysconfig/network-scripts/ifcfg-<interface-name> - For each network interface on a Red Hat Linux system, 
                                                        there is a corresponding interface configuration script. 
                                                        Each of these files provide information specific to a 
                                                        particular network interface.
Caution 
The "/etc/sysconfig/networking/" directory is used by the Network Administration Tool (redhat-config-network) 
and its contents should not be edited manually.
 

- Interface Configuration Files

Interface configuration files control the operation of individual network interface device. 
As your Red Hat Linux system boots, it uses these files to determine what interfaces to bring up and how 
to configure them. These files are usually named "ifcfg-<name>", where <name> refers to the name of the device 
that the configuration file controls. 

Ethernet Interfaces
One of the most common interface files is ifcfg-eth0, which controls the first network interface card or 
NIC in the system. In a system with multiple NICs, you will also have multiple ifcfg-eth files, 
each one with a unique number at the end of the file name. Because each device has its own configuration file, 
you can control how each interface functions individually. 

Below is a sample "/etc/sysconfig/network-scripts/ifcfg-eth0" file for a system using a fixed IP address: 

DEVICE=eth0
BOOTPROTO=none
ONBOOT=yes
NETWORK=10.0.1.0
NETMASK=255.255.255.0
IPADDR=10.0.1.27
USERCTL=no
 
The values required in an interface configuration file can change based on other values. 
For example, the ifcfg-eth0 file for an interface using DHCP looks quite a bit different, 
because IP information is provided by the DHCP server: 

DEVICE=eth0
BOOTPROTO=dhcp
ONBOOT=yes

Most of the time you will probably want to use a GUI utility, such as Network Administration Tool 
(redhat-config-network) to make changes to the various interface configuration files. 

You can also edit the configuration file for a given network interface by hand. Below is a listing of the parameters 
one can expect to configure in an interface configuration file. 

Within each of the interface configuration files, the following values are common: 

BOOTPROTO=<protocol>, where <protocol> is one of the following: 
 none - No boot-time protocol should be used. 
 bootp - The BOOTP protocol should be used. 
 dhcp - The DHCP protocol should be used. 

BROADCAST=<address>, where <address> is the broadcast address. This directive is deprecated. 

DEVICE=<name>, where <name> is the name of the physical device (except dynamically-allocated PPP devices 
              where it is the logical name). 

DNS{1,2}=<address>, where <address> is a name server address to be placed in /etc/resolv.conf if the PEERDNS 
                    directive is set to yes. 

IPADDR=<address>, where <address> is the IP address. 

NETMASK=<mask>, where <mask> is the netmask value. 

NETWORK=<address>, where <address> is the network address. This directive is deprecated. 

ONBOOT=<answer>, where <answer> is one of the following: 

 yes - This device should be activated at boot-time. 
 no - This device should not be activated at boot-time. 

PEERDNS=<answer>, where <answer> is one of the following: 

 yes - Modify /etc/resolv.conf if the DNS directive is set. If you are using DCHP, then yes is the default. 
 no - Do not modify /etc/resolv.conf. 

SRCADDR=<address>, where <address> is the specified source IP address for outgoing packets. 

USERCTL=<answer>, where <answer> is one of the following: 

 yes - Non-root users are allowed to control this device. 
 no - Non-root users are not allowed to control this device. 


- Network Functions

Red Hat Linux makes use of several files that contain important functions that are used in various ways 
to bring interfaces up and down. Rather than forcing each interface control file to contain the same functions 
as another, these functions are grouped together in a few files that can be sourced when needed. 

The most common network functions file is network-functions, located in the /etc/sysconfig/network-scripts/ directory. 
This file contains a variety of common IPv4 functions useful to many interface control scripts, such as 
contacting running programs that have requested information about changes in an interface's status, setting 
host names, finding a gateway device, seeing if a particular device is down or not, and adding a default route. 

As the functions required for IPv6 interfaces are different than IPv4 interfaces, a network-functions-ipv6 file 
exists specifically to hold this information. IPv6 support must be enabled in the kernel in order to communicate 
via that protocol. A function is present in this file that checks for the presence of IPv6 support. 
Additionally, functions that configure and delete static IPv6 routes, create and remove tunnels, add and 
remove IPv6 addresses to an interface, and test for the existence of an IPv6 address on an interface can also 
be found in this file. 


41.3.4 Linux packet filtering :
-------------------------------

Linux comes with advanced tools for packet filtering - the process of controlling network packets as they enter, 
move through, and exit the network stack within the kernel. Pre-2.4 kernels relied on ipchains for 
packet filtering and used lists of rules applied to packets at each step of the filtering process. 
The introduction of the 2.4 kernel brought with it iptables (also called netfilter), which is similar 
to ipchains but greatly expands on the scope and control available for filtering network packets. 

This chapter focuses on packet filtering basics, defines the differences between ipchains and iptables, 
explains various options available with iptables commands, and shows how filtering rules can be preserved 
between system reboots. 

Warning 
The default firewall mechanism under the 2.4 kernel is iptables, but iptables cannot be used if ipchains 
are already running. If ipchains are present at boot time, the kernel will issue an error and fail 
to start iptables. 

- Packet Filtering

Traffic moves through a network in packets. A network packet is collection of data in a specific size 
and format. In order to transmit a file over a network, the sending computer must first break the file 
into packets using the rules of the network protocol. Each of these packets holds a small part of the file data. 
Upon receiving the transmission, the target computer reassembles the packets into the file. 

Every packet contains information which helps it navigate the network and move toward its destination. 
The packet can tell computers along the way, as well as the destination machine, where it came from, 
where it is going, and what type of packet it is, among other things. Most packets are designed to carry data, 
although some protocols use packets in special ways. For example, the Transmission Control Protocol (TCP) 
uses a SYN packet, which contains no data, to initiate communication between two systems. 

The Linux kernel contains the built-in ability to filter packets, allowing some of them into the system 
while stopping others. The 2.4 kernel's netfilter has three built-in tables or rules lists. They are as follows: 

 filter - This is the default table for handling network packets. 
 nat - This table used to alter packets that create a new connection. 
 mangle - This table is used for specific types of packet alteration. 

Each of these tables in turn have a group of built-in chains which correspond to the actions performed 
on the packet by the netfilter. 

The built-in chains for the filter table are as follows: 

 INPUT - This chain applies to packets received via a network interface. 
 OUTPUT - This chain applies to packets sent out via the same network interface which received the packets. 
 FORWARD - This chain applies to packets received on one network interface and sent out on another. 

The built-in chains for the nat table are as follows: 

 PREROUTING - This chain alters packets received via a network interface when they arrive. 
 OUTPUT - This chain alters locally-generated packets before they are routed via a network interface. 
 POSTROUTING - This chain alters packets before they are sent out via a network interface. 

The built-in chains for the mangle table are as follows: 

 PREROUTING - This chain alters packets received via a network interface before they are routed. 
 OUTPUT - This chain alters locally-generated packets before they are routed via a network interface. 

Every network packet received by or sent out of a Linux system is subject to at least one table. 
A packet may be checked against multiple rules within each rules list before emerging at the end of the chain. 
The structure and purpose of these rules may vary, but they usually seek to identify a packet coming from 
or going to a particular IP address or set of addresses when using a particular protocol and network service. 
Regardless of their destination, when packets match a particular rule on one of the tables, they are 
designated for a particular target or action to be applied to them. If the rule specifies an ACCEPT target 
for a matching packet, the packet skips the rest of the rule checks and is allowed to continue to 
its destination. If a rule specifies a DROP target, that packet is refused access to the system and nothing 
is sent back to the host that sent the packet. If a rule specifies a REJECT target, the packet is dropped, 
but an error packet is sent to the packet's originator. 

Every chain has a default policy to ACCEPT, DROP, REJECT, or QUEUE the packet to be passed to user-space. 
If none of the rules in the chain apply to the packet, then the packet is dealt with in accordance 
with the default policy. 

The iptables command allows you to configure these rule lists, as well as set up new tables to be used 
for your particular situation. 

- iptables command:


41.3.5 Redhat and BIND:
-----------------------

BIND as a Nameserver:
Red Hat Linux includes BIND, which is a very popular, powerful, open source nameserver. BIND uses the named 
daemon to provide name resolution services. 

BIND version 9 also includes a utility called /usr/sbin/rndc which allows the administration of the running 
named daemon. More information about rndc can be found in the Section called Using rndc. 


41.4: tcpip timeouts:
---------------------

Note 1:
-------

The defaults for TCP Timeouts are:
AIX: 75 seconds
Solaris: 180 Seconds
NT: 9 Seconds

To view: 
# /usr/sbin/no -o tcp_keepinit 
The output should be something like:
tcp_keepinit = 150

To set:
# /usr/sbin/no -d tcp_keepinit 100


Note 2:
-------

Changing the TCP/IP timeout setting on your event server
If the Situation Update Forwarder cannot reach a monitoring server to send an update, depending on the TCP/IP settings 
for the computer where your event server is running, it could be up to 15 minutes before the Situation Update Forwarder tries 
to connect to the monitoring server again. This might occur if your event server is running on an AIXr, Solaris, or HP-UX computer.

Use the following steps to change the TCP/IP timeout for your computer.

On AIX, run the following command:

no -o tcp_keepinit=<timeout_value>

where <timeout_value> is the length of the timeout period, in half seconds. To configure a timeout of 30 seconds, 
set the <timeout_value> value to 60.

On Solaris and HP-UX, run the following command:

ndd -set /dev/tcp tcp_ip_abort_cinterval <timeout_value>

where <timeout_value> is the length of the timeout period, in milliseconds. To configure a timeout of 30 seconds, 
set the <timeout_value> value to 30000.


========================
42. SOME NOTES ON IPSEC:
========================


This section describes some important features of the IPSec implementations on AIX, HP-UX and Linux Redhat.


42.1 What is IPSec?
===================


IP Security, known commonly as IPSec, is a protocol developed by the Internet Engineering Task Force (IETF), 
designed to provide "end-to-end" Authentication and/or cryptographically-based security for IP network connections. 
Though not yet an official standard, compatible IPSec implementations are available for almost 
all modern operating systems. Inclusion of IPSec is required in every IPv6 implementation, 
and it has been designed to work equally well with the more common IPv4 system currently in use 
by most public and private networks.

All IP Security implementations include a common set of protocols and tools to enable interoperatability 
between different platforms, and provide the following three benefits:

- Authentication: proof that the identity of the host on the other end of the connection is valid and correct. 
- Integrity Checking: assurance that no data sent over the network connection was modified in transit. 
- Encryption: the rendering of network communications indecipherable to anyone who might intercept the transmitted data. 

IPSec implementations also include a method of restricting connections to various services, 
based on their origin and destination. This feature, often present in firewall devices, 
is known as packet filtering.

IPsec protocols operate at the network layer, layer 3 of the OSI model. Other Internet security protocols 
in widespread use, such as SSL, TLS and SSH, operate from the transport layer up (OSI layers 4 - 7). 
This makes IPsec more flexible, as it can be used for protecting layer 4 protocols, including both TCP and UDP, 
the most commonly used transport layer protocols. IPSec has an advantage over SSL and other methods that operate 
at higher layers. For an application to use IPsec no code change in the applications is required whereas 
to use SSL and other higher level protocols, applications must undergo code changes.

IPsec was intended to provide either "transport mode" (end-to-end) security of packet traffic in which 
the end-point computers do the security processing, or "tunnel mode" (portal-to-portal) communications security 
in which security of packet traffic is provided to several machines (even to whole LANs) by a single node.

IPsec can be used to create Virtual Private Networks (VPN) in either mode, and this is the dominant use. 
Note, however, that the security implications are quite different between the two operational modes.

End-to-end communication security on an Internet-wide scale has been slower to develop than many had expected. 
Part of the reason is that no universal, or universally trusted, Public Key Infrastructure (PKI) has emerged 
(DNSSEC was originally envisioned for this); another part is that many users understand neither their needs 
nor the available options well enough to promote inclusion in vendors' products.
This is why a "shared key" (or symmetric key) is used in IPSec. Both the sender and receiver must use the same key.


-- Transport mode
-- --------------

In transport mode, only the payload (the data you transfer) of the IP packet is authenticated and/or encrypted. 
The routing is intact, since the IP header is neither modified nor encrypted; however, when the authentication 
header is used, the IP addresses cannot be translated, as this will invalidate the hash value. The transport 
and application layers are always secured by hash, so they cannot be modified in any way (for example by 
translating the port numbers). Transport mode is used for host-to-host communications.

In its most simple form, using only an Authentication Header (AH) for identifying your communication
partner, the packet looks like this:

  ---------------------------------------
  | Original IP header | AH | TCP| DATA |
  ---------------------------------------

In transport mode, IPSec inserts the AH header after the IP header. The IP data and header are used to calculate 
the AH authentication value. 


-- Tunnel mode
-- -----------

In tunnel mode, the entire IP packet (data plus the message headers) is encrypted and/or authenticated. 
It must then be encapsulated into a new IP packet for routing to work. Tunnel mode is used for 
network-to-network communications (secure tunnels between routers) or host-to-network and host-to-host 
communications over the Internet.

You should be aware that tunnel mode is probably the most widely used implementation.
Many organizations use the Internet, to tunnel their traffic from site to site.

In its most simple form, using only an Authentication Header (AH) for identifying your communication
partner, the packet looks like this:

  --------------------------
  |NEW IP Header | Payload |
  --------------------------

  which is

  ----------------------------------------------------
  |NEW IP Header| AH | Original IP header| TCP| DATA |
  ----------------------------------------------------

In Tunnel mode, IPSec traffic can pass transparently through existing IP routers.


AH and/or ESP: or, just Authentication and/or Authentication plus Data Encryption:
-------------------------------------------------------------------------------

The IPSec Authentication Header (AH) provides integrity and authentication but no privacy--
the IP data is not encrypted. The AH contains an authentication value based on a symmetric-key hash function. 

Symmetric key hash functions are a type of cryptographic hash function that take the data and a key as input 
to generate an authentication value. Cryptographic hash functions are usually one-way functions, 
so that starting with a hash output value, it is difficult to create an input value that would generate 
the same output value. This makes it difficult for a third party to intercept a message and replace 
it with a new message that would generate the same authentication value. 

Symmetric key hash functions are also known as shared key hash functions because the sender and receiver 
must use the same (symmetric) key for the hash functions. In addition, the key must only be known by the 
sender and receiver, so this class of hash functions is sometimes referred to as secret key hash functions.

So, secret key must not be confused with the well-know Public/Private key encryptions.

-- Most implementations support the following for the AH:

HMAC-SHA1 (Hashed Message Authentication Code-Secure Hash Algorithm 1, 128-bit key)
HMAC-MD5 (HMAC-Message Digest 5, 160-bit key)

Ofcourse, total encryption of the DATA is also possible, instead of only the AH.
The IPSec Encapsulating Security Payload (ESP) provides data privacy. The ESP protocol also defines 
an authenticated format that provides data authentication and integrity, with data privacy 

-- Most implementations support the following for ESP:

DES-CBC (Data Encryption Standard Cipher Block Chaining Mode, 56-bit key length)
3DES-CBC (Triple-DES CBC, three encryption iterations, each with a different 56-bit key)
AES128-CBC (Advanced Encryption Standard CBC, 128-bit key length).

To be exact, With authenticated ESP, that is AH and ESP,  IPSec encrypts the payload using one symmetric key, 
then calculates an authentication value for the encrypted data using a second symmetric key.


How the shared key is generated:
--------------------------------

The Internet Key Exchange (IKE) protocol is used, for automatically generating and distributing cryptography keys 
for ESP and AH. IKE also authenticates the identity of the remote system, so AH and authenticated ESP 
with IKE keys provides data origin authentication.

Internet Key Exchange (IKE) is an automated protocol for dynamically negotiating the IPSec parameters. 
IKE provides dynamic secret key generation and exchange for IPSec and allows for scalability.
Before IPSec sends authenticated or encrypted IP data, both the sender and receiver must agree on the 
protocols, encryption algorithms and keys to use. IPSec uses the Internet Key Exchange (IKE) protocol 
to negotiate the encryption and authentication methods, and generate shared encryption keys. 
The IKE protocol also provides primary authentication - verifying the identity of the remote system 
before negotiating the encryption algorithm and keys.

The IKE protocol is a hybrid of three other protocols: Internet Security Association and 
Key Management Protocol (ISAKMP), Oakley, and Versatile Secure Key Exchange Mechanism for 
Internet protocol (SKEME). ISAKMP provides a framework for authentication and key exchange, but does not 
define them (neither authentication nor key exchange). The Oakley protocol describes a series of modes 
for key exchange and the SKEME protocol defines key exchange techniques.

Manual Keys, is an alternative to IKE. Instead of dynamically generating and distributing cryptography keys 
for ESP and AH, the cryptography keys are static and manually distributed. Manual keys are typically used only 
when the remote system does not support .


So IPSec uses "shared key" technology. If you use the manual keys, its clear how they get
generated: by you. But even if you use IKE, you still have a "negotiation phase" before the
keys are actually determined. In this phase, two models can be used:

 -> IKE Preshared Key Authentication
 With preshared key authentication, you must manually configure the same, shared symmetric key 
 on both systems, a preshared key. The preshared key is used only for the primary authentication. 
 The two negotiating entities then generate dynamic shared keys for the IKE SAs and IPSec/QM SAs.
 Preshared keys do not require a Certificate Authority or Public Key Infrastructure.

 -> Digital Signatures
 Digital signatures are based on security certificates, and are managed using a Public Key Infrastructure (PKI). 
 So, here you have a Public key infrastructure, only used in the "negotiation phase" before the 
 actual shared key is constructed.

 Two well known PKI products are:
 -VeriSign Managed PKI (formerly VeriSign OnSite for VPNs)
 -Baltimore UniCERT 3.5


Notes:
-----

Note 1:
-------

IPSec can be employed between hosts (that is, end nodes), between gateways, or between a host and a gateway 
in an IP network. Some implementations, like HP-UX IPSec, can only be installed on end nodes.

Note 2:
-------

Next to the Authentication and/or Data Encryption, IPSec also covers, or has implemented, "filter rules",
on a Host or gateway (router) which "allow/permit" or "deny" traffic based on IP addresses, masks, portnumbers etc..
Basically, this looks like the stuff you can find in Firewall implementations.
Thus rules are collected in socalled IPSec policies.

Note 3:
-------

In IPSec, you will often see the term "SA". This stands for "Security Association", which is actually
a term discribing and collecting all relevant parameters like Destination Address, Security Parameter Index SPI, Key, 
Autentication Algolrithm, Key lifetime etc..


42.2 IPSec and AIX:
===================


- Installing IPSec:

Installing the IP Security pieces
The software components needed to implement IPSec are included with AIX on the base installation media. 
To determine if the required filesets are already installed, run the command:

lslpp -L '*ipsec*'

The output from that command should contain the following filesets:

Fileset                      Level  State  Description  
----------------------------------------------------------------------------  
bos.msg.en_US.net.ipsec    4.3.3.0    C    IP Security Messages - U.S.                                             
bos.net.ipsec.keymgt      4.3.3.50    C    IP Security Key Management  
bos.net.ipsec.rte         4.3.3.50    C    IP Security  
bos.net.ipsec.websm       4.3.3.25    C    IP Security WebSM


One additional piece of software is required: the bos.crypto fileset, found on the AIX Bonus Pack CD. 
The name of this fileset may differ, depending on the country. To determine if this fileset is installed 
on the system, run the command:

lslpp -L 'bos.crypto*'

- Set up IPSec logging:

The IP Security software uses syslog to process messages and errors that it generates. 
Messages are sent to syslogd at the local4 facility. It is a good idea to setup logging of these messages 
before activating IPSec, to make troubleshooting easier. 

To have syslogd write all messages received at the local4 facility to the logfile /var/adm/ipsec.log, 
add the following line to the /etc/syslog.conf file:

local4.debug                    /var/adm/ipsec.log 

Create the empty log file by running the command touch /var/adm/ipsec.log, and then make syslogd aware 
of the changes to its configuration by running the command refresh -s syslogd.

- Using IPSec to create "rules":
--------------------------------

You can use smitty:

# smitty ips4_basic   for basic configuration for IP version 4 
# smitty ips6_basic   for basic configuration for IP version 6

or use the commandline with, for example, the "genfilt", "lsfilt" and other commands.


1. The genfilt Command

Purpose
Adds a filter rule. 

Syntax
genfilt -v 4|6 [ -n fid] [ -a D|P] -s s_addr -m s_mask [-d d_addr] [ -M d_mask] [ -g Y|N ] 
               [ -c protocol] [ -o s_opr] [ -p s_port] [ -O d_opr] [ -P d_port] [ -r R|L|B ] [ -w I|O|B ] [ -l Y|N ] 
               [ -f Y|N|O|H ] [ -t tid] [ -i interface] 


Description
Use the genfilt command to add a filter rule to the filter rule table. The filter rules generated by this command 
are called manual filter rules. IPsec filter rules can be configured using the genfilt command, 
IPsec smit (IP version 4 or IP version 6), or Web-based System Manager in the Virtual Private Network submenu.

Examples:

# genfilt -v 4 -a D -s 0.0.0.0 -m 0.0.0.0 -d 0.0.0.0 -M 0.0.0.0 -c udp -o any -O eq -P 123 -l n -w I -i all


2. The lsfilt Command

Purpose
Lists filter rules from either the filter table or the IP Security subsystem. 

Syntax
lsfilt -v 4|6 [-n fid_list] [-a] [-d] 

Description
Use the lsfilt command to list filter rules and their status. 


Example using IPSec on AIX:
---------------------------

To configure IP Sec, tunnels and filters must be configured. When a simple tunnel is defined for all traffic 
to use, the filter rules can be automatically generated. If more complex filtering is desired, filter rules 
can be configured separately.

You can configure IP Sec using the Web-based System Manager application Network or SMIT. If using SMIT, 
the following fastpaths will take you directly to the configuration panels you need:

- ips4_basic 
Basic configuration for IP version 4 
- ips6_basic 
Basic configuration for IP version 6

This section on IP Security Configuration discusses the following topics:

.Tunnels versus Filters 
.Tunnels and Security Associations 
.Choosing a Tunnel Type 
.Basic Configuration 
.Static Filter Rules and Examples 
.Advanced Manual Tunnel Configuration 
.Configuring IKE Tunnels 
.Predefined Filter Rules 
.Logging Facilities 
.Coexistence of IP Security and IBM Secured Network Gateway 2.2/IBM Firewall 3.1 or 3.2 


=> Tunnels versus Filters:

There are two related but distinct parts of IP Security: tunnels and filters. Tunnels require filters, 
but filters do not require tunnels.

Filtering is a basic function in which incoming and outgoing packets can be accepted or denied based 
on a variety of characteristics. This allows a system administrator to configure the host to control 
the traffic between this host and other hosts. Filtering is done on a variety of packet properties, 
such as source and destination addresses, IP Version (4 or 6), subnet masks, protocol, port, 
routing characteristics, fragmentation, interface, and tunnel definition. This filtering is done 
at the IP layer, so no changes are required to the applications. 

Tunnels define a security association between two hosts. These security associations involve specific 
security parameters that are shared between end points of the tunnel.

A packet comes in the network adapter to the IP stack. From there, the filter module is called to determine 
if the packet should be permitted or denied. If a tunnel ID is specified, the packet will be checked against 
the existing tunnel definitions. If the decapsulation from the tunnel is successful, the packet will be passed 
to the upper layer protocol. This function will occur in reverse order for outgoing packets. The tunnel 
relies on a filter rule to associate the packet with a particular tunnel, but the filtering function can occur 
without passing the packet to the tunnel. 

=> Tunnels and Security Associations

Tunnels are used whenever it is desired to have data authenticated, or authenticated and encrypted. 
Tunnels are defined by specifying a security association between two hosts (see figure). The security 
association SA, defines the parameters for the encryption and authentication algorithms and characteristics 
of the tunnel.

  -----------                              ---------
  |Host A   |                              |Host B |
  |         |------------------------------|       |
  |         |------------------------------|       |
  |         |                              |       |
  -----------  SA A------------------->    ---------
                   <------------------ SA B

SA = Security Association, consisting of {Destination Address, SPI, Key, Autentication Algolrithm, Key lifetime}

The Security Parameter Index (SPI) and the destination address identify a unique security association. 
Therefore, these two parameters are required for uniquely specifying a tunnel. Other parameters such as 
cryptographic algorithm, authentication algorithm, keys, and lifetime can be specified or defaults can be used.

=> Choosing a Tunnel Type

The decision to use IBM tunnels, manual tunnels, or, for AIX versions 4.3.2 and later, IKE tunnels, 
depends on the tunnel support of the remote end and the type of key management desired. IKE tunnels 
are preferable (when available) because they offer secure key negotiation and key refreshment in an 
industry-standard way. They also take advantage of the new IETF ESP and AH header types and support 
anti-replay protection.

IBM tunnels offer similar security, but their support is limited to a smaller set of encryption and 
authentication algorithms, but they provide backward compatibility and ease of use with their import/export 
functions with the IBM Firewall.

If the remote end does not support IBM tunnels, or uses one of the algorithms requiring manual tunnels,
 manual tunnels should be used. Manual tunnels ensure interoperability with a large number of hosts. 
Because the keys are static and difficult to change and may be cumbersome to update, they are not as secure.

IBM Tunnels may be used between any two AIX machines running AIX Version 4.3 or higher, or between an AIX 4.3 host and 
a host running IBM Secure Network Gateway 2.2 or IBM Firewall 3.1/3.2. Manual tunnels may be used between a host 
running AIX Version 4.3 and any other machine running IP Security and having a common set of cryptographic 
and authentication algorithms. Almost all vendors offer Keyed MD5 with DES, or HMAC MD5 with DES. 
This is a base subset that works with almost all implementations of IP Security.

When setting up manual or IBM tunnels, the procedure depends on whether you are setting up the first host 
of the tunnel or setting up the second host, which must have parameters matching the first host's setup. 
When setting up the first host, the keys may be autogenerated, and the algorithms can be defaulted. 
When setting up the second host, it is best to import the tunnel information from the remote end, if possible.

Another important consideration is determining whether the remote system is behind a firewall. If it is, 
the setup must include information about the intervening firewall.


=>Basic Configuration (Manual or IBM Tunnels)

- Setting Up Tunnels and Filters
For the simplest case, setting up a manual tunnel, it is not necessary to separately configure the filter rules. 
As long as all traffic between two hosts goes through the tunnel, the necessary filter rules are automatically 
generated. The process of setting up a tunnel is to define the tunnel on one end, import the definition 
on the other end, and activate the tunnel and filter rules on both ends. Then the tunnel is ready to use.

Information about the tunnel must be made to match on both sides if it is not explicitly supplied (see figure). 
For instance, the encryption and authentication algorithms specified for the source will be used for the destination 
if the destination values are not specified. This makes creating the tunnel much simpler.

- Creating a Manual Tunnel on Host A
You can configure a tunnel using the Web-based System Manager application Network, the SMIT fast path ips4_basic 
(for IP Version 4) or ips6_basic (for IP version 6), or you can use the following procedure.

The following is a sample of the gentun command used to create a manual tunnel: 

# gentun -v 4 -t manual -s 5.5.5.19 -d 5.5.5.8 -a HMAC_MD5 -e DES_CBC_8 -N 23567 

This will create a tunnel with output (using lstun -v 4) that looks similar to: 

Tunnel ID            : 1
IP Version           : IP Version 4
Source               : 5.5.5.19
Destination          : 5.5.5.8
Policy               : auth/encr
Tunnel Mode          : Tunnel
Send AH Algo         : HMAC_MD5 
Send ESP Algo        : DES_CBC_8 
Receive AH Algo      : HMAC_MD5 
Receive ESP Algo     : DES_CBC_8 
Source AH SPI        : 300
Source ESP SPI       : 300
Dest AH SPI          : 23576
Dest ESP SPI         : 23576
Tunnel Life Time     : 480
Status               : Inactive
Target               : -
Target Mask          : -
Replay               : No
New Header           : Yes
Snd ENC-MAC Algo     : -
Rcv ENC-MAC Algo     : -

The tunnel will be activated when the mktun command is used: 

# mktun -v 4 -t1

The filter rules associated with the tunnel are automatically generated and output (using lsfilt -v 4) 
looks similar to: 

Rule 4:

Rule action           : permit 
Source Address        : 5.5.5.19 
Source Mask           : 255.255.255.255 
Destination Address   : 5.5.5.8 
Destination Mask      : 255.255.255.255 
Source Routing        : yes 
Protocol              : all 
Source Port           : any 0 
Destination Port      : any 0 
Scope                 : both  
Direction             : outbound 
Logging control       : no 
Fragment control      : all packets 
Tunnel ID number      : 1 
Interface             : all 
Auto-Generated        : yes 

Rule 5: 

Rule action           : permit 
Source Address        : 5.5.5.8 
Source Mask           : 255.255.255.255 
Destination Address   : 5.5.5.19 
Destination Mask      : 255.255.255.255 
Source Routing        : yes 
Protocol              : all 
Source Port           : any 0 
Destination Port      : any 0 
Scope                 : both  
Direction             : inbound 
Logging control       : no 
Fragment control      : all packets 
Tunnel ID number      : 1 
Interface             : all 
Auto-Generated        : yes 

These filter rules in addition to the default filter rules are activated by the mktun -v 4 -t 1 command. 

To set up the other side (when it is another AIX machine), the tunnel definition can be exported on host A 
then imported to host B.

To export:

# exptun -v 4 -t 1 -f /tmp

This will export the tunnel definition into a file named ipsec_tun_manu.exp and any associated filter rules 
to the file ipsec_fltr_rule.exp in the directory indicated by the -f flag.

- Creating a manual tunnel on Host B
To create the matching end of the tunnel, the export files are copied to the remote side and imported into 
that remote AIX 4.3 machine by using the command:

# imptun -v 4 -t 1 -f /tmp

where 1 is the tunnel to be imported and /tmp is the directory where the import files reside. This tunnel number 
is system generated and must be referenced from the output of the gentun command, or by using the lstun command 
to list the tunnels and determine the correct tunnel number to import. If there is only one tunnel in the 
import file, or if all the tunnels are to be imported, then the -t option is not needed.

If the remote machine is not AIX 4.3, the export file can be used as a reference for setting up the algorithm, 
keys, and SPI values for the other end of the tunnel.

Export files from the IBM Secure Network Gateway (SNG) can be imported to create tunnels in AIX 4.3. To do this, 
use the -n option when importing the file:

# imptun -v 4 -f /tmp -n

- Creating an IBM tunnel on Host A
Setting up an IBM tunnel is similar to a manual tunnel, but some of the choices are different for the crypto 
algorithms and the keys are negotiated dynamically, so there is no need to import keys. IBM tunnels are limited 
to Keyed MD5 for authentication. If the HMAC MD5 or HMAC SHA algorithms are desired, a manual tunnel must be used.

# gentun -s 9.3.100.1 -d 9.3.100.245 -t IBM -e DES_CBC_8 -n 35564 

As with manual tunnels, from this point the tunnel and filter table must be activated to make the tunnel active:

# mktun -v 4 -t1

To set up the other side, if the other host is an AIX 4.3 IP Security machine, the tunnel definition can be exported 
on host A, then imported to host B. 

To export: 

# exptun -v 4 -f /tmp

This will export the tunnel definition into a file named ipsec_tun_ibm.exp and any associated filter rules 
to the file ipsec_fltr_rule.exp in the directory indicated by the -f flag. 

- Creating an IBM tunnel on Host B
The procedure is the same for creating the second end of the tunnel on host B for an IBM tunnel. 
The tunnel definition is exported from host A and imported onto host B. The -n flag can be used for a file exported 
by an IBM Secure Network Gateway or an IBM Firewall 3.1/3.2.

- Static Filter Rules and Examples
Filtering can be set up to be simple, using mostly autogenerated filter rules, or can be complex by defining 
very specific filter functions based on the properties of the IP packets. Matches on incoming packets are done 
by comparing the source address and SPI value to those listed in the filter table. Therefore, this pair 
must be unique. 

Each line in the filter table is known as a rule. A collection of rules will determine what packets are accepted 
in and out of the machine, and how they will be directed. Filter rules can be written based on source and destination 
addresses and masks, protocol, port number, direction, fragment control, source routing, tunnel, and interface.

Below is a sample set of filter rules. Within each rule, fields are shown in the following order 
(an example of each field from rule 1 is shown in parentheses): Rule_number (1), Action (permit), 
Source_addr (0.0.0.0), Source_mask (0.0.0.0), Dest_addr (0.0.0.0), Dest_mask (0.0.0.0), Source_routing (no), 
Protocol (udp), Src_prt_operator (eq), Src_prt_value (4001), Dst_prt_operator (eq), Dst_prt_value (4001), 
Scope (both), Direction (both), Logging (no), Fragment (all packets), Tunnel (0), and Interface (all).

1 permit 0.0.0.0 0.0.0.0 0.0.0.0 0.0.0.0 no udp eq 4001 eq 4001 both both no all 
packets 0 all

2 permit 0.0.0.0 0.0.0.0 0.0.0.0 0.0.0.0 no ah any 0 any 0 both both no all 
packets 0 all

3 permit 0.0.0.0 0.0.0.0 0.0.0.0 0.0.0.0 no esp any 0 any 0 both both no all 
packets 0 all

4 permit 10.0.0.1 255.255.255.255 10.0.0.2 255.255.255.255 no all any 0 any 0 
both outbound no all packets 1 all

5 permit 10.0.0.2 255.255.255.255 10.0.0.1 255.255.255.255 no all any 0 any 0 
both inbound no all packets 1 all

6 permit 10.0.0.1 255.255.255.255 10.0.0.3 255.255.255.255 no tcp lt 1024 eq 514 
local outbound yes all packets 2 all

7 permit 10.0.0.3 255.255.255.255 10.0.0.1 255.255.255.255 no tcp/ack eq 514 lt 
1024 local inbound yes all packets 2 all

8 permit 10.0.0.1 255.255.255.255 10.0.0.3 255.255.255.255 no tcp/ack lt 1024 lt 
1024 local outbound yes all packets 2 all

9 permit 10.0.0.3 255.255.255.255 10.0.0.1 255.255.255.255 no tcp lt 1024 lt 
1024 local inbound yes all packets 2 all

10 permit 10.0.0.1 255.255.255.255 10.0.0.4 255.255.255.255 no icmp any 0 any 0 
local outbound yes all packets 3 all

11 permit 10.0.0.4 255.255.255.255 10.0.0.1 255.255.255.255 no icmp any 0 any 0 
local inbound yes all packets 3 all

12 permit 10.0.0.1 255.255.255.255 10.0.0.5 255.255.255.255 no tcp gt 1023 eq 21 
local outbound yes all packets 4 all

13 permit 10.0.0.5 255.255.255.255 10.0.0.1 255.255.255.255 no tcp/ack eq 21 gt 
1023 local inbound yes all packets 4 all

14 permit 10.0.0.5 255.255.255.255 10.0.0.1 255.255.255.255 no tcp eq 20 gt 1023 
local inbound yes all packets 4 all

15 permit 10.0.0.1 255.255.255.255 10.0.0.5 255.255.255.255 no tcp/ack gt 1023 
eq 20 local outbound yes all packets 4 all

16 permit 10.0.0.1 255.255.255.255 10.0.0.5 255.255.255.255 no tcp gt 1023 gt 
1023 local outbound yes all packets 4 all

17 permit 10.0.0.5 255.255.255.255 10.0.0.1 255.255.255.255 no tcp/ack gt 1023 
gt 1023 local inbound yes all packets 4 all

18 permit 0.0.0.0 0.0.0.0 0.0.0.0 0.0.0.0 no all any 0 any 0 both both yes all 
packets 

Rule 1 is for the IBM Session Key daemon and will only appear in IP Version 4 filter tables. It uses port number 
4001 to control packets for refreshing the session key. It is an example of how the port number can be used 
for a specific purpose. This filter rule should not be modified except for logging purposes.

Rules 2 and 3 are used to allow processing of Authentication Headers (AH) and Encapsulating Security Payload 
(ESP) headers. They should not be modified except for logging purposes.

Rules 4 and 5 are a set of autogenerated rules that filter traffic between addresses 10.0.0.1 and 10.0.0.2 
through tunnel #1. Rule 4 is for outbound traffic and rule 5 is for inbound traffic.

Rules 6 through 9 are a set of user-defined rules that filter outbound rsh, rcp, rdump, rrestore, and rdist 
services between addresses 10.0.0.1 and 10.0.0.3 through tunnel #2. Note that logging is set to yes so the 
administrator can monitor this type of traffic.

Rules 10 and 11 are a set of user-defined rules that filter both inbound and outbound icmp services of any type 
between addresses 10.0.0.1 and 10.0.0.4 through tunnel #3.

Rules 12 through 17 are user-defined filter rules that filter outbound FTP service from 10.0.0.1 and 10.0.0.5 
through tunnel #4.

Rule 18 is an autogenerated rule always placed at the end of the table. In this case, it permits all packets 
that do not match the other filter rules. It may be set to deny all traffic not matching the other filter rules.

Each rule may be viewed separately (using lsfilt) to make each field clear. 


42.3 IPSEC and HP:
===================


As you have read in section 42.1, you should know beforehand if you want AH or AH plus ESP,
Manual keys or IKE, Transport mode or Tunnel mode, and what "filter rules" you want to apply.
Depending on the number of NIC's in your Host, and what traffic you want to permit or deny,
you will invest a certain a amount of effort to create those rules.


Introducing Configuring IPSec:
------------------------------

You configure HP-UX IPSec using a couple of commandline utilities like:

ipsec_config
ipsec_report
ipsec_admin
ipsec_policy

To configure security certificates (used in the negotiation phase in IKE), use the "ipsec_mgr" utility, 
which has a graphical user interface (GUI). So you need an X terminal.
You can also use preshared key instead of certificates (the preshared key is used only for the 
primary authentication).

As an example of using the commandline, take a look at the following command:

# ipsec_config add host my_host_policy -source 10.1.1.1 \
  -destination 10.0.0.0/8/TELNET -pri 100 \
  -action ESP_AES128_HMAC_SHA1

The above creates a "rule" or policy in the policy database "/var/adm/ipsec/config.db".

The syntax with respect of addresses and ports, resembles somewhat the common syntax found in many 
types of router, gateway, firewall products.

For example 
0.0.0.0   means here all possible IPv4 addresses
10.0.0.0  means here all possible IPv4 addresses in 10.

Instead of using a serie of individual commands to configure IPSec, HP recommends to create a "batchfile" 
with statements. All statements are parsed first, and either all statements pass and are executed, or all fail, 
even if only one statement is incorrectt.

For the above example, a batchfile would look like:

add host my_host_policy -source 10.1.1.1 \
-destination 10.0.0.0/8/TELNET -pri 100 \
-action ESP_AES128_HMAC_SHA1

Notice that we have used the "add" option of the ipsec_config command, indeed used to "add" 
to the config DB. It also suggest that there are other options, which is true:

You can use:

ipsec_config add        to add to the db
ipsec_config batch      to use a batchfile
ipsec_config delete     to delete from the db
ipsec_config show       to show information from the db

For example, the "ipsec_config show all" command displays the entire contents of the database.


profiles:

An ipsec_config profile file contains default argument values that are evaluated in ipsec_config add commands 
if the user does not specify the values in the command. The values are evaluated once, when the policy is 
added to the configuration database. Values used from the profile file become part of the configuration record 
for the policy.

You can specify a profile file name with the -profile argument as part of an ipsec_config command. By default, 
ipsec_config uses the /var/adm/ipsec/.ipsec_profile profile file, which is shipped with HP-UX IPSec. 
In most topologies, you can use the default values supplied in the /var/adm/ipsec/.ipsec_profile file.


Installation:
-------------

The software takes about 110MB. Most of the software goes into /var/adm/ipsec.
As root:

As usual at installation on HP-UX, run the swinstall program using the command:

# swinstall

This opens the "Software Selection" window and the "Specify Source" window. 
On the Specify Source window, change the Source Host Name if necessary. 
Enter the mount point of the drive in the Source Depot Path field and click OK to return to the 
Software Selection window. 

The Software Selection window now contains a list of available software bundles to install.
Highlight the HP-UX IPSec software for your system type. 

Choose Mark for Install from the Actions menu to choose the product to be installed. With the exception of 
the manpages and user's manual, you must install the complete IPSec product.

swinstall loads the fileset, runs the control scripts for the fileset, and builds the kernel. 
Estimated time for processing: 3 to 5 minutes.

Click OK on the Note window to reboot the system.

When the system reboots, check the log files "/var/adm/sw/swinstall.log" and 
"/var/adm/sw/swagent.log" to make sure the installation was successful.  

-- Setting the HP-UX IPSec Password:

When you install HP-UX IPSec, the HP-UX IPSec password is set to ipsec. You must change the HP-UX IPSec password 
after installing the product to use the autoboot feature and to load and configure security certificates. 
HP-UX IPSec uses the password to encrypt certificate files that contain cryptography keys for 
security certificates, and to control access to the ipsec_mgr security certificate configuration GUI.

To set the password, run the following command:

# ipsec_admin -newpasswd

The ipsec_admin utility prompts you to establish the HP-UX IPSec password.


Configuring IPSec (2):
----------------------

From the HP-UX documentation, it is shown that you should do the following actions:

Step 1: Configuring Host IPSec Policies
Step 2: Configuring Tunnel IPSec Policies
Step 3: Configuring IKE Policies
Step 4: Configuring Preshared Keys Using Authentication Records (Or do Step 5)
Step 5: Configuring Certificates
Step 6: Configuring the Bypass List (Local IPv4 Addresses)
Step 7: Verify Batch File Syntax
Step 8: Committing the Batch File Configuration and Verifying Operation
Step 9: Configuring HP-UX IPSec to Start Automatically
Step 10: Creating Backup Copies of the Batch File and Configuration Database
 

43. SOLARIS OpenBoot PROM commands:
===================================

-- Getting help
-- ------------
ok help / ok help [category] / ok help command

For example, if you want to see the help messages for all commands in the category "diag", type the following:

ok help diag

-- Display your physical devices
-- -----------------------------
ok show-devs [device path]

-- Create or show device aliases
-- -----------------------------
A device pathnames can be long and hard to enter. A device alias allows a short name to represent
an entire device pathname. For example the alias "disk0" might represent the device
/sbus@1,f8000000/esp@0,40000/sd@3,0:a

ok devalias               displays all current devices aliases
ok <alias> <device name>  creates the alias corresponding to the physical device

The following example creates a device alias named "disk3" which represents a SCSI disk
with a target ID of 3.

ok devalias disk3 /iommu/sbus/espdma@f,400000/esp@f,800000/sd@3,0

To make this permanent in NVRAM use:
ok nvalias disk3 /iommu/sbus/espdma@f,400000/esp@f,800000/sd@3,0

-- OpenBoot Diagnostics
-- --------------------
Various hardware diagnostics can be run in OpenBoot.

ok probe-scsi      identifies devices attached to as SCSI bus
ok probe-ide       identifies IDE devices attached to the PCI bus
ok test device     executes the self-test method of the device
ok test-all        test all devices that have a build-in self-test method
ok watch-clock     tests the clock function
ok watch-net       monitors the network connection

-- OpenBoot NVRAM
-- --------------
System configuration parameters, like "auto-boot", are stored in NVRAM.
You can list or modify these configuration parameters and any changes you make
remain in effect, even after a power cycle because the are stored in NVRAM.

Some of the most important parameters:
auto-boot?     default true        if true, the machine boots automatically
boot-command   default boot        the command that is executed if auto-boot is true
boot-device    disk or net         device from which to start up
input-device   keyboard            console input device, usually keyboard, ttya, ttyb
security-mode  none                none, command, or full
etc..

To show a parameter: ok printenv <parameter>
To set a parameter : ok setenv <parameter> <value>

ok setenv auto-boot? false
ok printenv auto-boot?

Once unix is loaded, root can also use the /usr/sbin/eeprom command to view or change an OpenBoot parameter.
/usr/sbin/eeprom auto-boot?=true


44. Process priority:
===================== 


Solaris:
--------

NICE and PRIOCTL commands:

nice:
-----

A high nice value means a low priority for your process: you are goiing to be nice.
A low or negative value means a high priority: you are not very nice.

Examples:

# nice +10 ~/bin/longtask
# renice -5 8829

The nice command uses the programname as an argument. The renice command takes the PID as argument.

System	   Range
------     -----
Solaris    0-39
HPUX       0-39
Read Hat   -20-20
FreeBSD    -20-20

prioctl:
--------

Solaris uses the prioctl command, intended as an improvement over the nice command,
to modify process priorities.

Syntax:
# prioctl -s -p <new_priority> -i pid <process_id>

Example:
# prioctl -s -p -5 -i pid 8200


AIX:
----

In AIX we can use the nice and renice commands as well.

About the schedtune Command:
Purpose
Sets parameters for CPU scheduler and Virtual Memory Manager processing.

Syntax
schedtune [ -D | { [ -d n ] [ -e n ] [ -f n ] [ -h n ] [ -m n ] [ -p n ] [ -r n ] [ -t n ] [ -w n ] } ]

Description
Priority-Calculation Parameters
The priority of most user processes varies with the amount of CPU time the process has used recently. The CPU scheduler's priority calculations are based on two parameters that are set with schedtune: -r and -d. The r and d values are in thirty-seconds (1/32); that is, the formula used by the scheduler to calculate the amount to be added to a process's priority value as a penalty for recent CPU use is:

CPU penalty = (recently used CPU value of the process) * (r/32)
and the once-per-second recalculation of the recently used CPU value of each process is:

new recently used CPU value = (old recently used CPU value of the process) * (d/32)


44. ttymon and terminals:
=========================

Solaris:
--------

The configuration of terminals in Solaris 8,9 is somewhat more elaborate than
adding such a device on AIX, for example with the mkdev command.
Here we shall only show the configuration in Solaris 8,9.

Note 1:
-------

In Solaris, the usual getty is taken over by the portmonitor ttymon. 

$ cd /etc
$ ls -al get*
lrwxrwxrwx   1 root     root          21 Aug 10  2004 getty -> ../usr/lib/saf/ttymon


/var/saf/zsmon >sacadm -l
PMTAG          PMTYPE         FLGS RCNT STATUS     COMMAND
zsmon          ttymon         -    0    ENABLED    /usr/lib/saf/ttymon #


$ pmadm -l
PMTAG          PMTYPE         SVCTAG         FLGS ID       <PMSPECIFIC>
zsmon          ttymon         ttya           u    root     /dev/term/a I - /usr/bin/login - 9600 ldterm,ttcompat ttya login:  - tvi925 y  #
zsmon          ttymon         ttyb           u    root     /dev/term/b I - /usr/bin/login - 9600 ldterm,ttcompat ttyb login:  - tvi925 y  #


ls -al \dev\term

lrwxrwxrwx   1 root     root          48 Aug 10  2004 a -> ../../devices/pci@1e,600000/isa@7/serial@0,3f8:a
lrwxrwxrwx   1 root     root          48 Aug 10  2004 b -> ../../devices/pci@1e,600000/isa@7/serial@0,2e8:b


Note 2:
-------

Solaris 2.x systems come with a ttymon port monitor named zsmon and with serial ports A and B already 
configured with default settings for terminals, as shown in the following example:

castle% /usr/sbin/sacadm -l
PMTAG          PMTYPE         FLGS RCNT STATUS     COMMAND
zsmon          ttymon         -    0    ENABLED    /usr/lib/saf/ttymon #

castle% /usr/sbin/pmadm -l
PMTAG         PMTYPE         SVCTAG         FLGS ID       <PMSPECIFIC>
tcp      listen   lp          - root    - p -

$ sacadm -l
PMTAG          PMTYPE         FLGS RCNT STATUS     COMMAND
zsmon          ttymon         -    0    ENABLED    /usr/lib/saf/ttymon #

Note 3:
-------

$ tail -30 /var/saf/zsmon/log

Wed Mar 16 13:13:59 2005; 453; ********** ttymon starting **********
Wed Mar 16 13:13:59 2005; 453; PMTAG:            zsmon
Wed Mar 16 13:13:59 2005; 453; Starting state: enabled
Wed Mar 16 13:13:59 2005; 453; Got SC_ENABLE message
Wed Mar 16 13:13:59 2005; 453; max open files    = 1024
Wed Mar 16 13:13:59 2005; 453; max ports ttymon can monitor = 1017
Wed Mar 16 13:13:59 2005; 453; *ptr == 0
Wed Mar 16 13:13:59 2005; 453; SUCCESS
Wed Mar 16 13:13:59 2005; 453; *ptr == 0
Wed Mar 16 13:13:59 2005; 453; SUCCESS
Wed Mar 16 13:13:59 2005; 453; Initialization Completed
Mon Mar 21 08:02:27 2005; 453; caught SIGTERM
Mon Mar 21 08:02:27 2005; 453; ********** ttymon exiting ***********
Mon Mar 21 08:05:43 2005; 453;
Mon Mar 21 08:05:43 2005; 453; ********** ttymon starting **********
Mon Mar 21 08:05:43 2005; 453; PMTAG:            zsmon
Mon Mar 21 08:05:43 2005; 453; Starting state: enabled
Mon Mar 21 08:05:43 2005; 453; Got SC_ENABLE message
Mon Mar 21 08:05:43 2005; 453; max open files    = 1024
Mon Mar 21 08:05:43 2005; 453; max ports ttymon can monitor = 1017
Mon Mar 21 08:05:43 2005; 453; *ptr == 0
Mon Mar 21 08:05:43 2005; 453; SUCCESS
Mon Mar 21 08:05:43 2005; 453; *ptr == 0
Mon Mar 21 08:05:43 2005; 453; SUCCESS
Mon Mar 21 08:05:43 2005; 453; Initialization Completed

Note 4:
-------

     ttymon is a STREAMS-based TTY port monitor.  Its function is
     to  monitor  ports,  to  set terminal modes, baud rates, and
     line disciplines for the ports, and   to  connect  users  or
     applications  to  services  associated  with the ports. Nor-
     mally, ttymon is configured  to run under the Service Access
     Controller, sac(1M), as part of the  Service Access Facility
     (SAF). It is configured using the  sacadm(1M) command.  Each
     instance  of  ttymon  can  monitor multiple ports. The ports
     monitored by an instance of ttymon are specified in the port
     monitor's  administrative  file.  The administrative file is
     configured using the pmadm(1M) and ttyadm(1M) commands. When
     an  instance  of  ttymon  is  invoked by the sac command, it
     starts to monitor its ports. For  each  port,  ttymon  first
     initializes the line disciplines, if they are specified, and
     the speed and terminal settings. For ports with  entries  in
     /etc/logindevperm,  device  owner, group and permissions are
     set. (See logindevperm(4).) The values used for  initializa-
     tion  are  taken  from the appropriate entry in the TTY set-
     tings file. This file is maintained by the sttydefs(1M) com-
     mand.  Default  line disciplines on ports are usually set up
     by the autopush(1M) command of the Autopush Facility.

     ttymon then writes the prompt and waits for user  input.  If
     the user indicates that the speed is inappropriate by press-
     ing the BREAK key, ttymon tries the next  speed  and  writes
     the  prompt  again.  When  valid  input  is received, ttymon
     interprets the per-service configuration file  for the port,
     if  one  exists,  creates  a  utmpx  entry  if required (see
     utmpx(4)), establishes the  service  environment,  and  then
     invokes  the  service  associated with the port. Valid input
     consists of a string of at least one non-newline  character,
     terminated  by  a  carriage  return.  After the service ter-
     minates,  ttymon cleans up the utmpx entry, if  one  exists,
     and returns the port to its initial state.

     If autobaud is enabled for a port, ttymon will try to deter-
     mine  the  baud  rate on the port automatically.  Users must
     enter a carriage return before ttymon can recognize the baud
     rate  and  print  the prompt. Currently, the baud rates that
     can be determined by autobaud are 110, 1200, 2400, 4800, and
     9600.

SunOS 5.9           Last change: 11 Dec 2001                    1

System Administration Commands                         ttymon(1M)

     If a port is configured as a bidirectional port, ttymon will
     allow  users  to  connect  to a service, and, if the port is
     free, will allow uucico(1M), cu(1C), or ct(1C) to use it for
     dialing out. If a port is bidirectional, ttymon will wait to
     read a character before it prints a prompt.

     If the connect-on-carrier flag is set  for  a  port,  ttymon
     will immediately invoke the port's associated service when a
     connection request is received. The prompt message will  not
     be sent.

     If a port is disabled, ttymon will not start any service  on
     that  port.  If a disabled message is specified, ttymon will
     send out the disabled message when a connection  request  is
     received.  If  ttymon  is  disabled,  all  ports  under that
     instance of ttymon will also be disabled.

SERVICE INVOCATION
     The service ttymon invokes for a port is  specified  in  the
     ttymon  administrative file.  ttymon will scan the character
     string giving the service to be invoked for this port, look-
     ing for a %d or a %% two-character sequence. If %d is found,
     ttymon will modify the service command  to  be  executed  by
     replacing those two characters by the full path name of this
     port (the device  name).  If  %%  is  found,  they  will  be
     replaced  by  a  single %. When the service is invoked, file
     descriptor 0, 1, and 2 are opened to  the  port  device  for
     reading  and  writing.  The service is invoked with the user
     ID, group ID and current home directory set to that  of  the
     user  name  under  which  the  service  was  registered with
     ttymon. Two environment variables, HOME and  TTYPROMPT,  are
     added to the service's environment by ttymon. HOME is set to
     the home directory of the user name under which the  service
     is invoked. TTYPROMPT is set to the prompt string configured
     for the service on the port. This is provided so that a ser-
     vice  invoked  by  ttymon  has  a  means of determining if a
     prompt was actually issued by ttymon and, if so,  what  that
     prompt actually was.

     See ttyadm(1M) for options that can be set for  ports  moni-
     tored by ttymon under  the Service Access Controller.

SECURITY
     ttymon uses pam(3PAM) for session management.  The PAM  con-
     figuration  policy,  listed through /etc/pam.conf, specifies
     the modules to  be  used  for  ttymon.  Here  is  a  partial
     pam.conf file with entries for ttymon using the UNIX session
     management module.

     ttymon  session   required  /usr/lib/security/pam_unix.so.1

SunOS 5.9           Last change: 11 Dec 2001                    2

System Administration Commands                         ttymon(1M)

     If there are no entries for the  ttymon  service,  then  the
     entries for the "other" service will be used.

Note 5:
-------

To add a login service to configure an existing port. Follow these steps to configure the SAF for 
a character terminal:

1.  Become superuser. 
2.  Type sacadm -l and press Return. Check the output to make sure that a ttymon port monitor is configured. 
    It is unlikely that you will need to add a new port monitor. If you do need to add one, type 

    sacadm -a -p pmtag -t ttymon -c /usr/lib/saf/ttymon -v `ttymon -V` and press Return. 

3.  Type 

    pmadm -a -p pmtag -s svctag -i root -fu -v `ttymon -V` -m "`ttyadm -t terminfo-type -d dev-path \
    -l ttylabel -s /usr/bin/login`" 

    and press Return. The port is configured for a login service. 
4.  Attach all of the cords and cables to the terminal and turn it on. 


In this example, a ttymon port monitor called ttymon0 is created and a login is 
enabled for serial port /dev/term/00:


oak% su
Password:
# sacadm -l
PMTAG          PMTYPE    FLGS RCNT STATUS      COMMAND
zsmon        ttymon  -  O  ENABLED  /usr/lib/saf/ttymon #
# sacadm -a -p ttymonO -t ttymon -c /usr/lib/saf/ttymon -v`ttyadm -V`
# sacadm -l
PMTAG          PMTYPE    FLGS RCNT STATUS     COMMAND
ttymonmO     ttymon   -  O  STARTING   /usr/lib/saf/ttymon #
zsmon        ttymon  -  O  ENABLED  /usr/lib/saf/ttymon #
# pmadm -a -p ttymonO -s ttyOO -i root -fu
-v `ttyadm -V` -m "`ttyadm -t tvi925 -d
/dev/term/OO -l 96OO -s
/usr/bin/login`"
# pmadm -l
PMTAG          PMTYPE         SVCTAG        FLGS ID       <PMSPECIFIC>
zsmon        ttymon   ttya       u root     /dev/term/a I -
/usr/bin/login - 96OO ldterm,ttcompat ttya login:  - tvi925 y
#
zsmon        ttymon   ttyb       u root     /dev/term/b I -
/usr/bin/login - 96OO ldterm,ttcompat
ttyb login:  - tvi925 y
#
ttymonO         ttymon    ttyOO    u root     /dev/term/OO - - -
?/usr/bin/login - 96OO login: - tvi925 - #
#


Add a port monitor         sacadm -a -p pmtag -t ttymon -c /usr/lib/saf/ttymon -v `ttyadm -V` -y "comment"  
Disable a port monitor     sacadm -d -p pmtag  
Enable a port monitor      sacadm -e -p pmtag  
Kill a port monitor        sacadm -k -p pmtag  
List status information 
for a port monitor         sacadm -l -p pmtag  
Remove a port monitor      sacadm -r -p pmtag  
Start a port monitor       sacadm -s -p pmtag  
Add a listen port monitor  sacadm -a -p pmtag -t listen -c /usr/lib/saf/listen -v `ttyadm -V` -y "comment"  


Add a standard terminal service  pmadm -a -p pmtag -s svctag -i root -v `ttyadm -V` -m "`ttyadm -i `terminal disabled.' -l contty -m ldterm,ttcompat -d dev-path -s /usr/bin/login`"  
Disable a ttymon port monitor    pmadm -d -p pmtag -s svctag  
Enable a ttymon port monitor     pmadm -e -p pmtag -s svctag  
List all services                pmadm -l  
List status information for one 
ttymon port monitor              pmadm -l -p pmtag -s svctag  
Add a listen service             pmadm -a -p pmtag -s lp -i root -v `nlsadmin -V` -m "`nlsadmin -o /var/spool/lp/fifos/listenS5`"  
Disable a listen port monitor    pmadm -d -p pmtag -s lp  
Enable a listen port monitor     pmadm -e -p pmtag -s lp  
List status information for 
one ttymon port monitor          pmadm -l -p pmtag  


Note 7:
-------

3.23) What has happened to getty? What is pmadm and how do you use it? 
I was hoping you wouldn't ask. PMadm stands for Port Monitor Admin, and it's part of a ridiculously complicated 
bit of software over-engineering that is destined to make everybody an expert. 

Best advice for workstations: don't touch it! It works out of the box. For servers, you'll have to read the manual. 
This should be in admintool in Solaris 2.3 and later. For now, here are some basic instructions from Davy Curry. 

"Not guaranteed, but they worked for me." 

To add a terminal to a Solaris system: 

1. Do a "pmadm -l" to see what's running. The serial ports on the CPU board are probably already being monitored by "zsmon". 


PMTAG          PMTYPE         SVCTAG         FLGS ID       <PMSPECIFIC>
zsmon          ttymon         ttya           u    root     \
	    /dev/term/a I - /usr/bin/login - 9600 ldterm,ttcompat ttya \
	    login:  - tvi925 y  #

2. If the port you want is not being monitored, you need to create a new port monitor with the command 


	    sacadm -a -p PMTAG -t ttymon -c /usr/lib/saf/ttymon -v VERSION

where PMTAG is the name of the port monitor, e.g. "zsmon" or "alm1mon", and VERSION is the output of "ttyadm -V". 

3. If the port you want is already being monitored, and you want to change something, you need to delete the current instance of the port monitor. To do this, use the command 


	    pmadm -r -p PMTAG -s SVCTAG

where PMTAG and SVCTAG are as given in the output from "pmadm -l". Note that if the "I" is present in the <PMSPECIFIC> field (as it is above), you need to get rid of it. 

4. Now, to create a specific instance of ttymon for a port, issue the command: 


pmadm -a -p PMTAG -s SVCTAG -i root -fu -v 1 -m \
	    "`ttyadm -m ldterm,ttcompat -p 'PROMPT' -S YORN -T TERMTYPE \
	    -d DEVICE -l TTYID -s /usr/bin/login`"

Note the assorted quotes; Bourne shell (sh) and Korn (ksh) users leave off the second backslash! 

In the above: 

PMTAG is the port monitor name you made with "sacadm", e.g. "zsmon". 
SVCTAG is the service tag, which can be the name of the port, e.g., "ttya" or "tty21". 
PROMPT is the prompt you want to print, e.g. "login: ". 
YORN is "y" to turn software carrier on (you want this for directly connected terminals" and "n" to leave it off 
(you want this for modems). 
TERMTYPE is the value you want in $TERM. 
DEVICE is the name of the device, e.g. "/dev/term/a" or "/dev/term/21". 
TTYID is the line you want from /etc/ttydefs that sets the baud rate and stuff. I suggest you use one of the 
"contty" ones for directly connected terminals. 

5. To disable ("turn off") a terminal, run 


	    pmadm -d -p PMTAG -s SVCTAG

To enable ("turn on") a terminal, run 


	    pmadm -e -p PMTAG -s SVCTAG

Ports are enabled by default when you "create" them as above. 


Note 8:
-------


You use three SAF commands to administer modems and alphanumeric terminals: sacadm, pmadm, and ttyadm.

-- The sacadm command adds and removes port monitors. This command is your main link with the Service Access Controller (SAC) 
and its administrative file (/etc/saf/_sactab).

-- The pmadm command adds or removes a service and associates a service with a particular port monitor.

-- The ttyadm command formats information for inclusion in various SAF administrative files. A ttyadm command often is embedded 
within a sacadm or pmadm command to provide some of the data needed by those commands. 


Function  			   Program  		Description  
Overall administration  	   sacadm		Command for adding and removing port monitors  
Service Access Controller	   sac			SAF's master program  
Port monitors  			   ttymon		Monitors serial port login requests  
 				   listen		Monitors requests for network services  
Port monitor service administrator pmadm		Command for controlling port monitors' services  
Services  			   logins; 
				   remote procedure calls; 
				   other		Services to which SAF provides access  


45: CDE:
========

Start Login Manager:
--------------------

The login Server, also called the Login Manager, usually starts up the CDE environment when the system
is booted and the "/etc/rc2.d/S99dtlogin" script is run.
The login Server is a server responsible for displaying a graphical logon screen, authenticating users,
and starting a user session.
It can display a login screen on local or network bitmap displays 

It can also be started from the command line, for example, to start the Login Server use either:

# /etc/init.d/dtlogin start
or
# /usr/dt/bin/dtlogin -deamon; exit

To set the Login Manager to start CDE the next time the system is booted, give the command

# /usr/dt/bin/dtconfig -e


Stop Login manager:
-------------------

To stop the Login Manager, use

# /etc/init.d/dtlogin stop
or
# /usr/dt/bin/dtconfig -kill

If you do not want the CDE startup if the system is booted use

# /usr/dt/bin/dtconfig -d


Other facts of the Login manager:
---------------------------------

By default the Login manager stores its PID in /var/dt/Xpid

The login manager is configurable throug a number of files like "Xconfig".
You should copy "/usr/dt/config" to "/etc/dt/config" and make modifications there.
To tell the Login Manager to reread Xconfig, use

# /usr/dt/bin/dtconfig -reset


Displaying a Login screen:
--------------------------

Upon startup, the Login Server checks the Xservers file to determine if an X server needs to be
started and to determine if and how login screens should be displayed on local or network displays.
To modify Xservers, copy Xservers from /usr/dt/config to /etc/dt/config.
After modifying, tell the login server to reread Xservers by
# /usr/dt/bin/dtconfig -reset

The format of a record in Xservers is:

display_name display_class display_type X_server_command

display_name     = the connection name to use when connecting to the X server (:0)
                   An * is expanded to hostname:0
display_class    = identifies resources specific to this display (for example Local)
display_type     = tells the Login manager whether the display is local or a network display.
X_server_command = identifies the commandline, connection number, and other options the
                   Login server will use to start the X server (/usr/bin/X11/X :0)
                   The connection number must match the number specified in display_name.

The default Xservers line is similar to:

:0 Local local@console /usr/bin/X11/X :0


Running the Login Server without a Local bitmap display:
--------------------------------------------------------

If your login server has no bitmap display, you should comment ou the line shown above like:

# :0 Local local@console /usr/bin/X11/X :0

So when the login server starts, it runs in the background waiting for requests from
network displays.


46. Make command:
================

Note 1: (Not geared to any particular unix version):
----------------------------------------------------

ABOUT MAKE

The make utility executes a list of shell commands associated with each target, typically to create 
or update a file of the same name. makefile contains entries that describe how to bring a target 
up to date with respect to those on which it depends, which are called dependencies.

SYNTAX

/usr/ccs/bin/make [ -d ] [ -dd ] [ -D ] [ -DD ] [ -e ] [ -i ] [ -k ] [ -n ] [ -p ] [ -P ] [ -q ] 
[ -r ] [ -s] [ -S ] [ -t ] [ -V ] [ -f makefile ] ... [-K statefile ] ... [ target ... ] [ macro = value ... ]

/usr/xpg4/bin/make [ -d ] [ -dd ] [ -D ] [ -DD ] [ -e ] [ -i ] [ -k ] [ -n ] [ -p ] [ -P ] [ -q ] 
[ -r ] [ -s] [ -S ] [ -t ] [ -V ] [ -f makefile ] ... [ target... ] [ macro = value ... ]


DESCRIPTION
     The make utility executes a list of shell  commands  associ-
     ated  with each target, typically to create or update a file
     of the same name. makefile contains  entries  that  describe
     how  to  bring  a target up to date with respect to those on
     which it depends, which are called dependencies. Since  each
     dependency is a target, it may have dependencies of its own.
     Targets, dependencies, and sub-dependencies comprise a  tree
     structure  that  make traces when deciding whether or not to
     rebuild a target.

     The make utility recursively checks each target against  its
     dependencies,  beginning  with  the  first  target  entry in
     makefile if no target argument is supplied  on  the  command
     line. If, after processing all of its dependencies, a target
     file is found either to be missing, or to be older than  any
     of  its dependencies, make rebuilds it. Optionally with this
     version of make, a target can be treated as out-of-date when
     the commands used to generate it have changed since the last
     time the target was built.

     To build a given target, make executes the list of commands,
     called  a  rule.  This  rule may be listed explicitly in the
     target's makefile entry, or it may be supplied implicitly by
     make.

     If no target is specified on the command line, make uses the
     first target defined in makefile.

     If a target has no makefile entry, or if its  entry  has  no
     rule,  make attempts to derive a rule by each of the follow-
     ing methods, in turn, until a suitable rule is  found.  Each
     method is described under USAGE below.


Note 2: An example
------------------

# find . -name "make" -print
./usr/ccs/bin/make
./usr/share/lib/make
./usr/xpg4/bin/make
./usr/appserver/samples/rmi-iiop/cpp/src/client/make

/opt/app/oracle/product/9.2/sqlplus/lib >/usr/ccs/bin/make -f ins_sqlplus.mk install


If you want to do compilations on Solaris, it is best not have /usr/ucb
in your PATH. If you want to have /usr/ucb in the PATH it must be the last
entry. You also should put /usr/ccs/bin/ before /usr/xpg4/bin/ in the PATH
to make sure that /usr/ccs/bin/make is used and not /usr/xpg4/bin/make.

To be able to use 'make' 'as' and 'ld' you need to make sure that 
/usr/ccs/bin is in your path.

Alan Coopersmith <alanc@alum.calberkeley.org> wrote:
> rhugga@yahoo.com (Keg) writes in comp.sys.sun.admin:
> |Just curious what the stuff under /usr/ucb is for? I was looking at
> |the ps utility and apparently they are the same fiel in 2 different
> |places:

> For users and scripts that expect the BSD style options, in cases such
> as ps & ls where they are incompatible with the SvsV options found in
> the /usr/bin versions.

It's there for historical reasons.  SunOS 4.x was based on BSD unix.
Solaris 2.x (= SunOS 5.x) was based on SYSV, with a bunch of commands
having different syntax and behavior.  To ease the transition, the
/usr/ucb directory was created to hold the incompatible BSD versions.
People who really wanted BSD could put /usr/ucb before /usr in their
PATH.

Note 3:
-------

How to write a simple makefile.
Let use start with a very simple example. Suppose the executable sortit depends on the main Fortran source file
"sortit_main.f90" and 2 additional files "readN.f90" and "sortarray.f90". 
The source files can be compiled and linked in 1 f90 command:

f90 -o sortit sortit_main.f90 readN.f90 sortarray.f90

Now suppose only one file changes, and the files are not small but contains many codelines, then
a better approach could be this:
Suppose you seperate the compilation and linking stages:

- compile into objectfiles:
f90 -c sortit sortit_main.f90 readN.f90 sortarray.f90

- link the files:
f90 -o sortit sortit_main.o readN.f90.o sortarray.o

Suppose there were many source files, and thus many objectfiles.
In this case it's better to make one definitionfile which explains it all. So if one source changes,
the corresponding objectfile is out of date, and needs to be recreated. 
All that information can be in a definitionfile, for example:

sortit:  sortit_main.o readN.o sortarray.o
	f90 -o sortit sortit_main.o readN.o sortarray.o

sortit_main.o: sortit_main.f90
	f90 -c sortit_main.f90

readN.o: readN.f90
	f90 -c readN.f90

sortarray.o: sortarray.f90
	f90 -c sortarray.f90

By default, make looks for a makefile called "makefile" in the current directory. Alternative files can
be specified with the -f option followed by the name of the makefile, for example:

make -f makefile1.mk

or

make -f makefile1.mk install

One of the labels present in the Makefile happens to be named ' install ' .

Further explanation:
--------------------

The make utility is embedded in UNIX history. It is designed to decrease a programmer's need to remember things. 
I guess that is actually the nice way of saying it decreases a programmer's need to document. In any case, 
the idea is that if you establish a set of rules to create a program in a format make understands, you don't have 
to remember them again. 

To make this even easier, the make utility has a set of built-in rules so you only need to tell it what new things 
it needs to know to build your particular utility. For example, if you typed in make love, make would first look 
for some new rules from you. If you didn't supply it any then it would look at its built-in rules. One of those 
built-in rules tells make that it can run the linker (ld) on a program name ending in .o to produce the 
executable program. 

So, make would look for a file named love.o. But, it wouldn't stop there. Even if it found the .o file, 
it has some other rules that tell it to make sure the .o file is up to date. In other words, newer than 
the source program. The most common source program on Linux systems is written in C and its file name ends in .c. 

If make finds the .c file (love.c in our example) as well as the .o file, it would check their timestamps 
to make sure the .o was newer. If it was not newer or did not exist, it would use another built-in rule to 
build a new .o from the .c (using the C compiler). This same type of situation exists for other 
programming languages. The end result, in any case, is that when make is done, assuming it can find the 
right pieces, the executable program will be built and up to date. 

The old UNIX joke, by the way, is what early versions of make said when it could not find the necessary files. 
In the example above, if there was no love.o, love.c or any other source format, the program would have said:
make: don't know how to make love. Stop. 

Getting back to the task at hand, the default file for additional rules in Makefile in the current directory. 
If you have some source files for a program and there is a Makefile file there, take a look. It is just text. 
The lines that have a word followed by a colon are targets. That is, these are words you can type following 
the make command name to do various things. If you just type make with no target, the first target will be executed. 

What you will likely see at the beginning of most Makefile files are what look like some assignment statements. 
That is, lines with a couple of fields with an equal sign between them. Surprise, that is what they are. 
They set internal variables in make. Common things to set are the location of the C compiler (yes, there is a default), 
version numbers of the program and such. 

This now beings up back to configure. On different systems, the C compiler might be in a different place, you might 
be using ZSH instead of BASH as your shell, the program might need to know your host name, it might use a 
dbm library and need to know if the system had gdbm or ndbm and a whole bunch of other things. 
You used to do this configuring by editing Makefile. Another pain for the programmer and it also meant that 
any time you wanted to install software on a new system you needed to do a complete inventory of what was where. 

As more and more software became available and more and more POSIX-compliant platforms appeared, this got harder 
and harder. This is where configure comes in. It is a shell script (generally written by GNU Autoconf) that goes up 
and looks for software and even tries various things to see what works. It then takes its instructions 
from Makefile.in and builds Makefile (and possibly some other files) that work on the current system. 

Background work done, let me put the pieces together. 

You run configure (you usually have to type ./configure as most people don't have the current directory in their 
search path). This builds a new Makefile. 
Type make This builds the program. That is, make would be executed, it would look for the first target in Makefile 
and do what the instructions said. The expected end result would be to build an executable program. 
Now, as root, type make install. This again invokes make, make finds the target install in Makefile and files 
the directions to install the program. 
This is a very simplified explanation but, in most cases, this is what you need to know. With most programs, 
there will be a file named INSTALL that contains installation instructions that will fill you in on 
other considerations. For example, it is common to supply some options to the configure command to change the 
final location of the executable program. There are also other make targets such as clean that remove unneeded 
files after an install and, in some cases test which allows you to test the software between the make and 
make install steps.


47. mkitab:
===========

AIX:

mkitab Command
Purpose
Makes records in the /etc/inittab file.

Syntax
mkitab [ -i Identifier ] { [ Identifier ] : [ RunLevel ] : [ Action ] : [ Command ] }

Description
The mkitab command adds a record to the /etc/inittab file. 
The Identifier:RunLevel:Action:Command parameter string specifies the new entry to the /etc/inittab file. 
You can insert a record after a specific record using the -i Identifier flag. The command finds the field 
specified by the Identifier parameter and inserts the new record after the one identified by 
the -i Identifier flag.

Example:

To add a new record to the /etc/inittab file, telling the init command to handle a login on tty2, 
enter: 

mkitab "tty002:2:respawn:/usr/sbin/getty /dev/tty2"

To change currently existing entries from the file, use the chitab command. For example, to change 
tty2's runlevel, enter the command

chitab "tty002:23:respawn:/usr/sbin/getty /dev/tty2"

chitab "rcnfs:23456789:off:/etc/rc.nfs > /dev/console 2>&1 # Start NFS Daemons"


This is also why an /etc/inittab is usually much bigger in AIX compared to Solaris.

rmitab Command
Purpose
Removes records in the /etc/inittab file. 

Syntax
rmitab Identifier


Description
The rmitab command removes an /etc/inittab record. You can specify a record to remove by 
using the Identifier parameter. The Identifier parameter specifies a field of one to fourteen 
characters used to uniquely identify an object. If the Identifier field is not unique, the command is unsuccessful. 

Examples
To remove the tty entry for tty2 , enter:

rmitab "tty002"


48. Starting and stopping deamons:
==================================

AIX:
----

AIX has a unique way of managing processes: the System Resource Controller (SRC). The SRC takes 
the form of a daemon, "/usr/sbin/srcmstr", which is started by init via /etc/inittab. srcmstr manages requests 
to start, stop, or refresh a daemon or a group of daemons. Instead of typing the name of a 
daemon to start it, or instead of using the kill command to stop a daemon, you use an SRC command 
that does it for you. In this way you don't have to remember, for example, whether to use an ampersand 
when starting a daemon, or what signal to use when killing one. SRC also allows you to stop and start 
groups of related daemons with one command. 

AIX has a hierarchical organization of system processes, and this organization is configured into the ODM 
in the form of the SRCsubsys and SRCsubsvr object classes. Daemons at the lowest levels are subservers. 
On a newly loaded system the only subservers are those of the inetd subsystem: 
ftp, telnet, login, finger, etc. To view these subservers, use the odmget command: 

To start a subsystem, for example
# startsrc -s lpd

To stop a subsystem, for example
# stopsrc -s lpd

You can also use the refresh command, after for example editing a .conf file and you need the
subsystem to reparse the config file.
For example, you have started the httpd demon 

# startsrc -s httpd

Now you have edited the /etc/httpd.conf file. To refresh the deamon, use the following command:

# refresh -s httpd


To list the status of a subsystem, use for example
# lssrc -g nfs
# lssrc -s sshd

Subsystem     Group    Pid    Status
biod          nfs      11354  active
rpc.lockd     nfs      11108  active
nfsd          nfs             inoperative
rpc.statd     nfs             inoperative
rpc.mountd    nfs             inoperative
rpc.mountd    nfs             inoperative


Starting and stopping daemons in general:
-----------------------------------------

In general, and in most cases, daemons which are not under the control of some resource controller, can be
stopped or started in a way as shown in the following "stanza":

# <script_name> stop
# <script_name> start

In many occasions, a script associated with the daemon is available, that will take "stop"or "start"
as an argument.
 

49. Inodes, the superblock and related items:
=============================================


49.1 Solaris:
-------------

Following is a "light weight" discussion about the superblock and inodes in the UFS filesystem in Solaris:

When you create an UFS filesystem, the disk slice is divided into cylindergroups. The slice is then divided
into blocks to control and organize the structure of files within the cylinder group.
Each block performs a specific function in the filesystem. 
A UFS filesystem has the following types of blocks:

Boot block: stores information used when booting the system, and is the first 8KB in a slice (partition).
Superblock: stores much of the information about the filesystem. Its located after the bootblock.
Inode     : stores all information about a file except its name
datablock : stores data for each file

The bootblock stores the procedures used in booting the system. Without a bootblock the system does not boot.
If a filesystem is not used for booting, the bootblock is left blank. The bootblock appears only
in the first cylinder group (cylinder group 0) and is the first 8KB in a slice.

The superblock stores much of the information about the filesystem. Following are the items 
contained in a superblock:
- size and status of the fs
- label (filesystem name and volume name)
- size of the fs logical block
- date and time of the last update
- cylinder group size
- number of datablocks in a cylinder group
- summary data block
- fs state (clean, stable, or active)
- pathname of the last mount point

The superblock is located at the beginning of the disk slice and is replicated in each cilinder group.
Because it contains critical data, multiple superblocks are made when the fs is created.
A copy of the superblock for each filesystem is kept up-to-date in memory.
The sync command forces every superblock in memory to write its data to disk.

An inode contains all the information about a file except its name which is kept in a directory.
An inode is 128 bytes. For each file there corresponds one inode. 
The inode information is kept in the cylinder information block and contains the
following:

- the type of file (regular file, directory, block special, character special, link)
- mode of the file (rwxrwxrwx)
- number of hard links to the file
- userid of the owner
- groupid
- number of bytes in the file
- an array of 15 disk-block addresses
- date and time the file was last accessed
- date and time the file was last modified
- date and time the file was created

The maximum number of files per UFS file system is determined by the number of inodes
allocated for a filesystem. The number of inodes depends on the amount of diskspace that
is allocated for each inode and the total size of the filesystem.
By default, on inode is allocated for each 2KB of dataspace. You can change this default
with the newfs command. 

Inodes include pointers to the data blocks. Each inode contains 15 pointers: 


the first 12 pointers point directly to data blocks 
the 13th pointer points to an indirect block, a block containing pointers to data blocks 
the 14th pointer points to a doubly-indirect block, a block containing 128 addresses of singly indirect blocks 
the 15th pointer points to a triply indirect block (which contains pointers to doubly indirect blocks, etc.) 

-------------------------------
| | | | | | | | | | | | | | | |
-------------------------------
 | | | | | | | | | | | | | | |--------------------------
      data blocks        | |-----------|               |
                         |             |               |
                       -----         -----           -----
                       |   |         |   |           |   |
                       -----         -----           -----
                        |||           |||             |||
                        data         -----           -----
                                     |   |           |   |
                                     -----           -----
                                      |||             |||
                                      data           -----
                                                     |   |
                                                     -----
                                                      |||
                                                      data


---------------------------------------------------------------------------
|          |           | | | | | | | |        |  |  |  |      |  |        |
| B. B.    | S. B.     | Inodes  | | | ...    |  Many Data Blocks ......  |
|          |           | | | | | | | |        |  |  |  |      |  |        |
---------------------------------------------------------------------------

In order to create a UFS filesystem on a formatted disk that already has been divided into slices
you need to know the raw device filename of the slice that will contain the filesystem.
Example:

# newfs /dev/rdsk/c0t3d0s7

defaults on UFS on Solaris: 
blocksize 8192
fragmentsize 1024
one inode for each 2K of diskspace


49.2 AIX:
---------

Although we use the LVM to create Volume Groups, and Logical Volumes within a Volume Group,
a file system resides on a single logical volume. 
Every file and directory belongs to a file system within a logical volume. 

The mkfs (make file system) command, or crfs command, or the System Management Interface Tool (smit command) 
creates a file system on a logical volume. 

- crfs
The crfs command creates a file system on a logical volume within a previously created volume group. 
A new logical volume is created for the file system unless the name of an existing logical volume 
is specified using the -d. An entry for the file system is put into the /etc/filesystems file.

By the way, a newly installed AIX 5.x system has the following filesystem structure:

"/" root is a filesystem. Certain standard directories are present within "/", like for example /bin.
But also a set of separate filesystems like hd2=/usr, hd3=/tmp, hd9var=/var, are MOUNTED over the 
coresponding named directories or mountpoints.

                              /
                              |
                 ----------------------------------------
                 |      |     |      |      |     |     |
                /bin   /dev  /etc    /usr   /tmp  /var  /home
               directories           file systems


So, when you unmount all extra (later on) defined filesystems like /export, /software etc..
you still have / (with its standard directories like /etc, /bin etc..) and the standard filesystems 
like /usr etc..


inodes:
-------

-- Working with JFS i-nodes:
-- -------------------------


Files in the journaled file system (JFS) are represented internally as index nodes (i-nodes). Journaled file system 
i-nodes exist in a static form on disk and contain access information for the file as well as pointers to the 
real disk addresses of the file's data blocks. The number of disk i-nodes available to a file system is 
dependent on the size of the file system, the allocation group size (8 MB by default), and the number of bytes 
per i-node ratio (4096 by default). These parameters are given to the mkfs command at file system creation. 
When enough files have been created to use all the available i-nodes, no more files can be created, even if 
the file system has free space. The number of available i-nodes can be determined by using the df -v command. 
Disk i-nodes are defined in the /usr/include/jfs/ino.h file. 

When a file is opened, an in-core i-node is created by the operating system. The in-core i-node contains 
a copy of all the fields defined in the disk i-node, plus additional fields for tracking the in-core i-node. 
In-core i-nodes are defined in the /usr/include/jfs/inode.h file. 

Disk i-node Structure for JFS

Each disk i-node in the journaled file system (JFS) is a 128-byte structure. 

The offset of a particular i-node within the i-node list of the file system produces the unique number 
(i-number) by which the operating system identifies the i-node. A bit map, known as the i-node map, tracks the 
availability of free disk i-nodes for the file system. 

Disk i-nodes include the following information: 

Field         Contents  
i_mode        Type of file and access permission mode bits  
i_size        Size of file in bytes  
i_uid         Access permissions for the user ID  
i_gid         Access permissions for the group ID  
i_nblocks     Number of blocks allocated to the file  
i_mtime       Last time file was modified  
i_atime       Last time file was accessed  
i_ctime       Last time i-node was modified  
i_nlink       Number of hard links to the file  
i_rdaddr[8]   Real disk addresses of the data  
i_rindirect   Real disk address of the indirect block, if any  


It is impossible to change the data of a file without changing the i-node, but it is possible to change the i-node 
without changing the contents of the file. For example, when permission is changed, the information within the 
i-node (i_ctime) is modified, but the data in the file remains the same. 

The i_rdaddr field within the disk i-node contains 8 disk addresses. These addresses point to the first 
8 data blocks assigned to the file. The i_rindirect field address points to an indirect block. 
Indirect blocks are either single indirect or double indirect. Thus, there are three possible geometries 
of block allocation for a file: direct, indirect, or double indirect. Use of the indirect block and other 
file space allocation geometries are discussed in the article JFS File Space Allocation . 

Disk i-nodes do not contain file or path name information. Directory entries are used to link file names to 
i-nodes. Any i-node can be linked to many file names by creating additional directory entries with the 
link or symlink subroutine. To discover the i-node number assigned to a file, use the ls -i command. 

The i-nodes that represent files that define devices contain slightly different information from i-nodes 
for regular files. Files associated with devices are called special files. There are no data block addresses 
in special device files, but the major and minor device numbers are included in the i_rdev field. 

In normal situations, a disk i-node is released when the link count (i_nlink) to the i-node equals 0. 
Links represent the file names associated with the i-node. When the link count to the disk i-node is 0, 
all the data blocks associated with the i-node are released to the bit map of free data blocks for the file system. 
The i-node is then placed on the free i-node map. 

In-core i-node Structure

When a file is opened, the information in the disk i-node is copied into an in-core i-node for easier access. 
The in-core i-node structure contains additional fields which manage access to the disk i-node's valuable data. 
The fields of the in-core i-node are defined in the inode.h file. Some of the additional information tracked 
by the in-core i-node is: 

-Status of the in-core i-node, including flags that indicate: 
  An i-node lock 
  A process waiting for the i-node to unlock 
  Changes to the file's i-node information 
  Changes to the file's data 
-Logical device number of the file system that contains the file 
-i-number used to identify the i-node 
-Reference count. When the reference count field equals 0, the in-core i-node is released. 

When an in-core i-node is released (for instance with the close subroutine), the in-core i-node 
reference count is reduced by 1. If this reduction results in the reference count to the in-core i-node 
becoming 0, the i-node is released from the in-core i-node table, and the contents of the in-core i-node 
are written to the disk copy of the i-node (if the two versions differ). 


-- Working with JFS2 i-nodes:
-- --------------------------

Files in the enhanced journaled file system (JFS2) are represented internally as index nodes (i-nodes). 
JFS2 i-nodes exist in a static form on the disk and they contain access information for the files as well as 
pointers to the real disk addresses of the file's data blocks. The i-nodes are allocated dynamically by JFS2. 

When a file is opened, an in-core i-node is created by the operating system. The in-core i-node contains 
a copy of all the fields defined in the disk i-node, plus additional fields for tracking the in-core i-node. 
In-core i-nodes are defined in the /usr/include/j2/j2_inode.h file. 


Disk i-node Structure for JFS2
Each disk i-node in JFS2 is a 512 byte structure. The index of a particular i-node allocation map of the 
file system produces the unique number (i-number) by which the operating system identifies the i-node. 
The i-node allocation map tracks the location of the i-nodes on the disk as well as their availability. 

Disk i-nodes include the following information: 

Field      Contents  
di_mode    Type of file and access permission mode bits  
di_size    Size of file in bytes  
di_uid     Access permissions for the user ID  
di_gid     Access permissions for the group ID  
di_nblocks Number of blocks allocated to the file  
di_mtime   Last time file was modified  
di_atime   Last time file was accessed  
di_ctime   Last time i-node was modified  
di_nlink   Number of hard links to the file  
di_btroot  Root of B+ tree describing the disk addresses of the data  


50. sendmail:
=============

Solaris:
--------


To receive SMTP mail from the network, run sendmail as a daemon during system startup. The sendmail daemon listens 
to TCP port 25 and processes incoming mail. In most cases the code to start sendmail is already in one of 
your boot scripts. If it isn't, add it. 


# Start the sendmail daemon:
if [ -x /usr/sbin/sendmail ]; then
  echo "Starting sendmail daemon (/usr/sbin/sendmail -bd -q 15m)..."
  /usr/sbin/sendmail -bd -q 15m
fi

First, this code checks for the existence of the sendmail program. If the program is found, the code displays 
a startup message on the console and runs sendmail with two command-line options. 
One option, the -q option, tells sendmail how often to process the mail queue. In the sample code, the queue is 
processed every 15 minutes (-q15m), which is a good setting to process the queue frequently. 
Don't set this time too low. Processing the queue too often can cause problems if the queue grows very large, 
due to a delivery problem such as a network outage. For the average desktop system, every hour (-q1h) or 
half hour (-q30m) is an adequate setting.

The other option relates directly to receiving SMTP mail. The option (-bd) tells sendmail to run as a daemon 
and to listen to TCP port 25 for incoming mail. Use this option if you want your system to accept incoming TCP/IP mail.

The Linux example is a simple one. Some systems have a more complex startup script. 
Solaris 2.5, which dedicates the entire /etc/init.d/sendmail script to starting sendmail, is a notable example. 
The mail queue directory holds mail that has not yet been delivered. It is possible that the system went down while 
the mail queue was being processed. Versions of sendmail prior to sendmail V8, such as the version that comes 
with Solaris 2.5, create lock files when processing the queue. Therefore lock files may have been left 
behind inadvertently and should be removed during the boot. Solaris checks for the existence of the mail queue directory 
and removes any lock files found there. If a mail queue directory doesn't exist, it creates one. The additional 
code found in some startup scripts is not required when running sendmail V8. 

All you really need is the sendmail command with the -bd option.


nlih30207858-08:/etc/rc2.d $ ps -ef | grep "sendmail"
   smmsp   412     1  0   Jan 09 ?        0:00 /usr/lib/sendmail -Ac -q15m
    root   413     1  0   Jan 09 ?        0:03 /usr/lib/sendmail -bd -q15m


Setup sendmail user and group
Before doing anything else, check that the mail user and group are set up. 
Look in /etc/passwd for user smmsp with uid 25. Then check in /etc/group for group smmsp with gid 25. 
If they are there, good. If not, add them with: 

groupadd -g 25 smmsp 
useradd -u 25 -g smmsp -d / smmsp 

Then edit /etc/passwd and remove the shell. You want the line to look something like "smmsp:x:25:25::/:". 
I notice that Slackware has the line set to "smmsp:x:25:25:smmsp:/var/spool/clientmqueue:", and that's okay too, 
so I leave it at that. 

In Solaris you should have an entry in passwd as follows:
smmsp:x:25:25:SendMail Message Submission Program:/:/sbin/noshell


Stoping and starting sendmail
/etc/rc2.d/S88sendmail stop then start on Sun systems.
/etc/rc.d/init.d/sendmail stop then start on Linux systems.


Note: About mail:
-----------------

mail -f    = show mail in your box
enter the number at the ? prompt to read the mail

examples:

# mail -f
Mail [5.2 UCB] [AIX 5.X]  Type ? for help.
"/root/mbox": 0 messages


# mail -f
Mail [5.2 UCB] [AIX 5.X]  Type ? for help.
"/root/mbox": 3 messages
>   1 root              Tue Nov  1 17:05  13/594
    2 MAILER-DAEMON     Sun Oct 30 07:59 109/3527 "Postmaster notify: see trans"
    3 daemon            Wed Jan 26 10:59  34/1618
? 1
Message  1:
From root Tue Nov  1 17:05:34 2005
Date: Tue, 1 Nov 2005 17:05:34 +0100
From: root
To: root

..
..


51. SAR:
========

AIX:
----

sar Command 
Purpose
Collects, reports, or saves system activity information. 

Syntax
/usr/sbin/sar [ { -A | [ -a ] [ -b ] [ -c ] [ -k ] [ -m ] [ -q ] [ -r ] [ -u ] [ -V ] [ -v ] [ -w ] [ -y ] } ] 
[ -P ProcessorIdentifier, ... | ALL ] [ -ehh [ :mm [ :ss ] ] ] [ -fFile ] [ -iSeconds ] [ -oFile ] [ -shh [ :mm [ :ss ] ] ]
[ Interval [ Number ] ]

The sar command writes to standard output the contents of selected cumulative activity counters in the operating system. 
The accounting system, based on the values in the Number and Interval parameters, writes information 
the specified number of times spaced at the specified intervals in seconds. The default sampling interval 
for the Number parameter is 1 second. The collected data can also be saved in the file specified by the -o File flag.

The sar command extracts and writes to standard output records previously saved in a file. This file can be either 
the one specified by the -f flag or, by default, the standard system activity daily data file, 
the /var/adm/sa/sadd file, where the dd parameter indicates the current day.

To report system unit activity, enter: 
# sar

To report current tty activity for each 2 seconds for the next 20 seconds, enter: 
# sar -y -r 2 20

To watch system unit for 10 minutes and sort data, enter: 
# sar -o temp 60 10

To report cpu activity for the first two processors, enter: 
# sar -u -P 0,1
cpu  %usr  %sys  %wio  %idle
0      45    45     5      5
1      27    65     3      5

To report message, semaphore, and cpu activity for all processors and system-wide, enter: 
# sar -mu -P ALL
On a four-processor system, this produces output similar to the following (the last line indicates 
system-wide statistics for all processors): 
cpu  msgs/s  sema/s  %usr  %sys  %wio  %idle
0      7       2       45    45     5     5
1      5       0       27    65     3     5
2      3       0       55    40     1     4
3      4       1       48    41     4     7
-     19       3       44    48     3     5

To collect all the statistics that sar monitors at 60 second intervals for a 10 hour period. 
Also redirects console output to null device

# nohup sar -A -o /tmp/SAR.STATS 60 600 > /dev/null &

The -A switch will cause all of the data collected by sar to be reported. The -ubcwyaqvm switch prevents some 
data from being reported.

On the obsolete AIX versions 4.2 throught 5.1, you should also make sure that the schedtune and vmtune utilities 
can be found in /usr/samples/kernel . If they're not there, install bos.adt.samples. These utilites are used 
to report on the tunable parameters for the VMM and the scheduler, and SarCheck is much more useful if it can 
analyze the values of these parameters. On newer versions of AIX, this is not necessary because we look at 
ioo, schedo, vmo, and vmstat -v for the data we need.


Solaris:
--------

Some specifics for Solaris with regards to the sar command:

How to check File Access:
# sar -a

How to check Buffer Activity: (metadata= inodes, cylinder group blocks etc..)
# sar -b

How to check System Call Statistics:
# sar -c

How to check Disk Activity:
# sar -d

How to check Page-Out and memory:
# sar -g

How to check Kernel Memory Allocation:
# sar -k

How to check Interprocess Communication:
# sar -m

How to check Page-In activity:
# sar -p

How to check Queue Activity:
# sar -q

How to check Unused Memory:
# sar -r

How to check CPU Utilization:
# sar -u


52. Xwindows:
=============

52.1 About the XWindows system:
-------------------------------

The X Window System is a graphics system primarily used on Unix systems (and, less commonly, on VMS, MVS, 
and MS-Windows systems) that provides an inherently client/server oriented base for displaying windowed graphics. 
It provides a public protocol by which client programs can query and update information on X servers. 

The representation of "client" and "server" appears a little bit backwards from most client/server systems. 
Usually, people expect the "local" programs to be called a "client," and for the "server" to be something off 
in the back room. Which nicely represents the way database applications usually work, with many "clients" 
connecting to a central database "server." 

X reverses these roles, which, as the locations of the hosts are reversed, is quite appropriate: 


An X server is a program that manages a video system (and possibly other "interactive" I/O devices such as mice, 
keyboards, and some more unusual devices).

The X server thus typically runs on a user's desktop, typically a relatively non-powerful host that would commonly 
be termed a "client system." It is, in this context, nonetheless acting as a server as it provides graphics services. 

On the other hand, an X client is typically an application program which must connect to an X Server 
in order to display things. 

The client will often run on another host, often a powerful Unix box that would commonly be known as a "server." 
The X client might itself also be a "server process" from some other point of view; there is no contradiction here. 
(Although calling it such may be unwise as it will naturally result in further confusion.)

X nomenclature treats anything that provides display services as an X server. Which is not particularly different 
from someone saying that a program that provides database services is a database server. 

The upshot (and the point) of all this is that this allows use of the X system that allows processes on 
various computers on a network to display stuff on display devices elsewhere on the network.

- GNOME:

GNOME - GNU Network Object Model Environment
GNOME is not a window manager. 

GNOME is an application framework that consists of libraries to assist in application development and a set 
of applications that use those libraries. 

It seeks to provide: 

An API for interapplication communications. This will represent a set of objects running via a CORBA 
Object Request Broker called ORBit.

This is crucial piece of the infrastructure, with which they intend to implement a component architecture 
to build "compound documents" not entirely unlike OpenDoc; without this, GNOME is merely a "pretty face," 
consuming memory and disk space for relatively little value. 

This description strongly parallels that of CDE... 

- K Desktop Environment - KDE

The KDE (K Desktop Environment) Project is building an integrated desktop environment including a window manager, 
file manager/web browser, and other components using the Trolltech "Qt" toolset, a development toolset written 
for C++ that allows applications to be deployed atop either X11 or Win32. 

KDE had been using the MICO CORBA ORB to construct an application embedding framework known as KOM and OpenParts. 
According to the [ KDE-Two: Second KDE Developers Conference], they found themselves unable to use 
the standardized CORBA framework, citing problems with concurrency, reliability and performance, and have 
instead decided to create Yet Another IPC Framework involving a shared library called libICE. 

On the other hand, the KDE Technology Overview for Version 2.0 provides a somewhat different story, 
so it's not completely clear just what is going on; they indicate the use of an IPC scheme called DCOP, 
indicating it to be a layer atop libICE, with the option of also using XML-RPC as an IPC scheme.


52.2 Running Cygwin on a PC, to have a Xwin Server:
---------------------------------------------------

Example of starting a xwin session

C:\cygwin\usr\X11R6\bin\XWin.exe -query hostname -fullscreen -fp tcp/hostname:7100". 


X &
xhost +
export DISPLAY=:0

When using X from a terminal server session, take note of the right ip and port.


52.3 XWin on AIX:
-----------------

The xdm (X Display Manager) command manages a collection of X displays, which may be on the local host 
or remote servers. The design of the xdm command was guided by the needs of X terminals as well as 
the X Consortium standard XDMCP, the X Display Manager Control Protocol. The xdm command provides services 
similar to those provided by the init, getty, and login commands on character terminals: prompting for 
login name and password, authenticating the user, and running a session.

Starting xdm 
xdm is typically started at system boot time. This is typically done in either an rc file in the /etc directory, 
or in the inittab file. 

Starting xdm in an rc file is usually simply a matter of adding the desired command line to the file, 
as in the example below. 

/usr/bin/X11/xdm -daemon -config /usr/lib/X11/xdm/xdm-config &

IBM wants xdm to integrate into their src subsystem. The AIX version of the above command is a bit different. 

start /usr/bin/X11/xdm $src_running

The problem with this is that since xdm is not supported in R4 under AIX, it is not really integrated into 
the src subsystem, so the attendant startup, shutdown, and other src commands do not work properly. 
An alternative, which works on many other systems as well, is to start xdm from the inittab file. 

xdm:2:respawn:/usr/bin/X11/xdm -nodaemon -config /usr/lib/X11/xdm-config

The -nodaemon flag keeps xdm from starting a daemon and exiting, which would cause the respawn option 
to start another copy of xdm, whereupon the process would repeat itself, quickly filling up your 
process table and dragging your system to its knees attempting to run oodles of managers and servers. 
xdm attempts to use system lock calls to prevent this from happening. It nevertheless happens on some systems. 


52.4 XWin on Linux:
-------------------

52.4.1 Redhat:
--------------

While the heart of Red Hat Linux is the kernel, for many users, the face of the operating system is the 
graphical environment provided by the X Window System, also called simply X. 
This chapter is an introduction to the behind-the-scenes world of XFree86, the open-source implementation 
of X provided with Red Hat Linux. 

X uses a client-server architecture. An X server process is started and X client processes can connect to it 
via a network or local loopback interface. The server process handles the communication with the hardware, 
such as the video card, monitor, keyboard, and mouse. The X client exists in the user-space, issuing requests 
to the X server. 

The X server performs many difficult tasks using a wide array of hardware, requiring detailed configuration. 
If some aspect of your system changes, such as the monitor or video card, XFree86 will need 
to be reconfigured. In addition, if you are troubleshooting a problem with XFree86 that cannot 
be solved using a configuration utility, such as the X Configuration Tool (redhat-config-xfree86), 
you may need to access its configuration file directly. 

Red Hat Linux 8.0 uses XFree86 version 4.2 as the base X Window System, which includes the various 
necessary X libraries, fonts, utilities, documentation, and development tools. 

- The X Window System resides primarily in two locations in the file system: 

/usr/X11R6/ directory 
A directory containing X client binaries (the bin directory), assorted header files (the include directory), 
libraries (the lib directory), and manual pages (the man directory), and various other X documentation 
(the /usr/X11R6/lib/X11/doc/ directory). 

/etc/X11/ directory 
The /etc/X11/ directory hierarchy contains all of the configuration files for the various components 
that make up the X Window System. This includes configuration files for the X server itself, 
the X font server (xfs), the X Display Manager (xdm), and many other base components. 
Display managers such as gdm and kdm, as well as various window managers, and other X tools also store their 
configuration in this hierarchy. 


- The Redhat X configuration tool:

from command line: # redhat-config-xfree86
from X: go to the Main Menu Button (on the Panel) => System Tools => Display

- XFree86 configuration file "etc/X11/XF86Config"

XFree86 version 4 server is a single binary executable - /usr/X11R6/bin/XFree86. This server dynamically 
loads various X server modules at runtime from the "/usr/X11R6/lib/modules/" directory including video drivers, 
font engine drivers, and other modules as needed. Some of these modules are automatically loaded by the server, 
whereas some are optional features that you must specify in the XFree86 server's configuration file, 
"/etc/X11/XF86Config", before they can be used. The video drivers are located in the 
/usr/X11R6/lib/modules/drivers/ directory. The DRI hardware accelerated 3D drivers are located in the 
/usr/X11R6/lib/modules/dri/ directory. 

- Running a simple X client:

You do not have to run a complicated window manager in conjunction with a particular desktop environment 
to use X client applications. Assuming that you are not already in an X environment and do not have 
an .xinitrc file in your home directory, type the xinit command to start X with a basic terminal window 
(the default xterm application). You will see that this basic environment utilizes your keyboard, mouse,
video card, and monitor with the XFree86 server, using the server's hardware preferences. 
Type exit at the xterm prompt to leave this basic X environment.

- Running X: The startx command

When you start X using the "startx" command, a pre-specified desktop environment is utilized. 
To change the default desktop environment used when X starts, open a terminal and type the 
switchdesk command. This brings up a graphical utility that allows you to select the desktop environment 
or window manager to use the next time X starts.

Most users run X from runlevels 3 or 5. Runlevel 3 places your system in multi-user mode with full 
networking capabilities. The machine will boot to a text-based login prompt with all necessary 
preconfigured services started. Most servers are run in runlevel 3, as X is not necessary to provide 
any services utilized by most users. Runlevel 5 is similar to 3, except that it automatically starts X 
and provides a graphical login screen. Many workstation users prefer this method, because it never forces 
them to see a command prompt. 

The default runlevel used when your system boots can be found in the /etc/inittab file. 
If you have a line in that file that looks like id:3:initdefault:, then your system will boot 
to runlevel 3. If you have a line that looks like id:5:initdefault:, your system is set to boot 
into runlevel 5. As root, change the runlevel number in this file to set a different default. 
Save the file and restart your system to verify that it boots to the correct runlevel.

When in runlevel 3, the preferred way to start an X session is to type the startx command. 
startx, a front-end to the xinit program, launches the XFree86 server and connects the X clients to it.


53. TAPE DRIVES:
================

53.1 AIX:
---------

Some usefull examples, using a tape:
------------------------------------

# mksysb -i /dev/rmt0
# backup -0 -uf /dev/rmt0 /data
# tctl -f /dev/rmt0 rewind
# savevg -if /dev/rmt0 uservg

# lsdev -Cc tape
rmt0  Available  10-60-00-5,0  SCSI 8mm Tape Drive

# lsattr -El rmt0
mode           yes     Use DEVICE BUFFERS during writes    True
block_size     1024    Block size (0=variable length)      True
extfm          no      Use EXTENDED file marks             True
ret            no      RETENSION on tape change or reset   True
..
..
To list the default values for that tape device (-D flag), use
# lsattr -l -D rmt0

# lscfg -vl rmt0
Manufacturer...............EXABYTE
Machine Type and Model.....IBM-20GB
Device Specific(Z1)........38zA
Serial Number..............60089837
..
..

Its very important which /dev/rmtx.y you use in some backup command like tar. See the following table:

special file       rewind on close         retension on open     density setting
--------------------------------------------------------------------------------
/dev/rmtx          yes                     no                     #1
/dev/rmtx.1        no                      yes                    #1
/dev/rmtx.2        yes                     yes                    #1
/dev/rmtx.3        no                      yes                    #2
/dev/rmtx.4        yes                     no                     #2
/dev/rmtx.5        no                      no                     #2
/dev/rmtx.6        yes                     yes                    #2
/dev/rmtx.7        no                      yes                    #2


54. WSM Web based System Manager:
=================================

AIX only:
---------

Web based System manager is a graphical user interface administration tool for AIX 5.x systems.
This is a Java based suite of system management tools. 
To start WSM, use the following command from the command line of a graphical console:
# wsm

- The WSM can be run in stand-alone mode, that is, you can use the tool to perform system administration
on the AIX system you are currently running on. 
- However, the WSM also supports a client-server environment.
In this environment, it is possible to administer an AIX system from a remote PC or from another AIX system
using a graphics terminal.
In this environment, the AIX system being administered is the Server and the system you are
performing the administration functions from is the client.

The client can operate in either application mode on AIX with jave 1.3, or in applet mode
on platforms that support Java 1.3. Thus, the AIX system can be managed from another AIX system
or from a PC with a browser and Java.


55. SOFTWARE INSTALLATIONS ON AIX 5.x:
======================================


55.1 Installing VisualAge C++ / C compiler on AIX 5.x:
======================================================

IBM VisualAge is a commandline C and C++ compiler for the AIX operating system.
You can use VisualAge as a C compiler for files with a .c suffix, or as a C++ compiler
for files with a .C, .cc, .cpp or .cxx suffix. The compiler processes your text-based
program source files to create an executable object module.
In most cases you should use the xlC command to compile your C++ source files, 
and the xlc command to compile C source files.
You can use VisualAge to develop both 32 bit and 64 bit appliactions.

If you want to install VisualAge C++ for AIX, check first if the following required filesets are installed.

bos.adt.include                 Base Application Development Include Files
bos.adt.l1b                     Base Application Development Libraries
bos.adt.l1bm                    Base Application Development Math Libraries
bos.net.ncs                     Base Network Computing Services
1for_ls.compat                  License Use Management version 4 compatibility
1for_ls.base                    License Use Management version 4 Base

Use the following command to see whether these are installed:

# lslpp -h bos.adt.include bos.adt.l1b bos.adt.l1bm \
           bos.net.ncs 1for_ls.compat 1for_ls.base

For some components, the following needs to be installed as well:
X11.base.rte, bos.rte.11bpthreads, 1pfx.rte, 1for_ls.base.gu1, 1for_ls.client.gui

Make sure the AppDev package has been installed in order to have access to commands like "make" etc...


Notes:
======

Note 1:
-------

IBM C and C++ Compilers

  Usage:
     xlC [ option | inputfile ]...
     xlc [ option | inputfile ]...
     cc [ option | inputfile ]...
     c89 [ option | inputfile ]...
     xlC128 [ option | inputfile ]...
     xlc128 [ option | inputfile ]...
     cc128 [ option | inputfile ]...
     xlC_r [ option | inputfile ]...
     xlc_r [ option | inputfile ]...
     cc_r [ option | inputfile ]...
     xlC_r4 [ option | inputfile ]...
     xlc_r4 [ option | inputfile ]...
     cc_r4 [ option | inputfile ]...
     CC_r4 [ option | inputfile ]...
     xlC_r7 [ option | inputfile ]...
     xlc_r7 [ option | inputfile ]...
     cc_r7 [ option | inputfile ]...

  Description:
     The xlC and related commands compile C and C++ source files.
     They also processes assembler source files and object files. Unless the
     -c option is specified, xlC calls the linkage editor to produce a
     single object file. Input files may be any of the following:
       1. file name with .C suffix: C++ source file
       2. file name with .i suffix: preprocessed C or C++ source file
       3. file name with .c suffix: C source file
       4. file name with .o suffix: object file for ld command
       5. file name with .s suffix: assembler source file
       6. file name with .so suffix: shared object file


xlc : ANSI C compiler with UNIX header files. Use this command for most new C programs. 

cc  : Extended C compiler. This command invokes a non-ANSI compliant compiler. Use it for legacy C programs. 

c89 : Strict ANSI C compiler with ANSI header files. Use this command for maximum portability of your C programs. 

xlC : Native (i.e., non-cfront) C++ compiler. Use this command for compiling and linking all C++ code. 


The following additional command names, plus their "-tst" and "-old" variants, are also available at SLAC 
for compiling and linking reentrant programs: 
xlc_r, cc_r; xlC_r            : For use with POSIX threads 
xlc_r4, cc_r4; xlC_r4, CC_r4  : For use with DCE threads 


Note 2:
-------

install VisualAge C++:

- insert CD
- smitt install_latest
- press F4 to display all devices
- select CDROM device
- press F4 to select the filesets you want to install

After you have installed VisualAge C++ for AIX, you need to enroll your license for the product
before using it.

VisualAge C++ is not automatically installed in /usr/bin. To invoke the compiler without
having to specify the full path, do one of the following steps:
- create symbolic links for the specific driver contained in /usr/vacpp/bin and
  /usr/vac/bin to /usr/bin
- add /usr/vacpp/bin and /usr/vac/bin to your path


Note 3:
-------

Note: usage of vac examples:

Example 1:

xlc -I/usr/local/include -L/usr/local/lib simple.c -lcurl -lz 

Example 2:

The commands listed below invoke versions of the XL C compiler, which then translates C source code statements 
into object code, sends .s files to the assembler, and links the resulting object files with object files 
and libraries specified on the command line in the order in which they are listed, producing a single executable file 
called "a.out" by default. The -o flag may be used to rename the resulting executable file. 
Where commands are shown, they are generally given as generic examples. In any case, you type the appropriate 
command and press the Return (or Enter) key as usual. 

You compile a source program and/or subprograms by typing the following command: 

xlc cmd_line_opts input_files

input_files are source files (.c or .i), object files (.o), or assembler files (.s) 

For example, to compile a C program whose source is in source file "prog.c" you would enter the following command: 

xlc prog.c

After the xlc command completes, you will see a new executable file named "a.out" in your directory. 

If you specify -c as a compiler option, XL C only compiles the source program, producing an object file 
whose default name is that of the program with a .o extension. Before running the program, 
you must invoke the linkage editor phase. Either invoke the linker using the ld command or issue 
the xlc command a second time without the -c option, using the desired object (.o) filenames. 

For example, you may compile a subprogram "second.c" and then use it in your main program "prog.c" 
with the following sequence of commands: 

xlc -c second.c
xlc prog.c second.o 

Some important files on a test system:

# find -name "crt0_64.o" -print

/usr/lib/crt0_64.o
/usr/css/lib/crt0_64.o


# find -name "crt0_32.o" -print

/usr/lib/crt0_64.o
/usr/css/lib/crt0_64.o


Check out if vac is installed:

root@zd110l02:/root#lslpp -l vacpp*
lslpp: Fileset vacpp* not installed.


root@zd110l02:/root#lslpp -l xlC*
  Fileset                      Level  State      Description
  ----------------------------------------------------------------------------
Path: /usr/lib/objrepos
  xlC.aix50.rte              7.0.0.6  COMMITTED  C Set ++ Runtime for AIX 5.0
  xlC.cpp                    6.0.0.0  COMMITTED  C for AIX Preprocessor
  xlC.rte                    7.0.0.1  COMMITTED  C Set ++ Runtime


Note 4:
-------

At a certain organisation, the installation goes as follows:

install:

# cd /prj/tmp
# tar xv       (tape in rmt0)
# ./driver

config licentie:

# /usr/vac/bin/vac6_licentie
# l4blt -r6
# /usr/opt/ifor/ls/aix/bin/i4blt -r6

test:

- using existing sourcefile:

# cd /prj/vac/cctst
# cc fac.c -o fac
# ./fac

Or...

- make a simple c source and compile it:

#include <stdio.h>
int main(void)
{
   printf("Hello World!\n");
   return 0;
}

now compile it
# /usr/vac/bin/xlc hello.c -o hello

now run it
# ./hello


Note 5: LUM
-----------

i4lmd - Network License Server Subsystem

The i4lmd subsystem starts the network license server on the local node. 

Examples
Start a license server and do not log checkin, vendor, product, timeout, or message events: 

startsrc -s i4lmd -a "-no cvptm"

Start a license server changing the default log-file: 

startsrc -s i4lmd -a "-l /ifor/ls/my_log"


On an example p520 systeem:
---------------------------

In /etc/inittab:

i4ls:2:wait:/etc/i4ls.rc > /dev/null 2>&1 # Start i4ls

cat /etc/i4ls.rc

#!/bin/ksh
# IBM_PROLOG_BEGIN_TAG
# This is an automatically generated prolog.
#
# bos520 src/bos/usr/opt/ifor/var/i4ls.rc 1.8
#
# Licensed Materials - Property of IBM
#
# (C) COPYRIGHT International Business Machines Corp. 1996,2001
# All Rights Reserved
#
# US Government Users Restricted Rights - Use, duplication or
# disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
#
# IBM_PROLOG_END_TAG
/usr/opt/ifor/ls/os/aix/bin/i4cfg -start -nopause
exit 0

On an example p550 system 29-12-2006, all apps down:
----------------------------------------------------

# ps -ef

     UID     PID    PPID   C    STIME    TTY  TIME CMD
    root       1       0   0   Dec 11      -  3:08 /etc/init
    root  327918       1   0   Dec 11      -  0:00 /usr/lib/errdemon
    root  352504       1   0   Dec 11      -  0:00 /usr/ccs/bin/shlap64
    root  360466       1   0   Dec 11      - 253:18 /usr/sbin/syncd 60
    root  548880 1724510   0 08:33:45  pts/0  0:00 -ksh
    root  585948  548880   1 09:11:19  pts/0  0:00 ps -ef
  cissys  880788 1060964   0 09:07:51      -  0:00 /usr/sbin/sftp-server
    root  983044 1011962   0   Dec 11      -  0:00 /usr/sbin/qdaemon
    root  999432       1   0   Dec 11      -  0:00 /usr/sbin/uprintfd
    root 1003764       1   0   Dec 11      -  0:34 /usr/sbin/cron
    root 1011962       1   0   Dec 11      -  0:00 /usr/sbin/srcmstr
    root 1024034       1   0   Dec 11      -  0:00 /usr/local/sbin/syslog-ng -f /usr/local/etc/syslog-ng.conf
    root 1028102       1   0   Dec 11      -  0:00 ./mflm_manager
    root 1036402 1011962   0   Dec 11      -  0:00 /etc/ncs/llbd
    root 1040402 1052716   0   Dec 11      -  0:00 /usr/opt/ifor/bin/i4lmd -l /var/ifor/logdb -n clwts
    root 1052716 1011962   0   Dec 11      -  0:44 /usr/opt/ifor/bin/i4lmd -l /var/ifor/logdb -n clwts
    root 1056788 1011962   0   Dec 11      -  0:00 /usr/sbin/rsct/bin/IBM.AuditRMd
  cissys 1060964 1532138   0 09:07:51      -  0:01 sshd: cissys@notty
    root 1065016 1011962   0   Dec 11      -  0:05 /usr/sbin/rsct/bin/IBM.CSMAgentRMd
    root 1073192 1011962   0   Dec 11      -  0:00 /usr/sbin/rsct/bin/IBM.ServiceRMd
    root 1077274       1   0   Dec 11      -  0:01 /opt/hitachi/HNTRLib2/bin/hntr2mon -d
    root 1081378 1011962   0   Dec 11      -  0:28 /usr/DynamicLinkManager/bin/dlmmgr
    root 1085478 1011962   0   Dec 11      -  0:06 /etc/ncs/glbd
    root 1089574 1101864   0   Dec 11      -  0:00 /usr/opt/ifor/bin/i4llmd -b -n wcclwts -l /var/ifor/llmlg
    root 1101864 1011962   0   Dec 11      -  3:14 /usr/opt/ifor/bin/i4llmd -b -n wcclwts -l /var/ifor/llmlg
    root 1110062 1011962   0   Dec 11      -  0:01 /usr/sbin/rsct/bin/rmcd -a IBM.LPCommands -r
    root 1114172 1011962   0   Dec 11      -  0:00 /usr/sbin/rsct/bin/IBM.ERrmd
    root 1122532 1167500   0 08:23:22      -  0:00 sshd: reserve [priv]
    root 1126476       1   0   Dec 27   lft0  0:00 -ksh
    root 1167500 1011962   0 03:17:38      -  0:00 /usr/sbin/sshd -D
  oracle 1175770       1   0   Dec 11      - 12:29 /apps/oracle/product/9.2/bin/tnslsnr listener -inherit
    root 1532138 1167500   0 09:07:50      -  0:00 sshd: cissys [priv]
    root 1708224 1126476   4 08:40:14   lft0  0:45 tar -cvf /dev/rmt0 /prj/was
 reserve 1724510 1786036   0 08:23:34  pts/0  0:00 -ksh
 reserve 1786036 1122532   0 08:23:34      -  0:00 sshd: reserve@pts/0


inittab:
--------

init:2:initdefault:
brc::sysinit:/sbin/rc.boot 3 >/dev/console 2>&1 # Phase 3 of system boot
powerfail::powerfail:/etc/rc.powerfail 2>&1 | alog -tboot > /dev/console # Power Failure Detection
mkatmpvc:2:once:/usr/sbin/mkatmpvc >/dev/console 2>&1
atmsvcd:2:once:/usr/sbin/atmsvcd >/dev/console 2>&1
load64bit:2:wait:/etc/methods/cfg64 >/dev/console 2>&1 # Enable 64-bit execs
tunables:23456789:wait:/usr/sbin/tunrestore -R > /dev/console 2>&1 # Set tunables
rc:23456789:wait:/etc/rc 2>&1 | alog -tboot > /dev/console # Multi-User checks
rcemgr:23456789:once:/usr/sbin/emgr -B > /dev/null 2>&1
fbcheck:23456789:wait:/usr/sbin/fbcheck 2>&1 | alog -tboot > /dev/console # run /etc/firstboot
srcmstr:23456789:respawn:/usr/sbin/srcmstr # System Resource Controller
rctcpip:23456789:wait:/etc/rc.tcpip > /dev/console 2>&1 # Start TCP/IP daemons
sniinst:2:wait:/var/adm/sni/sniprei > /dev/console 2>&1
: rcnfs:23456789:wait:/etc/rc.nfs > /dev/console 2>&1 # Start NFS Daemons
cron:23456789:respawn:/usr/sbin/cron
: piobe:2:wait:/usr/lib/lpd/pio/etc/pioinit >/dev/null 2>&1  # pb cleanup
qdaemon:23456789:wait:/usr/bin/startsrc -sqdaemon
: writesrv:23456789:wait:/usr/bin/startsrc -swritesrv
uprintfd:23456789:respawn:/usr/sbin/uprintfd
shdaemon:2:off:/usr/sbin/shdaemon >/dev/console 2>&1 # High availability daemon
l2:2:wait:/etc/rc.d/rc 2
logsymp:2:once:/usr/lib/ras/logsymptom # for system dumps
: itess:23456789:once:/usr/IMNSearch/bin/itess -start search >/dev/null 2>&1
diagd:2:once:/usr/lpp/diagnostics/bin/diagd >/dev/console 2>&1
: httpdlite:23456789:once:/usr/IMNSearch/httpdlite/httpdlite -r /etc/IMNSearch/httpdlite/httpdlite.conf & >/dev/console 2>&1
ha_star:h2:once:/etc/rc.ha_star >/dev/console 2>&1
cons:0123456789:respawn:/usr/sbin/getty /dev/console
hntr2mon:2:once:/opt/hitachi/HNTRLib2/etc/D002start
dlmmgr:2:once:startsrc -s DLMManager
ntbl_reset:2:once:/usr/bin/ntbl_reset_datafiles
rcml:2:once:/usr/sni/aix52/rc.ml > /dev/console 2>&1
perfstat:2:once:/usr/lib/perf/libperfstat_updt_dictionary >/dev/console 2>&1
ctrmc:2:once:/usr/bin/startsrc -s ctrmc > /dev/console 2>&1
tty1:2:off:/usr/sbin/getty /dev/tty1
tty0:2:off:/usr/sbin/getty /dev/tty0
: i4ls:2:wait:/etc/i4ls.rc > /dev/null 2>&1 # Start i4ls
mF:2345:wait:sh /etc/mflmrcscript > /dev/null 2>&1
i4ls:2:wait:/etc/i4ls.rc > /dev/null 2>&1 # Start i4ls
documentum:2:once:/etc/rc.documentum start >/dev/null 2>&1 


Note 7:
-------


IBM C/C++ Compilers
This describes the IBM implementation of the C and C++ compilers. 

Contents
Invoking the Compiler 
C Compiler Modes 
C++ Compiler Modes 
Source Files and Preprocessing 
Default Datatype Sizes 
Distributed-memory parallelism 
Shared-memory parallelism 
64-bit addressing 
Optimization 
Related Information 
Memory Management 
Porting programs from the Crays to the SP 
Mixing C and Fortran 

--------------------------------------------------------------------------------

Invoking the Compiler
The IBM C compiler is described in the IBM C for AIX User's Manual and the IBM C++ compiler is described 
in the IBM Visual Age C++ Batch Compiler manual. Both of these manuals are on line. 

As with the IBM XL Fortran compiler, there are several different commands that invoke the C or C++ compilers, 
each of which is really an alias for the main C or C++ command packaged with a set of commonly used options. 

The most basic C compile is of the form 

% xlc source.c

This will produce an executable named a.out. The other C Compiler modes are described below in the 
section C Compiler Modes. 

The most basic C++ compile is of the form 

%	xlC source.C

This will produce an executable named a.out. The other C++ Compiler modes are described below 
in the section C++ Compiler Modes. 

Note: There is no on-line man page for the C++ compiler. "man xlC" brings up the man page for the C compiler. 
For complete documentation of C++ specific options and conventions see the on-line C++ manual. 
The commands xlc, mpcc, and mpCC all have on-line man pages. 

C Compiler Modes
There are four basic compiler invocations for C compiles: xlc, cc, c89, and mpcc. All but c89 have one or more 
subinvocations with different defaults. 

xlc
xlc invokes the compiler for C with an ansi language level. This is the basic invocation that IBM recommends. 

These are the two most useful subinvocations of xlc: 

xlc_r 
This invokes the thread safe version of xlc. It should be used when any kind of multi-threaded code is being built. 
This is equivalent to invoking the compiler as xlc -D_THREAD_SAFE and the loader as 
xlc -L/usr/lib/threads -Lusr/lib/dce -lc_r -lpthreads. 

xlc128 
This is equivalent to invoking the compiler as xlc -qldbl128 -lC128. It increases the size of long double data types 
from 64 to 128 bits. 

cc
cc invokes the compiler for C with an extended language level. This is for source files with legacy C code 
that IBM refers to as "RT compiler extensions". This include older pre-ansi features such as those in the 
Kernighan and Ritchie's "The C Programming Language". 

The two most useful subinvocations are cc_r which is the cc equivalent of xlc_r and cc128 which is the cc equivalent 
of xlc128. 

c89
c89 should be used when strict conformance to the C ANSI ANSI standard (ISO/IEC 9899:1990) is desired. 
There are no subinvocations associated with this compiler invocation. 

mpcc
mpcc is a shell script that compiles C programs with the cc compiler while linking in the Partition Manager, 
Message Passing Interface (MPI), and/or Message Passing Library (MPL). Flags are passed by mpcc to the xlc command, 
so any of the xlc options can be used with mpcc as well. When mpcc is used to link a program the Partition Manager 
and message passing interface are automatically linked in. The script creates an executable that dynamically binds 
with the message passing libraries. 

There is one subinvocation with mpcc, mpcc_r which is the mpcc equivalent of cc_r. This invocation also links 
in the Partition Manager, the threaded implementation of Message Passing Interface (MPI), and Low-level 
Applications Programming Interface (LAPI). 

ANSI compliance can be achieved by compiling with the option -qlanglvl=ansi. 

Compiler summary
This table summarizes the features of several different C compiler invocations: 

Compiler Name Functionality 
C defaults DM Parallel SM Parallel 
xlc ansi No No 
xlc_r ansi No Yes 
xlc128 ansi No No 
cc extended No No 
cc_r extended No Yes 
cc128 extended No No 
c89 strict No No 
mpcc extended* Yes No 
mpcc_r extended* Yes Yes 
* ANSI compliance can be achieved by compiling with the option -qlanglvl=ansi. 
In the table above, C defaults indicates the default C standards behavior of the compiler. 

DM Parallel refers to distributed-memory parallelism through the MPI library. 

SM Parallel refers to shared-memory parallelism, available through OpenMP, IBM tasking directives, 
automatic parallelization by the compiler, or the pthreads API. 

C++ Compiler Modes
There are two basic compiler invocations for C++ compiles: xlC and mpCC. If a program consists of source code modules 
in different program languages, it must be linked with a form of one of these invocations in order to use the 
correct C++ run time libraries. 

All of the C++ invocations will compile source files with a .c suffix as ansi C source files unless the 
-+ option to the C++ compiler is specified. Any of the C compiler invocations will also compile a file with 
the appropriate suffix as a C++ file. 

xlC
Among the subinvocations of xlC are: 

xlC_r: the xlC equivalent of xlc_r 
xlC128: the xlC equivalent of xlc128 
xlC128_r: this combines the features of the xlC_r and xlC128 subinvocations. 
mpCC
mpCC is a shell script that compiles C++ programs with the xlC compiler while linking in the Partition Manager, 
Message Passing Interface (MPI), and/or Message Passing Library (MPL). Flags are passed by mpCC to the xlC command, 
so any of the xlC options can be used on the mpCC shell script. When mpCC is used to link a program the 
Partition Manager and message passing interface are automatically linked in. The script creates an executable 
that dynamically binds with the message passing libraries. 

By default, the mpCC compiler uses the regular C program MPI bindings. In order to use the full C++ MPI bindings 
use the compiler flag -cpp 

There is one mpCC subinvocation, mpCC_r. This invokes a shell script that compiles C++ programs while linking 
in the Partition Manager, the threaded implementation of Message Passing Interface (MPI), and Low-level Applications 
Programming Interface (LAPI). 

Source Files and Preprocessing
All of the C and C++ compiler invocations process assembler source files and object files as well as preprocessing 
and compiling C and C++ source files. Unless the -c option is specified, they also call the linkage editor to produce 
a single executable object file. 

All invocations of the C or C++ compilers follow these suffix conventions for input files: 

.C, .cc, .cpp, or .cxx - C++ source file. 
.c - C source file 
.i - preprocessed C source file 
.so - shared object file 
.o - object file for ld command 
.s - assembler source file 
By default, the preprocessor is run on both C and C++ source files. 

Default Datatype Sizes
These are the default sizes of the standard C/C++ datatypes. 

Type Length (bytes) 
bool1 1 
char 1 
wchar_t1 2 
short 2 
int 4 
long 4 /8 2 
float 4 
double 8 
long double 8 /163 
1C++ only.
264 bit mode -q64.
3 128 suffix compiling mode. 
Distributed-Memory Parallelism

Invoking any of the compilers starting with "mp" enables the program for running across several nodes. 
Of course, you are responsible for using a library such as MPI to arrange communication and coordination 
in such a program. Any of the mp compilers sets the include path and library paths to pick up the MPI library. 

To use the MPI with C++ or to use the MPI I/O subroutines, the thread-safe version of the compiler must be used. 

% mpcc_r a.c
% mpCC_r -cpp a.C

The example, hello.c, demonstrates the use of MPI from a C code. 

The example, hello.C, demonstrates the use of MPI from a C++ code. 

Shared-Memory Parallelism
The IBM C and C++ compilers support a variety of shared-memory parallelism. 

OpenMP
OpenMP directives are fully supported by the IBM C and C++ compilers when one of the invocations with _r suffix 
is used. See Using OpenMP on seaborg for details. 

Automatic Parallelization
The IBM C compiler will attempt to automatically parallelize simple loop constructs. Use the option "-qsmp" 
with one of the _r invocations: 

% xlc_r -qsmp a.c

64 Bit Addressing
Both the IBM C and C++ compilers support 64 bit addressing through the -q64 option. The default mode can be set 
through the environment variable OBJECT_MODE on Bassi, OBJECT_MODE=64 has been set to make 64-bit mode the default. 
On Seaborg the default is 32-bit addressing mode. In 64-bit mode all pointers are 64 bits in length and length 
of long datatypes increase from 32 to 64 bits. It does not change the default size of any other datatype. 

The following points should be kept in mind if 64-bit is used: 

If you have some object files that were compiled in 32-bit mode and others compiled in 64-bit mode the objects 
will not bind. You must recompile to ensure that all objects are in the same mode. 
Your link options must reflect the type of objects you are linking. If you compiled 64-bit objects, you must 
also link these objects with the -q64 option. 

Optimization
The default for all IBM compilers is for there to be no optimization. The NERSC/IBM recommended optimization options 
for both C and C++ compiles are -O3 -qstrict -qarch=auto -qtune=auto. 


55.2 Installing Tuxedo 8.1 or 9:
================================

Before installing make sure you understand the BEA and Tuxedo home dirs, and give appropriate
ownership/permissions to a dedicated BEA account.

GUI mode or console mode are available.

GUI:
====

Go to the directory where you downloaded the installer and invoke the installation procedure by entering 
the following command: 
prompt> sh filename.bin

where filename is the name of the BEA Tuxedo installer file.

Select the install set that you want installed on your system. The following seven choices are available:

Full Install (the default)-all Tuxedo server and client software components
Server Install-Tuxedo server software components only
Full Client Install-Tuxedo client software components only
Jolt Client Install-Jolt client software components only
ATMI (/WS) Client Install-Tuxedo ATMI client software components only
CORBA Client Install-Tuxedo CORBA client software components only
Custom Install-select specific Tuxedo server and client software components. The following table entry provides 
a summary of options for the Custom Install.

For a detailed list of software components for each install set, see Install Sets.

Select (add) or deselect (clear) one or more software components from the selected install set, 
or choose one of the other five install sets or Custom Set from the drop-down list menu and customize 
its software components. For a description of the JRLY component, see Jolt Internet Relay.

Observe the following software component mappings:

Server-contains ATMI server software; CORBA C++ server software; BEA Jolt server software; BEA SNMP Agent software, 
and BEA Tuxedo Administration Console software
ATMI Client-contains BEA ATMI Workstation (/WS) client software
CORBA Client-contains BEA CORBA C++ client software (C++ client ORB) including environmental objects
Jolt JRLY-contains BEA Jolt Relay software
Jolt Client-contains BEA Jolt client software

After selecting or deselecting one or more software components from the selected install set, 
click Next to continue with the installation. The appropriate encryption software for LLE and/or SSL 
is automatically included.

Specify the BEA Home directory that will serve as the central support directory for all BEA products 
installed on the target system. If you already have a BEA Home directory on your system, you can select 
that directory (recommended) or create a new BEA Home directory. If you choose to create a new directory, 
the BEA Tuxedo installer program automatically creates the directory for you. For details about the 
BEA Home directory, see BEA Home Directory.

Choose a BEA Home directory and then click Next to continue with the installation.

Console mode:
=============

Console-mode installation is the text-based method of executing the BEA Installation program. 
It can be run only on UNIX systems and is intended for UNIX systems with non-graphics consoles. 
Console-mode installation offers the same capabilities as graphics-based installation 

Go to the directory where you downloaded the installer and invoke the installation procedure 
by entering the following command: 

prompt> sh filename.bin -i console 

where filename is the name of the BEA Tuxedo installer file.

The tekstbased installation resembles from then on, the GUI installation.

Tuxedo 8.1 binaries and what can you do with them:
==================================================

/spl/SPLDEV1/product/tuxedo8.1/bin:>ls
AUTHSVR              TMNTSFWD_T           dmadmin              snmp_integrator.pbk  tpaclcvt
AUTHSVR.pbk          TMQFORWARD           dmadmin.pbk          snmp_version         tpacldel
BBL                  TMQUEUE              dmloadcf             snmp_version.pbk     tpaclmod
BBL.pbk              TMS                  dmloadcf.pbk         snmpget              tpaddusr
BRIDGE               TMS.pbk              dmunloadcf           snmpget.pbk          tpdelusr
BRIDGE.pbk           TMSYSEVT             dmunloadcf.pbk       snmpgetnext          tpgrpadd
BSBRIDGE             TMSYSEVT.pbk         epifreg              snmpgetnext.pbk      tpgrpdel
BSBRIDGE.pbk         TMS_D                epifregedt           snmptest             tpgrpmod
CBLDCLNT             TMS_QM               epifunreg            snmptest.pbk         tpmigldap
CBLDSRVR             TMS_QM.pbk           esqlc                snmptrap             tpmodusr
CBLVIEWC             TMS_SQL              evt2trapd            snmptrap.pbk         tpusradd
CBLVIEWC32           TMS_SQL.pbk          evt2trapd.pbk        snmptrapd            tpusrdel
DBBL                 TMUSREVT             genicf               snmptrapd.pbk        tpusrmod
DMADM                TMUSREVT.pbk         idl                  snmpwalk             tux_snmpd
DMADM.pbk            WSH                  idl2ir               snmpwalk.pbk         tux_snmpd.pbk
GWADM                WSH.pbk              idltojava            sql                  tuxadm
GWTDOMAIN            WSL                  idltojava.pbk        stop_agent           tuxadm.pbk
GWTDOMAIN.pbk        bldc_dce             ir2idl               stop_agent.pbk       tuxwsvr
GWTOPEND             blds_dce             irdel                tidl                 txrpt
ISH                  build_dgw            jrly                 tlisten              ud
ISH.pbk              buildclient          jrly.pbk             tlisten.pbk          ud32
ISL                  buildish             mkfldhdr             tlistpwd             uuidgen
ISL.pbk              buildobjclient       mkfldhdr32           tmadmin              viewc
JRAD                 buildobjserver       ntsadmin             tmadmin.pbk          viewc.pbk
JRAD.pbk             buildserver          qmadmin              tmboot               viewc32
JREPSVR              buildtms             reinit_agent         tmboot.pbk           viewc32.pbk
JSH                  buildwsh             reinit_agent.pbk     tmconfig             viewdis
JSH.pbk              cleanupsrv           restartsrv           tmipcrm              viewdis32
JSL                  cleanupsrv.pbk       restartsrv.pbk       tmipcrm.pbk          wgated
LAUTHSVR             cns                  rex                  tmloadcf             wgated.pbk
TMFFNAME             cnsbind              rmskill              tmloadcf.pbk         wlisten
TMFFNAME.pbk         cnsls                sbbl                 tmshutdown           wlisten.pbk
TMIFRSVR             cnsunbind            show_agent           tmshutdown.pbk       wtmconfig
TMNTS                cobcc                show_agent.pbk       tmunloadcf           wud
TMNTSFWD_P           cobcc.pbk            snmp_integrator      tpacladd             wud32


txrpt:
------

Name
txrpt-BEA TUXEDO system server/service report program 

Synopsis
txrpt [-t]  [-n names]  [-d mm/dd]  [-s time]  [-e time]
Description
txrpt analyzes the standard error output of a BEA TUXEDO system server to provide a summary 
of service processing time within the server. The report shows the number of times dispatched 
and average elapsed time in seconds of each service in the period covered. txrpt takes its input 
from the standard input or from a standard error file redirected as input. Standard error files 
are created by servers invoked with the -r option from the servopts(5) selection; the file can be 
named by specifying it with the -e servopts option. Multiple files can be concatenated into a single 
input stream for txrpt. Options to txrpt have the following meaning: 


-t 
order the output report by total time usage of the services, with those consuming the most total time printed first. 
If not specified, the report is ordered by total number of invocations of a service. 

-n names 
restrict the report to those services specified by names. names is a comma-separated list of service names. 

-d mm/dd 
limit the report to service requests on the month, mm, and day, dd, specified. The default is the current day. 

-s time 
restrict the report to invocations starting after the time given by the time argument. 
The format for time is hr[:min[:sec]]. 

-e time 
restrict the report to invocations that finished before the specified time. The format for time is the 
same as the -s flag. 
The report produced by txrpt covers only a single day. If the input file contains records from more than one day, 
the -d option controls the day reported on.

tuxadm:
-------

Name

tuxadm-BEA Tuxedo Administration Console CGI gateway.

Synopsis

http://cgi-bin/tuxadm[TUXDIR=tuxedo_directory | INIFILE=initialization_file][other_parameters]
Description

tuxadm is a common gateway interface (CGI) process used to initialize the Administration Console from a browser. 
As shown in the "Synopsis" section, this program can be used only as a location, or URL from a Web browser; 
normally it is not executed from a standard command line prompt. Like other CGI programs, 
tuxadm uses the QUERY_STRING environment variable to parse its argument list.

tuxadm parses its arguments and finds a Administration Console initialization file. If the TUXDIR parameter 
is present, the initialization file is taken to be $TUXDIR/udataobj/webgui/webgui.ini by default. 
If the INIFILE option is present, then the value of that parameter is taken to be the full path to the 
initialization file. Other parameters may also be present. 

Any additional parameters can be used to override values in the initialization file. See the wlisten 
reference page for a complete list of initialization file parameters. The ENCRYPTBITS parameter may not be 
overridden by the tuxadm process unless the override is consistent with the values allowed in the actual 
initialization file.

The normal action of tuxadm is to generate, to its standard output, HTML commands that build a Web page 
that launches the Administration Console applet. The general format of the Web page is controlled by 
the TEMPLATE parameter of the initialization file, which contains arbitrary HTML commands, 
with the special string %APPLET% on a line by itself in the place where the Administration Console applet 
should appear. Through the use of other parameters from the initialization file 
(such as CODEBASE, WIDTH, HEIGHT, and so on) a correct APPLET tag is generated that contains 
all the parameters necessary to create an instance of the Administration Console.

Errors

tuxadm generates HTML code that contains an error message if a failure occurs. Because of the way CGI 
programs operate, there is no reason to return an error code of any kind from tuxadm.

See Also

tuxwsvr(1), wlisten(1) 

MSTMACH:
--------

Is the machine name, and usually corresponds to the LMID, the logical machine ID.
There should be an entry of the hostname in /etc/hosts.


tmboot:
-------

tmboot(1)

Name

tmboot-Brings up a BEA Tuxedo configuration.

Synopsis

tmboot [-l lmid] [-g grpname] [-i srvid] [-s aout] [-o sequence] [-S] [-A] [-b] [-B lmid] [-T grpname] [-e command] 
       [-w] [-y] [-g] [-n] [-c] [-M] [-d1]

Description

tmboot brings up a BEA Tuxedo application in whole or in part, depending on the options specified. tmboot can be invoked 
only by the administrator of the bulletin board (as indicated by the UID parameter in the configuration file) 
or by root. The tmboot command can be invoked only on the machine identified as MASTER in the RESOURCES section 
of the configuration file, or the backup acting as the MASTER, that is, with the DBBL already running 
(via the master command in tmadmin(1)). Except, if the -b option is used; in that case, the system can be booted 
from the backup machine without it having been designated as the MASTER.

With no options, tmboot executes all administrative processes and all servers listed in the SERVERS section 
of the configuration file named by the TUXCONFIG and TUXOFFSET environment variables. If the MODEL is MP, 
a DBBL administrative server is started on the machine indicated by the MASTER parameter in the RESOURCES section. 
An administrative server (BBL) is started on every machine listed in the MACHINES section. For each group 
in the GROUPS section, TMS servers are started based on the TMSNAME and TMSCOUNT parameters for each entry. 
All administrative servers are started followed by servers in the SERVERS sections. Any TMS or gateway servers 
for a group are booted before the first application server in the group is booted. The TUXCONFIG file is propagated 
to remote machines as necessary. tmboot normally waits for a booted process to complete its initialization 
(that is, tpsvrinit()) before booting the next process.

Booting a gateway server implies that the gateway advertises its administrative service, and also advertises 
the application services representing the foreign services based on the CLOPT parameter for the gateway. 
If the instantiation has the concept of foreign servers, these servers are booted by the gateway at this time.

Booting an LMID is equivalent to booting all groups on that LMID.

Application servers are booted in the order specified by the SEQUENCE parameter, or in the order of server entries 
in the configuration file (see the description in UBBCONFIG(5)). If two or more servers in the SERVERS section 
of the configuration file have the same SEQUENCE parameter, then tmboot may boot these servers in parallel and 
will not continue until they all complete initialization. Each entry in the SERVERS section can have a 
MIN and MAX parameter. tmboot boots MIN application servers (the default is 1 if MIN is not specified for 
the server entry) unless the -i option is specified; using the -i option causes individual servers to be 
booted up to MAX occurrences.

If a server cannot be started, a diagnostic is written on the central event log (and to the standard output, 
unless -q is specified), and tmboot continues-except that if the failing process is a BBL, servers that depend 
on that BBL are silently ignored. If the failing process is a DBBL, tmboot ignores the rest of the 
configuration file. If a server is configured with an alternate LMID and fails to start on its primary machine, 
tmboot automatically attempts to start the server on the alternate machine and, if successful, sends a message 
to the DBBL to update the server group section of TUXCONFIG.

For servers in the SERVERS section, only CLOPT, SEQUENCE, SRVGRP, and SRVID are used by tmboot. Collectively, 
these are known as the server's boot parameters. Once the server has been booted, it reads the configuration file 
to find its run-time parameters. (See UBBCONFIG(5) for a description of all parameters.)

All administrative and application servers are booted with APPDIR as their current working directory. 
The value of APPDIR is specified in the configuration file in the MACHINES section for the machine on which 
the server is being booted.

The search path for the server executables is APPDIR, followed by TUXDIR/bin, followed by /bin and /usr/bin, 
followed by any PATH specified in the ENVFILE for the MACHINE. The search path is used only if an absolute pathname 
is not specified for the server. Values placed in the server's ENVFILE are not used for the search path.

When a server is booted, the variables TUXDIR, TUXCONFIG, TUXOFFSET, and APPDIR, with values specified in the 
configuration file for that machine, are placed in the environment. The environment variable LD_LIBRARY_PATH 
is also placed in the environment of all servers. Its value defaults to $APPDIR:$TUXDIR/lib:/lib:/usr/lib:lib> 
where <lib> is the value of the first LD_LIBRARY_PATH= line appearing in the machine ENVFILE. See UBBCONFIG(5) 
for a description of the syntax and use of the ENVFILE. Some Unix systems require different environment variables. 
For HP-UX systems, use the SHLIB_PATH environment variable. FOR AIX systems, use the LIBPATH environment variable.

The ULOGPFX for the server is also set up at boot time based on the parameter for the machine in the 
configuration file. If not specified, it defaults to $APPDIR/ULOG.

All of these operations are performed before the application initialization function, tpsvrinit(), is called.

Many of the command line options of tmboot serve to limit the way in which the system is booted and can be used 
to boot a partial system. The following options are supported.


-l lmid 

For each group whose associated LMID parameter is lmid, all TMS and gateway servers associated with the group 
are booted and all servers in the SERVERS section associated with those groups are executed.

-g grpname 


All TMS and gateway servers for the group whose SRVGRP parameter is grpname are started, followed by all servers 
in the SERVERS section associated with that group. TMS servers are started based on the TMSNAME and TMSCOUNT 
parameters for the group entry.

-i srvid 

All servers in the SERVERS section whose SRVID parameter is srvid are executed.

-s aout 

All servers in the SERVERS section with name aout are executed. This option can also be used to boot TMS and 
gateway servers; normally this option is used in this way in conjunction with the -g option.

-o sequence

All servers in the SERVERS section with SEQUENCE parameter sequence are executed.

-S

All servers in the SERVERS section are executed.

-A

All administrative servers for machines in the MACHINES section are executed. Use this option to guarantee 
that the DBBL and all BBL and BRIDGE processes are brought up in the correct order. (See also the description 
of the -M option.)

-b

Boot the system from the BACKUP machine (without making this machine the MASTER).

-B lmid 

A BBL is started on a processor with logical name lmid.

-M

This option starts administrative servers on the master machine. If the MODEL is MP, a DBBL administrative server 
is started on the machine indicated by the MASTER parameter in the RESOURCES section. A BBL is started on the 
MASTER machine, and a BRIDGE is started if the LAN option and a NETWORK entry are specified in the configuration file.

-d1

Causes command line options to be printed on the standard output. Useful when preparing to use sdb to debug 
application services.

-T grpname 

All TMS servers for the group whose SRVGRP parameter is grpname are started (based on the TMSNAME and TMSCOUNT 
parameters associated with the group entry). This option is the same as booting based on the TMS server name 
(-s option) and the group name (-g).

-e command 

Causes command to be executed if any process fails to boot successfully. command can be any program, script, 
or sequence of commands understood by the command interpreter specified in the SHELL environment variable. 
This allows an opportunity to bail out of the boot procedure. If command contains white space, the entire 
string must be enclosed in quotes. This command is executed on the machine on which tmboot is being run, 
not on the machine on which the server is being booted.

Note: If you choose to do redirection or piping on a Windows 2000 system, you must use one of the following methods:


Do redirection or piping from within a command file or script. 

To do redirection from within the queue manager administration program, precede the command with cmd. For example:
cmd /c ipconfig > out.txt 

If you choose to create a binary executable, you must allocate a console within the binary executable using 
the Windows AllocConsole() API function 

-w 

Informs tmboot to boot another server without waiting for servers to complete initialization. This option 
should be used with caution. BBLs depend on the presence of a valid DBBL; ordinary servers require a running BBL 
on the processor on which they are placed. These conditions cannot be guaranteed if servers are not started 
in a synchronized manner. This option overrides the waiting that is normally done when servers have sequence numbers.

-y 

Assumes a yes answer to a prompt that asks if all administrative and server processes should be booted. 
(The prompt appears only when the command is entered with none of the limiting options.)

-q 

Suppresses the printing of the execution sequence on the standard output. It implies -y.

-n

The execution sequence is printed, but not performed.

-c

Minimum IPC resources needed for this configuration are printed.

When the -l, -g, -i, -o, and -s options are used in combination, only servers that satisfy all qualifications 
specified are booted. The -l, -g, -s, and -T options cause TMS servers to be booted; the -l, -g, and -s options 
cause gateway servers to be booted; the -l, -g, -i, -o, -s, and -S options apply to application servers. 
Options that boot application servers fail if a BBL is not available on the machine.The -A, -M, and -B options 
apply only to administrative processes.

The standard input, standard output, and standard error file descriptors are closed for all booted servers.

Interoperability

tmboot must run on the master node, which in an interoperating application must be the highest release available. 
tmboot detects and reports configuration file conditions that would lead to the booting of administrative servers 
such as Workstation listeners on sites that cannot support them.

Portability 

tmboot is supported on any platform on which the BEA Tuxedo server environment is supported.

Environment Variables

During the installation process, an administrative password file is created. When necessary, the BEA Tuxedo system 
searches for this file in the following directories (in the order shown): APPDIR/.adm/tlisten.pw and 
TUXDIR/udataobj/tlisten.pw. To ensure that your password file will be found, make sure you have set the 
APPDIR and/or TUXDIR environment variables.

Link-Level Encryption

If the link-level encryption feature is in operation between tmboot and tlisten, link-level encryption will be 
negotiated and activated first to protect the process through which messages are authenticated.

Diagnostics

If TUXCONFIG is set to a non-existent file, two fatal error messages are displayed: 

error processing configuration file 

configuration file not found 
If tmboot fails to boot a server, it exits with exit code 1 and the user log should be examined for further details. 
Otherwise tmboot exits with exit code 0.

If tmboot is run on an inactive non-master node, a fatal error message is displayed:

tmboot cannot run on a non-master node.
If tmboot is run on an active node that is not the acting master node, the following fatal error message is displayed:

tmboot cannot run on a non acting-master node in an active application.
If the same IPCKEY is used in more than one TUXCONFIG file, tmboot fails with the following message: 

Configuration file parameter has been changed since last tmboot
If there are multiple node names in the MACHINES section in a non-LAN configuration, the following fatal error 
message is displayed:

Multiple nodes not allowed in MACHINES for non-LAN application.
If tlisten is not running on the MASTER machine in a LAN application, a warning message is printed. 
In this case, tmadmin(1) cannot run in administrator mode on remote machines; it is limited to read-only operations. 
This also means that the backup site cannot reboot the master site after failure.

Examples

To start only those servers located on the machines logically named CS0 and CS1, enter the following command:

tmboot -l CS0 -l CS1
To start only those servers named CREDEB that belong to the group called DBG1, enter the following command:

tmboot -g DBG1 -s CREDEB1

To boot a BBL on the machine logically named PE8, as well as all those servers with a location specified as PE8, 
enter the following command.

tmboot -B PE8 -l PE8

To view minimum IPC resources needed for the configuration, enter the following command.

tmboot -c

The minimum IPC requirements can be compared to the parameters set for your machine. See the system administration 
documentation for your machine for information about how to change these parameters. If the -y option is used, 
the display will differ slightly from the previous example.

Notices

The tmboot command ignores the hangup signal (SIGHUP). If a signal is detected during boot, the process continues.

Minimum IPC resources displayed with the -c option apply only to the configuration described in the configuration 
file specified; IPC resources required for a resource manager or for other BEA Tuxedo configurations are not 
considered in the calculation.

See Also

tmadmin(1), tmloadcf(1), tmshutdown(1), UBBCONFIG(5) 

Administering BEA Tuxedo Applications at Run Time


Notes in Dutch on Tuxedo:
=========================


Note 1 CDX or ETM application (middleware component, based on Tuxedo):
----------------------------------------------------------------------

Recompile van de tuxconfig.bin file, na changes in de ubb file.

ETM op AIX wordt geinstalleerd in het directory "/prj/spl/<naam_van_de_instance>", zoals
bijvoorbeeld "/prj/spl/ivocf01" of bijvoorbeeld "/prj/spl/SPLS3".

Het "gedrag" van Tuxedo wordt bijna geheel bepaald door de configuratie file
"/prj/spl/<ETM_Instance_Name>/etc/tuxconfig.bin".

De source van tuxconfig.bin, is de ascii file "/prj/spl/<ETM_Instance_Name>/etc/ubb".
Dit houdt in, dat als men een wijziging pleegt in de ubb file (bijv. het aantal servers verhogen),
dan moet er een nieuwe tuxconfig.bin file worden gegenereerd.

Hiervoor heeft SPL in het directory "/prj/spl/<ETM_Instance_Name>/bin" een shell script gemaakt,
met de naam "gentuxedo.sh".

Het script kan verschillende flags worden meegegeven. Zie opmerking 2 hieronder.

1. Logon als de ETM software owner (bijv. ccbsys of etmsys)
2. Check je environment (staan al je environment vars goed?)
3. Attach jezelf aan de juiste environment. Gebruikelijk is, dat er een "alias" bestaat
   met de naam die gelijk is aan de Instance Name. Je hoeft dan alleen maar de alias 
   vanaf de promt in te voeren.
   Voorbeeld: Stel de instance naam is "IVOCF01". Direkt na het inloggen
   als de ETM owner (of via "su - ownerccount"), kun je vanaf de prompt de alias
   aanroepen met IVOCF01, en je wordt geattached aan de IVOCF01 instance.

4. Change directory naar "/prj/spl/<ETM_Instance_Name>/bin"

5. Om nu, na een wijziging in de ubb file, de tuxconfig.bin file opnieuw te compileren,
   gebruik dan het commando

   ./gentuxedo.sh -m

   Gebruik "./gentuxedo.sh -u" om de ubb en de bin file vanaf de template te genereren.

Opmerkingen:

1. Hetzelfde kan bereikt worden met het tuxedo utility "tmloadcf"

   tmloadcf -y $SPLEBASE/etc/ubb

2. De flags die men aan gentuxedo.sh kan meegeven:

#%       USAGE:   gentuxedo.sh
#%       USAGE:    -h = HELP
#%       USAGE:    -r = Recreate the default tuxedo server
#%       USAGE:       This will recreate all of the default service lists
#%       USAGE:       ( see option -n ) as well as create UBB files
#%       USAGE:    -u = Create the UBB file from template only
#%       USAGE:    -m = use tmloadcf to recreate ubb binary from $SPLEBASE/etc/ubb
#%       USAGE:        Once modifications have been made to the $SPLEBASE/etc/ubb
#%       USAGE:        file it is necessary to compile those changes. use the -m
#%       USAGE:        option to do this.
#%       USAGE:    -s = Create the Servers
#%       USAGE:       This will create only the Servers as defined in the -n option.


Note 2:
-------

Connecten, of attachen, naar een AIX ETM instance

Op een AIX lpar (logical partition, ofwel een virtual machine, ofwel een volledig zelfstandige AIX machine), 
draaien 1 of meer ETM instance(s). Een ETM instance, is middleware, bestaande uit tuxedo services, een 
Cobol Application server, en Cobol business objects.

De ETM user (of software owner) kan zich "verbinden" met een dergelijke Instance, bijvoorbeeld om 
administratieve handelingen uit te voeren zoals het starten of stoppen van de Instance.

Op je te verbinden (of attachen) naar een bepaalde ETM instance, kun je het "splenviron.sh" script gebruiken 
welke is gelegen in "/prj/spl/<Instance_name>/bin" directory. Mogelijk is het pad toch iets anders van vorm, 
zoals bijvoorbeeld "/prj/etm_1520/IVOOCF/bin" of zoiets dergelijks. Het belangrijkste is om te weten dat 
binnen de directorytree die bij een instance hoort, dat er een "bin" directory bestaat met een aantal .sh 
shell scripts, waaronder dus ook het "splenviron.sh" script. 

Het .profile van de etm user, dient echter zodanig te zijn ingesteld, dat reeds een aantal environment variabelen 
"goed" zijn neergezet, en correct verwijzen naar de juiste Cobol, Tuxedo en DB2 locaties.
Er vanuit gaande dat het .profile goed is, kan de etm user zich verbinden met een Instance via:

splenviron.sh -e <Instance_Name>


Voorbeeld:

Stel op een AIX machine (of lpar) bestaat de ETM Instance "SPLDEV1" welke geinstalleerd is in het directory 
"/spl/SPLDEV1".

Men kan zich dan attachen via het command:

/spl/SPLDEV1/bin $ ./splenviron.sh -e SPLDEV1

Version ................ (SPLVERSION) : V1.5.20.0
Database Type ............... (SPLDB) : oracle
ORACLE_SID ............. (ORACLE_SID) : SPLDEV1
NLS_LANG ............... (NLS_LANG)   : AMERICAN_AMERICA.WE8ISO8859P15
App Dir - Logs ............. (SPLAPP) : /spl/splapp/SPLDEV1
Environment Name ....... (SPLENVIRON) : SPLDEV1
Environment Code Directory (SPLEBASE) : /spl/SPLDEV1
Build Directory .......... (SPLBUILD) : /spl/SPLDEV1/cobol/build
Runtime Directory .......... (SPLRUN) : /spl/SPLDEV1/runtime/oracle
Cobol Copy Path ......... (SPLCOBCPY) : /spl/SPLDEV1/cobol/source/cm:/spl/SPLDEV1/tuxedo/templates:
                                        /spl/SPLDEV1/tuxedo/genSources:/spl/SPLDEV1/cobol/source:
                                        /spl/SPLDEV1/product/tuxedo8.1/cobinclude

De belangrijkste functie van "splenviron.sh" is hier dan, dat een aantal variablen correct worden neergezet 
zodat alle kenmerken van deze applicatie (zoals build directory e.d.) goed staan.

Behalve het gebruik van splenviron.sh, is het heel goed mogelijk dat in het .profile van de etm user, reeds een 
aantal "aliases" zijn gedefinieerd.
Als er inderdaad aliases zijn gedefinieerd, is het attachen naar een Instance heel makkelijk. Men dient dan alleen 
nog maar de alias vanaf de unix prompt in te voeren.

Voorbeeld:

Stel dat in het .profile van de ETM user het volgende is opgenomen:

alias SPLDEV1='/spl/SPLDEV1/bin/splenviron.sh -e SPLDEV1'

Dan kan de ETM user zich direct aan SPLDEV1 attachen via het command: SPLDEV1

Dus om een aantal ETM instances te stoppen en weer te starten (bijv. in een backupscript):

#STOPPEN ETM instances:
su - cissys -c '/spl/SPLDEV1/bin/splenviron.sh -e SPLDEV1 -c "spl.sh -t stop"'
sleep 2
su - cissys -c '/spl/SPLDEV2/bin/splenviron.sh -e SPLDEV2 -c "spl.sh -t stop"'
sleep 2
su - cissys -c '/spl/SPLCONF/bin/splenviron.sh -e SPLCONF -c "spl.sh -t stop"'
sleep 2

#STARTEN ETM instances:
su - cissys -c '/spl/SPLDEV1/bin/splenviron.sh -e SPLDEV1 -c "spl.sh -t start"'
sleep 2
su - cissys -c '/spl/SPLDEV2/bin/splenviron.sh -e SPLDEV2 -c "spl.sh -t start"'
sleep 2
su - cissys -c '/spl/SPLCONF/bin/splenviron.sh -e SPLCONF -c "spl.sh -t start"'
sleep 2


Note 3:
-------

Hoe te (her-)compileren van de ETM Cobol objecten

Je kunt hiervoor het "co.sh" of het customized "co_BD.sh" script gebruiken.
Het co_BD.sh script, is een copy van het co.sh script in het /prj/spl/<INSTANCE_NAME>/bin directory, 
en dit script prompt de gebruiker voor de DB2USER / DB2PASSWORD credentials.

Syntax:

co_BD.sh -p <CobolSourceName>.cbl

Hoe te gebruiken:
1.	Logon als de juiste etmuser op AIX
2.	Run nu de alias van de juiste instance om de juiste environment in te stellen, en om je aan de 
        juiste-ETM instance te verbinden.
3.	Zorg ervoor dat je de juiste DB2User and DB2Password kent.

Nu kun je cobol objecten compileren, als in het volgende voorbeeld:

Voorbeeld op "S3" partition op AIX:

Na AIX logon als "ccbsys" (de etm instance owner op de S3 partition):

Type SPLS3 or SPLUI to set the environment
/home/ccbsys >SPLS3

060918.13:37:40 <info> DB2DIR Environment set to /prj/db2/admin/iinvu02/sqllib/
Version ................ (SPLVERSION) : V1.5.15.0
Database Type ............... (SPLDB) : db2
App Dir - Logs ............. (SPLAPP) : /prj/spl/splapp/SPLS3
Environment Name ....... (SPLENVIRON) : SPLS3
Environment Code Directory (SPLEBASE) : /prj/spl/SPLS3
Build Directory .......... (SPLBUILD) : /prj/spl/SPLS3/cobol/build
Runtime Directory .......... (SPLRUN) : /prj/spl/SPLS3/runtime/db2
Cobol Copy Path ......... (SPLCOBCPY) : /prj/spl/SPLS3/cobol/source/cm:/prj/spl/SPLS3/tuxedo/templates:/prj/spl/SPLS3/tuxedo/genSources:/prj/spl/SPLS3/cobol/source:/prj/spl/SPLS3/product/tuxedo8.1/cobinclude

/prj/spl/SPLS3 >cd /prj/spl/SPLS3/cobol/source/cm


/prj/spl/SPLS3/cobol/source/cm >co_BD.sh -p CMPCSU2B.cbl

060918.13:37:57 <info> co_BD.sh : Compile Started Mon Sep 18 13:37:57 CDT 2006
060918.13:37:57 <info> Build Directory = /prj/spl/SPLS3/cobol/build
060918.13:37:57 <info> Compiling for db2 database
060918.13:37:57 <info> Compilation requested by ccbsys for version V1.5.15.0
060918.13:37:57 <info> Environment SPLS3
060918.13:37:57 <info> Using cobol directory /opt/microfocus/cobol
060918.13:37:57 <info> DB2DIR Environment set to /prj/db2/admin/iinvu02/sqllib/

Please, type DBUSER userid to connect to database : 
Please, type in a password of c userid to connect to database : 

060918.13:38:27 <info> DB2 Compile : Local DB ALIAS = IVOOIS01
060918.13:38:27 <info> DB2 Compile : collection     = IVOOIS

060918.13:38:28 <info> Compiling one object only - CMPCSU2B.cbl
060918.13:38:29 <info> Program : CMPCSU2B ; Expand Return Code.. : 0
060918.13:38:29 <info> Program : CMPCSU2B ; Prepare Return Code. : 0
060918.13:38:30 <info> Program : CMPCSU2B ; Compile Return Code. : 0
060918.13:38:30 <info> FINISHED COMPILATION

Note 4:
-------

Hoe test je een DB2 connectie vanaf een AIX partition.
Indien op een AIX partition, "DB2 connect ESE" correct geinstalleerd is, wil je misschien de verbinding vanaf AIX, 
naar DB2 op Z testen. Dat kan zoals in het volgende voorbeeld:

1.	login als de juiste ETM instance owner (zoals bijv. iinvu02)
2.	type db2 <enter> 

De DB2 client utility wordt gestart en de bijbehorende prompt verschijnt:

db2 =>

Voer nu in:

db2=> Connect to <alias_name> user <user>

Vervolgens wordt om het password gevraagd, en hierbij is dan ook getest of de verbinding werkt.

Voorbeeld:

db2=> connect to IVOOCF01 user $SCCB60 using sa876dfy

Als extra test, kun je ook proberen om de huidige datum uit een DB2 dummy table op te vragen, via het commando:

db2=> select current date from sysibm.sysdummy1

Opmerking:

De ETM instance owner dient wel in zijn .profile een aantal DB2 environment variables te hebben staan, 
zodat DB2 correct werkt, zoals:

export DB2_HOME=/prj/db2/admin/<db2_user>
. $DB2_HOME/sqllib/db2profile


Note 5:
-------

Environment variabelen voor de ETM Instance owner op UNIX

De ETM software owner, of ook wel de ETM Instance owner, heeft op unix / AIX, een aantal noodzakelijke 
environment variabelen nodig in het .profile bestand. 

Stel dat we als voorbeeld nemen, de Instance SPLDEV1 die geinstalleerd is in "/spl/SPLDEV1"
In dat geval heeft de Instance owner zeker de volgende variabelen nodig. Je kunt ze aanpassen naar de environment 
die voor jou speelt, en direkt in het .profile file kopieren.

1. Algemene vars die verwijzen naar Support software als Java, Perl, DB2 connect e.d.

export COBDIR=/opt/SPLcobAS40		# of hogere versie
export COBMODE=64
export JAVA_HOME=/usr/java131		# of hogere versie
export LC_MESSAGES=C
export LANG=C
LD_LIBRARY_PATH=/spl/V1515_SFix2_BASE_SUN_DB2/runtime/db2:/spl/V1515_SFix2_BASE_SUN_DB2/product/tuxedo8.1/lib:/opt/SPLcobAS40/lib:/usr/local/lib:/opt/IBMdb2/db282/sqllib/lib::

3.	Vars mbt "deze" Instance

SPLAPP=/spl/splapp/V1515_SFix2_BASE_SUN_DB2
SPLBCKLOGDIR=/tmp
SPLBUILD=/spl/V1515_SFix2_BASE_SUN_DB2/cobol/build
SPLCOBCPY=/spl/V1515_SFix2_BASE_SUN_DB2/cobol/source/cm:/spl/V1515_SFix2_BASE_SUN_DB2/tuxedo/templates:/spl/V1515_SFix2_BASE_SUN_DB2/tuxedo/genSources:/spl/V1515_SFix2_BASE_SUN_DB2/cobol/source:/spl/V1515_SFix2_BASE_SUN_DB2/product/tuxedo8.1/cobinclude
SPLCOMMAND='ksh -o vi'
SPLCOMP=microfocus
SPLDB=db2
SPLEBASE=/spl/V1515_SFix2_BASE_SUN_DB2
SPLENVIRON=V1515_SFix2_BASE_SUN_DB2
SPLFUNCGETOP=''
SPLGROUP=cisusr
SPLHOST=sf-sunapp-22
SPLLOCALLOGS=/spl/vInd/local/logs
SPLLOGS=/spl/V1515_SFix2_BASE_SUN_DB2/logs
SPLQUITE=N
SPLRUN=/spl/V1515_SFix2_BASE_SUN_DB2/runtime/db2
SPLSOURCE=/spl/V1515_SFix2_BASE_SUN_DB2/cobol/source
SPLSUBSHELL=ksh
SPLSYSTEMLOGS=/spl/V1515_SFix2_BASE_SUN_DB2/logs/system
SPLUSER=cissys
SPLVERS=1
SPLVERSION=V1.5.15.1
SPLWEB=/spl/V1515_SFix2_BASE_SUN_DB2/cisdomain/applications
T=/spl/V135_MASTERTEMPLATE_UNIX
TERM=ansi
THREADS_FLAG=native
TUXCONFIG=/spl/V1515_SFix2_BASE_SUN_DB2/etc/tuxconfig.bin
TUXDIR=/spl/V1515_SFix2_BASE_SUN_DB2/product/tuxedo8.1
ULOGPFX=/spl/V1515_SFix2_BASE_SUN_DB2/logs/system/ULOG


55.3 Installing Micro focus:
============================


55.4 Installing Java or JRE:
============================

What is it?:
------------

- Java Compiler (javac):  Compiles programs written in the Java programming language into bytecodes.

- Java Interpreter (java):  Executes Java bytecodes.  In other words, it runs 
  programs written in the Java programming language.

- The Java 2 Runtime Environment is intended for software developers 
  and vendors to redistribute with their applications.

  The Java(TM) 2 Runtime Environment contains the Java virtual machine, 
  runtime class libraries, and Java application launcher that are 
  necessary to run programs written in the Java progamming language. 
  It is not a development environment and does not contain development 
  tools such as compilers or debuggers.  For development tools, see the 
  Java 2 SDK, Standard Edition.

- SDK, JDK
  Java 2 Platform, Standard Edition (J2SE) provides a complete environment for applications development 
  on desktops and servers and for deployment in embedded environments. It also serves as the foundation 
  for the Java 2 Platform, Enterprise Edition (J2EE) and Java Web Services.

- The PATH statement enables a system to find the executables (javac, java, javadoc, etc.) 
  from any current directory.

- The CLASSPATH tells the Java virtual machine and other applications (which are located in the 
  "jdk_<version>\bin" directory) where to find the class libraries, such as classes.zip file 
  (which is in the lib directory). 

The LIBPATH environment variable tells AIX applications, such as the JVM where to find shared libraries. 
This is equivalent to the use of LD_LIBRARY_PATH in other Unix-based systems. 

On AIX, LIBPATH must be set instead of LD_LIBRARY_PATH. On HP UX, SHLIB_PATH must be set instead of 
LD_LIBRARY_PATH. On Windows NT, no variable for shared libraries is required.


For AIX, a number of Java SDK's and JRE's are available, e.g.
December 2004 - SDK 1.3.1 32-bit PTF (APAR IY65310) released and JRE 1.3.1 32-bit refreshed, 
both using ca131-20041210 build (SR8). 
December 2004 - SDK 1.3.1 64-bit PTF (APAR IY65311) released and JRE 1.3.1 64-bit refreshed, 
both using caix64131-20041210 build (SR8). 


How to install?:
----------------

- Question: Can all these java releases co-exist on a machine? In which directories are these releases installed? 
    Answer: 
Yes, releases can co-exist. 
Java 1.1.8 installs in /usr/jdk_base 
Java 1.2.2 installs in /usr/java_dev2 
Java 1.3.0 installs in /usr/java130 
Java 1.3.1 64-bit install in /usr/java13_64 
Java 1.3.1 installs in /usr/java131 
Java 1.4 64-bit install in /usr/java14_64 
Java 1.4 installs in /usr/java14

- Question: What AIX levels are required for Java releases? 
    Answer: 
To take advantage of latest AIX fixes it is recommended/required that latest AIX Recommended Maintenance Level 
be used. The following is the minimum AIX level required at the time when a Java release was first released: 
Java 1.1.8 requires AIX 4.2.1 
Java 1.2.2 requires AIX 4.3.3 PLUS fixes 
Java 1.3.0 requires AIX 4.3.3.10 PLUS fixes 
Java 1.3.1 64-bit requires AIX 5.1.0.10 
Java 1.3.1 requires AIX 4.3.3.75 
Java 1.4 64-bit requires at least AIX 5.1.0.75 or AIX 5.2.0.10 
Java 1.4 requires at least AIX 5.1.0.75 or AIX 5.2.0.10


Question: What AIX levels are required for Java releases?
    Answer:
To take advantage of latest AIX fixes it is recommended/required that latest AIX Recommended Maintenance Level 
be used. The following is the minimum AIX level required at the time when a Java release was first released:
Java 1.1.8 requires AIX 4.2.1 
Java 1.2.2 requires AIX 4.3.3 PLUS fixes 
Java 1.3.0 requires AIX 4.3.3.10 PLUS fixes 
Java 1.3.1 64-bit requires AIX 5.1.0.10 
Java 1.3.1 requires AIX 4.3.3.75 
Java 1.4 requires at least AIX 5.1.0.75 or AIX 5.2.0.10
Java 5 requires at least AIX 5.2.0.75 or AIX 5.3.0.30


- Question: What paths do I need to set to use a specific Java release on my system? 
    Answer: 
Java 1.1.8: 
PATH=/usr/jdk_base/bin:$PATH 
Java 1.2.2: 
PATH=/usr/java_dev2/jre/sh:/usr/java_dev2/sh:$PATH 

Java 1.3.0 
PATH=/usr/java130/jre/bin:/usr/java130/bin:$PATH 

Java 1.3.1 64-bit: 
PATH=/usr/java13_64/jre/bin:/usr/java13_64/bin:$PATH 

Java 1.3.1 
PATH=/usr/java131/jre/bin:/usr/java131/bin:$PATH 

Java 1.4 64-bit: 
PATH=/usr/java14_64/jre/bin:/usr/java14_64/bin:$PATH 

Java 1.4 
PATH=/usr/java14/jre/bin:/usr/java14/bin:$PATH


Install JDK or SDK:

For base images after you downloaded either packagename.tar or the packagename.tar.gz file 
(the latter is recommended if you have gunzip utility available), you need to extract packagename from 
the downloaded file: 
# tar -xvf packagename.tar  (example: tar -xvf Java14.sdk.tar), or 
# gunzip -c packagename.tar.gz | tar -xvf -  (example: gunzip -c Java14.sdk.tar.gz | tar -xvf - ) 

For update images the .bff files are ready to be installed. Before installing, remove the old .toc file (if it exist) 
in the directory containing the .bff images. 

You can use the smitty command to install (both base and update images): 

        Run "smitty install"
        Select "Install and Update Software"
        Select "Install Software"
        Specify directory containing the images


Install JRE:

The JRE installation is simple. After downloading the package, create a directory where you want to install, 
then unpackage the files where /java_home is a directory of your choice and jre## refers to the specific 
JRE image from the download page. 
mkdir -p /java_home
cd /java_home 
tar -xvpf jre##.tar 
or 
gunzip -c < jre##.tar.gz | tar -xvpf -

 
How to check your java version?
-------------------------------

/software/java:>java -version
java version "1.3.1"
Java(TM) 2 Runtime Environment, Standard Edition (build 1.3.1)
Classic VM (build 1.3.1, J2RE 1.3.1 IBM AIX build ca131ifx-20040721a SR7P (JIT enabled: jitc))

/software/java:>java -fullversion
java full version "J2RE 1.3.1 IBM AIX build ca131ifx-20040721a SR7P"

/root:>which java
/usr/java131/bin/java

To check the Java filesets on your system:
# lslpp -l | grep Java


/root:>lslpp -l | grep Java
  Java131.rte.bin           1.3.1.16  COMMITTED  Java Runtime Environment
  Java131.rte.lib           1.3.1.16  COMMITTED  Java Runtime Environment
                                                 Java-based build tool.
                                                 JavaBeans(TM) (EJB(TM)).
                                                 Javadocs
                                                 Java(TM) technology-based Web
                                                 Java(TM) technology-based Web
                                                 Javadocs
  idebug.rte.hpj             9.2.5.0  COMMITTED  High-Performance Java Runtime
  idebug.rte.jre             9.2.5.0  COMMITTED  Java Runtime Environment
  idebug.rte.olt.Java        9.2.5.0  COMMITTED  Object Level Trace Java


Notes:
------

Note 1:
-------

thread

Q:

Unable to install Java 1.4 due to License Problem 
I am trying to install Java14_64.sdk package on AIX 5.3 ML4. The install fails with the message below, 
the license file IS on the system/media but for some reason is not recognized. 
When I try to install the license on its own it fails with a pre-requisite failure, the pre-requisite 
being the above package - Java14_64.sdk. 

Any ideas out there ?


Selected Filesets
-----------------
Java14_64.sdk 1.4.0.1 # Java SDK 64-bit

<< End of Success Section >>

FILESET STATISTICS
------------------
1 Selected to be installed, of which:
1 Passed pre-installation verification
----
1 Total to be installed

LICENSE AGREEMENT FAILURES
------------------
The installation cannot proceed because the following filesets
require software license agreement files which could not be
found
on the system or installation media:

Java14_64.sdk 


A:

The downloadable install images at:
http://www-128.ibm.com/developerwork...x/service.html
http://www-128.ibm.com/developerworks/java/jdk/aix/service.html
have both the 1.4.2.0 base images (ca1420-20040626) and the
current 'latest' level (ca142-20060824). You should be able
to use the 1.4.2.0 from that site plus your 1.4.2.1 update.

Paul Landay


Some jre versions on AIX:
-------------------------

AIX 5.1 ML5 comes with APAR IY52512
IBM SDK 1.3.1 SR7 32-bit (APAR IY52512) JavaTM 2 Runtime Environment, Standard Edition (build 1.3.1) Classic VM 
(build 1.3.1, J2RE 1.3.1 IBM AIX build ca131-20040517 (JIT enabled: jitc)) 

AIX 5.2 ML5 comes with APAR IY58350
Java(TM) 2 Runtime Environment, Standard Edition (build 1.3.1)
Classic VM (build 1.3.1, J2RE 1.3.1 IBM AIX build ca131ifx-20040721a SR7P (JIT enabled: jitc))


IY65305: JAVA142 32-BIT PTF : CA142IFX-20041203
== IY58350 : JAVA 1.3.1 32-bit SR7P : ca131ifx-20040721a

SDK 1.3.1 32-bit PTFs since GA:
ca131-20040517
					Java131.rte.bin
        APAR #    Fullversion		fileset level		"SR" #
        ------    -----------		---------------		------
	IY76252   ca131-20051025	1.3.1.18		SR 9
	IY65310   ca131-20041210	1.3.1.17		SR 8
  ->    IY58350   ca131ifx-20040721a    1.3.1.16                SR 7P
  ->	IY52512   ca131-20040517	1.3.1.15		SR 7
	IY50443   ca131-20031105	1.3.1.13		SR 6a
	IY49074   ca131-20031021	1.3.1.12		SR 6
	IY47055   ca131-20030630a	1.3.1.11		N/A
	IY45632   ca131-20030630	1.3.1.10		SR 5
	IY45288   ca131-20030329	1.3.1.9			N/A
	IY40440	  ca131-20030329	1.3.1.8			SR 4
	IY39508   ca131-20030122a	1.3.1.7			N/A
	IY38011	  ca131-20021107	1.3.1.6			SR 3W
	IY33957	  ca131-20021102	1.3.1.5			SR 3
        IY30887   ca131-20020706	1.3.1.2			SR 2


SDK 1.3.1 64-bit PTFs since GA:
					Java13_64.rte.bin
        APAR #    Fullversion		fileset level		"SR" #
        ------    -----------		-----------------	------

	IY76253   caix64131-20051025 	1.3.1.10		SR 9	
	IY65311   caix64131-20041210 	1.3.1.9			SR 8	
	IY58414   caix64131ifx-20040721 1.3.1.8			SR 7P	
	IY57370   caix64131-20040517	1.3.1.7			SR 7
        IY49076   caix64131-20031021	1.3.1.6			SR 6
	IY45633   caix64131-20030618	1.3.1.5			SR 5
	IY42844	  caix64131-20030329	1.3.1.4			SR 4
	IY34010   caix64131-20021102	1.3.1.3			SR 3
        IY30923   caix64131-20020706	1.3.1.2			SR 2


SDK 1.4 64-bit PTFs since 1.4.0 GA:
                                        Java14_64.sdk
    APAR #   Fullversion                fileset level   "SR" #
    ------   -----------                -------------   ------

    IY84054  caix64142-20060421           1.4.2.75      142 SR5 
    IY81444  caix64142ifx-20060209        1.4.2.51      142 SR4 (repackaged)
    IY77461  caix64142-20060120           1.4.2.50      142 SR4 (bad)
    IY75004  caix64142-20050929           1.4.2.20      142 SR3
    IY72502  caix64142-20050609           1.4.2.10      142 SR2
    IY70332  caix64142sr1aifx-20050414    1.4.2.5       N/A
    IY68122  caix64142sr1a-20050209       1.4.2.3       142 SR1a
 ==   IY62851  (IY63533 for download)
             caix64142-20040917           1.4.2.1       1.4.2 SR 1
    IY54664  caix641420-20040626          1.4.2.0       N/A (1.4.2 GA code)
    IY58415  caix641411ifx-20040810       1.4.1.4       1.4.1 SR 3
    IY52686  caix641411-20040301          1.4.1.3       1.4.1 SR 2
    IY48526  caix641411-20030930          1.4.1.2       1.4.1 SR 1
    IY47538  caix64141-20030703a          1.4.1.1       N/A
    IY43716  caix64141-20030522           1.4.1.0       N/A

Latest: 1.4.2 Service Release 5 (caix64142-20060421)


Other notes:
------------

jre 131 32 bit:
installs in /usr/java131

5100-08 (APAR IY70781)  - min AIX51
5200-06 (APAR IY67913)  - min AIX52
5300-02 (APAR IY69190)  - min AIX53

jre 131 64 bit:
installs in /usr/java13_64

5100-08 (APAR IY70781)  - min AIX51
5200-06 (APAR IY67913)  - min AIX52
5300-02 (APAR IY69190)  - min AIX53


Java 1.1.8, 1.2.2, and 1.4.1 are no longer supported by IBM. 

For AIX 4.3.3, which is out of support, Java 1.3.1 requires the AIX 4330-10 Recommended Maintenance Level.
For AIX 5.1, Java 1.3.1 requires the AIX 5100-03 Recommended Maintenance Level.
For AIX 5.2, Java 1.3.1 requires the AIX 5200-01 Recommended Maintenance Level.
For AIX 5.3, Java 1.3.1 requires Version 5.3.0.1 (APAR IY58143) or later.


Java version on AIX 5.3:
========================

The latest Java technology is included with base AIX 5L V5.3. 
The IBM 32-bit SDK for AIX 5L, Java 2 Technology Edition V1.4 ships with AIX 5L V5.3. 
The IBM 64-bit SDK for AIX 5L, Java 2 Technology Edition V1.4 is available on the AIX 5L V5.3 Expansion Pack 
and the AIX 5L Java Web site at ibm.com/developerworks/java/jdk/aix.


JVM problems and AIX Environment Variables in relation to Java:
===============================================================

Default Behavior of Java on AIX

This section describes the settings as they are right now. These settings may, and in most cases will, 
change over time. The README or SDK Guide accompanying the SDK are always the most up-to-date references 
for such settings.

Java uses the following environment settings:

AIXTHREAD_SCOPE=S 
This setting is used to ensure that each Java thread maps 1x1 to a kernel thread. The advantage of this approach 
is seen in several places; a notable example is how Java exploits Dynamic Logical Partitioning (DLPAR); 
when a new CPU is added to the partition, a Java thread can be scheduled on it. This setting should not be 
changed under normal circumstances. 

AIXTHREAD_COND_DEBUG, AIXTHREAD_MUTEX_DEBUG and AIXTHREAD_RWLOCK_DEBUG 
These flags are used for kernel debugging purposes. These may sometimes be set to OFF. If not, switching 
them off can provide a good performance boost.

LDR_CNTRL=MAXDATA=0x80000000 
This is the default setting on Java 1.3.1, and controls how large the Java heap can be allowed to grow. 
Java 1.4 decides the LDR_CNTRL setting based on requested heap. See Getting more memory in AIX for your 
Java applications for details on how to manipulate this variable.

JAVA_COMPILER 
This decides what the Just-In-Time compiler will be. The default is jitc, which points to the IBM JIT compiler. 
It can be changed to jitcg for the debug version of JIT compiler, or to NONE for switching the JIT compiler off 
(which in most cases is the absolute worst thing you can do for performance).

IBM_MIXED_MODE_THRESHOLD 
This decides the number of invocations after which the JVM JIT-compiles a method. This setting varies 
by platform and version; for example, it is 600 for Java 1.3.1 on AIX. 


Note 1:
-------

About o_maxdata and LDR_CNTRL:

... space for the native heap. Moving the fence down allows the native heap to grow, while reducing shared memory. 
For a setting of o_maxdata = N, the fence is placed at 0x30000000+N. For several good reasons, 
it is recommended to set o_maxdata to a value that is the start of a particular segment, 
such as 0xn0000000. In this case, the fence sits between segments 2+n and 3+n, which translates 
to n segments for the native heap, and 10-n segments for shared memory.

o_maxdata=8: 8 seg for native, 2 seg for shared
o_maxdata=7: 7 seg for native, 3 seg for shared
o_maxdata=6: 6 seg for native, 4 seg for shared
o_maxdata=5: 5 seg for native, 5 seg for shared
o_maxdata=4: 4 seg for native, 6 seg for shared
o_maxdata=3: 3 seg for native, 7 seg for shared *
o_maxdata=2: 2 seg for native, 8 seg for shared


By default, o_maxdata is set to 0x80000000, leaving 2 GB for native heap and 512 MB for shared memory. 
If you attempt to allocate a Java heap larger than 1 GB, it fails because Java tries to use shared memory 
for heap, and there is only 512 MB of shared memory available. If you set IBM_JAVA_MMAP_JAVA_HEAP 
in the environment and try to allocate a heap larger than 512 MB, JVM will be unable to allocate the heap. 
The solution is to adjust o_maxdata in such a way that the size of shared memory grows large enough 
to accommodate the Java heap. The next section shows you how to do this. 


So how do you go to a larger Java heap? You need to change o_maxdata to increase the amount of 
shared memory address space. You can use the following calculations to come up with the appropriate value 
for o_maxdata. Supposing you need a maximum heap size of J bytes, you would invoke Java as 

java -mxJ <other arguments> 

If J is less than 1 GB, and IBM_JAVA_MMAP_JAVA_HEAP is not set, the default setup will suffice. 
If J is > 1 GB, or if IBM_JAVA_MMAP_JAVA_HEAP is set, use o_maxdata = 0xn0000000 

where  n = (10 - ceil(J/256M)) or 8 

whichever is smaller. The function ceil rounds up the argument to the next integer. 

For example, if you need to allocate 1500 MB of heap, we have 

n = (10 - ceil(1500M/256M)) = (10 - 6) = 4. If you set o_maxdata = 0x40000000, 

you will be able to allocate the needed size of heap. To change o_maxdata, set the following 
environment variable: LDR_CNTRL=MAXDATA=<new o_maxdata value> 

The above example would set the following environment variable: LDR_CNTRL=MAXDATA=0x40000000
 

To verify that your calculation is accurate, you can try the following commands: 
$ export LDR_CNTRL=MAXDATA=0x40000000 
$ java -mx1500m -version
 
Setting the IBM_JAVA_MMAP_JAVA_HEAP variable

# export IBM_JAVA_MMAP_JAVA_HEAP=true


So, if you need to enhance memory for Websphere 5.x 32 bits, put the following lines
into the startServer.sh script, or in /prj/was/omgeving.rc:

export LDR_CNTRL=MAXDATA=0xn0000000
export IBM_JAVA_MMAP_JAVA_HEAP=true

try:

export AIXTHREAD_SCOPE=S
export AIXTHREAD_MUTEX_DEBUG=OFF
export AIXTHREAD_RWLOCK_DEBUG=OFF
export AIXTHREAD_COND_DEBUG=OFF
export LDR_CNTRL=MAXDATA=0x40000000 
export IBM_JAVA_MMAP_JAVA_HEAP=TRUE

or

export IBM_JAVA_MMAP_JAVA_HEAP=true
export LDR_CNTRL=MAXDATA=0x80000000

or

export IBM_JAVA_MMAP_JAVA_HEAP=true
export LDR_CNTRL=MAXDATA=0x80000000 


Note 2:
-------

I think the problem is that there are typically a lot of JNI allocations in
the heap that are pinned and are allocated for the life of the application.
Most of these are allocated during startup. If the min and max heap sizes
are the same, these pinned allocations are scattered throughout the heap.
Whereas if the min heap size is quite low, most of these allocations will be
closer together at the start of the heap, leaving the bulk of the heap (when
it's expanded) more free of pinned memory.


55.5 Installing Perl:
=====================

AIX supports dynamically loadable objects as well as shared libraries. Shared libraries by convention 
end with the suffix .a, which is a bit misleading, as an archive can contain static as well as 
dynamic members. For perl dynamically loaded objects we use the .so suffix also used on many other platforms.

Note that starting from Perl 5.7.2 (and consequently 5.8.0) and AIX 4.3 or newer Perl uses the AIX native 
dynamic loading interface in the so called runtime linking mode instead of the emulated interface that 
was used in Perl releases 5.6.1 and earlier or, for AIX releases 4.2 and earlier. This change does break 
backward compatibility with compiled modules from earlier perl releases. The change was made to make 
Perl more compliant with other applications like Apache/mod_perl which are using the AIX native interface. 
This change also enables the use of C++ code with static constructors and destructors in perl extensions, 
which was not possible using the emulated interface.


Starting from AIX 4.3.3 Perl 5 ships standard with AIX. (Perl 5.8.0 with AIX 5L V5.2, 5.6.0 with AIX 5L V5.1, 
5.005_03 with AIX 4.3.3.)

You either get the source code and compile Perl, or in some situations you might be happy with installing
a binary build.


55.6 Installing DB2 Connect Enterprise Edition 8.x:
===================================================

DB2 Connect 
DB2(R) Connect provides fast and robust connectivity to IBM(R) mainframe databases for e-business 
and other applications running under UNIX(R) and Windows(R) operating systems. 

DB2 Connect Personal Edition provides direct connectivity to host and iSeries DB2 servers, while 
DB2 Connect Enterprise Edition provides indirect connectivity that allows clients to access 
host and iSeries DB2 servers through the DB2 Connect server. DB2 Connect Unlimited Edition 7 and 
DB2 Connect Application Server Edition provide unique packaging solutions that make product selection 
and licensing easier. 


Note 1:
-------

Log on to the system as a user with root authority. 
Refer to the CD-ROM label to ensure that you are using the CD-ROM with your appropriate language. 
Change to the directory where the CD-ROM is mounted by entering the following command: 
   cd /cdrom 


- For AIX 4.3.3, HP-UX and Linux 

Enter the ./db2setup command to start the DB2 Setup wizard.
 
- For Solaris Operating Environment and AIX 5L 

Copy product.tar.Z, where product represents the product you are licensed to install, to a temporary filesystem. 
Enter the following command to start the DB2 Setup wizard: 

# zcat product.tar.Z | tar -xf - ; ./product/db2setup 

For example, if the product name for DB2 Enterprise Server Edition is ese, then enter the following command: 

# zcat ese.tar.Z | tar -xf - ; ./ese/db2setup 

After a moment, the IBM DB2 Setup Launchpad opens. 

When you have completed your installation, DB2 will be installed in the one of the following directories: 

For AIX: 
/usr/opt/db2_08_01 

For HP-UX, Linux, Solaris Operating Environment: 
/opt/IBM/db2/V8.1 

The installation logs db2setup.his, db2setup.log, and db2setup.err are located, by default, 
in the /tmp directory. You can specify the location of the log files. 
The db2setup.log file captures all DB2 installation information including errors. 
The db2setup.his records all DB2 installations on your machine. 
DB2 appends the db2setup.log file to the db2setup.his file. The db2setup.err file captures any error output 
that is returned by Java (for example, exceptions and trap information). 

If you want your DB2 product to have access to DB2 documentation either on your 
local computer or on another computer on your network, then you must install the DB2 Information Center. 
The DB2 Information Center contains documentation for DB2 Universal Database and DB2 related products. 


Note 2: db2admin
----------------

db2admin - DB2 Administration Server Command 
This utility is used to manage the DB2 Administration Server. 

Authorization 
Local administrator on Windows, or DASADM on UNIX based systems. 

Required connection 
None 

Command syntax 
>>-db2admin----------------------------------------------------->

>--+-----------------------------------------------------------------+-><
   +-START-----------------------------------------------------------+
   +-STOP--+--------+------------------------------------------------+
   |       '-/FORCE-'                                                |
   +-CREATE--+----------------------+--+---------------------------+-+
   |         '-/USER:--user-account-'  '-/PASSWORD:--user-password-' |
   +-DROP------------------------------------------------------------+
   +-SETID--user-account--user-password------------------------------+
   +-SETSCHEDID--sched-user--sched-password--------------------------+
   +- -?-------------------------------------------------------------+
   '- -q-------------------------------------------------------------

Note: 
If no parameters are specified, and the DB2 Administration Server exists, this command returns the name 
of the DB2 Administration Server. 

START 
Start the DB2 Administration Server. 

STOP /FORCE 
Stop the DB2 Administration Server. The force option is used to force the DB2 Administration Server to stop, 
regardless of whether or not it is in the process of servicing any requests. 

CREATE /USER: user-account /PASSWORD: user-password 
Create the DB2 Administration Server. If a user name and password are specified, the DB2 Administration Server 
will be associated with this user account. If the specified values are not valid, the utility returns 
an authentication error. The specified user account must be a valid SQL identifier, and must exist in the security database. It is recommended that a user account be specified to ensure that all DB2 Administration Server functions can be accessed. 

Note: 
To create a DAS on UNIX systems, use the dascrt command. 

Starting and stopping the DAS:

db2admin stop
db2admin start 


Note 3: db2start
----------------

db2start - Start DB2 Command 
Starts the current database manager instance background processes on a single database partition 
or on all the database partitions defined in a partitioned database environment. Start DB2 at the server 
before connecting to a database, precompiling an application, or binding a package to a database. 

db2start can be executed as a system command or a CLP command


Note 4: example cronjobs
------------------------

30 20 * * 1-6 /usr/opt/db2_08_01/adm/db2stop force >> /home/db2inst1/DBMaintenance/dbbkup.log 2>&1
31 20 * * 1-6 /usr/opt/db2_08_01/adm/db2start >> /home/db2inst1/DBMaintenance/dbbkup.log 2>&1


Note 5: sample db2 connect processes:
-------------------------------------

Using AIX, you would use the command ps -ef in order to examine processes. On Solaris and HP-UX, ps -ef 
will only show the db2sysc process (the main DB2 engine process) for all server-side processes 
(eg: agents, loggers, page cleaners, and prefetchers). If you're using Solaris or HP-UX, you can see these 
side processes with the command /usr/ucb/ps -axw. Both of these versions of the ps command work on Linux.

When performing this command on a computer running the DB2 Universal Database client or server software, 
you may see several DB2 processes listed. 

Example 1:

/root:#ps -ef | grep db2
 iinvu02 188456 422094   0 13:53:02      -  0:00 db2agent (idle) 0
   db2as 266468      1   0 13:52:10      -  0:00 /prj/db2/admin/db2as/das/adm/db2dasrrm
 iinvu02 282624 417996   0 13:53:03      -  1:13 db2disp 0
    root 295060      1   0 13:52:24      -  0:00 db2wdog 0
 iinvu01 299158 303256   0 13:52:26      -  0:00 db2resync 0
 iinvu01 303256 295060   0 13:52:24      -  0:00 db2sysc 0
    root 307350 303256   0 13:52:24      -  0:00 db2ckpwd 0
    root 311448 303256   0 13:52:24      -  0:00 db2ckpwd 0
    root 315546 303256   0 13:52:24      -  0:00 db2ckpwd 0
 iinvu01 319644 303256   0 13:52:24      -  0:00 db2gds 0
 iinvu01 323742 303256   0 13:52:24      -  0:00 db2ipccm 0
 iinvu01 327840 303256   0 13:52:25      -  0:00 db2tcpcm 0
 iinvu01 331938 303256   0 13:52:25      -  0:00 db2tcpcm 0
 iinvu01 336036 303256   0 13:52:25      -  0:00 db2tcpcm 0
 iinvu01 340134 303256   0 13:52:25      -  0:00 db2tcpcm 0
 iinvu01 344232 319644   0 13:52:26      -  0:00 db2srvlst 0
 iinvu01 348330 303256   0 13:52:26      -  0:00 db2spmrsy 0
 iinvu01 352428 319644   0 13:52:26      -  0:00 db2spmlw 0
 iinvu02 356606 401604   0 13:52:46      -  0:16 db2hmon 0
 iinvu01 360624 303256   0 13:52:26      -  0:18 db2hmon 0
 iinvu01 377016      1   0 13:52:32      -  3:00 /prj/db2/admin/iinvu01/sqllib/bin/db2fmd -i iinvu01 -m /prj/db2/admin/iinvu01/sqllib/lib/libdb2gcf.a
 iinvu02 389128 417996   0 13:52:46      -  0:00 db2srvlst 0
    root 393408      1   0 13:52:38      -  0:00 db2wdog 0
 iinvu02 397514 401604   0 13:52:46      -  0:00 db2resync 0
 iinvu02 401604 393408   0 13:52:38      -  0:00 db2sysc 0
    root 405702 401604   0 13:52:38      -  0:00 db2ckpwd 0
    root 409800 401604   0 13:52:38      -  0:00 db2ckpwd 0
    root 413898 401604   0 13:52:38      -  0:00 db2ckpwd 0
 iinvu02 417996 401604   0 13:52:38      -  0:00 db2gds 0
 iinvu02 422094 401604   0 13:52:39      -  0:00 db2ipccm 0
 iinvu02 426192      1   5 13:52:52      -  2:55 /prj/db2/admin/iinvu02/sqllib/bin/db2fmd -i iinvu02 -m /prj/db2/admin/iinvu02/sqllib/lib/libdb2gcf.a
    root 475370      1   0 13:53:50      -  0:18 /usr/opt/db2_08_01/bin/db2fmcd
   db2as 528466      1   0 13:55:52      -  0:00 /prj/db2/admin/db2as/das/bin/db2fmd -i db2as -m /prj/db2/admin/db2as/das/lib/libdb2dasgcf.a
 iinvu01 561230 323742   0 13:57:29      -  0:03 db2agent (idle) 0
 iinvu02 573686 417996   0 15:11:41      -  0:02 db2agent (idle) 0

Example 2:

    root 49504     1   0 13:13:07    -  0:00 db2wdog 
db2inst1 25844 35124   0 16:04:50    -  0:00 db2pfchr 
db2inst1 35124 65638   0 16:04:17    -  0:00 db2gds 
db2inst1 35540 35124   0 16:04:50    -  0:00 db2loggr (SAMPLE) 
db2inst1 41940 65638   0 16:04:19    -  0:00 db2resync 
db2inst1 45058 35124   0 16:04:50    -  0:00 db2pfchr 
db2inst1 49300 35124   0 16:04:19    -  0:00 db2srvlst 
db2inst1 49626 35124   0 16:04:50    -  0:00 db2dlock (SAMPLE) 
db2inst1 55852 65638   0 16:04:17    -  0:00 db2ipccm 
db2inst1 58168 35124   0 16:04:50    -  0:00 db2loggw (SAMPLE) 
db2inst1 59048 35124   0 16:04:50    -  0:00 db2pfchr 
db2inst1 64010 55852   0 16:04:50    -  0:00 db2agent (SAMPLE) 
db2inst1 65638 22238   0 16:04:17    -  0:00 db2sysc 
db2inst1 70018 35124   0 16:04:50    -  0:00 db2pclnr 
db2inst1 72120 35124   0 16:04:51    -  0:00 db2event (DB2DETAILDEADLOCK) 
db2inst1 74198 65638   0 16:04:17    -  0:00 db2syslog 
db2inst1 74578     1   0 16:04:47    -  0:00 /home/db2inst1/sqllib/bin/db2bp  
  50112C14631 5 


- db2dasrrm: The DB2 Admin Server process. This process supports both local and remote administration requests 
  using the DB2 Control Center 

- db2pclnr: I/O cleaners, associated with data cache buffers

- db2rebal: This process is used to perform a rebalancing of the data when a container is added to a DMS table space.

- db2disp: The DB2 agent dispatcher process. This process dispatches application connections between the 
  logical agent assigned to the application and the available coordinating agents when connection concentration 
  is enabled.

  This process will only exist when connection concentration is enabled.

- The db2ckpwd utility in DB2 is used to verify usernames and passwords for the operating system. 
  db2ckpwd takes a file descriptor as a command line argument and reads the username and password information 
  from that file descriptor.

- db2gds: The DB2 Global Daemon Spawner process that starts all DB2 EDUs (processes) on UNIX. 
  There is one db2gds per instance or database partition  

- db2ipccm: listener process for local applications.

- db2tcpcm: A remote client establishes TCP/IP communications through the db2tcpcm listener process. 

- db2sysc: The main DB2 system controller or engine. Without this process, the database server cannot function.

- db2resync: The resync manager process used to support applications that are using two-phase commit  

- db2wdog: The DB2 watchdog. This process is required since processes in UNIX can only track their 
  parent process ID. Each time a new process is started, the db2gds notifies the DB2 watchdog. 
  In the event that any DB2 process receive a ctrl-c or other abnormal signal, the process send the signal 
  to the watchdog, and it propagates the signal to all of the other processes in the instance. 

- db2ca: Starts the Configuration Assistant. The Configuration Assistant is a graphical interface 
  that is used to manage DB2 database configuration such as database manager configuration, DB2 registry, 
  node directory, database directory and DCS directory

- Agents

  An agent can be thought of as a 'worker' that performs all database operations on behalf of an application. 
  There are two main types of DB2 agents:

  Coordinator Agent (db2agent)
  A coordinator agent (or a coordinating agent) coordinates the work on behalf of an application and communicates 
  to other agents using interprocess communication (IPC) or remote communication protocols. 
  All connection requests from client applications, whether they are local or remote, are allocated a corresponding 
  coordinator agent. 

  Subagent (db2agntp)
  When the intra_parallel database manager configuration parameter is enabled, the coordinator agent distributes 
  the database requests to subagents (db2agntp). These agents perform the requests for the application. 
  Once the coordinator agent is created, it handles all database requests on behalf of its application 
  by coordinating subagents (db2agent) that perform requests on the database. 
  When an agent or subagent completes its work it becomes idle. When a subagent becomes idle, its name changes 
  from db2agntp to db2agnta.

  For example:

  db2agntp processes are active subagents which are currently performing work for the coordinator agent. 
  These processes will only exist when intra-partition parallelism is enabled.

  db2agnta processes are idle subagents that were used in the past by a coordinator agent.

- db2hmon: The db2hmon process has changed in DB2 Universal Database Version 8.2 and is no longer associated 
  with the HEALTH_MON database manager configuration parameter.  
  
  In DB2r Universal DatabaseT (DB2 UDB) Version 8.1, the db2hmon process was controlled by the HEALTH_MON 
  database manager configuration parameter. When HEALTH_MON was set to ON, a single-threaded independent 
  coordinator process named db2hmon would start. This process would terminate if HEALTH_MON was set to OFF. 
  In DB2 UDB Version 8.2, the db2hmon process is no longer controlled by the HEALTH_MON database manager 
  configuration parameter. Rather, it is a stand-alone process that is part of the database server 
  so when DB2 is started, the db2hmon process starts. db2hmon is a special multi-threaded DB2FMP process 
  that is named db2hmon on UNIX/Linux platforms and DB2FMP on Windows. 


Note 6: db2icrt
--------------- 

On UNIX-based systems, the db2icrt utility is located in the DB2DIR/instance directory, 
where DB2DIR represents /usr/opt/db2_08_01 on AIX, and /opt/IBM/db2/V8.1 on all other UNIX-based systems. 
If you have a FixPak or modification level installed in an alternate path, 
the DB2DIR directory is /usr/opt/db2_08_FPn on AIX and opt/IBM/db2/V8.FPn on all other 
UNIX-based systems, where n represents the number of 1 the FixPak or modification level.
The db2icrt utility creates an instance on the directory from which you invoke it. 

>>-db2icrt--+-----+--+-----+--+---------------+----------------->
            +- -h-+  '- -d-'  '- -a--AuthType-'
            '- -?-'

>--+---------------+--+---------------+--+----------------+----->
   '- -p--PortName-'  '- -s--InstType-'  '- -w--WordWidth-'

>--+---------------+--InstName---------------------------------><
   '- -u--FencedID-'

On an AIX machine, to create an instance called "db2inst1" on the directory /u/db2inst1/sqllib/bin, issue 
the following command from that directory: 

Example 1 

On a client machine: 1 
usr/opt/db2_08_01/instance/db2icrt db2inst1 
On a server machine: 1 
usr/opt/db2_08_01/instance/db2icrt -u db2fenc1 db2inst1 
where db2fenc1 is the user ID under which fenced user-defined functions and fenced stored procedures will run. 

Example 2 
On an AIX machine, if you have Alternate FixPak 1 installed, run the following command to create an instance 
running FixPak 1 code from the Alternate FixPak install path: 
/usr/opt/db2_08_FP1/instance/db2icrt -u db2fenc1 db2inst1 


Note 7: Whats an instance? Compared to Unix / Z
-----------------------------------------------

1. What's an Instance? And where is my Subsystem?

Posted 11/8/2005 | by Chris Eaton | Comments (0) | TrackBacks (0) 
If you are new to DB2 LUW from a DB2 for z/OS background then the first thing you probably noticed 
is that there is no subsystem on the LUW platform. Instead you will hear the term Instance used in a similar manner. 
An instance in DB2 for LUW is like a copy of the RDBMS including all the processes that run DB2 and memory 
(address spaces) associated with that instance of DB2 and some configuration parameters (ZPARMS) to control 
that instance. Think of it as a copy of the DB2 code running on a server. You can have as many instances 
as you like running on a single server. Associated with an instance is the concept of an instance owner. 
This is the user that "owns" that instance and has SYSADM authority over the instance and all databases 
inside that instance. SYSADM authority is the highest level of authority in DB2 and lets this user do anything 
within the databases it manages (create, drop, access all data, grant, revoke, etc). 

You can have one or more databases in each instance but a database is not exactly the same as you have 
on z/OS either. On z/OS you have one catalog per subsystem and a database is merely a logical collection of tables, 
indexes that usually have a distinct relationship to a given application. On the LUW platform each database has 
its own catalogs associated with it which stores all the metadata about that database. 

Why the difference? Well, as with many of the differences you will find at the server or storage layer, 
they are mostly due to the "culture" or "industry standard terms" that are typically used in a Linux, 
UNIX or for that matter a Windows environment. An Instance is a common term across a number of distributed 
platform RDBMSs to represent a copy of the database management code running on a server. And you won't likely 
find the term subsystem used to describe anything on a distributed platform (except for maybe some people 
talking about storage but if you dig a bit you will likely find that in a past life these people 
worked on a mainframe).

The other important distinction in this area is that your application connects to a database in the LUW 
environment (not a subsystem or instance). As well if you want to join tables across different databases 
you would use the federated query support built into DB2.


On MVS, OS390, zos:                         On UNIX/Windows:

----------------------------                ----------     -----------------------
| Subsystem                |                |INSTANCE|     |INSTANCE             |
----------------------------                ----------     -----------------------
 |             |       |                        |              |               |
---------     ----    ----                    ------------    ------------    ------------
|CATALOG|     |DB|    |DB|                    |DB+catalog|    |DB+catalog|    |DB+catalog|
---------     ----    ----                    ------------    ------------    ------------


   So, a  Z "Subsystem" <=corresponds to=> an Unix "Instance".


2. Aanvullende Info op Unix:

Na de installatie van DB2 kunt u alleen met DB2 communiceren door het instanti%ren van DB2. 
Met andere woorden, u maakt een object (lees: Database Manager) binnen DB2 aan,
die voor u de communicatie verzorgt.

  Dus een "Instance" <=corresponds to=> "Database Manager"

Stel u heeft een instance van een Database Manager aangemaakt. Deze Database Manager verzorgt de communicatie 
met zowel lokale als remote databases. U dient de Database Manager te instrueren hoe en op welke wijze 
bepaalde databases benaderd kunnen worden. Tevens geeft u aan onder welke `eenvoudige' naam deze set van 
instructies gebruikt kunnen worden. Dit is de zogenaamde Alias. 

In onderstaand figuur wordt schematisch weergegeven hoe de communicatie tussen de platformen 
wordt gerealiseerd.


       AIX                                                 z/OS
                                                                   
------------------------------              -------------------------------------     
|      -------------          |             |                                   |
|      |Application|          |             | Een Partitie                      |
|      ------|-------         |             |                ----------------   |
|            |                |             |                | DBMS 1 port A |  |      
| ------------------------    |             |                |        ----   |  |
| | Instance =           |    |        |------------------------>     |DB|   |  |
| | Database Manager     |    |        |    |                |        ----   |  |
| |                      |    |        |    |                | ----          |  |
| |         ----------   |    |        |---------------------->|DB|          |  |
| |         |Alias 1 |   |    |        |    |                | ----          |  |
| |         |        |------------------    |                ----------------   |
| |         ----------   |    |             |             ----------------      |
| |                      |    |             |             |DBMS 2 port B |      |
| |         ----------   |    |             |             |              |      |
| |         |Alias 2 |   |    |      alias  |             |       ----   |      |
| |         |        |--------------------------------------->    |DB|   |      |
| |         ----------   |    |             |             |       ----   |      |
| |                      |    |             |             |              |      |
| ------------------------    |             |             -----------------     |
-------------------------------             --------------------------------------


Zoals u ziet verloopt communicatie tussen een Applicatie, bijv. Websphere Application Server (WAS),
en het mainframe platform via DB2. Voor de eenvoud nemen we voor nu even aan dat de applicatie binnen 
de WAS de communicatie rechtstreeks aangaat. Binnen DB2 verzorgt een instance van een Database Manager 
de voorgedefinieerde connecties. De Database Manager stelt deze connecties ter beschikking 
middels een alias.

De middels pijltjes `eenvoudig' weergegeven connecties tussen de twee platformen dienen nader bekeken 
te worden. De pijl vanaf een alias tot aan een database op het mainframe wordt een node genoemd. 
Deze node dient vooraf voorzien te worden van informatie op basis waarvan toegang tot de betreffende 
database op het mainframe verkregen kan worden. Het pakket aan informatie gekoppeld aan een node noemen we 
een catalog. De node wordt alsvolgt opgebouwd:

 -------------------------------------------------------------------
 |Alias (heeft de database op het mainframe gekoppeld aan de Node) |
 -------------------------------------------------------------------
                         |
                         |
 -------------------------------------------------------------------------------------------
 |Node (kent het IP nummer van het mainframe en het poortnummer van de DBMS op de partitie)|
 -------------------------------------------------------------------------------------------


E,n alias heeft ,,n verbinding met ,,n DBMS op ,,n partitie op het mainframe. 
Binnen het DBMS `leven' namelijk meerdere databases. Indien een connectie moet worden gelegd 
tussen AIX en een andere database in een andere DBMS op dezelfde partitie dient een nieuwe alias 
(en dus node) aangemaakt te worden.


Bij het configureren van een Remote Database praten we dus over de connectie tussen DB2 
en een database op een partitie op het mainframe. De volgende stappen moeten we doorlopen om een 
werkende connectie aan te maken:

�	Aanmaken van de node;
�	Koppelen van node aan ip-nummer van het mainframe;
�	Koppelen van node aan poortnummer van een partitie op het mainframe;
�	Koppelen van database op het mainframe aan een alias op AIX

De implementatie zullen we laten zien aan de hand van een voorbeeld:

Catalog= ( node={IP + port} ( {Alias=DB} ) )

------------------------------------------------
	
db2=> Catalog tcpip node <nodenaam> <remote ip-adres mainframe> server <poortnummer>	

Het laatste commando initieert feitelijk de node gekoppeld aan het ipnummer 
van de mainframe en het poortnummer van de partitie, waarbij:

Nodenaam: De naam van de node. Deze kunt u zelf kiezen (bijv NOO49 : NOde Ontwikkeling 49).
Ip-adres mainframe:	T-partitie: 10.73.64.183
Poortnummer mainframe	T-partitie: 447 of 448 (afhankelijk van DBMS): BACDB2O = 447 (Ontwikkel omgeving)  
                                                                       BACDB2I  = 448 (Integratie omgeving)

-- Ter controle:


db2=> list node directory

Adds a Transmission Control Protocol/Internet Protocol (TCP/IP) node entry to the node directory. 
The TCP/IP communications protocol is used to access the remote node. 
The CATALOG TCPIP NODE command is run on a client. 

-----------------------------------------------

Vervolgens koppelen we de database op het mainframe middels een node aan een alias. Voer uit:

db2=> Catalog database databasenaam as alias at node nodenaam authentication dcs

Databasenaam:	Een bestaande database binnen het DBMS op het mainframe
Alias:       	Vrij te kiezen naam
Nodenaam:	De naam van de hierboven aangemaakte node

-- Ter controle:

db2=> list db directory

Nu doen we

db2=> Catalog dcs database databasenaam as DBMS

Databasenaam	De hierboven bestaande database binnen het DBMS op het mainframe
DBMS	        Dit is het DataBase Management Systeem op het mainframe.
                Bijv. T-partitie: BACDB2O (Ontwikkel omgeving)                
                                  BACDB2I (Integratie omgeving)

-- Ter controle:

db2=> list dcs directory

Vervolgens loggen wij in het mainframe in om de connectie te testen:

db2=> connect to aliasnaam user user using password

Aliasnaam	De zojuist hierboven aangemaakte alias voor een verbinding met het mainframe
User	        Uw userid of een userid met voldoende rechten op het mainframe (bijv. BDN account)
Password	Password van het toepaste userid

  Dus een sessie tot stand brengen gaat als in het onderstaande voorbeeld:

  connect to pscrx user u@mnx01 using PSWVDB2C;
  set current sqlid = 'F@MNX01'


Dus hoe zit het nu:
===================

Je wilt via DB2 connect naar een remote DB op een mainframe. Op de client doe je:

db2=> Catalog tcpip node <nodenaam> <remote ip-adres mainframe> server <poortnummer>
      db2=> list node directory         (=controle statement)
db2=> Catalog database databasenaam as alias at node nodenaam authentication dcs
      db2=> list db directory           (=controle statement)
db2=> Catalog dcs database databasenaam as DBMS
      db2=> list dcs directory          (=controle statement)

Je neemt een willekeurige handig nodenaam (door jou te kiezen dus) en koppel dat begrip
aan de remote IP en poort.
Dan koppel je de echte databasenaam aan een handige (door jou te kiezen dus) Alias, en dat koppel je dan ook
aan de nodenaam.

Voortaan kun je dan met de Alias een connectie opzetten !


Note: Connection via DB2 Connect:
---------------------------------

First of all install the DB2 client (for me it was DB2connect 7.1) and register it 
with the proper license (using db2licm).

Now you are ready to register your remote database.
I'll need to provide:
hostname,
port,
database name,
authentication method.

For every DB, I need three registrations: tcp/ip node, database and DCS.

Let's start from the tcp/ip node.

Connect to your db2 user (by default db2inst1):

db2inst1@brepredbls01:~> db2
(c) Copyright IBM Corporation 1993,2001
Command Line Processor for DB2 SDK 7.2.0

db2 =>


-- Now from the db2 client command prompt:

catalog tcpip node <nodename> remote <hostaname> server <port>

where nodename is an alias you choose, hostname is the DB2 remote hostname and the port is the DB2 listening port.

example:

catalog tcpip node RIHEP remote rihep.rit server 5023

to unregister it:

uncatalog node RIHEP 

and to list the register nodes:

db2 => list node directory

 Node Directory

 Number of entries in the directory = 3

Node 1 entry:

 Node name                      = AMDSPT
 Comment                        =
 Protocol                       = TCPIP
 Hostname                       = amdahlsvil.ras
 Service name                   = 5023

Node 2 entry:

 Node name                      = AMSVIL
 Comment                        =
 Protocol                       = TCPIP
 Hostname                       = amdahlsvil.ras
 Service name                   = 6021

Node 3 entry:

 Node name                      = RIHEP
 Comment                        =
 Protocol                       = TCPIP
 Hostname                       = rihep.rit
 Service name                   = 5023


-- Now you need to catalog your remote DB2 database:

catalog database <DBname> as <DBalias> at node <nodename> authentication DCS

Where DBname is the name of the remote database, DBalias is the name you are going to use in your connection 
and nodename is the node alias you registered above.
The chosen authentication has been DCS for my environment.

Example:

catalog database ITFINDB2 as ITFINDB2 at node RIHEP authentication DCS

If you wish to unregister the DB:

uncatalog database ITFINDB2 

for the list:

db2 => list db directory

 System Database Directory

 Number of entries in the directory = 3

Database 1 entry:

 Database alias                  = ITFINDB2
 Database name                   = ITFINDB2
 Node name                       = RIHEP
 Database release level          = 9.00
 Comment                         =
 Directory entry type            = Remote
 Authentication                  = DCS
 Catalog node number             = -1

Database 2 entry:

 Database alias                  = DB2PROD
 Database name                   = DB2PROD
 Node name                       = AMSVIL
 Database release level          = 9.00
 Comment                         =
 Directory entry type            = Remote
 Authentication                  = DCS
 Catalog node number             = -1

Database 3 entry:

 Database alias                  = DB2DSPT
 Database name                   = DB2DSPT
 Node name                       = AMDSPT
 Database release level          = 9.00
 Comment                         =
 Directory entry type            = Remote
 Authentication                  = DCS
 Catalog node number             = -1

-- Last registration step: the DCS.

catalog dcs database <DBname> as <DBalias>

example:

catalog dcs database ITFINDB2 as ITFINDB2

to unregister:

unregister dcs ITFINDB2

For the list:

db2 => list dcs directory

 Database Connection Services (DCS) Directory

 Number of entries in the directory = 3

DCS 1 entry:

 Local database name                = DB2DSPT
 Target database name               = DB2DSPT
 Application requestor name         =
 DCS parameters                     =
 Comment                            =
 DCS directory release level        = 0x0100

DCS 2 entry:

 Local database name                = DB2PROD
 Target database name               = DB2PROD
 Application requestor name         =
 DCS parameters                     =
 Comment                            =
 DCS directory release level        = 0x0100

DCS 3 entry:

 Local database name                = ITFINDB2
 Target database name               = ITFINDB2
 Application requestor name         =
 DCS parameters                     =
 Comment                            =
 DCS directory release level        = 0x0100


Now you can check if your configuration is correct:

db2 => connect to ITFINDB2 user sisbanc
Enter current password for sisbanc:

   Database Connection Information

 Database server        = DB2 OS/390 7.1.1
 SQL authorization ID   = SISBANC
 Local database alias   = ITFINDB2

This indicate a succesful connection.
An error or a command prompt without output indicates a failure.

ex:

db2 => connect to ITFINDB2 user sisbanc
Enter current password for sisbanc:

db2 => db2 => 


Note 9: license for DB2 Connect 8.x
----------------------------------- 

To license DB2 Connect 8.x, you typically use a statement like

/usr/opt/db2_08_01/adm/db2licm -a /prj/db2/install/udb/8.1/db2ese.lic


Note 10: DB2 Connect Configuration files:
-----------------------------------------

- db2nodes.cfg 

This topic provides information about the format of the node configuration file (db2nodes.cfg). 
The db2nodes.cfg file is used to define the database partition servers that participate in a DB2 instance. 
The db2nodes.cfg file is also used to specify the IP address or host name of a high-speed interconnect, 
if you want to use a high-speed interconnect for database partition server communication. 

The format of the db2nodes.cfg file 7 is as follows: 

nodenum    hostname    logical port   netname    resourcesetname 


Note 11: Most important Err messages:
-------------------------------------

1.

db2=> connect to <> user <> using <>

SQL1032N No start database manager command was issued.  SQLSTATE=57019

I keep getting the following error:
[IBM][CLI Driver] SL1032N No start database manager command was issued. SQLSTATE=57019

I tried starting the database, and I still get the above error.
Exacly how am I suppose to start the database, and how do I get rid of the above error?


56. Setting up an ASCII terminal on AIX:
========================================

The 3151 display can connect directly, or through a modem, to an AIX system.
The connection to the AIX system can be made to one of the native serial ports,
or to an asynchronous adapter. 
To add a TTY, use the following procedure:

- use "smitty tty" and select "Add a TTY" 
  or use "smitty maktty"

- or use mkdev

# mkdev -c tty -t tty -s rs232 -p sa0 -w s1 -a login=enable -a term=ibm3151
# mkdev -c tty -t tty -s rs232 -p sa0 -w s0 -a login=enable -a term=ibm3151

To validate that the tty has been added to the customized VPD object class, enter
# lscfg -vp | grep tty
tty0      01-S1-00-00       Asynchronous Terminal

To display the name of the systemconsole effective on the next startup, enter
# lscons -b
/dev/tty0


You can remove a terminal with
# rmdev -l tty_name -d

On the ASCII terminal, set the communications options as follows:
Line speed (baud rate) = 9600
Word Length (bits per character) = 8
Parity = no (none)
Number of Stop Bits = 1
Interface = RS-232C or RS-422A
Line Control = IPRTS


57: chroot:
===========

chroot

Run a command with a different root directory
'chroot' runs a command with a specified root directory. On many systems, only the super-user can do this. 


SYNTAX
     chroot NEWROOT [COMMAND [ARGS]...]

     chroot OPTION Ordinarily, filenames are looked up starting at the root of the directory structure, i.e. '/' 

'chroot' changes the root to the directory NEWROOT (which must exist) and then runs COMMAND with optional ARGS. 

If COMMAND is not specified, the default is the value of the `SHELL' environment variable or `/bin/sh' if not set, 
invoked with the `-i' option. 

The only options are `--help' and `--version' 

AIX:
----

chroot Command

Purpose
Changes the root directory of a command.

Syntax
chroot Directory Command

Description

Attention: If special files in the new root directory have different major and minor device numbers than the 
real root directory, it is possible to overwrite the file system.
The chroot command can be used only by a user operating with root user authority. 
If you have root user authority, the chroot command changes the root directory to the directory 
specified by the Directory parameter when performing the Command. The first / (slash) in any path name 
changes to Directory for the specified Command and any of its children.

The Directory path name is always relative to the current root. Even if the chroot command is in effect, 
the Directory path name is relative to the current root of the running process.

A majority of programs may not operate properly after the chroot command runs. For example, the commands 
that use the shared libraries are unsuccessful if the shared libraries are not in the new root file system. 
The most commonly used shared library is the /usr/ccs/lib/libc.a library.

The ls -l command is unsuccessful in giving user and group names if the current root location makes 
the /etc/passwd file beyond reach. In addition, utilities that depend on localized files (/usr/lib/nls/*) 
may also be unsuccessful if these files are not in the new root file system. It is your responsibility 
to ensure that all vital data files are present in the new root file system and that the path names 
accessing such files are changed as necessary.

Examples

Attention: The commands in the following examples may depend on shared libraries. Ensure that the shared 
libraries are in the new root file system before you run the chroot command.
To run the pwd command with the /usr/bin directory as the root file system, enter: 

# mkdir /usr/bin/lib
 
# cp /usr/ccs/lib/libc.a /usr/bin/lib
 
chroot /usr/bin pwd
To run a Korn shell subshell with another file system as the root file system, enter: 

# chroot /var/tmp /usr/bin/ksh

This makes the directory name / (slash) refer to the /var/tmp for the duration of the /usr/bin/ksh command. 
It also makes the original root file system inaccessible. The file system on the /var/tmp file must contain 
the standard directories of a root file system. In particular, the shell looks for commands in the 
/bin and /usr/bin files on the /var/tmp file system.

Running the /usr/bin/ksh command creates a subshell that runs as a separate process from your original shell.
 Press the END OF FILE (Ctrl-d) key sequence to end the subshell and go back to where you were 
in the original shell. This restores the environment of the original shell, including the meanings 
of the . (current directory) and the / (root directory).


58. The date command:
=====================

The date command can be very interesting to use on shell scripts, for example for testing purposes.
You can device a test like

daynumber=`date -u %d`
export daynumber

if daynumber=31 then
..
The following shows what can be done using date.

NAME
       date - print or set the system date and time

SYNOPSIS
       date [OPTION]... [+FORMAT]
       date [-u|--utc|--universal] [MMDDhhmm[[CC]YY][.ss]]

DESCRIPTION
       Display the current time in the given FORMAT, or set the system date.

       -d, --date=STRING
	      display time described by STRING, not `now'

       -f, --file=DATEFILE
	      like --date once for each line of DATEFILE

       -ITIMESPEC, --iso-8601[=TIMESPEC]
	      output  date/time	 in ISO 8601 format.  TIMESPEC=`date' for date
	      only, `hours', `minutes', or `seconds' for date and time to  the
	      indicated	 precision.   --iso-8601  without TIMESPEC defaults to
	      `date'.

       -r, --reference=FILE
	      display the last modification time of FILE

       -R, --rfc-822
	      output RFC-822 compliant date string

       -s, --set=STRING
	      set time described by STRING

       -u, --utc, --universal
	      print or set Coordinated Universal Time

       --help display this help and exit

       --version
	      output version information and exit

       FORMAT controls the output.  The only valid option for the second  form
       specifies Coordinated Universal Time.  Interpreted sequences are:

       %%     a literal %

       %a     locale's abbreviated weekday name (Sun..Sat)

       %A     locale's full weekday name, variable length (Sunday..Saturday)

       %b     locale's abbreviated month name (Jan..Dec)

       %B     locale's full month name, variable length (January..December)

       %c     locale's date and time (Sat Nov 04 12:02:33 EST 1989)

       %C     century  (year  divided  by  100	and  truncated	to an integer)
	      [00-99]

       %d     day of month (01..31)

       %D     date (mm/dd/yy)

       %e     day of month, blank padded ( 1..31)

       %F     same as %Y-%m-%d

       %g     the 2-digit year corresponding to the %V week number

       %G     the 4-digit year corresponding to the %V week number

       %h     same as %b

       %H     hour (00..23)

       %I     hour (01..12)

       %j     day of year (001..366)

       %k     hour ( 0..23)

       %l     hour ( 1..12)

       %m     month (01..12)

       %M     minute (00..59)

       %n     a newline

       %N     nanoseconds (000000000..999999999)

       %p     locale's upper case AM or PM indicator (blank in many locales)

       %P     locale's lower case am or pm indicator (blank in many locales)

       %r     time, 12-hour (hh:mm:ss [AP]M)

       %R     time, 24-hour (hh:mm)

       %s     seconds since `00:00:00 1970-01-01 UTC' (a GNU extension)

       %S     second (00..60); the 60 is necessary to accommodate a leap  sec-
	      ond

       %t     a horizontal tab

       %T     time, 24-hour (hh:mm:ss)

       %u     day of week (1..7);  1 represents Monday

       %U     week number of year with Sunday as first day of week (00..53)

       %V     week number of year with Monday as first day of week (01..53)

       %w     day of week (0..6);  0 represents Sunday

       %W     week number of year with Monday as first day of week (00..53)

       %x     locale's date representation (mm/dd/yy)

       %X     locale's time representation (%H:%M:%S)

       %y     last two digits of year (00..99)

       %Y     year (1970...)

       %z     RFC-822 style numeric timezone (-0500) (a nonstandard extension)

       %Z     time zone (e.g., EDT), or nothing if  no	time  zone  is	deter-
	      minable

       By  default, date pads numeric fields with zeroes.  GNU date recognizes
       the following modifiers between `%' and a numeric directive.

	      `-' (hyphen) do not pad the field `_' (underscore) pad the field
	      with spaces

ENVIRONMENT
       TZ     Specifies the timezone, unless overridden by command line param-
	      eters.  If neither is specified, the setting from /etc/localtime
	      is used.


DATE=$(date +%d"-"%B"-"%Y) 
ERRORDATE=$(date +%m%d0000%y) 

 
==================================
59. SOME NOTES ON LPARS ON POWER5:
==================================

This section is about pSeries and AIX only.

59.1 General architecture:
--------------------------

Before the POWER5 Architecture, you could only use lpars with dedicated cpu's, and disks dedicated to an lpar.
As from POWER5 you can use "Micro Partitioning" (assign cpu power in increments of 10% to lpars),
you can use "Dynamic LPAR" (reassign resouces to and from lpars without a reboot of lpars)
and every resource (SCSI, Netcards etc..) can be virtualized. But DLPAR was also available before Power5.

- "Virtual IO Server" (VIOS) must be installed on a partition (Nederlands: de beheer partitie) 
  to enable virtualization services.
  The other partitions can be AIX52, AIX53, Linux (Redhat, Suse) and i5/OS (AIX52 cannot use virtualized services,
  so you have to assign dedicated cpu's and disks to that partition).

  Also, if you do not have VIOS, you can only use the traditional lpars.
  VIOS provides the IO and ethernet resources to the other lpars.
  You cannot use VOIS as a usable operating system for applications. It is only used to provide
  virtual resources to other partitions. You must use the HMC or IVM to assign resources to lpars.

- You can use HMC to define partitions and administer partitions (e.g. start, shutdown an lpar)
  The HMC is a desktop connected with ethernet to the pSeries machine.

- You can use Integrated Virtualization Manager (IVM) on systems where an HMC is not used. 
  You can use the local IVM to create and administer lpars. This is a Webbased interface.
  If you want or need to use IVM, you need to install the VIOS on a nonpartitioned Server first.
  Then you can use a PC with a LAN connection to the Server, and use the browser interface.

- The Partion Load Manager (PLM) makes it possible to re-assign resources from lpars
  with lower needs (at a time) to lpars who needs higher number of resources (at a time).
  Policies can be defined on how to manage that.


HMC makes use of "partition profiles", in which you for example, can define for a lpar what the desired and
minimum and maximum resource values are. The IVM does not make use of profiles.
You can create a "system profile" that lists which partion profiles are to be used when the
Server is restarted.
Take notice of the fact that the HMC has the lpar configuration information in the form of saved profiles.

IVM does not have a commandline interface. You can telnet or ssh from your PC to the lpar for VOIS, and
use the "mkvt" command to create a vt to another lpar.

In order to use the PLM, you need to have a HMC connected to the managed Server, and you must have
an AIX 5.2 ML4 or 5.3 lpar or Server where PLM will be running.

You can create a Virtual Ethernet and VLAN's with VID's which enables lpars to communicate
with each other through this "internal" network.

Server Operating Systems can be placed in LPARS, like AIX 5.2, AIX 5.3, Linux and some others.
For AIX, only 5.3 can be a virtual client of virtualized resources.

Access to real storage devices is implemented through the Virtual SCSI services, a part of the VIOS.
Logical volumes that are created and exported on the Virtual I/O Server partition are shown at the
virtual storage client partition as a SCSI disk. 
The Virtual I/O Server supports logical mirroring and RAID. Logical Volumes created on RAID or JOBD
are bootable.

The VIOS and PLM is delivered on CD. 

To enable Power5 Partitioning, you must have obtained a key from IBM. But on the 570 and above,
this feature is per default implemented.

An AIX 5.2 lpar needs dedicated resources. AIX 5.3 can use all virtualization features.


59.2 Create an AIX logical partition and profile:
-------------------------------------------------

- logon to HMC
- Choose "Server and Partition"
- Choose "Server management"
- Choose your Server from list
- Rightclick on Partitions -> Click Create -> Click Logical Partition


59.3 Create a virtual ethernet adapter for AIX:
-----------------------------------------------

- logon to HMC
- Choose "Server and Partition"
- Choose "Server management"
- Choose your Server from list
- click on Partitions -> rightclick the partitionprofile of the
  partition who is about to use the virtual ethernet adapter 
  -> Select Dynamic Logical Partitions -> Virtual adapterrescouces -> Add/Remove
  -> Choose the tab Virtual I/O -> Choose Ethernet -> Create
  -> A dialog Properties will be displayed
  -> Fill in the slotnumber, Port Virtual LAN ID (PVID)


chown emcdmeu:emcdgeu

59.4 Installation of PLM:
-------------------------

Preparation:
============

1. Put the hostname of every lpar fullyqualified, like
lpar1.domain.com
lpar2.domain.com

2. If you do not use DNS, put in every hostfile of all lpars, 
   the hostname of the PLM Server, 
   the other hostnames of all other lpars,
   and the hostname of the HMC, like for example

172.16.0.30   lpar1.domain.com        lpar1
172.16.0.33   lpar2.domain.com        lpar2
172.16.0.100  plmserver1.domain.com   plmserver1
172.16.0.3    p5hmc1.domain.com       p5hmc1

3. Check whether Dynamic partitioning is possible for an lpar

# lssrc -a | grep rsct

If the deamon IBM.DRM is started, then an active RMC session is present on this lpar with the HMC.
RMC stands for Resource Monitoring and Control.

In order for DLPAR to work on an lpar, you need to see the following subsystems installed and active:

Subsystem	
ctrmc		Resource monitoring and control subsystem
IBM.CSMAgentRM	is for handshaking between the lpar and hmc		
IBM.ServiceRM		
IBM.DRM		is for executing the dlpar commands on the lpar 	
IBM.HostRM	is for obtaining OS information

On the HMC, you can check which lpars are ready for DLPAR with the following command:

# lspartition -dlpar


4. You need to have rsh and rcp access for all lpars.
If those are not enabled, the do the following:

- edit the .rhosts file on any lpar, and type in the lines

plmserver1 root
plmserver1.domain.com root

- chmod 4554 /usr/sbin/rshd
- chmod 4554 /usr/bin/rcp

- edit /etc/inetd.conf and make sure that this line is not commented out:
shell stream tcp6 nowait root /usr/sbin/rshd rshd

- Start the inetd deamon again with
refresh -s inetd

- Test the rsh access from the PLM Server with:
rsh root@lpar1 date
rsh root@lpar2 date

- Create the account "plmuser" on the PLM Server

- You need to have an ssh connection between the HMC and the PLM Server. 
Install Openssh on the PLM Server, and create a ssh user on the HMC.
To install Openssh on AIX, you need to have Openssl as well.
Create the ssh keys to make communication possible from HMC to PLM Server.

Installation:
=============


1. Place the PLM CD in the drive
2. smitty install_latest
3. The following filesets will be installed:
plm.license
plm.server.rte
plm.sysmgt.websm
plm.msg.en_US.server
plm.msg.en_US.websm


IOSCLI:
=======

The command line interface of the VIOS is called the IOSCLI.
The shell is a restricted shell, for example, you cannot change directories or change your PATH env. variable.

You can either work in the traditional mode or interactive mode.

- In traditional mode, you start a command with "ioscl", like in

# ioscli lsdev -virtual  (to list all virtual devices)

- In interactive mode, you use the aliases for the ioscli subcommands.
  That is, start the ioscli, and then just type the subcommand, like in

# ioscli
# lsdev -virtual

You cannot run external commands from interactive mode, like grep or sed.
First leave the interactive mode with "exit".

To escape from the limitations of ioscli, run "oem_setup_env" and you have access
to regular commands.


59.5 Important IOSCLI commands:
-------------------------------

Only on the VIOS:

lsmap command:
--------------

Displays the mapping between physical, logical, and virtual devices.

Syntax
lsmap { -vadapter ServerVirtualAdapter | -plc PhysicalLocationCode | -all }

lsmap [ -type BackingDeviceType | -net ]

lsmap [ -fmt Delimiter ] [ -field FieldNames ]

Description
The lsmap command displays the mapping between virtual host adapters and the physical devices they are backed to. 
Given a device name (ServerVirtualAdapter) or physical location code (PhysicalLocationCode) of a 
server virtual adapter, the device name of each connected virtual target device (child devices), 
its logical unit number, backing device(s) and the backing devices physical location code is displayed. 
If the -net flag is specified the supplied device must be a virtual server Ethernet adapter.

The -fmt flag divides the output by a user-specified delimiter/character (delimiter). The delimiter can be 
any non-white space character. This format is provided to facilitate scripting.

Examples:

- To list all virtual target devices and backing devices mapped to the server virtual SCSI adapter vhode2, type: 

# lsmap -vadapter vhost2 

The system displays a message similar to the following: 
 SVSA         Physloc                                     Client Partition ID
------------ -------------------------------------------- ------------------
vhost0       U9111.520.10004BA-V1-C2                      0x00000004

VTD                   vtscsi0
LUN                   0x8100000000000000
Backing device        vtd0-1
Physloc

VTD                   vtscsi1
LUN                   0x8200000000000000
Backing device        vtd0-2
Physloc

VTD                   vtscsi2
LUN                   0x8300000000000000
Backing device        hdisk2
Physloc               U787A.001.0397658-P1-T16-L5-L0


- To list the shared Ethernet adapter and backing device mapped to the virtual server Ethernet adapter ent4, type: 

# lsmap -vadapter ent4 -net

The system displays a message similar to the following: 
SVEA   Physloc
------ --------------------------------------------
ent4   P2-I1/E1

SEA                   ent5
Backing device        ent1
Physloc               P2-I4/E1


- To list the shared Ethernet adapter and backing device mapped to the virtual server Ethernet adapter ent5 
in script format separated by a : (colon), type: 

# lsmap -vadapter ent5 -fmt ":"

The system displays a message similar to the following: 
ent5:ent8:ent2


- To list all virtual target devices and backing devices, where the backing devices are of type disk or lv, type: 

# lsmap -all -type disk lv

The system displays a message similar to the following: 
SVSA            Physloc                                      Client Partition ID
--------------- -------------------------------------------- ------------------
vhost0          U9117.570.10D1B0E-V4-C3                      0x00000000

VTD                   vtscsi0
LUN                   0x8100000000000000
Backing device        hdisk0
Physloc               U7879.001.DQD0KN7-P1-T12-L3-L0

VTD                   vtscsi2
LUN                   0x8200000000000000
Backing device        lv04
Physloc                

SVSA            Physloc                                      Client Partition ID
--------------- -------------------------------------------- ------------------
vhost1          U9117.570.10D1B0E-V4-C4                      0x00000000

VTD                   vtscsi1
LUN                   0x8100000000000000
Backing device        lv03
Physloc 


mkvdev command:
---------------

Purpose
Adds a virtual device to the system.

Syntax
To create a virtual target device:

mkvdev [ -f ] {-vdev TargetDevice | -dplc TDPhysicalLocatonCode } { -vadapter VirtualServerAdapter | 
              -aplc VSAPhysicalLocationCode} [ -dev DeviceName ]

- To create a Shared Ethernet Adapter:

# mkvdev -sea TargetDevice -vadapter VirtualEthernetAdapter... -default DefaultVirtualEthernetAdapter 
       -defaultid SEADefaultPVID [ -attr Attribute=Value [ Attribute=Value... ] ]

- To create an Link Aggregation adapter:

# mkvdev -lnagg TargetAdapter... [ -attr Attribute=Value [ Attribute=Value... ] ]

- To create a VLAN Ethernet adapter:

# mkvdev -vlan TargetAdapter -tagid TagID

Description
The mkvdev command creates a virtual device. The name of the virtual device will be automatically generated 
and assigned unless the -dev DeviceName flag is specified, in which case DeviceName will become 
the device name. If the -lnagg flag is specified, a Link Aggregation or IEEE 802.3 Link Aggregation 
(automatic Link Aggregation) device is created. To create an IEEE 802.3 Link Aggregation set the mode attribute 
to 8023ad. If the -sea flag is specified, a Shared Ethernet Adapter is created. The TargetDevice may be a 
Link Aggregation adapter (note, however, that the VirtualEthernetAdapter may not be Link Aggregation adapters). 
The default virtual Ethernet adapter, DefaultVirtualEthernetAapter, must also be included as one of the 
virtual Ethernet adapters, VirtualEthernetAdapter. The -vlan flag is used to create a VLAN device and 
the -vdev flag creates a virtual target device which maps the VirtualServerAdapter to the TargetDevice.

If the backing device that is specified by the -vdev or -dplc flags is already in use, an error will be 
returned unless the -f flag is also specified.


Examples:
---------

Example 1:
----------

Suppose you have VIOS running, and you want to create three AIX53 client lpars, LPS1, LPS2 and LPS3.
Suppose from VIOS, you have created a number of virtual scsi controllers:

- Listing virtuele scsi controllers.

# lsdev -virtual

You will see a listing of virtual scsi controllers: vhost0, vhost1, en vhost2

- From VIOS, create Volume Groups.

Suppose hdisk2, hdisk3, and hdisk4 are not yet assigned, and thus are free to create VG's.

# mkvg -f -vg rootvg_lpar1  hdisk2
# mkvg -f -vg rootvg_lpar2  hdisk3
# mkvg -f -vg rootvg_lpar3  hdisk4


- Now create LV's.

# mklv -lv rootvg_lps1 rootvg_lpar1 15G
# mklv -lv rootvg_lps2 rootvg_lpar2 15G
# mklv -lv rootvg_lps3 rootvg_lpar3 15G

   Note: this could also be have done:
   # mklv -lv rootvg_lps1 rootvg_lpar1 15G
   # mklv -lv rootvg_lps2 rootvg_lpar1 15G
   # mklv -lv rootvg_lps3 rootvg_lpar1 15G


The lv's rootvg_lps1, rootvg_lps2, and rootvg_lps3 will become the rootvg's for the AIX53 client partitions.

- Create mappings.

# mkvdev -vdev rootvg_lps1 -vadapter vhost0
# mkvdev -vdev rootvg_lps2 -vadapter vhost1
# mkvdev -vdev rootvg_lps3 -vadapter vhost2


vhostx = LV \
vhosty = LV -> VG {disk(s)}
vhostz = LV /


More examples:
--------------

- From a AIX 5.3 client partition run the lsdev command, like

# lsdev -Cc disk -s vscsi
hdisk2 Available Virtual SCSI Disk Drive

# lscfg -vpl hdisk2
hdisk2 11.520.10DDEDC-V3-C5-T1-L810000000 Virtual SCSI Disk Drive

root@zd110l06:/root#lscfg -vpl hdisk2
  hdisk2           U9117.570.65B61FE-V6-C7-T1-L810000000000  Virtual SCSI Disk Drive

  PLATFORM SPECIFIC

  Name:  disk
    Node:  disk
    Device Type:  block


- To create the mapping of a virtual scsi adapter vhost0, to a logical volume (rootvg_nim) that an AIX partition
will use later as a disk, use

# mkvdev -vdev rootvg_nim -vadapter vhost0 -dev vnim

- To create a virtual target device that maps the logical volume lv20 as a virtual disk for a client partition 
hosted by the vhost0 virtual server adapter, type: 

# mkvdev -vdev lv20 -vadapter vhost0

The system displays a message similar to the following: 
vtscsi0 available

- To create a virtual target device that maps the physical volume hdisk6 as a virtual disk for a client partition 
served by the vhost2 virtual server adapter, type: 

# mkvdev -vdev hdisk6 -vadapter vhost2

The system displays a message similar to the following: 
vtscsi1 available

- To create a Shared Ethernet Adapter that maps the physical Ethernet adapter "ent4" as a virtual Ethernet adapter 
for the client partitions served by the virtual Ethernet adapters ent6, ent7, and ent9, using ent6 as the 
default adapter and 8 as the default ID, type: 

# mkvdev -sea ent4 -vadapter ent6,ent7,ent9 -default ent6 -defaultid 8

The system displays a message similar to the following: 
ent10 available           (which is the sea)

	Remember how to create a SEA on the VIOS:

	- To create a Shared Ethernet Adapter:

	# mkvdev -sea PhysTargetDevice -vadapter VirtualEthernetAdapter... -default DefaultVirtualEthernetAdapter 
       		-defaultid SEADefaultPVID [ -attr Attribute=Value [ Attribute=Value... ] ]


- To create an automatic Link Aggregation with primary adapters ent4 and ent5 and backup adapter ent6, type: 

# mkvdev -lnagg ent4,ent5 -attr backup_adapter=ent6 mode=6023ad

The system displays a message similar to the following: 
ent10 available


lsdev command (on VIOS):
------------------------

The lsdev command on a VIO Server has a bit of a different syntax compared to a regular AIX partition.
Commands like "lsdev -Cc tape" does not work on VIO.
Instead, you have a limited number of parameters you can give to the lsdev command.


Usage: lsdev [-type DeviceType ...] [-virtual] [-state DeviceState]
             [-field FieldName ...] [-fmt delimiter]
       lsdev {-dev DeviceName | -plc PhysicalLocationCode} [-child]
             [-field FieldName ...] [-fmt delimiter]
       lsdev {-dev DeviceName | -plc PhysicalLocationCode} [-parent |
             -attr [Attribute] | -range Attribute | -slot | -vpd]
       lsdev -slots
       lsdev -vpd

So normally you will use the following on a VIO Server:

lsdev -dev [options]  
                        like "lsdev -dev" 
                             "lsdev -dev <device>" 
                             "lsdev -dev <device> -vpd"
lsdev -slots 
lsdev -vpd
lsdev -virtual


>> Examples of Usage of lsdev on VIOS:


# tn vioserver1
Trying...
Connected to vioserver1.
Escape character is '^T'.

telnet (vioserver1)

IBM Virtual I/O Server

login: padmin
padmin's Password:
Last unsuccessful login: Mon Sep 24 04:25:04 CDT 2007 on /dev/vty0
Last login: Wed Nov 21 05:10:29 CST 2007 on /dev/pts/0 from starboss.antapex.org


Suppose you have logged on as padmin on a VIO server. Now you try the following commands
to retrieve information of the system:


$ lsdev -dev fcs*

name            status                                            description
fcs0            Available  FC Adapter
fcs1            Available  FC Adapter
fcs2            Available  FC Adapter
fcs3            Available  FC Adapter

$  lsdev -dev fcs0

name            status                                            description
fcs0            Available  FC Adapter


$ lsdev -dev fcs* -vpd|grep Z8
        Device Specific.(Z8)........20000000C95CDDEE
        Device Specific.(Z8)........20000000C95C88F1
        Device Specific.(Z8)........20000000C95AB49A
        Device Specific.(Z8)........20000000C95CDBFD


$ lsdev -dev fcs0 -vpd

  fcs0             U7879.001.DQDTZXG-P1-C6-T1  FC Adapter

        Part Number.................03N7069
        EC Level....................A
        Serial Number...............1B64505069
        Manufacturer................001B
        Feature Code/Marketing ID...280B
        FRU Number.................. 03N7069
        Device Specific.(ZM)........3
        Network Address.............10000000C95CDBFD
        ROS Level and ID............02881955
        Device Specific.(Z0)........1001206D
        Device Specific.(Z1)........00000000
        Device Specific.(Z2)........00000000
        Device Specific.(Z3)........03000909
        Device Specific.(Z4)........FF801413
        Device Specific.(Z5)........02881955
        Device Specific.(Z6)........06831955
        Device Specific.(Z7)........07831955
        Device Specific.(Z8)........20000000C95CDBFD
        Device Specific.(Z9)........TS1.91A5
        Device Specific.(ZA)........T1D1.91A5
        Device Specific.(ZB)........T2D1.91A5
        Device Specific.(YL)........U7879.001.DQDTZXG-P1-C6-T1

  PLATFORM SPECIFIC

  Name:  fibre-channel
    Model:  LP10000
    Node:  fibre-channel@1
    Device Type:  fcp
    Physical Location: U7879.001.DQDTZXG-P1-C6-T1


$ lsdev -slots

# Slot                      Description       Device(s)
U7311.D11.655157B-P1-C4     Logical I/O Slot  pci12 ent0 ent1
U7311.D11.655157B-P1-C5     Logical I/O Slot  pci13 fcs2
U7311.D11.655158B-P1-C6     Logical I/O Slot  pci14 fcs3
U7311.D20.655159B-P1-C04    Logical I/O Slot  pci9 sisscsia0
U7879.001.DQDTPAK-P1-C5     Logical I/O Slot  pci10 fcs1
U7879.001.DQDTZXG-P1-C6     Logical I/O Slot  pci8 fcs0
U7879.001.DQDTPAK-P1-T12    Logical I/O Slot  pci11 sisscsia1
U9117.570.65B61FE-V17-C0    Virtual I/O Slot  vsa0
U9117.570.65B61FE-V17-C11   Virtual I/O Slot  vhost0
U9117.570.65B61FE-V17-C12   Virtual I/O Slot  vhost1
..
U9117.570.65B61FE-V17-C324  Virtual I/O Slot  vhost33


$ lsdev -type disk

name            status                                            description
hdisk0          Available  16 Bit LVD SCSI Disk Drive
hdisk1          Available  16 Bit LVD SCSI Disk Drive
..
hdisk10         Available  16 Bit LVD SCSI Disk Drive
hdisk11         Available  SAN Volume Controller MPIO Device
hdisk12         Available  SAN Volume Controller MPIO Device
..
hdisk35         Available  SAN Volume Controller MPIO Device
vg01sanl02      Available  Virtual Target Device - Disk
vg01sanl03      Available  Virtual Target Device - Disk
..
vg04sanl14      Available  Virtual Target Device - Disk
vg05sanl14      Available  Virtual Target Device - Disk
vzd110l01       Available  Virtual Target Device - Logical Volume
vzd110l02       Available  Virtual Target Device - Logical Volume
..
vzd110l14       Available  Virtual Target Device - Logical Volume


$ lsdev -virtual

name            status                                            description
vhost0          Available  Virtual SCSI Server Adapter
vhost1          Available  Virtual SCSI Server Adapter
vhost2          Available  Virtual SCSI Server Adapter
..
vhost33         Available  Virtual SCSI Server Adapter
vsa0            Available  LPAR Virtual Serial Adapter
vg01sanl02      Available  Virtual Target Device - Disk
vg01sanl03      Available  Virtual Target Device - Disk
..
vg05sanl14      Available  Virtual Target Device - Disk
..
vzd110l01       Available  Virtual Target Device - Logical Volume
vzd110l14       Available  Virtual Target Device - Logical Volume


$ lsdev -vpd          # gives a huge list of output

INSTALLED RESOURCE LIST WITH VPD

The following resources are installed on your machine.

  Model Architecture: chrp
  Model Implementation: Multiple Processor, PCI bus

  sys0                                                                           System Object
  sysplanar0                                                                     System Planar
  vio0                                                                           Virtual I/O Bus
  vhost33          U9117.570.65B61FE-V17-C324                                    Virtual SCSI Server Adapter

        Device Specific.(YL)........U9117.570.65B61FE-V17-C324

  vg05sanl14       U9117.570.65B61FE-V17-C324-L2                                 Virtual Target Device - Disk
  vg03sanl14       U9117.570.65B61FE-V17-C324-L1                                 Virtual Target Device - Disk
  vhost32          U9117.570.65B61FE-V17-C323                                    Virtual SCSI Server Adapter

        Device Specific.(YL)........U9117.570.65B61FE-V17-C323

  vg04sanl14       U9117.570.65B61FE-V17-C323-L2                                 Virtual Target Device - Disk
  vg03sanl13       U9117.570.65B61FE-V17-C323-L1                                 Virtual Target Device - Disk
  vhost31          U9117.570.65B61FE-V17-C224                                    Virtual SCSI Server Adapter

        Device Specific.(YL)........U9117.570.65B61FE-V17-C224

  vg02sanl14       U9117.570.65B61FE-V17-C224-L1                                 Virtual Target Device - Disk
  vhost30          U9117.570.65B61FE-V17-C223                                    Virtual SCSI Server Adapter

        Device Specific.(YL)........U9117.570.65B61FE-V17-C223

..
..
  vg01sanl05       U9117.570.65B61FE-V17-C115-L1                                 Virtual Target Device - Disk
  vhost15          U9117.570.65B61FE-V17-C113                                    Virtual SCSI Server Adapter

        Device Specific.(YL)........U9117.570.65B61FE-V17-C113
  vg04sanl03       U9117.570.65B61FE-V17-C113-L3                                 Virtual Target Device - Disk
  vg03sanl03       U9117.570.65B61FE-V17-C113-L2                                 Virtual Target Device - Disk
  vg01sanl03       U9117.570.65B61FE-V17-C113-L1                                 Virtual Target Device - Disk
  vhost14          U9117.570.65B61FE-V17-C112                                    Virtual SCSI Server Adapter

        Device Specific.(YL)........U9117.570.65B61FE-V17-C112
..
..
        Device Specific.(YL)........U9117.570.65B61FE-V17-C0

  vty0             U9117.570.65B61FE-V17-C0-L0                                   Asynchronous Terminal
  pci6             U7311.D11.655158B-P1                                          PCI Bus

        Device Specific.(YL)........U7311.D11.655158B-P1

  pci14            U7311.D11.655158B-P1                                          PCI Bus

        Device Specific.(YL)........U7311.D11.655158B-P1

  fcs3             U7311.D11.655158B-P1-C6-T1                                    FC Adapter

        Part Number.................03N7069
        EC Level....................A
        Serial Number...............1B64504CA3
        Manufacturer................001B
        Feature Code/Marketing ID...280B
        FRU Number.................. 03N7069
        Device Specific.(ZM)........3
        Network Address.............10000000C95CDDEE
        ROS Level and ID............02881955
        Device Specific.(Z0)........1001206D
        Device Specific.(Z1)........00000000
        Device Specific.(Z2)........00000000
        Device Specific.(Z3)........03000909
        Device Specific.(Z4)........FF801413
        Device Specific.(Z5)........02881955
        Device Specific.(Z6)........06831955
        Device Specific.(Z7)........07831955
        Device Specific.(Z8)........20000000C95CDDEE
        Device Specific.(Z9)........TS1.91A5
        Device Specific.(ZA)........T1D1.91A5
        Device Specific.(ZB)........T2D1.91A5
        Device Specific.(YL)........U7311.D11.655158B-P1-C6-T1

  fcnet3           U7311.D11.655158B-P1-C6-T1                                    Fibre Channel Network Protocol Device
  fscsi3           U7311.D11.655158B-P1-C6-T1                                    FC SCSI I/O Controller Protocol Device
  pci5             U7311.D11.655157B-P1                                          PCI Bus

        Device Specific.(YL)........U7311.D11.655157B-P1


Other Example:
==============

See the differences between 1 and 2:


1. LPAR using only storage via VIO

root@zd110l06:/root#lspv
hdisk0          00cb61fe223c3926                    rootvg          active
hdisk1          00cb61fe2360b1b7                    rootvg          active
hdisk2          00cb61fe3339af9f                    appsvg          active
hdisk3          00cb61fe3339b066                    datavg          active

root@zd110l06:/root#lsdev -Cc disk -s vscsi
hdisk0 Available  Virtual SCSI Disk Drive
hdisk1 Available  Virtual SCSI Disk Drive
hdisk2 Available  Virtual SCSI Disk Drive
hdisk3 Available  Virtual SCSI Disk Drive

root@zd110l06:/root#lsdev -Cc disk
hdisk0 Available  Virtual SCSI Disk Drive
hdisk1 Available  Virtual SCSI Disk Drive
hdisk2 Available  Virtual SCSI Disk Drive
hdisk3 Available  Virtual SCSI Disk Drive

root@zd110l06:/root#lsdev -Cc adapter
ent0   Available       Virtual I/O Ethernet Adapter (l-lan)
ent1   Available 02-08 2-Port 10/100/1000 Base-TX PCI-X Adapter (14108902)
ent2   Available 02-09 2-Port 10/100/1000 Base-TX PCI-X Adapter (14108902)
ent3   Available 03-08 2-Port 10/100/1000 Base-TX PCI-X Adapter (14108902)
ent4   Available 03-09 2-Port 10/100/1000 Base-TX PCI-X Adapter (14108902)
vsa0   Available       LPAR Virtual Serial Adapter
vscsi0 Available       Virtual SCSI Client Adapter
vscsi1 Available       Virtual SCSI Client Adapter
vscsi2 Available       Virtual SCSI Client Adapter
vscsi3 Available       Virtual SCSI Client Adapter
vscsi4 Available       Virtual SCSI Client Adapter
vscsi5 Available       Virtual SCSI Client Adapter


2. LPAR using storage via VIO and dedicated FC cards

root@zd110l01.nl.eu.abnamro.com:/root#lspv
hdisk0          00cb61fe09fe92bd                    rootvg          active
hdisk1          00cb61fe0a47a802                    rootvg          active
hdisk2          00cb61fe336bc95b                    appsvg          active
hdisk3          00cb61fe321664d1                    datavg          active

root@zd110l01.nl.eu.abnamro.com:/root#lsdev -Cc disk -s vscsi
hdisk0 Available  Virtual SCSI Disk Drive
hdisk1 Available  Virtual SCSI Disk Drive

root@zd110l01.nl.eu.abnamro.com:/root#lsdev -Cc disk
hdisk0 Available          Virtual SCSI Disk Drive
hdisk1 Available          Virtual SCSI Disk Drive
hdisk2 Available 02-08-02 SAN Volume Controller MPIO Device
hdisk3 Available 02-08-02 SAN Volume Controller MPIO Device

root@zd110l01.nl.eu.abnamro.com:/root#lsdev -Cc adapter
ent0   Available       Virtual I/O Ethernet Adapter (l-lan)
fcs0   Available 02-08 FC Adapter
fcs1   Available 03-08 FC Adapter
vsa0   Available       LPAR Virtual Serial Adapter
vscsi0 Available       Virtual SCSI Client Adapter
vscsi1 Available       Virtual SCSI Client Adapter


>> More on lsdev on a VIOS:


Purpose

Displays Virtual I/O Server devices and their characteristics.

Syntax
To list devices

lsdev [ -type DeviceType... ] [ -virtual ] [ -field FieldName... ] [ -fmt Delimiter ] [-state State ]

To display information about a specific device:

lsdev { -dev DeviceName | -plc PhysicalLocationCode } [ -child ] [ -field FieldName... ] [ -fmt Delimiter ]

lsdev { -dev DeviceName | -plc PhysicalLocationCode } [ -attr [ Attribute ] | -range Attribute | -slot | -vpd | -parent]

lsdev -vpd

lsdev -slots

Description:

The lsdev command displays information about devices in the Virtual I/O Server. If no flags are specified, 
a list of all devices, both physical and virtual, in the Virtual I/O Server is displayed. 
To list devices, both physical and virtual, of a specific type use the -type DeviceType flag. 
Use the -virtual flag to list only virtual devices. Combining both the -type and -virtual flags 
will list the virtual devices of the specified type.

To display information about a specific device, use the -dev DeviceName or -plc PhysicalLocationCode. 
Use either the -child, -parent, -attr, -range, -slot, or -vpd flag to specify what type of information 
is displayed. If none of these flags are used, the name, status, and description of the device will be displayed.

Using the -vpd flag, without specifying a device, displays platform-specific information for all devices.


Examples

- To list all virtual adapters and display the name and status fields, type: 

# lsdev -type adapter -virtual -field name status

The system displays a message similar to the following: 
name  status

vhost0  Available
vhost1  Available
vhost2  Available
ent6    Available
ent7    Available
ent8    Available
ent9    Available

- To list all devices of type disk and display the name and physical location fields, type: 

# lsdev -type disk -field name physloc

The system displays a message similar to the following: 
name    physloc

hdisk0 U9111.520.10004BA-T15-L5-L0
hdisk1 U9111.520.10004BA-T15-L8-L0
hdisk2 U9111.520.10004BA-T16-L5-L0
hdisk3 U9111.520.10004BA-T16-L8-L0
hdisk4 UTMP0.02E.00004BA-P1-C4-T1-L8-L0
hdisk5 UTMP0.02E.00004BA-P1-C4-T2-L8-L0
hdisk6 UTMP0.02F.00004BA-P1-C8-T2-L8-L0
hdisk7 UTMP0.02F.00004BA-P1-C4-T2-L8-L0
hdisk8 UTMP0.02F.00004BA-P1-C4-T2-L11-L0
vtscsi0 U9111.520.10004BA-V1-C2-L1
vtscsi1 U9111.520.10004BA-V1-C3-L1
vtscsi2 U9111.520.10004BA-V1-C3-L2
vtscsi3 U9111.520.10004BA-V1-C4-L1
vtscsi4 U9111.520.10004BA-V1-C4-L2
vtscsi5 U9111.520.10004BA-V1-C5-L1


- To display the parent of a devices, type: 

# lsdev -dev hdisk0 -parent

The system displays a message similar to the following: 
parent

scsi0

- To display all I/O slots that are not hot-pluggable but can have DLPAR operations performed on them, type: 

# lsdev -slots

The system displays a message similar to the following: 
U787A.001.DNZ00Y1-P1-C1  Logical I/O Slot  pci4 sisscsia0   
U787A.001.DNZ00Y1-P1-T5  Logical I/O Slot  pci3 ent0 ent1   
U787A.001.DNZ00Y1-P1-T7  Logical I/O Slot  pci2 usbhc0 usbhc1   
U9111.520.10DFD8C-V2-C0  Virtual I/O Slot  vsa0   
U9111.520.10DFD8C-V2-C2  Virtual I/O Slot  vhost0   
U9111.520.10DFD8C-V2-C4  Virtual I/O Slot  Unknown


- The client partition accesses its assigned disks through a virtual SCSI client adapter.
The virtual scsi client adapter sees standard scsi devices and LUNs through this virtual adapter.
The commands in the following example show how the disks appear on a AIX 53 partition:

# lsdev -Cc disk -s vscsi
hdisk2 Available Virtual SCSI Disk Drive

# lscfg -vpl hdisk2
hdisk2 111.530.10DDEDC-V3-C5-T1  Virtual SCSI Disk Drive

- To configure an optical device as a virtual SCSI device is the same as configuring
a disk or logical volume into a vscsi device.
Using either a new or previously defined vhost adapter with the client partition, 
run the following command:

# mkvdev -vdev cd0 -vadapter vhost0
vtopt0  Available Virtual Target Device - Optical Media

On the client partition, run the cfmgr command and a cd0 device will be configured for use.
Mounting the CD device is now possible, as is using the mkdvd command.


rmvdev command:
---------------

Purpose
To remove the connection between a physical device and its associated virtual SCSI adapter.

Syntax
rmvdev [ -f ] { -vdev TargetDevice | -vtd VirtualTargetDevice } [-rmlv]

Description
The rmdev command removes the connection between a physical device and its associated virtual SCSI adapter. 
The connection can be identified by specifying the backing (physical) device or the virtual target device.
If the connection is specified by the device name and there are multiple connections between the 
physical device and virtual SCSI adapters and error is returned unless the -f flag is also specified.
If -f is included then all connections associated with the physical device are removed.

If the backing (physical) device is a logical volume and the -rmlv flag is specified, 
then logical volume will be removed as well.

Example:

# rmvdev -dev vhost0 -recursive

Example:

how to remove a dynamically allocated i/o slot in a DLPAR in IBM AIX 
Description

To remove a dynamically allocated I/O slot (must be a desired component) from a partition on a P-series 
IBM server partition:

1) Find the slot you wish to remove from the partition:

# lsslot -c slot
# Slot Description Device(s)
U1.5-P2/Z2 Logical I/O Slot pci15 scsi2 
U1.9-P1-I8 Logical I/O Slot pci13 ent0 
U1.9-P1-I10 Logical I/O Slot pci14 scsi0 scsi1 

In our case, it is pci14.

2) Delete the PCI adapter and all of its children in AIX before removal:

# rmdev -l pci14 -d -R
cd0 deleted
rmt0 deleted
scsi0 deleted
scsi1 deleted
pci14 deleted

3) Now, you can remove the PCI I/O slot device using the HMC:

a) Log in to the HMC

b) Select "Server and Partition", and then "Server Management"

c) Select the appropriate server and then the appropriate partition

d) Right click on the partition name, and then on "Dynamic Logical Partitioning"

e) In the menu, select "Adapters"

f) In the newly created popup, select the task "Remove resource from this partition"

g) Select the appropriate adapter from the list (only desired one will appear)

h) Select the "OK" button

i) You should have a popup window which tells you if it was successful. 


Example

lsslot -c slot; rmdev -l pci14 -d -R 


mkdvd command:
-------------- 

Examples of the mkdvd command:

To generate a bootable system backup to the DVD-R device named /dev/cd1, enter: 

# mkdvd -d /dev/cd1

To generate a system backup to the DVD-R or DVD-RAM device named /dev/cd1, enter: 

# mkdvd -d /dev/cd1

To generate a non-bootable volume group backup of the volume group myvg to /dev/cd1, enter: 

# mkdvd -d /dev/cd1 -v myvg

Note:
All savevg backup images are non-bootable.
To generate a non-bootable system backup, but stop mkdvd before the DVD is created and save 
the final images to the /mydata/my_cd file system, and create the other mkdvd file systems in myvg, enter: 

# mkdvd -B -I /mydata/my_cd -V myvg -S
To create a DVD or DVD that duplicates an existing directory structure 
/mycd/a
/mycd/b/d
/mycd/c/f/g
use the following command:

# mkdvd -r /mycd -d /dev/cd1
After mounting with mount -o ro /dev/cd1 /mnt, cd to /mnt; a find . -print command displays:

./a
./b
./b/d
./c
./c/f
./c/f/g


lparstat command:
-----------------

From the AIX prompt in a lpar, you can enter the lparstat -i command to get a list
of names and resources like, for example, if the partition is capped or uncapped etc..

# lparstat -i


cfgdev command:
---------------

On the VIOS partition, run the "cfgdev" command to rebuild the list of visible devices.
This is neccessary after you have created the partition and have added virtual controllers.

The virtual SCSI server adapters are now available to the VIOS. 
The name of these adapters are vhostx where x is a number assigned by the system.

Use the following command to make sure your adapters are available:

$ lsdev -virtual
name		status		description
ent2		Available	Virtual Ethernet Adapter
vhost0		Available	Virtual SCSI Server Adapter
vhost1		Available	Virtual SCSI Server Adapter
vhost2		Available	Virtual SCSI Server Adapter
vhost3		Available	Virtual SCSI Server Adapter
vsa0		Available	LPAR Virtual Serial Adapter


lspath command:
---------------

lspath Command
Purpose
Displays information about paths to a MultiPath I/O (MPIO) capable device.

Syntax
lspath [ -dev DeviceName ] [ -pdev Parent ] [ -status Status ] [ -conn Connection ] [ -field FieldName ] 
       [ -fmt Delimiter ]

lspath -dev DeviceName -pdev Parent [ -conn Connection ] -lsattr [ -attr Attribute... ]

lspath -dev DeviceName -pdev Parent [ -conn Connection ] -range -attr Attribute

Description
The lspath command displays one of three types of information about paths to an MPIO capable device. It either 
displays the operational status for one or more paths to a single device, or it displays one or more attributes 
for a single path to a single MPIO capable device. The first syntax shown above displays the operational status 
for one or more paths to a particular MPIO capable device. The second syntax displays one or more attributes 
for a single path to a particular MPIO capable device. Finally, the third syntax displays the possible range 
of values for an attribute for a single path to a particular MPIO capable device.

Displaying Path Status with the lspath Command
When displaying path status, the set of paths to display is obtained by searching the device configuration database 
for paths that match the following criteria:

The target device name matches the device specified with the -dev flag. If the -dev flag is not present, then the 
target device is not used in the criteria. 
The parent device name matches the device specified with the -pdev flag. If the -pdev flag is not present, then 
parent is not used in the criteria. 
The connection matches the connection specified with the -conn flag. If the -conn flag is not present, then 
connection is not used in the criteria. 
The path status matches status specified with the -status flag. If the -status flag is not present, the path 
status is not used in the criteria.
If none of the -dev, -pdev, -conn, or -status flags are specified, then all paths known to the system are displayed.

By default, this command will display the information in columnar form. When no flags are specified that qualify 
the paths to display, the format of the output is:

status device  parent
Possible values that can appear for the status column are:

-enabled 
Indicates that the path is configured and operational. It will be considered when paths are selected for IO. 
-disabled 
Indicates that the path is configured, but not currently operational. It has been manually disabled and will 
not be considered when paths are selected for IO. 
-failed 
Indicates that the path is configured, but it has had IO failures that have rendered it unusable. It will not be considered when paths are selected for IO. 
-defined 
Indicates that the path has not been configured into the device driver. 
-missing 
Indicates that the path was defined in a previous boot, but it was not detected in the most recent boot of the system. 
-detected 
Indicates that the path was detected in the most recent boot of the system, but for some reason it was not configured. A path should only have this status during boot and so this status should never appear as a result of the lspath command. 

Displaying Path Attributes with the lspath Command
When displaying attributes for a path, the path must be fully qualified. Multiple attributes for a path can be displayed, but attributes belonging to multiple paths cannot be displayed in a single invocation of the lspath command. Therefore, in addition to the -lsattr, -dev, and -pdev flags, the -conn flags are required to uniquely identify a single path. For example:

if only one path between a device and a specific parent, the -conn flag is not required 
if there are multiple paths between a device and a specific parent, the -conn flag is required
Furthermore, the -status flag is not allowed.

By default, this command will display the information in columnar form.

attribute   value    description         user_settableFlags
-attr Attribute Identifies the specific attribute to list. The 'Attribute' is the name of a path specific attribute. 
 When this flag is provided, only the identified attribute is displayed. Multiple instances of this flag may be 
 used to list multiple attributes. If this flag is not specified at all, all attributes associated with the 
 identified path will be listed. 
-lsattr Displays the attribute names, current values, descriptions, and user-settable flag values for a specific path. 
-dev Name Specifies the logical device name of the target device whose path information is to be displayed. 
-field FieldNames Specifies the list of fields to display. The following fields are supported: 
status 
Status of the path 
name 
Name of the device 
parent 
Name of the parent device 
conn 
Path connection.  
-fmt Delimiter Specifies a delimiter character to separate output fields. 
-pdev Parent Indicates the logical device name of the parent device of the path(s) to be displayed. 
-range Displays the legal values for an attribute name. The -range flag displays the list attribute values in a vertical column as follows: 
Value1
Value2
.
.
ValueN
The -range flag displays the range attribute values as x...n(+i) where x is the start of the range, n is the end of the range, and i is the increment. 
-status Status The -status Status flag indicates the status to use in qualifying the paths to be displayed. When displaying path information, the allowable values for this flag are: 
enabled 
Display paths that are enabled for MPIO path selection. 
disabled 
Display paths that are disabled from MPIO path selection. 
failed 
Display paths that are failed due to IO errors. 
available 
Display paths whose path_status is PATH_AVAILABLE (that is, paths that are configured in the system, includes enabled, disabled, and failed paths). 
defined 
Display paths whose path_status is PATH_DEFINED. 
missing 
Display paths whose path_status is PATH_MISSING.  
-conn Connection Indicates the connection information to use in qualifying the paths to be displayed. 

Exit Status
Return code Description 
1 Invalid status value. 

Examples:

To display, without column headers, the set of paths whose operational status is disabled, enter: 

# lspath -status disabled

The system will display a message similar to the following: 

disabled  hdisk1   scsi1 
disabled  hdisk2   scsi1 
disabled  hdisk23  scsi8 
disabled  hdisk25  scsi8

To display the set of paths whose operational status is failed, enter: 

# lspath -status failed

The system will display a message similar to the following: 
failed  hdisk1   scsi1 
failed  hdisk2   scsi1 
failed  hdisk23  scsi8 
failed  hdisk25  scsi8

If the target device is a SCSI disk, to display all attributes for the path to parent scsi0 at connection 5,0, 
use the command: 

# lspath -dev hdisk10 -pdev scsi0 -conn "5,0" -lsattr

The system will display a message similar to the following: 

weight     1      Order of path failover selection  true


To display the status of all paths to hdisk1 with column headers and I/O counts, type: 

# lspath -l hdisk1 -H

The system displays a message similar to the following: 

STATUS          PARENT          CONNECTION
enabled   (4)   scsi0           5,0
disabled  (0)   scsi1           5,0
missing         scsi2           5,0


To display without column headers, the set of paths whose operational status is disabled, type: 

# lspath -s disabled

The system displays a message similar to the following: 

hdisk1        scsi1        5, 0
hdisk2        scsi1        6, 0
hdisk23       scsi8        3, 0
hdisk25       scsi8        4, 0


chpath command:
---------------

chpath Command
Purpose
Changes the operational status of paths to an MultiPath I/O (MPIO) capable device, or changes an attribute 
associated with a path to an MPIO capable device.

Syntax
chpath -l Name -s OpStatus [ -p Parent ] [ -w Connection ]

chpath -l Name -p Parent [ -w Connection ] [ -P ] -a Attribute=Value [ -a Attribute=Value ... ]

chpath -h

Description
The chpath command either changes the operational status of paths to the specified device (the -l Name flag) 
or it changes one, or more, attributes associated with a specific path to the specified device. The required syntax 
is slightly different depending upon the change being made.

The first syntax shown above changes the operational status of one or more paths to a specific device. 
The set of paths to change is obtained by taking the set of paths which match the following criteria:

The target device matches the specified device. 
The parent device matches the specified parent (-p Parent), if a parent is specified. 
The connection matches the specified connection (-w Connection), if a connection is specified. 
The path status is PATH_AVAILABLE.
The operational status of a path refers to the usage of the path as part of MPIO path selection. The value of enable indicates that the path is to be used while disable indicates that the path is not to be used. It should be noted that setting a path to disable impacts future I/O, not I/O already in progress. As such, a path can be disabled, but still have outstanding I/O until such time that all of the I/O that was already in progress completes. As such, if -s disable is specified for a path and I/O is outstanding on the path, this fact will be output.

Disabling a path affects path selection at the device driver level. The path_status of the path is not changed in the device configuration database. The lspath command must be used to see current operational status of a path.

The second syntax shown above changes one or more path specific attributes associated with a particular path to a particular device. Note that multiple attributes can be changed in a single invocation of the chpath command; but all of the attributes must be associated with a single path. In other words, you cannot change attributes across multiple paths in a single invocation of the chpath command. To change attributes across multiple paths, separate invocations of chpath are required; one for each of the paths that are to be changed.

Flags
-a Attribute=Value Identifies the attribute to change as well as the new value for the attribute. The Attribute is the name of a path specific attribute. The Value is the value which is to replace the current value for the Attribute. More than one instance of the -a Attribute=Value can be specified in order to change more than one attribute. 
-h Displays the command usage message. 
-l Name Specifies the logical device name of the target device for the path(s) affected by the change. This flag is required in all cases. 
-p Parent Indicates the logical device name of the parent device to use in qualifying the paths to be changed. This flag is required when changing attributes, but is optional when change operational status. 
-P Changes the path's characteristics permanently in the ODM object class without actually changing the path. The change takes affect on the path the next time the path is unconfigured and then configured (possibly on the next boot). 
-w Connection Indicates the connection information to use in qualifying the paths to be changed. This flag is optional when changing operational status. When changing attributes, it is optional if the device has only one path to the indicated parent. If there are multiple paths from the parent to the device, then this flag is required to identify the specific path being changed. 
-s OpStatus Indicates the operational status to which the indicated paths should be changed. The operational status of a path is maintained at the device driver level. It determines if the path will be considered when performing path selection.The allowable values for this flag are: 
enable 
Mark the operational status as enabled for MPIO path selection. A path with this status will be considered for use when performing path selection. Note that enabling a path is the only way to recover a path from a failed condition. 
disable 
Mark the operational status as disabled for MPIO path selection. A path with this status will not be considered for use when performing path selection. 
This flag is required when changing operational status. When used in conjunction with the -a Attribute=Value flag, a usage error is generated. 

Security
Privilege Control: Only the root user and members of the system group have execute access to this command.

Auditing Events:

Event Information 
DEV_Change The chpath command line. 

Examples

To disable the paths between scsi0 and the hdisk1 disk device, enter: 

# chpath -l hdisk1 -p scsi0 -s disable

The system displays a message similar to one of the following: 

paths disabled
or 
some paths enabled

The first message indicates that all PATH_AVAILABLE paths from scsi0 to hdisk1 have been successfully enabled. 
The second message indicates that only some of the PATH_AVAILABLE paths from scsi0 to hdisk1 have been 
successfully disabled.


59.5 Example of usage of virtualization commands:
-------------------------------------------------


Suppose we have the following lpar:

# uname -L
12 zd110l12

# oslevel -r
5300-05

# lsdev -Cc disk
hdisk0 Available          Virtual SCSI Disk Drive
hdisk1 Available          Virtual SCSI Disk Drive <--------------------------------------
hdisk2 Available 02-08-02 SAN Volume Controller MPIO Device                            |
hdisk3 Available 02-08-02 SAN Volume Controller MPIO Device                            |
                                                                                       |
# lsdev -Cc disk -s vscsi                                                              |
hdisk0 Available  Virtual SCSI Disk Drive                                              |
hdisk1 Available  Virtual SCSI Disk Drive                                              |
                                                                                       |
# lscfg -vpl hdisk1                                                                    |
  hdisk1        U9117.570.65B61FE-V12-C6-T1-L810000000000  Virtual SCSI Disk Drive <----
                                                                                       |
# lsslot -c slot                                                                       |
# Slot                    Description       Device(s)                                  |
U7879.001.DQDTZXG-P1-C2   Logical I/O Slot  pci2 fcs0                                  |
U7879.001.DQDTPAK-P1-C6   Logical I/O Slot  pci3 fcs1                                  |
U9117.570.65B61FE-V12-C0  Virtual I/O Slot  vsa0                                       |
U9117.570.65B61FE-V12-C2  Virtual I/O Slot  ent0                                       |
U9117.570.65B61FE-V12-C5  Virtual I/O Slot  vscsi0                                     |
U9117.570.65B61FE-V12-C6  Virtual I/O Slot  vscsi1 <------------------------------------

#lscfg -vpl hdisk3
  hdisk3           U7879.001.DQDTZXG-P1-C2-T1-W50050768013029E5-L1000000000000  SAN Volume Controller MPIO Device

        Manufacturer................IBM
        Machine Type and Model......2145
        ROS Level and ID............0000
        Device Specific.(Z0)........0000043268101002
        Device Specific.(Z1)........0200640
        Serial Number...............600507680190014E3000000000000199   (LUN)


  PLATFORM SPECIFIC

  Name:  disk
    Node:  disk
    Device Type:  block


59.6 HMC commands:
==================


HMC commands:
lssyscfg	List the hardware resource configuration
mksyscfg	Creates the hardware resource configuration
chsyscfg	Changes the hardware resource configuration
rmsyscfg	Removes the hardware resource configuration

Example:

$ lssyscfg -r sys --all -z
name=ITSO_p690
state=Ready
model=7040-681
serial_number=021768A
..
..


Detail on lssyscfg:
-------------------


NAME 

lssyscfg - list system resources 

SYNOPSIS 

lssyscfg -r {lpar | prof | sys | sysprof | cage | frame} [-m managed-system | -e managed-frame] 
            [--filter "filter-data"] [-F [attribute-names] [--header]] [--help] 

DESCRIPTION 

lssyscfg lists the attributes of partitions, partition profiles, or system profiles for the managed-system. 
It can also list the attributes of the managed-system, and of all of the systems managed by this 
Hardware Management Console (HMC).

lssyscfg can also list the attributes of cages in the managed-frame, the attributes of the managed-frame, 
or the attributes of all of the frames managed by this HMC.

OPTIONS 

-r 
The type of resources to list. Valid values are lpar for partitions, prof for partition profiles, sys for 
managed systems, sysprof for system profiles, cage for managed frame cages, and frame for managed frames. 

-m 
The name of either the managed system to list, or the managed system which has the system resources to list. 
The name may either be the user-defined name for the managed system, or be in the form tttt-mmm*ssssssss, 
where tttt is the machine type, mmm is the model, and ssssssss is the serial number of the managed system. 
The tttt-mmm*ssssssss form must be used if there are multiple managed systems with the same user-defined name. 
This option is required when listing partitions, partition profiles, or system profiles. This option is optional 
when listing managed systems, and if it is omitted, then all of the systems managed by this HMC will be listed. 
This option is not valid when listing managed frame cages or managed frames. 

-e 
The name of either the managed frame to list, or the managed frame which contains the cages to list. 
The name may either be the user-defined name for the managed frame, or be in the form ttttmmm* ssssssss, 
where tttt is the type, mmm is the model, and ssssssss is the serial number of the managed frame. 
The tttt-mmm*ssssssss form must be used if there are multiple managed frames with the same user-defined name. 
This option is required when listing managed frame cages. This option is optional when listing managed frames, 
and if it is omitted, then all of the frames managed by this HMC will be listed. This option is not valid when 
listing partitions, partition profiles, system profiles, or managed systems. 

--filter 
The filter(s) to apply to the resources to be listed. Filters are used to select which resources of the specified 
resource type are to be listed. If no filters are used, then all of the resources of the specified resource type 
will be listed. For example, specific partitions can be listed by using a filter to specify the names or IDs of the partitions to list. Otherwise, if no filter is used, then all of the partitions in the managed system will be listed. The filter data consists of filter name/value pairs, which are in comma separated value (CSV) format. The filter data must be enclosed in double quotes. 
The format of the filter data is as follows:

"filter-name=value,filter-name=value,..."

Note that certain filters accept a comma separated list of values, as follows:

""filter-name=value,value,...",..."

When a list of values is specified, the filter name/value pair must be enclosed in double quotes. Depending on the 
shell being used, nested double quote characters may need to be preceded by an escape character, which is usually 
a '\' character. Unless otherwise indicated, multiple values can be specified for each filter.

Valid filter names for partitions:

lpar_names | lpar_ids | work_groups

Only one of these three filters may be specified.

Valid filter names for partition profiles:

lpar_names | lpar_ids

Either the name or the ID of the partition which has the partition profiles to be listed must be specified. 
Only one partition name or ID can be specified.

profile_names

Valid filter names for system profiles:

profile_names

This option is required when listing partition profiles. This option is not valid when listing managed systems, 
managed frame cages, or managed frames.

-F 
A delimiter separated list of attribute names for the desired attribute values to be displayed for each resource. 
If no attribute names are specified, then values for all of the attributes for the resource will be displayed. 
When this option is specified, only attribute values will be displayed. No attribute names will be displayed. 
The attribute values displayed will be separated by the delimiter which was specified with this option. 
This option is useful when only attribute values are desired to be displayed, or when the values of only 
selected attributes are desired to be displayed. 

--header 
Display a header record, which is a delimiter separated list of attribute names for the attribute values that 
will be displayed. This header record will be the first record displayed. This option is only valid when used 
with the -F option. 

--help 
Display the help text for this command and exit. 

EXAMPLES 

List all systems managed by this HMC:

lssyscfg -r sys

List only the user-defined name, machine type and model, and serial number for all of the systems managed by this HMC, 
and separate the output values with a colon:

lssyscfg -r sys -F name:type_model:serial_num

List the managed system system1:

lssyscfg -r sys -m system1

List all partitions in the managed system, and only display attribute values for each partition, following a header 
of attribute names:

lssyscfg -r lpar -m 9406-570*12345678 -F --header

List the partitions lpar1, lpar2, and lpar3:

lssyscfg -r lpar -m system1 --filter ""lpar_names=lpar1, lpar2,lpar3""

List only the names, IDs, and states of partitions lpar1, lpar2, and lpar3, and separate the output values 
with a comma:

lssyscfg -r lpar -m system1 --filter ""lpar_names=lpar1, lpar2,lpar3"" -F name,lpar_id,state

List all partition profiles defined for partition lpar2:

lssyscfg -r prof -m system1 --filter "lpar_names=lpar2"

List the partition profiles prof1 and prof2 defined for the partition that has an ID of 2:

lssyscfg -r prof -m system1 --filter "lpar_ids=2, "profile_names=prof1,prof2""

List all system profiles defined for the managed system:

lssyscfg -r sysprof -m 9406-520*100128A

List the system profile sysprof1:

lssyscfg -r sysprof -m system1 --filter "profile_names= sysprof1"

List all frames managed by this HMC:

lssyscfg -r frame

List the managed frame myFrame:

lssyscfg -r frame -e myFrame

List all cages in the managed frame:

lssyscfg -r cage -e 9119-59*000012C


Power 4 HMC Commands:
---------------------

To view partition state: 

get_partition_state 
To pop a hung partiton into the debugger (aka 'soft reset'): 

reset_partition -m <machine> -p <partition> -t soft 
To force a reboot of a hung partition (aka 'hard reset'): 

reset_partition -m <machine> -p <partition> -t hard 
To force a reboot of a "full system partition" (i.e. a system that is not partitioned) : 

chsysstate -r sys -n <machine> -o off --immed --restart 
To start a partition: 

start_partition -p <partition> -f <profile name> -m <machine> 
To get a listing of boot profiles: 

query_profile_names -m <machine> -p <partition> 


Power 5 HMC Commands:
---------------------

Viewing system state
To list all of the HMC-managed systems, issue the command: 

lssyscfg -r sys 

To list only the "name" field of all of the systems, use the -F flag, together with the name of the field 
(in this case, name): 

lssyscfg -r sys -F name 

To see system state for only a single system, issue: 

lssyscfg -r sys -m <machine> 

The above may be combined with the -F flag as well, to list only one attribute for one machine. 

To view the partition state, issue the command: 

lssyscfg -r lpar -m <machine> 

To see just the names and state: 

lssyscfg -r lpar -m <machine> -F name,state --header 

All frames managed by the HMC may be listed as: 

lssyscfg -r frame 

All cages in a frame may be listed by: 

lssyscfg -r cage -e <frame-name> 
Cages may be processors (cpu memory pci slots), and are identified as contents=sys, or they may be I/O drawers, 
and are identified as contents=io. 

To view the various profiles a partition can be booted into: 

lssyscfg -r prof -m <machine> --filter lpar_names=<partition> 


Changing system state, rebooting:
---------------------------------

To power on an lpar with a profile: 

chsysstate -m <machine> -o on -r lpar -n <lpar name> -f <profile> 
i.e. for example: 

chsysstate -m alpha -o on -r lpar -n alpha-lp1 -f default 
To power on a whole machine (CEC): 

chsysstate -m alpha -o on -r sys 
Etc. chsysstate, lssyscfg and other commands have good explanations if they're run without arguments. 

Issuing a 'soft reset', to push a hung machine into KDB/XMON, is not obvious. The magic incantation is: 

chsysstate -r lpar -m <machine> -n <partition> -o dumprestart 
To issue a 'hard reset', to turn off a partition, no matter what: 

chsysstate -r lpar -m <machine> -n <partition> -o shutdown --immed --restart 


Controlling virtual cpus:
-------------------------

To add one virtual CPU: (note these use -p instead of -n for the partition name) 

chhwres -r proc -m <machine> -p <partition> -o a --procs 1 
To add one-tenth of a cpu processing entitlement: 

chhwres -r proc -m <machine> -p <partition> --procunits 0.1 
To see nice report of: MACHINE,LPAR,PROFILE,STATE: 

lssyscfg -r sys -F name | while read mngsys; do lssyscfg -r lpar -F name,curr_profile,state -m $mngsys | 
sed "s/^/$mngsys,/"; done 


chhwres command:
----------------

This command you should use from the VIOS.


NAME 

chhwres - change hardware resources 

SYNOPSIS 

To add, remove, or move a physical I/O slot:

chhwres -r io -m managed-system -o {a | r | m} 
{-p partition-name | --id partition-ID} 
[{-t target-partition-name | 
--tid target-partition-ID}] 
-l slot-DRC-index [-a "attributes"] 
[-w wait-time] [-d detail-level] [--force]

To set physical I/O attributes:

chhwres -r io -m managed-system -o s 
{-p partition-name | --id partition-ID} 
--rsubtype {iopool | taggedio} 
-a "attributes"

To add or remove a virtual I/O adapter:

chhwres -r virtualio -m managed-system -o {a | r} 
{-p partition-name | --id partition-ID} 
[--rsubtype {eth | scsi | serial}] 
[-s virtual-slot-number] [-a "attributes"] 
[-w wait-time] [-d detail-level] [--force]

To set virtual I/O attributes:

chhwres -r virtualio -m managed-system -o s 
[{-p partition-name | --id partition-ID}] 
--rsubtype {eth | hsl | virtualopti} 
-a "attributes"

To add, remove, or move memory:

chhwres -r mem -m managed-system -o {a | r | m} 
{-p partition-name | --id partition-ID} 
[{-t target-partition-name | 
--tid target-partition-ID}] 
-q quantity 
[-w wait-time] [-d detail-level] [--force]

To set memory attributes:

chhwres -r mem -m managed-system -o s -a "attributes"

To add, remove, or move processing resources:

chhwres -r proc -m managed-system -o {a | r | m} 
{-p partition-name | --id partition-ID} 
[{-t target-partition-name | 
--tid target-partition-ID}] 
[--procs quantity] [--procunits quantity] 
[--5250cpwpercent percentage] 
[-w wait-time] [-d detail-level] [--force]

To set processing attributes:

chhwres -r proc -m managed-system -o s 
{-p partition-name | --id partition-ID} 
-a "attributes" 

DESCRIPTION 

chhwres changes the hardware resource configuration of the managed-system. chhwres is used to perform 
dynamic logical partitioning (DLPAR) operations. 

OPTIONS 

-r 

The type of hardware resources to change. Valid values are io for physical I/O, virtualio for virtual I/O, 
mem for memory, and proc for processing resources. 

--rsubtype 

The subtype of hardware resources to change. Valid physical I/O resource subtypes are slot for I/O slots, 
iopool for I/O pools, and taggedio for tagged I/O resources. Valid virtual I/O resource subtypes are eth 
for virtual ethernet, scsi for virtual SCSI, serial for virtual serial, hsl for High Speed Link (HSL) 
OptiConnect, and virtualopti for virtual OptiConnect resources. 
This option is required for physical I/O or virtual I/O set operations, and for virtual I/O add operations.

This option is not valid for memory or processor operations. 

-m 

The name of the managed system for which the hardware resource configuration is to be changed. 
The name may either be the user-defined name for the managed system, or be in the form tttt-mmm*ssssssss, 
where tttt is the machine type, mmm is the model, and ssssssss is the serial number of the managed system. 
The tttt-mmm*ssssssss form must be used if there are multiple managed systems with the same user-defined name. 

-o 

The operation to perform. Valid values are a to add hardware resources to a partition, r to remove hardware 
resources from a partition, m to move hardware resources from one partition to another, and s to set 
hardware resource related attributes for a partition or the managed-system. 

-p 

The name of the partition for which the operation is to be performed. For a move operation, this is the source 
partition (the partition the resources will be moved from) for the operation. To perform an add, remove, 
or move operation, the partition must be in the running state. 
You can either use this option to specify the name of the partition for which the operation is to be performed, 
or use the --id option to specify the partition's ID. The -p and the --id options are mutually exclusive.

A partition is required to be specified with this option or the --id option for all operations except 
a virtual ethernet or memory set operation. 

--id 

The ID of the partition for which the operation is to be performed. For a move operation, this is the source 
partition (the partition the resources will be moved from) for the operation. To perform an add, remove, 
or move operation, the partition must be in the running state. 
You can either use this option to specify the ID of the partition for which the operation is to be performed, 
or use the -p option to specify the partition's name. The --id and the -p options are mutually exclusive.

A partition is required to be specified with this option or the -p option for all operations except a virtual 
ethernet or memory set operation. 

-t 

The name of the target partition for a move operation. The partition must be in the running state. 
You can either use this option to specify the name of the target partition, or use the --tid option to specify 
the ID of the partition. The -t and the --tid options are mutually exclusive.

A target partition is required to be specified with this option or the --tid option for a move operation. 
This option is not valid for any other operation.

--tid 

The ID of the target partition for a move operation. The partition must be in the running state. 
You can either use this option to specify the ID of the target partition, or use the -t option to specify 
the name of the target partition. The --tid and the -t options are mutually exclusive.

A target partition is required to be specified with this option or the -t option for a move operation. 
This option is not valid for any other operation.

-l 

The DRC index of the physical I/O slot to add, remove, or move. 

-s 

The virtual slot number of the virtual I/O adapter to add or remove. 
When adding a virtual I/O adapter, if this option is not specified then the next available virtual slot number 
will be assigned to the virtual I/O adapter.

When removing a virtual I/O adapter, this option is required. 

-q 

The quantity of memory to add, remove, or move. The quantity specified must be in megabytes, it must be a multiple 
of the memory region size for the managed-system, and it must be greater than 0. 

--procs 

When adding or removing processing resources to or from a partition using dedicated processors, or when moving 
processing resources from a partition using dedicated processors to another partition using dedicated processors, 
use this option to specify the quantity of dedicated processors to add, remove, or move. 
When adding or removing processing resources to or from a partition using shared processors, or when moving processing 
resources from a partition using shared processors to another partition using shared processors, use this option 
to specify the quantity of virtual processors to add, remove, or move.

When moving processing resources from a partition using dedicated processors to a partition using shared processors, 
use this option to specify the quantity of dedicated processors to be moved from the source partition and added 
as shared processors to the target partition.

This option is not valid when moving processing resources from a partition using shared processors to a partition 
using dedicated processors. The --procunits option must be used instead.

The quantity of processing resources specified with this option must be a whole number greater than 0. 

--procunits 

When adding or removing processing resources to or from a partition using shared processors, or when moving 
processing resources from a partition using shared processors to another partition using shared processors, 
use this option to specify the quantity of processing units to add, remove, or move. 
When moving processing resources from a partition using shared processors to a partition using dedicated processors, 
use this option to specify the quantity of shared processors to be moved from the source partition and added as 
dedicated processors to the target partition.

This option is not valid when moving processing resources from a partition using dedicated processors to a partition 
using shared processors. The --procs option must be used instead.

When moving processing resources from a partition using shared processors to a partition using dedicated 
processors, the quantity of processing units specified with this option must be a whole number. Otherwise, 
the quantity of processing units specified with this option can have up to 2 decimal places. In either case, 
the quantity specified must be greater than 0. 

--5250cpwpercent 

The percentage of 5250 Commercial Processing Workload (CPW) to add, remove, or move. The percentage specified 
can have up to 2 decimal places, and it must be greater than 0. 
This option is only valid for i5/OS(TM) partitions and can only be used when the managed-system supports 
the assignment of 5250 CPW percentages to partitions. 

-w 

The elapsed time, in minutes, after which an add, remove, or move operation will be aborted. 
wait-time must be a whole number. If wait-time is 0, the operation will not be timed out.

If this option is not specified, a default value of 5 minutes is used.

This option is valid for all add, remove, and move operations for AIX(R), Linux(TM), and virtual I/O server 
partitions. This option is also valid for memory add, remove, and move operations for i5/OS partitions. 

-d 

The level of detail to be displayed upon return of an add, remove, or move operation. Valid values are 0 (none) 
through 5 (highest). 
If this option is not specified, a default value of 0 is used.

This option is valid for all add, remove, and move operations for AIX, Linux, and virtual I/O server partitions. 

--force 

This option allows you to force a remove or move operation to be performed for a physical I/O slot that is currently 
in use (varied on) by an i5/OS partition. 
This option also allows you to force an add, remove, or move operation to be performed for an AIX, Linux, 
or virtual I/O server partition that does not have an RMC connection to the HMC. If this command completes 
successfully, you will need to restart your operating system for the change to take affect. You should only use 
this option if you intentionally configured your LAN to isolate the HMC from the operating system of your partition. 

-a 

The configuration data needed to create virtual I/O adapters or set hardware resource related attributes. 
The configuration data consists of attribute name/value pairs, which are in comma separated value (CSV) format. 
The configuration data must be enclosed in double quotes. 

The format of the configuration data is as follows:

attribute-name=value,attribute-name=value,...

Note that certain attributes accept a comma separated list of values, as follows:

"attribute-name=value,value,...",...

When a list of values is specified, the attribute name/value pair must be enclosed in double quotes. Depending on 
the shell being used, nested double quote characters may need to be preceded by an escape character, 
which is usually a '\' character.

If '+=' is used in the attribute name/value pair instead of '=', then the specified value is added to the existing 
value for the attribute if the attribute is numerical. If the attribute is a list, then the specified value(s) 
is added to the existing list. 

If '-=' is used in the attribute name/value pair instead of '=', then the specified value is subtracted from 
the existing value for the attribute if the attribute is numerical. If the attribute is a list, then the 
specified value(s) is deleted from the existing list. 

Valid attribute names for attributes that can be set when adding, removing, or moving a physical I/O slot: 

slot_io_pool_id 
Valid attribute names for setting I/O pool attributes: 

lpar_io_pool_ids 
comma separated 
Valid attribute names for setting tagged I/O resources (i5/OS partitions only): 

load_source_slot 
DRC index of I/O slot, or virtual slot number 
alt_restart_device_slot 
DRC index of I/O slot, or virtual slot number 
console_slot 
DRC index of I/O slot, virtual slot number, or the value hmc 
alt_console_slot 
DRC index of I/O slot, or virtual slot number 
op_console_slot 
DRC index of I/O slot, or virtual slot number 
Valid attribute names for adding a virtual ethernet adapter: 

ieee_virtual_eth 
Valid values: 
0 - not IEEE 802.1Q compatible 
1 - IEEE 802.1Q compatible 

Required 
port_vlan_id 
Required 
addl_vlan_ids 
is_trunk 
Valid values: 
0 - no 
1 - yes 
trunk_priority 
Valid values are integers between 1 and 15, inclusive 
Required for a trunk adapter 
Valid attribute names for adding a virtual SCSI adapter: 

adapter_type 
Valid values are client or server (server adapters can only be added to i5/OS partitions on IBM(R) 
eServer(TM) i5 servers, or virtual I/O server partitions) 
Required 
remote_lpar_id | remote_lpar_name 
One of these attributes is required for a client adapter 
remote_slot_num 
Required for a client adapter 
Valid attribute names for adding a virtual serial adapter: 

adapter_type 
Valid values are client or server (client adapters cannot be added to i5/OS partitions on IBM System p5 or 
eServer p5 servers, and server adapters can only be added to i5/OS or virtual I/O server partitions) 
Required 
remote_lpar_id | remote_lpar_name 
One of these attributes is required for a client adapter 
remote_slot_num 
Required for a client adapter 
supports_hmc 
The only valid value is 0 for no 
Valid attribute names for setting virtual ethernet attributes: 

mac_prefix
Valid attribute names for setting HSL OptiConnect attributes (i5/OS partitions only): 

hsl_pool_id 
Valid values are: 
0 - HSL OptiConnect is disabled 
1 - HSL OptiConnect is enabled 
Valid attribute names for setting virtual OptiConnect attributes (i5/OS partitions only): 

virtual_opti_pool_id 
Valid values are: 
0 - virtual OptiConnect is disabled 
1 - virtual OptiConnect is enabled 
Valid attribute names for setting memory attributes: 

requested_num_sys_huge_pages 
Valid attribute names for setting processing attributes: 

sharing_mode 
Valid values are: 
keep_idle_procs - valid with dedicated processors 
share_idle_procs - valid with dedicated processors 
cap - valid with shared processors 
uncap - valid with shared processors 
uncap_weight 

--help 

Display the help text for this command and exit. 


EXAMPLES 

Add the I/O slot with DRC index 21010001 to partition p1 and set the I/O pool ID for the slot to 3:

chhwres -r io -m sys1 -o a -p p1 -l 21010001 
-a "slot_io_pool_id=3"

Add I/O pools 2 and 3 to the I/O pools in which partition p1 is participating:

chhwres -r io --rsubtype iopool -m 9406-520*1234321A -o s 
-p p1 -a ""lpar_io_pool_ids+=2,3""

Add a virtual ethernet adapter to the partition with ID 3:

chhwres -r virtualio -m 9406-520*1234321A -o a --id 3 
--rsubtype eth -a "ieee_virtual_eth=1, 
port_vlan_id=4,"addl_vlan_ids=5,6",is_trunk=1, 
trunk_priority=1"

Remove the virtual adapter in slot 3 from partition p1:

chhwres -r virtualio -m sys1 -o r -p p1 -s 3

Enable HSL OptiConnect for the i5/OS partition i5_p1:

chhwres -r virtualio -m sys1 -o s -p i5_p1 
--rsubtype hsl -a "hsl_pool_id=1"

Add 128 MB of memory to the partition with ID 1, and time out after 10 minutes:

chhwres -r mem -m sys1 -o a --id 1 -q 128 -w 10

Remove 512 MB of memory from the AIX partition aix_p1, return a detail level of 5:

chhwres -r mem -m 9406-520*1234321A -o r -p aix_p1 -q 512 
-d 5

Set the number of pages of huge page memory requested for the managed system to 2 (the managed system must be 
powered off):

chhwres -r mem -m sys1 -o s -a "requested_num_sys_huge_pages=2"

Move 1 processor from partition p1 to partition p2 (both partitions are using dedicated processors):

chhwres -r proc -m 9406-520*1234321A -o m -p p1 -t p2 
--procs 1

Move .5 processing units from the partition with ID 1 to the partition with ID 2 (both partitions are using 
shared processors):

chhwres -r proc -m sys1 -o m --id 1 --tid 2 --procunits .5

Add .25 processing units to the i5/OS partition i5_p1 and add 10 percent 5250 CPW:

chhwres -r proc -m sys1 -o a -p i5_p1 --procunits .25 
--5250cpwpercent 10


lshwres command:
----------------

lshwres -m "managed-system" [-p "partition-name" | -all ] -r [ resource-type ] [ -y "led-type" ] 
                            [ -F < format > ] [-help 

List system level memory information and include the minimum memory required to support a maximum of 1024 MB: 

# lshwres -r mem --level sys --maxmem 1024

List all memory information for partitions lpar1 and lpar2, and only display attribute values, following a 
header of attribute names: 

# lshwres -r mem --level lpar --filter "\"lpar_names=lpar1,lpar2\"" -F --header

List all I/O units on the system: 

# lshwres -r io --rsubtype unit

List all virtual Ethernet adapters on the managed system: 

# lshwres -r virtualio --rsubtype eth --level lpar

List all virtual slots for partition lpar1: 

# lshwres -r virtualio --rsubtype slot --level slot --filter "lpar_names=lpar1"

List only the installed and configurable processors on the system: 

# lshwres -r proc --level sys -F installed_sys_proc_units,configurable_sys_proc_units


lpar_netboot command:
---------------------

NAME 

lpar_netboot - retrieve MAC address and physical location code from network adapters for a partition or 
instruct a partition to network boot 

SYNOPSIS 

-- To retrieve MAC address and physical location code:

lpar_netboot -M -n [-v] [-x] [-f] [-i] [-A] -t ent [-D -s speed -d duplex -S server -G gateway -C client] 
             partition-name partition-profile managed-system

-- To perform network boot:

lpar_netboot [-v] [-x] [-f] [-i] [-g args] [{-A -D | [-D] -l physical-location-code | [-D] -m MAC-address}] 
             -t ent -s speed -d duplex -S server -G gateway -C client partition-name partition-profile managed-system

-- To retrieve MAC address and physical location code on a system supporting a full system partition:

lpar_netboot -M -n [-v] [-x] [-f] [-i] [-A] -t ent [-D -s speed -d duplex -S server -G gateway -C client] 
             managed-system managed-system

-- To perform network boot on a system supporting a full system partition:

lpar_netboot [-v] [-x] [-f] [-i] [-g args] [{-A -D | [-D] -l physical-location-code | [-D] -m MAC-address}] 
             -t ent -s speed -d duplex -S server -G gateway -C client managed-system managed-system

DESCRIPTION 

lpar_netboot instructs a logical partition to network boot by having it send out a bootp request to a server 
specified with the -S option. The server can be an AIX(R) NIM server serving SPOT resources or any server 
serving network boot images. If specified with the -M and -n options, lpar_netboot will return the 
Media Access Control (MAC) address and the physical location code for a network adapter of the type specified 
with the -t option. When the -m option is specified, lpar_netboot will boot a partition using the network adapter 
which has the specified MAC address. When the -l option is specified, lpar_netboot will boot a partition using 
the network adapter which has the specified physical location code. The MAC address and physical location code 
of a network adapter is dependent upon the hardware resource allocation in the partition profile the partition 
was booted with. The lpar_netboot command requires arguments for partition name, partition profile, and the name 
of the managed system which has the partition. 

OPTIONS 

-A 

Return all adapters of the type specified with the -t option. 
-C 

The IP address of the partition to network boot. 
-D 

Perform a ping test and use the adapter that successfully pings the server specified with the -S option. 
-G 

The gateway IP address of the partition specified with the -C option. 
-M 

Discover network adapter MAC address and physical location code. 
-S 

The IP address of the machine from which to retrieve the network boot image during network boot. 
-d 

The duplex setting of the partition specified with the -C option. Valid values are full, half, and auto. 
-f 

Force close the virtual terminal session for the partition. 
-g 

Specify generic arguments for booting the partition. 
-i 

Force immediate shutdown of the partition. If this option is not specified, a delayed shutdown will be performed. 
-l 

The physical location code of the network adapter to use for network boot. 
-m 

The MAC address of the network adapter to use for network boot. 
-n 

Instruct the partition to not network boot. 
-s 

The speed setting of the partition specified with the -C option. Valid values are 10, 100, 1000, and auto. 
-t 

The type of adapter for MAC address or physical location code discovery or for network boot. The only valid value is 
ent for ethernet. 
-v 

Display additional information during command execution. 
-x 

Display debug output during command execution. 
partition-name

The name of the partition.

partition-profile

The name of the partition profile.

managed-system

The name of the managed system which has the partition.

--help 

Display the help text for this command and exit. 

EXAMPLES 

To retrieve the MAC address and physical location code for partition machA with partition profile machA_prof 
on managed system test_sys:

$ lpar_netboot -M -n -t ent "machA" "machA_prof" "test_sys"

To network boot the partition machA with partition profile machA_prof on managed system test_sys:

$ lpar_netboot -t ent -s auto -d auto -S 9.3.6.49 -G 9.3.6.1 -C 9.3.6.234 "machA" "machA_prof" "test_sys"

To network boot the partition machA using the network adapter with a MAC address of 00:09:6b:dd:02:e8 with 
partition profile machA_prof on managed system test_sys:

$ lpar_netboot -t ent -m 00096bdd02e8 -s auto -d auto -S 9.3.6.49 -G 9.3.6.1 -C 9.3.6.234 "machA" "machA_prof" "test_sys"

To network boot the partition machA using the network adapter with a physical location code of 
U1234.121.A123456-P1-T6 with partition profile machA_prof on managed system test_sys:

$ lpar_netboot -t ent -l U1234.121.A123456-P1-T6 -s auto -d auto -S 9.3.6.49 -G 9.3.6.1 -C 9.3.6.234 "machA" 
  "machA_prof" "test_sys"

To perform a ping test along with a network boot of the partition machA with partition profile machA_prof on 
managed system test_sys:

$ lpar_netboot -t ent -D -s auto -d auto -S 9.3.6.49 -G 9.3.6.1 -C 9.3.6.234 "machA" "machA_prof" "test_sys"


Other HMC commands:
-------------------

Activate a logical partition						chsysstate  
Change the default partition profile for a logical partition		chsyscfg   
Close a virtual terminal session for an AIXr, Linuxr, or 
Virtual I/O Server partition						rmvterm   
Create a logical partition on a managed system				mksyscfg   
Create a logical partition profile on a managed system			mksyscfg  
Create a Virtual I/O Server						mksyscfg
Issue a command to a Virtual I/O Server					viosvrcmd   
Modify memory resources of a logical partition				chhwres   
Modify processing resources of a logical partition			chhwres   
Modify the properties of a logical partition profile			chsyscfg 
Modify the hardware resource configuration of a logical partition	chhwres   
Modify the I/O resources of a logical partition				chhwres   
Modify the keylock position on a logical partition			chsysstate   
Modify the properties of a logical partition				chsyscfg   
Modify virtual I/O resources of a logical partition			chhwres  
Open a virtual terminal session for an AIX, Linux, or 
Virtual I/O Server partition 						mkvterm   
Perform a Dynamic Logical Partitioning task				chhwres   
Perform a network boot of a logical partition				lpar_netboot   
Perform an operator panel function on a logical partition		chsysstate  
Remove a logical partition from the managed system			rmsyscfg   
Remove a logical partition profile					rmsyscfg  
Restart a logical partition						chsysstate   
Retrieve MAC address and location code for a partition			lpar_netboot   
Shut down a logical partition						chsysstate
View HCA adapter resources of a logical partition			lshwres
View I/O resources of a logical partition				lshwres
View logical partition profiles						lssyscfg
View logical partitions							lssyscfg
View processing resources of a logical partition			lshwres
View memory resources of a logical partition				lshwres
View reference codes for a logical partition				lsrefcode
View SNI adapter resources of a logical partition			lshwres
View virtual I/O resources of a logical partition			lshwres


Installation of the Virtual I/O Server software on the VIOS partition:
----------------------------------------------------------------------

You can install in one of three ways:
1. using the CD/DVD drive allocated to the VIOS partition and booting from it
2. Installing the VIOS software from the HMC
3. Installing the media using NIM and the HMC

In this example we assume you use the allocated CD/DVD drive.

1. put the DVD disk in the drive
2. activate the VIOS partition by right-clicking the partionname and selecting the
   Activate choice
3. Select the profile and check the "Open a terminal window" and click the "Advanced" button.
   Under the Bootmode choice, select "SMS" boot mode.
4. After booting the partition, the SMS menu appears.

   Main menu
   1. Select Language
   2. Setup Remote IPL
   3. Change SCSI Settings
   4. Select Console
   5. Select Boot Options

Choose 5 "select Boot Options".
Next, Choose 1 for "Select Install/Boot Device".
Next, choose 3 for CD/DVD.
Next, choose 4 for IDE.
Next, choose 1 for IDE CD-ROM.
Next, choose 2 for Normal Mode Boot.

When the installation has finished, use the padmin user to login.
After logging in, you will be placed in the IOSCLI. Type the following command
to accept the license:

# license -accept


Installing AIX using the CD-ROM device to install a partition with an HMC:
--------------------------------------------------------------------------

This information contains procedures to install the AIX operating system. For more information on concepts 
and considerations involved when performing a base operating system installation of AIX, 
or concepts and requirements involved when using the Network Installation Manager (NIM) to install 
and maintain AIX, refer to the AIX 5L Installation Guide and Reference.

Note:
For the installation method that you choose, ensure that you follow the sequence of steps as shown. 
Within each procedure, you must use AIX to complete some installation steps, while other steps are completed 
using the HMC interface.
In this procedure, you will perform a New and Complete Base Operating System Installation on 
a logical partition using the partition's CD-ROM device. This procedure assumes that there is an HMC 
attached to the managed system.

Prerequisites
Before you begin this procedure, you should have already used the HMC to create a partition 
and partition profile for the client. Assign the SCSI bus controller attached to the CD-ROM device, 
a network adapter, and enough disk space for the AIX operating system to the partition. 
Set the boot mode for this partition to be SMS mode. After you have successfully created the partition 
and partition profile, leave the partition in the Ready state. For instructions about how to create 
a logical partition and partition profile, refer to the Creating logical partitions and partition profiles 
article in the IBM eServer Hardware Information Center.

1. Activate and install the partition (perform these steps in the HMC interface)

__  Step 1. Activate the partition, as follows: 

Insert the AIX 5L Volume 1 CD into the CD device of the managed system. 
Right-click on the partition to open the menu. 
Select Activate. The Activate Partition menu opens with a selection of partition profiles. 
Be sure the correct profile is highlighted. 
Select Open a terminal window or console session at the bottom of the menu to open 
a virtual terminal (vterm) window. 

Select Advanced to open the Advanced options menu. 
For the Boot mode, select SMS. 
Select OK to close the Advanced options menu. 
Select OK. A vterm window opens for the partition.

__  Step 2. In the SMS menu on the vterm, do the following: 

Press the 5 key and press Enter to select 5. Select Boot Options.

PowerPC Firmware
Version SF220_001
SMS 1.5 (c) Copyright IBM Corp. 2000, 2003  All rights reserved.
-------------------------------------------------------------------------------
Main Menu

1. Select Language
2. Setup Remote IPL (Initial Program Load)
3. Change SCSI Settings
4. Select Console
5. Select Boot Options


-------------------------------------------------------------------------------


Press the 2 key and press Enter to select 2. Select Boot Devices. 
Press the 1 key and press Enter to select 1. Select 1st Boot Device. 
Press the 3 key and press Enter to select 3. CD/DVD. 
Select the media type that corresponds to the CD-ROM device and press Enter. 
Select the device number that corresponds to the CD-ROM device and press Enter. 
 The CD-ROM device is now the first device in the Current Boot Sequence list. 
Press the ESC key until you return to the Configure Boot Device Order menu. 
Press the 2 key to select 2. Select 2nd Boot Device. 
Press the 5 key and press Enter to select 5. Hard Drive. 
If you have more than one hard disk in your partition, determine which hard disk you will use 
to perform the AIX installation. Select the media type that corresponds to the hard disk and press Enter. 
Select the device number that corresponds to the hard disk and press Enter. 
Press the x key to exit the SMS menu. Confirm that you want to exit SMS.

__  Step 3. Boot from the AIX 5L Volume 1, as follows: 

Select console and press Enter. 
Select language for BOS Installation menus, and press Enter to open the Welcome to Base Operating System 
Installation and Maintenance menu. 
Type 2 to select Change/Show Installation Settings and Install in the Choice field and press Enter. 

                     Welcome to Base Operating System
                      Installation and Maintenance

Type the number of your choice and press Enter.  Choice is indicated by >>>.

    1 Start Install Now with Default Settings  

    2 Change/Show Installation Settings and Install

    3 Start Maintenance Mode for System Recovery

    88  Help ?
    99  Previous Menu
>>> Choice [1]: 2


__  Step 4. Verify or Change BOS Installation Settings, as follows: 

Type 1 in the Choice field to select the System Settings option. 
Type 1 for New and Complete Overwrite in the Choice field and press Enter. 
Note:
The installation methods available depend on whether your disk has a previous version of AIX installed.
When the Change Disk(s) screen displays, you can change the destination disk for the installation. 
If the default shown is correct, type 0 in the Choice field and press Enter. 
To change the destination disk, do the following: 

 Type the number for each disk you choose in the Choice field and press Enter. Do not press Enter 
 a final time until you have finished selecting all disks. If you must deselect a disk, type its number 
 a second time and press Enter. 

 When you have finished selecting the disks, type 0 in the Choice field and press Enter. 
 The Installation and Settings screen displays with the selected disks listed under System Settings.

If needed, change the primary language environment. Use the following steps to change the primary language 
used by this installation to select the language and cultural convention you want to use. 


Note:
Changes to the primary language environment do not take effect until after the Base Operating System Installation 
has completed and your system is rebooted.

Type 2 in the Choice field on the Installation and Settings screen to select the Primary Language Environment 
Settings option. 
Select the appropriate set of cultural convention, language, and keyboard options. 
Most of the options are a predefined combination, however, you can define your own combination of options. 
To choose a predefined Primary Language Environment, type that number in the Choice field and press Enter. 


Monitoring VIOS:
----------------

Note 1:
-------

With Virtual I/O Server fix pack 8.1.0, you can install and configure the 
"IBM Tivoli Monitoring System Edition for System pT agent" on the Virtual I/O Server. 
IBM Tivoli Monitoring System Edition for System p enables you to monitor the health and availability 
of multiple IBM System p servers (including the Virtual I/O Server) from the Tivoli EnterpriseT Portal.

IBM Tivoli Monitoring System Edition (SE) for System p V6.1 is a new offering of the popular IBM Tivoli 
Monitoring (ITM) product specifically designed for IBM System p AIX customers. ITM SE for System p V6.1 monitors 
the health and availability of System p servers, providing rich graphical views of your AIX, LPAR, CEC, 
and VIOS resources in a single console, delivering robust monitoring and quick time to value.

ITM SE for System p includes out-of-the-box best practice solutions created by AIX and VIOS developers. 
These best practice solutions include predefined thresholds for alerting on key metrics, Expert Advice that 
provides an explanation of the alert and recommends potential actions to take to resolve the issue, and the 
ability to take resolution actions directly from the Tivoli Enterprise Portal or set up automated actions. 
In addition, users have the ability to visualize the monitoring data in the Tivoli Enterprise Portal to determine 
the current state of the AIX, LPAR, CEC, and VIOS resources.

Note 2:
-------

How to monitor IBM's Virtual-IO-Server (VIO) with OpenSMART
The following steps tells you, how to monitor IBM's Virtual-IO-Server can be monitored by OpenSMART.

Download the latest agent- (or whole source-) pack from the OpenSMART home page.

Transfer this tar file to the VIO-Server (we assume to /tmp/opensmart-client-0.4.tar.gz) and do:

telnet vio-server

IBM Virtual I/O Server

login: padmin
padmin's Password: 
Last unsuccessful login: Tue Feb 28 03:08:08 CST 2006 on /dev/vty0 
Last login: Wed Mar 15 16:14:11 CST 2006 on /dev/pts/0 from 192.168.1.1

$ oem_setup_env
# mkdir /home/osmart
# useradd -c "OpenSMART Monitoring" -d /home/osmart osmart
# chown -R saicsadm:staff /home/osmart
# passwd osmart
Changing password for "saicsadm"
osmart's New password: ******
Enter the new password again: *****

# su - osmart
$ mkdir ostemp
$ cd ostemp
$ gunzip /tmp/opensmart-client-0.4.tar.gz
$ tar -xf /tmp/opensmart-client-0.4.tar
$ ./agent/install_agent ~
[ ... ]
Copy ../lib/opensmartresponse.dtd -> /usr/local/debis/os/etc/opensmartresponse.dtd
chmod 644 /usr/local/debis/os/etc/opensmartresponse.dtd


     **********************************************
     *   OpenSMART agent installed successfully   *
     **********************************************

$ cd ~
$ rm -rf ostemp
        
That's it - your installation is complete. Now you can configure your osagent (and do not forget to set up 
a cronjob for your osagent).

We recommend the following checks:


DISK Section 9.12, "Configuration for the disk check."
LOGS Section 9.20, "Configuration for the logs check."
ERRPT Section 9.14, "Configuration for the errpt check"
LOAD Section 9.19, "Configuration for the load check."
PROC Section 9.35, "Configuration for the proc check."
AIXSWRAID Section 9.8, "Configuration for the aixswraid check."


Note 3: lpar2rrd
----------------

LPAR2RRD Micro-Partitioning statistics tool
The tool is capable produce historical graphs of shared CPU usage on micro-partitioned systems.
Idea and rough design has been initiated by Ondrej Plachy, IBM Czech Republic. Tool itself has been written 
by Pavel Hampl, IBM Czech Republic.

FEATURES

It intended only for Micro-Partitioned systems 
it creates charts based on utilization data collected on HMC's 
it does not need any clients (agents) on LPARs 
it creates automatically a menu based WWW front end for viewing charts 
it is easy to install, configure and use (initial configuration should not take more than an hour! adding next HMC 
   takes up to 5mins) 
no any additional management when ANY change of LPAR configuration (tool recognizes new LPARs automatically) 
it supports all types of LPAR Micro-partitions and OSes (pSeries/iSeries, AIX/Linux/AS400) 
supported only on HMC >= V5R2.1 (it must support : Utilization data collection, check prerequisities for more) 
when Utilization data collection is supported, but disabled then the tool prompts you for enabling it (then it needs 2 hours at least to allow HMC collect data and let the tool show any data in charts) 
graphs are created 1 year back if historical utilization data on HMC's is available (note HMC saves hourly averages for last 2 months, daily averages for last 2 years when collection of data is enabled) 
initially the tool loads all historical data back to 1 year, then it loads only new data every hour, saves it in RRDTool databases and redraws the graphs 
it uses ssh-keys based access to HMC servers to get data automatically 
optionally for one time chart creation can be used whatever account on HMC and password authentification (you will be prompted to type a password couple of times when the tool is running) 
it does not cause considerable load on the hosted server where the tool is installed (it runs once an hour for couple of seconds) 
tool can be hosted on any *NIX platform, just needs a web server, ssh, perl and RRDTool installed. (check prerequisities bellow) 
it creates 4 kind of graphs for each lpar, shared pool and memory pool. First 3 (last day, week and month) are based 
   on hourly average data, last one (yearly chart) is based on daily averages 


More information about pSeries lpars an AIX:
--------------------------------------------

www.ibm.com/servers/eserver/pseries/lpar/
publib.boulder.ibm.com/infocenter/pseries/index.jsp?topic=/com.ibm.help.doc/welcome.htm


Errors at VIOS:
---------------

Note 1:
-------

Procedure: Install/Update HDLM drivers


# login to vio server as "padmin".
# Switch to "oem" prompt.
oem_setup_env
umount /mnt
mount bosapnim01:/export/lpp_source/hitachi /mnt

#  Install and update all filesets from the directories below.
#  "smitty install_all"
cd /mnt/hdlm_5601/aix_odm/V5.0.0.1
cd /mnt/hdlm_5601/aix_odm/V5.0.0.4u
cd /mnt/hdlm_5601/aix_odm/V5.0.1.4U
cd /mnt/hdlm_5601/aix_odm/V5.0.52.1U

#  Copy license file.
cd /mnt/hdlm_5601/license/enterprise
cp *.plk /var/tmp/hdlm_license

#  install and update all filesets from the above directory
#  "smitty install_all"
#  Fileset DLManager 5.60.1.100  Hitachi Dynamic Link Manager
cd /mnt/hdlm_5601

#  Leave the current Directory and unmount Driver Source Directory.
cd /
umount /mnt


Procedure: Install/Update VIO fixpack:
======================================

# Login to VIO server as "padmin"
# Obtain current IOS level
ioslevel

# Update VIO to latest IOS level
mount bosapnim01:/export/lpp_source/aix/vio_1200 /mnt
updateios -dev /mnt
	** Enter "y" to continue install

# Return to "root" shell prompt and HALT system.
oem_setup_env
shutdown -Fh

# Activate LPAR from HMC WebSM


Procedure: Configure VIO Server to utilize Boot Disks:
======================================================


# Login as "padmin"
# Switch to "oem" prompt
oem_setup_env

# Run in korn shell 93
  ksh93
  
# Remove any vhost adapter configuration settings
  for (( i=0; i<=48; ++i ))
  do
    /usr/ios/cli/ioscli rmdev -pdev vhost${i}
  done

# Remove all HDLM disks
  for i in $( lsdev -Cc disk -F name | grep dlmfdrv )
  do
    rmdev -Rdl ${i}
  done

# Remove all hdisks except for hdisk0 and hdisk1 - assumed to be rootvg
  for i in $( lsdev -Cc disk -F name | grep hdisk | egrep -v 'hdisk0$ | hdisk1$' )
  do
    rmdev -Rdl ${i}
  done

# If an HDLM unconfig file exists, rename it 
  [[ -f /usr/DynamicLinkManager/drv/dlmfdrv.unconf ]] &&
  mv /usr/DynamicLinkManager/drv/dlmfdrv.unconf \
     /usr/DynamicLinkManager/drv/$( date +"%Y%m%d").dlmfdrv.unconf

#  Verify "dlmfdrv.unconf" was renamed.  
   ls /usr/DynamicLinkManager/drv
	
# Set fast fail Parameter for SCSI Adapters and Reconfigure FC Adapters
                       -l fscsi0 -a fc_err_recov=fast_fail
  chdev -l fscsi1 -a fc_err_recov=fast_fail
    chdev -l fscsi2 -a fc_err_recov=fast_fail
    cfgmgr -vl fcs0
  cfgmgr -vl fcs1
  cfgmgr -vl fcs2

# Change HDLM settings
  cd /usr/DynamicLinkManager/bin
  print y | ./dlmodmset -e on
  print y | ./dlmodmset -b 68608

# Reconfigure HDLM disks
  ./dlmcfgmgr

# Turn off reserve settings on HDLM Driver
  ./dlnkmgr set -rsv on 0 -s

# Remove HDLM disks
  for i in $( lsdev -Cc disk -F name | grep dlmfdrv )
  do
    rmdev -Rdl ${i}
  done

# Change reserve policy on hdisks to "no_reserve"
  for i in $( lsdev -Cc disk -F name |
              grep hdisk |
              egrep -v 'hdisk0$|hdisk1$' )
  do
    chdev -l ${i} -a reserve_policy=no_reserve
  done

# Reconfigure HDLM disks
  ./dlmcfgmgr

# Verify all HDLM disks have an assigned PVID
  for i in $( lsdev -Cc disk -F name | grep dlmfdrv )
  do
    chdev -l ${i} -a pv=yes
  done
  lspv

# Remove any vhost adapter configuration settings
 /usr/ios/cli/ioscli lsmap -all

# Verify all vhosts adapters exist wihout Devices.
 SVSA            Physloc                                      Client Partition ID
--------------- -------------------------------------------- ------------------
vhost0          U9119.590.51A432C-V3-C10                     0x00000000

VTD                   NO VIRTUAL TARGET DEVICE FOUND

# Reboot VIO Server.
shutdown -Fr


# End of Final Procedure


# Do not perform this step as part of this procedure
 for (( i=0; i<=48; ++i ))
  do
    /usr/ios/cli/ioscli rmdev -pdev vhost${i}
  done


Other notes on VIOS:
--------------------

Note 1:
-------

Getting a Root Shell on VIOS 
IBM don't like people using the root account on their VIOS servers, but it is kind of useful for setting up 
things like the correct date. Just try the oem_setup_env command. 

Note 2:
-------

VIOS Install 
On our p5-550, I have allocated most physical devices to the VIOS LPAR so it can be used to divide these 
amongst AIX/Linux LPARs. The VIOS LPAR has four gigabit ethernet adapters allocated to it. 
Presently only two are in use as an aggregated link to the "real world". It also has a virtual ethernet adapter 
which connects internally to the p5-550. 

As for storage, there are 6 copper SCSI controllers/chains allocated and 3 FC HBAs. The ordinary SCSI stuff 
has 6 disks (2 per chain) with 4 36Gbyte disks and 2 144Gbyte disks. Two of the 36Gbyte disks are allocated 
to the rootvg volume group and are mirrored (with mirrorios). The remainder of the disks are allocated to a 
clients volume group. It is intended that new clients will have a logical volume for the operating system allocated 
out of this pool. 

Two of the Fiber Channel HBAs are assigned to the VIO partition and connected to port 13 of both switch ports 
in the SAN fabric. The SAN has been configured to attach an 860Gbyte RAID5 LUN to the IBM. Due to lack of 
multipathing support in VIOS, there are multiple apparent disks (hdisk6 ... hdisk13) which are in fact one. 
The first (hdisk6) was used to create the client_data volume group. It is intended that this volume group will 
be used for /data filesystems. 

Note: To add virtual devices dynamically to the VIOS partition, use the "Dynamic" option in the HMC. 


Networking on the VIOS LPAR:
---------------------------- 
Two (at present) of the gigabit adapters assigned to the VIOS LPAR are channelled together for redundancy. 
Telecomms will deliver all the relevant VLANS down this interface which can be bridged to internal Ethernets. 
Note that VLAN14 is configured as the native VLAN of the channel. To do this :- 


- Channel the two Ethernet NICs attached to the network: mkvdev -lnagg ent2 ent3 which produced ent5 
- Bridge between the channelled adapter and the internal network. 
  mkvdev -sea ent5 -vadapter ent4 -default ent4 -defaultid 1 which produced ent6 
- Configure the new bridge with an IP address: 
  mktcpip -hostname name -inetaddr 148.197.14.x -netmask 255.255.255.0 -gateway 148.197.14.254 -interface ent6 
- VLAN interfaces are unlikely to be necessary on the VIOS, but can be created :- 
  mkvdev -vlan ent6 -tagid 240. 

Creating a Client Logical Volume for System Disks:
-------------------------------------------------- 

-Create a logical volume for the relevant client. The name of it should be easily identifyable as 
 being assiciated with the relevant client. ... mklv -lv clientname_sys clients 18G. This creates a logical 
 volume 18Gbytes in size (enough for AIX or Linux operating system) on the clients volume group. 
-Mirror the logical volume for safety: mklvcopy lv_name. Warning, this is SLOW. 
-Assign the logical volume to a virtual adaptor: 
 mkvdev -vdev logical-volume -vadapter vhostN -dev name_for_target 

Creating a Client Logical Volume for Data Disks:
------------------------------------------------ 
- Find the virtual scsi adapter by running 
 lsdev -dev vvodka -parent 
 (of course this finds the vhost for vodka and not necessarily the one you are hunting for). 
- Create a logical volume for the relevant client. The name of it should be easily identifyable 
  as being assiciated with the relevant client. ... 
  mklv -lv clientname_data clients 100G. This creates a logical volume 100Gbytes in size on the 
  clients volume group. 
- Assign the logical volume to a virtual adaptor: 
  mkvdev -vdev logical-volume -vadapter vhostN -dev name_for_target 


Note 3:
-------

Installing AIX on an LPAR via NIM 

- On vodka (the NIM server), configure a hostname for the machine in /etc/hosts. The hostname should be the final 
  hostname of the machine to install with a '-i' added to the end (absinthe becomes absinthe-i) 
  as the connection to the NIM server is on subnet 14. This also means the LPAR to be installed needs 
  an IP address on subnet 14. 
- Go to the NIM smitty menu (smitty nim) and "Perfrom NIM Administration Tasks". 
- Select "Manage Machines", and "Define a Machine". Give the installation hostname of the machine (*absinthe-i), 
  and press Enter. 
- Select "ent" as the primary install interface. 
- On the large form, leave most things alone. But change the "Cable type" to "N/A" and hit Enter. On a previous attempt 
  it was also necessary to change the "Subnet Mask" to 255.255.255.0, and to change the "Default gateway" 
  to 148.197.14.254. 
- Back at the shell prompt, enter smitty nim_bosinst 
- Select the appropriate machine (if it isn't listed something went wrong). 
- Select an "rte" install type. 
- Select "lpp_source_530" as the LPP_SOURCE (package source) to use. 
- Select "spot_530" as the SPOT (install root) to use. 
- A long form then appears, scroll down to change the following parameters :- 
  RESOLV_CONF to use: (No choices!!) 
  ACCEPT new licenses: Change to "yes" (use Tab). 
  ACCEPT new license agreements: Change to "yes" (use Tab) 
  Press Enter to accept the changes. 
- Boot the LPAR to be installed via the HMC. Ensure that you select the "Advanced" button and specify "SMS" as the boot mode. 
- Once you have the console at the SMS menu, select "Setup Remote IPL". 
- Select the "Interpartition Logical LAN" device. 
- Select "IP parameters" 
- Specify the relevant IP addresses. 
- Go back to the main menu ("M") and select "Boot options". 
- Select "Configure Boot Device order" 
- Select "Select 1st boot device" 
- Select "Network" 
- Select "Virtual Ethernet" 
- After the virtual ethernet is specified as the boot device, exit SMS by entering "X" 
- The system should then boot over the network ... you will see lots of "IBM"'s appear on the screen followed by various messages proceeded by "BOOTP". The machine waits 60s for "Spanning Tree" ... this is normal (unless of course you have turned it off!). 
- The boot process should go through a BOOTP phase (when it obtains an address and various parameters) followed by a TFTP stage when the kernel is loaded. 
- After the kernel has booted, you will be asked to enter a digit (either '0' or '1') to select the system console for the install process. 
- Then you will be asked to enter a digit for the preferred installation language. 
- Finally you will be into the standard AIX installation process ... just accept the default settings. 

NIM Hacking 

Create an lpp_source (source of packages) without copying from CDs with nim -o define -t lpp_source -a server=master
 -a location=/nim/lpp_source/lpp_source_530  lpp_source_530 
You can create a SPOT resource using a suitable lpp_source (i.e. a full AIX source) as the source. 

Note 4: Errors in VIOS:
-----------------------

Error ED995F18
--------------

VSCSI_ERR3
ED995F18
000DRCFFFF FFF9

The Virtual SCSI server adapter (partition number and slot number) specified in the client adapter definition 
does not exist

On the HMC, correct the client adapter definition to associate it with a valid server adapter.

Error BFE4C025
--------------

BFE4C025 0222073404 P H sysplanar0 UNDETERMINED ERROR
BFE4C025   0113124506 P H sysplanar0     UNDETERMINED ERROR
BFE4C025 0126122306 P H sysplanar0 UNDETERMINED ERRR
BFE4C025 0112174806 P H sysplanar0 UNDETERMINED ERRR
BFE4C025   0919215904 P H sysplanar0     UNDETERMINED ERROR


thread:

A:

 had the error on aix 5.1, here is what IBM said : 

This is corrected with apar IY46874 that ships devices.chrp.base.rte 
5.1.0.53 

Thx to apply that apar , reboot the system. 

http://www-912.ibm.com/eserver/support/fixes/fcgui. jsp


Q:

 get a error "BFE4C025" from errpt, and this error often happen at 
almost all type RS/6000. 

when this happened, the system is running OK also, I also get none 
error information by 'diag' tools. 
I don't know what happen at the system. Who can help me? 


A:

Diags show the SRC, B700 F104, look it up here:
http://publib.boulder.ibm.com/infocenter/eserver/v1r3s/index.jsp?topic=/ipha6/refcodelist.htm

Operating System error 
Platform Licensed Internal Code terminated a partition.

If SRC word 3 is 0007, then a user may have initiated a function 22 prior to the operating system completing the IPL. 
If a function 22 was not performed, or if SRC word 3 is not 0007, look for other serviceable errors 
which occurred at same time frame.

Your word 3 is 0007 so it looks like some one forced a dump from the O/S or the Op Panel or the HMC, 
either by powering off an LPAR configured to dump or the LPAR crashed and dumped.

http://publib.boulder.ibm.com/infocenter/eserver/v1r3s/index.jsp?topic=/iphb5/f22msdc.htm

No indication of a hardware problem or any need to call support nagger.


DLPAR scripts:
==============


Note 1:
-------

Abstract 
 
DLPAR scripts are written by system administrators or software vendors to automate system resources in a dynamic 
LPAR environment. Scripts can be implemented in any scripting language, such as perl or shell, or it can be 
a compiled program. They are maintained by the system administrator using the drmgr command. 
The following Tip provides an overview of how to craft a script.

For related information about this topic, refer to the following IBM Redbooks publication:
AIX 5L Differences Guide Version 5.2 Edition, SG24-5765-02 
. 
Contents 
_ 
DLPAR scripts, used to automate LPAR reconfiguration, are written by system administrators or software vendors. 
Scripts can be implemented in any scripting language, such as perl or shell, or it can be a compiled program. 
They are maintained by the system administrator using the drmgr command. The syntax of the command is as follows: 

drmgr { -i script_name [-w minutes ] [ -f ] | -u script_name } [ -D hostname ]
drmgr [ -b ]
drmgr [ -R script_install_root_directory ]
drmgr [ -S syslog_ID ]
drmgr [ -l ] 

Descriptions of the most important flags for the drmgr command are provided in the following table. 
For a complete reference, refer to the man page or the documentation. 

Table 1. Flags of the drmgr command Flags Description 
-i script_name	This flag is used to install a script specified by the script_name parameter. By default, 
               	scripts are installed in the /usr/lib/dr/scripts/all directory. 
-w minutes	This flag is used to override the time limit value specified by the vendor for the script. 
		The script will be ended if it exceeds the specified time limit. 
-f		Using this flag forces an installed script to be overwritten. 
-u script_name	This flag is used to uninstall a script specified by the script_name parameter. 
-l		This option will display the details regarding the DLPAR scripts that are currently installed. 

For example, to install the /root/root_dlpar_test.sh script in the default directory, the following command could be used: 

drmgr -i /root/root_dlpar_test.sh 

To list the details, the drmgr -l command is used. The output is similar to the following: 

DR Install Root Directory: /usr/lib/dr/scripts
Syslog ID: DRMGR
------------------------------------------------------------
/usr/lib/dr/scripts/all/root_lpar_test.sh DLPAR test script
Vendor:IBM, Version:1.0, Date:19092002
Script Timeout:10, Admin Override Timeout:0
Resources Supported:
Resource Name: cpu Resource Usage: root_dlpar_test.sh command [parameter]
------------------------------------------------------------ 

DLPAR scripts get notified at each of the DLPAR operation phases explained previously. Notifying DLPAR scripts 
involves invoking the scripts in the appropriate environment with the appropriate parameters. 

The environment the script is executed in is as follows: 

The execution user ID and group ID are set to the uid or gid of the script. 
The PATH environment is set to /usr/bin:/etc:/usr/sbin. 
The working directory is /tmp. 
Environment variables that describe the DLPAR event are set. 

DLPAR scripts can write any necessary output to stdout. The format of the output should be name=value pair strings 
separated by newline characters to relay specific information to the drmgr. For example, the output DR_VERSION=1.0 
could be produced with the following ksh command: 

echo "DR_VERSION=1.0" 

Error and logging messages are provided by DLPAR scripts in the same way as regular output by writing 
name=value pairs to stdout. The DR_ERROR=message pair should be used to provide error descriptions. 
The name=value pairs contain information to be used to provide error and debug output for the syslog. 

DLPAR scripts can also write additional information to stdout that will be reflected to the HMC. 
The level of information that should be provided is based on the detail level passed to the script 
in the DR_DETAIL_LEVEL=N environment variable. N must be in the range of 0 to 5, where the default value 
of zero (0) signifies no information. A value of one (1) is reserved for the operating system and is used 
to present the high-level flow. The remaining levels (2-5) can be used by the scripts to provide information 
with the assumption that larger numbers provide greater detail. 

The syntax the DLPAR script is invoked with is as follows: 

[ input_name1=value1 ... ] scriptname command [ input_parameter1 ... ] 

Input variables are set as environment variables on the command line, followed by the script to be invoked that 
is provided with a command and with further parameters. A description of the function the commands should perform 
is provided in the following table. If the script is called with a command that is not implemented, 
it should exit with a return code of 10.


Table 2. DLPAR script commands Command and parameter Description 
scriptinfo Identifies the version, date, and vendor of the script. It is called when the script is installed. 
register Identifies the resources managed by the script. If the script returns the resource name (cpu or mem), the script will be automatically invoked when DLPAR attempts to reconfigure processors and memory, respectively. The register command is called when the script is installed with the DLPAR subsystem. 
usage resource_name Returns information describing how the resource is being used by the application. The description should be relevant so that the user can determine whether to install or uninstall the script. It should identify the software capabilities of the application that are impacted. The usage command is called for each resource that was identified by the register command. 
checkrelease resource_name Indicates whether the DLPAR subsystem should continue with the removal of the named resource. A script might indicate that the resource should not be removed if the application is not DLPAR-aware and the application is considered critical to the operation of the system. 
prerelease resource_name Reconfigures, suspends, or terminates the application so that its hold on the named resource is released. 
postrelease resource_name Reconfigures, resumes, or restarts the application. 
undoprerelease resource_name Invoked if an error is encountered and the resource is not released. Operations done in the prerelease command should be undone. 
checkacquire resource_name Indicates whether the DLPAR subsystem should proceed with the resource addition. It might be used by a license manager to prevent the addition of a new resource, for example, cpu, until the resource is licensed. 
preacquire resource_name Used to prepare for a resource addition. 
undopreacquire resource_name Invoked if an error is encountered in the preacquire phase or when the event is acted upon. Operations performed in the preacquire command should be undone. 
postacquire resource_name Reconfigure, resume, or start the application. 
The input variables that are provided as environment variables are dependent on the resource that is operated on. For memory add and remove operations, the variables provided in the following table are provided (one frame is equal to 4 KB): 

Table 3. Input variables for memory add/remove operations Input variable Description 

DR_FREE_FRAMES=0xFFFFFFFF 	The number of free frames currently in the system, in hexadecimal format. 
DR_MEM_SIZE_COMPLETED=n 	The number of megabytes that were successfully added or removed, in decimal format. 
DR_MEM_SIZE_REQUEST=n 		The size of the memory request in megabytes, in decimal format. 
DR_PINNABLE_FRAMES=0xFFFFFFFF  	The total number of pinnable frames currently in the system, in hexadecimal format. 
				This parameter provides valuable information when removing memory in that it can be used 
				to determine when the system is approaching the limit of pinnable memory, 
				which is the primary cause of failure for memory remove requests. 
DR_TOTAL_FRAMES=0xFFFFFFFF 	The total number of frames currently in the system, in hexadecimal format. 

The environment variables provided in the following table are set for processor add and remove operations:


Table 4. Input variables for processor add/remove operations Input Variable Description 

DR_BCPUID=N		The bind CPU ID of the processor that is being added or removed in decimal format. 
			A bindprocessor attachment to this processor does not necessarily mean that the attachment 
			has to be undone. This is only true if it is the Nth processor in the system, because 
			the Nth processor position is the one that is always removed in a CPU remove operation. 
			Bind IDs are consecutive in nature, ranging from 0 to N and are intended to identify only 
			online processors. Use the bindprocessor command to determine the number of online CPUs. 
DR_LCPUID=N 		The logical CPU ID of the processor that is being added or removed in decimal format. 

In the following example, an example Korn shell script in given that can be installed. For simplicity and demonstration 
purposes this script does not take any action. The actions for the process to control would need to be included 
in the appropriate command section: 

#!/usr/bin/ksh

if [[ $# -eq 0 ]]
then
echo "DR_ERROR= Script usage error"
exit 1
fi

ret_code=0
command=$1
case $command in
scriptinfo )
echo "DR_VERSION=1.0"
echo "DR_DATE=19092002"
echo "DR_SCRIPTINFO=DLPAR test script"
echo "DR_VENDOR=IBM";;
usage )
echo "DR_USAGE=root_dlpar_test.sh command [parameter]";;
register )
echo "DR_RESOURCE=cpu";;
checkacquire )
:;;
preacquire )
:;;
undopreaquire )
:;;
postacquire )
:;;
checkrelease )
:;;
prerelease )
:;;
undoprerelease )
:;;
postrelease )
:;;
* )
ret_code=10;;
esac

exit $ret_code
 
 
=======================================
60. SOME NOTES ON VIRTUALIZATION HP-UX:
=======================================


60.1 General information:
-------------------------

HP has had nPar hard partitions in the HP 9000 midrange and Superdome computers since the September 2000 launch 
of the Superdomes. These servers are based on a four-way cell board, and each cell board can be logically 
and electronically isolated from the others in the system, have its own HP-UX operating system installed on it, 
and function like a free-standing Unix server. In August 2001, HP announced vPar virtual partitions, 
which it rolled out first with the Superdomes and then cascaded down the HP 9000 server line. 
The Itanium-based Integrity server line has had static partitions for HP-UX and Windows operating systems 
at the high-end, and has supported HP-UX, Linux, and Windows at the low end. Only two weeks ago, HP announced 
that Linux was available on eight-way partitions on the 16-way and 64-way variants of the Integrity Superdome boxes 
through eight-way nPars. (Linux was not supported on the Superdomes until then.) 


1. nPar allows physical partioning of server 
2. vPar allows logical partioning of server. 

In both the above cases one server box can be devided in multiple servers, thus allowing consolidation. 

Each npar or vpar is a separate machine. You can transfer CPUs between vpars on the fly, but in a serious hardware 
failure you can lose all vpars. npar is more solid than vpar but you cannot transfer CPUs on the fly, it needs reboot 
and you can transfer only cell boards, I mean single CPU cannot be transfered to another npar. 

nPar is Node Partition. 
Basically distributing the IO ,CPU , Memory , and creating a virtule node within a single box. 
Superdome , V-Class , RP84XX 86XX , are nPar capable. 

v-Par : Virtual Partition. 
With Virtual Partitions (vPars) you can take almost any HP 9000 server and turn it into many "virtual" computers. 
These virtual computers can each be running their own instance of HP-UX and associated applications. 
The virtual computers are isolated from one another at the software level. Software running on one Virtual Partition 
will not affect software running in any other Virtual Partition. In the Virtual Partitions you can run different 
revisions of HP-UX, different patch levels of HP-UX, different applications, or any software you want and not affect 
other partitions. 


- Virtual Partitions versus Hard Partitions

A hard partition is a physical partition of a computer that divides the computer into groups of cell boards 
where each group operates independently of other groups. A hard partition can run a single instance of HP-UX 
or be further divided into virtual partitions.

A virtual partition is a software partition of a computer or hard partition where each virtual partition 
contains an instance of HP-UX. Though a hard partition can contain multiple virtual partitions, the inverse is 
not true. A virtual partition cannot span a hard partition boundary.
 

60.2 Bootsequence of vPar:
--------------------------

Boot Sequence: Quick Reference 

-- On a computer without vPars, a simplified boot sequence is:

1. ISL
   (Initial System Loader)
 
2. hpux
   (secondary system loader)
 
3. /stand/vmunix
   (kernel)
 

-- Adding vPars adds the monitor layer, so now hpux loads the monitor and then the monitor boots the kernels 
   of the virtual partitions. The boot sequence becomes

1. ISL
 
2. hpux
 
3. /stand/vpmon
  (vPars monitor and partition database) 
4. /stand/vmunix
  (kernels of the virtual partitions)
 

With or without vPars, the firmware loads and launches ISL.

ISL>

In a computer without vPars, at the ISL prompt, the secondary system loader hpux loads the kernel /stand/vmunix:

ISL> hpux /stand/vmunix

However, in a computer with vPars, at the ISL prompt, the secondary system loader hpux loads the 
vPars monitor /stand/vpmon:

ISL> hpux /stand/vpmon

The monitor loads the partition database (the default is /stand/vpdb) and internally creates (but does not boot) 
each virtual partition according to the resource assignments in the database.

Next, the vPars monitor runs in interactive mode (when no options to /stand/vpmon are given) with a 
command line interface.

MON>

To boot a kernel in a virtual partition (that is, to launch a virtual partition), use the monitor command 
vparload. For example, to launch the virtual partition named szilva1:

MON> vparload -p szilva1

In this example, the vPars monitor would load the virtual partition szilva1 and launch the kernel from the 
boot device specified for szilva1. (The boot device is assigned when the virtual partition is created and is 
recorded in the monitor database.)

HP-UX is now booted on the virtual partition szilva1.

Once a partition is running, you will be at the virtual console of a partition. Subsequent virtual partitions 
can be booted using the vPars command vparboot at the UNIX shell prompt of szilva1.


61. Alternate disk install AIX:
===============================

Its possible to install AIX onto another disk on the same system. This is not partitioning,
its just a second install of the BOS, on another disk.

You need to have 

"bos.alt_disk_install.rte" fileset installed. This fileset ships the "alt_disk_install" command, 
which allows cloning of the rootvg and installing an AIX mksysb to an alternate disk.

"bos.alt_disk_install.boot_images" fileset installed. This fileset shipts the boot images,
which is required to install mksysb images to an alternate disk.

Once you have installed these filesets, the alternate disk installation functions are available
to you. 

You can use the "smitty alt_install" or "smitty alt_clone" or "smitty alt_mksysb" fastpath:

# smitty alt_install

-----------------------------------------------

               Alternate Disk Installation

Move cursor to desired item and press Enter.

  Install mksysb on an Alternate Disk
  Clone the rootvg to an Alternate Disk

F1=Help  F2=Refresh   etc..
-----------------------------------------------

So, the Alternate Disk Installation can be used in one of two ways:
- Cloning the current rootvg to an alternate disk.
- Installing a mksysb image on another disk.


# smitty alt_mksysb

-----------------------------------------------
         Install mksysb on an Alternate Disk

Type or select values in entry fields.
Press Enter AFTER making all desired changes.

 Target Disk(s) to install          []
 Device or image name               []
 Phase to execute                    all
 image.data file                    []
 Customization script               []
 Set bootlist to boot from this disk
 on next reboot?                     yes
 Reboot when complete                no
 Verbose output?                     no
 Debug output?                       no
 resolv.conv file                   []

-----------------------------------------------


You can also use the "alt_disk_install" command to clone the rootvg to another disk.
The command creates an "altinst_rootvg" volumegroup on the destination disk and prepares
the same logical volumes as in the rootvg, except the names are prepended with "alt_",
for example, alt_hd1. Similar are the filesystems renamed to "/alt_inst/filesystemname"
and the original data (mksysb or rootvg) is copied.

After this first fase, a second fase begins where an optional configuration action 
can be performed, either a custom script or update of software, when cloning rootvg.

The third fase unmounts the /alt_inst/filesystems and renames the filesystems and logical volumes
by removing the alt names. Then the bootlist is altered to boot from the new disk.
After the system is rebooted, the original rootvg is renamed to old_rootvg.

Example:

# lspv
hdisk0      00fa7377474    rootvg
hdisk1      00hdgfh6374    None


# alt_disk_install -BC hdisk1


performs cloning hdisk0 to hdisk1 where hdisk1 will be the new rootvg.


Installing a second AIX52 partition using alt_disk_install:
-----------------------------------------------------------

You can use the alt_disk_install command to clone a system image to another disk, and you may use 
the -O option to remove references in the object data manager (ODM) and device (/dev) entries 
to the existing system. The -O flag tells the alt_disk_install command to call the devreset command, 
which resets the device database. The cloned disk can now be booted as if it were a new system.

An example of this scenario is as follows:

Boot the managed system as a Full System Partition so you have access to all the disks in the managed system. 
Configure the system and install the necessary applications. 
Run the alt_disk_install command to begin cloning the rootvg on hdisk0 to hdisk1, as follows: 

# /usr/sbin/alt_disk_install -O -B -C hdisk1


The cloned disk (hdisk1) will be named altinst_rootvg by default. 
Rename the cloned disk (hdisk1) to alt1, so you can repeat the operation with another disk. 
# /usr/sbin/alt_disk_install -v alt1 hdisk1

Run the alt_disk_install command again to clone to another disk and rename the cloned disk, as follows: 
# /usr/sbin/alt_disk_install -O -B -C hdisk2
# /usr/sbin/alt_disk_install -v alt2 hdisk2

Repeat steps 3 through 5 for all of the disks that you want to clone. 
Use the HMC to partition the managed system with the newly cloned disks. 
Each partition you create will now have a rootvg with a boot image. 
Boot the partition into SMS mode. Use the SMS MultiBoot menu to configure the 
first boot device to be the newly installed disk. Exit the SMS menus and boot the system. 


62. IBM LPAR FAQ:
=================

Logical partitioning
Frequently asked questions
   
  
DLPAR 

 
 What is required to enable dynamic capable LPARs? 
 Does the upgrade of the HMC or Platform Firmware affect my AIX 5.1 partitions? 
 What is the order for AIX, HMC, and Platform Hardware updates? 
 Where would I find latest versions or upgrades for: AIX or HMC or Platform Firmware? 
 Can dynamic and non-dynamic LPARs co-exist on the same pSeries? 
 Is Linux DLPAR capable? 
 Do all DLPAR operations have to be done through the HMC GUI? 
 What conditions may impede DLPAR operations? 
 Are there special rules for DLPAR operations? 
 How much time does it take for a DLPAR operation to complete? 
 How is the "detail level" option in the HMC used? 
 How is the timeout value for DLPAR operations used by the HMC? 
 With a timeout limit of zero, how can I stop a command that may not complete because the DLPAR command will not succeed? 
 If we do dynamic configuration, what will happen to the process pinned or accessing direct memory? 
 Are there special AIX filesets or PTF levels required for DLPAR? 
 Are applications affected by DLPAR operations? 
 What is a "DLPAR aware" application? 
 What is the relationship between DLPAR and Capacity Upgrade on Demand (CUoD)? 
 How does Dynamic Processor Deallocation work with Dynamic Processor Sparing? 
 How does affinity partitioning relate to DLPAR? 
 Are there any examples of using the HMC command line to automate DLPAR? 
 

Question: What is required to enable dynamic capable LPARs?

Answer: An upgrade of AIX, HMC, and Platform Firmware is required. The required levels are as follows:

AIX: 5.2
HMC: Release 3, Version 1.0
Platform Firmware: 10/2002 system firmware or later.

To determine platform firmware level, on any AIX partition type: 
lscfg -vp | grep -p Platform. 
The last 6 digits of the ROM Level represent the Platform Firmware date in the format: "YYMMDD". 


Question: Does the upgrade of the HMC or Platform Firmware affect my AIX 5.1 partitions?

Answer: The upgrade of Platform Firmware on some 5.1 systems may cause some systems difficulty in reboot. 
Thus, users are encouraged to apply APAR IY31961 on their AIX 5.1 partitions before upgrading Platform Firmware.


Question: What is the order for AIX, HMC, and Platform Hardware updates?

Answer: The recommended order is:
1. Install APAR IY31961 on AIX 5.1 partitions, if needed.
2. Upgrade the HMC to version 3.1.
3. Upgrade the Platform Firmware to 10/2002 or later.
4. For 5.2 partitions, perform AIX migration (from 5.1) or install.


Question: Where would I find latest versions or upgrades for: AIX or HMC or Platform Firmware?

Answer: Users should visit the software support sites:
AIX: techsupport.services.ibm.com/server/support?view=pSeries
HMC: techsupport.services.ibm.com/server/hmc
Users should consult their IBM Customer Engineers regarding latest Platform Firmware availability.


Question: Can dynamic and non-dynamic LPARs co-exist on the same pSeries?

Answer: Yes. The HMC GUI will not display Dynamic LPAR menus for partitions that are not DLPAR enabled.
  

Question: Is Linux DLPAR capable?

Answer: Yes. Linux distro's that use the Linux 2.6 Kernel or higher have the capability of supporting DLPAR on POWER5 systems. Currently both Novell/SUSE Linux for Power and RedHat Linux for Power Distro both support DLAR capabilities.

 
Question: Do all DLPAR operations have to be done through the HMC GUI?

Answer: While it is recommended that users use the HMC GUI for dynamic resource re-allocation, it is possible for a user or script to execute commands on the HMC command line to perform dynamic resource operations on a dynamic capable partition.


Question: What conditions may impede DLPAR operations?

Answer: There may be cases where the resource that users wish to deallocate are not available because they are in use by the operating system or applications. In those cases, the operation may not complete until these resources are freed. Dynamic LPAR operations are also constrained by the resource specifications in the active LPAR profile, such as maximum/minimum processors or memory, or required I/O slots.


Question: Are there special rules for DLPAR operations?

Answer: Dynamic operations with processors and memory typically require no special actions. However, the movement of "slots" does require special handling. When the user is moving a "slot", they are attempting to reallocate a resource that is attached to an adapter that is inserted in a slot. An example of this might be a CDROM drive or ethernet adapter that is used by one DLPAR partition that the administrator would like moved to another DLPAR partition. For cases involving slots, the user should:

deconfigure the child device connected to the parent adapter. 
use the SMIT PCI Hot Plug procedures to remove the adapter (but don't physically remove the card). 
use the HMC GUI to move the slot from one Dynamic-capable partition to another. 
after the movement of the slot, re-enable the adapter via the "Hot-Plug" process and reconfigure the parent adapter and then the child device. 


Question: How much time does it take for a DLPAR operation to complete?

Answer: In general, on a non-loaded system, a single processor move can take less than a minute. Memory moves may take a few more minutes than a processor move.

Question: How is the "detail level" option in the HMC used?

Answer: This sets the various level of debug output displayed during DLPAR operations. Additionally, this allows the user to see all the steps that AIX performed in the DLPAR operation providing tracing/logging information for debug and problem determination.


Question: How is the timeout value for DLPAR operations used by the HMC?

Answer: The user can set a time limit (in minutes) setting so that the DLPAR operation request will be canceled if the pre-set time limit is exceeded. An example is a situation requiring memory moves. When the memory cannot be re-allocated because resource memory is pinned to the physical memory, sometimes certain operations will take a very long time to complete. A time limit in this case may be used to limit the amount of retries that take place. A time limit of zero implies that there is no time limit.


Question: With a timeout limit of zero, how can I stop a command that may not complete because the DLPAR command will not succeed?

Answer: although a user may set the timeout limit to zero, HMC and AIX each have a set of default behaviors that will ensure a DLPAR command, that will eventually fail, will return with the appropriate error message.


Question: If we do dynamic configuration, what will happen to the process pinned or accessing direct memory?

Answer: Nothing. If a process has pinned memory, the virtual memory manager transparently migrates the data to a new pinned physical page and atomically updates the virtual to real page mappings to point to the new physical page.


Question: Are there special AIX filesets or PTF levels required for DLPAR?

Answer: The installation of AIX 5.2 is adequate for current pSeries LPARs to perform dynamic operations.


Question: Are applications affected by DLPAR operations?

Answer: A large majority of applications should be DLPAR unaware, which means they are not programmed to take advantage of DLPAR capabilities from within the application. Thus, they should not be affected by DLPAR. Only programs considered "DLPAR aware" might be affected by DLPAR actions.


Question: What is a "DLPAR aware" application?

Answer: A DLPAR aware application cares about the resource levels allocated to the partition and can alter its behavior based on changes in the resource levels. AIX provides APIs for applications that wish to be DLPAR aware.


Question: What is the relationship between DLPAR and Capacity Upgrade on Demand (CUoD)?

Answer: DLPAR can be used to bring online a resource that has been activated through CUoD.


Question: How does Dynamic Processor Deallocation work with Dynamic Processor Sparing?

Answer: If spare (unlicensed CUoD) processors are available, the partition should be able to assign and bring online these processors before it deactivates a failing processor.

 
Question: How does affinity partitioning relate to DLPAR?

Answer: Users can perform DLPAR operations on I/O slots with affinity partitions, but not with processor or memory resources.


Question: Are there any examples of using the HMC command line to automate DLPAR?

Answer: The DLPAR toolset avaliable on alphaworks provides tools that automate DLPAR operations using the HMC command line.


63. bosinst.data file:
======================

AIX only.

The bosinst.data file is an ascii file which controls the installation of AIX.
I can function as a sort of a "response file" in an unattended install.

If you are customizing the /bosinst.data file in order to have it become part of a system backup (mksysb), 
please note that starting with AIX Version 4.3.3, the mksysb command always updates the target_disk_data stanzas 
to reflect the current disks in the rootvg. If you do not want this update to occur you must create the file 
/save_bosinst.data_file. The existance of this file is checked by the mksysb command, before the 
target_disk_data stanzas are updated.

If you are editing the bosinst.data file, use one of the following procedures:


1. Create and Use a Backup Tape:
--------------------------------

Customize the bosinst.data file:

Change your directory, with the cd command, to the /var/adm/ras directory.

Copy the /var/adm/ras/bosinst.data file to a new name, such as bosinst.data.orig. 
This step preserves the original bosinst.data file.

Edit the bosinst.data file with an ASCII editor.

Verify the contents of the edited bosinst.data file using the bicheck command:

/usr/lpp/bosinst/bicheck filename

Copy the edited file to the root directory:

cp /var/adm/ras/bosinst.data /bosinst.data

If you do not want the target_disk_data file updated to reflect the current rootvg, 
create the file /save_bosinst.date_file by using the following command:

touch /save_bosinst.data_file

Create a backup image of the system:

Back up the system, using one of the following: the Web-based System Manager Backups application, 
the System Management Interface Tool (SMIT), or mksysb command. 

BOS installations from this backup will behave according to your customized bosinst.data file.


2. Create and Use a Client File:
--------------------------------

Create one customized bosinst.data file for each client and, using the Network Installation Manager (NIM), 
define the files as resources. Refer to AIX Version 4.3 Network Installation Management Guide and Reference 
for more information about how to use the bosinst.data file as a resource in network installations.


3. Create and Use a Supplementary Diskette:
-------------------------------------------

This procedure describes how to create the supplementary diskette and use it in future installations:

Customize the bosinst.data file:

Change your directory, with the cd command, to the /var/adm/ras directory.

Copy the /var/adm/ras/bosinst.data file to a new name, such as bosinst.data.orig. 
This step preserves the original bosinst.data file.

Edit the bosinst.data file with an ASCII editor.

Create an ASCII file consisting of one word:

data

Save the new ASCII file, naming it signature.

Create the diskette and use it for installation:

Back up the edited bosinst.data file and the new signature file to diskette with the following command:

ls ./bosinst.data ./signature | backup -iqv

OR

If you create a bundle file named mybundle, back up the edited bosinst.data file, 
the new signature file, and the bundle file to diskette with the following command:

ls ./bosinst.data ./signature ./mybundle | backup -iqv

Put the diskette in the diskette drive of the target machine you are installing.

Boot the target machine from the install media (tape, CD-ROM, or network) and install AIX.

The BOS installation program will use the diskette file, rather than the default bosinst.data file 
shipped with the installation media.

Example bosinst.data file:
--------------------------

The following is an example of a modified bosinst.data file that might be used in a nonprompted network installation:

control_flow:
   CONSOLE = Default
   INSTALL_METHOD = overwrite
   PROMPT = no
   EXISTING_SYSTEM_OVERWRITE = yes
   RUN_STARTUP = no
   RM_INST_ROOTS = yes
   ERROR_EXIT = 
   CUSTOMIZATION_FILE = 
   TCB = no
   BUNDLES = 
   RECOVER_DEVICES = Default
   BOSINST_DEBUG = no
   ACCEPT_LICENSES = yes
   INSTALL_CONFIGURATION = 
   DESKTOP = CDE
	INSTALL_DEVICES_AND_UPDATES = yes    
	IMPORT_USER_VGS = yes                
	ENABLE_64BIT_KERNEL = yes             
	CREATE_JFS2_FS = yes                  
	ALL_DEVICES_KERNELS = yes            
	GRAPHICS_BUNDLE = no                 
	DOC_SERVICES_BUNDLE = no             
	NETSCAPE_BUNDLE = yes                
	HTTP_SERVER_BUNDLE = yes             
	KERBEROS_5_BUNDLE = yes              
	SERVER_BUNDLE = yes                  
	ALT_DISK_INSTALL_BUNDLE = yes        
	REMOVE_JAVA_118 = no                 

target_disk_data:
   PVID = 
   CONNECTION = 
   LOCATION =
   SIZE_MB =
   HDISKNAME = hdisk0

locale:
   BOSINST_LANG = en_US
   CULTURAL_CONVENTION = en_US
   MESSAGES = en_US
   KEYBOARD = en_US


64. NIM:
========

64.1 Some notes about NIM:
==========================


AIX only.

Network Installation Management, or NIM, means that from a Server, via the network, clients can be
installed with AIX and possibly other software.

With NIM, you can have unattended installation of clients. The NIM Server also provides you with
the backup images of all your Servers (the NIM clients).

NIM objects:
------------
This topic explains the objects concept as it is used in the NIM environment.
The machines you want to manage in the NIM environment, their resources, and the networks through 
which the machines communicate are all represented as objects within a central database that resides 
on the master. Network objects and their attributes reflect the physical characteristics 
of the network environment. This information does not affect the running of a physical network 
but is used internally by NIM for configuration information. 
Each object in the NIM environment has a unique name that you specify when the object is defined. 
The NIM name is independent of any of the physical characteristics of the object it identifies 
and is only used for NIM operations. The benefit of unique names is that an operation can be performed 
using the NIM name without having to specify which physical attribute should be used. 
NIM determines which object attributes to use. For example, to easily identify NIM clients, 
the host name of the system can be used as the NIM object name, but these names are independent 
of each other. When an operation is performed on a machine, the NIM name is used, and all other data 
for the machine (including the host name) is retrieved from the NIM database. 

NIM machines:
-------------
The types of machines that can be managed in the NIM environment are standalone, diskless, 
and dataless clients. This section describes the differences between the machines, the attributes required 
to define the machines, and the operations that can be performed on them. 

The NIM environment is composed of two basic machine roles: master and client. The NIM master manages 
the installation of the rest of the machines in the NIM environment. The master is the only machine 
that can remotely run NIM commands on the clients. All other machines participating in the NIM environment 
are clients to the master, including machines that may also serve resources.


NIM Resources (from somewhat older source):
--------------
NIM allows you to customize installations and maintain clients on the network from a centralized location 
(the NIM master) or the NIM client itself. The master contains the NIM database and can serve resources. 
Resources in NIM are files or directories containing data that NIM will use to install, customize, 
and maintain NIM clients. A NIM client is any machine configured and defined in the NIM database. 
Some key NIM resources used in our setup are:

- Licensed Program Product Source Directory (lpp_source): This directory contains backup file format 
(BFF) images, which AIX installp uses to load software. One way to understand the role of the 
lpp_source directory in a BOS installation is to compare it to all the installation images needed 
to support any configuration (specifically different device configurations) along with a base core set 
of software (called simages) that are on the BASE installation CDs. We created a base 433 lpp_source, 
multiple lpp_sources containing different maintenance levels, and separate lpp_sources for our 32-bit 
and 64-bit third-party application software.

- Shared Product Object Tree (SPOT): This directory is created from an lpp_source and is equivalent 
in content to a /usr file-system on AIX. The purpose of a SPOT in a NIM installation is similar to the 
boot images and BOS installation scripts (bi_main, rc.boot, and rc.bosinst) on volume 1 of the 
BASE install CD. The SPOT must contain support for all boot environments (platform, network type, kernel type). 
We created several different SPOTs for the different data centers and maintenance levels we use to support our systems.

- bosinst_data: This data file contains information that drives the BOS install 
(e.g., prompt vs. no-prompt, which disk to install the OS on, and the type of installation 
(Overwrite, Preservation, or Migration) to name a few). First, we created separate bosinst_data resources 
for each machine type (S80, H70, B50, M80, P680, and 43P). Then, by specifying two disks to target 
in our bosinst_data resource and specifying copies in the image_data resource, we could set up 
mirroring during the initial load.

- image_data: This data file contains information about the characteristics of the OS being installed. 
For example, it includes the size of file systems, whether or not to mirror, and whether or not to 
disk stripe. We created separate image_data resources for each machine type (S80, H70, B50, M80, P680 and 43P).

- Installp_bundles: This data file contains a customized list of additional software to install 
after the base AIX software is loaded. If you have different configurations that you need to duplicate 
on a repeatable basis, this resource is very useful. In our environment, we have different OS software 
requirements for development, QA, and production above the minimal AIX software needed to support 
different hardware systems. The easiest way to facilitate and maintain these different requirements, 
which need to be consistent, is to use installp_bundles.

- mksysb: This is a backup archive file that contains a system image of rootvg. 
Because of our network security restrictions (no one machine could be connected to all the networks 
within our organization), we used mksysb and savevg tapes to replicate the NIM master to the other data centers. 
If we had one machine connected to the different data centers, we could have used NIM to replicate 
and update the NIM masters in the different data centers by BOS-installing a NIM mksysb resource and 
using a NIM script to restore the other volume group data.

- mac_group: This is a logical grouping of machine types (standalone, diskless, or dataless) 
that enables the systems administrator to target one or more machines with a single command or 
NIM operation. We did not use this feature, but we could have taken advantage of this by grouping all like 
systems and like configurations to install to more than one machine at a time.


We used the 43P systems as our NIM masters for each data center because they could complete remote 
installations of machines or be moved and directly connected to a server for OS installations. 
These NIM masters were also designated as the resource servers in our environment. 
To ensure consistency and standardization of each NIM master (for the different data centers), 
we created a standard NIM master machine, which we cloned. We made a stacked tape containing a mksysb image 
and a savevg image of the standard NIM master to sync up and update the other NIM masters. 
Here are the commands we ran on the standard NIM master to create this stacked single tape:


# mksysb -i /dev/rmt0 
# tctl -f/dev/rmt0.1 fsf4 
# savevg -i -m {volume_group_name} -f/dev/rmt0.1 
# mt -f/dev/rmt0 rewind 

To restore the tape to the other NIM masters, we did the following:

Booted and restored the mksysb image from the stacked tape 
# tctl -f/dev/rmt0.1 fsf4 
# restvg volume_group_name 


Setup NIM:
----------

Needed Filesets:

You should have the following installed on your master

# lslpp -l | grep bos.sysmgt.nim

bos.sysmgt.nim.client  5.1.0.25  COMMITTED Network Install Manager
bos.sysmgt.nim.master  5.1.0.25  COMMITTED Network Install Manager
bos.sysmgt.nim.spot    5.1.0.25  COMMITTED Network Install Manager

These are available on the AIX Product CD 1.

If you need to install the NIM client, master and spot filesets

# installp -qaX -d /dev/cd0 bos.sysmgt.nim.master bos.sysmgt.nim.client bos.sysmgt.nim.spot 

At the end of the install you should see the below

Installation Summary
Name Level Part Event Result

bos.sysmgt.nim.client 5.3.0.0 USR APPLY SUCCESS
bos.sysmgt.nim.spot 5.3.0.0 USR APPLY SUCCESS
bos.sysmgt.nim.master 5.3.0.0 USR APPLY SUCCESS
bos.sysmgt.nim.client 5.3.0.0 ROOT APPLY SUCCESS

To install NIM:

You can use the fast path

# smitty nim_config_env

to setup the basic NIM environment for the first time. It needs a minimum of two pieces of information.
- Input device for installation images
- Primary network interface

Default values are provided for the remaining options. Once this smitty panel has been completed successfully,
the following actions will have been completed:
. NIM master initialized on the primary interface
. NIM daemons running
. lpp_source created and available
. SPOT resource created and available (Shared Product Object Tree)

# smitty nim_config_env

                 Configure a Basic NIM Environment (Easy Startup)

     Initialize the NIM Master:
     * Primary Network Interface for the NIM Master            []
     Basic Installation Resources:
     * Input device for installation images                    []
     * LPP_SOURCE Name                                         [lpp_source]
     * LPP_SOURCE Directory                                    [/export/lpp_source]
       Create new filesystem for LPP_SOURCE?                   [yes]
       Filesystem SIZE (MB)                                    [650]
       VOLUME GROUP for new filesystem                         [rootvg]
     * SPOT Name                                               [spot1]
     * SPOT Directory                                          [/export/spot]
       Create new filesystem for SPOT?                         [yes]
       Filesystem SIZE (MB)                                    [350]                   
       VOLUME GROUP for new filesystem                         [rootvg]
     ..
     ..

  
EZNIM:
------

The "smit eznim" option installs the "bos.sysmgt.nim.master" fileset and configures the NIM environment.
The configuration involves creating the NIM database and populating it with several entries.
Several basic NIM resources will then be created and defined in the NIM database.

1. smitty eznim
2. Select "Configure as a NIM Master"
3. Select "Setup the NIM Master Environment"
4. Verify the default selections for software source, volume group etc..

To display the NIM resources that have been created, do the following:
use "smit eznim_master_panel" fast path, or select "Show the NIM environment".


The nim_master_setup command:
-----------------------------

The nim_master_setup command installs the bos.sysmgt.nim.master fileset, configures the NIM master, 
and creates the required resources for installation, including a mksysb system backup. 

The nim_master_setup command uses the rootvg volume group and creates an "/export/nim" file system, by default. 
You can change these defaults using the volume_group and file_system options. The nim_master_setup command 
also allows you to optionally not create a system backup, if you plan to use a mksysb image 
from another system. The nim_master_setup usage is as follows:

Usage nim_master_setup: Setup and configure NIM master.
      nim_master_setup [-a mk_resource={yes|no}]
	[-a file_system=fs_name]
	[-a volume_group=vg_name]
	[-a disk=disk_name]
	[-a device=device]
	[-B] [-v]

	-B    Do not create mksysb resource.
	-v    Enable debug output.

	Default values:
	mk_resource = yes
	file_system = /export/nim
	volume_group = rootvg
	device = /dev/cd0

To install the NIM master fileset and initialize the NIM environment using install media located 
in device /dev/cd1, type: 
# nim_master_setup -a device=/dev/cd1

To initialize the NIM environment without creating NIM install resources, type: 
# nim_master_setup -a mk_resource=no

To initialize the NIM environment, create NIM install resources without creating a backup image, 
using install media located under mount point /cdrom, type: 
# nim_master_setup -a device=/cdrom -B

To define NIM resources in an existing NIM environment, using install media located in device /dev/cd0, 
and create a new file system named /export/resources/NIM under volume group nimvg, type: 
# nim_master_setup -a volume_group=nimvg -a file_system=/export/resources/NIM


The nim_clients_setup command:
------------------------------

The nim_clients_setup command is used to define your NIM clients, allocate the installation resources, 
and initiate a NIM BOS installation on the clients.

The nim_clients_setup command uses the definitions in the basic_res_grp resource to allocate 
the necessary NIM resources to perform a mksysb restore operation on the selected clients. 
The usage for nim_clients_setup is as follows: 

Usage nim_clients_setup: Setup and Initialize BOS install for NIM clients.
       nim_clients_setup [-m mksysb_resource]
	[-c] [-r] [-v] client_objects
-m    specify mksysb resource object name -OR- absolute file path.
-c    define client objects from client.defs file.
-r    reboot client objects for BOS install.
-v    Enables debug output.

Note: If no client object names are given, all clients in the NIM environment are enabled for 
BOS installation; unless clients are defined using the -c option. 

Examples:
To define client objects from /export/nim/client.defs file, initialize the newly defined clients 
for BOS install using resources from the basic_res_grp resource group, and reboot the clients to begin install, type: 
# nim_clients_setup -c -r

To initialize clients client1 and client2 for BOS install, using the backup file 
/export/resource/NIM/530mach.sysb as the restore image, type: 
# nim_clients_setup -m /export/resource/NIM/530mach.sysb \ client1 client2

To initialize all clients in the NIM environment for native (rte) BOS install using resources 
from the basic_res_grp resource group, type: 
# nim_clients_setup -n


How to define a standalone machine in NIM.

      nim -o define -t standalone \
                -a platform=chrp \
                -a if1="subnet-74 FQDN of Machine 0" \
                -a cable_type1=tp \
                -a net_settings1="speed duplex" \
                -a netboot_kernel="up or mp \
                name of resource

How to initiate an install of a machine from a mksysb image.

      nim -o bos_inst \
                -a source=mksysb \
                -a spot=aix520-01_spot \
                -a mksysb=base520-02-64bit_mksysb or base520-02-32bit_mksysb \
                -a accept_licenses=yes \
                -a preserve_res=yes \
                -a installp_flags="cNgXY" \
                -a fb_script=osg-mksysb-install_firstboot \
                name of resource

If you do not want the machine to be rebooted right now, then add the following:

     -a no_client_boot=yes

How to reset the NIM state of a machine.

      nim -o reset \
                name of resource
  
You can add the following to force a reset

                -a force=yes

If after you try to reset the state and try to install again and you are told that the resource is 
still allocated run the following: 

      nim -Fo deallocate \
	-a subclass=all 
	name of resource


How to take a mksysb of a machine.

        nim -o define -t mksysb \
	          -a server=master \
	          -a location=/export/nim/mksysb/<name of resource>.mksysb \
                  -a source=resource name of machine to take mksysb \
 	          -a mk_image=yes \
 	          -a mksysb_flags='e'\
	          -a exclude_files=osg-default_exclude \
	          name of resource 
    
How to make a NIM exclude file.

        nim -o define -t exclude_files \
                  -a server=master
	          -a location=/export/nim/misc/osg-default.exclude \
	          osg-default_exclude
    
How to define a script resource in NIM.

        nim -o define -t script \
	          -a server=master \
	          -a location=/export/nim/misc/<name of the resource>.sh \
	          name of resource
    
How to define a firstboot script

        nim -o define -t fb_script \
	          name of the mksysb
    
How to remove a NIM resource.

        nim -o remove \
	            -a rm_image=yes \ 
	          name of the mksysb
    
Note that this process doesremove the mksysb file on disk. 

Updating installed software
      nimclient -o cust \
                -a lpp_source=lpp source \
                -a installp_bundle=installp bundle


Remark about nimsh:
-------------------

Using the NIM service handler for client communication
NIM makes use of the remote shell server (rshd) when it performs remote execution on clients. The server provides 
remote execution facilities with authentication based on privileged port numbers from trusted hosts.

AIXr 5.3 uses NIM Service Handler (NIMSH) to eliminate the need for rsh services during NIM client communication. 
The NIM client daemon (NIMSH) uses reserved ports 3901 and 3902, and it installs as part of the 
bos.sysmgt.nim.client fileset.

NIMSH allows you to query network machines by hostname. NIMSH processes query requests and returns NIM client 
configuration parameters used for defining hosts within a NIM environment. Using NIMSH, you can define 
NIM clients without knowing any system or network-specific information.

While NIMSH eliminates the need for rsh, it does not provide trusted authentication based on key encryption. 
To use cryptographic authentication with NIMSH, you can configure OpenSSL in the NIM environment. 
When you install OpenSSL on a NIM clients, SSL socket connections are established during NIMSH 
service authentication. Enabling OpenSSL provides SSL key generation and includes all cipher suites 
supported in SSL version 3.


64.2 Complete NIM Example:
==========================

(This is actually a nice example).


1.Installing the NIM filesets<top>

The required filesets for a NIM master server and client

bos.sysmgt.nim.client
bos.sysmgt.nim.master
bos.sysmgt.nim.spot


These are available on the AIX Product CD 1.

Install the NIM client, master and spot filesets

# installp -qaX -d /dev/cd0 bos.sysmgt.nim.master bos.sysmgt.nim.client bos.sysmgt.nim.spot 

At the end of the install you should see the below

Installation Summary
--------------------
Name Level Part Event Result

bos.sysmgt.nim.client 5.3.0.0 USR APPLY SUCCESS
bos.sysmgt.nim.spot 5.3.0.0 USR APPLY SUCCESS
bos.sysmgt.nim.master 5.3.0.0 USR APPLY SUCCESS
bos.sysmgt.nim.client 5.3.0.0 ROOT APPLY SUCCESS


2.Create a tftpboot filesystem and mount it <top>

# crfs -v jfs2 -g rootvg -a size=381M -m /tftpboot -A yes -t rw
# mount /tftpboot


3.Configure the NIM environment (ensure you have AIX product CD 1 loaded in the CD or DVD Drive<top>

# smitty nim_config_env

Select the defaults as below, apart from the size of the /export/lpp_source and /export/spot filesystems. 
As we are going to be copying additional products into these areas we need a reasonable amount of space

You also need to specify the primary network interface and path to the CD or DVD drive

Initialize the NIM Master:
* Primary Network Interface for the NIM Master [en0] 
Basic Installation Resources:
* Input device for installation images [/dev/cd0] 
* LPP_SOURCE Name [lpp_source1]
* LPP_SOURCE Directory [/export/lpp_source] 
Create new filesystem for LPP_SOURCE? [yes] 
Filesystem SIZE (MB) [6553] 
VOLUME GROUP for new filesystem [rootvg] 
* SPOT Name [spot1]
* SPOT Directory [/export/spot] 
Create new filesystem for SPOT? [yes] 
Filesystem SIZE (MB) [2097] 
VOLUME GROUP for new filesystem [rootvg] 
Create Diskless/Dataless Machine Resources? [no] 
Specify Resource Name to Define:
ROOT (required for diskless and dataless) [root1]
DUMP (required for diskless and dataless) [dump1]
PAGING (required for diskless) [paging1]
HOME (optional) [home1]
SHARED_HOME (optional) [shared_home1]
TMP (optional) [tmp1]
Diskless/Dataless resource directory [/export/dd_resource]
Create new filesystem for resources? [yes] 
Filesystem SIZE (MB) [150] 
VOLUME GROUP for new filesystem [rootvg] 
Define NIM System Bundles? [yes] 
Add Machines from a Definition File? [no] 
Specify Filename []
* Remove all newly added NIM definitions [no] 
and filesystems if any part of this
operation fails?


4.Populating the lpp_source1 resource with additional software<top>

Copy the contents of AIX Volume 2,5, Expansion Pack and the AIX ToolBox to the lpp_source, for each CD 
enter the below

# nim -o update -a packages=all -a source=/dev/cd0 lpp_source1


5.Updating the SPOT and lpp_source1 resources<top>

If the AIX CD's you are using to create the lpp and spot resources is a base level AIX CD, and the clients 
you are intending to build are at a higher level than the base level. You will need to update the 
lpp and spot resources. 

Identify the location of your update filesets and update with the below command

# nim_update_all -l lpp_source1 -s spot1 -d /location/of/filesets -u -B

Once complete, confirm the maintenance level of the spot1 resource with the below command

# lsnim -l spot1

In this example, I have updated the lpp_source1 and spot1 to AIX 5.3 ML 3

spot1:
class = resources
type = spot
plat_defined = chrp
arch = power
bos_license = yes
Rstate = ready for use
prev_state = verification is being performed
location = /export/spot/spot1/usr
version = 5
release = 3
mod = 0
oslevel_r = 5300-01
alloc_count = 0
server = master
Rstate_result = success
mk_netboot = yes
mk_netboot = yes
mk_netboot = yes


6.Defining NIM machines <top>

Before you can start a BOS install task you need to define the machines you are going to install. 

You need details of

a.server hostname
b.platform
c.netboot_kernel
d.subnet mask
e.default gateway of the master
f.master name

To define a NIM client, for eg sp-tsm2

# nim -o define -t standalone -a platform=chrp \
-a netboot_kernel=mp \
-a if1="find_net sp-tsm2.caledonia.speedy.wan 0" \
-a net_definition="ent 255.255.255.0 10.110.72.1 master" sp-tsm2

If you are adding a machine that is already running, you need to ensure the bos.sysmgt.nim.client fileset 
is installed and issue the following command on the client

note: change the name= and master= to match the client and master you are adding

# niminit -a name=pr-testerp -a master=pr-tsm -a pif_name=en0


The output from the following command will show your newly defined machine

# lsnim -c machines

[sp-tsm1] scripts # lsnim -c machines
master machines master
sp-tsm2 machines standalone

To get detailed output of your newly created machine, run the below

[sp-tsm1] scripts # lsnim -l sp-tsm2
sp-tsm2:
class = machines
type = standalone
connect = shell
platform = chrp
netboot_kernel = mp
if1 = speedy_network sp-tsm2.caledonia.speedy.wan 0
cable_type1 = N/A
Cstate = ready for a NIM operation
prev_state = not running
Mstate = currently running
cpuid = 00C13E8A4C00
Cstate_result = success


7.Configuring client communications

To configure SSL client communication as opposed to the traditional and un-secure rhost method perform the following

a.On the master server and clients install the openssl rpm from the AIX toolbox

# rpm -ivh openssl-0.9.7g-1

b.Next configure the NIM master for SSL

# nimconfig -c 

c.Then on each client configure as below

# mv /etc/niminfo /etc/niminfo.bak
# niminit -aname=pr-testdb -amaster=pr-tsm -a connect=nimsh
# nimclient -C

d.On the NIM master test the nimsh communication

# nim -o lslpp pr-testdb


8.Defining NIM groups<top>

Once you have defined your machines, add them to add mac_group. This will aid administration for future 
installation tasks

To define a group containing the sp-tsm2 machine run the below command
# nim -o define -t mac_group -a add_member=sp-tsm2 speedy_mac_group

For each machine to be added, use the option and argument `-a add_member=<hostname>' where <hostname> is the name 
of the server you are adding


9.Defining a bosinst.data file<top>

A bosinst data file is a file contained answers to questions usually asked during a manual BOS install. 
A standard Red Squared bosinst.data file contains the below information and is stored in the /export/bosinst 
directory. (note the highlighted areas, specifically the disk location. We will be mirroring the root disk 
as part of the post task during the BOS install procedure)

control_flow:
CONSOLE = Default
INSTALL_METHOD = overwrite
PROMPT = no
EXISTING_SYSTEM_OVERWRITE = yes
INSTALL_X_IF_ADAPTER = yes
RUN_STARTUP = yes
RM_INST_ROOTS = no
ERROR_EXIT =
CUSTOMIZATION_FILE =
TCB = no
INSTALL_TYPE =
BUNDLES =
RECOVER_DEVICES = no
BOSINST_DEBUG = no
ACCEPT_LICENSES = yes
DESKTOP = NONE
INSTALL_DEVICES_AND_UPDATES = yes
IMPORT_USER_VGS =
ENABLE_64BIT_KERNEL = yes
CREATE_JFS2_FS = yes
ALL_DEVICES_KERNELS = yes
GRAPHICS_BUNDLE = yes
MOZILLA_BUNDLE = no
KERBEROS_5_BUNDLE = no
SERVER_BUNDLE = yes
REMOVE_JAVA_118 = no
HARDWARE_DUMP = yes
ADD_CDE = yes
ADD_GNOME = no
ADD_KDE = no
ERASE_ITERATIONS = 0
ERASE_PATTERNS =
target_disk_data:
LOCATION =
SIZE_MB =
HDISKNAME = hdisk0

locale:
BOSINST_LANG = en_US
CULTURAL_CONVENTION = en_GB
MESSAGES = en_US
KEYBOARD = en_GB

large_dumplv:
DUMPDEVICE=lg_dumplv
SIZEGB=2

dump:
PRIMARY=/dev/lg_dumplv
SECONDARY=/dev/sysdumpnull
FORCECOPY=no
COPYDIR=/dump
ALLOWS_ALLOW=yes

Once you have created the bosinst.data file, you need to define it to the NIM environment with the below command

# nim -o define -t bosinst_data -a server=master \


10.Defining a post script resource<top>

A script resource is used as part of the bosinst task. The resource contains commands to be executed 
on the NIM client after the BOS install has completed. The inst_script file should reside in the "/export/bosinst" 
directory.

The below inst_script contains commands relevant to an Oracle database server

Note: In all instances the root disk should be mirrored

/usr/sbin/chdev -l sys0 -a maxuproc=5000
/usr/sbin/chdev -l sys0 -a autorestart=true
/usr/sbin/vmo -o lru_file_repage=0
/usr/sbin/vmo -o strict_maxclient=0
/usr/sbin/vmo -o maxperm%=45
/usr/sbin/vmo -o maxclient%=45
/usr/sbin/vmo -o minperm%=15
/usr/sbin/tunchange -f nextboot -t vmo -o lru_file_repage=0
/usr/sbin/tunchange -f nextboot -t vmo -o strict_maxclient=0
/usr/sbin/tunchange -f nextboot -t vmo -o maxperm%=45
/usr/sbin/tunchange -f nextboot -t vmo -o maxclient%=45
/usr/sbin/tunchange -f nextboot -t vmo -o minperm%=15
/usr/sbin/chfs -a size=+1024M /usr
/usr/sbin/chfs -a size=+512M /opt
/usr/sbin/chfs -a size=+512M /home
/usr/sbin/chfs -a size=+512M /tmp
/usr/sbin/chfs -a size=+512M /var
/usr/bin/mkgroup id=500 oinstall
/usr/bin/mkuser id=1001 groups=oinstall oracle
/usr/bin/mkgroup id=501 red2ops
/usr/bin/mkuser id=1002 groups=red2ops red2ops
/usr/bin/mkdir /home/root
/usr/bin/chuser home=/home/root root
/usr/sbin/crfs -v jfs2 -g rootvg -a size=128M -m /usr/local -A yes -t rw
/usr/sbin/crfs -v jfs2 -g rootvg -a size=128M -m /usr/red2 -A yes -t rw
/usr/sbin/crfs -v jfs2 -g rootvg -a size=8G -m /oracle_home -A yes -t rw
/usr/sbin/crfs -v jfs2 -g rootvg -a size=3G -m /oracle_base -A yes -t rw
/usr/sbin/crfs -v jfs2 -g rootvg -a size=3G -m /dump -A yes -t rw
/usr/sbin/mount -a
/usr/bin/chown oracleinstall /oracle_home
/usr/bin/chown oracleinstall /oracle_base
/usr/bin/echo 'sp-tsm1 root' >> /home/root/.rhosts
/usr/bin/rcp sp-tsm1:/home/root/.profile /home/root/.profile
/usr/bin/rcp sp-tsm1:/home/root/.kshrc /home/root/.kshrc
/usr/bin/rcp sp-tsm1:/etc/security/limits.nim /etc/security/limits
/usr/sbin/extendvg rootvg hdisk1
/usr/sbin/mirrorvg rootvg
/usr/sbin/syncvg -v rootvg
/usr/sbin/bosboot -a -d /dev/hdisk1
/usr/sbin/chvg -a 'y' -Q 'n' -x 'n' rootvg
/usr/bin/bootlist -m normal hdisk0 hdisk1
/usr/sbin/chps -s16 hd6
/usr/bin/sysdumpdev -K

Once created, define the script to the NIM server with the below command

# nim -o define -t script -a server=master \
-a location=/export/bosinst/inst_script inst_script


Details of your newly created script resource can be viewed with the below

[sp-tsm1] bosinst # lsnim -l speedy_inst_script
speedy_inst_script:
class = resources
type = script
Rstate = ready for use
prev_state = unavailable for use
location = /export/bosinst/inst_script
alloc_count = 0
server = master


11.Backing up and restoring the NIM database<top>

Now that you have created a number of resources and machines, it would be a good idea to add a cron job 
to take a backup of the NIM database on a weekly basis. This will by default be picked up by Tivoli and mksysb 
then sent to tape.

Create an executable script called nim_backup_db.sh located in /usr/red2/scripts

#!/bin/sh
#--------------------------------------------------------------------------------
#
# File : nim_backup_db.sh
#
# Author : Steve Burgess
#
# Description : Wrapper script to backup the NIM database
#
# Change History:
#
# Date Version Author Description
# ------- ------- ---------------- -----------------------------
#--------------------------------------------------------------------------------

#-------------------------
# Backup The NIM database
#-------------------------

/usr/lpp/bos.sysmgt/nim/methods/m_backup_db /etc/objrepos/nimdb_backup 2>&1 | tee /usr/red2/logs/nim_backup.log

if [ $? -ne 0 ]
then
echo "`date +%Y%m%d` NIM_BACKUP_FAILURE" | tee -a /usr/red2/logs/nim_backup.log
else
echo "`date +%Y%m%d` NIM_BACKUP_SUCCESS" | tee -a /usr/red2/logs/nim_backup.log
fi


Add the script to roots crontab (as below)


# Backup the NIM database once a week
1 00 * * 0 /usr/red2/scripts/nim_backup_db.sh > /dev/null 2>&1


To restore the NIM database following corruption or applying to another server

# /usr/lpp/bos.sysmgt/nim/methods/m_restore_db -f /etc/objrepos/nimdb.backup


12.Initiating a BOS Installation <top>

You are now ready to initiate a BOS install for one of your defined machines. Run the below command 
to initate a BOS install for sp-tsm2:

nim -o bos_inst -a source=rte \
-a lpp_source=lpp_source1 \
-a spot=spot1 \
-a filesets="Java14_64 bos.adt bos.iconv X11.adt vac.C vac.aix50 tivoli.tsm.client.api.32bit tivoli.tsm.client.ba openssl-0.9.7d-1.aix5.1.ppc.rpm openssh.base
openssh.license lsof-4.61-3.aix5.1.ppc.rpm zip-2.3-3.aix4.3.ppc.rpm unzip-5.51-1.aix5.1.ppc.rpm" \
-a accept_licenses=yes \
-a script=inst_script \
-a boot_client=no \
-a bosinst_data=bosinst \
sp-tsm2

This will make the previously created resources, inst_script and bosinst available to the server. 

Additional filesets, as defined in the line

-a filesets=<fileset names>

will be installed as part of the installation procedure. For additional filesets, append them to the filesets line

Next you need to follow the below procedure to boot your machine from the NIM server

� Begin with your machine turned off. 
If the system provided requires a System Management Services (SMS) diskette, insert it into the diskette drive of the client and turn on the machine. If you do not insert an SMS diskette at this time and one is required, you will be prompted to insert one later.
A graphics image is displayed on your screen. Press the F1 key as icons begin to display from left to right on the bottom of your display. 
The System Management Services menu displays on your screen. Select the Utilities option. 
From the System Management Services Utilities menu, select the Remote Initial Program Load Setup option. 
From the Network Parameters screen, select the IP Parameters option.
Set or change the values displayed so they are correct for snhent01. 

Specify the IP address of: 
The client machine you are booting in the client address field. : 10.20.5.253 
Your NIM server in the bootp server address field. : 10.20.5.254 
Your client's gateway in the gateway address field. 10.20.5.1 
Specify the subnet mask of 255.255.255.0 for the client machine if you are prompted for one in the subnet mask field. All machines in your subnet have the same subnet mask. 
After you specify the addresses, press Enter to save the addresses and continue. 
The Network Parameters screen is displayed. Select the Ping option. 
Select the network adapter to be used as the client's boot device. 
Verify that the displayed addresses are the same as the addresses you specified for your boot device. 
If the addresses are incorrect, press Esc until you return to the main menu. Then, go back to step 5. 
If the addresses are correct, press Enter to perform the ping test. The ping test may take several seconds to complete. 
If the ping test fails, verify that the addresses are correct, and perform network problem determination if necessary. If the ping test completes successfully, press Enter to acknowledge the success message. Then, press Esc until you return to the System Management Services menu. 
From the System Management Services menu, choose the Select Boot Devices option.
Select the network adapter to be used for the network boot from the list of displayed bootable devices. Be sure to select the correct network type Ethernet. After making your selection, the machine will boot over the network.

Following successful BOS installation, you will need to confirm the post tasks you defined in your inst_script have completed. Anything that has failed will need to be run manually


13.Taking a mksysb of your new server<top>

To take a mksysb of the newly created server onto the NIM server, you will need to create an new filesystem (not in rootvg) to hold the mksysb images. The filesystem should have a mount point of /export/mksysb_clients and of the type jfs2. To create a 20gb filesystem in tsmvg run the below command

# crfs -v jfs2 -g tsmvg -a size=20G -m /export/mksysb_clients -A yes -t rw


To take a mksysb of a NIM client, run the below command

nim -o define -t mksysb \
-a server=master \
-a location=/export/mksysb_clients/sp-tsm2 \
-a source=sp-tsm2 \
-a mk_image=yes \
-a mksysb_flags=-e \
sp-tsm2_image 

This will create a mksysb resource, as below

[sp-tsm1] scripts # lsnim -l sp-tsm2_image
sp-uat1_image:
class = resources
type = mksysb
arch = power
Rstate = ready for use
prev_state = unavailable for use
location = /export/mksysb_clients/sp-tsm2/sp-tsm2.mksysb
version = 5
release = 3
mod = 0
oslevel_r = 5300-03
alloc_count = 0
server = master


14.Restoring a host from a mksysb<top>

The procedure of restoring a host from a mksysb is fairly simple. In this example, we restore sp-tsm2

Enter the below command to initiate the restore from the NIM server

# nim -o bos_inst -a source=mksysb \
-a mksysb=sp-tsm2_image \
-a lpp_source=lpp_source1 \
-a spot=spot1 \
-a accept_licenses=yes \
-a boot_client=no \
sp-tsm2

Once entered, refer to section 11 to boot the server you are recovering over the network

15.Booting the server into diagnostics<top>

Occasionally you may need to boot the server into diagnostic mode to allow you to resolve a hardware issue. To do this, first enter the below


# nim -o diag -a spot=spot1 sp-tsm2

Once entered, refer to section 11 to boot the server into diagnostics over the network

16.Booting a server into maintenance<top>

Occasionally you may need to boot the server into maintenance mode. To do this, first enter the below

# nim -o maint_boot spot=spot1 sp-tsm2

Once entered, refer to section 11 to boot the server into diagnostics over the network

After successfully booting and defining the console, the System Maintenance menu is displayed. The maintenance menu options and their descriptions are described below. 

Access a Root Volume Group
This option allows you to activate the root volume group and start the maintenance shell with a full set of commands.
Copy a System Dump to Removable Media
This option allows you to copy a previous system dump to external media.
Access Advanced Maintenance Function
This option allows you to start a maintenance shell with a limited set of commands.


17.Installing additional software on a client<top>

Occasionally you may need to install additional filesets on a client. You first need to add the software to the lpp_source by simply copying it to the lpp_source directory. You then need to action the below command

# nim -Fo check lpp_source1

Following that you can initiate the install on the client

# nim -o cust -a lpp_source=lpp_source1 -a filesets=bos.adt \
-a installp_flags="a c g X p" sp-tsm2

Note: refer to the installp man page for options on installp_flags

To install a pre-defined or new installp bundle (output from # lsnim -t installp_bundle)

# nim -o cust -a lpp_source=lpp_source1 -a installp_bundle=openssh_server -a installp_flags=" a c g X p" pr-testdb

18.Update software on a client<top>

To update a client with the whole contents of an lpp resource, enter the below

# nim -o update -a packages=all -a source=lpp_source1 sp-tsm2

19.To add a new lpp resource that contains a new AIX level, then apply that update to a NIM client. <top>

First copy the contents of the ML to a filesystem area, then run

# nim -o define -t lpp_source -a location=/export/lpp_source/aix_maint_ML3 \ 
-a server=master aix_maint_ML3

To update a server from the new aix maint level # nim -o cust -a lpp_source=aix_maint_ML3 -a fixes=update_all \ 
-a installp_flags="a c g X p" sp-tsm2     Tutorial Tools 
 Show Printable Version  
 Email this Page  
 

65. ACCOUNTING:
===============

General in unix:
----------------

The following is a step-by-step summary of how UNIX system accounting works: 

When the UNIX system is switched into multiuser state, the /usr/lib/acct/startup program is executed. 
The startup program executes several other programs that invoke accounting: 
acctwtmp, turnacct, and remove. 

- acctwtmp adds a ``boot'' record to /var/adm/wtmp. In this record, the system name is shown 
  as the login name in the wtmp record. 

- turnacct, invoked with the on option, begins process accounting. Specifically, turnacct on executes 
  the accton program with the argument /var/adm/pacct. 

- remove ``cleans up'' the saved pacct and wtmp files left in the sum directory by runacct. 

Raw Accounting Data

The login and init programs record connect sessions by writing records into /var/adm/wtmp. 
Any date changes (made by running date with an argument) are also written to /var/adm/wtmp. 
Reboots and shutdowns (via acctwtmp) are also recorded in /var/adm/wtmp. 
When a process ends, the kernel writes one record per process, in the form of acct.h, in the /var/adm/pacct file. 

Two programs track disk usage by login: acctdusg and diskusg. They are invoked by the shell script dodisk. 

Every hour cron executes the ckpacct program to check the size of /var/adm/pacct. 
If the file grows past 500 blocks (default), turnacct switch is executed. (The turnacct switch program 
moves the pacct file and creates a new one.) The advantage of having several smaller pacct files 
becomes apparent when trying to restart runacct if a failure occurs when processing these records. 

If the system is shut down using shutdown, the shutacct program is executed automatically. 
The shutacct program writes a reason record into /var/adm/wtmp and turns off process accounting. 

If you provide services on a request basis (such as file restores), you can keep billing records 
by login by using the chargefee program. It allows you to add a record to /var/adm/fee each time a user 
incurs a charge. The next time runacct is executed, this new record is picked up and merged into the total 
accounting records. 

runacct is executed via cron each night. It processes the accounting files /var/adm/pacct?, 
/var/adm/wtmp, /var/adm/fee, and /var/adm/acct/nite/disktacct to produce command summaries 
and usage summaries by login. 

/usr/lib/acct/prdaily program is executed on a daily basis by runacct to write the daily accounting 
information collected by runacct (in ASCII format) in /var/adm/acct/sum/rprtMMDD. 

The monacct program should be executed on a monthly basis (or at intervals determined by you, 
such as the end of every fiscal period). The monacct program creates a report based on data stored 
in the sum directory that has been updated daily by runacct. After creating the report, monacct 
``cleans up'' the sum directory to prepare the directory's files for the new runacct data. 


On AIX:
-------

- Connect time accounting:
Connect time data is collected by the init and the login command. When you login, the login program
writes a record in the "/etc/utmp" file. This record includes your user name, the date and time of the login,
and the login port. Commands such as who, use this file to find out which users are logged into
the various display stations. 
If the /var/adm/wtmp connect-time accounting file exists, the login command adds a copy of this 
login record to it.

When your login program ends (when you logout), the init command records the end of the session
by writing another record in the "/var/adm/wtmp" file.
Both the login and logout records have the form described in the utmp.h file.

- Shutdown:
acctwtmp command:
The "acctwtmp" command also writes special entries in the /var/adm/wtmp file concerning
system shutdowns and startups.

- Process accounting:
accton command:
The system collects data on resource usage for each process as it runs, including
the memory use, elapsed time and processor time, user and group id under which the process runs etc..
The "accton" command records these data in the "/var/adm/pacct" file.

- Disk usage accounting:
dodisk command:
The dodisk command, run as specified by the cron demon, periodically writes disk-usage records
to the "/var/adm/acct/nite(x)/dacct" file. To accomplish this, the dodisk command calls other commands.
Depending upon the thoroughness of the accounting search, the diskusg command or the acctdusg command
can be used to collect data. The acctdisk command is used to write a total accounting record.
The total accounting record, in turn, is used by the acctmerg command to prepare the daily
accounting report.

- Printer usage accounting:
enq command:
The collection of printer usage data is a cooperative effort between the enq command and the queuing demon.
The enq command enqueues the user name, job number, and the name of the file to be printed.
After the file is printed, the qdaemon command writes an ascii record to a file, usually the
"/var/adm/qacct" file, containing the user name, user id, and the number of pages printed.
You can sort these records and convert them to total accounting records.


66. Combining cards, Link Aggregation,EtherChannel in AIX:
==========================================================

Note 1:
-------

EtherChannel and IEEE 802.3ad Link Aggregation are network port aggregation technologies that allow 
several Ethernet adapters to be aggregated together to form a single pseudo Ethernet device. 
For example, ent0 and ent1 can be aggregated into an EtherChannel adapter called ent3; interface en3 
would then be configured with an IP address. The system considers these aggregated adapters as one adapter. 
Therefore, IP is configured over them as over any Ethernet adapter. In addition, all adapters 
in the EtherChannel or Link Aggregation are given the same hardware (MAC) address, so they are treated 
by remote systems as if they were one adapter. Both EtherChannel and IEEE 802.3ad Link Aggregation require 
support in the switch so it is aware which switch ports should be treated as one.

The main benefit of EtherChannel and IEEE 802.3ad Link Aggregation is that they have the network bandwidth 
of all of their adapters in a single network presence. If an adapter fails, network traffic is automatically 
sent on the next available adapter without disruption to existing user connections. The adapter is automatically 
returned to service on the EtherChannel or Link Aggregation when it recovers.

There are some differences between EtherChannel and IEEE 802.3ad Link Aggregation. Consider the differences 
given in Table 15 to determine which would be best for your situation.

Table 15. 
Differences between EtherChannel and IEEE 802.3ad Link Aggregation. 

EtherChannel                                 IEEE 802.3ad 
Requires switch configuration                Little, if any, configuration of switch required to form aggregation. 
                                             Some initial setup of the switch may be required. 
Supports different packet distribution modes Supports only standard distribution mode 

Beginning with AIX 5.2 with 5200-03, Dynamic Adapter Membership functionality is available. 
This functionality allows you to add or remove adapters from an EtherChannel without having to disrupt 
any user connections. For more details, see Dynamic Adapter Membership.

Supported Adapters
EtherChannel and IEEE 802.3ad Link Aggregation are supported on the following Ethernet adapters:

10/100 Mbps Ethernet PCI Adapter 
Universal 4-Port 10/100 Ethernet Adapter 
10/100 Mbps Ethernet PCI Adapter II 
10/100/1000 Base-T Ethernet PCI Adapter 
Gigabit Ethernet-SX PCI Adapter 
10/100/1000 Base-TX Ethernet PCI-X Adapter 
Gigabit Ethernet-SX PCI-X Adapter 
2-port 10/100/1000 Base-TX Ethernet PCI-X Adapter 
2-port Gigabit Ethernet-SX PCI-X Adapter 
Gigabit Ethernet-SX Adapter 
10 Gigabit Ethernet PCI-X Adapter


Only the basic EtherChannel functions (operating exclusively in standard or round-robin mode without a backup) 
are supported in the following Ethernet adapters:

PCI Ethernet BNC/RJ-45 Adapter 
PCI Ethernet AUI/RJ-45 Adapter
For additional release information about new adapters, see the AIX Release Notes that correspond to your 
level of AIXr.

Important:
Mixing adapters of different speeds in the same EtherChannel, even if one of them is operating 
as the backup adapter, is not supported. This does not mean that such configurations will not work. 
The EtherChannel driver makes every reasonable attempt to work even in a mixed-speed scenario.
For information on configuring and using EtherChannel, see EtherChannel. For more information on configuring 
and using IEEE 802.3ad link aggregation, see IEEE 802.3ad Link Aggregation. For information on the different 
AIX and switch configuration combinations and the results they produce, see Interoperability Scenarios.

EtherChannel
The adapters that belong to an EtherChannel must be connected to the same EtherChannel-enabled switch. 
You must manually configure this switch to treat the ports that belong to the EtherChannel 
as an aggregated link. Your switch documentation might refer to this capability as link aggregation 
or trunking.

Traffic is distributed across the adapters in either the standard way (where the adapter over which 
the packets are sent is chosen depending on an algorithm) or on a round-robin basis (where packets 
are sent evenly across all adapters). Incoming traffic is distributed in accordance to the 
switch configuration and is not controlled by the EtherChannel operation mode.

In AIX, you can configure multiple EtherChannels per system, but it is required that all the links 
in one EtherChannel are attached to a single switch. Because the EtherChannel cannot be spread across 
two switches, the entire EtherChannel is lost if the switch is unplugged or fails. To solve this problem, 
a new backup option available in AIX 5.2 and later keeps the service running when the main EtherChannel fails. 
The backup and EtherChannel adapters should be attached to different network switches, which must be 
inter-connected for this setup to work properly. In the event that all of the adapters in the EtherChannel fail, 
the backup adapter will be used to send and receive all traffic. When any link in the EtherChannel is restored, 
the service is moved back to the EtherChannel.

For example, ent0 and ent1 could be configured as the main EtherChannel adapters, and ent2 as the backup adapter, 
creating an EtherChannel called ent3. Ideally, ent0 and ent1 would be connected to the same 
EtherChannel-enabled switch, and ent2 would be connected to a different switch. In this example, all traffic 
sent over en3 (the EtherChannel's interface) would be sent over ent0 or ent1 by default (depending on the 
EtherChannel's packet distribution scheme), whereas ent2 will be idle. If at any time both ent0 and ent1 fail, 
all traffic would be sent over the backup adapter, ent2. When either ent0 or ent1 recover, they will once again 
be used for all traffic.

Network Interface Backup, a mode of operation available for EtherChannel in AIX 4.3.3 and AIX 5.1, 
protects against a single point of Ethernet network failure. No special hardware is required to use 
Network Interface Backup, but the backup adapter should be connected a separate switch for maximum reliability. 
In Network Interface Backup mode, only one adapter at a time is actively used for network traffic. 
The EtherChannel tests the currently-active adapter and, optionally, the network path to a user-specified node. 
When a failure is detected, the next adapter will be used for all traffic. Network Interface Backup provides 
detection and failover with no disruption to user connections. Network Interface Backup was originally 
implemented as a mode in the EtherChannel SMIT menu. In AIX 5.2 and later, the backup adapter provides 
the equivalent function, so the mode was eliminated from the SMIT menu. To configure network interface backup 
in AIX 5.2 and later, see Configure Network Interface Backup.

Configuring EtherChannel:
-------------------------
Follow these steps to configure an EtherChannel.

Considerations
You can have up to eight primary Ethernet adapters and only one backup Ethernet adapter per EtherChannel. 
You can configure multiple EtherChannels on a single system, but each EtherChannel constitutes an additional 
Ethernet interface. The no command option, ifsize, may need to be increased to include not only the 
Ethernet interfaces for each adapter, but also any EtherChannels that are configured. 
In AIX 5.2 and earlier, the default ifsize is eight. In AIX 5.2 and later, the default size is 256. 
You can use any supported Ethernet adapter in an EtherChannel (see Supported Adapters). However, the Ethernet adapters 
must be connected to a switch that supports EtherChannel. See the documentation that came with your switch 
to determine if it supports EtherChannel (your switch documentation may refer to this capability also as 
link aggregation or trunking). 
All adapters in the EtherChannel should be configured for the same speed (100 Mbps, for example) and should be 
full duplex. 
The adapters used in the EtherChannel cannot be accessed by the system after the EtherChannel is configured. 
To modify any of their attributes, such as media speed, transmit or receive queue sizes, and so forth,
you must do so before including them in the EtherChannel. 
The adapters that you plan to use for your EtherChannel must not have an IP address configured on them 
before you start this procedure. When configuring an EtherChannel with adapters that were previously configured 
with an IP address, make sure that their interfaces are in the detach state. The adapters to be added 
to the EtherChannel cannot have interfaces configured in the up state in the Object Data Manager (ODM), 
which will happen if their IP addresses were configured using SMIT. This may cause problems bringing up 
the EtherChannel when the machine is rebooted because the underlying interface is configured before the 
EtherChannel with the information found in ODM. Therefore, when the EtherChannel is configured, it finds 
that one of its adapters is already being used. To change this, before creating the EtherChannel, 
type smit chinet, select each of the interfaces of the adapters to be included in the EtherChannel, 
and change its state value to "detach". This will ensure that when the machine is rebooted the EtherChannel 
can be configured without errors. 
For more information about ODM, see Object Data Manager (ODM) in AIX 5L Version 5.3 
General Programming Concepts: Writing and Debugging Programs.

If you will be using 10/100 Ethernet adapters in the EtherChannel, you may need to enable link polling 
on those adapters before you add them to the EtherChannel. Type "smit chgenet" at the command line. 
Change the Enable Link Polling value to yes, and press Enter. 

Note:
In AIX 5.2 with 5200-03 and later, enabling the link polling mechanism is not necessary. The link poller 
will be started automatically.
If you plan to use jumbo frames, you may need to enable this feature in every adapter before creating 
the EtherChannel and in the EtherChannel itself. Type smitty chgenet at the command line. 
Change the Enable Jumbo Frames value to yes and press Enter. Do this for every adapter for which you want 
to enable Jumbo Frames. You will enable jumbo frames in the EtherChannel itself later. 

Note:
In AIX 5.2 and later, enabling the jumbo frames in every underlying adapter is not necessary once it is enabled 
in the EtherChannel itself. The feature will be enabled automatically if you set the Enable Jumbo Frames attribute to yes.

Configure an EtherChannel:
--------------------------
Type "smit etherchannel" at the command line. 
Select Add an EtherChannel / Link Aggregation from the list and press Enter. 
Select the primary Ethernet adapters that you want on your EtherChannel and press Enter. If you are planning to use 
EtherChannel backup, do not select the adapter that you plan to use for the backup at this point. 
The EtherChannel backup option is available in AIX 5.2 and later. 

Note:
The Available Network Adapters displays all Ethernet adapters. If you select an Ethernet adapter that is already 
being used (has an interface defined), you will get an error message. You first need to detach this interface 
if you want to use it.

Enter the information in the fields according to the following guidelines: 

- EtherChannel / Link Aggregation Adapters: You should see all primary adapters that you are using 
in your EtherChannel. You selected these adapters in the previous step. 

- Enable Alternate Address: This field is optional. Setting this to yes will enable you to specify 
a MAC address that you want the EtherChannel to use. If you set this option to no, the EtherChannel 
will use the MAC address of the first adapter. 

- Alternate Address: If you set Enable Alternate Address to yes, specify the MAC address that you want 
to use here. The address you specify must start with 0x and be a 12-digit hexadecimal address 
(for example, 0x001122334455). 

- Enable Gigabit Ethernet Jumbo Frames: This field is optional. In order to use this, your switch 
must support jumbo frames. This will only work with a Standard Ethernet (en) interface, 
not an IEEE 802.3 (et) interface. Set this to yes if you want to enable it. 

- Mode: You can choose from the following modes: 

standard: In this mode the EtherChannel uses an algorithm to choose which adapter it will send 
the packets out on. The algorithm consists of taking a data value, dividing it by the number of adapters 
in the EtherChannel, and using the remainder (using the modulus operator) to identify the outgoing link. 
The Hash Mode value determines which data value is fed into this algorithm (see the Hash Mode attribute 
for an explanation of the different hash modes). For example, if the Hash Mode is standard, it will use 
the packet's destination IP address. If this is 10.10.10.11 and there are 2 adapters in the EtherChannel, 
(1 / 2) = 0 with remainder 1, so the second adapter is used (the adapters are numbered starting from 0). 
The adapters are numbered in the order they are listed in the SMIT menu. This is the default operation mode. 

round_robin: In this mode the EtherChannel will rotate through the adapters, giving each adapter one packet 
before repeating. The packets may be sent out in a slightly different order than they were given to the 
EtherChannel, but it will make the best use of its bandwidth. It is an invalid combination to select 
this mode with a Hash Mode other than default. If you choose the round-robin mode, leave the Hash Mode 
value as default. 

netif_backup: This option is available only in AIX 5.1 and AIX 4.3.3. In this mode, the EtherChannel 
will activate only one adapter at a time. The intention is that the adapters are plugged into different 
Ethernet switches, each of which is capable of getting to any other machine on the subnet or network. 
When a problem is detected either with the direct connection (or optionally through the inability 
to ping a machine), the EtherChannel will deactivate the current adapter and activate a backup adapter. 
This mode is the only one that makes use of the Internet Address to Ping, Number of Retries, and 
Retry Timeout fields. 
Network Interface Backup Mode does not exist as an explicit mode in AIX 5.2 and later. 
To enable Network Interface Backup Mode in AIX 5.2 and later, you must configure one adapter in the 
main EtherChannel and a backup adapter. For more information, see Configure Network Interface Backup.

8023ad: This options enables the use of the IEEE 802.3ad Link Aggregation Control Protocol (LACP) 
for automatic link aggregation. For more details about this feature, see IEEE 802.3ad Link Aggregation.

Hash Mode: You can choose from the following hash modes, which will determine which data value will be used 
by the algorithm to determine the outgoing adapter: 

default: In this hash mode the destination IP address of the packet will be used to determine the outgoing adapter. 
For non-IP traffic (such as ARP), the last byte of the destination MAC address is used to do the calculation. 
This mode will guarantee packets are sent out over the EtherChannel in the order they were received, but it may 
not make full use of the bandwidth. 
src_port: In this hash mode the source UDP or TCP port value of the packet will be used to determine the 
outgoing adapter. If the packet is not UDP or TCP traffic, the last byte of the destination IP address will be used. 
If the packet is not IP traffic, the last byte of the destination MAC address will be used. 
dst_port: In this hash mode the destination UDP or TCP port value of the packet will be used to determine 
the outgoing adapter. If the packet is not UDP or TCP traffic, the last byte of the destination IP will be used. 
If the packet is not IP traffic, the last byte of the destination MAC address will be used. 
src_dst_port: In this hash mode both the source and destination UDP or TCP port values of the packet will be used 
to determine the outgoing adapter (specifically, the source and destination ports are added and then divided 
by two before being fed into the algorithm). If the packet is not UDP or TCP traffic, the last byte of the 
destination IP will be used. If the packet is not IP traffic, the last byte of the destination MAC address 
will be used. This mode can give good packet distribution in most situations, both for clients and servers. 

Note:

It is an invalid combination to select a Hash Mode other than default with a Mode of round_robin.
To learn more about packet distribution and load balancing, see Load-balancing options. 

Backup Adapter: This field is optional. Enter the adapter that you want to use as your EtherChannel backup. 
EtherChannel backup is available in AIX 5.2 and later. 

Internet Address to Ping: This field is optional and only takes effect if you are running Network Interface 
Backup mode or if you have only one adapter in the EtherChannel and a backup adapter. The EtherChannel will 
ping the IP address or host name that you specify here. If the EtherChannel is unable to ping this address 
for the Number of Retries times in Retry Timeout intervals, the EtherChannel will switch adapters. 

Number of Retries: Enter the number of ping response failures that are allowed before the EtherChannel 
switches adapters. The default is three. This field is optional and valid only if you have set an 
Internet Address to Ping. 

Retry Timeout: Enter the number of seconds between the times when the EtherChannel will ping the Internet Address 
to Ping. The default is one second. This field is optional and valid only if you have set an Internet Address to Ping.

Press Enter after changing the desired fields to create the EtherChannel. 

Configure IP over the newly-created EtherChannel device by typing smit chinet at the command line. 
Select your new EtherChannel interface from the list. 
Fill in all the required fields and press Enter.

Configure Network Interface Backup
Network Interface Backup protects against a single point of network failure by providing failure detection 
and failover with no disruption to user connections. When operating in this mode, only one adapter is active 
at any given time. If the active adapter fails, another adapter in the EtherChannel will be used for all traffic. 
When operating in Network Interface Backup mode, it is not necessary to connect to EtherChannel-enabled switches.

The Network Interface Backup setup is most effective when the adapters are connected to different network switches, 
as this provides greater redundancy than connecting all adapters to one switch. When connecting to different switches, 
make sure there is a connection between the switches. This provides failover capabilities from one adapter 
to another by ensuring that there is always a route to the currently-active adapter.

In releases prior to AIX 5.2, Network Interface Backup mode was implemented as an explicit mode of operation 
in the EtherChannel SMIT menu. In AIX 5.2 and later, however, the backup adapter functionality provides 
the equivalent behavior, so the mode was eliminated from the SMIT menu.

Additionally, AIX 5.2 and later versions provide priority, meaning that the adapter configured in the primary 
EtherChannel will be used preferentially over the backup adapter. As long as the primary adapter is functional, 
it will be used. This contrasts from the behavior of Network Interface Backup mode in releases prior to AIX 5.2, 
where the backup adapter was used until it also failed, regardless of whether the primary adapter had 
already recovered.

For example, ent0 could be configured as the main adapter, and ent2 as the backup adapter, creating an 
EtherChannel called ent3. Ideally, ent0 and ent2 would be connected to two different switches. In this example, 
all traffic sent over en3 (the EtherChannel's interface) would be sent over ent0 by default, whereas ent2 
will be idle. If at any time ent0 fails, all traffic would be sent over the backup adapter, ent2. 
When ent0 recovers, it will once again be used for all traffic.

While operating in Network Interface Backup Mode, it is also possible to configure the EtherChannel to detect 
link failure and network unreachability. To do this, specify the IP address or host name of a remote host 
where connectivity should always be present. The EtherChannel will periodically ping this host to determine 
whether there is still a network path to it. If a specified number of ping attempts go unanswered, the EtherChannel 
will fail over to the other adapter in the hope that there is a network path to the remote host through the 
other adapter. In this setup, not only should every adapter be connected to a different switch, but each switch 
should also have a different route to the host that is pinged.

This ping feature is only available in Network Interface Backup mode. However, in AIX 5.2 and later, if there is 
a failover due to unanswered pings on the primary adapter, the backup adapter will remain the active channel as long 
as it is working. There is no way of knowing, while operating on the backup adapter, whether it is possible to reach 
the host being pinged from the primary adapter. To avoid failing over back and forth between the primary and 
the backup, it will simply keep operating on the backup (unless the pings go unanswered on the backup adapter 
as well, or if the backup adapter itself fails, in which case it would fail over to the primary adapter). 
However, if the failover occurred because the primary adapter failed (not because the pings went unanswered), 
the EtherChannel will then come back to the primary adapter as soon it has come back up, as usual.

To configure Network Interface Backup in AIX 5.2, see Configure Network Interface Backup in AIX 5.2 and later. 
To configure Network Interface Backup in previous versions of AIX, see Appendix D. Configure Network Interface Backup 
in previous AIX versions

Configure Network Interface Backup in AIX 5.2 and later
With root authority, type smit etherchannel on the command line. 
Select Add an EtherChannel / Link Aggregation from the list and press Enter. 
Select the primary Ethernet adapter and press Enter. This is the adapter that will be used until it fails. 
Note:
The Available Network Adapters displays all Ethernet adapters. If you select an Ethernet adapter that is already being used, you will get an error message and will need to detach this interface before you can use it. See the ifconfig command for information on how to detach an interface.
Enter the information in the fields according to the following guidelines: 
EtherChannel / Link Aggregation Adapters: You should see the primary adapter you selected in the previous step. 
Enable Alternate Address: This field is optional. Setting this to yes will enable you to specify a MAC address that you want the EtherChannel to use. If you set this option to no, the EtherChannel will use the MAC address of the primary adapter. 
Alternate Address: If you set Enable Alternate Address to yes, specify the MAC address that you want to use here. The address you specify must start with 0x and be a 12-digit hexadecimal address (for example 0x001122334455). 
Enable Gigabit Ethernet Jumbo Frames: This field is optional. In order to use this, your switch must support jumbo frames. This will only work with a Standard Ethernet (en) interface, not an IEEE 802.3 (et) interface. Set this to yes if you want to use it. 
Mode: It is irrelevant which mode of operation you select because there is only one adapter in the main EtherChannel. All packets will be sent over that adapter until it fails. There is no netif_backup mode because that mode can be emulated using a backup adapter. 
Hash Mode: It is irrelevant which hash mode you select because there is only one adapter in the main EtherChannel. All packets will be sent over that adapter until it fails. 
Backup Adapter: Enter the adapter that you want to be your backup adapter. After a failover, this adapter will be used until the primary adapter recovers. It is recommended to use the preferred adapter as the primary adapter. 
Internet Address to Ping: The field is optional. The EtherChannel will ping the IP address or host name that you specify here. If the EtherChannel is unable to ping this address for Number of Retries times in Retry Timeout intervals, the EtherChannel will switch adapters. 
Number of Retries: Enter the number of ping response failures that are allowed before the EtherChannel switches adapters. The default is three. This field is optional and valid only if you have set an Internet Address to Ping. 
Retry Timeout: Enter the number of seconds between the times when the EtherChannel will ping the Internet Address to Ping. The default is one second. This field is optional and valid only if you have set an Internet Address to Ping.
Press Enter after changing the desired fields to create the EtherChannel. 
Configure IP over the newly-created interface by typing smit chinet at the command line. 
Select your new EtherChannel interface from the list. 
Fill in all the required fields and press Enter.
For additional tasks that can be performed after the EtherChannel is configured, see Managing EtherChannel and IEEE 802.3ad Link Aggregation.

Load-balancing options
There are two load balancing methods for outgoing traffic in EtherChannel, as follows: round-robin, which spreads the outgoing traffic evenly across all the adapters in the EtherChannel; and standard, which selects the adapter using an algorithm. The Hash Mode parameter determines which numerical value is fed to the algorithm.

The following table summarizes the valid load balancing option combinations offered.

Table 16. Mode and Hash Mode combinations and the outgoing traffic distributions each will produce. Mode Hash Mode Outgoing Traffic Distribution 
standard or 8023ad default The traditional AIX behavior. The adapter selection algorithm uses the last byte of the destination IP address (for TCP/IP traffic) or MAC address (for ARP and other non-IP traffic). This mode is typically a good initial choice for a server with a large number of clients. 
standard or 8023ad src_dst_port The outgoing adapter path is selected by an algorithm using the combined source and destination TCP or UDP port values. Since each connection has a unique TCP or UDP port, the three port-based hash modes provide additional adapter distribution flexibility when there are several, separate TCP or UDP connections between an IP address pair. 
standard or 8023ad src_port The adapter selection algorithm uses the source TCP or UDP port value. In the netstat -an command output, the port is the TCP/IP address suffix value in the Local column. 
standard or 8023ad dst_port The outgoing adapter path is selected by the algorithm using the destination system port value. In the netstat -an command output, the TCP/IP address suffix in the Foreign column is the TCP or UDP destination port value. 
round-robin default Outgoing traffic is spread evenly across all the adapter ports in the EtherChannel. This mode is the typical choice for two hosts connected back-to-back (without an intervening switch). 

Round-Robin
All outgoing traffic is spread evenly across all of the adapters in the EtherChannel. It provides the highest bandwidth optimization for the AIX server system. While round-robin distribution is the ideal way to utilize all the links equally, consider that it also introduces the potential for out-of-order packets at the receiving system.

In general, round-robin mode is ideal for back-to-back connections running jumbo frames. In this environment, there is no intervening switch, so there is no chance that processing at the switch could alter the packet delivery time, order, or adapter path. On this direct cable network path, packets are received exactly as sent. Jumbo frames (9000 byte MTU) always yield better file transfer performance than traditional 1500 byte MTUs. In this case, however, they add another benefit. These larger packets take longer to send so it is less likely that the receiving host would be continuously interrupted with out-of-order packets.

Round-robin mode can be implemented in other environments but at increased risk of out-of-order packets at the receiving system. This risk is particularly high when there are few, long-lived, streaming TCP connections. When there are many such connections between a host pair, packets from different connections could be intermingled, thereby decreasing the chance of packets for the same connection arriving out-of-order. Check for out-of-order packet statistics in the tcp section of the netstat -s command output. A steadily-increasing value indicates a potential problem in traffic sent from an EtherChannel.

If out-of-order packets are a problem on a system that must use traditional Ethernet MTUs and must connected through a switch, try the various hash modes offered in standard mode operation. Each mode has a particular strength, but the default and src_dst_port modes are the logical starting points as they are more widely applicable.

Standard or 8032ad
Standard algorithm. The standard algorithm is used for both standard and IEEE 802.3ad-style link aggregations. AIX divides the last byte of the "numerical value" by the number of adapters in the EtherChannel and uses the remainder to identify the outgoing link. If the remainder is zero, the first adapter in the EtherChannel is selected; a remainder of one means the second adapter is selected, and so on (the adapters are selected in the order they are listed in the adapter_names attribute).

The Hash Mode selection determines the numerical value used in the calculation. By default, the last byte of the destination IP address or MAC address is used in the calculation, but the source and destination TCP or UDP port values may also be used. These alternatives allow you to fine-tune the distribution of outgoing traffic across the real adapters in the EtherChannel.

In default hash mode, the adapter selection algorithm is applied to the last byte of the destination IP address for IP traffic. For ARP and other non-IP traffic, the same formula is applied on the last byte of the destination MAC address. Unless there is an adapter failure which causes a failover, all traffic between a host pair in default standard mode goes out over the same adapter. The default hash mode may be ideal when the local host establishes connections to many different IP addresses.

If the local host establishes lengthy connections to few IP addresses, however, you will notice that some adapters carry a greater load than others, because all the traffic sent to a specific destination is sent over the same adapter. While this prevents packets from arriving out-of-order, it may not utilize bandwidth in the most effective fashion in all cases. The port-based hash modes still send packets in order, but they allow packets belonging to different UDP or TCP connections, even if they are sent to the same destination, to be sent over different adapters, thus utilizing better the bandwidth of all the adapters.

In src_dst_port hash mode, the TCP or UDP source and destination port values of the outgoing packet are added, then divided by two. The resultant whole number (no decimals) is plugged into the standard algorithm. TCP or UDP traffic is sent on the adapter selected by the standard algorithm and selected hash mode value. Non-TCP or UDP traffic will fall back to the default hash mode, meaning the last byte of either the destination IP address or MAC address. The src_dst_port hash mode option considers both the source and the destination TCP or UDP port values. In this mode, all of the packets in one TCP or UDP connection are sent over a single adapter so they are guaranteed to arrive in order, but the traffic is still spread out because connections (even to the same host) may be sent over different adapters. The results of this hash mode are not skewed by the connection establishment direction because it uses both the source and destination TCP or UDP port values.

In src_port hash mode, the source TCP or UDP port value of the outgoing packet is used. In dst_port hash mode, the destination TCP or UDP port value of the outgoing packet is used. Use the src_port or dst_port hash mode options if port values change from one connection to another and if the src_dst_port option is not yielding a desirable distribution.

Managing EtherChannel and IEEE 802.3ad Link Aggregation
This section will tell you how to perform the following tasks:

Listing EtherChannels or Link Aggregations 
Changing the Alternate Address 
Adding, removing, or changing adapters in an EtherChannel or Link Aggregation 
Remove an EtherChannel or Link Aggregation 
Configure or remove a backup adapter on an existing EtherChannel or Link Aggregation
Listing EtherChannels or Link Aggregations
On the command line, type smit etherchannel. 
Select List All EtherChannels / Link Aggregations and press Enter.
Changing the Alternate Address
This enables you to specify a MAC address for your EtherChannel or Link Aggregation.

On AIX 5.2 with 5200-01 and earlier, type ifconfig interface detach, where interface is your EtherChannel's or Link Aggregation's interface. (On AIX 5.2 with 5200-03 and later, you can change the alternate address of the EtherChannel without detaching its interface). 
On the command line, type smit etherchannel. 
Select Change / Show Characteristics of an EtherChannel and press Enter. 
If you have multiple EtherChannels, select the EtherChannel for which you want to create an alternate address. 
Change the value in Enable Alternate EtherChannel Address to yes. 
Enter the alternate address in the Alternate EtherChannel Address field. The address must start with 0x and be a 12-digit hexadecimal address (for example, 0x001122334455). 
Press Enter to complete the process. 
Note:
Changing the EtherChannel's MAC address at runtime may cause a temporary loss of connectivity. This is because the adapters need to be reset so they learn of their new hardware address, and some adapters take a few seconds to be initialized.
Dynamic Adapter Membership
Prior to AIX 5.2 with 5200-03, in order to add or remove an adapter from an EtherChannel, its interface first had to be detached, temporarily interrupting all user traffic. To overcome this limitation, Dynamic Adapter Membership (DAM) was added in AIX 5.2 with 5200-03. It allows adapters to be added or removed from an EtherChannel without having to disrupt any user connections. A backup adapter can also be added or removed; an EtherChannel can be initially created without a backup adapter, and one can be added a later date if the need arises

Not only can adapters be added or removed without disrupting user connections, it is also possible to modify most of the EtherChannel attributes at runtime. For example, you may begin using the "ping" feature of Network Interface Backup while the EtherChannel is in use, or change the remote host being pinged at any point.

You may also turn a regular EtherChannel into an IEEE 802.3ad Link Aggregation (or vice versa), allowing users to experiment with this feature without having to remove and recreate the EtherChannel.

Furthermore, with DAM, you may choose to create a one-adapter EtherChannel. A one-adapter EtherChannel behaves exactly like a regular adapter; however, should this adapter ever fail, it would be possible to replace it at runtime without ever losing connectivity. To accomplish this, you would add a temporary adapter to the EtherChannel, remove the defective adapter from the EtherChannel, replace the defective adapter with a working one using Hot Plug, add the new adapter to the EtherChannel, and then remove the temporary adapter. During this process you would never notice a loss in connectivity. If the adapter had been working as a standalone adapter, however, it would have had to be detached before being removed using Hot Plug, and during that time any traffic going over it would simply have been lost.

Adding, removing, or changing adapters in an EtherChannel or Link Aggregation
There are two ways to add, remove, or change an adapter in an EtherChannel or Link Aggregation. One method requires the EtherChannel or Link Aggregation interface to be detached, while the other does not (using Dynamic Adapter Membership, which is available in AIX 5.2 with 5200-03 and later).

Making changes to an EtherChannel using Dynamic Adapter Membership
Making changes using Dynamic Adapter Membership does not require you to stop all traffic going over the EtherChannel by detaching its interface. Consider the following before proceeding:

Notes:
When adding an adapter at runtime, note that different Ethernet adapters support different capabilities (for example, the ability to do checksum offload, to use private segments, to do large sends, and so forth). If different types of adapters are used in the same EtherChannel, the capabilities reported to the interface layer are those supported by all the adapters (for example, if all but one adapter supports the use of private segments, the EtherChannel will state it does not support private segments; if all adapters do support large send, the channel will state it supports large send). When adding an adapter to an EtherChannel at runtime, be sure that it supports at least the same capabilities as the other adapters already in the EtherChannel. If you attempt to add an adapter that does not support all the capabilities the EtherChannel supports, the addition will fail. Note, however, that if the EtherChannel's interface is detached, you may add any adapter (regardless of which capabilities it supports), and when the interface is reactivated the EtherChannel will recalculate which capabilities it supports based on the new list of adapters. 
If you are not using an alternate address and you plan to delete the adapter whose MAC address was used for the EtherChannel (the MAC address used for the EtherChannel is "owned" by one of the adapters), the EtherChannel will use the MAC address of the next adapter available (in other words, the one that becomes the first adapter after the deletion, or the backup adapter in case all main adapters are deleted). For example, if an EtherChannel has main adapters ent0 and ent1 and backup adapter ent2, it will use by default ent0's MAC address (it is then said that ent0 "owns" the MAC address). If ent0 is deleted, the EtherChannel will then use ent1's MAC address. If ent1 is then deleted, the EtherChannel will use ent2's MAC address. If ent0 were later re-added to the EtherChannel, it will continue to use ent2's MAC address because ent2 is now the owner of the MAC address. If ent2 were then deleted from the EtherChannel, it would start using ent0's MAC address again. 
Deleting the adapter whose MAC address was used for the EtherChannel may cause a temporary loss of connectivity, because all the adapters in the EtherChannel need to be reset so they learn of their new hardware address. Some adapters take a few seconds to be initialized.

If your EtherChannel is using an alternate address (a MAC address you specified), it will keep using this MAC address regardless of which adapters are added or deleted. Furthermore, it means that there will be no temporary loss of connectivity when adding or deleting adapters because none of the adapters "owns" the EtherChannel's MAC address.

Almost all EtherChannel attributes can now be modified at runtime. The only exception is Enable Gigabit Ethernet Jumbo Frames. To modify the Enable Gigabit Ethernet Jumbo Frames attribute, you must first detach the EtherChannel's interface before attempting to modify this value. 
For any attribute that cannot be changed at runtime (currently, only Enable Gigabit Ethernet Jumbo Frames), there is a field called Apply change to DATABASE only. If this attribute is set to yes, it is possible to change, at runtime, the value of an attribute that usually cannot be modified at runtime. With the Apply change to DATABASE only field set to yes the attribute will only be changed in the ODM and will not be reflected in the running EtherChannel until it is reloaded into memory (by detaching its interface, using rmdev -l EtherChannel_device and then mkdev -l EtherChannel_device commands), or until the machine is rebooted. This is a convenient way of making sure that the attribute is modified the next time the machine boots, without having to disrupt the running EtherChannel. 
To make changes to the EtherChannel or Link Aggregation using Dynamic Adapter Membership, follow these steps:

At the command line, type smit etherchannel. 
Select Change / Show Characteristics of an EtherChannel / Link Aggregation. 
Select the EtherChannel or Link Aggregation that you want to modify. 
Fill in the required fields according to the following guidelines: 
In the Add adapter or Remove adapter field, select the Ethernet adapter you want to add or remove. 
In the Add backup adapter or Remove backup adapter fields, select the Ethernet adapter you want to start or stop using as a backup. 
Almost all the EtherChannel attributes may be modified at runtime, although the Enable Gigabit Ethernet Jumbo Frames attribute cannot. 
To turn a regular EtherChannel into an IEEE 802.3ad Link Aggregation, change the Mode attribute to 8023ad. To turn an IEEE 802.3ad Link Aggregation into an EtherChannel, change the Mode attribute to standard or round_robin.
Fill in the necessary data, and press Enter.
Making changes on AIX 5.2 with 5200-01 and earlier
Follow these steps to detach the interface before making changes:

Type ifconfig interface detach, where interface is your EtherChannel's interface. 
On the command line type, smit etherchannel. 
Select Change / Show Characteristics of an EtherChannel / Link Aggregation and press Enter. 
Select the EtherChannel or Link Aggregation that you want to modify. 
Modify the attributes you want to change in your EtherChannel or Link Aggregation and press Enter. 
Fill in the necessary fields and press Enter.
Remove an EtherChannel or Link Aggregation
Type ifconfig interface detach, where interface is your EtherChannel's interface. 
On the command line type smit etherchannel. 
Select Remove an EtherChannel / and press Enter. 
Select the EtherChannel that you want to remove and press Enter.
Configure or remove a backup adapter on an existing EtherChannel or Link Aggregation
The following procedure configures or removes a backup adapter on an EtherChannel or Link Aggregation. This option is available only in AIX 5.2 and later.

Type ifconfig interface detach, where interface is your EtherChannel's or Link Aggregation's interface. 
On the command line, type smit etherchannel. 
Select Change / Show Characteristics of an EtherChannel / Link Aggregation. 
Select the EtherChannel or Link Aggregation that you are adding or modifying the backup adapter on. 
Enter the adapter that you want to use as your backup adapter in the Backup Adapter field, or select NONE if you wish to stop using the backup adapter.
Troubleshooting EtherChannel
If you are having trouble with your EtherChannel, consider the following:

Tracing EtherChannel
Use tcpdump and iptrace to troubleshoot the EtherChannel. The trace hook id for the transmission packets is 2FA and for other events is 2FB. You cannot trace receive packets on the EtherChannel as a whole, but you can trace each adapter's receive trace hooks.

Viewing EtherChannel Statistics
Use the entstat command to get the aggregate statistics of all the adapters in the EtherChannel. For example, entstat ent3 will display the aggregate statistics of ent3. Adding the -d flag will also display the statistics of each adapter individually. For example, typing entstat -d ent3 will show you the aggregate statistics of the EtherChannel as well as the statistics of each individual adapter in the EtherChannel.

Note:
In the General Statistics section, the number shown in Adapter Reset Count is the number of failovers. In EtherChannel backup, coming back to the main EtherChannel from the backup adapter is not counted as a failover. Only failing over from the main channel to the backup is counted. 
In the Number of Adapters field, the backup adapter is counted in the number displayed.

Improving Slow Failover
If the failover time when you are using network interface backup mode or EtherChannel backup is slow, verify that your switch is not running the Spanning Tree Protocol (STP). When the switch detects a change in its mapping of switch port to MAC address, it runs the spanning tree algorithm to see if there are any loops in the network. Network Interface Backup and EtherChannel backup may cause a change in the port to MAC address mapping.

Switch ports have a forwarding delay counter that determines how soon after initialization each port should begin forwarding or sending packets. For this reason, when the main channel is re-enabled, there is a delay before the connection is re-established, whereas the failover to the backup adapter is faster. Check the forwarding delay counter on your switch and make it as small as possible so that coming back to the main channel occurs as fast as possible.

For the EtherChannel backup function to work correctly, the forwarding delay counter must not be more than 10 seconds, or coming back to the main EtherChannel might not work correctly. Setting the forwarding delay counter to the lowest value allowed by the switch is recommended.

Adapters not Failing Over
If adapter failures are not triggering failovers and you are running AIX 5.2 with 5200-01 or earlier, check to see if your adapter card needs to have link polling enabled to detect link failure. Some adapters cannot automatically detect their link status. To detect this condition, these adapters must enable a link polling mechanism that starts a timer that periodically verifies the status of the link. Link polling is disabled by default. For EtherChannel to work correctly with these adapters, however, the link polling mechanism must be enabled on each adapter before the EtherChannel is created. If you are running AIX 5.2 with 5200-03 and later, the link polling is started automatically and this cannot be an issue.

Adapters that have a link polling mechanism have an ODM attribute called poll_link, which must be set to yes for the link polling to be enabled. Before creating the EtherChannel, use the following command on every adapter to be included in the channel:

smit chgenet
Change the Enable Link Polling value to yes and press Enter.

Using Jumbo Frames
For the jumbo frames option to work properly in AIX 5.2 and earlier, aside from enabling the use_jumbo_frame attribute on the EtherChannel, you must also enable jumbo frames on each adapter before creating the EtherChannel using the following command:

smitty chgenet
Change the Enable Jumbo Frames value to yes and press Enter. On AIX 5.2 and later, jumbo frames are enabled automatically in every underlying adapter when it is set to yes.

Remote Dump
Remote dump is not supported over an EtherChannel.

IEEE 802.3ad Link Aggregation
IEEE 802.3ad is a standard way of doing link aggregation. Conceptually, it works the same as EtherChannel in that several Ethernet adapters are aggregated into a single virtual adapter, providing greater bandwidth and protection against failures. For example, ent0 and ent1 can be aggregated into an IEEE 802.3ad Link Aggregation called ent3; interface en3 would then be configured with an IP address. The system considers these aggregated adapters as one adapter. Therefore, IP is configured over them as over any Ethernet adapter.

Like EtherChannel, IEEE 802.3ad requires support in the switch. Unlike EtherChannel, however, the switch does not need to be configured manually to know which ports belong to the same aggregation.

The advantages of using IEEE 802.3ad Link Aggregation instead of EtherChannel are that it creates the link aggregations in the switch automatically, and that it allows you to use switches that support the IEEE 802.3ad standard but do not support EtherChannel.

In IEEE 802.3ad, the Link Aggregation Control Protocol (LACP) automatically tells the switch which ports should be aggregated. When an IEEE 802.3ad aggregation is configured, Link Aggregation Control Protocol Data Units (LACPDUs) are exchanged between the server machine and the switch. LACP will let the switch know that the adapters configured in the aggregation should be considered as one on the switch without further user intervention.

Although the IEEE 802.3ad specification does not allow the user to choose which adapters are aggregated, the AIX implementation does allow the user to select the adapters. According to the specification, the LACP determines, completely on its own, which adapters should be aggregated together (by making link aggregations of all adapters with similar link speeds and duplexity settings). This prevents you from deciding which adapters should be used standalone and which ones should be aggregated together. The AIX implementation gives you control over how the adapters are used, and it never creates link aggregations arbitrarily.

To be able to aggregate adapters (meaning that the switch will allow them to belong to the same aggregation) they must be of the same line speed (for example, all 100 Mbps, or all 1 Gbps) and they must all be full duplex. If you attempt to place adapters of different line speeds or different duplex modes, the creation of the aggregation on the AIX system will succeed, but the switch may not aggregate the adapters together. If the switch does not successfully aggregate the adapters together, you may notice a decrease in network performance. For information on how to determine whether an aggregation on a switch has succeeded, see Troubleshooting IEEE 802.3ad.

According to the IEEE 802.3ad specification, packets going to the same IP address are all sent over the same adapter. Thus, when operating in 8023ad mode, the packets will always be distributed in the standard fashion, never in a round-robin fashion.

The backup adapter feature is available for IEEE 802.3ad Link Aggregations just as it is for EtherChannel. The backup adapter does not need to be connected to an IEEE 802.3ad-enabled switch, but if it is, the backup adapter will still follow the IEEE 802.3ad LACP.

You can also configure an IEEE 802.3ad Link Aggregation if the switch supports EtherChannel but not IEEE 802.3ad. In that case, you would have to manually configure the ports as an EtherChannel on the switch (just as if a regular EtherChannel had been created). By setting the mode to 8023ad, the aggregation will work with EtherChannel-enabled as well as IEEE 802.3ad-enabled switches. For more information about interoperability, see Interoperability Scenarios.

Note:
The steps to enable the use of IEEE 802.3ad varies from switch to switch. You should consult the documentation for your switch to determine what initial steps, if any, must be performed to enable LACP in the switch.
For information in how to configure an IEEE 802.3ad aggregation, see Configuring IEEE 802.3ad Link Aggregation.

Considerations
Consider the following before configuring an IEEE 802.3ad Link Aggregation:

Although not officially supported, the AIX implementation of IEEE 802.3ad will allow the Link Aggregation to contain adapters of different line speeds; however, you should only aggregate adapters that are set to the same line speed and are set to full duplex. This will help avoid potential problems configuring the Link Aggregation on the switch. Refer to your switch's documentation for more information on what types of aggregations your switch allows. 
If you will be using 10/100 Ethernet adapters in the Link Aggregation on AIX 5.2 with 5200-01 and earlier, you need to enable link polling on those adapters before you add them to the aggregation. Type smitty chgenet at the command line. Change the Enable Link Polling value to yes, and press Enter. Do this for every 10/100 Ethernet adapter that you will be adding to your Link Aggregation. 
Note:
In AIX 5.2 with 5200-03 and later, enabling the link polling mechanism is not necessary. The link poller will be started automatically.

Configuring IEEE 802.3ad Link Aggregation
Follow these steps to configure an IEEE 802.3ad Link Aggregation:

Type smit etherchannel at the command line. 
Select Add an EtherChannel / Link Aggregation from the list and press Enter. 
Select the primary Ethernet adapters that you want on your Link Aggregation and press Enter. If you are planning to use a backup adapter, do not select the adapter that you plan to use for the backup at this point. The backup adapter option is available in AIX 5.2 and later. 
Note:
The Available Network Adapters displays all Ethernet adapters. If you select an Ethernet adapter that is already being used (has an interface defined), you will get an error message. You first need to detach these interfaces if you want to use them.
Enter the information in the fields according to the following guidelines: 
EtherChannel / Link Aggregation Adapters: You should see all primary adapters that you are using in your Link Aggregation. You selected these adapters in the previous step. 
Enable Alternate Address: This field is optional. Setting this to yes will enable you to specify a MAC address that you want the Link Aggregation to use. If you set this option to no, the Link Aggregation will use the MAC address of the first adapter. 
Alternate Address: If you set Enable Alternate Address to yes, specify the MAC address that you want to use here. The address you specify must start with 0x and be a 12-digit hexadecimal address (for example, 0x001122334455). 
Enable Gigabit Ethernet Jumbo Frames: This field is optional. In order to use this, your switch must support jumbo frames. This will only work with a Standard Ethernet (en) interface, not an IEEE 802.3 (et) interface. Set this to yes if you want to enable it. 
Mode: Enter 8023ad. 
Hash Mode: You can choose from the following hash modes, which will determine which data value will be used by the algorithm to determine the outgoing adapter: 
default: In this hash mode the destination IP address of the packet will be used to determine the outgoing adapter. For non-IP traffic (such as ARP), the last byte of the destination MAC address is used to do the calculation. This mode will guarantee packets are sent out over the EtherChannel in the order they were received, but it may not make full use of the bandwidth. 
src_port: In this hash mode the source UDP or TCP port value of the packet will be used to determine the outgoing adapter. If the packet is not UDP or TCP traffic, the last byte of the destination IP address will be used. If the packet is not IP traffic, the last byte of the destination MAC address will be used. 
dst_port: In this hash mode the destination UDP or TCP port value of the packet will be used to determine the outgoing adapter. If the packet is not UDP or TCP traffic, the last byte of the destination IP will be used. If the packet is not IP traffic, the last byte of the destination MAC address will be used. 
src_dst_port: In this hash mode both the source and destination UDP or TCP port values of the packet will be used to determine the outgoing adapter (specifically, the source and destination ports are added and then divided by two before being fed into the algorithm). If the packet is not UDP or TCP traffic, the last byte of the destination IP will be used. If the packet is not IP traffic, the last byte of the destination MAC address will be used. This mode can give good packet distribution in most situations, both for clients and servers.
To learn more about packet distribution and load balancing, see Load-balancing options. 
Backup Adapter: This field is optional. Enter the adapter that you want to use as your backup. The backup adapter option is available in AIX 5.2 and later. 
Internet Address to Ping: This field is optional, and only available if you have only one adapter in the main aggregation and a backup adapter. The Link Aggregation will ping the IP address or host name that you specify here. If the Link Aggregation is unable to ping this address for the Number of Retries times in Retry Timeout intervals, the Link Aggregation will switch adapters. 
Number of Retries: Enter the number of ping response failures that are allowed before the Link Aggregation switches adapters. The default is three. This field is optional and valid only if you have set an Internet Address to Ping. 
Retry Timeout: Enter the number of seconds between the times when the Link Aggregation will ping the Internet Address to Ping. The default is one second. This field is optional and valid only if you have set an Internet Address to Ping.
Press Enter after changing the desired fields to create the Link Aggregation. 
Configure IP over the newly-created Link Aggregation device by typing smit chinet at the command line. 
Select your new Link Aggregation interface from the list. 
Fill in all the required fields and press Enter.
Managing IEEE 802.3ad
For management tasks that can be performed on an IEEE 802.3ad Link Aggregation after configuration, see Managing EtherChannel and IEEE 802.3ad Link Aggregation.

Troubleshooting IEEE 802.3ad
If you are having trouble with your IEEE 802.3ad Link Aggregation, use the following command to verify the mode of operation of the Link Aggregation:

entstat -d device
where device is the Link Aggregation device.

This will also make a best-effort determination of the status of the progress of LACP based on the LACPDUs received from the switch. The following status values are possible:

Inactive: LACP has not been initiated. This is the status when a Link Aggregation has not yet been configured, either because it has not yet been assigned an IP address or because its interface has been detached. 
Negotiating: LACP is in progress, but the switch has not yet aggregated the adapters. If the Link Aggregation remains on this status for longer than one minute, verify that the switch is correctly configured. For instance, you should verify that LACP is enabled on the ports. 
Aggregated: LACP has succeeded and the switch has aggregated the adapters together. 
Failed: LACP has failed. Some possible causes are that the adapters in the aggregation are set to different line speeds or duplex modes or that they are plugged into different switches. Verify the adapters' configuration. 
In addition, some switches allow only contiguous ports to be aggregated and may have a limitation on the number of adapters that can be aggregated. Consult the switch documentation to determine any limitations that the switch may have, then verify the switch configuration.

Note:
The Link Aggregation status is a diagnostic value and does not affect the AIX side of the configuration. This status value was derived using a best-effort attempt. To debug any aggregation problems, it is best to verify the switch's configuration.
Interoperability Scenarios
The following table shows several interoperability scenarios. Consider these scenarios when configuring your EtherChannel or IEEE 802.3ad Link Aggregation. Additional explanation of each scenario is given after the table.

Table 17. Different AIX and switch configuration combinations and the results each combination will produce. EtherChannel mode Switch configuration Result 
8023ad IEEE 802.3ad LACP OK - AIX initiates LACPDUs, which triggers an IEEE 802.3ad Link Aggregation on the switch. 
standard or round_robin EtherChannel OK - Results in traditional EtherChannel behavior. 
8023ad EtherChannel OK - Results in traditional EtherChannel behavior. AIX initiates LACPDUs, but the switch ignores them. 
standard or round_robin IEEE 802.3ad LACP Undesirable - Switch cannot aggregate. The result may be poor performance as the switch moves the MAC address between switch ports 

8023ad with IEEE 802.3ad LACP: 
This is the most common IEEE 802.3ad configuration. The switch can be set to passive or active LACP.

standard or round_robin with EtherChannel: 
This is the most common EtherChannel configuration.

8023ad with EtherChannel: 
In this case, AIX will send LACPDUs, but they will go unanswered because the switch is operating as an EtherChannel. However, it will work because the switch will still treat those ports as a single link.

Note:
In this case, the entstat -d command will always report the aggregation is in the Negotiating state.
standard or round_robin with IEEE 802.3ad LACP: 
This setup is invalid. If the switch is using LACP to create an aggregation, the aggregation will 
never happen because AIX will never reply to LACPDUs. For this to work correctly, 8023ad should be 
the mode set on AIX.


Note 5:
-------

Internet Protocol over Fibre Channel
Beginning with AIX 5.2 with 5200-03, IP packets can be sent over a physical fibre-channel connection. 
After a system is configured to use IP over Fibre Channel, its network activity will function just as 
if an Ethernet or Token-Ring adapter were being used.

In order to use IP over Fibre Channel, your system must have a Fibre Channel switch and either 
the 2 Gigabit Fibre Channel Adapter for 64-bit PCI Bus or the 2 Gigabit Fibre Channel PCI-X Adapter.

In addition, the following filesets must be installed:

devices.common.ibm.fc 
devices.pci.df1000f7 
devices.pci.df1080f9 
devices.pci.df1000f9

Configuring IP over Fiber Channel
The following procedure will lead you through a configuration of IP over Fibre Channel. 
The Fibre Channel IP device driver must first be enabled. Following the enablement, 
the cfgmgr command will be run to create the Fibre Channel interface. After the interface is created, 
the network attributes (such as its IP address, Network Mask, Nameserver, and Gateway) will be assigned.

Enable the Fibre Channel IP Device Driver
By default, the Fibre Channel IP device is not enabled. To enable this device, follow these steps:

From the command line, type smit dev. 
Select FC Adapter. 
Select FC Network Protocol Device. 
Select Enable a FC Network Device. 
Select the adapter that is going to be enabled. 
Use the cfgmgr command to create the Fibre Channel interface. 
See cfgmgr in AIX 5L Version 5.3 Commands Reference.

Assign network properties to Fibre Channel interface

After the adapter has been enabled, IP needs to be configured over it. Follow these steps to configure IP:

From the command line, type smit tcpip. 
Select Minimum Configuration & Startup. 
Select the interface that you want to configure. In this case, it will be fcx, where x is the 
minor number of the interface. 
Assign all required attributes.
After the IP attributes have been assigned, verify that the changes took place by tying the 
following command at the command line:

ifconfig -a
If your configuration was successful, you will see results similar to the following among the results:

fc1: flags=e000843 <UP,BROADCAST,NOTRAILERS,RUNNING,SIMPLEX,MULTICAST,GROUPRT,64BIT,PSEG,CHAIN>
        inet 11.11.11.18 netmask 0xffffff00 broadcast 11.11.11.255
Additionally, you can run the following command:

ifconfig fcx
where x is the minor number of the interface.


67. More on Memory in AIX:
==========================


67.1 Show memory in AIX: 
------------------------

# bootinfo -r
# lsattr -El sys0 -a realmem 
# ps -eo user,pid,pcpu,vsz,time,args    (vsz gives size per process)


To look at your virtual memory and its causes, you can use a combination of: 
  
# ipcs -bm  (shared memory) 
# lsps -a   (paging) 
# vmstat -v (shows all current values)
# vmo -a    (virtual memory options) 
# vmo -L    (all tunable VMM options and values)
# svmon -G  (basic memory allocations) 
# svmon -U  (virtual memory usage by user) 


67.2 AIX Memory Tune-ables:
---------------------------

Through Environment variables: See section 9 below.

Otherwise:

- vmtune command in AIX lower than AIX 5L, like AIX 4.1:
--------------------------------------------------------

The vmtune command can be used to modify the VMM parameters that control the behavior of the memory-management 
subsystem. Some options are available to alter the defaults for LVM and file systems; the options dealing 
with disk I/O are discussed in the following sections. 

To determine whether the vmtune command is installed and available, run the following command: 

# lslpp -lI bos.adt.samples

The executable program for the vmtune command is found in the /usr/samples/kernel directory. 
The vmtune command can only be executed by the root user. Changes made by this tool remain in place until 
the next reboot of the system. If a permanent change is needed, place an appropriate entry in the /etc/inittab file. 
For example: 

vmtune:2:wait:/usr/samples/kernel/vmtune -P 50

Note: The vmtune command is in the samples directory because it is VMM-implementation dependent. 
The vmtune code that accompanies each release of the operating system is tailored specifically to the VMM 
in that release. Running the vmtune command from one release on a system with a different VMM release might
result in an operating system failure. It is also possible that the functions of the vmtune command may change 
from release to release. Be sure to review the appropriate tuning information before using the vmtune command 
to change system parameters. 

How to use the vmtune command? Use vmtune with a flag representing the parameter you want to change,
for example "maxfree". 

maxfree
Purpose: The maximum size to which the VMM page-frame free list will grow by page stealing. 
Values: Default: configuration-dependent, Range: 16 to 204800 (4KB frames) 
Display: vmtune 
Change: vmtune -F NewValue  


- tuning commands in AIX 5L:
----------------------------

Introduction 
By default, AIX is tuned for a mixed workload, and will grow its VMM file cache up to 80% of physical RAM. 
While this may be great for an NFS server, SMTP relay or web server, it is very poor for running any application 
which does its own cache management. This includes most databases (Oracle, DB2, Sybase, PostgreSQL, 
MySQL using InnoDB tables, TSM) and some other software (eg. the Squid web cache). 

Common symptoms include high paging (high pgspin and pgspout in topas), high system CPU time, 
the lrud kernel thread using CPU, slow overall system throughput, slow backups and slow process startup. 

For most database systems, the ideal solution is to use raw logical volumes. If this is not acceptable, 
then direct I/O and concurrent I/O should be used. If for some reason this is not possible, then the last solution 
is to tune the AIX file caches to be less aggressive. 

Parameters 
The three main parameters that should be tuned are those controlling the size of the persistent file cache 
(minperm% and maxperm%) used for JFS filesystems, and the client file cache (maxclient%) used by 
NFS, CDRFS and JFS2 filesystems 

- numperm% 
  Defines the current size of the persistent file cache. 
- minperm% 
  Defines the minimum amount of RAM the persistent file cache may occupy. If numperm% is less than or equal 
  to minperm%, file pages will not be stolen when RAM is required. 
- maxperm% 
  Defines the maximum amount of RAM the persistent file cache may occupy before it is used as the sole source 
  of new pages by the page stealing algorithm. By default, numperm% may exceed maxperm% if there is 
  free memory available. The setting strict_maxperm may be set to one to change maxperm% into a hard limit, 
  guaranteeing numperm% will never exceed maxperm%. 
- strict_maxperm 
  As above, if set to 1, changes maxperm% into a hard limit. 
- numclient% 
  Defines the current size of the client file cache. 
- maxclient% 
  Defines the hard maximum size of the client file cache. 
- strict_maxclient 
  Introduced in 5.2 ML4, allows the changing of maxclient% into a soft limit, similar to strict_maxperm. 

Note that maxclient% may never exceed maxperm%. In later versions of vmtune, this is enforced by changing both 
parameters if necessary. 

Note: AIX 5.2 includes a compatibilty version of vmtune. It is probably most wise to become familiar with 
the new tools, instead of relying on the backwards compatibility commands. 

The main tool to use is /usr/sbin/vmo, installed as part of the bos.perf.tune fileset. To display current 
cache sizes (numperm% and numclient%) use vmstat -v. 

vmo can change both persistent (reboot) values as well as runtime values, and so does not need to be 
present in the startups. It stores the persistent values in the /etc/tunables/nextboot file. 

Current values and characteristics may be displayed using: 

# vmo -L
NAME                      CUR    DEF    BOOT   MIN    MAX    UNIT           TYPE
     DEPENDENCIES
--------------------------------------------------------------------------------
memory_frames             512K          512K                 4KB pages         S
--------------------------------------------------------------------------------
pinnable_frames           427718        427718               4KB pages         S
--------------------------------------------------------------------------------
maxfree                   128    128    128    16     200K   4KB pages         D
     minfree
     memory_frames
...


Kernel tuning parameters:

AIX 5.2 introduces a new method that is more flexible and centralized for setting most of the AIX kernel 
tuning parameters. It is now possible to make permanent changes without having to edit any rc files. 
This is achieved by placing the reboot values for all tunable parameters in a new stanza file, 
/etc/tunables/nextboot. When the machine is rebooted, the values in that file are automatically applied. 
Another stanza file, /etc/tunables/lastboot is automatically generated with all the values as they were set 
just after the reboot. This provides the capability to return to those values at any time. The log file for 
any changes made or impossible to make during reboot is stored in /etc/tunables/lastboot.log. There are sets 
of SMIT panels and a WebSm plug-in also available to manipulate current and reboot values for all tuning 
parameters as well as the files in the /etc/tunables directory.

There are four new commands introduced in AIX 5.2 to modify the tunables files. The tunsave command is used 
to save values to a stanza file. The tunrestore command is used to apply a file, for example, to change all 
tunables parameter values to those listed in a file. The command tuncheck must be used to validate a file 
created manually and the tundefault command is available to reset tunable parameters to their default values. 
All four commands work on both current and reboot tunables parameters values. See the respective man pages 
for more information.

Modifications to vmtune and schedtune
Vmtune and schedtune are being replaced by the newly supported commands called "vmo", "ioo", and "schedo". 
Both vmo and ioo together replace vmtune, while schedo replaces schedtune. All existing parameters are 
covered by the new commands.

The ioo command will handle all the I/O related tuning parameters, while the vmo command will handle 
all the other VMM parameters previously managed by vmtune. All three commands are part of the new fileset 
"bos.perf.tune" which also contains tunsave, tunrestore, tuncheck, and tundefault. 
The bos.adt.samples fileset will still include the vmtune and schedtune commands, which will simply 
be compatibility shell scripts calling vmo, ioo, and schedo as appropriate. The compatibility scripts 
only support changes to parameters which can be changed interactively. That is, parameters that need bosboot 
and then require a reboot of the machine to be effective are no longer supported by the vmtune script. 
To change those parameters, users must now use vmo -r. The options (all from vmtune) and parameters 
in question are as follows:

vmtune option     parameter name    new command 
-C 0|1            page coloring     vmo -r -o pagecoloring=0|1 

-g n1 
-L n2 large page size 
number of large pages to reserve vmo -r -o lpg_size=n1 -o lpg_regions=n2 
-m n memory pools vmo -r -o mempools=n 
-v n number of frames per memory pool vmo -r -o framesets=n 
-i n interval for special data segment identifiers vmo -r -o spec_dataseg_int=n 
-V n number of special data segment identifiers to reserve vmo -r -o num_spec_dataseg 
-y 0|1 p690 memory affinity vmo -r -o memory_affinity=0|1 

Enhancements to no and nfso
The no and nfso commands have been enhanced to support making permanent changes to tunable parameters. 
They now interact with the /etc/tunables/nextboot file to achieve this new functionality. 
They both also have a new -h flag which can be used to display help about any parameter. The content of the 
help includes the purpose of the parameter, the possible values (default, range and type), and diagnostic 
and tuning information to decide when to change the parameter value. This information is also listed entirely 
in the respective man pages. Note that all five tuning commands (ioo, nfso, no, vmo, and schedo) use 
the same common syntax. See the respective man pages for more details and also the complete list of 
tuning parameters supported.

-- The vmo command:
-------------------

Purpose
Manages Virtual Memory Manager tunable parameters.

Syntax
vmo [ -p | -r ] { -o Tunable [= Newvalue]}

vmo [ -p | -r ] {-d Tunable }

vmo [ -p | -r ] -D

vmo [ -p | -r ] -a

vmo -?

vmo -h [ Tunable ]

vmo -L [ Tunable ]

vmo -x [ Tunable ]

Note:
Multiple -o, -d, -x and -L are allowed.

Description
Note:
The vmo command can only be executed by root.
Use the vmo command to configure Virtual Memory Manager tuning parameters. This command sets or displays 
current or next boot values for all Virtual Memory Manager tuning parameters. This command can also make 
permanent changes or defer changes until the next reboot. Whether the command sets or displays a parameter 
is determined by the accompanying flag. The -o flag performs both actions. It can either display the 
value of a parameter or set a new value for a parameter.

The Virtual Memory Manager (VMM) maintains a list of free real-memory page frames. These page frames are
available to hold virtual-memory pages needed to satisfy a page fault. When the number of pages on the 
free list falls below that specified by the minfree parameter, the VMM begins to steal pages to add to 
the free list. The VMM continues to steal pages until the free list has at least the number of pages 
specified by the maxfree parameter.

If the number of file pages (permanent pages) in memory is less than the number specified by the 
minperm% parameter, the VMM steals frames from either computational or file pages, regardless 
of repage rates. If the number of file pages is greater than the number specified by the maxperm% parameter, 
the VMM steals frames only from file pages. Between the two, the VMM normally steals only file pages, 
but if the repage rate for file pages is higher than the repage rate for computational pages, 
computational pages are stolen as well.

You can also modify the thresholds that are used to decide when the system is running out of paging space. 
The npswarn parameter specifies the number of paging-space pages available at which the system begins 
warning processes that paging space is low. The npskill parameter specifies the number of paging-space 
pages available at which the system begins killing processes to release paging space.

Examples:

1. Configure large pages:

You must configure your system to use large pages and you must also specify the amount of physical memory 
that you want to allocate to back large pages. The system default is to not have any memory allocated 
to the large page physical memory pool. You can use the vmo command to configure the size of the large page 
physical memory pool. The following example allocates 4 GB to the large page physical memory pool:

# vmo -r -o lgpg_regions=64 -o lgpg_size=16777216
To use large pages for shared memory, you must enable the SHM_PIN shmget() system call with the following command, 
which persists across system reboots:

# vmo -p -o v_pinshm=1

To see how many large pages are in use on your system, use the vmstat -l command as in the following example:

# vmstat -l

kthr     memory             page              faults        cpu      large-page 
                                                                                
----- ----------- ------------------------ ------------ ----------- ------------
 r  b   avm   fre  re  pi  po  fr   sr  cy  in   sy  cs us sy id wa   alp   flp 
 2  1 52238 124523   0   0   0   0    0   0 142   41  73  0  3 97  0     16     16

From the above example, you can see that there are 16 active large pages, alp, and 16 free large pages, flp.


2. Tuning Examples:

Show Virtual Memory Tuning parameters: 
vmo -L

Show min and max values controlling file I/O cache: 
vmo -L minperm% -L maxperm%  -L maxclient%

Permanently adjust these values: 
vmo -p -o minperm%=5 -o maxperm%=20  -o maxclient%=20

Show Filesystem Tuning parameters: 
ioo -L

Show Network Tuning parameters: 
no -a

Show NFS Tuning paramters: 
nfso -LChange/

Show Kernel operating parameters: 

smitty chgsys

Enable Asynchronous I/O: 
smitty aio


Another example:
----------------

Suppose we have an Oracle DB instance on an AIX 5.3 machine. What is the best and simplest way
to tune the memory so its optimized for Oracle?

Take a look at the cache:

root@zd111l04:/root#vmo -L minperm% -L maxperm%  -L maxclient%
NAME                      CUR    DEF    BOOT   MIN    MAX    UNIT           TYPE
     DEPENDENCIES
--------------------------------------------------------------------------------
maxclient%                80     80     80     1      100    % memory          D
     maxperm%
     minperm%
--------------------------------------------------------------------------------
maxperm%                  80     80     80     1      100    % memory          D
     minperm%
     maxclient%
--------------------------------------------------------------------------------
minperm%                  20     20     20     1      100    % memory          D
     maxperm%
     maxclient%
--------------------------------------------------------------------------------


# vmo -p -o minperm%=5            # Was 20
# vmo -p -o maxclient%=10         # Was 80
# vmo -p -o maxperm%=10           # Was 80


67.3 Websphere and AIX Memory:
------------------------------

67.3.1 Errors you may find in Websphere logs

1. java.lang.OutOfMemory
2. javax.naming.NameNotFoundException
3. javax.servlet.ServletException
4. java.lang.StringIndexOutOfBoundsException
5. java.net.SocketException
6. java.io.IOException
7. java.io.FileNotFoundException
8. java.util.MissingResourceException
9. java.lang.ClassNotFoundException
10.java.lang.StringIndexOutOfBoundsException
11.java.io.InterruptedIOException
12.com.splwg.cis.common.NestedRuntimeException


The number that is associated with action determines the type of garbage
collection that is being done:
action=1 means a preemptive garbage collection cycle.
action=2 means a full allocation failure.
action=3 means that a heap expansion takes place.
action=4 means that all known soft references are cleared.
action=5 means that stealing from the transient heap is done.
action=6 means that free space is very low.


Note 1 on java.lang.OutOfMemory
-------------------------------

The Java process has two memory areas: the Java heap, and the "native heap", 
which combine total the memory usage of the process. 
The Java heap is controlled via the -Xms and -Xmx setting, and the space 
available to the native heap is that which isn't used by the Java heap. 


The act of reducing the maximum Java heap size has made the "native heap" 
bigger, and this is the area that was memory constrained. 
We know this because the OutOfMemoryError was generated the message informed 
you that the JVM was unable to allocate a new native stack, this is 
allocated onto the native heap (there is also a Java thread object which is 
created and allocated onto the Java heap). 


It is entirely possible that the amount of "native heap" available to the 
JVM was insufficient to allocate the underlying resources to run the Java 
process under the load that was being driven through it. The native heap is 
now 500MB bigger, and unless there is a memory leak or the load is 
significantly increased, this change should prevent any OutOfMemoryErrors 
based on the native heap. 

Note 2 on java.lang.OutOfMemory
-------------------------------

Hi,

I'm experiment with Tomcat with simple "Hello World" servlet.
When I send 50 concurrent requests, I got java.lang.outOfMemory error.
Tomcat works fine upto 40 concurrent requests for the same servlet.
I'm using Tomcat 3.1M1 with Java 1.2 on Solaris 2.7.

We try to add -mx swith to the Java invocation in tomcat.sh
(line 102)
    $JAVACMD -mx96m org.apache.tomcat.shell.Startup "$@" &
And it still out of memory.

Any suggestion?

Lishin

Hi Lishin

This could be to do with exceeding max file-descriptors - this gave us the
error below (45 connections)

We are running tomcat on Solaris 2.6.  Each new connection uses at least one
socket connection, which is treated as a file-descriptor.  There is a
default limit (user) of 64 file descriptors 

To check this try: 

ulimit -n

To increase this try

ulimit -n <num>

There will be a system limit - for Solaris this is default 1024:

system limit:

ulimit -Hn


I hope this helps - I had a very frustrating time solving this one!

Joe.

Note 3 on java.lang.OutOfMemory
-------------------------------

LDR_CNTRL Purpose: Allows tuning of the kernel loader. 
Values: Default: Not set Possible Values: PREREAD_SHLIB, LOADPUBLIC, IGNOREUNLOAD, USERREGS, MAXDATA, 
DSA, PRIVSEG_LOADS 
Display: echo $LDR_CNTRL 
Change: LDR_CNTRL={PREREAD_SHLIB | LOADPUBLIC| ...} export LDR_CNTRLChange takes effect immediately in this shell. 
Change is effective until logging out of this shell. Permanent change is made by adding the following line to 
the /etc/environment file: LDR_CNTRL={PREREAD_SHLIB | LOADPUBLIC| ...} 
Diagnosis: N/A 
Tuning: The LDR_CNTRL environment variable can be used to control one or more aspects of the system loader behavior. 
You can specify multiple options with the LDR_CNTRL variable. When doing this, separate the options using 
an @ character (that is, LDR_CNTRL=PREREAD_SHLIB@LOADPUBLIC). Specifying the PREREAD_SHLIB option will cause 
entire libraries to be read as soon as they are accessed. With VMM readahead tuned, a library can be read in from disk 
and be cached in memory by the time the program starts to access its pages. While this method can use more memory, 
it can enhance performance of programs that use many shared library pages providing the access pattern 
is non-sequential. (for example, Catia). Specifying the LOADPUBLIC option directs the system loader to load 
all modules requested by an application into the global shared library segment. If a module cannot be loaded 
publicly into the global shared library segment then it is loaded privately for the application. Specifying 
the IGNOREUNLOAD option will cause modules that are marked to be unloaded and used again 
(if the module has not been unloaded already). As a side effect of this option, you can end up with 
two different data instances for the module. Specifying the USERREGS option will tell the system to save 
all general-purpose user registers across system calls made by an application. This can be helpful in 
applications doing garbage collection. Specifying the MAXDATA option sets the maximum heap size for a process, 
including overriding any MAXDATA value specified in an executable. If you want to use Large Program Support 
with a data heap size of 0x30000000, then specify LDR_CNTRL=MAXDATA=0x30000000. To turn off Large Program Support, 
specify LDR_CNTRL=MAXDATA=0. Specifying the DSA (Dynamic Segment Allocation) option tells the system loader 
to run applications using Very Large Program Support. The DSA option is only valid for 32-bit applications. 
Specifying the PRIVSEG_LOADS option directs the system loader to put dynamically loaded private modules into 
the process private segment. This might improve the availability of memory in large memory model applications 
that perform private dynamic loads and tend to run out of memory in the process heap. If the process private segment 
lacks sufficient space, the PRIVSEG_LOADS option has no effect. The PRIVSEG_LOADS option is only valid for 
32-bit applications with a non-zero MAXDATA value. 


Note 4: Java SDK and Websphere for AIX :
----------------------------------------

At 06/06/06, the following versions for Websphere on AIX are frequently found:

5.0.2:
------

5.0.2.x  x in 2-16

5.0.2 ca131-20030618 (sr5) 

5.0.2.1
5.0.2.2
5.0.2.3
5.0.2.4
5.0.2.5
5.0.2.6
5.0.2.7
5.0.2.8
5.0.2.9
5.0.2.10
5.0.2.11
5.0.2.12
5.0.2.13
5.0.2.14
5.0.2.15 SDK is not updated 

5.1:
----

5.1 ca141-20031011 (sr1) 

5.1.1 ca1420-20040626 

5.1.1.1
5.1.1.2
5.1.1.3
5.1.1.4
5.1.1.5
5.1.1.6
5.1.1.7
5.1.1.8
5.1.1.9 SDK is not updated 

6.0:
----

6.0 ca142sr1w-20041028

6.0.0.2
6.0.0.3 SDK is not updated 

6.0.1 ca142sr1a-20050209(SR1a) 

6.0.1.1
6.0.1.2 SDK is not updated 

6.0.2 ca142-20050609 

6.0.2.1
6.0.2.3
6.0.2.5
6.0.2.7 SDK is not updated 

6.0.2.9 How critical is this fix pack? 
Recommended. This fix pack must be installed on top of WebSphere Application Server V6.0.2, 6.0.2.1, 6.0.2.3, 
6.0.2.5, or 6.0.2.7.


68. Kernel parameters AIX:
==========================

- Kernel Tunable Parameters
Following are kernel parameters, grouped into the following sections:

- Scheduler and Memory Load Control Tunable Parameters:

Virtual Memory Manager Tunable Parameters 
Synchronous I/O Tunable Parameters 
Asynchronous I/O Tunable Parameters 
Disk and Disk Adapter Tunable Parameters 
Interprocess Communication Tunable Parameters
Scheduler and Memory Load Control Tunable Parameters
Most of the scheduler and memory load control tunable parameters are fully described in the schedo man page. 
The following are a few other related parameters:

- maxuproc 
Purpose: Specifies the maximum number of processes per user ID. 
Values: Default: 40; Range: 1 to 131072 
Display: lsattr -E -l sys0 -a maxuproc 
Change: chdev -l sys0 -a maxuproc=NewValue 
Change takes effect immediately and is preserved over boot. If value is reduced, then it goes into effect 
only after a system boot. 
Diagnosis: Users cannot fork any additional processes. 
Tuning: This is a safeguard to prevent users from creating too many processes. 

- ncargs 
Purpose: Specifies the maximum allowable size of the ARG/ENV list (in 4KB blocks) when running exec() subroutines. 
Values: Default: 6; Range: 6 to 1024 
Display: lsattr -E -l sys0 -a ncargs 
Change: chdev -l sys0 -a ncargs=NewValue 
Change takes effect immediately and is preserved over boot. 
Diagnosis: Users cannot execute any additional processes because the argument list passed to the exec() 
system call is too long. A low default value might cause some programs to fail with the arg list too long 
error message, in which case you might try increasing the ncargs value with the chdev command above and then 
rerunning the program. 
Tuning: This is a mechanism to prevent the exec() subroutines from failing if the argument list 
is too long. Please note that tuning to a higher ncargs value puts additional constraints on system memory resources. 
 

- Virtual Memory Manager Tunable Parameters:

The complete listing of the virtual memory manager tunable parameters is located in the vmo man page.

- Synchronous I/O Tunable Parameters:

Most of the synchronous I/O tunable parameters are fully described in the ioo man page. 
The following are a few other related parameters:

maxbuf Purpose: Number of (4 KB) pages in the block-I/O buffer cache. 
Values: Default: 20; Range: 20 to 1000 
Display: lsattr -E -l sys0 -a maxbuf 
Change: chdev -l sys0 -a maxbuf=NewValue 
Change is effective immediately and is permanent. If the -T flag is used, the change is immediate and lasts until 
the next boot. If the -P flag is used, the change is deferred until the next boot and is permanent. 
Diagnosis: If the sar -b command shows breads or bwrites with %rcache and %wcache being low, you might want to 
tune this parameter. 
Tuning: This parameter normally has little performance effect on systems, where ordinary I/O does not use the 
block-I/O buffer cache. 
Refer to: Tuning Asynchronous Disk I/O 

maxpout Purpose: Specifies the maximum number of pending I/Os to a file. 
Values: Default: 0 (no checking); Range: 0 to n (n should be a multiple of 4, plus 1) 
Display: lsattr -E -l sys0 -a maxpout 
Change: chdev -l sys0 -a maxpout=NewValue 
Change is effective immediately and is permanent. If the -T flag is used, the change is immediate and lasts 
until the next boot. If the -P flag is used, the change is deferred until the next boot and is permanent. 
Diagnosis: If the foreground response time sometimes deteriorates when programs with large amounts 
of sequential disk output are running, sequential output may need to be paced. 
Tuning: Set maxpout to 33 and minpout to 16. If sequential performance deteriorates unacceptably, 
increase one or both. If foreground performance is still unacceptable, decrease both. 

minpout Purpose: Specifies the point at which programs that have reached maxpout can resume writing to the file. 
Values: Default: 0 (no checking); Range: 0 to n (n should be a multiple of 4 and should be at least 4 less than maxpout) 
Display: lsattr -E -l sys0 -a minpout 
Change: chdev -l sys0 -a minpout=NewValue 
Change is effective immediately and is permanent. If the -T flag is used, the change is immediate and lasts until 
the next boot. If the -P flag is used, the change is deferred until the next boot and is permanent. 
Diagnosis: If the foreground response time sometimes deteriorates when programs with large amounts of sequential 
disk output are running, sequential output may need to be paced. 
Tuning: Set maxpout to 33 and minpout to 16. If sequential performance deteriorates unacceptably, 
increase one or both. If foreground performance is still unacceptable, decrease both. 

mount -o nointegrity Purpose: A new mount option (nointegrity) may enhance local file system performance for 
certain write-intensive applications. This optimization basically eliminates writes to the JFS log. 
Note that the enhanced performance is achieved at the expense of metadata integrity. Therefore, use this 
option with extreme caution because a system crash can make a file system mounted with this option unrecoverable. 
Nevertheless, certain classes of applications do not require file data to remain consistent after a system crash, 
and these may benefit from using the nointegrity option. Two examples in which a nointegrity file system may be 
beneficial is for compiler temporary files, and for doing a nonmigration or mksysb installation. 

Paging Space Size Purpose: The amount of disk space required to hold pages of working storage. 
Values: Default: configuration-dependent; Range: 32 MB to n MB for hd6, 16 MB to n MB for non-hd6 
Display: lsps -a mkps or chps or smitty pgsp 
Change: Change is effective immediately and is permanent. Paging space is not necessarily put into use immediately, however. 
Diagnosis: Run: lsps -a. If processes have been killed for lack of paging space, monitor the situation with the psdanger() subroutine. 
Tuning: If it appears that there is not enough paging space to handle the normal workload, add a new paging space on another physical volume or make the existing paging spaces larger. 

syncd Interval Purpose: The time between sync() calls by syncd. 
Values: Default: 60; Range: 1 to any positive integer 
Display: grep syncd /sbin/rc.boot vi /sbin/rc.boot or 
Change: Change is effective at next boot and is permanent. An alternate method is to use the kill command to terminate the syncd daemon and restart it from the command line with the command /usr/sbin/syncd interval. 
Diagnosis: I/O to a file is blocked when syncd is running. 
Tuning: At its default level, this parameter has little performance cost. No change is recommended. Significant 
reductions in the syncd interval in the interests of data integrity (as for HACMPT) could have adverse performance 
consequences. 

Asynchronous I/O Tunable Parameters
maxreqs Purpose: Specifies the maximum number of asynchronous I/O requests that can be outstanding at any one time. 
Values: Default: 4096; Range: 1 to AIO_MAX (/usr/include/sys/limits.h) 
Display: lsattr -E -l aio0 -a maxreqs 
Change: chdev -l aio0 -a maxreqs=NewValue 
Change is effective after reboot and is permanent. 
Diagnosis: N/A 
Tuning: This includes requests that are in progress, as well as those that are waiting to be started. The maximum number of asynchronous I/O requests cannot be less than the value of AIO_MAX, as defined in the /usr/include/sys/limits.h file, but can be greater. It would be appropriate for a system with a high volume of asynchronous I/O to have a maximum number of asynchronous I/O requests larger than AIO_MAX. 
Refer to: Tuning Asynchronous Disk I/O 

maxservers Purpose: Specifies the maximum number of AIO kprocs per processor. 
Values: Default: 10 per processor 
Display: lsattr -E -l aio0 -a maxservers 
Change: chdev -l aio0 -a maxservers=NewValue 
Change is effective after reboot and is permanent. 
Diagnosis: N/A 
Tuning: This value limits the number of concurrent asynchronous I/O requests. The value should be about the same as the expected number of concurrent AIO requests. This tunable parameter only affects AIO on JFS file systems (or Virtual Shared Disks (VSD) before AIX 4.3.2). 
Refer to: Tuning Asynchronous Disk I/O 

minservers Purpose: Specifies the number of AIO kprocs that will be created when the AIO kernel extension is loaded. 
Values: Default: 1 
Display: lsattr -E -l aio0 -a maxservers 
Change: chdev -l aio0 -a minservers=NewValue 
Change is effective after reboot and is permanent. 
Diagnosis: N/A 
Tuning: Making this a large number is not recommended, because each process takes up some memory. Leaving this number small is acceptable in most cases because AIO will create additional kprocs up to maxservers as needed. This tunable is only effective for AIO on JFS file systems (or VSDs before AIX 4.3.2). 
Refer to: Tuning Asynchronous Disk I/O 

Disk and Disk Adapter Tunable Parameters
Disk Adapter Outstanding-Requests Limit Purpose: Maximum number of requests that can be outstanding on a SCSI bus. (Applies only to the SCSI-2 Fast/Wide Adapter.) 
Values: Default: 40; Range: 40 to 128 
Display: lsattr -E -l scsin -a num_cmd_elems 
Change: chdev -l scsin -a num_cmd_elems=NewValue 
Change is effective immediately and is permanent. If the -T flag is used, the change is immediate and lasts until the next boot. If the -P flag is used, the change is deferred until the next boot and is permanent. 
Diagnosis: Applications performing large writes to striped raw logical volumes are not obtaining the desired throughput rate. 
Tuning: Value should equal the number of physical drives (including those in disk arrays) on the SCSI bus, times the queue depth of the individual drives. 

Disk Drive Queue Depth Purpose: Maximum number of requests the disk device can hold in its queue. 
Values: Default: IBMr disks=3; Non-IBM disks=0; Range: specified by manufacturer 
Display: lsattr -E -l hdiskn 
Change: chdev -l hdiskn -a q_type=simple -a queue_depth=NewValue 
Change is effective immediately and is permanent. If the -T flag is used, the change is immediate and lasts until the next boot. If the -P flag is used, the change is deferred until the next boot and is permanent. 
Diagnosis: N/A 
Tuning: If the non-IBM disk drive is capable of request-queuing, make this change to ensure that the operating system takes advantage of the capability. 
Refer to: Setting SCSI-Adapter and Disk-Device Queue Limits 

Interprocess Communication Tunable Parameters
msgmax Purpose: Specifies maximum message size. 
Values: Dynamic with maximum value of 4 MB 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

msgmnb Purpose: Specifies maximum number of bytes on queue. 
Values: Dynamic with maximum value of 4 MB 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

msgmni Purpose: Specifies maximum number of message queue IDs. 
Values: Dynamic with maximum value of 131072 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

msgmnm Purpose: Specifies maximum number of messages per queue. 
Values: Dynamic with maximum value of 524288 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semaem Purpose: Specifies maximum value for adjustment on exit. 
Values: Dynamic with maximum value of 16384 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semmni Purpose: Specifies maximum number of semaphore IDs. 
Values: Dynamic with maximum value of 131072 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semmsl Purpose: Specifies maximum number of semaphores per ID. 
Values: Dynamic with maximum value of 65535 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semopm Purpose: Specifies maximum number of operations per semop() call. 
Values: Dynamic with maximum value of 1024 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semume Purpose: Specifies maximum number of undo entries per process. 
Values: Dynamic with maximum value of 1024 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semvmx Purpose: Specifies maximum value of a semaphore. 
Values: Dynamic with maximum value of 32767 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

shmmax Purpose: Specifies maximum shared memory segment size. 
Values: Dynamic with maximum value of 256 MB for 32-bit processes and 0x80000000u for 64-bit 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

shmmin Purpose: Specifies minimum shared-memory-segment size. 
Values: Dynamic with minimum value of 1 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

shmmni Purpose: Specifies maximum number of shared memory IDs. 
Values: Dynamic with maximum value of 131072 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the 


69. AIX TUNABLE ENVIRONMENT PARAMETERS:
=======================================

Thread Support Tunable Parameters
Following is a list of thread support parameters that can be tuned:

AIXTHREAD_COND_DEBUG (AIX 4.3.3 and subsequent versions) Purpose: Maintains a list of condition variables for use by the debugger. 
Values: Default: ON 
Range: ON, OFF 
Display: echo $AIXTHREAD_COND_DEBUG (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_COND_DEBUG={ON|OFF} 
export AIXTHREAD_COND_DEBUG 
Change takes effect immediately in this shell. Change is effective until logging out of this shell. 
Permanent change is made by adding AIXTHREAD_COND_DEBUG={ON|OFF} command to the /etc/environment file. 
Diagnosis: Leaving it on makes debugging threaded applications easier, but may impose some overhead. 
Tuning: If the program contains a large number of active condition variables and frequently creates and destroys condition variables, this may create higher overhead for maintaining the list of condition variables. Setting the variable to OFF will disable the list. 
Refer to Thread Debug Options. 

AIXTHREAD_ENRUSG Purpose: Enable or disable pthread resource collection. 
Values: Default: OFF 
Range: ON, OFF 
Display: echo $AIXTHREAD_ENRUSG (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_ENRUSG={ON|OFF} 
export AIXTHREAD_ENRUSG 
Change takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_ENRUSG={ON|OFF} command to the /etc/environment file. 
Diagnosis: Turning it on allows for resource collection of all pthreads in a process, but will impose some overhead. 
Tuning:  
Refer to Thread Environment Variables. 

AIXTHREAD_GUARDPAGES (AIX 4.3 and later) Purpose: Controls the number of guard pages to add to the end of the pthread stack. 
Values: Default: 0Range: A positive integer 
Display: echo $AIXTHREAD_GUARDPAGES (This is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_GUARDPAGES=nexport AIXTHREAD_GUARDPAGESChange takes effect immediately in this shell. 
Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_GUARDPAGES=n 
command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: N/A 
Refer to Thread Environment Variables. 

AIXTHREAD_MINKTHREADS (AIX 4.3 and later) Purpose Controls the the minimum number of kernel threads that should be used. 
Values: Default: 8 
Range: A positive integer value 
Display: echo $AIXTHREAD_MINKTHREADS (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_MINKTHREADS=nexport AIXTHREAD_MINKTHREADSChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_MINKTHREADS =n command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: The library scheduler will not reclaim kernel threads below this figure. A kernel thread may be reclaimed at virtually any point. Generally, a kernel thread is targeted as a result of a pthread terminating. 
Refer to: Variables for Process-Wide Contention Scope 

AIXTHREAD_MNRATIO (AIX 4.3 and later) Purpose: Controls the scaling factor of the library. This ratio is used when creating and terminating pthreads. 
Values: Default: 8:1 
Range: Two positive values (p:k), where k is the number of kernel threads that should be employed to handle p runnable pthreads 
Display: echo $AIXTHREAD_MNRATIO (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_MNRATIO=p:kexport AIXTHREAD_MNRATIOChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_MNRATIO=p:k command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: May be useful for applications with a very large number of threads. However, always test a ratio of 1:1 because it may provide for better performance. 
Refer to: Variables for Process-Wide Contention Scope 

AIXTHREAD_MUTEX_DEBUG (AIX 4.3.3 and later) Purpose: Maintains a list of active mutexes for use by the debugger. 
Values: Default: OFF 
Range: ON, OFF 
Display: echo $AIXTHREAD_MUTEX_DEBUG (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_MUTEX_DEBUG={ON|OFF}export AIXTHREAD_MUTEX_DEBUGChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_MUTEX_DEBUG={ON|OFF} command to the /etc/environment file. 
Diagnosis: Setting the variable to ON makes debugging threaded applications easier, but may impose some overhead. 
Tuning: If the program contains a large number of active mutexes and frequently creates and destroys mutexes, this may create higher overhead for maintaining the list of mutexes. Leaving the variable off disables the list. 
Refer to: Thread Debug Options 

AIXTHREAD_RWLOCK_DEBUG (AIX 4.3.3 and later) Purpose: Maintains a list of read-write locks for use by the debugger. 
Values: Default: ON 
Range: ON, OFF 
Display: echo $AIXTHREAD_RWLOCK_DEBUG (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_RWLOCK_DEBUG={ON|OFF}export AIXTHREAD_RWLOCK_DEBUGChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_RWLOCK_DEBUG={ON|OFF} command to the /etc/environment file. 
Diagnosis: Leaving it on makes debugging threaded applications easier, but may impose some overhead. 
Tuning: If the program contains a large number of active read-write locks and frequently creates and destroys read-write locks, this may create higher overhead for maintaining the list of read-write locks. Setting the variable to OFF will disable the list. 
Refer to: Thread Debug Options 

AIXTHREAD_SCOPE (AIX 4.3.1 and later) Purpose: Controls contention scope. P signifies process-based 
contention scope (M:N). S signifies system-based contention scope (1:1). 
Values: Default: P 
Possible Values: P or S 
Display: echo $AIXTHREAD_SCOPE (this is turned on internally, so the initial default value will not be seen 
with the echo command) 
Change: AIXTHREAD_SCOPE={P|S}export AIXTHREAD_SCOPE Change takes effect immediately in this shell. 
Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_SCOPE={P|S} 
command to the /etc/environment file. 

Diagnosis: If fewer threads are being dispatched than expected, then system scope should be tried. 
Tuning: Tests on AIX 4.3.2 have shown that certain applications can perform much better with system based 
contention scope (S). The use of this environment variable impacts only those threads created with the 
default attribute. The default attribute is employed when the attr parameter to pthread_create is NULL. 
Refer to: Thread Environment Variables 


AIXTHREAD_SLPRATIO (AIX 4.3 and later) Purpose: Controls the number of kernel threads that should be held in reserve for sleeping threads. 
Values: Default: 1:12 
Range: Two positive values (k:p), where k is the number of kernel threads that should be held in reserve for p sleeping pthreads 
Display: echo $AIXTHREAD_SLPRATIO (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_SLPRATIO=k:pexport AIXTHREAD_SLPRATIOChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_SLPRATIO=k:p command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: In general, fewer kernel threads are required to support sleeping pthreads, because they are generally woken one at a time. This conserves kernel resources. 
Refer to: Variables for Process-Wide Contention Scope 

AIXTHREAD_STK=n (AIX 4.3.3 ML 09 and later) Purpose: The decimal number of bytes that should be allocated for each pthread. This value may be overridden by pthread_attr_setstacksize. 
Values: Default: 98,304 bytes for 32bit applications, 196,608 bytes for 64bit applications. 
Range: Decimal integer values from 0 to 268,435,455 which will be rounded up to the nearest page (currently 4,096). 
Display: echo $AIXTHREAD_STK (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_STK=size export AIXTHREAD_STK Change takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_STK=size to the /etc/environment file. 
Diagnosis: If analysis of a failing program indicates stack overflow, the default stack size can be increased. 
Tuning: If trying to reach the 32,000 thread limit on a 32 bit application, it may be necessary to decrease the default stack size. 

MALLOCBUCKETS (Version 4.3.3.25 and later) Purpose: Enables buckets-based extension in the default memory allocator which may enhance performance of applications that issue large numbers of small allocation requests. 
Values: MALLOCTYPE=buckets 
 

MALLOCBUCKETS=[[ number_of_buckets:n | bucket_sizing_factor:n | blocks_per_bucket:n | bucket_statistics:[stdout|stderr|pathname]],...] 
The following table displays default values of MALLOCBUCKETS. MALLOCBUCKETS Default Values

MALLOCBUCKETS Options 
Default Value 
number_of_buckets1 
16 
bucket_sizing_factor (32-bit)2 
32 
bucket_sizing_factor (64-bit)3 
64 
blocks_per_bucket 
10244  
Notes:

1. The minimum value allowed is 1. The maximum value allowed is 128.

2. For 32-bit implementations, the value specified for bucket_sizing_factor must be a multiple of 8.

3. For 64-bit implementations, the value specified for bucket_sizing_factor must be a multiple of 16.

4. The bucket_statistics option is disabled by default.
 
Display: echo $MALLOCBUCKETS; echo $MALLOCTYPE 
Change: Use the shell specific method of exporting the environment variables. 
Diagnosis: If malloc performance is slow and many small malloc requests are issued, this feature may enhance performance. 
Tuning: To enable malloc buckets, the MALLOCTYPE environment variable has to be set to the value "buckets". 
 

The MALLOCBUCKETS environment variable may be used to change the default configuration of the malloc buckets, although the default values should be sufficient for most applications. 
 

The number_of_buckets:n option can be used to specify the number of buckets available per heap, where n is the number of buckets. The value specified for n will apply to all available heaps. 
 

The bucket_sizing_factor:n option can be used to specify the bucket sizing factor, where n is the bucket sizing factor in bytes. 
 

The blocks_per_bucket:n option can be used to specify the number of blocks initially contained in each bucket, where n is the number of blocks. This value is applied to all of the buckets. The value of n is also used to determine how many blocks to add when a bucket is automatically enlarged because all of its blocks have been allocated. 
 

The bucket_statistics option will cause the malloc subsystem to output a statistical summary for malloc buckets upon typical termination of each process that calls the malloc subsystem while malloc buckets is enabled. This summary will show buckets configuration information and the number of allocation requests processed for each bucket. If multiple heaps have been enabled by way of malloc multiheap, the number of allocation requests shown for each bucket will be the sum of all allocation requests processed for that bucket for all heaps. 
 

The buckets statistical summary will be written to one of the following output destinations, as specified with the bucket_statistics option. 
stdout 
Standard output 
stderr 
Standard error 
pathname 
A user-specified pathname 
 

If a user-specified pathname is provided, statistical output will be appended to the existing contents of the file (if any). Avoid using standard output as the output destination for a process whose output is piped as input into another process. 
Refer to: Malloc Buckets 

MALLOCMULTIHEAP (AIX 4.3.1 and later) Purpose: Controls the number of heaps within the process private segment. 
Values: Default: 16 for 4.3.1 and 4.3.2, 32 for 4.3.3 and later 
Range: A positive number between 1 and 32) 
Display: echo $MALLOCMULTIHEAP (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: MALLOCMULTIHEAP=[[heaps:n | considersize],...] export MALLOCMULTIHEAPChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding MALLOCMULTIHEAP=[[heaps:n | considersize],...] command to the /etc/environment file. 
Diagnosis: Look for lock contention on the malloc lock (located in segment F) or fewer than expected runnable threads. 
Tuning: Smaller number of heaps can help reduce size of the process. Certain multithreaded user processes which use the malloc subsystem heavily may obtain better performance by exporting the environment variable MALLOCMULTIHEAP=1 before starting the application. 
 

The potential performance enhancement is particularly likely for multithreaded C++ programs, because these may make use of the malloc subsystem whenever a constructor or destructor is called. 
 

Any available performance enhancement will be most evident when the multithreaded user process is running on an SMP system, and particularly when system scope threads are used (M:N ratio of 1:1). However, in some cases, enhancement may also be evident under other conditions, and on uniprocessors. 
 

If the considersize option is specified, an alternate heap selection algorithm is used that tries to select an available heap that has enough free space to handle the request. This may minimize the working set size of the process by reducing the number of sbrk() calls. However, there is a bit more processing time required for this algorithm. 
Refer to: Thread Environment Variables 

SPINLOOPTIME Purpose: Controls the number of times to retry a busy lock before yielding to another processor (only for libpthreads). 
Values: Default: 1 on uniprocessors, 40 on multiprocessors 
Range: A positive integer 
Display: echo $SPINLOOPTIME (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: SPINLOOPTIME=nexport SPINLOOPTIMEChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding SPINLOOPTIME=n command to the /etc/environment file. 
Diagnosis: If threads are going to sleep often (lot of idle time), then the SPINLOOPTIME may not be high enough. 
Tuning: Increasing the value from default of 40 on multiprocessor systems might be of benefit if there is pthread mutex contention. 
Refer to: Thread Environment Variables 

YIELDLOOPTIME Purpose: Controls the number of times to yield the processor before blocking on a busy lock (only for libpthreads). The processor is yielded to another kernel thread, assuming there is another runnable kernel thread with sufficient priority. 
Values: Default: 0 
Range: A positive value 
Display: echo $YIELDLOOPTIME (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: YIELDLOOPTIME=nexport YIELDLOOPTIMEChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding YIELDLOOPTIME=n command to the /etc/environment file. 
Diagnosis: If threads are going to sleep often (lot of idle time), then the YIELDLOOPTIME may not be high enough. 
Tuning: Increasing the value from default value of 0 may benefit if you do not want the threads to go to sleep when waiting for locks. 
Refer to: Thread Environment Variables 

Miscellaneous Tunable Parameters
Following is a list of miscellaneous parameters that can be tuned:

EXTSHM (AIX 4.2.1 and later) Purpose: Turns on the extended shared memory facility. 
Values: Default: Not set 
Possible Value: ON 
Display: echo $EXTSHM 
Change: EXTSHM=ON export EXTSHMChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding EXTSHM=ON command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: Setting value to ON will allow a process to allocate shared memory segments as small as 1 byte (though this will be rounded up to the nearest page); this effectively removes the limitation of 11 user shared memory segments. Maximum size of all segments together can still only be 2.75 GB worth of memory for 32-bit processes. 64-bit processes do not need to set this variable since a very large number of segments is available. Some restrictions apply for processes that set this variable, and these restrictions are the same as with processes that use mmap buffers. 
Refer to: Extended Shared Memory (EXTSHM) 

LDR_CNTRL Purpose: Allows tuning of the kernel loader. 
Values: Default: Not set Possible Values: PREREAD_SHLIB, LOADPUBLIC, IGNOREUNLOAD, USERREGS, MAXDATA, DSA, PRIVSEG_LOADS 
Display: echo $LDR_CNTRL 
Change: LDR_CNTRL={PREREAD_SHLIB | LOADPUBLIC| ...} export LDR_CNTRLChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding the following line to the /etc/environment file: LDR_CNTRL={PREREAD_SHLIB | LOADPUBLIC| ...} 
Diagnosis: N/A 
Tuning: The LDR_CNTRL environment variable can be used to control one or more aspects of the system loader behavior. You can specify multiple options with the LDR_CNTRL variable. When doing this, separate the options using an @ character (that is, LDR_CNTRL=PREREAD_SHLIB@LOADPUBLIC). Specifying the PREREAD_SHLIB option will cause entire libraries to be read as soon as they are accessed. With VMM readahead tuned, a library can be read in from disk and be cached in memory by the time the program starts to access its pages. While this method can use more memory, it can enhance performance of programs that use many shared library pages providing the access pattern is non-sequential. (for example, Catia). Specifying the LOADPUBLIC option directs the system loader to load all modules requested by an application into the global shared library segment. If a module cannot be loaded publicly into the global shared library segment then it is loaded privately for the application. Specifying the IGNOREUNLOAD option will cause modules that are marked to be unloaded and used again (if the module has not been unloaded already). As a side effect of this option, you can end up with two different data instances for the module. Specifying the USERREGS option will tell the system to save all general-purpose user registers across system calls made by an application. This can be helpful in applications doing garbage collection. Specifying the MAXDATA option sets the maximum heap size for a process, including overriding any MAXDATA value specified in an executable. If you want to use Large Program Support with a data heap size of 0x30000000, then specify LDR_CNTRL=MAXDATA=0x30000000. To turn off Large Program Support, specify LDR_CNTRL=MAXDATA=0. Specifying the DSA (Dynamic Segment Allocation) option tells the system loader to run applications using Very Large Program Support. The DSA option is only valid for 32-bit applications. Specifying the PRIVSEG_LOADS option directs the system loader to put dynamically loaded private modules into the process private segment. This might improve the availability of memory in large memory model applications that perform private dynamic loads and tend to run out of memory in the process heap. If the process private segment lacks sufficient space, the PRIVSEG_LOADS option has no effect. The PRIVSEG_LOADS option is only valid for 32-bit applications with a non-zero MAXDATA value. 

NODISCLAIM Purpose: Controls how calls to free() are being handled. When PSALLOC is set to early, all free() calls result in a disclaim() system call. When NODISCLAIM is set to True, this does not occur. 
Values: Default: Not set 
Possible Value: True 
Display: echo $NODISCLAIM 
Change: NODISCLAIM=true export NODISCLAIMChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding NODISCLAIM=true command to the /etc/environment file. 
Diagnosis: If number of disclaim() system calls is very high, you may want to set this variable. 
Tuning: Setting this variable will eliminate calls to disclaim() from free() if PSALLOC is set to early. 
Refer to: Early Page Space Allocation 

NSORDER Purpose: Overwrites the set name resolution search order. 
Values: Default: bind, nis, local 
Possible Values: bind, local, nis, bind4, bind6, local4, local6, nis4, or nis6 
Display: echo $NSORDER (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: NSORDER=value, value, ... export NSORDERChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding NSORDER=value command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: NSORDER overrides the /etc/netsvc.conf file. 
Refer to: Tuning Name Resolution 

PSALLOC Purpose: Sets the PSALLOC environment variable to determine the paging-space allocation policy. 
Values: Default: Not set 
Possible Value: early 
Display: echo $PSALLOC 
Change: PSALLOC=early export PSALLOCChange takes effect immediately in this shell. Change is effective until logging out of this shell. 
Diagnosis: N/A 
Tuning: To ensure that a process is not killed due to low paging conditions, this process can preallocate paging space by using the Early Page Space Allocation policy. However, this may result in wasted paging space. You may also want to set the NODISCLAIM environment variable. 
Refer to: Allocation and Reclamation of Paging Space Slots and Early Page Space Allocation 

RT_GRQ (AIX 4.3.3.1 and later) Purpose: Causes thread to be put on a global run queue rather than on a per-CPU run queue. 
Values: Default: Not set; Range: ON, OFF 
Display: echo $RT_GRQ 
Change: RT_GRQ={OFF/ONexport RT_GRQChange takes effect immediately. Change is effective until next boot. Permanent change is made by adding RT_GRQ={ON|OFF} command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: May be tuned on multiprocessor systems. Set to ON, will cause the thread to be put on a global run queue. In that case, the global run queue is searched to see which thread has the best priority. This might allow to get the thread dispatched sooner and can improve performance for threads that are running SCHED_OTHER, and are interrupt driven. 
Refer to: Scheduler Run Queue 

RT_MPC (AIX 4.3.3 and later) Purpose: When running the kernel in real-time mode (see bosdebug command), an MPC can be sent to a different CPU to interrupt it if a better priority thread is runnable so that this thread can be dispatched immediately. 
Values: Default: Not set; Range: ON 
Display: echo $RT_MPC 
Change: RT_MPC=ON 
export RT_MPC 
Change takes effect immediately. Change is effective until next boot. Permanent change is made by adding RT_MPC=ON command to the /etc/environment file. 
Diagnosis: N/A 


Note on LDR_CNTRL:
------------------


Setting the maximum number of AIX data segments that a process can use (LDR_CNTRL)
In AIX, Version 4.3.3 and later, the number of segments that a process can use for data is controlled 
by the LDR_CNTRL environment variable. It is defined in the parent process of the process that 
is to be affected. For example, the following defines one additional data segment: 

export LDR_CNTRL =MAXDATA=0x10000000
start_process
unset LDR_CNTRL

It is a good idea to unset the LDR_CNTRL environment variable, so that it does not unintentionally 
affect other processes. 

Unlike other environment variables for the IBM SecureWay Directory server process (slapd), 
the LDR_CNTRLenvironment variable cannot be set as a front-end variable in the slapd32.conf file. 
It must be set as an environment variable. 

The following table shows the LDR_CNTRL setting and memory increase for various numbers of data segments: 

LDP_CNTRL Setting  	Number of Additional Segments  Process Memory Limit Increase  
Unset  				0 (default)  		256 MB  
LDR_CNTRL=MAXDATA=0x1000000  	1  			512 MB  
LDR_CNTRL=MAXDATA=0x2000000  	2  			768 MB  
LDR_CNTRL=MAXDATA=0x3000000  	3  			1 GB  
LDR_CNTRL=MAXDATA=0x4000000  	4  			1.25 GB  
LDR_CNTRL=MAXDATA=0x5000000  	5 			1.5 GB  
LDR_CNTRL=MAXDATA=0x6000000  	6  			1.75 GB  
LDR_CNTRL=MAXDATA=0x7000000  	7  			2 GB  
LDR_CNTRL=MAXDATA=0x8000000  	8  			2.25 GB


70. Charactersets and Codepages:
================================


70.1 LANG variable on UNIX systems:
-----------------------------------

Most UNIX systems use the LANG variable to specify the desired locale. Different UNIX operating systems, however, 
require different locale names to specify the same language. Be sure to use a value for LANG that is supported 
by the UNIX operating system that you are using.

To obtain the locale names for your UNIX system, enter the following: 

# locale -a

As specified by open systems standards, other environment variables override LANG for some or all 
locale categories. These variables include the following: 

LC_COLLATE 
LC_CTYPE 
LC_MONETARY 
LC_NUMERIC 
LC_TIME 
LC_MESSAGES 
LC_ALL


To verify that you have a language package installed for your UNIX or Linux system, enter the following:

# locale 

If you had loaded a language package (for example bos.loc.iso.en_us), the output of the locale command would be:

LANG=en_US
LC_COLLATE="en_US"
LC_CTYPE="en_US"
LC_MONETARY="en_US"
LC_NUMERIC="en_US"
LC_TIME="en_US"
LC_MESSAGES="en_US"
LC_ALL=

If no language packages have been installed, the output would be:

LANG=en_US
LC_COLLATE="C"
LC_CTYPE="C"
LC_MONETARY="C"
LC_NUMERIC="C"
LC_TIME="C"
LC_MESSAGES="C"
LC_ALL=


Changing the LANG variable for the Unix shell session:
# export LANG=en_US

The  LANG  environment  variable  provides  the  ability  to specify  the user's requirements for native languages, 
localcustoms and character set, as an ASCII string in the form

LANG=language[_territory[.codeset]]

A user who speaks German as it is spoken in Austria and  has a  terminal which operates in ISO 8859/1 codeset, 
would want the setting of the LANG variable to be

# export LANG=De_A.88591

With this setting it should be possible  for  that  user  to  find any  relevant catalogs should they exist.
Should  the  LANG  variable  not  be  set,  the   value   of  LC_MESSAGES  as returned by setlocale() is used.  
If this is NULL, the default path as defined in <nl_types.h> is used.


70.2 UTF-8 on Unix/Linux:
-------------------------

The proper way to activate UTF-8 is the POSIX locale mechanism. A locale is a configuration setting that 
contains information about culture-specific conventions of software behaviour, including the character encoding, 
the date/time notation, alphabetic sorting rules, the measurement system and common office paper size, etc. 
The names of locales usually consist of ISO 639-1 language and ISO 3166-1 country codes, sometimes with 
additional encoding names or other qualifiers. 

You can get a list of all locales installed on your system (usually in /usr/lib/locale/) with the command 
locale -a. Set the environment variable LANG to the name of your preferred locale. When a C program executes 
the setlocale(LC_CTYPE, "") function, the library will test the environment variables 
LC_ALL, LC_CTYPE, and LANG in that order, and the first one of these that has a value will determine which 
locale data is loaded for the LC_CTYPE category (which controls the multibyte conversion functions). 
The locale data is split up into separate categories. For example, LC_CTYPE defines the character encoding 
and LC_COLLATE defines the string sorting order. The LANG environment variable is used to set the default locale 
for all categories, but the LC_* variables can be used to override individual categories. Do not worry too much 
about the country identifiers in the locales. Locales such as en_GB (English in Great Britain) and en_AU 
(English in Australia) differ usually only in the LC_MONETARY category (name of currency, rules for printing 
monetary amounts), which practically no Linux application ever uses. LC_CTYPE=en_GB and LC_CTYPE=en_AU have exactly 
the same effect. 

You can query the name of the character encoding in your current locale with the command locale charmap. 
This should say UTF-8 if you successfully picked a UTF-8 locale in the LC_CTYPE category. The command locale -m 
provides a list with the names of all installed character encodings. 

If you use exclusively C library multibyte functions to do all the conversion between the external character 
encoding and the wchar_t encoding that you use internally, then the C library will take care of using the right 
encoding according to LC_CTYPE for you and your program does not even have to know explicitly what the current 
multibyte encoding is. 

Users have to select a UTF-8 locale, for example with 

# export LANG=en_GB.UTF-8 
# export LANG en_US.UTF-8

in order to activate the UTF-8 support in applications. 

Note:

For some apps you must have the LANG and LC_ALL environment variables set to the appropriate locale 
in your current session before you start that app.

X11.loc.NN_NN for the UTF-8 locale 


70.3 Listing of locale env. vars:
---------------------------------

LANG
This variable determines the locale category for native language, local customs and coded character set 
in the absence of the LC_ALL and other LC_* (LC_COLLATE, LC_CTYPE, LC_MESSAGES, LC_MONETARY, LC_NUMERIC, 
LC_TIME) environment variables. This can be used by applications to determine the language to use for 
error messages and instructions, collating sequences, date formats, and so forth. 

LC_ALL
This variable determines the values for all locale categories. The value of the LC_ALL environment variable 
has precedence over any of the other environment variables starting with LC_ (LC_COLLATE, LC_CTYPE, LC_MESSAGES, 
LC_MONETARY, LC_NUMERIC, LC_TIME) and the LANG environment variable. 

LC_COLLATE
This variable determines the locale category for character collation. It determines collation information 
for regular expressions and sorting, including equivalence classes and multi-character collating elements, 
in various utilities and the strcoll() and strxfrm() functions. Additional semantics of this variable, if any, 
are implementation-dependent. 

LC_CTYPE
This variable determines the locale category for character handling functions, such as tolower(), toupper() 
and isalpha(). This environment variable determines the interpretation of sequences of bytes of text data 
as characters (for example, single- as opposed to multi-byte characters), the classification of characters 
(for example, alpha, digit, graph) and the behaviour of character classes. Additional semantics of 
this variable, if any, are implementation-dependent. 

LC_MESSAGES
This variable determines the locale category for processing affirmative and negative responses and the language 
and cultural conventions in which messages should be written. It also affects the behaviour of the 
catopen() function in determining the message catalogue. Additional semantics of this variable, if any, 
are implementation-dependent. The language and cultural conventions of diagnostic and informative messages 
whose format is unspecified by this specification set should be affected by the setting of LC_MESSAGES. 

LC_MONETARY
This variable determines the locale category for monetary-related numeric formatting information. 
Additional semantics of this variable, if any, are implementation-dependent. 

LC_NUMERIC
This variable determines the locale category for numeric formatting (for example, thousands separator 
and radix character) information in various utilities as well as the formatted I/O operations in printf() 
and scanf() and the string conversion functions in strtod(). Additional semantics of this variable, if any, 
are implementation-dependent. 

LC_TIME
This variable determines the locale category for date and time formatting information. It affects the behaviour 
of the time functions in strftime(). Additional semantics of this variable, if any, are implementation-dependent. 

NLSPATH
This variable contains a sequence of templates that the catopen() function uses when attempting to locate 
message catalogues. Each template consists of an optional prefix, one or more substitution fields, a filename 
and an optional suffix. For example: 
NLSPATH="/system/nlslib/%N.cat"


71. ar, ld commands:
====================

Note 1:
-------

ar Command

Purpose

Maintains the indexed libraries used by the linkage editor.

Syntax

ar [ -c ] [ -l ] [ -g | -o ] [ -s ] [ -v ] [ -C ] [ -T ] [ -z ] { -h | -p | -t |
-x } [ -X {32|64|32_64}] ArchiveFile [ File ... ]

ar [ -c ] [ -l ] [ -g | -o ] [ -s ] [ -v ] [ -C ] [ -T ] [ -z ] { -m | -r [ -u ]
} [ { -a | -b | -i } PositionName ] [ -X {32|64|32_64}] ArchiveFile File ...

ar [ -c ] [ -l ] [ -g | -o ] [ -s ] [ -v ] [ -C ] [ -T ] [ -z ] { -d | -q } [ -X
{32|64|32_64}] ArchiveFile File ...

ar [ -c ] [ -l ] [ -v ] [ -C ] [ -T ] [ -z ] { -g | -o | -s | -w } [ -X
{32|64|32_64}] ArchiveFile

Description

The ar command maintains the indexed libraries used by the linkage editor. The
ar command combines one or more named files into a single archive file written
in ar archive format. When the ar command creates a library, it creates headers
in a transportable format; when it creates or updates a library, it rebuilds the
symbol table. See the ar file format entry for information on the format and
structure of indexed archives and symbol tables.

There are two file formats that the ar command recognizes. The Big Archive
Format, ar_big, is the default file format and supports both 32-bit and 64-bit
object files. The Small Archive Format can be used to create archives that are
recognized on versions older than AIX 4.3, see the -g flag. If a 64-bit object
is added to a small format archive, ar first converts it to the big format,
unless -g is specified. By default, ar only handles 32-bit object files; any
64-bit object files in an archive are silently ignored. To change this behavior,
use the -X flag or set the OBJECT_MODE environment variable.

Flags

In an ar command, you can specify any number of optional flags from the set
cClosTv. You must specify one flag from the set of flags dhmopqrstwx. If you
select the -m or -r flag, you may also specify a positioning flag (-a, -b, or
-i); for the -a, -b, or -i flags, you must also specify the name of a file
within ArchiveFile (PositionName), immediately following the flag list and
separated from it by a blank.

-a PositionName Positions the named files after the existing file identified by
the PositionName parameter.

-b PositionName Positions the named files before the existing file identified by
the PositionName parameter.

-c Suppresses the normal message that is produced when library is created.

-C Prevents extracted files from replacing like-named files in the file system.

-d Deletes the named files from the library.

-g Orders the members of the archive to ensure maximum loader efficiency with a
minimum amount of unused space. In almost all cases, the -g flag physically
positions the archive members in the order in which they are logically linked.
The resulting archive is always written in the small format, so this flag can be
used to convert a big-format archive to a small-format archive. Archives that
contain 64-bit XCOFF objects cannot be created in or converted to the small
format.

-h Sets the modification times in the member headers of the named files to the
current date and time. If you do not specify any file names, the ar command sets
the time stamps of all member headers. This flag cannot be used with the -z
flag.

-i PositionName Positions the named files before the existing file identified by
the PositionName parameter (same as the -b).

-m Moves the named files to some other position in the library. By default, it
moves the named files to the end of the library. Use a positioning flag (abi) to
specify some other position.

-o Orders the members of the archive to ensure maximum loader efficiency with a
minimum amount of unused space. In almost all cases, the -o flag physically
positions the archive members in the order in which they are logically linked.
The resulting archive is always written in the big archive format, so this flag
can be used to convert a small-format archive to a big-format archive.

-p Writes to standard output the contents of the named in the Files parameter,
or all files specified in the ArchiveFile parameter if you do not specify any
files.

-q Adds the named files to the end of the library. In addition, if you name the
same file twice, it may be put in the library twice.

-r Replaces a named file if it already appears in the library. Because the named
files occupy the same position in the library as the files they replace, a
positioning flag does not have any additional effect. When used with the -u flag
(update), the -r flag replaces only files modified since they were last added to
the library file.

If a named file does not already appear in the library, the ar command adds it.
In this case, positioning flags do affect placement. If you do not specify a
position, new files are placed at the end of the library. If you name the same
file twice, it may be put in the library twice.

-s Forces the regeneration of the library symbol table whether or not the ar
command modifies the library contents. Use this flag to restore the library
symbol table after using the strip command on the library.

-t Writes to the standard output a table of contents for the library. If you
specify file names, only those files appear. If you do not specify any files,
the -t flag lists all files in the library.

-T Allows file name truncation if the archive member name is longer than the
file system supports. This option has no effect because the file system supports
names equal in length to the maximum archive member name of 255 characters.

-u Copies only files that have been changed since they were last copied (see the
-r flag discussed previously).

-v Writes to standard output a verbose file-by-file description of the making of
the new library. When used with the -t flag, it gives a long listing similar to
that of the ls -l command. When used with the -x flag, it precedes each file
with a name. When used with the -h flag, it lists the member name and the
updated modification times.

-w Displays the archive symbol table. Each symbol is listed with the name of the
file in which the symbol is defined.

-x Extracts the named files by copying them into the current directory. These
copies have the same name as the original files, which remain in the library. If
you do not specify any files, the -x flag copies all files out of the library.
This process does not alter the library.

-X mode Specifies the type of object file ar should examine. The mode must be
one of the following:

32
  Processes only 32-bit object files
64
  Processes only 64-bit object files
32_64
  Processes both 32-bit and 64-bit object files

The default is to process 32-bit object files (ignore 64-bit objects). The mode
can also be set with the OBJECT_MODE environment variable. For example,
OBJECT_MODE=64 causes ar to process any 64-bit objects and ignore 32-bit
objects. The -X flag overrides the OBJECT_MODE variable.


72. REMARKS ON PRINTING IN AIX:
===============================

The following defines terms commonly used when discussing UNIX printing.

* Print Job
A print job is a unit of work to be run on a printer. A print job can consist of
printing one or more files depending on how the print job is requested. The
system assigns a unique job number to each job it runs.

* Queue
The queue is where you direct a print job. It is a stanza in the /etc/qconfig
file whose name is the name of the queue and points to the associated
queue device.

* Queue Device
The queue device is the stanza in the /etc/qconfig file that normally follows
the local queue stanza. It specifies the /dev file (printer device) that should
be used.

* qdaemon
The qdaemon is a process that runs in the background and controls the
queues. It is generally started during IPL.

* Print Spooler
The spooler is not specifically a print job spooler. Instead, it provides a
generic spooling function that can be used for queuing various types of
jobs including print jobs queued to a printer.
The spooler does not normally know what type of job it is queuing

The main spooler command is the enq command. Although you can invoke
this command directly to queue a print job, three front-end commands are
defined for submitting a print job: The lp, lpr, and qprt commands. A print
request issued by one of these commands is first passed to the enq
command, which then places the information about the file in the queue for
the qdaemon to process.

* Real Printer
A real printer is the printer hardware attached to a serial or parallel port at
a unique hardware device address. The printer device driver in the kernel
communicates with the printer hardware and provides an interface
between the printer hardware and a virtual printer, but it is not aware of
the concept of virtual printers. Real printers sometimes run out of paper.

* Local and Remote Printers
When you attach a printer to a node or host, the printer is referred to as a
local printer. A remote print system allows nodes that are not directly
linked to a printer to have printer access.
To use remote printing facilities, the individual nodes must be connected
to a network using the Transmission Control Protocol/Internet Protocol
(TCP/IP) and must support the required TCP/IP applications.

* Printer Backend
The printer backend is a collection of programs called by the spooler's
qdaemon command to manage a print job that is queued for printing. The
printer backend performs the following functions:

- Receives from the qdaemon command a list of one or more files to be
printed
- Uses printer and formatting attribute values from the database
overridden by flags entered on the command line
- Initializes the printer before printing a file
- Runs filters as necessary to convert the print data stream to a format
supported by the printer
- Provides filters for simple formatting of ASCII documents
- Provides support for printing national language characters
- Passes the filtered print data stream to the printer device driver
- Generates header and trailer pages
- Generates multiple copies
- Reports paper out, intervention required, and printer error conditions
- Reports problems detected by the filters
- Cleans up after a print job is canceled
- Provides a print environment that a system administrator can
customize to address specific printing needs

AIX supports The AIX printsubsystem and the System5 BSD like printsubsystem.

- Devices and Drivers:

Local printing to serial and parallel attached printers for both printsubsystems
is done through standard AIX device drivers.
You can add printdevices with smitty, WSM, or commandline.

In order to show the present devices, use the lsdev command:

# lsdev -Cc printer
lp0 Available 00-00-0P-00 Lexmark...
lp1 Available 00-00-S2-00 IBM...
lp2 Available 00-00-S1-00 Hewlett-Packard...

Individual device files can be listed with the ls command, for example

# ls -al /dev/lp0
crw-rw-rw- 1 root system 25,0 Oct 19 13:62 /dev/lp0

* The Print Configuration File

The file that holds the configuration for the printers that exist on the system is
the /etc/qconfig file. It is the most important file in the spooler domain for
these reasons:

- It contains the definition of every queue known to the spooler.
- A system administrator can read this file and discern the function of each
queue.
- Although it is not recommended, this file can be edited to modify spooler
queues without halting the spooler.
The /etc/qconfig file describes all of the queues defined in the AIX operating
system. A queue is a named, ordered list of requests for a specific device. A
device is something (either hardware or software) than can handle those
requests one at a time. The queue provides serial access to the device.

The following is an example of the partial contents of the /etc/qconfig file.

..
..
lpforu:
device = lp0
lp0:
file = /dev/lp0
header = never
trailer = never
access = both
backend = /usr/lib/lpd/piobe


Quick checks:
-------------

Submit print jobs 	Status print jobs 	Cancel print jobs
-----------------       -----------------       -----------------
enq 			enq -A 			enq -x
qprt 			qchk 			qcan
lp 			lpstat 			lprm
lpr 			lpq


- The lpstat command displays information about the current status of the
line printer.
The lpstat command syntax is as follows:
lpstat [ -aList ] [ -cList ] [ -d ] [ -oList ] [ -pList ] [ -r ] [ -s ]
[ -t ] [ -uList ] [ -vList ] [ -W ]
An example of the lpstat command without any flags is as follows:
# lpstat
Queue Dev Status Job Files User PP% Blks Cp Rnk
------ ---- ------- --- ---------------- ------------ --- ---- -- ---
lpforu lp0 READY

- The qchk command displays the current status information regarding
specified print jobs, print queues, or users.
The qchk command syntax is as follows:
qchk [ -A ] [ -L | -W ] [ -P Printer ] [ -# JobNumber ] [ -q ] [ -u
UserName ] [ -w Delay ]
An example of the qchk command without any flags is as follows:
# qchk
Queue Dev Status Job Files User PP% Blks Cp Rnk
------ ---- ------- --- ---------------- ------------ --- ---- -- ---
lpforu lp0 READY

- The lpq command reports the status of the specified job or all jobs
associated with the specified UserName and JobNumber variables.
The lpq command syntax is as follows:
lpq [ + [ Number ] ] [ -l | -W ] [-P Printer ] [JobNumber] [UserName]
The following is an example of the lpq command without any flags.
# lpq
Queue Dev Status Job Files User PP% Blks Cp Rnk
------ ---- ------- --- ---------------- ------------ --- ---- -- ---
lpforu lp0 READY

- The lpr command uses a spooling daemon to print the named File
parameter when facilities become available.
The lpr command syntax is as follows:
lpr [ -f ] [ -g ] [ -h ] [ -j ] [ -l ] [ -m ] [ -n ] [ -p ] [ -r ] [ -s
] [ -P Printer ] [ -# NumberCopies ] [ -C Class ] [ -J Job ] [ -T Title
] [ -i [ NumberColumns ] ] [ -w Width ] [ File ... ]
The following is an example of using the lpr command to print the file
/etc/passwd.
# lpr /etc/passwd
# lpstat
Queue Dev Status Job Files User PP % Blks Cp Rnk
------ ---- -------- --- ---------------- -------- ---- -- ---- -- ---
lpforu lp0 RUNNING 3 /etc/passwd root 1 100 1 1 1


Example: 
--------

>>>> Stopping the Print Queue

In the following scenario, you have a job printing on a print queue, but you
need to stop the queue so that you can put more paper in the printer.

# lpstat -vlpforu

Queue Dev Status Job Files User PP % Blks Cp Rnk
------ ---- -------- --- ---------------- -------- ---- -- ---- -- ---
lpforu lp0 RUNNING 3 /etc/passwd root 1 100 1 1 1

Disable the print queue using the enq command as shown in the following
example. See Table 41 on page 355 for a list of enq command flags.

# enq -D -P 'lpforu:lp0'

Checking the printer queue using the qchk command as shown in the
following example. 

# qchk -Plpforu
Queue Dev Status Job Files User PP % Blks Cp Rnk
------ ---- -------- --- ---------------- -------- ---- -- ---- -- ---
lpforu lp0 DOWN 3 /etc/passwd root 1 100 1 1 1


>>>> Starting the Print Queue

You have replaced the paper, and you now want to restart the print queue so
that it will finish your print job. Here is how you would do this.

# lpstat -vlpforu
Queue Dev Status Job Files User PP % Blks Cp Rnk
------ ---- -------- --- ---------------- -------- ---- -- ---- -- ---
lpforu lp0 DOWN 3 /etc/passwd root 1 100 1 1 1

# enq -U -P 'lpforu:lp0'

# qchk -P lpforu
Queue Dev Status Job Files User PP % Blks Cp Rnk
------ ---- -------- --- ---------------- -------- ---- -- ---- -- ---
lpforu lp0 RUNNING 3 /etc/passwd root 1 100 1 1


- Adding a local print queue:

# smitty, or smitty printer, or smitty mkpq
or use
# mkque 
# mkquedev


There is a n:1 relation between queues and a device: 
multiple queues can be associated to one device.


- Displaying queue configuration information:

# smitty lsallq
# lsallq -c

_ Deleting a queue:

# smitty rmpq
or
# rmvirprt
# rmquedev
# rmque

- Enabling and disabling a queue:

This is the same as saying starting and stopping a queue.

# smitty qstop
# smitty qstart

Or use the "qadm" command to bring printers, queues, and the spooling system up or down.

Example:

To bring down the PCL-mv200 queue, enter one of the following commands:

# qadm -D PCL-mv200
# disable PCL-mv200


- Printing job management:

System5: lp
BSD    : lpr
AIX    : qprt

1. To submit a printjob, use either lp. lpr, or qprt. All jobs will go to the system default queue
unless the PRINTER or LPDEST variables are set. You can also specify on the command line which 
queue ti ose.
Use -d with lp or use -P with qprt and lpr.
All the printcommands lp, lpr, and qprt, actually call the "enq" command, which places
the print request in a queue.

To print multiple copies, use the "qprt -N #" or "lp -n #" command.
For lpr use just a dash followed by the number of copies, like "lpr - #".

Examples:

# qprt -P funjet /tmp/testfile
# lpr -P funjet /tmp/testfile
# lp -d funjet /tmp/testfile


- Checking status of jobs:

# smitty qstatus
# smitty qchk

System5: lpstat
BSD    : lpq
AIX    : qchk

- Cancelling a printjob:

System5: cancel
BSD    : lprm
AIX    : qcan

For example to cancel Job Number 127 on whatever queue the job is on, run

# qcan -x 127 
# cancel 127

To cancel all jobs queued on printer lp0, enter

# qcan -X -Plp0
# cancel lp0


- Demons:

System5 print service demon: lpsched
AIX print spooler demon    : qdaemon

Only one subsystem can be active at a time.

To switch between subsystems, you can use smitty or the switch.prt script.

# switch.prt -s System5
# switch.prt -s AIX


- System files associated with printing:

/etc/qconfig		describes the queues and devices available for use by printing commands
/var/spool		contains files and dirs used by printing programs and daemons
/var/spool/lpd/qdir	contains info about files queued to print
/var/spool/qdaemon	contains copies of the files spooled to print
/var/spool/lpd/stat	where the info on status of jobs is stored
/var/spool/lpd/pio	holds virtual printer defenitions


73. Apache:
===========

Apache webserver can be found on almost any flavour of Unix systems. We describe some apache features
on Redhat Linux and SuZE Linux.


73.1 Apache on Redhat:
----------------------

The Apache HTTP Server is a robust, commercial-grade open source Web server developed by the Apache 
Software Foundation (http://www.apache.org/). Red Hat Linux 8.0 includes the Apache HTTP Server version 2.0 
as well as a number of server modules designed to enhance its functionality. 

The default configuration file installed with the Apache HTTP Server works without alteration 
for most situations. This chapter, however, outlines how to customize the Apache HTTP Server 
configuration file (/etc/httpd/conf/httpd.conf) for situations where the default configuration does not 
suit your needs. 

- Apache HTTP Server 2.0
Red Hat Linux 8.0 ships with version 2.0 of the Apache HTTP Server. There are important differences 
between version 2.0 and version 1.3 - which shipped with earlier releases of Red Hat Linux. 
This section reviews some of the new features of Apache HTTP Server 2.0 and outlines important changes. 
If you need to migrate a version 1.3 configuration file to the new format, refer to the Section called 
Migrating Apache HTTP Server 1.3 Configuration Files. 

- Features of Apache HTTP Server 2.0
The arrival of Apache HTTP Server 2.0 brings with it a number of new features. Among them are the following: 

. New Apache API - The Apache HTTP Server has a new, more powerful set of Application Programing Interfaces 
  (APIs) for modules. 
  Caution 
  Modules built for Apache HTTP Server 1.3 will not work without being ported to the new API. 
  If you are unsure whether or not a particular module has been ported, consult with the package maintainer 
  before upgrading. 
. Filtering - Modules for Apache HTTP Server 2.0 have the ability to act as content filters. 
  See the Section called Modules and Apache HTTP Server 2.0 for more on how filtering works. 
. IPv6 Support - Apache HTTP Server 2.0 supports next generation IP addressing. 
. Simplified Directives - A number of confusing directives have been removed while others have been simplified. 
  See the Section called Configuration Directives in httpd.conf for more information about specific directives. 
. Multilingual Error Responses - When using Server Side Include (SSI) documents, customizable error 
  response pages can be delivered in multiple languages. 
. Multiprotocol Support - Apache HTTP Server 2.0 has the ability to serve multiple protocols. 

- Packaging Changes in Apache HTTP Server 2.0
Under Red Hat Linux 8.0 the Apache HTTP Server package has been renamed. Also, some related packages 
have been renamed, deprecated, or incorporated into other packages. 
Below is a list of the packaging changes: 

.The apache, apache-devel and apache-manual packages have been renamed as httpd, httpd-devel and httpd-manual 
 respectively. 

.The mod_dav package has been incorporated into the httpd package. 

.The mod_put and mod_roaming packages have been removed, since their functionality is a subset of that 
 provided by mod_dav. 

.The mod_auth_any and mod_bandwidth packages have been removed. 

.The version number for the mod_ssl package is now synchronized with the httpd package. This means that the 
 mod_ssl package for Apache HTTP Server 2.0 has a lower version number than mod_ssl package for 
 Apache HTTP Server 1.3. 

- File System Changes in Apache HTTP Server 2.0
The following changes to the file system layout occur when upgrading to Apache HTTP Server 2.0: 

. A new configuration directory, "/etc/httpd/conf.d/", has been added. - This new directory is used to store 
  configuration files for individually packaged modules, such as mod_ssl, mod_perl, and php. The server is 
  instructed to load configuration files from this location by the directive Include conf.d/*.conf within 
  the Apache HTTP Server configuration file, /etc/httpd/conf/httpd.conf. 

Warning 
It is vital that this line be inserted when migrating an existing configuration. 
 
. The ab and logresolve programs have been moved. - These utility programs have been moved from the 
  /usr/sbin/ directory and into the /usr/bin/ directory. This will cause scripts with absolute paths for 
  these binaries to fail. 

. The dbmmanage command has been replaced. - The dbmmanage command has been replaced by htdbm. 

. The logrotate configuration file has has been renamed. - The logrotate configuration file has been renamed 
  from /etc/logrotate.d/apache to /etc/logrotate.d/httpd. 

- After Installation
After you have installed the httpd package, the Apache HTTP Server's documentation is available by 
installing the httpd-manual package and pointing a Web browser to http://localhost/manual/ or you can 
browse the Apache documentation available on the Web at http://httpd.apache.org/docs-2.0/. 

The Apache HTTP Server's documentation contains a full list and complete descriptions of all 
configuration options. For your convenience, this chapter provides short descriptions of the configuration 
directives used by Apache HTTP Server 2.0. 

The version of the Apache HTTP Server included with Red Hat Linux includes the ability to set up secure Web servers 
using the strong SSL encryption provided by the mod_ssl and openssl packages. As you look through the 
configuration files, be aware that it includes both a non-secure and a secure Web server. 
The secure Web server runs as a virtual host, which is configured in the /etc/httpd/conf.d/ssl.conf file. 


- Starting and Stopping httpd (Apache)

The the httpd RPM installs the /etc/rc.d/init.d/httpd Bourne script, which is accessed using the 
/sbin/service command. 

 To start your server, as root type: 
 # /sbin/service httpd start
 
 Note 
  If you are running the Apache HTTP Server as a secure server, you will be prompted to type your password. 
 
 To stop your server, type the command: 
 # /sbin/service httpd stop
 

The command restart is a shorthand way of stopping and then starting your server. The restart command explicitly 
stops and then starts your server. You will be prompted for your password if you are running the Apache HTTP 
Server as a secure server. The restart command looks like the following: 

 # /sbin/service httpd restart
 
If you just finished editing something in your httpd.conf file, you do not need to explicitly stop and 
start your server. Instead, you can use the reload command. 

 Note 
  If you are running the Apache HTTP Server as a secure server, you will not need to type your password when 
  using the reload option as the password will remain cached across reloads. 
 
The reload command looks like the following example: 

 # /sbin/service httpd reload
 
By default, the httpd process will not start automatically when your machine boots. You will need to configure 
the httpd service to start up at boot time using an initscript utility, such as /sbin/chkconfig, /sbin/ntsysv, 
or the Services Configuration Tool program. 

Please refer to the chapter titled Controlling Access to Services in Official Red Hat Linux Customization Guide 
for more information regarding these tools. 

Note 
If you are running the Apache HTTP Server as a secure server, you will be prompted for the secure server's 
password after the machine boots, unless you generated a specific type of server key file. 

- Configuration Directives in httpd.conf

The Apache HTTP Server configuration file is /etc/httpd/conf/httpd.conf. The httpd.conf file is well-commented 
and mostly self-explanatory. Its default configuration will work for most situations, however you should 
become familiar some of the more important configuration options. 

If you need to configure the Apache HTTP Server, edit httpd.conf and then either reload, restart, 
or stop and start the httpd process. How to reload, stop and start the Apache HTTP Server is covered in the 
Section called Starting and Stopping httpd. 

- Default Modules
The Apache HTTP Server is distributed with a number of modules. By default the following modules are installed 
and enabled with the httpd package on Red Hat Linux: 

mod_access
mod_auth
mod_auth_anon
mod_auth_dbm
mod_auth_digest
mod_include
mod_log_config
mod_env
mod_mime_magic
mod_cern_meta
mod_expires
mod_headers
mod_usertrack
mod_unique_id
mod_setenvif
mod_mime
mod_dav
mod_status
mod_autoindex
mod_asis
mod_info
mod_cgi
mod_dav_fs
mod_vhost_alias
mod_negotiation
mod_dir
mod_imap
mod_actions
mod_speling
mod_userdir
mod_alias
mod_rewrite
 

Additionally, the following modules are available by installing additional packages: 

mod_auth_mysql
mod_auth_pgsql
mod_perl
mod_python
mod_ssl
php
squirrelmail
 
- Using Virtual Hosts
You can use the Apache HTTP Server's virtual hosts capability to run different servers for different IP addresses, 
different host names, or different ports on the same server. If you are interested in using virtual hosts, 
complete information is provided in the Apache documentation on your machine or on the Web at 
http://httpd.apache.org/docs-2.0/vhosts/. 

Note 
You cannot use name-based virtual hosts with your Red Hat Linux Advanced Server, because the SSL handshake 
occurs before the HTTP request which identifies the appropriate name-based virtual host. If you want to use 
name-based virtual hosts, they will only work with your non-secure Web server. 

Virtual hosts are configured within the httpd.conf file, as described in the Section called Configuration 
Directives in httpd.conf. Please review that section before you start to change the virtual hosts configuration 
on your machine. 

The Secure Web Server Virtual Host
The default configuration of your Web server runs a non-secure and a secure server. Both servers use the same 
IP address and host name, but they listen on different ports, and the secure server is a virtual host configured. 
This configuration enables you to serve both secure and non-secure documents in an manner. Setting up the secure 
HTTP transmission is very resource intensive, so generally you will be able to serve far fewer pages per second 
with a secure server. You need to consider this when you decide what information to include on the secure server 
and the non-secure server. 

The configuration directives for your secure server are contained within virtual host tags in the 
/etc/httpd/conf.d/ssl.conf file. If you need to change anything about the configuration of your secure server, 
you will need to change the configuration directives inside the virtual host tags. 

By default, both the secure and the non-secure Web servers share the same DocumentRoot. To change the DocumentRoot 
so that it is no longer shared by both the secure server and the non-secure server, change one of the DocumentRoot 
directives. The DocumentRoot either inside or outside of the virtual host tags in httpd.conf defines the 
DocumentRoot for the non-secure Web server. The DocumentRoot within the virtual host tags in 
conf.d/ssl.conf define the document root for the secure server. 

The secure the Apache HTTP Server server listens on port 443, while your non-secure Web server listens on port 80. 
To stop the non-secure Web server from accepting connections find the line which reads: 

Then comment out any line in httpd.conf which reads Listen 80. 

Setting Up Virtual Hosts
To create a virtual host, you will need to alter the virtual host lines, provided as an example in httpd.conf 
or create your own virtual host section. 

The virtual host example lines read as follows: 

#<VirtualHost  *>
#    ServerAdmin webmaster@dummy-host.example.com
#    DocumentRoot /www/docs/dummy-host.example.com
#    ServerName dummy-host.example.com
#    ErrorLog logs/dummy-host.example.com-error_log
#    CustomLog logs/dummy-host.example.com-access_log common
#</VirtualHost>
 

Uncomment all of the lines, and add the correct information for the virtual host. 
In the first line, change * to your server's IP address. Change the ServerName to a valid DNS name to use 
for the virtual host. 

You will also need to uncomment one of the NameVirtualHost lines below: 

NameVirtualHost *

Next change the IP address to the IP address, and port if necessary, for the virtual host. When finished it will 
look similar to the following example: 

NameVirtualHost 192.168.1.1:80 
 
If you set up a virtual host and want it to listen on a non-default port, you will need to set up a virtual host 
for that port and add a Listen directive for corresponding to that port. 

Then add the port number to the first line of the virtual host configuration as in the following example: 

<VirtualHost ip_address_of_your_server:12331>

This line would create a virtual host that listens on port 12331. 
You must restart httpd to start a new virtual host. See the Section called Starting and Stopping httpd for 
instructions on how to start and stop httpd. 

 
73.2 Apache on SuSE:
--------------------

- Using Apache
To display static web pages with Apache, simply place your files in the correct directory. In SUSE LINUX, 
the correct directory is /srv/www/htdocs. A few small example pages may already be installed there. 
Use these pages to check if Apache was installed correctly and is currently active. Subsequently, you can 
simply overwrite or uninstall these pages. Custom CGI scripts are installed in /srv/www/cgi-bin. 

During operation, Apache writes log messages to the file /var/log/httpd/access_log or /var/log/apache2/access_log. 
These messages show which resources were requested and delivered at what time and with which method 
(GET, POST, etc.). Error messages are logged to /var/log/apache2. 

- Active Contents
Apache provides several possibilities for the delivery of active contents. Active contents are HTML pages 
that are generated on the basis of variable input data from the client, such as search engines that respond 
to the input of one or several search strings (possibly interlinked with logical operators like AND or OR) 
by returning a list of pages containing these search strings.

Apache offers three ways of generating active contents:

Server Side Includes (SSI) 
These are directives that are embedded in an HTML page by means of special comments. Apache interprets 
the content of the comments and delivers the result as part of the HTML page. 

Common Gateway Interface (CGI) 
These are programs that are located in certain directories. Apache forwards the parameters transmitted by the 
client to these programs and returns the output of the programs. This kind of programming is quite easy, 
especially since existing command-line programs can be designed in such a way that they accept input 
from Apache and return their output to Apache.

Module 
Apache offers interfaces for executing any modules within the scope of request processing. Apache gives these 
programs access to important information, such as the request or the HTTP headers. Programs can take part 
in the generation of active contents as well as in other functions (such as authentication). The programming 
of such modules requires some expertise. The advantages of this approach are high performance and possibilities 
that exceed those of SSI and CGI.

While CGI scripts are executed directly by Apache (under the user ID of their owner), modules are controlled 
by a persistent interpreter that is embedded in Apache. In this way, separate processes do not need to be 
started and terminated for every request (this would result in a considerable overhead for the process management, 
memory management, etc.). Rather, the script is handled by the interpreter running under the ID of the web server.

However, this approach has a catch. Compared to modules, CGI scripts are relatively tolerant of careless 
programming. With CGI scripts, errors, such as a failure to release resources and memory, do not have a 
lasting effect, because the programs are terminated after the request has been processed. This results in the 
clearance of memory that was not released by the program due to a programming error. With modules, the 
effects of programming errors accumulate, as the interpreter is persistent. If the server is not restarted 
and the interpreter runs for several months, the failure to release resources, such as database connections, 
can be quite disturbing.

Server Side Includes: SSI
Server-side includes are directives that are embedded in special comments and executed by Apache. 
The result is embedded in the output. For example, the current date can be printed with 
<!--#echo var="DATE_LOCAL" -->. The # at the end of the opening comment mark "<!--" shows Apache that this 
is an SSI directive and not a simple comment.

SSIs can be activated in several ways. The easiest approach is to search all executable files for SSIs. 

Another approach is to specify certain file types to search for SSI.

Common Gateway Interface: CGI
CGI is the abbreviation for Common Gateway Interface. With CGI, the server does not simply deliver 
a static HTML page, but executes a program that generates the page. This enables the generation of pages 
representing the result of a calculation, such as the result of the search in a database. By means of 
arguments passed to the executed program, the program can return an individual response page for every request.

The main advantage of CGI is that this technology is quite simple. The program merely must exist in a 
specific directory to be executed by the web server just like a command-line program. The server sends 
the program output on the standard output channel (stdout) to the client.

GET and POST
Input parameters can be passed to the server with GET or POST. Depending on which method is used, the server 
passes the parameters to the script in various ways. 

> With POST, the server passes the parameters to the program 
  on the standard input channel (stdin). The program would receive its input in the same way when 
  started from a console.

> With GET, the server uses the environment variable QUERY_STRING to pass the parameters to the program. 
  An environment variable is a variable made available globally by the system (such as the variable PATH, 
  which contains a list of paths the system searches for executable commands when the user enters a command).

Languages for CGI
Theoretically, CGI programs can be written in any programming language. Usually, scripting languages 
(interpreted languages), such as Perl or PHP, are used for this purpose. If speed is critical, 
C or C++ may be more suitable.

In the simplest case, Apache looks for these programs in a specific directory (cgi-bin). This directory 
can be set in the configuration file.

If necessary, additional directories can be specified. In this case, Apache searches these directories 
for executable programs. However, this represents a security risk, as any user will be able to 
let Apache execute programs (some of which may be malicious). If executable programs are restricted 
to cgi-bin, the administrator can easily see who places which scripts and programs in this directory 
and check them for any malicious intent.

Generating Active Contents with Modules
A variety of modules is available for use with Apache. The term "module" is used in two different senses. 

> First, there are modules that can be integrated in Apache for the purpose of handling specific functions, 
  such as modules for embedding programming languages. These modules are introduced below.

> Second, in connection with programming languages, modules refer to an independent group of functions, 
  classes, and variables. These modules are integrated in a program to provide a certain functionality, 
  such as the CGI modules available for all scripting languages. These modules facilitate the programming 
  of CGI applications by providing various functions, such as methods for reading the request parameters 
  and for the HTML output.

mod_perl
Perl is a popular, proven scripting language. There are numerous modules and libraries for Perl, including 
a library for expanding the Apache configuration file. The home page for Perl is http://www.perl.com/. 
A range of libraries for Perl is available in the Comprehensive Perl Archive Network (CPAN) at http://www.cpan.org/.


Setting up mod_perl
To set up mod_perl in SUSE LINUX, simply install the respective package (see Section 15.6. "Installation"). 
Following the installation, the Apache configuration file will include the necessary entries 
(see /etc/apache2/mod_perl-startup.pl). Information about mod_perl is available at http://perl.apache.org/.

mod_perl versus CGI
In the simplest case, run a previous CGI script as a mod_perl script by requesting it with a different URL. 
The configuration file contains aliases that point to the same directory and execute any scripts it contains 
either via CGI or via mod_perl. All these entries already exist in the configuration file. The alias entry for 
CGI is as follows:

ScriptAlias /cgi-bin/ "/srv/www/cgi-bin/"
The entries for mod_perl are as follows:

<IfModule mod_perl.c> 
# Provide two aliases to the same cgi-bin directory, 
# to see the effects of the 2 different mod_perl modes. 
# for Apache::Registry Mode 
ScriptAlias /perl/          "/srv/www/cgi-bin/" 
# for Apache::Perlrun Mode 
ScriptAlias /cgi-perl/      "/srv/www/cgi-bin/" 
</IfModule> 

The following entries are also needed for mod_perl. These entries already exist in the configuration file.

#
# If mod_perl is activated, load configuration information
#
<IfModule mod_perl.c>
Perlrequire /usr/include/apache/modules/perl/startup.perl
PerlModule Apache::Registry

#
# set Apache::Registry Mode for /perl Alias
#
<Location /perl>
SetHandler  perl-script
PerlHandler Apache::Registry
Options ExecCGI
PerlSendHeader On
</Location>

#
# set Apache::PerlRun Mode for /cgi-perl Alias
#
<Location /cgi-perl>
SetHandler  perl-script
PerlHandler Apache::PerlRun
Options ExecCGI
PerlSendHeader On
</Location>

</IfModule>

These entries create aliases for the Apache::Registry and Apache::PerlRun modes. The difference between these 
two modes is as follows:

Apache::Registry 
All scripts are compiled and kept in a cache. Every script is applied as the content of a subroutine. 
Although this is good for performance, there is a disadvantage: the scripts must be programmed extremely 
carefully, as the variables and subroutines persist between the requests. This means that you must reset 
the variables to enable their use for the next request. If, for example, the credit card number of a customer 
is stored in a variable in an online banking script, this number could appear again when the next customer 
uses the application and requests the same script.

Apache::PerlRun 
The scripts are recompiled for every request. Variables and subroutines disappear from the namespace between 
the requests (the namespace is the entirety of all variable names and routine names that are defined at a 
given time during the existence of a script). Therefore, Apache::PerlRun does not necessitate painstaking 
programming, as all variables are reinitialized when the script is started and no values are kept from previous 
requests. For this reason, Apache::PerlRun is slower than Apache::Registry but still a lot faster than CGI 
(in spite of some similarities to CGI), because no separate process is started for the interpreter.

mod_php4
PHP is a programming language that was especially developed for use with web servers. In contrast to other languages 
whose commands are stored in separate files (scripts), the PHP commands are embedded in an HTML page 
(similar to SSI). The PHP interpreter processes the PHP commands and embeds the processing result in the HTML page.

The home page for PHP is http://www.php.net/. For PHP to work, install mod_php4-core and, in addition, 
apache2-mod_php4 for Apache 2. 

mod_python
Python is an object-oriented programming language with a very clear and legible syntax. An unusual but convenient 
feature is that the program structure depends on the indentation. Blocks are not defined with braces (as in C and 
Perl) or other demarcation elements (such as begin and end), but by their level of indentation. The package to 
install is apache2-mod_python.

More information about this language is available at http://www.python.org/. For more information about mod_python, 
visit the URL http://www.modpython.org/.

mod_ruby
Ruby is a relatively new, object-oriented high-level programming language that resembles certain aspects of Perl 
and Python and is ideal for scripts. Like Python, it has a clean, transparent syntax. On the other hand, Python 
has adopted abbreviations, such as $.r for the number of the last line read in the input file - a feature that 
is welcomed by some programmers and abhorred by others. The basic concept of Ruby closely resembles Smalltalk.


74. Distributed shell:
======================

Note 1:
-------

DSH - distributed shell
dsh is a program which runs a single command on multiple computers at the same time. It was designed 
as a cluster tool for beowulf-style supercomputers.
 
The link address is: http://dsh.sf.net/ 

Note 2:
-------

dsh12003 Sep 17Debian-Beowulf/DancerDancer Tools reference
NAME 

dsh - Distributed shell, or dancer's shell 
SYNOPSIS 

dsh [-m machinename | -a | -g groupname ] [-r remoteshellname ] [-c | -w | -i | -F forklimit ] -- commandline 
DESCRIPTION 

dsh executes command remotely on several different machines at the same time. An utility to effectively do a 
for a in $(seq 1 10); do rsh $a command; done in bourne shell. 

OPTIONS 

The options available are as follows. 
--verbose | -v 
Give verbose output of the execution process. 

--quiet | -q 
Makes output quieter. 

--machine | -m [machinename[,machinename]*] 
Adds machinename to the list of machines that the command is exeuted. The syntax of machinename allows 
username@machinename where remote shell is invoked with the option to make it of username. 
From version 0.21.4, it is possible to specify in the format of "username@machinename,username@machinename,
username@machinename" so that multiple hosts can be specified with comma-delimited values. 

--all | -a 
Add all machines found in /etc/dsh/machines.list to the list of machines that the specified command is executed. 

--group groupname | -g groupname 
Add all machines found in /etc/dsh/group/ groupname to the list of machines that the specified command is executed. 
If groupname is on the form @netgroup then the machines in the given netgroup is used to specify the list of machines 
to execute on. 

--file machinefile | -f machinefile 
Add all machines found in the specified file to the list of machines that the specified command is executed. 
The file should list one machine specification per line (with the same syntax as the machinename argument). 
Lines starting with "#" are ignored. 
From version 0.21.4, Specifying the same machine several times using any of the machine specification options 
will result in multiple invocations merged into one. 

--remoteshell shellname | -r shellname 
Execute remote shell shellname as the remote shell. Usually any of "rsh", "remsh" or "ssh" are available 

--remoteshellopt rshoption | -o rshoption 
Add one option rshoption to the list of options passed on to the remote shell. 

--help | -h 
Output help message and exits. 

--wait-shell | -w 
Executes on each machine and waits for the execution finishing before moving on to the next machine. 

--concurrent-shell | -c 
Executes shell concurrently. 

--show-machine-names | -M 
Prepends machine names on the standard output. Useful to be used in conjunction with the --concurrent-shell option 
so that the output is slightly more parsable. 

--duplicate-input | -i 
Duplicates the input to dsh process to individual process that are remotely invoked. Needs to have --concurrent-shell set. 
Due to limitations in current implementation, it is only useful for running shell. Terminate the shell session 
with ctrl-D. 

--bufsize | -b [ buffer-size in bytes ] 
Sets the buffer size used in replicating input for --duplicate-input option. 

--version | -V 
Outputs version information and exits. 

--num-topology | -N 
Changes the current topology from 1. 1 is the default behavior of spawning the shell from one node to every node. 
Changing the number to a value greater than 2 would result in dsh being spawned on other machines as well. 

--forklimit | -F fork limit 
Similar to -c with a limit on the number of simultaneous connections. dsh will wait before creating new connection 
if the limit is reached. Useful when the number of nodes to be accessed is going somewhere above 200, 
and using -N option is not possible. 

EXIT STATUS 

The first non-zero exit code of child processes is returned, or zero if none returned non-zero exit code. 
1 if error is found in command-line specifications. 2 if signal is received from child processes. 


EXAMPLES 

dsh -a w 
Shows list of users logged in on all workstations. 


dsh -r ssh -a -- w 
Shows list of users logged in on all workstations, and use ssh command to connect. 
(It should be of note that when using ssh, ssh-agent is handy.) 

dsh -r ssh -m node1 -m node2 -c -- 'echo $HOSTNAME $(cat/proc/loadavg )' 
Shows the load average of machines node1 and node2. 


FILES 

/etc/dsh/machines.list | $(HOME)/.dsh/machines.list 
List of machine names to be used for when -a command-line option is specified. 


/etc/dsh/group/ groupname | $(HOME)/.dsh/group/ groupname 
List of machine names to be used for when -g groupname command-line option is specified. 


/etc/dsh/dsh.conf | $(HOME)/.dsh/dsh.conf 
Configuration file containing the day-to-day default. 


Note 3:
-------

PSSP's distributed shell commands "dsh" and "dshbak" are now standard in AIX 5.2. They run commands in parallel 
on multiple hosts, and format the output. The dsh commands greatly simplify managing server farms. 

The set of nodes to which commands are sent can be set on the command line or by the contents of a file named 
by the DSH_LIST environment variable. 

Here are a couple simple examples how these commands can be used. (Assume DSH_LIST has been set to the name of the 
file containing the list of servers. In this case, just three servers: dodgers, surveyor and pioneer) 

Check the clock setting on all servers: 

# dsh date
dodgers: Fri Jun  4 14:46:06 PDT 2004
surveyor: Fri Jun 4 14:16:18 PDT 2004
pioneer: Fri Jun  4 14:32:28 PDT 2004

Identify servers running fix IX37151 

# dsh "instfix -ik IX37659"
dodgers:    There was no data for IX37659 in the fix database
surveyor:    All filesets for IY37659 were found
pioneer:     All filesets for IY37659 were found

Check the hardware error logs on all servers starting 6/4/04 

# dsh "errpt -s 0604000004" 

Or check the OS level on each server: 


# dsh "lslpp -L bos.rte | grep bos.rte"

You can also use "dshbak" to group common output from the # dsh command. This makes it easier to identify 
differences when you have a lot of servers. For example, we can consolidate the output of the above instfix command 
as follows. 


# dsh "lslpp -L bos.rte"  | dshbak
HOST: dodgers
---------------------
There was no data for IX37659 in the fix database.

HOST: surveyor, pioneer
----------------------------------
All filesets for IY37659 were found

Both commands are located in the /opt/csm/bin directory. They require a little customization. 
Check the AIX documentation for more information. 


=================
CLUSTER SECTIONS:
=================


========================================
75. General Parallel File System (GPFS):
========================================


Only AIX and Linux (pSeries) related.

General Parallel File System (GPFS) is a high performance "shared-disk file system" that can provide data access 
from nodes in a cluster environment. Parallel and serial applications can readily access shared files 
using standard UNIXr file system interfaces, and the same file can be accessed concurrently from multiple nodes. 
GPFS is designed to provide high availability through logging and replication, and can be configured for failover 
from both disk and server malfunctions.

GPFS operates often within the context of a HACMP cluster, but you can build just GPFS "clusters" as well.


75.1 Creating a 2 node GPFS Cluster:
====================================

Suppose we have two nodes named node2 and node3. Our goal is to create a single GPFS filesystem,
named "/my_gpfs", consisting of 2 disks used for data and metadata. These disks are housed by two
DS4300 storage subsystems. A tiebreaker disk, in a seperate DS4100, will be used to maintain node quorom
during single nodes failures. Additionally, a "filesystem descriptor" disk for /my_gpfs is located
at the same site.

Servers: 2 Nodes= 2 x lpar; per lpar 1 cpu, 2GB RAM, 2 x FC adapter, 2 x Ethernet adapter
Storage: 2 x DS4300 for GPFS and data, 1 x DS4100 for tiebreaker disk 

Suppose further that the nodes uses the following IP addresses:
Node2: 10.1.1.32
Node3: 10.1.1.33

The Ethernet adapters per Server, are Aggregated, or configured in NIB (backup standby mode).


  Note : What are Tiebreaker disks?

  GPFS can use two types of quorum mechanisms in order to determine service availability:
  - Disk quorom
  - Node quorom

  In case availability of either of these resources is less or equal to 50%, GPFS file system services are
  automatically stopped.

  When node quorom is not met, GPFS stops its cluster-wide services and access to all filesystems
  within the cluster is no longer possible. If less than 50% of disks serving a GPFS file system fail,
  disk quorom, that is the number of "filesystem descriptors" for that particular file system, 
  is no longer met and the filesystem will be unmounted.

  To eliminate the need of a tiebreaker node, as from GPFS 2.3, a new node quorom mechanism was introduced
  for a two node cluster. Its called a tiebreaker disk. 
  If one of the two nodes goes down, we still have "enough" node qourom to keep the GPFS system running.
  Basically, a tiebreaker disk replaces a "tiebreaker node".


-- Preparations:
-- -------------

1. The systems have AIX >= 5.3ML2 installed, and gpfs.base.xxxx installed
2. Make sure names resolution is ok, either by DNS or by /etc/hosts
3. Sync the system clocks, for example by NTP
4. Make sure rcp, ssh, scp is working (via ./rhosts etc.. or ssh protocols)
5. A distributed shell (DSH) is installed on each node.
6. During cluster setup some configuration files may be created and used with GPFS commands.
   These files reside in a subdirectories in /var/mmfs.

example:

root@starboss:/var/mmfs/etc#cat mmfs.cfg
#
#   WARNING:   This is a machine generated file.  Do not edit!
#   Use the mmchconfig command to change configuration parameters.
#
clusterName cluster_name.starboss
clusterId 729741152660153204
clusterType lc
autoload no
useDiskLease yes
maxFeatureLevelAllowed 912
tiebreakerDisks gpfs3nsd;gpfs4nsd
[zd110l13]
takeOverSdrServ yes


--  Creating the GPFS cluster:
-- ---------------------------

The first step is to create a GPFS cluster named TbrCl using the command:

# mmcrcluster -n /var/mmfs/conf/nodefile -p node2 -s node3 -C TbrCl -A

A file called "nodefile" contains the cluster node information, describing the function of each node:

  # Node2 can be a file system manager and is relevant for GPFS quorum
  node2:manager-quorom 
  # Node3 can be a file system manager and is relevant for GPFS quorum
  node3:manager-quorom

Each node can fullfill the function of a file system manager and is relevant for maintaining node quorom.
A GPFS cluster designates a primary cluster manager (node2) and appoints a backup (node3) in case the
primary fails. Cluster services will be started automatically during node boot (-A). After successfully
creating the cluster, you can verify your setup:

# mmlscluster 

  GPFS cluster information
  ========================

  GPFS cluster name:		TbrCl.node2
  GPFS cluster id:		720858653441148399
  GPFS UID domain:		TbrCl.node2
  Remote shell command:		/usr/bin/rsh
  Remote file copy command:	/usr/bin/rcp

  GPFS cluster configuration servers:
  -----------------------------------
  Primary server:		node2
  Secondary server: 		node3

  Node number Node name IP address    Full node name    Remarks
  -------------------------------------------------------------
  1           node2     10.1.1.32     node2              quorom node
  2           node3     10.1.1.33     node3              quorom node


The GPFS daemon has to be started on all nodes:

# mmstartup -a

With GPFS you can administer the whole cluster from any cluster node. After starting GPFS services you
should examine the state of the cluster:

# mmgetstate -aL

  Node number Node name Quorom    Nodes up  Total nodes GPFS state
  -------------------------------------------------------------
  1           node2     2         2         2           active    
  2           node3     2         2         2           active


At this point, the cluster software is running, but you haven't done anything yet on the filesystems.


-- Configuring GPFS disks
-- ----------------------

Before starting with the configuration of GPFS disks, you have to make sure that each cluster node has
access to each SAN attached disk when running in a shared disk environment. With AIX 5L, you can use
the lspv command to verify your disks (hdisk) are properly configured:

# lspv

hdisk2   none     none
hdisk3   none     none
hdisk4   none     none
hdisk5   none     none

If you look for LUN related information (e.g. volume names) issue the following command against a
dedicated hdisk:

# lsattr -El hdisk2

..
.... (in the output, you will also see SAN stuff)
..


Its very important to keep a well balanced disk configuration when using GPFS because this makes sure
you get optimal performance by distributing I/O requests evenly among storage subsystems and attached
data disks. Keep in mind that all GPFS disks belonging to a particular file system should be of same size.


GPFS uses a mechanism called Network Shared Disk (NSD) to provide file system access to cluster nodes,
which do not have direct physical access to file system disks. A diskless node accesses an NSD via the
cluster network and I/O operations are handled as if they run against a directly attached disk from
an operating systems perspective. A special device driver handles data shipping using the cluster network.
NSDs can also be used in a purely SAN based GPFS configuration where each node can directly access
any disk. In case a node looses direct disk access, it automatically switches to NSD-mode, sending I/O
requests via network to other direct direct disk attached nodes. This mechanism increases file system
availability, and should normally be used.

When using NSD, a primary and a backup server are assigned to each NSD. In case a node looses its
direct disk attachment, it contacts the primary NSD server, or backup server in case the primary
is not available.

In order to establish NSD you need to create "descriptor files" in order to describe each 
disk functionality. In our example, we will use the following file:

# cat /var/mmfs/conf/diskfile          

  #Description of disk attributes
  #<disk name>:<primary NSD server>:<2ndary NSD server>:<disk usage>:<failure group>:<NSD name>

  #Data and metadata disk for /my_gpfs, site A, DS4300_1
  hdisk2:node2:node3:dataAndMetadata:1:

  #Data and metadata disk for /my_gpfs, site B, DS4300_2
  hdisk3:node3:node2:dataAndMetadata:2:

  #File system descriptor disk for /my_gpfs, site C, DS4100
  hdisk4:::descOnly:3:

  #Tiebreaker disk, site C, DS4100
  hdisk5:::descOnly:-1:

Here, our cluster uses 4 disks with GPFS. Filesystem "/my_gpfs" uses hdisk2 and hdisk3 for data and metadata.
Therefore these disks will use the NSD mechanism to provide file system data access in case direct disk access
fails on one of the cluster nodes.
Node2 is the primary NSD server for hdisk2 with node3 being its backup. The same is true for hdisk3, but then
the other way around.
Each of these disks belongs to a different "failure group" (1=site A, 2=site B) which basically enables
replication of file system data and metadata between the two sites.

After successfully creating the "disk descriptor file", the following command is used to define the NSDs:


# mmcrnsd -F /var/mmfs/conf/diskfile -v yes


GPFS assigns a Physical Volume ID PVID to each of the disks. This information is written to sector 2
on the AIX5L hdisk. Since GPFS uses its own PVIDs, do not confuse them with AIX5L PVIDs.

After a successful creation of the NSDs, you can verify your setup using the mmlsnsd command:


# mmlsnsd -aL

File system    Disk name     NSD Volume ID     Primary node         Backup node
-------------------------------------------------------------------------------
(free disk)    gpfs1nsd      099CAF2043A04625  node2                node3
(free disk)    gpfs2nsd      099CAF2043A04627  node3                node2
(free disk)    gpfs3nsd      099CAF2043A04628  (directly attached)
(free disk)    gpfs4nsd      099CAF2043A04629  (directly attached)

During NSD creation, the diskfile was rewritten. Each hdisk stanza is commented out, and a
equivalent NSD stanza is inserted.

  #<disk name>:<primary NSD server>:<2ndary NSD server>:<disk usage>:<failure group>:<NSD name>

  #Data and metadata disk for /my_gpfs, site A, DS4300_1
  #hdisk2:node2:node3:dataAndMetadata:1:
  gpfs1nsd:::dataAndMetadata:1

  #Data and metadata disk for /my_gpfs, site B, DS4300_2
  #hdisk3:node3:node2:dataAndMetadata:2:
  gpfs2nsd:::dataAndMetadata:2

  #File system descriptor disk for /my_gpfs, site C, DS4100
  #hdisk4:::descOnly:3:
  gpfs3nsd:::descOnly:3

  #Tiebreaker disk, site C, DS4100
  #hdisk5:::descOnly:-1:
  gpfs4nsd:::descOnly:-1


After issuing the mmcrnsd command, we have made the disks available and ready to create GPFS filesystems.

`
-- Activating tiebreaker mode
-- --------------------------

When using a two node cluster with tiebraker disks, the cluster configuration must be switched
to tiebreaker mode. Ofcourse you need to know which disks are being used as tiebreaker disks.
Up to 3 disks are allowed. In our example, gpfs4nsd (that is hdisk5) is the only tiebreaker disk.
With the following command sequence, tiebreaker mode is turned on:

# mmshutdown -a
# mmstartup -a

A 2 node cluster running in tiebreaker mode can easily be identified by running the following command:

# mmgetstate -aL


  Node number Node name   Quorom    Nodes up  Total nodes GPFS state
  ---------------------------------------------------------------
  1           node2       1*        2         2           active    
  2           node3       1*        2         2           active


If the quorum information is displayed as "1*", this is a 2 node tiebreaker disk cluster.
Another nice command to check the status of the cluster is "mmlsconfig".

# mmlsconfig

  Configuration data for cluster TbrCl.node2:
  -------------------------------------------
  ClusterName TbrCl.node2
  ClusterId 8262362723390
  ClusterType 1c
  Multinode yes
  autoload yes
  useDiskLease yes
  MaxFeatureLevelAllowed 809
  tiebreakerDisks gpfs4nsd


-- Creating a GPFS Filesystem
-- --------------------------

GPFS generally maintains at least 3 filesystem descriptors, or quorum, per filesystem.
Best would be, to have the descriptors distributed over many disks. But you might have
only 2 disks, resulting in 2 copies on one disk, and 1 copy on the other disk.
That would be an unbalanced situation. GPFS always verifies if more than 50% of the
filesystem disks are available, and if not, it will unmount the filesystem.

Before we can create the /my_gpfs filesystem we need to prepare a file named "fsdisks_mygpfs"
describing all disks belonging to the filesystem.
In our example, we use only 2 disks for the filesystem, but we like to have a balanced situation
with at least 3 descriptor area's. For this, we can use "#hdisk4:::descOnly:3:"
as shown before as an entry in the "nsd diskfile".

Our "fdisk_mygpfs" looks like this:

  #<disk name>:<primary NSD server>:<2ndary NSD server>:<disk usage>:<failure group>:<NSD name>

  #Data and metadata disk for /my_gpfs, site A, DS4300_1
  gpfs1nsd:::dataAndMetadata:1

  #Data and metadata disk for /my_gpfs, site B, DS4300_2
  gpfs2nsd:::dataAndMetadata:2

  #File system descriptor disk for /my_gpfs, site C, DS4100
  gpfs3nsd:::descOnly:3


The next step is to create the file system:

# mmcrfs /my_gpfs /dev/my_gpfs -F /var/mmfs/conf/fdisk_mygpfs -A yes -m2 -M2 -r2 -R2 -v yes


The mountpoint is /my_gpfs and a device called /dev/my_gpfs is created. The option -F is used to specify
a configuration file describing the filesystem's NSDs. We want this filesystem to be mounted automatically
during startup (-A yes). When designing our cluster, we decided to use data and metadata replication (-r2,-m2)
to provide high availability.

If you intend to create several filesystems within your cluster, repeat all the steps as shown above.


-- mounting a GPFS Filesystem
-- --------------------------

Filesystem "/my_gpfs" will be mounted on each of the cluster nodes using the command:

# dsh -a mount -t mmfs

The command dsh is the Distributed Shell, wich should be available on your AIX53 systems.
Your GPFS filesystem is also registered in /etc/filesystems. Also, standard AIX commands can be used against
the GPFS filesystems, like for example:

# dsh -w node2,node3 df -k /my_gpfs

Filesystem /my_gpfs is now available to both nodes with all three file system descripters being well
balanced across failure groups and disks.

# mmlsdisk my_gpfs

disk            driver     sector   failure   holds    holds 
name            type       size     group     metadata data  status    availability  disk id  remarks
-----------------------------------------------------------------------------------------------------
gpfs1nsd        nsd        512      1         yes      yes   ready     up             1       desc
gpfs2nsd        nsd        512      2         yes      yes   ready     up             2       desc
gpfs3nsd        nsd        512      3         no       no    ready     up             3       desc


Notes:
------

Note 1: SDD driver

Subsystem Device Driver, SDD, is a pseudo driver designed to support the multipath configuration environments
in the IBM Totalstorage Enterprise Storage Server, the IBM TotalStorage DS family, and the IBM System Storage
SAN Volume Controller.  
You can see this driver installed, for example, in HACMP and GPFS systems.
 
At this time, SSD version 1.6.1.0 is not supported by VIOS. Ofcourse, this might change later.

Note 2: pv listing:

In a gpfs cluster, a lspv might show output like the following example:

root@zd110l13:/root# lspv
hdisk0          00cb61fe0b562af0                    rootvg          active
hdisk1          00cb61fe0fb40619                    rootvg          active
hdisk2          00cb61fe33429fa6                    vge0corddap01   active
hdisk3          00cb61fe3342a096                    vge0corddap01   active
hdisk4          00cb61fe3342a175                    gpfs3nsd
hdisk5          00cb61fe33536125                    gpfs4nsd

root@zd110l13:/root# mmlsnsd -aL

 File system   Disk name    NSD volume ID      Primary node             Backup node
---------------------------------------------------------------------------------------------
 gpfsfs0       gpfs3nsd     0A208FB64650A409   zd110l13                 zd110l14.nl.eu.abnamro.com
 gpfsfs0       gpfs4nsd     0A208FB64650A40D   zd110l13                 zd110l14.nl.eu.abnamro.com


Note 3: Other examples:

Other Examples of registration of a GPFS fileystem in /etc/filesystems:

..
..
/data/documentum/dmadmin:
        dev             = /dev/gpfsfs0
        vfs             = mmfs
        nodename        = -
        mount           = mmfs
        type            = mmfs
        account         = false
        options         = rw,mtime,atime,dev=gpfsfs0
..
..


root@zd110l13:/etc# mmlsdisk /dev/gpfsfs0

disk         driver   sector failure holds    holds                            storage
name         type       size   group metadata data  status        availability pool
------------ -------- ------ ------- -------- ----- ------------- ------------ ------------
gpfs3nsd     nsd         512       1 yes      yes   ready         up           system
gpfs4nsd     nsd         512       2 yes      yes   ready         up           system


75.2 GPFS commands:
===================


75.2.1. The mmcrcluster Command:
--------------------------------

Name
mmcrcluster - Creates a GPFS cluster from a set of nodes.

Synopsis
mmcrcluster -n NodeFile -p PrimaryServer [-s SecondaryServer] [-r RemoteShellCommand] 
               [-R RemoteFileCopyCommand] [-C ClusterName] [-U DomainName] [-A] [-c ConfigFile]

Description
Use the mmcrcluster command to create a GPFS cluster.

Upon successful completion of the mmcrcluster command, the /var/mmfs/gen/mmsdrfs and the /var/mmfs/gen/mmfsNodeData 
files are created on each of the nodes in the cluster. Do not delete these files under any circumstances. 
For further information, see the General Parallel File System: Concepts, Planning, and Installation Guide.

You must follow these rules when creating your GPFS cluster:

While a node may mount file systems from multiple clusters, the node itself may only be added to a single cluster 
using the mmcrcluster or mmaddnode command. 
The nodes must be available for the command to be successful. If any of the nodes listed are not available 
when the command is issued, a message listing those nodes is displayed. You must correct the problem on each node 
and issue the mmaddnode command to add those nodes. 
You must designate at least one node as a quorum node. You are strongly advised to designate the cluster 
configuration servers as quorum nodes. How many quorum nodes altogether you will have depends on whether 
you intend to use the node quorum with tiebreaker algorithm. or the regular node based quorum algorithm. 
For more details, see the General Parallel File System: Concepts, Planning, and Installation Guide and 
search for designating quorum nodes.

Parameters
-A 
Specifies that GPFS daemons are to be automatically started when nodes come up. The default is not to start 
daemons automatically. 
-C ClusterName 
Specifies a name for the cluster. If the user-provided name contains dots, it is assumed to be a fully 
qualified domain name. Otherwise, to make the cluster name unique, the domain of the primary configuration 
server will be appended to the user-provided name. 
If the -C flag is omitted, the cluster name defaults to the name of the primary GPFS cluster configuration server.

-c ConfigFile 
Specifies a file containing GPFS configuration parameters with values different than the documented defaults. 
A sample file can be found in /usr/lpp/mmfs/samples/mmfs.cfg.sample. See the mmchconfig command for a detailed 
description of the different configuration parameters. 
The -c ConfigFile parameter should only be used by experienced administrators. Use this file to only set up 
parameters that appear in the mmfs.cfg.sample |file. Changes to any other values may be ignored by GFPS. 
When in doubt, use the mmchconfig command instead.

-n NodeFile 
NodeFile consists of a list of node descriptors, one per line, to be included in the GPFS cluster. 
Node descriptors are defined as: 

NodeName:NodeDesignationswhere: 

NodeName is the hostname or IP address to be used by GPFS for node to node communication. 
The hostname or IP address must refer to the communications adapter over which the GPFS daemons communicate. 
Alias interfaces are not allowed. Use the original address or a name that is resolved by the host command 
to that original address. You may specify a node using any of these forms:

Format Example 
Short hostname   k145n01 
Long hostname    k145n01.kgn.ibm.com 
IP address       9.119.19.102 

NodeDesignations is an optional, '-' separated list of node roles. 
manager | client   - Indicates whether a node is part of the pool of nodes from which configuration and 
                     file system managers are selected. The default is client. 
quorum | nonquorum - Indicates whether a node is to be counted as a quorum node. The default is nonquorum.

You must provide a descriptor for each node to be added to the GPFS cluster.

-p PrimaryServer 
Specifies the primary GPFS cluster configuration server node used to store the GPFS configuration data. 
This node must be a member of the GPFS cluster. 
-R RemoteFileCopy 
Specifies the fully-qualified path name for the remote file copy program to be used by GPFS. The default value is 
/usr/bin/rcp. 
The remote copy command must adhere to the same syntax format as the rcp command, but may implement an 
alternate authentication mechanism.

-r RemoteShellCommand 
Specifies the fully-qualified path name for the remote shell program to be used by GPFS. The default value is 
/usr/bin/rsh. 
The remote shell command must adhere to the same syntax format as the rsh command, but may implement an 
alternate authentication mechanism.

-s SecondaryServer 
Specifies the secondary GPFS cluster configuration server node used to store the GPFS cluster data. 
This node must be a member of the GPFS cluster. 
It is suggested that you specify a secondary GPFS cluster configuration server to prevent the loss of 
configuration data in the event your primary GPFS cluster configuration server goes down. When the GPFS daemon 
starts up, at least one of the two GPFS cluster configuration servers must be accessible.

If your primary GPFS cluster configuration server fails and you have not designated a secondary server, 
the GPFS cluster configuration files are inaccessible, and any GPFS administrative commands that are issued fail. 
File system mounts or daemon startups also fail if no GPFS cluster configuration server is available.

-U DomainName 
Specifies the UID domain name for the cluster. 
A detailed description of the GPFS user ID remapping convention is contained in UID Mapping for GPFS In a 
Multi-Cluster Environment at www.ibm.com/servers/eserver/clusters/library/wp_aix_lit.html.

Exit status

0 
Successful completion. 
1 
A failure has occurred. 

Security
You must have root authority to run the mmcrcluster command.

You may issue the mmcrcluster command from any node in the GPFS cluster.

A properly configured .rhosts file must exist in the root user's home directory on each node in the GPFS cluster. 
If you have designated the use of a different remote communication program on either the mmcrcluster or the 
mmchcluster command, you must ensure:

Proper authorization is granted to all nodes in the GPFS cluster. 
The nodes in the GPFS cluster can communicate without the use of a password, and without any extraneous messages.


Example 1:
----------

To create a GPFS cluster made of all of the nodes listed in the file /u/admin/nodelist, using node k164n05 
as the primary server, and node k164n04 as the secondary server, issue:

# mmcrcluster  -n /u/admin/nodelist -p k164n05 -s k164n04

where /u/admin/nodelist has the these contents:

k164n04.kgn.ibm.com:quorum
k164n05.kgn.ibm.com:quorum
k164n06.kgn.ibm.com

The output of the command is similar to:

Mon Aug  9 22:14:34 EDT 2004: 6027-1664 mmcrcluster: Processing node
                              k164n04.kgn.ibm.com
Mon Aug  9 22:14:38 EDT 2004: 6027-1664 mmcrcluster: Processing node 
                              k164n05.kgn.ibm.com
Mon Aug  9 22:14:42 EDT 2004: 6027-1664 mmcrcluster: Processing node 
                              k164n06.kgn.ibm.com
mmcrcluster: Command successfully completed
mmcrcluster: 6027-1371 Propagating the changes to all affected.
                       nodes. This is an asynchronous process.

To confirm the creation, enter: 

# mmlscluster

The system displays information similar to:

GPFS cluster information
========================
  GPFS cluster name:         k164n05.kgn.ibm.com
  GPFS cluster id:           680681562214606028
  GPFS UID domain:           k164n05.kgn.ibm.com
  Remote shell command:      /usr/bin/rsh
  Remote file copy command:  /usr/bin/rcp

GPFS cluster configuration servers:
-------------------------------------
  Primary server:    k164n05.kgn.ibm.com
  Secondary server:  k164n04.kgn.ibm.com

 Node number  Node name  IP address      Full node name       Remarks

--------------------------------------------------------------------------
       1      k164n04    198.117.68.68   k164n04.kgn.ibm.com  quorum node
       2      k164n05    198.117.68.69   k164n05.kgn.ibm.com  quorum node
       3      k164n06    198.117.68.70   k164n06.kgn.ibm.com  


Example 2:
----------

# mmcrcluster  -n /home/root/nodelist -p zcnodeb -s n5nodea -r /usr/bin/rsh 
  -R /usr/bin/rcp -C MDLPR -A

Where the -C option determines the clustername.

You can start the cluster (GPFS daemon) by using

# mmstartup -a

Check if all nodes are registered in the cluster

# mmlscluster


75.2.2 Other GPFS commands:
---------------------------

The most common gpfs commands, will be illustrated by examples.


-- List cluster info: mmlscluster
-- ------------------------------

# mmlscluster

The system displays information similar to:

GPFS cluster information
========================
  GPFS cluster name:         k164n05.kgn.ibm.com
  GPFS cluster id:           680681562214606028
  GPFS UID domain:           k164n05.kgn.ibm.com
  Remote shell command:      /usr/bin/rsh
  Remote file copy command:  /usr/bin/rcp

GPFS cluster configuration servers:
-------------------------------------
  Primary server:    k164n05.kgn.ibm.com
  Secondary server:  k164n04.kgn.ibm.com

 Node number  Node name  IP address      Full node name       Remarks

--------------------------------------------------------------------------
       1      k164n04    198.117.68.68   k164n04.kgn.ibm.com  quorum node
       2      k164n05    198.117.68.69   k164n05.kgn.ibm.com  quorum node
       3      k164n06    198.117.68.70   k164n06.kgn.ibm.com  


-- Retrieving the Cluster status:
-- ------------------------------

# mmgetstate -aL

  Node number Node name Quorom    Nodes up  Total nodes GPFS state
  -------------------------------------------------------------
  1           node2     2         2         2           active    
  2           node3     2         2         2           active


-- Retreiving config data of the Cluster:
-- --------------------------------------

# mmlsconfig

  Configuration data for cluster TbrCl.node2:
  -------------------------------------------
  ClusterName TbrCl.node2
  ClusterId 8262362723390
  ClusterType 1c
  Multinode yes
  autoload yes
  useDiskLease yes
  MaxFeatureLevelAllowed 809
  tiebreakerDisks gpfs4nsd


root@zd110l13:/root#mmlsconfig
Configuration data for cluster cluster_name.zd110l13:
-----------------------------------------------------
clusterName cluster_name.zd110l13
clusterId 729741152660153204
clusterType lc
autoload no
useDiskLease yes
maxFeatureLevelAllowed 912
tiebreakerDisks gpfs3nsd;gpfs4nsd
[zd110l13]
takeOverSdrServ yes

File systems in cluster cluster_name.zd110l13:
----------------------------------------------
/dev/gpfsfs0


root@zd110l13:/var/adm/ras#df -k | grep /dev/gpfsfs0
/dev/gpfsfs0   2097152000 2009668608    5%   101193     5% /data/documentum/dmadmin


-- Change the status of a disk, and listing status: mmchdisk and mmlsdisk
-- ----------------------------------------------------------------------

You can even simulate the loss of a NSD disk from a Cluster, for example

# mmchdisk my_gpfs stop -d "gpfs1nsd"
# mmlsdisk my_gpfs -L

disk            driver     sector   failure   holds    holds 
name            type       size     group     metadata data  status    availability  disk id  remarks
-----------------------------------------------------------------------------------------------------
gpfs1nsd        nsd        512      1         yes      yes   ready     down           1       desc
gpfs2nsd        nsd        512      2         yes      yes   ready     up             2       desc
gpfs3nsd        nsd        512      3         no       no    ready     up             3       desc

We have used the example of the 2 node cluster of section 74.1 here. Since the quorom is still met,
even with one disk "down", the service is still working.


-- Changes GPFS cluster configuration data. 
-- ----------------------------------------

The mmchcluster command serves different purposes: 

Change the primary or secondary GPFS cluster data server. 
Synchronize the primary GPFS cluster data server. 
Change the remote shell and remote file copy programs to be used by the nodes in the cluster. 

To change the primary GPFS server for the cluster, enter: 

# mmchcluster -p k145n03

 
-- Changes the attributes of a GPFS file system
-- --------------------------------------------

Use the mmchfs command to change the attributes of a GPFS file system.

With the mmchfs command, you can for example change the number of inodes of GPFS filesystem, like
for example


# mmchfs gpfsfs0 -F 856064:856064

Now list the properties of the gpfsfs0 filesystem:


# mmdf /dev/gpfsfs0
disk                disk size  failure holds    holds              free KB             free KB
name                    in KB    group metadata data        in full blocks        in fragments
--------------- ------------- -------- -------- ----- -------------------- -------------------
Disks in storage pool: system
gpfs3nsd              7340032        1 yes      yes         5867008 ( 80%)        434992 ( 6%)
gpfs1nsd            314572800        1 yes      yes       268067328 ( 85%)      17170032 ( 5%)
gpfs2nsd            115343360        1 no       no                0 (  0%)             0 ( 0%)
                -------------                         -------------------- -------------------
(pool total)        437256192                             273934336 ( 63%)      17605024 ( 4%)

                =============                         ==================== ===================
(total)             437256192                             273934336 ( 63%)      17605024 ( 4%)

Inode Information
-----------------
Number of used inodes:          177011
Number of free inodes:          679053
Number of allocated inodes:     856064
Maximum number of inodes:       856064
                               2048006


# mmdf /dev/gpfsfs0
# mmchfs gpfsfs0 -F 2457612:2457612
                    1228806


mmchfs ommand syntax:

mmchfs Device [-A {yes | no | automount}] [-E {yes | no}] [-D {nfs4 | posix}] 
              [-F MaxNumInodes[:NumInodesToPreallocate]] [-k {posix | nfs4 | all}] 
              [-K {no | whenpossible | always}] [-m DefaultMetadataReplicas] 
              [-o MountOptions] [-Q {yes | no}]
              [-r DefaultDataReplicas] [-S {yes | no} ] [-T Mountpoint]
              [-V] [-z {yes | no}]


To change the default replicas for metadata to 2 and the default replicas for data to 2 for new files 
created in the fs0 file system, enter:

# mmchfs fs0 -m 2 -r 2

To confirm the change, enter:

# mmlsfs fs0 -m -r

The system displays information similar to:

flag value          description
---- -------------- -----------------------------------
 -m  2              Default number of metadata replicas
 -r  2              Default number of data replicas


With the mmchfs command, you can for example also change the number of inodes of GPFS filesystem, like
for example

# mmchfs gpfsfs0 -F 856064:856064


More examples:


-- Add a node to the cluster
-- -------------------------

The mmaddnode command adds nodes to a GPFS cluster.
Use the mmaddnode command to add nodes to an existing GPFS cluster. On each new node a mount point directory
and character mode device is created for each GPFS filesystem.

Example:
To add the nodes "k164n06" and "k164n07" as quorom nodes, designating "k164n06" to be available as 
manager node, use the following command:

# mmaddnode -N k164n06:quorom-manager,k164n07:quorom


-- Mounting and unmounting GPFS file
-- ----------------------------------

Use the mmmount and mmumount to mount or unmount GPFS filesystem on one or more nodes in the cluster.

Examples:

- To mount all GPFS filesystems on all of the nodes in the cluster:

# mmmount all -a

- To mount filesystem "fs2" read-only on the local node, use

# mmmount fs2 -o ro

- To mount fs1 on all NSD server nodes, use

# mmmount fs1 -N nsdnodes  

- To unmount fs1 on all nodes of the cluster, use

# mmumount fs1 -a


-- Creates cluster-wide names for Network Shared Disks (NSDs) used by GPFS
-- -----------------------------------------------------------------------

mmcrnsd -F DescFile [-v {yes |no}]

The mmcrnsd command is used to create cluster-wide names for NSDs used by GPFS.

This is the first GPFS step in preparing a disk for use by a GPFS file system. A disk descriptor file supplied 
to this command is rewritten with the new NSD names and that rewritten disk descriptor file can then be supplied 
as input to the mmcrfs command.

The name created by the mmcrnsd command is necessary since disks connected at multiple nodes may have differing 
disk device names in /dev on each node. The name uniquely identifies the disk. This command must be run 
for all disks that are to be used in GPFS file systems. The mmcrnsd command is also used to assign a 
primary and backup NSD server that can be used for I/O operations on behalf of nodes that do not have 
direct access to the disk.

To identify that the disk has been processed by the mmcrnsd command, a unique NSD volume ID is written on 
sector 2 of the disk. All of the NSD commands (mmcrnsd, mmlsnsd, and mmdelnsd) use this unique 
NSD volume ID to identify and process NSDs.

After the NSDs are created, the GPFS cluster data is updated and they are available for use by GPFS.

Examples:

To create your NSDs from the descriptor file nsdesc containing: 

 sdav1:k145n05:k145n06:dataOnly:4
 sdav2:k145n04::dataAndMetadata:5:ABC

enter:

# mmcrnsd -F nsdesc 


-- GPFS and inittab
-- ----------------

Usually, the following enry should be in place in /etc/inittab

mmfs:2:once:/usr/lpp/mmfs/bin/mmautoload >/dev/console 2>&1


75.3 Installing GPFS:
=====================

Installing GPFS V. 2.3 or v. 3.1


Installing GPFS on AIX 5L nodes
It is suggested you read Planning for GPFS and the GPFS FAQs at 
publib.boulder.ibm.com/infocenter/clresctr/topic/com.ibm.cluster.gpfs.doc/gpfs_faqs/gpfsclustersfaq.html.

Do not attempt to install GPFS if you do not have the prerequisites listed in Hardware requirements 
and Software requirements.

Ensure that the PATH environment variable on each node includes /usr/lpp/mmfs/bin.

The installation process includes:

-Files to ease the installation process 
-Verifying the level of prerequisite software 
-Installation procedures

>> Files to ease the installation process

Creation of a file that contains all of the nodes in your GPFS cluster prior to the installation of GPFS, 
will be useful during the installation process. Using either host names or IP addresses when constructing 
the file will allow you to use this information when creating your cluster through the mmcrcluster command.

For example, create the file /tmp/gpfs.allnodes, listing the nodes one per line: 

k145n01.dpd.ibm.com 
k145n02.dpd.ibm.com 
k145n03.dpd.ibm.com 
k145n04.dpd.ibm.com 
k145n05.dpd.ibm.com 
k145n06.dpd.ibm.com 
k145n07.dpd.ibm.com 
k145n08.dpd.ibm.com 


>> Verifying the level of prerequisite software

It is necessary to verify you have the correct levels of the prerequisite software installed. If the correct level 
of prerequisite software is not installed, see the appropriate installation manual before proceeding with your 
GPFS installation: 

1. AIX 5L Version 5 Release 2 with the latest level of service available 

   # WCOLL=/tmp/gpfs.allnodes dsh "oslevel"

   Output similar to this should be displayed: 
   5.2.0.10

2. AIX 5L Version 5 Release 3 with the latest level of service available 

   # WCOLL=/tmp/gpfs.allnodes dsh "oslevel"

   Output similar to this should be displayed: 
   5.3.0.0
   If you are utilizing NFS V4, at a minimum your output should include: 
   5.3.0.10


>>Installation procedures

The installation procedures are generalized for all levels of GPFS. Ensure you substitute the correct 
numeric value for the modification (m) and fix (f) levels, where applicable. The modification and fix 
level are dependent upon the level of PTF support.

Follow these steps to install the GPFS software using the installp command:

1. Electronic license agreement 
2. Creating the GPFS directory 
3. Creating the GPFS installation table of contents file 
4. Installing the GPFS man pages 
5. Installing GPFS on your network 
6. Existing GPFS files 
7. Verifying the GPFS installation


--1. Electronic license agreement

The GPFS software license agreements is shipped and viewable electronically. The electronic license agreement 
must be accepted before software installation can continue.

For additional software package installations, the installation cannot occur unless the appropriate 
license agreements are accepted. When using the installp command, use the -Y flag to accept licenses 
and the -E flag to view license agreement files on the media.

--2. Creating the GPFS directory

To create the GPFS directory:

On any node create a temporary subdirectory where GPFS installation images will be extracted. For example: 

# mkdir  /tmp/gpfslpp

Copy the installation images from the CD-ROM to the new directory, by issuing: 

# bffcreate -qvX -t /tmp/gpfslpp -d /dev/cd0 all

This will place the following GPFS images in the image directory :

gpfs.base 
gpfs.docs 
gpfs.msg.en_US


--3. Creating the GPFS installation table of contents file

Make the new image directory the current directory: 

# cd /tmp/gpfslpp

Use the inutoc command to create a .toc file. The .toc file is used by the installp command. 

# inutoc .

--4. Installing the GPFS man pages

In order to use the GPFS man pages you must install the gpfs.docs image. The GPFS manual pages will be 
located at /usr/share/man/.

Installation consideration:
The gpfs.docs image need not be installed on all nodes if man pages are not desired or local file system space 
on the node is minimal.

--5. Installing GPFS on your network

Install GPFS according to these directions, where localNode is the name of the node on which you are running:

If you are installing on a shared file system network, ensure the directory where the GPFS images can be found 
is NFS exported to all of the nodes planned for your GPFS cluster (/tmp/gpfs.allnodes). 

Ensure an acceptable directory or mountpoint is available on each target node, such as /tmp/gpfslpp. 
If there is not, create one: 

# WCOLL=/tmp/gpfs.allnodes dsh "mkdir /tmp/gpfslpp"

If you are installing on a shared file system network, to place the GPFS images on each node in your network, 
issue: 

# WCOLL=/tmp/gpfs.allnodes dsh "mount localNode:/tmp/gpfslpp /tmp/gpfslpp"

Otherwise, issue: 

# WCOLL=/tmp/gpfs.allnodes dsh "rcp localNode:/tmp/gpfslpp/gpfs* /tmp/gpfslpp"
# WCOLL=/tmp/gpfs.allnodes dsh "rcp localNode:/tmp/gpfslpp/.toc /tmp/gpfslpp"

Install GPFS on each node: 

# WCOLL=/tmp/gpfs.allnodes dsh "installp -agXYd /tmp/gpfslpp gpfs" 

--6. Existing GPFS files

If you have previously installed GPFS on your system, during the install process you may see 
messages similar to:

Some configuration files could not be automatically merged into the
system during the installation.  The previous versions of these files
have been saved in a configuration directory as listed below.  Compare
the saved files and the newly installed files to determine if you need
to recover configuration data.  Consult product documentation to
determine how to merge the data.

Configuration files which were saved in /lpp/save.config:
  /var/mmfs/etc/gpfsready
  /var/mmfs/etc/gpfsrecover.src
  /var/mmfs/etc/mmfsdown.scr
  /var/mmfs/etc/mmfsup.scr

If you have made changes to any of these files, you will have to reconcile the differences with the 
new versions of the files in directory /var/mmfs/etc. This does not apply to file /var/mmfs/etc/mmfs.cfg 
which is automatically maintained by GPFS.

--7. Verifying the GPFS installation

Use the lslpp command to verify the installation of GPFS file sets on each node:

lslpp -l gpfs\* 

Output similar to the following should be returned:

  Fileset                      Level  State      Description         
  ----------------------------------------------------------------------------
Path: /usr/lib/objrepos
gpfs.base              2.3.0.0  COMMITTED  GPFS File Manager
gpfs.docs.data         2.3.0.0  COMMITTED  GPFS Server Manpages
gpfs.msg.en_US         2.3.0.0  COMMITTED  GPFS Server Messages - U.S. English
Path: /etc/objrepos
gpfs.base              2.3.0.0  COMMITTED  GPFS File Manager


Example:

root@zd110l14:/root#lslpp -L "*gpfs*"
  Fileset                      Level  State  Type  Description (Uninstaller)
  ----------------------------------------------------------------------------
  gpfs.base                 3.1.0.11    C     F    GPFS File Manager
  gpfs.docs.data             3.1.0.4    C     F    GPFS Server Manpages and
                                                   Documentation
  gpfs.msg.en_US            3.1.0.10    C     F    GPFS Server Messages - U.S.
                                                   English


State codes:
 A -- Applied.
 B -- Broken.
 C -- Committed.
 E -- EFIX Locked.
 O -- Obsolete.  (partially migrated to newer version)
 ? -- Inconsistent State...Run lppchk -v.

Type codes:
 F -- Installp Fileset
 P -- Product
 C -- Component
 T -- Feature
 R -- RPM Package
 E -- Interim Fix


root@zd110l14:/root#lslpp -l gpfs\*
  Fileset                      Level  State      Description
  ----------------------------------------------------------------------------
Path: /usr/lib/objrepos
  gpfs.base                 3.1.0.11  COMMITTED  GPFS File Manager
  gpfs.msg.en_US            3.1.0.10  COMMITTED  GPFS Server Messages - U.S.
                                                 English

Path: /etc/objrepos
  gpfs.base                 3.1.0.11  COMMITTED  GPFS File Manager

Path: /usr/share/lib/objrepos
  gpfs.docs.data             3.1.0.4  COMMITTED  GPFS Server Manpages and
                                                 Documentation


75.4 GPFS error messages:
=========================


Note 1:
-------

The MMFS log
GPFS writes both operational messages and error data to the MMFS log file. The MMFS log can be found 
in the /var/adm/ras directory on each node. The MMFS log file is named mmfs.log.date.nodeName, where date 
is the time stamp when the instance of GPFS started on the node and nodeName is the name of the node. 
The latest mmfs log file can be found by using the symbolic file name /var/adm/ras/mmfs.log.latest. 
The MMFS log from the previous instance of GPFS can be found by using the symbolic file name 
/var/adm/ras/mmfs.log.previous. All other files have a timestamp and node name appended to the file name.

Example:

root@zd110l13:/var/adm/ras#cat mmfs.log.latest
Sun May 20 22:10:37 DFT 2007 runmmfs starting
Removing old /var/adm/ras/mmfs.log.* files:
Loading kernel extension from /usr/lpp/mmfs/bin . . .
GPFS: 6027-500 /usr/lpp/mmfs/bin/aix64/mmfs64 loaded and configured.
Sun May 20 22:10:39 2007: GPFS: 6027-310 mmfsd64 initializing. {Version: 3.1.0.11   Built: Apr  6 2007 09:38:56} ...
Sun May 20 22:10:44 2007: GPFS: 6027-1710 Connecting to 10.32.143.184 zd110l14.nl.eu.abnamro.com
Sun May 20 22:10:44 2007: GPFS: 6027-1711 Connected to 10.32.143.184 zd110l14.nl.eu.abnamro.com
Sun May 20 22:10:44 2007: GPFS: 6027-300 mmfsd ready
Sun May 20 22:10:44 DFT 2007: mmcommon mmfsup invoked
Sun May 20 22:10:44 DFT 2007: mounting /dev/gpfsfs0
Sun May 20 22:10:44 2007: Command: mount gpfsfs0 323816
Sun May 20 22:10:46 2007: Command: err 0: mount gpfsfs0 323816
Sun May 20 22:10:46 DFT 2007: finished mounting /dev/gpfsfs0


At GPFS startup, files that have not been accessed during the last ten days are deleted. 
If you want to save old files, copy them elsewhere.

This example shows normal operational messages that appear in the MMFS log file:

Tue Aug 31 16:02:43 edt 2004 runmmfs starting
Removing old /var/adm/ras/mmfs.log.* files:
mv: 0653-401 Cannot rename /var/adm/ras/mmfs.log.previous to /var/adm/ras/mmfs.log.previous.save:
             A file or directory in the path name does not exist.
Loading kernel extension from /usr/lpp/mmfs/bin . . .
/usr/lpp/mmfs/bin/vcmdummy64 loaded and configured
/usr/lpp/mmfs/bin/aix64/mmfs64 loaded and configured.
Tue Aug 31 16:02:44 2004: GPFS: 6027-310 mmfsd64 initializing. {Version: 3.7.0.0 
    Built: Aug 30 2004 17:10:20} ...
Tue Aug 31 16:02:54 2004: GPFS: 6027-1710 Connecting to 198.16.0.9 k154gn09
Tue Aug 31 16:02:55 2004: GPFS: 6027-1711 Connected to 198.16.0.9 k154gn09
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.2 k154gn02
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.18 k155gn02
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.49 kolt1g_r1b32
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.17 k155gn01
Tue Aug 31 16:02:55 2004: GPFS: 6027-1710 Connecting to 198.16.0.10 k154gn10
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.35
Tue Aug 31 16:02:55 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.5
Tue Aug 31 16:02:57 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.23
Tue Aug 31 16:02:57 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.6
Tue Aug 31 16:02:57 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.21
Tue Aug 31 16:03:00 edt 2004 /var/mmfs/etc/gpfsready invoked
Tue Aug 31 16:03:00 2004: GPFS: 6027-300 mmfsd ready
Tue Aug 31 16:03:00 2004: GPFS: 6027-1709 Accepted and connected to 198.16.0.10 k154gn10
Tue Aug 31 16:03:00 edt 2004: mounting /dev/fs3
Tue Aug 31 16:03:00 2004: Command: mount fs3 594128 

Depending on the size and complexity of your system configuration, the amount of time to start GPFS varies. 
Taking your system configuration into consideration, after a reasonable amount of time if you cannot access 
the file system look in the log file for error messages.

The GPFS log is a repository of error conditions that have been detected on each node, as well as 
operational events such as file system mounts. The GPFS log is the first place to look when attempting 
to debug abnormal events. Since GPFS is a cluster file system, events that occur on one node may affect 
system behavior on other nodes, and all GPFS logs may have relevant data.


Note 2:
-------

GPFS for AIX 5L V2.2 in an HACMP Cluster
Problem Determination Guide

The operating system error log facility
GPFS records file system or disk failures using the error logging facility provided by the 
operating system: syslog facility on Linux and errpt facility on AIX. For the remainder of this book, 
the error logging facility will be referred to as 'the error log'.

These failures can be viewed by issuing this command: 

errpt -a
The error log contains information about several classes of events or errors. These classes are:

MMFS_ABNORMAL_SHUTDOWN 
MMFS_DISKFAIL 
MMFS_ENVIRON 
MMFS_FSSTRUCT 
MMFS_GENERIC 
MMFS_LONGDISKIO 
MMFS_PHOENIX 
MMFS_QUOTA 
MMFS_SYSTEM_UNMOUNT 
MMFS_SYSTEM_WARNING
MMFS_ABNORMAL_SHUTDOWN

The MMFS_ABNORMAL_SHUTDOWN error log entry means that GPFS has determined that it must shutdown all operations 
on this node because of a problem. This is most likely caused by some interaction with the Group Services component. 
Group services failures may result in abnormal shutdown, as well as possible loss of quorum. 
Insufficient memory on the node to handle critical recovery situations can also cause this error. 
In general there will be other error log entries from GPFS or some other component associated with this error log entry.

MMFS_DISKFAIL
The MMFS_DISKFAIL error log entry indicates that GPFS has detected the failure of a disk and forced the disk 
to the stopped state. Unable to access disks describes the actions taken in response to this error. 
This is ordinarily not a GPFS error but a failure in the disk subsystem or the path to the disk subsystem. 
the book AIX 5L System Management Guide: Operating System and Devices and search on logical volume. 
Follow the problem determination and repair actions specified.

MMFS_ENVIRON
MMFS_ENVIRON error log entry records are associated with other records of the MMFS_GENERIC or MMFS_SYSTEM_UNMOUNT types. 
They indicate that the root cause of the error is external to GPFS and usually in the network that supports GPFS. 
Check the network and its physical connections. The data portion of this record supplies the return code provided 
by the communications code.

MMFS_FSSTRUCT
The MMFS_FSSTRUCT error log entry indicates that GPFS has detected a problem with the on-disk structure of 
the file system. The severity of these errors depends on the exact nature of the inconsistent data structure. 
If it is limited to a single file, EIO errors will be reported to the application and operation will continue. 
If the inconsistency affects vital metadata structures, operation will cease on this file system. 
These errors are often associated with an MMFS_SYSTEM_UNMOUNT error log entry and will probably occur on all nodes. 
If the error occurs on all nodes, some critical piece of the file system is inconsistent. This may occur as a 
result of a GPFS error or an error in the disk system. Issuing the mmfsck command may repair the error:

Issue the mmfsck -n command to collect data. 
Issue the mmfsck -y command off-line to repair the file system.
If the file system is not repaired after issuing the mmfsck command, contact the IBM Support Center.

MMFS_GENERIC
The MMFS_GENERIC error log entry means that GPFS self diagnostics have detected an internal error, or that 
additional information is being provided with an MMFS_SYSTEM_UNMOUNT report. If the record is associated with an 
MMFS_SYSTEM_UNMOUNT report, the event code fields in the records will be the same. The error code and return code 
fields may describe the error. See Messages for a listing of codes generated by GPFS.

If the error is generated by the self diagnostic routines, service personnel should interpret the return and error 
code fields since the use of these fields varies by the specific error. Errors caused by the self checking logic 
will result in the shutdown of GPFS on this node.

MMFS_GENERIC errors may result from an inability to reach a critical disk resource. These errors may look different 
depending on the specific disk resource that has become unavailable, like logs and allocation maps. 
This type of error will usually be associated with other error indications. Other errors generated by disk subsystems, 
high availability components, and communications components at the same time as, or immediately preceding, 
the GPFS error should be pursued first because they may be the cause of these errors. MMFS_GENERIC error indications 
without an associated error of those types represent a GPFS problem that requires the IBM Support Center. 
See Information to collect before contacting the IBM Support Center.

MMFS_LONGDISKIO
The MMFS_LONGDISKIO error log entry indicates that GPFS is experiencing very long response time for disk requests. 
This is a warning message and may indicate that your disk system is overloaded or that a failing disk is requiring 
many I/O retries. Follow your operating system's instructions for monitoring the performance of your I/O subsystem 
on this node. The data portion of this error record specifies the disk involved. 
There may be related error log entries from the disk subsystems that will pinpoint the actual cause of the problem. 
See the book AIX 5L Performance Management Guide.

MMFS_PHOENIX
MMFS_PHOENIX error log entries reflect a failure in GPFS interaction with Group Services. Go to the book 
Reliable Scalable Cluster Technology: Administration Guide. Search for diagnosing group services problems. 
Follow the problem determination and repair action specified. These errors are usually not GPFS problems, 
although they will disrupt GPFS operation.

MMFS_QUOTA
The MMFS_QUOTA error log entry is used when GPFS detects a problem in the handling of quota information. 
This entry is created when the quota manager has a problem reading or writing the quota file. If the quota manager 
cannot read all entries in the quota file when mounting a file system with quotas enabled, the quota manager 
shuts down, but file system manager initialization continues. Client mounts will not succeed and will return 
an appropriate error message.

In order for GPFS quota accounting to work properly, the system administrator should ensure that the user and group 
information is consistent throughout the nodeset, such as the /etc/passwd and /etc/group files are identical across 
the nodeset. Otherwise, unpredictable and erroneous quota accounting will occur.

It may be necessary to run an off-line quota check (mmcheckquota) to repair or recreate the quota file. 
If the quota file is corrupted, mmcheckquota will not restore it. The file must be restored from the backup copy. 
If there is no backup copy, an empty file may be set as the new quota file. This is equivalent to recreating 
the quota file. To set an empty file or use the backup file, issue the mmcheckquota command with the 
appropriate operand:

-u UserQuotaFilename for the user quota file 
-g GroupQuotaFilename for the group quota file
Reissue the mmcheckquota command to check the file system inode and space usage.

MMFS_SYSTEM_UNMOUNT
The MMFS_SYSTEM_UNMOUNT error log entry means that GPFS has discovered a condition which may result in 
data corruption if operation with this file system continues from this node. GPFS has marked the file system 
as disconnected and applications accessing files within the file system will receive ESTALE errors. 
This may be the result of:

The loss of a path to all disks containing a critical data structure. 
An internal processing error within the file system.
See File system forced unmount. Follow the problem determination and repair actions specified.

MMFS_SYSTEM_WARNING
The MMFS_SYSTEM_WARNING error log entry means that GPFS has detected a system level value approaching its 
maximum limit. This may occur as a result of the number of inodes (files) reaching its limit. Issue the mmchfs 
command to increase the number of inodes for the file system so there is at least a minimum of 5% free.

Error log entry example
This is an example of an error log entry which indicates loss of the Group Services subsystem:

LABEL:          MMFS_ABNORMAL_SHUTD
IDENTIFIER:     1FB9260D

Date/Time:       Thu May 16 14:39:07 EDT 
Sequence Number: 759
Machine Id:      000196364C00
Node Id:         k145n01
Class:           S
Type:            PERM
Resource Name:   mmfs            

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
COMPONENT ID
5765B9500 
PROGRAM
mmfsd64 
DETECTING MODULE
/fs/mmfs/ts/phoenix/PhoenixInt.C
MAINTENANCE LEVEL
2.2.0.0 
LINE
        4409
RETURN CODE
         668
REASON CODE
0000 0000 
EVENT CODE
           0


Note 3:
-------

IY35279: MMFSD64 CORE DUMPS IN CLEANOLDSHAREDMEMORY__FV() 

A fix is available 
Download fix packs
 

APAR status
Closed as program error.

Error description: 
When starting gpfs, mmfsd64 on the 64-bit kernel may segfault
with a stack trace similar to:

cxiMapShSeg__Fv() at 0x1003579d4
CleanOldSharedMemory__Fv() at 0x1000025dc
mainBody__FiPPc(??, ??) at 0x100334c20
main(??, ??) at 0x10000257c

Local fix 

Problem summary 
When starting gpfs, mmfsd64 on the 64-bit kernel may segfault
with a stack trace similar to:

cxiMapShSeg__Fv() at 0x1003579d4
CleanOldSharedMemory__Fv() at 0x1000025dc
mainBody__FiPPc(??, ??) at 0x100334c20
main(??, ??) at 0x10000257c
SYMPTOM STRING

Problem conclusion 
Make sure to update the current cpu's ppda rather than another
cpu's ppda
Temporary fix 
Comments 
APAR information 
APAR number IY35279 
Reported component name AIX 5L POWER 
Reported component ID 5765E6100 
Reported release 510 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2002-10-02 
Closed date 2002-10-02 
Last modified date 2002-11-07 
 

Note 4:
-------

IY56448: WHEN CLLSIF OUTPUT IS NOT CORRECT, MMCOMMON DOES NOT HANDLE 

A fix is available 
Obtain fix for this APAR


APAR status
Closed as program error.

Error description 
from GPFS log:
sort: 0653-655 Cannot open /var/mmfs/tmp/cllsifOutput.mmcommon.2
82794

mmcommon: 6027-1271 Unexpected error from getNodeGODMdata: sort
/var/mmfs/tmp/cllsifOutput.mmcommon.282794. Return code: 2

Could not run command /usr/lpp/mmfs/bin/mmcommon getNodeDataForD
aemon hacmp 2>/var/mmfs/tmp/mmcommon..6Qeya
GPFS: 6027-311 mmfsd64 is shutting down.
Reason for shutdown: Could not initialize cluster config

Local fix 
correct cluster infomation so that cllsif is correct.

Problem summary 
WHEN CLLSIF OUTPUT IS NOT CORRECT, MMCOMMON DOES NOT HANDLE
Problem conclusion 
add checks for invalid data from HACMP, RPD, or SDR when
getNodeData is called
Temporary fix 
Comments 
APAR information 
APAR number IY56448 
Reported component name GPFS FOR AIX 
Reported component ID 5765F6400 
Reported release 220 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2004-05-03 
Closed date 2004-05-03 
Last modified date 2004-06-24 
 

Note 5: Troubleshooting: Some possible GPFS problems
----------------------------------------------------

http://book.opensourceproject.org.cn/enterprise/cluster/ibmcluster/opensource/7819/ddu0070.html


8.5 Troubleshooting: Some possible GPFS problems
Troubleshooting of a GPFS file system can be complex due its distributed nature. In this section, 
we describe the most common problems you may find when running GPFS and possible solutions. 
For further information on trouble shooting, refer to IBM General Parallel File System for Linux: 
Problem Determination Guide, GA22-7842.

8.5.1 Authorization problems

ssh and scp (or rsh and rcp) are used by GPFS administration commands to perform operations on other nodes. 
In order for these commands to be run, the sshd daemon must be running and configured to accept the connections 
from the other root users on the other nodes.

The first thing to check is the connection authorization from one node to other nodes and for extraneous messages 
in the command output. You can find information on OpenSSH customization in Appendix B, "Common facilities" 
on page 275. Check that all nodes can connect to all others without any password prompt.

You can also check if your GPFS cluster has been configured correctly to use the specified remote shell 
and remote copy commands by issuing the mmlscluster command, as in Example 8-17. Verify the contents 
of the remote shell command and remote file copy command fields.

Example 8-17: mmlscluster command 
 

[root@storage001 root]# mmlscluster

GPFS cluster information
========================
  Cluster id:  gpfs1035415317
  Remote shell command:      /usr/bin/ssh
  Remote file copy command:  /usr/bin/scp
  Primary network:           myrinet
  Secondary network:         ether

GPFS cluster data repository servers:
-------------------------------------
  Primary server:    storage001-myri0.cluster.com
  Secondary server:  (none)

Nodes in nodeset 1:
-------------------
   1  storage001-myri0 10.2.1.141     storage001-myri0.cluster.com  10.0.3.141
   2  node001-myri0  10.2.1.1         node001-myri0.cluster.com    10.0.3.1
   3  node002-myri0  10.2.1.2         node002-myri0.cluster.com    10.0.3.2
   4  node003-myri0  10.2.1.3         node003-myri0.cluster.com    10.0.3.3
   5  node004-myri0  10.2.1.4         node004-myri0.cluster.com    10.0.3.4
[root@storage001 root]#

 
8.5.2 Connectivity problems
Another reason why SSH may fail is that connectivity to a node has been lost. Error messages from mmdsh may indicate 
such a condition. For example:

mmdsh: node001 rsh process had return code 1.

There are many things that could cause this problem: cable failures, network cardproblems, switch failures, 
and so on. You can start by checking if the affected node is powered on. If the node is up, check the node connectivity 
and verify the sshd daemon is running on the remote node. If not, restart the daemon by issuing:

# service sshd start

Sometimes you may see a mmdsh error message due to the lack of an mmfsd process on some of the nodes, 
as in Example 8-18. Make sure the mmfsd is running on all nodes, using lssrc -a, as in Example 8-19.

Example 8-18: mmcrfs command 
 

[root@storage001 root]# mmcrfs /gpfs gpfs0 -F DescFile -v yes -r 1 -R 2
GPFS: 6027-624 No disks
GPFS: 6027-441 Unable to open disk 'gpfs2nsd'.
No such device
GPFS: 6027-538 Error accessing disks.
mmdsh: node001 rsh process had return code 19.
mmcommon: Unexpected error from runRemoteCommand_Cluster: mmdsh. Return code: 1
mmcrfs: tscrfs failed. Cannot create gpfs0
[root@storage001 root]#

 
Example 8-19: Verifying mmfsd is running 
 

# lssrc -a
Subsystem        Group        PID     Status
 cthats          cthats       843     active
 cthags          cthags       943     active
 ctrmc           rsct         1011    active
 ctcas           rsct         1018    active
 IBM.HostRM      rsct_rm      1069    active
 IBM.FSRM        rsct_rm      1077    active
 IBM.CSMAgentRM  rsct_rm      1109    active
 IBM.ERRM        rsct_rm      1110    active
 IBM.AuditRM     rsct_rm      1148    active
 mmfs            aixmm        1452    active
 IBM.SensorRM    rsct_rm              inoperative
 IBM.ConfigRM    rsct_rm              inoperative

 
8.5.3 NSD disk problems
In this section, we describe the two most common problems related to NSD and disks. These are not the only problems
 you might face, but they are the most common.

The disk has disappeared from the system
Sometimes you may face a disk failure and the disk appears to have disappeared from the system. 
This can happen if somebody simply removes an in-use hot-swap disk from the server or in the case of 
a particularly nasty disk failure.

In this situation, GPFS loses connectivity to the disk and, depending on how the file system was created, 
you may or may not lose access to the file system.

You can verify whether the disk is reachable by the operating system using mmlsnsd -m, as shown in Example 8-20. 
In this situation, the GPFS disk gpfs1nsd is unreachable. This could mean that the disk has been turned off, 
has been removed from its bay, or has failed for some other reason.

Example 8-20: mmlsnsd command 
 

[root@storage001 root]# mmlsnsd -m

 NSD name     PVID               Device       Node name    Remarks
-----------------------------------------------------------------------
 gpfs1nsd     0A0000013BF15AFD   -            node-a       (error) primary node
 gpfs2nsd     0A0000023BF15B0A   /dev/sdb1    node-b       primary node
 gpfs3nsd     0A0000033BF15B26   /dev/sdb1    node-c       primary node
 gpfs4nsd     0A0000013BF2F4EA   /dev/sda9    node-a       primary node
 gpfs5nsd     0A0000023BF2F4FF   /dev/sda3    node-b       primary node
 gpfs6nsd     0A0000033BF2F6E1   /dev/sda6    node-c       primary node
[root@storage001 root]#


To correct this problem, you must first verify whether the disk is correctly attached and that it is not dead. After that, you can verify whether the driver for the disk is operational, and reload the driver using the rmmod and insmod commands. If the disk had only been removed from its bay or turned off, reloading the driver will activate the disks again, and then you can enable them again following the steps in "The disk is down and will not come up" on page 241. If the disk had any kind of hardware problem that will require replacing the disk, refer to 8.1.3, "Replacing a failing disk in an existing GPFS file system" on page 230.

The disk is down and will not come up
Occasionally, disk problems will occur on a node and, even after the node has been rebooted, the disk connected to it does not come up again. In this situation, you will have to manually set the disk up again and then run some recovery commands in order to restore access to your file system.

For our example, we see that the gpfs0 file system has lost two of its three disks: gpfs1nsd and gpfs3nsd. In this situation, we have to recover the two disks, run a file system check, and then re-stripe the file system.

Because the file system check and re-stripe require access to the file system, which is down, you must first re-activate the disks. Once the file system is up again, recovery may be undertaken. In Example 8-21, we verify which disks are down using the mmlsdisk command, re-activate the disks by using the mmchdisk command, and then verify the disks again with mmlsdisk.

Example 8-21: Reactivating disks 
 

[root@storage001 root]# mmlsdisk gpfs0
disk         driver   sector failure holds    holds
name         type       size   group metadata data  status        availability
------------ -------- ------ ------- -------- ----- ------------- ------------
gpfs1nsd     nsd         512       1 yes      yes   ready         down
gpfs2nsd     nsd         512       2 yes      yes   ready         up
gpfs3nsd     nsd         512       3 yes      yes   ready         down

[root@storage001 root]# mmchdisk gpfs0 start -d "gpfs1nsd;gpfs3nsd"
Scanning file system metadata, phase 1 ...
Scan completed successfully.
Scanning file system metadata, phase 2 ...
Scan completed successfully.
Scanning file system metadata, phase 3 ...
Scan completed successfully.
Scanning user file metadata ...
  77 % complete on Tue Nov 27 00:13:38 2001
 100 % complete on Tue Nov 27 00:13:39 2001
Scan completed successfully.

[root@storage001 root]# mmlsdisk gpfs0
disk         driver   sector failure holds    holds
name         type       size   group metadata data  status        availability
------------ -------- ------ ------- -------- ----- ------------- ------------
gpfs1nsd     nsd         512       1 yes      yes   ready         up
gpfs2nsd     nsd         512       2 yes      yes   ready         up
gpfs3nsd     nsd         512       3 yes      yes   ready         up
[root@storage001 root]#


Now that we have the three disks up, it is time to verify the file system consistency. Additionally, because some operations could have occurred on the file system when only one of the disks was down, we must re-balance it. We show the output of the mmfsck and mmrestripefs commands in Example 8-22. The mmfsck command has some important options you may need to use, like -r, for read-only access, and -y, to automatically correct problems found in the file system.

Example 8-22: mmfsck and mmrestripefs commands 
 

[root@storage001 root]# mmfsck gpfs0
Checking "gpfs0"
Checking inodes
Checking inode map file
Checking directories and files
Checking log files
Checking extended attributes file
Checking file reference counts
Checking file system replication status

       33792 inodes
          14   allocated
           0   repairable
           0   repaired
           0   damaged
           0   deallocated
           0   orphaned
           0   attached

      384036 subblocks
        4045   allocated
           0   unreferenced
           0   deletable
           0   deallocated

         231 addresses
           0   suspended

File system is clean.
# mmrestripefs gpfs0 -r
Scanning file system metadata, phase 1 ...
Scan completed successfully.
Scanning file system metadata, phase 2 ...
Scan completed successfully.
Scanning file system metadata, phase 3 ...
Scan completed successfully.
Scanning user file metadata ...
  72 % complete on Tue Nov 27 00:19:24 2001
 100 % complete on Tue Nov 27 00:19:25 2001
Scan completed successfully.

[root@storage001 root]# mmlsdisk gpfs0 -e
All disks up and ready

[root@storage001 root]# mmlsdisk gpfs0
disk         driver   sector failure holds    holds
name         type       size   group metadata data  status        availability
------------ -------- ------ ------- -------- ----- ------------- ------------
gpfs1nsd     nsd         512       1 yes      yes   ready         up
gpfs2nsd     nsd         512       2 yes      yes   ready         up
gpfs3nsd     nsd         512       3 yes      yes   ready         up
[root@storage001 root]#


==========
76. HACMP:
==========


76.1: Overview Cluster solutions and terminology on AIX:
========================================================


-- CSM: (Management of Cluster)
-- ----------------------------

What is Cluster Systems Management (CSM)?
Cluster Systems Management (CSM) software provides a distributed system management solution that allows 
a system administrator to set up and maintain a cluster of nodes that run the AIXr or Linuxr operating system. 
CSM simplifies cluster administration tasks by providing management from a single point-of-control. 
CSM can be used to manage homogeneous clusters of servers that run Linux, homogeneous servers that run AIX, 
or mixed clusters which include both AIX and Linux.

You can use the following hardware for your CSM management server, install server, and nodes:

IBM System x: System x, IBM xSeriesr, IBM BladeCenterr*, and IBM eServer 325, |326, and 326m hardware |
IBM System p: System p, IBM pSeries, IBM BladeCenter*, System p5, IBM eServer OpenPower
*The BladeCenter JS models use the POWER architecture common to all System p servers.

The management server is the machine that is designated to operate, monitor, and maintain the rest of the cluster. 
Install servers are the machines that are used to install the nodes. By default, the management server 
is the install server. Managed nodes are instances of the operating system that you can manage in the cluster. 
Managed devices are the non-node devices for which CSM supports power control and remote console access. 
For hardware and software support information, see Planning for CSM software.

Communicating with CSM:
CSM offers you several options for issuing commands to the cluster:

-Command line interface 
-Distributed Command Execution Manager (DCEM) 
-IBMr Web-based System Manager 
-SMIT


-- GPFS:
-- -----

Introducing General Parallel File System

GPFS is a high-performance cluster file system for AIX 5L, Linux and mixed clusters that provides users 
with shared access to files spanning multiple disk drives. By dividing individual files into blocks 
and reading/writing these blocks in parallel across multiple disks, GPFS provides very high bandwidth; 
in fact, GPFS has won awards and set world records for performance. In addition, GPFS's multiple data paths 
can also eliminate single points of failure, making GPFS extremely reliable. GPFS currently powers many of 
the world's largest scientific supercomputers and is increasingly used in commercial applications requiring 
high-speed access to large volumes of data such as digital media, engineering design, business intelligence, 
financial analysis and geographic information systems. GPFS is based on a shared disk model, providing lower 
overhead access to disks not directly attached to the application nodes, and using a distributed protocol 
to provide data coherence for access from any node. 

IBM's General Parallel File System (GPFS) provides file system services to parallel and serial applications. 
GPFS allows parallel applications simultaneous access to the same files, or different files, from any node 
which has the GPFS file system mounted while managing a high level of control over all file system operations. 
GPFS is particularly appropriate in an environment where the aggregate peak need for data bandwidth exceeds 
the capability of a distributed file system server.

GPFS allows users shared file access within a single GPFS cluster and across multiple GPFS clusters. 
A GPFS cluster consists of: 

AIX 5LT nodes, Linuxr nodes, or a combination thereof (see GPFS cluster configurations). A node may be: 
An individual operating system image on a single computer within a cluster. 
A system partition containing an operating system. Some System p5T and pSeriesr machines allow multiple 
system partitions, each of which is considered to be a node within the GPFS cluster.

Network shared disks (NSDs) created and maintained by the NSD component of GPFS 
All disks utilized by GPFS must first be given a globally accessible NSD name. 
The GPFS NSD component provides a method for cluster-wide disk naming and access. 

On Linux machines running GPFS, you may give an NSD name to: 
 Physical disks 
 Logical partitions of a disk 
 Representations of physical disks (such as LUNs)

On AIXr machines running GPFS, you may give an NSD name to: 
 Physical disks 
 Virtual shared disks 
 Representations of physical disks (such as LUNs)

A shared network for GPFS communications allowing a single network view of the configuration. 
A single network, a LAN or a switch, is used for GPFS communication, including the NSD communication.


-- PSSP: (predecessor to Cluster Systems Management (CSM))
-- -------------------------------------------------------

Parallel System Support Programs (PSSP)

The PSSP 3.5 software is a comprehensive suite of applications to manage a system as a full-function 
parallel processing system. It provides administrative tasks that help increase productivity by enabling 
administrators to view, monitor, and operate the system from the control workstation, a single point of control. 
The PSSP software is discussed in terms of functional entities called components of PSSP. Most functions 
are base components of PSSP while others are optional; they come with the PSSP software, but you can choose 
whether to install and use them.

With PSSP 3.5, AIX 5L 5.1 or 5.2 must be on the control workstation. Note that your control workstation 
must be at the highest AIX level in the system. If you have any HMC-controlled servers in your system, 
AIX 5L 5.1 or 5.2 must be on each HMC-controlled server node. Other nodes can have AIX 5L 5.1 and PSSP 3.4, 
or AIX 4.3.3 with PSSP 3.4 or PSSP 3.2. However, you can only run with the 64-bit AIX kernel and switch 
between 64-bit and 32-bit AIX kernel mode on nodes with PSSP 3.5.

Parallel System Support Programs (PSSP) for AIXr
PSSP is the systems management predecessor to Cluster Systems Management (CSM) and does not support 
IBM System p servers or AIX 5L V5.3. New cluster deployments should use CSM and existing PSSP customers 
with software maintenance will be transitioned to CSM at no charge. 


-- Tivoli Workload Scheduler LoadLeveler
-- -------------------------------------

Used for dynamic workload scheduling, Tivoli Workload Scheduler LoadLeveler is a distributed network-wide 
job management facility designed to dynamically schedule work such as maximize resource utilization 
and minimize job completion time. Jobs are scheduled based on job priority, job requirements, 
resource availability and user-defined rules to match processing needs with resources. LoadLeveler provides 
consolidated accounting and reporting and supports IBM servers including IBM System p and System x environments. 


-- Engineering Scientific Subroutine Library (ESSL) and Parallel ESSL 
-- ------------------------------------------------------------------

ESSL is a collection of state-of-the-art mathematical subroutines specifically tuned to IBM hardware 
and offering significant performance improvement to any math-intensive scientific or engineering applications. 
Parallel ESSL extends the function of ESSL to support parallel applications that use the Message Passing 
Interface included in IBM Parallel Environment. ESSL and Parallel ESSL support C, C++ and Fortran applications. 


-- Parallel Environment (PE)
-- -------------------------

Parallel Environment for AIX 5L is a comprehensive development and execution environment for parallel 
applications (distributed-memory, message-passing applications running across multiple nodes). 
It is designed to help organizations develop, test, debug, tune and run high-performance parallel 
applications in C, C++ and Fortran on IBM System p and System x clusters. Parallel Environment runs 
on AIX 5L V5.2 and V5.3.  

-- HACMP:
-- ------

HACMP is designed to provide high availability for critical business applications and data through 
system redundancy and failover. HACMP constantly monitors the status of servers, networks and applications 
to detect failures or performance degradation and can respond by automatically restarting a troubled 
application on designated backup hardware, taking care of all network or storage connections in the process. 
With HACMP, clients can scale up to 32 nodes and mix and match system sizes and performance levels as well 
as network adapters and disk subsystems to satisfy specific application, network and disk performance needs. 

HACMP/XD extends HACMP's high availability capabilities across geographic sites with remote data 
mirroring (replication) and failover using this mirrored data; this combination can maintain application 
and data availability even if an entire site is disabled by a disaster. HACMP/XD provides IP-based data 
mirroring and also supports hardware-based mirroring products such as 
IBM Enterprise Storage Systems Metro-Mirror (formerly PPRC). 

-- RSCT:
-- -----

Reliable Scalable Cluster Technology. Since HACMP 5.1, HACMP relies on RSCT. So, in modern HACMP, RSCT is
a neccessary component or subsystem. For example, HACMP uses the heartbeat facility of RSCT.
RSCT is a standard component in AIX5L.

Reliable Scalable Cluster Technology, or RSCT, is a set of software components that together provide a 
comprehensive clustering environment for AIXr and Linuxr. RSCT is the infrastructure used by a variety 
of IBMr products to provide clusters with improved system availability, scalability, and ease of use. 
RSCT includes the following components: 

- Resource Monitoring and Control (RMC) subsystem. This is the scalable, reliable backbone of RSCT. 
  It runs on a single machine or on each node (operating system image) of a cluster and provides a common 
  abstraction for the resources of the individual system or the cluster of nodes. You can use RMC for 
  single system monitoring or for monitoring nodes in a cluster. In a cluster, however, RMC provides global 
  access to subsystems and resources throughout the cluster, thus providing a single monitoring and management 
  infrastructure for clusters. 
- RSCT core resource managers. A resource manager is a software layer between a resource 
  (a hardware or software entity that provides services to some other component) and RMC. A resource manager 
  maps programmatic abstractions in RMC into the actual calls and commands of a resource. 
- RSCT cluster security services, which provide the security infrastructure that enables RSCT components 
  to authenticate the identity of other parties. 
- Topology Services subsystem, which, on some cluster configurations, provides node and network failure detection. 
  Group Services subsystem, which, on some cluster configurations, provides cross-node/process coordination.


RSCT is the "glue" that holds the nodes together in a cluster. It is a group of low-level components 
that allow clustering technologies, such as High-Availability Cluster Multiprocessing (HACMP) and 
General Parallel File System (GPFS), to be built easily. 

RSCT technology was originally developed by IBM for RS/6000 SP systems (Scalable POWERparallel). 
As time passed, it became apparent that these capabilities could be used on a growing number of general 
computing applications, so they were moved into components closer to the operating system (OS), such as 
Resource Monitoring and Control (RMC), Group Services, and Topology Services. 

The components were originally packaged as part of the RS/6000 SP Parallel System Support Program (PSSP) 
and called RSCT. RSCT is now packaged as part of AIX 5L Version 5.1 and later. 

RSCT is also included in Cluster Systems Management (CSM) for Linux. Now, Linux nodes (with appropriate 
hardware and software levels) running CSM 1.3 for Linux can be part of the management domain cluster 1600, 
and RSCT (with RMC) is the common interface for clustering. For more information about this heterogeneous 
cluster, see An Introduction to CSM 1.3 for AIX 5L, SG24-6859. 

RSCT includes these components: 

-Resource Monitoring and Control (RMC) 
-Resource managers (RM) 
-Cluster Security Services (CtSec) 
-Group Services 
-Topology Services

Group Services and Topology Services

Group Services and Topology Services, although included in RSCT, are not used in the management 
domain structure of CSM. These two components are used in peer domain clusters for applications, 
such as High-Availability Cluster Multiprocessing (HACMP) and General Parallel File System (GPFS), 
providing node and process coordination and node and network failure detection. Therefore, for these 
applications, a .rhosts file may be needed (for example, for HACMP configuration synchronization). 

These services are often referred to as hats and hags: 
high availability Group Services daemon (hagsd) 
and high availability Topology Services daemon (hatsd). 

- What are management domains and peer domains?
In order to understand how the various RSCT components are used in a cluster, you should be aware 
that nodes of a cluster can be configured for either manageability or high availability.

>> You configure a set of nodes for manageability using the Clusters Systems Management (CSM) product as 
described in IBMr Cluster Systems Management: Administration Guide. The set of nodes configured for manageability 
is called a management domain of your cluster.

>>You configure a set of nodes for high availability using RSCT's Configuration resource manager. 
The set of nodes configured for high availability is called an RSCT peer domain of your cluster. 
For more information, refer to Creating and administering an RSCT peer domain.


-- HPSS:	 
-- -----

High Performance Storage System
What is High Performance Storage System? HPSS is software that manages petabytes of data on disk and robotic tape 
libraries. HPSS provides highly flexible and scalable hierarchical storage management that keeps recently 
used data on disk and less recently used data on tape. HPSS uses cluster, LAN and/or SAN technology to aggregate 
the capacity and performance of many computers, disks, and tape drives into a single virtual file system 
of exceptional size and versatility. This approach enables HPSS to easily meet otherwise unachievable demands 
of total storage capacity, file sizes, data rates, and number of objects stored. HPSS provides a variety of user 
and filesystem interfaces ranging from the ubiquitous vfs, ftp, samba and nfs to higher performance pftp, 
client API, local file mover and third party SAN (SAN3P). HPSS also provides hierarchical storage management 
(HSM) services for IBM General Parallel File System (GPFS). 


-- C-SPOC:
-- -------

The Cluster Single Point of Control (C-SPOC) utility lets system administrators perform administrative tasks 
on all cluster nodes from any node in the cluster.


-- HA Network Server:
-- ------------------

The High Availability Network Server (HA Network Server) is a complete solution that quickly and automatically 
configures certain network services in a high availability environment. HA Network Server solution is designed 
to enhance the HACMP product by offering a set of scripts that set up highly available network services 
such as Domain Name System (DNS), Dynamic Host Configuration Protocol (DHCP), Network File System (NFS), 
and printing services. This is possible by using the framework offered in HACMP to monitor and act upon 
potential problems with network services in order to extend high availability beyond just hardware recovery. 
Making these services highly available means there is no down time in services that are critical to running 
a business. This solution is now available by download.

HA Network Server components
The HA Network Server solution is comprised of three network service plug-ins providing for DNS, DHCP, 
and print services (HACMP already contains integrated support for high availability NFS (HANFS)). 
Each of these plug-ins is available on this Web site as a downloadable tar file. These example scripts start 
and stop the network service processes, verify that configuration files are present and stored in a 
shared filesystem, and assist the HACMP monitoring functions that check on the health of the network service process. 
These scripts are provided as examples that may be customized for your environment.

A setup program is also provided with each of these plug-ins to assist with the setup after downloading the plug-in. 
Since several prerequisites must be completed by the user before setup begins, please read the README file that is 
included within the plug-in tar file. After download and tar file expansion, the README will be located in 
/usr/es/sbin/cluster/plug-ins/<network_service>, where <network_service> will be dns, dhcp, or printserver 
depending on which plug-in was downloaded.


76.2 Overview architecture:
===========================

HACMP is an "High Availability" solution, and it's an IBM cluster technology, based on RSCT and additional daemons
and implementations, like, for example, the concept of a "Resource Group".

In an HACMP Cluster, most relevant hardware adapters in a system are doubled. For example, multiple
network adapters and multiple FC cards, are typical in a Cluster node, to avoid Single Points Of Failure (SPOFs).
 
Two main implementations are possible (we limit ourselves here to a 2-node Cluster):

- One node runs and owns an application (asssociated with a Resource Group), and in case of whatever
  failure, another node can take "ownership" of the Resource Group and starts running the application.
  Implementations is partly done with the aid of start- and stop scripts belonging to this application.

- But if you have a suitable application, it's also posible that both nodes runs the same application at the same time
  and thus parallel processing takes place.

So, many HACMP implementations, acts like an "active - passive" cluster, in which one node runs the app, and the
other node takes the role of "failover" node, Which is not to say that the failover node can't actively run other 
applications as well.
But do not forget, that when the right type of applications are used, real parallel processing
could be implemented.


         ------------------------------------------ public network
             | |                             | |
             | |                             | |
        ------------                    -------------
        |cluster   |                    |cluster    |
        |system    |Ethernet            |system     |
        |pSeries   |--------------------|pSeries    |
        |          |          heartbeat |           |
        |          |Or                  |           |
        |          |Serial Link         |           |
        |          |--------------------|           |
        |FC  FC    |                    |  FC    FC |
        ------------                    -------------
          |  |                             |    |
          |  |   ---------------------------    |
          |  |   |                              |
          |  ----|-------------------------     |
          |      |                        |     |
          --------  Resource Group:       -------- Resource Group:
          |hdisk1|  -Application_01       |hdisk1| -Application_02
          --------  -Volume Group(s)      -------- -Volume Group(s)
          --------  -File System(s)       -------- -File System(s)
          |hdisk2|                        |hdisk2|
          --------                        --------
          --------                        --------
          |hdisk3|                        |hdisk3|
          --------                        --------

A "Resource group" is a group of associated "resources", known under one name. 
It can consist of an Application, Volume Group(s), File System(s) and other resources.
You can define a Resource Group from smitty: smitty hacmp

Resource Groups can be available from a single node or, in the case of concurrent applications,
available simultaneously from multiple nodes.

The components in a Resource Group move together from one node to another node,
in the case of a node failure.

Fallover and Fallback:

- Fallover: Represents the movement of a resource group from one node to the backup node
  in response to a failure on that node.
- Fallback: Represents the movement of a resource group from the backup node to the previous
  node, when it becomes available.


Key tasks in setting up an HACMP Cluster are:
- define the right Resource Group(s) and failover (fallover and fallback) policy
- create the right start and stop scripts for the application(s)
- setup the right IP parameters, like IP addresses and takeover methodology, per node

To illustrate the above, it probably nice to take a look at this (very simple) thread from the Internet:

  thread:

  Q:

  Hi All, 
  We have 2 servers running HACMP 4.3.1 in 
  non-concurrent rotating mode with IP Take Over 
  Facility Enabled. We have only one resourse group 
  running on Server A. In case of Failure, Services 
  Transfer to Server B(backup Server with same 
  configuration). 

  Now I have question is it possible to create another 
  resource group B active on Server B when Resource 
  Group A is Active On Server A. i.e both resource group 
  keep active on Different Server and both servers act 
  as a backup for each other. 

  Any practical implementation? 

  A:

  The short answer is "yes". We have that scenario on our servers running 
  Peoplesoft. One system is "primary" for HR and one system is primary for 
  Financials. However, each system functions as a backup for the other 
  application in case of a failure. 

  Sorry - I'm not an HA expert as we had a contractor actually come in and do 
  the work for us - but it is possible, as you asked. 


76.3 Application Servers:
=========================

To put the application under HACMP control, you create an application server resource that associates 
a user-defined name with the names of specially written scripts to start and stop the application. 
By defining an application server, HACMP can start another instance of the application on the takeover node 
when a fallover occurs. This protects your application so that it does not become a single point of failure. 
An application server can also be monitored with the application monitoring feature and the Application 
Availability Analysis tool. 

After you define the application server, you can add it to a resource group. A resource group is a set of 
resources that you define so that the HACMP software can treat them as a single unit.

HACMP can monitor applications that are defined to application servers, in one of two ways: 

-Process monitoring detects the termination of a process, using RSCT Resource Monitoring and Control (RMC) capability. 
-Custom monitoring monitors the health of an application based on a monitor method that you define. 


76.4 Daemons:
=============

Cluster Services:
 
Notice that if you list the daemons in the AIX System Resource Controller (SRC), you will see ES appended 
to their names. The actual executables do not have the ES appended; the process table shows the executable 
by path (/usr/es/sbin/cluster...). 

The following lists the required and optional HACMP/ES daemons: 

-- Cluster Manager daemon (clstrmgr):

This daemon monitors the status of the nodes and their interfaces, and invokes the appropriate scripts 
in response to node or network events. It also centralizes the storage of and publishes updated information 
about HACMP-defined resource groups. The Cluster Manager on each node coordinates information gathered from 
the HACMP global ODM, and other Cluster Managers in the cluster to maintain updated information about the content, 
location, and status of all HACMP resource groups. This information is updated and synchronized among all nodes 
whenever an event occurs that affects resource group configuration, status, or location.
All cluster nodes must run the clstrmgr daemon.

-- Cluster SMUX Peer daemon (clsmuxpd):

This daemon maintains status information about cluster objects. This daemon works in conjunction with 
the Simple Network Management Protocol (snmpd) daemon. All cluster nodes must run the clsmuxpd daemon.
Note: The clsmuxpd daemon cannot be started unless the snmpd daemon is running.

-- Cluster Information Program daemon (clinfo):

This daemon provides status information about the cluster to cluster nodes and clients and invokes 
the /usr/es/sbin/cluster/etc/clinfo.rc script in response to a cluster event. The clinfo daemon is optional 
on cluster nodes and clients.

-- Cluster Lock Manager daemon (cllockd):
This daemon provides advisory locking services. The cllockd daemon is required on cluster nodes only if 
those nodes are part of a concurrent access configuration.

- Cluster Topology Services daemon (topsvcsd):
This daemon monitors the status of network adapters in the cluster. 
All cluster nodes must run the topsvcsd daemon.

-- Cluster Event Management daemon (emsvcsd):
This daemon matches information about the state of system resources with information about resource conditions 
of interest to client programs (applications, subsystems, and other programs).The emsvcsd daemon runs on each node 
of a domain.

-- Event Management AIX Operating System Resource Monitor (emaixos):
This daemon acts as a resource monitor for the event management subsystem and provides information about 
the operating system characteristics and utilization. The emaixos daemon is started automatically by Event Management

-- Cluster Group Services daemon (grpsvcsd):
This daemon manages all of the distributed protocols required for cluster operation. 
All cluster nodes must run the grpsvcsd daemon.

-- Cluster Globalized Server Daemon daemon (grpglsmd):
This daemon operates as a grpsvcs client; its function is to make switch adapter membership global across 
all cluster nodes. All cluster nodes must run the grpglsmd daemon. 

- Group Services Concurrent Logical Volume Manager (gsclvmd).
When extended concurrent Volume Groups are used, this process manages concurrent Volumes.

- high availability Group Services daemon (hagsd) 

- high availability Topology Services daemon (hatsd). 


The AIX System Resource Controller (SRC) controls the HACMP/ES daemons (except for cllockd, which is a 
kernel extension). It provides a consistent interface for starting, stopping, and monitoring processes 
by grouping sets of related programs into subsystems and groups. In addition, it provides facilities for 
logging of abnormal terminations of subsystems or groups and for tracing of one or more subsystems. 

 
The HACMP/ES daemons are collected into the following SRC subsystems and groups: 

Daemon 				Subsystem	Group 
/usr/es/sbin/cluster/clstrmgr	clstrmgrES	cluster 
/usr/es/sbin/cluster/clinfo	clinfoES	cluster 
/usr/es/sbin/cluster/clsmuxpd	clsmuxpdES	cluster 
/usr/es/sbin/cluster/cllockd	cllockdES	lock 
/usr/sbin/rsct/bin/emsvcs	emsvcs		emsvcs 
/usr/sbin/rsct/bin/topsvcs	topsvcs		topsvcs 
/usr/sbin/rsct/bin/hagsglsmd	grpglsm		grpsvcs 
/usr/sbin/rsct/bin/emaixos	emsvcs		emsvcs 
/usr/es/sbin/cluster/clcomd	clcomdES	clcomd

When using the SRC commands, you can control the clstrmgr, clinfo, and clsmuxpd daemons by specifying 
the SRC cluster group. 

The required and optional HACMP and RSCT daemons are:

- clcomdES	Cluster communication daemon
- clstrmgrES	Cluster manager
- clinfoES	Cluster information daemon
- rmcd		RSCT resource Monitoring and Control daemon 
- hatsd		RSCT Topology Services subsystem (includes hats_nim* which send and receives heartbeats)
- hagsd		RSCT group services subsystem
- grpglsmd	main function is to make switch adapter membership global accross all cluster nodes.

Starting with hacmp 5.3, the cluster manager process is always running. It can be in one of two states,
as displayed by the command

# lssrc -ls clstrmgrES

ST_INIT (start event has executed)
ST_NOTCONFIGURED (start event has not executed)

# lssrc -ls clstrmgrES

Current state: ST_STABLE
sccsid = "@(#)36   1.135.1.62   src/43haes/usr/sbin/cluster/hacmprd/main.C, hacmp.pe, 52haes_r540, 
                                                                            r540s001a 6/29/06 08:59:13"
i_local_nodeid 0, i_local_siteid -1, my_handle 1
ml_idx[1]=0     ml_idx[2]=1
There are 0 events on the Ibcast queue
There are 0 events on the RM Ibcast queue
CLversion: 9
local node vrmf is 5400
cluster fix level is "0"
The following timer(s) are currently active:
Current DNP values
DNP Values for NodeId - 1  NodeName - n5101l01
    PgSpFree = 1849678  PvPctBusy = 2  PctTotalTimeIdle = 99.147538
DNP Values for NodeId - 2  NodeName - zd101l01
    PgSpFree = 2095773  PvPctBusy = 0  PctTotalTimeIdle = 98.956015
root@n5101l01:/root#


76.5 Understanding Cluster Service Startup:
===========================================
 
You start cluster services on a node by executing the HACMP/ES /usr/es/sbin/cluster/etc/rc.cluster script. 
Or use the Start Cluster Services SMIT screen, described in this section. 

Using smitty:
-------------

To start the HACMP cluster (the HACMP Cluster Manager) on the cluster nodes, there are two methods.

1. The first method is the most convenient; however, it can only be used if rsh is enabled. It allows the 
Cluster Manager to be started on both nodes with a single command:

% smitty hacmp

Cluster System Management
-> HACMP Cluster Services
-> Start Cluster Services

 
2. Alternatively, it is possible to use a slightly different SMIT path to start the Cluster Manager 
on the local node. Of course, this requires logging into each node independently to activate both Cluster Managers. 

% smitty hacmp

Cluster Services

-> Start Cluster Services

Take the defaults and press <Enter>.


Using scripts:
--------------

The rc.cluster script initializes the environment required for HACMP/ES 
by setting environment variables and then calls the /usr/es/sbin/cluster/utilities/clstart script 
to start the HACMP/ES daemons. The clstart script is the HACMP/ES script that starts all the cluster services. 
The clstart script calls the SRC startsrc command to start the specified subsystem or group. 
The following figure illustrates the major commands and scripts called at cluster startup: 

rc.cluster -> clstart -> startsrc

The HACMP/ES daemons are started in the following order: 

-RSCT daemons (Group Services, Topology Services, then Event Management) 
-Cluster Manager 
-Cluster SMUX daemon 
-Cluster Information Program daemon (optional) 

Using the C-SPOC utility, you can start cluster services on any node (or on all nodes) in a cluster 
by executing the C-SPOC /usr/es/sbin/cluster/sbin/cl_rc.cluster command on a single cluster node. 
The C-SPOC cl_rc.cluster command calls the rc.cluster command to start cluster services on the nodes specified 
from the one node. The nodes are started in sequential order, not in parallel. The output of the command 
run on the remote node is returned to the originating node. Because the command is executed remotely, 
there can be a delay before the command output is returned. 

The following example shows the major commands and scripts executed on all cluster nodes when cluster 
services are started in clusters using the C-SPOC utility. 


        NODE A           NODE B  
        cl_rc.cluster
             |        \rsh
             |         \
           rc.cluster    rc.cluster 
             |             | 
             |             |
           clstart        clstart
             |             |
             |             |
           startsrc       startsrc


-- Automatically Restarting Cluster Services 
You can optionally have cluster services start whenever the system is rebooted. If you specify the -R flag 
to the rc.cluster command, or specify "restart or both" in the Start Cluster Services SMIT screen, 
the rc.cluster script adds the following line to the /etc/inittab file. 

hacmp:2:wait:/usr/es/sbin/cluster/etc/rc.cluster -boot> /dev/console 2>&1 
# Bring up Cluster 

At system boot, this entry causes AIX to execute the /usr/es/sbin/cluster/etc/rc.cluster script to start HACMP/ES. 

WARNING: Be aware that if the cluster services are set to restart automatically at boot time, you may face 
problems with node integration after a power failure and restoration, or you may want to test a node after 
doing maintenance work before having it rejoin the cluster. 

-- Starting Cluster Services with IP Address Takeover Enabled 
If IP address takeover is enabled, the /usr/es/sbin/cluster/etc/rc.cluster script calls the /etc/rc.net script 
to configure and start the TCP/IP interfaces and to set the required network options. 

-- Editing the rc.cluster File to Turn Deadman Switch Off 
In HACMP/ES, the Deadman Switch (DMS) is controlled by RSCT Topology Services. If, in a rare case, you want 
to turn the DMS off, you must edit the rc.cluster file as follows: 

There is a -D flag in clstart, located in /usr/es/sbin/cluster/utilities 
In the /usr/es/sbin/cluster/etc/rc.cluster file, find a call to "clstart" at about line #486. 
Edit this call to include the -D flag. 


76.6 Understanding Stopping Cluster Services:
=============================================
 
You stop cluster services on a node by executing the HACMP/ES /usr/es/sbin/cluster/utilities/clstop script. 
Use the HACMP for AIX Stop Cluster Services SMIT screen, described in the section Stopping Cluster Services 
to build and execute this command. The clstop script stops an HACMP/ES daemon or daemons. The clstop script 
starts all the cluster services or individual cluster services by calling the SRC command stopsrc. 

The following figure illustrates the major commands and scripts called at cluster shutdown: 

clstop -> stopsrc

Using the C-SPOC utility, you can stop cluster services on a single node or on all nodes in a cluster 
by executing the C-SPOC /usr/es/sbin/cluster/sbin/cl_clstop command on a single node. The C-SPOC cl_clstop 
command performs some cluster-wide verification and then calls the clstop command to stop cluster services 
on the specified nodes. The nodes are stopped in sequential order, not in parallel. The output of the command 
run on the remote node is returned to the originating node. Because the command is executed remotely, 
there can be a delay before the command output is returned. 

        NODE A           NODE B  
        cl_clstop
             |       \rsh
             |        \
           clstop       clstop
             |             | 
             |             |
           stopsrc      stopsrc


Starting and stopping using smitty:

To start cluster services, use

smit cl_admin -> Manage HACMP Services -> Start Cluster Services

To stop cluster services, use

smit cl_admin -> Manage HACMP Services -> Stop Cluster Services


76.7 Resource Groups:
=====================

If you consider the question of how the failover node takes control of a Resource Group, we can consider
the following options:

- Cascading resource groups:
  It defines a list of all the nodes that can control the Resource Group, and each node has a takeover
  priority. In case of a failure of the active node, the higest priority node aquires the Resource Group.
  If that node is unavailable, the next-highest node takes over, and so on.
  There are some differentiations if a lesser-higher node has taken over a RG, but a higher node
  becomes available. It's possible to define a Cascading method with fallback to the higher node,
  or to define it without fallback (CWOF).

- Rotating resource groups:
  
- Concurrent Access resource groups
- Custom Access resource groups


76.8: Cluster logfiles:
=======================

Cluster log files
HACMP for AIX scripts, daemons, and utilities write messages to the log files shown below.

HACMP log files Log file name Description 

/var/adm/cluster.log 	Contains time-stamped, formatted messages generated by HACMP for AIX scripts and daemons. 
			In this log file, there is one line written for the start of each event, and one line written 
			for the completion. 
/tmp/hacmp.out 		Contains time-stamped, formatted messages generated by the HACMP for AIX scripts. 
			In verbose mode, this log file contains a line-by-line record of each command executed 
			in the scripts, including the values of the arguments passed to the commands. By default, 
			the HACMP for AIX software writes verbose information to this log file; however, you can 
			change this default. Verbose mode is recommended. 
system error log 	Contains time-stamped, formatted messages from all AIX subsystems, including the HACMP 
			for AIX scripts and daemons. 

/usr/sbin/cluster/
history/cluster.mmdd 	Contains time-stamped, formatted messages generated by the HACMP for AIX scripts. 
			The system creates a new cluster history log file every day that has a cluster event 
			occurring. It identifies each day's file by the file name extension, where mm indicates 
			the month and dd indicates the day. 
/tmp/cm.log 		Contains time-stamped, formatted messages generated by HACMP for AIX clstrmgr activity. 
			Information in this file is used by IBM Support personnel when the clstrmgr is in debug mode. 
			Note that this file is overwritten every time cluster services are started; 
			so, you should be careful to make a copy of it before restarting cluster services on a 
			failed node. 
/tmp/cspoc.log 		Contains time-stamped, formatted messages generated by HACMP for AIX C-SPOC commands. 
			Because the C-SPOC utility lets you start or stop the cluster from a single cluster node, 
			the /tmp/cspoc.log is stored on the node that initiates a C-SPOC command. 
/tmp/dms_logs.out 	Stores log messages every time HACMP for AIX triggers the deadman switch. 
/tmp/emuhacmp.out 	Contains time-stamped, formatted messages generated by the HACMP for AIX Event Emulator. 
			The messages are collected from output files on each node of the cluster, and cataloged 
			together into the /tmp/emuhacmp.out log file. In verbose mode (recommended), this log file 
			contains a line-by-line record of every event emulated. Customized scripts within the event 
			are displayed, but commands within those scripts are not executed. 

/var/hacmp/clverify
/clverify.log		Contains messages when the cluster verification has run.


76.9 Oracle 10g, Oracle RAC 10g, and HACMP:
===========================================

Note 1:
-------

thread:

Q:

Hi Guys , I need some technical guidance regarding HACMP and Oracle Clusterware. I am designing an 
Oracle maximum Availability architecture for a client on 4 Nodes of IBM 570 PSeries servers on 
Oracle 10G RAC. The configuration includes IBM HACMP and Oracle Clusterware. No I need do know if I can 
fully rely on Oracle Clusterware as my Clusterware or I can configure both IBM HACMP and Oracle Clusterware 
for some services. Can these two clusterware coexist ?? 

A:

1) HACMP and Oracle Clusterware can co-exist
2) HACMP is optional
3) Oracle Clusterware is required for RAC whether or not you use HACMP.

A:

Yes they can co-exist. But my question is why complicate things. You cannot have a RAC cluster without 
the Oracle Clusterware. Meaning if you install HACMP you will have to install Oracle Clusterware also 
on top of this. Why complicate the stack... keep it simple.. we have been using Oracle clusteware on AIX 
without HACMP without any issues so far.


thread:

Q:

Any suggestions on how to provide a cold failover solution on two P5 
Series boxes with an Oracle database? With RAC being pricey, I don't 
think our business will be open to purchasing RAC licenses. Our UNIX 
Admin is adamant about using HACMP. Without RAC in place, how does HACMP 
interact with Oracle? From what I understand, since both nodes will be 
sharing the same disk storage, it should be as simple as starting the 
database on the second node with customized scripts in the event of a 
failure--is this true? HACMP apparantly does some sort of export from the 
primary node to the secondary node in the even of a failure, then runs 
customized scripts to start applications, etc....Seems too simplistic to 
me--am I missing something? 

I've also heard that if RAC is used for a cold failover solution, then the 
price is discounted. 

I'm struggiling with providing solutions to the business, knowing that new 
hardware and a network upgrade are going to incur a cost. 

Any thoughts, suggestions, etc would be much appreciated. 

A:


76.10 Other notes on HACMP:
===========================


Filesets and compatibility list HACMP versions - AIX versions:

Note 1:
-------

HACMP Version Compatibility Matrix 

http://www-03.ibm.com/support/techdocs/atsmastr.nsf/WebIndex/TD101347

Document Author:  
Shawn Bodily

Document ID: 
TD101347 

Doc. Organization: 
Advanced Technical Support 
 
Document Revised: 
03/06/2007 

Product(s) covered: 
HACMP 
 

Abstract: This document provides a HACMP Version Compatibility Matrix. 


HACMP 	Version Supported? 	AIX Level(s) MISC 
1.2 	NO 			3.2.5   
2.1 	NO 			3.2.5   
3.1.0 	NO 			3.2.5   
3.1.1 	NO 			3.2.5  
4.1.0 	NO 			4.1.X   
4.1.1 	NO 			4.1.X  
4.2 	NO 			4.1.4, 4.2.X  
4.2.1 	NO 			4.1.5, 4.2.X  
4.2.2 	NO 			4.1.5, 4.2.1, 4.3.X  
4.3 	NO 			4.3.2, 4.3.3  
4.3.1 	NO 			4.3.2, 4.3.3  
4.4 	NO 			4.3.3  
4.4.1 	NO 			4.3.3, 5.1  
4.5 	NO 			5.1, 5.2  
5.1 	NO-09/01/2006 		5.1, 5.2,5.3  
5.2 	Y-9/30/2007 		5.1, 5.2,5.3  
5.3 	Y-9/30/2008 		5.2(ML4), 5.3(ML2) AIX 5.2 RSCT 2.3.6 or higher AIX 5.3 RSCT 2.4.2 or higher  
5.4 	Yes 			5.2 (TL8), 5.3(TL4) AIX 5.2 RSCT 2.3.9 or higher AIX 5.3 RSCT 2.4.5. or higher 
 
 
Cross Reference Chart 

		AIX 4.3.3 AIX 5.1 AIX 5.1(64-bit) AIX 5.2 AIX 5.3 
HACMP 4.5 	No Yes No Yes No 
HACMP/ES 4.5 	No Yes Yes Yes No 
HACMP/ES 5.1 	No Yes Yes Yes Yes 
HACMP/ES 5.2 	No Yes Yes Yes Yes 
HACMP/ES 5.3 	No No No Yes Yes 
HACMP/ES 5.4 	No No No Yes Yes 
 

Note 2:
-------

HACMP 5.1 requires:
- AIX 5L v5.1 ML5 with RSCT v2.2.1.30 or higher
- AIX 5L v5.2 ML2 with RSCT v2.3.1.0 or higher
- c-spoc vpath support requires SDD 1.3.1.3 or higher

HACMP 5.2:
AIX 
Each cluster node must have one of the following installed: 
AIX 5L v5.1 plus the most recent maintenance level (minimum ML 5) 
AIX 5L v5.2 plus the most recent maintenance level (minimum ML 2) 

HACMP 5.3 is supported on AIX 5.2 and 5.3
- AIX 5.2 ML06 or later with RSCT 2.3.6 or later
- AIX 5.3 ML02 or later with RSCT 2.4.2 or later


Note 3: HACMP FAQ:
------------------


I have installed HACMP, now what? 
 
Why does HACMP require so many subnets for IP address takeover? 
 
Does HACMP have any limits? 
 
How can I avoid the nameserver as a single point-of-failure? 
 
What is a config_too_long event? 
 
Do all cluster nodes need to be at the same version of HACMP and AIX 5L operating system? 
 
Why do I need a non-IP heartbeat network? 
 
Can I put different types of processors, communications adapters, or disk subsystems in the same cluster? 
 
What kinds of applications are best suited for a high availability environment? 
 
Can I use Etherchannel with HACMP? 
 
Can I use an existing Enhanced Concurrent Mode volume group for disk heartbeat? Or do I need to define a new one? 
 
 
Question: I have installed HACMP, now what?

Answer: Before HACMP can manage and keep your application highly available, you need to tell HACMP about 
your cluster and the application. There are 4 steps:

Step 1) Define the nodes that will keep your application highly available

The local node (the one where you are configuring HACMP) is assumed to be one of the cluster nodes 
and you must give HACMP the name of the other nodes that make up the cluster. Just enter a hostname or IP address 
for each node. 

Step 2) Define the application you want to keep highly available 
There are 3 things you need to tell HACMP about the application: 
name-provide a name 
start script-specify a script for HACMP to use to start the application 
stop script-specify a script for HACMP to use to stop the application 

Step 3) Verify and synchronize the cluster 
HACMP will discover all the networks and disks connected to the nodes. A verification step will ensure 
that the cluster configuration will be able to keep the application highly available. When successful the 
configuration will be copied to the rest of the nodes in the cluster. 

Step 4) Manage the application 
When you start HACMP it will begin managing the application and keeping it highly available. You can also use 
the maintenance facilities provided by HACMP to move the application between nodes for maintenance purposes. 

To see just how easy it is to configure HACMP, look for Using the SMIT Assistant in Chapter 11 of the 
Installation Guide. View the online documentation for HACMP. HACMP for Linux does not include the advanced 
discovery and verification features available on AIX 5L. When configuring HACMP for Linux you must manually 
define the cluster, networks and network interfaces. Any changes to the configuration require HACMP for Linux 
to be restarted on all nodes. 


Question: Why does HACMP require so many subnets for IP address takeover?

Answer: HACMP (using RSCT) determines adapter state by sending heartbeats across a specific network interface
-as long as heartbeat messages can be sent through an interface, the interface is considered alive. 
Prior to AIX 5L V5, AIX did not allow more than one interface to own a subnet route but in AIX 5L V5.1 multiple 
interfaces can have a route to the same subnet. This is sometimes referred to as multipath routing or 
route striping and when this situation exists, AIX 5L will multiplex outgoing packets destined for a particular 
subnet across all interfaces with a route to that subnet. This interferes with RSCT's ability to reliably 
send heartbeats to a specific interface. Therefore the subnetting rules for boot, service and persistent labels 
are such that there will never be a duplicate subnet route created by the placement of these addresses.

HACMP V5 includes a new feature whereby you may be able to avoid some of the subnet requirements 
by configuring HACMP to use a different set of IP alias addresses for heartbeat. With this feature you provide 
a base or starting address and HACMP calculates a set of addresses in proper subnets-when cluster services 
are active, HACMP adds these addresses as IP alias addresses to the interfaces and then uses these alias 
addresses exclusively for heartbeat traffic. You can then assign your "regular" boot, service and persistent 
labels in any subnet, but be careful: although this feature avoids multipath routing for heartbeat, 
multipath routing may adversely affect your application. Heartbeat via IP Aliasing is discussed in Chapter 2 
of the Concepts and Facilities Guide and Chapter 3 of the Administration and Troubleshooting Guide. 
View the online documentation for HACMP.


Question: Does HACMP have any limits?

Answer: The functional limits for HACMP (e.g. number of nodes and networks) can be found in Chapter 1 
of the Planning and Installation Guide. View the online documentation for HACMP.


Question: How can I avoid the nameserver as a single point-of-failure?

Answer: 1) Make the nodes look at /etc/hosts first before the nameserver by creating a 
/etc/netsvc.conf file with the following entry:

hosts=local,bind 

where local tells it to look at /etc/hosts first and then the nameserver

2) Remove /etc/resolv.conf (or modify name to save it for later use) so it looks for name resolution 
in /etc/hosts first.

For information on updating the /etc/hosts file and nameserver configuration, Installation Guide. 
View the online documentation for HACMP. 


Question: What is a config_too_long event?

Answer: The config_too_long event is an informational event run by HACMP whenever a cluster event runs 
for longer that a preset time. This can occur when:

an AIX 5L command (e.g. fsck) is taking a long time to complete, or has hung 
there was an un-recoverable error encountered - in this case there will be an "EVENT FAILED" indication 
in hacmp.out 

If the config_too_long event is run, you should check the hacmp.out file to determine the cause and if manual 
intervention is required. For more information on recovery after an event failure, refer to Recover from HACMP 
Script Failure in Chapter 18 of the Administration and Troubleshooting Guide. 


Question: Do all cluster nodes need to be at the same version of HACMP and AIX 5L operating system?

Answer: No, though there are some restrictions when running mixed mode clusters.

Mixed levels of AIX 5L on cluster nodes do not cause problems for HACMP as long as the level of AIX 5L 
is adequate to support the level of HACMP being run on that node. All cluster operations are supported 
in such an environment. The HACMP install and update packaging will enforce the minimum level of AIX 5L 
required on each system.

Similarly for Linux on POWER, different levels of the operating system should not cause problems as long as 
the minimum supported version is installed. Mixing different platforms-AIX 5L, RedHat and SUSE-within the 
same cluster is not supported.

As a matter of practicality, it is recommended that all nodes be at the same levels of operating system 
and HACMP whenever possible. Keeping, the operating system, HACMP and the application at the same level 
on all nodes will make the administration of the cluster easier and less error prone, and will go a long way 
towards reducing the frustration of the administrators. The Planning Guide has advice for effectively managing 
different installation and migration scenarios.


Question: Why do I need a non-IP heart beat network?

Answer: The purpose of the non-IP heartbeat link is often misunderstood. The requirement comes from the following: 
HACMP heartbeats on IP networks are sent as UDP datagrams. This means that if a node or network is congested, 
the heartbeats can be discarded. If there were only IP networks, and if this congestion went on long enough, 
the node would be seen as having failed and HACMP would initiate a takeover. Since the node is still alive, 
HACMP takeover can cause both nodes to have the same IP address, and can cause the nodes to both try to own 
and access the shared disks. This situation is sometimes referred to as "split brain" or "partitioned cluster". 
Data corruption is all but inevitable in this circumstance.

HACMP therefore strongly recommends that there be at least one non-IP network connecting a node to at least one 
other node. For clusters with more than two nodes, the most reliable configuration includes two non-IP networks 
on each node. The distance limitations on non-IP links-particularly RS-232-has often made this requirement 
difficult to meet. For such clusters, HACMP disk heartbeating should be strongly considered. Disk heartbeating 
enables the easy creation of multiple non-IP networks without requiring additional hardware or software.


Question: Can I put different types of processors, communications adapters, or disk subsystems in the same cluster?

Answer: In general, yes, as long as the individual components are supported by HACMP. Note that there are some 
combinations which may not be reasonable or desirable. For example, putting two Ethernet adapters that run at 
different speeds on the same network will generally force all adapters on the network to run at the speed of 
the slower one. Likewise, having a low powered processor back up a high-powered processor may result in 
unacceptable performance should HACMP have to run the application on the lower powered one. (But see the 
questions on dynamic LPAR and CUoD for a way of dealing with this). As long as AIX 5L and the hardware support 
the interconnections, HACMP will support them as well.


Question: What kinds of applications are best suited for a high availability environment?

Answer: HACMP detects failures in the cluster then moves or restarts resources in order to keep the application 
highly available. For an application to work well in a high availability environment, the application itself 
must be capable of being managed (start, stop, restart) programmatically (no user intervention required) and must 
have no "hard coded" dependencies on specific resources. For example, if the application relies on the hostname 
of the server (and cannot dynamically accept a change in hostname), then it is practically impossible to 
restart the application on a backup server after a failure.

Question: Can I use Etherchannel with HACMP?

Answer: See Using Etherchannel with HACMP.


Question: Can I use an existing Enhanced Concurrent Mode volume group for disk heartbeat? 
Or do I need to define a new one?

Answer: To achieve the highest levels of availability under the widest range of failure scenarios, the best practice 
would be to configure one disk heartbeat connection per physical disk enclosure (or LUN).

The heartbeat operation itself involves reading and writing messages from a non-data area of the shared disk. 
Although the space used for heartbeat messages does not decrease the space available for the application 
(it is in the reserved area of the disk) there is some overhead when the disk seeks back and forth between 
the reserved area and the application data area.

If you configure the disk heartbeat path using the same disk and vg as is used by the application, the best practice 
is to select a disk which does not have frequently accessed or performance critical application data: 
although the disk heartbeat overhead is small (2-4 seeks/sec), it could potentially impact application performance or,
conversely, excess application access could cause the disk hb connection to appear to go up and down.

Ultimately the decision of which disk and volume group to use for heartbeat depends on what makes sense for 
your shared disk environment and management procedures. For example, using a separate vg just for heartbeat 
isolates the heartbeat from the application data, but adds another volume group that has to be maintained 
(during upgrades, changes, etc) and consumes another LUN.

If you decide on a separate vg for heartbeat, it does not need to be included in an HACMP resource group, 
however, the CSPOC utilities use a resource group node list as the set of nodes to perform operations: 
including the vg in a resource group with just the (sub)set of nodes connected to the disk will let you take 
advantage of the CSPOC functions. You can also define and use a disk which is not part of any volume group, 
though such a setup would have to be manually configured and maintained.

   
Note 5:
-------

thread:

Q:

Hi, 

I?m try to varyon a concurrent vg, but i receive this 
error: 

root@dgij:/ > varyonvg -nc hb_vg 
srcsrqt failed errno : SRC_NSVR 
Subsystem [gsclvmd] is not active 
tellclvmd: request failed rc = -9036 [SRC_NSVR] 
0516-1334 varyonvg: The command /usr/sbin/tellclvmd 
returned an error. 

I try to start the gsclvmd subsystem, but i receive 
this error in errpt: 

A:

You need to install the bos.clvm.rte fileset from the HACMP CD in order to make HACMP start the gsclvmd service 


Note 6: Monitoring an HACMP Cluster:
------------------------------------

HACMP provides the following tools for monitoring a cluster:
- HAView 
- Cluster Monitoring with Tivoli
- the "/usr/es/bin/cluster/clstat" command
- WebSMIT clstat 


# /usr/es/sbin/cluster/clstat -a -o

clstat - HACMP Cluster Status Monitor
-------------------------------------

Cluster: manet_monet (1089262563)
Fri Jul 9 14:53:04 2004
State: UP Nodes: 2
SubState: STABLE

Node: manet State: UP
Interface: manete_boot (0) Address: xxx.xxx.xxx.xxx
State: DOWN
Interface: manete_stby (0) Address: xxx.xxx.xxx.xxx
State: UP
Interface: manet_tty0_01 (1) Address: 0.0.0.0
State: UP
Interface: manete_rep_svc (0) Address: xxx.xxx.xxx.xxx
State: UP
Resource Group: cas1 State: On line

Node: monet State: UP
Interface: monete_boot (0) Address: xxx.xxx.xxx.xxx
State: UP
Interface: monete_stby (0) Address: xxx.xxx.xxx.xxx
State: UP
Interface: monet_tty0_01 (1) Address: 0.0.0.0
State: UP


Other example:

clstat - monitors the status of an IBM HACMP cluster 
Description

Monitors the status of an HACMP cluster.

To monitor the status of HACMP in a terminal (ASCII mode):

root@n5101l01:/root#clstat -a -o

                clstat - HACMP Cluster Status Monitor
                -------------------------------------

Cluster: scenter_pr     (1192098110)
Thu Oct 25 10:21:31 2007
                State: UP               Nodes: 2
                SubState: STABLE

        Node: n5101l01          State: UP
           Interface: n5101l01-boot (2)         Address: 10.17.4.11
                                                State:   UP
           Interface: n5101l01_hb01 (0)         Address: 0.0.0.0
                                                State:   UP
           Interface: n5101l01_hb02 (1)         Address: 0.0.0.0
                                                State:   UP
           Interface: sonriso (2)               Address: 10.17.3.100
                                                State:   UP
           Resource Group: scenter_pr                   State:  On line

        Node: zd101l01          State: UP
           Interface: zd101l01-boot (2)         Address: 10.17.4.10
                                                State:   UP
           Interface: zd101l01_hb01 (0)         Address: 0.0.0.0
                                                State:   UP
           Interface: zd101l01_hb02 (1)         Address: 0.0.0.0
                                                State:   UP


Note 6: Starting and stopping GPFS:
-----------------------------------

Starting and stopping GPFS
Before starting GPFS:

Ensure you have: 
Verified the installation of all prerequisite software. 
Verified there is no conflicting software installed. 
Properly configured and tuned your system for use by GPFS. This should be done prior to configuring GPFS. 
Completed all of the GPFS configuration considerations including running the mmconfig command.
For details see the General Parallel File System for AIX 5L in an HACMP Cluster: Concepts, Planning, 
and Installation Guide. 
If you are using the Data Management API (DMAPI) for GPFS to manage the data in your file system, you may 
customize the shell script gpfsready to synchronize the initialization of the GPFS daemon and the data management 
application. The script is invoked by the GPFS daemon as file systems are starting to be mounted, and can be used 
to verify the data management application is ready to handle mount events from the file system. For further 
information regarding the script, see the General Parallel File System: Data Management API Guide and search 
for initializing the Data Management application.

Start the daemons on all of the nodes in the nodeset by issuing the mmstartup command:

# mmstartup -C set1

Check the messages recorded in /var/adm/ras/mmfs.log.latest on one node for verification:

mmfsd initializing ...
GPFS: 6027-300 mmfsd ready

This indicates successful start-up of a quorum of nodes.

If GPFS does not start, see the General Parallel File System for AIX 5L in an HACMP Cluster: 
Problem Determination Guide and search for the GPFS daemon will not come up.

See the mmstartup Command for complete usage information.

If it becomes necessary to stop GPFS, you can do so from the command line by issuing the mmshutdown command:

# mmshutdown -C set1

The system displays information similar to:

Wed Aug 16 17:27:01 EDT 2000: 6027-1341 mmshutdown: Starting force unmount of GPFS file systems
k145n08:  forced unmount of /fs2
k145n08:  forced unmount of /fs1
k145n05:  forced unmount of /fs2
k145n05:  forced unmount of /fs1
Wed Aug 16 17:27:06 EDT 2000: 6027-1344 mmshutdown: Shutting down GPFS daemons
k145n08:  Shutting down!
k145n08:  0513-044 The mmfs Subsystem was requested to stop.
k145n05:  Shutting down!
k145n05:  0513-044 The mmfs Subsystem was requested to stop.
Wed Aug 16 17:27:10 EDT 2000: 6027-1345 mmshutdown: Finished

See the mmshutdown Command for complete usage information.


Note 7: Other remarks:
----------------------

7.1.
----

The main HACMP / RSCT services are:

- clcomdES	Cluster communication daemon
- clstrmgrES	Cluster manager
- clinfoES	Cluster information daemon

- rmcd		RSCT resource Monitoring and Control daemon 
- hatsd		RSCT Topology Services
- hagsd		RSCT group services

The following lines are added to inttab when you initially install hacmp. 

- It will start the clcomdES and clstrmgrES
subsystems if they are not already running.

hacmp:2:once:/usr/es/sbin/cluster/etc/rc.init > /dev/console 2>&1

- HACMP is configured for IP address takeover

harc:2:wait:/usr/es/sbin/cluster/etc/harc.net #HACMP newtwork startup


To start the cluster communication daemon:

# startsrc -s clcomdES


7.2.
----

To install HACMP:

From installation media

# smitty install_all


7.3.
----

The most commonly used shared storage in HACMP is:

> Fiber Attach Storage Server (FAStT)
> Enterprise Storage Servers (ESS/Shark)
> Serial Architecture Storage (SSA)

Devices supported:

- Traditional SCSI disks and enclosures
- SSA disks and enclosures
- FAStT / DS4xxx storage servers
- 2105 Enterprise Storage Servers nd DS8xxx and 6xxx
- Some 3rd party storage devices


The "cluster.es" and "cluster.cspoc" images which contain the HACMP runtime executable, 
are required and must be installed on all servers.


7.4.
----

Dynamic Node Priority:

# lssrc -ls clstrmgrES


7.5.
----

Shared Logical Volume:

Shared logical volume access can be made available in any of the following data accessing modes:

- Non-concurrent access mode
- Concurrent access mode
- Enhanced concurrent access mode

In a non-concurrent access configuration, only one cluster node can access the shared data at a time.
If the resource group containing the shared disk space moves to another node, the new node will activate
the disks and check the current state of the volume groups, logical volumes, and filesystems.

In a concurrent access configuration, data on the disks is available to all nodes concurrently.
Concurrent access mode is not supported for filesystems; 
instead you must use raw logical volumes or physical disks.

7.6.
----

>> Is my shared volume group online?

The following sequence will determine if the sharedvg volume group is currently online (often useful 
in application start scripts): 

if lsvg -o | grep -q -w sharedvg ; then
    echo sharedvg is online
else
    echo sharedvg is offline
fi


Note the use of the -w option on the grep invocation - this ensures that if you have a sharedvg and a sharedvg2 
volume group then the grep only finds the sharedvg line (if it exists). 
If you need to do something if the volume group is offline and don't need to do anything if it is online 
then use this: 

if lsvg -o | grep -q -w sharedvg ; then
    :	# null commmand if the volume group is online
else
    echo sharedvg is offline
fi


Some people don't like the null command in the above example. They may prefer the following alternative: 

lsvg -o | grep -q -w sharedvg
if [ $? -ne 0 ] ; then
    echo sharedvg is offline
fi

Although we're not particularily keen on the null command in the first approach, we really don't like the use 
of $? in if tests since it is far to easy for the command generating the $? value to become separated from the 
if test (a classic example of how this happens is if you add an echo command immediately before the if command 
when you're debugging the script). If we find ourselves needing to test the exit status of a command in an if test 
then we either use the command itself as the if test (as in the first approach) or we do the following: 

lsvg -o | grep -q -w sharedvg
rval=$?
if [ $rval -ne 0 ] ; then
    echo sharedvg is offline
fi

In our opinion (your's may vary), this makes it much more obvious that the exit status of the grep command is 
important and must be preserved. 


>>Starting a non-root process from within an application start script


A common requirement in an application start script is the need to start a program and/or shell script 
which is to be run by a non-root userid. This snippet does the trick: 

su - dbadmin -c "/usr/local/db/startmeup.sh"

This will run the startmeup.sh script in a process owned by the dbadmin user. Note that it is possible 
to pass parameters to the script/program as well: 


su - dbadmin -c "/usr/local/db/startmeup.sh PRODDB"

This runs the startmeup.sh script with a parameter indicating which database is to be started. 
A bit of formalism never hurts when it comes time later to do script maintenance. For example, use shell variables 
to specify the username and the command to be invoked: 

DBUSER=dbadmin
DBNAME=PRODDB
STARTCMD="/usr/local/db/startmeup.sh $DBNAME"
su - $DBUSER -c "$STARTCMD"

This makes it easy to change the username, database name or start command (this is particularily important 
if any of these appear more than once within the application start script). 
The double quotes around $STARTCMD in the su command are necessary as the command to be executed must be passed 
as a single parameter to the su command's -c option. 


>> Note 1: Killing processes owned by a user

A common requirement in application stop scripts is the need to terminate all processes owned by a 
particular user. The following snippet terminates all processes owned by the dbadmin user (this could be part 
of an application stop script that corresponds to the previous snippet that started the DB as dbadmin). 

DBUSER=dbadmin
kill ` ps -u $DBUSER -o pid= `

Since a simple kill is rarely enough and a kill -9 is a rather rude way to start a conversation, the following 
sequence might be useful: 

DBUSER=dbadmin
kill ` ps -u $DBUSER -o pid= `
sleep 10
kill -9 ` ps -u $DBUSER -o pid= `

To see how this works, just enter the ps command. It produces output along these lines: 
12276
12348

Note that equal sign in the pid= part is important as it eliminates the normal PID title which would appear 
at the top of the column of output. I.e. without the equal sign, you'd get this: 

  PID
12276
12348

Passing PID to the kill command is just a bad idea as writing scripts which normally produce error messages 
makes it much more difficult to know if things are working correctly. 


>> A more complete example of an application stop script

A common requirement in application stop scripts is the need to terminate all processes owned by a particular user. 
For example, a script along the following lines could be used to first gently and then forcibly terminate 
the database processes started in the previous example: 

#!/bin/ksh

DBUSER=dbadmin
STOPCMD="/usr/local/db/stopdb.sh"

# ask nicely
su - $DBUSER -c "$STOPCMD"

# wait twenty seconds and then get rude
sleep 20
kill ` ps -u $DBUSER -o pid= `

# wait ten more seconds and then get violent
sleep 10
kill -9 ` ps -u $DBUSER -o pid= `

# terminate any processes using our two shared filesystems
fuser -k /dev/sharedlv1
fuser -k /dev/sharedlv2

# make sure that our exit status is 0
exit 0


Good HACMP site: http://www.coredumps.de/doc/ibm/cluster/HAES/haes/gtoc.html


Note 8: Internal logic error in Group Services d
================================================


>> thread:


Q:

Hello,

I get below grpsvcs errors on my cluster nodes (HACMP 5.4 - cluster is UP and STABLE):

463A893D 0425054607 P O grpsvcs Internal logic error in Group Services d


A:

ok, I think I have found it - it is a bug in rsct 2.4.6 and cab be fixed installing fix for APAR IY91960

http://www-1.ibm.com/support/docview.wss?uid=isg1IY91960

it is:
rsct.basic.rte.2.4.6.3.bff
rsct.core.hostrm.2.4.6.1.bff
rsct.core.rmc.2.4.6.3.bff
rsct.core.sec.2.4.6.3.bff
rsct.core.sensorrm.2.4.6.1.bff
rsct.core.utils.2.4.6.3.bff
rsct.opt.saf.amf.2.4.6.1.bff
rsct.opt.storagerm.2.4.6.3.bff


>> thread:


APAR IY26257

APAR status
Closed as program error.

Error description 

detection of bouncing nodes is too slow
Local fix 

Problem summary 
When the RSCT Topology Services daemon exits in one node, it
takes a finite time for the node to be detected as down by
the other nodes on each of the networks being monitored.
This happens because the other nodes need to go through a
process of missing incoming heartbeats from the given node,
and can only declare the node down after enough heartbeats
are missed. If a new instance of the daemon is started
then it is possible for the old instance to be still
thought as alive by other nodes by the time the new
instance starts.

The current behavior may result in other nodes never
detecting the given node as down. This occurs especially
when different networks use different Topology Services
heartbeating tunable values -- which is often the case in
HACMP.

In HACMP/ES, if the cluster is stopped in one node and is
then restarted quickly, it is possible for the cluster to
become "hung", with this node being unable to join the
others in the cluster. The following AIX error log entry
may appear:

  LABEL:          GS_DOM_NOT_FORM_WA
  IDENTIFIER:     AA8DB7B3
  Type:            INFO
  Resource Name:   grpsvcs
  Description: Group Services daemon has not been
               established.

Other nodes may present the entry below:

  LABEL:          GS_ERROR_ER
  IDENTIFIER:     463A893D

The problem is aggravated by the presence of HAGEO
networks, which have very large timeout values.
Problem conclusion 
A number of protocol changes were introduced into the RSCT
Topology Services daemon. With the changes

   - nodes where the Topology Services daemon exits are
     going to be detected as down faster than before the
     fix.

   - nodes where the Topology Services daemon exits and
     is restarted quickly are going to be detected as
     down soon after the new instance starts.

With the fix, error log entries like GS_DOM_NOT_FORM_WA
should no longer occur when restarting the HACMP cluster
on a node. In addition, because the demise of the previous
instance of the Topology Services daemon is detected
sooner, the new instance is allowed to join its adapter
memberships faster.
Temporary fix 
Comments 
APAR information 
APAR number IY26257 
Reported component name RSCT 
Reported component ID 5765D5101 
Reported release 121 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2001-12-11 
Closed date 2001-12-11 
Last modified date 2003-10-03 


>> thread:


Note 9:
-------

Typical 2 node /etc/hosts file:

#
# HACMP - Do not modify!
#
10.17.4.11      n5101l01-boot.nl.eu.abnamro.com n5101l01-boot
10.17.4.10      zd101l01-boot.nl.eu.abnamro.com zd101l01-boot
10.17.3.59      n5101l01.nl.eu.abnamro.com n5101l01
10.17.3.51      zd101l01.nl.eu.abnamro.com zd101l01
10.17.3.100     sonriso.nl.eu.abnamro.com sonriso
#
# End of HACMP
#


======================================================================
79. Notes on Installation and Migration AIX, HP-UX, Linux:
======================================================================


79.1 Migrations AIX 5.1,AIX 5.2,AIX 5.3:
---------------------------------------

-- New and Complete Overwrite 
This method installs AIX 5.3 on a new machine or completely overwrites any BOS version that exists on your system. 
For instructions on installing AIX on a new machine or to completely overwrite the BOS on an existing machine, 
refer to Installing new and complete BOS overwrite or preservation.

-- Preservation 
This method replaces an earlier version of the BOS but retains the root volume group, the user-created logical volumes, 
and the /home file system. The system file systems /usr, /var, /tmp, and / (root) are overwritten. 
Product (application) files and configuration data stored in these file systems will be lost. 
Information stored in other non-system file systems will be preserved. 
For instructions on preserving the user-defined structure of an existing BOS, refer to Installing new and 
complete BOS overwrite or preservation.

-- Migration 
This method upgrades from AIX 4.2, 4.3, 5.1, or 5.2 versions of the BOS to AIX 5.3 (see the release notes 
for restrictions). The migration installation method is used to upgrade from an existing version or release 
of AIX to a later version or release of AIX. A migration installation preserves most file systems, 
including the root volume group, logical volumes, and system configuration files. It overwrites the /tmp file system. 


Installation Steps 		New and Complete Overwrite 	Preservation 	Migration 
Create rootvg 			Yes 				No 		No 
Create file system /,/usr,/var 	Yes 				Yes 		No 
Create file system /home 	Yes 				No 		No 
Save Configuration 		No 				No 		Yes 
Restore BOS 			Yes 				Yes 		Yes 
Install Additional Filesets 	Yes 				Yes 		Yes 
Restore Configuration 		No 				No 		Yes 


Note 1:
-------

thread

Found that the prngd subsystem (used with ssh, random number generator) on AIX 5.1 is incompatible with the 
AIX 5.2 upgrade. BEFORE migration this subsystem should be disabled either in /etc/rc.local or erased completely: 
rmssys -s prngd

It has to be remade after migration (see customization). 
If prngd is not disabled, the final boot after 5.2 installation coredumps with 0C9 and the machine never recovers. 
In this case: 

Boot into maintenance mode (needs first 5.2 CD and SMS console) 
Limited function shell (or getrootfs) 
vi /etc/rc.local to disable prngd 

- Firmware/Microcode upgrade
It is wise to update the firmware/microcode of your system before upgrading the system. Checkout the IBM support 
site Directly via ftp site. 
- Base system
Straightforward like installing from scratch. When asked, select "Migration" instead of "Overwrite" installation. 


Note 2:
-------

thread:

Problem creating users on AIX 5.2
Reply from tcarlson on 6/28/2007 6:14:00 PM  

It appears to working now. Thanks for the replies. 
Not exactly sure what fixed it, but I ran usrck, grpck, and pwdck and it started working again. 


Note 3:
-------

AIX 5.2 Installation Tips (Doc Number=1612) 
  Fix Readme 

April 27, 2005 

--------------------------------------------------------------------------------
This document contains the latest tips for successful installation of AIX 5.2, and will be updated as new tips become available. 
APARs and PTFs mentioned in this document, when available, can be obtained from the following web site. 

http://www.ibm.com/servers/eserver/support/pseries/aixfixes.html 
http://www14.software.ibm.com/webapp/set2/sas/f/genunix3/aixfixes.html

The AIX installation CD-ROMs and the level of AIX pre-installed on new systems may not contain the latest fixes available at the time you install the system, and may contain errors. Some these fixes may be critical to the proper operation of your system. We recommend that you update to the latest service level, which can be obtained from http://www.ibm.com/servers/eserver/support/pseries/aixfixes.html. 
The compare_report command, which is documented in the AIX Commands Reference, can be used to determine which available updates are newer than those installed on your system. 


--------------------------------------------------------------------------------

Reads from Frozen JFS2 Filesystem Hang 
System Firmware for POWER 4 Systems 
Critical Updates for 5200-04 
CD-ROM Installation of JS20 Appears to Hang 
oslevel -r Does not Indicate 5200-04 After Update 
Documentation Library Service Broken Links 
lppchk Error on /usr/lib/perl with HACMP Installed 
License Agreement Failures 
Possible Data Error on DR Memory Remove 
Possible File Corruption Running defragfs 
ksh: clean_up_was: not found 
Installation of devices.artic960 5.2 
ARTIC960Hx SDLC and Bisync Support 
Inventory Scout 
HMC 3.2.0 Upgrade 
TSM 5.1 Not Supported on AIX 5.2 
AIXlink/X.25 Version 2 on AIX 5.2 


--------------------------------------------------------------------------------

Reads from Frozen JFS Filesystem Hang
After application of the 5.2.0.60 level kernels (bos.mp, bos.mp64, bos.up), which are included on the 5/2005 Update CD and in the 5200-06 Recommended Maintenance package, reads from a frozen JFS2 filesystem will no longer be possible. All reads from a frozen filesystem will be blocked until the filesystem is thawed. Because of this, a filesystem level backup, such as a backup using the backup command or through TSM, will appear to hang until the filesystem is thawed. This restriction will be lifted in APAR IY70225. 
Backups using FlashCopy or similar logical volume or device level backups are still possible on a frozen filesystem. 


--------------------------------------------------------------------------------

System Firmware for POWER 4 Systems
The AIX 5200-04 (or later) Recommended Maintenance package exposes a problem in older levels of POWER 4 system firmware that can manifest itself as either a system hang or hang in some diagnostic commands. The commands that expose the problem include, but may not be limited to snap, lscfg, and lsresource. This problem can be resolved by installing the latest system firmware, available at the following web site. 

http://techsupport.services.ibm.com/server/mdownload/ 
http://www14.software.ibm.com/webapp/set2/firmware/gjsn


--------------------------------------------------------------------------------

Critical Updates for 5200-04
When installing the 5200-04 Recommended Maintenance package, it is also recommended that you install the following APARs. 
IY64978  Possible system hang while concurrently renaming and unlinking under JFS. This APAR is currently available.  
IY63366  Loader may fail to find symbol even though the symbol is present in the symbol table. This can cause applications that use dynamically loaded modules to fail. Prior to APAR availability, an emergency fix is available at: 
ftp://service.software.ibm.com/aix/efixes/iy63366/  


Systems running bos.rte.lvm 5.2.0.41 or later should install APAR IY64691. APAR IY64691 fixes a problem with the chvg-B command that can cause data corruption on Big volume groups which were converted from normal volume groups. Prior to APAR availability, obtain the emergency fix for APAR IY64691 from: 
A href="ftp://service.software.ibm.com/aix/efixes/iy64691/">ftp://service.software.ibm.com/aix/efixes/iy64691/ 

Systems running bos.rte.lvm 5.2.0.50 should install APAR IY65001. APAR IY65001 fixes a possible corruption issue with mirrored logical volumes. This APAR also contains the fix for APAR IY64691. Prior to APAR availability, obtain the emergency fix for APAR IY65001 from: 
ftp://service.software.ibm.com/aix/efixes/iy65001/ 

Systems running bos.rte.aio 5.2.0.50 should install APAR IY64737. APAR IY64737 fixes a problem where applications that use Asynchronous I/O (AIO) can cause a system hang. Prior to APAR availability, obtain the emergency fix for APAR IY64737 from: 
ftp://service.software.ibm.com/aix/efixes/iy64737/ 


--------------------------------------------------------------------------------

CD-ROM Installation of JS20 Appears to Hang
Following installation of packages from CD-ROM volume 1, the installation appears to hang for 45 to 60 minutes before prompting for volume 2. During this time, the installation process is verifying the installed packages. The problem will be fixed in a later level of the AIX installation media. 


--------------------------------------------------------------------------------

oslevel -r Does not Indicate 5200-04 After Update
After updating from the 8/2004 Update CD or the 5200-04 Recommended Maintenance package, the 'oslevel -r' command may not indicate 5200-04. This occurs when updating from a level lower than 5200-03 because the bos.pmapi.tools update does not install. Performing a second update from the media will install the bos.pmapi.tools update and correct the problem. 


--------------------------------------------------------------------------------

Documentation Library Service Broken Links
A search of the Documentation Library Service will return hits, but the links result in "404 Not Found" errors. This problem occurs with the sysmgt.websm filesets at the 5.2.0.30 level. This can be resolved using the following command, where $DOC_LANG is any language directory installed under /usr/HTTPServer/htdocs (example: de_DE, en_US, etc.). 
ln -sf /usr/share/man/info /usr/HTTPServer/htdocs/$DOC_LANG/doc_link 


--------------------------------------------------------------------------------

lppchk Error on /usr/lib/perl with HACMP Installed
After installing the cluster.es.server.rte fileset at the 5.2.0.0 level, the lppchk command may return an error due to a missing /usr/lib/perl link. The error can be resolved by doing a force overwrite install of perl.rte from the base AIX media, or by running the following command as root. 
ln -s /usr/opt/perl5/lib /usr/lib/perl 


--------------------------------------------------------------------------------

License Agreement Failures
Installation of some device packages using SMIT may fail due to license agreement failures. This is caused by a missing SMIT "ACCEPT new license agreements" option. To resolve this issue, first install APAR IY52152, which is included in bos.sysmgt.smit 5.2.0.30 or later. 


--------------------------------------------------------------------------------

Possible Data Error on DR Memory Remove
For systems running Dynamic Logical Partitioning (DLPAR), it is imperative that the fix for APAR IY50852 be installed. To determine if APAR IY50852 is installed on your system, use the command: 
instfix -ik IY50852 
When doing a dynamic reconfiguration memory remove operation with DMA operations ongoing, it is possible that data actively being DMAed to pages within the memory being removed may be misdirected to memory that is no longer active in the partition. This could result in a program reading wrong data. 


--------------------------------------------------------------------------------

Possible File Corruption Running defragfs
For systems using JFS2 filesystems, it is imperative that the fix for APAR IY50791 be installed. To determine if APAR IY50791 is installed on your system, use the command: 
instfix -ik IY50791 
When data is synced (written to disk) soon after running the defragfs command on a JFS2 filesystem, incomplete data can be written. Sync operations can be performed with the sync command, but are also performed when unmounting a filesystem or during a system shutdown or reboot. 


--------------------------------------------------------------------------------

ksh: clean_up_was: not found
When attempting to run the clean_up_was shell script as part of installing the Websphere Application Server 5.0.1 trial on the AIX 5.2 Bonus Pack (LCD4-1141-02), the following error may be encountered. 
	ksh: clean_up_was: not found.
This problem is caused by control characters at the end each line within the script. To correct the script, use the following procedure: 
	mv clean_up_was clean_up_was.orig
	tr -d '\015' <clean_up_was.orig >clean_up_was
	chmod 755 clean_up_was


--------------------------------------------------------------------------------

Installation of devices.artic960 5.2
To successfully upgrade to devices.artic960 5.2 from a previous version, it is necessary to install APAR IY48642. 


--------------------------------------------------------------------------------

ARTIC960Hx SDLC and Bisync Support
The devices.pci.14108c00 fileset provides support for SDLC and bisynchronous protocols on the IBM ARTIC960Hx 4-Port Selectable PCI Adapter, (FC 2947). When combined with the installation of the devices.artic960 5.2.0.0 fileset, Enhanced Error Handling (EEH) support is provided. APAR IY44132 provides 64-bit support. 


--------------------------------------------------------------------------------

Inventory Scout
Inventory Scout introduces a new microcode management graphical user interface (GUI). This feature is available on your AIX system by installing an additional fileset, invscout.websm, onto the system, or if a Hardware Management Console (HMC) is attached, using the microcode update function. This GUI is a Web-based System Manager plug-in that surveys the microcode levels of the system, and on POWER4 systems, downloads and installs microcode. Inventory Scout continues to work with the applet found at https://techsupport.services.ibm.com/server/aix.invscoutMDS to survey only. 
This release of Inventory Scout significantly changes the method used to determine the microcode levels of systems, adapters, and devices to compare to the latest available levels. Previously, data was collected and sent to IBM to determine the current state of the system. 

The new microcode management feature does the following: 

Downloads a catalog of currently available levels to the system being examined 
Conducts a microcode survey on the system and compares to the latest available microcode 
On the POWER4 systems, allows you to download and flash to the latest microcode available 
This new microcode survey procedure might cause some problems with some customer techniques used today for surveying systems and might require changes to those procedures. 

This microcode management feature relies on system features that were not present in previous generations of systems. Support for microcode on these systems is limited to survey only. For more information about microcode updates, see http://techsupport.services.ibm.com/server/mdownload. To enable this new Inventory Scout functionality, you will need the following filesets at the specified levels or higher: 

	invscout.com            2.1.0.1
	invscout.ldb            2.1.0.2
	invscout.rte            2.1.0.1
	invscout.websm          2.1.0.1
To obtain the required filesets, order APAR IY44381. Go to the following URL: 

http://www.ibm.com/servers/eserver/support/pseries/aixfixes.html 
If you are using this microcode management feature tool through the HMC, your HMC must be at Release 3, Version 2.2. This can be obtain by ordering APAR IY45844. 

The HMC code can be obtained from http://techsupport.services.ibm.com/server/hmc/. 

Known Problems: 

The following devices supported in POWER4 systems have limitations in the ability to update microcode with this microcode management feature. 
SCSI Enclosure Services (ses) Microcode for 7311-D20, 7038-6M2 & 7028-6C4/6E4 
7040-61D SCSI I/O Drawer 
PCI 4-Channel Ultra3 SCSI RAID Adapter 
CD-ROM and DVD-ROM Drives 
RAID Devices 
SSA devices and adapters 

For more information about these devices, see the Readme files at http://techsupport.services.ibm.com/server/mdownload. 


When updating system firmware from an HMC, the connection between the HMC and the system might get out of sync. This situation can be recovered by going to your server management panel on the HMC and selecting Rebuild Managed System. 

Some adapters and devices do not support concurrent operation with microcode flashing. Such devices must be taken off-line to update microcode. This situation creates a problem when updating microcode for these communications adapters, such as Ethernet adapters used to communicate with the Internet to obtain the microcode updates or communicate with an HMC. In this case, if the adapters are on-line and the update is attempted, the final step of flashing the device is not completed. You can complete the update procedure by taking the device off-line, and going into diagnostic service aids to download microcode to that device. 

Due to the changes in how the survey works, you can no longer concatenate survey results prior to sending them to IBM. 

There is a known system firmware upgrade problem with pSeries 690 or pSeries 670 Servers that have six 7040-61D I/O Drawers and three Integrated Battery Features (IBFs) (battery backup) OR seven or more 7040-61D I/O Drawers, regardless of the number of IBFs. Systems with this configuration should not use the new GUI for microcode management to update the system firmware. For additional information, reference the 7040-681 and/or 7040-671 Readme files which can be found at http://techsupport.services.ibm.com/server/mdownload. 


--------------------------------------------------------------------------------

HMC 3.2.0 Upgrade
The following enhancements delivered for the p670 and p690 systems in May 2003 will not be available unless the Hardware Management Console (HMC) is upgraded to the 3.2.0 or later level. New HMCs delivered after May 2003 will be at the required level. You can obtain the latest HMC update from the Download section of the pSeries Support web pages at http://www.ibm.com/servers/eserver/support/pseries. 
32 partitions 
Distributed RMC 
CUoD - email home 
CUoD memory - permanent 
CUoD memory - try & buy 
CUoD processor - try & buy 
Customer firmware management 
Fast activation of a partition 
Flat Panel display support 
Full HMC command line 
Microcode download from HMC 


--------------------------------------------------------------------------------

TSM 5.1 Not Supported on AIX 5.2
Tivoli Storage Manager (TSM) 5.1 is not compatible with AIX 5.2 and will cause a system crash if installed. TSM 5.1 is shipped in error on the AIX 5L for POWER V5.2 Bonus Pack (LCD4-1141-00) dated 10/2002, and should not be installed. 
Once TSM 5.1 is installed on AIX 5.2, the system will crash on every reboot. To recover from this situation, the system will have to be booted in maintenance mode from the AIX 5.2 installation media or a system backup (mksysb) to uninstall the tivoli.tsm.* filesets. Alternatively, the following line can be uncommented in /etc/inittab by inserting a ':' (colon) at the beginning of the line. 


adsmsmext:2:wait:/etc/rc.adsmhsm > /dev/console 


AIXlink/X.25 Version 2 on AIX 5.2
To avoid a system crash when using TCP/IP over X.25 functionality, it is necessary to install APAR IY45606. 

See: http://www-1.ibm.com/support/docview.wss?uid=isg1SSRVAIX52TIPS081512_450 
 

Note 4:
-------

thread:

Q:

Attention msg during mksysb 

Hi,
I am running AIX 5.2 ML03, I am receving following Attention msg during the
mksysb

****ATTENTION****
The boot image you created might fail to boot because the size exceeds the
system limit. For information about fixes or workarounds,
see/usr/lpp/bos.sysmgt/README.
****ATTENTION****
..
Creating list of files to back up..
Backing up 569000 files.................................


What am I missing in it? any help or hints or tips will be of great value to
me. Thanks

A:

This solution DOES NOT WORK on models 7028, 7029, 7038, 7039, and 7040
systems, see option 4 regarding these models.
If APAR IY40824 (AIX 5.1) or IY40975 (AIX 5.2) was installed prior to making
the backup, then you may boot from the backup and go to the open firmware
prompt. To get to the open firmware prompt, when the system beeps twice after
powering it on, press F8 on the keyboard (or the 8 key on an ASCII terminal).
You can also get to the open firmware prompt from SMS. The open firmware
prompt is also referred to as the "OK" prompt. On some systems there will be
a menu option located on the initial SMS menu. On others, it will be located
under the Multiboot menu. From the open firmware prompt execute the following:

>setenv real-base 1000000
>reset-all

Notes:
a) To use this option, the backup must have this APAR in it and therefore
must be created after installing the APAR.
b) The above commands will have to be executed each time you boot from the
large boot image backup media.
 

Note 5:
-------

Configuring mksysb image on system backup tapes
Use the swmksysb command to ensure that the boot image, BOS Installation/Maintenance image, 
and the table of contents image are created with a tape block_size value of 512.

Bootable mksysb tapes comprise the following images: 
Boot image 
BOS Installation/Maintenance image 
Table of contents image 
System backup image 
The system backup image is the actual backup of the files in the rootvg in all JFS-mounted file systems.
The boot image, BOS Installation/Maintenance image, and the table of contents image must be created with a 
tape block_size value of 512. The mksysb command ensures that the block size is 512 when these images are created. 
There are no restrictions on the block size used for the fourth (system backup image) on the tape. 
The block size of the system, before it was temporarily set to 512, is used for the fourth image on the tape.

The value of the block size must be saved in the /tapeblksz file in the second image on the tape. 
The second and fourth images are stored in backup/restore format. Again, mksysb ensures the correctness 
of the tapes created by using the mksysb command.

If there are problems with the bosinst.data file, the image.data file, or the tapeblksz file, these files 
can be restored from the second image on the tape and checked. These files, as well as commands necessary 
for execution in the RAM file system (when running in maintenance mode after booting from the tape), 
are stored in the second image.


Note 6:
-------

thread

Before you migrate 5.1 -> 5.2, do as an absolute minimum the following:

- errpt, and resolve all serious issues. If you can't, then STOP.
- enough free space rootvg, /, /tmp, /usr, /var
- lppchk -v   If dependencies are not OK, then correct or STOP.
- check firmware. Is the current firmware ok for AIX52? Use "prtconf" or "lsmcode".

  Example:

  To display the system firmware level and service processor (if present), type: 
  # lsmcode -c

  The system displays a message similar to the following: 
  System Firmware level is TCP99256

  If the Firmware version is not current enough, then upgrade or STOP.

Or use
  # lscfg -vp | grep -p Platform

- Always create a mksysb tape.

Note: its quite likely that your apps still need a number of AIX fixes, APARS 
      before they can run on AIX52.


POWER5 Firmware releases:

Release End of Currency 
SF240   To be announced 
SF235   March 2007 
SF230   March 2007 
SF225   Out of Currency 
SF222   Out of Currency 
SF220   Out of Currency 
SF210   Out of Currency 

Firmware lower than SF230 is not usable for upgrades/migration.


Note 7:
-------

thread

Migration AIX 4.3.3 to 5.2, Rebooting don't stop
Reply from Simone on 2/9/2005 10:17:00 AM  

Hi, the Reboot problem during migration to 5.2 is SOLVED. 

As described in the previous messages, we have 4 networks cards and two are unused 
(no cable attached, no tcpip adress defined and no network mask allocated). 

During the reboot, file "/etc/rc.net" is executed (boot stage two). This one call "/usr/lib/methods/cfgif" 
which configure the network (ethernet adapter, server name, default gateway, static routes). 
Because of the two unconfigured cards and the execution of "/usr/lib/methods/cfgif", server do a "SYSTEM DUMP DEV" 
and reboot again. 

To solve this issue, two ways are possible: 
- detach the unconfigured cards (chdev -l en0 -a state=detach) 
- configure the cards 

Please note, that no informations have been founded into IBM documentation about this issue. 

THEN, BEFORE A AIX UPGRADE 4.3.3 TO 5.2, BE SURE TO HAVE ALL cards are correctly configured. 

Thanks 


Note 8:
-------


==========================
80. Some IP subnetting:
==========================

Traditional IP Classes are:

The  first byte in the 4 byte address corresponds to:

Class A: 1-126	        0xxxxxxx.yyyyyyyy.yyyyyyy.yyyyyyyy
Class B: 128-191	10xxxxxx.xxxxxxxx.yyyyyyy.yyyyyyyy
Class C: 192-223	110xxxxx.xxxxxxxx.xxxxxxx.yyyyyyyy
Class D: 224		1110----.--------.-------.--------  (not used, or multicast)

Notice the first bits in the first byte in the class address.


Note : 127.aaaaaaaa.aaaaaaa.aaaaaaaa is reserved for debugging/testing purpose, or local host


In an IP address aaaaaaaa.aaaaaaaa.aaaaaaaa.aaaaaaaa, the 8 bits "aaaaaaaa" corresponds  to

ax2macht7+ax2macht6+ax2macht6+ax2macht5+ax2macht4+ax2macht3+ax2macht2+ax2macht1+ax2macht0 

with "a" is a bit, that is, a=0 or a=1

1x128 + 1x64 + 1x32 + 1x16 + 1x8 + 1x4 + 1x2 + 1x1

So, in for example in class A: 0xxxxxxx, means that the first byte is at maximum value of
0x128 + 1x64 + 1x32 + 1x16 + 1x8 + 1x4 + 1x2 + 1x1 = 127, but 127 is reserved, so Class A runs from 1 - 126

Similar for Class B: it can be minimum of 10000000 or 10111111 in the first byte, and thats 128-191
Remember, by design, the first two bits in B, MUST BE "10", and the other 6 bits can vary.


Subnetting:

Class C subnetting:  
                    No of     No of     No of      No of
		    subnets   hosts	subnetbits hostbits
-----------------------------------------------------------
 *255.255.255.128        NA      NA           1        7     * not valid with most routers
  255.255.255.192         2      62           2        6
  255.255.255.224         6      30           3        5
  255.255.255.240        14      14           4        4
  255.255.255.248        30       6           5        3
  255.255.255.252        62       2           6        2


Class B subnetting:
                    No of     No of     No of      No of               
		    subnets   hosts	subnetbits hostbits
-----------------------------------------------------------
255.255.128.0            NA      NA           1       15
255.255.192.0             2   16382           2       14
255.255.224.0             6    8190           3       13
255.255.240.0            14    4094           4       12
255.255.248.0            30    2046           5       11
255.255.252.0            62    1022           6       10
255.255.254.0           126     510           7        9
255.255.255.0           254     254           8        8
255.255.255.128         510     126           9        7
255.255.255.192        1022      62          10        6
255.255.255.224        2046      30          11        5
255.255.255.240        4094      14          12        4
255.255.255.248        8190       6          13        3
255.255.255.252       16382       2          14        2


========================================
81. Notes on TSM:
========================================


81.1 Notes on TSM client dsmc:
==============================

Installing the TSM 5.1 client under IBM AIX 4.3.3 

These instructions will guide you through the installation and configuration of the Tivoli 
Storage Manager client, so you can back up your data using DoIT's Bucky Backup service. 
You should be familiar with the AIX operating system distribution and have root or root-equivalent 
access to the machine you are working with. These instructions and the AIX client are specific to 
the pSeries & RS/6000 architecture.

Data that has been backed up or archived from a TSM v5.1 client cannot be restored or retrieved to any 
previous level client. The data must be restored or retrieved by a v5.1.0 or higher level client. 
Once you migrate to 5.1 you cannot go back to an older client (but you can certainly restore older data). 
This is non-negotiable. You have been warned.

This product installs into /usr/tivoli/tsm/client. It requires 40 to 50 megabytes of space. 
During the installation SMIT will extend the filesystem if you do not have enough space. 
If you do not have space in rootvg, you can symlink /usr/tivoli/tsm/client into a directory 
where you do have enough space.

You must have registered a node and have received confirmation of your node name. Make sure you know 
the password that you specified when applying for the node.

You must have xlC.rte installed in order to install the client. If you wish to use the graphical client 
under AIX you must have AIXwindows X11R6, Motif 1.2 or Motif 2.0, and the CDE installed.

Acquire the software from Tivoli. You can use wget or lynx to retrieve the files from their web site 
(or use the "Save Target As..." feature of your browser):

ftp://service.boulder.ibm.com/storage/tivoli-storage-management/maintenance/client/v5r1/
Start SMIT to install the software:

smitty install

Select "Install and Update Software", then "Install and Update from LATEST Available Software". 
When it prompts you for the "INPUT device / directory for software" specify the directory in which 
you saved the installation files. Proceed to install the software ("_all_latest")

Change to the new directories created for the client:

cd /usr/tivoli/tsm/client/ba/bin


Create and edit the dsm.sys, dsm.opt, and inclexcl files for your system. Sample files are linked. 
At a minimum, you will have to edit dsm.sys and insert your node name.

Start dsmc by using the ./dsmc command. Enter the command "query schedule" and you will be prompted 
for your node's password. Enter your password and press enter. Once it successfully displays the node's 
backup schedule, enter the command "quit" to exit it. This saves your node's password, so that backups 
and other operations can happen automatically.

To start the TSM client on reboot, edit /etc/inittab and insert the line (all one line):

tsm:2:once:/usr/tivoli/tsm/client/ba/bin/dsmc schedule servername=bucky3 > /dev/null 2>&1 < /dev/null
Issue the following command on the command line, as root, to manually start dsmc:

nohup /usr/tivoli/tsm/client/ba/bin/dsmc schedule -servername=bucky3>/dev/null & 

Verify that the client has started and is working by checking the log files in /usr/tivoli/tsm/client/ba/bin.

You can perform a manual backup to test your settings using the command:

/usr/tivoli/tsm/client/ba/bin/dsmc incremental

Remember that if you change the settings in dsm.sys, dsm.opt, or inclexcl you need to restart the software.

Upgrading the TSM client from 4.2 to 5.1

To upgrade the TSM client from 4.2.1 to 5.1 use the following procedure:

Obtain a copy of the software (use the links at the top of this page). 

Kill the running copy of dsmc (a "ps -ef | grep dsmc" will show you what is running. Kill the parent process). 

Back up dsm.opt, dsm.sys, and inclexcl from your old configuration (probably in /usr/tivoli/tsm/client/ba/bin). 
The upgrade will preserve them, but it pays to have a backup copy. 

Upgrade the TSM client packages using "smitty install". Select "Install and Update Software", 
then "Update Installed Software to Latest Level (Update All)". Specify the directory in which 
the software was downloaded. 

Edit your dsm.sys file and ensure that the TCPServeraddress flag is set to buckybackup2.doit.wisc.edu 
OR buckybackup3.doit.wisc.edu (this just ensures future compatibility with changes to the service). 
This setting could be either server, depending on when you registered your node. 

Restart dsmc using the command:

nohup /usr/tivoli/tsm/client/ba/bin/dsmc schedule -servername=bucky2 >/dev/null & 

Watch your logs to ensure that a backup happened. You can also invoke a manual backup using 
"dsmc incremental" from the command line. 

So how to install:

Run the script tsm-instl 
Modify /usr/tivoli/tsm/client/ba/bin/dsm.opt 
Modify /usr/tivoli/tsm/client/ba/bin/dsm.sys 
Modify /var/tsm/inclexcl.dsm 
Register the target machine name with TSM 

The config files are thus "dsm.opt" and "dsm.sys"


zd77l06:/usr/tivoli/tsm/client/ba/bin>cat dsm.opt

SErvername      ZTSM01
dateformat      4
compressalways  no
followsymbolic  yes
numberformat    5
subdir          yes
timeformat      1


zd77l06:/usr/tivoli/tsm/client/ba/bin>cat dsm.sys

SErvername            ZTSM01
   COMMmethod         TCPip
   TCPPort            1500
   TCPServeraddress   cca-tsm01.ao.nl.abnamro.com
   HTTPPort           1581
   PASSWORDACCESS     GENERATE
   schedmode          PROMPTED
   nodename           zd77l06
   compression        yes
   SCHEDLogretention  7
   ERRORLogretention  7
   ERRORLogname       /beheer/log/tsm/dsmerror.log
   SCHEDLogname       /beheer/log/tsm/dsmsched.log

On HP these directories could be /opt/tivoli/tsm/client/ba/bin/dsm.opt


If you need to exclude a filesystem in the backup run, you can edit dsm.sys and put in an exclude statement
like in the following example:

SErvername            ZTSM01
Exclude "/data/documentum/dmadmin/*"
   COMMmethod         TCPip
   TCPPort            1500
   TCPServeraddress   cca-tsm01.ao.nl.abnamro.com
   HTTPPort           1581
   PASSWORDACCESS     GENERATE
   schedmode          PROMPTED
   nodename           zd110l14
   compression        yes
   SCHEDLogretention  7
   ERRORLogretention  7
   ERRORLogname       /beheer/log/tsm/dsmerror.log
   SCHEDLogname       /beheer/log/tsm/dsmsched.log


81.2 Examples of the dsmc command:
==================================

To view schedules that are defined for your client node, enter: 
# dsmc query schedule
# dsmc q ses

To change the password:
# /usr/bin/dsmc set password 

To make a test incr backup of /tmp now, to see if backups work:
# /usr/bin/dsmc inc /tmp
/usr/bin/dsmc inc /home/se1223

To see what the mksysb backup status is:
# dsmc q archive /mkcd/mksysb.img


81.3 Restore with dsmc:
=======================

-- Example 1:
To restore a file /a/b to /c/d :

# dsmc restore /a/b /c/d

-- Example 2:
Restore the most recent backup version of the /home/monnett/h1.doc file, even if the backup is inactive.

# dsmc restore /home/monnett/h1.doc -latest

-- Example 3:
Display a list of active and inactive backup versions of files from which you can select versions to restore.

# dsmc restore "/user/project/*"-pick -inactive

-- Example 4:
Restore the files in the /home file system and all of its subdirectories.

# dsmc restore /home/ -subdir=yes

-- Example 5:
Restore all files in the /home/mydir directory to their state as of 1:00 PM on August 17, 2002.

# dsmc restore -pitd=8/17/2002 -pitt=13:00:00 /home/mydir/

-- Example 6:
Restore all files from the /home/projecta directory that end with .bak to the /home/projectn/ directory.

# dsmc restore "/home/projecta/*.bak" /home/projectn/

# dsmc restore /data/documentum/dmadmin/backup1008/*
# dsmc restore /data/documentum/dmadmin/backup1608/*

small script for scheduling a restore job from cron:

!#/usr/bin/ksh

echo "Starting backup at: " >> /beheer/adhoc_restore.log
date >> /beheer/adhoc_restore.log

cd /data/documentum/dmadmin
dsmc restore /data/documentum/dmadmin/backup_3011/*

echo "End of backup at: " >> /beheer/adhoc_restore.log
date >> /beheer/adhoc_restore.log


-- Example 7:

- Use of FROMDate=date                                                      
Specify a beginning date for filtering backup versions. Do not  
restore files that were backed up before this date.             
                                                                        
You can use this option with the TODATE option to create a time 
window for backup versions. You can list files that were backed 
up between two dates.                                           
                                                                        
For example, to restore all the files that you backed up from   
the /home/case directory from April 7, 1995 through April 14,   
1995, enter:                                                    
                                                                        
# REStore "/home/case/" -FROMDate=04/07/1995 -TODate=04/14/1995 

As another example, to restore backup versions of files that were created during 
the last week of March 1995 for the files in the /home/jones directory, enter: 

# Restore -FROMDate=03/26/1995 -TODate=03/31/1995 /home/jones
                                                                        
The date must be in the format you select with the DATEFORMAT   
option. For example, the date for date format 1 is mm/dd/yyyy,  
where mm is month; dd is day; and yyyy is year. If you include  
DATEFORMAT with the command, it must precede FROMDATE and       
TODATE.                                                         
                                          
- Use of TODate=date                                                        
Specify an end date for filtering backup versions. ADSM does not
restore backup versions that were backed up after this date.    
                                                                        
You can use this option with the FROMDATE option to create a    
time window for backups. You can restore backup versions that   
were backed up between two dates                                
                                                                        
For example, to restore files from the /home/case directory that
were backed up between April 7, 1995 and April 14, 1995, enter  
the following:                                                  
                                                                        
res "/home/case/" -FROMDate=04/07/1995 -TODate=4/14/1995      
                                                                        
The date must be in the format you select with the DATEFORMAT   
option. For example, the date for date format 1 is mm/dd/yyyy,  
where mm is month; dd is day; and yyyy is year. If you include  
DATEFORMAT with the command, it must precede FROMDATE and       
TODATE.                                       


To start the clients Graphical User Interface enter dsm. The TSM GUI appears.


Example in Dutch:
-----------------

Bestanden terughalen van de backup.
Elke nacht worden er backup's gemaakt van alle data. Als U, om wat voor een reden dan ook, een bestand 
kwijt bent geraakt, kunt U dit zelf terughalen van de backup als er aan de volgende voorwaarden wordt voldaan: 

-het bestand heeft minstens 1 nacht op de server gestaan 
-U bent het minder dan 30 dagen geleden kwijtgeraakt 

Files die u dezelfde dag op het systeem zet en weer weggooid, kunt u niet met een restore terughalen! 

Om een bestand terug te halen (restore) gaat U als volgt te werk: 
Log in op de machine en tik op de commando prompt: 

$ dsmc restore -ina -subdir=yes "file-selectie"

waarbij voor "file-selectie" de filenaam opgegeven wordt van het bestand dat U kwijt bent. Weet U de naam 
niet meer precies tik dan in: 

dsmc restore -ina -subdir=yes "*"

met de "'s. 

--Een voorbeeld
Voorbeeld van een restore 

De voorbeeld file: 

-rw-rw-r--    1 faq  faq      269 Mar 18 16:12 testfile

File (per ongeluk) weggooien: 

$ rm testfile

Controle op het (niet meer) bestaan van de file: 

$ ls -l testfile
ls: testfile: No such file or directory

Starten van een restore: 

$ dsmc restore -pick "test*"

TSM Scrollable PICK Window - Restore

    #    Backup Date/Time        File Size A/I  File
---------------------------------------------------------------
 x  1. | 19-03-2002 03:13:22        269  B  A   /www/faq/testfile
       |
       |
       |
       |
       0---------10--------20--------30--------40--------50--------60--
<U>=Up  <D>=Down  <T>=Top  <B>=Bottom  <R#>=Right  <L#>=Left
=Goto Line #  <#>=Toggle Entry  <+>=Select All  <->=Deselect All
<#:#+>=Select A Range <#:#->=Deselect A Range  <O>=Ok  <C>=Cancel
pick>

Selecteer de te restoren file, in dit geval kiezen we voor testfile 

pick> 1

TSM Scrollable PICK Window - Restore

     #    Backup Date/Time        File Size A/I  File
        ---------------------------------------------------------------
x    1. | 19-03-2002 03:13:22        269  B  A   /www/faq/testfile
        |
        |
        |
        |
        0---------10--------20--------30--------40--------50--------60--
<U>=Up  <D>=Down  <T>=Top  <B>=Bottom  <R#>=Right  <L#>=Left
=Goto Line #  <#>=Toggle Entry  <+>=Select All  <->=Deselect All
<#:#+>=Select A Range <#:#->=Deselect A Range  <O>=Ok  <C>=Cancel
pick>

Bevestig met OK 

pick> o

 ** Interrupted **
ANS1114I Waiting for mount of offline media.

Opmerking: wachttijd is afhankelijk van de drukte op de TSM-server 

Restoring             269 /www/faq/testfile [Done]

Restore processing finished.

Total number of objects restored:         1
Total number of objects failed:           0
Total number of bytes transferred:      296  B
Data transfer time:                    0.00 sec
Network data transfer rate:        19,270.83 KB/sec
Aggregate data transfer rate:          0.00 KB/sec
Elapsed processing time:           00:04:05

Controle op het bestaan van de file: 


$ ls -l testfile
-rw-rw-r--    1 faq      faq           269 Mar 18 16:12 testfile

De (test-)file is weer te gebruiken. 


Example:
--------


- Backup von Dateien:
Mittels des Befehls:

C:>dsmc incremental -subdir=yes "d:\*.*"

kann beispielsweise der Inhalt der ganzen Festplatte (oder Partition) d:\ Ihres PCs gesichert werden. 
Analog lassen sich Unterverzeichnisse derselben Platte sichern. Hier noch ein Beispiel unter dem Betriebssystem Unix:

# dsmc incremental -subdir=no /usr/users/holo

Dieser Befehl sichert alle Dateien in dem Verzeichnis /usr/users/holo .

- Anzeige von mittels Backup gesicherten Dateien:
Um herauszufinden welche Dateien durch Backup im TSM Server gespeichert sind, gibt es folgendes Kommando:

dsmc query backup -subdir=yes "*"

womit Sie alle Ihre gesicherten Dateien aufgelistet bekommen. Die Option -inactive gestattet zus,tzlich 
das Auflisten aller gespeicherten fr_heren Versionen Ihrer Dateien. Haben Sie z.B. unter Unix im Verzeichnis 
/u/holo/briefe/1997 durch Znderungen mehrere Versionen Ihrer Briefe konferenz97.tex abgespeichert, 
so bekommen Sie eine Liste aller Dateien durch: 

# dsmc query backup -inactive \ /u/holo/briefe/1997/konferenz97.tex

Der obige \ ist die Unix Zeilenfortsetzung.

- Restore von Dateien:
Es ist passiert: Sie haben sich eine Datei die Sie sp,ter ben"tigen gel"scht. Mittels des Kommandos restore 
k"nnen Sie Dateien die mittels Backup gesichert wurden, wiederherstellen.

Der Befehl:

# dsmc restore -subdir=yes "/u/holo/briefe/*"

speichert alle aktiven Dateien im Unterverzeichnis /u/holo/briefe wieder zur_ck. Falls Sie dabei existierende 
Dateien _berschreiben wollen, so k"nnen Sie die Option

-replace=yes

angeben.

Es ist auch m"glich ein Zeitintervall anzugeben, in welchem die Sicherung erfolgt sein mu�, um so auf ,ltere 
Versionen zur_ckzugehen (die z.B. noch frei von Computerviren sind):

dsmc restore -replace=yes -todate=1997-12-22 d:\*.doc


81.4 Oracle and TSM: TDPO:
==========================

RMAN and Tivoli Data Protection for Oracle
The Oracle Recovery Manager provides consistent and secure backup, restore, and recovery performance 
for Oracle databases. While the Oracle RMAN initiates a backup or restore, TDP for Oracle acts as the 
interface to the Tivoli Storage Manager server (Version 4.1.0 or later). The Tivoli Storage Manager server 
then applies administrator-defined storage management policies to the data. TDP for Oracle Version 2.2.1 
implements Oracle defined Media Management API 2.0, which interfaces with RMAN for backup and restore 
operations and translates Oracle commands into Tivoli Storage Manager API calls to the Tivoli Storage Manager server. 

With the use of RMAN, TDP for Oracle allows you to perform the following functions: 

Full backup function for the following while online or offline: 
-databases 
-tablespaces 
-datafiles 
-archive log files 
-control files 
-Full database restore while offline 
-Tablespace and datafile restore while online or offline 

TDPO.OPT File 
This feature provides a centralized place to define all the options needed by RMAN for TDP for Oracle backup 
and restore operations. This eliminates the need to specify environment variables for each session, 
thereby reducing the potential for human error. This also simplifies the establishment of multiple sessions. 

The Data Protection for Oracle options file, tdpo.opt, contains options that determine the behavior and performance 
of Data Protection for Oracle. The only environment variable Data Protection for Oracle Version 5.2 recognizes 
within an RMAN script is the fully qualified path name to the tdpo.opt file. Therefore, some RMAN scripts may need 
to be edited to use TDPO_OPTFILE=fully qualified path and file name of options file variable in place of other 
environment variables. For example:

allocate channel t1 type 'sbt_tape' parms 
       'ENV=(TDPO_OPTFILE=/home/rman/scripts/tdpo.opt)'See Scripts for further information.

If a fully qualified path name is not provided, Data Protection for Oracle uses the tdpo.opt file located 
in the Data Protection for Oracle default installation directory. If this file does not exist, 
Data Protection for Oracle proceeds with default values.


Note 1:
-------

Installing Data Protection for Oracle 5.1
This chapter provides information on the required client environment for Data Protection for Oracle 
and instructions on installing Data Protection for Oracle.

Make sure these conditions exist before installing Data Protection for Oracle:

- Tivoli Storage Manager Server Version 5.1.0 (or later) is configured. 
- Tivoli Storage Manager API Version 5.1.5 (or later) is installed. This version of the 
  Tivoli Storage Manager API is included in the Data Protection for Oracle product media.

Attention: A root user must install the Tivoli Storage Manager API before installing Data Protection 
for Oracle on the workstation where the target database resides.
After Data Protection for Oracle is installed, you must perform the following configuration tasks:

-Define Data Protection for Oracle options in the tdpo.opt file. 
-Register the Data Protection for Oracle node to a Tivoli Storage Manager Server. 
-Define Tivoli Storage Manager options in the dsm.opt and dsm.sys files. 
-Define Tivoli Storage Manager policy requirements. 
-Initialize the password with a Tivoli Storage Manager Server.

See Configuring Data Protection for Oracle for detailed task instructions.

Note 2:
-------

New Password File Generation

Data Protection for Oracle uses a new password utility for 
password generation and maintenance.  The new TDPO configuration
utility, 'tdpoconf' replaces the previous executable 'aobpswd'.
To generate or update a password, invoke 'tdpoconf' as follows:

	tdpoconf password [-tdpo_optfile=/mydir/myfile.opt]

Successful execution of 'tdpoconf password' should generate apassword file
with the prefix 'TDPO.' followed by your nodename.

For more information, please see the 'Using the Utilities' section in the
Data Protection for Oracle User's Guide.


Note 3:
-------

thread

Q:

Any good step-by-step docs out there? I just need to get this thing setup and working quickly. 
Don't have time (unless it is my only choice of course) do filter through several manuals to pick out 
the key info... Help if you can - I surly would appreciate it 

A:

What needs to happen is this -- 


1. Get the manuals downloaded.

2. Download the TDP and perform default installations

3. Create a node name for the TDP - i.e TDP_Oracle _&lt;hostname>, register it using a simple password because 
   you are going to need it.

3. Add a stanza in dsm.sys for the TDP, this should be the second or third stanza since the first stanza is 
   for the client.

4. In the TDP installation directory, modify the tdpo.opt file - this is the configuration file for the TDP. 
   This file is self explanatory

5. Execute tdpconf showenv - you should get a response back from tsm.

6. Execute tdpoconf passwd - to create tdpo password - the password file will be created and stored 
   in the TDP home directory. If will have the host name as part of the file name.

7. Once you have gotten this far, in Oracle's home directory - create the dsm.opt file and make sure it contains 
   only one line, the servername line of the newly created stanza. The file needs to be owned by oracle.

8. If you are using tracing, the tdpo.opt file will identify the location.

9. Configure RMAN

10. Test and verify


Note 4:
-------

thread

Q:

I see this question has been asked several times in the list, but I fail
to see any answers on ADSM.ORG.

I'm getting the 
"ANS0263E Either the dsm.sys file was not found, or the Inclexcl file
specified in the dsm.sys was not found" 
error when trying to set the password after installing the 64 bit TDPO
on Solaris 8.

(The 32 bit version installs fine)

Anyone have the fix for this handy?

A:

Dale,
Did you check the basics of, as oracle, or your tdpo user:

   # env | grep DSM

Make sure the DSMI variables point to the right locations, then verify
those files are readable by your user.

If after verifying this, you might want to let us know what version of
oracle, tdpo and tsmc you have on this node.

A:

We had an issue with this and discovered that it was looking in the api
directory for the dsm.sys and not the ba/bin directory so we just put a link
in api to bin and it worked.

A:

You may want to break the link to prevent TDP from using the INCLEXCL file that's 
normally in a dsm.sys file.  If you don't, you'll generate errors.  If linked, and 
commented out, your normal backups
won't have an INCLEXCL file, hence, you'll backup everything on your client server 
during your regular client backup.

Note 5:
-------

http://www-1.ibm.com/support/docview.wss?rs=0&uid=swg24012732

TSM for Databases v5.3.3, Data Protection for Oracle Downloads by Platform 
Downloadable files 
 
Abstract 
Data Protection for Oracle v5.3.3 refresh.  
  
Download Description 
Data Protection for Oracle v5.3.3 refresh.

These packages contains no license file. The customer must already have a Paid version of the package 
to obtain the license file.

These packages contain fixes for APARs IC48436, IC48248, IC48056, IC46968, IC45462, IC43896, IC41501, IC38717, 
IC38681, IC38430, IC38061, IC37459, IC36686, IC36389 
   
Prerequisites 
A Paid version of the Data Protection for Oracle package is required. 
  
Installation Instructions 
See the README.TDPO in the download directory.  
 
 
Note 6:
-------

IC44171: ON AIX, FILESET TIVOLI.TSM.CLIENT.API.32BIT 5.3.0.0 IS A PRE-REQFOR INSTALLING 
TIVOLI.TSM.CLIENT.API.64BIT 5.3.0.0 

APAR status
Closed as documentation error.

Error description 
When installing tivoli.tsm.client.api.64bit 5.3.0.0 on AIX,
tivoli.tsm.client.api.32bit 5.3.0.0 is required as pre-requsite
for the installation. The installation will fail if
tivoli.tsm.client.api.32bit 5.3.0.0 is not avaiable for install.
tivoli.tsm.client.api.32bit 5.3.0.0 is needed because of
languages enhancement in 5.3.
Local fix 
Problem summary 
****************************************************************
* USERS AFFECTED: AIX CLIENTS                                  *
****************************************************************
* PROBLEM DESCRIPTION: API 32bit PREREQ for API 64bit not  in  *
* README.                                                      *
****************************************************************
* RECOMMENDATION: apply next available fix.                    *
****************************************************************
Problem conclusion 
Add info to README files and database.
 

Files in TSM client and tdpo: 

tivoli.tivguid                                                         Y
tivoli.tsm.books.en_US.client.htm                                      Y
tivoli.tsm.books.en_US.client.pdf                                      Y
tivoli.tsm.client.api.32bit                                            Y
tivoli.tsm.client.api.64bit                                            Y
tivoli.tsm.client.ba.32bit.base                                        Y
tivoli.tsm.client.ba.32bit.common                                      Y
tivoli.tsm.client.ba.32bit.image                                       Y
tivoli.tsm.client.ba.32bit.nas                                         Y
tivoli.tsm.client.ba.32bit.web                                         Y
tivoli.tsm.client.oracle.aix.64bit                                     Y
tivoli.tsm.client.oracle.books.htm                                     Y
tivoli.tsm.client.oracle.books.pdf                                     Y
tivoli.tsm.client.oracle.tools.aix.64bit   
 
 
81.5 Other stuff with TSM client, stopping and starting:
========================================================


The TSM scheduler dsmcad:
-------------------------

-- Check if its running:

ps -ef | grep dsm
root 245896      1   0   Jan 08      -  0:24 /usr/tivoli/tsm/client/ba/bin/dsmcad

-- To start the process:

#! /bin/sh
#       Copyright (c) 1989, Silicon Graphics, Inc.
#ident  "$Revision: 1.1 $"

if $IS_ON verbose ; then        # For a verbose startup and shutdown
    ECHO=echo
else                            # For a quiet startup and shutdown
    ECHO=:
fi

state=$1
case $state in

'start')
        set `who -r`
        if [ $8 != "0" ]
        then
                exit
        fi

        $ECHO "Starting dsm schedule:"
        DSM_LOG="/usr/tivoli/tsm/client/ba"
        DSM_CONFIG="/usr/tivoli/tsm/client/ba/bin/dsm.opt"
        DSM_DIR="/usr/tivoli/tsm/client/ba/bin"

        export DSM_LOG DSM_CONFIG DSM_DIR

        if [ -f /usr/tivoli/tsm/client/ba/bin/dsmcad ]; then
                /usr/tivoli/tsm/client/ba/bin/dsmcad > /dev/null 2>&1 &
                if [ $? -eq 0 ]; then
                        $ECHO " done"
                else
                        $ECHO " failed"
                        exit 2
                fi
        else
                echo " failed, no dsm installed"
                exit 3
        fi
        ;;

'stop')
        $ECHO "Stopping dsm schedule:"
        killall dsmcad
        ;;
esac


It is also possible now to start and stop dsmcad using the script. For example :

- To start dsmcad manually run:

/etc/init.d/dsmcad start

- To stop dsmcad run:

/etc/init.d/dsmcad stop

- To restart dsmcad (for example to refresh daemon after dsm.sys or dsm.opt modification)

/etc/init.d/dsmcad restart

- To check if dsmcad is running run:

/etc/init.d/dsmcad status
-or-
ps -ef | grep dsmcad 

Or use:

(root) /sbin/init.d/tsmclient stop
(root) /sbin/init.d/tsmclient start


It could also be implemented in a proprierty way as in the following example:

root@zd111l08:/etc#./rc.dsm stop
dsmcad en scheduler gestopt

root@zd111l08:/etc#./rc.dsm start

root@zd111l08:/etc#IBM Tivoli Storage Manager
Command Line Backup/Archive Client Interface - Version 5, Release 2, Level 3.0
(c) Copyright by IBM Corporation and other(s) 1990, 2004. All Rights Reserved.

Querying server for next scheduled event.
Node Name: ZD111L08
Session established with server ZTSM05: AIX-RS/6000
  Server Version 5, Release 2, Level 4.5
  Server date/time: 06.08.2007 08:19:13  Last access: 30.07.2007 14:54:54

Next operation scheduled:
------------------------------------------------------------
Schedule Name:         DAG01U_ALG_CCAZ
Action:                Incremental
Objects:
Options:               -su=yes
Server Window Start:   01:00:00 on 07.08.2007
------------------------------------------------------------
Waiting to be contacted by the server.


81.6 TSCM: Tivoli Security Compliance Manager :
----------------------------------------------


-- client.pref:

The client.pref configuration file contains configuration parameters for the Tivoli Security Compliance Manager 
client and is located in the client installation directory.

The default installation directories are:

UNIX 
/opt/IBM/SCM/client 


-- stopping and starting the client:


Stopping the client on UNIX systems
Note:
You must log in as the root user to complete this task.
To stop the client component on a UNIX system:

Open a command shell. 
Go to the directory where the client component is installed. 
The default installation directory is /opt/IBM/SCM/client.

Enter the following command: 
./jacclient stop

To start it, use
./jacclient start


-- Show status of the client:

./jacclient status
HCVIN0033I The Tivoli Security Compliance Manager client is currently running.


-- Start of the client on boot:

On AIX, the client is started from inittab:

#cat /etc/inittab | grep jac
ibmscmcli:2:once:/opt/IBM/SCM/client/jacclient start >/dev/console 2>&1 # Start Security Compliance Manager Client


81.7 ANS1005E TCP/IP read error on socket xx errono=73:
-------------------------------------------------------

ANS1005E TCP/IP read error on socket xx errono=73 
 Technote (FAQ) 
  
Problem 
ANS1005E TCP/IP read error on socket = 6, errno = 73, reason: 'A connection with a remote socket was reset 
by that socket.'.  
  
Cause 
The same ANR1005E message with errno 10054 is well-documented, but very little documentation exists for errno 73  
  
Solution 
ANS1005E TCP/IP read error on socket = 6, errno = 73, reason: 'A connection with a remote socket was reset 
by that socket.'.

The errno 73 seen in the message above indicates that the connection was reset by the peer, usually an indication 
that the session was cancelled or terminated on the TSM Server. In all likelihood these sessions were terminated 
on the server because they were in an idle wait for a period of time exceeding the idletimeout value on 
the TSM Server. We see that the sessions successfully reconnected and no further errors were seen. 
Sessions sitting in an idle wait is not uncommon and is frequently seen when backing up large amounts of data. 
With multi-threaded clients, some sessions are responsible for querying the server to identify which files 
are eligible to be backed up (producer sessions) while the other sessions are responsible for the actual transfer 
of data (consumer sessions). It usually takes longer to backup files across the network than it takes for a list 
of eligible files to be generated. Once the producer sessions have completed building lists of eligible files 
they will sit idle while the producer sessions actually backup these files to the TSM Server. After some time, 
the TSM Server will terminate the producer sessions because they have been idle for a period of time longer 
than the IDLETIMEOUT value specified on the server. 


Many times this issue can be seen in firewall environment and has been seen with network DNS problems and/or network 
config problems. One of the most common is when a passive device (router, switch, hub, etc.) is in between the 
client & the server. If the port on the passive device is set to Auto-Negotiate, it will automatically defer 
to the active device (the NIC in the client) to set the connection speed. If the NIC is also set to Auto-Negotiate 
(default in most OS's) this often causes excessive delays and interruptions in connectivity. This is because the NIC 
is looking to the network appliance to set the connection speed and vice-versa, so it takes some time before 
the network device will find a suitable connection speed (not always optimal, just suitable) and begin data transfer. 
This repeats every time a data packet is sent across the network. While the negotiating period is relatively short 
by human standards (usually in the nanosecond range) it adds up over time when trying to send a large amount 
of data at a high speed and causes the connection to be broken. The best workaround for that is to hard code 
both the NIC and the network port for a specific setting. This is usually 100Mb Full Duplex for a standard 
CAT-5 copper connection, although older equipment may require reconfiguration of 10/100 NICs to allow for that speed.

The other possible workaround for this issue is to estimate the file transfer time and increase the IDLETIMEOUT 
to a level higher than that time.
 

=========
82. LDAP:
=========


82.1: Introduction:
===================

The Lightweight Directory Access Protocol, better known as LDAP, is based on the X.500 standard, but is significantly 
simpler and more readily adapted to meet custom needs. Unlike X.500, LDAP supports TCP/IP, which is necessary 
for Internet access. The core LDAP specifications are all defined in RFCs.

Associated with LDAP, there must be an "information store" somewhere on Server(s), that holds objects
and all their related properties, like useraccounts and all properties belonging to those accounts.

Strictly speaking, though, LDAP isn't a database at all, but a protocol used to access information stored 
in an information directory (also known as an LDAP directory). 
So, the protocol does not make any assumptions on the actual type or sort of database which is involved,
but it does specify how to describe objects, classes, properties and how to retrieve and store
this information.

The socalled "schema" specifies all object classes and properties.

LDAP directory servers store their data hierarchically. If you've seen the top-down representations of 
DNS trees or UNIX file directories, an LDAP directory structure will be familiar ground. As with DNS host names, 
an LDAP directory record's Distinguished Name (DN for short) is read from the individual entry, 
backwards through the tree, up to the top level.
It's just a "way" to represent a LDAP entry (or record). It has a Distinguished Name (DN) that fully
and uniquely describes the object in the "tree", similar to a file in a subdirectory on a filesystem.

-- What's in a name? The DN of an LDAP entry:
 
All entries stored in an LDAP directory have a unique "Distinguished Name," or DN. The DN for each LDAP entry 
is composed of two parts: the Relative Distinguished Name (RDN) and the location within the LDAP directory 
where the record resides. 

Some people like to refer to "container objects", holding other objects, and "leaf objects" that are endpoints
in the tree. Containers are mostly referred to as "Organizational Units" or OU's.
OU's are completely compairable to the domain components (dc's) of a fully qualified Domain Name.

Compared to a filesystem, an OU looks a lot like a directory.


Some attributes of an object are required, while other attributes are optional. An objectclass definition 
sets which attributes are required and which are not for each entry.

An object is represented (or can be found) by listing all ou's or dc's until you have reached the "endpoint":

-- Example 1:

OU=com.Ou=shell.OU=research.harry   or better according to convention:  harry.OU=research.OU=shell.OU=com

There are quite a few implementations that describe objects. For example, in Novell NDS, a user's Distinguished Name
might be like the following example:

-- Example 2:

CN=jdoe.OU=hrs.O=ADMN

or abbreviated to

jdoe.hrs.admn

Note: In Novell NDS, the toplevel OU is called an Organization, or just "O".
As another example: CN=lpIII.OU=development.OU=engineering.O=VerySmallCompany

-- Example 3:

An object could also be described as in the following example.

cn=Oatmeal Deluxe,ou=recipes,dc=foobar,dc=com 

Which means: In com, then in foobar, then in recipes, we can find the object "Oatmeal Deluxe". 


An LDAP Server is capable of propagating its directories to other LDAP servers throughout the world, 
providing global access to information. Currently, however, LDAP is more commonly used within 
individual organizations, like universities, government departments, and private companies. 

LDAP is a client-server system. The server can use a variety of databases to store a directory, 
each optimized for quick and copious read operations. When an LDAP client application connects to an LDAP server 
it can either query a directory or upload information to it. In the event of a query, the server either answers 
the query or, if it can not answer locally, it can refer the query upstream to a higher level LDAP server 
which does have the answer. If the client application is attempting to upload information to an LDAP directory, 
the server verifies that the user has permission to make the change and then adds or updates the information. 


Note:
LDAP processes listen per default on port 389.


82.2 API's dealing with LDAP:
=============================

-- Programming:
-- ------------

Almost all programming languages, have libraries or modules to access a LDAP Directory Server, and
to query, or add, delete or modify objects.

-- Utilities:
-- ----------

Also, on many platforms, commandline utilities exist which can do manipulation of objects.
In this case, in many implementations, LDIF files can be used.

Lightweight Directory Interchange Format (LDIF) is a text-based format used to describe and modify,
add, and delete--directory entries. In the latter capacity, it provides input 
to command-line utilities.

The two LDIF files immediately following represent a directory entry for a printer. 
The string in the first line of each entry is the entry's name, called a distinguished name. 
The difference between the files is that the first describes the entry--that is, the format is an index 
of the information that the entry contains. The second, when used as input to the command-line utility, 
adds information about the speed of the printer.

Description

dn: cn=LaserPrinter1, ou=Devices, dc=acme,dc=com
objectclass: top
objectclass: printer
objectclass: epsonPrinter
cn: LaserPrinter1
resolution: 600
description: In room 407


Modification

dn: cn=LaserPrinter1, ou=Devices, dc=acme, dc=com
changetype: modify
add: pagesPerMinute
pagesPerMinute: 6


As a few examples in programming languages, consider the following examples:


Java example:
-------------

Listing 1 shows a simple JNDI program that will print out the cn attributes of all the Person type objects 
on your console. 


Listing 1. SimpleLDAPClient.java

                        public class SimpleLDAPClient {
    public static void main(String[] args) {
        Hashtable env = new Hashtable();

        env.put(Context.INITIAL_CONTEXT_FACTORY,"com.sun.jndi.ldap.LdapCtxFactory");
        env.put(Context.PROVIDER_URL, "ldap://localhost:10389/ou=system");
        env.put(Context.SECURITY_AUTHENTICATION, "simple");
        env.put(Context.SECURITY_PRINCIPAL, "uid=admin,ou=system");
        env.put(Context.SECURITY_CREDENTIALS, "secret");
        DirContext ctx = null;
        NamingEnumeration results = null;
        try {
            ctx = new InitialDirContext(env);
            SearchControls controls = new SearchControls();
            controls.setSearchScope(SearchControls.SUBTREE_SCOPE);
            results = ctx.search("", "(objectclass=person)", controls);
            while (results.hasMore()) {
                SearchResult searchResult = (SearchResult) results.next();
                Attributes attributes = searchResult.getAttributes();
                Attribute attr = attributes.get("cn");
                String cn = (String) attr.get();
                System.out.println(" Person Common Name = " + cn);
            }
        } catch (NamingException e) {
            throw new RuntimeException(e);
        } finally {
            if (results != null) {
                try {
                    results.close();
                } catch (Exception e) {
                }
            }
            if (ctx != null) {
                try {
                    ctx.close();
                } catch (Exception e) {
                }
            }
        }
    }
}

                     
VB.Net example:
---------------


 'To retrieve list of all  LDAP users 

 'This function returns HashTable
 _ldapServerName = ldapServerName

 Dim sServerName As String = "mail"

 Dim oRoot As DirectoryEntry = New DirectoryEntry("LDAP://" & ldapServerName & _
       "/ou=People,dc=mydomainname,dc=com")
 
 Dim oSearcher As DirectorySearcher = New DirectorySearcher(oRoot)
 Dim oResults As SearchResultCollection
 Dim oResult As SearchResult
 Dim RetArray As New Hashtable()

 Try

  oSearcher.PropertiesToLoad.Add("uid")
  oSearcher.PropertiesToLoad.Add("givenname")
  oSearcher.PropertiesToLoad.Add("cn")
  oResults = oSearcher.FindAll     

  For Each oResult In oResults

   If Not oResult.GetDirectoryEntry().Properties("cn").Value = "" Then
    RetArray.Add( oResult.GetDirectoryEntry().Properties("uid").Value, _
      oResult.GetDirectoryEntry().Properties("cn").Value)
   End If

  Next

 Catch e As Exception

  'MsgBox("Error is " & e.Message)
  Return RetArray

 End Try

 Return RetArray
  
 End Function</PRE>


Some frgaments in C++:
----------------------

Establishing an LDAP Connection:

LDAPConnection lc("localhost");
try {  
      lc.bind("cn=user,dc=example,dc=org","secret");
    } catch (LDAPException e) {  
    std::cerr << "Bind failed: " << e   << std::endl;


Create user in VB:
------------------

' From the book "Active Directory, Third Edition" 
' ISBN: 0-596-10173-2

Dim objParent As New DirectoryEntry("LDAP://ou=sales,dc=mycorp,dc=com", _
                                    "administrator@mycorp.com",_
                                    "MyPassword", _
                                    AuthenticationTypes.Secure)
Dim objChild As DirectoryEntry = objParent.Children.Add("cn=jdoe", "user")
objChild.Properties("sAMAccountName").Add("jdoe?)
objChild.CommitChanges()

objChild.NativeObject.AccountDisabled = False
objChild.CommitChanges()

Console.WriteLine("Added user")


82.3 Implementing LDAP on AIX:
==============================

IBM Directory Server needs to be configured to support user authentication through LDAP with both the 
AIX specific schema and the RFC 2307 schema on AIX.

Also on AIX, we have a client - Server relationship. Any LDAP client can be authenticated by the
LDAP Server. 

When users log in, the LDAP client sends a query to the LDAP server to get the user and group information 
from the centralized database. DB2r is a database used for storing the user and group information. 
The LDAP database stores and retrieves information based on a hierarchical structure of entries, 
each with its own distinguishing name, type, and attributes. The attributes (properties) define 
acceptable values for the entry. An LDAP database can store and maintain entries for many users. 

An LDAP security load module was implemented as from AIX Version 4.3. This load module provides  
user authentication and centralized user and group management functions through the IBM SecureWayr Directory. 
A user defined on an LDAP server can be configured to log in to an LDAP client even if that user 
is not defined locally. The AIX LDAP load module is fully integrated with the AIX operating system


82.3.1 Configuration of IBM Directory Server:
---------------------------------------------

http://www.ibm.com/developerworks/aix/library/au-ldapconfg/index.html?ca=drs-


IBM Directory Server on AIX can be configured with either: 

- The ldapcfg command line tool 
- The graphical version of the ldapcfg tool, called ldapxcfg 
- The mksecldap command 

The following file sets are required to configure IBM Directory Server: 

- "ldap.server" file sets 
- DB2, the back-end database software that is required by the IBM Directory Server 

AIX provides the mksecldap command to set up the IBM Directory servers and clients to exploit the servers.

The mksecldap command performs the following tasks for the new server setup: 

. Creates the ldapdb2 default DB2 instance. 
. Creates the ldapdb2 default DB2 database. 
. Creates the AIX tree DN (suffix) under which AIX user and group is stored. 
. Exports users and groups from security database files of the local host into the LDAP database. 
. Sets LDAP server administrator DN and password. 
. Optionally sets server to use Secure Sockets Layer (SSL) communication. 
. Installs the /usr/ccs/lib/libsecldapaudit, an AIX audit plug-in for the LDAP server. 
. Starts the LDAP server after all the above is done. 
. Adds the LDAP server entry (slapd) to /etc/inittab for automatic restart after reboot. 

Example of how to setup the LDAP Server:

# mksecldap -s -a cn=admin -p passwd -S rfc2307aix


82.3.1 Configuration of an AIX LDAP Client:
-------------------------------------------

The "ldap.client" file set contains the IBM Directory client libraries, header files, and utilities. 
You can use the mksecldap command to configure the AIX client against the IBM Directory Server, 
as in the following example:

# mksecldap -c -h <LDAP Server name> -a cn=admin -p adminpwd -S rfc2307aix

You must have the IBM Directory Server administrator DN and password to configure the AIX client. 
Once the AIX client is configured, the secldapclntd daemon starts running. Once the AIX client is configured 
against the IBM Directory Server, change the SYSTEM attribute in "/etc/security/user" file to LDAP OR compat 
or compat or LDAP to authenticate users against the AIX client system.

The "/usr/lib/security/methods.cfg" file contains the load module definition. The mksecldap command adds 
the following stanza to enable the LDAP load module during the client setup.

XX
LDAP:
	program = /usr/lib/security/LDAP
	program_64 = /usr/lib/security/LDAP64
 

The "/etc/security/ldap/ldap.cfg" file on the client machine has configuration information for the 
secldapclntd client daemon. This configuration file contains information about the IBM Directory Server name, 
binddn, and password information. The file is automatically updated by the mksecldap command during AIX client setup. 

The auth_type attribute in the /etc/security/ldap/ldap.cfg file specifies where the user needs to be authenticated. 
If the auth_type attribute is UNIX_AUTH, then the user is authenticated at the client system. If it is ldap_auth, 
then the user is authenticated on IBM Directory Server. 


82.3.3 LDAP utilities:
----------------------

Using LDAP Tools on Linux, Solaris, AIX, or HP-UX


ldapadd      -  Adds new entries to an LDAP directory.
 
ldapdelete   - Deletes entries from an LDAP directory server. The ldapdelete tool opens a connection 
               to an LDAP server, binds, and deletes one or more entries.
 
ldapmodify   - Opens a connection to an LDAP server, binds, and modifies or adds entries.
 
ldapmodrdn   - Modifies the relative distinguished name (RDN) of entries in an LDAP directory server. 
               Opens a connection to an LDAP server, binds, and modifies the RDN of entries. 
 
ldapsearch   - Searches entries in an LDAP directory server. Opens a connection to an LDAP server, binds, 
               and performs a search using the specified filter. The filter should conform to the string representation 
 

ldapcfg utility:
----------------

Using the ldapcfg utility:

The ldapcfg utility is a command-line tool that you can use to configure IBM Tivoli Directory Server. 
You can use ldapcfg instead of the Configuration Tool for the following tasks:

- Setting the administrator DN and password. See Setting the administrator DN and password for instructions. 
- Configuring a database. See Configuring the database for instructions. 
- Changing the password of the DB2 administrator in the server configuration file.  
- Enabling the change log. See Enabling the change log for instructions. 
- Adding a suffix. 


1. Setting the administrator DN and password

To define the administrator DN and password, type the following at a command prompt: 

# ldapcfg -u "adminDN" -p password

where 

adminDN is the administrator DN you want. 
password is the password for the administrator DN. 

Note:
Double byte character set (DBCS) characters in the password are not supported.
For example:

# ldapcfg -u "cn=root" -p secret

Note:
Do not use single quotation marks (') to define DNs with spaces in them. They are not interpreted correctly.
To accept the default administrator DN of cn=root and define a password, type the following command 
at a command prompt: 

# ldapcfg -p password
where password is the password for the administrator DN.

For example:

# ldapcfg -p secret


2. Configuring the database

When you configure the database, you must always specify a user ID and password on the command line. 
The instance name is, by default, the same as the user ID. The user ID must already exist and must meet 
certain requirements. If you want a different instance name you can specify it using the -t option. 
This name must also be an existing user ID that meets certain requirements. 
See Before you configure: creating the DB2 database owner and database instance owner for information about 
these requirements on both Windows and UNIX platforms.

Attention:
Before configuring the database, be sure that the environment variable DB2COMM is not set. 
Be sure to read this section before you use the ldapcfg command. Some options (such as -f and -s) have changed. 
Unpredictable results will occur if you use them incorrectly or as they were used in previous releases. 
The server must be stopped before you configure the database. 
To configure a database, the following options are available: 

-l location 
Specifies the location of the DB2 database. For UNIX systems, this is a directory name such as /home/ldapdb. 
For Windows systems, this is a drive letter such as C: 
-a id 
Specifies the DB2 administrator ID. 
-c 
Creates a database in UTF-8 format. (The default, if you do not specify this option, is to create a database 
that is in the local code page.) 
-i 
Destroys any instance currently configured with IBM Tivoli Directory Server. All databases associated with the 
instance are also destroyed. 
-w password 
Specifies the DB2 administrator password. 

Note:
The ldapcfg -w password command no longer changes the system password of the database owner. It only updates 
the ibmslapd.conf file. See Changing the DB2 administrator password for information about using the -w option alone.

-d database 
Specifies the DB2 database name. 

-t dbinstance 
Specifies the database instance. If you do not specify an instance, the instance name is the same as the 
DB2 administrator ID. 
-o 
Overwrites the database if one already exists. By default, the database being overwritten is not deleted. 
-r 
Destroys any database currently configured with IBM Tivoli Directory Server. 
-f 
Specifies the full path of a file to redirect output into. If used in conjunction with the -q option, 
only errors will be sent to the file. 
-q 
Runs in quiet mode. All output is suppressed except for errors. 
-n 
Runs in no prompt mode. All output is generated except for messages requiring user interaction. 

To configure a database on /home/ldapdb2 with a DB2 administrator name of db2admin, a password of mypassword, 
and a database name of dbName when there is not an existing database configured (that is, the first time), the command is: 

# ldapcfg -l /home/ldapdb2 -a db2admin -w mypassword -d dbName

To configure a database on /home/ldapdb2 with a DB2 administrator name of db2admin, a password of mypassword, 
a database name of dbName, and an instance name of dbInstance when there is not an existing database configured 
(that is, the first time), the command is: 

# ldapcfg -l /home/ldapdb2 -a db2admin -w mypassword -d dbName -t dbInstance

To configure a database on /home/ldapdb2 when a database is already configured and you want to overwrite it, 
the command is: 

# ldapcfg -l /home/ldapdb2 -a db2admin -w mypassword -d dbName -o


3. Changing the DB2 administrator password

If you change the password for the DB2 administrator through the operating system, you must also change it 
using ldapcfg with the -w option. This changes the password in the server configuration file. Similarly, 
if you change the password for the DB2 administrator with the ldapcfg command, you must also change it through 
the operating system.

To change the DB2 administrator password to newpassword, type the following command:


ldapcfg -w newpassword

Note:
Double byte character set (DBCS) characters in the password are not supported.


userid='sidnsl2'


Notes:
------


Note 1:
-------

http://www-03.ibm.com/systems/p/os/aix/whitepapers/ldap_client.html

Summary of the above paper:

AIX first implemented a LDAP security load module in version 4.32. The implementation worked well in a 
uniform AIX environment. However, users have found it hard to configure AIX systems to work with third party 
LDAP servers. This shortcoming is primarily the result of the proprietary schema used by AIX1.

Since AIX 5LT version 5.2, AIX supports the schema defined in RFC 2307 which is widely used among IBM peers 
and which is becoming the industry standard for network entities. The schema defines attributes and object classes 
for such entities as users, groups, networks, services, hosts, protocols, rpc, etc.3. 
The RFC 2307 schema is often referred to as the nisSchema. Both of these terms are used interchangeably 
in this paper.

Client support for the nisSchema in AIX is part of Configurable Schema Support Mechanism (CSSM), 
which is a bigger effort to support arbitrary schema. With CSSM, AIX systems can be configured to support 
LDAP directory servers using any schema. At present, CSSM is implemented for users and groups only.

Configuring AIX to do naming lookup through LDAP for network entities, including users and groups, 
is also implemented in AIX 5L v5.2. However, this paper deals only with issues related to user authentication and 
user/group management through LDAP. Naming lookup services for other network entities is addressed in a separate paper. 

This paper addresses only client configuration. Section 2 introduces the major components and their 
functionality in an AIX LDAPclient system. Section 3 gives step-by-step instruction on configuring 
an AIX client system. In Section 4, detailed behaviors and new features of the AIX LDAP client, 
including CSSM are presented and discussed. System management in respect of the LDAP load module and 
detailed steps to enable LDAP user authentication are given in Section 5.


Note 2:
-------

http://www.redbooks.ibm.com/abstracts/sg247165.html

Summary of the above Redbook:

This IBM Redbook is a technical planning reference for IT organizations that are adding AIX 5L clients 
to an existing LDAP authentication and user management environment. It presents integration scenarios 
for the AIX 5L LDAP client with IBM Tivoli Directory Server, the Sun ONE Directory Server, 
and Microsoft Active Directory.


Note 3:
-------

thread

Q:

All-
>
> Having a problem installing a DB2 client on a machine running AIX
> version 5.0. Client appeared to install one time succesfully, then
> was uninstalled and a reinstall was attempted. For some reasons, it
> does not complete the reinstall. See the status report from the GUI
> installer at the end of this note. Errors are towards the bottom.
> Everything installed in /usr/opt for DB2 but the sqllib folder that is
> supposed to be created in the home directory of the instance ownder is
> not installed (in our case the instance ownder is db2inst1). Have
> tried installing DB2 with the user db2inst1 already existing and not.
> Same error seems to appear. The key errors from the output below
> appear to be:
>
> ERROR:Could not switch current DB2INSTANCE to "db2inst1". The return
> code is
> "-2029059916".
> ERROR:DBI1122E Instance db2inst1 cannot be updated.[/color]

A:

Most likely, when you uninstalled, you removed the ~db2inst1/sqllib via
rm -rf, rather than via db2idrop. There are crumbs still sticking
around in your system.

Install the product, don't bother with the instance. Run
/usr/opt/db2_08_01/instance/db2ilist (as root). If it shows db2inst1
in the list, this is your problem. The solution is to recreate the
~db2inst1/sqllib directory (just use mkdir), then try db2idrop. Once
the instance is properly dropped, you can use db2isetup (also in the
..../instance directory) to recreate the instance.

Hope this helps,

A:

Works!! Thanks for your help!


Note 4:
-------

Technote:

http://www-1.ibm.com/support/docview.wss?rs=71&context=SSEPGG&q1=loopback+extshm&uid=swg21009742&loc=en_US&cs=utf-8&lang=en

DB2 issues SQL1224N and WebSphere Application Server (WAS) admin server fails with StaleConnectionException 
when attempting more than 10 local concurrent DB2 connections from a single process.

Problem 
On AIX 4.3.3 or later, DB2 will issue SQL1224N and WebSphere administration server will fail with 
StaleConnectionException when attempting more than 10 local concurrent DB2 connections from a single process. 
JDK 1.1.8 allows a maximum number of 10 local concurrent DB2 connections. JDK 1.2.2 allows a maximum of 4 
local connections. JDK 1.3.0 allows a maximum of 2 local connections.
 
Solution 
Symptoms 
DB2 errors: 

In db2diag.log, it has DIA9999E "An internal error occurred" with an error return code of 18 and sqlcode -1224 
appear when running DB2 with a WebSphere application:
2000-10-26-14.46.36.060751 Instance:db2ninst Node:000
PID:35928(java) Appid:
oper_system_services sqlocshr Probe:200 
DIA9999E An internal error occurred. Report the following error code : " 18".

Data Title:SQLCA PID:35928 Node:000
sqlcaid : SQLCA sqlcabc: 136 sqlcode: -1224 sqlerrml: 0
sqlerrmc:
sqlerrp : sqlearcn
sqlerrd : (1) 0x00000000 (2) 0x00000000 (3) 0x00000000
(4) 0x00000000 (5) 0x000000FF (6) 0x00000000
sqlwarn : (1) (2) (3) (4) (5) (6)
(7) (8) (9) (10) (11)
sqlstate:


The javacore.txt log file shows that an exception is thrown due to SQL1224N when the application attempts 
to connect to the database: 

COM.ibm.db2.jdbc.DB2Exception: [IBM][CLI Driver] SQL1224N A database agent could not be started to service a request, 
or was terminated as a result of a database system shutdown or a force command. SQLSTATE=55032

at java.lang.Throwable.<init>(Throwable.java:96)
at java.lang.Exception.<init>(Exception.java:44)
at java.sql.SQLException.<init>(SQLException.java:45)
at COM.ibm.db2.jdbc.DB2Exception.<init>(DB2Exception.java:93)
at COM.ibm.db2.jdbc.app.SQLExceptionGenerator.throw_SQLException(SQLExceptionGenerator.java:164)
at COM.ibm.db2.jdbc.app.SQLExceptionGenerator.check_return_code(SQLExceptionGenerator.java:402)
at COM.ibm.db2.jdbc.app.DB2Connection.connect(DB2Connection.java(Compiled Code))
at COM.ibm.db2.jdbc.app.DB2Connection.<init>(DB2Connection.java(Compiled Code))
at COM.ibm.db2.jdbc.app.DB2Driver.connect(DB2Driver.java(Compiled Code))
at java.sql.DriverManager.getConnection(DriverManager.java(Compiled Code))
at java.sql.DriverManager.getConnection(DriverManager.java:183)
at newtest.connectDM(newtest.java:35)
at newtest.run(newtest.java:109)
at java.lang.Thread.run(Thread.java:498)
 

Possible cause 

The error return code 18 indicates that there are too many files open and therefore, no available 
segment registers. The Websphere application has reached AIX's limit of 10 shared memory segments per process, 
and so DIA9999E is generated. 

SQL1224N and StaleConnectionException result as a result of DB2 not being able to obtain a new shared memory segment.

Action
DB2 UDB Version 7.2 (DB2 UDB Version 7.1 FixPak 3) or later
The support of EXTSHM has been added to V7.2 (V7.1 Fixpak 3). By default, AIX does not permit 32-bit applications 
to attach to more than 11 shared memory segments per process, of which a maximum of 10 can be used for 
local DB2 connections. To use EXTSHM with DB2, do the following:

In DB2 client sessions:
export EXTSHM=ON
When starting the DB2 UDB Server:
export EXTSHM=ON
db2set DB2ENVLIST=EXTSHM
db2start
On DB2 UDB EEE, also add the following lines to sqllib/db2profile:
EXTSHM=ON
export EXTSHM

The above information has been documented in the DB2 UDB Release Notes for Version 7.2 / Version V7.1 FixPak 3, page 366. 
You can get it from: ftp://ftp.software.ibm.com/ps/products/db2/info/vr7/pdf/letter/db2ire71.pdf


Note 5:
-------

http://publib.boulder.ibm.com/infocenter/wpdoc/v510/index.jsp?topic=/com.ibm.wp.ent.doc_5.1/wps/tbl_adm.html

When modifying user information via WebSphere Portal, if you receive the error Backend storage system failed. 
Please try again later. or the user attributes are not updated in LDAP, it might mean that the default 
tuning parameters for use with DB2 and IBM Tivoli Directory Server need to be adjusted.

Solution: The default DB2 parameters are:

APP_CTL_HEAP_SZ 128
APPLHEAP_SZ 128

The parameters above are too small for IBM Tivoli Directory Server and WebSphere Portal on AIX with 2000 user entries.

The HEAP size of UDB is required when updating or inserting data. WebSphere Portal spawns heavy transactions 
to the LDAP server in any phase, especially changing user attributes, which spawns several updates and inserts. 
To prevent this problem, the following WebSphere Portal tuning is required:

su -ldapdb2
db2 -c update db cfg for ldap using APP_CTL_HEAP_SZ 1024
db2 -c update db cfg for ldap using APPLHEAP_SZ 1024  


82.4 Implementing LDAP on HP-UX:
================================


82.5 Implementing LDAP on RedHat:
=================================


82.5.1 OpenLDAP Daemons and Utilities:
--------------------------------------

The suite of OpenLDAP libraries and tools is spread out over the following packages: 


openldap         - Contains the libraries necessary to run the openldap server and client applications. 

openldap-clients - Contains command-line tools for viewing and modifying directories on an LDAP server. 

openldap-server  - Contains the servers and other utilities necessary to configure and run an LDAP server. 


There are two servers contained in the openldap-servers package: the Standalone LDAP Daemon (/usr/sbin/slapd) 
and the Standalone LDAP Update Replication Daemon (/usr/sbin/slurpd). 

The slapd daemon is the standalone LDAP server while the slurpd daemon is used to synchronize changes from 
one LDAP server to other LDAP servers on the network. The slurpd daemon is only necessary when dealing 
with multiple LDAP servers. 

To perform administrative tasks, the openldap-server package installs the following utilities into the 
/usr/sbin/ directory: 


slapadd    - Adds entries from an LDIF file to an LDAP directory. For example, 
            /usr/sbin/slapadd -l ldif-input will read in the LDIF file, ldif-input, containing the new entries. 

slapcat    - Pulls entries out of an LDAP directory in the default format - Berkeley DB - and saves them 
             in an LDIF file. For example, the command /usr/sbin/slapcat -l ldif-output will output an LDIF file 
             called ldif-output containing the entries from the LDAP directory. 

slapindex  - Re-indexes the slapd directory based on the current content. 

slappasswd - Generates an encrypted user password value for use with ldapmodify or the rootpw value in the 
             slapd configuration file, /etc/openldap/slapd.conf. Execute /usr/sbin/slappasswd to create the password. 


 Warning 
  Be sure to stop slapd by issuing "/usr/sbin/service slapd stop" before using slapadd, slapcat or slapindex. 
  Otherwise, the consistency of the LDAP directory is at risk. 
 

The openldap-clients package installs tools used to add, modify, and delete entries in an LDAP directory 
into /usr/bin/. These tools include the following: 


ldapmodify  - Modifies entries in an LDAP directory, accepting input via a file or standard input. 

ldapadd     - Adds entries to your directory by accepting input via a file or standard input; 
              ldapadd is actually a hard link to ldapmodify -a. 

ldapsearch  - Searches for entries in the LDAP directory using a shell prompt. 

ldapdelete  - Deletes entries from an LDAP directory by accepting input via user input at the terminal or via a file. 


With the exception of ldapsearch, each of these utilities is more easily used by referencing a file containing 
the changes to be made rather than typing a command for each entry you wish to change in an LDAP directory. 
The format of such a file is outlined in each application's man page. 

NSS, PAM, and LDAP
In addition to the OpenLDAP packages, Red Hat Linux includes a package called nss_ldap which enhances LDAP's ability 
to integrate into both Linux and other UNIX environments. 

The nss_ldap package provides the following modules: 

/lib/libnss_ldap-<glibc-version>.so

/lib/security/pam_ldap.so

The libnss_ldap-<glibc-version>.so module allows applications to look up user, group, hosts, and other 
information using an LDAP directory via glibc's Nameservice Switch (NSS) interface. NSS allows applications 
to authenticate using LDAP in conjunction with Network Information Service (NIS) name service and 
flat authentication files. 

The pam_ldap module allows PAM-aware applications to authenticate users using information stored in an 
LDAP directory. PAM-aware applications include console login, POP and IMAP mail servers, and Samba. By deploying 
an LDAP server on your network, all of these login situations can authenticate against one user ID and 
password combination, greatly simplifying administration. 

PHP4, the Apache HTTP Server, and LDAP
Red Hat Linux includes a package containing LDAP module for the PHP server-side scripting language. 

The php-ldap package adds LDAP support to the PHP4 HTML-embedded scripting language via the 
/usr/lib/php4/ldap.so module. This module allows PHP4 scripts to access information stored in an LDAP directory. 
 

82.5.2 OpenLDAP Configuration Files:
------------------------------------

OpenLDAP configuration files are installed into the /etc/openldap/ directory. The following is a brief 
list highlighting the most important directories and files: 


/etc/openldap/schema/ directory - This subdirectory contains the schema used by the slapd daemon. 
                                  
/etc/openldap/ldap.conf         - This is the configuration file for all client applications which use 
                                  the OpenLDAP libraries. These include, but are not limited to, Sendmail, Pine, 
                                  Balsa, Evolution, and Gnome Meeting. 

/etc/openldap/slapd.conf        - This is the configuration file for the slapd daemon.  


 Note 
  If the nss_ldap package is installed, it will create a file named /etc/ldap.conf. This file is used by the 
  PAM and NSS modules supplied by the nss_ldap package. See the Section called Configuring Your System to 
  Authenticate Using OpenLDAP for more information about this configuration file. 
 

-- slapd.conf
In order to use the slapd LDAP server, you will need to modify its configuration file, 
/etc/openldap/slapd.conf. You must to edit this file to make it specific to your domain and server. 

The suffix line names the domain for which the LDAP server will provide information. The suffix line should be 
changed from: 

suffix          "dc=your-domain,dc=com"
 
so that it reflects your domain name. For example: 

suffix          "dc=example,dc=com"
 

The rootdn entry is the Distinguished Name (DN) for a user who is unrestricted by access controls or 
administrative limit parameters set for operations on the LDAP directory. The rootdn user can be thought of as 
the root user for the LDAP directory. In the configuration file, change the rootdn line from its default value 
to something like the example below: 

rootdn          "cn=root,dc=example,dc=com"
 
Change the rootpw line to something like the example below: 

rootpw          {SSHA}vv2y+i6V6esazrIv70xSSnNAJE18bb2u
 
In the rootpw example, you are using an encrypted root password, which is a much better idea than leaving a 
plain text root password in the slapd.conf file. To make this encrypted string, type the following command: 

# slappasswd
 
You will be prompted to type and then re-type a password. The program prints the resulting encrypted password 
to the terminal. 

 Warning 
  LDAP passwords, including the rootpw directive specified in /etc/openldap/slapd.conf, are sent over the network 
  in plain text unless you enable TLS encryption. 
 
For added security, the rootpw directive should only be used if the initial configuration and population 
of the LDAP directory occurs over a network. After the task is completed, it is best to comment out the rootpw 
directive by preceding it with a pound sign (#). 


 Tip 
  If you are using the slapadd command-line tool locally to populate the LDAP directory, using the rootpw directive 
  is not necessary. 
 

The /etc/openldap/schema/ Directory

The /etc/openldap/schema/ directory holds LDAP definitions, previously located in the slapd.at.conf and 
slapd.oc.conf files. All attribute syntax definitions and objectclass definitions are now located 
in the different schema files. The various schema files are referenced in /etc/openldap/slapd.conf 
using include lines, as shown in this example: 

include		/etc/openldap/schema/core.schema
include		/etc/openldap/schema/cosine.schema
include		/etc/openldap/schema/inetorgperson.schema
include		/etc/openldap/schema/nis.schema
include		/etc/openldap/schema/rfc822-MailMember.schema
include		/etc/openldap/schema/autofs.schema
include		/etc/openldap/schema/kerberosobject.schema
 
 Caution 
  You should not modify any of the schema items defined in the schema files installed by OpenLDAP. 
 
You can extend the schema used by OpenLDAP to support additional attribute types and object classes using 
the default schema files as a guide. To do this, create a local.schema file in the /etc/openldap/schema directory. 
Reference this new schema within slapd.conf by adding the following line below your default include schema lines: 

include          /etc/openldap/schema/local.schema
 
Next, go about defining your new attribute types and object classes within the local.schema file. 
Many organizations use existing attribute types and object classes from the schema files installed by default 
and modify them for use in the local.schema file. This can help you to learn the schema syntax while meeting 
the immediate needs of your organization. 

Extending schema to match certain specialized requirements is quite involved and beyond the scope of this chapter. 
Visit http://www.openldap.org/doc/admin/schema.html for information on writing new schema files. 


82.5.3 Setting up OpenLDAP on RedHat:
-------------------------------------

The basic steps for creating an LDAP server are as follows: 

1. Install the openldap, openldap-servers, and openldap-clients RPMs. 

2. Edit the /etc/openldap/slapd.conf file to reference your LDAP domain and server. 
   Refer to the Section called slapd.conf for more information on how to edit this file. 

3. Start slapd with the command:

/sbin/service/ldap start
 

After you have configured LDAP correctly, you can use chkconfig, ntsysv, or Services Configuration Tool 
to configure LDAP to start at boot time. For more information about configuring services, 
see to the chapter titled Controlling Access to Services in the Official Red Hat Linux Customization Guide. 

4. Add entries to your LDAP directory with ldapadd. 

5. Use ldapsearch to see if slapd is accessing the information correctly. 

6. At this point, your LDAP directory should be functioning properly and you can configure any LDAP-enabled 
   applications to use the LDAP directory. 


=========================
83. Introduction SAMBA:
=========================


83.1 Introduction:
==================


-- File and Print services:

Traditionally, unix machines have their own "usual" protocols and utilities on top of tcp/ip 
with regards to file and print services, like scp, ftp, http, rcp, lp, ipc mechanisms etc..

File and print services on Windows, traditionally uses "Server Message Blocks", otherwise known
as the SMB protocol. 

The SMB protocol can be installed on unix as well, making it "look" like a Windows Server
as far as Windows clients are concerned, who want to use a Server for file and print services. 
For this to make a reality, you can instal "Samba" on your unix machine.

-- Authentication:

Machines from both the Windows an unix worlds, have means to "authenticate" a user locally,
or let the user be authenticated by a remote entity.

For example, on a unix machine, a user "can logon locally", using the local password file (in reality,
this could be more complex), or be authenticated "remotely" by "NIS" (Network Information System),
or be authenticated by a ldap Server etc..

Also, on a Windows machine, a user might logon locally, or be authenticated remotely by a 
PDC or BDC (Domain Controllers in a NT4 network), or be authenticated by Active Directory (Win2K, Win2K3).

With samba, you can integrate a unix machine in the Windows-type of authentication, that is, let a unix
machine function as a Windows Domain Controller (NT4), or integrate it in Active Directory (2000, 2003).

In the next sections, we take a look on how samba can be used on HP-UX, Solaris, RedHat, and AIX.


=========================
84. AIX and SNA:
=========================


Note 1:
-------

SNA defines a set of rules that systems use to communicate. These rules define the layout of the data 
that flows between the systems and the action the systems take when they receive the data. 
SNA does not specify how a system implements the rules. A fundamental objective of SNA is to allow 
systems that have very different internal hardware and software designs to communicate. 
The only requirement is that the externals meet the rules of the architecture. 

Logical Unit (LU) is an SNA term used to describe a logical collection of services that can be accessed 
from a network. In this environment, you can think of a CICS region as an LU. SNA defines many different 
types of LUs, including devices like terminals and printers. The type of LU that is used for 
CICS intersystem communication is LU type 6.2. 

Each LU is identified by a name of up to eight characters, referred to as the LU name. An IBM mainframe-based 
CICS system uses the APPLID defined in the CICS system initialization table as its LU name 
(also referred to as a NETNAME). The LU name for a CICS OS/2 system is specified in the 
Communications Manager/2 Local LU definition, and the LU name for a CICS/400 system is defined 
in the APPL parameter of the ADDCICSSIT command. 

An SNA network also has a name of up to eight characters, called the network name. The network name 
is sometimes referred to as the network ID or the netid. An LU can be uniquely identified by combining 
its LU name with the network name of the network that owns it. The LU's name is then referred to as the 
network-qualified LU name or the fully-qualified LU name. For example, if an LU named CICSA belongs to 
a network named NETWORK1, its network-qualified LU name is NETWORK1.CICSA. 

For an LU to communicate with another LU, it must establish at least one session between them. 
The request to activate a session is referred to as a BIND request. It is used to pass details 
of the capabilities of the initiating LU to the receiving system, and also to determine a route 
through the network. The receiving LU then sends a description of its capabilities to the 
initiating LU in the BIND response. Once the session is established, it can be used for a number 
of intersystem requests and remains active for as long as the two LUs and the network between them are available. 

When you configure your network, you can set up different characteristics for the sessions established 
between a pair of LUs, such as in the route they take through the network. Session characteristics 
are referred to as modegroups. All the sessions associated with a modegroup have the same characteristics. 
A modegroup is identified by a modename of up to eight characters. 

When defining a CICS region, you must also identify the SNA synchronization level required. 
CICS supports all three synchronization levels defined by SNA: 


Synchronization level 0 (NONE)-- SNA provides no synchronization support. The application must code its own. 
Synchronization level 1 (CONFIRM)-- SNA provides the ability to send simple acknowledgment requests. 
Synchronization level 2 (SYNCPOINT)-- SNA provides the ability for two or more systems to treat the updates 
made by an application on these systems as one logical unit of work (LUW). 

There are many ways to connect CICS systems in a network. If the data is successfully transferred in the 
correct format, these CICS systems are unaware of the network makeup. SNA configuration is performed at two levels: 

-The logical level, described in the preceding paragraphs, incorporates the characteristics of the systems 
that wish to communicate. 
-The physical level incorporates the linking of actual machines, or nodes, in the network. Each node has 
physical links, or connections, to other nodes so that every node is connected to at least one other node. 
Data must sometimes travel along a number of links to get from one system to another. Also, these links 
can be of different types. For example, IBM Token Ring, Synchronous Data Link Control (SDLC), Ethernet, 
and X.25 are all physical links. These types of links are collectively referred to as 
data link control (DLC) protocols. 


Each node has a Physical Unit (PU). This is a combination of hardware and software that controls the links 
to other nodes. Several PU types with different capabilities and responsibilities exist, such as: 

-PU type 5--The best-known example is an IBM mainframe processor running VTAM. VTAM provides the support 
 for the Systems Services Control Point (SSCP) function defined in SNA. 
-PU type 4--This is a communications controller, such as an Advanced Communications Function for the 
 Network Control Program (ACF/NCP), that resides in the center of a network, routing and controlling the 
 data flow between machines. 
-PU type 2--This is a small machine, such as an advanced program-to-program communications (APPC) workstation. 
 It can communicate directly only with a PU type 4 or PU type 5 and relies on these PUs to route the data to the 
 correct system. 
-PU type 2.1--This is a more advanced PU type 2 that can also communicate with other PU type 2.1 nodes directly. 
 This node can support an independent LU. An independent LU can establish a session with another LU 
 without using VTAM. Communications Server for AIX is a PU type 2.1 node. 


PU type 2.1 nodes may have support for Advanced Peer-to-Peer Networking (APPN). This support enables a node 
to search for an LU in the network, rather than requiring a remote LU's location to be preconfigured locally. 
There are two types of APPN nodes: end nodes and network nodes. An end node can receive a search request 
for an LU and respond, indicating whether the LU is local to the node or not. A network node can issue search 
requests, as well as respond to them, and maintains a dynamic database that contains the results of 
the search requests. Support for APPN can greatly reduce the maintenance work in an SNA network, especially 
if the network is large or dynamic. Communications Server for AIX supports APPN.


Note 2:
-------

IBMr Communications Server for AIXr provides an essential foundation for enterprise networking

It helps provide a security-rich, scalable, and high-performance communications solution for the AIX operating system.

-Reaps the benefits of IBM's years of experience with SNA, TCP/IP, and network computing
-Enables customers and Business Partners to choose applications based on their business needs, 
 not their network infrastructure
-Provides an excellent offering for multi-protocol networking environments with Enterprise Extender, 
 enhanced TN3270E Server, Telnet Redirector, and Remote API client/server support
-Offers use of comprehensive Secure Sockets Layer (SSL) data encryption, and SSL client and server 
 authentication with the TN3270E Server, the Telnet Redirector and the Remote API Client/Server using 
 HTTPS connections for access to SNA networks
-Offers the ideal choice for customers who need more secure, robust Telnet and Remote API networking environments
-Includes full implementation of APPN (network node and end node), HPR, and DLUR, along with integrated 
 gateway capabilities, positioning itself as a participant in a host (hierarchical) or peer-to-peer distributed 
 network environment
-Operating systems supported: AIX


IBM Communications Server exist for:


Note 3:
-------

Introduction to SNA 
Summary: In the early 1970s, IBM discovered that large customers were reluctant to trust unreliable 
communications networks to properly automate important transactions. In response, IBM developed 
Systems Network Architecture (SNA). "Anything that can go wrong will go wrong," and SNA may be unique 
in trying to identify literally everything that could possibly go wrong in order to specify the proper response. 
Certain types of expected errors (such as a phone line or modem failure) are handled automatically. 
Other errors (software problems, configuration tables, etc.) are isolated, logged, and reported 
to the central technical staff for analysis and response. This SNA design worked well as long as communications 
equipment was formally installed by a professional staff. It became less useful in environments when any PC 
simply plugs in and joins the LAN. Two forms of SNA developed: Subareas (SNA Classic) managed by mainframes, 
and APPN (New SNA) based on networks of minicomputers. 

In the original design of SNA, a network is built out of expensive, dedicated switching minicomputers 
managed by a central mainframe. The dedicated minicomputers run a special system called NCP. No user programs 
run on these machines. Each NCP manages communications on behalf of all the terminals, workstations, and PCs 
connected to it. In a banking network, the NCP might manage all the terminals and machines in branch offices 
in a particular metropolitan area. Traffic is routed between the NCP machines and eventually into the central mainframe. 

The mainframe runs an IBM product called VTAM, which controls the network. Although individual messages 
will flow from one NCP to another over a phone line, VTAM maintains a table of all the machines and 
phone links in the network. It selects the routes and the alternate paths that messages can take between 
different NCP nodes. 

A subarea is the collection of terminals, workstations, and phone lines managed by an NCP. Generally, 
the NCP is responsible for managing ordinary traffic flow within the subarea, and VTAM manages the connections 
and links between subareas. Any subarea network must have a mainframe. 

The rapid growth in minicomputers, workstations, and personal computers forced IBM to develop a second kind of SNA. 
Customers were building networks using AS/400 minicomputers that had no mainframe or VTAM to provide control. 
The new SNA is called APPN (Advanced Peer to Peer Networking). APPN and subarea SNA have entirely different 
strategies for routing and network management. Their only common characteristic is support for applications 
or devices using the APPC (LU 6.2) protocol. Although IBM continues the fiction that SNA is one architecture, 
a more accurate picture holds that it is two compatible architectures that can exchange data. 

It is difficult to understand something unless you have an alternative with which to compare it. Anyone reading 
this document has found it from the PC Lube and Tune server on the Internet. This suggests the obvious 
comparison: SNA is not TCP/IP. This applies at every level in the design of the two network architectures. 
Whenever the IBM designers went right, the TCP/IP designers went left. As a result, instead of the two 
network protocols being incompatible, they turn out to be complimentary. An organization running both 
SNA and TCP/IP can probably solve any type of communications problem. 

An IP network routes individual packets of data. The network delivers each packed based on an address number 
that identifies the destination machine. The network has no view of a "session". When PC Lube and Tune sends 
this document through the network to your computer, different pieces can end up routed through different cities. 
TCP is responsible for reassembling the pieces after they have been received. 

In the SNA network, a client and server cannot exchange messages unless they first establish a session. 
In a Subarea network, the VTAM program on the mainframe gets involved in creating every session. 
Furthermore, there are control blocks describing the session in the NCP to which the client talks 
and the NCP to which the server talks. Intermediate NCPs have no control blocks for the session. 
In APPN SNA, there are control blocks for the session in all of the intermediate nodes through which 
the message passes. 

Every design has advantages and limitations. The IP design (without fixed sessions) works well in experimental 
networks built out of spare parts and lab computers. It also works well for its sponsor (the Department of Defense) 
when network components are being blown up by enemy fire. In exchange, errors in the IP network often go unreported 
and uncorrected, because the intermediate equipment reroutes subsequent messages through a different path. 
The SNA design works well to build reliable commercial networks out of dedicated, centrally managed devices. 
SNA, however, requires a technically trained central staff ready and able to respond to problems as they are 
reported by the network equipment. 

The mainframe-managed subarea network was originally designed so that every terminal, printer, or application 
program was configured by name on the mainframe before it could use the network. This worked when 3270 terminals 
were installed by professional staff and were cabled back to centrally managed control units. Today, when 
ordinary users buy a PC and connect through a LAN, this central configuration has become unwieldy. 
One solution is to create a "pool" of dummy device names managed by a gateway computer. PC's then power up 
and borrow an unused name from the pool. Recent releases allow VTAM to define a "prototype" PC and 
dynamically add new names to the configuration when devices matching the prototype appear on the network. 

A more formal solution, however, is provided by the APPN architecture designed originally for minicomputers. 
APPN has two kinds of nodes. An End Node (EN) contains client and server programs. Data flows in or out of 
an End Node, but does not go through it. A Network Node (NN) also contains clients and servers, 
but it also provides routing and network management. When an End Node starts up, it connects to one 
Network Node that will provide its access to the rest of the network. It transmits to that NN a list 
of the LUNAMEs that the End Node contains. The NN ends up with a table of its own LUNAMEs and those of all 
the EN's that it manages. 

When an EN client wants to connect to a server somewhere in the network, its sends a BIND message with 
the LUNAME of the server to the NN. The NN checks its own table, and if the name is not matched broadcasts 
a query that ultimately passes through every NN in the network. When some NN recognizes the LUNAME, 
it sends back a response that establishes both a session and a route through the NN's between the client 
and the server program. 

Most of APPN is the set of queries and replies that manage names, routes, and sessions. Like the rest of SNA, 
it is a fairly complicated and exhaustively documented body of code. 

Obviously workstations cannot maintain a dynamic table that spans massive networks or long distances. 
The solution to this problem is to break the APPN network into smaller local units each with a Network ID (NETID). 
In common use, a NETID identifies a cluster of workstations that are close to each other 
(in a building, on a campus, or in the same city). The dynamic exchange of LUNAMEs does not occur between 
clusters with different NETIDs. Instead, traffic to a remote network is routed based on the NETID, 
and traffic within the local cluster is routed based on the LUNAME. The combination of NETID and LUNAME 
uniquely identifies any server in the system, but the same LUNAME may appear in different NETID groups 
associated with different local machines. After all, one has little difficulty confusing "CHICAGO.PRINTER" 
from "NEWYORK.PRINTER" even though the LUNAME "PRINTER" is found in each city. 

TCP/IP is a rather simple protocol. The source code for programs is widely available. SNA is astonishing complex, 
and only IBM has the complete set of programs. It is built into the AS/400. Other important workstation products include: 

NS/DOS for DOS and Windows 
Communications Manager for OS/2 
SNA Services for AIX 
SNA Server for Windows NT [from Microsoft] 

The native programming interface for modern SNA networks is the Common Programming Interface for Communications 
(CPIC). This provides a common set of subroutines, services, and return codes for programs written in COBOL, 
C, or REXX. It is documented in the IBM paper publication SC26-4399, but it is also widely available in 
softcopy on CD-ROM. 

Under the IBM Communications Blueprint, SNA becomes one of several interchangeable "transport" options. 
It is a peer of TCP/IP. The Blueprint is being rolled out in products carrying the "Anynet" title. 
This allows CPIC programs to run over TCP/IP, or programs written to use the Unix "socket" interface can run 
over SNA networks. Choice of network then depends more on management characteristics. 

The traditional SNA network has been installed and managed by a central technical staff in a large corporation. 
If the network goes down, a company like Aetna Insurance is temporarily out of business. TCP/IP is designed to be 
casual about errors and to simply discard undeliverable messages.


Note 3:
-------

Using IBM Communications Server for AIX with CICS

--------------------------------------------------------------------------------

Starting AIX SNA
To start Communications Server for AIX, enter smitty sna and select these options: 

  -> Manage SNA Resources
      -> Start SNA Resources
          -> Start Node              

This command starts SNA, the node, and the main SNA process. It also starts the links that listen 
for other machines calling to activate links if the activation parameter on the configuration of the DLC, 
port, and link station is set to start the links at startup time. 

If you have defined a link that calls another machine, you can start this link by using the following command: 

  -> Manage SNA Resources
      -> Start SNA Resources
          -> Start Link Station              

You can start a session by using the following command: 

  -> Manage SNA Resources
      -> Start SNA Resources
          -> Start an SNA Session              

To start a session, you must supply either a local LU name or a local LU alias and either a partner LU alias 
or a fully-qualified partner LU name. You must also supply a modename. In the example below, 
OPENCICS is the LU alias and CICSESA is the partner LU alias. CICSISC0 is a modegroup 
that is valid for the connection. 


Figure 53. Starting an SNA Session


+--------------------------------------------------------------------------------+
|                                                Start an SNA Session            |
|                                                                                |
|Type or select values in entry fields.                                          |
|Press Enter AFTER making all desired changes.                                   |
|                                                                                |
|                                                        [Entry Fields]          |
|  Enter one of:                                                                 |
|        Local LU alias                               [OPENCICS]               + |
|        Local LU name                                []                       + |
|                                                                                |
|  Enter one of:                                                                 |
|        Partner LU alias                             [CICSESA]                + |
|        Fully-qualified Partner LU name              []                       + |
|                                                                                |
|* Mode name                                          [CICSISC0]               + |
|  Session polarity                                    POL_EITHER              + |
|  CNOS permitted?                                     YES                     + |
|                                                                                |
|                                                                                |
|F1=Help             F2=Refresh          F3=Cancel           F4=List             |
|F5=Reset            F6=Command          F7=Edit             F8=Image            |
|F9=Shell            F10=Exit            Enter=Do                                |
+--------------------------------------------------------------------------------+
If the command returns an error indicating that no sessions can be activated between LUs, one of the 
following problems exists: 

-The link station that is used by the connection is not active. 
-The maximum number of sessions has been started already. 
-The specified modename, although defined locally, is not known on the remote system. 
-The specified local or remote system name is not known on the remote machine. 
-The remote system is not accepting connection requests (for example, if it is a mainframe-based CICS system, 
 the connection possibly is not installed and in service). 
-Check that the configuration matches the values in the remote system. 


Note 4:
-------

Versions of SNA Services for AIX and Communications Server 
 Technote (troubleshooting) 
  
Problem(Abstract) 
Versions of IBM's SNA Services for AIX and Communications Server  
  
 
Resolving the problem 
The following table provides information about the different versions of Communications Server for AIX and 
the levels of AIX on which they will run.
The VRMF (Version.Release.Modification.Fixlevel) values and the external name of different AIX SNA levels 
have changed over time. 
You can check the VRMF level by issuing    : lslpp -h 'sna*' 
You can check the product number by issuing: lslpp -i 'sna*' 

The listed AIX levels are the minimum levels required for CS/AIX to function.

The only currently supported version is 6.3 on AIX 5.2 and higher.


External Name V.R.M.F. AIX Levels Product # 
Communications Server for AIX, V6.3 6.3.1.0 5.2 ML5,
5.3 ML2, 6.1 5765-E51 
6.3.0.x 5.2 ML5,
5.3 ML2 
Communications Server for AIX, V6.1 (EOS 09/30/2006; 12/31/2003 on AIX 4) 6.1.0.5 4.3.3, 5.1, 5.2, 5.3 
6.1.0.1  4.3.3, 5.1, 5.2 
6.1.0.0  4.3.3, 5.1  
Communications Server for AIX, V6 (EOS 06/30/2002) 6.0.x.x  4.1.5, 4.2.1, 4.3.2  
Communications Server for AIX, V5 (EOS 06/30/2001) 5.0.x.x  4.1.5, 4.2.1, 4.3  5765-D20  
Communications Server for AIX, V4.2 (EOS 11/30/1998) 3.1.2.x  4.1, 4.2, 4.3  5765-652  
Communications Server for AIX, V4.1 (EOS 11/30/1998) 3.1.1.x  4.1, 4.2  
Communications Server for AIX, V3 (EOS 03/31/1997) 3.1.0.x  4.1  5765-582 
AIX SNA Server/6000 V2.2 (EOS 04/26/1996) 2.2.x.x  4.1  5765-247  
AIX SNA Server/6000 V2.1 (EOS 12/31/1997) 1.3.x.x  3.2.5  
AIX SNA Services/6000 V1 (EOS 12/31/1995) 1.2.x.x  3.1, 3.2  5601-287 


EOS = End Of Service: No defect work will be performed after this date.  
 
 
84. The dd and od commands:
===========================

Note 1:
-------

http://www.codecoffee.com/tipsforlinux/articles/036.html

>> How and when to use the dd command?  
 

In this article, Sam Chessman explains the use of the dd command with a lot of useful examples. This article is not aimed at absolute beginners. 
Once you are familiar with the basics of Linux, you would be in a better position to use the dd command. 

The ' dd ' command is one of the original Unix utilities and should be in everyone's tool box. It can strip headers, extract parts of 
binary files and write into the middle of floppy disks; it is used by the Linux kernel Makefiles to make boot images. 
It can be used to copy and convert magnetic tape formats, convert between ASCII and EBCDIC, swap bytes, and force to upper and lowercase. 


For blocked I/O, the dd command has no competition in the standard tool set. One could write a custom utility to do specific I/O or 
formatting but, as dd is already available almost everywhere, it makes sense to use it. 

Like most well-behaved commands, dd reads from its standard input and writes to its standard output, unless a command line specification 
has been given. This allows dd to be used in pipes, and remotely with the rsh remote shell command. 

Unlike most commands, dd uses a keyword=value format for its parameters. This was reputedly modeled after IBM System/360 JCL, 
which had an elaborate DD 'Dataset Definition' specification for I/O devices. A complete listing of all keywords is available from GNU dd with 

$ dd --help

Some people believe dd means ``Destroy Disk'' or ``Delete Data'' because if it is misused, a partition or output file can be trashed very quickly. 
Since dd is the tool used to write disk headers, boot records, and similar system data areas, misuse of dd has probably trashed 
many hard disks and file systems. 

In essence, dd copies and optionally converts data. It uses an input buffer, conversion buffer if conversion is specified, and an output buffer. 
Reads are issued to the input file or device for the size of the input buffer, optional conversions are applied, and writes are issued 
for the size of the output buffer. This allows I/O requests to be tailored to the requirements of a task. Output to standard error reports 
the number of full and short blocks read and written. 


Example 1


A typical task for dd is copying a floppy disk. As the common geometry of a 3.5" floppy is 18 sectors per track, two heads and 80 cylinders, 
an optimized dd command to read a floppy is: 

Example 1-a : Copying from a 3.5" floppy

dd bs=2x80x18b if=/dev/fd0 of=/tmp/floppy.image 
1+0 records in
1+0 records out 

The 18b specifies 18 sectors of 512 bytes, the 2x multiplies the sector size by the number of heads, and the 80x is for the cylinders--
a total of 1474560 bytes. This issues a single 1474560-byte read request to /dev/fd0 and a single 1474560 write request to 
/tmp/floppy.image, whereas a corresponding cp command 

cp /dev/fd0 /tmp/floppy.image


issues 360 reads and writes of 4096 bytes. While this may seem insignificant on a 1.44MB file, when larger amounts of data are involved, 
reducing the number of system calls and improving performance can be significant. 


This example also shows the factor capability in the GNU dd number specification. This has been around since before the Programmers Work Bench and, 
while not documented in the GNU dd man page, is present in the source and works just fine, thank you. 


To finish copying a floppy, the original needs to be ejected, a new diskette inserted, and another dd command issued to write to the diskette: 

Example 1-b : Copying to a 3.5" floppy
dd bs=2x80x18b < /tmp/floppy.image > /dev/fd0 
1+0 records in 
1+0 records out 

Here is shown the stdin/stdout usage, in which respect dd is like most other utilities. 


Example 2


The original need for dd came with the 1/2" tapes used to exchange data with other systems and boot and install Unix on the PDP/11. 
Those days are gone, but the 9-track format lives. To access the venerable 9-track, 1/2" tape, dd is superior. With modern SCSI tape devices, 
blocking and unblocking are no longer a necessity, as the hardware reads and writes 512-byte data blocks. 

However, the 9-track 1/2" tape format allows for variable length blocking and can be impossible to read with the cp command. The dd command allows 
for the exact specification of input and output block sizes, and can even read variable length block sizes, by specifying an input buffer size larger 
than any of the blocks on the tape. Short blocks are read, and dd happily copies those to the output file without complaint, simply reporting on the 
number of complete and short blocks encountered. 


Then there are the EBCDIC datasets transferred from such systems as MVS, which are almost always 80-character blank-padded Hollerith Card Images! 
No problem for dd, which will convert these to newline-terminated variable record length ASCII. Making the format is just as easy and dd again 
is the right tool for the job. 

Example 2 : Converting EBCDIC 80-character fixed-length record to ASCII variable-length newline-terminated record 
dd bs=10240 cbs=80 conv=ascii,unblock if=/dev/st0 of=ascii.out
40+0 records in
38+1 records out 

The fixed record length is specified by the cbs=80 parameter, and the input and output block sizes are set with bs=10240. 
The EBCDIC-to-ASCII conversion and fixed-to-variable record length conversion are enabled with the conv=ascii,noblock parameter. 


Notice the output record count is smaller than the input record count. This is due to the padding spaces eliminated from the output file and 
replaced with newline characters. 


Example 3


Sometimes data arrives from sources in unusual formats. For example, every time I read a tape made on an SGI machine, the bytes are swapped. 
The dd command takes this in stride, swapping the bytes as required. The ability to use dd in a pipe with rsh means that the tape device 
on any *nix system is accessible, given the proper rlogin setup. 

Example 3 : Byte Swapping with Remote Access of Magnet Tape
rsh sgi.with.tape dd bs=256b if=/dev/rmt0 conv=swab | tar xvf -


The dd runs on the SGI and swaps the bytes before writing to the tar command running on the local host. 


Example 4

Murphy's Law was postulated long before digital computers, but it seems it was specifically targeted for them. 
When you need to read a floppy or tape, it is the only copy in the universe and you have a deadline past due, that is when you will have a bad spot 
on the magnetic media, and your data will be unreadable. To the rescue comes dd, which can read all the good data around the bad spot and continue 
after the error is encountered. Sometimes this is all that is needed to recover the important data. 

Example 4 : Error Handling
dd bs=265b conv=noerror if=/dev/st0 of=/tmp/bad.tape.image 


Example 5


The Linux kernel Makefiles use dd to build the boot image. In the Alpha Makefile /usr/src/linux/arch/alpha/boot/Makefile, 
the srmboot target issues the command: 

Example 5 : Kernel Image Makefile
dd if=bootimage of=$(BOOTDEV) bs=512 seek=1 skip=1 

This skips the first 512 bytes of the input bootimage file (skip=1) and writes starting at the second sector of the $(BOOTDEV) device (seek=1). 
A typical use of dd is to skip executable headers and begin writing in the middle of a device, skipping volume and partition data. 
As this can cause your disk to lose file system data, please test and use these applications with care.

 
Note 2:
-------

 
85. Openssl, certificates, AIX:
===============================

Note 1:
-------

Short for Secure Sockets Layer, a protocol developed by Netscape for transmitting private documents via the Internet. 
SSL uses a cryptographic system that uses two keys to encrypt data - a public key known to everyone and a private or secret key known only 
to the recipient of the message. Both Netscape Navigator and Internet Explorer support SSL, and many Web sites use the protocol 
to obtain confidential user information, such as credit card numbers. By convention, URLs that require an SSL connection 
start with https: instead of http:. 
Another protocol for transmitting data securely over the World Wide Web is Secure HTTP (S-HTTP). Whereas SSL creates a secure connection 
between a client and a server, over which any amount of data can be sent securely, S-HTTP is designed to transmit individual messages securely. 
SSL and S-HTTP, therefore, can be seen as complementary rather than competing technologies. Both protocols have been approved 
by the Internet Engineering Task Force (IETF) as a standard. 

Note 2:
-------

SSL (Secure Sockets Layer), also known as TLS (Transport Layer Security), is a protocol that allows two programs to communicate 
with each other in a secure way. Like TCP/IP, SSL allows programs to create "sockets," endpoints for communication, and make 
connections between those sockets. But SSL, which is built on top of TCP, adds the additional capability of encryption. 
The HTTPS protocol spoken by web browsers when communicating with secure sites is simply the usual World Wide Web HTTP protocol, 
"spoken" over SSL instead of directly over TCP. 
In addition to providing privacy, SSL encryption also allows us to verify the identity of the party we are talking to. 
This can be very important if we don't trust the Internet. While it is unlikely in practice that the root DNS servers 
of the Internet will be subverted, a "man in the middle" attack elsewhere on the network could substitute the address of one 
Internet site for another. SSL prevents this scenario by providing a mathematically sound way to verify the other program's identity. 
When you log on to your bank's website, you want to be very, very sure you are talking to your bank! 

-- How SSL Works
SSL provides both privacy and security using a technique called "public/private key encryption" (often called "asymmetric encryption" 
or simply "public key encryption"). 
A "public key" is a string of letters and numbers that can be used to encrypt a message so that only the owner of the public key can read it. 
This is possible because every public key has a corresponding private key that is kept secret by the owner of the public key. 

-- The SSL Handshake: Identity and Privacy
Let's suppose Jane wants to log into www.examplebank.com. When Jane's web browser makes an HTTPS connection to www.examplebank.com, 
her browser sends the bank's server a string of randomly generated data, which we'll call the "greeting." 
The web server responds with two things: its own public key encoded in an SSL certificate, which we'll examine more closely later, 
and the "greeting" encrypted with its private key. 

Jane's web browser then decrypts the greeting with the bank's public key. If the decrypted greeting matches the original greeting 
sent by the browser, then Jane's browser can be sure it is really talking to the owner of the private key - 
because only the holder of the private key can encrypt a message in such a way that the corresponding public key will decrypt it. 

Now, let's suppose Bob is monitoring this traffic on the Internet. He has the bank's public key, and Jane's greeting. 
But he doesn't have the bank's private key. So he can't encrypt the greeting and send it back. That means Jane can't be fooled by Bob. 

-- The Identity Problem
But what if Bob inserts himself into the picture even before Jane's browser connects to the bank? What if Jane's browser is actually 
talking to Bob's server from the very beginning? Then Bob can substitute his own public and private keys, encrypt the greeting successfully, 
and convince Jane's browser that his computer is the bank's. Not good! 
That's why the complete SSL handshake includes more than just the bank's public key. The public key is part of an "SSL certificate" issued 
by a certificate authority that Jane's browser already trusts. 

How does this work? When web browser software is installed on a computer, it already contains the public keys of several certificate authorities, 
such as GoDaddy, VeriSign and Thawte. Companies that want their secure sites to be "trusted" by web browsers must purchase 
an SSL certificate from one of these authorities. 

But what is the certificate, exactly? The SSL certificate consists essentially of the bank's public key and a statement 
identifying the bank, encrypted with the certificate authority's private key. 

When the bank's web server sends its certificate to Jane's browser, Jane's browser decrypts it with the public key of the 
certificate authority. If the certificate is fake, the decryption results in garbage. If the certificate is valid, out pops 
the bank's public key, along with the identifying statement. And if that statement doesn't include, among other information, 
the same hostname that Jane connected to, Jane receives an appropriate warning message and decides not to continue the connection. 

Now, let's return to Bob. Can he substitute himself convincingly for the bank? No, he can't, because he doesn't have the certificate authority's 
private key. That means he can't sign a certificate claiming that he is the bank. 

Now that Jane's browser is thoroughly convinced that the bank is what it appears to be, the conversation can continue. 


-- certlist 

Purpose
certlist lists the contents of one or more certificates.

Syntax
certlist [-c] [-a attr [attr....] ]tag [username]

Description
The certlist command lists the contents of one or more certificates. Using the -c option causes the output to be formatted 
as colon-separated data with the attribute names associated with each field on the previous line as follows: 

# name: attribute1: attribute2: ... 
User: value1: value2: ... The -f option causes the output to be formatted in stanza file format with the username attribute 
given as the stanza name. Each attribute=value pair is listed on a separate line: 

user: 
     attribute1=value 
     attribute2=value 
     attribute3=value 

When neither of these command line options are selected, the attributes are output as attribute=value pairs.

Flags
-c Displays the output in colon-separated records. 
-f Displays the output in stanzas. 
-a attr Selects one or more attributes to be displayed. 


=========================
86. OTHER STUFF SECTION:
=========================


Here we comment on a variety of programs, or demons, or commands, found in Solaris, HP, Linux or AIX.


lrud:
=====

lrud (least recently used) is a page managing memory process in AIX.

Solaris uses a completely different page stealing algorythm to AIX, so 
you cannot compare the 2. 

AIX uses LRUD and Solaris uses LIFO (last in 1st 
out). 

Again, AIX will build as large a filesystem cache as possible by 
default. when it hits minperm it is going to scan for pages to free up 
and free them according to LRUD algorythm... the pages it frees up are 
dependant on the number of filepages cached and the maxperm / minperm 
settings. 

If numperm is above maxperm it is non discriminate over what pages to 
mark as candidates to free up, but if numperm is below maxperm, then 
it will only mark file (persistient) pages as candidates to get the 
size of the fs cache down. 

NOTE: by default it only does this once you hit minfree.. 

To strictly set the maximum number of file pages cached you would set 
strict_maxperm, but you usually do not have to do this unless you are 
working with a very large amount of memory (64Gb and up) ... so, i 
would leave well alone if you only have a couple of GB... 


gil:
====

GIL is a kernel process, which does TCP/IP timing. It handles
transmission errors, ACKs, etc. Normally it shouldn't consume too much
CPU, but it can take quite a lot of CPU when the system is using the
network a lot (like with NFS filesystems which are heavily used).
.
The kproc gil runs the TCP/IP timer driven operations. Every 200ms, and
every 500ms the GIL thread is kicked to go run protocol timers. With TCP
up (which is ALWAYS the case), TCP timers are called which end up
looking at every connection on the system (to do retransmission, delayed
acks,etc). In version 4 this work is all done on a multi-threaded kproc
to promote concurrency and SMP scalability.gil.

GIL is one of the kprocs (kernel processes) in AIX 4.3.3, 5.1 and 5.2.
Since the advent of topas in AIX 4.3.3 and changes made to the ps
command in AIX 5.1, system administrators have become aware of this
class of processes, which are not new to AIX. These kprocs have no
user interfaces and have been largely undocumented in base
documentation. Once a kproc is started, typically it stays in the
process table until the next reboot. The system resources used by any
one kproc are accounted as kernel resources, so no separate account is
kept of resources used by an individual kproc.
.
Most of these kprocs are NOT described in base AIX documentation and
the descriptions below may be the most complete that can be found.
.
GIL term is an acronym for "Generalized Interrupt Level" and was
created by the Open Software Foundation (OSF), This is the networking
daemon responsible for processing all the network interrupts, including
incoming packets, tcp timers, etc.
.
Exactly how these kprocs function and much of their expected behavior
is considered IBM proprietary information.


picld:
------

The Platform Information and Control Library (PICL) provides a mechanism to publish platform-specific 
information for clients to access in a platform-independent way. picld maintains and controls access 
to the PICL information from clients and plug-in modules. 
The daemon is started in both single-user and multi-user boot mode.

Upon startup, the PICL daemon loads and initializes the plug-in modules. These modules use the 
libpicltree(3PICLTREE) interface to create nodes and properties in the PICL tree to publish 
platform configuration information. After the plug-in modules are initialized, the daemon opens 
the PICL daemon door to service client requests to access information in the PICL tree.

arraymon:
---------

arraymon is the disk array daemon process sometimes found in Solaris. It performs these major functions:

- Monitoring of the error information maintained by the disk array controllers.

- Reporting of events that require operator attention in a manner selected by the user via 
  the rmparams file and the rmscript file.

- Launching of the parityck utility at the designated time, if the parity check option is enabled.

arraymon maintains logs of the messages currently outstanding on the system console 
and in the file /etc/raid/rmlog.log. In addition, all error information is written to the 
system error log /var/adm/messages ).

sar:
----

System activity data can be accessed at the special  request of  a  user  (see  sar(1))  and  automatically, 
on a routine basis, as described  here.  The  operating  system  contains several  counters  that  are  
incremented  as various system actions occur. These include counters for  CPU  utilization,
buffer  usage,  disk  and  tape  I/O  activity,  TTY  device activity, switching and system-call  activity,  
file-access, queue  activity,  inter-process  communications, and paging.
For  more   general  system  statistics,  use  iostat  (1M), sar(1), or vmstat(1M).

Note 1:
-------

I'm paring down processes and port listners on a Solaris 8 server to have the very minimal services/ports open. 
I have followed their guidelines/blueprints for Solaris 6 hardening.

I need to find out what is listening on the ports below and how to disable services for them.

Specifically, listners on ports 5987, 898, and 32768. (See netstat output below)

Also what are: root 181 1 0 15:08:10 ? 
0:00 /usr/sadm/lib/smc/bin/smcboot root 182 181 0 15:08:10 ? 
0:00 /usr/sadm/lib/smc/bin/smcboot root 56 1 0 15:08:04 ? 
0:00 /usr/lib/sysevent/syseventd root 58 1 0 15:08:05 ? 
0:00 /usr/lib/sysevent/syseventconfd root 67 1 0 15:08:05 ? 
0:01 /usr/lib/picl/picld root 202 1 0 15:08:12 ? 
0:00 /usr/lib/efcode/sparcv9/efdaemon

And can they be disabled? How?

This host will only run standalone firewall and sendmail only. 
On Solaris 2.6 these listners and procs do not exist.


Regarding the "smcboot" process the answer is simple. This is the boot process for the 
Solaris Management Console (SMC) which is a GUI (well - more a framework with a several existing modules) 
to manage your system.

If you're not interested to manage your host using SMC, then you can safely disable this 
(remove or diable /etc/rc2.d/S90wbem). This smc process is also responsible for listening on port 898 and 5987.

The port 32768 is not used for a fixed service. You should check your system to idenfity 
which process is using this port. This can be done by using the pfiles command, e.g. 
"cd /proc; /usr/proc/bin/pfiles * > /tmp/pfiles.out" and then look in /tmp/pfiles.out for the portnumber.

The picld process is a new abstraction layer for programs who want to access platform specific information. 
Instead of using some platform specific program applications can use the picl library to access 
information in a generic way.

Disabling the picld daemon will affect applications which are using the libpicltree. 
You can use the "ldd" command to identify such applications and decide whether you're using them or not. 
Example applications are "prtpicl" or "locator" (see the manpages).

The "syseventd" is responsible for delivering system events and disabling this service will affect your 
ability to create new devices on the fly (e.g. for dynamic reconfiguration). The "efdaemon" is another example 
of such a process which is needed for dynamic reconfiguration.

Disabling syseventd and/or efdaemon havily depends on your required services. 
After creating your devices (boot -r) you can safely turn of these daemons but you'll run into trouble 
when trying dynamic reconfiguration... Without knowing your requirements we can't tell whether it's ok 
to disable those services or not.


bpbkar:
=======

bpbkar is part of the Veritas Netbackup client, usually installed at
/usr/openv/netbackup .


<defunct> process:
==================

Note 1:

In general, defunct processes are caused by a parent process not reaping its children. Find out which process 
is the parent process of all those zombies (ps -e). It's that process that has a bug. 

In Solaris 2.3 (and presumably earlier) there is a bug in the pseudo tty modules that makes them hang in close. 
This causes processes to hang forever while exiting. 

Fix: Apply patch 101415-02 (for 2.3). 

In all Solaris 2 releases prior to 2.5 (also fixed in the latest 2.4 kernel jumbo patch), 
init (process 1) calls sync() every five minutes which can hang init for some considerable time. 
This can cause a lot of zombies accumulating with process 1 as parent, but occurs only in rare circumstances. 

Note 2:

My app has a parent that forks a child. Sometimes, one of them dies and leaves a defunct process, 
along with shared memory segments. I try to get rid of the shared memory and kill the defunct task, 
but to no avail. I then have to reboot the system to clean up the shared memory and to get rid 
of the defunct process. How can I kill a defunct process and get rid of the associated shared memory ?

A defunct task is already dead. You can not kill a "zombie".
The problem is obviously that the app does not expect a child to die and does not make the 
necessary wait calls to relieve the child from its return code.
Did you stopp the app and see what happens?

use ipcrm to release shared memory. 

But a zombie indicates also a programming problem with the application. 
So it is time to redesign the application. 

Note 3:

A zombie process is a process which has died and whose parent process is still running 
and has not wait()ed for it. In other words, if a process becomes a zombie, it means 
that the parent process has not called wait() or waitpid() to obtain the child process's 
termination status. Once the parent retrieves a child's termination status, that child process 
no longer appears in the process table.

You cannot kill a zombie process, because it's already dead. It is taking up space in the process table, 
and that's about it.

If any process terminates before its children do, init inherits those children. 
When they die, init calls one of the wait() functions to retrieve the child's termination status, 
and the child disappears from the process table.

A zombie process is not, in and of itself, harmful, unless there are many of them taking up space 
in the process table. But it's generally bad programming practice to leave zombies lying around, in the same way
that it's generally a Bad Thing to never bother to free memory you've malloc()ed. 

Note 4:

Other than Windows, unix manages an explicit parent-child relationships between processes. 
When a child process dies, the parent will receive a notification. It is then the duty of the parent process 
to explicitly take notice of the childs demise by using the wait() system call. The return value of the wait() 
is the process ID of the child, which gives the parent exact control about which of its children are still alive. 
As long as the parent hasn't called wait(), the system needs to keep the dead child in the global process list, 
because that's the only place where the process ID is stored. The purpose of the "zombies" is really just for 
the system to remember the process ID, so that it can inform the parent process about it on request. 
If the parent "forgets" to collect on its children, then the zombie will stay undead forever. 
Well, almost forever. If the parent itself dies, then "init" (the system process with the ID 0) will take over 
fostership over its children and catch up on the neglected parental duties


S_IFCHR and S_IFDOOR:
=====================

Suppose you use the pfiles command on a PID

# /usr/proc/bin/pfiles 194
194:    /usr/sbin/nscd
  Current rlimit: 256 file descriptors
   0: S_IFCHR mode:0666 dev:85,1 ino:3291 uid:0 gid:3 rdev:13,2
      O_RDWR
   1: S_IFCHR mode:0666 dev:85,1 ino:3291 uid:0 gid:3 rdev:13,2
      O_RDWR
   2: S_IFCHR mode:0666 dev:85,1 ino:3291 uid:0 gid:3 rdev:13,2
      O_RDWR
   3: S_IFDOOR mode:0777 dev:275,0 ino:0 uid:0 gid:0 size:0
      O_RDWR FD_CLOEXEC  door to nscd[194]


# /usr/proc/bin/pfiles 254
254: /usr/dt/bin/dtlogin -daemon Current rlimit: 2014 file descriptors 
0: S_IFDIR mode:0755 dev:32,24 ino:2 uid:0 gid:0 size:512 
O_RDONLY|O_LARGEFILE 1: S_IFDIR mode:0755 dev:32,24 ino:2 uid:0 gid:0 size:512 
O_RDONLY|O_LARGEFILE 2: S_IFREG mode:0644 dev:32,24 ino:143623 uid:0 gid:0 size:41 
O_WRONLY|O_APPEND|O_LARGEFILE 3: S_IFCHR mode:0666 dev:32,24 ino:207727 uid:0 gid:3 rdev:13,12 
O_RDWR 4: S_IFSOCK mode:0666 dev:174,0 ino:4686 uid:0 gid:0 size:0 
O_RDWR|O_NONBLOCK 5: S_IFREG mode:0644 dev:32,24 ino:143624 uid:0 gid:0 size:4 
O_WRONLY|O_LARGEFILE advisory write lock set by process 245 7: 
S_IFSOCK mode:0666 dev:174,0 ino:3717 uid:0 gid:0 size:0 O_RDWR 8: 
S_IFDOOR mode:0444 dev:179,0 ino:65516 uid:0 gid:0 size:0 O_RDONLY|O_LARGEFILE FD_CLOEXEC door to nscd[171] 

This listing shows the files open by the dtlogin process. Notice how easy it is to decipher the file types 
in this output. We have: 

S_IFDIR directory files
S_IFREG regular files 
S_IFCHR character mode device 
S_IFSOCK sockets S_IFDOOR a "door" file

Flags That Specify Access Type
The following OFlag parameter flag values specify type of access:

O_RDONLY The file is opened for reading only. 
O_WRONLY The file is opened for writing only. 
O_RDWR The file is opened for both reading and writing. 


Limits on the number of files that a process can open can be changed system-wide in the /etc/system file. 

If you support a process that opens a lot of sockets, then you can monitor the number of open files 
and socket connections by using a command such as this: 

# /usr/proc/bin/pfiles <procID> | grep mode | wc -l 

The third limit determines how many file references can be held in memory at any time (in the inode cache). 
If you're running the sar utility, then a sar -v command will show you (in one column of its output (inod-sz)) 
the number of references in memory and the maximum possible. On most systems, these two numbers will be oddly 
stable throughout the day. The system maintains the references even after a process has stopped running 
-- just in case it might need them again. These references will be dropped and the space reused as needed. 
The sar output might look like this: 

00:00:00 proc-sz ov inod-sz ov file-sz ov 11:20:00 400/20440 0 41414/46666 0 1400/1400 0 0/0 

The 4th field reports the number of files currently referenced in the inode cache and 
the maximum that can be stored. 


EXP shell script:
=================

#!/usr/bin/ksh
NLS_LANG=AMERICAN_AMERICA.WE8ISO8859P1
export NLS_LANG
ORACLE_SID=ECM
export ORACLE_SID
cd /u03/dumps/ECM
mv ECM.dmp.Z ECMformer.dmp.Z
exp system/arcturus81 file=ECM.dmp full=y statistics=none
cp ECM.dmp /u01/dumps/ECM
compress -v ECM.dmp

xntpd:
======

The xntpd daemon sets and maintains a Unix system time-of-day in compliance with Internet standard time servers. 
The xntpd daemon is a complete implementation of the Network Time Protocol (NTP) version 3 standard, 
as defined by RFC 1305, and also retains compatibility with version 1 and 2 servers as defined by 
RFC 1059 and RFC 1119, respectively. The xntpd daemon does all computations in fixed point arithmetic 
and does not require floating point code. 

The xntpd daemon reads from a configuration file (/etc/ntp.conf is the default) at startup time. 
You can override the configuration file name from the command line. You can also specify a working, 
although limited, configuration entirely on the command line, eliminating the need for a configuration file. 
Use this method when configuring the xntpd daemon as a broadcast or multicast client, that determines 
all peers by listening to broadcasts at runtime. You can display the xntpd daemon internal variables with the 
ntpq command (Network Time Protocol (NTP) query program). You can alter configuration options 
with the xntpdc command. 

Note for AIX: checking the status of the xntpd subsystem:

# lssrc -s xntpd


utmpd:
======

Solaris:

NAME
     utmpd - utmpx monitoring daemon

SYNOPSIS
     utmpd [-debug]

DESCRIPTION
     The  utmpd daemon  monitors  the  /var/adm/utmpx  file.  See
     utmpx(4) (and utmp(4) for historical information).

     utmpd receives requests from  pututxline(3C)  by  way  of  a
     named  pipe.  It  maintains  a  table  of processes and uses
     poll(2) on /proc files to detect process  termination.  When
     utmpd  detects that a process has terminated, it checks that
     the process has removed its utmpx entry from /var/adm/utmpx.
     If  the  process'   utmpx entry has not been removed,  utmpd
     removes   the   entry.   By   periodically   scanning    the
     /var/adm/utmpx  file, utmpd also monitors processes that are
     not in its table.

OPTIONS
     -debug
           Run  in debug mode, leaving the process  connected  to
           the controlling terminal.  Write debugging information
           to standard output.

HP-UX 11i:

NAME    [Toc]    [Back]
      utmpd - user accounting database daemon

 SYNOPSIS    [Toc]    [Back]
      /usr/sbin/utmpd

 DESCRIPTION    [Toc]    [Back]
      utmpd, user accounting database daemon, manages the user accounting
      database which is the database of currently logged-in users.  This was
      previously maintained by /etc/utmp and /etc/utmpx files on HP-UX.

      Upon startup, utmpd writes its pid to the file
      /etc/useracct/utmpd_pid.  Applications can add, update, or query
      entries into the database using the getuts() APIs.  See the getuts(3C)
      manual page for more information.

      utmpd(1M) takes care of synchronizing the legacy /etc/utmpx file and
      its own in-memory database.  The synchronization is bi-directional
      from the utmpd's database to the /etc/utmpx and from the /etc/utmpx
      file to utmpd's database.  However, this synchronization does not
      happen in real time.  There is a time lag which could span from a few
      seconds on a lightly loaded system to a few minutes on a heavily
      loaded system.


pwconv:
=======

NAME
     pwconv - installs and updates /etc/shadow  with  information
     from /etc/passwd

DESCRIPTION
     The pwconv command  creates  and  updates  /etc/shadow  with
     information from /etc/passwd.

     pwconv relies on a special value  of  'x'  in  the  password
     field  of /etc/passwd.  This value of 'x' indicates that the
     password for the user is already in /etc/shadow  and  should
     not be modified.

     If the /etc/shadow file does not exist,  this  command  will
     create  /etc/shadow  with information from /etc/passwd.  The
     command populates /etc/shadow with the  user's  login  name,
     password, and password aging information.  If password aging
     information does not exist in /etc/passwd for a given  user,
     none  will  be  added  to  /etc/shadow.   However,  the last
     changed information will always be updated.

     If the /etc/shadow file does exist, the following tasks will
     be performed:

          Entries that are in the /etc/passwd file and not in the
          /etc/shadow file will be added to the /etc/shadow file.

          Entries that are in the /etc/shadow file and not in the
          /etc/passwd file will be removed from /etc/shadow.

          Password attributes (for example,  password  and  aging
          information) that exist in an /etc/passwd entry will be
          moved to the corresponding entry in /etc/shadow.

     The pwconv command can only be used by the super-user.


ESCON:
======

The Enterprise System Connection Architecture�, ESCON, was developed by IBM as a channel connection architecture 
with the intent of improving connectivity by incorporating fibre optics into a network. ESCON uses fibre optics 
to replace existing bus and tag cables in a new or existing data centre. Designed to connect a wide range of 
peripherals to IBM mainframe computers, the architecture supports data communications at a speed of 200Mbps.

Basically, its a fiber optic switch, connecting Control Units or other nodes.


FICON:
======

FICON (for Fiber Connectivity) is a high-speed input/output (I/O) interface for mainframe computer connections 
to storage devices or other nodes. As part of IBM's S/390 or z servers, FICON channels increase I/O capacity 
through the combination of a new architecture and faster physical link rates to make them up to eight times 
as efficient as ESCON (Enterprise System Connection), IBM's previous fiber optic channel standard.  


nscd:
=====

     nscd is a process that provides a cache for the most  common
     name  service requests. It starts up during multi-user boot.
     The default configuration-file /etc/nscd.conf determines the
     behavior of the cache daemon. See nscd.conf(4).

     nscd provides caching for the passwd(4), group(4), hosts(4),
     ipnodes(4),  exec_attr(4),  prof_attr(4),  and  user_attr(4)
     databases  through  standard  libc   interfaces,   such   as
     gethostbyname(3NSL),               getipnodebyname(3SOCKET),
     gethostbyaddr(3NSL), and others. Each cache has  a  separate
     time-to-live  for  its  data;  modifying  the local database
     (/etc/hosts, /etc/resolv.conf, and  so  forth)  causes  that
     cache  to become invalidated upon the next call to nscd. The
     shadow file is specifically not cached.  getspnam(3C)  calls
     remain uncached as a result.

     nscd also  acts  as  its  own  administration  tool.  If  an
     instance  of nscd is already running, commands are passed to
     the running version transparently.

     In order to preserve NIS+ security, the startup  script  for
     nscd (/etc/init.d/nscd) checks the permissions on the passwd
     table if NIS+ is being used. If this table cannot be read by
     unauthenticated  users,  then  nscd  will make sure that any
     encrypted password information returned from the NIS+ server
     is supplied only to the owner of that password.


A sample /etc/nscd.conf file, which minimizes the functionality of nscd, is as follows: 

logfile                 /var/adm/nscd.log
enable-cache            passwd          no
enable-cache            group           no
positive-time-to-live   hosts           3600
negative-time-to-live   hosts           5
suggested-size          hosts           211
keep-hot-count          hosts           20
old-data-ok             hosts           no
check-files             hosts           yes
enable-cache            exec_attr       no
enable-cache            prof_attr       no
enable-cache            user_attr       no


If your system has any instability with respect to host names and/or IP addresses, it is possible 
to substitute the following line for all the above lines containing hosts. 
This may slow down host name lookups, but it should fix the name translation problem. 

enable-cache            hosts           no


EBCDIC and unix:
================

thread 1:
---------

Take a look at the dd command with the option
dd conv=ebcdic

See man dd for more details.


thread 2:
---------

1. nvdmetoa command:

How to convert EBCDIC files to ASCII:
On your AIX system, the tool nvdmetoa might be present.

Examples:
 
nvdmetoa <AS400.dat  >AIXver3.dat 

Converts an EBCDIC file taken off an AS400 and converts to an ASCII file for the pSeries or RS/6000 

nvdmetoa 132 <AS400.txt  >AIXver3.txt 

Converts an EBCDIC file with a record length of 132 characters to an ASCII file with 132 bytes per line 
PLUS 1 byte for the linefeed character. 


thread 3:
---------

od command:

The od command translate a file into other formats, like for example hexadecimal format.
To translate a file into several formats at once, enter: 

# od -t cx a.out > a.xcd

This command writes the contents of the a.out file, in hexadecimal format (x) and character format (c), 
into the a.xcd file. 

thread 4:
---------

I'm using the DD command in UNIX to convert ASCII to EBCDIC so that I can print 
via "lp" to a AS/400 attached printer.  I'm using the AS/400 as a print server.  
The command below works fine except that the carriage return/line feed disappear.  
The file prints without the carriage return line feed.

Here is the unix command:

cat $file | dd ibs=80 cbs=132 conv=ebcdic | lp -d AS400PRNT -s


utmp, wtmp, failedlogin File Format:
====================================

AIX:
----

Purpose
Describes formats for user and accounting information.

Description
The utmp file, the wtmp file, and the failedlogin file contain records with user and accounting information.

When a user attempts to logs in, the login program writes entries in two files:

The /etc/utmp file, which contains a record of users logged into the system. 
The /var/adm/wtmp file (if it exists), which contains connect-time accounting records.
On an invalid login attempt, due to an incorrect login name or password, the login program makes an entry in:

The /etc/security/failedlogin file, which contains a record of unsuccessful login attempts.
The records in these files follow the utmp format, defined in the utmp.h header file.

To convert a binary record in wtmp format to an ASCII record called dummy.file, enter: 

/usr/sbin/acct/fwtmp < /var/adm/wtmp > /etc/dummy.file
The content of a binary wtmp file is redirected to a dummy ASCII file.

failedlogin:

Use the who command to read the contents of the /etc/security/failedlogin file:

# who /etc/security/failedlogin

# who /etc/security/failedlogin > /tmp/failed_login.txt

To clear the file use:

# cp /dev/null /etc/security/failedlogin


The /etc/default/login file:
==========================I=

If you uncomment, or put, the line

CONSOLE=/dev/console

into the "/etc/default/login" file, root can only logon from the console,
and not from other terminals.

Ofcourse, on a normal terminal, you can still logon with your useraccount and then su to root.


Notes on the libc libary in AIX 5L:
===================================

What is it?
-----------

Most unixes has a couple of important shared libraries. One of them is the libc.a lib on AIX.

libc = C Libary
glibc = GNU C library (on linux and open systems)

It is an XCOFF shared library under AIX and hence a critical part of the running
system. 

The standard C library, `libc.a', is automatically linked into your programs by the `gcc' control program. 
Or it is used by C/C++ compilers to create statically linked programs.
It provides many of the functions that are normally associated with C programs 

For each function or variable that the library provides, the definition of that symbol will include 
information on which header files to include in your source to obtain prototypes and type definitions 
relevant to the use of that symbol. 

Note that many of the functions in `libm.a' (the math library) are defined in `math.h' but are not present 
in libc.a. Some are, which may get confusing, but the rule of thumb is this--the C library contains 
those functions that ANSI dictates must exist, so that you don't need the -lm if you only use ANSI functions. 
In contrast, `libm.a' contains more functions and supports additional functionality such as the matherr 
call-back and compliance to several alternative 
standards of behavior in case of FP errors. 


Version:
--------

On AIX, you can determine the version of the libc fileset on your machine as follows:

# lslpp -l bos.rte.libc


Its gone, now what?
-------------------

Note: You might want to look at the "recsh" recovery shell command first.

Other ways to recover:

You can recover from this without rebooting or reinstalling, if you
have another copy of libc.a available that is also named "libc.a".  If
you moved libc.a to a different directory, you're in luck -- do the
following:

export LIBPATH=/other/directory


And your future commands will work.  But if you renamed libc.a, this
won't do it.  If you have an NFS mounted directory somewhere, you can
put libc.a on the that host, and point LIBPATH to that directory as
shown above.

Or..

If you have a good copy of from somewhere..

Copy the libc.a fix into place, e.g.,

a. # cp -f your_dir/locale_format/lib/libc.a /usr/ccs/lib/
b. # chown bin.bin /usr/ccs/lib/libc.a
c. # chmod 555 /usr/ccs/lib/libc.a
d. # ln -sf /usr/ccs/lib/libc.a /usr/lib/libs.a
e. # unset LIBPATH
f. # slibclean

Make sure that the new libraries will be picked up at
the next reboot.

Now Reboot.


IBM's version on how to recover:
--------------------------------


Restore Access to an Unlinked or Deleted System Library
When the existing libc.a library is not available, most operating system commands are not recognized. 
The most likely causes for this type of problem are the following: 

The link in /usr/lib no longer exists. 
The file in /usr/ccs/lib has been deleted.

The following procedure describes how to restore access to the libc.a library. This procedure requires 
system downtime. If possible, schedule your downtime when it least impacts your workload to protect 
yourself from a possible loss of data or functionality.

The information in this how-to was tested using AIXr 5.3. If you are using a different version or level of AIX, 
the results you obtain might vary significantly. 

Restore a Deleted Symbolic Link

Use the following procedure to restore a symbolic link from the /usr/lib/libc.a library to 
the /usr/ccs/lib/libc.a path:

With root authority, set the LIBPATH environment variable to point to the /usr/ccs/lib directory by typing 
the following commands: 
# LIBPATH=/usr/ccs/lib:/usr/lib
# export LIBPATH

At this point, you should be able to execute system commands. 
To restore the links from the /usr/lib/libc.a library and the /lib directory to the /usr/lib directory, 
type the following commands: 
ln -s /usr/ccs/lib/libc.a /usr/lib/libc.a
ln -s /usr/lib /lib

At this point, commands should run as before. If you still do not have access to a shell, 
skip the rest of this procedure and continue with the next section, Restore a Deleted System Library File. 
Type the following command to unset the LIBPATH environment variable. 

unset LIBPATH


Symbol resolution failed for /usr/lib/libc_r.alibc_r.a:
=======================================================


Note 1:
-------

libc_r.a is a standard re-entrant C library, which allows synchronization of the tasks at exit. 


Note 2:
-------

thread:

Q:

Hi there

I've just tried to install Informix 9.3 64-bit on AIX 52. It failed with the
error shown below. Any suggestions as to what could be wrong? I tried to
find information on the web as to what versions of Informix (if any) are
supported on AIX52, but could not find anything.

I would be grateful for any advice in this matter.


Disk Initializing Demo IBM Informix Dynamic Server
exec(): 0509-036 Cannot load program
/u01/app/informix-9.3-64/server/bin/oninit
because of the following errors:
0509-130 Symbol resolution failed for /usr/lib/libc_r.a[aio_64.o]
becau
e:
0509-136 Symbol kaio_rdwr64 (number 0) is not exported from
dependent module /unix.
0509-136 Symbol listio64 (number 1) is not exported from
dependent module /unix.
0509-136 Symbol acancel64 (number 2) is not exported from
dependent module /unix.
0509-136 Symbol iosuspend64 (number 3) is not exported from
dependent module /unix.
0509-136 Symbol aio_nwait (number 4) is not exported from
dependent module /unix.
0509-150 Dependent module libc_r.a(aio_64.o) could not be loaded.
0509-026 System error: Cannot run a file that does not have a valid
for
at.
0509-192 Examine .loader section symbols with the
'dump -Tv' command.
Bundle Install program has finished

A:

Did you enable AIX aio? If not then run the following smit command.

$ smit aio

Choose "Change / Show Characteristics of Asynchronous I/O"

Set the state to be configure at system restart to Available.
Set state of fast path to Enable.

Also check that you enabled 64-bit version of AIX run time.


Note 3:
-------

Q:

Suppose you get the error: Symbol resolution failed for /usr/lib/libc_r.a

Examples:

Error:  Exec(): 0509-036 Cannot load program
Article ID: 20180 
Software:  ArcGIS - ArcInfo 8.0.1, 8.0.2, 8.1 ArcView GIS 3.1, 3.2, 3.2a 
Platforms:  AIX 4.3.2.0, 4.3.3.0 

Error Message
Executing some ArcInfo Workstation commands, or running ArcView GIS cause the following errors to occur: 

Exec(): 0509-036 Cannot load program ... because of the 
0509-130 Symbol resolution failed for /usr/lib/libc_r.a(aio.o) because: 
0509-136 Symbol kaio_rdwr (number 0) is not exported from dependant module /unix 
0509-136 Symbol listio (number 1) is not exported from dependant 
module /unix 
0509-136 Symbol acancel (number 2) is not exported from dependant module /unix 
0509-136 Symbol iosuspend (number 3) is not exported from dependant module /unix 
0509-136 Symbol aio_nwait (number 4) is not exported from dependant module /unix 
0509-192 Examine .loader section symbols with the 'dump -Tv' command.


root@n5110l13:/appl/emcdctm/dba/log#cat dmw_et.log
Could not load program ./documentum:
Symbol resolution failed for /usr/lib/libc_r.a(aio.o) because:
        Symbol kaio_rdwr (number 0) is not exported from dependent
          module /unix.
        Symbol listio (number 1) is not exported from dependent
          module /unix.
        Symbol acancel (number 2) is not exported from dependent
          module /unix.
        Symbol iosuspend (number 3) is not exported from dependent
          module /unix.
        Symbol aio_nwait (number 4) is not exported from dependent
          module /unix.
        Symbol aio_nwait64 (number 5) is not exported from dependent
          module /unix.
        Symbol aio_nwait_timeout (number 6) is not exported from dependent
          module /unix.
        Symbol aio_nwait_timeout64 (number 7) is not exported from dependent
          module /unix.
System error: Error 0


A:

Cause
The AIX asynchronous I/O module has not been loaded.

Solution or Workaround
Load asynchronous I/O. You must do this as a ROOT user:

Use SMITTY and navigate to Devices > Async I/O > Change/Show. 
Make the defined option available. 
Reboot the machine. 

or

Enable AIO by running the following commands: 
/usr/sbin/chdev -l aio0 -a autoconfig=available 
/usr/sbin/mkdev -l aio0 


KDB kernel debugger and kdb command:
====================================

AIX Only

KDB kernel debugger and kdb command
This document describes the KDB kernel debugger and kdb command. The KDB kernel debugger and the kdb command 
are the primary tools a developer uses for debugging device drivers, kernel extensions, and the kernel itself. 
Although they appear similar to the user, the KDB kernel debugger and the kdb command are two separate tools:

-- KDB kernel debugger: 
The KDB kernel debugger is integrated into the kernel and allows full control of the system while a 
debugging session is in progress. The KDB kernel debugger allows for traditional debugging tasks such as 
setting breakpoints and single-stepping through code. 

-- kdb command: 
This command is implemented as an ordinary user-space program and is typically used for post-mortem analysis 
of a previously-crashed system by using a system dump file. The kdb command includes subcommands specific to the 
manipulation of system dumps. 

Both the KDB kernel debugger and kdb command allow the developer to display various structures normally found 
in the kernel's memory space. Both do the following:

-Provide numerous subcommands to decode various data structures found throughout the kernel. 
-Print the data structures in a user-friendly format. 
-Perform debugging at the machine instruction level. Although this is less convenient than source level debugging, 
 it allows the KDB kernel debugger and the kdb command to be used in the field where access to source code 
 might not be possible. 
-Process the debugging information found in XCOFF objects. This allows the use of symbolic names for functions 
 and global variables.


slibclean:
==========

AIX:

Note 1:

Removes any currently unused modules in kernel and library memory.

Syntax

# slibclean


Description
The slibclean command unloads all object files with load and use counts of 0. It can also be used to 
remove object files that are no longer used from both the shared library region and in the shared library 
and kernel text regions by removing object files that are no longer required.

Files
/usr/sbin/slibclean Contains the slibclean command. 


thread_getregs, thread_waitlock, sigprocmask:
=============================================

Note 1:
-------

thread:

Q:

thread_waitlock

Hello all 
Can someone please provide me with a link to where the above function is 
documented ?? I know its part of libc_r.a and is used for thread 
synchronization ... I need to get some details on the function as to 
what exactly it does since a program I'm trying to debug is getting a 
ENOTSUP error while calling this function ... 

Would really appreciate the help. 

A:

thread_waitlock()
Reply from Richard Joltes on 8/25/2003 5:48:00 PM  

This doesn't seem to be documented anywhere, but it appears this 
function is _not_ in libc(_r). I found this elsewhere: 

"...kernel symbols are defined by import lists found in /usr/lib. 
You'll need threads.exp and syscalls.exp. Look at the -bI option 
in the ld documentation." 

You'll find references to this function if you look through these 
two file so maybe that's your best option. Threads.exp even 
specifically says "the system calls listed below are not imported 
by libc.a." 


Note 2:
-------

thread:

APAR: IY17298 COMPID: 5765C3403 REL: 430 
ABSTRACT: ASSERT IN THREAD_SETSTATE_FAST 

PROBLEM DESCRIPTION: 
Program termination due to an assert in thread_setstate_fast. 

PROBLEM SUMMARY: 
Assert in thread_setstate_fast 

PROBLEM CONCLUSION: 
Increase lock scope. 


Note 3:
-------

thread:

Paul Pluzhnikov wrote: 
> "pankajtakawale" <pankaj.takaw...@gmail.com> writes: 

> > Here is the snippet of truss output 
> ... 
> > sbrk(0x00000060)                              Err#12 ENOMEM 


> > Do i need to increase swap space or thread stack size? 


> Increasing swap might help, but I would not expect it. 
> You are running out of *heap* space. Check your limits, e.g. 'ulimit 
> -a' in *sh or 'limit' in *csh. 

Yes process was running out of heap space. In my local environment I 
decreased soft limit of data segment and ran app. truss showed 'sbrk 
faild with ENOMEM'. Now Im planning to run app in very heavy 
configuration such that 'unlimited data segment' too will be 
insufficient. And on same configuration I will make app large addr 
space model by setting env variable LDR_CNTRL (app shud run in large 
addr space model). And will update thread with results. 
Thanks for your valuable help Paul. 


Note 4:
-------

+   On Linux, the interface exports a bunch of "#define __NR_foo 42" style 
+   definitions, so there is no implementation. 
+ 
+   On AIX, syscall numbers are not fixed ahead of time; in principle 
+   each process can have its own assignment of numbers to actual 
+   syscalls.  As a result we have a bunch of global variables to store 
+   the number for each syscall, which are assigned to at system 
+   startup, and a bunch of #defines which map "__NR_foo" names to 
+   these global variables.  Initially, when we don't know what a 
+   syscall's number is, it is set to __NR_AIX5_UNKNOWN. 
+ 
+   Therefore, on AIX, this module provides a home for those variables. 
+ 
+   It also provides VG_(aix5_register_syscall) to assign numbers to 
+   those variables. 
+*/ 

e.g.

+Int VG_(aix5_NR__sigqueue) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR__sigsuspend) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR__sigaction) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_sigprocmask) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_siglocalmask) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_count_event_waiters) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_waitact) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_waitlock_local) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_waitlock) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_wait) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_unlock) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_twakeup_unlock) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_twakeup_event) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_twakeup) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_tsleep_event) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_tsleep_chkpnt) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_tsleep) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_post_many) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_post) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_ue_proc_unregister) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_ue_proc_register) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_kthread_ctl) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR__thread_setsched) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_threads_runnable) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_getregs) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_terminate_unlock) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_terminate_ack) = __NR_AIX5_UNKNOWN; 
+Int VG_(aix5_NR_thread_setstate_fast) = __NR_AIX5_UNKNOWN; 
etc....


skulker command:
================

AIX:

The skulker command is a command file for periodically purging obsolete or unneeded files from file systems. 
Candidate files include files in the /tmp directory, files older than a specified age, a.out files, core files, 
or ed.hup files. 

The skulker command is normally invoked daily, often as part of an accounting procedure run by the cron command 
during off-peak periods. Modify the skulker command to suit local needs following the patterns shown in the 
distributed version. System users should be made aware of the criteria for automatic file removal. 

The find command and the xargs command form a powerful combination for use in the skulker command. 
Most file selection criteria can be expressed conveniently with find expressions. The resulting file list 
can be segmented and inserted into rm commands using the xargs command to reduce the overhead that would 
result if each file were deleted with a separate command. 

Note 
Because the skulker command is run by a root user and its whole purpose is to remove files, 
it has the potential for unexpected results. Before installing a new skulker command, test any additions 
to its file removal criteria by running the additions manually using the xargs -p command. After you have 
verified that the new skulker command removes only the files you want removed, you can install it. 
 
To enable the skulker command, you should use the crontab -e command to remove the comment statement 
by deleting the # (pound sign) character from the beginning of the /usr/sbin/skulker line in the 
/var/spool/cron/crontabs/root file. 


L1 cache, L2 cache, L3 cache and differences:
=============================================


Note 1:
-------

Q:

Question from cmos "What is L2 cache? 

What is L3 cache? 

What is the major difference between the two." 

A:

Answer from LinksMaster "CPU Cache (the example CPU is a little old but the concepts are still the same) 

* The initial level of storage on a processor are the registers. The registers are where the actually processing 
input and output takes place.

* L1 cache - Then the level 1 cache comes next. It is logically the closest high speed memory 
to the CPU core / registers. It usually runs at the full speed (meaning the same as the CPU core clockspeed). 
L1 often comes in size of 8kB, 16kB, 32kB, 64kB or 128kB. But, it is very high speed even though the amount 
is relatively small.

* L2 cache - The next level of cache is L2, or level 2. Nowadays L2 is larger than L1 and it often comes in 
256kB, 512kB and 1,024MB amounts. L2 often runs at 1/4, 1/2 or full speed in relation to the CPU core clockspeed.

* L3 cache - Level 3 cache is something of a luxury item. Often only high end workstations and servers 
need L3 cache. L3 has been both "on-die", meaning part of the CPU or "external" meaning mounted near 
the CPU on the motherboard. It comes in many sizes and speeds.


Note 2:
-------

L2 cachhe, short for Level 2 cache, cache memory that is external to the microprocessor. In general, L2 cache memory, 
also called the secondary cache, resides on a separate chip from the microprocessor chip. 
Although, more and more microprocessors are including L2 caches into their architectures. 

As more and more processors begin to include L2 cache into their architectures, Level 3 cache is now the name 
for the extra cache built into motherboards between the microprocessor and the main memory. 

Quite simply, what was once L2 cache on motherboards now becomes L3 cache when used with microprocessors 
containing built-in L2 caches. 


xcom:
=====

Used for filetransfer between systems with many nice features like printing a report of the transfer,
queuing of transfers, EBCIDIC - ASCII conversion, scheduling etc..

The command "xcom62" is used for SNA networks.
The command "xcomtcp" is used for the tcpip networks.

The xcom daemon is "xcomd" in /opt/xcom/bin

Use:
xcomd -c	to kill/stop
xcomd		to start  the daemon.

logging and config of xcom events can be found in:
/usr/spool/xcom 

- xcom.glb has all conf. settings and is a superset of xcom.cnf
- After reconfig, stop and start the daemon
- atoe files are for ascii / ebcdic conversions
- xcomqm for maintaining queues
- xcom.trusted must be defined for each trusted transfer
- xcomtool is for GUI

sending of files is done with "xcomtcp -c[1234] other options"
where -c1 means sending a file

Example commands:
xcomtcp -c1 -f /tmp/xcom.cnf LOCAL_FILE=/tmp/xcomtest.txt REMOTE_FILE=Q:\
REMOTE_SYSTEM=NLPA020515.patest.nl.eu.abnamro.com QUEUE=NO PROTOCOL=TCPIP PORT=8044

xcomtcp -c1 -f /tmp/xcom.cnf LOCAL_FILE=/tmp/xcomtest.txt REMOTE_FILE=c:\test.txt
REMOTE_SYSTEM=NLPR020796.branches.nl.eu.abnamro.com QUEUE=NO PROTOCOL=TCPIP PORT=8044


xcomtcp -c1 -f /tmp/xcom.cnf LOCAL_FILE=/tmp/xcomtest.txt REMOTE_FILE=Q:\XCOM\DATA\IN\t.txt
REMOTE_SYSTEM=NLPA020515.patest.nl.eu.abnamro.com QUEUE=NO PROTOCOL=TCPIP PORT=8044

xcomtcp -c1 -f /tmp/xcom.cnf LOCAL_FILE=/tmp/xcomtest.txt REMOTE_FILE=Q:\
REMOTE_SYSTEM=NLPA020515.patest.nl.eu.abnamro.com QUEUE=NO PROTOCOL=TCPIP PORT=8044

where "/tmp/xcom.cnf" contains parameters (e.g. userid, password etc..) not specified at the prompt.


vmcid: IBM  minor code: E02
===========================


Note 1:
------- 

Seems there is no port 32888 assign on this systeem ZD111L08

/etc/services
#                                32775-32895            # Unassigned

Maybe this is the issue..

Note 2:
-------


AIX:  0403-031 The fork function failed:
========================================


run "svmon -P" to see the top consumers.
You might try using:

1. increase the paging space

chps -s number_of_PP's hd6 

Or 

2. increase the maxuproc might help if its too low:

maxuproc:    Specifies the maximum number of processes per user ID. 
Values:      Default: 40; Range: 1 to 131072 
Display:     lsattr -E -l sys0 -a maxuproc 
Change:      chdev -l sys0 -a maxuproc=NewValue 


3. Actually, you should increase RAM memory as a structural solution, or decrease
the number of processes.


AIX create Ramdisk:
===================

Example:

# mkramdisk 40000 
# ls -l /dev | grep ram 
# mkfs -V jfs /dev/ramdiskx 
# mkdir /ramdiskx 
# mount -V jfs -o nointegrity /dev/ramdiskx /ramdiskx 


AIX 5.3 ping error:
===================

thread:

Q:

Hi all, 

If a normal use trys to ping to my workstation then it gives the followin error "0821-067 ping: 
The socket creation call failed.: The file access permissions do not allow the specified action" 

And in my workstation if i login as non-root user then if I ping to some other system it gives the same error..
whereas it is not so with root user. 

Any suggestions what can be the problem? 

A:

Hi, 

looks like problems in program file ping rights... in my AIX system i have the following for /usr/sbin/ping 

# ls -l /usr/sbin/ping 
-r-sr-xr-x 1 root system 31598 Dec 17 2002 /usr/sbin/ping 
# 
100kggoud

A:

Technote IBM

Cannot ping as Non-root user 
 Technote (FAQ) 
  
Problem 

When trying to ping as a user that is not root, the following error message was 
displayed:

0821-067 Ping: the socket creation call failed.
the file access permissions do not allow the specified
actions.

  
Solution 

--------------------------------------------------------------------------------

Environment
AIX Version 5.x Change the setuid bit permissions for /usr/sbin/ping. Enter: 
chmod 4555 /usr/sbin/ping


Root Password Recovery on Solaris : 
===================================


Go to the OK Prompt - by pressing Stop +A . Put the 1st cd for Solaris in the cdrom � 
At OK prompt give the command # boot cdrom -s � Now mount the boot device onto /a 
( To check boot device you can use df command) 

#mount /dev/dsk/c0t0d0s0 /a

Now open the password file and remove the password entry i.e. 
the second field root:$1$NYDu1c8p$Mdm2n6IPb9k14pP2s2FXZ.:13063:0:99999:7::: 

# vi /a/etc/shadow

Now unmount the /a mount point 

#umount /a

Reboot the server in single mode 

#ok boot -s

Give a new password for root: 

        #passwd
          New Password:
          Verify Password:

This will reset the password for root and you will be able to login to the box using this password. 


itm_ora_App2


AIX: 0403-006 Execute permission denied:
========================================

Note 1:
-------

thread

Q:

hello all,now,I want to exe a shell script,the result of command "ls 
-l",it's permission: 
-rwxr-x--x 

but i use the "./proname" to exe it,the result is: 
0403-006 Execute permission denied 


WHY?? the permission is all eXecute!!! 

A:

If permissions seems ok, then chcek this:
Make sure there are no "empty" lines with an "~" below the last statement, like

~


because "~" cannot be executed.


AIX: The certlist command:
==========================

You can use the certlist command without any parameters or flags, which will show
you all installed certificates for your account on your system.

The man page of certlist:

certlist Command
Purpose
certlist lists the contents of one or more certificates.

Syntax
certlist [-c] [-a attr [attr....] ]tag [username]

Description
The certlist command lists the contents of one or more certificates. Using the -c option causes the output 
to be formatted as colon-separated data with the attribute names associated with each 
field on the previous line as follows: 

# name: attribute1: attribute2: ... 
User: value1: value2: ... 

The -f option causes the output to be formatted in stanza file format with the username attribute 
given as the stanza name. Each attribute=value pair is listed on a separate line: 

user: 
     attribute1=value 
     attribute2=value 
     attribute3=value 

When neither of these command line options are selected, the attributes are output as attribute=value pairs.

The -a option selects a list of one or more certificate attributes to output. In addition to the attributes 
supported by the load module, several pseudo-attributes shall also be provided for each certificate.

Those attributes are:

auth_user User's authentication certificate. 
distinguished_name User's subject distinguished name in the certificate. 
alternate_name User's subject alternate name in the certificate. 
validafter The date the user's certificate becomes valid. 
validuntil The date the user's certificate becomes invalid. 
tag The name that uniquely identifies this certificate. 
issuer The distinguished name of the certificate issuer. 
label The label that identifies this certificate in the private keystore. 
keystore The location of the private keystore for the private key of the certificate. 
serialnumber The serial number of the certificate. 
verified true indicates that the user poved that he is in possession of the private key. 

Flags
-c Displays the output in colon-separated records. 
-f Displays the output in stanzas. 
-a attr Selects one or more attributes to be displayed. 

The tag parameter selects which of the user's certificates is to be output. The reserved value ALL indicates 
that all certificates for the user are to be listed.

The username parameter specifies the name of the AIX user to be queried. If invoked without the username parameter, 
the certdelete command uses the name of the current user.

Exit Status
0 If successful. 
EINVAL If the command is ill-formed or the arguments are invalid. 
ENOENT If a) the user doesn't exist, b) the tag does not exist c) the file does not exist. 
EACCES If the attribute cannot be listed, for example, if the invoker does not have read_access to the user data-base. 
EPERM If the user identification and authentication fails. 
errno If system error. 

Security
This command can be executed by any user in order to list the attributes of a certificate. 
Certificates listed may be owned by another user.

Audit
This command records the following event information:

CERT_List <username>

Examples
$ certlist -f -a verified keystore label signcert bob
bob:
      verified=false
      keystore=file:/var/pki/security/keys/bob
      label=signcert

$ certlist -c -a validafter validbefore issuer signcert bob
#name:validafter:validuntil:issuer
bob:1018091201:1018091301:c=US,o=xyz

$ certlist -f ALL bob
bob:
      auth_cert=logincert
      distinguished_name=c=US,o=xyz,cn=bob
      alternate_name=bob@xyz.com
      validafter=0921154701
      validuntil=0921154801
      issuer=c=US,o=xyz
      tag=logincert
      verified=true
      label=loginkey
      keystore=file:/var/pki/security/keys/bob
      serialnumber=03
bob:
      auth_cert=logincert
      distinguished_name=c=US,o=xyz,cn=bob
      alternate_name=bob@ibm.com
      validafter=1018091201
      validuntil=1018091301
      issuer=c=US,o=xyz
      tag=signcert
      verified=false
      label=signkey
      keystore=file:/var/pki/security/keys/bob
      serialnumber=02Files
/usr/lib/security/pki/acct.cfg

/usr/lib/security/pki/policy.cfg


SAM on HP-UX:
=============


The easiest way to administer HP-UX is to use Sam.
As root, simply type "sam"... easy, huh?
If you're in text-mode, you'll get a curses-based window, and if you're in CDE / VUE, you will get 
a new window on your workspace... Simply navigate your way through - 
you can do a lot of your administration via sam.

Some example screens in textmode sam

# sam
..
..
Starting the terminal version of sam...

To move around in sam:

- use the "Tab" key to move between screen elements
- use the arrow keys to move within screen elements
- use "Ctrl-F" for context-sensitive help anywhere in sam

On screens with a menubar at the top like this:

        ------------------------------------------------------
       |File View Options Actions                         Help|
       | ---- ---- ------- ------------------------------- ---|

- use "Tab" to move from the list to the menubar
- use the arrow keys to move around
- use "Return" to pull down a menu or select a menu item
- use "Tab" to move from the menubar to the list without selecting a menu item
- use the spacebar to select an item in the list

On any screen, press "CTRL-K" for more information on how to use the keyboard.


+ ===             System Administration Manager (gavnh300) (1)                 +
YFile View Options Actions                                                Help Y
Y                       Press CTRL-K for keyboard help.                        Y
YSAM Areas                                                                     Y
Y------------------------------------------------------------------------------Y
Y  Source   Area                                                               Y
Y+---------------------------------------------------------------------------+ Y
YY SAM      Accounts for Users and Groups ->                                 ^ Y
YY SAM      Auditing and Security         ->                                   Y
YY SAM      Backup and Recovery           ->                                   Y
YY SAM      Clusters                      ->                                   Y
YY SAM      Disks and File Systems        ->                                   Y
YY SAM      Display                       ->                                   Y
YY SAM      Kernel Configuration          ->                                   Y


YY SAM      Networking and Communications ->                                   Y
YY SAM      Performance Monitors          ->                                   Y
YY SAM      Peripheral Devices            ->                                   Y
YY SAM      Printers and Plotters         ->                                   Y
YY SAM      Process Management            ->                                   Y
YY Other    Resource Management           ->                                   Y
YY SAM      Routine Tasks                 ->                                 v Y
Y+---------------------------------------------------------------------------+ Y
Y                                                                              Y
+------------------------------------------------------------------------------+


Choose "Accounts for Users and Groups" and the following screen shows:

+ ===             System Administration Manager (gavnh300) (1)                 +
YFile View Options Actions                                                Help Y
Y                       Press CTRL-K for keyboard help.                        Y
YSAM Areas:Accounts for Users and Groups                                       Y
Y------------------------------------------------------------------------------Y
Y  Source   Area                                                               Y
Y+---------------------------------------------------------------------------+ Y
YY ..(go up)                                                                 ^ Y
YY SAM      Groups                                                             Y
YY SAM      Users                                                              Y
YY                                                                             Y
YY                                                                             Y
YY                                                                             Y
YY                                                                             Y
YY                                                                             Y
YY                                                                             Y
YY                                                                             Y
YY                                                                             Y
YY                                                                             Y
YY                                                                             Y
YY                                                                           v Y
Y+---------------------------------------------------------------------------+ Y
Y    Working...                                                                Y
+------------------------------------------------------------------------------+


+ ===             Accounts for Users and Groups (gavnh300) (1)                 +
YFile List View Options Actions                                           Help Y
Y                       Y Add...                                Y              Y
YTemplate In Use: None  Y User Templates ->                     Y              Y
YFiltering:  Displaying Y Task Customization...                 Y              Y
Y-----------------------Y ===================================== Y--------------Y
YUsers                  Y Modify...                             Yf 314 selectedY
Y-----------------------Y Remove...                             Y--------------Y
Y  Login      User ID   Y Modify Secondary Group Membership...  Yce      Of    Y
Y  Name         (UID)   Y Modify User's Password                Ye       Lo    Y
Y+----------------------Y Reset User's Password                 Y------------+ Y
YY ru1160        6243   Y ------------------------------------- YLDEV     R. ^ Y
YY sa2064        6975   Y Deactivate...                         YRATOR    I  Y Y
YY sa2194        8172   Y ------------------------------------- YLMAN     S  Y Y
YY sc3060        5318   Y Modify Security Policies...           YMAN      KE Y Y
YY sc4634        8140   Y Set Authorized Login Times...         YDONLY    JH Y Y
YY sc6228        7027   +---------------------------------------+LMAN     P.   Y
YY se1223        8170   SEL                       sysman      SYSMAN      VA Y Y
YY sh0403        7735   SHARIF                    oper        OPERATOR    T. Y Y
YY si1608        7479   Sinha                     support     APPLMAN     R  Y Y
YY si1624        7391   Sinha                     support     APPLMAN     A  v Y
Y <------------------------------------------------------------------------->+ Y
Y                                                                              Y
+------------------------------------------------------------------------------+


AIX 5L vmstat output issues:
============================

Suppose you see output like this:

[pl003][tdbaeduc][/dbms/tdbaeduc/educroca/admin/dump/bdump] vmstat -v
              1572864 memory pages
              1506463 lruable pages
                36494 free pages
                    7 memory pools
               336124 pinned pages
                 80.0 maxpin percentage
                 20.0 minperm percentage
                 80.0 maxperm percentage
                 43.4 numperm percentage
               654671 file pages
                  0.0 compressed percentage
                    0 compressed pages
                 45.8 numclient percentage
                 80.0 maxclient percentage
               690983 client pages
                    0 remote pageouts scheduled
                    0 pending disk I/Os blocked with no pbuf
         -->  8868259 paging space I/Os blocked with no psbuf
         -->     2740 filesystem I/Os blocked with no fsbuf
         -->    13175 client filesystem I/Os blocked with no fsbuf
         -->  319766 external pager filesystem I/Os blocked with no fsbuf


What is the meaning, and interpretation, of the outputlines like "pending disk I/Os blocked with no pbuf" ?

Note 1:
-------

http://www.circle4.com/jaqui/eserver/eserver-AugSep06-AIXPerformance.pdf
..
..

The last five lines of the vmstat -v report are useful when you're looking for I/O problems. The first line is 
for disk I/Os that were blocked because there were no pbufs. Pbufs are pinned memory buffers used 
to hold I/O requests at the logical volumemanager layer. Prior to AIX v5.3, this was a systemwide parameter. 
It's now tuneable on a volume-group basis using the lvmo command. The ioo parameter that controls the default 
number of pbufs to add when a disk is added to a volume groupis pv_min_pbuf, and it defaults to 512. 
This specifies the minimum number of pbufs per PV that the LVM uses, and it's a global value that applies to all 
VGs on the system. If you see the pbuf blocked I/Os field above increasing over time, you may want to use the 
lvmo -a command to find out which volume groups are having problems with pbufs and then slowly increase 
pbufs for that volume group using the lvmo command. A reasonable value could be 1024.

Paging space I/Os blocked with no psbuf refers to the number of paging space I/O requests blocked 
because no psbuf was available. These are pinned memory buffers used to hold I/O requests at the 
virtual memory manager layer. If you see these increasing, then you need to either find out why the system 
is paging or increase the size of the page datasets. Filesystem I/Os blocked with no fsbufs refers to the 
number of filesystem I/O requests blocked because no fsbuf was available. Fsbufs are pinned memory buffers 
used to hold I/O requests in the filesystem layer. If this is constantly increasing, then it may be necessary 
to use ioo to increase numfsbufs so that more bufstructs are available. The default numfsbufs value
is determined by the system and seems to normally default to 196. I regularly increase this to either 1,024 or 2,048.

Client filesystem I/Os blocked with no fsbuf refers to the number of client filesystem I/O requests blocked 
because no fsbuf was available. Fsbufs are pinned memory buffers used to hold I/O requests in the 
filesystem layer. This includes NFS, VxFS (Veritas) and GPFS filesystems. Finally, ext pager 
filesystem I/Os blocked with no fsbuf refers to the number of external pager client filesystem I/O requests 
blocked because no fsbuf was available. JFS2 is an external pager client filesystem. If I see this growing, 
I typically set j2_nBufferPerPagerDevice=1024


Note 2:
-------

thread:

Q:

we have I/O issue on the AIX box for our Oracle DB
the disks having the Database files are always 100% busy
and the wa column in vmstat hits above 50
and the vmstat -v show the I/O's being blocked
    2238141 pending disk I/Os blocked with no pbuf
  13963233 paging space I/Os blocked with no psbuf
  2740 filesystem I/Os blocked with no fsbuf
  1423313 client filesystem I/Os blocked with no fsbuf
 1128548 external pager filesystem I/Os blocked with no fsbuf
  
What does these indicate, short of real mem or does some kernal parameters need to be adjusted?

A:

I'd up the number of fsbufs per filesystem.

What are your current settings?

ioo -L|egrep 'numfsbufs|j2_nBufferPerPagerDevice'

numfsbufs is for jfs filesystems
j2_nBuffer... is for jfs2 filesystems

if I'm not mistaken.

Note, if you change these values, you have to umount/mount the filesystems to take effect. 
I.e. you have to bring Oracle down.

HTH,

p5wizard
 
 
p5wizard, Thanks, I dont have the access to it,i will get the SA to get me the output.
Are these figures cummulative since the last reboot of the box.
what a good setting for this 
 
dbinsight (TechnicalUser) 10 Mar 06 14:06  
ioo -L |egrep 'numfsbufs|j2_nBufferPerPagerDevice'       
numfsbufs                 512    196    512    1      2G-1                     M
j2_nBufferPerPagerDevice  512    512    512    0      2G-1

The above are our settings, Are these the default settings?

  
p5wizard (IS/IT--Management) 13 Mar 06 3:35  
answers to your questions:

yes, cumulative (so depends on how long the system's been running to interpret the values).

no, already been increased for jfs
yes, defaults for jfs2

1st value = current situation
2nd value = system default
3rd value = value for nextboot
 
# ioo -L|head -3; ioo -L|egrep 'numfs|j2_nBuff' 
NAME                      CUR    DEF    BOOT   MIN    MAX    UNIT           TYPE
     DEPENDENCIES
--------------------------------------------------------------------------------
j2_nBufferPerPagerDevice  512    512    512    0      256K                     M
numfsbufs                 400    196    400    1      2G-1                     M

But as I said before, doesn't help to increase 'em unless you unmount/mount the filesystems. 
As your SA has upped the 'NEXTBOOT' values, I guess (s)he knows about that. 

Run "topas 2" for a few iterations, and post that screenful please.
Also "vmo -L|egrep 'perm%|client%'" output please.

You have a very high value:
  13963233 paging space I/Os blocked with no psbuf
On my large DB servers this is close to zero.

Run "lsps -a" and post that output also please.

I googled for "aix psbufs" and found an Oracle AIX performance Technical Brief, here's an excerpt:

# vmstat -v | tail -5 (we only need last 5 lines)
0 pending disk I/Os blocked with no pbuf
  o for pbufs, increase pv_min_pbuf using ioo,
0 paging space I/Os blocked with no psbuf
  o for psbufs, stop paging or add more paging space,
8755 filesystem I/Os blocked with no fsbuf ?? JFS
  o for fsbufs, increase numfsbufs using ioo,
  o default is 196, recommended starting value is 568,
0 client filesystem I/Os blocked with no fsbuf (NFS/Veritas)
  o for client filesystem fsbufs, increase:
     nfso's nfs_v3_pdts and nfs_v3_vm_bufs
2365 external pager filesystem I/Os blocked with no fsbuf (JFS2)
  o for external pager fsbufs, increase:
     j2_nBufferPerPagerDevice, default is 512, recommended value is 2048,
     j2_dynamicBufferPreallocation using ioo.


Note 3:
-------

thread:

4.2) File System Buffers.  By default, the number of file system buffers is set to 196.  For high I/O systems,
this is typically too small.  To see if you are blocking I/O due to not having enough 
file system buffers, run: vmstat -v.  

For JFS file systems, look at the "filesystem I/Os blocked with no fsbuf" line.  
For JFS2 file systems, look at the "client filesystem I/Os blocked with no fsbuf" line.  

If these values are more than a couple thousand, you may need to increase the respective parameters.  
For JFS file systems, you will need to change the numfsbufs parameter.  For JFS2 file systems, 
change the  j2_nBufferPerPagerDevice parameter.  Changing this parameter does not require a reboot, 
but will only take effect when the file system is mounted, so you will have to unmount/mount the file system.

4.2) JFS Log Devices.  Heavily used filesystems should ALWAYS have their own JFS log on a 
separate physical disk.  All writes to a JFS (or JFS2) file system are written to the JFS log.  
By default, there is only one JFS log created for any volume group containing JFS file systems.  
This means that ALL writes to ALL the file systems in the volume group go to ONE PHYSICAL DISK!!  
(This is, unless, your underlying disk structure is striped or another form of RAID for performance.)  
Creating separate JFS logs on different physical disks is very important to getting the most out 
of the AIX I/O subsystem.
 
 
/usr/ccs/bin/shlap64:
=====================

The /usr/ccs/bin/shlap64 process is the Shared Library Support Daemon.

The muxatmd, snmpmibd and aixmibd are Simple Network Managaement
Protocol (SNMP) daemons for AIX. All can be turned off by commenting
out the entries that start them in /etc/rc.tcpip. shlap64 is part of
the 64-bit environment and needs to be running if you are using a
64-bit kernel.

The IBM.* programs are part of the Reliable Scalable Cluster
Technology which IBM added to AIX v5 from their SP cluster systems.
These programs provide additional system monitoring (and alerting if
configured to do so). You should probably leave them running.


/etc/ncs/glbd:
==============

glbd Daemon

Purpose
Manages the global location broker database.

Syntax
/etc/ncs/glbd [ -create { -first [-family FamilyName] | -from HostName } ] [  -change_family FamilyName ] 
[ -listen FamilyList] [ -version ]

Description
The glbd daemon manages the global location broker (GLB) database. The GLB database, part of the 
Network Computing System (NCS), helps clients to clients to locate servers on a network or internet. 
The GLB database stores the locations (specifically, the network addresses and port numbers) of servers 
on which processes are running. The glbd daemon maintains this database and provides access to it.

There are two versions of the GLB daemon, glbd and nrglbd.


RBAC:
=====

Role Based Access Control

AIXr has provided a limited RBAC implementation since AIX 4.2.1.

Most environments require that different users manage different system administration duties. 
It is necessary to maintain separation of these duties so that no single system management user 
can accidentally or maliciously bypass system security. While traditional UNIXr system administration 
cannot achieve these goals, Role Based Access Control can.

RBAC allows the creation of roles for system administration and the delegation of administrative tasks 
across a set of trusted system users. In AIXr, RBAC provides a mechanism through which the administrative 
functions typically reserved for the root user can be assigned to regular system users.

Beginning with AIX 6.1, a new implementation of RBAC provides for a very fine granular mechanism 
to segment system administration tasks. Since these two RBAC implementations differ greatly in functionality, 
the following terms are used:

-Legacy RBAC Mode 
 The historic behavior of AIX roles that was introduced in AIX 4.2.1 
-Enhanced RBAC Mode 
 The new implementation introduced with AIX 6.1 

Both modes of operation are supported. However, Enhanced RBAC Mode is the default on a newly installed AIX 6.1 system. 


llbd:
=====

llbd Daemon


Purpose
Manages the information in the local location broker database. 


Syntax
llbd [-family FamilyName] [ -version] 


Description
The llbd daemon is part of the Network Computing System (NCS). It manages the local location broker (LLB) database, 
which stores information about NCS-based server programs running on the local host. 

A host must run the llbd daemon to support the location broker forwarding function or to allow remote access 
(for example, by the lb_admin tool) to the LLB database. In general, any host that runs an NCS-based server 
program should run an llbd daemon, and llbd should be running before any such servers are started. 
Additionally, any network or internet supporting NCS activity should have at least one host running a 
global location broker daemon (glbd). 

The llbd daemon is started in one of two ways: 

Through the System Resource Controller (the recommended method), by entering on the command line: 

startsrc -s llbd

By a person with root user authority entering on the command line: 

/etc/ncs/llbd &

TCP/IP must be configured and running on your system before you start the llbd daemon. 
(You should start the llbd daemon before starting the glbd or nrglbd daemon.) 


tripwire:
=========

Tripwire data integrity assurance software monitors the reliability of critical system files and directories 
by identifying changes made to them. Tripwire configuration options include the ability to receive alerts 
via email if particular files are altered and automated integrity checking via a cron job. Using Tripwire for 
intrusion detection and damage assessment helps you keep track of system changes. Because Tripwire can 
positively identify files that have been added, modified, or deleted, it can speed recovery from a break-in 
by keeping the number of files which must be restored to a minimum. 

Tripwire compares files and directories against a database of file locations, dates modified, and other data. 
The database contains baselines, which are snapshots of specified files and directories at a specific 
point in time. The contents of the baseline database should be generated before the system is at risk 
of intrusion. After creating the baseline database, Tripwire then compares the current system to the baseline 
and reports any modifications, additions, or deletions. 

While Tripwire is a valuable tool for auditing the security state of Red Hat Linux systems, Tripwire is not 
supported by Red Hat, Inc. Refer to the Tripwire project's website (http://www.tripwire.org/) for more 
information about Tripwire. 


SA-Agent uctsp0:
================

CONTROL-SA is a client/server solution that enables you to manage security systems distributed across multiple
platforms. CONTROL-SA synchronizes accounts and passwords across those systems.
 
On AIX, you can find the binaries and files in /usr/lpp/uctsp0.
On HP, you can find the binaries and files in /usr/local/uctsp0.

To stop the agent:
# su - uctsp0 -c stop-ctsa

To start the agent:
# su - uctsp0 -c start-ctsa


EMC Documentum:
===============

General:
--------

The following components are associated with the Content Server:

- A database containing relationships that relate to stored content (on filesystems).
  This database thus contains metadata (and not content).

- file storage of the actual content being managed.
  The file store and database constitue the Documentum Repository abstraction, called Docbase.

- A set of key processes that implement the Documentum content management solution
  such as the Document Broker.

- A set of housekeeping utilities, including a Web-based Admin tool.


Client Connect:
---------------

Access to Documentum Docbases is controlled through the Documentum 
client file dmcl.ini.

You need to understand the architecture of Content Server and
docbroker and how DMCL connects. Content Server and docbroker are 2
separate processes which are started (usually but not necessarily) on
the same machine. Since they are separate processes they listen on
different ports. Docbroker usually listens on a well-known port 1489.
Content Server will listen on the port you configure in the services
file. When you issue a DMCL connect (which is what DAB does - it is a
documentum client using DMCL) the DMCL first locates a docbroker. It
asks the docbroker for a list of docbases it knows about. The
docbroker returns the names of the docbases and provides details of
the Content Server(s) that service the docbase. This includes the host
and port details. The DMCL then issues requests directly to the
Content Server bypassing the docbroker. Thus you need both the
docbroker and the Content Server port open.


BPS http Listener (Installation and Configuration)
--------------------------------------------------
What is BPS 

Business Process Services (BPS) provides the gateway to access Docbase for a non Documentum user. 
It allows HTTP, SMTP of JMS message to be stored directly in the Docbase. When an http, SMTP or JMS message 
is sent to BPS http listener Servlet URL, email address or JMS queue; the listener intercepts the message 
and processes it to a protocol neutral format. The message is then passed to the BPS message handler. 
The message handler opens a connection to the Docbase and stores the message as virtual document. 
The attachment gets stored as child nodes of the virtual document. 

How http Listener Works 

The http message listener is implemented as a Servlet. It gets installed as a part of BPS web application. 
The URL to access the http listener Servlet is 

http://<servername>:<portnumber>/bps/http/http 

or 

https://<servername>:<portnumber>/bps/http/http 

As you can see from the URL, the http listener can use both http and https protocol. However it should 
be kept in mind that application server uses two separate ports to communicate with http and https protocol. 
If we provide http protocol port number (say 8080) to construct the https URL, it will not work. 
This is a common error one can make while configuring BPS http listener. In the following pages we will 
step through the installation, configuration and testing of BPS http listener. 


Configuring the BPS Handlers 

BPS configuration for handlers are kept in default.xml file. It is be located at drive:\Documentum\config\bps 
1) Navigate to The default.xml file and open it for edit using a ASCII editor like Notepad. 
2) Configuring <connections> element of this file. In the <connections> element we have to provide 
   the connection details to Docbase, such as Docbasename, username, password etc. A sample <connections> 
   element entry should look like this:- 

<connections> 
 <docbase-connection name="connection"> 
  <docbase-name>zebra</docbasename> 
  <user-name>dmadmin</user-name> 
  <password>mypassword</password> 
 </docbase-connection> 
</connections> 

3) Configuring <handlers> element. 

This element specifies the message handlers available to BPS message listeners. Many out of the box handlers 
are provided with BPS but all of them are disabled by surrounding them within XML comment tag <!- -> 

Either enable some of them or point towards your own custom handler class like this 

<handlers> 
.. 
.. 
<handler name="LinkToFolderExample"> 
<service-name> 
com.documentum.bps.handlers.LinkToFolderService 
</service-name> 
<params> 
<param name="folderName" value="/bpsinbound/"/> 
</params> 
</handler> 
</handlers> 

4) Configuring <listeners> element 

Listeners element turns on the SSL capabilities of Local and remote listeners. Set the <allow-non-ssl> 
flag true or false as per your requirements. For our http listener test, we would use non-ssl connection, 
so make sure the value for the element is "true". 

 
5) Save the default.xml. 

A complete default.xml file for our test setup will look something like this. Replace the bold letters 
with your own Docbase, user, password and connection names. You can have multiple connections defined for 
multiple docbases. 

?xml version="1.0? encoding="UTF-8??> 

<config xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"> 

<processors> 
<local name="default"/> 
</processors> 
<connections> 
<docbase-connection name="myconnection1?> 
<docbase-name>mydocbase1</docbase-name> 
<user-name>myname1</user-name> 
<password>mypassword1</password> 
</docbase-connection> 
<docbase-connection name="myconnection2?> 
<docbase-name>mydocbase2</docbase-name> 
<user-name>myname2</user-name> 
<password>mypassword2</password> 
</docbase-connection> 
</connections> 
<handlers> 
<handler name="ErrorHandlerService"> 
<service-name>com.documentum.bps.handlers.ErrorHandlerService</service-name> 
</handler> 
<handler name="Redirector"> 
<service-name>com.documentum.bps.handlers.SubjectMessageFilter</service-name> 
</handler> 
<handler name="LinkToFolderExample"> 
<service-name>com.documentum.bps.handlers.LinkToFolderService</service-name> 
<params> 
<param name="folderName" value="/bpsinbound"/> 
</params> 
</handler> 
</handlers> 
<listeners> 
<http-listener> 
<local-listener> 
<allow-non-ssl>true</allow-non-ssl> 
</local-listener> 
<remote-listener> 
<allow-non-ssl>true</allow-non-ssl> 
</remote-listener> 
</http-listener> 
</listeners> 
</config> 

Creating a Test html Page 

We would need a test html page to test the http listener. Create html page out of the code provided below. 
This simple page submits a form to http listener after populating BPS http listener parameters in the form parameters. 

<HTML> 
<h1>BPS http listener and LinkToFolder handler test</h1> 
<form method="post" enctype="multipart/form-data" ACTION="http://localhost:8080/bps/http/http"> 
<input type="hidden" name="DctmBpsHandler" value="LinkToFolderExample"> 
<input type="hidden" name="DctmBpsId" value="4b08ac1980001d29?> 
Connection name: <input type="text" name="DctmBpsConnection" size="20? ><br/> 
File1 to upload: <input type="file" name="file to upload1? id="file1? size="20?> 
<br/> 
File2 to upload: <input type="file" name="file to upload2? id="file2? size="20?> 
<br/> 
<br/> 
<input type="submit" value="Submit"> 
</form> 
</HTML> 

Create a file called test.html out of this code and save it in the bps root folder. 

Testing the Application 

Start the application server where BPS is deployed and then invoke the html page by typing the following URL 
in your browser 

http://<servername>:<portnumber>/bps/test.html 


A page should appear in your browser. If not then please check if your application server is running or 
if it has been installed properly 

Fill up the connection name such as myconnection1 and then select a file to upload and then hit submit. 
This will cause the html form to be submitted to the BPS http listener, which will pass the message 
to LinkToFolder message handler and the file will be stored in bpsinbound folder. Once message handler succeeded, 
it will present a success page.

Locating the Saved Message in the Docbase 

We have configured the LinkToFolder handler to save the message to bpsinbound folder. If you browse 
to the bpsinbound folder, you will found a new virtual document has been created by the LinkToFolder handler. 


Expanding the root virtual document will show the attached file. 

Summary- BPS http Installation and Configuration 

BPS http listener can be installed by selecting proper option in the BPS installer. To run the 
http listener, you will require an application server like Tomcat. The handler is implemented as Servlet. 
Before using the listener and the message handlers, BPS default.xml file needs to be configured. 
Please follow the instruction provided in this Whitepaper.to configure the default.xml file. Once it is configured; 
the http listener is ready for test. Use the test.html file provided in this White Paper to test 
the http listener


Example start and stop of Documentum processes:
-----------------------------------------------

TAKE NOTICE:

First of all, on any documentum server, find the account of the software owner.
Since there are serveral accounts, depending on the site, you must check this
before starting or stopping a Service.
You can allways check for the correct owner by looking at the owner of the
"/appl/emcdctm" directory

Example: on ZD111L13 you check

root@zd111l13:/appl#ls -al
total 16
drwxr-xr-x   4 root     staff           256 Jul 13 15:43 .
drwxr-xr-x  24 root     system         4096 Aug 21 15:09 ..
drwxr-xr-x  13 emcdmeu  emcdgeu        4096 Aug  9 15:04 emcdctm
drwxr-xr-x   3 root     staff           256 Jun 29 15:35 oracle

Now you do a swich user to the owner. In the example it would be "su - emcdmeu"

If you logon as the software owner (e.g."su - emcdmeu"), you have several environment variables
available, like $DOCUMENTUM which points to "/appl/emcdctm".


1. Docbroker processes:
-----------------------

Start
$DOCUMENTUM/dba/dm_launch_Docbroker

Stop
$DOCUMENTUM/dba/dm_stop_Docbroker

the startup calls 
./dmdocbroker -port 1489 $@  >> $tlogfile 2>&1 & 
(/product/5.3/bin/dmdocbroker)

Logs:
tail -f $DOCUMENTUM/dba/log/docbroker.<host name>.1489.log
* for example
tail -f $DOCUMENTUM/dba/log/docbroker.ZD110L12.nl.eu.abnamro.com.1489.log


2. Content Server:
------------------ 

Content servers have Docbrokers and a "Java Method Server"
There is also a service for each repository that has been installed


Start
$DOCUMENTUM/dba/dm_launch_Docbroker
$DOCUMENTUM/dba/dm_start_dmwpreu1
$DM_HOME/tomcat/bin/startup.sh

$DM_HOME/tomcat/bin/shutdown.sh
$DOCUMENTUM/dba/dm_shutdown_dmwpreu1
$DOCUMENTUM/dba/dm_stop_Docbroker

Stop
$DM_HOME/tomcat/bin/shutdown.sh
$DOCUMENTUM/dba/dm_shutdown_dmw_eu
$DOCUMENTUM/dba/dm_stop_Docbroker

Or if there are 2 filestores, like in ETNL:

Start
$DOCUMENTUM/dba/dm_launch_Docbroker
$DOCUMENTUM/dba/dm_start_dmw_et
$DOCUMENTUM/dba/dm_start_dmw_et3
$DM_HOME/tomcat/bin/startup.sh

Stop
$DM_HOME/tomcat/bin/shutdown.sh
$DOCUMENTUM/dba/dm_shutdown_dmw_et
$DOCUMENTUM/dba/dm_shutdown_dmw_et3
$DOCUMENTUM/dba/dm_stop_Docbroker

Logs
*Repository
tail -f $DOCUMENTUM/dba/log/dmw_et.log
*JMS
tail -f $DM_HOME/tomcat/logs/catalina.out


Or:

1) kill all processes that are being run by emcdm user. 
2) Run the following commands as user emcdm:

$DOCUMENTUM/dba/dm_launch_Docbroker
$DOCUMENTUM/dba/dm_start_dmw_et
$DM_HOME/tomcat/bin/startup.sh


3. BPS:
-------


Start
#	As user {NL} emcdm, or {EU} wasemceu
cd $DOCUMENTUM/dfc/bps/inbound/bin
./start_jms_listener.sh

Better is:

nohup ./start_jms_listener.sh &

Stop
#	As user {NL} emcdm, or {EU} wasemceu
ps -ef | grep bps
kill -9 <process id>


4. Index Server:
----------------

 Indexer - server IX
 Index servers have 3 services: Docbroker, Index Server, 
 and Index Agent {per repository}

Start
$DOCUMENTUM/dba/dm_launch_Docbroker
$DOCUMENTUM/fulltext/IndexServer/bin/startup.sh
$DOCUMENTUM_SHARED/IndexAgents/IndexAgent1/startupIndexAgent.sh

Stop
$DOCUMENTUM_SHARED/IndexAgents/IndexAgent1/shutdownIndexAgent.sh
$DOCUMENTUM/fulltext/IndexServer/bin/shutdown.sh
$DOCUMENTUM/dba/dm_stop_Docbroker
 
Logs
tail -f $DOCUMENTUM/dfc/logs/IndexAgent1.log


5. Websphere:
-------------

example 1: Syntax if rc.appserver exists:

su - wasemceu

/etc/rc.appserver start ETM1DAE
/etc/rc.appserver start ETM1DEU
/etc/rc.appserver stop ETM1DEU

/etc/rc.appserver stop ETM1DAN
/etc/rc.appserver stop ETM1DNL

/etc/rc.appserver start ETM1DAN
/etc/rc.appserver start ETM1DNL

example 2: Syntax if just using websphere scripts:

START:

/appl/was51/bin/startNode.sh
/appl/was51/bin/startServer.sh server1
/appl/was51/bin/startServer.sh STM1DNL
/appl/was51/bin/startServer.sh STM1DAN

tail -f /beheer/log/was51/server1/SystemOut.log
tail -f /beheer/log/was51/STM1DNL/SystemOut.log
tail -f /beheer/log/was51/STM1DNL/SystemErr.log
tail -f /beheer/log/was51/STM1DAN/SystemOut.log

STOP:

/appl/was51/bin/stopServer.sh STM1DAN
/appl/was51/bin/stopServer.sh STM1DNL

/appl/was51/bin/stopServer.sh server1
/appl/was51/bin/stopNode.sh


Backup options EMC Documentum:
------------------------------

1 cold backup:
--------------

If you want to backup a docbase the Documentum recommended way is:

1) Stop the Content Server 
2) Stop the database
3) Backup the database using standard database or OS tools as appropriate for your database
4) Backup the Content Store(s) using OS tools.

Using is referred to as a full, cold backup. There are options for hot and/or incremental backups but it does get 
more complicated (and possibly expensive). The full,cold backup is the simplest option available.


2. Online Hot backup:
---------------------

2.1 CYA hot backup software 


2.2 Platform Dynamics:  Recovery management for EMC Documentum


2.3 EMC NetWorker Module for Documentum 


Catalina:
=========
 
The Java Serlvet womb part of Apache Tomcat server. It lets Java Servlets handle HTTP requests. 
Catalina is the name of the Java class of Tomcat from version 4.0
Tomcat's servlet container was redesigned as Catalina in Tomcat version 4.x


XMWLM:
======

Note 1:
-------

xmwlm Command

Purpose

       Provides recording of system performance or WLM metrics.

Syntax

       xmwlm [ -d recording_dir ] [ -n recording_name ] [ -t trace_level ] [ -L ]

Description

The xmwlm agent provides recording capability for a limited set of local system performance metrics. These include
common CPU, memory, network, disk, and partition metrics typically displayed by the topas command. Daily recordings
are stored in the /etc/perf/daily directory. The topasout command is used to output these recordings in raw ASCII or
speadsheet format. The xmwlm agent can also be used to provide recording data from Workload Management (WLM). This is
the default format used when xmwlm is run without any flags. Daily recordings are stored in the /etc/perf/wlm
directory. The wlmmon command can be used to process WLM-related recordings. The xmwlm agent can be started from the
command line, from a user script, or can be placed near the end of the /etc/inittab file. All recordings cover 24-
hour periods and are only retained for two days.

#ps -ef | grep -i xmwlm
root 266378      1   0   Aug 06      - 272:17 /usr/bin/xmwlm -L


Note 2:
-------


IY78009: XMWLM HIGH RESOURCE CONSUMPTION, TOPASOUT COUNTERS 

 A fix is available 
Obtain fix for this APAR


APAR status
Closed as program error.

Error description 
xmwlm daemon may consume well over 1% of CPU resources
some disk counter values may be inaccurate in topasout output
Local fix 
Problem summary 
xmwlm daemon may consume well over 1% of CPU resources
some disk counter values may be inaccurate in topasout output
Problem conclusion 
Reduce SPMI instrumentations internal polling frequency for
filesystem metrics.  Update topasout for certain counter data
types.
Temporary fix 
Comments 
APAR information 
APAR number IY78009 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2005-10-21 
Closed date 2005-10-21 
Last modified date 2005-11-17 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Publications Referenced


Fix information 
Fixed component name AIX 5.3 
Fixed component ID 5765G0300 
 
 
Note 3:
-------

IY95912: XMWLM LOOPING IN SIGNAL HANDLERS INFINITELY 


     Subscribe to this APAR 
By subscribing, you will receive periodic email alerting you to the status of the APAR, and a link to download the fix once it becomes available.
 
 
 A specific fix for this item is not yet available electronically 
This record will be updated with a link to the fix if the APAR is new.
For APARs older than 365 days, contact your support center.
 

APAR status
Closed as program error.

Error description 
High cpu consumption by xmwlm
Local fix 
Problem summary 
High cpu consumption by xmwlm
Problem conclusion 
Stop xmwlm from looking infinitely in signal handler and
avoid xmwlm from crashing when it has to record more than
4096 metrics by recording only 4096 metrics at max.
Temporary fix 
Comments 
APAR information 
APAR number IY95912 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2007-03-11 
Closed date 2007-03-11 
Last modified date 2007-03-15 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:
IY96091
 
 
Second superuser:
=================


For safety reasons, you might want to have a second root user on your system.


Note 1:
-------

-- Creating a second root user

Follow these steps to create a second root user: 

Create a user. 
Manually edit the user ID field and group ID field in the /etc/passwd file. 
Change the user ID to ID 0. 
For a typical user ID, for example, change the entry from: 
   russ:!:206:1::/u/russ:/bin/ksh 

to 
   russ:!:0:0::/u/russ:/bin/ksh 

This creates a user (in this case, russ) with identical permissions to root. 


-- Creating special users with root authority

Special users that have root authority but can only execute one command may also be created. For instance, 
to create a user that can only reboot the system, create a regular user called shutdown and modify the /etc/passwd 
command to change the user and group ID to 0. For example, in AIX 3.2: 

   shutdown:!:0:0::/u/shutdown:/bin/ksh 

Change the initial program from /bin/ksh to /etc/shutdown -Fr: 

   shutdown:!:0:0::/u/shutdown:/etc/shutdown -Fr 

For AIX 4, the /etc/passwd entry for the user called shutdown should be: 

   shutdown:!:0:0::/u/shutdown:/usr/sbin/shutdown -Fr 

The shutdown command on AIX Version 4.1 is located in /usr/sbin. 
Now when user shutdown logs in, the system will shut down and reboot. 


Base AIX error codes:
=====================


Appendix A. Base Operating System Error Codes for Services That Require Path-Name Resolution
The following errors apply to any service that requires path name resolution:

EACCES	 Search permission is denied on a component of the path prefix. 
EFAULT	 The Path parameter points outside of the allocated address space of the process. 
EIO	 An I/O error occurred during the operation. 
ELOOP	 Too many symbolic links were encountered in translating the Path parameter. 
ENAMETOOLONG A component of a path name exceeded 255 characters and the process has the DisallowTruncation attribute (see the ulimit subroutine) or an entire path name exceeded 1023 characters. 
ENOENT	 A component of the path prefix does not exist. 
ENOENT	 A symbolic link was named, but the file to which it refers does not exist. 
ENOENT	 The path name is null. 
ENOTDIR	 A component of the path prefix is not a directory. 
ESTALE	 The root or current directory of the process is located in a virtual file system that is unmounted. 


clsprod@haflinger:/usr/include $ cat errlog.h
/* IBM_PROLOG_BEGIN_TAG                                                   */
/* This is an automatically generated prolog.                             */
/*                                                                        */
/* bos53D src/bos/usr/ccs/lib/liberrlog/errlog.h 1.7                      */
/*                                                                        */
/* Licensed Materials - Property of IBM                                   */
/*                                                                        */
/* Restricted Materials of IBM                                            */
/*                                                                        */
/* (C) COPYRIGHT International Business Machines Corp. 2000,2005          */
/* All Rights Reserved                                                    */
/*                                                                        */
/* US Government Users Restricted Rights - Use, duplication or            */
/* disclosure restricted by GSA ADP Schedule Contract with IBM Corp.      */
/*                                                                        */
/* IBM_PROLOG_END_TAG                                                     */
#ifndef H_errlog
#define H_errlog
/* @(#)74        1.7  src/bos/usr/ccs/lib/liberrlog/errlog.h, cmderrlg, bos53D, d2005_09B1 2/24/05 15:34:58 */

/*
 * COMPONENT_NAME: CMDERRLG   system error logging and reporting facility
 *
 * External definitions and declarations for liberrlog.a
 *
 */


#include <sys/types.h>
#include <sys/err_rec.h>

typedef void *errlog_handle_t;

/*
 *  These magic numbers will indicate which version of errlog
 *  entry is being returned.
 *  All users of errlog_entry_t should use only LE_MAGIC.
 */
#define LE_MAGIC_41 0x0C3DF420
/* LE_MAGIC434_INTERUM is an interum 43T magic, before le_errdiag was added. */
#define LE_MAGIC434_INTERUM 0x0C3DF434
#define LE_MAGIC434 0x0C4DF434
#define LE_MAGIC52F 0x0C4DF52F
#define LE_MAGIC53D 0x0C4DF53D
#define LE_MAGIC   LE_MAGIC53D          /* current errlog_open magic # */
/* VALID_LE_MAGIC gives valid magic numbers for an error log record. */
#define VALID_LE_MAGIC(m) (((m) == LE_MAGIC_41) || \
                ((m) == LE_MAGIC434_INTERUM) || ((m) == LE_MAGIC434))
/* VALID_LENTRY_MAGIC gives valid magic numbers for errlog_open(). */
#define VALID_LENTRY_MAGIC(m) (((m) == LE_MAGIC) || ((m) == LE_MAGIC434) ||\
                               ((m) == LE_MAGIC52F))

/*
 * Optional duplicate information.
 */
struct errdup {
    unsigned int        ed_dupcount;
    time32_t            ed_time1;
    time32_t            ed_time2;
};

/* Lengths of the various fields in the structure. */
#define LE_LABEL_MAX            20
#define LE_MACHINE_ID_MAX       32
#define LE_NODE_ID_MAX          32
#define LE_CLASS_MAX            2
#define LE_TYPE_MAX             5
#define LE_RESOURCE_MAX         16
#define LE_RCLASS_MAX           16
#define LE_RTYPE_MAX            16
#define LE_VPD_MAX              512
#define LE_IN_MAX               256
#define LE_CONN_MAX             20
#define LE_DETAIL_MAX           ERR_REC_MAX
#define LE_SYMPTOM_MAX          312
#define LE_ERRDUP_MAX           sizeof(struct errdup)

/* The data structure that contains an errlog entry */
typedef struct errlog_entry {
    unsigned int        el_magic;
    unsigned int        el_sequence;
    char                el_label[LE_LABEL_MAX];
    unsigned int        el_timestamp;
    unsigned int        el_crcid;
    unsigned int        el_errdiag;
    char                el_machineid[LE_MACHINE_ID_MAX];
    char                el_nodeid[LE_NODE_ID_MAX];
    char                el_class[LE_CLASS_MAX];
    char                el_type[LE_TYPE_MAX];
    char                el_resource[LE_RESOURCE_MAX];
    char                el_rclass[LE_RCLASS_MAX];
    char                el_rtype[LE_RTYPE_MAX];
    char                el_vpd_ibm[LE_VPD_MAX];
    char                el_vpd_user[LE_VPD_MAX];
    char                el_in[LE_IN_MAX];
    char                el_connwhere[LE_CONN_MAX];
    unsigned short      el_flags;
    unsigned short      el_detail_length;
    char                el_detail_data[LE_DETAIL_MAX];
    unsigned int        el_symptom_length;
    char                el_symptom_data[LE_SYMPTOM_MAX];
    struct errdup       el_errdup;
} errlog_entry_t;


/* Values for the el_flags element. */
#define LE_FLAG_ERR64           0x01
#define LE_FLAG_ERRDUP          0x100

/*
 *  This structure is used to pass search criteria to errlog_find_first.

 *  To use it an operation is put in em_op.  If it is a leaf operation,
 *  the field in errlog_entry_t to apply the op to is put in em_field and
 *  the value to compare against is put in em_strvalue or em_intvalue.
 *  Boolean values are put in em_intvalue.
 *
 *  To connect operations, a unary or binary operator is put in em_op.
 *  The operation(s) to apply the operator to are put in em_left and,
 *  if it's a binary operator, em_right.
 */

typedef struct errlog_match {
    unsigned int                em_op;
    union {
        struct errlog_match     *emu_left;
        unsigned int            emu_field;
    } emu1;
    union {
        struct errlog_match     *emu_right;
        unsigned int            emu_intvalue;
        unsigned char           *emu_strvalue;
    } emu2;
} errlog_match_t;

#define em_left         emu1.emu_left
#define em_field        emu1.emu_field
#define em_right        emu2.emu_right
#define em_intvalue     emu2.emu_intvalue
#define em_strvalue     emu2.emu_strvalue

/* Operators to use in the match structures for the find functions */
#define LE_OP_EQUAL             0x01
#define LE_OP_NE                0x02
#define LE_OP_SUBSTR            0x03
#define LE_OP_LT                0x04
#define LE_OP_LE                0x05
#define LE_OP_GT                0x06
#define LE_OP_GE                0x07
#define LE_OP_LEAF              0x100
#define LE_OP_NOT               0x101
#define LE_OP_AND               0x201
#define LE_OP_OR                0x202
#define LE_OP_XOR               0x203

/* Flags to combine with the field id to indicate the data type of the field */
#define LE_TYPE                 0xff00
#define LE_TYPE_INT             0x0100
#define LE_TYPE_STRING          0x0200
#define LE_TYPE_BOOLEAN         0x0300

/* Flags to indicate which field to match in the find functions. */
#define LE_MATCH_FIELD          0xff
#define LE_MATCH_SEQUENCE       (0x01|LE_TYPE_INT)
#define LE_MATCH_LABEL          (0x02|LE_TYPE_STRING)
#define LE_MATCH_TIMESTAMP      (0x03|LE_TYPE_INT)
#define LE_MATCH_CRCID          (0x04|LE_TYPE_INT)
#define LE_MATCH_MACHINEID      (0x05|LE_TYPE_STRING)
#define LE_MATCH_NODEID         (0x06|LE_TYPE_STRING)
#define LE_MATCH_CLASS          (0x07|LE_TYPE_STRING)
#define LE_MATCH_TYPE           (0x08|LE_TYPE_STRING)
#define LE_MATCH_RESOURCE       (0x09|LE_TYPE_STRING)
#define LE_MATCH_RCLASS         (0x0a|LE_TYPE_STRING)
#define LE_MATCH_RTYPE          (0x0b|LE_TYPE_STRING)
#define LE_MATCH_VPD_IBM        (0x0c|LE_TYPE_STRING)
#define LE_MATCH_VPD_USER       (0x0d|LE_TYPE_STRING)
#define LE_MATCH_IN             (0x0e|LE_TYPE_STRING)
#define LE_MATCH_CONNWHERE      (0x0f|LE_TYPE_STRING)
#define LE_MATCH_FLAG_ERR64     (0x10|LE_TYPE_BOOLEAN)
#define LE_MATCH_FLAG_ERRDUP    (0x11|LE_TYPE_BOOLEAN)
#define LE_MATCH_DETAIL_DATA    (0x12|LE_TYPE_STRING)
#define LE_MATCH_SYMPTOM_DATA   (0x13|LE_TYPE_STRING)
#define LE_MATCH_ERRDIAG        (0x14|LE_TYPE_INT)

/*
 *  Define the directions find can walk through the errlog file.
 */

#define LE_FORWARD              0x01
#define LE_REVERSE              0x02

/*
 * Define the errors that the functions can return.
 */

#define LE_ERR_INVARG   0x01            /* Invalid input argument */
#define LE_ERR_NOFILE   0x02            /* The errlog file can't be opened */
#define LE_ERR_INVFILE  0x03            /* The errlog file isn't valid */
#define LE_ERR_NOMEM    0x04            /* We're out of memory */
#define LE_ERR_NOWRITE  0x05            /* Can't write entry back */
#define LE_ERR_IO       0x06            /* IO error in the errlog file */
#define LE_ERR_DONE     0x07            /* The find function reached the end */

/*
 * These are the functions that comprise the API
 */
extern int errlog_open(char             *path,
                       int              mode,
                       unsigned int     magic,
                       errlog_handle_t  *handle);

extern int errlog_close(errlog_handle_t handle);

extern int errlog_find_first(errlog_handle_t    handle,
                             errlog_match_t     *filter,
                             errlog_entry_t     *result);

extern int errlog_find_next(errlog_handle_t     handle,
                            errlog_entry_t      *result);

extern int errlog_find_sequence(errlog_handle_t handle,
                                int             sequence,
                                errlog_entry_t  *result);

extern int errlog_set_direction(errlog_handle_t handle,
                                int             direction);

extern int errlog_write(errlog_handle_t         handle,
                        errlog_entry_t          *data);

#endif
clsprod@haflinger:/usr/include $


clsprod@haflinger:/usr/include/sys $ cat errno.h
/* IBM_PROLOG_BEGIN_TAG                                                   */
/* This is an automatically generated prolog.                             */
/*                                                                        */
/* bos530 src/bos/kernel/sys/errno.h 1.27.1.23                            */
/*                                                                        */
/* Licensed Materials - Property of IBM                                   */
/*                                                                        */
/* (C) COPYRIGHT International Business Machines Corp. 1985,1995          */
/* All Rights Reserved                                                    */
/*                                                                        */
/* US Government Users Restricted Rights - Use, duplication or            */
/* disclosure restricted by GSA ADP Schedule Contract with IBM Corp.      */
/*                                                                        */
/* IBM_PROLOG_END_TAG                                                     */
/* @(#)49       1.27.1.23  src/bos/kernel/sys/errno.h, incstd, bos530 1/25/01 16:31:11 */
/*
 * COMPONENT_NAME: (INCSTD) Standard Include Files
 *
 * FUNCTIONS:
 *
 * ORIGINS: 27,71
 *
 * (C) COPYRIGHT International Business Machines Corp. 1985, 1996
 * All Rights Reserved
 * Licensed Materials - Property of IBM
 *
 * US Government Users Restricted Rights - Use, duplication or
 * disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
 */
/*
 * (c) Copyright 1990, 1991, 1992 OPEN SOFTWARE FOUNDATION, INC.
 * ALL RIGHTS RESERVED
 */

#ifndef _H_ERRNO
#define _H_ERRNO
#include <standards.h>

/*
 *      Error codes
 *
 *      The ANSI, POSIX, and XOPEN standards require that certain values be
 *      in errno.h.  The standards allow additional macro definitions,
 *      beginning with an E and an uppercase letter.
 *
 */

#ifdef _ANSI_C_SOURCE

#ifndef _KERNEL

#if defined(_THREAD_SAFE) || defined(_THREAD_SAFE_ERRNO)
/*
 * Per thread errno is provided by the threads provider. Both the extern int
 * and the per thread value must be maintained by the threads library.
 */
extern  int     *_Errno( void );
#define errno   (*_Errno())

#else

extern int errno;

#endif  /* _THREAD_SAFE || _THREAD_SAFE_ERRNO */

#endif  /* _KERNEL */

#ifdef _ALL_SOURCE

extern  char    *sys_errlist[];
extern  int     sys_nerr;

#endif /* _ALL_SOURCE */

#define EPERM   1       /* Operation not permitted              */
#define ENOENT  2       /* No such file or directory            */
#define ESRCH   3       /* No such process                      */
#define EINTR   4       /* interrupted system call              */
#define EIO     5       /* I/O error                            */
#define ENXIO   6       /* No such device or address            */
#define E2BIG   7       /* Arg list too long                    */
#define ENOEXEC 8       /* Exec format error                    */
#define EBADF   9       /* Bad file descriptor                  */
#define ECHILD  10      /* No child processes                   */
#define EAGAIN  11      /* Resource temporarily unavailable     */
#define ENOMEM  12      /* Not enough space                     */
#define EACCES  13      /* Permission denied                    */
#define EFAULT  14      /* Bad address                          */
#define ENOTBLK 15      /* Block device required                */
#define EBUSY   16      /* Resource busy                        */
#define EEXIST  17      /* File exists                          */
#define EXDEV   18      /* Improper link                        */
#define ENODEV  19      /* No such device                       */
#define ENOTDIR 20      /* Not a directory                      */
#define EISDIR  21      /* Is a directory                       */
#define EINVAL  22      /* Invalid argument                     */
#define ENFILE  23      /* Too many open files in system        */
#define EMFILE  24      /* Too many open files                  */
#define ENOTTY  25      /* Inappropriate I/O control operation  */
#define ETXTBSY 26      /* Text file busy                       */
#define EFBIG   27      /* File too large                       */
#define ENOSPC  28      /* No space left on device              */
#define ESPIPE  29      /* Invalid seek                         */
#define EROFS   30      /* Read only file system                */
#define EMLINK  31      /* Too many links                       */
#define EPIPE   32      /* Broken pipe                          */
#define EDOM    33      /* Domain error within math function    */
#define ERANGE  34      /* Result too large                     */
#define ENOMSG  35      /* No message of desired type           */
#define EIDRM   36      /* Identifier removed                   */
#define ECHRNG  37      /* Channel number out of range          */
#define EL2NSYNC 38     /* Level 2 not synchronized             */
#define EL3HLT  39      /* Level 3 halted                       */
#define EL3RST  40      /* Level 3 reset                        */
#define ELNRNG  41      /* Link number out of range             */
#define EUNATCH 42      /* Protocol driver not attached         */
#define ENOCSI  43      /* No CSI structure available           */
#define EL2HLT  44      /* Level 2 halted                       */
#define EDEADLK 45      /* Resource deadlock avoided            */

#define ENOTREADY       46      /* Device not ready             */
#define EWRPROTECT      47      /* Write-protected media        */
#define EFORMAT         48      /* Unformatted media            */

#define ENOLCK          49      /* No locks available           */

#define ENOCONNECT      50      /* no connection                */
#define ESTALE          52      /* no filesystem                */
#define EDIST           53      /* old, currently unused AIX errno*/

/* non-blocking and interrupt i/o */
/*
 * AIX returns EAGAIN where 4.3BSD used EWOULDBLOCK;
 * but, the standards insist on unique errno values for each errno.
 * A unique value is reserved for users that want to code case
 * statements for systems that return either EAGAIN or EWOULDBLOCK.
 */
#if _XOPEN_SOURCE_EXTENDED==1
#define EWOULDBLOCK     EAGAIN   /* Operation would block       */
#else /* _XOPEN_SOURCE_EXTENDED */
#define EWOULDBLOCK     54
#endif /* _XOPEN_SOURCE_EXTENDED */

#define EINPROGRESS     55      /* Operation now in progress */
#define EALREADY        56      /* Operation already in progress */

/* ipc/network software */

        /* argument errors */
#define ENOTSOCK        57      /* Socket operation on non-socket */
#define EDESTADDRREQ    58      /* Destination address required */
#define EDESTADDREQ     EDESTADDRREQ /* Destination address required */
#define EMSGSIZE        59      /* Message too long */
#define EPROTOTYPE      60      /* Protocol wrong type for socket */
#define ENOPROTOOPT     61      /* Protocol not available */
#define EPROTONOSUPPORT 62      /* Protocol not supported */
#define ESOCKTNOSUPPORT 63      /* Socket type not supported */
#define EOPNOTSUPP      64      /* Operation not supported on socket */
#define EPFNOSUPPORT    65      /* Protocol family not supported */
#define EAFNOSUPPORT    66      /* Address family not supported by protocol family */
#define EADDRINUSE      67      /* Address already in use */
#define EADDRNOTAVAIL   68      /* Can't assign requested address */

        /* operational errors */
#define ENETDOWN        69      /* Network is down */
#define ENETUNREACH     70      /* Network is unreachable */
#define ENETRESET       71      /* Network dropped connection on reset */
#define ECONNABORTED    72      /* Software caused connection abort */
#define ECONNRESET      73      /* Connection reset by peer */
#define ENOBUFS         74      /* No buffer space available */
#define EISCONN         75      /* Socket is already connected */
#define ENOTCONN        76      /* Socket is not connected */
#define ESHUTDOWN       77      /* Can't send after socket shutdown */

#define ETIMEDOUT       78      /* Connection timed out */
#define ECONNREFUSED    79      /* Connection refused */

#define EHOSTDOWN       80      /* Host is down */
#define EHOSTUNREACH    81      /* No route to host */

/* ERESTART is used to determine if the system call is restartable */
#define ERESTART        82      /* restart the system call */

/* quotas and limits */
#define EPROCLIM        83      /* Too many processes */
#define EUSERS          84      /* Too many users */
#define ELOOP           85      /* Too many levels of symbolic links      */
#define ENAMETOOLONG    86      /* File name too long                     */

/*
 * AIX returns EEXIST where 4.3BSD used ENOTEMPTY;
 * but, the standards insist on unique errno values for each errno.
 * A unique value is reserved for users that want to code case
 * statements for systems that return either EEXIST or ENOTEMPTY.
 */
#if defined(_ALL_SOURCE) && !defined(_LINUX_SOURCE_COMPAT)
#define ENOTEMPTY       EEXIST  /* Directory not empty */
#else   /* not _ALL_SOURCE */
#define ENOTEMPTY       87
#endif  /* _ALL_SOURCE */

/* disk quotas */
#define EDQUOT          88      /* Disc quota exceeded */

#define ECORRUPT        89      /* Invalid file system control data */

/* errnos 90-92 reserved for future use compatible with AIX PS/2 */

/* network file system */
#define EREMOTE         93      /* Item is not local to host */

/* errnos 94-108 reserved for future use compatible with AIX PS/2 */

#define ENOSYS          109     /* Function not implemented  POSIX */

/* disk device driver */
#define EMEDIA          110     /* media surface error */
#define ESOFT           111     /* I/O completed, but needs relocation */

/* security */
#define ENOATTR         112     /* no attribute found */
#define ESAD            113     /* security authentication denied */
#define ENOTRUST        114     /* not a trusted program */

/* BSD 4.3 RENO */
#define ETOOMANYREFS    115     /* Too many references: can't splice */

#define EILSEQ          116     /* Invalid wide character */
#define ECANCELED       117     /* asynchronous i/o cancelled */

/* SVR4 STREAMS */
#define ENOSR           118     /* temp out of streams resources */
#define ETIME           119     /* I_STR ioctl timed out */
#define EBADMSG         120     /* wrong message type at stream head */
#define EPROTO          121     /* STREAMS protocol error */
#define ENODATA         122     /* no message ready at stream head */
#define ENOSTR          123     /* fd is not a stream */

#define ECLONEME        ERESTART /* this is the way we clone a stream ... */

#define ENOTSUP         124     /* POSIX threads unsupported value */

#define EMULTIHOP       125     /* multihop is not allowed */
#define ENOLINK         126     /* the link has been severed */
#define EOVERFLOW       127     /* value too large to be stored in data type */

#endif /* _ANSI_C_SOURCE */

#endif /* _H_ERRNO */
clsprod@haflinger:/usr/include/sys $


clsprod@haflinger:/usr/include $ file sysexits.h
sysexits.h: ascii text
clsprod@haflinger:/usr/include $ cat sysexits.h
/* IBM_PROLOG_BEGIN_TAG                                                   */
/* This is an automatically generated prolog.                             */
/*                                                                        */
/* bos530 src/bos/usr/include/sysexits.h 1.6                              */
/*                                                                        */
/* Licensed Materials - Property of IBM                                   */
/*                                                                        */
/* (C) COPYRIGHT International Business Machines Corp. 1989,1991          */
/* All Rights Reserved                                                    */
/*                                                                        */
/* US Government Users Restricted Rights - Use, duplication or            */
/* disclosure restricted by GSA ADP Schedule Contract with IBM Corp.      */
/*                                                                        */
/* IBM_PROLOG_END_TAG                                                     */
/* @(#)30       1.6  src/bos/usr/include/sysexits.h, incstd, bos530 6/16/90 00:14:57 */
#ifndef _H_SYSEXITS
#define _H_SYSEXITS
/*
 * COMPONENT_NAME: (INCSTD) Standard Include Files
 *
 * FUNCTIONS:
 *
 * ORIGINS: 27
 *
 * (C) COPYRIGHT International Business Machines Corp. 1989
 * All Rights Reserved
 * Licensed Materials - Property of IBM
 *
 * US Government Users Restricted Rights - Use, duplication or
 * disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
 */

/*
**  SYSEXITS.H -- Exit status codes for system programs.
**
**      This include file attempts to categorize possible error
**      exit statuses for system programs, notably delivermail
**      and the Berkeley network.
**
**      Error numbers begin at EX__BASE to reduce the possibility of
**      clashing with other exit statuses that random programs may
**      already return.  The meaning of the codes is approximately
**      as follows:
**
**      EX_USAGE -- The command was used incorrectly, e.g., with
**              the wrong number of arguments, a bad flag, a bad
**              syntax in a parameter, or whatever.
**      EX_DATAERR -- The input data was incorrect in some way.
**              This should only be used for user's data & not
**              system files.
**      EX_NOINPUT -- An input file (not a system file) did not
**              exist or was not readable.  This could also include
**              errors like "No message" to a mailer (if it cared
**              to catch it).
**      EX_NOUSER -- The user specified did not exist.  This might
**              be used for mail addresses or remote logins.
**      EX_NOHOST -- The host specified did not exist.  This is used
**              in mail addresses or network requests.
**      EX_UNAVAILABLE -- A service is unavailable.  This can occur
**              if a support program or file does not exist.  This
**              can also be used as a catchall message when something
**              you wanted to do doesn't work, but you don't know
**              why.
**      EX_SOFTWARE -- An internal software error has been detected.
**              This should be limited to non-operating system related
**              errors as possible.
**      EX_OSERR -- An operating system error has been detected.
**              This is intended to be used for such things as "cannot
**              fork", "cannot create pipe", or the like.  It includes
**              things like getuid returning a user that does not
**              exist in the passwd file.
**      EX_OSFILE -- Some system file (e.g., /etc/passwd, /etc/utmp,
**              etc.) does not exist, cannot be opened, or has some
**              sort of error (e.g., syntax error).
**      EX_CANTCREAT -- A (user specified) output file cannot be
**              created.
**      EX_IOERR -- An error occurred while doing I/O on some file.
**      EX_TEMPFAIL -- temporary failure, indicating something that
**              is not really an error.  In sendmail, this means
**              that a mailer (e.g.) could not create a connection,
**              and the request should be reattempted later.
**      EX_PROTOCOL -- the remote system returned something that
**              was "not possible" during a protocol exchange.
**      EX_NOPERM -- You did not have sufficient permission to
**              perform the operation.  This is not intended for
**              file system problems, which should use NOINPUT or
**              CANTCREAT, but rather for higher level permissions.
**              For example, kre uses this to restrict who students
**              can send mail to.
**
*/

# define EX_OK          0       /* successful termination */

# define EX__BASE       64      /* base value for error messages */

# define EX_USAGE       64      /* command line usage error */
# define EX_DATAERR     65      /* data format error */
# define EX_NOINPUT     66      /* cannot open input */
# define EX_NOUSER      67      /* addressee unknown */
# define EX_NOHOST      68      /* host name unknown */
# define EX_UNAVAILABLE 69      /* service unavailable */
# define EX_SOFTWARE    70      /* internal software error */
# define EX_OSERR       71      /* system error (e.g., can't fork) */
# define EX_OSFILE      72      /* critical OS file missing */
# define EX_CANTCREAT   73      /* can't create (user) output file */
# define EX_IOERR       74      /* input/output error */
# define EX_TEMPFAIL    75      /* temp failure; user is invited to retry */
# define EX_PROTOCOL    76      /* remote error in protocol */
# define EX_NOPERM      77      /* permission denied */
# define EX_CONFIG      78      /* configuration error */
# define EX_DB          79      /* database access error */

#endif /* _H_SYSEXITS */


104 sh             2396372       6.338403430       0.002943           return from execve. error ENOEXEC [36 usec]
101 sh             2396372       6.338414084       0.010654           open LR = 10003088


ioctl and related:
==================


The ioctl subroutine performs a variety of control operations on the object associated with the specified open file descriptor. 
This function is typically used with character or block special files, sockets, or generic device support 
such as the termio general terminal interface.

The control operation provided by this function call is specific to the object being addressed, 
as are the data type and contents of the Argument parameter. The ioctlx form of this function can be used to pass 
an additional extension parameter to objects supporting it. The ioct132 and ioct132x forms of this function behave in 
the same way as ioctl and ioctlx, but allow 64-bit applications to call the ioctl routine for an object that does not 
normally work with 64-bit applications.

Performing an ioctl function on a file descriptor associated with an ordinary file results in an error being returned.

EBADF	 The FileDescriptor parameter is not a valid open file descriptor. 
EFAULT	 The Argument or Ext parameter is used to point to data outside of the process address space. 
EINTR	 A signal was caught during the ioctl or ioctlx subroutine and the process had not enabled re-startable subroutines for the signal. 
EINTR	 A signal was caught during the ioctl , ioctlx , ioctl32 , or ioct132x subroutine and the process had not enabled re-startable subroutines for the signal. 
EINVAL	 The Command or Argument parameter is not valid for the specified object. 
ENOTTY	 The FileDescriptor parameter is not associated with an object that accepts control functions. 
ENODEV	 The FileDescriptor parameter is associated with a valid character or block special file, but the supporting device driver does not support the ioctl function. 
ENXIO	 The FileDescriptor parameter is associated with a valid character or block special file, but the supporting device driver is not in the configured state. 
	 Object-specific error codes are defined in the documentation for associated objects. 


tecad error:
============

IZ37728: TECAD_SNMP CRASHES ON AIX 5.3 SP8 OR GREATER
  

 A fix is available 
3.9.0.8-TIV-TEC-IF0106 IBM Tivoli Enterprise Console Version 3.9 Interim Fix

 
APAR status
Closed as program error.

Error description 
TEC 3.9
AIX 5.3 SP8 or greater
When traps are received by the snmp adapter, it crashes

Following is some data from the truss and adapter output:

Truss 1 Output:

open("/usr/lib/nls/msg/en_US/libc.cat", O_RDONLY) = 9
kioctl(9, 22528, 0x00000000, 0x00000000) Err#25 ENOTTY
kfcntl(9, F_SETFD, 0x00000001)   = 0
kioctl(9, 22528, 0x00000000, 0x00000000) Err#25 ENOTTY
kread(9, "\0\001 ?\007\007 I S O 8".., 4096) = 4096
lseek(9, 0, 1)     = 4096
lseek(9, 0, 1)     = 4096
lseek(9, 0, 1)     = 4096
_getpid()     = 101604
lseek(9, 0, 1)     = 4096
lseek(9, 8069, 0)    = 8069
kread(9, " T h e   s y s t e m   c".., 4096) = 4096
close(9)     = 0
__loadx(0x07000000, 0xF01E0438, 0x0000001A, 0xF015B6F8,
0x100140A3) = 0xF015C35C
__loadx(0x07000000, 0xF01E0444, 0x0000001A, 0xF015B6F8,
0x100140A3) = 0xF015C3A4
__loadx(0x07000000, 0xF01E0450, 0x0000001A, 0xF015B6F8,
0x100140A3) = 0xF015C314
__loadx(0x07000000, 0xF01E045C, 0x0000001A, 0xF015B6F8,
0x100140A3) = 0xF015C3EC
__loadx(0x07000000, 0xF01E0468, 0x0000001A, 0xF015B6F8,
0x100140A3) = 0xF015C428
__loadx(0x05000000, 0x2FF1F6A8, 0x00000960, 0xF015B6F8,
0x00000000) = 0x00000000
kread(8, " h o s t s   n i s _ l d".., 4096) = 0
close(8)     = 0
getdomainname(0xF023D178, 1024)   = 0
getdomainname(0xF023D178, 1024)   = 0
getdomainname(0xF023D178, 1024)   = 0
getdomainname(0xF023D178, 1024)   = 0
_getpid()     = 101604
getuidx(1)     = 0
kwrite(7, " 2 7", 2)    Err#32 EPIPE
    Received signal #13, SIGPIPE [default]
*** process killed ***

Truss 2 Output

open("/tmp/tec_ed ", O_WRONLY|O_CREAT|O_TRUNC,
S_IRUSR|S_IWUSR|S_IRGRP|S_IWGRP|S_IROTH|S_IWOTH) = 5
kioctl(5, 22528, 0x00000000, 0x00000000) Err#25 ENOTTY
kfcntl(5, F_GETFL, 0x00000008)   = 1
close(5)     = 0
kread(4, " #\r\n #   " $ I d :   @".., 4096) = 0
close(4)     = 0
open("/tmp/tec_ed ", O_WRONLY|O_CREAT|O_APPEND,
S_IRUSR|S_IWUSR|S_IRGRP|S_IWGRP|S_IROTH|S_IWOTH) = 4
klseek(4, 0, 0, 0x00000002)   = 0
kioctl(4, 22528, 0x00000000, 0x00000000) Err#25 ENOTTY
kioctl(4, 22528, 0x00000000, 0x00000000) Err#25 ENOTTY
kwrite(4, " S e p   2 2   2 3 : 2 2".., 114) = 114
close(4)     = 0
kread(3, " #   M o n   J u l   2 1".., 4096) = 0
kfcntl(3, F_GETFL, 0x00000008)   = 0
klseek(3, 0, 0, 0x00000000)   = 0
kread(3, " #   M o n   J u l   2 1".., 4096) = 195
kread(3, " #   M o n   J u l   2 1".., 4096) = 0
kfcntl(3, F_GETFL, 0x00000008)   = 0
klseek(3, 0, 0, 0x00000000)   = 0
kread(3, " #   M o n   J u l   2 1".., 4096) = 195
kread(3, " #   M o n   J u l   2 1".., 4096) = 0
close(3)     = 0
kioctl(1, 22528, 0x00000000, 0x00000000) = 0
kwrite(1, 0xF0220C70, 68)   = 68
sigprocmask(0, 0xF029D7B0, 0xF029D7A8)  = 0
kfork()      = 149840
thread_setmymask_fast(0x00000000, 0x00000000, 0x00000000,
0xD006DC80, 0x00000000, 0x1004671F, 0x1004671F, 0x00000000) =
0x00000000
_exit(0)

tecad_snmp.err output:

Tue Sep 23 15:05:57 2008  NORMAL: SELECT ,(00), ibtecad/select.c
line 0220: Correct is TRUE
Tue Sep 23 15:05:57 2008  NORMAL: SELECT ,(00), ibtecad/select.c
line 0267: Finished TECAD_EvalSelect, returning TRUE
Tue Sep 23 15:05:57 2008     LOW: KERNEL ,(00), ibtecad/kernel.c
line 0247: Found action is <DirectTalkStatus_Trap>
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0086: Entered Eval_Fetch
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0109: --get FetchVar, i=0
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0120: --calling EvalFetchExpression
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0187: Entered Eval_Fetch_Expression
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0190: -- argc1
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0197: -- argv not null
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0203: -- loop over all fetches, i=0
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0212: -- Current_Expression not null
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0187: Entered Eval_Fetch_Expression
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0190: -- argc0
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0197: -- argv not null
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0236: -- do the required operation
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0245:   -- Expression->Index=6
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0246:   -- argc=0
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0250:   -- argv not NULL
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0255:   -- Expression->Operator not NULL
Tue Sep 23 15:05:57 2008 VERBOSE: KERNEL ,(00), cad/evaluation.c
line 0271: TECAD_GetGlobalEntry Index <6>
Tue Sep 23 15:05:57 2008 VERBOSE: UTILS  ,(00), /configuration.c
line 0521: Entering TECAD_CopyAttributeEntry
Tue Sep 23 15:05:57 2008 VERBOSE: UTILS  ,(00), /configuration.c
line 0161: Entering TECAD_MakeAttributeEntry
Tue Sep 23 15:05:57 2008 VERBOSE: UTILS  ,(00), /configuration.c
line 0185: Leaving TECAD_MakeAttributeEntry
Tue Sep 23 15:05:57 2008 VERBOSE: UTILS  ,(00), /configuration.c
line 0563: Leaving TECAD_CopyAttributeEntry
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0267: -- clear the memory
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0270: Finished Eval_Fetch_Expression
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0222: -- result not null
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0236: -- do the required operation
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0245:   -- Expression->Index=1
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0246:   -- argc=1
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0250:   -- argv not NULL
Tue Sep 23 15:05:57 2008  NORMAL: FETCH  ,(00), libtecad/fetch.c
line 0255:   -- Expression->Operator not NULL
Local fix 
Problem summary 
****************************************************************
* USERS AFFECTED: All TEC users running the SNMP adapter on AIX.
****************************************************************
* PROBLEM DESCRIPTION: When traps are received by the SNMP
*   adapter running on AIX, it crashes.
****************************************************************
* RECOMMENDATION: Apply the maintenance listed below.
****************************************************************
Problem conclusion 
The adapter was being killed due to a SIGPIPE signal.  This
signal will now be ignored.

The fix for this APAR is contained in the following maintenance
packages:
  | interim fix | 3.9.0.8-TIV-TEC-IF0106
Temporary fix 
Comments  


Universal Command:
==================

UC facilitates jobscheduling from Mainframe to AIX and HP-UX systems.

AIX:
# lslpp -La 'UCmdP'
HP:
# swlist -l subproduct UCmd


Tiger:
======

Tiger is a security tool that can be used as a security and intrusion detection system. It works at many
platforms and is provided under the GPL license. So its free software. 
Its written entirely in shell language.


5FC2DD4B  PING TO REMOTE HOST FAILED:
=====================================

AIX ONLY:

In errpt you might find:

[pl101][tdbaprod][/home/tdbaprod] errpt
IDENTIFIER TIMESTAMP  T C RESOURCE_NAME  DESCRIPTION
5FC2DD4B   0225151808 I H ent2           PING TO REMOTE HOST FAILED
9F7B0FA6   0225151108 I H ent2           PING TO REMOTE HOST FAILED

LABEL:          ECH_PING_FAIL_BCKP
IDENTIFIER:     5FC2DD4B

Date/Time:       Mon Feb 25 14:41:06 2008
Sequence Number: 2140
Machine Id:      00CB85FF4C00
Node Id:         pl101
Class:           H
Type:            INFO
Resource Name:   ent2
Resource Class:  adapter
Resource Type:   ibm_ech
Location:

Description
PING TO REMOTE HOST FAILED

Probable Causes
CABLE
SWITCH
ADAPTER

Failure Causes
CABLES AND CONNECTIONS

        Recommended Actions
        CHECK CABLE AND ITS CONNECTIONS
        IF ERROR PERSISTS, REPLACE ADAPTER CARD.

Detail Data
FAILING ADAPTER
ent1
SWITCHING TO ADAPTER
ent0
Unable to reach remote host through backup adapter: switching over to primary adapter


-- thread 1:

All our servers every three minutes logs this message:

9F7B0FA6   0602080605 I H ent2           PING TO REMOTE HOST FAILED

The details of the message says it can't ping the default gateway through backup adapter.
Why does it try this? Why does it fail because if we pull the primary cable it switches 
to the backup adapter with no problems.

Cheers


-- thread 2:

Hello:

I've seen similar things happen when the switch is not on "port host" (meaning the port begins receiving and sending 
packets quickly, instead of running Spanning Tree Protocol before going in the FORWARDING state): in this case, 
the EtherChannel sends the ping packets, they are dropped because the switch is still initializing, 
and the cycle continues on and on. Still, 30 minutes sounds like a long time.

You can try the following:

- verify that the EtherChannel switch ports are set to "port host" (i.e., STP should be disabled)

on the VIOS, set the num_retries to a higher value (default is 3) and/or set the retry_time to a higher value (default is 1) 

Does this ONLY happen when updating from FP74 to FP8, or every time the VIOS boots?

Kind regards,


-- thread 3:

Hi All, 
I am getting the following error consistently on one of my servers. when i 
do a entstat -d ent3 | grep "Active channel", it does come back with Active 
channel: primary channel. Could you please provide me with any suggestions 
or steps I can take to fix this error? 

entstat -d ent2 | grep "Active channel"

Hi 
Just Etherchannel or Etherchannel with Backup Adapter connected to a failover Switch just in case everything fails ?? 
If so, please take a read of the following: 
http://publib.boulder.ibm.com/infocenter/clresctr/v xrx/index.jsp?topic=/com.ibm.cluster.rsct.doc/rsct _aix5l53/bl5adm05/bl5adm0559.html 
Hope this helps


-- thread 4:

A VIOS network failover test produces the above error messages, so in that case there is no real problem.


midaemon, scopeux, measureware:
===============================

The scopeux data collector, the midaemon (measurement interface daemon), and the alarmgen (alarm generator)
process, are all part of HP Openview Measureware software that can run on a node.

A typical process list might show the following:

zd57l08:/home/root>ps -ef | grep /usr/lpp/perf/bin/
    root  176232       1   0   Dec 20      -  0:17 /usr/lpp/perf/bin/midaemon
    root  188536       1   0   Dec 20      -  0:00 /usr/lpp/perf/bin/ttd
    root  200830  254112   0   Dec 20      -  5:06 /usr/lpp/perf/bin/alarmgen -svr 254112 -t alarmgen /var/opt/perf/datafiles//
    root  204918       1   0   Dec 20      - 106:40 /usr/lpp/perf/bin/scopeux
    root  233644       1   0   Dec 20      -  0:07 /usr/lpp/perf/bin/llbd
    root  254112  266370   0   Dec 20      -  6:56 /usr/lpp/perf/bin/agdbserver -t alarmgen /var/opt/perf/datafiles/
    root  266370       1   0   Dec 20      -  3:42 /usr/lpp/perf/bin/perflbd
    root  307362  266370   0   Dec 20      -  8:22 /usr/lpp/perf/bin/rep_server -t SCOPE /var/opt/perf/datafiles/logglob

You can start or stop or view the status of the processes, by using the mwa command:

oranh202:/home/se1223>mwa status
MeasureWare scope status:
WARNING: scopeux    is not active (MWA data collector)

MeasureWare background daemon status:
(Should always be running when the system is up)
    Running ttd                   (Transaction Tracker daemon) pid 1493

MeasureWare server status:
    Running alarmgen              (alarm generator) pid 12547
    Running agdbserver            (alarm database server) pid 12546
    Running perflbd               (location broker) pid 12075

    The following data sources have running repository servers:
                            PID  DATA SOURCE
    Running rep_server    12521  SCOPE


How does mwa start on boot?
---------------------------

There is an entry in "/etc/inittab":

root@zd110l01.nl.eu.abnamro.com:/etc#cat inittab | grep mwa
mwa:2:wait:/etc/rc.mwa start >/dev/console  # Start MeasureWare


Check on whether its running:

oper@zd110l03:/home/oper$ ps -ef | grep /usr/lpp/
    root 147480      1   0   May 15      -  0:10 /usr/lpp/perf/bin/midaemon
    root 151694      1   0   May 02      - 38:59 /usr/lpp/perf/bin/llbd
    root 229394 340068   0   May 15      -  7:30 /usr/lpp/perf/bin/rep_server -t SCOPE /var/opt/perf/datafiles/logglob
    root 258142 393340   0   Jul 18      -  9:04 /usr/lpp/mmfs/bin/aix64/mmfsd64
    root 270588 372908   0   May 15      -  4:07 /usr/lpp/perf/bin/alarmgen -svr 372908 -t alarmgen /var/opt/perf/datafiles//
    root 315572      1   0   May 15      - 40:33 /usr/lpp/perf/bin/scopeux
    root 340068      1   0   May 15      -  4:45 /usr/lpp/perf/bin/perflbd
    root 352326      1   0   May 15      -  0:00 /usr/lpp/perf/bin/ttd
    root 372908 340068   0   May 15      -  6:19 /usr/lpp/perf/bin/agdbserver -t alarmgen /var/opt/perf/datafiles/
    root 393340      1   0   Jul 18      -  0:00 /bin/ksh /usr/lpp/mmfs/bin/runmmfs
  uctsp0 434430      1   0   May 04      - 58:36 /usr/lpp/uctsp0/control-sa/exe/AIX-51/p_ctsce
  uctsp0 503814 434430   0 22:41:01      -  0:00 /usr/lpp/uctsp0/control-sa/exe/AIX-51/./p_ctscd 57671683 57671682 434430
  uctsp0 536756 434430   0 22:41:01      -  0:00 /usr/lpp/uctsp0/control-sa/exe/AIX-51/./p_ctscs 57671681 60817408 434430


-- How to stop and start the agents:

1. Use the /etc/rc.mwa script, if its available on your system:

root@zd110l13:/etc#rc.mwa stop

Shutting down Measureware collection software
         Shutting down scopeux, pid(s) 192688
         Waiting on 192688  (10 more tries)
         The Measureware collector, scopeux has been shut down successfully.

Shutting down the MeasureWare server daemons
         Shutting down the alarmgen process.  This may take a while
         depending upon how many monitoring systems have to be
         notified that MeasureWare Server is shutting down.


         Shutting down the perflbd process

         The perflbd process has terminated

         Shutting down the agdbserver process

         The agdbserver process has terminated

         Shutting down the rep_server processes

         The rep_server processes have terminated

The MeasureWare Server has been shut down successfully
root@zd110l13:/etc#rc.mwa start

The MeasureWare scope collector is being started.
         The Transaction Tracking daemon
         /usr/lpp/perf/bin/ttd has been started.

         The performance collection daemon
         /usr/lpp/perf/bin/scopeux has been started.

The MeasureWare server daemon "llbd" is being started.

The MeasureWare server daemons are being started.
         The MeasureWare Location Broker daemon
         /usr/lpp/perf/bin/perflbd has been started.


root@zd110l13:/etc#


Notes about mwa:
----------------

About MWA(MeasureWare Agent)
Introduction:
MeasureWare Agent software captures performance, resource, and transaction data from your 
HP 9000 server or workstation system.
Using minimal system resources, the software continuously collects, logs, summarizes, and 
timestamps data,and detects alarm conditions on current and historical data across your system.
You can analyze the data using spreadsheet programs, Hewlett-Packard analysis products such as PerfView,
or third-party analysis products.
Also, MeasureWare Agent provides data access to PerfView and sends alarm notifications to PerfView, 
HP OpenView, and IT/Operations.

HP OpenView MeasureWare Agent for UNIX has been renamed to HP OpenView Performance Agent for UNIX.

MeasureWare Agent uses data source integration (DSI) technology to receive, alarm on, and log data 
from external data sources such as applications, databases, networks, and other operating systems.
The comprehensive data logged and stored by MeasureWare Agent allows you to:
 Characterize the workloads in the environment.
 Analyze resource usage and load balance.
 Perform trend analyses on historical data to isolate and identify bottlenecks.
 Perform service-level management based on transaction response time.
 Perform capacity planning.
 Respond to alarm conditions.
 Solve system management problems before they arise.

Starting MWA Automatically
The process of starting MeasureWare Agent automatically whenever the system reboots is controlled 
by the configuration file /etc/rc.config.d/mwa.

This file defines two shell variables, MWA_START and MWA_START_COMMAND.

The default /etc/rc.config.d/mwa configuration file shipped with this version of MeasureWare Agent 
resides in /opt/perf/newconfig/
and assigns the following values to these variables:
MWA_START=1
MWA_START_COMMAND="/opt/perf/bin/mwa start"

When MeasureWare Agent is installed, the file is conditionally copied to /etc/rc.config.d/mwa and will 
not replace any existing /etc/rc.config.d/mwa configuration file that may have been customized by the user 
for a previous version of MeasureWare Agent.
When the file is copied to /etc/rc.config.d/mwa, the variable MWA_START=1 causes MeasureWare Agent to automatically 
start when the system reboots.
The variable MWA_START_COMMAND="/opt/perf/bin/mwa start" causes all MeasureWare Agent processes to initiate 
when the system reboots.

If you want MeasureWare Agent to start at system reboot using special options,
modify the /etc/rc.config.d/mwa file by changing MWA_START_COMMAND from its default value of 
"/opt/perf/bin/mwa start" to the desired value.

For example, to start up scopeux but not the servers, change the value to "/opt/perf/bin/mwa start scope".
To disable MeasureWare Agent startup when the system reboots, change the variable MWA_START=1 to MWA_START=0.

MWA Command:
SYNOPSIS
      mwa [action] [subsystem] [parms]

DESCRIPTION
      mwa is a script  that  is  used  to  start,  stop,  and  re-initialize MeasureWare Agent processes.

 ACTION
-?      List all mwa options.
        If your shell interprets ? as a wildcard character, use an invalid option such as -xxx nstead of -?.
start   Start all or part of MeasureWare Agent.  (default)
stop    Stop all or part of MeasureWare Agent.
restart Reinitialize all or part of MWA. This option causes some processes to be stopped and restarted.
status  List the status of all or part of MWA processes.
version List the version of the all or part of MWA files.

 SUBSYSTEM
all Perform the selected action on  all  MWA components. (default)
    scope Perform the selected action on the scopeux collector.
    The  restart  operation causes the scopeux collector to stop, then restart. 
    This causes the parm and ttd.conf files to be re-read.

server Perform the selected action on the MWA server components. 
          This affects the data repositories as well as the alarm generation subsystem. 
          The restart operation causes all repository server processes to terminate and restart.
          This causes the perflbd.rc and alarmdef files to be re-read.

alarm Perform the selected action on the MWA server alarm component.
         Restart is the only valid option and causes the alarmdef file to be reprocessed.

db  Perform the selected action on  the MWA server db component.

 PARMS
-midaemon <miparms> Provide the midaemon with parameters to initiate it with other than default parameters. 

Example:
phred01:/> mwa status
MeasureWare scope status:
WARNING: scopeux    is not active (MWA data collector)

MeasureWare background daemon status:
(Should always be running when the system is up)
    Running ttd                   (Transaction Tracker daemon) pid 1900

MeasureWare server status:
    Running alarmgen              (alarm generator) pid 2816
    Running agdbserver            (alarm database server) pid 2815
    Running perflbd               (location broker) pid 1945

    The following data sources have running repository servers:
                            PID  DATA SOURCE
    Running rep_server     2810  SCOPE
phred01:/> mwa stop

         Shutting down Measureware collection software..
NOTE:    The Transaction Tracker daemon, ttd  will be left running. pid 1900

         Shutting down the MeasureWare server daemons..
         Shutting down the alarmgen process.  This may take awhile
         depending upon how many monitoring systems have to be
         notified that MeasureWare Server is shutting down.
        
         The alarmgen process has terminated

         Shutting down the perflbd process

         The perflbd process has terminated

         The agdbserver process terminated

         The rep_server processes have terminated

         The MeasureWare Server has been shut down successfully

phred01:/> mwa start

The Transaction Tracker daemon is being started.
         The Transaction Tracker daemon
         /opt/perf/bin/ttd, is already running.

The MeasureWare scope collector is being started.
         The performance collection daemon
         /opt/perf/bin/scopeux has been started.
The MeasureWare server daemons are being started.
         The MeasureWare Location Broker daemon
         /opt/perf/bin/perflbd has been started.

phred01:/> mwa status
MeasureWare scope status:
    Running scopeux               (MWA data collector) pid 12361
    Running midaemon              (measurement interface daemon) pid 1936

MeasureWare background daemon status:
    Running ttd                   (Transaction Tracker daemon) pid 1900

MeasureWare server status:
    Running alarmgen              (alarm generator) pid 12907
    Running agdbserver            (alarm database server) pid 12906
    Running perflbd               (location broker) pid 12369

    The following data sources have running repository servers:
                            PID  DATA SOURCE
    Running rep_server    12905  SCOPE

References:
HP OpenView Performance Agent for HP-UX 10.20 and 11 Installation & Configuration Guide
man mwa(Command)


Thread 1 about scopeaux:
------------------------

Subject: restart vrs. start scopeux

mwa status reports scopeux not running. Manual states to use restart command to retain existing logs (status.scope). 
But, I'm more concerned about the database collected prior to "mysterious" end of scopeux. Will restart (or start) 
of scope (scopeux) preserve existing data?

Thanks.
Vic. 

Once the data is written to the log files, it stays there when scopeux stops and starts. The data is deleted 
after the logfile reachs its size limit and starts to wrap. The oldest data is overwritten first.

I just do a "mwa start scope" to restart scope. I usually don't do a "mwa restart" as sometimes one of the 
processes may not stop, usually perflbd. I do a "mwa stop", check everything "mwa status" then do a "mwa start".

Sometimes when scopeux crashes, it was in the act of writing a record and only a partial record is written. 
This partial record will corrupt the database and scopeux will not start. In this case, the only way to 
start scopeux is to delete the database. It is a good idea to backup the databases frequently if the data is 
important to you.

HTH
Marty

If you only want to work with the Scope Collector itself (I.E. All other MeasureWare processes are running) 
do the following:

mwa start scope
or
mwa restart scope

This will narrow down what part of the MeasureWare product you are working with.

The status.scope file might help you figure out why scope stopped.


To see what may have happened to the scope daemon, look in its status file /var/opt/perf/status.scope. 
You can also use "perfstat -t" to see the last few lines of all the OVPA status files.

Since the "perfstat" command shows glance and OVPA (mwa) status, I recommend using perfstat instead of 
"mwa status" (also less to type!).

Since I'm in a recommending mood, I also recommend AGAINST doing a "mwa start scope" (or restart scope). 
The reason is that its always better to restart the datacomm daemons when the underlying collector is restarted. 
Thus its better to just do a "mwa restart" or "mwa start" instead of restarting scope out from under 
rep_server and friends. 

In any case, if perfstat shows everything running but scopeux, then first find out why scope died 
(by looking at status.scope) before doing any restarts.


Thread 2 about scopeux:
-----------------------

Subject: HELP OVPA: midaemon and scopeux won't start        
Jamie Chui 
 Jun 21, 2007 03:23:12 GMT      

I could not get midaemon and scopeux to start. When using glance, the following error messages appears 
and what does it mean?

midaemon: Mon Jun 11 15:51:05 2007
mi_ki_init - only able to allocate 3 bufsets
Not enough space. 

I am using HP-UX 11.11 with OVPA C.04.55.00. 

Measureware ran for 10 days, and during this period, it had the following error message and then finally one day 
it stopped running. 

**** /opt/perf/bin/scopeux : 06/15/07 13:35:01 ****
WARNING: Measurement Buffers Lost see metric GBL_LOST_MI_TRACE_BUFFERS. (PE221-5
0)

How can I troubleshoot and get it running again?  
 

It looks like your OS is not allocating enough buffer space. You will need to increase your kernel parameters 
pertaining to buffer space and regen the kernel.

HTH
Marty


ctcasd daemon:
==============

The ctcasd daemon is used by the cluster security services library when UNIX-identity-based authentication 
is configured and active within the cluster environment. The cluster security services uses ctcasd 
when service requesters and service providers try to create a secured execution environment through 
a network connection. ctcasd is not used when service requesters and providers establish 
a secured execution environment through a local operating system connection such as a UNIX domain socket.

The daemon is actually part of RSCT, the reliable scalable cluster technology.
The Cluster Security (CtSec) component of RSCT provides a subservice
that performs authentication functions based on host identities.  This
subservice, called "ctcas", comes with a text formatted configuration
file shipped in /usr/sbin/rsct/cfg/ctcasd.cfg. 


Tivoli endpoint, lcfd process:
==============================


Tivoli Management Framework is the software infrastructure for many Tivoli software products. 
Using Tivoli Management Framework and a combination of Tivoli Enterprise products, you can manage 
large distributed networks with multiple operating systems, various network services, diverse system tasks, 
and constantly changing nodes and users. Tivoli Management Framework provides management services that are used 
by the installed Tivoli Enterprise products.

Tivoli Management Framework provides centralized control of a distributed environment, which can include 
mainframes, UNIX(R) operating systems, or Microsoft Windows operating systems. Using Tivoli Enterprise products, 
a single system administrator can perform the following tasks for thousands of networked systems:

-Manage user and group accounts 
-Deploy new or upgrade existing software 
-Inventory existing system configurations 
-Monitor the resources of systems either inside or outside the Tivoli environment 
-Manage Internet and intranet access and control 
-Manage third-party applications

Tivoli Management Framework lets you securely delegate system administration tasks to other administrators, 
giving you control over which systems an administrator can manage and what tasks that administrator 
can perform. Tivoli Management Framework includes the base infrastructure and base set of services 
that its related products use to provide direct control over specific resources in a distributed 
computing environment. Tivoli Management Framework provides a simple, consistent interface to 
diverse operating systems, applications, and distributed services.


>> Architecture overview:

Using this three-tiered hierarchy, the amount of communication with the Tivoli server is reduced. 
Endpoints do not communicate with the Tivoli server, except during the initial login process. 
All endpoint communication goes through the gateway. In most cases, the gateway provides all the support 
an endpoint needs without requiring communication with the Tivoli server.

In a smaller workgroup-size installation, you can create the gateway on the Tivoli server. 
The server can handle communication requirements when fewer computer systems are involved. 
This is not an acceptable option in large deployments. The Tivoli server in a large installation 
will be overloaded if it also serves as a gateway. Refer to Endpoints and gateways for more information 
about endpoint communication.


-- Tivoli servers

The Tivoli server includes the libraries, binaries, data files, and the graphical user interface (GUI) 
(the Tivoli desktop) needed to install and manage your Tivoli environment. The Tivoli server performs 
all authentication and verification necessary to ensure the security of Tivoli data. The following components 
comprise a Tivoli server: 

- An object database, which maintains all object data for the entire Tivoli region. 
- An object dispatcher, which coordinates all communication with managed nodes and gateways. 
  The object dispatcher process is the oserv, which is controlled by the oserv command. 
- An endpoint manager, which is responsible for managing all of the endpoints in the Tivoli region.

When you install the Tivoli server on a UNIX operating system, the Tivoli desktop is automatically installed. 
When you install the Tivoli server on a Windows operating system, you must install Tivoli Desktop for Windows 
separately to use the Tivoli desktop.


-- Managed nodes

A managed node runs the same software that runs on a Tivoli server. Managed nodes maintain their own 
object databases that can be accessed by the Tivoli server. When managed nodes communicate directly 
with other managed nodes, they perform the same communication or security operations that are performed 
by the Tivoli server.

The difference between a Tivoli server and a managed node is that the Tivoli server object database is global 
to the entire region including all managed nodes. In contrast, the managed node database is local to the 
particular managed node.

To manage a computer system that hosts the managed node, install an endpoint on that managed node.


-- Gateways

A gateway controls communication with and operations on endpoints. Each gateway can support thousands of endpoints. 
A gateway can launch methods on an endpoint or run methods on behalf of the endpoint.

A gateway is generally created on an existing managed node. This managed node provides access to the 
endpoint methods and provides the communication with the Tivoli server that the endpoints occasionally require. 
Refer to Endpoints and gateways for more information about gateways.

-- Endpoints
An endpoint provides the primary interface for system management. An endpoint is any system that runs 
the lcfd service (or daemon), which is configured using the lcfd command.

Typically, an endpoint is installed on a computer system that is not used for daily management operations. 
Endpoints run a very small amount of software and do not maintain an object database. The majority of systems 
in a Tivoli environment should be endpoints. The Tivoli desktop is not installed with the endpoint software. 
If you choose to run a desktop on an endpoint, you must install Tivoli Desktop for Windows or 
telnet to a UNIX managed node. Refer to Endpoints and gateways for more information about endpoints.


Note 1:
-------

thread:

Q:

Dear friends, 

When i run "lslpp -l" i have in a line : "Tivoli_Management_Agent.client.rte 
3.7.1.0 COMMITTED Management Framework Endpoint 
Runtime" ". 
What's the purpose of this fileset ? 

Ps : I have Aix 5.3 

thank's a lot. 

A:

The purpose of Tivoli Management Agent
Reply from nlx6976 on 3/6/2006 5:56:00 PM  

Its an agent that runs on your system as part of the Tivoli Distributed 
Monitoring. It reports various things about your sysem back to the Tivoli 
Enterprise Console - usually your help desk. The basic monitors include 
things like file system usage (e.g if a FS is more than 80% used the system 
gets flagged at the console), or monitoring log files. Basically you can 
configure it to monitor whatever you want.


Note 2:
-------


Problem 
The AIX server comes preloaded with the Tivoli Endpoint software installed. How can you make this process 
autostart at bootup?  
  
Solution 
Create the /etc/inittab entry:

mkitab "rctma1:2:wait:/etc/rc.tma1 > /dev/console 2>&1 # Tivoli Management Agent"

Create the startup file "/etc/rc.tma1"
#!/bin/sh
#
# Start the Tivoli Management Agent
#
if [ -f /Tivoli/lcf/dat/1/lcfd.sh ]; then
/Tivoli/lcf/dat/1/lcfd.sh start
fi

When the OS reboot the LCFD process will automatically start.  


Note 3:
-------

The lcfd.log file, found on each endpoint in the lcf/dat directory, contains logging messages for upcall methods, 
downcall methods, and the login activities of the endpoint. You also can view this log file from the http interface. 
In addition, lcfd.log can have different levels of debugging information written to it. 
To set the level of debugging, use the lcfd command with the -dlevel option, which sets the log_threshold option 
in the last.cfg file. Set the log_threshold at level 2 for problem determination, because level 3 often provides 
too much information.

Of the three log files, the lcfd.log file is sometimes the most useful for debugging endpoint problems. 
However, remote access to the endpoint is necessary for one-to-one contact.

Endpoint log messages have the following format: 

timestamp level app_name messageThe message elements are as follows: 

timestamp 
  Displays the date and time that the message was logged. 
level 
  Displays the logging level of the message. 
app_name 
  Displays the name of the application that generated the message. 
message 
  Displays the full message text. The content of message is provided by the application specified in app_name. 

The default limit of the log file is 1 megabyte, which you can adjust with the lcfd (or lcfd.sh) command 
with the -D log_size =max_size option. The valid range is 10240 through 10240000 bytes. When the maximum size 
is reached, the file reduces to a size of approximately 200 messages and continues to log. 

In addition to these three log files, the following files help troubleshoot endpoint problems 
located on the endpoint: 

last.cfg 
  A text file that contains the endpoint and gateway login configuration information from the last time 
  the endpoint successfully logged in to its assigned gateway. Use this file to review the configuration settings 
  for an endpoint. 
lcf.id 
  A text file that contains a unique ID number to represent the endpoint. This file is uniquely generated 
  if the TMEID.tag file does not exist. 
lcf.dat 
  A binary file that contains the gateway login information. You cannot modify this information; however, you can 
  view network configuration information from the http interface. 
  Of these files, the last.cfg file can be useful in determining problems with an endpoint. 
  The last.cfg file resides in the \dat subdirectory of the endpoint installation and also can be viewed 
  from the http interface. This file contains configuration information for the endpoint. 

The following example shows the contents of a last.cfg file: 

lcfd_port=9495
lcfd_preferred_port=9495
gateway_port=9494
protocol=TCPIP
log_threshold=1
start_timeout=120
run_timeout=120
lcfd_version=41100
logfile=C:\Program Files\Tivoli\lcf\dat\1\lcfd.log
config_path=C:\Program Files\Tivoli\lcf\dat\1\last.cfg
run_dir=C:\Program Files\Tivoli\lcf\dat\1
load_dir=C:\Program Files\Tivoli\lcf\bin\w32-ix86\mrt
lib_dir=C:\Program Files\Tivoli\lcf\bin\w32-ix86\mrt
cache_loc=C:\Program Files\Tivoli\lcf\dat\1\cache
cache_index=C:\Program Files\Tivoli\lcf\dat\1\cache\Index.v5
cache_limit=20480000
log_queue_size=1024
log_size=1024000
udp_interval=300
udp_attempts=6
login_interval=1800
lcs.machine_name=andrew1
lcs.crypt_mode=196608
lcfd_alternate_port=9496
recvDataTimeout=2
recvDataNumAttempts=10
recvDataQMaxNum=50
login_timeout=300
login_attempts=3

When you change endpoint configuration with the lcfd command, the last.cfg file changes. Therefore, you should 
not modify the last.cfg file. If you require changes, use the lcfd command to make any changes. 
However, running the lcfd command requires stopping and restarting the endpoint.

Another useful tool for endpoint problem determination is the output from the wtrace command. 
The wtrace command is useful for tracking upcall and downcall method failures. To learn more about the wtrace command, 
see Troubleshooting the Tivoli environment.


sample logfile "root@zd110l13:/beheer/Tivoli/lcf/dat/1/lcfd.log"


Nov 15 09:14:13 1 engineUpdate Sending msg amRaAdd
Nov 15 09:14:13 1 engineUpdate Sending msg amTmrRemove
Nov 15 09:14:13 1 engineUpdate Sending msg amMpeRemove
Nov 15 09:14:13 1 engineUpdate Sending msg amRaRemove
Nov 15 09:14:13 1 engineUpdate Sending msg amTmrAdd
Nov 15 09:14:13 1 engineUpdate Sending msg amMpeAdd
Nov 15 09:14:13 1 engineUpdate Sending msg amRaAdd
Nov 15 09:14:13 1 engineUpdate Sending msg amRaAdd
Nov 15 09:14:13 1 engineUpdate Sending msg amRaAdd
Nov 15 09:14:13 1 engineUpdate Sending msg amRaAdd
Nov 15 09:14:13 1 engineUpdate Sending msg amRaAddTasks
Nov 15 09:14:13 1 engineUpdate Sending msg amEndPush
Nov 15 09:18:48 1 lcfd Spawning: /beheer/Tivoli/lcf/dat/1/cache/bin/aix4-r1/TME/Tmw2k/tmw2k_ep, ses: 215c89e6
Nov 15 09:28:46 1 lcfd Spawning: /beheer/Tivoli/lcf/dat/1/cache/bin/aix4-r1/TME/Tmw2k/tmw2k_ep, ses: 215c8a22
Nov 15 09:33:51 1 lcfd Spawning: /beheer/Tivoli/lcf/dat/1/cache/bin/aix4-r1/TME/Tmw2k/tmw2k_ep, ses: 215c8a4e
Nov 15 09:48:55 1 lcfd Spawning: /beheer/Tivoli/lcf/dat/1/cache/bin/aix4-r1/TME/Tmw2k/tmw2k_ep, ses: 215c8ac6
Nov 15 10:03:58 1 lcfd Spawning: /beheer/Tivoli/lcf/dat/1/cache/bin/aix4-r1/TME/Tmw2k/tmw2k_ep, ses: 215c8b5b
Nov 15 10:19:02 1 lcfd Spawning: /beheer/Tivoli/lcf/dat/1/cache/bin/aix4-r1/TME/Tmw2k/tmw2k_ep, ses: 215c8c4a
Nov 15 10:34:05 1 lcfd Spawning: /beheer/Tivoli/lcf/dat/1/cache/bin/aix4-r1/TME/Tmw2k/tmw2k_ep, ses: 215c8cb6


root@zd110l05:/#find . -name "*tecad*" -print

./tmp/.tivoli/.tecad_logfile.fifo.zd110l05.aix-default
./tmp/.tivoli/.tecad_logfile.lock.zd110l05.aix-default
./tmp/.tivoli/.tecad_logfile.fifo.zd110l05.aix-defaultlogsourcepipe
./etc/Tivoli/tecad
./etc/Tivoli/tecad.1011792
./etc/Tivoli/tecad.1011792/bin/init.tecad_logfile
./etc/Tivoli/tec/tecad_logfile.cache
./etc/rc.tecad_logfile
./etc/rc.shutdown-pre-tecad_logfile
./etc/rc.tecad_logfile-pre-tecad_logfile
./etc/rc.tivoli_tecad_mqseries
find: 0652-023 Cannot open file ./proc/278708.
find: 0652-023 Cannot open file ./proc/315572.
find: 0652-023 Cannot open file ./proc/442616.
find: 0652-023 Cannot open file ./proc/475172.
./beheer/Tivoli/lcf/dat/1/cache/out-of-date/init.tecad_logfile
./beheer/Tivoli/lcf/dat/1/cache/out-of-date/tecad-remove-logfile.sh
./beheer/Tivoli/lcf/dat/1/cache/bin/aix4-r1/TME/TEC/adapters/bin/tecad_logfile.cfg
./beheer/Tivoli/lcf/dat/1/LCFNEW/CTQ/logs/trace_mqs_start_tecad__MQS_CC.Q3P0063__1__p1052790.log
./beheer/Tivoli/lcf/bin/aix4-r1/TME/TEC/adapters/bin/tecad_logfile
./beheer/Tivoli/lcf/bin/aix4-r1/TME/TEC/adapters/bin/init.tecad_logfile
./beheer/Tivoli/lcf/bin/aix4-r1/TME/TEC/adapters/bin/tecad-remove-logfile.sh
./beheer/Tivoli/lcf/bin/aix4-r1/TME/TEC/adapters/aix-default/etc/C/tecad_logfile.fmt
./beheer/Tivoli/lcf/bin/aix4-r1/TME/TEC/adapters/aix-default/etc/tecad_logfile.err
./beheer/Tivoli/lcf/bin/aix4-r1/TME/TEC/adapters/aix-default/etc/tecad_logfile.conf
./beheer/Tivoli/lcf/bin/aix4-r1/TME/TEC/adapters/aix-default/etc/tecad_logfile.cds
./beheer/Tivoli/lcf/bin/aix4-r1/TME/MQS/bin/tecad_mqseries.cfg
./beheer/Tivoli/lcf/bin/aix4-r1/TME/MQS/bin/tecad_mqseries.mqsc
./beheer/Tivoli/lcf/bin/aix4-r1/TME/MQS/bin/tecad_mqseries_nontme
./beheer/Tivoli/lcf/bin/aix4-r1/TME/MQS/bin/tecad_mqseries_tmegw
./beheer/Tivoli/lcf/bin/generic_unix/TME/MQS/sh/mqs_start_tecad.sh
./beheer/Tivoli/lcf/bin/generic_unix/TME/MQS/sh/mqs_stop_tecad.sh
./beheer/Tivoli/lcf/bin/generic_unix/TME/MQS/teccfg/tecad_mqseries.Q3P0063.cfg


dircmp:
=======

  
Linux / Unix dircmp command

About dircmp
Lists files in both directories and indicates whether the files in the directories are the same and/or different.

Syntax
dircmp [-d] [-s] [-w n] directoryone directorytwo.

-d Compare the contents of files with the same name in both directories and output a list telling what 
must be changed in the two files to bring them into agreement. The list format is described in diff(1). 
-s Does not tell you about the files that are the same. 
-w n Change the width of the output line to n characters. The default width is 72. 
directoryone The first directory for comparing. 
directorytwo The second directory for comparing. 

Examples
dircmp dir1 dir2 - Compares the directory dir1 with the directory dir2. Below is an example of the output 
you may receive when typing this command.

Feb 8 17:18 2001 Comparison of help issues Page 1

directory .
same ./favicon.ico
same ./logo.gif
same ./question.gif


kmcrca:
=======

Part of the IBM Tivoli OMEGAMON XE for WebSphere MQ suite.


KMCRCA Starts IBM Tivoli OMEGAMON XE for WebSphere MQ Configuration
KMQIRA Starts IBM Tivoli OMEGAMON XE for WebSphere MQ Monitoring
KRARLOFF Converts the historical data file to a neutral file format for use with
various analytical programs


FLASHCOPY:
==========

Some notes about flashcopy implementations:


Note 1:
=======

What is FlashCopy?
FlashCopy is a function designed to create an instant "copy" of some data. When an administrator issues a 
FlashCopy command that essentially says "make a copy of this data," SVC via FlashCopy immediately provides 
the appearance of having created a copy of the data, when in reality it creates the physical copy 
in the background before moving that copy to an alternative data-storage device, which can take some time 
depending on the size of the backup copy. However, it creates the appearance of having completed 
the copy instantaneously, so customers can have a backup copy available as soon as the command is issued, 
even though copying to a different storage medium takes place behind the scenes.

"Because it operates very quickly in this way, FlashCopy allows customers to make a copy and immediately 
move on to other work without having to wait for the data to actually physically be copied from one place 
to another," says Saul. "In that regard, SVC FlashCopy is very similar to FlashCopy on the DS8000, for example, 
with the difference being SVC FlashCopy operates on most storage devices attached to the SVC, spanning many 
different disk systems."


Note 2:
=======

FlashCopy
FlashCopy is an IBM feature supported on ESS (Enterprise Storage Servers) that allows you to make nearly 
instantaneous Point in Time copies of entire logical volumes or data sets. The HDS (Hitachi Data Systems) 
implementation providing similar function is branded as ShadowImage. Using either implementation, 
the copies are immediately available for both read and write access. 

-- FlashCopy Version 1
The first implementation of FlashCopy, Version 1 allowed entire volumes to be instantaneously "copied" to 
another volume by using the facilities of the newer Enterprise Storage Subsystems (ESS). 

Version 1 of FlashCopy had its limitations however. Although the copy (or "flash" of a volume occurred 
instantaneously, the FlashCopy commands were issued sequentially and the ESS required a brief moment 
to establish the new pointers. Because of this minute processing delay, the data residing on two volumes 
that were FlashCopied are not exactly time consistent. 

-- FlashCopy Version 2
FlashCopy Version 2 introduced the ability to flash individual data sets and more recently added support 
for "consistency groups". FlashCopy consistency groups can be used to help create a consistent point-in-time 
copy across multiple volumes, and even across multiple ESSs, thus managing the consistency of dependent writes. 

FlashCopy consistency groups are used in a single-site scenario in order to create a time-consistent copy of data 
that can then be backed-up and sent offsite, or in a multi-site Global Mirror for ESS implementation to force 
time consistency at the remote site. 

The implementation of consistency groups is not limited to FlashCopy. Global Mirror for z/Series (formerly known 
as XRC or eXtended Remote Copy) also creates consistency groups to asynchronously mirror disk data from one site 
to another over any distance. 


Note 3:
-------

http://www.ibm.com/developerworks/forums/thread.jspa?messageID=13967589


Q:

Using target volume from FlashCopy on same LPAR as source volume going thru VIO server 
Posted: Jun 28, 2007 12:21:09 PM       Reply  
 
Synopsis:

DS4500 logical drive mapped to a p5 550 VIO server, then mapped to an AIX partition. Without interrupting 
the source drive, created a flashcopy of the drive and mapped it to the same VIO server, then again to 
the same partition. This caused duplicate VGID on the system. Had to varyoff and export the volume group 
to run recreatevg against the flashcopy hdisk and make a new volume group with it. This works fine 
the first time, however after I varyoff the vg and export it, then disable the flashcopy, and re-create 
it I cannot import or varyon the vg on the partition. importvg and recreatevg both say the hdisk belongs 
to a different vg so they don't work. The varyvg fails because the descriptors are not constitent.

How do I create a flashcopyvg on this partition using virtual disk from the VIO so that the process 
is repeatable and thus scriptable without having to interrupt the source volume group everytime I do this. 
The intent is to be able run a backup process against the flashcopy then disable it and do it again a few hours 
later and repeat it several times each day. We are using legacy vsam instead of a DB and need to keep the 
data accessible to our CICS system, while being able to capture point in time backups throughout the day.  
 

A:
 
Did you rmdev the vpath and hdisks before recreating the flash copy? Then you will need to run recreatevg again, 
as restarting the flash copy will change the pvid back to the same as the source volume.

Why not just attach the flash copy to another host? Then you won't need to run recreate vg and you could assign 
the flash copy to the original host if you need to recover the data. 
 

==============================
2. NOTES ABOUT SHELL PROGRAMS:
==============================

-------------------------------------------------------------
NOTE 1:

# sh dothat         (the dothat file contains commands)

# chmod 755 dothat  (or use: chmod +x dothat
# dothat            (now it's executable)
-------------------------------------------------------------
NOTE 2:

# now=`date`    (variable is output of a command)`
# echo $now

This means that commands are read from the string between two ` `.
Usage in a nested command goes like this:
font=`grep font \`cat filelist\``

-------------------------------------------------------------
NOTE 3:

To extend the PATH variable, on most systems use a statement like the following example:

$ export PATH=$PATH:$ORACLE_HOME/bin
$ export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$ORACLE_HOME/lib

PATH=.:/usr/bin:/$HOME/bin:/net/glrr/files1/bin  
export PATH  


-------------------------------------------------------------
NOTE 4:

Positional parameters:


program, function or shell                         $0
argument 1 through 9                               $1 .. $9
nth argument                                       ${n}
number of positional parameters                    $#
every positional parameter                         $@, $*
decimal value returned by last executed cmd        $?     if the former was successful this will be 0
pid of shell                                       $$
pid of last backgrounded command                   $!


Examples of usage of $n parameter:
----------------------------------

# dothis grapes apples pears

$1=grapes, $2=apples, $3=pears


# cat makelist
sort +1 -2 people | tr -d "[0-9]" | pr -h Distribution | lp

  This will only work on that filename, that is in this example, people.

# cat makelist
sort +1 -2 $1 | tr -d "[0-9]" | pr -h Distribution | lp

  This will work on ANY file given as an argument to makelist

  # makelist file1
  # makelist file2

-------------------------------------------------------------
NOTE 5:

# echo "Hi there $LOGNAME"
Hi there Albert

# echo 'Hi there $LOGNAME'
Hi there $LOGNAME

# echo "You owe me \$$amount" | mail 

Single-quotes are literal quotes.
Double-quotes can have their contents expanded

-------------------------------------------------------------
NOTE 6: 

How to set variables:
--------------------

- A variable name must begin with a letter and can contain letters, digits, and underscores,
but no special characters.

- A variable is set with an assignment of NAME=value
Be sure not to have any white space before or after the equals sign =.
Double quotes are used when white space is present in the text string you are assigning to the variable.
So here are a few examples:

ME=bill
BC="bill clinton"

Now the shell can react and use the variable $ME and it substitutes the value for that variable.

Local and environment variables:
--------------------------------

variables that you set are local to the current shell unless you mark them for excport.
Variables marked for export are called environment variables, and will be made available
to any command that the shell creates. The following command marks the variable BC for export:

export BC

You can list local variables by typing the set command.
You can list the environment variables by using the env command.

-------------------------------------------------------------
NOTE 7:

for file in list_of_values
do
  sort +1 -2 $file | tr -d "[0-9]" | pr -h Distribution | lp
done


if test $# -eq 0
  then echo "You must give a filename"
  exit 1
fi


-eq=equal, -ne=not equal, -gt=greater than, -lt=less then, -ge=greater or equal, -le=less or equal

-------------------------------------------------------------
NOTE 8:

UNIX$ for file in `ls /local/ssl/misc/*` 
> do 
> echo I found a config file $file
> echo Its type is `/usr/bin/file $file`
> done

-------------------------------------------------------------
NOTE 9:

If a script is to accept arguments then these can be referred to as ` $1 $2 $3..$9'. 
There is a logical limit of nine arguments to a Bourne script, but Bash handles the next arguments as `${10}'. 
`$0' is the name of the script itself. 

Here is a simple Bash script which prints out all its arguments. 

#!/bin/bash
# 
# Print all arguments (version 1)
#

for arg in $*
do
  echo Argument $arg
done

echo Total number of arguments was $#


The `$*' symbol stands for the entire list of arguments and `$#' is the total number of arguments.

-------------------------------------------------------------
NOTE 10: Start and End of Command

A command starts with the first word on a line or if it's the second command on a line 
with the first word after a";'.
A command ends either at the end of the line or whith a ";". So one can put several commands onto one line:

print -n "Name: "; read name; print ""


One can continue commands over more than one line with a "\" immediately followed by a newline sign 
which is made be the return key:

grep filename | sort -u | awk '{print $4}' | \
uniq -c >> /longpath/file

-------------------------------------------------------------
NOTE 11:

Bash and the Bourne shell has an array of tests. They are written as follows. 
Notice that `test' is itself not a part of the shell, but is a program which works out 
conditions and provides a return code. See the manual page on `test' for more details. 

test on file chracteristics:

test -f file        True if the file is a plain file 
test -d file        True if the file is a directory 
test -r file        True if the file is readable 
test -w file        True if the file is writable 
test -x file        True if the file is executable 
test -h file        True if the file is a symbolic link
test -s file        True if the file contains something 
test -g file        True if setgid bit is set 
test -u file        True if setuid bit is set 

string comparisons:

test s1 = s2        True if strings s1 and s2 are equal 
test s1 != s2       True if strings s1 and s2 are unequal 

numeric comparisons:

test x -eq y        True if the integers x and y are numerically equal 
test x -ne y        True if integers are not equal 
test x -gt y        True if x is greater than y 
test x -lt y        True if x is less than y 
test x -ge y        True if x>=y 
test x -le y        True if x <= y 

! Logical NOT operator 
-a Logical AND 
-o Logical OR 

Note that an alternate syntax for writing these commands is to use the square brackets,
instead of writing the word test. 

 [ $x -lt $y ]   "=="    test $x -lt $y

Just as with the arithmetic expressions, Bash 2.x provides a syntax for conditionals 
which are more similar to Java and C. While arithmetic C-like expressions can be used within double parentheses, 
C-like tests can be used within double square brackets. 

 [[ $var == "OK" || $var == "yes" ]]

This C-like syntax is not allowed in the Bourne shell, but is equivalent to 

[ $var = "OK" -o $var = "yes" ]

which is valid in both shells. 

Arithmetic C-like tests can be used within double parentheses so that under Bash 2.x the following tests are equivalent: 

 [ $x -lt $y ]   "==" (( x < y ))

Example:

#!/bin/ksh


if [ `whoami` != root ]
then
  echo RUN AS ROOT !!!
  exit
fi


Sometimes you will encounter the $variable==value syntax, like in

if $x==5
then

meaning if $x equals 5 then ..


-------------------------------------------------------------
NOTE 12:

alphabet="a b c d e"			# Initialise a string
count=0					# Initialise a counter
for letter in $alphabet			# Set up a loop control
do					# Begin the loop
    count=`expr $count + 1`		# Increment the counter
    echo "Letter $count is [$letter]"	# Display the result
done					# End of loop


alphabet="a b c d e"						# Initialise a string
count=0								# Initialise a counter
while [ $count -lt 5 ]						# Set up a loop control
do								# Begin the loop
    count=`expr $count + 1`					# Increment the counter
    position=`bc $count + $count - 1`   			# Position of next letter
    letter=`echo "$alphabet" | cut -c$position-$position`	# Get next letter 
    echo "Letter $count is [$letter]"				# Display the result
done								# End of loop


if [ -f $dirname/$filename ]
then
    echo "This filename [$filename] exists"
elif [ -d $dirname ]
then
    echo "This dirname [$dirname] exists"
else
    echo "Neither [$dirname] or [$filename] exist"
fi


if [$1 -le $2] ; then
    echo "$2 is de grootste "
else
    echo "$1 is de grootste"
fi

-------------------------------------------------------------
NOTE 13: Loops and conditionals:

loops:

  for-do-done
  while-do-done
  until-do-done

conditionals:

  if-then-else-fi
  case-esac
  &&
  ||


IF
==
The basic type of condition is "if". 

if [ $? -eq 0 ] ; then
	print we are okay
else
	print something failed
fi

IF the variable $? is equal to 0, THEN print out a message. Otherwise (else), print out a different message. 
FYI, "$?" checks the exit status of the last command run. 
The final 'fi' is required. This is to allow you to group multiple things together. 
You can have multiple things between if and else, or between else and fi, or both.
You can even skip the 'else' altogether, if you dont need an alternate case. 

if [ $? -eq 0 ] ; then
	print we are okay
	print We can do as much as we like here
fi


if [ -f /tmp/errlog ] 
   then
      rm /tmp/errlog
   else
      echo "no errorlog found"
fi


if [ ! -f /tmp/errlog ]
   then

#!/usr/bin/ksh
if [ `cat alert.log|wc -l` -gt 1 ]
then
   echo "something you want to say if alert.log contains more than 1 line"
else
   echo "something else you want to say"
fi


CASE
====
The case statement functions like 'switch' in some other languages. Given a particular variable, 
jump to a particular set of commands, based on the value of that variable. 
While the syntax is similar to C on the surface, there are some major differences; 

The variable being checked can be a string, not just a number 
There is no "fall through". You hit only one set of commands 
To make up for no 'fall through', you can 'share' variable states 
You can use WILDCARDS to match strings 

echo input yes or no
read  answer
case $answer in
	yes|Yes|y)
		echo got a positive answer
		# the following ';;' is mandatory for every set
		# of comparative xxx)  that you do
		;;
	no)
		echo got a 'no'
		;;
	q*|Q*)
		#assume the user wants to quit
		exit
		;;
		
	*)
		echo This is the default clause. we are not sure why or
		echo what someone would be typing, but we could take
		echo action on it here
		;;
esac

Sometimes you want to break out a while loop, which contains a case, like in this example:

#!/bin/sh

echo "Please talk to me ..."
while :
do
  read INPUT_STRING
  case $INPUT_STRING in
	hello)
		echo "Hello yourself!"
		;;
	bye)
		echo "See you again!"
		break
		;;
	*)
		echo "Sorry, I don't understand"
		;;
  esac
done
echo 
echo "That's all folks!"

In this example, the program loops if the user typed "hello". But if the user types "bye", the "break" statement will
quit the loop.

Note:  ":" evaluates to "true", but you might also have used "while true".


&& and ||
=========

The simples conditional in the Bourne shell is the double ampersand &&.
When two commands are separated by a double ampersand, the second command executes
only if the first command returns a zero exit status (succesful completion)

Example:

ls -ld /usr/bin > /dev/null && echo "Directory Found"

The opposite of && is the ||. When two commands are separated by ||, the second command executes
only if the first command returns a nonzero exit status (indicating failure).

Example:

ls -d /usr/foo || echo "No Directory Found"

If the directory does not exist, the message is displayed.


Loops

WHILE
=====
The basic loop is the 'while' loop; "while" something is true, keep looping.
There are two ways to stop the loop. The obvious way is when the 'something' is no longer true. 
The other way is with a 'break' command. 


keeplooping=1;
while [[ $keeplooping -eq 1 ]] ; do
	read quitnow
	if [[ "$quitnow" = "yes" ]] ; then
		keeplooping=0
	fi
	if [[ "$quitnow" = "q" ]] ; then
		break;
	fi
done


UNTIL
=====
The other kind of loop in ksh, is 'until'. The difference between them is that 'while' implies looping while 
something remains true.
'until', implies looping until something false, becomes true 

until [[ $stopnow -eq 1 ]] ; do
	echo just run this once
	stopnow=1;
	echo we should not be here again.
done


FOR
===
A "for loop", is a "limited loop". It loops a specific number of times, to match a specific number of items. 
Once you start the loop, the number of times you will repeat is fixed. 
The basic syntax is 

for i in eat run jump play
do
    echo See Albert $i
done

  See Albert eat
  See Albert run
  See Albert jump
  See Albert play


for i in "eat run jump play"
do
    echo See Albert $i
done

  See Albert eat run jump play


for var in one two three ; do
	echo $var
done

Whatever name you put in place of 'var', will be updated by each value following "in". 
So the above loop will print out 

one
two
three

But you can also have variables defining the item list. They will be checked ONLY ONCE, when you start the loop. 

list="one two three"
for var in $list ; do
	echo $var
	# Note: Changing this does NOT affect the loop items
	list="nolist"
done

The two things to note are: 
It stills prints out "one" "two" "three" 
Do NOT quote "$list", if you want the 'for' command to use multiple items 
If you used "$list" in the 'for' line, it would print out a SINGLE LINE, "one two three" 


for i in 1 2 3 4 5 6 7
do
    cp x.txt $i
done


-------------------------------------------------------------
NOTE 14: Arrays

Arrays 
Yes, you CAN have arrays in ksh, unlike old bourne shell. The syntax is as follows: 

# This is an OPTIONAL way to quickly null out prior values
set -A array
#
array[1]="one"
array[2]="two"
array[3]="three"
three=3

print ${array[1]}
print ${array[2]}
print ${array[3]}
print ${array[three]}


Briefly, an array contains a collection of values (elements) that may be accessed individually or as a group.  
Although newer versions of the Korn shell support more than one type of array, this tip will only apply 
to indexed arrays.

When assigning or accessing array elements, a subscript is used to indicate each element's position 
within the array.  The subscript is enclosed by brackets after the array name:
 
arrayname[subscript]
 
The first element in an array uses a subscript of 0, and the last element position (subscript value) 
is dependent on what version of the Korn shell you are using.  Review your system's Korn shell (ksh) 
man page to identify this value.

In this first example, the colors red, green, and blue are assigned to the first three positions of an array 
named colors:
 
$ colors[0]=RED
$ colors[1]=GREEN
$ colors[2]=BLUE
 
Alternatively, you can perform the same assignments using a single command:
 
$ set -A colors RED GREEN BLUE
 
Adding a dollar sign and an opening brace to the front of the general syntax and a closing brace on the end 
allows you to access individual array elements:
 
${arrayname[subscript]}

Using the array we defined above, let's access (print) each array element one by one:
 
$ print ${colors[0]}
RED
$ print ${colors[1]}
GREEN
$ print ${colors[2]}
BLUE
$
 
If you access an array without specifying a subscript, 0 will be used:
 
$ print ${colors[]}
RED
$

 
The while construct can be used to loop through each position in the array:
 
$ i=0
$ while [ $i -lt 3 ]
> do
> print ${colors[$i]}
> (( i=i+1 ))
> done
RED
GREEN
BLUE
$
 
Notice that a variable (i) was used for the subscript value each time through the loop.  
 

Special variables
There are some "special" variables that ksh itself gives values to. Here are the ones I find interesting 
PWD - always the current directory 
RANDOM - a different number every time you access it 
$$ - the current process id (of the script, not the user's shell) 
PPID - the "parent process"s ID. (BUT NOT ALWAYS, FOR FUNCTIONS) 
$? - exit status of last command run by the script 
PS1 - your "prompt". "PS1='$PWD:> '" is interesting. 
$1 to $9 - arguments 1 to 9 passed to your script or function 

Tweaks with variables
Both bourne shell and KSH have lots of strange little tweaks you can do with the ${} operator. 
The ones I like are below. 


To give a default value if and ONLY if a variable is not already set, use this construct: 


APP_DIR=${APP_DIR:-/usr/local/bin}

(KSH only)
You can also get funky, by running an actual command to generate the value. For example 


DATESTRING=${DATESTRING:-$(date)}


(KSH only)
To count the number of characters contained in a variable string, use ${#varname}. 


  echo num of chars in stringvar is ${#stringvar}

-------------------------------------------------------------
NOTE 15:

Appending dates to files and such:

Example 1:
----------

# mv logfile logfile.`date`
# mv logfile logfile.`date + %Y.%m.%d`

Example 2:
----------

MS korn shell:
# now=`date -u %d`;export now
# echo $now
24


------------------------------------------------------------
NOTE 16: tput

What is tput?

The tput command initializes and manipulates your terminal session through the terminfo database. Using tput, you can alter 
several terminal capabilities, such as moving or altering the cursor, changing text properties, 
and clearing specific areas of the terminal screen. 

Command-line introduction to tput

The tput command, like most commands in UNIX, can be used either at your shell command line or inside a shell script. 
To gain a better understanding of tput, this article starts with the command line, and then continues into shell script examples. 

Cursor attributes

Moving the cursor or altering its attributes can be helpful in UNIX shell scripts or at the command line. 
There may be times when you're required to enter sensitive information, such as a password, or enter information in two 
different areas of the screen. Using tput can help you in such conditions. 

Moving the cursor

Moving the cursor's position on the respective device is easily done with tput. Using the cup option, or cursor position, in tput, 
you can move the cursor to any X or Y coordinates in the device's rows and columns. 
The top left coordinates of the device are 0,0. 

To move the cursor to the fifth column (X) and the first row (Y) on a device, simply execute tput cup 5 1. 
Another example would be tput cup 23 45, which would move the cursor to the forty-fifth row in the twenty-third column. 

Moving the cursor and displaying information

Another useful cursor position trick is to move the cursor, execute a command to display information, 
and then return to the previous cursor location: 

(tput sc ; tput cup 23 45 ; echo "Input from tput/echo at 23/45" ; tput rc)
 

Let's break down the subshell commands:

tput sc

The current cursor location must be saved first. To save the current cursor position, include the sc option, 
or "save cursor position." 

tput cup 23 45
 
After the cursor location has been saved, the cursor coordinates will be moved to 23,45. 

echo "Input from tput/echo at 23/45"
 
Display information to stdout.

tput rc
 
When the information has been displayed, the cursor must return to the original location that was saved with tput sc. 
To return the cursor to its last saved location, include the rc option, or "restore cursor position." 


------------------------------------------------------------
NOTE 17: Doing some arithmetic


CleanOldArchiveFiles()
{
cd $T2_ARCH_DIR
COUNT_BEFORE=$(find ${T2_ARCH_DIR} -type f -name "T2*" -exec ls -al {} \; | wc -l)
PRESENT_DIR=`pwd`

if [ $PRESENT_DIR==$T2_ARCH_DIR ]       # Let.s make sure we are in the right directory.
   then
      find . -name "T2*" -mtime +30 -exec rm {} \;
      AF_EXITCODE=$?
      if (( AF_EXITCODE == 0 ))
      then
        EXIT_CODE=0
      else
        EXIT_CODE=1
      fi
fi

COUNT_AFTER =$(find ${T2_ARCH_DIR} -type f -name "T2*" -exec ls -al {} \; | wc -l)

DELTA=`expr $COUNT_BEFORE - $COUNT_AFTER`

}

------------------------------------------------------------
NOTE 18: Examples


Example 1:
----------

#!/usr/bin/ksh
# Monitor the SPL p550 server
# By Albert
# version 0.1

umask 022

date=`date +%d-%m-%y`
time=`date +%H:%M`
emailers=albertvandersel@zonnet.nl

echo "$date $time" > /tmp/topper
df >> /tmp/topper

mailx -r albertvandersel@zonnet.nl -s "::: Disk info p550 :::" $emailers < /tmp/topper
rm /tmp/topper

exit 0


cat /export/home/fas/RSP/RSP.log | grep "ORA-01000" > /tmp/brokencursor.err
mailx -r noreply@ricoh-europe.com -s "::: Process info NLIHblabla-08 :::" $emailers < /tmp/topper


'mailx' to send someone an email


Example 2:
----------

#!/bin/ksh
# Monitor rsp logfile
#
PATH=/usr/ucb:/bin:/usr/bin:/sbin:/usr/sbin:/usr/local/bin:/usr/local/etc:/usr/opt/SUNWmd/sbin
export PATH

umask 022

date=`date +%d-%m-%y`
time=`date +%H:%M`

emailers=nobuya.horii@ricoh-europe.com,Nathan.Bohn@firepond.com

cat /export/home/fas/RSP/RSP.log | grep "ORA-01000" > /tmp/brokencursor.err

if [ -s /tmp/brokencursor.err ] 
   then
      # echo "$date $time" > /tmp/brokencursor.err
      mailx -r noreply@ricoh-europe.com -s "::: Check on ORA-01000 :::" $emailers < /tmp/brokencursor.err
   else
      echo "all OK" >> /tmp/brokencursor.log
fi

/bin/rm /tmp/brokencursor.err

exit 0


Example 3: Automatic startup of an application
----------------------------------------------

#!/bin/ksh

# name: spl
# purpose: script that will start or stop the spl stuff.


case "$1" in
start )
        echo "starting spl"
        echo "su - ccbsys -c '/prj/spl/SPLS3/bin/splenviron.sh -e SPLS3 -c "spl.sh -t start"'"
        su - ccbsys -c '/prj/spl/SPLS3/bin/splenviron.sh -e SPLS3 -c "spl.sh -t start"'
        ;;
stop )
        echo "stopping spl"
        echo "su - ccbsys -c '/prj/spl/SPLS3/bin/splenviron.sh -e SPLS3 -c "spl.sh -t stop"'"
        su - ccbsys -c '/prj/spl/SPLS3/bin/splenviron.sh -e SPLS3 -c "spl.sh -t stop"'
        ;;
* )
        echo "Usage: $0 (start | stop)"
        exit 1
esac


Example 4: scheduled cleanup of logs
------------------------------------

#!/usr/bin/ksh

SAVEPERIOD=5

echo "/prj/spl/splapp/SPLQ3"| \
while read DIR
do
   cd $(DIR)
   find . -type f -mtime +$(SAVEPERIOD) -exec rm {} \;
done

exit 0


Example 5: some testing examples
--------------------------------

Initialise()
{
	export SPLcomplog=$SPLSYSTEMLOGS/initialSetup.sh.log
        if [ -f $SPLcomplog ]
        then
           rm -f $SPLcomplog
           export RSP=$?
           if [ $RSP -ne 0 ]
           then
              echo "ERROR - Cannot remove the old Log file $SPLcomplog "
              exitFunc $RSP
           fi
        fi
        touch $SPLcomplog
        export RSP=$?
        if [ $RSP -ne 0 ]
        then
           echo "ERROR - Cannot create Log file $SPLcomplog "
           exitFunc $RSP
        fi
	export TMP1=$SPLSYSTEMLOGS/initialSetup.sh.tmp
}

exitFunc()
{
	export RSP=$1
	Log "Exiting $SCRIPTNAME with return code $RSP"
	if [ -f $TMP1 ]
	then
		rm -f $TMP1 > /dev/null 2>&1
	fi
	exit $RSP
}

testDBconnection()
{
	Log "Testing Database connection parameters entered in configureEnv.sh"
	if [ `which db2|wc -w` -gt 1 ]
	then
		Log "ERROR : cannot find \"db2\" Program. This is a PATH prerequisit to the Install"
		exitFunc 1
	fi
	. cisconnect.sh > $TMP1 2>&1
	export RSP=$?
	if [ $RSP -ne 0 ]
	then
		Log "ERROR : connecting to Database:"
		Log -f "$TMP1"
		Log "ERROR : Rerun configureEnv.sh and ensure database connection parameters are correct"
		Log "ERROR : Check DB2 Connect configuration to ensure connection is o.K."
		exitFunc $RSP
	fi
		
}


Other example:

check_cron() {
# check of commando door cron of met de hand wordt uitgevoerd #
CRON_PID=`ps -ef | grep check_sudo | grep -v grep | awk '{print $3}'`
    if [[ `ps -p ${CRON_PID} | grep -v TIME | awk '{print $4}'` == "cron" ]]
    then
        CRON_RUN="yes"
        # Genereer een sleeptime nummer, voorkom daarmee dat alle clients tegelijk de Distroserver benaderen #
        random_sleeptime
    else
        CRON_RUN="no"
        SLEEPTIME="1"
    fi
}


Example 6:
----------

P550:/home/reserve/bin $ cat CheckAppl
#!/usr/bin/ksh
#
#
# variabelen initialisatie
appl=/prj/etm/1.5.20
#
# to start trace option execute set -x
if [ $1 = d ]
then
        set -x
fi
#set -x
#
# cleanscreen
clear
echo "Just a moment"
#
for i in `cat /etc/cistab | sed -e 's/:/ /g'| awk '{ print $1 }'`
do
        #aantal processen
        # initialisatie
        aantal_processen=0
        aantal_jsl=0
        aantal_jrepsvr=0
        aantal_bbl=0
        aantal_tuxfulladm=0
        aantal_tuxfullall=0
        #
        aantal_processen=`ps -ef|grep $i| grep -v grep | wc -l`
        if [ $aantal_processen = 0 ]
        then
                status=DOWN
        else
                status=UP
                # aantal JSL
                aantal_jsl=`ps -ef|grep $i | grep -i JSL | wc -l`
                # aantal JREPSVR
                aantal_jrepsvr=`ps -ef|grep $i | grep -i JREPSVR | wc -l`
                # aantal_BBL
                aantal_bbl=`ps -ef|grep $i | grep -i BBL | wc -l`
                # aantal_TUXFULLadm
                aantal_tuxfulladm=`ps -ef|grep $i | grep -i tuxfulladm | wc -l`
                # aantal_TUXFULLall
                aantal_tuxfullall=`ps -ef|grep $i | grep -i tuxfullall | wc -l`

        fi

        if [ $status = UP ] ; then

                 echo "$i $status BBL($aantal_bbl) TUXFULLall($aantal_tuxfullall) TUXFULLadm($aantal_tuxfulladm)    JSL( $aantal_jsl )(  JREPSVR(  $aantal_jrepsvr ) "
        else
                echo $i $status
        fi
done   | sort +1

# check logs
echo
echo "Check backup to rmt0"
echo "--------------------"
tail -2 /opt/back*/backup_to_rmt0.log
echo
echo "Check backup to rmt1"
echo "--------------------"
tail -7 /opt/backupscripts/backup_to_rmt1.log

echo
echo "Check backup from 520"
echo "---------------------"
ls -l /backups/520backups/oradb/conv.dmp
ls -l /backups/520backups/splenvs/*tar*


Example 7:
----------

#!/bin/sh

getinfo() {
        USER=$1
    PASS=$2
    DB=$3

    CONN="${USER}/${PASS}@${DB}"

    echo "
    set linesize 1000
    set pagesize 1000
    set trimspool on
    SELECT CIS_RELEASE_ID,':', CM_RELEASE_ID
    FROM CI_INSTALLATION;
    " | sqlplus -S $CONN | grep '[0-9a-zA_Z]'
}

if [ $# -gt 0 ]
then
        DB="$1"
else
        DB="$SPLENVIRON"
fi

if [ "x$DB" = x ]
then
        echo "dbvers: no environment"
        exit 1
fi

getinfo cisread cisread $DB | sed -e 1d -e 's/[         ]//g'


Example 8:
----------

#!/usr/bin/sh

. $HOME/.profile >/dev/null 2>&1

[ $# -ne 1 ] && exit 1

MARKER=/home/cissys/etc/marker-file

if [ $1 = "setmarker" ]
then
        /bin/touch $MARKER

        exit 0
fi

if [ $1 = "cleanup" ]
then
        [ \! -f $MARKER ] && exit 1

        for DIR  in `cut -d: -f4 /etc/cistab`
        do
                 /usr/bin/find $DIR \! -newer $MARKER -type f -exec rm -f {} \;
        done

        exit 0
fi

if [ $1 = "runbatch" ]
then
        for ETM  in `cut -d: -f1 /etc/cistab`
        do
                DIR1=`grep $ETM /etc/cistab|cut -d: -f3`
                DIR2=`grep $ETM /etc/cistab|cut -d: -f4`
                $DIR1/bin/splenviron.sh -q -e $ETM -c cdxcronbatch.sh \
                        >>$DIR2/cdxcronbatch.out 2>&1
        done

        exit 0
fi

exit 1


Example 9:
----------

date >> /opt/backupscripts/backupdatabases.log

cd /backups/oradb

if [ -f spltrain.dmp ]
   then
      echo "backup of spltrain is OK" >> /opt/backupscripts/backupdatabases.log
   else
      echo "error backup of spltrain " >> /opt/backupscripts/backupdatabases.log
fi


Example 10:
-----------

#!/usr/bin/ksh

# target : print configuration
#set -x
DAY=`date +%Y%m%d`
HOSTNAME=`hostname`
LOG=/home/reserve/log/ListConfiguration/LsConf.$DAY.$HOSTNAME.P522

date     >> $LOG
echo "++++++++++++++++++++++++++++++++++++++++++++" >> $LOG
echo "lsdev -P " >> $LOG
lsdev -P >> $LOG
echo "++++++++++++++++++++++++++++++++++++++++++++" >> $LOG
echo "lsdev -C " >> $LOG
lsdev -C >> $LOG
echo "++++++++++++++++++++++++++++++++++++++++++++" >> $LOG
echo "lsconf">> $LOG
lsconf   >> $LOG
echo "++++++++++++++++++++++++++++++++++++++++++++" >> $LOG
for i in `lsdev -C | awk '{ print $1 }`
do
        echo "lsdev -C -l $i" >> $LOG
        lsattr -E -l $i >> $LOG
        echo >> $LOG
done
echo "++++++++++++++++++++++++++++++++++++++++++++" >> $LOG
for i in `lsvg`
do
        lsvg -p $i
        lsvg -l $i
done >> $LOG
echo "++++++++++++++++++++++++++++++++++++++++++++" >> $LOG
for i in `lspv | awk '{ print $1 }'`
do
        lspv -l $i
        lspv -L $i
        lspv -M $i
        lspv -p $i
done >>$LOG


echo "++++++++++++++++++++++++++++++++++++++++++++" >> $LOG
echo "/etc/passwd">> $LOG
cat /etc/passwd >> $LOG
echo "++++++++++++++++++++++++++++++++++++++++++++" >> $LOG
echo "/etc/group" >> $LOG
cat /etc/group  >> $LOG


Example 11:
-----------

Make dynamic Oracle exports from a shell script. You do not need to list exp statements per database,
this will be extracted from som file, like /etc/oratab.

#!/usr/bin/ksh
DATE=`date +%Y%m%d`
HOSTNAME=`hostname`
ORACONF=/etc/rc.oratab
set -x
# MAKE SURE THE ENVIRONMENT IS OK
ORACLE_BASE=/apps/oracle; export ORACLE_BASE
ORACLE_HOME=/apps/oracle/product/9.2; export ORACLE_HOME
LIBPATH=/apps/oracle/product/9.2/lib; export LIBPATH
ORACLE_TERM=xterm;export ORACLE_TERM
export ORA_NLS33=$ORACLE_HOME/ocommon/nls/admin/data
LD_LIBRARY_PATH=$ORACLE_HOME/lib; export LD_LIBRARY_PATH
export TNS_ADMIN=/apps/oracle/product/9.2/network/admin
export ORAENV_ASK=NO

PATH=/usr/local/bin:/usr/bin:/etc:/usr/sbin:/usr/ucb:/usr/bin/X11:/sbin:/usr/java131/jre/bin;export PATH
PATH=$ORACLE_HOME/bin:$PATH;export PATH

# SAVE THE FORMER BACKUPS: LETS KEEP 1 EXTRA DAY ONLINE
# Lets copy the current file to another filesystem:
cd /backups/oradb
# Now lets save the current file on the same filesystem in 1dayago
cd /backups/oradb/1dayago
mv spl*dmp /backups/oradb/2dayago

cd /backups/oradb
mv spl*dmp /backups/oradb/1dayago

# Now create a new export file in /backups/oradb

ExpOracle()
{
set -x
        for i in `cat ${ORACONF} | grep -v \# | awk '{ print $1 }'`
        do
                SID_NAME=$i
                BOOT=`grep $SID_NAME $ORACONF | awk '{ print $2}'`
                if [ $BOOT = Y ] ;
                then
                        su - oracle -c "
                        ORACLE_SID=${SID_NAME}
                        export ORACLE_SID
                        cd /backups/oradb
                        exp system/cygnusx1@$SID_NAME file=$SID_NAME.$HOSTNAME.$DATE.dmp full=y statistics=none
                        EOF "
                fi
                sleep 5
                if [ -f $SID_NAME.$HOSTNAME.$DATE.dmp ]
                then
                        echo "backup of $SID_NAME is OK" >> /opt/backupscripts/backupdatabases.log
                else
                        echo "error backup of $SID_NAME " >> /opt/backupscripts/backupdatabases.log
                fi

        done
}

ExpOracle


Example 12:
-----------

Running sqlplus from a shell script.

$ORACLE_HOME/bin/sqlplus -s "/ as sysdba" <<EOF 1> $tmp_file 2>1
   set heading off feedback off
   whenever sqlerror exit
   select 'DB_NAME=' || name from v\$database;
   .. # possible other stuff
   exit
EOF


Example 13:
-----------

Kill a paricular process, that runs from "/dir/dir/abc" :

kill `ps -ef | grep /dir/dir/abc | grep -v grep | awk '(print $2)'`


Example 14:
-----------

#!/usr/bin/ksh
#
# description: start and stop the Documentum Content Server environment from dmadmin account
# called by:   dmadmin
#

DOCBASE_NM1=dmw_et
DOCBASE_NM2=dmw_et3

function log
{
        echo $(date +"%Y/%m/%d %H.%M.%S %Z") 'documentum.sh:' ${@}
}


# See how we were called.
case $1 in

  start)
    # Starting DocBroker
    cd $DOCUMENTUM/dba
    ./dm_launch_Docbroker
    ./dm_start_$DOCBASE_NM1
    ./dm_start_$DOCBASE_NM2
    # Starting Tomcat services
    cd $DM_HOME/tomcat/bin
    ./startup.sh
  ;;

  stop)
    # Stopping Tomcat services
    cd $DM_HOME/tomcat/bin
    ./shutdown.sh
    # Stopping DocBroker
    cd $DOCUMENTUM/dba
    ./dm_shutdown_$DOCBASE_NM1
    ./dm_shutdown_$DOCBASE_NM2
    ./dm_stop_Docbroker
  ;;

  clean_logs)
    # Call myself to stop stuff
    ${0} stop
    # Stopping Tomcat services
    find $DOCUMENTUM/dba/log -type f -name "*" -exec rm -rf {} \;
    # Call myself to restart stuff
    ${0} start
  ;;

  clean_performance)
    # Call myself to stop stuff
    ${0} stop
    # Stopping Tomcat services
    find $DOCUMENTUM/dba/log -type d -name "perftest*" -exec rm -rf {} \;
    find $DOCUMENTUM/dba/log -type d -name "perfuser*" -exec rm -rf {} \;
    find $DOCUMENTUM/dba/log -type d -name "RefAppUser*" -exec rm -rf {} \;
    # Call myself to restart stuff
    ${0} start
  ;;

  kill)
    cd $DOCUMENTUM/dba
    ./dm_launch_Docbroker -k
  ;;

  *)
  echo "Usage: $0 {start|stop|kill|clean_logs|clean_performance}"
  exit 1
esac

exit 0


Example 15: 
-----------

# Check bij aanloggen of voor dit domein een dmgr of nodeagent draait
WASUSR=`whoami`
echo ""

DMAOK=`ps -ef | grep $WASUSR | grep dmgr | grep -v grep`
if [ ! "$DMAOK" = "" ]; then
  echo "Deployment Manager is running!"
else
  echo "NOTE: Deployment Manager is NOT running!"
fi
echo ""

WASOK=`ps -ef | grep $WASUSR | grep nodeagent | grep -v grep`
if [ ! "$WASOK" = "" ]; then
  echo "Nodeagent is running!"
else
  echo "NOTE: Nodeagent is NOT running!"
fi
echo ""

if [ "$WASOK" = "" ] || [ "$DMAOK" = "" ]; then
  echo "The $WASUSR-menu can be used to start the Deployment manager or Nodeagent."
  echo "Type 'menu' on the command-prompt followed by ENTER to start the menu."
  echo ""
fi


if (( $# == 1 )) && [[ "${1}" = "?" ]]
if [ "$WASOK" = "" ] || [ "$DMAOK" = "" ]

Example 16:
-----------

Example with input from user:

echo ""
read CLEARMSG?"Press C or c to clear this message or any other key to keep it : "
if [ "${CLEARMSG}" = "C" ] || [ "${CLEARMSG}" = "c" ]; then
  if [ -f ExtraMessage.txt ]; then
    rm ~/ExtraMessage.txt
  fi
fi


Example 17:
-----------

# Check arguments
if [ ${#} != 3 ]
then
    log "Usage: ${0} <enviroment> <installFilesFolder> <installTarget>"
    exit 1
fi


#
if [ -z "$1" ]
then
	echo "use : build.sh PROGNAME
e.g.  build.sh  CLBDSYNC
"
	exit 1
fi


if [ "$OPSYS" = "AIX" ]||[ "$OPSYS" = "HP-UX" ]||[ "$OPSYS" = "Linux" ]
then
   ....
else
   ....


Example 18:
-----------


Read Input from User and from Files

- Read in a Variable
From a user we read with: read var. Then the users can type something in. One should first print something like: 

print -n "Enter your favorite haircolor: ";read var; print "". 

The -n suppresses the newline sign.

- Read into a File Line for Line
To get each line of a file into a variable iteratively do:

{ while read myline;do
   # process $myline
done } < filename

- To catch the output of a pipeline each line at a time in a variable use:

last | sort | {
while read myline;do
   # commands
done }


Example 19:
-----------

#!/bin/sh
# ****************************************************************************
# This script is used to start  Tomcat
# It calls the startup.sh script under $CATALINA_HOME/bin.
#
# ****************************************************************************

#JAVA_OPTS="-Xms512m -Xmx512m -XX:NewSize=128m -XX:SurvivorRatio=8 -verbosegc"
JAVA_OPTS="-Xms384m -Xmx384m"
export JAVA_OPTS
CATALINA_BASE=$SPLEBASE/tomcatBase 
export CATALINA_BASE

if [ ! -d $CATALINA_HOME/bin ]
then
   echo "Unable to find directory $CATALINA_HOME/bin"
else
   $CATALINA_HOME/bin/startup.sh
fi


Example 20:
-----------

export SCRIPTNAME=$0
export SPLQUITE=N
export SPLCOMMAND=""
export SPLENVIRON=""
export MYID=`id |cut -d'(' -f2|cut -d')' -f1`
export SPLSUBSHELL=ksh

# Current Platform and OS
export OPSYS=`uname -s`          
case $OPSYS in
        SunOS)  export OPSYSVER=`uname -r`
#               Supplied and Supported Tuxedo Version
                TUXVERS=tuxedo8.1
#               Supplied and Supported Cobol Application Server
                SPLCOBDIR=/opt/SPLcobAS40sp2
#               Supplied and Supported Java Version
		AWK=nawk
                ;;
        AIX)   export OPSYSVER=`oslevel | cut -c1-3`
               TUXVERS=tuxedo8.1
#              Supplied and Supported Cobol Application Server
               SPLCOBDIR=/opt/SPLcobAS40sp2
#              Supplied and Supported Java Version
               AWK=nawk
               ;;
	Linux) export OPSYSVER=`uname -r| cut -c1-3`
               TUXVERS=tuxedo8.1
#              Supplied and Supported Cobol Application Server
               SPLCOBDIR=/opt/SPLcobAS40sp2
               AWK=awk
	       SPLSUBSHELL=bash
               ;;
	      
        HP-UX)   export OPSYSVER=`uname -r|sed 's/^[A-Z]\.//'|cut -c1-2`
               TUXVERS=tuxedo8.1
#              Supplied and Supported Cobol Application Server
               SPLCOBDIR=/opt/SPLcobAS40sp2
#              Supplied and Supported Java Version
               AWK=awk
               ;;
esac


Example 21:
-----------

Get a certain number of columns from df -k output:

df -k |awk '{print $1,$2,$5,$8}' |grep -v "Filesystem" > /tmp/df.txt 


#!/usr/bin/ksh
for i in `df -k |awk '{print $7}' |grep -v "Filesystem"'`
do
   echo "Albert"

done


#!/usr/bin/ksh
cd ~
rm -rf /root/alert.log
echo "Important alerts in errorlog: " >> /root/alert.log
errpt | grep -i STORAGE >> /root/alert.log
errpt | grep -i QUORUM >> /root/alert.log
errpt | grep -i ADAPTER >> /root/alert.log
errpt | grep -i VOLUME >> /root/alert.log
errpt | grep -i PHYSICAL >> /root/alert.log
errpt | grep -i STALE >> /root/alert.log
errpt | grep -i DISK >> /root/alert.log
errpt | grep -i LVM >> /root/alert.log
errpt | grep -i LVD >> /root/alert.log
errpt | grep -i UNABLE >> /root/alert.log
errpt | grep -i USER >> /root/alert.log
errpt | grep -i CORRUPT >> /root/alert.log
cat /root/alert.log

if [ `cat alert.log|wc -l` -eq 1 ]
then
   echo "No critical errors found."
fi

echo " "
echo "Filesystems that might need attention, e.g. %used:"
df -k |awk '{print $4,$7}' |grep -v "Filesystem"|grep -v tmp  > /root/tmp.txt
cat /root/tmp.txt | sort -n | tail -3


Example 22:
-----------

for sid in `grep -v "^#" /etc/oratab | sed -e 's/:.*$//'`
do
echo "/data/oracle/$sid/admin/bdump/alert_$sid.log:/beheer/log/history/oracle/$sid:cat::::" >> /tmp/test
done


Notes:
------

lsnrctl>set Log_status off
mv Listener.log to listenerold.log
lsnrctl>set Log_Status on 

% cd /u01/app/oracle/product/9.2.0/network/log
% lsnrctl set log_status off
% mv listener.log listener.old
% lsnrctl set log_status on

case $IN in
start)
for dbase in `grep -v "^#" /etc/oratab | sed -e 's/:.*$//'`
do
su - $dbase -c "/beheer/oracle/cluster/orapkg.sh start"
done;;

for dbase in `grep -v "^#" /etc/oratab | sed -e 's/:.*$//'`
do
echo $dbase
done;;

for sid in `grep -v "^#" /etc/oratab | sed -e 's/:.*$//'`
do
echo "/data/oracle/$sid/admin/bdump/alert_$sid.log:/beheer/log/history/oracle/$sid:cat::::" >> /tmp/test
done


=====================
3. BOOT and Shutdown:
=====================


3.1 Shutdown systems:
=====================


3.1.1 Shutdown a Solaris system:
================================

Under Solaris, you need to use 

  init or shutdown are normally best: they run the kill scripts
  halt or reboot do not run the kill scripts properly

  /usr/sbin/shutdown -i5 -g0      -- this let the system go to the powerdown state
  /usr/sbin/shutdown -i6 -g0 -y   -- this let the system reboot
  /usr/sbin/shutdown -i0 -g0      -- shuts everything down, unmounts all fs

  shutdown -i6  (is a reboot in Solaris8)

  shutdown [-y no interactive confirmations] [-g grace period in seconds] [-i init state] [message]

- If you say init 6, or shutdown -i6, the system reboots an restart into a runstate as defined as the default
  in the inittab file.

- If you say init 0, the system cleanly shuts down, and you can power of the system
  If you say init 5, is equivalent to the poweroff command, and the system cleanly shuts down, 
  and you can power of the system


to achieve the desired effect. Be sure to read the man page for shutdown for your operating system. 
With no argument, shutdown will take the system into single user mode. 

- The /usr/sbin/reboot command: 
  is used when you want to reboot. The system does not go through the shutdown scripts. 
  Also, it usually sync's the filesystem. 
  Thus, the following is a safe bet on all Unixes: 
  sync;sync;sync;reboot

- The /usr/sbin/halt command: 
  syncs the filesystem and stops the processor. No shutdown scripts are fired up. 

- The fastboot/fasthalt command: 
  The fasthalt command halts the processor and creates a /fastboot file to tell the system to skip the 
  fsck operation upon reboot 

- The sync command: completes pending filesystem writes to disk (in other words, the buffer cache is dumped to disk). 
  Most Unix shutdown, reboot, and halt commands will do a sync. However, the reboot, fastboot, or halt commands will not 
  go through the shutdown scripts. 

If you manually sync, it is customary to do it multiple times (as we saw before). This is partly Unix superstition and 
part fact. 
The first sync is supposed to schedule a sync, not actually perform it. The second and subsequent syncs force the sync. 

sync<enter>

sync<enter>

init 0<enter>


- Shutdown scripts:
Like startup scripts, the system initialization directories (usually /etc/rcN.d) contains shutdown scripts which are fired up
by init during an orderly shutdown (i.e. when either the init command is used to change the runlevel or when the 
shutdown command is used). 
The usual convention is to use the letter K in front of a number, followed by a service name, such as K56network. 
The number determines the order in which the scripts are fired up when the system transitions into a particular run level. 


3.1.2 Shutdown an AIX system:
=============================

You can use the init, shutdown and halt commands. The shutdown command stops the system in an orderly fashion.

Bring the system from multi-user mode to maintenance mode, use
# shutdown -m

To restart the system, use
# shutdown -r

To gracefully shutdown the system, use
# shutdown

If you need a customized shutdown sequence, you can create a file called /etc/rc.shutdown.
If this file exists, it is called by the shutdown command and is executed first.
This can be usefull for example, if you need to close a database prior to a shutdown.
If rc.shutdown fails (non zero return code value), the shutdown cycle is terminated.

Example rc.shutdown:
--------------------

#cat /etc/rc.shutdown

#!/bin/ksh


# stop Control-SA/Agent
/etc/rc.ctsa stop
/etc/rc.mwa stop
/etc/rc.opc stop

# Stop TSM dsmcad en scheduler
/etc/rc.dsm stop


# Stop TSCM client
/opt/IBM/SCM/client/jacclient stop

# /etc/rc.shutdown SHOULD always end with a # Stop db2 instances as last line
/etc/rc.ihs stop
/etc/rc.ihs stop des
/etc/rc.appserver stop PRM1DES
/etc/rc.nodeagent stop
/etc/rc.dmgr stop
# Stop db2 instances
/etc/rc.db2_udb stop all

/etc/rc.directoryserver stop
 #Stop the Tivoli Enterprise Console Logfile Adapter
if [ -f /beheer/Tivoli/lcf/bin/aix4-r1/TME/TEC/adapters/bin/init.tecad_logfile ]; then
   /beheer/Tivoli/lcf/bin/aix4-r1/TME/TEC/adapters/bin/init.tecad_logfile stop aix-default >/dev/null 2>&1
   echo "Tivoli Enterprise Console Logfile Adapter stopped."
fi
exit 0


3.1.3 Shutdown a Linux system:
==============================

Note 1: Redhat systems:

To shut down Red Hat Linux, issue the shutdown command. You can read the shutdown man page for complete details, 
but the two most common uses are: 

/sbin/shutdown -h now
/sbin/shutdown -r now
 
You must run shutdown as root. After shutting everything down, the -h option will halt the machine, 
and the -r option will reboot. 

Non-root users can use the reboot and halt commands to shutdown the system while in runlevels 1 through 5. 
However, not all Linux operating systems support this feature. 

If your computer does not power itself down, be careful not turn off the computer until you see a message indicating 
that the system is halted. 


3.2 Booting:
============


3.2.1 Generic description booting a unix system:
================================================
 
1. Or some rom menu, or some command prompt, or automatic procedure,
provides for finding or selecting initial bootstraps code
on some device
 
2. kernel is loaded (unix, vmunix etc.. in /, or /boot, or /kernel, or some place else)
3. start init
4. run scripts

SunOs:    /vmunix
Solaris8 = SunOs 5.8: /kernel/unix
AIX: /unix


3.2.2 Bootprocedure Solaris:
============================


1. Boot overview
----------------

- Openboot PROM: 

  After a dignostice fase, it shows either 
    - the "ok" prompt or 
    - via Openboot parameter "auto_boot" goes on with the bootprocedure.
      Thanks to the Openboot parameter "boot_device" the path to the bootblk is known.

  You can also use the
  "ok boot alias/physical_device_name" to boot from the specified device, like for example
  "ok boot disk3"

  You can also use options as 
  "ok boot -a=interactive boot, -r=reconfiguration boot, -s=boot to single-user state, -v=verbose mode"

  The command "ok boot disk5 kernel/unix -s", the PROM will look for the primary bootprogram bootblk
  on the alias disk5, which could be a physical device as /iommu/sbus/espdma@f,400000/esp@f,800000/sd@0,0
  The primary startup command will then load "ufsboot". This will then load the kernel as specified.

  Thus, after the simple boot command, the boot process goes on in the following manner:

- bootblk  (from the default boot-device)
- bootblk find ufsboot (plus filesystem drivers)
- ufsboot starts the kernel, its merges genunix and unix (mounts the root fs)
- kernel mounts other fs
- start sched (swapper)
- start /sbin/init 
- init starts demons in rc scripts

You can also view the startup information via:

$ more /var/adm/messages
$ /usr/sbin/dmesg 

2. Booting with Other system file:
----------------------------------

1. login as root

2. create a backup copy of the /etc/system file
   cp /etc/system /etc/system.orig

3. Halt the system
   /usr/sbin/shutdown -y -g0 -i0

4. at the OK prompt, boot the system using the interactive option
   OK boot -a

5. You will be prompted to enter a filename for the kernel, and a default
   directory for modules. Enter a return for each of these questions.
   When prompted to use the default /etc/system file:

   Name of system file [etc/system]:

   enter the following:

   /etc/system.orig 


3. More about init:

init uses the /etc/inittab File
When you boot the system or change run levels with the init or shutdown command, the init daemon starts processes 
by reading information from the /etc/inittab file. This file defines three important items for the init process: 

-The system's default run level
-What processes to start, monitor, and restart if they terminate
-What actions to be taken when the system enters a new run level

Each entry in the /etc/inittab file has the following fields:

id:rstate:action:process


in /etc we find the links:
rc0 -> /sbin/rco
rc1 -> /sbin/rc1
rc2 -> /sbin/rc2
rc3 -> /sbin/rc3
rc4 -> /sbin/rc4
rc5 -> /sbin/rc5
rc6 -> /sbin/rc6

in /sbin we find the scripts rc0 - rc6, rcS. These are not links, but true shell scripts.
In /etc  we find the links   rc0 - rc6, rcS.

In /etc we find the (true) directories /etc/rc#.d.
So suppose the runlevel=3

1. init reads inittab

Contents /etc/inittab

\u@\h[\w]> more inittab
ap::sysinit:/sbin/autopush -f /etc/iu.ap
ap::sysinit:/sbin/soconfig -f /etc/sock2path
fs::sysinit:/sbin/rcS                   >/dev/console 2<>/dev/console </dev/console
is:3:initdefault:
p3:s1234:powerfail:/usr/sbin/shutdown -y -i5 -g0 >/dev/console 2<>/dev/console
s0:0:wait:/sbin/rc0                     >/dev/console 2<>/dev/console </dev/console
s1:1:wait:/usr/sbin/shutdown -y -iS -g0 >/dev/console 2<>/dev/console </dev/console
s2:23:wait:/sbin/rc2                    >/dev/console 2<>/dev/console </dev/console
s3:3:wait:/sbin/rc3                     >/dev/console 2<>/dev/console </dev/console
s5:5:wait:/sbin/rc5                     >/dev/console 2<>/dev/console </dev/console
s6:6:wait:/sbin/rc6                     >/dev/console 2<>/dev/console </dev/console
fw:0:wait:/sbin/uadmin 2 0              >/dev/console 2<>/dev/console </dev/console
of:5:wait:/sbin/uadmin 2 6              >/dev/console 2<>/dev/console </dev/console
rb:6:wait:/sbin/uadmin 2 1              >/dev/console 2<>/dev/console </dev/console
sc:234:respawn:/usr/lib/saf/sac -t 300
co:234:respawn:/usr/lib/saf/ttymon -g -h -p "`uname -n` console login: " -T sun
-d /dev/console -l console -m ldterm,ttcompat


2. init knows the runlevel, default it's 3


For each rc script in the /sbin directory, there is a corresponding directory named /etc/rcn.d that contains 
scripts to perform various actions for that run level. 
For example, /etc/rc2.d contains files used to start and stop processes for run level 2.


# ls /etc/rc2.d
K20spc@             S70uucp*            S80lp*
K60nfs.server*      S71rpc*             S80spc@
K76snmpdx*          S71sysid.sys*       S85power*
K77dmi*             S72autoinstall*     S88sendmail*
README              S72inetsvc*         S88utmpd*
S01MOUNTFSYS*       S73nfs.client*      S89bdconfig@
S05RMTMPFILES*      S74autofs*          S91leoconfig*
S20sysetup*         S74syslog*          S92rtvc-config*
S21perf*            S74xntpd*           S92volmgt*
S30sysid.net*       S75cron*            S93cacheos.finish*
S47asppp*           S76nscd*            S99audit*
S69inet*            S80PRESERVE*        S99dtlogin*
 

The /etc/rcn.d scripts are always run in ASCII sort order. The scripts have names of the form:

[K,S][0-9][0-9][A-Z][0-99]

Files beginning with K are run to terminate (kill) a system process. Files beginning with S are run to start a system process.

Run control scripts are also located in the /etc/init.d directory. 
These files are linked to corresponding run control scripts in the /etc/rc*.d directories.
One advantage of having individual scripts for each run level is that you can run scripts in the /etc/init.d directory individually 
to turn off functionality without changing a system's run level.

- Start and Stop of an individual process

The advantage to have individual scripts, is that you can stop or start individual processes
by running such a script, without rebooting or changing the run level.


Turn off functionality. 
# /etc/init.d/filename stop
 
Restart functionality
# /etc/init.d/filename start

For example, if you want to restart the NFS server, you can do the following:

# /etc/init.d/nfs.server stop
# /etc/init.d/nfs.server start


Use the ps and grep commands to verify whether the service has been stopped or started.
# ps -ef | grep service 


  Adding a Run Control Script:
  ----------------------------

  All scripts are in /etc/init.d
  You create the neccesary links in the corresponding /etc/rcn.d directory

  The /sbin/rcN scripts run the /etc/rcN.d scripts

  If you want to add a run control script to start and stop a service, 
  copy the script into the /etc/init.d directory and create links in the rc*.d 
  directory you want the service to start and stop. 
  See the README file in each /etc/rc*.d directory for more information on naming run control scripts. 
  The procedure below describes how to add a run control script.

  How to Add a Run Control Script
  Become superuser.

  Add the script to the /etc/init.d directory. 

  # cp filename /etc/init.d 
  # chmod 744 /etc/init.d/filename
  # chown root:sys /etc/init.d/filename

  Create links to the appropriate rc*.d directory.

  # cd /etc/init.d
  # ln filename /etc/rc2.d/Snnfilename
  # ln filename /etc/rcn.d/Knnfilename 

  (or 
  cd /etc/rc2d
  ln /etc/init.d/filename S22filename
  )

  Use the ls command to verify that the script has links in the specified directories.

  # ls /etc/init.d/ /etc/rc2.d/ /etc/rcn.d/
 
  Example-Adding a Run Control Script

  # cp xyz /etc/init.d
  # cd /etc/init.d
  # ln xyz /etc/rc2.d/S100xyz
  # ln xyz /etc/rc0.d/K100xyz
  # ls /etc/init.d /etc/rc2.d /etc/rc0.d


#!/bin/ksh

# name: spl
# purpose: script that will start or stop the spl stuff.


case "$1" in
start )
        echo "starting spl"
        echo "su - ccbsys -c '/prj/spl/SPLS3/bin/splenviron.sh -e SPLS3 -c "spl.sh -t start"'"
        su - ccbsys -c '/prj/spl/SPLS3/bin/splenviron.sh -e SPLS3 -c "spl.sh -t start"'
        ;;
stop )
        echo "stopping spl"
        echo "su - ccbsys -c '/prj/spl/SPLS3/bin/splenviron.sh -e SPLS3 -c "spl.sh -t stop"'"
        su - ccbsys -c '/prj/spl/SPLS3/bin/splenviron.sh -e SPLS3 -c "spl.sh -t stop"'
        ;;
* )
        echo "Usage: $0 (start | stop)"
        exit 1
esac


3. /sbin/rc3 script will be started


Contents /sbin/rc3

#!/sbin/sh
#       Copyright (c) 1984, 1986, 1987, 1988, 1989 AT&T
#         All Rights Reserved

#       THIS IS UNPUBLISHED PROPRIETARY SOURCE CODE OF AT&T
#       The copyright notice above does not evidence any
#       actual or intended publication of such source code.

#ident  "@(#)rc3.sh     1.12    94/12/19 SMI"   SVr4.0 1.11.2.2
#       "Run Commands" executed when the system is changing to init state 3,
#       same as state 2 (multi-user) but with remote file sharing.

PATH=/usr/sbin:/usr/bin
set `/usr/bin/who -r`
if [ -d /etc/rc3.d ]
   then
        for f in /etc/rc3.d/K*
        {
                if [ -s ${f} ]
                then
                        case ${f} in
                                *.sh)   .        ${f} ;;        # source it
                                *)      /sbin/sh ${f} stop ;;   # sub shell
                        esac
                fi
        }

for f in /etc/rc3.d/S*
        {
                if [ -s ${f} ]
                then
                        case ${f} in
                                *.sh)   .        ${f} ;;        # source it
                                *)      /sbin/sh ${f} start ;;  # sub shell
                        esac
                fi
        }
fi
 
modunload -i 0 & > /dev/null 2>&1

if [ $9 = 'S' -o $9 = '1' ]
then
  echo 'The system is ready.'
fi
       

4. From /sbin/rc3 all K* and S* scripts in /etc/rc3.d will be run

Oracle is installed on this host,  so there should be a /etc/rc3.d/S99oracle or similar
script. Now there indeed exists the S88dbora script.


5. There is an S88dbora script, so it will be called:

Oracle S88dbora script in /etc/rc3.d


Example:
--------

mt -f /dev/rm  rewind
tar -xvf /dev/rmt1.1  fielname
mt -f /dev/rmt0.1 fsf 2 (voor drie) (daarna staat tapepointer op begin 4)

fsf bsf

\u@\h[\w]> more S88dbora
#!/bin/sh

# 
# Startup for Oracle Databases
#

ORACLE_HOME=/opt/oracle/product/8.0.6
ORACLE_OWNER=oracle

if [ ! -f $ORACLE_HOME/bin/dbstart ] ;then
  echo "Oracle startup: cannot start"
  exit
fi

case "$1" in
  'start')
        # Start the Oracle databases
        su - $ORACLE_OWNER -c "$ORACLE_HOME/bin/dbstart" > /dev/null 2>&1
        su - $ORACLE_OWNER -c "$ORACLE_HOME/bin/lsnrctl start" > /dev/null 2>&1
        su - $ORACLE_OWNER -c "$ORACLE_HOME/bin/lsnrctl dbsnmp_start" > /dev/null 2>&1
        ;;

  'stop')
        # Stop the Oracle databases
        su - $ORACLE_OWNER -c "$ORACLE_HOME/bin/lsnrctl dbsnmp_stop" > /dev/null
 2>&1
        su - $ORACLE_OWNER -c "$ORACLE_HOME/bin/lsnrctl stop" > /dev/null 2>&1
        su - $ORACLE_OWNER -c "$ORACLE_HOME/bin/dbshut" > /dev/null 2>&1
        ;;

  *)
        echo "Usage: $0 { start | stop }"
        ;;
esac

Another example:
----------------

more /etc/init.d/dbora

nlih30207858-08:/etc/init.d $ more dbora
# Set ORA_HOME to be equivalent to the ORACLE_HOME
# from which you wish to execute dbstart and
# dbshut
# set ORA_OWNER to the user id of the owner of the
# Oracle database in ORA_HOME
ORA_HOME=/u01/app/oracle/product/9.2
ORA_OWNER=oraclown
if [ ! -f $ORA_HOME/bin/dbstart -o ! -d $ORA_HOME ]
then
echo "Oracle startup: cannot start"
exit
fi
case "$1" in
'start')
# Start the Oracle databases and listener:
su - $ORA_OWNER -c "$ORA_HOME/bin/lsnrctl start" &
su - $ORA_OWNER -c $ORA_HOME/bin/dbstart &
;;
'stop')
# Stop the Oracle databases and listener:
su - $ORA_OWNER -c $ORA_HOME/bin/lsnrctl stop &
su - $ORA_OWNER -c $ORA_HOME/bin/dbshut &
;;
esac


6. To start the database(s) en listener(s), the dbstart script is run:

\u@\h[\w]> more dbstart
:
#
# $Header: dbstart.sh.pp 1.1 95/02/22 14:37:29 rdhoopar Osd<unix> $ dbstart.sh.p
p Copyr (c) 1991 Oracle
#

###################################
#
# usage: dbstart
#
# This script is used to start ORACLE from /etc/rc(.local).
# It should ONLY be executed as part of the system boot procedure.
#
#####################################

ORATAB=/var/opt/oracle/oratab

trap 'exit' 1 2 3
case $ORACLE_TRACE in
    T)  set -x ;;
esac

# Set path if path not set (if called from /etc/rc)
case $PATH in
    "") PATH=/bin:/usr/bin:/etc
        export PATH ;;
esac

#
# Loop for every entry in oratab file and and try to start
# that ORACLE
#

cat $ORATAB | while read LINE
do
    case $LINE in
        \#*)            ;;      #comment-line in oratab
        *)
#       Proceed only if third field is 'Y'.
        if [ "`echo $LINE | awk -F: '{print $3}' -`" = "Y" ] ; then
            ORACLE_SID=`echo $LINE | awk -F: '{print $1}' -`
            if [ "$ORACLE_SID" = '*' ] ; then
                ORACLE_SID=""
            fi
#           Called programs use same database ID
            export ORACLE_SID
            ORACLE_HOME=`echo $LINE | awk -F: '{print $2}' -`
#           Called scripts use same home directory
            export ORACLE_HOME
#           Put $ORACLE_HOME/bin into PATH and export.
            PATH=$ORACLE_HOME/bin:/bin:/usr/bin:/etc ; export PATH

            PFILE=${ORACLE_HOME}/dbs/init${ORACLE_SID}.ora

#       Figure out if this is a V5, V6, or V7 database. Do we really need V5?
            if [ -f $ORACLE_HOME/bin/sqldba ] ; then
                VERSION=`$ORACLE_HOME/bin/sqldba command=exit | awk '
                        /SQL\*DBA: (Release|Version)/ {split($3, V, ".") ;
                        print V[1]}'`
            else
                if test -f $ORACLE_HOME/bin/svrmgrl; then
                        VERSION="7.3"

                else
                        VERSION="5"
                fi
            fi

           if test  -f $ORACLE_HOME/dbs/sgadef${ORACLE_SID}.dbf  -o \
                     -f $ORACLE_HOME/dbs/sgadef${ORACLE_SID}.ora
            then
                STATUS="-1"
            else
                STATUS=1
            fi
            case $STATUS in
                1)  if [ -f $PFILE ] ; then
                        case $VERSION in
                            5)  ior w pfile=$PFILE
                                ;;

                            6)  sqldba command=startup
                                ;;

                            7)  sqldba <<EOF
connect internal
startup
EOF
                                ;;

                           7.3) svrmgrl <<EOF
connect internal
startup
EOF
                                ;;
                        esac

                        if test $? -eq 0 ; then
                            echo ""
                            echo "Database \"${ORACLE_SID}\" warm started."
                        else
                            echo ""
                            echo "Database \"${ORACLE_SID}\" NOT started."
                        fi
                    else
                        echo ""
                        echo "Can't find init file for Database \"${ORACLE_SID}\
"."
                        echo "Database \"${ORACLE_SID}\" NOT started."
                    fi
                    ;;

                -1) echo ""
                    echo "Database \"${ORACLE_SID}\" possibly left running when
system went down (system crash?)."
                    echo "Notify Database Administrator."
                    case $VERSION in
                        5)  ior c
                            ;;

                        6)  sqldba "command=shutdown abort"
                            ;;

                        7)  sqldba <<EOF
connect internal
shutdown abort
EOF
                            ;;

                      7.3)  svrmgrl <<EOF
connect internal
shutdown abort
EOF

                           ;;
                    esac

                    if test $? -eq 0 ; then
                        if [ -f $PFILE ] ; then
                            case $VERSION in
                                5)  ior w pfile=$PFILE
                                    ;;

                                6)  sqldba command=startup
                                    ;;

                                7)  sqldba <<EOF
connect internal
startup
EOF
                                    ;;
                              7.3)  svrmgrl <<EOF
connect internal
startup
EOF
                                    ;;
                            esac
                            if test $? -eq 0 ; then
                                echo ""
                                echo "Database \"${ORACLE_SID}\" warm started."
                            else
                                echo ""
                                echo "Database \"${ORACLE_SID}\" NOT started."
                            fi
                        else
                            echo ""
                            echo "Can't find init file for Database \"${ORACLE_S
ID}\"."
                            echo "Database \"${ORACLE_SID}\" NOT started."
                        fi
                    else
                        echo "Database \"${ORACLE_SID}\" NOT started."
                    fi
                    ;;
            esac
        fi
        ;;
    esac
done


environment oracle user

DBPASSWORD=abc
DBPASSWORDFE=mrx
DBUSER=xyz
DBUSERFE=mry
EDITOR=vi
HOME=/opt/home/oracle
HZ=100
INPUTRC=/usr/local/etc/inputrc
LD_LIBRARY_PATH=/opt/oracle/product/8.0.6/lib
LESSCHARSET=latin1
LOG=/var/opt/oracle
LOGNAME=oracle
MANPATH=/usr/share/man:/usr/openwin/share/man:/usr/opt/SUNWmd/man:/opt/SUNWsymon
/man:/opt/SUNWswusg/man:/opt/SUNWadm/2.2/man:/opt/local/man
NLS_LANG=american_america.we8iso8859p1
OPENWINHOME=/usr/openwin
ORACLE_BASE=/opt/oracle
ORACLE_HOME=/opt/oracle/product/8.0.6
ORACLE_SID=ORCL
ORA_NLS33=/opt/oracle/product/8.0.6/ocommon/nls/admin/data
PATH=/usr/bin:/sbin:/usr/sbin:/usr/local/bin:/usr/ucb:/usr/openwin/bin:/opt/orac
le/product/8.0.6/bin
PROGRAMS=/opt/local/bin/oracle
PS1=\u@\h[\w]>
SHELL=/sbin/sh
TERM=vt100
TZ=MET
\u@\h[\w]>


3.2.3 Bootprocedure AIX 5.x: 
============================

http://publib.boulder.ibm.com/infocenter/pseries/index.jsp?topic=/com.ibm.aix.doc/aixbman/admnconc/under_sys.htm


Understanding the Boot Process
During the boot process, the system tests the hardware, loads and runs the operating system, 
and configures devices. To boot the operating system, the following resources are required:

. A boot image that can be loaded after the machine is turned on or reset. 
. Access to the root (/) and /usr file systems.

There are three types of system boots:

Hard Disk Boot           A machine is started for normal operations. For more information, 
                         see Understanding System Boot Processing. 
Diskless Network Boot    A diskless or dataless workstation is started remotely over a network. 
                         A machine is started for normal operations. One or more remote file servers provide the files 
                         and programs that diskless or dataless workstations need to boot. 
Maintenance Boot         A machine is started from a hard disk, network, tape, or CD-ROM in maintenance mode. 
                         A system administrator can perform tasks such as installing new or updated software and running 
                         diagnostic checks. For more information, see Understanding the Maintenance Boot Process. 

During a hard disk boot, the boot image is found on a local disk created when the operating system was installed. 
During the boot process, the system configures all devices found in the machine and initializes other basic software 
required for the system to operate (such as the Logical Volume Manager). At the end of this process, 
the file systems are mounted and ready for use. For more information about the file system used during boot processing, 
see Understanding the RAM File System.

The same general requirements apply to diskless network clients. They also require a boot image and access 
to the operating system file tree. Diskless network clients have no local file systems and get all their 
information by way of remote access.


Understanding System Boot Processing:

Most users perform a hard disk boot when starting the system for general operations. 
The system finds all information necessary to the boot process on its disk drive.

When the system is started by turning on the power switch (a cold boot) or restarted with the 
reboot or shutdown commands (a warm boot), a number of events must occur before the system is ready for use. 
These events can be divided into the following phases:

. Read Only Storage (ROS) Kernel Init Phase 
. Base Device Configuration Phase 
. Maintenance Boot Phase.

-- ROS Kernel Init Phase:

The ROS kernel resides in firmware. Its initialization phase involves the following steps:

1. The firmware checks to see if there are any problems with the system motherboard. Control is passed to ROS, 
which performs a power-on self-test (POST). 

2. The ROS initial program load (IPL) checks the user bootlist, a list of available boot devices. 
This boot list can be altered to suit your requirements using the bootlist command. If the user boot list 
in non-volatile random access memory (NVRAM) is not valid or if a valid boot device is not found, 
the default boot list is then checked. In either case, the first valid boot device found in the boot list 
is used for system startup. If a valid user boot list exists in NVRAM, the devices in the list are checked in order. 
If no user boot list exists, all adapters and devices on the bus are checked. In either case, devices are checked 
in a continuous loop until a valid boot device is found for system startup. 

Note:
The system maintains a default boot list located in ROS and a user boot list stored in NVRAM, 
for a normal boot. Separate default and user boot lists are also maintained for booting from the Service key position.

3. When a valid boot device is found, the first record or program sector number (PSN) is checked. 
If it is a valid boot record, it is read into memory and is added to the IPL control block in memory. 
Included in the key boot record data are the starting location of the boot image on the boot device, 
the length of the boot image, and instructions on where to load the boot image in memory. 

4. The boot image is read sequentially from the boot device into memory starting at the location 
specified in NVRAM. The disk boot image consists of the kernel, a RAM file system, and base customized 
device information (customized reduced ODM). 

5. Control is passed to the kernel, which begins system initialization. 

6. The kernel runs init, which runs phase 1 of the "/sbin/rc.boot" script.
When the kernel initialization phase is completed, base device configuration begins.


-- Base Device Configuration Phase:

The init process starts the rc.boot script. Phase 1 of the rc.boot script performs the base device configuration, 
and it includes the following steps:

. The boot script calls the restbase program to build the customized Object Data Manager (ODM) database 
  in the RAM file system from the compressed customized data. 
. The boot script starts the configuration manager, which accesses phase 1 ODM configuration rules to configure 
  the base devices. 
. The configuration manager starts the sys, bus, disk, SCSI, and the Logical Volume Manager (LVM) and 
  rootvg volume group configuration methods. 
. The configuration methods load the device drivers, create special files, and update the customized data 
  in the ODM database.

-- System Boot Phase:

The System Boot Phase involved the following steps:

The init process starts phase 2 running of the rc.boot script. Phase 2 of rc.boot includes the following steps: 
.Call the ipl_varyon program to vary on the rootvg volume group. 
.Mount the hard disk file systems onto their normal mount points. 
.Run the swapon program to start paging. 
.Copy the customized data from the ODM database in the RAM file system to the ODM database in the hard disk file system. 
.Exit the rc.boot script.

- After phase 2 of rc.boot, the boot process switches from the RAM file system to the hard disk root file system. 
- Then the init process runs the processes defined by records in the /etc/inittab file. 
One of the instructions in the /etc/inittab file runs phase 3 of the rc.boot script, 

  cat /etc/inittab | grep rc.boot
  brc::sysinit:/sbin/rc.boot 3 >/dev/console 2>&1 # Phase 3 of system boot

which includes the following steps: 
.Mount the /tmp hard disk file system. 
.Start the configuration manager phase 2 to configure all remaining devices. 
.Use the savebase command to save the customized data to the boot logical volume 
.Exit the rc.boot script.

At the end of this process, the system is up and ready for use.

- To display the current runlevel:

# who -r
# cat /etc/.init.state

- To display a history of previous runlevels:

# /usr/lib/acct/fwtmp < /var/adm/wtmp | grep run-level

- To change the runlevel:

# telinit m    # m in 0-9,   S,s,M,m,   a,b,c,    Q,q

S,s,M,m: maintenance mode
0,1    : reserved
2      : normal
3-9    : user defined
Q,q    : reparse inittab file


The telinit command directs the actions of the init process by taking a one character parameter
and signaling the init process to perform the appropriate action. 
So the telinit command sets the system at a specific runlevel.


Some important PCI systems an pSeries LED codes:
------------------------------------------------


hd4= /, hd5=boot, hd6=paging, hd2=/usr, hd3=/tmp, hd9var=/var
/etc is in "/", so the ODM database is in "/".

Describe LED codes (121, 223, 229, 551, 552, 553, 581, OC31, OC32) 


-reduced ODM from BLV copied into RAMFS: OK=510, NOT OK=LED 548: 
-LED 511: bootinfo -b is called to determine the last bootdevice
-ipl_varyon of rootvg: OK=517,ELSE 551,552,554,556: 
-LED 555,557: mount /dev/hd4 on temporary mountpoint /mnt
-LED 518: mount /usr, /var
-LED 553: syncvg rootvg, or inittab problem
-LED 549
-LED 581: tcp/ip is being configured, and there is some problem

Last phases in the boot is where cfgcon is called, to configure the console.
cfgcon LED codes include:
C31: Console not yet configured.
C32: Console is an LFT terminal
C33: Console is a TTY
C34: Console is a file on disk
C99: Could not detect a console device

LED 551: ipl_varyon of rootvg

201           : Damaged boot image
223-229       : Invalid boot list
551,555,557   : Corrupted filesystem, corrupted JFS log
552,554,556   : Superblock corrupted, corrupted customized ODM database
553           : Corrupted /etc/inittab file


More Detail on LED codes:
-------------------------


105
CPU planar board is not securely seated in the adapter slot on the microchannel bus.


--------------------------------------------------------------------------------

200 
Key is in SECURE mode and the system will NOT boot until the key is turned to either 
NORMAL or SERVICE mode.


--------------------------------------------------------------------------------

201 
LV hd5 (boot logical volume) has been corrupted. To correct this situation, perform the following: 

. Boot system in service mode. Either boot the system from boot diskettes or boot tape OF THE SAME VERSION AND 
LEVEL AS THE SYSTEM. 
. To perform system maintenance functions from the INSTALL and MAINTENANCE menu, enter the following command, 
where hdisk0 is the drive that contains the boot logical volume (/blv) 
/usr/sbin/getrootfs hdisk0 

. From maintenance mode make sure /tmp has at least enough free disk space to create the tape image when 
the 'bosboot' command is executed. 
. Make sure /dev/hd6 is swapped on via the lsps -a command. 
You don't want to get 'paging space low' messages when creating a new boot image on /dev/hd5. Recreate 
a new boot image by executing the command: 
bosboot -a -d /dev/hdisk0 
Turn key to normal mode 
shutdown -Fr 


--------------------------------------------------------------------------------
221 

The NVRAM is potentially corrupted. To correct this situtation, perform the following steps:

Boot system in service mode. Either boot the system from boot diskettes or boot tape 
Select option to perform system maintenance functions from the INSTALL and MAINTENANCE menu. 
Enter the following command: 
/usr/sbin/getrootfs hdisk0 from maintenance mode 
Enter the command 
bootlist -m normal hdisk0 or whatever your boot drive name is (eg., hdisk1) 
shutdown -Fr 
If the above method fails, try the following:

Shutdown your machine and unplug your system battery before you power up. 
Wait 30 minutes for battery to drain. 
Reconnect battery. 
Once you power up and a 221 is displayed on your LED 
flip the key to service mode then back to normal mode 
plug in system battery 
Once this is done, the NVRAM should return to normal.


--------------------------------------------------------------------------------

223/229 
Cannot boot in normal mode from any of the devices listed in the NVRAM bootlist.

Typically the cause of this problem is the machine has just been moved and the SCSI adapter card is not 
firmly seated in the adapter slot on the microchannel bus. Make sure the card is seated properly and all 
internal and external SCSI connectors are firmly attached. 
Another possibility is that a NEW SCSI device has been added to the system and there are two or more devices 
with the same SCSI ID. 


--------------------------------------------------------------------------------

233 
Attempting to IPL from devices specified in NVRAM device list. If diagnostics indicate a bad drive is 
suspected, BEFORE replacing the physical volume, replace the LOGIC ASSEMBLY on the drive housing first. 
Saves time in retrying to rebuild a system especially if full backups haven't been made recently.


--------------------------------------------------------------------------------

552
BAD ERROR. The VG rootvg could not be varied on. Most likely scenario is that the VGDA on the default 
boot drive (hdisk0) got hammered/corrupted. To resolve this problem, try the following:

1) Boot system in service mode. Either boot the system from boot diskettes or boot tape
2) Select option to perform system maintenance functions from the INSTALL and MAINTENANCE menu.
3) Enter the following command:/usr/sbin/getrootfs hdisk0 from maintenance mode. If there are at least two PVs in the VG rootvg, if one fails to work with this command, try any of the remaining PVs (eg, /etc/continue hdisk0 or /etc/continue hdisk1)
4) If the importvg command fails, as should the varyonvg command, then perform the following from the command line:

exportvg <VG_NAME> EXAMPLE: exportvg vg2 removes LV references from ODM but wont write any info to VGDA
importvg -y <VG_NAME> <PV_NAME> EXAMPLE: importvg -y vg2 hdisk1  restores ODM database from information read from VGDA
varyonvg -m1 <VG_NAME>  EXAMPLE: varyonvg vg2 This command will INSURE that the ODM database MATCHES 
the characteristics stored in the VGDA (syncs VGDA to ODM)

5) If no error messages are reported by importvg or varyonvg, then goto step '11'
6) Execute the command: mount
7) If /dev/ram0 is the only mounted filesystem, try the following script entered interactively from the command line: EXAMPLE: for VG rootvg - if it fails to varyon

for i in hd2 hd3 hd4
    do 
        synclvodm rootvg $i
        if [ "$?" -eq 0 ]; then
            fsck -fp /dev/$i
        fi
    done

8) If there are no error messages from the synclvodm command or the fsck command, then mount the following 
file systems:

mount /dev/hd3 /tmp
mount /dev/hd2 /usr
mount /dev/hd4 /mnt

9) If there are no error messages from these mount commands, then goto step '11'
10) If the previous step fails or the log redo process fails or indicates any filesystems with an 
unknown log device, then do the following 2 steps:    

/etc/aix/logform /dev/hd8 ( Answer 'y' to the "Destroy /dev/hd8 (y)?" prompt )
LogForm will reformat the log logical volume. The next IPL will take a little longer.

11) Turn key to normal mode
12) Shutdown the system via the command shutdown -Fr. If this doesn't appear to be working, type the following at the command line:

    sync; sync;
    halt

13) If the problem still persists, consult your local SE before you attempt to RE-INSTALL your system.


--------------------------------------------------------------------------------

553 
Your /etc/inittab file has been corrupted or truncated. To correct this situation, perform the following:

boot system in service mode. Either boot the system from boot diskettes or boot tape select option 5 
(perform system maintenance) from the INSTALL and MAINTENANCE menu. 
Enter the command /etc/continue hdisk0 from maintenance mode. 
Check to see that you have free space on those file systems that are mounted on logical volumes /dev/hd3 and /dev/hd4. 
If they are full, erase files that aren't needed. 
Some space needs to be free on these logical volumes for the system to boot properly. 
Check to see if the /etc/inittab file looks ok. If not, goto the next step, else consult your local SE 
for further advice. 
Place the MOST recent 'mksysb' tape into the tape drive. If you don't have a 'mksysb' tape, get your 
INSTALL/MAINT floppy and insert into your diskette drive. 
Extract the /etc/inittab file from the media device mentioned. 
Change directories to root (eg., cd /) first, then execute the following command: 
restore -xvf/dev/fd0 ./etc/inittab - if a floppy disk 
restore -xvf/dev/rmt0 ./etc/inittab - if a tape device 
This will restore the contents of the /etc/inittab file to a reasonable format to boot the system up with. 
Depending on how current the /etc/inittab file is, you may have to manually add, modify, or delete the 
contents of this file. 

shutdown -Fr 


--------------------------------------------------------------------------------

581 
This LED is displayed when the /etc/rc.net script is executed.

Verify this script is correct or if modifications have been made since the system was last rebooted. 
Any errors logged during the execution of this script are sent to the /tmp/rc.net.out file. 
top of page


--------------------------------------------------------------------------------

727
Printer port is being configured BUT there is NO cable connected to the configured port on the 16-port 
concentrator OR the RJ-45 cable from the concentrator back to the 64-port card isn't connected.

Either remove the printer in question from the ODM database (eg., rmdev -l lp0 -d) OR 
Reconnect the printer cable back to the port on the 16-port concentrator OR 
Re-connect the 16-port concentrator back to the 64-port adapter card. 
To determine WHICH concentrator box that printer is connected to

Count the number of 727s displayed on the LED 
Subtract two (first two 727s deal with the native serial ports). 
For example, if the LED count is 17 (minus the two for the native ports), then the second concentrator 
box is the problem. 


--------------------------------------------------------------------------------

869 
Most likely scenario is that you have two or more SCSI devices with the same SCSI id on one SCSI controller. 
To correct this situation...

Change one of the conflicting SCSI devices to use an UNUSED SCSI address (0-7). 
If this case fails, RESET your SCSI adapter(s). 

--------------------------------------------------------------------------------


Steps Required to Obtain a System Dump
If your CONSOLE device is still operational, perform the following steps:

sysdumpdev -l This will determine which device has been assigned as the primary and secondary dump devices 
sysdumpstart -p (initiate dump to primary device) 
sysdumpstart -s (initiate dump to secondary device) 
sysdumpdev -z (indicates if a NEW dump exists) 
sysdumpdev -L (indicates info about a previous dump) 
Press keyboard sequence: CTRL-ALT-NUMPAD1 (for primary device) 
Press keyboard sequence: CTRL-ALT-NUMPAD2 (for secondary device)

Insert a tape in the tape device you wish to dump the kernel data to /usr/sbin/snap -gfkD -o /dev/rmt0

If your system is hung, the user MUST initiate or force a dump of the kernel data via the following:

Turn the Key Mode Switch to the SERVICE position 
Press the RESET button 


Other remarks AIX 5.x bootprocess:
======================================


ROS IPL (Read Only Storage Initial Program Load). This phase includes a power-on selftest, the location
of the bootdevice, and loading of the boot kernel into memory.

At boottime,once the POST is completed, the system will search the boot list for a
bootable image. The system will attempt to boot from the first entry in the bootlist.
Pressing the F5 key (or 5) during boot, will invoke the service bootlist, which includes
the CDROM.

  Note: If you want to install AIX on a machine, insert the product media, start the machine,
        press the F5 key (or 5) to let it boot from CD, then press 1 (graphic display) or
        2 (ascii terminal) to define your terminal as the Console

In normal operation of AIX, to view the normal boot list, use
# bootlist -m normal -o

fd0
cd0
hdisk0

The bootlist can be changed using the same command, for example
# bootlist -m normal hdisk0 cd0

To see or trace the bootprocess, use the alog command.

Because no console is available during the bootphase, the boot messages are collected
in a special file, which by default is /var/adm/ras/bootlog.

To view the boot log, use
# alog -o -t boot

To record the current date and time in alog file named /tmp/mylog, enter
# date | alog -f /tmp/mylog

To see the list the logs defined in the alog database, run
# alog -L

AIX uses the default runlevel 2. This is the normal multi-user mode.
Runlevels 0,1 are reserved, 2 is normal, and 3-9 are configurable by the Administrator.

At a certain stage, /etc/init is started, and invokes
/sbin/rc.boot 3, and runs the entries in /etc/inittab.


Example of an AIX /etc/inittab file:
------------------------------------

init:2:initdefault:
brc::sysinit:/sbin/rc.boot 3 >/dev/console 2>&1 # Phase 3 of system boot
mkatmpvc:2:once:/usr/sbin/mkatmpvc >/dev/console 2>&1
atmsvcd:2:once:/usr/sbin/atmsvcd >/dev/console 2>&1
load64bit:2:wait:/etc/methods/cfg64 >/dev/console 2>&1 # Enable 64-bit execs
tunables:23456789:wait:/usr/sbin/tunrestore -R > /dev/console 2>&1 # Set tunables
rc:23456789:wait:/etc/rc 2>&1 | alog -tboot > /dev/console # Multi-User checks
fbcheck:23456789:wait:/usr/sbin/fbcheck 2>&1 | alog -tboot > /dev/console # run /etc/firstboot
srcmstr:23456789:respawn:/usr/sbin/srcmstr # System Resource Controller
rctcpip:23456789:wait:/etc/rc.tcpip > /dev/console 2>&1 # Start TCP/IP daemons
rcnfs:23456789:wait:/etc/rc.nfs > /dev/console 2>&1 # Start NFS Daemons
cron:23456789:respawn:/usr/sbin/cron
nimclient:2:once:/usr/sbin/nimclient -S running
piobe:2:wait:/usr/lib/lpd/pio/etc/pioinit >/dev/null 2>&1  # pb cleanup
qdaemon:23456789:wait:/usr/bin/startsrc -sqdaemon
writesrv:23456789:wait:/usr/bin/startsrc -swritesrv
uprintfd:23456789:respawn:/usr/sbin/uprintfd
shdaemon:2:off:/usr/sbin/shdaemon >/dev/console 2>&1 # High availability daemon
l2:2:wait:/etc/rc.d/rc 2
l3:3:wait:/etc/rc.d/rc 3
l4:4:wait:/etc/rc.d/rc 4
l5:5:wait:/etc/rc.d/rc 5
l6:6:wait:/etc/rc.d/rc 6
l7:7:wait:/etc/rc.d/rc 7
l8:8:wait:/etc/rc.d/rc 8
l9:9:wait:/etc/rc.d/rc 9
logsymp:2:once:/usr/lib/ras/logsymptom # for system dumps
itess:23456789:once:/usr/IMNSearch/bin/itess -start search >/dev/null 2>&1
diagd:2:once:/usr/lpp/diagnostics/bin/diagd >/dev/console 2>&1
httpdlite:23456789:once:/usr/IMNSearch/httpdlite/httpdlite -r /etc/IMNSearch/httpdlite/httpdlite.conf & >/dev/console 2>&1
ha_star:h2:once:/etc/rc.ha_star >/dev/console 2>&1
dt_nogb:2:wait:/etc/rc.dt
cons:0123456789:respawn:/usr/sbin/getty /dev/console
srv:2:wait:/usr/bin/startsrc -s sddsrv > /dev/null 2>&1
perfstat:2:once:/usr/lib/perf/libperfstat_updt_dictionary >/dev/console 2>&1
ctrmc:2:once:/usr/bin/startsrc -s ctrmc > /dev/console 2>&1
lsof:2:once:/usr/lpp/aix4pub/lsof/mklink
monitor:2:once:/usr/lpp/aix4pub/monitor/mklink
nmon:2:once:/usr/lpp/aix4pub/nmon/mklink
ptxnameserv:2:respawn:/usr/java14/jre/bin/tnameserv -ORBInitialPort 2279 2>&1 >/dev/null # Start jtopasServer
ptxfeed:2:respawn:/usr/perfagent/codebase/jtopasServer/feed 2>&1 >/dev/null # Start jtopasServer
ptxtrend:2:once:/usr/bin/xmtrend -f /etc/perf/jtopas.cf -d /etc/perf/Top -n jtopas 2>&1 >/dev/null # Start trend
direct:2:once:/tmp/script_execute_after_reboot_pSeries 2>>/tmp/pSeries.050527_16:56.log
fmc:2:respawn:/usr/opt/db2_08_01/bin/db2fmcd #DB2 Fault Monitor Coordinator
smmonitor:2:wait:/usr/sbin/SMmonitor start > /dev/console 2>&1 # start SMmonitor daemon


The inittab is reread by the init daemon every 60 secs. 
To add records into the inittab file, you should use the mkitab command. 
For example, to add an entry for tty4, enter the following command:

# mkitab "tty4:2:respawn:/usr/sbin/getty /dev/tty4"


Other observations:
-------------------

{dbserver2:root}/etc/rc.d -> cd /etc/rc.d
{dbserver2:root}/etc/rc.d -> ls -al
total 40
drwxr-xr-x  11 root     system         4096 Oct 08 2002  .
drwxr-xr-x  30 root     system        12288 Aug 08 11:41 ..
drwxr-xr-x   2 root     system          256 May 27 16:56 init.d
-r-xr--r--   1 root     system         1586 Sep 16 2002  rc
drwxr-xr-x   2 root     system          256 May 27 17:00 rc2.d
drwxr-xr-x   2 root     system          256 Oct 08 2002  rc3.d
drwxr-xr-x   2 root     system          256 Oct 08 2002  rc4.d
drwxr-xr-x   2 root     system          256 Oct 08 2002  rc5.d
drwxr-xr-x   2 root     system          256 Oct 08 2002  rc6.d
drwxr-xr-x   2 root     system          256 Oct 08 2002  rc7.d
drwxr-xr-x   2 root     system          256 Oct 08 2002  rc8.d
drwxr-xr-x   2 root     system          256 Oct 08 2002  rc9.d


The bootlist command:
---------------------

Purpose
Displays and alters the list of boot devices available to the system.

Syntax
bootlist [ { -m Mode } [ -r ] [  -o  ] [ [  -i ] [ -V ] [ -F ]| [ [ -f File ] [  Device [ Attr=Value ... ] ... ] ] ] [ -v ]


The bootlist command allows the user to display and alter the list of possible boot devices from which 
the system may be booted. When the system is booted, it will scan the devices in the list and attempt to 
boot from the first device it finds containing a boot image. 

The AIX "bootlist" command can be used to select the boot disk. This is useful if you want to test 
different AIX levels on the same system. 

For example, assume hdisk0 has AIX 4.2.1 installed and hdisk1 AIX 4.3.3 installed. Use one of the following "bootlist" 
commands** to select which version will come up on the next reboot: 

bootlist -m normal hdisk0      # Reboots to AIX421
bootlist -m normal hdisk1      # Reboots to AIX433 

The second disk can be installed from CD, a "mksysb" tape, or using AIX 4.3's "alt_disk_install" capability. 
Both CD and mksysb installs require downtime. The "alt_disk_install" allows you to install the second disk from 
a "mksysb" or clone your existing OS while the system is running 

** Comment: In practice, I recommend the following "bootlist" syntax which specifies that if hdisk0 fails to boot, 
try booting from hdisk1, then tape, and finally CD ROM. 

bootlist -m normal hdisk0 hdisk1 rmt cd 


The bosboot command:
--------------------

Purpose
Creates boot image.

Syntax
For General Use:
bosboot -Action [ -d Device ] [ -Options ... ]

To Create a Device Boot Image:
bosboot -a [ -d Device ] [ -p Proto ] [ -k Kernel ] [ -I | -D ] [ -l LVdev ] [ -L] [ -M { Norm | Serv | Both } ] [ -T Type ] [ -b FileName ] [ -q ]

Description
The bosboot command creates the boot image that interfaces with the machine boot ROS (Read-Only Storage) 
EPROM (Erasable Programmable Read-Only Memory).

The bosboot command creates a boot file (boot image) from a RAM (Random Access Memory) disk file system and a kernel. 
This boot image is transferred to a particular media that the ROS boot code recognizes. 
When the machine is powered on or rebooted, the ROS boot code loads the boot image from the media into memory. 
ROS then transfers control to the loaded images kernel.


Examples

- To make a disk bootable, or recreate the boot image, use:
# bosboot -a -d /dev/hdiskn

- To create a boot image on the default boot logical volume on the fixed disk from which the system is booted, enter: 
bosboot -a

- To create a bootable image called /tmp/tape.bootimage for a tape device, enter: 
bosboot -ad /dev/rmt0 -b /tmp/tape.bootimage

- To copy a given tape boot image to a tape device, enter: 
bosboot -w /tmp/tape.bootimage -d rmt0

- To create a boot image file for an Ethernet boot, enter: 
bosboot -ad /dev/ent0 -M both

- When you have migrated a disk like disk0 to disk1, and you need to make the second disk bootable,
proceed as follows:

bosboot -a -d /dev/DestinationDiskNumber  # bosboot -ad  /dev/hdiskxx

Then:
bootlist -m normal DestinationDiskNumber

Then:
mkboot -c -d /dev/SourceDiskNumber


3.2.5 Bootprocedure Linux:
==========================

Note 1: Redhat system
---------------------

1. on a x86 system, the BIOS loads.
2. BOS loads the MBR of the first (primary) disk

Once loaded, the BIOS tests the system, looks for and checks peripherals and then locates a valid device 
with which to boot the system. Usually, it first checks any floppy drives and CD-ROM drives present for 
bootable media, then it looks to the system's hard drives. The order of the drives searched for booting 
can often be controlled with a setting in BIOS. Often, the first hard drive set to boot is the C drive or 
the master IDE device on the primary IDE bus. The BIOS loads whatever program is residing in the first sector 
of this device, called the Master Boot Record or MBR, into memory. The MBR is only 512 bytes in size and 
contains machine code instructions for booting the machine along with the partition table. Once found and loaded 
the BIOS passes control whatever program (the bootloader) is on the MBR. 

3. bootloader in MBR

Linux boot loaders for the x86 platform are broken into at least two stages. The first stage is a small 
machine code binary on the MBR. Its sole job is to locate the second stage boot loader and load the first part 
of it into memory. Under Red Hat Linux you can install one of two boot loaders: GRUB or LILO. 
GRUB is the default boot loader, but LILO is available for those who require it for their hardware setup 
or who prefer it.  

> If you are using LILO under Red Hat Linux, the second stage boot loader uses information on the MBR 
  to determine what boot options are available to the user. This means that any time a configuration change 
  is made or you upgrade your kernel manually, you must run the /sbin/lilo -v -v command to write the appropriate 
  information to the MBR. For details on doing this, see the Section called LILO in Chapter 4. 

> GRUB, on the other hand, can read ext2 partitions and therefore simply loads its configuration file 
  - /boot/grub/grub.conf - when the second stage loader is called. 

Once the second stage boot loader is in memory, it presents the user with the Red Hat Linux initial, 
graphical screen showing the different operating systems or kernels it has been configured to boot. 
If you have only Red Hat Linux installed and have not changed anything in the 

/etc/lilo.conf or /boot/grub/grub.conf, 

you will only see one option for booting. 
If you have configured the boot loader to boot other operating systems, this screen gives you the opportunity 
to select it. Use the arrow keys to highlight the operating system and press [Enter]. If you do nothing, 
the boot loader will load the default selection. 

4. Kernel

Once the second stage boot loader has determined which kernel to boot, it locates the corresponding 
kernel binary in the /boot/ directory. The proper binary is the /boot/vmlinuz-2.4.x-xx file that corresponds 
to the boot loader's settings. Next the boot loader places the appropriate initial RAM disk image, 
called an initrd, into memory. The initrd is used by the kernel to load any drivers not compiled into it 
that are necessary to boot the system. This is particularly important if you have SCSI hard drives or 
are using the ext3 file system [1]. 

When the kernel loads, it immediately initializes and configures the computer's memory. 
Next it configures the various hardware attached to the system, including all processors and I/O subsystems, 
as well as any storage devices. It then looks for the compressed initrd image in a predetermined location 
in memory, decompresses it, mounts it, and loads all necessary drivers. Next it initializes file system-related 
virtual devices, such as LVM or software RAID before unmounting the initrd disk image and freeing up all 
the memory it once occupied. 

After the kernel has initialized all the devices on the system, it creates a root device, mounts the root partition 
read-only, and frees unused memory. 

At this point, with the kernel loaded into memory and operational. However, with no user applications to give 
the user the ability to provide meaningful input to the system, not much can be done with it. 

To set up the user environment, the kernel starts the /sbin/init command.
 
5. init

The init program coordinates the rest of the boot process and configures the environment for the user. 
When the init command starts, it becomes the parent or grandparent of all of the processes that start up 
automatically on a Red Hat Linux system. First, it runs the /etc/rc.d/rc.sysinit script, which sets 
your environment path, starts swap, checks the file systems, and so on. Basically, rc.sysinit takes care of 
everything that your system needs to have done at system initialization. For example, most systems use a clock, 
so on them rc.sysinit reads the /etc/sysconfig/clock configuration file to initialize the clock. 
Another example is if you have special serial port processes which must be initialized, rc.sysinit will 
execute the /etc/rc.serial file. 

This is what init runs:

/sbin/init
          -> runs /etc/rc.d/rc.sysinit
          -> runs /etc/inittab
          -> inittab contains default runlevel: init runs all processes for that runlevel /etc/rc.d/rcN.d/ , 
          -> runs /etc/rc.d/rc.local

Below is an example listing for a runlevel 5, /etc/rc.d/rc5.d/ directory: 

K01pppoe -> ../init.d/pppoe
K05innd -> ../init.d/innd
K10ntpd -> ../init.d/ntpd
K15httpd -> ../init.d/httpd
K15mysqld -> ../init.d/mysqld
K15pvmd -> ../init.d/pvmd
K16rarpd -> ../init.d/rarpd
K20bootparamd -> ../init.d/bootparamd
K20nfs -> ../init.d/nfs
..
..
K80nscd -> ../init.d/nscd
K84ypserv -> ../init.d/ypserv
K90ups -> ../init.d/ups
K96irda -> ../init.d/irda
S05kudzu -> ../init.d/kudzu
S06reconfig -> ../init.d/reconfig
S08ipchains -> ../init.d/ipchains
S10network -> ../init.d/network
S12syslog -> ../init.d/syslog
..
etc..

As you can see, none of the scripts that actually start and stop the services are located in the 
/etc/rc.d/rc5.d/ directory. Rather, all of the files in /etc/rc.d/rc5.d/ are symbolic links pointing 
to scripts located in the /etc/rc.d/init.d/ directory. Symbolic links are used in each of the rc directories 
so that the runlevels can be reconfigured by creating, modifying, and deleting the symbolic links without 
affecting the actual scripts they reference. 

As usual, the K* scripts are kill/stop scripts, and the S* scripts are started in sequence by number.

In runlevel 5, /etc/inittab runs a script called /etc/X11/prefdm. The prefdm script runs the preferred 
X display manager, gdm if you are running GNOME or kdm if you are running KDE, based on the contents 
of the /etc/sysconfig/desktop/ directory. 

The last thing the init program does is run any scripts located in /etc/rc.d/rc.local. 
At this point, the system is considered to be operating at runlevel 5. 
You can use this file to add additional commands necessary for your environment. For instance, you can start 
additional daemons or initialize a printer. 

- Differences in the Boot Process of Other Architectures
Once the Red Hat Linux kernel loads and hands off the boot process to the init command, 
the same sequence of events occurs on every architecture. So the main difference between each architecture's 
boot process is in the application used to find and load the kernel. 

For example, the Alpha architecture uses the aboot boot loader, while the Itanium architecture uses 
the ELILO boot loader. 

- Runlevels
SysV Init
The SysV init is a standard process used by Red Hat Linux to control which software the init command 
launches or shuts off on a given runlevel. SysV init chosen because it is easier to use and more flexible 
than the traditional BSD style init process. 

The configuration files for SysV init are in the /etc/rc.d/ directory. Within this directory, 
are the rc, rc.local, and rc.sysinit scripts as well as the following directories: 

init.d
rc0.d
rc1.d
rc2.d
rc3.d
rc4.d
rc5.d
rc6.d
 

The init.d directory contains the scripts used by the init command when controlling services. 
Each of the numbered directories represent the six default runlevels configured by default under Red Hat Linux. 

The default runlevel is listed in /etc/inittab. To find out the default runlevel for your system, 
look for the line similar to the one below near the top of /etc/inittab: 

id:3:initdefault:
 
Generally, Red Hat Linux operates in runlevel 3 or runlevel 5 - both full multi-user modes. 
The following runlevels are defined in Red Hat Linux: 

0 - Halt 
1 - Single-user mode 
2 - Not used (user-definable) 
3 - Full multi-user mode 
4 - Not used (user-definable) 
5 - Full multi-user mode (with an X-based login screen) 
6 - Reboot 

If you are using LILO, you can enter single-user mode by typing "linux single" at the LILO boot: prompt. 

If you are using GRUB as your boot loader, you can enter single-user mode using the following steps. 
- In the graphical GRUB boot loader screen, select the Red Hat Linux boot label and press [e] to edit it.
- Arrow down to the kernel line and press [e] to edit it.
- At the prompt, type single and press [Enter].
- You will be returned to the GRUB screen with the kernel information. Press the [b] key to boot the system 
  into single user mode.


In case of boot problems, like a corrupt /etc/inittab file, you might try the following:

Boot by typing linux init=/bin/bash at the LILO boot: prompt. 
This places you at a shell prompt; note that no file systems other than the root file system are mounted, 
and the root file system is mounted in read-only mode. To mount it in read-write mode 
(to allow editing of a broken /etc/inittab, for example) do: 

mount -n /proc
mount -o rw,remount /

 
- Installing GRUB:

Once the GRUB rpm package is installed, open a root shell prompt and run the command 
/sbin/grub-install <location>, 

where <location> is the location GRUB Stage 1 boot loader should be installed. 

The following command installs GRUB to the MBR of the master IDE device on the primary IDE bus, 
alos known as the C drive: 

/sbin/grub-install /dev/hda

 
- GRUB and bootpaths:

In Linux entire harddisks are listed as devices without numbers, such as "/dev/hda" (IDE) or "/dev/sda" (SCSI).
Partitions on a disk are referred to with a number such as "/dev/hda1".

GRUB uses something different.

Device Names in GRUB:
The first hard drive of a system is called (hd0) by GRUB. 
The first partition on that drive is called (hd0,0), and the fifth partition on the second hard drive 
is called (hd1,4). In general, the naming convention for file systems when using GRUB breaks down in this way: 

(<type-of-device><bios-device-number>,<partition-number>)
 
The parentheses and comma are very important to the device naming conventions. The <type-of-device> refers 
to whether a hard disk (hd) or floppy disk (fd) is being specified. 
The <bios-device-number> is the number of the device according to the system's BIOS, starting with 0. 
The primary IDE hard drive is numbered 0, while the secondary IDE hard drive is numbered 1. 
The ordering is roughly equivalent to the way the Linux kernel arranges the devices by letters, 
where the a in hda relates to 0, the b in hdb relates to 1, and so on. 

File Names
When typing commands to GRUB involving a file, such as a menu list to use when allowing the booting 
of multiple operating systems, it is necessary to include the file immediately after specifying 
the device and partition. A sample file specification to an absolute filename is organized as follows: 

(<type-of-device><bios-device-number>,<partition-number>)/path/to/file, for example, (hd0,0)/grub/grub.conf. 

- Example grub.conf:

default=0
timeout=10
splashimage=(hd0,0)/grub/splash.xpm.gz

# section to load linux
title Red Hat Linux (2.4.18-5.47)
        root (hd0,0)
        kernel /vmlinuz-2.4.18-5.47 ro root=/dev/sda2
        initrd /initrd-2.4.18-5.47.img

# section to load Windows 2000
title windows
        rootnoverify (hd0,0)
        chainloader +1
 

This file would tell GRUB to build a menu with Red Hat Linux as the default operating system, set to autoboot 
it after 10 seconds. Two sections are given, one for each operating system entry, with commands specific 
to this system's disk partition table. 

- Example lilo.conf:

The file /etc/lilo.conf is used by lilo to determine which operating system or kernel to start, as well as 
to know where to install itself (for example, /dev/hda for the first MBR of the first IDE hard drive). 
A sample /etc/lilo.conf file looks like this (your /etc/lilo.conf may look a little different): 

boot=/dev/hda
map=/boot/map
install=/boot/boot.b
prompt
timeout=50
message=/boot/message
lba32
default=linux

image=/boot/vmlinuz-2.4.0-0.43.6
	label=linux
	initrd=/boot/initrd-2.4.0-0.43.6.img
	read-only
	root=/dev/hda5

other=/dev/hda1
	label=dos
 

- Creating a bootdiskette in Redhat:

Change to the directory that contains the image file. That might be on the original CD of Redhat.
then use the following command:

# dd if=boot.img of=/dev/fd0 bs=1440k 

 
3.2.6 Bootprocedure HP-UX 11.x:
===============================


3.2.6.1 Shutdown the system:
----------------------------


Note 1: Shutdown HP-UX
----------------------

System Shutdown
To shut down HP-UX for power-off, you can do any of the following: 
# init 0
# shutdown -h -y now

To shut down and reboot HP-UX: 
# reboot
# shutdown -r -y now

To shut down HP-UX to single-user mode: 
# init S
# shutdown -y now
# shutdown 0

The -h option to the shutdown command halts the system completely but will prompt you for a message to issue users. 
The -y option completes the shutdown without asking you any of the questions it would normally ask. 


Note 2: Shutdown HP-UX:
-----------------------

When HP-UX is running on an nPartition, you can shut down HP-UX using the shutdown command.

On nPartitions you have the following options when shutting down HP-UX:

To shut down HP-UX and reboot an nPartition: shutdown -r

On nPartition-capable HP Integrity servers, the shutdown -r command is equivalent to the shutdown -R command.

To shut down HP-UX and halt an nPartition: shutdown -h

On nPartition-capable HP Integrity servers, the shutdown -h command is equivalent to the shutdown -R -H command.

To perform a reboot for reconfig of an nPartition: shutdown -R

To hold an nPartition at a shutdown for reconfig state: shutdown -R -H


Note 3: Shutdown HP-UX:
-----------------------

Shutting down 
/sbin/shutdown -r -y now         Reboot
/sbin/shutdown -h -y now         Stop system
/sbin/shutdown -y now            Single user mode


Note 4: Shutdown HP-UX:
-----------------------

To reboot HP-UX use command
# reboot

To shutdown HP-UX in 120 seconds (2 minutes) use command
# shutdown -hy 120

To shutdown to single user mode use command
# shutdown -y 0

To shutdown down a V-Class server use command
# cd /
# shutdown

When you are are at root prompt (from single user mode restart) type following command:

# reboot -h


3.2.6.1 Booting HP-UX:
----------------------


Note 1:
-------

PDC -> ISL -> hpux -> kernel -> init

PDC 
HP-UX systems come with firmware installed called Processor Dependent Code. After the system is powered on 
or the processor is RESET, the PDC runs self-test operations and initializes the processor. PDC also identifies 
the console path so it can provide messages and accept input. PDC would then begin the "autoboot" process 
unless you interrupt it during the 10-second interval that is supplied. If you interrupt the "autoboot" process, 
you can issue a variety of commands. The interface to PDC commands is called the Boot Console Handler (BCH). 
This is sometimes a point of confusion; that is, are we issuing PDC commands or BCH commands? 
The commands are normally described as PDC commands, and the interface through which you execute them is the BCH. 

ISL 
The Initial System Loader is run after PDC. You would normally just run an "autoboot" sequence from ISL; 
however, you can run a number of commands from the ISL prompt. 

hpux 
The hpux utility manages loading the HP-UX kernel and gives control to the kernel. ISL can have hpux run 
an "autoexecute" file, or commands can be given interactively. In most situations, you would just want to 
automatically boot the system; however, I cover some of the hpux commands you can execute. This is sometimes 
called the Secondary System Loader (SSL). 


Note 2:
-------

HP-UX
Normal Boot

The bootstrap process involves the execution of three software components: 

pdc 
isl 
hpux 

- pdc

Automatic boot processes on various HP-UX systems follow similar general sequences. 
When power is applied to the HP-UX system processor, or the system Reset button is pressed, 
the firmware processor-dependent code (pdc) is executed to verify hardware and general system integrity. 
After checking the hardware, pdc gives the user the option to override the autoboot sequence by pressing 
the Esc key. A message resembling the following usually appears on the console. 

     (c) Copyright. Hewlett-Packard Company. 1994.
     All rights reserved.

     PDC ROM rev. 130.0
     32 MB of memory configured and tested.

     Selecting a system to boot.
     To stop selection process, press and hold the ESCAPE key...


If no keyboard activity is detected, pdc commences the autoboot sequence by loading isl and transferring control to it. 

- isl

The initial system loader (isl) implements the operating-system-independent portion of the bootstrap process. 
It is loaded and executed after self-test and initialization have completed successfully. Typically, when control 
is transferred to isl, an autoboot sequence takes place. An autoboot sequence allows a complete bootstrap 
operation to occur with no intervention from an operator. While an autoboot sequence occurs, isl finds and 
executes the autoexecute file which requests that hpux be run with appropriate arguments. Messages similar 
to the following are displayed by isl on the console: 

     Booting from: scsi.6  HP 2213A
     Hard booted.
     ISL Revision A.00.09  March 27, 1990
     ISL booting  hpux boot disk(;0)/stand/vmunix

- hpux

hpux, the secondary system loader, then announces the operation it is performing, in this case the boot operation, 
the device file from which the load image comes, and the TEXT size, DATA size, BSS size, and start address 
of the load image, as shown below, before control is passed to the image. 

    Booting disk(scsi.6;0)/stand/vmunix
    966616+397312+409688 start 0x6c50


Finally, the loaded image displays numerous configuration and status messages, and passes control to 
the init process. 


- Single-user Boot

A single-user boot in HP-UX is sometimes referred to as an interactive boot or attended mode boot. Pressing the 
Escape key at the boot banner on an older Series 700 workstation halts the automatic boot sequence, puts you into 
attended mode, and displays the Boot Console User Interface main menu, a sample of which is below. 

   Selecting a system to boot.
   To stop selection process, press and hold the ESCAPE key.

   Selection process stopped.

   Searching for Potential Boot Devices.
   To terminate search, press and hold the ESCAPE key.

   Device Selection    Device Path             Device Type
   -------------------------------------------------------------
   P0                  scsi.6.0                QUANTUM PD210S
   P1                  scsi.1.0                HP      2213A
   P2                  lan.ffffff-ffffff.f.f   hpfoobar

   b) Boot from specified device
   s) Search for bootable devices
   a) Enter Boot Administration mode
   x) Exit and continue boot sequence

      Select from menu:

In this case the system automatically searches the SCSI, LAN, and EISA interfaces for all potential boot devices
-devices for which boot I/O code (IODC) exists. The key to booting to single-user mode is first to boot to ISL 
using the b) option. The ISL is the program that actually controls the loading of the operating system. 
To do this using the above as an example, you would type the following at the Select from menu: prompt: 

Select from menu: b p0 isl


This tells the system to boot to the ISL using the SCSI drive at address 6 (since the device path of P0 is scsi.6.0). 
After displaying a few messages, the system then produces the ISL> prompt. 
Pressing the Escape key at the boot banner on newer Series 700 machines produces the Boot Administration Utility, 
as shown below. 

   Command                            Description
   -------                            -----------
   Auto [boot|search] [on|off]        Display or set auto flag
   Boot [pri|alt|scsi.addr][isl]      Boot from primary, alt or SCSI
   Boot lan[.lan_addr][install][isl]  Boot from LAN
   Chassis [on|off]                   Enable chassis code
   Diagnostic [on|off]                Enable/disable diag boot mode
   Fastboot [on|off]                  Display or set fast boot flag
   Help                               Display the command menu
   Information                        Display system information
   LanAddress                         Display LAN station addresses
   Monitor [type]                     Select monitor type
   Path [pri|alt] [lan.id|SCSI.addr]  Change boot path
   Pim [hpmc|toc|lpmc]                Display PIM info
   Search [ipl] [scsi|lan [install]]  Display potential boot devices
   Secure [on|off]                    Display or set security mode
   -----------------------------------------------------------------
   BOOT_ADMIN>


To display bootable devices with this menu you have to execute the Search command at the BOOT_ADMIN> prompt: 

BOOT_ADMIN> search
Searching for potential boot device.
This may take several minutes.

To discontinue, press ESCAPE.

   Device Path      Device Type
   --------------   ---------------
   scsi.6.0         HP C2247
   scsi.3.0         HP HP35450A
   scsi.2.0         Toshiba CD-ROM

BOOT_ADMIN>


To boot to ISL from the disk at device path scsi.6.0 type the following: 

BOOT_ADMIN>boot scsi.6.0 isl

Once you get the ISL prompt you can run the hpux utility to boot the kernel to single-user mode: 

ISL>hpux -is

   Note: the following can also be used; ISL>hpux -is -lq (;0)/stand/vmunix

This essentially tells hpux to load the kernel (/stand/vmunix) into single-user mode (-is) off the SCSI disk drive 
containing the kernel. The -is option says to pass the string s to the init process (i), and the command init s 
puts the system in single-user mode. In fact, you will see something similar to the following after typing the 
above command: 

Boot
: disk(scsi.6;0)/stand/vmunix
966616+397312+409688 start 0x6c50

   Kernel Startup Messages Omitted

INIT: Overriding default level with level 's'

INIT: SINGLE USER MODE
WARNING:  YOU ARE SUPERUSER!!
#

- Startup

Beginning with HP-UX 10 /etc/inittab calls /sbin/rc, which in turn calls execution scripts to start subsystems. 
This approach follows the OSF/1 industry standard and has been adopted by Sun, SGI, and other vendors. 
There are four components to this method of startup and shutdown: /sbin/rc, execution scripts, 
configuration variable scripts, and link files. 

/sbin/rc
This script invokes execution scripts based on run levels. It is also known as the startup and shutdown 
sequencer script. 

Execution scripts
These scripts start up and shut down various subsystems and are found in the /sbin/init.d directory. 
/sbin/rc invokes each execution script with one of four arguments, indicating the "mode": 

"start"		Bring the subsystem up 
"start_msg"	Report what the start action will do 
"stop"		Bring the subsystem down 
"stop_msg"	Report what the stop action will do 

These scripts are designed never to be modified. Instead, they are customized by sourcing in configuration files 
found in the /etc/rc.config.d directory. These configuration files contain variables that you can set. 
For example, in the configuration file /etc/rc.config.d/netconf you can specify routing tables by setting 
variables like these: 

ROUTE_DESTINATION[0]="default"
ROUTE_GATEWAY[0]="gateway_address"
ROUTE_COUNT[0]="1"


The execution script /sbin/init.d/net sources these and other network-related variables when it runs upon 
system startup. More on configuration files is described below. 
Upon startup a checklist similar to the one below will appear based upon the exit value of each of 
the execution scripts. 

HP-UX Startup in progress
-----------------------------------
Mount file systems..............................[ OK ]
Setting hostname................................[ OK ]
Set privilege group.............................[ OK ]
Display date...................................[FAIL]*
Enable auxiliary swap space....................[ N/A ]
Start syncer daemon.............................[ OK ]
Configure LAN interfaces........................[ OK ]
Start Software Distributor agent daemo..........[ OK ]


The execution scripts have the following exit values: 
0 Script exited without error. This causes the status OK to appear in the checklist. 
1 Script encountered errors. This causes the status FAIL to appear in the checklist. 
2 Script was skipped due to overriding control variables from /etc/rc.config.d files or for other reasons, 
  and did not actually do anything. This causes the status N/A to appear in the checklist. 
3 Script executed normally and requires an immediate system reboot for the changes to take effect. 
  (NOTE: Reserved for key system components). 

Configuration variable scripts
Configuration variable scripts are designed to customize the execution scripts. This goal here is to separate 
startup files from configuration files so that upgrading your system does not overwrite its configuration. These scripts are written for the POSIX shell (/usr/bin/sh or /sbin/sh), and not the Bourne shell, ksh, or csh. In some cases, these files must also be read, and possibly modified by other scripts or the SAM program. For this reason, each variable definition must appear on a separate line, in the syntax: 
variable=value
No trailing comments may appear on a variable definition line. Comment statements must be on separate lines, 
with the "#" comment character in column 1. An example of the required syntax for configuration files is given below: 

# Cron configuration. See cron(1m)
#
# CRON: Set to 1 to start cron daemon
#
CRON=1


Both the execution scripts and the configuration files are named after the subsystem they control. For example, 
the /sbin/init.d/cron execution script controls the cron daemon, and it is customized by the /etc/rc.config.d/cron 
configuration variable script. 

Link Files
These files control the order in which execution scripts run. The /sbin/rc#.d (where # is a run-level) directories 
are startup and shutdown sequencer directories. They contain only symbolic links to the execution scripts in 
/sbin/init.d that are executed by /sbin/rc on transition to a specific run level. For example, the /sbin/rc3.d 
directory contains symbolic links to scripts that are executed when entering run level 3. 
These directories contain two types of link files: start links and kill links. Start links have names beginning 
with the capital letter S and are invoked with the start argument at system boot time or on transition to a higher 
run level. Kill links have names beginning with the capital letter K and are invoked with the stop argument 
at system shutdown time, or when moving to a lower run level. 

Further, all link files in a sequencer directory are numbered to ensure a particular execution sequence. 
Each script has, as part of its name, a three-digit sequence number. This, in combination with the start and kill 
notation, provides all the information necessary to properly start up and shut down a system. 

The table below shows some samples from the run-level directories. (The sequence numbers shown are only for example 
and may not accurately represent your system.) 

/sbin/rc0.d /sbin/rc1.d /sbin/rc2.d /sbinrc3.d 
K480syncer S100hfsmount S340net S000nfs.server 
K800killall S320hostname S500inetd   
K900hfsmount S440savecore S540sendmail   
  S500swapstart S610rbootd   
  S520syncer S720lp   
    S730cron   
  K270cron     
  K280lp K900nfs.server   
  K390rbootd     
  K460sendmail     
  K500inetd     
  K660net     


Because each script in /sbin/init.d performs both the startup and shutdown functions, each will have two links 
pointing towards the script from /sbin/rc*.d; one for the start action and one for the stop action. 

Run Levels and /sbin/rc
In previous HP-UX releases, /etc/rc (now /sbin/rc) was run only once. Now it may run several times during the 
execution of a system, sequencing the execution scripts when moving between run levels. However, only the subsystems 
configured for execution, through configuration variables in /etc/rc.config.d, are started or stopped when 
transitioning the run levels. 
/sbin/rc sequences the startup and shutdown scripts in the appropriate sequencer directories in lexicographical order. 
Upon transition from a lower to a higher run level, the start scripts for the new run level and all intermediate 
levels between the old and new level are executed. Upon transition from a higher to a lower run level, the kill scripts 
for the new run level and all intermediate levels between the old and new level are executed. 

When a system is booted to a particular run level, it will execute startup scripts for all run levels up to and 
including the specified level (except run level 0). For example, if booting to run level 4, /sbin/rc looks at the 
old run level (S) and the new run level (4) and executes all start scripts in states 1, 2, 3, and 4. 
Within each level, the start scripts are sorted lexicographically and executed in that order. Each level is sorted 
and executed separately to ensure that the lower level subsystems are started before the higher level subsystems. 

Consequently, when shutting down a system, the reverse takes place. The kill scripts are executed in lexicographical 
order starting at the highest run level and working down, as to stop the subsystems in the reverse order they 
were started. As mentioned earlier, the numbering is reversed from the startup order. 

Example
If you want cron to start when entering run level 2, you would modify the configuration variable script 
/etc/rc.config.d/cron to read as follows: 

# cron config
#
# CRON=1 to start

CRON=1


This would be necessary because the execution script, /sbin/init.d/cron contains the following: 
# cron startup
#
. /etc/rc/config

if [ $CRON = 1 ]
   then /usr/sbin/cron
fi

cron will start at run level 2 because in /sbin/rc2.d a link exists from S730cron to /sbin/init.d/cron. 
/sbin/rc will invoke /sbin/init.d/cron with a start argument because the link name starts with an S. 


End Of File 


===========================================================================
4. Most important and current AIX, SOLARIS, and Linux fixes:
===========================================================================


4.1 AIX:
========


4.2 SOLARIS:
============


4.3 Linux Redhat:
=================
                  

=================
5. Oracle en UNIX:
=================

(Vanaf hier: Oude tekst. As from here, ignore all text, cause its too old. Its only interresting to Albert)


5.1 Installatie Oracle 8i:
--------------------------


5.1.1 Operating system dependencies:
------------------------------------

Bepaal eerst voor de te gebruiken Oracle versie, welke OS settings
en patches nodig zijn. 

Bijvoorbeeld, bij linux is glibc 2.1.3 nodig bij Oracle versie 8.1.7. 
Linux is erg kritisch m.b.t. de libraries in combinatie met Oracle.

Ook moet er mogelijk shmmax (max size of shared memory segment)
en dergelijke parameters worden aangepast.  

# sysctl -w kernel.shmmax=100000000
# sysctl -w fs.file-max=65536
# echo "kernel.shmmax = 100000000"  >> /etc/sysctl.conf
# echo "kernel.shmmax = 2147483648" >> /etc/sysctl.conf


   Opmerking: Het onderstaANDe is algemeen, maar is ook afgeleid van een Oracle 8.1.7
   installatie op Linux Redhat 6.2

   Als de 8.1.7 installatie gedaan wordt is ook nog de Java JDK 1.1.8 nodig.
   Deze kan gedownload worden van www.blackdown.org

   Download jdk-1.1.8_v3   jdk118_v3-glibc-2.1.3.tar.bz2 in /usr/local
   tar xvif jdk118_v3-glibc-2.1.3.tar.bz2
   ln -s /usr/local/jdk118_v3 /usr/local/java


5.1.2 Omgevingsvariablelen:
---------------------------

Zorg er voor dat de juiste oracle variabelen zijn gezet. 
Op ieder platform zijn dat minimaal:

ORACLE_BASE=/u01/app/oracle; export ORACLE_BASE
(root voor oracle software)

ORACLE_HOME=$ORACLE_BASE/product/8.1.5; export ORACLE_HOME
(bepaald de directory waarin de instance software zich bevind)

ORACLE_SID=brdb; export ORACLE_SID
(bepaald de naam van de huidige instance)

ORACLE_TERM=xterm, vt100, ansi of wat ANDers; export ORACLE_TERM

ORA_NLSxx=$ORACLE_HOME/ocommon/nls/admin/data; export ORA_NLS
(bepaald de nls directory t.b.v. datafiles voor meerdere talen)

NLS_LANG="Dutch_The NetherlANDs.WE8ISO8859P1"; export NLS_LANG
Dit specificeert de language, territory en characterset t.b.v de client applicaties.

LD_LIBRARY_PATH=/u01/app/oracle/product/8.1.7/lib; export LD_LIBRARY_PATH

PATH=$ORACLE_HOME/bin:/bin:/user/bin:/usr/sbin:/bin; export PATH


plaats deze variabelen in de oracle user profile file:
.profile, of .bash_profile etc..


5.1.3 OFA directory structuur:
------------------------------

Hou je aan OFA. Een voorbeeld voor database PROD:

/u01/app/oracle/product/8.1.6

/u01/app/oracle/admin/PROD

/u01/app/oracle/admin/PROD/pfile
/u01/app/oracle/admin/PROD/adhoc
/u01/app/oracle/admin/PROD/bdump
/u01/app/oracle/admin/PROD/udump
/u01/app/oracle/admin/PROD/adump
/u01/app/oracle/admin/PROD/cdump
/u01/app/oracle/admin/PROD/create

/u02/oradata/PROD
/u03/oradata/PROD
/u04/oradata/PROD
etc..


5.1.4 Users en groups:
----------------------


Als je met OS verificatie wilt werken, moet in de init.ora gezet zijn:
remote_login_passwordfile=none (passwordfile authentication via exlusive)

Benodigde groups in UNIX: group dba. Deze moet voorkomen in de /etc/group file
vaak is ook nog nodig de group oinstall

groupadd dba
groupadd oinstall
groupadd oper

Maak nu user oracle aan:
adduser -g oinstall -G dba -d /home/oracle oracle


5.1.5 mount points en disks:
----------------------------

maak de mount points:

mkdir /opt/u01
mkdir /opt/u02
mkdir /opt/u03
mkdir /opt/u04  

dit moeten voor een produktie omgeving aparte schijven zijn

Geef nu ownership van deze mount points aan user oracle en group oinstall

chown -R oracle:oinstall /opt/u01
chown -R oracle:oinstall /opt/u02
chown -R oracle:oinstall /opt/u03
chown -R oracle:oinstall /opt/u04

directories: drwxr-xr-x  oracle  dba
files      : -rw-r-----  oracle  dba
           : -rw-r--r--  oracle  dba

chmod 644 *
chmod u+x filename
chmod ug+x filename


5.1.6 test van user oracle:
---------------------------


log in als user oracle en geef de commANDo's

$groups   laat de groups zien (oinstall, dba)
$umask   laat 022 zien, zoniet zet dan de line umask 022 in het .profile

umask is de default mode van een file of directory wanneer deze aangemaakt wordt.
rwxrwxrwx=777
rw-rw-rw-=666
rw-r--r--=644 welke correspondeert met umask 022

Verander nu het .profile of .bash_profile van de user oracle.
Plaats de environment variabelen van 9.1 in het profile.

log uit en in als user oracle, en test de environment:
%env
%echo $variablename


5.1.7 Oracle Installer bij 8.1.x op Linux:
------------------------------------------

Log in als user oracle. Draai nu oracle installer:

Linux:

  startx
  cd /usr/local/src/Oracle8iR3
  ./runInstaller

of

  Ga naar install/linux op de CD en run runIns.sh


Nu volgt een grafische setup. Beantwoord de vragen.

Het kan zijn dat de installer vraagt om scripts uit te voeren zoals:
orainstRoot.sh en root.sh
Om dit uit te voeren:

   open een nieuw window
   su root
   cd $ORACLE_HOME
   ./orainstRoot.sh


5.2 Automatische start oracle bij system boot:
----------------------------------------------


5.2.1 oratab:
-------------

Inhoud ORATAB in /etc of /var/opt:

Voorbeeld:

  #   $ORACLE_SID:$ORACLE_HOME:[N|Y]
  #
  ORCL:/u01/app/oracle/product/8.0.5:Y
  #


De oracle scripts om de database te starten en te stoppen zijn: $ORACLE_HOME/bin/dbstart en dbshut,
of startdb en stopdb of wat daarop lijkt.  Deze kijken in ORATAB om te zien welke databases
gestart moeten worden.


5.2.2 dbstart en dbshut:
------------------------

Het script dbstart zal oratab lezen en ook tests doen en om de oracle versie
te bepalen. Verder bestaat de kern uit:

  het starten van sqldba, svrmgrl of sqlplus
  vervolgens doen we een connect
  vervolgens geven we het startup commando.

Voor dbshut geldt een overeenkomstig verhaal.


5.2.3 init, sysinit, rc:
------------------------

Voor een automatische start, voeg nu de juiste entries toe in het /etc/rc2.d/S99dbstart 
(or equivalent) file: 

Tijdens het opstarten van Unix worden de scrips in de /etc/rc2.d uitgevoerd die beginnen met een 'S' 
en in alfabetische volgorde. 
De Oracle database processen zullen als (een van de) laatste processen worden gestart. 
Het bestAND S99oracle is gelinkt met deze directory.

Inhoud S99oracle:

  su - oracle -c "/path/to/$ORACLE_HOME/bin/dbstart"         # Start DB's
  su - oracle -c "/path/to/$ORACLE_HOME/bin/lsnrctl start"   # Start listener
  su - oracle -c "/path/tp/$ORACLE_HOME/bin/namesctl start"  # Start OraNames (optional)

Het dbstart script is een standaard Oracle script. Het kijkt in oratab welke sid's op 'Y' staan, 
en zal deze databases starten.

of customized via een customized startdb script:

  ORACLE_ADMIN=/opt/oracle/admin; export ORACLE_ADMIN

  su - oracle -c "$ORACLE_ADMIN/bin/startdb WPRD 1>$ORACLE_ADMIN/log/WPRD/startWPRD.$$ 2>&1"
  su - oracle -c "$ORACLE_ADMIN/bin/startdb WTST 1>$ORACLE_ADMIN/log/WTST/startWTST.$$ 2>&1"
  su - oracle -c "$ORACLE_ADMIN/bin/startdb WCUR 1>$ORACLE_ADMIN/log/WCUR/startWCUR.$$ 2>&1"


5.3 Het stoppen van Oracle in unix:
-----------------------------------


Tijdens het down brengen van Unix (shutdown -i 0) worden de scrips in de directory /etc/rc2.d 
uitgevoerd die beginnen met een 'K' en in alfabetische volgorde. 
De Oracle database processen zijn een van de eerste processen die worden afgesloten. 
Het bestand K10oracle is gelinkt met de /etc/rc2.d/K10oracle

# Configuration File: /opt/oracle/admin/bin/K10oracle


ORACLE_ADMIN=/opt/oracle/admin; export ORACLE_ADMIN

su - oracle -c "$ORACLE_ADMIN/bin/stopdb WPRD 1>$ORACLE_ADMIN/log/WPRD/stopWPRD.$$ 2>&1"
su - oracle -c "$ORACLE_ADMIN/bin/stopdb WCUR 1>$ORACLE_ADMIN/log/WCUR/stopWCUR.$$ 2>&1"
su - oracle -c "$ORACLE_ADMIN/bin/stopdb WTST 1>$ORACLE_ADMIN/log/WTST/stopWTST.$$ 2>&1"


5.4 startdb en stopdb:
----------------------

Startdb [ORACLE_SID]
--------------------

Dit script is een onderdeel van het script S99Oracle. Dit script heeft 1 parameter, ORACLE_SID

# Configuration File: /opt/oracle/admin/bin/startdb

# Algemene omgeving zetten

. $ORACLE_ADMIN/env/profile

ORACLE_SID=$1
echo $ORACLE_SID 

# Omgeving zetten RDBMS
. $ORACLE_ADMIN/env/$ORACLE_SID.env

# Het starten van de database
sqlplus /nolog << EOF
connect / as sysdba
startup
EOF

# Het starten van de listener
lsnrctl start $ORACLE_SID

# Het starten van de intelligent agent voor alle instances
#lsnrctl dbsnmp_start


Stopdb [ORACLE_SID]
-------------------

Dit script is een onderdeel van het script K10Oracle. Dit script heeft 1 parameter, ORACLE_SID

# Configuration File: /opt/oracle/admin/bin/stopdb

# Algemene omgeving zetten
. $ORACLE_ADMIN/env/profile

ORACLE_SID=$1
export $ORACLE_SID

# Settings van het RDBMS
. $ORACLE_ADMIN/env/$ORACLE_SID.env

# Het stoppen van de intelligent agent
#lsnrctl dbsnmp_stop

# Het stoppen van de listener
lsnrctl stop $ORACLE_SID

# Het stoppen van de database.
sqlplus /nolog << EOF
connect / as sysdba
shutdown immediate
EOF


5.5 Batches:
------------

De batches (jobs) worden gestart door het Unix proces cron

# Batches (Oracle)

# Configuration File: /var/spool/cron/crontabs/root
# Format of lines:
# min	hour	daymo	month	daywk	cmd
#
# Dayweek 0=sunday, 1=monday...
0        9        *       *       6  /sbin/sh /opt/oracle/admin/batches/bin/batches.sh  
>> /opt/oracle/admin/batches/log/batcheserroroutput.log 2>&1


# Configuration File: /opt/oracle/admin/batches/bin/batches.sh
# Door de op de commandline  ' BL_TRACE=T ; export BL_TRACE ' worden alle commando's getoond.
case $BL_TRACE in
    T)	set -x ;;
esac

ORACLE_ADMIN=/opt/oracle/admin; export ORACLE_ADMIN
ORACLE_HOME=/opt/oracle/product/8.1.6; export ORACLE_HOME

ORACLE_SID=WCUR ; export ORACLE_SID
su - oracle -c ". $ORACLE_ADMIN/env/profile ; . $ORACLE_ADMIN/env/$ORACLE_SID.env; 
cd $ORACLE_ADMIN/batches/bin; sqlplus /NOLOG @$ORACLE_ADMIN/batches/bin/Analyse_WILLOW2K.sql 1>
$ORACLE_ADMIN/batches/log/batches$ORACLE_SID.`date +"%y%m%d"` 2>&1"

ORACLE_SID=WCON ; export ORACLE_SID
su - oracle -c ". $ORACLE_ADMIN/env/profile ; . $ORACLE_ADMIN/env/$ORACLE_SID.env; 
cd $ORACLE_ADMIN/batches/bin; sqlplus /NOLOG @$ORACLE_ADMIN/batches/bin/Analyse_WILLOW2K.sql 1>
$ORACLE_ADMIN/batches/log/batches$ORACLE_SID.`date +"%y%m%d"` 2>&1"


=======================
7. INSTALLING SUNOS:
=======================

Installing Sun Solaris 2.8 

--------------------------------------------------------------------------------

Contents 

Overview 
Using Serial Console Connection 
Starting the Installation 
Answering the Screen Prompts 
Post-Installation Tasks 


--------------------------------------------------------------------------------


Overview 


This article documents installing the 2/02 release of Solaris 8 from CD-ROM. 
For the purpose of this example, I will be installing Solaris 8 on a Sun Blade 150 with the following configuration: 

Sun Blade 150 (UltraSPARC-IIe 650MHz), No Keyboard, OpenBoot 4.6 
1,792 MB RAM Memory 
40 GB IDE Western Digital Hard Drive - (/dev/dsk/c0t0d0) 
Built-in Ethernet - (eri0) 
CDROM - (/dev/dsk/c0t1d0) 
Installing Solaris 8 will require 2 CDs found in the Solaris media kit labeled SOLARIS 8 SOFTWARE 
- 1 of 2 / 2 of 2. Before starting the installation process, ensure that you have noted the following items: 


Determine the host name of the system you are installing 
Determine the language and locales you intend to use on the system 
If you intend to include the system in a network, gather the following information: 
Host IP address 
Subnet mask 
Type of name service (DNS, NIS, or NIS+, for example) 
Domain name 
Host name of server 
Host IP address of the name server 
Using Serial / Console Connection 


For a complete discussion of connecting to a Sun serial console from Linux, see my article "Using Serial Consoles 
- (Sun Sparcs)". 
For this particular installation, I will NOT be using a VGA monitor connected to the built-in 
frame-buffer (video card). The installation will be done using the serial port of the Sun Blade as a console. 
A serial cable (null modem) will be connected from the serial port of a Linux machine to the serial port 
of the Sun Blade. Keep in mind that you will not be able to make use of the serial console of the Sun Blade 
if it was booted with the keyboard/mouse plugged in. In order to make use of the serial console, you will need 
to disconnect the keyboard/mouse and reboot the Sun server. On the Sun Blade 100/150, if the keyboard/mouse 
are plugged in during the boot phase, all console output will be redirected to the VGA console. 

From the Linux machine, you can use a program called minicom. Start it up with the command "minicom". 
Press "Ctrl-A Z" to get to the main menu. Press "o" to configure minicom. Go to "Serial port setup" 
and make sure that you are set to the correct "Serial Device" and that the speed on line E matches the speed 
of the serial console you are connecting to. (In most cases with Sun, this is 9600.) Here are the settings 
I made when using Serial A / COM1 port on the Linux machine: 

+-----------------------------------------------------------------------+
| A -    Serial Device      : /dev/ttyS0                                |
| B - Lockfile Location     : /var/lock                                 |
| C -   Callin Program      :                                           |
| D -  Callout Program      :                                           |
| E -    Bps/Par/Bits       : 9600 8N1                                  |
| F - Hardware Flow Control : Yes                                       |
| G - Software Flow Control : No                                        |
|                                                                       |
|    Change which setting?                                              |
+-----------------------------------------------------------------------+
After making all necessary changes, hit the ESC key to go back to the "configurations" menu. 
Now go to "Modem and dialing". Change the "Init string" to "~^M~". Save the settings (as dflt), 
and then restart Minicom. You should now see a console login prompt. 

[root@bertha1 root]# minicom

Welcome to minicom 1.83.1

OPTIONS: History Buffer, F-key Macros, Search History Buffer, I18n
Compiled on Aug 28 2001, 15:09:33.

Press CTRL-A Z for help on special keys

alex console login: root
Password:
Last login: Tue Nov  4 18:55:41 on console
Nov  7 12:17:24 alex login: ROOT LOGIN /dev/console
Sun Microsystems Inc.   SunOS 5.8       Generic Patch   October 2001
#
# init 0
INIT: New run level: 0
The system is coming down.  Please wait.
System services are now being stopped.
Print services stopped.
Nov  7 12:17:38 alex syslogd: going down on signal 15
The system is down.
syncing file systems... done
Program terminated
ok
Starting the Installation 


The installation process starts at the ok prompt. The previous section of this document provides the steps 
required to not only gain access to the console port of the Sun SPARC server, but also how to get the server 
to an ok prompt. If when logging you, the machine is already booted (you have console login like the following:
 "alex console login:") you will need to bring the machine to its EEPROM (ok prompt) by initiating init 0 
like in the Using Serial / Console Connection section above. 
The first step in installing Solaris 8 it to boot the machine from Disk 1 of the SOLARIS 8 SOFTWARE CDs. 
You will need to get the machine to the ok prompt. You can do this by shutting the system down using init 0. 
Once at the ok prompt, type in boot cdrom. (Or in some cases, you can use reboot cdrom). From here, 
the installation program prompts you for system configuration information that is needed to complete the installation. 

NOTE: If you were performing a network installation, you would type: ok boot net. 

In almost all cases, you will be installing the Solaris 8 software on a new system where it will not be necessary 
to preserve any data already on the hard drive. Using this assumption, I will partition the single 40 GB IDE 
hard drive in the system. 

Answering the Screen Prompts 


Let's start the installation process! Put the SOLARIS 8 SOFTWARE (Disk 1 of 2) in the CDROM tray and boot to it: 
ok boot cdrom
Resetting ...

Sun Blade 150 (UltraSPARC-IIe 650MHz), No Keyboard
Copyright 1998-2002 Sun Microsystems, Inc.  All rights reserved.
OpenBoot 4.6, 1792 MB memory installed, Serial #52928138.
Ethernet address 0:3:ba:27:9e:8a, Host ID: 83279e8a.

Rebooting with command: boot cdrom
Boot device: /pci@1f,0/ide@d/cdrom@1,0:f  File and args:
SunOS Release 5.8 Version Generic_108528-13 64-bit
Copyright 1983-2001 Sun Microsystems, Inc.  All rights reserved.
 
The boot process may take several minutes to complete, but once done, you will start answering a series of prompts. 

The following section will walk you through many of the screen prompts from the installation. 

The first three prompts are from the command line interface (CLI) and are used to specify the language, 
locale and terminal. Use English for both Language and Locale. As for a terminal setting, I commonly telnet 
to a Linux server (that is connected from the serial port of the Linux server to the serial port of the Sun machine). 
From the Linux server, I use "minicom" to connect from the Linux server to the Sun server. 
The best terminal for this type of installation is "DEC VT100": 


  Language                             : English
  Locale                               : English
  What type of terminal are you using? : 3) DEC VT100
NOTE: You should be able to use a terminal type of "DEC VT100" or "X Terminal Emulator (xterms)".  

NOTE: Further installation through the terminal requires responses to the selections through ESC and function keys 
and space bar, which are mentioned on the installation screen.  


Many of the screens to follow will ask you about networking information. When asked if the system will be connected 
to a network, answer Yes. 

NOTE: Many of the screens should be easy to complete except for the "Names Services" section. In almost all cases, 
you will want to use DNS naming services, but if your machine is not currently configured within DNS, this section 
will fail and no information entered about Names Services will be stored and configured. 
If this is the case, you will need to select None under the Names Services section. 
The network configuration will then need to be completed after the installation process by updating certain 
network files on the local hard drive. This will be documented in the "Post Installation Procedures" of this document. 
 

--------------------------------------------------------------------------------


Screen 1 : The Solaris Installation Program 

This is the Solaris Installation Welcome screen. 

Hit ESC - F2 to continue 


Screen 2 : Identify This System 

This screen informs you about how you will need to identify the computer as it applies to network connectivity. 

Hit ESC - F2 to continue 


Screen 3 : Network Connectivity 


Networked
---------
[X] Yes
[ ] No
Hit ESC - F2 to continue 

Screen 4 : DHCP 


Use DHCP
--------
[ ] Yes
[X] No
Hit ESC - F2 to continue 

Screen 5 : Host Name 


Host name: alex
Hit ESC - F2 to continue 

Screen 6 : IP Address 


Host name: 192.168.1.102
Hit ESC - F2 to continue 

Screen 7 : Subnets 


System part of a subnet
-----------------------
[X] Yes
[ ] No
Hit ESC - F2 to continue 

Screen 8 : Netmask 


Netmask: 255.255.255.0
Hit ESC - F2 to continue 

Screen 9 : IPv6 


Enable IPv6
-----------
[ ] Yes
[X] No
Hit ESC - F2 to continue 

Screen 10 : Confirm Information 

This is a confirmation screen. Verify all data is correct. 

Hit ESC - F2 to continue 


Screen 11 : Configure Security Policy 


Configure Kerberos Security
---------------------------
[ ] Yes
[X] No
Hit ESC - F2 to continue 

Screen 12 : Confirm Information 

This is a confirmation screen. Verify all data is correct. 

Hit ESC - F2 to continue 


Screen 13 : Name Service 


Name service
------------
[ ] NIS+
[ ] NIS
[X] DNS
[ ] LDAP
[ ] None
Hit ESC - F2 to continue 

Screen 14 : Domain Name 


Host name: idevelopment.info
Hit ESC - F2 to continue 

Screen 15 : DNS Server Addresses 


Server's IP address: 63.67.120.18
Server's IP address: 63.67.120.23
Server's IP address: 
Hit ESC - F2 to continue 

Screen 16 : DNS Search List 


Search domain:
Search domain:
Search domain: 
Search domain:
Search domain:
Search domain:
Hit ESC - F2 to continue 

Screen 17 : Confirm Information 

This is a confirmation screen. Verify all data is correct. 

Hit ESC - F2 to continue 


Screen 18 : Time Zone 


Regions
-------
[ ] Asia, Western
[ ] Australia / New Zealand
[ ] Canada
[ ] Europe
[ ] Mexico
[ ] South America
[X] United States
[ ] other - offset from GMT
[ ] other - specify time zone file
Hit ESC - F2 to continue 

Screen 19 : Time Zone 


Time zones
----------
[X] Eastern
[ ] Central
[ ] Mountain
[ ] Pacific
[ ] East-Indiana
[ ] Arizona
[ ] Michigan
[ ] Samoa
[ ] Alaska
[ ] Aleutian
[ ] Hawaii
Hit ESC - F2 to continue 

Screen 20 : Date and Time 


Date and time: YYYY-MM-DD HH:MM

  Year   (4 digits) : <enter year>
  Month  (1-12)     : <enter month>
  Day    (1-31)     : <enter day>
  Hour   (0-23)     : <enter hour>
  Minute (0-59)     : <enter minute>
Hit ESC - F2 to continue 

Screen 21 : Confirm Information 

This is a confirmation screen. Verify all data is correct. 

Hit ESC - F2 to continue 


Screen 22 : Solaris Interactive Installation 

This screen recognizes if a previous version of Solaris is installed and whether you would like to upgrade or not. 
Always select the install option (F4_Initial). 

Hit ESC - F4 to continue 


Screen 23 : Solaris Interactive Installation 

There are two ways to install your Solaris software: "Standard" or "Flash". 
Choose the "Standard" method (Esc-2_Standard). 

Hit ESC - F2 to continue 


Screen 24 : Time Zone 


Select the geographic regions for which support should be installed.
--------------------------------------------------------------------
> [ ] Asia
> [ ] Eastern Europe
> [ ] Middle East
> [ ] Central America
> [ ] South America
> [ ] Northern Europe
> [ ] Southern Europe
> [ ] Central Europe
V [/] North America
  [ ]     Canada-English (ISO8859-1)
  [ ]     Canada-French (ISO8859-1)
  [ ]     French
  [ ]     Mexico (ISO8859-1)
  [X]     U.S.A. (en_US.ISO8859-1) [ ] Australasia
> [ ] Western Europe
> [ ] Northern Africa
Hit ESC - F2 to continue 

Screen 25 : Select Software 


Select the Solaris software to install on the system.
-----------------------------------------------------
[ ] Entire Distribution plus OEM support 64-bit  1432.00 MB
[X] Entire Distribution 64-bit ................. 1401.00 MB
[ ] Developer System Support 64-bit ............ 1350.00 MB
[ ] End User System Support 64-bit ............. 932.00 MB
[ ] Core System Support 64-bit ................. 396.00 MB
Hit ESC - F2 to continue 

Screen 26 : Select Disks 

You must select the disks for installing Solaris software. If there are several disks available, 
I always install the Solaris software on the boot disk c0t0d0. 

----------------------------------------------------------
Disk Device (Size)        Available Space
=============================================
[X] c0t0d0   (14592 MB) boot disk    14592 MB  (F4 to edit)

                    Total Selected:  14592 MB
                 Suggested Minimum:    974 MB


--------------------------------------------------------------------------------

I generally select ESC - F4 to edit the c0t0d0 disk to ensure that the root directory is going 
to be located on this disk. 

----------------------------------------------------------
On this screen you can select the disk for installing the 
root (/) file system of the Solaris software.

Original Boot Device : c0t0d0

          Disk
      ==============================
      [X] c0t0d0    (F4 to select boot device)


--------------------------------------------------------------------------------

On this screen, I typically select ESC - F4 to select boot device to ensure the root file system will be 
located on slice zero, c0t0d0s0. 

----------------------------------------------------------
On this screen you can select the specific slice for the root (/) file
system. If you choose Any of the Above, the Solaris installation program
will choose a slice for you.

Original Boot Device : c0t0d0s0

          [X]  c0t0d0s0
          [ ]  c0t0d0s1
          [ ]  c0t0d0s2
          [ ]  c0t0d0s3
          [ ]  c0t0d0s4
          [ ]  c0t0d0s5
          [ ]  c0t0d0s6
          [ ]  c0t0d0s7
          [ ]  Any of the Above
Hit ESC - F2 to after selecting Disk Slice 


--------------------------------------------------------------------------------

Hit ESC - F2 to continue with your Boot Disk selection 


--------------------------------------------------------------------------------


Screen 27 : Reconfigure EEPROM? 

Do you want to update the system's hardware (EEPROM) to always boot from c0t0d0? 

Hit ESC - F2 to Reconfigure EEPROM and Continue 


Screen 28 : Preserve Data? 

Do you want to preserve existing data? At least one of the disks you've selected for installing Solaris software 
has file systems or unnamed slices that you may want to save. 

Hit ESC - F2 to continue 


Screen 29 : Automatically Layout File Systems? 

Do you want to use auto-layout to automatically layout file systems? Manually laying out file systems 
requires advanced system administration skills. 

I typically perform an "Auto" File System Layout (F2_Auto Layout). 

Hit ESC - F2 to Perform Auto Layout. 


Screen 30 : Automatically Layout File Systems 

On this screen you must select all the file systems you want auto-layout to create, or accept the 
default file systems shown. 

File Systems for Auto-layout
========================================
[X]  /
[ ]  /opt
[ ]  /usr
[ ]  /usr/openwin
[ ]  /var
[X]  swap
Hit ESC - F2 to continue 

Screen 31 : File System and Disk Layout 

The summary below is your current file system and disk layout, based on the information you've supplied. 

NOTE: If you choose to customize, you should understand file systems, their intended purpose on the disk, 
and how changing them may affect the operation of the system. 

File system/Mount point           Disk/Slice             Size
=============================================================
/                                 c0t0d0s0            1338 MB
swap                              c0t0d0s1             296 MB
overlap                           c0t0d0s2           38162 MB
/export/home                      c0t0d0s7           36526 MB


--------------------------------------------------------------------------------

I generally select ESC - F4 (F4_Customize) to edit the partitions for disk c0t0d0. If this is a workstation, 
I make only three partitions: 


/ : I often get the sizes for the individual filesystems (/usr, /opt, and /var) incorrect. This is one reason 
I typically create only one partition as / that will be used for the entire system (minus swap space). 
In most cases, I will be installing addition disks for large applications like the Oracle RDBMS, 
Oracle Application Server, or other J2EE application servers. 
overlap : The overlap partition represents entire disk and is slice s2 of the disk. 
swap : The swap partition size depends on the size of RAM in the system. If you are not sure of its size, 
make it double the amount of RAM in your system. I typically like to make swap 1GB. 
------------------------------------------------
Boot Device: c0t0d0s0
=================================================
  Slice  Mount Point                 Size (MB)
     0   /                               37136
     1   swap                             1025
     2   overlap                         38162
     3                                       0
     4                                       0
     5                                       0
     6                                       0
     7                                       0
=================================================
                         Capacity:       38162 MB
                        Allocated:       38161 MB
                   Rounding Error:           1 MB
                             Free:           0 MB
Hit ESC - F2 to continue 


--------------------------------------------------------------------------------

This is what the File System and Disk Layout screen looks like now. 

File system/Mount point           Disk/Slice             Size
=============================================================
/                                 c0t0d0s0           37136 MB
swap                              c0t0d0s1            1025 MB
overlap                           c0t0d0s2           38162 MB
Hit ESC - F2 to continue 

Screen 32 : Mount Remote File Systems? 

Do you want to mount software from a remote file server? This may be necessary if you had to remove software 
because of disk space problems. 

Hit ESC - F2 to continue 


Screen 33 : Confirm Information 

This is a confirmation screen. Verify all data is correct. 

Hit ESC - F2 to continue 


Screen 34 : Reboot After Installation? 

After Solaris software is installed, the system must be rebooted. You can choose to have the system 
automatically reboot, or you can choose to manually reboot the system if you want to run scripts or do other 
customizations before the reboot. You can manually reboot a system by using the reboot(1M) command. 

[X] Auto Reboot
[ ] Manual Reboot
Hit ESC - F2 to Begin the Installation 

Screen 34 : Installation Progress 

Afterwards it starts configuring disk making partitions and installing software indicating the progress. 

Preparing system for Solaris install

Configuring disk (c0t0d0)
        - Creating Solaris disk label (VTOC)

Creating and checking UFS file systems
        - Creating / (c0t0d0s0)

==================================================================

MBytes Installed: 392.08

MBytes Remaining: 428.09

      Installing: JavaVM run time environment

***************
|    |     |     |     |     |  

0   20    40    60    80    100 
After the installation is complete it customizes system files, devices, and logs. 
The system then reboots or asks you to reboot depending upon the choice selected earlier in the Reboot 
After Installation? screen. 


Screen 36 : Create a root Password 

On this screen you can create a root password. 

A root password can contain any number of characters, but only the first eight characters in the password 
are significant. (For example, if you create `a1b2c3d4e5f6' as your root password, you can use `a1b2c3d4' 
to gain root access.) 

You will be prompted to type the root password twice; for security, the password will not be displayed 
on the screen as you type it. 

> If you do not want a root password, press RETURN twice. 


Root password:
Enter Your root Password and Press Return to continue. 

Screen 37 : Solaris 8 Software 2 of 2 

Please specify the media from which you will install Solaris 8 Software 2 of 2 (2/02 SPARC Platform Edition). 

Alternatively, choose the selection for "Skip" to skip this disc and go on to the next one. 


Media:

1. CD/DVD
2. Network File System
3. Skip

   Media [1]: 1

Screen 38 : Insert the CD/DVD for Solaris 8 Software 2 of 2 

Please insert the CD/DVD for Solaris 8 Software 2 of 2 (2/02 SPARC Platform Edition). 

After you insert the disc, please press Enter. 

Enter S to skip this disc and go on to the next one. To select a different media, enter B to go Back. 

[]

Screen 39 : Solaris 8 packages (part 2) 

After hitting <Enter> in the previous screen, the installation will continue installing the Solaris software (part 2) 

Reading Solaris 8 Software 2 of 2 (2/02 SPARC Platform Edition).... \

Launching installer for Solaris 8 Software 2 of 2 (2/02 SPARC Platform
Edition). Please Wait...

Installing Solaris 8 packages (part 2)
|-1%--------------25%-----------------50%-----------------75%--------------100%|


Installation details:

     Product                      Result     More Info
 1.  Solaris 8 packages (part 2)  Installed  Available

 2.  Done

   Enter the number corresponding to the desired selection for more
   information, or enter 2 to continue [2]:2

   <Press Return to reboot the system> 
Post-Installation Tasks 


After successfully installing the Solaris operating platform software, there may be several tasks that need 
to be performed depending on your configuration. 

Networking: 
If you will be using networking database files for your TCP/IP networking configuration, several files 
will need to be manually created and/or modified. I provided a step-by-step document on how to manually 
configure TCP/IP networking files to manually enable TCP/IP networking using files: 
Configuring TCP/IP on Solaris - TCP/IP Configuration Files - (Quick Config Guide) 


Solaris 8 Patch Cluster: 
It is advisable to install the latest Sun Solaris Patch Cluster to ensure a stable operating environment. 
I provided a step-by-step document on how to download and install the latest Sun Solaris 8 Patch Cluster: 
Patching Sun Solaris 2.8 


=======================
8. RAID Volumes on SUN:
=======================


8.1 SCSI, DISKS AND RAID:
=========================

8.1.1 General
-------------

 SCSI HBA-----------SCSI ID 5----Lun 0 Primary CDROM drive
               |              |--Lun 1 Slave CDROM drive
               |              |-- ....
               |              |--Lun 7 Slave CDROM drive
               |
               |----SCSI ID 6----Lun 0 Primary CDROM
               |              |--...
               |
               |----SCSI ID 0----...

Every SCSI Device can have 8 lun numbers from 0-7


A logical unit number (LUN) is a unique identifier used on a SCSI bus that enables it to differentiate between 
up to eight separate devices (each of which is a logical unit). Each LUN is a unique number that identifies 
a specific logical unit, which may be an disk. 

A SCSI (Small System Computer Interface) is a parallel interface, that can have up to eight devices 
all attached through a single cable; the cable and the host (computer) adapter make up the SCSI bus. 
The bus allows the interchange of information between devices independently of the host. 
In the SCSI program, each device is assigned a unique number, which is either a number between 
0 and 7 for an 8-bit (narrow) bus, or between 8 and 16 for a 16-bit (wide) bus. 
The devices that request input/output (I/O) operations are initiators and the devices that perform 
these operations are targets. Each target has the capacity to connect up to eight additional devices 
through its own controller; these devices are the logical units, each of which is assigned a unique number 
for identification to the SCSI controller for command processing. 

Short for logical unit number, a unique identifier used on a SCSI bus to distinguish between devices 
that share the same bus. SCSI is a parallel interface that allows up to 16 devices to be connected along a single cable. 
The cable and the host adapter form the SCSI bus, and this operates independently of the rest of the computer. 
Each of the eight devices is given a unique address by the SCSI BIOS, ranging from 0 to 7 for an 8-bit bus or 
0 to 15 for a 16-bit bus. Devices that request I/O processes are called initiators. Targets are devices that perform 
operations requested by initiators. Each target can accommodate up to eight other devices, known as logical units, 
and each is assigned an LUN. Commands that are sent to the SCSI controller identify devices based on their LUNs. 

So we might have a situation as:

- Drive C: is standard, Drive D: is SCSI Target 0 LUN 0.
- Drive C: is SCSI Target 0 LUN 0, Drive D:, if installed,is SCSI Target 0 LUN 1 or Target 1 LUN 0.


8.1.2 single-initiator
----------------------


A single-initiator SCSI bus has only one node connected to it, and provides host isolation and better 
performance than a multi-initiator bus. Single-initiator buses ensure that each node is protected 
from disruptions due to the workload, initialization, or repair of the other nodes.

When using a single- or dual-controller RAID array that has multiple host ports and provides 
simultaneous access to all the shared logical units from the host ports on the storage enclosure, 
the setup of the single-initiator SCSI buses to connect each cluster node to the RAID array is possible. 
If a logical unit can fail over from one controller to the other, the process must be transparent 
to the operating system. Note that some RAID controllers restrict a set of disks to a specific 
controller or port. In this case, single-initiator bus setups are not possible.


To set up a single-initiator SCSI bus configuration, perform the following steps:


Enable the onboard termination for each host bus adapter.

Enable the termination for each RAID controller. 

Use the appropriate SCSI cable to connect each host bus adapter to the storage enclosure.

Setting host bus adapter termination is done in the adapter BIOS utility during system boot. 
To set RAID controller termination, refer to the vendor documentation. 


  ---------   SI SCSI bus                   --------------
  |      T|---------------                  |  HBA        |
  |HBA    |               |       ----------|T            |
  |       |               |       |         --------------
  ---------               |       |
                          |       |
                     -------------------
                     |    T       T    |
                     |Storage Enclosure|
                     -------------------

Recommended in Linux an Sun clusters.


8.1.3 Multi Initiator SCSI
--------------------------


Multi Initiator SCSI configurations are configurations with two SCSI host adapter boards connect 
to a single SCSI bus like in the following example: 

  ______________                                              ______________
 |   System 1   |  SCSI   ___________    ___________   SCSI  |   System 2   |
 |(SCSI Adapter)|--------|SCSI Device|--|SCSI Device|--------|(SCSI Adapter)|
 |______________|  Bus   |___________|  |___________|  Bus   |______________|
                                                

  ---------   SI SCSI bus                   --------------
  |      T|-------------------------------- |T            |
  |       |                  |              |             |
  |HBA    |                  |              |HBA          |
  |       |                  |              |             |
  ---------                  |              ---------------           
                     -------------------
                     |       T         |
                     |Storage Enclosure|
                     -------------------


Not recommended for Linux or Solaris clusters.


8.2 Installing an A1000 on Solaris8:
====================================

contributed by Jim Shumpert, edited by Doug Hughes
Here is what you need to do to install An A1000 on Solaris8. The order is very particular. 
Much of this is by way of example. The particulars of your site will differ. Substitute the latest version of 
Raid Manager if there is a newer one available. Also, the exact firmware versions will change over time, 
so, do not take this too literally. 

-Install Solaris8 
-Install required OS patches
 (If you have an Ultra60, install 106455-09 or better - firmware patch - before proceeding) 
- Install Raid Manager 6.22 (RM 6.22) or better. 
# pkgadd -d . SUNWosar SUNWosafw SUNWosamn SUNWosau
  See also section 6.2

(contributed by Greg Whalin) Check /etc/osa/mnf and make sure that your controller name does NOT contain any periods. 
Change them to a _ instead. The RM software does not have any clue how to deal with a period. 
This kept me screwed up for quite a while. 

/etc/osa >more mnf
rebv-pegasu_001~1T94516518~ 0 1 2~~~0~3~~c1t0d0~~

Install patches 109571-02 (for Solaris8 FCS) and 108553-07 (or newer)
(for Solaris7/2.6 patch 108834-07 or newer) [ NOTE: 112125-01 and 112126-01 or better for RM 6.22.1] 
# patchadd 109571-02
# patchadd 108553-02 

Boot -r 
# touch /reconfigure
# reboot -- -r 

Upgrade the firmware on the A1000 

/usr/lib/osa/bin/raidutil -c c1t0d0 -i

LUNs found on c1t0d0.
  LUN 0    RAID 0    0 MB

Vendor ID         Symbios 
ProductID         StorEDGE A1000  
Product Revision  0205
Boot Level        02.05.01.00
Boot Level Date   12/02/97
Firmware Level    02.05.02.11
Firmware Date     04/09/98
raidutil succeeded!


Find lowest number firmware upgrade that is still greater than the firmware that is installed on your A1000. 
For the above example, with patch 108553, upgrade to 2.05.06.32 (do this first, VERY IMPORTANT!) 
# cd /usr/lib/osa/fw
# /usr/lib/osa/bin/fwutil 02050632.bwd c1t0d0
# /usr/lib/osa/bin/fwutil 02050632.apd c1t0d0 

Upgrade to the each next higher firmware in succession until you get to the most recent version. 
It is recommend that you do the upgrades in order. For this example, Upgrade to 3.01.02.33/5 
# /usr/lib/osa/bin/fwutil 03010233.bwd c1t0d0
# /usr/lib/osa/bin/fwutil 03010235.apd c1t0d0 

Upgrade to 03.01.03.60 (or better) 
# /usr/lib/osa/bin/fwutil 03010304.bwd c1t0d0
# /usr/lib/osa/bin/fwutil 03010360.apd c1t0d0 

Check that the array has the correct versions: 

# /usr/lib/osa/bin/raidutil -c c1t0d0 -i

LUNs found on c1t0d0.
  LUN 0    RAID 0    0 MB

Vendor ID         Symbios 
ProductID         StorEDGE A1000  
Product Revision  0301
Boot Level        03.01.03.00
Boot Level Date   10/22/99
Firmware Level    03.01.03.54
Firmware Date     03/30/00
raidutil succeeded!

Check to make sure that the RAID is attached and looks good 

# /usr/lib/osa/bin/drivutil -i c1t0d0

Drive Information for satl-adb1_a_001


Location  Capacity   Status         Vendor  Product          Firmware     Serial
	    (MB)                              ID             Version      Number
[1,0]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0RHKA00    
[2,0]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0QZM600    
[1,1]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0QLRG00    
[2,1]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0RHM400    
[1,2]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0R9FZ00    
[2,2]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0R9SZ00    
[1,3]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0R9FY00    
[2,3]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0QKVR00    
[1,4]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0R79X00    
[2,4]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0QX8500    
[1,5]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0R9JS00    
[2,5]     17274      Optimal        SEAGATE ST318404LSUN18G  4207         3BT0RCY600    

drivutil succeeded!

Example: Create 1 large 10-disk RAID 5 configuration (LUN 0) of max size and then create 2 Hot Spare disks 

# /usr/lib/osa/bin/raidutil -c c1t0d0 -D 0

LUNs found on c1t0d0.
  LUN 0    RAID 0    0 MB
Deleting LUN 0.
Press Control C to abort.

 LUNs successfully deleted

raidutil succeeded!

# /usr/lib/osa/bin/raidutil -c c1t0d0 -l 5 -n 0 -s 0 -r fast -g 10,20,11,21,12,22,13,23,14,24

No existing LUNs were found on c1t0d0.
Capacity available in drive group:  317669184 blocks  (155111 MB).
Creating LUN 0

Registering new logical unit 0 with system.
Formatting logical unit 0  RAID 5   155111 MB 
LUNs found on c1t0d0.
  LUN 0    RAID 5    155111 MB

 LUNs successfully created

raidutil succeeded!

# /usr/lib/osa/bin/raidutil -c c1t0d0 -h 15,25

LUNs found on c1t0d0.
  LUN 0    RAID 5    155111 MB

raidutil succeeded!

Format new RAID by making only one slice 2 partition: 

# prtvtoc /dev/rdsk/c1t0d0s2
* /dev/rdsk/c1t0d0s2 partition map
*
* Dimensions:
*     512 bytes/sector
*      75 sectors/track
*      64 tracks/cylinder
*    4800 sectors/cylinder
*   65535 cylinders
*   65533 accessible cylinders
*
* Flags:
*   1: unmountable
*  10: read-only
*
* Unallocated space:
*       First     Sector    Last
*       Sector     Count    Sector 
*           0 314558400 314558399
*
*                          First     Sector    Last
* Partition  Tag  Flags    Sector     Count    Sector  Mount Directory
       2      5    01          0 314558400 314558399


Newfs new RAID 
# newfs /dev/dsk/c1t0d0s2 

Mount the RAID up as /raid 
# mkdir /raid
# echo "/dev/dsk/c1t0d0s2 /dev/rdsk/c1t0d0s2 /raid ufs 3 yes -" >> /etc/vfstab
# mount /raid 

Check to make sure that the new array is available via "df -lk" 

 # df -lk
 Filesystem            kbytes    used   avail capacity  Mounted on
 /dev/md/dsk/d0       2056211   43031 1951494     3%    /
 /dev/md/dsk/d6       4131866 1133180 2957368    28%    /usr
 /proc                      0       0       0     0%    /proc
 fd                         0       0       0     0%    /dev/fd
 mnttab                     0       0       0     0%    /etc/mnttab
 /dev/md/dsk/d5       2056211    9092 1985433     1%    /var
 swap                 1450208       8 1450200     1%    /var/run
 swap                 1450208       8 1450200     1%    /tmp
 /dev/md/dsk/d7       8089425  182023 7826508     3%    /export
 /dev/dsk/c1t0d0s2    154872105       9 153323375     1%    /raid


6.2 Install problem A1000
-------------------------

Hi.

Thanks for your kind responses. There are a few reply but tons of 
out of office mail. And sorry for forgetting to state that A1000
is not brand new one but used one. After some researches I found
this. here's my summary.

Conclusion:
If A1000 has previously defined LUNs and will be used to be array
as new one, you have to be remove old LUNs before define new LUNs
or your rm6 complains that cannot find raid modules.

---
if you can see more than 1 LUNs in boot prom via command "probe-scsi-all"
you have to insert disk into slot as many as LUNs than reboot with boot -rs.
Than you can see configured LUNs via /usr/lib/osa/bin/lad.
and /usr/lib/osa/bin/raidutil -c c#t#d# -X to delete all old LUNs.
Once you delete old LUNs you can boot normaly with just one disk and 
can find raid module.

Again, Thanks for your help.
-- 


6.3 Installing RM 6.22 on Solaris:
----------------------------------

Raid Manager 6.22 and A1000 config

-- Config and setup
-- ----------------

Firstly install the Raid manager 6.22 (6.221) software on the Solaris 8 system. 

	# pkgadd -d . SUNWosar SUNWosafw SUNWosamn SUNWosau

Defending upon your raid manager version and  scsi/fibre card type you will need to patch the system. 
The following patches are recommended for Solaris 8.

Solaris 8 & Raid manager 6.22        108553-07108982-09111085-02 
Solaris 8 & Raid manager 6.221       112125-01108982-09111085-02 
Ultra 60                             106455-09 
Fibre channel card                   109571-02 

It is probably worth giving the system a reconfigure reboot at this stage.


-- Firmware
-- --------

The first thing to do is check the firmware of the A1000. This can be done with the raidutil command. 
( I assume the A1000 is on controller 1. If not then change the controller as appropriate. 

	# raidutil -c c1t0d0 -i

If the returned values are less that those shown below you will have to upgrade the firmware using fwutil.

	Product		Revision  0301
	Boot Level        03.01.03.04
	Boot Level Date   07/06/00
	Firmware Level    03.01.03.60
	Firmware Date     06/30/00

To upgrade the firmware perform the following.

	# cd /usr/lib/osa/fw
	# fwutil 02050632.bwd c1t0d0
	# fwutil 02050632.apd  c1t0d0
	# fwutil 03010233.bwd  c1t0d0
	# fwutil 03010235.apd  c1t0d0
	# fwutil 03010304.bwd  c1t0d0
	# fwutil 03010360.apd  c1t0d0

You can now re-perform the "raidutil -c c1todo -i" command again to verify the firmware changes.


Clean up the array
I am assuming that the array is free for full use by ourselves and intend to remove any old luns that might be lying around. 

	# raidutil -c c1t0d0 -X
The above command resets the array internals.
We can now remove any old lun's.  To do this run "raidutil -c c1t0d0 -i" and note any luns that are configured.

To delete the luns perform the following command.
	# raidutil -c c1t0d0 -i
			LUNs found on c1t0d0.
 			LUN 0    RAID 1    10 MB

			Vendor ID         Symbios
			ProductID         StorEDGE A1000
			Product Revision  0301
			Boot Level        03.01.03.04
			Boot Level Date   07/06/00
			Firmware Level    03.01.03.60
			Firmware Date     06/30/00
			raidutil succeeded!

	# raidutil -c c1t0d0 -D 0
In the above example we are removing lun 0.  repeat this command changing the lun number as appropriate.

We can now give the array a name of our choice. (Do not use a .)
	# storutil -c c1t0d0 -n "dragon_array"


Creating Lun's
The disks are labelled on the front of the A1000 as controller number and disk number seperated by a comma eg. 1,0 1,2 and 2,0 etc, etc. We refer to the disks without using the comma. So the first disk on controller 1 is disk 10 and the 3rd disk on controller 2 is disk 23. we will use disks on both controllers when creating the mirrors. I am starting with the disks on each controller as viewed form the left. The next stage is to create the luns we require. In the below example I will configure a fully populated (12 disks) system which has 18Gb drives into the following sizes. Here we will use the raidutil command again. 

	# raidutil -c controller -n lun_number -l  raid_type  -s  size  -g  disk_list

LUN 0  	Size 8617mb of a stripped/mirror configuration across half of the first two disks.
		# raidutil -c c1t0d0 -n 0 -l 1+0 -s 8617 -g 10,20

LUN 1  	Size 8617mb of a stripped/mirror configuration across the second half of the first two disks.
		# raidutil -c c1t0d0 -n 1 -l 1+0 -s 8617 -g 10,20

LUN 2  	Size 8617mb of a stripped/mirror configuration across half of the next two disks.
		# raidutil -c c1t0d0 -n 2 -l 1+0 -s 8617 -g 11,21

LUN 3  	Size 8617mb of a stripped/mirror configuration across the second half of the next two disks.
		# raidutil -c c1t0d0 -n 3 -l 1+0 -s 8617 -g 11,21

LUN 4  	Size 34468mb of a stripped/mirror configuration across the next four disks.
		# raidutil -c c1t0d0 -n 4 -l 1+0 -s 34468 -g 12,13,22,23

LUN 5  	Size 17234mb of a stripped/mirror configuration across the next two disks.
		# raidutil -c c1t0d0 -n 5 -l 1+0 -s 34468 -g 14,24

LUN 6 	Size 17234mb of a non mirror configuration on the next disk.
		# raidutil -c c1t0d0 -n 6 -l 0 -s 34468 -g 15

This then leaves the disk 25 or disk 5 on the second controller free as a hot spare.
to set up this disk as a hot spare run
                # raidutil -h 25


Finishing off
We are now ready to reboot the system performing a reconfigure. When this is done we can format, partition, newfs 
and mount the disks in the normal way. 

Other commands
The following is a list of possibly useful raid manager commands 

rm6 (GUI interface) 
drivutil (drive / lun management) 
healtchk (helth check on a raid module 
lad (list array devices) 
logutil (log formatting program) 
nvutil (edit / modify NVSRAM) 
parityck (parity checker and repair) 
rdacutil (redundency controller for failed bits and load balancing) 
storutil (host and naming info) 


7.3 Sun StorEdge D1000:
=======================


Overview
The Sun StorEdge D1000 is a disk tray with hot-pluggable 


- Power supplies 
- Fans 
- Disks (If SPARCstorage Volume Manager configured). 

A D1000 is disk array attached to the hostname is configured as a RAID5 metadevice. 


Disk Terminology
Before you can effectively use the information in this section, you should be familiar with basic disk architecture. 
In particular, you should be familiar with the following terms: 

Track 
Cylinder 
Sector 
Disk controller 
Disk label 
Device drivers 
 

Disk Slices
Files stored on a disk are contained in file systems. Each file system on a disk is assigned to a slice-a group of 
cylinders set aside for use by that file system. Each disk slice appears to the operating system 
(and to the system administrator) as though it were a separate disk drive. 
Slices are sometimes referred to as partitions. 

Each disk slice holds only one file system. 

No file system can span multiple slices. 
On SPARC based systems, Solaris defines eight disk slices and assigns to each a conventional use. 
These slices are numbered 0 through 7. 

Slice File System Purpose 
0  root  Holds files and directories that make up the operating system.  
1  swap  Provides virtual memory, or swap space. Swap space is used when running programs are too large to fit 
   in a computer's memory. The Solaris operating environment then "swaps" programs from memory to the disk 
   and back as needed.  
2  Refers to the entire disk, by convention. It is defined automatically by the format and the Solaris 
   installation programs. The size of this slice should not be changed.  
3  /export  Holds alternative versions of the operating system. These alternative versions are required by client systems 
   whose architectures differ from that of the server. Clients with the same architecture type as the server 
   obtain executables from the /usr file system, usually slice 6.  
4  /export/swap  Provides virtual memory space for client systems.  
5  /opt  Holds application software added to a system. If a slice is not allocated for this file system 
   during installation, the /opt directory is put in slice 0.  
6  /usr  Holds operating system commands-also known as executables- designed to be run by users. 
   This slice also holds documentation, system programs (init and /tech/sun/commands/syslogd.html">syslogd, for example) 
   and library routines.  
7  /export/home  Holds files created by users.  

Or.. something like this is also seen on a single disk system:

/        Slice 0, partition  about 2G
swap     Slice 1, partition  about 4G
/export  Slice 3, partition  about 50G, maybe you link it to /u01
/var     Slice 4, partition  about 2G
/opt     Slice 5, partition  about 10G if you plan to install apps here
/usr     Slice 6, partition  about 2G
/u01     Slice 7, partition  optional, standard it's /home
         Depending on how you configure /export, size could be around 20G 

Raw Data Slices
The SunOS operating system stores the disk label in block 0, cylinder 0 of each disk. This means that using third-party 
database applications that create raw data slices must not start at block 0, cylinder 0, or the disk label 
will be overwritten and the data on the disk will be inaccessible. 

Do not use the following areas of the disk for raw data slices, which are sometimes created by third-party d
atabase applications: 

Block 0, cylinder 0  Where the disk label is stored.  
Cylinder 0  Avoid for improved performance. 
Slice 2  Represents the entire disk. 


Slice Arrangements on Multiple Disks 
Although a single disk that is large enough can hold all slices and their corresponding file systems, two or more disks are often used to hold a system's slices and file systems. A slice cannot be split between two or more disks. However, multiple swap slices on separate disks are allowed. 

For instance, a single disk might hold the root (/) file system, a swap area, and the /usr file system, while a separate disk is provided for the /export/home file system and other file systems containing user data. 

In a multiple disk arrangement, the disk containing the operating system software and swap space (that is, the disk holding the root (/) or /usr file systems or the slice for swap space) is called the system disk. Disks other than the system disk are called secondary disks or non-system disks. 

Locating a system's file systems on multiple disks allows you to modify file systems and slices on the secondary disks without having to shut down the system or reload operating system software. 

Having more than one disk also increases input-output (I/O) volume. By distributing disk load across multiple disks, you can avoid I/O bottlenecks. 

 
Determining Which Slices to Use
When you set up a disk's file systems, you choose not only the size of each slice, but also which slices to use. The system configuration requires the use of different slices. The table below lists these requirements. 


Slice Server 
0 root 
1 swap 
2  -  
3  /export  
4  /export/swap  
5 /opt 
6 /usr 
7 /export/home 

 
The format Utility
The format utility can be used to manipulate hard disk drives: 


Display slice information 
Divide a disk into slices 
Add a disk drive 
Reformat a disk drive 
Repair a disk drive 
 

Disk Labels
A special area of every disk is set aside for storing information about the disk's controller, geometry, and slices. That information is called the disk's label. Another term used to described the disk label is the VTOC (Volume Table of Contents). To label a disk means to write slice information onto the disk. You usually label a disk after changing its slices. 

If you fail to label a disk after creating slices, the slices will be unavailable because the operating system has no way of "knowing" about the slices. The partition table identifies a disk's slices, the slice boundaries (in cylinders), and total size of the slices. A disk's partition table can be displayed using the format utility. Partition flags and tags are assigned by convention and require no maintenance. 

The following partition table example is displayed from a 1.05-Gbyte disk using the format utility: 

Total disk cylinders available: 2036 + 2 (reserved cylinders)
Part      Tag    Flag     Cylinders        Size            Blocks
  0       root    wm       0 -  300      148.15MB    (301/0/0)   303408
  1       swap    wu     301 -  524      110.25MB    (224/0/0)   225792
  2     backup    wm       0 - 2035     1002.09MB    (2036/0/0) 2052288
  3 unassigned    wm       0               0         (0/0/0)          0
  4 unassigned    wm       0               0         (0/0/0)          0
  5 unassigned    wm       0               0         (0/0/0)          0
  6        usr    wm     525 - 2035      743.70MB    (1511/0/0) 1523088
  7 unassigned    wm       0               0         (0/0/0)          0

The partition table contains the following information: 


Column Name Description 
Part  Partition or (slice number). Valid numbers are 0-7. 
Tag  A numeric value that usually describes the file system mounted on this partition. 
0=UNASSIGNED 
1=BOOT 
2=ROOT 
3=SWAP 
4=USR 
5=BACKUP 
7=VAR 
8=HOME  
Flags  wm  Partition is writable and mountable. 
wu rm  Partition is writable and unmountable. Default state of partitions dedicated for swap areas. The mount command does not check the "not mountable" flag. 
top>rm  The partition is read only and mountable. 
 
Cylinders  The starting and ending cylinder number for the slice. 
Size  The slice size in Mbytes. 
Blocks  The total number of cylinders and the total number of sectors per slice in the far right column. 

The following example displays a disk label using the prtvtoc command. 


# prtvtoc /dev/rdsk/c0t1d0s0
* /dev/rdsk/c0t1d0s0 partition map
*
* Dimensions:
*     512 bytes/sector
*      72 sectors/track
*      14 tracks/cylinder
*    1008 sectors/cylinder
*    2038 cylinders
*    2036 accessible cylinders
*
* Flags:
*   1: unmountable
*  10: read-only
*
*                          First     Sector    Last
* Partition  Tag  Flags    Sector     Count    Sector  Mount Directory
       0      2    00          0    303408    303407   /
       1      3    01     303408    225792    529199
       2      5    00          0   2052288   2052287
       6      4    00     529200   1523088   2052287   /usr

The disk label includes the following information: 


Dimensions - Physical dimensions of the disk drive. 

Flags - Flags listed in the partition table section. 

Partition (or Slice) Table - Contains the following information: 

Column Name  Description  
Partition Slice number  
Flags Partition flag.  
First Sector  The first sector of the slice. 
Sector Count  The total number of sectors in the slice. 
Last Sector The last sector number in the slice. 
Mount Directory The last mount point directory for the file system. 

 
Dividing a Disk Into Slices
The format utility is most often used by system administrators to divide a disk into slices. The steps are: 

Determining which slices are needed 
Determining the size of each slice 
Using the format utility to divide the disk into slices 
Labeling the disk with new slice information 
Creating the file system for each slice 
The easiest way to divide a disk into slices is to use the modify command from the partition menu. The modify command allows you to create slices by specifying the size of each slice in megabytes without having to keep track of starting cylinder boundaries. It also keeps tracks of any disk space remainder in the "free hog" slice. 

 
Using the Free Hog Slice
When you use the format utility to change the size of one or more disk slices, you designate a temporary slice that will expand and shrink to accommodate the resizing operations. 

This temporary slice donates, or "frees," space when you expand a slice, and receives, or "hogs," the discarded space when you shrink a slice. For this reason, the donor slice is sometimes called the free hog. 

The donor slice exists only during installation or when you run the format utility. There is no permanent donor slice during day-to-day, normal operations. 

 
How to Identify the Disks on a System

Become superuser. 

Run the format utility. 

# format 
The format utility displays a list of disks that it recognizes under AVAILABLE DISK SELECTIONS. 
Here is sample format output: 


# format
Searching for disks...done


AVAILABLE DISK SELECTIONS:
       0. c1t0d0 <SUN36G cyl 24620 alt 2 hd 27 sec 107>
          /pci@8,600000/SUNW,qlc@4/fp@0,0/ssd@w21000004cf785d11,0
       1. c1t1d0 <SUN36G cyl 24620 alt 2 hd 27 sec 107>
          /pci@8,600000/SUNW,qlc@4/fp@0,0/ssd@w21000004cf78670e,0
       2. c2t0d0 <SUN36G cyl 24620 alt 2 hd 27 sec 107>
          /pci@8,600000/scsi@1/sd@0,0
       3. c2t1d0 <SUN36G cyl 24620 alt 2 hd 27 sec 107>
          /pci@8,600000/scsi@1/sd@1,0
       4. c2t8d0 <SUN36G cyl 24620 alt 2 hd 27 sec 107>
          /pci@8,600000/scsi@1/sd@8,0
       5. c2t9d0 <SUN36G cyl 24620 alt 2 hd 27 sec 107>
          /pci@8,600000/scsi@1/sd@9,0


The format output associates a disk's physical and local device name to the disk's marketing name which appears in angle brackets <>. This is an easy way to identify which local device names represent the disks connected to your system. The following example uses a wildcard to display the disks connected to a second controller. 

# format /dev/rdsk/c2*
AVAILABLE DISK SELECTIONS:
  0. /dev/rdsk/c2t0d0s0 
     /io-unit@f,e0200000/sbi@0,0/QLGC,isp@2,10000/sd@0,0
  1. /dev/rdsk/c2t1d0s0 
     /io-unit@f,e0200000/sbi@0,0/QLGC,isp@2,10000/sd@1,0
  2. /dev/rdsk/c2t2d0s0 
     /io-unit@f,e0200000/sbi@0,0/QLGC,isp@2,10000/sd@2,0
  3. /dev/rdsk/c2t3d0s0 
     /io-unit@f,e0200000/sbi@0,0/QLGC,isp@2,10000/sd@3,0
  4. /dev/rdsk/c2t5d0s0 
     /io-unit@f,e0200000/sbi@0,0/QLGC,isp@2,10000/sd@5,0
Specify disk (enter its number): 

The format output identifies that disk 2 (targets 0-5) are connected to the first SCSI host adapter (sbi@...), 
which is connected to the first SBus device (io-unit@). 


--------------------------------------------------------------------------------


Displaying Disk Slices
You can use the format utility to check whether or not a disk has the appropriate disk slices. If you determine 
that a disk does not contain the slices you want to use, use the format utility to re-create them and label the disk. 
The format utility uses the term partition in place of slice. 


Become superuser. 

Enter the format utility. 

Identify the disk for which you want to display slice information by selecting a disk listed 
under AVAILABLE DISK SELECTIONS. 

Specify disk (enter its number):1 

Enter the partition menu by typing partition at the format> prompt. 

format> partition 

Display the slice information for the current disk drive by typing print at the partition> prompt. 

partition> print 

Exit the format utility by typing q at the partition> prompt and typing q at the format> prompt. 

partition> q 
format> q 
# 

Verify displayed slice information by identifying specific slice tags and slices. If the screen output shows that 
no slice sizes are assigned, the disk probably does not have slices. 
 

Examples--Displaying Disk Slice Information
The following example displays slice information for disk /dev/rdsk/c2t0d0s0 


Total disk cylinders available: 24620 + 2 (reserved cylinders)

Part      Tag    Flag     Cylinders         Size            Blocks
  0 unassigned    wm       0                0         (0/0/0)            0
  1 unassigned    wm       0                0         (0/0/0)            0
  2     backup    wu       0 - 24619       33.92GB    (24620/0/0) 71127180
  3 unassigned    wm       0                0         (0/0/0)            0
  4 unassigned    wm       0                0         (0/0/0)            0
  5 unassigned    wm       0                0         (0/0/0)            0
  6 unassigned    wm       0 - 24618       33.91GB    (24619/0/0) 71124291
  7 unassigned    wm       0                0         (0/0/0)            0

The following example displays slice information for disk /dev/rdsk/c2t8d0s0 


Total disk cylinders available: 24620 + 2 (reserved cylinders)

Part      Tag    Flag     Cylinders         Size            Blocks
  0 unassigned    wm       0                0         (0/0/0)            0
  1 unassigned    wm       0                0         (0/0/0)            0
  2     backup    wu       0 - 24619       33.92GB    (24620/0/0) 71127180
  3 unassigned    wm       0                0         (0/0/0)            0
  4 unassigned    wm       0                0         (0/0/0)            0
  5 unassigned    wm       0                0         (0/0/0)            0
  6 unassigned    wm       0 - 24618       33.91GB    (24619/0/0) 71124291
  7 unassigned    wm       0                0         (0/0/0)            0

 
--------------------------------------------------------------------------------

 
Creating and Examining a Disk Label
Labeling a disk is usually done during system installation or when you are creating new disk slices. 
You might need to relabel a disk if the disk label is corrupted (for example, from a power failure). 
The format utility will attempt to automatically configure any unlabeled SCSI disk. If format is able 
to automatically configure an unlabeled disk, it will display a message like the following: 


c1t0d0:configured with capacity of 404.65MB 
 

How to Label a Disk

Become superuser. 

Enter the format utility. 

Enter the number of the disk that you want to label from the list displayed on your screen. 
Specify disk (enter its number):1 

If the disk is unlabeled and was successfully configured, format will ask if you want to label the disk. 
Go to step 5 to label the disk. 
If the disk was labeled and you want to change the type, or format was not able to automatically configure 
the disk you must specify the disk type. Go to steps 6-7 to set the disk type and label the disk. 


Label the disk by typing y at the Label it now? prompt. 

Disk not labeled. Label it now? y 

The disk is now labeled. Go to step 10 to exit the format utility. 

Enter type at the format> prompt. 

format> type 
Format displays the Available Drive Types menu. 

Select a disk type from the list of possible disk types. 

Specify disk type (enter its number)[12]: 12 

Label the disk. If the disk is not labeled, the following message is displayed. 

Disk not labeled. Label it now? y 
Otherwise you are prompted with this message: 

Ready to label disk, continue? y 

Use the verify command from the format main menu to verify the disk label. 

format> verify 

Exit the format utility by typing q at the format> prompt. 

partition> q 
format> q 
# 
 

Example-Labeling a Disk
The following example automatically configures and labels a 1.05-Gbyte disk. 


# format
 c1t0d0: configured with capacity of 1002.09MB
 AVAILABLE DISK SELECTIONS:
   0. c0t3d0 
     /iommu@f,e0000000/sbus@f,e0001000/espdma@f,400000/esp@f,800000/sd@1,0
   1. c1t0d0 
     /iommu@f,e0000000/sbus@f,e0001000/espdma@f,400000/esp@f,800000/sd@1,0
Specify disk (enter its number): 1
Disk not labeled.  Label it now?  yes
format> verify
#

 
How to Examine a Disk Label


Examine disk label information by using the prtvtoc(1M) command. See Chapter 28, Disk Management (Overview) for 
a detailed description of the disk label and the information displayed by the prtvtoc command. 

Become superuser. 

Display the disk label information by using the prtvtoc command. 

# prtvtoc /dev/rdsk/device-name 
 

Automatically Configuring SCSI Disk Drives
In Solaris 2.3 release and compatible versions, the format utility automatically configures SCSI disk drives even if 
that specific type of drive is not listed in the /etc/format.dat file. This feature enables you to format, slice, 
and label any disk driver compliant with SCSI-2 specification for disk device mode sense pages. 
The following steps are involved in configuring a SCSI drive using autoconfiguration: 

Shutting down the system 
Attaching the SCSI disk drive to the system 
Turning on the disk drive 
Performing a reconfiguration boot 
Using the format utility to automatically configure the SCSI disk drive 
After the reconfiguration boot, invoke the format utility. The format utility will attempt to configure the disk and, 
if successful, alert the user that the disk was configured. See How to Automatically Configure a SCSI Drive 
for step-by-step instructions on configuring a SCSI disk drive automatically. 

Here are the default slice rules that format uses to create the partition table. 


Disk Size Root File System Swap Slice 
0 - 180 Mbytes 16 Mbytes 16 Mbytes 
180 Mbytes - 280 Mbytes  16 Mbytes 32 Mbytes 
280 Mbytes - 380 Mbytes 24 Mbytes 32 Mbytes 
380 Mbytes - 600 Mbytes 32 Mbytes 32 Mbytes 
600 Mbytes - 1.0 Gbytes 32 Mbytes 64 Mbytes 
1.0 Gbytes - 2.0 Gbytes 64 Mbytes 128 Mbytes 
More than 2.0 Gbytes 128 Mbytes 128 Mbytes 

In all cases, slice 6 (for the /usr file system) gets the remainder of the space on the disk. 

Here's an example of a format-generated partition table for a 1.3-Gbyte SCSI disk drive. 


Part    Tag    Flag     Cylinders     Size        Blocks
   0     root    wm       0 -   96    64.41MB      (97/0/0)
   1     swap    wu      97 -  289   128.16MB     (193/0/0)
   2   backup    wu       0 - 1964     1.27GB    (1965/0/0)
   6      usr    wm     290 - 1964     1.09GB    (1675/0/0)

 
How to Automatically Configure a SCSI Drive


Become superuser. 

Create the /reconfigure file that will be read when the system is booted. 

# /tech/sun/commands/touch.html">touch /reconfigure 

Shut down the system. 

# /tech/sun/commands/shutdown.html">shutdown -i0 -g30 -y 
The ok or > prompt is displayed after the operating environment is shut down. 


Turn off power to the system and all external peripheral devices. 

Make sure the disk you are adding has a different target number than the other devices on the system. 
You will often find a small switch located at the back of the disk for this purpose. 

Connect the disk to the system and check the physical connections. 

Turn on the power to all external peripherals. 

Turn on the power to the system. The system will boot and display the login prompt. 

Login as superuser, invoke the format utility, and select the disk to be configured automatically. 

# format
Searching for disks...done
c1t0d0: configured with capacity of 1002.09MB
AVAILABLE DISK SELECTIONS:
  0. c0t1d0 
     /iommu@f,e0000000/sbus@f,e0001000/espdma@f,400000/esp@f,800000/sd@1,0
  1. c0t3d0 
     /iommu@f,e0000000/sbus@f,e0001000/espdma@f,400000/esp@f,800000/sd@3,0
Specify disk (enter its number): 1


Reply yes to the prompt to label the disk. Replying y will cause the disk label to be generated and written 
to the disk by the autoconfiguration feature. 

Disk not labeled. Label it now? y 

Verify the disk label with the verify command. 

format> verify 

Exit the format utility. 

format> q 
 

--------------------------------------------------------------------------------

 
SPARC: How to Create Disk Slices and Label a Disk

Become superuser. 

Start the format(1M) utility. 

# format 
A list of available disks is displayed. 

Enter the number of the disk that you want to repartition from the list displayed on your screen. 

Specify disk (enter its number): disk-number 

Go into the partition menu (which lets you set up the slices). 

format> partition 

Display the current partition (slice) table. 

partition> print 

Start the modification process. 

partition> modify 

Set the disk to all free hog. 

Choose base (enter number) [0]? 1 
See Using the Free Hog Slice for more information about the free hog slice. 

Create a new partition table by answering y when prompted to continue. 

Do you wish to continue creating a new partition table based on above table[yes]? y 

Identify the free hog partition (slice) and the sizes of the slices when prompted. When adding a system disk, 
you must set up slices for: root (slice 0) and swap (slice 1) and/or /usr (slice 6) After you identify the slices, 
the new partition table is displayed. 

Make the displayed partition table the current partition table by answering y when asked. Okay to make this 
the current partition table[yes]? y If you don't want the current partition table and you want to change it, 
answer no and go to Step 6 . 

Name the partition table. 

Enter table name (remember quotes): "partition-name" 

Label the disk with the new partition table when you have finished allocating slices on the new disk. 

Ready to label disk, continue? yes 

Quit the partition menu. 

partition> q 

Verify the disk label using the verify command. 

format> verify 

Quit the format menu. 

format> q 
 

SPARC: Example-Creating Disk Slices and Labeling a System Disk
The following example uses the format utility to divide a 1-Gbyte disk into three slices: 
one for the root (/) file system, one for the swap area, and one for the /usr file system. 


# format
Searching for disks...done
AVAILABLE DISK SELECTIONS:
   0. c0t1d0 
      /iommu@f,e0000000/sbus@f,e0001000/espdma@f,400000/esp@f,800000/sd@1,0
   1. c0t3d0 
      /iommu@f,e0000000/sbus@f,e0001000/espdma@f,400000/esp@f,800000/sd@3,0
Specify disk (enter its number): 0
selecting c0t1d0
[disk formatted]
format> partition
partition> print
partition> modify
Select partitioning base:
 0. Current partition table (original)
 1. All Free Hog
Choose base (enter number) [0]? 1
 Part      Tag    Flag     Cylinders        Size            Blocks
  0       root    wm       0               0         (0/0/0)          0
  1       swap    wu       0               0         (0/0/0)          0
  2     backup    wu       0 - 2035     1002.09MB    (2036/0/0) 2052288
  3 unassigned    wm       0               0         (0/0/0)          0
  4 unassigned    wm       0               0         (0/0/0)          0
  5 unassigned    wm       0               0         (0/0/0)          0
  6        usr    wm       0               0         (0/0/0)          0
  7 unassigned    wm       0               0         (0/0/0)          0
Do you wish to continue creating a new partition
table based on above table[yes]? yes
Free Hog partition[6]? 6
Enter size of partition `0' [0b, 0c, 0.00mb]: 200mb
Enter size of partition `1' [0b, 0c, 0.00mb]: 200mb
Enter size of partition `3' [0b, 0c, 0.00mb]:
Enter size of partition `4' [0b, 0c, 0.00mb]:
Enter size of partition `6' [0b, 0c, 0.00mb]:
Enter size of partition `7' [0b, 0c, 0.00mb]:
  Part      Tag    Flag     Cylinders        Size            Blocks
  0       root    wm       0 -  406      200.32MB    (407/0/0)   410256
  1       swap    wu     407 -  813      200.32MB    (407/0/0)   410256
  2     backup    wu       0 - 2035     1002.09MB    (2036/0/0) 2052288
  3 unassigned    wm       0               0         (0/0/0)          0
  4 unassigned    wm       0               0         (0/0/0)          0
  5 unassigned    wm       0               0         (0/0/0)          0
  6        usr    wm     814 - 2035      601.45MB    (1222/0/0) 1231776
  7 unassigned    wm       0               0         (0/0/0)          0
 Okay to make this the current partition table[yes]? yes
Enter table name (remember quotes): "disk0"
Ready to label disk, continue? yes
partition> quit
format> verify
format> quit

 
SPARC: Example-Creating Disk Slices and Labeling a Secondary Disk
The following example uses the format utility to divide a 1-Gbyte disk into one slice for the /export/home file system. 


# format
Searching for disks...done
AVAILABLE DISK SELECTIONS:
   0. c0t1d0 
      /iommu@f,e0000000/sbus@f,e0001000/espdma@f,400000/esp@f,800000/sd@1,0
   1. c0t3d0 
      /iommu@f,e0000000/sbus@f,e0001000/espdma@f,400000/esp@f,800000/sd@3,0
Specify disk (enter its number): 0
selecting c0t1d0
[disk formatted]
format> partition
partition> print
partition> modify
Select partitioning base:
 0. Current partition table (original)
 1. All Free Hog
Choose base (enter number) [0]? 1
 Part      Tag    Flag     Cylinders        Size            Blocks
  0       root    wm       0               0         (0/0/0)          0
  1       swap    wu       0               0         (0/0/0)          0
  2     backup    wu       0 - 2035     1002.09MB    (2036/0/0) 2052288
  3 unassigned    wm       0               0         (0/0/0)          0
  4 unassigned    wm       0               0         (0/0/0)          0
  5 unassigned    wm       0               0         (0/0/0)          0
  6        usr    wm       0               0         (0/0/0)          0
  7 unassigned    wm       0               0         (0/0/0)          0
Do you wish to continue creating a new partition
table based on above table[yes]? y
Free Hog partition[6]? 7
Enter size of partition '0' [0b, 0c, 0.00mb, 0.00gb]: 
Enter size of partition '1' [0b, 0c, 0.00mb, 0.00gb]: 
Enter size of partition '3' [0b, 0c, 0.00mb, 0.00gb]: 
Enter size of partition '4' [0b, 0c, 0.00mb, 0.00gb]: 
Enter size of partition '5' [0b, 0c, 0.00mb, 0.00gb]: 
Enter size of partition '6' [0b, 0c, 0.00mb, 0.00gb]:
 Part      Tag    Flag     Cylinders        Size            Blocks
  0       root    wm       0               0         (0/0/0)          0
  1       swap    wu       0               0         (0/0/0)          0
  2     backup    wu       0 - 2035     1002.09MB    (2036/0/0) 2052288
  3 unassigned    wm       0               0         (0/0/0)          0
  4 unassigned    wm       0               0         (0/0/0)          0
  5 unassigned    wm       0               0         (0/0/0)          0
  6        usr    wm       0               0         (0/0/0)          0
  7 unassigned    wm       0 - 2035     1002.09MB    (2036/0/0) 2052288 
Okay to make this the current partition table[yes]? yes
Enter table name (remember quotes): "home"
Ready to label disk, continue? y
partition> q
format> verify
format> q
# 


SPARC: How to Create File Systems


Become superuser. 

Create a file system for each slice with the newfs(1M) command. 

# newfs /dev/rdsk/cwtxdysz 

Verify the new file system by mounting it on an unused mount point. 

# mount /dev/dsk/cwtxdysz /mnt 
# ls 
lost+found 
 

How to Stop All Processes Accessing a File System

Become superuser. 

List all the processes that are accessing the file system, so you know which processes you are going to stop. 

# /tech/sun/commands/fuser.html">fuser -c [ -u ] mount-point 

Stop all processes accessing the file system. You should not stop a user's processes without warning. 

# /tech/sun/commands/fuser.html">fuser -c -k mount-point 
A SIGKILL is sent to each process using the file system. 

Verify that there are no processes accessing the file system. 

# /tech/sun/commands/fuser.html">fuser -c mount-point 
 

--------------------------------------------------------------------------------

 
Add Disk 
Follow the steps below to add a new external/internal disk: 


Bring the system down to the ok prompt. 

# init 0


Find an available target setting. This command will show what you currently have on your system. 

# probe-scsi

If the disk is on another scsi controller (another card off of an sbus slot) 

# probe-scsi-all


Attach the new disk with the correct target setting. Run probe-scsi again to make sure the system sees it. If it doesn't, the disk is either not connected properly, has a target conflict, or is defective. Resolve this issue before continuing. 
In this example, we'll say: 


T3 original internal drive 
T1 new/other internal drive where a duplicate copy of the OS will be placed. 

Perform a reconfiguration boot. 

# boot -rv
rv -> reconfigure in verbose mode.


Run format and partition the disk. (Here's our example): 


# format
Searching for disks...done

AVAILABLE DISK SELECTIONS:

1. c0t1d0 
/iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@1,0
2. c0t3d0 
/iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0
Specify disk (enter its number): 1
selecting c0t1d0
[disk formatted]

FORMAT MENU:
disk 		- select a disk
type 		- select (define) a disk type
partition 	- select (define) a partition table
current 	- describe the current disk
format 		- format and analyze the disk
repair 		- repair a defective sector
label 		- write label to the disk
analyze 	- surface analysis
defect 		- defect list management
backup 		- search for backup labels
verify 		- read and display labels
save 		- save new disk/partition definitions
inquiry 	- show vendor, product and revision
volname 	- set 8-character volume name
quit
format> part

PARTITION MENU:
0 	- change `0' partition
1 	- change `1' partition
2 	- change `2' partition
3 	- change `3' partition
4 	- change `4' partition
5 	- change `5' partition
6 	- change `6' partition
7 	- change `7' partition
select 	- select a predefined table
modify 	- modify a predefined partition table
name 	- name the current table
print 	- display the current table
label 	- write partition map and label to the disk
quit

partition> print

Current partition table (original):
Total disk cylinders available: 2036 + 2 (reserved cylinders)

Part 	Tag 	Flag 	Cylinders 	Size 			Blocks
0 	root 	wm 	0 - 203 	100.41MB 	(204/0/0) 	205632
1 	swap 	wu 	204 - 407 	100.41MB 	(204/0/0) 	205632
2 	backup 	wm 	0 - 2035 	1002.09MB 	(2036/0/0) 	2052288
3   unassigned 	wm 	0 		0 		(0/0/0) 	0
4 	var 	wm 	408 - 611 	100.41MB 	(204/0/0) 	205632
5   unassigned 	wm 	612 - 1018 	200.32MB 	(407/0/0) 	410256
6 	usr 	wm 	1019 - 2034 	500.06MB 	(1016/0/0) 	1024128
7   unassigned 	wm 	0 		0 		(0/0/0) 	0

partition>

****** Modify partitions to suit your needs ******
****** Do NOT alter partition 2, backup !!! ******


In this example we'll go with the current displayed partition table listed: 

partition> 0
Part 	    Tag 	Flag 	Cylinders 	Size 	     Blocks
0 	unassigned 	wm 	0 - 162 	80.23MB (163/0/0) 164304

Enter partition id tag[unassigned]:
Enter partition permission flags[wm]:
Enter new starting cyl[0]: o
`o' is not an integer.
Enter new starting cyl[0]: 0
Enter partition size[164304b, 163c, 80.23mb, 0.08gb]: 100.41mb
partition> pr
Current partition table (unnamed):
Total disk cylinders available: 2036 + 2 (reserved cylinders)

Part 	   Tag 		Flag Cylinders 		Size 		Blocks
0 	unassigned 	wm 	0 - 203 	100.41MB 	(204/0/0) 	205632
1 	unassigned 	wu 	163 - 423 	128.46MB 	(261/0/0) 	263088
2 	backup 		wu 	0 - 2035 	1002.09MB 	(2036/0/0) 	2052288
3 	unassigned 	wm 	0 		0 		(0/0/0) 	0
4 	unassigned 	wm 	424 - 749 	160.45MB 	(326/0/0) 	328608
5 	unassigned 	wm 	750 - 1109 	177.19MB 	(360/0/0) 	362880
6 	unassigned 	wm 	1110 - 2035 	455.77MB 	(926/0/0) 	933408
7 	unassigned 	wm 	0 		0 		(0/0/0) 	0

partition> 1
Part 	Tag 		Flag Cylinders 		Size 		Blocks
1 	unassigned 	wu 	163 - 423 	128.46MB 	(261/0/0) 	263088

Enter partition id tag[unassigned]:
Enter partition permission flags[wu]:
Enter new starting cyl[163]: 204
Enter partition size[263088b, 261c, 128.46mb, 0.13gb]: 100.41mb
partition> pr
Current partition table (unnamed):
Total disk cylinders available: 2036 + 2 (reserved cylinders)

Part 	Tag 		Flag Cylinders 		Size 			Blocks
0 	unassigned 	wm 	0 - 203 	100.41MB 	(204/0/0) 	205632
1 	unassigned 	wu 	204 - 407 	100.41MB 	(204/0/0) 	205632
2 	backup 		wu 	0 - 2035 	1002.09MB 	(2036/0/0) 	2052288
3 	unassigned 	wm 	0 		0 		(0/0/0) 	0
4 	unassigned 	wm 	424 - 749 	160.45MB 	(326/0/0) 	328608
5 	unassigned 	wm 	750 - 1109 	177.19MB 	(360/0/0) 	362880
6 	unassigned 	wm 	1110 - 2035 	455.77MB 	(926/0/0) 	933408
7 	unassigned 	wm 	0 		0 		(0/0/0) 	0


partition> 4
Part 	Tag 		Flag 	Cylinders 	Size 		Blocks
4 	unassigned 	wm 	424 - 749 	160.45MB 	(326/0/0) 328608

Enter partition id tag[unassigned]:
Enter partition permission flags[wm]:
Enter new starting cyl[424]: 408
Enter partition size[328608b, 326c, 160.45mb, 0.16gb]: 100.41mb
partition> pr
Current partition table (unnamed):
Total disk cylinders available: 2036 + 2 (reserved cylinders)

Part 	Tag 		Flag 	Cylinders 	Size 			Blocks
0 	unassigned 	wm 	0 - 203 	100.41MB 	(204/0/0) 	205632
1 	unassigned 	wu 	204 - 407 	100.41MB 	(204/0/0) 	205632
2 	backup 		wu 	0 - 2035 	1002.09MB 	(2036/0/0) 	2052288
3 	unassigned 	wm 	0 		0 		(0/0/0) 	0
4 	unassigned 	wm 	408 - 611 	100.41MB 	(204/0/0) 	205632
5 	unassigned 	wm 	750 - 1109 	177.19MB 	(360/0/0) 	362880
6 	unassigned 	wm 	1110 - 2035 	455.77MB 	(926/0/0) 	933408
7 	unassigned 	wm 	0 		0 		(0/0/0) 	0

partition> 5
Part 	Tag 		Flag 	Cylinders 	Size 			Blocks
5 	unassigned 	wm 	750 - 1109 	177.19MB 	(360/0/0) 	362880

Enter partition id tag[unassigned]:
Enter partition permission flags[wm]:
Enter new starting cyl[750]: 612
Enter partition size[362880b, 360c, 177.19mb, 0.17gb]: 177mb
partition> pr
Current partition table (unnamed):
Total disk cylinders available: 2036 + 2 (reserved cylinders)

Part 	Tag 		Flag 	Cylinders 	Size 			Blocks
0 	unassigned 	wm 	0 - 203 	100.41MB 	(204/0/0) 	205632
1 	unassigned 	wu 	204 - 407 	100.41MB 	(204/0/0) 	205632
2 	backup 		wu 	0 - 2035 	1002.09MB 	(2036/0/0) 	2052288
3 	unassigned 	wm 	0 		0 		(0/0/0) 	0
4 	unassigned 	wm 	408 - 611 	100.41MB 	(204/0/0) 	205632
5 	unassigned 	wm 	612 - 971 	177.19MB 	(360/0/0) 	362880
6 	unassigned 	wm 	1110 - 2035 	455.77MB 	(926/0/0) 	933408
7 	unassigned 	wm 	0 		0 		(0/0/0) 	0

partition> 6
Part 	Tag 		Flag 	Cylinders 	Size 			Blocks
6 	unassigned 	wm 	1110 - 2035 	455.77MB 	(926/0/0) 	933408

Enter partition id tag[unassigned]:
Enter partition permission flags[wm]:
Enter new starting cyl[1110]: 972
Enter partition size[933408b, 926c, 455.77mb, 0.45gb]: $
partition> pr
Current partition table (unnamed):
Total disk cylinders available: 2036 + 2 (reserved cylinders)

Part 	Tag 		Flag 	Cylinders 	Size 			Blocks
0 	unassigned 	wm 	0 - 203 	100.41MB 	(204/0/0) 	205632

1 unassigned wu 204 - 407 100.41MB (204/0/0) 205632
2 backup wu 0 - 2035 1002.09MB (2036/0/0) 2052288
3 unassigned wm 0 0 (0/0/0) 0
4 unassigned wm 408 - 611 100.41MB (204/0/0) 205632
5 unassigned wm 612 - 971 177.19MB (360/0/0) 362880
6 unassigned wm 972 - 2035 523.69MB (1064/0/0) 1072512
7 unassigned wm 0 0 (0/0/0) 0

partition>

NOTE: You will know for certain that your partitioning is correct if you add all the cylinder values [the values enclosed in ( )], like so, 204+204+204+360+1064=2036 which is the same value for slice 2 or the whole disk (Tag = backup). 

Now label the disk. This is important as this is what saves the partition table in your VTOC (Virtual Table Of Contents). It's also always recommended to do the labeling part twice to be certain that the VTOC gets saved. 


partition> label
partition> q
format> q

After partitioning c0t1d0 to be exactly the same as c0t3d0, be sure you label the disk so that VTOC gtes updated with the correct partition table. 

To recap, our scenario is: 


c0t3d0 (running Solaris 2.6) being copied to c0t1d0 (which will have the copied Solaris 2.6 slices/partitions) 
c0t3d0s0 / -> c0t1d0s0 /
c0t3d0s4 /var -> c0t1d0s4 /var
c0t3d0s5 /opt -> c0t1d0s5 /opt
c0t3d0s6 /usr -> c0t1d0s6 /usr


For each of the partitions that you wish to mount, run newfs to contruct a unix filesystem. 
So, newfs each partition. 


# newfs -v /dev/rdsk/c0t1d0s0
# newfs -v /dev/rdsk/c0t1d0s4
# newfs -v /dev/rdsk/c0t1d0s5
# newfs -v /dev/rdsk/c0t1d0s6


To ensure that they are clean and mounted properly, run fsck on these mounted partitions: 

# fsck /dev/rdsk/c0t1d0s0
# fsck /dev/rdsk/c0t1d0s4
# fsck /dev/rdsk/c0t1d0s5
# fsck /dev/rdsk/c0t1d0s6


Make the mount points. 

# /tech/sun/commands/mkdir.html">mkdir /mount_point

Create mountpoints for each slice/partition, like so: 


# /tech/sun/commands/mkdir.html">mkdir /root2
# /tech/sun/commands/mkdir.html">mkdir /var2
# /tech/sun/commands/mkdir.html">mkdir /opt2
# /tech/sun/commands/mkdir.html">mkdir /usr2


Mount the new partitions. 

# mount /dev/dsk/c0t1d0sX /mount_point

Mount each partition (of the new disk), like so: 

# mount /dev/dsk/c0t1d0s0 /root2
# mount /dev/dsk/c0t1d0s4 /var2
# mount /dev/dsk/c0t1d0s5 /opt2
# mount /dev/dsk/c0t1d0s6 /usr2


Now we /tech/sun/commands/ufsdump.html">ufsdump each slices/partitions: It is often difficult to copy from one disk to another disk. If you try to use dd, and the disks are of differing sizes, then you will undoubtedly run into trouble. Use this method to copy from disk to disk and you should not have any problems. Of course you're still on the old disk (that's where you booted from c0t3d0): 

# cd /

(Just ensures that you are in the root's parent/top directory). 


# /tech/sun/commands/ufsdump.html">ufsdump 0f - /dev/rdsk/c0t3d0s0 | (cd /root2; /tech/sun/commands/ufsrestore.html">ufsrestore rf -)
# /tech/sun/commands/ufsdump.html">ufsdump 0f - /dev/rdsk/c0t3d0s4 | (cd /var2; /tech/sun/commands/ufsrestore.html">ufsrestore rf -)
# /tech/sun/commands/ufsdump.html">ufsdump 0f - /dev/rdsk/c0t3d0s5 | (cd /opt2; /tech/sun/commands/ufsrestore.html">ufsrestore rf -)
# /tech/sun/commands/ufsdump.html">ufsdump 0f - /dev/rdsk/c0t3d0s6 | (cd /usr2; /tech/sun/commands/ufsrestore.html">ufsrestore rf -)

The gotcha here is that you can't really specify the directory name as /tech/sun/commands/ufsdump.html">ufsdump will interpret it as not being a block or character device. To illustrate this error: 

# cd /usr
# /tech/sun/commands/ufsdump.html">ufsdump 0f - /usr | (cd /usr2; /tech/sun/commands/ufsrestore.html">ufsrestore xf - )
DUMP: Writing 32 Kilobyte records
DUMP: Date of this level 0 dump: Wed Dec 10 17:33:42 1997
DUMP: Date of last level 0 dump: the epoch
DUMP: Dumping /dev/rdsk/c0t3d0s0 (tmpdns:/usr) to standard output
DUMP: Mapping (Pass I) [regular files]
DUMP: Mapping (Pass II) [directories]
DUMP: Estimated 317202 blocks (154.88MB)
DUMP: Dumping (Pass III) [directories]
DUMP: Broken pipe
DUMP: The ENTIRE dump is aborted

If you want to use the directory names to simplify your command line, use the tar command instead of /tech/sun/commands/ufsdump.html">ufsdump as follows: 

Example: 


# cd /usr
# tar cvfp - . | (cd /usr2; tar xvfp - )


OPTIONAL (This may be redundant BUT ensures that the copied files are once again clean and consistent). Checking the integrity of a filesystem is always highly recommended even if it becomes redundant in nature. Now, check and run fsck on the new partition/slices: 

# fsck /dev/rdsk/c0t1d0s0
# fsck /dev/rdsk/c0t1d0s4
# fsck /dev/rdsk/c0t1d0s5
# fsck /dev/rdsk/c0t1d0s6


Edit your /mount_point/etc/vfstab file to have this disk bootup from the correct disk/devices c0t1d0 as opposed to c0t3d0. 

# cd /root2
# vi /root2/etc/vfstab

Change c0tXd0sX devices to reflect the new disk! 

#device device mount FS fsck mount mount
#to mount to fsck point type pass at boot options
#
#/dev/dsk/c1d0s2 /dev/rdsk/c1d0s2 /usr ufs 1 yes -
fd - /dev/fd fd - no -
/proc - /proc proc - no -
/dev/dsk/c0t1d0s1 - - swap - no -
/dev/dsk/c0t1d0s0 /dev/rdsk/c0t1d0s0 / ufs 1 no -
/dev/dsk/c0t1d0s6 /dev/rdsk/c0t1d0s6 /usr ufs 1 no -
/dev/dsk/c0t1d0s4 /dev/rdsk/c0t1d0s4 /var ufs 1 no -
/dev/dsk/c0t1d0s5 /dev/rdsk/c0t1d0s5 /opt ufs 2 yes -
swap - /tmp tmpfs - yes -
:wq!


Now you must run /tech/sun/commands/installboot.html">installboot to load a new bootblk on that disk. Not loading a bootblk will leave this disk in an unbootable state as the boot strap program is contained within the bootblk, and this in turn is what loads the boot file called ufsboot after interfacing with the OBP (Open Boot PROM). 
You can do this from your current booted disk or you may choose to boot off from cdrom via ok> boot cdrom -sw (single-user mode, writeable mode off of cdrom's mini-root). 

If you choose to get bootblk from your current disk the location of the bootblk in Solaris 2.5 or higher is under: 


/usr/platform/`uname -i`/lib/fs/ufs/bootblk


# /usr/sbin/installboot /usr/platform/`uname -i`/lib/fs/ufs/bootblk \
/dev/rdsk/c0t1d0s0

If you choose to get bootblk from your cdrom image: 


ok> boot cdrom -sw
# /tech/sun/commands/installboot.html">installboot /cdrom/solaris_2_5_sparc/s0/export/exec/sparc.Solaris_2.5 \
/usr/platform/`uname -i`/lib/fs/ufs/bootblk /dev/rdsk/c0txd0s0

ANOTHER SPARC EXAMPLE: 
To install a ufs bootblock on slice 0 of target 0 on con- troller 1, of the platform where the command is being run, use: 


example# /tech/sun/commands/installboot.html">installboot /usr/platform/`uname -i`/lib/fs/ufs/bootblk \
/dev/rdsk/c1t0d0s0


Now create an alias for the other disk (this may be existent if it's off of the onboard/first scsi controller). 

ok> probe-scsi
    T3 original boot disk
    T1 new disk with copied slices

Verify via devalias command to see current /tech/sun/commands/aliases.html">aliases: disk1 is for sd@1,0 which is scsi id/target 1 


ok> devalias

ok> setenv boot-device disk1
ok> boot -rv

You do not necessarily need to do a reconfiguration boot as devices had already been created. This parameter will only be run if you attached new devices to your system. 

By default this will always boot from the new disk. If you want to boot from the old disk you can manually tell it to boot to that alias, like so: 


ok> boot disk
or
ok> boot disk3

(This will boot off from any Target 3/scsi id 3 internal disk). Also see INFODOC #'s 14046, 11855, 11854 for setting different boot devalias'es. 

NOTE: If the new disk encounters a problem on booting, most likely cause would be inappropriate /tech/sun/commands/devlinks.html">devlinks so, the course of action to take here is the /etc/path_to_inst, /dev, /devices fix: The following is a solution to solve problems with /dev, /devices, and/or /etc/path-to_inst. This routine extracts the defaults (with links intact) from the Solaris 2.x CD-ROM. 


ok> boot cdrom -sw

# mount /dev/dsk/c0t1d0s0 /a ** This step assumes your boot disk is
c0t1d0s0
# cd /tmp/dev
# tar cvfp - . | (cd /a/dev; tar xvfp - )
# cd /tmp/devices
# tar cvfp - . | (cd /a/devices; tar xvfp - )
# cd /tmp/root/etc
# /tech/sun/commands/cp.html">cp path_to_inst /a/etc/path_to_inst
# /tech/sun/commands/reboot.html">reboot -- -rv


If you plan to move this new disk you copied the OS on, you MUST ensure that it will be moved to a similar architecture and machine type as hardware address paths are usually different from one machine to another. 
Each hardware platform has a hardware device tree which must match the device tree information saved during installation in /devices and the /dev directories. 

Another reason is that a kernel from one architecture cannot boot on a machine of a different architecture. Customers often overlook these architecture differences (Sun 4/4c/4m/4d/4u). A boot drive moved from a SPARCstation 2 (sun4c architecture) cannot boot on a SPARCstation 5 (sun4m architecture). 

For more details on why you can't move Solaris 2.X boot disk between machines please see INFODOC 13911 and 13920. 

Also ensure that you have the correct /tech/sun/commands/hostname.html">hostname, IP address and vfstab entries for this new drive if you plan to move it to another machine. 

 
Add a New Disk II
In this example, the Sun StorEdge D1000 tray is connected to a UDWIS host adapter corresponding to controller c2 and a drive was added to slot 4 on the tray. The new drive appears as /dev/dsk/c2t4d0s[0-7] and /dev/rdsk/c2t4d0s[0-7]. 


Add the new device: 

# drvconfig (or devfsadm) 
# disks 

Verify the new disk has been created: 

# ls -l /dev/dsk/c1t4d0s* 

The new disk drive is now available for use as a block or character device. Refer to sd for more info. 


7.4 bare-metal restore procedure.
=================================

SUMMARY: Help troubleshooting bare-metal restore procedure.

Thank you:
   Anand Chouthai
   Roy Erickson

With help from contributors I identified two things wrong
w/ the procedure I was implementing:

 1. I needed to update /dev,/devices, and /etc/path_to_inst
    so that the replacement FC drive w/ a new WWN 
    was correctly recognized as the root disk (this was on a Sun 280R).

 2. After fixing that it helped me uncover the "lockup". It was because
    I didn't add the "-H" option to bprestore which indicates that
    to rename the hardlink targets as well as the normal
    source files (I thought bprestore had an AI module that figured
    all that out ;-).

 After making these two changes I was able to get the system
 back to a sane state.

 Below are the updated scripts. 

 The first one is the revised script I am using to do the actual
 bare metal restore.

 The second is a script I use to build a recovery image. Certain
 packages and tarballs are needed which aren't included but
 the general idea is there.

 --

 The only quirk I have observed is that /var/run doesn't get
 mounted as swap (the system creates a normal directory).

    mount: mount-point /var/run does not exist.

 Not sure about this one but for now I will live w/ it.

 Thanks again Sun Managers!

 Kevin Counts 

 --

Script #1:

#!/bin/sh
#------------------------------------------------------------------------
# $Id: recover-egate2.sh,v 1.7 2004/03/01 19:36:06 countskm Exp $
#------------------------------------------------------------------------
# Custom script to restore egate2 (run from jumpstart recovery image).
#-------------------------------------------------------------------------

#-------------------------------------------------------------------------
# Create pre-defined vtoc for 36GB FC Drive
#-------------------------------------------------------------------------

/usr/sbin/fmthard -s - /dev/rdsk/c1t0d0s2 <<EOF
       0      2    00          0   8389656   8389655
       1      3    01    8389656   8389656  16779311
       2      5    00          0  71127180  71127179
       3      7    00   16779312  16779312  33558623
       4      0    00   33558624  37516554  71075177
       6      0    00   71075178     26001  71101178
       7      0    00   71101179     26001  71127179
EOF

echo "y" | /usr/sbin/newfs /dev/rdsk/c1t0d0s0
echo "y" | /usr/sbin/newfs /dev/rdsk/c1t0d0s3
echo "y" | /usr/sbin/newfs /dev/rdsk/c1t0d0s4

/usr/sbin/fsck /dev/rdsk/c1t0d0s0
/usr/sbin/fsck /dev/rdsk/c1t0d0s3
/usr/sbin/fsck /dev/rdsk/c1t0d0s4

mount /dev/dsk/c1t0d0s0 /a
mkdir -p /a/var
mkdir -p /a/opt
mount /dev/dsk/c1t0d0s3 /a/var
mount /dev/dsk/c1t0d0s4 /a/opt

#------------------------------------------------------------------------
server=veritas
log=/var/tmp/bprestore.log
rename=/var/tmp/bprestore.rename
filelist=/var/tmp/bprestore.filelist

# extra_opt="-e 2/01/2004 -C egate2"
  extra_opt="-C egate2"

cat <<EOF > ${filelist}
/
!/egate
EOF

cat <<EOF > ${rename}
change / to /a
EOF

cat /dev/null > ${log}

cat <<EOF

--------------------------------------------------------------------
 Running bprestore in foreground.                                   

 View logfile: $log in another login session for status.
 
 (A message will appear in this window when the restore is complete)
--------------------------------------------------------------------

EOF

echo \
/usr/openv/netbackup/bin/bprestore -w                \
                                   -H                \
                                   -S ${server}      \
                                   -L ${log}         \
                                   -R ${rename}      \
                                   ${extra_opt}      \
                                   -f ${filelist}

/usr/openv/netbackup/bin/bprestore -w                \
                                   -H                \
                                   -S ${server}      \
                                   -L ${log}         \
                                   -R ${rename}      \
                                   ${extra_opt}      \
                                   -f ${filelist}


#-------------------------------------------------------------------------
# Make excluded /egate mountpoint
#-------------------------------------------------------------------------
mkdir -p /a/egate

#-------------------------------------------------------------------------
# Unconfigure disksuite mirror
#-------------------------------------------------------------------------
mv /a/etc/lvm/mddb.cf /a/etc/lvm/mddb.cf.bak

sed -e 's!md/!!g'         \
    -e 's!d10!c1t0d0s0!g' \
    -e 's!d20!c1t0d0s1!g' \
    -e 's!d30!c1t0d0s3!g' \
    -e 's!d40!c1t0d0s4!g' \
/a/etc/vfstab > /a/etc/vfstab.tmp

cp /a/etc/vfstab     /a/etc/vfstab.bak
cp /a/etc/vfstab.tmp /a/etc/vfstab

sed -e '/^rootdev/ s/^/*/' \
    -e '/^set md/  s/^/*/' \
/a/etc/system > /a/etc/system.tmp

cp /a/etc/system     /a/etc/system.bak
cp /a/etc/system.tmp /a/etc/system

#-------------------------------------------------------------------------
# Rebuild /dev and /devices and /etc/path_to_inst
# Typically we don't backup /dev so check if its even there.
#-------------------------------------------------------------------------
[ -d /a/dev ] && mv /a/dev /a/dev.bak
                 mv /a/devices  /a/devices.bak

mkdir /a/dev
mkdir /a/devices

cd /dev     ;     find . -depth -print  | cpio -pdm /a/dev
cd /devices ;     find . -depth -print  | cpio -pdm /a/devices
cd

mv /a/etc/path_to_inst \
   /a/etc/path_to_inst.bak

cp /tmp/root/etc/path_to_inst \
   /a/etc/path_to_inst


#-------------------------------------------------------------------------
# Make mount points excluded from backup
#-------------------------------------------------------------------------
mkdir          /a/tmp
chmod 1777     /a/tmp
chown root:sys /a/tmp

#-------------------------------------------------------------------------
# Umount the slices and install the ufs boot block 
#-------------------------------------------------------------------------
umount /a/var
umount /a/opt
umount /a

/usr/sbin/installboot /usr/platform/`uname -i`/lib/fs/ufs/bootblk /dev/rdsk/c1t0d0s0

echo "--------------------------------------------------------------------"
echo " Restore complete - type \"reboot -- -r\" to reboot the system."
echo "--------------------------------------------------------------------"

#-------------------------------------------------------------------------
# End.
#-------------------------------------------------------------------------


Script #2:


#!/bin/sh
#-------------------------------------------------------------------------
# Configuring Solaris 8 Boot Image
#-------------------------------------------------------------------------
root=/export/install/SOL8-RECOVER-TEST/Solaris_8/Tools/Boot/
noask=/export/depot/fileset/isconf/plat/sunos/5.8/etc/noask_pkgadd
depot=/export/depot/pkg/sunos/5.8

#-------------------------------------------------------------------------
perl -pi -e '/^root/ && s/NP/<your own hash>/' $root/etc/shadow

exit 0
pkgadd -d ${depot}/SMC/SMCncurs-5.3 -R $root  \
       -n -a ${noask} all

pkgadd -d ${depot}/MCC/MCCssh2-3.2.3 -R $root \
       -n -a ${noask} all

pkgadd -d ${depot}/SMC/SMCbash-2.05 -R $root  \
       -n -a ${noask} all


#-------------------------------------------------------------------------
perl -pi -e ' /^\s*install\)/ and print <<EOF

                recover)
                         cat < /dev/null > /tmp/._recover_startup
                         shift
                         ;;

EOF
' $root/sbin/rcS

#-------------------------------------------------------------------------
perl -pi -e ' m!#/usr/sbin/inetd -s! and print <<EOF

if [ -f /tmp/._recover_startup ] ; then
   /usr/sbin/inetd -s
fi
EOF
' $root/sbin/sysconfig

#-------------------------------------------------------------------------
perl -pi -e ' m!exec /sbin/suninstall! and print <<EOF
if [ -f /tmp/._recover_startup ] ; then
   exec /bin/ksh -o vi
fi

EOF
' $root/sbin/sysconfig

#-------------------------------------------------------------------------
cp -rp tmp_proto/openv $root/.tmp_proto/

ln -s /tmp/openv $root/usr/openv

#-------------------------------------------------------------------------
cat <<EOF >> $root/etc/services
#
# NetBackup services
#
bprd    13720/tcp       bprd
bpcd    13782/tcp       bpcd
vopied  13783/tcp       vopied
bpjava-msvc     13722/tcp       bpjava-msvc
EOF

#-------------------------------------------------------------------------
cat <<EOF >> $root/etc/inetd.conf
#
# netbackup services
#
bpcd    stream  tcp     nowait  root    /usr/openv/netbackup/bin/bpcd bpcd
vopied  stream  tcp     nowait  root    /usr/openv/bin/vopied vopied
bpjava-msvc     stream  tcp     nowait  root    /usr/openv/netbackup/bin/bpjava-msvc bpjava-msvc -transient
EOF
_______________________________________________


Example about FileSystems on a SunOs 5.9 server
===============================================


Devices are described in three ways in the Solaris environment, using three distinct naming
conventions: the physical device name, the instance name, and the logical device name.

- Physical devices:
A "physical device name" represents the full pathname of the device. 
Physical device files are found in the /devices directory and have the following
naming convention:

/devices/sbus@1,f8000000/esp@0,40000/sd@3,0:a

Each device has a unique name representing both the type of device and the location of that device
in the system-addressing structure called the "device tree". The OpenBoot firmware builds the 
device tree for all devices from information gathered at POST. The device tree is loaded in memory
and is used by the kernel during boot to identify all configured devices.
A device pathname is a series of node names separated by slashes. Each device has the following form: 
  
driver-name@unit-address:device-arguments

On our testmachine, we find:

/devices>ls -al
total 70
drwxr-xr-x   7 root     sys          512 Aug 10  2004 .
drwxr-xr-x  25 root     root         512 Aug 17  2004 ..
crw-------   1 root     sys      201,  0 Aug 10  2004 memory-controller@0,0:mc-us3i
drwxr-xr-x   4 root     sys          512 Aug 10  2004 pci@1c,600000
crw-------   1 root     sys      109,767 Aug 10  2004 pci@1c,600000:devctl
drwxr-xr-x   2 root     sys          512 Aug 10  2004 pci@1d,700000
crw-------   1 root     sys      109,1023 Aug 10  2004 pci@1d,700000:devctl
drwxr-xr-x   4 root     sys          512 Aug 10  2004 pci@1e,600000
crw-------   1 root     sys      109,511 Aug 10  2004 pci@1e,600000:devctl
drwxr-xr-x   2 root     sys          512 Aug 10  2004 pci@1f,700000
crw-------   1 root     sys      109,255 Aug 10  2004 pci@1f,700000:devctl
drwxr-xr-x   2 root     sys        29696 Aug 11  2004 pseudo


- Instance name:
The "instance name" represents the kernel's abbreviated name for every possible device
on the system. For example, sd0 and sd1 represents the instance names of two SCSI disk devices.
Instance names are mapped in the /etc/path_to_inst file, an are displayed by using the
commands dmesg, sysdef, and prtconf

/devices>cd /etc
/etc>more path_to_inst
#
#       Caution! This file contains critical kernel state
#
"/options" 0 "options"
"/pci@1f,700000" 0 "pcisch"
"/pci@1f,700000/network@2" 0 "bge"
"/pci@1f,700000/network@2,1" 1 "bge"
"/pci@1e,600000" 1 "pcisch"
"/pci@1e,600000/ide@d" 0 "uata"
"/pci@1e,600000/ide@d/sd@0,0" 30 "sd"
"/pci@1e,600000/isa@7" 0 "ebus"
"/pci@1e,600000/isa@7/power@0,800" 0 "power"
"/pci@1e,600000/isa@7/rmc-comm@0,3e8" 0 "rmc_comm"
"/pci@1e,600000/isa@7/i2c@0,320" 0 "pcf8584"
"/pci@1e,600000/isa@7/i2c@0,320/motherboard-fru-prom@0,a2" 0 "seeprom"
"/pci@1e,600000/isa@7/i2c@0,320/chassis-fru-prom@0,a8" 1 "seeprom"
"/pci@1e,600000/isa@7/i2c@0,320/power-supply-fru-prom@0,b0" 2 "seeprom"
"/pci@1e,600000/isa@7/i2c@0,320/power-supply-fru-prom@0,a4" 3 "seeprom"
"/pci@1e,600000/isa@7/i2c@0,320/dimm-spd@0,b6" 4 "seeprom"
"/pci@1e,600000/isa@7/i2c@0,320/dimm-spd@0,b8" 5 "seeprom"
"/pci@1e,600000/isa@7/i2c@0,320/dimm-spd@0,c6" 6 "seeprom"
"/pci@1e,600000/isa@7/i2c@0,320/dimm-spd@0,c8" 7 "seeprom"
"/pci@1e,600000/isa@7/i2c@0,320/nvram@0,50" 8 "seeprom"
"/pci@1e,600000/isa@7/i2c@0,320/gpio@0,70" 0 "pca9556"
"/pci@1e,600000/isa@7/i2c@0,320/gpio@0,44" 1 "pca9556"
"/pci@1e,600000/isa@7/i2c@0,320/gpio@0,46" 2 "pca9556"
"/pci@1e,600000/isa@7/i2c@0,320/gpio@0,4a" 3 "pca9556"
"/pci@1e,600000/isa@7/i2c@0,320/gpio@0,68" 4 "pca9556"
"/pci@1e,600000/isa@7/i2c@0,320/gpio@0,88" 5 "pca9556"
"/pci@1e,600000/isa@7/serial@0,3f8" 0 "su"
"/pci@1e,600000/isa@7/serial@0,2e8" 1 "su"
"/pci@1e,600000/pmu@6" 0 "pmubus"
"/pci@1e,600000/pmu@6/gpio@8a" 0 "pmugpio"
"/pci@1e,600000/pmu@6/i2c@0" 0 "smbus"
"/pci@1e,600000/pmu@6/gpio@80000000" 1 "pmugpio"
"/pci@1e,600000/pmu@6/i2c@0,0" 1 "smbus"
"/pci@1e,600000/usb@a" 0 "ohci"
"/pci@1c,600000" 2 "pcisch"
"/pci@1c,600000/scsi@2" 0 "glm"
"/pci@1c,600000/scsi@2/sd@0,0" 0 "sd"
"/pci@1c,600000/scsi@2/sd@1,0" 1 "sd"
"/pci@1c,600000/scsi@2/sd@2,0" 2 "sd"
"/pci@1c,600000/scsi@2/sd@3,0" 3 "sd"
"/pci@1c,600000/scsi@2/sd@4,0" 4 "sd"
"/pci@1c,600000/scsi@2/sd@5,0" 5 "sd"
"/pci@1c,600000/scsi@2/sd@6,0" 6 "sd"
"/pci@1c,600000/scsi@2/sd@8,0" 7 "sd"
"/pci@1c,600000/scsi@2/sd@9,0" 8 "sd"
"/pci@1c,600000/scsi@2/sd@a,0" 9 "sd"
"/pci@1c,600000/scsi@2/sd@b,0" 10 "sd"
"/pci@1c,600000/scsi@2/sd@c,0" 11 "sd"
"/pci@1c,600000/scsi@2/sd@d,0" 12 "sd"
"/pci@1c,600000/scsi@2/sd@e,0" 13 "sd"
"/pci@1c,600000/scsi@2/sd@f,0" 14 "sd"
"/pci@1c,600000/scsi@2/st@0,0" 0 "st"
"/pci@1c,600000/scsi@2/st@1,0" 1 "st"
"/pci@1c,600000/scsi@2/st@2,0" 2 "st"
"/pci@1c,600000/scsi@2/st@3,0" 3 "st"
"/pci@1c,600000/scsi@2/st@4,0" 4 "st"
"/pci@1c,600000/scsi@2/st@5,0" 5 "st"
"/pci@1c,600000/scsi@2/st@6,0" 6 "st"
"/pci@1c,600000/scsi@2/ses@0,0" 0 "ses"
"/pci@1c,600000/scsi@2/ses@1,0" 1 "ses"
"/pci@1c,600000/scsi@2/ses@2,0" 2 "ses"
"/pci@1c,600000/scsi@2/ses@3,0" 3 "ses"
"/pci@1c,600000/scsi@2/ses@4,0" 4 "ses"
"/pci@1c,600000/scsi@2/ses@5,0" 5 "ses"
"/pci@1c,600000/scsi@2/ses@6,0" 6 "ses"
"/pci@1c,600000/scsi@2/ses@7,0" 7 "ses"
"/pci@1c,600000/scsi@2/ses@8,0" 8 "ses"
"/pci@1c,600000/scsi@2/ses@9,0" 9 "ses"
"/pci@1c,600000/scsi@2/ses@a,0" 10 "ses"
"/pci@1c,600000/scsi@2/ses@b,0" 11 "ses"
"/pci@1c,600000/scsi@2/ses@c,0" 12 "ses"
"/pci@1c,600000/scsi@2/ses@d,0" 13 "ses"
"/pci@1c,600000/scsi@2/ses@e,0" 14 "ses"
"/pci@1c,600000/scsi@2/ses@f,0" 15 "ses"
"/pci@1c,600000/scsi@2,1" 1 "glm"
"/pci@1c,600000/scsi@2,1/sd@0,0" 15 "sd"
"/pci@1c,600000/scsi@2,1/sd@1,0" 16 "sd"
"/pci@1c,600000/scsi@2,1/sd@2,0" 17 "sd"
"/pci@1c,600000/scsi@2,1/sd@3,0" 18 "sd"
"/pci@1c,600000/scsi@2,1/sd@4,0" 19 "sd"
"/pci@1c,600000/scsi@2,1/sd@5,0" 20 "sd"
"/pci@1c,600000/scsi@2,1/sd@6,0" 21 "sd"
"/pci@1c,600000/scsi@2,1/sd@8,0" 22 "sd"
"/pci@1c,600000/scsi@2,1/sd@9,0" 23 "sd"
"/pci@1c,600000/scsi@2,1/sd@a,0" 24 "sd"
"/pci@1c,600000/scsi@2,1/sd@b,0" 25 "sd"
"/pci@1c,600000/scsi@2,1/sd@c,0" 26 "sd"
"/pci@1c,600000/scsi@2,1/sd@d,0" 27 "sd"
"/pci@1c,600000/scsi@2,1/sd@e,0" 28 "sd"
"/pci@1c,600000/scsi@2,1/sd@f,0" 29 "sd"
"/pci@1c,600000/scsi@2,1/st@0,0" 7 "st"
"/pci@1c,600000/scsi@2,1/st@1,0" 8 "st"
"/pci@1c,600000/scsi@2,1/st@2,0" 9 "st"
"/pci@1c,600000/scsi@2,1/st@3,0" 10 "st"
"/pci@1c,600000/scsi@2,1/st@4,0" 11 "st"
"/pci@1c,600000/scsi@2,1/st@5,0" 12 "st"
"/pci@1c,600000/scsi@2,1/st@6,0" 13 "st"
"/pci@1c,600000/scsi@2,1/ses@0,0" 16 "ses"
"/pci@1c,600000/scsi@2,1/ses@1,0" 17 "ses"
"/pci@1c,600000/scsi@2,1/ses@2,0" 18 "ses"
"/pci@1c,600000/scsi@2,1/ses@3,0" 19 "ses"
"/pci@1c,600000/scsi@2,1/ses@4,0" 20 "ses"
"/pci@1c,600000/scsi@2,1/ses@5,0" 21 "ses"
"/pci@1c,600000/scsi@2,1/ses@6,0" 22 "ses"
"/pci@1c,600000/scsi@2,1/ses@7,0" 23 "ses"
"/pci@1c,600000/scsi@2,1/ses@8,0" 24 "ses"
"/pci@1c,600000/scsi@2,1/ses@9,0" 25 "ses"
"/pci@1c,600000/scsi@2,1/ses@a,0" 26 "ses"
"/pci@1c,600000/scsi@2,1/ses@b,0" 27 "ses"
"/pci@1c,600000/scsi@2,1/ses@c,0" 28 "ses"
"/pci@1c,600000/scsi@2,1/ses@d,0" 29 "ses"
"/pci@1c,600000/scsi@2,1/ses@e,0" 30 "ses"
"/pci@1c,600000/scsi@2,1/ses@f,0" 31 "ses"
"/pci@1d,700000" 3 "pcisch"
"/pci@1d,700000/network@2" 2 "bge"
"/pci@1d,700000/network@2,1" 3 "bge"
"/memory-controller@0,0" 0 "mc-us3i"
"/memory-controller@1,0" 1 "mc-us3i"
"/pseudo" 0 "pseudo"
"/scsi_vhci" 0 "scsi_vhci"
/etc>


- Logical Device names.
The "Logical device names" are used with most Solaris file system commands to refer to devices.
Logical device files in the /dev directory are symbolically linked to physical device files
in the /devices directory. Logical device names are used to access disk devices in the
following circumstances:
  - adding a new disk to the system and partitioning the disk
  - moving a disk from one system to another
  - accessing or mounting a file system residing on a local disk
  - backing up a local file system
  - repairing a file system

  Logical devices are organized in subdirs under the /dev directory by their device types
  /dev/dsk    block interface to disk devices
  /dev/rdsk   raw or character interface to disk devices
  /dev/rmt    tape devices
  /dev/term   serial line devices 
  etc..

  Logical device files have a major and minor number that indicate device drivers, 
  hardware addresses, and other characteristics.
  Furthermore, a device filename must follow a specific naming convention.
  A logical device name for a disk drive has the following format:

  /dev/[r]dsk/cxtxdxsx

  where cx refers to the SCSI controller number, tx to the SCSI bus target number,
  dx to the disk number (always 0 except on storage arrays)
  and sx to the slice or partition number.
  
/dev/ls -al
..
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd1a -> rdsk/c1t1d0s0
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd1b -> rdsk/c1t1d0s1
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd1c -> rdsk/c1t1d0s2
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd1d -> rdsk/c1t1d0s3
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd1e -> rdsk/c1t1d0s4
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd1f -> rdsk/c1t1d0s5
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd1g -> rdsk/c1t1d0s6
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd1h -> rdsk/c1t1d0s7
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd3a -> rdsk/c1t0d0s0
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd3b -> rdsk/c1t0d0s1
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd3c -> rdsk/c1t0d0s2
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd3d -> rdsk/c1t0d0s3
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd3e -> rdsk/c1t0d0s4
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd3f -> rdsk/c1t0d0s5
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd3g -> rdsk/c1t0d0s6
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsd3h -> rdsk/c1t0d0s7
lrwxrwxrwx   1 root     root          27 Aug 10  2004 rsm -> ../devices/pseudo/rsm@0:rsm
lrwxrwxrwx   1 root     root          13 Aug 10  2004 rsr0 -> rdsk/c0t0d0s2
lrwxrwxrwx   1 root     root           7 Aug 10  2004 rst12 -> rmt/0lb
lrwxrwxrwx   1 root     root           7 Aug 10  2004 rst20 -> rmt/0mb
lrwxrwxrwx   1 root     root           7 Aug 10  2004 rst28 -> rmt/0hb
lrwxrwxrwx   1 root     root           7 Aug 10  2004 rst36 -> rmt/0cb
lrwxrwxrwx   1 root     other         27 Aug 10  2004 rts -> ../devices/pseudo/rts@0:rts
drwxr-xr-x   2 root     sys          512 Aug 10  2004 sad
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd1a -> dsk/c1t1d0s0
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd1b -> dsk/c1t1d0s1
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd1c -> dsk/c1t1d0s2
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd1d -> dsk/c1t1d0s3
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd1e -> dsk/c1t1d0s4
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd1f -> dsk/c1t1d0s5
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd1g -> dsk/c1t1d0s6
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd1h -> dsk/c1t1d0s7
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd3a -> dsk/c1t0d0s0
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd3b -> dsk/c1t0d0s1
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd3c -> dsk/c1t0d0s2
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd3d -> dsk/c1t0d0s3
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd3e -> dsk/c1t0d0s4
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd3f -> dsk/c1t0d0s5
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd3g -> dsk/c1t0d0s6
lrwxrwxrwx   1 root     root          12 Aug 10  2004 sd3h -> dsk/c1t0d0s7
..

/dev>cd dsk
/dev/dsk>ls -al
total 58
drwxr-xr-x   2 root     sys          512 Aug 10  2004 .
drwxr-xr-x  14 root     sys         4096 Oct  4 14:15 ..
lrwxrwxrwx   1 root     root          42 Aug 10  2004 c0t0d0s0 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:a
lrwxrwxrwx   1 root     root          42 Aug 10  2004 c0t0d0s1 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:b
lrwxrwxrwx   1 root     root          42 Aug 10  2004 c0t0d0s2 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:c
lrwxrwxrwx   1 root     root          42 Aug 10  2004 c0t0d0s3 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:d
lrwxrwxrwx   1 root     root          42 Aug 10  2004 c0t0d0s4 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:e
lrwxrwxrwx   1 root     root          42 Aug 10  2004 c0t0d0s5 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:f
lrwxrwxrwx   1 root     root          42 Aug 10  2004 c0t0d0s6 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:g
lrwxrwxrwx   1 root     root          42 Aug 10  2004 c0t0d0s7 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:h
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t0d0s0 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:a
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t0d0s1 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:b
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t0d0s2 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:c
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t0d0s3 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:d
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t0d0s4 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:e
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t0d0s5 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:f
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t0d0s6 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:g
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t0d0s7 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:h
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t1d0s0 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:a
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t1d0s1 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:b
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t1d0s2 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:c
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t1d0s3 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:d
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t1d0s4 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:e
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t1d0s5 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:f
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t1d0s6 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:g
lrwxrwxrwx   1 root     root          43 Aug 10  2004 c1t1d0s7 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:h

/dev/dsk>cd ..
/dev>cd rdsk
/dev/rdsk>ls -al
total 58
drwxr-xr-x   2 root     sys          512 Aug 10  2004 .
drwxr-xr-x  14 root     sys         4096 Oct  4 14:15 ..
lrwxrwxrwx   1 root     root          46 Aug 10  2004 c0t0d0s0 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:a,raw
lrwxrwxrwx   1 root     root          46 Aug 10  2004 c0t0d0s1 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:b,raw
lrwxrwxrwx   1 root     root          46 Aug 10  2004 c0t0d0s2 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:c,raw
lrwxrwxrwx   1 root     root          46 Aug 10  2004 c0t0d0s3 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:d,raw
lrwxrwxrwx   1 root     root          46 Aug 10  2004 c0t0d0s4 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:e,raw
lrwxrwxrwx   1 root     root          46 Aug 10  2004 c0t0d0s5 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:f,raw
lrwxrwxrwx   1 root     root          46 Aug 10  2004 c0t0d0s6 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:g,raw
lrwxrwxrwx   1 root     root          46 Aug 10  2004 c0t0d0s7 -> ../../devices/pci@1e,600000/ide@d/sd@0,0:h,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t0d0s0 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:a,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t0d0s1 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:b,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t0d0s2 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:c,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t0d0s3 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:d,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t0d0s4 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:e,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t0d0s5 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:f,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t0d0s6 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:g,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t0d0s7 -> ../../devices/pci@1c,600000/scsi@2/sd@0,0:h,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t1d0s0 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:a,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t1d0s1 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:b,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t1d0s2 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:c,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t1d0s3 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:d,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t1d0s4 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:e,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t1d0s5 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:f,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t1d0s6 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:g,raw
lrwxrwxrwx   1 root     root          47 Aug 10  2004 c1t1d0s7 -> ../../devices/pci@1c,600000/scsi@2/sd@1,0:h,raw

# format
Searching for disks...done


AVAILABLE DISK SELECTIONS:
       0. c1t0d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@1c,600000/scsi@2/sd@0,0
       1. c1t1d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
          /pci@1c,600000/scsi@2/sd@1,0
Specify disk (enter its number):


# prtvtoc /dev/rdsk/c1t0d0s2
* /dev/rdsk/c1t0d0s2 partition map
*
* Dimensions:
*     512 bytes/sector
*     424 sectors/track
*      24 tracks/cylinder
*   10176 sectors/cylinder
*   14089 cylinders
*   14087 accessible cylinders
*
* Flags:
*   1: unmountable
*  10: read-only
*
*                          First     Sector    Last
* Partition  Tag  Flags    Sector     Count    Sector  Mount Directory
       0      2    00    9514560   8191680  17706239
       1      3    01          0   8395200   8395199
       2      5    00          0 143349312 143349311
       3      7    00    8466432   1048128   9514559
       4      0    00   51266688  33560448  84827135
       5      0    00   17706240  33560448  51266687
       6      8    00   84827136  58522176 143349311
       7      0    00    8395200     71232   8466431


# prtvtoc /dev/rdsk/c1t1d0s2
* /dev/rdsk/c1t1d0s2 partition map
*
* Dimensions:
*     512 bytes/sector
*     424 sectors/track
*      24 tracks/cylinder
*   10176 sectors/cylinder
*   14089 cylinders
*   14087 accessible cylinders
*
* Flags:
*   1: unmountable
*  10: read-only
*
*                          First     Sector    Last
* Partition  Tag  Flags    Sector     Count    Sector  Mount Directory
       0      2    00    9514560   8191680  17706239
       1      3    01          0   8395200   8395199
       2      5    00          0 143349312 143349311
       3      7    00    8466432   1048128   9514559
       4      0    00   51266688  33560448  84827135
       5      0    00   17706240  33560448  51266687
       6      8    00   84827136  58522176 143349311
       7      0    00    8395200     71232   8466431
#


Intel's x86 processors and their clones are little endian. 
Sun's SPARC,Motorola's 68K, and the PowerPC families are all big endian. 


Contents on AIX /dev:
---------------------

# ls -al

..
..
crw-------   1 root     system       17,  0 Aug 08 12:00 tty0
crw-rw-rw-   1 root     system       22,  0 May 27 17:45 ttyp0
crw-rw-rw-   1 root     system       22,  1 May 27 17:45 ttyp1
crw-rw-rw-   1 root     system       22,  2 May 27 17:45 ttyp2
..
..
brw-rw----   1 root     system       10,  7 May 27 17:46 hd3
brw-rw----   1 root     system       10,  4 Jun 27 15:51 hd4
brw-rw----   1 root     system       10,  1 Aug 08 11:41 hd5
brw-rw----   1 root     system       10,  2 May 27 17:46 hd6
brw-rw----   1 root     system       10,  3 May 27 17:46 hd8
brw-rw----   1 root     system       10,  6 May 27 17:46 hd9var
brw-------   1 root     system       16,  7 May 27 17:44 hdisk0
brw-------   1 root     system       16,  2 May 27 17:44 hdisk1
brw-------   1 root     system       16, 10 May 27 18:23 hdisk10
brw-------   1 root     system       16, 12 May 27 18:23 hdisk11
brw-------   1 root     system       16,  5 May 27 17:44 hdisk2
brw-------   1 root     system       16, 20 May 27 18:23 hdisk20
brw-------   1 root     system       16, 21 May 27 18:23 hdisk21
brw-------   1 root     system       16, 22 May 27 18:23 hdisk22
..
..

The 'c' implies that the device is a character device, as 
opposed to a block device (b prefix) or ordinary file.


=========================
8. Current machines 2005:
=========================


8.1 AIX machines:
=================

1. eServer p5 family:
---------------------

               dimensions             cpu            Ghz           Max mem GB
  p5-510 Express 19" 2U                 1 or 2         1.5           32
  p5-520 Express 19" 4U or deskside     1 or 2         1.5           32
  p5-550 Express 19" 4U or deskside     1,2,4          1.5           64
  p5-570 Express 19" rack               2,4,8          1.5           128

               dimensions             cpu            Ghz           Max mem GB
  p5-510         19" 2U                 1 or 2         1.65          32
  p5-520         19" 4U or deskside     2              1.65          32
* p5-550         19" 4U or deskside     1,2 or 4       1.5 or 1.65   64
  p5-570         19" rack               2,4,8,12,16    1.65 or 1.90  128

               dimensions             cpu            Ghz           Max mem GB
  p5-575         24" frame              8              1.90          256
  p5-590         24" frame              8 to 32        1.65          256
  p5-595         24" frame              16 to 64       1.65 or 1.90  256 / 512


More info on the 550:
---------------------

5 hotplug PCI slots, 
1 embedded Ultra 320 SCSI dual channel controller,
1 10/100/1000 Mbps integrated dual port Ethernet controller,
2 service processor communications port,
2 USB 2 ports,
2 HMC ports,
2 RIO ports = Remote IO ports, e.g. for connecting the system to an external drawer
2 System Power Control Network SPCN ports
1 or 2 hotswap capable 4-diskbays. 4x146 or 8x 146 GB Ultra320 scsi disks 

SPCN interface: System Power Control Network is a microprocessor based operating system that controls all aspects
of the IBM power network, including power on/off, sequencing, fru isolation, battery testing etc..
So SPCN is used for power control

RIO interface: for connecting the system to external drawer.

SPC interface: Service Processor Communications port. Can be used to connect a terminal.


8.2 former pSeries:
===================

- pSeries 655
  (Rack-mount)

Easy-to-manage 4- or 8-way ultra-dense cluster-optimized server for HPC and BI.

Processor 64-bit POWER4+ 
Clock rates (Min/Max) 1.50GHz / 1.70GHz 
System memory (Std/Max) 4GB / 64GB 
Internal storage (Std/Max) 72.8GB / 2.6TB 
Performance (rPerf range)*** 15.22 to 21.87 

- pSeries 670
  (Rack-mount)

The 4- to 16-way p670 is packed with the same Capacity on Demand (CoD) capabilities and innovative 
technology as the flagship p690.

Processor POWER4+ 
Clock rates (Min/Max) 1.50GHz 
System memory (Std/Max) 4GB / 256GB 
Internal storage (Std/Max) 72.8GB / 7.0TB 
Performance (rPerf range)*** 13.66 to 46.79 

- pSeries 690

8- to 32-way enterprise-class AIX 5L/Linux server 

Processor 64-bit POWER4+ 
Clock rates (Min/Max) 1.50GHz / 1.90GHz 
System memory (Std/Max) 8GB / 1TB 
Internal storage (Std/Max) 72.8GB / 18.7TB 
Performance (rPerf range)*** 27.11 to 104.17 
 

8.3 Recent Sun machines:
========================

Sun Fire High-End:
------------------

Sun Fire High-end Servers
Sun's flagship servers powered by the new UltraSPARC IV+ processors work with the Solaris Operating System 
to offer a stellar enterprise consolidation platform for customers: over five times greater performance, 
doubled memory capacity and significantly increased I/O performance over previous generation systems. 
Compatible with multiple platforms and CPU speeds, the Solaris 10 OS allows customers to quickly and easily 
take advantage of the latest server technology and the industry's only true technology systems 
investment protection.

Sun Fire E20K

High-end computing, affordability, and scalability: The Sun Fire E20K server gives you 36 UltraSPARC IV+ processors 
and 72 simultaneous threads, with mainframe class reliability and security. Later, scale it up to the full 
Sun Fire E25K capacities.

Scales up to 36 UltraSPARC IV+ processors with 72 threads, over 5x faster performance over UltraSPARC III 
processor-based servers
Doubled memory capacity with 2GB DIMMs: 576 GB memory
Unique Uniboard technology to provision up to 9 CPU/Memory Uniboards on the fly
Easily upgraded up to 72 processors with 144 threads and over 1TB memory
High-bandwidth PCI-X support for I/O intensive applications
Mix and match UltraSPARC III, IV and IV+ processors in the same system
Dynamic System Domains and Solaris Containers
Predictive Self-Healing/Automatic System Recovery
Solaris Security Toolkit
Capacity on Demand 2.0 to add resources only when needed
Solaris 9 9/05 and Solaris 10 3/05 HW1


Sun Fire E25K 

Scales up to 72 UltraSPARC IV+ processors with 144 threads, over 5x faster performance compared to UltraSPARC III 
processor-based servers
Doubled memory capacity with 2GB DIMMs: over 1TB memory
Unique Uniboard technology to provision up to 18 CPU/Memory Uniboards on the fly
High-bandwidth PCI-X support for I/O intensive applications
Mix and match UltraSPARC III, IV and IV+ processors in the same system
Dynamic System Domains and Solaris Containers
Predictive Self-Healing/Automatic System Recovery
Solaris Security Toolkit
Capacity on Demand 2.0 to add resources only when needed
Solaris 9 9/05 and Solaris 10 3/05 HW1


Sun Fire Mid-Range:
-------------------

Sun Fire Midrange Servers
Designed for compute density backed by enterprise-class features, Sun's midrange servers deliver high performance 
and protect your IT investments over time. Sun has enhanced the industry's leading 64-bit server line with the 
UltraSPARC IV+ processor, offering up to five times the performance of previous UltraSPARC III systems. 
Yet you retain Sun's migration-enabling unbroken binary application compatibility with the Solaris OS. 

 
Sun Fire V490
Scales up to four processors, eight simultaneous compute threads 

Sun Fire V890
Scales up to eight processors, 16 simultaneous compute threads 

Sun Fire E2900
Scales up to 12 processors, 24 simultaneous compute threads 

Sun Fire E4900
Scales up to 12 processors, 24 simultaneous compute threads 

Sun Fire E6900
Scales up to 24 processors, 48 simultaneous compute threads 


Sun Fire Low-End:
-----------------

V125
V215
V245
V445

V210
V240
etc..

Sun Fire x64 Servers:
---------------------

These servers can run many operating system, including Solaris OS, Linux, Windows or VMware.
These are the Sun Blade machines. 


8.4 Recent HP Servers:
======================

 _ HP ProLiant servers 
 _ HP ProLiant DL 
 _ HP ProLiant ML 
 _ HP ProLiant BL blades 
 
 _ HP Integrity servers 
 _ Entry-class 
 _ Mid-range 
 _ Superdome (high-end) 
 _ HP Integrity BL blades 
 
 _ HP Integrity NonStop servers 
 _ HP 9000 servers 
 _ HP AlphaServer systems 
 _ Telco and carrier-grade servers 
 _ HP e3000 servers 
 

HP servers running HP-UX 11i :

 _ HP 9000 servers 
  PA-RISC powered servers

 _ HP Integrity servers 
  Industry standard 
  Itaniumr 2 based servers 

 _ HP Telco servers 
Specially designed for the telecom and service provider industries 


=======================================
9. Most important pSeries LED Codes:
=======================================

MCA LED codes:
--------------

Booting BIST phase: leds 100-195, defining hardware status
Booting POST phase: leds 200-2E7, during finding BLV
LED 200: key in secure position
LED 299: BLV will be loaded

PCI systems an pSeries LED codes:
---------------------------------

reduced ODM from BLV copied into RAMFS: OK=510, NOT OK=LED 548: 
LED 511: bootinfo -b is called to determine the last bootdevice
ipl_varyon of rootvg: OK=517,ELSE 551,552,554,556: 
LED 555,557: mount /dev/hd4 on temporary mountpoint /mnt
LED 518: mount /usr, /var
LED 553: syncvg rootvg, or inittab problem
LED 549
LED 581: tcp/ip is being configured, and there is some problem

Last phases in the boot is where cfgcon is called, to configure the console.
cfgcon LED codes include:
C31: Console not yet configured.
C32: Console is an LFT terminal
C33: Console is a TTY
C34: Console is a file on disk
C99: Could not detect a console device

LED 551: ipl_varyon of rootvg

201           : Damaged boot image
223-229       : Invalid boot list
551,555,557   : Corrupted filesystem, corrupted JFS log
552,554,556   : Superblock corrupted, corrupted customized ODM database
553           : Corrupted /etc/inittab file

Firmware that leads to LED code:
--------------------------------

LED Code 888 right after boot: software problem 102, OR, hardware or software problem 103

rc.boot LED codes:
------------------

rc.boot1

  init          success=F05 error=c06                             
  restbase      copies bootimage ODM -> RAM fs ODM:  success=510  error=548
  cfgmgr -f     configuration all base devices needed to access rootvg
  bootinfo -b

end rc.boot 1   LED=511


PCI / RS6000 LED Codes:
========================


=============
1.

Built-In Self-Test (BIST) Indicators
------------------------------------

100 BIST completed successfully; control was passed to IPL ROS.
101 BIST started following reset.
102 BIST started, following the system unit's power-on reset.
103 BIST could not determine the system model number.
104 Equipment conflict; BIST could not find the CBA.
105 BIST could not read from the OCS EPROM.
106 BIST failed: CBA not found
111 OCS stopped; BIST detected a module error.
112 A checkstop occurred during BIST; checkstop results could not be logged out.
113 Three checkstops have occurred.
120 BIST starting a CRC check on the 8752 EPROM.
121 BIST detected a bad CRC in the first 32K bytes of the OCS EPROM.
122 BIST started a CRC check on the first 32K bytes of the OCS EPROM.
123 BIST detected a bad CRC on the OCS area of NVRAM.
124 BIST started a CRC check on the OCS area of NVRAM.
125 BIST detected a bad CRC on the time-of-day area of NVRAM.
126 BIST started a CRC check on the time-of-day area of NVRAM.
127 BIST detected a bad CRC on the 8752 EPROM.
130 BIST presence test started.
140 Running BIST. (Box Manufacturing Mode Only)
142 Box manufacturing mode operation.
143 Invalid memory configuration.
144 Manufacturing test failure.
151 BIST started AIPGM test code.
152 BIST started DCLST test code.
153 BIST started ACLST test code.
154 BIST started AST test code.
160 Bad EPOW Signal/Power status signal.
161 BIST being conducted on BUMP I/O.
162 BIST being conducted on JTAG.
163 BIST being conducted on Direct I/O.
164 BIST being conducted on CPU.
165 BIST being conducted on DCB and Memory.
166 BIST being conducted on Interrupts.
170 BIST being conducted on Multi-Processors.
180 Logout in progress.
182 BIST COP bus not responding.
185 A checkstop condition occurred during the BIST.
186 System logic-generated checkstop (Model 250 only).
187 Graphics-generated checkstop (Model 250).
195 Checkstop logout complete
199 Generic SCSI backplane
888 BIST did not start.

Power-On Self-Test (POST) Indicators
------------------------------------ 

200 IPL attempted with keylock in the Secure position.
201 IPL ROM test failed or checkstop occurred (irrecoverable).
202 Unexpected machine check interrupt.
203 Unexpected data storage interrupt.
204 Unexpected instruction storage interrupt.
205 Unexpected external interrupt.
206 Unexpected alignment interrupt.
207 Unexpected program interrupt.
208 Unexpected floating point unavailable interrupt.
209 Unexpected SVC interrupt.
20c L2 cache POST error. (The display shows a solid 20c for 5 seconds.)
210 Unexpected SVC interrupt.
211 IPL ROM CRC comparison error (irrecoverable).
212 RAM POST memory configuration error or no memory found (irrecoverable).
213 RAM POST failure (irrecoverable).
214 Power status register failed (irrecoverable).
215 A low voltage condition is present (irrecoverable).
216 IPL ROM code being uncompressed into memory.
217 End of boot list encountered.
218 RAM POST is looking for good memory.
219 RAM POST bit map is being generated.
21c L2 cache is not detected. (The display shows a solid 21c for 2 seconds.)
220 IPL control block is being initialized.
221 NVRAM CRC comparison error during AIX IPL(key mode switch in Normal mode).
Reset NVRAM by reaccomplishing IPL in Service mode. For systems with an
internal, direct-bus-attached (DBA) disk, IPL ROM attempted to perform an IPL from
that disk before halting with this operator panel display value.
222 Attempting a Normal mode IPL from Standard I/O planar-attached devices specified 
in NVRAM IPL Devices List.
223 Attempting a Normal mode IPL from SCSI-attached devices specified in NVRAM IPL 
Devices List.
224 Attempting a Normal mode IPL from 9333 subsystem device specified in NVRAM IPL 
Devices List.
225 Attempting a Normal mode IPL from 7012 DBA disk-attached devices specified in 
NVRAM IPL Devices List.
226 Attempting a Normal mode IPL from Ethernet specified in NVRAM IPL Devices List.
227 Attempting a Normal mode IPL from Token-Ring specified in NVRAM IPL Devices List.
228 Attempting a Normal mode IPL from NVRAM expansion code.
229 Attempting a Normal mode IPL from NVRAM IPL Devices List; cannot IPL from any
of the listed devices, or there are no valid entries in the Devices List.
22c Attempting a normal mode IPL from FDDI specified in NVRAM IPL device list.
230 Attempting a Normal mode IPL from adapter feature ROM specified in IPL ROM
Device List.
231 Attempting a Normal mode IPL from Ethernet specified in IPL ROM Device List.
232 Attempting a Normal mode IPL from Standard I/O planar-attached devices specified
in ROM Default Device List.
233 Attempting a Normal mode IPL from SCSI-attached devices specified in IPL ROM
Default Device List.
234 Attempting a Normal mode IPL from 9333 subsystem device specified in IPL ROM
Device List.
235 Attempting a Normal mode IPL from 7012 DBA disk-attached devices specified in
IPL ROM Default Device List.
236 Attempting a Normal mode IPL from Ethernet specified in IPL ROM Default Device
List.
237 Attempting a Normal mode IPL from Token-Ring specified in IPL ROM Default
Device List.
238 Attempting a Normal mode IPL from Token-Ring specified by the operator.
239 System failed to IPL from the device chosen by the operator.
23c Attempting a normal mode IPL from FDDI specified in IPL ROM device list.
240 Attempting a Service mode IPL from adapter feature ROM.
241 Attempting a normal boot from devices specified in the NVRAM boot list.
242 Attempting a Service mode IPL from Standard I/O planar-attached devices specified
in the NVRAM IPL Devices List.
243 Attempting a Service mode IPL from SCSI-attached devices specified in the
NVRAM IPL Devices List.
244 Attempting a Service mode IPL from 9333 subsystem device specified in the
NVRAM IPL Devices List.
245 Attempting a Service mode IPL from 7012 DBA disk-attached devices specified in
the NVRAM IPL Devices List.
246 Attempting a Service mode IPL from Ethernet specified in the NVRAM IPL Devices
List.
247 Attempting a Service mode IPL from Token-Ring specified in the NVRAM Device
List.
248 Attempting a Service mode IPL from NVRAM expansion code.
249 Attempting a Service mode IPL from the NVRAM IPL Devices List; cannot IPL from
any of the listed devices, or there are no valid entries in the Devices List.
24c Attempting a service mode IPL from FDDI specified in NVRAM IPL device list.
250 Attempting a Service mode IPL from adapter feature ROM specified in the IPL ROM
Device List.
251 Attempting a Service mode IPL from Ethernet specified in the IPL ROM Default
Device List.
252 Attempting a Service mode IPL from Standard I/O planar-attached devices specified
in the ROM Default Device List.
253 Attempting a Service mode IPL from SCSI-attached devices specified in the IPL
ROM Default Device List.
254 Attempting a Service mode IPL from 9333 subsystem device specified in the IPL
ROM Devices List.
255 Attempting a Service mode IPL from 7012 DBA disk-attached devices specified in
IPL ROM Default Device List.
256 Attempting a Service mode IPL from Ethernet specified in the IPL ROM Devices
List.
257 Attempting a Service mode IPL from Token-Ring specified in the IPL ROM Devices
List.
258 Attempting a Service mode IPL from Token-Ring specified by the operator.
259 Attempting a Service mode IPL from FDDI specified by the operator.
25c Attempting a service mode IPL from FDDI specified in IPL ROM device list.
260 Information is being displayed on the display console.
261 No supported local system display adapter was found.
262 Keyboard not detected as being connected to the system's keyboard port.
263 Attempting a Normal mode IPL from adapter feature ROM specified in the NVRAM
Device List.
269 Stalled state - the system is unable to IPL.
270 Low Cost Ethernet Adapter (LCE) POST executing
271 Mouse and Mouse port POST.
272 Tablet Port POST.
276 10/100Mbps MCA Ethernet Adapter POST executing
277 Auto Token-Ring LANstreamer MC 32 Adapter.
278 Video ROM scan POST.
279 FDDI POST.
280 3com Ethernet POST.
281 Keyboard POST executing.
282 Parallel port POST executing.
283 Serial port POST executing.
284 POWER Gt1 graphics adapter POST executing.
285 POWER Gt3 graphics adapter POST executing.
286 Token-Ring adapter POST executing.
287 Ethernet adapter POST executing.
288 Adapter card slots being queried.
289 POWER GT0 Display Adapter POST.
290 IOCC POST error (irrecoverable).
291 Standard I/O POST running.
292 SCSI POST running.
293 7012 DBA disk POST running.
294 IOCC bad TCW memory module in slot location J being tested.
295 Graphics Display adapter POST, color or grayscale.
296 ROM scan POST.
297 System model number does not compare between OCS and ROS (irrecoverable).
298 Attempting a software IPL.
299 IPL ROM passed control to the loaded program code.
301 Flash Utility ROM test failed or checkstop occurred (irrecoverable
302 Flash Utility ROM: User prompt, move the key to the service position in order to
perform an optional Flash Update. LED 3d2 will only appear if the key switch is in
the secure position. This signals the user that a Flash Update may be initiated by
moving the key switch to the service position. If the key is moved to the service
position then LED 3d3 will be displayed, this signals the user to press the Reset
button and select optional Flash Update.
303 Flash Utility ROM: User prompt, press the Reset button in order to perform an
optional Flash Update. LED 3d2 will only appear if the key switch is the secure
position. This signals the user that a Flash Update may be initiated by moving the
key switch to the service position. If the key is moved to the service position LED
3d3 will be displayed, this signals the user to press the Reset button and select
optional Flash Update.
304 Flash Utility ROM IOCC POST error (irrecoverable).
305 Flash Utility ROM standard I/O POST running.
306 Flash Utility ROM is attempting IPL from Flash Update media device.
307 Flash Utility ROM system model number does not compare between OCS and
ROM (irrecoverable).
308 Flash Utility ROM: IOCC TCW memory is being tested.
309 Flash Utility ROM passed control to a Flash Update Boot Image.
311 Flash Utility ROM CRC comparison error (irrecoverable).
312 Flash Utility ROM RAM POST memory configuration error or no memory found
(irrecoverable).
313 Flash Utility ROM RAM POST failure (irrecoverable).
314 Flash Utility ROM Power status register failed (irrecoverable).
315 Flash Utility ROM detected a low voltage condition.
318 Flash Utility ROM RAM POST is looking for good memory.
319 Flash Utility ROM RAM POST bit map is being generated.
322 CRC error on media Flash Image. No Flash Update performed.
323 Current Flash Image is being erased.
324 CRC error on new Flash Image after Update was performed. (Flash Image is cor-rupted.)
325 Flash Update successful and complete.

Configuration Program Indicators
--------------------------------

500 Querying Standard I/O slot.
501 Querying card in Slot 1.
502 Querying card in Slot 2.
503 Querying card in Slot 3.
504 Querying card in Slot 4.
505 Querying card in Slot 5.
506 Querying card in Slot 6.
507 Querying card in Slot 7.
508 Querying card in Slot 8.
510 Starting device configuration.
511 Device configuration completed.
512 Restoring device configuration files from media.
513 Restoring basic operating system installation files from media.
516 Contacting server during network boot.
517 Mounting client remote file system during network IPL.
518 Remote mount of the root and /usr file systems failed during network boot.
520 Bus configuration running.
521 /etc/init invoked cfgmgr with invalid options; /etc/init has been corrupted or incor-rectly 
modified (irrecoverable error).
522 The configuration manager has been invoked with conflicting options (irrecoverable
error).
523 The configuration manager is unable to access the ODM database (irrecoverable
error).
524 The configuration manager is unable to access the config.rules object in the ODM
database (irrecoverable error).
525 The configuration manager is unable to get data from a customized device object in
the ODM database (irrecoverable error).
526 The configuration manager is unable to get data from a customized device driver
object in the ODM database ( irrecoverable error).
527 The configuration manager was invoked with the phase 1 flag; running phase 1 at
this point is not permitted (irrecoverable error).
528 The configuration manager cannot find sequence rule, or no program name was
specified in the ODM database (irrecoverable error).
529 The configuration manager is unable to update ODM data (irrecoverable error).
530 The program savebase returned an error.
531 The configuration manager is unable to access the PdAt object class (irrecoverable
error).
532 There is not enough memory to continue (malloc failure); irrecoverable error.
533 The configuration manager could not find a configure method for a device.
534 The configuration manager is unable to acquire database lock (irrecoverable error).
535 HIPPI diagnostics interface driver being configured.
536 The configuration manager encountered more than one sequence rule specified in
the same phase (irrecoverable error).
537 The configuration manager encountered an error when invoking the program in the
sequence rule.
538 The configuration manager is going to invoke a configuration method.
539 The configuration method has terminated, and control has returned to the configura-tion 
manager.
551 IPL vary-on is running.
552 IPL varyon failed.
553 IPL phase 1 is complete.
554 The boot device could not be opened or read, or unable to define NFS swap device
during network boot.
555 An ODM error occurred when trying to varyon the rootvg, or unable to create an
NFS swap device during network boot.
556 Logical Volume Manager encountered error during IPL vary-on.
557 The root filesystem will not mount.
558 There is not enough memory to continue the system IPL.
559 Less than 2 M bytes of good memory are available to load the AIX kernel.
570 Virtual SCSI devices being configured.
571 HIPPI common function device driver being configured.
572 HIPPI IPI-3 master transport driver being configured.
573 HIPPI IPI-3 slave transport driver being configured.
574 HIPPI IPI-3 transport services user interface device driver being configured.
575 A 9570 disk-array driver is being configured.
576 Generic async device driver being configured.
577 Generic SCSI device driver being configured.
578 Generic commo device driver being configured.
579 Device driver being configured for a generic device.
580 HIPPI TCPIP network interface driver being configured.
581 Configuring TCP/IP.
582 Configuring Token-Ring data link control.
583 Configuring an Ethernet data link control.
584 Configuring an IEEE Ethernet data link control.
585 Configuring an SDLC MPQP data link control.
586 Configuring a QLLC X.25 data link control.
587 Configuring a NETBIOS.
588 Configuring a Bisync Read-Write (BSCRW).
589 SCSI target mode device being configured.
590 Diskless remote paging device being configured.
591 Configuring an LVM device driver.
592 Configuring an HFT device driver.
593 Configuring SNA device drivers.
594 Asynchronous I/O being defined or configured.
595 X.31 pseudo-device being configured.
596 SNA DLC/LAPE pseudo-device being configured.
597 OCS software being configured.
598 OCS hosts being configured during system reboot.
599 Configuring FDDI data link control.
5c0 Streams-based hardware drive being configured.
5c1 Streams-based X.25 protocol being configured.
5c2 Streams-based X.25 COMIO emulator driver being configured.
5c3 Streams-based X.25 TCP/IP interface driver being configured.
5c4 FCS adapter device driver being configured.
5c5 SCB network device driver for FCS is being configured.
5c6 AIX SNA channel being configured.
600 Starting network boot portion of /sbin/rc.boot
602 Configuring network parent devices.
603 /usr/lib/methods/defsys, /usr/lib/methods/cfgsys, or /usr/lib/methods/cfgbus
failed.
604 Configuring physical network boot device.
605 Configuration of physical network boot device failed.
606 Running /usr/sbin/ifconfig on logical network boot device.
607 /usr/sbin/ifconfig failed.
608 Attempting to retrieve the client.info file with tftp.Note that a flashing 608 indicates
multiple attempt(s) to retrieve the client_info file are occurring.
609 The client.info file does not exist or it is zero length.
610 Attempting remote mount of NFS file system.
611 Remote mount of the NFS file system failed.
612 Accessing remote files; unconfiguring network boot device.
614 Configuring local paging devices.
615 Configuration of a local paging device failed.
616 Converting from diskless to dataless configuration.
617 Diskless to dataless configuration failed.
618 Configuring remote (NFS) paging devices.
619 Configuration of a remote (NFS) paging device failed.
620 Updating special device files and ODM in permanent filesystem with data from boot
RAM filesystem.
622 Boot process configuring for operating system installation.
650 IBM SCSD disk drive being configured
668 25MB ATM MCA Adapter being configured
680 POWER GXT800M Graphics Adapter
689 4.5GB Ultra SCSI Single Ended Disk Drive being configured
690 9.1GB Ultra SCSI Single Ended Disk Drive being configured
694 Eicon ISDN DIVA MCA Adapter for PowerPC Systems
700 Progress indicator. A 1.1 GB 8-bit SCSI disk drive being identified or configured.
701 Progress indicator. A 1.1 GB 16-bit SCSI disk drive is being identified or configured.
702 Progress indicator. A 1.1 GB 16-bit differential SCSI disk drive is being identified or
configured.
703 Progress indicator. A 2.2 GB 8-bit SCSI disk drive is being identified or configured.
704 Progress indicator. A 2.2 GB 16-bit SCSI disk drive is being identified or configured.
705 The configuration method for the 2.2 GB 16-bit differential SCSI disk drive is being
run. If an irrecoverable error occurs, the system halts.
706 Progress indicator. A 4.5 GB 16-bit SCSI disk drive is being identified or configured.
707 Progress indicator. A 4.5 GB 16-bit differential SCSI disk drive is being identified or
configured.
708 Progress indicator. A L2 cache is being identified or configured.
710 POWER GXT150M graphics adapter being identified or configured.
711 Unknown adapter being identified or configured.
712 Graphics slot bus configuration is executing.
713 The IBM ARTIC960 device is being configured.
714 A video capture adapter is being configured.
715 The Ultimedia Services audio adapter is being configured. This LED displays briefly
on the panel.
717 TP Ethernet Adapter being configured.
718 GXT500 Graphics Adapter being configured.
720 Unknown read/write optical drive type being configured.
721 Unknown disk or SCSI device being identified or configured.
722 Unknown disk being identified or configured.
723 Unknown CD-ROM being identified or configured.
724 Unknown tape drive being identified or configured.
725 Unknown display adapter being identified or configured.
726 Unknown input device being identified or configured.
727 Unknown async device being identified or configured.
728 Parallel printer being identified or configured.
729 Unknown parallel device being identified or configured.
730 Unknown diskette drive being identified or configured.
731 PTY being identified or configured.
732 Unknown SCSI initiator type being configured.
733 7GB 8mm tape drive being configured.
734 4x SCSI-2 640MB CD-ROM Drive
741 1080MB SCSI Disk Drive
745 16GB 4mm Tape Auto Loader
748 MCA keyboard/mouse adapter being configured.
749 7331 Model 205 Tape Library
754 1.1GB 16-bit SCSI disk drive being configured.
755 2.2GB 16-bit SCSI disk drive being configured.
756 4.5GB 16-bit SCSI disk drive being configured.
757 External 13GB 1.5M/s 1/4 inch tape being configured.
772 4.5GB SCSI F/W Disk Drive
773 9.1GB SCSI F/W Disk Drive
774 9.1GB External SCSI Disk Drive
77c Progress indicator. A 1.0 GB 16-bit SCSI disk drive being identified or configured.
783 4mm DDS-2 Tape Autoloader
789 2.6GB External Optical Drive
794 10/100MB Ethernet PX MC Adapter
797 Turboways 155 UTP/STP ATM Adapter being identified or configured.
798 Video streamer adapter being identified or configured.
800 Turboways 155 MMF ATM Adapter being identified or configured.
803 7336 Tape Library Robotics being configured
804 8x Speed SCSI-2 CD ROM drive being configured
807 SCSI Device Enclosure being configured
808 System Interface Full (SIF) configuration process
80c SSA 4-Port Adapter being identified or configured.
811 Processor complex being identified or configured.
812 Memory being identified or configured.
813 Battery for time-of-day, NVRAM, and so on being identified or configured, or system
I/O control logic being identified or configured.
814 NVRAM being identified or configured.
815 Floating-point processor test
816 Operator panel logic being identified or configured.
817 Time-of-day logic being identified or configured.
819 Graphics input device adapter being identified or configured.
821 Standard keyboard adapter being identified or configured.
823 Standard mouse adapter being identified or configured.
824 Standard tablet adapter being identified or configured.
825 Standard speaker adapter being identified or configured.
826 Serial Port 1 adapter being identified or configured.
827 Parallel port adapter being identified or configured.
828 Standard diskette adapter being identified or configured.
831 3151 adapter being identified or configured, or Serial Port 2 being identified or con-figured.
834 64-port async controller being identified or configured.
835 16-port async concentrator being identified or configured.
836 128-port async controller being identified or configured.
837 16-port remote async node being identified or configured.
838 Network Terminal Accelerator Adapter being identified or configured.
839 7318 Serial Communications Server being configured.
841 8-port async adapter (EIA-232) being identified or configured.
842 8-port async adapter (EIA-422A) being identified or configured.
843 8-port async adapter (MIL-STD 188) being identified or configured.
844 7135 RAIDiant Array disk drive subsystem controller being identified or configured.
845 7135 RAIDiant Array disk drive subsystem drawer being identified or configured.
846 RAIDiant Array SCSI 1.3GB Disk Drive
847 16-port serial adapter (EIA-232) being identified or configured.
848 16-port serial adapter (EIA-422) being identified or configured.
849 X.25 Interface Co-Processor/2 adapter being identified or configured.
850 Token-Ring network adapter being identified or configured.
851 T1/J1 Portmaster adapter being identified or configured.
852 Ethernet adapter being identified or configured.
854 3270 Host Connection Program/6000 connection being identified or configured.
855 Portmaster Adapter/A being identified or configured.
857 FSLA adapter being identified or configured.
858 5085/5086/5088 adapter being identified or configured.
859 FDDI adapter being identified or configured.
85c Progress indicator. Token-Ring High-Performance LAN adapter is being identified or
configured.
861 Optical adapter being identified or configured.
862 Block Multiplexer Channel Adapter being identified or configured.
865 ESCON Channel Adapter or emulator being identified or configured.
866 SCSI adapter being identified or configured.
867 Async expansion adapter being identified or configured.
868 SCSI adapter being identified or configured.
869 SCSI adapter being identified or configured.
870 Serial disk drive adapter being identified or configured.
871 Graphics subsystem adapter being identified or configured.
872 Grayscale graphics adapter being identified or configured.
874 Color graphics adapter being identified or configured.
875 Vendor generic communication adapter being configured.
876 8-bit color graphics processor being identified or configured.
877 POWER Gt3/POWER Gt4 being identified or configured.
878 POWER Gt4 graphics processor card being configured.
879 24-bit color graphics card, MEV2
880 POWER Gt1 adapter being identified or configured.
887 Integrated Ethernet adapter being identified or configured.
889 SCSI adapter being identified or configured.
890 SCSI-2 Differential Fast/Wide and Single-Ended Fast/Wide Adapter/A.
891 Vendor SCSI adapter being identified or configured.
892 Vendor display adapter being identified or configured.
893 Vendor LAN adapter being identified or configured.
894 Vendor async/communications adapter being identified or configured.
895 Vendor IEEE 488 adapter being identified or configured.
896 Vendor VME bus adapter being identified or configured.
897 S/370 Channel Emulator adapter being identified or configured.
898 POWER Gt1x graphics adapter being identified or configured.
899 3490 attached tape drive being identified or configured.
89c Progress indicator. A multimedia SCSI CD-ROM is being identified or configured.
901 Vendor SCSI device being identified or configured.
902 Vendor display device being identified or configured.
903 Vendor async device being identified or configured.
904 Vendor parallel device being identified or configured.
905 Vendor other device being identified or configured.
908 POWER GXT1000 Graphics subsystem being identified or configured.
910 1/4GB Fibre Channel/266 Standard Adapter being identified or configured.
911 Fibre Channel/1063 Adapter Short Wave
912 2.0GB SCSI-2 differential disk drive being identified or configured.
913 1.0GB differential disk drive being identified or configured.
914 5GB 8 mm differential tape drive being identified or configured.
915 4GB 4 mm tape drive being identified or configured.
916 Non-SCSI vendor tape adapter being identified or configured.
917 Progress indicator. 2.0GB 16-bit differential SCSI disk drive is being identified or
configured.
918 Progress indicator. 2GB 16-bit single-ended SCSI disk drive is being identified or
configured.
920 Bridge Box being identified or configured.
921 101 keyboard being identified or configured.
922 102 keyboard being identified or configured.
923 Kanji keyboard being identified or configured.
924 Two-button mouse being identified or configured.
925 Three-button mouse being identified or configured.
926 5083 tablet being identified or configured.
927 5083 tablet being identified or configured.
928 Standard speaker being identified or configured.
929 Dials being identified or configured.
930 Lighted program function keys (LPFK) being identified or configured.
931 IP router being identified or configured.
933 Async planar being identified or configured.
934 Async expansion drawer being identified or configured.
935 3.5-inch diskette drive being identified or configured.
936 5.25-inch diskette drive being identified or configured.
937 An HIPPI adapter is being configured.
942 POWER GXT 100 graphics adapter being identified or configured.
943 Progress indicator. 3480 and 3490 control units attached to a System/370 Channel
Emulator/A adapter are being identified or configured.
944 100MB ATM adapter being identified or configured
945 1.0GB SCSI differential disk drive being identified or configured.
946 Serial port 3 adapter is being identified or configured.
947 Progress indicator. A 730MB SCSI disk drive is being configured.
948 Portable disk drive being identified or configured.
949 Unknown direct bus-attach device being identified or configured.
950 Missing SCSI device being identified or configured.
951 670MB SCSI disk drive being identified or configured.
952 355MB SCSI disk drive being identified or configured.
953 320MB SCSI disk drive being identified or configured.
954 400MB SCSI disk drive being identified or configured.
955 857MB SCSI disk drive being identified or configured.
956 670MB SCSI disk drive electronics card being identified or configured.
957 120MB DBA disk drive being identified or configured.
958 160 MB DBA disk drive being identified or configured.
959 160MB SCSI disk drive being identified or configured.
960 1.37GB SCSI disk drive being identified or configured.
964 Internal 20GB 8mm tape drive identified or configured.
968 1.0GB SCSI disk drive being identified or configured.
970 Half-inch, 9-track tape drive being identified or configured.
971 150MB 1/4-inch tape drive being identified or configured.
972 2.3GB 8 mm SCSI tape drive being identified or configured.
973 Other SCSI tape drive being identified or configured.
974 CD-ROM drive being identified or configured.
975 Progress indicator. An optical disk drive is being identified or configured.
977 M-Audio Capture and Playback Adapter being identified or configured.
981 540MB SCSI-2 single-ended disk drive being identified or configured.
984 1GB 8-bit disk drive being identified or configured.
985 M-Video Capture Adapter being identified or configured.
986 2.4GB SCSI disk drive being identified or configured.
987 Progress indicator. Enhanced SCSI CD-ROM drive is being identified or configured.
989 200MB SCSI disk drive being identified or configured.
990 2.0GB SCSI-2 single-ended disk drive being identified or configured.
991 525MB 1/4-inch cartridge tape drive being identified or configured.
994 5GB 8 mm tape drive being identified or configured.
995 1.2GB 1/4 inch cartridge tape drive being identified or configured.
996 Progress indicator. Single-port, multi-protocol communications adapter is being
identified or configured.
997 FDDI adapter being identified or configured.
998 2.0GB4 mm tape drive being identified or configured.
999 7137 or 3514 Disk Array Subsystem being configured.
D81 T2 Ethernet Adapter being configured.

Diagnostic Load Progress Indicators
-----------------------------------

Note: When a lowercase c is listed, it displays in the lower half of the seven-segment
character position.

c00 AIX Install/Maintenance loaded successfully.
c01 Insert the first diagnostic diskette.
c02 Diskettes inserted out of sequence.
c03 The wrong diskette is in diskette drive.
c04 The loading stopped with a nonrecoverable error.
c05 A diskette error occurred.
c06 The rc.boot configuration shell script is unable to determine type of boot.
c07 Insert the next diagnostic diskette.
c08 RAM file system started incorrectly.
c09 The diskette drive is reading or writing a diskette.
c20 An unexpected halt occurred, and the system is configured to enter the kernel
debug program instead of entering a system dump.
c21 The ifconfig command was unable to configure the network for the client network
host.
c22 The tftp command was unable to read client's ClientHostName info file during a
client network boot.
c24 Unable to read client's ClientHostName.info file during a client network boot.
c25 Client did not mount remote miniroot during network install.
c26 Client did not mount the /usr file system during the network boot.
c29 The system was unable to configure the network device.
c31 Select the console display for the diagnostics. To select No console display, set the 
key mode switch to Normal then to Service. The diagnostic programs will then load
and run the diagnostics automatically.
c32 A direct-attached display (HFT) was selected.
c33 A tty terminal attached to serial ports S1 or S2 was selected.
c34 A file was selected. The console messages store in a file.
c40 Configuration files are being restored.
c41 Could not determine the boot type or device.
c42 Extracting data files from diskette.
c43 Cannot access the boot/install tape.
c44 Initializing installation database with target disk information.
c45 Cannot configure the console.
c46 Normal installation processing.
c47 Could not create a physical volume identifier (PVID) on disk.
c48 Prompting you for input.
c49 Could not create or form the JFS log.
c50 Creating root volume group on target disks.
c51 No paging devices were found.
c52 Changing from RAM environment to disk environment.
c53 Not enough space in the /tmp directory to do a preservation installation.
c54 Installing either BOS or additional packages.
c55 Could not remove the specified logical volume in a preservation installation.
c56 Running user-defined customization.
c57 Failure to restore BOS.
c58 Displaying message to turn the key.
c59 Could not copy either device special files, device ODM, or volume group information
from RAM to disk.
c61 Failed to create the boot image.
c62 Loading platform dependent debug files
c63 Loading platform dependent data files
c64 Failed to load platform dependent data files
c70 Problem Mounting diagnostic CDROM disc
c99 Diagnostics have completed. This code is only used when there is no console.


=============
2.

0c0 The dump completed successfully 
0c1 The dump failed due to an I/O error. 
0c2 A user-requested dump has started. You requested a dump using the SYSDUMPSTART command, a dump key sequence, or the Reset button. 

0c3 The dump is inhibit 
0c4 The dump did not complete. A partial dump was written to the dump device. There is not enough space on the dump deviceto contain the entire dump. To prevent this problem from occuring again, you must increase the size of your dumpmedia. 


0c5 The dump failed to start. An unecpected error occured while the system was attempting to write to the dump media. 
0c6 A dump to the secondary dump device was requested. Make the secondary dump device ready, then press CTRL-ALT-NUMPAD2. 
0c7 Reserved. 
0c8 The dump function is disabled. No primary dump device is configured. 
0c9 A dump is in progress. 
0cc Unknown dump failure 


---------- Diagnostics Load Progress Indicators ----------- 

c00 AIX Install/Maintenance loaded successfully. 
c01 Insert the first diagnostic diskette. 
c02 Diskettes inserted out of sequence. 
c03 The wrong diskette is in the drive. 
c04 The loading stopped with an irrecoverable error. 
c05 A diskette error occurred. 
c08 RAM filesystem started incorrectly. 
c07 Insert the next diagnostic diskette. 
c09 The diskette drive is reading or writing a diskette. 
c20 An unexpected halt occured, and the system is configured to enter the kernel debug program instead of entering asystem dump. 

c21 The 'ifconfig' command was unable to configure the network for the client network host. 
c22 The 'tftp' command was unable to read client's ClientHostName.info file during a client network boot. 
c24 Unable to read client's ClientHostName.info file during a client network boot. 
c25 Client did not mount remote miniroot during network install. 
c26 Client did not mount the /usr filesystem during the network boot. 
c29 System was unable to configure the network device. 
c31 Select the console display for the diagnostics. To select "No console display", set the key mode switch to normal thento Service. The diagnostic program will then load and run the diagnostics automatically. 

c32 A direct-attached display (HFT) was selected. 
c33 a TTY terminal attached to serial ports S1 or S2 was selected. 
c34 A file was selected. The console messages store in a file 
c40 Configuration files are been restored. 
c41 Could not determine the boot type or device. 
c42 Extracting data files from diskette. 
c43 Diagboot cannot be accessed. 
c44 Initialyzing installation database with target disk information. 
c45 Cannot configure the console. 
c46 Normal installation processing. 
c47 Could not create a physical volume identifier (PVID) on disk. 
c48 Prompting you for input. 
c49 Could not create or form the JFS log. 
c50 Creating rootvg volume group on target disk 
c51 No paging space were found. 
c52 Changing from RAM environment to disk environment. 
c53 Not enough space in the /tmp directory to do a preservation installation. 
c54 Installing either BOS or additionnal packages. 
c55 Could not remove the specified logical volume in a preservation installation. 
c56 Running user-defined customization. 
c57 Failure to restore BOS. 
c58 Display message to turn the key. 
c59 Could not copy either device special files, device ODM, or volume group information from RAM to disk. 
c61 Failed to create the boot image. 
c70 Problem Mounting diagnostics CDROM disc. 
c99 Diagnostics have completed. This code is only used when there is no console. 


--------Debugger Progress Indicators ---------- 

c20 Kernel debug program activated. An unexpected system halt has occured, and you have configured the system 
to enter the kernel debug program instead of performing a dump. 


---------Built-In Self Test (Bist) Indicators--------- 

100 BIST completed successfully. Control was passed to IPL ROS. 
101 BIST started following RESET 
102 BIST started following Power-on Reset 
103 BIST could not determine the system model number. 
104 Equipment conflict. BIST could not find the CBA. 
105 BIST could not read the OCS EPROM. 
106 BIST detected a module error. 
111 OCS stopped. BIST detected a module error. 
112 A checkstop occured during BIST. 
113 BIST checkstop count is greater than 1. 
120 BIST starting a CRC check on the 8752 EPROM. 
121 BIST detected a bad CRC in the first 32K of the OCS EPROM. 
122 BIST started a CRC check on the first 32K of the OCS EPROM. 
123 BIST detected a bad CRC on the OCS area of NVRAM. 
124 BIST started a CRC check on the OCS area of NVRAM. 
125 BIST detected a bad CRC on the time-of-day area of NVRAM. 
126 BIST started a CRC check on the time-of-day area of the NVRAM. 
127 BIST detected a bad CRC on the 8752 EPROM. 
130 BIST presence test started. 
140 BIST failed: procedure error 
142 BIST failed: procedure error 
143 Invalid memory configuration. 
144 BIST failed; procedure error. 
151 BIST started AIPGM test code. 
152 BIST started DCLST test code. 
153 BIST started ACLST test code. 
154 BIST started AST test code. 
160 Bad EPOW Signal/Power status signal 
161 BIST being conducted on BUMP I/O 
162 BIST being conducted on JTAG 
163 BIST being conducted on Direct I/O 
164 BIST being conducted on CPU 
165 BIST being conducted on DCB and Memory 
166 BIST being conducted on interrupts 
170 BIST being conducted on 'Multi-Processor 
180 BIST logout failed. 
182 BIST COP bus not responding 
185 A checkstop condition occured during the BIST 
186 System logic-generated checkstop (Model 250 only) 
187 Graphics-generated checkstop (Model 250) 
195 BIST logout completed. 
888 BIST did not start 


------- Power-On Self Test ------- 

200 IPL attempted with keylock in the SECURE position. 
201 IPL ROM test failed or checkstop occured (irrecoverable) 
202 IPL ROM test failed or checkstop occured (irrecoverable) 
203 Unexpected data storage interrupt. 
204 Unexpected instruction storage interrupt. 
205 Unexpected external interrupt. 
206 Unexpected alignment interrupt. 
207 Unexpected program interrupt. 
208 Unexpected floating point unavailable interrupt. 
209 Unexpected SVC interrupt. 
20c L2 cache POST error. (The display shows a solid 20c for 5 seconds 
210 Unexpected SVC interrupt. 
211 IPL ROM CRC comparison error (irrecoverable). 
212 RAM POST memory configuration error or no memory found (irrecoverable). 
213 RAM POST failure (irrecoverable). 
214 Power status register failed (irrecoverable). 
215 A low voltage condition is present (irrecoverable). 
216 IPL ROM code being uncompressed into memory. 
217 End of bootlist encountered. 
218 RAM POST is looking for 1M bytes of good memory. 
219 RAM POST bit map is being generated. 
21c L2 cache is not detected. (The display shows a solid 21c for 5 sec) 
220 IPL control block is being initialized. 
221 NVRAM CRC comparison error during AIX. 
IPL(Key Mode Switch in Normal mode). 
Reset NVRAM by reaccomplishing IPL in Service mode. For systems with an internal, direct-bus-attached(DBA)disk,IPL 
ROM attempted to perform an IPL from that disk before halting with this three-digit display value. 
222 Attempting a Normal mode IPL from Standard I/O planar attached devices specified in NVRAM IPL Devices List. 
223 Attempting a Normal mode IPL from SCSI attached devices specified in NVRAM IPL Devices List. 
Note: May be caused by incorrect jumper setting for external SCSI devices or by incorrect SCSI terminator. 
REFER FFC B88 
224 Attempting a Normal mode restart from 9333 subsystem device specified in NVRAM device list. 
225 Attempting a Normal mode IPL from IBM 7012 DBA disk attached devices specified in NVRAM IPL Devices List. 
226 Attempting a Normal mode restart from Ethernet specified in NVRAM device list. 
227 Attempting a Normal mode restart from Token Ring specified in NVRAM device list. 
228 Attempting a Normal mode IPL from NVRAM expansion code. 
229 Attempting a Normal mode IPL from NVRAM IPL Devices List; cannot IPL from any of the listed devices, or there are 
no valid entry in the Devices List. 
22c Attempting a normal mode IPL from FDDI specified in NVRAM IPL device list. 
230 Attempting a Normal mode restart from adapter feature ROM specified in IPL ROM devices list. 
231 Attempting a Normal mode restart from Ethernet specified in IPL ROM devices list. 
232 Attempting a Normal mode IPL from Standard I/O planar attached devices specified in Rom Default Device List. 
233 Attempting a Normal mode IPL from SCSI attached devices specified in IPL ROM Default Device List. 
234 Attempting a Normal mode restart from 9333 subsystem device specified in IPL ROM device list. 
235 Attempting a Normal mode IPL from IBM 7012 DBA disk attached devices specified in IPL ROM Default Device List. 
236 Attempting a Normal mode restart from Ethernet specified in IPL ROM default devices list. 
237 Attempting a Normal mode restart from Token Ring specified in IPL ROM default device list. 
238 Attempting a Normal mode restart from Token Ring specified by the operator. 
239 System failed to restart from the device chosen by the operator. 
23c Attempting a normal mode IPL from FDDI specified in IPL ROM device list. 
240 Attempting a Service mode restart from adapter feature ROM. 
241 Attempting a Normal mode IPL from devices specified in the NVRAM IPL Devices List. 
242 Attempting a Service mode IPL from Standard I/O planar attached devices specified in NVRAM IPL Devices List. 
243 Attempting a Service mode IPL from SCSI attached devices specified in NVRAM IPL Devices List. 
244 Attempting a Service mode restart from 9333 subsystem device specified in NVRAM device list. 
245 Attempting a Service mode IPL from IBM 7012 DBA disk attached devices specified in NVRAM IPL Devices List. 
246 Attempting a Service mode restart from Ethernet specified in NVRAM device list. 
247 Attempting a Service mode restart from Token Ring specified in NVRAM device list. 
248 Attempting a Service mode IPL from NVRAM expansion code. 
249 Attempting a Service mode IPL from NVRAM IPL Devices List; cannot IPL from any of the listed devices, or there areno valid entries in the Devices List. 

24c Attempting a service mode IPL from FDDI specified in NVRAM IPL device list. 
250 Attempting a Service mode restart from adapter feature ROM specified in IPL ROM device list. 
251 Attempting a Service mode restart from Ethernet specified in IPL ROM device list. 
252 Attempting a Service mode IPL from standard I/O planar attached devicesspecified in ROM Default Device List. 
253 Attempting a Service mode IPL from SCSI attached devices specified in IPL ROM Default Device List. 
254 Attempting a Service mode restart from 9333 subsystem device specified in IPL ROM device list. 
255 Attempting a Service mode IPL from IBM 7012 DBA disk'attached devices specified in IPL ROM Default Devices List. 
256 Attempting a Service mode restart from Ethernet specified in IPL ROM default device list. 
257 Attempting a Service mode restart from Token Ring specified in IPL ROM default device list. 
258 Attempting a Service mode restart from Token Ring specified by the operator. 
259 Attempting a Service mode restart from FDDI specified by the operator. 

25c Attempting a normal mode IPL from FDDI specified in IPL ROM device list. 
260 Information is being displayed on the display console. 
261 Information will be displayed on the tty terminal when the "1" key is pressed on the tty terminal keyboard. 
262 A keyboard was not detected as being connected to the system's 
NOTE: Check for blown planar fuses or for a corrupted boot on disk drive 
263 Attempting a Normal mode restart from adapter feature ROM specified in NVRAM device list. 
269 Stalled state - the system is unable to IPL 
271 Mouse port POST. 
272 Tablet port POST. 
277 Auto Token-Ring LANstreamer MC 32 Adapter 
278 Video ROM Scan POST. 
279 FDDI adapter POST. 
280 3COM Ethernet POST. 
281 Keyboard POST executing. 
282 Parallel port POST executing 
283 Serial port POST executing 
284 POWER Gt1 graphadapte POST executing 
285 POWER Gt3 graphadapte POST executing 
286 Token Ring adapter POST executing. 
287 Ethernet adapter POST executing. 
288 Adapter card slots being queried. 
289 GTO POST. 
290 IOCC POST error (irrecoverable). 
291 Standard I/O POST running. 
292 SCSI POST running. 
293 IBM 7012 DBA disk POST running. 
294 IOCC bad TCW SIMM in slot location J being tested. 
295 Graphics Display adapter POST, color or grayscale. 
296 ROM scan POST. 
297 System model number does not compare between OCS and ROS 
(irrecoverable). Attempting a software IPL. 
298 Attempting a software IPL (warm boot). 
299 IPL ROM passed control to the loaded program code. 
301 Flash Utility ROM failed or checkstop occured (irrecoverable) 
302 Flash Utility ROM failed or checkstop occured (irrecoverable) 
302 Flash Utility ROM: User prompt, move the key to the service in order to perform an optional Flash Update. LED 
will only appear if the key switch is in the SECURE position. This signals the user that a Flash Update may be 
initiated by moving the key switch to the SERVICE position. If the key is moved to the SERVICE position, 
LED 303 will be displayed. This signals the user to press the reset button and select optional Flash Update. 
303 Flash Utility ROM: User prompt, press the reset button in order to perform an optional Flash Update. LED 
only appear if the key switch is in the SECURE position. This signals the user that a Flash Update may be initiated 
by moving the key switch to the SERVICE position. If the key is moved to the SERVICE position, LED 303 will be 
displayed. This signals the user to press the reset button and select optional Flash Update. 
304 Flash Utility ROM IOCC POST error (irrecoverable) 
305 Flash Utility ROM standard I/O POST running. 
306 Flash Utility ROM is attempting IPL from Flash Update Boot Image. 
307 Flash Utility ROM system model number does not compare between OCS and ROM (irrecoverable). 
308 Flash Utility ROM: IOCC TCW memory is being tested. 
309 Flash Utility ROM passed control to a Flash Update Boot Image. 
311 Flash Utility ROM CRC comparison error (irrecoverable). 
312 Flash Utility ROM RAM POST memory configuration error or no memory found ( iirecoverable). 
313 Flash Utility ROM RAM POST failure( irrecoverable). 
314 Flash Utility ROM Power status register failed (irrecoverable). 
315 Flash Utility ROM detected a low voltage condition. 
318 Flash Utility ROM RAM POST is looking for good memory. 
319 Flash Utility ROM RAM POST bit map is being generated. 
322 CRC error on media Flash Image. No Flash Update performed. 
323 Current Flash Image is being erased. 
324 CRC error on new Flash Image after Update was performed. (Flash Image is corrupted). 
325 Flash Image successful and complete. 

500 Querying Native I/O slot. 
501 Querying card in Slot 1 
502 Querying card in Slot 2 
503 Querying card in Slot 3 
504 Querying card in Slot 4 
505 Querying card in Slot 5 
506 Querying card in Slot 6 
507 Querying card in Slot 7 
508 Querying card in Slot 8 
510 Starting device configuration. 
511 Device configuration completed. 
512 Restoring device configuration files from media. 
513 Restoring basic operating system installation files from media. 
516 Contacting server during network boot 
517 Mounting client remote file system during network IPL. 
518 Remote mount of the root and /usr filesystems failed during network boot. 
520 Bus configuration running. 
521 /etc/init invoked cfgmgr with invalid options; /etc/init has been corrupted or incorrectly modified 
(irrecoverable error). 
522 The configuration manager has been invoked with conflicting options (irrecoverable error). 
523 The configuration manager is unable to access the ODM database (irrecoverable error). 
524 The configuration manager is unable to access the config rules object in the ODM database (irrecoverable error). 
525 The configuration manager is unable to get data from a customized device object in the ODM database 
(irrecoverable error). 
526 The configuration manager is unable to get data from a customized device driver objet in the ODM database 
(irrecoverable error). 
527 The configuration manager was invoked with the phase 1 flag; running phase 1 flag; running phase 1 at this point 
is not permitted (irrecoverable error). 
528 The configuration manager cannot find sequence rule, or no program was specified in the ODM database 
(irrecoverable error). 
529 The configuration manager is unable to update ODM data 
(irrecoverable error). 
530 The program "savebase" returned an error. 
531 The configuration manager is unable to access PdAt object class 
(irrecoverable eroor) 
532 There is not enough memory to continue (malloc failure); 
irrecoverable error. 
533 The configuration manager could not find a configure method for a device. 
534 The configuration manager is unable to aquire database lock. irrecoverable error. 
536 The configuration manager encountered more than one sequence rule specified in the same phase. (irrecoverable error). 
537 The configuration manager encountered an error when invoking the program in the sequence rule. 
538 The configuration manager is going to invoke a configuration 
539 The configuration method has terminated, and control has returned to the configuration manager. 
551 IPL Varyon is running 

552 IPL Varyon failed. 
553 IPL phase 1 is complete. 
554 Unable to define NFS swap device during network boot 
555 Unable to define NFS swap device during network boot 
556 Logical Volume Manager encountered error during IPL varyon. 
557 The root filesystem will not mount. 
558 There is not enough memory to continue the IPL. 
559 Less than 2MB of good memory are available to load the AIX kernel. 
570 Virtual SCSI devices being configured. 
571 HIPPI common function device driver being configured. 
572 HIPPI IPI-3 master transport driver being configured. 
573 HIPPI IPI-3 slave transport driver being configured. 
574 HIPPI IPI-3 transport services user interface device driver being configured. 
576 Generic async device driver being configured. 
577 Generic SCSI device driver being configured. 
578 Generic commo device driver being configured. 
579 Device driver being configured for a generic device. 
580 HIPPI TCPIP network interface driver being configured. 
581 Configuring TCP/IP. 
582 Configuring token ring data link control. 
583 Configuring an Ethernet data link control. 
584 Configuring an IEEE ethernet data link control. 
585 Configuring an SDLC MPQP data link control. 
586 Configuring a QLLC X.25 data link control. 
587 Configuring NETBIOS. 
588 Configuring a Bisync Read-Write (BSCRW). 
589 SCSI target mode device being configured. 
590 Diskless remote paging device being configured. 
591 Configuring an LVM device driver 
592 Configuring an HFT device driver 
593 Configuring SNA device drivers. 
594 Asynchronous I/O being defined or configured. 
595 X.31 pseudo device being configured. 
596 SNA DLC/LAPE pseudo device being configured. 
597 OCS software being configured. 
598 OCS hosts being configured during system reboot. 
599 Configuring FDDI data link control. 
5c0 Streams-based hardware drive being configured. 
5c1 Streams-based X.25 protocol being configured. 
5c2 Streams-based X.25 COMIO emulator driver being configured. 
5c3 Streams-based X.25 TCP/IP interface driver being configured. 
5c4 FCS adapter device driver being configured. 
5c5 SCB network device driver for FCS is being configured. 
5c6 AIX SNA channel being configured. 
600 Starting network boot portion of /sbin/rs.boot 
602 Configuring network parent devices. 
603 /usr/lib/methods/defsys 
/usr/lib/methods/cggsys, or 
/usr/lib/methods/cggbus failed. 
604 Configuring physical network boot device. 
605 Configuring physical network boot device failed. 
606 Running /usr/sbin/ifconfig on logical network boot device. 
607 /usr/sbin/ifconfig failed. 
608 Attempting to retrieve the client.info file with tftp. Note that a flashing 608 indicates multiple attempts 
to retrieve the client_info file are occuring. 
609 The client.info file does not exist or it is zero length. 
610 Attempting remote mount of NFS file system 
611 Remote mount of the NFS filesystem failed. 
612 Accessing remote files; unconfiguring network boot device. 
614 Configuring local paging devices. 
615 Configuring of a local paging device failed. 
616 Converting from diskette to dataless configuration. 
617 Diskless to dataless configuration failed. 
618 Configuring remote (NFS) paging devices. 
619 Configuration of a remote (NFS) paging device failed. 
620 Updating special device files and ODM in permanent filesystem with data from boot RAM filesystem. 
622 Boot process configuring for operating system installation. 

650 IBM SCSD disk drive drive being configured 
700 Progress indicator. A 1.1GB 8-bit SCSI disk drive being identified or configured. 
701 Progress indicator. A 1.1GB 16-bit SCSI SE disk drive being identified or configured. 
702 Progress indicator. A 1.1GB 16-bit SCSI differential disk drive being identified or configured. 
703 Progress indicator. A 2.2GB 8-bit SCSI disk drive being identified or configured. 
704 Progress indicator. A 2.2GB 16-bit SCSI SE disk drive being identified or configured. 
705 The configuration method for the 2.2GB 16-bit differential SCSI disk drive is being run. If a irrecoverableerror occurs, the system halts. identified or configured. 

706 Progress indicator. A 4.5GB 16-bit SE SCSI disk drive is being identified or configured. 
707 Progress indicator. A 4.5GB 16-bit differential SCSI drive is being identified or configured. 
708 Progress indicator: A L2 cache is being identified or configured. 
710 POWER GXT150M graphics adapterbeing ientifyied or configured. 
711 Unknown adapter being identified or configured. 
712 Graphics slot bus configuration is executing. 
713 The IBM ARTIC960 device is being configured. 
714 A video capture adapter is being configured. 
715 The Ultimedia Services audio adapter is being configured. This LED displays briefly on the panel. 
720 Unknown read/write optical drive type being configured. 
721 Unknown disk or SCSI device being identified or configured. 
722 Unknown disk being identified or configured. 
723 Unknown CDROM being identified or configured. 
724 Unknown tape drive being identified or configured. 
725 Unknown display being identified or configured. 
726 Unknown input device being idenor configured 
727 Unknown adync device being idenor configured


=========================================== 
10. Diskless machines, NFS Implementations:
===========================================


Setting up nfs, NetBSD
Setting up nfs, OpenBSD
Setting up nfs, FreeBSD
Setting up nfs, Mac OS X and Darwin
Setting up nfs, Linux
Setting up nfs, SunOS
Setting up nfs, Solaris
Setting up nfs, NEWS-OS
Setting up nfs, NEXTSTEP
Setting up nfs, HP-UX 7 (couldn't get it to work)
Setting up nfs, HP-UX 9
Setting up nfs, HP-UX 10 and later


--------------------------------------------------------------------------------

NetBSD and OpenBSD
If you have built your own kernel, you need to make sure you have the following in your config file: 
options         NFSSERVER
The GENERIC kernel distributed with NetBSD has this compiled in. 

# mkdir -p /export/client/root/dev 

# mkdir /export/client/usr 

# mkdir /export/client/home 

# touch /export/client/swap 

# cd /export/client/root 

# tar -xvpzf /export/client/NetBSD-release/binary/sets/kern.tgz 

# mknod /export/client/root/dev/console c 0 0 

Add the following lines to /etc/exports: 
#/etc/exports
/export/client/root -maproot=root:wheel    client.test.net
/export/client/swap -maproot=root:wheel    client.test.net
/export/client/usr  -maproot=nobody:nobody client.test.net
/export/client/home -maproot=nobody:nobody client.test.net

# ps -aux | grep mountd
If mountd is running, then kill -HUP that process to force it to reread /etc/exports. Otherwise, you'll need to start it:
# /usr/sbin/mountd


# ps -aux | grep nfsd
If the nfsdaemons are not running, then you need to start them:
# /usr/sbin/nfsd -tun 4 
If the server isn't running the NFS daemons, the client will print: 

le(0,0,0,0): Unknown error: code -1
boot: Unknown error: code -1
If the server is running NFS, but isn't exporting the root directory to the client, the client will print: 
boot: no such file or directory
If everything is working properly, you will see a few numbers and a spinning cursor on the client. This means you have succeeded! At this point, your client isn't bootable. If you let it continue, it will panic when attempting to start init. 
Continue on to setting up the client filesystem 


--------------------------------------------------------------------------------

FreeBSD
The setup for FreeBSD 4.x is similar to NetBSD, but mountd needs different options and /etc/exports has a different format. 
# mkdir -p /export/client/root/dev 

# mkdir /export/client/usr 

# mkdir /export/client/home 

# touch /export/client/swap 

# cd /export/client/root 

# tar [--numeric-owner] -xvpzf /export/client/NetBSD-release/binary/sets/kern.tgz 

# mknod /export/client/root/dev/console c 0 0 

Add the following line to /etc/exports (see the FreeBSD Handbook, Section 17.4 on NFS): 
#/etc/exports
/export/client/root /export/client/swap -maproot=root:wheel    client.test.net 

FreeBSD is unable to export multiple directories within a filesystem (such as /export) to a client unless all of the directories are listed on a single line in /etc/exports. 
You will also need to make sure the your client's /home and /usr are stored in /export/client/root. FreeBSD is unable to set different properties for exported directories, defeating the point of exporting those directories separately (and without -maproot=root:wheel). 


# ps -aux | grep mountd
If mountd is running, then kill that process. You need it to be running with the -r option for the swap file to be mountable, and the -2 option is to force it to use NFS V2.
# /sbin/mountd -2r


# ps -aux | grep nfsd
If the nfsdaemons are not running, then you need to start them:
# /sbin/nfsd -tun 4 
If the server isn't running the NFS daemons, the client will print: 

le(0,0,0,0): Unknown error: code -1
boot: Unknown error: code -1
If the server is running NFS, but isn't exporting the root directory to the client, the client will print: 
boot: no such file or directory
If everything is working properly, you will see a few numbers and a spinning cursor on the client. This means you have succeeded! At this point, your client isn't bootable. If you let it continue, it will panic when attempting to start init. 
Continue on to setting up the client filesystem 


--------------------------------------------------------------------------------

Mac OS X and Darwin
This setup for Mac OS X and Darwin use the NetInfo system. There are ways to use typical BSD-style configuration files, but most systems are by default configured to use NetInfo. Here, we describe how to set up a default install of Mac OS X/Darwin (i.e. in its own local NetInfo domain). Read your netinfo(5) man page for more information. 

# mkdir -p /export/client/root/dev 

# mkdir /export/client/usr 

# mkdir /export/client/home 

# touch /export/client/swap 

# cd /export/client/root 

# tar -xvpzf /export/client/NetBSD-release/binary/sets/kern.tgz 

# mknod /export/client/root/dev/console c 0 0 

Modify the NetInfo database to export your shares. Note that you must escape the forward slashes in the path to your export twice. Once for the shell, and once for the NetInfo parser (since it uses forward slashes to delimit NetInfo properties). Just to add to the confusion, the NetInfo property we're adding to is called /exports. 
# nicl . -create /exports/\\/export\\/client\\/root opts maproot=root:wheel
# nicl . -create /exports/\\/export\\/client\\/root clients 192.168.0.10
# nicl . -create /exports/\\/export\\/client\\/swap opts maproot=root:wheel
# nicl . -create /exports/\\/export\\/client\\/swap clients 192.168.0.10
# nicl . -create /exports/\\/export\\/client\\/usr opts maproot=nobody:nobody
# nicl . -create /exports/\\/export\\/client\\/usr clients 192.168.0.10
# nicl . -create /exports/\\/export\\/client\\/home opts maproot=nobody:nobody
# nicl . -create /exports/\\/export\\/client\\/home clients 192.168.0.10

To later add another client for the same export, you would append to that property (as opposed to the initial create): 
# nicl . -append /exports/\\/export\\/client\\/root clients 192.168.0.12

To verify that everything looks good, read it back: 

# nicl . -read /exports/\\/export\\/client\\/root
name: /export/client/root
opts: maproot=root:wheel
clients: 192.168.0.10 192.168.0.12

# ps -aux | grep portmap
If the portmap is not running, then you need to start it:
# /usr/sbin/portmap 

# ps -aux | grep nfsd
If the nfsdaemons are not running, then you need to start them:
# /sbin/nfsd -t -u -n 6 

# ps -aux | grep mountd
If mountd is running, then kill -HUP that process to force it to reread the NetInfo database. If it's not running, then you need to start it:
# /usr/sbin/mountd 

Your system will always start the NFS daemons after reboots if the NetInfo /exports property is present. To remove all exports and prevent your system from starting NFS in the future, run:
# nicl . -delete /exports 
If the server isn't running the NFS daemons, the client will print: 

le(0,0,0,0): Unknown error: code -1
boot: Unknown error: code -1
If the server is running NFS, but isn't exporting the root directory to the client, the client will print: 
boot: no such file or directory
If everything is working properly, you will see a few numbers and a spinning cursor on the client. This means you have succeeded! At this point, your client isn't bootable. If you let it continue, it will panic when attempting to start init. 
Continue on to setting up the client filesystem 


--------------------------------------------------------------------------------

Linux
# mkdir -p /export/client/root/dev 

# mkdir /export/client/usr 

# mkdir /export/client/home 

# touch /export/client/swap 

# cd /export/client/root 

# tar [--numeric-owner] -xvpzf /export/client/NetBSD-release/binary/sets/kern.tgz 

# mknod /export/client/root/dev/console c 0 0 

Add the following lines to /etc/exports: 
#/etc/exports
/export/client/root client.test.net(rw,no_root_squash)
/export/client/swap client.test.net(rw,no_root_squash)
/export/client/usr client.test.net(rw,root_squash)
/export/client/home client.test.net(rw,root_squash)

# ps aux | grep mountd
If mountd is running, then kill -HUP that process. This will force it to reread the /etc/exports file. If it's not already running, then you need to:
# /sbin/rpc.mountd [--no-nfs-version 3]
You may need to add the --no-nfs-version 3 if you're having problems. See below. 

# ps aux | grep nfsd
If the nfsdaemons are running, then you need to restart them so that they reread the /etc/exports file. If they're not already running, then you need to:
# /sbin/rpc.nfsd 
If the server isn't running the NFS daemons, the client will print: 

le(0,0,0,0): Unknown error: code -1
boot: Unknown error: code -1
If the server is running NFS, but isn't exporting the root directory to the client, the client will print: 
boot: no such file or directory
If everything is working properly, you will see a few numbers and a spinning cursor on the client. This means you have succeeded! At this point, your client isn't bootable. If you let it continue, it will panic when attempting to start init. 
Kernel NFS Problem: 

Most versions of linux only implement NFS2, in which case NetBSD will try NFS3 and then automatically fall back. Some versions (notably RedHat 6.0) will incorrectly answer both NFS2 and NFS3 mount requests, then ignore any attempt to access the filesystem using NFS3. This causes untold pain and hassle.

The workaround is to kill mountd and start it with options preventing NFS3 problems (i.e., rpc.mountd --no-nfs-version 3). 
Continue on to setting up the client filesystem 


--------------------------------------------------------------------------------

SunOS
# mkdir -p /export/client/root/dev 

# mkdir /export/client/usr 

# mkdir /export/client/home 

# touch /export/client/swap 

# cd /export/client/root 

# tar [--numeric-owner] -xvpzf /export/client/NetBSD-release/binary/sets/kern.tgz 

# mknod /export/client/root/dev/console c 0 0 

Create (or add to) your /etc/exports file:

#/etc/exports
/export/client/root -root=client
/export/client/swap -root=client
/export/client/usr
/export/client/home

# rm -f /etc/xtab;touch /etc/xtab 

# exportfs -a 

# ps aux | grep nfsd
If nfsd not already running, then run:
# nfsd 8 & 

# ps aux | grep mountd
If mountd is not already running, then run:
# rpc.mountd -n & 
If the server isn't running the NFS daemons, the client will print: 

le(0,0,0,0): Unknown error: code -1
boot: Unknown error: code -1
If the server is running NFS, but isn't exporting the root directory to the client, the client will print: 
boot: no such file or directory
If everything is working properly, you will see a few numbers and a spinning cursor on the client. This means you have succeeded! At this point, your client isn't bootable. If you let it continue, it will panic when attempting to start init. 
Continue on to setting up the client filesystem 


--------------------------------------------------------------------------------

Solaris
# mkdir -p /export/client/root/dev 

# mkdir /export/client/usr 

# mkdir /export/client/home 

# touch /export/client/swap 

# cd /export/client/root 

# tar [--numeric-owner] -xvpzf /export/client/NetBSD-release/binary/sets/kern.tgz 

# mknod /export/client/root/dev/console c 0 0 

Add the following lines to /etc/dfs/dfstab: 
share -F nfs -o root=client /export/client/root
share -F nfs -o root=client /export/client/swap
share -F nfs -o rw=client   /export/client/usr
share -F nfs -o rw=client   /export/client/home
Be certain to use names, if you use numeric IP addresses, Solaris will deny access without any error messages. 


# /usr/bin/ps -ef | grep nfs
If the nfs daemons are running, then you merely need to run:
# shareall
Normally, you'd need to run unshareall;shareall, but you've only added entries, not deleted anything. 
If the nfs daemons aren't running, then you will need to run:
# /etc/init.d/nfs.server start 

If the server isn't running the NFS daemons, the client will print: 

le(0,0,0,0): Unknown error: code -1
boot: Unknown error: code -1
If the server is running NFS, but isn't exporting the root directory to the client, the client will print: 
boot: no such file or directory
If everything is working properly, you will see a few numbers and a spinning cursor on the client. This means you have succeeded! At this point, your client isn't bootable. If you let it continue, it will panic when attempting to start init. 
Continue on to setting up the client filesystem 


--------------------------------------------------------------------------------

NEWS-OS
# mkdir -p /export/client/root/dev 

# mkdir /export/client/usr 

# mkdir /export/client/home 

# touch /export/client/swap 

# cd /export/client/root 

# tar [--numeric-owner] -xvpzf /export/client/NetBSD-release/binary/sets/kern.tgz 

# mknod /export/client/root/dev/console c 0 0 

Create (or add to) your /etc/exports file:

#/etc/exports
/export/client/root -root=client
/export/client/swap -root=client
/export/client/usr
/export/client/home

# rm -f /etc/xtab;touch /etc/xtab 

# /usr/etc/exportfs -av 

# ps -aux | grep nfsd
If nfsd not already running, then run:
# /etc/nfsd 4 & 
If the server isn't running the NFS daemons, the client will print: 

le(0,0,0,0): Unknown error: code -1
boot: Unknown error: code -1
If the server is running NFS, but isn't exporting the root directory to the client, the client will print: 
boot: no such file or directory
If everything is working properly, you will see a few numbers and a spinning cursor on the client. This means you have succeeded! At this point, your client isn't bootable. If you let it continue, it will panic when attempting to start init. 
Continue on to setting up the client filesystem 


--------------------------------------------------------------------------------

NEXTSTEP
Note, NEXTSTEP doesn't support exporting a file. This means that swap will have to be a file on your root (nfs) filesystem, and not its own nfs mounted file. Keep this in mind in later steps involving swap. 
You may also wish to keep with NEXTSTEP convention and place all of your client files in /private/export/client instead of /export/client. 


# mkdir -p /export/client/root/dev 

# mkdir /export/client/usr 

# mkdir /export/client/home 

# touch /export/client/root/swap 

# cd /export/client/root 

# tar [--numeric-owner] -xvpzf /export/client/NetBSD-release/binary/sets/kern.tgz 

# mknod /export/client/root/dev/console c 0 0 

Launch /NextAdmin/NFSManager.app 

Click on the "Export From ..." menu item 

Select your NetInfo Domain (probably /) and click OK. 

Click on the top Add button to pick your Directory Name


Type in your client's name under "Root Access" and click that "Add" button. 

Click OK. If your client doesn't have a DNS or /etc/hosts entry, NEXTSTEP will not serve correctly. 

Click the "Quit" menu item. 
For reference, here is a snapshot of what the NFSManager Exported Directories window should look like. 

If the server isn't running the NFS daemons, the client will print: 

le(0,0,0,0): Unknown error: code -1
boot: Unknown error: code -1
If the server is running NFS, but isn't exporting the root directory to the client, the client will print: 
boot: no such file or directory
If everything is working properly, you will see a few numbers and a spinning cursor on the client. This means you have succeeded! At this point, your client isn't bootable. If you let it continue, it will panic when attempting to start init. 
Continue on to setting up the client filesystem 


--------------------------------------------------------------------------------

HP-UX 7
I couldn't get the HP-UX 7 rpc.mountd to start. Here's what I tried, if you think it might work for you. Let us know what we're doing wrong. 
I don't think HP-UX 7's NFS server allows for restricting root read/write access. 


# mkdir -p /export/client/root/dev 

# mkdir /export/client/usr 

# mkdir /export/client/home 

# touch /export/client/swap 

# cd /export/client/root 

# tar [--numeric-owner] -xvpzf /export/client/NetBSD-release/binary/sets/kern.tgz 

# mknod /export/client/root/dev/console c 0 0 

Add the following lines to /etc/exports: 
#/etc/exports
/export/client/root client.test.net
/export/client/swap client.test.net
/export/client/usr  client.test.net
/export/client/home client.test.net

# ps -ef | grep nfsd
If they're not running, then run:
# /etc/nfsd 4 

Make sure the rpc.mountd in /etc/inetd.conf is uncommented 
If the server isn't running the NFS daemons, the client will print: 

le(0,0,0,0): Unknown error: code -1
boot: Unknown error: code -1
If the server is running NFS, but isn't exporting the root directory to the client, the client will print: 
boot: no such file or directory
If everything is working properly, you will see a few numbers and a spinning cursor on the client. This means you have succeeded! At this point, your client isn't bootable. If you let it continue, it will panic when attempting to start init. 
Continue on to setting up the client filesystem 


--------------------------------------------------------------------------------

HP-UX 9
# mkdir -p /export/client/root/dev 

# mkdir /export/client/usr 

# mkdir /export/client/home 

# touch /export/client/swap 

# cd /export/client/root 

# tar [--numeric-owner] -xvpzf /export/client/NetBSD-release/binary/sets/kern.tgz 

# mknod /export/client/root/dev/console c 0 0 

Open sam and make sure that the kernel has NFS support compiled in.
Kernel Configuration -> Subsystems, NFS/9000
This will require a reboot if it's not. 

Add the following lines to /etc/exports: 
#/etc/exports
/export/client/root   -root=client.test.net
/export/client/swap   -root=client.test.net
/export/client/usr  -access=client.test.net
/export/client/home -access=client.test.net

# ps -ef | grep mountd
If mountd is not already running, then run:
# /usr/etc/rpc.mountd 

# ps -ef | grep nfsd
If nfsd isn't already running, then run:
# /etc/nfsd 4 

# /usr/etc/exportfs -a 
If the server isn't running the NFS daemons, the client will print: 

le(0,0,0,0): Unknown error: code -1
boot: Unknown error: code -1
If the server is running NFS, but isn't exporting the root directory to the client, the client will print: 
boot: no such file or directory
If everything is working properly, you will see a few numbers and a spinning cursor on the client. This means you have succeeded! At this point, your client isn't bootable. If you let it continue, it will panic when attempting to start init. 
Continue on to setting up the client filesystem 


--------------------------------------------------------------------------------

HP-UX 10
# mkdir -p /export/client/root/dev 

# mkdir /export/client/usr 

# mkdir /export/client/home 

# touch /export/client/swap 

# cd /export/client/root 

# tar [--numeric-owner] -xvpzf /export/client/NetBSD-release/binary/sets/kern.tgz 

# mknod /export/client/root/dev/console c 0 0 

Edit /etc/rc.config.d/nfsconf and make sure that:
NFS_SERVER=1
START_MOUNTD=1
If those are not set, then you will need to run:
# /sbin/init.d/nfs.server start 

Add the following lines to /etc/exports: 
#/etc/exports
/export/client/root   -root=client.test.net
/export/client/swap   -root=client.test.net
/export/client/usr  -access=client.test.net
/export/client/home -access=client.test.net

# /usr/sbin/exportfs -a 
If the server isn't running the NFS daemons, the client will print: 

le(0,0,0,0): Unknown error: code -1
boot: Unknown error: code -1
If the server is running NFS, but isn't exporting the root directory to the client, the client will print: 
boot: no such file or directory
If everything is working properly, you will see a few numbers and a spinning cursor on the client. This means you have succeeded! At this point, your client isn't bootable. If you let it continue, it will panic when attempting to start init. 
Continue on to setting up the client filesystem 


#######################################################################
antapex.org
albert van der sel
#######################################################################


#############################################################################################
#############################################################################################
#############################################################################################


==================================================================================
SECTION 19: BIG SECTION: Oracle RDBMS 8,8i,9i,10g system queries and architecture:
==================================================================================


CONTENTS:

0.  Common data dictionary queries for sessions, locks, perfoRMANce etc..
1.  DATA DICTIONARY QUERIES m.b.t. files, tablespaces, logs:
2.  NOTES ON PERFORMANCE:
3.  Data dictonary queries m.b.t perfoRMANce:
4.  IMP and EXP, IMPDB and EXPDB, and SQL*Loader Examples
5.  Add, Move AND Size Datafiles,logfiles, create objects etc..:
6.  Install Oracle 92 on Solaris:
7.  install Oracle 9i on Linux:
8.  Install Oracle 9.2.0.2 on OpenVMS:
9.  Install Oracle 9.2.0.1 on AIX
9.  Installation Oracle 8i - 9i:
10. CONSTRAINTS:
11. DBMS_JOB and scheduled Jobs:
12. Net8,9,10 / SQLNet:
13. Datadictionary queries Rollback segments:
14. Data dictionary queries m.b.t. security, permissions:
15. INIT.ORA parameters:
16. Snapshots:
17. Triggers:
19. BACKUP RECOVERY, TROUBLESHOOTING:
20. TRACING:
21. Overig:
22. DBA% and v$ views
23  TUNING:
24  RMAN:
25. UPGRADE AND MIGRATION
26. Some info on Rdb:
27. Some info on IFS 
28. Some info on 9iAS rel. 2
29 - 35 9iAS configurations and troubleshooting
30. BLOBS
31. BLOCK CORRUPTION
32. iSQL*Plus and EM 10g
33. ADDM 
34. ASM


=========================================================================
0.  QUICK INFO/VIEWS ON SESSIONS, LOCKS, AND UNDO/ROLLBACK INFORMATION:
=========================================================================


-- ---------------------------
-- 0.1 QUICK VIEW ON SESSIONS:
-- ---------------------------

SELECT substr(username, 1, 10), osuser, sql_address, to_char(logon_time, 'DD-MM-YYYY;HH24:MI'),
       sid, serial#, command, substr(program, 1, 30), substr(machine, 1, 30), substr(terminal, 1, 30)
FROM   v$session;

SELECT sql_text, rows_processed from v$sqlarea where address=''


-- ------------------------
-- 0.2 QUICK VIEW ON LOCKS: (use the sys.obj$ to find ID1:)
-- ------------------------

SELECT s.sid                       sid, 
       substr(s.username, 1, 10)   username, 
       substr(s.schemaname, 1, 10) schemaname, 
       substr(s.osuser, 1, 10)     osuser, 
       substr(s.program, 1, 30)    program, 
       s.command                   command,
       l.lmode                     lockmode, 
       l.block                     blocked
FROM   v$session s, v$lock l
WHERE  s.sid=l.sid and schemaname not in ('SYS','SYSTEM');


SELECT SID,TYPE,ID1,ID2,LMODE,REQUEST 
FROM   v$lock WHERE type='TX'; 

SELECT event,p1,p2,p3 
FROM   v$session_wait 
WHERE  wait_time=0 and event='enqueue'; 

SELECT  l.object_id, l.session_id, substr(l.oracle_username, 1, 10), substr(l.os_user_name, 1, 10), 
        l.process, l.locked_mode, substr(o.object_name, 1, 20)
FROM    v$locked_object l, dba_objects o
WHERE   l.object_id=o.object_id;

SELECT s.sid, substr(s.username, 1, 10), substr(s.schemaname, 1, 10), 
       substr(s.osuser, 1, 10), substr(s.program, 1, 30), s.command,
       l.lmode, l.block
FROM   v$session s, v$lock l
WHERE  s.sid=l.sid;

-- The following is for associating Orace users and unix process ids

SELECT 
  p.spid                      unix_spid,
  s.sid                       sid, 
  p.addr,
  s.paddr,
  substr(s.username, 1, 10)   username, 
  substr(s.schemaname, 1, 10) schemaname, 
  s.command                   command,
  substr(s.osuser, 1, 10)     osuser, 
  substr(s.machine, 1, 25)    machine
FROM   v$session s, v$process p
WHERE  s.paddr=p.addr
ORDER BY p.spid;
 
SELECT waiting_session, holding_session, lock_type, mode_held
FROM   dba_waiters;


-- -----------------------------
-- 0.3 QUICK VIEW ON TEMP USAGE:
-- -----------------------------

select total_extents, used_extents, total_extents, current_users, tablespace_name
from v$sort_segment;

select username, user, sqladdr, extents, tablespace from v$sort_usage;

SELECT b.tablespace,
       ROUND(((b.blocks*p.value)/1024/1024),2),
       a.sid||','||a.serial# SID_SERIAL,
       a.username,
       a.program
     FROM sys.v_$session a,
          sys.v_$sort_usage b,
          sys.v_$parameter p
    WHERE p.name  = 'db_block_size'
      AND a.saddr = b.session_addr
      ORDER BY b.tablespace, b.blocks;

-- --------------------------------
-- 0.4 QUICK VIEW ON UNDO/ROLLBACK:
-- --------------------------------

SELECT  substr(username, 1, 10), substr(terminal, 1, 10), substr(osuser, 1, 10),
        t.start_time, r.name, t.used_ublk "ROLLB BLKS", log_io, phy_io
FROM    sys.v_$transaction t, sys.v_$rollname r, sys.v_$session s
WHERE   t.xidusn = r.usn
AND     t.ses_addr = s.saddr;

SELECT substr(n.name, 1, 10), s.writes, s.gets, s.waits, s.wraps, s.extents, s.status, 
       s.optsize, s.rssize
FROM   V$ROLLNAME n, V$ROLLSTAT s
WHERE  n.usn=s.usn;

SELECT substr(r.name, 1, 10) "RBS", s.sid, s.serial#, s.taddr, t.addr,
       substr(s.username, 1, 10) "USER", t.status,
       t.cr_get, t.phy_io, t.used_ublk, t.noundo,
       substr(s.program, 1, 15) "COMMAND"
FROM   sys.v_$session s, sys.v_$transaction t, sys.v_$rollname r
WHERE  t.addr = s.taddr
  AND  t.xidusn = r.usn
ORDER  BY t.cr_get, t.phy_io;

SELECT substr(segment_name, 1, 20), substr(tablespace_name, 1, 20), status,
       INITIAL_EXTENT, NEXT_EXTENT, MIN_EXTENTS, MAX_EXTENTS, PCT_INCREASE   
FROM   DBA_ROLLBACK_SEGS;

select 'FREE',count(*) from sys.fet$ 
union 
select 'USED',count(*) from sys.uet$;

-- Quick view active transactions

SELECT NAME, XACTS "ACTIVE TRANSACTIONS"
FROM   V$ROLLNAME, V$ROLLSTAT
WHERE  V$ROLLNAME.USN = V$ROLLSTAT.USN;

SELECT  to_char(BEGIN_TIME, 'DD-MM-YYYY;HH24:MI'), to_char(END_TIME, 'DD-MM-YYYY;HH24:MI'), 
        UNDOTSN, UNDOBLKS, TXNCOUNT, MAXCONCURRENCY AS "MAXCON"
FROM    V$UNDOSTAT WHERE trunc(BEGIN_TIME)=trunc(SYSDATE);

select TO_CHAR(MIN(Begin_Time),'DD-MON-YYYY HH24:MI:SS')
                 "Begin Time",
    TO_CHAR(MAX(End_Time),'DD-MON-YYYY HH24:MI:SS')
                 "End Time",
    SUM(Undoblks)    "Total Undo Blocks Used",
    SUM(Txncount)    "Total Num Trans Executed",
    MAX(Maxquerylen)  "Longest Query(in secs)",
    MAX(Maxconcurrency) "Highest Concurrent TrCount",
    SUM(Ssolderrcnt),
    SUM(Nospaceerrcnt)
from V$UNDOSTAT;


SELECT used_urec FROM v$session s, v$transaction t 
WHERE s.audsid=sys_context('userenv', 'sessionid') and
s.taddr = t.addr;

(used_urec = Used Undo records)

SELECT a.sid, a.username, b.xidusn, b.used_urec, b.used_ublk
FROM v$session a, v$transaction b
WHERE a.saddr = b.ses_addr;


-- --------------------------------
-- 0.5 SOME EXPLANATIONS:
-- --------------------------------


-- explanation of "COMMAND":

1: CREATE TABLE 2: INSERT 3: SELECT 4: CREATE CLUSTER 5: ALTER CLUSTER 6: UPDATE 7: DELETE 8: DROP CLUSTER 
9: CREATE INDEX 10: DROP INDEX 11: ALTER INDEX 12: DROP TABLE 13: CREATE SEQUENCE 14: ALTER SEQUENCE 
15: ALTER TABLE 16: DROP SEQUENCE 17: GRANT 18: REVOKE 19: CREATE SYNONYM 20: DROP SYNONYM 21: CREATE VIEW 
22: DROP VIEW 23: VALIDATE INDEX 24: CREATE PROCEDURE 25: ALTER PROCEDURE 26: LOCK TABLE 27: NO OPERATION 
28: RENAME 29: COMMENT 30: AUDIT 31: NOAUDIT 32: CREATE DATABASE LINK 33: DROP DATABASE LINK 34: CREATE DATABASE 
35: ALTER DATABASE 36: CREATE ROLLBACK SEGMENT 37: ALTER ROLLBACK SEGMENT 38: DROP ROLLBACK SEGMENT 
39: CREATE TABLESPACE 40: ALTER TABLESPACE 41: DROP TABLESPACE 42: ALTER SESSION 43: ALTER USE 44: COMMIT 
45: ROLLBACK 46: SAVEPOINT 47: PL/SQL EXECUTE 48: SET TRANSACTION 49: ALTER SYSTEM SWITCH LOG 50: EXPLAIN 
51: CREATE USER 25: CREATE ROLE 53: DROP USER 54: DROP ROLE 55: SET ROLE 56: CREATE SCHEMA 57: CREATE CONTROL FILE 
58: ALTER TRACING 59: CREATE TRIGGER 60: ALTER TRIGGER 61: DROP TRIGGER 62: ANALYZE TABLE 63: ANALYZE INDEX 
64: ANALYZE CLUSTER 65: CREATE PROFILE 66: DROP PROFILE 67: ALTER PROFILE 68: DROP PROCEDURE 69: DROP PROCEDURE 
70: ALTER RESOURCE COST 71: CREATE SNAPSHOT LOG 72: ALTER SNAPSHOT LOG 73: DROP SNAPSHOT LOG 74: CREATE SNAPSHOT 
75: ALTER SNAPSHOT 76: DROP SNAPSHOT 79: ALTER ROLE 85: TRUNCATE TABLE 86: TRUNCATE COUSTER 88: ALTER VIEW 
91: CREATE FUNCTION 92: ALTER FUNCTION 93: DROP FUNCTION 94: CREATE PACKAGE 95: ALTER PACKAGE 96: DROP PACKAGE 
97: CREATE PACKAGE BODY 98: ALTER PACKAGE BODY 99: DROP PACKAGE BODY 


-- explanation of locks:

Locks:
		0, 'None',           /* Mon Lock equivalent */
		1, 'Null',           /* N */
		2, 'Row-S (SS)',     /* L */
		3, 'Row-X (SX)',     /* R */
		4, 'Share',          /* S */
		5, 'S/Row-X (SRX)',  /* C */
		6, 'Exclusive',      /* X */
		to_char(b.lmode)
                TX:  enqueu, waiting
                TM:  DDL on object
                MR:  Media Recovery

A TX lock is acquired when a transaction initiates its first change and is 
held until the transaction does a COMMIT or ROLLBACK. It is used mainly as 
a queuing mechanism so that other sessions can wait for the transaction to 
complete.

TM Per table locks are acquired during the execution of a transaction 
when referencing a table with a DML statement so that the object is 
not dropped or altered during the execution of the transaction, 
if and only if the dml_locks parameter is non-zero. 

LOCKS: locks op user objects, zoals tables en rows 
LATCH: locks op system objects, zoals shared data structures in memory en data dictionary rows

LOCKS - shared of exclusive
LATCH - altijd exclusive


UL= user locks, geplaats door programmatuur m.b.v. bijvoorbeeld DBMS_LOCK package

DML LOCKS: data manipulatie: table lock, row lock
DDL LOCKS: preserves de struktuur van object (geen simulane DML, DDL statements)

DML locks:

row lock (TX):            voor rows (insert, update, delete)
row lock plus table lock: row lock, maar ook voorkomen DDL statements
table lock (TM): automatisch bij insert, update, delete, ter voorkoming DDL op table

table lock:	S: share lock
		RS: row share
		RSX: row share exlusive
		RX: row exclusive

		X: exclusive (ANDere tansacties kunnen alleen SELECT..)

in V$LOCK lmode column:

0, None 
1, Null (NULL) 
2, Row-S (SS) 
3, Row-X (SX) 
4, Share (S) 
5, S/Row-X (SSX) 
6, Exclusive (X)  


-- Explanation of Waits:

SQL> desc v$system_event;
 Name                           
 ------------------------
 EVENT                          
 TOTAL_WAITS                    
 TOTAL_TIMEOUTS                 
 TIME_WAITED                    
 AVERAGE_WAIT                   
 TIME_WAITED_MICRO              

v$system_event
This view displays the count (total_waits) of all wait events since startup of the instance. 
If timed_statistics is set to true, the sum of the wait times for all events are also displayed 
in the column time_waited. The unit of time_waited is one hundreth of a second. 
Since 10g, an additional column (time_waited_micro) measures wait times in millionth of a second. 
total_waits where event='buffer busy waits' is equal the sum of count in v$waitstat. 
v$enqueue_stat can be used to break down waits on the enqueue wait event. While this view totals all 
events in an instance, v$session

select event, total_waits, time_waited
from v$system_event
where event like '%file%'
Order by total_waits desc;

column c1 heading 'Event|Name'             format a30
column c2 heading 'Total|Waits'            format 999,999,999
column c3 heading 'Seconds|Waiting'        format 999,999
column c4 heading 'Total|Timeouts'         format 999,999,999
column c5 heading 'Average|Wait|(in secs)' format 99.999
  
ttitle 'System-wide Wait Analysis|for current wait events'

select
   event                         c1,
   total_waits                   c2,
   time_waited / 100             c3,
   total_timeouts                c4,
   average_wait    /100          c5
from
   sys.v_$system_event
where
   event not in (
    'dispatcher timer',
    'lock element cleanup',
    'Null event',
    'parallel query dequeue wait',
    'parallel query idle wait - Slaves',
    'pipe get',
    'PL/SQL lock timer',
    'pmon timer',
    'rdbms ipc message',
    'slave wait',
    'smon timer',
    'SQL*Net break/reset to client',
    'SQL*Net message from client',
    'SQL*Net message to client',
    'SQL*Net more data to client',
    'virtual circuit status',
    'WMON goes to sleep'
   )
AND
 event not like 'DFS%'
and
   event not like '%done%'
and
   event not like '%Idle%'
AND
 event not like 'KXFX%'
order by
   c2 desc
;

Create table beg_system_event as select * from v$system_event
Run workload through system or user task
Create table end_system_event as select * from v$system_event
Issue SQL to determine true wait events
drop table beg_system_event;
drop table end_system_event;

SELECT b.event, 
(e.total_waits    - b.total_waits)    total_waits,
(e.total_timeouts - b.total_timeouts) total_timeouts,
(e.time_waited    - b.time_waited)    time_waited
FROM beg_system_event b, 
          end_system_event e
 	WHERE b.event = e.event; 


Cumulative info, after startup:
-------------------------------

SELECT *
    FROM v$system_event
   WHERE event = 'enqueue';

 SELECT * 
    FROM v$sysstat 
   WHERE class=4;

select c.name,a.addr,a.gets,a.misses,a.sleeps, 
a.immediate_gets,a.immediate_misses,a.wait_time, b.pid 
from v$latch a, v$latchholder b, v$latchname c 
where a.addr   = b.laddr(+) and a.latch# = c.latch# 
order by a.latch#; 


-- ---------------------------------------------------------------
-- 0.6. QUICK INFO ON HIT RATIO, SHARED POOL etc.. 
-- ---------------------------------------------------------------

  -- Hit ratio:

SELECT  (1-(pr.value/(dbg.value+cg.value)))*100
FROM    v$sysstat pr, v$sysstat dbg, v$sysstat cg
WHERE   pr.name = 'physical reads'
AND     dbg.name = 'db block gets'
AND     cg.name = 'consistent gets';

SELECT * FROM V$SGA;

  -- free memory shared pool:

SELECT * FROM v$sgastat 
WHERE name = 'free memory';

  -- hit ratio shared pool:

SELECT gethits,gets,gethitratio FROM v$librarycache
WHERE namespace = 'SQL AREA';

SELECT SUM(PINS) "EXECUTIONS",
       SUM(RELOADS) "CACHE MISSES WHILE EXECUTING"
       FROM V$LIBRARYCACHE;

SELECT sum(sharable_mem) FROM v$db_object_cache; 

  -- finding literals in SP:

SELECT substr(sql_text,1,40) "SQL", 
       count(*) , 
       sum(executions) "TotExecs"
FROM v$sqlarea
WHERE executions < 5
GROUP BY substr(sql_text,1,40)
HAVING count(*) > 30
ORDER BY 2;


-- ---------------------------------------
-- 0.7 Quick Table and object information
-- ---------------------------------------

SELECT distinct substr(t.owner, 1, 25), substr(t.table_name,1,50), substr(t.tablespace_name,1,20), 
       t.chain_cnt, t.logging, s.relative_fno
FROM   dba_tables t, dba_segments s
WHERE  t.owner not in ('SYS','SYSTEM', 'OUTLN','DBSNMP','WMSYS','ORDSYS','ORDPLUGINS','MDSYS','CTXSYS','XDB')
AND    t.table_name=s.segment_name
AND    s.segment_type='TABLE'
AND    s.segment_name like 'CI_PAY%';

SELECT substr(segment_name, 1, 30), segment_type, substr(owner, 1, 10),
       extents, initial_extent, next_extent, max_extents
FROM   dba_segments
WHERE  extents > max_extents - 100
AND    owner not in ('SYS','SYSTEM');

SELECT segment_name, owner, tablespace_name, extents
FROM   dba_segments
WHERE  owner='SALES'  -- you use the correct schema here
and    extents > 700;

SELECT owner, substr(object_name, 1, 30), object_type, created, 
       last_ddl_time, status
FROM   dba_objects 
WHERE  created > SYSDATE-1;

SELECT owner, substr(object_name, 1, 30), object_type, created, 
       last_ddl_time, status
FROM   dba_objects 
WHERE  status='INVALID';

Compare 2 owners:
-----------------

select table_name from dba_tables
where owner='MIS_OWNER' and
table_name not in (SELECT table_name from dba_tables where OWNER='MARPAT');

Table and column information:
-----------------------------

select
	substr(table_name, 1, 3) schema
	, table_name
	, column_name
	, substr(data_type,1 ,1) data_type
from
	user_tab_columns
where COLUMN_NAME='ENV_ID'
where
	 table_name like 'ALG%'
	 or table_name like 'STG%' 
	 or table_name like 'ODS%' 
	 or table_name like 'DWH%' 
	 or table_name like 'MKM%' 
order by 
	  decode(substr(table_name, 1, 3), 'ALG', 10, 'STG', 20, 'ODS', 30, 'DWH', 40, 'MKM', 50, 60)
	  , table_name
	  , column_id


Check on existence of JServer:
------------------------------

select count(*) from all_objects where object_name = 'DBMS_JAVA';
should return a count of 3


-- --------------------------------------
-- 0.8 QUICK INFO ON PRODUCT INFORMATION:
-- --------------------------------------
ersa
SELECT * FROM PRODUCT_COMPONENT_VERSION;
SELECT * FROM NLS_DATABASE_PARAMETERS;
SELECT * FROM NLS_SESSION_PARAMETERS;
SELECT * FROM NLS_INSTANCE_PARAMETERS;
SELECT * FROM V$OPTION;
SELECT * FROM V$LICENSE;
SELECT * FROM V$VERSION;


  Oracle RDBMS releases:
  ----------------------

9.2.0.1 is the terminal release for Oracle 9i. Rel 2.
        Normally it's patched to 9.2.0.4.
        As from october patches 9.2.0.5 and little later 9.2.0.6 were available

        9.2.0.4 is patch ID 3095277. 

9.0.1.4 is the terminal release for Oracle 9i Rel. 1.
8.1.7   is the terminal release for Oracle8i. Additional patchsets exists.
8.0.6   is the terminal release for Oracle8.  Additional patchsets exists.
7.3.4   is the terminal release for Oracle7.  Additional patchsets exists.

  IS ORACLE 32BIT or 64BIT? 
  -------------------------

Starting with version 8, Oracle began shipping 64bit versions of it's RDBMS product on UNIX platforms 
that support 64bit software. IMPORTANT:  64bit Oracle can only be installed on Operating Systems that are 64bit enabled.   
In general, if Oracle is 64bit, '64bit' will be displayed on the opening banners of Oracle executables 
such as 'svrmgrl', 'exp' and 'imp'.  It will also be displayed in the  headers of Oracle trace files. 
Otherwise if '64bit' is not display at these locations, it can be assumed that Oracle is 32bit. 

or

From the OS level:  % cd $ORACLE_HOME/bin  % file oracle    ...if 64bit, '64bit' will be indicated. 


  To verify the wordsize of a downloaded patchset: 
  ------------------------------------------------
The filename of the downloaded patchset usually dictates which version and  wordsize of Oracle 
it should be applied against. For instance: p1882450_8172_SOLARIS64.zip   is the 8.1.7.2 patchset for 64bit  
Oracle on Solaris.  Also refer to the README that is included with the patch or patch set and  this Note: 


  Win2k Server Certifications:
  ----------------------------
OS Product Certified With Version Status Addtl. Info. Components Other Install Issue 
2000 10g N/A N/A Certified Yes None None None 
2000 9.2 32-bit -Opteron N/A N/A Certified Yes None None None 
2000 9.2 N/A N/A Certified Yes None None None 
2000 9.0.1 N/A N/A Desupported Yes None N/A N/A 
2000 8.1.7 (8i) N/A N/A Desupported Yes None N/A N/A 
2000 8.1.6 (8i) N/A N/A Desupported Yes None N/A N/A 
2000, Beta 3 8.1.5 (8i) N/A N/A Withdrawn Yes N/A N/A N/A 

  Solaris Server certifications:
  ------------------------------
Server Certifications
OS Product Certified With Version Status Addtl. Info. Components Other Install Issue 
9 10g 64-bit N/A N/A Certified Yes None None None 
8 10g 64-bit N/A N/A Certified Yes None None None 
10 10g 64-bit N/A N/A Projected None  N/A N/A N/A 
9 9.2 64-bit N/A N/A Certified Yes None None None 
8 9.2 64-bit N/A N/A Certified Yes None None None 
10 9.2 64-bit N/A N/A Projected None  N/A N/A N/A 
2.6 9.2 N/A N/A Certified Yes None None None 
9 9.2 N/A N/A Certified Yes None None None 
8 9.2 N/A N/A Certified Yes None None None 
7 9.2 N/A N/A Certified Yes None None None 
10 9.2 N/A N/A Projected None  N/A N/A N/A 
9 9.0.1 64-bit N/A N/A Desupported Yes None N/A N/A 
8 9.0.1 64-bit N/A N/A Desupported Yes None N/A N/A 
2.6  9.0.1 N/A N/A Desupported Yes None N/A N/A 
9 9.0.1 N/A N/A Desupported Yes None N/A N/A 
8 9.0.1 N/A N/A Desupported Yes None N/A N/A 
7  9.0.1 N/A N/A Desupported Yes None N/A N/A 
9 8.1.7 (8i) 64-bit N/A N/A Desupported Yes None N/A N/A 
8 8.1.7 (8i) 64-bit N/A N/A Desupported Yes None N/A N/A 
2.6 8.1.7 (8i) N/A N/A Desupported Yes None N/A N/A 
9 8.1.7 (8i) N/A N/A Desupported Yes None N/A N/A 
8 8.1.7 (8i) N/A N/A Desupported Yes None N/A N/A 
7 8.1.7 (8i) N/A N/A Desupported Yes None N/A N/A 
everything below: desupported


  Oracle clients:
  ---------------

		Server Version								
Client Version	10.1.0	9.2.0	9.0.1	8.1.7	8.1.6	8.1.5	8.0.6	8.0.5	7.3.4
10.1.0		Yes	Yes	Was	Yes #2	No	No	No	No	No
9.2.0		Yes	Yes	Was	Yes	No	No	Was	No	No #1
9.0.1		Was	Was	Was	Was	Was	No	Was	No	Was
8.1.7		Yes	Yes	Was	Yes	Was	Was	Was	Was	Was
8.1.6		No	No	Was	Was	Was	Was	Was	Was	Was
8.1.5		No	No	No	Was	Was	Was	Was	Was	Was
8.0.6		No	Was	Was	Was	Was	Was	Was	Was	Was
8.0.5		No	No	No	Was	Was	Was	Was	Was	Was
7.3.4		No	Was	Was	Was	Was	Was	Was	Was	Was


-- -----------------------------------------------------
-- 0.9 QUICK INFO WITH REGARDS LOGS AND BACKUP RECOVERY:
-- -----------------------------------------------------

SELECT * from V$BACKUP;

SELECT file#, substr(name, 1, 30), status, checkpoint_change#            -- uit controlfile
FROM V$DATAFILE;

SELECT d.file#, d.status, d.checkpoint_change#, b.status, b.CHANGE#,
       to_char(b.TIME,'DD-MM-YYYY;HH24:MI'), substr(d.name, 1, 40)
FROM V$DATAFILE d, V$BACKUP b
WHERE d.file#=b.file#;  

SELECT file#, substr(name, 1, 30), status, fuzzy, checkpoint_change#      -- uit file header
FROM V$DATAFILE_HEADER;

SELECT first_change#, next_change#, sequence#, archived, substr(name, 1, 40),
COMPLETION_TIME, FIRST_CHANGE#, FIRST_TIME      
FROM V$ARCHIVED_LOG
WHERE COMPLETION_TIME > SYSDATE -2;

SELECT recid, first_change#, sequence#, next_change# 
FROM V$LOG_HISTORY;

SELECT resetlogs_change#, checkpoint_change#, controlfile_change#, open_resetlogs
FROM V$DATABASE;

SELECT * FROM  V$RECOVER_FILE  -- Which file needs recovery


-- ----------------------------------------------------------------------------
-- 0.10 QUICK INFO WITH REGARDS TO TABLESPACES, DATAFILES, REDO LOGFILES etc..:
-- -----------------------------------------------------------------------------

  -- online redo log informatie: V$LOG, V$LOGFILE:

SELECT l.group#, l.members, l.status, l.bytes, substr(lf.member, 1, 50)
FROM   V$LOG l, V$LOGFILE lf
WHERE  l.group#=lf.group#;

SELECT THREAD#, SEQUENCE#, FIRST_CHANGE#, FIRST_TIME, 
       to_char(FIRST_TIME, 'DD-MM-YYYY;HH24:MI')
FROM   V$LOG_HISTORY;
-- WHERE SEQUENCE#

SELECT GROUP#, ARCHIVED, STATUS FROM V$LOG;


 -- tablespace free-used:

SELECT Total.name "Tablespace Name",
       Free_space, (total_space-Free_space) Used_space, total_space
FROM
  (SELECT tablespace_name, sum(bytes/1024/1024) Free_Space
     FROM sys.dba_free_space
    GROUP BY tablespace_name
  ) Free,
  (SELECT b.name,  sum(bytes/1024/1024) TOTAL_SPACE
     FROM sys.v_$datafile a, sys.v_$tablespace B
    WHERE a.ts# = b.ts#
    GROUP BY b.name
  ) Total
WHERE Free.Tablespace_name = Total.name;

SELECT substr(file_name, 1, 70), tablespace_name FROM dba_data_files;


----------------------------------------------
-- AUDIT Statements:
----------------------------------------------


select v.sql_text, v.FIRST_LOAD_TIME, v.PARSING_SCHEMA_ID, b.username from
v$sqlarea v, dba_users b
where v.FIRST_LOAD_TIME > '2006-08-20'
and
v.PARSING_SCHEMA_ID=b.user_id
order by v.FIRST_LOAD_TIME ;

-----------------------------------------------
-- EXAMPLE OF DYNAMIC SQL:
-----------------------------------------------

select 'UPDATE '||t.table_name||' SET '||c.column_name||'=REPLACE('||c.column_name||','''',CHR(7));'
from user_tab_columns c, user_tables t
where c.table_name=t.table_name and t.num_rows>0 and c.DATA_LENGTH>10
and data_type like '%CHAR%'
ORDER BY t.table_name desc;

create public synonym EMPLOYEE for HARRY.EMPLOYEE;

select 'create public synonym '||table_name||' for CISADM.'||table_name||';'
from dba_tables where owner='CISADM';

select 'GRANT SELECT, INSERT, UPDATE, DELETE ON '||table_name||' TO CISUSER;' 
from dba_tables where owner='CISADM';

select 'GRANT SELECT ON '||table_name||' TO CISREAD;' 
from dba_tables where owner='CISADM';


select 'UPDATE '||table_name||' SET ENV_ID=854765 '||';' from user_tables where
table_name in (select table_name from user_tab_columns
where COLUMN_NAME='ENV_ID');

select 'GRANT SELECT, INSERT, UPDATE, 
DELETE ON '||table_name from user_tables||' TO CISUSER;'


SQL> connect piet/piet@SPLCONF2
Connected.
SQL> create table EMP
  2  (
  3  id number);

Table created.

SQL> connect gerrit/gerrit@SPLCONF2
Connected.
SQL>  create table EMP
  2   (
  3   id number);

Table created.

create public synonym for gerrit.emp;
lukt

create user jaap identified by jaap
default tablespace CISTS_01
temporary tablespace temp;

grant resource to jaap;
grant connect to jaap;

SQL> create table emp
  2  (
  3  id number);

Table created.

SQL> drop table emp;

Table dropped.

SQL> connect system/cygnusx1@SPLCONF2
Connected.
SQL> create table jaap.emp as select * from piet.emp;

Table created.


----
Volgens Albert: dit kun je gebruiken:
set linesize 1000
set pagesize 1000
set trimspool on

select 'TRUNCATE TABLE '||table_name||';' from user_tables
where table_name like 'CM_%';

note 1: the % is a Oracle wildcard.

note 2: even een nieuwe (!) kopie table maken op basis van een bestaande table, bijv

CREATE TABLE EMPLOYEE_2
AS SELECT * FROM EMPLOYEE

note 3: even een nieuwe (!) kopie table maken op basis van een bestaande table in een ANDERE DATABASE, bijv

eerst een database link maken in database A, die wijst naar database B

CREATE PUBLIC DATABASE LINK MY_LINK
CONNECT TO CISADM IDENTIFIED BY j0llycoffee
USING 'SPLTST2';

SELECT * FROM employee@MY_LINK;

create table Y as SELECT * FROM employee@MY_LINK;

table leegmaken:

TRUNCATE TABLE EMPLOYEE;


----


========================
1. NOTES ON PERFORMANCE:
=========================


1.1 POOLS:
==========

-- SHARED POOL:
-- ------------

A literal SQL statement is considered as one which uses literals in the predicate/s rather than bind variables 
where the value of the literal is likely to differ between various executions of the statement. 
Eg 1: 
  SELECT * FROM emp WHERE ename='CLARK';
    is used by the application instead of
  SELECT * FROM emp WHERE ename=:bind1;
SQL statement for this article as it can be shared.

-- Hard Parse
If a new SQL statement is issued which does not exist in the shared pool then this has to be parsed fully. 
Eg: Oracle has to allocate memory for the statement from the shared pool, check the statement syntactically 
and semantically etc... This is referred to as a hard parse and is very expensive in both terms of CPU used 
and in the number of latch gets performed. 

--Soft Parse
If a session issues a SQL statement which is already in the shared pool AND it can use an existing version 
of that statement then this is known as a 'soft parse'. 
As far as the application is concerned it has asked to parse the statement. 

if two statements are textually identical but cannot be shared then these are called 'versions' of the same statement. 
If Oracle matches to a statement with many versions it has to check each version in turn to see 
if it is truely identical to the statement currently being parsed. Hence high version counts are best avoided.

The best approach to take is that all SQL should be sharable unless it is adhoc or infrequently used SQL where 
it is important to give CBO as much information as possible in order for it to produce a good execution plan.  

--Eliminating Literal SQL
If you have an existing application it is unlikely that you could eliminate all literal SQL but you should 
be prepared to eliminate some if it is causing problems. By looking at the V$SQLAREA view it is possible 
to see which literal statements are good candidates for converting to use bind variables. The following query 
shows SQL in the SGA where there are a large number of similar statements: 

SELECT substr(sql_text,1,40) "SQL", 
       count(*) , 
       sum(executions) "TotExecs"
FROM v$sqlarea
WHERE executions < 5
GROUP BY substr(sql_text,1,40)
HAVING count(*) > 30
ORDER BY 2;

The values 40,5 and 30 are example values so this query is looking for different statements whose first 
40 characters are the same which have only been executed a few times each and there are at least 30 different 
occurrances in the shared pool. This query uses the idea it is common for literal statements to begin 
"SELECT col1,col2,col3 FROM table WHERE ..." with the leading portion of each statement being the same. 

--Avoid Invalidations
Some specific orders will change the state of cursors to INVALIDATE. These orders modify directly 
the context of related objects associated with cursors. That's orders are TRUNCATE, ANALYZE or DBMS_STATS.GATHER_XXX 
on tables or indexes, grants changes on underlying objects. The associated cursors will stay in the SQLAREA but when it 
will be reference next time, it should be reloaded and reparsed fully, so the global performance will be impacted. 

The following query could help us to better identify the concerned cursors: 

SELECT substr(sql_text, 1, 40) "SQL", invalidations from v$sqlarea
order by invalidations DESC;

-- CURSOR_SHARING parameter (8.1.6 onwards)
<Parameter:CURSOR_SHARING> is a new parameter introduced in Oracle8.1.6. It should be used with caution in this release. 
If this parameter is set to FORCE then literals will be replaced by system generated bind variables where possible. 
For multiple similar statements which differ only in the literals used this allows the cursors to be shared 
even though the application supplied SQL uses literals. 
The parameter can be set dynamically at the system or session level thus: 
ALTER SESSION SET cursor_sharing = FORCE;
       or
ALTER SYSTEM SET cursor_sharing = FORCE;
or it can be set in the init.ora file. 
Note: As the FORCE setting causes system generated bind variables to be used in place of literals, a different execution 
plan may be chosen by the cost based optimizer (CBO) as it no longer has the literal values available to it 
when costing the best execution plan. 
In Oracle9i, it is possible to set CURSOR_SHARING=SIMILAR. SIMILAR causes statements that may differ 
in some literals, but are otherwise identical, to share a cursor, unless the literals affect either the meaning 
of the statement or the degree to which the plan is optimized. This enhancement improves the usability of the parameter 
for situations where FORCE would normally cause a different, undesired execution plan. 
With CURSOR_SHARING=SIMILAR, Oracle determines which literals are "safe" for substitution with bind variables. 
This will result in some SQL not being shared in an attempt to provide a more efficient execution plan. 

-- SESSION_CACHED_CURSORS parameter
<Parameter:SESSION_CACHED_CURSORS> is a numeric parameter which can be set at instance level or at session level 
using the command: 
        ALTER SESSION SET session_cached_cursors = NNN;
The value NNN determines how many 'cached' cursors there can be in your session. 
Whenever a statement is parsed Oracle first looks at the statements pointed to by your private session cache - 
if a sharable version of the statement exists it can be used. This provides a shortcut access to frequently parsed 
statements that uses less CPU and uses far fewer latch gets than a soft or hard parse. 

To get placed in the session cache the same statement has to be parsed 3 times within the same cursor - a pointer to the 
shared cursor is then added to your session cache. If all session cache cursors are in use then the least recently 
used entry is discarded. 
If you do not have this parameter set already then it is advisable to set it to a starting value of about 50. 
The statistics section of the bstat/estat report includes a value for 'session cursor cache hits' which shows 
if the cursor cache is giving any benefit. The size of the cursor cache can then be increased or decreased as necessary. 
SESSION_CACHED_CURSORS are particularly useful with Oracle Forms applications when forms are frequently opened and closed.

-- SHARED_POOL_RESERVED_SIZE parameter
There are quite a few notes explaining <Parameter:SHARED_POOL_RESERVED_SIZE> already in circulation. The parameter 
was introduced in Oracle 7.1.5 and provides a means of reserving a portion of the shared pool for large memory allocations. 
The reserved area comes out of the shared pool itself. 
From a practical point of view one should set SHARED_POOL_RESERVED_SIZE to about 10% of SHARED_POOL_SIZE unless either 
the shared pool is very large OR SHARED_POOL_RESERVED_MIN_ALLOC has been set lower than the default value: 

  
If the shared pool is very large then 10% may waste a significant amount of memory when a few Mb will suffice. 
If SHARED_POOL_RESERVED_MIN_ALLOC has been lowered then many space requests may be eligible to be satisfied 
from this portion of the shared pool and so 10% may be too little. 
It is easy to monitor the space usage of the reserved area using the <View:V$SHARED_POOL_RESERVED> 
which has a column FREE_SPACE. 

-- SHARED_POOL_RESERVED_MIN_ALLOC parameter
In Oracle8i this parameter is hidden. 
SHARED_POOL_RESERVED_MIN_ALLOC should generally be left at its default value, although in certain cases values 
of 4100 or 4200 may help relieve some contention on a heavily loaded shared pool.  

-- SHARED_POOL_SIZE parameter
<Parameter:SHARED_POOL_SIZE> controls the size of the shared pool itself. The size of the shared pool can 
impact performance. If it is too small then it is likely that sharable information will be flushed from the pool 
and then later need to be reloaded (rebuilt). If there is heavy use of literal SQL and the shared pool is too large then 
over time a lot of small chunks of memory can build up on the internal memory freelists causing the shared pool latch 
to be held for longer which in-turn can impact performance. In this situation a smaller shared pool may perform better 
than a larger one. This problem is greatly reduced in 8.0.6 and in 8.1.6 onwards due to the enhancement in <bug:986149> . 
NB: The shared pool itself should never be made so large that paging or swapping occur as performance 
can then decrease by many orders of magnitude. 

-- _SQLEXEC_PROGRESSION_COST parameter (8.1.5 onwards)
This is a hidden parameter which was introduced in Oracle 8.1.5. The parameter is included here as 
the default setting has caused some problems with SQL sharability. Setting this parameter to 0 can avoid these 
issues which result in multiple versions statements in the shared pool. 
Eg: Add the following to the init.ora file 
        # _SQLEXEC_PROGRESSION_COST is set to ZERO to avoid SQL sharing issues

        # See Note:62143.1 for details

        _sqlexec_progression_cost=0
Note that a side effect of setting this to '0' is that the V$SESSION_LONGOPS view is not populated by long running queries.  

-- MTS, Shared Server and XA
The multi-threaded server (MTS) adds to the load on the shared pool and can contribute to any problems as the User Global Area (UGA) 
resides in the shared pool. This is also true of XA sessions in Oracle7 as their UGA is located in the shared pool. (In Oracle8/8i XA sessions 
do NOT put their UGA in the shared pool). In Oracle8 the Large Pool can be used for MTS reducing its impact on shared pool activity
- However memory allocations in the Large Pool still make use of the "shared pool latch". 
See <Note:62140.1> for a description of the Large Pool. 
Using dedicated connections rather than MTS causes the UGA to be allocated out of process private memory rather 
than the shared pool. Private memory allocations do not use the "shared pool latch" and so a switch from MTS to 
dedicated connections can help reduce contention in some cases. 

In Oracle9i, MTS was renamed to "Shared Server". For the purposes of the shared pool, the behaviour is essentially the same.  


Useful SQL for looking at memory and Shared Pool problems
---------------------------------------------------------

Indeling SGA:
-------------

SELECT * FROM V$SGA;

free memory shared pool:
------------------------

SELECT * FROM v$sgastat 
WHERE name = 'free memory';

hit ratio shared pool:
----------------------

SELECT gethits,gets,gethitratio FROM v$librarycache
WHERE namespace = 'SQL AREA';

SELECT SUM(PINS) "EXECUTIONS",
	SUM(RELOADS) "CACHE MISSES WHILE EXECUTING"
	FROM V$LIBRARYCACHE;


SELECT sum(sharable_mem) FROM v$db_object_cache; 


statistics:
-----------

SELECT class, value, name     
FROM v$sysstat;    


Executions:
-----------

SELECT substr(sql_text,1,50) "SQL", 
  	 count(*) , 
	 sum(executions) "TotExecs"
    FROM v$sqlarea
   WHERE executions > 5
   GROUP BY substr(sql_text,1,50)
  HAVING count(*) > 10
   ORDER BY 2
  ;

The values 40,5 and 30 are example values so this query is looking for 
different statements whose first 40 characters are the same 
which have only been executed a few times each and there are at least 30 different 
occurrances in the shared pool. This query uses the idea it is common for literal statements to begin 
"SELECT col1,col2,col3 FROM table WHERE ..." with the leading portion of each statement being the same. 

V$SQLAREA:

SQL_TEXT
 VARCHAR2(1000)
 First thousand characters of the SQL text for the current cursor
 
SHARABLE_MEM
 NUMBER
 Amount of shared memory used by a cursor. If multiple child cursors exist, then the sum of all 
 shared memory used by all child cursors.
 
PERSISTENT_MEM
 NUMBER
 Fixed amount of memory used for the lifetime of an open cursor. If multiple child cursors exist, 
 the fixed sum of memory used for the lifetime of all the child cursors.
 
RUNTIME_MEM
 NUMBER
 Fixed amount of memory required during execution of a cursor. If multiple child cursors exist, 
 the fixed sum of all memory required during execution of all the child cursors.
 
SORTS
 NUMBER
 Sum of the number of sorts that were done for all the child cursors
 
VERSION_COUNT
 NUMBER
 Number of child cursors that are present in the cache under this parent
 
LOADED_VERSIONS
 NUMBER
 Number of child cursors that are present in the cache and have their context heap (KGL heap 6) loaded
 
OPEN_VERSIONS
 NUMBER
 The number of child cursors that are currently open under this current parent
 
USERS_OPENING
 NUMBER
 The number of users that have any of the child cursors open
 
FETCHES
 NUMBER
 Number of fetches associated with the SQL statement
 
EXECUTIONS
 NUMBER
 Total number of executions, totalled over all the child cursors
 
USERS_EXECUTING
 NUMBER
 Total number of users executing the statement over all child cursors
 
LOADS
 NUMBER
 The number of times the object was loaded or reloaded
 
FIRST_LOAD_TIME
 VARCHAR2(19)
 Timestamp of the parent creation time
 
INVALIDATIONS
 NUMBER
 Total number of invalidations over all the child cursors
 
PARSE_CALLS
 NUMBER
 The sum of all parse calls to all the child cursors under this parent
 
DISK_READS
 NUMBER
 The sum of the number of disk reads over all child cursors
 
BUFFER_GETS
 NUMBER
 The sum of buffer gets over all child cursors
 
ROWS_PROCESSED
 NUMBER
 The total number of rows processed on behalf of this SQL statement
 
COMMAND_TYPE
 NUMBER
 The Oracle command type definition
 
OPTIMIZER_MODE
 VARCHAR2(10)
 Mode under which the SQL statement is executed
 
PARSING_USER_ID
 NUMBER
 The user ID of the user that has parsed the very first cursor under this parent
 
PARSING_SCHEMA_ID
 NUMBER
 The schema ID that was used to parse this child cursor
 
KEPT_VERSIONS
 NUMBER
 The number of child cursors that have been marked to be kept using the DBMS_SHARED_POOL package
 
ADDRESS
 RAW(4)
 The address of the handle to the parent for this cursor
 
HASH_VALUE
 NUMBER
 The hash value of the parent statement in the library cache
 
MODULE
 VARCHAR2(64)
 Contains the name of the module that was executing at the time that the SQL statement was first parsed as set 
 by calling DBMS_APPLICATION_INFO.SET_MODULE
 
MODULE_HASH
 NUMBER
 The hash value of the module that is named in the MODULE column
 
ACTION
 VARCHAR2(64)
 Contains the name of the action that was executing at the time that the SQL statement was first parsed 
 as set by calling DBMS_APPLICATION_INFO.SET_ACTION
 
ACTION_HASH
 NUMBER
 The hash value of the action that is named in the ACTION column
 
SERIALIZABLE_ABORTS
 NUMBER
 Number of times the transaction fails to serialize, producing ORA-08177 errors, totalled over all the child cursors
 
IS_OBSOLETE
 VARCHAR2(1)
 Indicates whether the cursor has become obsolete (Y) or not (N). This can happen if the number of child cursors 
 is too large.
 
CHILD_LATCH
 NUMBER
 Child latch number that is protecting the cursor
 

V$SQL:
------

V$SQL lists statistics on shared SQL area without the GROUP BY clause and contains one row for each child 
of the original SQL text entered.

Column Datatype Description 
SQL_TEXT
 VARCHAR2(1000)
 First thousand characters of the SQL text for the current cursor
 
SHARABLE_MEM
 NUMBER
 Amount of shared memory used by this child cursor (in bytes)
 
PERSISTENT_MEM
 NUMBER
 Fixed amount of memory used for the lifetime of this child cursor (in bytes)
 
RUNTIME_MEM
 NUMBER
 Fixed amount of memory required during the execution of this child cursor
 
SORTS
 NUMBER
 Number of sorts that were done for this child cursor
 
LOADED_VERSIONS
 NUMBER
 Indicates whether the context heap is loaded (1) or not (0)
 
OPEN_VERSIONS
 NUMBER
 Indicates whether the child cursor is locked (1) or not (0)
 
USERS_OPENING
 NUMBER
 Number of users executing the statement
 
FETCHES
 NUMBER
 Number of fetches associated with the SQL statement
 
EXECUTIONS
 NUMBER
 Number of executions that took place on this object since it was brought into the library cache
 
USERS_EXECUTING
 NUMBER
 Number of users executing the statement
 
LOADS
 NUMBER
 Number of times the object was either loaded or reloaded
 
FIRST_LOAD_TIME
 VARCHAR2(19)
 Timestamp of the parent creation time
 
INVALIDATIONS
 NUMBER
 Number of times this child cursor has been invalidated
 
PARSE_CALLS
 NUMBER
 Number of parse calls for this child cursor
 
DISK_READS
 NUMBER
 Number of disk reads for this child cursor
 
BUFFER_GETS
 NUMBER
 Number of buffer gets for this child cursor
 
ROWS_PROCESSED
 NUMBER
 Total number of rows the parsed SQL statement returns
 
COMMAND_TYPE
 NUMBER
 Oracle command type definition
 
OPTIMIZER_MODE
 VARCHAR2(10)
 Mode under which the SQL statement is executed
 
OPTIMIZER_COST
 NUMBER
 Cost of this query given by the optimizer
 
PARSING_USER_ID
 NUMBER
 User ID of the user who originally built this child cursor
 
PARSING_SCHEMA_ID
 NUMBER
 Schema ID that was used to originally build this child cursor
 
KEPT_VERSIONS
 NUMBER
 Indicates whether this child cursor has been marked to be kept pinned in the cache using the DBMS_SHARED_POOL package
 
ADDRESS
 RAW(4)
 Address of the handle to the parent for this cursor
 
TYPE_CHK_HEAP
 RAW(4)
 Descriptor of the type check heap for this child cursor
 
HASH_VALUE
 NUMBER
 Hash value of the parent statement in the library cache
 
PLAN_HASH_VALUE
 NUMBER
 Numerical representation of the SQL plan for this cursor. Comparing one PLAN_HASH_VALUE to another easily 
 identifies whether or not two plans are the same (rather than comparing the two plans line by line).
 
CHILD_NUMBER
 NUMBER
 Number of this child cursor
 
MODULE
 VARCHAR2(64)
 Contains the name of the module that was executing at the time that the SQL statement was first parsed, 
 which is set by calling DBMS_APPLICATION_INFO.SET_MODULE
 
MODULE_HASH
 NUMBER
 Hash value of the module listed in the MODULE column
 
ACTION
 VARCHAR2(64)
 Contains the name of the action that was executing at the time that the SQL statement was first parsed, 
 which is set by calling DBMS_APPLICATION_INFO.SET_ACTION
 
ACTION_HASH
 NUMBER
 Hash value of the action listed in the ACTION column
 
SERIALIZABLE_ABORTS
 NUMBER
 Number of times the transaction fails to serialize, producing ORA-08177 errors, per cursor
 
OUTLINE_CATEGORY
 VARCHAR2(64)
 If an outline was applied during construction of the cursor, then this column displays the category 
 of that outline. Otherwise the column is left blank.
 
CPU_TIME
 NUMBER
 CPU time (in microseconds) used by this cursor for parsing/executing/fetching
 
ELAPSED_TIME
 NUMBER
 Elapsed time (in microseconds) used by this cursor for parsing/executing/fetching
 
OUTLINE_SID
 NUMBER
 Outline session identifier
 
CHILD_ADDRESS
 RAW(4)
 Address of the child cursor
 
SQLTYPE
 NUMBER
 Denotes the version of the SQL language used for this statement
 
REMOTE
 VARCHAR2(1)
 (Y/N) Identifies whether the cursor is remote mapped or not
 
OBJECT_STATUS
 VARCHAR2(19)
 Status of the cursor (VALID/INVALID)
 
LITERAL_HASH_VALUE
 NUMBER
 Hash value of the literals which are replaced with system-generated bind variables and are to be matched, 
 when CURSOR_SHARING is used. This is not the hash value for the SQL statement. If CURSOR_SHARING is not used, 
 then the value is 0.
 
LAST_LOAD_TIME
 VARCHAR2(19)
  
IS_OBSOLETE
 VARCHAR2(1)
 Indicates whether the cursor has become obsolete (Y) or not (N). This can happen if the number of child cursors 
 is too large.
 
CHILD_LATCH
 NUMBER
 Child latch number that is protecting the cursor
 

Checking for high version counts: 
--------------------------------
        
SELECT address, hash_value,
                version_count ,
                users_opening ,
                users_executing,
                substr(sql_text,1,40) "SQL"
          FROM v$sqlarea
         WHERE version_count > 10
        ;

"Versions" of a statement occur where the SQL is character for character identical but the underlying objects or binds
 etc.. are different.

Finding statement/s which use lots of shared pool memory: 
--------------------------------------------------------

SELECT substr(sql_text,1,40) "Stmt", count(*),
                sum(sharable_mem)    "Mem",
                sum(users_opening)   "Open",
                sum(executions)      "Exec"
          FROM v$sql
         GROUP BY substr(sql_text,1,40)
        HAVING sum(sharable_mem) > 10000
        ;
where MEMSIZE is about 10% of the shared pool size in bytes. This should show if there are similar literal statements, 
or multiple versions of a statements which account for a large portion of the memory in the shared pool.


1.2 statistics:
---------------

- Rule based / Cost based 
- apply EXPLAIN PLAN in query

- ANALYZE COMMAND:

ANALYZE TABLE EMPLOYEE COMPUTE STATISTICS;
ANALYZE TABLE EMPLOYEE COMPUTE STATISTICS FOR ALL INDEXES;
ANALYZE INDEX scott.indx1 COMPUTE STATISTICS;
ANALYZE TABLE EMPLOYEE ESTIMATE STATISTICS SAMPLE 10 PERCENT;
ALTER TABLE EMPLOYEE DELETE STATISTICS;

- DBMS_UTILITY.ANALYZE_SCHEMA() procedure:

DBMS_UTILITY.ANALYZE_SCHEMA (
   schema           VARCHAR2, 
   method           VARCHAR2, 
   estimate_rows    NUMBER   DEFAULT NULL, 
   estimate_percent NUMBER   DEFAULT NULL, 
   method_opt       VARCHAR2 DEFAULT NULL);

DBMS_UTILITY.ANALYZE_DATABASE (
   method           VARCHAR2, 
   estimate_rows    NUMBER   DEFAULT NULL, 
   estimate_percent NUMBER   DEFAULT NULL, 
   method_opt       VARCHAR2 DEFAULT NULL);

method=compute, estimate, delete

To exexcute:

exec DBMS_UTILITY.ANALYZE_SCHEMA('CISADM','COMPUTE');


1.3 Storage parameters:
-----------------------

segement: pctfree, pctused, number AND size of extends in STORAGE clause

   - very low updates   : pctfree low
   - if updates, oltp   : pctfree 10, pctused 40
   - if only inserts    : pctfree low


1.4 rebuild indexes on regular basis:
-----------------------------------------

alter index SCOTT.EMPNO_INDEX rebuild
tablespace INDEX
storage (initial 5M next 5M pctincrease 0);

You should next use the ANALYZE TABLE COMPUTE STATISTICS command

1.5 Is an index used in a query?:
---------------------------------

De WHERE clause of a query must use the 'leading column' of (one of the) index(es):
Suppose an index 'indx1' exists on EMPLOYEE(city, state, zip)

   Suppose a user issues the query:
   SELECT .. FROM EMPLOYEE WHERE state='NY'

Then this query will not use that index!
Therfore you must pay attention to the cardinal column of any index.


1.6 set transaction parameters:
-------------------------------

ONLY ORACLE 7,8,8i:

Suppose you must perform an action which will generate a lot
of redo and rollback. 
If you want to influence which rollback segment will be used
in your transactions, you can use the statement

  set transaction use rollback segment SEGMENT_NAME


1.7 Reduce fragmentation of a dictionary managed tablespace:
------------------------------------------------------------

  alter tablespace DATA coalesce;


1.8 normalisation of tables:
----------------------------

  The more tables are 'normalized', the higher the performance costs for
  queries joining tables


1.9 commits na zoveel rows:
----------------------------

declare
  i number := 0;
  cursor s1 is SELECT * FROM tab1 WHERE col1 = 'value1'
               FOR UPDATE;
begin
  for c1 in s1 loop
      update tab1 set col1 = 'value2'
             WHERE current of s1;

      i := i + 1;              -- Commit after every X records
      if i > 1000 then
         commit;
         i := 0;
      end if;

  end loop;
  commit;
end;
/


-- ------------------------------

CREATE TABLE TEST
(
ID    NUMBER(10)    NULL,
DATUM DATE          NULL,
NAME  VARCHAR2(10)  NULL
);

declare
  i number := 1000;
begin
  while i>1 loop
       insert into TEST
        values (1, sysdate+i,'joop');

      i := i - 1;
         commit;
         
  end loop;
  commit;
end;
/

-- ------------------------------

CREATE TABLE TEST2
(
i     number        NULL,
ID    NUMBER(10)    NULL,
DATUM DATE          NULL,
DAG   VARCHAR2(10)  NULL,
NAME  VARCHAR2(10)  NULL
);

declare
  i number := 1;
  j date;
  k varchar2(10);
begin
  while i<1000000 loop
       j:=sysdate+i;
       k:=TO_CHAR(SYSDATE+i,'DAY');
       insert into TEST2
        values (i,1, j, k,'joop');

      i := i + 1;
         commit;
         
  end loop;
  commit;
end;
/


-- ------------------------------

CREATE TABLE TEST3
(
ID    NUMBER(10)    NULL,
DATUM DATE          NULL,
DAG   VARCHAR2(10)  NULL,
VORIG VARCHAR2(10)  NULL,
NAME  VARCHAR2(10)  NULL
);


declare
  i number := 1;
  j date;
  k varchar2(10);
  l varchar2(10);
begin
  while i<1000 loop
       j:=sysdate+i;
       k:=TO_CHAR(SYSDATE+i,'DAY');
       l:=TO_CHAR(SYSDATE+i-1,'DAY');
       insert into TEST3
       (ID,DATUM,DAG,VORIG,NAME)
        values (i, j, k, l,'joop');

      i := i + 1;
         commit;
         
  end loop;
  commit;
end;
/

1.10 explain plan commAND, autotrace:
-------------------------------------

1 explain plan commAND: 
-----------------------

First execute the utlxplan.sql script. 
This script will create the PLAN_TABLE table, needed for storage of performance data.
Now it's possible to do the following:

-- optionally, delete the former performance data
DELETE FROM plan_table WHERE statement_id = 'XXX'; COMMIT;

-- now you can run the query that is to be analyzed
EXPLAIN PLAN SET STATEMENT_ID = 'XXX' 
FOR 
SELECT * FROM EMPLOYEE WHERE city > 'Y%';

To view results, you can use the utlxpls.sql script.


2. set autotrace on / off
-------------------------

Deze maakt ook gebruik van de PLAN_TABLE en de PLUSTRACE role moet bestaan. 
Desgewenst kan het plustrce.sql script worden uitgevoerd (onder SYS).


Opmerking: Execution plan / access path bij een join query:

- nested loop: 1 table is de driving table met full table scan of gebruik van index, 
  en de tweede table wordt benadert m.b.v. een index van de 
  tweede table gebaseerd op de WHERE clause.  

- merge join: als er geen bruikbare index is, worden alle rows opgehaald, 
  gesorteerd, en gejoined naar een resultset.

- Hash join: bepaalde init.ora parameters moeten aanwezig zijn 
  (HASH_JOIN_ENABLE=TRUE, HASH_AREA_SIZE= , of via 
  ALTER SESSION SET HASH_JOIN_ENABLED=TRUE). 
  Meestal zeer effectief bij joins van een kleine table met een grote table. 
  De kleine table is de driving table in memory en het vervolg is een algolritme 
  wat lijkt op de nested loop

Kan ook worden afgedwongen met een hint:

SELECT /*+ USE_HASH(COMPANY) */ COMPANY.Name, 
SUM(Dollar_Amount) FROM COMPANY, SALES 
WHERE COMPANY.Company_ID = SALES.Company_ID GROUP BY COMPANY.Name;


3 SQL trace en TKPROFF
----------------------

SQL trace kan geactiveerd worden via init.ora of via 

ALTER SESSION SET SQL_TRACE=TRUE
DBMS_SYSTEM.SET_SQL_TRACE_IN_SESSION(sid, serial#, TRUE);
DBMS_SYSTEM.SET_SQL_TRACE_IN_SESSION(12, 398, TRUE);
DBMS_SYSTEM.SET_SQL_TRACE_IN_SESSION(12, 398, FALSE);
DBMS_SUPPORT.START_TRACE_IN_SESSION(12,398);

Turn SQL tracing on in session 448. The trace information will get written to user_dump_dest.

SQL> exec dbms_system.set_sql_trace_in_session(448,2288,TRUE); 

Turn SQL tracing off in session 448

SQL> exec dbms_system.set_sql_trace_in_session(448,2288,FALSE); 


Init.ora:

Max_dump_file_size in OS blocks
SQL_TRACE=TRUE	(kan zeer grote files opleveren, is voor alle sessions)
USER_DUMP_DEST= lokatie trace files


1.12 Indien de CBO niet het beste access path gebruikt: hints in query:
-----------------------------------------------------------------------

Goal hints: 		ALL_ROWS, FIRST_ROWS, CHOOSE, RULE
Access methods hints:	FULL, ROWID, CLUSTER, HASH, INDEX

SELECT /*+ INDEX(emp_pk) */
FROM emp WHERE empno=12345;

SELECT /*+ RULE */ ename, dname
FROM emp, dept WHERE emp.deptno=dept.deptno


==============================================
3. Data dictonary queries m.b.t perfoRMANce:
==============================================


3.1 Reads AND writes in files: 
------------------------------

V$FILESTAT, V$DATAFILE

- Relative File I/O (1)

SELECT fs.file#, df.file#, substr(df.name, 1, 50), 
fs.phyrds, fs.phywrts, df.status
FROM v$filestat fs, v$datafile df
WHERE fs.file#=df.file#

- Relative File I/O (2)

set pagesize 60 linesize 80 newpage 0 feedback off
ttitle skip centre 'Datafile IO Weights' skip centre
column Total_IO format 999999999
column Weigt format 999.99
column file_name format A40
break on drive skip 2
compute sum of Weight on Drive

SELECT
substr(DF.Name, 1, 6) Drive,
DF.Name File_Name,
FS.Phyblkrd+FS.Phyblkwrt Total_IO,
100*(FS.Phyblkrd+FS.Phyblkwrt) / MaxIO Weight
FROM V$FILESTAT FS, V$DATAFILE DF, 
(SELECT MAX(Phyblkrd+Phyblkwrt) MaxIO FROM V$FILESTAT)
WHERE
DF.File#=FS.File#
ORDER BY Weight desc
/


3.2 undocumented init parameters:
---------------------------------

SELECT *
FROM   SYS.X$KSPPI
WHERE  SUBSTR(KSPPINM,1,1) = '_';


3.3 Kans op gebruik index of niet?:
-----------------------------------
 
Kijk in

DBA_TAB_COLUMNS.NUM_DISTINCT
DBA_TABLES.NUM_ROWS

als num_distinct in de buurt komt van num_rows : 
index favoriet i.p.v. full table 

Kijk in
DBA_INDEXES, USER_INDEXES.CLUSTERING_FACTOR 
als clustering_factor = aantal blocks: ordered


3.4 snel overzicht hit ratio buffer cache:
------------------------------------------

    Hit ratio= (LR - PR) / LR

    Stel er zijn nauwelijk Physical Reads PR, ofwel PR=0, dan is de 
    Hit Ratio=LR/LR=1  Er worden dan geen blocks van disk gelezen.

    Praktijk: Hit ratio moet gemiddeld wel zo > 0,8 - 0,9 

    V$sess_io en v$sysstat en v$session kunnen geraadpleegd worden om de hit ratio te bepalen.

    V$sess_io:	sid, consistent_gets, physical_reads
    V$session:	sid, username


SELECT name, value
FROM v$sysstat
WHERE name IN ('db block gets', 'consistent gets','physical reads');

SELECT (1-(pr.value/(dbg.value+cg.value)))*100
     FROM v$sysstat pr, v$sysstat dbg, v$sysstat cg
     WHERE pr.name = 'physical reads'
          AND dbg.name = 'db block gets'
          AND cg.name = 'consistent gets';

-- uitgebeidere query m.b.t. hit ratio


CLEAR
SET HEAD ON
SET VERIFY OFF

col HitRatio format 999.99 heading 'Hit Ratio'
col CGets format 9999999999999 heading 'Consistent Gets'
col DBGets format 9999999999999 heading 'DB Block Gets'
col PhyGets format 9999999999999 heading 'Physical Reads'

SELECT substr(Username, 1, 10),
	 v$sess_io.sid,
       consistent_gets,
       block_gets,
       physical_reads,
       100*(consistent_gets+block_gets-physical_reads)/ (consistent_gets+block_gets) HitRatio
FROM v$session, v$sess_io
WHERE v$session.sid = v$sess_io.sid
AND (consistent_gets+block_gets) > 0
AND Username is NOT NULL
/

SELECT 'Hit Ratio' Database,
       cg.value CGets,
       db.value DBGets,
       pr.value PhyGets, 
       100*(cg.value+db.value-pr.value)/(cg.value+db.value) HitRatio
FROM v$sysstat db, v$sysstat cg, v$sysstat pr
WHERE db.name = 'db block gets'
AND cg.name = 'consistent gets'
AND pr.name = 'physical reads'
/


3.6 Wat zijn de actieve transacties?:
-------------------------------------

SELECT substr(username, 1, 10), substr(terminal, 1, 10), substr(osuser, 1, 10),
       t.start_time, r.name, t.used_ublk "ROLLB BLKS",
       decode(t.space, 'YES', 'SPACE TX',
          decode(t.recursive, 'YES', 'RECURSIVE TX',
             decode(t.noundo, 'YES', 'NO UNDO TX', t.status)
       )) status
FROM sys.v_$transaction t, sys.v_$rollname r, sys.v_$session s
WHERE t.xidusn = r.usn
  AND t.ses_addr = s.saddr


3.7 sid's, resource belasting en locks:
---------------------------------------

SELECT sid, lmode, ctime, block 
FROM v$lock

SELECT s.sid, substr(s.username, 1, 10), substr(s.schemaname, 1, 10), substr(s.osuser, 1, 10), 
substr(s.program, 1, 10), s.command,
l.lmode, l.block
FROM v$session s, v$lock l
WHERE s.sid=l.sid;

SELECT l.addr, s.saddr, l.sid, s.sid, l.type, l.lmode,
s.status, substr(s.schemaname, 1, 10), 
s.lockwait, s.row_wait_obj#
FROM v$lock l, v$session s
WHERE l.addr=s.saddr

SELECT sid, substr(owner, 1, 10), substr(object, 1, 10) 
FROM v$access

   SID   Session number that is accessing an object 
   OWNER Owner of the object 
   OBJECT Name of the object 
   TYPE  Type identifier for the object 
 
SELECT substr(s.username, 1, 10), s.sid,
t.log_io, t.phy_io
FROM v$session s, v$transaction t
WHERE t.ses_addr=s.saddr
      

3.8 latch use in SGA (locks op process):
----------------------------------------

SELECT c.name,a.gets,a.misses,a.sleeps, a.immediate_gets,a.immediate_misses,b.pid
FROM v$latch a, v$latchholder b, v$latchname c 
WHERE a.addr = b.laddr(+) 
AND a.latch# = c.latch# 
AND (c.name like 'redo%' or c.name like 'row%')
ORDER BY a.latch#; 


column latch_name format a40
SELECT name latch_name, gets, misses,
     round(decode(gets-misses,0,1,gets-misses)/
     decode(gets,0,1,gets),3) hit_ratio
     FROM v$latch WHERE name = 'redo allocation';

column latch_name format a40
SELECT name latch_name, immediate_gets, immediate_misses,
     round(decode(immediate_gets-immediate_misses,0,1,
     immediate_gets-immediate_misses)/
     decode(immediate_gets,0,1,immediate_gets),3) hit_ratio
     FROM v$latch WHERE name = 'redo copy';

column name format a40
column value format a10
SELECT name,value FROM v$parameter WHERE name in
     ('log_small_entry_max_size','log_simultaneous_copies',
     'cpu_count');


-- latches en locks in beeld

set pagesize 23
set pause on
set pause 'Hit any key...'

col sid format 999999
col serial# format 999999
col username format a12 trunc
col process format a8 trunc
col terminal format a12 trunc
col type format a12 trunc
col lmode format a4 trunc
col lrequest format a4 trunc
col object format a73 trunc

SELECT s.sid, s.serial#,
       decode(s.process, null,
          decode(substr(p.username,1,1), '?',   upper(s.osuser), p.username),
          decode(       p.username, 'ORACUSR ', upper(s.osuser), s.process)
       ) process,
       nvl(s.username, 'SYS ('||substr(p.username,1,4)||')') username,
       decode(s.terminal, null, rtrim(p.terminal, chr(0)),
              upper(s.terminal)) terminal,
       decode(l.type,
          -- Long locks
                      'TM', 'DML/DATA ENQ',   'TX', 'TRANSAC ENQ',
                      'UL', 'PLS USR LOCK',
          -- Short locks
                      'BL', 'BUF HASH TBL',  'CF', 'CONTROL FILE',
                      'CI', 'CROSS INST F',  'DF', 'DATA FILE   ',
                      'CU', 'CURSOR BIND ',
                      'DL', 'DIRECT LOAD ',  'DM', 'MOUNT/STRTUP',
                      'DR', 'RECO LOCK   ',  'DX', 'DISTRIB TRAN',
                      'FS', 'FILE SET    ',  'IN', 'INSTANCE NUM',
                      'FI', 'SGA OPN FILE',
                      'IR', 'INSTCE RECVR',  'IS', 'GET STATE   ',
                      'IV', 'LIBCACHE INV',  'KK', 'LOG SW KICK ',
                      'LS', 'LOG SWITCH  ',
                      'MM', 'MOUNT DEF   ',  'MR', 'MEDIA RECVRY',
                      'PF', 'PWFILE ENQ  ',  'PR', 'PROCESS STRT',
                      'RT', 'REDO THREAD ',  'SC', 'SCN ENQ     ',
                      'RW', 'ROW WAIT    ',
                      'SM', 'SMON LOCK   ',  'SN', 'SEQNO INSTCE',
                      'SQ', 'SEQNO ENQ   ',  'ST', 'SPACE TRANSC',
                      'SV', 'SEQNO VALUE ',  'TA', 'GENERIC ENQ ',
                      'TD', 'DLL ENQ     ',  'TE', 'EXTEND SEG  ',
                      'TS', 'TEMP SEGMENT',  'TT', 'TEMP TABLE  ',
                      'UN', 'USER NAME   ',  'WL', 'WRITE REDO  ',
                      'TYPE='||l.type) type,
       decode(l.lmode, 0, 'NONE', 1, 'NULL', 2, 'RS', 3, 'RX',
                       4, 'S',    5, 'RSX',  6, 'X',
                       to_char(l.lmode) ) lmode,
       decode(l.request, 0, 'NONE', 1, 'NULL', 2, 'RS', 3, 'RX',
                         4, 'S', 5, 'RSX', 6, 'X',
                         to_char(l.request) ) lrequest,
       decode(l.type, 'MR', decode(u.name, null,
                            'DICTIONARY OBJECT', u.name||'.'||o.name),
                      'TD', u.name||'.'||o.name,
                      'TM', u.name||'.'||o.name,
                      'RW', 'FILE#='||substr(l.id1,1,3)||
                      ' BLOCK#='||substr(l.id1,4,5)||' ROW='||l.id2,
                      'TX', 'RS+SLOT#'||l.id1||' WRP#'||l.id2,
                      'WL', 'REDO LOG FILE#='||l.id1,
                      'RT', 'THREAD='||l.id1,
                      'TS', decode(l.id2, 0, 'ENQUEUE',
                                             'NEW BLOCK ALLOCATION'),
                      'ID1='||l.id1||' ID2='||l.id2) object
FROM   sys.v_$lock l, sys.v_$session s, sys.obj$ o, sys.user$ u,
       sys.v_$process p
WHERE  s.paddr  = p.addr(+)
  AND  l.sid    = s.sid
  AND  l.id1    = o.obj#(+)
  AND  o.owner# = u.user#(+)
  AND  l.type   <> 'MR'
UNION ALL                          /*** LATCH HOLDERS ***/
SELECT s.sid, s.serial#, s.process, s.username, s.terminal,
       'LATCH', 'X', 'NONE', h.name||' ADDR='||rawtohex(laddr)
FROM   sys.v_$process p, sys.v_$session s, sys.v_$latchholder h
WHERE  h.pid  = p.pid
  AND  p.addr = s.paddr
UNION ALL                         /*** LATCH WAITERS ***/
SELECT s.sid, s.serial#, s.process, s.username, s.terminal,
       'LATCH', 'NONE', 'X', name||' LATCH='||p.latchwait
FROM   sys.v_$session s, sys.v_$process p, sys.v_$latch l
WHERE  latchwait is not null
  AND  p.addr      = s.paddr
  AND  p.latchwait = l.addr
/


========================================================
4. IMP and EXP, IMPDP and EXPDP, and SQL*Loader Examples
========================================================


4.1 EXPDP and IMPDP examples:
=============================

New for Oracle 10g, are the impdp and expdp utilities.

EXPDP practice/practice PARFILE=par1.par
EXPDP hr/hr DUMPFILE=export_dir:hr_schema.dmp LOGFILE=export_dir:hr_schema.explog
EXPDP system/******** PARFILE=c:\rmancmd\dpe_1.expctl


Oracle 10g provides two new views, DBA_DATAPUMP_JOBS and DBA_DATAPUMP_SESSIONS that allow the DBA to monitor the progress 
of all DataPump operations.

SELECT 
     owner_name
    ,job_name
    ,operation
    ,job_mode
    ,state
    ,degree
    ,attached_sessions
  FROM dba_datapump_jobs
;

SELECT 
     DPS.owner_name
    ,DPS.job_name
    ,S.osuser
  FROM 
     dba_datapump_sessions DPS
    ,v$session S
 WHERE S.saddr = DPS.saddr
;


Example 1. EXPDP parfile
------------------------

JOB_NAME=NightlyDRExport
DIRECTORY=export_dir
DUMPFILE=export_dir:fulldb_%U.dmp
LOGFILE=export_dir:NightlyDRExport.explog
FULL=Y
PARALLEL=2
FILESIZE=650M
CONTENT=ALL
STATUS=30
ESTIMATE_ONLY=Y

Example 2. EXPDP parfile, only for getting an estimate of export size
---------------------------------------------------------------

JOB_NAME=EstimateOnly
DIRECTORY=export_dir
LOGFILE=export_dir:EstimateOnly.explog
FULL=Y
CONTENT=DATA_ONLY
ESTIMATE=STATISTICS
ESTIMATE_ONLY=Y
STATUS=60

Example 3. EXPDP parfile, only 1 schema, writing to multiple files with %U variable, limited to 650M
----------------------------------------------------------------------------------------------

JOB_NAME=SH_TABLESONLY
DIRECTORY=export_dir
DUMPFILE=export_dir:SHONLY_%U.dmp
LOGFILE=export_dir:SH_TablesOnly.explog
SCHEMAS=SH
PARALLEL=2
FILESIZE=650M
STATUS=60

Example 4. EXPDP parfile, multiple tables, writing to multiple files with %U variable, limited
---------------------------------------------------------------------------------------- 

JOB_NAME=HR_PAYROLL_REFRESH
DIRECTORY=export_dir
DUMPFILE=export_dir:HR_PAYROLL_REFRESH_%U.dmp
LOGFILE=export_dir:HR_PAYROLL_REFRESH.explog
STATUS=20
FILESIZE=132K
CONTENT=ALL 
TABLES=HR.EMPLOYEES,HR.DEPARTMENTS,HR.PAYROLL_CHECKS,HR.PAYROLL_HOURLY,HR.PAYROLL_SALARY,HR.PAYROLL_TRANSACTIONS

Example 5. EXPDP parfile, Exports all objects in the HR schema, including metadata, asof just before midnight on April 10, 2005
-------------------------------------------------------------------------------------------------------------------------

JOB_NAME=HREXPORT
DIRECTORY=export_dir
DUMPFILE=export_dir:HREXPORT_%U.dmp
LOGFILE=export_dir:2005-04-10_HRExport.explog
SCHEMAS=HR
CONTENTS=ALL
FLASHBACK_TIME=TO_TIMESTAMP"('04-10-2005 23:59', 'MM-DD-YYYY HH24:MI')"

Example 6. IMPDP parfile, Imports data +only+ into selected tables in the HR schema, Multiple dump files will be used
----------------------------------------------------------------------------------------------------------------------

JOB_NAME=HR_PAYROLL_IMPORT
DIRECTORY=export_dir
DUMPFILE=export_dir:HR_PAYROLL_REFRESH_%U.dmp
LOGFILE=export_dir:HR_PAYROLL_IMPORT.implog
STATUS=20
TABLES=HR.PAYROLL_CHECKS,HR.PAYROLL_HOURLY,HR.PAYROLL_SALARY,HR.PAYROLL_TRANSACTIONS
CONTENT=DATA_ONLY
TABLE_EXISTS_ACTION=TRUNCATE

Example 7. IMPDP parfile,3 tables in the SH schema are the only tables to be refreshed,These tables will be truncated before loading
--------------------------------------------------------------------------------------------------------------------------------

DIRECTORY=export_dir
JOB_NAME=RefreshSHTables
DUMPFILE=export_dir:fulldb_%U.dmp
LOGFILE=export_dir:RefreshSHTables.implog
STATUS=30
CONTENT=DATA_ONLY
SCHEMAS=SH
INCLUDE=TABLE:"IN('COUNTRIES','CUSTOMERS','PRODUCTS','SALES')"
TABLE_EXISTS_ACTION=TRUNCATE

Example IMPDP parfile,Generates SQLFILE output showing the DDL statements,Note that this code is +not+ executed! 
----------------------------------------------------------------------------------------------------------------

DIRECTORY=export_dir
JOB_NAME=GenerateImportDDL
DUMPFILE=export_dir:hr_payroll_refresh_%U.dmp
LOGFILE=export_dir:GenerateImportDDL.implog
SQLFILE=export_dir:GenerateImportDDL.sql
INCLUDE=TABLE


Example: schedule a procedure which uses DBMS_DATAPUMP
------------------------------------------------------

BEGIN
	DBMS_SCHEDULER.CREATE_JOB (
		 job_name => 'HR_EXPORT'
		,job_type => 'PLSQL_BLOCK'
		,job_action => 'BEGIN HR.SP_EXPORT;END;'
		,start_date => '04/18/2005 23:00:00.000000'
		,repeat_interval => 'FREQ=DAILY'
		,enabled => TRUE
		,comments => 'Performs HR Schema Export nightly at 11 PM'
    );
END;
/


======================================
How to use the NETWORK_LINK paramater:
======================================

Note 1:
=======

Lora, the DBA at Acme Bank, is at the center of attention in a high-profile meeting of the bank's top management team. 
The objective is to identify ways of enabling end users to slice and dice the data in the company's main data warehouse. 
At the meeting, one idea presented is to create several small data marts�each based on a particular functional area�that 
can each be used by specialized teams. 

To effectively implement the data mart approach, the data specialists must get data into the data marts quickly and efficiently. 
The challenge the team faces is figuring out how to quickly refresh the warehouse data to the data marts, which run on 
heterogeneous platforms. And that's why Lora is at the meeting. What options does she propose for moving the data? 

An experienced and knowledgeable DBA, Lora provides the meeting attendees with three possibilities, as follows: 


Using transportable tablespaces 
Using Data Pump (Export and Import) 
Pulling tablespaces 

This article shows Lora's explanation of these options, including their implementation details and their pros and cons. 

Transportable Tablespaces: 

Lora starts by describing the transportable tablespaces option. The quickest way to transport an entire tablespace to 
a target system is to simply transfer the tablespace's underlying files, using FTP (file transfer protocol) 
or rcp (remote copy). 
However, just copying the Oracle data files is not sufficient; the target database must recognize and import the files 
and the corresponding tablespace before the tablespace data can become available to end users. 
Using transportable tablespaces 
involves copying the tablespace files and making the data available in the target database. 

A few checks are necessary before this option can be considered. First, for a tablespace TS1 to be transported to a 
target system, 
it must be self-contained. That is, all the indexes, partitions, and other dependent segments of the tables in the tablespace 
must be inside the tablespace. Lora explains that if a set of tablespaces contains all the dependent segments, 
the set is considered 
to be self-contained. For instance, if tablespaces TS1 and TS2 are to be transferred as a set and a table in TS1 has 
an index in TS2, the tablespace set is self-contained. However, if another index of a table in TS1 is in tablespace TS3, 
the tablespace set (TS1, TS2) is not self-contained. 


To transport the tablespaces, Lora proposes using the Data Pump Export utility in Oracle Database 10g. Data Pump is Oracle's 
next-generation data transfer tool, which replaces the earlier Oracle Export (EXP) and Import (IMP) tools. 
Unlike those older tools, which use regular SQL to extract and insert data, Data Pump uses proprietary APIs that bypass 
the SQL buffer, making the process extremely fast. In addition, Data Pump can extract specific objects, such as a particular 
stored procedure or a set of tables from a particular tablespace. Data Pump Export and Import are controlled by jobs, 
which the DBA can pause, restart, and stop at will. 

Lora has run a test before the meeting to see if Data Pump can handle Acme's requirements. Lora's test transports the 
TS1 and TS2 tablespaces as follows: 

1. Check that the set of TS1 and TS2 tablespaces is self- contained. Issue the following command: 


BEGIN 
  SYS.DBMS_TTS.TRANSPORT_SET_CHECK ('TS1','TS2'); 
END;


2. Identify any nontransportable sets. If no rows are selected, the tablespaces are self-contained: 


SELECT * FROM SYS.TRANSPORT_SET_VIOLATIONS;

no rows selected


3. Ensure the tablespaces are read-only: 


SELECT STATUS
FROM DBA_TABLESPACES
WHERE TABLESPACE_NAME IN ('TS1','TS2');

STATUS
---------
READ ONLY
READ ONLY

4. Transfer the data files of each tablespace to the remote system, into the directory /u01/oradata, 
using a transfer mechanism such as FTP or rcp. 

5. In the target database, create a database link to the source database (named srcdb in the line below). 


CREATE DATABASE LINK srcdb 
USING 'srcdb';


6. In the target database, import the tablespaces into the database, using Data Pump Import. 


impdp lora/lora123  TRANSPORT_DATAFILES="'/u01/oradata/ts1_1.dbf','/u01/oradata/ts2_1.dbf'" NETWORK_LINK='srcdb' 
TRANSPORT_TABLESPACES=\(TS1,TS2\) 
NOLOGFILE=Y


This step makes the TS1 and TS2 tablespaces and their data available in the target database. 
Note that Lora doesn't export the metadata from the source database. She merely specifies the value srcdb, 
the database link to the source database, for the parameter NETWORK_LINK in the impdp command above. 
Data Pump Import fetches the necessary metadata from the source across the database link and re-creates it in the target. 

7. Finally, make the TS1 and TS2 tablespaces in the source database read-write. 


ALTER TABLESPACE TS1 READ WRITE;
ALTER TABLESPACE TS2 READ WRITE;


Note 2:
=======


One of the most significant characteristics of an import operation is its mode, because the mode largely determines 
what is imported. The specified mode applies to the source of the operation, either a dump file set or another database 
if the NETWORK_LINK parameter is specified.

The NETWORK_LINK parameter initiates a network import. This means that the impdp client initiates the import request, 
typically to the local database. That server contacts the remote source database referenced by the database link 
in the NETWORK_LINK parameter, retrieves the data, and writes it directly back to the target database. 
There are no dump files involved.

In the following example, the source_database_link would be replaced with the name of a valid database link 
that must already exist.

impdp hr/hr TABLES=employees DIRECTORY=dpump_dir1 NETWORK_LINK=source_database_link EXCLUDE=CONSTRAINT

This example results in an import of the employees table (excluding constraints) from the source database. 
The log file is written to dpump_dir1, specified on the DIRECTORY parameter.


4.2 Export / Import examples:
=============================

In all Oracle versions 7,8,8i,9i,10g you can use the exp and imp utilities.


exp system/manager file=expdat.dmp compress=Y owner=(HARRY, PIET)
exp system/manager file=hr.dmp owner=HR indexes=Y
exp system/manager file=expdat.dmp TABLES=(john.SALES)

imp system/manager file=hr.dmp full=Y buffer=64000 commit=Y
imp system/manager file=expdat.dmp FROMuser=ted touser=john indexes=N commit=Y buffer=64000

c:\> cd [oracle_db_home]\bin
c:\> set nls_lang=american_america.WE8ISO8859P15
export NLS_LANG=AMERICAN_AMERICA.UTF8
c:\> imp system/manager fromuser=mis_owner touser=mis_owner file=[yourexport.dmp]


FROM Oracle8i one can use the QUERY= export parameter to SELECTively unload a subset of the data FROM a table. 
Look at this example: 
exp scott/tiger tables=emp query=\"WHERE deptno=10\"


What is exported?:
------------------

Tables, indexes, data, database links gets exported.

Example:
-------- 

exp system/manager file=oemuser.dmp owner=oemuser

Verbonden met: Oracle9i Enterprise Edition Release 9.0.1.4.0 - Production
With the Partitioning option
JServer Release 9.0.1.4.0 - Production.
Export is uitgevoerd in WE8MSWIN1252 tekenset en AL16UTF16 NCHAR-tekenset.

Export van opgegeven gebruikers gaat beginnen ...
. pre-schema procedurele objecten en acties wordt ge�xporteerd.
. bibliotheeknamen van verwijzende functie voor gebruiker OEMUSER worden ge�xpo
teerd
. objecttypedefinities voor gebruiker OEMUSER worden ge�xporteerd
Export van objecten van OEMUSER gaat beginnen ...
. databasekoppelingen worden ge�xporteerd.
. volgnummers worden ge�xporteerd.
. clusterdefinities worden ge�xporteerd.
. export van tabellen van OEMUSER gaat beginnen ... via conventioneel pad ...
. . tabel                      CUSTOMERS wordt ge�xporteerd.Er zijn           2
rijen ge�xporteerd.
. synoniemen worden ge�xporteerd.
. views worden ge�xporteerd.
. opgeslagen procedures worden ge�xporteerd.
. operatoren worden ge�xporteerd.
. referenti�le integriteitsbeperkingen worden ge�xporteerd.
. triggers worden ge�xporteerd.
. indextypen worden ge�xporteerd.
. bitmap, functionele en uit te breiden indexen worden ge�xporteerd.
. acties post-tabellen worden ge�xporteerd
. snapshots worden ge�xporteerd.
. logs voor snapshots worden ge�xporteerd.
. takenwachtrijen worden ge�xporteerd
. herschrijfgroepen en kinderen worden ge�xporteerd
. dimensies worden ge�xporteerd.
. post-schema procedurele objecten en acties wordt ge�xporteerd.
. statistieken worden ge�xporteerd.
Export is succesvol be�indigd zonder waarschuwingen.

D:\temp>


Can one import tables to a different tablespace?
-------------------------------------------------

Import the dump file using the INDEXFILE= option 
Edit the indexfile. Remove remarks and specify the correct tablespaces. 
Run this indexfile against your database, this will create the required tables 
in the appropriate tablespaces 
Import the table(s) with the IGNORE=Y option. 
Change the default tablespace for the user:


Revoke the "UNLIMITED TABLESPACE" privilege FROM the user 
Revoke the user's quota FROM the tablespace FROM WHERE the object was exported. 
This forces the import utility to create tables in the user's default tablespace. 
Make the tablespace to which you want to import the default tablespace for the user 
Import the table 

Can one export to multiple files?/ Can one beat the Unix 2 Gig limit?
---------------------------------------------------------------------

FROM Oracle8i, the export utility supports multiple output files. 
         exp SCOTT/TIGER FILE=D:\F1.dmp,E:\F2.dmp FILESIZE=10m LOG=scott.log

Use the following technique if you use an Oracle version prior to 8i: 

Create a compressed export on the fly. 


        # create a named pipe
        mknod exp.pipe p
        # read the pipe - output to zip file in the background
        gzip < exp.pipe > scott.exp.gz &
        # feed the pipe
        exp userid=scott/tiger file=exp.pipe ...


Some famous Errors:
-------------------

Error 1:
--------

EXP-00008: ORACLE error 6550 encountered 
ORA-06550: line 1, column 31: 
PLS-00302: component 'DBMS_EXPORT_EXTENSION' must be declared 

1. The errors indicate that 
$ORACLE_HOME/rdbms/admin/CATALOG.SQL 
and 
$ORACLE_HOME/rdbms/admin/CATPROC.SQL 
Should be run again, as has been previously suggested. Were these scripts run connected as SYS? 
Try SELECT OBJECT_NAME, OBJECT_TYPE FROM DBA_OBJECTS WHERE STATUS = 
'INVALID' AND OWNER = 'SYS'; 
Do you have invalid objects? Is DBMS_EXPORT_EXTENSION invalid? If so, try compiling it manually: 
ALTER PACKAGE DBMS_EXPORT_EXTENSION COMPILE BODY; 
If you receive errors during manual compilation, please show errors for further information. 

2. Or possibly different imp/exp versions are run to another version 
of the database.

The problem can be resolved by copying the higher version 
CATEXP.SQL and executed in the lesser version RDBMS. 

3. Other fix:

If there are problems in exp/imp from single byte to multibyte databases:

- Analyze which tables/rows could be affected by national characters before
  running the export
- Increase the size of affected rows.
- Export the table data once again.

Error 2:
--------

EXP-00091: Exporting questionable statistics.

Hi. This warning is generated because the statistics are questionable due to the 
client character set difference from the server character set. 
There is an article which discusses the causes of questionable statistics available 
via the MetaLink Advanced Search option by Doc ID: 
Doc ID: 159787.1 9i: Import STATISTICS=SAFE 
If you do not want this conversion to occur, you need to ensure the client NLS environment 
performing the export is set to match the server. 


Fix ~~~~  
a) If the statistics of a table are not required to include in export      
take the export with parameter STATISTICS=NONE    
Example:  $exp scott/tiger file=emp1.dmp tables=emp STATISTICS=NONE  
b) In case, the statistics are need to be included can use 
STATISTICS=ESTIMATE or COMPUTE (default is Estimate). 

Error 3:
--------

EXP-00056: ORACLE error 1403 encountered
ORA-01403: no data found
EXP-00056: ORACLE error 1403 encountered
ORA-01403: no data found
EXP-00000: Export terminated unsuccessfully

You can't export any DB with an exp utility of a newer version.
The exp version must be equal or older than the DB version

Doc ID </help/usaeng/Search/search.html>: 	Note:281780.1	Content Type: 	TEXT/PLAIN	
Subject: 	Oracle 9.2.0.4.0: Schema Export Fails with ORA-1403 (No Data Found) on Exporting Cluster Definitions	Creation Date: 	29-AUG-2004	
Type: 	PROBLEM	Last Revision Date: 	29-AUG-2004	
Status: 	PUBLISHED		
The information in this article applies to:  
- Oracle Server - Enterprise Edition - Version: 9.2.0.4 to 9.2.0.4 
- Oracle Server - Personal Edition   - Version: 9.2.0.4 to 9.2.0.4 
- Oracle Server - Standard Edition   - Version: 9.2.0.4 to 9.2.0.4 
This problem can occur on any platform. 
 
 
ERRORS 
------ 
 
EXP-56 ORACLE error encountered 
ORA-1403 no data found 
EXP-0: Export terminated unsuccessfully 
 
 
SYMPTOMS 
-------- 
 
A schema level export with the 9.2.0.4 export utility from a 9.2.0.4 or higher 
release database in which XDB has been installed, fails when exporting  
the cluster definitions with: 
 
... 
. exporting cluster definitions 
EXP-00056: ORACLE error 1403 encountered 
ORA-01403: no data found 
EXP-00000: Export terminated unsuccessfully 
 
 
You can confirm that XDB has been installed in the database: 
 
SQL> SELECT substr(comp_id,1,15) comp_id, status, substr(version,1,10) version, 
   substr(comp_name,1,30) comp_name FROM dba_registry ORDER BY 1; 
 
COMP_ID         STATUS      VERSION    COMP_NAME 
--------------- ----------- ---------- ------------------------------ 
... 
XDB             INVALID     9.2.0.4.0  Oracle XML Database 
XML             VALID       9.2.0.6.0  Oracle XDK for Java 
XOQ             LOADED      9.2.0.4.0  Oracle OLAP API 
 
 
You create a trace file of the ORA-1403 error: 
 
SQL> SHOW PARAMETER user_dump 
SQL> ALTER SYSTEM SET EVENTS '1403 trace name errorstack level 3'; 
System altered. 
 
-- Re-run the export 
 
SQL> ALTER SYSTEM SET EVENTS '1403 trace name errorstack off'; 
System altered. 
 
 
The trace file that was written to your USER_DUMP_DEST directory, shows: 
 
ksedmp: internal or fatal error 
ORA-01403: no data found 
Current SQL statement for this session: 
SELECT xdb_uid FROM SYS.EXU9XDBUID 
 
 
You can confirm that you have no invalid XDB objects in the database: 
 
SQL> SET lines 200 
SQL> SELECT status, object_id, object_type, owner||'.'||object_name  
   "OWNER.OBJECT" FROM dba_objects WHERE owner='XDB' AND status != 'VALID'  
   ORDER BY 4,2; 
 
no rows selected 
 
Note: If you do have invalid XDB objects, and the same ORA-1403 error occurs 
      when performing a full database export, see the solution mentioned in: 
      [NOTE:255724.1] <ml2_documents.showDocument?p_id=255724.1&p_database_id=NOT> 
    "Oracle 9i: Full Export Fails with ORA-1403 
    (No Data Found) on Exporting Cluster Defintions" 
 
 
CHANGES 
------- 
 
You recently restored the database from a backup or you recreated the  
controlfile, or you performed Operating System actions on your database  
tempfiles. 
 
 
CAUSE 
----- 
 
The Temporary tablespace does not have any tempfiles. 
 
Note that the errors are different when exporting with a 9.2.0.3 or earlier 
export utility: 
 
. exporting cluster definitions 
EXP-00056: ORACLE error 1157 encountered 
ORA-01157: cannot identify/lock data file 201 - see DBWR trace file 
ORA-01110: data file 201: 'M:\ORACLE\ORADATA\M9201WA\TEMP01.DBF' 
ORA-06512: at "SYS.DBMS_LOB", line 424 
ORA-06512: at "SYS.DBMS_METADATA", line 1140 
ORA-06512: at line 1 
EXP-00000: Export terminated unsuccessfully 
 
The errors are also different when exporting with a 9.2.0.5 or later export 
utility: 
 
. exporting cluster definitions 
EXP-00056: ORACLE error 1157 encountered 
ORA-01157: cannot identify/lock data file 201 - see DBWR trace file 
ORA-01110: data file 201: 'M:\ORACLE\ORADATA\M9205WA\TEMP01.DBF' 
EXP-00000: Export terminated unsuccessfully 
 
 
FIX 
--- 
 
1. If the controlfile does not have any reference to the tempfile(s),  
   add the tempfile(s): 
 
   SQL> SET lines 200 
   SQL> SELECT status, enabled, name FROM v$tempfile; 
   no rows selected 
 
   SQL> ALTER TABLESPACE temp ADD TEMPFILE  
     'M:\ORACLE\ORADATA\M9204WA\TEMP01.DBF' REUSE; 
 
or:  
 
   If the controlfile has a reference to the tempfile(s), but the files are 
   missing on disk, re-create the temporary tablespace, e.g.: 
 
   SQL> SET lines 200 
   SQL> CREATE TEMPORARY TABLESPACE temp2 TEMPFILE 
     'M:\ORACLE\ORADATA\M9204WA\TEMP201.DBF' SIZE 100m AUTOEXTEND ON  
     NEXT 100M MAXSIZE 2000M; 
   SQL> ALTER DATABASE DEFAULT TEMPORARY TABLESPACE temp2; 
   SQL> DROP TABLESPACE temp; 
   SQL> CREATE TEMPORARY TABLESPACE temp TEMPFILE 
     'M:\ORACLE\ORADATA\M9204WA\TEMP01.DBF' SIZE 100m AUTOEXTEND ON  
     NEXT 100M MAXSIZE 2000M; 
   SQL> ALTER DATABASE DEFAULT TEMPORARY TABLESPACE temp; 
   SQL> SHUTDOWN IMMEDIATE 
   SQL> STARTUP 
   SQL> DROP TABLESPACE temp2 INCLUDING CONTENTS AND DATAFILES; 
 
2. Now re-run the export. 


Other errors:
-------------


Doc ID </help/usaeng/Search/search.html>: 	Note:175624.1	Content Type: 	TEXT/X-HTML	
Subject: 	Oracle Server - Export and Import FAQ	Creation Date: 	08-FEB-2002	
Type: 	FAQ	Last Revision Date: 	16-FEB-2005	
Status: 	PUBLISHED		
PURPOSE
=======
This Frequently Asked Questions (FAQ) provides common Export and Import issues
in the following sections:
- GENERIC          - LARGE FILES   - INTERMEDIA         - TOP EXPORT DEFECTS
- COMPATIBILITY    - TABLESPACE    - ADVANCED QUEUING   - TOP IMPORT DEFECTS
- PARAMETERS       - ORA-942       - REPLICATION
- PERFORMANCE      - NLS           - FREQUENT ERRORS


GENERIC
=======
Question: What is actually happening when I export and import data?
          See Note 61949.1 </metalink/plsql/showdoc?db=NOT&id=61949.1> "Overview of Export and Import in Oracle7"

Question: What is important when doing a full database export or import?
          See Note 10767.1 </metalink/plsql/showdoc?db=NOT&id=10767.1> "How to perform full system Export/Import"
Question: Can data corruption occur using export & import (version 8.1.7.3 to 9.2.0)?
          See Note 199416.1 </metalink/plsql/showdoc?db=NOT&id=199416.1> "ALERT: EXP Can Produce Dump File with Corrupted Data"

Question: How to Connect AS SYSDBA when Using Export or Import?
          See Note 277237.1 </metalink/plsql/showdoc?db=NOT&id=277237.1> "How to Connect AS SYSDBA when Using Export or Import"


COMPATIBILITY
=============
Question: Which version should I use when moving data between different database releases?
          See Note 132904.1 </metalink/plsql/showdoc?db=NOT&id=132904.1> "Compatibility Matrix for Export & Import Between Different Oracle Versions"
          See Note 291024.1 </metalink/plsql/showdoc?db=NOT&id=291024.1> "Compatibility and New Features when Transporting Tablespaces with Export and Import"
          See Note 76542.1 </metalink/plsql/showdoc?db=NOT&id=76542.1> "NT: Exporting from Oracle8, Importing Into Oracle7"

Question: How to resolve the IMP-69 error when importing into a database?
          See Note 163334.1 </metalink/plsql/showdoc?db=NOT&id=163334.1> "Import Gets IMP-00069 when Importing 8.1.7 Export"
          See Note 1019280.102 </metalink/plsql/showdoc?db=NOT&id=1019280.102> "IMP-69 on Import"


PARAMETERS
==========
Question: What is the difference between a Direct Path and a Conventional Path Export?
          See Note 155477.1 </metalink/plsql/showdoc?db=NOT&id=155477.1> "Parameter DIRECT: Conventional Path Export versus Direct Path Export"

Question: What is the meaning of the Export parameter CONSISTENT=Y and when should I use it?
          See Note 113450.1 </metalink/plsql/showdoc?db=NOT&id=113450.1> "When to Use CONSISTENT=Y During an Export"

Question: How to use the Oracle8i/9i Export parameter QUERY=... and what does it do?
          See Note 91864.1 </metalink/plsql/showdoc?db=NOT&id=91864.1> "Query= Syntax in Export in 8i"
          See Note 277010.1 </metalink/plsql/showdoc?db=NOT&id=277010.1> "How to Specify a Query in Oracle10g Export DataPump and Import DataPump"

Question: How to create multiple export dumpfiles instead of one large file?
          See Note 290810.1 </metalink/plsql/showdoc?db=NOT&id=290810.1> "Parameter FILESIZE - Make Export Write to Multiple Export Files"


PERFORMANCE
===========
Question: Import takes so long to complete. How can I improve the performance of Import?
          See Note 93763.1 </metalink/plsql/showdoc?db=NOT&id=93763.1> "Tuning Considerations when Import is slow"

Question: Why has export performance decreased after creating tables with LOB columns?
          See Note 281461.1 </metalink/plsql/showdoc?db=NOT&id=281461.1> "Export and Import of Table with LOB Columns (like CLOB and BLOB) has Slow Performance"


LARGE FILES
===========
Question: Which commands to use for solving Export dump file problems on UNIX platforms?
          See Note 30528.1 </metalink/plsql/showdoc?db=NOT&id=30528.1> "QREF: Export/Import/SQL*Load Large Files in Unix - Quick Reference"

Question: How to solve the EXP-15 and EXP-2 errors when Export dump file is larger than 2Gb?
          See Note 62427.1 </metalink/plsql/showdoc?db=NOT&id=62427.1> "2Gb or Not 2Gb - File limits in Oracle"
          See Note 1057099.6 </metalink/plsql/showdoc?db=NOT&id=1057099.6> "Unable to export when export file grows larger than 2GB"
          See Note 290810.1 </metalink/plsql/showdoc?db=NOT&id=290810.1> "Parameter FILESIZE - Make Export Write to Multiple Export Files"

Question: How to export to a tape device by using a named pipe?
          See Note 30428.1 </metalink/plsql/showdoc?db=NOT&id=30428.1> "Exporting to Tape on Unix System"


TABLESPACE
==========
Question: How to transport tablespace between different versions?
          See Note 291024.1 </metalink/plsql/showdoc?db=NOT&id=291024.1> "Compatibility and New Features when Transporting Tablespaces with Export and Import"

Question: How to move tables to a different tablespace and/or different user?
          See Note 1012307.6 </metalink/plsql/showdoc?db=NOT&id=1012307.6> "Moving Tables Between Tablespaces Using EXPORT/IMPORT"
          See Note 1068183.6 </metalink/plsql/showdoc?db=NOT&id=1068183.6> "How to change the default tablespace when importing using the INDEXFILE option"

Question: How can I export all tables of a specific tablespace?
          See Note 1039292.6 </metalink/plsql/showdoc?db=NOT&id=1039292.6> "How to Export Tables for a specific Tablespace"


ORA-942
=======
Question: How to resolve an ORA-942 during import of the ORDSYS schema?
          See Note 109576.1 </metalink/plsql/showdoc?db=NOT&id=109576.1> "Full Import shows Errors when adding Referential Constraint on Cartrige Tables"

Question: How to resolve an ORA-942 during import of a snapshot (log) into a different schema?
          See Note 1017292.102 </metalink/plsql/showdoc?db=NOT&id=1017292.102> "IMP-00017 IMP-00003 ORA-00942 USING FROMUSER/TOUSER ON SNAPSHOT [LOG] IMPORT"

Question: How to resolve an ORA-942 during import of a trigger on a renamed table?
          See Note 1020026.102 </metalink/plsql/showdoc?db=NOT&id=1020026.102> "ORA-01702, ORA-00942, ORA-25001, When Importing Triggers"

Question: How to resolve an ORA-942 during import of one specific table?
          See Note 1013822.102 </metalink/plsql/showdoc?db=NOT&id=1013822.102> "ORA-00942: ON TABLE LEVEL IMPORT"


NLS
===
Question: Which effect has the client's NLS_LANG setting on an export and import?
          See Note 227332.1 </metalink/plsql/showdoc?db=NOT&id=227332.1> "NLS considerations in Import/Export - Frequently Asked Questions"
          See Note 15656.1 </metalink/plsql/showdoc?db=NOT&id=15656.1> "Export/Import and NLS Considerations"

Question: How to prevent the loss of diacritical marks during an export/import?
          See Note 96842.1 </metalink/plsql/showdoc?db=NOT&id=96842.1> "Loss Of Diacritics When Performing EXPORT/IMPORT Due To Incorrect Charactersets"


INTERMEDIA OBJECTS
==================
Question: How to solve an EXP-78 when exporting metadata for an interMedia Text index?
          See Note 130080.1 </metalink/plsql/showdoc?db=NOT&id=130080.1> "Problems with EXPORT after upgrading from 8.1.5 to 8.1.6"

Question: I dropped the ORDSYS schema, but now I get ORA-6550 and PLS-201 when exporting?
          See Note 120540.1 </metalink/plsql/showdoc?db=NOT&id=120540.1> "EXP-8 PLS-201 After Drop User ORDSYS"


ADVANCED QUEUING OBJECTS
========================
Question: Why does export show ORA-1403 and ORA-6512 on an AQ object, after an upgrade?
          See Note 159952.1 </metalink/plsql/showdoc?db=NOT&id=159952.1> "EXP-8 and ORA-1403 When Performing A Full Export"

Question: How to resolve export errors on DBMS_AQADM_SYS and DBMS_AQ_SYS_EXP_INTERNAL?
          See Note 114739.1 </metalink/plsql/showdoc?db=NOT&id=114739.1> "ORA-4068 while performing full database export"


REPLICATION OBJECTS
===================
Question: How to resolve import errors on DBMS_IJOB.SUBMIT for Replication jobs?
          See Note 137382.1 </metalink/plsql/showdoc?db=NOT&id=137382.1> "IMP-3, PLS-306 Unable to Import Oracle8i JobQueues into Oracle8"

Question: How to reorganize Replication base tables with Export and Import?
          See Note 1037317.6 </metalink/plsql/showdoc?db=NOT&id=1037317.6> "Move Replication System Tables using Export/Import for Oracle 8.X"


FREQUENTLY REPORTED EXPORT/IMPORT ERRORS
========================================
EXP-00002: Error in writing to export file
           Note 1057099.6 </metalink/plsql/showdoc?db=NOT&id=1057099.6> "Unable to export when export file grows 
larger than 2GB"

EXP-00002: error in writing to export file 
The export file could not be written to disk anymore, probably because the disk is full or the device has an error.
Most of the time this is followed by a device (filesystem) error message indicating the problem.

Possible causes are file systems that do not support a certain limit (eg. dump file size > 2Gb) or a disk/filesystem that ran out of space.


EXP-00003: No storage definition found for segment(%s,%s) (EXP-3 EXP-0)
           Note 274076.1 </metalink/plsql/showdoc?db=NOT&id=274076.1> "EXP-00003 When Exporting From Oracle9i 9.2.0.5.0 with a Pre-9.2.0.5.0 Export Utility"
           Note 124392.1 </metalink/plsql/showdoc?db=NOT&id=124392.1> "EXP-3 while exporting Rollback Segment definitions during FULL Database Export"

EXP-00067: "Direct path cannot export %s which contains object or lob data."
           Note 1048461.6 </metalink/plsql/showdoc?db=NOT&id=1048461.6> "EXP-00067 PERFORMING DIRECT PATH EXPORT"

EXP-00079: Data in table %s is protected (EXP-79)
           Note 277606.1 </metalink/plsql/showdoc?db=NOT&id=277606.1> "How to Prevent EXP-00079 or EXP-00080 Warning (Data in Table xxx is Protected) During Export"

EXP-00091: Exporting questionable statistics
           Note 159787.1 </metalink/plsql/showdoc?db=NOT&id=159787.1> "9i: Import STATISTICS=SAFE"

IMP-00016: Required character set conversion (type %lu to %lu) not supported
           Note 168066.1 </metalink/plsql/showdoc?db=NOT&id=168066.1> "IMP-16 When Importing Dumpfile into a Database Using Multibyte Characterset"
	
IMP-00020: Long column too large for column buffer size 
           Note 148740.1 </metalink/plsql/showdoc?db=NOT&id=148740.1> "ALERT: Export of table with dropped functional index may cause IMP-20 on import"

ORA-00904: Invalid column name (EXP-8 ORA-904 EXP-0)
           Note 106155.1 </metalink/plsql/showdoc?db=NOT&id=106155.1> "EXP-00008 ORA-1003 ORA-904 During Export"
           Note 172220.1 </metalink/plsql/showdoc?db=NOT&id=172220.1> "Export of Database fails with EXP-00904 and ORA-01003"
           Note 158048.1 </metalink/plsql/showdoc?db=NOT&id=158048.1> "Oracle8i Export Fails on Synonym Export with EXP-8 and ORA-904"
           Note 130916.1 </metalink/plsql/showdoc?db=NOT&id=130916.1> "ORA-904 using EXP73 against Oracle8/8i Database"
           Note 1017276.102 </metalink/plsql/showdoc?db=NOT&id=1017276.102> "Oracle8i Export Fails on Synonym Export with EXP-8 and ORA-904"

ORA-01406: Fetched column value was truncated (EXP-8 ORA-1406 EXP-0)
           Note 163516.1 </metalink/plsql/showdoc?db=NOT&id=163516.1> "EXP-0 and ORA-1406 during Export of Object Types"

ORA-01422: Exact fetch returns more than requested number of rows
           Note 221178.1 </metalink/plsql/showdoc?db=NOT&id=221178.1> "PLS-201 and ORA-06512 at 'XDB.DBMS_XDBUTIL_INT' while Exporting Database"
           Note 256548.1 </metalink/plsql/showdoc?db=NOT&id=256548.1> "Export of Database with XDB Throws ORA-1422 Error"

ORA-01555: Snapshot too old
           Note 113450.1 </metalink/plsql/showdoc?db=NOT&id=113450.1> "When to Use CONSISTENT=Y During an Export"

ORA-04030: Out of process memory when trying to allocate %s bytes (%s,%s) (IMP-3 ORA-4030 ORA-3113)
           Note 165016.1 </metalink/plsql/showdoc?db=NOT&id=165016.1> "Corrupt Packages When Export/Import Wrapper PL/SQL Code"

ORA-06512: at "SYS.DBMS_STATS", line ... (IMP-17 IMP-3 ORA-20001 ORA-6512)
           Note 123355.1 </metalink/plsql/showdoc?db=NOT&id=123355.1> "IMP-17 and IMP-3 errors referring dbms_stats package during import"

ORA-29344: Owner validation failed - failed to match owner 'SYS'
           Note 294992.1 </metalink/plsql/showdoc?db=NOT&id=294992.1> "Import DataPump: Transport Tablespace Fails with ORA-39123 and 29344 (Failed to match owner SYS)"

ORA-29516: Aurora assertion failure: Assertion failure at %s (EXP-8 ORA-29516 EXP-0)
           Note 114356.1 </metalink/plsql/showdoc?db=NOT&id=114356.1> "Export Fails With ORA-29516 Aurora Assertion Failure EXP-8"

PLS-00103: Encountered the symbol "," when expecting one of the following ... (IMP-17 IMP-3 ORA-6550 PLS-103)
           Note 123355.1 </metalink/plsql/showdoc?db=NOT&id=123355.1> "IMP-17 and IMP-3 errors referring dbms_stats package during import"
           Note 278937.1 </metalink/plsql/showdoc?db=NOT&id=278937.1> "Import DataPump: ORA-39083 and PLS-103 when Importing Statistics Created with Non "." NLS Decimal Character"


EXPORT TOP ISSUES CAUSED BY DEFECTS
===================================
 Release   : 8.1.7.2 and below
 Problem   : Export may fail with ORA-1406 when exporting object type definitions
 Solution  : apply patch-set 8.1.7.3
 Workaround: no, see Note 163516.1 </metalink/plsql/showdoc?db=NOT&id=163516.1> "EXP-0 and ORA-1406 during Export of Object Types"

Bug 1098503 </metalink/plsql/showdoc?db=Bug&id=1098503>
 Release   : Oracle8i (8.1.x) and Oracle9i (9.x)
 Problem   : EXP-79 when Exporting Protected Tables
 Solution  : this is not a defect
 Workaround: N/A, see Note 277606.1 </metalink/plsql/showdoc?db=NOT&id=277606.1> "How to Prevent EXP-00079 or EXP-00080 Warning (Data in Table xxx is Protected) During Export"

Bug 2410612 </metalink/plsql/showdoc?db=Bug&id=2410612>
 Release   : 8.1.7.3 and higher and 9.0.1.2 and higher
 Problem   : Conventional export may produce an export file with corrupt data
 Solution  : 8.1.7.5 and 9.2.0.x or check for Patch 2410612 <http://updates.oracle.com/ARULink/PatchDetails/process_form?patch_num=2410612> (for 8.1.7.x), 2449113 (for 9.0.1.x)
 Workaround: yes, see Note 199416.1 </metalink/plsql/showdoc?db=NOT&id=199416.1> "ALERT: Client Program May Give Incorrect Query Results
             (EXP Can Produce Dump File with Corrupted Data)"

 Release   : Oracle8i (8.1.x)
 Problem   : Full database export fails with EXP-3: no storage definition found for segment
 Solution  : Oracle9i (9.x)
 Workaround: yes, see Note 124392.1 </metalink/plsql/showdoc?db=NOT&id=124392.1> "EXP-3 while exporting Rollback Segment definitions during FULL Database Export"

Bug 2900891 </metalink/plsql/showdoc?db=Bug&id=2900891>
 Release   : 9.0.1.4 and below and 9.2.0.3 and below
 Problem   : Export with 8.1.7.3 and 8.1.7.4 from Oracle9i fails with invalid identifier SPOLICY
             (EXP-8 ORA-904 EXP-0)
 Solution  : 9.2.0.4 or 9.2.0.5
 Workaround: yes, see Bug 2900891 </metalink/plsql/showdoc?db=Bug&id=2900891> how to recreate view sys.exu81rls

Bug 2685696 </metalink/plsql/showdoc?db=Bug&id=2685696>
 Release   : 9.2.0.3 and below
 Problem   : Export fails when exporting triggers in call to XDB.DBMS_XDBUTIL_INT
             (EXP-56 ORA-1422 ORA-6512)
 Solution  : 9.2.0.4 or check for Patch 2410612 <http://updates.oracle.com/ARULink/PatchDetails/process_form?patch_num=2410612> (for 9.2.0.2 and 9.2.0.3)
 Workaround: yes, see Note 221178.1 </metalink/plsql/showdoc?db=NOT&id=221178.1> "ORA-01422 ORA-06512: at "XDB.DBMS_XDBUTIL_INT" while exporting
             full database"

Bug 2919120 </metalink/plsql/showdoc?db=Bug&id=2919120>
 Release   : 9.2.0.4 and below
 Problem   : Export fails when exporting triggers in call to XDB.DBMS_XDBUTIL_INT
             (EXP-56 ORA-1422 ORA-6512)
 Solution  : 9.2.0.5 or check for Patch 2919120 <http://updates.oracle.com/ARULink/PatchDetails/process_form?patch_num=2919120> (for 9.2.0.4)
 Workaround: yes, see Note 256548.1 </metalink/plsql/showdoc?db=NOT&id=256548.1> "Export of Database with XDB Throws ORA-1422 Error"


IMPORT TOP ISSUES CAUSED BY DEFECTS
===================================
Bug 1335408 </metalink/plsql/showdoc?db=Bug&id=1335408>
 Release   : 8.1.7.2 and below
 Problem   : Bad export file using a locale with a ',' decimal seperator (IMP-17 IMP-3 ORA-6550 PLS-103)
 Solution  : apply patch-set 8.1.7.3 or 8.1.7.4
 Workaround: yes, see Note 123355.1 </metalink/plsql/showdoc?db=NOT&id=123355.1> "IMP-17 and IMP-3 errors referring DBMS_STATS package during import"

Bug 1879479 </metalink/plsql/showdoc?db=Bug&id=1879479>
 Release   : 8.1.7.2 and below and 9.0.1.2 and below
 Problem   : Export of a wrapped package can result in a corrupt package being imported
             (IMP-3 ORA-4030 ORA-3113 ORA-7445 ORA-600[16201]).
 Solution  : in Oracle8i with 8.1.7.3 and higher; in Oracle9iR1 with 9.0.1.3 and higher
 Workaround: no, see Note 165016.1 </metalink/plsql/showdoc?db=NOT&id=165016.1> "Corrupt Packages When Export/Import Wrapper PL/SQL Code"

Bug 2067904 </metalink/plsql/showdoc?db=Bug&id=2067904>
 Release   : Oracle8i (8.1.7.x) and 9.0.1.2 and below
 Problem   : Trigger-name causes call to DBMS_DDL.SET_TRIGGER_FIRING_PROPERTY to fail during Import
             (IMP-17 IMP-3 ORA-931 ORA-23308 ORA-6512).
 Solution  : in Oracle9iR1 with patchset 9.0.1.3
 Workaround: yes, see Note 239821.1 </metalink/plsql/showdoc?db=NOT&id=239821.1> "ORA-931 or ORA-23308 in SET_TRIGGER_FIRING_PROPERTY 
             on Import of Trigger in 8.1.7.x and 9.0.1.x"

Bug 2854856 </metalink/plsql/showdoc?db=Bug&id=2854856>
 Release   : Oracle8i (8.1.7.x) and 9.0.1.2 and below
 Problem   : Schema-name causes call to DBMS_DDL.SET_TRIGGER_FIRING_PROPERTY to fail during Import
             (IMP-17 IMP-3 ORA-911 ORA-6512).
 Solution  : in Oracle9iR2 with patchset 9.2.0.4
 Workaround: yes, see Note 239890.1 </metalink/plsql/showdoc?db=NOT&id=239890.1> "ORA-911 in SET_TRIGGER_FIRING_PROPERTY on Import of Trigger 
             in 8.1.7.x and Oracle9i"


4.3 SQL*Loader examples:
-=======================

SQL*Loader is used for loading data from text files into
Oracle tables. The text file can have fixed column positions
or columns separated by a special character, for example an ",".

to call sqlloader

sqlldr system/manager control=smssoft.ctl
sqlldr parfile=bonus.par

Example 1:
----------

BONUS.PAR:

userid=scott
control=bonus.ctl
bad=bonus.bad
log=bonus.log
discard=bonus.dis
rows=2
errors=2
skip=0

BONUS.CTL:

LOAD DATA
INFILE bonus.dat
APPEND
INTO TABLE BONUS
(name position(01:08) char,
city position(09:19) char,
salary position(20:22) integer external)

Now you can use the command: 
$ sqlldr parfile=bonus.par

Example 2:
----------
LOAD1.CTL:

LOAD DATA
INFILE 'PLAYER.TXT'
INTO TABLE BASEBALL_PLAYER
FIELDS TERMINATED BY ',' OPTIONALLY ENCLOSED BY '"'
  (player_id,last_name,first_name,middle_initial,start_date)

SQLLDR system/manager CONTROL=LOAD1.CTL LOG=LOAD1.LOG
 BAD=LOAD1.BAD DISCARD=LOAD1.DSC


Example 3: another controlfile:
------------------------------
SMSSOFT.CTL:

LOAD DATA
INFILE 'SMSSOFT.TXT'
TRUNCATE
INTO TABLE SMSSOFTWARE
FIELDS TERMINATED BY ',' OPTIONALLY ENCLOSED BY '"'
(DWMACHINEID, SERIALNUMBER, NAME, SHORTNAME, SOFTWARE, CMDB_ID, LOGONNAME)

Example 4: another controlfile:
-------------------------------

LOAD DATA    
INFILE      *   
BADFILE     'd:\stage\loader\load.bad'   
DISCARDFILE 'd:\stage\loader\load.dsc'   
APPEND   
INTO TABLE TEST   
FIELDS TERMINATED BY "<tab>" TRAILING NULLCOLS   
(    
c1,    
c2 char,    
c3 date(8) "DD-MM-YY"   
)   
BEGINDATA   
1<tab>X<tab>25-12-00   
2<tab>Y<tab>31-12-00  

Note: The <tab> placeholder is only for illustration purposes, in the acutal implementation, 
one would use a real tab character which is not  visible.  

- Convential path load:
When the DIRECT=Y parameter is not used, the convential path is used.
This means that essentially INSERT statements are used, 
triggers and referential integrety are in normal use, and that
the buffer cache is used.

- Direct path load:
Buffer cache is not used. Existing used blocks are not used.
New blocks are written as needed.
Referential integrety and triggers are disabled during the load.


4.4 Creation of new table on basis of existing table:
=====================================================

CREATE TABLE EMPLOYEE_2
AS SELECT * FROM EMPLOYEE

insert into t SELECT * FROM t2;

insert into DSA_IMPORT    
SELECT * FROM MDB_DW_COMPONENTEN@SALES


4.5 Copy commAND om data uit een remote database te halen:
==========================================================

set copycommit 1
set arraysize 1000
copy FROM HR/PASSWORD@loc -
create EMPLOYEE -
using
SELECT * FROM employee -
WHERE state='NM'


=======================================================
5. Add, Move AND Size Datafiles, tablespaces, logfiles:
=======================================================


5.1 ADD OR DROP REDO LOGFILE GROUP:
===================================

ADD:
----

alter database
add logfile group 4
('/db01/oracle/CC1/log_41.dbf', '/db02/oracle/CC1/log_42.dbf') size 5M;

ALTER DATABASE
ADD LOGFILE ('/oracle/dbs/log1c.rdo', '/oracle/dbs/log2c.rdo') SIZE 500K;

will create automatically a new group

ALTER DATABASE
ADD LOGFILE ('G:\ORADATA\AIRM\REDO05.LOG') SIZE 20M;

DROP:
-----

-An instance requires at least two groups of online redo log files, 
 regardless of the number of members in the groups. (A group is one or more members.) 
-You can drop an online redo log group only if it is inactive. 
 If you need to drop the current group, first force a log switch to occur. 

ALTER DATABASE DROP LOGFILE GROUP 3;

ALTER DATABASE DROP LOGFILE 'G:\ORADATA\AIRM\REDO02.LOG';


5.2 ADD REDO LOGFILE MEMBER:
============================

alter database
add logfile member '/db03/oracle/CC1/log_3c.dbf' to group 4;


5.3 RESIZE DATABASE FILE:
=========================

alter database
datafile '/db05/oracle/CC1/data01.dbf' rezise 400M; (increase or decrease size)

alter tablespace DATA
datafile '/db05/oracle/CC1/data01.dbf' rezise 400M; (increase or decrease size)

5.4 ADD FILE TO TABLESPACE:
===========================

alter tablespace DATA
add datafile '/db05/oracle/CC1/data02.dbf'
size 50M
autoextend ON
maxsize unlimited;


5.5 ALTER STORAGE FOR FILE:
===========================

alter database
datafile '/db05/oracle/CC1/data01.dbf'
autoextend ON
maxsize unlimited;

alter database datafile '/oradata/temp/temp.dbf' autoextend off;

The AUTOEXTEND option cannot be turned OFF at for the entire tablespace with 
a single command. Each datafile within the tablespace must explicitly turn off 
the AUTOEXTEND option via the ALTER DATABASE command. 

+447960585647

5.6 MOVE OF DATA FILE:
======================

connect internal
shutdown

mv /db01/oracle/CC1/data01.dbf  /db02/oracle/CC1

connect / as SYSDBA
startup mount CC1

alter database rename file
'/db01/oracle/CC1/data01.dbf' to '/db02/oracle/CC1/data01.dbf';

alter database open;


5.7 MOVE OF REDO LOG FILE:
==========================

connect internal
shutdown

mv /db05/oracle/CC1/redo01.dbf  /db02/oracle/CC1

connect / as SYSDBA
startup mount CC1

alter database rename file
'/db05/oracle/CC1/redo01.dbf' to '/db02/oracle/CC1/redo01.dbf';

alter database open;

in case of problems:

ALTER DATABASE CLEAR LOGFILE GROUP n


example:
--------

shutdown immediate

op Unix:
mv /u01/oradata/spltst1/redo01.log /u02/oradata/spltst1/
mv /u03/oradata/spltst1/redo03.log /u02/oradata/spltst1/

startup mount pfile=/apps/oracle/admin/SPLTST1/pfile/init.ora

alter database rename file
'/u01/oradata/spltst1/redo01.log' to '/u02/oradata/spltst1/redo01.log';

alter database rename file
'/u03/oradata/spltst1/redo03.log' to '/u02/oradata/spltst1/redo03.log';

alter database open;


5.8 Put a datafile or tablespace ONLINE or OFFLINE:
===================================================

alter tablespace data offline;
alter tablespace data online;

alter database datafile 8 offline;
alter database datafile 8 online;


5.9 ALTER DEFAULT STORAGE:
==========================

alter tablespace AP_INDEX_SMALL
default storage (initial 5M next 5M pctincrease 0);


5.10 CREATE TABLESPACE STORAGE PARAMETERS:
==========================================

locally managed 9i style:

-- autoallocate:
----------------

CREATE TABLESPACE DEMO DATAFILE '/u02/oracle/data/lmtbsb01.dbf' size 100M
extent management local autoallocate; 

-- uniform size, 1M is default:
-------------------------------

CREATE TABLESPACE LOBS DATAFILE 'f:\oracle\oradata\pegacc\lobs01.dbf' SIZE 3000M
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 64K;

CREATE TABLESPACE LOBS2 DATAFILE 'f:\oracle\oradata\pegacc\lobs02.dbf' SIZE 3000M
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 1M;

CREATE TABLESPACE CISTS_01 DATAFILE '/u04/oradata/pilactst/cists_01.dbf' SIZE 1000M
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 128K;

CREATE TABLESPACE CISTS_01 DATAFILE '/u01/oradata/spldev1/cists_01.dbf' SIZE 400M
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 128K;


CREATE TABLESPACE CISTS_01 DATAFILE '/u07/oradata/spldevp/cists_01.dbf' SIZE 1200M
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 128K;

CREATE TABLESPACE USERS DATAFILE '/u06/oradata/splpack/users01.dbf' SIZE 50M
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 128K;

CREATE TABLESPACE INDX DATAFILE '/u06/oradata/splpack/indx01.dbf' SIZE 100M
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 

CREATE TEMPORARY TABLESPACE TEMP TEMPFILE '/u07/oradata/spldevp/temp01.dbf' 
SIZE 200M 
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 10M;

ALTER DATABASE DEFAULT TEMPORARY TABLESPACE TEMP;


ALTER TABLESPACE CISTS_01 
ADD DATAFILE '/u03/oradata/splplay/cists_02.dbf' SIZE 1000M
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 128K;

alter tablespace DATA
add datafile '/db05/oracle/CC1/data02.dbf'
size 50M
autoextend ON
maxsize unlimited;

-- segment management manual or automatic:
-- ---------------------------------------

We can have a locally managed tablespace, but the segment space management, 
via the free lists and the pct_free and pct_used parameters, 
be still used manually.

To specify manual space management, use the SEGMENT SPACE MANAGEMENT MANUAL clause

CREATE TABLESPACE INDX2 DATAFILE '/u06/oradata/bcict2/indx09.dbf' SIZE 5000M
EXTENT MANAGEMENT LOCAL AUTOALLOCATE
SEGMENT SPACE MANAGEMENT MANUAL;

or if you want segement space management to be automatic:

CREATE TABLESPACE INDX2 DATAFILE '/u06/oradata/bcict2/indx09.dbf' SIZE 5000M
EXTENT MANAGEMENT LOCAL AUTOALLOCATE
SEGMENT SPACE MANAGEMENT AUTO;


-- temporary tablespace:
------------------------

CREATE TEMPORARY TABLESPACE TEMP TEMPFILE '/u04/oradata/pilactst/temp01.dbf' 
SIZE 200M 
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 10M;

create user cisadm identified by cisadm
default tablespace cists_01
temporary tablespace temp;

create user cisuser identified by cisuser
default tablespace cists_01
temporary tablespace temp;

create user cisread identified by cisread
default tablespace cists_01
temporary tablespace temp;

grant connect to cisadm;
grant connect to cisuser;
grant connect to cisread;

grant resource to cisadm;
grant resource to cisuser;
grant resource to cisread;


CREATE TEMPORARY TABLESPACE TEMP TEMPFILE '/u04/oradata/bcict2/tempt01.dbf' 
SIZE 5000M 
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 100M;

alter tablespace TEMP add tempfile '/u04/oradata/bcict2/temp02.dbf'
SIZE 5000M;

alter tablespace UNDO add file '/u04/oradata/bcict2/undo07.dbf' size 500M;

ALTER DATABASE datafile '/u04/oradata/bcict2/undo07.dbf' RESIZE 3000M;

CREATE TEMPORARY TABLESPACE TEMP2 TEMPFILE '/u04/oradata/bcict2/temp01.dbf' 
SIZE 5000M 
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 100M;


ALTER TABLESPACE TEMP
ADD TEMPFILE '/u04/oradata/bcict2/tempt4.dbf'  SIZE 5000M;

         1 /u03/oradata/bcict2/temp.dbf         
         2 /u03/oradata/bcict2/temp01.dbf       
         3 /u03/oradata/bcict2/temp02.dbf       

ALTER DATABASE TEMPFILE '/u02/oracle/data/lmtemp02.dbf' DROP 
INCLUDING DATAFILES;


The extent management clause is optional for temporary tablespaces because all temporary tablespaces 
are created with locally managed extents of a uniform size. The Oracle default for SIZE is 1M. 
But if you want to specify another value for SIZE, you can do so as shown in the above statement.

The AUTOALLOCATE clause is not allowed for temporary tablespaces.

If you get errors:
------------------

If the controlfile does not have any reference to the tempfile(s),  
add the tempfile(s): 
 
SQL> SET lines 200 
SQL> SELECT status, enabled, name FROM v$tempfile; 
no rows selected 
 
SQL> ALTER TABLESPACE temp ADD TEMPFILE 'M:\ORACLE\ORADATA\M9204WA\TEMP01.DBF' REUSE; 
 
or:  
 
If the controlfile has a reference to the tempfile(s), but the files are 
missing on disk, re-create the temporary tablespace, e.g.: 
 
SQL> SET lines 200 
SQL> CREATE TEMPORARY TABLESPACE temp2 TEMPFILE 
    'M:\ORACLE\ORADATA\M9204WA\TEMP201.DBF' SIZE 100m AUTOEXTEND ON  
     NEXT 100M MAXSIZE 2000M; 
SQL> ALTER DATABASE DEFAULT TEMPORARY TABLESPACE temp2; 
SQL> DROP TABLESPACE temp; 
SQL> CREATE TEMPORARY TABLESPACE temp TEMPFILE 
     'M:\ORACLE\ORADATA\M9204WA\TEMP01.DBF' SIZE 100m AUTOEXTEND ON  
     NEXT 100M MAXSIZE 2000M; 
SQL> ALTER DATABASE DEFAULT TEMPORARY TABLESPACE temp; 
SQL> SHUTDOWN IMMEDIATE 
SQL> STARTUP 
SQL> DROP TABLESPACE temp2 INCLUDING CONTENTS AND DATAFILES; 


-- undo tablespace:
-- ----------------

CREATE UNDO TABLESPACE undotbs_02
DATAFILE '/u01/oracle/rbdb1/undo0201.dbf' SIZE 2M REUSE AUTOEXTEND ON;

ALTER SYSTEM SET UNDO_TABLESPACE = undotbs_02;


-- ROLLBACK TABLESPACE:
-- --------------------

create tablespace RBS
 datafile '/disk01/oracle/oradata/DB1/rbs01.dbf' size 25M
 default storage (
  initial     500K
  next        500K
  pctincrease 0
  minextents  2  );


#######################################################################################


CREATE TABLESPACE "DRSYS" LOGGING DATAFILE '/u02/oradata/pegacc/drsys01.dbf' 
SIZE 20M REUSE AUTOEXTEND ON NEXT 1024K MAXSIZE UNLIMITED 
EXTENT MANAGEMENT LOCAL SEGMENT SPACE MANAGEMENT  AUTO ;

CREATE TABLESPACE "INDX" LOGGING DATAFILE '/u02/oradata/pegacc/indx01.dbf' 
SIZE 100M REUSE AUTOEXTEND ON NEXT  1024K MAXSIZE UNLIMITED 
EXTENT MANAGEMENT LOCAL SEGMENT SPACE MANAGEMENT  AUTO ;

CREATE TABLESPACE "TOOLS" LOGGING DATAFILE '/u02/oradata/pegacc/tools01.dbf' 
SIZE 100M REUSE AUTOEXTEND ON NEXT 1024K MAXSIZE UNLIMITED 
EXTENT MANAGEMENT LOCAL SEGMENT SPACE MANAGEMENT  AUTO ;

CREATE TABLESPACE "USERS" LOGGING DATAFILE '/u02/oradata/pegacc/users01.dbf' 
SIZE 1000M REUSE AUTOEXTEND ON NEXT 1024K MAXSIZE UNLIMITED 
EXTENT MANAGEMENT LOCAL SEGMENT SPACE MANAGEMENT  AUTO ;

CREATE TABLESPACE "XDB" LOGGING DATAFILE '/u02/oradata/pegacc/xdb01.dbf' 
SIZE 20M REUSE AUTOEXTEND ON NEXT 1024K MAXSIZE UNLIMITED 
EXTENT MANAGEMENT LOCAL SEGMENT SPACE MANAGEMENT  AUTO ;

CREATE TABLESPACE "LOBS" LOGGING DATAFILE '/u02/oradata/pegacc/lobs01.dbf' 
SIZE 2000M REUSE AUTOEXTEND ON NEXT 1024K MAXSIZE UNLIMITED 
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 1M ;

#######################################################################################


General form of a 8i type statement:

CREATE TABLESPACE DATA
DATAFILE 'G:\ORADATA\RCDB\DATA01.DBF' size 100M
EXTENT MANAGEMENT DICTIONARY
default storage (
                  initial 512K
                  next    512K
                  minextents 1
                  pctincrease 0 )
minimum extent 512K
logging
online
peRMANENTt;

More info:
----------

By declaring a tablespace as DICTIONARY managed, you are specifying that extent management for segments 
in this tablespace will be managed using the dictionary tables sys.fet$ and sys.uet$. Oracle updates 
these tables in the data dictionary whenever an extent is allocated, or freed for reuse. This is the default 
in Oracle8i when no extent management clause is used in the CREATE TABLESPACE statement. 
The sys.fet$ table is clustered in the C_TS# cluster. Because it is created without a SIZE clause, one block 
will be reserved in the cluster for each tablespace. Although, if a tablespace has more free extents 
than can be contained in a single cluster block, then cluster block chaining will occur which can significantly 
impact performance on the data dictionary and space management transactions in particular. Unfortunately, 
chaining in this cluster cannot be repaired without recreating the entire database. Preferably, the number 
of free extents in a tablespace should never be greater than can be recorded in the primary cluster block 
for that tablespace, which is about 500 free extents for a database with an 8K database block size. 

Used extents, on the other hand, are recorded in the data dictionary table sys.uet$, which is clustered in the 
C_FILE#_BLOCK# cluster. Unlike the C_TS# cluster, C_FILE#_BLOCK# is sized on the assumption that segments 
will have an average of just 4 or 5 extents each. Unless your data dictionary was specifically 
customized prior to database creation to allow for more used extents per segment, then creating segments 
with thousands of extents (like mentioned in the previous section) will cause excessive cluster block chaining 
in this cluster. The major dilemma with an excessive number of used and/or free extents is that they can 
misrepresent the operations of the dictionary cache LRU mechanism. Extents should therefore not be allowed to grow 
into the thousands, not because of the impact of full table scans, but rather the performance of the data dictionary 
and dictionary cache. 


A Locally Managed Tablespace is a tablespace that manages its own extents by maintaining a bitmap in each 
datafile to keep track of the free or used status of blocks in that datafile. Each bit in the bitmap corresponds 
to a block or a group of blocks. When the extents are allocated or freed for reuse, Oracle simply changes 
the bitmap values to show the new status of the blocks. These changes do not generate rollback information 
because they do not update tables in the data dictionary (except for tablespace quota information). This is the 
default in Oracle9i. If COMPATIBLE is set to 9.0.0, then the default extent management for any new tablespace is 
locally managed in Oracle9i. If COMPATIBLE is less than 9.0.0, then the default extent management for any 
new tablespace is dictionary managed in Oracle9i. 
While free space is represented in a bitmap within the tablespace, used extents are only recorded in the 
extent map in the segment header block of each segment, and if necessary, in additional extent map blocks 
within the segment. 

Keep in mind though, that this information is not cached in the dictionary cache. It must be obtained from the 
database block every time that it is required, and if those blocks are not in the buffer cache, 
that involves I/O and potentially lots of it. Take for example a query against DBA_EXTENTS. This query would 
be required to read every segment header and every additional extent map block in the entire database. 
It is for this reason that it is recommended that the number of extents per segment in locally managed tablespaces 
be limited to the number of rows that can be contained in the extent map with the segment header block. 
This would be approximately - (db_block_size / 16) - 7. For a database with a db block size of 8K, 
the above formula would be 505 extents. 


5.11 DEALLOCATE EN OPSPOREN VAN UNUSED SPACE IN EEN TABLE:
==========================================================

alter table emp
deallocate unused;

alter table emp
deallocate unused
keep 100K;

alter table emp
allocate extent (
  size 100K
  datafile '/db05/oradata/CC1/user05.dbf');

Deze datafile moet in dezelfde tablespace bestaan.

-- gebruik van de dbms_space.unused_space package

declare
var1 number;
var2 number;
var3 number;
var4 number;
var5 number; 
var6 number;
var7 number;

begin
dbms_space.unused_space('AUTOPROV1', 'MACADDRESS_INDEX', 'INDEX', 
           var1, var2, var3, var4, var5, var6, var7);
dbms_output.put_line('OBJECT_NAME = NOG ZON SLECHTE INDEX');
dbms_output.put_line('TOTAL_BLOCKS ='||var1);
dbms_output.put_line('TOTAL_BYTES ='||var2);
dbms_output.put_line('UNUSED_BLOCKS ='||var3);
dbms_output.put_line('UNUSED_BYTES ='||var4);
dbms_output.put_line('LAST_USED_EXTENT_FILE_ID ='||var5);
dbms_output.put_line('LAST_USED_EXTENT_BLOCK_ID ='||var6);
dbms_output.put_line('LAST_USED_BLOCK ='||var7);

end;
/


5.12 CREATE TABLE:
==================

-- STORAGE PARAMETERS EXAMPLE:
-- ---------------------------

create table emp
(
id number,
name varchar(2)
)
tablespace users
pctfree 10
storage 
   (initial 1024K
    next    1024K
    pctincrease 10
    minextents 2);


ALTER a COLUMN:
===============

ALTER TABLE GEWEIGERDETRANSACTIE
MODIFY (VERBRUIKTIJD DATE);


-- Creation of new table on basis of existing table:
-- -------------------------------------------------

CREATE TABLE EMPLOYEE_2
AS SELECT * FROM EMPLOYEE

insert into t SELECT * FROM t2;

insert into DSA_IMPORT    
SELECT * FROM MDB_DW_COMPONENTEN@SALES


-- Creation of a table with an autoincrement:
-- ------------------------------------------

CREATE SEQUENCE seq_customer
  INCREMENT BY 1
  START WITH 1
  MAXVALUE 99999
  NOCYCLE;


CREATE TABLE CUSTOMER ( 
  CUSTOMER_ID   NUMBER (10)    NOT NULL, 
  NAAM          VARCHAR2 (30)  NOT NULL, 
  CONSTRAINT PK_CUSTOMER
  PRIMARY KEY ( CUSTOMER_ID ) 
    USING INDEX 
     TABLESPACE INDX PCTFREE 10
     STORAGE ( INITIAL 16K NEXT 16K PCTINCREASE 0 )) 
 TABLESPACE USERS
   PCTFREE 10   PCTUSED 40
   INITRANS 1   MAXTRANS 255
 STORAGE ( 
   INITIAL 80K NEXT 80K PCTINCREASE 0
   MINEXTENTS 1 MAXEXTENTS 2147483645 )
   NOCACHE; 


CREATE OR REPLACE TRIGGER tr_CUSTOMER_ins
BEFORE INSERT ON CUSTOMER FOR EACH ROW
BEGIN
	SELECT seq_customer.NEXTVAL INTO :NEW.CUSTOMER_ID FROM dual;
END;


CREATE SEQUENCE seq_brains_verbruik
  INCREMENT BY 1
  START WITH 1750795
  MAXVALUE 100000000
  NOCYCLE;

CREATE OR REPLACE TRIGGER tr_PARENTEENHEID_ins
BEFORE INSERT ON PARENTEENHEID FOR EACH ROW
BEGIN
	SELECT seq_brains_verbruik.NEXTVAL INTO :NEW.VERBRUIKID FROM dual;
END;


5.13 REBUILD OF INDEX:
======================

ALTER INDEX emp_pk
REBUILD			-- online 8.16 or higher
NOLOGGING
TABLESPACE INDEX_BIG
PCTFREE 10
STORAGE ( INITIAL 5M
          NEXT    5M
          pctincrease 0
        );

ALTER INDEX emp_ename
      INITRANS 5
      MAXTRANS 10
      STORAGE (PCTINCREASE 50);

In situations where you have B*-tree index leaf blocks that can be freed up for reuse, you can merge 
those leaf blocks using the following statement: 

ALTER INDEX vmoore COALESCE;

DROP INDEX emp_ename:

-- Basic example of creating an index:

CREATE INDEX emp_ename ON emp(ename)
      TABLESPACE users
      STORAGE (INITIAL 20K
      NEXT 20k
      PCTINCREASE 75)
      PCTFREE 0;

If you have a LMT, you can just do:

create index cust_indx on customers(id) nologging;

This statement is without storage parameters.

-- Dropping an index:

DROP INDEX emp_ename:


5.14 MOVE TABLE TO OTHER TABLESPACE:
====================================

ALTER TABLE CHARLIE.CUSTOMERS MOVE TABLESPACE USERS2


5.15 SYNONYM (pointer to an object):
====================================

example:
create public synonym EMPLOYEE for HARRY.EMPLOYEE;

5.16 DATABASE LINK:
===================

CREATE PUBLIC DATABASE LINK SALESLINK
CONNECT TO FRONTEND IDENTIFIED BY cygnusx1
USING 'SALES';

SELECT * FROM employee@MY_LINK;

For example, using a database link to database sales.division3.acme.com, 
a user or application can reference remote data as follows:

SELECT * FROM scott.emp@sales.division3.acme.com;  # emp table in scott's schema
SELECT loc FROM scott.dept@sales.division3.acme.com;


If GLOBAL_NAMES is set to FALSE, then you can use any name for the link to sales.division3.acme.com. 
For example, you can call the link foo. Then, you can access the remote database as follows:

SELECT name FROM scott.emp@foo;  # link name different FROM global name

Synonyms for Schema Objects:

Oracle lets you create synonyms so that you can hide the database link name FROM the user. 
A synonym allows access to a table on a remote database using the same syntax that you would use 
to access a table on a local database. For example, assume you issue the following query 
against a table in a remote database:

SELECT * FROM emp@hq.acme.com;


You can create the synonym emp for emp@hq.acme.com 
so that you can issue the following query instead to access the same data:

SELECT * FROM emp;

View DATABASE LINKS:

select substr(owner,1,10), substr(db_link,1,50), substr(username,1,25),
substr(host,1,40), created from dba_db_links


5.17 TO CLEAR TABLESPACE TEMP:
==============================

alter tablespace TEMP default storage (pctincrease 0);
alter session set events 'immediate trace name DROP_SEGMENTS level TS#+1';


5.18 RENAME OF OBJECT:
======================

RENAME sales_staff TO dept_30; 
RENAME emp2 TO emp;


5.19 CREATE PROFILE:
====================

CREATE PROFILE DEVELOP_FIN LIMIT
    SESSIONS_PER_USER 4
    IDLE_TIME 30;

CREATE PROFILE PRIOLIMIT LIMIT
SESSIONS_PER_USER 10;
    
ALTER USER U_ZKN
     PROFILE EXTERNLIMIT;

ALTER PROFILE EXTERNLIMIT 
   LIMIT PASSWORD_REUSE_TIME 90 
   PASSWORD_REUSE_MAX UNLIMITED;

ALTER PROFILE EXTERNLIMIT 
LIMIT SESSIONS_PER_USER 20
    IDLE_TIME 20;


5.20 RECOMPILE OF FUNCTION, PACKAGE, PROCEDURE:
===============================================

ALTER FUNCTION schema.function COMPILE;
example: ALTER FUNCTION oe.get_bal COMPILE; 

ALTER PACKAGE schema.package COMPILE specification/body/package
example ALTER PACKAGE emp_mgmt COMPILE PACKAGE; 

ALTER PROCEDURE schema.procedure COMPILE;
example ALTER PROCEDURE hr.remove_emp COMPILE; 

TO FIND OBJECTS:

SELECT 'ALTER '||decode( object_type,
                        'PACKAGE SPECIFICATION'
                       ,'PACKAGE'
                       ,'PACKAGE BODY'
                       ,'PACKAGE'
                       ,object_type)
                ||' '||owner
                ||'.'|| object_name ||' COMPILE '
                ||decode( object_type,
                        'PACKAGE SPECIFICATION'
                       ,'SPECIFACTION'
                       ,'PACKAGE BODY'
                       ,'BODY'
                       , NULL)  ||';'
    FROM dba_objects WHERE status = 'INVALID';


5.21 CREATE PACKAGE:
====================

A package is a set of related functions and / or routines. 
Packages are used to group together PL/SQL code blocks which make up a common application 
or are attached to a single business function. Packages consist of a specification and a body. 
The package specification lists the public interfaces to the blocks within the package body. 
The package body contains the public and private PL/SQL blocks which make up the application, 
private blocks are not defined in the package specification and cannot be called by any routine
other than one defined within the package body. 
The benefits of packages are that they improve the organisation of procedure 
and function blocks, allow you to update the blocks that make up the package body 
without affecting the specification (which is the object that users have rights to) 
and allow you to grant execute rights once instead of for each and every block.

To create a package specification we use a variation on the CREATE command, 
all we need put in the specification is each PL/SQL block header that will 
be public within the package. An example follows :-


CREATE OR REPLACE PACKAGE MYPACK1 AS
PROCEDURE MYPROC1 (REQISBN IN NUMBER, MYVAR1 IN OUT CHAR,TCOST OUT NUMBER);
FUNCTION MYFUNC1;
END MYPACK1;


To create a package body we now specify each PL/SQL block that makes up the package, 
note that we are not creating these blocks separately (no CREATE OR REPLACE is 
required for the procedure and function definitions). An example follows :-

CREATE OR REPLACE PACKAGE BODY MYPACK1 AS
PROCEDURE MYPROC1
(REQISBN IN NUMBER,
MYVAR1 IN OUT CHAR,
TCOST OUT NUMBER)
TEMP_COST NUMBER(10,2))
IS BEGIN
   SELECT COST FROM JD11.BOOK INTO TEMP_COST WHERE ISBN = REQISBN;
   IF TEMP_COST > 0 THEN
      UPDATE JD11.BOOK SET COST = (TEMP_COST*1.175) WHERE ISBN = REQISBN;
   ELSE 
      UPDATE JD11.BOOK SET COST = 21.32 WHERE ISBN = REQISBN;
   END IF; 
   TCOST := TEMP_COST;
   COMMIT;
EXCEPTION
   WHEN NO_DATA_FOUND THEN
      INSERT INTO JD11.ERRORS (CODE, MESSAGE) VALUES(99, 'ISBN NOT FOUND');
END MYPROC1;
FUNCTION MYFUNC1
RETURN NUMBER
IS RCOST NUMBER(10,2); 
BEGIN
   SELECT COST FROM JD11.BOOK INTO RCOST WHERE ISBN = 21;
   RETURN (RCOST);
END MYFUNC1;
END MYPACK1;

You can execute a public package block like this :-
EXECUTE :PCOST := JD11.MYPACK1.MYFUNC1 - WHERE JD11 is the schema name that 
owns the package. You can use DROP PACKAGE and DROP PACKAGE BODY to remove the package objects FROM the database.


CREATE OR REPLACE PACKAGE schema.package

CREATE PACKAGE emp_mgmt AS
   FUNCTION hire (last_name VARCHAR2, job_id VARCHAR2,
 manager_id NUMBER, salary NUMBER, 
 commission_pct NUMBER, department_id NUMBER)
 RETURN NUMBER;
 FUNCTION create_dept(department_id NUMBER, location NUMBER)
 RETURN NUMBER;
 PROCEDURE remove_emp(employee_id NUMBER);
 PROCEDURE remove_dept(department_id NUMBER);
 PROCEDURE increase_sal(employee_id NUMBER, salary_incr NUMBER);
 PROCEDURE increase_comm(employee_id NUMBER, comm_incr NUMBER);
 no_comm EXCEPTION;
 no_sal EXCEPTION;
END emp_mgmt;
/

Before you can call this package's procedures and functions, 
you must define these procedures and functions in the package body. 


5.22 View a view:
=================

set long 2000

SELECT text                 
FROM sys.dba_views                 
WHERE view_name = 'CONTROL_PLAZA_V'; 


5.23 ALTER SYSTEM:
==================

ALTER SYSTEM CHECKPOINT;
ALTER SYSTEM ENABLE/DISABLE RESTRICTED SESSION;
ALTER SYSTEM FLUSH SHARED_POOL;
ALTER SYSTEM SWITCH LOGFILE;
ALTER SYSTEM SUSPEND/RESUME;
ALTER SYSTEM SET RESOURCE_LIMIT = TRUE;
ALTER SYSTEM SET LICENSE_MAX_USERS = 300;
ALTER SYSTEM SET GLOBAL_NAMES=FALSE;
ALTER SYSTEM SET COMPATIBLE = '9.2.0' SCOPE=SPFILE;


5.24 HOW TO ENABLE OR DISABLE TRIGGERS:
=======================================

Disable enable trigger:

  ALTER TRIGGER Reorder DISABLE;
  ALTER TRIGGER Reorder ENABLE;

Or in 1 time for all triggers on a table:

  ALTER TABLE Inventory
  DISABLE ALL TRIGGERS;


5.25 DIASABLING AND ENABLING AN INDEX:
======================================

alter index HEAT_CUSTOMER_POSTAL_CODE unusable;
alter index HEAT_CUSTOMER_POSTAL_CODE rebuild;


5.26 CREATE A VIEW:
===================

CREATE VIEW v1 AS SELECT
       LPAD(' ',40-length(size_tab.size_col)/2,' ') size_col 
       FROM size_tab;

CREATE VIEW X 
AS
SELECT * FROM gebruiker@aptest 

5.27 MAKE A USER:
=================

CREATE USER jward
    IDENTIFIED BY aZ7bC2
    DEFAULT TABLESPACE data_ts
    QUOTA 100M ON test_ts
    QUOTA 500K ON data_ts
    TEMPORARY TABLESPACE temp_ts
    PROFILE clerk;
GRANT connect TO jward;


create user jaap identified by jaap
default tablespace users
temporary tablespace temp;

grant connect to jaap;

grant resource to jaap;

Dynamic queries:
----------------

-- CREATE USER AND GRANT PERMISSION STATEMENTS
-- dynamic querieS

SELECT 'CREATE USER '||USERNAME||' identified by '||USERNAME||' default tableSpace '||
DEFAULT_TABLESPACE||' temporary tableSpace '||TEMPORARY_TABLESPACE||';'
FROM DBA_USERS 
WHERE USERNAME NOT IN ('SYS','SYSTEM','OUTLN','CTXSYS','ORDSYS','MDSYS');

SELECT 'GRANT CREATE SeSSion to '||USERNAME||';' FROM DBA_USERS
WHERE USERNAME NOT IN ('SYS','SYSTEM','OUTLN','CTXSYS','ORDSYS','MDSYS');

SELECT 'GRANT connect to '||USERNAME||';' FROM DBA_USERS
WHERE USERNAME NOT IN ('SYS','SYSTEM','OUTLN','CTXSYS','ORDSYS','MDSYS');

SELECT 'GRANT reSource to '||USERNAME||';' FROM DBA_USERS
WHERE USERNAME NOT IN ('SYS','SYSTEM','OUTLN','CTXSYS','ORDSYS','MDSYS');

SELECT 'GRANT unlimited tableSpace to '||USERNAME||';' FROM DBA_USERS
WHERE USERNAME NOT IN ('SYS','SYSTEM','OUTLN','CTXSYS','ORDSYS','MDSYS');


Becoming another user:
======================

- Do the query:

select 'ALTER USER '||username||' IDENTIFIED BY VALUES '||''''||password||''''||';'
from dba_users;

- change the password
- do what you need to do as the other account
- change the password back to the original value


-- grant <other roles or permissions> to <user>


5.28 CREATE A SEQUENCE:
=======================

Sequences are database objects from which multiple users can generate unique integers. 
You can use sequences to automatically generate primary key values. 

CREATE SEQUENCE           <sequence name> 
          INCREMENT BY    <increment number>
          START WITH      <start number>
          MAXVALUE        <maximum value>  
          CYCLE ;


CREATE SEQUENCE department_seq
  INCREMENT BY 1
  START WITH 1
  MAXVALUE 99999
  NOCYCLE;


5.29 STANDARD USERS IN 9i:
==========================

CTXSYS is the primary schema for interMedia. 
MDSYS, ORDSYS, and ORDPLUGINS are schemas required when installing any of the cartridges. 
MTSSYS is required for the Oracle Service for MTS and is specific to NT. 
OUTLN is an integral part of the database required for the plan stability feature in Oracle8i. 

While the interMedia and cartridge schemas can be recreated by running their associated 
scripts as needed, I am not 100% on the steps associated with the MTSSYS user. 

Unfortunately, the OUTLN user is created at database creation time when sql.bsq is run. 
The OUTLN user owns the package OUTLN_PKG which is used to manage stored outlines 
and their outline categories. 
There are other tables (base tables), indexes, grants, and synonyms related to this package. 


By default, are automatically created during database creation :    
SCOTT  by script $ORACLE_HOME/rdbms/admin/utlsampl.sql   
OUTLN  by script $ORACLE_HOME/rdbms/admin/sql.bsq   
Optionally:    
DBSNMP                    if Enterprise Manager Intelligent Agent is installed    
TRACESVR                  if Enterprise Manager is installed   
AURORA$ORB$UNAUTHENTICATED \   
AURORA$JIS$UTILITY$         -- if Oracle Servlet Engine (OSE) is installed   
OSE$HTTP$ADMIN             /   
MDSYS                     if Oracle Spatial option is installed    
ORDSYS                    if interMedia Audio option is installed     
ORDPLUGINS                if interMedia Audio option is installed     
CTXSYS                    if Oracle Text option is installed    
REPADMIN                  if Replication Option is installed    
LBACSYS                   if Oracle Label Security option is installed    
ODM                       if Oracle Data Mining option is installed   
ODM_MTR                   idem   
OLAPSYS                   if OLAP option is installed   
WMSYS                     if Oracle Workspace Manager script owmctab.plb is executed.   
ANONYMOUS                 if catqm.sql catalog script for SQL XML management   
XDB                       is executed 


====================================================
ORACLE INSTALLATIONS ON SOLARIS, LINUX, AIX, VMS:
====================================================

6: Install on Solaris
7: Install on Linux
8: Install on OpenVMS
9: Install on AIX


==================================
6.1. Install Oracle 92 on Solaris:
==================================


6.1 Tutorial 1:
===============


Short Guide to install Oracle 9.2.0 on SUN Solaris 8 

--------------------------------------------------------------------------------
 
The Oracle 9i Distribution can be found on Oracle Technet (http://technet.oracle.com)

The following, short Installation Guide shows how to install Oracle 9.2.0 for SUN Solaris 8. 
You may download our scripts to create a database, we suggest this way and NOT using DBASSIST. Besides this scripts, 
you can download our SQLNET configuration files TNSNAMES.ORA. LISTENER.ORA and SQLNET.ORA.

Check Hardware Requirements
Operating System Software Requirements
Java Runtime Environment (JRE)
Check Software Limits
Setup the Solaris Kernel
Create Unix Group �dba�
Create Unix User �oracle�
Setup ORACLE environment ($HOME/.profile) as follows
Install from CD-ROM ...
... or Unpacking downloaded installation files
Check oraInst.loc File
Install with Installer in interactive mode
Create the Database
Start Listener
Automatically Start / Stop the Database
Install Oracle Options (optional)
Download Scripts for Sun Solaris

For our installation, we used the following ORACLE_HOME and ORACLE_SID, please adjust these parameters 
for your own environment.

ORACLE_HOME = /opt/oracle/product/9.2.0 

ORACLE_SID = TYP2 

--------------------------------------------------------------------------------
 
 Check Hardware Requirements

Minimal Memory: 256 MB
Minimal Swap Space: Twice the amount of the RAM

To determine the amount of RAM memory installed on your system, enter the following command.

$ /usr/sbin/prtconf

To determine the amount of SWAP installed on your system, enter the following command and multiply 
the BLOCKS column by 512.

$ swap -l

Use the latest kernel patch from Sun Microsystems (http://sunsolve.sun.com)

 Operating System Software Requirements

Use the latest kernel patch from Sun Microsystems.

- Download the Patch from: http://sunsolve.sun.com
- Read the README File included in the Patch
- Usually the only thing you have to do is:

$ cd <patch cluster directory>
$ ./install_custer
$ cat /var/sadm/install_data/<luster name>_log
$ showrev -p 

- Reboot the system

To determine your current operating system information:

$ uname -a

To determine which operating system patches are installed:

$ showrev -p

To determine which operating system packages are installed:

$ pkginfo -i [package_name]

To determine if your X-windows system is working properly on your local system, but you can redirect the X-windows 
output to another system.

$ xclock

To determine if you are using the correct system executables:

$ /usr/bin/which make
$ /usr/bin/which ar
$ /usr/bin/which ld
$ /usr/bin/which nm

Each of the four commands above should point to the /usr/ccs/bin directory. If not, add /usr/ccs/bin to the 
beginning of the PATH environment variable in the current shell.

 Java Runtime Environment (JRE)

The JRE shipped with Oracle9i is used by Oracle Java applications such as the Oracle Universal Installer 
is the only one supported. You should not modify this JRE, unless it is done through a patch provided by 
Oracle Support Services. The inventory can contain multiple versions of the JRE, each of which can be used 
by one or more products or releases. The Installer creates the oraInventory directory 
the first time it is run to keep an inventory of products that it installs on your system as well as other 
installation information. The location of oraInventory is defined in /var/opt/oracle/oraInst.loc. 
Products in an ORACLE_HOME access the JRE through a symbolic link in $ORACLE_HOME/JRE to the actual location 
of a JRE within the inventory. You should not modify the symbolic link. 

 Check Software Limits

Oracle9i includes native support for files greater than 2 GB. Check your shell to determine 
whether it will impose a limit. 

To check current soft shell limits, enter the following command:
$ ulimit -Sa

To check maximum hard limits, enter the following command:
$ ulimit -Ha

The file (blocks) value should be multiplied by 512 to obtain the maximum file size imposed by the shell. 
A value of unlimited is the operating system default and is the maximum value of 1 TB.

 Setup the Solaris Kernel

Set to the sum of the PROCESSES parameter for each Oracle database, adding the largest one twice, then add 
an additional 10 for each database. 
For example, consider a system that has three Oracle instances with the PROCESSES parameter 
in their initSID.ora files set to the following values: 

ORACLE_SID=TYP1, PROCESSES=100
ORACLE_SID=TYP2, PROCESSES=100
ORACLE_SID=TYP3, PROCESSES=200

The value of SEMMNS is calculated as follows: 

SEMMNS = [(A=100) + (B=100)] + [(C=200) * 2] + [(# of instances=3) * 10] = 630 

Setting parameters too high for the operating system can prevent the machine from booting up. 
Refer to Sun Microsystems Sun SPARC Solaris system administration documentation for parameter limits. 

*
* Kernel Parameters on our SUN Enterprise with 640MB for Oracle 9
*
set shmsys:shminfo_shmmax=4294967295
set shmsys:shminfo_shmmin=1
set shmsys:shminfo_shmmni=100
set shmsys:shminfo_shmseg=10
set semsys:seminfo_semmni=100
set semsys:seminfo_semmsl=100
set semsys:seminfo_semmns=2500
set semsys:seminfo_semopm=100
set semsys:seminfo_semvmx=32767

  -- remarks:

  The parameter for shared memory (shminfo_shmmax) can be set to the maximum value; it will not impact Solaris in any way. 
  The values for semaphores (seminfo_semmni and seminfo_semmns) depend on the number of clients you want to collect 
  data from. 
  As a rule of the thumb, the values should be set to at least (2*nr of clients + 15). 
  You will have to reboot the system after making changes to the /etc/system file. 

  Solaris doesn't automatically allocate shared memory, unless you specify the
  value in /etc/system and reboot.

  Were I you, i'd put in lines in /etc/system that look something like this:
  only the first value is *really* important. It specifies the maximum amount
  of shared memory to allocate. I'd make this parameter be about 70-75% of your
  physical ram (assuming you have nothing else on this machine running besides
  Oracle ... if not, adjust down accordingly). Then this value will dictate
  your maximum SGA size as you build your database.

  set shmsys:shminfo_shmmax=4294967295 
  set shmsys:shminfo_shmmin=1
  set shmsys:shminfo_shmmni=100
  set shmsys:shminfo_shmseg=10
  set semsys:seminfo_semmsl=256
  set semsys:seminfo_semmns=1024
  set semsys:seminfo_semmni=400

  -- end remarks

 Create Unix Group �dba�

$ groupadd -g 400 dba
$ groupdel dba

 Create Unix User �oracle�

$ useradd -u 400 -c "Oracle Owner" -d /export/home/oracle \
  -g "dba" -m -s /bin/ksh oracle

 Setup ORACLE environment ($HOME/.profile) as follows

# Setup ORACLE environment

ORACLE_HOME=/opt/oracle/product/9.2.0; export ORACLE_HOME
ORACLE_SID=TYP2; export ORACLE_SID
ORACLE_TERM=xterm; export ORACLE_TERM
TNS_ADMIN=/export/home/oracle/config/9.2.0; export TNS_ADMIN
NLS_LANG=AMERICAN_AMERICA.WE8ISO8859P1; export NLS_LANG
ORA_NLS33=$ORACLE_HOME/ocommon/nls/admin/data; export ORA_NLS33
LD_LIBRARY_PATH=$ORACLE_HOME/lib:/lib:/usr/lib:/usr/openwin/lib
LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/dt/lib:/usr/ucblib:/usr/local/lib
export LD_LIBRARY_PATH

# Set up the search paths:

PATH=/bin:/usr/bin:/usr/sbin:/opt/bin:/usr/ccs/bin:/opt/local/GNU/bin
PATH=$PATH:/opt/local/bin:/opt/NSCPnav/bin:$ORACLE_HOME/bin
PATH=$PATH:/usr/local/samba/bin:/usr/ucb:.
export PATH

# CLASSPATH must include the following JRE location(s):

CLASSPATH=$ORACLE_HOME/JRE:$ORACLE_HOME/jlib:$ORACLE_HOME/rdbms/jlib
CLASSPATH=$CLASSPATH:$ORACLE_HOME/network/jlib

 Install from CD-ROM ...

Usually the CD-ROM will be mounted automatically by the Solaris Volume Manager, if not, do it as follows as user root.

$ su root
$ mkdir /cdrom
$ mount -r -F hsfs /dev/.... /cdrom

exit or CTRL-D

 ... or Unpacking downloaded installation files

If you downloaded database installation files from Oracle site (901solaris_disk1.cpio.gz, 901solaris_disk2.cpio.gz and 
901solaris_disk3.cpio.gz) gunzip them somewhere and you'll get three .cpio files. The best way to download the huge files 
is to use the tool GetRight ( http://www.getright.com/ ) 

$ cd <somewhere>
$ mkdir Disk1 Disk2 Disk3
$ cd Disk1
$ gunzip 901solaris_disk1.cpio.gz
$ cat 901solaris_disk1.cpio | cpio -icd

This will extract all the files for Disk1, repeat steps for Disk2 and D3isk3. Now you should have three directories 
(Disk1, Disk2 and Disk3) containing installation files.

 Check oraInst.loc File

If you used Oracle before on your system, then you must edit the Oracle Inventory File, usually located in: 
/var/opt/oracle/oraInst.loc

inventory_loc=/opt/oracle/product/oraInventory

 Install with Installer in interactive mode

Install Oracle 9i with Oracle Installer

$ cd /Disk1
$ DISPLAY=<Any X-Window Host>:0.0
$ export DISPLAY
$ ./runInstaller


  example display:
  $ export DISPLAY=192.168.1.10:0.0


Answer the questions in the Installer, we use the following install directories

Inventory Location: /opt/oracle/product/oraInventory
Oracle Universal Installer in: /opt/oracle/product/oui
Java Runtime Environment in: /opt/oracle/product/jre/1.1.8

Edit the Database Startup Script /var/opt/oracle/oratab

TYP2:/opt/oracle/product/9.2.0:Y

 Create the Database

Edit and save the CREATE DATABASE File initTYP2.sql in $ORACLE_HOME/dbs, or create a symbolic-Link 
from $ORACLE_HOME/dbs to your Location.

$ cd $ORACLE_HOME/dbs
$ ln -s /export/home/oracle/config/9.2.0/initTYP2.ora initTYP2.ora
$ ls -l

initTYP2.ora -> /export/home/oracle/config/9.2.0/initTYP2.ora

First start the Instance, just to test your initTYP2.ora file for correct syntax and system resources.

$ cd /export/home/oracle/config/9.2.0/
$ sqlplus /nolog
SQL> connect / as sysdba
SQL> startup nomount
SQL> shutdown immediate

Now you can create the database

SQL> @initTYP2.sql
SQL> @shutdown immediate
SQL> startup

Check the Logfile: initTYP2.log

 Start Listener

$ lsnrctl start LSNRTYP2

 Automatically Start / Stop the Database

To start the Database automatically on Boot-Time, create or use our Startup Scripts dbora and lsnrora 
(included in ora_config_sol_920.tar.gz), 
which must be installed in /etc/init.d. Create symbolic Links from the Startup Directories.

lrwxrwxrwx 1 root root S99dbora -> ../init.d/dbora*
lrwxrwxrwx 1 root root S99lsnrora -> ../init.d/lsnrora*

 Install Oracle Options (optional)

You may want to install the following Options:

Oracle JVM 
Orcale XML 
Oracle Spatial 
Oracle Ultra Search 
Oracle OLAP 
Oracle Data Mining 
Example Schemas 
Run the following script install_options.sh to enable this options in the database. Before running this scripts 
adjust the initSID.ora paramaters 
as follows for the build process. After this, you can reset the paramters to smaller values.

parallel_automatic_tuning = false
shared_pool_size = 200000000
java_pool_size = 100000000

$ ./install_options.sh

 Download Scripts for Sun Solaris

These Scripts can be used as Templates. Please note, that some Parameters like ORACLE_HOME, ORACLE_SID and PATH 
must be adjusted on your own Environment. Besides this, you should check the initSID.ora Parameters 
for your Database (Size, Archivelog, ...)

 
6.2 Environment oracle user:
----------------------------

typical profile for Oracle account on most unix systems:

.profile
--------

MAIL=/usr/mail/${LOGNAME:?}
umask=022
EDITOR=vi; export EDITOR
ORACLE_BASE=/opt/app/oracle; export ORACLE_BASE
ORACLE_HOME=$ORACLE_BASE/product/9.2; export ORACLE_HOME
ORACLE_SID=OWS; export ORACLE_SID
ORACLE_TERM=xterm; export ORACLE_TERM
TNS_ADMIN=$ORACLE_HOME/network/admin; export TNS_ADMIN
NLS_LANG=AMERICAN_AMERICA.AL16UTF8; export NLS_LANG
ORA_NLS33=$ORACLE_HOME/ocommon/nls/admin/data; export ORA_NLS33
LD_LIBRARY_PATH=$ORACLE_HOME/lib:/lib:/usr/lib:/usr/openwin/lib
LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/dt/lib:/usr/ucblib:/usr/local/lib
export LD_LIBRARY_PATH
PATH=.:/usr/bin:/usr/sbin:/sbin:/usr/ucb:/etc:$ORACLE_HOME/lib:/usr/oasys/bin:$ORACLE_HOME/bin:/usr/local/bin:
export PATH
PS1='$PWD >'
DISPLAY=172.17.2.128:0.0
export DISPLAY

/etc >more passwd
-----------------
root:x:0:1:Super-User:/:/sbin/sh
daemon:x:1:1::/:
bin:x:2:2::/usr/bin:
sys:x:3:3::/:
adm:x:4:4:Admin:/var/adm:
lp:x:71:8:Line Printer Admin:/usr/spool/lp:
uucp:x:5:5:uucp Admin:/usr/lib/uucp:
nuucp:x:9:9:uucp Admin:/var/spool/uucppublic:/usr/lib/uucp/uucico
smmsp:x:25:25:SendMail Message Submission Program:/:
listen:x:37:4:Network Admin:/usr/net/nls:
nobody:x:60001:60001:Nobody:/:
noaccess:x:60002:60002:No Access User:/:
nobody4:x:65534:65534:SunOS 4.x Nobody:/:
avdsel:x:1002:100:Albert van der Sel:/export/home/avdsel:/bin/ksh
oraclown:x:1001:102:Oracle owner:/export/home/oraclown:/bin/ksh
brighta:x:1005:102:Bright Alley:/export/home/brighta:/bin/ksh
customer:x:2000:102:Customer account:/export/home/customer:/usr/bin/tcsh

/etc >more group
----------------
root::0:root
other::1:
bin::2:root,bin,daemon
sys::3:root,bin,sys,adm
adm::4:root,adm,daemon
uucp::5:root,uucp
mail::6:root
tty::7:root,adm
lp::8:root,lp,adm
nuucp::9:root,nuucp
staff::10:
daemon::12:root,daemon
sysadmin::14:
smmsp::25:smmsp
nobody::60001:
noaccess::60002:
nogroup::65534:
dba::100:oraclown,brighta
oper::101:
oinstall::102:


=====================================
7. install Oracle 9i on Linux:
=====================================

====================
7.1.Article 1:
====================


The Oracle 9i Distribution can be found on Oracle Technet (http://technet.oracle.com)

The following short Guide shows how to install and configure Oracle 9.2.0 on RedHat Linux 7.2 / 8.0 You may download our 
Scripts to create a database, we suggest this way and NOT using DBASSIST. Besides these scripts, you can download our 
NET configuration files: LISTNER.ORA, TNSNAMES.ORA and SQLNET.ORA.

System Requirements
Create Unix Group �dba�
Create Unix User �oracle�
Setup Environment ($HOME/.bash_profile) as follows
Mount the Oracle 9i CD-ROM (only if you have the CD) ...
... or Unpacking downloaded installation files
Install with Installer in interactive mode
Create the Database
Create your own DB-Create Script (optional)
Start Listener
Automatically Start / Stop the Database
Setup Kernel Parameters ( if necessary )
Install Oracle Options (optional)
Download Scripts for RedHat Linux 7.2

For our installation, we used the following ORACLE_HOME AND ORACLE_SID, please adjust these parameters for 
your own environment.

ORACLE_HOME = /opt/oracle/product/9.2.0 
ORACLE_SID = VEN1 

--------------------------------------------------------------------------------
 
 System Requirements

Oracle 9i needs Kernel Version 2.4 and glibc 2.2, which is included in RedHat Linux 7.2.

Component
 Check with ...
 ... Output
 
Liunx Kernel Version 2.4
 rpm -q kernel
 kernel-2.4.7-10
 
System Libraries
 rpm -q glibc
 glibc-2.2.4-19.3
 
Proc*C/C++
 rpm -q gcc
 gcc-2.96-98
 

 Create Unix Group �dba�

$ groupadd -g 400 dba

 Create Unix User �oracle�

$ useradd -u 400 -c "Oracle Owner" -d /home/oracle \
  -g "dba" -m -s /bin/bash oracle

 Setup Environment ($HOME/.bash_profile) as follows

# Setup ORACLE environment

ORACLE_HOME=/opt/oracle/product/9.2.0; export ORACLE_HOME
ORACLE_SID=VEN1; export ORACLE_SID
ORACLE_TERM=xterm; export ORACLE_TERM
ORACLE_OWNER=oracle; export ORACLE_OWNER
TNS_ADMIN=/home/oracle/config/9.2.0; export TNS_ADMIN
NLS_LANG=AMERICAN_AMERICA.WE8ISO8859P1; export NLS_LANG
ORA_NLS33=$ORACLE_HOME/ocommon/nls/admin/data; export ORA_NLS33
CLASSPATH=$ORACLE_HOME/jdbc/lib/classes111.zip
LD_LIBRARY_PATH=$ORACLE_HOME/lib; export LD_LIBRARY_PATH

### see JSDK: export CLASSPATH

# Set up JAVA and JSDK environment:


export JAVA_HOME=/usr/local/jdk
export JSDK_HOME=/usr/local/jsdk
CLASSPATH=$CLASSPATH:$JAVA_HOME/lib:$JSDK_HOME/lib/jsdk.jar
export CLASSPATH


# Set up the search paths:

PATH=$POSTFIX/bin:$POSTFIX/sbin:$POSTFIX/sendmail
PATH=$PATH:/usr/local/jre/bin:/usr/local/jdk/bin:/bin:/sbin:/usr/bin:/usr/sbin
PATH=$PATH:/usr/local/bin:$ORACLE_HOME/bin:/usr/local/jsdk/bin
PATH=$PATH:/usr/local/sbin:/usr/bin/X11:/usr/X11R6/bin:/root/bin
PATH=$PATH:/usr/local/samba/bin
export PATH

 Mount the Oracle 9i CD-ROM (only if you have the CD) ...

Mount the CD-ROM as user root.

$ su root
$ mkdir /cdrom
$ mount -t iso9660 /dev/cdrom /cdrom
$ exit

 ... or Unpacking downloaded installation files

If you downloaded database installation files from Oracle site (Linux9i_Disk1.cpio.gz, Linux9i_Disk2.cpio.gz and 
Linux9i_Disk3.cpio.gz) gunzip them somewhere and you'll get three .cpio files. The best way to download the huge files 
is to use the tool GetRight ( http://www.getright.com/ ) 

$ cd <somewhere>
$ cpio -idmv < Linux9i_Disk1.cpio
$ cpio -idmv < Linux9i_Disk2.cpio
$ cpio -idmv < Linux9i_Disk3.cpio

Now you should have three directories (Disk1, Disk2 and Disk3) containing installation files. 

 Install with Installer in interactive mode

Install Oracle 9i with Oracle Installer

$ cd Disk1
$ DISPLAY=<Any X-Window Host>:0.0
$ export DISPLAY
$ ./runInstaller

Answer the questions in the Installer, we use the following install directories

Inventory Location: /opt/oracle/product/oraInventory
Oracle Universal Installer in: /opt/oracle/product/oui
Java Runtime Environment in: /opt/oracle/product/jre/1.1.8

Edit the Database Startup Script /etc/oratab

VEN1:/opt/oracle/product/9.2.0:Y

 Create the Database

Edit and save the CREATE DATABASE File initVEN1.sql in $ORACLE_HOME/dbs, or create a symbolic-Link from 
$ORACLE_HOME/dbs to your Location.

$ cd $ORACLE_HOME/dbs
$ ln -s /home/oracle/config/9.2.0/initVEN1.ora initVEN1.ora
$ ls -l

initVEN1.ora -> /home/oracle/config/9.2.0/initVEN1.ora

First start the Instance, just to test your initVEN1.ora file for correct syntax and system resources.

$ cd /home/oracle/config/9.2.0/
$ sqlplus /nolog
SQL> connect / as sysdba
SQL> startup nomount
SQL> shutdown immediate

Now you can create the database

SQL> @initVEN1.sql
SQL> @shutdown immediate
SQL> startup

Check the Logfile: initVEN1.log

 Create your own DB-Create Script (optional)

You can generate your own DB-Create Script using the Tool: $ORACLE_HOME/bin/dbca

 Start Listener

$ lsnrctl start LSNRVEN1

 Automatically Start / Stop the Database

To start the Database automatically on Boot-Time, create or use our Startup Scripts dbora and lsnrora (included in 
ora_config_linux_901.tar.gz), which must be installed in /etc/rc.d/init.d. Create symbolic Links from the 
Startup Directories in /etc/rc.d (e.g. /etc/rc.d/rc2.d).

lrwxrwxrwx 1 root root S99dbora -> ../init.d/dbora*
lrwxrwxrwx 1 root root S99lsnrora -> ../init.d/lsnrora*

 Setup Kernel Parameters ( if necessary )

Oracle9i uses UNIX resources such as shared memory, swap space, and semaphores extensively 
for interprocess communication. If your kernel parameter settings are insufficient for Oracle9i, 
you will experience problems during installation and instance startup. 
The greater the amount of data you can store in memory, the faster your database will operate. In addition, 
by maintaining data in memory, the UNIX kernel reduces disk I/O activity.

Use the ipcs command to obtain a list of the system�s current shared memory and semaphore segments, 
and their identification number and owner. 
You can modify the kernel parameters by using the /proc file system.

To modify kernel parameters using the /proc file system:

1. Log in as root user.

2. Change to the /proc/sys/kernel directory.

3. Review the current semaphore parameter values in the sem file using the cat or more utility

# cat sem

The output will list, in order, the values for the SEMMSL, SEMMNS, SEMOPM, and SEMMNI parameters. 
The following example shows how the output will appear.

250 32000 32 128

In the preceding example, 250 is the value of the SEMMSL parameter, 32000 is the value of the SEMMNS parameter, 32 
is the value of the SEMOPM parameter, and 128 is the value of the SEMMNI parameter.

4. Modify the parameter values using the following command:

# echo SEMMSL_value SEMMNS_value SEMOPM_value SEMMNI_value > sem

In the preceding command, all parameters must be entered in order.

5. Review the current shared memory parameters using the cat or more utility.

# cat shared_memory_parameter

In the preceding example, the shared_memory_parameter is either the SHMMAX or SHMMNI parameter. The parameter name must be 
entered in lowercase letters.

6. Modify the shared memory parameter using the echo utility. For example, to modify the SHMMAX parameter, enter the following:

# echo 2147483648 > shmmax

7. Write a script to initialize these values during system startup and include the script in your system init files. 
Refer to the following table to determine if your system shared memory and semaphore kernel parameters are set high enough for Oracle9i. 
The parameters in the following table are the minimum values required to run Oracle9i with a single database instance. 
You can put the initialization in the file /etc/rc.d/rc.local

# Setup Kernel Parameters for Oracle 9i

echo 250 32000 100 128 > /proc/sys/kernel/sem
echo 2147483648 > /proc/sys/kernel/shmmax
echo 4096 > /proc/sys/kernel/shmmni

 Install Oracle Options (optional)

You may want to install the following Options:

Oracle JVM 
Orcale XML 
Oracle Spatial 
Oracle Ultra Search 
Oracle OLAP 
Oracle Data Mining 
Example Schemas 
Run the following script install_options.sh to enable this options in the database. Before running this scripts adjust 
the initSID.ora paramaters as follows for the build process. After this, you can reset the paramters to smaller values.

parallel_automatic_tuning = false
shared_pool_size = 200000000
java_pool_size = 100000000

$ ./install_options.sh

 Download Scripts for RedHat Linux 7.2

These Scripts can be used as Templates. Please note, that some Parameters like ORACLE_HOME, ORACLE_SID and PATH must 
be adjusted on your own Environment. Besides this, you should check the initSID.ora Parameters for your Database (Size, Archivelog, ...)


====================
7.2.Article 2:
====================


Installing Oracle9i (9.2.0.5.0) on Red Hat Linux (Fedora Core 2) 

by Jeff Hunter, Sr. Database Administrator 

--------------------------------------------------------------------------------

Contents 


Overview 
Swap Space Considerations 
Configuring Shared Memory 
Configuring Semaphores 
Configuring File Handles 
Create Oracle Account and Directories 
Configuring the Oracle Environment 
Configuring Oracle User Shell Limits 
Downloading / Unpacking the Oracle9i Installation Files 
Update Red Hat Linux System - (Oracle Metalink Note: 252217.1) 
Install the Oracle 9.2.0.4.0 RDBMS Software 
Install the Oracle 9.2.0.5.0 Patchset 
Post Installation Steps 
Creating the Oracle Database 


--------------------------------------------------------------------------------

Overview 

The following article is a summary of the steps required to successfully install the Oracle9i (9.2.0.4.0) RDBMS software on Red Hat Linux Fedora Core 2. Also included in this article is a detailed overview for applying the Oracle9i (9.2.0.5.0) patchset. Keep in mind the following assumptions throughout this article: 

When installing Red Hat Linux Fedora Core 2, I install ALL components. (Everything). This makes it easier than trying to troubleshoot missing software components. 

As of March 26, 2004, Oracle includes the Oracle9i RDBMS software with the 9.2.0.4.0 patchset already included. This will save considerable time since the patchset does not have to be downloaded and installed. We will, however, be applying the 9.2.0.5.0 patchset. 

Although it is not required, it is recommend to apply the 9.2.0.5.0 patchset. 

The post installation section includes steps for configuring the Oracle Networking files, configuring the database to start and stop when the machine is cycled, and other miscellaneous tasks. 

Finally, at the end of this article, we will be creating an Oracle 9.2.0.5.0 database named ORA920 using supplied scripts. 


--------------------------------------------------------------------------------

Swap Space Considerations 

Ensure enough swap space is available. 

Installing Oracle9i requires a minimum of 512MB of memory. 
(An inadequate amount of swap during the installation will cause the Oracle Universal Installer to either "hang" or "die") 

To check the amount of memory / swap you have allocated, type either: 
# free 

- OR - 

# cat /proc/swaps 

- OR - 

# cat /proc/meminfo | grep MemTotal 


If you have less than 512MB of memory (between your RAM and SWAP), you can add temporary swap space by creating a temporary swap file. This way you do not have to use a raw device or even more drastic, rebuild your system. 
As root, make a file that will act as additional swap space, let's say about 300MB: 
# dd if=/dev/zero of=tempswap bs=1k count=300000 

Now we should change the file permissions: 
# chmod 600 tempswap 

Finally we format the "partition" as swap and add it to the swap space: 
# mke2fs tempswap
# mkswap tempswap
# swapon tempswap


--------------------------------------------------------------------------------

Configuring Shared Memory 

The Oracle RDBMS uses shared memory in UNIX to allow processes to access common data structures and data. 
These data structures and data are placed in a shared memory segment to allow processes the fastest form of 
Interprocess Communications (IPC) available. The speed is primarily a result of processes not needing to copy 
data between each other to share common data and structures - relieving the kernel from having to get involved. 
Oracle uses shared memory in UNIX to hold its Shared Global Area (SGA). This is an area of memory within 
the Oracle instance that is shared by all Oracle backup and foreground processes. It is important to size 
the SGA to efficiently hold the database buffer cache, shared pool, redo log buffer as well as other shared 
Oracle memory structures. Inadequate sizing of the SGA can have a dramatic decrease in performance of the database. 

To determine all shared memory limits you can use the ipcs command. The following example shows the values 
of my shared memory limits on a fresh RedHat Linux install using the defaults: 

# ipcs -lm

------ Shared Memory Limits --------
max number of segments = 4096
max seg size (kbytes) = 32768
max total shared memory (kbytes) = 8388608
min seg size (bytes) = 1
Let's continue this section with an overview of the parameters that are responsible for configuring the 
shared memory settings in Linux. 
SHMMAX 


The SHMMAX parameter is used to define the maximum size (in bytes) for a shared memory segment and should be set 
large enough for the largest SGA size. If the SHMMAX is set incorrectly (too low), it is possible that the 
Oracle SGA (which is held in shared segments) may be limited in size. An inadequate SHMMAX setting would result 
in the following: 
ORA-27123: unable to attach to shared memory segment
You can determine the value of SHMMAX by performing the following: 

# cat /proc/sys/kernel/shmmax
33554432
As you can see from the output above, the default value for SHMMAX is 32MB. This is often too small to configure the Oracle SGA. I generally set the SHMMAX parameter to 2GB. 
NOTE: With a 32-bit Linux operating system, the default maximum size of the SGA is 1.7GB. This is the reason I will often set the SHMMAX parameter to 2GB since it requires a larger value for SHMMAX. 
On a 32-bit Linux operating system, without Physical Address Extension (PAE), the physical memory is divided into a 3GB user space and a 1GB kernel space. It is therefore possible to create a 2.7GB SGA, but you will need make several changes at the Linux operating system level by changing the mapped base. In the case of a 2.7GB SGA, you would want to set the SHMMAX parameter to 3GB. 

Keep in mind that the maximum value of the SHMMAX parameter is 4GB. 
 

To change the value SHMMAX, you can use either of the following three methods: 

This is method I use most often. This method sets the SHMMAX on startup by inserting the following kernel parameter in the /etc/sysctl.conf startup file: 
# echo "kernel.shmmax=2147483648" >> /etc/sysctl.conf

If you wanted to dynamically alter the value of SHMMAX without rebooting the machine, you can make this change directly to the /proc file system. This command can be made permanent by putting it into the /etc/rc.local startup file: 
# echo "2147483648" > /proc/sys/kernel/shmmax

You can also use the sysctl command to change the value of SHMMAX: 
# sysctl -w kernel.shmmax=2147483648
SHMMNI 


We now look at the SHMMNI parameters. This kernel parameter is used to set the maximum number of shared memory segments system wide. The default value for this parameter is 4096. This value is sufficient and typically does not need to be changed. 
You can determine the value of SHMMNI by performing the following: 

# cat /proc/sys/kernel/shmmni
4096
SHMALL 


Finally, we look at the SHMALL shared memory kernel parameter. This parameter controls the total amount of shared memory (in pages) that can be used at one time on the system. In short, the value of this parameter should always be at least: 
ceil(SHMMAX/PAGE_SIZE)
The default size of SHMALL is 2097152 and can be queried using the following command: 
# cat /proc/sys/kernel/shmall
2097152
From the above output, the total amount of shared memory (in bytes) that can be used at one time on the system is: 
SM = (SHMALL * PAGE_SIZE)
   = 2097152 * 4096
   = 8,589,934,592 bytes
The default setting for SHMALL should be adequate for our Oracle installation. 
NOTE: The page size in Red Hat Linux on the i386 platform is 4096 bytes. You can, however, use bigpages which supports the configuration of larger memory page sizes.  


--------------------------------------------------------------------------------

Configuring Semaphores 

Now that we have configured our shared memory settings, it is time to take care of configuring our semaphores. A semaphore can be thought of as a counter that is used to control access to a shared resource. Semaphores provide low level synchronization between processes (or threads within a process) so that only one process (or thread) has access to the shared segment, thereby ensureing the integrity of that shared resource. When an application requests semaphores, it does so using "sets". 
To determine all semaphore limits, use the following: 

# ipcs -ls

------ Semaphore Limits --------
max number of arrays = 128
max semaphores per array = 250
max semaphores system wide = 32000
max ops per semop call = 32
semaphore max value = 32767
You can also use the following command: 
# cat /proc/sys/kernel/sem
250     32000   32      128
SEMMSL 


The SEMMSL kernel parameter is used to control the maximum number of semaphores per semaphore set. 
Oracle recommends setting SEMMSL to the largest PROCESS instance parameter setting in the init.ora file for all databases hosted on the Linux system plus 10. Also, Oracle recommends setting the SEMMSL to a value of no less than 100. 

SEMMNI 


The SEMMNI kernel parameter is used to control the maximum number of semaphore sets on the entire Linux system. 
Oracle recommends setting the SEMMNI to a value of no less than 100. 

SEMMNS 


The SEMMNS kernel parameter is used to control the maximum number of semaphores (not semaphore sets) on the entire Linux system. 
Oracle recommends setting the SEMMNS to the sum of the PROCESSES instance parameter setting for each database on the system, adding the largest PROCESSES twice, and then finally adding 10 for each Oracle database on the system. To summarize: 

SEMMNS =   sum of PROCESSES setting for each database on the system
         + ( 2 * [largest PROCESSES setting])
         + (10 * [number of databases on system]
To determine the maximum number of semaphores that can be allocated on a Linux system, use the following calculation. It will be the lesser of: 

SEMMNS  -or-  (SEMMSL * SEMMNI)
SEMOPM 


The SEMOPM kernel parameter is used to control the number of semaphore operations that can be performed per semop system call. 
The semop system call (function) provides the ability to do operations for multiple semaphores with one semop system call. A semaphore set can have the maximum number of SEMMSL semaphores per semaphore set and is therefore recommended to set SEMOPM equal to SEMMSL. 

Oracle recommends setting the SEMOPM to a value of no less than 100. 

Setting Semaphore Kernel Parameters 


Finally, we see how to set all semaphore parameters using several methods. In the following, the only parameter I care about changing (raising) is SEMOPM. All other default settings should be sufficient for our example installation. 
This is method I use most often. This method sets all semaphore kernel parameters on startup by inserting the following kernel parameter in the /etc/sysctl.conf startup file: 
# echo "kernel.sem=250 32000 100 128" >> /etc/sysctl.conf

If you wanted to dynamically alter the value of all semaphore kernel parameters without rebooting the machine, you can make this change directly to the /proc file system. This command can be made permanent by putting it into the /etc/rc.local startup file: 
# echo "250 32000 100 128" > /proc/sys/kernel/sem

You can also use the sysctl command to change the value of all semaphore settings: 
# sysctl -w kernel.sem="250 32000 100 128"


--------------------------------------------------------------------------------

Configuring File Handles 

When configuring our Linux database server, it is critical to ensure that the maximum number of file handles is large enough. The setting for file handles designate the number of open files that you can have on the entire Linux system. 
Use the following command to determine the maximum number of file handles for the entire system: 

# cat /proc/sys/fs/file-max
103062
Oracle recommends that the file handles for the entire system be set to at least 65536. In most cases, the default for Red Hat Linux is 103062. I have seen others (Red Hat Linux AS 2.1, Fedora Core 1, and Red Hat version 9) that will only default to 32768. If this is the case, you will want to increase this value to at least 65536. 

This is method I use most often. This method sets the maximum number of file handles (using the kernel parameter file-max) on startup by inserting the following kernel parameter in the /etc/sysctl.conf startup file: 
# echo "fs.file-max=65536" >> /etc/sysctl.conf

If you wanted to dynamically alter the value of all semaphore kernel parameters without rebooting the machine, you can make this change directly to the /proc file system. This command can be made permanent by putting it into the /etc/rc.local startup file: 
# echo "65536" > /proc/sys/fs/file-max

You can also use the sysctl command to change the maximum number of file handles: 
# sysctl -w fs.file-max=65536
NOTE: It is also possible to query the current usage of file handles using the following command: 
# cat /proc/sys/fs/file-nr
1140    0       103062
In the above example output, here is an explanation of the three values from the file-nr command: 
Total number of allocated file handles. 
Total number of file handles currently being used. 
Maximum number of file handles that can be allocated. This is essentially the value of file-max - (see above). 
 

NOTE: If you need to increase the value in /proc/sys/fs/file-max, then make sure that the ulimit is set properly. Usually for 2.4.20 it is set to unlimited. Verify the ulimit setting my issuing the ulimit command: 
# ulimit
unlimited
 

--------------------------------------------------------------------------------

Create Oracle Account and Directories 

Now let's create the Oracle UNIX account all all required directories: 
Login as the root user id. 
% su -
Create directories. 
# mkdir -p /u01/app/oracle
# mkdir -p /u03/app/oradata
# mkdir -p /u04/app/oradata
# mkdir -p /u05/app/oradata
# mkdir -p /u06/app/oradata
Create the UNIX Group for the Oracle User Id. 
# groupadd -g 115 dba
Create the UNIX User for the Oracle Software. 
# useradd -u 173 -c "Oracle Software Owner" -d /u01/app/oracle -g "dba" -m -s /bin/bash oracle
# passwd oracle
Changing password for user oracle.
New UNIX password: ************
BAD PASSWORD: it is based on a dictionary word
Retype new UNIX password: ************
passwd: all authentication tokens updated successfully.
Change ownership of all Oracle Directories to the Oracle UNIX User. 
# chown -R oracle:dba /u01
# chown -R oracle:dba /u03
# chown -R oracle:dba /u04
# chown -R oracle:dba /u05
# chown -R oracle:dba /u06
Oracle Environment Variable Settings 
NOTE: Ensure to set the environment variable: LD_ASSUME_KERNEL=2.4.1 Failing to set the LD_ASSUME_KERNEL parameter will cause 
the Oracle Universal Installer to hang!  


Verify all mount points. Please keep in mind that all of the following mount points can simply be directories if you only have one hard drive. 
For our installation, we will be using four mount points (or directories) as follows: 

/u01 : The Oracle RDBMS software will be installed to /u01/app/oracle. 

/u03 : This mount point will contain the physical Oracle files: 

Control File 1 
Online Redo Log File - Group 1 / Member 1 
Online Redo Log File - Group 2 / Member 1 
Online Redo Log File - Group 3 / Member 1 

/u04 : This mount point will contain the physical Oracle files: 

Control File 2 
Online Redo Log File - Group 1 / Member 2 
Online Redo Log File - Group 2 / Member 2 
Online Redo Log File - Group 3 / Member 2 

/u05 : This mount point will contain the physical Oracle files: 

Control File 3 
Online Redo Log File - Group 1 / Member 3 
Online Redo Log File - Group 2 / Member 3 
Online Redo Log File - Group 3 / Member 3 

/u06 : This mount point will contain the all physical Oracle data files. 

This will be one large RAID 0 stripe for all Oracle data files. 
All tablespaces including System, UNDO, Temporary, Data, and Index. 


--------------------------------------------------------------------------------

Configuring the Oracle Environment 

After configuring the Linux operating environment, it is time to setup the Oracle UNIX User ID for the installation of the Oracle RDBMS Software. 
Keep in mind that the following steps need to be performed by the oracle user id. 
Before delving into the details for configuring the Oracle User ID, I packaged an archive of shell scripts and configuration files to assist 
with the Oracle preparation and installation. You should download the archive "oracle_920_installation_files_linux.tar" as the Oracle User ID 
and place it in his HOME directory. 


Login as the oracle user id. 
% su - oracle

Unpackage the contents of the oracle_920_installation_files_linux.tar archive. After extracting the archive, you will have a new directory 
called oracle_920_installation_files_linux that contains all required files. The following set of commands descibe how to extract the file 
and where to copy/extract all required files: 
$ id
uid=173(oracle) gid=115(dba) groups=115(dba)

$ pwd
/u01/app/oracle

$ tar xvf oracle_920_installation_files_linux.tar
oracle_920_installation_files_linux/
oracle_920_installation_files_linux/admin.tar
oracle_920_installation_files_linux/common.tar
oracle_920_installation_files_linux/dbora
oracle_920_installation_files_linux/dbshut
oracle_920_installation_files_linux/.bash_profile
oracle_920_installation_files_linux/dbstart
oracle_920_installation_files_linux/ldap.ora
oracle_920_installation_files_linux/listener.ora
oracle_920_installation_files_linux/sqlnet.ora
oracle_920_installation_files_linux/tnsnames.ora
oracle_920_installation_files_linux/crontabORA920.txt

$ cp oracle_920_installation_files_linux/.bash_profile ~/.bash_profile

$ tar xvf oracle_920_installation_files_linux/admin.tar

$ tar xvf oracle_920_installation_files_linux/common.tar

$ . ~/.bash_profile
.bash_profile executed
$


--------------------------------------------------------------------------------

Configuring Oracle User Shell Limits 

Many of the Linux shells (including BASH) implement certain controls over certain critical resources like the number of file descriptors that 
can be opened and the maximum number of processes available to a user's session. In most cases, you will not need to alter any of these shell limits,
 but you find yourself getting errors when creating or maintaining the Oracle database, you may want to read through this section. 
You can use the following command to query these shell limits: 

# ulimit -a

core file size        (blocks, -c) 0
data seg size         (kbytes, -d) unlimited
file size             (blocks, -f) unlimited
max locked memory     (kbytes, -l) unlimited
max memory size       (kbytes, -m) unlimited
open files                    (-n) 1024
pipe size          (512 bytes, -p) 8
stack size            (kbytes, -s) 10240
cpu time             (seconds, -t) unlimited
max user processes            (-u) 16383
virtual memory        (kbytes, -v) unlimited
Maximum Number of Open File Descriptors for Shell Session 


Let's first talk about the maximum number of open file descriptors for a user's shell session. 
NOTE: Make sure that throughout this section, that you are logged in as the oracle user account since this is the shell account we want to test!  


Ok, you are first going to tell me, "But I've already altered my Linux environment by setting the system wide kernel parameter /proc/sys/fs/file-max". 
Yes, this is correct, but there is still a per user limit on the number of open file descriptors. This typically defaults to 1024. 
To check that, use the following command: 

% su - oracle
% ulimit -n
1024
If you wanted to change the maximum number of open file descriptors for a user's shell session, you could edit the /etc/security/limits.conf as the root account. For your Linux system, you would add the following lines: 
oracle           soft    nofile          4096
oracle           hard    nofile          101062
The first line above sets the soft limit, which is the number of files handles (or open files) that the Oracle user will have after logging in to the shell account. The hard limit defines the maximum number of file handles (or open files) are possible for the user's shell account. If the oracle user account starts to recieve error messages about running out of file handles, then number of file handles should be increased for the oracle using the user should increase the number of file handles using the hard limit setting. You can increase the value of this parameter to 101062 for the current session by using the following: 
% ulimit -n 101062
Keep in mind that the above command will only effect the current shell session. If you were to log out and log back in, the value would be set back to its default for that shell session. 
NOTE: Although you can set the soft and hard file limits higher, it is critical to understand to never set the hard limit for nofile for your shell account equal to /proc/sys/fs/file-max. If you were to do this, your shell session could use up all of the file descriptors for the entire Linux system, which means that the entire Linux system would run out of file descriptors. At this point, you would not be able to initiate any new logins since the system would not be able to open any PAM modules, which are required for login. Notice that I set my hard limit to 101062 and not 103062. In short, I am leaving 2000 spare!  


We're not totally done yet. We still need to ensure that pam_limits is configured in the /etc/pam.d/system-auth file. The steps defined below sould already be performed with a normal Red Hat Linux installation, but should still be validated! 

The PAM module will read the /etc/security/limits.conf file. You should have an entry in the /etc/pam.d/system-auth file as follows: 

session     required      /lib/security/$ISA/pam_limits.so
I typically validate that my /etc/pam.d/system-auth file has the following two entries: 
session     required      /lib/security/$ISA/pam_limits.so
session     required      /lib/security/$ISA/pam_unix.so
Finally, let's test our new settings for the maximum number of open file descriptors for the oracle shell session. Logout and log back in as the oracle user account then run the following commands. 

Let's first check all current soft shell limits: 

$ ulimit -Sa
core file size        (blocks, -c) 0
data seg size         (kbytes, -d) unlimited
file size             (blocks, -f) unlimited
max locked memory     (kbytes, -l) unlimited
max memory size       (kbytes, -m) unlimited
open files                    (-n) 4096
pipe size          (512 bytes, -p) 8
stack size            (kbytes, -s) 10240
cpu time             (seconds, -t) unlimited
max user processes            (-u) 16383
virtual memory        (kbytes, -v) unlimited
Finally, let's check all current hard shell limits: 
$ ulimit -Ha
core file size        (blocks, -c) unlimited
data seg size         (kbytes, -d) unlimited
file size             (blocks, -f) unlimited
max locked memory     (kbytes, -l) unlimited
max memory size       (kbytes, -m) unlimited
open files                    (-n) 101062
pipe size          (512 bytes, -p) 8
stack size            (kbytes, -s) unlimited
cpu time             (seconds, -t) unlimited
max user processes            (-u) 16383
virtual memory        (kbytes, -v) unlimited
The soft limit is now set to 4096 while the hard limit is now set to 101062. 
NOTE: There may be times when you cannot get access to the root user account to change the /etc/security/limits.conf file. You can set this value in the user's login script for the shell as follows: 
su - oracle
cat >> ~oracle/.bash_profile << EOF
ulimit -n 101062
EOF
 

NOTE: For this section, I used the BASH shell. The session values will not always be the same for other shells.  


Maximum Number of Processes for Shell Session 


This section is very similar to the previous section, "Maximum Number of Open File Descriptors for Shell Session" and deals with the same concept of soft limits and hard limits as well as configuring pam_limits. For most default Red Hat Linux installations, you will not need to be concerned with the maximum number of user processes as this value is generally high enough! 
NOTE: For this section, I used the BASH shell. The session values will not always be the same for other shells.  


Let's start by querying the current limit of the maximum number of processes for the oracle user: 

% su - oracle
% ulimit -u
16383
If you wanted to change the soft and hard limits for the maximum number of processes for the oracle user, (and for that matter, all users), you could edit the /etc/security/limits.conf as the root account. For your Linux system, you would add the following lines: 
oracle           soft    nproc          2047
oracle           hard    nproc          16384
NOTE: There may be times when you cannot get access to the root user account to change the /etc/security/limits.conf file. You can set this value in the user's login script for the shell as follows: 
su - oracle
cat >> ~oracle/.bash_profile << EOF
ulimit -u 16384
EOF
 

Miscellaneous Notes 


To check all current soft shell limits, enter the following command: 
$ ulimit -Sa
core file size        (blocks, -c) 0
data seg size         (kbytes, -d) unlimited
file size             (blocks, -f) unlimited
max locked memory     (kbytes, -l) unlimited
max memory size       (kbytes, -m) unlimited
open files                    (-n) 4096
pipe size          (512 bytes, -p) 8
stack size            (kbytes, -s) 10240
cpu time             (seconds, -t) unlimited
max user processes            (-u) 16383
virtual memory        (kbytes, -v) unlimited
To check maximum hard limits, enter the following command: 
$ ulimit -Ha
core file size        (blocks, -c) unlimited
data seg size         (kbytes, -d) unlimited
file size             (blocks, -f) unlimited
max locked memory     (kbytes, -l) unlimited
max memory size       (kbytes, -m) unlimited
open files                    (-n) 101062
pipe size          (512 bytes, -p) 8
stack size            (kbytes, -s) unlimited
cpu time             (seconds, -t) unlimited
max user processes            (-u) 16383
virtual memory        (kbytes, -v) unlimited
The file (blocks) value should be multiplied by 512 to obtain the maximum file size imposed by the shell. A value of unlimited is the operating system default and typically has a maximum value of 1 TB. 
NOTE: Oracle9i Release 2 (9.2.0) includes native support for files greater than 2 GB. Check your shell to determine whether it will impose a limit.  


--------------------------------------------------------------------------------

Downloading / Unpacking the Oracle9i Installation Files 


Most of the actions throughout the rest of this document should be done as the "oracle" user account unless otherwise noted. If you are not logged in as the "oracle" user account, do so now. 

Download Oracle9i from Oracle's OTN Site. 
(If you do not currently have an account with Oracle OTN, you will need to create one. This is a FREE account!) 
http://www.oracle.com/technology/software/products/oracle9i/htdocs/linuxsoft.html 


Download the following files to a temporary directory (i.e. /u01/app/oracle/orainstall: 

ship_9204_linux_disk1.cpio.gz (538,906,295 bytes) (cksum - 245082434) 
ship_9204_linux_disk2.cpio.gz (632,756,922 bytes) (cksum - 2575824107) 
ship_9204_linux_disk3.cpio.gz (296,127,243 bytes) (cksum - 96915247) 

Directions to extract the files. 

Run "gunzip <filename>" on all the files. 
% gunzip ship_9204_linux_disk1.cpio.gz
Extract the cpio archives with the command: "cpio -idmv < <filename>" 
% cpio -idmv < ship_9204_linux_disk1.cpio
NOTE: Some browsers will uncompress the files but leave the extension the same (gz) when downloading. If the above steps do not work for you, try skipping step 1 and go directly to step 2 without changing the filename. 
% cpio -idmv < ship_9204_linux_disk1.cpio.gz
 

You should now have three directories called "Disk1, Disk2 and Disk3" containing the Oracle9i Installation files: 
/Disk1
/Disk2
/Disk3


--------------------------------------------------------------------------------

Update Red Hat Linux System - (Oracle Metalink Note: 252217.1) 

The following RPMs, all of which are available on the Red Hat Fedora Core 2 CDs, will need to be updated as per the steps described in Metalink Note: 252217.1 - "Requirements for Installing Oracle 9iR2 on RHEL3". 
All of these packages will need to be installed as the root user: 

From Fedora Core 2 / Disk #1 

# cd /mnt/cdrom/Fedora/RPMS
# rpm -Uvh libpng-1.2.2-22.i386.rpm
From Fedora Core 2 / Disk #2 
# cd /mnt/cdrom/Fedora/RPMS
# rpm -Uvh gnome-libs-1.4.1.2.90-40.i386.rpm
From Fedora Core 2 / Disk #3 
# cd /mnt/cdrom/Fedora/RPMS
# rpm -Uvh compat-libstdc++-7.3-2.96.126.i386.rpm
# rpm -Uvh compat-libstdc++-devel-7.3-2.96.126.i386.rpm
# rpm -Uvh compat-db-4.1.25-2.1.i386.rpm
# rpm -Uvh compat-gcc-7.3-2.96.126.i386.rpm
# rpm -Uvh compat-gcc-c++-7.3-2.96.126.i386.rpm
# rpm -Uvh openmotif21-2.1.30-9.i386.rpm
# rpm -Uvh pdksh-5.2.14-24.i386.rpm
From Fedora Core 2 / Disk #4 
# cd /mnt/cdrom/Fedora/RPMS
# rpm -Uvh sysstat-5.0.1-2.i386.rpm
Set gcc296 and g++296 in PATH 
Put gcc296 and g++296 first in $PATH variable by creating the following symbolic links: 
# mv /usr/bin/gcc /usr/bin/gcc323
# mv /usr/bin/g++ /usr/bin/g++323
# ln -s /usr/bin/gcc296 /usr/bin/gcc
# ln -s /usr/bin/g++296 /usr/bin/g++
Check hostname 
Make sure the hostname command returns a fully qualified host name by amending the /etc/hosts file if necessary: 
# hostname
Install the 3006854 patch: 
The Oracle / Linux Patch 3006854 can be downloaded here. 
# unzip p3006854_9204_LINUX.zip
# cd 3006854
# sh rhel3_pre_install.sh


--------------------------------------------------------------------------------

Install the Oracle 9.2.0.4.0 RDBMS Software 

As the "oracle" user account: 

Set your DISPLAY variable to a valid X Windows display. 
% DISPLAY=<Any X-Windows Host>:0.0
% export DISPLAY


NOTE: If you forgot to set the DISPLAY environment variable and you get the following error: 

Xlib: connection to ":0.0" refused by server 
Xlib: Client is not authorized to connect to Server 


you will then need to execute the following command to get "runInstaller" working again:
% rm -rf /tmp/OraInstall 

If you don't do this, the Installer will hang without giving any error messages. Also make sure that "runInstaller" has stopped running in the background. If not, kill it. 


Change directory to the Oracle installation files you downloaded and extracted. Then run: runInstaller. 

$ su - oracle
$ cd orainstall/Disk1
$ ./runInstaller
Initializing Java Virtual Machine from /tmp/OraInstall2004-05-02_08-45-13PM/jre/bin/java. Please wait...
Screen Name Response 
Welcome Screen: Click "Next" 
Inventory Location: Click "OK" 
UNIX Group Name: Use "dba" 
Root Script Window: Open another window, login as the root userid, and run "/tmp/orainstRoot.sh". When the script has completed, return to the dialog from the Oracle Installer and hit Continue. 
File Locations: Leave the "Source Path" at its default setting. For the Destination name, I like to use "OraHome920". You can leave the Destination path at it's default value which should be "/u01/app/oracle/product/9.2.0". 
Available Products: Select "Oracle9i Database 9.2.0.4.0" and click "Next" 
Installation Types: Select "Enterprise Edition (2.84GB)" and click "Next" 
Database Configuration: Select "Software Only" and click "Next" 
Summary: Click "Install" 


Running root.sh script. 
When the "Link" phase is complete, you will be prompted to run the $ORACLE_HOME/root.sh script as the "root" user account. 


Shutdown any started Oracle processes 
The Oracle Universal Installer will succeed in starting some Oracle programs, in particular the Oracle HTTP Server (Apache), the Oracle Intelligent Agent, and possibly the Orcle TNS Listener. Make sure all programs are shutdown before attempting to continue in installing the Oracle 9.2.0.5.0 patchset: 


% $ORACLE_HOME/Apache/Apache/bin/apachectl stop

% agentctl stop

% lsnrctl stop


--------------------------------------------------------------------------------

Install the Oracle 9.2.0.5.0 Patchset 

Once you have completed installing of the Oracle9i (9.2.0.4.0) RDBMS software, you should now apply the 9.2.0.5.0 patchset. 
NOTE: The details and instructions for applying the 9.2.0.5.0 patchset in this article is not absolutely necessary. I provide it here simply as a convenience for those how do want to apply the latest patchset.  


The 9.2.0.5.0 patchset can be downloaded from Oracle Metalink: 

Patch Number: 3501955 
Description: ORACLE 9i DATABASE SERVER RELEASE 2 - PATCH SET 4 VERSION 9.2.0.5.0 
Product: Oracle Database Family 
Release: Oracle 9.2.0.5 
Select a Platform or Language: Linux x86 
Last Updated: 26-MAR-2004 
Size: 313M (328923077 bytes) 


Use the following steps to install the Oracle10g Universal Installer and then the Oracle 9.2.0.5.0 patchset. 


To start, let's unpack the Oracle 9.2.0.5.0 to a temporary directory: 
% cd orapatch
% unzip p3501955_9205_LINUX.zip
% cpio -idmv < 9205_lnx32_release.cpio

Next, we need to install the Oracle10g Universal Installer into the same $ORACLE_HOME we used to install the Oracle9i RDBMS software. 
NOTE: Using the old Universal Installer that was used to install the Oracle9i RDBMS software, (OUI release 2.2), cannot be used to install the 9.2.0.5.0 patchset and higher!  


Starting with the Oracle 9.2.0.5.0 patchset, Oracle requires the use of the Oracle10g Universal Installer to apply the 9.2.0.5.0 patchset and to perform all subsequent maintenance operations on the Oracle software $ORACLE_HOME. 

Let's get this thing started by installing the Oracle10g Universal Installer. This must be done by running the runInstaller that is included with the 9.2.0.5.0 patchset we extracted in the above step: 

% cd orapatch/Disk1
% ./runInstaller -ignoreSysPrereqs
Starting Oracle Universal Installer...

Checking installer requirements...

Checking operating system version: must be redhat-2.1, UnitedLinux-1.0, redhat-3, SuSE-7 or SuSE-8
                                      Failed <<<<


>>> Ignoring required pre-requisite failures. Continuing...

Preparing to launch Oracle Universal Installer from /tmp/OraInstall2004-08-30_07-48-15PM. Please wait ...
Oracle Universal Installer, Version
 10.1.0.2.0 Production
Copyright (C) 1999, 2004, Oracle. All rights reserved.
Use the following options in the Oracle Universal Installer to install the Oracle10g OUI: 
Screen Name Response 
Welcome Screen: Click "Next" 
File Locations: The "Source Path" should be pointing to the products.xml file by default. 
For the Destination name, choose the same one you created when installing the Oracle9i software. The name we used in this article was "OraHome920" and the destination path should be "/u01/app/oracle/product/9.2.0".
 
Select a Product to Install: Select "Oracle Universal Installer 10.1.0.2.0" and click "Next" 
Summary: Click "Install" 


Exit from the Oracle Universal Installer. 

Correct the runInstaller symbolic link bug. (Bug 3560961) 
After the installation of Oracle10g Universal Installer, there is a bug that does NOT update the $ORACLE_HOME/bin/runInstaller symbolic link to point to the new 10g installation location. Since the symbolic link does not get updated, the runInstaller command still points to the old installer (2.2) and will be run instead of the new 10g installer. 

To correct this, you will need to manually update the $ORACLE_HOME/bin/runInstaller symbolic link: 

% cd $ORACLE_HOME/bin
% ln -s -f $ORACLE_HOME/oui/bin/runInstaller.sh runInstaller

We now install the Oracle 9.2.0.5.0 patchset by executing the newly installed 10g Universal Installer: 
% cd
% runInstaller -ignoreSysPrereqs
Starting Oracle Universal Installer...

Checking installer requirements...

Checking operating system version: must be redhat-2.1, UnitedLinux-1.0, redhat-3, SuSE-7 or SuSE-8
                                      Failed <<<<


>>> Ignoring required pre-requisite failures. Continuing...

Preparing to launch Oracle Universal Installer from /tmp/OraInstall2004-08-30_07-59-30PM. Please wait ...
Oracle Universal Installer, Version
 10.1.0.2.0 Production
Copyright (C) 1999, 2004, Oracle. All rights reserved.
Here is an overview of the selections I made while performing the 9.2.0.5.0 patchset install: 
Screen Name Response 
Welcome Screen: Click "Next" 
File Locations: The "Source Path" should be pointing to the products.xml file by default. 
For the Destination name, choose the same one you created when installing the Oracle9i software. The name we used in this article was "OraHome920" and the destination path should be "/u01/app/oracle/product/9.2.0".
 
Select a Product to Install: Select "Oracle 9iR2 Patchsets 9.2.0.5.0" and click "Next" 
Summary: Click "Install" 


Running root.sh script. 
When the Link phase is complete, you will be prompted to run the $ORACLE_HOME/root.sh script as the "root" user account. Go ahead and run the root.sh script. 


Exit Universal Installer 
Exit from the Universal Installer and continue on to the Post Installation section of this article. 


--------------------------------------------------------------------------------

Post Installation Steps 

After applying the Oracle 9.2.0.5.0 patchset, we should perform several miscellaneous tasks like configuring the Oracle Networking files and setting up startup and shutdown scripts for then the machine is cycled. 
Configuring Oracle Networking Files: 
I already included sample configuration files (contained in the oracle_920_installation_files_linux.tar file) that can be simply copied to their proper location and started. Change to the oracle HOME directory and copy the files as follows: 

% cd
% cd oracle_920_installation_files_linux
% cp ldap.ora $ORACLE_HOME/network/admin/
% cp tnsnames.ora $ORACLE_HOME/network/admin/
% cp sqlnet.ora $ORACLE_HOME/network/admin/
% cp listener.ora $ORACLE_HOME/network/admin/

% cd
% lsnrctl start

Update /etc/oratab: 
The dbora script (below) relies on an entry in the /etc/oratab. Perform the following actions as the oracle user account: 

% echo "ORA920:/u01/app/oracle/product/9.2.0:Y" >> /etc/oratab

Configuring Startup / Shutdown Scripts: 
Also included in the oracle_920_installation_files_linux.tar file is a script called dbora. This script can be used by the init process to startup and shutdown the database when the machine is cycled. The following tasks will need to be performed by the root user account: 

% su -
# cp /u01/app/oracle/oracle_920_installation_files_linux/dbora /etc/init.d

# chmod 755 /etc/init.d/dbora

# ln -s /etc/init.d/dbora /etc/rc3.d/S99dbora
# ln -s /etc/init.d/dbora /etc/rc4.d/S99dbora
# ln -s /etc/init.d/dbora /etc/rc5.d/S99dbora
# ln -s /etc/init.d/dbora /etc/rc0.d/K10dbora
# ln -s /etc/init.d/dbora /etc/rc6.d/K10dbora


--------------------------------------------------------------------------------

Creating the Oracle Database 

Finally, let's create an Oracle9i database. This can be done using scripts that I already included with the oracle_920_installation_files_linux.tar download. The scripts are included in the ~oracle/admin/ORA920/create directory. To create the database, perform the following steps: 
% su - oracle
% cd admin/ORA920/create
% ./RUN_CRDB.sh
After starting the RUN_CRDB.sh, there will be no screen activity until the database creation is complete. You can, however, bring up a new console window to the Linux databse server as the oracle user account, navigate to the same directory you started the database creation from, and tail the crdb.log log file. 
$ telnet linux3
...
Fedora Core release 2 (Tettnang)
Kernel 2.6.5-1.358 on an i686
login: oracle
Password: xxxxxx
.bash_profile executed
[oracle@linux3 oracle]$ cd admin/ORA920/create
[oracle@linux3 create]$ tail -f crdb.log


=====================================
8. Install Oracle 9.2.0.2 on OpenVMS:
=====================================

VMS:
====

Using OUI to install Oracle9i Release 2 on an OpenVMS System

We have a PC running Xcursion and a 16 Processor GS1280 with the 2 built-in disks
In the examples we booted on disk DKA0:
Oracle account is on disk DKA100. Oracle and the database will be installed on DKA100.
Install disk MUST be ODS-5.

Installation uses the 9.2 downloaded from the Oracle website. It comes in a Java JAR file.
Oracle ships a JRE with its product. However, you will have to install Java on OpenVMS so you can unpack 
the 9.2 JAR file that comes from the Oracle website
Unpack the JAR file as described on the Oracle website. This will create two .BCK files.

Follow the instructions in the VMS_9202_README.txt file on how to restore the 2 backup save sets.

When the two backup save sets files are restored, you should end up with two directories:

[disk1] directory 
[disk2] directory

These directories will be in the root of a disk. In this example they are in the root of DKA100.
The OUI requires X-Windows. If the Alpha system you are using does not have a graphic head, 
use a PC with an X-Windows terminal such as Xcursion.

During this install we discovered a problem:
Instructions tell you to run 

@DKA100:[disk1]runinstaller.

This will not work because the RUNINSTALLER.COM file is not in the root of DKA100:[disk1]. 
You must first copy RUNINSTALLER.COM from the dka100:[disk1.000000] directory into dka100:[disk1]:

$ Copy dka100:[disk1.000000]runinstaller.com dka100:[disk1]

From a terminal window execute:

@DKA100:[disk1]runinstaller

- Oracle Installer starts
  Start the installation
  Click Next to start the installation.

- Assign name and directory structure for the Oracle Home ORACLE_HOME

  Assign a name for your Oracle home.
  Assign the directory structure for the home, for example

  Ora_home
  Dka100:[oracle.oracle9]

  This is where the OUI will install Oracle.
  The OUI will create the directories as necessary

- Select product to install
  Select Database.
  Click Next.
- Select type of installation
  Select Enterprise Edition (or Standard Edition or Custom).
  Click Next.
- Enable RAC
  Select No.
  Click Next.
- Database summary
  View list of products that will be installed.
  Click Install.
- Installation begins
  Installation takes from 45 minutes to an hour.
  Installation ends
  Click Exit.

Oracle is now installed in DKA100:[oracle.oracle9]. 
To create the first database, you must first set up Oracle logicals. 
To do this use a terminal and execute 

@[.oracle9]orauser .

The tool to create and manage databases is DBCA.
On the terminal, type DBCA to launch the Database Assistant.
Welcome to Database Configuration Assistant
DBCA starts.
Click Next.
Select an operation
Select Create a Database.
Click Next.
Select a template
Select New Database.
Click Next.
Enter database name and SID
Enter the name of the database and Oracle System Identifier (SID):
In this example, the database name is DB9I.
The SID is DB9I1.
Click Next.
Select database features
Select which demo databases are installed.
In the example, we selected all possible databases.
Click Next.
Select default node
Select the node in which you want your database to operate by default.
In the example, we selected Shared Server Mode.
Click Next.
Select memory
In the example, we selected the default.
Click Next.
Specify database storage parameters
Select the device and directory.
Use the UNIX device syntax I.E.
For example, DKA100:[oracle.oracle9.database] would be:

	/DKA100/oracle/oracle9/database/

In the example, we kept the default settings.
Click Next.

Select database creation options
Creating a template saves time when creating a database.
Click Finish.
Create a template
Click OK.
Creating and starting Oracle Instance
The database builds.
If it completes successfully, click Exit.
If it does not complete successfully, build it again.
Running the database
Enter �show system� to see the Oracle database up and running.
Set up some files to start and stop the database.
Example of a start file
This command sets the logicals to manage the database:

$ @dka100:[oracle.oracle9]orauser db9i1

The next line starts the Listener (needed for client connects).
The final lines start the database.
Stop database example
Example of how to stop the database.
Test database server
Use the Enterprise Manager console to test the database server.
Oracle Enterprise Manager
Enter address of server and SID.
Name the server.
Click OK.
Databases connect information
Select database.
Enter system account and password.
Change connection box to �AS SYSDBA.�
Click OK.
Open database
Database is opened and exposed.
Listener
Listener automatically picks up the SID from the database.
Start Listener before database and the SID will display in the Listener.
If you start the database before the Listener, the SID may not appear immediately.
To see if the SID is registered in the Listener, enter:

$lsnrctl stat

Alter a user
User is altered:

SQL> alter user oe identified by oe account unlock;
SQL> exit

Preferred method is to use the Enterprise Manager Console.


==================================================
9. Installation of Oracle 9i on AIX and other UNIX
==================================================

AIX:
====

9.1 Installation of Oracle 9i on AIX

 
Doc ID: 	Note:201019.1	Content Type: 	TEXT/PLAIN	   
Subject: 	AIX: Quick Start Guide - 9.2.0 RDBMS Installation	Creation Date: 	25-JUN-2002	   
Type: 	REFERENCE	Last Revision Date: 	14-APR-2004	   
Status: 	PUBLISHED		 
Quick Start Guide 
Oracle9i Release 2 (9.2.0) RDBMS Installation 
AIX Operating System 
 
 
Purpose 
======= 
 
This document is designed to be a quick reference that can be used when 
installing Oracle9i Release 2 (9.2.0) on an AIX platform.  It is NOT designed 
to replace the Installation Guide or other documentation.  A familiarity 
with the AIX Operating System is assumed.  If more detailed information is 
needed, please see the Appendix at the bottom of this document for additional 
resources. 
 
Each step should be done in the order that it is listed.  These steps are the 
bare minimum that is necessary for a typical install of the Oracle9i RDBMS. 
 
 
Verify OS version is certified with the RDBMS version 
====================================================== 
 
The following steps are required to verify your version of the AIX operating 
system is certified with the version of the RDBMS (Oracle9i Release 2 (9.2.0)): 
 
  1. Point your web browser to http://metalink.oracle.com. 
  2. Click the "Certify & Availability" button near the left. 
  3. Click the "Certifications" button near the top middle. 
  4. Click the "View Certifications by Platform" link. 
  5. Select "IBM RS/6000 AIX" and click "Submit". 
  6. Select Product Group "Oracle Server" and click "Submit". 
  7. Select Product "Oracle Server - Enterprise Edition" and click "Submit". 
  8. Read any general notes at the top of the page. 
  9. Select "9.2 (9i) 64-bit" and click "Submit". 
 
The "Status" column displays the certification status.  The links in the 
"Addt'l Info" and "Install Issue" columns may contain additional information 
relevant to a given version.  Note that if patches are listed under one of 
these links, your installation is not considered certified unless you apply 
them.  The "Addt'l Info" link also contains information about available 
patchsets.  Installation of patchsets is not required to be considered 
certified, but they are highly recommended. 
 
 
Pre-Installation Steps for the System Administrator 
==================================================== 
 
The following steps are required to verify your operating system meets minimum 
requirements for installation, and should be performed by the root user.  For 
assistance with system administration issues, please contact your system 
administator or operating system vendor. 
 
Use these steps to manually check the operating system requirements before 
attempting to install Oracle RDBMS software, or you may choose to use the 
convenient "Unix InstallPrep script" which automates these checks for you.  For 
more information about the script, including download information, please 
review the following article: 
 
   Note:189256.1   UNIX: Script to Verify Installation Requirements for 
                   Oracle 9.x version of RDBMS 
 
The InstallPrep script currently does not check requirements for AIX5L systems. 
 
 
The Following Steps Need to be Performed by the Root User: 
 
  1. Configure Operating System Resources: 
 
     Ensure that the system has at least the following resources: 
 
       ? 400 MB in /tmp * 
       ? 256 MB of physical RAM memory 
       ? Two times the amount of physical RAM memory for Swap/Paging space 
         (On systems with more than 2 GB of physical RAM memory, the 
         requirements for Swap/Paging space can be lowered, but Swap/Paging 
         space should never be less than physical RAM memory.) 
 
       * You may also redirect /tmp by setting the TEMP environment variable.  
         This is only recommended in rare circumstances where /tmp cannot be 
         expanded to meet free space requirements. 
 
 
  2. Create an Oracle Software Owner and Group: 
 
     Create an AIX user and group that will own the Oracle software. 
     (user = oracle, group = dba) 
 
       ? Use the "smit security" command to create a new group and user 
 
     Please ensure that the user and group you use are defined in the local 
     /etc/passwd (user) and /etc/group (group) files rather than resolved via 
     a network service such as NIS. 
 
  3. Create a Software Mount Point and Datafile Mount Points: 
 
     Create a mount point for the Oracle software installation. 
     (at least 3.5 GB, typically /u01) 
 
     Create a second, third, and fourth mount point for the database files. 
     (typically /u02, /u03, and /u04)  Use of multiple mount points is not 
     required, but is highly recommended for best performance and ease of 
     recoverability. 
 
  4. Ensure that Asynchronous Input Output (AIO) is "Available": 
 
     Use the following command to check the current AIO status: 
 
       # lsdev -Cc aio 
 
     Verify that the status shown is "Available".  If the status shown is 
     "Defined", then change the "STATE to be configured at system restart" 
     to "Available" after running the following command: 
 
       # smit chaio 
 
  5. Ensure that the math library is installed on your system: 
 
     Use the following command to determine if the math library is installed: 
 
       # lslpp -l bos.adt.libm 
 
     If this fileset is not installed and "COMMITTED", then you must install 
     it from the AIX operating system CD-ROM from IBM.  With the correct 
     CD-ROM mounted, run the following command to begin the process to load 
     the required bos.adt.libm fileset: 
 
       # smit install_latest 
 
     AIX5L systems also require the following filesets: 
 
       # lslpp -l bos.perf.perfstat 
       # lslpp -l bos.perf.libperfstat 
 
  6. Download and install JDK 1.3.1 from IBM.  At the time this article was 
     created, the JDK could be downloaded from the following URL: 
 
     http://www.ibm.com/developerworks/java/jdk/aix/index.html 
 
     Please contact IBM Support if you need assistance downloading or 
     installing the JDK. 
 
  7. Mount the Oracle CD-ROM: 
 
     Mount the Oracle9i Release 2 (9.2.0) CD-ROM using the command: 
 
       # mount -rv cdrfs /dev/cd0 /cdrom 
 
  8. Run the rootpre.sh script: 
 
     NOTE: You must shutdown ALL Oracle database instances (if any) before 
           running the rootpre.sh script.  Do not run the rootpre.sh script 
           if you have a newer version of an Oracle database already installed 
           on this system. 
 
     Use the following command to run the rootpre.sh script: 
 
       # /cdrom/rootpre.sh 
 
 
Installation Steps for the Oracle User 
======================================= 
 
The Following Steps Need to be Performed by the Oracle User: 
 
  1. Set Environment Variables 
 
     Environment variables should be set in the login script for the oracle 
     user.  If the oracle user's default shell is the C-shell (/usr/bin/csh), 
     then the login script will be named ".login".  If the oracle user's 
     default shell is the Bourne-shell (/usr/bin/bsh) or the Korn-shell 
     (/usr/bin/sh or /usr/bin/ksh), then the login script will be named 
     ".profile".  In either case, the login script will be located in the 
     oracle user's home directory ($HOME). 
 
     The examples below assume that your software mount point is /u01. 
 
       Parameter       Value 
       -----------     ----------------------------- 
 
       ORACLE_HOME     /u01/app/oracle/product/9.2.0 
 
       PATH            /u01/app/oracle/product/9.2.0/bin:/usr/ccs/bin: 
                       /usr/bin/X11: 
                       (followed by any other directories you wish to include) 
 
       ORACLE_SID      Set this to what you will call your database instance. 
                       (typically 4 characters in length) 
 
       DISPLAY         <ip-address>:0.0 
                       (review Note:153960.1 for detailed information) 
 
  2. Set the umask: 
 
     Set the oracle user's umask to "022" in you ".profile" or ".login" file. 
 
     Example: 
 
       umask 022 
 
  3. Verify the Environment 
 
     Log off and log on as the oracle user to ensure all environment variables 
     are set correctly.  Use the following command to view them: 
 
       % env | more 
 
     Before attempting to run the Oracle Universal Installer (OUI), verify that 
     you can successfully run the following command: 
 
       % /usr/bin/X11/xclock 
 
     If this does not display a clock on your display screen, please review the 
     following article: 
 
       Note:153960.1  FAQ: X Server testing and troubleshooting 
 
  4. Start the Oracle Universal Installer and install the RDBMS software: 
 
     Use the following commands to start the installer: 
 
       % cd /tmp 
       % /cdrom/runInstaller 
 
     Respond to the installer prompts as shown below: 
 
     ? When prompted for whether rootpre.sh has been run by root, enter "y". 
       This should have been done in Pre-Installation step 8 above. 
 
     ? At the "Welcome Screen", click Next. 
 
     ? If prompted, enter the directory to use for the "Inventory Location". 
       This can be any directory, but is usually not under ORACLE_HOME because 
       the oraInventory is shared with all Oracle products on the system. 
 
     ? If prompted, enter the "UNIX Group Name" for the oracle user (dba). 
 
     ? At the "File Locations Screen", verify the Destination listed is your 
       ORACLE_HOME directory.  Also enter a NAME to identify this ORACLE_HOME. 
       The NAME can be anything, but is typically "DataServer" and the first 
       three digits of the version.  For example: "DataServer920" 
 
     ? At the "Available Products Screen", choose Oracle9i Database, then click 
       Next. 
 
     ? At the "Installation Types Screen", choose Enterprise Edition, then 
       click Next. 
 
     ? If prompted, click Next at the "Component Locations Screen" to accept 
       the default directories. 
 
     ? At the "Database Configuration Screen", choose the the configuration 
       based on how you plan to use the database, then click Next. 
 
     ? If prompted, click Next at the "Privileged Operating System Groups 
       Screen" to accept the default values (your current OS primary group). 
 
     ? If prompted, enter the Global Database Name in the format 
       "ORACLE_SID.hostname" at the "Database Identification Screen". 
       For example: "TEST.AIXhost".  The SID entry should be filled in with 
       the value of ORACLE_SID.  Click Next. 
 
     ? If prompted, enter the directory where you would like to put datafiles 
       at the "Database File Location Screen".  Click Next. 
 
     ? If prompted, select "Use the default character set" (WE8ISO8859P1) at 
       the "Database Character Set Screen".  Click Next. 
 
     ? At the "Choose JDK Home Directory", enter the directory where you have 
       previously installed the JDK 1.3.1 from IBM.  This should have been 
       done in Pre-Installation step 6 above. 
 
     ? At the "Summary Screen", review your choices, then click Install. 
 
     The install will begin.  Follow instructions regarding running "root.sh" 
     and any other prompts.  When completed, the install will have created a 
     default database, configured a Listener, and started both for you. 
 
     Note: If you are having problems changing CD-ROMs when prompted to do so, 
           please review the following article: 
 
           Note:146566.1  How to Unmount / Eject First Cdrom 
 
 
Your Oracle9i Release 2 (9.2.0) RDBMS installation is now complete and ready 
for use. 
 
 
Appendix A 
========== 
 
Documentation is available from the following resources: 
 
 
Oracle9i Release 2 (9.2.0) CD-ROM Disk1 
---------------------------------------- 
 
Mount the CD-ROM, then use a web browser to open the file "index.htm" located 
at the top level directory of the CD-ROM.  On this CD-ROM you will find the 
Installation Guide, Administrator's Reference, and other useful documentation. 
 
 
Oracle Documentation Center 
--------------------------- 
 
Point your web browser to the following URL: 
 
   http://otn.oracle.com/documentation/content.html 
 
Select the highest version CD-pack displayed to ensure you get the most 
up-to-date information. 


Unattended install:
-------------------

Note 1:
-------

This note describes how to start the unattended install of patch 9.2.0.5 on AIX 5L, which can be applied
to 9.2.0.2, 9.2.0.3, 9.2.0.4

Shut down the existing Oracle server instance with normal or immediate priority. For example, 
shutdown all instances (cleanly) if running Parallel Server. 

Stop all listener, agent and other processes running in or against the ORACLE_HOME that will have 
the patch set installation. Run slibclean (/usr/sbin/slibclean) as root to remove ant currently unused 
modules in kernel and library memory. 

To perform a silent installation requiring no user intervention: 

Copy the response file template provided in the response directory where you unpacked 
the patch set tar file. 

Edit the values for all fields labeled as <Value Required> according to the comments and 
examples in the template. 

Start the Oracle Universal Installer from the directory described in Step 4 which applies to your situation. 
You should pass the full path of the response file template you have edited locally as the last argument 
with your own value of ORACLE_HOME and FROM_LOCATION. The following is an example of the command: 

% ./runInstaller -silent -responseFile full_path_to_your_response_file

Run the $ORACLE_HOME/root.sh script from a root session. If you are applying the patch set 
in a cluster database environment, the root.sh script should be run in the same way on both the local node 
and all participating nodes. 

Note 2:
-------

In order to make an unattended install of 9.2.0.1 on Win2K:

Running Oracle Universal Installer and Specifying a Response File
To run Oracle Universal Installer and specify the response file: 

Go to the MS-DOS command prompt. 

Go to the directory where Oracle Universal Installer is installed. 

Run the appropriate response file. For example, 

C:\program files\oracle\oui\install> setup.exe -silent -nowelcome -responseFile filename 

Where... Description 
filename
 Identifies the full path of the specific response file
 
-silent
 Runs Oracle Universal Installer in complete silent mode. The Welcome window is suppressed automatically. 
 This parameter is optional. If you use -silent, -nowelcome is not necessary.
 
-nowelcome
 Suppresses the Welcome window that appears during installation. This parameter is optional.


Note 3:
-------

Unattended install of 9.2.0.5 on Win2K:

To perform a silent installation requiring no user intervention: 

Make a copy of the response file template provided in the response directory where you unzipped 
the patch set file. 
Edit the values for all fields labeled as <Value Required> according to the comments and examples 
in the template. 

Start Oracle Universal Installer release 10.1.0.2 located in the unzipped area of the patch set. 
For example, Disk1\setup.exe. You should pass the full path of the response file template you have edited 
locally as the last argument with your own value of ORACLE_HOME and FROM_LOCATION. The syntax is as follows: 

setup.exe -silent -responseFile ORACLE_BASE\ORACLE_HOME\response_file_path


===============================
9.2 Oracle and UNIX and other OS:
===============================

You have the following options for creating your new Oracle database:

- Use the Database Configuration Assistant (DBCA). 

DBCA can be launched by the Oracle Universal Installer, depending upon the type of install that you select, 
and provides a graphical user interface (GUI) that guides you through the creation of a database. 
You can chose not to use DBCA, or you can launch it as a standalone tool at any time in the future to create a database. 

Run DCBA as

% dbca

- Create the database manually from a script. 

If you already have existing scripts for creating your database, you can still create your database manually. 
However, consider editing your existing script to take advantage of new Oracle features. Oracle provides a sample database 
creation script and a sample initialization parameter file with the database software files it distributes, 
both of which can be edited to suit your needs. 

- Upgrade an existing database. 

In all cases, the Oracle software needs to be installed on your host machine.


9.1.1 Operating system dependencies:
------------------------------------

First, determine for this version of Oracle, what OS settings
must be made, and if any patches must be installed.

For example, on Linux, glibc 2.1.3 is needed with Oracle version 8.1.7.
Linux could be quite critical with respect to libraries in combination
with Oracle.

Ook moet er mogelijk shmmax (max size of shared memory segment)
en dergelijke kernel parameters worden aangepast.  

# sysctl -w kernel.shmmax=100000000
# echo "kernel.shmmax = 100000000" >> /etc/sysctl.conf


   Opmerking: Het onderstaANDe is algemeen, maar is ook afgeleid van een Oracle 8.1.7
   installatie op Linux Redhat 6.2

   Als de 8.1.7 installatie gedaan wordt is ook nog de Java JDK 1.1.8 nodig.
   Deze kan gedownload worden van www.blackdown.org

   Download jdk-1.1.8_v3   jdk118_v3-glibc-2.1.3.tar.bz2 in /usr/local
   tar xvif jdk118_v3-glibc-2.1.3.tar.bz2
   ln -s /usr/local/jdk118_v3 /usr/local/java


9.1.2 Environment variables:
----------------------------

Make sure you have the following environment variables set:

ON UNIX:
========

Example 1:
----------

ORACLE_BASE=/u01/app/oracle; export ORACLE_BASE			(root voor oracle software)
ORACLE_HOME=$ORACLE_BASE/product/8.1.5; export ORACLE_HOME      (bepaald de directory waarin de instance software zich bevind)
ORACLE_SID=brdb; export ORACLE_SID                              (bepaald de naam van de huidige instance)
ORACLE_TERM=xterm, vt100, ansi of wat ANDers; export ORACLE_TERM
ORA_NLSxx=$ORACLE_HOME/ocommon/nls/admin/data; export ORA_NLS   (bepaald de nls directory t.b.v. datafiles voor meerdere talen)
NLS_LANG="Dutch_The NetherlANDs.WE8ISO8859P1"; export NLS_LANG  (Dit specificeert de language, territory en characterset 
                                                                 t.b.v de client applicaties.
LD_LIBRARY_PATH=/u01/app/oracle/product/8.1.7/lib; export LD_LIBRARY_PATH
PATH=$ORACLE_HOME/bin:/bin:/user/bin:/usr/sbin:/bin; export PATH

plaats deze variabelen in de oracle user profile file:
.profile, of .bash_profile etc..


Example 2:
----------

/dbs01	-	-	-	-	-	Db directory 1
/dbs01	/app	-	-	-	-	Constante
/dbs01	/app	/oracle	-	-		$ORACLE_BASE	Oracle base directory
/dbs01	/app	/oracle	/admin	-		$ORACLE_ADMIN	Oracle admin directory
/dbs01	/app	/oracle	/product	-	-		Constante
/dbs01	/app	/oracle	/product	/817	$ORACLE_HOME	Oracle home directory


# LISTENER.ORA Network Configuration File: 
/dbs01/app/oracle/product/817/network/admin/listener.ora

# TNSNAMES.ORA Network Configuration File: 
/dbs01/app/oracle/product/817/network/admin/tnsnames.ora

Example 3:
----------

/dbs01/app/orace	Oracle software
/dbs02/oradata          database files
/dbs03/oradata          database files
..
..
/var/opt/oracle	        network files
/opt/oracle/admin/bin 

Example 4:
----------

Mountpunt	Device	Omvang 		(Mbyte)	Doel
/		/dev/md/dsk/d1     	100	Unix Root-filesysteem
/usr		/dev/md/dsk/d3		1200	Unix usr-filesysteem
/var		/dev/md/dsk/d4		200	Unix var-filesysteem
/home		/dev/md/dsk/d5		200	Unix opt-filesysteem
/opt		/dev/md/dsk/d6		4700	Oracle_Home
/u01		/dev/md/dsk/d7		8700	Oracle datafiles	
/u02		/dev/md/dsk/d8		8700	Oracle datafiles	
/u03		/dev/md/dsk/d9		8700	Oracle datafiles	
/u04		/dev/md/dsk/d10		8700	Oracle datafiles	
/u05		/dev/md/dsk/d110	8700	Oracle datafiles	
/u06		/dev/md/dsk/d120	8700	Oracle datafiles	
/u07		/dev/md/dsk/d123	8650 	Oracle datafiles

Example 5:
----------

initBENE.ora	/opt/oracle/product/8.0.6/dbs
tnsnames.ora	/opt/oracle/product/8.0.6/network/admin
listener.ora	/opt/oracle/product/8.0.6/network/admin
alert log	/var/opt/oracle/bene/bdump
oratab	        /var/opt/oracle

Example 6:
----------

ORACLE_BASE   /u01/app/oracle  
ORACLE_HOME   $ORACLE_BASE/product/10.1.0/db_1  
ORACLE_PATH   /u01/app/oracle/product/10.1.0/db_1/bin:. 
Note: The period adds the current working directory to the search path. 
 
ORACLE_SID   SAL1    
ORAENV_ASK   NO  
SQLPATH      /home:/home/oracle:/u01/oracle  
TNS_ADMIN    $ORACLE_HOME/network/admin  
TWO_TASK  
Function  Specifies the default connect identifier to use in the connect string. If this environment variable is set, 
you do not need to specify the connect identifier in the connect string. For example, if the TWO_TASK environment variable 
is set to sales, you can connect to a database using the CONNECT username/password command 
rather than the CONNECT username/password@sales command.  
Syntax  Any connect identifier.  
Example  PRODDB_TCP  

to identify the SID and Oracle home directory for the instance that you want to shut down, enter the following command: 

Solaris: 

$ cat /var/opt/oracle/oratab

Other operating systems: 

$ cat /etc/oratab


ON NT/2000:
===========

SET ORACLE_BASE=G:\ORACLE
SET ORACLE_HOME=G:\ORACLE\ORA81
SET ORACLE_SID=AIRM
SET ORA_NLSxxx=G:\ORACLE\ORA81\ocommon\nls\admin\data
SET NLS_LANG=AMERICAN_AMERICA.WE8ISO8859P1


ON OpenVMS:
===========

When Oracle is installed on VMS, a root directory is chosen which is pointed 
to by the logical name ORA_ROOT. This directory can be placed anywhere on the VMS system.
 The majority of code, configuration files and command procedures are found below this root directory. 

When a new database is created a new directory is created in the root directory 
to store database specific configuration files. This directory is called [.DB_dbname]. 
This directory will normally hold the system tablespace data file as well 
as the database specific startup, shutdown and orauser files. 

The Oracle environment for a VMS user is set up by running the appropriate 
ORAUSER_dbname.COM file. This sets up the necessary command symbols and logical names 
to access the various ORACLE utilities. 
Each database created on a VMS system will have an ORAUSER file in it's home directory 
and will be named ORAUSER_dbname.COM, e.g. for a database SALES the file specification could be: 

ORA_ROOT:[DB_SALES]ORAUSER_SALES.COM

To have the environment set up automatically on login, run this command file in your login.com file. 
To access SQLPLUS use the following command with a valid username and password: 

$ SQLPLUS username/password

SQLDBA is also available on VMS and can be invoked similarly: 
$ SQLDBA username/password


9.1.3 OFA directory structuur:
------------------------------

Hou je aan OFA. Een voorbeeld voor database PROD:

/opt/oracle/product/8.1.6
/opt/oracle/product/8.1.6/admin/PROD

/opt/oracle/product/8.1.6/admin/pfile
/opt/oracle/product/8.1.6/admin/adhoc
/opt/oracle/product/8.1.6/admin/bdump
/opt/oracle/product/8.1.6/admin/udump
/opt/oracle/product/8.1.6/admin/adump
/opt/oracle/product/8.1.6/admin/cdump
/opt/oracle/product/8.1.6/admin/create

/u02/oradata/PROD
/u03/oradata/PROD
/u04/oradata/PROD

etc..


Example mountpoints and disks:
------------------------------

Mountpunt	Device	        Omvang 	Doel
/	       /dev/md/dsk/d1   100	Unix Root-filesysteem
/usr	       /dev/md/dsk/d3	1200	Unix usr-filesysteem
/var	       /dev/md/dsk/d4	200	Unix var-filesysteem
/home	       /dev/md/dsk/d5	200	Unix opt-filesysteem
/opt	       /dev/md/dsk/d6	4700	Oracle_Home
/u01	       /dev/md/dsk/d7	8700	Oracle datafiles	
/u02	       /dev/md/dsk/d8	8700	Oracle datafiles	
/u03	       /dev/md/dsk/d9	8700	Oracle datafiles	
/u04	       /dev/md/dsk/d10	8700	Oracle datafiles	
/u05	       /dev/md/dsk/d110	8700	Oracle datafiles	
/u06	       /dev/md/dsk/d120	8700	Oracle datafiles	
/u07	       /dev/md/dsk/d123	8650 	Oracle datafiles	


9.1.4 Users en groups:
----------------------


Als je met OS verificatie wilt werken, moet in de init.ora gezet zijn:
remote_login_passwordfile=none (passwordfile authentication via exlusive)

Benodigde groups in UNIX: group dba. Deze moet voorkomen in de /etc/group file
vaak is ook nog nodig de group oinstall

groupadd dba
groupadd oinstall
groupadd oper

Maak nu user oracle aan:
adduser -g oinstall -G dba -d /home/oracle oracle


#  groupadd dba
#  useradd oracle
#  mkdir /usr/oracle
#  mkdir /usr/oracle/9.0
#  chown -R oracle:dba /usr/oracle
#  touch /etc/oratab
#  chown oracle:dba /etc/oratab


9.1.5 mount points en disks:
----------------------------

maak de mount points:

mkdir /opt/u01
mkdir /opt/u02
mkdir /opt/u03
mkdir /opt/u04  

dit moeten voor een produktie omgeving aparte schijven zijn

Geef nu ownership van deze mount points aan user oracle en group oinstall

chown -R oracle:oinstall /opt/u01
chown -R oracle:oinstall /opt/u02
chown -R oracle:oinstall /opt/u03
chown -R oracle:oinstall /opt/u04

directories: drwxr-xr-x  oracle  dba
files      : -rw-r-----  oracle  dba
           : -rw-r--r--  oracle  dba

chmod 644 *
chmod u+x filename
chmod ug+x filename


9.1.6 test van user oracle:
---------------------------


log in als user oracle en geef de commANDo's

$groups   laat de groups zien (oinstall, dba)
$umask   laat 022 zien, zoniet zet dan de line umask 022 in het .profile

umask is de default mode van een file of directory wanneer deze aangemaakt wordt.
rwxrwxrwx=777
rw-rw-rw-=666
rw-r--r--=644 welke correspondeert met umask 022

Verander nu het .profile of .bash_profile van de user oracle.
Plaats de environment variabelen van 9.1 in het profile.

log uit en in als user oracle, en test de environment:
%env
%echo $variablename


9.1.7 Oracle Installer bij 8.1.x op Linux:
------------------------------------------

Log in als user oracle. Draai nu oracle installer:

Linux:

  startx
  cd /usr/local/src/Oracle8iR3
  ./runInstaller

of

  Ga naar install/linux op de CD en run runIns.sh


Nu volgt een grafische setup. Beantwoord de vragen.

Het kan zijn dat de installer vraagt om scripts uit te voeren zoals:
orainstRoot.sh en root.sh
Om dit uit te voeren:

   open een nieuw window
   su root
   cd $ORACLE_HOME
   ./orainstRoot.sh


Installatie database op Unix:
-----------------------------

  $ export PATH=$PATH:$ORACLE_HOME/bin
  $ export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$ORACLE_HOME/lib
  $ dbca &

or

  $ cat "db1:/usr/oracle/9.0:Y >> /etc/oratab"
  $ cd $ORACLE_HOME/dbs
  $ cat initdw.ora |sed s/"#db_name = MY_DB_NAME"/"db_name = db1"/|sed s/#control_files/control_files/ > initdb1.ora
Start and create database : 
  $ export PATH=$PATH:$ORACLE_HOME/bin
  $ export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$ORACLE_HOME/lib
  $ export ORACLE_SID=db1
  $ sqlplus /nolog <<!
  connect / as sysdba
  startup nomount
  create database db1
  !

This creates a default database with files in $ORACLE_HOME/dbs 
Now add the database meta data to actually make it useful : 

$ sqlplus /nolog <<!
  connect / as sysdba
  @?/rdbms/admin/catalog   # E.g: /apps/oracle/product/9.2/rdbms/admin
  @?/rdbms/admin/catproc
  !
Now create a user and give it wide ranging permissions : 
  $ sqlplus /nolog <<!
  connect / as sysdba
  create user myuser identified by password;
  grant create session,create any table to myuser;
  grant unlimited tablespace to myuser;
  !


9.1.8 OS or Password Authentication:
------------------------------------

-- Preparing to Use OS Authentication

To enable authentication of an administrative user using the operating system you must do the following:

Create an operating system account for the user. Add the user to the OSDBA or OSOPER operating system defined groups. 
Ensure that the initialization parameter, REMOTE_LOGIN_PASSWORDFILE, is set to NONE. This is the default value 
for this parameter. 

A user can be authenticated, enabled as an administrative user, and connected to a local database by typing 
one of the following SQL*Plus commands:

CONNECT / AS SYSDBA
CONNECT / AS SYSOPER

For a remote database connection over a secure connection, the user must also specify the net service name 
of the remote database:

CONNECT /@net_service_name AS SYSDBA
CONNECT /@net_service_name AS SYSOPER


OSDBA:
unix   :  dba
windows: ORA_DBA
 
OSOPER:
unix   : oper
windows: ORA_OPER
 

-- Preparing to Use Password File Authentication

To enable authentication of an administrative user using password file authentication you must do the following:

Create an operating system account for the user. 
If not already created, Create the password file using the ORAPWD utility: 

ORAPWD FILE=filename PASSWORD=password ENTRIES=max_users


Set the REMOTE_LOGIN_PASSWORDFILE initialization parameter to EXCLUSIVE. 
Connect to the database as user SYS (or as another user with the administrative privilege). 
If the user does not already exist in the database, create the user. Grant the SYSDBA or SYSOPER 
system privilege to the user: 
GRANT SYSDBA to scott;

This statement adds the user to the password file, thereby enabling connection AS SYSDBA.

For example, user scott has been granted the SYSDBA privilege, so he can connect as follows:

CONNECT scott/tiger AS SYSDBA


9.1.9 Create a 9i database:
---------------------------

Step 1: Decide on Your Instance Identifier (SID)

Step 2: Establish the Database Administrator Authentication Method

Step 3: Create the Initialization Parameter File

Step 4: Connect to the Instance

Step 5: Start the Instance.

Step 6: Issue the CREATE DATABASE Statement

Step 7: Create Additional Tablespaces

Step 8: Run Scripts to Build Data Dictionary Views

Step 9: Run Scripts to Install Additional Options (Optional)

Step 10: Create a Server Parameter File (Recommended)

Step 11: Back Up the Database.


Step 1:
-------

% ORACLE_SID=ORATEST; export ORACLE_SID 

Step 2: see above
-----------------

Step 3: init.ora
----------------

Sample Initialization Parameter File
# Cache and I/O
DB_BLOCK_SIZE=4096
DB_CACHE_SIZE=20971520

# Cursors and Library Cache
CURSOR_SHARING=SIMILAR
OPEN_CURSORS=300

# Diagnostics and Statistics
BACKGROUND_DUMP_DEST=/vobs/oracle/admin/mynewdb/bdump
CORE_DUMP_DEST=/vobs/oracle/admin/mynewdb/cdump
TIMED_STATISTICS=TRUE
USER_DUMP_DEST=/vobs/oracle/admin/mynewdb/udump

# Control File Configuration
CONTROL_FILES=("/vobs/oracle/oradata/mynewdb/control01.ctl",
               "/vobs/oracle/oradata/mynewdb/control02.ctl",
               "/vobs/oracle/oradata/mynewdb/control03.ctl")

# Archive
LOG_ARCHIVE_DEST_1='LOCATION=/vobs/oracle/oradata/mynewdb/archive'
LOG_ARCHIVE_FORMAT=%t_%s.dbf
LOG_ARCHIVE_START=TRUE

# Shared Server
# Uncomment and use first DISPATCHES parameter below when your listener is
# configured for SSL 
# (listener.ora and sqlnet.ora)
# DISPATCHERS = "(PROTOCOL=TCPS)(SER=MODOSE)",
#               "(PROTOCOL=TCPS)(PRE=oracle.aurora.server.SGiopServer)"
DISPATCHERS="(PROTOCOL=TCP)(SER=MODOSE)",
            "(PROTOCOL=TCP)(PRE=oracle.aurora.server.SGiopServer)",
             (PROTOCOL=TCP)

# Miscellaneous
COMPATIBLE=9.2.0
DB_NAME=mynewdb

# Distributed, Replication and Snapshot
DB_DOMAIN=us.oracle.com
REMOTE_LOGIN_PASSWORDFILE=EXCLUSIVE

# Network Registration
INSTANCE_NAME=mynewdb

# Pools
JAVA_POOL_SIZE=31457280
LARGE_POOL_SIZE=1048576
SHARED_POOL_SIZE=52428800

# Processes and Sessions
PROCESSES=150

# Redo Log and Recovery
FAST_START_MTTR_TARGET=300

# Resource Manager
RESOURCE_MANAGER_PLAN=SYSTEM_PLAN

# Sort, Hash Joins, Bitmap Indexes
SORT_AREA_SIZE=524288

# Automatic Undo Management
UNDO_MANAGEMENT=AUTO
UNDO_TABLESPACE=undotbs


Step 4: Connect to the Instance:
--------------------------------
Start SQL*Plus and connect to your Oracle instance AS SYSDBA.

$ SQLPLUS /nolog
CONNECT SYS/password AS SYSDBA

Step 5: Start the Instance:
---------------------------

Start an instance without mounting a database. Typically, you do this only during database creation or while performing
 maintenance on the database. Use the STARTUP command with the NOMOUNT option. In this example, because the initialization 
parameter file is stored in the default location, you are not required to specify the PFILE clause:

STARTUP NOMOUNT

At this point, there is no database. Only the SGA is created and background processes are started in preparation 
for the creation of a new database.

Step 6: Issue the CREATE DATABASE Statement:
--------------------------------------------

To create the new database, use the CREATE DATABASE statement. The following statement creates database mynewdb:

CREATE DATABASE mynewdb
   USER SYS IDENTIFIED BY pz6r58
   USER SYSTEM IDENTIFIED BY y1tz5p
   LOGFILE GROUP 1 ('/vobs/oracle/oradata/mynewdb/redo01.log') SIZE 100M,
           GROUP 2 ('/vobs/oracle/oradata/mynewdb/redo02.log') SIZE 100M,
           GROUP 3 ('/vobs/oracle/oradata/mynewdb/redo03.log') SIZE 100M
   MAXLOGFILES 5
   MAXLOGMEMBERS 5
   MAXLOGHISTORY 1
   MAXDATAFILES 100
   MAXINSTANCES 1
   CHARACTER SET US7ASCII
   NATIONAL CHARACTER SET AL16UTF16
   DATAFILE '/vobs/oracle/oradata/mynewdb/system01.dbf' SIZE 325M REUSE
   EXTENT MANAGEMENT LOCAL
   DEFAULT TEMPORARY TABLESPACE tempts1
      DATAFILE '/vobs/oracle/oradata/mynewdb/temp01.dbf' 
      SIZE 20M REUSE
   UNDO TABLESPACE undotbs 
      DATAFILE '/vobs/oracle/oradata/mynewdb/undotbs01.dbf'
      SIZE 200M REUSE AUTOEXTEND ON NEXT 5120K MAXSIZE UNLIMITED;


Step 7: Create Additional Tablespaces:
--------------------------------------

To make the database functional, you need to create additional files and tablespaces for users. 
The following sample script creates some additional tablespaces:

CONNECT SYS/password AS SYSDBA
-- create a user tablespace to be assigned as the default tablespace for users
CREATE TABLESPACE users LOGGING 
     DATAFILE '/vobs/oracle/oradata/mynewdb/users01.dbf' 
     SIZE 25M REUSE AUTOEXTEND ON NEXT  1280K MAXSIZE UNLIMITED 
     EXTENT MANAGEMENT LOCAL;
-- create a tablespace for indexes, separate from user tablespace
CREATE TABLESPACE indx LOGGING 
     DATAFILE '/vobs/oracle/oradata/mynewdb/indx01.dbf' 
     SIZE 25M REUSE AUTOEXTEND ON NEXT  1280K MAXSIZE UNLIMITED 
     EXTENT MANAGEMENT LOCAL;
EXIT


Step 8: Run Scripts to Build Data Dictionary Views:
---------------------------------------------------

Run the scripts necessary to build views, synonyms, and PL/SQL packages:

CONNECT SYS/password AS SYSDBA
@/vobs/oracle/rdbms/admin/catalog.sql
@/vobs/oracle/rdbms/admin/catproc.sql
EXIT

Do not forget to run as SYSTEM the script /sqlplus/admin/pupbld.sql;


The following table contains descriptions of the scripts:

Script Description 
CATALOG.SQL:  Creates the views of the data dictionary tables, the dynamic performance views, and public synonyms 
              for many of the views. Grants PUBLIC access to the synonyms.
 
CATPROC.SQL:  Runs all scripts required for or used with PL/SQL.
 

Step 10: Create a Server Parameter File (Recommended):
------------------------------------------------------

Oracle recommends you create a server parameter file as a dynamic means of maintaining initialization parameters. 
The following script creates a server parameter file from the text initialization parameter file and writes it 
to the default location. The instance is shut down, then restarted using the server parameter file (in the default location).

CONNECT SYS/password AS SYSDBA
-- create the server parameter file 
CREATE SPFILE='/vobs/oracle/dbs/spfilemynewdb.ora' FROM
       PFILE='/vobs/oracle/admin/mynewdb/scripts/init.ora';
SHUTDOWN 
-- this time you will start up using the server parameter file
CONNECT SYS/password AS SYSDBA
STARTUP 
EXIT


CREATE SPFILE='/opt/app/oracle/product/9.2/dbs/spfileOWS.ora' 
FROM  PFILE='/opt/app/oracle/admin/OWS/pfile/init.ora';

CREATE SPFILE='/opt/app/oracle/product/9.2/dbs/spfilePEGACC.ora' 
FROM PFILE='/opt/app/oracle/admin/PEGACC/scripts/init.ora';

CREATE SPFILE='/opt/app/oracle/product/9.2/dbs/spfilePEGTST.ora' 
FROM PFILE='/opt/app/oracle/admin/PEGTST/scripts/init.ora';


9.10 Oracle 9i licenses:
------------------------

Setting License Parameters

Oracle no longer offers licensing by the number of concurrent sessions. Therefore the LICENSE_MAX_SESSIONS and LICENSE_SESSIONS_WARNING 
initialization parameters have been deprecated.
 

- named user licesnsing:

If you use named user licensing, Oracle can help you enforce this form of licensing. You can set a limit on the number of users 
created in the database. Once this limit is reached, you cannot create more users.

Note: 
This mechanism assumes that each person accessing the database has a unique user name and that no people share a user name. 
Therefore, so that named user licensing can help you ensure compliance with your Oracle license agreement, do not allow 
multiple users to log in using the same user name. 

To limit the number of users created in a database, set the LICENSE_MAX_USERS initialization parameter in the 
database's initialization parameter file, as shown in the following example:

LICENSE_MAX_USERS = 200


- per-processor licensing:

Oracle encourages customers to license the database on the per-processor licensing model. With this licensing method 
you count up the number of CPUs in your computer, and multiply that number by the licensing cost of the database 
and database options you need.

Currently the Standard (STD) edition of the database is priced at $15,000 per processor, and the Enterprise (EE) edition is priced at 
$40,000 per processor. The RAC feature is $20,000 per processor extra, and you need to add 22 percent annually for the support contract.

It's possible to license the database on a per-user basis, which makes financial sense if there'll never be many users accessing
the database. However, the licensing method can't be changed after it is initially licensed. So if the business grows and 
requires significantly more users to access the database, the costs could exceed the costs under the per-processor model. 
You also have to understand what Oracle corporation considers to be a user for the purposes of licensing purposes. 
If 1,000 users access the database through an application server, which only makes five connections to the database, 
then Oracle will require that either 1,000 user licenses be purchased or that the database be licensed via 
the per-processor pricing model.

The Oracle STD edition is licensed at $300 per user (with a five user minimum), and EE edition costs $800 per user 
(with a 25 user minimum). There is still an annual support fee of 22 percent, which should be budgeted in addition to the licensing fees. 
If the support contract is not paid each year, then the customer is not licensed to upgrade to the latest version of the database and must 
re-purchase all of the licenses over again in order to upgrade versions. This section only gives you a brief overview of the available 
licensing options and costs, so if you have additional questions you really should contact an Oracle sales representative


9.11. Older Database installations:
-----------------------------------

CREATE DATABASE Examples on 8.x


The easiest way to create a 8i, 9i database, is using the "Database Configuration Assistant".
Using this tool, you are able to create a database and setup the NET configuration and the listener,
in a graphical environment.

It is also possible to use a script running in sqlpus (8i,9i) or svrmgrl (only in 8i).

Charactersets that are used a lot in europe:

WE8ISO8859P15
WE8MMSWIN1252


Example 1:
----------

$ SQLPLUS /nolog
CONNECT username/password AS sysdba

STARTUP NOMOUMT PFILE=<path to init.ora>

--  Create database
CREATE DATABASE rbdb1
    CONTROLFILE REUSE
    LOGFILE '/u01/oracle/rbdb1/redo01.log' SIZE 1M REUSE,
            '/u01/oracle/rbdb1/redo02.log' SIZE 1M REUSE,
            '/u01/oracle/rbdb1/redo03.log' SIZE 1M REUSE,
            '/u01/oracle/rbdb1/redo04.log' SIZE 1M REUSE
    DATAFILE '/u01/oracle/rbdb1/system01.dbf' SIZE 10M REUSE 
      AUTOEXTEND ON
      NEXT 10M MAXSIZE 200M 
    CHARACTER SET WE8ISO8859P1;

run catalog.sql
run catproq.sql

-- Create another (temporary) system tablespace
CREATE ROLLBACK SEGMENT rb_temp STORAGE (INITIAL 100 k NEXT 250 k);

-- Alter temporary system tablespace online before proceding
ALTER ROLLBACK SEGMENT rb_temp ONLINE;

-- Create additional tablespaces ...
-- RBS: For rollback segments
-- USERs: Create user sets this as the default tablespace
-- TEMP: Create user sets this as the temporary tablespace
CREATE TABLESPACE rbs
    DATAFILE '/u01/oracle/rbdb1/rbs01.dbf' SIZE 5M REUSE AUTOEXTEND ON
      NEXT 5M MAXSIZE 150M;
CREATE TABLESPACE users
    DATAFILE '/u01/oracle/rbdb1/users01.dbf' SIZE 3M REUSE AUTOEXTEND ON
      NEXT 5M MAXSIZE 150M;
CREATE TABLESPACE temp
    DATAFILE '/u01/oracle/rbdb1/temp01.dbf' SIZE 2M REUSE AUTOEXTEND ON
      NEXT 5M MAXSIZE 150M;

-- Create rollback segments.  
CREATE ROLLBACK SEGMENT rb1 STORAGE(INITIAL 50K NEXT 250K)
  tablespace rbs;
CREATE ROLLBACK SEGMENT rb2 STORAGE(INITIAL 50K NEXT 250K)
  tablespace rbs;
CREATE ROLLBACK SEGMENT rb3 STORAGE(INITIAL 50K NEXT 250K)
  tablespace rbs;
CREATE ROLLBACK SEGMENT rb4 STORAGE(INITIAL 50K NEXT 250K)
  tablespace rbs;

-- Bring new rollback segments online and drop the temporary system one
ALTER ROLLBACK SEGMENT rb1 ONLINE;
ALTER ROLLBACK SEGMENT rb2 ONLINE;
ALTER ROLLBACK SEGMENT rb3 ONLINE;
ALTER ROLLBACK SEGMENT rb4 ONLINE;

ALTER ROLLBACK SEGMENT rb_temp OFFLINE;
DROP ROLLBACK SEGMENT rb_temp ;


Example 2:
----------

connect internal
startup nomount pfile=/disk00/oracle/software/7.3.4/dbs/initDB1.ora

create database "DB1"
   maxinstances 2
   maxlogfiles 32
   maxdatafiles 254
   characterset "US7ASCII"  

datafile '/disk02/oracle/oradata/DB1/system01.dbf' size 128M
autoextent on next 8M maxsize 256M

logfile group 1 
        ('/disk03/oracle/oradata/DB1/redo1a.log', 
         '/disk04/oracle/oradata/DB1/redo1b.log') size 5M,
        group 2
        ('/disk05/oracle/oradata/DB1/redo2a.log',
        ('/disk06/oracle/oradata/DB1/redo2b.log') size 5M


REM * install data dictionary views    
@/disk00/oracle/software/7.3.4/rdbms/admin/catalog.sql
@/disk00/oracle/software/7.3.4/rdbms/admin/catproq.sql

create rollback segment  SYSROLL tablespace system
storage (initial 2M next 2M minextents 2 maxextents 255);

alter rollback segment SYSROLL online;


create tablespace RBS
 datafile '/disk01/oracle/oradata/DB1/rbs01.dbf' size 25M
 default storage (
  initial     500K
  next        500K
  pctincrease 0
  minextents  2  );

create rollback segment  RBS_1 tablespace RBS1
storage (initial 512K next 512K minextents 50);

create rollback segment  RBS02 tablespace RBS
storage (initial 500K next 500K minextents 2 optimal 1M);

etc..

alter rollback segment RBS01 online;
alter rollback segment RBS02 online;

etc..

create tablespace DATA
 datafile '/disk05/oracle/oradata/DB1/data01.dbf' size 25M
 default storage (
  initial     500K
  next        500K
  pctincrease 0
  maxextends  UNLIMITED );

etc.. other tablespaces you need
run other scripts you need.

alter user sys temporary tablespace TEMP;
alter user system default tablespace TOOLS temporary tablespace TEMP;

connect system/manager

@/disk00/oracle/software/7.3.4/rdbms/admin/catdbsyn.sql


@/disk00/oracle/software/7.3.4/rdbms/admin/pubbld.sql
t.b.v. PRODUCT_USER_PROFILE, SQLPLUS_USER_PROFILE

Example 3: on NT/2000 8i best example:
--------------------------------------

Suppose you want a second database on a NT/2000 Server:

1. create a service with oradim

oradim -new -sid -startmode -pfile

2. sqlplus /nolog (or use svrmgrl)

startup nomount pfile="G:\oracle\admin\hd\pfile\init.ora"

SVRMGR> CREATE DATABASE hd
        LOGFILE 'G:\oradata\hd\redo01.log' SIZE 2048K,
           'G:\oradata\hd\redo02.log' SIZE 2048K,
           'G:\oradata\hd\redo03.log' SIZE 2048K
        MAXLOGFILES 32
        MAXLOGMEMBERS 2
        MAXLOGHISTORY 1
        DATAFILE 'G:\oradata\hd\system01.dbf' SIZE 264M  REUSE AUTOEXTEND ON NEXT 10240K
        MAXDATAFILES 254
        MAXINSTANCES 1
        CHARACTER SET WE8ISO8859P1
        NATIONAL CHARACTER SET WE8ISO8859P1;


@catalog.sql
@catproq.sql
  

Oracle 9i:
----------

Example 1:
----------


CREATE DATABASE mynewdb
   USER SYS IDENTIFIED BY pz6r58
   USER SYSTEM IDENTIFIED BY y1tz5p
   LOGFILE GROUP 1 ('/vobs/oracle/oradata/mynewdb/redo01.log') SIZE 100M,
           GROUP 2 ('/vobs/oracle/oradata/mynewdb/redo02.log') SIZE 100M,
           GROUP 3 ('/vobs/oracle/oradata/mynewdb/redo03.log') SIZE 100M
   MAXLOGFILES 5
   MAXLOGMEMBERS 5
   MAXLOGHISTORY 1
   MAXDATAFILES 100
   MAXINSTANCES 1
   CHARACTER SET US7ASCII
   NATIONAL CHARACTER SET AL16UTF16
   DATAFILE '/vobs/oracle/oradata/mynewdb/system01.dbf' SIZE 325M REUSE
   EXTENT MANAGEMENT LOCAL
   DEFAULT TEMPORARY TABLESPACE tempts1
      DATAFILE '/vobs/oracle/oradata/mynewdb/temp01.dbf' 
      SIZE 20M REUSE
   UNDO TABLESPACE undotbs 
      DATAFILE '/vobs/oracle/oradata/mynewdb/undotbs01.dbf'
      SIZE 200M REUSE AUTOEXTEND ON NEXT 5120K MAXSIZE UNLIMITED;


9.2 Automatische start oracle bij system boot:
==============================================


9.2.1 oratab:
-------------

Inhoud ORATAB in /etc of /var/opt:

Voorbeeld:

  #   $ORACLE_SID:$ORACLE_HOME:[N|Y]
  #
  ORCL:/u01/app/oracle/product/8.0.5:Y
  #


De oracle scripts om de database te starten en te stoppen zijn: $ORACLE_HOME/bin/dbstart en dbshut,
of startdb en stopdb of wat daarop lijkt.  Deze kijken in ORATAB om te zien welke databases
gestart moeten worden.


9.2.2 dbstart en dbshut:
------------------------

Het script dbstart zal oratab lezen en ook tests doen en om de oracle versie
te bepalen. Verder bestaat de kern uit:

  het starten van sqldba, svrmgrl of sqlplus
  vervolgens doen we een connect
  vervolgens geven we het startup commando.

Voor dbshut geldt een overeenkomstig verhaal.


9.2.3 init, sysinit, rc:
------------------------

Voor een automatische start, voeg nu de juiste entries toe in het /etc/rc2.d/S99dbstart 
(or equivalent) file: 

Tijdens het opstarten van Unix worden de scrips in de /etc/rc2.d uitgevoerd die beginnen met een 'S' 
en in alfabetische volgorde. 
De Oracle database processen zullen als (een van de) laatste processen worden gestart. 
Het bestAND S99oracle is gelinkt met deze directory.

Inhoud S99oracle:

  su - oracle -c "/path/to/$ORACLE_HOME/bin/dbstart"         # Start DB's
  su - oracle -c "/path/to/$ORACLE_HOME/bin/lsnrctl start"   # Start listener
  su - oracle -c "/path/tp/$ORACLE_HOME/bin/namesctl start"  # Start OraNames (optional)

Het dbstart script is een standaard Oracle script. Het kijkt in oratab welke sid's op 'Y' staan, 
en zal deze databases starten.

of customized via een customized startdb script:

  ORACLE_ADMIN=/opt/oracle/admin; export ORACLE_ADMIN

  su - oracle -c "$ORACLE_ADMIN/bin/startdb WPRD 1>$ORACLE_ADMIN/log/WPRD/startWPRD.$$ 2>&1"
  su - oracle -c "$ORACLE_ADMIN/bin/startdb WTST 1>$ORACLE_ADMIN/log/WTST/startWTST.$$ 2>&1"
  su - oracle -c "$ORACLE_ADMIN/bin/startdb WCUR 1>$ORACLE_ADMIN/log/WCUR/startWCUR.$$ 2>&1"


9.3 Het stoppen van Oracle in unix:
-----------------------------------


Tijdens het down brengen van Unix (shutdown -i 0) worden de scrips in de directory /etc/rc2.d 
uitgevoerd die beginnen met een 'K' en in alfabetische volgorde. 
De Oracle database processen zijn een van de eerste processen die worden afgesloten. 
Het bestand K10oracle is gelinkt met de /etc/rc2.d/K10oracle

# Configuration File: /opt/oracle/admin/bin/K10oracle


ORACLE_ADMIN=/opt/oracle/admin; export ORACLE_ADMIN

su - oracle -c "$ORACLE_ADMIN/bin/stopdb WPRD 1>$ORACLE_ADMIN/log/WPRD/stopWPRD.$$ 2>&1"
su - oracle -c "$ORACLE_ADMIN/bin/stopdb WCUR 1>$ORACLE_ADMIN/log/WCUR/stopWCUR.$$ 2>&1"
su - oracle -c "$ORACLE_ADMIN/bin/stopdb WTST 1>$ORACLE_ADMIN/log/WTST/stopWTST.$$ 2>&1"


9.4 startdb en stopdb:
----------------------

Startdb [ORACLE_SID]
--------------------

Dit script is een onderdeel van het script S99Oracle. Dit script heeft 1 parameter, ORACLE_SID

# Configuration File: /opt/oracle/admin/bin/startdb

# Algemene omgeving zetten

. $ORACLE_ADMIN/env/profile

ORACLE_SID=$1
echo $ORACLE_SID 

# Omgeving zetten RDBMS
. $ORACLE_ADMIN/env/$ORACLE_SID.env

# Het starten van de database
sqlplus /nolog << EOF
connect / as sysdba
startup
EOF

# Het starten van de listener
lsnrctl start $ORACLE_SID

# Het starten van de intelligent agent voor alle instances
#lsnrctl dbsnmp_start


Stopdb [ORACLE_SID]
-------------------

Dit script is een onderdeel van het script K10Oracle. Dit script heeft 1 parameter, ORACLE_SID

# Configuration File: /opt/oracle/admin/bin/stopdb

# Algemene omgeving zetten
. $ORACLE_ADMIN/env/profile

ORACLE_SID=$1
export $ORACLE_SID

# Settings van het RDBMS
. $ORACLE_ADMIN/env/$ORACLE_SID.env

# Het stoppen van de intelligent agent
#lsnrctl dbsnmp_stop

# Het stoppen van de listener
lsnrctl stop $ORACLE_SID

# Het stoppen van de database.
sqlplus /nolog << EOF
connect / as sysdba
shutdown immediate
EOF


9.5 Batches:
------------

De batches (jobs) worden gestart door het Unix proces cron

# Batches (Oracle)

# Configuration File: /var/spool/cron/crontabs/root
# Format of lines:
# min	hour	daymo	month	daywk	cmd
#
# Dayweek 0=sunday, 1=monday...
0        9        *       *       6  /sbin/sh /opt/oracle/admin/batches/bin/batches.sh  
>> /opt/oracle/admin/batches/log/batcheserroroutput.log 2>&1

# Configuration File: /opt/oracle/admin/batches/bin/batches.sh
# Door de op de commandline  ' BL_TRACE=T ; export BL_TRACE ' worden alle commando's getoond.
case $BL_TRACE in
    T)	set -x ;;
esac

ORACLE_ADMIN=/opt/oracle/admin; export ORACLE_ADMIN
ORACLE_HOME=/opt/oracle/product/8.1.6; export ORACLE_HOME

ORACLE_SID=WCUR ; export ORACLE_SID
su - oracle -c ". $ORACLE_ADMIN/env/profile ; . $ORACLE_ADMIN/env/$ORACLE_SID.env; 
cd $ORACLE_ADMIN/batches/bin; sqlplus /NOLOG @$ORACLE_ADMIN/batches/bin/Analyse_WILLOW2K.sql 1>
$ORACLE_ADMIN/batches/log/batches$ORACLE_SID.`date +"%y%m%d"` 2>&1"

ORACLE_SID=WCON ; export ORACLE_SID
su - oracle -c ". $ORACLE_ADMIN/env/profile ; . $ORACLE_ADMIN/env/$ORACLE_SID.env; 
cd $ORACLE_ADMIN/batches/bin; sqlplus /NOLOG @$ORACLE_ADMIN/batches/bin/Analyse_WILLOW2K.sql 1>
$ORACLE_ADMIN/batches/log/batches$ORACLE_SID.`date +"%y%m%d"` 2>&1"


9.6 Autostart in NT/Win2K:
--------------------------

1) Older versions

delete the existing instance FROM the command prompt:
oradim80 -delete -sid SID

recreate the instance FROM the command prompt:

oradim -new -sid SID -intpwd <password> -startmode <auto> -pfile <path\initSID.ora>

Execute the command file FROM the command prompt: oracle_home\database\strt<sid>.cmd

Check the log file generated FROM this execution: oracle_home\rdbmsxx\oradimxx.log

2) NT Registry value

HKEY_LOCAL_MACHINE\SOFTWARE\ORACLE\HOME0\ORA_SID_AUTOSTART REG_EXPAND_SZ TRUE
 

9.7 Tools:
----------

Relink van Oracle:
------------------

info:

  showrev -p
  pkginfo -i

relink:

  mk -f $ORACLE_HOME/rdbms/lib/ins_rdbms.mk install
  mk -f $ORACLE_HOME/svrmgr/lib/ins_svrmgr.mk install
  mk -f $ORACLE_HOME/network/lib/ins_network.mk install

$ORACLE_HOME/bin

relink all

Relinking Oracle 

Background: Applications for UNIX are generally not distributed as complete executables.   Oracle, like many 
application vendors who create products for UNIX, distribute  individual object files, library archives of object files, 
and some source  files which then get ?relinked? at the operating system level during  installation to create 
usable executables.  This guarantees a reliable integration with functions provided by the OS system libraries.  
Relinking occurs automatically under these circumstances:   
- An Oracle product has been installed with an Oracle provided installer.  
- An Oracle patch set has been applied via an Oracle provided installer.   

[Step 1] Log into the UNIX system as the Oracle software owner
Typically this is the user 'oracle'. 

[STEP 2] Verify that your $ORACLE_HOME is set correctly: 
For all Oracle Versions and Platforms, perform this basic environment check  first:    
% cd $ORACLE_HOME  
% pwd       
...Doing this will ensure that $ORACLE_HOME is set correctly in your current environment. 

[Step 3] Verify and/or Configure the UNIX Environment for Proper Relinking: 
For all Oracle Versions and UNIX Platforms:  The Platform specific environment variables LIBPATH, LD_LIBRARY_PATH, 
&   SHLIB_PATH typically are already set to include system library locations like  '/usr/lib'.  
In most cases, you need only check what they are set to first,   then add the $ORACLE_HOME/lib directory to them 
where appropriate.  i.e.:  
% setenv LD_LIBRARY_PATH ${ORACLE_HOME}/lib:${LD_LIBRARY_PATH}  
(see [NOTE:131207.1] How to Set UNIX Environment Variables for help with setting UNIX environment variables) 

If on SOLARIS (Sparc or Intel) with: 
Oracle 7.3.X, 8.0.X, or 8.1.X:        
- Ensure that /usr/ccs/bin is before /usr/ucb in $PATH            
% which ld   ....should return '/usr/ccs/bin/ld'         
If using 32bit(non 9i) Oracle,         
- Set LD_LIBRARY_PATH=$ORACLE_HOME/lib         
If using 64bit(non 9i) Oracle,         
- Set LD_LIBRARY_PATH=$ORACLE_HOME/lib        
- Set LD_LIBRARY_PATH_64=$ORACLE_HOME/lib64          
Oracle 9.X.X (64Bit) on Solaris (64Bit) OS        
- Set LD_LIBRARY_PATH=$ORACLE_HOME/lib32          
- Set LD_LIBRARY_PATH_64=$ORACLE_HOME/lib          
Oracle 9.X.X (32Bit) on Solaris (64Bit) OS        
- Set LD_LIBRARY_PATH=$ORACLE_HOME/lib  

[Step 4] For all Oracle Versions and UNIX Platforms:  
Verify that you performed Step 2 correctly:  
 % env|pg  ....make sure that you see the correct absolute path for    $ORACLE_HOME in the variable definitions. 

[Step 5] Run the OS Commands to Relink Oracle:  
Before relinking Oracle, shut down both the database and the listener.  

Oracle 8.1.X or 9.X.X 
------------------------   
*** NEW IN 8i AND ABOVE ***     
A 'relink' script is provided in the $ORACLE_HOME/bin directory.      
% cd $ORACLE_HOME/bin      
% relink      ...this will display all of the command's options.        usage: relink <parameter>        
accepted values for parameter: 
all               Every product executable that has been installed  
oracle            Oracle Database executable only  
network           net_client, net_server, cman  
client            net_client, plsql  
client_sharedlib  Client shared library  
interMedia        ctx  
ctx               Oracle Text utilities  
precomp           All precompilers that have been installed  
utilities         All utilities that have been installed  
oemagent          oemagent 
                  Note: To give the correct permissions to the nmo and nmb executables, 
                  you must run the root.sh script after relinking oemagent. 
 
ldap              ldap, oid  

Note: ldap option is available only from 9i. In 8i, you would have to manually relink ldap.    
You can relink most of the executables associated with an Oracle Server Installation   
by running the following command:      % relink all       

This will not relink every single executable Oracle provides 
(you can discern which executables were relinked by checking their timestamp with   
'ls -l' in the $ORACLE_HOME/bin directory).  
However, 'relink all' will recreate the shared libraries that most executables rely on and thereby   
resolve most issues that require a proper relink.  

-or- 
Since the 'relink' command merely calls the traditional 'make' commands, you 
still have the option of running the 'make' commands independently: 
For executables: oracle, exp, imp, sqlldr, tkprof, mig, dbv, orapwd, rman, 
svrmgrl, ogms, ogmsctl 

% cd $ORACLE_HOME/rdbms/lib 
% make -f ins_rdbms.mk install 
For executables: sqlplus 
% cd $ORACLE_HOME/sqlplus/lib 
% make -f ins_sqlplus.mk install 
For executables: isqlplus 
% cd $ORACLE_HOME/sqlplus/lib 
% make -f ins_sqlplus install_isqlplus 
For executables: dbsnmp, oemevent, oratclsh 
% cd $ORACLE_HOME/network/lib 
% make -f ins_oemagent.mk install 
For executables: names, namesctl 
% cd $ORACLE_HOME/network/lib 
% make -f ins_names.mk install 
For executables: osslogin, trcasst, trcroute, onrsd, tnsping 
% cd $ORACLE_HOME/network/lib 
% make -f ins_net_client.mk install 
For executables: tnslsnr, lsnrctl 
% cd $ORACLE_HOME/network/lib 
% make -f ins_net_server.mk install 
For executables related to ldap (for example Oracle Internet Directory): 
% cd $ORACLE_HOME/ldap/lib 
% make -f ins_ldap.mk install 

Note:
Unix Installation/OS: RDBMS Technical Forum        

Displayed below are the messages of the selected thread. 

Thread Status: Closed 

From: Ray Stell 20-Apr-05 21:43 
Subject: solaris upgrade 

RDBMS Version: 9.2.0.4
Operating System and Version: Solaris 8
Error Number (if applicable): 
Product (i.e. SQL*Loader, Import, etc.): 
Product Version: 

solaris upgrade

I need to move a server from solaris 5.8 to 5.9. Does this 
require a new oracle 9.2.0 ee server install or relink or 
nothing at all? Thanks. 

--------------------------------------------------------------------------------

From: Samir Saad 21-Apr-05 03:28 
Subject: Re : solaris upgrade 


You must relink even if you find that the databases came up after Solaris upgrade and they seem fine. 

As for the existing Oracle installations, they will all be fine. 
Samir. 

--------------------------------------------------------------------------------

From: Oracle, soumya anand 21-Apr-05 10:59 
Subject: Re : solaris upgrade 


Hello Ray, 

As rightly pointed by Samir, after an OS upgrade it sufficient to 
relink the executables. 

Regards, 
Soumya 


Note: troubles after relink:
----------------------------

If you see on AIX something that resembles the following:

P522:/home/oracle $lsnrctl
exec(): 0509-036 Cannot load program lsnrctl because of the following errors:
        0509-130 Symbol resolution failed for /usr/lib/libc.a[aio_64.o] because:
        0509-136   Symbol kaio_rdwr64 (number 0) is not exported from
                   dependent module /unix.
        0509-136   Symbol listio64 (number 1) is not exported from
                   dependent module /unix.
        0509-136   Symbol acancel64 (number 2) is not exported from
                   dependent module /unix.
        0509-136   Symbol iosuspend64 (number 3) is not exported from
                   dependent module /unix.
        0509-136   Symbol aio_nwait (number 4) is not exported from
                   dependent module /unix.
        0509-150   Dependent module libc.a(aio_64.o) could not be loaded.
        0509-026 System error: Cannot run a file that does not have a valid format.
        0509-192 Examine .loader section symbols with the
                 'dump -Tv' command.


If this occurs, you have asynchronous I/O turned off. 

To turn on asynchronous I/O: 


Run smitty chgaio and set STATE to be configured at system restart from defined to available. 
Press Enter. 
Do one of the following: 
Restart your system. 
Run smitty aio and move the cursor to Configure defined Asynchronous I/O. Then press Enter. 


trace:
------

  truss -aef -o /tmp/trace svrmgrl

To trace what a Unix process is doing enter: 

  truss -rall -wall -p <PID>
  truss -p $ lsnrctl dbsnmp_start

NOTE: The "truss" command works on SUN and Sequent. Use "tusc" on HP-UX, "strace" on Linux, 
"trace" on SCO Unix or call your system administrator to find the equivalent command on your system. 
Monitor your Unix system: 


Logfiles:
---------

Unix message files record all system problems like disk errors, swap errors, NFS problems, etc. 
Monitor the following files on your system to detect system problems: 

  tail -f /var/adm/SYSLOG
  tail -f /var/adm/messages
  tail -f /var/log/syslog


===============
10. CONSTRAINTS:
===============


10.1 index owner en table owner information: DBA_INDEXES
-------------------------------------------

set linesize 100

SELECT DISTINCT
substr(owner, 1, 10)         as INDEX_OWNER, 
substr(index_name, 1, 40)    as INDEX_NAME,
substr(tablespace_name,1,40) as TABLE_SPACE,
substr(index_type, 1, 10)    as INDEX_TYPE, 
substr(table_owner, 1, 10)   as TABLE_OWNER, 
substr(table_name, 1, 40)    as TABLE_NAME,
BLEVEL,NUM_ROWS,STATUS
FROM DBA_INDEXES
order by index_owner;


SELECT DISTINCT
substr(owner, 1, 10)        as INDEX_OWNER, 
substr(index_name, 1, 40)   as INDEX_NAME,
substr(index_type, 1, 10)   as INDEX_TYPE, 
substr(table_owner, 1, 10)  as TABLE_OWNER, 
substr(table_name, 1, 40)   as TABLE_NAME
FROM DBA_INDEXES
WHERE table_name='HEAT_CUSTOMER';


SELECT 
substr(owner, 1, 10)        as INDEX_OWNER, 
substr(index_name, 1, 40)   as INDEX_NAME,
substr(index_type, 1, 10)   as INDEX_TYPE, 
substr(table_owner, 1, 10)  as TABLE_OWNER, 
substr(table_name, 1, 40)   as TABLE_NAME
FROM DBA_INDEXES
WHERE owner<>table_owner;

10.2 PK en FK constraint relations:
----------------------------------

SELECT 
c.constraint_type                  as TYPE, 
SUBSTR(c.table_name, 1, 40)        as TABLE_NAME,
SUBSTR(c.constraint_name, 1, 40)   as CONSTRAINT_NAME,
SUBSTR(c.r_constraint_name, 1, 40) as REF_KEY,
SUBSTR(b.column_name, 1, 40)       as COLUMN_NAME
FROM DBA_CONSTRAINTS c, DBA_CONS_COLUMNS b
WHERE 
c.constraint_name=b.constraint_name AND
c.OWNER in ('TRIDION_CM','TCMLOGDBUSER','VPOUSERDB')
AND c.constraint_type in ('P', 'R', 'U');


select c.constraint_name, c.constraint_type, c.table_name, 
(select table_name from c where c.r_constraint_name,
o.constraint_name, o.column_name
from dba_constraints c, dba_cons_columns o
where c.constraint_name=o.constraint_name and c.constraint_type='R'
and c.owner='BRAINS';


SELECT 'SELECT * FROM '||c.table_name||' WHERE '||b.column_name||' '||c.search_condition
FROM DBA_CONSTRAINTS c, DBA_CONS_COLUMNS b
WHERE 
c.constraint_name=b.constraint_name AND
c.OWNER='BRAINS' AND c.constraint_type = 'C';


SELECT 'ALTER TABLE PROJECTS.'||table_name||' enable constraint '||constraint_name||';'
FROM DBA_CONSTRAINTS
WHERE owner='PROJECTS' AND constraint_type='R';

SELECT 'ALTER TABLE BRAINS.'||table_name||' disable constraint '||constraint_name||';'
FROM USER_CONSTRAINTS
WHERE owner='BRAINS' AND constraint_type='R';


10.3 PK en FK constraint informatie: DBA_CONSTRAINTS
-----------------------------------

-- owner and all foreign key, constraints 

SELECT 
SUBSTR(owner, 1, 10)             as OWNER, 
constraint_type                  as TYPE, 
SUBSTR(table_name, 1, 40)        as TABLE_NAME,
SUBSTR(constraint_name, 1, 40)   as CONSTRAINT_NAME,
SUBSTR(r_constraint_name, 1, 40) as REF_KEY,
DELETE_RULE                      as DELETE_RULE,
status
FROM DBA_CONSTRAINTS
WHERE OWNER='BRAINS' AND constraint_type in ('R', 'P', 'U');

SELECT 
SUBSTR(owner, 1, 10)             as OWNER, 
constraint_type                  as TYPE, 
SUBSTR(table_name, 1, 30)        as TABLE_NAME,
SUBSTR(constraint_name, 1, 30)   as CONSTRAINT_NAME,
SUBSTR(r_constraint_name, 1, 30) as REF_KEY,
DELETE_RULE                      as DELETE_RULE,
status
FROM DBA_CONSTRAINTS
WHERE OWNER='BRAINS' AND constraint_type in ('R');

-- owner en alle primary key constraints bepalen van een bepaalde user, op bepaalde objects

Zelfde query: Zet OWNER='gewenste_owner' AND constraint_type='P'


10.4 opsporen bijbehorende index van een bepaalde constraint: DBA_INDEXES, DBA_CONSTRAINTS
------------------------------------------------------------

SELECT 
c.constraint_type                    as Type,
substr(x.index_name, 1, 40)          as INDX_NAME,
substr(c.constraint_name, 1, 40)     as CONSTRAINT_NAME,
substr(x.tablespace_name, 1, 40)     as TABLESPACE
FROM DBA_CONSTRAINTS c, DBA_INDEXES x
WHERE
c.constraint_name=x.index_name AND
c.constraint_name='UN_DEMO1';


SELECT 
c.constraint_type                   as Type,
substr(x.index_name, 1, 40)         as INDX_NAME,
substr(c.constraint_name, 1, 40)    as CONSTRAINT_NAME,
substr(c.table_name, 1, 40)         as TABLE_NAME,
substr(c.owner, 1, 10)              as OWNER
FROM DBA_CONSTRAINTS c, DBA_INDEXES x
WHERE
c.constraint_name=x.index_name AND
c.owner='JOOPLOC';


10.5 opsporen tablespace van een constraint of constraint owner:
---------------------------------------------------------------

SELECT 
substr(s.segment_name, 1, 40)       as Segmentname,
substr(c.constraint_name, 1, 40)    as Constraintname,
substr(s.tablespace_name, 1, 40)    as Tablespace,
substr(s.segment_type, 1, 10)       as Type
FROM DBA_SEGMENTS s, DBA_CONSTRAINTS c
WHERE
s.segment_name=c.constraint_name
AND
c.owner='PROJECTS';


10.6 Ophalen index create statements:
------------------------------------

DBA_INDEXES
DBA_IND_COLUMNS


SELECT 
substr(i.index_name, 1, 40)       as INDEX_NAME,
substr(i.index_type, 1, 15)       as INDEX_TYPE,
substr(i.table_name, 1, 40)       as TABLE_NAME,
substr(c.index_owner, 1, 10)      as INDEX_OWNER, 
substr(c.column_name, 1, 40)      as COLUMN_NAME, 
c.column_position                 as POSITION
FROM DBA_INDEXES i, DBA_IND_COLUMNS c
WHERE i.index_name=c.index_name AND i.owner='SALES';


10.7 Aan en uitzetten van constraints:
-------------------------------------

-- aanzetten:

   alter table tablename enable constraint constraint_name 

-- uitzetten:

   alter table tablename disable constraint constraint_name 

-- voorbeeld:

   ALTER TABLE EMPLOYEE DISABLE CONSTRAINT FK_DEPNO;
   ALTER TABLE EMPLOYEE ENABLE CONSTRAINT FK_DEPNO;

   maar ook kan:

   ALTER TABLE DEMO 
   ENABLE PRIMARY KEY;

-- Alle FK constraints van een schema in een keer uitzetten:

   SELECT 'ALTER TABLE MIS_OWNER.'||table_name||' disable constraint '||constraint_name||';'
   FROM DBA_CONSTRAINTS
   WHERE owner='MIS_OWNER' AND constraint_type='R'
   AND TABLE_NAME LIKE 'MKM%';


   SELECT 'ALTER TABLE MIS_OWNER.'||table_name||' enable constraint '||constraint_name||';'
   FROM DBA_CONSTRAINTS
   WHERE owner='MIS_OWNER' AND constraint_type='R'
   AND TABLE_NAME LIKE 'MKM%';


10.8 Constraint aanmaken en initieel uit:
----------------------------------------

Dit kan handig zijn bij bijvoorbeeld het laden van een
table waarbij mogelijk dubbele waarden voorkomen

ALTER TABLE CUSTOMERS
ADD CONSTRAINT PK_CUST PRIMARY KEY (custid) DISABLE;

Als nu blijkt dat bij het aanzetten van de constraint, er dubbele records voorkomen,
kunnen we deze dubbele records plaatsen in de EXCEPTIONS table:

1. aanmaken EXCEPTIONS table:

@ORACLE_HOME\rdbms\admin\utlexcpt.sql

2. Constraint aaNzetten:

ALTER TABLE CUSTOMERS
ENABLE PRIMARY KEY exceptions INTO EXCEPTIONS;

Nu bevat de EXCEPTIONS table de dubbele rijen.

3. Welke dubbele rijen:

SELECT c.custid, c.name
FROM CUSTOMERS c, EXCEPTIONS s
WHERE c.rowid=s.row_id;


10.9 Gebruik PK FK constraints:
------------------------------

10.9.1: Voorbeeld normaal gebruik met DRI:

create table customers
(
custid number not null,
custname varchar(10),
CONSTRAINT pk_cust PRIMARY KEY (custid) 
);


create table contacts
( 
contactid number not null,
custid number,
contactname varchar(10),
CONSTRAINT pk_contactid PRIMARY KEY (contactid),
CONSTRAINT fk_cust FOREIGN KEY (custid) REFERENCES customers(custid) 
);

   Hierbij kun je dus niet zondermeer een row met een bepaald custid
   uit customers verwijderen, indien er een row in contacts bestaat met hetzelfde custid.


10.9.2: Voorbeeld met ON DELETE CASCADE:


create table contacts
(
contactid number not null,
custid number,
contactname varchar(10),
CONSTRAINT pk_contactid PRIMARY KEY (contactid),
CONSTRAINT fk_cust FOREIGN KEY (custid) REFERENCES customers(custid) ON DELETE CASCADE 
);

Ook de clausule "ON DELETE SET NULL" kan gebruikt worden.

Nu is het wel mogelijk om in customers een row te verwijderen, terwijl
in contacts een overeenkomende custid bestaat. De row in contacts
wordt dan namelijk ook verwijdert.


10.10 Procedures voor insert, delete:
------------------------------------

Als voorbeeld op table customers:

CREATE OR REPLACE PROCEDURE newcustomer (custid NUMBER, custname VARCHAR) 
IS
BEGIN
INSERT INTO customers values (custid,custname);
commit;
END;
/


CREATE OR REPLACE PROCEDURE delcustomer (cust NUMBER) 
IS
BEGIN
delete from customers where custid=cust;
commit;
END;
/


10.11 User datadictonary views:
-----------------------------

We hebben al gezien dat we voor constraint informatie voornamelijk de onderstaande views raadplegen:

DBA_TABLES
DBA_INDEXES, 
DBA_CONSTRAINTS, 
DBA_IND_COLUMNS, 
DBA_SEGMENTS


Deze zijn echter voor de DBA.

Gewone users kunnen informatie opvragen uit USER_ en ALL_ views.

USER_ :  in the schema van de user
ALL_  :  waar de user bij kan

USER_TABLES,       ALL_TABLES
USER_INDEXES,      ALL_INDEXES
USER_CONSTRAINTS,  ALL_CONSTRAINTS
USER_VIEWS,        ALL_VIEWS
USER_SEQUENCES,    ALL_SEQUENCES
USER_CONS_COLUMNS, ALL_CONS_COLUMNS
USER_TAB_COLUMNS,  ALL_TAB_COLUMNS
USER_SOURCE,       ALL_SOURCE

cat
tab
col
dict


10.12 Create en drop index examples:
-----------------------------------

CREATE UNIQUE INDEX HEATCUST0 ON HEATCUST(CUSTTYPE) 
  TABLESPACE INDEX_SMALL PCTFREE 10  
  STORAGE(INITIAL 163840 NEXT 163840 PCTINCREASE 0 );

DROP INDEX indexname


10.13 Check the height of indexes:
---------------------------------

Is an index rebuild neccessary ?

SELECT index_name, owner, blevel, 
        decode(blevel,0,'OK BLEVEL',1,'OK BLEVEL', 
        2,'OK BLEVEL',3,'OK BLEVEL',4,'OK BLEVEL','BLEVEL HIGH') OK 
FROM dba_indexes 
WHERE owner='SALES'
and blevel > 3;  

10.14 Make indexes unusable (before a large dataload):
-----------------------------------------------------

-- Make Indexes unusable
alter index HEAT_CUSTOMER_DISCON_DATE   unusable;
alter index HEAT_CUSTOMER_EMAIL_ADDRESS unusable;
alter index HEAT_CUSTOMER_POSTAL_CODE   unusable;

-- Enable Indexes again
alter index HEAT_CUSTOMER_DISCON_DATE   rebuild;
alter index HEAT_CUSTOMER_EMAIL_ADDRESS rebuild;
alter index HEAT_CUSTOMER_POSTAL_CODE   rebuild;


================================
11. DBMS_JOB and scheduled Jobs:
================================

Used in Oracle 9i and lower versions.


11.1 SNP background process:
----------------------------

Scheduled jobs zijn mogelijk wanneer het SNP background process
geactiveerd is. Dit kan via de init.ora:


JOB_QUEUE_PROCESSES=1    aantal SNP processes (SNP0, SNP1), max 36 t.b.v. replication en jobqueue's
JOB_QUEUE_INTERVAL=60    check interval


11.2 DBMS_JOB package:
----------------------

DBMS_JOB.SUBMIT()
DBMS_JOB.REMOVE()
DBMS_JOB.CHANGE()
DBMS_JOB.WHAT()
DBMS_JOB.NEXT_DATE()
DBMS_JOB.INTERVAL()
DBMS_JOB.RUN()


11.2.1 DBMS_JOB.SUBMIT()
-----------------------

There are actually two versions SUBMIT() and ISUBMIT()

PROCEDURE DBMS_JOB.SUBMIT
   (job OUT BINARY_INTEGER,
    what IN VARCHAR2,
    next_date IN DATE DEFAULT SYSDATE,
    interval IN VARCHAR2 DEFAULT 'NULL',
    no_parse IN BOOLEAN DEFAULT FALSE);

PROCEDURE DBMS_JOB.ISUBMIT
   (job IN BINARY_INTEGER,
   what IN VARCHAR2,
   next_date in DATE DEFAULT SYSDATE
   interval IN VARCHAR2 DEFAULT 'NULL',
   no_parse in BOOLEAN DEFAULT FALSE);


The difference between ISUBMIT and SUBMIT is that ISUBMIT specifies a job number, 
whereas SUBMIT returns a job number generated by the DBMS_JOB package
 

  Look for submitted jobs:
  ------------------------

select job, last_date, next_date, interval, substr(what, 1, 50)
from dba_jobs;


  Submit a job:
  -------------- 

The jobnumber (if you use SUBMIT() ) will be derived from the sequence SYS.JOBSEQ

Suppose you have the following procedure:

create or replace procedure test1 is
begin
  dbms_output.put_line('Hallo grapjas.');
end;
/

  Example 1:
  ----------

variable jobno number;
begin
  DBMS_JOB.SUBMIT(:jobno, 'test1;', Sysdate, 'Sysdate+1');
  commit;
end;
/

DECLARE
   jobno   NUMBER;
BEGIN
   DBMS_JOB.SUBMIT
      (job  => jobno
      ,what => 'test1;'
      ,next_date => SYSDATE
      ,interval  => 'SYSDATE+1/24');
   COMMIT;
END;
/

So suppose you submit the above job at 08.15h. Then the next, and first time, 
that the job will run is at 09.15h.


  Example 2:
  ----------

variable jobno number;
begin
  DBMS_JOB.SUBMIT(:jobno, 'test1;', LAST_DAY(SYSDATE+1), 'LAST_DAY(ADD_MONTHS(LAST_DAY(SYSDATE+1),1))');
  commit;
end;
/


  Example 3:
  ----------

VARIABLE jobno NUMBER 
BEGIN
   DBMS_JOB.SUBMIT(:jobno, 
      'DBMS_DDL.ANALYZE_OBJECT(''TABLE'',
      ''CHARLIE'', ''X1'', 
      ''ESTIMATE'', NULL, 50);', 
      SYSDATE, 'SYSDATE + 1');
   COMMIT;
END;
/

PRINT jobno

JOBNO
----------
14144


  Example 4: this job is scheduled every hour
  -------------------------------------------

DECLARE
   jobno   NUMBER;
BEGIN
   DBMS_JOB.SUBMIT
      (job  => jobno
      ,what => 'begin space_logger; end;'
      ,next_date => SYSDATE
      ,interval  => 'SYSDATE+1/24');
   COMMIT;
END;
/


  Example 5: Examples of intervals
  --------------------------------

'SYSDATE + 7'                                                   :exactly seven days from the last execution  
'SYSDATE + 1/48'                                                :every half hour  
'NEXT_DAY(TRUNC(SYSDATE), ''MONDAY'') + 15/24'                  :every Monday at 3PM  
'NEXT_DAY(ADD_MONTHS(TRUNC(SYSDATE, ''Q''), 3), ''THURSDAY'')'  :first Thursday of each quarter  
'TRUNC(SYSDATE + 1)'                                            :Every day at 12:00 midnight
'TRUNC(SYSDATE + 1) + 8/24'                                     :Every day at 8:00 a.m.
'NEXT_DAY(TRUNC(SYSDATE ), "TUESDAY" ) + 12/24'                 :Every Tuesday at 12:00 noon
'TRUNC(LAST_DAY(SYSDATE ) + 1)'                                 :First day of the month at midnight
'TRUNC(ADD_MONTHS(SYSDATE + 2/24, 3 ), 'Q' ) - 1/24'            :Last day of the quarter at 11:00 p.m.
 NEXT_DAY(SYSDATE, "FRIDAY") ) ) + 9/24'                        :Every Monday, Wednesday, and Friday at 9:00 a.m.
 

---------------------------------------------------------------------------------
  Example 6:
  ----------

You have this testprocedure

create or replace procedure test1 as
id_next number;
begin
  select max(id)  into id_next from iftest;
  insert into iftest
  (id)
  values
  (id_next+1);
commit;
end;
/

Suppose on 16 juli at 9:26h you do:

variable jobno number;
begin
  DBMS_JOB.SUBMIT(:jobno, 'test1;', LAST_DAY(SYSDATE+1), 'LAST_DAY(ADD_MONTHS(LAST_DAY(SYSDATE+1),1))');
  commit;
end;
/

select job, to_char(this_date,'DD-MM-YYYY;HH24:MI'), to_char(next_date, 'DD-MM-YYYY;HH24:MI')
from dba_jobs;

       JOB TO_CHAR(THIS_DAT TO_CHAR(NEXT_DAT
---------- ---------------- ----------------
        25                  31-07-2004;09:26
 

Suppose on 16 juli at 9:38h you do:

variable jobno number;
begin
  DBMS_JOB.SUBMIT(:jobno, 'test1;', LAST_DAY(SYSDATE)+1, 'LAST_DAY(ADD_MONTHS(LAST_DAY(SYSDATE+1),1))');
  commit;
end;
/

       JOB TO_CHAR(THIS_DAT TO_CHAR(NEXT_DAT
---------- ---------------- ----------------
        25                  31-07-2004;09:26
        26                  01-08-2004;09:38

Suppose on 16 juli at 9:41h you do:

variable jobno number;
begin
  DBMS_JOB.SUBMIT(:jobno, 'test1;', SYSDATE, 'LAST_DAY(ADD_MONTHS(LAST_DAY(SYSDATE+1),1))');
  commit;
end;
/

       JOB TO_CHAR(THIS_DAT TO_CHAR(NEXT_DAT
---------- ---------------- ----------------
        27                  31-08-2004;09:41
        25                  31-07-2004;09:26
        26                  01-08-2004;09:39


Suppose on 16 juli at 9:46h you do:

variable jobno number;
begin
  DBMS_JOB.SUBMIT(:jobno, 'test1;', SYSDATE, 'TRUNC(LAST_DAY(SYSDATE + 1/24 ) )');
  commit;
end;
/

      JOB TO_CHAR(THIS_DAT TO_CHAR(NEXT_DAT
--------- ---------------- ----------------
       27                  31-08-2004;09:41
       28                  31-07-2004;00:00
       25                  31-07-2004;09:26
       29                  31-07-2004;00:00


--------------------------------------------------------------------------------------
variable jobno number;
begin
  DBMS_JOB.SUBMIT(:jobno, 'test1;', null, 'TRUNC(LAST_DAY(SYSDATE ) + 1)' );
  commit;
end;
/

In the job definition, use two single quotation marks around strings. 
Always include a semicolon at the end of the job definition.


11.2.2 DBMS_JOB.REMOVE()
------------------------

Removing a Job FROM the Job Queue
To remove a job FROM the job queue, use the REMOVE procedure in the DBMS_JOB package.

The following statements remove job number 14144 FROM the job queue:

BEGIN
DBMS_JOB.REMOVE(14144);
END;
/

11.2.3 DBMS_JOB.CHANGE()
------------------------

In this example, job number 14144 is altered to execute every three days:

BEGIN
DBMS_JOB.CHANGE(1, NULL, NULL, 'SYSDATE + 3');
END;
/

If you specify NULL for WHAT, NEXT_DATE, or INTERVAL when you call the 
procedure DBMS_JOB.CHANGE, the current value remains unchanged.


11.2.4 DBMS_JOB.WHAT()
----------------------

You can alter the definition of a job by calling the DBMS_JOB.WHAT procedure.
The following example changes the definition for job number 14144:

BEGIN
DBMS_JOB.WHAT(14144, 
      'DBMS_DDL.ANALYZE_OBJECT(''TABLE'',
      ''HR'', ''DEPARTMENTS'', 
      ''ESTIMATE'', NULL, 50);');
END;
/


11.2.5 DBMS_JOB.NEXT_DATE()
---------------------------

You can alter the next execution time for a job by calling the 
DBMS_JOB.NEXT_DATE procedure, as shown in the following example:

BEGIN
DBMS_JOB.NEXT_DATE(14144, SYSDATE + 4);
END;
/

11.2.6 DBMS_JOB.INTERVAL():
---------------------------

The following example illustrates changing the execution interval 
for a job by calling the DBMS_JOB.INTERVAL procedure:

BEGIN
DBMS_JOB.INTERVAL(14144, 'NULL');
END;
/

execute dbms_job.interval(<job number>,'SYSDATE+(1/48)'); 


In this case, the job will not run again after it successfully executes 
and it will be deleted FROM the job queue

11.2.7 DBMS_JOB.BROKEN():
-------------------------

A job is labeled as either broken or not broken. Oracle does not attempt to run broken jobs.

Example:

BEGIN
DBMS_JOB.BROKEN(10, TRUE);
END;
/

Example:

The following example marks job 14144 as not broken and sets its 
next execution date to the following Monday:

BEGIN
DBMS_JOB.BROKEN(14144, FALSE, NEXT_DAY(SYSDATE, 'MONDAY'));
END;
/

Example:

exec DBMS_JOB.BROKEN( V_JOB_ID, true);

Example:

select JOB into V_JOB_ID from DBA_JOBS
where  WHAT like '%SONERA%';

DBMS_SNAPSHOT.REFRESH( 'SONERA', 'C');

DBMS_JOB.BROKEN( V_JOB_ID, false);

fix broken jobs:
----------------

/* Filename on companion disk: job5.sql */*
CREATE OR REPLACE PROCEDURE job_fixer
AS
   /*
   || calls DBMS_JOB.BROKEN to try and set
   || any broken jobs to unbroken
   */
   
   /* cursor selects user's broken jobs */
   CURSOR broken_jobs_cur
   IS
   SELECT job
     FROM user_jobs
    WHERE broken = 'Y';
    
BEGIN
   FOR job_rec IN broken_jobs_cur
   LOOP
      DBMS_JOB.BROKEN(job_rec.job,FALSE);
   END LOOP;
END job_fixer;


11.2.8 DBMS_JOB.RUN():
----------------------

BEGIN
DBMS_JOB.RUN(14144);
END;
/


11.3 DBMS_SCHEDULER:
--------------------

Used in Oracle 10g.

BEGIN

DBMS_SCHEDULER.create_job (
job_name => 'test_self_contained_job',
job_type => 'PLSQL_BLOCK',
job_action => 'BEGIN DBMS_STATS.gather_schema_stats(''JOHN''); END;',
start_date => SYSTIMESTAMP,
repeat_interval => 'freq=hourly; byminute=0',
end_date => NULL,
enabled => TRUE,
comments => 'Job created using the CREATE JOB procedure.');
End;
/

BEGIN
DBMS_SCHEDULER.run_job (job_name => 'TEST_PROGRAM_SCHEDULE_JOB',
use_current_session => FALSE);
END;
/

BEGIN
DBMS_SCHEDULER.stop_job (job_name => 'TEST_PROGRAM_SCHEDULE_JOB');
END;
/

Jobs can be deleted using the DROP_JOB procedure:

BEGIN
DBMS_SCHEDULER.drop_job (job_name => 'TEST_PROGRAM_SCHEDULE_JOB');
DBMS_SCHEDULER.drop_job (job_name => 'test_self_contained_job');
END;
/ 


==================
12. Net8 / SQLNet:
==================

In bijvoorbeeld sql*plus vult men in: 

-----------------
Username:     system 
Password:     manager
Host String:  XXX
-----------------

NET8 bij de client kijkt in TNSNAMES.ORAnaar de eerste entry

XXX= (description.. protocol..host...port.. SERVICE_NAME=Y)

XXX is eigenlijk een alias en is dus willekeurig hoewel
het uiteraard aansluit bij de instance name of database name waarnaar
je wilt connecten.
Maar het zou dus zelfs pipo mogen zijn.

  Wordt XXX niet gevonden, dan meld de client:
  ORA-12154 TNS: could not resolve SERVICE NAME

Vervolgens wordt door NET8 via de connect descriptor Y
contact gemaakt met de listener op de Server die luistert naar Y

  Is Y niet wat de listener verwacht, dan meldt de listener aan de client:
  TNS: listener could not resolve SERVICE_NAME in connect descriptor


12.1 sqlnet.ora voorbeeld:
--------------------------

SQLNET.AUTHENTICATION_SERVICES= (NTS)

NAMES.DIRECTORY_PATH= (TNSNAMES)


12.2 tnsnames.ora voorbeelden:
------------------------------

voorbeeld 1.

  DB1=
     (DESCRIPTION=
        (ADDRESS_LIST=
           (ADDRESS=(PROTOCOL=TCP)(HOST=STARBOSS)(PORT=1521)
         )
         (CONNECT_DATA=
            (SERVICE_NAME=DB1.world)
         )
      )


voorbeeld 2.


  DB1.world=
     (DESCRIPTION=
        (ADDRESS_LIST=
           (ADDRESS=(COMMUNITY=tcp.world)(PROTOCOL=TCP)(HOST=STARBOSS)(PORT=1521)
         )
         (CONNECT_DATA=(SID=DB1)
         )
      )

  DB2.world=
     (... )

  DB3.world=
     (... )

  etc..

voorbeeld 3.

  RCAT =
    (DESCRIPTION =
      (ADDRESS_LIST =
        (ADDRESS = (PROTOCOL = TCP)(HOST = w2ktest)(PORT = 1521))
      )
      (CONNECT_DATA =
        (SERVICE_NAME = rcat.antapex)
      )
    )


12.3 listener.ora voorbeelden:
------------------------------

Example 1:
----------

LISTENER=
   (DESCRIPTION=
      (ADDRESS=(PROTOCOL=TCP)(HOST=STARBOSS)(PORT=1521))
   )
   SID_LIST_LISTENER=
      (SID_LIST=
          (SID_DESC=
              (GLOBAL_DBNAME=DB1.world)
              (ORACLE_HOME=D:\oracle8i)
              (SID_NAME=DB1)
           )
       )


Example 2:
----------

############## WPRD #####################################################
LOG_DIRECTORY_WPRD		= /opt/oracle/admin/WPRD/network/log
LOG_FILE_WPRD			= WPRD.log
TRACE_LEVEL_WPRD		= OFF #ADMIN
TRACE_DIRECTORY_WPRD		= /opt/oracle/admin/WPRD/network/trace
TRACE_FILE_WPRD			= WPRD.trc

WPRD =
  (DESCRIPTION_LIST =
    (DESCRIPTION =
      (ADDRESS_LIST=(ADDRESS=(PROTOCOL=TCP)(HOST=blnl01)(PORT=1521)))))

SID_LIST_WPRD =
  (SID_LIST =
    (SID_DESC =
      (GLOBAL_DBNAME = WPRD)
      (ORACLE_HOME = /opt/oracle/product/8.1.6)
      (SID_NAME = WPRD)))


############## WTST #####################################################
LOG_DIRECTORY_WTST		= /opt/oracle/admin/WTST/network/log
LOG_FILE_WTST			= WTST.log
TRACE_LEVEL_WTST		= OFF #ADMIN
TRACE_DIRECTORY_WTST		= /opt/oracle/admin/WTST/network/trace
TRACE_FILE_WTST			= WTST.trc

WTST =
  (DESCRIPTION_LIST =
    (DESCRIPTION =
      (ADDRESS_LIST=(ADDRESS=(PROTOCOL=TCP)(HOST=blnl01)(PORT=1522)))))

SID_LIST_WTST =
  (SID_LIST =
    (SID_DESC =
      (GLOBAL_DBNAME = WTST)
      (ORACLE_HOME = /opt/oracle/product/8.1.6)
      (SID_NAME = WTST)))


Example 3:
----------

# LISTENER.ORA Network Configuration File: D:\oracle\ora901\NETWORK\ADMIN\listener.ora
# Generated by Oracle configuration tools.

LISTENER =
  (DESCRIPTION_LIST =
    (DESCRIPTION =
      (ADDRESS = (PROTOCOL = IPC)(KEY = EXTPROC0))
    )
    (DESCRIPTION =
      (ADDRESS = (PROTOCOL = TCP)(HOST = missrv)(PORT = 1521))
    )
  )

SID_LIST_LISTENER =
  (SID_LIST =
    (SID_DESC =
      (SID_NAME = PLSExtProc)
      (ORACLE_HOME = D:\oracle\ora901)
      (PROGRAM = extproc)
    )
    (SID_DESC =
      (GLOBAL_DBNAME = o901)
      (ORACLE_HOME = D:\oracle\ora901)
      (SID_NAME = o901)
    )
    (SID_DESC =
      (SID_NAME = MAST)
      (ORACLE_HOME = D:\oracle\ora901)
      (PROGRAM = hsodbc)
    )
    (SID_DESC =
      (SID_NAME = NATOPS)
      (ORACLE_HOME = D:\oracle\ora901)
      (PROGRAM = hsodbc)
    )
    (SID_DESC =
      (SID_NAME = VRF)
      (ORACLE_HOME = D:\oracle\ora901)
      (PROGRAM = hsodbc)
    )
    (SID_DESC =
      (SID_NAME = DRILLS)
      (ORACLE_HOME = D:\oracle\ora901)
      (PROGRAM = hsodbc)
    )
    (SID_DESC =
      (SID_NAME = DDS)
      (ORACLE_HOME = D:\oracle\ora901)
      (PROGRAM = hsodbc)
    )
    (SID_DESC =
      (SID_NAME = IVP)
      (ORACLE_HOME = D:\oracle\ora901)
      (PROGRAM = hsodbc)
    (SID_DESC =
      (SID_NAME = ALBERT)
      (ORACLE_HOME = D:\oracle\ora901)
      (PROGRAM = hsodbc)
    )
  )


12.4: CONNECT TIME FAILOVER:
----------------------------


The connect-time failover feature allows clients to connect to another listener if the initial connection
to the first listener fails. Multiple listener locations are specified in the clients tnsnames.ora file.
If a connection attempt to the first listener fails, a connection request to the next listener
in the list is attempted. This feature increases the availablity of the Oracle service
should a listener location be unavailable.
Here is an example of what a tnsnames.ora file looks like with connect-time failover enabled:

ORCL=
(DESCRIPTION=
  (ADDRESS_LIST=
    (ADDRESS=(PROTOCOL=TCP)(HOST=DBPROD)(PORT=1521))
    (ADDRESS=(PROTOCOL=TCP)(HOST=DBFAIL)(PORT=1521))
  )
  (CONNECT_DATA=(SERVICE_NAME=PROD)(SERVER=DEDICATED)
  )
)


12.5: CLIENT LOAD BALANCING:
----------------------------

Client Load Balancing is a feature that allows clients to randomly select from a list of listeners.
Oracle Net moves through the list of listeners and balances the load of connection requests
accross the available listeners. 
Here is an example of the tnsnames.ora entry that allows for load balancing:


ORCL=
(DESCRIPTION=
  (LOAD_BALANCE=ON)
  (ADDRESS_LIST=
    (ADDRESS=(PROTOCOL=TCP)(HOST=MWEISHAN-DELL)(PORT=1522))
    (ADDRESS=(PROTOCOL=TCP)(HOST=MWEISHAN-DELL)(PORT=1521))
  )
  (CONNECT_DATA=(SERVICE_NAME=PROD)(SERVER=DEDICATED)
  )
)

Notice the additional parameter of LOAD_BALANCE. This enables load balancing between the
two listener locations specified.


12.6: ORACLE SHARED SERVER:
---------------------------

With the dedicated Server, each server process has a PGA, outside the SGA
When Shared Server is used, the user program area's are in the SGA in the large pool.

With a few init.ora parameters, you can configure Shared Server.


1. DISPATCHERS:

The DISPATCHERS parameter defines the number of dispatchers that should start when the instance is started.
For example, if you want to configure 3 TCP/IP dispatchers and to IPC dispatchers, 
you set the parameters as follows:

DISPATCHERS="(PRO=TCP)(DIS=3)(PRO=IPC)(DIS=2)"

For example, if you have 500 concurrent TCP/IP connections, and you want each dispatcher to manage 
50 concurrent connections, you need 10 dispatchers.
You set your DISPATCHERS parameter as follows:

DISPATCHERS="(PRO=TCP)(DIS=10)"

2. SHARED_SERVER:

The Shared_Servers parameter specifies the minimum number of Shared Servers to start and retain 
when the Oracle instance is started.


View information about dispatchers and shared servers with the following commands and queries:

lsnrctl services

SELECT name, status, messages, idle, busy, bytes, breaks
FROM v$dispatcher;


12.7: Keeping Oracle connections alive through a Firewall:
----------------------------------------------------------

Implementing keep alive packets:
SQLNET.INBOUND_CONNECT_TIMEOUT 


Notes:
=======

Note 1:
-------


Doc ID: 	Note:274130.1	Content Type: 	TEXT/PLAIN	   
Subject: 	SHARED SERVER CONFIGURATION	Creation Date: 	25-MAY-2004	   
Type: 	BULLETIN	Last Revision Date: 	24-JUN-2004	   
Status: 	PUBLISHED		 
PURPOSE 
------- 
 
 
This article discusses about the configuration of shared servers on 9i DB. 
  
SHARED SERVER CONFIGURATION: 
=========================== 
 
 1. Add the parameter shared_servers in the init.ora 
   
    SHARED_SERVERS specifies the number of server processes that you want to  
    create when an instance is started up. If system load decreases,  
    this minimum number of servers is maintained. Therefore, you should take 
    care not to set SHARED_SERVERS too high at system startup.  
       
    
    Parameter type  	Integer       
    Parameter class	Dynamic: ALTER SYSTEM       
     
     
 2. Add the parameter DISPATCHERS in the init.ora 
 
    DISPATCHERS configures dispatcher processes in the shared server 
    architecture. 
  
    USAGE: 
    ----- 
    DISPATCHERS = "(PROTOCOL=TCP)(DISPATCHERS=3)" 
     
 3. Save the init.ora file. 
  
 4. Change the connect string in tnsnames.ora from 
  
     ORACLE.IDC.ORACLE.COM = 
       (DESCRIPTION = 
         (ADDRESS_LIST = 
           (ADDRESS = (PROTOCOL = TCP)(HOST = xyzac)(PORT = 1521)) 
         ) 
         (CONNECT_DATA = 
           (SERVER = DEDICATED) 
           (SERVICE_NAME = oracle) 
         ) 
       ) 
        
        to 
         
         
      ORACLE.IDC.ORACLE.COM = 
       (DESCRIPTION = 
         (ADDRESS_LIST = 
           (ADDRESS = (PROTOCOL = TCP)(HOST = xyzac)(PORT = 1521)) 
         ) 
         (CONNECT_DATA = 
           (SERVER = SHARED) 
           (SERVICE_NAME = Oracle) 
         ) 
       )   
        
       Change SERVER=SHARED. 
        
  5. Shutdown and startup the database. 
   
  6. Make a new connection to database other than SYSDBA. 
      
     (NOTE: SYSDBA will always acquire dedicated connection by default.) 
      
  7. Check whether the connection is done through server server. 
   
     > Select server from v$session. 
      
	SERVER 
	--------- 
	DEDICATED 
	DEDICATED 
	DEDICATED 
	SHARED 
	DEDICATED 
	 
	 
   NOTE: 
   ==== 
     The following parameters are optional (if not specified, Oracle selects 
     defaults): 
       
     MAX_DISPATCHERS: 
     =============== 
      Specifies the maximum number of dispatcher processes that can run  
      simultaneously. 
       
     SHARED_SERVERS: 
     ============== 
      Specifies the number of shared server processes created when an instance 
      is started up. 
       
     MAX_SHARED_SERVERS: 
     ================== 
      Specifies the maximum number of shared server processes that can run  
      simultaneously. 
       
     CIRCUITS: 
     ======== 
      Specifies the total number of virtual circuits that are available for  
      inbound and outbound network sessions. 
       
     SHARED_SERVER_SESSIONS: 
     ====================== 
      Specifies the total number of shared server user sessions to allow. 
      Setting this parameter enables you to reserve user sessions for  
      dedicated servers. 
      
      
     Other parameters affected by shared server that may require adjustment: 
       
     LARGE_POOL_SIZE: 
     =============== 
      Specifies the size in bytes of the large pool allocation heap. Shared  
      server may force the default value to be set too high, causing  
      performance problems or problems starting the database. 
       
     SESSIONS: 
     ======== 
      Specifies the maximum number of sessions that can be created in the  
      system. May need to be adjusted for shared server. 


12.7 password for the listener:
-------------------------------

Note 1:

LSNRCTL> set password <password> where <password> is the password you want to use. 
To change a password, use "Change_Password" You can also designate a password when you configure the listener 
with the Net8 Assistant. These passwords are stored in the listener.ora file and although they will not show 
in the Net8 Assistant, they are readable in the listener.ora file. 

Note 2:

The password can be set either by specifying it through the command CHANGE_PASSWORD, or through a parameter 
in the listener.ora file. We saw how to do that through the CHANGE_PASSWORD command earlier. 
If the password is changed this way, it should not be specified in the listener.ora file. The password is not 
displayed anywhere. When supplying the password in the listener control utility, you must supply it at the 
Password: prompt as shown above. You cannot specify the password in one line as shown below. 

LSNRCTL> set password t0p53cr3t
LSNRCTL> stop
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=IPC)(KEY=EXTPROC)))
TNS-01169: The listener has not recognized the password
LSNRCTL>

Note 3:

 more correct method would be to password protect the listener functions.

See the net8 admin guide for info but in short -- you can:

LSNRCTL> change_password
Old password:    <just hit enter if you don't have one yet>
New password: 
Reenter new password: 
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=slackdog)(PORT=1521)))
Password changed for LISTENER
The command completed successfully

LSNRCTL> set password
Password: 
The command completed successfully

LSNRCTL> save_config
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=slackdog)(PORT=1521)))
Saved LISTENER configuration parameters.
Listener Parameter File   /d01/home/oracle8i/network/admin/listener.ora
Old Parameter File   /d01/home/oracle8i/network/admin/listener.bak
The command completed successfully
LSNRCTL> 

Now, you need to use a password to do various operations (such as STOP) but not 
others (such as STATUS)


=============================================
13. Datadictionary queries Rollback segments:
=============================================

13.1 naam, plaats en status van rollback segementen:
----------------------------------------------------

SELECT substr(segment_name, 1, 10), substr(tablespace_name, 1, 20), status,
       INITIAL_EXTENT, NEXT_EXTENT, MIN_EXTENTS, MAX_EXTENTS, PCT_INCREASE   
FROM DBA_ROLLBACK_SEGS;

13.2 indruk van aantal active transactions per rollback segment:
----------------------------------------------------------------

aantal actieve transacties: V$ROLLSTAT
naam rollback segment:      V$ROLLNAME

SELECT n.name, s.xacts
FROM V$ROLLNAME n, V$ROLLSTAT s
WHERE n.usn=s.usn;               
(usn=undo segment number)


13.3 grootte, naam, extents, bytes van de rollback segmenten:
-------------------------------------------------------------

SELECT substr(segment_name, 1, 15), bytes/1024/1024 Size_in_MB, blocks, 
extents, substr(tablespace_name, 1, 15)
FROM DBA_SEGMENTS WHERE segment_type='ROLLBACK';

SELECT n.name, s.extents, s.rssize
FROM V$ROLLNAME n, V$ROLLSTAT s
WHERE n.usn=s.usn;


Create Tablespace RBS
datafile '/db1/oradata/oem/rbs.dbf' SIZE 200M AUTOEXTEND ON NEXT 20M MAXSIZE 500M
LOGGING
DEFAULT STORAGE ( 
                  INITIAL 5M
                  NEXT 5M
                  MINEXTENTS 2
                  MAXEXTENTS 100
                  PCTINCREASE 0
                )
ONLINE
PERMANENT;


13.4 De optimal parameter:
--------------------------

SELECT n.name, s.optsize
FROM V$ROLLNAME n, V$ROLLSTAT s
WHERE n.usn=s.usn;


13.5 writes to rollback segementen:
-----------------------------------

Doe de query begin meting, en bij einde meting en bekijk het verschil

SELECT n.name, s.writes FROM V$ROLLNAME n, V$ROLLSTAT s
WHERE n.usn=s.usn


13.6 Wie en welke processes gebruiken de rollback segs:
-------------------------------------------------------

Query1: Query op v$lock, v$session, v$rollname

column rr heading 'RB Segment' format a15
column us heading 'Username'   format a10
column os heading 'OS user'    format a10
column te heading 'Terminal'   format a15

SELECT R.name rr, nvl(S.username, 'no transaction') us,
       S.Osuser os,
       S.Terminal te
FROM   V$LOCK L, V$SESSION S, V$ROLLNAME R
WHERE  L.Sid=S.Sid(+)
AND    trunc(L.Id1/65536)=R.usn
AND    L.Type='TX'
AND    L.Lmode=6
ORDER BY R.name
/


Query 2:

SELECT r.name "RBS", s.sid, s.serial#, s.username "USER", t.status,
       t.cr_get, t.phy_io, t.used_ublk, t.noundo,
       substr(s.program, 1, 78) "COMMAND"
FROM   sys.v_$session s, sys.v_$transaction t, sys.v_$rollname r
WHERE  t.addr = s.taddr
  AND  t.xidusn = r.usn
ORDER  BY t.cr_get, t.phy_io
/


13.7 Bepaling minimum aantal rollbacksegmenten:
------------------------------------------------

Bepaal in init.ora via "show parameter transactions"

transactions= a	                      (max no of transactions, stel 100)
transactions_per_rollback_segment= b  (allowed no of concurrent tr/rbs, stel 10)

minimum=a/b   (100/10=10)


13.8 Bepaling minimale grootte rollback segmenten:
--------------------------------------------------

lts=largest transaction size (normal production, niet af en toe batch loads)
min_size=minimum size van rollback segment
min_size= lts * 100 / (100 - (40 {%free} + 15 {iaiu} +5 {header}
min_size=lts * 1.67

Stel lts=700K, dan is de startwaarde rollbacksegment=1400K


=========================================================
14. Data dictionary queries m.b.t. security, permissions:
=========================================================


14.1 user information in datadictionary
---------------------------------------

SELECT username, user_id, password
FROM DBA_USERS
WHERE username='Kees';

14.2 default tablespace, account_status of users
------------------------------------------------

SELECT username, default_tablespace, account_status
FROM DBA_USERS;

14.3 tablespace quotas of users
-------------------------------

SELECT tablespace_name, bytes, max_bytes, blocks, max_blocks
FROM DBA_TS_QUOTAS
WHERE username='CHARLIE';

14.4 Systeem rechten van een user opvragen: DBA_SYS_PRIVS
---------------------------------------------------------

SELECT substr(grantee, 1, 15), substr(privilege, 1, 40),
admin_option 
FROM DBA_SYS_PRIVS WHERE grantee='CHARLIE';

SELECT * FROM dba_sys_privs
WHERE grantee='Kees';


14.5 Invalid objects in DBA_OBJECTS:
------------------------------------

SELECT substr(owner, 1, 10), substr(object_name, 1, 40),
substr(object_type, 1, 40), status
FROM DBA_OBJECTS
WHERE status='INVALID';


14.6 session information
------------------------

SELECT sid, serial#, substr(username, 1, 10), substr(osuser, 1, 10), substr(schemaname, 1, 10),
substr(program, 1, 15), substr(module, 1, 15), status, logon_time,
substr(terminal, 1, 15), substr(machine, 1, 15)
FROM V$SESSION;


14.7 kill a session
-------------------

alter system kill session 'SID, SERIAL#'


========================
15. INIT.ORA parameters:
========================


15.1 init.ora parameters en ARCHIVE MODE:
----------------------------------------

LOG_ARCHIVE_DEST=/oracle/admin/cc1/arch
LOG_ARCHIVE_START=TRUE
LOG_ARCHIVE_FORMAT=archcc1_%s.log

LOG_ARCHIVE_DEST_1=
LOG_ARCHIVE_DEST_2=
LOG_ARCHIVE_MAX_PROCESSES=2


15.2 init.ora en perfoRMANce en SGA:
------------------------------------

SORT_AREA_SIZE                	= 65536         (per PGA, max sort area)
SORT_AREA_RETAINED_SIZE       	= 65536         (size after sort)
PROCESSES                     	= 100           (alle processes)
DB_BLOCK_SIZE                 	= 8192
DB_BLOCK_BUFFERS              	= 3400          (DB_CACHE_SIZE in Oracle 9i)
SHARED_POOL_SIZE              	= 52428800
LOG_BUFFER                    	= 26214400 
LARGE_POOL_SIZE                 =
DBWR_IO_SLAVES                                  (DB_WRITER_PROCESSES)
DB_WRITER_PROCESSES             = 2
LGWR_IO_SLAVES=
DB_FILE_MULTIBLOCK_READ_COUNT	=16		(minimize io during table scans,
                                                it specifies max number of blocks in one
                                                io operation during sequential read)
BUFFER_POOL_RECYCLE             =
BUFFER_POOL_KEEP                =
TIMED_STATISTICES		=TRUE           (statistics related to time are collected or not)           
OPTIMIZER_MODE			=RULE, CHOOSE, FIRST_ROWS, ALL_ROWS

PARALLEL_MIN_SERVERS            = 2		(voor Parallel Query, en parallel recovery)
PARALLEL_MAX_SERVERS            = 4

RECOVERY_PARALLELISM            = 2		(set parallel recovery op database niveau)


SHARED_POOL_SIZE: in bytes or K or M
SHARED_POOL_SIZE specifies (in bytes) the size of the shared pool. The shared pool contains shared cursors, stored procedures, 
control structures, and other structures. If you set PARALLEL_AUTOMATIC_TUNING to false, 
Oracle also allocates parallel execution message buffers from the shared pool. Larger values improve perfoRMANce in multi-user systems.
 Smaller values use less memory. 
You can monitor utilization of the shared pool by querying the view V$SGASTAT. 

SHARED_POOL_RESERVED_SIZE:
The parameter was introduced in Oracle 7.1.5 and provides a means of reserving a portion of the shared pool 
for large memory allocations. The reserved area comes out of the shared pool itself. 
From a practical point of view one should set SHARED_POOL_RESERVED_SIZE to about 10% 
of SHARED_POOL_SIZE unless either the shared pool is very large OR SHARED_POOL_RESERVED_MIN_ALLOC 
has been set lower than the default value:


15.3 init.ora en jobs:
----------------------

JOB_QUEUE_PROCESSES=1    aantal SNP processes (SNP0, SNP1), max 36 t.b.v. replication en jobqueue's
JOB_QUEUE_INTERVAL=60    check interval


15.4 instance name, sid:
------------------------

db_name                       	= CC1
global_names                  	= TRUE
instance_name                 	= CC1
db_domain                     	= antapex.net


15.5 overige parameters:
------------------------

OS_AUTHENT_PREFIX             	= ""                  (stANDaard is dat OPS$)
REMOTE_OS_AUTHENTICATION        = TRUE or FALSE       (of een OS authentication via het netwerk kan)
REMOTE_LOGIN_PASSWORDFILEe     	= NONE or EXCLUSIVE


distributed_transactions        =0 or >0  (starts the RECO process)
aq_tm_processes                 =         (advanced queuing, message queues)

mts_servers                     =         (number of shared server processes in multithreaded server)
mts_max_servers                 =

audit_file_dest               	= /dbs01/app/oracle/admin/AMI_PRD/adump
background_dump_dest          	= /dbs01/app/oracle/admin/AMI_PRD/bdump
user_dump_dest                	= /dbs01/app/oracle/admin/AMI_PRD/udump
core_dump_dest                	= /dbs01/app/oracle/admin/AMI_PRD/cdump

resource_limit			=true		(specifies whether resource limits in profiles are in effect)

license_max_sessions            =               (max number of concurrent user sessions)
license_sessions_warning        =               (at this limit, warning in alert log)
license_max_users               =               (maximum number of users that can be created in the database)

                                                 are enforced)
compatible                    	= 8.1.7.0.0
control_files                 	= /dbs04/oradata/AMI_PRD/ctrl/cc1_01.ctl
control_files                 	= /dbs05/oradata/AMI_PRD/ctrl/cc1_02.ctl
control_files                 	= /dbs06/oradata/AMI_PRD/ctrl/cc1_03.ctl

db_files              		= 150            (max number of data files opened)
java_pool_size                	= 0
log_checkpoint_interval       	= 10000 
log_checkpoint_timeout        	= 1800 
max_dump_file_size            	= 10240
max_enabled_roles             	= 40
nls_date_format               	= "DD-MM-YYYY"
nls_language                  	= AMERICAN
nls_territory                 	= AMERICA
o7_dictionary_accessibility  	= TRUE      
open_cursors                  	= 250
optimizer_max_permutations    	= 1000
optimizer_mode                	= CHOOSE
parallel_max_servers          	= 5
pre_page_sga                  	= TRUE
service_names                 	= CC1
utl_file_dir           		= /app01/oradata/cc1/utl_file


All init.ora  parameters:
-------------------------

PARAMETER                       DESCRIPTION
------------------------------  ----------------------------------------
 O7_DICTIONARY_ACCESSIBILITY     Version 7 Dictionary Accessibility
                                  support [TRUE | FALSE]
 
 active_instance_count           Number of active instances in the
                                  cluster database [NUMBER]
 aq_tm_processes                 Number of AQ Time Managers to start [NUMBER]
 archive_lag_target              Maximum number of seconds of redos the
                                  standby could lose [NUMBER]
 asm_diskgroups                  Disk groups to mount automatically [CHAR]
 asm_diskstring                  Disk set locations for discovery  [CHAR]
 asm_power_limit                 Number of processes for disk rebalancing [NUMBER]
 audit_file_dest                 Directory in which auditing files are to reside ['Path']
 audit_sys_operations            Enable sys auditing [TRUE|FALSE]
 audit_trail                     Enable system auditing [NONE|DB|DB_EXTENDED|OS]

 background_core_dump            Core Size for Background Processes [partial | full]
 background_dump_dest            Detached process dump directory [file_path]
 backup_tape_io_slaves           BACKUP Tape I/O slaves [TRUE | FALSE]
 bitmap_merge_area_size          Maximum memory allow for BITMAP MERGE [NUMBER]
 blank_trimming                  Blank trimming semantics parameter [TRUE | FALSE]
 buffer_pool_keep                Number of database blocks/latches in
                                 keep buffer pool [CHAR: (buffers:n, latches:m)]
 buffer_pool_recycle             Number of database blocks/latches in
                                  recycle buffer pool [CHAR: (buffers:n, latches:m)]

 circuits                        Max number of virtual circuits [NUMBER]
 cluster_database                If TRUE startup in cluster database mode [TRUE | FALSE]
 cluster_database_instances      Number of instances to use for sizing
                                  cluster db SGA structures [NUMBER]
 cluster_interconnects           Interconnects for RAC use [CHAR]
 commit_point_strength           Bias this node has toward not preparing
                                  in a two-phase commit [NUMBER (0-255)]
 compatible                      Database will be completely compatible
                                  with this software version [CHAR: 9.2.0.0.0]
 control_file_record_keep_time   Control file record keep time in days [NUMBER]
 control_files                   Control file names list [file_path,file_path..]
 core_dump_dest                  Core dump directory [file_path]
 cpu_count                       Initial number of cpu's for this instance [NUMBER]
 create_bitmap_area_size         Size of create bitmap buffer for bitmap
                                  index [INTEGER]
 cursor_sharing                  Cursor sharing mode [EXACT | SIMILAR | FORCE]
 create_stored_outlines          Create stored outlines for DML statements [TRUE | FALSE | category_name] 
 cursor_space_for_time           Use more memory in order to get faster
                                  execution [TRUE | FALSE]

 db_16k_cache_size               Size of cache for 16K buffers [bytes]
 db_2k_cache_size                Size of cache for 2K buffers [bytes]
 db_32k_cache_size               Size of cache for 32K buffers [bytes]
 db_4k_cache_size                Size of cache for 4K buffers [bytes]
 db_8k_cache_size                Size of cache for 8K buffers [bytes]
 db_block_buffers                Number of database blocks to cache in memory
                                  [bytes: 8M or NUMBER of blocks (Ora7)]
 db_block_checking               Data and index block checking [TRUE | FALSE]
 db_block_checksum               Store checksum in db blocks and check
                                 during reads [TRUE | FALSE]
 db_block_size                   Size of database block [bytes]
 db_cache_advice                 Buffer cache sizing advisory [internal use only]
 db_cache_size                   Size of DEFAULT buffer pool for standard
                                  block size buffers [bytes]
 db_create_file_dest             Default database location ['Path_to_directory']
 db_create_online_log_dest_n     Online log/controlfile destination (where n=1-5) ['Path']
 db_domain                       Directory part of global database name
                                  stored with CREATE DATABASE [CHAR]
* db_file_multiblock_read_count   Db blocks to be read each IO [NUMBER]
 db_file_name_convert            Datafile name convert patterns and
                                 strings for standby/clone db [, ]
 db_files                        Max allowable # db files [NUMBER]
 db_flashback_retention_target   Maximum Flashback Database log retention time in minutes [NUMBER]
 db_keep_cache_size              Size of KEEP buffer pool for standard
                                  block size buffers [bytes]
 db_name                         Database name specified in CREATE
                                  DATABASE [CHAR]
 db_recovery_file_dest           Default database recovery file location [CHAR]
 db_recovery_file_dest_size      Database recovery files size limit [bytes]
 db_recycle_cache_size           Size of RECYCLE buffer pool for standard
                                  block size buffers [bytes]
 db_unique_name                  Database Unique Name [CHAR]
 db_writer_processes             Number of background database writer
                                  processes to start [NUMBER]
 dblink_encrypt_login            Enforce password for distributed login
                                  always be encrypted [TRUE | FALSE]
 dbwr_io_slaves                  DBWR I/O slaves [NUMBER]
 ddl_wait_for_locks              Disable NOWAIT DML lock acquisitions [TRUE | FALSE]
 dg_broker_config_file1          Data guard broker configuration file #1 ['Path']
 dg_broker_config_file2          Data guard broker configuration file #2 ['Path']
 dg_broker_start                 Start Data Guard broker framework (DMON
                                  process) [TRUE | FALSE]
 disk_asynch_io                  Use asynch I/O for random access devices [TRUE | FALSE]
 dispatchers                     Specifications of dispatchers
 (MTS_dispatchers in Ora 8)        [CHAR]
 distributed_lock_timeout        Number of seconds a distributed transaction
                                  waits for a lock [Internal]
 dml_locks                       Dml locks - one for each table modified
                                  in a transaction [NUMBER]
 drs_start                       Start DG Broker monitor (DMON process)[TRUE | FALSE]

 enqueue_resources                Resources for enqueues [NUMBER]
 event                           Debug event control - default null string [CHAR]
                                    
 fal_client                      FAL client [CHAR]
 fal_server                      FAL server list [CHAR]
 fast_start_io_target            Upper bound on recovery reads [NUMBER]
 fast_start_mttr_target          MTTR target of forward crash recovery
                                  in seconds [NUMBER]
 fast_start_parallel_rollback    Max number of parallel recovery slaves
                                  that may be used [LOW | HIGH | FALSE]
 file_mapping                    Enable file mapping [TRUE | FALSE]
 fileio_network_adapters         Network Adapters for File I/O [CHAR]
 filesystemio_options            IO operations on filesystem files [Internal]
 fixed_date                      Fix SYSDATE value for debugging[NONE or '2000_12_30_24_59_00']

 gc_files_to_locks               RAC/OPS - lock granularity number of 
                                  global cache locks per file (DFS) [CHAR]
 gcs_server_processes            Number of background gcs server processes to start [NUMBER]
 global_context_pool_size        Global Application Context Pool Size in
                                  Bytes [bytes]
 global_names                    Enforce that database links have same
                                  name as remote database [TRUE | FALSE]

 hash_area_size                  Size of in-memory hash work area (Shared Server)[bytes]
 hash_join_enabled               Enable/disable hash join (CBO) [TRUE | FALSE]
 hi_shared_memory_address        SGA starting address (high order 32-bits
                                  on 64-bit platforms) [NUMBER]
 hs_autoregister                 Enable automatic server DD updates in HS
                                  agent self-registration [TRUE | FALSE]

 ifile                           Include file in init.ora ['path_to_file']
 instance_groups                 List of instance group names [CHAR]
 instance_name                   Instance name supported by the instance [CHAR]
 instance_number                 Instance number [NUMBER]
 instance_type                   Type of instance to be executed
                                  RDBMS or Automated Storage Management [RDBMS | ASM]

 java_max_sessionspace_size      Max allowed size in bytes of a Java
                                  sessionspace [bytes]
 java_pool_size                  Size in bytes of the Java pool [bytes]
 java_soft_sessionspace_limit    Warning limit on size in bytes of a Java
                                  sessionspace [NUMBER]
 job_queue_processes             Number of job queue slave processes [NUMBER]

 large_pool_size                 Size in bytes of the large allocation pool [bytes]
 ldap_directory_access           RDBMS's LDAP access option [NONE | PASSWORD | SSL]
 license_max_sessions            Maximum number of non-system user sessions
                                  (concurrent licensing) [NUMBER]
 license_max_users               Maximum number of named users that can be created
                                  (named user licensing) [NUMBER]
 license_sessions_warning        Warning level for number of non-system
                                  user sessions [NUMBER]
 local_listener                  Define which listeners instances register with [CHAR]
 lock_name_space                 Used for generating lock names for standby/primary database 
                                  assign each a unique name space [CHAR]
 lock_sga                        Lock entire SGA in physical memory [Internal]
 log_archive_config               Log archive config 
                                  [SEND|NOSEND] [RECEIVE|NORECEIVE] [ DG_CONFIG]
 log_archive_dest                Archive logs destination ['path_to_directory']
 log_archive_dest_n              Archive logging parameters (n=1-10)
                                  Enterprise Edition [CHAR]
 log_archive_dest_state_n        Archive logging parameter status (n=1-10) [CHAR]
                                  Enterprise Edition [CHAR]
 log_archive_duplex_dest         Duplex archival destination ['path_to_directory']
 log_archive_format              Archive log filename format [CHAR: "MyApp%S.ARC"]
 log_archive_local_first         Establish EXPEDITE attribute default value [TRUE | FALSE]
 log_archive_max_processes       Maximum number of active ARCH processes [NUMBER]
 log_archive_min_succeed_dest    Minimum number of archive destinations
                                  that must succeed [NUMBER]
 log_archive_start               Start archival process on SGA initialization [TRUE | FALSE]
 log_archive_trace               Archive log tracing level [NUMBER]                                    
 log_buffer                      Redo circular buffer size [bytes]
 log_checkpoint_interval         Checkpoint threshold, # redo blocks [NUMBER]
 log_checkpoint_timeout          Checkpoint threshold, maximum time interval between
                                  checkpoints in seconds [NUMBER]
 log_checkpoints_to_alert        Log checkpoint begin/end to alert file [TRUE | FALSE]
 log_file_name_convert           Logfile name convert patterns and
                                  strings for standby/clone db [, ]
 log_parallelism                 Number of log buffer strands [NUMBER]
 logmnr_max_persistent_sessions  Maximum number of threads to mine [NUMBER]

 max_commit_propagation_delay    Max age of new snapshot in .01 seconds [NUMBER]
 max_dispatchers                 Max number of dispatchers [NUMBER]
 max_dump_file_size              Maximum size (blocks) of dump file [UNLIMITED or bytes]
 max_enabled_roles               Max number of roles a user can have enabled [NUMBER]
 max_rollback_segments           Max number of rollback segments in SGA cache [NUMBER]
 max_shared_servers              Max number of shared servers [NUMBER]
 mts_circuits                    Max number of circuits [NUMBER]
 mts_dispatchers                 Specifications of dispatchers [CHAR]
 mts_listener_address            Address(es) of network listener [CHAR]
 mts_max_dispatchers             Max number of dispatchers [NUMBER]
 mts_max_servers                 Max number of shared servers [NUMBER]
 mts_multiple_listeners          Are multiple listeners enabled? [TRUE | FALSE]
 mts_servers                     Number of shared servers to start up [NUMBER]
 mts_service                     Service supported by dispatchers [CHAR]
 mts_sessions                    max number of shared server sessions [NUMBER]

 nls_calendar                    NLS calendar system name (Default=GREGORIAN) [CHAR]
 nls_comp                        NLS comparison, Enterprise Edition [BINARY | ANSI]
 nls_currency                    NLS local currency symbol [CHAR]
 nls_date_format                 NLS Oracle date format [CHAR]
 nls_date_language               NLS date language name (Default=AMERICAN) [CHAR]
 nls_dual_currency               Dual currency symbol [CHAR]
 nls_iso_currency                NLS ISO currency territory name
                                  override the default set by NLS_TERRITORY [CHAR]
 nls_language                    NLS language name (session default) [CHAR]
 nls_length_semantics            Create columns using byte or char
                                  semantics by default [BYTE | CHAR]
 nls_nchar_conv_excp             NLS raise an exception instead of
                                  allowing implicit conversion [CHAR]
 nls_numeric_characters          NLS numeric characters [CHAR]
 nls_sort                        Case-sensitive or insensitive sort [Language]
                                  language may be BINARY, BINARY_CI, BINARY_AI,
                                  GERMAN, GERMAN_CI, etc
 nls_territory                   NLS territory name (country settings) [CHAR]
 nls_time_format                 Time format [CHAR]
 nls_time_tz_format              Time with timezone format [CHAR]
 nls_timestamp_format            Time stamp format [CHAR]
 nls_timestamp_tz_format         Timestamp with timezone format [CHAR]

 object_cache_max_size_percent   Percentage of maximum size over optimal
                                  of the user session's ob [NUMBER]
 object_cache_optimal_size       Optimal size of the user session's
                                  object cache in bytes [bytes]
 olap_page_pool_size             Size of the olap page pool in bytes [bytes]
 open_cursors                    Max # cursors per session [NUMBER]
 open_links                      Max # open links per session [NUMBER]
 open_links_per_instance         Max # open links per instance [NUMBER]
 optimizer_dynamic_sampling      Optimizer dynamic sampling [NUMBER]
 optimizer_features_enable       Optimizer plan compatibility 
                                  (oracle version e.g. 8.1.7) [CHAR]
 optimizer_index_caching         Optimizer index caching percent [NUMBER]
 optimizer_index_cost_adj        Optimizer index cost adjustment [NUMBER]
 optimizer_max_permutations      Optimizer maximum join permutations per
                                  query block [NUMBER]
 optimizer_mode                  Optimizer mode [RULE | CHOOSE | FIRST_ROWS | ALL_ROWS]
 oracle_trace_collection_name    Oracle TRACE default collection name [CHAR]
 oracle_trace_collection_path    Oracle TRACE collection path [CHAR]
 oracle_trace_collection_size    Oracle TRACE collection file max. size [NUMBER]
 oracle_trace_enable             Oracle Trace enabled/disabled [TRUE | FALSE]
 oracle_trace_facility_name      Oracle TRACE default facility name [CHAR]
 oracle_trace_facility_path      Oracle TRACE facility path [CHAR]
 os_authent_prefix               Prefix for auto-logon accounts [CHAR]
 os_roles                        Retrieve roles from the operating system [TRUE | FALSE]

 parallel_adaptive_multi_user    Enable adaptive setting of degree for
                                  multiple user streams [TRUE | FALSE]
 parallel_automatic_tuning       Enable intelligent defaults for parallel
                                  execution parameters  [TRUE | FALSE]
 parallel_execution_message_size Message buffer size for parallel
                                  execution [bytes]
 parallel_instance_group         Instance group to use for all parallel
                                  operations [CHAR]
 parallel_max_servers            Maximum parallel query servers per
                                  instance [NUMBER]
 parallel_min_percent            Minimum percent of threads required for
                                  parallel query [NUMBER]
 parallel_min_servers            Minimum parallel query servers per
                                  instance [NUMBER]
 parallel_server                 If TRUE startup in parallel server mode [TRUE | FALSE]
 parallel_server_instances       Number of instances to use for sizing
                                  OPS SGA structures [NUMBER]
 parallel_threads_per_cpu        Number of parallel execution threads per
                                  CPU [NUMBER]
 partition_view_enabled          Enable/disable partitioned views [TRUE | FALSE]
 pga_aggregate_target            Target size for the aggregate PGA memory
                                  consumed by the instance [bytes]
 plsql_code_type                 PL/SQL code-type [INTERPRETED | NATIVE]
 plsql_compiler_flags            PL/SQL compiler flags [CHAR]
 plsql_debug                     PL/SQL debug [TRUE | FALSE]
 plsql_native_c_compiler         plsql native C compiler [CHAR]
 plsql_native_library_dir        plsql native library dir ['Path_to_directory']
 plsql_native_library_subdir_count  plsql native library number of
                                     subdirectories [NUMBER]
 plsql_native_linker             plsql native linker [CHAR]
 plsql_native_make_file_name     plsql native compilation make file [CHAR]
 plsql_native_make_utility       plsql native compilation make utility [CHAR]
 plsql_optimize_level            PL/SQL optimize level [NUMBER]
 plsql_v2_compatibility          PL/SQL version 2.x compatibility flag [TRUE | FALSE]
 plsql_warnings                  PL/SQL compiler warnings settings [CHAR]
                                  See also DBMS_WARNING and DBA_PLSQL_OBJECT_SETTINGS 
 pre_page_sga                    Pre-page sga for process [TRUE | FALSE]
 processes                       User processes [NUMBER]

 query_rewrite_enabled           Allow rewrite of queries using materialized views
                                   if enabled [FORCE | TRUE | FALSE]
 query_rewrite_integrity         Perform rewrite using materialized views
                                  with desired integrity [STALE_TOLERATED | TRUSTED | ENFORCED]

 rdbms_server_dn                 RDBMS's Distinguished Name [CHAR]
 read_only_open_delayed          If TRUE delay opening of read only files
                                  until first access [TRUE | FALSE]
 recovery_parallelism            Number of server processes to use for
                                  parallel recovery [NUMBER]
 remote_archive_enable           Remote archival enable setting [RECEIVE[,SEND] | FALSE | TRUE]
 remote_dependencies_mode        Remote-procedure-call dependencies mode
                                  parameter [TIMESTAMP | SIGNATURE]
 remote_listener                 Remote listener [CHAR]
 remote_login_passwordfile       Use a password file [NONE | SHARED | EXCLUSIVE]
 remote_os_authent               Allow non-secure remote clients to use
                                  auto-logon accounts [TRUE | FALSE]
 remote_os_roles                 Allow non-secure remote clients to use
                                  os roles [TRUE | FALSE]
 replication_dependency_tracking Tracking dependency for Replication
                                  parallel propagation [TRUE | FALSE]
 resource_limit                  Master switch for resource limit [TRUE | FALSE]
 resource_manager_plan            Resource mgr top plan [Plan_Name]
 resumable_timeout               Set resumable_timeout, seconds [NUMBER] 
 rollback_segments               Undo segment list [CHAR]
 row_locking                     Row-locking [ALWAYS | DEFAULT | INTENT] (Default=always)

 serial_reuse                    Reuse the frame segments [DISABLE | SELECT|DML|PLSQL|ALL|NULL] 
 serializable                    Serializable [Internal]
 service_names                   Service names supported by the instance [CHAR]
 session_cached_cursors          Number of cursors to save in the session
                                  cursor cache [NUMBER]
 session_max_open_files          Maximum number of open files allowed per
                                  session  [NUMBER]
 sessions                        User and system sessions [NUMBER]
 sga_max_size                    Max total SGA size [bytes]
 sga_target                      Target size of SGA [bytes]
 shadow_core_dump                Core Size for Shadow Processes [PARTIAL | FULL | NONE]
 shared_memory_address           SGA starting address (low order 32-bits
                                  on 64-bit platforms) [NUMBER]
 shared_pool_reserved_size       Size in bytes of reserved area of shared
                                  pool [bytes]
 shared_pool_size                Size in bytes of shared pool [bytes]
 shared_server_sessions          Max number of shared server sessions [NUMBER]
 shared_servers                  Number of shared servers to start up [NUMBER]
 skip_unusable_indexes           Skip unusable indexes if set to true [TRUE | FALSE]
 sort_area_retained_size         Size of in-memory sort work area
                                  retained between fetch calls [bytes]
 sort_area_size                  Size of in-memory sort work area [bytes]
 smtp_out_server                 utl_smtp server and port configuration parameter [server_clause]
 spfile                          Server parameter file [CHAR]
 sp_name                         Service Provider Name [CHAR]
 sql92_security                  Require select privilege for searched
                                  update/delete [TRUE | FALSE]
 sql_trace                       Enable SQL trace [TRUE | FALSE]
 sqltune_category                Category qualifier for applying hintsets [CHAR]
 sql_version                     Sql language version parameter for
                                  compatibility issues [CHAR]
 standby_archive_dest            Standby database archivelog destination
                                  text string ['Path_to_directory']
 standby_file_management         If auto then files are created/dropped
                                  automatically on standby [MANUAL | AUTO]
 star_transformation_enabled     Enable the use of star transformation
                                  [TRUE | FALSE | DISABLE_TEMP_TABLE]
 statistics_level                Statistics level [ALL | TYPICAL | BASIC]
 streams_pool_size               Size in bytes of the streams pool [bytes]

 tape_asynch_io                  Use asynch I/O requests for tape devices [TRUE | FALSE]
 thread                          Redo thread to mount [NUMBER]
 timed_os_statistics             Internal os statistic gathering interval
                                  in seconds [NUMBER]
 timed_statistics                Maintain internal timing statistics [TRUE | FALSE]
 trace_enabled                   Enable KST tracing  (Internal parameter) [TRUE | FALSE]
 tracefile_identifier            Trace file custom identifier [CHAR]
 transaction_auditing            Transaction auditing records generated
                                 in the redo log [TRUE | FALSE]
 transactions                    Max. number of concurrent active
                                  transactions [NUMBER]
 transactions_per_rollback_segment Number of active transactions per
                                    rollback segment [NUMBER]

 undo_management                 Instance runs in SMU mode if TRUE, else
                                  in RBU mode [MANUAL | AUTO]
 undo_retention                  Undo retention in seconds [NUMBER]
 undo_suppress_errors            Suppress RBU errors in SMU mode [TRUE | FALSE]
 undo_tablespace                 Use or switch undo tablespace [Undo_tbsp_name]
 use_indirect_data_buffers       Enable indirect data buffers (very large
                                  SGA on 32-bit platforms [TRUE | FALSE]
 user_dump_dest                  User process dump directory ['Path_to_directory']
 utl_file_dir                    utl_file accessible directories list
                                   utl_file_dir='Path1', 'Path2'..
					                     or
                                   utl_file_dir='Path1'  # Must be
                                   utl_file_dir='Path2'  # consecutive entries

 workarea_size_policy            Policy used to size SQL working areas [MANUAL | AUTO]


db_file_multiblock_read_count:
The db_file_multiblock_read_count initialization parameter determines the  maximum number of database blocks 
read in one I/O operation during a full  table scan.  The setting of this parameter can reduce 
the number of I/O calls required for a full table scan, thus improving performance.     


15.6 9i UNDO or ROLLBACK parameters:
------------------------------------


- UNDO_MANAGEMENT
  If AUTO, use automatic undo management mode. If MANUAL, use manual undo management mode.
 
- UNDO_TABLESPACE
  A dynamic parameter specifying the name of an undo tablespace to use.
 
- UNDO_RETENTION
  A dynamic parameter specifying the length of time to retain undo. Default is 900 seconds.
 
- UNDO_SUPPRESS_ERRORS
  If TRUE, suppress error messages if manual undo management SQL statements are issued when operating 
  in automatic undo management mode. If FALSE, issue error message. This is a dynamic parameter.

 
If you're database is on manual, you can still use the following 8i type parameters:

- ROLLBACK_SEGMENTS
  Specifies the rollback segments to be acquired at instance startup
 
- TRANSACTIONS
  Specifies the maximum number of concurrent transactions
 
- TRANSACTIONS_PER_ROLLBACK_SEGMENT
  Specifies the number of concurrent transactions that each rollback segment is expected to handle
 
- MAX_ROLLBACK_SEGMENTS
  Specifies the maximum number of rollback segments that can be online for any instance
 

15.7 Oracle 9i init file examples:
---------------------------------=

Example 1:
----------

# Cache and I/O
DB_BLOCK_SIZE=4096
DB_CACHE_SIZE=20971520

# Cursors and Library Cache
CURSOR_SHARING=SIMILAR
OPEN_CURSORS=300

# Diagnostics and Statistics
BACKGROUND_DUMP_DEST=/vobs/oracle/admin/mynewdb/bdump
CORE_DUMP_DEST=/vobs/oracle/admin/mynewdb/cdump
TIMED_STATISTICS=TRUE
USER_DUMP_DEST=/vobs/oracle/admin/mynewdb/udump

# Control File Configuration
CONTROL_FILES=("/vobs/oracle/oradata/mynewdb/control01.ctl",
               "/vobs/oracle/oradata/mynewdb/control02.ctl",
               "/vobs/oracle/oradata/mynewdb/control03.ctl")

# Archive
LOG_ARCHIVE_DEST_1='LOCATION=/vobs/oracle/oradata/mynewdb/archive'
LOG_ARCHIVE_FORMAT=%t_%s.dbf
LOG_ARCHIVE_START=TRUE

# Shared Server
# Uncomment and use first DISPATCHES parameter below when your listener is
# configured for SSL 
# (listener.ora and sqlnet.ora)
# DISPATCHERS = "(PROTOCOL=TCPS)(SER=MODOSE)",
#               "(PROTOCOL=TCPS)(PRE=oracle.aurora.server.SGiopServer)"
DISPATCHERS="(PROTOCOL=TCP)(SER=MODOSE)",
            "(PROTOCOL=TCP)(PRE=oracle.aurora.server.SGiopServer)",
             (PROTOCOL=TCP)

# Miscellaneous
COMPATIBLE=9.2.0
DB_NAME=mynewdb

# Distributed, Replication and Snapshot
DB_DOMAIN=us.oracle.com
REMOTE_LOGIN_PASSWORDFILE=EXCLUSIVE

# Network Registration
INSTANCE_NAME=mynewdb

# Pools
JAVA_POOL_SIZE=31457280
LARGE_POOL_SIZE=1048576
SHARED_POOL_SIZE=52428800

# Processes and Sessions
PROCESSES=150

# Redo Log and Recovery
FAST_START_MTTR_TARGET=300

# Resource Manager
RESOURCE_MANAGER_PLAN=SYSTEM_PLAN

# Sort, Hash Joins, Bitmap Indexes
SORT_AREA_SIZE=524288

# Automatic Undo Management
UNDO_MANAGEMENT=AUTO
UNDO_TABLESPACE=undotbs


Example 2:
----------

##############################################################################
# Copyright (c) 1991, 2001 by Oracle Corporation
##############################################################################
 
###########################################
# Cache and I/O
###########################################
db_block_size=8192
db_cache_size=50331648
 
###########################################
# Cursors and Library Cache
###########################################
open_cursors=300
 
###########################################
# Diagnostics and Statistics
###########################################
background_dump_dest=D:\oracle\admin\iasdb\bdump
core_dump_dest=D:\oracle\admin\iasdb\cdump
timed_statistics=TRUE
user_dump_dest=D:\oracle\admin\iasdb\udump
 
###########################################
# Distributed, Replication and Snapshot
###########################################
db_domain=missrv.miskm.mindef.nl
remote_login_passwordfile=EXCLUSIVE
 
###########################################
# File Configuration
###########################################
control_files=("D:\oracle\oradata\iasdb\CONTROL01.CTL", "D:\oracle\oradata\iasdb\CONTROL02.CTL", "D:\oracle\oradata\iasdb\CONTROL03.CTL")
 
###########################################
# Job Queues
###########################################
job_queue_processes=4
 
###########################################
# MTS
###########################################
dispatchers="(PROTOCOL=TCP)(PRE=oracle.aurora.server.GiopServer)", "(PROTOCOL=TCP)(PRE=oracle.aurora.server.SGiopServer)"
 
###########################################
# Miscellaneous
###########################################
aq_tm_processes=1
compatible=9.0.0
db_name=iasdb
 
###########################################
# Network Registration
###########################################
instance_name=iasdb
 
###########################################
# Pools
###########################################
java_pool_size=41943040
shared_pool_size=33554432
 
###########################################
# Processes and Sessions
###########################################
processes=150
 
###########################################
# Redo Log and Recovery
###########################################
fast_start_mttr_target=300
 
###########################################
# Sort, Hash Joins, Bitmap Indexes
###########################################
pga_aggregate_target=33554432
sort_area_size=524288
 
###########################################
# System Managed Undo and Rollback Segments
###########################################
undo_management=AUTO
undo_tablespace=UNDOTBS


==============
17. Snapshots:
==============

Snapshots allow you to replicate data based on column- and/or row-level subsetting, 
while multimaster replication requires replication of the entire table. 

You need a database link to implement replication.

17.1 Database link:
-------------------

In de "local" database, waar de snapshot copy komt te staan, geef een statement als bijv:

  CREATE PUBLIC DATABASE LINK MY_LINK
  CONNECT TO HARRY IDENTIFIED BY password
  USING 'DB1';

De servicename "DB1" wordt via de tnsnames.ora geresolved in
een connectdescriptor, waarin de remote Servername, protocol, en 
SID van de remote database bekend is geworden.

Nu is het mogelijk om bijv. de table employee in de remote database "DB1"
te SELECTeren:

  SELECT * FROM employee@MY_LINK;

Ook 2PC is geimplementeerd:

  update employee  set amount=amount-100;

  update employee@my_link  set amount=amount+100;

  commit;


17.2 Snapshots:
---------------

There are in general 2 styles of snapshots available

Simple snapshot:

  One to one replication of a remote table to a local snapshot (=table).

  The refresh of the snapshot can be a complete refresh, with the refresh rate
  specified in the "create snapshot" command.
  Also a snapshot log can be used at the remote original table in order to replicate
  only the transaction data.

Complex snapshot:

  If multiple remote tables are joined in order to create/refresh a local snapshot,
  it is a "complex snapshot". Only complete refreshes are possible.
  If joins or complex query clauses are used, like group by, one can only
  use a "complex snapshot".

-> Example COMPLEX snapshot:

On the local database:

CREATE SNAPSHOT EMP_DEPT_COUNT

pctfree 5
tablespace SNAP
storage (initial 100K next 100K pctincrease 0)

REFRESH COMPLETE
START WITH SYSDATE
NEXT SYSDATE+7
AS
SELECT DEPTNO, COUNT(*) Dept_count
FROM EMPLOYEE@MY_LINK
GROUP BY Deptno;


Because the records in this snapshot will not correspond one to one
with the records in the master table (since the query contains a group by clause)
this is a complex snapshot. Thus the snapshot will be completely recreated
every time it is refreshed.


-> Example SIMPLE snapshot:

On the local database:

CREATE SNAPSHOT EMP_DEPT_COUNT
pctfree 5
tablespace SNAP
storage (initial 100K next 100K pctincrease 0)
REFRESH FAST
START WITH SYSDATE
NEXT SYSDATE+7
AS
SELECT * FROM EMPLOYEE@MY_LINK

In this case the refresh fast clause tells oracle to use a snapshot log to refresh the local snapshot.
When a snapshotlog is used, only the changes to the master table are sent to the targets.
The snapshot log must be created in the master database (WHERE the original object is)

create snapshot log on employee
tablespace data
storage (initial 100K next 100K pctincrease 0);


Snapshot groups:
----------------

A snapshot group in a replication system maintains a partial or complete copy of the objects 
at the target master group. Snapshot groups cannot span master group boundaries. 
Figure 3-7 displays the correlation between Groups A and B at the master site and Groups A and B at the snapshot site. 

Group A at the snapshot site (see Figure 3-7) contains only some of the objects in the corresponding Group A 
at the master site. Group B at the snapshot site contains all objects in Group B at the master site. 
Under no circumstances, however, could Group B at the snapshot site contain objects FROM Group A at the master site. 
As illustrated in Figure 3-7, a snapshot group has the same name as the master group on which the snapshot group is based. 
For example, a snapshot group based on a "PERSONNEL" master group is also named "PERSONNEL." 

In addition to maintaining organizational consistency between snapshot sites and master sites, 
snapshot groups are required for supporting updateable snapshots. 
If a snapshot does not belong to a snapshot group, then it must be a read-only snapshot. 

A snapshot group is used to organize snapshots in a logical manner.


Refresh groups:
---------------

If 2 or more master tables which have a PK-FK relationship, are replicated, it is possible'that the 
2 cooresponding snapshots violate the referential integrety, because of different refresh times and schedules etc..

Related snapshots can be collected int refresh groups. The purpose of a refresh group is to coordinate
the refresh schedules of it's members.

This is achieved via the DBMS_REFRESH package. The procedures in this package are 
MAKE, ADD, SUBSTRACT, CHANGE, DESTROY, and REFRESH

A refresh group could contain more than one snapshot groups.


Types of snapshots:
-------------------

Primary Key
-----------

Primary key snapshots are the default type of snapshot. They are updateable if the snapshot was 
created as part of a snapshot group and "FOR UPDATE" was specified when defining the snapshot. 
Changes are propagated according to the row-level changes that have occurred, as identified by 
the primary key value of the row (not the ROWID). The SQL statement for creating an updateable, 
primary key snapshot might look like: 

CREATE SNAPSHOT sales.customer FOR UPDATE AS
 SELECT * FROM sales.customer@dbs1.acme.com;

Primary key snapshots may contain a subquery so that you can create a horizontally partitioned subset 
of data at the remote snapshot site. This subquery may be as simple as a basic WHERE clause or as 
complex as a multilevel WHERE EXISTS clause. Primary key snapshots that contain a SELECTed class of subqueries 
can still be incrementally or fast refreshed. The following is a subquery snapshot with a WHERE 
clause containing a subquery: 

CREATE SNAPSHOT sales.orders REFRESH FAST AS
 SELECT * FROM sales.orders@dbs1.acme.com o
 WHERE EXISTS
   (SELECT 1 FROM sales.customer@dbs1.acme.com c
    WHERE o.c_id = c.c_id AND zip = 19555);

ROWID
-----

For backwards compatibility, Oracle supports ROWID snapshots in addition to the default primary 
key snapshots. A ROWID snapshot is based on the physical row identifiers (ROWIDs) of the rows in a master table. 
ROWID snapshots should be used only for snapshots based on master tables FROM an Oracle7 database, 
and should not be used when creating new snapshots based on master tables FROM Oracle release 8.0 or greater databases. 

CREATE SNAPSHOT sales.customer REFRESH WITH ROWID AS
 SELECT * FROM sales.customer@dbs1.acme.com;

Complex
-------

To be fast refreshed, the defining query for a snapshot must observe certain restrictions. 
If you require a snapshot whose defining query is more general and cannot observe the restrictions, 
then the snapshot is complex and cannot be fast refreshed. 

Specifically, a snapshot is considered complex when the defining query of the snapshot contains: 

A CONNECT BY clause 

Clauses that do not comply with the requirements detailed in Table 3-1, "Restrictions for Snapshots with Subqueries" 

A set operation, such as UNION, INTERSECT, or MINUS 

In most cases, a distinct or aggregate function, although it is possible 
to have a distinct or aggregate function in the defining query and still have a simple snapshot 

See Also: 
Oracle8i Data Warehousing Guide for more information about complex materialized views. 
"Snapshot" is synonymous with "materialized view" in Oracle documentation, and "materialized view" 
is used in the Oracle8i Data Warehousing Guide.  
  

The following statement is an example of a complex snapshot CREATE statement: 

CREATE SNAPSHOT scott.snap_employees AS
 SELECT emp.empno, emp.ename FROM scott.emp@dbs1.acme.com
  UNION ALL
 SELECT new_emp.empno, new_emp.ename FROM scott.new_emp@dbs1.acme.com;

Read Only
---------

Any of the previously described types of snapshots can be made read-only by 
omitting the FOR UPDATE clause or disabling the equivalent checkbox in the Replication Manager interface. 
Read-only snapshots use many of the same mechanisms as updateable snapshots, 
except that they do not need to belong to a snapshot group. 

Snapshot Registration at a Master Site
--------------------------------------

At the master site, an Oracle database automatically registers information about a 
snapshots based on its master table(s). 
The following sections explain more about Oracle's snapshot registration mechanism. 

DBA_REGISTERED_SNAPSHOTS and DBA_SNAPSHOT_REFRESH_TIMES dictionary views

You can query the DBA_REGISTERED_SNAPSHOTS data dictionary view to list the
following information about a remote snapshot: 

The owner, name, and database that contains the snapshot 
The snapshot's defining query 
Other snapshot characteristics, such as its refresh method (fast or complete) 

You can also query the DBA_SNAPSHOT_REFRESH_TIMES view at the master site to 
obtain the last refresh times for each snapshot. Administrators can use this information 
to monitor snapshot activity FROM master sites and coordinate changes to snapshot sites 
if a master table needs to be dropped, altered, or relocated. 

Internal Mechanisms
Oracle automatically registers a snapshot at its master database when you create the snapshot, 
and unregisters the snapshot when you drop it. 


Caution: 
Oracle cannot guarantee the registration or unregistration of a snapshot at 
its master site during the creation or drop of the snapshot, respectively. 
If Oracle cannot successfully register a snapshot during creation, 
Oracle completes snapshot registration during a subsequent refresh of the snapshot. 
If Oracle cannot successfully unregister a snapshot when you drop the snapshot, 
the registration information for the snapshot persists in the master database until 
it is manually unregistered. Complex snapshots might not be registered.  

Manual registration
-------------------

If necessary, you can maintain registration manually. 
Use the REGISTER_SNAPSHOT and UNREGISTER_SNAPSHOT procedures of the 
DBMS_SNAPSHOT package at the master site to add, modify, or remove snapshot registration information. 

 
Snapshot Log
------------

When you create a snapshot log for a master table, Oracle creates an underlying table 
as the snapshot log. A snapshot log holds the primary keys and/or the ROWIDs of rows 
that have been updated in the master table. A snapshot log can also contain filter columns 
to support fast refreshes of snapshots with subqueries. 
The name of a snapshot log's table is MLOG$_master_table_name. 
The snapshot log is created in the same schema as the target master table. 
One snapshot log can support multiple snapshots on its master table. 

As described in the previous section, the internal trigger adds change information 
to the snapshot log whenever a DML transaction has taken place on the target master table. 

There are three types of snapshot logs: 

Primary Key: The snapshot records changes to the master table based on the primary key of the affected rows. 
Row ID: The snapshot records changes to the master table based on the ROWID of the affected rows. 
Combination: The snapshot records changes to the master table based on both the primary key and the 
ROWID of the affected rows. This snapshot log supports both primary key and ROWID snapshots, which is helpful for mixed environments. 

A combination snapshot log works in the same manner as the primary key and ROWID snapshot log, 
except that both the primary key and the ROWID of the affected row are recorded. 

Though the difference between snapshot logs based on primary keys and ROWIDs is small 
(one records affected rows using the primary key, while the other records affected rows using the physical ROWID), 
the practical impact is large. Using ROWID snapshots and snapshot logs makes reorganizing and truncating your master tables 
difficult because it prevents your ROWID snapshots FROM being fast refreshed. 
If you reorganize or truncate your master table, your ROWID snapshot must be COMPLETE refreshed 
because the ROWIDs of the master table have changed. 


To delete a snapshot log, execute the DROP SNAPSHOT LOG SQL statement in SQL*Plus. 
For example, the following statement deletes the snapshot log for a table named CUSTOMERS in the SALES schema: 

DROP SNAPSHOT LOG ON sales.customers;

To delete the master table, use

truncate table TABLE_NAME purge snapshot log; 


=============
18. Triggers:
=============


A trigger is PL/SQL code block attached and executed by an event which occurs to a database table. 
Triggers are implicitly invoked by DML commands. Triggers are stored as text and compiled at 
execute time, because of this it is wise not to include much code in them but to call out to 
previously stored procedures or packages as this will greatly improve perfoRMANce. 
You may not use COMMIT, ROLLBACK and SAVEPOINT statements within trigger blocks. 
Remember that triggers may be executed thousands of times for a large update - 
they can seriously affect SQL execution perfoRMANce.

Triggers may be called BEFORE or AFTER the following events :-

INSERT, UPDATE and DELETE.

Triggers may be STATEMENT or ROW types. 

- STATEMENT triggers fire BEFORE or AFTER the execution of the statement 
  that caused the trigger to fire. 

- ROW triggers fire BEFORE or AFTER any affected row is processed.

An example of a statement trigger follows :-

CREATE OR REPLACE TRIGGER MYTRIG1 
BEFORE DELETE OR INSERT OR UPDATE ON JD11.BOOK
BEGIN
   IF (TO_CHAR(SYSDATE,'DAY') IN ('sat','sun')) OR (TO_CHAR(SYSDATE,'hh24:mi') NOT BETWEEN '08:30' AND '18:30') THEN
      RAISE_APPLICATION_ERROR(-20500,'Table is secured');
   END IF;
END;

After the CREATE OR REPLACE statement is the object identifier (TRIGGER) and the object name (MYTRIG1). 
This trigger specifies that before any data change event on the BOOK table this PL/SQL code block 
will be compiled and executed. The user will not be allowed to update the table outside of normal working hours.

An example of a row trigger follows :-

CREATE OR REPLACE TRIGGER MYTRIG2 
AFTER DELETE OR INSERT OR UPDATE ON JD11.BOOK
FOR EACH ROW
BEGIN
   IF DELETING THEN
      INSERT INTO JD11.XBOOK (PREVISBN, TITLE, DELDATE) VALUES (:OLD.ISBN, :OLD.TITLE, SYSDATE); 
   ELSIF INSERTING THEN
      INSERT INTO JD11.NBOOK (ISBN, TITLE, ADDDATE) VALUES (:NEW.ISBN, :NEW.TITLE, SYSDATE); 
   ELSIF UPDATING ('ISBN) THEN
      INSERT INTO JD11.CBOOK (OLDISBN, NEWISBN, TITLE, UP_DATE) VALUES (:OLD.ISBN :NEW.ISBN, :NEW.TITLE, SYSDATE);
   ELSE /* UPDATE TO ANYTHING ELSE THAN ISBN */
      INSERT INTO JD11.UBOOK (ISBN, TITLE, UP_DATE) VALUES (:OLD.ISBN :NEW.TITLE, SYSDATE); 
   END IF
END;

In this case we have specified that the trigger will be executed after any data change event on any affected row. 
Within the PL/SQL block body we can check which update action is being performed for the 
currently affected row and take whatever action we feel is appropriate. Note that we can 
specify the old and new values of updated rows by prefixing column names with the :OLD and :NEW qualifiers. 


--------------------------------------------------------------------------------


The following statement creates a trigger for the Emp_tab table: 

CREATE OR REPLACE TRIGGER Print_salary_changes
BEFORE DELETE OR INSERT OR UPDATE ON Emp_tab
FOR EACH ROW
WHEN (new.Empno > 0)
DECLARE
    sal_diff number;
BEGIN
    sal_diff  := :new.sal  - :old.sal;
    dbms_output.put('Old salary: ' || :old.sal);
    dbms_output.put('  New salary: ' || :new.sal);
    dbms_output.put_line('  Difference ' || sal_diff);
END;
/

If you enter a SQL statement, such as the following: 
UPDATE Emp_tab SET sal = sal + 500.00 WHERE deptno = 10;
Then, the trigger fires once for each row that is updated, 
and it prints the new and old salaries, and the difference. 


CREATE OR REPLACE TRIGGER "SALES".HENKILOROOLI_CHECK2
  AFTER INSERT OR UPDATE OR DELETE ON AH_HENKILOROOLI

BEGIN
  IF INSERTING OR DELETING THEN
    handle_delayed_triggers ('AH_HENKILOROOLI', 'HENKILOROOLI_CHECK');
  END IF;
  IF INSERTING OR UPDATING OR DELETING THEN                             /* FE */
    handle_delayed_triggers('AH_HENKILOROOLI', 'FRONTEND_FLAG');        /* FE */
  END IF;                                                               /* FE */

END;


A trigger is either a stored PL/SQL block or a PL/SQL, C, or Java procedure associated with a table, 
view, schema, or the database itself. Oracle automatically executes a trigger when a specified event takes place, 
which may be in the form of a system event or a DML statement being issued against the table. 


Triggers can be: 

-DML triggers on tables. 
-INSTEAD OF triggers on views.
-System triggers on DATABASE or SCHEMA: With DATABASE, triggers fire for each event for all users; 
                                       with SCHEMA, triggers fire for each event for that specific user. 

BEFORE and AFTER Options 

The BEFORE or AFTER option in the CREATE TRIGGER statement specifies exactly when to fire the 
trigger body in relation to the triggering statement that is being run. 
In a CREATE TRIGGER statement, the BEFORE or AFTER option is specified just before the triggering statement. 
For example, the PRINT_SALARY_CHANGES trigger in the previous example is a BEFORE trigger. 

INSTEAD OF Triggers 

The INSTEAD OF option can also be used in triggers. INSTEAD OF triggers provide a transparent way 
of modifying views that cannot be modified directly through UPDATE, INSERT, and DELETE statements. 
These triggers are called INSTEAD OF triggers because, unlike other types of triggers, 
Oracle fires the trigger instead of executing the triggering statement. 
The trigger performs UPDATE, INSERT, or DELETE operations directly on the underlying tables. 


CREATE TABLE Project_tab (
   Prj_level NUMBER, 
   Projno    NUMBER,
   Resp_dept NUMBER);
CREATE TABLE Emp_tab (
   Empno     NUMBER NOT NULL,
   Ename     VARCHAR2(10),
   Job       VARCHAR2(9),
   Mgr       NUMBER(4),
   Hiredate  DATE,
   Sal       NUMBER(7,2),
   Comm      NUMBER(7,2),
   Deptno    NUMBER(2) NOT NULL);
   
CREATE TABLE Dept_tab (
   Deptno    NUMBER(2) NOT NULL,
   Dname     VARCHAR2(14),
   Loc       VARCHAR2(13),
   Mgr_no    NUMBER,
   Dept_type NUMBER);


The following example shows an INSTEAD OF trigger for inserting rows into the MANAGER_INFO view. 

CREATE OR REPLACE VIEW manager_info AS
    SELECT e.ename, e.empno, d.dept_type, d.deptno, p.prj_level,
           p.projno
        FROM   Emp_tab e, Dept_tab d, Project_tab p
        WHERE  e.empno =  d.mgr_no
        AND    d.deptno = p.resp_dept;

CREATE OR REPLACE TRIGGER manager_info_insert
INSTEAD OF INSERT ON manager_info
REFERENCING NEW AS n                 -- new manager information

FOR EACH ROW
DECLARE
   rowcnt number;
BEGIN
   SELECT COUNT(*) INTO rowcnt FROM Emp_tab WHERE empno = :n.empno;
   IF rowcnt = 0  THEN
       INSERT INTO Emp_tab (empno,ename) VALUES (:n.empno, :n.ename);
   ELSE
      UPDATE Emp_tab SET Emp_tab.ename = :n.ename
         WHERE Emp_tab.empno = :n.empno;
   END IF;
   SELECT COUNT(*) INTO rowcnt FROM Dept_tab WHERE deptno = :n.deptno;
   IF rowcnt = 0 THEN
      INSERT INTO Dept_tab (deptno, dept_type) 
         VALUES(:n.deptno, :n.dept_type);
   ELSE
      UPDATE Dept_tab SET Dept_tab.dept_type = :n.dept_type
         WHERE Dept_tab.deptno = :n.deptno;
   END IF;
   SELECT COUNT(*) INTO rowcnt FROM Project_tab 
      WHERE Project_tab.projno = :n.projno;
   IF rowcnt = 0 THEN
      INSERT INTO Project_tab (projno, prj_level) 
         VALUES(:n.projno, :n.prj_level);
   ELSE
      UPDATE Project_tab SET Project_tab.prj_level = :n.prj_level
         WHERE Project_tab.projno = :n.projno;
   END IF;
END;
 

FOR EACH ROW Option

The FOR EACH ROW option determines whether the trigger is a row trigger or a statement trigger. 
If you specify FOR EACH ROW, then the trigger fires once for each row of the table that is affected 
by the triggering statement. The absence of the FOR EACH ROW option indicates that the trigger fires only once 
for each applicable statement, but not separately for each row affected by the statement. 

For example, you define the following trigger: 


--------------------------------------------------------------------------------
Note: 
You may need to set up the following data structures for certain examples to work: 

CREATE TABLE Emp_log (
   Emp_id     NUMBER, 
   Log_date   DATE,
   New_salary NUMBER, 
   Action     VARCHAR2(20));

  
--------------------------------------------------------------------------------
 

CREATE OR REPLACE TRIGGER Log_salary_increase
AFTER UPDATE ON Emp_tab
FOR EACH ROW
WHEN (new.Sal > 1000)
BEGIN
    INSERT INTO Emp_log (Emp_id, Log_date, New_salary, Action)
       VALUES (:new.Empno, SYSDATE, :new.SAL, 'NEW SAL');
END;


Then, you enter the following SQL statement: 

UPDATE Emp_tab SET Sal = Sal + 1000.0
    WHERE Deptno = 20;


If there are five employees in department 20, then the trigger fires five times when this statement is entered, 
because five rows are affected. 

The following trigger fires only once for each UPDATE of the Emp_tab table: 

CREATE OR REPLACE TRIGGER Log_emp_update
AFTER UPDATE ON Emp_tab
BEGIN
    INSERT INTO Emp_log (Log_date, Action)
        VALUES (SYSDATE, 'Emp_tab COMMISSIONS CHANGED');
END;


Trigger Size
The size of a trigger cannot be more than 32K. 

Valid SQL Statements in Trigger Bodies 
The body of a trigger can contain DML SQL statements. It can also contain SELECT statements, 
but they must be SELECT... INTO... statements or the SELECT statement in the definition of a cursor. 

DDL statements are not allowed in the body of a trigger. 
Also, no transaction control statements are allowed in a trigger. 
ROLLBACK, COMMIT, and SAVEPOINT cannot be used.For system triggers, {CREATE/ALTER/DROP} TABLE statements 
and ALTER...COMPILE are allowed. 


Recompiling Triggers 
Use the ALTER TRIGGER statement to recompile a trigger manually. 
For example, the following statement recompiles the PRINT_SALARY_CHANGES trigger: 

  ALTER TRIGGER Print_salary_changes COMPILE;

Disable enable trigger:

  ALTER TRIGGER Reorder DISABLE;
  ALTER TRIGGER Reorder ENABLE;

Or in 1 time for all triggers on a table:

  ALTER TABLE Inventory
  DISABLE ALL TRIGGERS;


ALTER DATABASE rename GLOBAL_NAME TO NEW_NAME; 


====================================
19 BACKUP RECOVERY, TROUBLESHOOTING:
====================================


19.1 SCN:
--------

The Control files and all datafiles contain the last SCN (System Change Number) after:

- checkpoint, for example via ALTER SYSTEM CHECKPOINT, 
- shutdown normal/immediate/transactional, 
- log switch occurs by the system
- via alter system switch logfile, 
- alter tablespace begin backup etc..

at checkpoint the following occurs:
------------------------------------

-  The database writer (DBWR) writes all modified database 
   blocks in the buffer cache back to datafiles, 
-  Log writer (LGWR) or Checkpoint process (CHKPT) updates both the controlfile and 
   the datafiles to indicate when the last checkpoint 
   occurred (SCN)

Log switching causes a checkpoint, but a checkpoint does
not cause a logswitch.

LGWR writes logbuffers to online redo log:
------------------------------------------

- at commit
- redolog buffers 1/3 full, > 1 MB changes
- before DBWR writes modified blocks to datafiles

LOG_CHECKPOINT_INTERVAL init.ora parameter:
-------------------------------------------

The LOG_CHECKPOINT_INTERVAL init.ora parameter controls how often a checkpoint 
operation will be performed based upon the number of operating system blocks 
that have been written to the redo log.  If this value is larger than the size 
of the redo log, then the checkpoint will only occur when Oracle performs a 
log switch FROM one group to another, which is preferred. 

NOTE: Starting with Oracle 8.1, LOG_CHECKPOINT_INTERVAL will be interpreted 
to mean that the incremental checkpoint should not lag the tail of the 
log by more than log_checkpoint_interval number of redo blocks. 

On most Unix systems the operating system block size is 512 bytes.  This means 
that setting LOG_CHECKPOINT_INTERVAL to a value of 10,000 (the default 
setting), causes a checkpoint to occur after 5,120,000 (5M) bytes are written 
to the redo log.  If the size of your redo log is 20M, you are taking 4 
checkpoints for each log. 

LOG_CHECKPOINT_TIMEOUT init.ora parameter:
------------------------------------------

The LOG_CHECKPOINT_TIMEOUT init.ora parameter controls how often a checkpoint 
will be performed based on the number of seconds that have passed since the 
last checkpoint.  

NOTE: Starting with Oracle 8.1, LOG_CHECKPOINT_TIMEOUT will be interpreted 
to mean that the incremental checkpoint should be at the log position 
WHERE the tail of the log was LOG_CHECKPOINT_TIMEOUT seconds ago. 

Checkpoint frequency impacts the time required for the 
database to recover FROM an unexpected failure.  Longer intervals between 
checkpoints mean that more time will be required during database recovery. 

LOG_CHECKPOINTS_TO_ALERT init.ora parameter:
--------------------------------------------

The LOG_CHECKPOINTS_TO_ALERT init.ora parameter, when set to a value of TRUE, 
allows you to log checkpoint start and stop times in the alert log.  This is 
very helpful in determining if checkpoints are occurring at the optimal 
frequency and gives a chronological view of checkpoints and other database 
activities occurring in the background. 

It is a misconception that setting LOG_CHECKPOINT_TIMEOUT to a given value 
will initiate a log switch at that interval, enabling a recovery window used 
for a stand-by database configuration.  Log switches cause a checkpoint, but a 
checkpoint does not cause a log switch.  The only way to cause a log switch is 
manually with ALTER SYSTEM SWITCH LOGFILE or resizing the redo logs to cause 
more 

FAST_START_MTTR_TARGET init.ora parameter:
------------------------------------------

FAST_START_MTTR_TARGET enables you to specify the number of seconds the database 
takes to perform crash recovery of a single instance. 
It is the number of seconds it takes to recover FROM crash recovery.
The lower the value, the more often DBWR will write the blocks to disk.
FAST_START_MTTR_TARGET 
can be overridden by either FAST_START_IO_TARGET or LOG_CHECKPOINT_INTERVAL. 


FAST_START_IO_TARGET init.ora paramater:
----------------------------------------

FAST_START_IO_TARGET (available only with the Oracle Enterprise Edition) 
specifies the number of I/Os that should be needed during crash or instance recovery. 

Smaller values for this parameter result in faster recovery times. 
This improvement in recovery perfoRMANce is achieved at the expense of 
additional writing activity during normal processing. 

ARCHIVE_LAG_TARGET init.ora parameter:
--------------------------------------

The following initialization parameter setting sets the log switch interval
to 30 minutes (a typical value).

ARCHIVE_LAG_TARGET = 1800


19.2 init.ora parameters and ARCHIVE MODE:
----------------------------------------

LOG_ARCHIVE_DEST=/oracle/admin/cc1/arch
LOG_ARCHIVE_DEST_1=d:\oracle\oradata\arc
LOG_ARCHIVE_START=TRUE
LOG_ARCHIVE_FORMAT=arc_%s.log

LOG_ARCHIVE_DEST_1=
LOG_ARCHIVE_DEST_2=
LOG_ARCHIVE_MAX_PROCESSES=2


19.3 Enabling or disabling archive mode:
----------------------------------

ALTER DATABASE ARCHIVELOG   (mounted, niet open)
ALTER DATABASE NOARCHIVELOG (mounted, niet open)


19.4 Implementation backup in archive mode via OS script:
--------------------------------------------------------


19.4.1 OS backup script in unix
------------------------------

###############################################
# Example archive log backup script in UNIX:  #
###############################################

# Set up the environment to point to the correct database

ORACLE_SID=CC1; export ORACLE_SID
ORAENV_ASK=NO; export ORAENV_ASK
.oraenv

# Backup the tablespaces

svrmgrl <<EOFarch1
connect internal

alter tablespace SYSTEM begin backup;
! tar -cvf /dev/rmt/0hc /u01/oradata/sys01.dbf 
alter tablespace data end backup;

alter tablespace DATA begin backup;
! tar -rvf /dev/rmt/0hc /u02/oradata/data01.dbf 
alter tablespace data end backup;
etc
..
..
# Now we backup the archived redo logs before we delete them.
# We must briefly stop the archiving process in order that
# we do not miss the latest files for sure.

archive log stop;
exit
EOFarch1

# Get a listing of all archived files.

FILES='ls /db01/oracle/arch/cc1/arch*.dbf'; export FILES

# Start archiving again

svrmgrl <<EOFarch2
connect internal
archive log start;
exit
EOFarch2

# Now backup the archived files to tape

tar -rvf /dev/rmt/0hc $FILES

# Delete the backupped archived files

rm -f $FILES

# Backup the control file

svrmgrl <<EOFarch3
connect internal
alter database backup controlfile to '/db01/oracle/cc1/cc1controlfile.bck';
exit
EOFarch3

tar -rvf /dev/rmt/0hc /db01/oracle/cc1/cc1controlfile.bck

###############################
# End backup script example   #
###############################


19.5 Tablespaces en datafiles online/offline in non-archive en archive mode:
---------------------------------------------------------------------------

Tablespace:

Een tablespace kan in archive mode en non-archive mode offline worden
geplaatst zonder dat media recovery nodig is. 
Dit is zo met de NORMAL clausule: alter tablespace offline normal;
Met de immediate clausule is wel recovery nodig.

Datafile;

Een datafile kan in archive mode offline worden gezet. 
Als de datafile online wordt gebracht, moet eerst media recovery wordfen toegepast.
Een datafile kan in non-archive mode niet offline worden geplaatst.

Backup mode:

When you issue ALTER TABLESPACE .. BEGIN BACKUP, it freezes the datafile header. 
This is so that we know what redo logs we need to apply to a given file to make 
it consistent.  While you are backing up that file hot, we are still writing to 
it -- it is logically inconsistent.  Some of the backed up blocks could be from 
the SCN in place at the time the backup began -- others from the time it ended 
and others from various points in between.


19.6 Recovery in archive mode:
-----------------------------

19.6.1: recovery waarbij een current controlfile bestaat
=======================================================

Media recovery na de loss van datafile(s) en dergelijke,
gebeurt normaliter op basis van de SCN in de controlfile.

A1: complete recovery:
------------------
RECOVER DATABASE            (database not open)
RECOVER TABLESPACE DATA     (database open, except this tablespace)
RECOVER DATAFILE 5          (database open, except this datafile)

A2: incomplete recovery:
------------------------

time based:    recover database until time '1999-12-31:23.40.00'
cancel based:  recover database until cancel
change bases:  recover database until change 60747681;

Bij beide recoveries worden de archived redo logs toegepast.

Een incomplete recovery altijd met 
"alter database open resetlogs;"
uitvoeren om de nieuwe logentries te purgen uit de online redo files


19.6.2: Recovery zonder huidige controlfile
========================================== 


media recovery wanneer er geen huidige controlfile bestaat

De control file bevat dus een SCN die te oud is t.o.v. de SCN's
in de archived redo logs.
Dit moet je Oracle laten weten via

RECOVER DATABASE UNTIL CANCEL USING BACKUP CONTROLFILE;

specifying "using backup controlfile" is effectively telling oracle that you've lost your controlfile, 
and thus SCN's in file headers cannot be compared to anything. So Oracle will happily keep applying archives 
until you tell it to stop (or run out) 


19.7 Queries om SCN te vinden:
-----------------------------

Iedere redo log is geassocieerd met een hoog en laag scn

In V$LOG_HISTORY, V$ARCHIVED_LOG, V$DATABASE, V$DATAFILE_HEADER, V$DATAFILE  staan scn's:

Queries:
--------

SELECT file#, substr(name, 1, 30), status, checkpoint_change#            -- uit controlfile
FROM V$DATAFILE;

SELECT file#, substr(name, 1, 30), status, fuzzy, checkpoint_change#      -- uit file header
FROM V$DATAFILE_HEADER;

SELECT first_change#, next_change#, sequence#, archived, substr(name, 1, 40) 
FROM V$ARCHIVED_LOG;

SELECT recid, first_change#, sequence#, next_change# 
FROM V$LOG_HISTORY;

SELECT resetlogs_change#, checkpoint_change#, controlfile_change#, open_resetlogs
FROM V$DATABASE;

SELECT * FROM  V$RECOVER_FILE  -- Which file needs recovery

Find the latest archived redologs:

   SELECT name
   FROM v$archived_log
   WHERE sequence# = (SELECT max(sequence#) FROM v$archived_log
                     WHERE 1699499 >= first_change#;


sequence#             : geeft het nummer aan van de archived redo log
first_change#         : eerste scn in archived redo log
next_change#          : laatste scn in archived redo log, en de eerste scn van de volgende log
checkpoint_change#    : laatste actuele SCN
FUZZY                 : Y/N, indien YES dan bevat de file changes die later zijn dan de scn in de header
A datafile that contains a block whose SCN is more recent than the SCN of its header is called a fuzzy datafile. 


19.8 Archived redo logs nodig voor recovery:
-------------------------------------------

In V$RECOVERY_LOG staan die archived logs vermeld
die nodig zijn bij een recovery.

Je kunt ook V$RECOVER_FILE gebruiken om te bepalen welke files moeten recoveren. 

SELECT * FROM v$recover_file; 

Hier vind je de FILE# en deze kun je weer gebruiken met v$datafile en v$tablespace:

SELECT d.name, t.name 
FROM v$datafile d, v$tablespace t 
WHERE t.ts# = d.ts# 
AND d.file# in (14,15,21);  # use values obtained FROM V$RECOVER_FILE query 


19.9 voorbeeld recovery 1 datafile:
----------------------------------

Stel 1 datafile is corrupt. Nu behoeft slechts die ene file te worden teruggezet
en daarna recovery toe te passen.

SVRMGRL>alter database datafile '/u01/db1/users01.dbf' offline;

$ cp /stage/users01.dbf /u01/db1

SVRMGRL>recover datafile '/u01/db1/users01.dbf';

en oracle komt met een suggestie van het toepassen van archived logfiles

SVRMGRL>alter database datafile '/u01/db1/users01.dbf' online;


19.10 voorbeeld recovery database:
---------------------------------

Stel meerdere datafiles zijn verloren. Zet nu backup files terug.

SVRMGRL>startup mount;
SVRMGRL>recover database;

en oracle zal de archived redo logfiles toepassen.

media recovery complete

SVRMGRL>alter database open;


19.11 restore naar ANDere disks:
-------------------------------

- alter database backup controlfile to trace;
- restore files naar nieuwe lokatie:
- edit control file met nieuwe lokatie files
- save dit als .sql script en voer het uit: 
SVRMGRL>@new.sql

controlfile:

startup nomount
create controlfile reuse database "brdb" noresetlogs archivelog
maxlogfiles 16
maxlogmembers 2
maxdatafiles 100
maxinstances 1
maxloghistory 226

logfile
group 1 ('/disk03/db1/redo/redo01a.dbf', '/disk04/db1/redo/redo01b.dbf') size 2M,
group 2 ('/disk03/db1/redo/redo02a.dbf', '/disk04/db1/redo/redo02b.dbf') size 2M

datafile
'/disk04/oracle/db1/sys01.dbf',
'/disk05/oracle/db1/rbs01.dbf',
'/disk06/oracle/db1/data01.dbf',
'/disk04/oracle/db1/index01.dbf',

character set 'us7ascii'
;
RECOVER DATABASE UNTIL CANCEL USING BACKUP CONTROLFILE;
ALTER DATABASE OPEN RESETLOGS;


19.12 Copy van database naar ANDere Server:
------------------------------------------

1. kopieer alle files precies van ene lokatie naar ANDere

2. source server: alter database backup controlfile to trace

3. Maak een juiste init.ora met references nieuwe server

4. edit de ascii versie controlfile uit stap 2 waarbij alle schijflokaties verwijzen naar de target

STARTUP NOMOUNT

CREATE CONTROLFILE REUSE SET DATABASE "FSYS" RESETLOGS noARCHIVELOG
MAXLOGFILES 8
MAXLOGMEMBERS 4
etc..


ALTER DATABASE OPEN resetlogs;

of

CREATE CONTROLFILE REUSE SET DATABASE "TEST" RESETLOGS ARCHIVELOG
..
#RECOVER DATABASE
ALTER DATABASE OPEN RESETLOGS;

ALTER DATABASE OPEN RESETLOGS;


CREATE CONTROLFILE REUSE DATABASE "PROD" NORESETLOGS ARCHIVELOG
..
..
RECOVER DATABASE
# All logs need archiving AND a log switch is needed.
ALTER SYSTEM ARCHIVE LOG ALL;
# Database can now be opened normally.
ALTER DATABASE OPEN;

5. SVRMGRL>@script

bij probleem: delete originele controlfiles en geen reuse.

Voorbeeld create controlfile:
-----------------------------

If you want another database name use CREATE CONTROLFILE SET DATABASE

STARTUP NOMOUNT
CREATE CONTROLFILE REUSE DATABASE "O901" RESETLOGS NOARCHIVELOG
    MAXLOGFILES 50
    MAXLOGMEMBERS 5
    MAXDATAFILES 100
    MAXINSTANCES 1
    MAXLOGHISTORY 113
LOGFILE
  GROUP 1 'D:\ORACLE\ORADATA\O901\REDO01.LOG'  SIZE 100M,
  GROUP 2 'D:\ORACLE\ORADATA\O901\REDO02.LOG'  SIZE 100M,
  GROUP 3 'D:\ORACLE\ORADATA\O901\REDO03.LOG'  SIZE 100M
DATAFILE
  'D:\ORACLE\ORADATA\O901\SYSTEM01.DBF',
  'D:\ORACLE\ORADATA\O901\UNDOTBS01.DBF',
  'D:\ORACLE\ORADATA\O901\CWMLITE01.DBF',
  'D:\ORACLE\ORADATA\O901\DRSYS01.DBF',
  'D:\ORACLE\ORADATA\O901\EXAMPLE01.DBF',
  'D:\ORACLE\ORADATA\O901\INDX01.DBF',
  'D:\ORACLE\ORADATA\O901\TOOLS01.DBF',
  'D:\ORACLE\ORADATA\O901\USERS01.DBF'
CHARACTER SET UTF8
;

Voorbeeld controlfile:
----------------------

STARTUP NOMOUNT
CREATE CONTROLFILE REUSE DATABASE "SALES" NORESETLOGS ARCHIVELOG
    MAXLOGFILES 5
    MAXLOGMEMBERS 2
    MAXDATAFILES 255
    MAXINSTANCES 2
    MAXLOGHISTORY 1363
LOGFILE
  GROUP 1 (
    '/oradata/system/log/log1.log',
    '/oradata/dump/log/log1.log'
  ) SIZE 100M,
  GROUP 2 (
    '/oradata/system/log/log2.log',
    '/oradata/dump/log/log2.log'
  ) SIZE 100M
DATAFILE
  '/oradata/system/system.dbf',
  '/oradata/rbs/rollback.dbf',
  '/oradata/rbs/rollbig.dbf',
  '/oradata/system/users.dbf',
  '/oradata/temp/temp.dbf',
  '/oradata/data_big/ahp_lkt_data_small.dbf',
  '/oradata/data_small/ahp_lkt_data_big.dbf',
  '/oradata/data_big/ahp_lkt_index_small.dbf',
  '/oradata/index_small/ahp_lkt_index_big.dbf',
  '/oradata/data_small/maniin_ah_data_small.dbf',
  '/oradata/index_small/maniin_ah_data_big.dbf',
  '/oradata/index_big/maniin_ah_index_small.dbf',
  '/oradata/index_big/maniin_ah_index_big.dbf',
  '/oradata/index_big/fe_heat_data_big.dbf',
  '/oradata/data_small/fe_heat_index_big.dbf',
  '/oradata/data_small/eksa_data_small.dbf',
  '/oradata/data_big/eksa_data_big.dbf',
  '/oradata/index_small/eksa_index_small.dbf',
  '/oradata/index_big/eksa_index_big.dbf',
  '/oradata/data_small/provisioning_data_small.dbf',
  '/oradata/data_small/softplan_data_small.dbf',
  '/oradata/index_small/provisioning_index_small.dbf',
  '/oradata/system/tools.dbf',
  '/oradata/index_small/fe_heat_index_small.dbf',
  '/oradata/data_small/softplan_data_big.dbf',
  '/oradata/index_small/softplan_index_small.dbf',
  '/oradata/index_small/softplan_index_big.dbf',
  '/oradata/data_small/fe_heat_data_small.dbf'
;
# Recovery is required if any of the datafiles are restored backups,
# or if the last shutdown was not normal or immediate.
RECOVER DATABASE UNTIL CANCEL USING BACKUP CONTROLFILE;
ALTER DATABASE OPEN RESETLOGS;


19.13 PROBLEMS DURING RECOVERY:
-------------------------------


    BEGIN BACKUP        END BACKUP        normal business
                                           |
      system=453        switch logfile     |
          users=455      |                 |                   CRASH
            tools=459    |                 |                   |
           |             |                 |                   |
------------------------------------------------------------------------------
          t=t0          t=t1              t=t2                t=t3


ORA-01194, ORA-01195:
---------------------

-------
Note 1:
-------

Suppose the system comes with:

ORA-01194: file 1 needs more recovery to be consistent 
ORA-01110: data file 1: '/u03/oradata/tstc/dbsyst01.dbf' 

Either you had the database in archive mode or in non archive mode:

archive mode

RECOVER DATABASE UNTIL CANCEL USING BACKUP CONTROLFILE;
ALTER DATABASE OPEN RESETLOGS;

non-archive mode:

# RECOVER DATABASE UNTIL CANCEL USING BACKUP CONTROLFILE;
ALTER DATABASE OPEN RESETLOGS;

If you have checked that the scn's of all files are the samed number,
you might try in the init.ora file:

_allow_resetlogs_corruption = true


-------
Note 2:
-------

Problem Description 
-------------------  
You restored your hot backup and you are trying to do a point-in-time recovery. 
When you tried to open your database you received the following error:   
ORA-01195: online backup of file <name> needs more recovery to be consistent       
Cause: An incomplete recovery session was started, but an insufficient              
number of redo logs were applied to make the file consistent.             

The reported file is an online backup that must be recovered to the time the backup ended.     
Action: Either apply more redo logs until the file is consistent or restore the file from an older backup 
and repeat the recovery.             
For more information about online backup, see the index entry              
"online backups" in the <Oracle7 Server Administrator's Guide>.   
This is assuming that the hot backup completed error free.   

Solution Description 
--------------------  
Continue to apply the requested logs until you are able to open the  database.   

Explanation 
-----------  
When you perform hot backups on a file, the file header is frozen.  For example, 
datafile01 may have a file header frozen at SCN #456.  When you backup the next datafile 
the SCN # may be differnet.   For example the file header for datafile02 may be frozen 
with SCN #457.   Therefore, you must apply archive logs until you reach the SCN #  of the 
last file that was backed up.  Usually, applying one or two more archive logs will solve 
the problem, unless  there was alot of activity on the database during the backup. 


-------
Note 3:
-------

ORA-01194: file 1 needs more recovery to be consistent

I am working with a test server, I can load it again but I would like to know if this 
kind of problem could be solved or not. Just to let you know, that I am new 
in Oracle Database Administration. 

I ran a hot backup script, which deleted the old ARCHIVE, logs at the end. 
After checking the script's log, I realized that the hot backup was not successful and it 
deleted the Archives. I tried to startup the database and an error occurred; 
"ORA-01589: must use RESETLOGS or NORESETLOGS option for database open" 
I tried to open it with the RESETLOGS option then another error occurred; 
"ORA-01195: online backup of file 1 needs more recovery to be consistent" 

Just because, it was a test environment, I have never taken any cold backups. 
I still have hot backups. I don't know how to recover from those. 
If anyone can tell me how to do it from SQLPLUS (SVRMGRL is not loaded), 
I would really appreciate it. 

Thanks, 

Hi Hima, 

The following might help. You now have a database that is operating 
like it's in noarchive mode since the logs are gone. 

1. Mount the database. 
2. Issue the following query: 

SELECT V1.GROUP#, MEMBER, SEQUENCE#, FIRST_CHANGE# 
FROM V$LOG V1, V$LOGFILE V2 
WHERE V1.GROUP# = V2.GROUP# ; 

This will list all your online redolog files and their respective 
sequence and first change numbers. 

3. If the database is in NOARCHIVELOG mode, issue the query: 

SELECT FILE#, CHANGE# FROM V$RECOVER_FILE; 

If the CHANGE# is GREATER than the minimum FIRST_CHANGE# 
of your logs, the datafile can be recovered. 

4. Recover the datafile, after taking offline, you cannot take 
system offline which is the file in error in your case. 

RECOVER DATAFILE '<full_path_file_name>' 


5. Confirm each of the logs that you are prompted for until you 
receive the message "Media recovery complete". If you are prompted for a non-existing 
archived log, Oracle probably needs one or more of the online logs to proceed with the recovery. 
Compare the sequence number referenced in the ORA-280 message with the sequence numbers 
of your online logs. Then enter the full path name of one of the members of the redo group 
whose sequence number matches the one you are being asked for. Keep entering 
online logs as requested until you receive the message "Media recovery 
complete". 

6. Bring the datafile online. No need for system. 

7. If the database is at mount point, open it 

Perform a full closed backup of the existing database 


-------
Note 4:
-------

Recover until time using backup controlfile

Hi, 

I am trying to perform an incomplete recovery to an arbitrary point in time in the past. Eg. I want 
to go back five minutes. 

I have a hot backup of my database. (Tablespaces into hotbackup mode, copy files, tablespaces out 
of hotbackup mode, archive current log, backup controlfile to a file and also to a trace). 
(yep im in archivelog mode as well) 

I shutdown the current database and blow the datafiles,online redo logs,controlfiles away. 

I restore my backup copy of the database - (just the datafiles) startup nomount and then run 
an edited controlfile trace backup (with resetlogs). 

I then RECOVER DATABSE UNTIL TIME 'whenever' USING BACKUP CONTROLFILE. 

I'm prompted for logs in the usual way but the recovery ends with an ORA-1547 - 

Recover succeeded but open resetlogs would give the following error. 
The next error is that datafile 1 (system ts) - would need more recovery. 

Now metalink tells me that this is usually due to backups being restored that are older 
than the archive redo logs - this isn't the case. I have all the archive redo logs I need to 
cover the time the backup was taken up to the present. The time specified in the recovery 
is after the backup as well. 
What am I missing here? Its driving me nuts. I'm off back to the docs again! 

Thanks in advance 

Tim 

--------------------------------------------------------------------------------

From: Anand Devaraj 15-Aug-02 15:15 
Subject: Re : Recover until time using backup controlfile 


The error indicates that Oracle requires a few more scns to get all the datafiles in sync. 
It is quite possible that those scns are present in the online redo logfiles which were lost. 
In such cases when Oracle asks for a non-existent archive log, you should provide the complete path 
of the online log file for the recovery to succeed. 
Since you dont have an online log file you should use 
RECOVER DATABASE UNTIL CANCEL USING BACKUP CONTROLFILE. 

In this case when you exhaust all the archive log files, you issue the cancel command which will 
automatically rollback all the incomplete transactions and get all the datafile headers 
in sync with the controlfile. 

To do an incomplete recovery using time,you usually require the online logfiles to be present. 

Anand 

--------------------------------------------------------------------------------

From: Radhakrishnan paramukurup 15-Aug-02 16:19 
Subject: Re : Recover until time using backup controlfile 


I am not sure whether you have missed this step or just missed in the note. 
You need to also to switch the log at the end of the back up (I do as a matter of practice else you 
need the next log which is not sure to be available in case of a failure). Otherwise some of the changes 
to reach a consistant state is still in the online log and you can never open untill 
you reach a consistent state. 

Hope this helps ........ 

--------------------------------------------------------------------------------

From: Mark Gokman 15-Aug-02 16:41 
Subject: Re : Recover until time using backup controlfile 

To successfully perform incomplete recovery, you need a full db backup that was completed prior 
to the point to which you want to recover, plus you need all archive logs containing all SCNs 
up to the point to which you want to recover. 
Applying these rules to your case, I have two questions: 
- are you recovering to the point in time AFTER the time the successful full backup was copleted? 
- is there an archive log that was generated AFTER the time you specify in until time? 
If both answers are yes, then you should have no problems. 
I actually recently performed such a recovery several times. 


--------------------------------------------------------------------------------

From: Tim Palmer 15-Aug-02 18:02 
Subject: Re : Re : Recover until time using backup controlfile 


Thanks Guys! I think Mark has hit the nail on the head here. I was being an idiot! 
Ive ran this exercise a few more times (with success) and I am convinced that what I was doing 
was trying to recover to a point in time that basically was before the latest scn of any one file 
in the hot backup set I was using - convinced myself that I wasnt - 
but I must have been..... perhaps I need a holiday! 

Thanks again 

Tim 

--------------------------------------------------------------------------------

From: Oracle, Rowena Serna 16-Aug-02 15:44 
Subject: Re : Recover until time using backup controlfile 


Thanks to mark for his input for helping you out. 

-------
Note 5:
-------

ORA-01547: warning: RECOVER succeeded but OPEN RESETLOGS would get error below
ORA-01152: file 2 was not restored from a sufficiently old backup
ORA-01110: data file 2: 'D:\ORACLE\ORADATA\<instance>\UNDOTBS01.DBF'

File number, name and directory may vary depending on Oracle configuration 

Details:
Undo tablespace data description

In an Oracle database, Undo tablespace data is an image or snapshot of the original contents 
of a row (or rows) in a table. This data is stored in Undo segments (formerly Rollback segments 
in earlier releases of Oracle) in the Undo tablespace. When a user begins to make a change to the data 
in a row in an Oracle table, the original data is first written to Undo segments in the Undo tablespace. 
The entire process (including the creation of the Undo data) is recorded in Redo logs before 
the change is completed and written in the Database Buffer Cache, and then the data files via the 
database writer (DBWn) process. 

If the transaction does not complete due to some error or should there be a user decision 
to reverse (rollback) the change, this Undo data is critical for the ability to roll back 
or undo the changes that were made. Undo data also ensures a way to provide read consistency 
in the database. Read consistency means that if there is a data change in a row of data that 
is not yet committed, a new query of this same row or table will not display any of the 
uncommitted data to other users, but will use the information from the Undo segments in the Undo tablespace 
to actually construct and present a consistent view of the data that only includes 
committed transactions or information. 

During recovery, Oracle uses its Redo logs to play forward through transactions in a database 
so that all lost transactions (data changes and their Undo data generation) are replayed into 
the database. Then, once all the Redo data is applied to the data files, Oracle uses the information 
in the Undo segments to undo or roll back all uncommitted transactions. Once recovery is complete, 
all data in the database is committed data, the System Change Numbers (SCN) on all data files 
and the control_files match, and the database is considered consistent. 

As for Oracle 9i, the default method of Undo management is no longer manual, but automatic; 
there are no Rollback segments in individual user tablespaces, and all Undo management is processed 
by the Oracle server, using the Undo tablespace as the container to maintain the Undo segments 
for the user tablespaces in the database. The tablespace that still maintains its own Rollback segments 
is the System tablespace, but this behavior is by design and irrelevant to the discussion here. 

If this configuration is left as the default for the database, and the 5.022 or 5.025 version of the 
VERITAS Backup Exec (tm) Oracle Agent is used to perform Oracle backups, the Undo tablespace 
will not be backed up. If Automatic Undo Management is disabled and the database administrator (DBA) 
has modified the locations for the Undo segments (if the Undo data is no longer in the Undo tablespace), 
this data may be located elsewhere, and the issues addressed by this TechNote may not affect 
the ability to fully recover the database, although it is still recommended that the upgrade 
to the 5.026 Oracle Agent be performed.


Scenario 1

The first scenario would be a recovery of the entire database to a previous point-in-time. 
This type of recovery would utilize the RECOVER DATABASE USING BACKUP CONTROLFILE statement 
and its customizations to restore the entire database to a point before the entry of improper 
or corrupt data or to roll back to a point before the accidental deletion of critical data. 
In this type of situation, the most common procedure for the restore is to just restore 
the entire online backup over the existing Oracle files with the database shutdown. 
(See the Related Documents section for the appropriate instructions on how to restore and recover 
an Oracle database to a point-in-time using an online backup.)

In this scenario, where the entire database would be rolled back in time, an offline restore 
would include all data files, archived log files, and the backup control_file from the tape 
or backup media. Once the RECOVER DATABASE USING BACKUP CONTROLFILE command was executed, 
Oracle would begin the recovery process to roll forward through the Redo log transactions, 
and it would then roll back or undo uncommitted transactions. 

At the point when the recovery process started on the actual Undo tablespace, Oracle would see that 
the SCN of that tablespace was too high (in relation to the record in the control_file). 
This would happen simply because the Undo tablespace wasn't on the tape or backup media that was restored, 
so the original Undo tablespace wouldn't have been overwritten, as were the other data files, 
during the restore operation. The failure would occur because the Undo tablespace would still be 
at its SCN before the restore from backup (an SCN in the future as related to the restored backup control_file). 
All other tablespaces and control_files would be back at their older SCNs (not necessarily consistent yet), 
and the Oracle server would respond with the following error messages:

ORA-01547: warning: RECOVER succeeded but OPEN RESETLOGS would get error below
ORA-01152: file 2 was not restored from a sufficiently old backup
ORA-01110: data file 2: 'D:\ORACLE\ORADATA\<instance>\UNDOTBS01.DBF'

At this point, the database cannot be opened with the RESETLOGS option, nor in a normal mode. 
Any attempt to do so yields the error referenced above.

SQL> alter database open resetlogs;
alter database open resetlogs
*

Error at line 1:
ORA-01152: file 2 was not restored from a sufficiently old backup
ORA-01110: data file 2: 'D:\ORACLE\ORADATA\DRTEST\UNDOTBS01.DBF'

The only recourse here is to recover or restore an older backup that contains an Undo tablespace, 
whether from an older online backup, or from a closed or offline backup or copy of the database. 
Without this ability to acquire an older Undo tablespace to rerun the recovery operation, 
it will not be possible to start the database. At this point, Oracle Technical Support must be contacted.


Scenario 2

The second scenario would involve the actual corruption or loss of the Undo tablespace's data files. 
If the Undo tablespace data is lost or corrupted due to media failure or other internal 
logical error or user error, this data/tablespace must be recovered. 

Oracle 9i does offer the ability to create a new Undo tablespace and to alter the Oracle Instance to use 
this new tablespace when deemed necessary by the DBA. One of the requirements to accomplish this change, though, 
is that there cannot be any active transactions in the Undo segments of the tablespace when it is time to 
actually drop it. In the case of data file corruption, uncommitted transactions in the database that have 
data in Undo segments can be extremely troublesome because the existence of any uncommitted transactions 
will lock the Undo segments holding the data so that they cannot be dropped. This will be evidenced by 
an "ORA-01548" error if this is attempted. This error, in turn, prevents the drop and recreation of 
the Undo tablespace, and thus prevents the successful recovery of the database. 

To overcome this problem, the transaction tables of the Undo segments can be traced to provide details 
on transactions that Oracle is trying to recover via rollback and these traces will also identify 
the objects that Oracle is trying to apply the undo to. Oracle Doc ID: 94114.1 may be referenced to set up 
a trace on the database startup so that the actual transactions that are locking the Undo segments 
can be identified and dropped. Dropping objects that contain uncommitted transactions that are holding 
locks on Undo segments does entail data loss, and the amount of loss depends on how much uncommitted data 
was in the Undo segments at the point of failure. 

When utilized, this trace is actually monitoring or dumping data from the transaction tables in the headers 
of the Undo segments (where the records that track the data in the Undo segments are located), 
but if the Undo tablespace's data file is actually missing, has been offline dropped, or if these 
Undo segment headers have been corrupted, even the ability to dump the transaction table data is lost 
and the only recourse at this point may be to open the database, export, and rebuild. At this point, 
Oracle Technical Support must be contacted.  

Backup Exec Agent for Oracle 5.022 and 5.025 should be upgraded to 5.026
When using the 5.022 or 5.025 version of the Backup Exec for Windows Servers Oracle Agent 
(see the Related Documents section for the appropriate instructions on how to identify the version 
of the Oracle Agent in use), the Oracle Undo tablespace is not available for backup because the 
Undo tablespace falls into the type category of Undo, and only tablespaces with a content type of 
PERMANENT are located and made available for backup. Normal full backups with all Oracle components 
selected will run without error and will complete with a successful status since the Undo tablespace 
is not actually flagged as a selection.

In most Oracle recovery situations, this absence of the Undo tablespace data for restore would not 
cause any problem because the original Undo tablespace is still available on the database server. 
Restores of User tablespaces, which do not require a rollback in time, would proceed normally 
since lost data or changes would be replayed back into the database, and Undo data would be 
available to roll back uncommitted transactions to leave the database in a consistent 
state and ready for user access. 

However, in certain recovery scenarios, (in which a rollback in time or full database recovery 
is attempted, or in the case of damaged or missing Undo tablespace data files) this missing Undo data 
can result in the inability to properly recover tablespaces back to a point-in-time, and could potentially 
render the database unrecoverable without an offline backup or the assistance of Oracle Technical Support. 
The scenarios in this TechNote describe two examples (this does not necessarily imply that these 
are the only scenarios) of how this absence of the Undo tablespace on tape or backup media, and thus its 
inability to be restored, can result in failure of the database to open and can result in actual data loss. 

The only solution to the problems referenced within this TechNote is to upgrade the Backup Exec for 
Windows Servers Oracle Agent to version 5.026, and to take new offline (closed database) and then 
new online (running database) backups of the entire Oracle 9i database as per the Oracle Agent 
documentation in the Backup Exec 9.0 for Windows Servers Administrator's Guide. Oracle 9i database backups 
made with the 5.022 and 5.025 Agent that shipped with Backup Exec 9.0 for Windows Servers 
build 4367 or build 4454 should be considered suspect in the context of the information 
provided in this TechNote.

Note: The 5.022, 5.025, and 5.026 versions of the Oracle Agent are compatible with 
Backup Exec 8.6 for Windows NT and Windows 2000, which includes support for Oracle 9i, 
as well as Backup Exec 9.0 for Windows Servers. See the Related Documents section for 
instructions on how to identify the version of the Oracle Agent in use.


-------
Note 6:
-------

- Backup 

a) Consistent backups 
A consistent backup means that all data files and control files are consistent 
to a point in time. I.e. they have the same SCN. This is the only method of 
backup when the database is in NO Archive log mode. 

b) Inconsistent backups 
An Inconsistent backup is possible only when the database is in Archivelog mode 
and proper Oracle aware software is used. Most default backup software can not 
backup open files. Special precautions need to be used and testing needs to be 
done. You must apply redo logs to the data files, in order to restore the 
database to a consistent state. 

c) Database Archive mode 
The database can run in either Archivelog mode or noarchivelog mode. 
When you first create the database, you specify if it is to be in Archivelog 
mode. Then in the init.ora file you set the parameter log_archive_start=true 
so that archiving will start automatically on startup. 
If the database has not been created with Archivelog mode enabled, you can 
issue the command whilst the database is mounted, not open. 
SVRMGR> alter database Archivelog;. 
SVRMGR> log archive start 
SVRMGR> alter database open 
SVRMGR> archive log list 
This command will show you the log mode and if automatic archival is set. 

d) Backup Methods 
Essentially, there are two backup methods, hot and cold, also known as online 
and offline, respectively. 
A cold backup is one taken when the database is shutdown. 
A hot backup is on taken when the database is running. 
Commands for a hot backup: 
1. Svrmgr>alter database Archivelog 
Svrmgr> log archive start 
Svrmgr> alter database open 
2. Svrmgr> archive log list 
--This will show what the oldest online log sequence is. As a precaution, 
always keep the all archived log files starting from the oldest online log 
sequence. 
3. Svrmgr> Alter tablespace tablespace_name BEGIN BACKUP 
4. --Using an OS command, backup the datafile(s) of this tablespace. 
5. Svrmgr> Alter tablespace tablespace_name END BACKUP 
--- repeat step 3, 4, 5 for each tablespace. 
6. Svrmgr> archive log list 
---do this again to obtain the current log sequence. You will want to make 
sure you have a copy of this redo log file. 
7. So to force an archived log, issue 
Svrmgr> ALTER SYSTEM SWITCH LOGFILE 
A better way to force this would be: 
svrmgr> alter system archive log current; 
8. Svrmgr> archive log list 
This is done again to check if the log file had been archived and to find 
the latest archived sequence number. 

9. Backup all archived log files determined from steps 2 and 8. 
Do not backup the online redo logs. These will contain the end-of-backup 
marker and can cause corruption if use doing recovery. 

10. Back up the control file: 
Svrmgr> Alter database backup controlfile to 'filename' 

e) Incremental backups 
These are backups that are taken on blocks that have been modified since the 
last backup. These are useful as they don't take up as much space and time. 
There are two kinds of incremental backups 
Cumulative and Non cumulative. 
Cumulative incremental backups include all blocks that were changed since the 
last backup at a lower level. This one reduces the work during restoration as 
only one backup contains all the changed blocks. 
Noncumulative only includes blocks that were changed since the previous backup 
at the same or lower level. 
Using rman, you issue the command "backup incremental level n" 
f) Support scenarios 
When the database crashes, you now have a backup. You restore the backup and 
then recover the database. Also, don't forget to take a backup of the control 
file whenever there is a schema change. 

RECOVERY 
========= 
There are several kinds of recovery you can perform, depending on the type of 
failure and the kind of backup you have. Essentially, if you are not running in 
archive log mode, then you can only recover the cold backup of the database and 
you will lose any new data and changes made since that backup was taken. 
If, however, the database is in Archivelog mode you will be able to restore the 
database up to the time of failure. 
There are three basic types of recovery: 
1. Online Block Recovery. 
This is performed automatically by Oracle.(pmon) Occurs when a process dies 
while changing a buffer. Oracle will reconstruct the buffer using the online 
redo logs and writes it to disk. 
2. Thread Recovery. 
This is also performed automatically by Oracle. Occurs when an instance 
crashes while having the database open. Oracle applies all the redo changes 
in the thread that occurred since the last time the thread was checkpointed. 
3. Media Recovery. 
This is required when a data file is restored from backup. The checkpoint 
count in the data files here are not equal to the check point count in the 
control file. 
This is also required when a file was offlined without checkpoint and when 
using a backup control file. 
Now let's explain a little about Redo vs Rollback. 
Redo information is recorded so that all commands that took place can be 
repeated during recovery. Rollback information is recorded so that you can undo 
changes made by the current transaction but were not committed. The Redo Logs 
are used to Roll Forward the changes made, both committed and non- committed 
changes. Then from the Rollback segments, the undo information is used to 
rollback the uncommitted changes. 
Media Failure and Recovery in Noarchivelog Mode 
In this case, your only option is to restore a backup of your Oracle 
files. 
The files you need are all datafiles, and control files. 
You only need to restore the password file or parameter files if they are lost 
or are corrupted. 
Media Failure and Recovery in Archivelog Mode 
In this case, there are several kinds of recovery you can perform, depending on 
what has been lost. The three basic kinds of recovery are: 
1. Recover database - here you use the recover database command and the database 
must be closed and mounted. Oracle will recover all datafiles that are online. 
2. Recover tablespace - use the recover tablespace command. The database can be 
open but the tablespace must be offline. 
3. Recover datafile - use the recover datafile command. The database can be 
open but the specified datafile must be offline. 
Note: You must have all archived logs since the backup you restored from, 
or else you will not have a complete recovery. 
a) Point in Time recovery: 
A typical scenario is that you dropped a table at say noon, and want to recover 
it. You will have to restore the appropriate datafiles and do a point-in-time 
recovery to a time just before noon. 
Note: you will lose any transactions that occurred after noon. 
After you have recovered until noon, you must open the database with resetlogs. 
This is necessary to reset the log numbers, which will protect the database 
from having the redo logs that weren't used be applied. 
The four incomplete recovery scenarios all work the same: 
Recover database until time '1999-12-01:12:00:00'; 
Recover database until cancel; (you type in cancel to stop) 
Recover database until change n; 
Recover database until cancel using backup controlfile; 
Note: When performing an incomplete recovery, the datafiles must be online. 
Do a select name, status from v$datafile to find out if there are any files 
which are offline. If you were to perform a recovery on a database which has 
tablespaces offline, and they had not been taken offline in a normal state, you 
will lose them when you issue the open resetlogs command. This is because the 
data file needs recovery from a point before the resetlogs option was used. 
b) Recovery without control file 
If you have lost the current control file, or the current control file is 
inconsistent with files that you need to recover, you need to recover either by 
using a backup control file command or create a new control file. You can also 
recreate the control file based on the current one using the 
'backup control file to trace' command which will create a script for you to 
run to create a new one. 
Recover database using backup control file command must be used when using a 
control file other that the current. The database must then be opened with 
resetlogs option. 
c) Recovery of missing datafile with rollback segment 
The tricky part here is if you are performing online recovery. Otherwise you 
can just use the recover datafile command. Now, if you are performing an 
online recovery, you must first ensure that in the init.ora file, you remove 
the parameter rollback_segments. Otherwise, oracle will want to use those 
rollback segments when opening the database, but can't find them and wont open. 
Until you recover the datafiles that contain the rollback segments, you need to 
create some temporary rollback segments in order for new transactions to work. 
Even if other rollback segments are ok, they will have to be taken offline. 
So, all the rollback segments that belong to the datafile need to be recovered. 
If all the datafiles belonging to the tablespace rollback_data were lost, you 
can now issue a recover tablespace rollback_data. 
Next bring the tablespace online and check the status of the rollback segments 
by doing a select segment_name, status from dba_rollback_segs; 
You will see the list of rollback segments that are in status Need Recovery. 
Simply issue alter rollback segment online command to complete. 
Don't forget to reset the rollback_segments parameter in the init.ora. 
d) Recovery of missing datafile without rollback segment 
There are three ways to recover in this scenario, as mentioned above. 
1. recover database 
2. recover datafile 'c:\orant\database\usr1orcl.ora' 
3. recover tablespace user_data 
e) Recovery with missing online redo logs 
Missing online redo logs means that somehow you have lost your redo logs before 
they had a chance to archived. This means that crash recovery cannot be 
performed, so media recovery is required instead. All datafiles will need to 
berestored and rolled forwarded until the last available archived log file is 
applied. This is thus an incomplete recovery, and as such, the recover 
database command is necessary. 
(i.e. you cannot do a datafile or tablespace recovery). 
As always, when an incomplete recovery is performed, you must open the database 
with resetlogs. 
Note: the best way to avoid this kind of a loss, is to mirror your online log 
files. 
f) Recovery with missing archived redo logs 
If your archives are missing, the only way to recover the database is to 
restore from your latest backup. You will have lost any uncommitted 
transactions which were recorded in the archived redo logs. Again, this is why 
Oracle strongly suggests mirroring your online redo logs and duplicating copies 
of the archives. 
g) Recovery with resetlogs option 
Reset log option should be the last resort, however, as we have seen from above, 
it may be required due to incomplete recoveries. (recover using a backup 
control file, or a point in time recovery). It is imperative that you backup 
up the database immediately after you have opened the database with reset logs. 
The reason is that oracle updates the control file and resets log numbers, and 
you will not be able to recover from the old logs. 
The next concern will be if the database crashes after you have opened the 
database with resetlogs, but have not had time to backup the database. 
How to recover? 
Shut down the database 
Backup all the datafiles and the control file 
Startup mount 
Alter database open resetlogs 
This will work, because you have a copy of a control file after the 
resetlogs point. 
Media failure before a backup after resetlogs. 
If a media failure should occur before a backup was made after you opened the 
database using resetlogs, you will most likely lose data. 
The reason is because restoring a lost datafile from a backup prior to the 
resetlogs will give an error that the file is from a point in time earlier, 
and you don't have its backup log anymore. 
h) Recovery with corrupted/missing rollback segments. 
If a rollback segment is missing or corrupted, you will not be able to open the 
database. The first step is to find out what object is causing the rollback to 
appear corrupted. If we can determine that, we can drop that object. 
If we can't we will need to log an iTar to engage support. 
So, how do we find out if it's actually a bad object? 
1. Make sure that all tablespaces are online and all datafiles are online. 
This can be checked through v$datafile, under the status column. 
For tablespaces associated with the datafiles, look in dba_tablespaces. 
If this doesn't show us anything, i.e., all are online, then 
2. Put the following in the init.ora: 
event = "10015 trace name context forever, level 10" 
This event will generate a trace file that will reveal information about the 
transaction Oracle is trying to roll back and most importantly, what object 
Oracle is trying to apply the undo to. 
Stop and start the database. 
3. Check in the directory that is specified by the user_dump_dest parameter 
(in the init.ora or show parameter command) for a trace file that was 
generated at startup time. 
4. In the trace file, there should be a message similar to: 
error recovery tx(#,#) object #. 
TX(#,#) refers to transaction information. 
The object # is the same as the object_id in sys.dba_objects. 
5. Use the following query to find out what object Oracle is trying to 
perform recovery on. 
select owner, object_name, object_type, status 
from dba_objects where object_id = <object #>; 
6. Drop the offending object so the undo can be released. An export or relying 
on a backup may be necessary to restore the object after the corrupted 
rollback segment goes away. 
7. After dropping the object, put the rollback segment back in the init.ora 
parameter rollback_segments, remove the event, and shutdown and startup 
the database. 
In most cases, the above steps will resolve the problematic rollback segment. 
If this still does not resolve the problem, it may be likely that the 
corruption is in the actual rollback segment. 
If in fact the rollback segment itself is corrupted, we should see if we can 
restore from a backup. However, that isn't always possible, there may not be a 
recent backup etc. In this case, we have to force the database open with the 
unsupported, hidden parameters, you will need to log an iTar to engage support. 
Please note, that this is potentially dangerous! 
When these are used, transaction tables are not read on opening of the database 
Because of this, the typical safeguards associated with the rollback segment 
are disabled. 
Their status is 'offline' in dba_rollback_segs. 
Consequently, there is no check for active transactions before dropping the 
rollback segment. If you drop a rollback segment which contains active 
transactions then you will have logical corruption. Possibly this corruption 
will be in the data dictionary. 
If the rollback segment datafile is physically missing, has been offlined 
dropped, or the rollback segment header itself is corrupt, there is no way to 
dump the transaction table to check for active transactions. So the only thing 
to do is get the database open, export and rebuild. Log an iTar to engage support 
to help with this process. 
If you cannot get the database open, there is no other alternative than 
restoring from a backup. 
i) Recovery with System Clock change. 
You can end up with duplicate timestamps in the datafiles when a system clock 
changes. 
A solution here is to recover the database until time 'yyyy-mm-dd:00:00:00', 
and set the time to be later than the when the problem occurred. That way it 
will roll forward through the records that were actually performed later, but 
have an earlier time stamp due to the system clock change. 
Performing a complete recovery is optimal, as all transactions will be applied. 
j) Recovery with missing System tablespace. 
The only option is to restore from a backup. 
k) Media Recovery of offline tablespace 
When a tablespace is offline, you cannot recover datafiles belonging to this 
tablespace using recover database command. The reason is because a recover 
database command will only recover online datafiles. Since the tablespace is 
offline, it thinks the datafiles are offline as well, so even if you recover 
database and roll forward, the datafiles in this tablespace will not be touched. 
Instead, you need to perform a recover tablespace command. Alternatively, you 
could restored the datafiles from a cold backup, mount the database and select 
from the v$datafile view to see if any of the datafiles are offline. If they 
are, bring them online, and then you can perform a recover database command. 
l) Recovery of Read-Only tablespaces 
If you have a current control file, then recovery of read only tablespaces is 
no different than recovering read-write files. 
The issues with read-only tablespaces arise if you have to use a backup control 
file. If the tablespace is in read-only mode, and hasn't changed to read-write 
since the last backup, then you will be able to media recovery using a backup 
control file by taking the tablespace offline. The reason here is that when you 
are using the backup control file, you must open the database with resetlogs. 
And we know that Oracle wont let you read files from before a resetlogs was 
done. However, there is an exception with read-only tablespaces. You will be 
able to take the datafiles online after you have opened the database. 
When you have tablespaces that switch modes and you don't have a current control 
file, you should use a backup control file that recognizes the tablespace in 
read-write mode. If you don't have a backup control file, you can create a new 
one using the create controlfile command. 
Basically, the point here is that you should take a backup of the control file 
every time you switch a tablespaces mod 


ORA-01547:
ORA-01110:
ORA-01588
ORA-00205:
----------


OTHER ERRORS:
=============


1. Control file missing

ORA-00202: controlfile: 'g:\oradata\airm\control03.ctl'
ORA-27041: unable to open file
OSD-04002: unable to open file
O/S-Error: (OS 2) The system cannot find the file specified.

Sat May 24 20:02:40 2003
ORA-205 signalled during: alter database airm mount...

Solution: just copy one of the present to the missing one

ORA=00214
---------

1. one Control file is different version

Solution: just copy one of the present to the different one


19.13 recovery FROM 
------------------

alter system disable distributed recovery


  ORA-2019 ORA-2058 ORA-2068 ORA-2050: FAILED DISTRIBUTED TRANSACTIONS 
for step by step instructions on how to proceed.

The above errors indicates that there is a failed distributed transaction that 
needs to be manually cleaned up. 

See <Note 1012842.102>  
In some cases, the instance may crash before the solutions are implemented.  
If this is the case, issue an  'alter system disable distributed recovery' 
immediately after the database starts to allow the database to run without 
having reco terminate the instance.


19.14 get a tablespace out of backup mode:
--------------------------------------

SVRMGR> connect internal
SVRMGR> startup mount
SVRMGR> SELECT df.name,bk.time FROM v$datafile df,v$backup bk
            2> WHERE df.file# = bk.file# and bk.status = 'ACTIVE';
Shows the datafiles currently in a hot backup state.
SVRMGR> alter database datafile
            2> '/u03/oradata/PROD/devlPROD_1.dbf' end backup;
Do an "end backup" on those listed hot backup datafiles.
SVRMGR> alter database open;


19.15 Disk full, corrupt archive log
---------------------------------

Archive mandatory in log_archive_dest is unavailable and it's impossible 
to make a full recovery.
 
Workaround 
 Configure log_archive_min_succeed_dest = 2 
 Do not use log_archive_duplex_dest


19.16 ORA-1578 ORACLE data block corrupted (file # %s, block # %s)
---------------------------------------------------------------

SELECT  segment_name ,  segment_type ,  owner , tablespace_name
FROM    sys.dba_extents
WHERE   file_id = &bad_file_id
AND     &bad_block_id BETWEEN block_id and block_id + blocks -1


19.17 Database does not start (1) SGADEF.DBF LK.DBF
--------------------------------------------------

Note:1034037.6 
Subject:  ORA-01102: WHEN STARTING THE DATABASE 
Type:  PROBLEM 
Status:  PUBLISHED 
 Content Type:  TEXT/PLAIN 
Creation Date:  25-JUL-1997 
Last Revision Date:  10-FEB-2000 
 

Problem Description: 
==================== 
 
You are trying to startup the database and you receive the following error:  
 
   ORA-01102:  cannot mount database in EXCLUSIVE mode 
       Cause:  Some other instance has the database mounted exclusive  
               or shared. 
      Action: Shutdown other instance or mount in a compatible mode. 
or

scumnt: failed to lock /opt/oracle/product/8.0.6/dbs/lkSALES
Fri Sep 13 14:29:19 2002
ORA-09968: scumnt: unable to lock file
SVR4 Error: 11: Resource temporarily unavailable
Fri Sep 13 14:29:19 2002
ORA-1102 signalled during: alter database  mount...
Fri Sep 13 14:35:20 2002
Shutting down instance (abort) 
 
Problem Explanation: 
==================== 
 
A database is started in EXCLUSIVE mode by default.  Therefore, the  
ORA-01102 error is misleading and may have occurred due to one of the  
following reasons: 
 
  - there is still an "sgadef<sid>.dbf" file in the "ORACLE_HOME/dbs" 
    directory  
  - the processes for Oracle (pmon, smon, lgwr and dbwr) still exist 
  - shared memory segments and semaphores still exist even though the  
    database has been shutdown 
  - there is a "ORACLE_HOME/dbs/lk<sid>" file 
  
 
Search Words: 
============= 
 
ORA-1102, crash, immediate, abort, fail, fails, migration

Solution Description: 
===================== 
 
Verify that the database was shutdown cleanly by doing the following: 
 
1. Verify that there is not a "sgadef<sid>.dbf" file in the directory 
   "ORACLE_HOME/dbs".   
 
        % ls $ORACLE_HOME/dbs/sgadef<sid>.dbf 
 
   If this file does exist, remove it. 
 
        % rm $ORACLE_HOME/dbs/sgadef<sid>.dbf 
 
2. Verify that there are no background processes owned by "oracle"  
 
        % ps -ef | grep ora_ | grep $ORACLE_SID 
 
   If background processes exist, remove them by using the Unix  
   command "kill".  For example: 
 
        % kill -9 <Process_ID_Number> 
 
3. Verify that no shared memory segments and semaphores that are owned  
   by "oracle" still exist 
 
        % ipcs -b 
 
   If there are shared memory segments and semaphores owned by "oracle", 
   remove the shared memory segments  
 
	% ipcrm -m <Shared_Memory_ID_Number> 
 
   and remove the semaphores  
 
	% ipcrm -s <Semaphore_ID_Number> 
 
   NOTE:  The example shown above assumes that you only have one  
          database on this machine.  If you have more than one 
          database, you will need to shutdown all other databases 
          before proceeding with Step 4. 
 
4. Verify that the "$ORACLE_HOME/dbs/lk<sid>" file does not exist 
 
5. Startup the instance 
 
 
Solution Explanation: 
===================== 
 
The "lk<sid>" and "sgadef<sid>.dbf" files are used for locking shared memory. 
It seems that even though no memory is allocated, Oracle thinks memory is 
still locked.  By removing the "sgadef" and "lk" files you remove any knowledge
oracle has of shared memory that is in use. Now the database can start.
.


19.18 Rollback segment missing, active transactions
------------------------------------------------


Note:1013221.6 
Subject:  RECOVERING FROM A LOST DATAFILE IN A ROLLBACK TABLESPACE 
Type:  PROBLEM 
Status:  PUBLISHED 
 Content Type:  TEXT/PLAIN 
Creation Date:  16-OCT-1995 
Last Revision Date:  18-JUN-2002 


Solution 1:
---------------

Error scenario:  

1. set transaction use rollback segment rb1;  
2. INSERTS into's... 
3. SHUTDOWN ABORT;  (simulate Media errors) 
4. Delete file rb1.ora (Tablespace RB1 with segment rb1 );  
5. Restore a backup of the file

Recover:  

1. comment out INIT.ORA ROLLBACK_SEGMENT parameter , so ORACLE does not try to 
find the incorrect segment rb1 
2. STARTUP MOUNT 
3. ALTER DATABASE DATAFILE  'rb1.ora' OFFLINE;
4. ALTER DATABASE OPEN  # now we are in business
5. CREATE ROLLBACK SEGMENT rbtemp TABLESPACE SYSTEM; 
   # We need Temporary RBS for further steps;
6. ALTER ROLLBACK SEGMENT rbtemp ONLINE;
7. RECOVER TABLESPACE RB1;
8. ALTER TABLESPACE RB1 ONLINE;
9. ALTER ROLLBACK SEGMENT rb1 ONLINE;
10. ALTER  ROLLBACK SEGMENT rbtemp OFFLINE;
11. DROP ROLLBACK SEGMENT rbtemp;

Result:  Successfully rollback uncommitted Transactions, no suspect instance.


Solution 2:
---------------

INTRODUCTION
------------
Rollback segments can be monitored through the data dictionary view,
dba_rollback_segs. There is a status column that describes what state the
rollback segment is currently in. Normal states are either online or offline.
Occasionally, the status of "needs recovery" will appear.

When a rollback segment is in this state, bringing the rollback segment
offline or online either through the alter rollback segment command or
removing it FROM the rollback_segments parameter in the init.ora usually 
has no effect.


UNDERSTANDING
-------------
A rollback segment falls into this status of needs recovery whenever
Oracle tries to roll back an uncommitted transaction in its transaction
table and fails.

Here are some examples of why a transaction may need to rollback:
  1-A user may do a dml transaction and decides to issue rollback
  2-A shutdown abort occurs and the database needs to do an instance recovery
    in which case, Oracle has to roll back all uncommitted transactions.

When a rollback of a transaction occurs, undo must be applied to the
data block the modified row/s are in.  If for whatever reason, that data
block is unavailable, the undo cannot be applied. The result is a 'corrupted'
rollback segment with the status of needs recovery.

What could be some reasons a datablock is unaccessible for undo?
  1-If a tablespace or a datafile is offline or missing.
  2-If the object the datablock belongs to is corrupted.
  3-If the datablock that is corrupt is actually in the rollback segment
    itself rather than the object.


HOW TO RESOLVE IT
-----------------
1-MAKE sure that all tablespaces are online and all datafiles are
  online. This can be checked through v$datafile, under the
  status column.  For tablespaces associated with the datafiles,
  look in dba_tablespaces.

If that still does not resolve the problem then

2-PUT the following in the init.ora-
  event = "10015 trace name context forever, level 10"

        Setting this event will generate a trace file that will reveal the
        necessary information about the transaction Oracle is trying to roll
        back and most importantly, what object Oracle is trying to apply
        the undo to.

3-SHUTDOWN the database (if normal does not work, immediate, if that does
  not work, abort) and bring it back up.

        Note: An ora-1545 may be encountered, or other errors. If the database
              cannot startup, contact customer support at this point.

4-CHECK in the directory that is specified by the user_dump_dest parameter
  (in the init.ora or show parameter command) for a trace file that was
  generated at startup time.

5-IN the trace file, there should be a message similar to-
  error recovery tx(#,#) object #.

        TX(#,#) refers to transaction information.
        The object # is the same as the object_id in sys.dba_objects.

6-USE the following query to find out what object Oracle is trying to
  perform recovery on.

        SELECT owner, object_name, object_type, status
        FROM dba_objects WHERE object_id = <object #>;

7-THIS object must be dropped so the undo can be released. An export or relying
  on a backup may be necessary to restore the object after the corrupted
  rollback segment goes away.

8-AFTER dropping the object, put the rollback segment back in the init.ora
  parameter rollback_segments, removed the event, and shutdown and startup
  the database.

In most cases, the above steps will resolve the problematic rollback segment.
If this still does not resolve the problem, it may be likely that the
corruption is in the actual rollback segment.
At this point, if the problem has not been resolved, please contact
customer support.


Solution 3:
---------------

Recovery FROM the loss of a Rollback segment datafile containing active transactions

How do I recover the datafile containing rollback segments having active transactions 
and if the backup is done with RMAN without using catalog. 
I have tried the case study FROM the Oracle recovery handbook,but 
when i tried to open the database after offlining the Rollback segment file I got the following errors 

ORA-00604: error occurred at recursive SQL level 2 
ORA-00376: file 2 cannot be read at this time 
ORA-01110:data file 2: '/orabackup/CCD1prod/oradata/rbs01CCD1prod.dbf' 

the status of the datafile was "Recover". 
Anyhow shutting down and starup mounting the database allows for the database or the datafile recovery,
but this was done through SVRMGRL. 

Here is whats happening. 

simulate the loss of datafile by removing FROM the os and shut down abort the database. 
mount the database so RMAN can restore the file, 
at this point offlining the file succeeds but you cannot open the database. 
so the question is can we offline a rollback segment datafile containing active transactions and open the database ? 
How to perform recovery in such case using an RMAN backup without using the catalog. 
I appreciate for any insight and tips into this issue. 

Madhukar 


FROM: Oracle, Tom Villane 01-May-02 21:04 
Subject: Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Hi, 

The only supported way to recover FROM the loss of a rollback segment datafile containing 
a rollback segment with a potentially active data dictionary transaction is to restore the datafile 
FROM backup and roll forward to a point in time prior to the loss of the datafile (assuming archivelog mode). 


Tom Villane Oracle Support Metalink Analyst 

FROM: Madhukar Yedulapuram 02-May-02 06:46 
Subject: Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Hi Tom, 
What does Rollforward upto a time prior to the loss of the datafile got to do with the recovery,
are you suggesting this so that active transaction is not lost,is it possible ? 
Because during the recovery the rollforward is followed by rollback and all the active transactions 
FROM the rollback segment's transaction table will be rolled back isnt it ?
My question is if I have a active transaction in a rollback segment and the file containing 
that rollback segment is lost and the database crashed or did a shutdown abort can we open the 
database after offlining the datafile and commenting out the rollback_segments parameter in the init.ora parameter,
I tried to do it and got the errors which I mentioned earlier.
So in this case I have to do offline recovery only or what ? 
Thanks, 
madhukar 

FROM: Oracle, Tom Villane 02-May-02 16:24 
Subject: Re : Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Hi, 

You won't be able to open the database if you lose a rollback segment datafile that contains an active transaction. 
You will have to: 
Restore a good backup of the file 
RECOVER DATAFILE '<name>' 
ALTER DATABASE DATAFILE '<name>' ONLINE; 

The only way you would be able to open the database is if the status of the rollback were OFFLINE, 
any other status requires that you recover as noted before. 

As recovering FROM rollback corruption needs to be done properly, 
you may want to log an iTAR if you have additional questions. 

Regards 
Tom Villane 
Oracle Support Metalink Analyst 

FROM: Madhukar Yedulapuram 03-May-02 07:22 
Subject: Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Hi Tom, 
Thank you for the reply.you said that the only way the database can be opened is if the status of the rollback segment 
was offline,but what happens to an active transaction which was using this rollback segment,
once the database is opened and the media recovery performed on the datafile,the database will show 
values which were part of an active transaction and not committed,isnt this the logical corruption? 

madhukar 


FROM: Madhukar Yedulapuram 05-May-02 08:14 
Subject: Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Tom, 
Can I get some reponse to my questions. 

Thank You, 
Madhukar 

FROM: Oracle, Tom Villane 07-May-02 13:53 
Subject: Re : Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Hi, 

Sorry for the confusion, I should not have said "rolling forward to a point in time..." in my previous reply. 
No, there won't be corruption or inconsistency. The redo logs will contain the information for both 
committed and uncommitted transactions. Since this includes changes made to rollback segment blocks, 
it follows that rollback data is also (indirectly) recorded in the redo log. 
To recover FROM a loss of Datafiles in the SYSTEM tablespace or 
datafiles with active rollback segments. You must perform closed database recovery. 
-Shutdown the database 
-Restore the file FROM backup 
-Recover the datafile 
-Open the database. 

References: 
Oracle8i Backup and Recovery Guide, chapter 6 under "Losing Datafiles in ARCHIVELOG Mode ". 


Regards 
Tom Villane 
Oracle Support Metalink Analyst 

FROM: Madhukar Yedulapuram 07-May-02 22:23 
Subject: Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Hi Tom, 
After offlining the rollback segment containing active transaction you can open the database and do the recovery 
and after that any active transactions should be rolled back and the data should not show up,
but I performed the following test and Oracle is showing logical corruption by showing data which was never committed. 

SVRMGR> create tablespace test_rbs datafile 
'/orabackup/CCD1prod/oradata/test_rbs01.dbf' size 10M 
2> default storage (initial 1M next 1M minextents 1 maxextents 1024); 
Statement processed. 
SVRMGR> create rollback segment test_rbs tablespace test_rbs; 
Statement processed. 
SVRMGR> create table case5 (c1 number) tablespace tools; 
Statement processed. 
SVRMGR> set transaction use rollback segment test_rbs; 
ORA-01598: rollback segment 'TEST_RBS' is not online 
SVRMGR> alter rollback segment test_rbs online; 
Statement processed. 
SVRMGR> set transaction use rollback segment test_rbs; 
Statement processed. 
SVRMGR> insert into case5 values (5); 
1 row processed. 
SVRMGR> alter rollback segment test_rbs offline; 
Statement processed. 
SVRMGR> shutdown abort 
ORACLE instance shut down. 
SVRMGR> startup mount 
ORACLE instance started. 
Total System Global Area 145981600 bytes 
Fixed Size 73888 bytes 
Variable Size 98705408 bytes 
Database Buffers 26214400 bytes 
Redo Buffers 20987904 bytes 
Database mounted. 
SVRMGR> alter database datafile '/orabackup/CCD1prod/oradata/test_rbs01.dbf' 
offline; 
Statement processed. 
SVRMGR> alter database open; 
Statement processed. 
SVRMGR> recover tablespace test_rbs; 
Media recovery complete. 
SVRMGR> alter tablespace test_rbs online; 
Statement processed. 
SVRMGR> SELECT * FROM case5; 
C1 
---------- 
5 
1 row SELECTed. 
SVRMGR> alter rollback segment test_rbs online; 
Statement processed. 
SVRMGR> SELECT * FROM case5; 
C1 
---------- 
5 
1 row SELECTed. 
SVRMGR> drop rollback segment test_rbs; 
drop rollback segment test_rbs 
* 
ORA-01545: rollback segment 'TEST_RBS' specified not available 
SVRMGR> SELECT segment_name,status FROM dba_rollback_segs; 
SEGMENT_NAME STATUS 
------------------------------ ---------------- 
SYSTEM ONLINE 
R0 OFFLINE 
R01 OFFLINE 
R02 OFFLINE 
R03 OFFLINE 
R04 OFFLINE 
R05 OFFLINE 
R06 OFFLINE 
R07 OFFLINE 
R08 OFFLINE 
R09 OFFLINE 
R10 OFFLINE 
R11 OFFLINE 
R12 OFFLINE 
BIG_RB OFFLINE 
TEST_RBS ONLINE 
16 rows SELECTed. 

SVRMGR> drop rollback segment test_rbs; 
drop rollback segment test_rbs 
* 
ORA-01545: rollback segment 'TEST_RBS' specified not available 

Here I have to bring the rollback segment offline to dropt it. 

Can this be explained or is this a bug,because this caused logical corruption. 

FROM: Oracle, Tom Villane 10-May-02 13:19 
Subject: Re : Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Hi, 

What you are showing is expected and normal, and not corruption. 
At the time that you issue the "alter rollback segment test_rbs online;" Oracle does an implicit commit 
becuase any "ALTER" statement is considered DDL and Oracle issues an 
implicit COMMIT before and after any data definition language (DDL)statement. 

Regards 
Tom Villane 
Oracle Support Metalink Analyst 


--------------------------------------------------------------------------------

FROM: Madhukar Yedulapuram 14-May-02 20:12 
Subject: Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Hi Tom, 
So what you are saying is the moment I say 
Alter rollback segment RBS# online,oracle will issue 
an implicit commit,but if you look at my test just after performing the tablespace recovery 
(had only one datafile in the RBS tablespace 
which was offlined before opening the database and doing the recovery),

I brought the tablespace online and did a SELECT FROM the table which was having 
the active transaction in one of the rollback segments,so this statement has issued an 
implicit commit and I could see the data which was never actually committed,doesnt this 
contradict the Oracle's stance that only that data will be shown which shown which is committed,
I think this statement is true for Intance and Crash recovery,not for media recovery as the case 
in point proves,but still if you say Oracle issues an implicit commit,then the stance of oracle is consistent. 

madhukar 


FROM: Oracle, Tom Villane 15-May-02 18:30 
Subject: Re : Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Hi, 

A slight correction to what I posted, I should have said the implicit commit happened 
when the rollback segment was altered offline. 

Whether it's an implicit commit (before and after a DDL statement like CREATE, DROP, RENAME, ALTER)
 or if the user did the commit, or if the user exits the application (forces a commit). 
All of the above are considered commits and the data will be saved. 

Regards 
Tom Villane 
Oracle Support Metalink Analyst 


FROM: Madhukar Yedulapuram 16-May-02 23:17 
Subject: Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Hi Tom, 
Thank You very much,so the moment i brought the RBS offline,the transaction was 
committed and the data saved in the table,is that what you are saying.
So the data was committed even before performing the recovery,so recovery is essentially not applying anything in this case. 

madhukar 


FROM: Oracle, Tom Villane 17-May-02 12:18 
Subject: Re : Re : Recovery FROM the loss of a Rollback segment datafile containing active transactions 


Hi, 

Yes, that is what happened. 

Regards 
Tom Villane 
Oracle Support Metalink Analyst 


19.19 After backup you increase a datafile.
------------------------------------------


problem 2: "the backed up
   datafile size is smaller, and Oracle won't
   accept it for recovery." 

isn't a problem because we most certainly will accept that file.  As a test you 
can do this (i just did)

o create a small 1m tablespace with a datafile.
o alter it and begin backup.
o copy the datafile
o alter it and end backup.
o alter the datafile and "autoextend on next 1m" it.
o create a table with initial 2m initial extent. This will 
  grow the datafile.
o offline the tablespace
o copy the 1m original file back.
o try to online it -- it'll tell you the file that needs 
  recovery (its already accepted the smaller file at this 
  point)
o alter database recover datafile 'that file';
o alter the tablespace online again -- all is well.


As for the questions:

1) There is such a command -- "alter database create datafile".  Here is an 
example I just ran through:

tkyte@TKYTE816> alter tablespace t begin backup;
Tablespace altered.

I copied the single datafile that is in T at this point

tkyte@TKYTE816> alter tablespace t end backup;
Tablespace altered.

tkyte@TKYTE816> alter tablespace t add datafile 'c:\temp\t2.dbf' size 1m;
Tablespace altered.

So, I added a datafile AFTER the backup...

tkyte@TKYTE816> alter tablespace t offline;
Tablespace altered.

At this point, I went out and erased the two datafiles associated with T.  I 
moved the copy of the one datafile in place...

tkyte@TKYTE816> alter tablespace t online;
alter tablespace t online
*
ERROR at line 1:
ORA-01113: file 9 needs media recovery
ORA-01110: data file 9: 'C:\TEMP\T.DBF'

So, it sees the copy is out of sync...

tkyte@TKYTE816> recover tablespace t;
ORA-00283: recovery session canceled due to errors
ORA-01157: cannot identify/lock data file 10 - see DBWR trace file
ORA-01110: data file 10: 'C:\TEMP\T2.DBF'

and now it tells of the missing datafile -- all we need do at this point is:

tkyte@TKYTE816> alter database create datafile 'c:\temp\t2.dbf';
Database altered.

tkyte@TKYTE816> recover tablespace t;
Media recovery complete.
tkyte@TKYTE816> alter tablespace t online;
Tablespace altered.

and we are back in business....


19.22 Setting Trace Events
-------------------------

database level via init.ora

EVENT="604 TRACE NAME ERRORSTACK FOREVER"
EVENT="10210 TRACE NAME CONTEXT FOREVER, LEVEL 10"

session level

ALTER SESSION SET EVENTS 'IMMEDIATE TRACE NAME BLOCKDUMP LEVEL 67109037';
ALTER SESSION SET EVENTS 'IMMEDIATE TRACE NAME CONTROLF LEVEL 10';
  
system trace dump file
ALTER SESSION SET EVENTS 'IMMEDIATE TRACE NAME SYSTEMSTATE LEVEL 10';  


19.23 DROP TEMP DATAFILE
-----------------------

SVRMGRL>startup mount
SVRMGRL>alter database open;
ora-01157 cannot identify datafile 4 - file not found
ora-01110 data file 4 '/oradata/temp/temp.dbf'
SVRMGRL>alter database datafile '/oradata/temp/temp.dbf' offline drop;
SVRMGRL>alter database open;
SVRMGRL>drop tablespace temp including contents;
SVRMGRL>create tablespace temp datafile '....


19.24 SYSTEM DATAFILE RECOVERY
-----------------------------

- a normal datafile can be taken offline and the database started up.
- the system file can be taken offline but the database cannot start

- restore a backup copy of the system file
- recover the file


19.25 Strange processes=.. and database does not start
-----------------------------------------------------

Does the PROCESSES initialization parameter of init.ora depend on some other parameter ?
We were getting the error as 
maximum no of process (50) exceeded.....
The value was initially set to 50, so when the value was....changed to 200, and the database
was restarted, it gave an error of "end-of-file on communication channel"
The value was reduced to 150 & 100 and the same error was encountered....
when it was set back to 50, the database started....
Can anyone clear ?

check out ur semaphore settings in /etc/system.
try increasing seminfo_semmns 


19.26 ORA-00600
--------------

I work with ORACLE DB ver.8.0.5
and recieved an error in alert.log
ksedmp: internal or fatal error
ORA-00600: internal error code, arguments: [12700], [3383], [41957137], [44], [], [], [], []


oerr ora 600
00600, 00000, "internal error code, arguments: [%s], [%s], [%s], [%s], [%s], [%s], [%s], [%s]"
Cause: This is the generic internal error number for 
Oracle program
exceptions. This indicates that a process has 
encountered an exceptional condition.
Action: Report as a bug - the first argument is the 
internal error number
Number [12700] indicates 
"invalid NLS parameter value (%s)"
Cause: An invalid or unknown NLS configuration 
parameter was specified.


19.27 segment has reached it's max_extents
-----------------------------------------

oracle later than 7.3.x

Version 7.3 and later:                  
You can set the MAXEXTENTS storage parameter value to UNLIMITED for any          
object.             
Rollback Segment           
================             
ALTER ROLLBACK SEGMENT rollback_segment STORAGE ( MAXEXTENTS UNLIMITED);  
           
Temporary Segment           
=================             
ALTER TABLESPACE tablespace DEFAULT STORAGE ( MAXEXTENTS UNLIMITED); 
            
Table Segment           
=============             
ALTER TABLE MANIIN_ASIAKAS STORAGE ( MAXEXTENTS UNLIMITED); 

ALTER TABLE MANIIN_ASIAKAS STORAGE ( NEXT 5M ); 
   
         
Index Segment           
=============             
ALTER INDEX index STORAGE ( MAXEXTENTS UNLIMITED);   
           
Table Partition Segment          
=======================           
ALTER TABLE table MODIFY PARTITION partition STORAGE (MAXEXTENTS UNLIMITED);   


19.28 max logs
--------------

Problem Description
-------------------  
In the "alert.log", you find the following warning messages:        
kccrsz: denied expansion of controlfile section 9 by 65535 record(s)      
the number of records is already at maximum value (65535)     
krcpwnc: following controlfile record written over:      
RECID #520891 Recno 53663 Record timestamp     ...     
kccrsz: denied expansion of controlfile section 9 by 65535 record(s)      
the number of records is already at maximum value (65535)     
krcpwnc: following controlfile record written over:      
RECID #520892 Recno 53664 Record timestamp  

The database is still running.  
The CONTROL_FILE_RECORD_KEEP_TIME init parameter is set to 7.  
If you display the records used in the LOG HISTORY section 9 of the controlfile:     

SQL> SELECT * FROM v$controlfile_record_section WHERE type='LOG HISTORY' ;     
TYPE           RECORDS_TOTAL RECORDS_USED FIRST_INDEX LAST_INDEX LAST_RECID    
-------------  ------------- ------------ ----------- ---------- ----------    
LOG HISTORY            65535        65535       33864      33863     520892   

The number of RECORDS_USED has reached the maximum allowed in RECORDS_TOTAL.   

Solution Description
--------------------  
Set the CONTROL_FILE_RECORD_KEEP_TIME to 0:   
* Insert the parameter CONTROL_FILE_RECORD_KEEP_TIME = 0 IN "INIT.ORA"    
 -OR-                              
* Set it momentarily if you cannot shut the database down now:                                                                                 
SQL> alter system set control_file_record_keep_time=0;                                                           

Explanation 
-----------  
The default value for      * the CONTROL_FILE_RECORD_KEEP_TIME is 7 days.       
SELECT value FROM v$parameter         
WHERE name='control_file_record_keep_time';                  
VALUE       
-----          
7       
* the MAXLOGHISTORY database parameter has already reached the maximum of       
65535 and it cannot be increased anymore.                                                                                
SQL> alter database backup controlfile to trace;                                    
=> in the trace file, MAXLOGHISTORY is 65535                                
The MAXLOGHISTORY increases dynamically when the       
CONTROL_FILE_RECORD_KEEP_TIME is set to a value different FROM 0,      
but does not exceed 65535.  Once reached, the message appears in the      
alert.log warning you that a controlfile record is written over.   


19.29 ORA-470 maxloghistory
--------------------------

Problem Description: 
==================== 
Instance cannot be started because of ORA-470. LGWR has also died 
creating a trace file with an ORA-204 error. It is possible that the 
maxloghistory limit of 65535 as specified in the controlfile has 
been reached. 
Diagnostic Required: 
==================== 
The following information should be requested for diagnostics: 
1. LGWR trace file produced 
2. Dump of the control file - using the command: 
ALTER SESSION SET EVENTS 'immediate trace name controlf level 10' 
3. Controlfile contents, using the command: 
ALTER DATABASE BACKUP CONTROLFILE TO TRACE; 
Diagnostic Analysis: 
==================== 
The following observations will indicate that we have the maxloghistory 
limit of 65535: 
1. The Lgwr trace file should show the following stack trace: 
- in 8.0.3 and 8.0.4, OSD skgfdisp returns ORA-27069, 
stack: 
kcrfds -> kcrrlh -> krcpwnc -> kccroc -> kccfrd -> kccrbl -> kccrbp 
- in 8.0.5 kccrbl causes SEGV before the call to skgfdisp 
with wrong block number. 
stack: 
kcrfds -> kcrrlh -> krcpwnc -> kccwnc -> kccfrd -> kccrbl 
2. FROM the 'dump of the controlfile': 
... 
... numerous lines omittted 
... 
LOG FILE HISTORY RECORDS: 
(blkno = 0x13, size = 36, max = 65535, in-use = 65535, last-recid= 188706) 
... 
the max value of 65535 reconfirms that the limit has been reached. 
3. Further confirmation can be seen FROM the controlfile trace: 
CREATE CONTROLFILE REUSE DATABASE "ORCL" NORESETLOGS NOARCHIVELOG 
MAXLOGFILES 16 
MAXLOGMEMBERS 2 
MAXDATAFILES 50 
MAXINSTANCES 1 
MAXLOGHISTORY 65535 
... 
Diagnostic Solution: 
=================== 
1. Set control_file_record_keep_time = 0 in the init.ora. 
This parameter specifies the minimum age of a log history record 
in days before it can be reused. With the parameter set to 0, 
reusable sections never expand and records are reused immediately 
as required. 
[NOTE:1063567.6] <ml2_documents.showDocument?p_id=1063567.6&p_database_id=NOT> 
gives a good description on the use of this parameter. 
2. Mount the database and retrieve details of online redo log files for use in 
step 6. Because the recovery will need to roll forward through current online 
redo logs, a list of online log details is required to indicate which redo 
log is current. This can be obtained using the following command: 
startup mount 
SELECT * FROM v$logfile; 
3. Open the database. 
This is a very important step. Although the startup will fail, it is a 
very important step before recreating the controlfile in step 5 and hense, 
enabling crash recovery to repair any incomplete log switch. Without this 
step it may be impossible to recover the database. 
alter database open 
4. Shutdown the database, if it did not already crash in step 3. 
5. Using the backup controlfile trace, recreate the controlfile with a smaller 
maxloghistory value. The MAXLOGHISTORY section of the current control file 
cannot be extended beyond 65536 entries. The value should reflect the amount 
of log history that you wish to maintain. 
An ORA-219 may be returned when the size of the controlfile, based on the 
values of the MAX- parameters, is higher then the maximum allowable size. 
[NOTE:1012929.6] <ml2_documents.showDocument?p_id=1012929.6&p_database_id=NOT> 
gives a good step-by-step guide to recreating the control file. 
6. Recover the database. 
The database will automatically be mounted due to the recreation of the 
controlfile in step 5 : 
Recover database using backup controlfile; 
At the recovery prompt apply the online logs in sequence by typing the 
unquoted full path and file name of the online redo log to apply, as noted 
in step 2. After applying the current redo log, you will receive the 
message 'Media Recovery Complete'. 
7. Once media recovery is complete, open the database as follows: 
alter database open resetlogs; 

19.30 Compatible init.ora change:
--------------------------------

Database files have the COMPATIBLE version in the file header. If you 
set the parameter to a higher value, all the headers will be updated at next 
database startup. This means that if you shutdown your database, downgrade the 
COMPATIBLE parameter, and try to restart your database, you'll receive an error 
message something like: 

ORA-00201: control file version 7.3.2.0.0 incompatible with ORACLE version 
7.0.12.0.0 
ORA-00202: control file: '/usr2/oracle/dbs/V73A/ctrl1V73A.ctl' 
In the above case, database was running with COMPATIBLE 7.3.2.0. I commented out 
the parameter in init.ora, that is; kernel uses default 7.0.12.0 and returns an 
error before mounting since kernel cannot read the controlfile header. 

- You may only change the value of COMPATIBLE after a COLD Backup. 
- You may only change the value of COMPATIBLE if the database has been 
shutdown in NORMAL/IMMEDIATE mode. 


This parameter allows you to use a new release, while at the same time guaranteeing backward 
compatibility with an earlier release (in case it becomes necessary to revert to the earlier release). 
This parameter specifies the release with which Oracle7 Server must maintain compatibility.
 Some features of the current release may be restricted. For example, if you are running release 7.2.2.0 
with compatibility set to 7.1.0.0 in order to guarantee compatibility, you will not be able to use 7.2 features. 
When using the standby database and feature, this parameter must have the same value on the primary 
and standby databases, and the value must be 7.3.0.0.0 or higher. This parameter allows you to immediately 
take advantage of the maintenance improvements of a new release in your production systems 
without testing the new functionality in your environment. The default value is the earliest release with which 
compatibility can be guaranteed. Ie: It is not possible to set COMPATIBLE to 7.3 on an Oracle8 database. 
-----------------

Hi Tom, Just installed DB9.0.1, I tried to modify parameter in init.ora file: compatible=9.0.0(default) to 8.1.0. 
After I restarted the 901 DB, I got error below when I login to sqlplus: ERROR: ORA-01033: 
ORACLE initialization or shutdown in progress Anything wrong with that? If I change back, everything is ok. 

The database could not start up. If you start the database manually, from the command line -- 
you would discover this. For example:
 
idle> startup pfile=initora920.ora
 ORACLE instance started. 
Total System Global Area 143725064 bytes 
Fixed Size 451080 bytes 
Variable Size 109051904 bytes
 Database Buffers 33554432 bytes 
Redo Buffers 667648 bytes 
Database mounted. 
ORA-00402: database changes by release 9.2.0.0.0 cannot be used by release 8.1.0.0.0 
ORA-00405: compatibility type "Locally Managed SYSTEM tablespace" .....
 Generally, compatible cannot be set DOWN as you are already using new features 
many times that are not compatible with the older release. 
You would have had to of created the database with 8.1 file formats (compatible set to 8.1 from the very beginning) 
------------------------------


19.31 ORA-27044: unable to write the header block of file:
---------------------------------------------------------

Problem Description:   
====================   
   
When you manually switch redo logs, or when the log buffer causes the redo  
threads to switch, you see errors similar to the following in your alert log:  
  
    ...  
    Fri Apr 24 13:42:00 1998  
    Thread 1 advanced to log sequence 170  
      Current log# 4 seq# 170 mem# 0: /.../rdlACPT04.rdl  
    Fri Apr 24 13:42:04 1998  
    Errors in file /.../acpt_arch_15973.trc:  
    ORA-202: controlfile: '/.../ctlACPT01.dbf'  
    ORA-27044: unable to write the header block of file  
    SVR4 Error: 48: Operation not supported  
    Additional information: 3  
    Fri Apr 24 13:42:04 1998  
    kccexpd: controlfile resize from 356 to 368 block(s) denied by OS  
    ...  
  
  
Note: The particular SVR4 error observed may differ in your case and is  
      irrelevant here.  
  
  
ORA-00202: "controlfile: '%s'"  
    Cause: This message reports the name file involved in other messages.  
   Action: See associated error messages for a description of the problem.  
  
ORA-27044: "unable to write the header block of file"  
    Cause: write system call failed, additional information indicates  
           which function encountered the error  
   Action: check errno  
  
 
Solution Description:   
=====================   
   
To workaround this problem you can:  
  
1. Use a database blocksize smaller than 16k.  This may not be practical  
   in all cases, and to change the db_block_size of a database  
   you must rebuild the database.  
  
- OR -  
 
2. Set the init.ora parameter CONTROL_FILE_RECORD_KEEP_TIME equal to   
   zero.  This can be done by adding the following line to your  
   init.ora file:  
  
       CONTROL_FILE_RECORD_KEEP_TIME = 0  
   
   The database must be shut down and restarted to have the changed  
   init.ora file read.   
  
   
Explanation:   
============   
   
This is [BUG:663726] <ml2_documents.showDocument?p_id=663726&p_database_id=BUG>, which is fixed in release 8.0.6.  
   
The write of a 16K buffer to a control file seems to fail during an implicit  
resize operation on the controlfile that came as a result of adding log  
history records (V$LOG_HISTORY) when archiving an online redo log after a log  
switch.  
 
Starting with Oracle8 the control file can grow to a much larger size than it  
was able to in Oracle7.  Bug 663726 <ml2_documents.showDocument?p_id=663726&p_database_id=BUG> 
is only reproducible when the control file  
needs to grow AND when the db_block_size = 16k.  This has been tested on  
instances with a smaller database block size and the problem has not been able  
to be reproduced.  
  
Records in some sections in the control file are circularly reusable while  
records in other sections are never reused. CONTROL_FILE_RECORD_KEEP_TIME  
applies to reusable sections. It specifies the minimum age in days that a  
record must have before it can be reused. In the event a new record needs to  
be added to a reusable section and the oldest record has not aged enough, the  
record section expands.   
  
If CONTROL_FILE_RECORD_KEEP_TIME is set to 0, then reusable sections never  
expand and records are reused as needed. 


19.32 ORA-04031 error shared_pool:
---------------------------------


DIAGNOSING AND RESOLVING ORA-04031 ERROR

For most applications, shared pool size is critical to Oracle perfoRMANce. The shared pool holds both the d
ata dictionary cache and the fully parsed or compiled representations of PL/SQL blocks and SQL statements. 
When any attempt to allocate a large piece of contiguous memory in the shared pool fails 
Oracle first flushes all objects 
that are not currently in use from the pool and the resulting free memory chunks are merged. 
If there is still not a single chunk large enough to satisfy the request ORA-04031 is returned. 
The message that you will get when this error appears is the following: 
Error: ORA 4031 
Text: unable to allocate %s bytes of shared memory (%s,%s,%s) 


The ORA-04031 error is usually due to fragmentation in the library cache 
or shared pool reserved space. Before of increasing the shared pool size consider 
to tune the application to use shared sql and tune 
SHARED_POOL_SIZE, SHARED_POOL_RESERVED_SIZE, and SHARED_POOL_RESERVED_MIN_ALLOC. 

First determine if the ORA-04031 was a result of fragmentation in the library 
cache or in the shared pool reserved space by issuing the following query: 

SELECT free_space, avg_free_size, used_space, 
avg_used_size, request_failures, last_failure_size 
FROM v$shared_pool_reserved;

The ORA-04031 is a result of lack of contiguous space in the shared pool 
reserved space if: 
REQUEST_FAILURES is > 0 and LAST_FAILURE_SIZE is > 
SHARED_POOL_RESERVED_MIN_ALLOC. 

To resolve this consider increasing SHARED_POOL_RESERVED_MIN_ALLOC to lower 
the number of objects being cached into the shared pool reserved space and 
increase SHARED_POOL_RESERVED_SIZE and SHARED_POOL_SIZE to increase the 
available memory in the shared pool reserved space. 
The ORA-04031 is a result of lack of contiguous space in the library cache if: 
REQUEST_FAILURES is > 0 and LAST_FAILURE_SIZE is < 
SHARED_POOL_RESERVED_MIN_ALLOC 
or 
REQUEST_FAILURES is 0 and LAST_FAILURE_SIZE is < SHARED_POOL_RESERVED_MIN_ALLOC 
The first step would be to consider lowering SHARED_POOL_RESERVED_MIN_ALLOC to 
put more objects into the shared pool reserved space and increase 
SHARED_POOL_SIZE. 

This view keeps information of every SQL statement and PL/SQL block executed in the database.
The following SQL can show you statements with literal values or candidates to include bind variables: 

SELECT substr(sql_text,1,40) "SQL", 
count(*) , 
sum(executions) "TotExecs" 
FROM v$sqlarea 
WHERE executions < 5 
GROUP BY substr(sql_text,1,40) 
HAVING count(*) > 30 
ORDER BY 2; 


19.33 ORA-4030 Out of memory:
----------------------------

Possibly no memory left in Oracle, or the OS does not grant more memory.
Also inspect the size of any swap file.

The errors is also reported if execute permissions are not in place
on some procedure.


19.34 wrong permissions on oracle:
----------------------------------

Hi,

I am under very confusing situation.
I'm running database (8.1.7)
My oracle is installed under ownership of userid "oracle"

when i login with unix id "TEST" and give oracle_sid,oracle_home,PATH variables 
and then do sqlplus sys

after logging in when i give 
"select file#,error from v$datafile_header;"

for some file# i get error as "CAN NOT READ HEADER"

but when i login through other unix id and do the same thing.
I'm not getting any error..

This seems very very confusing,
Could you tell me the reason behind this??


Thank & Regards,
Atul


Followup:  
sounds like you did not run the root.sh during the install and the permissions 
on the oracle binaries are wrong.

what does ls -l $ORACLE_HOME/bin/oracle look like.  it should look like this:


$ ls -l $ORACLE_HOME/bin/oracle
-rwsr-s--x    1 ora920   ora920   51766646 Mar 31 13:03 
/usr/oracle/ora920/bin/oracle


with the "s" bits set. 
 
rwsr-s--x   1 oracle   dba       494456 Dec  7  1999 lsnrctl


regardless of who I log in as, when you have a setuid program as the oracle 
binary is, it'll be running "as the owner"

tell me, what does ipcs -a show you, who is the owner of the shared memory 
segments associated with the SGA.  If that is not Oracle -- you are "getting 
confused" somewhere for the s bit would ensure that Oracle was the owner.


Some connection troubleshooting:
--------------------------------

19.35:
======

ORA-12545:
----------

This one is probaly due to the fact the IP or HOSTNAME in tnsnames is wrong.

ORA-12514:
----------

This one is probaly due to the fact the SERVICE_NAME in tnsnames is wrong or should be
fully qualified with domain name.

ORA-12154:
----------

This one is probaly due to the fact the alias you have used in the logon dialogbox
is wrong.
fully qualified with domain name.


ORA-12535:
----------

The TNS-12535 or ORA-12535 error is normally a timeout error associated
  with Firewalls or slow Networks.
+ It can also be an incorrect listener.ora parameter setting for the
  CONNECT_TIMEOUT_<listener_name> value specified.
+ In essence, the ORA-12535/TNS-12535 is a timing issue between the client and
  server.


19.36 ORA-12560
---------------

Note 1:
-------

Oracle classify this as a �generic protocol adapter error�. In my experience it indicates that 
Oracle client does not know what instance to connect to or what TNS alias to use.

Set the correct ORACLE_HOME ans ORACLE_SID variables. 

Note 2:
-------

Doc ID:  Note:73399.1 
Subject:  WINNT: ORA-12560 DB Start via SVRMGRL or SQL*PLUS ORACLE_SID is set correctly 
Type:  BULLETIN 
Status:  PUBLISHED 
 Content Type:  TEXT/PLAIN 
Creation Date:  28-JUL-1999 
Last Revision Date:  14-JAN-2004 

PURPOSE 
  
To assist in resolving ORA-12560 errors on Oracle8i. 
  
SCOPE & APPLICATION 
  
Support Analysts and customers. 
  
RELATED DOCUMENTS 
 
PR:1070749.6 
NOTE:1016454.102 <ml2_documents.showDocument?p_id=1016454.102&p_database_id=NOT> TNS 12560 DB CREATE VIA INSTALLATION OR CONFIGURATION ASSISTANT FAILS  
BUG:948671 <ml2_documents.showDocument?p_id=948671&p_database_id=BUG> ORADIM SUCCSSFULLY CREATES AN UNUSABLE SID WITH NON-ALPHANUMERIC  
           CHARACTER 
BUG:892253 <ml2_documents.showDocument?p_id=892253&p_database_id=BUG> ORA-12560 CREATING DATABASE WITH DB CONFIGURATION ASSISTANT IF 
           SID HAS NON-ALPHA 
 
If you encounter an ORA-12560 error when you try to start Server Manager 
or SQL*Plus locally on your Windows NT server, you should first check 
the ORACLE_SID value.  Make sure the SID is correctly set, either in the  
Windows NT registry or in your environment (with a set command).  Also, you  
must verify that the service is running.  See the entries above for more details. 
 
If you have verified that ORACLE_SID is properly set, and the service 
is running, yet you still get an ORA-12560, then it is possible that you 
have created an instance with a non-alphanumeric character. 
 
The Getting Started Guide for Oracle8i on Windows NT documents that SID 
names can contain only alphanumerics, however if you attempt to create a SID 
with an underscore or a dash on Oracle8i you are not prevented from doing so. 
The service will be created and started successfully, but attempts to connect 
will fail with an ORA-12560. 
 
You must delete the instance and recreate it with no special characters -  
only alphanumerics are allowed in the SID name. 
 
See BUG#948671, which was logged against 8.1.5 on Windows NT for this issue. 


Note 3:
-------

Doc ID </help/usaeng/Search/search.html>: 	Note:119008.1	Content Type: 	TEXT/PLAIN	
Subject: 	ORA-12560 Connecting to the Server on Unix - Troubleshooting	Creation Date: 	04-SEP-2000	
Type: 	PROBLEM	Last Revision Date: 	20-MAR-2003	
Status: 	PUBLISHED		
PURPOSE 
------- 
 
This note describes some of the possible reasons for ORA-12560 errors  
connecting to  server on Unix Box. The list below shows some of the  
causes, the symptoms and the action to take. It is possible you will hit  
a cause not described here, in that case the information above should allow  
it to be identified. 
  
 
SCOPE & APPLICATION 
------------------- 
 
Support Analysts and customers alike. 
 
 
ORA-12560 CONNECTING TO THE SERVER ON UNIX - TROUBLESHOOTING 
------------------------------------------------------------ 
 
ORA-12560: TNS:protocol adapter error 
Cause:  A generic protocol adapter error occurred. 
Action: Check addresses used for proper protocol specification. Before 
        reporting this error, look at the error stack and check for lower 
        level transport errors.  For further details, turn on tracing and 
        re execute the operation. Turn off tracing when the operation 
        is complete. 
 
This is a high-level error just reporting an error occurred in the actual 
transport layer. Look at the next error down the stack and process that. 
 
 
1. ORA-12500 ORA-12560 MAKING MULTIPLE CONNECTIONS TO DATABASE  
 
   Problem: 
   Trying to connect to the database via listener and the ORA-12500 are  
   prompted. You may see in the listener.log ORA-12500 and ORA-12560: 
 
    ORA-12500:  TNS:listener failed to start a dedicated server process 
        Cause:  The process of starting up a dedicated server process 
                failed.  The executable could not be found or the 
                environment maybe set up incorrectly. 
       Action:  Turn on tracing at the ADMIN level and re execute the 
                operation.  Verify that the ORACLE Server executable is 
                present and has execute permissions enabled.  Ensure that 
                the ORACLE environment is specified correctly in 
                LISTENER.ORA. If error persists, contact Worldwide 
                Customer Support. 
 
   In many cases the error ORA-12500 is caused due to leak of resources in the  
   Unix Box, if you are enable to connect to database and randomly you get 
   the error your operating system is reached the maximum values for some  
   resources. Otherwise, if you get the error in first connection the problem 
   may be in the configuration of the system. 
 
   Solution: 
   Finding the resource which is been reached is difficult, the note 2064862.102 <ml2_documents.showDocument?p_id=2064862.102&p_database_id=NOT> 
   indicates some suggestion to solve the problems.  
 
    
2. ORA-12538/ORA-12560 connecting to the database via SQL*Net 
    
   Problem: 
   Trying to connect to database via SQL*Net the error the error ORA-12538  
   is prompted. In the trace file you can see: 
  
      nscall: error exit 
      nioqper:  error from nscall 
      nioqper:    nr err code: 0 
      nioqper:    ns main err code: 12538 
      nioqper:    ns (2)  err code: 12560 
      nioqper:    nt main err code: 508 
      nioqper:    nt (2)  err code: 0 
      nioqper:    nt OS   err code: 0 
 
   Solution: 
   - Check the protocol used in the TNSNAMES.ORA by the connection string 
   - Ensure that the TNSNAMES.ORA you check is the one that is actually being 
     used by Oracle. Define the TNS_ADMIN environment variable to point to the 
     TNSNAMES directory.   
   - Using the $ORACLE_HOME/bin/adapters command, ensure the protocol is 
     installed. Run the command without parameters to check if the protocol is 
     installed, then run the command with parameters to see whether a  
     particular tool/application contains the protocol symbols e.g.: 
 
     1. $ORACLE_HOME/bin/adapters 
     2. $ORACLE_HOME/bin/adapters $ORACLE_HOME/bin/oracle 
        $ORACLE_HOME/bin/adapters $ORACLE_HOME/bin/sqlplus 
 
    Explanation: 
    If the protocol is not installed every connection attempting to use it will  
    fail with ORA-12538 because the executable doesn't contain the required  
    protocol symbol/s. 
 
Error ORA-12538 may also be caused by an issue with the 
'$ORACLE_HOME/bin/relink all' command. 'Relink All' does not relink the sqlplus 
executable. If you receive error ORA-12538 when making a sqlplus connection, it 
may be for this reason. 
 
    To relink sqlplus manually: 
    $ su - oracle 
    $ cd $ORACLE_HOME/sqlplus/lib  
    $ make -f ins_sqlplus.mk install 
    $ ls -l $ORACLE_HOME/bin/sqlplus --> should show a current date/time stamp 
 
3. ORA-12546 ORA-12560 connecting locally to the database  
 
   Problem: 
   Trying to connect to database locally with a different account to the  
   software owner, the error the error ORA-12546 is prompted. In the trace file  
   you can see: 
 
     nioqper:  error from nscall 
     nioqper:    nr err code: 0 
     nioqper:    ns main err code: 12546 
     nioqper:    ns (2)  err code: 12560 
     nioqper:    nt main err code: 516 
     nioqper:    nt (2)  err code: 13 
     nioqper:    nt OS   err code: 0 
 
   Solution: 
   Make sure the permissions of oracle executable are correct, this should be: 
  
   52224 -rwsr-sr-x   1 oracle dba  53431665 Aug 10 11:07 oracle 
 
   
   Explanation: 
   The problem occurs due to an incorrect setting on the oracle executable. 
 
 
4. ORA-12541 ORA-12560 TRYING TO CONNECT TO A DATABASE 
 
   Problem: 
   You are trying to connect to a database using SQL*Net and receive the  
   following error ORA-12541 ORA-12560 after change the TCP/IP port in the  
   listener.ora and you are using PARAMETER USE_CKPFILE_LISTENER in  
   listener.ora.  
 
    The following error struct appears in the SQLNET.LOG: 
 
    nr err code: 12203 
    TNS-12203: TNS:unable to connect to destination 
    ns main err code: 12541 
    TNS-12541: TNS:no listener 
    ns secondary err code: 12560 
    nt main err code: 511 
    TNS-00511: No listener 
    nt secondary err code: 239 
    nt OS err code: 0 
 
   Solution: 
   Check [NOTE:1061927.6] <ml2_documents.showDocument?p_id=1061927.6&p_database_id=NOT> to resolve the problem. 
 
   Explanation: 
   If TCP protocol is listed in the Listener.ora's ADDRESS_LIST section and  
   the parameter USE_CKPFILE_LISTENER = TRUE, the Listener ignores the TCP  
   port number defined in the ADDRESS section and listens on a random port. 
          
 
RELATED DOCUMENTS 
----------------- 
Note:39774.1 <ml2_documents.showDocument?p_id=39774.1&p_database_id=NOT>    LOG & TRACE Facilities on NET . 
Note:45878.1 <ml2_documents.showDocument?p_id=45878.1&p_database_id=NOT>    SQL*Net Common Errors & Diagnostic Worksheet 
Net8i Admin/Ch.11  Troubleshooting Net8 / Resolving the Most Common  
                   Error Messages 


19.37 ORA-12637
---------------

Packet received failed.

A process was unable to receive a packet from another process. Possible causes are: 1. The other process was terminated.
2. The machine on which the other process is running went down.
3. Some other communications error occurred.

Note 1:

Just edit the file sqlnet.ora and search for the string SQLNET.AUTHENTICATION_SERVICES. 
When it exists it�s set to = (TNS), change this to = (NONE). When it doesn�t exist, add the string 
SQLNET.AUTHENTICATION_SERVICES = (NONE)

Note 2:

What does SQLNET.AUTHENTICATION_SERVICES do?

SQLNET.AUTHENTICATION_SERVICES
Purpose
Use the parameter SQLNET.AUTHENTICATION_SERVICES to enable one or more authentication services. 
If authentication has been installed, it is recommended that this parameter be set to either none or to one 
of the authentication methods.

Default
None

Values
Authentication Methods Available with Oracle Net Services:
none for no authentication methods. A valid username and password can be used to access the database. 
all for all authentication methods 
nts for Windows NT native authentication 
Authentication Methods Available with Oracle Advanced Security:
kerberos5 for Kerberos authentication 
cybersafe for Cybersafe authentication 
radius for RADIUS authentication 
dcegssapi for DCE GSSAPI authentication 

See Also: 
Oracle Advanced Security Administrator's Guide
 

Example
SQLNET.AUTHENTICATION_SERVICES=(kerberos5, cybersafe)


Note 3:

ORA-12637 for members of one NT group, using OPS$ login

Being "identified externally", users can work fine until the user is added to a "wwwauthor" NT group to allow them 
to publish documents on Microsoft IIS (intranet) -- then they get ORA-12637 starting the Oracle c/s application 
(document management system). 
The environment is: Oracle 9.2.0.1.0 on Windows 2000 Advanced Server w. SP4, Windows 2003 domain controllers 
in W2K compatible mode, client workstations with W2K and Win XP. 
Any hint will be appreciated. 

Problem solved. Specific NT group (wwwauthor) which caused problems had existed already with specific permissions, 
then it was dropped and created again with exactly the same name (but, of course, with different internal ID). 
This situation have been identified as causing some kind of mess. 
A completely new group with different name has been created. 

Note 4:

ORA-12637 packet receive failure

I added a second instance to the Oracle server. Since then, on the server and all clients, 
I get ORA-12637 packet receive failure when I try to connect to this database. Why is this? 

Hello 

Try commenting out the SQLNET.CRYPTO_SEED and SQLNET.AUTHENTICATION_SERVICES in the server's SQLNET.ORA 
and on the client sqlnet file if they exist. 

Please also verify that the server's LISTENER.ORA file contains the following parameter: 
CONNECT_TIMEOUT_LISTENER=0 

Note 5:

Workaround is to turn off prespawned server processes in "listener.ora".   
In the "listener.ora", comment out or delete the prespawn parameters, ie:   
SID_LIST_LISTENER =   
(SID_LIST =     
   (SID_DESC = 
     (SID_NAME = prd)       
       (ORACLE_HOME = /raid/app/oracle/product/7.3.4) 
#      (PRESPAWN_MAX = 99) 
#      (PRESPAWN_LIST = 
#      (PRESPAWN_DESC = (PROTOCOL = TCP) (POOL_SIZE = 1) (TIMEOUT = 30)) #      )     )   ) 


Note 6:

Problem Description 
-------------------  
Connections to Oracle 9.2 using a Cybersafe authenticated user fails on Solaris  2.6 with ORA-12637 and a core dump is generated.   
Solution Description 
--------------------  
1) Shutdown Oracle, the listener and any clients.  
2) In $ORACLE_HOME/lib take a backup copy of the file sysliblist    
3) Edit sysliblist. Move the -lthread entry to the beginning. 
   So change from,  	-lnsl -lsocket -lgen -ldl -lsched -lthread  To,  	-lthread -lnsl -lsocket -lgen -ldl -lsched   
4) Do $ORACLE_HOME/bin/relink all 


Note 7:

fact: Oracle Server - Personal Edition 8.1
fact: MS Windows
symptom: Starting Server Manager (Svrmgrl) Fails
symptom: ORA-12637: Packet Receive Failed
cause: Oracle's installer will set the authentication to (NTS) by default.
However, if the Windows machine is not in a Domain where there
is a Windows Domain Controller, it will not be able to contact the
KDC (Key Distribtion Centre) needed for Authentication.


fix:

Comment out SQLNET.AUTHENTICATION_SERVICES=(NTS) in sqlnet.ora


19.38 ORA 02058:
================

dba_2pc_pending:
Lists all in-doubt distributed transactions. The view is empty until populated by an in-doubt transaction. 
After the transaction is resolved, the view is purged.

SQL> SELECT LOCAL_TRAN_ID, GLOBAL_TRAN_ID, STATE, MIXED, HOST, COMMIT#
  2  FROM DBA_2PC_PENDING
  3  /

LOCAL_TRAN_ID          GLOBAL_TRAN_ID                                                   
---------------------- ----------------------------------------------------------
6.31.5950              1145324612.10D447310B5FCE408A296417959EBEEC00000000              

SQL> select STATE, MIXED, HOST, COMMIT#
  2  FROM DBA_2PC_PENDING
  3  /

STATE            MIX HOST                                                               
---------------- --- ------------------------------------------------------------
forced rollback  no  REBV\PGSS-TST-TCM      

                                           
SQL> select * from dba_2pc_neighbors;

LOCAL_TRAN_ID          IN_ DATABASE                                             
---------------------- --- --------------------------------------------------
6.31.5950              in  O                                                    

SQL> select state, tran_comment, advice from dba_2pc_pending;

STATE            TRAN_COMMENT                                                   
---------------- ------------------------------------------------------------
prepared

SQL> rollback force '6.31.5950';

Rollback complete.

SQL> commit;


Doc ID:  Note:290405.1 
Subject:  ORA-30019 When Executing Dbms_transaction.Purge_lost_db_entry 
Type:  PROBLEM 
Status:  MODERATED 
 Content Type:  TEXT/X-HTML 
Creation Date:  11-NOV-2004 
Last Revision Date:  16-NOV-2004 


The information in this document applies to: 
Oracle Server - Enterprise Edition - Version: 9.2.0.5
This problem can occur on any platform.

Errors
ORA-30019 Illegal rollback Segment operation in Automatic Undo mode

Symptoms
Attempting to clean up the pending transaction using DBMS_TRANSACTION.PURGE_LOST_DB_ENTRY, getting ora-30019:

ORA-30019: Illegal rollback Segment operation in Automatic Undo mode 
Changes
AUTO UNDO MANAGEMENT is running 
Cause
DBMS_TRANSACTION.PURGE_LOST_DB_ENTRY is not supported in AUTO UNDO MANAGEMENT
This is due to fact that "set transaction use rollback segment.." cannot be done in AUM.

Fix
1.) alter session set "_smu_debug_mode" = 4;
2.) execute DBMS_TRANSACTION.PURGE_LOST_DB_ENTRY('local_tran_id'); 


19.39. ORA-600 [12850]: 
=======================

Doc ID </help/usaeng/Search/search.html>: 	Note:1064436.6	Content Type: 	TEXT/PLAIN	
Subject: 	ORA-00600 [12850], AND ORA-00600 [15265]: WHEN SELECT OR DESCRIBE ON TABLE	Creation Date: 	14-JAN-1999	
Type: 	PROBLEM	Last Revision Date: 	29-FEB-2000	
Status: 	PUBLISHED		
 
Problem Description:  
---------------------  
You are doing a describe or select on a table and receive: 
 
ORA-600 [12850]: 
Meaning:  12850 occurs when it can't find the user who owns the object 
          from the dictionary. 
 
If you try to delete the table, you receive: 
 
ORA-600 [15625]: 
Meaning:  The arguement 15625 is occuring because some index entry for the  
table is not found in obj$. 
 
Problem Explanation:  
--------------------  
The data dictionary is corrupt. 
 
You cannot drop the tables in question because the data dictionary doesn't know  
they exist. 
   
Search Words:  
-------------  
ORA-600 [12850] 
ORA-600 [15625] 
describe 
delete 
table 
 
Solution Description:  
--------------------- 
You need to rebuild the database. 
 
Solution Explanation:  
---------------------
  
Since the table(s) cannot be accessed or dropped because of the data dictionary  
corruption, rebuilding the database is the only option. 


19.40 ORA-01092:
================

-------------------------------------------------------------------------------------------

Doc ID </help/usaeng/Search/search.html>: 	Note:222132.1	Content Type: 	TEXT/PLAIN	
Subject: 	ORA-01599 and ORA-01092 while starting database	Creation Date: 	03-DEC-2002	
Type: 	PROBLEM	Last Revision Date: 	07-AUG-2003	
Status: 	PUBLISHED		
PURPOSE 
------- 
 
The purpose of this Note is to fix errors ORA-01599 & ORA-01092 when 
recieved at startup. 
  
 
SCOPE & APPLICATION 
------------------- 
 
All DBAs, Support Analyst. 
 
 
Symptom(s) 
~~~~~~~~~~ 
 
Starting the database gives errors similar to: 
 
ORA-01599: failed to acquire rollback segment (20), cache space is  
   full (currently has (19) entries) 
ORA-01092: ORACLE instance terminated 
 
 
Change(s) 
~~~~~~~~~~ 
 
Increased shared_pool_size parameter. 
Increased processes and/or sessions parameters. 
 
 
Cause 
~~~~~~~ 
 
Low value for max_rollback_segments 
The above changes changed the value for max_rollback_segments internally. 
 
 
Fix 
~~~~ 
 
The value for max_rollback_segments which is to be calculated as follows: 
 
max_rollback_segments = transactions/transactions_per_rollback_segment or  
30 whichever is greater. 
 
transactions = session * 1.1; 
 
sessions = (processes * 1.1) + 5; 
 
The default value for transactions_per_rollback_segment = 5; 
 
1. Use these calculations and find out the value for max_rollback_segments. 
2. Set it to this value or 30 whichever is greater. 
3. Startup database after this correct setting. 
 
 
Reference info 
~~~~~~~~~~~~~~ 
[BUG:2233336] <ml2_documents.showDocument?p_id=2233336&p_database_id=BUG> - 
RDBMS ERRORS AT STARTUP CAN CAUSE ODMA TO OMIT CLEANUP ACTIONS 
[NOTE:30764.1] <ml2_documents.showDocument?p_id=30764.1&p_database_id=NOT> - 
Init.ora Parameter "MAX_ROLLBACK_SEGMENTS" Reference Note

--------------------------------------------------------------------------------------------

Doc ID </help/usaeng/Search/search.html>: 	Note:1038418.6	Content Type: 	TEXT/PLAIN	
Subject: 	ORA-01092 STARTING UP ORACLE RDBMS DATABASE	Creation Date: 	17-NOV-1997	
Type: 	PROBLEM	Last Revision Date: 	06-JUL-1999	
Status: 	PUBLISHED		
 
Problem Summary: 
================ 
 
ORA-01092 starting up Oracle RDBMS database. 
 
 
Problem Description: 
==================== 
 
When you startup your Oracle RDBMS database, you receive the following error: 
 
     ORA-01092: ORACLE instance terminated. Disconnection forced. 
 
 
Problem Explanation: 
==================== 
 
Oracle cannot write to the alert_<SID>.log file because the 
ownership and/or permissions on the BACKGROUND_DUMP_DEST directory 
are incorrect. 
 
Solution Summary: 
================= 
 
Modify the ownership and permissions of directory BACKGROUND_DUMP_DEST. 
 
 
Solution Description: 
===================== 
 
To allow oracle to write to the BACKGROUND_DUMP_DEST directory (contains 
alert_<SID>.log), modify the ownership of directory BACKGROUND_DUMP_DEST  
so that the oracle user (software owner) is the owner and make the  
permissions on directory BACKGROUND_DUMP_DEST 755. 
 
Follow these steps: 
 
  1.  Determine the location of the BACKGROUND_DUMP_DEST parameter  
      defined in the init<SID>.ora or config<SID>.ora files. 
 
  2.  Login as root. 
 
  3.  Change directory to the location of BACKGROUND_DUMP_DEST. 
 
  4.  Change the owner of all the files and the directory to the  
      software owner. 
 
      For example: 
  
        % chown oracle * 
    
  5.  Change the permissions on the directory to 755. 
  
        % chmod 755 . 
 
 
Solution Explanation: 
===================== 
 
Changing the ownership and permissions of the BACKGROUND_DUMP_DEST 
directory, enables oracle to write to the alert_<SID>.log file. 


---------------------------------------------------------------------------

Doc ID </help/usaeng/Search/search.html>: 	Note:273413.1	Content Type: 	TEXT/X-HTML	
Subject: 	Database Does not Start, Ora-00604 Ora-25153 Ora-00604 Ora-1092	Creation Date: 	19-MAY-2004	
Type: 	PROBLEM	Last Revision Date: 	04-OCT-2004	
Status: 	MODERATED		
The information in this article applies to: 
Oracle Server - Enterprise Edition - Version: 8.1.7.4 to 10.1.0.4
This problem can occur on any platform.
Errors
ORA-1092 Oracle instance terminated.
ORA-25153 Temporary Tablespace is Empty
ORA-604 error occurred at recursive SQL level <num>
Symptoms
The database is not opening and in the alert.log the following errors are reported:

ORA-00604: error occurred at recursive SQL level 1
ORA-25153: Temporary Tablespace is Empty
Error 604 happened during db open, shutting down database
USER: terminating instance due to error 604
Instance terminated by USER, pid = xxxxx
ORA-1092 signalled during: alter database open...

You might find SQL in the trace file like:

select distinct d.p_obj#,d.p_timestamp from sys.dependency$ d, obj$ o where d.p_obj#>=:1 and d.d_obj#=o.obj# 
and o.status!=5

Cause
In the case where there's locally managed temp tablespace in the database,after controlfile is 
re-created using the statement generated by "alter database backup controlfile to trace", the database 
can't be opened again because it complains that temp tablespace is empty. However no tempfiles can be added 
to the temp tablespace, nor can the temp tablespace be dropped because the database is not yet open.

The query failed because of inadequate sort space(memory + disk)

Fix
We can increase the sort_area_size and sort_area_retained_size to a very high value so that the query completes. 
Then DB will open and we can take care of the TEMP tablespace

If the error still persists after increasing the sort_area_size and sort_area_retained_size to a high vale, 
then the only option remains is to restore and recover.

-------------------------------------------------------------------------------

Displayed below are the messages of the selected thread. 


Thread Status: Active 

From: Ronald Shaffer 17-Mar-05 19:23 
Subject: Deleted OUTLN and now I get ORA-1092 and ORA-18008 

RDBMS Version: 10G
Operating System and Version: RedHat ES 3
Error Number (if applicable): ORA-1092 and ORA-18008
Product (i.e. SQL*Loader, Import, etc.): 
Product Version: 

Deleted OUTLN and now I get ORA-1092 and ORA-18008

One of our DBAs dropped the OUTLN user in 10G and now the instance will not start. 
We get an ORA-18008 specifying the schema is missing and an ORA-1092 when it attempts to OPEN. 
Startup mount is as far as we can get. Any experience with this issue out there? 

Thanks... 


From: Fairlie Rego 23-Mar-05 01:26 
Subject: Re : Deleted OUTLN and now I get ORA-1092 and ORA-18008 


Hi Ronald, 

You are hitting bug 3786479 
AFTER DROPPING THE OUTLN USER/SCHEMA, DB WILL NO LONGER OPEN.ORA-18008 

http://metalink.oracle.com/metalink/plsql/ml2_documents.showDocument?p_database_id=BUG&p_id=3786479 

If this is still an issue file a Tar and get a backport. 

Regards, 
Fairlie Rego 

----------------------------------------------------------------------------------
 
Displayed below are the messages of the selected thread. 


Thread Status: Closed 

From: Henry Lau 06-Mar-03 10:38 
Subject: ORA-01092 while alter datbase open 

RDBMS Version: 9.0.1.3
Operating System and Version: Linux Redhat 7.1
Error Number (if applicable): ORA-01092
Product (i.e. SQL*Loader, Import, etc.): ORACLE DATABASE
Product Version: 9.0.1.3

ORA-01092 while alter datbase open

Hi, 

Since our undotbs is very large and we try to follow the Doc ID: 157278.1, we are trying to change the undotbs 
to a new one 

We try to 
1. Create UNDO tablespace undotb2 datafile $ORACLE_HOME/oradata/undotb2.dbf size 300M 
2. ALTER SYSTEM SET undo_tablespace=undotb2; 
3. Change undo = undotb2; 
4. Restart the database; 
5. alter tablespace undotbs offline; 
6. when we restart the database, it shows the following error. 

SQL> startup mount pfile=$ORACLE_HOME/admin/TEST/pfile/init.ora 
ORACLE instance started. 

Total System Global Area 386688540 bytes 
Fixed Size 280092 bytes 
Variable Size 318767104 bytes 
Database Buffers 67108864 bytes 
Redo Buffers 532480 bytes 
Database mounted. 
SQL> alter database nomount; 
alter database nomount 
* 
ERROR at line 1: 
ORA-02231: missing or invalid option to ALTER DATABASE 


SQL> alter database open; 
alter database open 
* 
ERROR at line 1: 
ORA-01092: ORACLE instance terminated. Disconnection forced 


I have checked the Log file as follow: 

SQL> 
/u01/oracle/product/9.0.1/admin/TEST/udump/ora_29151.trc 
Oracle9i Release 9.0.1.3.0 - Production 
JServer Release 9.0.1.3.0 - Production 
ORACLE_HOME = /u01/oracle/product/9.0.1 
System name: Linux 
Node name: utxrho01.unitex.com.hk 
Release: 2.4.2-2smp 
Version: #1 SMP Sun Apr 8 20:21:34 EDT 2001 
Machine: i686 
Instance name: TEST 
Redo thread mounted by this instance: 1 
Oracle process number: 9 
Unix process pid: 29151, image: oracle@utxrho01.unitex.com.hk (TNS V1-V3) 

*** SESSION ID:(8.3) 2003-03-06 17:25:38.615 
Evaluating checkpoint for thread 1 sequence 8 block 2 
ORA-00376: file 2 cannot be read at this time 
ORA-01110: data file 2: '/u01/oracle/product/9.0.1/oradata/TEST/undotbs01.dbf' 
~ 
~ 
~ 
~ 
Please help to check what the problem is ?? 
Thank you !! 

Regards, 
Henry 


From: Oracle, Pravin Sheth 07-Mar-03 09:31 
Subject: Re : ORA-01092 while alter datbase open 


Hi Henry, 
What you are seeing is bug 2360088, which is fixed in Oracle 9.2.0.2. 
I suggest that you log an iSR (formerly iTAR) for a quicker solution for the problem. 
Regards 
Pravin 

-----------------------------------------------------------------------------------


19.41 ORA-600 [qerfxFetch_01]
=============================

Note 1:
-------

Doc ID:  Note:255881.1 
Subject:  ORA-600 [qerfxFetch_01] 
Type:  REFERENCE 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  10-NOV-2003 
Last Revision Date:  12-NOV-2004 
 

<Internal_Only>

  This note contains information that has not yet been reviewed by the
  PAA Internals group or DDR.

  As such, the contents are not necessarily accurate and care should be
  taken when dealing with customers who have encountered this error.

  If you are going to use the information held in this note then please
  take whatever steps are needed to in order to confirm that the
  information is accurate. Until the article has been set to EXTERNAL, we
  do not guarantee the contents.

  Thanks. PAA Internals Group

(Note - this section will be deleted as the note moves to publication)

</Internal_Only>

Note: For additional ORA-600 related information please read Note 146580.1

PURPOSE:
  This article represents a partially published OERI note.

  It has been published because the ORA-600 error has been 
  reported in at least one confirmed bug.

  Therefore, the SUGGESTIONS section of this article may help
  in terms of identifying the cause of the error.

  This specific ORA-600 error may be considered for full publication
  at a later date. If/when fully published, additional information 
  will be available here on the nature of this error.

<Internal_Only>
PURPOSE:
  This article discusses the internal error "ORA-600 [qerfxFetch_01]", what
  it means and possible actions. The information here is only applicable
  to the versions listed and is provided only for guidance.

ERROR:
  ORA-600 [qerfxFetch_01]

VERSIONS:
  versions 9.2

DESCRIPTION:

  During database operations, user interrupts need to be handled correctly.

  ORA-600 [qerfxFetch_01] is raised when an interrupt has been trapped 
  but has not been handled correctly.

FUNCTIONALITY:
  Fixed table row source. 

IMPACT:
  NON CORRUPTIVE - No underlying data corruption.

</Internal_Only>
SUGGESTIONS:

  If the Known Issues section below does not help in terms of identifying
  a solution, please submit the trace files and alert.log to Oracle 
  Support Services for further analysis.

  Known Issues:

  Bug# 2306106   See Note 2306106.8
      OERI:[qerfxFetch_01] possible - affects OEM
      Fixed: 9.2.0.2, 10.1.0.2
 

<Internal_Only>

 INTERNAL ONLY SECTION - NOT FOR PUBLICATION OR DISTRIBUTION TO CUSTOMERS
 ========================================================================


Ensure that this note comes out on top in Metalink when searched
ora-600 ora-600 ora-600 ora-600 ora-600 ora-600 ora-600
ora-600 ora-600 ora-600 ora-600 ora-600 ora-600 ora-600
qerfxFetch_01 qerfxFetch_01 qerfxFetch_01 qerfxFetch_01 qerfxFetch_01
qerfxFetch_01 qerfxFetch_01 qerfxFetch_01 qerfxFetch_01 qerfxFetch_01
qerfxFetch_01 qerfxFetch_01 qerfxFetch_01 qerfxFetch_01 qerfxFetch_01
qerfxFetch_01 qerfxFetch_01 qerfxFetch_01 qerfxFetch_01 qerfxFetch_01

</Internal_Only>


Note 2:
-------

Doc ID </help/usaeng/Search/search.html>: 	
Note:2306106.8	Content Type: 	TEXT/X-HTML	
Subject: 	Support Description of Bug 2306106	
Creation Date: 	13-AUG-2003	
Type: 	PATCH	Last Revision Date: 	14-AUG-2003	
Status: 	PUBLISHED		

Click here <javascript:getdoc('NOTE:245840.1')> for details of sections in this note.
Bug 2306106 OERI:[qerfxFetch_01] possible - affects OEM
This note gives a brief overview of bug 2306106. 
Affects:
Product (Component)	Oracle Server (RDBMS)	
Range of versions believed to be affected	Versions >= 9.2 but < 10G 	
Versions confirmed as being affected	9.2.0.1 	
Platforms affected	Generic (all / most platforms affected)	
Fixed:
This issue is fixed in	9.2.0.2 (Server Patch Set)  10G Production Base Release 	
Symptoms:
Error may occur <javascript:taghelp('TAGS_ERROR')> 
Internal Error may occur (ORA-600) <javascript:taghelp('TAGS_OERI')> 
ORA-600 [qerfxFetch_01] 
Related To:
(None Specified) 
Description
ORA-600 [qerfxFetch_01] possible -  affects OEM


Note 3:
-------
      
Bug 2306106 is fixed in the 9.2.0.2 patchset. This bug is not published and thus cannot be viewed externally 
in MetaLink. All it says on this bug is 'ORA-600 [qerfxFetch_01] possible - affects OEM'. 


19.42 Undo corruption:
======================

Note 1:
-------

Doc ID </help/usaeng/Search/search.html>: 	Note:2431450.8	Content Type: 	TEXT/X-HTML	
Subject: 	Support Description of Bug 2431450	Creation Date: 	08-AUG-2003	
Type: 	PATCH	Last Revision Date: 	05-JAN-2004	
Status: 	PUBLISHED		
Click here <javascript:getdoc('NOTE:245840.1')> for details of sections in this note.

Bug 2431450 SMU Undo corruption possible on instance crash

This note gives a brief overview of bug 2431450. 
Affects:
Product (Component)	(Rdbms)	
Range of versions believed to be affected	Versions >= 9 but < 10G 	
Versions confirmed as being affected	9.0.1.4  9.2.0.3 	
Platforms affected	Generic (all / most platforms affected)	
Fixed:
This issue is fixed in	9.0.1.5 iAS Patch Set  9.2.0.4 (Server Patch Set)  10g Production Base Release 	
Symptoms:
Corruption (Physical) <javascript:taghelp('TAGS_CORR_PHY')> 
Internal Error may occur (ORA-600) <javascript:taghelp('TAGS_OERI')> 
ORA-600 [kteuPropTime-2] / ORA-600 [4191] 
Related To:
System Managed Undo 
Description
SMU (System Managed Undo) Undo corruption possible on instance crash.

This can result in subsequent ORA-600 errors due to the undo 
corruption.


Note 2:
-------

Doc ID </help/usaeng/Search/search.html>: 	Note:233864.1	Content Type: 	TEXT/X-HTML	
Subject: 	ORA-600 [kteuproptime-2]	Creation Date: 	28-MAR-2003	
Type: 	REFERENCE	Last Revision Date: 	07-APR-2005	
Status: 	PUBLISHED		

Note: For additional ORA-600 related information please read Note 146580.1 
</metalink/plsql/showdoc?db=NOT&id=146580.1>

PURPOSE:
  This article discusses the internal error "ORA-600 [kteuproptime-2]", 
  what it means and possible actions. The information here is only 
  applicable to the versions listed and is provided only for guidance.
 
ERROR:
  ORA-600 [kteuproptime-2]

VERSIONS:
  versions 9.0 to 9.2
 
DESCRIPTION:

  Oracle has encountered an error propagating Extent Commit Times in
  the Undo Segment Header / Extent Map Blocks, for System Managed Undo
  Segments

  The extent being referenced is not valid.
 
FUNCTIONALITY:      
  UNDO EXTENTS
 
IMPACT:
  INSTANCE FAILURE
  POSSIBLE PHYSICAL CORRUPTION

SUGGESTIONS:

  If instance is down and fails to restart due to this error then set the 
  following parameter, which will gather additional information to 
  assist support in identifing the cause:

  # Dump Undo Segment Headers during transaction recovery
  event="10015 trace name context forever, level 10" 
  
  Restart the instance and submit the trace files and alert.log to 
  Oracle Support Services for further analysis.

  Do not set any other undo/rollback_segment parameters without direction 
  from Support.

  Known Issues:

  Bug# 2431450   See Note 2431450.8 </metalink/plsql/showdoc?db=NOT&id=2431450.8>
      SMU Undo corruption possible on instance crash
      Fixed: 9.2.0.4, 10.1.0.2


Note 3:
-------

Hi, 

apply patchset 9.2.0.2, bug 2431450 is fixed in 9.2.0.2 that made 
SMU (System Managed Undo) Undo corruption possible on instance crash. 

It's a very rare scenario : 

This will only cause a problem if there was an instance crash after a 
transaction committed but before it propogated the extent commit times to all 
its extents AND there was a shrink of extents before the transaction could 
be recovered. 

But still, this bug was not published (not for any particular reason 
except it was found internal). 

Greetings, 


Note 4:
-------

From: Oracle, Ken Robinson 21-Feb-03 17:44 
Subject: Re : ORA-600 kteuPropTime-2 


Forgot to mention the second bug for this....bug 2689239. 

Regards, 
Ken Robinson 
Oracle Server EE Analyst 


ORA-600 [4191] possible on shrink of system managed undo segment.


Note 5:
-------

BUGBUSTER - System-managed undo segment corruption

Affects Versions: 9.2.0.1.0,   9.2.0.2.0,   9.2.0.3.0 
Fixed in: Patch 2431450,   9.2.0.4.0 
BUG# (if recognised) 2431450 
This info. correct on: 31-AUG-2003 

Symptoms

Oracle instance crashes and details of the ORA-00600 error are written to the alert.log
ORA-00600: internal error code, arguments: [kteuPropTime-2], [], [], []

Followed by
Fatal internal error happened while SMON was doing active transaction recovery.

Then
SMON: terminating instance due to error 600
Instance terminated by SMON, pid = 22972


This occurs as Oracle encounters an error when propagating Extent Commit Times in the Undo Segment Header Extent Map Blocks.
It could be because SMON is over-enthusiastic in shrinking extents in SMU segments. As a result, extent commit times 
do not get written to all the extents and SMON causes the instance to crash, leaving one or more of the undo segments 
corrupt.

When opening the database following the crash, Oracle tries to perform crash recovery and encounters problems 
recovering committed transactions stored in the corrupt undo segments. This leads to more ORA-00600 errors 
and a further instance crash. The net result is that the database cannot be opened:

"Error 600 happened during db open, shutting down database"


Workaround

Until the corrupt undo segment can be identified and offlined then unfortunately the database will not open. 
Identify the corrupt undo segment by setting the following parameters in the init.ora file:

_smu_debug_mode=1
event="10015 trace name context forever, level 10"

(set event 10511)
event="10511 trace name context forever, level 2" 


_smu_debug_mode simply collects diagnostic information for support purposes. Event 10015 is the undo segment 
recovery tracing event. Use this to identify corrupted rollback/undo segments when a database cannot be started.

With these parameters set, an attempt to open the database will still cause a crash, but Oracle will write 
vital information about the corrupt rollback/undo segments to a trace file in user_dump_dest. 
This is an extract from such a trace file, revealing that undo segment number 6 (_SYSSMU6$) is corrupt. 
Notice that the information stored in the segment header about the number of extents was inconsistent 
with the extent map.

Recovering rollback segment _SYSSMU6$
UNDO SEG (BEFORE RECOVERY): usn = 6 Extent Control Header
-----------------------------------------------------------------
Extent Header:: spare1: 0 spare2: 0 #extents: 7 #blocks: 1934
last map 0x00805f89 #maps: 1 offset: 4080
Highwater:: 0x0080005b ext#: 0 blk#: 1 ext size: 7
#blocks in seg. hdr's freelists: 0
#blocks below: 0
mapblk 0x00000000 offset: 0
Unlocked
Map Header:: next 0x00805f89 #extents: 5 obj#: 0 flag: 0x40000000
Extent Map
-----------------------------------------------------------------
0x0080005a length: 7
0x00800061 length: 8
0x0081ac89 length: 1024
0x00805589 length: 256
0x00805a89 length: 256

Retention Table
-----------------------------------------------------------
Extent Number:0 Commit Time: 1060617115
Extent Number:1 Commit Time: 1060611728
Extent Number:2 Commit Time: 1060611728
Extent Number:3 Commit Time: 1060611728
Extent Number:4 Commit Time: 1060611728


Comment out parameters undo_management and undo_tablespace and set the undocumented _corrupted_rollback_segments 
parameter to tell Oracle to ignore any corruptions and force the database open:

_corrupted_rollback_segments=(_SYSSMU6$)

This time, Oracle will start and open OK, which will allow you to check the status of the undo segments 
by querying DBA_ROLLBACK_SEGS.

select segment_id, segment_name, tablespace_name, status
from dba_rollback_segs
where owner='PUBLIC';

SEGMENT_ID SEGMENT_NAME TABLESPACE_NAME STATUS
---------- ------------ --------------- ----------------
         1 _SYSSMU1$    UNDOTS          OFFLINE
         2 _SYSSMU2$    UNDOTS          OFFLINE
         3 _SYSSMU3$    UNDOTS          OFFLINE
         4 _SYSSMU4$    UNDOTS          OFFLINE
         5 _SYSSMU5$    UNDOTS          OFFLINE
         6 _SYSSMU6$    UNDOTS          NEEDS RECOVERY
         7 _SYSSMU7$    UNDOTS          OFFLINE
         8 _SYSSMU8$    UNDOTS          OFFLINE
         9 _SYSSMU9$    UNDOTS          OFFLINE
        10 _SYSSMU10$   UNDOTS          OFFLINE

SMON will complain every 5 minutes by writing entries to the alert.log as long as there are undo segments 
in need of recovery

SMON: about to recover undo segment 6
SMON: mark undo segment 6 as needs recovery

At this point, you must either download and apply patch 2431450 or create private rollback segments.


Note 6:
-------

Repair UNDO log corruption  Don Burleson	
In rare cases (usually DBA error) the Oracle UNDO tablespace can become corrupted.

This manifests with this error: ORA-00376: file xx cannot be read at this time 

In cases of UNDO log corruption, you must:

� Change the undo_management parameter from �AUTO� to �MANUAL�
� Create a new UNDO tablespace
� Drop the old UNDO tablespace

Dropping the corrupt UNDO tablespace can be tricky and you may get the message: 

ORA-00376: file string cannot be read at this time

To drop a corrupt UNDO tablespace:

1 � Identify the bad segment:

select 
segment_name, 
status 
from 
dba_rollback_segs 
where 
tablespace_name='undotbs_corrupt'
and
status = �NEEDS RECOVERY�;

SEGMENT_NAME STATUS
------------------------------ ----------------
_SYSSMU22$ NEEDS RECOVERY

2. Bounce the instance with the hidden parameter �_offline_rollback_segments�, specifying the bad segment name:

_OFFLINE_ROLLBACK_SEGMENTS=_SYSSMU22$


3. Bounce database, nuke the corrupt segment and tablespace:
SQL> drop rollback segment "_SYSSMU22$";
Rollback segment dropped.

SQL > drop tablespace undotbs including contents and datafiles;
Tablespace dropped.


Note 7:
-------

Sometimes there can be trouble with an undo segment.
Actually there might be something with a normal object:

PUT the following in the init.ora-
event = "10015 trace name context forever, level 10"

Setting this event will generate a trace file that will reveal the
necessary information about the transaction Oracle is trying to
rollback and most importantly, what object Oracle is trying to apply
the undo to.

USE the following query to find out what object Oracle is trying to
perform recovery on.

select owner, object_name, object_type, status
from dba_objects where object_id = <object #>;

THIS object must be dropped so the undo can be released. An export or
relying on a backup may be necessary to restore the object after the corrupted
rollback segment goes away.


19.43 ORA-1653
==============

Note 1:
-------


Doc ID </help/usaeng/Search/search.html>: 	Note:151994.1	Content Type: 	TEXT/PLAIN	
Subject: 	Overview Of ORA-01653: Unable To Extend Table %s.%s By %s In Tablespace %s	Creation Date: 	12-JUL-2001	
Type: 	TROUBLESHOOTING	Last Revision Date: 	15-JUN-2004	
Status: 	PUBLISHED		
PURPOSE 
------- 
This bulletin is an overview of ORA-1653 error message for tablespace dictionary managed. 
 
SCOPE& APPLICATION 
------------------ 
It is for users requiring further information on ORA-01653 error message. 
 
When looking to resolve the error by using any of the solutions suggested, please 
consult the DBA for assistance. 
 
 
Error:  ORA-01653 
Text: unable to extend table %s.%s by %s in tablespace %s  
------------------------------------------------------------------------------- 
Cause:  Failed to allocate an extent for table segment in tablespace. 
Action: Use ALTER TABLESPACE ADD DATAFILE statement to add one or more 
        files to the tablespace indicated. 
 
 
Explanation: 
------------ 
This error does not necessarily indicate whether or not you have enough space  
in the tablespace, it merely indicates that Oracle could not find a large enough area of free 
contiguous space in which to fit the next extent. 
 
 
Diagnostic Steps: 
----------------- 
1. In order to see the free space available for a particular tablespace, you must 
   use the view DBA_FREE_SPACE.  Within this view, each record represents one 
   fragment of space. How the view DBA_FREE_SPACE can be used to determine  
   the space available in the database is described in: 
   [NOTE:121259.1] <ml2_documents.showDocument?p_id=121259.1&p_database_id=NOT> Using DBA_FREE_SPACE  
 
2. The DBA_TABLES view describes the size of next extent (NEXT_EXTENT) and the  
   percentage increase (PCT_INCREASE) for all tables in the database.  
   The "next_extent" size is the size of extent that is trying to be allocated (and for  
   which you have the error).  
     
   When the extent is allocated :  
		next_extent = next_extent * (1 + (pct_increase/100)) 
 
   Algorythm to allocate extent for segment is described in the Concept Guide 
   Chapter : Data Blocks, Extents, and Segments - How Extents Are Allocated 
 
3. Look to see if any users have the tablespace in question as their temporary tablespace. 
   This can be checked by looking at DBA_USERS (TEMPORARY_TABLESPACE). 
 
Possible solutions: 
------------------- 
- Manually Coalesce Adjacent Free Extents 
       ALTER TABLESPACE <tablespace name> COALESCE; 
   The extents must be adjacent to each other for this to work. 
 
- Add a Datafile:  
        ALTER TABLESPACE <tablespace name> ADD DATAFILE '<full path and file name>'  
        SIZE <integer> <k|m>;  
 
- Resize the Datafile:  
        ALTER DATABASE DATAFILE '<full path and file name>' RESIZE <integer> <k|m>;  
 
- Enable autoextend:  
       ALTER DATABASE DATAFILE '<full path and file name>' AUTOEXTEND ON  
       MAXSIZE UNLIMITED; 
 
- Defragment the Tablespace:  
 
- Lower "next_extent" and/or "pct_increase" size: 
        ALTER <segment_type> <segment_name> STORAGE ( next <integer> <k|m>  
        pctincrease <integer>);  
 
- If the tablespace is being used as a temporary tablespace, temporary segments may 
  be still holding the space. 
 
References: 
----------- 
[NOTE:1025288.6] <ml2_documents.showDocument?p_id=1025288.6&p_database_id=NOT> How to Diagnose and Resolve ORA-01650, ORA-01652, ORA-01653, ORA-01654, ORA-01688 : Unable to Extend < OBJECT > by %S in Tablespace 
[NOTE:1020090.6] <ml2_documents.showDocument?p_id=1020090.6&p_database_id=NOT> Script to Report on Space in Tablespaces 
[NOTE:1020182.6] <ml2_documents.showDocument?p_id=1020182.6&p_database_id=NOT> Script to Detect Tablespace Fragmentation 
[NOTE:1012431.6] <ml2_documents.showDocument?p_id=1012431.6&p_database_id=NOT> Overview of Database Fragmentation 
[NOTE:121259.1] <ml2_documents.showDocument?p_id=121259.1&p_database_id=NOT>  Using DBA_FREE_SPACE 
[NOTE:61997.1] <ml2_documents.showDocument?p_id=61997.1&p_database_id=NOT>   SMON - Temporary Segment Cleanup and Free Space Coalescing 


Note 2:
-------

Doc ID </help/usaeng/Search/search.html>: 	Note:1025288.6	Content Type: 	TEXT/PLAIN	
Subject: 	How to Diagnose and Resolve ORA-01650,ORA-01652,ORA-01653,ORA-01654,ORA-01688 : Unable to Extend < OBJECT > by %S in Tablespace %S	Creation Date: 	02-JAN-1997	
Type: 	TROUBLESHOOTING	Last Revision Date: 	10-JUN-2004	
Status: 	PUBLISHED		
PURPOSE 
------- 
 
This document can be used to diagnose and resolve space management errors - ORA-1650, ORA-1652, 
ORA-1653, ORA-1654 and ORA-1688. 
 
SCOPE & APPLICATION 
------------------- 
You are working with the database and have encountered one of the  
following errors:   
 
 ORA-01650: unable to extend rollback segment %s by %s in tablespace %s  
     Cause: Failed to allocate extent for the rollback segment in tablespace. 
    Action: Use the ALTER TABLESPACE ADD DATAFILE statement to add one or more  
            files to the specified tablespace. 
 
 ORA-01652: unable to extend temp segment by %s in tablespace %s  
     Cause: Failed to allocate an extent for temp segment in tablespace. 
    Action: Use ALTER TABLESPACE ADD DATAFILE statement to add one or more 
            files to the tablespace indicated or create the object in other 
            tablespace. 
 
 ORA-01653: unable to extend table %s.%s by %s in tablespace %s  
     Cause: Failed to allocate extent for table segment in tablespace. 
    Action: Use the ALTER TABLESPACE ADD DATAFILE statement to add one or more  
            files to the specified tablespace. 
  
 ORA-01654: unable to extend index %s.%s by %s in tablespace %s  
     Cause: Failed to allocate extent for index segment in tablespace. 
    Action: Use the ALTER TABLESPACE ADD DATAFILE statement to add one or more  
            files to the specified tablespace. 
 
 ORA-01688: unable to extend table %s.%s partition %s by %s in tablespace %s  
     Cause: Failed to allocate an extent for table segment in tablespace.  
    Action: Use ALTER TABLESPACE ADD DATAFILE statement to add one or more files  
            to the tablespace indicated. 
 
 
How to Solve the Following Errors About UNABLE TO EXTEND 
-------------------------------------------------------- 
 
An "unable to extend" error is raised when there is insufficient contiguous  
space available to extend the object. 
 
 
A. In order to address the UNABLE TO EXTEND issue, you need to get the following 
   information: 
 
   1. The largest contiguous space available for the tablespace 
 
      SELECT  max(bytes)  
      FROM    dba_free_space  
      WHERE   tablespace_name = '<tablespace name>';  
  
      The above query returns the largest available contiguous chunk of space.  
 
      Please note that if the tablespace you are concerned with is of type TEMPORARY, 
      then please refer to [NOTE:188610.1] <ml2_documents.showDocument?p_id=188610.1&p_database_id=NOT>. 
 
      If this query is done immediately after the failure, it will show that the 
      largest contiguous space in the tablespace is smaller than the next extent  
      the object was trying to allocate. 
        
   2. => "next_extent" for the object 
      => "pct_increase" for the object  
      => The name of the tablespace in which the object resides 
     
      Use the "next_extent" size with "pct_increase" in the following formula to 
      determine the size of extent that is trying to be allocated.  
  
      extent size = next_extent * (1 + (pct_increase/100)  
 
            next_extent = 512000  
            pct_increase = 50  
            => extent size =  512000 * (1 + (50/100)) =  512000 * 1.5 = 768000  
 
      ORA-01650 Rollback Segment  
      ==========================  
 
      SELECT  next_extent, pct_increase, tablespace_name 
      FROM    dba_rollback_segs  
      WHERE   segment_name = '<rollback segment name>';  
  
        Note:   pct_increase is only needed for early versions of Oracle, by  
        default in later versions pct_increase for a rollback segment is 0.  
 
      ORA-01652 Temporary Segment  
      ===========================  
  
      SELECT  next_extent, pct_increase, tablespace_name  
      FROM    dba_tablespaces  
      WHERE   tablespace_name = '<tablespace name>';  
  
        Temporary  segments take the default storage clause of the tablespace  
        in which they are created. 
 
      If this error is caused by a query, then try and ensure that the query  
      is tuned to perform its sorts as efficiently as possible. 
 
      To find the owner of a sort, please refer to [NOTE:1069041.6] <ml2_documents.showDocument?p_id=1069041.6&p_database_id=NOT> 
  
      ORA-01653 Table Segment  
      =======================  
  
      SELECT  next_extent, pct_increase , tablespace_name 
      FROM    dba_tables  
      WHERE   table_name = '<table name>' AND owner = '<owner>';  
 
      ORA-01654 Index Segment  
      ======================= 
  
      SELECT  next_extent, pct_increase, tablespace_name  
      FROM    dba_indexes  
      WHERE   index_name = '<index name>' AND owner = '<owner>'; 
 
      ORA-01688 Table Partition 
      ========================= 
 
      SELECT next_extent, pct_increase, tablespace_name 
      FROM   dba_tab_partitions 
      WHERE  partition_name='<partition name>' AND table_owner = '<owner>'; 
    
 
B. Possible Solutions 
     
   There are several options for solving errors due to failure to extend: 
  
 
   a. Manually Coalesce Adjacent Free Extents 
      --------------------------------------- 
 
      ALTER TABLESPACE <tablespace name> COALESCE; 
 
      The extents must be adjacent to each other for this to work. 
 
  
   b. Add a Datafile 
      -------------- 
  
      ALTER TABLESPACE <tablespace name>  
      ADD DATAFILE '<full path and file name>' SIZE <integer> <k|m>;  
 
  
   c. Lower "next_extent" and/or "pct_increase" size 
      ---------------------------------------------- 
  
      For non-temporary and non-partitioned segment problem:  
  
      ALTER <segment_type> <segment_name>  
      STORAGE ( next <integer> <k|m> pctincrease <integer>);  
  
      For non-temporary and partitioned segment problem:  
  
      ALTER TABLE <table_name> MODIFY PARTITION <partition_name> 
      STORAGE ( next <integer> <k|m> pctincrease <integer>);  
 
      For a temporary segment problem:  
  
      ALTER TABLESPACE <tablespace name>  
      DEFAULT STORAGE (initial <integer> next <integer> <k|m> pctincrease <integer>);  
 
 
    d. Resize the Datafile 
       ------------------- 
  
       ALTER DATABASE DATAFILE '<full path and file name>'  
       RESIZE <integer> <k|m>;  
 
  
    e. Defragment the Tablespace 
       ------------------------- 
  
       If you would like more information on fragmentation, the following  
       documents are available from Oracle WorldWide Support . 
       (this is not a comprehensive list)  
  
       [NOTE:1020182.6] <ml2_documents.showDocument?p_id=1020182.6&p_database_id=NOT>  Script to Detect Tablespace Fragmentation   
       [NOTE:1012431.6] <ml2_documents.showDocument?p_id=1012431.6&p_database_id=NOT>  Overview of Database Fragmentation 
       [NOTE:30910.1] <ml2_documents.showDocument?p_id=30910.1&p_database_id=NOT>    Recreating Database Objects  
 
Related Documents: 
================== 
 
[NOTE:15284.1] <ml2_documents.showDocument?p_id=15284.1&p_database_id=NOT>   Understanding and Resolving ORA-01547 
<Note.151994.1>  Overview Of ORA-01653  Unable To Extend Table %s.%s By %s In Tablespace %s:                        
<Note.146595.1>  Overview Of ORA-01654  Unable To Extend Index %s.%s By %s In Tablespace %s:    
[NOTE:188610.1] <ml2_documents.showDocument?p_id=188610.1&p_database_id=NOT>  DBA_FREE_SPACE Does not Show Information about Temporary Tablespaces 
[NOTE:1069041.6] <ml2_documents.showDocument?p_id=1069041.6&p_database_id=NOT> How to Find Creator of a SORT or TEMPORARY SEGMENT or Users  
                 Performing Sorts for Oracle8 and 9 
 
Search Words: 
============= 
 
ORA-1650 ORA-1652 ORA-1653 ORA-1654 ORA-1688  
ORA-01650 ORA-01652 ORA-01653 ORA-01654 ORA-01688  
1650 1652 1653 1654 1688 


19.44: Other ORA- errors on 9i:
===============================

Doc ID </help/usaeng/Search/search.html>: 	Note:201342.1	Content Type: 	TEXT/X-HTML	
Subject: 	Top Internal Errors - Oracle Server Release 9.2.0	Creation Date: 	27-JUN-2002	
Type: 	BULLETIN	Last Revision Date: 	24-MAY-2004	
Status: 	PUBLISHED		
Top Internal Errors - Oracle Server Release 9.2.0

Additional information or documentation on ORA-600 errors not listed here 
may be available from the ORA-600 Lookup tool :  <Note:153788.1 </metalink/plsql/showdoc?db=Not&id=153788.1>>


<Note:189908.1 </metalink/plsql/showdoc?db=Not&id=189908.1>> Oracle9i Release 2 (9.2) Support Status and Alerts 

ORA-600 [KSLAWE:!PWQ]			
Possible bugs:		Fixed in:	
<Bug:3566420 </metalink/plsql/showdoc?db=Bug&id=3566420>> 	BACKGROUND PROCESS GOT OERI:KSLAWE:!PWQ AND INSTANCE CRASHES 	9.2.0.6, 10G 	
			
References:			
<Note:271084.1 </metalink/plsql/showdoc?db=Not&id=271084.1>> 	ALERT: ORA-600[KSLAWE:!PWQ] RAISED IN V92040 OR V92050 ON SUN 64BIT ORACLE 		
			
ORA-600 [ksmals]			
Possible bugs:		Fixed in:	
<Bug:2662683 </metalink/plsql/showdoc?db=Bug&id=2662683>> 	ORA-7445 & HEAP CORRUPTION WHEN RUNNING APPS PROGRAM THAT DOES HEAVY INSERTS 	9.2.0.4 	
			
References:			
<Note:247822.1 </metalink/plsql/showdoc?db=Not&id=247822.1>> 	ORA-600 [ksmals] 		
			
ORA-600 [4000]			
Possible bugs:		Fixed in:	
<Bug:2959556 </metalink/plsql/showdoc?db=Bug&id=2959556>> 	STARTUP after an ORA-701 fails with OERI[4000] 	9.2.0.5, 10G 	
<Bug:1371820 </metalink/plsql/showdoc?db=Bug&id=1371820>> 	OERI:4506 / OERI:4000 possible against transported tablespace 	8.1.7.4, 9.0.1.4, 9.2.0.1 	
			
References:			
<Note:47456.1 </metalink/plsql/showdoc?db=Not&id=47456.1>> 	ORA-600 [4000] "trying to get dba of undo segment header block from usn" 		
			
ORA-600 [4454]			
Possible bugs:		Fixed in:	
<Bug:1402161 </metalink/plsql/showdoc?db=Bug&id=1402161>> 	OERI:4411/OERI:4454 on long running job 	8.1.7.3, 9.0.1.3, 9.2.0.1 	
			
References:			
<Note:138836.1 </metalink/plsql/showdoc?db=Not&id=138836.1>> 	ORA-600 [4454] 		
			
ORA-600 [kcbgcur_9]			
Possible bugs:		Fixed in:	
<Bug:2722809 </metalink/plsql/showdoc?db=Bug&id=2722809>> 	OERI:kcbgcur_9 on direct load into AUTO space managed segment 	9.2.0.4, 10G 	
<Bug:2392885 </metalink/plsql/showdoc?db=Bug&id=2392885>> 	Direct path load may fail with OERI:kcbgcur_9 / OERI:ktfduedel2 	9.2.0.4, 10G 	
<Bug:2202310 </metalink/plsql/showdoc?db=Bug&id=2202310>> 	OERI:KCBGCUR_9 possible from SMON dropping a rollback segment in locally managed tablespace 	9.0.1.4, 9.2.0.1 	
<Bug:2035267 </metalink/plsql/showdoc?db=Bug&id=2035267>> 	OERI:KCBGCUR_9 possible during TEMP space operations 	9.0.1.3, 9.2.0.1 	
<Bug:1804676 </metalink/plsql/showdoc?db=Bug&id=1804676>> 	OERI:KCBGCUR_9 possible from ONLINE REBUILD INDEX with concurrent DML 	8.1.7.3, 9.0.1.3, 9.2.0.1 	
<Bug:1785175 </metalink/plsql/showdoc?db=Bug&id=1785175>> 	OERI:kcbgcur_9 from CLOB TO CHAR or BLOB TO RAW conversion 	9.2.0.2, 10G 	
			
References:			
<Note:114058.1 </metalink/plsql/showdoc?db=Not&id=114058.1>> 	ORA-600 [kcbgcur_9] "Block class pinning violation" 		
			
ORA-600 [qerrmOFBu1], [1003]			
Possible bugs:		Fixed in:	
<Bug:2308496 </metalink/plsql/showdoc?db=Bug&id=2308496>> 	SQL*PLUS CRASH IN TTC LOGGING INTO ORACLE 7.3.4 DATABASE 		
			
References:			
<Note:209363.1 </metalink/plsql/showdoc?db=Not&id=209363.1>> 	ORA-600 [qerrmOFBu1] - "Error during remote row fetch operation 		
<Note:207319.1 </metalink/plsql/showdoc?db=Not&id=207319.1>> 	ALERT: Connections from Oracle 9.2 to Oracle7 are Not Supported 		
			
ORA-600 [ktsgsp5] or ORA-600 [kdddgb2]			
Possible bugs:		Fixed in:	
<Bug:2384289 </metalink/plsql/showdoc?db=Bug&id=2384289>> 	ORA-600 [KDDDGB2] [435816] [2753588] & PROBABLE INDEX CORRUPTION 	9.2.0.2 	
			
References:			
<Note:139037.1 </metalink/plsql/showdoc?db=Not&id=139037.1>> 	ORA-600 [kdddgb2] 		
<Note:139180.1 </metalink/plsql/showdoc?db=Not&id=139180.1>> 	ORA-600 [ktsgsp5] 		
<Note:197737.1 </metalink/plsql/showdoc?db=Not&id=197737.1>> 	ALERT: Corruption / Internal Errors possible after Upgrading to 9.2.0.1 		


19.45: ADJUST SCN:
==================

Note 1 Adjust SCN:
------------------

Doc ID:  Note:30681.1 
Subject:  EVENT: ADJUST_SCN - Quick Reference 
Type:  REFERENCE 
Status:  PUBLISHED 
 Content Type:  TEXT/PLAIN 
Creation Date:  20-OCT-1997 
Last Revision Date:  04-AUG-2000 
Language:  USAENG 
 

ADJUST_SCN Event
~~~~~~~~~~~~~~~~
*** WARNING ***
   This event should only ever be used under the guidance
   of an experienced Oracle analyst.
   If an SCN is ahead of the current database SCN, this indicates
   some form of database corruption. The database should be rebuilt
   after bumping the SCN. 
****************

    The ADJUST_SCN event is useful in some recovery situations where the
    current SCN needs to be incremented by a large value to ensure it 
    is ahead of the highest SCN in the database. This is typically 
    required if either:
      a. An ORA-600 [2662] error is signalled against database blocks
    or
      b. ORA-1555 errors keep occuring after forcing the database open
         or ORA-604 / ORA-1555 errors occur during database open.
         (Note: If startup reports ORA-704 & ORA-1555 errors together
                then the ADJUST_SCN event cannot be used to bump the
                SCN as the error is occuring during bootstrap.
                Repeated startup/shutdown attempts may help if the SCN
                mismatch is small)
    or
      c. If a database has been forced open used _ALLOW_RESETLOGS_CORRUPTION
         (See <Parameter:Allow_Resetlogs_Corruption> )


    The ADJUST_SCN event acts as described below.

  **NOTE: You can check that the ADJUST_SCN event has fired as it
  	  should write a message to the alert log in the form
	  "Debugging event used to advance scn to %s".
	  If this message is NOT present in the alert log the event
	  has probably not fired.


  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  If the database will NOT open:
  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    Take a backup.
    You can use event 10015 to trigger an ADJUST_SCN on database open:

	startup mount;

	alter session set events '10015 trace name adjust_scn level 1';

        (NB: You can only use IMMEDIATE here on an OPEN database. If the 
	     database is only mounted use the 10015 trigger to adjust SCN, 
	     otherwise you get ORA 600 [2251], [65535], [4294967295] )

	alter database open;

	If you get an ORA 600:2256 shutdown, use a higher level and reopen.

    Do *NOT* set this event in init.ora or the instance will crash as soon
    as SMON or PMON try to do any clean up. Always use it with the 
    "alter session" command.

  ~~~~~~~~~~~~~~~~~~~~~~~~~~
  If the database *IS* OPEN: 
  ~~~~~~~~~~~~~~~~~~~~~~~~~~
    You can increase the SCN thus:

	alter session set events 'IMMEDIATE trace name ADJUST_SCN level 1';

    LEVEL:  Level 1 is usually sufficient - it raises the SCN to 1 billion
				            (1024*1024*1024)
	    Level 2 raises it to 2 billion etc...

	    If you try to raise the SCN to a level LESS THAN or EQUAL to its
	    current setting you will get <OERI:2256>    - See below.
	    Ie: The event steps the SCN to known levels. You cannot use
		the same level twice.

  Calculating a Level from 600 errors:
  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
	To get a LEVEL for ADJUST_SCN:

	a) Determine the TARGET scn:
 	    ora-600 [2662]    See <OERI:2662>  Use TARGET >= blocks SCN
	    ora-600 [2256]    See <OERI:2256>  Use TARGET >= Current SCN

  	b) Multiply the TARGET wrap number by 4. This will give you the level 
	   to use in the adjust_scn to get the correct wrap number.
	c) Next, add the following value to the level to get the desired base
	   value as well :

        Add to Level         Base
        ~~~~~~~~~~~~ ~~~~~~~~~~~~
                   0            0
                   1   1073741824
                   2   2147483648
                   3   3221225472


Note 2: Adjust SCN
------------------


Subject:  OERR: 600 2662 Block SCN is ahead of Current SCN
Creation Date:  21-OCT-1997 

ORA-600 [2662] [a] [b] [c] [d] [e]        
Versions: 7.0.16  - 8.0.5                                Source: kcrf.h
===========================================================================
Meaning: 
  There are 3 forms of this error. 

 	4/5 argument forms - 
  		The SCN found on a block (dependant SCN) was ahead of the 
		current SCN. See below for this

        1 Argument (before 7.2.3):
                 Oracle is in the process of writing a block to a log file.
                 If the calculated block checksum is less than or equal to 1
                 (0 and 1 are reserved) ORA-600 [2662] is returned.
                 This is a problem generating an offline immediate log marker
                 (kcrfwg).
                 *NOT DOCUMENTED HERE*
         
---------------------------------------------------------------------------
Argument Description:

  Until version 7.2.3 this internal error can be logged for two separate 
  reasons, which we will refer to as type I and type II.  The two types can 
  be distinguished by the number of arguments:
    Type I has four or five arguments after the [2662].  
    Type II has one argument after the [2662]. 
  From 7.2.3 onwards type II no longer exists.            

Type I
~~~~~~
    a.  Current SCN WRAP
    b.  Current SCN BASE
    c.  dependant SCN WRAP
    d.  dependant SCN BASE
    e.  Where present this is the DBA where the dependant SCN came from.
        From kcrf.h:
         If the SCN comes from the recent or current SCN then a dba
         of zero is saved. If it comes from undo$ because the undo segment is
         not available then the undo segment number is saved, which looks like
         a block from file 0. If the SCN is for a media recovery redo (i.e.
         block number == 0 in change vector), then the dba is for block 0 
         of the relevant datafile. If it is from another database for 
         distribute xact then dba is DBAINF(). If it comes from a TX lock 
         then the dba is really usn<<16+slot.

Type II
~~~~~~~
    a.  checksum -> log block checksum - zero if none (thread # in old format)  

---------------------------------------------------------------------------

Diagnosis:
~~~~~~~~~~      
  In addition to different basic types from above, there are different 
  situations and coherences where ORA-600 [2662] type 'I' can be raised. 

  For diagnosis we can split up startup-issues and no-startup-issues. 
  Usually the startup-issues are more critical. 

  Getting started:
  ~~~~~~~~~~~~~~~~
   (1) is the error raised during normal database operations (i.e. when the
       database is up) or during startup of the database?
   (2) what is the SCN difference [d]-[b] ( subtract argument 'b' from arg 'd')?
   (3) is there a fifth argument [e] ? 
       If so convert the dba to file# block# 
       Is it a data dictionary object? (file#=1)
       If so find out object name with the help of reference dictionary 
	from second database
   (4) What is the current SQL statement? (see trace)
       Which table is refered to?
       Does the table match the object you found in step before?

   Be careful at this point:
    there may be no relationship between DBA in [e] and real source of
    problem (blockdump).


  Deeper analysis:
  ~~~~~~~~~~~~~~~~
   - investigate trace file
     this will be a user trace file normally but could be an smon trace too

   - search for: 'buffer' 
     ("buffer dba" in Oracle7 dumps, "buffer tsn" in Oracle8 dumps)
     this will bring you to a blockdump which usually represents the 
     'real' source of OERI:2662
     WARNING: There may be more than one buffer pinned to the process
              so ensure you check out all pinned buffers.

   -> does the blockdump match the dba from e.?
   -> what kind of blockdump is it?          
       (a) rollbacksegment header
       (b) datablock
       (c) other

   SEE BELOW for EXAMPLES which demonstrate the sort of output you may 
   see in trace files and the things to check.

  Check list and possible causes
  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

   - If Parallel Server check both nodes are using the same lock manager
     instance & point at the same control files.

   - If not Parallel Server check that 2 instances haven't mounted the
     same database (Is there a second PMON process around ?? - shut
     down any other instances to be sure)

   Possible causes:
    - doing an open resetlogs with _ALLOW_RESETLOGS_CORRUPTION enabled    
    - a hardware problem, like a faulty controller, resulting in a failed 
      write to the control file or the redo logs     
    - restoring parts of the database from backup and not doing the 
      appropriate recovery     
    - restoring a control file and not doing a RECOVER DATABASE USING BACKUP 
      CONTROLFILE     
    - having _DISABLE_LOGGING set during crash recovery                      
    - problems with the DLM in a parallel server environment      
    - a bug      

    Solutions:
    - if the SCNs in the error are very close:
      Attempting a startup several times will bump up the dscn every time we
      open the database even if open fails. The database will open when 
      dscn=scn.
      
    - ** You can bump the SCN on open using <Event:ADJUST_SCN>
      See [NOTE:30681.1]
      Be aware that you should really rebuild the database if you use this
      option.

    - Once this has occurred you would normally want to rebuild the
      database via exp/rebuild/imp as there is no guarantee that some
      other blocks are not ahead of time.

Articles:
~~~~~~~~~
  Solutions:
   [NOTE:30681.1]   Details of the ADJUST_SCN Event
   [NOTE:1070079.6] alter system checkpoint

  Possible Causes:
   [NOTE:1021243.6] CHECK INIT.ORA SETTING _DISABLE_LOGGING
   [NOTE:74903.1]   How to Force the Database Open (_ALLOW_RESETLOGS_CORRUPTION)
   [NOTE:41399.1]   Forcing the database open with `_ALLOW_RESETLOGS_CORRUPTION`
   [NOTE:851959.9]  OERI:2662 DURING CREATE SNAPSHOT AT MASTER SITE


Known Bugs:
~~~~~~~~~~~

Fixed In. Bug No.      Description
---------+------------+----------------------------------------------------
7.0.14    BUG:153638
7.1.5     BUG:229873
7.1.3     Bug:195115   Miscalculation of SCN on startup for distributed TX ?
7.1.6.2.7 Bug:297197   Port specific Solaris OPS problem
7.3       Bug:336196   Port specific IBM SP AIX problem -> dlm issue    
7.3.4.5   Bug:851959   OERI:2662 possible from distributed OPS select

---------------------------------------------------------------------------
---------------------------------------------------------------------------

Examples:
~~~~~~~~
  Below are some examples of this type of error and the information
  you will see in the trace files.

~~~~~~~~~~
CASE (a)
~~~~~~~~~~
  blockdump should look like this:

***
buffer dba: 0x05000002 inc: 0x00000001 seq: 0x0001a9c6
       ver: 1 type: 1=KTU UNDO HEADER
 
  Extent Control Header
  -----------------------------------------------------------------
  Extent Control:: inc#: 716918 tsn: 4      object#: 0     
***

-> interpret:
dba: 0x05000002 -> 83886082 (0x05000002) =    5,2 
XXX tsn: 4 -> this is rollback segment 4
tsn: 4 -> this rollback segment is in tablespace 4
     
      
ORA-00600: Interner Fehlercode, Argumente:
[2662], [0], [71183], [0], [71195], [83886082], [], []
      

-> [e] > 0 and represents dba from block which is in trace
-> [d]-[b] = 71195 - 71183 = 12

-> convert [b] to hex: 71195 = 0x1161B
   so this value can be found in blockdump:

***
TRN TBL::

index  state cflags  wrap#    uel         scn            dba
------------------------------------------------------------------
...
0x4e    9    0x00  0x00d6  0xffff  0x0000.0001161b  0x00000000 
...
***

-> possible cause
so in this case the CURRENT SCN is LOWER than the SCN on this transaction
ie: The current SCN looks like it has decreased !!
This could happen if the database is opened with the 
_allow_resetlogs_corruption parameter

-> If some recovery steps have just been performed review these steps
   as the mismatch may be due to open resetlogs with 
   _allow_resetlogs_corruption enabled or similar. 
   See <Parameter:Allow_Resetlogs_corruption> for information on this 
   parameter.
------------------------------------------------------------------

~~~~~~~~~~
CASE (b)
~~~~~~~~~~
  blockdump looks like this:

***
buffer dba: 0x0100012f inc: 0x00000815 seq: 0x00000d48
       ver: 1 type: 6=trans data
 
Block header dump: dba: 0x0100012f
 Object id on Block? Y
 seg/obj: 0xe  csc: 0x00.5fed6  itc: 2  flg: O  typ: 1 - DATA
     fsl: 0  fnx: 0x0 
 
 Itl           Xid                  Uba         Flag  Lck        Scn/Fsc
0x01   0x0000.00b.0000036c  0x0100261c.0138.04  --U-    1  fsc 0x0000.0005fed7
0x02   0x0000.00a.0000037b  0x0100261d.0138.01  --U-    1  fsc 0x0000.0005fed4
 
data_block_dump
===============
...
***
      interpret:
      dba: 0x0100012f ->    8,10  ==>   16777519 (0x0100012f) =    1,303        
                                                               (0x1   0x12f)

***
SVRMGR> SELECT SEGMENT_NAME, SEGMENT_TYPE FROM DBA_EXTENTS
     2> WHERE FILE_ID = 1  AND 303 BETWEEN BLOCK_ID AND
     3> BLOCK_ID + BLOCKS - 1;
SEGMENT_NAME                                               SEGMENT_TYPE
---------------------------------------------------------- -----------------
UNDO$                                                      TABLE
1 row selected.      
***

-> current sql-statement (trace):
***
update undo$ set 
name=:2,file#=:3,block#=:4,status$=:5,user#=:6,
undosqn=:7,xactsqn=:8,scnbas=:9,scnwrp=:10,inst#=:11 where us#=:1

ksedmp: internal or fatal error
ORA-00600: internal error code, arguments: 
[2662], [0], [392916], [0], [392919], [0], [], []
***


-> e. = 0  info not available
-> d-b = 392919 - 392916 = 3
-> dba from blockdump matches the object from current sql statement
-> convert b. to hex:  = 0x5FED7
   so this value can be found in blockdump -> see ITL slot 0x01!


---------------------------------------------------------------------------
---------------------------------------------------------------------------
---------------------------------------------------------------------------

Some more internals:
~~~~~~~~~~~~~~~~~~~~

I will try to give another example in oder to answer question if current
SCN is decreased or dependant SCN increase.

hypothesis:
current SCN decreased

Evidence:
reproduced ORA-600 [2662] by aborting tx and using _allow_resetlog_corruption
while open resetlogs. check database SCN before!

Prerequisits: _allow_resetlogs_corruption = true in init<SID>.ora
shutdown/startup db

*** BEGIN TESTCASE

SVRMGR> drop table tx;
Statement processed.
SVRMGR> create table tx (scn# number);
Statement processed.
SVRMGR> insert into tx values( userenv('COMMITSCN') );
1 row processed.
SVRMGR> select * from tx;
SCN#
----------
    392942
1 row selected.

************ another session **************
SQL> connect scott/tiger
Connected.
SQL> update emp set sal=sal+1;
13 rows processed.
SQL>
-- no commit here
*******************************************

SVRMGR> insert into tx values( userenv('COMMITSCN') );
1 row processed.
SVRMGR> select * from tx;
SCN#
----------
    392942
    392943
2 rows selected.

-- so current SCN will be 392943

SVRMGR> shutdown abort
ORACLE instance shut down.

-- this breaks tx

SVRMGR> startup mount pfile=e:\jv734\initj734.ora
ORACLE instance started.
Total System Global Area      11018952 bytes
Fixed Size                       35760 bytes
Variable Size                  7698200 bytes
Database Buffers               3276800 bytes
Redo Buffers                      8192 bytes
Database mounted.

SVRMGR> recover database until cancel;
ORA-00279: Change 392925 generated at 10/26/99 17:13:03 needed for thread 1
ORA-00289: Suggestion : e:\jv734\arch\arch_2.arc
ORA-00280: Change 392925 for thread 1 is in sequence #2
 Specify log: {<RET>=suggested | filename | AUTO | CANCEL}

cancel
Media recovery cancelled.
SVRMGR> alter database open resetlogs;
alter database open resetlogs
*
ORA-00600: internal error code, arguments: 
[2662], [0], [392928], [0], [392931], [0], [], []

*** END TESTCASE

because we know current SCN before (392943) we see, that current SCN has
decreased 

after solving the problem with:
shutdown abort/startup -> works

SVRMGR> drop table tx;
Statement processed.
SVRMGR> create table tx (scn# number);
Statement processed.
SVRMGR> insert into tx values( userenv('COMMITSCN') );
1 row processed.
SVRMGR> select * from tx;
SCN#
----------
    392943
1 row selected. 

so we have exactly reached the current SCN from before 'shutdown abort' 
So current SCN was bumpt up from 392928 to 392942.


Note 3: Adjust SCN
------------------

Doc ID </help/usaeng/Search/search.html>: 	Note:28929.1	Content Type: 	TEXT/X-HTML	
Subject: 	ORA-600 [2662] "Block SCN is ahead of Current SCN"	Creation Date: 	21-OCT-1997	
Type: 	REFERENCE	Last Revision Date: 	15-OCT-2004	
Status: 	PUBLISHED		
<Internal_Only>

  This note contains information that was not reviewed by DDR.

  As such, the contents are not necessarily accurate and care should be
  taken when dealing with customers who have encountered this error.

  Thanks. PAA Internals Group

</Internal_Only>

Note: For additional ORA-600 related information please read Note 146580.1 </metalink/plsql/showdoc?db=NOT&id=146580.1>

PURPOSE:            
  This article discusses the internal error "ORA-600 [2662]", what 
  it means and possible actions. The information here is only applicable 
  to the versions listed and is provided only for guidance.
 
ERROR:              
  ORA-600 [2662] [a] [b] [c] [d] [e]
 
VERSIONS:
  versions 6.0 to 10.1
 
DESCRIPTION:

  A data block SCN is ahead of the current SCN.

  The ORA-600 [2662] occurs when an SCN is compared to the dependent SCN 
  stored in a UGA variable.

  If the SCN is less than the dependent SCN then we signal the ORA-600 [2662]
  internal error.

ARGUMENTS:
  Arg [a]  Current SCN WRAP
  Arg [b]  Current SCN BASE
  Arg [c]  dependent SCN WRAP
  Arg [d]  dependent SCN BASE 
  Arg [e]  Where present this is the DBA where the dependent SCN came from.
 
FUNCTIONALITY:      
  File and IO buffer management for redo logs
 
IMPACT:
  INSTANCE FAILURE
  POSSIBLE PHYSICAL CORRUPTION
 
SUGGESTIONS:        
     
  There are different situations where ORA-600 [2662] can be raised.

  It can be raised on startup or duing database operation.

  If not using Parallel Server, check that 2 instances have not mounted
  the same database.

  Check for SMON traces and have the alert.log and trace files ready
  to send to support.

  Check the SCN difference [argument d]-[argument b].

  If the SCNs in the error are very close, then try to shutdown and startup
  the instance several times. 

  In some situations, the SCN increment during startup may permit the 
  database to open. Keep track of the number of times you attempted a 
  startup.

  If the Known Issues section below does not help in terms of identifying
  a solution, please submit the trace files and alert.log to Oracle
  Support Services for further analysis.
 
  Known Issues:
  Bug# 2899477   See Note 2899477.8 </metalink/plsql/showdoc?db=NOT&id=2899477.8>
      Minimise risk of a false OERI[2662]
      Fixed: 9.2.0.5, 10.1.0.2
 
  Bug# 2764106   See Note 2764106.8 </metalink/plsql/showdoc?db=NOT&id=2764106.8>
      False OERI[2662] possible on SELECT which can crash the instance
      Fixed: 9.2.0.5, 10.1.0.2
 
  Bug# 2054025   See Note 2054025.8 </metalink/plsql/showdoc?db=NOT&id=2054025.8>
      OERI:2662 possible on new TEMPORARY index block
      Fixed: 9.0.1.3, 9.2.0.1
 
  Bug# 851959   See Note 851959.8 </metalink/plsql/showdoc?db=NOT&id=851959.8>
      OERI:2662 possible from distributed OPS select
      Fixed: 7.3.4.5
 
  Bug# 647927 P  See Note 647927.8 </metalink/plsql/showdoc?db=NOT&id=647927.8>
      Digital Unix ONLY: OERI:2662 could occur under heavy load
      Fixed: 8.0.4.2, 8.0.5.0
 

<Internal_Only>

 INTERNAL ONLY SECTION - NOT FOR PUBLICATION OR DISTRIBUTION TO CUSTOMERS
 ========================================================================

There were 2 forms of this error until 7.2.3:

 Type I:	 4/5 argument forms - 
  		 The SCN found on a block (dependent SCN) is ahead of the 
		 current SCN. See below for this

 Type II:        1 Argument (before 7.2.3 only):
                 Oracle is in the process of writing a block to a log file.
                 If the calculated block checksum is less than or equal to 1
                 (0 and 1 are reserved) ORA-600 [2662] is returned.
                 This is a problem generating an offline immediate log marker
                 (kcrfwg).
                 *NOT DOCUMENTED HERE*
         
Type I
~~~~~~
    a.  Current SCN WRAP
    b.  Current SCN BASE
    c.  dependent SCN WRAP
    d.  dependent SCN BASE
    e.  Where present this is the DBA where the dependent SCN came from.
        From kcrf.h:
         If the SCN comes from the recent or current SCN then a dba
         of zero is saved. If it comes from undo$ because the undo segment is
         not available then the undo segment number is saved, which looks like
         a block from file 0. If the SCN is for a media recovery redo (i.e.
         block number == 0 in change vector), then the dba is for block 0 
         of the relevant datafile. If it is from another database for a
         distributed transaction then dba is DBAINF(). If it comes from a TX  
         lock then the dba is really usn<<16+slot.

Type II
~~~~~~~
    a.  checksum -> log block checksum - zero if none (thread # in old format)  

---------------------------------------------------------------------------

Diagnosis:
~~~~~~~~~~      
  In addition to different basic types from above, there are different 
  situations where ORA-600 [2662] type I can be raised. 

Getting started:
~~~~~~~~~~~~~~~~
   (1) is the error raised during normal database operations (i.e. when the
       database is up) or during startup of the database?
   (2) what is the SCN difference [d]-[b] ( subtract argument 'b' from arg 'd')?
   (3) is there a fifth argument [e] ? 
       If so convert the dba to file# block# 
       Is it a data dictionary object? (file#=1)
       If so find out object name with the help of reference dictionary 
	from second database
   (4) What is the current SQL statement? (see trace)
       Which table is refered to?
       Does the table match the object you found in previous step?

   Be careful at this point: there may be no relationship between DBA in [e] 
   and the real source of problem (blockdump).


Deeper analysis:
~~~~~~~~~~~~~~~~
  (1) investigate trace file:
      this will be a user trace file normally but could be an smon trace too
  (2) search for: 'buffer' 
      ("buffer dba" in Oracle7 dumps, "buffer tsn" in Oracle8/Oracle9 dumps)
      this will bring you to a blockdump which usually represents the 
      'real' source of OERI:2662

      WARNING: There may be more than one buffer pinned to the process
               so ensure you check out all pinned buffers.

   -> does the blockdump match the dba from e.?
   -> what kind of blockdump is it?          
       (a) rollback segment header
       (b) datablock
       (c) other

   
Check list and possible causes
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

  If Parallel Server check both nodes are using the same lock manager
  instance & point at the same control files.

  Possible causes:

  (1) doing an open resetlogs with _ALLOW_RESETLOGS_CORRUPTION enabled    
  (2) a hardware problem, like a faulty controller, resulting in a failed 
      write to the control file or the redo logs     
  (3) restoring parts of the database from backup and not doing the 
      appropriate recovery     
  (4) restoring a control file and not doing a RECOVER DATABASE USING BACKUP 
      CONTROLFILE     
  (5) having _DISABLE_LOGGING set during crash recovery                      
  (6) problems with the DLM in a parallel server environment      
  (7) a bug      

  Solutions:

   (1) if the SCNs in the error are very close, attempting a startup several 
       times will bump up the dscn every time we open the database even if 
       open fails. The database will open when dscn=scn.
      
   (2)You can bump the SCN either on open or while the database is open 
      using <Event:ADJUST_SCN> (see Note 30681.1 </metalink/plsql/showdoc?db=NOT&id=30681.1>).
      Be aware that you should rebuild the database if you use this
      option.

   Once this has occurred you would normally want to rebuild the
   database via exp/rebuild/imp as there is no guarantee that some
   other blocks are not ahead of time.

Articles:
~~~~~~~~~
  Solutions:
   Note 30681.1 </metalink/plsql/showdoc?db=NOT&id=30681.1>   Details of the ADJUST_SCN Event
   Note 1070079.6 </metalink/plsql/showdoc?db=NOT&id=1070079.6> Alter System Checkpoint

  Possible Causes:
   Note 1021243.6 </metalink/plsql/showdoc?db=NOT&id=1021243.6> CHECK INIT.ORA SETTING _DISABLE_LOGGING
   Note 41399.1 </metalink/plsql/showdoc?db=NOT&id=41399.1>   Forcing the database open with `_ALLOW_RESETLOGS_CORRUPTION`
   Note 851959.9 </metalink/plsql/showdoc?db=NOT&id=851959.9>  OERI:2662 DURING CREATE SNAPSHOT AT MASTER SITE


Known Bugs:
~~~~~~~~~~~

Fixed In. Bug No.      Description
---------+------------+----------------------------------------------------
7.1.5     Bug 229873 </metalink/plsql/showdoc?db=Bug&id=229873>
7.1.3     Bug 195115 </metalink/plsql/showdoc?db=Bug&id=195115>   Miscalculation of SCN on startup for distributed TX ?
7.1.6.2.7 Bug 297197 </metalink/plsql/showdoc?db=Bug&id=297197>   Port specific Solaris OPS problem
7.3       Bug 336196 </metalink/plsql/showdoc?db=Bug&id=336196>   Port specific IBM SP AIX problem -> dlm issue    
7.3.4.5   Bug 851959 </metalink/plsql/showdoc?db=Bug&id=851959>   OERI:2662 possible from distributed OPS select
Not fixed Bug 2216823 </metalink/plsql/showdoc?db=Bug&id=2216823>  OERI:2662 reported when reusing tempfile with restored DB
8.1.7.4   Bug 2177050 </metalink/plsql/showdoc?db=Bug&id=2177050>  OERI:729 space leak possible (with tags "define var info"/"oactoid info")
                       can corrupt UGA and cause OERI:2662

---------------------------------------------------------------------------

Ensure that this note comes out on top in Metalink when searched
ora-600 ora-600 ora-600 ora-600 ora-600 ora-600 ora-600
ora-600 ora-600 ora-600 ora-600 ora-600 ora-600 ora-600
2662 2662 2662 2662 2662 2662 2662 2662 2662
2662 2662 2662 2662 2662 2662 2662 2662 2662


</Internal_Only


19.47: _allow_read_only_corruption
==================================


If you have a media failure and for some reason (such as having lost an archived log file) you cannot perform 
a complete recovery on some datafiles, then you might need this parameter. It is new for 8i. Previously there 
was only _allow_resetlogs_corruption which allowed you to do a RESETLOGS open of the database 
in such situations. Of course, a database forced open in this way would be in a crazy state 
because the current SCN would reflect the extent of the incomplete recovery, but some datafiles 
would have blocks in the future, which would lead to lots of nasty ORA-00600 errors 
(although there is an ADJUST_SCN event that could be used for relief). Once in this position, 
the only thing to do would be to do a full database export, rebuild the database, import and then assess the damage. 

The new _allow_read_only_corruption provides a much cleaner solution to the same problem. 
You should only use it if all other recovery options have been exhausted, and you cannot open 
the database read/write. Once again, the intent is to export, rebuild and import. Not pleasant, but sometimes 
better than going back to an older usable backup and performing incomplete recovery to a consistent state. 
Also, the read only open allows you to assess better which recovery option you want to take without committing 
you to either. 
 

19.48: _allow_resetlogs_corruption
==================================

log problem:

Try this approach to solve problems with redolog files:

1. create a backup of all datafiles, redolog files and controlfiles.
2.set next initialization parameter in init.ora

_allow_resetlogs_corruption = true

3.startup the database and try to open it
4. if the database can't be opened, then mount it and try to issue:

alter session set events '10015 trace name adjust_scn level 1';

#or if previous doesn't work increase the level to 2 

alter session set events '10015 trace name adjust_scn level 4096';

5. alter database open

You can try with recover database until cancel and then open iz with resetlogs option.

With this procedure I succesfully recovered from loosing my redolog files.

Using event 10015 you are forcing a SCN jump that will eventually syncronize the SCN values from your 
datafiles and controlfiles.
The level controls how much the SCN will be incremented with. In the case of a 9.0.1 I had, it worked 
only with 4096, however it may be that even a level of 1 to 3 would make the SCN jump 1 million.
So you have to dump those headers and compare the SCNs inside before and after the event 10015.
I was succeful too in opening a db after loosing controlfile and online redo logs  , 
however Oracle support made it pretty clear that the only usage for the database afterwards is to do a 
full export and recreate it from that. It would be better if Oracle support walks you through this procedure.


19.49: ORA-01503: CREATE CONTROLFILE failed
============================================

ORA-01503: CREATE CONTROLFILE failed
ORA-01161: database name PEGACC in file header does not match given name of
PEGSAV
ORA-01110: data file 1: '/u02/oradata/pegsav/system01.dbf'


Note 1:
=======

Problem:

You are attempting to recreate a controlfile with a 'createcontrolfile' 
script and the script fails with the following error when it tries to access 
one of the datafiles:

  ORA-1161, database name <name> in file header does not match given name

You are certain that the file is good and that it belongs to that database.


Solution:

Check the file's properties in Windows Explorer and verify that it is not 
a "Hidden" file.  


Explanation:

If you have set the "Show All Files' option under Explorer, View, Options,
you are able to see 'hidden' files that other users and/or applications 
cannot.  If any or all datafiles are marked as 'hidden' files, Oracle does 
not see them when it tries to recreate the controlfile.  

You must change the properties of the file by right-clicking on the file 
in Windows Explorer and then deselecting the check box marked "Hidden" under 
the General tab.  You should then be able to create the controlfile.


References:

Note 1084048.6 ORA-01503, ORA-01161: on Create Controlfile.


Note 2:
=======

This message may result, if the db_name in the init.ora does not match with the set "db_name" given 
while creating the controlfile. 

Also, remove any old controlfiles present in the specified directory. 

Thanks, 


Note 3:
=======

We ran into a similar problem when trying to create a new instance with datafiles from another database. 
The error comes in the create control file statement. Oracle uses REUSE as the default option when you do the 
alter database backup controlfile to trace. If you delete REUSE then the new database name you will change 
all the header information in all the database datafiles and you will be able to start up the instance. 
Hope this helps. 


Note 4:
=======

Try this command "CREATE CONTROLFILE SET DATABASE..." instead of "CREATE CONTROLFILE REUSE DATABASE..." 
I think it would be better. 


19.50. ORA-01031
================


Note 1:
-------

The 'OSDBA' and 'OSOPER' groups are chosen at installation time and usually    
both default to the group 'dba'. These groups are compiled into the 'oracle' executable and so are the same for 
all databases running from a given ORACLE_HOME directory. The actual groups being used for OSDBA and OSOPER 
can be checked thus:     
cd $ORACLE_HOME/rdbms/lib    
cat config.[cs]  
The line '#define SS_DBA_GRP "group"' should name the chosen OSDBA group. 
The line '#define SS_OPER_GRP "group"' should name the chosen OSOPER group. 

Note 2:
-------

 
Bookmark	Fixed font 	Go to End	 
  
Doc ID: 	Note:69642.1	Content Type: 	TEXT/PLAIN	   
Subject: 	UNIX: Checklist for Resolving Connect AS SYSDBA Issues	Creation Date: 	20-APR-1999	   
Type: 	TROUBLESHOOTING	Last Revision Date: 	31-DEC-2004	   
Status: 	PUBLISHED		 
Introduction: 
~~~~~~~~~~~~~ 
This bulletin lists the documented causes of getting  
 
   ---> prompted for a password when trying to CONNECT as SYSDBA  
   ---> errors such as ORA-01031, ORA-01034, ORA-06401, ORA-03113,ORA-09925, 
                       ORA-09817, ORA-12705, ORA-12547 
 
 
a) SQLNET.ORA Checks:  
---------------------  
1. The "sqlnet.ora" can be found in the following locations (listed by search order): 
      
   $TNS_ADMIN/sqlnet.ora  
   $HOME/sqlnet.ora  
   $ORACLE_HOME/network/admin/sqlnet.ora  
          
   Depending upon your operating system, it may also be located in:  
  
   /var/opt/oracle/sqlnet.ora 
   /etc/sqlnet.ora 
 
   A corrupted "sqlnet.ora" file, or one with security options set, will cause  
   a 'connect internal' request to prompt for a password. 
   To determine if this is the problem, locate the "sqlnet.ora" that is being used. 
   The one being used will be the first one found according to the search order 
   listed above. 
   Next, move the file so that it will not be found by this search: 
 
   % mv sqlnet.ora sqlnet.ora_save 
  
   Try to connect internal again. 
   If it still fails, search for other "sqlnet.ora" files according to the search order listed 
   above and repeat using the move command until you are sure there are no other 
   "sqlnet.ora" files being used. 
   If this does not resolve the issue, use the move command to put all the 
   "sqlnet.ora" files back where they were before you made the change:  
 
   % mv sqlnet.ora_save sqlnet.ora 
 
   If moving the "sqlnet.ora" resolves the issue, then verify the contents of the file: 
  
   a) SQLNET.AUTHENTICATION_SERVICES  
  
      If you are not using database links, comment this line out or try setting it to:  
  
      SQLNET.AUTHENTICATION_SERVICES = (BEQ,NONE)  
  
   b) SQLNET.CRYPTO_SEED  
  
      This should not be set in a "sqlnet.ora" file on UNIX. 
      If it is, comment the line out. (This setting is added to the "sqlnet.ora" 
      if it is built by one of Oracle's network cofiguration products shipped with client products) 
  
   c) AUTOMATIC_IPC  
  
      If this is set to "ON" it can force a "TWO_TASK" connection.  
      Try setting this to "OFF":  
     
      AUTOMATIC_IPC = OFF 
 
 
2. Set the permissions correctly in the "TNS_ADMIN" files. 
   The environment variable TNS_ADMIN defines the directory where the "sqlnet.ora", 
   "tnsnames.ora", and "listener.ora" files reside. 
   These files must contain the correct permissions, which are set when "root.sh" runs 
   during installation.  
   As root, run "root.sh" or edit the permissions on the "sqlnet.ora", "tnsnames.ora", 
   and "listener.ora" files by hand as follows: 
 
   $ cd $TNS_ADMIN 
   $ chmod 644 sqlnet.ora tnsnames.ora listener.ora 
   $ ls -l sqlnet.ora tnsnames.ora listener.ora 
 
   -rw-r--r--   1 oracle dba        1628 Jul 12 15:25 listener.ora 
   -rw-r--r--   1 oracle dba         586 Jun  1 12:07 sqlnet.ora 
   -rw-r--r--   1 oracle dba       82274 Jul 12 15:23 tnsnames.ora 
 
 
b) Software and Operating System Issues: 
---------------------------------------- 
1. Be sure $ORACLE_HOME is set to the correct directory and does not have any 
   typing mistakes: 
  
   % cd $ORACLE_HOME  
   % pwd  
    
   If this returns a location other than your "ORACLE_HOME" or is invalid, you 
   will need to reset the value of this environment variable:  
  
   sh or ksh: 
   ----------  
   $ ORACLE_HOME=<path_to_ORACLE_HOME>  
   $ export ORACLE_HOME  
  
   Example:  
   $ ORACLE_HOME=/u01/app/oracle/product/7.3.3  
   $ export ORACLE_HOME  
  
   csh:  
   ---- 
   % setenv ORACLE_HOME <path_to_ORACLE_HOME>  
  
   Example:  
   % setenv ORACLE_HOME /u01/app/oracle/product/7.3.3  
  
 
   If your "ORACLE_HOME" contains a link or the instance was started with the 
   "ORACLE_HOME" set to another value, the instance may try to start using the 
   memory location that another instance is using. 
   An example of this might be:  
  
   You have "ORACLE_HOME" set to "/u01/app/oracle/product/7.3.3" and start the 
   instance. 
   Then you do something like:  
  
   % ln -s /u01/app/oracle/product/7.3.3 /u01/app/oracle/7.3.3  
   % setenv ORACLE_HOME /u01/app/oracle/7.3.3  
   % svrmgrl  
 
   SVRMGR> connect internal 
 
   If this prompts for a password then most likely the combination of your 
   "ORACLE_HOME" and "ORACLE_SID" hash to the same shared memory address of 
   another running instance. Otherwise you may be able to connect internal 
   but you will receive an ORA-01034 "Oracle not available" error.  
  
   In most cases using a link as part of your "ORACLE_HOME" is fine as long as 
   you are consistent. 
   Oracle recommends that links not be used as part of the "ORACLE_HOME", but 
   their use is supported.  
            
2. Check that $ORACLE_SID is set to the correct SID, (including capitalization), 
   and does not have any typos: 
  
   % echo $ORACLE_SID                           
 
   Refer to Note:1048876.6 for more information. 
  
3. Ensure $TWO_TASK is not set. 
   To check if "TWO_TASK" is set, do the following: 
  
   sh, ksh or on HP/UX only csh:  
   ----------------------------- 
   env |grep -i two  
   - or - 
   echo $TWO_TASK 
  
   csh:  
   ---- 
   setenv |grep -i two   
  
  If any lines are returned such as:  
  
  TWO_TASK=  
  - or - 
  TWO_TASK=PROD 
 
  You will need to unset the environment variable "TWO_TASK":  
  
  sh or ksh:  
  ----------  
  unset TWO_TASK  
    
  csh:  
  ----  
  unsetenv TWO_TASK  
 
  Example : 
    
      $ TWO_TASK=V817 
      $ export TWO_TASK 
      $ sqlplus /nolog 
 
      SQL*Plus: Release 8.1.7.0.0 - Production on Fri Dec 31 10:12:25 2004 
      (c) Copyright 2000 Oracle Corporation.  All rights reserved. 
 
      SQL> conn / as sysdba 
      ERROR: 
      ORA-01031: insufficient privileges 
 
      $ unset TWO_TASK 
      $ sqlplus /nolog 
      SQL> conn / as sysdba 
      Connected. 
 
  If you are running Oracle release 8.0.4, and upon starting "svrmgrl" you 
  receive an ORA-06401 "NETCMN: invalid driver designator" error, you should 
  also unset two_task. 
  The login connect string may be getting its value from the TWO_TASK 
  environment variable if this is set for the user. 
  
4. Check the permissions on the Oracle executable:    
     
   % cd $ORACLE_HOME/bin  
   % ls -l oracle                 ('ls -n oracle' should work as well) 
  
   The permissions should be rwsr-s--x, or 6751.  
   If the permissions are incorrect, do the following as the "oracle" 
   software owner: 
  
   % chmod 6751 oracle  
    
   If you receive an ORA-03113 "end-of-file on communication" error followed 
   by a prompt for a password, then you may also need to check the ownership 
   and permissions on the dump directories. 
   These directories must belong to Oracle, group dba, (or the appropriates names 
   for your installation). 
   This error may occur while creating a database.  
 
   Permissions should be:  755 (drwxr-xr-x) 
 
   Also, the alert.log must not be greater than 2 Gigabytes in size. 
   When you start up "nomount" an Oracle pseudo process will try to write the 
   "alert.log" file in "udump". 
   When Oracle cannot do this (either because of permissions or because of the 
   "alert.log" being greater than 2 Gigabytes in size), it will issue the 
   ORA-03113 error. 
 
5. "osdba" group checks: 
 
   a. Make sure the operating system user issuing the CONNECT INTERNAL belongs  
      to the "osdba" group as defined in the "$ORACLE_HOME/rdbms/lib/config.s"  
      or "$ORACLE_HOME/rdbms/lib/config.c". Typically this is set to "dba". 
      To verify the operating system groups the user belongs to, do the following: 
  
      % id  
      uid=1030(oracle) gid=1030(dba)  
  
      The "gid" here is "dba" so the "config.s" or "config.c" may contain an 
      entry such as:  
  
       /* 0x0008         15 */         .ascii  "dba\0"  
  
      If these do not match, you either need to add the operating system user 
      to the group as it is seen in the "config" file, or modify the "config"  
      file and relink the "oracle" binary. 
    
      Refer to entry [NOTE:50507.1] section 3 for more details. 
    
   b. Be sure you are not logged in as the "root" user and that the environment 
      variables "USER", "USERNAME", and "LOGNAME" are not set to "root". 
      The "root" user is a special case and cannot connect to Oracle as the 
      "internal" user unless the effective group is changed to the "osdba" group, 
      which is typically "dba". 
      To do this, either modify the "/etc/password" file (not recommended) or  
      use the "newgrp" command: 
  
      # newgrp dba  
  
      "newgrp" always opens a new shell, so you cannot issue "newgrp" from 
      within a shell script. 
      Keep this in mind if you plan on executing scripts as the "root" user. 
 
   c. Verify that the "osdba" group is only listed once in the "/etc/group" file:  
  
      % grep dba /etc/group  
      dba::1010:  
      dba::1100:  
  
      If more than one line starting with the "osdba" group is returned, you  
      need to remove the ones that are not correct. 
      It is not possible to have more than one group use a group name.   
  
   d. Check that the oracle user uid and gid are matching with /etc/passwd and  
      /etc/group : 
 
      $ id 
      uid=500(oracle) gid=235(dba) 
    
      $ grep oracle /etc/passwd 
      oracle:x:500:235:oracle:/home/oracle:/bin/bash 
                   ^^^ 
      $ grep dba /etc/group 
      dba:x:253:oracle 
            ^^^  
      The mismatch also causes an ORA-1031 error. 
 
 
6. Verify that the file system is not mounted no set uid:  
  
   % mount  
   /u07 on /dev/md/dsk/d7 nosuid/read/write  
  
   If the filesytem is mounted "nosuid", as seen in this example, you will need 
   to unmount the filesystem and mount it without the "nosuid" option. 
   Consult your operating system documentation or your operating system vendor 
   for instruction on modifying mount options. 
     
7. Please read the following warning before you attempt to use the information 
   in this step: 
 
   ******************************************************************   
   *                                                                * 
   *  WARNING: If you remove segments that belong to a running      * 
   *           instance you will crash the instance, and this may   * 
   *           cause database corruption.                           * 
   *                                                                * 
   *           Please call Oracle Support Services for assistance   * 
   *           if you have any doubts about removing shared memory  * 
   *           segments.                                            * 
   *                                                                * 
   ****************************************************************** 
    
   If an instance crashed or was killed off using "kill" there may be shared  
   memory segments hanging around that belong to the down instance. 
   If there are no other instances running on the machine you can issue: 
  
   % ipcs -b  
 
         T         ID       KEY        MODE    OWNER      GROUP   SEGSZ  
      Shared Memory:  
         m          0   0x50000ffe --rw-r--r-- root       root         68  
         m       1601   0x0eedcdb8 --rw-r----- oracle      dba    4530176  
  
 
   In this case the "ID" of "1601" is owned by "oracle" and if there are no 
   other instances running in most cases this can safely be removed: 
  
   % ipcrm -m 1601  
  
   If your SGA is split into multiple segments you will have to remove all 
   segments associated with the instance. If there are other instances 
   running, and you are not sure which memory segments belong to the failed 
   instance, you can do the following: 
  
   a. Shut down all the instances on the machine and remove whatever shared 
      memory still exists that is owned by the software owner.  
   b. Reboot the machine.  
   c. If your Oracle software is release 7.3.3 or newer, you can connect into 
      each instance that is up and identify the shared memory owned by that 
      instance:  
  
      % svrmgrl  
      SVRMGR> connect internal  
      SVRMGR> oradebug ipc   
                   
      In Oracle8:  
      ----------- 
      Area #0 `Fixed Size', containing Subareas 0-0 
      Total size 000000000000b8c0, Minimum Subarea size 00000000 
      Subarea  Shmid             Size      Stable Addr 
            0   7205 000000000000c000         80000000    
  
 
      In Oracle7: 
      -----------  
  
       -------------- Shared memory --------------  
       Seg Id       Address   Size  
         2016       80000000  4308992  
        Total: # of segments = 1, size = 4308992  
  
     Note the "Shmid" for Oracle8 and "Seg Id" for Oracle7 for each running instance. 
     By process of elimination find the segments that do not belong to an 
     instance and remove them.  
                    
8.  If you are prompted for a password and then receive error ORA-09925 "unable 
    to create audit trail file" or error ORA-09817 "write to audit file failed", 
    along with "SVR4 Error: 28: No space left on device", do the following:  
 
    Check your "pfile". It is typically in the "$ORACLE_HOME/dbs" directory 
    and will be named "init<your_sid>.ora, where "<your_sid>" is the value of 
    "ORACLE_SID" in your environment. If the "init<your_sid>.ora" file has 
    the "ifile" parameter set, you will also have to check the included file 
    as well. You are looking for the parameter "audit_file_dest".  
  
    If "audit_file_dest" is set, change to that directory; otherwise change to 
    the "$ORACLE_HOME/rdbms/audit" directory, as this is the default location 
    for audit files. If the directory does not exist, create it. 
    Ensure that you have enough space to create the audit file. 
    The audit file is generally 600 bytes in size. 
    If it does exist, verify you can write to the directory: 
  
    % touch afile  
  
    If it could not create the called "afile", you need to change the permissions 
    on your audit directory:  
  
    % chmod 751   
 
9.  If connect internal prompts you for a password and then you receive an  
    ORA-12705 "invalid or unknown NLS parameter value specified" error, you 
    need to verify the settings for "ORA_NLS", "ORA_NLS32", "ORA_NLS33" or 
    "NLS_LANG". 
    You will need to consult your Installation and Configuration Guide for the 
    proper settings for these environment variables. 
 
10. If you have installed Oracle software and are trying to connect with 
    Server Manager to create or start the database, and receive a TNS-12571 
    "packet writer failure" error, please refer to Note:1064635.6 
 
11. If in SVRMGRL (Server Manager line mode), you are running the "startup.sql" 
    script and receive the following error:  
   
    ld:so.1: oracle_home/bin/svrmgrl fatal relocation error  
    symbol not found kgffiop                  
 
    RDBMS v7.3.2 is installed. 
    RDBMS v8.0.4 is a separate "oracle_home", and you are attempting to have 
    it coexist. 
    This is due to the wrong version of the client shared library "libclntsh.so.1" 
    being used at runtime. 
    Verify environment variable settings. 
 
    You need to ensure that "ORACLE_HOME" and "LD_LIBRARY_PATH" are set correctly. 
 
    For C-shell, type:  
 
    % setenv LD_LIBRARY_PATH $ORACLE_HOME/lib  
    % setenv ORACLE_HOME /u01/app/oracle/product/8.0.4  
 
    For Bourne or Korn shell, type:  
 
    $ LD_LIBRARY_PATH=$ORACLE_HOME/lib 
    $ export LD_LIBRARY_PATH  
    $ ORACLE_HOME=/u01/app/oracle/product/8.0.4 
    $ export ORACLE_HOME 
 
12. Ensure that the disk the instance resides on has not reached 100% capacity. 
 
    % df -k  
 
    If it has reached 100% capacity, this may be the cause of 'connect internal' 
    prompting for a password. 
    Additional disk space will need to be made available before 'connect internal' 
    will work.  
 
    For additional information refer to Note:97849.1 
 
13. Delete process.dat and regid.dat files in $ORACLE_HOME/otrace/admin directory. 
    Oracle Trace is enabled by default on 7.3.2 and 7.3.3 (depends on platform) 
    This can caused high disk space usage by these files and cause a number of 
    apparently mysterious side effects. 
    See Note:45482.1 for more details. 
 
14. When you get ora-1031 "Insufficient privileges" on connect internal after you 
    supply a valid password and you have multiple instances running from the same 
    ORACLE_HOME, be sure that if an instance has REMOTE_LOGIN_PASSWORDFILE set to 
    exclusive that the file $ORACLE_HOME/dbs/orapw<sid> does exist, otherwise it 
    defaults to the use of the file orapw that consequently causes access problems 
    for any other database that has the parameter set to shared. 
    Set the parameter REMOTE_LOGIN_PASSWORDFILE to shared for all instances that share 
    the common password file and create an exclusive orapw<sid> password files for any 
    instances that have this set to exclusive. 
 
15. Check permissions on /etc/passwd file (Unix only). 
    If Oracle cannot open the password file, connect internal fails with  
    ORA-1031, since Oracle is not able to verify if the user trying to connect  
    is indeed in the dba group. 
    Example: 
    -------- 
    # chmod 711 /etc/passwd 
    # ls -ltr passwd 
    -rwx--x--x   1 root     sys          901 Sep 21 14:26 passwd 
     
    $ sqlplus '/ as sysdba' 
  
    SQL*Plus: Release 9.2.0.1.0 - Production on Sat Sep 21 16:21:18 2002 
  
    Copyright (c) 1982, 2002, Oracle Corporation.  All rights reserved. 
  
    ERROR: 
    ORA-01031: insufficient privileges 
 
    Trussing sqlplus will show also the problem: 
 
    25338:  munmap(0xFF210000, 8192)                        = 0 
    25338:  lwp_mutex_wakeup(0xFF3E0778)                    = 0 
    25338:  lwp_mutex_lock(0xFF3E0778)                      = 0 
    25338:  time()                                          = 1032582594 
    25338:  open("/etc/passwd", O_RDONLY)                   Err#13 EACCES 
    25338:  getrlimit(RLIMIT_NOFILE, 0xFFBE8B28)            = 0 
 
 
c) Operating System Specific checks:  
------------------------------------ 
1. On OpenVMS, check that the privileges have been granted at the Operating System 
   level: 
        
   $ SET DEFAULT SYS$SYSTEM:   
   $ RUN AUTHORIZE   
     
   If the list returned by AUTHORIZE does not contain ORA_<SID>_DBA, or ORA_DBA, 
   then you do not have the correct OS privileges to issue a connect internal. 
   If ORA_<SID>_DBA was added AFTER ORA_DBA, then ORA_DBA needs to be removed 
   and granted again to be updated. 
   Please refer to Note:1010852.6 for more details. 
 
2. On Windows NT, check if DBA_AUTHORIZATION is set to BYPASS in the registry. 
 
3. On Windows NT, if you are able to connect internally but then startup fails 
   for some reason, successive connect internal attempts might prompt for a 
   password. You may also receive errors such as: 
 
   ORA-12705: invalid or unknown NLS parameter value specified 
   ORA-01012: not logged on 
   LCC-00161: Oracle error (possible syntax error) 
   ORA-01031: insufficient privileges 
 
   Refer to entry Note:1027964.6 for suggestions on how to resolve this problem 
 
4. If you are using Multi-Threaded Server (MTS), make sure you are using a dedicated 
   server connection. 
   A dedicated server connection is required to start up or shutdown the database. 
   Unless the database alias in the "TNSNAMES.ORA" file includes a parameter to make 
   a dedicated server connection, it will make a shared connection to a dispatcher. 
   See Note:1058680.6 for more details. 
 
5. On Solaris, if the file "/etc/.name_service_door" has incorrect permissions, 
   Oracle cannot read the file. You will receive a message that "The Oracle  
   user cannot access "/etc/.name_service_door" (permission denied). 
   This file is a flavor of IPC specific to Solaris which Oracle software is using 
   This can also cause connect internal problems. See entry Note:1066589.6 
  
6. You are on Digital Unix, running SVRMGRL (Server Manager line mode), and you 
   receive an ORA-12547 "TNS:lost contact" error and a password prompt. 
 
   This problem occurs when using Parallel Server and the True Cluster software together. 
   If Parallel Server is not linked in, svrmgrl works as expected. 
 
   Oracle V8.0.5 requires an Operating System patch which previous versions of 
   Oracle did not require. 
   The above patch allows svrmgrl to communicate with the TCR software. 
 
   You can determine if the patch is applied by running: 
   % nm /usr/ccs/lib/libssn.a | grep adjust 
 
   If this returns nothing, then you need to: 
 
   1. Obtain the patch for TCR 1.5 from Digital. 
      This patch is for the MC SCN and adds the symbol "adjustSequenceNumber" 
      to the library /usr/ccs/lib/libssn.a. 
   2.  Apply the patch. 
   3.  Relink Oracle 
 
   Another possibility is that you need to raise the value of kernel parameter 
 
   per-proc-stack-size 
 
   when increased from its default value of 2097152 to 83886080 resolved this problem. 
 
7. You are on version 6.2 of the Silicon Graphics UNIX (IRIX) operating system 
   and you have recently installed RDBMS release 8.0.3. 
   If you are logged on as "oracle/dba" and an attempt to log in to Server Manager 
   using "connect/internal" prompts you for a password, you should refer to entry 
   Note:1040607.6 
 
8. On AIX 4.3.3 after applying ML5 or higher you can not longer connect as internal 
   or if on 9.X '/ as sysdba' does not work as well. 
   This is a known AIX bug and it occurs on all RS6000 ports including SP2. 
   There is two workarounds and one solution. They are as follows: 
 
   1) Use mkpasswd command to remove the index. 
      This is valid until a new user is added to "/etc/passwd" or modified: 
 
      # mkpasswd -v -d 
 
   2) Touch the "/etc/passwd" file. 
      If the "/etc/passwd" file is newer than the index it will not use the 
      password file index: 
 
      # touch /etc/passwd 
 
   3) Obtain APAR IY22458 from IBM. 
      Any questions about this APAR should be directed to IBM.     
 
 
d) Additional Information: 
--------------------------  
1. In the "Oracle7 Administrator's Reference for UNIX", there is a note that states: 
           
   If REMOTE_OS_AUTHENT is set to true, users who are members of the  dba group 
   on the remote machine are able to connect as INTERNAL without a password. 
   However, if you are connecting remotely, that is connecting via anything 
   except the bequeath adapter, you will be prompted for a password regardless 
   of the value of "REMOTE_OS_AUTHENT". 
   Refer to bug 644988 
   
       
References:  
~~~~~~~~~~~ 
[NOTE:1048876.6]  UNIX: Connect internal prompts for password after install  
[NOTE:1064635.6]  ORA-12571: PACKET WRITER FAILURE WHEN STARTING SVRMGR 
[NOTE:1010852.6]  OPENVMS: ORA-01031: WHEN ISSUING "CONNECT INTERNAL" IN SQL*DBA OR SERVER MANAGER 
[NOTE:1027964.6]  LCC-00161 AND ORA-01031 ON STARTUP 
[NOTE:1058680.6]  ORA-00106 or ORA-01031 ERROR when trying to STARTUP or SHUTDOWN DATABASE 
[NOTE:1066589.6]  UNIX: Connect Internal asks for password when TWO_TASK is set  
[NOTE:1040607.6]  SGI: ORA-01012 ORA-01031: WHEN USING SRVMGR AFTER 8.0.3 INSTALL 
[NOTE:97849.1]    Connect internal Requires Password 
[NOTE:50507.1]    SYSDBA and SYSOPER Privileges in Oracle8 and Oracle7  
[NOTE:18089.1]    UNIX: Connect INTERNAL / AS SYSBDA Privilege on Oracle 7/8 
[BUG:644988]      REMOTE_OS_AUTHENT=TRUE: NOT ALLOWING USERS TO CONNECT INTERNAL WITHOUT PASSWORD 
 
  
Search Words: 
~~~~~~~~~~~~~  
svrmgrm sqldba sqlplus sqlnet 
remote_login_passwordfile 


Note 3:
-------

ORA-01031: insufficient privileges 
Cause: An attempt was made to change the current username or password without the appropriate privilege. 
This error also occurs if attempting to install a database without the necessary operating system privileges. 
Action: Ask the database administrator to perform the operation or grant the required privileges.

Note 4:
-------

ORA-01031: insufficient privileges 
 In most cases, the user receiving this error lacks a privilege to create an object (such as a table, view, 
procedure and the like). Grant the required privilege like so: 
grant create table to user_lacking_privilege;
Startup
If someone receives this error while trying to startup the instance, the logged on user must belong 
to the ora_dba group on Windows or dba group on Unix. 

Note 5:
-------

I am not sure it is the same, but I got this error today in windows when sql_authentication in sqlnet.ora was NONE.
Changing it to NTS solved the problem.


19.51 ORA-00600: internal error code, arguments: [17059]:
=========================================================

Note 1:
-------

Doc ID </help/usaeng/Search/search.html>: 	Note:138554.1	Content Type: 	TEXT/PLAIN	
Subject: 	ORA-600 [17059]	Creation Date: 	02-APR-2001	
Type: 	REFERENCE	Last Revision Date: 	09-DEC-2004	
Status: 	PUBLISHED		
Note: For additional ORA-600 related information please read [NOTE:146580.1] <ml2_documents.showDocument?p_id=146580.1&p_database_id=NOT> 
 
PURPOSE:             
  This article discusses the internal error "ORA-600 [17059]", what  
  it means and possible actions. The information here is only applicable  
  to the versions listed and is provided only for guidance. 
 
ERROR: 
  ORA-600 [17059] [a] 
 
VERSIONS: 
  versions 7.1 to 10.1 
 
DESCRIPTION: 
 
  While building a table to hold the list of child cursor dependencies  
  relating to a given parent cursor, we exceed the maximum possible size  
  of the table. 
 
ARGUMENTS: 
  Arg [a] Object containing the table 
 
FUNCTIONALITY: 
  Kernel Generic Library cache manager 
 
IMPACT: 
  PROCESS FAILURE 
  NON CORRUPTIVE - No underlying data corruption. 
 
SUGGESTIONS: 
 
  One symptom of this error is that the session will appear to hang for a  
  period of time prior to this error being reported. 
 
  If the Known Issues section below does not help in terms of identifying 
  a solution, please submit the trace files and alert.log to Oracle 
  Support Services for further analysis. 
 
  Issuing this SQL as SYS (SYSDBA) may help show any problem 
  objects in the dictionary: 
 
   select do.obj#, 
        po.obj# ,  
        p_timestamp, 
        po.stime , 
        decode(sign(po.stime-p_timestamp),0,'SAME','*DIFFER*') X 
    from sys.obj$ do, sys.dependency$ d,  sys.obj$ po 
   where P_OBJ#=po.obj#(+)  
     and D_OBJ#=do.obj# 
     and do.status=1 /*dependent is valid*/ 
     and po.status=1 /*parent is valid*/ 
     and po.stime!=p_timestamp /*parent timestamp not match*/ 
   order by 2,1 
   ; 
 
  Normally the above select would return no rows. If any rows are 
  returned the listed dependent objects may need recompiling. 
 
 
  Known Issues: 
 
  Bug# 3555003   See [NOTE:3555003.8] <ml2_documents.showDocument?p_id=3555003.8&p_database_id=NOT> 
      View compilation hangs / OERI:17059 after DBMS_APPLY_ADM.SET_DML_HANDLER 
      Fixed: 9.2.0.6 
  
  Bug# 2707304   See [NOTE:2707304.8] <ml2_documents.showDocument?p_id=2707304.8&p_database_id=NOT> 
      OERI:17059 / OERI:kqlupd2 / PLS-907 after adding partitions to Partitioned IOT 
      Fixed: 9.2.0.3, 10.1.0.2 
  
  Bug# 2636685   See [NOTE:2636685.8] <ml2_documents.showDocument?p_id=2636685.8&p_database_id=NOT> 
      Hang / OERI:[17059] after adding a list value to a partition 
      Fixed: 9.2.0.3, 10.1.0.2 
  
  Bug# 2626347   See [NOTE:2626347.8] <ml2_documents.showDocument?p_id=2626347.8&p_database_id=NOT> 
      OERI:17059 accessing view after ADD / SPLIT PARTITION 
      Fixed: 9.2.0.3, 10.1.0.2 
  
  Bug# 2306331   See [NOTE:2306331.8] <ml2_documents.showDocument?p_id=2306331.8&p_database_id=NOT> 
      Hang / OERI[17059] on view after SET_KEY or SET_DML_INVOKATION on base table 
      Fixed: 9.2.0.2 
  
  Bug# 1115424   See [NOTE:1115424.8] <ml2_documents.showDocument?p_id=1115424.8&p_database_id=NOT> 
      Cursor authorization and dependency lists too long - can impact shared pool / OERI:17059 
      Fixed: 8.0.6.2, 8.1.6.2, 8.1.7.0 
  
  Bug# 631335   See [NOTE:631335.8] <ml2_documents.showDocument?p_id=631335.8&p_database_id=NOT> 
      OERI:17059 from extensive re-user of a cursor 
      Fixed: 8.0.4.2, 8.0.5.0, 8.1.5.0 
  
  Bug# 558160   See [NOTE:558160.8] <ml2_documents.showDocument?p_id=558160.8&p_database_id=NOT> 
      OERI:17059 from granting privileges multiple times 
      Fixed: 8.0.3.2, 8.0.4.0, 8.1.5.0 
  
 
Note 2:
-------

Doc ID </help/usaeng/Search/search.html>: 	Note:234457.1	Content Type: 	TEXT/X-HTML	
Subject: 	ORA-600 [17059] Error When Compiling A Package	Creation Date: 	19-FEB-2003	
Type: 	PROBLEM	Last Revision Date: 	24-AUG-2004	
Status: 	PUBLISHED		


fact: 
fact: Oracle Server - Enterprise Edition

fact: Partitioned Tables / Indexes

symptom: ORA-600 [17059] Error When Compiling A Package

symptom: When Compiling a Package

symptom: The Package Accesses a Partitioned Table

symptom: ORA-00600: internal error code, arguments: [%s], [%s], [%s], [%s], 
[%s], [%s], [%s]

symptom: internal error code, arguments: [17059], [352251864]

symptom: Calling Location kglgob

symptom: Calling Location kgldpo

symptom: Calling Location kgldon

symptom: Calling Location pkldon

symptom: Calling Location pkloud

symptom: Calling Location - phnnrl_name_resolve_by_loading

cause: This is due to <bug:2073948 </metalink/plsql/showdoc?db=bug&id=2073948>> fixed in 10i, and occurs when accessing a 
partitioned table via a dblink within the package, where DDL (such as 
adding/dropping partitions) is performed on the table.


fix:

This is fixed in 9.0.1.4, 9.2.0.2 & 10i.   One-off patches are available 
for 8.1.7.4.  A workaround is to flush the shared pool.


Note 3:
-------


Doc ID </help/usaeng/Search/search.html>: 	Note:239796.1	Content Type: 	TEXT/PLAIN	
Subject: 	ORA-600 [17059] when querying dba_tablespaces, dba_indexes, dba_ind_partitions etc	Creation Date: 	28-MAY-2003	
Type: 	PROBLEM	Last Revision Date: 	13-AUG-2004	
Status: 	PUBLISHED		
Problem: 
~~~~~~~~ 
 
The information in this article applies to: 
 
Internal Error ORA-600 [17059] when querying Data dictionary views like dba_tablespaces, 
dba_indexes, dba_ind_partitions etc 
 
 
Symptom(s) 
~~~~~~~~~~ 
While querying Data dictionary views like dba_tablespaces, 
dba_indexes, dba_ind_partitions etc, getting internal error ORA-600 [17059] 
 
Change(s) 
~~~~~~~~~~ 
You probably altered some objects or executed some cat*.sql scripts. 
 
Cause 
~~~~~~~ 
Some SYS objects are INVALID. 
 
Fix 
~~~~ 
Connect SYS 
run $ORACLE_HOME/rdbms/admin/utlrp.sql and make sure all the objects are valid. 
 

19.52: ORA-00600: internal error code, arguments: [17003]
=========================================================

Note 1:
-------

The information in this article applies to: 
Oracle Forms - Version: 9.0.2.7 to 9.0.2.12
Oracle Server - Enterprise Edition - Version: 9.2
This problem can occur on any platform.

Errors
ORA 600 "internal error code, arguments: [%s],[%s],[%s], [%s], [%s],

Symptoms
The following error occurs when compiling a form or library ( fmb / pll ) against RDBMS 9.2

PL/SQL ERROR 0 at line 0, column 0
ORA-00600: internal error code, arguments: [17003], [0x11360BC], [275], [1], [], [], [], []

The error reproduces everytime.

Triggers / local program units in the form / library contain calls to stored 
database procedures and / or functions.

The error does not occur when compiling against RDBMS 9.0.1 or lower. 
Cause
This is a known bug / issue. The compilation error occurs when the form contains a call to a stored database 
function / procedure which has two DATE IN variables receiving DEFAULT values such as SYSDATE.
Reference:
<Bug:2713384> Abstract: INTERNAL ERROR [1401] WHEN COMPILE FUNCTION WITH 2 
DEFAULT DATE VARIABLES ON 9.2 
Fix
The bug is fixed in Oracle Forms 10g (9.0.4). There is no backport fix available for 
Forms 9i (9.0.2)

To work-around, modify the offending calls to the stored database procedure/ functions so that DEFAULT parameter values 
are not passed directly . 
For example, pass the DEFAULT value SYSDATE indirectly to the stored database procedure/ function by first 
assigning it to a local variable in the form. 


Note 2:
-------

Doc ID </help/usaeng/Search/search.html>: 	Note:138537.1	Content Type: 	TEXT/PLAIN	
Subject: 	ORA-600 [17003]	Creation Date: 	02-APR-2001	
Type: 	REFERENCE	Last Revision Date: 	15-OCT-2004	
Status: 	PUBLISHED		
Note: For additional ORA-600 related information please read [NOTE:146580.1] <ml2_documents.showDocument?p_id=146580.1&p_database_id=NOT> 
 
PURPOSE:             
  This article discusses the internal error "ORA-600 [17003]", what  
  it means and possible actions. The information here is only applicable  
  to the versions listed and is provided only for guidance. 
 
ERROR:               
  ORA-600 [17003] [a] [b] [c] 
  
VERSIONS: 
  versions 7.0 to 10.1 
 
DESCRIPTION:         
 
  The error indicates that we have tried to lock a library cache object by  
  using the dependency number to identify the target object and have found 
  that no such dependency exists. 
 
  Under this situation we will raise an ORA-600 [17003] if the dependency  
  number that we are using exceeds the number of entries in the dependency  
  table or the dependency entry is not marked as invalidated. 
 
ARGUMENTS: 
  Arg [a] Library Cache Object Handle 
  Arg [b] Dependency number 
  Arg [c] 1 or 2 (indicates where the error was raised internally) 
 
FUNCTIONALITY:       
  Kernel Generic Library cache manager 
  
IMPACT:              
  PROCESS MEMORY FAILURE 
  NO UNDERLYING DATA CORRUPTION. 
 
SUGGESTIONS:  
 
  A common condition where this error is seen is problematic upgrades. 
 
  If a patchset has recently been applied, please confirm that there were  
  no errors associated with this upgrade. 
 
  Specifically, there are some XDB related bugs which can lead to this error 
  being reported. 
 
  Known Issues: 
  Bug# 2611590   See [NOTE:2611590.8] <ml2_documents.showDocument?p_id=2611590.8&p_database_id=NOT> 
      OERI:[17003] running XDBRELOD.SQL 
      Fixed: 9.2.0.3, 10.1.0.2 
  
 
  Bug# 3073414 
      XDB may not work after applying a 9.2 patch set 
      Fixed: 9.2.0.5 
 

19.53: ORA-00600: internal error code, arguments: [qmxiUnpPacked2], [121], [], [], [], [], [], []
=================================================================================================


Note 1.
-------

 
Doc ID: 	Note:222876.1	Content Type: 	TEXT/PLAIN	   
Subject: 	ORA-600 [qmxiUnpPacked2]	Creation Date: 	09-DEC-2002	   
Type: 	REFERENCE	Last Revision Date: 	15-OCT-2004	   
Status: 	PUBLISHED		 
Note: For additional ORA-600 related information please read [NOTE:146580.1] 
 
PURPOSE: 
  This article discusses the internal error "ORA-600 [qmxiUnpPacked2]", what 
  it means and possible actions. The information here is only applicable 
  to the versions listed and is provided only for guidance. 
 
ERROR: 
  ORA-600 [qmxiUnpPacked2] [a] 
 
VERSIONS: 
  versions 9.2 to 10.1 
 
DESCRIPTION: 
 
  When unpickling an XOB or an array of XOBs an unexpected datatype was 
  found.  
 
  Generally due to XMLType data that has not been successfully upgraded from 
  a previous version. 
 
ARGUMENTS: 
  Arg [a] Type of XOB 
 
FUNCTIONALITY: 
  Qernel xMl support Xob to/from Image 
 
IMPACT: 
  PROCESS FAILURE 
  NON CORRUPTIVE - No underlying data corruption. 
 
SUGGESTIONS: 
 
  Please review the following article on Metalink : 
 
     [NOTE:235423.1] How to resolve ORA-600 [qmxiUnpPacked2] during upgrade 
 
  If you still encounter the error having tried the suggestions in the  
  above article, or the article isn't applicible to your environment then  
  ensure that the upgrade to current version was completed succesfully  
  without error.  
 
  If the Known Issues section below does not help in terms of identifying 
  a solution, please submit the trace files and alert.log to Oracle 
  Support Services for further analysis. 
 
  Known Issues: 
  Bug# 2607128   See [NOTE:2607128.8] 
      OERI:[qmxiUnpPacked2] if CATPATCH.SQL/XDBPATCH.SQL fails 
      Fixed: 9.2.0.3 
  
 
  Bug# 2734234 
      CONSOLIDATION BUG FOR ORA-600 [QMXIUNPPACKED2] DURING CATPATCH.SQL 9.2.0.2 
 

Note 2.
-------
   

Doc ID: 	Note:235423.1	Content Type: 	TEXT/X-HTML	   
Subject: 	How to resolve ORA-600 [qmxiUnpPacked2] during upgrade	Creation Date: 	14-APR-2003	   
Type: 	HOWTO	Last Revision Date: 	18-MAR-2005	   
Status: 	PUBLISHED		 

The information in this article applies to:

Oracle 9.2.0.2
Multiple Platforms, 64-bit


Symptom(s)
~~~~~~~~~~

ORA-600 [qmxiUnpPacked2] []


Cause
~~~~~

If the error is seen after applying 9.2.0.2 on a 9.2.0.1 database or if
using DBCA in 9.2.0.2 to create a new database (which is using the 9.2.0.1
seed database) then it is very likely that either shared_pool_size or
java_pool_size was too small when catpatch.sql was executed. 

Error is generally seen as 

ORA-600: internal error code, arguments: [qmxiUnpPacked2], [121]

There are 3 options to proceed from here:-


Fix
~~~~

  Option 1
  ========
 
  If your shared_pool_size and java_pool_size are less than 150Mb the do the
  following :-

  1/ Set your shared_pool_size and java_pool_size to 150Mb each. In some case 
     you may need to use larger pool sizes.

  2/ Get the xdbpatch.sql script from Note 237305.1

  3/ Copy xdbpatch.sql to $ORACLE_HOME/rdbms/admin/xdbpatch.sql having taken a
     backup of the original file first

  4/ Restart the instance with:

       startup migrate;

  5/ spool catpatch

       @?/rdbms/admin/catpatch.sql

  Option 2
  ========

  If you already have shared_pool_size and java_pool_size set at greater than
  150Mb then the problem may be caused by the shared memory allocated during 
  the JVM upgrade is not released properly. In which case do the following :-

  1/ Set your shared_pool_size and java_pool_size to 150Mb each. In some case 
     you may need to use larger pool sizes.

  2/ Get the xdbpatch.sql script from Note 237305.1

  3/ Edit the xdbpatch.sql script and add the following as the first line in 
     the script:-

       alter system flush shared_pool;

  3/ Copy xdbpatch.sql to $ORACLE_HOME/rdbms/admin/xdbpatch.sql having taken a
     backup of the original file first

  3/ Restart the instance with:

       startup migrate;

  4/ spool catpatch

       @?/rdbms/admin/catpatch.sql

  Option 3
  ========

  If XDB is NOT in use and there are NO registered XML Schemas an alternative 
  is to drop, and maybe re-install XDB :-

  1/ To drop the XDB subsystem connect as sys and run

       @?/rdbms/admin/catnoqm.sql

  2/ You can then run catpatch.sql to perform the upgrade

       startup migrate;
       
       @?/rdbms/admin/catpatch.sql

  3/ Once complete you may chose to re-install the XDB subsystem, if so 
     connect as sys and run catqm.sql

       @?/rdbms/admin/catqm.sql <XDB_PASSWD> <TABLESPACE> <TEMP_TABLESPACE>


  If the error is seen during normal database operation, ensure that upgrade 
  to current version was completed succesfully without error. Once this is 
  confirmed attempt to reproduce the error, if successful forward ALERT.LOG, 
  trace files and full error stack to Oracle Support Services for further 
  analysis. 


References
~~~~~~~~~~~

Bug 2734234   CONSOLIDATION BUG FOR ORA-600 [QMXIUNPPACKED2] DURING CATPATCH.SQL 9.2.0.2
Note 237305.1 Modified xdbpatch.sql


19.54 ORA-00600: internal error code, arguments: [kcbget_37], [1], [], [], [], [], [], []
=========================================================================================

ORA-00600: internal error code, arguments: [kcbso1_1], [], [], [], [], [], [], []
ORA-00600: internal error code, arguments: [kcbget_37], [1], [], [], [], [], [], []

Doc ID:  Note:2652771.8 
Subject:  Support Description of Bug 2652771 
Type:  PATCH 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  13-AUG-2003 
Last Revision Date:  14-AUG-2003 
 
 Click here for details of sections in this note.

Bug 2652771  AIX: OERI[1100] / OERI[KCBGET_37] SGA corruption
 This note gives a brief overview of bug 2652771. 

Affects:
Product (Component) Oracle Server (RDBMS) 
Range of versions believed to be affected Versions < 10G  
Versions confirmed as being affected 8.1.7.4 
9.2.0.2 
 
Platforms affected Aix 64bit 5L 
Aix 64bit 433 
 

Fixed:
This issue is fixed in 9.2.0.3 (Server Patch Set) 
 

Symptoms:
Memory Corruption 
Internal Error may occur (ORA-600) 
ORA-600 [1100] / ORA-600 [kcbget_37] 

  Known Issues:   Bug# 2652771 P  See [NOTE:2652771.8]       
AIX: OERI[1100] / OERI[KCBGET_37] SGA corruption       Fixed: 9.2.0.3    


19.55 ORA-00600: internal error code, arguments: [kcbzwb_4], [], [], [], [], [], [], []
=======================================================================================


Doc ID:  Note:4036717.8 
Subject:  Bug 4036717 - Truncate table in exception handler can causes OERI:kcbzwb_4 
Type:  PATCH 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  25-FEB-2005 
Last Revision Date:  09-MAR-2005 
 
 Click here for details of sections in this note.

Bug 4036717  Truncate table in exception handler can causes OERI:kcbzwb_4
 This note gives a brief overview of bug 4036717. 

Affects:
Product (Component) PL/SQL (Plsql) 
Range of versions believed to be affected Versions < 10.2  
Versions confirmed as being affected 10.1.0.3 
 
Platforms affected Generic (all / most platforms affected) 

Fixed:
This issue is fixed in 9.2.0.7 (Server Patch Set) 
10.1.0.4 (Server Patch Set) 
10g Release 2 (future version) 
 

Symptoms: Related To: 
Internal Error May Occur (ORA-600) 
ORA-600 [kcbzwb_4] 
 PL/SQL 
Truncate 
 

Description
Truncate table in exception handler can cause OERI:kcbzwb_4
with the fix for bug 3768052 installed.

Workaround: 
   Turn off or deinstall the fix for bug 3768052.  
   Note that the procedure containing the affected transactional commands 
   will have to be recompiled after backing out the bug fix.


Doc ID:  Note:4036717.8 
Subject:  Bug 4036717 - Truncate table in exception handler can causes OERI:kcbzwb_4 
Type:  PATCH 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  25-FEB-2005 
Last Revision Date:  09-MAR-2005 
 
 Click here for details of sections in this note.

Bug 4036717  Truncate table in exception handler can causes OERI:kcbzwb_4
 This note gives a brief overview of bug 4036717. 

Affects:
Product (Component) PL/SQL (Plsql) 
Range of versions believed to be affected Versions < 10.2  
Versions confirmed as being affected 10.1.0.3 
 
Platforms affected Generic (all / most platforms affected) 

Fixed:
This issue is fixed in 9.2.0.7 (Server Patch Set) 
10.1.0.4 (Server Patch Set) 
10g Release 2 (future version) 
 

Symptoms: Related To: 
Internal Error May Occur (ORA-600) 
ORA-600 [kcbzwb_4] 
 PL/SQL 
Truncate 
 

Description
Truncate table in exception handler can cause OERI:kcbzwb_4
with the fix for bug 3768052 installed.

Workaround: 
   Turn off or deinstall the fix for bug 3768052.  
   Note that the procedure containing the affected transactional commands 
   will have to be recompiled after backing out the bug fix.


19.56 ORA-00600: internal error code, arguments: [kcbgtcr_6], [], [], [], [], [], [], []
========================================================================================

Doc ID:  Note:248874.1 
Subject:  ORA-600 [kcbgtcr_6] 
Type:  REFERENCE 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  18-SEP-2003 
Last Revision Date:  25-MAR-2004 
 

<Internal_Only>

  This note contains information that has not yet been reviewed by DDR.

  As such, the contents are not necessarily accurate and care should be
  taken when dealing with customers who have encountered this error.

  Thanks. PAA Internals Group

</Internal_Only>

Note: For additional ORA-600 related information please read Note 146580.1

PURPOSE:
  This article discusses the internal error "ORA-600 [kcbgtcr_6]", what
  it means and possible actions. The information here is only applicable
  to the versions listed and is provided only for guidance.

ERROR:
  ORA-600 [kcbgtcr_6] [a]

VERSIONS:
  versions 8.0 to 10.1

DESCRIPTION:

  Two buffers have been found in the buffer cache that are both current
  and for the same DBA (Data Block Address).

  We should not have two 'current' buffers for the same DBA in the cache,
  if this is the case then this error is raised. 

ARGUMENTS:
  Arg [a] Buffer class

  Note that for Oracle release 9.2 and earlier there are no additional 
  arguments reported with this error.

FUNCTIONALITY:
  Kernel Cache Buffer management

IMPACT:
  PROCESS FAILURE
  POSSIBLE INSTANCE FAILURE
  NON CORRUPTIVE - No underlying data corruption.

SUGGESTIONS:

  Retry the operation.

  Does the error still occur after an instance bounce?

  If using 64bit AIX then ensure that minimum version in use is 9.2.0.3
  or patch for Bug 2652771 has been applied.

  If the Known Issues section below does not help in terms of identifying
  a solution, please submit the trace files and alert.log to Oracle
  Support Services for further analysis.

  Known Issues:
  Bug 2652771 Shared data structures corrupted around latch code on 64bit
              AIX ports.
  Fixed 9.2.0.3
  backports available for older versions (8.1.7) from Metalink.


<Internal_Only>

ORA-600 [kcbgtcr_6]
Versions: 8.0.5 - 10.1                                         Source: kcb.c


Meaning: 

  We have two 'CURRENT' buffers for the same DBA.

Argument Description:

  None

---------------------------------------------------------------------------  
Explanation:

  We have identified two 'CURRENT' buffers for the same DBA in the cache,
  this is incorrect, and this error will be raised. 
    
---------------------------------------------------------------------------
Diagnosis:

  Check the trace file, this will show the buffers i.e :-

  BH (0x70000003ffe9800) file#: 39 rdba: 0x09c131e6 (39/78310) class 1 ba: 
  0x70000003fcf0000
  set: 6 dbwrid: 0 obj: 11450 objn: 11450
  hash: [70000000efa9b00,70000004d53a870] lru: 
  [70000000efa9b68,700000006fb8d68]
  ckptq: [NULL] fileq: [NULL]
  st: XCURRENT md: NULL rsop: 0x0 tch: 1
  LRBA: [0x0.0.0] HSCN: [0xffff.ffffffff] HSUB: [255] RRBA: [0x0.0.0]

  BH (0x70000000efa9b00) file#: 39 rdba: 0x09c131e6 (39/78310) class 1 ba: 
  0x70000000e4f6000
  set: 6 dbwrid: 0 obj: 11450 objn: 11450
  hash: [70000004d53a870,70000003ffe9800] lru: 
  [700000012fbaf68,70000003ffe9868]
  ckptq: [NULL] fileq: [NULL]
  st: XCURRENT md: NULL rsop: 0x0 tch: 2
  LRBA: [0x0.0.0] HSCN: [0xffff.ffffffff] HSUB: [255] RRBA: [0x0.0.0]

  Here it is clear that we have two current buffers for the dba.

  Most likely cause for this is 64bit AIX Bug 2652771.

  If this isn't the case check the error reproduces consistently after 
  bouncing the instance?

  Via SQLplus? What level of concurrency to reproduce? Is a testcase 
  available?

  Check OS memory for errors.
  
---------------------------------------------------------------------------
Known Bugs:    

  Bug 2652771 Shared data structures corrupted around latch code on 64bit
              AIX ports.
              - Fixed 9.2.0.3, backports available for older versions.


19.57 ORA-00600: internal error code, arguments: [1100], [0x7000002FDF83F40], [0x7000002FDF83F40], [], [], [], [], []
=====================================================================================================================


Doc ID:  Note:138123.1 
Subject:  ORA-600 [1100] 
Type:  REFERENCE 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  28-MAR-2001 
Last Revision Date:  08-FEB-2005 
 

Note: For additional ORA-600 related information please read Note 146580.1
 
PURPOSE:            
  This article discusses the internal error "ORA-600 [1100]", what 
  it means and possible actions. The information here is only applicable 
  to the versions listed and is provided only for guidance.

ERROR:               
  ORA-600 [1100] [a] [b] [c] [d] [e]
 
VERSIONS:
  versions 6.0 to 9.2
 
DESCRIPTION:

  This error relates to the management of standard double-linked (forward 
  and backward) lists. 

  Generally, if the list is damaged an attempt to repair the links is 
  performed.

  Additional information will accompany this internal error. A dump of the 
  link and often a core dump will coincide with this error. 

  This is a problem with a linked list structure in memory.
 
FUNCTIONALITY:      
  GENERIC LINKED LISTS
 
IMPACT:             
  PROCESS FAILURE
  POSSIBLE INSTANCE FAILURE IF DETECTED BY PMON PROCESS
  No underlying data corruption.
 
SUGGESTIONS:

  Known Issues:

  Bug# 3724548   See Note 3724548.8
      OERI[kglhdunp2_2] / OERI[1100] under high load
      Fixed: 9.2.0.6, 10.1.0.4, 10.2
 
  Bug# 3691672 +  See Note 3691672.8
      OERI[17067]/ OERI[26599] / dump (kgllkdl) from JavaVM / OERI:1100 from PMON
      Fixed: 10.1.0.4, 10.2
 
  Bug# 2652771 P  See Note 2652771.8
      AIX: OERI[1100] / OERI[KCBGET_37] SGA corruption
      Fixed: 9.2.0.3
 
  Bug# 1951929   See Note 1951929.8
      ORA-7445 in KQRGCU/kqrpfr/kqrpre possible
      Fixed: 8.1.7.3, 9.0.1.2, 9.2.0.1
 
  Bug# 959593   See Note 959593.8
      CTRL-C During a truncate crashes the instance
      Fixed: 8.1.6.3, 8.1.7.0
 

<Internal_Only>

INTERNAL ONLY SECTION - NOT FOR PUBLICATION OR DISTRIBUTION TO CUSTOMERS

No internal information at the present time.


Ensure that this note comes out on top in Metalink when searched              
ora-600 ora-600 ora-600 ora-600 ora-600 ora-600 ora-600                       
ora-600 ora-600 ora-600 ora-600 ora-600 ora-600 ora-600                       
1100 1100 1100 1100 1100 1100 1100 1100 1100 1100                                       
1100 1100 1100 1100 1100 1100 1100 1100 1100 1100               

</Internal_Only>


Note 2:
-------

Doc ID:  Note:3724548.8 
Subject:  Bug 3724548 - OERI[kglhdunp2_2] / OERI[1100] under high load 
Type:  PATCH 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  24-SEP-2004 
Last Revision Date:  13-JAN-2005 
 
 Click here for details of sections in this note.

Bug 3724548  OERI[kglhdunp2_2] / OERI[1100] under high load
 This note gives a brief overview of bug 3724548. 

Affects:
Product (Component) Oracle Server (Rdbms) 
Range of versions believed to be affected Versions < 10.2  
Versions confirmed as being affected 9.2.0.4 
9.2.0.5 
 
Platforms affected Generic (all / most platforms affected) 

Fixed:
This issue is fixed in 9.2.0.6 (Server Patch Set) 
10.1.0.4 (Server Patch Set) 
10g Release 2 (future version) 
 

Symptoms: Related To: 
Memory Corruption 
Internal Error May Occur (ORA-600) 
ORA-600 [kglhdunp2_2] 
ORA-600 [1100] 
 (None Specified) 
 

Description
When an instance is under high load it is possible for sessions to get 
ORA-600[KGLHDUNP2_2] and ORA-600 [1100] errors. This can also show
as a corrupt linked list in the SGA.


The full bug text (if published) can be seen at <Bug:3724548> (This link will not work for UNPUBLISHED bugs)
You can search for any interim patches for this bug here <Patch:3724548> (This link will Error if no interim patches exist)


19.58 Compilation problems DBI DBD:
===================================


We upgraded Oracle from 8.1.6 to 9.2.0.5 and I tried to rebuild the 
DBD::Oracle module but it threw errors like:

.
gcc: unrecognized option `-q64'
ld: 0711-736 ERROR: Input file /lib/crt0_64.o:
        XCOFF64 object files are not allowed in 32-bit mode.
collect2: ld returned 8 exit status
make: 1254-004 The error code from the last command is 1.
Stop.

After some digging I found out that this is because the machine is AIX 5.2 
running under 32-bit and it is looking at the oracle's lib directory which 
has 64 bit libraries. So after running "perl Makefile.PL", I edited the 
Makefile
1. changing the references to Oracle's ../lib to ../lib32, 
2. changing change crt0_64.o to crt0_r.o. 
3. Remove the -q32 and/or -q64 options from the list of libraries to link 
with.

Now when I ran "make" it went smoothly, so did make test and make install. 
I ran my own simple perl testfile which connects to the Oracle and gets 
some info and it works fine. 

Now I have an application which can be customised to call perl scripts and 
when I call this test script from that application it fails with:

install_driver(Oracle) failed: Can't load 
'/usr/local/perl/lib/site_perl/5.8.5/a
ix/auto/DBD/Oracle/Oracle.so' for module DBD::Oracle:   0509-022 Cannot 
load mod
ule /usr/local/perl/lib/site_perl/5.8.5/aix/auto/DBD/Oracle/Oracle.so.
        0509-150   Dependent module 
/u00/oracle/product/9.2.0/lib/libclntsh.a(sh
r.o) could not be loaded.
        0509-103   The module has an invalid magic number.
        0509-022 Cannot load module 
/u00/oracle/product/9.2.0/lib/libclntsh.a.
        0509-150   Dependent module 
/u00/oracle/product/9.2.0/lib/libclntsh.a co
uld not be loaded. at /usr/local/perl/lib/5.8.5/aix/DynaLoader.pm line 
230.
 at (eval 3) line 3
Compilation failed in require at (eval 3) line 3.
Perhaps a required shared library or dll isn't installed where expected
 at /opt/dscmdevc/src/udps/test_oracle_dbd.pl line 45

whats happening here is that the application sets its own LIBPATH to 
include oracle's lib(instead of lib32) in the beginning and that makes 
perl look at the wrong place for the file - libclntsh.a .Unfortunately it 
will take too long for the application developers to change this in their 
application and I am looking for a quick solution. The test script is 
something like:

        use Env;
        use strict;
        use lib qw( /opt/harvest/common/perl/lib ) ;
        #use lib qw( $ORACLE_HOME/lib32 ) ;
        use DBI;
        my $connect_string="dbi:Oracle:";
        my $datasource="d1ach2";
        $ENV{'LIBPATH'} = "${ORACLE_HOME}/lib32:$ENV{'LIBPATH'}" ;
        .
        .
        my $dbh = DBI->connect($connect_string, $dbuser, $dbpwd)
                or die "Can't connect to $datasource: $DBI::errstr";
        .
        .

Adding 'use lib' or using'$ENV{LIBPATH}' to change the LIBPATH is not 
working because I need to make this work in this perl script and the "use 
DBI" is run (or whatever the term is) in the compile-phase before the 
LIBPATH is set in the run-phase.

I have a work around for it: write a wrapper ksh script which exports the 
LIBPATH and then calls the perl script which works fine but I was 
wondering if there is a way to set the libpath or do something else inside 
the current perl script so that it knows where to look for the right 
library files inspite of the wrong LIBPATH? 

Or did I miss something when I changed the Makefile and did not install 
everything right? Is there anyway I check this? (the make install didnot 
throw any errors) 

Any help or thoughts on this would be much appreciated.

Thanks!
Rachana.


note 12:
--------

P550:/ # find . -name "libclnt*" -print
./apps/oracle/product/9.2/lib/libclntst9.a
./apps/oracle/product/9.2/lib/libclntsh.a
./apps/oracle/product/9.2/lib32/libclntst9.a
./apps/oracle/product/9.2/lib32/libclntsh.a
./apps/oracle/oui/bin/aix/libclntsh.so.9.0
P550:/ #


19.59 Listener problem: IBM/AIX RISC System/6000 Error: 13: Permission denied
-----------------------------------------------------------------------------

When starting listener
start listener

TNS-12546: TNS:permission denied
 TNS-12560: TNS:protocol adapter error
  TNS-00516: Permission denied
   IBM/AIX RISC System/6000 Error: 13: Permission denied


Note 1:

'TNS-12531: TNS:cannot allocate memory' may be misleading, it seems to be a permission problem 
(see also IBM/AIX RISC System/6000 Error: 13: Permission denied). A possible reason is:
Oracle (more specific the listener) is unable to read /etc/hosts, because of permission problems. 
So host resolution is not possible. 

..
..
The problem really was in permissions of /etc/hosts on the node2. It was -rw-r----- (640).
Now it is -rw-rw-r-- (664) and everything goes ok.
Thank you! 


BUGS WITH REGARDS TO PRO*COBOL ON 9i:


19.60: 64BIT PRO*COBOL IS NOT THERE EVNN AFTER UPGRDING TO 9.2.0.3 ON AIX-5L BOX 
--------------------------------------------------------------------------------

 
Bookmark	Fixed font 	Go to End	Monitor Bug	 
  
Bug No.	2859282	   
Filed	19-MAR-2003	Updated	01-NOV-2003	   
Product	Precompilers	Product Version 	9.2.0.3	   
Platform	AIX5L Based Systems (64-bit)	Platform Version	5.*	   
Database Version	9.2.0.3	Affects Platforms 	Port-Specific	   
Severity 	Severe Loss of Service	Status	Closed, Duplicate Bug	   
Base Bug	2440385	Fixed in Product Version	No Data	 

Problem statement:

64BIT PRO*COBOL IS NOT THERE EVNN AFTER UPGRDING TO 9.2.0.3 ON AIX-5L BOX 

 
*** 03/19/03 10:13 am *** 
2889686.996 
. 
=========================     
PROBLEM: 
. 
 1. Clear description of the problem encountered: 
. 
cst. has upgraded from 9.2.0.2 to 9.2.0.3 on a AIX 5L 64-Bit Box and is not  
seeing the 64-bit Procob executable. Actually the same problem existed when  
upgraded from 9.2.0.1 to 9.2.0.2, but the one-off patch has been provided in  
the Bug#2440385 to resolve the issue. As per the Bug, problem has been fixed  
in 9.2.0.3. But My Cst. is facing the same problem on 9.2.0.3 also. 
. 
This is what the Cst. says 
============================ 
This is the original bug # 2440385.  The fix provides 64 bit versions of  
Pro*Cobol.There are two versions of the patch for the bug: one is for the  
9.2.0.1 RDBMS and the other  is for 9.2.0.2.  So the  last time I hit this  
issue, I applied the 9.2.0.2 RDBMS patch to the 9.2.0.1  install.  The 9.2.0.2  
patch also experienced the relinking problem on rtsora just like the 9.2.0.1  
install did.  I ignored the  error to complete the patch application.  Then I  
used the patch for the 2440385 bug to get 64 bit procob/rtsora executables  
(the patch actually provides executables rather than performing a successful  
relinking) to get the Pro*Cobol 1.8.77 precompiler to work with the  
MicroFocus Server Express 2.0.11 (64 bit) without encountering "bad magic  
number" error. 
. 
The current install that I am performing I've downloaded the Oracle 9.2.0.3   
Pro*Cobol capability fix either so the rtsora relinking fails as well.  Thus I  
don't have a working Pro*Cobol precompiler to allow me to generate our Cobol  
programs against the database. 
. 
 2. Pertinent configuration information (MTS/OPS/distributed/etc) 
. 
 3. Indication of the frequency and predictability of the problem   
. 
 4. Sequence of events leading to the problem   
. 
 5. Technical impact on the customer. Include persistent after effects. 
. 
=========================     
DIAGNOSTIC ANALYSIS: 
. 
One-off patch should be provided on top of 9.2.0.3 as provided on top of 
9.2.0.2/9.2.0.1 
. 
=========================    
WORKAROUND: 
. 
. 
=========================    
RELATED BUGS: 
. 
2440385 
. 
=========================    
REPRODUCIBILITY: 
. 
 1. State if the problem is reproducible; indicate where and predictability 
. 
 2. List the versions in which the problem has reproduced 
. 
    9.2.0.3 
. 
 3. List any versions in which the problem has not reproduced 


Further notes on PRO*COBOL:
===========================


Note 1:
=======
		9201,9202,9203,9204,9205
32 bit cobol: 	procob32 or procob18_32.
64 bit cobol: 	procob or procob18


PATCHES:

1. Patch 2663624: (Cobol patch for 9202 AIX 5L)
-----------------------------------------------

PSE FOR BUG2440385 ON 9.2.0.2 FOR AIX5L PORT 212
Patchset Exception: 2663624 / Base Bug 2440385
#-------------------------------------------------------------------------
#
#  DATE: November 26, 2002
#  -----------------------
#  Platform Patch for : AIX Based Systems (Oracle 64bit) for 5L
#  Product Version #  : 9.2.0.2
#  Product Patched    : RDBMS
#
#  Bugs Fixed by this patch:
#  -------------------------
#  2440385 : PLEASE PROVIDE THE PATCH FOR SUPPORTING 64BIT PRO*COBOL
#
#  Patch Installation Instructions:
#  --------------------------------
#  To apply the patch, unzip the PSE container file;
#
#    % unzip p2440385_9202_AIX64-5L.zip
#
#  Set your current directory to the directory where the patch
#  is located:
#
#    % cd 2663624
#
#  Ensure that the directory containing the opatch script appears in
#  your $PATH; then enter the following command:
#
#    % opatch apply


2. Patch 2440385:
-----------------

Results for Platform : AIX5L Based Systems (64-bit)  
 
 
Patch  Description  Release  Updated  Size        
2440385 Pro*COBOL:  PATCH FOR SUPPORTING 64BIT PRO*COBOL 9.2.0.3 27-APR-2003 34M   
2440385 Pro*COBOL:  PATCH FOR SUPPORTING 64BIT PRO*COBOL 9.2.0.2 26-NOV-2002 17M   
2440385 Pro*COBOL:  PATCH FOR SUPPORTING 64BIT PRO*COBOL 9.2.0.1 01-OCT-2002 17M 
 
 
3. Patch 3501955 9205:
----------------------

Also includes 2440385.  Provide the patch for supporting 64-bit Pro*COBOL.  


Note 2:
=======

Problem precompiling Cobol program under Oracle 9i......

Hi, we recently upgraded to 9i. However, we still have 32 bit Cobol, so we're using the procob18_32 precompiler 
to compile our programs. Some of my compiles have worked successfully. However, I'm receiving the follow error 
in one of my compiles: 

1834 183400 01 IB0-STATUS PIC 9. 7SA 
350 
1834 ...................................^ 
PCC-S-0018: Expected "PICTURE clause", but found "9" at line 1834 in file 

What's strange is that if I compile the program against the same DB using procob instead of procob18_32, 
it compiles cleanly. I noticed in my compile that failed using procob18_32, it had the following message: 

System default option values taken from: /u01/app/oracle/product/9.2.0.4/precomp 
/admin/pcccob.cfg 


Yet, when I used procob, it had this message: 

System default option values taken from: /u01/app/oracle/product/9.2.0.4/precomp 
/admin/pcbcfg.cfg 

..
..

Hi, I started using procob32 instead of procob18_32, and that resolved my problem. 
Thanks for any help you may have already started to provide. 


Note 3:
=======

 
Doc ID: 	Note:257934.1	Content Type: 	TEXT/X-HTML	   
Subject: 	Pro*COBOL Application Fails in Runtime When Using Customized old Make Files With Signal 11 (MF Errror 114)	Creation Date: 	20-NOV-2003	   
Type: 	PROBLEM	Last Revision Date: 	04-APR-2005	   
Status: 	MODERATED		 
The information in this article applies to: 
Precompilers - Version: 9.2.0.4
This problem can occur on any platform.
Symptoms
After upgrading from Oracle server and Pro*COBOL 9.2.0.3.0 to 9.2.0.4.0 
application are failing with cobol runtime error 114 when using 32-bit builds. 
Platform is AIX 4.3.3 which does not support 64-bit builds with Micro Focus 
Server Express 2.0.11. 

Execution error : file 'sample1' 
error code: 114, pc=0, call=1, seg=0 
114 Attempt to access item beyond bounds of memory (Signal 11) 
Changes
Upgraded from 9.2.0.3.0 to 9.2.0.4.0. 
Cause
The customized old make files for building 32-bit applications invoked the 64-bit 
precompilers procob or procob18 instead of procob32 or procob18_32. 
Fix
Use the Oracle Supplied make templates or change the customized old make files for 32-bit application builds 
$ORACLE_HOME/precomp/demo/procob2/demo_procob_32.mk, 
$ORACLE_HOME/precomp/demo/procob/demo_procob_32.mk and 
$ORACLE_HOME/precomp/demo/procob/demo_procob18_32.mk 
invoke the wrong precompiler. 

To fix the problem add the following to 
$ORACLE_HOME/precomp/demo/procob2/demo_procob_32.mk: 

PROCOB=procob32 

Using $ORACLE_HOME/precomp/demo/procob/demo_procob_32.mk: 

PROCOB_32=procob32 

Using $ORACLE_HOME/precomp/demo/procob/demo_procob18_32.mk 

PROCOB18_32=procob18_32 

The change can be added to the bottom of the make file. 
References
Bug 3220095 - Procobol App Fails114 Attempt To Access Item Beyond Bounds Of Memory (Signal 11)


Note 4:
=======


Displayed below are the messages of the selected thread. 
Thread Status: Closed 
From: Jean-Daniel DUMAS 23-Nov-04 16:39 
Subject: PROCOB18_32 Problem at execution ORA-00933 


PROCOB18_32 Problem at execution ORA-00933

We try to migrate from Oracle 8.1.7.4 to Oracle 9.2.0.5. 
We've got problems with a lot of procobol programs using host table variables in PL SQL blocks like: 

EXEC SQL EXECUTE 
BEGIN 
FOR nIndice IN 1..:WI-NB-APPELS-TFO009S LOOP 
UPDATE tmp_edition_erreur 
SET mon_nb_dec = :WTI-S2-MON-NB-DEC (nIndice) 
WHERE mon_cod = :WTC-S2-MON-COD (nIndice) 
AND run_id = :WC-O-RUN-ID; 
END LOOP; 
END; 
END-EXEC 

At execution, we've got "ORA-00933 SQL command not properly ended". 
The problem seems to appear only if the host table variable is used inside a SELECT,UPDATE or DELETE command. 
For the INSERT VALUES command, it seems that we've got no problem. 

A workaround consists to assign host table variables into oracle table variables and replace inside SQL command host table 
variables by oracle table variables. 
But, as we've got a lot a program like this, we don't enjoy to do this. 
Have somebody another idea ? 

jddumas@eram.fr 


From: Oracle, Amit Joshi 05-Jan-05 06:26 
Subject: Re : PROCOB18_32 Problem at execution ORA-00933 

Hi 

Please refer to bug 3802067 on Metalink. 

From the details provided , it seems you are hitting the same. 

Best Regards 
Amit Joshi 


Note 5:
=======

Re: Server Express 64bit and Oracle 9i problem (114) on AIX 5.2
Hi Wayne (and Panos)

Apologies if you're aware of some of this already, but I just wanted to
clarify the steps involved in creating and executing a Pro*COBOL application
with Micro Focus Server Express on UNIX.

When installing Pro*COBOL on UNIX (as part of the main Oracle installation),
you need to have your COBOL environment setup, in order for the installer to
relink a COBOL RTS containing the Oracle support libraries
(rtsora/rtsora32/rtsora64).

The 64-bit edition of Oracle 9i on AIX 5.x creates rtsora -- the 64-bit
version of the run-time -- and rtsora32 -- the 32-bit version of the
run-time.

It's imperative that you use the correct edition of Server Express, i.e.
32-bit or 64-bit -- note well, that these are separate products on this
platform -- for the mode in which you wish to use Oracle. In addition, you
need to ensure that LIBPATH is set to point to the correct Oracle 'lib'
directory -- $ORACLE_HOME/lib32 for 32-bit, or $ORACLE_HOME/lib for 64-bit

If you wish to recreate those executables, say if you've updated your COBOL
environment since installing Oracle, then from looking at the makefiles --
ins_precomp.mk and env_precomp.mk -- then the effective commands to use to
re-link the run-time correctly are as follows (logged in under your Oracle
user ID) :

either mode:
<set up COBDIR, ORACLE_HOME, ORACLE_BASE, ORACLE_SID as appropriate for your
installation>
export PATH=$COBDIR/bin:$ORACLE_HOME/bin:$PATH

32-bit :
export LIBPATH=$COBDIR/lib:$ORACLE_HOME/lib32:$LIBPATH
cd $ORACLE_HOME/precomp/lib
make LIBDIR=lib32 -f ins_precomp.mk EXE=rtsora32 rtsora32

64-bit:
export LIBPATH=$COBDIR/lib:$ORACLE_HOME/lib:$LIBPATH
cd $ORACLE_HOME/precomp/lib
make -f ins_precomp.mk rtsora

Regarding precompiling your application, Oracle provide two versions of
Pro*COBOL. Again, you need to use the correct one depending on whether
you're creating a 32-bit or 64-bit application, as the precompiler will
generate different code.

If invoking Pro*COBOL directly, you need to use :

32-bit : procob32 / procob18_32 , e.g.
procob32 myapp.pco
cob -it myapp.cob
rtsora32 myapp.int

or
64-bit : procob / procob18 , e.g.
procob myapp.pco
cob -it myapp.cob
rtsora myapp.int

If you're using Server Express 2.2 SP1 or later, you can also compile using
the Cobsql preprocessor, which will invoke the correct version of Pro*COBOL
under the covers, allowing for a single precompile-compile step, e.g.

cob -ik myapp.pco -C "p(cobsql) csqlt==oracle8 endp"

This method also aids debugging, as you will see the original source code
while animating, rather than the output from the precompiler. See the Server
Express Database Access manual. Prior to SX 2.2 SP1, Cobsql only supported
the creation of 32-bit applications.

I hope this helps -- if you're still having problems, please let me know.

Regards,
SimonT.


Re: Re: Server Express 64bit and Oracle 9i problem (114) on AIX 5.2
Hi Simon (and anyone else)

Thanks for that. We still seem to be getting a very unusual error with our c
ompiles in or makes.

A bit of background: we are "upgrading" from Oracle8i, SAS6, Solaris, MF COB
OL 4.5 to AIX 5L, Oracle9i, SAS8 and MF Server Express COBOL.

When we attempt to compile our COBOL it works fine. However if the COBOL has
 embedded Oracle SQL our procomp makes try to access ADA. We do not use ADA.
 I thought this must have been included by accident; but can find no flag or
 install option for it. So can you give us any clues as to why we are suffer
ing an ADA plague :-))

Wayne


Re: Server Express 64bit and Oracle 9i problem (114) on AIX 5.2
Hi Wayne.

On the surface, it appears as if you're not picking up the correct Pro*COBOL
binary.

If you invoke 'procob' from the command line, you should see something along
the lines of :

Pro*COBOL: Release 9.2.0.4.0 - Production on Mon Apr 19 13:38:07 2004

followed by a list of Pro*COBOL options.

Do you see this, or do you see a different banner (say, Pro*ADA, or
Pro*Fortran)? Assuming you see something other than a Pro*COBOL banner, then
if you invoke 'whence procob', does it show procob as being picked up from
your Oracle bin directory (/home/oracle/9.2.0/bin/procob in my case) ?

If you're either not seeing the correct Pro*COBOL banner, or it's not
located in the correct directory, I'd suggest rebuilding the procob and
procob32 binaries. Logged in under your Oracle user ID, with the Oracle
environment set up :

cd $ORACLE_HOME/precomp/lib
make -f ins_precomp.mk procob32 procob

and then try your compilation process again.

Regards,
SimonT.


Re: Re: Server Express 64bit and Oracle 9i problem (114) on AIX 5.2
Hi Simon

Firstly, thanks for all your help, it was greatly appreciated.

We have the solution to our problem:

The problem is resolved by modifying the line in the job from:

	make -f $SRC_DIR/procob.mk COBS="$SRC_DIR/PFEM025A.cob SYSDATE.cob CNTLGET.
cob" EXE=$SRC_DIR/PFEM025A
to
	make -f $SRC_DIR/procob.mk build COBS="$SRC_DIR/PFEM025A.cob SYSDATE.cob CN
TLGET.cob" EXE=$SRC_DIR/PFEM025A

It appears this (build keyword) is not a requirement for the job to run on S
olaris but is for AIX.

All is working fine.

Cheers

Wayne


Note 6:
=======

 
Doc ID: 	Note:2440385.8	Content Type: 	TEXT/X-HTML	   
Subject: 	Support Description of Bug 2440385	Creation Date: 	08-AUG-2003	   
Type: 	PATCH	Last Revision Date: 	15-AUG-2003	   
Status: 	PUBLISHED		 
Click here for details of sections in this note.
Bug 2440385 AIX: Support for 64 bit ProCobol
This note gives a brief overview of bug 2440385. 
Affects:
 
Product (Component)	Precompilers (Pro*COBOL)	   
Range of versions believed to be affected	Versions >= 7 but < 10G 	   
Versions confirmed as being affected	9.2.0.3 	   
Platforms affected	Aix 64bit 5L 	 
Fixed:
 
This issue is fixed in	9.2.0.4 (Server Patch Set) 	 
Symptoms:
(None Specified) 
Related To:
Pro* Precompiler 
Description
Add support for 64 bit ProCobol

The full bug text (if published) can be seen at Bug 2440385
This link will not work for UNPUBLISHED bugs. 


Note 7:
=======

Displayed below are the messages of the selected thread. 


Thread Status: Closed 

From: Cathy Agada 18-Sep-03 21:40 
Subject: How do I relink rtsora for 64 bit processing 


How do I relink rtsora for 64 bit processing

I have the following error while relinking "rtsora" on AIX 5L/64bit platform on oracle 9.2.0.3 
(I believe my patch is up-to-date). Our Micro Focus compiler version is 2.0.11 

$>make -f ins_precomp.mk relink EXENAME=rtsora 
/bin/make -f ins_precomp.mk LIBDIR=lib32 EXE=/app/oracle/product/9.2.0/precomp/lib/rtsora rtsora32 
Linking /app/oracle/product/9.2.0/precomp/lib/rtsora 
cob64: bad magic number: /app/oracle/product/9.2.0/precomp/lib32/cobsqlintf.o 
make: 1254-004 The error code from the last command is 1. 

Stop. 
make: 1254-004 The error code from the last command is 2. 

My environment variable is as follows: 
COBDIR=/usr/lpp/cobol 
LD_LIBRARY_PATH=$ORACLE_HOME/lib:/app/oracle/product/9.2.0/network/lib 
SHLIB_PATH=$ORACLE_HOME/lib64:/app/oracle/product/9.2.0/lib32 

I added 'define=bit64' on precomp config file. 

Any ideas on what could be wrong. Thanks. 


From: Oracle, Amit Chitnis 19-Sep-03 05:26 
Subject: Re : How do I relink rtsora for 64 bit processing 


Cathy, 

Support for 64 bit Pro*Cobol 9.2.0.3 on AIX 5.1 was provided through one off patch for bug 2440385 

You will need to download and apply the patch for bug 2440385. 

==OR== 

You can dowload and apply the latest 9.2.0.4 patchset where the bug is fixed. 


Thanks, 
Amit Chitnis. 


Note 8:
=======

 
Doc ID: 	Note:215279.1	Content Type: 	TEXT/X-HTML	   
Subject: 	Building Pro*COBOL Programs Fails With "cob64: bad magic number:"	Creation Date: 	08-APR-2003	   
Type: 	PROBLEM	Last Revision Date: 	15-APR-2003	   
Status: 	PUBLISHED		 


fact: Pro*COBOL 9.2.0.2

fact: Pro*COBOL 9.2.0.1

fact: AIX-Based Systems (64-bit)

symptom: Building Pro*COBOL programs fails

symptom: cob64: bad magic number: %s

symptom: /oracle/product/9.2.0/precomp/lib32/cobsqlintf.o

cause: Bug 2440385 AIX: Support for 64 bit ProCobol


fix:

This is fixed in Pro*COBOL 9.2.0.3
One-Off patch for Pro*COBOL 9.2.0.2 has been provided in Metalink Patch Number 
2440385

Reference:

How to Download a Patch from Oracle


Note 9:
=======

If you wish to recreate those executables, say if you've updated your COBOL
environment since installing Oracle, then from looking at the makefiles --
ins_precomp.mk and env_precomp.mk -- then the effective commands to use to
re-link the run-time correctly are as follows (logged in under your Oracle
user ID) :

either mode:
<set up COBDIR, ORACLE_HOME, ORACLE_BASE, ORACLE_SID as appropriate for your
installation>
export PATH=$COBDIR/bin:$ORACLE_HOME/bin:$PATH

32-bit :
export LIBPATH=$COBDIR/lib:$ORACLE_HOME/lib32:$LIBPATH
cd $ORACLE_HOME/precomp/lib
make LIBDIR=lib32 -f ins_precomp.mk EXE=rtsora32 rtsora32

64-bit:
export LIBPATH=$COBDIR/lib:$ORACLE_HOME/lib:$LIBPATH
cd $ORACLE_HOME/precomp/lib
make -f ins_precomp.mk rtsora


Note 10:
========

On 9.2.0.5, try to get the pro cobol patch for 9203. Then just copy the procobol files
to the cobol directory. 


19.61: ORA-12170:
=================

Connection Timeout.

 
Doc ID: 	Note:274303.1	Content Type: 	TEXT/X-HTML	   
Subject: 	Description of parameter SQLNET.INBOUND_CONNECT_TIMEOUT	Creation Date: 	26-MAY-2004	   
Type: 	BULLETIN	Last Revision Date: 	10-FEB-2005	   
Status: 	MODERATED		 

***
This article is being delivered in Draft form and may contain
errors.  Please use the MetaLink "Feedback" button to advise
Oracle of any issues related to this article.
***

PURPOSE
-------

To specify the time, in seconds, for a client to connect with the database server 
and provide the necessary authentication information.

 
Description of parameter SQLNET.INBOUND_CONNECT_TIMEOUT
-------------------------------------------------------
This parameter has been introduced in 9i version. 
This has to be configured in sqlnet.ora file.
 

Use the SQLNET.INBOUND_CONNECT_TIMEOUT parameter to specify the time,
in seconds, for a client to connect with the database server 
and provide the necessary authentication information.
    
If the client fails to establish a connection and complete authentication 
in the time specified, then the database server terminates the connection.
In addition, the database server logs the IP address of the client 
and an ORA-12170: TNS:Connect timeout occurred error message to the sqlnet.log 
file. The client receives either an ORA-12547: TNS:lost contact or 
an ORA-12637: Packet receive failed error message.
    
Without this parameter, a client connection to the database server can stay open 
indefinitely without authentication. Connections without authentication can 
introduce possible denial-of-service attacks, whereby malicious clients attempt to flood database servers with
connect requests that consume resources.
    
To protect both the database server and the listener, 
Oracle Corporation recommends setting this parameter in combination with the 
INBOUND_CONNECT_TIMEOUT_listener_name parameter in the listener.ora file.
When specifying values for these parameters, 
consider the following recommendations:
    
   *Set both parameters to an initial low value.
   *Set the value of the INBOUND_CONNECT_TIMEOUT_listener_name parameter to a 
    lower value than the SQLNET.INBOUND_CONNECT_TIMEOUT parameter.
For example, you can set INBOUND_CONNECT_TIMEOUT_listener_name to 2 seconds and
INBOUND_CONNECT_TIMEOUT parameter to 3 seconds. 
If clients are unable to complete connections within the specified time 
due to system or network delays that are normal for the particular
environment, then increment the time as needed.

By default is set to None

Example
SQLNET.INBOUND_CONNECT_TIMEOUT=3


RELATED DOCUMENTS
-----------------

Oracle9i Net Services Reference Guide, Release 2 (9.2), Part Number A96581-02


SQLNET.EXPIRE_TIME:
-------------------

Purpose: 
 Determines time interval to send a probe to verify the session is alive 

See Also: Oracle Advanced Security Administrator's Guide 
 
Default:  
 None 
 
Minimum Value:  
 0 minutes 
 
Recommended Value:  
 10 minutes 
 
Example: 
 sqlnet.expire_time=10


sqlnet.expire_time
Enables dead connection detection, that is, after the specifed time (in minutes) the server checks 
if the client is still connected. 
If not, the server process exits. This parameter must be set on the server


PROBLEM:
Long query (20 minutes) returns ORA-01013 after about a minute.

SOLUTION:
The SQLNET.ORA parameter SQLNET.EXPIRE_TIME was set to a one(1).
The parameter was changed to...
SQLNET.EXPIRE_TIME=2147483647
This allowed the query to complete.
This is documented in the Oracle Troubleshooting manual on page 324.
The manual part number is A54757.01.

Keywords:

SQLNET.EXPIRE_TIME,SQLNET.ORA,ORA-01013

sqlnet.expire_time should be set on the server. The server sends keep alive traffic over connections 
that have already been established. You won't need to change your firewall. 

sqlnet.expire_time is actually intended to test connections in order to allow oracle to clean up resources 
from connection that abnormally terminated. 

The architecture to do that means that the server will send a probe packet to the client. That probe packet 
is viewed by the most firewalls as traffic on the line. That will in short reset the idle timers on the firewall. 
If you happen to have the disconnects from idle timers then it may help. 
It was not intended for that feature but it is a byproduct of the design. 


19.62: Tracing SQLNET:
======================

Note 1:
-------


Doc ID:  Note:219968.1 
Subject:  SQL*Net, Net8, Oracle Net Services - Tracing and Logging at a Glance 
Type:  BULLETIN 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  20-NOV-2002 
Last Revision Date:  26-AUG-2003 
 

TITLE
-----

SQL*Net, Net8, Oracle Net Services - Tracing and Logging at a Glance.


PURPOSE
-------

The purpose of Oracle Net tracing and logging is to provide detailed
information to track and diagnose Oracle Net problems such as connectivity
issues, abnormal disconnection and connection delay. Tracing provides varying
degrees of information that describe connection-specific internal operations
during Oracle Net usage. Logging reports summary, status and error messages.

Oracle Net Services is the replacement name for the Oracle Networking product
formerly known as SQL*Net (Oracle7 [v2.x]) and Net8 (Oracle8/8i [v8.0/8.1]).
For consistency, the term Oracle Net is used thoughout this article and refers
to all Oracle Net product versions.


SCOPE & APPLICATION
-------------------

The aim of this document is to overview SQL*Net, Net8, Oracle Net Services
tracing and logging facilities. The intended audience includes novice Oracle
users and DBAs alike. Although only basic information on how to enable and
disable tracing and logging features is described, the document also serves
as a quick reference. The document provides the reader with the minimum
information necessary to generate trace and log files with a view to
forwarding them to Oracle Support Services (OSS) for further diagnosis. The
article does not intend to describe trace/log file contents or explain how to
interpret them.


LOG & TRACE PARAMETER OVERVIEW
------------------------------

The following is an overview of Oracle Net trace and log parameters.

  TRACE_LEVEL_[CLIENT|SERVER|LISTENER]     = [0-16|USER|ADMIN|SUPPORT|OFF]
  TRACE_FILE_[CLIENT|SERVER|LISTENER]      = <FILE NAME>
  TRACE_DIRECTORY_[CLIENT|SERVER|LISTENER] = <DIRECTORY>
  TRACE_UNIQUE_[CLIENT|SERVER|LISTENER]    = [ON|TRUE|OFF|FALSE]
  TRACE_TIMESTAMP_[CLIENT|SERVER|LISTENER] = [ON|TRUE|OFF|FALSE]   #Oracle8i+
  TRACE_FILELEN_[CLIENT|SERVER|LISTENER]   = <SIZE in KB>          #Oracle8i+
  TRACE_FILENO_[CLIENT|SERVER|LISTENER]    = <NUMBER>              #Oracle8i+

  LOG_FILE_[CLIENT|SERVER|LISTENER]        = <FILE NAME>
  LOG_DIRECTORY_[CLIENT|SERVER|LISTENER]   = <DIRECTORY NAME>
  LOGGING_LISTENER                         = [ON|OFF]

  TNSPING.TRACE_LEVEL                      = [0-16|USER|ADMIN|SUPPORT|OFF]
  TNSPING.TRACE_DIRECTORY                  = <DIRECTORY>

  NAMES.TRACE_LEVEL                        = [0-16|USER|ADMIN|SUPPORT|OFF]
  NAMES.TRACE_FILE                         = <FILE NAME> 
  NAMES.TRACE_DIRECTORY                    = <DIRECTORY>
  NAMES.TRACE_UNIQUE                       = [ON|OFF]
  NAMES.LOG_FILE                           = <FILE NAME>
  NAMES.LOG_DIRECTORY                      = <DIRECTORY>
  NAMES.LOG_UNIQUE                         = [ON|OFF]

  NAMESCTL.TRACE_LEVEL                     = [0-16|USER|ADMIN|SUPPORT|OFF]
  NAMESCTL.TRACE_FILE                      = <FILE NAME>
  NAMESCTL.TRACE_DIRECTORY                 = <DIRECTORY>
  NAMESCTL.TRACE_UNIQUE                    = [ON|OFF]

  Note: With the exception of parameters suffixed with LISTENER, all other
        parameter suffixes and prefixes [CLIENT|NAMES|NAMESCTL|SERVER|TNSPING]
        are fixed and cannot be changed. For parameters suffixed with LISTENER,
        the suffix name should be the actual Listener name. For example, if
        the Listener name is PROD_LSNR, an example trace parameter name would
        be TRACE_LEVEL_PROD_LSNR=OFF.


CONFIGURATION FILES
-------------------

Files required to enable Oracle Net tracing and logging features include:

  Oracle Net Listener        LISTENER.ORA                  LISTENER.TRC
  Oracle Net - Client        SQLNET.ORA on client          SQLNET.TRC
  Oracle Net - Server        SQLNET.ORA on server          SQLNET.TRC
  TNSPING Utility            SQLNET.ORA on client/Server   TNSPING.TRC
  Oracle Name Server         NAMES.ORA                     NAMES.TRC
  Oracle NAMESCTL            SQLNET.ORA on server
  Oracle Connection Manager  CMAN.ORA


CONSIDERATIONS WHEN USING LOGGING/TRACING
-----------------------------------------

1. Verify which Oracle Net configuration files are in use.
   By default, Oracle Net configuration files are sought and resolved from
   the following locations:

   TNS_ADMIN environment variable (incl. Windows Registry Key)
   /etc or /var/opt/oracle (Unix)
   $ORACLE_HOME/network/admin (Unix)
   %ORACLE_HOME%/Network/Admin or %ORACLE_HOME%/Net80/Admin (Windows)

   Note: User-specific Oracle Net parameters may also reside in
         $HOME/sqlnet.ora file.
         An Oracle Net server installation is also a client.
   
2. Oracle Net tracing and logging can consume vast quantities of disk space.
   Monitor for sufficient disk space when tracing is enabled.
   On some Unix operating systems, /tmp is used for swap space.
   Although generally writable by all users, this is not an ideal location for
   trace/log file generation.

3. Oracle Net tracing should only be enabled for the duration of the issue at
   hand. Oracle Net tracing should always be disabled after problem resolution.

4. Large trace/log files place an overhead on the processes that generate them.
   In the absence of issues, the disabling of tracing and/or logging will
   improve Oracle Net overall efficiency.
   Alternatively, regularly truncating log files will also improve efficiency.

5. Ensure that the target trace/log directory is writable by the connecting
   user, Oracle software owner and/or user that starts the Net Listener.


LOG & TRACE PARAMETERS
----------------------

This section provides a detailed description of each trace and log parameter.

  TRACE LEVELS

    TRACE_LEVEL_[CLIENT|SERVER|LISTENER] = [0-16|USER|ADMIN|SUPPORT|OFF]
    Determines the degree to which Oracle Net tracing is provided.
    Configuration file is SQLNET.ORA, LISTENER.ORA.
    Level 0 is disabled - level 16 is the most verbose tracing level.
    Listener tracing requires the Net Listener to be reloaded or restarted
    after adding trace parameters to LISTENER.ORA.
    Oracle Net (client/server) tracing takes immediate effect after tracing
    parameters are added to SQLNET.ORA.
    By default, the trace level is OFF.
  
    OFF     (equivalent to 0) disabled - provides no tracing.
    USER    (equivalent to 4) traces to identify user-induced error conditions. 
    ADMIN   (equivalent to 6) traces to identify installation-specific problems. 
    SUPPORT (equivalent to 16) trace information required by OSS for 
            troubleshooting.

  TRACE FILE NAME

    TRACE_FILE_[CLIENT|SERVER|LISTENER] = <FILE NAME>
    Determines the trace file name.
    Any valid operating system file name.
    Configuration file is SQLNET.ORA, LISTENER.ORA.
    Trace file is automatically appended with '.TRC'.
    Default trace file name is SQLNET.TRC, LISTENER.TRC.

  TRACE DIRECTORY

    TRACE_DIRECTORY_[CLIENT|SERVER|LISTENER] = <DIRECTORY>
    Determines the directory in which trace files are written.
    Any valid operating system directory name.
    Configuration file is SQLNET.ORA, LISTENER.ORA.
    Directory should be writable by the connecting user and/or Oracle software
    owner.
    Default trace directory is $ORACLE_HOME/network/trace.

  UNIQUE TRACE FILES

    TRACE_UNIQUE_[CLIENT|SERVER|LISTENER] = [ON|TRUE|OFF|FALSE]
    Allows generation of unique trace files per connection.
    Trace file names are automatically appended with '_<PID>.TRC'.
    Configuration file is SQLNET.ORA, LISTENER.ORA.
    Unique tracing is ideal for sporadic issues/errors that occur infrequently
    or randomly.
    Default value is OFF

  TRACE TIMING

    TRACE_TIMESTAMP_[CLIENT|SERVER|LISTENER] = [ON|TRUE|OFF|FALSE]
    A timestamp in the form of [DD-MON-YY 24HH:MI;SS] is recorded against each
    operation traced by the trace file.
    Configuration file is SQLNET.ORA, LISTENER.ORA
    Suitable for hanging or slow connection issues.
    Available from Oracle8i onwards.
    Default value is is OFF.

  MAXIMUM TRACE FILE LENGTH

    TRACE_FILELEN_[CLIENT|SERVER|LISTENER] = <SIZE>
    Determines the maximum trace file size in Kilobytes (Kb).
    Configuration file is SQLNET.ORA, LISTENER.ORA.
    Available from Oracle8i onwards.
    Default value is UNLIMITED.

  TRACE FILE CYCLING

    TRACE_FILENO_[CLIENT|SERVER|LISTENER] = <NUMBER>
    Determines the maximum number of trace files through which to perform
    cyclic tracing.
    Configuration file is SQLNET.ORA, LISTENER.ORA.
    Suitable when disk space is limited or when tracing is required to be
    enabled for long periods.
    Available from Oracle8i onwards.
    Default value is 1 (file).

  LOG FILE NAME

    LOG_FILE_[CLIENT|SERVER|LISTENER] = <FILE NAME>
    Determines the log file name.
    May be any valid operating system file name.
    Configuration file is SQLNET.ORA, LISTENER.ORA.
    Log file is automatically appended with '.LOG'.
    Default log file name is SQLNET.LOG, LISTENER.LOG.

  LOG DIRECTORY

    LOG_DIRECTORY_[CLIENT|SERVER|LISTENER] = <DIRECTORY NAME>
    Determines the directory in which log files are written.
    Any valid operating system directory name.
    Configuration file is SQLNET.ORA, LISTENER.ORA.
    Directory should be writable by the connecting user or Oracle software
    owner.
    Default directory is $ORACLE_HOME/network/log.
  

  DISABLING LOGGING

    LOGGING_LISTENER = [ON|OFF]
    Disables Listener logging facility.
    Configuration file is LISTENER.ORA.
    Default value is ON.


ORACLE NET TRACE/LOG EXAMPLES
-----------------------------

  CLIENT (SQLNET.ORA)
    trace_level_client = 16
    trace_file_client = cli
    trace_directory_client = /u01/app/oracle/product/9.0.1/network/trace
    trace_unique_client = on
    trace_timestamp_client = on
    trace_filelen_client = 100
    trace_fileno_client = 2
    log_file_client = cli
    log_directory_client = /u01/app/oracle/product/9.0.1/network/log
    tnsping.trace_directory = /u01/app/oracle/product/9.0.1/network/trace
    tnsping.trace_level = admin

  SERVER (SQLNET.ORA)

    trace_level_server = 16
    trace_file_server = svr
    trace_directory_server = /u01/app/oracle/product/9.0.1/network/trace
    trace_unique_server = on
    trace_timestamp_server = on
    trace_filelen_server = 100
    trace_fileno_server = 2
    log_file_server = svr
    log_directory_server = /u01/app/oracle/product/9.0.1/network/log

    namesctl.trace_level = 16
    namesctl.trace_file = namesctl
    namesctl.trace_directory = /u01/app/oracle/product/9.0.1/network/trace
    namesctl.trace_unique = on

  LISTENER (LISTENER.ORA)

    trace_level_listener = 16
    trace_file_listener = listener
    trace_directory_listener = /u01/app/oracle/product/9.0.1/network/trace
    trace_timestamp_listener = on
    trace_filelen_listener = 100
    trace_fileno_listener = 2
    logging_listener = off
    log_directory_listener = /u01/app/oracle/product/9.0.1/network/log
    log_file_listener=listener

  NAMESERVER TRACE (NAMES.ORA)

    names.trace_level = 16
    names.trace_file = names
    names.trace_directory = /u01/app/oracle/product/9.0.1/network/trace
    names.trace_unique = off

  CONNECTION MANAGER TRACE (CMAN.ORA)

    tracing = yes


RELATED DOCUMENTS
-----------------

Note 16658.1   (7) Tracing SQL*Net/Net8
Note 111916.1  SQLNET.ORA Logging and Tracing Parameters
Note 39774.1   Log & Trace Facilities on Net v2
Note 73988.1   How to Get Cyclic SQL*Net Trace Files when Disk Space is Limited
Note 1011114.6 SQL*Net V2 Tracing
Note 1030488.6 Net8 Tracing


Note 2:
-------

Doc ID:  Note:39774.1 
Subject:  LOG & TRACE Facilities on NET v2. 
Type:  FAQ 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  25-JUL-1996 
Last Revision Date:  31-JAN-2002 
 

                      LOG AND TRACE FACILITIES ON SQL*NET V2
                      ======================================
 
This article describes the log and trace facilities that can be used to
examine application connections that use SQL*Net. This article is based on 
usage of SQL*NET v2.3. It explains how to invoke the trace facility and how 
to use the log and trace information to diagnose and resolve operating problems.
Following topics are covered below:

               o  What the log facility is

               o  What the trace facility is

               o  How to invoke the trace facility

               o  Logging and tracing parameters 

               o  Sample log output

               o  Sample trace output

Note: Information in this section is generic to all operating system 
      environments. You may require further information from the Oracle 
      operating system-specific documentation for some details of your specific 
      operating environment.


________________________________________

1. What is the Log Facility?
============================

All errors encountered in SQL*Net are logged to a log file for evaluation by a 
network or database administrator. The log file provides additional information 
for an administrator when the error on the screen is inadequate to understand 
the failure. The log file, by way of the error stack, shows the state of the 
TNS software at various layers. The properties of the log file are: 

    o  Error information is appended to the log file when an error occurs.

    o  Generally, a log file can only be replaced or erased by an administrator,
        although client log files can be deleted by the user whose application 
       created them. (Note that in general it is bad practice to delete these 
       files while the program using them is still actively logging.)


    o  Logging of errors for the client, server, and listener cannot be 
       disabled. This is an essential feature that ensures all errors are 
       recorded. 

    o  The Navigator and Connection Manager components of the MultiProtocol
       Interchange may have logging turned on or off. If on, logging includes
       connection statistics. 

    o  The Names server may have logging turned on or off. If on, a Names 
       server's operational events are written to a specified logfile. You set 
       logging parameters using the Oracle Network Manager.


________________________________________
 

2. What is the Trace Facility?
==============================

The trace facility allows a network or database administrator to obtain more
information on the internal operations of the components of a TNS network
than is provided in a log file. Tracing an operation produces a detailed
sequence of statements that describe the events as they are executed. All
trace output is directed to trace output files which can be evaluated after
the failure to identify the events that lead up to an error. The trace
facility is typically invoked during the occurrence of an abnormal
condition, when the log file does not provide a clear indication of the
cause.

Attention: The trace facility uses a large amount of disk space and may have
           a significant impact upon system performance. Therefore, you are 
           cautioned to turn the trace facility ON only as part of a diagnostic 
           procedure and to turn it OFF promptly when it is no longer necessary.

Components that can be traced using the trace facility are:

    o  Network listener
    o  SQL*Net version 2 components
       -  SQL*Net client
       -  SQL*Net server
    o  MultiProtocol Interchange components
       -  the Connection Manager and pumps
       -  the Navigator
    o  Oracle Names
       -   Names server
       -  Names Control Utility

The trace facility can be used to identify the following types of problems:
    -  Difficulties in establishing connections
    -  Abnormal termination of established connections
    -  Fatal errors occurring during the operation of TNS network
       components


________________________________________


3. What is the Difference between Logging and Tracing?
======================================================

While logging provides the state of the TNS components at the time of an
error, tracing provides a description of all software events as they occur,
and therefore provides additional information about events prior to an
error. There are three levels of diagnostics, each providing more
information than the previous level. The three levels are:

  1. The reported error from Oracle7 or tools; this is the single error that
     is commonly returned to the user.

  2. The log file containing the state of TNS at the time of the error. This 
     can often uncover low level errors in interaction with the underlying 
     protocols.

  3. The trace file containing English statements describing what the TNS 
     software has done from the time the trace session was initiated until the 
     failure is recreated.


When an error occurs, a simple error message is displayed and a log file is
generated. Optionally, a trace file can be generated for more information.
(Remember, however, that using the trace facility has an impact on your
system performance.)

In the following example, the user failed to use Oracle Network Manager to
create a configuration file, and misspelled the word "PORT" as "POT" in the
connect descriptor. It is not important that you understand in detail the
contents of each of these results; this example is intended only to provide
a comparison.

Reported Error (On the screen in SQL*Forms):

        ERROR: ORA-12533: Unable to open message file (SQL-02113)

Logged Error (In the log file, SQLNET.LOG): 

        ****************************************************************
        Fatal OSN connect error 12533, connecting to:
        (DESCRIPTION=(CONNECT_DATA=(SID=trace)(CID=(PROGRAM=)(HOST=lala)
        (USER=ginger)))(ADDRESS_LIST=(ADDRESS=(PROTOCOL=ipc)
        (KEY=bad_port))(ADDRESS=(PROTOCOL=tcp)(HOST=lala)(POT=1521))))

        VERSION INFORMATION:
        TNS for SunOS: Version 2.0.14.0.0 - Developer's Release
        Oracle Bequeath NT Protocol Adapter for SunOS: Version
        2.0.14.0.0 - Developer's Release
        Unix Domain Socket IPC NT Protocol Adaptor for SunOS: Version
        2.0.14.0.0 - Developer's Release
        TCP/IP NT Protocol Adapter for SunOS: Version 2.0.14.0.0 -
        Developer's Release
        Time: 07-MAY-93 17:38:50
        Tracing to file: /home/ginger/trace_admin.trc
        Tns error struct:
        nr err code: 12206
        TNS-12206: TNS:received a TNS error while doing navigation
        ns main err code: 12533
        TNS-12533: TNS:illegal ADDRESS parameters
        ns secondary err code: 12560
        nt main err code: 503
        TNS-00503: Illegal ADDRESS parameters
        nt secondary err code: 0
        nt OS err code: 0

Example of Trace of Error
-------------------------

The trace file, SQLNET.TRC at the USER level, contains the 
following information:

        --- TRACE CONFIGURATION INFORMATION FOLLOWS ---
        New trace stream is "/private1/oracle/trace_user.trc"
        New trace level is 4
        --- TRACE CONFIGURATION INFORMATION ENDS ---

        --- PARAMETER SOURCE INFORMATION FOLLOWS ---
        Attempted load of system pfile source
        /private1/oracle/network/admin/sqlnet.ora
        Parameter source was not loaded
        Error stack follows:
        NL-00405: cannot open parameter file

        Attempted load of local pfile source /home/ginger/.sqlnet.ora
        Parameter source loaded successfully

        -> PARAMETER TABLE LOAD RESULTS FOLLOW <-
        Some parameters may not have been loaded
        See dump for parameters which loaded OK
        -> PARAMETER TABLE HAS THE FOLLOWING CONTENTS <-
        TRACE_DIRECTORY_CLIENT = /private1/oracle
        trace_level_client = USER
        TRACE_FILE_CLIENT = trace_user
        --- PARAMETER SOURCE INFORMATION ENDS ---

        --- LOG CONFIGURATION INFORMATION FOLLOWS ---
        Attempted open of log stream "/tmp_mnt/home/ginger/sqlnet.log"
        Successful stream open
        --- LOG CONFIGURATION INFORMATION ENDS ---

        Unable to get data from navigation file tnsnav.ora
        local names file is /home/ginger/.tnsnames.ora
        system names file is /etc/tnsnames.ora
        -<ERROR>- failure, error stack follows
        -<ERROR>- NL-00427: bad list
        -<ERROR>- NOTE: FILE CONTAINS ERRORS, SOME NAMES MAY BE MISSING

        Calling address:
        (DESCRIPTION=(CONNECT_DATA=(SID=trace)(CID=(PROGRAM=)(HOST=lala)(USER=ginger)))
        (ADDRESS_LIST=(ADDRESS=(PROTOCOL=ipc)(KEY=bad_port))(ADDRESS=(PROTOCOL=tcp)(HOST
        Getting local community information
        Looking for local addresses setup by nrigla
        No addresses in the preferred address list
        TNSNAV.ORA is not present. No local communities entry.
        Getting local address information
        Address list being processed...
        No community information so all addresses are "local"
        Resolving address to use to call destination or next hop
        Processing address list...
        No community entries so iterate over address list
        This a local community access
        Got routable address information
        Making call with following address information:
        (DESCRIPTION=(EMPTY=0)(ADDRESS=(PROTOCOL=ipc)(KEY=bad_port)))
        Calling with outgoing connect data
        (DESCRIPTION=(CONNECT_DATA=(SID=trace)(CID=(PROGRAM=)(HOST=lala)(USER=ginger)))
        (ADDRESS_LIST=(ADDRESS=(PROTOCOL=tcp)(HOST=lala)(POT=1521))))
        (DESCRIPTION=(EMPTY=0)(ADDRESS=(PROTOCOL=ipc)(KEY=bad_port)))
        KEY = bad_port
        connecting...
        opening transport...
        -<ERROR>- sd=8, op=1, resnt[0]=511, resnt[1]=2, resnt[2]=0
        -<ERROR>- unable to open transport
        -<ERROR>- nsres: id=0, op=1, ns=12541, ns2=12560; nt[0]=511, nt[1]=2,
        nt[2]=0
        connect attempt failed
        Call failed...
        Call made to destination
        Processing address list so continuing
        Getting local community information
        Looking for local addresses setup by nrigla
        No addresses in the preferred address list
        TNSNAV.ORA is not present. No local communities entry.
        Getting local address information
        Address list being processed...
        No community information so all addresses are "local"
        Resolving address to use to call destination or next hop
        Processing address list...
        No community entries so iterate over address list
        This a local community access
        Got routable address information
        Making call with following address information:
        (DESCRIPTION=(EMPTY=0)(ADDRESS=(PROTOCOL=tcp)(HOST=lala)(POT=1521)))
        Calling with outgoing connect data
        (DESCRIPTION=(CONNECT_DATA=(SID=trace)(CID=(PROGRAM=)(HOST=lala)(USER=ginger)))
        (ADDRESS_LIST=(ADDRESS=(PROTOCOL=tcp)(HOST=lala)(POT=521))))
        (DESCRIPTION=(EMPTY=0)(ADDRESS=(PROTOCOL=tcp)(HOST=lala)(POT=1521)))

        -<FATAL?>- failed to recognize: POT

        -<ERROR>- nsres: id=0, op=13, ns=12533, ns2=12560; nt[0]=503, nt[1]=0,
        nt[2]=0
        Call failed...
        Exiting NRICALL with following termination result -1
        -<ERROR>- error from nricall
        -<ERROR>- nr err code: 12206
        -<ERROR>- ns main err code: 12533
        -<ERROR>- ns (2) err code: 12560
        -<ERROR>- nt main err code: 503
        -<ERROR>- nt (2) err code: 0
        -<ERROR>- nt OS err code: 0
        -<ERROR>- Couldn't connect, returning 12533


In the trace file, note that unexpected events are preceded with an
-<ERROR>- stamp. These events may represent serious errors, minor errors, or
merely unexpected results from an internal operation. More serious and
probably fatal errors are stamped with the -<FATAL?>- prefix.

In this example trace file, you can see that the root problem, the
misspelling of "PORT," is indicated by the trace line: -<FATAL?>- failed to
recognize: POT

Most tracing is very similar to this. If you have a basic understanding of
the events the components will perform, you can identify the probable cause
of an error in the text of the trace.
________________________________________
 

4. Log File Names
=================

Log files produced by different components have unique names. The default
file names are:

 SQLNET.LOG           Contains client and/or server
                      information

 LISTENER.LOG         Contains listener information

 INTCHG.LOG           Contains Connection Manager and pump
                      information

 NAVGATR.LOG          Contains Navigator information

 NAMES.LOG            Contains Names server information

You can control the name of the log file. For each component, any valid
string can be used to create a log file name. The parameters are of the
form:

LOG_FILE_component = string

For example:

LOG_FILE_LISTENER = TEST

Some platforms have restrictions on the properties of a file name. See your
Oracle operating system specific manuals for platform specific restrictions.

_____________________________________

5. Using Log Files
==================

Follow these steps to track an error using a log file:

1. Browse the log file for the most recent error that matches the error
number you have received from the application. This is almost always the
last entry in the log file. Notice that an entry or error stack in the log
file is usually many lines in length. In the example earlier in this
chapter, the error number was 12207.

2. Starting at the bottom, look up to the first non-zero entry in the error
report. This is usually the actual cause. In the example earlier in this
chapter, the last non-zero entry is the "ns" error 12560.

3. Look up the first non-zero entry in later chapters of this book for its
recommended cause and action. (For example, you would find the "ns" error
12560 under ORA-12560.) To understand the notation used in the error report,
see the previous chapter, "Interpreting Error Messages."

4. If that error does not provide the desired information, move up the error
stack to the second to last error and so on.

5. If the cause of the error is still not clear, turn on tracing and
re-execute the statement that produced the error message. The use of the
trace facility is described in detail later in this chapter. Be sure to turn
tracing off after you have re-executed the command.

________________________________________
 

6. Using the Trace Facility
===========================

The steps used to invoke tracing are outlined here. Each step is fully
described in subsequent sections.

1. Choose the component to be traced from the list:

       o  Client 
       o  Server
       o  Listener
       o  Connection Manager and pump (cmanager)
       o  Navigator (navigator)
       o  Names server
       o  Names Control Utility

2. Save existing trace file if you need to retain information on it. By default
   most trace files will overwrite an existing ones. TRACE_UNIQUE parameter needs
   to be included in appropriate config. files if unique trace files are required.
   This appends Process Id to each file.
   For Example:
       For Names server tracing, NAMES.TRACE_UNIQUE=ON needs to be set in NAMES.
       ORA file. For Names Control Utility, NAMESCTL.TRACING_UNIQUE=TRUE needs 
       to be in SQLNET.ORA. TRACE_UNIQUE_CLIENT=ON in SQLNET.ORA for Client 
       Tracing.


3. For any component, you can invoke the trace facility by editing the 
   component configuration file that corresponds to the component traced. The 
   component config. files are SQLNET.ORA, LISTENER.ORA, INTCHG.ORA, and NAMES.
   ORA. 

4. Execute or start the component to be traced. If the trace component 
   configuration files are modified while the component is running, the 
   modified trace parameters will take effect the next time the component is 
   invoked or restarted. Specifically for each component:


   CLIENT:   Set the trace parameters in the client-side SQLNET.ORA and invoke 
             a client application, such as SQL*Plus, a Pro*C application, or 
             any application that uses the Oracle network products. 

   SERVER:   Set the trace parameters in the server-side SQLNET.ORA. The next 
             process started by the listener will have tracing enabled. The 
             trace parameters must be created or edited manually.


   LISTENER: Set the trace parameters in the LISTENER.ORA

   CONNECTION MANAGER: 
             Set the trace parameters in INTCHG.ORA and start the Connection 
             Manager from the Interchange Control Utility or command line. The 
             pumps are started automatically with the Connection Manager, and 
             their trace files are controlled by the trace parameters for the 
             Connection Manager. 

                 
   NAVIGATOR:Again, set the trace parameters in INTCHG.ORA and start the 
             Navigator 

   NAMES SERVER:  
             Trace parameters needs to be set in NAMES.ORA and start the Names 
             server. 
 
   NAMES CONTROL UTILITY:
             Set the trace parameters in SQLNET.ORA and start the Names Control 
             Utility 


5. Be sure to turn tracing off when you do not need it for a specific
   diagnostic purpose.

________________________________________
 

7. Setting Trace Parameters
===========================

The trace parameters are defined in the same configuration files as the log
parameters. Table below shows the configuration files for different network
components and the default names of the trace files they generate.


 --------------------------------------------------------
| Trace Parameters  | Configuration   |                  |
| Corresponding to  | File            | Output Files     |
|-------------------|-----------------|------------------| 
|                   |                 |                  |
| Client            | SQLNET.ORA      | SQLNET.TRC       |
| Server            |                 | SQLNET.TRC       |       
| TNSPING Utility   |                 | TNSPING.TRC      |
| Names Control     |                 |                  |
|   Utility         |                 | NAMESCTL.TRC     |
|-------------------|-----------------|------------------|
| Listener          | LISTENER.ORA    | LISTENER.TRC     |
|-------------------|-----------------|------------------|
| Interchange       | INTCHG.ORA      |                  |
|   Connection      |                 |                  | 
|     Manager       |                 | CMG.TRC          |
|   Pumps           |                 | PMP.TRC          |
|   Navigator       |                 | NAV.TRC          |
|-------------------|-----------------|------------------|
| Names server      | NAMES.ORA       | NAMES.TRC        |
|___________________|_________________|__________________|

The configuration files for each component are located on the computer
running that component.

The trace characteristics for two or more components of an Interchange are
controlled by different parameters in the same configuration file. For
example, there are separate sets of parameters for the Connection Manager
and the Navigator that determine which components will be traced, and at
what level.

Similarly, if there are multiple listeners on a single computer, each
listener is controlled by parameters that include the unique listener name
in the LISTENER.ORA file.

For each component, the configuration files contain the following
information:

         o  A valid trace level to be used (Default is OFF)
         o  The trace file name (optional)
         o  The trace file directory (optional)


________________________________________

 
7a. Valid SQLNET.ORA Diagnostic Parameters
==========================================

The SQLNET.ORA caters for:
         o Client Logging & Tracing 
         o Server Logging & Tracing 
         o TNSPING utility
         o NAMESCTL program


 ------------------------------------------------------------------------------
|                        |                |                                    |
| PARAMETERS             | VALUES         | Example (DOS client, UNIX server)  |
|                        |                |                                    |
|------------------------|----------------|------------------------------------|
|Parameters for Client                                                         |
|=====================                                                         |
|------------------------------------------------------------------------------|
|                        |                |                                    |
| TRACE_LEVEL_CLIENT     | OFF/USER/ADMIN | TRACE_LEVEL_CLIENT=USER            |
|                        |                |                                    |
| TRACE_FILE_CLIENT      | string         | TRACE_FILE_CLIENT=CLIENT           |
|                        |                |                                    |
| TRACE_DIRECTORY_CLIENT | valid directory| TRACE_DIRECTORY_CLIENT=c:\NET\ADMIN|
|                        |                |                                    |
| TRACE_UNIQUE_CLIENT    | OFF/ON         | TRACE_UNIQUE_CLIENT=ON             |
|                        |                |                                    |
| LOG_FILE_CLIENT        | string         | LOG_FILE_CLIENT=CLIENT             |
|                        |                |                                    |
| LOG_DIRECTORY_CLIENT   | valid directory| LOG_DIRECTORY_CLIENT=c:\NET\ADMIN  |
|------------------------------------------------------------------------------|
|Parameters for Server                                                         |
|=====================                                                         |
|------------------------------------------------------------------------------|
|                        |                |                                    |
| TRACE_LEVEL_SERVER     | OFF/USER/ADMIN | TRACE_LEVEL_SERVER=ADMIN           |
|                        |                |                                    |
| TRACE_FILE_SERVER      | string         | TRACE_FILE_SERVER=unixsrv_2345.trc |
|                        |                |                                    |
| TRACE_DIRECTORY_SERVER | valid directory| TRACE_DIRECTORY_SERVER=/tmp/trace  |
|                        |                |                                    |
| LOG_FILE_SERVER        | string         | LOG_FILE_SERVER=unixsrv.log        |
|                        |                |                                    |
| LOG_DIRECTORY_SERVER   | valid directory| LOG_DIRECTORY_SERVER=/tmp/trace    |
|------------------------------------------------------------------------------|

 
 ---(SQLNET.ORA Cont.)---------------------------------------------------------
|                        |                |                                    |
| PARAMETERS             | VALUES         | Example (DOS client, UNIX server)  |
|                        |                |                                    |
|------------------------|----------------|------------------------------------|
|
|Parameters for TNSPING                                                        |
|======================                                                        |
|------------------------------------------------------------------------------|
|                        |                |                                    |
| TNSPING.TRACE_LEVEL    | OFF/USER/ADMIN | TNSPING.TRACE_LEVEL=user           |
|                        |                |                                    |
| TNSPING.TRACE_DIRECTORY| directory      |TNSPING.TRACE_DIRECTORY=            |
|                        |                |             /oracle7/network/trace |
|                        |                |                                    |
|------------------------------------------------------------------------------|
|Parameters for Names Control Utility                                          |
|====================================                                          |
|------------------------------------------------------------------------------|
|                        |                |                                    |
| NAMESCTL.TRACE_LEVEL   | OFF/USER/ADMIN |NAMESCTL.TRACE_LEVEL=user           |
|                        |                |                                    |
| NAMESCTL.TRACE_FILE    | file           |NAMESCTL.TRACE_FILE=nc_south.trc    |
|                        |                |                                    |
| NAMESCTL.TRACE_DIRECTORY| directory     |NAMESCTL.TRACE_DIRECTORY=/o7/net/trace|
|                        |                |                                    |
| NAMESCTL.TRACE_UNIQUE  |  TRUE/FALSE    |NAMESCTL.TRACE_UNIQUE=TRUE or ON/OFF|
|                        |                |                                    |
 ------------------------------------------------------------------------------


Note: You control log and trace parameters for the client through Oracle
      Network Manager. You control log and trace parameters for the server by
      manually adding the desired parameters to the SQLNET.ORA file.

      Parameters for Names Control Utility & TNSPING Utility need to be added
      manually to SQLNET.ORA file. You cannot create them using Oracle Network Manager.


________________________________________
 

7b. Valid LISTENER.ORA Diagnostic Parameters
============================================

The following table shows the valid LISTENER.ORA parameters used in logging
and tracing of the listener.

 ------------------------------------------------------------------------------
|                        |                |                                    |
| PARAMETERS             | VALUES         | Example (DOS client, UNIX server)  |
|                        |                |                                    |
|------------------------|----------------|------------------------------------|
|                        |                |                                    |
|TRACE_LEVEL_LISTENER    | USER           | TRACE_LEVEL_LISTENER=OFF           |
|                        |                |                                    |
|TRACE_FILE_LISTENER     | string         | TRACE_FILE_LISTENER=LISTENER       |
|                        |                |                                    |
|TRACE_DIRECTORY_LISTENER| valid directory| TRACE_DIRECTORY_LISTENER=$ORA_SQLNETV2 |
|                        |                |                                    |      
|LOG_FILE_LISTENER       | string         | LOG_FILE_LISTENER=LISTENER         |
|                        |                |                                    |    
|LOG_DIRECTORY_LISTENER  | valid directory| LOG_DIRECTORY_LISTENER=$ORA_ERRORS |
|                        |                |                                    |    
 ------------------------------------------------------------------------------

________________________________________
 

7c. Valid INTCHG.ORA Diagnostic Parameters
==========================================

The following table shows the valid INTCHG.ORA parameters used in logging
and tracing of the Interchange. 


 ----------------------------------------------------------------------------------
|                        |                    |                                    |
| PARAMETERS             | VALUES             | Example (DOS client, UNIX server)  |
|                        |           (default)|                                    |
|------------------------|--------------------|------------------------------------|
|                        |                    |                                    |
|TRACE_LEVEL_CMANAGER    | OFF|USER|ADMIN     | TRACE_LEVEL_CMANAGER=USER          |
|                        |                    |                                    |
|TRACE_FILE_CMANAGER     | string (CMG.TRC)   | TRACE_FILE_CMANAGER=CMANAGER       |
|                        |                    |                                    |
|TRACE_DIRECTORY_CMANAGER| valid directory    | TRACE_DIRECTORY_CMANAGER=C:\ADMIN  |
|                        |                    |                                    |
|LOG_FILE_CMANAGER       | string (INTCHG.LOG)| LOG_FILE_CMANAGER=CMANAGER         |
|                        |                    |                                    |
|LOG_DIRECTORY_CMANAGER  | valid directory    | LOG_DIRECTORY_CMANAGER=C:\ADMIN    |
|                        |                    |                                    |
|LOGGING_CMANAGER        | OFF/ON             | LOGGING_CMANAGER=ON                |
|                        |                    |                                    |
|LOG_INTERVAL_CMANAGER   | Any no of minutes  | LOG_INTERVAL_CMANAGER=60           |
|                        |        (60 minutes)|                                    |
|TRACE_LEVEL_NAVIGATOR   | OFF/USER/ADMIN     | TRACE_LEVEL_NAVIGATOR=ADMIN        |
|                        |                    |                                    | 
|TRACE_FILE_NAVIGATOR    | string    (NAV.TRC)| TRACE_FILE_NAVIGATOR=NAVIGATOR     | 
|                        |                    |                                    |
|TRACE_DIRECTORY_NAVIGATOR| valid directory   | TRACE_DIRECTORY_NAVIGATOR=C:\ADMIN |
|                        |                    |                                    | 
|LOG_FILE_NAVIGATOR      |string (NAVGATR.LOG)| LOG_FILE_NAVIGATOR=NAVIGATOR       |
|                        |                    |                                    | 
|LOG_DIRECTORY_NAVIGATOR | valid directory    | LOG_DIRECTORY_NAVIGATOR=C:\ADMIN   |
|                        |                    |                                    |
|LOGGING_NAVIGATOR       | OFF/ON             | LOGGING_NAVIGATOR=OFF              |
|                        |                    |                                    |
|LOG_LEVEL_NAVIGATOR     | ERRORS|ALL (ERRORS)| LOG_LEVEL_NAVIGATOR=ERRORS         |
|                        |                    |                                    |
 ----------------------------------------------------------------------------------

  Note: The pump component shares the trace parameters of the Connection
        Manager, but it generates a separate trace file with the unchangeable
        default name PMPpid.TRC.

________________________________________
 
7d. Valid NAMES.ORA Diagnostic Parameters
=========================================

The following table shows the valid NAMES.ORA parameters used in logging and
tracing of the Names server. 


 ------------------------------------------------------------------------------
|                        |                |                                    |
| PARAMETERS             | VALUES         | Example (DOS client, UNIX server)  |
|                        |       (default)|                                    |
|------------------------|----------------|------------------------------------|
|                        |                |                                    |
| NAMES.TRACE_LEVEL      | OFF/USER/ADMIN | NAMES.TRACE_LEVEL=ADMIN            |
|                        |                |                                    |
| NAMES.TRACE_FILE       | file(names.trc)| NAMES.TRACE_FILE=nsrv3.trc         |
|                        |                |                                    |
| NAMES.TRACE_DIRECTORY  | directory      | NAMES.TRACE_DIRECTORY=/o7/net/trace|
|                        |                |                                    |
| NAMES.TRACE_UNIQUE     | TRUE/FALSE     | NAMES.TRACE_UNIQUE=TRUE  or ON/OFF |
|                        |                |                                    |  
| NAMES.LOG_FILE         | file(names.log)| NAMES.LOG_FILE=nsrv1.log           |
|                        |                |                                    |  
| NAMES.LOG_DIRECTORY    | directory      | NAMES.LOG_DIRECTORY= /o7/net/log   |
|                        |                |                                    |    
 ------------------------------------------------------------------------------
                     

________________________________________
 
8. Example of a Trace File
===========================
In the following example, the SQLNET.ORA file includes the following line:

                TRACE_LEVEL_CLIENT = ADMIN

The following trace file is the result of a connection attempt that failed
because the hostname is invalid.

The trace output is a combination of debugging aids for Oracle specialists
and English information for network administrators. Several key events can
be seen by analyzing this output from beginning to end:

        (A)  The client describes the outgoing data in the connect
             descriptor used to contact the server.

        (B)  An event is received (connection request).

        (C)  A connection is established over the available transport
             (in this case TCP/IP).

        (D)  The connection is refused by the application, which is the 
             listener. 

        (E)  The trace file shows the problem, as follows: 
                
                -<FATAL?>- ***hostname lookup failure! ***

        (F)  Error 12545 is reported back to the client.

If you look up Error 12545 in Chapter 3 of this Manual, you will find the
following description:

        ORA-12545 TNS:Name lookup failure

        Cause:  A protocol specific ADDRESS parameter cannot be resolved.
        Action: Ensure the ADDRESS parameters have been entered correctly;
                the most likely incorrect value is the node name.


++++++ NOTE: TRACE FILE EXTRACT +++++++

        --- TRACE CONFIGURATION INFORMATION FOLLOWS ---
        New trace stream is "/private1/oracle/trace_admin.trc"
        New trace level is 6
        --- TRACE CONFIGURATION INFORMATION ENDS ---

++++++ NOTE: Loading Parameter files now. +++++++

        --- PARAMETER SOURCE INFORMATION FOLLOWS ---
        Attempted load of system pfile source
        /private1/oracle/network/admin/sqlnet.ora
        Parameter source was not loaded
        Error stack follows:
        NL-00405: cannot open parameter file

        Attempted load of local pfile source /home/ginger/.sqlnet.ora
        Parameter source loaded successfully

        -> PARAMETER TABLE LOAD RESULTS FOLLOW <-
        Some parameters may not have been loaded
        See dump for parameters which loaded OK
        -> PARAMETER TABLE HAS THE FOLLOWING CONTENTS <-
        TRACE_DIRECTORY_CLIENT = /private1/oracle
        trace_level_client = ADMIN
        TRACE_FILE_CLIENT = trace_admin
        --- PARAMETER SOURCE INFORMATION ENDS ---

++++++ NOTE: Reading Parameter files. +++++++

        --- LOG CONFIGURATION INFORMATION FOLLOWS ---
        Attempted open of log stream "/private1/oracle/sqlnet.log"
        Successful stream open
        --- LOG CONFIGURATION INFORMATION ENDS ---

        Unable to get data from navigation file
        tnsnav.ora
        local names file is /home/ginger/.tnsnames.ora
        system names file is /etc/tnsnames.ora
        initial retry timeout for all servers is 500 csecs
        max request retries per server is 2
        default zone is [root]
        Using nncin2a() to build connect descriptor for (possibly remote) database.
        initial load of /home/ginger/.tnsnames.ora
        -<ERROR>- failure, error stack follows
        -<ERROR>- NL-00405: cannot open parameter file
        -<ERROR>- NOTE: FILE CONTAINS ERRORS, SOME NAMES MAY BE MISSING

        initial load of /etc/tnsnames.ora
        -<ERROR>- failure, error stack follows
        -<ERROR>- NL-00427: bad list
        -<ERROR>- NOTE: FILE CONTAINS ERRORS, SOME NAMES MAY BE MISSING

        Inserting IPC address into connect descriptor returned from nncin2a().


++++++ NOTE: Looking for Routing Information. +++++++

        Calling address:
        (DESCRIPTION=(CONNECT_DATA=(SID=trace)(CID=(PROGRAM=)(HOST=lala)
        (USER=ginger)))(ADDRESS_LIST=(ADDRESS=(PROTOCOL=ipc
        (KEY=bad_host))(ADDRESS=(PROTOCOL=tcp)(HOST=lavender)
        (PORT=1521))))
        Getting local community information
        Looking for local addresses setup by nrigla
        No addresses in the preferred address list
        TNSNAV.ORA is not present. No local communities entry.
        Getting local address information
        Address list being processed...
        No community information so all addresses are "local"
        Resolving address to use to call destination or next hop
        Processing address list...
        No community entries so iterate over address list
        This a local community access
        Got routable address information


++++++ NOTE: Calling first address (IPC). +++++++

        Making call with following address information:
        (DESCRIPTION=(EMPTY=0)(ADDRESS=(PROTOCOL=ipc)(KEY=bad_host)))
        Calling with outgoing connect data
        (DESCRIPTION=(CONNECT_DATA=(SID=trace)(CID=(PROGRAM=)(HOST=lala)
        (USER=ginger)))(ADDRESS_LIST=(ADDRESS=(PROTOCOL=tcp)
        (HOST=lavender)(PORT=1521))))
        (DESCRIPTION=(EMPTY=0)(ADDRESS=(PROTOCOL=ipc)(KEY=bad_host)))
        KEY = bad_host
        connecting...
        opening transport...
        -<ERROR>- sd=8, op=1, resnt[0]=511, resnt[1]=2, resnt[2]=0
        -<ERROR>- unable to open transport
        -<ERROR>- nsres: id=0, op=1, ns=12541, ns2=12560; nt[0]=511, nt[1]=2,
        nt[2]=0
        connect attempt failed
        Call failed...
        Call made to destination
        Processing address list so continuing


++++++ NOTE: Looking for Routing Information. +++++++

        Getting local community information
        Looking for local addresses setup by nrigla
        No addresses in the preferred address list
        TNSNAV.ORA is not present. No local communities entry.
        Getting local address information
        Address list being processed...
        No community information so all addresses are "local"
        Resolving address to use to call destination or next hop
        Processing address list...
        No community entries so iterate over address list
        This a local community access
        Got routable address information


++++++ NOTE: Calling second address (TCP/IP). +++++++

        Making call with following address information:
        (DESCRIPTION=(EMPTY=0)(ADDRESS=(PROTOCOL=tcp)
        (HOST=lavender)(PORT=1521)))
        Calling with outgoing connect data
        (DESCRIPTION=(CONNECT_DATA=(SID=trace)(CID=(PROGRAM=)(HOST=lala)
        (USER=ginger)))(ADDRESS_LIST=(ADDRESS=(PROTOCOL=tcp)
        (HOST=lavender) (PORT=1521))))
        (DESCRIPTION=(EMPTY=0)(ADDRESS=(PROTOCOL=tcp)
        (HOST=lavender)(PORT=1521)))
        port resolved to 1521
        looking up IP addr for host: lavender

-<FATAL?>- *** hostname lookup failure! ***

        -<ERROR>- nsres: id=0, op=13, ns=12545, ns2=12560; nt[0]=515, nt[1]=0,
        nt[2]=0
        Call failed...
        Exiting NRICALL with following termination result -1
        -<ERROR>- error from nricall
        -<ERROR>- nr err code: 12206
        -<ERROR>- ns main err code: 12545
        -<ERROR>- ns (2) err code: 12560
        -<ERROR>- nt main err code: 515
        -<ERROR>- nt (2) err code: 0
        -<ERROR>- nt OS err code: 0
        -<ERROR>- Couldn't connect, returning 12545

Most tracing is very similar to this. If you have a basic understanding of
the events the components will perform, you can identify the probable cause
of an error in the text of the trace.


19.63 ORA-01595: error freeing extent (2) of rollback segment (9)):
===================================================================

Note 1:

ORA-01595, 00000, "error freeing extent (%s) of rollback segment (%s))"
Cause:  Some error occurred while freeing inactive rollback segment extents.
Action: Investigate the accompanying error.

Note 2:

Two factors are necessary for this to happen. 

A rollback segment has extended beyond OPTIMAL. 

There are two or more transactions sharing the rollback segment at the time of the shrink. 
What happens is that the first process gets to the end of an extent, notices the need to shrink 
and begins the recursive transaction to do so. But the next transaction blunders past the end 
of that extent before the recursive transaction has been committed. 
The preferred solution is to have sufficient rollback segments to eliminate the sharing of 
rollback segments between processes. Look in V$RESOURCE_LIMIT for the high-water-mark of transactions. 
That is the number of rollback segments you need. The alternative solution is to raise OPTIMAL 
to reduce the risk of the error. 

Note 3:

This error is harmless. You can try (and probably should) set optimal to null 
and
maxextents to unlimited (which might minimize the frequency of these errors).

These errors happen sometimes when oracle is shrinking the rollback segments 
upto the optimal
size. The undo data for shrinking is also kept in the rollback segments. So 
when it attempts to
shrink the same rollback segment where its trying to write the undo, it throws 
this warning.

Its not a failure per se .. since oracle will retry and succeed.


19.64: OUI-10022: oraInventory cannot be used because it is in an invalid state
===============================================================================

Note 1:
-------

If there are other products installed through the OUI, create a copy of = 
the 
oraInst.loc file (depending on the UNIX system, 
possibly in /etc or /var/opt/oracle). 

Modify the inventory_loc parameter to point to a different location for = 
the OUI to create the oraInventory directory. 

Run the installer using the -invPtrLoc parameter 
(eg: runInstaller -invPtrLoc /PATH/oraInst.loc). 

This will retain the existing oraInventory directory and create a new = 
one for use by the new product. 


19.65: Failure to extend rollback segment because of 30036 condition
====================================================================

Not a serious problem. Do some undo tuning.


19.66: ORA-06502: PL/SQL: numeric or value error: character string buffer too small
===================================================================================


Note 1:

Hi,

I am having a strange problem with an ORA-06502 error I am getting and don't understand why.  
I would expect this error to be quite easy to fix, it would suggest that a variable is not large enough 
to cope with a value being assigned to it.  But I'm fairly sure that isn't the problem.  Anyway  I have 
a stored procedure similar to the following:

PROCEDURE myproc(a_user IN VARCHAR2,
                              p_1 OUT <my_table>.<my_first_column>%TYPE,
                              p_2 OUT <my_table>.<my_second_column>%TYPE)
IS

BEGIN

  SELECT my_first_column,
              my_second_column
  INTO    p_1,
             p_2
  FROM my_table
  WHERE user_id = a_user;

END;
/

The procedure is larger than this, but using error_position variables I have tracked it down 
to one SQL statement.  But I don't understand why I'm getting the ORA-06502, because the variables I am selecting 
into are defined as the same types as the columns I'm selecting.  The variable I am selecting into is in fact 
a VARCHAR2(4), but if I replace the sql statement with p_1 := 'AB'; it still fails.  
It succeeds if I do p_1 := 'A';

Has anyone seen this before or anything similar that they might be able to help me with please?

Thanks,

mtae.

-- Answer 1:

It is the code from which you are calling it that has the problem, e.g.

DECLARE
  v1 varchar2(1);
  v2 varchar2(1);
BEGIN
  my_proc ('USER',v1,v2);
END;
/


-- Answer 2

try this:

PROCEDURE myproc(a_user IN VARCHAR2,
                              p_1 OUT varchar2,
                              p_2 OUT varchar2)
IS
   v_1 <my_table>.<my_first_column>%TYPE;
   v_2 <my_table>.<my_second_column>%TYPE;
BEGIN

  SELECT my_first_column,
              my_second_column
  INTO    v_1,
             v_2
  FROM my_table
  WHERE user_id = a_user;
  p_1 := v_1;
  p_2 := v_2;
END;
/

Comment from mtae 
Date: 07/28/2004 04:24AM PDT
 Author Comment  


It was the size of the variable that was being used as the actual parameter being passed in.  
Feeling very silly, but thanks, sometimes you can look at a problem too long.


19.67 ORA-00600: internal error code, arguments: [LibraryCacheNotEmptyOnClose], [], [], [], [], [], [], []
==========================================================================================================

thread:

see this error every time I shutdown a 10gR3 grid control database on 10.2.0.3 RDBMS, even though all opmn and OMS 
processes are down. So far, I have not seen any problems, apart from the annoying shutdown warning.

Note 365103.1 seems to indicate it can be ignored:

Cause
This is due to unpublished Bug 4483084 'ORA-600 [LIBRARYCACHENOTEMPTYONCLOSE]'

This is a bug in that an ORA-600 error is reported when it is found that something is still going
on during shutdown. It does not indicate any damage or a problem in the system.


Solution

At the time of writing, it is likely that the fix will be to report a more meaningful external error, although this 
has not been finalised.

The error is harmless so it is unlikely that this will be backported to 10.2.

The error can be safely ignored as it does not indicate a problem with the database. 


thread:

ORA-00600: internal error code, arguments: [LibraryCacheNotEmptyOnClose], [],[], [], [], [], [], [] 
14-DEC-06 05:15:35 GMT

Hi,

There is no patch available for the bug 4483084.

You need to Ignore this error, as there is absolutely no impact to the database due to this error.

Thanks,
Ram

 
thread:


19.68:
----------------


=====================
20. DATABASE TRACING:
=====================

-- Trace a session:
-- ----------------

Examples:
---------

exec DBMS_SYSTEM.SET_SQL_TRACE_IN_SESSION(sid, serial#, TRUE);
exec DBMS_SYSTEM.SET_SQL_TRACE_IN_SESSION(23, 54071, TRUE);

DBMS_SYSTEM has some mysterious and apparently dangerous procedures in it. Obtaining any information 
about SET_EV and READ_EV was very difficult and promises to be more difficult in the future since 
the package header is no longer exposed in Oracle 8.0.

In spite of Oracle's desire to keep DBMS_SYSTEM "under wraps," I feel strongly that the SET_SQL_TRACE_IN_SESSION 
procedure is far too valuable to be hidden away in obscurity. DBAs and developers need to find out exactly 
what is happening at runtime when a user is experiencing unusual performance problems, 
and the SQL trace facility is one of the best tools available for discovering what the database 
is doing during a user's session. This is especially useful when investigating problems with software packages 
where source code (including SQL) is generally unavailable.

So how can we get access to the one program in DBMS_SYSTEM we want without exposing those other dangerous 
elements to the public? The answer, of course, is to build a package of our own to encapsulate DBMS_SYSTEM 
and expose only what is safe. In the process, we can make DBMS_SYSTEM easier to use as well. 
Those of us who are "keyboard-challenged" (or just plain lazy) would certainly appreciate 
not having to type a procedure name with 36 characters.

I've created a package called trace to cover DBMS_SYSTEM and provide friendlier ways to set SQL tracing on or off 
in other user's sessions. Here is the package specification:

*/ Filename on companion disk: trace.sql */*
CREATE OR REPLACE PACKAGE trace
IS

type rr_rec is record (
     v_sid           number,
     v_serial        number
);

r_rec rr_rec;

   /*
   || Exposes DBMS_SYSTEM.SET_SQL_TRACE_IN_SESSION
   || with easier to call programs
   ||
   || Author:  John Beresniewicz, Savant Corp
   || Created: 07/30/97
   ||
   || Compilation Requirements:
   || SELECT on SYS.V_$SESSION
   || EXECUTE on SYS.DBMS_SYSTEM (or create as SYS)
   || 
   || Execution Requirements:
   || 
   */
   
   /* turn SQL trace on by session id */
   PROCEDURE Xon(sid_IN IN NUMBER);

   /* turn SQL trace off by session id */
   PROCEDURE off(sid_IN IN NUMBER);

   /* turn SQL trace on by username */
   PROCEDURE Xon(user_IN IN VARCHAR2);

   /* turn SQL trace off by username */
   PROCEDURE off(user_IN IN VARCHAR2);

END trace;


The trace package provides ways to turn SQL tracing on or off by session id or username. 
One thing that annoys me about DBMS_SYSTEM.SET_SQL_TRACE_IN_SESSION is having to figure out and pass 
a session serial number into the procedure. There should always be only one session per sid at any time 
connected to the database, so trace takes care of figuring out the appropriate serial number behind the scenes.

Another improvement (in my mind) is replacing the potentially confusing BOOLEAN parameter sql_trace 
with two distinct procedures whose names indicate what is being done. Compare the following commands, 
either of which might be used to turn SQL tracing off in session 15 using SQL*Plus:

SQL> execute trace.off(sid_IN=>15);

SQL> execute SYS.DBMS_SYSTEM.SET_SQL_TRACE_IN_SESSION(15,4567,FALSE);

The first method is both more terse and easier to understand.

The xon and off procedures are both overloaded on the single IN parameter, with versions accepting 
either the numeric session id or a character string for the session username. Allowing session selection 
by username may be easier than by sids. Why? Because sids are transient and must be looked up at runtime, 
whereas username is usually permanently associated with an individual. Beware, though, that multiple sessions 
may be concurrently connected under the same username, and invoking trace.xon by username will turn tracing on 
in all of them.

Let's take a look at the trace package body: 

/* Filename on companion disk: trace.sql */*
CREATE OR REPLACE PACKAGE BODY trace 
IS

   /*
   || Use DBMS_SYSTEM.SET_SQL_TRACE_IN_SESSION to turn tracing on 
   || or off by either session id or username.  Affects all sessions
   || that match non-NULL values of the user and sid parameters.
   */
   PROCEDURE set_trace
      (sqltrace_TF BOOLEAN
      ,user IN VARCHAR2 DEFAULT NULL
      ,sid IN NUMBER DEFAULT NULL)
   IS
   BEGIN
      /*
      || Loop through all sessions that match the sid and user
      || parameters and set trace on in those sessions.  The NVL 
      || function in the cursor WHERE clause allows the single
      || SELECT statement to filter by either sid OR user.
      */
      FOR sid_rec IN 
         (SELECT sid,serial# 
            FROM v$session   S
           WHERE S.type='USER'
             AND S.username = NVL(UPPER(user),S.username)
             AND S.sid      = NVL(sid,S.sid) )
      LOOP
         SYS.DBMS_SYSTEM.SET_SQL_TRACE_IN_SESSION
            (sid_rec.sid, sid_rec.serial#, sqltrace_TF);
      END LOOP;
   END set_trace;

   /*
   || The programs exposed by the package all simply
   || call set_trace with different parameter combinations.
   */
   PROCEDURE Xon(sid_IN IN NUMBER)
   IS
   BEGIN
      set_trace(sqltrace_TF => TRUE, sid => sid_IN);
   END Xon;

   PROCEDURE off(sid_IN IN NUMBER)
   IS
   BEGIN
      set_trace(sqltrace_TF => FALSE, sid => sid_IN);
   END off;

   PROCEDURE Xon(user_IN IN VARCHAR2)
   IS
   BEGIN
      set_trace(sqltrace_TF => TRUE, user => user_IN);
   END Xon;

   PROCEDURE off(user_IN IN VARCHAR2)
   IS
   BEGIN
      set_trace(sqltrace_TF => FALSE, user => user_IN);
   END off;

END trace;


All of the real work done in the trace package is contained in a single private procedure called set_trace. 
The public procedures merely call set_trace with different parameter combinations. This is a structure 
that many packages exhibit: private programs with complex functionality exposed through public programs 
with simpler interfaces. 

One interesting aspect of set_trace is the cursor used to get session identification data from V_$SESSION. 
I wanted to identify sessions for tracing by either session id or username. I could have just defined 
two cursors on V_$SESSION with some conditional logic deciding which cursor to use, but that just did 
not seem clean enough. After all, less code means fewer bugs. The solution I arrived at: 
make use of the NVL function to have a single cursor effectively ignore either the sid or the user parameter 
when either is passed in as NULL. Since set_trace is always called with either sid or user, but not both, 
the NVLs act as a kind of toggle on the cursor. I also supplied both the sid and user parameters to set_trace 
with the default value of NULL so that only the parameter being used for selection needs be passed in the call.

Once set_trace was in place, the publicly visible procedures were trivial. 

A final note about the procedure name "xon": I wanted to use the procedure name "on," but ran afoul of the 
PL/SQL compiler since ON is a reserved word in SQL and PL/SQL.

You can also try:

Alter system set sql_trace=true;
Setting sql_trace=true is a prerequisite when using tk prof. 

-- TRACING a session:
-----------------------

Enable tracing a session to generate a tarce file.
This file can be formatted with TKPROF

6.1.
The following INIT.ORA parameters must be set:   
          #SQL_TRACE = TRUE   
          USER_DUMP_DEST = <preferred directory for the trace output>   
          TIMED_STATISTICS = TRUE   
          MAX_DUMP_FILE_SIZE = <optional, determines trace output file size>   
   

6.2
To enable the SQL trace facility for your current session, enter: 

ALTER SESSION SET SQL_TRACE = TRUE;

or use

DBMS_SUPPORT.START_TRACE_IN_SESSION( SID , SERIAL# );
DBMS_SUPPORT.STOP_TRACE_IN_SESSION( SID , NULL );
DBMS_SYSTEM.SET_SQL_TRACE_IN_SESSION(sid, serial#, TRUE);


To enable the SQL trace facility for your instance, set the value of the 
SQL_TRACE initialization parameter to TRUE. Statistics will be collected for all sessions. 

Once the SQL trace facility has been enabled for the instance, 
you can disable it for an individual session by entering: 
ALTER SESSION SET SQL_TRACE = FALSE;

6.3

Examples of TKPROF

   TKPROF ora53269.trc ora 53269.prf
   SORT = (PRSDSK, EXEDSK, FCHDSK)
   PRINT = 10

To analyze the sql statements:

      1. tkprof  ora_11598.trc  myfilename     
      2. tkprof  ora_11598.trc  /tmp/myfilename    
      3. tkprof  ora_11598.trc  /tmp/myfilename explain=ap/ap  
      4. tkprof  ora_23532.trc  myfilename  explain=po/po  sort=execpu  

7 STATSPACK:
------------

Statspack is a set of SQL, PL/SQL, and SQL*Plus scripts that allow the collection, 
automation, storage, and viewing of perfoRMANce data (see Table 2). 
The installation script (statscre.sql) calls several other scripts in order 
to create the entire Statspack environment. (Note: You should run only the 
installation script, not the base scripts that statscre.sql invokes.) 
All the scripts you need for installing and running Statspack are in the 
ORACLE_HOME/rdbms/admin directory for UNIX platforms and in 
%ORACLE_HOME%\rdbms\admin for Microsoft Windows NT systems. 


The simplest interactive way to take a snapshot is to log in to SQL*Plus 
as the owner perfstat and execute the statspack.snap procedure: 

SQL> connect perfstat/perfstat
SQL> execute statspack.snap;

You can use dbms_job to automate statistics collection. 
The file statsauto.sql contains an example of how to do this, 
scheduling a snapshot every hour. When you create a job by using dbms_job, 
Oracle assigns the job a unique number that you can use for changing or removing the job. 
In order to use dbms_job to schedule snapshots automatically, you must set the job_queue_processes 
initialization parameter to greater than 0 in the init.ora file: 
# Set to enable the job-queue process to start.
# This allows dbms_job to schedule automatic
# statistics collection, using Statspack
job_queue_processes=1


Change the interval of statistics collection by using the dbms_job.interval procedure: 

execute dbms_job.interval(<job number>, 
'SYSDATE+(1/48)'); 

In this case, SYSDATE+(1/48)' causes the statistics to be gathered each 1/48 day-every half hour. 
To stop and remove the automatic-collection job: 

execute dbms_job.remove(<job number>);


Install Statspack:

CREATE USER perfstat identified by perfstat
default tableSpace TOOLS temporary tableSpace TEMP; 

GRANT CREATE SeSSion to PERFSTAT;
GRANT connect to PERFSTAT;
GRANT reSource to PERFSTAT;
GRANT unlimited tableSpace to PERFSTAT;


sqlplus sys
--
-- Install Statspack
-- Enter tablespace names when prompted
--
@?/rdbms/admin/spcreate.sql
--
-- Drop Statspack
-- Reverse of spcreate.sql
--
-- @?/rdbms/admin/spdrop.sql
--

 
The spcreate.sql install script automatically calls 3 other scripts needed:

spcusr - creates the user and grants privileges 
spctab - creates the tables 
spcpkg - creates the package 
Check each of the three output files produced (spcusr.lis, spctab.lis, spcpkg.lis) 
by the installation to ensure no errors were encountered, before continuing on to the next step. 
 
  
 Using Statspack (gathering data): 
sqlplus perfstat
--
-- Take a perfoRMANce snapshot 
--
execute statspack.snap;
--
-- Get a list of snapshots
--
column snap_time format a21
SELECT snap_id,to_char(snap_time,'MON dd, yyyy hh24:mm:ss') snap_time
FROM sp$snapshot;
--

NOTE: To include important timing information set the init.ora parameter timed_statistics to true.

To examine the change in instancewide statistics between two time periods, the SPREPORT.SQL file is run 
while connected to the PERFSTAT user. The SPREPORT.SQL command file is located in the rdbms/admin directory 
of the Oracle home.


You are prompted for the following:

The beginning snapshot ID 
The ending snapshot ID 
The name of the report text file to be created 
  

===========
21. Overig:
===========


20.1 NLS:
=========

Bij Server: 

1. characterset specificatie bij CREATE DATABASE
2. De Sever kan wel meerdere locale in runtime laden uit files gespecificeerd in
   $ export ORA_NLSxx=$ORACLE_HOME/ocommon/nls/admin/data
3. NLS init.ora parameters t.b.v. de user sessions.


If clients using different character sets will access the database, then choose a superset that includes 
all client character sets. Otherwise, character conversions may be necessary at the cost of 
increased overhead and potential data loss.


client:

1. client heeft lokaal een NLS environment setting
2. client connect naar database, een session wordt gevormd, en de NLS enviroment wordt gemaakt
   aan de hAND van de NLS init.ora parameters.
   Is bij de clent de NLS_LANG environment variable gezet, dan communiceerd
   de client dat naar de server session. Hierdoor zijn beide hetzelfde.
   Is er geen NLS_LANG, dan gelden de init.ora NLS parameters voor de server session
3. De session NLS kan worden verANDert via ALTER SESSION. Dit heeft alleen effect
   op de PL/SQL en SQL statements executed op de server


init.ora parameters bij server    : invloed op sessions op server
environment variables bij client  : locale bij client, overrides session
alter session statement           : verANDert de session, overides init.ora
expliciet in SQL statement        : overides alles

Voorbeeld van override:

in init.ora:   NLS_SORT=ENGLISH
bij client:    ALTER SESSION SET NLS_SORT=FRENCH;

Examples:
---------

Example 1:
----------

ALTER SESSION SET nls_date_format = 'dd/mm/yy'
ALTER SESSION SET NLS_DATE_FORMAT = 'DD-MON-YYYY'

ALTER SESSION SET NLS_LANGUAGE='ENGLISH';

ALTER SESSION SET NLS_LANGUAGE='NEDERLANDS';

export NLS_NUMERIC_CHARACTERS=',.'
ALTER SESSION SET NLS_NUMERIC_CHARACTERS=',.'

ALTER SESSION SET NLS_TERRITORY=France;
ALTER SESSION SET NLS_TERRITORY=America;


In SQL functions:

NLS parameters can be used explicitly to hardcode NLS behavior within a SQL function. 
Doing so will override the default values that are set for the session in the initialization parameter file, 
set for the client with environment variables, or set for the session by the ALTER SESSION statement. 
For example:

TO_CHAR(hiredate, 'DD/MON/YYYY', 'nls_date_language = FRENCH')

SELECT last_name FROM employees WHERE hire_date > 
TO_DATE('01-JAN-1999','DD-MON-YYYY', 'NLS_DATE_LANGUAGE = AMERICAN');


Example 2:
----------

SQL> ALTER SESSION SET NLS_NUMERIC_CHARACTERS=',.'
  2  ;

Session altered.

SQL> select * from ap2;

NAME              SAL
---------- ----------
ap              12,53
piet             89,7


SQL> ALTER SESSION SET NLS_NUMERIC_CHARACTERS='.,';

Session altered.

SQL> select * from ap2;

NAME              SAL
---------- ----------
ap              12.53
piet             89.7


priority:
---------

1. expliciet in SQL
2. ALTER SESSION
3. environment variable
4. init.ora


NLS parameters, te zetten via:

NLS_CALENDAR             init.ora, env, alter session
NLS_COMP                 init.ora, env, alter session
NLS_CREDIT                -        env  -
NLS_CURRENCY             init.ora, env, alter session
NLS_DATE_FORMAT          init.ora, env, alter session
NLS_DATE_LANGUAGE        init.ora, env, alter session
NLS_DEBIT                 -        env  -
NLS_ISO_CURRENCY         init.ora, env, alter session
NLS_LANG                  -        env  -
NLS_LANGUAGE             init.ora, -  , alter session
NLS_LIST_SEPERATOR        -        env  -   
NLS_MONETARY_CHARACTERS   -        env  -
NLS_NCHAR                 -        env  -
NLS_NUMMERIC_CHARACTERS  init.ora, env, alter session
NLS_SORT                 init.ora, env, alter session
NLS_TERRITORY            init.ora, -  , alter session
NLS_DUAL_CURRENCY        init.ora, env, alter session


DATA DICTIONARY VIEWS:
----------------------

Applications can check the session, instance, and database NLS parameters by querying 
the following data dictionary views:

NLS_SESSION_PARAMETERS shows the NLS parameters and their values for the session that is querying 
the view. It does not show information about the character set. 

NLS_INSTANCE_PARAMETERS shows the current NLS instance parameters that have been explicitly set 
and the values of the NLS instance parameters. 

NLS_DATABASE_PARAMETERS shows the values of the NLS parameters that were used when the database was created. 


Example:
--------

SQL> desc ap1;
 Name                                      Null?    Type
 ----------------------------------------- -------- -------------
 NAME                                               VARCHAR2(10)
 SAL                                                VARCHAR2(10)

SQL> select * from ap1;

NAME       SAL
---------- ----------
ap         12,53
piet       89,7

SQL> desc ap2;
 Name                                      Null?    Type
 ----------------------------------------- -------- ----------------------------
 NAME                                               VARCHAR2(10)
 SAL                                                NUMBER


SQL> select * from ap2;

NAME              SAL
---------- ----------
ap              12.53
piet             89.7


SQL> insert into ap2
  2  select * from ap1;
select * from ap1
       *
ERROR at line 2:
ORA-01722: invalid number


SQL> ALTER SESSION SET NLS_NUMERIC_CHARACTERS=',.';

Session altered.

SQL> insert into ap2
  2  select * from ap1;

2 rows created.


20.2 More on AL32UTF8, AL16UTF16, UTF8:
=======================================

1) What is the National Character Set?
-------------------------------------- 
The National Character set (NLS_NCHAR_CHARACTERSET) is a character set which is defined 
in addition to the (normal) database character set and  is used for data stored in 
NCHAR, NVARCHAR2 and NCLOB columns. Your current value for the NLS_NCHAR_CHARACTERSET can be found 
with this select:  select value from NLS_DATABASE_PARAMETERS where parameter='NLS_NCHAR_CHARACTERSET'; 
You cannot have more than 2 charactersets defined in Oracle: 
The NLS_CHARACTERSET is used for CHAR, VARCHAR2, CLOB columns; 
The NLS_NCHAR_CHARACTERSET is used for NCHAR, NVARCHAR2, NCLOB columns. 
NLS_NCHAR_CHARACTERSET is defined when the database is created  and specified with the 
CREATE DATABASE command. The NLS_NCHAR_CHARACTERSET defaults to AL16UTF16 if nothing is specified. 

From 9i onwards the NLS_NCHAR_CHARACTERSET can have only 2 values: 
UTF8 or AL16UTF16 who are Unicode charactersets. 
See Note 260893.1 Unicode character sets in the Oracle database for more info about the difference 
between them. Al lot of people think that they *need* to use the NLS_NCHAR_CHARACTERSET 
to have UNICODE support in Oracle, this is not true, NLS_NCHAR_CHARACTERSET (NCHAR, NVARCHAR2) 
is in 9i always Unicode but you can perfectly use "normal" CHAR and VARCHAR2 columns for storing unicode  
in a database who has a AL32UTF8 / UTF8 NLS_CHARACTERSET. 
See also point 15. When trying to use another 
NATIONAL characterset, the CREATE DATABASE command will fail with "ORA-12714 
invalid national character set specified".
The character set identifier is stored with the column definition itself.

2) Which datatypes use the National Character Set?
--------------------------------------------------

 There are three datatypes which can store data in the national character set:

NCHAR     - a fixed-length national character set character string. 
            The length of the column is ALWAYS defined in characters 
            (it always uses CHAR semantics)

NVARCHAR2 - a variable-length national character set character string.  
            The length of the column is ALWAYS defined in characters 
            (it always uses CHAR semantics)

NCLOB     - stores national character set data of up to four gigabytes.     
	    Data is always stored in UCS2 or AL16UTF16, even if the 
	    NLS_NCHAR_CHARACTERSET is UTF8.
	    This has very limited impact, for more info about this please see:
	    Note 258114.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=258114.1> 
            Possible action for CLOB/NCLOB storage after 10g upgrade
	    and if you use DBMS_LOB.LOADFROMFILE see 
	    Note 267356.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=267356.1> 
            Character set conversion when using DBMS_LOB

 If you don't know what CHAR semantics is, then please read
 Note 144808.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=144808.1> Examples and limits of BYTE and CHAR semantics usage

 If you use N-types, DO use the (N'...') syntax when coding it so that Literals are 
 denoted as being in the national character set by prepending letter 'N', for example:

   create table test(a nvarchar2(100));
   insert into test values(N'this is a NLS_NCHAR_CHARACTERSET string');


3) How to know if I use N-type columns?
---------------------------------------

 This select list all tables containing a N-type column:

  select distinct OWNER, TABLE_NAME from DBA_TAB_COLUMNS where DATA_TYPE in ('NCHAR','NVARCHAR2', 'NCLOB');

 On a 9i database created without (!) the "sample" shema you will see these rows (or less) returned:

OWNER                          TABLE_NAME
------------------------------ ------------------------------
SYS                            ALL_REPPRIORITY
SYS                            DBA_FGA_AUDIT_TRAIL
SYS                            DBA_REPPRIORITY
SYS                            DEFLOB
SYS                            STREAMS$_DEF_PROC
SYS                            USER_REPPRIORITY
SYSTEM                         DEF$_LOB
SYSTEM                         DEF$_TEMP$LOB
SYSTEM                         REPCAT$_PRIORITY

9 rows selected.

  These SYS and SYSTEM tables may contain data if you are using:

  * Fine Grained Auditing -> DBA_FGA_AUDIT_TRAIL 
  * Advanced Replication -> ALL_REPPRIORITY, DBA_REPPRIORITY, USER_REPPRIORITY
                            DEF$_TEMP$LOB , DEF$_TEMP$LOB and REPCAT$_PRIORITY
  * Advanced Replication or Deferred Transactions functionality -> DEFLOB
  * Oracle Streams -> STREAMS$_DEF_PROC


 If you do have created the database with the DBCA and included
 the sample shema then you will see typically:

OWNER                         TABLE_NAME                                        
------------------------------------------------------------                    
OE                            BOMBAY_INVENTORY                                  
OE                            PRODUCTS                                          
OE                            PRODUCT_DESCRIPTIONS                              
OE                            SYDNEY_INVENTORY                                  
OE                            TORONTO_INVENTORY                                 
PM                            PRINT_MEDIA                                       
SYS                           ALL_REPPRIORITY                                   
SYS                           DBA_FGA_AUDIT_TRAIL                               
SYS                           DBA_REPPRIORITY                                   
SYS                           DEFLOB                                            
SYS                           STREAMS$_DEF_PROC                                 
SYS                           USER_REPPRIORITY                                  
SYSTEM                        DEF$_LOB                                          
SYSTEM                        DEF$_TEMP$LOB                                     
SYSTEM                        REPCAT$_PRIORITY                                  

15 rows selected.

 The OE and PM tables contain just sample data and can be dropped if needed.

4) Should I worry when I upgrade from 8i or lower to 9i or 10g?
---------------------------------------------------------------

* When upgrading from version 7:

    The National Character Set did not exist in version 7, 
    so you cannot have N-type columns.
    Your database will just have the -default- AL16UTF16 NLS_NCHAR_CHARACTERSET
    declaration and the standard sys/system tables.
    So there is nothing to worry about...

* When upgrading from version 8 and 8i:

  - If you have only the SYS / SYSTEM tables listed in point 3)
    then you don't have USER data using N-type columns.

    Your database will just have the -default- AL16UTF16 NLS_NCHAR_CHARACTERSET
    declaration after the upgrade and the standard sys/system tables.
    So there is nothing to worry about...

    We recommend that you follow this note:
    Note 159657.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=159657.1> Complete Upgrade Checklist for Manual Upgrades from 8.X / 9.0.1 to Oracle9i

  - If you have more tables then the SYS / SYSTEM tables listed in point 3) 
    (and they are also not the "sample" tables) then there are two possible cases:

  * Again, the next to points are *only* relevant when you DO have n-type USER data *
  
    a) Your current 8 / 8i NLS_NCHAR_CHARACTERSET is in this list:

       JA16SJISFIXED , JA16EUCFIXED , JA16DBCSFIXED , ZHT32TRISFIXED
       KO16KSC5601FIXED , KO16DBCSFIXED , US16TSTFIXED , ZHS16CGB231280FIXED
       ZHS16GBKFIXED , ZHS16DBCSFIXED , ZHT16DBCSFIXED , ZHT16BIG5FIXED
       ZHT32EUCFIXED

       Then the new NLS_NCHAR_CHARACTERSET will be AL16UTF16
       and your data will be converted to AL16UTF16 during the upgrade.

       We recommend that you follow this note:
       Note 159657.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=159657.1> Complete Upgrade Checklist for Manual Upgrades from 8.X / 9.0.1 to Oracle9i

    b) Your current 8 / 8i NLS_NCHAR_CHARACTERSET is UTF8:

       Then the new NLS_NCHAR_CHARACTERSET will be UTF8
       and your data not be touched during the upgrade.

       We still recommend that you follow this note:
       Note 159657.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=159657.1> Complete Upgrade Checklist for Manual Upgrades from 8.X / 9.0.1 to Oracle9i

    c) Your current 8 / 8i NLS_NCHAR_CHARACTERSET is NOT in the list of point a) 
       and is NOT UTF8:

       Then your will need to export your data and drop it before upgrading.
       We recommend that you follow this note: 
       Note 159657.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=159657.1> Complete Upgrade Checklist for Manual Upgrades from 8.X / 9.0.1 to Oracle9i

   For more info about the National Character Set in Oracle8 see Note 62107.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=62107.1>

5) The NLS_NCHAR_CHARACTERSET is NOT changed to UTF8 or AL16UTF16 after upgrading to 9i.
----------------------------------------------------------------------------------------

 That may happen if you have not set the ORA_NLS33 environment parameter correctly
 to the 9i Oracle_Home during the upgrade.
  Note 77442.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=77442.1> ORA_NLS (ORA_NLS32, ORA_NLS33, ORA_NLS10) Environment Variables explained.

 We recommend that you follow this note for the upgrade: 
 Note 159657.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=159657.1> Complete Upgrade Checklist for Manual Upgrades from 8.X / 9.0.1 to Oracle9i

 Strongly consider to restore your backup and do the migration again
 or log a TAR, refer to this note and ask to assign the TAR to the 
 NLS/globalization team. That team can then assist you further.
 However please do note that not all situations can be corrected,
 so you might be asked to do the migration again...

      
6) Can I change the AL16UTF16 to UTF8 / I hear that there are problems with AL16UTF16.
--------------------------------------------------------------------------------------

a) If you do *not* use N-types then there is NO problem at all with AL16UTF16
   because you are simply not using it and we strongly advice you the keep 
   the default AL16UTF16 NLS_NCHAR_CHARACTERSET.

b) If you *do* use N-types then there will be a problem with 8i clients and 
   lower accessing the N-type columns (note that you will NOT have a problem 
   selecting from "normal" non-N-type columns).
   More info about that is found there:
   Note 140014.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=140014.1> ALERT Oracle8/8i to Oracle9i/10g using New "AL16UTF16" National Character Set
   Note 236231.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=236231.1> New Character Sets Not Supported For Use With Developer 6i And Older Versions

   If this is a situation you find yourself in we recommend to simply use UTF8 
   as NLS_NCHAR_CHARACTERSET or create a second 9i db using UTF8 as NCHAR and use this as "inbetween" between the 8i and the 9i db
   you can create views in this new database that do a select from the AL16UTF16 9i db
   the data will then be converted from AL16UTF16 to UTF8 in the "inbetween" database and that can 
   be read by oracle 8i

   This is one of the 2 reasons why you should use UTF8 as NLS_NCHAR_CHARACTERSET.
   If you are NOT using N-type columns with pre-9i clients then there is NO reason to go to UTF8.

c) If you want to change to UTF8 because you are using transportable tablespaces from 8i database
   then check if are you using N-types in the 8i database that are included in the tablespaces that you are transporting.

     select distinct OWNER, TABLE_NAME from DBA_TAB_COLUMNS where DATA_TYPE in ('NCHAR','NVARCHAR2', 'NCLOB');

   If yes, then you have the second reason to use UTF8 as as NLS_NCHAR_CHARACTERSET.

   If not, then leave it to AL16UTF16 and log a tar for the solution of the ORA-19736
   and refer to this document.

d) You are in one of the 2 situations where it's really needed to change from AL16UTF16 to UTF8,
   log a tar so that we can assist you.

   provide:
   1) the output from: 

     select distinct OWNER, TABLE_NAME, COLUMN_NAME, CHAR_LENGTH 
     from DBA_TAB_COLUMNS where DATA_TYPE in ('NCHAR','NVARCHAR2', 'NCLOB');

   2) a CSSCAN output

  IMPORTANT:
  Please *DO* install the version 1.2 or higher from TechNet for you version.
  http://technet.oracle.com/software/tech/globalization/content.html
  and use this.

  copy all scripts and executables found in the zip file you downloaded
  to your oracle_home overwriting the old versions.

  Then run csminst.sql using these commands and SQL statements:

    cd $ORACLE_HOME/rdbms/admin 
    set oracle_sid=<your SID>
    sqlplus "sys as sysdba"
    SQL>set TERMOUT ON
    SQL>set ECHO ON
    SQL>spool csminst.log
    SQL> START csminst.sql

  Check the csminst.log for errors.

  Then run CSSCAN

    csscan FULL=Y FROMNCHAR=AL16UTF16 TONCHAR=UTF8 LOG=Ncharcheck CAPTURE=Y

  ( note the usage of fromNchar and toNchar )

  Upload the 3 resulting files and the output of the select while creating the tar

important:

   Do NOT use the N_SWITCH.SQL script, this will corrupt you NCHAR data !!!!!!

7) Is the AL32UTF8 problem the same as the AL16UTF16 / do I need the same patches?
----------------------------------------------------------------------------------
 No, they may look similar but are 2 different issues.

 For information about the possible AL32UTF8 issue please see 
  Note 237593.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=237593.1> 
  Problems connecting to AL32UTF8 databases from older versions (8i and lower) 
 
8) But I still want <characterset> as NLS_NCHAR_CHARACTERSET, like I had in 8(i)!
---------------------------------------------------------------------------------

 This is simply not possible.

 From 9i onwards the NLS_NCHAR_CHARACTERSET can have only 2 values: UTF8 or AL16UTF16.

 Both UTF8 and AL16UTF16 are unicode charactersets, so they can
 store whatever <characterset> you had as NLS_NCHAR_CHARACTERSET in 8(i).

 If you are not using N-types then keep the default AL16UTF16 or use UTF8,
 it doesn't matter if you don't use the types.

 There is one condition in which this "limitation" can have a undisired affect,
 when you are importing an Oracle8i Transportable Tablespace into Oracle9i
 you can run into a ORA-19736 (as wel with AL16UTF16 as with UTF8).
 In that case log a TAR, refer to this note and ask to assign the TAR to the 
 NLS/globalization team. That team can then assist you to work around this 
 issue.

9) Do i need to set NLS_LANG to AL16UTF16 when creating/using the NLS_NCHAR_CHARACTERSET ?
------------------------------------------------------------------------------------------

As clearly stated in 
 Note 158577.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=158577.1> 
 NLS_LANG Explained (How does Client-Server Character Conversion Work?)
 point "1.2 What is this NLS_LANG thing anyway?"

* NLS_LANG is used to let Oracle know what characterset you client's OS is USING
  so that Oracle can do (if needed) conversion from the client's characterset to the 
  database characterset.

NLS_LANG is a CLIENT parameter has has no influance on the database side.

10) I try to use AL32UTF8 as NLS_NCHAR_CHARACTERSET but it fails with ORA-12714 
-------------------------------------------------------------------------------

 From 9i onwards the NLS_NCHAR_CHARACTERSET can have only 2 values:
 UTF8 or AL16UTF16.

UTF8 is possible so that you can use it (when needed) for 8.x backwards compatibility.
In all other conditions AL16UTF16 is the preferred and best value.
AL16UTF16 has the same unicode revision as AL23UTF8, 
so there is no need for AL32UTF8 as NLS_NCHAR_CHARACTERSET.

11) I have the message "( possible ncharset conversion )" during import.
------------------------------------------------------------------------

in the import log you see something similar to this:

Import:  Release 9.2.0.4.0 - Production on Fri Jul 9 11:02:42 2004 
 Copyright (c) 1982, 2002, Oracle Corporation.  All rights reserved. 
  
Connected to: Oracle9i Enterprise Edition Release 9.2.0.4.0 - 64bit Production 
JServer Release 9.2.0.4.0 - Production 
 Export file created by EXPORT:V08.01.07 via direct path 
import done in WE8ISO8859P1 character set and AL16UTF16 NCHAR character set 
export server uses WE8ISO8859P1 NCHAR character set (possible ncharset conversion)

This is normal and is not a error condition.

- If you do not use N-types then this is a pure informative message.

- But even in the case that you use N-types like NCHAR or NCLOB then this is not a problem:

  * the database will convert from the "old" NCHAR characterset to the new one automatically.
    (and - unlike the "normal" characterset - the NLS_LANG has no impact on this conversion
     during exp/imp)

  * AL16UTF16 or UTF8 (the only 2 possible values in 9i) are unicode characterset and so
    can store any character... So no data loss is to be expected.

12) Can i use AL16UTF16 as NLS_CHARACTERSET ?
----------------------------------------------

No, AL16UTF16 can only be used as  NLS_NCHAR_CHARACTERSET in 9i and above.
Trying to create a database with a AL16UTF16 NLS_CHARACTERSET will fail.

13) I'm inserting <special character> in a Nchar or Nvarchar2 col but it comes back as ? or ? ...
--------------------------------------------------------------------------------------------------

see point 13 in  Note 227330.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=227330.1> 
Character Sets & Conversion - Frequently Asked Questions

14) Do i need to change the NLS_NCHAR_CHARACTERSET in 8i to UTF8 BEFORE upgrading to 9i/10g?
--------------------------------------------------------------------------------------------

No, see point 4) in this note.

15) Having a UTF8 NLS_CHARACTERSET db is there a advantage to use AL16UTF16 N-types ?
-------------------------------------------------------------------------------------

there migth be 2 reasons:

a) one possible advantage is storage (disk space).

UTF8 uses 1 up to 3 bytes, AL16UTF16 always 2 bytes.
If you have a lot of non-western data (cyrillic, Chinese, Japanese, Hindi languages..)
then i can be advantageous to use N-types for those columns.
For western data (english, french, spanish, dutch, german, portuguese etc...)
UTF8 will use in most cases less disk space then AL16UTF16.

 Note 260893.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=260893.1>
 Unicode character sets in the Oracle database

This is not true for (N)CLOB, they are both encoded a internal fixed-width Unicode character set
Note 258114.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=258114.1> 
Possible action for CLOB/NCLOB storage after 10g upgrade
so they will use the same amount of disk space.

b) other possible advantage is extending the limits of CHAR semantics

   For a single-byte character set encoding, the character and byte length are
   the same. However, multi-byte character set encodings do not correspond to 
   the bytes, making sizing the column more difficult.

   Hence the reason why CHAR semantics was introduced. However, we still have some
   physical underlying byte based limits and development has choosen to allow the full usage
   of the underlying limits. This results in the following table giving the maximum amount 
   of CHARarcters occupying the MAX datalength that can be stored for a cer 
   datatype in 9i and up.

   The MAX colum is the MAXIMUM amount of CHARACTERS that can be stored 
   occupying the MAXIMUM data len seen that UTF8 and AL32UTF8 are VARRYING 
   charactersets this means that a string of X chars can be X to X*3 (or X*4 for AL32) bytes.

   The MIN col is the maximum size that you can *define* and that Oracle can store if all data
   is the MINIMUM datalength (1 byte for AL32UTF8 and UTF8) for that characet.

   N-types (NVARCHAR2, NCHAR) are *always* defined in CHAR semantics, you cannot define them in BYTE.

   all numbers are CHAR definitions

                UTF8 (1 to 3 bytes)  AL32UTF8 (1 to 4 bytes)  AL16UTF16 ( 2 bytes)
                MIN       MAX         MIN      MAX             MIN      MAX
   CHAR         2000      666        2000      500             N/A      N/A

   VARCHAR2     4000     1333        4000     1000             N/A      N/A

   NCHAR        2000      666         N/A      N/A            1000     1000

   NVARCHAR2    4000     1333         N/A      N/A            2000     2000

                (N/A means not possible)

   This means that if you try to store more then 666 characters
   that occupy 3 bytes in UTF8 in a CHAR UTF8 colum you still will get a
   ORA-01401: inserted value too large for column
   (or from 10g onwards: ORA-12899: value too large for column )
   error, even if you have defined the colum as CHAR (2000 CHAR) 
   so here it might be a good idea to define that column as NCHAR
   that will raise the MAX to 1000 char's ...

   Note 144808.1 <http://metalink.oracle.com/metalink/plsql/showdoc?db=NOT&id=144808.1> Examples and limits of BYTE and CHAR semantics usage

Disadvantages using N-types:

* You might have some problems with older clients if using AL16UTF16
  see point 6) b) in this note
* Be sure that you use (AL32)UTF8 as NLS_CHARACTERSET , otherwise you will run into
  point 13 of this note.
* Do not expect a higher *performance* by using AL16UTF16, it might be faster
  on some systems, but that has more to do with I/O then with the database kernel.
* If you use N-types, DO use the (N'...') syntax when coding it so that Literals are 
  denoted as being in the national character set by prepending letter 'N', for example:
   
   create table test(a nvarchar2(100));
   insert into test values(N'this is NLS_NCHAR_CHARACTERSET string');

Normally you will choose to use VARCHAR (using a (AL32)UTF8 NLS_CHARACTERSET)
for simplicity, to avoid confusion and possible other limitations who might be
imposed by your application or programming language to the usage of N-types.

16) I have a message running DBUA (Database Upgrade Assistant) about NCHAR type when upgrading from 8i .


AL16UTF16
The default Oracle character set for the SQL NCHAR data type, which is used for the national character set. 
It encodes Unicode data in the UTF-16 encoding.

AL32UTF8
An Oracle character set for the SQL CHAR data type, which is used for the database character set. 
It encodes Unicode data in the UTF-8 encoding.

Unicode
Unicode is a universal encoded character set that allows you information from any language to be stored 
by using a single character set. Unicode provides a unique code value for every character, regardless 
of the platform, program, or language.

Unicode database
A database whose database character set is UTF-8.

Unicode code point
A 16-bit binary value that can represent a unit of encoded text for processing and interchange. 
Every point between U+0000 and U+FFFF is a code point.

Unicode datatype
A SQL NCHAR datatype (NCHAR, NVARCHAR2, and NCLOB). You can store Unicode characters in columns 
of these datatypes even if the database character set is not Unicode.

unrestricted multilingual support
The ability to use as many languages as desired. A universal character set, such as Unicode, 
helps to provide unrestricted multilingual support because it supports a very large character 
repertoire, encompassing most modern languages of the world.

UTFE
A Unicode 3.0 UTF-8 Oracle database character set with 6-byte supplementary character support. 
It is used only on EBCDIC platforms.

UTF8
The UTF8 Oracle character set encodes characters in one, two, or three bytes. 
It is for ASCII-based platforms. The UTF8 character set supports Unicode 3.0. 
Although specific supplementary characters were not assigned code points in Unicode until 
version 3.1, the code point range was allocated for supplementary characters in Unicode 3.0. 
Supplementary characters are treated as two separate, user-defined characters that occupy 6 bytes.

UTF-8
The 8-bit encoding of Unicode. It is a variable-width encoding. One Unicode character can 
be 1 byte, 2 bytes, 3 bytes, or 4 bytes in UTF-8 encoding. Characters from the European scripts 
are represented in either 1 or 2 bytes. Characters from most Asian scripts are represented in 
3 bytes. Supplementary characters are represented in 4 bytes.

UTF-16
The 16-bit encoding of Unicode. It is an extension of UCS-2 and supports the supplementary characters 
defined in Unicode 3.1 by using a pair of UCS-2 code points. 
One Unicode character can be 2 bytes or 4 bytes in UTF-16 encoding. 
Characters (including ASCII characters) from European scripts and most Asian scripts are 
represented in 2 bytes. Supplementary characters are represented in 4 bytes.

wide character
A fixed-width character format that is useful for extensive text processing because it allows data to be processed in consistent, fixed-width chunks. Wide characters are intended to support internal character processing


Oracle started supporting Unicode based character sets in Oracle7.  
Here is a summary of the Unicode character sets supported in Oracle: 
 
+------------+---------+-----------------+ 
|  Charset   |  RDBMS  | Unicode version |  
+------------+---------+-----------------+ 
| AL24UTFFSS | 7.2-8.1 | 1.1             | 
|            |         |                 | 
| UTF8       | 8.0-10g | 2.1 (8.0-8.1.7) | 
|            |         | 3.0 (8.1.7-10g) | 
|            |         |                 | 
| UTFE       | 8.0-10g | 2.1 (8.0-8.1.7) | 
|            |         | 3.0 (8.1.7-10g) | 
|            |         |                 | 
| AL32UTF8   | 9.0-10g | 3.0 (9.0)       | 
|            |         | 3.1 (9.2)       | 
|            |         | 3.2 (10.1)      | 
|            |         |                 | 
| AL16UTF16  | 9.0-10g | 3.0 (9.0)       | 
|            |         | 3.1 (9.2)       | 
|            |         | 3.2 (10.1)      | 
+------------+---------+-----------------+ 
 
AL24UTFFSS 
AL24UTFFSS was the first Unicode character set supported by Oracle. Is was  
introduced in Oracle 7.2. The AL24UTFFSS encoding scheme was based on the  
Unicode 1.1 standard, which is now obsolete. AL24UTFFSS has been de-supported  
from Oracle9i. The migration path for existing AL24UTFFSS databases is to  
upgrade the database to 8.0 or 8.1, then upgrade the character set to UTF8  
before upgrading the database further to 9i or 10g. 
[NOTE:234381.1] <http://metalink.oracle.com/metalink/plsql/ml2_documents.showDocument?p_id=234381.1&p_database_id=NOT> Changing AL24UTFFSS to UTF8 - AL32UTF8 with ALTER DATABASE CHARACTERSET  
 
UTF8 
UTF8 was the UTF-8 encoded character set in Oracle8 and 8i. It followed the  
Unicode 2.1 standard between Oracle 8.0 and 8.1.6, and was upgraded to Unicode  
version 3.0 for versions 8.1.7, 9i and 10g. To maintain compatibility with  
existing installations this character set will remain at Unicode 3.0 in future  
Oracle releases. Although specific supplementary characters were not assigned  
to Unicode until version 3.1, the allocation for these characters were already  
defined in 3.0. So if supplementary characters are inserted in a UTF8 database, 
it will not corrupt the actual data inside the database. They will be treated as 
2 separate undefined characters, occupying 6 bytes in storage. We recommend that 
customers switch to AL32UTF8 for full supplementary character support. 
 
UTFE 
This is the UTF8 database character set for the EDCDIC platforms. It has the  
same properties as UTF8 on ASCII based platforms. The EBCDIC Unicode  
transformation format is documented in Unicode Technical Report #16 UTF-EBCDIC. 
Which can be found at http://www.unicode.org/unicode/reports/tr16/ 
 
AL32UTF8 
This is the UTF-8 encoded character set introduced in Oracle9i. AL32UTF8 is the 
database character set that supports the latest version (3.2 in 10g) of the  
Unicode standard. It also provides support for the newly defined supplementary 
characters. All supplementary characters are stored as 4 bytes. 
AL32UTF8 was introduced because when UTF8 was designed (in the times of Oracle8) 
there was no concept of supplementary characters, therefore UTF8 has a maximum  
of 3 bytes per character. Changing the design of UTF8 would break backward  
compatibility, so a new character set was introduced. The introduction of  
surrogate pairs should mean that no significant architecture changes are needed  
in future versions of the Unicode standard, so the plan is to keep enhancing  
AL32UTF8 as necessary to support future version of the Unicode standard, for 
example work is now underway to make sure we support Unicode 4.0 in AL32UTF8 
in the release after 10.1. 
 
AL16UTF16 
This is the first UTF-16 encoded character set in Oracle. It was introduced in  
Oracle9i as the default national character set (NLS_NCHAR_CHARACTERSET). 
AL16UTF16 supports the latest version (3.2 in 10g) of the Unicode standard. 
It also provides support for the newly defined supplementary characters. 
All supplementary characters are stored as 4 bytes. 
As with AL32UTF8, the plan is to keep enhancing AL16UTF16 as  
necessary to support future version of the Unicode standard. 
AL16UTF16 cannot be used as a database character set (NLS_CHARACTERSET), 
only as the national character set (NLS_NCHAR_CHARACTERSET). 
The database character set is used to identify and to hold SQL, 
SQL metadata and PL/SQL source code. It must have either single byte 7-bit ASCII 
or single byte EBCDIC as a subset, whichever is native to the deployment  
platform. Therefore, it is not possible to use a fixed-width, multi-byte  
character set (such as AL16UTF16) as the database character set. 
Trying to create a database with AL16UTF16 a characterset in 9i and up will give 
"ORA-12706: THIS CREATE DATABASE CHARACTER SET IS NOT ALLOWED". 
 
Further reading 
--------------- 
All the above information is taken from the white paper "Oracle Unicode database 
support". The paper itself contains much more information and is available from:  
http://otn.oracle.com/tech/globalization/pdf/TWP_Unicode_10gR1.pdf 
 
 
References 
---------- 
The following URLs contain a complete list of hex values and character  
descriptions for every Unicode character: 
Unicode Version 3.2: http://www.unicode.org/Public/3.2-Update/UnicodeData-3.2.0.txt 
Unicode Version 3.1: http://www.unicode.org/Public/3.1-Update/UnicodeData-3.1.0.txt  
Unicode Version 3.0: http://www.unicode.org/Public/3.0-Update/UnicodeData-3.0.0.txt  
Unicode Versions 2.x: http://www.unicode.org/unicode/standard/versions/enumeratedversions.html  
Unicode Version 1.1: http://www.unicode.org/Public/1.1-Update/UnicodeData-1.1.5.txt  
A description of the file format can be found at: 
http://www.unicode.org/Public/UNIDATA/UnicodeData.html 
For a glossarry of unicode terms, see: 
http://www.unicode.org/glossary/ 
 
On above locations you can find the unicode standard, all characters are there  
referenced with their UCS-2 codepoint 


Some further notes:
===================


Note 1:
-------

Thanks for the detailed reply.
>
> >Furthermore the use of NLS columns on a utf8 database (al32utf8 would be
> better by the way) is
> >subject to questions. Correct me if I'm wrong but I believe that most
> >asian character sets can be translated into utf8 without loosing any
> >information. The only exception to this statement is for surrogate pairs
> >and that's the only difference between al32utf8 and utf8 in Oracle.
> >al32utf8 supports surrogate pairs.
>
> I found from Oracle documentation that UTF8 supports surrogate pairs but
> requires 6 bytes for surrogate pairs.

I should have clarified : the jdbc drivers don't support these 6-bytes
utf8 surrogate pairs. That's the reason why we introduced al32utf8 as
one of the native character set (ascii, isolatin1, utf8, al32utf8, ucs2,
al24utffss).

Note 2:
-------

> AL32UTF8
> The AL32UTF8 character set encodes characters in one to three bytes.
> Surrogate
> pairs require four bytes. It is for ASCII-based platforms.
>
> UTF8
> The UTF8 character set encodes characters in one to three bytes. Surrogate
> pairs
> require six bytes. It is for ASCII-based platforms.
>
> AL32UTF8
> ---------
> Advantages
> ----------
> 1. Surrogate pair Unicode characters
> are stored in the standard 4 bytes
> representation, and there is no
> data conversion upon retrieval
> and insertion of those surrogate
> characters. Also, the storage for
> those characters requires less disk
> space than that of the same
> characters encoded in UTF8.
>
> Disadvantages
> -------------
> 1. You cannot specify the length of SQL CHAR
> types in the number of characters (Unicode
> code points) for surrogate characters. For
> example, surrogate characters are treated as
> one code point rather than the standard of two
> code points.
> 2. The binary order for SQL CHAR columns is
> different from that of SQL NCHAR columns
> when the data consists of surrogate pair
> Unicode characters. As a result, CHAR columns
> NCHAR columns do not always have the same
> sort for identical strings.
>
> UTF8
> ----
> Advantages
> ----------
>  1. You can specify the length of SQL
> CHAR types as a number of
> characters.
> 2. The binary order on the SQL CHAR
> columns is always the same as
> that of the SQL NCHAR columns
> when the data consists of the same
> surrogate pair Unicode characters.
> As a result, CHAR columns and
> NCHAR columns have the same
> sort for identical strings.
>
> Disadvantages
> -------------
> 1. Surrogate pair Unicode characters are stored
> as 6 bytes instead of the 4 bytes defined by the
> Unicode standard. As a result, Oracle has to
> convert data for those surrogate characters.
>
> I dont understand the 1st disadvantage of AL32UTF8 encoding !! If surrogate
> characters are considered 1 codepoint, then if I declare a CHAR column as of
> length 40 characters (codepoints) , then I can enter 40 surrogate
> characters.

Note 3:
-------

Universal Character Sets			 
====================			 
Character Set Name  Description	                                 Comments	   Language, Country or Region 
=================  =====================================	 =========	   ========================== 
AL16UTF16	   Unicode 3.1 UTF-16Universal character set     MB, EURO, FIXED   Universal Unicode 
AL32UTF8	   Unicode 3.1 UTF-8 Universal character set	 MB, ASCII, EURO   Universal Unicode 
UTF8	           Unicode 3.0 UTF-8 Universal character set	 MB, ASCII, EURO   Universal Unicode 
                    CESU-8 compliant 
UTFE	           EBCDIC form of Unicode 3.0UTF-8               MB, EURO	   Universal Unicode 
                    Universal character set	 

Note 4:
-------

WE8ISO is a single byte character set.  It has 255 characters.

Korean data requires a multi-byte character set -- each character could be 1, 2, 
3 or more bytes.  It is a variable length encoding scheme.  It has more then, 
way more then 255 characters.  I don't see it fitting into we8iso unless they 
use RAW in which case it is just bytes, not characters at all.

Note 5:
-------

Hi Tom,

We migrated our DB 8.1.7 to 9.2.In 8.1.7 we used UTF8 character set.It remains 
same in 9.2.
We know that Oracle 9.2 doesn't have UTF8 but AL32UTF8.
Can we keep this UTF8 or have to change to AL32UTF8.
If we need to change, may we do it by :
alter database character set AL32UTF8 
or
we must use exp/imp utility?

Regards  


Followup:  
what do you mean -- utf8 is still a valid character set? 
 
Note 6:
-------

Hi Tom,

 We are migrating from oracle 8.1.6 to oracle 9 R2. We have about 14 oracle 
instance. All instances have WE8ISO88591P1
character set. Our company is expanding globally so we are thinking to use 
unicode character set with oracle 9.
I have few questions on this issue.

1) What is the difference between UTF-8,UTF-16
  Is AL32UTF8 and UTF-8 is same character set or they are different? 
  Is UTF-16 and AL16UTF16 is same character set or different ?

2) Which character is super set of all character set?
   If there is any, Does oracle support that character set?

3) Do we have to change our pl/sql procedure if we move to unicode database ?  
The reason for this question is our developer is using ascii character for 
carrage return and line feed like chr(10) and chr(13) and some other ascii 
character .

4) What is impact on CLOB ?

5) What will be the size of the database? Our production DB size is currently 
50GB. What it would be in unicode?

Thanks 


basically utf8 is unicode 3.0 support, utf16 is unicode 3.1
there is no super super "top" set.

Your plsql routines may will have to change -- your data model may well have to 
change.

You'll find that in utf, european characters (except ascii -- 7bit data) all 
take 2 bytes.  That varchar2(80) you have in your database?  It might only hold 
40 characters of eurpean data (or even less of other kinds of data).  It is 80 
bytes (you can use the new 9i syntax varchar2( N char ) -- it'll allocate in 
characters, not bytes).

So, you could find your 80 character description field cannot hold 80 
characters.

You might find that x := a || b; fails -- with string to long in your plsql code 
due to the increased size.

You might find that your string intensive routines run slower (substr(x,1,80) is 
no longer byte 1 .. byte 80 -- Oracle has to look through the string to find 
where characters start and stop -- it is more complex)


chr(10) and chr(13) should work find, they are simple ASCII.


On clob -- same impact as on varchar2, same issues.


Your database could balloon to 200gb, but it will be somewhere between 50 and 
200.  As unicode is a VARYING WIDTH encoding scheme, it is impossible to be 
precise -- it is not a fixed width scheme, so we don't know how big your strings 
will get to be. 
 

21.3 Oracle Rowid's
-------------------

Rowid's: Every table row has an internal rowid which contains information about
object_id, block_id, file#.
Also you can query on the "logical" number rownum.

SQL> SELECT * FROM charlie.xyz;

       ID NAME
--------- --------------------
        1 joop
        2 gerrit

SQL> SELECT rownum FROM charlie.xyz;

   ROWNUM
---------
        1
        2

SQL> SELECT rowid FROM SALES.xyz;

ROWID
------------------
AAAI92AAQAAAFXbAAA
AAAI92AAQAAAFXbAAB

- DBMS_ROWID:

DBMS_ROWID.


Every row has a rowid. Every row has also an associated
logical "rownum" on which you can query.

The rowid is an 18 byte structure that stores the
location of blockid WHERE the row is in.

The old format is the restricted format of Oracle 7
The new format is the extended format of Oracle 8, 8i

format: OOOOOOFFFBBBBBRRRR

000000=object_id
FFF=relative datafile number
BBBBB=block_id
RRR=row in block

The dbms package DBMS_ROWID has several function to convert FROM
the one format to the other.

DBMS_ROWID EXAMPLES:
--------------------

SELECT DBMS_ROWID.ROWID_TO_EXTENDED(ROWID,null,null,0),
       DBMS_ROWID.ROWID_TO_RESTRICTED(ROWID,0), rownum
FROM   CHARLIE.XYZ;

SELECT dbms_rowid.rowid_block_number(rowid)     
FROM   emp 
WHERE  ename = 'KING';

SELECT dbms_rowid.rowid_block_number(rowid)     
FROM   TCMLOGDBUSER.EVENTLOG
WHERE  id = 5;

This example returns the ROWID for a row in the EMP table, extracts the data object number 
FROM the ROWID, using the ROWID_OBJECT function in the DBMS_ROWID package, then displays the object number: 

DECLARE
    object_no   INTEGER;
    row_id      ROWID;
    BEGIN
      SELECT ROWID INTO row_id FROM TCMLOGDBUSER.EVENTLOG
      WHERE id=5;
      object_no := dbms_rowid.rowid_object(row_id);
      dbms_output.put_line('The obj. # is '|| object_no);
  END;
/

PL/SQL procedure successfully completed.

SQL> set serveroutput on
SQL> /
The obj. # is 28954

PL/SQL procedure successfully completed.

SQL> select * from dba_objects where object_id=28954;

OWNER
------------------------------
OBJECT_NAME
-----------------------------------------------------------
SUBOBJECT_NAME                  OBJECT_ID DATA_OBJECT_ID
------------------------------ ---------- --------------
OBJECT_TYPE        CREATED   LAST_DDL_ TIMESTAMP
------------------ --------- --------- -------------------
STATUS  T G S
------- - - -
TCMLOGDBUSER
EVENTLOG
                                    28954          28954
TABLE              05-DEC-04 05-DEC-04 2004-12-05:22:26:10
VALID   N N N


21.4 HETEROGENEOUS SERVICES:
----------------------------

Generic connectivity is intended for low-end data integration solutions  
requiring the ad hoc query capability to connect 
from Oracle8i to non-Oracle  database systems. Generic connectivity is enabled 
by Oracle Heterogeneous  Services, 
allowing you to connect to non-Oracle systems with improved  performance and throughput.  
Generic connectivity is implemented as a Heterogeneous Services ODBC agent. 
An  ODBC agent is included as part of your Oracle8i system.

To access the non-Oracle data store using generic connectivity, the agent works  
with an ODBC driver. Oracle8i provides support for the ODBC driver interface.  
The driver that you use must be on the same machine as the agent. 
The non-Oracle  data stores can reside on the same machine as Oracle8i or a different machine.

Agent processes are usually started when a user session makes its first 
non-Oracle system access through a database link. These connections are made using 
Oracle's remote data access software, Oracle Net Services, which enables both 
client-server and server-server communication. The agent process continues to run 
until the user session is disconnected or the database link is explicitly closed.

Multithreaded agents behave slightly differently. They have to be explicitly started 
and shut down by a database administrator instead of automatically being spawned by Oracle Net Services.

Oracle has Generic Connectivity agents for ODBC and OLE DB that enable you to use 
ODBE and OLEDB drivers to access non-Oracle systems that have an ODBC or an OLE DB interface.


Setup:
------

1. HS datadictonary
-------------------

To install the data dictionary tables and views for Heterogeneous Services, you must run a script 
that creates all the Heterogeneous Services data dictionary tables, views, and packages. 
On most systems the script is called caths.sql and resides in $ORACLE_HOME/rdbms/admin.

Check for the existence of Heterogeneous Services data dictionary views, 

All normal standard preparations for HS needs to be in place in Oracle 9i. 
To recap this here, if you must install HS from scratch:

-	run caths.sql as SYS on Ora9i DB Server.
-	The HS Agent will be installed as part of 9i DB install. 
        It will be started as part of the listener.
-	On NT/2000, The agent works with a OLEDB or ODBC driver to connect 
        to target db
-	The DB Server will connect to the agent through NET8, which is why 
         a tnsnames.ora and a listener.ora entry needs to be setup

You van also check on HS installation. Just check on existence of the 
HS% views in the SYS schema, for example, SYS.HS_FDS_CLASS.

2. tnsnames.ora and listener.ora
--------------------------------

To initiate a connection to the non-Oracle system, the Oracle9i server starts an agent process 
through the Oracle Net listener. For the Oracle9i server to be able to connect to the agent, you must
configure tnsnames.ora and listener.ora


------------------------------------------------------------------------------

tnsnames examples:

Sybase_sales= (DESCRIPTION=
                     (ADDRESS=(PROTOCOL=tcp)
                              (HOST=dlsun206)  -- local machine
                              (PORT=1521)
                     )
                     (CONNECT_DATA = (SERVICE_NAME=SalesDB)
                     )
                     (HS = OK)
              )


TNSNAMES.ORA  hsmsql =   
         (DESCRIPTION =     
            (ADDRESS_LIST =       
              (ADDRESS = (PROTOCOL = tcp)(host=winhost)(port=1521))     ) -- local machine
              (CONNECT_DATA =       
              (SID = msql)            
              )                                   -- needs to match the sid in listener.ora.
              (HS=OK)                 
              )      
         )

TG4MSQL.WORLD =   
        (DESCRIPTION =     
              (ADDRESS = (PROTOCOL = TCP)(HOST = ukp15340)(PORT = 1528)     )     
        (CONNECT_DATA =       (SID = tg4msql)     
        )     
        (HS = OK)   
        ) 
-------------------------------------------------------------------------------

listener.ora examples:

LISTENER =
   (ADDRESS_LIST =
      (ADDRESS= (PROTOCOL=tcp)
                (HOST = dlsun206)
                (PORT = 1521)
      )
  )
... 
SID_LIST_LISTENER = 
  (SID_LIST = 
      (SID_DESC = (SID_NAME=SalesDB)
                  (ORACLE_HOME=/home/oracle/megabase/9.0.1)
                  (PROGRAM=tg4mb80)
                  (ENVS=LD_LIBRARY_PATH=non_oracle_system_lib_directory)
      )
  )

 
LISTENER.ORA  
LISTENER =   (DESCRIPTION_LIST =     
(DESCRIPTION =       
(ADDRESS_LIST =         
(ADDRESS = (PROTOCOL = TCP)(HOST = winhost)(PORT = 1521))       )     )  

SID_LIST_LISTENER =   (SID_LIST =     (SID_DESC =       
(SID_NAME = msql)          <== needs to match the sid in tnsnames.ora       
(ORACLE_HOME = E:\Ora816)       
(PROGRAM = hsodbc)         <== hsodbc is the executable            )   )  


3. create the initialization file:
----------------------------------

Create the Initialization file. Oracle supplies a sample initialization file named     
"inithsodbc.ora" which is stored in the $ORACLE_HOME\hs\admin directory.     
To create an initialization file, copy the appropriate sample file and rename     
the file to initHS_SID.ora. In this example the sid noted in the listener and     
tnsnames is msql so our new initialization file is called initmsql.ora. 

INITMSQL.ORA 
# HS init parameters 
# 
HS_FDS_CONNECT_INFO = msql            <= odbc data_source_name 
HS_FDS_TRACE_LEVEL = 0                <= trace levels 0 - 4  (4 is verbose) 
HS_FDS_TRACE_FILE_NAME = hsmsql.trc   <= trace file name # 
# Environment variables required for the non-Oracle system # 
#set <envvar>=<value> 

HS_FDS_SHAREABLE_NAME
Default value: 
 none 
 
Range of values: 
 not applicable  
 
 
HS_FDS_SHAREABLE_NAME:
Specifies the full path name to the ODBC library. This parameter is required when you are 
using generic connectivity to access data from an ODBC provider on a UNIX machine. 


4. create a database link:
--------------------------

CREATE DATABASE LINK sales
USING `Sybase_sales';


Common Errors:
--------------

AGTCTL.exe = ORA-28591 unable to access parameter file, ORA-28592 agent SID not set
agentctl
hsodbc.exe =
caths.sql

What is the difference between agtctl and lsnrctl dbsnmp_start 


Error:	ORA-28591 Text:	agent control utility: unable to access parameter file  
--------------------------------------------------------------------------- 
Cause:	The agent control utility was unable to access its parameter file.  	
This could be because it could not find its admin directory or because  	
permissions on directory were not correctly set.  Action:	
The agent control utility puts its parameter file in either the  	
directory pointed to by the environment variable AGTCTL_ADMIN or in the  	
directory pointed to by the environment variable TNS_ADMIN. Make sure  	
that at least one of these environment variables is set and that it  	
points to a directory that the agent has access to. 

SET AGTCTL_ADMIN=\OPT\ORACLE\ORA81\HS\ADMIN


Error:	ORA-28592 Text:	agent control utility: agent SID not set  
--------------------------------------------------------------------------- 
Cause:	The agent needs to know the value of the AGENT_SID parameter before it  	
can process any commands. If it does not have a value for AGENT_SID  	
then all commands will fail.  
Action:	Issue the command SET AGENT_SID <value> and then retry the command  	
that failed. 

Error:
------

fix:

Set the HS_FDS_TRACE_FILE_NAME to a filename:
HS_FDS_TRACE_FILE_NAME = test.log

or comment it out:

#HS_FDS_TRACE_FILE_NAME

Error: incorrect characters 
------

Change the HS_LANGUAGE to a correct NLS
like AMERICAN_AMERICA.WE8MSWIN1252


Error: ORA-02085
----------------

HS_FDS_CONNECT_INFO = <SystemDSN_name>
HS_FDS_TRACE_LEVEL = 0 
HS_FDS_TRACE_FILE_NAME = c:\hs.log  
HS_DB_NAME = exhsodbc               -- case sensitive
HS_DB_DOMAIN = ch.oracle.com        -- case sensitive

ERROR: ORA-02085
----------------

SET GLOBAL_NAMES TRUE

ERORR:ORA-02068 and ORA-28511
-----------------------------

LD_LIBRARY_PATH=/u06/home/oracle/support/network/ODBC/lib  
f the LD_LIBRARY_PATH does not contain the path to the ODBC library, a
dd the ODBC library path and start the listener with this environment. 

LD_LIBRARY_PATH=/u01/app/oracle/product/8.1.7/lib; export LD_LIBRARY_PATH


When the listener launches the agent hsodbc, the agent inherits the 
environment from the listener and needs to have the ODBC library path in order 
to access the ODBC shareable file.  The shareable file is defined in 
the init<sid>.ora file located in the $ORACLE_HOME/hs/admin directory.  
HS_FDS_SHAREABLE_NAME=/u06/home/oracle/support/network/ODBC/lib/libodbc.so 
         

21.5 SET EVENTS:
----------------

Note 1:
-------

- What is a database EVENT and how does one set it?

Oracle trace events are useful for debugging the Oracle database server. The following two examples 
are simply to demonstrate syntax. Refer to later notes on this page for an explanation of what these 
particular events do. 
Events can be activated by either adding them to the INIT.ORA parameter file. E.g. 

	 event='1401 trace name errorstack, level 12'
... or, by issuing an ALTER SESSION SET EVENTS command: E.g. 
	 alter session set events '10046 trace name context forever, level 4';

The alter session method only affects the user's current session, whereas changes to the INIT.ORA file will 
affect all sessions once the database has been restarted. 

- What database events can be set?

The following events are frequently used by DBAs and Oracle Support to diagnose problems: 
10046 trace name context forever, level 4
Trace SQL statements and show bind variables in trace output. 

10046 trace name context forever, level 8
This shows wait events in the SQL trace files 

10046 trace name context forever, level 12
This shows both bind variable names and wait events in the SQL trace files 

1401 trace name errorstack, level 12
1401 trace name errorstack, level 4
1401 trace name processstate
Dumps out trace information if an ORA-1401 "inserted value too large for column" error occurs. 
The 1401 can be replaced by any other Oracle Server error code that you want to trace. 

60 trace name errorstack level 10
Show where in the code Oracle gets a deadlock (ORA-60), and may help to diagnose the problem. 

- The following list of events are examples only. They might be version specific, so please call Oracle before using them: 
10210 trace name context forever, level 10
10211 trace name context forever, level 10
10231 trace name context forever, level 10
These events prevent database block corruptions 

10049 trace name context forever, level 2
Memory protect cursor 

10210 trace name context forever, level 2
Data block check 

10211 trace name context forever, level 2
Index block check 

10235 trace name context forever, level 1
Memory heap check 

10262 trace name context forever, level 300
Allow 300 bytes memory leak for connections 


- How can one dump internal database structures?
The following (mostly undocumented) commands can be used to obtain information about internal database structures. 

-- Dump control file contents
alter session set events 'immediate trace name CONTROLF level 10'
/

-- Dump file headers
alter session set events 'immediate trace name FILE_HDRS level 10'
/

-- Dump redo log headers
alter session set events 'immediate trace name REDOHDR level 10'
/

-- Dump the system state
-- NOTE: Take 3 successive SYSTEMSTATE dumps, with 10 minute intervals
alter session set events 'immediate trace name SYSTEMSTATE level 10'
/

-- Dump the process state
alter session set events 'immediate trace name PROCESSSTATE level 10'
/

-- Dump Library Cache details
alter session set events 'immediate trace name library_cache level 10'
/

-- Dump optimizer statistics whenever a SQL statement is parsed (hint: change statement or flush pool)
alter session set events '10053 trace name context forever, level 1'
/

-- Dump a database block (File/ Block must be converted to DBA address)
-- Convert file and block number to a DBA (database block address). Eg:
        variable x varchar2;
        exec :x := dbms_utility.make_data_block_address(1,12);
        print x
alter session set events 'immediate trace name blockdump level 50360894'
/


ALTER SESSION SET EVENTS '1652 trace name errorstack level 1 '; 


or 
alter system set events '1652 trace name errorstack level 1 '; 
alter system set events '1652 trace name errorstack off '; 


Note 2:
-------

Doc ID </help/usaeng/Search/search.html>: 	Note:218105.1	Content Type: 	TEXT/PLAIN	
Subject: 	Introduction to ORACLE Diagnostic EVENTS	Creation Date: 	11-NOV-2002	
Type: 	BULLETIN	Last Revision Date: 	20-NOV-2002	
Status: 	PUBLISHED		
PURPOSE 
------- 
 
This document describes the different types of Oracle EVENT that exist to help  
customers and Oracle Support Services when investigating Oracle RDBMS related 
issues. 
 
This note will only provide information of a general nature. 
 
Specific information on the usage of a given event should be provided by  
Oracle Support Services or the Support related article that is suggesting the  
use of a given event. This note will not provide that level of detail. 
 
SCOPE & APPLICATION 
------------------- 
 
The information held here is of use to Oracle DBAs, developers and Oracle 
Support Services. 
 
Introduction to ORACLE Diagnostic EVENTS 
---------------------------------------- 
 
Before proceeding, please review the following note as it contain some  
important additional information on Events. 
 
[NOTE:75713.1] <ml2_documents.showDocument?p_id=75713.1&p_database_id=NOT> "Important Customer information about 
using Numeric Events" 
 
EVENTS are primarily used to produce additional diagnostic information  
when insufficient information is available to resolve a given problem. 
 
EVENTS are also used to workaround or resolve problems by changing Oracle's 
behaviour or enabling undocumented features. 
 
*WARNING* Do not use an Oracle Diagnostic Event unless directed to do so by  
Oracle Support Services or via a Support related article on Metalink.  
Incorrect usage can result in disruptions to the database services. 
 
Setting EVENTS 
-------------- 
 
There are a number of ways in which events can be set. 
 
How you set an event depends on the nature of the event and the circumstances 
at the time. As stated above, specific information on how you set a given event  
should be provided by Oracle Support Services or the Support related article  
that is suggesting the use of a given event. 
 
Most events can be set using more than one of the following methods : 
 
      o As INIT parameters 
      o In the current session 
      o From another session using a Debug tool 
 
INIT Parameters 
~~~~~~~~~~~~~~~ 
 
Syntax: 
 
EVENT = "<event_name> <action>" 
 
Reference:  
 
[NOTE:160178.1] <ml2_documents.showDocument?p_id=160178.1&p_database_id=NOT> How to set EVENTS in the SPFILE 
 
Current Session 
~~~~~~~~~~~~~~~ 
 
Syntax: 
 
ALTER SESSION SET EVENTS '<event_name> <action>'; 
 
From another Session using a Debug tool 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 
There are a number of debug tools : 
 
      o ORADEBUG 
      o ORAMBX (VMS only) 
 
  ORADEBUG : 
  ======== 
 
     Syntax: 
 
     Prior to Oracle 9i,  
 
     SVRMGR> oradebug event <event_name> <action> 
 
     Oracle 9i and above : 
 
     SQL> oradebug event <event_name> <action> 
 
     Reference:  
 
[NOTE:29786.1] <ml2_documents.showDocument?p_id=29786.1&p_database_id=NOT>   "SUPTOOL:  ORADEBUG 7.3+ (Server Manager/SQLPLUS Debug Commands)" 
[NOTE:1058210.6] <ml2_documents.showDocument?p_id=1058210.6&p_database_id=NOT> "HOW TO ENABLE SQL TRACE FOR ANOTHER SESSION USING ORADEBUG" 
 
  ORAMBX : on OpenVMS is still available and described under : 
  ====== 
 
  [NOTE:29062.1] <ml2_documents.showDocument?p_id=29062.1&p_database_id=NOT> "SUPTOOL:  ORAMBX (VMS) - Quick Reference" 
 
This note will not enter into additional details on these tools. 
 
EVENT Categories 
---------------- 
 
The most commonly used events fall into one of four categories : 
 
      o Dump diagnostic information on request 
      o Dump diagnostic information when an error occurs 
      o Change Oracle's behaviour 
      o Produce trace diagnostic information as the instance runs 
 
Dump diagnostic information on request (Immediate Dump) 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 
An immediate dump Event will result in information immediately being  
written to a trace file. 
 
Some common immediate dump Events include :  
 
SYSTEMSTATE, ERRORSTACK, CONTROLF, FILE_HDRS and REDOHDR 
 
These type of events are typically set in the current session. 
 
For example: 
 
ALTER SESSION SET EVENTS 'IMMEDIATE trace name ERRORSTACK level 3'; 
 
Dump Diagnostic information when an error occurs (On-Error Dump) 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 
The on-error dump Event is similar to the immediate dump Event with the  
difference being that the trace output is only produced when the given 
error occurs. 
 
You can use virtually any standard Oracle error to trigger this type of 
event. 
 
For example, an ORA-942 "table or view does not exist" error does not include 
the name of the problem table or view. When this is not obvious from the 
application (due to its complexity), then it can be difficult to investigate 
the source of the problem. However, an On-Error dump against the 942 error can  
help narrow the search. 
 
These type of events are typically set as INIT parameters. 
 
For example, using the 942 error : 
 
EVENT "942 trace name ERRORSTACK level 3" 
 
Once established, the next time a session encounters an ORA-942 error, a  
trace file will be produced that shows (amongst other information) the current  
SQL statement being executed. This current SQL can now be checked and the  
offending table or view more easily discovered. 
 
Change Oracle's behaviour 
~~~~~~~~~~~~~~~~~~~~~~~~~ 
 
Instance behaviour can be changed or hidden features can be enabled using 
these type of Event 
 
A common event in this category is 10262 which is discussed in  
 
[NOTE:21235.1] <ml2_documents.showDocument?p_id=21235.1&p_database_id=NOT> EVENT: 10262 "Do not check for memory leaks" 
 
These type of events are typically set as INIT parameters. 
 
For example: 
 
EVENT "10262 trace name context forever, level 4000" 
 
Produce trace diagnostic information as the instance runs (Trace Events) 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 
Trace events produce diagnostic information as processes are running. 
 
They are used to gather additional information about a problem. 
 
A common event in this category is 10046 which is discussed in 
 
[NOTE:21154.1] <ml2_documents.showDocument?p_id=21154.1&p_database_id=NOT> EVENT: 10046 "enable SQL statement tracing (including binds/waits)" 
 
These type of events are typically set as INIT parameters. 
 
For example: 
 
EVENT = "10046 trace name context forever, level 12" 
 
Summary 
------- 
 
EVENT usage and syntax can be very complex and due to the possible impact on  
the database, great care should be taken when dealing with them. 
 
Oracle Support Services (or a Support article) should provide information 
on the appropriate method to be adopted and syntax to be used when  
establishing a given event. 
 
If it is possible to do so, test an event against a development system  
prior to doing the same thing on a production system. 
 
The misuse of events can lead to a loss of service. 
 
RELATED DOCUMENTS 
----------------- 
 
[NOTE:75713.1] <ml2_documents.showDocument?p_id=75713.1&p_database_id=NOT>   Important Customer information about using Numeric Events 
[NOTE:21235.1] <ml2_documents.showDocument?p_id=21235.1&p_database_id=NOT>   EVENT: 10262 "Do not check for memory leaks" 
[NOTE:21154.1] <ml2_documents.showDocument?p_id=21154.1&p_database_id=NOT>   EVENT: 10046 "enable SQL statement tracing (including binds/waits)" 
[NOTE:160178.1] <ml2_documents.showDocument?p_id=160178.1&p_database_id=NOT>  How to set EVENTS in the SPFILE 
[NOTE:1058210.6] <ml2_documents.showDocument?p_id=1058210.6&p_database_id=NOT> HOW TO ENABLE SQL TRACE FOR ANOTHER SESSION USING ORADEBUG 
[NOTE:29786.1] <ml2_documents.showDocument?p_id=29786.1&p_database_id=NOT>   SUPTOOL:  ORADEBUG 7.3+ (Server Manager/SQLPLUS Debug Commands) 
[NOTE:29062.1] <ml2_documents.showDocument?p_id=29062.1&p_database_id=NOT>   SUPTOOL:  ORAMBX (VMS) - Quick Reference 


======================
22. DBA% and v$ views
======================

NLS:
----

VIEW_NAME                      OWNER
------------------------------ ------------------------------
NLS_DATABASE_PARAMETERS        SYS
NLS_INSTANCE_PARAMETERS        SYS
NLS_SESSION_PARAMETERS         SYS


DBA:
----

VIEW_NAME                      OWNER
------------------------------ ------------------------------
DBA_2PC_NEIGHBORS              SYS
DBA_2PC_PENDING                SYS
DBA_ALL_TABLES                 SYS
DBA_ANALYZE_OBJECTS            SYS
DBA_ASSOCIATIONS               SYS
DBA_AUDIT_EXISTS               SYS
DBA_AUDIT_OBJECT               SYS
DBA_AUDIT_SESSION              SYS
DBA_AUDIT_STATEMENT            SYS
DBA_AUDIT_TRAIL                SYS
DBA_CACHEABLE_OBJECTS          SYS
DBA_CACHEABLE_TABLES           SYS
DBA_CACHEABLE_TABLES_BASE      SYS
DBA_CATALOG                    SYS
DBA_CLUSTERS                   SYS
DBA_CLUSTER_HASH_EXPRESSIONS   SYS
DBA_CLU_COLUMNS                SYS
DBA_COLL_TYPES                 SYS
DBA_COL_COMMENTS               SYS
DBA_COL_PRIVS                  SYS
DBA_CONSTRAINTS                SYS
DBA_CONS_COLUMNS               SYS
DBA_CONTEXT                    SYS
DBA_DATA_FILES                 SYS
DBA_DB_LINKS                   SYS
DBA_DEPENDENCIES               SYS
DBA_DIMENSIONS                 SYS
DBA_DIM_ATTRIBUTES             SYS
DBA_DIM_CHILD_OF               SYS
DBA_DIM_HIERARCHIES            SYS
DBA_DIM_JOIN_KEY               SYS
DBA_DIM_LEVELS                 SYS
DBA_DIM_LEVEL_KEY              SYS
DBA_DIRECTORIES                SYS
DBA_DMT_FREE_SPACE             SYS
DBA_DMT_USED_EXTENTS           SYS
DBA_ERRORS                     SYS
DBA_EXP_FILES                  SYS
DBA_EXP_OBJECTS                SYS
DBA_EXP_VERSION                SYS
DBA_EXTENTS                    SYS
DBA_FREE_SPACE                 SYS
DBA_FREE_SPACE_COALESCED       SYS
DBA_FREE_SPACE_COALESCED_TMP1  SYS
DBA_FREE_SPACE_COALESCED_TMP2  SYS
DBA_FREE_SPACE_COALESCED_TMP3  SYS
DBA_IAS_CONSTRAINT_EXP         SYS
DBA_IAS_GEN_STMTS              SYS
DBA_IAS_GEN_STMTS_EXP          SYS
DBA_IAS_OBJECTS                SYS
DBA_IAS_OBJECTS_BASE           SYS
DBA_IAS_OBJECTS_EXP            SYS
DBA_IAS_POSTGEN_STMTS          SYS
DBA_IAS_PREGEN_STMTS           SYS
DBA_IAS_SITES                  SYS
DBA_IAS_TEMPLATES              SYS
DBA_INDEXES                    SYS
DBA_INDEXTYPES                 SYS
DBA_INDEXTYPE_OPERATORS        SYS
DBA_IND_COLUMNS                SYS
DBA_IND_EXPRESSIONS            SYS
DBA_IND_PARTITIONS             SYS
DBA_IND_SUBPARTITIONS          SYS
DBA_INTERNAL_TRIGGERS          SYS
DBA_JAVA_POLICY                SYS
DBA_JOBS                       SYS
DBA_JOBS_RUNNING               SYS
DBA_LIBRARIES                  SYS
DBA_LMT_FREE_SPACE             SYS
DBA_LMT_USED_EXTENTS           SYS
DBA_LOBS                       SYS
DBA_LOB_PARTITIONS             SYS
DBA_LOB_SUBPARTITIONS          SYS
DBA_METHOD_PARAMS              SYS
DBA_METHOD_RESULTS             SYS
DBA_MVIEWS                     SYS
DBA_MVIEW_AGGREGATES           SYS
DBA_MVIEW_ANALYSIS             SYS
DBA_MVIEW_DETAIL_RELATIONS     SYS
DBA_MVIEW_JOINS                SYS
DBA_MVIEW_KEYS                 SYS
DBA_NESTED_TABLES              SYS
DBA_OBJECTS                    SYS
DBA_OBJECT_SIZE                SYS
DBA_OBJECT_TABLES              SYS
DBA_OBJ_AUDIT_OPTS             SYS
DBA_OPANCILLARY                SYS
DBA_OPARGUMENTS                SYS
DBA_OPBINDINGS                 SYS
DBA_OPERATORS                  SYS
DBA_OUTLINES                   SYS
DBA_OUTLINE_HINTS              SYS
DBA_PARTIAL_DROP_TABS          SYS
DBA_PART_COL_STATISTICS        SYS
DBA_PART_HISTOGRAMS            SYS
DBA_PART_INDEXES               SYS
DBA_PART_KEY_COLUMNS           SYS
DBA_PART_LOBS                  SYS
DBA_PART_TABLES                SYS
DBA_PENDING_TRANSACTIONS       SYS
DBA_POLICIES                   SYS
DBA_PRIV_AUDIT_OPTS            SYS
DBA_PROFILES                   SYS
DBA_QUEUES                     SYS
DBA_QUEUE_SCHEDULES            SYS
DBA_QUEUE_TABLES               SYS
DBA_RCHILD                     SYS
DBA_REFRESH                    SYS
DBA_REFRESH_CHILDREN           SYS
DBA_REFS                       SYS
DBA_REGISTERED_SNAPSHOTS       SYS
DBA_REGISTERED_SNAPSHOT_GROUPS SYS
DBA_REPAUDIT_ATTRIBUTE         SYS
DBA_REPAUDIT_COLUMN            SYS
DBA_REPCAT                     SYS
DBA_REPCATLOG                  SYS
DBA_REPCAT_REFRESH_TEMPLATES   SYS
DBA_REPCAT_TEMPLATE_OBJECTS    SYS
DBA_REPCAT_TEMPLATE_PARMS      SYS
DBA_REPCAT_TEMPLATE_SITES      SYS
DBA_REPCAT_USER_AUTHORIZATIONS SYS
DBA_REPCAT_USER_PARM_VALUES    SYS
DBA_REPCOLUMN                  SYS
DBA_REPCOLUMN_GROUP            SYS
DBA_REPCONFLICT                SYS
DBA_REPDDL                     SYS
DBA_REPFLAVORS                 SYS
DBA_REPFLAVOR_COLUMNS          SYS
DBA_REPFLAVOR_OBJECTS          SYS
DBA_REPGENERATED               SYS
DBA_REPGENOBJECTS              SYS
DBA_REPGROUP                   SYS
DBA_REPGROUPED_COLUMN          SYS
DBA_REPGROUP_PRIVILEGES        SYS
DBA_REPKEY_COLUMNS             SYS
DBA_REPOBJECT                  SYS
DBA_REPPARAMETER_COLUMN        SYS
DBA_REPPRIORITY                SYS
DBA_REPPRIORITY_GROUP          SYS
DBA_REPPROP                    SYS
DBA_REPRESOLUTION              SYS
DBA_REPRESOLUTION_METHOD       SYS
DBA_REPRESOLUTION_STATISTICS   SYS
DBA_REPRESOL_STATS_CONTROL     SYS
DBA_REPSCHEMA                  SYS
DBA_REPSITES                   SYS
DBA_RGROUP                     SYS
DBA_ROLES                      SYS
DBA_ROLE_PRIVS                 SYS
DBA_ROLLBACK_SEGS              SYS
DBA_RSRC_CONSUMER_GROUPS       SYS
DBA_RSRC_CONSUMER_GROUP_PRIVS  SYS
DBA_RSRC_MANAGER_SYSTEM_PRIVS  SYS
DBA_RSRC_PLANS                 SYS
DBA_RSRC_PLAN_DIRECTIVES       SYS
DBA_RULESETS                   SYS
DBA_SEGMENTS                   SYS
DBA_SEQUENCES                  SYS
DBA_SNAPSHOTS                  SYS
DBA_SNAPSHOT_LOGS              SYS
DBA_SNAPSHOT_LOG_FILTER_COLS   SYS
DBA_SNAPSHOT_REFRESH_TIMES     SYS
DBA_SOURCE                     SYS
DBA_STMT_AUDIT_OPTS            SYS
DBA_SUBPART_COL_STATISTICS     SYS
DBA_SUBPART_HISTOGRAMS         SYS
DBA_SUBPART_KEY_COLUMNS        SYS
DBA_SUMMARIES                  SYS
DBA_SUMMARY_AGGREGATES         SYS
DBA_SUMMARY_DETAIL_TABLES      SYS
DBA_SUMMARY_JOINS              SYS
DBA_SUMMARY_KEYS               SYS
DBA_SYNONYMS                   SYS
DBA_SYS_PRIVS                  SYS
DBA_TABLES                     SYS
DBA_TABLESPACES                SYS
DBA_TAB_COLUMNS                SYS
DBA_TAB_COL_STATISTICS         SYS
DBA_TAB_COMMENTS               SYS
DBA_TAB_HISTOGRAMS             SYS
DBA_TAB_MODIFICATIONS          SYS
DBA_TAB_PARTITIONS             SYS
DBA_TAB_PRIVS                  SYS
DBA_TAB_SUBPARTITIONS          SYS
DBA_TEMP_FILES                 SYS
DBA_TRIGGERS                   SYS
DBA_TRIGGER_COLS               SYS
DBA_TS_QUOTAS                  SYS
DBA_TYPES                      SYS
DBA_TYPE_ATTRS                 SYS
DBA_TYPE_METHODS               SYS
DBA_UNUSED_COL_TABS            SYS
DBA_UPDATABLE_COLUMNS          SYS
DBA_USERS                      SYS
DBA_USTATS                     SYS
DBA_VARRAYS                    SYS
DBA_VIEWS                      SYS

V_$:
----

 VIEW_NAME                      OWNER
------------------------------ ------------------------------
V_$ACCESS                      SYS
V_$ACTIVE_INSTANCES            SYS
V_$AQ                          SYS
V_$AQ1                         SYS
V_$ARCHIVE                     SYS
V_$ARCHIVED_LOG                SYS
V_$ARCHIVE_DEST                SYS
V_$ARCHIVE_PROCESSES           SYS
V_$BACKUP                      SYS
V_$BACKUP_ASYNC_IO             SYS
V_$BACKUP_CORRUPTION           SYS
V_$BACKUP_DATAFILE             SYS
V_$BACKUP_DEVICE               SYS
V_$BACKUP_PIECE                SYS
V_$BACKUP_REDOLOG              SYS
V_$BACKUP_SET                  SYS
V_$BACKUP_SYNC_IO              SYS
V_$BGPROCESS                   SYS
V_$BH                          SYS
V_$BSP                         SYS
V_$BUFFER_POOL                 SYS
V_$BUFFER_POOL_STATISTICS      SYS
V_$CIRCUIT                     SYS
V_$CLASS_PING                  SYS
V_$COMPATIBILITY               SYS
V_$COMPATSEG                   SYS
V_$CONTEXT                     SYS
V_$CONTROLFILE                 SYS
V_$CONTROLFILE_RECORD_SECTION  SYS
V_$COPY_CORRUPTION             SYS
V_$DATABASE                    SYS
V_$DATAFILE                    SYS
V_$DATAFILE_COPY               SYS
V_$DATAFILE_HEADER             SYS
V_$DBFILE                      SYS
V_$DBLINK                      SYS
V_$DB_CACHE_ADVICE             SYS
V_$DB_OBJECT_CACHE             SYS
V_$DB_PIPES                    SYS
V_$DELETED_OBJECT              SYS
V_$DISPATCHER                  SYS
V_$DISPATCHER_RATE             SYS
V_$DLM_ALL_LOCKS               SYS
V_$DLM_CONVERT_LOCAL           SYS
V_$DLM_CONVERT_REMOTE          SYS
V_$DLM_LATCH                   SYS
V_$DLM_LOCKS                   SYS
V_$DLM_MISC                    SYS
V_$DLM_RESS                    SYS
V_$DLM_TRAFFIC_CONTROLLER      SYS
V_$ENABLEDPRIVS                SYS
V_$ENQUEUE_LOCK                SYS
V_$EVENT_NAME                  SYS
V_$EXECUTION                   SYS
V_$FAST_START_SERVERS          SYS
V_$FAST_START_TRANSACTIONS     SYS
V_$FILESTAT                    SYS
V_$FILE_PING                   SYS
V_$FIXED_TABLE                 SYS
V_$FIXED_VIEW_DEFINITION       SYS
V_$GLOBAL_BLOCKED_LOCKS        SYS
V_$GLOBAL_TRANSACTION          SYS
V_$HS_AGENT                    SYS
V_$HS_PARAMETER                SYS
V_$HS_SESSION                  SYS
V_$INDEXED_FIXED_COLUMN        SYS
V_$INSTANCE                    SYS
V_$INSTANCE_RECOVERY           SYS
V_$KCCDI                       SYS
V_$KCCFE                       SYS
V_$LATCH                       SYS
V_$LATCHHOLDER                 SYS
V_$LATCHNAME                   SYS
V_$LATCH_CHILDREN              SYS
V_$LATCH_MISSES                SYS
V_$LATCH_PARENT                SYS
V_$LIBRARYCACHE                SYS
V_$LICENSE                     SYS
V_$LOADCSTAT                   SYS
V_$LOADISTAT                   SYS
V_$LOADPSTAT                   SYS
V_$LOADTSTAT                   SYS
V_$LOCK                        SYS
V_$LOCKED_OBJECT               SYS
V_$LOCKS_WITH_COLLISIONS       SYS
V_$LOCK_ACTIVITY               SYS
V_$LOCK_ELEMENT                SYS
V_$LOG                         SYS
V_$LOGFILE                     SYS
V_$LOGHIST                     SYS
V_$LOGMNR_CONTENTS             SYS
V_$LOGMNR_DICTIONARY           SYS
V_$LOGMNR_LOGS                 SYS
V_$LOGMNR_PARAMETERS           SYS
V_$LOG_HISTORY                 SYS
V_$MAX_ACTIVE_SESS_TARGET_MTH  SYS
V_$MLS_PARAMETERS              SYS
V_$MTS                         SYS
V_$MYSTAT                      SYS
V_$NLS_PARAMETERS              SYS
V_$NLS_VALID_VALUES            SYS
V_$OBJECT_DEPENDENCY           SYS
V_$OBSOLETE_PARAMETER          SYS
V_$OFFLINE_RANGE               SYS
V_$OPEN_CURSOR                 SYS
V_$OPTION                      SYS
V_$PARALLEL_DEGREE_LIMIT_MTH   SYS
V_$PARAMETER                   SYS
V_$PARAMETER2                  SYS
V_$PQ_SESSTAT                  SYS
V_$PQ_SLAVE                    SYS
V_$PQ_SYSSTAT                  SYS
V_$PQ_TQSTAT                   SYS
V_$PROCESS                     SYS
V_$PROXY_ARCHIVEDLOG           SYS
V_$PROXY_DATAFILE              SYS
V_$PWFILE_USERS                SYS
V_$PX_PROCESS                  SYS
V_$PX_PROCESS_SYSSTAT          SYS
V_$PX_SESSION                  SYS
V_$PX_SESSTAT                  SYS
V_$QUEUE                       SYS
V_$RECOVERY_FILE_STATUS        SYS
V_$RECOVERY_LOG                SYS
V_$RECOVERY_PROGRESS           SYS
V_$RECOVERY_STATUS             SYS
V_$RECOVER_FILE                SYS
V_$REQDIST                     SYS
V_$RESERVED_WORDS              SYS
V_$RESOURCE                    SYS
V_$RESOURCE_LIMIT              SYS
V_$ROLLNAME                    SYS
V_$ROLLSTAT                    SYS
V_$ROWCACHE                    SYS
V_$ROWCACHE_PARENT             SYS
V_$ROWCACHE_SUBORDINATE        SYS
V_$RSRC_CONSUMER_GROUP         SYS
V_$RSRC_CONSUMER_GROUP_CPU_MTH SYS
V_$RSRC_PLAN                   SYS
V_$RSRC_PLAN_CPU_MTH           SYS
V_$SESSION                     SYS
V_$SESSION_CONNECT_INFO        SYS
V_$SESSION_CURSOR_CACHE        SYS
V_$SESSION_EVENT               SYS
V_$SESSION_LONGOPS             SYS
V_$SESSION_OBJECT_CACHE        SYS
V_$SESSION_WAIT                SYS
V_$SESSTAT                     SYS
V_$SESS_IO                     SYS
V_$SGA                         SYS
V_$SGASTAT                     SYS
V_$SHARED_POOL_RESERVED        SYS
V_$SHARED_SERVER               SYS
V_$SORT_SEGMENT                SYS
V_$SORT_USAGE                  SYS
V_$SQL                         SYS
V_$SQLAREA                     SYS
V_$SQLTEXT                     SYS
V_$SQLTEXT_WITH_NEWLINES       SYS
V_$SQL_BIND_DATA               SYS
V_$SQL_BIND_METADATA           SYS
V_$SQL_CURSOR                  SYS
V_$SQL_SHARED_CURSOR           SYS
V_$SQL_SHARED_MEMORY           SYS
V_$STATNAME                    SYS
V_$SUBCACHE                    SYS
V_$SYSSTAT                     SYS
V_$SYSTEM_CURSOR_CACHE         SYS
V_$SYSTEM_EVENT                SYS
V_$SYSTEM_PARAMETER            SYS
V_$SYSTEM_PARAMETER2           SYS
V_$TABLESPACE                  SYS
V_$TARGETRBA                   SYS
V_$TEMPFILE                    SYS
V_$TEMPORARY_LOBS              SYS
V_$TEMPSTAT                    SYS
V_$TEMP_EXTENT_MAP             SYS
V_$TEMP_EXTENT_POOL            SYS
V_$TEMP_PING                   SYS
V_$TEMP_SPACE_HEADER           SYS
V_$THREAD                      SYS
V_$TIMER                       SYS
V_$TRANSACTION                 SYS
V_$TRANSACTION_ENQUEUE         SYS
V_$TYPE_SIZE                   SYS
V_$VERSION                     SYS
V_$WAITSTAT                    SYS
V_$_LOCK                       SYS


==========
23 TUNING:
==========


1. init.ora settings
--------------------

background_dump_dest = /var/opt/oracle/SALES/bdump
control_files = ( /oradata/arc/control/ctrl1SALES.ctl
, /oradata/temp/control/ctrl2SALES.ctl
, /oradata/rbs/control/ctrl3SALES.ctl)

db_block_size = 16384
db_name = SALES
db_block_buffers = 17500
db_block_checkpoint_batch = 16
db_files = 255
db_file_multiblock_read_count = 10
license_max_users = 170
#core_dump_dest = /var/opt/oracle/SALES/cdump
core_dump_dest = /oradata/rbs/cdump
distributed_transactions = 40
dml_locks = 1000
job_queue_processes = 2
log_archive_buffers = 20
log_archive_buffer_size = 256
log_archive_dest = /oradata/arc
log_archive_format = arcSALES_%s.arc
log_archive_start = true
log_buffer = 163840
log_checkpoint_interval = 1250
log_checkpoint_timeout = 1800
log_simultaneous_copies = 4
max_dump_file_size = 100240
max_enabled_roles = 50
oracle_trace_enable = true
open_cursors = 2000
open_links = 20
processes = 200
remote_os_authent = true
rollback_segments = (r1, r2, r3, rbig,rbig2)
sequence_cache_entries = 30
sequence_cache_hash_buckets = 23
shared_pool_size = 750M
sort_area_retained_size = 15728640
sort_area_size = 15728640
sql_trace = false
timed_statistics = true
resource_limit = true
user_dump_dest = /var/opt/oracle/SALES/udump
utl_file_dir = /var/opt/oracle/utl
utl_file_dir = /var/opt/oracle/utl/frontend


SORT_AREA_SIZE                	= 65536         (per PGA, max sort area)
SORT_AREA_RETAINED_SIZE       	= 65536         (size after sort)
PROCESSES                     	= 100           (alle processes)
DB_BLOCK_SIZE                 	= 8192
DB_BLOCK_BUFFERS              	= 3400          (DB_CACHE_SIZE in Oracle 9i)
SHARED_POOL_SIZE              	= 52428800
LOG_BUFFER                    	= 26215400 
                                   4194304
LARGE_POOL_SIZE                 =
DBWR_IO_SLAVES                                  (DB_WRITER_PROCESSES)
DB_WRITER_PROCESSES             = 2
LGWR_IO_SLAVES=
DB_FILE_MULTIBLOCK_READ_COUNT	=16		(minimize io during table scans,
                                                it specifies max number of blocks in one
                                                io operation during sequential read)
BUFFER_POOL_RECYCLE             =
BUFFER_POOL_KEEP                =
TIMED_STATISTICES		=TRUE           (statistics related to time are collected or not)           
OPTIMIZER_MODE			=RULE, CHOOSE, FIRST_ROWS, ALL_ROWS

PARALLEL_MIN_SERVERS            = 2		(voor Parallel Query, en parallel recovery)
PARALLEL_MAX_SERVERS            = 4

RECOVERY_PARALLELISM            = 2		(set parallel recovery op database niveau)


2. UTLBSTAT and UTLESTAT
------------------------

- if wanted change default tablespace of SYS to TOOLS
- set timed_statistics=true
- in $ORACLE_HOME/rdbms/admin you find utlbstat.sql and utlestat.sql

to create perfoRMANce table and insert baseline: run utlbstat
let the database run for some time
to gather statistics, run utlestat which drop tables and generate report.txt


3. STATSPACK:
-------------

Available as of 8.1.6

installation:

- connect internal
- @$ORACLE_HOME/rdbms/admin/statscre.sql

It will create user PERFSTAT who ownes the new statistics tables
You will be prompted for TEMP and DEFAULT tablespaces

Gather statistices:

- connect perfstat/perfstat
- execute statspack.snap

Or use DBMS_JOB to schedule the generation of snapshots

Create report:

- connect perfstat/perfstat
- @ORACLE_HOME/rdbms/admin/statsrep.sql

This will ask for beginning snapshot id and ending snapshot id.
Then you can enter the filename for the report.


4. QUERIES:
-----------

-- 4.1 HIT RATIO buffercache

SELECT  (1-(pr.value/(dbg.value+cg.value)))*100
FROM    v$sysstat pr, v$sysstat dbg, v$sysstat cg
WHERE   pr.name = 'physical reads'
AND     dbg.name = 'db block gets'
AND     cg.name = 'consistent gets';

-- 4.2 redo noWait ratio

SELECT  (req.value*5000)/entries.value
FROM    v$sysstat req, v$sysstat entries
WHERE   req.name ='redo log space requests'
AND     entries.name='redo entries';

-- 4.3 Library cache and shared pool

Overview memory:

SELECT * FROM V$SGA;

Free memory shared pool:

SELECT * FROM v$sgastat 
WHERE name = 'free memory';

How often an object has to be reloaded into the cache once it has been loaded

SELECT sum(pins) Executions, sum(reloads) Misses, sum(reloads)/sum(pins) Ratio
FROM   v$librarycache;

SELECT gethits,gets,gethitratio FROM v$librarycache
WHERE  namespace = 'SQL AREA';

SELECT sum(sharable_mem) FROM v$db_object_cache; 

-- 4.4 TABLE OR INDEX REBUILD NECCESARY?

SELECT substr(segment_name, 1, 30), segment_type, substr(owner, 1, 10),
       extents, initial_extent, next_extent, max_extents
FROM   dba_segments
WHERE  extents > max_extents - 100
AND    owner not in ('SYS','SYSTEM');

SELECT index_name, blevel, 
        decode(blevel,0,'OK BLEVEL',1,'OK BLEVEL', 
        2,'OK BLEVEL',3,'OK BLEVEL',4,'OK BLEVEL','BLEVEL HIGH') OK 
FROM dba_indexes 
WHERE owner='SALES';  


EXAMPLE OF A SCRIPT THAT YOU MIGHT SCHEDULE ONCE A DAY:
-------------------------------------------------------

-- report 1.

set linesize 500
set pagesize 500
set serveroutput on
set trimspool on
spool d:\logs\


exec dbms_output.put_line('DAILY REPORT SALES DATABASE ON SERVER SUPER');
exec dbms_output.put_line('RUNTIME: '||to_char(SYSDATE, 'DD-MM-YYYY;HH24:MI'));
exec dbms_output.put_line('Please read all sections carefully, takes only 1 minute.');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('===================================================');
exec dbms_output.put_line('SECTION 1: OBJECTS AND USERS');
exec dbms_output.put_line('===================================================');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('1.1 INVALID OBJECTS AS FOUND RIGHT NOW:');
exec dbms_output.put_line('  ');

SELECT substr(object_name, 1. 30), substr(object_type, 1, 20), owner, status
FROM dba_objects WHERE status='INVALID';

exec dbms_output.put_line('  ');
exec dbms_output.put_line('Remark: If invalid objects are found intervention is required.');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('1.2 TABLE/INDEX REACHING MAX NO OF EXTENTS:');
exec dbms_output.put_line('  ');

SELECT substr(segment_name, 1, 30), segment_type, substr(owner, 1, 10),
       extents, initial_extent, next_extent, max_extents
FROM   dba_segments
WHERE  extents > max_extents - 50
AND    owner not in ('SYS','SYSTEM');

exec dbms_output.put_line('  ');
exec dbms_output.put_line('Remark: If objects are found intervention is required.');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('1.3 SKEWED or BAD INDEXES with blevel > 3:');
exec dbms_output.put_line('  ');

SELECT index_name, owner, blevel, 
        decode(blevel,0,'OK BLEVEL',1,'OK BLEVEL', 
        2,'OK BLEVEL',3,'OK BLEVEL',4,'OK BLEVEL','BLEVEL HIGH') OK 
FROM dba_indexes 
WHERE owner in ('SALES','FRONTEND')
and blevel > 3;  

exec dbms_output.put_line('  ');
exec dbms_output.put_line('Remark: If indexes are found rebuild is required.');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('1.4. NEW OBJECTS CREATED SINCE YESTERDAY:');
exec dbms_output.put_line('  ');

SELECT owner, substr(object_name, 1, 30), object_type, created, 
       last_ddl_time, status
FROM   dba_objects 
WHERE  created > SYSDATE-5;

exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('1.5. NEW ORACLE USERS CREATED SINCE YESTERDAY:');
exec dbms_output.put_line('  ');

SELECT substr(username, 1, 20), account_status, 
default_tablespace, temporary_tablespace, created
FROM dba_users WHERE created > SYSDATE -10;

exec dbms_output.put_line('  ');

exec dbms_output.put_line('===================================================');
exec dbms_output.put_line('SECTION 2: TABLESPACES, DATAFILES, ROLLBACK SEGS');
exec dbms_output.put_line('===================================================');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('2.1 FREE/USED SPACE OF TABLESPACES RIGHT NOW:');
exec dbms_output.put_line('  ');

SELECT Total.name "Tablespace Name",
       Free_space, (total_space-Free_space) Used_space, total_space
FROM
  (SELECT tablespace_name, sum(bytes/1024/1024) Free_Space
     FROM sys.dba_free_space
    GROUP BY tablespace_name
  ) Free,
  (SELECT b.name,  sum(bytes/1024/1024) TOTAL_SPACE
     FROM sys.v_$datafile a, sys.v_$tablespace B
    WHERE a.ts# = b.ts#
    GROUP BY b.name
  ) Total
WHERE Free.Tablespace_name = Total.name;

exec dbms_output.put_line('  ');
exec dbms_output.put_line('REMARK: FOR MONTHLY INTERNET BILLING AT LEAST 50MB SPACE MUST');
exec dbms_output.put_line('BE AVAILABLE IN EACH OF THE MANIIN% TABLESPACES.  ');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('2.2 STATUS DATABASE FILES RIGHT NOW:');
exec dbms_output.put_line('  ');

SELECT substr(file_name, 1, 50), tablespace_name, status
FROM dba_data_files;

exec dbms_output.put_line('  ');
exec dbms_output.put_line('Remark: status of all files should be available ');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('2.3 STATUS ROLLBACK SEGMENTS RIGHT NOW:');
exec dbms_output.put_line('  ');

SELECT substr(segment_name, 1, 20), substr(tablespace_name, 1, 20), status,
       INITIAL_EXTENT, NEXT_EXTENT, MIN_EXTENTS, MAX_EXTENTS, PCT_INCREASE   
FROM DBA_ROLLBACK_SEGS;

exec dbms_output.put_line('  ');
exec dbms_output.put_line('===================================================');
exec dbms_output.put_line('SECTION 3: PERFORMANCE STATS SINCE DATABASE STARTUP');
exec dbms_output.put_line('===================================================');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('3.1 ORACLE MEMORY (SGA LAYOUT):');
exec dbms_output.put_line('  ');

SELECT * FROM V$SGA;

exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('3.2 FREE MEMORY SHARED POOL:');
exec dbms_output.put_line('  ');

SELECT * FROM v$sgastat 
WHERE name = 'free memory';
 
exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('3.3 LIBRARY (pl/sql) HIT RATIO:');
exec dbms_output.put_line('  ');

SELECT sum(pins) Executions, sum(reloads) Misses, sum(reloads)/sum(pins) Ratio
FROM   v$librarycache;

exec dbms_output.put_line('  ');
exec dbms_output.put_line('Remark: above Ratio should be low ');
exec dbms_output.put_line('  ');

exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('3.4 DATABASE BUFFERS HIT RATIO:');
exec dbms_output.put_line('  ');

SELECT  (1-(pr.value/(dbg.value+cg.value)))*100
FROM    v$sysstat pr, v$sysstat dbg, v$sysstat cg
WHERE   pr.name = 'physical reads'
AND     dbg.name = 'db block gets'
AND     cg.name = 'consistent gets';

exec dbms_output.put_line('  ');
exec dbms_output.put_line('Remark: above Ratio should be high  ');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('3.5 REDO BUFFERS WAITS:');
exec dbms_output.put_line('  ');

SELECT  (req.value*5000)/entries.value
FROM    v$sysstat req, v$sysstat entries
WHERE   req.name ='redo log space requests'
AND     entries.name='redo entries';

exec dbms_output.put_line('  ');
exec dbms_output.put_line('Remark: above Ratio should be very low  ');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('===================================================');
exec dbms_output.put_line('SECTION 4: LOCKS');
exec dbms_output.put_line('===================================================');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('4.1 OBJECT LOCKS RIGHT NOW:');
exec dbms_output.put_line('  ');

SELECT  l.object_id                      object_id, 
        l.session_id                     session_id,
        substr(l.oracle_username, 1, 10) username, 
        substr(l.os_user_name, 1, 30)    osuser, 
        l.process                        process,
        l.locked_mode                    lockmode, 
        substr(o.object_name, 1, 20)     objectname
FROM    v$locked_object l, dba_objects o
WHERE   l.object_id=o.object_id;

exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('4.2 PERSISTENT LOCKS SINCE YESTERDAY:');
exec dbms_output.put_line('  ');

SELECT OBJECT_ID,SESSION_ID,USERNAME,OSUSER,PROCESS,LOCKMODE,       
       OBJECT_NAME, to_char(DATUM, 'DD-MM-YYYY;HH24:MI')
FROM PROJECTS.LOCKLIST
WHERE DATUM > SYSDATE-2
ORDER BY DATUM;

exec dbms_output.put_line('  ');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('4.3 BLOCKED SESSIONS RIGHT NOW:');
exec dbms_output.put_line('  ');

SELECT s.sid                       sid, 
       substr(s.username, 1, 10)   username, 
       substr(s.schemaname, 1, 10) schemaname, 
       substr(s.osuser, 1, 10)     osuser, 
       substr(s.program, 1, 30)    program, 
       s.command                   command,
       l.lmode                     lockmode, 
       l.block                     blocked
FROM   v$session s, v$lock l
WHERE  s.sid=l.sid and schemaname not in ('SYS','SYSTEM');

exec dbms_output.put_line('  ');
exec dbms_output.put_line('===================================================');
exec dbms_output.put_line('SECTION 5: ONLY NEEDED FOR oracle-dba     ');
exec dbms_output.put_line('           INFO NEEDED FOR RECOVERY       ');
exec dbms_output.put_line('===================================================');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('scn datafiles:  ');
exec dbms_output.put_line('scn controlfiles:  ');
exec dbms_output.put_line('latest 20 archived redo: ');
exec dbms_output.put_line('  ');
exec dbms_output.put_line('  ');

exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('---------------------------------------------------');
exec dbms_output.put_line('END REPORT 1');
exec dbms_output.put_line('Thanks a lot for reading this report !!!');
exit
/


========
24 RMAN:
========

24.1 Introduction:
------------------

Recovery Manager (RMAN) is an Oracle tool that allows you to back up, 
copy, restore, and recover datafiles, control files, and archived redo logs. 
It is included with the Oracle server and does not require separate installation. 
You can invoke RMAN as a command line utility from the operating system (O/S) prompt 
or use the GUI-based Enterprise Manager Backup Manager. 

RMAN users "server sessions" to automate many of the backup and recovery tasks that 
were formerly performed manually. For example, instead of requiring you to 
locate appropriate backups for each datafile, copy them to the correct place using 
operating system commands, and choose which archived logs to apply, 
RMAN manages these tasks automatically. 

RMAN stores metadata about its backup and recovery operations in the recovery catalog, 
which is a centralized repository of information, or exclusively in the control file. 
Typically, the recovery catalog is stored in a separate database. 
If you do not use a recovery catalog, RMAN uses the control file as its repository of metadata. 


RMAN can be used on a database in archive mode or no archive mode.

!!!! But, for open backups, the database MUST BE in ARCHIVE MODE. 
That's true for Oracle 8, 8i, 9i and 10g.

RMAN doesn't do a "begin backup".  It is not necessary when you use RMAN.  
RMAN does an intelligent copy of the database blocks (as opposed to a simple OS 
copy) and it ensures we do not copy a fractured block.  The whole purpose of the 
begin backup (of the OS type of backup) is to record more info into the redo logs 
in the event an OS copy 
copies a "fractured block" - where the head and tail do not match (can happen 
since we are WRITING to the database at the same time the backup would be 
reading).  When RMAN hits such a block -- it re-reads it to get a clean copy.


How to start RMAN? 

- You can call from unix, or cmd prompt, the RMAN utility:

$ rman

RMAN>

Once started you will see the RMAN> prompt.

- Or you can give command line paramaters along with the rman call

% rman target sys/sys_pwd@prod1 catalog rman/rman@rcat


24.2 Types of commands, and interactive mode or batch mode:
-----------------------------------------------------------

RMAN uses two basic types of commands: stand-alone commands and job commands.

- The job commands always appear within the brackets of a run command.
- The stand-alone command can be issued right after the RMAN prompt.

You can run RMAN in interactive mode or batch mode

- examples of interactive mode:


RMAN> run {
2>  allocate channel d1 type disk;
3>  backup database; 
4>  }


RMAN> run {
      allocate channel c1 type disk;
      copy datafile 6 to 'F:\oracle\backups\oem01.cpy';
      release channel c1;
      }

RMAN> run {
      allocate channel c1 type disk;
      backup format 'F:\oracle\backups\oem01.rbu' ( datafile 6 );
      release channel c1;
      }

RMAN> run {
     allocate channel c1 type 'sbt_tape';
     restore database;
     recover database;
      }

Note about 'channel':

You must allocate a 'channel" before you execute backup and recovery commands. 
Each allocated channel establishes a connection from RMAN to a target database 
by starting a server session on the instance. This server session performs 
the backup and recovery operations. 
Only one RMAN session communicates with the allocated server sessions. 

You can allocate multiple channels, thus allowing a single RMAN command 
to read or write multiple backups or image copies in parallel. 
Thus, the number of channels that you allocate affects the degree of parallelism 
within a command. 
When backing up to tape you should allocate one channel for each physical device, 
but when backing up to disk you can allocate as many channels 
as necessary for maximum throughput. 


The simplest way to determine whether RMAN encountered an error is to examine its return code. 
RMAN returns 0 to the operating system if no errors occurred, 1 otherwise. 
For example, if you are running UNIX and using the C shell, 
RMAN outputs the return code into a shell variable called $status. 

The second easiest way is to search the Recovery Manager output for the
string RMAN-00569, which is the message number for the error stack banner. 
All RMAN errors are preceded by this error message. 
If you do not see an RMAN-00569 message in the output, then there are no errors. 


- example of batch mode:

You can type RMAN commands into a file, and then run the command file 
by specifying its name on the command line. 
The contents of the command file should be identical 
to commands entered at the command line. Suppose the commandfile is
called 'b_whole_l0.rcv', then the rman call could be as in the following example:


$ rman target / catalog rman/rman@rcat @b_whole_l0.rcv log rman_log.f

Another example:

c:> rman target xxx/yyy@target rcvcat aaa/bbb@catalog cmdfile bkdb.scr msglog bkdb.log                                                                             


24.3. Recovery Manager Repository or RMAN Catalog:
--------------------------------------------------

Storage of the RMAN Repository in the Recovery Catalog, or exclusively in the
target database controlfile:

The RMAN repository is the collection of metadata about your target databases 
that RMAN uses to conduct its backup, recovery, and maintenance operations. 
You can either create a recovery catalog in which to store this information, 
or let RMAN store it exclusively in the target database control file. 
Although RMAN can conduct all major backup and recovery operations using 
just the control file, some RMAN commands function only when you use a recovery catalog. 

The recovery catalog is maintained solely by RMAN; the target database never 
accesses it directly. RMAN propagates information about the database structure, 
archived redo logs, backup sets, and datafile copies 
into the recovery catalog from the target database's control file. 

A single recovery catalog is able to store information for multiple target databases.

What is in the recovery catalog?
--------------------------------

-Datafile and archived redo log backup sets and backup pieces. 
-Datafile copies. 
-Archived redo logs and their copies. 
-Tablespaces and datafiles on the target database. 
-Stored scripts, which are named user-created sequences of RMAN and SQL commands. 

Resynchronization of the Recovery Catalog
-----------------------------------------

The recovery catalog obtains crucial RMAN metadata from the target database control file. 
Resynchronization of the recovery catalog ensures that the metadata that RMAN obtains 
from the control file stays current. 

Resynchronizations can be full or partial. In a partial resynchronization, 
RMAN reads the current control file to update changed data, but does not resynchronize 
metadata about the database physical schema: datafiles, tablespaces, redo threads, 
rollback segments (only if the database is open), and online redo logs. 
In a full resynchronization, RMAN updates all changed records, including schema records. 

When you issue certain commands in RMAN, the program automatically detects when it needs 
to perform a full or partial resynchronization and executes the operation as needed. 
You can also force a full resynchronization by issuing a 'resync catalog' command. 

It is a good idea to run RMAN once a day or so and issue the resync catalog command 
to ensure that the catalog stays current. 
Because the control file employs a circular reuse system, 
backup and copy records eventually get overwritten.

A single recovery catalog is able to store information for multiple target databases. 

24.4 Media Manager:
-------------------

To utilize tape storage for your database backups, RMAN requires a media manager. 
A media manager is a utility that loads, labels, 
and unloads sequential media such as tape drives for the purpose of backing up and recovering data.
Note that Oracle does not need to connect to the media management 
library (MML) software when it backs up to disk. 

Software that is compliant with the MML interface enables an Oracle server session 
to issue commands to the media manager to back up or restore a file. 
The media manager responds to the command by loading, labeling, or unloading the requested tape. 


24.5 Backups:
-------------

When you execute the backup command, you create one or more backup sets. 
A backup set, which is a logical construction, contains one or more physical backup pieces. 
Backup pieces are operating system files that contain the backed up datafiles, 
control files, or archived redo logs. You cannot split a file across different backup sets 
or mix archived redo logs and datafiles into a single backup set. 

A backup set is a complete set of backup pieces that constitute a full or incremental 
backup of the objects specified in the backup command. Backup sets are in an RMAN-specific format; 
image copies, in contrast, are available for use without additional processing. 

So, for example:
You can have a backupset 'backupset 1' containing just 1 datafile.
You can have a backupset 'backupset 2' containing many datafiles, as blocks.
You can have a backupset 'backupset 3' containing archived redologs


You can either let RMAN determine a unique name for the backup piece or use the format parameter 
to specify a name. If you do not specify a filename, RMAN uses the %U substitution variable 
to guarantee a unique name. The backup command provides substitution variables 
that allow you to generate unique filenames. 


24.6 Starting RMAN Sessions:
----------------------------

Example 1: connect to target database
-------------------------------------

$ ORACLE_SID=brdb;export ORACLE_SID

$rman
RMAN>connect target sys/password
RMAN .. connected

Example 2: connect to catalog database
--------------------------------------

$rman
RMAN>connect catalog rman/rman
RMAN .. connected


Starting and stopping target database

$ ORACLE_SID=brdb;export ORACLE_SID

$rman
RMAN>connect target sys/password
RMAN .. connected

RMAN>startup      -- will start the target database

RMAN>shutdown     -- will stop the target database


Example 3: starting RMAN with command parameters:
-------------------------------------------------

$ ORACLE_SID=brdb;export ORACLE_SID

$ rman target sys/password@prod1 catalog rman/rman@rcat


24.7 Creating the Recovery Catalog:
-----------------------------------

- create a database for the Recovery Catalog, for example rcdb
- create the user that will hold the catalog, rman with password rman

create user rman identified by rman
default tablespace rman
temporary tablespace temp;

- give the right permissions:

grant connect, resource, recovery_catalog_owner to rman;

- create the catalog in database rcdb

In 8.0, to setup Recovery Catalog, you can run 
$ORACLE_HOME/rdbms/admin/catrman.sql while connected to RMAN database.

In 8.1 and later, to setup the Recovery Catalog, use the create catalog command.

$ rman
RMAN>connect catalog rman/rman

RMAN-06008 connected to recovery catalog database
RMAN-06428 recovery catalog is not installed


RMAN>create catalog tablespace rman;
RMAN-06431 recovery catalog created

You can expect something like the following to exist
in the rcdb database:

SQL> select table_name, tablespace_name, owner
  2  from dba_tables where owner='RMAN';

TABLE_NAME                     TABLESPACE_NAME                OWNER
------------------------------ ------------------------------ ------
AL                             DATA                           RMAN
BCB                            DATA                           RMAN
BCF                            DATA                           RMAN
BDF                            DATA                           RMAN
BP                             DATA                           RMAN
BRL                            DATA                           RMAN
BS                             DATA                           RMAN
CCB                            DATA                           RMAN
CCF                            DATA                           RMAN
CDF                            DATA                           RMAN
CKP                            DATA                           RMAN
CONFIG                         DATA                           RMAN
DB                             DATA                           RMAN
DBINC                          DATA                           RMAN
DF                             DATA                           RMAN
DFATT                          DATA                           RMAN
OFFR                           DATA                           RMAN
ORL                            DATA                           RMAN
RCVER                          DATA                           RMAN
RLH                            DATA                           RMAN
RR                             DATA                           RMAN
RT                             DATA                           RMAN
SCR                            DATA                           RMAN
SCRL                           DATA                           RMAN
TS                             DATA                           RMAN
TSATT                          DATA                           RMAN
XCF                            DATA                           RMAN
XDF                            DATA                           RMAN

28 rows selected.

SQL> select view_name, owner
  2  from dba_views where owner='RMAN';

VIEW_NAME                      OWNER
------------------------------ -----
RC_ARCHIVED_LOG                RMAN
RC_BACKUP_CONTROLFILE          RMAN
RC_BACKUP_CORRUPTION           RMAN
RC_BACKUP_DATAFILE             RMAN
RC_BACKUP_PIECE                RMAN
RC_BACKUP_REDOLOG              RMAN
RC_BACKUP_SET                  RMAN
RC_CHECKPOINT                  RMAN
RC_CONTROLFILE_COPY            RMAN
RC_COPY_CORRUPTION             RMAN
RC_DATABASE                    RMAN
RC_DATABASE_INCARNATION        RMAN
RC_DATAFILE                    RMAN
RC_DATAFILE_COPY               RMAN
RC_LOG_HISTORY                 RMAN
RC_OFFLINE_RANGE               RMAN
RC_PROXY_CONTROLFILE           RMAN
RC_PROXY_DATAFILE              RMAN
RC_REDO_LOG                    RMAN
RC_REDO_THREAD                 RMAN
RC_RESYNC                      RMAN
RC_STORED_SCRIPT               RMAN
RC_STORED_SCRIPT_LINE          RMAN
RC_TABLESPACE                  RMAN

24 rows selected.


The recovery catalog is now installed in the database rcdb.

Compatibility:
---------------

If you use an 8.1.6 RMAN executable to execute the "create catalog" command,
then the recovery catalog is created as a release 8.1.6 recovery catalog.
Compatibility=8.1.6
You cannot use the 8.1.6 catalog with a pre-8.1.6 release of the RMAN executable.

If you use an 8.1.6 RMAN executable to execute the "upgrade catalog" command,
then the recovery catalog is upgraded from a pre-8.1.6 release to a release 8.1.6 catalog.
Compatibility=8.0.4
The 8.1.6 catalog is backwards compatible with older releases of the RMAN executable. 

To view compatibility:

SQL> SELECT value FROM config WHERE name='compatible';

Use an older RMAN to create the catalog.
Use the newer RMAN to upgrade the catalog.

You can allwys do:

RMAN> configure compatible = 8.1.5;


*** EXTRA: different RMAN CATALOGS in 1 DATABASE ***

Different versions in one database:
-----------------------------------

In general, the rules of RMAN compatibility are as follows: 

- The RMAN catalog schema version (tables/views) should be greater than or equal 
  to the catalog database version. 
- The RMAN catalog is backwards compatible with target databases from earlier releases.
- The versions of the RMAN executable and the target database should be the same

- RMAN cannot create release 8.1 or later catalog schemas in 8.0 catalog databases. 


Suppose you have 8.0.5 and 9i target databases.

- create one 9i database rcdb
- create 2 tablespaces: RCAT80 and RCAT9I
- create corresponding rman users


Create the 8.0.5 catalog in the 9.2.0 catalog database.     
#  sql syntax for creating logical catalog 8.0.5 structure.    
create tablespace RCAT80 datafile '/export/home/dfreneuil/D817F/           
DATAFILES/rcat80_01.dbf' size 20M ;  

Create the 9.2.0 catalog in the 9.2.0 catalog database.     
#  sql syntax for creating logical catalog 8i structure.    
create tablespace RCAT9I datafile '/export/home/dfreneuil/D920F/           
DATAFILES/rcat9i_01.dbf' size 20M ;  

#  sql syntax for creating catalog 8.0.5 user owner.    
create user RMAN80 identified by rman80    
default tablespace RCAT80    
temporary tablespace temp    
quota unlimited on RCAT80 ; 

grant connect, resource,recovery_catalog_owner to rman80 ;  

#  sql syntax for creating catalog 9i user owner. 
create user RMAN9I identified by rman9i    
default tablespace RCAT9I    
temporary tablespace temp    
quota unlimited on RCAT9I ;  

grant connect, resource,recovery_catalog_owner to rman9i ; 

- make tnsnames.ora OK

- Create the 2 catalogs:

9.2.0 catalog views creation.        

$ rman catalog rman9i/rman9i    -- to connect locally.
or   
$ rman catalog rman8i/rman9i@alias    to connect through NET8.       
         
RMAN> create catalog ;  


8.0.5 catalog views creation.            

Since the catalogs database is an 8.1.7 database, connect to the 8.0.5 catalog 
via 8.0.5 SQL*Plus.  

$ sqlplus rman80/rman80@alias_to_rcat80          
--> connect from the target machine to the 8.0.5 catalog.        
SQL> @?/rdbms/admin/catrman.sql  

Backup an 8.0.5 database with 8.0.5 RMAN into an 8.0.5 catalog in an 9.2.0 catalog database. 

$ rman rcvcat rman80/rman80@V817  

8.0.5 db ----> 8.0.5 RMAN ----> 8.0.5 catalog in 9.2.0 db
9.2.0 db ----> 9.2.0 RMAN ----> 9.2.0 catalog in 9.2.0 db

*** END EXTRA ***


24.8 Registering and un-registering the target database:
--------------------------------------------------------

Register:
---------

Now we must 'register' the target database.
Suppose the target database is called 'airm'.

Connect to the target and the catalog:

$ rman target / catalog rman/rman@rcdb

or

$ rman system/passw@airm catalog rman/rman@rcdb

RMAN-06005 connected to target database: AIRM
RMAN-06008 connected to recovery catalog database

RMAN>register database

And the airm database will be registered in the catalog.

  If you connected to rcdb before the registering and
  give the following queries before and after registering airm:

  SQL> connect system/manager@rcdb
  Connected.

 
  before registering:
  SQL> select * from rman.db;

  no rows selected

  after registering:
  SQL> select * from rman.db;

      DB_KEY      DB_ID CURR_DBINC_KEY
  ---------- ---------- --------------
           1 2092303715              2


Unregister:
-----------

It's best to unregister the backups from the catalog
first:

RMAN> list backup of database;
RMAN-03022: compiling command: list

  shows possible backupsets with their numbers
  fore example 989

RMAN> allocate channel for maintenance type disk;
      change backupset 989 delete;


Next we un-register the target database. You will
not use rman, but a special procedure.
You must use this procedure with the DB_KEY and DB_ID
parameters as values.

In SQL*Plus:

SQL>execute dbms_rcvcat.unregisterdatabase(1,2092303715)

and the airm database will be unregistered.


24.9 Reset of the catalog:
--------------------------

If you have opened the target database with the 'RESETLOGS' option,
you have in fact created a new 'incarnation' of the database.

This information must be 'told' to the recovery catalog via
the 'reset database' command:

$ rman target sys/passw catalog rman/rman@rcdb

RMAN>reset database;


24.10 List and Report commands:
-------------------------------

List commands query the catalog or control file, to determine which
backups or copies are available.
List commands provide for basic information.

Report commands can provide for much more detail.


List commands:
--------------

- Query on the incarnations of the target database

RMAN> list incarnation of database;
RMAN-03022: compiling command: list

List of Database Incarnations
DB Key  Inc Key DB Name  DB ID            CUR Reset SCN  Reset Time
------- ------- -------- ---------------- --- ---------- ----------
1       2       AIRM     2092303715       YES 1          24-DEC-02


- Query on tablespace backups

You can ask for lists of tablespace backups, as shown in the 
following example:

RMAN> list backup of tablespace users;

- Query on database backups

RMAN> list backup of database;


Report commands:
----------------

RMAN>report schema;

Shows the physical structure of the target database.


RMAN> report obsolete;

RMAN-03022: compiling command: report
RMAN-06147: no obsolete backups found


-- REPORT COMMAND:
-- ---------------

About Reports of RMAN Backups
Reports enable you to confirm that your backup and recovery strategy is in fact meeting your requirements 
for database recoverability. The two major forms of REPORT used to determine whether your database 
is recoverable are:

RMAN> REPORT NEED BACKUP;

Reports which database files need to be backed up to meet a configured or specified retention policy

RMAN> REPORT UNRECOVERABLE;

Reports which database files require backup because they have been affected by some NOLOGGING operation 
such as a direct-path insert

You can report backup sets, backup pieces and datafile copies that are obsolete, that is, not needed 
to meet a specified retention policy, by specifying the OBSOLETE keyword. If you do not specify any 
other options, then REPORT OBSOLETE displays the backups that are obsolete according to the current 
retention policy, as shown in the following example:

RMAN> REPORT OBSOLETE;


In the simplest case, you could crosscheck all backups on disk, tape or both, using any one 
of the following commands:

RMAN> CROSSCHECK BACKUP DEVICE TYPE DISK;
RMAN> CROSSCHECK BACKUP DEVICE TYPE SBT;
RMAN> CROSSCHECK BACKUP; # crosshecks all backups on all devices 


The REPORT SCHEMA command lists and displays information about the database files.

After connecting RMAN to the target database and recovery catalog (if you use one), issue REPORT SCHEMA 
as shown in this example:

RMAN> REPORT SCHEMA;


-- LIST COMMAND:
-- -------------

About RMAN Reports Generated by the LIST Command
You can control how the output is displayed by using the BY BACKUP and BY FILE options of the LIST command 
and choosing between the SUMMARY and VERBOSE options.

The primary purpose of the LIST command is to determine which backups are available. For example, you can list:

. Backups and proxy copies of a database, tablespace, datafile, archived redo log, or control file
. Backups that have expired
. Backups restricted by time, path name, device type, tag, or recoverability
. Incarnations of a database

Note that the V$BACKUP_FILES also contains list information for backups.

By default, RMAN lists backups by backup, which means that it serially lists each backup or proxy copy 
and then identifies the files included in the backup. You can also list backups by file.

By default, RMAN lists in verbose mode. You can also list backups in a summary mode if the verbose mode 
generates too much output.

Listing Backups by Backup
To list backups by backup, connect to the target database and recovery catalog (if you use one), and then 
execute the LIST BACKUP command. Specify the desired objects with the listObjList clause. For example, 
you can enter:

LIST BACKUP;       # lists backup sets, image copies, and proxy copies
LIST BACKUPSET;    # lists only backup sets and proxy copies
LIST COPY;         # lists only disk copies

Example:

RMAN> LIST BACKUP OF DATABASE;


By default the LIST output is detailed, but you can also specify that RMAN display the output in summarized form. 
Specify the desired objects with the listObjectList or recordSpec clause. If you do not specify an object, 
then LIST BACKUP displays all backups.

After connecting to the target database and recovery catalog (if you use one), execute LIST BACKUP, 
specifying the desired objects and options. For example:

LIST BACKUP SUMMARY;  # lists backup sets, proxy copies, and disk copies

You can also specify the EXPIRED keyword to identify those backups that were not found during a crosscheck:

LIST EXPIRED BACKUP SUMMARY;


-- CONTROLFILE AUTOBACKUP
-- ----------------------

Configuring Control File and Server Parameter File Autobackup
RMAN can be configured to automatically back up the control file and server parameter file whenever 
the database structure metadata in the control file changes and whenever a backup record is added. 
The autobackup enables RMAN to recover the database even if the current control file, catalog, and server 
parameter file are lost.

Because the filename for the autobackup uses a well-known format, RMAN can search for it without access 
to a repository, and then restore the server parameter file. After you have started the instance with the 
restored server parameter file, RMAN can restore the control file from an autobackup. After you mount 
the control file, the RMAN repository is available and RMAN can restore the datafiles and find 
the archived redo log.

You can enable the autobackup feature by running this command:

CONFIGURE CONTROLFILE AUTOBACKUP ON;

You can disable the feature by running this command:

CONFIGURE CONTROLFILE AUTOBACKUP OFF;

Backing Up Control Files with RMAN
You can back up the control file when the database is mounted or open. RMAN uses a snapshot control file 
to ensure a read-consistent version. If CONFIGURE CONTROLFILE AUTOBACKUP is ON (by default it is OFF), 
then RMAN automatically backs up the control file and server parameter file after every backup 
and after database structural changes. The control file autobackup contains metadata about the previous backup, 
which is crucial for disaster recovery.

If the autobackup feature is not set, then you must manually back up the control file in one of the following ways:

.Run BACKUP CURRENT CONTROLFILE
.Include a backup of the control file within any backup by using the INCLUDE CURRENT CONTROLFILE option
 of the BACKUP command
.Back up datafile 1, because RMAN automatically includes the control file and SPFILE in backups of datafile 1

Note:

If the control file block size is not the same as the block size for datafile 1, then the control file 
cannot be written into the same backup set as the datafile. RMAN writes the control file into a backup set 
by itself if the block size is different.
A manual backup of the control file is not the same as a control file autobackup. In manual backups, 
only RMAN repository data for backups within the current RMAN session is in the control file backup, 
and a manually backed-up control file cannot be automatically restored.


24.11 Create scripts:
---------------------

If you are connected to the target and the catalog,
you can create and store scripts in the catalog.

Example:


== XXX

RMAN> create script complet_bac1 {
2> allocate channel c1 type disk;
3> allocate channel c2 type disk;
4> backup database;
5> sql 'ALTER SYSTEM ARCHIVE LOG ALL';
6> backup archivelog all;
7> }

RMAN-03022: compiling command: create script
RMAN-03023: executing command: create script
RMAN-08085: created script complet_bac1

To run such a script:

$ rman target sys/passw@airm catalog rman/rman@rcdb

RMAN>run { execute scipt complet_bac1; }

You can also replace a script:

RMAN>replace script b_whole_l0 {
     # back up whole database and archived logs
       allocate channel d1 type disk;
       allocate channel d2 type disk;
       allocate channel d3 type disk;
       backup
         incremental level 0
         tag b_whole_l0
         filesperset 6
         format '/dev/backup/prod1/df/df_t%t_s%s_p%p'  -- name of the backup piece
          (database);
         sql 'ALTER SYSTEM ARCHIVE LOG CURRENT';
         backup
          filesperset 20
          format '/dev/backup/prod1/al/al_t%t_s%s_p%p'
          (archivelog all
           delete input);
}
 

RMAN> SET CONTROLFILE AUTOBACKUP FORMAT FOR DEVICE TYPE DISK TO 'controlfile_%F';
RMAN> BACKUP AS COPY DATABASE;
RMAN> RUN { 
       SET CONTROLFILE AUTOBACKUP FORMAT FOR DEVICE TYPE DISK TO '/tmp/%F.bck'; 
       BACKUP AS BACKUPSET DEVICE TYPE DISK DATABASE;
      }


24.12 Parallization:
--------------------

RMAN executes commands serially; that is, it completes the current command 
before starting the next one. Parallelism is exploited only within the context 
of a single command. Consequently, if you want 5 datafile copies, 
issue a single copy command specifying all 5 copies rather than 5 separate copy commands. 


In the following example, you allocate 5 channels, and then
you issued 5 separate copy commands.
So, all copy commands are performed one after the other.

run { 
    allocate channel  c1 type disk; 
    allocate channel  c2 type disk; 
    allocate channel  c3 type disk; 
    allocate channel  c4 type disk; 
    allocate channel  c5 type disk; 
    copy datafile 22 to '/dev/prod/backup1/prod_tab5_1.dbf'; 
    copy datafile 23 to '/dev/prod/backup1/prod_tab5_2.dbf'; 
    copy datafile 24 to '/dev/prod/backup1/prod_tab5_3.dbf'; 
    copy datafile 25 to '/dev/prod/backup1/prod_tab5_4.dbf'; 
    copy datafile 26 to '/dev/prod/backup1/prod_tab6_1.dbf'; 
}


To get the copy command run in parallel, use the following
command:

run { 
    allocate channel  c1 type disk; 
    allocate channel  c2 type disk; 
    allocate channel  c3 type disk; 
    allocate channel  c4 type disk; 
    allocate channel  c5 type disk; 
copy datafile 5 to '/dev/prod/backup1/prod_tab5_1.dbf',
         datafile 23 to '/dev/prod/backup1/prod_tab5_2.dbf',
         datafile 24 to '/dev/prod/backup1/prod_tab5_3.dbf',
         datafile 25 to '/dev/prod/backup1/prod_tab5_4.dbf',
         datafile 26 to '/dev/prod/backup1/prod_tab6_1.dbf';
}


24.13 Creating backups:
-----------------------


1. Image copy and Backup set:
-----------------------------

- you can make 'image copies', which are actual
complete copies of database files, controlfiles, or
archived redologs, to disk.
These are not stored in the special RMAN format, and can
be used 'ouside' of rman if neccessary.

- you can make for example backups of database files
in a 'backup set' which are in the special rman format.
You must use rman to process them.

Examples:

- image copy, using the copy command:

RMAN>run { allocate channel c1 type disk;
copy
datafile 1 to '/staging/system01.dbf',
datafile 2 to '/staging/data01.dbf',
datafile 3 to '/staging/users01.dbf',
current controlfile to '/staging/control1.ctl'; }

RMAN> run {
2> allocate channel c1 type disk;
3> copy datafile 1 to 'df1.bak';
4> }


- backup set, using the backup command:

RMAN> run
{ allocate channel c1 type disk;
backup tablespace users
including current controlfile; }

RMAN> run {
2> allocate channel c1 type disk;
3> backup tablespace system;
4> }

RMAN>

This example backs up the tablespace to its default backup location, which is port-specific: 
on UNIX systems the location is $ORACLE_HOME/dbs. Because you do not specify the format parameter, 
RMAN automatically assigns the backup a unique filename. 


2. Archive mode and No archive mode:
------------------------------------

If the database is in ARCHIVELOG mode, then the target database can be open or closed; 
you do not need to close the database cleanly (although Oracle recommends 
you do so that the backup is consistent). 

If the database is in NOARCHIVELOG mode, then you must close it cleanly 
prior to taking a backup. 

The following example shows that a tablespace backup
does not work if the database is open and in no archive mode.

RMAN> run {
2> allocate channel c1 type disk;
3> backup tablespace users;
4> }


RMAN-03022: compiling command: allocate
RMAN-03023: executing command: allocate
RMAN-08030: allocated channel: c1
RMAN-08500: channel c1: sid=17 devtype=DISK

RMAN-03022: compiling command: backup
RMAN-03023: executing command: backup
RMAN-08008: channel c1: starting full datafile backupset
RMAN-08502: set_count=2 set_stamp=482962114 creation_time=10-JAN-03
RMAN-08010: channel c1: specifying datafile(s) in backupset
RMAN-00571: ===========================================================
RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============
RMAN-00571: ===========================================================
RMAN-03007: retryable error occurred during execution of command: backup
RMAN-07004: unhandled exception during command execution on channel c1
RMAN-10035: exception raised in RPC: ORA-19602: cannot backup or copy active file 
in NOARCHIVELOG mode
RMAN-10031: ORA-19624 occurred during call to DBMS_BACKUP_RESTORE.BACKUPDATAFILE


3. Names and sizes:
-------------------

Filenames for Backup Pieces:

You can either let RMAN determine a unique name for the backup piece or use 
the format parameter to specify a name. If you do not specify a filename, 
RMAN uses the %U substitution variable to guarantee a unique name. 
The backup command provides substitution variables that allow you to generate unique filenames. 

Number and Size of Backup Set:

Use the backupSpec clause to list what you want to back up as well as specify 
other useful options. The number and size of backup sets depends on: 

The number of backupSpec clauses that you specify. 

The number of input files specified or implied in each backupSpec clause. 

The number of channels that you allocate. 

The filesperset parameter, which limits the number of files for a backup set. 

The setsize parameter, which limits the overall size in bytes of a backup set. 

The most important rules in the algorithm for backup set creation are: 

Each allocated channel that performs work in the backup job--that is, 
that is not idle--generates at least one backup set. 
By default, this backup set contains one backup piece. 

RMAN always tries to divide the backup load so that all allocated channels have roughly 
the same amount of work to do. 

The maximum upper limit for the number of files per backup set is determined by the 
filesperset parameter of the backup command. 

The maximum upper limit for the size in bytes of a backup set is determined by the 
setsize parameter of the backup command. 


The filesperset parameter limits the number of files that can go in a backup set. 
The default value of this parameter is calculated by RMAN as follows: 
RMAN compares the value 64 to the rounded-up ratio of number of files / number of channels, 
nd sets filesperset to the lower value. For example, if you back up 70 files with one channel, 
RMAN divides 70/1, compares this value to 64, and sets filesperset to 64 
because it is the lowest value. 

The number of backup sets produced by RMAN is the rounded-up ratio of number of 
datafiles / filesperset. For example, if you back up 70 datafiles and filesperset is 64, 
then RMAN produces 2 backup sets. 


setsize:  Sets the maximum size in bytes of the backup set without 
          specifying a limit to the number of files in the set. 

filesperset:  Sets a limit to the number of files in the backup set without 
              specifying a maximum size in bytes of the set. 


4. Examples:
------------


- Backup Database:
------------------

$ rman target / catalog rman/rman@rcat

To write the output to a log file, specify the file at startup. For example, enter: 

$ rman target / catalog rman/rman@rcat log /oracle/log/mlog.f

Allocate one or more channels of type disk or type 'sbt_tape'. 
This example backs up all the datafiles as well as the control file. 
It does not specify a format parameter, so RMAN gives each backup piece 
a unique name automatically and stores it in the port-specific 
default location ($ORACLE_HOME/dbs on UNIX).

Whole database backups automatically include the current control file,
but the current control file does not contain a record of the whole database backup. 
To obtain a control file backup with a record of the whole database backup, 
make a backup of the control file after executing the whole database backup. 
Include a backup of the control file within any backup by specifying 
the include current controlfile option. 

Optionally, use the set duplex command to create multiple identical backupsets.

run { 
     allocate channel ch1 type disk;
     backup database;
     sql 'ALTER SYSTEM ARCHIVE LOG CURRENT'; # archives current redo log as well as 
                                             # all unarchived logs
}

Optionally, use the format parameter to specify a filename for the backup piece. 
For example, enter: 

run { 
     allocate channel ch1 type disk;
     backup database
     format '/oracle/backup/%U';  # %U generates a unique filename
}

Optionally, use the tag parameter to specify a tag for the backup. For example, enter: 

run { 
     allocate channel ch1 type 'sbt_tape';
     backup database
     tag = 'weekly_backup';   # gives the backup a tag identifier
}


This script backs up the database and the archived redo logs: 

RMAN> run {
     allocate channel ch1 type disk;
     allocate channel ch2 type disk;
     backup database;
     sql 'ALTER SYSTEM ARCHIVE LOG ALL';
     backup archivelog all;
}


RMAN> run {
     allocate channel ch1 type disk;
     allocate channel ch2 type disk;
     backup format 'i:\backup\full_db.bck' (database);
     sql 'ALTER SYSTEM ARCHIVE LOG CURRENT';
     backup archivelog all;
}

- Backup tablespace:
--------------------

run { 
     allocate channel ch1 type disk;
     allocate channel ch2 type disk;
     allocate channel ch3 type disk;
     backup filesperset = 3
       tablespace inventory, sales
       include current controlfile;
}

- Backup datafiles:
-------------------

run { 
     allocate channel ch1 type disk;
     backup
       (datafile 1,2,3,4,5,6
       filesperset 3)
       datafilecopy '/oracle/copy/tbs_1_c.f';
}

RMAN> run {
      allocate channel c1 type disk;
      copy datafile 6 to 'F:\oracle\backups\oem01.cpy';
      release channel c1;
      }

RMAN> run {
      allocate channel c1 type disk;
      backup format 'F:\oracle\backups\oem01.rbu' ( datafile 6 );
      release channel c1;
      }

RMAN> run {
     allocate channel ch1 type disk;
     allocate channel ch2 type disk;
     allocate channel ch3 type disk;
     backup
      (datafile 1,2,3 filesperset = 1 channel ch1)
      (datafilecopy '/oracle/copy/cf.f' filesperset = 2 channel ch2)
      (archivelog from logseq 100 until logseq 102 thread 1 filesperset = 3 channel ch3);
}

- Backup archived redologs:
---------------------------

To back up archived logs, issue backup archivelog with the desired filtering options: 

run { 
     allocate channel ch1 type 'sbt_tape';
     backup archivelog all         # Backs up all archived redo logs.    
       delete input;               # Optionally, delete the input logs
}

You can also specify a range of archived redo logs by time, SCN, or log sequence number. 
This example backs up all archived logs created more than 7 and less than 30 days ago: 

run { 
     allocate channel ch1 type disk;
     backup archivelog 
       from time 'SYSDATE-30' until time 'SYSDATE-7';
}


- Incremental backups:
----------------------

This example makes a level 0 backup of the database: 

run { 
     allocate channel ch1 type disk;
     backup 
       incremental level = 0
       database;
}


This example makes a level 1 backup of the database: 

run { 
     allocate channel ch1 type disk;
     backup 
       incremental level = 1
       database;
}


Further examples:
------------------

Your database has to be in archive log mode for this script to work 

RMAN> run {
2> # backup the database to disk
3> allocate channel d1 type disk;
4> backup
5> full
6> tag full_db
7> format '/backups/db_%t_%s_p%p'
8> (database);
9> release channel d1;
10> }
          
----

This script will backup all archive logs. Your database has to be 
in archive log mode for this script to work. 

RMAN> run {
2> allocate channel d1 type disk;
3> backup
4> format '/backups/log_t%t_s%s_p%p'
5> (archivelog all);
6> release channel d1;
7> }
          
          
----

This script will backup all the datafiles. 

resync catalog;
run { 
allocate channel c1 type disk; 
copy datafile 1 to 'C:\rman1.dbf'; 
copy datafile 2 to 'C:\rman2.dbf'; 
copy datafile 3 to 'C:\rman3.dbf';
copy datafile 4 to 'C:\rman4.dbf'; 
copy datafile 5 to 'C:\rman5.dbf'; 
}

exit
echo exiting after successful hot backup using RMAN
          
-----

run {
sql 'alter database close';
allocate channel d1 type disk;
backup full
tag full_offline_backup 
format 'c:\backup\db_t%t_s%s_p%p'
(database);
release channel d1;
sql 'alter database open';
}
          

5. Complete Examples:
---------------------

***************************************************************

L=0 BACKUP

run {
allocate channel d1 type disk;
backup
incremental level = 0
tag db_whole_l0
format 'i:\backup\l0_%d_t%t_s%s_p%p' (database);
sql 'ALTER SYSTEM ARCHIVE LOG CURRENT';
backup
format 'i:\backup\log_%d_t%t_s%s_p%p' (archivelog all);
}

or

  run {
  allocate channel d1 type disk;
  allocate channel d2 type disk;
  backup
  incremental level = 0
  tag db_whole_l0
  format 'i:\backup\l0_%d_t%t_s%s_p%p' (database channel d1);
  sql 'ALTER SYSTEM ARCHIVE LOG CURRENT';
  backup
  format 'i:\backup\log_%d_t%t_s%s_p%p' (archivelog all channel d2);
  }

L=1 BACKUP

run {
allocate channel d1 type disk;
backup
incremental level = 1
tag db_whole_l1
format 'i:\backup\l1_%d_t%t_s%s_p%p' (database);
sql 'ALTER SYSTEM ARCHIVE LOG CURRENT';
backup
format 'i:\backup\log_%d_t%t_s%s_p%p' (archivelog all);
}

*****************************************************************

RMAN>create script db_whole_l0 {
     # back up whole database and archived logs
       allocate channel d1 type disk;
       backup
         incremental level 0
         tag db_whole_l0
         filesperset 15
         format 'i:\backup\l0_%d_t%t_s%s_p%p'  -- name of the backup piece
          (database);
         sql 'ALTER SYSTEM ARCHIVE LOG CURRENT';
         backup
          filesperset 20
          format 'i:\backup\log_%d_t%t_s%s_p%p'
          (archivelog all
           delete input);
}

RMAN>create script db_whole_l1 {
     # back up whole database and archived logs
       allocate channel d1 type disk;
       backup
         incremental level 1
         tag db_whole_l0
         filesperset 15
         format 'i:\backup\l1_%d_t%t_s%s_p%p'  -- name of the backup piece
          (database);
         sql 'ALTER SYSTEM ARCHIVE LOG CURRENT';
         backup
          filesperset 20
          format 'i:\backup\log_%d_t%t_s%s_p%p'
          (archivelog all
           delete input);
}

On sunday : schedule RMAN>run { execute scipt db_whole_l0; }
Other days: schedule RMAN>run { execute scipt db_whole_l1; }


**********************************************

replace script backup_all_archives {
  execute script alloc_all_disks;
  backup
    filesperset 50
    format '/bkup/SID/%d_al_t%t_s%s_p%p'
    (archivelog all  delete input);
  execute script rel_all_disks;
}


# Incremental level 0 (whole) database backup
# The control file is automatically included each time file 1 of the
# system tablespace is backed up. 
# replace script backup_db_level_0_disk {
# execute script alloc_all_disks;
#  set maxcorrupt for datafile 1 to 0;
run {
  allocate channel c2 type disk;
  backup
    incremental level = 0
    tag backup_db_level_0
    # The skip inaccessible clause ensures the backup will continue 
    # if any of the datafiles are inaccessible. 
    skip inaccessible
    filesperset 9
    format 'i:\backup\L0_%d.bck'
    (database);
  sql 'alter system archive log current';  
  execute script backup_all_archives;
}

*************************************************************

-- SUNDAY LEVEL 0 BACKUP
run {
allocate channel d1 type disk;
setlimit channel d1 kbytes 2097150 maxopenfiles 32 readrate 200;
set maxcorrupt for datafile 1,2,3,4,5,6 to 0; 
backup
incremental level 0 cumulative
skip inaccessible
tag sunday_level_0
format 'c:\temp\df_t%t_s%s_p%p'
database;
copy current controlfile to 'c:\temp\sunday.ctl';
sql 'alter system archive log current';
backup
format 'c:\temp\al_t%t_s%s_p%p'
archivelog all
delete input;
release channel d1;
}
-- MONDAY LEVEL 2 BACKUP
run {
allocate channel d1 type disk;
setlimit channel d1 kbytes 2097150 maxopenfiles 32 readrate 200;
set maxcorrupt for datafile 1,2,3,4,5,6 to 0; 
backup
incremental level 2 cumulative
skip inaccessible
tag monday_level_2
format 'c:\temp\df_t%t_s%s_p%p'
database;
copy current controlfile to 'c:\temp\monday.ctl';
sql 'alter system archive log current';
backup
format 'c:\temp\al_t%t_s%s_p%p'
archivelog all
delete input;
release channel d1;
}
-- TUESDAY LEVEL 2 BACKUP
run {
allocate channel d1 type disk;
setlimit channel d1 kbytes 2097150 maxopenfiles 32 readrate 200;
set maxcorrupt for datafile 1,2,3,4,5,6 to 0; 
backup
incremental level 2 cumulative
skip inaccessible
tag tueday_level_2
format 'c:\temp\df_t%t_s%s_p%p'
database;
copy current controlfile to 'c:\temp\tuesday.ctl';
sql 'alter system archive log current';
backup
format 'c:\temp\al_t%t_s%s_p%p'
archivelog all
delete input;
release channel d1;
}
-- WEDNESDAY LEVEL 2 BACKUP
run {
allocate channel d1 type disk;
setlimit channel d1 kbytes 2097150 maxopenfiles 32 readrate 200;
set maxcorrupt for datafile 1,2,3,4,5,6 to 0; 
backup
incremental level 2 cumulative
skip inaccessible
tag wednesday_level_2
format 'c:\temp\df_t%t_s%s_p%p'
database;
copy current controlfile to 'c:\temp\wednesday.ctl';
sql 'alter system archive log current';
backup
format 'c:\temp\al_t%t_s%s_p%p'
archivelog all
delete input;
release channel d1;
}
-- THURSDAY LEVEL 1 BACKUP
run {
allocate channel d1 type disk;
setlimit channel d1 kbytes 2097150 maxopenfiles 32 readrate 200;
set maxcorrupt for datafile 1,2,3,4,5,6 to 0; 
backup
incremental level 1 cumulative
skip inaccessible
tag thursday_level_1
format 'c:\temp\df_t%t_s%s_p%p'
database;
copy current controlfile to 'c:\temp\thursday.ctl';
sql 'alter system archive log current';
backup
format 'c:\temp\al_t%t_s%s_p%p'
archivelog all
delete input;
release channel d1;
}
-- FRIDAY LEVEL 2 BACKUP
run {
allocate channel d1 type disk;
setlimit channel d1 kbytes 2097150 maxopenfiles 32 readrate 200;
set maxcorrupt for datafile 1,2,3,4,5,6 to 0; 
backup
incremental level 2 cumulative
skip inaccessible
tag friday_level_2
format 'c:\temp\df_t%t_s%s_p%p'
database;
copy current controlfile to 'c:\temp\friday.ctl';
sql 'alter system archive log current';
backup
format 'c:\temp\al_t%t_s%s_p%p'
archivelog all
delete input;
release channel d1;
}
-- SATURDAY LEVEL 2 BACKUP
run {
allocate channel d1 type disk;
setlimit channel d1 kbytes 2097150 maxopenfiles 32 readrate 200;
set maxcorrupt for datafile 1,2,3,4,5,6 to 0; 
backup
incremental level 2 cumulative
skip inaccessible
tag saturday_level_2
format 'c:\temp\df_t%t_s%s_p%p'
database;
copy current controlfile to 'c:\temp\saturday.ctl';
sql 'alter system archive log current';
backup
format 'c:\temp\al_t%t_s%s_p%p'
archivelog all
delete input;
release channel d1;
}


6. Third Party:
---------------


You can use rman in combination with third party storage managers.
In this case, rman is used with a MML library and possibly some API 
that uses it's own configuration files, for example:

backup.scr script:

run
{
   allocate channel t1 type 'sbt_tape' parms 
            'ENV=(TDPO_OPTFILE=c:\RMAN\scripts\tdpo.opt)';
   allocate channel t2 type 'sbt_tape' parms 
            'ENV=(TDPO_OPTFILE=c:\RMAN\scripts\tdpo.opt)';
 
   backup
      filesperset 5
      format 'df_%t_%s_%p'
      (database);
 
   release channel t1;
   release channel t2;
}    


run {
allocate channel d1 type 'sbt_tape' connect 'internal/manager@scdb2' parms 
'ENV=(TDPO_OPTFILE=/usr/tivoli/tsm/client/oracle/bin64/tdpo.opt)';
allocate channel d2 type 'sbt_tape' connect 'internal/manager@scdb1' parms 
'ENV=(TDPO_OPTFILE=/usr/tivoli/tsm/client/oracle/bin64/tdpo.opt)';
backup
  format 'ctl_t%t_s%s_p%p'
  tag cf
  (current controlfile);
backup
 full
  filesperset 8
  format 'db_t%t_s%s_p%p'
  tag fulldb
  (database);
release channel d1;
release channel d2;
}


The PARMS parameter sends instructions to the media manager. For example, the following 
vendor-specific PARMS setting instructs the media manager to back up to 
a volume pool called oracle_tapes:

PARMS='ENV=(NSR_DATA_VOLUME_POOL=oracle_tapes)'
parms='ENV=(DSMO_FS=oracle)'

Another example:

RUN
{
  ALLOCATE CHANNEL c1 DEVICE TYPE sbt 
    PARMS='ENV=(NSR_SERVER=tape_srv,NSR_GROUP=oracle_tapes)';
}


If you do not receive an error message, then Oracle successfully l
oaded the shared library. However, channel allocation can fail with the ORA-27211 error:

To delete an old backup:

run
{
   allocate channel for delete type 'sbt_tape' parms
            'ENV=(TDPO_OPTFILE=c:\RMAN\scripts\tdpo.opt)';
 
   change backupset primary_key delete;
 
}
                       
 
To schedule scripts:
--------------------

orcschedppim.cmd

rem ==================================================
rem orcsched.cmd
rem ==================================================

rem ==================================================
rem set rman executable
rem ==================================================
set ora_exe=d:\oracle\ora81\bin\rman

rem ==================================================
rem set script and log directory
rem ==================================================
rem set ora_script_dir=d:\oracle\scripts\
set ora_script_dir=c:\progra~1\tivoli\tsm\agentoba\
rem ==================================================
rem run the backup script
rem ==================================================

%ora_exe% target system/manager@ppim rcvcat rman_db1/rman_db1@orcl 
cmdfile %ora_script_dir%bkdbppim.scr msglog %ora_script_dir%bkdbppim.log

bkdbppim.scr

run
{
   allocate channel t1 type 'sbt_tape' parms
'ENV=(TDPO_OPTFILE=C:\Progra~1\Tivoli\TSM\AgentOBA\tdpoppim.opt)';
   allocate channel t2 type 'sbt_tape' parms
'ENV=(TDPO_OPTFILE=C:\Progra~1\Tivoli\TSM\AgentOBA\tdpoppim.opt)';
      backup
      filesperset 5
      format 'df_%t_%s_%p'
      (database);

   release channel t1;
   release channel t2;
}

------------------------------------

Remarks:
--------

The following is what needs to be changed.

- Old Way

allocate channel for maintenance type 'sbt_tape' parms
'ENV=(DSMO_NODE=tora,
DSMI_ORC_CONFIG=/opt/tivoli/tsm/client/oracle/bin/dsm.opt)'


allocate channel t1 type 'sbt_tape' parms
>         'ENV=(DSMO_NODE=rx_r50,
>               DSMI_CONFIG=/usr/tivoli/tsm/client/ba/bin/dsm.opt,
>               DSMO_PSWDPATH=/usr/tivoli/tsm/client/oracle/bin,
>               DSMI_DIR=/usr/tivoli/tsm/client/ba/bin,
>               DSMO_AVG_SIZE00)';
>   
 
- New Way
allocate channel for maintenance type 'sbt_tape' parms
'ENV=(TDPO_OPTFILE=/opt/tivoli/tsm/client/oracle/bin/tdpo.opt)'

Contents of tdpo.opt


DSMI_ORC_CONFIG    /opt/tivoli/tsm/client/oracle/bin/dsm.opt
DSMI_LOG          /opt/tivoli/tsm/client/oracle/bin/tdpoerror.log 

TDPO_FS           rman_fs 
TDPO_NODE         tora 
*TDPO_OWNER         
TDPO_PSWDPATH      /opt/tivoli/tsm/client/oracle/bin

*TDPO_DATE_FMT      1
*TDPO_NUM_FMT       1
*TDPO_TIME_FMT      1

*TDPO_MGMT_CLASS2   mgmtclass2
*TDPO_MGMT_CLASS3   mgmtclass3
*TDPO_MGMT_CLASS4   mgmtclass4

It is recomended TDP_NUM_BUFFERS be set to a value of 1 only. 

7. Recovery:
------------

A restore can be as easy as:

RMAN> RESTORE DATABASE;
RMAN> RECOVER DATABASE;

Or a single tablespace:

Restore the tablespace or datafile with the RESTORE command, and recover it with the RECOVER command. 
(Use configured channels, or if desired, use a RUN block and allocate channels to improve performance 
of the RESTORE and RECOVER commands.)

RMAN> RESTORE TABLESPACE users;

RMAN> RECOVER TABLESPACE users;

If RMAN reported no errors during the recovery, then bring the tablespace back online:

RMAN> SQL 'ALTER TABLESPACE users ONLINE';


Use the RMAN restore command to restore datafiles, control files, or archived redo logs 
from backup sets or image copies. 
RMAN restores backups from disk or tape, but image copies only from disk. 

Restore files to either: 

- The default location, which overwrites the files with the same name.
- A new location specified by the set newname command.

Restoring the Database to its Default Location
----------------------------------------------

If you do not specify set newname commands for the datafiles during a restore job, 
the database must be closed or the datafiles must be offline. 

RMAN> run {
     allocate channel c1 type 'sbt_tape';
     restore database;
     recover database;
      }

run { 
     set until logseq 5 thread 1;
     allocate auxiliary channel dupdb1 type disk; 
     duplicate target database to dupdb;

} 


Restoring the Database to a point in time (same incarnation)
------------------------------------------------------------

RMAN> run
2 {
3    set until time '23-DEC-2006 13:45:00';
4    restore database;
5    recover database;
6 }

   
Moving the Target Database to a New Host with the Same File System
------------------------------------------------------------------

A media failure may force you to move a database by restoring a backup from 
one host to another. You can perform this procedure so long as you have 
a valid backup and a recovery catalog or control file. 

Because your restored database will not have the online redo logs of your production database, 
you will need to perform incomplete recovery up to the lowest SCN of the most recently 
archived redo log in each thread and then open the database with the RESETLOGS option. 


To restore the database from HOST_A to HOST_B with a recovery catalog:

Copy the initialization parameter file for HOST_A to HOST_B using an operating system utility. 
Connect to the HOST_B target instance and HOST_A recovery catalog. For example, enter: 

% rman target sys/change_on_install@host_b catalog rman/rman@rcat


Start the instance without mounting it: 

startup nomount

Restore and mount the control file. Execute a run command with the following sub-commands: 

Allocate at least one channel. 
Restore the control file. 
Mount the control file. 

run {
     allocate channel ch1 type disk;
     restore controlfile;
     alter database mount;
}


Because there may be multiple threads of redo, use change-based recovery. 
Obtain the SCN for recovery termination by finding the lowest SCN among the most 
recent archived redo logs for each thread. 

Start SQL*Plus and use the following query to determine the necessary SCN: 

SELECT min(scn) 
FROM (SELECT max(next_change#) scn 
      FROM v$archived_log 
      GROUP BY thread#);


Execute a run command with the following sub-commands: 

Set the SCN for recovery termination using the value obtained from the previous step. 

Allocate at least one channel. 
Restore the database. 
Recover the database. 
Open the database with the RESETLOGS option. 

run {
     set until scn = 500;  # use appropriate SCN for incomplete recovery
     allocate channel ch1 type 'sbt_tape';
     restore database;
     recover database;
     alter database open resetlogs;
}

Moving the Target Database to a New Host with a different File System
---------------------------------------------------------------------

Follow the procedure as above, but now use the 'set newname' command.

run { 
     set until scn 500;  # use appropriate SCN for incomplete recovery
     allocate channel ch1 type disk; 
     set newname for datafile 1 to '/disk1/%U'; # rename each datafile manually
     set newname for datafile 2 to '/disk1/%U'; 
     set newname for datafile 3 to '/disk1/%U'; 
     set newname for datafile 4 to '/disk1/%U'; 
     set newname for datafile 5 to '/disk1/%U'; 
     set newname for datafile 6 to '/disk2/%U'; 
     set newname for datafile 7 to '/disk2/%U'; 
     set newname for datafile 8 to '/disk2/%U'; 
     set newname for datafile 9 to '/disk2/%U'; 
     set newname for datafile 10 to '/disk2/%U'; 
     alter database mount; 
     restore database; 
     switch datafile all;  # points the control file to the renamed datafiles
     recover database;
     alter database open resetlogs; 
}  


Warning:

restore with use catalog:
If you issue switch commands, RMAN considers the restored database as the target database, 
and the recovery catalog becomes corrupted. If you do not issue switch commands, 
RMAN considers the restored datafiles as image copies that are candidates for future restore operations. 

restore with no catalog: 
If you issue switch commands, RMAN considers the restored database as the target database. 
If you do not issue switch commands, the restore operation has no effect on the repository. 

Restoring a tablespace:
-----------------------

Suppose tablespace DATA_BIG has become unusable.

run {
     allocate channel ch1 type disk;
     restore tablespace data_big;
}

run {
     allocate channel ch1 type disk;
     recover tablespace data_big;
}


This script will perform datafile recovery 

RMAN> run {
2> allocate channel d1 type disk;
3> sql "alter tablespace users offline immediate";
4> restore datafile 5;
5> recover datafile 5;
6> sql "alter tablespace users online";
7> release channel d1;
8> }


RMAN> run {
     allocate channel ch1 type disk;
     restore database;
     recover database;
     alter database open resetlogs;
}


Duplexing the Target Database to a New Host:
---------------------------------------------

- create instance on second host
- create init.ora, password file etc..
- create similar directories on second host
- make sure net8 works from target en rman to second host
- startup nomount
- neccesary archived redologs are present on second host

$ rman target sys/target_pwd@target_str catalog rman/cat_pwd@cat_str 
auxiliary sys/aux_pwd@aux_str


run { 
     allocate auxiliary channel ch1 type 'sbt_tape';
     duplicate target database to dupdb 
       nofilenamecheck;
}


run {  
     # allocate at least one auxiliary channel of type disk or tape 
     allocate auxiliary channel dupdb1 type 'sbt_tape'; 
     . . . 
     # set new filenames for the datafiles
     set newname for datafile 1 TO '$ORACLE_HOME/dbs/dupdb_data_01.f'; 
     set newname for datafile 2 TO '$ORACLE_HOME/dbs/dupdb_data_02.f'; 
     . . .
     # issue the duplicate command
     duplicate target database to dupdb 
     # create at least two online redo log groups
     logfile
       group 1 ('$ORACLE_HOME/dbs/dupdb_log_1_1.f', 
                '$ORACLE_HOME/dbs/dupdb_log_1_2.f') size 200K, 
       group 2 ('$ORACLE_HOME/dbs/dupdb_log_2_1.f', 
                '$ORACLE_HOME/dbs/dupdb_log_2_2.f') size 200K; 
}


24.14 Common RMAN errors:
-------------------------

What are the common RMAN errors (with solutions)?
Some of the common RMAN errors are: 

PROBLEM 1.
----------

RMAN-20242: Specification does not match any archivelog in the recovery catalog.

Add to RMAN script: sql 'alter system archive log current';

PROBLEM 2.
----------

RMAN-06089: archived log xyz not found or out of sync with catalog

Execute from RMAN: change archivelog all validate;

PROBLEM 3.
----------

fact: Oracle Server - Enterprise Edition 8
fact: Oracle Server - Enterprise Edition 9
fact: Recovery Manager (RMAN)
symptom: RMAN backup fails
symptom: RMAN-10035: exception raised in RPC
symptom: ORA-19505: failed to identify file <file>
symptom: ORA-27037: unable to obtain file status
symptom: SVR4 error:2:no such file or directory
cause: Datafile existed in previous backup set, but has been subsequently 
removed or renamed.

fix:

Resync the RMAN Catalog
	$ rman target sys/<passwd>@target catalog rman/<passwd>@catalog 
	RMAN> resync catalog;
Or
Validate the backup pieces.
	$ rman target sys/<passwd>@target catalog rman/<passwd>@catalog  
        RMAN> allocate channel for maintenance type disk;
	RMAN> crosscheck backup;
	RMAN> resync catalog;


PROBLEM 4.
----------

RMAN> connect target sys/change_on_install@TARGETDB

   RMAN-00569: ================error message stack follows
   RMAN-04005: error from target database: 
               ORA-01017: invalid username/password; logon denied


Problem Explanation:

Recovery Manager automatically requests a connection to the target database 
as SYSDBA.

Solution Description:

Recovery Manager automatically requests a connection to the target database as 
SYSDBA.  In order to connect to the target database as SYSDBA, you must either:

1. Be part of the operating system DBA group with respect to the target 
   database.  This means that you have the ability to CONNECT INTERNAL 
   to the trget database without a password.

    - or -

2. Have a password file setup.  This requires the use of the "orapwd" command 
   and the initialization parameter "remote_login_passwordfile". See Chapter 1 
   of the Oracle8(TM) Server Administrator's Guide, Release 8.0 for details.
   Note that changes to the password file will not take affect until after
   the database is shutdown and restarted.

For Unix, also ensure TWO_TASK is _not_ set. 
e.g. % env | grep -i two
     If set, unset it.
     % unsetenv TWO_TASK

PROBLEM 5.
---------

RMAN cannot connect to the target database through a multi-threaded server (MTS) dispatcher: 
it requires a dedicated server process

Create a net service name in the tnsnames.ora file that connects to the non-shared SID. 
For example, enter: 

inst1_ded =
  (description=
    (address=(protocol=tcp)(host=inst1_host)(port1521))
    (connect_data=(service_name=inst1)(server=dedicated))
  )

$ rman target sys/oracle@inst1_ded catalog rman/rman@rcat


PROBLEM 6.
---------

No MML libary found.
RMAN will:

1. Attempts to load the library indicated by the SBT_LIBRARY parameter in the 
ALLOCATE CHANNEL or CONFIGURE CHANNEL command. If the SBT_LIBRARY parameter 
is not specified, then Oracle proceeds to the next step. 

2. Attempts to load the default media management library. The filename of the default library 
is operating system specific. On UNIX, the library filename is $ORACLE_HOME/lib/libobk.so, 
with the extension name varying according to platform: .so, .sl, .a, and so forth. 
On Windows NT the library is named %ORACLE_HOME%\bin\orasbt.dll. 

If Oracle is unable to locate the MML library,then RMAN issues an ORA-27211 error and exits. 

Whenever channel allocation fails, Oracle writes a trace file to the USER_DUMP_DEST directory. 
The following shows sample output:

SKGFQ OSD: Error in function sbtinit on line 2278
SKGFQ OSD: Look for SBT Trace messages in file /oracle/rdbms/log/sbtio.log
SBT Initialize failed for /oracle/lib/libobk.so


24.15 RMAN 10g Notes:
---------------------


==========================
25. UPGRADE AND MIGRATION:
==========================


25.1 Version and release numbers:
---------------------------------

Oracle 7      -> 8,8i,9i
Oracle 8      -> 8i
Oracle 8.1.x  -> 8.1.y
Oracle 8,8i   -> 9i

Upgrade:   move upwarde from one release in the same version to a higher release
           within the same base version, for example 8.1.6 -> 8.1.7
Migration: move to a different version, for example 7.4.3 -> 8.1.5
Patches  : bugfixes
Patchset : smaller patches combined to latest patchset

Example version:

8.1.6.2 -> 
8=version,1=release number,6=maintenance release number,2=patch number

Exp Imp matrix:
---------------

1. Migration to Oracle9i release 1 - 9.0.1.x :    
-------------------------------------------    
Direct migration with a full database export and full database import    
is only supported if the source database is:    
- Oracle7 : 7.3.4    
- Oracle8 : 8.0.6    
- Oracle8i: 8.1.5 or 8.1.6 or 8.1.7 

Migration to Oracle9i release 2 - 9.2.0.x :    
-------------------------------------------    
Direct migration with a full database export and full database import    
is only supported if the source database is:    
- Oracle7 : 7.3.4    
- Oracle8 : 8.0.6    
- Oracle8i: 8.1.7    
- Oracle9i: 9.0.1 


Tools that can be used to migrate from one version to another:
--------------------------------------------------------------

- exp/imp
- MIG Migration Utility
- ODMA Oracle Data Migration Assistant

There also exists the "Migration Workbench" for migrating 
Access, SQL Server etc.. to Oracle.


25.2 Migration From 7 to 8,8i:
------------------------------

Take into account the following:

- Changed standard directories of init, alert, dump
- Changed and obsolete init.ora parameters
- Changed and obsolete sqlnet.ora, tnsnames.ora and listener.ora parameters
- Rowid values have changed from "restricted" to "extended" format

Obsolete init.ora parameters:

 init_sql_files
 lm_domains
 lm_non_fault_tolerant
 parallel_default_max_scans
 parallel_default_scansize
 sequence_cache_hash_buckets
 serializable
 session_cached_cursors
 v733_plans_enabled

Change init.ora parameters:

 compatible
 snapshot_refresh_interval -> job_queue_interval
 snapshot_refresh_process  -> job_queue_processes
 db_writers                -> dbwr_io_slaves
 user_dump_dest, background_dump_dest, ifile

Three main tools:

- exp/imp

OWNER=  or FULL exp/imp
In case of a full exp/imp you must run catalog.sql of new database

- Migration utility

This is a command line utility.
From 7 to 8 or higher: the Rowid will not be changed automatically.
Migration utility will create a "conversion file" instance_name.dbf
Move this file to the /dbs directory of Oracle 8,9.  
Startup svrmgrl or sqlplus
  alter database convert;
  alter database open resetlogs;

- ODMA

This tool uses a GUI.
 
25.2 Example Upgrade of 8.1.6 to 9 using ODMA:
----------------------------------------------

1. Install the Oracle 9i software in it's own ORACLE_HOME.
2. Prepare the original init.ora
   DB_DOMAIN=correct domain
   JOB_QUEUE_PROCESS=0
   AQ_TM_PROCESSES=0
   REMOTE_LOGIN_PASSWORDFILE=NONE
3. Resize the SYSTEM tablespace to have more than 100M free
4. Prepare the system rollbacksegment to be big enough
   alter rollback segment system storage(maxextents 505 optimal null next 1M);
5. Verify that SYSTEM is the default tablespace for SYS and SYSTEM
6. Make sure there is no user MIGRATE. ODMA will use a user called MIGRATE.
7. Shutdown the database cleanly.
8. Make a backup

9. Setup the environment variables for the 9i software.
   Also, ODMA uses the java GUI, just like the OUI
10. Start ODMA

$ cd $ORACLE_HOME/bin
$ odma

11. Basically, follow the instructions.
    
ODMA will ask you the instance that must be upgraded.
On unix, this is read from the oratab file.

Then it will ask you to confirm both the old and new ORACLE_HOME.
It will also ask for the location of the init.ora file.

Then it will proceed in the upgrade.
The upgrade is primarily about the datadictionary.

12. When ODMA is ready, do the following:
    check the alert log and other logs
    Also check oratab, optionally run utlrp.sql to automatically 
    rebuild any invalid objects.
    Check for invalid objects and check indexes.
    Analyze all tables plus indexes.


25.3 Example Upgrade of 8.1.6 to 8.1.7:
---------------------------------------

1. Install the new Oracle software in a different $ORACLE_HOME
For example
$ cd $ORACLE_BASE
$ cd product
$ ls
8.1.6 8.1.7

Backup and shutdown the 8.1.6 database, and stop the listener

2. Set the correct env variables for 8.1.7
3. Create a softlink in the new $ORACLE_HOME/dbs to the init.ora
   in the $ORACLE_BASE/admin/sid/pfile directory

Startup the database with new Oracle release

4. Startup the database using the new Oracle software
   sqlplus internal  (or via svrmgrl)
   startup restrict;
5. Run the upgrade script $ORACLE_HOME/rdbms/admin/u0801060.sql
   This will also rebuild the datadictionary (catalog, catproc)
6. You optionally run utlrp.sql to automatically rebuild any invalid objects
7. Change on unix oratab for new $ORACLE_HOME
8. Change listener.ora for $ORACLE_HOME value
9. Set COMPATIBLE in init.ora
10. Checks:
check the alert log and other logs
Also check oratab, optionally run utlrp.sql to automatically 
rebuild any invalid objects.
Check for invalid objects and check indexes.
Analyze all tables plus indexes.


=====================
26. Some info on Rdb:
=====================

Rdb is most often seen on Digtal unix, or OpenVMS VAX, or OpenVMS alpha,
but there exists a port to NT / 2000 as well.


Samples directory:
------------------

- digital unix: /usr/lib/dbs/vnn/examples
- OpenVMS: SQL$EXAMPLE 

In digital unix, to create a sample database:
$/usr/lib/dbs/sql/vnn/examples/personnel <database-form> <dir>
<database-form>: S, M, MSDB
<dir>: enter a directory where you want the database to be created.
$/usr/lib/dbs/sql/vnn/examples/personnel m /tmp/


Invoking SQL:
--------------

- In OpenVMS. Create a symbol
  $ SQL:==$SQL$
  
  $ SQL
  SQL>

- In digital unix:
  $ SQL
  SQL>


Attach to database:
-------------------

SQL>ATTACH 'FILENAME mf_personnel';

SQL>ATTACH 'FILENAME DISK$1:[GERALDO.DB]SUPPLIES MULTISCHEMA IS OFF'


Detach from database:
---------------------

SQL>exit
$

or

SQL>DISCONNECT DEFAULT;
SQL>


Editing a SQL Statement:
------------------------

SQL>EDIT

...

EXIT


OpenVMS: Defining a Logical name for a database:
------------------------------------------------

$ DEFINE SQL$DATABASE DISK01:[FIELDMAN.DBS]mf_personnel

You do not need to attach to the database anymore.


Digital unix: Defining a configuration parameter:
-------------------------------------------------

$ SQL_DATABASE /usr/fieldman/dbs/mf_personnel


SHOW Statements:
----------------

SQL> SHOW TABLES   -- shows all tables
SQL> SHOW TABLE *
SQL> SHOW ALL TABLES
SQL> SHOW TABLE WORK_STATUS  -- displays info about table WORK_STATUS
SQL> SHOW VIEWS  -- shows all views
SQL> SHOW VIEW CURRENT_SALARY  -- shows info about this view only
SQL> SHOW DOMAINS -- display all domains
SQL> SHOW DOMAIN DATE_DOM
SQL> SHOW INDEXES
SQL> SHOW INDEXES ON SALARY_HISTORY
SQL> SHOW INDEX DEG_EMP_ID
SQL> SHOW DATABASE  -- returns the database name
SQL> SHOW STORAGE AREAS


Single file or multifile database:
----------------------------------

A database that stores tables in one file (file type .rdb) is a
single file database. Alternately, you can have a database in which
system information is stored in a database root file (.rdb) and the data 
and metadata are stored in one or more storage area files (type .rda).

Single file: 
- a database root file which contains all user data and information
  about the status of all database operations.
- a snapshot file (.snp file) which contains copies of rows (before images) that
  are beiing modified by users updating the database.

Multifile:
- a database root file which contains information about the status of all database operations.
- a storage area file, .rda file, for the system tables (RDB$SYSTEM)
- one or more .rda files for user data.
- snapshot files for each .rda file and for the database root file.


Create multifile database example:
----------------------------------

$ SQL
SQL> CREATE DATABASE FILENAME mf_personnel_test
cont>       ALIAS MF_PERS
cont>       RESERVE 6 JOURNALS
cont>       RESERVE 15 STORAGE AREAS
cont>       DEFAULT STORAGE AREA default_area
cont>       SYSTEM INDEX COMPRESSION IS ENABLED
cont>    CREATE STORAGE AREA default_area FILENAME default_area
cont>    CREATE STORAGE AREA RDB$SYSTEM FILENAME pers_system_area;


Datatypes:
----------

Rdb                                         Oracle
--------------------------------------------------

CHAR                                        CHAR, NCHAR                                        
VARCHAR                                     VARCHAR2, NVARCHAR2
SMALLINT (16 bits)                          NUMBER(L,P)
INTEGER  (32 bits) can be used with         NUMBER(L,P)
          a scale factor INTEGER(2)
BIGINT   (64 bits)                          RAW, LONG, LONG RAW,
VARYING
DATE ANSI (year, month, day)
TIME
INTERVAL
TIMESTAMP (year, month, day,                DATE
           hours, min, sec)
DATE VMS


ODBC for RDB:
-------------

---------------
The current driver version is 3.00.02.05 which 
doesnt work, and the older driver version (which does 
work) is 2.10.17.00 (DriverConf1 outputs attached).

---------------
I am trying to run a DTS job to import data from an Oracle 7.3 RDB (DEC) platform into SLQ Server 2000. 
I have an odbc connection set up and I am using it in MS Access 2000 to view the table that I want to import. 
When I create the job in SQL Server, I can preview the data and everything looks fine, as in the Access table, 
but when I try and run the job I get an: 
[ORACLE][ODBC]Function Sequence Error 
error message. Any experience with these type of errors and RDB. 
Thanks, 
John Campbell 


This can - I understand - occur where the version of the ODBC drivers on the NT box with SQL Server running 
is incompatible with the services running on the VMS box. 
I can't remember the various numbers I'm afraid (or even where I found the stuff - it was some time ago). 

We're running VMS 7.2-1 and Oracle 7.3 and found that this produced a similar error with the most recent version 
of the Oracle ODBC Drivers for RdB - but we have no problems running the v2.10 drivers (v2.10.17 to be exact). 

HTH 
---------------

ODBC driver for RDB uses
SQSAPI32.ini 


JInitiator:
-----------

Oracle heeft deze standaard aangepast, specifiek gericht op het uitvoeren van Webforms. 
Deze aanpassingen houden verband met stabiliteit (bugfixes) en performance verbetering, 
zoals JAR file caching, incremental JAR file loading en applet caching. Met behulp van JInitiator 
kunnen Oracle Forms in een browser (Webforms) worden uitgevoerd. 

JInitiator is g��n JVM, maar een extensie op de JVM standaard, waarmee Oracle Webforms 
op een stabiele �n ondersteunde wijze in een browser kunnen worden uitgevoerd. 
JInitiator is alleen beschikbaar voor het Windows platform. Op dit moment is het niet mogelijk 
om Webforms uit te voeren in de standaard Microsoft JVM. Jinitiator zal in de volgende release 
niet meer terugkeren. Webforms wordt gecertificeerd op de standaard Java Plugin. 
De Microsoft JVM conformeert zich ook aan deze standaard (g��n certificatie), waardoor Webforms 
op termijn in een standaard Microsoft Internet Explorer browser uitgevoerd zal kunnen worden. 
Dit kan echter pas met zekerheid gesteld worden na grondig testen. 


Installatie JInitiator
JInitiator wordt bij het eerste gebruik automatisch gedownload vanaf de Application Server. 
Overigens kan de JInitiator ook handmatig worden ge�nstalleerd op de client machines.


============================
28. Some info on IFS 
============================

First some remarks about IFS in versions 9.0.2 and 9.0.3:

9.0.2
=====

In version 9.0.2, IFS (Internet File System) is a separate product.

9.0.3
=====

In version 9.0.3, CM SDK runs in conjunction with Oracle9i Application Server and an Oracle9i database. 
The Oracle Content Management SDK (Oracle CM SDK) is the new name for the product formerly known as the 
Oracle Internet File System (Oracle 9iFS). This new naming is official as of version 9.0.3. 
Oracle CM SDK runs in conjunction with Oracle9i Application Server and an Oracle9i database. 
Written entirely in Java, Oracle CM SDK is an extensible content management system with file server convenience. 


27.1 IFS 9.0.2
--------------
--------------

We first will turn our attention to iFS 9.0.2:
----------------------------------------------

The Oracle 9i database stores all content that comprises the filesystem,
from the files themselves to metadata like owners and group information.

On most occasions, 9iFS stores the files contents as LOB's in the database.

Tools:
------

- Oracle 9iFS Configuration Assistant.
Allows you to create a new 9iFS Domain, and add nodes etc..

- Oracle 9IFS Credential Manager Configuration Assistant.
To change the default credential manager to be applied to each user.

- OEM for 9iAS website (9iAS Home Page)
You can manage 9iFS from the 9iAS OEM website.

- OEM console (Oracle Enterprise Manager)
You can manage 9iFS from the OEM console.

- Oracle 9iFS Manager
Graphical java based interface on iFS.

- Webinterface iFS manager

- Command line utilities
ifsshell etc..

- Import/Export utility
The Import/Export utility exports Oracle 9iFS objects (content and users)
into an export file.

Domain:
-------

9iFS is organized in a Domain concept, with an administrative
Domain controller and possibly other nodes as members in the Domain.

Repository:
-----------

All data managed by 9iFS resides in an 9i database schema, called
the 9iFS repository. You specify the database instance and schemaname
during installation of 9iFS.

Commands:
---------

Stop IFS:

Oracle Internet File System 1.1.x
 ORACLE_HOME\ifs1.1\bin\ifsstop.bat
 
Oracle 9iFS 9.0.1 (and higher)
 ORACLE_HOME\9ifs\bin\ifsstopdomain.bat 


start iFS OC4J instance 
Windows NT or 2K:        > ifsstartoc4j.bat

start up ifs domain controller process 
Windows NT or 2K        > ifslaunchdc.bat  

Start ifs node processes
Windows NT or 2K        > ifslaunchnode.bat 

Activate the iFS domain controller and Nodes  
Windows NT or 2K        > ifsstartdomain.bat  

Here is a script example to run on windows NT or 2K:  

StartIfs902.bat
=============== 

D:\ora902\9ifs\bin\ifsstartoc4j.bat 
start D:\ora902\9ifs\bin\ifslaunchdc.bat 
start D:\ora902\9ifs\bin\ifslaunchdomain.bat 
D:\ora902\9ifs\bin\ifsstartdomain -s myifshost:53140 ifssys 
echo "iFS 902 started"  


- Home:

Oracle CM SDK must be installed in the Oracle9i Application Server, Release 2 home. 
Make sure to select the file location carefully; 
once installed, the Oracle CM SDK software cannot be moved without deinstalling and reinstalling.

Oracle 9iFS requires an Oracle 9.0.2 home, which means you must install and configure 
Oracle9i Application Server, Release 2 in an Oracle home separate from that of the database. 
The Oracle home can be on the same machine (resources allowing), or on a different machine. 


- Install with Oracle Universal Installer.

Installation and configuration of Oracle 9iFS starts from the Oracle Universal Installer, 
the graphical user interface wizard that copies all necessary software to the Oracle home 
on the target machine. 

The Oracle 9iFS Configuration tool launches automatically at the end of the Oracle Universal Installer process 
and guides you through the process of identifying the Oracle database to be used for the 
Oracle Internet File System schema; selecting the type of authentication to use 
(native Oracle 9iFS credential manager or Oracle Internet Directory for credential management); 
and various other configuration tasks. The specific configuration tasks vary, depending on the type 
of deployment (new Oracle 9iFS domain vs. additional Oracle 9iFS nodes, for example)
 

- Starting install wizard again:

ORACLE_HOME\ifs\cmsdk\bin\ifsca.bat 

- connect to database:

The Oracle CM SDK Configuration Assistant attempts to make a connection as SYS AS SYSDBA using a database string, 
and therefore needs the database to be configured with a password file. 

- Directory service:

Select either CMSDK Directory Service or Oracle Internet Directory Service for user authentication. 

The default Oracle Internet Directory super user name/password is cn=orcladmin/welcome1. 
The default Oracle Internet Directory root Oracle context is set to cn=OracleContext. 

- Launch Internet File System Manager from a Web browser: 

http://hostname.mycompany.com:7778/cmsdk/admin


Access paths and directory structure:
-------------------------------------

- Oracle FileSync Client Software:

In addition to using the networking protocols or client applications native to the Windows operating system, 
Windows users can install and use Oracle FileSync to keep local directories on a desktop machine and folders 
in Oracle CM SDK synchronized. 

Double-click Setup.exe to run the installation program, 
or run O:\ifs\clients\filesync\setup.exe from the Windows Start...Run Menu. 

- CUP (Command-line Utilities Protocol) Client

The Oracle Command-line Utilities Protocol server enables administrators and developers to perform a 
variety of tasks quickly and easily from a Windows command-line or a UNIX shell. 

copy /ifs/clients/cmdline/win32
to a local directory.


============================
28. Some info on 9iAS rel. 2
============================


28.1 General Information:
=========================


Oracle9i Application Server (Oracle9iAS) is a part of the Oracle9i platform, 
a complete and integrated e-business platform. Oracle9i platform consists of: 

- Oracle9i Developer Suite for developing applications 

- Oracle9i Application Server for deploying Internet applications 

- Oracle9i Database Server for storing content 


9iAS is not just a webserver. A webserver is only part of the 9iAS system. 9iAS offers OC4J 
(Oracle Containers for J2EE), portals, webserver and webcache, and BusinessIntelligence and other components.

OC4J:
-----

The "core" of the AS (thus the application part), is the OC4J architecture. The OC4J infrastructure supports
EJB, JSP and Servlet applications. Developers can write J2EE applications, like EJB, Servlet and JSP applications,
that will run on 9iAS.
OC4J itself is written in Java and runs on a Java virtual machine.

BusinessIntelligence:
---------------------

A set of services and client applications that make reports and all types of analysis possible.
For example, the 'Oracle Reports service' , an application in the middle tier, uses a queue for 
submitted client requests. These request might create reports of a Datawarehouse in a 
Customer database etc...


28.1.1 Components:
------------------

There are 3 install types:

-J2EE and Web Cache 
-Portal and Wireless 
-BusinessIntelligence and Forms 

Note: 

The Oracle 9iAS 9.0.2 Concepts and the 9iAS Install guides mentions 3 install types,
but the Admin guide Rel. 9.0.2 mentions 4 install types.
The fourth additional one is "Unified Messaging". This  Enables you to integrate different 
types of messages into a single framework. 
It includes all of the components available in the Business Intelligence and Forms install type. 


Component 			J2EE and Web Cache 	Portal and Wireless 	BusinessInt. and Forms 
Oracle9iAS Web Cache 		YES 			YES 			YES
Oracle HTTP Server		YES			YES			YES
Oracle9iAS Container for J2EE 	YES 			YES 			YES 
Oracle EM Web site 		YES 			YES 			YES
Oracle9iAS Portal 		no 			YES 			YES
Oracle9iAS Wireless 		no 			YES 			YES
Oracle9iAS Discoverer 		no 			no 			YES
Oracle9iAS Reports Services 	no 			no 			YES
Oracle9iAS Clickstream Int. 	no 			no 			YES
Oracle9iAS Forms Services 	no 			no 			YES
Oracle9iAS Personalization 	no 			no 			YES

 
28.1.2. Need of Oracle9iAS Infrastructure:
------------------------------------------

Prior to installing an instance of the "Portal and Wireless" 
or "Business Intelligence and Forms" install type, 
you must install and configure the Oracle9iAS Infrastructure 
somewhere in your network, optimally on a separate computer. 

The J2EE and Web Cache install type does not require Oracle9iAS Infrastructure. 

You can install single or multiple instances of Oracle9iAS install types, J2EE and Web Cache, Portal and Wireless, 
and Business Intelligence and Forms, on the same host, which is not a very realistic scenarion.

Multiple instances of different Oracle9iAS install types, can use one instance of Oracle9iAS Infrastructure,
and this could be a realistic scenario.


28.1.3. Metadata Repository in the Infrastructure:
--------------------------------------------------

The Oracle9iAS Infrastructure installation consists of: 

- Oracle9iAS Metadata Repository: 
  Pre-seeded database containing metadata needed to run Oracle9iAS instances. 

- Oracle Internet Directory OID: 
  Directory service that enables sharing information about dispersed users and network resources. 
  Oracle Internet Directory implements LDAP v3. 

- Oracle9iAS Single Sign-On SSO: 
  Creates an enterprise-wide user authentication to access multiple accounts 
  and Oracle9iAS applications. 

- Oracle Management Server OMS: 
  Processes system management tasks and administers the distribution of these tasks 
  across the network using the Oracle Enterprise Manager Console. 
  The Console and its three-tier architecture can be used with the 
  Oracle Enterprise Manager Web site to manage not only Oracle9iAS, but your entire Oracle environment. 

- J2EE and Web Cache: 
  For internal use with Oracle9iAS Infrastructure. Not used for component application deployment. 


Application server installations and their components use an infrastructure in the following ways: 

-- Components and applications use the Single Sign-on service provided by Oracle9iAS Single Sign-On. 

-- Application server installations and components store configuration information and 
   user and group privileges in Oracle Internet Directory. 

-- Components use schemas that reside in the metadata repository. 

SSO is required for "Portal and Wireless" and "Business Intelligence and Forms" install types. 
Also required for application server clustering with J2EE and Web Cache install type. 


28.1.4. Customer database:
--------------------------

This could be any database on any Host, containing business data.
But,

The following components require a customer database: 

Oracle9iAS Discoverer 

Oracle9iAS Personalization 

Oracle9iAS Unified Messaging 

If you configure any of these components during installation, their setup and configuration will not be 
complete at the end of installation. You need to take additional steps to install and tune a customer database, 
load schemas into the database, and finish configuring the component to use the customer database. 


28.1.5. Oracle Home:
--------------------

Oracle home is the directory in which Oracle software is installed. 

Different Oracle versions always get their own Oracle Homes.

Multiple instances of Oracle9iAS install types (J2EE and Web Cache, Business Intelligence and Forms, 
and Portal and Wireless) must be installed in separate Oracle homes on the same computer.

You must install Oracle9iAS Infrastructure in its own Oracle home directory, preferably on a separate host. 
The Oracle9iAS installation cannot exist in the same Oracle home as the Oracle9iAS Infrastructure installation. 


28.1.6. Oracle9iAS Infrastructure Port Usage:
---------------------------------------------

!! Oracle9iAS Infrastructure requires exclusive use of port 1521

Installation of Oracle9iAS Infrastructure requires exclusive use of port 1521 on your computer. 
If one of your current system applications uses this port, 
then complete one of the following actions before installing Oracle9iAS Infrastructure: 

If you have an existing application using port 1521, 
then reconfigure the existing application to use another port. 

If you have an existing Oracle Net listener and an Oracle9i database, then proceed 
with the installation of Oracle9iAS Infrastructure. 
Your Oracle9iAS Infrastructure will use the existing Oracle Net listener. 

If you have an existing Net8 listener in use by an Oracle8i database, 
then you must upgrade to the Oracle9i Net listener version by installing Oracle9iAS Infrastructure. 


28.1.6. Using the Oracle Enterprise Manager Console:
----------------------------------------------------

The Oracle Enterprise Manager console provides a wider view of your Oracle environment, 
beyond Oracle9iAS. Use the Console to automatically discover and manage databases, 
application servers, and Oracle applications across your entire network. 

The Console and its related components are installed with the Oracle Management Server 
as part of the Oracle9iAS Infrastructure installation option. 
The Console is part of the Oracle Management Server component of the Oracle9iAS Infrastructure. 
The Management Server, the Console, and Oracle Agent are installed 
on the Oracle9iAS Infrastructure host, along with the other infrastructure components. 

28.1.7. Starting and Stopping the Oracle Management Server on Windows:
----------------------------------------------------------------------

On Windows systems, use the Services control panel to start and stop the management server. 
The name of the service is in the following format: 

OracleORACLE_HOMEManagementServer

For example: 

OracleOraHome902ManagementServer


28.1.8. OEM Website:
--------------------

You can verify the Enterprise Manager Web site is started by pointing your browser to the Web site URL. 
For example: 

http://hostname:1810

get console http://hostname:1810  http://127.0.0.1:1810
get welcome http://hostname:7777

To start or stop the Enterprise Manager Web site on Windows, use the Services control panel. 
The name of the service is in the following format: 

OracleORACLE_HOMEEMwebsite

Or
Start the Enterprise Manager Web site

(UNIX)    ORACLE_HOME/bin/emctl start
(Windows) ORACLE_HOME\bin\emctl start

 
Stop the Enterprise Manager Web site
 emctl stop 


Example Services:

   Oracleias902Discoverer
   Oracleias902ProcessManager
   Oracleias902WebCache
   Oracleias902WebCacheAdmin
   Oracleinfra902Agent                        = Agent for Management Server
   Oracleinfra902EMWebsite                    = Enterprise Manager Web site
   Oracleinfra902InternetDirectory_iasdb
   Oracleinfra902ManagementServer             = OEM Management Server
   Oracleinfra902ProcessManager
   OracleOraHome901TNSListener                = just the Listener
   OracleServiceIASDB		              = infra structure db
   OracleServiceO901		              = regular customer db


Note for Oracle 10g RDBMS EM DB console:
========================================

Sites:
------

Enterprise Manager Database Control URL - (dbname) :
http://hostname:1158/em
http://127.0.0.1:1810
http://127.0.0.1:1158

The iSQL*Plus URL is:
http://localhost:5561/isqlplus

The iSQL*Plus DBA URL is:
http://localhost:5561/isqlplus/dba

emctl prompt tool:
------------------

C:\ora10g\product\10.2.0\db_1\NETWORK\ADMIN>emctl status dbconsole
Oracle Enterprise Manager 10g Database Control Release 10.2.0.1.0
Copyright (c) 1996, 2005 Oracle Corporation.  All rights reserved.
http://xpwsora:1158/em/console/aboutApplication
Oracle Enterprise Manager 10g is running.

Logs are generated in directory C:\ora10g\product\10.2.0\db_1/xpwsora_SPLCONF/sysman/log

Services:
---------

C:\ora10g\product\10.2.0\db_1\NETWORK\ADMIN>net start | find "Ora"
   OracleDBConsolesplconf
   OracleOraDb10g_home1iSQL*Plus
   OracleOraDb10g_home1TNSListener
   OracleServiceSPLCONF

C:\ora10g\product\10.2.0\db_1\NETWORK\ADMIN>


28.1.9. emctl tool :  for controlling EM website:
-------------------------------------------------

Enterprise manager homepage http://hostname:1810 can only be accessed if EM webste is running.

Usage:: 
       emctl start|stop|status
       emctl reload | upload 
       emctl set credentials [<Target_name>[:<Target_Type>]]
       emctl gencertrequest
       emctl installcert [-ca|-cert] <certificate base64 text file>
       emctl set ssl test|on|off|password [<old password> <new password>]
       emctl set password <old password> <new password> 
       emctl authenticate <pwd> 
       emctl switch home [-silent <new_home>]
       emctl config <options>

emctl start                      : Start the Enterprise Manager Web site.
emctl stop                       : Stop the Enterprise Manager Web site (requires ias_admin password).
emctl status                     : Verify the status of the Enterprise Manager Web site.
emctl set password new_password  : Reset the ias_admin password.
emctl authenticate password      : Verify that the supplied password is the ias_admin password.

emctl config options can be listed by typing "emctl config"

emctl status
C:\temp>emctl status
EMD is up and running : 200 OK


28.1.10. OEMCTL tool: for controlling Management Server:
--------------------------------------------------------

EM control

D:\temp>oemctl
"Syntax: OEMCTL START  OMS                                                   "
"    OEMCTL STOP   OMS          <EM Username>/<EM Password>"
"    OEMCTL STATUS OMS          <EM Username>/<EM Password>[@<OMS-HostName>]"
"    OEMCTL PING OMS "
"    OEMCTL START  PAGING       [BootHost Name]                          "
"    OEMCTL STOP   PAGING       [BootHost Name]                          "
"    OEMCTL ENABLE EVENTHANDLER"
"    OEMCTL DISABLE EVENTHANDLER"
"    OEMCTL EXPORT EVENTHANDLER <filename>"
"    OEMCTL IMPORT EVENTHANDLER <filename>"
"    OEMCTL DUMP EVENTHANDLER"
"    OEMCTL IMPORT REGISTRY <filename> <Rep Username>/<Rep Password>@<RepAlias>"
"    OEMCTL EXPORT REGISTRY <Rep Username>/<Rep Password>@<RepAlias>"
"    OEMCTL CONFIGURE RWS"


28.1.11. The Intelligent Agent:
-------------------------------

The Oracle Intelligent Agent is installed whenever you install Oracle9iAS on a host computer. 
For example, if you select the J2EE and Web Cache installation type, the Oracle Universal Installer 
installs Oracle Enterprise Manager Web site and the Oracle Intelligent Agent, 
along with the J2EE and Web Cache software. This means the Intelligent Agent software 
is always available if you decide to use the Console and the Management Server 
to manage your Oracle9iAS environment. 

The Console and Management Server are installed as part of the Oracle9iAS Infrastructure. 
In most cases, you install the Infrastructure on a dedicated host that can be used to 
centrally manage multiple application server instances. The Infrastructure includes 
Oracle Internet Directory, Single Sign-On, the metadata repository, the Intelligent Agent, 
and Oracle Management Server.  

You only need to run the Intelligent Agent if you are using Oracle Management Server in your enterprise. 
In order for Oracle Management Server to detect application server installations on a host, 
you must make sure the Intelligent Agent is started. 
Note that one Intelligent Agent is started per host and must be started after every system boot. 


28.1.12. AGENTCTL: for controlling the Intelligent Agent:
---------------------------------------------------------

(UNIX) You can run the following commands in the Oracle home of the primary installation 
(the first installation on the host) to get status and start the Intelligent Agent: 

ORACLE_HOME/bin/agentctl status agent
ORACLE_HOME/bin/agentctl start agent

(Windows) You can check the status and start the Intelligent Agent using the Services control panel. 
The name of the service is in the following format: 

OracleORACLE_HOMEAgent (the executable is agntsrvc.exe)

start the Intelligent Agent in the Oracle home of the primary installation: 

ORACLE_HOME/bin/agentctl start agent


28.1.13. Backup and Restore:
----------------------------

To ensure that you can make a full recovery from media failures, 
you should perform regular backups of the following: 

- Application Server and Infrastructure Oracle Homes 
- Oracle Internet Directory 
- Metadata Repository 
- Customer Databases 

You should perform regular backups of all files in the Oracle home of each application server 
and infrastructure installation in your enterprise using your preferred method of filesystem backup. 

Oracle Internet Directory offers command-line tools for backing up and restoring 
the Oracle Internet Directory schema and subtree. 

The metadata repository is an Oracle9i Enterprise Edition Database that you can back up and restore 
using several different tools and operating system commands. 

The customer databases can be backupped using any standard method, the same way you would do
for any other 9iEE database.

 
Applications:
=============
 

28.2 Report services:
---------------------

Client contacts the Report Server
- Web,through url
- Nonweb, rwclient

-requests goes to a jobqueue

-users with webbrowser:
 http Server must be running, and you use or reports servlet, a JSP, or CGI components on 9iAS
 
 The reports server must be running.

- default it is an inprocess server
  httpd -> mod_oc4j {reports servlet} -> Reports Server

- CGI
  httpd -> CGI -> Reports Server

- starting from URL:
  http://machine:port/reports/rwservlet

  commandline:
  rwserver server=machinename

- The servlet is part of the OC4J instance: OC4J_BI_FORMS

- its possible to make it a service of its own:
  rwserver -install autostart=yes/no

- verify the Reports Servlet and Server Are Running:

  http://missrv/rwservlet/help
  (show help page with rwservlet command line arguments)

  http://machine:port/reports/rwservlet/showjobs?server=server_name
  (show a listing of the jobqueue)

  IP:7778/reports/rwservlet/showenv
  http://<hostname>:<port>/reports/rwservlet/getserverinfo? 
  http://<hostname>:<port>/reports/rwservlet/getserverinfo?authid=orcladmin/<password of ias_admin> 
  http://machinename/servlet/RWServlet/showmap?server=Rep60_servername 

- stopping Reports Server:

  commandline:
  rwserver server=machinename shutdown=normal/immediate authid=admin/password

  Enterprise Manager: stop Reports Server

The reports servlet uses the PORT parameter configured in the 
httpd.conf 

  reports_user/welcome1
  ias_admin/welcome1
  orcladmin /welcome1


Reports Servlet
url			: http://missrv:7778/reports/rwservlet
em username	        : reports_user
em password	        : welcome1
reports store	        : d:\reports (change in registry, key is REPORTS_PATH)


- Reports Server configuration files:

  ORACLE_HOME\reports\conf\server_name.conf
  ORACLE_HOME\reports\dtd\rwserverconf.dtd
  ORACLE_HOME\reports\conf\rwbuilder.conf
  ORACLE_HOME\reports\conf\rwservlet.properties


- Check miskm.propery files:

$9ias_home\j2ee\OC4J_iFS_cmsdk\applications\brugpaneel\FrontOffice\WEB-INF\classes.
Het gaat om de volgende bestanden:

  misIfs.properties	: parameters van iFs interface/front office.
  miskm.properties	: parameters van MIS Front Office applications
  XSQLConfig.xml	: XSQL Parameters, moet wijzen naar mis_owner schema. 

Er wordt ook gebruik gemaakt van JDBC. De instellingen van deze connectie staan in het bestand: 
$9ias_home\j2ee\OC4J_iFS_cmsdk\applications\brugpaneel\META-INF\data-sources.xml

miskm.properties:
-----------------

# miskm.reports parameters are used in order to display reports that are built
# using Oracle Reports.

# The action of the hidden form.
#miskm.reports.action=http://dgas40.mindef.nl/reports/rwservlet
miskm.reports.action=http://missrv.miskm.mindef.nl:7778/reports/rwservlet

# The schemaname/schemapassword@tns_names entry where the data is stored.
#miskm.reports.connectstring=mis_owner/mis_owner@miskm_demo
miskm.reports.connectstring=mis_owner/mis_owner@miskm_dev

# The name of the Reports Server (after default installation: rep_missrv)
#miskm.reports.repserver=rep_dgas40
miskm.reports.repserver=rep_missrv

# The location where the output is placed on the server.
miskm.reports.destype=cache

# The output of the the generated report (e.g html, pdf, etc.)
#miskm.reports.desformat=pdf
miskm.reports.desformat=rtf&mimetype=application/msword

# The reports server is a partner application, therefore a sso username/password
# is required.
miskm.reports.ssoauthid=reports_user/welcome1


- Reports Server configuration files:

  ORACLE_HOME\reports\conf\server_name.conf
  ORACLE_HOME\reports\dtd\rwserverconf.dtd
  ORACLE_HOME\reports\conf\rwbuilder.conf
  ORACLE_HOME\reports\conf\rwservlet.properties (inprocess or standallone)

reports_server_name.conf
cgicmd.dat
jdbcpds.conf
proxyinfo.xml
rwbuilder.conf
rwserver.template
rwservlet.properties
textpds.conf
xmlpds.conf		in  ORACLE_HOME/reports/conf


Reports Servlet 9i

Rapportages worden gemaakt met behulp van de Reports Builder en moeten worden opgeslagen 
in een directory op de applicatieserver (standaard is dit d:\reports). 
Om de Reports Servlet te laten weten waar allemaal reports zijn opgeslagen dient de registersleutel 
van Windows REPORTS_PATH  te worden uitgebreid met de directory waar de rapportages zijn opgeslagen.

De servlet is onderdeel van de OC4J instance: OC4J_BI_FORMS, dus om hier gebruik van te maken,
moet deze instance opgestart zijn.

De servlet maakt gebruik van Oracle SSO en daarom dient een er een SSO gebruiker aangemaakt te worden 
die in staat is om gebruik te maken van de servlet:
1.	Ga naar http://missrv.miskm.mindef.nl:7777/oiddas
2.	Log in als de portal gebruiker (standaard portal/welcome1)
3.	Maak een nieuwe gebruiker aan, bijvoorbeeld: reports_user.
4.	Sta deze gebruiker de privilege: �Allow resource management for Oracle Reports and Forms� toe.
5.	Controleer of deze gebruiker overeenstemt met de sleutel: miskm.reports.ssoauthid in het bestand miskm.properties


28.3 Internet Directory and Single Sign-On:
-------------------------------------------

Oracle Internet Directory, an LDAP directory, provides a single repository and administration for user accounts. 

Oracle9iAS Single Sign-On enables users to login to Oracle9iAS and gain access to those applications for which they 
are authorized, without requiring them to re-enter a user name and password for each application. 
It is fully integrated with Oracle Internet Directory, which stores user information. It supports LDAP-based 
user and password management through OID. 

Oracle Internet Directory is installed as part of the Oracle9iAS Infrastructure installation. 
Oracle9iAS Single Sign-On is installed as part of the Oracle9iAS Infrastructure installation. 

SSO is Portal's authentication engine. In 9iAS all applications may use SSO.
Without a functioning SSO, users will not be able to logon and use SSO. The first test following
a failure to authenticate is to login directly using SSO:

http://servername:port/pls/orasso


Examples:

Single Sign-On Server	: oasdocs.us.oracle.com:7777
Internet Directory	: oasdocs.us.oracle.com:389
Infrastructure database : iasdb.oasdocs.us.oracle.com	

missrv.miskm.mindef.nl:1521:iasdb

In a start script, you may find commands like the following to start the OID server:

%INFRA_BIN%\oidmon start
%INFRA_BIN%\oidctl server=oidldapd instance=1 start

In a stop script, you may notice the following commands to stop the OID server:

%INFRA_BIN%\oidctl server=oidldapd instance=1 stop
%INFRA_BIN%\oidmon stop


When oidctl is executed, it connects to the database as user ODSCOMMON and simply inserts/updates rows 
into a table ODS.ODS_PROCESS depending on the options used in the command. A row is inserted if the START option 
is used, and  updated if the STOP or RESTART option is used. So there are no processes started at this point, 
and LDAP server is not started. 


Both the listener/dispatcher process and server process are called oidldapd on unix, and oidldapd.exe on NT.
Oidmon is also a process (called oidmon on unix, oidmon.exe/oidservice.exe on windows). 

To control the processes (servers) we need to have OID Monitor (oidmon) running. This monitor is often called 
daemon or guardian process as well. When oidmon is running, it periodically connects to the database and reads 
the ODS.ODS_PROCESS table in order to start/stop/restart related processes.   
 
NOTE:

Because the only task oidctl has is to insert / update table ODS.ODS_PROCESS in the database, 
it's obvious that the database and listener have to be fully accessible when oidctl is used.

Also, oidmon connects periodically to the database. So the database and listener must be
accessible for oidmon to connect.


28.4 Example and default values:
--------------------------------

Information                                 Example Values Your Information 
Oracle home location                        D:\ora9ias

Instance Name                               instance1
ias_admin Password                          welcome1

Single Sign-On Server HostName/server       oasdocs.us.oracle.com
Single Sign-On Port Number                  7777

Internet Directory Hostname/server          oasdocs.us.oracle.com
Internet Directory Port Number              389 / 4032
Internet Directory Username                 orcladmin, cn=orcladmin  (the Oracle Internet Directory administrator) 
Internet Directory Password                 welcome1 

9iAS Metadata Repository                    oasdocs.us.oracle.com
9iAS Reports Services Outgoing Mail Server  oasdocs.us.oracle.com

http Server                                 oasdocs.us.oracle.com:7777

Metadata database connection string         oasdocs.us.oracle.com:1521:iasdb:iasdb.oasdocs.us.oracle.com

Oracle Universal Installer creates a file showing the port assignments during installation of Oracle9iAS components. 
This file is ORACLE_HOME\install\portlist.ini
It contains entries like the following default values:

Oracle HTTP Server port = 7777
Oracle HTTP Server SSL port = 4443
Oracle HTTP Server listen port = 7778
Oracle HTTP Server SSL listen port = 4444
Oracle HTTP Server Jserv port = 8007
Enterprise Manager Servlet port = 1810

The ID username and password are defined in Oracle Internet Directory as either the: 

- orcladmin (root user) 
- a user who is member of the IASAdmins group in Oracle Internet Directory 

The SSO schema is now 'ORASSO' and the ORASSO user is registered with OID after an infra install. 
THe default user is 'orcladmin' with a login of your ias_admin password. 


EM Website:          http://<hostname.domain>:<port>  
(port 1810 assigned by default)           
You will login using the 'ias_admin' username and the password you entered          
during the Infrastructure installation.        

SSO Login Page:          http://<hostname.domain>:<port>/pls/orasso           
You will login using the 'orcladmin' username and the password for the 'ias_admin'.          
The port will be the HTTP Server port of your Infrastructure, (port 7777 by default)  
http://missrv.miskm.mindef.nl:7777/pls/orasso 
      

OID_DAS Page:          http://<hostname.domain>:<port>/oiddas           
You will login using the 'orcladmin' username and the password for the 'ias_admin'.          
The port will be the HTTP Server port of your Infrastructure, (port 7777 by default).          
The OC4J_DAS component must be UP for this test to succeed.  


28.5 Management tools:
----------------------

28.5.1. OEM Website:
-------------------

You can access the Welcome Page by pointing your browser to the HTTP Server URL for your installation. 
For example, the default HTTP Server URL is: 

http://hostname:7777

This page offer many options to explore features of 9iAS.

You can also go directly to the Oracle Enterprise Manager Web site using the following instructions: 

http://hostname:1810  http://

Enterprise manager homepage http://hostname:1810 can only be accessed if EM webste is running.
This corresponds to the service like "Oracleinfra902EMWebsite". 

The username for the administrator user is ias_admin. 
The password is defined during the installation of Oracle9iAS. The default password is welcome1.

Depending upon the options you have installed, the Administration section of the Oracle9iAS Instance Home Page 
provides additional features that allow you to perform the following tasks: 

-Associate the current instance with an existing Oracle9iAS Infrastructure. 
-Configure additional Oracle9iAS components that have been installed, but not configured 
-Change the password or default schema for a component 


Start or stop on NT/W2K:

To start or stop the Enterprise Manager Web site on Windows, use the Services control panel. 
The name of the service is in the following format: 

OracleORACLE_HOMEEMwebsite

For example, if the name of the Oracle Home is OraHome902, the service name is: 
OracleOraHome902EMWebsite

You can also use 
net start OracleOraHome902EMWebsite
net stop OracleOraHome902EMWebsite

Start or stop on UNIX:

Start the Enterprise Manager Web site:  emctl start
 
Stop the Enterprise Manager Web site:  emctl stop 
Or use the kill command if it does not respond
 

Changing the ias_admin Password:

1. Using Oracle Enterprise Manager Web Site: 
   
   Navigate to the Instance Home Page. Select Preferences in the top right corner. 
   This displays the Change Password Page. 

   Enter the new password and new password confirmation. Click OK. 
   This resets the ias_admin password for all application server installations on the host. 

   Restart the Oracle Enterprise Manager Web site. 

2. Using the emctl Command-Line Tool: 

   To change the ias_admin user password using a command-line tool: 

   Enter the following command in the Oracle home of the primary installation 
   (the first installation on the host): 

  (UNIX) ORACLE_HOME/bin/emctl set password new_password
  (Windows) ORACLE_HOME\bin\emctl set password new_password

  For example: 

  (UNIX) ORACLE_HOME/bin/emctl set password m5b8r5
  (Windows) ORACLE_HOME\bin\emctl set password m5b8r5

  Restart the Enterprise Manager Web site. 


The Enterprise Manager Web site relies on various technologies to discover, monitor, 
and administer the Oracle9iAS environment. These technologies include: 

- Oracle Dynamic Monitoring Service (DMS) 
  The Enterprise Manager Web site uses DMS to gather performance data about your Oracle9iAS components. 

- Oracle HTTP Server and Oracle Containers for J2EE (OC4J) 
  the Enterprise Manager Web site also uses HTTP Server and OC4J to deploy its management components. 

- Oracle Process Management Notification (OPMN) 
  OPMN manages Oracle HTTP Server and OC4J processes within an application server instance. 
  It channels all events from different component instances to all components interested in receiving them. 

- Distributed Configuration Management (DCM) 
  This will be used with clusters or farms. 
  DCM manages configurations among application server instances 
  that are associated with a common Infrastructure (members of an Oracle9iAS farm). 
  It enables Oracle9iAS cluster-wide deployment so you can deploy an application to an entire cluster, 
  or make a single host or instance configuration change applicable across all instances in a cluster. 


28.5.2 OEM Console:
-------------------

The console is a non Web, Java tool, and part of the 3-tier OMS architecture.
See also section 28.1.

The Oracle Enterprise Manager console provides a wider view of your Oracle environment, 
beyond Oracle9iAS. Use the Console to automatically discover and manage databases, 
application servers, and Oracle applications across your entire network. 

The Console and its related components are installed with the Oracle Management Server 
as part of the Oracle9iAS Infrastructure installation option. 
The Console is part of the Oracle Management Server component of the Oracle9iAS Infrastructure. 
The Management Server, the Console, and Oracle Agent are installed 
on the Oracle9iAS Infrastructure host, along with the other infrastructure components. 

The Console offers advanced management features, such as an Event system to notify administrators 
of changes in your environment and a Job system to automate standard and repetitive tasks, 
such as executing a SQL script or executing an operating system command. 

The Console and Management Server are installed as part of the Oracle9iAS Infrastructure.

Use the OEMCTL commandline tool for controlling OMS. See section 28.1.10.


29. Starting and stopping 9iAS and components:
==============================================


29.1 Starting a simple Webcache/J2EE installation:
--------------------------------------------------

Start the Enterprise Manager Web site. 
Even though you are not using the Web site, this ensures that the processes to support the 
dcmctl command-line tool are started. To start the Web site, execute the following command 
in the Oracle home of the primary installation on your host: 

  (UNIX) ORACLE_HOME/bin/emctl start
  (Windows) ORACLE_HOME\bin\emctl start

Start Oracle HTTP Server and OC4J (the rest of the commands in this section should be executed 
in the Oracle home of the J2EE and Web Cache instance): 

 (UNIX) ORACLE_HOME/dcm/bin/dcmctl start
 (Windows) ORACLE_HOME\dcm\bin\dcmctl start

If Web Cache is configured, start Web Cache: 

 (UNIX) ORACLE_HOME/bin/webcachectl start
 (Windows) ORACLE_HOME\bin\webcachectl start


29.2 Startin and stopping Advanced 9iAS installations
-----------------------------------------------------


Start/Stop Enterprise:
----------------------

Starting an Application Server Enterprise:
The order in which to start the pieces of an application server enterprise is as follows: 

1. Start the infrastructure. 
   If your enterprise contains more than one infrastructure, start the primary infrastructure first. 

2. Start customer databases. 
   If your enterprise contains customer databases, you can start them using several methods, 
   including SQL*Plus and Oracle Enterprise Manager Console. 
   Remember that iFS could also be installed into the customer database.

3. Start application server instances. 
   You can start application server instances in any order. 
   If instances are part of a cluster, start them as part of starting the cluster. 


The order in which to stop the pieces of an application server enterprise is as follows: 

1. Stop application server instances. 
   You can stop application server instances in any order. 
   If instances are part of a cluster, stop them as part of stopping the cluster. 

2. Stop customer databases. 
   If your enterprise contains customer databases, you can stop them using several methods, 
   including SQL*Plus and Oracle Enterprise Manager Console. 

3. Stop the infrastructure. 
   If your enterprise contains more than one infrastructure, stop the primary infrastructure last. 


Start/Stop Instance:
--------------------

Start:

First you have started the infrastructure instance, and customer database instance.


1. Preliminary:

- Enterprise Manager Web Site (Required):

The first step before starting an application server instance is to ensure that the Enterprise Manager Web site 
is running on the host. The Web site provides underlying processes required to run an application server instance 
and must be running even if you intend to use command-line tools to start your instance.

There is one Enterprise Manager Web site per host. It resides in the primary installation (or first installation) 
on that host. The primary installation can be an application server installation or an infrastructure. 
This Web site usually listens on port 1810 and provides services to all application server instances 
and infrastructures on that host. 

To verify the status of the Enterprise Manager Web site, run the following command in the Oracle home of the 
primary installation: 

(UNIX) ORACLE_HOME/bin/emctl status
(Windows) ORACLE_HOME\bin\emctl status

To start the Enterprise Manager Web site, run the following command in the Oracle home of the primary installation: 

(UNIX) ORACLE_HOME/bin/emctl start
(Windows) ORACLE_HOME\bin\emctl start

Or on NT/W2K: net start OracleORACLE_HOMEEMwebsite

- Intelligent Agent (Optional)

You only need to run the Intelligent Agent if you are using Oracle Management Server in your enterprise. 
In order for Oracle Management Server to detect application server installations on a host, 
you must make sure the Intelligent Agent is started. Note that one Intelligent Agent is started per host 
and must be started after every system boot. 

(UNIX) You can run the following commands in the Oracle home of the primary installation 
(the first installation on the host) to get status and start the Intelligent Agent: 

ORACLE_HOME/bin/agentctl status agent
ORACLE_HOME/bin/agentctl start agent

(Windows) You can check the status and start the Intelligent Agent using the Services control panel. 
The name of the service is in the following format: 

OracleORACLE_HOMEAgent


2. Start the instance using OEM Website:

You can start, stop, and restart all types of application server instances using the 
Instance Home Page on the Enterprise Manager Web site.

Or...

3. Start the 'J2EE and Web Cache' instance using commands:

  Start OEM Website:  ORACLE_HOME\bin\emctl start  or  net start OracleORACLE_HOMEEMwebsite

  Start Oracle HTTP Server and OC4J: ORACLE_HOME\dcm\bin\dcmctl start

  If Web Cache is configured, start Web Cache: ORACLE_HOME\bin\webcachectl start


4. Stop the 'J2EE and Web Cache' instance using commands:

  ORACLE_HOME\bin\webcachectl stop
  ORACLE_HOME\dcm\bin\dcmctl stop


Start/Stop components:
----------------------

You can start, stop, and restart individual components using the Instance Home Page or the component home page 
on the Enterprise Manager Web site. You can also start and stop some components using command-line tools. 

Oracle HTTP Server
 Start: ORACLE_HOME\dcm\bin\dcmctl start -ct ohs
 Stop : ORACLE_HOME\dcm\bin\dcmctl stop -ct ohs
 
Individual OC4J Instances
 Start:  ORACLE_HOME\dcm\bin\dcmctl start -co instance_name
 Stop :  ORACLE_HOME\dcm\bin\dcmctl stop -co instance_name
 
All OC4J Instances
 Start:  ORACLE_HOME\dcm\bin\dcmctl start -ct oc4j
 Stop :  ORACLE_HOME\dcm\bin\dcmctl stop -ct oc4j
 
Web Cache
 Start:  ORACLE_HOME\bin\webcachectl start
 Stop :  ORACLE_HOME\bin\webcachectl stop
 
Reports
 Start:  ORACLE_HOME\bin\rwserver server=name
 Stop :  ORACLE_HOME\bin\rwserver server=name shutdown=yes
 

You cannot start or stop some components. The radio buttons in the Select column on the Instance Home Page 
are disabled for these components, and their component home pages do not have Start, Stop, or Restart buttons. 


Start/Stop the Infrastructure:
------------------------------

No matter which procedure you use, starting an infrastructure involves performing the following steps in order: 

Start the Metadata Repository = infrastructure database
Start OID, Oracle Internet Directory 
Start the Enterprise Manager Web site. 
Start OHS, Oracle HTTP Server
Start the OC4J_DAS instance 
Start Web Cache (optional) 
Start Oracle Management Server and Intelligent Agent (optional) 


No matter which procedure you use, stopping an infrastructure involves performing the following steps in order: 

Stop all middle-tier application server instances that use the infrastructure. 
Stop Oracle Management Server and Intelligent Agent (optional) 
Stop Web Cache (optional) 
Stop OC4J instances 
Stop Oracle HTTP Server 
Stop Oracle Internet Directory 
Stop the Metadata Repository 


The next section describes how to start an infrastructure using command-line tools on Windows. 
Except where noted, all commands should be run in the Oracle home of the infrastructure. 


-- ---------------------------------------------------------------------

 -Start the metadata repository listener: 

  ORACLE_HOME\bin\lsnrctl start

 -Set the ORACLE_SID environment variable to the metadata repository system identifier (default is iasdb). 

  You can set the ORACLE_SID system variable using the System Properties control panel. 

 -Start the metadata repository instance using SQL*Plus: 

  ORACLE_HOME\bin\sqlplus /nolog
  sql> connect sys/password_for_sys as sysdba
  sql> startup
  sql> quit

-- ---------------------------------------------------------------------
 - Start Oracle Internet Directory. 

   Make sure the ORACLE_SID is set to the metadata repository system identifier (refer to previous step). 
   Start the Oracle Internet Directory monitor: 

   ORACLE_HOME\bin\oidmon start

 -Start the Oracle Internet Directory server: 

  ORACLE_HOME\bin\oidctl server=oidldapd configset=0 instance=n start

  where n is any instance number (1, 2, 3...) that is not in use. For example: 
  ORACLE_HOME\bin\oidctl server=oidldapd configset=0 instance=1 start

-- ---------------------------------------------------------------------
 - Start the Enterprise Manager Web site. 

  Even though you are using command-line, the Web site is required because it provides underlying support 
  for the command-line tools. The Web site must be started after every system boot. 
  You can check the status and start the Enterprise Manager Web site using the Services control panel. 
  The name of the service is in the following format: OracleORACLE_HOMEEMwebsite

  You can also start the service using the following command line: 

  net start WEB_SITE_SERVICE_NAME

-- ---------------------------------------------------------------------
 -Start Oracle HTTP Server. 

  ORACLE_HOME\dcm\bin\dcmctl start -ct ohs

  Note that starting Oracle HTTP Server also makes Oracle9iAS Single Sign-On available. 

-- ---------------------------------------------------------------------
 - Start the OC4J_DAS instance. 

  ORACLE_HOME\dcm\bin\dcmctl start -co OC4J_DAS

  Note that the infrastructure instance contains other OC4J instances, such as OC4J_home and OC4J_Demos, 
  but these do not need to be started; their services are not required and incur unnecessary overhead. 

-- ---------------------------------------------------------------------
 -Start Web Cache (optional). 

  Web Cache is not configured in the infrastructure by default, but if you have configured it, start it as follows: 

  ORACLE_HOME\bin\webcachectl start
-- ---------------------------------------------------------------------
 - Start Oracle Management Server and Intelligent Agent (optional). 

  Perform these steps only if you have configured Oracle Management Server. 
  Start Oracle Management Server: 

  ORACLE_HOME\bin\oemctl start oms

-- ---------------------------------------------------------------------

  Start the Intelligent Agent. 

  In order for Oracle Management Server to detect the infrastructure and any other application server 
  installations on this host, you must make sure the Intelligent Agent is started. Note that one Intelligent Agent 
  is started per host and must be started after every reboot. 

  You can check the status and start the Intelligent Agent using the Services control panel. 
  The name of the service is in the following format: 

  OracleORACLE_HOMEAgent


30. Creating a Database Access Descriptor (DAD) for mod_plsql:
---------------------------------------------------------------

Oracle HTTP Server contains the mod_plsql module, which provide support for building PL/SQL-based 
applications on the Web. 
PL/SQL stored procedures retrieve data from a database and generate HTTP responses containing data and code 
to display in a Web browser. 

In order to use mod_plsql you must install the PL/SQL Web Toolkit into a database and create a 
Database Access Descriptor (DAD) which provides mod_plsql with connection information for the database. 


31. Configuring HTTP Server, OC4J, and Web Cache:
--------------------------------------------------


You can use the OEM website in order to configure components as HTTP Server, OC4J, and Web Cache,
or 
you can manually edit configuration files.


If you edit Oracle HTTP Server or OC4J configuration files manually, instead of using the Enterprise Manager Web site, 
you must use the DCM command-line utility dcmctl to notify the DCM repository of the changes. Otherwise, 
your changes will not go into effect and will not be reflected in the Enterprise Manager Web site. 

Note that the dcmctl tool is located in: 
UNIX) ORACLE_HOME/dcm/bin/dcmctl
(Windows) ORACLE_HOME\dcm/bin\dcmctl

To notify DCM of changes made to: Use this command: 

Oracle HTTP Server configuration files:  dcmctl updateConfig -ct ohs
 
OC4J configuration files              :  dcmctl updateConfig -ct oc4j
 
All configuration files               :  dcmctl updateConfig
 
- HTTP Server:

You can configure Oracle HTTP Server using the Oracle HTTP Server Home Page on the Oracle Enterprise Manager Web site. 
You can perform tasks such as modifying directives, changing log properties, specifying a port for a listener, 
modifying the document root directory, managing client requests, and editing server configuration files. 

You can access the Oracle HTTP Server Home Page in the Name column of the System Components table on the Instance Home Page. 

- OC4J:

You can configure Oracle9iAS Containers for J2EE (OC4J) using the Enterprise Manager Web site. 
You can use the Instance Home Page to create and delete OC4J instances, each of which has its own OC4J Home Page. 
You can use each individual OC4J Home Page to configure the corresponding OC4J instance and its deployed applications. 

Creating an OC4J Instance.

Every application server instance has a default OC4J instance named OC4J_home. 
You can create additional instances, each with a unique name, within an application server instance. 

To create a new OC4J instance: 

- Navigate to the Instance Home Page on the Oracle Enterprise Manager Web site. Scroll to the System Components section. 
- Click Create OC4J Instance. This opens the Create OC4J Instance Page. 
- In the Create OC4J Instance Page, type a unique instance name in the OC4J instance name field. Click Create. 
- A new OC4J instance is created with the name you provided. 
- This OC4J instance shows up on the Instance Home Page in the System Components section. 
- The instance is initially in a stopped state and can be started any time after creation. 

Each OC4J instance has its own OC4J Home Page which allows you to configure global services 
and deploy applications to that instance. 


32. 9iAS CONFIG FILES:
-----------------------

---------------------------------------------
32.1 9iAS Rel. 2 most obvious config files:
---------------------------------------------


Oracle HTTP Server:
-------------------
httpd.conf		
oracle_apache.conf
access.conf
magic
mime.types
mod_oc4j.conf
srm.conf		in ORACLE_HOME/Apache/Apache/conf

JServ:
------ 
jserv.conf
jserv.properties
zone.properties		in ORACLE_HOME/Apache/Jserv/etc
			   
mod_oradav:
-----------
moddav.conf		in ORACLE_HOME/Apache/oradav/conf

mod_plsql:
---------- 
cache.conf
dads.conf		in ORACLE_HOME/Apache/modplsql/conf

Oracle9iAS Web Cache:
--------------------- 
internal.xml
internal_admin.xml
webcache.xml		in ORACLE_HOME/webcache

Oracle9iAS Reports Services:
----------------------------
reports_server_name.conf
cgicmd.dat
jdbcpds.conf
proxyinfo.xml
rwbuilder.conf
rwserver.template
rwservlet.properties
textpds.conf
xmlpds.conf		in  ORACLE_HOME/reports/conf

Oracle9iAS Discoverer:
----------------------
configuration.xml	in ORACLE_HOME/j2ee/OC4J_BI_Forms/applications/discoverer/web/WEB-INF/lib
viewer_config.xml	in ORACLE_HOME/j2ee/OC4J_BI_Forms/applications/discoverer/web/viewer_files
plus_config.xml		in ORACLE_HOME/j2ee/OC4J_BI_Forms/applications/discoverer/web/plus_files
portal_config.xml	in  ORACLE_HOME/j2ee/OC4J_BI_Forms/applications/discoverer/web/portal
pref.txt		in  ORACLE_HOME/discoverer902/util
.reg_key.dc		in  ORACLE_HOME/discoverer902/bin/.reg


---------------------------------------------
32.2 9iAS Rel. 2 list of all .conf files:
---------------------------------------------
 
Now as an example, follows a listing of all .conf configuration files of a real 9iAS Server.


-- -------------------------------------------------------------------
-- BEGIN LISTING FROM AN REAL LIFE 9iAS rel. 9.0.2 Server:
-- -------------------------------------------------------------------

 Directory of D:\ORACLE\ias902\Apache\Apache\conf

06/25/2002  10:55p                 293 access.conf
12/01/2003  02:07p              46,178 httpd.conf
12/01/2003  02:07p               3,342 mod_oc4j.conf
12/01/2003  02:07p                 517 mod_osso.conf
12/01/2003  02:07p                 811 oracle_apache.conf
06/25/2002  10:55p                 305 srm.conf
12/01/2003  02:07p                 551 wireless_sso.conf
               7 File(s)         51,997 bytes

 Directory of D:\ORACLE\ias902\Apache\Apache\conf\osso

04/23/2003  08:41p                 433 osso.conf
               1 File(s)            433 bytes

 Directory of D:\ORACLE\ias902\Apache\Jserv\conf

04/23/2003  08:38p              10,745 jserv.conf
               1 File(s)         10,745 bytes

 Directory of D:\ORACLE\ias902\Apache\jsp\conf

12/01/2003  02:07p                 594 ojsp.conf
               1 File(s)            594 bytes

 Directory of D:\ORACLE\ias902\Apache\modplsql\conf

12/01/2003  02:07p                 840 cache.conf
12/01/2003  02:07p               2,122 dads.conf
12/01/2003  02:07p               1,598 plsql.conf
               3 File(s)          4,560 bytes

 Directory of D:\ORACLE\ias902\Apache\oradav\conf

12/01/2003  02:07p                 785 moddav.conf
12/01/2003  02:07p                 396 oradav.conf
               2 File(s)          1,181 bytes

 Directory of D:\ORACLE\ias902\click\conf

12/01/2003  02:07p                 427 click-apache.conf
               1 File(s)            427 bytes

 Directory of D:\ORACLE\ias902\click\conf\templates

01/14/2002  11:21p                 445 click-apache.conf
               1 File(s)            445 bytes

 Directory of D:\ORACLE\ias902\dcm\config

02/17/2004  01:31p                 186 dcm.conf
               1 File(s)            186 bytes

 Directory of D:\ORACLE\ias902\dcm\config\plugins\apache

06/27/2002  11:01p              43,623 httpd.conf
               1 File(s)         43,623 bytes

 Directory of D:\ORACLE\ias902\dcm\repository.install\dcm\config

04/23/2003  08:57p                 185 dcm.conf
               1 File(s)            185 bytes

 Directory of D:\ORACLE\ias902\forms90\server

12/01/2003  02:07p               2,997 forms90.conf
               1 File(s)          2,997 bytes

 Directory of D:\ORACLE\ias902\ldap\das

12/01/2003  02:07p                 165 oiddas.conf
               1 File(s)            165 bytes

 Directory of D:\ORACLE\ias902\opmn\conf

02/17/2004  01:31p                  45 ons.conf
               1 File(s)             45 bytes

 Directory of D:\ORACLE\ias902\portal\conf

12/01/2003  02:07p               1,407 portal.conf
               1 File(s)          1,407 bytes

 Directory of D:\ORACLE\ias902\RDBMS\demo

12/01/2003  02:07p                 482 aqxml.conf
               1 File(s)            482 bytes

 Directory of D:\ORACLE\ias902\reports\conf

04/28/2003  02:59p               3,386 Copy (2) of rep_vbas99.conf
05/17/2002  08:45p               7,421 jdbcpds.conf
04/28/2003  02:59p               3,386 rep_vbas99.conf
05/17/2002  08:45p               6,381 textpds.conf
05/17/2002  08:45p                 454 xmlpds.conf
               5 File(s)         21,028 bytes

 Directory of D:\ORACLE\ias902\ultrasearch\webapp\config

12/01/2003  02:07p                 320 ultrasearch.conf
               1 File(s)            320 bytes

 Directory of D:\ORACLE\ias902\xdk\admin

12/01/2003  02:07p                 294 xml.conf
               1 File(s)            294 bytes

 Directory of D:\ORACLE\infra902\Apache\Apache\conf

06/25/2002  10:55p                 293 access.conf
04/23/2003  08:23p              46,224 httpd.conf
04/23/2003  08:23p               1,500 mod_oc4j.conf
04/23/2003  08:23p                 519 mod_osso.conf
04/23/2003  08:23p                 747 oracle_apache.conf
06/25/2002  10:55p                 305 srm.conf
               6 File(s)         49,588 bytes

 Directory of D:\ORACLE\infra902\Apache\Apache\conf\osso

04/23/2003  08:20p                 433 osso.conf
               1 File(s)            433 bytes

 Directory of D:\ORACLE\infra902\Apache\Jserv\conf

04/23/2003  08:04p              10,763 jserv.conf
               1 File(s)         10,763 bytes

 Directory of D:\ORACLE\infra902\Apache\jsp\conf

04/23/2003  08:23p                 598 ojsp.conf
               1 File(s)            598 bytes

 Directory of D:\ORACLE\infra902\Apache\modplsql\conf

04/23/2003  08:23p                 842 cache.conf
04/23/2003  08:23p               1,485 dads.conf
04/23/2003  08:23p               1,606 plsql.conf
               3 File(s)          3,933 bytes

 Directory of D:\ORACLE\infra902\Apache\oradav\conf

04/23/2003  08:23p                 789 moddav.conf
04/23/2003  08:23p                   2 oradav.conf
               2 File(s)            791 bytes

 Directory of D:\ORACLE\infra902\dcm\config

02/17/2004  01:31p                 188 dcm.conf
               1 File(s)            188 bytes

 Directory of D:\ORACLE\infra902\dcm\config\plugins\apache

06/27/2002  11:01p              43,623 httpd.conf
               1 File(s)         43,623 bytes

 Directory of D:\ORACLE\infra902\dcm\repository.install\dcm\config

04/23/2003  08:24p                 187 dcm.conf
               1 File(s)            187 bytes

 Directory of D:\ORACLE\infra902\ldap\das

04/23/2003  08:23p                 165 oiddas.conf
               1 File(s)            165 bytes

 Directory of D:\ORACLE\infra902\oem_webstage

04/23/2003  08:23p                 943 oem.conf
               1 File(s)            943 bytes

 Directory of D:\ORACLE\infra902\opmn\conf

02/17/2004  01:31p                  45 ons.conf
               1 File(s)             45 bytes

 Directory of D:\ORACLE\infra902\RDBMS\demo

04/23/2003  08:23p                 477 aqxml.conf
               1 File(s)            477 bytes

 Directory of D:\ORACLE\infra902\sqlplus\admin

04/23/2003  08:23p               1,454 isqlplus.conf
               1 File(s)          1,454 bytes

 Directory of D:\ORACLE\infra902\sso\conf

04/23/2003  08:23p                 154 sso_apache.conf
               1 File(s)            154 bytes

 Directory of D:\ORACLE\infra902\ultrasearch\webapp\config

04/23/2003  08:23p                 324 ultrasearch.conf
               1 File(s)            324 bytes

 Directory of D:\ORACLE\infra902\xdk\admin

04/23/2003  08:23p                 291 xml.conf
               1 File(s)            291 bytes

 Directory of D:\ORACLE\ora901\Apache\Apache\conf

08/20/2001  11:00a                 285 access.conf
04/23/2003  07:26p              43,205 httpd.conf
04/23/2003  07:33p                 472 oracle_apache.conf
08/20/2001  11:00a                 297 srm.conf
               4 File(s)         44,259 bytes

 Directory of D:\ORACLE\ora901\Apache\Jserv\conf

04/23/2003  07:26p               6,710 jserv.conf
               1 File(s)          6,710 bytes

 Directory of D:\ORACLE\ora901\Apache\jsp\conf

04/23/2003  07:33p                 511 ojsp.conf
               1 File(s)            511 bytes

 Directory of D:\ORACLE\ora901\Apache\modose\conf

04/23/2003  07:27p                 637 ose.conf
               1 File(s)            637 bytes

 Directory of D:\ORACLE\ora901\Apache\modplsql\cfg

04/23/2003  07:29p                 318 plsql.conf
               1 File(s)            318 bytes

 Directory of D:\ORACLE\ora901\BC4J

04/23/2003  07:33p                 121 bc4j.conf
               1 File(s)            121 bytes

 Directory of D:\ORACLE\ora901\oem_webstage

04/23/2003  07:33p                 682 oem.conf
               1 File(s)            682 bytes

 Directory of D:\ORACLE\ora901\rdbms\demo

04/23/2003  07:26p                 326 aqxml.conf
               1 File(s)            326 bytes

 Directory of D:\ORACLE\ora901\sqlplus\admin

04/23/2003  07:33p               1,476 isqlplus.conf
               1 File(s)          1,476 bytes

 Directory of D:\ORACLE\ora901\ultrasearch\jsp\admin\config

05/02/2001  08:26p              10,681 mod__ose.conf
               1 File(s)         10,681 bytes

 Directory of D:\ORACLE\ora901\xdk\admin

04/23/2003  07:33p                 253 xml.conf
               1 File(s)            253 bytes

     Total Files Listed:
              71 File(s)        321,045 bytes
 

33. Deploying J2EE Applications:
----------------------------------
 
You can deploy J2EE applications using the OC4J Home Page on the Enterprise Manager Web site. 
To navigate to an OC4J Home Page, do the following: 

-Navigate to the Instance Home Page where the OC4J instance resides. 
 Scroll to the System Components section. 

-Select the OC4J instance in the Name column. This opens the OC4J Home Page for that OC4J instance. 

-Scroll to the Deployed Applications section on the OC4J Home Page. 

Clicking Deploy EAR File or Deploy WAR File starts the deployment wizard, which deploys the application 
to the OC4J instance and binds any Web application to a URL context. 

Your J2EE application can contain the following modules: 

-- Web applications 
   The Web applications module (WAR files) includes servlets and JSP pages. 

-- EJB applications  
   The EJB applications module (EJB JAR files) includes Enterprise JavaBeans (EJBs). 

-- Client application contained within a JAR file 

Now archive the JAR and WAR files that belong to an enterprise Java application into an EAR file 
for deployment to OC4J. The J2EE specifications define the layout for an EAR file. 

The internal layout of an EAR file should be as follows: 


<appname>-
          |--META_INF
          |     |
          |     -----application.xml
          |
          |--EJB JAR file
          |
          |--WEB WAR file
          |
          |--Client JAR file
          |


When you deploy an application within a WAR file, the application.xml file is created 
for the Web application. 
When you deploy an application within an EAR file, you must create the application.xml 
file within the EAR file. 
Thus, deploying a WAR file is an easier method for deploying a Web application. 


-------------
34. Errors:
-------------

-- TROUBLESHOOTING 9iAS Rel. 2
-- Version 2.0
-- 4 juli 2004
-- Albert van der Sel


With an 9iAS Release 2 Full install (Business Inteligence install), a tremendous amount
of errors might be encountered.
Here you will find my own experiences, as well as some threads from metalink.


OPMN        = Oracle Process Manager and Notification Server
JAZN / JAAS = Oracle Application Server Java Authentication and Authorization Service 
DCM         = Distributed Configuration Management


OPMN stands for 'oracle process management notification' and is Oracle's 'high availability' system. 
OPMN monitors processes and brings them up again automatically if they go down. 
It is started when you start enterprise manager website with emctl start from the prompt 
in the infrastructure oracle home, and doing this starts 2 opmn processes for each oracle home. 
OPMN consists of two components - Oracle Process Manager and Oracle Notification System.

DCM stands for 'distributed component management' and is the framework by which all 
IAS R2 components hang together. DCM is a layer that ensures that if something is changed in one components, 
others like Enterprise Manager are made aware as well. It is not a process as such, 
but rather a generic term for a framework and utilities. 
It is controlled directly with the dcmctl command. 

DMS Dynamic Monitoring Services . These processes are started when you start ohs. 
DMS basically gathers information on components. 

Jserv Jserv works in much the same way as R1 except oracle components no longer use this servlet architecture, 
but use oc4j instead. 

mod_plsql works the same way as R1. 

mod_oradav oradav allows web folders to be shared with clients e.g. PC's and accessed as if they were NT folders. 

OC4J_DAS is used by Portal for the management of users and groups. You access this via http://machine:port/oiddas


============================
PART 1: GENERAL 9iAS ERRORS:
============================


1. troubleshooting the targets.xml:
===================================

If you change the HOSTNAME for the repository (infrastructure) database, 
then you need to update the ssoServerMachineName property for the oracle SSO target 
in INFRA_ORACLE_HOME/sysman/emd/targets.xml 

The $ORACLE_HOME/sysman/emd/targets.xml file is created during installation of 9iAS and 
includes descriptions of all currently known targets. 
This file is used as the source of targets for the EM Website.

sample targets.xml:

- <Targets>
- <Target TYPE="oracle_webcache" NAME="ias902dev.missrv.miskm.mindef.nl_Web Cache" DISPLAY_NAME="Web Cache">
  <Property NAME="HTTPPort" VALUE="7778" /> 
  <Property NAME="logFileName" VALUE="webcache.log" /> 
  <Property NAME="authrealm" VALUE="Oracle Web Cache Administrator" /> 
  <Property NAME="AdminPort" VALUE="4003" /> 
  <Property NAME="HTTPProtocol" VALUE="http" /> 
  <Property NAME="logFileDir" VALUE="/sysman/log" /> 
  <Property NAME="HTTPMachine" VALUE="missrv.miskm.mindef.nl" /> 
  <Property NAME="HTTPQuery" VALUE="" /> 
  <Property NAME="controlFile" VALUE="d:\oracle\ias902/bin/webcachectl.exe" /> 
  <Property NAME="MonitorPort" VALUE="4005" /> 
  <Property NAME="HTTPPath" VALUE="/" /> 
  <Property NAME="authpwd" VALUE="98574abda4f0a0cadcfe3e420f09854b" ENCRYPTED="TRUE" /> 
  <Property NAME="authuser" VALUE="98574abda4f0a0cadcfe3e420f09854b" ENCRYPTED="TRUE" /> 
- <CompositeMembership>
  <MemberOf TYPE="oracle_ias" NAME="ias902dev.missrv.miskm.mindef.nl" ASSOCIATION="null" /> 
  </CompositeMembership>
  </Target>
+ <Target TYPE="oracle_clkagtmgr" NAME="ias902dev.missrv.miskm.mindef.nl_Clickstream" DISPLAY_NAME="Clickstream Collector" ON_HOST="missrv.miskm.mindef.nl">
- <CompositeMembership>
  <MemberOf TYPE="oracle_ias" NAME="ias902dev.missrv.miskm.mindef.nl" /> 
  </CompositeMembership>
  </Target>
   ..
   ..
- <Target TYPE="oracle_repserv" NAME="ias902dev.missrv.miskm.mindef.nl_Reports:rep_missrv" DISPLAY_NAME="Reports:rep_missrv" VERSION="1.0" ON_HOST="missrv.miskm.mindef.nl">
  <Property NAME="OracleHome" VALUE="d:\oracle\ias902" /> 
  <Property NAME="UserName" VALUE="repadmin" /> 
  <Property NAME="Servlet" VALUE="http://missrv.miskm.mindef.nl:7778/reports/rwservlet" /> 
  <Property NAME="Server" VALUE="rep_missrv" /> 
  <Property NAME="Password" VALUE="ced9a541f77e7df6" ENCRYPTED="TRUE" /> 
  <Property NAME="host" VALUE="missrv.miskm.mindef.nl" /> 
- <CompositeMembership>
  <MemberOf TYPE="oracle_ias" NAME="ias902dev.missrv.miskm.mindef.nl" ASSOCIATION="null" /> 
  </CompositeMembership>
  </Target>
  </Target>
- <Target TYPE="oracle_ifs" NAME="iFS_missrv.miskm.mindef.nl:1521:o901:IFSDP">
  <Property NAME="DomainName" VALUE="ifs://missrv.miskm.mindef.nl:1521:o901:IFSDP" /> 
  <Property NAME="IfsRootHome" VALUE="d:\oracle\ias902\ifs" /> 
  <Property NAME="SysadminUsername" VALUE="system" /> 
  <Property NAME="SysadminPassword" VALUE="973dc46d050ca537" ENCRYPTED="TRUE" /> 
  <Property NAME="IfsHome" VALUE="d:\oracle\ias902\ifs\cmsdk" /> 
  <Property NAME="SchemaPassword" VALUE="daeffdd4f05cd456" ENCRYPTED="TRUE" /> 
- <CompositeMembership>
  <MemberOf TYPE="oracle_ias" NAME="ias902dev.missrv.miskm.mindef.nl" /> 
  </CompositeMembership>
  </Target>

The above file stores amongst other things, the encrypted passwords that EM uses for access to components. 
Search for oracle_portal, oracle_repserv etc. Although encrypted, you can change these to be a password 
in Englidh as long as you flag it ENCRYPTED=FALSE. This should only be done for specific bug problems 
as recommended by oracle support. 
Do not change these passwords for any other reason!! 

The following is a list of things to check when there appears to be a problem with targets.xml.

1.  Check the permissions on the active targets.xml file and restart all the infrastructure components 
    (database, listener, oid, emctl in that order). 
    The targets.xml file should be owned by the user who installed 9iAS and who starts emctl. 
    Accidentally  starting emctl as root recreates the targets.xml under root ownership.
    Fix this by changing ownership on targets.xml and restarting emctl.

2.  Check which targets are listed, to ensure there is information on each expected target.

3.  Check whether the hosts file and targets.xml have matching hostnames, 
    and whether both have fully qualified hostnames. 

4.  What should be done if targets.xml is empty, or missing targets?

a.  Restore targets.xml from backup

b.  Copy $ORACLE_HOME/sysman/emd/discoveredTargets.xml to $ORACLE_HOME/sysman/emd/targets.xml, 
    although it may not be complete if additional targets were installed following installation.

    See EM Website has no Entries for the 9iAS Instances 226226.1
    and EM Web Site Fails to Display Application Servers 210552.1
    and Login as ias_admin to 9iAS R2 Enterprise Manager, A Blank List is Displayed for Targets 209540.1

c.  Check the amount of disk space available. See Bug 2508930 - TARGETS.XML IS EMPTY IF WE HAVE NO DISK SPACE.

d.  Reinstall. See De-Installing 9iAS Release 2 (9.0.2) From Unix Platforms 218277.1


5.  Is there an Infrastructure and Mid-Tier install on the system?

    When installing both the infrastructure and a mid-tier on the same server (in different homes), 
    the installation of the infrastructure creates the emtab file pointing to its own home. 
    During installation of the mid-tier, the mid-tier installation routine uses the emtab file 
    pointing to the infrastructure home so it knows where to write configuration information 
    required for the infrastructure EM Website, so it can see not only information concerning itself 
    but also information related to the mid-tier.

    If the emtab file is removed/renamed after installation of of the infrastructure but before installation 
    of the mid-tier, a new emtab file is created pointing to the mid-tier home. The configuration file 
    routines of the mid-tier installation therefore do not know about the existence of the infrastructure 
    and write the new configuration information into files in its own home and not into the files 
    in the infrastructure home.

In addition to entries in the targets.xml in the infrastructure home, other files such as the 
ias.properties file in the infrastructure home are also updated with information concerning the mid-tier.

Merging the targets.xml file from both homes may solve some of the display problems, 
though they may not solve control of component issues due to incomplete configuration files 
in the infrastructure home.

References to renaming the emtab file should be disregarded when performing infrastructure/mid-tier 
installs on the same server, and may have in fact been specific to certain platforms and specific 
for certain circumstances.


The EM Web Site is launched as a J2EE application. The configuration files consist of many XML files 
and properties files. Here are some of those files: 

targets.xml 
emd.properties 
logging.properties 
iasadmin.properties 


2. Cleanly Restarting OID After A 9iAS 9.0.2 Crash:
===================================================


A problem that often seems to happen when Oracle 9iAS 9.0.2 crashes is that you can't seem 
to restart OID using OIDCTL.

For example, a situation might arise when a server is bounced without 9iAS being shut down cleanly. 
When you reboot the PC, and use DCMCTL to check the status of the OC4J instances prior to starting them, 
you get the following error message:

C:\ocs_onebox\infra\dcm\bin>dcmctl getState -V

ADMN-202026
A problem has occurred accessing the Oracle9iAS infrastructure database.
Base Exception:
oracle.ias.repository.schema.SchemaException:Unable to connect to Directory Server
:javax.naming.CommunicationException: markr.plusconsultancy.co.
uk:4032 [Root exception is java.net.ConnectException: Connection refused: connect]
Please, refer to the base exception for resolution, or call Oracle support.


Or, when you watch an ias start script, at the point oid get started, you will see

C:\ocs_onebox\infra\bin>oidctl server=oidldapd configset=0 instance=1 start

which should startup an OID instance. However, sometimes this fails to work and you get the error message:

C:\ocs_onebox\infra\bin>oidctl server=oidldapd configset=0 instance=1 start
*** Instance Number already in use. ***
*** Please try a different Instance number. ***

oidmon is the 'monitor' process. It pools the database ( table ODS.ODS_PROCESS ) for new 
ldap server launch requests, and if it finds one, (also placed there by oidctl as user ODSCOMMON ) , 
then it starts a 'dispatcher/listener process.' As such, oidctl does not actually start the ldap processes. 
Oidmon then spawns 'dispatcher' and 'server' oidldapd processes. 


What actually happens behind the scenes is that a row is inserted or updated in the ODS.ODS_PROCESS table 
that contains the instance name (which must be unique), the process ID, and a flag called 'state', 
which has three values - 0,1,2 and 3 which stand for stop, start, running and restart. A second process, OIDMON, 
polls the ODS.ODS_PROCESS table and when it finds a row with state=0, it reads the pid and stops the process. 
When it finds a state=1, oidmon starts a new process and updates pid with a new process id. With state=2, 
oidmon reads the pid, and checks that the process with the same pid is running. If it's not, oidmon starts 
a new process and updates the pid. Lastly, with state=3, oidmon reads the pid, stops the process, starts a new one 
and updates the pid accordingly. If oidmon can't start the server for some reason, it retries 10 times, and if 
still unsuccessful, it deletes the row from the ODS.ODS_PROCESS table. Therefore, OIDCTL only inserts or updates 
state information, and OIDMON reads rows from ODS.ODS_PROCESS, and performs specified tasks based on the value of 
the state column.

This all works fine except when 9iAS crashes; when this happens, OIDMON exits but the OIDLDAPD processes are 
not killed, and in addition, stray rows are often left in the ODS.ODS_PROCESS table that are detected when you try 
to restart the oidldapd instance after a reboot.


The way to properly deal with this is to take two steps.

1. Kill any stray OIDLDAPD processes still running (if you haven't rebooted the server since the crash) 
2. Delete any rows in the ODS.ODS_PROCESS table

connect to the IASDB database as the ODS user, or as SYSTEM

select * from ODS.ODS_PROCESS; (there should be at least one row)
delete form ODS.ODS_PROCESS;
commit;

3. Restart the OID instance again, using 

C:\ocs_onebox\infra\bin>oidctl server=oidldapd configset=0 instance=1 start 


OID uses the configfile:

$INFRA_ORACLE_HOME/network/admin/ldap.ora 

Sample:

  # LDAP.ORA Network Configuration File: d:\oracle\infra902\network\admin\ldap.ora
  # Generated by Oracle configuration tools.

  DEFAULT_ADMIN_CONTEXT = ""

  DIRECTORY_SERVERS= (missrv.miskm.mindef.nl:4032:4031)

  DIRECTORY_SERVER_TYPE = OID


3. Deobfuscate Errors After Reboot, Crash, or Network Change.
=============================================================
 

This can occur occur under these scenarios:     
*  A reboot has just occurred for the first time after 9iAS was installed.        
   (And, you had to change the /etc/hosts file during installation)   OR    
*  A system crash occurred, and trying to recover. The 9iAS installion       
   is placed on a machine with the same hostname and IP address as before the crash occurred. OR    
*  Hardware changes have occurred to the machine. (ie, CPU, NIC)   AND    
*  Everything was working under the current 9iAS configuration.       
   (A 9iAS configuration change causing this can be a different problem)  


There are different times when this error can occur. But, it basically occurs when a 
change to the system has been done. This can be after a reboot or a crash, but there is 
a difference on the machine before and after the occurance. 
It is usually a network configuration change that has caused the problem.  


When you try to start the Oracle HTTP Server, the following error might appear in the opmn logs:   
"Syntax error on line 6 of OH/Apache/Apache/conf/mod_osso.conf:  Unable to deobfuscate the SSO server config file,  
OH/Apache/Apache/conf/osso/osso.conf, error  Bad padding pattern detected in the last block."  
Most of the Mid-Tier components will fail to connect to the Infrastructure, and will give the following error:  
"oracle.ias.repository.schema.SchemaException:Password could not be retrieved"   

Possible solution:


1. Start Infrastructure DB
2. Start the Infrastructure OID
3. Include $ORACLE_HOME/lib in the LD_LIBRARY_PATH, SHLIB_PATH, or LIBPATH environment variable, 
depending on your platform.       
-For AIX      LIBPATH=$ORACLE_HOME/lib:$ORACLE_HOME/lib64:$LIBPATH; export LIBPATH           
-For HPUX     SHLIB_PATH=$ORACLE_HOME/lib32:$ORACLE_HOME/lib:$SHLIB_PATH; export SHLIB_PATH       
-For Solaris, Linux and Tru64      LD_LIBRARY_PATH=$ORACLE_HOME/lib:$LD_LIBRARY_PATH; export LD_LIBRARY_PATH 

4. Run the command to reset the iAS password. Please use the SAME password, as we are not attempting 
to change the password you enter when signing onto EM. That is done with the emctl utility. 
This command changes it internally, and we want to re-instate the current obfuscated password: 

resetiASpasswd.sh "cn=orcladmin" <orcladminpassword_given_before> <$ORACLE_HOME> 
      
Note: There is a resetiASpasswd.bat on Windows, to be used the same way, just in case these steps 
are followed on Windows. The above stated problem is specific to UNIX, but there may be occasions 
to run through the same steps. 

5. Use the showPassword utility to obtain the password for the orasso user.      
Then, re-register the listener, being sure to add this information to  the ossoreg command in Step 6:            
-schema orasso          
-pass ReplaceWithPassword  

6. Run the command to re-register mod_osso.       
* Make sure there are no spaces after the trailing '\'s        
If on Windows, use all one line, withouth the "\"      
* Replace the uppercase with proper items      
* The following assumes the to-be registered http server is on the mid-tier      
* If on Windows, use "SYSTEM", instead of "root" for -u    

java -jar $ORACLE_HOME/sso/lib/ossoreg.jar \   -host $INFRA_HOST \   -port 1521 \   -sid iasdb \   
-site_name MID_HOST:MID_PORT\   -oracle_home_path $ORACLE_HOME \   
-success_url http://MID_HOST:MID_PORT/osso_login_success \   
-logout_url http://MID_HOST:MID_PORT/osso_logout_success \   
-cancel_url http://MID_HOST:MID_PORT/  \   
-home_url http://MID_HOST:MID_PORT/  \   
-config_mod_osso TRUE  \   -u root \   -sso_server_version v1.2 \   -schema orasso \   -pass <ReplaceWithPassword>


NOTE:   The following command will not work on 9iAS 9.0.2.0.x, unless a patched dcm.jar   
has previously been applied with a patch (or 9.0.2.1). Since this cannot be run on previous versions, 
just proceed to step 8.    

7. Run following commands on the machine where the change occurred, (not the associated Mid-Tiers):      
a. Solaris         
i.  $ORACLE_HOME/dcm/bin/dcmctl resetHostInformation         
ii. $ORACLE_HOME/bin/emctl set password <previous_password>      

b. NT         
i.  Make sure the Oracle9iAS is stopped         
ii. Edit %ORACLE_HOME%\sysman\j2ee\config\jazn-data.xml          
iii.Search for �ias_admin�         
iv. Replace obfuscated text between <credentials> and </credentials> with "!<password>" 
where "<password>" is the password. 	             
Example:     <credentials>!welcome1</credentials>         
v.  Save the file. 

8. Continue starting 9iAS, as in Note 200475.1. The next step is:            

% dcmctl start -ct ohs       

This is what was originally failing. After successfully starting OHS, You may want to take a backup 
of the deobfuscated information as described  in Note215955.1. 


3. Not able to access the Middle Tier from EM Website.
======================================================

3.1
---

Thread Status: Active 
From: Ishaq Baig  <mailto:ishaq@alrabie.com>19-Nov-03 10:47 
Subject: Enable to Access the Middle Tier Instance from EM Website 

RDBMS Version: 8.1.7
Operating System and Version: WIN2K Service Pack3
Product (i.e., OAS, IAS, etc): IAS 
Product Version: 9.0.2
JDK Version: 1.3.1.9
Error number: 

Enable to Access the Middle Tier Instance from EM Website

Hi, 
We have an 9IAS (9.0.2) Infrastructure and Middle Tier 
instance running on ONE Box (Win2k),thing we fine until 
while trying to implement the Single Sigon after 
making the changes as instructed in Note:199072.1 
we stopped the HTTP Server so that change could take 
effect,but every since we have stopped the HTTP Server 
we couldn't gain access to the Middle Instance from the 
EM WEB SITE the page just hangs......on the other hand 
the INFRASTRUCTURE instance is working fien.We even tried 
starting the HTTP server through the DCM UTILITY the following was 
the message 

Content-Type: text/html 
Response: 0 of 1 processes started. 

Check opmn log files such as ipm.log and ons.log for detailed.". 
Resolve the indicated problem at the Oracle9iAS instance where it occurred thenresync the instance 
Remote Execute Exception 806212 
oracle.ias.sysmgmt.exception.ProcessMgmtException: OPMN operation failure 
at oracle.ias.sysmgmt.clustermanagement.OpmnAgent.validateOperation(Unknown Source) 
at oracle.ias.sysmgmt.clustermanagement.OpmnAgent.startOHS(Unknown Source) 
at oracle.ias.sysmgmt.clustermanagement.StartInput.execute(Unknown Source) 
at oracle.ias.sysmgmt.clustermanagement.ClusterManager.execute(Unknown Source) 
at oracle.ias.sysmgmt.task.ClusterManagementAdapter.execute(Unknown Source) 
at oracle.ias.sysmgmt.task.TaskMaster.execute(Unknown Source) 
at oracle.ias.sysmgmt.task.TaskMasterReceiver.process(Unknown Source) 
at oracle.ias.sysmgmt.task.TaskMasterReceiver.handle(Unknown Source) 
at oracle.ias.sysmgmt.task.TaskMasterReceiver.run(Unknown Source) 

is Any Inputs highly appreciated,we need to get it up 
as soon as possible. 

Regards 
Ishaq Baig 


From: Oracle, Rhoderick Butial  <mailto:rhoderick.butial@oracle.com>19-Nov-03 14:36 
Subject: Re : Enable to Access the Middle Tier Instance from EM Website 

Hello, 

What type of changes did you alter? 
Did you try restarting all of the other components on the mid tier? 

There should be some errors generated in the error_log file, please post these errors 
in your next reply. You may want to review the following notes: 

Note.236112.1 Wrong user supplied to ossoreg causing ADMN-906025 exception, 806212 
Note.223586.1 Starting Oracle HTTP Server gives ADMN-906025 error 
Note.222051.1 Starting Oracle HTTP Server gives ADMN-906025 Error 

Also, I noticed that you have listed your 9iAS version as 9.0.2, did you apply the latest patchsets before implementing the changes? 

If not, you will need to apply the patchsets first before making the changes. Please review.. 

Note.215882.1 9iAS Release 2 Patching Recommendations Within the Version Lifecycle 
Thank you, 

Rod 
Oracle Technical Support 


3.2
---

Displayed below are the messages of the selected thread. 
Thread Status: Closed 
From: Ron Miller  <mailto:ron.miller@tccd.edu>28-Oct-03 16:13 
Subject: EM Website extremely slow for 9iAS 

RDBMS Version:: 9.0.1.3.0
Operating System and Version:: AIX 4.3.3
Product (i.e. Trace, DB Diff, Expert, etc):: Oracle9i Application Server
Product Version:: 9.0.2.2.0
OEM Console Operating System and Version:: Windows 2000

EM Website extremely slow for 9iAS

When I use the EM website to access the components of my 9i App server, the response time is very slow. 
It takes 2 or 3 minutes to go from screen to another. I have found information on this forum that others 
are experiencing the same problem. The response from Oracle support has been that this is a known problem 
and there is a bug, 2756262, which is to be fixed in 9.0.4. However, I cannot find any information on when 
this release will be available. It seems to keep getting pushed back. Does anyone know a release date? 
Has anyone requested a backport of this fix to an earlier release? Thanks for any response. 


From: Oracle, Kathy Ting  <mailto:Kathy.Ting@oracle.com>29-Oct-03 05:41 
Subject: Re : EM Website extremely slow for 9iAS 

The base architecture is being redesign. Due to the redesign, backports are not being accepted. 

Look for a much better improved EM website in future releases. 

Thank you for using the MetaLink Forum, 
Kathy 
Oracle Support. 


From: Ron Miller  <mailto:ron.miller@tccd.edu>29-Oct-03 14:52 
Subject: Re : Re : EM Website extremely slow for 9iAS 

Thanks for the reply Kathy. I will look forward to the redesign since the current product is pretty much useless. 


From: Oracle, Kathy Ting  <mailto:Kathy.Ting@oracle.com>29-Oct-03 22:04 
Subject: Re : Re : Re : EM Website extremely slow for 9iAS 


As do we. 


Thank you for using the MetaLink Forum, 
Kathy 
Oracle Support. 


4. Explanation of IAS_ADMIN and ORCLADMIN Accounts 
==================================================

Note:244161.1 
Subject: Explanation of IAS_ADMIN and ORCLADMIN Accounts 
Type: BULLETIN 
Status: PUBLISHED


PURPOSE 
------- 
 
To provide an explanation for the IAS_ADMIN and ORCLADMIN accounts that are  
established with Oracle9i Application Server (9iAS) Release 2 (9.0.2.x). 
 
  
SCOPE & APPLICATION 
------------------- 
 
Website Administrators installing and maintaining 9iAS 
 
 
Explanation of IAS_ADMIN and ORCLADMIN Accounts 
------------------------------------------------ 
  
There are two users that can create some confusion: ias_admin and orcladmin.  
However, the interaction is more or less internally managed. You log into the EM  
Website with ias_admin, but use the orcladmin password after initially installing 
9iAS. So when changing the orcladmin password, you may not get the results 
intended with the ias_admin login.  
 
But, if the obfuscation gets skewed, we found you sometimes need to reinstate  
the password obfuscation between the two with the resetiASpasswd script. This  
assumes the same password is used, and no resulting changes are noted. The  
*change* occurred internally. These changes, and methods, can cause some  
confusion.  
 
You can actually change the EM Website login separately with the emctl utility. 
Or, change the orcladmin username separately, depending on how your want  
to manage this.  
 
 
IAS_ADMIN Account  
----------------- 
 
In EM 9.0.2 and 9.0.3, you will need to use the IAS_ADMIN account to access the  
EM Website Home Page. This account is not known within the database or to the  
Oracle Management Server. Instead, it is a new account used only for access to  
the 9iAS Administration (EM) Web Site. The following note can be used to  
supplement the Documentation and Release Notes dealing with modifying this  
password:  
 
[NOTE:204182.1] <http://metalink.oracle.com/metalink/plsql/ml2_documents.showDocument?p_id=204182.1&p_database_id=NOT> 
How to Change the IAS_ADMIN password for Enterprise Manager  
 
NOTE:  
If you change the IAS_ADMIN password (as described in Note:204182.1),  
the ORCLADMIN password does not change. 
   
 
ORCLADMIN Account  
----------------- 
 
ORCLADMIN is used as a superuser account for administering 9iAS. During the  
initial installation of 9iAS, the installer prompts you to create the IAS_ADMIN 
password. This password is then also assigned to the ORCLADMIN account.   
 
To reset (not change) the ORCLADMIN password, you must run the script, ResetiASpasswd.sh.  
 
$ORACLE_HOME/bin/resetiASpasswd.sh "cn=orcladmin" <orcladminpassword_given_before> <$ORACLE_HOME> 
 
Note:  
There is a resetiASpasswd.bat on Windows, to be used the same way. 
 
If you suspect that the encryption is skewed, use the SAME password, to *reset* 
this. If you desire to change the password you enter when signing onto EM, use  
the emctl utility, (as described in Note:204182.1). 
 
If you actually want to change the ORCLADMIN password, you should use the  
Oracle Directory Manager, to modify this super user. 
 
   - Start the Directory Manager from $ORACLE_HOME/bin/oidadmin 
 
   - In the navigator pane, expand Oracle Internet Directory Servers.  
 
   - Select a server. The group of tab pages for that server appear in the right pane.  
 
   - Select the System Passwords tab. This page displays the current user names  
     and passwords for each type of user. Note that passwords are not displayed  
     in the password fields.  
 
 
SUMMARY 
------- 
 
Is the goal to reset the internally encrypted ias_admin password, change the  
actual orcladmin password, or just change the password when logging onto EM? 
Thats the main question to ask.  
 
1.  
To reset the internally encrypted ias_admin password, use the resetiASpasswd  
script, and use the same password as previously given. 
 
2. 
To change the orcladmin password, it is best to use the Oracle Directory Manager. 
Please see the Oracle Internet Directory Administrator's Guide for more information. 
 
3. 
Change the EM website or emctl password: 
Within the EM Web Site...Preferences link...top right-hand side of the screen.  
Or, on command line, using emctl. 

RELATED DOCUMENTS 
----------------- 
 
[NOTE:234712.1] <http://metalink.oracle.com/metalink/plsql/ml2_documents.showDocument?p_id=234712.1&p_database_id=NOT> Managing Schemas of the 9iAS Release 2 Metadata Repository 
 
[NOTE:253149.1] <http://metalink.oracle.com/metalink/plsql/ml2_documents.showDocument?p_id=253149.1&p_database_id=NOT> Resetting the Single Sign-On password for ORCLADMIN 

. 
 

5. Password for ORASSO Database Schema 
======================================

Password for ORASSO Database Schema 
goal: What is the password for the ORASSO database schema? 
fact: Oracle9i Application Server Enterprise Edition 9.0.2 
fact: Oracle9iAS Single Sign-On 9.0.2 

fix: During installation a random password is generated for the ORASSO database schema. 
You need to look up this password in the Oracle Internet Directory. 
The following text is taken from the Interoperability Patch Readme 
(a patch that was mandatory for 9.0.2.0.0 but is no longer needed for 9.0.2.0.1): 

If you do not know the password for the orasso schema, you can use the following procedure 
to determine the password: Note: Do not use the "alter user" SQL command to change the orasso password. 
If you need to change the orasso password, use Enterprise Manager so that it can propagate the password 
to all components that need to access orasso. 

Start up the Oracle Internet Directory administration tool from infrastructure machine. 

prompt> $ORACLE_HOME/bin/oidadmin 

Log into the oidadmin tool using the OID administrator account (cn=orcladmin) for 
the Infrastructure installation. 
Username: cn=orcladmin 
Password: administrator_password 
Server : host running Oracle Internet Directory and port number where Oracle Internet Directory 
is listening 
The administrator password is the same as the ias_admin password. 
The default port for Oracle Internet Directory is 389 (without SSL). 
Navigate the Single Sign-On schema (orasso) entry using the administration tool. 

> cn=orcladmin@OID_hostname:OID_port (for example: cn=orcladmin@infra.acme. com:389) 
> Entry Management 
> cn=OracleContext 
> cn=Products 
> cn=IAS 
> cn=Infrastructure Databases 
> orclReferenceName=Single Sign-On database SID:Single Sign-On Server hostname 
(for example: orclReferenceName=iasdb:infra.acme.com) 
> orclResourceName=ORASSO Click the above entry and look for the orclpasswordattribute 
attribute value on the right panel. This value is the password for the orasso schema. 
NOTE: If you have multiple Infrastructures installed using one Oracle Internet Directory, 
ensure that you are looking at the correct Single Sign-On database entry since all 
the infrastructure instances would have an ORASSO schema entry, but only one of them is actually being used. 


6. Windows Script to Determine orasso Password in 9iAS Release 2 (9.0.2) 
========================================================================

Note:205984.1 
Subject:  Windows Script to Determine orasso Password in 9iAS Release 2 (9.0.2) 
Type:  BULLETIN 
Status:  PUBLISHED 


PURPOSE 
------- 
 
The showPassword utility was developed to avoid having to use the oidadmin 
tool to look up various OID passwords, by using ldapsearch with Oracle9i  
Application Server (9iAS) Release 2 (9.0.2).  
 
As a script, varying on different environments, it is not supported by Oracle  
Support Services. It is intended as an example, to aid in the understanding 
of the product. 
 
SCOPE & APPLICATION 
------------------- 
 
9iAS Administrators and Windows Administrators 
 
 
Windows Script to Determine orasso Password in 9iAS Release 2 (9.0.2) 
--------------------------------------------------------------------- 
 
1. Paste the following script in a file named showPassword.bat and copy it in  
a directory. Please also ensure that ldapserach is there in PATH on your  
widnows machine. 
 
8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8< 
 
set OIDHOST=bldel18.in.oracle.com 
set OIDPORT=4032 
if "%1"== ""  goto cont 
if "%2"== "" goto cont 
ldapsearch -h %OIDHOST%  -p %OIDPORT% -D "cn=orcladmin" -w "%1" -b "cn=IAS Infrastructure  
Databases,cn=IAS,cn=Products,cn=OracleContext" -s sub "orclResourceName=%2"  
orclpasswordattribute 
goto :end 
:cont 
echo Correct Syntax is 
echo showpassword.bat orcladminpassword username 
:end 
 
8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8<8< 
 
Note that the "ldapsearch...orclpasswordattribute" commands should be put on 
one line. 
 
2. Edit the script and update with your own hostname and OID port  
OIDHOST=bldel18.in.oracle.com 
OIDPORT=4032 
 
3. Ensure that you have ldapsearch from the correct ORACLE_HOME in the PATH 
 
4. Check that OID is up and running before proceeding.  
 
5. Run the script, and enter the schema name as: orasso, and the password value 
   is shown. 
 
 
For example: 
(all ONE line...may be easier to copy/paste from Notepad) 
 
C:\> showPassword.bat oracle1 orasso 
OrclResourceName=ORASSO,orclReferenceName=iasdb.bldel18.in.oracle.com,cn=IAS Inf 
rastructure Databases,cn=IAS,cn=Products,cn=OracleContext 
orclpasswordattribute=Gbn3Fd24 
 
The orasso password in this example is Gbn3Fd24. 
 

6. STARTING AND STOPPING 9iAS WITH SCRIPTS.
===========================================

----------------------------------------------------------------
5.1 From metalink:


a) StartInfrastructure.bat: 
REM #################################################### 
REM #################################################### 
REM ## Script to start Infrastructure ## 
REM ## ## 
REM #################################################### 
REM #################################################### 
REM ## 
REM ## Set environment variables for Infrastructure 
REM #################################################### 
set ORACLE_HOME=D:\IAS90201I 
set ORACLE_SID=IASDB 
set PATH=%ORACLE_HOME%\bin;%ORACLE_HOME%\dcm\bin;%ORACLE_HOME%\opmn\bin;%PATH%; 
REM ##################################################### 
REM ## Start Oracle Internet Directory processes 
REM ##################################################### 
echo .....Starting %ORACLE_HOME% Internet Directory ...... 
oidmon start 
oidctl server=oidldapd instance=1 start 
timeout 20 
REM ##################################################### 
REM ## Start Oracle HTTP Server and OC4J processes 
REM ##################################################### 
echo .....Starting OHS and OC4J processes....... 
call dcmctl start -ct ohs 
call dcmctl start -ct oc4j 
REM ##################################################### 
REM ## Check OHS and OC4J processes are running 
REM ##################################################### 
echo .....Checking OHS and OC4J status..... 
call dcmctl getstate -v 
pause 


REM #################################################### 
b) StartMidTier.bat: 
REM #################################################### 
REM #################################################### 
REM ## Script to start MidTier ## 
REM ## ## 
REM #################################################### 
REM #################################################### 
REM ## 
REM ## Set environment variables for Midtier 
REM #################################################### 
set ORACLE_HOME=D:\IAS90201J 
set PATH=%ORACLE_HOME%\bin;%ORACLE_HOME%\dcm\bin;%ORACLE_HOME%\opmn\bin;%PATH%; 
REM ##################################################### 
REM ## Start Oracle HTTP Server and OC4J processes 
REM ##################################################### 
echo .....Starting OHS and OC4J processes....... 
call dcmctl start -ct ohs 
call dcmctl start -ct oc4j 
REM ##################################################### 
REM ## Check OHS and OC4J processes are running 
REM ##################################################### 
echo .....Checking OHS and OC4J status..... 
call dcmctl getstate -v 
REM #################################################### 
REM ## Start Webcache 
REM #################################################### 
echo .....Starting Webcache.......... 
webcachectl start 
REM #################################################### 
REM ## Start Enterprise Manager Website 
REM #################################################### 
echo .....Starting EM Website..... 
net start Oracleias90201iEMWebsite 
echo ....Done 
pause 
REM #################################################### 
c) StopMidTier.bat: 
REM #################################################### 
REM #################################################### 
REM ## Script to stop Midtier ## 
REM ## ## 
REM #################################################### 
REM #################################################### 
REM ## 
REM ## Set environment variables for Midtier 
REM #################################################### 
set ORACLE_HOME=D:\IAS90201J 
set PATH=%ORACLE_HOME%\bin;%ORACLE_HOME%\dcm\bin;%ORACLE_HOME%\opmn\bin;%PATH%; 
REM #################################################### 
REM ## Stop Enterprise Manager Website 
REM #################################################### 
echo .....Stopping EM Website..... 
net stop Oracleias90201iEMWebsite 
REM #################################################### 
REM ## Stop Webcache 
REM #################################################### 
echo .....Stopping %ORACLE_HOME% Webcache.......... 
webcachectl stop 
REM #################################################### 
REM ## Stop Oracle HTTP Server and OC4J processes 
REM #################################################### 
echo .....Stopping %ORACLE_HOME% OHS and OC4J........ 
dcmctl shutdown 
echo ....Done 
pause 
REM #################################################### 
d)StopInfrastructure.bat: 
REM #################################################### 
REM #################################################### 
REM ## Script to stop Infrastructure ## 
REM ## ## 
REM #################################################### 
REM #################################################### 
REM ## 
REM ## Set environment variables for Infrastructure 
REM #################################################### 
set ORACLE_HOME=D:\IAS90201I 
set ORACLE_SID=IASDB 
set PATH=%ORACLE_HOME%\bin;%ORACLE_HOME%\dcm\bin;%ORACLE_HOME%\opmn\bin;%PATH%; 
set EM_ADMIN_PWD=<your_pwd> 
REM #################################################### 
REM ## Stop Enterprise Manager Website 
REM #################################################### 
echo .....Stopping EM Website..... 
call emctl stop 
REM #################################################### 
REM ## Stop Oracle HTTP Server and OC4J processes 
REM #################################################### 
echo .....Stopping %ORACLE_HOME% OHS and OC4J........ 
call dcmctl shutdown 
REM ##################################################### 
REM ## Stop Oracle Internet Directory processes 
REM ##################################################### 
echo .....Stopping %ORACLE_HOME% Internet Directory ...... 
oidctl server=oidldapd configset=0 instance=1 stop 
timeout 20 
oidmon stop 
echo ....Done 
pause 
REM ##################################################### 


----------------------------------------------------------------
5.2 Our scripts:

Starting:
=========

@ECHO OFF
TITLE Startup all
REM **********************************************************
REM Adjust the following values
set ORACLE_BASE=D:\oracle
set IAS_HOME=%ORACLE_BASE%\ias902
set IAS_BIN=%IAS_HOME%\bin
set INFRA_HOME=%ORACLE_BASE%\infra902
set INFRA_BIN=%INFRA_HOME%\bin
REM **********************************************************

echo **********************************************************
echo Parameters used are:
echo ORACLE_BASE = %ORACLE_BASE%
echo IAS_HOME    = %IAS_HOME%
echo IAS_BIN     = %IAS_BIN%
echo INFRA_HOME  = %INFRA_HOME%
echo INFRA_BIN   = %INFRA_BIN%
echo **********************************************************

echo **********************************************************
echo "Starting up infra"
echo **********************************************************

echo "Starting iasdb instance"
echo connect sys/change_on_install as sysdba > $$tmp$$
echo startup >> $$tmp$$
echo exit >> $$tmp$$
%INFRA_BIN%\sqlplus /nolog < $$tmp$$
del $$tmp$$

echo "Starting Oracle Internet Directory..."
%INFRA_BIN%\oidmon start
%INFRA_BIN%\oidctl server=oidldapd instance=1 start
timeout 10

echo "Starting Enterprise manager Services..."
net start Oracleinfra902EMWebsite

echo "Starting OEM ..."
net start Oracleinfra902ManagementServer

rem net start Oracleinfra902TNSListener
net start Oracleinfra902Agent

echo "Starting up infra services..."
%INFRA_HOME%\opmn\bin\opmnctl startall


echo **********************************************************
echo "Done kickin' up infra!"
echo **********************************************************
echo.

echo **********************************************************
echo "Starting all mid tier services..."
echo **********************************************************

%IAS_HOME%\opmn\bin\opmnctl startall

echo "Starting webcache..."
%IAS_BIN%\webcachectl start

echo "Starting all services..."
net start Oracleias902Discoverer
rem net start Oracleias902ProcessManager
rem net start Oracleias902WebCacheAdmin
rem net start Oracleias902WebCache

echo **********************************************************
echo "Done starting up mid tier!"
echo **********************************************************

pause


Stopping:
=========

@ECHO OFF
TITLE Shutdown all
REM **********************************************************
REM Adjust the following values
set ORACLE_BASE=D:\oracle
set IAS_HOME=%ORACLE_BASE%\ias902
set IAS_BIN=%IAS_HOME%\bin
set INFRA_HOME=%ORACLE_BASE%\infra902
set INFRA_BIN=%INFRA_HOME%\bin
REM **********************************************************

echo **********************************************************
echo Parameters used are:
echo ORACLE_BASE = %ORACLE_BASE%
echo IAS_HOME    = %IAS_HOME%
echo IAS_BIN     = %IAS_BIN%
echo INFRA_HOME  = %INFRA_HOME%
echo INFRA_BIN   = %INFRA_BIN%
echo **********************************************************

echo **********************************************************
echo "Shutting down mid tier..."
echo **********************************************************

echo "Stopping all mid tier services..."
%IAS_HOME%\opmn\bin\opmnctl stopall

echo "Stopping webcache..."
%IAS_BIN%\webcachectl stop

echo "Stopping Discoverer service..."
net stop Oracleias902Discoverer

echo "Sanity stops for WebCache"
net stop Oracleias902WebCache
net stop Oracleias902WebCacheAdmin

echo **********************************************************
echo "Done shutting down mid tier!"
echo **********************************************************
echo.
echo **********************************************************
echo "Shutting down Infrastructure..."
echo **********************************************************

echo "Stopping Enterprise Manager Website"
call %INFRA_BIN%\emctl stop welcome1

echo "Stopping Enterprise Manager Management Console..."
call %INFRA_BIN%\oemctl stop oms sysman/sysman

echo "Stopping Infra Services..."
%INFRA_HOME%\opmn\bin\opmnctl stopall

echo "Stopping Oracle Internet Directory..."
%INFRA_BIN%\oidctl server=oidldapd instance=1 stop
timeout 10
%INFRA_BIN%\oidmon stop

echo "Stopping infra database..."
echo connect sys/change_on_install as sysdba > $$tmp$$
echo shutdown immediate >> $$tmp$$
echo exit >> $$tmp$$
%INFRA_BIN%\sqlplus /nolog < $$tmp$$
del $$tmp$$


echo "Stopping all Remaining NT Services..."
rem net stop Oracleinfra902TNSListener
net stop Oracleinfra902Agent

echo **********************************************************
echo "Done shutting down infra!"
echo **********************************************************

pause


Starting BI:
============

@echo off
title Starting Oracle Reports
rem ********************************************************************
set IAS_HOME=d:\oracle\ias902
set IAS_BIN=%IAS_HOME%\bin
rem ********************************************************************

echo ********************************************************************
echo Parameters used:
echo. 
echo IAS_HOME = %IAS_HOME%
echo IAS_BIN  = %IAS_BIN%
echo ********************************************************************
echo.
echo ********************************************************************
echo Bringing up OC4J_BI_Forms (Business Intelligence/Forms)
echo ********************************************************************
call %IAS_HOME%\dcm\bin\dcmctl start -co OC4J_BI_Forms -v
timeout 5

echo Check to see if the instance really started up:
echo.
call %IAS_HOME%\dcm\bin\dcmctl getReturnStatus
echo Done starting up OC4J_BI_FORMS...

pause


Starting CMSDK:
===============

@echo off
title Starting Oracle CM SDK 9.0.3.1.
rem ********************************************************************
set IAS_HOME=d:\oracle\ias902
set IAS_BIN=%IAS_HOME%\bin
rem ********************************************************************

echo ********************************************************************
echo Parameters used:
echo. 
echo IAS_HOME = %IAS_HOME%
echo IAS_BIN  = %IAS_BIN%
echo ********************************************************************
echo.
echo ********************************************************************
echo Bringing up Domain Controller, note default password is: ifsdp
echo ********************************************************************
call %IAS_HOME%\ifs\cmsdk\bin\ifsctl start
echo Done bringing up Domain Controller
echo.
echo ********************************************************************
echo Bringing up OC4J Instance...
echo ********************************************************************
call %IAS_HOME%\dcm\bin\dcmctl start -co OC4J_iFS_cmsdk -v
timeout 5

echo Check to see if the instance really started up:
echo.
call %IAS_HOME%\dcm\bin\dcmctl getReturnStatus
echo Done starting up OC4J Instance.
echo Done starting up CM SDK.

pause


8. Warning: Stop EMD Before Using DCMCTL Utility. 
=================================================

Note:207208.1 
Subject:  Warning: Stop EMD Before Using DCMCTL Utility 
Type:  BULLETIN 
Status:  PUBLISHED 


PURPOSE 
------- 
 
Issue a warning for the use of the dcmctl utility when administering the Oracle9i 
Application Server (9iAS) Release 2 (9.0.2.0.x). There is now a Patch available which  
resolves the issue of running DCM and EM at the same time. 
  
SCOPE & APPLICATION 
------------------- 
 
This article is intended for 9iAS Administrators. It gives a general description 
of a problem that can occur when dcmctl is used without precautions. 
 
 
DCMCTL RESTRICTIONS 
------------------- 
 
1.  
Do not use dcmctl while EMD (Enterprise Manager Console/Website) is running. 
  
The dcmctl utility is issuing DCM commands to control the state of components 
in 9iAS. The same is done from the EMD, which is generally reachable at the following URLs: 
 
http://yourserver:1810/emd/console 
http://yourserver:1810/ 
 
When the dcmctl utility is used while EMD is running, this may cause out-of-sync 
problems with your 9iAS instance. This is caused by only one DCM daemon being 
available to 'listen' to requests. 
 
   How to Avoid Problems 
   --------------------- 
 
     Stop EMD: 
      $  emctl stop 
     Issue your command with dcmctl 
     When you are done, restart EMD: 
      $  emctl start 
 
2. 
If an Infrastructure and Mid-Tier(s) are installed on same server, EM must be 
stopped when issuing dcmctl from either the Infrastructure or a Mid-tier directories. 
This is because EM is common to all 9iAS instances on the server. Stopping multiple 
instances of EM across multiple servers is not neccessary. The DCM/EM concurrency  
conflict will only come into play with instances on a given machine.   
 
3.  
Do not issue multiple DCM commands at once, and do not issue a DCM command 
while one might still be running. 
 
4. 
If you start a component with DCM, it is recommended to also stop it with DCM.  
If you start a component with the EM Website, it is recommended stop it with  
the EM Website.  
 
 
SOLUTION 
-------- 
 
If out-of-sync errors occur because of EM being up while using dcmctl, then a  
reinstall may be neccessary. Please apply the following patches in order to 
prevent this concurrency problem from happening inadvertently: 
 
Patch 2542920 : 9iAS 9.0.2.1 Core Patchset 
Patch 2591631 : DCM/EM Concurrency Fix 
 
   * The 9.0.2.1 Patchset is a pre-requisite of the DCM Patch.  
   * Both patches should be applied to all associated 9iAS Tiers.  
   * Please refer to the readme for important information.  
   * Future releases (9.0.2.2+) will have this fix included. 
 

9. MISCELLANEOUS:
=================


9.1 Change of hostname:
-----------------------

If you change the HOSTNAME for the repository (infrastructure) database, 
then you need to update the ssoServerMachineName property for the oracle SSO target 
in INFRA_ORACLE_HOME/sysman/emd/targets.xml 

If you change the PORT for the repository database, discoverer is affected - update the port 
for discodemo in tnsnames.ora. 


9.2 Files with IP in the name:
------------------------------

 
9.3 ldapcheck and ldapsearch examples:
--------------------------------------

List users and or passwords: use ldapcheck and ldapsearch


Example 1:
----------

ldapsearch -h uks799 -p 4032 -D "cn=orcladmin" -w your_ias_or_oid_password -b 
"cn=Users,dc=uk,dc=oracle,dc=com" -s sub -v "objectclass=*" 

set OIDHOST=bldel18.in.oracle.com 
set OIDPORT=4032 
if "%1"== ""  goto cont 
if "%2"== "" goto cont 
ldapsearch -h %OIDHOST%  -p %OIDPORT% -D "cn=orcladmin" -w "%1" -b "cn=IAS Infrastructure  
Databases,cn=IAS,cn=Products,cn=OracleContext" -s sub "orclResourceName=%2"  
orclpasswordattribute 
goto :end 
:cont 
echo Correct Syntax is 
echo showpassword.bat orcladminpassword username 
:end 

C:\> showPassword.bat oracle1 orasso 
OrclResourceName=ORASSO,orclReferenceName=iasdb.bldel18.in.oracle.com,cn=IAS Inf 
rastructure Databases,cn=IAS,cn=Products,cn=OracleContext 
orclpasswordattribute=Gbn3Fd24 
 
The orasso password in this example is Gbn3Fd24. 


Example 2:
----------


9.4 dcmctl commands:
--------------------

On a simple 9iAS webcache/j2ee installation, you might try the following command:

F:\oracle\ias902\dcm\bin>dcmctl getstate -V

Current State for Instance:ias902dev.localhost

    Component               Type          Up Status     In Sync Status

===========================================================================

1   home                    oc4j          Up            True
2   HTTP Server             ohs           Up            True
3   OC4J_Demos              oc4j          Up            True
4   OC4J_iFS_cmsdk          oc4j          Up            True


dcmctl getstate -ct ohs   - show status of ohs of the current instance ONLY. 


dcmctl updateConfig            Atempt to update DCM's view of the world after a manual configuration change. 
dcmctl getstate -v             determines which component aren't starting. 
dcmctl resyncInstance -force   force resync of the instance. 


9.5 Fault tolerance:
====================

217368.1 from Metalink - "Advanced Configurations and Topologies for Enterprise Deployments of E-Business"

Hot site Oracle disaster recovery configuration 
Oracle failover with Oracle standby database 
Oracle failover with Oracle9i Dataguard 
Oracle failover with Oracle9i TAF (Transparent Application Failover) 
Oracle failover with Oracle9i Real Application Clusters (RAC) 


   |----------------------------------|
   |Machine A                         |
   |                                  |
   | |-----------------------------|  |
   | |Instance A                   |  |
   | | - Cluster manager           |  |
   | | - Distributed Lock Manager  |  |
   | | - OS Shared Disk Driver     |  |--------------
   |  -----------------------------   |              |
   |----------------------------------|              |
                   |                                 |
                   | interconnect                ------------
                   |                             | Shared   |
   |----------------------------------|          | Disk     |
   |Machine B                         |          | Subsystem|
   |                                  |          ------------
   | |-----------------------------|  |              |
   | |Instance B                   |  |              |
   | | - Cluster manager           |  |              |
   | | - Distributed Lock Manager  |  |              |
   | | - OS Shared Disk Driver     |  |---------------
   |  -----------------------------   |
   |----------------------------------|


Note 1:
-------

Local Clustering Definition 
Local cluster is defined as two or more physical machines (nodes) that share common disk storage 
and logical IP address. Clustered nodes exchange cluster information over heartbeat link(s). 
Cluster software collects information and checks the situation on both nodes. On error condition, 
software will execute a predefined script and switch the clustered services over to a secondary machine. 
Oracle instance, as one of clustered services, will be switched off together with listener process, 
and restarted on the secondary (surviving) node. 

HA Oracle Agent 
HA Oracle Agent software controls Oracle database activity on Sun Cluster nodes. The agent performs 
fault checking using two processes on the local node and two process on the remote node by querying 
V$SYSSTAT table for active sessions. If the database has no active sessions, HA Agent will open 
a test transaction (connect and execute in serial create, insert, update, drop table commands). 
Return error codes from HA Agent have been validated against a special action file on location. 

/etc/opt/SUNWscor/haoracle_config_V1: 

# Action file for HA-DBMS Oracle fault monitor
# State DBMS_er proc_di log_msg timeout int_err new_sta action  message
---
co      *       *       *       *       1       *       
  stop    Internal HA-DBMS Oracle error connecting to db 
on      28      *       *       *       *       di      
  none    Session killed by DBA, will reconnect
*       50      *       *       *       *       di      
  takeover  O/S error occurred while obtaining an enqueue
co      0       *       *       1       0       *       
  restart A timeout has occured during connect
--

Takeover - cluster software will switch to another node. 

Stop - cluster will stop DBMS 

None - no action taken 

Restart - database restarted locally on the same node 

HA Oracle Agent requires Oracle configuration files (listener.ora, oratab and tnsnames.ora) 
on unique predefined location /var/opt/oracle


Note 2:
-------

You Asked (Jump to Tom's latest followup)

If I want to use Oracle Fail Safe and Dataguard do the servers have to be 
clustered?  Right now I have a primary database on one server and a separate 
server for the logical standby database.  I want automatic failover, but it 
looks like Oracle Fail Safe requires clustered servers.

The DATAGUARD manual mentions that you can use ORACLE FAIL SAFE on the windows 
platform, but the ORACLE FAIL SAFE documentation doesn't say squat about 
DATAGUARD or how to configure for it.   Is there any documentation of this 
subject that you can refer me to?


and we said...

Fail Safe is a clustering solution.

The two (data guard & failsafe) are complimentary but somewhat orthogonal here.

Failsafe is designed to keep the single database up and available -- in a single 
data center.  As long as that room exists -- failsafe keeps the database up.

data guard is a disaster recovery solution.  It is for when the room the data 
center is in "goes away" for whatever reason.

Data guard wants the machines to be independent (no clusters) of eachother and 
separated by some geographic distance.

Failsafe, like 9i RAC, wants the machines to be tethered together - sitting 
right next to eachother in a cluster.

Failsafe is HA (high availability)
Data guard is DR (disaster recovery)


Failsafe will give you automated failover.  As long as the data center exists, 
that database is up.

With data guard -- you do not WANT automated failover (many *think* they do but 
you don't).  Do you really want your DR solution to kick in due to a WAN 
failure?  No, not really.  For DR to take over, you want a human to say "yup, 
data center burnt to the ground, lets head for the mountains".  You do not want 
the DR site to kick in because "it thinks the primary site is gone" -- you need 
to tell it "the primary site is gone".  In a cluster -- the machines are very 
aware of eachother and automated failover is "safe"


So, data guards reference to failsafe is incidental.
That failsafe doesn't talk about data guard is of no real consequence.

They are independent feature/functions. 


Note 3: terms:
--------------

Note 4:
-------

FAQ RAC:

Real Application Clusters
General RAC
Is it supported to install CRS and RAC as different users. (09-SEP-04) 
I have changed my spfile with alter system set <parameter_name> =.... scope=spfile. The spfile is on 
ASM storage and the database will not start. (18-APR-04) 
Is it difficult to transition from Single Instance to RAC? (18-JUL-05) 
What are the dependencies between OCFS and ASM in Oracle10g ? (05-MAY-05) 
What software is necessary for RAC? Does it have a separate installation CD to order? (05-MAY-05) 
Do we have to have Oracle RDBMS on all nodes? (02-APR-04) 
What kind of HW components do you recommend for the interconnect? (02-APR-04) 
Is rcp and/or rsh required for normal RAC operation ? (06-NOV-03) 
Are there any suggested roadmaps for implementing a new RAC installation? (26-NOV-02) 
What is Cache Fusion and how does this affect applications? (26-NOV-02) 
Can I use iSCSI storage with my RAC cluster? (13-JUL-05) 
Can I use RAC in a distributed transaction processing environment? (16-JUN-05) 
Is it a good idea to add anti-virus software to my RAC cluster? (31-JAN-05) 
When configuring the NIC cards and switch for a GigE Interconnect should it be set to FULL or Half duplex in RAC? (05-NOV-04) 
What would you recomend to customer, Oracle clusterware or Vendor Clusterware (I.E. MC Service Guard, HACMP, Sun Cluster, Veritas etc.) with Oracle Database 10g Real Application Clusters? (21-OCT-04) 
What is Standard Edition RAC? (01-SEP-04) 
High Availability
If I use Services with Oracle Database 10g, do I still need to set up Load Balancing ? (16-JUN-05) 
Why do we have a Virtual IP (VIP) in 10g? Why does it just return a dead connection when its primary node fails? (12-MAR-04) 
I am receiving an ORA-29740 error. What should I do? (02-DEC-02) 
Can RMAN backup Real Application Cluster databases? (26-NOV-02) 
What is Server-side Transparent Application Failover (TAF) and how do I use it? (07-JUL-05) 
What is CLB_GOAL and how should I set it? (16-JUN-05) 
Can I use TAF and FAN/FCF? (16-JUN-05) 
What clients provide integration with FAN and FCF? (28-APR-05) 
What are my options for load balancing with RAC? Why do I get an uneven number of connections on my instances? (15-MAR-05) 
Can our 10g VIP fail over from NIC to NIC as well as from node to node ? (10-DEC-04) 
Can I use ASM as mechanism to mirror the data in an Extended RAC cluster? (18-OCT-04) 
What does the Virtual IP service do? I understand it is for failover but do we need a separate network card? Can we use the existing private/public cards? What would happen if we used the public ip? (15-MAR-04) 
What do the VIP resources do once they detect a node has failed/gone down? Are the VIPs automatically acquired, and published, or is manual intervention required? Are VIPs mandatory? (15-MAR-04) 
Scalability
I am seeing the wait events 'ges remote message', 'gcs remote message', and/or 'gcs for action'. What should I do about these? (02-APR-04) 
What are the changes in memory requirements from moving from single instance to RAC? (02-DEC-02) 
What is the Load Balancing Advisory? (16-JUN-05) 
What is Runtime Connection Load Balancing? (16-JUN-05) 
How do I enable the load balancing advisory? (16-JUN-05) 
Manageability
How do I stop the GSD? (22-MAR-04) 
How should I deal with space management? Do I need to set free lists and free list groups? (16-JUN-03) 
I was installing RAC and my Oracle files did not get copied to the remote node(s). What went wrong? (26-NOV-02) 
What is the Cluster Verification Utiltiy (cluvfy)? (16-JUN-05) 
What versions of the database can I use the cluster verification utility (cluvfy) with? (16-JUN-05) 
What are the implications of using srvctl disable for an instance in my RAC cluster? I want to have it available to start if I need it but at this time to not want to run this extra instance for this database. (31-MAR-05) 
Platform Specific
How many nodes can be had in an HP/Sun/IBM/Compaq/NT/Linux cluster? (21-OCT-04) 
Is crossover cable supported as an interconnect with 9iRAC/10gRAC on any platform ? (21-FEB-05) 
Is it possible to run RAC on logical partitions (i.e. LPARs) or virtual separate servers. (18-MAY-04) 
Can the Oracle Database Configuration Assistant (DBCA) be used to create a database with Veritas DBE / AC 3.5? (10-JAN-03) 
How do I check RAC certification? (26-NOV-02) 
Where I can find information about how to setup / install RAC on different platforms ? (08-AUG-02) 
Is Veritas Storage Foundation 4.0 supported with RAC? (05-OCT-04) 
Platform Specific -- Linux
Is 3rd Party Clusterware supported on Linux such as Veritas or Redhat? (11-MAY-05) 
Can you have multiple RAC $ORACLE_HOME's on Linux? (19-JUL-05) 
After installing patchset 9013 and patch_2313680 on Linux, the startup was very slow (20-DEC-04) 
Is CFS Available for Linux? (20-DEC-04) 
Where can I find more information about hangcheck-timer module on Linux ? And how do we configure hangcheck-timer module ? (20-DEC-04) 
Can RAC 10g and 9i RAC be installed and run on the same physical Linux cluster? (20-DEC-04) 
Is the hangcheck timer still needed with Oracle Database 10g RAC? (20-DEC-04) 
How to configure bonding on Suse SLES8. (29-NOV-04) 
How to configure bonding on Suse SLES9. (29-NOV-04) 
Platform Specific -- Solaris
Does RAC run faster with Sun-cluster or Veritas cluster-ware? (these being alternatives with Sun hardware) Is there some clusterware that would make RAC run faster? (20-DEC-04) 
Platform Specific -- HP-UX
Is HMP supported with 10g on all HP platforms ? (20-DEC-04) 
Platform Specific -- Windows
Does the Oracle Cluster File System (OCFS) support network access through NFS or Windows Network Shares? (27-JAN-05) 
Can I run my 9i RAC and RAC 10g on the same Windows cluster? (01-JUL-05) 
My customer wants to understand what type of disk caching they can use with their Windows RAC Cluster, the install guide tells them to disable disk caching? (31-MAR-05) 
Platform Specific -- IBM AIX
Do I need HACMP/GPFS to store my OCR/Voting file on a shared device. (20-DEC-04) 
Platform Specific -- IBM-z/OS (Mainframe)
Can I run Oracle RAC 10g on my IBM Mainframe Sysplex environment (z/OS)? (07-JUL-05) 
Diagnosibility
What are the cdmp directories in the background_dump_dest used for? (11-AUG-03) 
EBusiness Suite with RAC
What is the optimal migration path to be used while migrating the E-Business suite to RAC? (08-JUL-05) 
Is the Oracle E-Business Suite (Oracle Applications) certified against RAC? (04-JUN-03) 
Can I use TAF with e-Business in a RAC environment? (02-APR-03) 
How to configure concurrent manager in a RAC environment? (20-SEP-02) 
Should functional partitioning be used with Oracle Applications? (20-SEP-02) 
Which e-Business version is prefereable? (20-SEP-02) 
Can I use Automatic Undo Management with Oracle Applications? (20-SEP-02) 
Clustered File Systems
Can I use OCFS with SE RAC? (01-SEP-04) 
What are the maximum number of nodes under OCFS on Linux ? (06-NOV-03) 
Where can I find documentation on OCFS ? (06-NOV-03) 
What files can I put on Linux OCFS? (14-AUG-03) 
Is Sun QFS supported with RAC? What about Sun GFS? (19-JAN-05) 
Is Red Hat GFS(Global File System) is certified by Oracle for use with Real Application Clusters? (22-NOV-04) 
Oracle Clusterware (CRS)
Is it possible to use ASM for the OCR and voting disk? (19-JUL-05) 
Is it supported to rerun root.sh from the Oracle Clusterware installation ? (05-MAY-05) 
Is it supported to allow 3rd Party Clusterware to manage Oracle resources (instances, listeners, etc) and turn off Oracle Clusterware management of these? (05-MAY-05) 
What is the High Availability API? (05-MAY-05) 
How to move the OCR location ? (24-MAR-04) 
Does Oracle Clusterware support application vips? (11-JUL-05) 
Why is the home for Oracle Clusterware not recommended to be subdirectory of the Oracle base directory? (11-JUL-05) 
Can I use Oracle Clusterware to provide cold failover of my 9i or 10g single instance Oracle Databases? (01-JUL-05) 
How do I put my application under the control of Oracle Clusterware to achieve higher availability? (16-JUN-05) 
How do I protect the OCR and Voting in case of media failure? (05-MAY-05) 
How do I use multiple network interfaces to provide High Availability for my interconnect with Oracle Clusterware? (06-APR-05) 
How to Restore a Lost Voting Disk used by Oracle Clusterware 10g (02-DEC-04) 
With Oracle Clusterware 10g, how do you backup the OCR? (02-DEC-04) 
Does the hostname have to match the public name or can it be anything else? (05-NOV-04) 
Is it a requirement to have the public interface linked to ETH0 or does it only need to be on a ETH lower than the private interface?: - public on ETH1 - private on ETH2 (05-NOV-04) 
How do I restore OCR from a backup? On Windows, can I use ocopy? (27-OCT-04) 
What should the permissions be set to for the voting disk and ocr when doing a RAC Install? (22-OCT-04) 
Which processes access to OCR ? (22-OCT-04) 
Can I change the name of my cluster after I have created it when I am using Oracle Database 10g Clusterware? (05-OCT-04) 
Can I change the public hostname in my Oracle Database 10g Cluster using Oracle Clusterware? (05-OCT-04) 
During CRS installation, I am asked to define a private node name, and then on the next screen asked to define which interfaces should be used as private and public interfaces. What information is required to answer these questions? (24-MAR-04) 
Answers
I have changed my spfile with alter system set <parameter_name> =.... scope=spfile. The spfile is on 
ASM storage and the database will not start.
How to recover: 
 

In $ORACLE_HOME/dbs

. oraenv <instance_name>

sqlplus "/ as sysdba" 

startup nomount

create pfile='recoversp' from spfile 
/
shutdown immediate
quit
 
Now edit the newly created pfile to change the parameter to something sensible.

Then:
 
sqlplus "/ as sysdba"

startup pfile='recoversp' (or whatever you called it in step one).
 
create spfile='+DATA/GASM/spfileGASM.ora' from pfile='recoversp' 
/
N.B.The name of the spfile is in your original init<instance_name>.ora so adjust to suit

shutdown immediate
startup
quit

   Modified: 18-APR-04    Ref #: ID-5068 


--------------------------------------------------------------------------------

Is it supported to install CRS and RAC as different users.
Yes, CRS and RAC can be installed as different users. The CRS user and the RAC user must both have "oinstall" as their primary group, and the RAC user should be a member of the OSDBA group. 
   Modified: 09-SEP-04    Ref #: ID-5769 


--------------------------------------------------------------------------------

Do we have to have Oracle RDBMS on all nodes?
Each node of a cluster will typically have the RDBMS and RAC software loaded on it, but not actual datafiles (these need to be available via shared disk). For example, if you wish to run RAC on 2 nodes of a 4-node cluster, you would need to install it on all nodes, but it would only need to be licensed on the two nodes running the RAC database. Note that using a clustered file system, or NAS storage can provide a configuration that does not necessarily require the Oracle binaries to be installed on all nodes. 
   Modified: 02-APR-04    Ref #: ID-4024 


--------------------------------------------------------------------------------

What kind of HW components do you recommend for the interconnect?
The general recommendation for the interconnect is to provide the highest bandwith interconnect, together with the lowest latency protocol that is available for a given platform. In practice, Gigabit Ethernet with UDP has proven sufficient in every case it has been implemented, and tends to be the lowest common denominator across platforms. 
   Modified: 02-APR-04    Ref #: ID-4049 


--------------------------------------------------------------------------------

Are there any suggested roadmaps for implementing a new RAC installation?
Yes, Oracle Support recommends the following best practices roadmap to successfully implement RAC:

A Smooth Transition to Real Application Clusters

The Purpose of this document is to provide a best practices road map to successfully implement Real Application Clusters.

   Modified: 26-NOV-02    Ref #: ID-4062 


--------------------------------------------------------------------------------

What is Cache Fusion and how does this affect applications?
Cache Fusion is a new parallel database architecture for exploiting clustered computers to achieve scalability of all types of applications. Cache Fusion is a shared cache architecture that uses high speed low latency interconnects available today on clustered systems to maintain database cache coherency. Database blocks are shipped across the interconnect to the node where access to the data is needed. This is accomplished transparently to the application and users of the system. Cache Fusion scales to clusters with a large numbers of nodes. For more information about cache fusion see the following links: 
Additional Information can be found at:

Understanding 9i Real Application Clusters Cache Fusion

There is also a whitepaper ""Cache Fusion Delivers Scalability"" available at http://otn.oracle.com/products/oracle9i/content.html

Cache Fusion in the Oracle Documentation

   Modified: 26-NOV-02    Ref #: ID-4065 


--------------------------------------------------------------------------------

Is it difficult to transition from Single Instance to RAC?
If the cluster and the cluster software are not present, these components must be installed and configured.  The RAC option must be added using the Oracle Universal Installer, which necessitates the existing DB instance must be shut down.  There are no changes necessary on the user data within the database.  However, a shortage of freelists and freelist groups can cause contention with header blocks of tables and indexes as multiple instances vie for the same block.  This may cause a performance problem and require data partitioning.  However, the need for these changes should be rare. 

Recommendation: apply automatic space segment management to perform these changes automatically.  The free space management will replace the freelists and freelist groups and is better.  The database requires one Redo thread and one Undo tablespace for each instance, which are easily added with SQL commands or with Enterprise Manager tools.

Datafiles will need to be moved to either a clustered file system (CFS) or raw devices so that all nodes can access it.  Also, the MAXINSTANCES parameter in the control file must be greater than or equal to number of instances you will start in the cluster.

For more detailed information, please see Migrating from single-instance to RAC in the Oracle Documentation

With Oracle Database 10g Release 2, $ORACLE_HOME/bin/rconfig tool can be used to convert Single instance database to RAC. This tool takes in a xml input file and convert the Single Instance database whose information is provided in the xml. You can run this tool in "verify only" mode prior to performing actual conversion. This is documented in the RAC admin book and a sample xml can be found $ORACLE_HOME/assistants/rconfig/sampleXMLs/ConvertToRAC.xml. Grid Control 10g Release 2 provides a easy to use wizard to perform this function.
Note: Please be aware that you may hit bug 4456047 (shutdown immediate hangs) as you convert the database. The bug is updated with workaround and the w/a should is release noted as well.

   Modified: 18-JUL-05    Ref #: ID-4101 


--------------------------------------------------------------------------------

What are the dependencies between OCFS and ASM in Oracle10g ?
In an Oracle Database 10g RAC environment, there is no dependency between Automatic Storage Management (ASM) 
and Oracle Cluster File System (OCFS).
OCFS is not required if you are using Automatic Storage Management (ASM) for database files. You can use OCFS 
on Windows( Version 2 on Linux ) for files that ASM does not handle - binaries (shared oracle home), 
trace files, etc. Alternatively, you could place these files on local file systems even though it's not 
as convenient given the multiple locations.
If you do not want to use ASM for your database files, you can still use OCFS for database files in Oracle Database 10g.
Please refer to ASM and OCFS Positioning 
   Modified: 05-MAY-05    Ref #: ID-4116 


--------------------------------------------------------------------------------

Is rcp and/or rsh required for normal RAC operation ?
rcp"" and ""rsh"" are not required for normal RAC operation. However ""rsh"" and ""rcp"" should to be enabled for RAC and patchset installation. In future releases, ssh will be used for these operations. 
   Modified: 06-NOV-03    Ref #: ID-4117 


--------------------------------------------------------------------------------

What software is necessary for RAC? Does it have a separate installation CD to order?
Real Application Clusters is an option of Oracle Database and therefore part of the Oracle Database CD. With Oracle 9i, RAC is part of Oracle9i Enterprise Edition. If you install 9i EE onto a cluster, and the Oracle Universal Installer (OUI) recognizes the cluster, you will be provided the option of installing RAC. Most UNIX platforms require an OSD installation for the necessary clusterware. For Intel platforms (Linux and Windows), Oracle provides the OSD software within the Oracle9i Enterprise Edition release.

With Oracle Database 10g, RAC is an option of EE and available as part of SE. Oracle provides Oracle Clusterware on its own CD included in the database CD pack.

Please check the certification matrix (Note 184875.1) or with the appropriate platform vendor for more information.

@ Sent by Karin Brandauer

   Modified: 05-MAY-05    Ref #: ID-4132 


--------------------------------------------------------------------------------

What is Standard Edition RAC?
With Oracle Database 10g, a customer who has purchased Standard Edition is allowed to use the RAC option within the limitations of Standard Edition(SE). For licensing restrictions you should read the Oracle Database 10g License Doc. At a high level this means that you can have a max of 4 cpus in the cluster, you must use ASM for all database files. Oracle Cluster File System (OCFS) is not supported for use with SE RAC. 
   Modified: 01-SEP-04    Ref #: ID-5750 


--------------------------------------------------------------------------------

Can I use iSCSI storage with my RAC cluster?
For iSCSI, Oracle has made the statement that, as a block protocol, this technology does not require validation for single instance database. There are many early adopter customers of iSCSI running Oracle9i and Oracle Database 10g. As for RAC, Oracle has chosen to validate the iSCSI technology (not each vendor's targets) for the 10g platforms - this has been completed for Linux, Unix and Windows. For Windows we have tested up to 4 nodes - Any Windows iSCSI products that are supported by the host and storage device are supported by Oracle. No vendor-specific information will be posted on Certify. 
   Modified: 13-JUL-05    Ref #: ID-5788 


--------------------------------------------------------------------------------

What would you recomend to customer, Oracle clusterware or Vendor Clusterware (I.E. MC Service Guard, HACMP, 
Sun Cluster, Veritas etc.) with Oracle Database 10g Real Application Clusters?

You will be installing and using Oracle Clusterware whether or not you use the Vendor Clusterware. The question 
you need to ask is whether the Vendor Clusterware gives you something that Oracle Clusterware does not. 
Is the RAC database on the same server as the application server? Are there any other processes on the same server 
as the database that you require Vendor Clusterware to fail over to another server in the cluster if the server 
it is running on fails? IF this is the case, you may want the vendor clusterware, if not, why spend the extra money 
when Oracle Clusterware supplies everything you need to for the clustered database included with your RAC license. 
   Modified: 21-OCT-04    Ref #: ID-5968 


--------------------------------------------------------------------------------

When configuring the NIC cards and switch for a GigE Interconnect should it be set to FULL or Half duplex in RAC?
You've got to use Full Duplex, regardless of RAC or not, but for all network communication. Half Duplex means you can only either send OR receive at the same time. 
   Modified: 05-NOV-04    Ref #: ID-6048 


--------------------------------------------------------------------------------

Is it a good idea to add anti-virus software to my RAC cluster?
For customers who choose to run anti-virus (AV) software on their database servers, they should be aware that the nature of AV software is that disk IO bandwidth is reduced slightly as most AV software checks disk writes/reads. Also, as the AV software runs, it will use CPU cycles that would normally be consumed by other server processes (e.g your database instance). As such, databases will have faster performance when not using AV software. As some AV software is known to lock the files whilst is scans then it is a good idea to exclude the Oracle Datafiles/controlfiles/logfiles from a regular AV scan 
   Modified: 31-JAN-05    Ref #: ID-6595 


--------------------------------------------------------------------------------

Can I use RAC in a distributed transaction processing environment?
YES. Best practices is to have all tightly coupled branches of a distributed transaction running on a RAC database must run on the same instance. Between transactions and between services, transactions can be load balanced across all of the database instances.
You can use services to manage DTP environments. By defining the DTP property of a service, the service is guaranteed to run on one instance at a time in a RAC database. All global distributed transactions performed through the DTP service are ensured to have their tightly-coupled branches running on a single RAC instance. 
   Modified: 16-JUN-05    Ref #: ID-6864 


--------------------------------------------------------------------------------

Why do we have a Virtual IP (VIP) in 10g? Why does it just return a dead connection when its primary node fails?
Its all about availability of the application.
When a node fails, the VIP associated with it is supposed to be automatically failed over to some other node. When this occurs, two things happen. (1) the new node re-arps the world indicating a new MAC address for the address. For directly connected clients, this usually causes them to see errors on their connections to the old address; (2) Subsequent packets sent to the VIP go to the new node, which will send error RST packets back to the clients. This results in the clients getting errors immediately.
This means that when the client issues SQL to the node that is now down, or traverses the address list while connecting, rather than waiting on a very long TCP/IP time-out (~10 minutes), the client receives a TCP reset. In the case of SQL, this is ORA-3113. In the case of connect, the next address in tnsnames is used.
Without using VIPs, clients connected to a node that died will often wait a 10 minute TCP timeout period before getting an error.
As a result, you don't really have a good HA solution without using VIPs. 
   Modified: 12-MAR-04    Ref #: ID-4609 


--------------------------------------------------------------------------------

If I use Services with Oracle Database 10g, do I still need to set up Load Balancing ?
Yes, Services allow you granular definition of workload and the DBA can dynamically define which instances provide the service. Connection Load Balancing still needs to be set up to allow the user connections to be balanced across all instances providing a service. 
   Modified: 16-JUN-05    Ref #: ID-6731 


--------------------------------------------------------------------------------

Can RMAN backup Real Application Cluster databases?
Absolutely. RMAN can be configured to connect to all nodes within the cluster to parallelize the backup of the database files and archive logs. If files need to be restored, using set AUTOLOCATE ON alerts RMAN to search for backed up files and archive logs on all nodes.

RAC with RMAN in the Oracle Documentation

   Modified: 26-NOV-02    Ref #: ID-4035 


--------------------------------------------------------------------------------

I am receiving an ORA-29740 error. What should I do?
This error can occur when problems are detected on the cluster:

Error: ORA-29740 (ORA-29740)
Text: evicted by member %s, group incarnation %s 
---------------------------------------------------------------------------
Cause:  This member was evicted from the group by another member of the 
        cluster database for one of several reasons, which may include a 
        communications error in the cluster, failure to issue a heartbeat 
        to the control file, etc. 
Action: Check the trace files of other active instances in the cluster 
        group for indications of errors that caused a reconfiguration. 

For more information on troubleshooting this error, see the following Metalink note:

Note 219361.1
Troubleshooting ORA-29740 in a RAC Environment

   Modified: 02-DEC-02    Ref #: ID-4093 


--------------------------------------------------------------------------------

What does the Virtual IP service do? I understand it is for failover but do we need a separate network card? Can we use the existing private/public cards? What would happen if we used the public ip?
The 10g Virtual IP Address (VIP) exists on every RAC node for public network communication. All client communication should use the VIPs in their TNS connection descriptions. The TNS ADDRESS_LIST entry should direct clienst to VIPs rather than using hostnames. During normal runtime, the behaviour is the same as hostnames, however when the node goes down or is shutdown the VIP is hosted elsewhere on the cluster, and does not accept connection requests. This results in a silent TCP/IP error and the client fails immediately to the next TNS address. If the network interface fails within the node, the VIP can be configured to use alternate interfaces in the same node. The VIP must use the public interface cards. There is no requirement to purchase additional public interface cards (unless you want to take advantage of within-node card failover.) 
   Modified: 15-MAR-04    Ref #: ID-4636 


--------------------------------------------------------------------------------

What do the VIP resources do once they detect a node has failed/gone down? Are the VIPs automatically acquired, and published, or is manual intervention required? Are VIPs mandatory?
When a node fails, the VIP associated with the failed node is automatically failed over to one of the other nodes in the cluster. When this occurs, two things happen: 
The new node re-arps the world indicating a new MAC address for this IP address. For directly connected clients, this usually causes them to see errors on their connections to the old address; 
Subsequent packets sent to the VIP go to the new node, which will send error RST packets back to the clients. This results in the clients getting errors immediately. 
In the case of existing SQL conenctions, errors will typically be in the form of ORA-3113 errors, while a new connection using an address list will select the next entry in the list. Without using VIPs, clients connected to a node that died will often wait for a TCP/IP timeout period before getting an error. This can be as long as 10 minutes or more. As a result, you don't really have a good HA solution without using VIPs. 
   Modified: 15-MAR-04    Ref #: ID-4638 


--------------------------------------------------------------------------------

What are my options for load balancing with RAC? Why do I get an uneven number of connections on my instances?
All the types of load balancing available currently (9i-10g) occur at connect time.
This means that it is very important how one balances connections and what these connections do on a long term basis.
Since establishing connections can be very expensive for your application, it is good programming practice to connect once and stay connected. This means one needs to be careful as to what option one uses. Oracle Net Services provides load balancing or you can use external methods such as hardware based or clusterware solutions.
The following options exist:
Random
Either client side load balancing or hardware based methods will randomize the connections to the instances.
On the negative side this method is unaware of load on the connections or even if they are up meaning they might cause waits on TCP/IP timeouts.
Load Based
Server side load balancing (by the listener) redirects connections by default depending on the RunQ length of each of the instances. This is great for short lived connections. Terrible for persistent connections or login storms. Do not use this method for connections from connection pools or applicaton servers
Session Based
Server side load balancing can also be used to balance the number of connections to each instance. Session count balancing is method used when you set a listener parameter, prefer_least_loaded_node_listener-name=off. Note listener name is the actual name of the listener which is different on each node in your cluster and by default is listener_nodename.
Session based load balancing takes into account the number of sessions connected to each node and then distributes ne connections to balance the number of sessions across the different nodes. 
   Modified: 15-MAR-05    Ref #: ID-4940 


--------------------------------------------------------------------------------

Can I use ASM as mechanism to mirror the data in an Extended RAC cluster?
Yes, but it cannot replicate everything that needs replication.
ASM works well to replicate any object you can put in ASM. But you cannot put the OCR or Voting Disk in ASM.
In 10gR1 they can either be mirrored using a different mechanism (which could then be used instead of ASM) or the OCR needs to be restored from backup and the Voting Disk can be recreated.
In the future we are looking at providing Oracle redundancy for both. 
   Modified: 18-OCT-04    Ref #: ID-5948 


--------------------------------------------------------------------------------

Can our 10g VIP fail over from NIC to NIC as well as from node to node ?
Yes the 10g VIP implementation is capable from failing over within a node from NIC to NIC and back if the failed NIC is back online again, and also we fail over between nodes. The NIC to NIC failover is fully redundant if redundant switches are installed. 
   Modified: 10-DEC-04    Ref #: ID-6348 


--------------------------------------------------------------------------------

What clients provide integration with FAN and FCF?
With Oracle Database 10g Release 1, JDBC clients (both thick and thin driver) are integrated with FAN by providing FCF. With Oracle Database 10g Release 2, we have added ODP.NET and OCI. Other applications can integrate with FAN by using the API to subscribe to the FAN events. 
   Modified: 28-APR-05    Ref #: ID-6735 


--------------------------------------------------------------------------------

What is CLB_GOAL and how should I set it?
CLB_GOAL is the connection load balancing goal for a service. There are 2 options, CLB_GOAL_SHORT and CLB_GOAL_LONG (default).
Long is for applications that have long-lived connections. This is typical for connection pools and SQL*Forms sessions. Long is the default connection load balancing goal.
Short is for applications that have short-lived connections.
The GOAL for a service can be set with EM or DBMS_SERVICE.
Note: You must still configure load balancing with Oracle Net Services 
   Modified: 16-JUN-05    Ref #: ID-6854 


--------------------------------------------------------------------------------

Can I use TAF and FAN/FCF?
With Oracle Database 10g Release 1, NO. With Oracle Database 10g Release 2, the answer is YES for OCI and ODP.NET, it is recommended. For JDBC, you should not use TAF and FCF even with the Thick JDBC driver. 
   Modified: 16-JUN-05    Ref #: ID-6866 


--------------------------------------------------------------------------------

What is Server-side Transparent Application Failover (TAF) and how do I use it?
Oracle Database 10g Release 2, introduces server-side TAF when using services. After you create a service, you can use the dbms_service.modify_service pl/sql procedure to define the TAF policy for the service. Only the basic method is supported. Note this is different than the TAF policy (traditional client TAF) that is supported by srvctl and EM Services page. If your service has a server side TAF policy defined, then you do not have to encode TAF on the client connection string. If the instance where a client is connected, fails, then the connection will be failed over to another instance in the cluster that is supporting the service. All restrictions of TAF still apply.
NOTE: both the client and server must be 10.2 and aq_ha_notifications must be set to true for the service.
Sample code to modify service: 
execute dbms_service.modify_service (service_name => 'gl.us.oracle.com' -
, aq_ha_notifications => true -
, failover_method => dbms_service.failover_method_basic -
, failover_type => dbms_service.failover_type_select -
, failover_retries => 180 -
, failover_delay => 5 -
, clb_goal => dbms_service.clb_goal_long); 

   Modified: 07-JUL-05    Ref #: ID-6912 


--------------------------------------------------------------------------------

I am seeing the wait events 'ges remote message', 'gcs remote message', and/or 'gcs for action'. What should I do about these?
These are idle wait events and can be safetly ignored. The 'ges remote message' might show up in a 9.0.1 statspack report as one of the top wait events. To have this wait event not show up you can add this event to the PERFSTAT.STATS$IDLE_EVENT table so that it is not listed in Statspack reports.

   Modified: 02-APR-04    Ref #: ID-4092 


--------------------------------------------------------------------------------

What are the changes in memory requirements from moving from single instance to RAC?
If you are keeping the workload requirements per instance the same, then about 10% more buffer cache and 15% more shared pool is needed.  The additional memory requirement is due to data structures for coherency management.  The values are heuristic and are mostly upper bounds.  Actual esource usage can be monitored by querying current and maximum columns for the gcs resource/locks and ges resource/locks entries in V$RESOURCE_LIMIT.

But in general, please take into consideration that memory requirements per instance are reduced when the same user population is distributed over multiple nodes.  In this case:

Assuming the same user population N number of nodes M buffer cache for a single system then

(M / N) + ((M / N )*0.10) [ + extra memory to compensate for failed-over users ]

Thus for example with a M=2G & N=2 & no extra memory for failed-over users

=( 2G / 2 ) + (( 2G / 2 )) *0.10

=1G + 100M

   Modified: 02-DEC-02    Ref #: ID-4030 


--------------------------------------------------------------------------------

What is the Load Balancing Advisory?
To assist in the balancing of application workload across designated resources, Oracle Database 10g Release 2 provides the Load Balancing Advisory. This Advisory monitors the current workload activity across the cluster and for each instance where a service is active; it provides a percentage value of how much of the total workload should be sent to this instance as well as service quality flag. The feedback is provided as an entry in the Automatic Workload Repository and a FAN event is published. 
   Modified: 16-JUN-05    Ref #: ID-6858 


--------------------------------------------------------------------------------

What is Runtime Connection Load Balancing?
Runtime connection load balancing enables the connection pool to route incoming work requests to the available database connection that will provide it with the best service. This will provide the best service times globally, and routing responds fast to changing conditions in the system. Oracle has implemented runtime connection load balancing with ODP.NET and JDBC connection pools. Runtime Connection Load Balancing is tightly integrated with the automatic workload balancing features introduced with Oracle Database 10g I.E. Services, Automatic Workload Repository, and the new Load Balancing Advisory. 
   Modified: 16-JUN-05    Ref #: ID-6860 


--------------------------------------------------------------------------------

How do I enable the load balancing advisory?
The load balancing advisory requires the use of services and Oracle Net connection load balancing.
To enable it, on the server: set a goal (service_time or throughput, for ODP.NET enable AQ_HA_NOTIFICATIONS=>true, and set CLB_GOAL ) on your service.
For client, you must be using the connection pool.
For JDBC, enable the datasource parameter FastConnectionFailoverEnabled.
For ODP.NET enable the datasource parameter Load Balancing=true. 
   Modified: 16-JUN-05    Ref #: ID-6862 


--------------------------------------------------------------------------------

How do I stop the GSD?
If you are on 9.0 on Unix you would issue:

$ ps -ef | grep jre
$ kill -9 <gsd process> 

Stop the OracleGSDService on Windows.

Note: Make sure that this is the process in use by GSD

If you are on 9.2 you would issue:

$ gsdctl stop


   Modified: 22-MAR-04    Ref #: ID-4091 


--------------------------------------------------------------------------------

How should I deal with space management? Do I need to set free lists and free list groups?
Manually setting free list groups is a complexity that is no longer required.

We recommend using Automatic Segment Space Management rather than trying to manage space manually. Unless you are migrating from an earlier database version with OPS and have already built and tuned the necessary structures, Automatic Segment Space Management is the preferred approach.

Automatic Segment Space Management is NOT the default, you need to set it.

For more information see:

Automatic Space Segment Management in RAC Environments

   Modified: 16-JUN-03    Ref #: ID-4074 


--------------------------------------------------------------------------------

I was installing RAC and my Oracle files did not get copied to the remote node(s). What went wrong?
First make sure the cluster is running and is available on all nodes. You should be able to see all nodes 
when running an 'lsnodes -v' command. 

If lsnodes shows that all members of the cluster are available, then you may have an rcp/rsh problem on Unix 
or shares have not been configured on Windows. 

You can test rcp/rsh on Unix by issuing the following from each node:

[node1]/tmp> touch test.tst
[node1]/tmp> rcp test.tst node2:/tmp

[node2]/tmp> touch test.tst
[node2]/tmp> rcp test.tst node1:/tmp

On Windows, ensure that each node has administrative access to all these directories within the Windows environment by running the following at the command prompt: 

NET USE \\host_name\C$ 

Clustercheck.exe also checks for this.

More information can be found in the Step-by-Step RAC notes available on Metalink. To find these search Metalink for 'Step-by-Step Installation of RAC'.

   Modified: 26-NOV-02    Ref #: ID-4094 


--------------------------------------------------------------------------------

What are the implications of using srvctl disable for an instance in my RAC cluster? I want to have it available 
to start if I need it but at this time to not want to run this extra instance for this database.
During node reboot, any disabled resources will not be started by the Clusterware, therefore this instance 
will not be restarted. It is recommended that you leave the vip, ons,gsd enabled in that node. For example, 
VIP address for this node is present in address list of database services, so a client connecting to these services 
will still reach some other database instance providing that service via listener redirection. J
ust be aware that by disabling an Instance on a node, all that means is that the instance itself is not starting. 
However, if the database was originally created with 3 instances, that means there are 3 threads of redo. 
So, while the instance itself is disabled, the redo thread is still enabled, and will occasionally cause 
log switches. The archived logs for this 'disabled' instance would still be needed in any potential database 
recovery scenario. So, if you are going to disable the instance through srvctl, you may also want to consider 
disabling the redo thread for that instance.

srvctl disable instance -d orcl -i orcl2

SQL> alter database disable public thread 2;

Do the reverse to enable the instance.

SQL> alter database enable public thread 2;

srvctl enable instance -d orcl -i orcl2 
   Modified: 31-MAR-05    Ref #: ID-6672 


--------------------------------------------------------------------------------

What is the Cluster Verification Utiltiy (cluvfy)?
The Cluster Verification Utility (CVU) is a validation tool that you can use to check all the important components that need to be verified at different stages of deployment in a RAC environment. The wide domain of deployment of CVU ranges from initial hardware setup through fully operational cluster for RAC deployment and covers all the intermediate stages of installation and configuration of various components. Cluvfy does not take any corrective action following the failure of a verification task, does not enter into areas of performance tuning or monitoring, does not perform any cluster or RAC operation, and does not attempt to verify the internals of cluster database or cluster elements. 
   Modified: 16-JUN-05    Ref #: ID-6850 


--------------------------------------------------------------------------------

What versions of the database can I use the cluster verification utility (cluvfy) with?
The cluster verification utility is release with Oracle Database 10g Release 2 but can also be used with Oracle Database 10g Release 1. 
   Modified: 16-JUN-05    Ref #: ID-6852 


--------------------------------------------------------------------------------

How many nodes can be had in an HP/Sun/IBM/Compaq/NT/Linux cluster?
The number of nodes supported is not limited by Oracle, but more generally by the clustering software/hardware 
in question.

When using Solely Oracle Clusterware: 63 nodes (9i or 10gR1)


When using a third party clusterware: 

Sun: 8 

HP UX: 16 

HP Tru64: 8 

IBM AIX: 

 * 8 nodes for Physical Shared (CLVM) SSA disk

 * 16 nodes for Physical Shared (CLVM) non-SSA disk 

 * 128 nodes for Virtual Shared Disk (VSD) 

 * 128 nodes for GPFS 

 * Subject to storage subsystem limitations 

Veritas: 8-16 nodes (check w/ Veritas)


   Modified: 21-OCT-04    Ref #: ID-4047 


--------------------------------------------------------------------------------

Where I can find information about how to setup / install RAC on different platforms ?
There is a roadmap for implementing Real Application Clusters' available at: 

A Smooth Transition to Real Application Clusters

There are also Step-by-Step notes available for each platform available on the Metalink 'Top Tech Docs' for RAC:

High Availability - Real Application Clusters Library Page Index

Additional information can be found on OTN:

http://technet.oracle.com/products/oracle9i/content.html --> 'Oracle Real Application Clusters'

   Modified: 08-AUG-02    Ref #: ID-4067 


--------------------------------------------------------------------------------

Is it possible to run RAC on logical partitions (i.e. LPARs) or virtual separate servers.
Yes, it is possible. The E10K and other high end servers can be partitioned into domains of smaller sizes, each domain with its own CPU(s) and operating system. Each domain is effectively a virtual server. RAC can be run on cluster comprises of domains. The benefits of using this is similar to a regular cluster, any domain failure will have little effect on other domains. Besides, the management of the cluster may be easier since there is only one physical server. Note however, since one E10K is still just one server. There are single points of failures. Any failures, such as back plane failure, that crumble the entire server will shutdown the virtual cluster. That is the tradeoff users have to make in how best to build a cluster database. 
   Modified: 18-MAY-04    Ref #: ID-4075 


--------------------------------------------------------------------------------

How do I check RAC certification?
See the following Metalink note:

Note 184875.1
How To Check The Certification Matrix for Real Application Clusters

Please note that certifications for Real Application Clusters are performed against the Operating System and Clusterware versions. The corresponding system hardware is offered by System vendors and specialized Technology vendors.  Some system vendors offer pre-installed, pre-configured RAC clusters. These are included below under the corresponding OS platform selection within the certification matrix.


   Modified: 26-NOV-02    Ref #: ID-4095 


--------------------------------------------------------------------------------

Can the Oracle Database Configuration Assistant (DBCA) be used to create a database with Veritas DBE / AC 3.5?
DBCA can be used to create databases on raw devices in 9i RAC Release 1 and 9i Release 2. Standard database creation scripts using SQL commands will work with file system and raw.

DBCA cannot be used to create databases on file systems on Oracle 9i Release 1. The user can choose to set up a database on raw devices, and have DBCA output a script. The script can then be modified to use cluster file systems instead.

With Oracle 9i RAC Release 2 (Oracle 9.2), DBCA can be used to create databases on a cluster filesystem. If the ORACLE_HOME is stored on the cluster filesystem, the tool will work directly. If ORACLE_HOME is on local drives on each system, and the customer wishes to place database files onto a cluster file system, they must invoke DBCA as follows: dbca -datafileDestination /oradata where /oradata is on the CFS filesystem. See 9iR2 README and bug 2300874 for more info.

   Modified: 10-JAN-03    Ref #: ID-4124 


--------------------------------------------------------------------------------

Is crossover cable supported as an interconnect with 9iRAC/10gRAC on any platform ?
 
 
 NO. CROSS OVER CABLES ARE NOT SUPPORTED. 
The requirement is to use a switch: 

Detailed Reasons:
 1) cross-cabling limits the expansion of RAC to two nodes 
 2) cross-cabling is unstable:
   a) Some NIC cards do not work properly with it.
   b) Instability.  We have seen different problems e.g.. ORA-29740 at configurations using crossover cable, and other errors.

Due to the benefits and stability provided by a switch, and their afforability, this is the only supported configuration.

Please see certify.us.oracle.com as well.

(content consolidated from that of Massimo Castelli, Roland Knapp and others)
 

   Modified: 21-FEB-05    Ref #: ID-4150 


--------------------------------------------------------------------------------

Is Veritas Storage Foundation 4.0 supported with RAC?
Veritas Storage Foundation 4.0 is certified on AIX, Solaris and HPUX for 9i RAC and Oracle Database 10g RAC. Veritas is production also on Linux, but it is not certified by Oracle. If customers choose Veritas on Linux, Oracle will support the Oracle products in the stack, but they do not qualify for Unbreakable Linux support. 
   Modified: 05-OCT-04    Ref #: ID-5888 


--------------------------------------------------------------------------------

Is 3rd Party Clusterware supported on Linux such as Veritas or Redhat?
No, Oracle RAC 10g does not support 3rd Party clusterware on Linux. This means that if a cluster file system requires a 3rd party clusterware, the cluster file system is not supported. 
   Modified: 11-MAY-05    Ref #: ID-6743 


--------------------------------------------------------------------------------

Can you have multiple RAC $ORACLE_HOME's on Linux?
No, there should be only one Oracle Cluster Manager (ORACM) running on each node. All RAC databases should run out of the $ORACLE_HOME that ORACM is installed in. 
   Modified: 19-JUL-05    Ref #: ID-6931 


--------------------------------------------------------------------------------

After installing patchset 9013 and patch_2313680 on Linux, the startup was very slow
 
Please carefully read the following new information about configuring Oracle Cluster Management on Linux, provided as part of the patch README:

Three parameters affect the startup time:

soft_margin (defined at watchdog module load)

-m (watchdogd startup option)

WatchdogMarginWait (defined in nmcfg.ora).

WatchdogMarginWait is calculated using the formula:

WatchdogMarginWait = soft_margin(msec) + -m + 5000(msec).

[5000(msec) is hardcoded]

Note that the soft_margin is measured in seconds, -m and WatchMarginWait are measured in milliseconds.

Based on benchmarking, it is recommended to set soft_margin between 10 and 20 seconds. Use the same value for -m (converted to milliseconds) as used for soft_margin. Here is an example:

soft_margin=10 -m=10000 WatchdogMarginWait = 10000+10000+5000=25000

If CPU utilization in your system is high and you experience unexpected node reboots, check the wdd.log file. If there are any 'ping came too late' messages, increase the value of the above parameters.

   Modified: 20-DEC-04    Ref #: ID-4069 


--------------------------------------------------------------------------------

Is CFS Available for Linux?
 
Yes, OCFS (Oracle Cluster Filesystem) is now available for Linux. The following Metalink note has information for obtaining the latest version of OCFS:

Note 238278.1 - How to find the current OCFS version for Linux

   Modified: 20-DEC-04    Ref #: ID-4089 


--------------------------------------------------------------------------------

Where can I find more information about hangcheck-timer module on Linux ? And how do we configure hangcheck-timer module ?
In releases 9.2.0.2.0 and later, Oracle recommends using a new I/O fencing model -- HangCheck-Timer module. Hangcheck-Timer
module monitors the Linux kernel for long operating system hangs that could affect the reliability of a RAC node. You can configure hangcheck-timer module using 3 parameters -- hangcheck_tick, hangcheck_margin and MissCount. 

For more details, please review Note :: 259487.1
   Modified: 20-DEC-04    Ref #: ID-4179 


--------------------------------------------------------------------------------

Can RAC 10g and 9i RAC be installed and run on the same physical Linux cluster?
Yes - CRS / CSS and oracm can coexist. 
   Modified: 20-DEC-04    Ref #: ID-4408 


--------------------------------------------------------------------------------

Is the hangcheck timer still needed with Oracle Database 10g RAC?
YES!  The hangcheck-timer module monitors the Linux kernel for extended operating system hangs that could affect the reliability
of the RAC node ( I/O fencing) and cause database corruption.  To verify the hangcheck-timer module is running on every node:

as root user:
/sbin/lsmod | grep hangcheck

If the hangcheck-timer module is not listed enter the following command as the root user:

/sbin/insmod hangcheck-timer hangcheck_tick=30 hangcheck_margin=180

To ensure the module is loaded every time the system reboots, verify that the local system startup file (/etc/rc.d/rc.local) contains the command above.

For additional information please review the  Oracle RAC Install and Configuration Guide (5-41).


   Modified: 20-DEC-04    Ref #: ID-6208 


--------------------------------------------------------------------------------

How to configure bonding on Suse SLES8.
Please see note:291958.1 
   Modified: 29-NOV-04    Ref #: ID-6288 


--------------------------------------------------------------------------------

How to configure bonding on Suse SLES9.
Please see note:291962.1
   Modified: 29-NOV-04    Ref #: ID-6290 


--------------------------------------------------------------------------------

Does RAC run faster with Sun-cluster or Veritas cluster-ware? (these being alternatives with Sun hardware) Is there some clusterware that would make RAC run faster?
RAC scalability and performance are independent of the clusterware. However, we recommend that the customer uses a very
fast memory based interconnect if one wants to optimize the performance. For Example, Sun can use FireLink, a very fast proprietary interconnect which is more optimal for RAC, while Veritas is limited to using Gigabit Ethernet. 

Starting with 10g there will be an alternative to SunCluster and Veritas Cluster than this is Oracle CRS/CSS.

   Modified: 20-DEC-04    Ref #: ID-4088 


--------------------------------------------------------------------------------

Is HMP supported with 10g on all HP platforms ?
 
- 10g RAC + HMP + PA-RISC = yes

- 10g RAC + HMP + Itanium, "Oracle has no plans and will likely never
support RAC over HMP on IPF."

- 10g RAC + UDP + Itanium = yes (even over Hyperfabric)


"Oracle recommends that HMP not be used. UDP is the recommended interconnect protocol across all platforms."


   Modified: 20-DEC-04    Ref #: ID-5488 


--------------------------------------------------------------------------------

Does the Oracle Cluster File System (OCFS) support network access through NFS or Windows Network Shares?
No, in the current release the Oracle Cluster File System (OCFS) is not supported for use by network access approaches like NFS or Windows Network Shares. 
   Modified: 27-JAN-05    Ref #: ID-4122 


--------------------------------------------------------------------------------

My customer wants to understand what type of disk caching they can use with their Windows RAC Cluster, the install guide tells them to disable disk caching?
If the write cache identified is local to the node then that is bad for RAC. If the cache is visible to all nodes as a 'single cache', typically in the storage array, and is also 'battery backed' then that is OK. 
   Modified: 31-MAR-05    Ref #: ID-6670 


--------------------------------------------------------------------------------

Can I run my 9i RAC and RAC 10g on the same Windows cluster?
Yes but the 9i RAC database must have the 9i Cluster Manager and you must run Oracle Clusterware for the Oracle Database 10g. 9i Cluster Manager can coexsist with Oracle Clusterware 10g. 
   Modified: 01-JUL-05    Ref #: ID-6889 


--------------------------------------------------------------------------------

Do I need HACMP/GPFS to store my OCR/Voting file on a shared device.

The prerequisites doc for AIX clearly says: 

"If you are not using HACMP, you must use a GPFS file system to store the Oracle CRS files" ==> 
this is a documentation bug and this will be fixed with 10.1.0.3 

----- 

On AIX it is important to put the reserve_lock=no/reserve_policy =no_reserve 

in order to allow AIX to access the devices from more than one node simultaneously. 

Use the /dev/rhdisk devices (character special) for the crs and voting disk and change the attribute with the command 
  

chdev -l hdiskn -a reserve_lock=no 

(for ESS, EMC, HDS, CLARiiON, and MPIO-capable devices you have to do an chdev -l hdiskn -a reserve_policy=no_reserve) 


   Modified: 20-DEC-04    Ref #: ID-5288 


--------------------------------------------------------------------------------

Can I run Oracle RAC 10g on my IBM Mainframe Sysplex environment (z/OS)?
YES! There is no separate documentation for RAC on z/OS. What you would call "clusterware" is built in to the OS 
and the native file systems are global. IBM z/OS documentation explains how to set up a Sysplex Cluster; 
once the customer has done that it is trivial to set up a RAC database. The few steps involved are covered 
in in Chapter 14 of the Oracle for z/OS System Admin Guide, which you can read here. There is also an Install Guide 
for Oracle on z/OS ( here) but I don't think there are any RAC-specific steps in the installation. By the way, 
RAC on z/OS does not use Oracle's clusterware (CSS/CRS/OCR). 
   Modified: 07-JUL-05    Ref #: ID-6910 


--------------------------------------------------------------------------------

What are the cdmp directories in the background_dump_dest used for?
These directories are produced by the diagnosibility daemon process (DIAG). DIAG is a process related to RAC 
which as one of its tasks, performs cash dumping. The DIAG process dumps out tracing to file when it discovers 
the death of an essential process (foreground or background) in the local instance. A dump directory named something 
like cdmp_ is created in the bdump or background_dump_dest directory, and all the trace dump files DIAG creates are 
placed in this directory. 
   Modified: 11-AUG-03    Ref #: ID-4152 


--------------------------------------------------------------------------------

Is the Oracle E-Business Suite (Oracle Applications) certified against RAC?
Yes. (There is no seperate certification required for RAC.) "" 
   Modified: 04-JUN-03    Ref #: ID-4029 


--------------------------------------------------------------------------------

What is the optimal migration path to be used while migrating the E-Business suite to RAC?
Following is the recommended and most optimal path to migrate you E-Business suite to RAC environment:

1. Migrate the existing application to new hardware. (If applicable). 

2. Use Clustered File System for all data base files or migrate all database files to raw devices. (Use dd for Unix or ocopy for NT) 

3. Install/upgrade to the latest available e-Business suite. 

4. Upgrade database to Oracle9i (Refer document 216550.1 on Metalink) 

5. In step 4, install RAC option while installing Oracle9i and use Installer to perform install for all the nodes. 

6. Clone Oracle Application code tree.

Reference Documents:
Oracle E-Business Suite Release 11i with 9i RAC: Installation and Configuration : Metalink Note# 279956.1
E-Business Suite 11i on RAC : Configuring Database Load balancing & Failover: Metalink Note# 294652.1
Oracle E-Business Suite 11i and Database - FAQ : Metalink# 285267.1

   Modified: 08-JUL-05    Ref #: ID-4107 


--------------------------------------------------------------------------------

How to configure concurrent manager in a RAC environment?
Large clients commonly put the concurrent manager on a separate server now (in the middle tier) to reduce the load on the database server. The concurrent manager programs can be tied to a specific middle tier (e.g., you can have CMs running on more than one middle tier box). It is advisable to use specilize CM. CM middle tiers are set up to point to the appropriate database instance based on product module being used.

   Modified: 20-SEP-02    Ref #: ID-4108 


--------------------------------------------------------------------------------

Should functional partitioning be used with Oracle Applications?
We do not recommend functional partitioning unless throughput on your server architecture demands it. Cache fusion has been optimized to scale well with non-partitioned workload.

If your processing requirements are extreme and your testing proves you must partition your workload in order to reduce internode communications, you can use Profile Options to designate that sessions for certain applications Responsibilities are created on a specific middle tier server. That middle tier server would then be configured to connect to a specific database instance.

To determine the correct partitioning for your installation you would need to consider several factors like number of concurrent users, batch users, modules used, workload characteristics etc.

   Modified: 20-SEP-02    Ref #: ID-4109 


--------------------------------------------------------------------------------

Which e-Business version is prefereable?
Versions 11.5.5 onwards are certified with Oracle9i and hence with Oracle9i RAC. However we recommend the latest available version.

   Modified: 20-SEP-02    Ref #: ID-4110 


--------------------------------------------------------------------------------

Can I use Automatic Undo Management with Oracle Applications?
Yes. In a RAC environment we highly recommend it.

   Modified: 20-SEP-02    Ref #: ID-4111 


--------------------------------------------------------------------------------

Can I use TAF with e-Business in a RAC environment?
TAF itself does not work with e-Business suite due to Forms/TAF limitations, but you can configure the tns failover clause. On instance failure, when the user logs back into the system, their session will be directed to a surviving instance, and the user will be taken to the navigator tab. Their committed work will be available; any uncommitted work must be re-started.

We also recommend you configure the forms error URL to identify a fallback middle tier server for Forms processes, if no router is available to accomplish switching across servers.

   Modified: 02-APR-03    Ref #: ID-4112 


--------------------------------------------------------------------------------

Can I use OCFS with SE RAC?
It is not supported to use OCFS with Standard Edition RAC. All database files must use ASM (redo logs, recovery area, 
datafiles, control files etc). We recommend that the binaries and trace files (non-ASM supported files) to be 
replicated on all nodes. This is done automatically by install. 
   Modified: 01-SEP-04    Ref #: ID-5748 


--------------------------------------------------------------------------------

What are the maximum number of nodes under OCFS on Linux ?
Oracle 9iRAC on Linux, using OCFS for datafiles, can scale to a maximum of 32 nodes. 
   Modified: 06-NOV-03    Ref #: ID-4118 


--------------------------------------------------------------------------------

Where can I find documentation on OCFS ?
For Main Page >>> http://oss.oracle.com/projects/ocfs/ For User Manual >>> http://oss.oracle.com/projects/ocfs/documentation/ For OCFS Files >>> http://oss.oracle.com/projects/ocfs/files/supported/ 
   Modified: 06-NOV-03    Ref #: ID-4119 


--------------------------------------------------------------------------------

What files can I put on Linux OCFS?
For optimal performance, you should only put the following files on Linux OCFS:

- Datafiles
- Control Files
- Redo Logs
- Archive Logs
- Shared Configuration File (OCR)
- Quorum / Voting File
- SPFILE

   Modified: 14-AUG-03    Ref #: ID-4156 


--------------------------------------------------------------------------------

Is Sun QFS supported with RAC? What about Sun GFS?
 
 
Sun QFS is supported with Oracle 9i RAC.
Sun is planning to certify QFS with Oracle Database 10g and RAC but as of November 15,2004, this certification is "planned".

For 9i, Software Stack details:

For SVM you need Solaris 9 9/04 (Solaris 9 update 7),SVM Patch 116669-03(this is required SUN patch), Sun Cluster 3.1 Update 3, Oracle 9.2.0.5 + Oracle patch 3366258

For SharedQFS you need Solaris 9 04/03 and above or Solaris 8 02/02 and above, QFS 4.2, Sun Cluster 3.1 Update 2 or above, Oracle 9.2.0.5 + Oracle patch 3566420
Differently, Sun GFS (Global File System) is only supported for Oracle binary and archive logs only, but NOT for database files. 
   Modified: 19-JAN-05    Ref #: ID-6128 


--------------------------------------------------------------------------------

Is Red Hat GFS(Global File System) is certified by Oracle for use with Real Application Clusters?
Sistina Cluster Filesystem is not part of the standard RedHat kernel and therefore is not certified under 
the unbreakable Linux but falls under a kernel extension. This however, does not mean that Oracle RAC is 
not certified with it. As a fact, Oracle RAC does not certify against a filesystem per se, but certifies 
against an operating system. If, as is the case with Sistina filesystem, the filesystem is certified with 
the operating system, this only means that the combination does not fall under the unbreakable Linux combination 
and Oracle does not provide direct support and fix the filesystem in case of an error. Customer will have to contact 
the filesystem provider for support. 
   Modified: 22-NOV-04    Ref #: ID-6228 


--------------------------------------------------------------------------------

How to move the OCR location ?
- stop the CRS stack on all nodes using "init.crs stop" - Edit /var/opt/oracle/ocr.loc on all nodes and set up ocrconfig_loc=new OCR device - Restore from one of the automatic physical backups using ocrconfig -restore. - Run ocrcheck to verify. - reboot to restart the CRS stack. - additional information can be found at http://st-doc.us.oracle.com/10/101/rac.101/b10765/storage.htm#i1016535 
   Modified: 24-MAR-04    Ref #: ID-4728 


--------------------------------------------------------------------------------

Is it supported to rerun root.sh from the Oracle Clusterware installation ?
Rerunning root.sh after the initial install is expressly discouraged and unsupported. We strongly recommend not doing it. 
   Modified: 05-MAY-05    Ref #: ID-4730 


--------------------------------------------------------------------------------

Is it supported to allow 3rd Party Clusterware to manage Oracle resources (instances, listeners, etc) and turn off 
Oracle Clusterware management of these?
In 10g we do not support using 3rd Party Clusterware for failover and restart of Oracle resources. Oracle Clusterware 
resources should not be disabled. 
   Modified: 05-MAY-05    Ref #: ID-6528 


--------------------------------------------------------------------------------

What is the High Availability API?
An application-programming interface to allow processes to be put under the High Availability infrastructure that is part of the Oracle Clusterware distributed with Oracle Database 10g. A user written script defines how Oracle Clusterware should start, stop and relocate the process when the cluster node status changes. This extends the high availability services of the cluster to any application running in the cluster. Oracle Database 10g Real Application Clusters (RAC) databases and associated Oracle processes (E.G. listener) are automatically managed by the clusterware. 
   Modified: 05-MAY-05    Ref #: ID-6741 


--------------------------------------------------------------------------------

Is it possible to use ASM for the OCR and voting disk?
No, the OCR and voting disk must be on raw or CFS (cluster filesystem). 
   Modified: 19-JUL-05    Ref #: ID-6929 


--------------------------------------------------------------------------------

During CRS installation, I am asked to define a private node name, and then on the next screen asked to define which interfaces should be used as private and public interfaces. What information is required to answer these questions?
The private names on the first screen determine which private interconnect will be used by CSS.
Provide exactly one name that maps to a private IP address, or just the IP address itself. If a logical name is used, then the IP address this maps to can be changed subsequently, but if you IP address is specified CSS will always use that IP address. CSS cannot use multiple private interconnects for its communication hence only one name or IP address can be specified.

The private interconnect enforcement page determines which private interconnect will be used by the RAC instances.
It's equivalent to setting the CLUSTER_INTERCONNECTS init.ora parameter, but is more convenient because it is a cluster-wide setting that does not have to be adjusted every time you add nodes or instances. RAC will use all of the interconnects listed as private in this screen, and they all have to be up, just as their IP addresses have to be when specified in the init.ora paramter. RAC does not fail over between cluster interconnects; if one is down then the instances using them won't start.

   Modified: 24-MAR-04    Ref #: ID-4724 


--------------------------------------------------------------------------------

Can I change the name of my cluster after I have created it when I am using Oracle Database 10g Clusterware?
No, you must properly deinstall CRS and then re-install. To properly de-install CRS, you MUST follow the directions in the Installation Guide Chapter 10. This will ensure the ocr gets cleaned out. 
   Modified: 05-OCT-04    Ref #: ID-5890 


--------------------------------------------------------------------------------

Can I change the public hostname in my Oracle Database 10g Cluster using Oracle Clusterware?
Hostname changes are not supported in CRS, unless you want to perform a deletenode followed by a new addnode operation. 
   Modified: 05-OCT-04    Ref #: ID-5892 


--------------------------------------------------------------------------------

What should the permissions be set to for the voting disk and ocr when doing a RAC Install?
The Oracle Real Application Clusters install guide is correct. It describes the PRE INSTALL ownership/permission requirements for ocr and voting disk. This step is needed to make sure that the CRS install succeeds. Please don't use those values to determine what the ownership/permmission should be POST INSTALL. The root script will change the ownership/permission of ocr and voting disk as part of install. The POST INSTALL permissions will end up being : OCR - root:oinstall - 640 Voting Disk - oracle:oinstall - 644 
   Modified: 22-OCT-04    Ref #: ID-5988 


--------------------------------------------------------------------------------

Which processes access to OCR ?
Oracle Cluster Registry (OCR) is used to store the cluster configuration information among other things. OCR needs to be accessible from all nodes in the cluster. If OCR became inaccessible the CSS daemon would soon fail, and take down the node. PMON never needs to write to OCR. To confirm if OCR is accessible, try ocrcheck from your ORACLE_HOME and ORA_CRS_HOME. 
   Modified: 22-OCT-04    Ref #: ID-5990 


--------------------------------------------------------------------------------

How do I restore OCR from a backup? On Windows, can I use ocopy?
The only recommended way to restore an OCR from a backup is "ocrconfig -restore ". The ocopy command will not be able to perform the restore action for OCR. 
   Modified: 27-OCT-04    Ref #: ID-6008 


--------------------------------------------------------------------------------

Does the hostname have to match the public name or can it be anything else?
When there is no vendor clusterware, only CRS, then the public node name must match the host name. When vendor clusterware is present, it determines the public node names, and the installer doesn't present an opportunity to change them. So, when you have a choice, always choose the hostname. 
   Modified: 05-NOV-04    Ref #: ID-6050 


--------------------------------------------------------------------------------

Is it a requirement to have the public interface linked to ETH0 or does it only need to be on a ETH lower than the private interface?: - public on ETH1 - private on ETH2
There is no requirement for interface name ordering. You could have - public on ETH2 - private on ETH0 Just make sure you choose the correct public interface in VIPCA, and in the installer's interconnect classification screen. 
   Modified: 05-NOV-04    Ref #: ID-6052 


--------------------------------------------------------------------------------

How to Restore a Lost Voting Disk used by Oracle Clusterware 10g
Please read Note:279793.1 and for OCR Note:268937.1
   Modified: 02-DEC-04    Ref #: ID-6308 


--------------------------------------------------------------------------------

With Oracle Clusterware 10g, how do you backup the OCR?
There is an automatic backup mechanism for OCR.  The default location is : $ORA_CRS_HOME\cdata\"clustername"\

To display backups : ocrconfig -showbackup
To restore a backup :  ocrconfig -restore 

The automatic backup mechanism keeps upto about a week old copy. So, if you want to retain a backup copy more than that, then you should copy that "backup" file to some other name.

Unfortunately there are a couple of bugs regarding backup file manipulation, and changing default backup dir on Windows. These will be fixed in 10.1.0.4.  OCR backup on Windows are absent. Only file in the backup directory is 
temp.ocr which would be the last backup.  You can restore this most recent backup by using the command ocr -restore temp.ocr

If you want to take a logical copy of OCR at any time use : ocrconfig -export 
, and use -import option to restore the contents back.

   Modified: 02-DEC-04    Ref #: ID-6328 


--------------------------------------------------------------------------------

How do I protect the OCR and Voting in case of media failure?
In Oracle Database 10g Release 1 the OCR and Voting device are not mirrored within Oracle,hence both must be mirrored via a storage vendor method, like RAID 1.
Starting with Oracle Database 10g Release 2 Oracle Clusterware will multiplex the OCR and Voting Disk (two for the OCR and three for the Voting).
Please read Note:279793.1 and Note:268937.1 regarding backup and restore a lost Voting/OCR and FAQ 6238 regarding OCR backup. 
   Modified: 05-MAY-05    Ref #: ID-6612 


--------------------------------------------------------------------------------

How do I use multiple network interfaces to provide High Availability for my interconnect with Oracle Clusterware?
This needs to be done externally to Oracle Clusterware usually by some OS provided nic bonding which gives Oracle Clusterware a single ip address for the interconnect but provide failover across multiple nic cards. There are several articles in Metalink on how to do this. For example for Sun Solaris search for IPMP. On Linux, read the doc on rac.us
Configure Redundant Network Cards / Switches for Oracle Database 10g Release 1 Real Application Cluster on Linux 
   Modified: 06-APR-05    Ref #: ID-6680 


--------------------------------------------------------------------------------

How do I put my application under the control of Oracle Clusterware to achieve higher availability?
First write a control agent. It must accept 3 different parameters: start-The control agent should start the application, check-The control agent should check the application, stop-The Control agent should start the application. Secondly you must create a profile for your application using crs_profile. Thirdly you must register your application as a resource with Oracle Clusterware (crs_register). See the RAC Admin and Deployment Guide for details. 
   Modified: 16-JUN-05    Ref #: ID-6846 


--------------------------------------------------------------------------------

Can I use Oracle Clusterware to provide cold failover of my 9i or 10g single instance Oracle Databases?
Oracle does not provide the necessary wrappers to fail over single-instance databases using Oracle Clusterware 10g Release 2. But since it's possible for customers to use Oracle Clusterware to wrap arbitrary applications, it'd be possible for them to wrap single-instance databases this way. 
   Modified: 01-JUL-05    Ref #: ID-6891 


--------------------------------------------------------------------------------

Does Oracle Clusterware support application vips?
Yes, with Oracle Database 10g Release 2, Oracle Clusterware now supports an "application" vip. This is to support putting applications under the control of Oracle Clusterware using the new high availability API and allow the user to use the same URL or connection string regardless of which node in the cluster the application is running on. The application vip is a new resource defined to Oracle Clusterware and is a functional vip. It is defined as a dependent resource to the application. There can be many vips defined, typically one per user application under the control of Oracle Clusterware. You must first create a profile (crs_profile), then register it with Oracle Clusterware (crs_register). The usrvip script must run as root. 
   Modified: 11-JUL-05    Ref #: ID-6893 


--------------------------------------------------------------------------------

Why is the home for Oracle Clusterware not recommended to be subdirectory of the Oracle base directory?
If anyone other than root has write permissions to the parent directories of the CRS home, then they can give themselves root escalations. This is a security issue. The CRS home itself is a mix of root and non-root permissions, as appropriate to the security requirements. Please follow the install docs about who is your primary group and what other groups you need to create and be a member of. 
   Modified: 11-JUL-05    Ref #: ID-6915 


--------------------------------------------------------------------------------
 
.  

--------------------------------------------------------------------------------
 
 Copyright �  


9.6 JRE:
========

JRE:
----

Oracle 9.2 uses JRE 1.3.1

- Java Compiler (javac):  Compiles programs written in the Java programming language into bytecodes.

- Java Interpreter (java):  Executes Java bytecodes.  In other words, it runs 
  programs written in the Java programming language.

- Jave Runtime Interpreter (jre):  Similar to the Java Interpreter (java), but intended for
  end users who do not require all the development-related options available with the java tool.

The PATH statement enables Windows to find the executables (javac, java, javadoc, etc.) 
from any current directory.

The CLASSPATH tells the Java virtual machine and other applications (which are located in the 
"jdk_<version>\bin" directory) where to find the class libraries, such as classes.zip file 
(which is in the lib directory). 

Note 1:
-------

Suppose on a Solaris 5.9 machine with Oracle 9.2, we search for jre:

# find . -name "jre*" -print

./opt/app/oracle/product/9.2/inventory/filemap/jdk/jre
./opt/app/oracle/product/9.2/jdk/jre
./opt/app/oracle/jre
./opt/app/oracle/jre/1.1.8/bin/sparc/native_threads/jre
./opt/app/oracle/jre/1.1.8/bin/jre
./opt/app/oracle/jre/1.1.8/jre_config.txt
./usr/j2se/jre
./usr/iplanet/console5.1/bin/base/jre
./usr/java1.2/jre

Suppose on a AIX 5.2 machine with Oracle 9.2, we search for jre:

./apps/oracle/product/9.2/inventory/filemap/jdk/jre
./apps/oracle/product/9.2/inventory/filemap/jre
./apps/oracle/product/9.2/jdk/jre
./apps/oracle/product/9.2/jre
./apps/oracle/oraInventory/filemap/apps/oracle/jre
./apps/oracle/oraInventory/filemap/apps/oracle/jre/1.3.1/jre
./apps/oracle/jre
./apps/oracle/jre/1.1.8/bin/jre
./apps/oracle/jre/1.1.8/bin/aix/native_threads/jre
./apps/oracle/jre/1.3.1/jre
./apps/ora10g/product/10.2/jdk/jre
./apps/ora10g/product/10.2/jre
./usr/java131/jre
./usr/idebug/jre


Note 2:
-------

jre - The Java Runtime Interpreter (Solaris)
jre interprets (executes) Java bytecodes. 
SYNOPSIS
jre [ options ] classname <args>

DESCRIPTION
The jre command executes Java class files. The classname argument is the name of the class to be executed. 
Any arguments to be passed to the class must be placed after the classname on the command line. 
Class paths for the Solaris version of the jre tool can be specified using the CLASSPATH environment variable 
or by using the -classpath or -cp options. The Windows version of the jre tool ignores the CLASSPATH 
environment variable. For both Solaris and Windows, the -cp option is recommend for specifying class paths 
when using jre. 


OPTIONS
-classpath   path(s) 
Specifies the path or paths that jre uses to look up classes. Overrides the default or the CLASSPATH environment 
variable if it is set. If more than one path is specified, they must be separated by colons. 
Each path should end with the directory containing the class file(s) to be executed. 
However, if a file to be executed is a zip or jar file, the path to that file must end with the file's name. 
Here is an example of an argument for -classpath that specifies three paths consisting of the current directory 
and two additional paths: 
   .:/home/xyz/classes:/usr/local/java/classes/MyClasses.jar


-cp   path(s) 
Prepends the specified path or paths to the base classpath or path given by the CLASSPATH environment variable. 
If more than one path is specified, they must be separated by colons. Each path should end with the directory 
containing the class file(s) to be executed. However, if a file to be executed is a zip or jar file, 
the path to that file must end with the file's name. Here is an example of an argument for -cp that specifies 
three paths consisting of the current directory and two additional paths: 
   .:/home/xyz/classes:/usr/local/java/classes/MyClasses.jar

-help 
Print a usage message. 

-mx   x 
Sets the maximum size of the memory allocation pool (the garbage collected heap) to x. 
The default is 16 megabytes of memory. x must be greater than or equal to 1000 bytes. 
By default, x is measured in bytes. You can specify x in either kilobytes or megabytes by appending the letter 
"k" for kilobytes or the letter "m" for megabytes. 

-ms   x 
Sets the startup size of the memory allocation pool (the garbage collected heap) to x. The default is 1 megabyte 
of memory. x must be > 1000 bytes. 
By default, x is measured in bytes. You can specify x in either kilobytes or megabytes by appending the letter 
"k" for kilobytes or the letter "m" for megabytes. 

-noasyncgc 
Turns off asynchronous garbage collection. When activated no garbage collection takes place unless 
it is explicitly called or the program runs out of memory. Normally garbage collection runs as an 
asynchronous thread in parallel with other threads. 

-noclassgc 
Turns off garbage collection of Java classes. By default, the Java interpreter reclaims space for unused 
Java classes during garbage collection. 

-nojit 
Specifies that any JIT compiler should be ignored and instead invokes the default Java interpreter. 

-ss   x 
Each Java thread has two stacks: one for Java code and one for C code. The -ss option sets the maximum stack size 
that can be used by C code in a thread to x. Every thread that is spawned during the execution of the program 
passed to jre has x as its C stack size. The default units for x are bytes. The value of x must be greater than 
or equal to 1000 bytes. 
You can modify the meaning of x by appending either the letter "k" for kilobytes or the letter "m" for megabytes. 
The default stack size is 128 kilobytes ("-ss 128k"). 

-oss   x 
Each Java thread has two stacks: one for Java code and one for C code. The -oss option sets the maximum stack size 
that can be used by Java code in a thread to x. Every thread that is spawned during the execution of the program 
passed to jre has x as its Java stack size. The default units for x are bytes. The value of x must be greater 
than or equal to 1000 bytes. 
You can modify the meaning of x by appending either the letter "k" for kilobytes or the letter "m" for megabytes. 
The default stack size is 400 kilobytes ("-oss 400k"). 

-v,   -verbose 
Causes jre to print a message to stdout each time a class file is loaded. 

-verify 
Performs byte-code verification on the class file. Beware, however, that java -verify does not perform 
a full verification in all situations. Any code path that is not actually executed by the interpreter 
is not verified. Therefore, java -verify cannot be relied upon to certify class files unless all code paths 
in the class file are actually run. 

-verifyremote 
Runs the verifier on all code that is loaded into the system via a classloader. verifyremote is the default 
for the interpreter. 

-noverify 
Turns verification off. 

-verbosegc 
Causes the garbage collector to print out messages whenever it frees memory. 

-DpropertyName=newValue 
Defines a property value. propertyName is the name of the property whose value you want to change and newValue 
is the value to change it to. For example, this command line 
% jre -Dawt.button.color=green ...

sets the value of the property awt.button.color to "green". jre accepts any number of -D options on the command line. 

ENVIRONMENT VARIABLES
CLASSPATH 
You can use the CLASSPATH environment variable to specify the path to the class file or files that you want to execute. 
CLASSPATH consists of a colon-separated list of directories that contain the class files to be executed. For example: 
   .:/home/xyz/classes

If the file to be executed is a zip file or a jar file, the path should end with the file name. For example: 
   .:/usr/local/java/classes/MyClasses.jar

SEE ALSO
CLASSPATH 


Note 3:
-------

Solaris: Installing IBM JRE, Version 1.3.1
To install JRE 1.3.1 on Solaris, follow these steps:

Log on as root. 
Insert the IBM Tivoli Access Manager for Solaris CD. 
Install the IBM JRE 1.3.1 package: 
pkgadd -d /cdrom/cdrom0/solaris -a /cdrom/cdrom0/solaris/pddefault SUNWj3rt 
where -d /cdrom/cdrom0/solaris specifies the location of the package and -a /cdrom/cdrom0/solaris/pddefault 
specifies the location of the installation administration script.

Set the PATH environmental variable: 
PATH=/usr/j2se/jre/bin:$PATH 
export PATH
After you install IBM JRE 1.3.1, no configuration is necessary.

###################################################################################


=========
30 LOBS:
=========


30.1 General LOB info:
----------------------


Note 1:
=======

A LOB is a Large Object.  LOBs are used to store large, unstructured data, such as video, audio, 
photo images etc.  With a LOB you can store up to 4 Gigabytes of data. 
They are similar to a LONG or LONG RAW but differ from them in quite a few ways.  

LOBs offer more features to the developer than a LONG or LONG RAW.  The main differences between 
the data types also indicate why you would use a LOB instead of a LONG or LONG RAW. These differences 
include the following: - 
�	You can have more than one LOB column in a table, whereas you are restricted to just one LONG 
        or LONG RAW column per table.
�	When you insert into a LOB, the actual value of the LOB is stored in a separate segment 
        (except for in-line LOBs) and only the LOB locator is stored in the row, thus making it more 
        efficient from a storage as well as query perspective.  With LONG or LONG RAW, the entire data 
        is stored in-line with the rest of the table row. 
�	LOBs allow a random access to its data, whereas with a LONG you have to go in for a sequential read 
        of the data from beginning to end.
�	The maximum length of a LOB is 4 Gig as compared to a 2 Gig limit on LONG 
�	Querying a LOB column returns the LOB locator and not the entire value of the LOB.  
        On the other hand, querying LONG returns the entire value contained within the LONG column

You can have two categories of LOBs based on their location with respect to the database.  The categories 
include internal LOBs and external LOBs.  As the names suggest, internal LOBs are stored within the database, 
as table columns. External LOBs are stored outside the database as operating system files.  
Only a reference to the actual OS file is stored in the database.  An internal LOB can also be persistent 
or temporary depending on the life of the internal LOB. 

An internal LOB can be one of three different data types as follows: - 
�	CLOB � A Character LOB.  Used to store character data.
�	BLOB � A Binary LOB.  Used to store binary, raw data
�	NCLOB � A LOB that stores character data that corresponds to the national character set 
                defined for the database.

The only external LOB data type in Oracle 8i is called a BFILE.  
�	BFILE - Short for Binary File.  These hold references to large binary data stored as physical files 
        in the OS outside the database. 


DBA_LOBS displays the BLOBs and CLOBs contained in all tables in the database. BFILEs are stored outside the database, 
so they are not described by this view. This view's columns are the same as those in "ALL_LOBS".

NCLOB and CLOB, are both encoded a internal fixed-width Unicode character set.

CLOB   = Character Large Object 4Gigabytes 
NCLOB  = National Character Large Object   4Gigabytes   
BLOB   = Binary Large Object    4Gigabytes   
BFILE  = pointer to binary file on disk   4Gigabytes 

- A limited number of BFILEs can be open simultaneously per session. The initialization parameter, 
  SESSION_MAX_OPEN_FILES defines an upper limit on the number of simultaneously open files in a session. 

  The default value for this parameter is 10. That is, you can open a maximum of 10 files at the same time 
  per session if the default value is utilized. If you want to alter this limit, the database administrator 
  can change the value of this parameter in the init.ora file. For example: 

  SESSION_MAX_OPEN_FILES=20

  If the number of unclosed files exceeds the SESSION_MAX_OPEN_FILES value then you will not be able 
  to open any more files in the session. To close all open files, use the FILECLOSEALL call. 


- LOB locators
  Regardless of where the value of the internal LOB is stored, a locator is stored in the row. 
  You can think of a LOB locator as a pointer to the actual location of the LOB value. A LOB locator 
  is a locator to an internal LOB while a BFILE locator is a locator to an external LOB. 
  When the term locator is used without an identifying prefix term, it refers to both LOB locators and BFILE locators. 

- Internal LOB Locators
  For internal LOBs, the LOB column stores a locator to the LOB's value which is stored in a database tablespace. 
  Each LOB column/attribute for a given row has its own distinct LOB locator and copy of the LOB value 
  stored in the database tablespace. 

- LOB Locator Operations
  Setting the LOB Column/Attribute to contain a locator
  Before you can start writing data to an internal LOB, the LOB column/attribute must be made non-null, 
  that is, it must contain a locator. Similarly, before you can start accessing the BFILE value, 
  the BFILE column/attribute must be made non-null. 

  For internal LOBs, you can accomplish this by initializing the internal LOB to empty in an 
  INSERT/UPDATE statement using the functions EMPTY_BLOB() for BLOBs or EMPTY_CLOB() for CLOBs and NCLOBs. 

  For external LOBs, you can initialize the BFILE column to point to an external file 
  by using the BFILENAME() function. 


Note 2:
=======

From: Oracle, Kalpana Malligere 29-Aug-01 14:50 
Subject: Re : What is my best LOB choice 


Hello, 

There are several articles/discussions available in the MetaLink Repository which discuss LOBs, including BFILEs. 
They are accessible via the Search option and the following articles should assist you to make you choice: 

66431.1 LOBS - Storage, Redo and Performance Issues 
66046.1 Oracle8i: LOBs 
107441.1 Comparison between LOBs, and LONG & LONG Raw Datatypes 

To find any performance comparison between BFILEs and BLOBs, the best 
suggestion is to try a small scale test. One of the customer wrote that his rule of thumb is that a small number 
of large LOBs => bfile, and a large number of small LOBs => BLOB. 

The BLOB datatype can store up to 4Gb of data. BLOBs can participate fully in transactions. 
Changes made to a BLOB value by the DBMS_LOB package, PL/SQL, or the OCI can be committed or rolled back. 
The BFILE datatype stores unstructured binary data (such as image files) in operating-system files 
outside the database. A BFILE column or attribute stores a file locator that points to an external file 
containing the data. BFILEs can also store up to 4Gb of data. 

Howerver, BFILEs are read-only; you cannot modify them. They support only random (not sequential) reads, 
and they do not participate in transactions. The underlying operating system must maintain the file integrity 
and durability for BFILEs. The database administrator must ensure that the file exists and that Oracle processes 
have operating-system read permissions on the file. 

Your application will have an impact on which is preferable. BFILEs will really help if your application is 
WEB based because you can access them through an annonymous FTP connect into the browser by passing 
the URL to the HTML. You can also do this through a regular BLOB, but this would make you drag the 
entire image through the Oracle server buffer cache everytime it is requested. The separation of the backup 
can be beneficial especially if the the image files are mostly static. This reduces the backup volume of 
the database itself. You also don't need a special program for loading them into the database. 
You just copy the files to the OS and run a DML statement to add them. This way you also avoid the redo 
created by inserting them as an internal BLOB. 

On the other side of the coin, you will have to devise a file naming convention/directory structure to prevent 
overwriting the BFILE's. 
You may want to do only one backup instead of both. With BLOBs, if you backup the database, 
you have everything needed. You won't be able to update a BFILE through the database, you will always have to 
make modifcations through the OS. LOB types can be replicated, but not BFILE. 

The Oracle 8i Application Developer's Guide - Large Objects (LOBs), provides information on the various 
programmatic environments and how to operate on LOB and BFILE data. Questions on these capabilities 
should be posted to the appropriate forum (i.e. Oracle PL/SQL, Oracle Call Interface, Oracle Precompiler, etc.). 

To answer your question, it depends on how you want to use the data. 
A LOB is stored in line by default if it is less than 3,960 bytes, whereas an out-of-line LOB takes about 
20 bytes per row. An inline LOB (i.e. one that is actually stored in the row) is always logged, but an out-of-line 
can be made non-logging. Preference is always to DISABLE STORAGE IN ROW, but if your LOBs are actually very small, 
and the way you use them is sufficiently special then you may want to store them in line. 
But if so, they could probably become simple varchar2(4000). 
Note - the minimum size an out-of-line LOB can use is one Oracle block (plus a bit of extra space in the LOBINDEX). 

Thanks! 
Kalpana 
Oracle Technical Support 


Note 3:
=======

Doc ID </help/usaeng/Search/search.html>: 	Note:66431.1	Content Type: 	TEXT/PLAIN	
Subject: 	LOBS - Storage, Redo and Performance Issues	Creation Date: 	05-NOV-1998	
Type: 	BULLETIN	Last Revision Date: 	25-JUL-2002	
Status: 	PUBLISHED		

Introduction 
~~~~~~~~~~~~ 
  This is a short note on the internal storage of LOBs. The information 
  here is intended to supplement the documentation and other notes 
  which describe how to use LOBS. The focus is on the storage characteristics 
  and configuration issues which can affect performance. 
 
  There are 4 types of LOB: 
	   CLOB, BLOB, NCLOB	stored internally to Oracle 
	   BFILE		stored externally  
 
  The note mainly discusses the first 3 types of LOB which as stored INTERNALLY 
  within the Oracle DBMS. BFILE's are pointers to external files and 
  are only mentioned briefly.   
  Examples of handling LOBs can be found in 
  [NOTE:47740.1] <ml2_documents.showDocument?p_id=47740.1&p_database_id=NOT> 
 
 
Attributes 
~~~~~~~~~~ 
  There are many attributes associated with LOB columns. The aim here 
  is to cover the fundamental points about each of the main attributes. 
  The attributes for each LOB column are specified using the  
  "LOB (lobcolname) STORE AS ..." syntax. 
 
  A table containing LOBs (CLOB, NCLOB and BLOB) creates 2 additional  
  disk segments per LOB column - a LOBINDEX and a LOBSEGMENT. These 
  can be viewed, along with the LOB attributes, using the dictionary views:  
 
	DBA_LOBS, ALL_LOBS or USER_LOBS 
 
  which give the columns: 
 
	OWNER              Table Owner 
	TABLE_NAME         Table name 
	COLUMN_NAME        Column name in the table  
	SEGMENT_NAME       Segment name of the LOBSEGMENT 
	INDEX_NAME         Segment name of the LOBINDEX 
	CHUNK              Chunk size (bytes)  
	PCTVERSION         PctVersion  
	CACHE              Cache option of the LOB Segment	(yes/no) 
	LOGGING            Logging mode of the LOB segment	(yes/no) 
	IN_ROW             Whether storage in row is allowed 	(yes/no) 
 

SELECT
l.table_name as "TABLE",
l.column_name as "COLUMN",
l.segment_name as "SEGMENT",
l.index_name as "INDEX",
l.chunk as "CHUNKSIZE", l.LOGGING, l.IN_ROW, t.tablespace_name
FROM DBA_LOBS l, DBA_TABLES t
WHERE l.table_name=t.table_name AND 
l.owner in ('VPOUSERDB','TRIDION_CM');
 

 Storage Parameters 
 ~~~~~~~~~~~~~~~~~~ 
  By default LOB segments are created in the same tablespace as the 
  base table using the tablespaces default storage details. You can  
  specify the storage attributes of the LOB segments thus: 
 
Create table DemoLob ( A number, B clob ) 
       LOB(b)  
	STORE AS lobsegname (  
	  TABLESPACE lobsegts  
	  STORAGE (lobsegment storage clause)  
	  INDEX lobindexname ( 
		TABLESPACE lobidxts 
		STORAGE ( lobindex storage clause )  
	  )  
	) 
	TABLESPACE tables_ts 
	STORAGE( tables storage clause ) 
; 

CREATE TABLE t_lob 
(DOCUMENT_NR NUMBER(16,0) NOT NULL, 
DOCUMENT_BLOB BLOB NOT NULL 
) 
STORAGE 
(INITIAL 100k 
NEXT 100K 
PCTINCREASE 0 
MAXEXTENTS 100 
) 
TABLESPACE system 
lob (DOCUMENT_BLOB) store as DOCUMENT_LOB 
(tablespace ts storage 
(initial 30K next 30K pctincrease 30 maxextents 3) 
index (tablespace ts_index storage 
(initial 40K next 40K pctincrease 40 maxextents 4))); 

 
   In 8.0 the LOB INDEX can be stored separately from the lob segment. 
   If a tablespace is specified for the LOB SEGMENT then the LOB INDEX 
   will be placed in the same tablespace UNLESS a different tablespace 
   is explicitly specified. 
   Unless you specify names for the LOB segments system generated names 
   are used. 
 
 
 In ROW Versus Out of ROW 
 ~~~~~~~~~~~~~~~~~~~~~~~~ 
  LOB columns can be allowed to store data within the row or not as detailed 
  below. Whether in-line storage is allowed or not can ONLY be specified 
  at creation time. 
 
  "STORE AS ( enable storage in row )" 
	Allows LOB data to be stored in the TABLE segment provided 
	it is less than about 4000 bytes.   
 
	The actual maximum in-line LOB is 3964 bytes. 
 
	If the lob value is greater than 3964 bytes then the LOB data is 
	stored in the LOB SEGMENT (ie: out of line). An out of line 
	LOB behaves as described under 'disable storage in row' except that 
	if its size shrinks to 3964 or less the LOB can again be stored  
	inline. 
 
	When a LOB is stored out-of-line in an 'enable storage in row' 
	LOB column between 36 and 84 bytes of control data remain in-line  
	in the row piece. 
 
	In-line LOBS are subject to normal chaining and row migration 
	rules within Oracle. Ie: If you store a 3900 byte LOB in a row 
	with a 2K block size then the row piece will be chained across 
	two or more blocks. 
 
	Both REDO and UNDO are written for in-line LOBS as they are part 
	of the normal row data.  
 
 
  "STORE AS ( disable storage in row )" 
	This option prevents any size of LOB from being stored in-line. 
 
	Instead a 20 byte LOB locator is stored in the ROW which gives 
	a unique identifier for a LOB in the LOB segment for this column. 
 
	The Lob Locator actually gives a key into the LOB INDEX which  
	contains a list of all blocks (or pages) that make up the LOB. 
 
	The minimum storage allocation for an out of line LOB is 1 Database 
	BLOCK per LOB ITEM and may be more if CHUNK is larger than a  
	single block. 
 
	UNDO is only written for the column locator and LOB INDEX changes. 
 
	No UNDO is generated for pages in the LOB SEGMENT. 
	Consistent Read is achieved by using page versions. 
	Ie: When you update a page of a LOB the OLD page remains and a 
	    new page is created. This can appear to waste space but 
	    old pages can be reclaimed and reused. 
 
 
 CHUNK size 
 ~~~~~~~~~~ 
  "STORE AS ( CHUNK bytes ) " 
	Can ONLY be specified at creation time. 
 
	In 8.0 values of CHUNK are in bytes and are rounded to the next  
	highest multiple of DB_BLOCK_SIZE without erroring.  
	Eg: If you specify a CHUNK of 3000 with a block size of 2K then 
	    CHUNK is set to 4096 bytes. 
 
	"bytes" / DB_BLOCK_SIZE determines the unit of allocation of 
	blocks to an 'out of line' LOB in the LOB segment.  
	Eg: if CHUNK is 32K and the LOB is 'disable storage in row'   
	    then even if the LOB is only 10 bytes long 32K will be  
	    allocated in the LOB SEGMENT. 
 
	CHUNK does NOT affect in-line LOBS. 
 
 
 PCTVERSION 
 ~~~~~~~~~~ 
  "STORE AS ( PCTVERSION n )" 
	PCTVERSION can be changed after creation using: 
                ALTER TABLE tabname MODIFY LOB (lobname) ( PCTVERSION n ); 
 
	PCTVERSION affects the reclamation of old copies of LOB data. 
	This affects the ability to perform consistent read. 
 
	If a session is attempting to use an OLD version of a LOB 
	and that version gets overwritten (because PCTVERSION is too small) 
	then the user will typically see the errors: 
		ORA-01555: snapshot too old:  
				rollback segment number  with name "" too small 
		ORA-22924: snapshot too old 
 
	PCTVERSION can prevent OLD pages being used and force the segment 
	to extend instead.  
 
	Do not expect PCTVERSION to be an exact percentage of space as there  
	is an internal fudge factor applied. 
 
 
 CACHE 
 ~~~~~ 
  "STORE AS ( CACHE )" or "STORE AS ( NOCACHE )" 
	This option can be changed after creation using: 
		ALTER TABLE tabname MODIFY LOB (lobname) ( CACHE ); 
	or 
		ALTER TABLE tabname MODIFY LOB (lobname) ( NOCACHE ); 
 
	With NOCACHE set (the default) reads from and writes to the 
	LOB SEGMENT occur using direct reads and writes. This means that 
	the blocks are never cached in the buffer cache and the the Oracle 
	shadow process performs the reads/writes itself. 
	The reads / writes show up under the wait events "direct path read" 
	and "direct path write" and multiple blocks can be read/written at  
	a time (provided the caller is using a large enough buffer size). 
 
	When set the CACHE option causes the LOB SEGMENT blocks to 
	be read / written via the buffer cache . Reads show up as  
	"db file sequential read" but unlike a table scan the blocks are  
	placed at the most-recently-used end of the LRU chain. 
 
	The CACHE options for LOB columns is different to the CACHE 
	option for tables as CACHE_SIZE_THRESHOLD does not limit the  
	size of LOB read into the buffer cache. This means that extreme 
	caution is required otherwise the read of a long LOB can effectively 
	flush the cache. 
 
	In-line LOBS are not affected by the CACHE option as they reside 
	in the actual table block (which is typically accessed via the buffer  
	cache any way). 
 
	The cache option can affect the amount of REDO generated for 
	out of line LOBS. With NOCACHE blocks are direct loaded and 
	so entire block images are written to the REDO stream. If CHUNK 
	is also set then enough blocks to cover CHUNK are written to REDO. 
	If CACHE is set then the block changes are written to REDO.  
	Eg: In the extreme case  'DISABLE STORAGE IN ROW  NOCACHE  CHUNK 32K' 
	    would write redo for the whole 32K even if the LOB was only 
	    5 characters long. CACHE would write a redo record describing the 
	    5 byte change (taking about 100-200 bytes). 
 
 
 LOGGING 
 ~~~~~~~ 
   "STORE AS ( NOCACHE LOGGING )" or "STORE AS ( NOCACHE NOLOGGING )" 
	This option can be changed after creation but the LOGGING / NOLOGGING 
	attribute must be prefixed by the NOCACHE option. The CACHE option 
  	implicitly enables LOGGING. 
 
	The default for this option is LOGGING. 
 
	If a LOB is set to NOCACHE NOLOGGING then updates to the LOB SEGMENT 
	are not logged to the redo logs. However, updates to in-line LOBS 
	are still logged as normal. As NOCACHE operations use direct 
	block updates then all LOB segment operations are affected. 
	NOLOGGING of the LOB segment means that if you have to recover the  
	database then sections of the LOB segment will be marked as corrupt  
	during recovery.  
 
 
Space required for updates 
~~~~~~~~~~~~~~~~~~~~~~~~~~ 
  If a LOB is out-of-line then updates to pages if the LOB cause new  
  versions of those pages to be created. Rollback is achieved by reverting 
  back to the pre-updated page versions. This has implications on the  
  amount of space required when a LOB is being updated as the LOB SEGMENT 
  needs enough space to hold both the OLD and NEW pages concurrently in case 
  your transaction rolls back. 
  Eg: Consider the following: 
	INSERT a large LOB		LOB SEGMENT extends take the new pages 
	COMMIT; 
	DELETE the above LOB		The LOB pages are not yet free as 
					they will be needed in case of  
					rollback. 
	INSERT a new LOB		Hence this insert may require more  
					space in the LOB SEGMENT 
	COMMIT;				Only after this point could the 
					deleted pages be used. 
 
Performance Issues 
~~~~~~~~~~~~~~~~~~~ 
  Working with LOBs generally requires more than one round trip to the database. 
  The application first has to obtain the locator and only then can perform 
  operations against that locator. This is true for inline or out of line  
  LOBS. 
 
  The buffer size used to read / write the LOB can have a significant 
  impact on performance, as can the SQL*Net packet sizes. 
  Eg: With OCILobRead() a buffer size is specified for handling the LOB. 
      If this is small (say 2K) then there can be a round trip to the database 
      for each 2K chunk of the LOB. To make the issue worse the server will 
      only fetch the blocks needed to satisfy the current request so may  
      perform single block reads against the LOB SEGMENT. If however a larger  
      chunk size is used (say 32K) then the server can perform multiblock  
      operations and pass the data back in larger chunks. 
 
  There is a LOB buffering subsystem which can be used to help improve 
  the transfer of LOBs between the client and server processes. See the 
  documentation for details of this. 
 
 
BFILEs 
~~~~~~ 
  BFILEs are quite different to internal LOBS as the only real storage 
  issue is the space required for the inline locator. This is about 20 bytes 
  PLUS the length of the directory and filename elements of the BFILENAME. 
 
  The performance implications of the buffer size are the same as for internal 
  LOBS. 
 
References 
~~~~~~~~~~ 
 
[NOTE:162345.1] <ml2_documents.showDocument?p_id=162345.1&p_database_id=NOT> 
LOBS - Storage, Read-consistency and Rollback 

Note 4:
=======

 
Doc ID: 	Note:159995.1	Content Type: 	TEXT/X-HTML	   
Subject: 	Different Behaviors of Lob and Lobindex Segments in 8.0, 8i and 9i	Creation Date: 	05-OCT-2001	   
Type: 	BULLETIN	Last Revision Date: 	27-MAR-2003	   
Status: 	PUBLISHED		 
PURPOSE
------- 
This bulletin lists the different behaviors of a lob index segment regarding 
tablespace and storage values: 
-> When creating the table, the lob and lob index segments 
-> Altering the associated lob segment and/or lob index segment. 
SCOPE & APPLICATION
------------------- 
For all DBAs who manage different versions of Oracle with databases containing 
LOB segments, and who need to maintain the associated lob indexes. 
Under 8i and 9i 
In Oracle8i SQL Reference and Oracle9i SQL Reference, it is clearly stated that: 
lob_index_clause 
This clause is deprecated as of Oracle8i. Oracle generates an index for each LOB column. 
Oracle names and manages the LOB indexes internally. Although it is still possible for 
you to specify this clause, Oracle Corporation strongly recommends that you no longer do 
so. In any event, do not put the LOB index in a different tablespace from the LOB data. 
1.Lob and lobindex specifications at table creation 
If you create a new table in release 8i and 9i and specify a tablespace 
and storage values for the LOB index for a non-partitioned table, the 
tablespace specification and storage values are ignored. 
The LOB index is located in the same tablespace as the LOB segment 
with the same storage values, except the NEXT and MAXEXTENTS values. 
the NEXT value of the lobindex = INITIAL default value of the tablespace (LOB segment) 

the MAXEXTENTS value of the lobindex = unlimited value (2Gb) 

SQL> CREATE TABLE t_lob 
2 (DOCUMENT_NR NUMBER(16,0) NOT NULL, 
3 DOCUMENT_BLOB BLOB NOT NULL 
4 ) 
5 STORAGE 
6 (INITIAL 100k 
7 NEXT 100K 
8 PCTINCREASE 0 
9 MAXEXTENTS 100 
10 ) 
11 TABLESPACE system 
12 lob (DOCUMENT_BLOB) store as DOCUMENT_LOB 
13 (tablespace ts storage 
14 (initial 30K next 30K pctincrease 30 maxextents 3) 
15 index (tablespace ts_index storage 
16 (initial 40K next 40K pctincrease 40 maxextents 4))); 

Table created. 

SQL> select segment_name, segment_type, tablespace_name, 
2 initial_extent, next_extent, pct_increase, max_extents 
3 from user_segments; 
SEGMENT_NAME SEGMENT_TY TABLESPA INITIAL NEXT_EXT PCT_INC MAX_EXT 
----------------------- ----------- --------- -------- -------- ------- --------- 
T_LOB TABLE SYSTEM 102400 102400 0 100 
SYS_IL0000020297C00002$$ LOBINDEX TS 30720 10240 30 2147483645 
DOCUMENT_LOB LOBSEGMENT TS 30720 30720 30 3 
All storage modifications are based on this original table t_lob. 
2.Lob and lobindex storage modifications 
When you modify the storage values for the lob and lob index segments, 
the values of the lob index are kept as initially set, except the PCT_INCREASE. 
The value of the lob segment PCTINCREASE spreads out on the lob index: 
SQL> alter table t_lob 
2 modify lob (document_blob) 
3 (storage (next 60K pctincrease 60 maxextents 6) 
4 index (storage (next 70K pctincrease 70 maxextents 7))); 
Table altered. 
SQL> select segment_name, segment_type, tablespace_name, 
2 initial_extent, next_extent, pct_increase, max_extents 
3 from user_segments; 
SEGMENT_NAME SEGMENT_TY TABLESPA INITIAL NEXT_EXT PCT_INC MAX_EXT 
----------------------- ----------- --------- -------- -------- ------- --------- 
T_LOB TABLE SYSTEM 102400 102400 0 100 
SYS_IL0000020297C00002$$ LOBINDEX TS 30720 10240 60 2147483645 
DOCUMENT_LOB LOBSEGMENT TS 30720 61440 60 6 
3.Storage modifications of lob segment only 
If you modify the storage values for the lob segment only, you get the same behaviour: 
SQL> alter table t_lob 
2 modify lob (document_blob) 
3 (storage (next 60K pctincrease 60 maxextents 6)); 
Table altered. 
SQL> select segment_name, segment_type, tablespace_name, 
2 initial_extent, next_extent, pct_increase, max_extents 
3 from user_segments; 
SEGMENT_NAME SEGMENT_TY TABLESPA INITIAL NEXT_EXT PCT_INC MAX_EXT 
----------------------- ----------- --------- -------- -------- ------- --------- 
T_LOB TABLE SYSTEM 102400 102400 0 100 
SYS_IL0000020297C00002$$ LOBINDEX TS 30720 10240 60 2147483645 
DOCUMENT_LOB LOBSEGMENT TS 30720 61440 60 3 
4.Storage modifications of lobindex segment only 
If you modify the storage values for the lob index segment only, nothing is altered: 
SQL> alter table t_lob 
2 modify lob (document_blob) 
3 (index (storage (next 70K pctincrease 70 maxextents 7))) 
4 ; 
Table altered. 
SQL> select segment_name, segment_type, tablespace_name, 
2 initial_extent, next_extent, pct_increase, max_extents 
3 from user_segments; 
SEGMENT_NAME SEGMENT_TY TABLESPA INITIAL NEXT_EXT PCT_INC MAX_EXT 
----------------------- ----------- --------- -------- -------- ------- --------- 
T_LOB TABLE SYSTEM 102400 102400 0 100 
SYS_IL0000020297C00002$$ LOBINDEX TS 30720 10240 30 2147483645 
DOCUMENT_LOB LOBSEGMENT TS 30720 30720 30 3 
If you attempt to modify the storage values of the lob index directly, 
you get an error message: 
SQL> alter index SYS_IL0000020297C00002$$ storage (pctincrease 80); 
alter index SYS_IL0000020297C00002$$ storage (pctincrease 80) 
* 
ERROR at line 1: 
ORA-22864: cannot ALTER or DROP LOB indexes 
SQL> alter index SYS_IL0000020297C00002$$ rebuild storage (pctincrease 60); 
alter index SYS_IL0000020297C00002$$ rebuild storage (pctincrease 60) 
* 
ERROR at line 1: 
ORA-02327: cannot create index on expression with datatype LOB 
Under 8.0 
1.Lob and lobindex specifications at table creation 
If you create a new table in release 8.0 and specify a tablespace for the LOB index for 
a non-partitioned table, the tablespace specification and storage values are encountered. 
The LOB index is located in the defined tablespace with the user-defined storage values. 
SQL> CREATE TABLE t_lob 
2 (DOCUMENT_NR NUMBER(16,0) NOT NULL, 
3 DOCUMENT_BLOB BLOB NOT NULL 
4 ) 
5 STORAGE 
6 (INITIAL 100k 
7 NEXT 100K 
8 PCTINCREASE 0 
9 MAXEXTENTS 100 
10 ) 
11 TABLESPACE system 
12 lob (DOCUMENT_BLOB) store as DOCUMENT_LOB 
13 (tablespace ts storage 
14 (initial 30K next 30K pctincrease 30 maxextents 3) 
15 index (tablespace ts_index storage 
16 (initial 40K next 40K pctincrease 40 maxextents 4))); 
Table created. 
SQL> select segment_name, segment_type, tablespace_name, 
2 initial_extent, next_extent, pct_increase, max_extents 
3 from user_segments; 
SEGMENT_NAME SEGMENT_TY TABLESPA INITIAL NEXT_EXT PCT_INC MAX_EXT 
----------------------- ----------- --------- -------- -------- ------- --------- 
T_LOB TABLE SYSTEM 102400 102400 0 100 
SYS_IL0000020297C00002$$ LOBINDEX TS_INDEX 40960 40960 40 4 
DOCUMENT_LOB LOBSEGMENT TS 32768 30720 30 3 
All storage modifications are based on this original table t_lob. 
2.Lob and lobindex storage modifications 
When you modify the storage values for the lob and lob index segments, 
the values for the lobindex are kept as initially set: 
SQL> alter table t_lob 
2 modify lob (document_blob) 
3 (storage (next 60K pctincrease 60 maxextents 6) 
4 index (storage (next 70K pctincrease 70 maxextents 7))); 
Table altered. 
SQL> select segment_name, segment_type, tablespace_name, 
2 initial_extent, next_extent, pct_increase, max_extents 
3 from user_segments; 
SEGMENT_NAME SEGMENT_TY TABLESPA INITIAL NEXT_EXT PCT_INC MAX_EXT 
----------------------- ----------- --------- -------- -------- ------- --------- 
T_LOB TABLE SYSTEM 102400 102400 0 100 
SYS_IL0000020297C00002$$ LOBINDEX TS_INDEX 40960 40960 40 4 
DOCUMENT_LOB LOBSEGMENT TS 32768 61440 60 6 
3.Storage modifications of lob segment only 
If you modify the storage values for the lob segment only, you get the same behavior: 
SQL> alter table t_lob 
2 modify lob (document_blob) 
3 (storage (next 60K pctincrease 60 maxextents 6)); 
Table altered. 
SQL> select segment_name, segment_type, tablespace_name, 
2 initial_extent, next_extent, pct_increase, max_extents 
3 from user_segments; 

SEGMENT_NAME SEGMENT_TY TABLESPA INITIAL NEXT_EXT PCT_INC MAX_EXT 
----------------------- ----------- --------- -------- -------- ------- --------- 
T_LOB TABLE SYSTEM 102400 102400 0 100 
SYS_IL0000020297C00002$$ LOBINDEX TS_INDEX 40960 40960 40 4 
DOCUMENT_LOB LOBSEGMENT TS 32768 61440 60 6 

Again, the lob segment storage values do not impact the lob index. 
4.Storage modifications of lobindex segment only 
If you modify the storage values for the lob index segment only, nothing is altered: 
SQL> alter table t_lob 
2 modify lob (document_blob) 
3 (index (storage (next 70K pctincrease 70 maxextents 7))) 
4 ; 
Table altered. 
SQL> select segment_name, segment_type, tablespace_name, 
2 initial_extent, next_extent, pct_increase, max_extents 
3 from user_segments; 

SEGMENT_NAME SEGMENT_TY TABLESPA INITIAL NEXT_EXT PCT_INC MAX_EXT 
----------------------- ----------- --------- -------- -------- ------- --------- 
T_LOB TABLE SYSTEM 102400 102400 0 100 
SYS_IL0000020297C00002$$ LOBINDEX TS_INDEX 40960 40960 40 4 
DOCUMENT_LOB LOBSEGMENT TS 32768 30720 30 3 

If you attempt to modify the storage values of the lob index directly, 
you get an error message: 
SQL> alter index SYS_IL0000020297C00002$$ storage (pctincrease 20); 
alter index SYS_IL0000020297C00002$$ storage (pctincrease 20) 
* 
ERROR at line 1: 
ORA-22864: cannot ALTER or DROP LOB indexes 
Migration from 7 to 9i 
The "Oracle9i Database Migration Release 1 (9.0.1)" documentation states: 
LOB Index Clause 
If you used the LOB index clause to store LOB index data in a tablespace 
separate from the tablespace used to store the LOB, the index data 
is relocated to reside in the same tablespace as the LOB. 
If you used Export/Import to migrate from Oracle7 to Oracle9i, the index 
data was relocated automatically during migration. However, the index data 
was not relocated if you used the Migration utility or the Oracle Data 
Migration Assistant. 
RELATED DOCUMENTS
----------------- 
<Note:66431.1> LOBS - Storage, Redo and Performance Issues 
<Bug:1353339> ALTER TABLE MODIFY DEFAULT ATTRIBUTES LOB DOES NOT UPDATE LOB INDEX DEFAULT TS 
<Bug:1864548> LARGE LOB INDEX SEGMENT SIZE 
<Bug:747326> ALTER TABLE MODIFY LOB STORAGE PARAMETER DOES'T WORK 
<Bug:1244654> UNABLE TO CHANGE STORAGE CHARACTERISTICS FOR LOB INDEXES 


Note 5:
=======

Calculate sizes:

Example 
------- 
SQL> create table my_lob 
2 (idx number null, a_lob clob null, b_lob blob null) 
3 storage (initial 20k maxextents 121 pctincrease 0 ) 
4 lob (a_lob, b_lob) store as 
5 ( storage ( initial 100k next 100K maxextents 999 pctincrease 0)); 
Table created. 
SQL> select object_name,object_type,object_id from user_objects order by 2; 
OBJECT_NAME OBJECT_TYPE OBJECT_ID 
---------------------------------------- ------------------ ---------- 
SYS_LOB0000004017C00002$$ LOB 4018 
SYS_LOB0000004017C00003$$ LOB 4020 
MY_LOB TABLE 4017 
SQL> select bytes, s.segment_name,s.segment_type 
2 from dba_segments s 
3 where s.segment_name='MY_LOB'; 
BYTES SEGMENT_NAME SEGMENT_TYPE 
---------- ------------------------------ ------------------ 
65536 MY_LOB TABLE 
SQL> select sum(bytes), s.segment_name, s.segment_type 
2 from dba_lobs l, dba_segments s 
3 where s.segment_type = 'LOBSEGMENT' 
4 and l.table_name = 'MY_LOB' 
5 and s.segment_name = l.segment_name 
6 group by s.segment_name,s.segment_type; 
SUM(BYTES) SEGMENT_NAME SEGMENT_TYPE 
---------- ------------------------------ ------------------ 
131072 SYS_LOB0000004017C00002$$ LOBSEGMENT 
131072 SYS_LOB0000004017C00003$$ LOBSEGMENT 
Therefore the total size for the table MY_LOB is: 
65536 (for the table) + 131072 (for CLOB segment) + 131072 (for BLOB segment) 
=> 327680 bytes 


Note 6:
=======


Doc ID:  Note:268476.1 
Subject:  LOB Performance Guideline 
Type:  WHITE PAPER 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  09-APR-2004 
Last Revision Date:  22-JUN-2004 
 
LOB Performance Guidelines


An Oracle White Paper

April 2004


LOB Performance Guidelines
Executive Overview.............................................................................. 3
LOB Overview...................................................................................... 3
Important Storage Parameters................................................................ 4
CHUNK............................................................................................ 4
Definition...................................................................................... 4
Points to Note............................................................................... 4
Recommendation........................................................................... 4
In-line and Out-of-Line storage: ENABLE STORAGE IN ROW and DISABLE STORAGE IN ROW 4
Definition...................................................................................... 4
Points to Note............................................................................... 5
Recommendation........................................................................... 5
CACHE, NOCACHE....................................................................... 5
Definition...................................................................................... 5
Points to Note............................................................................... 6
Recommendation........................................................................... 6
Consistent Reads on LOBs: RETENTION and PCTVERSION...... 6
Definition...................................................................................... 6
Points to Note............................................................................... 6
Recommendation........................................................................... 7
LOGGING, NOLOGGING............................................................. 7
Definition...................................................................................... 7
Points to Note............................................................................... 7
Recommendation........................................................................... 7
Performance GUIDELINE ? LOB Loading.......................................... 8
Points to Note................................................................................... 8
Use array operations for LOB inserts............................................. 8
Scalability problem ? with LOB disable storage in row option...... 8
Row Chaining problem ? with the use of OCILobWrite API......... 8
High number of consistent read blocks created and examined...... 9
CPU time and Elapsed time - not reported accurately................... 9
Reads/Writes are done one chunk at a time in synchronous way 10
High CPU system time................................................................. 11
Buffer cache sizing problem......................................................... 11
Multi-byte character set conversion............................................. 11
HWM enqueue contention........................................................... 11
RAC environment issues.............................................................. 12
Other LOB performance related issues....................................... 12

APPENDIX A..................................................................................... 13
LONG API access to LOB datatype............................................... 13
APPENDIX B..................................................................................... 15
Migration from in-line to out-of-line (and out-of-line to in-line) storage 15
APPENDIX C..................................................................................... 16
How LOB data is stored.................................................................. 16
In-line LOB ? LOB size less than 3964 bytes............................. 16
In-line LOB ? LOB size = 3965 bytes (1 byte greater than 3964) 16
In-line LOB ? LOB size greater than 12 chunk addresses........... 17
Out-of-line LOBs ? All LOB sizes.............................................. 17


LOB Performance Guidelines

Executive Overview

This document gives a brief overview of Oracle?s LOB data structure, emphasizing various storage parameter options 
and describes scenarios where those storage parameters are best used. The purpose of the latter is to help describe 
the effects of readers select the appropriate LOB storage options. This paper assumes that most customers load 
LOB data once and retrieve many times (less than 10% of DML is update and delete), so performance guidelines provided 
here are for LOB loading.

LOBs were designed to efficiently store and retrieve large amounts of data. Small LOBs (< 1MB) perform better 
than LONGs for inserts, and have comparable performance on selects. Large LOBs perform better than LONGs in general. 

Oracle recommends the use of LOBs to store unstructured or semi-structured data, and has provided a LONG API 
to allow ease of migration from LONGs to LOBs. Oracle plans to de-support LONGs in the future.

LOB Overview

Whenever a table containing a LOB column is created, two segments are created to hold the specified LOB column. 
These segments are of type LOBSEGMENT and LOBINDEX. 
The LOBINDEX segment is used to access LOB chunks/pages that are stored in the LOBSEGMENT segment.

CREATE TABLE foo (pkey NUMBER, bar BLOB);

SELECT segment_name, segment_type FROM user_extents;

9792 is the object_id of the parent table FOO 
(if a table has more than one LOB column, LOB segment names are generated differently, 
use dba|user_lobs view to get parent table association). 
 

SEGMENT_NAME                          SEGMENT_TYPE 
FOO                                   TABLE
SYS_IL0000009792C00002$$              LOBINDEX
SYS_LOB0000009792C00002$$             LOBSEGMENT (also referred as LOB chunks/pages)
 

The LOBSEGMENT and the LOBINDEX segments are stored in the same tablespace as the table containing the LOB, 
unless otherwise specified.[1] 

Important Storage Parameters

This section defines the important storage parameters of a LOB column (or a LOB attribute) - . 
?fFor each definition we describe the effects of the parameter, and give recommendations for on how to get 
better performance and to avoid errors. 

CHUNK

Definition

CHUNK is the smallest unit of LOBSEGMENT allocation. It is a multiple of DB_BLOCK_SIZE.

Points to Note

?         For example, if the value of CHUNK is 8K and an inserted LOB is only 1K in size, then 1 chunk 
          is allocated and 7K are wasted in that chunk. The CHUNK option does NOT affect in-line LOBs 
          (see the definition in the next section)

?         Choose an appropriate chunk size for best performance also to avoid space wastage. 
          The maximum chunk size is 32K.

?         The CHUNK parameter cannot be altered.

Recommendation

Choose a chunk size for optimal performance and minimum space wastage. For LOBs that are less than 32K, 
a chunk size that is 60% (or more) of the LOB size is a good starting point. For LOBs larger than 32K, 
choose a chunk size equal to the frequent update size.

In-line and Out-of-Line storage: ENABLE STORAGE IN ROW and DISABLE STORAGE IN ROW

Definition

LOB storage is said to be Inin-line when the LOB data is stored with the other column data in the row. 
A LOB can only be stored inline if its size is less than ~4000 bytes. For in-line LOB data, space is allocated 
in the table segment (the LOBINDEX and LOBSEGMENT segments are empty).

LOB storage is said to be out-of-line when the LOB data is stored , in CHUNK sized blocks in the LOBSEGMENT segment, 
separate from the other columns? data.

ENABLE STORAGE IN ROW allows LOB data to be stored in the table segment provided it is less than ~4000 bytes.

DISABLE STORAGE IN ROW prevents LOB data from being stored in-line, regardless of the size of the LOB. 
Instead only a 20-byte LOB locator is stored with the other column data in the table segment.

Points to Note

?         In-line LOBs are subject to normal chaining and row migration rules within Oracle. If you store a 
          3900 byte LOB in a row with 2K block size then the row will be chained across two or more blocks. 
          Both REDO and UNDO are written for in-line LOBs as they are part of the normal row data. 
          The CHUNK option does not affect in-line LOBs.

?         With out-of-line storage, UNDO is written only for the LOB locator and LOBINDEX changes. 
          No UNDO is generated for chunks/pages in the LOBSEGMENT. Consistent Read is achieved by using 
          page versions (see the RETENTION or PCTVERSION options).

?         DML operations on out-of-line LOBs can generate high amounts of redo information, because redo is 
          generated for the entire chunk. For example, in the extreme case, 
          ?DISABLE STORAGE IN ROW CHUNK 32K? would write redo for the whole 32K even if the LOB changes were was 
          only 5 bytes. 

?         When in-line LOB data is updated, and if the new LOB size is greater than 3964 bytes, then it is 
          migrated and stored out-of-line. If this migrated LOB is updated again and its size becomes less 
          than 3964 bytes, it is not moved back in-line (except when we use LONG API for update).

?         ENABLE|DISABLE STORAGE IN ROW parameters cannot be altered.

Recommendation

Use ENABLE STORAGE IN ROW, except in cases where the LOB data is not retrieved as much as other columns? data. 
In this case, if the LOB data is stored out-of-line, the biggest gain is achieved while performing full table scans, 
as the operation does not retrieve the LOB?s data.

CACHE, NOCACHE

Definition

The CACHE storage parameter causes LOB data blocks to be read/written via the buffer cache. 

With the NOCACHE storage parameter, LOB data is read/written using direct reads/writes. This means that the LOB data 
blocks are never in the buffer cache and the Oracle server process performs the reads/writes. 


Points to Note

?         With the CACHE option, LOB data reads show up as wait event ?db file sequential read?, writes are performed 
          by the DBWR process. With the NOCACHE option, LOB data reads/writes show up as wait events 
          direct path read (lob)?/?direct path write (lob)?. Corresponding statistics are ?physical reads direct (lob)? 
          and ?physical writes direct (lob)?.

?         In-line LOBs are not affected by the CACHE option as they reside with the other column data, 
          which is typically accessed via the buffer cache. 

?         The CACHE option gives better read/write performance than the NOCACHE option.

?         The CACHE option for LOB columns is different from the CACHE option for tables. This means that caution 
          is required otherwise the read of a large LOB can effectively flush the buffer cache.

?         The CACHE|NOCACHE option can be altered. 

Recommendation

Enable caching, except for cases where caching LOBs would severely impact performance for other online users, 
by forcing these users to perform disk reads rather than getting cache hits.

Consistent Reads on LOBs: RETENTION and PCTVERSION

Consistent Read (CR) on LOBs uses a different mechanism than that used for other data blocks in Oracle. 
Older versions of the LOB are retained in the LOB segment and CR is used on the LOB index to access these 
older versions (for in-line LOBs which are stored in the table segment, the regular UNDO mechanism is used). 
There are two ways to control how long older versions are maintained. 

Definition

?         RETENTION ? time-based: this specifies how long older versions are to be retained.

?         PCTVERSION ? space-based: this specifies what percentage of the LOB segment is to be used 
          to hold older versions.

Points to Note

?         RETENTION is a keyword in the LOB column definition. No value can be specified for RETENTION. 
          The RETENTION value is implicit,.. If a LOB is created with database compatibility set to 
          9.2.0.0 or higher, undo_management=TRUE and PCTVERSION is not explicitly specified, 
          time-based retention is used. The LOB RETENTION value is always equal to the value of the 
          UNDO_RETENTION database instance parameter.

?         You cannot specify both PCTVERSION and RETENTION.

?         PCTVERSION is applicable only to LOB chunks/pages allocated in LOBSEGMENTS. Other LOB related data 
          in the table column and the LOBINDEX segment use regular undo mechanism.

?         PCTVERSION=0: the space allocated for older versions of LOB data in LOBSEGMENTS can be reused 
          by other transactions and can cause ?snapshot too old? errors.

?         PCTVERSION=100: the space allocated by older versions of LOB data can never be reused by other transactions. 
          LOB data storage space is never reclaimed and it always increases. 

?         RETENTION and PCTVERSION can be altered

Recommendation

Time-based retention using the RETENTION keyword is preferred.

A high value for RETENTION or PCTVERSION may be needed to avoid ?snapshot too old? errors in environments 
with high concurrent read/write LOB access.

LOGGING, NOLOGGING

Definition

LOGGING: enables logging of LOB data changes to the redo logs.

NOLOGGING: changes to LOB data (stored in LOBSEGMENTs) are not logged into the redo logs, however in-line LOB 
changes are still logged as normal.

Points to Note

?         The CACHE option implicitly enables LOGGING.

?         If NOLOGGING was set, and if you have to recover the database, 
          then sections of the LOBSEGMENT will be marked as corrupt during recovery 
          (LOBINDEX changes are logged to redo logs and are recovered, but the corresponding LOBSEGMENTs 
          are not logged for recovery).

?         LOGGING|NOLOGGING can be altered. The NOCACHE option is required to turn off LOGGING, e.g. (NOCACHE NOLOGGING).

Recommendation

Use NOLOGGING only when doing bulk loads or migrating from LONG to LOB. 
Backup is recommended after bulk operations.


Performance GUIDELINE  LOB Loading

In the rest of the document, you will notice LOB API and LONG API methods being referenced many times. 
The difference between these APIs is as follows:

LOB API: the LOB data is accessed by first selecting the LOB locator. 
LONG API: the LOB data is accessed without using the LOB locator.

Points to Note
Use array operations for LOB inserts
Scalability problem with LOB disable storage in row option

BUG 3180333 - LOB LOADING USING SQLLDR DOESN'T SCALE

Problem scenario: 2 (or more) concurrent sqlldr processes trying to load LOB data (LOB column defined with 
DISABLE STORAGE IN ROW). Loading will run almost serially. Serialization point is getting a CR copy of the LOBINDEX block.

Workaround: use ENABLE STORAGE IN ROW even for LOBs whose size is greater than 3964 bytes. 
With ENABLE STORAGE IN ROW, we store the first 12 chunk addresses in the table row and if the inserted LOB data size 
can be addressed within these first 12 chunk addresses, then LOBINDEX is empty. Generating a CR version of a table block 
is more efficient and,,, in some cases, not required. This code path provides much better scalability. Please note that 
if LOB data is larger than 12 chunk addresses, then we may see CR contention with the ENABLE STORAGE IN ROW option as well.

 
Row Chaining problem with the use of OCILobWrite API

TAR 2760194.995 (UK) - LOADING SMALL (AVG LEN 1120) CLOB DATA INTO TABLE PRODUCES MUCH CHAINING, WHY?

Problem scenario: in 10gR1 (and older releases), SQL*Loader uses OCILobWrite API for LOB loading. 
This leads to a row chaining problem, as described below:

CREATE TABLE foo (pkey NUMBER NOT NULL, bar BLOB);

Load 3 rows with LOB data size as 3700, 3000 and 3400 respectively. 

SQL*Loader loads the LOB columns, first by inserting empty_blob, and second, by writing the LOB data using the LOB locator.
In the first step, the average row length is pkey length + empty_blob length= 4 + 40 bytes = ~44 bytes. 
Assuming that DB_BLOCK_SIZE=8192, these 3 rows can be inserted into one data block. 

In the second step, loading LOB data, the 1st row, 3700 bytes of LOB, and the 2nd row, 3000 bytes of LOB, can be inserted 
into the same block. However, for the 3rd row of LOB data, there is no space left in that block, so the row must be chained. 

Workaround: the first workaround could be to increase the value of PCTFREE. It may help solve this problem, 
but it unnecessarily wastes space. The second workaround is to write a loader program using the LONG API method 
(please note that an enhancement request against sqlldr component is filed for this problem, and there is a plan 
to fix it in the future release).


High number of consistent read blocks created and examined

BUG 3297800 - SQLLDR MAY NEED TO USE LONG API INTERFACE FOR LOBS LESS THAN 2GB

Problem scenario: 2 (or more) concurrent sqlldr processes loading LOB data in conventional mode. Using the LOB API method 
for loading the LOB data in a single user environment may also cause a high number of CR block creation to occur.

As mentioned earlier, loading the LOB data is performed in 2 steps. . In the first step, sqlldr inserts empty_blob 
for LOB columns. Then, with this LOB locator, the LOB data is written using an OCILobWrite call. 
In a multi-user loading environment, before OCILobWrite is invoked, if other loading processes change the data block, 
it may be required to examine the block and, if required, a CR version of the block is created. 

Workaround: None, other than writing a loader program using he LONG API method

 
CPU time and Elapsed time - not reported accurately 

BUG 3504487 - DBMS_LOB/OCILob* CALL RESOURCE USAGE IS NOT REPORTED, AS THEY ARE NOT PART OF A CURSOR

Problem scenario: the work done using LOB API calls is not part of the cursor, so reporting resource usage while 
collecting statistics for the LOB workload, such as the CPU time or the elapsed time, may not be accurate. 

Example to illustrate this situation:

(We have already a table created as: CREATE TABLE foo (pkey NUMBER, bar BLOB);)

Declare
            lob_loc             blob;
            buffer              raw(32767);
        lob_amt             binary_integer := 16384;
begin
        buffer := utl_raw.cast_to_raw(rpad('FF', 32767, 'FF'));
        for j in 1..10000 loop
                    select bar into lob_loc from foo where pkey = j for update;
            dbms_lob.write(lob_loc, lob_amt, 1, buffer );
            commit;
        end loop;
        dbms_output.put_line ('Write test finished ');
end;
/

After executing the above PL/SQL, query V$SQL to measure cpu_time and elapsed time resource usage. 

select sql_text, cpu_time/100000, elapsed_time/100000 
from v$sql 
where sql_text like '%foo%' or sql_text like ?%dbms_lob%?;

SQL_TEXT 
------------------------------------------------------------------------------------------------ 
CPU_TIME/1000000 ELAPSED_TIME/1000000
-------------------------- -----------------------------------

declare       lob_loc             blob;       buffer              raw(32767);  
lob_amt             binary_integer := 16384 ; 
                 begin       buffer := utl_raw.cast_to_raw(rpad('FF', 32767, 'FF'));       
                for j in 1..10000 loop    
                select 
bar into lob_loc from foo where pkey = j for update;       
                  dbms_lob.write(lob_loc, lob_amt, 1, buffer );         
                commit;       end loop;       dbms_output.put_line ('Write test finished '); end;
                19.54                                           19.28
 
 
SELECT bar from foo where pkey = :b1 for update
5.00                                             4.81
 

As you can see, the PL/SQL block took about 19.54 seconds in CPU time and 19.28 seconds in elapsed time respectively. 
Out of 19.54 secondss , the SELECT statement contributed to 5.00 seconds, so the remaining 14 seconds (approximately) 
were spent in dbms_lob.write. This is not reported, because the work done by dbms_lob.write is not part of a cursor. 
Similarly OCILOB API calls were not part of a cursor as well.
 

Workaround: None
Reads/Writes are done one chunk at a time in synchronous way


BUG 3437770 - LOB DIRECT PATH READ/WRITES ARE LIMITED BY CHUNK SIZE

Problem scenario: The Oracle server process does NOCACHE LOB reads/writes using a direct path mechanism. 
The limitation here is that reads/writes are done one chunk at a time in a synchronous way. Consider the example below:

Assuming CHUNK size=8K, DB_BLOCK_SIZE=2k, LOB data = 64K, 8 writes are done (each doing 4 blocks of write at a time) 
to load the entire LOB data, waiting for each write to complete before issuing another write.

Workaround: use as many loader processes as possible to maximize disk throughput.

 
High CPU system time 


BUG 3437770 - LOB DIRECT PATH READ/WRITES ARE LIMITED BY CHUNK SIZE

This is probably due to the above limitation (reads/writes are done one chunk at time in synchronous way)


Buffer cache sizing problem

Problem scenario: loading LOB data with the CACHE option will most likely fill up even a large buffer cache. 
Under this condition, a degradation in the load rate can be seen if the database writer doesn?t keep up with 
the foreground free buffer requests.

Workaround: follow the general instance tuning guidelines

- use asynchronous I/O (if not possible, use multiple db writer processes)

- stripe datafiles across many spindles

- use the NOCACHE option

The CACHE option will also force other online users to perform physical disk reads. 
This can be avoided by using multiple block sizes.

For example, keep online user objects in 4k (or 8k) block size tablespace and and cached LOB data in 8kK (or 16k) 
block size tablespace. Allocate the required amount of buffer cache for each block sizes 
(e.g. db_4k_block_buffer=500M, db_8k_block_buffer=2000M)

Multi-byte character set conversion

BUG 3324897 - LOBS LESS THAN 3964 BYTES ARE STORED OUT-OF-LINE WHILE LOADING USING SQLLDR

Problem scenario: wWhen dealing with multi-byte character set, additional bytes are required for CLOB data. This may cause 
client side CLOB data of ~ 4000 bytes, being stored out-of-line in the database.

Workaround: None


Use array operations for LOB inserts

HWM enqueue contention

BUG 3537749 - HW ENQUEUE CONTENTION WHEN LOADING LOB DATA

Problem scenario: given the large size of LOB data (compare to relational table row size), blocks under 
HWM are filled rapidly (under high concurrent load condition) and can cause HW enqueue contention.

Workaround: ASSM with larger extent size may help. 


RAC environment issues

BUG 3429986 - CONVENTIONAL LOAD OF LOB FROM 2 RAC NODE DO NOT SCALE DUE TO LOG FLUSH LATENCIES

Problem scenario: In a RAC environment, when loading LOB data into one partition, you may notice contention on 1st level 
bitmap and LOB header segment with ASSM. You may notice the same contention on a single instance 
(with a large number of CPUs) with a high number of concurrent loaders.

Workaround: loading into separate partitions will avoid this situation. If this is not possible, use range-hash partition 
instead of just range partitions. FREEPOOLS should help in this situation, but we need to do more testing to see 
the effect of this parameter.but didn?t provide any improvement in our testing.
 

Other LOB performance related issues

BUG 3234751 - EXCESSIVE USAGE OF TEMP TS WHILE LOADING LOB USING SQLLDR IN CONVENTIONAL MODE

BUG 3230541 - LOB LOADING USING SQLLDR DIRECT PATH SLOWER THAN CONVENTIONAL

BUG 3189083 - OPEN/CLOSE OF DATAFILE FOR EVERY LOB CHUNK WRITEWRITES

APPENDIX A

APPENDIX A 

LONG API access to LOB datatype

 
Oracle provides transparent access to LOBs from applications that use LONG and LONG RAW datatypes. If your application 
uses DML (INSERT, UPDATE, DELETE) statements from OCI or PL/SQL (PRO*C etc) for LONG or LONG RAW data, no application 
changes are required after the column is converted to a LOB. 

For example, you can SELECT a CLOB into a character variable, or a BLOB into a RAW variable. You can define a CLOB column 
as SQLT_CHR or a BLOB column as SQLT_BIN and select the LOB data directly into a CHARACTER or RAW buffer without selecting 
out the locator first. 

The following example demonstrates this concept:

create table foo 
(
pkey number(10) not null, 
bar long raw
);
 
set serveroutput on
 
declare
   in_buf              raw(32767);
   out_buf            raw(32767);
   out_pkey         number;
begin
in_buf := utl_raw.cast_to_raw (rpad('FF', 32767, 'FF'));
 
for j in 1..10 loop
   insert into foo values (j, in_buf) ;
   commit;
end loop;
dbms_output.put_line ('Write test finished ');
      
for j in 1..10 loop
   select pkey, bar into  out_pkey, out_buf from foo where pkey=j ;
end loop;
dbms_output.put_line ('Read test finished ');
 
end;
/
 
Now migrate LONG RAW column to BLOB column
 
alter table foo modify (bar blob);

That works.

alter table foo modify (bar long raw);
ERROR at line 1:
ORA-22859: invalid modification of columns

So that does not work.

 
There are few things customer should note when doing the LONG to LOB migration. This alter table migration statement runs 
serially in 9i. i (what about 8i,10g). Indexes need to be rebuilt and statistics recollected.

After the LONG to LOB migration, the above PL/SQL block will work without any modifications. 

Advanced LOB features may require the use of the LOB API, described in the Oracle Documentation[2] 


APPENDIX B

Migration FROM from in-line to out-of-line (and out-of-line to in-line) STORAGE
This section explains one major difference between the LOB API and LONG API methods.

If a change to the in-line LOB data makes it larger than 3964 bytes, then it is automatically moved out of table segment 
and stored out-of-line. If during future operations, the LOB data shrinks to under 3964 bytes, it will remain out-of-line.

In other words, once a LOB is migrated out, it is always stored out-of-line irrespective of its size, with the following 
exception scenario.

Consider a scenario where you used the LONG API to update the LOB datatype

[..]
begin
in_buf := utl_raw.cast_to_raw (rpad('FF', 3964, 'FF'));
insert into foo values (1, in_buf) ; 
commit;
[..]
 
Above LOB is stored in-line, update the LOB to a size more than 3964 bytes
 
[..]
in_buf := utl_raw.cast_to_raw (rpad('FF', 4500, 'FF'));
update foo set bar=buffer where pkey=1;
commit;
[..]
 
After the update LOB is stored out-of-line, now update the LOB to a size smaller than 3964 bytes
 
[..]
in_buf := utl_raw.cast_to_raw (rpad('FF', 3000, 'FF'));
update foo set bar=buffer where pkey=1;
commit;
[..]
 
LOB is stored in-line again.
 

When using the LONG API for update, the older LOB is deleted (or space is reclaimed as per RETENTION or PCTVERSION setting) 
and a new LOB is created, with a new LOB locator. This is different from using LOB API, where DML on LOB is possible only 
using the LOB locator (the LOB locator doesn?t change)

APPENDIX C 

How LOB data is stored

The purpose of this section is to differentiate how the ENABLE STORAGE IN ROW option is different from the 
DISABLE STORAGE IN ROW option for LOB data size greater than 3964 bytes. It also highlights customers when LOBINDEX 
is really used (following example scenarios assume Solaris OS and Oracle 9204 32 bit version)..

In-line LOB LOB size less than 3964 bytes
LOB can be NULL, EMPTY_BLOB, and actual LOB data

create table foo 
(
pkey number(10) not null, 
bar BLOB
)  
lob (bar) store as (enable storage in row chunk 2k);

 
declare
inbuf     raw(3964);
 
begin
inbuf := utl_raw.cast_to_raw(rpad('FF', 3964, 'FF')); 
insert into foo values (1, NULL);
insert into foo values (2, EMPTY_BLOB() );
insert into foo values (3, inbuf );
commit;
end;
/
 
note: RPAD ('-', 60, '-')==>'------------------------------------------------------------'

Now Foo table rows are:
 
Pkey=1
 Bar=0 byte (nothing is stored)
 
Pkey=2
 Bar=36 byte (10 byte metadata + 10 byte LobId + 16 byte Inode)
 
Pkey=3
Bar=4000 byte (36 byte + 3964 byte of LOB data, nothing stored in LOBINDEX and LOBSEGMENT
 

LobId - LOB Locator
 

In-line LOB ? LOB size = 3965 bytes (1 byte greater than 3964)

LOB is defined as in-line, but actual data is greater than 3964 bytes, so moved out ? please note this is different from 
LOB being defined as out-of-line.

[..]
inbuf := utl_raw.cast_to_raw(rpad('FF', 3965, 'FF'));
insert into foo values (4, inbuf );
[..]
 
Foo table row
Pkey=4
 Bar=40 bytes (36 byte + 4 byte for one chunk RDBA). Using this RDBA, we directly access LOB data in LOBSEGMENT. 
Nothing stored in LOBINDEX
 
RDBA ? Relative Database Block Address
 
In-line LOB ? LOB size greater than 12 chunk addresses 

With in-line LOB option, we store the first 12 chunk addresses in the table row. This takes 84 bytes (36+4*12) of size 
in table row. LOBs that are less than 12 chunks in size will not have entries in the LOBINDEX if 
ENABLE STORAGE IN ROW is used

[..]
inbuf := utl_raw.cast_to_raw(rpad('FF', 32767, 'FF'));
insert into foo values (5, inbuf );
[..]
 
Here, we are inserting 32767 bytes of LOB data, given our chunk
size of 2k, we need approximately 16 blocks (32767/2048). So we store first 12 chunk RDBAs in table row and the rest in LOBINDEX
 
Foo table row
 
Pkey=5
 Bar=84 bytes (36 byte + 4*12 byte for first 12 chunk RDBA). Using this RDBA, we directly
access 12 LOB chunks in LOBSEGMENT. Then using the LobId, we lookup LOBINDEX to get rest of the LOB chunk RDBAs.
 

Out-of-line LOBs ? All LOB sizes

With out-of-line LOB option, only LOB locator is stored in table row. Using LOB locator, we lookup LOBINDEX and find the range 
of chunk RDBAs, using this RDBAs we read LOB data from LOBSEGMENT

create table foo (pkey number(10) not null, bar BLOB)  
lob (bar) store as (disable storage in row chunk 2k);
 
[..]
inbuf := utl_raw.cast_to_raw(rpad('FF', 20, 'FF'));
insert into foo values (6, inbuf );
[..]
 
Foo table rows
Pkey=6
 Bar=20 bytes (10 byte metadata + 10 byte LobId). Please note Inode and chunk RDBAs are stored in LOBINDEX. 
 

LOB Performance Guidelines

April 2004

Author: V. Jegraj (Vinayagam.Djegaradjane)


Acknowledgements: Vishy Karra, Krishna Kunchithapadam, Cecilia Gervasio

 
Oracle Corporation
World Headquarters
500 Oracle Parkway
Redwood Shores, CA 94065
U.S.A.


Worldwide Inquiries:
Phone: +1.650.506.7000
Fax: +1.650.506.7200
www.oracle.com


Copyright ? 2004 Oracle Corporation

All rights reserved.


--------------------------------------------------------------------------------

[1] In Oracle8i, users can specify storage parameters for LOB index, but from Oracle9i Database onwards, 
    specifying storage parameters for a LOB index is ignored without any error and the index is stored 
    in the same tablespace as the LOB segment, with an Oracle generated index name.

[2] Large Objects (LOBs) in Oracle9i Application Developer's Guide, DBMS_LOB package in Oracle9i 
    Supplied PL/SQL Packages and Types Reference, LOB and FILE Operations in Oracle Call Interface Programmer?s guide

.  

--------------------------------------------------------------------------------
 
 Copyright � 2005, Oracle. All rights reserved. Legal Notices and Terms of Use. 


Note 7:
=======

 
Doc ID: 	Note:1071540.6	Content Type: 	TEXT/PLAIN	   
Subject: 	Converting a Long datatype to Clob in Oracle8i?	Creation Date: 	27-MAY-1999	   
Type: 	BULLETIN	Last Revision Date: 	24-JUN-2004	   
Status: 	PUBLISHED		 
PURPOSE  
  This note describes the Oracle 8.1.x function that converts data stored in  
  LONG and LONG RAW datatypes to CLOB and BLOB datatypes respectively.  This  
  is done using the TO_LOB function.  
  
 
Converting a long datatype to a Clob: 
========================================= 
  
The TO_LOB function is provided in Oracle 8.1.x to convert LONG and LONG RAW 
datatypes to CLOB and BLOB datatypes respectively. 
 
Note: The TO_LOB function is not provided in Oracle 8.0.x. 
 
Oracle recommends that long datatypes be converted to CLOBs, NCLOB or BLOBs.   
 
Note: When a LOB is stored in a table, the data (LOB VALUE) and a pointer to 
the data called a LOB LOCATOR, are stored separately.  The data may be stored 
along with the locator in the table itself or in a separate table.  The LOB 
clause in the create table command can be used to specify whether an attempt 
should be made to store data in the main table or a separate one.  The LOB 
clause may also be used to specify a separate tablespace and storage clause 
for both the LOB table and its associated index. 
 
 
Example: 
 
SQL> create table long_data (c1 number, c2 long); 
 
Table created. 
 
SQL> desc long_data 
 Name                            Null?    Type 
 ------------------------------- -------- ---- 
 C1                                       NUMBER 
 C2                                       LONG 
 
SQL> insert into long_data values 
  2  (1, 'This is some long data to be migrated to a CLOB'); 
 
1 row created. 
 
 
Note: The TO_LOB function may be used in CREATE TABLE AS SELECT or 
      INSERT...SELECT statements: 
 
Example: 
 
SQL> create table test_lobs 
  2  (c1 number, c2 clob); 
 
Table created. 
 
SQL> desc test_lobs 
 Name                            Null?    Type 
 ------------------------------- -------- ---- 
 C1                                       NUMBER 
 C2                                       CLOB 
 
SQL> insert into test_lobs 
  2  select c1, to_lob(c2) from long_data; 
 
1 row created. 
 
SQL> select c2 from test_lobs; 
 
C2 
----------------------------------------------- 
This is some long data to be migrated to a CLOB 
 
  
References:  
===========  
 
Oracle8i SQL Reference Volume 1 
[NOTE:66046.1]  Oracle8i: LOBs 
  
 
30.2 How to access LOB data:
============================


30.2.1 SQL DML:
---------------

Using SQL DML for Basic Operations on LOBs
SQL DML provides basic operations -- INSERT, UPDATE, SELECT, DELETE -- that let you make changes 
to the entire values of internal LOBs within the Oracle ORDBMS. To work with parts of internal LOBs, 
you will need to use one of the interfaces that have been developed to handle more complex requirements. 

Oracle8 supports read-only operations on external LOBs. So if you need to update/write to external LOBs, 
you will have to develop client side applications suited to your needs 

Suppose you have the following table:

create table multimedia_tab
(
clip_id number,
story       clob,
flsub       nclob,
photo       bfile,
frame       blob,
sound       blob,
voiced_ref  voiced_type,
inseg_ntab  inseg_type,
music       bfile,
map_obj     map_typ
);


create table multimedia_tab
(
clip_id number,
story       clob,
flsub       nclob,
photo       bfile,
frame       blob,
sound       blob,
music       bfile
);

The following INSERT statement populates story with the character string 'JFK interview', 
sets flsub, frame and sound to an empty value, sets photo to NULL, and 
initializes music to point to the file 'JFK_interview' located under the logical directory 
'AUDIO_DIR' (see the CREATE DIRECTORY command in the Oracle8i Reference. Character strings are inserted 
using the default character set for the instance. 

INSERT INTO Multimedia_tab 
VALUES (101, 'JFK interview', EMPTY_CLOB(), NULL,
    EMPTY_BLOB(), EMPTY_BLOB(), NULL, NULL, 
    BFILENAME('AUDIO_DIR', 'JFK_interview'), NULL);


Similarly, the LOB attributes for the Map_typ column in Multimedia_tab can be initialized to NULL 
or set to empty as shown below. Note that you cannot initialize a LOB object attribute with a literal. 

INSERT INTO Multimedia_tab 
VALUES (1, EMPTY_CLOB(), EMPTY_CLOB(), NULL, EMPTY_BLOB(), 
          EMPTY_BLOB(), NULL, NULL, NULL, 
          Map_typ('Moon Mountain', 23, 34, 45, 56, EMPTY_BLOB(), NULL);


SELECTing a LOB
Performing a SELECT on a LOB returns the locator instead of the LOB value. In the following PL/SQL fragment 
you select the LOB locator for story and place it in the PL/SQL locator variable Image1 defined 
in the program block. When you use PL/SQL DBMS_LOB functions to manipulate the LOB value, you refer 
to the LOB using the locator. 


DECLARE
    Image1       BLOB;
    ImageNum     INTEGER := 101;
BEGIN
    SELECT story INTO Image1 FROM Multimedia_tab
        WHERE clip_id = ImageNum;
    DBMS_OUTPUT.PUT_LINE('Size of the Image is: ' ||
        DBMS_LOB.GETLENGTH(Image1));
    /* more LOB routines */
END;


DECLARE
    Image1       BLOB;
    ImageNum     INTEGER := 101;
BEGIN
    SELECT content INTO Image1 FROM binaries2
        WHERE id = 1211;
    DBMS_OUTPUT.PUT_LINE('Size of the Image is: ' ||
        DBMS_LOB.GETLENGTH(Image1));
    /* more LOB routines */
END;
/

XXX So you can retrieve all kinds of info with DBMS_LOB


30.2.2 The EMPTY_BLOB and EMPTY_CLOB functions:
-----------------------------------------------

The EMPTY_BLOB function returns an empty locator of type BLOB (binary large object). 
The specification for the EMPTY_BLOB function is: 

FUNCTION EMPTY_BLOB RETURN BLOB;
You can call this function without any parentheses or with an empty pair. Here are some examples: 

INSERT INTO family_member (name, photo)
   VALUES ('Steven Feuerstein', EMPTY_BLOB());

DECLARE
   my_photo BLOB := EMPTY_BLOB;
BEGIN

Use EMPTY_BLOB to initialize a BLOB to "empty." Before you can work with a BLOB, either to reference it 
in SQL DML statements such as INSERTs or to assign it a value in PL/SQL, it must contain a locator. 
It cannot be NULL. The locator might point to an empty BLOB value, but it will be a valid BLOB locator. 


The EMPTY_CLOB function returns an empty locator of type CLOB. The specification for the EMPTY_CLOB function is: 

FUNCTION EMPTY_CLOB RETURN CLOB;
You can call this function without any parentheses or with an empty pair. Here are some examples: 

INSERT INTO diary (entry, text) 
VALUES (SYSDATE, EMPTY_CLOB());

DECLARE
   the_big_novel CLOB := EMPTY_CLOB;
BEGIN

Use EMPTY_CLOB to initialize a CLOB to "empty". Before you can work with a CLOB, either to reference it 
in SQL DML statements such as INSERTs or to assign it a value in PL/SQL, it must contain a locator. 
It cannot be NULL. The locator might point to an empty CLOB value, but it will be a valid CLOB locator. 


30.2.3 DBMS_LOB
---------------

Simple example to get the length of a lob:

DECLARE
    Image1       BLOB;
    ImageNum     INTEGER := 101;
BEGIN
    SELECT content INTO Image1 FROM binaries2
        WHERE id = 1211;
    DBMS_OUTPUT.PUT_LINE('Size of the Image is: ' ||
        DBMS_LOB.GETLENGTH(Image1));
    /* more LOB routines */
END;
/


DBMS_LOB 
The DBMS_LOB package provides subprograms to operate on BLOBs, CLOBs, NCLOBs, BFILEs, and temporary LOBs. 
You can use DBMS_LOB to access and manipulation specific parts of a LOB or complete LOBs. 

DBMS_LOB can read and modify BLOBs, CLOBs, and NCLOBs; it provides read-only operations for BFILEs. 
The bulk of the LOB operations are provided by this package. 

Example:

Load Text Files to CLOB then Write Back Out to Disk - (PL/SQL) 

Overview 

The following example is part of the Oracle LOB Examples Collection. 
This example provides two PL/SQL procedures that demonstrate how to populate a CLOB column with 
a text file (an XML file) then write it back out to the file system as a different file name. 


- Load_CLOB_From_XML_File: 

  This PL/SQL procedure loads an XML file on disk to a CLOB column using a BFILE reference variable. 
  Notice that I use the new PL/SQL procedure DBMS_LOB.LoadCLOBFromFile(), introduced in Oracle 9.2, 
  that handles uploading to a multi-byte UNICODE database. 

- Write_CLOB_To_XML_File: 

  This PL/SQL procedure writes the contents of the CLOB column in the database piecewise 
  back to the file system. 


Let's first take a look at an example XML file: 

DatabaseInventoryBig.xml:

  <?xml version="1.0" ?> 
  <!DOCTYPE DatabaseInventory (View Source for full doctype...)> 
- <DatabaseInventory>
- <DatabaseName>
  <GlobalDatabaseName>production.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>production</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <DatabaseAttributes Type="Production" Version="9i" /> 
  <Comments>The following database should be considered the most stable for up-to-date data. The backup strategy includes running the database in Archive Log Mode and performing nightly backups. All new accounts need to be approved by the DBA Group before being created.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>development.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>development</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <DatabaseAttributes Type="Development" Version="9i" /> 
  <Comments>The following database should contain all hosted applications. Production data will be exported on a weekly basis to ensure all development environments have stable and current data.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing1.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing1</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host more than half of the testing for our hosting environment.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing2.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing2</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the HR department only.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing3.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing3</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the Finance department only.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing4.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing4</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the HQ department only.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing5.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing5</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the Engineering department only.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing6.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing6</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the IT department only.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing7.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing7</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the Marketing department only.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing8.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing8</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the Purchasing department only.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing9.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing9</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the Accounts Payable department only.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing10.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing10</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing OEM.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing11.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing11</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing XMLDB.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing12.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing12</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for tuning.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing13.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing13</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for UAT.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing14.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing14</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for additional monitoring.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing15.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing15</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing upgrades.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing16.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing16</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for certification tesing.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing17.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing17</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing18.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing18</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing19.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing19</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing20.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing20</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing21.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing21</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing22.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing22</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing23.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing23</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
+ <DatabaseName>
  <GlobalDatabaseName>testing24.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing24</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
+ <DatabaseName>
  <GlobalDatabaseName>testing25.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing25</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
+ <DatabaseName>
  <GlobalDatabaseName>testing26.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing26</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
+ <DatabaseName>
  <GlobalDatabaseName>testing27.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing27</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
+ <DatabaseName>
  <GlobalDatabaseName>testing28.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing28</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing29.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing29</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing30.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing30</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing31.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing31</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing32.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing32</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing33.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing33</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing34.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing34</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing35.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing35</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing36.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing36</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing37.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing37</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing38.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing38</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing39.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing39</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing40.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing40</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing41.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing41</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing42.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing42</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing43.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing43</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing44.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing44</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing45.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing45</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing46.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing46</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing47.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing47</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing48.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing48</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing49.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing49</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing50.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing50</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing51.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing51</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing52.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing52</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing53.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing53</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing54.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing54</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing55.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing55</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing56.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing56</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing57.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing57</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing58.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing58</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing59.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing59</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the DBA department for testing of all ERP application modules.</Comments> 
  </DatabaseName>
- <DatabaseName>
  <GlobalDatabaseName>testing60.iDevelopment.info</GlobalDatabaseName> 
  <OracleSID>testing60</OracleSID> 
  <DatabaseDomain>iDevelopment.info</DatabaseDomain> 
  <Administrator EmailAlias="jhunter" Extension="6007">Jeffrey Hunter</Administrator> 
  <Administrator EmailAlias="mhunter" Extension="6008">Melody Hunter</Administrator> 
  <Administrator EmailAlias="ahunter">Alex Hunter</Administrator> 
  <DatabaseAttributes Type="Testing" Version="9i" /> 
  <Comments>The following database will host a testing database to be used by the Sales Force Automation department.</Comments> 
  </DatabaseName>
  </DatabaseInventory 

After downloading the above XML file, create all Oracle database objects: 


DROP TABLE test_clob CASCADE CONSTRAINTS
/

Table dropped.


CREATE TABLE test_clob (
      id           NUMBER(15)
    , file_name    VARCHAR2(1000)
    , xml_file     CLOB
    , timestamp    DATE
)
/

Table created.


CREATE OR REPLACE DIRECTORY EXAMPLE_LOB_DIR AS '/u01/app/oracle/lobs'
/

Directory created.

Now, let's define our two example procedures: 

CREATE OR REPLACE PROCEDURE Load_CLOB_From_XML_File
IS

    dest_clob   CLOB;
    src_clob    BFILE  := BFILENAME('EXAMPLE_LOB_DIR', 'DatabaseInventoryBig.xml');
    dst_offset  number := 1 ;
    src_offset  number := 1 ;
    lang_ctx    number := DBMS_LOB.DEFAULT_LANG_CTX;
    warning     number;

BEGIN

    DBMS_OUTPUT.ENABLE(100000);

    -- -----------------------------------------------------------------------
    -- THE FOLLOWING BLOCK OF CODE WILL ATTEMPT TO INSERT / WRITE THE CONTENTS
    -- OF AN XML FILE TO A CLOB COLUMN. IN THIS CASE, I WILL USE THE NEW 
    -- DBMS_LOB.LoadCLOBFromFile() API WHICH *DOES* SUPPORT MULTI-BYTE
    -- CHARACTER SET DATA. IF YOU ARE NOT USING ORACLE 9iR2 AND/OR DO NOT NEED
    -- TO SUPPORT LOADING TO A MULTI-BYTE CHARACTER SET DATABASE, USE THE
    -- FOLLOWING FOR LOADING FROM A FILE:
    -- 
    --     DBMS_LOB.LoadFromFile(
    --         DEST_LOB => dest_clob
    --       , SRC_LOB  => src_clob
    --       , AMOUNT   => DBMS_LOB.GETLENGTH(src_clob)
    --     );
    --
    -- -----------------------------------------------------------------------

    INSERT INTO test_clob(id, file_name, xml_file, timestamp) 
        VALUES(1001, 'DatabaseInventoryBig.xml', empty_clob(), sysdate)
        RETURNING xml_file INTO dest_clob;


    -- -------------------------------------
    -- OPENING THE SOURCE BFILE IS MANDATORY
    -- -------------------------------------
    DBMS_LOB.OPEN(src_clob, DBMS_LOB.LOB_READONLY);

    DBMS_LOB.LoadCLOBFromFile(
          DEST_LOB     => dest_clob
        , SRC_BFILE    => src_clob
        , AMOUNT       => DBMS_LOB.GETLENGTH(src_clob)
        , DEST_OFFSET  => dst_offset
        , SRC_OFFSET   => src_offset
        , BFILE_CSID   => DBMS_LOB.DEFAULT_CSID
        , LANG_CONTEXT => lang_ctx
        , WARNING      => warning
    );

    DBMS_LOB.CLOSE(src_clob);

    COMMIT;

    DBMS_OUTPUT.PUT_LINE('Loaded XML File using DBMS_LOB.LoadCLOBFromFile: (ID=1001).');

END;
/

SQL> @load_clob_from_xml_file.sql

Procedure created.


CREATE OR REPLACE PROCEDURE Write_CLOB_To_XML_File
IS

  clob_loc          CLOB;
  buffer            VARCHAR2(32767);
  buffer_size       CONSTANT BINARY_INTEGER := 32767;
  amount            BINARY_INTEGER;
  offset            NUMBER(38);

  file_handle       UTL_FILE.FILE_TYPE;
  directory_name    CONSTANT VARCHAR2(80) := 'EXAMPLE_LOB_DIR';
  new_xml_filename  CONSTANT VARCHAR2(80) := 'DatabaseInventoryBig_2.xml';

BEGIN

    DBMS_OUTPUT.ENABLE(100000);

    -- ----------------
    -- GET CLOB LOCATOR
    -- ----------------
    SELECT xml_file INTO clob_loc
    FROM   test_clob
    WHERE  id = 1001;


    -- --------------------------------
    -- OPEN NEW XML FILE IN WRITE MODE
    -- --------------------------------
    file_handle := UTL_FILE.FOPEN(
        location     => directory_name,
        filename     => new_xml_filename,
        open_mode    => 'w',
        max_linesize => buffer_size);

    amount := buffer_size;
    offset := 1;

    -- ----------------------------------------------
    -- READ FROM CLOB XML / WRITE OUT NEW XML TO DISK
    -- ----------------------------------------------
    WHILE amount >= buffer_size
    LOOP

        DBMS_LOB.READ(
            lob_loc    => clob_loc,
            amount     => amount,
            offset     => offset,
            buffer     => buffer);

        offset := offset + amount;

        UTL_FILE.PUT(
            file      => file_handle,
            buffer    => buffer);

        UTL_FILE.FFLUSH(file => file_handle);

    END LOOP;

    UTL_FILE.FCLOSE(file => file_handle);

END;
/

SQL> @write_clob_to_xml_file.sql

Procedure created.


Now lets test it:

SQL> set serveroutput on

SQL> exec Load_CLOB_From_XML_File
Loaded XML File using DBMS_LOB.LoadCLOBFromFile: (ID=1001).

PL/SQL procedure successfully completed.


SQL> exec Write_CLOB_To_XML_File

PL/SQL procedure successfully completed.


SQL> SELECT id, DBMS_LOB.GETLENGTH(xml_file) Length FROM test_clob;

        ID     LENGTH
---------- ----------
      1001      41113


SQL> host ls -l DatabaseInventory*
-rw-r--r--   1 oracle   dba        41113 Sep 20 15:02 DatabaseInventoryBig.xml
-rw-r--r--   1 oracle   dba        41113 Sep 20 15:48 DatabaseInventoryBig_2.xml


30.2.4 REMOTE SELECTS, INSERTS, UPDATES:
----------------------------------------

Valid operations on LOB columns in remote tables include: 

CREATE TABLE as select * from table1@remote_site; 
INSERT INTO t select * from table1@remote_site; 
UPDATE t set lobcol = (select lobcol from table1@remote_site); 
INSERT INTO table1@remote... 
UPDATE table1@remote... 
DELETE table1@remote...   


30.2.5: Export a BLOB to a file with Java:
------------------------------------------


First we create a Java stored procedure that accepts a file name and a BLOB as parameters:

CREATE OR REPLACE JAVA SOURCE NAMED "BlobHandler" AS
import java.lang.*;
import java.sql.*;
import oracle.sql.*;
import java.io.*;

public class BlobHandler
{
  
  public static void ExportBlob(String myFile, BLOB myBlob) throws Exception
  {
    // Bind the image object to the database object
    // Open streams for the output file and the blob
    File binaryFile = new File(myFile);
    FileOutputStream outStream = new FileOutputStream(binaryFile);
    InputStream inStream = myBlob.getBinaryStream();

    // Get the optimum buffer size and use this to create the read/write buffer
    int size = myBlob.getBufferSize();
    byte[] buffer = new byte[size];
    int length = -1;

    // Transfer the data
    while ((length = inStream.read(buffer)) != -1)
    {
      outStream.write(buffer, 0, length);
      outStream.flush();
    }

    // Close everything down
    inStream.close();
    outStream.close();
  } 

};
/

ALTER java source "BlobHandler" compile;
show errors java source "BlobHandler"


Next we publish the Java call specification so we can access it via PL/SQL:


CREATE OR REPLACE PROCEDURE ExportBlob (p_file  IN  VARCHAR2,
                                        p_blob  IN  BLOB)
AS LANGUAGE JAVA 
NAME 'BlobHandler.ExportBlob(java.lang.String, oracle.sql.BLOB)';
/

Next we grant the Oracle JVM the relevant filesystem permissions:

EXEC Dbms_Java.Grant_Permission( -
'SCHEMA-NAME', -
'java.io.FilePermission', -
'<<ALL FILES>>', -
'read ,write, execute, delete');

Finally we can test it:

CREATE TABLE tab1 (col1 BLOB);
INSERT INTO tab1 VALUES(empty_blob());
COMMIT;                                

DECLARE
  v_blob BLOB;
BEGIN
  SELECT col1
  INTO   v_blob
  FROM   tab1;
  
  ExportBlob('c:\MyBlob',v_blob);
END;
/


30.2.6 Import into a BLOB from a file:
--------------------------------------

Import BLOB Contents
The following article presents a simple methods for importing a file into a BLOB datatype. 
First a directory object is created to point to the relevant filesystem directory:

CREATE OR REPLACE DIRECTORY images AS 'C:\';
Next we create a table to hold the BLOB:

CREATE TABLE tab1 (col1 BLOB);

Finally we import the file into a BLOB datatype and insert it into the table:


DECLARE
  v_bfile  BFILE;
  v_blob   BLOB;
BEGIN
  INSERT INTO tab1 (col1)
  VALUES (empty_blob())
  RETURN col1 INTO v_blob;

  v_bfile := BFILENAME('IMAGES', 'MyImage.gif');
  Dbms_Lob.Fileopen(v_bfile, Dbms_Lob.File_Readonly);
  Dbms_Lob.Loadfromfile(v_blob, v_bfile, Dbms_Lob.Getlength(v_bfile));
  Dbms_Lob.Fileclose(v_bfile);

  COMMIT;
END;
/
Hope this helps. Regards Tim...


30.2.7 Import into a CLOB from a file:
--------------------------------------

Import CLOB Contents
The following article presents a simple methods for importing a file into a CLOB datatype. 
First a directory object is created to point to the relevant filesystem directory:

CREATE OR REPLACE DIRECTORY documents AS 'C:\';
Next we create a table to hold the CLOB:

CREATE TABLE tab1 (col1 CLOB);

Finally we import the file into a CLOB datatype and insert it into the table:


DECLARE
  v_bfile  BFILE;
  v_clob   CLOB;
BEGIN
  INSERT INTO tab1 (col1)
  VALUES (empty_clob())
  RETURN col1 INTO v_clob;

  v_bfile := BFILENAME('DOCUMENTS', 'Sample.txt');
  Dbms_Lob.Fileopen(v_bfile, Dbms_Lob.File_Readonly);
  Dbms_Lob.Loadfromfile(v_clob, v_bfile, Dbms_Lob.Getlength(v_bfile));
  Dbms_Lob.Fileclose(v_bfile);

  COMMIT;
END;
/
Hope this helps. Regards Tim...


Note 5:
-------

You Asked (Jump to Tom's latest followup)

I have a table with a blob column.
It's possible to specify an extra
storage clause for this column ?
 
and we said...


Yes, the following example is cut and pasted from the SQL Reference Manual, the 
CREATE TABLE command:

CREATE TABLE lob_tab (col1 BLOB, col2 CLOB) STORAGE (INITIAL 512 NEXT 256)
LOB (col1, col2) STORE AS
    (TABLESPACE lob_seg_ts
       STORAGE (INITIAL 6144 NEXT 6144)
       CHUNK 4
       NOCACHE LOGGING
       INDEX (TABLESPACE lob_index_ts
       STORAGE (INITIAL 256 NEXT 256)
    )
);


The table will be stored in the users default tablespace with (INITIAL 512 NEXT 
256).  The actual lob data will be in LOB_SEG_TS with (INITIAL 6144 NEXT 6144).  
The lob index built on the pages constituting the lob will be stored in yet a 
3'rd tablespace -- lob_index_ts with (INITIAL 256 NEXT 256).
 
   Reviews      
GOTO a page to Bookmark Review | Bottom | Top 
lob storage recovery  May 07, 2004 
Reviewer:  bob  from PA 

Tom,

If the LOB tablespace is not backed up, can the table data (a different 
tablespace) be recovered in a failure scenario?

I know with TSPITR the process validates that no objects cross tablespaces that 
are not included in the set being recovered with the TSPITR check/validate 
funcion.  This doesn't mean the tablespace won't be recovered in the auxiliary 
db, it just means the automated process won't continue through to export the 
objects, and re-import unless you pass the check.   (or at least that was what 
happened in the test I ran).  

I am just curious about what would happen to this table if its lob tablespace 
was lost and non-recoverable.  can just the regular data be recovered
 

Followup:  
well, it's going to be problematic as the lob locators will point to "garbage".  
You cannot really TSPITR a table with lobs without doing the *same* to the lob 
segments.

You'd have to sort of update the lobs to NULL and pull it manually -- but then I 
would ask "why have the lobs in the first place, must not be very important"?

so yes, we'd be able to get the scalar data back (complete recovery would be 
best here), update the lob to null and go forward with that. 
 
 
30.3 Errors in LOB:
===================

30.3.1:
-------

Doc ID:  Note:293515.1 
Subject:  ORA-1578 ORA-26040 in a LOB segment - Script to solve the errors 
Type:  PROBLEM 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  09-DEC-2004 
Last Revision Date:  25-FEB-2005 
 

Purpose
============
- The purpose of this article is to provide a script to solve errors 
ORA-1578 / ORA-26040 when a lob block is accessed by a sql statement. 

- Note that the data inside the corrupted lob blocks is not salvageable.  
This procedure will update the lob column with an empty lob to avoid errors 
ORA-1578 / ORA-26040.

- After applying this solution dbverify would still produce error 
DBV-200 until block marked as corrupted is reused and reformatted.


Symptoms
===========
- ORA-1578 and ORA-26040 are produced when accesing a lob column in a table:

ORA-1578 : ORACLE data block corrupted (file # %s, block # %s)
ORA-26040: Data block was loaded using the NOLOGGING option 

- dbverify for the datafile that produces the errors fails with:

DBV-00200: Block, dba <dba number>, already marked corrupted

Example:

dbv file=/oracle/oradata/data.dbf blocksize=8192

DBV-00200: Block, dba 54528484, already marked corrupted
.....


The dba can be used to get the relative file number and block number:

Relative File number:

SQL> select dbms_utility.data_block_address_file(54528484) from dual;

DBMS_UTILITY.DATA_BLOCK_ADDRESS_FILE(54528484)
----------------------------------------------
                                            13

Block Number:

SQL> select dbms_utility.data_block_address_block(54528484) from dual;

DBMS_UTILITY.DATA_BLOCK_ADDRESS_BLOCK(54528484)
-----------------------------------------------
                                           2532


Cause
==========
- LOB segment has been defined as NOLOGGING
- LOB Blocks were marked as corrupted by Oracle after a datafile restore / recovery.


Identify the table referencing the lob segment - Example
=========================================================
Error example when accessing the lob column by a sql statement:

ORA-01578 : ORACLE data block corrupted (file #13 block # 2532)
ORA-01110 : datafile 13: '/oracle/oradata/data.dbf'
ORA-26040 : Data block was loaded using the NOLOGGING option.

1. Query dba_extents to find out the lob segment name

select owner, segment_name, segment_type 
from   dba_extents
where  file_id = 13
and    2532 between block_id and block_id + blocks - 1;

In our example it returned:

owner=SCOTT
segment_name=SYS_LOB0000029815C00006$$
segment_type=LOBSEGMENT


2. Query dba_lobs to identify the table_name and lob column name:

select table_name, column_name 
from   dba_lobs
where  segment_name = 'SYS_LOB0000029815C00006$$'
and    owner = 'SCOTT';

In our example it returned:

table_name  = EMP
column_name = EMPLOYEE_ID_LOB


Fix
======

1. Identify the table rowid's referencing the corrupted lob segment blocks by 
running the following plsq script:

rem ********************* Script begins here ********************


create table corrupted_data (corrupted_rowid rowid);

set concat #
 
declare
error_1578 exception;
pragma exception_init(error_1578,-1578);
n number;
begin
  for cursor_lob in (select rowid r, &&lob_column from &table_owner.&table_with_lob) loop
    begin
      n:=dbms_lob.instr(cursor_lob.&&lob_column,hextoraw('8899')) ;
    exception
    when error_1578 then
       insert into corrupted_data values (cursor_lob.r);
       commit;
    end;
  end loop;
end;
/    
undefine lob_column 


rem ********************* Script ends here ********************


When prompted by variable values and following our example:

Enter value for lob_column: EMPLOYEE_ID_LOB
Enter value for table_owner: SCOTT
Enter value for table_with_lob: EMP            

2. Update the lob column with empty lob to avoid ORA-1578 and ORA-26040:

SQL> set concat #
SQL> update &table_owner.&table_with_lob 
     set &lob_column = empty_blob() 
     where rowid in (select corrupted_rowid from corrupted_data);

if &lob_column is a CLOB datatype, replace empty_blob by empty_clob.


Reference
==============
Note 290161.1 - The Gains and Pains of Nologging Operations


30.3.2:
-------

Displayed below are the messages of the selected thread. 


Thread Status: Closed 

From: Neil Bullen 26-Mar-02 08:26 
Subject: How do you alter NOLOGGING in lob index partition 

RDBMS Version: 8.1.7.2.1
Operating System and Version: Compaq Tru64 Unix 5.2
Error Number (if applicable): 
Product (i.e. SQL*Loader, Import, etc.): 
Product Version: 

How do you alter NOLOGGING in lob index partition

I have discovered that a lob index partition is set to NOLOGGING, how can I alter this to LOGGING. 

The lob is set to CACHE and LOGGING, the index def_logging is set to NONE 
and the tablespace is set to LOGGING. 

--------------------------------------------------------------------------------

From: Oracle, Rowena Serna 02-Apr-02 03:26 
Subject: Re : How do you alter NOLOGGING in lob index partition 


You could find the system generated lobindex name and use the "alter index" command. 

Regards, 
Rowena Serna 
Oracle Corporation 


-------------------------------------------------------------------------------

From: Neil Bullen 03-Apr-02 23:42 
Subject: Re : How do you alter NOLOGGING in lob index partition 

Using alter index on a lob segment index results in error ORA-22864 cannot ALTER or DROP LOB indexes, 
the solution I found was to alter the lob caching setting, even though dba_lobs showed the CACHE and LOGGING 
settings to be 'YES' by issuing the ALTER TABLE <tablename> MODIFY LOB(<lobname>) (CACHE); command 
all partitions of the associated index were changed to LOGGING. What threw me was the CACHE and LOGGING settings 
in dba_lobs already being set correctly, however resetting these again was the key. 

--------------------------------------------------------------------------------

From: Oracle, Rowena Serna 09-Apr-02 02:46 
Subject: Re : How do you alter NOLOGGING in lob index partition 


Thanks for updating. 

Regards, 
Rowena Serna 
Oracle Corporation 


30.3.4 exp/imp errors and LOBS:
-------------------------------

Note 1:
-------

Doc ID:  Note:48023.1 
Subject:  OERR: IMP 64 Definition of LOB was truncated by export 
Type:  REFERENCE 
Status:  PUBLISHED 
 Content Type:  TEXT/PLAIN 
Creation Date:  07-NOV-1997 
Last Revision Date:  26-MAR-2001 
 

Error:  IMP 64 Text:   Definition of LOB was truncated by export  
--------------------------------------------------------------------------- 
Cause:  While producing the dump file, Export was unable to write the * entire          
contents of a LOB. Import is therefore unable to * reconstruct the          
contents of the LOB. The remainder of the * import of the current table          
will be skipped.  Action: Delete the offending row in the exported database and retry the * 
export. 
. 

Note 2:
-------

An export or import of a table with a Large Object (LOB) column, 
has a slower performance than an export or import of a table without LOB 
columns: 
-- create two tables: TESTTAB1 with a VARCHAR2 column, and TESTTAB2 with a 
-- CLOB column: 
connect / as sysdba 
create table scott.testtab1 (nr number, txt varchar2(2000)); 
create table scott.testtab2 (nr number, txt clob); 
-- populate both tables with the same 500,000 rows: 
declare 
x varchar2(50); 
begin 
for i in 1..500000 loop 
x := 'This is a line with the number: ' || i; 
insert into scott.testtab1 values(i,x); 
insert into scott.testtab2 values(i,x); 
commit; 
end loop; 
end; 
/ 
-- export both tables: 
% exp system/manager file=exp_testtab1.dmp tables=scott.testtab1 direct=y 
% exp system/manager file=exp_testtab1a.dmp tables=scott.testtab1 
% exp system/manager file=exp_testtab2.dmp tables=scott.testtab2 

No CLOB No CLOB With CLOB 
DIRECT CONVENTIONAL column 
------------ ------------ ------------ 
8.1.7.4.0 0:13 0:20 7:49 
9.2.0.4.0 0:14 0:18 7:37 
9.2.0.5.0 0:12 0:15 7:03 
10.1.0.2.0 0:16 0:31 7:15 


Note 3:
-------

Doc ID: 	Note:157024.1	Content Type: 	TEXT/X-HTML	   
Subject: 	Insert/Import of Table with Lob Fails IMP-00003 ORA-3237	
Creation Date: 	24-MAY-2001	   
Type: 	PROBLEM	Last Revision Date: 	21-OCT-2003	   
Status: 	PUBLISHED		 


fact: Oracle Server - Enterprise Edition
fact: Import Utility (IMP)
symptom: Import fails with error
symptom: Insert fails
symptom: Table with LOB column
symptom: Locally managed tablespace
symptom: IMP-00003: ORACLE error %lu encountered
symptom: IMP-00017: following statement failed with ORACLE error %lu:
symptom: ORA-03237: Initial Extent of specified size cannot be allocated
cause: Extent size specified for the tablespace is not large enough.

fix:

For LOBS, ensure that the extent size specification in the tablespace is least 
three times the db_block_size.

For example:
If the db_block_size is 8192, then the extent size for the tablespace should be 
at least 24576.

Explaination:
Certain objects may require larger extents by virtue of how they are built 
internally (Example: an RBS requires at least four blocks and a LOB at least 
three).

References:
<Bug:1186625>
SQL Reference Guide, Create Tablespace


Note 4:
-------

 
Doc ID: 	Note:211721.1	Content Type: 	TEXT/X-HTML	   
Subject: 	Unable to Import Tables with LOB Columns	Creation Date: 	13-SEP-2002	   
Type: 	PROBLEM	Last Revision Date: 	03-OCT-2003	   
Status: 	PUBLISHED		 


fact: Oracle Server - Enterprise Edition 9+-
fact: Oracle Server - Enterprise Edition 8.1
fact: Oracle Server - Enterprise Edition 8
fact: Import Utility (IMP)
symptom: Import fails
symptom: ORA-01658: unable to create INITIAL extent for segment in 
tablespace '%s'
symptom: ORA-01652: unable to extend temp segment by %s in tablespace %s
symptom: Table contains LOB column
symptom: Problem does not occur for tables without LOB columns

cause: No LOB storage specifications were specified on the table creation 
for those tables with LOB columns. LOB data is stored both within and outwith 
the table depending on how much data the column contains.

A new database was created and the data reimported into a tablespace with 
1.7GB default initial extent size. The LOB storage outwith the table defaults 
to the initial extent of the tablespace and this storage requirement could not 
be fulfilled.

fix:

As a user with dba privileges issue

alter tablespace <tablespace_name> default storage (initial <x>M)&
#059;

where <tablespace_name> and <x> are replaced with appropriate 
values.

See also :
Note:1074731.6 ORA-01658 During 'Create Table' Statement 


Note 5:
-------

Doc ID:  Note:197699.1 
Subject:  "IMP-00003 ORA-00959 ON IMPORT OF TABLE WITH CLOB DATATYPES" 
Type:  PROBLEM 
Status:  PUBLISHED 
 Content Type:  TEXT/PLAIN 
Creation Date:  31-MAY-2002 
Last Revision Date:  29-AUG-2002 
 

Problem Description 
-------------------  
You are attempting to import a table that has CLOB datatype and you receive the following errors:
      IMP-00003: ORACLE error 959 encountered       
ORA-00959: tablespace <tablespace_name> does not exist     
Solution Description 
-------------------- 
Create the table that has CLOB datatypes before the import, specifying tablespaces  
that exist on the target system, and import using IGNORE=Y.   
Here is a simple example where you can get this problem and how to resolve it:  
I have a user "TEST"  with default tablespace has "USERS"  
Step-1: Create tst Tablespace 
=================================  
SQL>  create tablespace tst datafile 'c:\temp\tst1.dbf' size 5m;  Tablespace created.   
Step-2: Create table with CLOB datatype by login to "TEST" user 
=================================================================  
SQL> CREATE TABLE "TEST"."PX2000" ("ID" NUMBER(*,0), "SUBMITDATE" DATE,       
"COMMENTS" VARCHAR2(4000),"RECOMMENDEDTIMELONG" CLOB)        
PCTFREE 10 PCTUSED 40  INITRANS 1 MAXTRANS 255        
STORAGE(INITIAL 65536 FREELISTS 1 FREELIST GROUPS 1)       
TABLESPACE "TST" LOGGING LOB ("RECOMMENDEDTIMELONG")       
STORE AS (TABLESPACE "TST" ENABLE STORAGE IN ROW CHUNK 8192       
PCTVERSION 10 NOCACHE        
STORAGE(INITIAL 65536 FREELISTS 1 FREELIST GROUPS 1)) ;   

SQL> select table_name,tablespace_name from user_tables   
2  where table_name='PX2000';  

TABLE_NAME                     TABLESPACE_NAME 
------------------------------ ------------------------------ 
PX2000                         TST  

SQL> select username,default_tablespace from user_users;  
USERNAME                       DEFAULT_TABLESPACE 
------------------------------ ------------------------------ 
TEST                            USERS   

Step-3: Export the Table 
=========================  	
exp test/test file=px2000.dmp tables=px2000 . . 
exporting table                         PX2000          
0 rows exported   

Step-4: Drop the "TST" tablespace including contents:
 Please note that 'AND datafiles' is a new option in version 9i.  
Omit this clause if running version prior to 9i.  
============================================================  
SQL> drop tablespace tst including contents and datafiles;  
Tablespace dropped.   
Step-5: Import the table back to test schema 
==============================================       
imp test/test file=px2000.dmp tables=px2000      
IMP-00017: following statement failed with ORACLE error 959:    
"CREATE TABLE "PX2000" ("ID" NUMBER(*,0), "SUBMITDATE" DATE, "COMMENTS" 
VARC"    "HAR2(4000), "RECOMMENDEDTIMELONG" CLOB)  PCTFREE 10 PCTUSED 40 
INITRANS 1 M"    "AXTRANS 255 STORAGE(INITIAL 65536 FREELISTS 1 FREELIST GROUPS 1) 
TABLESPACE"    " "TST" LOGGING LOB ("RECOMMENDEDTIMELONG") 
STORE AS  (TABLESPACE "TST" ENAB"    "LE STORAGE IN ROW CHUNK 8192 
PCTVERSION 10 NOCACHE  STORAGE(INITIAL 65536 F"    "REELISTS 1 FREELIST GROUPS 1))"    
IMP-00003: ORACLE error 959 encountered    ORA-00959: tablespace 'TST' does not exist    
Import terminated successfully with warnings.  

Step-6: Workaround is to extract the DDL from the dumpfile,change the tablespace 
to target database. Create the table manually and import with ignore=y option
 ==================================================================================     
% imp test/test file=px2000.dmp full=y show=y log=<logFile>      

Step-7: Use the logFile to pre-create the table, then ignore object creation errors.  
====================================================================================    
% imp test/test file=px2000.dmp full=y ignore=y   

Explanation 
-----------  
For most of the DDL's (except for Partitioned tables,tables without CLOB  datatypes), i
mport will automatically create the objects to the users default tablespace if the 
specified tablespace does not exist. DDL's with tables with CLOB datatypes and partitioned tables
an IMP-00003 and ORA-00959 will result if the tablespace does not exists in target database.
      References ---------- [NOTE:1058330.6]  
"IMP-00003 ORA-00959 ON IMPORT OF PARTITIONED TABLE" [BUG:1982168] 
"IMP-3 / ORA-959 importing table with CLOB using IGNORE=Y into variable width charset DB" 
[BUG:2398272] "IMPORT TABLE WITH CLOB DATATYPE FAILS WITH IMP-3 AND ORA-959"  
Oracle Utilites Manual 
.  

Note 6:
-------

Displayed below are the messages of the selected thread. 
Thread Status: Closed 
From: Helmut Daiminger 12-Dec-00 21:50 
Subject: MOVE table with LOB column to another tablespace 

RDBMS Version: 8.1.6.1.2
Operating System and Version: Win2k, SP1
Error Number (if applicable): 
Product (i.e. SQL*Loader, Import, etc.): 
Product Version: 

MOVE table with LOB column to another tablespace

Hi! 

I'm having a problem here: I want to move a table with a LOB column (i.e. LOB index segment) 
to a different tablespace. In the beginning the table and the LOB segment were in the USERS 
tablespace. 
I then exported the table using the EXP tool. Then I revoked the user's quota to the 
USERS tablespace and only gave him quota on the default tablespace. 

Then I run IMP and import that LOB-table. The table gets recreated in the new tablespace, 
but the creation of the LOB index fails with an error message that I don't have privileges 
to write to the USERS tablespace. 

How do I completey move the table and the LOB index segment to a new tablespace? 

This is 8.1.6 on Windows 2000 Server. 

Thanks, 
Helmut 


From: Oracle, Ken Robinson 14-Dec-00 21:05 
Subject: Re : MOVE table with LOB column to another tablespace 

I believe you can do the following: 

ALTER TABLE foo MOVE 
TABLESPACE new_tbsp STORAGE(new_storage) 
LOB (lobcol) STORE AS lobsegment 
(TABLESPACE new_tbsp STORAGE (new_storage)); 

Regards, 
Ken Robinson 
Oracle Server EE Analyst 

Note 7.
-------

 
Doc ID: 	Note:176898.1	Content Type: 	TEXT/X-HTML	   
Subject: 	Import Fails with IMP-00032 and IMP-00008	Creation Date: 	15-FEB-2002	   
Type: 	PROBLEM	Last Revision Date: 	24-JUN-2003	   
Status: 	PUBLISHED		 


fact: Oracle Server - Enterprise Edition
fact: Import Utility (IMP)
symptom: IMP-00032: SQL statement exceeded buffer length
symptom: IMP-00008: unrecognized statement in the export file

cause: The insert statement run when importing exceeds the default or 
specified buffer size. 
 
For import of tables containing LONG, LOB, BFILE, REF, ROWID, LOGICALROWID 
or type columns, rows are inserted individually. The size of the buffer must be 
large enough to contain the entire row inserted.


fix:

Increase the buffer size, and make sure that it is big enough to contain the 
biggest row in the table(s) imported.
For example: imp system/manager file=test.dmp full=y log=test.log buffer=
10000000

Note 8:
-------

For tables with LOB columns, make sure the tablespace already exists in the
target database before the import is done.
Also, make sure the extent size is large enough.

Note 9:
-------

With imp/exp I hit a problem that on remote database users tablespace is called 
'users', while on local it's 'users_data'. Now I have to go to documentation to 
figure out if those stupid switches would save the day...

Also with schlobs the elegant
insert into t2 select * from t1@remote_db_link;
doesn't work.

I wonder why export/import is not plain sqlplus statements where I can just 
specify the right 'where' clause...
 

Followup:  
Yes, when you deal with multi segment objects (tables with LOBS, partitioned 
table, IOTs with overflows for example), using EXP/IMP is complicated if the 
target database doesn't have the same tablespace structure.  That is because the 
CREATE statement contains many tablespaces and IMP will only "rewrite" the first 
TABLESPACE in it (it will not put multi-tablespace objects into a single 
tablespace, the object creation will fail of the tablespaces needed by that 
create do not exist).

I dealt with this issue in my book,  in there, I recommend you do an:

imp .... full=y indexfile=temp.sql

In temp.sql, you will have all of the DDL for indexes and tables.  Simply delete 
all index creates and uncomment any table creates you want.  Then, you can 
specify the tablespaces for the various components -- precreate the objects and 
run imp with ignore=y.  The objects will now be populated.


You are incorrect with the "schlobs" comment (both in spelling and in 
conclusion).

scott@ORA815.US.ORACLE.COM> create table t ( a int, b blob );

Table created.

scott@ORA815.US.ORACLE.COM> desc t
 Name                                Null?    Type
 ----------------------------------- -------- ------------------------
 A                                            NUMBER(38)
 B                                            BLOB

scott@ORA815.US.ORACLE.COM> select a, dbms_lob.getlength(b) from t;

no rows selected

scott@ORA815.US.ORACLE.COM> insert into t select x, y from t@ora8i.world;

1 row created.

scott@ORA815.US.ORACLE.COM> select a, dbms_lob.getlength(b) from t;

         A DBMS_LOB.GETLENGTH(B)
---------- ---------------------
         1               1000011

So, the "elegant insert into select * from" does work.

imp/exp can be plain sqlplus statements -- use indexfile=y (if you get my book, 
I use this over and over in various places to get the DDL).  In 9i, there is a 
simple stored procedure interface as well. 
 

Note 10:
--------

Tom
Without using the export import( show=y) Is there any query to find out in which 
Tablespace the LOB column is stored
Thanks in advance 


Followup:  
select * from user_segments

you can join user_segments to user_lobs if you like as well.


user_segments will give you tablespace info.
user_lobs will give you the lob segment name. 
 

Note 11:
--------

IMP-00003 ORACLE error number encountered

Cause: Import encountered the referenced Oracle error. 

Action: Look up the Oracle message in the ORA message chapters of this manual, 
and take appropriate action.


IMP-00020 long column too large for column buffer size (number)

Cause: The column buffer is too small. This usually occurs when importing LONG data. 

Action: Increase the insert buffer size 10,000 bytes at a time (for example). 
Use this step-by-step approach because a buffer size that is too large may cause a similar problem. 


IMP-00064 Definition of LOB was truncated by export

Cause: While producing the dump file, Export was unable to write the entire contents of a LOB. 
Import is therefore unable to reconstruct the contents of the LOB. The remainder of the import 
of the current table will be skipped.

Action: Delete the offending row in the exported database and retry the export.


IMP-00070 Lob definitions in dump file are inconsistent with database.

Cause: The number of LOBS per row in the dump file is different than the number of LOBS per row 
in the table being populated.

Action: Modify the table being imported so that it matches the column attribute layout 
of the table that was exported.

Note 12:
--------

we have a 10 Mill rows table with a BLOB column in it
the size of the lob varies: from 1K up ward to a few megabytes, but most are in the 2K-3K range.

So currently, we have ENABLE STORAGE IN ROW.
and want to do DISABLE STORAGE IN ROW b/c
we are starting to do a lot of range scan on the table.

When we export/import the table and during import
have moved all the lobs out of line.. the total space
used during the import bloated 5 times from
a 2GIG tablespace into a 10GIG tablespace??? Why?

The database block size is 8K, running 9.2.0.6 with
auto sgement management in the tablespace

CREATE TABLESPACE "BLOB_DATA" LOGGING 
DATAFILE 'D:ORACLEORADATATESTDBBLOB_DATA01.ora' SIZE 2048M
REUSE AUTOEXTEND OFF 
EXTENT MANAGEMENT LOCAL UNIFORM SIZE 8M
SEGMENT SPACE MANAGEMENT AUTO 


Note 13:
--------

To relocate tables using lobs:

Method 1:
=========

1. export data using exp cmd
2. drop all tables 
3. create a new LOB tablespace
4. re-create all the tables with the LOB Storage clause, for example

create table FOO (
col1 NUMBER 
,col2 BLOB
)
tablespace DATA_TBLSPCE
LOB ( col2 ) STORE AS col2_blob
(
tablespace BLOB_TBLSPCE disable storage in row 
chunk 8192 pctversion 10 cache 
storage (initial 64K next 64K 
minextents 1 maxextents unlimited 
pctincrease 0
) 

5. import data with ignore=y 


Method 2:
=======

Doc ID:  Note:130814.1 
Subject:  How to move LOB Data to Another Tablespace 
Type:  HOWTO 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  19-DEC-2000 
Last Revision Date:  05-AUG-2003 
 

Purpose
-------

The purpose of this article is to provide the syntax for altering the storage 
parameters of a table that contains one or more LOB columns.


Scope & Application
-------------------

This article will be useful for Oracle DBSs, Developers, and Support Analysts.


How to move LOB Data to Another Tablespace
------------------------------------------
If you want to make no other changes to the table containing a lob other than
to rebuild it, use:

  ALTER TABLE foo MOVE;

This will rebuild the table segment.  It does not affect any of the lob 
segments associated with the lob columns which is the desired optimization.

If you want to change one or more of the physical attibutes of the table containing
the lob, however no attributes of the lob columns are to be changed,
use the following syntax:

  ALTER TABLE foo MOVE TABLESPACE new_tbsp STORAGE(new_storage);

This will rebuild the table segment.  It does not rebuild any of the lob 
segments associated with the lob columns which is the desired optimization.

If a table containing a lob needs no changes to the physical attributes of the 
table segment, but you want to change one or more lob segments; for example,
you want to move the lob column to a new tablespace as well as the lob's 
storage attributes, use the following syntax:

  ALTER TABLE foo MOVE LOB(lobcol) STORE AS lobsegment 
  (TABLESPACE new_tbsp STORAGE (new_storage));

Note that this will also rebuild the table segment (although, in this case, in the
same tablespace and without changing the table segment physical attributes).

If a table containing a lob needs changes to both the table attributes as well 
as the lob attributes then use the following syntax:

  ALTER TABLE foo MOVE
  TABLESPACE new_tbsp STORAGE(new_storage)
  LOB (lobcol) STORE AS lobsegment
  (TABLESPACE new_tbsp STORAGE (new_storage));


Explanation
-----------

The 'ALTER TABLE foo MODIFY LOB (lobcol) ...' syntax does not allow
for a change of tablespace

  ALTER TABLE  my_lob
   MODIFY LOB (a_lob)
   (TABLESPACE new_tbsp);

  (TABLESPACE new_tbsp)
   *
  ORA-22853: invalid LOB storage option specification

You have to use the MOVE keyword instead as shown in the examples.


References
----------

Note 66431.1 LOBS - Storage, Redo and Performance Issues
Bug 747326   ALTER TABLE MODIFY LOB STORAGE PARAMETER DOES'T WORK


Additional Search Words
-----------------------

ora-1735 ora-906 ora-2143 ora-22853 clob nclob blob


Method 3:
=========

Move doesnt support Long datatypes. You can either convert them to LOBs and then move 
or do exp/imp of the table with the LONG column or create the table with LONG 
in the locally managed tablespace and copy the data from the old table using PL/SQL loop 
or CTAS with to_lob in the locally managed tablespace..


SQL> desc t
 Name                                      Null?    Type
 ----------------------------------------- -------- ----------------------------
 X                                                  NUMBER(38)
 Y                                                  LONG

SQL> alter table t move;
alter table t move
*
ERROR at line 1:
ORA-00997: illegal use of LONG datatype

-- You can create the new table in the Locally Managed tablespace 

SQL> create table t_lob  tablespace users as select x,to_lob(y) y from t;

Table created.

SQL> desc t_lob
 Name                                      Null?    Type
 ----------------------------------------- -------- ----------------------------
 X                                                  NUMBER(38)
 Y                                                  CLOB

-- Now you can drop the old table and rename the new table

-- Or you can move the LOB table to the locally managed tablespace

SQL> alter table t_lob move;

Table altered.

-- Or you can precreate the new table with LONG in the locally managed tablespace and do exp/imp

-- export the Long table
SQL> !exp / file=t.dmp tables=t compress=n 

Export: Release 9.2.0.3.0 - Production on Tue Mar 2 09:32:30 2004

Copyright (c) 1982, 2002, Oracle Corporation.  All rights reserved.

Connected to: Oracle9i Enterprise Edition Release 9.2.0.3.0 - Production
With the Partitioning, OLAP and Oracle Data Mining options
JServer Release 9.2.0.3.0 - Production
Export done in WE8ISO8859P1 character set and AL16UTF16 NCHAR character set

About to export specified tables via Conventional Path ...
. . exporting table                              T          2 rows exported
Export terminated successfully without warnings.

-- just rename the old table for reference purposes
SQL> rename t to tbak;

Table renamed.

-- Create the LONG table in the locally managed tablespace

SQL> create table t(x int,y long) tablespace users;

Table created.

-- now import the data 

SQL> !imp / file=t.dmp tables=t ignore=y          

Import: Release 9.2.0.3.0 - Production on Tue Mar 2 09:33:43 2004

Copyright (c) 1982, 2002, Oracle Corporation.  All rights reserved.

Connected to: Oracle9i Enterprise Edition Release 9.2.0.3.0 - Production
With the Partitioning, OLAP and Oracle Data Mining options
JServer Release 9.2.0.3.0 - Production

Export file created by EXPORT:V09.02.00 via conventional path
import done in WE8ISO8859P1 character set and AL16UTF16 NCHAR character set
. importing OPS$ORACLE's objects into OPS$ORACLE
. . importing table                            "T"          2 rows imported
Import terminated successfully without warnings.

SQL> desc t
 Name                                      Null?    Type
 ----------------------------------------- -------- ----------------------------
 X                                                  NUMBER(38)
 Y                                                  LONG


Note 14:
--------

Doc ID </help/usaeng/Search/search.html>: 	Note:225337.1	Content Type: 	TEXT/PLAIN	
Subject: 	ORA-22285 ON ACCESSING THE BFILE COLUMN OF A TABLE	Creation Date: 	08-JAN-2003	
Type: 	PROBLEM	Last Revision Date: 	17-DEC-2004	
Status: 	PUBLISHED		
Fact(s) 
~~~~~~~ 
  
  *The directory alias for the relevant directory exists. 
 
  *This condition might be encountered in general or particularly after     
   successful export/import of 'table with bfile column' from one schema 
   to another. 
   
  *Non-bfile columns of the table could be accessed but not the bfile  
   column.  
   
    
Symptom(s) 
~~~~~~~~~~ 
 
  Accessing the bfile column of table gives the following errors: 
 
          ORA-22285: non-existent directory or file for ..... 
          ORA-06512: at "SYS.DBMS_LOB", line ...  
   
 
Diagnosis: 
~~~~~~~~~~ 
 
-- create the exporting user schema and the table with bfile data-- 
 
SQL>connect system/manager 
 
SQL>create user test2 identified by test2 default tablespace users  
    quota  50 m on users 
/ 
SQL>grant connect, create table, create any directory to test2 
/ 
SQL>conn test2/test2 
 
SQL>create table test_lobs ( 
   c1 number, 
   c2 clob, 
   c3 bfile, 
   c4 blob  
) 
LOB (c2) STORE AS (ENABLE STORAGE IN ROW) 
LOB (c4) STORE AS (DISABLE STORAGE IN ROW) 
/ 
 
create two files (rec2.txt , rec3.txt) using OS utilities in some  
directory say ( /tmp ) 
 
--create the directory alias -- 
 
SQL>create directory tmp_dir as '/tmp' 
/ 
 
-- Populate the table-- 
 
SQL>insert into test_lobs values (1,null,null,null) 
/ 
SQL>insert into test_lobs values (2,EMPTY_CLOB(),BFILENAME('TMP_DIR','rec2.txt'),EMPTY_BLOB()) 
/ 
SQL>insert into test_lobs values (3,'Some data for record3.',    
    BFILENAME('TMP_DIR','rec2.txt'), 
   '48656C6C6F'||UTL_RAW.CAST_TO_RAW('there!')) 
/ 
 
-- access the table-- 
 
SQL>column len_c2 format 9999 
SQL>column len_c3 format 9999 
SQL>column len_c4 format 9999 
 
 
SQL>select c1, DBMS_LOB.GETLENGTH(c2) len_c2, 
    DBMS_LOB.GETLENGTH(c3) len_c3, 
    DBMS_LOB.GETLENGTH(c4) len_c4 from test_lobs 
/ 
 
                        C1 LEN_C2 LEN_C3 LEN_C4 
-------------------------- ------ ------ ------ 
                         1 
                         2      0    124      0 
                         3     22    124     11 
 
-- carry out the schema level export-- 
 
$ exp  system/manager file=exp44.dmp log=logr44.log owner=test2 
 
IMPORTING DATABASE: 
 
 
create same two files (rec2.txt , rec3.txt) using OS utilities in some  
directory say ( /tmp ) 
 
--create the directory alias -- 
 
SQL>conn system/manager 
 
SQL>create directory tmp_dir as '/tmp' 
/ 
 
-- create the importing user schema-- 
 
SQL>create user test3 identified by test3 default tablespace users  
    quota  50 m on users 
/ 
SQL>grant connect, create table, create any directory to test3 
/ 
 
 
--carry out the successful schema level import-- 
 
$ imp system/manager fromuser=test2 touser=test3 file=exp44.dmp log=log44.log 
 
 
--try to access the imported table as below (same statement as by the 
   exporting user-- 
 
SQL>select c1, DBMS_LOB.GETLENGTH(c2) len_c2, DBMS_LOB.GETLENGTH(c3) len_c3, 
    DBMS_LOB.GETLENGTH(c4) len_c4 from test_lobs 
/ 
 
ERROR: 
ORA-22285: non-existent directory or file for GETLENGTH operation 
ORA-06512: at "SYS.DBMS_LOB", line 547 
 
-- However non bfile columns could be accessed-- 
 
  
Cause 
~~~~~ 
   
    The importing user lacks the read access on the corresponding directory/ 
    directory alias.   
     
Solution(s) 
~~~~~~~~~~~ 
 
 
   grant read access on the corressponding directory to the user who tries to   
   access the  bfile table as below: 
 
   SQL> conn system/manager 
        Connected. 
   SQL> grant read on directory tmp_dir to test3; ( please see the example  
       above ) 
 
   Once the read permission is granted ,the bfile column of the said table  
   is accessible since the corresponding directory (/alias) is accessible. 
 
 
Refrences: 
~~~~~~~~~~ 
 
[NOTE:66046.1] <ml2_documents.showDocument?p_id=66046.1&p_database_id=NOT>:   LOBs, Longs, and other Datatypes 


Note 15:
--------


Doc ID:  Note:279722.1 
Subject:  IMPORT OF TABLE WITH LOB GENERATES CORE DUMP 
Type:  PROBLEM 
Status:  MODERATED 
 Content Type:  TEXT/X-HTML 
Creation Date:  31-JUL-2004 
Last Revision Date:  02-AUG-2004 
 

The information in this article applies to: 
Oracle Server - Enterprise Edition - Version: 9.2.0.3
This problem can occur on any platform.

Symptoms
IMPORT OF TABLE WITH LOB GENERATES CORE DUMP 
Cause
<Bug:3091499>

Importing a table having a clob created with chunksize = 32k


Error Details:
-------------

. importing DBAPIDB1's objects into DBAPIDB1
. . importing table "TE2006"Segmentation fault

Trace from the Core Dump:
------------------------
lmmstrmlr 44
lmmstmrg D4
lmmstmrg D4
lmmstfree 104
lmmfree C0
impmfr 24
impplb 5BC
impins 22B8
do_insert 48C
imptabwrk F4
impdta 41C
impdrv 2D68
main 14
__start 94 
Fix
FIX:
---
Apply the patch for Bug:3091499


WORKAROUND:
----------
Before import, create the table with chunksize <= 16K and run import setting ignore=y 
References
<BUG:3091499> - Import Of Table With Lob Generates Core Dump


Note 16: keep LOBS at manageble size.
-------------------------------------

(1) Look at PCTVERSION:

Since the LOB segments are usually very large, they are  treated differently from other columns. While other columns 
can be guaranteed to give consistent reads, these columns are not. This is because, it is difficult to manage 
with LOB data rollback segments due to their size unlike other columns. So they do not use rollback segments. 
Usually only one copy exists, so the queries reading that column may not get consistent reads while 
other queries modify them. In these cases, the other queries will get "ORA-22924 snapshot too old" errors.

To maintain read consistency Oracle creates new LOB page versions every time a lob changes. PCTVERSION is 
the percentage of all used LOB data space that can be occupied by old versions of LOB data pages. As soon as 
old versions of LOB data pages start to occupy more than the PCTVERSION amount of used LOB space, 
Oracle tries to reclaim the old versions and reuse them. In other words, PCTVERSION is the percent of used 
LOB data blocks that is available for versioning old LOB data. The PCTVERSION can be set to the percentage 
of LOB's that are occasionally updated. 

Often a table's a LOB column usually gets the data uploaded only once, but is read multiple times. 
Hence it is not necessary to keep older versions of LOB data. It is recommended that this value be changed to "0". 

By default PCTVERSION is set to 10%. So, most of the instances usually have it set to 10%, 
it must be set to 0% explicitly. The value can be changed any time in a running system.

Use the following query to find out currently set value for PCTVERSION:

SQL> select PCTVERSION from dba_lobs where TABLE_NAME = 'table_name' and COLUMN_NAME='column_name';

PCTVERSION
----------
        10

PCTVERSION can be changed using the following SQL (it can be run anytime in a running system):

ALTER TABLE FND_LOBS MODIFY LOB (FILE_DATA) ( PCTVERSION 0 );


Note 17: difference 9iR1 9iR2 with respect to Locally managed tablespace
------------------------------------------------------------------------


Doc ID:  Note:159078.1 
Subject:  Cannot Create Table with LOB Column in Locally Managed Tablespace 
Type:  PROBLEM 
Status:  PUBLISHED 
 Content Type:  TEXT/X-HTML 
Creation Date:  26-SEP-2001 
Last Revision Date:  04-AUG-2004 
 

fact: Oracle Server - Enterprise Edition 9.0.1
symptom: Creating new OEM repository fails
symptom: Create table SMP_LMV_SEARCH_OBJECT fails
symptom: ORA-03001: unimplemented feature
symptom: Table with LOB column
cause: You try to create a LOB segment in a bitmapped (locally managed) 
tablespace.

This is a limitation for bitmapped segments in 9i. This is being documented in
the SQL Reference- the restriction will be lifted in 9i Release 2.


fix:

Create the table in a tablespace that was created with clause
SEGMENT SPACE MANAGEMENT MANUAL


Note 18:
--------

In a trace file you either get

ORA-00600: internal error code, arguments: [kkdoilsn1], [], [], [], [], [], [], []
or
ORA-00600: internal error code, arguments: [15265], [], [], [], [], [], [], []

description: 
in a 9.2 database, a table with lob and indexsegments was moved to another tablespace.

Explanation:

9202 2405258 Dictionary corruption / OERI:15265 from MOVE LOB to existing segment name 
2405258 Dictionary corruption / OERI:15265 from MOVE LOB to existing segment name  

This is Bug 2405258      Fixed: 9202 
Corruption 
LOB Related (CLOB/BLOB/BFILE) 
Dictionary corruption / ORA-600 [15265] from MOVE LOB toexisting segment name.
Eg: 
ALTER TABLE mytab MOVE LOB (artist_bio)    STORE AS lobsegment (STORAGE(INITIAL 1M NEXT 1M));
corrupts the dictionary if "logsegment" already exists.


Bug 2405258 Dictionary corruption / OERI:15265 from MOVE LOB to existing segment name
This note gives a brief overview of bug 2405258. 
Affects:
Product (Component)	Oracle Server (RDBMS)	
Range of versions believed to be affected	Versions >= 8 but < 10G 	
Versions confirmed as being affected	9.2.0.1 	
Platforms affected	Generic (all / most platforms affected)	
Fixed:
This issue is fixed in	9.2.0.2 (Server Patch Set)  10G Production Base Release 	
Symptoms:
Corruption (Dictionary) <javascript:taghelp('TAGS_CORR_DIC')> 
Internal Error may occur (ORA-600) <javascript:taghelp('TAGS_OERI')> 
ORA-600 [15265] 
Related To:
Datatypes - LOBs (CLOB/BLOB/BFILE) 
Description
Dictionary corruption / ORA-600 [15265] from MOVE LOB to 
existing segment name.
Eg: ALTER TABLE mytab MOVE LOB (artist_bio) 
    STORE AS lobsegment (STORAGE(INITIAL 1M NEXT 1M)); 
corrupts the dictionary if "logsegment" already exists.


=====================
31. BLOCK CORRUPTION:
=====================


Note 1:
=======

Doc ID </help/usaeng/Search/search.html>: 	Note:47955.1	Content Type: 	TEXT/PLAIN	
Subject: 	Block Corruption FAQ	Creation Date: 	14-NOV-1997	
Type: 	FAQ	Last Revision Date: 	17-AUG-2004	
Status: 	PUBLISHED		
ORACLE SERVER 

------------- 
BLOCK CORRUPTION 
---------------- 
FREQUENTLY ASKED QUESTIONS 
-------------------------- 
25-JAN-2000 
 
CONTENTS 
-------- 
1. What does the error ORA-01578 mean? 
2. How to determine what object is corrupted?  
3. What are the recovery options if the object is a table? 
4. What are the recovery options if the object is an index? 
5. What are the recovery options if the object is a rollback segment? 
6. What are the recovery options if the object is a data dictionary object? 
7. What methods are available to assist in pro-actively identifying corruption? 
8. How can corruption be prevented? 
9. What are the common causes of corruption? 
 
 
QUESTIONS & ANSWERS 
 
1. What does the error ORA-01578 mean? 
 
   An Oracle data block is written in an internal binary format which conforms  
   to a defined structure.  The size of the physical data block is determined  
   by the "init.ora" parameter DB_BLOCK_SIZE set at the time of database  
   creation.  The format of the block is similar regardless of the type of data 
   contained in the block.   
 
  Each formatted block on disk has a wrapper which consists of a block header  
  and footer. Unformatted blocks should be zero throughout.  Whenever a block  
  is read into the buffer cache, the block wrapper information is checked for  
  validity.  The checks include verifying that the block passed to Oracle by  
  the operating system is the block requested (data block address) and also  
  that certain information stored in the block header matches information  
  stored in the block footer in case of a split (fractured) block. 
 
  On a read from disk, if an inconsistency in this information is found, the  
  block is considered to be corrupt and 

  ORA-01578: ORACLE data block corrupted  (file # %s, block # %s) 

  is signaled where file# is the file ID of the Oracle  
  datafile and block# is the block number, in Oracle blocks, within that file. 
  However, this does not always mean that the block on disk is truely  
  physically corrupt.  That fact needs to be confirmed.   
 
 
2. How to determine what object is corrupted? 
 
   The following query will display the segment name, type, and owner:  
 
       SELECT SEGMENT_NAME, SEGMENT_TYPE, OWNER 
       FROM SYS.DBA_EXTENTS 
       WHERE FILE_ID = <f>   
       AND <b> BETWEEN BLOCK_ID AND BLOCK_ID + BLOCKS - 1; 
 
   Where <f> is the file number and <b> is the block number reported in the  
   ORA-01578 error message. 

   Suppose block 82817 from table 'USERS' is corrupt:

SQL> select extent_id, block_id, blocks from dba_extents where segment_name='USERS';

 EXTENT_ID   BLOCK_ID     BLOCKS
---------- ---------- ----------
         0      82817          8
         1      82825          8
         2      82833          8
         3      82841          8
         4      82849          8

SQL> SELECT SEGMENT_NAME, SEGMENT_TYPE, OWNER 
  2  FROM SYS.DBA_EXTENTS 
  3  WHERE FILE_ID = 9  
  4  AND 82817 BETWEEN BLOCK_ID AND BLOCK_ID + BLOCKS - 1; 

SEGMENT_NAME                                                                      SEGMENT_TYPE       OWNER
--------------------------------------------------------------------------------- ------------------
USERS                                                                             TABLE              VPOUSERDB
 
 
3. What are the recovery options if the object is a table? 
 
   The following options exist for resolving non-index block corruption in a  
   table which is not part of the data dictionary: 
 
       o Restore and recover the database from backup (recommended). 
       o Recover the object from an export. 
       o Select the data out of the table bypassing the corrupted block(s). 
 
   If the table is a Data Dictionary table, you should contact Oracle Support  
   Services.  The recommended recovery option is to restore the database from  
   backup. 
 
   [NOTE:28814.1] <ml2_documents.showDocument?p_id=28814.1&p_database_id=NOT> 
   contains information on how to handle ORA-1578 errors in  Oracle7. 
 
   References: 
   ----------- 
   [NOTE:28814.1] <ml2_documents.showDocument?p_id=28814.1&p_database_id=NOT>  
   TECH ORA-1578 and Data Block Corruption in Oracle7 

4. What are the recovery options if the object is  an index? 
  
   If the object is an index which is not part of the data dictionary and the  
   base table does not contain any corrupt blocks, you can simply drop and  
   recreate the index. 
 
   If the index is a Data Dictionary index, you should contact Oracle Support  
   Services. The recommended recovery option is to restore the database from  
   backup.  There is a possibility you might be able to drop the index and then 
   recreate it based on the original create SQL found in the administrative  
   scripts.  Oracle Support Services will be able to make the determination as  
   to whether this is a viable option for you. 
 
 
5. What are the recovery options if the object is a rollback segment? 
 
   If the object is a rollback segment, you should contact Oracle Support  
   Services.  The recommended recovery option is to restore the database  
   from backup. 
 
 
6. What are the recovery options if the object is a data dictionary object? 
 
   If the object is a Data Dictionary object, you should contact Oracle Support 
   Services.  The recommended recovery option is to restore the database from  
   backup.   
 
   If the object is an index on a Data Dictionary table, you might be able to  
   drop the index and then recreate it based on the original create SQL found  
   in the administrative scripts.  Oracle Support Services will be able to make 
   the determination as to whether this is a viable option. 
 
 
7. What methods are available to assist in pro-actively identifying corruption? 
     
   ANALYZE TABLE/INDEX/CLUSTER ... VALIDATE STRUCTURE is a SQL command which  
   can be executed against a table, index, or cluster which scans every block  
   and reports a failure upon encountering any potentially corrupt blocks. The 
   CASCADE option checks all associated indices and verifies the 1 to 1  
   correspondence between data and index rows. This is the most detailed block  
   check available, but requires the database to be open. 
 
   DB Verify is a utility which can be run against a datafile of a database  
   that will scan every block in the datafile and generate a report identifying 
   any potentially corrupt blocks.  DB Verify performs basic block checking  
   steps, however it does not provide the capability to verify the 1 to 1  
   correspondence between data and index rows. It can be run when the database  
   is closed. 
 
   Export will read the blocks allocated to each table being exported and  
   report any potential block corruptions encountered. 
 
 
   References: 
   ----------- 
 
   [NOTE:35512.1] <ml2_documents.showDocument?p_id=35512.1&p_database_id=NOT> 
   DBVERIFY - Database file Verification Utility (7.3.2 onwards) 
 
 
8. How can corruption be prevented? 
 
   Unfortunately, there is no way to totally eliminate the risk of corruption. 
   You can only minimize the risk and plan accordingly. 
 
 
9. What are the common causes of corruption? 
 
   o Bad I/O, H/W, Firmware. 
   o Operating System I/O or caching problems. 
   o Memory or paging problems. 
   o Disk repair utilities. 
   o Part of a datafile being overwritten. 
   o Oracle incorrectly attempting to access an unformatted block. 
   o Oracle or operating system bug. 
 
   Note 77587.1 <ml2_documents.showDocument?p_id=77587.1&p_database_id=NOT> 
   discusses block corruptions in Oracle and how they are related  
   to the underlying operating system and hardware. 
 
   References: 
   ----------- 
 
   [NOTE:77587.1] <ml2_documents.showDocument?p_id=77587.1&p_database_id=NOT>  BLOCK CORRUPTIONS ON ORACLE AND UNIX 


Note 2:
=======

ORA-00600: Internal message code, arguments: [01578] [...] [...] [] [] [].
ORA-01578: Oracle data block corrupted (file ..., block ...).

Having encountered the Oracle data block corruption, we must firstly investigate which database segment
(name and type) the corrupted block is allocated to. Chances are that the block belongs either to an index 
or to a table segment, since these two type of segments fill the major part of our databases. 
The following query will reveil the segment that holds the corrupted block identified by
<filenumber> and <blocknumber> (which were given to you in the error message):

	SELECT ds.*
	FROM dba_segments ds, sys.uet$ e
	WHERE ds.header_file=e.segfile#
	and ds.header_block=e.segblock#
	and e.file#=<filenumber>
	and <blocknumber> between e.block# and e.block#+e.length-1;


If the segment turns out to be an index segment, then the problem can be very quickly solved. 
Since all the table data required for recreating the index is still accessable, we can drop and recreate the index 
(since the block will reformatted, when taken FROM the free-space list and reused for the index).
If the segment turns out to be a table segment a number of options for solving the problem are available:

- restore and recovery of datafile the block is in
- imp table
- sql

The last option involves using SQL to SELECT as much data as possible FROM the current 
corrupted table segment and save the SELECTed rows into a new table.
SELECTing data that is stored in segment blocks that preceede the corrupted block
can be easily done using a full table scan (via a cursor). 
Rows stored in blocks after the corrupted block cause a problem. 
A full table scan will never reach these. However these rows can still be
fetched using rowids (single row lookups).


2.1 Table was indexed

Using an optimizer hint we can write a query that SELECTs the rows FROM the table 
via an index scan (using rowid's), instead of via a full table scan. 
Let's assume our table is named X with columns a, b and c. And table X is indexed 
uniquely on columns a and b by index X_I, the query would look like:

SELECT /*+index(X X_I) */ a, b, c
FROM X;

We must now exclude the corrupt block FROM being accessed to avoid the 
internal exception ORA-00600[01578]. Since the blocknumber is a substring 
of the rowid ( ) this can very easily be achieved:

SELECT /*+index(X X_I) */ a, b, c
FROM X
WHERE rowid not like <corrupt_block_number>||'.%.'||<file_number>;


But it is important to realize that the WHERE-clause gets evaluated right 
after the index is accessed and before the table is accessed. 
Otherwise we would still get the ORA-00600[01578] exception. 
Using the above query as a subquery in an insert statement we can restore 
all rows of still valid blocks to a new table.

Since the index holds the actual column values of the indexed columns we could 
also use the index to restore all indexed columns of rows that reside in the corrupt block. 
The following query,

	SELECT /*+index(X X_I) */ a, b
	FROM X
	WHERE rowid like <corrupt_block_number>||'.%.'||<file_number>;

retreives only indexed columns a and b FROM rows inside the corrupt block. 
The optimizer will not access the table for this query. 
It can retreive the column values using the index segment only. 

Using this technique we are able to restore all indexed column values of the 
rows inside the corrupt block, without accessing the corrupt block at all. 
Suppose in our example that column c of table X was also indexed by index X_I2. 
This enables us to completely restore rows inside the corrupt block.

First restore columns a and b using index X_I:

	create table X_a_b(rowkey,a,b) as
	SELECT /*+index(X X_I) */ rowid, a, b
	FROM X
	WHERE rowid like <corrupt_block_number>||'.%.'||<file_number>;

Then restore column c using index X_I2:

	create table X_c(rowkey,c) as
	SELECT /*+index(X X_I2) */ rowid, c
	FROM X
	WHERE rowid like <corrupt_block_number>||'.%.'||<file_number>;

And finally join the columns together using the restored rowid:

	SELECT x1.a, x1.b, x2.c
	FROM X_a_b x1, X_c x2
	WHERE x1.rowkey=x2.rowkey;

In summary:
Indexes on the corrupted table segment can be used to restore all columns of all rows 
that are stored outside the corrupted data blocks. 
Of rows inside the corrupted data blocks, only the columns that were indexed can be restored. 
We might even be able to use an old version of the table (via Import)
to further restore non-indexed columns of these records.


2.2 Table has no indexes

This situation should rarely occur since every table should have a primary key and therefore a unique index. 
However when no index is present, all rows of corrupted blocks should be considered lost. 
All other rows can be retrieved using rowid's. 
Since there is no index we must build a rowid generator ourselves. 
The SYS.UET$ table shows us exactly which extents (file#, startblock, endblock)
 we need to inspect for possible rows of our table X. 
If we make an estimate of the maximum number of rows per block for table X, 
we can build a PL/SQL-loop that generates possible rowid's of records inside table X. 
By handling the 'invalid rowid' exception, and skipping the corrupted data block, 
we can restore all rows except those inside the corrupted block.

declare
v_rowid  varchar2(18);
v_xrec   X%rowtype;
e_invalid_rowid exception;
pragma exception_init(e_invalid_rowid,-1410);

begin 

for v_uetrec in (SELECT file# file, block# start_block, block#+length#-1 end_block
			FROM uet$
			WHERE segfile#=6 and segblock#=64) -- Identifies our segment X.
      loop for v_blk in v_uetrec.start_block..v_uetrec.end_block
	    loop if v_uetrec.file<>6 and v_blk<>886 -- 886 in file 6 is our corrupted block.
		 then for v_row in 0..200 -- 200 is maximum number of rows per block for segment X.
		      loop begin SELECT a,b,c into v_rec
				FROM x
				WHERE rowid=chartorowid('0000'||hex(v_blk)||'.'||
						hex(v_row)||'.'||hex(v_uetrec.file);
				insert into x_saved(a,b,c) values(v_rec.a,v_rec.b,v_rec.c);
				commit;
			   exception when e_invalid_rowid then null;
			   end;
		      end loop; /*row-loop*/
		 end if;
	    end loop; /*blk-loop*/
      end loop; /*uet-loop*/
end;
/

The above code assumes that block id's never exceed 4 hexadecimal places. 
A definition of the hex-function which is used in the above code can be found in the appendix.


Note 3:
=======

Doc ID </help/usaeng/Search/search.html>: 	Note:33405.1	Content Type: 	TEXT/PLAIN	
Subject: 	Extracting Data from a Corrupt Table using SKIP_CORRUPT_BLOCKS or Event 10231	Creation Date: 	24-JAN-1996	
Type: 	BULLETIN	Last Revision Date: 	13-SEP-2000	
Status: 	PUBLISHED		
***************** 
*** *** 
***************** 
  This note is an extension to article [NOTE:28814.1]
  <ml2_documents.showDocument?p_id=28814.1&p_database_id=NOT> about handling  
  block corruption errors where the block wrapper of a datablock indicates  
  that the block is bad.  (Typically for ORA-1578 errors).   
  The details here will not work if only the block internals are  
  corrupt (eg: for ORA-600 or other errors). 
 
  Please read [NOTE:28814.1] <ml2_documents.showDocument?p_id=28814.1&p_database_id=NOT> before reading this note. 
 
Introduction 
~~~~~~~~~~~~ 
	This short article explains how to skip corrupt blocks on an object 
        either using the Oracle8i SKIP_CORRUPT table flag or the special  
        Oracle event number 10231 which is available in Oracle releases 7  
        through 8.1 inclusive. 
	The information here explains how to use these options. 
 
	Before proceeding you should: 
		a)  Be certain that the corrupt block is on a USER table. 
		    (i.e.: not a data dictionary table) 
		b)  Have contacted Oracle Support Services and been advised to  
                    use event 10231 or the SKIP_CORRUPT flag. 
		c)  Have decided how you are to recreate the table.  
		    Eg: Export , and disk space is available etc.. 
		d)  You have scheduled down-time to attempt the salvage 
		    operation. 
		e)  Have a backup of the database. 
		f)  Have the SQL to rebuild the problem table, its indexes 
		    constraints, triggers, grants etc...  
		    This SQL should include relevant storage clauses. 
 
 
What is event 10231 ? 
~~~~~~~~~~~~~~~~~~~~~ 
	This event allows Oracle to skip certain types of corrupted blocks 
	on full table scans ONLY hence allowing export or "create table as 
	select" type operations to retrieve rows from the table which are not 
	in the corrupt block. Data in the corrupt block is lost. 
 
	The scope of this event is limited for Oracle versions prior to 
	Oracle 7.2 as it only allows you to skip 'soft corrupt' blocks. 
	Most ORA 1578 errors are a result of media corruptions and in such  
	cases event 10231 is useless. 
 
	From Oracle 7.2 onwards the event allows you to skip many forms of  
	media corrupt blocks in addition to soft corrupt blocks and so is 
	far more useful. It is still *NOT* guaranteed to work.  
	[NOTE:28814.1] <ml2_documents.showDocument?p_id=28814.1&p_database_id=NOT> 
        describes alternatives which can be used if this event  
	fails. 
 
What is the SKIP_CORRUPT flag ? 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
        In Oracle8i the functionality of the 10231 event has been externalised 
        on a PER-SEGMENT basis such that it is possible to mark a TABLE or 
        PARTITION to skip over corrupt blocks when possible. The flag is 
        set or cleared using the DBMS_REPAIR package. DBA_TABLES has a  
        SKIP_CORRUPT column which indicates if this flag is set for an  
        object or not. 
 
Setting the event or flag 
~~~~~~~~~~~~~~~~~~~~~~~~~ 
 	The event can either be set within the session or at database instance 
        level. If you intend to use a CREATE TABLE AS SELECT then setting 
  	the event in the session may suffice. If you want to EXPORT the table 
	data then it is best to set the event at instance level, or set the 
        SKIP_CORRUPT table attribute if on Oracle8i. 
 
  Oracle8i  
  ~~~~~~~~ 
        Connect as a DBA user and mark the table as needing to skip  
        corrupt blocks thus: 
          execute DBMS_REPAIR.SKIP_CORRUPT_BLOCKS('<schema>','<tablename>'); 
 
        or for a table partition: 
          execute DBMS_REPAIR.SKIP_CORRUPT_BLOCKS('<schema>','<tablename>'.'<partition>'); 
 
	Now you should be able to issue a CREATE TABLE AS SELECT operation 
	against the corrupt table to extract data from all non-corrupt 
	blocks, or EXPORT the table. 
	Eg: 
		CREATE TABLE salvage_emp  
		 AS SELECT * FROM corrupt_emp; 
 
        To clear the attribute for a table use: 
        execute DBMS_REPAIR.SKIP_CORRUPT_BLOCKS('<schema>','<tablename>', 
                      flags=>dbms_repair.noskip_flag); 

execute DBMS_REPAIR.SKIP_CORRUPT_BLOCKS('VPOUSERDB','USERS', flags=>dbms_repair.noskip_flag); 
 
 
  Setting the event in a Session 
  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 	Connect to Oracle as a user with access to the corrupt table and 
	issue the command: 
 
		ALTER SESSION SET EVENTS 
			'10231 TRACE NAME CONTEXT FOREVER, LEVEL 10'; 
 
	Now you should be able to issue a CREATE TABLE AS SELECT operation 
	against the corrupt table to extract data from all non-corrupt 
	blocks, but an export would still fail as the event is only set  
        within your current session. 
	Eg: 
		CREATE TABLE salvage_emp  
		 AS SELECT * FROM corrupt_emp; 
 
  Setting the event at Instance level 
  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
	This requires that the event be added to the init$ORACLE_SID.ora file 
 	used to start the instance: 
 
		shutdown the database 
 
		Edit your init<SID>.ora startup configuration file and ADD 
		a line that reads: 
 
			event="10231 trace name context forever, level 10" 
 
		  Make sure this appears next to any other EVENT= lines in the 
	 	  init.ora file.  
 
		STARTUP  
			If the instance fails to start check the syntax 
			of the event parameter matches the above exactly. 
			Note the comma as it is important.  
 
		SHOW PARAMETER EVENT  
			To check the event has been set in the correct place. 
			You should see the initial portion of text for the 
			line in your init.ora file. If not check which  
			parameter file is being used to start the database. 
 
		Select out the data from the table using a full table scan 
		operation. 
			Eg: Use a table level export  
			    or create table as select. 
 
  Export Warning: If the table is very large then some versions of export 
  may not be able to write more than 2Gb of data to the 
  export file. See [NOTE:62427.1] <ml2_documents.showDocument?p_id=62427.1&p_database_id=NOT> for general information 
  on 2Gb limits in various Oracle releases. 
 
			     
Salvaging data from the corrupt block itself 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
  SKIP_CORRUPT and event 10231 extract data from good blocks but 
  skip over corrupt blocks. To extract information from the corrupt 
  block there are three main options: 
 
    - Select column data from any good indexes 
        This is discussed towards the end of the following 2 articles: 
           Oracle7 - using ROWID range scans    [NOTE:34371.1] <ml2_documents.showDocument?p_id=34371.1&p_database_id=NOT> 
           Oracle8/8i - using ROWID range scans [NOTE:61685.1] <ml2_documents.showDocument?p_id=61685.1&p_database_id=NOT> 
 
    - See if Oracle Support can extract any data from HEX dumps of the 
      corrupt block. 
    - It may be possible to salvage some data using Log Miner 
         
 
Once you have the data extracted 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
	Once you have the required data extracted either into an export file 
	or into another table make sure you have a valid database backup before 
	proceeding. The importance of this cannot be over-emphasised. 
 
	Double check you have the SQL to rebuild the object and its indexes  
	etc.. 
 
	Double check that you have any diagnostic information if requested by  
	Oracle support. Once you proceed with dropping the object certain  
	information is destroyed so it is important to capture it now. 
 
	Now you can: 
 
	    	If 10231 was set at instance level: 
		   Remove the 'event' line from the init.ora file  
 
		   SHUTDOWN and RESTART the database. 
 
		   SHOW PARAMETER EVENT 
			Make sure the 10231 event is no longer shown 
 
		RENAME or DROP the problem table 
			If you have space it is advisable to RENAME the 
			problem table rather than DROP it at this stage. 
 
		Recreate the table. 
			Eg: By importing. 
			    Take special care to get the storage clauses  
			    correct when recreating the table. 
 
		Create any indexes, triggers etc.. required 
			Again take care with storage clauses. 
 
		Re-grant any access to the table. 
 
		If you RENAMEd the original table you can drop it once 
		the new table has been tested. 

. 


Note 4: Analyze table validate structure:
=========================================

validate structure table:

  ANALYZE TABLE CHARLIE.CUSTOMERS VALIDATE STRUCTURE;

validate structure index:

  ANALYZE INDEX CHARLIE.PK_CUST VALIDATE STRUCTURE;

Als er geen corrupte blocks worden gevonden, is de output slechts "table analyzed".
Als er wel corrupte blocks worden gevonden, moet een aangemaakte trace file
worden bekeken.


Note 5: DBVERIFY Utility:
=========================

Vanaf de OS prompt kan het dbv utility gedraaid worden om een datafile
te onderzoeken.

$ dbv FILE=/u02/oracle/cc1/data01.dbf BLOCKSIZE=8192


Note 6: DBMS_REPAIR package:
============================

Het DBMS_REPAIR package wordt aangemaakt door bmprpr.sql script.

Stap 1.

via ANALYZE TABLE ben je er achter gekomen dat van een table
een of meer blocks corrupt zijn.

Stap 2.

Gebruik eerst DBMS_REPAIR.ADMIN_TABLES om de REPAIR_TABLE aan te maken.
Deze table zal dan gegevens gaan bevatten over de blocks, en of die
gemarkeerd zijn als zijnde corrupt e.d.

  declare

  begin
    dbms_repair.admin_tables('REPAIR_TABLE, dbms_repair.repair_table, dbms_repair.create_action, 'users');
  end;
  /

Stap 3.

Gebruik nu de DBMS_REPAIR.CHECK_OBJECT procedure op het object
om de repair_table uit stap 2 te vullen met corruptie gegevens.


set serveroutput on
declare rpr_count int;

begin
  rpr_count:=0;
    
  dbms_repair.check_object('CHARLIE', 'CUSTOMERS', 'REPAIR_TABLE', rpr_count);

  dbms_output.put_line('repair_block_count :'||to_char(rpr_count));
end;
/


Note 7:
=======

Tom,

If I have this information:
select * from V$DATABASE_BLOCK_CORRUPTION;

     FILE#     BLOCK#     BLOCKS CORRUPTION_CHANGE# CORRUPTIO
---------- ---------- ---------- ------------------ ---------
        11      12357         12          197184960 LOGICAL
and 
select * from v$backup_corruption;

     RECID      STAMP  SET_STAMP  SET_COUNT     PIECE#      FILE#     BLOCK#     
BLOCKS CORRUPTION_CHANGE# MAR CO
---------- ---------- ---------- ---------- ---------- ---------- ---------- 
---------- ------------
         1  533835361  533835140       3089          1         11      12357     
    12          197184960 NO  LOGICAL

How can I get more details of what data resides on this blocks? and being 
'Logical' can they be recoverd without loosing that data at all?

Any extra details would be appreciated.

Thanks,

Orlando 

Followup:  
select * from dba_extents 
where file_id = 11
and 12357 between block_id an block_id+blocks-1;

if it is something "rebuildable" -- like an index, drop and recreate might be 
the path of least resistance, else you would go back to your backups -- to 
before this was detected and restore that file/range of blocks (rman can do 
block level recovery)

 
Tom

trace file generated by analyze contained

table scan: segment: file# 55 block# 229385
            skipping corrupt block file# 55 block# 251372

This is repeated every day (analyzed each morning)
but daily direct export / import succeeds.

SQL> select segment_type from dba_extents 
    where file_id=55 
    and 229385 between block_id and 
    (block_id +( blocks -1));

SEGMENT_TYPE
----------------------------------------
TABLE


$ dbv     file=/u03/oradata/emu/emu_data_large02.dbf \
    blocksize=8192 logfile=/dbv.log

DBVERIFY: Release 8.1.7.2.0 - Production on Mon Aug 10 10:10:13 2004

(c) Copyright 2000 Oracle Corporation.  All rights reserved.


DBVERIFY - Verification starting : FILE = /u03/oradata/emu/emu_data_large02.dbf
Block Checking: DBA = 230938092, Block Type = KTB-managed data block
Found block already marked corrupted

DBVERIFY - Verification complete

Total Pages Examined         : 256000
Total Pages Processed (Data) : 253949
Total Pages Failing   (Data) : 0
Total Pages Processed (Index): 0
Total Pages Failing   (Index): 0
Total Pages Processed (Other): 11
Total Pages Empty            : 2040
Total Pages Marked Corrupt   : 0
Total Pages Influx           : 0


Any thoughts ?

Thanks


Note 6:
-------

Detect And Correct Corruption
Oracle provides a number of methods to detect and repair corruption within datafiles:

DBVerify 
ANALYZE .. VALIDATE STRUCTURE 
DB_BLOCK_CHECKING. 
DBMS_REPAIR. 
Other Repair Methods. 

DBVerify
DBVerify is an external utility that allows validation of offline datafiles. 
In addition to offline datafiles it can be used to check the validity of backup datafiles:

C:>dbv file=C:\Oracle\oradata\TSH1\system01.dbf feedback=100 blocksize=4096

ANALYZE .. VALIDATE STRUCTURE
The ANALYZE command can be used to verify each data block in the analyzed object. 
If any corruption is detected rows are added to the INVALID_ROWS table:

-- Create the INVALID_ROWS table.
SQL> @C:\Oracle\901\rdbms\admin\UTLVALID.SQL

-- Validate the table structure.
SQL> ANALYZE TABLE scott.emp VALIDATE STRUCTURE;

-- Validate the table structure along with all it's indexes.
SQL> ANALYZE TABLE scott.emp VALIDATE STRUCTURE CASCADE;

-- Validate the index structure.
SQL> ANALYZE INDEX scott.pk_emp VALIDATE STRUCTURE;

DB_BLOCK_CHECKING
When the DB_BLOCK_CHECKING parameter is set to TRUE Oracle performs a walk through of the data 
in the block to check it is self-consistent. Unfortunately block checking can add 
between 1 and 10% overhead to the server. Oracle recommend setting this parameter to TRUE 
if the overhead is acceptable.

DBMS_REPAIR
Unlike the previous methods dicussed, the DBMS_REPAIR package allows you to detect and 
repair corruption. The process requires two administration tables to hold a list of 
corrupt blocks and index keys pointing to those blocks. These are created as follows:

BEGIN
  Dbms_Repair.Admin_Tables (
    table_name => 'REPAIR_TABLE',
    table_type => Dbms_Repair.Repair_Table,
    action => Dbms_Repair.Create_Action,
    tablespace => 'USERS');

  Dbms_Repair.Admin_Tables (
    table_name => 'ORPHAN_KEY_TABLE',
    table_type => Dbms_Repair.Orphan_Table,
    action => Dbms_Repair.Create_Action,
    tablespace => 'USERS');
END;
/

With the administration tables built we are able to check the table of interest using the 
CHECK_OBJECT procedure:

SET SERVEROUTPUT ON
DECLARE 
  v_num_corrupt INT;
BEGIN
  v_num_corrupt := 0;
  Dbms_Repair.Check_Object (
    schema_name => 'SCOTT',
    object_name => 'DEPT',
    repair_table_name => 'REPAIR_TABLE',
    corrupt_count => v_num_corrupt);
  Dbms_Output.Put_Line('number corrupt: ' || TO_CHAR (v_num_corrupt));
END;
/

Assuming the number of corrupt blocks is greater than 0 the CORRUPTION_DESCRIPTION and 
the REPAIR_DESCRIPTION columns of the REPAIR_TABLE can be used to get more information 
about the corruption.

At this point the currupt blocks have been detected, but are not marked as corrupt. 
The FIX_CORRUPT_BLOCKS procedure can be used to mark the blocks as corrupt, allowing them 
to be skipped by DML once the table is in the correct mode:

SET SERVEROUTPUT ON
DECLARE
  v_num_fix INT;
BEGIN 
  v_num_fix := 0;
  Dbms_Repair.Fix_Corrupt_Blocks (
    schema_name => 'SCOTT',
    object_name=> 'DEPT',
    object_type => Dbms_Repair.Table_Object,
    repair_table_name => 'REPAIR_TABLE',
    fix_count=> v_num_fix);
  Dbms_Output.Put_Line('num fix: ' || to_char(v_num_fix));
END;
/

Once the corrupt table blocks have been located and marked all indexes must be checked to see 
if any of their key entries point to a corrupt block. This is done using the 
DUMP_ORPHAN_KEYS procedure:

SET SERVEROUTPUT ON
DECLARE
  v_num_orphans INT;
BEGIN
  v_num_orphans := 0;
  Dbms_Repair.Dump_Orphan_Keys (
    schema_name => 'SCOTT',
    object_name => 'PK_DEPT',
    object_type => Dbms_Repair.Index_Object,
    repair_table_name => 'REPAIR_TABLE',
    orphan_table_name=> 'ORPHAN_KEY_TABLE',
    key_count => v_num_orphans);
  Dbms_Output.Put_Line('orphan key count: ' || to_char(v_num_orphans));
END;
/

If the orphan key count is greater than 0 the index should be rebuilt.

The process of marking the table block as corrupt automatically removes it from the freelists. 
This can prevent freelist access to all blocks following the corrupt block. 
To correct this the freelists must be rebuilt using the REBUILD_FREELISTS procedure:

BEGIN
  Dbms_Repair.Rebuild_Freelists (
    schema_name => 'SCOTT',
    object_name => 'DEPT',
    object_type => Dbms_Repair.Table_Object);
END;
/

The final step in the process is to make sure all DML statements ignore the data blocks 
marked as corrupt. This is done using the SKIP_CORRUPT_BLOCKS procedure:

BEGIN
  Dbms_Repair.Skip_Corrupt_Blocks (
    schema_name => 'SCOTT',
    object_name => 'DEPT',
    object_type => Dbms_Repair.Table_Object,
    flags => Dbms_Repair.Skip_Flag);
END;
/

The SKIP_CORRUPT column in the DBA_TABLES view indicates if this action has been successful.

At this point the table can be used again but you will have to take steps to correct any data 
loss associated with the missing blocks.

Other Repair Methods
Other methods to repair corruption include:

Full database recovery. 
Individual datafile recovery. 
Block media recovery (BMR), available in Oracle9i when using RMAN. 
Recreate the table using the CREATE TABLE .. AS SELECT command, taking care to avoid the 
corrupt blocks by retricting the where clause of the query. 
Drop the table and restore it from a previous export. This may require some manual effort 
to replace missing data. 
Hope this helps. Regards Tim...

Note 7:
-------

If you know the file number and the block number indicating the corruption, you can salvage 
the data in the corrupt table by selecting around the bad blocks.

Set event 10231 in the init.ora file to cause Oracle to skip software- and media-
corrupted blocks when performing full table scans:

Event="10231 trace name context forever, level 10"

Set event 10233 in the init.ora file to cause Oracle to skip software- and media-
corrupted blocks when performing index range scans:

Event="10233 trace name context forever, level 10"


Note 8:
-------

Detecting and reporting data block corruption using the DBMS_REPAIR package:

Note: Note that this event can only be used if the block "wrapper" is marked corrupt. 

Eg: If the block reports ORA-1578. 

1. Create DBMS_REPAIR administration tables:

To Create Repair tables, run the below package.

SQL> EXEC DBMS_REPAIR.ADMIN_TABLES(�REPAIR_ADMIN�, 1,1, �REPAIR_TS�);

Note that table names prefix with �REPAIR_� or �ORPAN_�. If the second variable is 1, it will create 
�REAIR_key tables, if it is 2, then it will create �ORPAN_key tables. 

If the thread variable is 

1 then package performs �create� operations.
2 then package performs �delete� operations.
3 then package performs �drop� operations.

2. Scanning a specific table or Index using the DBMS_REPAIR.CHECK_OBJECT procedure:

In the following example we check the table employee for possible corruption�s that belongs to the schema TEST. 
Let�s assume that we have created our administration tables called REPAIR_ADMIN in schema SYS.

To check the table block corruption use the following procedure:

SQL> VARIABLE A NUMBER;
SQL> EXEC DBMS_REPAIR.CHECK_OBJECT (�TEST�,�EMP�, NULL, 
                               1,�REPAIR_ADMIN�, NULL, NULL, NULL, NULL,:A);
SQL> PRINT A; 

To check which block is corrupted, check in the REPAIR_ADMIN table.
SQL> SELECT * FROM REPAIR_ADMIN;

3. Fixing corrupt block using the DBMS_REPAIR.FIX_CORRUPT_BLOCK procedure:

         SQL> VARIABLE A NUMBER;
         SQL> EXEC DBMS_REPAIR.FIX.CORRUPT_BLOCKS (�TEST�,�EMP�, NULL, 
                                                                            1,�REPARI_ADMIN�, NULL,:A);
         SQL> SELECT MARKED FROM REPAIR_ADMIN;

If u select the EMP table now you still get the error ORA-1578.

4. Skipping corrupt blocks using the DBMS_REPAIR. SKIP_CORRUPT_BLOCK procedure:

SQL> EXEC DBMS_REPAIR. SKIP_CORRUPT.BLOCKS (�TEST�, �EMP�, 1,1);

Notice the verification of running the DBMS_REPAIR tool. You have lost some of data. One main advantage of 
this tool is that you can retrieve the data past the corrupted block. However we have lost some data in the table. 

5. This procedure is useful in identifying orphan keys in indexes that are pointing to corrupt rows of the table:

SQL> EXEC DBMS_REPAIR. DUMP ORPHAN_KEYS (�TEST�,�IDX_EMP�, NULL, 
                                             2, �REPAIR_ADMIN�, �ORPHAN_ADMIN�, NULL,:A);

If u see any records in ORPHAN_ADMIN table you have to drop and re-create the index to avoid any inconsistencies 
in your queries.

6. The last thing you need to do while using the DBMS_REPAIR package is to run the 
DBMS_REPAIR.REBUILD_FREELISTS procedure to reinitialize the free list details in the data dictionary views. 

SQL> EXEC DBMS_REPAIR.REBUILD_FREELISTS (�TEST�,�EMP�, NULL, 1);

NOTE

Setting events 10210, 10211, 10212, and 10225 can be done by adding the following line for each event 
in the init.ora file:

Event = "event_number trace name errorstack forever, level 10"

- When event 10210 is set, the data blocks are checked for corruption by checking their integrity. 
  Data blocks that don't match the format are marked as soft corrupt.

- When event 10211 is set, the index blocks are checked for corruption by checking their integrity. 
  Index blocks that don't match the format are marked as soft corrupt.

- When event 10212 is set, the cluster blocks are checked for corruption by checking their integrity. 
  Cluster blocks that don't match the format are marked as soft corrupt.

- When event 10225 is set, the fet$ and uset$ dictionary tables are checked for corruption 
  by checking their integrity. Blocks that don't match the format are marked as soft corrupt.

- Set event 10231 in the init.ora file to cause Oracle to skip software- and media-corrupted blocks 
  when performing full table scans:

Event="10231 trace name context forever, level 10"

- Set event 10233 in the init.ora file to cause Oracle to skip software- and media-corrupted blocks 
  when performing index range scans:

Event="10233 trace name context forever, level 10"

To dump the Oracle block you can use below command from 8.x on words:

SQL> ALTER SYSTEM DUMP DATAFILE 11 block 9;
This command dumps datablock 9 in datafile11, into USER_DUMP_DEST directory.

Dumping Redo Logs file blocks:

SQL> ALTER SYSTEM DUMP LOGFILE �/usr/oracle8/product/admin/udump/rl. log�;

Rollback segments block corruption, it will cause problems (ORA-1578) while starting up the database.
With support of oracle, can use below under source parameter to startup the database.

_CORRUPTED_ROLLBACK_SEGMENTS=(RBS_1, RBS_2)

DB_BLOCK_COMPUTE_CHECKSUM

This parameter is normally used to debug corruption�s that happen on disk.

The following V$ views contain information about blocks marked logically corrupt: 

V$ BACKUP_CORRUPTION, V$COPY_CORRUPTION

When this parameter is set, while reading a block from disk to catch, oracle will compute the checksum 
again and compares it with the value that is in the block.

If they differ, it indicates that the block is corrupted on disk. Oracle makes the block as corrupt and 
signals an error. There is an overhead involved in setting this parameter. 

DB_BLOCK_CACHE_PROTECT=�TRUE�

Oracle will catch stray writes made by processes in the buffer catch. 

Oracle 9i new RMAN futures:

Obtain the datafile numbers and block numbers for the corrupted blocks. Typically, you obtain this output 
from the standard output, the alert.log, trace files, or a media management interface. 
For example, you may see the following in a trace file: 

ORA-01578: ORACLE data block corrupted (file # 9, block # 13) 
ORA-01110: data file 9: '/oracle/dbs/tbs_91.f' 
ORA-01578: ORACLE data block corrupted (file # 2, block # 19) 
ORA-01110: data file 2: '/oracle/dbs/tbs_21.f' 

$rman target =rman/rman@rmanprod
RMAN> run {
       2> allocate channel ch1 type disk;
       3> blockrecover datafile 9 block 13 datafile 2 block 19;
       4> }

Recovering Data blocks Using Selected Backups:

# restore from backupset 
BLOCKRECOVER DATAFILE 9 BLOCK 13 DATAFILE 2 BLOCK 19 FROM BACKUPSET; 

# restore from datafile image copy 
BLOCKRECOVER DATAFILE 9 BLOCK 13 DATAFILE 2 BLOCK 19 FROM DATAFILECOPY; 

# restore from backupset with tag "mondayAM" 
BLOCKRECOVER DATAFILE 9 BLOCK 13 DATAFILE 2 BLOCK 199 FROM TAG = mondayAM; 

# restore using backups made before one week ago 
BLOCKRECOVER DATAFILE 9 BLOCK 13 DATAFILE 2 BLOCK 19 RESTORE 
UNTIL 'SYSDATE-7'; 

# restore using backups made before SCN 100 
BLOCKRECOVER DATAFILE 9 BLOCK 13 DATAFILE 2 BLOCK 19 RESTORE UNTIL SCN 100; 

# restore using backups made before log sequence 7024 
BLOCKRECOVER DATAFILE 9 BLOCK 13 DATAFILE 2 BLOCK 19 RESTORE 
UNTIL SEQUENCE 7024;


Note 9:
=======

Displayed below are the messages of the selected thread. 


Thread Status: Closed 

From: nitinpawar@birlasunlife.com 23-Feb-05 11:51 
Subject: ORA-01578 on system datafile 

RDBMS Version: Oracle9i Enterprise Edition Release 9.2.0.1.0
Operating System and Version: Windows 2000
Error Number (if applicable): ORA-01578
Product (i.e. SQL*Loader, Import, etc.): 
Product Version: 

ORA-01578 on system datafile

A data block in SYSTEM tablespace datafile is corrupted. 
The error has been occuring since past 7 months. I noticed it recently when I took over the support. 
The database is in archivelog mode. We don't have any old hot backups of the database files. 
Both export and alert log indicate corrupt block to be # 7873, but dbverify declares block #7875 to be corrupt. 
It seems there is no object using the block. 

Following is the extract from the alert log. 

*** 
Corrupt block relative dba: 0x00401ec1 (file 1, block 7873) 
Fractured block found during buffer read 
Data in bad block - 
type: 16 format: 2 rdba: 0x00401ec1 
last change scn: 0x0000.00007389 seq: 0x1 flg: 0x04 
consistency value in tail: 0x23430601 
check value in block header: 0x5684, computed block checksum: 0x396b 
spare1: 0x0, spare2: 0x0, spare3: 0x0 
*** 
Reread of rdba: 0x00401ec1 (file 1, block 7873) found same corrupted data 


From: Oracle, Fahad Abdul Rahman 25-Feb-05 08:18 
Subject: Re : ORA-01578 on system datafile 


Nitin, 
I would suggest you to relocate the system datafiles to a new location on disk and see 
if the corruption is removed. If the issue still persist ,then I would suggest you to log a TAR 
with Oracle Support for further research. 


========================
32. iSQL*Plus and EM 10:
========================


32.1 iSQL*Plus:
===============

Note 1:
-------

How to start iSql*Plus:
-----------------------

lsnrctl start
emctl start dbconsole
isqlplusctl start

http://localhost:5561/isqlplus/


Note 2:
-------


Doc ID: 	Note:281946.1	Content Type: 	TEXT/X-HTML	   
Subject: 	How to Verify that iSQL*Plus 10i is Running and How to Restart the Processes?	Creation Date: 	31-AUG-2004	   
Type: 	HOWTO	Last Revision Date: 	06-APR-2005	   
Status: 	PUBLISHED		 
The information in this document applies to: 
SQL*Plus - Version: 10.1.0
Information in this document applies to any platform.
Goal
How to verify that iSQL*Plus 10i is running, and how to restart the processes? 

Fix
How to Verify that iSQL*Plus is running?
=======================================
UNIX Platform
-------------------
Check whether the iSQL*Plus process is running by entering the following command:

ps -eaf |grep java
The iSQL*Plus process looks something like the following:
oracle 18488 1 0 16:01:30 pts/8 0:36 $ORACLE_HOME/jdk/bin/java -Djava.awt.headless=true 
-Doracle.oc4j.localhome=/ora

Windows Platform
--------------------------
Check whether the iSQL*Plus process is running by opening the Windows services dialog from the Control Panel and checking 
the status of the iSQL*Plus service. 
The iSQL*Plus service will be called "OracleOracle_Home_NameiSQL*Plus".

How to Start and Stop iSQL*Plus?
===============================
UNIX Platform
--------------------
To start iSQL*Plus, enter the command:
$ORACLE_HOME/bin/isqlplusctl start

To stop iSQL*Plus, enter the command:
$ORACLE_HOME/bin/isqlplusctl stop

Windows Platform
--------------------------
Use the Windows service to start and stop iSQL*Plus. 
The service is set to start automatically on installation and when the operating system is started.


Note 3:
-------

 
Doc ID: 	Note:281847.1	Content Type: 	TEXT/X-HTML	   
Subject: 	How do I configure or test iSQL*Plus 10i?	Creation Date: 	30-AUG-2004	   
Type: 	HOWTO	Last Revision Date: 	25-MAR-2005	   
Status: 	PUBLISHED		 
The information in this document applies to: 
SQL*Plus - Version: 10.1.0.0 to 10.1.0
Information in this document applies to any platform.
Goal
How do I configure or test?iSQL*Plus after the install or Oracle Enterprise Edition 10i? 
Fix
iSQL*Plus 10.x is automatically installed and configured with Enterprise Edition 10i. 
At the end of the installation process a file called $ORACLE_HOME/install/readme.txt has the information needed to configure or test iSQL*Plus:
readme.txt example:
----------------
The following J2EE Applications have been deployed and are accessible at the URLs listed below.
Your database configuration files have been installed in?$ORACLE_HOME while other components selected for installation have been installed in $ORACLE_HOME\Db_1.? Be cautious not to accidentally delete these configuration files.
Ultra Search URL:
:5620/ultrasearch"http://<your host name>:5620/ultrasearch
Ultra Search Administration Tool URL:
:5620/ultrasearch/admin"http://<your host name>:5620/ultrasearch/admin
iSQL*Plus URL:
:5560/isqlplus"http://<your host name>:5560/isqlplus
Enteprise Manager 10g Database Control URL:
:5500/em"http://<your host name>:5500/em
----------------
The URL for your iSQL*Plus server is:

:port/isqlplus" target=_blankhttp://<your host name>:port /isqlplus

:port/isqlplus/dba" target=_blankhttp://<your host name>:port /isqlplus/dba

The port number is likely to be 5560.

If this URL does not display the iSQL*Plus log in page, check that iSQL*Plus has been started 
For more additional information about iSQL*Plus please check the following Metalink notes:
Note 281947.1 How to Troubleshoot iSQLPlus 10i when it is not Starting on Unix? 
Note 281946.1?How to Verify that iSQLPlus 10i is Running and How to Restart the Processes? 
Note 283114.1?How to connect as sysdba/sysoper through iSQL*Plus in Oracle 10g 


Note 4:
-------


Doc ID: 	Note:283114.1	Content Type: 	TEXT/X-HTML	   
Subject: 	How to connect as sysdba/sysoper through iSQL*Plus in Oracle 10g	Creation Date: 	16-SEP-2004	   
Type: 	HOWTO	Last Revision Date: 	12-JAN-2005	   
Status: 	MODERATED		 
  
This document is being delivered to you via Oracle Support's Rapid Visibility (RaV) process, and therefore has not been subject to an independent technical review. 	 
The information in this document applies to: 
SQL*Plus - Version: 10.0.1
Information in this document applies to any platform.
Goal
Enabling iSQL*Plus DBA Access. 
Fix
Inorder to connect as SYSDBA through iSQL*Plus you will have to use iSQL*Plus DBA URL. Given below is a sample DBS URL in iSQL*Plus.

" target=_blankhttp://Hostname:Port/isqlplus/dba


Enabling iSQL*Plus DBA Access
=============================

To access the iSQL*Plus DBA URL, you must set up the OC4J user manager. You can set up OC4J to use:

The XML-based provider type, jazn-data.xml

The LDAP-based provider type, Oracle Internet Directory

This document discusses how to set up the iSQL*Plus DBA URL to use the XML-based provider. For information on how to set up the LDAP-based provider, see the Oracle9iAS Containers for J2EE documentation.


To set up the iSQL*Plus DBA URL
=================================

1. Create users for the iSQL*Plus DBA URL.

2. Grant the webDba role to users.

3. Test iSQL*Plus DBA Access

The Oracle JAAS Provider, otherwise known as JAZN (Java AuthoriZatioN), is Oracle's implementation of the Java Authentication and Authorization Service (JAAS). Oracle's JAAS Provider is referred to as JAZN in the remainder of this document. See the Oracle9iAS Containers for J2EE documentation for more information about JAZN, the Oracle JAAS Provider.


Create and Manage Users for the iSQL*Plus DBA URL
=================================================

The actions available to manage users for the iSQL*Plus DBA URL are:

1. Create users

2. List users

3. Grant the webDba role

4. Remove users

5. Revoke the webDba role

6. Change user passwords


You perform these actions from the $ORACLE_HOME/oc4j/j2ee/isqlplus/application-deployments/isqlplus directory.

$JAVA_HOME is the location of your JDK (1.4 or above). It should be set to $ORACLE_HOME/jdk, but you may use another JDK.

admin_password is the password for the iSQL*Plus DBA realm administrator user, admin. The password for the admin user is set to 'welcome' by default. You should change this password as soon as possible.

A JAZN shell option, and a command line option are given for all steps.

To start the JAZN shell, enter:

$JAVA_HOME/bin/java -Djava.security.properties=$ORACLE_HOME/sqlplus/admin/iplus/provider -jar $ORACLE_HOME/oc4j/j2ee/home/jazn.jar -user "iSQL*Plus DBA/admin" -password admin_password -shell
To exit the JAZN shell, enter:

EXIT
Create Users
You can create multiple users who have access to the iSQL*Plus DBA URL. To create a user from the JAZN shell, enter:

JAZN> adduser "iSQL*Plus DBA" username password
To create a user from the command-line, enter:

$JAVA_HOME/bin/java -Djava.security.properties=$ORACLE_HOME/sqlplus/admin/iplus/provider -jar $ORACLE_HOME/oc4j/j2ee/home/jazn.jar -user "iSQL*Plus DBA/admin" -password admin_password -adduser "iSQL*Plus DBA" username password
username and password are the username and password used to log into the iSQL*Plus DBA URL.

To create multiple users, repeat the above command for each user.

List Users
You can confirm that users have been created and added to the iSQL*Plus DBA realm. To confirm the creation of a user using the JAZN shell, enter:

JAZN> listusers "iSQL*Plus DBA"
To confirm the creation of a user using the command-line, enter:

$JAVA_HOME/bin/java -Djava.security.properties=$ORACLE_HOME/sqlplus/admin/iplus/provider -jar $ORACLE_HOME/oc4j/j2ee/home/jazn.jar -user "iSQL*Plus DBA/admin" -password admin_password -listusers "iSQL*Plus DBA"
The usernames you created are displayed.

Grant Users the webDba Role
Each user you created above must be granted access to the webDba role. To grant a user access to the webDba role from the JAZN shell, enter:

JAZN> grantrole webDba "iSQL*Plus DBA" username
To grant a user access to the webDba role from the command-line, enter:

$JAVA_HOME/bin/java -Djava.security.properties=$ORACLE_HOME/sqlplus/admin/iplus/provider -jar $ORACLE_HOME/oc4j/j2ee/home/jazn.jar -user "iSQL*Plus DBA/admin" -password admin_password -grantrole webDba "iSQL*Plus DBA" username
Remove Users
To remove a user using the JAZN shell, enter:

JAZN> remuser "iSQL*Plus DBA" username
To remove a user using the command-line, enter:

$JAVA_HOME/bin/java -Djava.security.properties=$ORACLE_HOME/sqlplus/admin/iplus/provider -jar $ORACLE_HOME/oc4j/j2ee/home/jazn.jar -user "iSQL*Plus DBA/admin" -password admin_password -remuser "iSQL*Plus DBA" username
Revoke the webDba Role
To revoke a user's webDba role from the JAZN shell, enter:

JAZN> revokerole webDba "iSQL*Plus DBA" username
To revoke a user's webDba role from the command-line, enter:

$JAVA_HOME/bin/java -Djava.security.properties=$ORACLE_HOME/sqlplus/admin/iplus/provider -jar $ORACLE_HOME/oc4j/j2ee/home/jazn.jar -user "iSQL*Plus DBA/admin" -password admin_password -revokerole "iSQL*Plus DBA" username
Change User Passwords
To change a user's password from the JAZN shell, enter:

JAZN> setpasswd "iSQL*Plus DBA" username old_password new_password
To change a user's password from the command-line, enter:

$JAVA_HOME/bin/java -Djava.security.properties=$ORACLE_HOME/sqlplus/admin/iplus/provider -jar $ORACLE_HOME/oc4j/j2ee/home/jazn.jar -user "iSQL*Plus DBA/admin" -password admin_password -setpasswd "iSQL*Plus DBA" username old_password new_password
Test iSQL*Plus DBA Access
Test iSQL*Plus DBA access by entering the iSQL*Plus DBA URL in your web browser:

" target=_blankhttp://machine_name.domain:5560/isqlplus/dba
A dialog is displayed requesting authentication for the iSQL*Plus DBA URL. Log in as the user you created above. You may need to restart iSQL*Plus for the changes to take effect. 


Help us improve our service. Please email us your comments for this document. . 


What is a wire protocol ODBC driver?
====================================

A DBMS is written using an application programming interface (API), which is specific to that database. 
For example, an Oracle 9i database has its own version of the API specification (called Net9), 
which must run on each client application.

Developers write applications compliant to the ODBC specification and use ODBC drivers to access the database. 
The ODBC driver communicates with the vendor's native API. Then, the native API passes instructions 
to another vendor-specific low-level API. Finally the wire protocol API communicates with the database.

The wire protocol architecture eliminates the need for the database's native API (for example, Net9), 
so the driver communicates directly to the database through the database's own wire level protocol. 
This effectively removes an entire communication layer.


==============
33. ADDM:
==============

Note 1:
=======


Doc ID: 	Note:250655.1	Content Type: 	TEXT/PLAIN	   
Subject: 	How to use the Automatic Database Diagnostic Monitor	Creation Date: 	09-OCT-2003	   
Type: 	BULLETIN	Last Revision Date: 	10-JUN-2004	   
Status: 	PUBLISHED		 
PURPOSE 
------- 
 
    The purpose of this article is to show an introduction on how to use the  
    Automatic Database Diagnostic Monitor feature. The ADDM consists of  
    functionality built into the Oracle kernel to assist in making tuning an  
    Oracle instance less elaborate. 
 
  
SCOPE & APPLICATION 
------------------- 
 
    Audience         : Oracle developers and DBAs 
    Use              : Using the Automatic Database Diagnostic Monitor feature 
                       as a first step in the creation of an autotunable  
                       database 
    Level of detail  : medium 
    Limitation on use: none 
 
 
USING THE AUTOMATIC DATABASE DIAGNOSTIC MONITOR 
----------------------------------------------- 
 
Introduction: 
------------- 
 
    The Automatic Database Diagnostic Monitor (hereafter called ADDM) is an  
    integral part of the Oracle RDBMS capable of gathering performance  
    statistics and advising on changes to solve any exitsing performance issues  
    measured. 
 
    For this it uses the Automatic Workload Repository ( hereafter called AWR),  
    a repository defined in the database to store database wide usage statistics  
    at fixed size intervals (60 minutes). 
 
    To make use of ADDM, a PL/SQL interface called DBMS_ADVISOR has been  
    implemented. This PL/SQL interface may be called through the supplied  
    $ORACLE_HOME/rdbms/admin/addmrpt.sql script, called directly, or used in  
    combination with the Oracle Enterprise Manager application. Besides this  
    PL/SQL package, a number of views (with names starting with the DBA_ADVISOR_  
    prefix) allow retrieval of the results of any actions performed with the  
    DBMS_ADVISOR API. The preferred way of accessing ADDM is through the  
    Enterprise Manager interface, as it shows a complete performance overview  
    including recommendations on how to solve bottlenecks on a single screen. 
    When accessing ADDM manually, you should consider using the ADDMRPT.SQL  
    script provided with your Oracle release, as it hides the complexities  
    involved in accessing the DBMS_ADVISOR package. 
 
    To use ADDM for advising on how to tune the instance and SQL, you need to 
    make sure that the AWR has been populated with at least 2 sets of  
    performance data. When the STATISTICS_LEVEL is set to TYPICAL or ALL  
    the database will automatically schedule the AWR  
    to be populated at 60 minute intervals.  
 
    When you wish to create performance snapshots outside of the fixed  
    intervals, then you can use the DBMS_WORKLOAD_REPOSITORY package for this,  
    like in: 
        BEGIN 
            DBMS_WORKLOAD_REPOSITORY.CREATE_SNAPSHOT('TYPICAL'); 
        END; 
        / 
 
    The snapshots need be created before and after the action you wish to  
    examine. E.g. when examining a bad performing query, you need to have  
    performance data snapshots from the timestamps before the query was started  
    and after the query finished. 
 
    You may also change the frequency of the snapshots and the duration for which they 
    are saved in the AWR. Use the DBMS_WORKLOAD_REPOSITORY package as in the following example:

    execute DBMS_WORKLOAD_REPOSITORY.MODIFY_SNAPSHOT_SETTINGS(interval=>60,retention=>43200);

Example: 
-------- 
 
    You can use ADDM through the PL/SQL API and query the various advisory views 
    in SQL*Plus to examine how to solve performance issues. 
 
    The example is based on the SCOTT account executing the various tasks. To  
    allow SCOTT to both generate AWR snapshots and sumbit ADDM recommendation  
    jobs, he needs to be granted proper access: 
        CONNECT / AS SYSDBA 
        GRANT ADVISOR TO scott; 
        GRANT SELECT_CATALOG_ROLE TO scott; 
        GRANT EXECUTE ON dbms_workload_repository TO scott; 
 
    Furthermore, the buffer cache size (DB_CACHE_SIZE) has been reduced to 24M. 
 
    The example presented makes use of a table called BIGEMP, residing in the  
    SCOTT schema. The table (containing about 14 million rows) has been created  
    with: 
        CONNECT scott/tiger 
        CREATE TABLE bigemp AS SELECT * FROM emp; 
        ALTER TABLE bigemp MODIFY (empno NUMBER); 
        DECLARE 
            n NUMBER; 
        BEGIN 
            FOR n IN 1..18 
            LOOP 
                INSERT INTO bigemp SELECT * FROM bigemp; 
            END LOOP; 
            COMMIT; 
        END; 
        / 
        UPDATE bigemp SET empno = ROWNUM; 
        COMMIT; 
 
    The next step is to generate a performance data snapshot: 
        EXECUTE dbms_workload_repository.create_snapshot('TYPICAL'); 
 
    Execute a query on the BIGEMP table to generate some load: 
        SELECT * FROM bigemp WHERE deptno = 10; 
 
    After this, generate a second performance snapshot: 
        EXECUTE dbms_workload_repository.create_snapshot('TYPICAL'); 
 
    The easiest way to get the ADDM report is by executing: 
        @?/rdbms/admin/addmrpt 
 
    Running this script will show which snapshots have been generated, asks for  
    the snapshot IDs to be used for generating the report, and will generate the  
    report containing the ADDM findings. 
 
    When you do not want to use the script, you need to submit and execute the  
    ADDM task manually. First, query DBA_HIST_SNAPSHOT to see which snapshots  
    have been created. These snapshots will be used by ADDM to generate  
    recommendations: 
        SELECT * FROM dba_hist_snapshot ORDER BY snap_id; 
 
           SNAP_ID       DBID INSTANCE_NUMBER 
        ---------- ---------- --------------- 
        STARTUP_TIME 
        ----------------------------------------------------------------------- 
        BEGIN_INTERVAL_TIME 
        ----------------------------------------------------------------------- 
        END_INTERVAL_TIME 
        ----------------------------------------------------------------------- 
        FLUSH_ELAPSED 
        ----------------------------------------------------------------------- 
        SNAP_LEVEL ERROR_COUNT 
        ---------- ----------- 
                 1  494687018               1 
        17-NOV-03 09.39.17.000 AM 
        17-NOV-03 09.39.17.000 AM 
        17-NOV-03 09.50.21.389 AM 
        +00000 00:00:06.6 
                 1           0 
                 2  494687018               1 
        17-NOV-03 09.39.17.000 AM 
        17-NOV-03 09.50.21.389 AM 
        17-NOV-03 10.29.35.704 AM 
        +00000 00:00:02.3 
                 1           0 
                 3  494687018               1 
        17-NOV-03 09.39.17.000 AM 
        17-NOV-03 10.29.35.704 AM 
        17-NOV-03 10.35.46.878 AM 
        +00000 00:00:02.1 
                 1           0 
 
    Mark the 2 snapshot IDs (such as the lowest and highest ones) for use in  
    generating recommendations. 
 
    Next, you need to submit and execute the ADDM task manually, using a script  
    similar to: 
        DECLARE 
            task_name VARCHAR2(30) := 'SCOTT_ADDM'; 
            task_desc VARCHAR2(30) := 'ADDM Feature Test'; 
            task_id NUMBER; 
        BEGIN 
    (1)     dbms_advisor.create_task('ADDM', task_id, task_name, task_desc,  
                null); 
    (2)     dbms_advisor.set_task_parameter('SCOTT_ADDM', 'START_SNAPSHOT', 1); 
            dbms_advisor.set_task_parameter('SCOTT_ADDM', 'END_SNAPSHOT', 3); 
            dbms_advisor.set_task_parameter('SCOTT_ADDM', 'INSTANCE', 1); 
            dbms_advisor.set_task_parameter('SCOTT_ADDM', 'DB_ID', 494687018); 
    (3)     dbms_advisor.execute_task('SCOTT_ADDM'); 
        END; 
        / 
 
    Here is the explanation of the steps you need to take to successfully  
    execute an ADDM job: 
    1) The first step is to create the task. For this, you need to specify the  
       name under which the task will be known in the ADDM task system. Along  
       with the name you can provide a more readable description on what the job  
       should do. The task type must be 'ADDM' in order to have it executed in  
       the ADDM environment. 
    2) After having defined the ADDM task, you must define the boundaries within  
       which the task needs to be executed. For this you need to set the  
       starting and ending snapshot IDs, instance ID (especially necessary when  
       running in a RAC environment), and database ID for the newly created job. 
    3) Finally, the task must be executed. 
 
    When querying DBA_ADVISOR_TASKS you see the just created job: 
        SELECT * FROM dba_advisor_tasks; 
 
        OWNER                             TASK_ID TASK_NAME 
        ------------------------------ ---------- ------------------------------ 
        DESCRIPTION 
        ------------------------------------------------------------------------ 
        ADVISOR_NAME                   CREATED   LAST_MODI PARENT_TASK_ID 
        ------------------------------ --------- --------- -------------- 
        PARENT_REC_ID READ_ 
        ------------- ----- 
        SCOTT                                   5 SCOTT_ADDM 
        ADDM Feature Test 
        ADDM                           17-NOV-03 17-NOV-03              0 
                    0 FALSE 
 
    When the job has successfully completed, examine the recommendations made by  
    ADDM by calling the DBMS_ADVISOR.GET_TASK_REPORT() routine, like in: 
        SET LONG 1000000 PAGESIZE 0 LONGCHUNKSIZE 1000 
        COLUMN get_clob FORMAT a80 
        SELECT dbms_advisor.get_task_report('SCOTT_ADDM', 'TEXT', 'TYPICAL') 
        FROM   sys.dual; 
 
    The recommendations supplied should be sufficient to investigate the  
    performance issue, as in: 
 
                  DETAILED ADDM REPORT FOR TASK 'SCOTT_ADDM' WITH ID 5 
                  ---------------------------------------------------- 
 
                      Analysis Period: 17-NOV-2003 from 09:50:21 to 10:35:47 
                 Database ID/Instance: 494687018/1 
                       Snapshot Range: from 1 to 3 
                        Database Time: 4215 seconds 
                Average Database Load: 1.5 active sessions 
 
        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 
 
        FINDING 1: 65% impact (2734 seconds) 
        ------------------------------------ 
        PL/SQL execution consumed significant database time. 
 
           RECOMMENDATION 1: SQL Tuning, 65% benefit (2734 seconds) 
              ACTION: Tune the PL/SQL block with SQL_ID fjxa1vp3yhtmr. Refer to  
              the "Tuning PL/SQL Applications" chapter of Oracle's "PL/SQL  
              User's Guide and Reference" 
                 RELEVANT OBJECT: SQL statement with SQL_ID fjxa1vp3yhtmr 
                 BEGIN EMD_NOTIFICATION.QUEUE_READY(:1, :2, :3); END; 
 
        FINDING 2: 35% impact (1456 seconds) 
        ------------------------------------ 
        SQL statements consuming significant database time were found. 
 
           RECOMMENDATION 1: SQL Tuning, 35% benefit (1456 seconds) 
              ACTION: Run SQL Tuning Advisor on the SQL statement with SQL_ID 
                 gt9ahqgd5fmm2. 
                 RELEVANT OBJECT: SQL statement with SQL_ID gt9ahqgd5fmm2 and 
                 PLAN_HASH 547793521 
                 UPDATE bigemp SET empno = ROWNUM 
 
        FINDING 3: 20% impact (836 seconds) 
        ----------------------------------- 
        The throughput of the I/O subsystem was significantly lower than expected. 
 
           RECOMMENDATION 1: Host Configuration, 20% benefit (836 seconds) 
              ACTION: Consider increasing the throughput of the I/O subsystem. 
                 Oracle's recommended solution is to stripe all data file using  
                 the SAME methodology. You might also need to increase the  
                 number of disks for better performance. 
 
           RECOMMENDATION 2: Host Configuration, 14% benefit (584 seconds) 
              ACTION: The performance of file  
                 D:\ORACLE\ORADATA\V1010\UNDOTBS01.DBF was significantly worse  
                 than other files. If striping all files using the SAME  
                 methodology is not possible, consider striping this file over  
                 multiple disks. 
                 RELEVANT OBJECT: database file 
                 "D:\ORACLE\ORADATA\V1010\UNDOTBS01.DBF" 
 
           SYMPTOMS THAT LED TO THE FINDING: 
              Wait class "User I/O" was consuming significant database time.  
              (34% impact [1450 seconds]) 
 
        FINDING 4: 11% impact (447 seconds) 
        ----------------------------------- 
        Undo I/O was a significant portion (33%) of the total database I/O. 
 
           NO RECOMMENDATIONS AVAILABLE 
 
           SYMPTOMS THAT LED TO THE FINDING: 
              The throughput of the I/O subsystem was significantly lower than 
              expected. (20% impact [836 seconds]) 
                 Wait class "User I/O" was consuming significant database time.  
                 (34% impact [1450 seconds]) 
 
        FINDING 5: 9.9% impact (416 seconds) 
        ------------------------------------ 
        Buffer cache writes due to small log files were consuming significant  
        database time. 
 
           RECOMMENDATION 1: DB Configuration, 9.9% benefit (416 seconds) 
              ACTION: Increase the size of the log files to 796 M to hold at  
                 least 20 minutes of redo information. 
 
           SYMPTOMS THAT LED TO THE FINDING: 
              The throughput of the I/O subsystem was significantly lower than 
              expected. (20% impact [836 seconds]) 
                 Wait class "User I/O" was consuming significant database time.  
                 (34% impact [1450 seconds]) 
 
        FINDING 6: 9.2% impact (387 seconds) 
        ------------------------------------ 
        Individual database segments responsible for significant user I/O wait  
        were found. 
 
           RECOMMENDATION 1: Segment Tuning, 7.2% benefit (304 seconds) 
              ACTION: Run "Segment Advisor" on database object "SCOTT.BIGEMP"  
                 with object id 49634. 
                 RELEVANT OBJECT: database object with id 49634 
              ACTION: Investigate application logic involving I/O on database  
                 object "SCOTT.BIGEMP" with object id 49634. 
                 RELEVANT OBJECT: database object with id 49634 
 
           RECOMMENDATION 2: Segment Tuning, 2% benefit (83 seconds) 
              ACTION: Run "Segment Advisor" on database object 
                 "SYSMAN.MGMT_METRICS_RAW_PK" with object id 47084. 
                 RELEVANT OBJECT: database object with id 47084 
              ACTION: Investigate application logic involving I/O on database  
                 object "SYSMAN.MGMT_METRICS_RAW_PK" with object id 47084. 
                 RELEVANT OBJECT: database object with id 47084 
 
           SYMPTOMS THAT LED TO THE FINDING: 
              Wait class "User I/O" was consuming significant database time.  
              (34% impact [1450 seconds]) 
 
        FINDING 7: 8.7% impact (365 seconds) 
        ------------------------------------ 
        Individual SQL statements responsible for significant physical I/O were  
        found. 
 
           RECOMMENDATION 1: SQL Tuning, 8.7% benefit (365 seconds) 
              ACTION: Run SQL Tuning Advisor on the SQL statement with SQL_ID 
                 gt9ahqgd5fmm2. 
                 RELEVANT OBJECT: SQL statement with SQL_ID gt9ahqgd5fmm2 and 
                 PLAN_HASH 547793521 
                 UPDATE bigemp SET empno = ROWNUM 
 
           RECOMMENDATION 2: SQL Tuning, 0% benefit (0 seconds) 
              ACTION: Tune the PL/SQL block with SQL_ID fjxa1vp3yhtmr. Refer to  
                 the "Tuning PL/SQL Applications" chapter of Oracle's "PL/SQL  
                 User's Guide and Reference" 
                 RELEVANT OBJECT: SQL statement with SQL_ID fjxa1vp3yhtmr 
                 BEGIN EMD_NOTIFICATION.QUEUE_READY(:1, :2, :3); END; 
 
           SYMPTOMS THAT LED TO THE FINDING: 
              The throughput of the I/O subsystem was significantly lower than 
              expected. (20% impact [836 seconds]) 
                 Wait class "User I/O" was consuming significant database time.  
                 (34% impact [1450 seconds]) 
 
        FINDING 8: 8.3% impact (348 seconds) 
        ------------------------------------ 
        Wait class "Configuration" was consuming significant database time. 
 
           NO RECOMMENDATIONS AVAILABLE 
 
           ADDITIONAL INFORMATION: Waits for free buffers were not consuming 
              significant database time. 
              Waits for archiver processes were not consuming significant  
              database time. 
              Log file switch operations were not consuming significant database  
              time while waiting for checkpoint completion. 
              Log buffer space waits were not consuming significant database  
              time. 
              High watermark (HW) enqueue waits were not consuming significant 
              database time. 
              Space Transaction (ST) enqueue waits were not consuming  
              significant database time. 
              ITL enqueue waits were not consuming significant database time. 
 
 
        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 
 
                  ADDITIONAL INFORMATION 
                  ---------------------- 
 
        An explanation of the terminology used in this report is available when  
        you run the report with the 'ALL' level of detail. 
 
        The analysis of I/O performance is based on the default assumption that  
        the average read time for one database block is 5000 micro-seconds. 
 
        Wait class "Administrative" was not consuming significant database time. 
        Wait class "Application" was not consuming significant database time. 
        Wait class "Cluster" was not consuming significant database time. 
        Wait class "Commit" was not consuming significant database time. 
        Wait class "Concurrency" was not consuming significant database time. 
        CPU was not a bottleneck for the instance. 
        Wait class "Network" was not consuming significant database time. 
        Wait class "Scheduler" was not consuming significant database time. 
        Wait class "Other" was not consuming significant database time. 
 
    =============================    END OF ADDM REPORT ====================== 
 
    ADDM points out which events cause the performance problems to occur and  
    suggests directions to follow to fix these bottlenecks. The ADDM  
    recommendations show amongst others that the query on BIGEMP needs to be  
    examined; in this case it suggests to run the Segment Advisor to check  
    whether the data segment is fragmented or not; it also advices to check  
    the application logic involved in accessing the BIGEMP table. Furthermore,  
    it shows the system suffers from I/O problems (which is in this example  
    caused by not using SAME and placing all database files on a single disk  
    partition).  
 
    The findings are sorted descending by impact: the issues causing the  
    greatest performance problems are listed at the top of the report. Solving  
    these issues will result in the greatest performance benefits. Also, in the last 
    section of the report ADDM indicates the areas that are not representing 
    a problem for the performance of the instance 
 
    In this example the database is rather idle. As such the Enterprise Manager  
    notification job (which runs frequently) is listed at the top. You need not  
    worry about this job at all. 
 
    Please notice that the output of the last query may differ depending on what 
    took place on your database at the time the ADDM recommendations were  
    generated. 
 
 
RELATED DOCUMENTS 
----------------- 
 
Oracle10g Database Performance Guide Release 1 (10.1) 
Oracle10g Database Reference Release 1 (10.1) 
PL/SQL Packages and Types Reference Release 1 (10.1) 


Note 2:
=======


To determine which segments will benefit from segment shrink, you can invoke Segment Advisor.

alter table hr.employees enable row movement;

After the Segment Advisor has been invoked to give recommendations, the findings are available
in BDA_ADVISOR_FINDINGS and DBA_ADVISOR_RECOMMENDATIONS.

variable task_id number;

declare
   name varchar2(100);
   desc varchar2(500);
   obj_id number;
begin
   name:='';
   desc:='Check HR.EMPLOYEE';
   DBMS_ADVISOR.CREATE_TASK('Segment Advisor', :task_id, name, descr, NULL);
   DBMS_ADVISOR.CREATE_OBJECT(name,'TABLE','HR','EMPLOYEES', NULL,NULL,obj_id);
   DBMS_ADVISOR.SET_TASK_PARAMETER(name,'RECOMMEND_ALL','TRUE');
   DBMS_ADVISOR.EXECUTE_TASK(name);
end;


PL/SQL procedure successfully completed.


print task_id

TASK_ID
-------
6


SELECT owner, task_id, task_name, type, message, more_info
FROM DBA_ADVISOR_FINDINGS
WHERE task_id=6;

OWNER   TASK_ID    TASK_NAME    TYPE        MESSAGE
-----   -------    ---------    ----        --------------------------------------------------
RJB           6    TASK_00003   INFORMATION Perform shrink, estimated savings is 107602 bytes.


In DBA_ADVISOR_ACTIONS, you can even find the exact SQL statement to shrink the hr.employees segment.


alter table hr.employees shrink space;


==============================
34. ASM and RAC in Oracle 10g:
==============================


34.1 ASM
========


========
Note 1:
========

Automatic Storage Management (ASM) in Oracle Database 10g


With ASM, Automatic Storage Management, there is a separate lightweight 10g database involved.
This ASM database (+ASM), contains all metadata about the ASM system.
It also acts as the interface between the regular database and the filesystems.

ASM will provide for presentation and implementation of a special filesystem, on which a number
of redundancy/availability and performance features are implemented.

In addition to the normal database background processes like CKPT, DBWR, LGWR, SMON, and PMON, 
an ASM instance uses at least two additional background processes to manage data storage operations. 
The Rebalancer process, RBAL, coordinates the rebalance activity for ASM disk groups, 
and the Actual ReBalance processes, ARBn, handle the actual rebalance of data extent movements. 
There are usually several ARB background processes (ARB0, ARB1, and so forth). 

Every database instance that uses ASM for file storage, will also need two new processes. 
The Rebalancer background process (RBAL) handles global opens of all ASM disks in the ASM Disk Groups, 
while the ASM Bridge process (ASMB) connects as a foreground process into the ASM instance when the 
regular database instance starts. ASMB facilitates communication between the ASM instance and 
the regular database, including handling physical file changes like data file creation and deletion. 

ASMB exchanges messages between both servers for statistics update and instance health validation. 
These two processes are automatically started by the database instance when a new Oracle file type - 
for example, a tablespace's datafile -- is created on an ASM disk group. When an ASM instance mounts 
a disk group, it registers the disk group and connect string with Group Services. The database instance 
knows the name of the disk group, and can therefore use it to locate connect information for 
the correct ASM instance.


========
Note 2: 
========

Some terminology in RAC:

CRS cluster ready services - Clusterware:

For Oracle10g on Linux and Windows-based platforms, CRS co-exists with but does not inter-operate 
with vendor clusterware. You may use vendor clusterware for all UNIX-based operating systems 
except for Linux. Even though, many of the Unix platforms have their own clusterware products, 
you need to use the CRS software to provide the HA support services. CRS (cluster ready services) 
supports services and workload management and helps to maintain the continuous availability of the services. 
CRS also manages resources such as virtual IP (VIP) address for the node and the global services daemon.
Note that the "Voting disks" and the "Oracle Cluster Registry", are regarded as part of the CRS.

OCR:

The Oracle Cluster Registry (OCR) contains cluster and database configuration information 
for Real Application Clusters Cluster Ready Services (CRS), including the list of nodes 
in the cluster database, the CRS application, resource profiles, and the authorizations for 
the Event Manager (EVM). The OCR can reside in a file on a cluster file system or on a shared raw device. 
When you install Real Application Clusters, you specify the location of the OCR.

OCFS:

OCFS is a shared disk cluster filesystem. Version 1 released for Linux is specifically designed 
to alleviate the need for manag-ing raw devices. It can contain all the 
oracle datafiles, archive log files and controlfiles.  It is however not designed as a 
general purpose filesystem.

OCFS2 is the next generation of the Oracle Cluster File System for Linux. It is an extent based, 
POSIX compliant file system. Unlike the previous release (OCFS), OCFS2 is a general-purpose 
file system that can be used for shared Oracle home installations making management of 
Oracle Real Application Cluster (RAC) installations even easier. Among the new features and benefits are: 

Node and architecture local files using Context Dependent Symbolic Links (CDSL) 
Network based pluggable DLM 
Improved journaling / node recovery using the Linux Kernel "JBD" subsystem 
Improved performance of meta-data operations (space allocation, locking, etc). 
Improved data caching / locking (for files such as oracle binaries, libraries, etc) 

- OCFS1 does NOT support a shared Oracle Home
- OCFS2 does     support a shared Oracle Home

Though ASM appears to be the intended replacement for Oracle Cluster File System (OCFS) 
for the Real Applications Cluster (RAC).
ASM supports Oracle Real Application Clusters (RAC), so there is no need 
for a separate Cluster LVM or a Cluster File System.

So it boils down to:
- You use or OCFS2, or ASM for your database files.

Storage Option				Oracle Clusterware	Database	Recovery area
--------------				------------------	--------	-------------
Automatic Storage Management 		No 			Yes 		Yes 
Cluster file system (OCFS) 		Yes 			Yes 		Yes 
Shared raw storage 			Yes 			Yes 		No 


========
Note 3:
========

Automatic Storage Management (ASM) simplifies database administration. It eliminates the need for you, 
as a DBA, to directly manage potentially thousands of Oracle database files. It does this by enabling 
you to create disk groups, which are comprised of disks and the files that reside on them. You only need 
to manage a small number of disk groups.

In the SQL statements that you use for creating database structures such as tablespaces, redo log and 
archive log files, and control files, you specify file location in terms of disk groups. 
Automatic Storage Management then creates and manages the associated underlying files for you.

Automatic Storage Management extends the power of Oracle-managed files. With Oracle-managed files, 
files are created and managed automatically for you, but with Automatic Storage Management you get 
the additional benefits of features such as mirroring and striping.
The primary component of Automatic Storage Management is the disk group. You configure Automatic Storage Management 
by creating disk groups, which, in your database instance, can then be specified as the default 
location for files created in the database. Oracle provides SQL statements that create and manage 
disk groups, their contents, and their metadata.

A disk group consists of a grouping of disks that are managed together as a unit. These disks are referred 
to as ASM disks. Files written on ASM disks are ASM files, whose names are automatically generated 
by Automatic Storage Management. You can specify user-friendly alias names for ASM files, 
but you must create a hierarchical directory structure for these alias names.

You can affect how Automatic Storage Management places files on disks by specifying failure groups. 
Failure groups define disks that share components, such that if one fails then other disks sharing 
the component might also fail. An example of what you might define as a failure group would be a set 
of SCSI disks sharing the same SCSI controller. Failure groups are used to determine which ASM disks 
to use for storing redundant data. For example, if two-way mirroring is specified for a file, 
then redundant copies of file extents must be stored in separate failure groups.


If you would take a look at the v$datafile, v$logfile, and v$controlfile of the regular Database,
you would see information like in the following example:

SQL> select file#, name from v$datafile;

1  +DATA1/rac0/datafile/system.256.1
2  +DATA1/rac0/datafile/undotbs.258.1
3  +DATA1/rac0/datafile/sysaux.257.1
4  +DATA1/rac0/datafile/users.259.1
5  +DATA1/rac0/datafile/example.269.1


SQL> select name from v$controlfile;

+DATA1/rac0/controlfile/current.261.3
+DATA1/rac0/controlfile/current.260.3


-- Initialization Parameters (init.ora or SPFILE) for ASM Instances

The following initialization parameters relate to an ASM instance. Parameters that start with ASM_ 
cannot be set in database instances.

Name             Description 
INSTANCE_TYPE    Must be set to INSTANCE_TYPE = ASM. 
                 Note: This is the only required parameter. All other parameters take suitable defaults 
                 for most environments.
 
DB_UNIQUE_NAME   Unique name for this group of ASM instances within the cluster or on a node. 
Default: +ASM    (Needs to be modified only if trying to run multiple ASM instances on the same node)
 
ASM_POWER_LIMIT  The maximum power on an ASM instance for disk rebalancing. 
Default: 1       Can range from 1 to 11. 1 is the lowest priority. 

See Also: "Tuning Rebalance Operations"
 
ASM_DISKSTRING   Limits the set of disks that Automatic Storage Management considers for discovery. 
Default: NULL    (This default causes ASM to find all of the disks in a platform-specific location to which 
                  it has read/write access.).
                  Example: /dev/raw/*

ASM_DISKGROUPS   Lists the names of disk groups to be mounted by an ASM instance at startup, 
                 or when the ALTER DISKGROUP ALL MOUNT statement is used. 
Default: NULL    (If this parameter is not specified, then no disk groups are mounted.)

Note: This parameter is dynamic and if you are using a server parameter file (SPFILE), then you should 
rarely need to manually alter this value. Automatic Storage Management automatically adds a disk group 
to this parameter when a disk group is successfully mounted, and automatically removes a disk group that 
is specifically dismounted. However, when using a traditional text initialization parameter file, 
remember to edit the initialization parameter file to add the name of any disk group that you want automatically 
mounted at instance startup, and remove the name of any disk group that you no longer want automatically mounted.


-- ASM Views:

The ASM configuration can be viewed using the V$ASM_% views, which often contain different information 
depending on whether they are queried from the ASM instance, or a dependant database instance.

Viewing ASM Instance Information Via SQL Queries
Finally, there are several dynamic and data dictionary views available to view an ASM configuration from within 
the ASM instance itself:

ASM Dynamic Views: FROM ASM Instance Information
 
View Name        Description
 
V$ASM_ALIAS      Shows every alias for every disk group mounted by the ASM instance
 
V$ASM_CLIENT     Shows which database instance(s) are using any ASM disk groups that are being mounted by this ASM instance
 
V$ASM_DISK       Lists each disk discovered by the ASM instance, including disks that are not part of any ASM disk group
 
V$ASM_DISKGROUP  Describes information about ASM disk groups mounted by the ASM instance
 
V$ASM_FILE       Lists each ASM file in every ASM disk group mounted by the ASM instance
 
V$ASM_OPERATION  Like its counterpart, V$SESSION_LONGOPS, it shows each long-running ASM operation in the ASM instance
 
V$ASM_TEMPLATE   Lists each template present in every ASM disk group mounted by the ASM instance
 
 
-- Managing disk groups

The SQL statements introduced in this section are only available in an ASM instance. 
You must first start the ASM instance. 

Creating disk group examples:

Example 1:
----------

Creating a Disk Group: Example

The following examples assume that the ASM_DISKSTRING is set to '/devices/*'. Assume the following:

ASM disk discovery identifies the following disks in directory /devices.

/devices/diska1 
/devices/diska2 
/devices/diska3 
/devices/diska4 
/devices/diskb1 
/devices/diskb2 
/devices/diskb3 
/devices/diskb4

The disks diska1 - diska4 are on a separate SCSI controller from disks diskb1 - diskb4.


The following SQL*Plus session illustrates starting an ASM instance and creating a disk group named dgroup1.

% SQLPLUS /NOLOG
SQL> CONNECT / AS SYSDBA

SQL> CREATE DISKGROUP dgroup1 NORMAL REDUNDANCY 
  2  FAILGROUP controller1 DISK
  3 '/devices/diska1',
  4 '/devices/diska2',
  5 '/devices/diska3',
  6 '/devices/diska4',
  7 FAILGROUP controller2 DISK
  8 '/devices/diskb1',
  9 '/devices/diskb2',
 10 '/devices/diskb3',
 11 '/devices/diskb4';

In this example, dgroup1 is composed of eight disks that are defined as belonging to either 
failure group controller1 or controller2. Since NORMAL REDUNDANCY level is specified for the disk group, 
then Automatic Storage Management provides redundancy for all files created in dgroup1 according to the 
attributes specified in the disk group templates.

For example, in the system default template shown in the table in "Managing Disk Group Templates", 
normal redundancy for the online redo log files (ONLINELOG template) is two-way mirroring. This means that 
when one copy of a redo log file extent is written to a disk in failure group controller1, a mirrored copy 
of the file extent is written to a disk in failure group controller2. You can see that to support normal 
redundancy level, at least two failure groups must be defined.

Since no NAME clauses are provided for any of the disks being included in the disk group, 
the disks are assigned the names of dgroup1_0001, dgroup1_0002, ..., dgroup1_0008.


Example 2:
----------

CREATE DISKGROUP disk_group_1 NORMAL REDUNDANCY
  FAILGROUP failure_group_1 DISK
    '/devices/diska1' NAME diska1,
    '/devices/diska2' NAME diska2,
  FAILGROUP failure_group_2 DISK
    '/devices/diskb1' NAME diskb1,
    '/devices/diskb2' NAME diskb2;


Example 3:
----------

At some point in using OUI in installing the software, and creating a database, you will
see the following screen:

----------------------------------------------------
|SPECIFY Database File Storage Option               |
|                                                   |
|  o File system                                    |
|    Specify Database file location: #########      |
|                                                   |
|  o Automatic Storage Management (ASM)             |
|                                                   |
|  o Raw Devices                                    |
|                                                   |
|    Specify Raw Devices mapping file: ##########   |
----------------------------------------------------

Suppose that you have on a Linux machine the following raw disk devices:

/dev/raw/raw1	8GB
/dev/raw/raw2	8GB
/dev/raw/raw3	6GB
/dev/raw/raw4	6GB
/dev/raw/raw5	6GB
/dev/raw/raw6	6GB

Then you can choose ASM in the upper screen, and see the following screen, where
you can create the initial diskgroup and assign disks to it:

-----------------------------------------------------
| Configure Automatic Storage Management              |
|                                                     |
| Disk Group Name:  data1                             |
|                                                     |
| Redundancy                                          |
| o High  o Normal  o External                        |             
|                                                     |
| Add member Disks                                    |
| |--------------------------------                   |
| | select  Disk Path              |                  |
| |[#]     /dev/raw/raw1           |                  |
| |[#]     /dev/raw/raw2           |                  | 
| |[ ]     /dev/raw/raw3           |                  |
| |[ ]     /dev/raw/raw4           |                  |
|  --------------------------------                   |
|                                                     |
-----------------------------------------------------


-- Mounting and Dismounting Disk Groups

Disk groups that are specified in the ASM_DISKGROUPS initialization parameter are mounted automatically 
at ASM instance startup. This makes them available to all database instances running on the same node 
as Automatic Storage Management. The disk groups are dismounted at ASM instance shutdown. 
Automatic Storage Management also automatically mounts a disk group when you initially create it, 
and dismounts a disk group if you drop it.

There may be times that you want to mount or dismount disk groups manually. For these actions use 
the ALTER DISKGROUP ... MOUNT or ALTER DISKGROUP ... DISMOUNT statement. You can mount or dismount 
disk groups by name, or specify ALL.

If you try to dismount a disk group that contains open files, the statement will fail, unless you also
specify the FORCE clause.


Example

The following statement dismounts all disk groups that are currently mounted to the ASM instance:

ALTER DISKGROUP ALL DISMOUNT;


The following statement mounts disk group dgroup1:

ALTER DISKGROUP dgroup1 MOUNT; 


========
Note 4:
========


-- Installing Oracle ASMLib for Linux:

ASMLib is a support library for the Automatic Storage Management feature of Oracle Database 10g. 
This document is a set of tips for installing the Linux specific ASM library and its assocated driver. 
This library is provide to enable ASM I/O to Linux disks without the limitations of the 
standard Unix I/O API. The steps below are steps that the system administrator must follow. 

The ASMLib software is available from the Oracle Technology Network. Go to ASMLib download page 
and follow the link for your platform. 
You will see 4-6 packages for your Linux platform. 

-The oracleasmlib package provides the actual ASM library. 
-The oracleasm-support package provides the utilities used to get the ASM driver 
 up and running. Both of these packages need to be installed. 
-The remaining packages provide the kernel driver for the ASM library. Each package provides 
 the driver for a different kernel. You must install the appropriate package for the kernel you are running. 
 Use the "uname -r command to determine the version of the kernel. The oracleasm kerel driver package 
 will have that version string in its name. For example, if you were running Red Hat Enterprise Linux 4 AS, 
 and the kernel you were using was the 2.6.9-5.0.5.ELsmp kernel, you would choose the 
 oracleasm-2.6.9-5.0.5-ELsmp package. 

So, for example, to install these packages on RHEL4 on an Intel x86 machine,  you might use the command: 

rpm -Uvh oracleasm-support-2.0.0-1.i386.rpm \
    oracleasm-lib-2.0.0-1.i386.rpm \
    oracleasm-2.6.9-5.0.5-ELsmp-2.0.0-1.i686.rpm

Once the command completes, ASMLib is now installed on the system. 

-- Configuring ASMLib: 
 
Now that the ASMLib software is installed, a few steps have to be taken by the system administrator 
to make the ASM driver available. The ASM driver needs to be loaded, and the driver filesystem needs 
to be mounted. This is taken care of by the initialization script, "/etc/init.d/oracleasm". 
Run the "/etc/init.d/oracleasm" script with the "configure" option. It will ask for the user and group 
that default to owning the ASM driver access point. If the database was running as the 'oracle' user 
and the 'dba' group, the output would look like this: 

[root@ca-test1 /]# /etc/init.d/oracleasm configure
  Configuring the Oracle ASM library driver.
 
  This will configure the on-boot properties of the Oracle ASM library
  driver.  The following questions will determine whether the driver is
  loaded on boot and what permissions it will have.  The current values
  will be shown in brackets ('[]').  Hitting  without typing an
  answer will keep that current value.  Ctrl-C will abort.

  Default user to own the driver interface []: oracle
  Default group to own the driver interface []: dba
  Start Oracle ASM library driver on boot (y/n) [n]: y
  Fix permissions of Oracle ASM disks on boot (y/n) [y]: y
  Writing Oracle ASM library driver configuration            [  OK  ]
  Creating /dev/oracleasm mount point                        [  OK  ]
  Loading module "oracleasm"                                 [  OK  ]
  Mounting ASMlib driver filesystem                          [  OK  ]
  Scanning system for ASM disks                              [  OK  ]
 

This should load the oracleasm.o driver module and mount the ASM driver filesystem. 
By selecting enabled = 'y' during the configuration, the system will always load the module 
and mount the filesystem on boot. 
The automatic start can be enabled or disabled with the 'enable' and 'disable' options 
to /etc/init.d/oracleasm: 

  [root@ca-test1 /]# /etc/init.d/oracleasm disable
  Writing Oracle ASM library driver configuration            [  OK  ]
  Unmounting ASMlib driver filesystem                        [  OK  ]
  Unloading module "oracleasm"                               [  OK  ]

  [root@ca-test1 /]# /etc/init.d/oracleasm enable
  Writing Oracle ASM library driver configuration            [  OK  ]
  Loading module "oracleasm"                                 [  OK  ]
  Mounting ASMlib driver filesystem                          [  OK  ]
  Scanning system for ASM disks                              [  OK  ]


-- Making Disks Available to ASMLib: 
 
The system administrator has one last task. Every disk that ASMLib is going to be accessing 
needs to be made available. This is accomplished by creating an ASM disk. The /etc/init.d/oracleasm script 
is again used for this task: 

  [root@ca-test1 /]# /etc/init.d/oracleasm createdisk VOL1 /dev/sdg1
  Creating Oracle ASM disk "VOL1"                            [  OK  ]

 
Disk names are ASCII capital letters, numbers, and underscores. They must start with a letter. 
Disks that are no longer used by ASM can be unmarked as well: 

  [root@ca-test1 /]# /etc/init.d/oracleasm deletedisk VOL1
  Deleting Oracle ASM disk "VOL1"                            [  OK  ]

Any operating system disk can be queried to see if it is used by ASM: 

  [root@ca-test1 /]# /etc/init.d/oracleasm querydisk /dev/sdg1
  Checking if device "/dev/sdg1" is an Oracle ASM disk        [  OK  ]
  [root@ca-test1 /]# /etc/init.d/oracleasm querydisk /dev/sdh1
  Checking if device "/dev/sdh1" is an Oracle ASM disk        [FAILED]

Existing disks can be listed and queried: 

  [root@ca-test1 /]# /etc/init.d/oracleasm listdisks
  VOL1
  VOL2
  VOL3
  [root@ca-test1 /]# /etc/init.d/oracleasm querydisk VOL1
  Checking for ASM disk "VOL1"                               [  OK  ]

When a disk is added to a RAC setup, the other nodes need to be notified about it. 
Run the 'createdisk' command on one node, and then run 'scandisks' on every other node: 

  [root@ca-test1 /]# /etc/init.d/oracleasm scandisks
  Scanning system for ASM disks                              [  OK  ]


-- Discovery Strings for Linux ASMLib: 
 
ASMLib uses discovery strings to determine what disks ASM is asking for. The generic Linux ASMLib 
uses glob strings. The string must be prefixed with "ORCL:". Disks are specified by name. 
A disk created with the name "VOL1" can be discovered in ASM via the discovery string "ORCL:VOL1". 
Similarly, all disks that start with the string "VOL" can be queried with the discovery string "ORCL:VOL*". 
Disks cannot be discovered with path names in the discovery string. If the prefix is missing, 
the generic Linux ASMLib will ignore the discovery string completely, expecting that it is intended 
for a different ASMLib. The only exception is the empty string (""), which is considered a full wildcard. 
This is precisely equivalent to the discovery string "ORCL:*". 

NOTE: Once you mark your disks with Linux ASMLib, Oracle Database 10g R1 (10.1) OUI will not be able 
to discover your disks. It is recommended that you complete a Software Only install and then use DBCA 
to create your database (or use the custom install). 

 
========
Note 5:
========

Automatic Storage Management (ASM) is a new feature that has be introduced in Oracle 10g to 
simplify the storage of Oracle datafiles, controlfiles and logfiles. 


- Overview of Automatic Storage Management (ASM) 
- Initialization Parameters and ASM Instance Creation 
- Startup and Shutdown of ASM Instances 
- Administering ASM Disk Groups 
- Disks 
- Templates 
- Directories 
- Aliases 
- Files 
- Checking Metadata 
- ASM Filenames 
- ASM Views 
- SQL and ASM 
- Migrating to ASM Using RMAN 

Overview of Automatic Storage Management (ASM)
Automatic Storage Management (ASM) simplifies administration of Oracle related files by allowing 
the administrator to reference disk groups rather than individual disks and files, which are managed by ASM. 
The ASM functionality is an extention of the Oracle Managed Files (OMF) functionality that also includes 
striping and mirroring to provide balanced and secure storage. The new ASM functionality can be used in 
combination with existing raw and cooked file systems, along with OMF and manually managed files.

The ASM functionality is controlled by an ASM instance. This is not a full database instance, 
just the memory structures and as such is very small and lightweight.

The main components of ASM are disk groups, each of which comprise of several physical disks that are controlled 
as a single unit. The physical disks are known as ASM disks, while the files that reside on the disks 
are know as ASM files. The locations and names for the files are controlled by ASM, but user-friendly aliases and directory structures can be defined for ease of reference.

The level of redundancy and the granularity of the striping can be controlled using templates. 
Default templates are provided for each file type stored by ASM, but additional templates can be defined as needed.

Failure groups are defined within a disk group to support the required level of redundancy. 
For two-way mirroring you would expect a disk group to contain two failure groups so individual files 
are written to two locations.

In summary ASM provides the following functionality:

Manages groups of disks, called disk groups. 
Manages disk redundancy within a disk group. 
Provides near-optimal I/O balancing without any manual tuning. 
Enables management of database objects without specifying mount points and filenames. 
Supports large files. 
Initialization Parameters and ASM Instance Creation

The init.ora / spfile initialization parameters that are of specific interest for an ASM instance are:

INSTANCE_TYPE   - Set to ASM or RDBMS depending on the instance type. The default is RDBMS. 
DB_UNIQUE_NAME  - Specifies a globally unique name for the database. This defaults to +ASM but 
                  must be altered if you intend to run multiple ASM instances. 
ASM_POWER_LIMIT - The maximum power for a rebalancing operation on an ASM instance. The valid values range 
                  from 1 to 11, with 1 being the default. The higher the limit the more resources are allocated 
                  resulting in faster rebalancing operations. This value is also used as the default 
                  when the POWER clause is omitted from a rebalance operation. 
ASM_DISKGROUPS  - The list of disk groups that should be mounted by an ASM instance during instance startup, 
                  or by the ALTER DISKGROUP ALL MOUNT statement. ASM configuration changes are automatically 
                  reflected in this parameter. 
ASM_DISKSTRING -  Specifies a value that can be used to limit the disks considered for discovery. 
                  Altering the default value may improve the speed of disk group mount time and the speed 
                  of adding a disk to a disk group. Changing the parameter to a value which prevents 
                  the discovery of already mounted disks results in an error. The default value is NULL 
                  allowing all suitable disks to be considered. 

Incorrect usage of parameters in ASM or RDBMS instances result in ORA-15021 errors.

To create an ASM instance first create a file called init+ASM.ora in the /tmp directory 
containing the following information.

INSTANCE_TYPE=ASM 

Next, using SQL*Plus connect to the ide instance.

export ORACLE_SID=+ASM

sqlplus / as sysdba

Create an spfile using the contents of the init+ASM.ora file.

SQL> CREATE SPFILE FROM PFILE='/tmp/init+ASM.ora';

File created.

Finally, start the instance with the NOMOUNT option.

SQL> startup nomount
ASM instance started

Total System Global Area  125829120 bytes
Fixed Size                  1301456 bytes
Variable Size             124527664 bytes
Database Buffers                  0 bytes
Redo Buffers                      0 bytes
SQL>

The ASM instance is now ready to use for creating and mounting disk groups. 
To shutdown the ASM instance issue the following command.

SQL> shutdown
ASM instance shutdown
SQL>

Once an ASM instance is present disk groups can be used for the following parameters 
in database instances (INSTANCE_TYPE=RDBMS) to allow ASM file creation:

DB_CREATE_FILE_DEST 
DB_CREATE_ONLINE_LOG_DEST_n 
DB_RECOVERY_FILE_DEST 
CONTROL_FILES 
LOG_ARCHIVE_DEST_n 
LOG_ARCHIVE_DEST 
STANDBY_ARCHIVE_DEST 

Startup and Shutdown of ASM Instances
ASM instance are started and stopped in a similar way to normal database instances. The options 
for the STARTUP command are:

FORCE - Performs a SHUTDOWN ABORT before restarting the ASM instance. 
MOUNT - Starts the ASM instance and mounts the disk groups specified by the ASM_DISKGROUPS parameter. 
NOMOUNT - Starts the ASM instance without mounting any disk groups. 
OPEN - This is not a valid option for an ASM instance. 

The options for the SHUTDOWN command are:

NORMAL - The ASM instance waits for all connected ASM instances and SQL sessions to exit then shuts down. 
IMMEDIATE - The ASM instance waits for any SQL transactions to complete then shuts down. 
            It doesn't wait for sessions to exit. 
TRANSACTIONAL - Same as IMMEDIATE. 
ABORT - The ASM instance shuts down instantly. 

Aministering ASM Disk Groups

Disk groups are created using the CREATE DISKGROUP statement. This statement allows you to specify 
the level of redundancy:

NORMAL REDUNDANCY   - Two-way mirroring, requiring two failure groups. 
HIGH REDUNDANCY     - Three-way mirroring, requiring three failure groups. 
EXTERNAL REDUNDANCY - No mirroring for disks that are already protected using hardware mirroring or RAID. 

In addition failure groups and preferred names for disks can be defined. If the NAME clause is omitted 
the disks are given a system generated name like "disk_group_1_0001". The FORCE option can be used 
to move a disk from another disk group into this one.

CREATE DISKGROUP disk_group_1 NORMAL REDUNDANCY
  FAILGROUP failure_group_1 DISK
    '/devices/diska1' NAME diska1,
    '/devices/diska2' NAME diska2,
  FAILGROUP failure_group_2 DISK
    '/devices/diskb1' NAME diskb1,
    '/devices/diskb2' NAME diskb2;

Disk groups can be deleted using the DROP DISKGROUP statement.

DROP DISKGROUP disk_group_1 INCLUDING CONTENTS;

Disks can be added or removed from disk groups using the ALTER DISKGROUP statement. 
Remember that the wildcard "*" can be used to reference disks so long as the resulting string does not match 
a disk already used by an existing disk group.

-- Add disks.
ALTER DISKGROUP disk_group_1 ADD DISK
  '/devices/disk*3',
  '/devices/disk*4';

-- Drop a disk.
ALTER DISKGROUP disk_group_1 DROP DISK diska2;

Disks can be resized using the RESIZE clause of the ALTER DISKGROUP statement. 
The statement can be used to resize individual disks, all disks in a failure group or all disks 
in the disk group. If the SIZE clause is omitted the disks are resized to the size of the disk returned by the OS.

-- Resize a specific disk.
ALTER DISKGROUP disk_group_1
  RESIZE DISK diska1 SIZE 100G;

-- Resize all disks in a failure group.
ALTER DISKGROUP disk_group_1
  RESIZE DISKS IN FAILGROUP failure_group_1 SIZE 100G;

-- Resize all disks in a disk group.
ALTER DISKGROUP disk_group_1
  RESIZE ALL SIZE 100G;The UNDROP DISKS clause of the ALTER DISKGROUP statement allows pending disk drops 
to be undone. It will not revert drops that have completed, or disk drops associated with the dropping of a disk group.

ALTER DISKGROUP disk_group_1 UNDROP DISKS;

Disk groups can be rebalanced manually using the REBALANCE clause of the ALTER DISKGROUP statement. 
If the POWER clause is omitted the ASM_POWER_LIMIT parameter value is used. Rebalancing is only needed 
when the speed of the automatic rebalancing is not appropriate. 

ALTER DISKGROUP disk_group_1 REBALANCE POWER 5;

Disk groups are mounted at ASM instance startup and unmounted at ASM instance shutdown. 
Manual mounting and dismounting can be accomplished using the ALTER DISKGROUP statement as seen below.

ALTER DISKGROUP ALL DISMOUNT;
ALTER DISKGROUP ALL MOUNT;
ALTER DISKGROUP disk_group_1 DISMOUNT;
ALTER DISKGROUP disk_group_1 MOUNT;

Templates
Templates are named groups of attributes that can be applied to the files within a disk group. 
The following example show how templates can be created, altered and dropped.

-- Create a new template.
ALTER DISKGROUP disk_group_1 ADD TEMPLATE my_template ATTRIBUTES (MIRROR FINE);

-- Modify template.
ALTER DISKGROUP disk_group_1 ALTER TEMPLATE my_template ATTRIBUTES (COARSE);

-- Drop template.
ALTER DISKGROUP disk_group_1 DROP TEMPLATE my_template;Available attributes include:

UNPROTECTED - No mirroring or striping regardless of the redundancy setting. 
MIRROR - Two-way mirroring for normal redundancy and three-way mirroring for high redundancy. 
         This attribute cannot be set for external redundancy. 
COARSE - Specifies lower granuality for striping. This attribute cannot be set for external redundancy. 
FINE - Specifies higher granularity for striping. This attribute cannot be set for external redundancy. 

Directories
A directory heirarchy can be defined using the ALTER DISKGROUP statement to support ASM file aliasing. 
The following examples show how ASM directories can be created, modified and deleted.

-- Create a directory.
ALTER DISKGROUP disk_group_1 ADD DIRECTORY '+disk_group_1/my_dir';

-- Rename a directory.
ALTER DISKGROUP disk_group_1 RENAME DIRECTORY '+disk_group_1/my_dir' TO '+disk_group_1/my_dir_2';

-- Delete a directory and all its contents.
ALTER DISKGROUP disk_group_1 DROP DIRECTORY '+disk_group_1/my_dir_2' FORCE;Aliases
Aliases allow you to reference ASM files using user-friendly names, rather than the fully qualified ASM filenames. 
-- Create an alias using the fully qualified filename.
ALTER DISKGROUP disk_group_1 ADD ALIAS '+disk_group_1/my_dir/my_file.dbf'
  FOR '+disk_group_1/mydb/datafile/my_ts.342.3';

-- Create an alias using the numeric form filename.
ALTER DISKGROUP disk_group_1 ADD ALIAS '+disk_group_1/my_dir/my_file.dbf'
  FOR '+disk_group_1.342.3';

-- Rename an alias.
ALTER DISKGROUP disk_group_1 RENAME ALIAS '+disk_group_1/my_dir/my_file.dbf'
  TO '+disk_group_1/my_dir/my_file2.dbf';

-- Delete an alias.
ALTER DISKGROUP disk_group_1 DELETE ALIAS '+disk_group_1/my_dir/my_file.dbf';

Attempting to drop a system alias results in an error.

Files
Files are not deleted automatically if they are created using aliases, as they are not Oracle Managed Files (OMF), 
or if a recovery is done to a point-in-time before the file was created. For these circumstances 
it is necessary to manually delete the files, as shown below.

-- Drop file using an alias.
ALTER DISKGROUP disk_group_1 DROP FILE '+disk_group_1/my_dir/my_file.dbf';

-- Drop file using a numeric form filename.
ALTER DISKGROUP disk_group_1 DROP FILE '+disk_group_1.342.3';

-- Drop file using a fully qualified filename.
ALTER DISKGROUP disk_group_1 DROP FILE '+disk_group_1/mydb/datafile/my_ts.342.3';

Checking Metadata
The internal consistency of disk group metadata can be checked in a number of ways using the CHECK clause 
of the ALTER DISKGROUP statement.

-- Check metadata for a specific file.
ALTER DISKGROUP disk_group_1 CHECK FILE '+disk_group_1/my_dir/my_file.dbf'

-- Check metadata for a specific failure group in the disk group.
ALTER DISKGROUP disk_group_1 CHECK FAILGROUP failure_group_1;

-- Check metadata for a specific disk in the disk group. 
ALTER DISKGROUP disk_group_1 CHECK DISK diska1;

-- Check metadata for all disks in the disk group. 
ALTER DISKGROUP disk_group_1 CHECK ALL;

ASM Views
The ASM configuration can be viewed using the V$ASM_% views, which often contain different information 
depending on whether they are queried from the ASM instance, or a dependant database instance.

Viewing ASM Instance Information Via SQL Queries
Finally, there are several dynamic and data dictionary views available to view an ASM configuration from within 
the ASM instance itself:

-- ASM Dynamic Views: FROM ASM Instance Information
 
View Name        Description
 
V$ASM_ALIAS      Shows every alias for every disk group mounted by the ASM instance
 
V$ASM_CLIENT     Shows which database instance(s) are using any ASM disk groups that are being mounted by this ASM instance
 
V$ASM_DISK       Lists each disk discovered by the ASM instance, including disks that are not part of any ASM disk group
 
V$ASM_DISKGROUP  Describes information about ASM disk groups mounted by the ASM instance
 
V$ASM_FILE       Lists each ASM file in every ASM disk group mounted by the ASM instance
 
V$ASM_OPERATION  Like its counterpart, V$SESSION_LONGOPS, it shows each long-running ASM operation in the ASM instance
 
V$ASM_TEMPLATE   Lists each template present in every ASM disk group mounted by the ASM instance
 

I was also able to query the following dynamic views against my database instance to view the related ASM storage 
components of that instance:

-- ASM Dynamic Views: FROM Database Instance Information
 
View Name          Description
 
V$ASM_DISKGROUP    Shows one row per each ASM disk group that's mounted by the local ASM instance
 
V$ASM_DISK         Displays one row per each disk in each ASM disk group that are in use by the database instance
 
V$ASM_CLIENT       Lists one row per each ASM instance for which the database instance has any open ASM files
 

ASM Filenames
There are several ways to reference ASM file. Some forms are used during creation and some for 
referencing ASM files. The forms for file creation are incomplete, relying on ASM to create the fully qualified name, 
which can be retrieved from the supporting views. The forms of the ASM filenames are summarised below.

Filename Type Format 
Fully Qualified ASM Filename +dgroup/dbname/file_type/file_type_tag.file.incarnation 
Numeric ASM Filename +dgroup.file.incarnation 
Alias ASM Filenames +dgroup/directory/filename 
Alias ASM Filename with Template +dgroup(template)/alias 
Incomplete ASM Filename +dgroup 
Incomplete ASM Filename with Template +dgroup(template) 

SQL and ASM
ASM filenames can be used in place of conventional filenames for most Oracle file types, including controlfiles, 
datafiles, logfiles etc. For example, the following command creates a new tablespace with a datafile 
in the disk_group_1 disk group.

CREATE TABLESPACE my_ts DATAFILE '+disk_group_1' SIZE 100M AUTOEXTEND ON;Migrating to ASM Using RMAN
The following method shows how a primary database can be migrated to ASM from a disk based backup:

Disable change tracking (only available in Enterprise Edition) if it is currently being used.

SQL> ALTER DATABASE DISABLE BLOCK CHANGE TRACKING;Shutdown the database.

SQL> SHUTDOWN IMMEDIATEModify the parameter file of the target database as follows:

Set the DB_CREATE_FILE_DEST and DB_CREATE_ONLINE_LOG_DEST_n parameters to the relevant ASM disk groups. 
Remove the CONTROL_FILES parameter from the spfile so the control files will be moved to the DB_CREATE_* destination 
and the spfile gets updated automatically. If you are using a pfile the CONTROL_FILES parameter must be set 
to the appropriate ASM files or aliases. 


Start the database in nomount mode.

RMAN> STARTUP NOMOUNTRestore the controlfile into the new location from the old location.

RMAN> RESTORE CONTROLFILE FROM 'old_control_file_name';Mount the database.

RMAN> ALTER DATABASE MOUNT;Copy the database into the ASM disk group.

RMAN> BACKUP AS COPY DATABASE FORMAT '+disk_group';Switch all datafile to the new ASM location.

RMAN> SWITCH DATABASE TO COPY;Open the database.

RMAN> ALTER DATABASE OPEN;Create new redo logs in ASM and delete the old ones.


Enable change tracking if it was being used.

SQL> ALTER DATABASE ENABLE BLOCK CHANGE TRACKING;Form more information see:

Using Automatic Storage Management 
Migrating a Database into ASM 
Hope this helps. Regards Tim...


Note 6:
=======

Good example !!!!

How to Use Oracle10g release 2 ASM on Linux:

[root@danaly etc]# fdisk /dev/cciss/c0d0

The number of cylinders for this disk is set to 8854.
There is nothing wrong with that, but this is larger than 1024,
and could in certain setups cause problems with:
1) software that runs at boot time (e.g., old versions of LILO)
2) booting and partitioning software from other OSs
   (e.g., DOS FDISK, OS/2 FDISK)

Command (m for help): p

Disk /dev/cciss/c0d0: 72.8 GB, 72833679360 bytes
255 heads, 63 sectors/track, 8854 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

           Device Boot      Start         End      Blocks   Id  System
/dev/cciss/c0d0p1   *           1          33      265041   83  Linux
/dev/cciss/c0d0p2              34         555     4192965   82  Linux swap
/dev/cciss/c0d0p3             556         686     1052257+  83  Linux
/dev/cciss/c0d0p4             687        8854    65609460    5  Extended
/dev/cciss/c0d0p5             687        1730     8385898+  83  Linux
/dev/cciss/c0d0p6            1731        2774     8385898+  83  Linux
/dev/cciss/c0d0p7            2775        3818     8385898+  83  Linux
/dev/cciss/c0d0p8            3819        4601     6289416   83  Linux

Command (m for help): n
First cylinder (4602-8854, default 4602): 
Using default value 4602
Last cylinder or +size or +sizeM or +sizeK (4602-8854, default 8854): +20000M    

Command (m for help): n
First cylinder (7035-8854, default 7035): 
Using default value 7035
Last cylinder or +size or +sizeM or +sizeK (7035-8854, default 8854): +3000M 

Command (m for help): n
First cylinder (7401-8854, default 7401): 
Using default value 7401
Last cylinder or +size or +sizeM or +sizeK (7401-8854, default 8854): +3000M

Command (m for help): p

Disk /dev/cciss/c0d0: 72.8 GB, 72833679360 bytes
255 heads, 63 sectors/track, 8854 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

           Device Boot      Start         End      Blocks   Id  System
/dev/cciss/c0d0p1   *           1          33      265041   83  Linux
/dev/cciss/c0d0p2              34         555     4192965   82  Linux swap
/dev/cciss/c0d0p3             556         686     1052257+  83  Linux
/dev/cciss/c0d0p4             687        8854    65609460    5  Extended
/dev/cciss/c0d0p5             687        1730     8385898+  83  Linux
/dev/cciss/c0d0p6            1731        2774     8385898+  83  Linux
/dev/cciss/c0d0p7            2775        3818     8385898+  83  Linux
/dev/cciss/c0d0p8            3819        4601     6289416   83  Linux
/dev/cciss/c0d0p9            4602        7034    19543041   83  Linux
/dev/cciss/c0d0p10           7035        7400     2939863+  83  Linux
/dev/cciss/c0d0p11           7401        7766     2939863+  83  Linux

Command (m for help): w
The partition table has been altered!

Calling ioctl() to re-read partition table.

WARNING: Re-reading the partition table failed with error 16: Device or resource busy.
The kernel still uses the old table.
The new table will be used at the next reboot.
Syncing disks.


[root@danaly data1]# /etc/init.d/oracleasm createdisk VOL5 /dev/cciss/c0d0p10
Marking disk "/dev/cciss/c0d0p10" as an ASM disk: [  OK  ]
[root@danaly data1]# /etc/init.d/oracleasm createdisk VOL6 /dev/cciss/c0d0p11
Marking disk "/dev/cciss/c0d0p11" as an ASM disk: [  OK  ]
[root@danaly data1]# /etc/init.d/oracleasm listdisks
VOL1
VOL2
VOL3
VOL4
VOL5
VOL6


(THE FOLLOWING QUERIES ARE ISSUED FROM THE ASM INSTANCE.)

[oracle@danaly ~]$ export ORACLE_SID=+ASM
[oracle@danaly ~]$ sqlplus "/ as sysdba"

SQL*Plus: Release 10.2.0.1.0 - Production on Sun Sep 3 00:28:09 2006

Copyright (c) 1982, 2005, Oracle.  All rights reserved.

Connected to an idle instance.

SQL> startup
ASM instance started

Total System Global Area   83886080 bytes
Fixed Size                  1217836 bytes
Variable Size              57502420 bytes
ASM Cache                  25165824 bytes
ASM diskgroups mounted

SQL> select group_number,disk_number,mode_status from v$asm_disk;

GROUP_NUMBER DISK_NUMBER MODE_STATUS
------------ ----------- --------------
           0           4 ONLINE
           0           5 ONLINE
           1           0 ONLINE
           1           1 ONLINE
           1           2 ONLINE
           1           3 ONLINE

6 rows selected.

SQL> select group_number,disk_number,mode_status,name from v$asm_disk;

GROUP_NUMBER DISK_NUMBER MODE_STATUS    NAME
------------ ----------- -------------- ---------------------------------
           0           4 ONLINE
           0           5 ONLINE
           1           0 ONLINE         VOL1
           1           1 ONLINE         VOL2
           1           2 ONLINE         VOL3
           1           3 ONLINE         VOL4

6 rows selected.

SQL> create diskgroup orag2 external redundancy disk 'ORCL:VOL5';

Diskgroup created.

SQL> select group_number,disk_number,mode_status,name from v$asm_disk;

GROUP_NUMBER DISK_NUMBER MODE_STATUS    NAME
------------ ----------- -------------- -------------------------------------
           0           5 ONLINE
           1           0 ONLINE         VOL1
           1           1 ONLINE         VOL2
           1           2 ONLINE         VOL3
           1           3 ONLINE         VOL4
           2           0 ONLINE         VOL5

6 rows selected.


(THE FOLLOWING QUERIES ARE ISSUED FROM THE DATABASE INSTANCE.)

[oracle@danaly ~]$ export ORACLE_SID=danaly
[oracle@danaly ~]$ sqlplus "/ as sysdba"

SQL*Plus: Release 10.2.0.1.0 - Production on Sun Sep 3 00:47:04 2006

Copyright (c) 1982, 2005, Oracle.  All rights reserved.

Connected to an idle instance.

SQL> startup
ORACLE instance started.

Total System Global Area  943718400 bytes
Fixed Size                  1222744 bytes
Variable Size             281020328 bytes
Database Buffers          654311424 bytes
Redo Buffers                7163904 bytes
Database mounted.
Database opened.

SQL> select name from v$datafile;

NAME
--------------------------------------------------------------------------------
+ORADG/danaly/datafile/system.264.600016955
+ORADG/danaly/datafile/undotbs1.265.600016969
+ORADG/danaly/datafile/sysaux.266.600016977
+ORADG/danaly/datafile/users.268.600016987


SQL> create tablespace eygle datafile '+ORAG2' ;

Tablespace created.

SQL> select name from v$datafile;

NAME
---------------------------------------------------------------------------------
+ORADG/danaly/datafile/system.264.600016955
+ORADG/danaly/datafile/undotbs1.265.600016969
+ORADG/danaly/datafile/sysaux.266.600016977
+ORADG/danaly/datafile/users.268.600016987
+ORAG2/danaly/datafile/eygle.256.600137647


oracle@danaly log]$ export ORACLE_SID=+ASM
[oracle@danaly log]$ sqlplus "/ as sysdba"

SQL*Plus: Release 10.2.0.1.0 - Production on Sun Sep 3 01:36:37 2006

Copyright (c) 1982, 2005, Oracle.  All rights reserved.


Connected to:
Oracle Database 10g Enterprise Edition Release 10.2.0.1.0 - Production
With the Partitioning, Oracle Label Security, OLAP and Data Mining Scoring Engine options

SQL> alter diskgroup orag2 add disk 'ORCL:VOL6';

Diskgroup altered.


============
Note 7: OMF
============


Using Oracle-managed files simplifies the administration of an Oracle database. Oracle-managed files eliminate 
the need for you, the DBA, to directly manage the operating system files comprising an Oracle database. 
You specify operations in terms of database objects rather than filenames. Oracle internally uses standard 
file system interfaces to create and delete files as needed for the following database structures:

Tablespaces 
Online redo log files 
Control files 


The following initialization parameters init.ora/spfile.ora allow the database server to use 
the Oracle Managed Files feature:


- DB_CREATE_FILE_DEST
  Defines the location of the default file system directory where Oracle creates datafiles 
  or tempfiles when no file specification is given in the creation operation. Also used as the default 
  file system directory for online redo log and control files if DB_CREATE_ONLINE_LOG_DEST_n is not specified.
 
- DB_CREATE_ONLINE_LOG_DEST_n
  Defines the location of the default file system directory for online redo log files and 
  control file creation when no file specification is given in the creation operation. You can use this 
  initialization parameter multiple times, where n specifies a multiplexed copy of the online redo log 
  or control file. You can specify up to five multiplexed copies
 
Example:

DB_CREATE_FILE_DEST         = '/u01/oradata/payroll'
DB_CREATE_ONLINE_LOG_DEST_1 = '/u02/oradata/payroll'
DB_CREATE_ONLINE_LOG_DEST_2 = '/u03/oradata/payroll'


34.2 RAC 10g:
=============

===========================================
Note 1: High Level Overview Oracle 10g RAC
===========================================


- RAC Architecture Overview

Let's begin with a brief overview of RAC architecture.

A cluster is a set of 2 or more machines (nodes) that share or coordinate resources to perform the same task. 
A RAC database is 2 or more instances running on a set of clustered nodes, with all instances accessing 
a shared set of database files. 
Depending on the O/S platform, a RAC database may be deployed on a cluster that uses vendor clusterware 
plus Oracle's own clusterware (Cluster Ready Services), or on a cluster that solely uses 
Oracle's own clusterware.
Thus, every RAC sits on a cluster that is running Cluster Ready Services. srvctl is the primary tool DBAs use 
to configure CRS for their RAC database and processes.


- Cluster Ready Services and the OCR

Cluster Ready Services, or CRS, is a new feature for 10g RAC. Essentially, it is Oracle's own clusterware. 
On most platforms, Oracle supports vendor clusterware; in these cases, CRS interoperates with the vendor 
clusterware, providing high availability support and service and workload management. On Linux and Windows clusters, 
CRS serves as the sole clusterware. In all cases, CRS provides a standard cluster interface that is consistent 
across all platforms.

CRS consists of four processes (crsd, occsd, evmd, and evmlogger) and two disks: 
the Oracle Cluster Registry (OCR), and the voting disk. 

CRS manages the following resources: 

. The ASM instances on each node 
. Databases 
. The instances on each node 
. Oracle Services on each node 
. The cluster nodes themselves, including the following processes, or "nodeapps":
  . VIP 
  . GSD 
  . The listener 
  . The ONS daemon

CRS stores information about these resources in the OCR. If the information in the OCR for one of these 
resources becomes damaged or inconsistent, then CRS is no longer able to manage that resource. 
Fortunately, the OCR automatically backs itself up regularly and frequently.


10g RAC (10.2) uses, or depends on,:

- Oracle Clusterware (10.2), formerly referred to as CRS "Cluster Ready Services" (10.1).
- Oracle's optional Cluster File System OCFS (This is optional), or use ASM and RAW.
- Oracle Database extensions

RAC is "scale out" technology: just add commodity nodes to the system.
The key component is "cache fusion". Data are transferred from one node
to another via very fast interconnects. 
Essential to 10g RAC is a "Shared Cache" technology.

Automatic Workload Repository (AWR) plays a role also.  The Fast Application Notification (FAN) mechanism
that is part of RAC, publishes events that describe the current service level being provided
by each instance, to AWR. The load balancing advisory information is then used to determine
the best instance to serve the new request.

. With RAC, ALL Instances of ALL nodes in a cluster, access a SINGLE database.
. But every instance has it's own UNDO tablespace, and REDO logs.

The Oracle Clusterware comprise several background processes that facilitate cluster operations.
The Cluster Synchronization Service CSS, Event Management EVM, and Oracle Cluster components
communicate with other cluster components layers in the other instances within the same 
cluster database environment.


Questions per implementation arise in the following points:
. Storage
. Computer Systems/Storage-Interconnect
. Database
. Application Server
. Public and Private networks
. Application Control & Display

On the Storage level, it can be said that 10g RAC supports
- Automatic Storage Management (ASM)
- Oracle Cluster File System (OCFS)
- ??? Network File System (NFS) - limited (only theoretical actually)
- Disk raw partitions
- Third party cluster file systems

For application control and tools, it can be said that 10g RAC supports
- OEM Grid Control     http://hostname:5500/em
  OEM Database Control http://hostname:1158/em
- "svrctl" is a command line interface to manage the cluster configuration,
   for example, starting and stopping all nodes in one command.
- Cluster Verification Utility (cluvfy) can be used for an installation and sanity check.

Failure in Client connections:

Depending on the Net configuration, type of connection, type of transaction etc.., 
Oracle Net services provides a feature called "Transparant Application Failover", or TAF,
which can fail over a client session to another backup connection.

About HA and DR:

- RAC is HA       , High Availability, that will keep things Up and Running in one site.
- Data Guard is DR, Disaster Recovery, and is able to mirror one site to another remote site.


====================================================
Note 2: 10g RAC processes, services, daemons, tools
====================================================


==============================================
Note 3: Installation notes 10g RAC on Windows
==============================================


3.1 Before you install:
-----------------------

Each node in a cluster requires the following:

> One private internet protocol (IP) address for each node to serve as the private interconnect. 
 The following must be true for each private IP address:

 -It must be separate from the public network
 -It must be accessible on the same network interface on each node
 -It must have a unique address on each node

 The private interconnect is used for inter-node communication by both Oracle Clusterware and RAC. 
 If the private address is available from a network name server (DNS), then you can use that name. 
 Otherwise, the private IP address must be available in each node's C:\WINNT\system32\drivers\etc\hosts file.

> One public IP address for each node, to be used as the Virtual IP (VIP) address for client connections 
and for connection failover. The name associated with the VIP must be different from the default host name.

This VIP must be associated with the same interface name on every node that is part of your cluster. 
In addition, the IP addresses that you use for all of the nodes that are part of a cluster must be from 
the same subnet. 

> One public fixed hostname address for each node, typically assigned by the system administrator 
during operating system installation. If you have a DNS, then register both the fixed IP and the VIP address 
with DNS. If you do not have DNS, then you must make sure that the public IP and VIP addresses for all 
nodes are in each node's host file.

For example, with a two node cluster where each node has one public and one private interface, 
you might have the configuration shown in the following table for your network interfaces, 
where the hosts file is %SystemRoot%\system32\drivers\etc\hosts:

Node Interface Name 	Type 		IP Address 	Registered In 
rac1 rac1 		Public 		143.46.43.100 	DNS (if available, else the hosts file) 
rac1 rac1-vip 		Virtual 	143.46.43.104 	DNS (if available, else the hosts file) 
rac1 rac1-priv 		Private 	10.0.0.1 	Hosts file 
rac2 rac2 		Public 		143.46.43.101 	DNS (if available, else the hosts file) 
rac2 rac2-vip 		Virtual 	143.46.43.105 	DNS (if available, else the hosts file) 
rac2 rac2-priv 		Private 	10.0.0.2 	Hosts file 

The virtual IP addresses are assigned to the listener process.

To enable VIP failover, the configuration shown in the preceding table defines the public and VIP addresses 
of both nodes on the same subnet, 143.46.43. When a node or interconnect fails, then the associated VIP 
is relocated to the surviving instance, enabling fast notification of the failure to the clients connecting 
through that VIP. If the application and client are configured with transparent application failover options, 
then the client is reconnected to the surviving instance.

To disable Windows Media Sensing for TCP/IP, you must set the value of the DisableDHCPMediaSense parameter to 1 
on each node. Disable Media Sensing by completing the following steps on each node of your cluster:

Use Registry Editor (Regedt32.exe) to view the following key in the registry:

HKEY_LOCAL_MACHINE\System\CurrentControlSet\Services\Tcpip\Parameters

Add the following registry value:

Value Name: DisableDHCPMediaSense
Data Type: REG_DWORD -Boolean
Value: 1


- External shared disks for storing Oracle Clusterware and database files.
The disk configuration options available to you are described in Chapter 3, "Storage Pre-Installation Tasks". 
Review these options before you decide which storage option to use in your RAC environment. However, note 
that when Database Configuration Assistant (DBCA) configures automatic disk backup, it uses a 
database recovery area which must be shared. The database files and recovery files do not necessarily have 
to be located on the same type of storage.

Determine the storage option for your system and configure the shared disk. Oracle recommends that 
you use Automatic Storage Management (ASM) and Oracle Managed Files (OMF), or a cluster file system. 
If you use ASM or a cluster file system, then you can also take advantage of OMF and other Oracle Database 10g 
storage features. If you use RAC on Oracle Database 10g Standard Edition, then you must use ASM.

If you use ASM, then Oracle recommends that you install ASM in a separate home from the 
Oracle Clusterware home and the Oracle home. 

Oracle Database 10g Real Application Clusters installation is a two-phase installation. 
In phase one, use Oracle Universal Installer (OUI) to install Oracle Clusterware. 
In phase two, install the database software using OUI.

When you install Oracle Clusterware or RAC, OUI copies the Oracle software onto the node from which 
you are running it. If your Oracle home is not on a cluster file system, then OUI propagates the software 
onto the other nodes that you have selected to be part of your OUI installation session. 

- Shared Storage for Database Recovery Area
When you configure a database recovery area in a RAC environment, the database recovery area must be on 
shared storage. When Database Configuration Assistant (DBCA) configures automatic disk backup, it uses 
a database recovery area that must be shared.

If the database files are stored on a cluster file system, then the recovery area can also be shared through 
the cluster file system.

If the database files are stored on an Automatic Storage Management (ASM) disk group, then the recovery area 
can also be shared through ASM.

If the database files are stored on raw devices, then you must use either a cluster file system or ASM 
for the recovery area.

Note:

ASM disk groups are always valid recovery areas, as are cluster file systems. Recovery area files do not have 
to be in the same location where datafiles are stored. For instance, you can store datafiles on raw devices, 
but use ASM for the recovery area.

Data files are not placed on NTFS partitions, because they cannot be shared. 
Data files can be placed on Oracle Cluster File System (OCFS), on raw disks using ASM, or on raw disks.


- Oracle Clusterware
You must provide OUI with the names of the nodes on which you want to install Oracle Clusterware. 
The Oracle Clusterware home can be either shared by all nodes, or private to each node, depending 
on your responses when you run OUI. The home that you select for Oracle Clusterware must be different 
from the RAC-enabled Oracle home.

Versions of cluster manager previous to Oracle Database 10g were sometimes referred to as "Cluster Manager". 
In Oracle Database 10g, this function is performed by a Oracle Clusterware component known as 
Cluster Synchronization Services (CSS). The OracleCSService, OracleCRService, and OracleEVMService 
replace the service known previous to Oracle Database 10g as OracleCMService9i.


3.2 cluvfy or runcluvfy.bat:
----------------------------

Once you have installed Oracle Clusterware, you can use CVU by entering cluvfy commands on the command line. 
To use CVU before you install Oracle Clusterware, you must run the commands using a command file available 
on the Oracle Clusterware installation media. Use the following syntax to run a CVU command run from the 
installation media, where media is the location of the Oracle Clusterware installation media and options 
is a list of one or more CVU command options:

media\clusterware\cluvfy\runcluvfy.bat options

The following code example is of a CVU help command, run from a staged copy of the Oracle Clusterware 
directory downloaded from OTN into a directory called stage on your C: drive:

C:\stage\clusterware\cluvfy> runcluvfy.bat comp nodereach -n node1,node2 -verbose

For a quick test, you can run the following CVU command that you would normally use after you have completed 
the basic hardware and software configuration:

prompt> media\clusterware\cluvfy\runcluvfy.bat stage �post hwos �n node_list

Use the location of your Oracle Clusterware installation media for the media value and a list of the nodes, 
separated by commas, in your cluster for node_list. Expect to see many errors if you run this command 
before you or your system administrator complete the cluster pre-installation steps.

On Oracle Real Application Clusters systems, each member node of the cluster must have user equivalency 
for the Administrative privileges account that installs the database. This means that the administrative 
privileges user account and password must be the same on all nodes.

- Checking the Hardware and Operating System Setup with CVU
You can use two different CVU commands to check your hardware and operating system configuration. 
The first is a general check of the configuration, and the second specifically checks for the components required 
to install Oracle Clusterware.

The syntax of the more general CVU command is:

cluvfy stage �post hwos �n node_list [-verbose]

where node_list is the names of the nodes in your cluster, separated by commas. However, because you have 
not yet installed Oracle Clusterware, you must execute the CVU command from the installation media using a command 
like the one following. In this example, the command checks the hardware and operating system of a two-node 
cluster with nodes named node1 and node2, using a staged copy of the installation media in a directory called 
stage on the C: drive:

C:\stage\clusterware\cluvfy> runcluvfy.bat stage �post hwos �n node1,node2 -verbose

You can omit the -verbose keyword if you do not wish to see detailed results listed as CVU performs 
each individual test.

The following example is a command, without the -verbose keyword, to check for the readiness of the cluster 
for installing Oracle Clusterware:

C:\stage\clusterware\cluvfy> runcluvfy.bat comp sys -n node1,node2 -p crs

- Checking the Network Setup
Enter a command using the following syntax to verify node connectivity between all of the nodes 
for which your cluster is configured:

cluvfy comp nodecon -n node_list [-verbose]

- Verifying Cluster Privileges
Before running Oracle Universal Installer, from the node where you intend to run the Installer, 
verify that you have administrative privileges on the other nodes. To do this, enter the following command 
for each node that is a part of the cluster:

net use \\node_name\C$

where node_name is the node name. If your installation will access drives in addition to the C: drive, repeat 
this command for every node in the cluster, substituting the drive letter for each drive you plan to use.

For the installation to be successful, you must use the same user name and password on each node in a cluster 
or use a domain user name. If you use a domain user name, then log on under a domain with a user name and password 
to which you have explicitly granted local administrative privileges on all nodes.


3.3 Shared disk considerations:
-------------------------------

Preliminary Shared Disk Preparation
Complete the following steps to prepare shared disks for storage:

-- Disabling Write Caching
You must disable write caching on all disks that will be used to share data between nodes in your cluster. 
To disable write caching, perform these steps:

Click Start, then click Settings, then Control Panel, then Administrative Tools, then Computer Management, 
then Device Manager, and then Disk drives
Expand the Disk drives and double-click the first drive listed
Under the Disk Properties tab for the selected drive, uncheck the option that enables the write cache
Double-click each of the other drives listed in the Disk drives hive and disable the write cache as described 
in the previous step

Caution:

Any disks that you use to store files, including database files, that will be shared between nodes, 
must have write caching disabled.

-- Enabling Automounting for Windows 2003
If you are using Windows 2003, then you must enable disk automounting, depending on the Oracle products 
you are installing and on other conditions.

You must enable automounting when using:

Raw partitions for Oracle Real Application Clusters (RAC)
Cluster file system for Oracle Real Application Clusters
Oracle Clusterware
Raw partitions for a single-node database installation
Logical drives for Automatic Storage Management (ASM)

To enable automounting:

Enter the following commands at a command prompt:

c:\> diskpart
DISKPART> automount enable
Automatic mounting of new volumes enabled.

Type exit to end the diskpart session

Repeat steps 1 and 2 for each node in the cluster.


3.4 Reviewing Storage Options for Oracle Clusterware, Database, and Recovery Files:
-----------------------------------------------------------------------------------

This section describes supported options for storing Oracle Clusterware files, Oracle Database software, 
and database files. 

-- Overview of Oracle Clusterware Storage Options

Note that Oracle Clusterware files include the Oracle Cluster Registry (OCR) and 
the Oracle Clusterware voting disk.

There are two ways to store Oracle Clusterware files:

1. Oracle Cluster File System (OCFS): The cluster file system Oracle provides for the Windows and Linux communities. 
If you intend to store Oracle Clusterware files on OCFS, then you must ensure that OCFS volume sizes 
are at least 500 MB each.

2. Raw storage: Raw logical volumes or raw partitions are created and managed by Microsoft Windows 
disk management tools or by tools provided by third party vendors.

Note that you must provide disk space for one mirrored Oracle Cluster Registry (OCR) file, 
and two mirrored voting disk files.

-- Overview of Oracle Database and Recovery File Options

There are three ways to store Oracle Database and recovery files on shared disks:

1. Automatic Storage Management (database files only): Automatic Storage Management (ASM) is an integrated, 
high-performance database file system and disk manager for Oracle files. Because ASM requires an 
Oracle Database instance, it cannot contain Oracle software, but you can use ASM to manage database 
and recovery files.

2. Oracle Cluster File System (OCFS): Note that if you intend to use OCFS for your database files, 
then you should create partitions large enough for the database files when you create partitions 
for Oracle Clusterware

Note:

If you want to have a shared Oracle home directory for all nodes, then you must use OCFS.

3. Raw storage: Note that you cannot use raw storage to store Oracle database recovery files.

The storage option that you choose for recovery files can be the same as or different to the option 
you choose for the database files.


Storage Option				Oracle Clusterware	Database	Recovery area
--------------				------------------	--------	-------------
Automatic Storage Management 		No 			Yes 		Yes 
Cluster file system (OCFS) 		Yes 			Yes 		Yes 
Shared raw storage 			Yes 			Yes 		No 


-- Checking for Available Shared Storage with CVU
To check for all shared file systems available across all nodes on the cluster, use the following CVU command:

cluvfy comp ssa -n node_list

Remember to use the full path name and the runcluvfy.bat command on the installation media and include 
the list of nodes in your cluster, separated by commas, for the node_list. The following example is for 
a system with two nodes, node1 and node2, and the installation media on drive F:

F:\clusterware\cluvfy> runcluvfy.bat comp ssa -n node1,node2

If you want to check the shared accessibility of a specific shared storage type to specific nodes 
in your cluster, then use the following command syntax:

cluvfy comp ssa -n node_list -s storageID_list

In the preceding syntax, the variable node_list is the list of nodes you want to check, separated by commas, 
and the variable storageID_list is the list of storage device IDs for the storage devices managed by the 
file system type that you want to check.


=====================================
Note 4: Installation on Redhat Linux
=====================================

4.2 Prepare your nodes:
-----------------------


4.2.1 Scetch of a 2-node Linux cluster

			192.168.2.0
         ------------------------------------------ public network 
             |                              |
             |                              |
        ------------                    -------------
        |InstanceA |Private network     |InstanceB  |
        |          |Ethernet            |           |
        |          |--------------------|           |
        |          |192.168.1.0         |           |
        |          |                    |           |
        |          |____________        |           |
        |          |  -----    -|---    |           |
        |          |--|PWR|    |PWR|----|           |
        |          |  -----    -----    |           |
        |          |    |_______________|           |
        |          |                    |           |
        ------------                    -------------
             | SCSI bus or Fible Channel      |
             ------------------  --------------
               Interconnect   |  |
                              |  |
Fig 4.1                   -----------
                          |Shared   |  - has Single DB on ASM or OCFS or RAW
                          |Disk     |  - has OCR and Voting disk on OCFS or RAW (not ASM)
                          |Storage  |  - has Recovery area on ASM or OCFS (not RAW)
                          ----------- 


4.2.2 Storage Options

Storage					Oracle Clusterware	Database	Recovery area
--------------				------------------	--------	-------------
Automatic Storage Management 		No 			Yes 		Yes 
Cluster file system (OCFS) 		Yes 			Yes 		Yes 
Shared raw storage 			Yes 			Yes 		No 


In the following, we will do an example installation on 3 nodes.


4.2.3 Install Redhat on all nodes with all options.

4.2.4 create oracle user and groups dba, oinstall on all nodes.
      Make sure they all have the same UID and GUI.

4.2.5 Make sure the user oracle has an appropriate .profile or .bash_profile

4.2.6 Every node needs a private network connection and a public network connection (at least
      two networkcards).

4.2.7 Linux kernel parameters:

Most out of the box kernel parameters (of RHELS 3,4,5) are set correctly for Oracle
except a few.

You should have the following minimal configuration:

net.ipv4.ip_local_port_range	1024  65000
kernel.sem			250  32000  100  128
kernel.shmmni			4096
kernel.shmall			2097152
kernel.shmmax			2147483648
fs.file-max			65536


You can check the most important parameters using the following command:

# /sbin/sysctl -a | egrep 'sem|shm|file-max|ip_local'

net.ipv4.ip_local_port_range = 1024  65000
kernel.sem = 250  32000  100  128
kernel.shmmni = 4096
kernel.shmall = 2097152
kernel.shmmax = 2147483648
fs.file-max = 65536

If some value should be changed, you can change the "/etc/sysctl.conf" file and run the "/sbin/sysctl -p" command
to change the value immediately.
Every time the system boots, the init program runs the /etc/rc.d/rc.sysinit script. This script contains 
a command to execute sysctl using /etc/sysctl.conf to dictate the values passed to the kernel. 
Any values added to /etc/sysctl.conf will take effect each time the system boots. 
 

4.2.8 make sure ssh and scp are working on all nodes without asking for a password.
      Use shh-keygen to arrange that.


4.2.9 Example "/etc/host" on the nodes:

Suppose you have the following 3 hosts, with their associated public and private names:

public  private
oc1	poc1
oc2	poc2
oc3	poc3

Then this could be a valid host file on the nodes: 

127.0.0.1	localhost.localdomain	localhost

192.168.2.99	rhes30
192.168.2.166	oltp
192.168.2.167	mw

192.168.2.101	oc1	#public1
192.168.1.101	poc1	#private1
192.168.2.176	voc1	#virtual1

192.168.2.102	oc2	#public2
192.168.1.102	poc2	#private2
192.168.2.177	voc2	#virtual2

192.168.2.103	oc3	#public3
192.168.1.103	poc3	#private3
192.168.2.178	voc3	#virtual3


4.2.10 Example disk devices

On all nodes, the shared disk devices should be accessible through the same devices names.

Raw Device Name		Physical Device Name	Purpose
/dev/raw/raw1		/dev/sda1		ASM Disk 1: +DATA1
/dev/raw/raw2		/dev/sdb1		ASM Disk 1: +DATA1
/dev/raw/raw3		/dev/sdc1		ASM Disk 2: +RECOV1
/dev/raw/raw4		/dev/sdd1		ASM Disk 2: +RECOV1
/dev/raw/raw5		/dev/sde1		OCR Disk (on RAW device)
/dev/raw/raw6		/dev/sdf1		Voting Disk (on RAW device)


4.3 CRS installation:
---------------------

4.3.1 First install CRS in its own home directory

First install CRS in its own home directory, e.g. CRS10gHome, apart from the Oracle home dir.

As Oracle user:

./runInstaller

 ---------------------------------------------------
 |                                                 |  Screen 1
 |Specify File LOcations                           |
 |                                                 |
 |Source                                           |
 |Path: /install/crs10g/Disk1/stage/products.xml   |
 |                                                 |
 |Destination                                      |
 |Name: CRS10gHome                                 |
 |Path: /u01/app/oracle/product/10.1.0/CRS10gHome  |
 |                                                 |
 ---------------------------------------------------


 ---------------------------------------------------
 |                                                 |  Screen 2
 |Cluster Configuration                            |
 |                                                 |
 |Cluster Name: lec1                               |
 |                                                 |
 | Public Node Name            Private Node Name   |
 | ---------------------------------------------   |
 | |oc1                 | p0c1                  |  |
 | |--------------------------------------------   |
 | |oc2                 | p0c2                  |  |
 | |--------------------------------------------   |
 | |oc3                 | poc3                  |  |
 | |--------------------------------------------   |
 ---------------------------------------------------

In the next screen, you specify which of your networks is to be used as
the public interface (to connect to the public network) and which will be used
for the private interconnect to support cache fushion and the cluster heartbeat.

 ---------------------------------------------------
 |                                                 |  Screen 3
 |Private Interconnect Enforcement                 |
 |                                                 |
 |                                                 |
 |                                                 |
 | Interface Name   Subnet          Interface type |
 | ---------------------------------------------   |
 | |eth0           |192.168.2.0   |Public      |   |
 | |--------------------------------------------   |
 | |eth1           |192.168.1.0   |Private     |   |
 | |--------------------------------------------   |
 |                                                 |
 ---------------------------------------------------

In the next screen, you specify /dev/raw/raw5 as the raw disk for the Oracle Cluster Registry.

 ---------------------------------------------------
 |                                                 |  Screen 4
 |Oracle Cluster Registry                          |
 |                                                 |
 |Specify OCR Location: /dev/raw/raw5              |
 |                                                 |
 ---------------------------------------------------

In a similar fashion you specify the location of the Voting Disk.

 ---------------------------------------------------
 |                                                 |  Screen 5
 |Voting Disk                                      |
 |                                                 |
 |Specify Voting Disk: /dev/raw/raw6               |
 |                                                 |
 ---------------------------------------------------

You now have to execute the /u01/app/oracle/orainventory/orainstRoot.sh script
on all Cluster Nodes as the root user.

After this, you can continue with the other window, and see an "Install Summary" screen.
Now you click "Install" and the installation begins.
Apart from the node you work on, the software will also be copied to the other nodes as well.

After the installation is complete, you are once again prompted to run a script as root
on each node of the Cluster.
This is the script "/u01/app/oracle/product/10.1.0/CRS10gHome/root.sh".

-- The olsnodes command.

After finishing the CSR installation, you can verify that the installation completed successfully
by running on any node the following command:

# cd /u01/app/oracle/product/10.1.0/CRS10gHome/bin
# olsnodes -n
oc1   1
oc2   2
oc3   3


4.4 Database software installation:
-----------------------------------

You can install the database software into the same directory in each node.
With OCFS2, you might do one install in a common shared directory for all nodes.

Because CSR is already running, the OUI detects that, and because its cluster aware, it
provides you with the options to install a clustered implementation.

You start the installation by running ./runInstaller as the oracle user on one node.
For most part, it looks the same as a single-instance installation.

After the file location screen, that is source and destination, you will see this screen:

 ---------------------------------------------------
 |                                                 |  
 |Specify Hardware Cluster Installation Mode       |
 |                                                 |
 | o Cluster installation mode                     |
 |                                                 |
 |  Node name                                      |
 |  ---------------------------------------------  |
 |  | [] oc1                                    |  |
 |  | [] oc2                                    |  |
 |  | [] oc3                                    |  |
 |  ---------------------------------------------  |
 |                                                 |
 | o Local installation (non cluster)              |
 |                                                 |
 |-------------------------------------------------|

Most of the time, you will do a "software only" installation, and create the database later
with the DBCA.

For the first node only, after some time, the Virtual IP Configuration Assistant, VIPCA, will start.
Here you can configure the Virtual IP adresses you will use for application failover
and the Enterprise Manager Agent.
Here you will select the Virtual IP's for all nodes.
VIPCA only needs to run once per Cluster.


4.5 Creating the RAC database with DBCA:
----------------------------------------

Launching the DBCA for installing a RAC database is much the same as launching DBCA for a single instance.
If DBCA detects cluster software installed, it gives you the option to install a RAC database 
or a single instance.

as oracle user:

% dbca &

 ---------------------------------------------------
 |                                                 |  
 |Welcome to the database configuration assistant  |
 |                                                 |
 |                                                 |
 |                                                 |
 | o Oracle Real Application Cluster database      |
 |                                                 |
 | o Oracle single instance database               |
 |                                                 |
 |-------------------------------------------------|

After selecting RAC, the next screen gives you the option to select nodes:

 ---------------------------------------------------
 |                                                 |  
 |Select the nodes on which you want to create     |
 |the cluster database. The local node oc1 will    |
 |always be used whether or not it is selected.    |
 |                                                 |
 |  Node name                                      |
 |  ---------------------------------------------  |
 |  | [] oc1                                    |  |
 |  | [] oc2                                    |  |
 |  | [] oc3                                    |  |
 |  ---------------------------------------------  |
 |                                                 |
 |                                                 |
 |-------------------------------------------------|
 
In the next screens, you can choose the type of database (oltp, dw etc..), and all
other items, just like a single instance install.
At a cetain point, you can choose to use ASM diskgroups, flash-recovery area etc..


===========================================
Note 5. RAC tools an utilities.
===========================================


Example 1: removing and adding a failed node
--------------------------------------------

Suppose, using above example, that instance rac3 on node oc3, fails. Suppose that you need to repair
the node (e.g. harddisk crash).

-- Remove the instance:

% srvctl remove instance -d rac -i rac3
Remove instance rac3 for the database rac (y/n)? y

-- Remove the node from the cluster:

# cd /u01/app/oracle/product/10.1.0/CRS10gHome/bin
# ./olsnode -n
oc1   1
oc2   2
oc3   3
# cd ../install
# ./rootdeletenode.sh oc3,3
# cd ../bin
# ./olsnode -n
oc1   1
oc2   2
#

Suppose that you have repared host oc3. We now want to add it back into the cluster.
Host oc3 has the OS newly installed, and its /etc/host file is just like it is on the other nodes.

-- Add the node at the clusterware layer:

From oc1 or oc2, go to the $CRS_Home/oui/bin directory, and run

# ./addNode.sh

A graphical screen pops up, and you are able to add oc3 to the cluster.
Al CRS files are copied to the new node.

To start the services on the new node, you are then prompted to run "rootaddnode.sh" on the active node
and "root.sh" on the new node.

# ./rootaddnode.sh

# ssh oc3
# cd /u01/app/oracle product/10.1.0/CRS10gHome
# ./root.sh

-- Install the Oracle software on the new node:


Example 2: showing all nodes from a node
----------------------------------------

# lsnodes -v

# cd /u01/app/oracle/product/10.1.0/CRS10gHome/bin
# ./olsnodes -n
oc1   1
oc2   2
oc3   3


Example 3: using svrctl
-----------------------

The Server Control SVRCTL utility is installed on each node by default. 
You can use SRVCTL to start and stop the database and instances, manage configuration information,
and to move or remove instances and services.

Some SVRCTL operations store configuration information in the OCR. 
SVRCTL performs other operations, such as starting and stopping instances, by sending request
to the Oracle Clusterware process CSRD, which then starts or stops the Oracle Clusterware resources.

srvctl must be run from the $ORACLE_HOME of the RAC you are administering. 
The basic format of a srvctl command is 

srvctl <command> <target> [options]

where command is one of

enable|disable|start|stop|relocate|status|add|remove|modify|getenv|setenv|unsetenv|config

and the target, or object, can be a 
-database, 
-instance, 
-service, 
-ASM instance, or the 
-nodeapps.


-- Example 1: To view help:

% svrctl -h
% svrctl command -h

-- Example 2: To see the SRVCTL version number, enter

% svrctl -V

-- Example 3. Bring up the MYSID1 instance of the MYSID database.

% srvctl start instance -d MYSID -i MYSID1

-- Example 4. Stop the MYSID database: all its instances and all its services, on all nodes.

% srvctl stop database -d MYSID

The following command mounts all of the non-running instances, using the default connection information:

% srvctl start database -d orcl -o mount

-- Example 5. Stop the nodeapps on the myserver node. NB: Instances and services also stop.

% srvctl stop nodeapps -n myserver

-- Example 6. Add the MYSID3 instance, which runs on the myserver node, to the MYSID clustered database.

% srvctl add instance -d MYSID -i MYSID3 -n myserver

-- Example 7. Add a new node, the mynewserver node, to a cluster.

% srvctl add nodeapps -n mynewserver -o $ORACLE_HOME -A 149.181.201.1/255.255.255.0/eth1
(The -A flag precedes an address specification.)

-- Example 8. To change the VIP (virtual IP) on a RAC node, use the command

% srvctl modify nodeapps -A new_address

-- Example 9. Status of components 

. Find out whether the nodeapps on mynewserver are up.

 % srvctl status nodeapps -n mynewserver
  VIP is running on node: mynewserver
  GSD is running on node: mynewserver
  Listener is not running on node: mynewserver
  ONS daemon is running on node: mynewserver

. Find out whether the ASM  is running:

  % srvctl status asm -n docrac1
  ASM instance +ASM1 is running on node docrac1.

. Find status of cluster database

  % srvctl status database -d EOPP
  Instance EOPP1 is running on node dbq0201
  Instance EOPP2 is running on node dbq0102

  % srvctl config database -d EOPP
  dbq0201 EOPP1 /ora/product/10.2.0/db
  dbq0102 EOPP2 /ora/product/10.2.0/db

  % srvctl config service -d EOPP
  opp.et.supp PREF: EOPP1 AVAIL: EOPP2
  opp.et.grid PREF: EOPP1 AVAIL: EOPP2


-- Example 10. The following command and output show the expected configuration for a three node 
               database called ORCL.

% srvctl config database -d ORCL

server01 ORCL1 /u01/app/oracle/product/10.1.0/db_1
server02 ORCL2 /u01/app/oracle/product/10.1.0/db_1
server03 ORCL3 /u01/app/oracle/product/10.1.0/db_1


-- Example 11. Disable the ASM instance on myserver for maintenance.

% srvctl disable asm -n myserver


-- Example 12. Debugging srvctl

Debugging srvctl in 10g couldn't be easier. Simply set the SRVM_TRACE environment variable.

% export SRVM_TRACE=true


-- Example 13. Question Version 10G RAC

Q: how to add a listener to the nodeapps using the srvctl command ??
or even if it can be added using srvctl ??

A: just edit listener.ora on all concerned nodes and add entries ( the usual way).
srvctl will automatically make use of it.
For example

% srvctl start database -d SAMPLE

will start database SAMPLE and its associated listener LSNR_SAMPLE. 


-- Example 14. Adding services.

% srvctl add database -d ORCL -o /u01/app/oracle/product/10.1.0/db_1
% srvctl add instance -d ORCL -i ORCL1 -n server01
% srvctl add instance -d ORCL -i ORCL2 -n server02
% srvctl add instance -d ORCL -i ORCL3 -n server03


-- Example 15. Administering ASM Instances with SRVCTL in RAC

You can use SRVCTL to add, remove, enable, and disable an ASM instance as described in the following procedure:

Use the following to add configuration information about an existing ASM instance:
% srvctl add asm -n node_name -i asm_instance_name -o oracle_home

Use the following to remove an ASM instance:
% srvctl remove asm -n node_name [-i asm_instance_name]

-- Example 16. Stop multiple instances.

The following command provides its own connection information to shut down the two instances orcl3 and orcl4
using the IMMEDIATE option:

% srvctl stop instance -d orcl -i "orcl3,orcl4" -o immediate -c "sysback/oracle as sysoper" 

-- Example 17. Showing policies.

Clusterware can automatically start your RAC database when the system restarts.
You can use Automatic or Manual "policies", to control whether clusterware restarts RAC.

To display the current policy:

% srvctl config database -d database_name -a

To change to another policy:

% srvctl modify database -d database_name -y policy_name

-- Example 18.

% srvctl start service -d DITOB


-- More examples

% srvctl remove instance -d rac -i rac3
% srvctl disable instance -d orcl -i orcl2
% srvctl enable instance -d orcl -i orcl2 


Example 4: crsctl
-----------------

Use CRSCTL to Control Your Clusterware

Oracle Clusterware enables servers in an Oracle database Real Application Cluster to coordinate simultaneous 
workload on the same database files. The crsctl command provides administrators many useful capabilities. 
For example, with crsctl, you can check Clusterware health disable/enable Oracle Clusterware startup on boot, 
find information on the voting disk and check the Clusterware version, and more.

1. Do you want to check the health of the Clusterware?
# crsctl check crs
CSS appears healthy
CRS appears healthy
EVM appears healthy

2. Do you want to reboot a node for maintenance without Clusterware coming up on boot?
## Disable clusterware on machine2 bootup:
# crsctl disable crs
## Stop the database then stop clusterware processes:
# srvctl stop instance �d db �i db2
# crsctl stop crs
# reboot 

## Enable clusterware on machine bootup:
# crsctl enable crs
# crsctl start crs
# srvctl start instance �d db �i db2 

3. Do you wonder where your voting disk is?
# crsctl query css votedisk
0. 0 /dev/raw/raw2 

4. Do you need to find out what clusterware version is running on a server?
# crsctl query crs softwareversion
CRS software version on node [db2] is [10.2.0.2.0]

5. Adding and Removing Voting Disks

You can dynamically add and remove voting disks after installing Oracle RAC. Do this using the following 
commands where path is the fully qualified path for the additional voting disk. Run the following command 
as the root user to add a voting disk:

# crsctl add css votedisk path

Run the following command as the root user to remove a voting disk:

# crsctl delete css votedisk path


Example 5: cluvfy
-----------------

The Cluster Verification Utility pre or post validates an Oracle Clusterware environment or configuration.  
We found the CVU utility to be very useful for checking a cluster server environment for RAC. 
The CVU can check shared storage, interconnects, server systems and user permissions. The Universal Installer runs 
the verification utility at the end of the cluster ware install. The utility can also be run from the command line 
with parameters and options to validate components. 
 
For example, a script that verifies a cluster using cluvfy is named runcluvfy.sh and is located on 
the /clusterware/cluvfy directory in the installation area. This script unpacks the utility, sets environment 
variables and executes the verification command.
 
This command verifies that the hosts atlanta1, atlanta2 and atlanta3 are ready for a clustered database 
install of release 2.
 
./runcluvfy.sh stage -pre dbinst -n atlanta1,atlanta2,atlanta3 -r 10gR2 -osdba dba �verbose
 
The results of the command above check user and group equivalence across machines, connectivity, 
interface settings, system requirements like memory, disk space and kernel settings and versions, 
required Linux package existence and so on. Any problems are reported as errors, all successful 
checks are marked as passed.
 
Many other aspects of the cluster can be verified with this utility for Release 2 or Release 1.

Some more examples:

-- Checking for Available Shared Storage with CVU
To check for all shared file systems available across all nodes on the cluster, use the following CVU command:

% cluvfy comp ssa -n node_list

Remember to use the full path name and the runcluvfy.bat command on the installation media and include 
the list of nodes in your cluster, separated by commas, for the node_list. The following example is for 
a system with two nodes, node1 and node2, and the installation media on drive F:

% runcluvfy.bat comp ssa -n node1,node2

If you want to check the shared accessibility of a specific shared storage type to specific nodes 
in your cluster, then use the following command syntax:

% cluvfy comp ssa -n node_list -s storageID_list

In the preceding syntax, the variable node_list is the list of nodes you want to check, separated by commas, 
and the variable storageID_list is the list of storage device IDs for the storage devices managed by the 
file system type that you want to check.


=================================
Note 6: Example tnsnames.ora in RAC
=================================

Example 1:
----------

tnsnames.ora File

TEST =
(DESCRIPTION =
(LOAD_BALANCE = ON)
(ADDRESS_LIST =
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux1)(PORT = 1521))
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux2)(PORT = 1521)))
(CONNECT_DATA =
(SERVICE_NAME = TEST))))

TEST1 =
(DESCRIPTION =
(ADDRESS_LIST =
(LOAD_BALANCE = ON)
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux1)(PORT = 1521)))
(CONNECT_DATA =
(SERVICE_NAME = TEST)(INSTANCE_NAME = TEST1)))

TEST2 =
(DESCRIPTION =
(ADDRESS_LIST =
(LOAD_BALANCE = ON)
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux2)(PORT = 1521)))
(CONNECT_DATA =
(SERVICE_NAME = TEST)(INSTANCE_NAME = TEST2)))

EXTPROC_CONNECTION_DATA =
(DESCRIPTION =
(ADDRESS_LIST =
(ADDRESS = (PROTOCOL = IPC)(KEY = EXTPROC)))
(CONNECT_DATA =
(SID=PLSExtProc)(PRESENTATION = RO)))
 
LISTENERS_TEST =
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux1)(PORT = 1521))
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux2)(PORT = 1521))


Example 2:
----------

Connect-Time Failover
From the clients end, when your connection fails at one node or service, you can then do a look up 
from your tnsnames.ora file and go on seeking a connection with the other available node. Take this example 
of our 4-node VMware ESX 3.x Oracle Linux Servers:

FOKERAC =
  (DESCRIPTION =
    (ADDRESS = (PROTOCOL = TCP)(HOST = nick01.wolga.com)(PORT = 1521))
    (ADDRESS = (PROTOCOL = TCP)(HOST = nick02.wolga. com)(PORT = 1521))
    (ADDRESS = (PROTOCOL = TCP)(HOST = brian01.wolga. com)(PORT = 1521))
    (ADDRESS = (PROTOCOL = TCP)(HOST = brian02.wolga. com)(PORT = 1521))
    (CONNECT_DATA =
      (SERVICE_NAME = fokerac)
    )
  )

Here the first address in the list is tried at the client�s end. Should the connection to nick01.wolga.nl fail, 
then the next address, nick02.wolga.nl, will be tried. This phenomenon is called connection-time failover. 
You could very well have a 32-node RAC cluster monitoring the galactic system at NASA and thus have all 
those nodes typed in your tnsnames.ora file. Moreover, these entries do not necessarily have to be part 
of the RAC cluster. So it is possible that you are using Streams, Log Shipping or Advanced Replication 
to maintain your HA (High Availability) model. These technologies facilitate continued processing of the 
database by such a HA (High Availability) model in a non-RAC environment. In a RAC environment we know 
(and expect) the data to be the same across all nodes since there is only one database.

Example 3:
----------

TAF (Transparent Application Failover)
Transparent Application Failover actually refers to a failover that occurs when a node or instance 
is unavailable due to an outage or other reason that prohibits a connection to be established on that node. 
This can be set to on with the following parameter FAILOVER. Setting it to ON will activate the TAF. 
It is turned on by default unless you set it to OFF to disable it. Now, when you turn it on you have two types 
of connections available by the means of the FAILOVER_MODE parameter. The type can be session, which is default 
or select. When the type is SESSION, if the instance fails, then the user is automatically connected to the next 
available node without the user�s manual intervention. The SQL statements need to be carried out again 
on the next node. However, when you set the TYPE to SELECT, then if you are connected and are in the middle 
of your query, then your query will be restarted after you have been failed over to the next available node. 
Take this example of our tnsnames.ora file, (go to the section beginning with CONNECT_DATA):

 (CONNECT_DATA =
      (SERVER = DEDICATED)
      (SERVICE_NAME = fokerac.wolga.com)
      (FAILOVER_MODE =
        (TYPE = SELECT)
        (METHOD = BASIC)
	(RETRIES = 180)
	(DELAY = 5)
      )
  )


==============================================
Note 7: Notes about Backup and Restore of RAC
==============================================


7.1 Backing up Voting Disk:
---------------------------

Run the following command to backup a voting disk. Perform this operation on every voting disk
as needed where 'voting_disk_name' is the name of the active voting disk, and 'backup_file_name'
is the name of the file to which you want to backup the voting disk contents:

# dd if=voting_disk_name of=backup_file_name

When you use the dd command for making backups of the voting disk, the backup can be performed while 
the Cluster Ready Services (CRS) process is active; you do not need to stop the crsd.bin process 
before taking a backup of the voting disk.

-- Adding and Removing Voting Disks
You can dynamically add and remove voting disks after installing Oracle RAC. Do this using the following 
commands where path is the fully qualified path for the additional voting disk. Run the following command 
as the root user to add a voting disk:

# crsctl add css votedisk path

Run the following command as the root user to remove a voting disk:

# crsctl delete css votedisk path


7.2 Recovering Voting Disk:
---------------------------

Run the following command to recover a voting disk where 'backup_file_name'
is the name of the voting disk backupfile, and 'voting_disk_name' is the name of the active
voting disk:

# dd if=backup_file_name of=voting_disk_name


7.3 Backup and Recovery OCR:
----------------------------

Oracle Clusterware automatically creates OCR backups every 4 hours. At any one time, Oracle Clusterware 
always retains the latest 3 backup copies of the OCR that are 4 hours old, 1 day old, and 1 week old.

You cannot customize the backup frequencies or the number of files that Oracle Clusterware retains. 
You can use any backup software to copy the automatically generated backup files at least once daily 
to a different device from where the primary OCR file resides. The default location for generating backups 
on Red Hat Linux systems is "CRS_home/cdata/cluster_name" where cluster_name is the name of your cluster 
and CRS_home is the home directory of your Oracle Clusterware installation.


-- Viewing Available OCR Backups
To find the most recent backup of the OCR, on any node in the cluster, use the following command:

# ocrconfig -showbackup

-- Backing Up the OCR
Because of the importance of OCR information, Oracle recommends that you use the ocrconfig tool to make copies 
of the automatically created backup files at least once a day.

In addition to using the automatically created OCR backup files, you should also export the OCR contents 
to a file before and after making significant configuration changes, such as adding or deleting nodes 
from your environment, modifying Oracle Clusterware resources, or creating a database. 
Exporting the OCR contents to a file lets you restore the OCR if your configuration changes cause errors. 
For example, if you have unresolvable configuration problems, or if you are unable to restart your cluster database 
after such changes, then you can restore your configuration by importing the saved OCR content 
from the valid configuration.

To export the contents of the OCR to a file, use the following command, where backup_file_name is the name 
of the OCR backup file you want to create:

# ocrconfig -export backup_file_name


-- Recovering the OCR
This section describes two methods for recovering the OCR. The first method uses automatically generated 
OCR file copies and the second method uses manually created OCR export files.

In event of a failure, before you attempt to restore the OCR, ensure that the OCR is unavailable. 
Run the following command to check the status of the OCR:

# ocrcheck 

If this command does not display the message 'Device/File integrity check succeeded' for at least one copy 
of the OCR, then both the primary OCR and the OCR mirror have failed. You must restore the OCR from a backup.

-- Restoring the Oracle Cluster Registry from Automatically Generated OCR Backups
When restoring the OCR from automatically generated backups, you first have to determine which backup file 
you will use for the recovery.

To restore the OCR from an automatically generated backup on a Red Hat Linux system:

Identify the available OCR backups using the ocrconfig command:

# ocrconfig -showbackup

Note:

You must be logged in as the root user to run the ocrconfig command.

Review the contents of the backup using the following ocrdump command, where file_name is the name 
of the OCR backup file:

$ ocrdump -backupfile file_name

As the root user, stop Oracle Clusterware on all the nodes in your Oracle RAC cluster by executing 
the following command:

# crsctl stop crs

Repeat this command on each node in your Oracle RAC cluster.

As the root user, restore the OCR by applying an OCR backup file that you identified in step 1 
using the following command, where file_name is the name of the OCR that you want to restore. 
Make sure that the OCR devices that you specify in the OCR configuration exist, and that these OCR devices 
are valid before running this command.

# ocrconfig -restore file_name

As the root user, restart Oracle Clusterware on all the nodes in your cluster by restarting each node, 
or by running the following command:

# crsctl start crs

Repeat this command on each node in your Oracle RAC cluster.

Use the Cluster Verify Utility (CVU) to verify the OCR integrity. Run the following command, 
where the -n all argument retrieves a list of all the cluster nodes that are configured as part of your cluster:

$ cluvfy comp ocr -n all [-verbose]


-- Recovering the OCR from an OCR Export File
Using the ocrconfig -export command enables you to restore the OCR using the -import option if your 
configuration changes cause errors.

To restore the previous configuration stored in the OCR from an OCR export file:

Place the OCR export file that you created previously with the ocrconfig -export command in an accessible 
directory on disk.

As the root user, stop Oracle Clusterware on all the nodes in your Oracle RAC cluster by executing 
the following command:

# crsctl stop crs

Repeat this command on each node in your Oracle RAC cluster.

As the root user, restore the OCR data by importing the contents of the OCR export file using the 
following command, where file_name is the name of the OCR export file:

# ocrconfig -import file_name

As the root user, restart Oracle Clusterware on all the nodes in your cluster by restarting each node, 
or by running the following command:

# crsctl start crs

Repeat this command on each node in your Oracle RAC cluster.

Use the CVU to verify the OCR integrity. Run the following command, where the -n all argument retrieves 
a list of all the cluster nodes that are configured as part of your cluster:

$ cluvfy comp ocr -n all [-verbose]


7.4 RMAN snapshot controlfile:
------------------------------

RMAN> SHOW SNAPSHOT CONTROLFILE NAME;

RMAN> CONFIGURE SNAPSHOT CONTROLFILE NAME TO 'ORACLE_HOME/dbf/scf/snap_prod.cf';


=================================
Note 8: Noticable items in 10g RAC
=================================

8.1 SPFILE:
-----------

If an initialization parameter applies to all instances, use *.<parameter> notation, otherwise
prefix the parameter with the name of the instance.
For example:

*.OPEN_CURSORS=500
prod1.OPEN_CURSORS=1000


8.2 Start and stop of RAC:
--------------------------

8.2.1 Stopping RAC:
-------------------


#### NOTE 1: ####

> Stop Oracle Clusterware or Cluster Ready Services Processes
If you are modifying an Oracle Clusterware or Oracle Cluster Ready Services (CRS) installation, 
then shut down the following Oracle Database 10g services.

Note:

You must perform these steps in the order listed.
Shut down any processes in the Oracle home on each node that might be accessing a database; for example, 
shut down Oracle Enterprise Manager Database Control.

Note:

Before you shut down any processes that are monitored by Enterprise Manager Grid Control, set a blackout in 
Grid Control for the processes that you intend to shut down. This is necessary so that the availability 
records for these processes indicate that the shutdown was planned downtime, rather than an unplanned system outage.
Shut down all Oracle RAC instances on all nodes. To shut down all Oracle RAC instances for a database, 
enter the following command, where db_name is the name of the database:

$ oracle_home/bin/srvctl stop database -d db_name

Shut down all ASM instances on all nodes. To shut down an ASM instance, enter the following command, 
where node is the name of the node where the ASM instance is running:

$ oracle_home/bin/srvctl stop asm -n node

Stop all node applications on all nodes. To stop node applications running on a node, enter the following command, 
where node is the name of the node where the applications are running

$ oracle_home/bin/srvctl stop nodeapps -n node

Log in as the root user, and shut down the Oracle Clusterware or CRS process by entering the following command 
on all nodes:

# CRS_home/bin/crsctl stop crs

#### END NOTE 1 ####


#### NOTE 2: ####

To stop process in an existing Oracle Real Application Clusters Database, where you want to shut down 
the entire database, complete the following steps.

-- Shut Down Oracle Real Application Clusters Databases
Shut down any existing Oracle Database instances on each node, with normal or immediate priority.
If Automatic Storage Management (ASM) is running, then shut down all databases that use ASM, and then shut down 
the ASM instance on each node of the cluster.

Note:

-- Stop All Oracle Processes
Stop all listener and other processes running in the Oracle home directories where you want to modify 
the database software.

Note:

If you shut down ASM instances, then you must first shut down all database instances that use ASM, 
even if these databases run from different Oracle homes.

-- Stop Oracle Clusterware or Cluster Ready Services Processes
If you are modifying an Oracle Clusterware or Oracle Cluster Ready Services (CRS) installation, 
then shut down the following Oracle Database 10g services.

Note:

You must perform these steps in the order listed.
Shut down any processes in the Oracle home on each node that might be accessing a database; for example, shut down 
Oracle Enterprise Manager Database Control.

Note:

Before you shut down any processes that are monitored by Enterprise Manager Grid Control, set a blackout in 
Grid Control for the processes that you intend to shut down. This is necessary so that the availability records 
for these processes indicate that the shutdown was planned downtime, rather than an unplanned system outage.
Shut down all Oracle RAC instances on all nodes. To shut down all Oracle RAC instances for a database, 
enter the following command, where db_name is the name of the database:

$ oracle_home/bin/srvctl stop database -d db_name

Shut down all ASM instances on all nodes. To shut down an ASM instance, enter the following command, 
where node is the name of the node where the ASM instance is running:

$ oracle_home/bin/srvctl stop asm -n node

Stop all node applications on all nodes. To stop node applications running on a node, enter the following command, 
where node is the name of the node where the applications are running

$ oracle_home/bin/srvctl stop nodeapps -n node

Log in as the root user, and shut down the Oracle Clusterware or CRS process by entering the following command 
on all nodes:

# CRS_home/bin/crsctl stop crs


#### END NOTE 2 ####


Notes about Starting up:
------------------------

crsd  : Cluster Ready Services Daemon (CRSD)
occsd : Oracle Cluster Synchronization Server Daemon (OCSSD), the CCS. 
evmd  : Event Manager Daemon (EVMD).  
evmlogger

The CRSD manages the HA functionality by starting, stopping, and failing over the application resources 
and maintaining the profiles and current states in the Oracle Cluster Registry (OCR) whereas the OCSSD 
manages the participating nodes in the cluster by using the voting disk. The OCSSD also protects against 
the data corruption potentially caused by "split brain" syndrome by forcing a machine to reboot. 


>Linux:

# cat /etc/inittab | grep crs
h3:35:respawn:/etc/init.d/init.crsd run > /dev/null 2>&1 </dev/null

# cat /etc/inittab | grep evmd
h1:35:respawn:/etc/init.d/init.evmd run > /dev/null 2>&1 </dev/null

# cat /etc/inittab | grep css
h2:35:respawn:/etc/init.d/init.cssd fatal > /dev/null 2>&1 </dev/null

/etc/init.d> ls -al *init*
init.crs
init.crsd
init.cssd
init.evmd

# cat /etc/inittab
..
..
h1:35:respawn:/etc/init.d/init.evmd run > /dev/null 2>&1 </dev/null
h2:35:respawn:/etc/init.d/init.cssd fatal > /dev/null 2>&1 </dev/null
h3:35:respawn:/etc/init.d/init.crsd run > /dev/null 2>&1 </dev/null

init.crsd -> calls crsd

correct order for stopping:   Reverse order of startup. crsd should be shutdown before
cssd and evmd. evmd should be shutdown before cssd.

init.crs stop:
	init.crsd
	init.evmd
	init.cssd

init.crs start
	init.cssd autostart|manualstart


-------------------------------------------
links:
http://dmx0201.nl.eu.abnamro.com:7900/wi
https://dmp0101.nl.eu.abnamro.com:1159/em
-------------------------------------------


=========================
Note 9: Oracle and HACMP:
=========================

9.1

IBM Smart Assist Program   
  
HACMP Smart Assists streamline implementation and configuration
The optional HACMP Smart Assist package simplifies implementation and configuration of HACMP in DB2�, Oracle� 
and WebSphere� environments by reading the application configuration data and configuring HACMP accordingly; 
the Smart Assists also provide all the necessary application monitors and scripts. All three Smart Assists 
are included in one inexpensive package. 

DB2 requires a cluster manager for its high availability/disaster recovery configurations and HACMP is the #1 
cluster manager for AIX. The DB2 Smart Assist is designed to deploy HACMP into an existing DB2 environment; 
the Smart Assist reads the existing DB2 configuration information and configures HACMP accordingly, 
providing all the necessary application monitors, start and stop scripts and so forth 
Because HACMP must be installed before Oracle itself is installed, the Oracle Smart Assist manages the entire 
Oracle installation process in addition to configuring HACMP. The Oracle Smart Assist automatically initiates 
the Oracle installation, then resumes control and completes the cluster configuration 
Although the WebSphere Application Server Network Deployment, Tivoli Directory Server (TDS), and DB2� components are 
inherently highly available, HACMP extends their capabilities to significantly reduce your risk of downtime or outages.
Like the DB2 Smart Assist, the WebSphere Smart Assist configures HACMP based on the existing WebSphere configuration 
and supplies the necessary application monitors, scripts and so forth. 

 
============================
Note 10: 10g RAC errors
============================


Note 1:
-------

OCFS not supported by kernel 
Could not get a raw device slot for disk access. 
Format error: Could not run "mkfs.ocfs", No such file or directory 
ocmstart.sh: Error: Restart is too freequent 
ocmstart.sh: Info: Check the system configuration and fix the problem 
ocmstart.sh: Info: After you fixed the problem, remove the timestamp file 
orace.ops.mgmt.cluster.ClusterException: PRKC-1007: Problem in creating directories on the nodes 
ORA-01092: ORACLE instance terminated. Disconnection Forced 
PRKP-1003: Startup operation partially failed 
ORA-01157: cannot identify/lock data file 
ORA-01110: data file 11 
ORA-00119: invalid specification for system parameter remote_listener 
ORA-00132: syntax error or unresolved network name 'LISTENERS_TEST' 
ORA-01078: failure in processing system parameters 
stat for /u01/oradata/test/spfiletest.ora failed - is not a valid raw device. 
Failed to start GSD on local node 
PRKR-1007: getting of cluster database test configuration failed 
PRKC-1019: Error creating handle to daemon on the node 
PRKO-2005: Application error: Failure in getting Cluster Database Configuration 
PRKP-1040: Failed to get status of the listeners associated with instance 
PRKR-1007: getting of cluster database configuration failed 
PRKC-1018: Error getting coordinator node 
CMCLI ERROR: OpenCommPort: connect failed with error 111. 
PRKC-1021: Problem in the clusterware - Failed to get list of active nodes from clusterware 
PRKR-1005: adding of cluster database test configuration failed 
PRKR-1064: General Exception in OCR 
Error: Oracle Cluster Registry can exist only as a shared file system file or as a shared raw partition 
Error: Some of the configuration assistans failed 
PRIF-12: failed to initialize cluster support services 
Warning: oracle1:4948 already configured 
INIT: id "hx" respawning too fast : disabled for 5 minutes 
Error: the value for SID may contain only alpha, numeric, and a few additional characters 
ASM: The pasword for the user "SYS" is not valid. Please specify a valid password. 
VIPCA: Starting GSD hangs at 65% 
ORA-12154: TNS: could not resolve the connect identifier specified 
Failed to add instances [fast1] on nodes [oracle1] of cluster database "fast" 
PRKR-1008: adding of instance fast1 on node oracle1 to cluster database fast failed 
CRS-0211: Resource ora.fast.fast1 inst has already been registered 
Handle to the cluster database is invalid, "null" 
Failed to start listeners on nodes "[oracle2]" of cluster database "fast" 
CRS-0215: Could not start resource ora.oracle2.LISTENER_ORACLE2.lsnr 
ORA-01078: failure in processing system parameters 
ORA-27040: file create error, unable to create file 
PRKP-1001: error starting instance last2 on node oracle2 
ORA-01078: failure in processing system parameters 
CRS-0215: Could not start resource ora.last.last2.inst 
ORA-17503: ksfdopn:2 Failed to open file +DSKGRP01/past/spfilepast.ora 
ORA-15055: unable to connect to ASM instance 
ORA-15100: invalid or missing diskgroup name 
ORA-15032: not all alterations performed 
ORA-15063: diskgroup "DSKGRP01" lacks quorum of 2 PST disks; 0 found 
PRKO-2105: Error in checking condition of VIP on node: oracle1 
PRKO-2106: Error in checking condition of GSD on node: oracle1 
PRKO-2016: Error in checking condition of listener on node: oracle1 
PRKO-2116: Error in checking condition of ONS on node: oracle1 
error connecting to CRSD at [(ADDRESS=(PROTOCOL=ipc)(KEY=ora_crsqs))] clsccon 184 
VIP address has been assigned to the wrong NIC! (10g) 
CRS-0233: Resource or relatives are currently involved with another operation 
ORA-15031: disk specification 'ORCL:DISK01' matches no disks 
ORA-15014: location 'ORCL:DISK01' is not in the discovery set 
PRKR-1007: getting of cluster database test configuration failed 
PRKR-1078: Database test cannot be administered using current version of srvconfig. 
PRKO-2005: Application error: Failure in getting Cluster Database Configuration for: test 
PRKP-1040: Failed to get status of listeners associated with instance test1 on nodeoracle1 
PRKH-1001: HASContext Internal Error 
ORA-09817: Write to audit failed 
Error in creating link from /u01/crs/oracle... to /u01/crs/oracle... 
10G release 2 sqlplus: error while loading shared libraries: libaio.so.1: cannot open shared object file 
CRS-0168: Cannot create the backup file. 

Note 2:
-------

Q:
On my 2 node home RAC on 32 bit CentOS Linux, I had a crash on one of the
nodes. Afterwards, I am not able to start my database from either of the
nodes. In alert log, I find that, it is not able to do instance crash
recovery because of a corrupt block in redo log. (ORA-1172). Is there
something you can suggest to get the DB back up?I don't have a back up of
the database.

Q:
hpunix
oracle9205 rac
??node1?????????,????vg??????RAC????
startup ?node.
????,???????ORA-12545: Connect failed because target host or object does not exist ,???????.?????,????????
local_listener?remote_listener.???

Q:
Hello all, 

We are running Oracle 10.1.0.4 two node RAC on Egenera Blade Frame 
using RedHat AS 3.0 with EMC CX500 as storage. 


When there is heavy load on this system (i.e. running export and RMAN), 
one of the nodes get evicted from the cluster. Usually this is the 
error you see in the alert log on Node A 


Sun Mar 26 22:24:33 2006 
skgxpvfymmtu: process 10781 failed because of a resource problem in the 
OS. The OS has most likely run out of buffers 
Sun Mar 26 22:24:33 2006 
Errors in file 
/u01/app/oracle/admin/P1/bdump/p11_ora_10781.trc:ORA-00603: ORACLE 
server session terminated by fatal error 
ORA-27504: IPC error creating OSD context 

A:
Turns out, this was resolved by setting the MTU size on the private 
interconnect network down to 1500.  The default MTU size for Egenera 
blades (because of their high speed switched backplane) is 16896.  The 
heart of the issues is how RHEL 3 handles is memory management. 
Typical packets are buffered in memory and will fit nicely into the 
standard 4K block.  The larger MTU size (I don't remember what the 
boundary was) requires larger blocks (32K) and it is more difficult to 
find contiguous (and that's the key) blocks to buffer the data.  We 
weren't running out of memory per se, but we were running out of 
contiguous memory blocks, and in the time it took to free up the 
necessary memory, the packet was dropped, the node was determined to be 
unreachable, and summarily evicted to preserve data integrity. 

We have confirmed that setting the MTU size to 1500 has resolved the 
node eviction issue.  We are also considering moving to RHEL 4 (perhaps 
64-bit even) as another solution as we understand the memory management 
features in RHEL 4 are better than what is available in RHEL 3. 

Q:
I�ve been doing a lot of 4 node 10gR2 RAC stuff lately on 4 different clusters. I ran into a problem that I 
thought I�d share since I can�t find very much on the web or in Metalink. It is a real headache, but �easy� 
to fix. I put easy in quotes because it is only easy to fix if you know what to do. First, the problem.

ORA-30012
I was running a Real Application Clusters stress test that I developed called thrash. It is not sophisticated, 
but it does put a lot of pain on the instances, servers and storage. It consists of staggered instance rebooting 
followed by the creation of a small tablespace�one per instance. Once the tablespaces are created, 
a set of sqlplus sessions alter the tablespaces adding a significant number of random sized data files. 
The tablespaces are dropped and the instances are rebooted in a staggered fashion. As I said, 
each instance is sustaining this workload. I know it is nothing like a production workload. 
It�s just one test I run in hopes of exposing file manipulation bugs associated with tablespace creation, 
datafile addition and tablespace dropping. That is, I�m looking for filesystem bugs exposed by Oracle tablespace 
manipulation.

Bumps in the Road
On occasion I was getting instances that would stop participating in the thrash. I read the alert logs 
and found that the error was ORA-30012�which made no sense to me since it was happening out of the blue. 
In fact, it was happening after running thrash for as long as 12 hours. To show you the oddity of these alert 
log entries, I�ll provide grep(1) output from all the alert logs (ORACLE_BASE is on a CFS):

$ grep �does not exist or of wrong type� *log
alert_PROD2.log:ORA-30012: undo tablespace �UNDOTBS2' does not exist or of wrong type
alert_PROD2.log:ORA-30012: undo tablespace � does not exist or of wrong type
alert_PROD3.log:ORA-30012: undo tablespace �UNDOTBS3' does not exist or of wrong type
alert_PROD3.log:ORA-30012: undo tablespace � does not exist or of wrong type
alert_PROD4.log:ORA-30012: undo tablespace �UNDOTBS4' does not exist or of wrong type

How strange! What is an undo tablespace with a NULL name? Part of the SPFILE is shown later in this post 
establishing the fact that I assign undo to instances explicity so what gives?

The following is a snippet to show that the error was occuring during ALTER DATABASE OPEN:

Errors in file /u01/app/oracle/admin/PROD/udump/prod2_ora_1569.trc:
ORA-30012: undo tablespace �UNDOTBS2' does not exist or of wrong type
Fri Mar 16 04:41:38 2007
Error 30012 happened during db open, shutting down database
USER: terminating instance due to error 30012
Instance terminated by USER, pid = 1569
ORA-1092 signalled during: ALTER DATABASE OPEN�

The trace file wasn�t much help:

$ more /u01/app/oracle/admin/PROD/udump/prod2_ora_1569.trc
/u01/app/oracle/admin/PROD/udump/prod2_ora_1569.trc
Oracle Database 10g Enterprise Edition Release 10.2.0.1.0 - 64bit Production
With the Partitioning, Real Application Clusters, OLAP and Data Mining options
ORACLE_HOME = /u01/app/oracle/product/10.2.0/rac_1
System name:    Linux
Node name:      qar14s22
Release:        2.6.9-42mxs351RHELupdate4
Version:        #1 SMP Tue Mar 6 16:37:53 PST 2007
Machine:        x86_64
Instance name: PROD2
Redo thread mounted by this instance: 2
Oracle process number: 16
Unix process pid: 1569, image: oracle@qar14s22 (TNS V1-V3)
*** SERVICE NAME:() 2007-03-16 04:41:35.038
kspgetpeeq:  kspasci not KSPASCNOP (0�110001 != 0�0)
*** SESSION ID:(137.25) 2007-03-16 04:41:38.788
ORA-30012: undo tablespace �UNDOTBS2' does not exist or of wrong type

After exhausting my patience spelunking for information on the web and Metalink, I asked my fellow OakTable 
Network members. Jo�e Senegacnik of dbprof.com replied with:

If this is a RAC database then you need to specify the instance name in the init.ora or spfile together 
with the undo_tablespace parameter. 

Where did he come up with that? I asked him if he�d actually hit this before. His answer was:

Yes, a couple of weeks ago I have experienced it on RAC on Windows. One node had problems with the undo_tablespace 
parameter after an unplanned database restart. The undo_tablespace parameter was changed in runtime months 
before but obviously it lacked the SID information and this caused problems after database restart.

A DBCA NO-NO
Folks, this database was created with DBCA. I don�t know how I haven�t seen this issue before, but the problem 
is that DBCA does not configure the SPFILE with explicit assignments for the INSTANCE_NAME parameter. 
For example, the following are a couple of strings(1)|grep(1) command pipelines that would return the 4 
assigments of the INSTANCE_NAME parameter had DBCA set it up that way:

$ strings - spfilePROD.ora | grep �I instance_name
$ strings - spfilePROD.ora | grep �^PROD�
PROD2.__db_cache_size=71303168
PROD4.__db_cache_size=67108864
PROD3.__db_cache_size=75497472
PROD1.__db_cache_size=71303168
PROD3.__java_pool_size=4194304
PROD2.__java_pool_size=4194304
PROD4.__java_pool_size=4194304
PROD1.__java_pool_size=4194304
PROD3.__large_pool_size=4194304
PROD2.__large_pool_size=4194304
PROD4.__large_pool_size=4194304
PROD1.__large_pool_size=4194304
PROD2.__shared_pool_size=79691776
PROD4.__shared_pool_size=83886080
PROD3.__shared_pool_size=75497472
PROD1.__shared_pool_size=79691776
PROD3.__streams_pool_size=0
PROD2.__streams_pool_size=0
PROD4.__streams_pool_size=0

PROD1.__streams_pool_size=0
PROD3.instance_number=3
PROD4.instance_number=4
PROD2.instance_number=2
PROD1.instance_number=1
PROD3.thread=3
PROD2.thread=2
PROD4.thread=4
PROD1.thread=1
PROD1.undo_tablespace=�UNDOTBS1'
PROD3.undo_tablespace=�UNDOTBS3'
PROD4.undo_tablespace=�UNDOTBS4'
PROD2.undo_tablespace=�UNDOTBS2'

Remedy
It was exactly what Joze said. I set explicit assignments for the ISNTANCE_NAME parameters as follows 
and the problem has gone away.

PROD1.INSTANCE_NAME=PROD1
PROD2.INSTANCE_NAME=PROD2
PROD3.INSTANCE_NAME=PROD3
PROD4.INSTANCE_NAME=PROD4

Hope this helps some googler someday.


Q:
On 2/7/07, A Ebadi <ebadi01@xxxxxxxxx> wrote:     We are trying to install a 
2-node RAC with ASM (Oracle 10.2.0.2.0 on Solaris 10) and getting the error 
below when using dbca to create the database.  The error occurs when dbca is 
done creating the DB (100%).  Any suggestions? 
   
  PRKP-1001: Error starting instance atlprd2 on node f10bb5-01
  CRS-0215: Could not start resource 'ora.atlprd.atlprd2.inst'
   
  We have tried starting atlprd2 instance manually and get the error below 
regarding an issue with spfile which is on ASM.
   
  ORA-01565: error in identifying file '+SYS_DG/atlprd/spfileatlprd.ora'
  ORA-17503: ksfdopn:2 Failed to open file +SYS_DG/atlprd/spfileatlprd.ora
  ORA-03113: end-of-file on communication channel
   
  By the way, instance one (atlprd1) is fine.


A:
Cause
=====
  Installing the 10.2.0.2 patchset in a RAC installation on any Unix platform 
does not correctly update the libknlopt.a file on all nodes. The local node 
where the installer is run does update libknlopt.a but remote nodes do not get 
the updated file. This can lead to dumps or internal errors on the remote nodes 
if Oracle is subsequently relinked.
  Solution
========
  There are two solutions for this problem: 
   
  1) Manual copy of the "libknlopt.a" library to the offending nodes : 
      -  ensure all instances are shut down 
    -  manually copy $ORACLE_HOME/rdbms/lib/libknlopt.a from the local node to 
all remote nodes 
    -  relink Oracle on all nodes : 
       make -f ins_rdbms.mk ioracle 
   2) Install the patchset on every node using the "-local" option: 
  On Unix: 
runInstaller -updateNodeList  -local ORACLE_HOME=$ORACLE_HOME 
CLUSTER_NODES=node1,node2,... 
  On Windows: 
setup.exe -updateNodeList  -local ORACLE_HOME=%ORACLE_HOME% 
CLUSTER_NODES=node1,node2,... 


root@zd110l04:/etc#cat inittab | grep ora
orapw:2:wait:/etc/loadext -L /etc


#############################################################################################
#############################################################################################
#############################################################################################


=====================================================================================
Secton 20. How to trace in UNIX.
=====================================================================================


IMPORTANT NOTICE:


>>> This small note is ONLY about tracing a process, in order to find out  <<<
>>> what objects it (tries) to access (e.g. files) and what syscalls       <<<
>>> it makes. So the purpose of this note, is simply trying to find out    <<<
>>> what a process "does under the hood".                                  <<<
>>> The objective is ofcourse, to have some extra help if you need         <<<
>>> to troubleshoot an illbehaving or failing process.                     <<<
>>>                                                                        <<<
>>> But an application might have it's own tracing facility, or it might   <<<
>>> be switched to run in some verbose mode, which might present you much  <<<
>>> better troubleshooting information, than a trace run from the OS.      <<<
>>> You MUST realise, that the use of the "right" trace facility of most   <<<
>>> commercial (large) programs, often is the only proper way to arrive    <<<
>>> to correct conclusions.                                                <<<
>>>                                                                        <<<
>>> Please be aware that this note does not pretent to be anything more    <<<
>>  than just a "light-weight" and incomplete introduction on this subject.<<<
>>> Still, I hope I can demonstrate a few instructive examples on          <<<
>>> AIX, HP, Solaris and Linux.                                            <<<
>>> But please remember: it's really limited in scope.                     <<<
>>>                                                                        <<<
>>> Tracing to a logfiles can produce really large logs.                   <<<
>>> So be sure you have sufficient space in the filesystem you want        <<<
>>> to save your logs to, or first experiment with short times that your   <<<
>>> trace runs.                                                            <<<
>>>                                                                        <<<
>>> Be carefull on production systems.                                     <<<
>>> Always first test your trace setup, on a test environment.             <<<


============================================================================
1. First some info before you trace:
============================================================================ 


When you study your trace files, you may come accross a number of error messages or error codes.
The errorcodes we mean here, are the codes that are also visible in the file "errno.h". 
This is a header file in the standard library of C programming language. 

Those are a subset of the codes that a program might get when it requests a service 
from the system (like for example, "open file").

That's certainly is not all there is that you might run into about errors and corresponding codes, 
but it constitues an important base of what you can encounter in traces.

Suppose you find something like this in a trace:

  vnop_lookup(dvp = F100010034228BF8, flag = 0002) = 0002, *vpp = 0000
  return from statx. error ENOENT [13 usec]

What can ENOENT mean? If you don't find some more "explaining text" 'close' to this line, then you can find
from the table below, that it means "No such file or directory".

Actually, I produced 2 lists, one from Linux and one from AIX, 
just to prove they are quite the same (there is no garantee that they are *exactly* the same on all systems).

By the way, if you go search for that "errno.h" file (or similar name), on your own system, 
and take a look at the contents, you can create the list yourself for your particular unix/linux system.
You can find that file (likely) in "/usr/include/sys" 
But for easy reference, we list the most important errno's for 2 representative unixes.
(Yes.. one listing would have been quite sufficient).


1.1 Errcodes Linux (generic):
=============================


#define EPERM            1      /* Operation not permitted */
#define ENOENT           2      /* No such file or directory */
#define ESRCH            3      /* No such process */
#define EINTR            4      /* Interrupted system call */
#define EIO              5      /* I/O error */
#define ENXIO            6      /* No such device or address */
#define E2BIG            7      /* Arg list too long */
#define ENOEXEC          8      /* Exec format error */
#define EBADF            9      /* Bad file number */
#define ECHILD          10      /* No child processes */
#define EAGAIN          11      /* Try again */
#define ENOMEM          12      /* Out of memory */
#define EACCES          13      /* Permission denied */
#define EFAULT          14      /* Bad address */
#define ENOTBLK         15      /* Block device required */
#define EBUSY           16      /* Device or resource busy */
#define EEXIST          17      /* File exists */
#define EXDEV           18      /* Cross-device link */
#define ENODEV          19      /* No such device */
#define ENOTDIR         20      /* Not a directory */
#define EISDIR          21      /* Is a directory */
#define EINVAL          22      /* Invalid argument */
#define ENFILE          23      /* File table overflow */
#define EMFILE          24      /* Too many open files */
#define ENOTTY          25      /* Not a typewriter */
#define ETXTBSY         26      /* Text file busy */
#define EFBIG           27      /* File too large */
#define ENOSPC          28      /* No space left on device */
#define ESPIPE          29      /* Illegal seek */
#define EROFS           30      /* Read-only file system */
#define EMLINK          31      /* Too many links */
#define EPIPE           32      /* Broken pipe */
#define EDOM            33      /* Math argument out of domain of func */
#define ERANGE          34      /* Math result not representable */
#define EDEADLK         35      /* Resource deadlock would occur */
#define ENAMETOOLONG    36      /* File name too long */
#define ENOLCK          37      /* No record locks available */
#define ENOSYS          38      /* Function not implemented */
#define ENOTEMPTY       39      /* Directory not empty */
#define ELOOP           40      /* Too many symbolic links encountered */
#define EWOULDBLOCK     EAGAIN  /* Operation would block */
#define ENOMSG          42      /* No message of desired type */
#define EIDRM           43      /* Identifier removed */
#define ECHRNG          44      /* Channel number out of range */
#define EL2NSYNC        45      /* Level 2 not synchronized */
#define EL3HLT          46      /* Level 3 halted */
#define EL3RST          47      /* Level 3 reset */
#define ELNRNG          48      /* Link number out of range */
#define EUNATCH         49      /* Protocol driver not attached */
#define ENOCSI          50      /* No CSI structure available */
#define EL2HLT          51      /* Level 2 halted */
#define EBADE           52      /* Invalid exchange */
#define EBADR           53      /* Invalid request descriptor */
#define EXFULL          54      /* Exchange full */
#define ENOANO          55      /* No anode */
#define EBADRQC         56      /* Invalid request code */
#define EBADSLT         57      /* Invalid slot */
#define EDEADLOCK       EDEADLK
#define EBFONT          59      /* Bad font file format */
#define ENOSTR          60      /* Device not a stream */
#define ENODATA         61      /* No data available */
#define ETIME           62      /* Timer expired */
#define ENOSR           63      /* Out of streams resources */
#define ENONET          64      /* Machine is not on the network */
#define ENOPKG          65      /* Package not installed */
#define EREMOTE         66      /* Object is remote */
#define ENOLINK         67      /* Link has been severed */
#define EADV            68      /* Advertise error */
#define ESRMNT          69      /* Srmount error */
#define ECOMM           70      /* Communication error on send */
#define EPROTO          71      /* Protocol error */
#define EMULTIHOP       72      /* Multihop attempted */
#define EDOTDOT         73      /* RFS specific error */
#define EBADMSG         74      /* Not a data message */
#define EOVERFLOW       75      /* Value too large for defined data type */
#define ENOTUNIQ        76      /* Name not unique on network */
#define EBADFD          77      /* File descriptor in bad state */
#define EREMCHG         78      /* Remote address changed */
#define ELIBACC         79      /* Can not access a needed shared library */
#define ELIBBAD         80      /* Accessing a corrupted shared library */
#define ELIBSCN         81      /* .lib section in a.out corrupted */
#define ELIBMAX         82      /* Attempting to link in too many shared libraries */
#define ELIBEXEC        83      /* Cannot exec a shared library directly */
#define EILSEQ          84      /* Illegal byte sequence */
#define ERESTART        85      /* Interrupted system call should be restarted */
#define ESTRPIPE        86      /* Streams pipe error */
#define EUSERS          87      /* Too many users */
#define ENOTSOCK        88      /* Socket operation on non-socket */
#define EDESTADDRREQ    89      /* Destination address required */
#define EMSGSIZE        90      /* Message too long */
#define EPROTOTYPE      91      /* Protocol wrong type for socket */
#define ENOPROTOOPT     92      /* Protocol not available */
#define EPROTONOSUPPORT 93      /* Protocol not supported */
#define ESOCKTNOSUPPORT 94      /* Socket type not supported */
#define EOPNOTSUPP      95      /* Operation not supported on transport endpoint */
#define EPFNOSUPPORT    96      /* Protocol family not supported */
#define EAFNOSUPPORT    97      /* Address family not supported by protocol */
#define EADDRINUSE      98      /* Address already in use */
#define EADDRNOTAVAIL   99      /* Cannot assign requested address */
#define ENETDOWN        100     /* Network is down */
#define ENETUNREACH     101     /* Network is unreachable */
#define ENETRESET       102     /* Network dropped connection because of reset */
#define ECONNABORTED    103     /* Software caused connection abort */
#define ECONNRESET      104     /* Connection reset by peer */
#define ENOBUFS         105     /* No buffer space available */
#define EISCONN         106     /* Transport endpoint is already connected */
#define ENOTCONN        107     /* Transport endpoint is not connected */
#define ESHUTDOWN       108     /* Cannot send after transport endpoint shutdown */
#define ETOOMANYREFS    109     /* Too many references: cannot splice */
#define ETIMEDOUT       110     /* Connection timed out */
#define ECONNREFUSED    111     /* Connection refused */
#define EHOSTDOWN       112     /* Host is down */
#define EHOSTUNREACH    113     /* No route to host */
#define EALREADY        114     /* Operation already in progress */
#define EINPROGRESS     115     /* Operation now in progress */
#define ESTALE          116     /* Stale NFS file handle */
#define EUCLEAN         117     /* Structure needs cleaning */
#define ENOTNAM         118     /* Not a XENIX named type file */
#define ENAVAIL         119     /* No XENIX semaphores available */
#define EISNAM          120     /* Is a named type file */
#define EREMOTEIO       121     /* Remote I/O error */
#define EDQUOT          122     /* Quota exceeded */
#define ENOMEDIUM       123     /* No medium found */
#define EMEDIUMTYPE     124     /* Wrong medium type */


The list above should actually be sufficient, but we shall show next, the corresponding
list for AIX:


1.2 errcodes AIX:
=================


#define EPERM   1       /* Operation not permitted              */
#define ENOENT  2       /* No such file or directory            */
#define ESRCH   3       /* No such process                      */
#define EINTR   4       /* interrupted system call              */
#define EIO     5       /* I/O error                            */
#define ENXIO   6       /* No such device or address            */
#define E2BIG   7       /* Arg list too long                    */
#define ENOEXEC 8       /* Exec format error                    */
#define EBADF   9       /* Bad file descriptor                  */
#define ECHILD  10      /* No child processes                   */
#define EAGAIN  11      /* Resource temporarily unavailable     */
#define ENOMEM  12      /* Not enough space                     */
#define EACCES  13      /* Permission denied                    */
#define EFAULT  14      /* Bad address                          */
#define ENOTBLK 15      /* Block device required                */
#define EBUSY   16      /* Resource busy                        */
#define EEXIST  17      /* File exists                          */
#define EXDEV   18      /* Improper link                        */
#define ENODEV  19      /* No such device                       */
#define ENOTDIR 20      /* Not a directory                      */
#define EISDIR  21      /* Is a directory                       */
#define EINVAL  22      /* Invalid argument                     */
#define ENFILE  23      /* Too many open files in system        */
#define EMFILE  24      /* Too many open files                  */
#define ENOTTY  25      /* Inappropriate I/O control operation  */
#define ETXTBSY 26      /* Text file busy                       */
#define EFBIG   27      /* File too large                       */
#define ENOSPC  28      /* No space left on device              */
#define ESPIPE  29      /* Invalid seek                         */
#define EROFS   30      /* Read only file system                */
#define EMLINK  31      /* Too many links                       */
#define EPIPE   32      /* Broken pipe                          */
#define EDOM    33      /* Domain error within math function    */
#define ERANGE  34      /* Result too large                     */
#define ENOMSG  35      /* No message of desired type           */
#define EIDRM   36      /* Identifier removed                   */
#define ECHRNG  37      /* Channel number out of range          */
#define EL2NSYNC 38     /* Level 2 not synchronized             */
#define EL3HLT  39      /* Level 3 halted                       */
#define EL3RST  40      /* Level 3 reset                        */
#define ELNRNG  41      /* Link number out of range             */
#define EUNATCH 42      /* Protocol driver not attached         */
#define ENOCSI  43      /* No CSI structure available           */
#define EL2HLT  44      /* Level 2 halted                       */
#define EDEADLK 45      /* Resource deadlock avoided            */
#define ENOTREADY       46      /* Device not ready             */
#define EWRPROTECT      47      /* Write-protected media        */
#define EFORMAT         48      /* Unformatted media            */
#define ENOLCK          49      /* No locks available           */
#define ENOCONNECT      50      /* no connection                */
#define ESTALE          52      /* no filesystem                */
#define EDIST           53      /* old, currently unused AIX errno*/
#define EINPROGRESS     55      /* Operation now in progress */
#define EALREADY        56      /* Operation already in progress */
#define ENOTSOCK        57      /* Socket operation on non-socket */
#define EDESTADDRREQ    58      /* Destination address required */
#define EDESTADDREQ     EDESTADDRREQ /* Destination address required */
#define EMSGSIZE        59      /* Message too long */
#define EPROTOTYPE      60      /* Protocol wrong type for socket */
#define ENOPROTOOPT     61      /* Protocol not available */
#define EPROTONOSUPPORT 62      /* Protocol not supported */
#define ESOCKTNOSUPPORT 63      /* Socket type not supported */
#define EOPNOTSUPP      64      /* Operation not supported on socket */
#define EPFNOSUPPORT    65      /* Protocol family not supported */
#define EAFNOSUPPORT    66      /* Address family not supported by protocol family */
#define EADDRINUSE      67      /* Address already in use */
#define EADDRNOTAVAIL   68      /* Can't assign requested address */
#define ENETDOWN        69      /* Network is down */
#define ENETUNREACH     70      /* Network is unreachable */
#define ENETRESET       71      /* Network dropped connection on reset */
#define ECONNABORTED    72      /* Software caused connection abort */
#define ECONNRESET      73      /* Connection reset by peer */
#define ENOBUFS         74      /* No buffer space available */
#define EISCONN         75      /* Socket is already connected */
#define ENOTCONN        76      /* Socket is not connected */
#define ESHUTDOWN       77      /* Can't send after socket shutdown */
#define ETIMEDOUT       78      /* Connection timed out */
#define ECONNREFUSED    79      /* Connection refused */
#define EHOSTDOWN       80      /* Host is down */
#define EHOSTUNREACH    81      /* No route to host */
#define ERESTART        82      /* restart the system call */
#define EPROCLIM        83      /* Too many processes */
#define EUSERS          84      /* Too many users */
#define ELOOP           85      /* Too many levels of symbolic links      */
#define ENAMETOOLONG    86      /* File name too long                     */
#define EDQUOT          88      /* Disc quota exceeded */
#define ECORRUPT        89      /* Invalid file system control data */
#define EREMOTE         93      /* Item is not local to host */
#define ENOSYS          109     /* Function not implemented  POSIX */
#define EMEDIA          110     /* media surface error */
#define ESOFT           111     /* I/O completed, but needs relocation */
#define ENOATTR         112     /* no attribute found */
#define ESAD            113     /* security authentication denied */
#define ENOTRUST        114     /* not a trusted program */
#define ETOOMANYREFS    115     /* Too many references: can't splice */
#define EILSEQ          116     /* Invalid wide character */
#define ECANCELED       117     /* asynchronous i/o cancelled */
#define ENOSR           118     /* temp out of streams resources */
#define ETIME           119     /* I_STR ioctl timed out */
#define EBADMSG         120     /* wrong message type at stream head */
#define EPROTO          121     /* STREAMS protocol error */
#define ENODATA         122     /* no message ready at stream head */
#define ENOSTR          123     /* fd is not a stream */
#define ECLONEME        ERESTART /* this is the way we clone a stream ... */
#define ENOTSUP         124     /* POSIX threads unsupported value */
#define EMULTIHOP       125     /* multihop is not allowed */
#define ENOLINK         126     /* the link has been severed */
#define EOVERFLOW       127     /* value too large to be stored in data type */


Actually, this is only a very small list of errors and code: 
It is ONLY associated with the interaction of a process with the system. 
And even in that context, this is a limited list.

There are ofcourse also many classes of errors you will never see in a trace.
Think of the possible errors that can be seen at boottime of a system, or what an 
error logging daemon might write in a logfile, can all be a very different story.


============================================================================
2. Tracing in Linux:
============================================================================ 


2.1.strace:
===========


>>> strace example on Linux:

One main trace utility on most Linux distro's, is the "strace" command.
You can use it with many parameters, but the "-o outputfile" is very important, in order to save the output to a file.

Use it like:

# strace -o logfile <name_of_command_or_program_you_want_to_trace> 

# strace -o logfile -p <process_id>     # In cases where you want to trace a process that is already running, 
                                        # pass the -p option to strace.


Because strace will show you the systemcalls and signals, you can use it to reveal whether a program cannot
find a file, or does not have permissions to read (or write to) a file. In such a case, a program might fail.


Example 1:
----------

Suppose we have a file called "/etc/security.conf". Now we run a utility to read the file (like cat, pg, more, less etc..)
as a normal user, which user does not have permissions to read the file. Let's trace that event to a logfile, and see
what we can discover.

$ strace -o strace_example.log less /etc/security.conf

A trace file can get pretty long, but you should just browse it and be alert on what seems to be an error reported.
So, if we take a look in the logfile "strace_example.log"

  ..
  ..
  open("/etc/security.conf", O_RDONLY|O_LARGEFILE) = -1 EACCES (Permission denied)
  write(2, "/etc/security.conf: Permission denied\n", 32) = 32
  ..
  ..

We can clearly see, that our program failed due to lack of permission.

Example 2:
----------

You can use strace in many ways. One other famous "error" you might find using strace, is that a program needs a libary,
but can't find it.
Like in this example;

  ..
  open("/opt/tux/cbl/lib/libdcpybk.so", O_RDONLY) = -1 
  ENOENT (No such file or directory)
  ..

Remark:

To find out what libraries a program needs, you might also try the ldd command.
For example, what uuencode needs is shown with:

$ ldd uuencode
uuencode needs:
         /usr/lib/libc.a(shr.o)
         /unix
         /usr/lib/libcrypt.a(shr.o)


2.2. ltrace:
============

While "strace" deals with systemcalls, if you want to track what library calls an application does, 
you can use the "ltrace" command.
It works really similar to "strace".

Example:

$ ltrace -o ls_example_trace_file.trc ls


2.3. LTT Linux Trace Toolkit:
=============================

Strace, as we have seen above, will trace only one process and present the result in text form. To trace many processes in
a given period of time, Linux Trace Toolkit (LTT) is a better choice. LTT is distributed as free software under GPL. 
The trace toolkit provides a daemon, which will capture the events and write it to disk. 

It's (generally) not a standard feature of Linux, and you need to obtain it elswhere. If you are interested, just Google on
Linux Trace Toolkit, to find current info.

Basically, you run the tracedaemon, and after a while, you use the tracevisualizer to view results
in graphical form. 


2.4. Other possible usefull Linux commands (limited list):
==========================================================

Although not directly related to tracing, the following limited list of commands might help in creating a better view of
your system and processes. I am sure you are familiair with them, but let's list them anyway:

-- Show your OS version:

# cat /proc/version 
# uname -a

-- Show the open files that a process uses:

# pfiles pid

-- Show the jobs that are scheduled (in the account you use) from cron:

# crontab -l

-- What are the standard mounted filesystems: That's defined in "/etc/fstab"

# cat /etc/fstab

-- Which processes are using a certain filesystem?

# fuser -c /filesystem     # We mean the "mountpoint", like for example "/apps/oracle"

-- Show memory usage of a process:

# pmap -d pid                       # (Most important options: -x  Show the extended format; -d Show the device format.)
                                    # (And pid is the process-id, as visible in the command "ps -ef".)

-- Show system memory:

# cat /proc/meminfo
# /usr/sbin/dmesg | grep "Physical"
# free                              # (the free command)   

-- Swap usage:

# cat /proc/swaps                   # Above 60%-70% it's getting scary
# cat /proc/meminfo

-- cpu info:

# cat /proc/cpuinfo

-- user and process limits:

Sometimes, when a process runs under some account, and it fails for no immediate reason, it might be
worth checking the "ulimit" of that account (like max filesize, max open files, number of files etc..)
use it under that account as:

# ulimit (-a)

-- Show processtree of parent and children:

# pstree pid                       # on some distros ptree is implemented


-- Show the system error report / error log:

# cat /var/log/messages | more    (# more will ensure that not all contents scroll at your screen "at once", until the end is reached)


-- Determine the type of a file (e.g. is it ascii, or another type of file?)

# file file_name                  # (the command is really named "file")


-- Show free/used space of the filesystems:

# df -m                           # m in MB; k in KB

If there are many filesystems, you might want to see just the top 5 that are the lowest on free space:

# df -k |awk '{print $4,$7}' |grep -v "Filesystem" | sort -n | tail -5

-- How to become another user, or possibly root:

# su - accountname       # (switch to that accountname like "su - albert")
# su -                   # (switch to root)
                         # if the sudo utility is implemented, you might try the command "sudo -l" to see what you might execute.

-- Carefull!! How to kill a process "the hard way"?

# kill -9 PID              # carefull, don't kill the wrong one; not recommended unless you don't have a choice.

-- Carefull!! How to kill all your processes "the hard way", all at once?

# kill -9 -1               # very carefull; not recommended unless you don't have a choice.
# killall                  # implemented on some distros. very carefull; not recommended unless you don't have a choice.

-- Show your uid (userid) and gid (groupid):

# id

-- refreshing (restarting) inetd after modifying "/etc/inetd.conf"

# service xinetd restart	    # depending on the distro, like RedHat					
# /etc/init.d/inetd restart	

-- To show the init runlevel:

# who -r 

-- Show uptime of system plus average load (15 minutes)

# uptime

-- Show the last logged on users: account name & pts & date (history since last restart)

# last | more


============================================================================
3. Tracing in AIX:
============================================================================

In AIX, tracing commands are available like "truss", "syscalls" and "trace".

First we will talk about the "trace" facility, to which AIX also offers a userfriendly 
interface. It's a menu based system (via smitty). But you can use "trace" on the commandline as well.
The neat thing here is that you can trace a PID, a program, or just all.

We will start with the command "smitty trace". We will instruct the system to create 
a raw tracefile first (not easily readable), and then, after we have stopped tracing, we create
an ascii (readable) file, from the raw file.


3.1. Setting up a trace with "smitty trace":
============================================


>>> Define and start the trace:
-------------------------------

You can start with

$ smitty trace

The following menu appears:

Move cursor to desired item and press Enter.

  START Trace
  STOP Trace
  Generate a Trace Report
  Manage Trace
  Manage Event Groups


First we choose "START Trace"

The following menu appears:

FIG. 1.

                                                        [Entry Fields]
  EVENT GROUPS to trace                              []               
  ADDITIONAL event IDs to trace                      []               
  Event Groups to EXCLUDE from trace                 []               
  Event IDs to EXCLUDE from trace                    []               
  Process IDs to Trace                               []               
  Program to Trace                                   []
  Propagate Tracing to                               [new processes and threads] 
  Trace MODE                                         [alternate]                 
  STOP when log file full?                           [no]                        
  LOG FILE                                           [/var/adm/ras/trcfile]
  SAVE PREVIOUS log file?                            [no]                  
  Omit PS/NM/LOCK HEADER to log file?                [yes]                 
  Omit DATE-SYSTEM HEADER to log file?               [no]                  
  Run in INTERACTIVE mode?                           [no]                  
  Trace BUFFER SIZE in bytes                         [262144]              
  LOG FILE SIZE in bytes                             [2621440]             
  Buffer Allocation                                  [automatic]  

Now move to the item:

- "LOG FILE":

Now we adjust the logfile location from the default "/var/adm/ras/trcfile" to another suitable filesystem and filename,
like "/tmp/trcraw" (the /var filesystem is usually not a good idea to store your own large tracefile)
In this example, we use "/tmp" as the filesystem to store our tracefile (if there is enough free space).
And we let the tracefile has the name of "trcraw", because it will not contain readable text (at first),
hence the "raw".

Next, move to the item:

- "LOG FILE SIZE in bytes":

It might be a good idea to limit the size of the tracefile. For exmple, if you only have 1GB free in /tmp,
you must stay well below that size.
But you will see that tracing to file, is like "exploding" the filesize. It can grow incredibly fast, also
depending on the event groups you trace.
Undoubtly, you will see that for yourself. If you trace on too many events, it can be as bad as 500MB per minute.
But in this example, we stay "modest" in sizes.

So here, we have taken the example value of 100MB (104857600 bytes)


FIG. 2.
                                                        [Entry Fields]
  EVENT GROUPS to trace                              []      
  ADDITIONAL event IDs to trace                      []      
  Event Groups to EXCLUDE from trace                 []      
  Event IDs to EXCLUDE from trace                    []      
  Process IDs to Trace                               []      
  Program to Trace                                   []
  Propagate Tracing to                               [new processes and threads]     
  Trace MODE                                         [alternate]                     
  STOP when log file full?                           [yes]                           
  LOG FILE                                           [/tmp/trcraw]
  SAVE PREVIOUS log file?                            [no]         
  Omit PS/NM/LOCK HEADER to log file?                [yes]        
  Omit DATE-SYSTEM HEADER to log file?               [no]         
  Run in INTERACTIVE mode?                           [no]         
  Trace BUFFER SIZE in bytes                         [262144]     
  LOG FILE SIZE in bytes                             [104857600]        (changed to 100MB)                                                                                         #
  Buffer Allocation                                  [automatic]   

Next, move to

- "STOP when log file full?"

Decide whether you want to stop logging when the size limit has been reached (generally a good idea).
You can choose between "yes" and "no" via the F4 key.

Next, we move to 

- "EVENT GROUPS to trace":

When you have your cursor at this item, press F4. An impressive list of "counters" or trace-able events, is shown.
With the F7 key, you can toggle "Select event" to on/off.
Remember, the more event(groups) you choose, the more "intensive" the system will trace, and the faster
your tracefile will grown.

believe me: if you want to create a relatively simple trace for troubleshooting purposes, then the selection of
- fop - FILE OPENS (reserved)
- fact - FILE ACTIVITY (open,close,read,write) (reserved)

can be sufficient. Because many process failures are related to permission problems (on files and directories) and
not able to find files (like libaries, logfiles etc..).

So, in this we just choose those eventgroups, and press Enter.


FIG. 3.

                                         +--------------------------------------------------------------------------+
                                         �                          EVENT GROUPS to trace                           �
                                         �                                                                          �
                                         � Move cursor to desired item and press F7. Use arrow keys to scroll.      �
  EVENT GROUPS to trace                  �     ONE OR MORE items can be selected.                                   �
  ADDITIONAL event IDs to trace          � Press Enter AFTER making all selections.                                 �
  Event Groups to EXCLUDE from trace     �                                                                          �
  Event IDs to EXCLUDE from trace        � [TOP]                                                                    �
  Process IDs to Trace                   �   tidhk - Hooks needed to display thread name (reserved)                 �
  Program to Trace                       �   gka - GENERAL KERNEL ACTIVITY (files,execs,dispatches) (reserved)      �
  Propagate Tracing to                   �   gkasc - GENERAL KERNEL ACTIVITY + SYSTEM CALLS (reserved)              �
  Trace MODE                             �   fop - FILE OPENS (reserved)                                            �
  STOP when log file full?               �   fact - FILE ACTIVITY (open,close,read,write) (reserved)                �
  LOG FILE                               �   proc - EXECS, FORKS, EXITS (reserved)                                  �
  SAVE PREVIOUS log file?                �   procd - EXECS, FORKS, DISPATCHES (reserved)                            �
  Omit PS/NM/LOCK HEADER to log file?    �   filephys - FILE ACTIVITY (with physical file system) (reserved)        �
  Omit DATE-SYSTEM HEADER to log file?   �   filepfsv - FILE ACTIVITY (with physical file system and VMM) (reserved �
  Run in INTERACTIVE mode?               �   filepvl - FILE ACTIVITY (with physical file system, VMM, and LVM) (res �
  Trace BUFFER SIZE in bytes             �   filepvld - FILE ACTIVITY (w/ phys. file sys., VMM, LVM, and disk) (res �
  LOG FILE SIZE in bytes                 �   syscall - SYSTEM CALLS (reserved)                                      �
  Buffer Allocation                      �   inthands - FLIHS and SLIHS (reserved)                                  �
                                         �   lfs - LOGICAL FILE SYSTEM (deprecated, use vnops and vfsops) (reserved �
                                         �   pfs - PHYSICAL FILE SYSTEM (reserved)                                  �
                                         �   vmm - VIRTUAL MEMORY MANAGER (reserved)                                �
                                         �   vmmsvc - VMM SERVICES (reserved)                                       �
                                         �   lvm - LOGICAL VOLUME MANAGER (reserved)                                �
                                         �   lvmbb - LOGICAL VOLUME MANAGER BADBLOCK EVENTS (reserved)              �
                                         �   ipcgen - IPC: GENERAL (reserved)                                       �
                                         �   ipcsm - IPC: SHARED MEMORY (reserved)                                  �
                                         �   ipcmsgs - IPC: MESSAGES (reserved)                                     �
                                         �   ipcsem - IPC: SEMAPHORES (reserved)                                    �
                                         �   ipcmmap - IPC: MMAP (reserved)                                         �
                                         �   ipcmsem - IPC: MSEMAPHORES (reserved)                                  �
                                         �   errlg - ERROR LOGGING (reserved)                                       �
                                         �   parpdd - DEVICE DRIVER: PARALLEL PRINTER (reserved)                    �
                                         �   tapedd - DEVICE DRIVER: TAPE (reserved)                                �
                                         �   entdd - DEVICE DRIVER: ETHERNET - HIGH PERFORMANCE LAN ADAPTER (8ef5)  �
                                         �   tokdd - DEVICE DRIVER: TOKEN RING - HIGH PERFORMANCE ADAPTER (8fc8) (r �
                                         �   c3270dd - DEVICE DRIVER: C3270 (reserved)                              �
                                         �   fddd - DEVICE DRIVER: FLOPPY DISK (reserved)                           �
                                         �   scsidd - DEVICE DRIVER: SCSI (reserved)                                �
                                         �   sisadd - DEVICE DRIVER: PCI-X SCSI (reserved)                          �
                                         �   sissasdd - DEVICE DRIVER: SAS (reserved)                               �
                                         �   diskdd - DEVICE DRIVER: DISK (reserved)                                �
                                         �   mpqdd - DEVICE DRIVER: MULTI-PROTOCAL ADAPTERS (reserved)              �
                                         �   graphdd - DEVICE DRIVER: GRAPHICS (reserved)                           �
                                         �   ttydd - DEVICE DRIVER: pty (reserved)                                  �
                                         �   rs232dd - DEVICE DRIVER: rs232 (reserved)                              �
                                         �   64portdd - DEVICE DRIVER: 64 PORT ASYNC CONTROLLER (reserved)          �
                                         �   x25dd - DEVICE DRIVER: X25 (reserved)                                  �
                                         �   harierdd - DEVICE DRIVER: HARRIER2 (reserved)                          �
                                         �   scsitgdd - DEVICE DRIVER: SCSI Target Mode (reserved)                  �
                                         �   lpfkdd - DEVICE DRIVER: Dials/LPFKeys (reserved)                       �
                                         � [MORE...36]                                                              �
                                         �                                                                          �
                                         � F1=Help                 F2=Refresh              F3=Cancel                �
F1=Help                                F2� F7=Select               F8=Image                F10=Exit                 � F4=List
F5=Reset                               F6� Enter=Do                /=Find                  n=Find Next              � F8=Image


Now the trace wil start and you should see the file "/tmp/trcraw" grow in size.
You can see that with:

$ ls -al /tmp/trcraw

Also, try this command from the prompt:

$ ps -ef | grep trace

and you should see your trace running in the process list.

IMPORTANT:

Did you note that we did not select a PID (process ID) to trace on? So, actually, we trace on all processes,
"which do something" on the eventgroups we selected.

Ofcourse, if you know a PID on which you want to trace, you just fill that in the menu shown in Fig. 2.

If you select to trace on a PID (only), the your tracefile will ofcourse not grow that fast, as it would in our example.

But even in our example (where we trace on all processes on the selected eventgroups), we can see marvelous things.
Suppose Oracle and/or Websphere, or monitoring tools, (or you name it), are running. Later on, when you inspect the tracefile,
you can find very valuable information about what those processes do "under the hood".


Remember, we are creating a raw trace file here. We still need to do one extra step, after stopping the trace.


>>> Stop the trace and create a readable file:
----------------------------------------------


Ok, if you have left smitty, start it up again.

$ smitty trace

In the menu that follows, just select " STOP Trace".

  START Trace
  STOP Trace
  Generate a Trace Report
  Manage Trace
  Manage Event Groups

and the trace facility will stop tracing.

Next, we want to have a readable file, which we can view (use cat, pg, more, grep etc..).
In smitty, there are options available to create a trace report, but I think it's more instructive
to do this from the prompt. Here we go:

We have a raw trace in the file /tmp/trcraw
Lets create a readable file from the raw file, and call it "/tmp/trctxt".

You can do that with for example:

$ trcrpt -O pid=on,exec=on trcraw > trcnew


Please be aware that the textfile is typically 2 or 3 times larger than the raw file. So, always be aware on available
space in the filesystem where you want to create the file.

Now you can open the file, or grep it on an identifier etc..


3.2. A few examples of using the truss command:
===============================================


With "truss" you can trace a command, or trace an existing process. It shows all system calls (or a selection) made, with their arguments
and the return code. System call parameters are displayed symbolically. 
It also prints information about all signals received by a process.

You can use truss in the following way:

# truss [options] command

You must understand that in this way, you actually start the command, and let truss attach, and then it will 
display the calls to the system and external libaries.

# truss [options] -p PID

In this case, you 'attach' to an existing process.

There are many parameters (or options) you can use, but a few of the most important options are:

-o truss.log		# So here you save the truss trace to the logfile "truss.log"
-t [!] Syscall		# If you leave out -t, you trace on all syscalls. Indeed, the default is "-tall".
                        # If you use -t, you can also give a comma seperated list on the calls you want to
                        # trace on, like "-t open,statx,close", where you will only trace on open, close, statx.
                        # You can also excluse certain syscalls, by using "-t ! syscall".
-u [!] [LibraryName]    # Here you can give a comma seperated list on which you want to trace the calls to.
                        # using -u ! LibraryName, you can exclude a certain library from the trace.


let's take a look at a few simple examples:

Example 1:
----------

Suppose in /opt/app/cc we have a program called "test".
Somebody from your group tries to run it, but it immediately dies, and you don't have a clue to what caused it.
It was supposed to present colleque a menuscreen to work with, but that never happened.

Ofcourse, any well behaved program should give a messsage on the screen, or write
status information in a logile.
But suppose we are dealing with a program without those nice features.

$ ./test

And it dies, while we were expecting a menuscreen to work with.
Why did it die?


Let's try truss:


$ truss ./test
execve("test", 0xFFBFFDEC, 0xFFBFFDF4)  argc = 1
getcwd("/home/albert", 1015)               = 0
stat("/home/albert/test", 0xFFBFFBC8)   = 0
open("/var/ld/ld.config", O_RDONLY)             Err#2 ENOENT
stat("/opt/csw/lib/libc.so.1", 0xFFBFF6F8)      Err#2 ENOENT
stat("/lib/libc.so.1", 0xFFBFF6F8)              = 0
resolvepath("/lib/libc.so.1", "/lib/libc.so.1", 1023) = 14
open("/lib/libc.so.1", O_RDONLY)                = 3
memcntl(0xFF280000, 139692, MC_ADVISE, MADV_WILLNEED, 0, 0) = 0
close(3)                                        = 0
getcontext(0xFFBFF8C0)
getrlimit(RLIMIT_STACK, 0xFFBFF8A0)             = 0
getpid()                                        = 7895 [7894]
setustack(0xFF3A2088)
open("/opt/app/etc/cc.conf", O_RDONLY)          Err#13 EACCES [file_dac_read]     <--- !!!
ioctl(1, TCGETA, 0xFFBFEF14)                    = 0


Now note the line that I have marked with "!!!". Here you see Err#13 EACCES.

From the lists in Section 1, we can find that Error 13 corresponds to "Permission denied".

So, suppose that you go to "/opt/app/etc/" and check the permissions on the file "cc.conf", you would find
that the permission on that file should be altered.
After using the following command: 
$ chmod g+r cc.conf                    # here we give the group read permission on "cc.conf"

Now the program runs without errors. Probably this was a program that first wanted to read configuration information
from "/opt/app/etc/cc.conf", and if that fails, the program would just terminate without any message.
Ofcourse, that program could have been designed much better. 
But we have seen an example where truss was of use.


Example 2:
----------

Let's run the program "lsps -s" (show pagingspace) from my home dir, and let's truss it, to see what systemcalls it makes:

albert@sharky:/home/albert $ truss lsps -s

execve("/usr/sbin/lsps", 0x2FF22A4C, 0x2000EB28)  argc: 2
__loadx(0x03000000, 0x2FF22870, 0x000000F0, 0x10000000, 0x20000E14) = 0x00000000
__loadx(0x0A040000, 0xD0572CD4, 0x0000000A, 0x00000000, 0x00000000) = 0x00000000
sbrk(0x00000000)                                = 0x20004570
vmgetinfo(0x2FF21C30, 7, 16)                    = 0
sbrk(0x00000000)                                = 0x20004570
__libc_sbrk(0x00000000)                         = 0x20004570
getuidx(4)                                      = 6318
getuidx(2)                                      = 6318
getuidx(1)                                      = 6318
getgidx(4)                                      = 1105
getgidx(2)                                      = 1105
getgidx(1)                                      = 1105
__loadx(0x01000080, 0x2FF216E0, 0x00000960, 0x2FF22160, 0x00000000) = 0xD0149130
__loadx(0x0A040000, 0xD0572CA0, 0x2FF22FFC, 0x0000D0B2, 0x00000000) = 0x00000000
__loadx(0x01000180, 0x2FF216E0, 0x00000960, 0xF028CC4C, 0xF028CB7C) = 0xF03358D8
__loadx(0x0A040000, 0xD0572CA0, 0x2FF22FFC, 0x0000D0B2, 0x00000000) = 0x00000000
__loadx(0x07080000, 0xF028CC1C, 0xFFFFFFFF, 0xF03358D8, 0x00000000) = 0xF0336808
__loadx(0x07080000, 0xF028CB5C, 0xFFFFFFFF, 0xF03358D8, 0x00000000) = 0xF0336814
__loadx(0x07080000, 0xF028CC2C, 0xFFFFFFFF, 0xF03358D8, 0x00000000) = 0xF0336844
__loadx(0x07080000, 0xF028CB6C, 0xFFFFFFFF, 0xF03358D8, 0x00000000) = 0xF0336850
__loadx(0x07080000, 0xF028CBEC, 0xFFFFFFFF, 0xF03358D8, 0x00000000) = 0xF0336820
__loadx(0x07080000, 0xF028CB8C, 0xFFFFFFFF, 0xF03358D8, 0x00000000) = 0xF0336838
__loadx(0x07080000, 0xF028CBFC, 0xFFFFFFFF, 0xF03358D8, 0x00000000) = 0xF033685C
__loadx(0x07080000, 0xF028CC0C, 0xFFFFFFFF, 0xF03358D8, 0x00000000) = 0xF033688C
__loadx(0x07080000, 0xF028CB9C, 0xFFFFFFFF, 0xF03358D8, 0x00000000) = 0xF0336874
__loadx(0x07080000, 0xF028CBAC, 0xFFFFFFFF, 0xF03358D8, 0x00000000) = 0xF0336910
getuidx(4)                                      = 6318
getuidx(2)                                      = 6318
getuidx(1)                                      = 6318
getgidx(4)                                      = 1105
getgidx(2)                                      = 1105
getgidx(1)                                      = 1105
__loadx(0x01000080, 0x2FF216E0, 0x00000960, 0x2FF22160, 0x00000000) = 0xD0149130
getuidx(4)                                      = 6318
getuidx(2)                                      = 6318
getuidx(1)                                      = 6318
getgidx(4)                                      = 1105
getgidx(2)                                      = 1105
getgidx(1)                                      = 1105
__loadx(0x01000080, 0x2FF216E0, 0x00000960, 0x2FF22160, 0x00000000) = 0xD0149130
getuidx(4)                                      = 6318
getuidx(2)                                      = 6318
getuidx(1)                                      = 6318
getgidx(4)                                      = 1105
getgidx(2)                                      = 1105
getgidx(1)                                      = 1105
__loadx(0x01000080, 0x2FF216E0, 0x00000960, 0x2FF22160, 0x00000000) = 0xD0149130
getuidx(4)                                      = 6318
getuidx(2)                                      = 6318
getuidx(1)                                      = 6318
getgidx(4)                                      = 1105
getgidx(2)                                      = 1105
getgidx(1)                                      = 1105
__loadx(0x01000080, 0x2FF216E0, 0x00000960, 0x2FF22160, 0x00000000) = 0xD0149130
getuidx(4)                                      = 6318
getuidx(2)                                      = 6318
getuidx(1)                                      = 6318
getgidx(4)                                      = 1105
getgidx(2)                                      = 1105
getgidx(1)                                      = 1105
__loadx(0x01000080, 0x2FF216E0, 0x00000960, 0x2FF22160, 0x00000000) = 0xD0149130
access("/usr/lib/nls/msg/en_US/cmdps.cat", 0)   = 0
_getpid()                                       = 483490
psdanger(0)                                     = 524288
psdanger(-1)                                    = 521468
open("/usr/lib/nls/msg/en_US/cmdps.cat", O_RDONLY) = 3
kioctl(3, 22528, 0x00000000, 0x00000000)        Err#25 ENOTTY
kfcntl(3, F_SETFD, 0x00000001)                  = 0
kioctl(3, 22528, 0x00000000, 0x00000000)        Err#25 ENOTTY
kread(3, "\0\001 �\001\001 I S O 8".., 4096)    = 4096
lseek(3, 0, 1)                                  = 4096
lseek(3, 0, 1)                                  = 4096
lseek(3, 0, 1)                                  = 4096
_getpid()                                       = 483490
lseek(3, 0, 1)                                  = 4096
kioctl(1, 22528, 0x00000000, 0x00000000)        = 0
Total Paging Space   Percent Used
kwrite(1, " T o t a l   P a g i n g".., 34)     = 34
      2048MB               1%
kwrite(1, "             2 0 4 8 M B".., 30)     = 30
__loadx(0x04000000, 0x2FF22080, 0x00000800, 0x0000D0B2, 0x00000000) = 0x00000000
kfcntl(1, F_GETFL, 0x00000001)                  = 67110914
kfcntl(2, F_GETFL, 0xF02DF418)                  = 67110914
_exit(0)


There is a lot of output on the screen. I entered "lsps -s", and truss will watch what syscalls are done
and shows that on your screen.
In fact, many of the first lines deal with "getuidx" and that kind of calls. The system would like to know
who (and in what groups he/she is) issued the command.
You can ignore the output, because it's not that interresting. I only "published" it here, to give you an
idea on how much output those tracing commands (like truss) generates.


If I just want to store that information to a logfile, for example "truss.log", I should use the following command:

albert@sharky:/home/albert $ truss -o truss.log lsps -s


3.3. Other possible usefull AIX commands:
=========================================

Although not directly related to tracing, the following limited list of commands might help in creating a better view of
your system and processes. I am sure you are familiair with them, but let's list them anyway::


-- Show your AIX version:

# oslevel -r

-- Show the jobs that are scheduled (in the account you use) from cron:

# crontab -l

-- What are the standard mounted filesystems?: That's defined in "/etc/filesystems"

# cat /etc/filesystems | more

-- Which processes are using a certain filesystem?

# fuser -c /filesystem     # We mean the "mountpoint", like for example /appl/oracle

-- Show memory usage of a process:

# procmap pid              # pid is the process-id, as visible in the command "ps -ef"   

-- Show the open files that a process uses:

# pfiles pid               # also take a look at the "lsof" command: man lsof            

-- Show system memory:

# bootinfo -r
# lsattr -E -l mem0
# lsattr -E -l sys0 -a realmem
# svmon -G
# vmstat -v
# vmo -L                # ( lots of output )
# svmon -U -g -t 10     # ( top 10 users paging space)

-- Swap usage:

# lsps -s                 # more than 60%-70% used? It get's really scary. More than 75% used? Oh boy!
# pstat -s

-- cpu info:

# lparstat (-i)       
# prtconf | grep proc
# pmcycles -m
# lscfg | grep proc
# pstat -S

-- ulimit:

Sometimes, when a process runs under some ones credentials, and it fails for no immediate reason, it might be
worth checking the "ulimit" of that account (like max filesize, max open files, number of files etc..)
use it under that account as:

# ulimit -a

-- Show process tree of parent and children:

# proctree pid        # Tip: take a look at the "proc tools" on AIX               


-- Show the system error report / error log:

# errpt                           # or "errpt | more" 
# errpt -aj <ERRID> | more        # view details of an error record. ERRID is the 1st identifier in such a record.

-- Determine the type of a file (e.g. is it ascii, or another type of file?)

# file file_name          # (yes..., the command is really "file")

-- Show free/used space of the filesystems:

# df -m         # m in MB; k in KB; g in GB

If there are many filesystems, you might want to see just the top 5 that have the lowest on free space:

# df -k |awk '{print $4,$7}' |grep -v "Filesystem" | sort -n | tail -5


-- How to become another user, or possibly root:

# su - accountname       # (switch to that accountname like "su - albert")
# su -                   # (switch to root)
                         # if the sudo utility is implemented, you might try the command "sudo -l" to see what you might execute.

-- Carefull!! How to kill a process "the hard way"?

# kill -9 PID              # carefull, don't kill the wrong one; not recommended unless you don't have a choice.

-- Carefull!! How to kill all your processes "the hard way", all at once?

# kill -9 -1               # be very carefull; not recommended unless you don't have a choice.
# killall                  # be very carefull; not recommended unless you don't have a choice.


-- Show your uid (userid) and gid (groupid):

# id

-- refresh inetd after modifying "/etc/inetd.conf":

# refresh -s inetd

-- Show the last logged on users + date (history since last restart):

# last | more

-- To show the init runlevel:

# who -r 

-- Show uptime of system plus average load (15 minutes):

# uptime

-- Clean memory with ipcrm (be carefull):

# ipcrm -m 50855977      # (clear memory segment, identfied by example id 50855977; Be carefull)
# ipcrm -s 2228248       # (remove semaphore, identfied by example id 2228248; Be carefull) 
# ipcrm -q 5111883       # (remove queue, identfied by example id 5111883; Be carefull) )
                         # (see man pages ipcrm)

-- To clear out unused system modules (currently unused modules in kernel and library memory):

# slibclean


============================================================================
4. Solaris:
============================================================================

A similar "story" will be put here, but then ofcourse for Solaris.

 
============================================================================
5. Other:
============================================================================


5.1 Some trivial remarks:
=========================


Now for some trivial remarks...... 

- kernel parameters

If you have problems installing a program, or if fails to run properly, are you sure all
required kernel parameters have been set? 

- Environment variables

If you have problems installing a program, or if fails to run properly, are you sure all
required Environment variables have been set? 
Many "large" programs really have an impressive list of variables you need to set in place
before it will run properly.

- Dependencies on other stuff.

Most (commercial) programs depend heavily on installed support programs or tools, like perl, java,  etc..
They may even have very strict requirements on versions of those support programs.

- Cluttered memory (ipc identifiers, semaphores, shared memory)

If you have started an application, and terminated it roughly, it's possible that
"stuff" still remains in memory. 
In such a case, it's possible that your app will not be able to restart.
You need to use a tool like "ipcrm" to clean memory, or
you might even consider to reboot the system.


#############################################################################################
#############################################################################################
#############################################################################################


=====================================================================================
Secton 21. How to undelete a file in UNIX.
=====================================================================================


IMPORTANT NOTICE:

>>>  This document contains some selected theads from the Internet.           <<<
>>>  It just contains some "pointers" in case you have a file or fs problem.  <<<
>>>  Do NOT regard the information as being "directly usable" in any sense!   <<<
>>>  Its only ment as a possible pointer, or hint,                            <<<
>>>  on which you may investigate further.                                    <<<
>>>  Also, it's vital to understand that on the subject of "undelete",        <<<
>>>  this document ONLY contains some pointers on that subject.               <<<
>>>  It does not pretent to be any more than that.                            <<<


Contents:
---------

1. Some Filesystem errors
2. How to delete "weird" files
3. Some possible hints on howto "undelete" files (if no backups are available)
4. Some other stuff


For some pointers on the subject of "undelete", you might want to jump to
section 3 right away.


###############################################################
1. Some Filesystem errors:
###############################################################


----------------------------------------------------------------------------------------
Note 1.1         : Possible way how to save files from A corrupt directory
Works on OS      : all unix
probable message : ksh: Invalid file system control data detected:
----------------------------------------------------------------------------------------


>>>> Question:

Anybody recognize this? This directory seems to be missing the ".", I can't 
umount, can't remove the directory, can't copy a good directory over it, 
etc. 

spiderman# cd probes 
spiderman# pwd 
/opt/diagnostics/probes 
spiderman# ls -la 
ls: 0653-341 The file . does not exist. 
spiderman# cd .. 
spiderman# ls -la probes 
ls: probes: Invalid file system control data detected. 
total 0 
spiderman# 

spiderman# fuser /opt 
/opt: 
spiderman# umount /opt 
umount: 0506-349 Cannot unmount /dev/hd10opt: The requested resource is 
busy. 
spiderman# umount /dev/hd10opt 
umount: 0506-349 Cannot unmount /dev/hd10opt: The requested resource is 
busy. 

spiderman# fsck /opt 

** Checking /dev/hd10opt (/opt) MOUNTED FILE SYSTEM; WRITING SUPPRESSED; 
Checking a mounted filesystem does not produce dependable results. 
** Phase 1 - Check Blocks and Sizes 
** Phase 2 - Check Pathnames 
DIRECTORY CORRUPTED (NOT FIXED) 
DIRECTORY CORRUPTED (NOT FIXED) 
Directory /diagnostics/probes, '.' entry is missing. (NOT FIXED) 
Directory /diagnostics/probes, '..' entry is missing. (NOT FIXED) 
** Phase 3 - Check Connectivity 
** Phase 4 - Check Reference Counts 
link count directory I@98 owner=bin mode$0755 
sizeQ2 mtime=May 13 14:54 2005 
count 3 should be 2 (NOT ADJUSTED) 
link count directory I@99 owner=bin mode$0755 
size24 mtime=Jan 10 13:45 2005 
count 2 should be 1 (NOT ADJUSTED) 
Unreferenced file IA06 owner=bin mode0555 
sizee56 mtime=Jul 07 14:25 2004 (NOT RECONNECTED) 
Unreferenced file IA06 (NOT CLEARED) 
Unreferenced file IA07 owner=bin mode0555 
size)12 mtime=Jul 07 14:25 2004 (NOT RECONNECTED) 
etc....


>>>> Answer:

Some good news here. Yes, your directory is hosed, but the important 
things is that all a directory is a repository for storing inode numbers 
and associated (human readable) file names. Since fsck is so nicely 
generating all of those now currently inaccessible inode numbers, a find 
command can be used to move them into a new directory. Once the old 
directory is empty, you can (hopefully) rm -r it. 

Here's what you need to do. 

a) Get all the inode numbers generated from your fsck 
b) put them into a variable (e.g. lost_inodes="4099 4106....etc." 
c) Make a target directory for the lost inodes to be moved into: 
mkdir /tmp/recovery 
d) cd into your problem File System: 
cd /opt 
d) Run a loop using find: 

for i in ${lost_inodes} 
do 
find . -inum ${i} mv * /tmp/recovery \; 
echo "Moved and recovered inode # ${i}" 
done 

That should do it. Let me know if it works ok! BTW, the new "file 
name" should be the inode number of the file. You will have to rename 
the files as needed. 


Note that this mehod saved the files from the corrupt directory.


----------------------------------------------------------------------------------------
Note 1.2         : A superblock issue
Works on OS      : all unix
probable message : probably fsck gives you a message
disks            : local disks, most likely not SAN
----------------------------------------------------------------------------------------


>>>> Method 1:

Use this command in case the superblock is corrupted. This will restore the BACKUP COPY of the superblock 
to the CURRENT copy.

# dd count=1 bs=4k skip=31 seek=1 if=/dev/hd4 of=/dev/hd4    (hd4 is an example)

# fsck /dev/hd4 2>&1 | tee /tmp/fsck.errors

OR

>>>>> Method 2:

If you have a dirty superblock you might try to do �fsck�. If this does not work try the following (This procedure does not promise 100% success).
(The following example relats to a bad filesystem in slv4.0)

1. Copy the original Superblock into a file (calld sd0 in /tmp - places can be changed):
dd if=/dev/rslv4.0 of=/tmp/sb0 bs=4k count=1 skip=1

Note: if=Input File, of=Output file, bs=Block Size.

2. Copy the backup Superblock into a file (calld sd1 in /tmp - places can be changed):
dd if=/dev/rslv4.0 of=/tmp/sb1 bs=4k count=1 skip=31

3. Copy the Backup Superblock file over the original Superblock:
dd if=/tmp/sb1 of=/dev/rslv4.0 bs=4k seek=1

4. Do �fsck� again on this filesystem

Note:
If you want to restore the original Superblock, do:
dd if=/tmp/sb0 of=/dev/rslv4.0 bs=4k seek=1


----------------------------------------------------------------------------------------
Note 1.3         : A superblock issue
Works on OS      : AIX
probable message : probably fsck gives you a message
disks            : local disks, most likely not SAN
----------------------------------------------------------------------------------------


>>>> Method 1:

-- Fixing a corrupted magic number in the file system superblock.

If the superblock of a file system is damaged, the file system cannot be accessed. You can fix a 
corrupted magic number in the file system superblock.

Most damage to the superblock cannot be repaired. The following procedure describes how to repair a superblock 
in a JFS file system when the problem is caused by a corrupted magic number. If the primary superblock is corrupted 
in a JFS2 file system, use the fsck command to automatically copy the secondary superblock and repair the primary 
superblock.

In the following scenario, assume /home/myfs is a JFS file system on the physical volume /dev/lv02.

The information in this how-to was tested using AIX� 5.2. If you are using a different version or level of AIX, 
the results you obtain might vary significantly. 

1. Unmount the /home/myfs file system, which you suspect might be damaged, using the following command: 

# umount /home/myfs

2. To confirm damage to the file system, run the fsck command against the file system. For example: 

# fsck -p /dev/lv02

If the problem is damage to the superblock, the fsck command returns one of the following messages: 

fsck: Not an AIXV5 file system
OR 
Not a recognized filesystem type

3. With root authority, use the od command to display the superblock for the file system, 
as shown in the following example: 

# od -x -N 64 /dev/lv02 +0x1000

Where the -x flag displays output in hexadecimal format and the -N flag instructs the system to format 
no more than 64 input bytes from the offset parameter (+), which specifies the point in the file where 
the file output begins. The following is an example output: 

0001000  1234 0234 0000 0000 0000 4000 0000 000a
0001010  0001 8000 1000 0000 2f6c 7633 0000 6c76
0001020  3300 0000 000a 0003 0100 0000 2f28 0383
0001030  0000 0001 0000 0200 0000 2000 0000 0000
0001040

In the preceding output, note the corrupted magic value at 0x1000 (1234 0234). If all defaults were taken 
when the file system was created, the magic number should be 0x43218765. If any defaults were overridden, 
the magic number should be 0x65872143. 

4. Use the od command to check the secondary superblock for a correct magic number. An example command 
and its output follows: 

# od -x -N 64 /dev/lv02 +0x1f000

001f000  6587 2143 0000 0000 0000 4000 0000 000a
001f010  0001 8000 1000 0000 2f6c 7633 0000 6c76
001f020  3300 0000 000a 0003 0100 0000 2f28 0383
001f030  0000 0001 0000 0200 0000 2000 0000 0000
001f040

Note the correct magic value at 0x1f000. 

5. Copy the secondary superblock to the primary superblock. An example command and output follows: 

# dd count=1 bs=4k skip=31 seek=1 if=/dev/lv02 of=/dev/lv02

dd: 1+0 records in.
dd: 1+0 records out.

Use the fsck command to clean up inconsistent files caused by using the secondary superblock. For example: 

# fsck /dev/lv02 2>&1 | tee /tmp/fsck.errs

For more information

The fsck and od command descriptions in AIX 5L Version 5.3 Commands Reference, Volume 4 
AIX Logical Volume Manager from A to Z: Introduction and Concepts, an IBM Redbook 
AIX Logical Volume Manager from A to Z: Troubleshooting and Commands, an IBM Redbook 
"Boot Problems" in Problem Solving and Troubleshooting in AIX 5L, an IBM Redbook 


OR

>>>>> Method 2:

If you experience a dirty superblock, which causes a filesystem to be 
not mountable, you can use backup copy of superblock to copy it over the 
corrupted one. 


With little unix experience it can be a tough task, because the steps 
required are as follows: 


- boot from bootable media (install cd/tape, mksysb tape) 
- access rootvg before mounting fs 
- fsck -y on corrupted fs's 
- logform on logdevice 
- dd count=1 bs=4k skip=31 seek=1 if=/dev/<corrupted_lv> of=/dev/<corrupted_lv> 


----------------------------------------------------------------------------------------
Note 1.3         : A superblock issue
Works on OS      : Solaris
probable message : probably fsck gives you a message
disks            : local disks, most likely not SAN
----------------------------------------------------------------------------------------

>>>> Method 1:

Boot from OK prompt to single user mode, for example from CDROM

OK boot cdrom -sw
 

Attempt to fsck(1M) boot disk. This could fail with a super block error. 

# fsck /dev/rdsk/device

Find the locations of alternate super blocks. BE SURE TO USE AN UPPERCASE -N. For example: 

# newfs -N /dev/rdsk/c0t0d0s0
/dev/rdsk/c0t0d0s0:     2048960 sectors in 1348 cylinders of 19 tracks, 
80 sectors 1000.5MB in 85 cyl groups (16 c/g, 11.88MB/g, 5696 i/g)
super-block backups (for fsck -F ufs -o b=#) at:
32, 24432, 48832, 73232, 97632, 122032, 146432, 170832, 195232, 219632,
244032, 268432, 292832, 317232, 341632, 366032, 390432, 414832, 439232,
463632, 488032, 512432, 536832, 561232, 585632, 610032, 634432, 658832,
683232, 707632, 732032, 756432, 778272, 802672, 827072, 851472, 875872,
900272, 924672, 949072, 973472, 997872, 1022272, 1290672, ... 


Using an alternate super block, run fsck(1M) on the disk. You might have to try more than one alternate super block 
to make this to work. Pick a couple from the beginning, the middle, and the end. 

# fsck -o b=<altblk> /dev/rdsk/c0t0d0s0 


The boot block is probably bad too. Restore it while you are booted from the CD-ROM. 

# /usr/sbin/installboot /usr/platform/architecture/lib/fs/ufs/bootblk 
/dev/rdsk/c0t0d0s0 


Reboot the operating environment. 

# reboot 

OR:

>>>>> Method 2:

#newfs -N /dev/rdsk/<device>  (like c0t0d0s7)

it will generate the identical superblock.

then run.......

#fsck -o b=535952 /dev/rdsk/<device> (like c0t0d0s7)


OR:

>>>>>>> Method 3:

Restore a Bad Superblock (Solaris 8,9 and 10)
February 25, 2008 by sun4u 


Become superuser or assume an equivalent role. 
Determine whether the bad superblock is in the root (/), /usr, or /var file system and select one of
the following:

If the bad superblock is in either the root (/), /usr, or /var file system, 

then boot from the network or a locally connected CD.

From a locally-connected CD, use the following command:
ok boot cdrom -s

From the network where a boot or install server is already setup, use the following command:
ok boot net -s

If the bad superblock is not in either the root (/), /usr, /var file system, change to a directory
outside the damaged file system and unmount the file system.

# umount /mount-point

Caution � Be sure to use the newfs -N in the next step. If you omit the -N option, you will destroy
all of the data in the file system and replace it with an empty file system.

Display the superblock values by using the newfs -N command. 
# newfs -N /dev/rdsk/device-name

Provide an alternate superblock by using the fsck command. 
# fsck-F ufs -o b=block-number /dev/rdsk/device-name

The fsck command uses the alternate superblock you specify to restore the primary superblock. You
can always try 32 as an alternate block. Or, use any of the alternate blocks shown by the newfs -N
command.

 
Restoring a Bad Superblock (Solaris 8, 9, and 10 Releases)
The following example shows how to restore the superblock copy 5264.

# newfs -N /dev/rdsk/c0t3d0s7
/dev/rdsk/c0t3d0s7: 163944 sectors in 506 cylinders of 9 tracks, 36 sectors
83.9MB in 32 cyl groups (16 c/g, 2.65MB/g, 1216 i/g)
super-block backups (for fsck -b #) at:
32, 5264, 10496, 15728, 20960, 26192, 31424, 36656, 41888,
47120, 52352, 57584, 62816, 68048, 73280, 78512, 82976, 88208,
93440, 98672, 103904, 109136, 114368, 119600, 124832, 130064, 135296,
140528, 145760, 150992, 156224, 161456,

# fsck-F ufs -o b=5264 /dev/rdsk/c0t3d0s7
Alternate superblock location: 5264.
** /dev/rdsk/c0t3d0s7
** Last Mounted on
** Phase 1- Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
** Phase 5 - Check Cyl groups
36 files, 867 used, 75712 free (16 frags, 9462 blocks, 0.0% fragmentation)
***** FILE SYSTEM WAS MODIFIED *****
#


----------------------------------------------------------------------------------------
Note 1.4         : A superblock issue
Works on OS      : Linux ext2 filesystem
probable message : probably fsck gives you a message
disks            : local disks, most likely not SAN
----------------------------------------------------------------------------------------


DAMAGED SUPERBLOCK


If a filesystem check fails and returns the error message �Damaged Superblock� you're lost . . . . . . . 
or not ?
Well, not really, the damaged �superblock� can be restored from a backup. There are several backups stored 
on the harddisk. But let me first have a go at explaining what a �superblock�is.

A superblock is located at position 0 of every partition, contains vital information about the filesystem 
and is needed at a filesystem check.

The information stored in the superblock are about what sort of fiesystem is used, the I-Node counts, 
block counts, free blocks and I-Nodes, the numer of times the filesystem was mounted, date of the 
last filesystem check and the first I-Node where / is located.

Thus, a damaged superblock means that the filesystem check will fail. 

Our luck is that there are backups of the superblock located on several positions and we can restore 
them with a simple command.

The usual ( and only ) positions are: 8193, 32768, 98304, 163840, 229376 and 294912. ( 8193 in many cases 
only on older systems, 32768 is the most current position for the first backup )
You can check this out and have a lot more info about a particular partition you have on your HD by:

  
# dumpe2fs /dev/hda5 

You will see that the primary superblock is located at position 0, and the first backup on position 32768.
O.K. let�s get serious now, suppose you get a �Damaged Superblock� error message at filesystem check 
( after a power failure ) and you get a root-prompt in a recovery console, then you give the command:


# e2fsck -b 32768 /dev/hda5 


don�t try this on a mounted filesystem

It will then check the filesystem with the information stored in that backup superblock and if the check 
was successful it will restore the backup to position 0.
Now imagine the backup at position 32768 was damaged too . . . then you just try again with the backup 
stored at position 98304, and 163840, and 229376 etc. etc. until you find an undamaged backup  
( there are five backups so if at least one of those five is okay it�s bingo ! )

So next time don�t panic . . just get the paper where you printed out this Tip and give the magic command
 
# e2fsck -b 32768 /dev/hda5  


----------------------------------------------------------------------------------------
Note 1.5         : Root filesystem full or nearly full
Works on OS      : most unixes
----------------------------------------------------------------------------------------


Always take care that the "/" root filesystem does not get near 100% full.

Potential problems

1. Some systems will not boot anymore in the normal multi-user way
2. On many systems new logons are not possible anymore
3. Some apps write or create unamed pipes "somewhere" in the root fs: they may stall or even crash
   
Remarks on 2:

This is caused by a full file system and the system has no space
to write its utmpx (login info) entry.

To get around this condition the system must be booted up
into single user mode, or you may need to boot from CDROM or from network etc..
Then you might be able to clear logfiles under /var/..
Or just increase the / filesystem with some additional space.


###############################################################
2. How to delete "weird" files
############################################################### 


----------------------------------------------------------------------------------------
Note 2.1         : You cannot rm a file in the "normal" way, or
                   How to Delete or Remove Files With Inode Number
Works on OS      : all unix
----------------------------------------------------------------------------------------

>>>>>> Question: 

How can I remove a bizarre, irremovable file from a directory? I've tried every way of using 
/bin/rm and nothing works." 


>>>>>> Answer: 

In some rare cases a strangely-named file will show itself in your directory and appear to be 
un-removable with the rm command. Here is will the use of ls -li and find with its -inum [inode] 
primary does the job. 
Let's say that ls -l shows your irremovable as 

-rw-------  1 smith  smith  0 Feb  1 09:22 ?*?*P

Type: 

ls -li

to get the index node, or inode. 

153805 -rw-------  1 smith  smith  0 Feb  1 09:22 ?*?^P

The inode for this file is 153805. Use find -inum [inode] to make sure that the file is correctly identified. 


%  find -inum 153805 -print
./?*?*P

Here, we see that it is. Then used the -exec functionality to do the remove. . 
  
% find . -inum 153805 -print -exec /bin/rm {} \;

Note that if this strangely named file were not of zero-length, it might contain accidentally misplaced 
and wanted data. Then you might want to determine what kind of data the file contains and move the file 
to some temporary directory for further investigation, for example: 

% find . -inum 153805 -print -exec /bin/mv {} unknown.file \;

Will rename the file to unknown.file, so you can easily inspect it. 

Another way to remove strangely-named files is to use "ls -q" or "cat -v" to show the special characters, 
and then use shell's globbing mechanism to delete the file. 

$ ls
-????*'?
$ ls | cat -v
-^B^C?^?*'

$ rm ./-'^B'*           -- achieved by typing control-V control-B
$ ls


the argument given to rm is a judicious selection of glob wildcards (*'s) and sufficient control characters 
to uniquely identify the file. The leading "./" is useful when the file begins with a hyphen. 
These binary name files are caused by: 

* accidental cut-and-pastes to shell prompts - especially when you paste something of the form: "junk > garbage" 
because the shell creates the file "garbage" before trying to execute the command "junk" 

* filesystem corruption (in which case touching the filesystem any more can really stuff things up) 
If you discover that you have two files of the same name, one of the files probably has a bizarre 
(and unprintable) character in its name. Most probably, this unprintable character is a backspace. 

For example: 


    $ ls
    filename filename
    $ ls -q
    filename fl?ilename
    $ ls | cat -v
    filename
    fl^Hilename


----------------------------------------------------------------------------------------
Note 2.2         : You cannot rm a file in the "normal" way, or
                   How to Delete or Remove Files With Inode Number
Works on OS      : all unix
Same problem as noted in note 2.1.
----------------------------------------------------------------------------------------


An inode identifies the file and its attributes such as file size, owner, and so on. A unique inode number 
within the file system identifies each inode. But, why to delete file by an inode number? 
Sure, you can use rm command to delete file. Sometime accidentally you creates filename with control characters 
or characters which are unable to be input on a keyboard or special character such as ?, * ^ etc. 
Removing such special character filenames can be problem. Use following method to delete a file with strange characters in its name:

Please note that the procedure outlined below works with Solaris, FreeBSD, Linux, or any other Unixish oses out there:


Find out file inode 
First find out file inode number with any one of the following command:

stat {file-name}

OR 

ls -il {file-name}

Use find command to remove file:
Use find command as follows to find and remove a file:

find . -inum [inode-number] -exec rm -i {} \;

When prompted for confirmation, press Y to confirm removal of the file.

Let us try to delete file using inode number.

(a) Create a hard to delete file name:
$ cd /tmp
$ touch "\+Xy \+\8"
$ ls 
(b) Try to remove this file with rm command:
$ rm \+Xy \+\8

(c) Remove file by an inode number, but first find out the file inode number:
$ ls -ilOutput: 

781956 drwx------  3 viv viv 4096 2006-01-27 15:05 gconfd-viv
781964 drwx------  2 viv viv 4096 2006-01-27 15:05 keyring-pKracm
782049 srwxr-xr-x  1 viv viv    0 2006-01-27 15:05 mapping-viv
781939 drwx------  2 viv viv 4096 2006-01-27 15:31 orbit-viv
781922 drwx------  2 viv viv 4096 2006-01-27 15:05 ssh-cnaOtj4013
781882 drwx------  2 viv viv 4096 2006-01-27 15:05 ssh-SsCkUW4013
782263 -rw-r--r--  1 viv viv    0 2006-01-27 15:49 \+Xy \+\8Note: 782263 is inode number.

(d) Use find command to delete file by inode:
Find and remove file using find command, type the command as follows:
$ find . -inum 782263 -exec rm -i {} \;
Note you can also use add \ character before special character in filename to remove it directly so the command would be:

$ rm "\+Xy \+\8"

If you have file like name like name "2005/12/31" then no UNIX or Linux command can delete this file by name. 
Only method to delete such file is delete file by an inode number. Linux or UNIX never allows creating filename like 2005/12/31 
but if you are using NFS from MAC OS or Windows then it is possible to create a such file.

OR

read this thead:


Become superuser or assume an equivalent role. 
Determine whether the bad superblock is in the root (/), /usr, or /var file system and select one of
the following:

If the bad superblock is in either the root (/), /usr, or /var file system, 

then boot from the network or a locally connected CD.

From a locally-connected CD, use the following command:
ok boot cdrom -s

From the network where a boot or install server is already setup, use the following command:
ok boot net -s

If the bad superblock is not in either the root (/), /usr, /var file system, change to a directory
outside the damaged file system and unmount the file system.

# umount /mount-point

Caution � Be sure to use the newfs -N in the next step. If you omit the -N option, you will destroy
all of the data in the file system and replace it with an empty file system.

Display the superblock values by using the newfs -N command. 
# newfs -N /dev/rdsk/device-name

Provide an alternate superblock by using the fsck command. 
# fsck-F ufs -o b=block-number /dev/rdsk/device-name

The fsck command uses the alternate superblock you specify to restore the primary superblock. You
can always try 32 as an alternate block. Or, use any of the alternate blocks shown by the newfs -N
command.

 
Restoring a Bad Superblock (Solaris 8, 9, and 10 Releases)
The following example shows how to restore the superblock copy 5264.

# newfs -N /dev/rdsk/c0t3d0s7
/dev/rdsk/c0t3d0s7: 163944 sectors in 506 cylinders of 9 tracks, 36 sectors
83.9MB in 32 cyl groups (16 c/g, 2.65MB/g, 1216 i/g)
super-block backups (for fsck -b #) at:
32, 5264, 10496, 15728, 20960, 26192, 31424, 36656, 41888,
47120, 52352, 57584, 62816, 68048, 73280, 78512, 82976, 88208,
93440, 98672, 103904, 109136, 114368, 119600, 124832, 130064, 135296,
140528, 145760, 150992, 156224, 161456,

# fsck-F ufs -o b=5264 /dev/rdsk/c0t3d0s7
Alternate superblock location: 5264.
** /dev/rdsk/c0t3d0s7
** Last Mounted on
** Phase 1- Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
** Phase 5 - Check Cyl groups
36 files, 867 used, 75712 free (16 frags, 9462 blocks, 0.0% fragmentation)
***** FILE SYSTEM WAS MODIFIED *****
#


###############################################################
3. UNDELETE OF FILES IF NO BACKUPS ARE AVAILABLE:
###############################################################

Few things are so lousy, as loosing an important file.

Ofcourse, all sysdamins have professional backup software running on their systems.

But in some rare cases, for whatever reason, a backup might not be available.
In such a situation it *might* still be possible to recover files
after you have accidently deleted them.

In general however, there is no more than a pessimistic prognose for file undelete.

When a file is deleted using the �rm� command, three actions occur. First, the filename and pointer 
are removed from its directory block. Second, the kernel frees up file's data blocks for general use. 
Third, the kernel frees up the file's indexing record, or inode, for general use. 
Thus, quite litteraly, the file is effectively destroyed from the operating system's standpoint.
But, it's not "gone" yet ! If you act quickly, you might salvage the file.

Some unixes provides for a sort of "unrm" or "undelete" (shell) utility, suitable for
some types of filesystems, which may produce good results if you start using it immediately
after you mistakingly deleted the file. But it's likely that you still need to do a lot of work after
using that "unrm" tool, like processing the results with "Lazarus" or similar tool.
Any case, its still worth to check with your sysadmin or check your system.
Also, in general, if an important file was deleted by mistake, (try to) stop all write activity
on that filesystem. 

Maybe, this section provides you with a pointer on how to move on.
Also, there might be "tools" out there that can help the user with such a problem.
Here are some notes on the subject of undelete on Unix.


----------------------------------------------------------------------------------------
Note 1:
----------------------------------------------------------------------------------------

http://www.cyberciti.biz/tips/linuxunix-recover-deleted-files.html

Using grep (traditional UNIX way) to recover files
Use following grep syntax:

grep -b 'search-text' /dev/partition > file.txt
OR

grep -a -B[size before] -A[size after] 'text' /dev/[your_partition] > file.txt

Where,

-i : Ignore case distinctions in both the PATTERN and the input files i.e. match both uppercase and lowercase character. 
-a : Process a binary file as if it were text 
-B Print number lines/size of leading context before matching lines. 
-A: Print number lines/size of trailing context after matching lines. 

To recover text file starting with "nixCraft" word on /dev/sda1 you can try following command:

# grep -i -a -B10 -A100 'nixCraft' /dev/sda1 > file.txt

Next use vi to see file.txt. This method is ONLY useful if deleted file is text file. 
If you are using ext2 file system, try out recover command. .


----------------------------------------------------------------------------------------
Note 2:
----------------------------------------------------------------------------------------

Bring back deleted files with lsof
By Michael Stutz on November 16, 2006 (8:00:00 AM) 

Briefly, a file as it appears somewhere on a Linux filesystem is actually just a link to an inode, 
which contains all of the file's properties, such as permissions and ownership, as well as the addresses 
of the data blocks where the file's content is stored on disk. When you rm a file, you're removing the link 
that points to its inode, but not the inode itself; other processes (such as your audio player) might still 
have it open. It's only after they're through and all links are removed that an inode and the data blocks 
it pointed to are made available for writing.

This delay is your key to a quick and happy recovery: if a process still has the file open, the data's there 
somewhere, even though according to the directory listing the file already appears to be gone.

This is where the Linux process pseudo-filesystem, the /proc directory, comes into play. Every process on 
the system has a directory here with its name on it, inside of which lies many things -- 
including an fd ("file descriptor") subdirectory containing links to all files that the process has open. 
Even if a file has been removed from the filesystem, a copy of the data will be right here:

/proc/process id/fd/file descriptor 

To know where to go, you need to get the id of the process that has the file open, and the file descriptor. 
These you get with lsof, whose name means "list open files." (It actually does a whole lot more than this 
and is so useful that almost every system has it installed. If yours isn't one of them, you can grab the latest 
version straight from its author.)

Once you get that information from lsof, you can just copy the data out of /proc and call it a day.

This whole thing is best demonstrated with a live example. First, create a text file that you can delete 
and then bring back:

$ man lsof | col -b > myfile 

Then have a look at the contents of the file that you just created:

$ less myfile 

You should see a plaintext version of lsof's huge man page looking out at you, courtesy of less.

Now press Ctrl-Z to suspend less. Back at a shell prompt make sure your file is still there:

$ ls -l myfile
-rw-r--r--  1 jimbo jimbo 114383 Oct 31 16:14 myfile
$ stat myfile
  File: `myfile'
  Size: 114383          Blocks: 232        IO Block: 4096   regular file
Device: 341h/833d       Inode: 1276722     Links: 1
Access: (0644/-rw-r--r--)  Uid: ( 1010/    jimbo)   Gid: ( 1010/    jimbo)
Access: 2006-10-31 16:15:08.423715488 -0400
Modify: 2006-10-31 16:14:52.684417746 -0400
Change: 2006-10-31 16:14:52.684417746 -0400
Yup, it's there all right. OK, go ahead and oops it:

$ rm myfile
$ ls -l myfile
ls: myfile: No such file or directory
$ stat myfile
stat: cannot stat `myfile': No such file or directory
$
It's gone.

At this point, you must not allow the process still using the file to exit, because once that happens, 
the file will really be gone and your troubles will intensify. Your background less process in this walkthrough 
isn't going anywhere (unless you kill the process or exit the shell), but if this were a video or sound file that 
you were playing, the first thing to do at the point where you realize you deleted the file would be to 
immediately pause the application playback, or otherwise freeze the process, so that it doesn't eventually 
stop playing the file and exit. 

Now to bring the file back. First see what lsof has to say about it:

$ lsof | grep myfile
less      4158    jimbo    4r      REG       3,65   114383   1276722 /home/jimbo/myfile (deleted)
The first column gives you the name of the command associated with the process, the second column is the 
process id, and the number in the fourth column is the file descriptor (the "r" means that it's a regular file). 
Now you know that process 4158 still has the file open, and you know the file descriptor, 4. That's everything 
you have to know to copy it out of /proc.

You might think that using the -a flag with cp is the right thing to do here, since you're restoring the file -- 
but it's actually important that you don't do that. Otherwise, instead of copying the literal data contained 
in the file, you'll be copying a now-broken symbolic link to the file as it once was listed in its original directory:

$ ls -l /proc/4158/fd/4
lr-x------  1 jimbo jimbo 64 Oct 31 16:18 /proc/4158/fd/4 -> /home/jimbo/myfile (deleted)
$ cp -a /proc/4158/fd/4 myfile.wrong
$ ls -l myfile.wrong
lrwxr-xr-x  1 jimbo jimbo 24 Oct 31 16:22 myfile.wrong -> /home/jimbo/myfile (deleted)
$ file myfile.wrong
myfile.wrong: broken symbolic link to `/home/jimbo/myfile (deleted)'
$ file /proc/4158/fd/4
/proc/4158/fd/4: broken symbolic link to `/home/jimbo/myfile (deleted)'
So instead of all that, just a plain old cp will do the trick:

$ cp /proc/4158/fd/4 myfile.saved 

And finally, verify that you've done good:

$ ls -l myfile.saved
-rw-r--r--  1 jimbo jimbo 114383 Oct 31 16:25 myfile.saved
$ man lsof | col -b > myfile.new
$ cmp myfile.saved myfile.new
No complaints from cmp -- your restoration is the real deal.

Incidentally, there are a lot of useful things you can do with lsof in addition to rescuing lost files.


----------------------------------------------------------------------------------------
Note 3:
----------------------------------------------------------------------------------------

Recover Deleted Files
Files on Unix may be deleted, but still held open by another process. While most Unix would require a utility to read a file 
by the filesystem and inode(5) number, the special /proc filesystem on Linux allows the recovery of deleted but held open files:

Use lsof(1) to discover the deleted file, and record the Process ID (PID) and File Descriptor (FD) open to this file. 
Recover the file: 

cp /proc/$PID/fd/$FD /var/tmp/recovered 

The deleted file should appear as a broken symbolic link under the /proc/$PID/fd directory. 
Despite this, /proc still allows the file to be copied elsewhere. For related information, see how to debug Unix systems.


----------------------------------------------------------------------------------------
Note 4:
----------------------------------------------------------------------------------------

HOWTO recover deleted files on an Linux ext3 file system

Please see:

http://www.xs4all.nl/~carlo17/howto/undelete_ext3.html

Or see

Tom Pycke, Recovering Files in Linux, available at www.recover.source.net/linux


For Linux ext2 file system:

1. R-Linux undelete utility: 
Take a look here:
http://3d2f.com/tags/undelete/recover/unix/

2. The ext2 file system has an addon program called e2undel[1] which allows file undeletion, although the similar ext3 file system 
does not support that kind of undeletion.

3. Also, mabe the following "unrm" can be of help on Linux:
http://freshmeat.net/projects/unrm/


Another "unrm" pointer:
http://staff.washington.edu/dittrich/talks/blackhat/tct/man/man1/unrm.1.html


----------------------------------------------------------------------------------------
Note 5:
----------------------------------------------------------------------------------------

Possible AIX undelete tool:

http://www.compunix.com/products.html
http://www.compunix.com/prod/analyse.html
http://www.compunix.com/eval/list.html


For AIX and JFS:

http://www.phase2.net/2008/03/04/aix-recovering-a-deleted-file-undelete/

When you are really good with the fsdb tool (included in AIX), you might be able
to recover files yourself. See another note in this document for an example of using fsdb.

See man page for fsdb or 
http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.aix.cmds/doc/aixcmds2/fsdb.htm


----------------------------------------------------------------------------------------
Note 6:
----------------------------------------------------------------------------------------

1. Solaris Recovery:

-- Kernel Recovery for Solaris Sparc
Kernel Recovery for Solaris Sparc is a do-it-yourself data recovery software. Software performs read-only scan, 
which helps you to recover your important data in minutes. File System supported for recovery is UFS File system.

http://www.download.com/Kernel-Recovery-for-Solaris-Sparc/3000-2248_4-10578170.html
http://www.download3k.com/Press-Launch-of-Kernel-Recovery-for-Solaris-SPARC.html
http://www.tucows.com/preview/505583
http://www.programurl.com/kernel-recovery-for-solaris-sparc.htm

Nucleus Technologies.com: http://www.nucleustechnologies.com 

-- Other Solaris Data Recovery Software:

http://solaris-data-recovery-software.qarchive.org/


2. R-Tools technology: Undelete tool for Linux and Solaris:

http://www.data-recovery-software.net/


----------------------------------------------------------------------------------------
Note 7:
----------------------------------------------------------------------------------------

For AIX and JFS filesystem: an undelete program
Not tested by writer of this document:


/*****************************************************************************
 * rsb.c - Read Super Block. Allows a jfs superblock to be dumped, inode
 * table to be listed or specific inodes data pointers to be chased and
 * dumped to standard out (undelete).
 *
 * Phil Gibbs - Trinem Consulting (pgibbs@trinem.co.uk)
 ****************************************************************************/
#include <stdio.h>
#include <jfs/filsys.h>
#include <jfs/ino.h>
#include <sys/types.h>
#include <pwd.h>
#include <grp.h>
#include <unistd.h>
#include <time.h>

#define FOUR_MB		(1024*1024*4)
#define THIRTY_TWO_KB	(1024*32)

extern int optind;
extern int Optopt;
extern int Opterr;
extern char *optarg;

void PrintSep()
{
	int k=80;

	while (k)
	{
		putchar('-');
		k--;
	}
	putchar('\n');
}

char *UserName(uid_t uid)
{
char replystr[10];
struct passwd *res;

res=getpwuid(uid);
if (res->pw_name[0])
{
	return res->pw_name;
}
else
{
	sprintf(replystr,"%d",uid);
	return replystr;
}
}

char *GroupName(gid_t gid)
{
struct group *res;
res=getgrgid(gid);
return res->gr_name;
}


ulong NumberOfInodes(struct superblock *sb)
{
	ulong MaxInodes;
	ulong TotalFrags;

	if (sb->s_version==fsv3pvers)
	{
		TotalFrags=(sb->s_fsize*512)/sb->s_fragsize;
		MaxInodes=(TotalFrags/sb->s_agsize)*sb->s_iagsize;
	}
	else
	{
		MaxInodes=(sb->s_fsize*512)/sb->s_bsize;
	}
	return MaxInodes;
}


void AnalyseSuperBlock(struct superblock *sb)
{
	ulong TotalFrags;

	PrintSep();
	printf("SuperBlock Details:\n-------------------\n");
	printf("File system size:  %ld x 512 bytes (%ld Mb)\n",
				sb->s_fsize,
				(sb->s_fsize*512)/(1024*1024));
	printf("Block size:        %d bytes\n",sb->s_bsize);
	printf("Flags:             ");
	switch (sb->s_fmod)
	{
		case (char)FM_CLEAN:
			break;
		case (char)FM_MOUNT:
			printf("mounted ");
			break;
		case (char)FM_MDIRTY:
			printf("mounted dirty ");
			break;
		case (char)FM_LOGREDO:
			printf("log redo failed ");
			break;
		default:
			printf("Unknown flag ");
			break;
	}
	if (sb->s_ronly) printf("(read-only)");
	printf("\n");
	printf("Last SB update at: %s",ctime(&(sb->s_time)));
	printf("Version:           %s\n",
	sb->s_version?"1 - fsv3pvers":"0 - fsv3vers");
	printf("\n");
	if (sb->s_version==fsv3pvers)
	{
		TotalFrags=(sb->s_fsize*512)/sb->s_fragsize;
		printf("Fragment size:     %5d         ",sb->s_fragsize);
		printf("inodes per alloc:  %8d\n",sb->s_iagsize);
		printf("Frags per alloc:   %5d         ",sb->s_agsize);
		printf("Total Fragments:   %8d\n",TotalFrags);
		printf("Total Alloc Grps:  %5d         ",
						TotalFrags/sb->s_agsize);
		printf("Max inodes:        %8ld\n",NumberOfInodes(sb));
	}
	else
	{
		printf("Total Alloc Grps:  %5d         ",
				(sb->s_fsize*512)/sb->s_agsize);
		printf("inodes per alloc:  %8d\n",sb->s_agsize);
		printf("Max inodes:      %8ld\n",NumberOfInodes(sb));
	}
	PrintSep();
}

void ReadInode(	FILE *in,
		ulong StartInum,
		struct dinode *inode,
		ulong InodesPerAllocBlock,
		ulong AllocBlockSize)
{
	off_t			SeekPoint;
	long			BlockNumber;
	int			OffsetInBlock;
	static struct dinode	I_NODES[PAGESIZE/DILENGTH];
	ulong			AllocBlock;
	ulong			inum;
	static off_t		LastSeekPoint=-1;

	AllocBlock=(StartInum/InodesPerAllocBlock);
	BlockNumber=(StartInum-(AllocBlock*InodesPerAllocBlock))/
			(PAGESIZE/DILENGTH);
	OffsetInBlock=(StartInum-(AllocBlock*InodesPerAllocBlock))-
			(BlockNumber*(PAGESIZE/DILENGTH));
	SeekPoint=(AllocBlock)?
		(BlockNumber*PAGESIZE)+(AllocBlock*AllocBlockSize):
		(BlockNumber*PAGESIZE)+(INODES_B*PAGESIZE);
	if (SeekPoint!=LastSeekPoint)
	{
		sync();
		fseek(in,SeekPoint,SEEK_SET);
		fread(I_NODES,PAGESIZE,1,in);
		LastSeekPoint=SeekPoint;
	}
	*inode=I_NODES[OffsetInBlock];
}

void DumpInodeContents(	long	inode,
			FILE	*in,
			ulong	InodesPerAllocBlock,
			ulong	AllocBlockSize,
			ulong	Mask,
			ulong	Multiplier)
{
	struct dinode		DiskInode;
	ulong			SeekPoint;
	char			Buffer[4096];
	ulong			FileSize;
	int			k;
	int			BytesToRead;
	ulong			*DiskPointers;
	int			NumPtrs;

	ReadInode(	in,
			inode,
			&DiskInode,
			InodesPerAllocBlock,
			AllocBlockSize);
	FileSize=DiskInode.di_size;

	if (FileSize>FOUR_MB)
	{
		/* Double indirect mapping */
	}
	else
	if (FileSize>THIRTY_TWO_KB)
	{
		/* Indirect mapping */
		SeekPoint=DiskInode.di_rindirect & Mask;
		SeekPoint=SeekPoint*Multiplier;
		DiskPointers=(ulong *)malloc(1024*sizeof(ulong));
		fseek(in,SeekPoint,SEEK_SET);
		fread(DiskPointers,1024*sizeof(ulong),1,in);
		NumPtrs=1024;
	}
	else
	{
		/* Direct Mapping */
		DiskPointers=&(DiskInode.di_rdaddr[0]);
		NumPtrs=8;
	}

	for (k=0;k<=NumPtrs && FileSize;k++)
	{
		SeekPoint=(DiskPointers[k] & Mask);
		SeekPoint=SeekPoint*Multiplier;

		BytesToRead=(FileSize>sizeof(Buffer))?sizeof(Buffer):FileSize;
		fseek(in,SeekPoint,SEEK_SET);
		fread(Buffer,BytesToRead,1,in);
		FileSize=FileSize-BytesToRead;
		write(1,Buffer,BytesToRead);
	}
}

void DumpInodeList(	FILE	*in,
			ulong	MaxInodes,
			ulong	InodesPerAllocBlock,
			ulong	AllocBlockSize)
{
	long			inode;
	struct dinode		DiskInode;
	struct tm		*TimeStruct;

	printf("   Inode Links     User    Group     Size    ModDate\n");
	printf("-------- ----- -------- -------- --------    -------\n");
	for (inode=0;inode<=MaxInodes;inode++)
	{
		ReadInode(	in,
				inode,
				&DiskInode,
				InodesPerAllocBlock,
				AllocBlockSize);
		if (DiskInode.di_mtime)
		{
			TimeStruct=localtime((long *)&DiskInode.di_mtime);
			printf("%8d %5d %8s %8s %8d %02d/%02d/%4d\n",
				inode,
				DiskInode.di_nlink,
				UserName(DiskInode.di_uid),
				GroupName(DiskInode.di_gid),
				DiskInode.di_size,
				TimeStruct->tm_mday,
				TimeStruct->tm_mon,
				TimeStruct->tm_year+1900);
		}
	}
}

void ExitWithUsageMessage()
{
	fprintf(stderr,"USAGE: rsb [-i inode] [-d] [-s] <block_device>\n");
	exit(1);
}

main(int argc,char **argv)
{
	FILE			*in;
	struct superblock	SuperBlock;
	short			Valid;
	long			inode=0;
	struct dinode		DiskInode;
	ulong			AllocBlockSize;
	ulong			InodesPerAllocBlock;
	ulong			MaxInodes;
	ulong			Mask;
	ulong			Multiplier;
	int			option;
	int			DumpSuperBlockFlag=0;
	int			DumpFlag=0;

	while ((option=getopt(argc,argv,"i:ds")) != EOF)
	{
		switch(option)
		{
			case 'i':
				/* Inode specified */
				inode=atol(optarg);
				break;
			case 'd':
				/* Dump flag */
				DumpFlag=1;
				break;
			case 's':
				/* List Superblock flag */
				DumpSuperBlockFlag=1;
				break;
			default:
				break;
		}
	}

	if (strlen(argv[optind])) in=fopen(argv[optind],"r");
	else ExitWithUsageMessage();

	if (in)
	{
		fseek(in,SUPER_B*PAGESIZE,SEEK_SET);
		fread(&SuperBlock,sizeof(SuperBlock),1,in);
		switch (SuperBlock.s_version)
		{
			case fsv3pvers:
				Valid=!strncmp(SuperBlock.s_magic,fsv3pmagic,4);
				InodesPerAllocBlock=SuperBlock.s_iagsize;
				AllocBlockSize=
				SuperBlock.s_fragsize*SuperBlock.s_agsize;
				Multiplier=SuperBlock.s_fragsize;
				Mask=0x3ffffff;
				break;
			case fsv3vers:
				Valid=!strncmp(SuperBlock.s_magic,fsv3magic,4);
				InodesPerAllocBlock=SuperBlock.s_agsize;
				AllocBlockSize=SuperBlock.s_agsize*PAGESIZE;
				Multiplier=SuperBlock.s_bsize;
				Mask=0xfffffff;
				break;
			default:
				Valid=0;
				break;
		}
		if (Valid)
		{
			if (DumpSuperBlockFlag==1)
			{
				AnalyseSuperBlock(&SuperBlock);
			}
			MaxInodes=NumberOfInodes(&SuperBlock);
			if (DumpFlag==1)
			{
				if (inode)
				DumpInodeContents(inode,in,InodesPerAllocBlock,AllocBlockSize,Mask,Multiplier);
				else
				DumpInodeList(in,MaxInodes,InodesPerAllocBlock,AllocBlockSize);
			}
		}
		else
		{
			fprintf(stderr,"Superblock - bad magic number\n");
			exit(1);
		}
	}
	else
	{
		fprintf(stderr,"couldn't open ");
		perror(argv[optind]);
		exit(1);
	}
}


----------------------------------------------------------------------------------------
Note 8:
----------------------------------------------------------------------------------------

http://wiki.yak.net/592


HOWTO rescue deleted Linux files | undelete | unremove | unrm | rm -v
Here's how we rescued a LaTeX *.tex file that was accidentally removed on a Linux box. 


Stop doing anything else on the system. The idea is to use the disk as little as possible. (We stopped short of killing idle daemons, 
because we didn't want them scribbling stuff in log files. ) 

Know the first few bytes of the file you want. Hopefully they are fairly unique. The LaTeX document we wanted began with the characters 
"\document", so we used that pattern. 

Write a program that will read each sector from the raw partition (you must be root) (assuming 512 byte sectors is safest) 
and see if it begins with the pattern. If not, it loops and reads the next 512 bytes... If it finds it, it saves that sector and some 
fixed amount of following sectors (we did 600 more sectors, which is 300 KBytes) in a rescue file. Save probably twice as long a file as you think 
you're looking for. Save them to an extra partition -- or invoke "scp" or something to save them on another machine. 
(Usually ext2 & ext3 store files contiguously on disk -- especially if they are not too big & are written all at once.) 

The following TCL script did the job. Make it open the exact partition you want to scan. It needs another partition to write the rescue files to. 
grope.tcl 

 #
 #  This is in the language Tcl.
 #  Usage:
 #      tclsh scriptname < /dev/hda1   (the partition with the deleted file)
 #
 #  Notice:  change the MOUNT below to a different partition!
 #
 #  Also fix the "string match" pattern -- we used \document for a LaTeX document.
 #
 #  Occasinally sector numbers are written out, to indicate progress.
 #       ( 1 sector == 512 bytes == 0.5KBytes )


 set i 0
 set n 0
 fconfigure stdin -translation binary -encoding binary
 while true {
 	set x [read stdin 512 ]
 	if {$x==""} break
 
 	if {[string match {\\document*} $x ]} {
 		incr i
 		puts stderr "SAVING $i"
 		set f [open /WRITABLE_MOUNT_TO_SAVE_FILES_IN_GOES_HERE/rescue.$i w]
 		fconfigure $f -translation binary -encoding binary
 		puts -nonewline $f $x 
 		puts -nonewline $f [read stdin [expr 600*512] ]
 		close $f
 	}
 
 	incr n
 	if { ($n % 200000)==0 } { puts -nonewline stderr $n. }
 } 


Use "less" to examine the rescue files to see if you can find your data. Also the "strings" command is very good about 
extracting ASCII text portions. 

Even better, if you have physical access to the machine, shut down the system IMMEDIATELY and physically install its disk 
as an extra drive in another unix box. Do your scanning of the raw disk from there. (In our recent case, we didn't have access to this box.) 
Or boot a KNOPPIX CD (which will not write to any partitions unless you specifically mount them writeable from a root shell.) 

I've also used this kind of technique to rescue JPEG files from a digital camera's Compact Flash with a corrupted FAT file system. 
We wrote a program that started a new rescue file every time it found "JFIF" as the first 4 bytes of a sector, even if it was still 
saving the previous rescue file. We completely rescued about 3/4 of the images this way, and fragments of more. 

Obviously the data you are rescuing must be important enough to warrent this much trouble with no guarentee of successfull results. 

Your file could always have been overwritten, or it could be fragmented so you don't find the pieces. But the couple of times I've had 
to do this (for someone else's data!) we've had pretty good success. 


----------------------------------------------------------------------------------------
Note 9: special case: text file edited with vi
----------------------------------------------------------------------------------------

If the file that was deleted, was a text file, and recently edited by vi, then there still might be a version 
available on your system.

On most unix systems, vi keep tracks of former versions.
Check

/var/preserve/username (or similar directory: vi -r )

or a similar directory, depending on the unix version, where there still might exist a recent
version of your text file.


----------------------------------------------------------------------------------------
Note 10:
----------------------------------------------------------------------------------------

Subject: Undelete of a file on AIX, using fsdb.

Remark   : Quite an elaborate procedure but it seems to work for small files.
Important: Be carefull in using fsdb.


Document:

http://www.phase2.net/2008/03/04/aix-recovering-a-deleted-file-undelete/


-- Contents repeated here:

This is a document I wrote a while back for work that I thought I would release in hopes that some people out there would find it useful.

Preferably, you have a backup of the file system that you can use. If not, the filesystem you are about to try to to recover a file on 
must meet these requirements:

No new files have been created on the filesystem. 
No files have been extended. 
The filesystem is able to be unmounted. 
It is a JFS filesystem, not JFS2 
If so, then please, drink a few more beers and continue, but before you do�

BACKUP THE CURRENT FILESYSTEM!

Also, note that if you are dealing with a directory that has been deleted and would like to recover both the directory 
and the files under that directory, you should try Recovering a Deleted Directory ( a document I have yet to post.. ). 
It follows many of the same steps, but has some very important differences. Do not try and use this procedure to recover 
deleted directories and the files that were contained within them. You will mess up.

Before we begin, I need to note a few things. I take no responsibility if this screws up your system. Use this at your own risk. 
Also, the example presented here is an actual representation of me recovering a deleted file, this is not just made up numbers. 
Also, this only works on jfs filesystems, not jfs2. The jfs2 fsdb is much different and I haven�t had a chance to play with it 
to determine the proper way of doing this.

Now that I�ve said that, we can begin. We�ll use an example directory with some example files. Our directory is called 
/test and our filesystem is testlv, otherwise known as /dev/testlv. In our example, our Junior System Admin, Myron, 
has accidentally deleted a perl script called testfile.pl and needs to recover it.

Note: If you are performing this operation on a filesystem while in maintenance mode, do NOT use option 1 when asked on how to mount 
the filesystems. ALWAYS use option 2, which specifies to start a shell before mounting the filesystems. Otherwise, the system will force 
a fsck -y on the filesystem and delete your files.

Step 1.
First, run this command:

ls -id /testOutput:

[test:/]# ls -id /test
    2 /test/

This informs us that the inode for the directory /test is 2. Record this for future use.

Step 2.
Unmount /test

umount /test

Output: None

We must unmount the directory. We don�t want anyone to try and use it while we are attempting to restore the file.

Step 3
Now we�ll start up the filesystem debugger.

fsdb /dev/testlv

Output:

[test:/]# fsdb /dev/testlv

File System:                           /dev/testlv

File System Size:                         193200128  (512 byte blocks)
Disk Map Size:                                 1660  (4K blocks)
Inode Map Size:                                 831  (4K blocks)
Fragment Size:                                 4096  (bytes)
Allocation Group Size:                        16384  (fragments)
Inodes per Allocation Group:                   8192
Total Inodes:                              12075008
Total Fragments:                           24150016

This starts the filesystem debugger on our testlv filesystem.

Step 4
Now we look at our inode number.

2i

Output:

2i
i#:      2  md: d-g-rwxr-xr-x  ln:    4  uid:    3  gid:    3
szh:        0  szl:      512  (actual size:      512)
a0: 0x25d       a1: 0x00        a2: 0x00        a3: 0x00
a4: 0x00        a5: 0x00        a6: 0x00        a7: 0x00
at: Mon Jan 10 11:19:17 2005
mt: Mon Jan 10 11:11:26 2005
ct: Mon Jan 10 11:11:26 2005

The INODE in the command is the inode number we recorded in step #1. This will display the inode information for the directory. 
The field a0 contains the block number of the directory. The following steps assume only field a0 is used. If a value appears in a1, etc, 
it may be necessary to repeat steps #5 and #6 for each block until the file to be recovered is found.

Step 5
Move to the block

a0b

Output:

a0b
0x000025d000  :  0x00000000 (0)

This moves to the block pointed to by field �a0? of this inode.

Step 6
Now we need to print out some data.

p256c

Output:

p256c

0x000025d000:   \0 \0 \0 \? \0 \? \0 \? .  \0 \0 \0 \0 \0 \0 \?
0x000025d010:   \0 \? \0 \? .  .  \0 \0 \0 \0 \0 \? \0 \? \0 \n
0x000025d020:   l  o  s  t  +  f  o  u  n  d  \0 \0 \0 \0 \0 \?
0x000025d030:   \0 $  \0 \? m  e  m  _  r  e  p  o  r  t  _  2
0x000025d040:   0  0  4  1  1  0  1  .  d  m  p  .  g  z  \0 \0
0x000025d050:   \0 \0 \0 \? \0 \s \0 \? o  r  a  s  c  r  a  t
0x000025d060:   c  h  .  c  p  i  o  .  g  z  \0 \0 \0 \0 \0 \?
0x000025d070:   \0 (  \0 \s u  s  e  r  _  a  c  t  i  v  i  t
0x000025d080:   y  _  2  0  0  4  1  1  0  1  .  d  m  p  .  g
0x000025d090:   z  \0 \0 \0 \0 \0 \0 \? \0 ,  \0 !  u  s  e  r
0x000025d0a0:   _  a  c  t  i  v  i  t  y  _  d  e  t  _  2  0
0x000025d0b0:   0  4  1  1  0  1  .  d  m  p  .  g  z  \0 \0 \0
0x000025d0c0:   \0 \? `  \0 \? @  \0 \? E  C  R  1  X  \0 \0 \0
0x000025d0d0:   \0 \0 \0 \? \? 0  \0 \? t  e  s  t  f  i  l  e
0x000025d0e0:   .  p  l  \0 \?    \0 \a t  e  s  t  d  i  r  \0
0x000025d0f0:   j  d  u  c  k  o  .  t  x  t  \0 \0 \0 \0 \0 \?

The command p256c stands for �print 256 bytes in character mode�. You could type �p128c� and it would print 128 bytes in character mode 
and so on. The beginning left column is the address of the first character in that row. The important thing in this output is 
to find which line the file to be recovered is on. Our file ( testfile.pl ) is located on line 0�000025d0d0. Next, we have to find 
the address of the first character of our filename. To do this, starting at 0, count in hexidecimal until you reach the first character 
of the filename. In our example, the �t� of testfile.pl is at address 0�000025d0d8. Record this address.

If you cannot find your filename here, issue the command again. It will print the next 256 bytes in character mode. 
Do this until you find your filename.

Here�s a layout to help you in figuring out how we got the address:

Address:        0  1  2  3  4  5  6  7  8  9  A  B  C  D  E  F
0�000025d0d0:   \0 \0 \0 \? \? 0  \0 \? t  e  s  t  f  i  l  eStep 7

Reset our position.

a0b

Output:

a0b
0x000025d000  :  0x00000000 (0)

This resets our position back to the beginning of the a0 block. This is necessary whenever you want to reprint out 
the byte data. Remember, however, that if you had to use the �p� command many times to find your filename, you will probably 
have to use it many times each time you reset back to the beginning.

Step 8
Print our data in decimal

p256e

Output:

p256e

0x000025d000:         0       2      12       1   11776       0       0       2
0x000025d010:        12       2   11822       0       0      16      20      10
0x000025d020:     27759   29556   11110   28533   28260       0       0      17
0x000025d030:        36      26   28005   27999   29285   28783   29300   24370
0x000025d040:     12336   13361   12592   12590   25709   28718   26490       0
0x000025d050:         0      18      28      18   28530   24947   25458   24948
0x000025d060:     25448   11875   28777   28462   26490       0       0      19
0x000025d070:        40      29   30067   25970   24417   25460   26998   26996
0x000025d080:     31071   12848   12340   12593   12337   11876   28016   11879
0x000025d090:     31232       0       0      20      44      33   30067   25970
0x000025d0a0:     24417   25460   26998   26996   31071   25701   29791   12848
0x000025d0b0:     12340   12593   12337   11876   28016   11879   31232       0
0x000025d0c0:        18   24576     320       5   17731   21041   22528       0
0x000025d0d0:         0      21     304      11   29797   29556   26217   27749
0x000025d0e0:     11888   27648     288       7   29797   29556   25705   29184
0x000025d0f0:     27236   30051   27503   11892   30836       0       0      23
0x000025d100:       260      16   27233   28005   29549   24947   29537   29281
0x000025d110:     11892   30836       0       0       0       0       0       0
0x000025d120:         0       0       0       0       0       0       0       0
0x000025d130:         0       0       0       0       0       0       0       0
0x000025d140:         0       0       0       0       0       0       0       0

The command �p256e� stands for �print 256 bytes in decimal word format�. This output can be helpful and confusing at the same time. 
First, find the beginning address that our file name is on. In our example, this was 0�000025d0d0. The line in decimal format reads:

0x000025d0d0:         0      21     304      11   29797   29556   26217   27749

For each file, assume the following:

   {ADDRESS}:  x    x    x    x    x    x    x    x    x
               |    |    |    |    |---- filename -----|
     inode # --+----+    |    |
                         |    +-- filename length
         record LENGTH --+

Note that the inode # may begin on any part of the line. The reason we print the data in decimal format is to help us 
determine where in the line the inode number is. There are several ways to help you do this, here are some:

Count the number of characters in your filename, then try and find that number in our address line. 
( eg: There are 11 characters in the filename �testfile.pl�. ) You can see on our line there is a matching number 11. 
Recount to the address 0�000025d0d8, assuming each column represents two numbers. The first column is 0 and 1. The second column is 2 and 3, 
then 4 and 5, etc. When you reach the column that matches your address, go back one column. The number in this column should match up 
with your filename length. Unless, of course, your filename is over 255 characters. 
Once you are sure you have the the correct column for your filename length, you are going to count back three more columns. 
This should put at the first column of the inode number. We�ll use our example decimal line to explain this more:

0x000025d0d0:         0      21     304      11   29797   29556   26217   27749

Like we mentioned before, testfile.pl is 11 characters. We find a matching number 11 in the 4th column. That means that the column 
with �304' is our record length field and the 0 and 21 columns make up our inode. Now, that we know which columns our inode is in ( columns 1 and 2 ), 
we must translate this number into our real inode number.

Step 9
Reset our position again.

a0b

Output:

a0b
0x000025d000  :  0x00000000 (0)

Again, we have to reset our position back to the beginning because this time, we�re going to print the information in hex.

Step 10
Print our data in hex.

p256x

Output:

p256x

0x000025d000:    0000  0002  000C  0001  2E00  0000  0000  0002
0x000025d010:    000C  0002  2E2E  0000  0000  0010  0014  000A
0x000025d020:    6C6F  7374  2B66  6F75  6E64  0000  0000  0011
0x000025d030:    0024  001A  6D65  6D5F  7265  706F  7274  5F32
0x000025d040:    3030  3431  3130  312E  646D  702E  677A  0000
0x000025d050:    0000  0012  001C  0012  6F72  6173  6372  6174
0x000025d060:    6368  2E63  7069  6F2E  677A  0000  0000  0013
0x000025d070:    0028  001D  7573  6572  5F61  6374  6976  6974
0x000025d080:    795F  3230  3034  3131  3031  2E64  6D70  2E67
0x000025d090:    7A00  0000  0000  0014  002C  0021  7573  6572
0x000025d0a0:    5F61  6374  6976  6974  795F  6465  745F  3230
0x000025d0b0:    3034  3131  3031  2E64  6D70  2E67  7A00  0000
0x000025d0c0:    0012  6000  0140  0005  4543  5231  5800  0000
0x000025d0d0:    0000  0015  0130  000B  7465  7374  6669  6C65
0x000025d0e0:    2E70  6C00  0120  0007  7465  7374  6469  7200
0x000025d0f0:    6A64  7563  6B6F  2E74  7874  0000  0000  0017
0x000025d100:    0104  0010  6A61  6D65  736D  6173  7361  7261
0x000025d110:    2E74  7874  0000  0000  0000  0000  0000  0000
0x000025d120:    0000  0000  0000  0000  0000  0000  0000  0000
0x000025d130:    0000  0000  0000  0000  0000  0000  0000  0000
0x000025d140:    0000  0000  0000  0000  0000  0000  0000  0000

First, we find the line that begins with our address 0�000025d0d0. There it is!

0x000025d0d0:    0000  0015  0130  000B  7465  7374  6669  6C65

Next, find the two columns that we know our inode is in. For us, that�s column 1 and 2. Column 1 is all 0�s, so we can disregard it. 
Column 2, however, is 0015. Open up a calculator and translate 15 from hexidecimal to decimal. As you can see, this number turns into 21, 
which is our real inode number.

Some of you may be asking why we just didn�t use the inode number from the decimal output in step 8. The reason is because it always isn�t 
always this easy. Take, for example, the address above ours. The directory ECR1X is on this address. Its inode number, like ours, is in 
columns 1 and 2. However, if you compare the lines between hexidecimal and decimal, you can immediately see the difference.

Decimal:
0x000025d0c0:      18  24576
Hex:
0x000025d0c0:    0012  6000

If you translate 12600 from hexidecimal to decimal, the output is 1204224, which is the correct inode number for the ECR1X directory. 
If you can figure out how to translate 18 24576 into 1204224, please let me know and I�ll update this document.

In any case, we now know the inode number of the missing file. We�re close to recovery!

Step 11
We go to our new inode number

21i

Output:

21i
i#:     21  md: f---rw-r--r--  ln:    0  uid:    0  gid:    3
szh:        0  szl:       45  (actual size:       45)
a0: 0xeff       a1: 0x00        a2: 0x00        a3: 0x00
a4: 0x00        a5: 0x00        a6: 0x00        a7: 0x00
at: Mon Jan 10 14:16:40 2005
mt: Mon Jan 10 14:16:48 2005
ct: Mon Jan 10 14:16:53 2005

From this output, you can see that we have a file.

Step 12
21i.ln=1

Output:

21i.ln=1
0x0000020a88  :  0x00000001 (1)

This sets the link count of the file back to 1. You can verify this by reissuing the command from step #11 and noticing that the �ln� field has incremented.

21i
i#:     21  md: f---rw-r--r--  ln:    1  uid:    0  gid:    3
szh:        0  szl:       45  (actual size:       45)
a0: 0xeff       a1: 0x00        a2: 0x00        a3: 0x00
a4: 0x00        a5: 0x00        a6: 0x00        a7: 0x00
at: Mon Jan 10 14:16:40 2005
mt: Mon Jan 10 14:16:48 2005
ct: Mon Jan 10 14:16:53 2005

We have now told the filesystem that the link count for inode 21 should be 1. This means that there should be a filename pointing 
at this inode. This basically reverses what the OS actually does when deleting files. It doesn�t actually erase the file data, 
instead, it unlinks the filename from its inode number, effectively preventing you from seeing the data.

Step 13
Quit.

q

Output:

q
[test:/]#

This quits out of the fsdb.

Step 14
Fsck our volume

fsck /dev/testlv

Output:

[test:/]# fsck /dev/testlv

** Checking /dev/rtestlv (/test)
** Phase 1 - Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
Unreferenced file  I=21  owner=root mode=100644
size=45 mtime=Jan 10 14:16 2005 ; RECONNECT? y
** Phase 5 - Check Inode Map
Bad Inode Map; SALVAGE? y
** Phase 5b - Salvage Inode Map
** Phase 6 - Check Block Map
Bad Block Map; SALVAGE? y
** Phase 6b - Salvage Block Map
18 files 21893872 blocks 171306256 free
***** Filesystem was modified *****

This does a filesystem check on /dev/testlv. As you can see, it finds an inode claiming it is linked to, but no file that links to it. 
We answer �y� to tell it to reconnect the inode to a filename, effectively giving us our file back!

Step 15
Remount our directory.

mount /test

Output: None

We must remount our filesystem to get back at our file.

Step 16
Go into lost and found. It�s where all lost little kiddies go. Duh.

cd /test/lost+found

Output: None

Our file is now located in lost+found. If you do an �ls� in this directory, you will see something like the following:

[test:/test/lost+found]# ls -l
total 8
-rw-r�r�   1 root     sys              45 Jan 10 14:16 21

And if we cat the file 21, we get the following:

[test:/test/lost+found]# cat 21
#!/usr/bin/perl

print �this is a test\n�;

Ta-da! It�s Myron�s missing perl script!

As a final aside, I will say that there may be different and much better ways of recovering files on AIX, however, this is the way 
I constructed from notes I found on various mailing lists and a few days of fooling around with it. So if you see some mistakes in this document 
or have some suggestions for better ways of doing this, please, let me know! I will happily update this document with better information as it is provided.

I hope this helps some of you who have to deal with certain people who accidentally delete files on your systems. Nothing beats a good backup 
but when you don�t have one of those, this can always be used as a fallback.


----------------------------------------------------------------------------------------
Note 11:
----------------------------------------------------------------------------------------

Subject: Undelete of a file on AIX, using fsdb.

http://faqs.cs.uu.nl/na-dir/aix-faq/part1.html


Contents repeated here:


RECOVERING REMOVED FILES AND DIRECTORIES IN A FILESYSTEM

If a file is Deleted from the system, the filesytem blocks composing 
that file still exist, but are no longer allocated. As long as no new
files are created or existing files extended within the same filesystem, 
the blocks will remain untouched. It is possible to reallocate the 
blocks to the previous file using the "fsdb" command (filesystem debugger).


 MAKE A BACKUP OF THE ENTIRE FILESYSTEM BEFORE PERFORMING THESE STEPS!!!
 ELSE ( BANG !!!!! ).

 It is possible to send a mail for have some informations ...

                   Bernard.Kozyra@bull.net


Steps to recover a deleted file
-------------------------------

1) "ls -id {dir}" 
   (where dir is directory where file resided)
   Record INODE number for next step.

2) Unmount the filesystem.

3) "fsdb /{Mountpoint}" or "fsdb /dev/{LVname}"
   (where Mountpoint is the filesystem mount point, and LVname is 
   the logical volume name of the filesystem)

4) "{INODE}i"
   (where INODE is the inode number recorded in step 1)
   This will display the inode information for the directory. The
   field a0 contains the block number of the directory.
   The following steps assume only field a0 is used. If a value 
   appears in a1, etc, it may be necessary to repeat steps #5 and 
   #6 for each block until the file to be recovered is found.

5) "a0b"
   (moves to block pointed to by field "a0" of this inode)

6) "p128c"
   (prints 128 bytes of directory in character format)
   Look for missing filename. If not seen, repeat this step until
   filename is found. Record address where filename begins. Also
   record address where PRIOR filename begins. If filename does 
   not appear, return to step #5, and selecting a1b, a2b, etc.

   Note that the address of the first field is shown to the far left.
   Increment the address by one for each position to the right,
   counting in octal.

7) "a0b"
   (moves to block pointed to by field "a0" of this inode)
   If the filename was found in block 1, use a1b instead, etc.

8) "p128e"
   (prints first 128 bytes in decimal word format)
   Find the address of the file to recover (as recorded in step 6) 
   in the far left column. If address is not shown, repeat until found.

9) Record the address of the file which appeared immediately PRIOR to 
   the file you want to recover.

10) Find the ADDRESS of the record LENGTH field for the file in step 
   #9 assuming the following format:

   {ADDRESS}:  x    x    x    x    x    x    x    x    x    x  ...
               |    |    |    |    |-------- filename ------|
     inode # --+----+    |    |
                         |    +-- filename length
         record LENGTH --+

   Note that the inode number may begin at any position on the line.
   Note also that each number represents two bytes, so the address
   of the LENGTH field will be `{ADDRESS} + (#hops * 2) + 1'

11) Starting with the first word of the inode number, count in OCTAL
    until you reach the inode number of the file to be restored, 
    assuming each word is 2 bytes.

12) "0{ADDRESS}B={BYTES}"
    (where ADDRESS is the address of the record LENGTH field found
    in step #10, and BYTES is the number of bytes [octal] counted 
    in step #11)

13) If the value found in the LENGTH field in step #10 is greater than
    255, also type the following:

    "0{ADDRESS-1}B=0"
    (where ADDRESS-1 is one less than the ADDRESS recorded in step #10)
    This is necessary to clear out the first byte of the word.

14) "q"
    (quit fsdb)

15) "fsck {Mountpoint}" or "fsck /dev/{LVname}"
    This command will return errors for each recovered file asking if
    you wish to REMOVE the file. Answer "n" to all questions.
    For each file that is listed, record the associated INODE number.

16) "fsdb /{Mountpoint}" or "fsdb /dev/{LVname}"

17) {BLOCK}i.ln=1
    (where BLOCK is the block number recoded in step #15)
    This will change the link count for the inode associated with
    the recovered file. Repeat this step for each file listed in
    step #15.

18) "q"
    (quit fsdb)

19) "fsck {Mountpoint}" or "fsck /dev/{LVname}"
    The REMOVE prompts should no longer appear. Answer "y" to
    all questions pertaining to fixing the block map, inode map,
    and/or superblock.

20) If the desired directory or file returns, send money to the author
    of this document.


----------------------------------------------------------------------------------------
Note 12:
----------------------------------------------------------------------------------------

This note has some interresting feautures. You can't use it for all types of un-delete,
but maybe you want to take a look.

Original:

http://lde.sourceforge.net/UNERASE.txt

Here the contents is repeated:


	I imagine that most of the people initially using this package
will be the ones who have recently deleted something.  After all,
that's what finally inspired me to learn enough about the different
file systems to write this package.  Undelete under unix really isn't
that hard, it really only suffers the same problems that DOS undelete
does which is -- you can't recover data that someone else has just
overwritten.

	If you are quick and have very few users on your system there
is a good chance that the data will be intact and you can go ahead
with a successful undelete.  I don't recommend using this package to
undelete your /usr/bin directory or really any directory, but if you
have trashed a piece of irreplaceable code or data, undelete is where
it's at.  If you can reinstall or have recent backups I'd recommend
you try them.  But it's up to you, besides, sometimes playing with
lde/undelete for a while is a lot more fun than going back and
recoding a few hours worth of lost work.

	Before I tell you how to undelete stuff, have a look at
doc/minix.tex (or the ps or dvi version).  Even if you aren't using a
minix file system, read it carefully, it will get you used to the
terms and the general idea behind things here.

These are the steps for a successful undelete:

#########################  STEP ONE  ##################################

	Unmount the partition which has the erased file on it.  If you
want to, you can remount it read-only, but it isn't necessary.  

NOTE: lde does some checks to see if the file system is mounted, but
it does not check if it was mounted read-only.  Some functions will be
deactivated for any (read-only or read/write) mounted partition.

#########################  STEP TWO  ##################################

	Figure out what you want to undelete.  If you know what kind
of file you are looking for (tar file, compressed file, C file),
finding it will be a lot easier.  There are a few ways to look for
file data.

	lde supports a type search and a string search for data at the
beginning of a file.  Currently, the supported types include gz
(gzip), tgz (tarred gzip file), and script (those beginning with
"#!/").

---- EXAMPLE ----
String search (search for a PKzip file - starts with PK, -O 0 not required):
	lde -S PK -O 0 /dev/hda1 

String search (search for JPEG files - JIFF starts at byte 6):
	lde -S JIFF -O 6 /dev/hda1

Type search (search for a gzipped tar file):
	lde -T tgz /dev/hda1
-------------------

	When searching by type, you can also include the filename;
the desired pattern will be extracted from the file.  You should
specify an offest (-O) and length (-L) when using this option.  This
option was included to make generalized searches easier.  You can
find pattern, length, and offset information in /etc/magic which you
can use to generate your own template files, or specify lengths and
offsets so that existing files may be used as templates.

---- EXAMPLE ----
Type search (search for core file - see /etc/magic to determine -O/-L):
	lde -T /proc/kcore -O 216 -L 4 /dev/hda1
-----------------

If you add --recoverable to the command line, it will check to see if
another active inode uses any blocks in this inode.  If no blocks are
marked used by another inode, "recovery possible" will be printed.  If
blocks are used by another file "recovery NOT possible" will be
printed to the screen.  You may still be able to get some data back
even when it reports that recovery is not possible.  To get an idea of
how many blocks are in use, you will have to check its recoverablilty
from lde via its curses interface.

---- EXAMPLE ----
./lde --paranoid -T script --ilookup --recoverable /dev/hda5
---- OUTPUT  ----
Paranoid flag set.  Opening device "/dev/hda5" read-only.
User requested autodetect filesystem. Checking device . . .
Found ext2fs on device.
Match at block 0x107, check inode 0xB, recovery possible.
Match at block 0x421E7, no unused inode found.
-----------------

	When you run lde in these mode, it will report a block (and
inode if you are lucky and used the --ilookup flag) where a match was
found. Take this inode number and go to step (3).

	If lde doesn't report anything on its own, or the search
detailed above does not suit your needs, you can use grep to search
the partition for data and pipe it through lde which will attempt to
find a block and inode again.  The recommended procedure (all this can
go on one line, the '\' indicates continuation) is:

   grep -b SEARCH DEVICE | awk '{FS = ":" } ; {print $1 }' | \
	 lde ${LDE_OPT} --grep DEVICE

A shell script (crash_recovery/grep-inode) is included that will do
this for you.

   grep-inode [grep_options] search_string device

---- EXAMPLE ----
   grep-inode -i MyDevelopment.h /dev/hda1
-----------------

	If none of these search methods are productive, you can page
through the disk with an editor (emacs /dev/hda2) or the preferred
choice might be to page through it with lde.  Fire up lde and go into
block mode (hit 'b') then use PG_UP/PG_DN to flip through all the
blocks until you find one you like.  Hitting '^R' while displaying the
block will attempt to find an inode which references the block.

########################  STEP THREE  #################################

	If you have an inode number, things are looking good.  Go into
inode mode and display this inode.  Then hit 'R' (use capital 'R') to
copy the inode information to the recovery block list and enter
recovery mode.  Now hit 'R' again and lde will prompt you for a file
name (you can include a full path).  Make sure you write it to a FILE
SYSTEM OTHER THAN THE ONE WHICH THE DELETED FILE RESIDES ON or you
will probably overwrite it as you go.  One day, when lde supports disk
writes, it will be able to undelete the file to its original location,
but for now this is safer.

	The recovered file will be a little larger than the original
as the last block will be padded with zeroes (or whatever was on the
disk at the end of the last block).  If you did find an inode for the
deleted file, you can copy its old size to the new inode by using lde
to edit the two inodes (don't use lde's copy/paste as it will copy the
entire inode and undo all the work you just did to restore the file).

######################  OTHER OPTIONS  ################################

	If you were unable to find an intact inode, things are going
to be tough.  You will have to find all the blocks in the file in
order.  If your disk is relatively unfragmented, you can hopefully
find everything in order or close by at least.  Currently, you have to
tag all the direct blocks, then find the indirect blocks and tag them.

	If the indirect block was wiped or you are unable to find it,
you've got a lot of work to do.  You can copy individual blocks one at
a time to the recovery file by using 'w' in block mode.  Display the
next block in the file, hit 'w', then enter the filename (if you hit
enter, the last filename will be reused and the block will be appended
to the file).  lde will always ask if you want to append, overwrite,
or cancel when a file exists.  You can override this by setting the
append flag from the flags menu ('f' from most modes).

	If you find any type of indirect block, you can copy it to the
recovery inode in its corresponding position and recover a whole bunch
of blocks at once.  Leave the direct blocks filled with zeros.

	Another option is to use dd.  Real programmers still probably
use emacs and dd to hack a fs. ;) If you know there are a bunch (one
or more) of contiguous blocks on the disk, you can use the unix
command dd to copy them from the device to a file.

---- EXAMPLE ----
To copy blocks 200-299 from the device /dev/hda1 to /home/recover/file1:

   dd if=/dev/hda1 of=/home/recover/file1 bs=1024 count=100 skip=200

	if    input file or device
	of    output file or device
	bs    blocksize (will be 1024 for most linux fs's)
	count number of blocks to copy
	skip  number of blocks to skip from the start of the device 
-----------------

Read the dd man page for more info.

####################  ABOUT INDIRECT BLOCKS  ##########################

[ Mail from to an lde user ]

> 1 - install a routine that lets you read what the indirect blocks
> are pointing to in the chain, I mean, I know that file X has 2
> indirect blocks but what blocks do these point to and how do I find
> out?

        This is hard to describe, but if you have figured out how to
use inode mode any you are looking at the blocklist contained in that
inode (it should list all the direct blocks and the 1x, 2x, and 3x
indirect blocks), when you hit 'B' when the cursor is sitting on the
1x indirect block, it will take you to that block in block mode, then
each entry in that block (most likely each entry is 4 bits -- as in
the ext2 fs) points to another block in the chain.

I.E.

        INDIRECT BLOCK:   0x000200

   Now look at block 0x000200

       0000:   01 00 00 00 02 00 00 00 : 04 04 04 00 10 01 00 00

   This would indicate the the next 4 blocks in the file are

        0x00000001, 0x00000002, 0x00040404, 0x00000110

The same is true for double indirect blocks, but the double indirect
blocks contains pointers to more indirect block which you must then
look up as above.

That was a pretty lousy explaination, someday I do plan to add a
feature where you may view all the blocks in a file without doing the
indirect indexing yourself.  For now, lde is mostly a crutch for last
ditch efforts at file recovery, but I'm glad if people find other uses
for it.


#################  RECOVERING WITHOUT INODES  #######################

[ This is mail to a person who was unable to find an inode, it gives
  some last ditch suggestions before giving up. ]

        In a perfect world, or on a virgin disk, everything would
be sequential.  But with things like unix and (network) file sharing,
many people can write to the disk at the same time, so the blocks
can get interleaved.  Also depending on the free space situation of
the disk, the two free blocks may not exist sequentially on the disk.
Also, there are file "holes" in ext2 where there are block pointers of
zero on the disk.  Normally an indirect block would point to 256
direct blocks, but with zero entries it may be less than this.

        If things are perfect, here is how I imagine your disk is
layed out:

        Direct blocks 1-9: you already know where these are and they
                           are in that tiny recovery file (9k).  These
                           were not sequential, so it makes me wonder
                           if the rest of the bytes will be layed out
			   in order.

        Indirect block:    This takes up one block and ideally your
                           data would start right after it.
        256 blocks of data:
        2x indirect block: Should only have one entry, pointing to the
                           next block on the disk
        indirect block:    pointed to by the 2xindirect block
        88 blocks of data:

So my last ditch recommendation is to use dd to copy the blocks off
the disk and then cat all the dd'ed files together.

        0x5e65e - 0x5e660  |
        0x61a72            |
        0x5e661            +--  These are the direct blocks, you could
        0x61ad4            |    use the lde recovered file instead of
        0x5e662 - 0x5e664  |    dd + cat.

        0x5e665 - 0x5e764  - 256 blocks of data
        0x5e750 - 0x5e7a8  - 88 blocks of data

Things look bad becuse the numbers are out of sequence (those 256
blocks of data should end right before the 2x indirect block at 0x5e74
there's 0x10 blocks unaccounted for (maybe this is just some of the
ext2 file system data which is dispersed about the disk -- it could
fall anywhere in that data range if it's there).

        So try:

---- EXAMPLE ----
	lde (recover direct blocks to /home/recover/block1)
        dd if=/dev/sdb1 of=/home/recover/block2 bs=1024 count=256 skip=386661
        dd if=/dev/sdb1 of=/home/recover/block3 bs=1024 count=88  skip=386896
        cat block.1 block2 block3 > access_file.dos
-----------------

####################  TRIPLE INDIRECT BLOCKS  #########################

[ This is a response to one persons request for immediate help
  recovering a very large file -- the stuff about the triple block
  having _three_ entries was specific to this persons problem.  In general
  though, the triple indirect block will not have very many entries, so
  this method might be viable until I get things together and write in
  the triple indirect block support. ]

        lde allows you to append a single block to the recover file
(use 'w' from block mode) -- you can page through the triple indirect
blocks to figure out the block order and then write each block to the
recover file.  I.e. after piecing things together from the triple
indirect block, you should have a list of all the blocks in the file,
now display the first block on the screen, write it to the file,
display the second block, write it to the file . . . I really don't
think it's worth it for 145,000 blocks though.

        The semi-automated way to do this is to make some fake inodes.
The triple indirect inode should be pretty empty - maybe 3 entires.
Each of these entries points to a double indirect block.  Solution:

        1) Recover any direct/indirect/double indirect blocks in 
           the original inode to a file.  Do this with lde.

        2) Look at the triple indirect block.  It should have 3
           entries.  Write down the 3 double indirect blocks listed here.

        3) Use the recover mode fake inode, fill in all entires with
           zeroes.  Now fill in the 1st double indirect block that
           you wrote down in step 2 in the slot for the 2x indirect 
           block.

        4) Execute a recover, dump it to a file, say "file1".  Repeat
           step 3 with the other two double indirect inodes from step 2.

        5) Now you should have 4 files, catenate them all together and
           with any luck, it will un-tar.
 

----------------------------------------------------------------------------------------
Note 13:
----------------------------------------------------------------------------------------

>>> Some tools or info that might be usefull:


1. Midnight Commander 
  is GNU (free) software that runs on UNIX based operating systems. 
  At the time of writing, the undelete feature only works on ext2 filesystems.
  Midnight Commander can be obtained at http://www.ibiblio.org/mc/

2. Opensource forensic:
  http://www.opensourceforensics.org/tools/unix.html

3. R-Linux, recovery and undelete tool for Ext2 fs
   http://3d2f.com/tags/undelete/recover/unix/

4. http://foremost.sourceforge.net/
   Also take a look at
   Tom Pycke, Recovering Files in Linux, available at www.recover.source.net/linux

5. R-Linux 1.0
   Data Recovery and Undelete Tool for Ext2FS (Linux) file system. 
   http://www.supershareware.com/info/r-linux.html

6. Compunix AIX undelete tool:
   http://www.compunix.com/prod/analyse.html
   http://www.compunix.com/eval/list.html


7. Check out a tool called "Lazarus" which can work in combination with unrm

8. For Linux (ext2, ext3 fs) and Solaris (ufs fs)
   R-Tools technology: Undelete tool for Linux and Solaris:
   http://www.data-recovery-software.net/

9. Solaris undelete tools:

   -- Kernel Recovery for Solaris Sparc
   http://www.download.com/Kernel-Recovery-for-Solaris-Sparc/3000-2248_4-10578170.html
   http://www.download3k.com/Press-Launch-of-Kernel-Recovery-for-Solaris-SPARC.html
   http://www.tucows.com/preview/505583
   http://www.programurl.com/kernel-recovery-for-solaris-sparc.htm

   Nucleus Technologies.com: http://www.nucleustechnologies.com 

   -- Other Solaris Data Recovery Software:
   http://solaris-data-recovery-software.qarchive.org/


   R-Tools technology: Undelete tool for Linux and Solaris:
   http://www.data-recovery-software.net/

10. General info on undelete intentions on ext2 fs:
    http://amadeus.uprm.edu/~undelete/Presentacion.html

11. Patents on undelete feature in Unix (requires a change in how inodes are freed)
    http://www.patentstorm.us/patents/6615224.html
    http://www.freepatentsonline.com/6615224.html


###############################################################
4. OTHER STUFF:
###############################################################


----------------------------------------------------------------------------------------
Note 1:
----------------------------------------------------------------------------------------

Carefull in using "utilities" in removing accounts and other items.
The following story explains it all:


From: dbrillha@dave.mis.semi.harris.com (Dave Brillhart)
Organization: Harris Semiconductor

We can laugh (almost) about it now, but...

Our operations group, a VMS group but trying to learn UNIX, was assigned
account administration. They were cleaning up a few non-used accounts
like they do on VMS - backup and purge. When they came across the
account "sccs", which had never been accessed, away it went. The
"deleteuser" utility fom DEC asks if you would like to delete all
the files in the account. Seems reasonable, huh?

Well, the home directory for "sccs" is "/". Enough said :-(


(Note: funny story, but filemodes or permissions should actually make this impossible)


----------------------------------------------------------------------------------------
Note 2:
----------------------------------------------------------------------------------------

You already have seen some examples of using the dd and od commands. These commands are available on almost
all unix versions. They are extremely powerfull, and could be very dangerous also, if not used properly.
Because you can dump any diskblock, or blocks from tape, to any output, with possible conversion of data,
you might even recover data which would otherwise be considered as lost.

The following article is very instructive on how to use the dd command.


http://www.codecoffee.com/tipsforlinux/articles/036.html

>> How and when to use the dd command?  
 

In this article, Sam Chessman explains the use of the dd command with a lot of useful examples. This article is not aimed at absolute beginners. 
Once you are familiar with the basics of Linux, you would be in a better position to use the dd command. 

The ' dd ' command is one of the original Unix utilities and should be in everyone's tool box. It can strip headers, extract parts of 
binary files and write into the middle of floppy disks; it is used by the Linux kernel Makefiles to make boot images. 
It can be used to copy and convert magnetic tape formats, convert between ASCII and EBCDIC, swap bytes, and force to upper and lowercase. 


For blocked I/O, the dd command has no competition in the standard tool set. One could write a custom utility to do specific I/O or 
formatting but, as dd is already available almost everywhere, it makes sense to use it. 

Like most well-behaved commands, dd reads from its standard input and writes to its standard output, unless a command line specification 
has been given. This allows dd to be used in pipes, and remotely with the rsh remote shell command. 

Unlike most commands, dd uses a keyword=value format for its parameters. This was reputedly modeled after IBM System/360 JCL, 
which had an elaborate DD 'Dataset Definition' specification for I/O devices. A complete listing of all keywords is available from GNU dd with 

$ dd --help

Some people believe dd means ``Destroy Disk'' or ``Delete Data'' because if it is misused, a partition or output file can be trashed very quickly. 
Since dd is the tool used to write disk headers, boot records, and similar system data areas, misuse of dd has probably trashed 
many hard disks and file systems. 

In essence, dd copies and optionally converts data. It uses an input buffer, conversion buffer if conversion is specified, and an output buffer. 
Reads are issued to the input file or device for the size of the input buffer, optional conversions are applied, and writes are issued 
for the size of the output buffer. This allows I/O requests to be tailored to the requirements of a task. Output to standard error reports 
the number of full and short blocks read and written. 


Example 1


A typical task for dd is copying a floppy disk. As the common geometry of a 3.5" floppy is 18 sectors per track, two heads and 80 cylinders, 
an optimized dd command to read a floppy is: 

Example 1-a : Copying from a 3.5" floppy

dd bs=2x80x18b if=/dev/fd0 of=/tmp/floppy.image 
1+0 records in
1+0 records out 

The 18b specifies 18 sectors of 512 bytes, the 2x multiplies the sector size by the number of heads, and the 80x is for the cylinders--
a total of 1474560 bytes. This issues a single 1474560-byte read request to /dev/fd0 and a single 1474560 write request to 
/tmp/floppy.image, whereas a corresponding cp command 

cp /dev/fd0 /tmp/floppy.image


issues 360 reads and writes of 4096 bytes. While this may seem insignificant on a 1.44MB file, when larger amounts of data are involved, 
reducing the number of system calls and improving performance can be significant. 


This example also shows the factor capability in the GNU dd number specification. This has been around since before the Programmers Work Bench and, 
while not documented in the GNU dd man page, is present in the source and works just fine, thank you. 


To finish copying a floppy, the original needs to be ejected, a new diskette inserted, and another dd command issued to write to the diskette: 

Example 1-b : Copying to a 3.5" floppy
dd bs=2x80x18b < /tmp/floppy.image > /dev/fd0 
1+0 records in 
1+0 records out 

Here is shown the stdin/stdout usage, in which respect dd is like most other utilities. 


Example 2


The original need for dd came with the 1/2" tapes used to exchange data with other systems and boot and install Unix on the PDP/11. 
Those days are gone, but the 9-track format lives. To access the venerable 9-track, 1/2" tape, dd is superior. With modern SCSI tape devices, 
blocking and unblocking are no longer a necessity, as the hardware reads and writes 512-byte data blocks. 

However, the 9-track 1/2" tape format allows for variable length blocking and can be impossible to read with the cp command. The dd command allows 
for the exact specification of input and output block sizes, and can even read variable length block sizes, by specifying an input buffer size larger 
than any of the blocks on the tape. Short blocks are read, and dd happily copies those to the output file without complaint, simply reporting on the 
number of complete and short blocks encountered. 


Then there are the EBCDIC datasets transferred from such systems as MVS, which are almost always 80-character blank-padded Hollerith Card Images! 
No problem for dd, which will convert these to newline-terminated variable record length ASCII. Making the format is just as easy and dd again 
is the right tool for the job. 

Example 2 : Converting EBCDIC 80-character fixed-length record to ASCII variable-length newline-terminated record 
dd bs=10240 cbs=80 conv=ascii,unblock if=/dev/st0 of=ascii.out
40+0 records in
38+1 records out 

The fixed record length is specified by the cbs=80 parameter, and the input and output block sizes are set with bs=10240. 
The EBCDIC-to-ASCII conversion and fixed-to-variable record length conversion are enabled with the conv=ascii,noblock parameter. 


Notice the output record count is smaller than the input record count. This is due to the padding spaces eliminated from the output file and 
replaced with newline characters. 


Example 3


Sometimes data arrives from sources in unusual formats. For example, every time I read a tape made on an SGI machine, the bytes are swapped. 
The dd command takes this in stride, swapping the bytes as required. The ability to use dd in a pipe with rsh means that the tape device 
on any *nix system is accessible, given the proper rlogin setup. 

Example 3 : Byte Swapping with Remote Access of Magnet Tape
rsh sgi.with.tape dd bs=256b if=/dev/rmt0 conv=swab | tar xvf -


The dd runs on the SGI and swaps the bytes before writing to the tar command running on the local host. 


Example 4

Murphy's Law was postulated long before digital computers, but it seems it was specifically targeted for them. 
When you need to read a floppy or tape, it is the only copy in the universe and you have a deadline past due, that is when you will have a bad spot 
on the magnetic media, and your data will be unreadable. To the rescue comes dd, which can read all the good data around the bad spot and continue 
after the error is encountered. Sometimes this is all that is needed to recover the important data. 

Example 4 : Error Handling
dd bs=265b conv=noerror if=/dev/st0 of=/tmp/bad.tape.image 


Example 5


The Linux kernel Makefiles use dd to build the boot image. In the Alpha Makefile /usr/src/linux/arch/alpha/boot/Makefile, 
the srmboot target issues the command: 

Example 5 : Kernel Image Makefile
dd if=bootimage of=$(BOOTDEV) bs=512 seek=1 skip=1 

This skips the first 512 bytes of the input bootimage file (skip=1) and writes starting at the second sector of the $(BOOTDEV) device (seek=1). 
A typical use of dd is to skip executable headers and begin writing in the middle of a device, skipping volume and partition data. 
As this can cause your disk to lose file system data, please test and use these applications with care.

 
----------------------------------------------------------------------------------------
Note 3:
----------------------------------------------------------------------------------------


od Command


Purpose
Displays files in a specified format. 
dump files in octal and other formats


Syntax

To Display Files Using a Type-String to Format the Output
od [  -v ] [  -A AddressBase ] [  -N Count ] [  -j Skip ] [  -t TypeString ... ] [ File ... ] 

type is a string of one or more of the below type indicator characters. If you include more than one type indicator character 
in a single type string or use this option more than once, od writes one copy of each output line using each of the data types 
that you specified, in the order that you specified. 

a named character 
c ASCII character or backslash escape 
d signed decimal 
f floating point 
o octal 
u unsigned decimal 
x hexadecimal 
C char 
S short 
I int 
L long 
For floating point (f): 
F float 
D double 
L long double 


Examples:

>> To display a file in octal, a page at a time, enter: 

od a.out | pg

This command displays the a.out file in octal format and pipes the output through the pg command. 

>> To translate a file into several formats at once, enter: 

od -t cx a.out > a.xcd

This command writes the contents of the a.out file, in hexadecimal format ( x) and character format ( c), into the a.xcd file. 

>> To start displaying a file in the middle (using the first syntax format), enter: 

od -t acx -j 100 a.out

This command displays the a.out file in named character ( a), character ( c), and hexadecimal ( x) formats, starting from the 100th byte. 

>> To start in the middle of a file (using the second syntax format), enter: 

od -bcx a.out +100.

This displays the a.out file in octal-byte ( -b), character ( -c), and hexadecimal ( -x) formats, starting from the 100th byte. 
The . (period) after the offset makes it a decimal number. Without the period, the output would start from the 64th (100 octal) byte. 

% dir | od -c | more
% cat my_file | od -c |more
% od my_file |more
Comparison of different outputs:

>> Show 16 first characters from a binary file (/bin/sh) as ASCII characters or backslash escapes (octal):

% od -N 16 -c /bin/sh
output: 
0000000 177 E L F 001 001 001 \0 \0 \0 \0 \0 \0 \0 \0 \0

>> Show the same binary as named ASCII characters:

% od -N 16 -a /bin/sh
output:

0000000 del E L F soh soh soh nul nul nul nul nul nul nul nul nul

>> Show the same binary as short hexcadecimals:

% od -N 16 -t x1 /bin/sh
output:

0000000 7f 45 4c 46 01 01 01 00 00 00 00 00 00 00 00 00


>> Show the same binary as octal numbers:

% od -N 16 /bin/sh
output:

% 0000000 042577 043114 000401 000001 000000 000000 000000 000000


#############################################################################################
#############################################################################################
#############################################################################################


=====================================================================================
Secton 22. Oracle 10g/11g RAC.
=====================================================================================


/***************************************************************************/
/* Document     : Quick Intro: Oracle 10g/11g RAC                          */
/* Doc. Versie  : 3                                                        */
/* File         : ora10g11gRAC.txt                                         */
/* Date         : 11/10/2008                                               */
/* Content      : Elementary Introduction on Real Application Clusters     */ 
/* Compiled by  : Albert van der Sel                                       */ 
/***************************************************************************/


-----------------------------------------------------
REMARK:
This is a very simple, and incomplete, document, 
providing a birds-eye view on Oracle RAC technology.
-----------------------------------------------------


An "Oracle Real Application Cluster" (RAC), is about a clustered Oracle database.

If a RAC is properly set up, all the nodes (Servers) are active at the same time, acting
on the same one Database. This is very different from a failover Cluster.

Let's first take on a discussion about Cluster systems in general. See Section 1. The example system used
here is Linux, but the discussion about Cluster systems (Not targeted at RAC) here is "general" enough
to be of use. As from section 2, we will discuss RAC.


===============================================
1. Discussion about Cluster systems in general:
=============================================== 


1.1 Cluster Overview (in general):
----------------------------------


To set up a cluster, an administrator must connect the cluster systems (often referred to as member
systems) to the cluster hardware, and configure the systems into the cluster environment. The foundation
of a cluster is an advanced host membership algorithm. This algorithm ensures that the cluster
maintains complete data integrity at all times by using the following methods of inter-node communication:

� Quorum partitions on shared disk storage to hold system status
� Ethernet (and optional serial or other type of connections) between the cluster systems 
  for heartbeat channels

To make an application and data highly available in a cluster, the administrator must configure a 
"cluster service" � a discrete group of service properties and resources, such as an application and shared
disk storage. A service can be assigned an IP address to provide transparent client access to the service.
For example, an administrator can set up a cluster service that provides clients with access to
highly-available database application data.
Both cluster systems can run any service and access the service data on shared disk storage. 

However, each service can run on only one cluster system at a time, in order to maintain data integrity. 
Administrators can set up

- an "active-active" configuration in which both cluster systems run different services,
or 
- an "active-passive" (hot-standby) configuration in which a primary cluster system runs all the services, 
  and a backupcluster system takes over only if the primary system fails.

  NOTE:
  So this is actually a difference from Oracle 10g Real Application Cluster (RAC), where both instances,
  or multiple instances (from 2 - 100), accesses the single database on shared storage, at the same time !


Scetch of a 2-node Linux cluster


         ------------------------------------------ public network
             |                              |
             |                              |
        ------------                    -------------
        |cluster   |                    |cluster    |
        |system    |Ethernet            |system     |
        |          |--------------------|           |
        |          |heartbeat           |           |
        |          |                    |           |
        |          |____________        |           |
        |ServiceA  |  -----    -|---    |           |
        |ServiceB  |--|PWR|    |PWR|----|ServiceC   |
        |          |  -----    -----    |           |
        |          |    |_______________|           |
        |          |                    |           |
        ------------                    -------------
             | SCSI bus or Fible Channel      |
             ------------------  --------------
               Interconnect   |  |
                              |  |
Fig 1.1                   -----------
                          |Shared   |  - has Quorum partitions (or disks)
                          |Disk     |  - has partitions (or disks) for ServiceA, B, C
                          |Storage  |
                          ----------- 


Figure 1�1, shows an example of a cluster in an active-active configuration.
If a hardware or software failure occurs, the cluster will automatically restart the failed system�s services
on the functional cluster system. This service failover capability ensures that no data is lost,
and there is little disruption to users. When the failed system recovers, the cluster can re-balance the
services across the two systems.
In addition, a cluster administrator can cleanly stop the services running on a cluster system and then
restart them on the other system. This service relocation capability enables the administrator to maintain
application and data availability when a cluster system requires maintenance.

-- Service configuration framework:

Clusters enable an administrator to easily configure individual services to make data and applications
highly available. To create a service, an administrator specifies the resources used in the
service and properties for the service, including the service name, application start and stop script,
disk partitions, mount points, and the cluster system on which an administrator prefers to run the
service. After the administrator adds a service, the cluster enters the information into the cluster
database on shared storage, where it can be accessed by both cluster systems.
The cluster provides an easy-to-use framework for database applications. For example, a database
service serves highly-available data to a database application. The application running on a cluster
system provides network access to database client systems, such as Web servers. If the service
fails over to another cluster system, the application can still access the shared database data. A
network-accessible database service is usually assigned an IP address, which is failed over along
with the service to maintain transparent access for clients.
The cluster service framework can be easily extended to other applications, as well.

-- Multiple cluster communication methods:

To monitor the health of the other cluster system, each cluster system monitors the health of the
remote power switch, if any, and issues heartbeat pings over network and serial channels to monitor
the health of the other cluster system. In addition, each cluster system periodically writes a
timestamp and cluster state information to two (or more) quorum partitions located on shared disk storage.
System state information includes whether the system is an active cluster member. Service state
information includes whether the service is running and which cluster system is running the service.
Each cluster system checks to ensure that the other system�s status is up to date.
To ensure correct cluster operation, if a system is unable to write to both quorum partitions at
startup time, it will not be allowed to join the cluster. In addition, if a cluster system is not updating
its timestamp, and if heartbeats to the system fail, the cluster system will be removed from the
cluster.

If a hardware or software failure occurs, the cluster will take the appropriate action to maintain application
availability and data integrity. For example, if a cluster system completely fails, the other
cluster system will restart its services. Services already running on this system are not disrupted.
When the failed system reboots and is able to write to the quorum partitions, it can rejoin the
cluster and run services. Depending on how the services are configured, the cluster can re-balance
the services across the two cluster systems.

-- Manual service relocation capability:

In addition to automatic service failover, a cluster enables administrators to cleanly stop services
on one cluster system and restart them on the other system. This allows administrators to perform
planned maintenance on a cluster system, while providing application and data availability.

-- Event logging facility:

To ensure that problems are detected and resolved before they affect service availability, the cluster
daemons log messages by using the conventional Linux syslog subsystem. Administrators can
customize the severity level of the logged messages.

-- Application Monitoring:

The cluster services infrastructure can optionally monitor the state and health of an application. In
this manner, should an application-specific failure occur, the cluster will automatically restart the
application. In response to the application failure, the application will attempt to be restarted on
the member it was initially running on; failing that, it will restart on the other cluster member.

-- Status Monitoring Agent:

A cluster status monitoring agent is used to gather vital cluster and application state information.
This information is then accessible both locally on the cluster member as well as remotely. A
graphical user interface can then display status information from multiple clusters in a manner
which does not degrade system performance.


1.2 Just an example of a more detailed view of an almost "No single point of failure" 2-Node Clustered System:
--------------------------------------------------------------------------------------------------------------

                            ----------
                            |NETWORK |
         -------------------|SWITCH  |-----------------------
         |                  ----------                      | public network
         |                      |		            |               
 ---------------------          |		    ---------------------   
 |network interface  |      ----------		    |network interface  |   
 |--------------------      |terminal|		    |--------------------   
 |serial port        |------|server  |--------------|serial port        |
 |--------------------      ----------		    |--------------------   
 |CLUSTER            | 				    |CLUSTER            | 
 |SYSTEM             |				    |SYSTEM             |
 |--------------------	private network             |--------------------
 |network interface  |------------------------------|network interface  | 
 |--------------------				    |--------------------
 |serial port        |------------------------------|serial port        |
 |--------------------				    |--------------------
 |serial port        |-----------------\	    |                   |
 |--------------------   -----       -----	    |--------------------   
 |power plug         |---|PWR|       |PWR|----------|power plug         |
 |--------------------   -----       -----	    |--------------------   
 |                   |	   |			    |-------------------|
 |                   |	   -------------------------|serial port        |
 |--------------------				    |--------------------
 |SCSI adapter (T)   |				    |SCSI adapter (T)   |
 ---------------------				    ---------------------
       |                                                       |
       |                                                       |
       -----------           -----------------------------------
                 |           |
                 |           |                 (T)         (T)
             -------------------------------------------------------
             | Port A/in | Port B/in |    | Port A/Out| Port B/Out |
             |------------------------------------------------------
             |     |           |                |          |       |
             |  -------------------         --------------------   |
             |  |controller 1     |         |controller 2      |   |
             |  -------------------         --------------------   |
             |          |                            |             | 
             |          |                            |             |
  RAID       |         ( )                          ( )            |
             |          |                            |             |
             |         ( )                          ( )            |
             |                                                     |
             |    mirrored shared disks                            |
             -------------------------------------------------------


=====================================================================================
2. Overview of the architecture of a Single Oracle Instance compared to a RAC system:
=====================================================================================

Let's first take a birds-eye overview of a single Instance architecture, compared to RAC
architecture.


2.1 Single Instance:
--------------------

If you look at a (traditional) Single Server where a single Oracle Instance (8i, 9i, 10g, 11g) is involved,
you would see the following situation.


>>>>> Files:

You can find a number of database files, residing on a disksystem, amongst others are:

 . system.dbf:    # this contains the dictionary (users, grants, table properties, packages etc..)
 . undo.dbf:      # this contains "undo/rollback" information about all modifying SQL statements, and
                    thus containing the "former situation" before transactions are committed to the DB.
 . redo logs:     # in case of a crash, these write ahead logs can be used to redo committed transactions 
                    that which were not written to the datafiles yet, but were logged in the redo logs.
 
 . user defined   # These are data files, organized in the logical concept of "tablespaces".
   data and index   These tablespaces contain the tables (and indexes) 
   tablespaces:
                   

Note: a tablespace consist of one or more files. To the Operating system, there are only files
to be concerned of, but from the Database perspective, the DBA can create a logical entity 
called "tablespace", consisting of possibly multiple files, possibly distributed over multiple disks.
Then, if the DBA then creates a table (or index), he or she should specify a tablespace, and thereby
distributing the (future) tablecontent over multiple files, which might increase I/O performance.
So, the DBA might create tablespaces with names like for example "DATA_BIG", "DATA_SMALL", 
"INDEX_BIG" etc.. 


>>>>> Memory structure and processes:

The Instance gets created in memory when the DBA (or the system) "starts the database". 
Starting the database means that a number of processes gets active, and that a rather complex shared 
memory area gets created. This memory area is called the SGA (System Global Area) and contains some
buffers and other pools, of which the following are most noticable:

SGA contains:

buffer cache	: datablocks from disk, are cached in this buffer. Most of this cached data
                  are blocks from tables and indexes.
log buffer	: small memory area which contains modified data which is about to be written
                  to the redologs
Shared pool	: All used SQL queries and procedures are cached in this pool
Library cache	: The systems metadata is cached in this structure

By the way, an Oracle Instance can be highly configured by a configuration file (traditionally that is
the file "init.ora" which is an ascii file and can be edited to adjust values. The modern variant of "init.ora"
is a binary file "spfile.ora".).
Some of the parameters in that file, determine the sizes of the different caches and pools.
For example, here is a section that determines the SGA of a small database:

db_cache_size        =268435456
java_pool_size       = 67108864
shared_pool_size     = 67108864
java_pool_size       = 67108864
streams_pool_size    = 67108864 


So, "an instance" is ofcourse not synonym to the database files on disk, but is really the "stuff" that
gets loaded or get created in memory.
After an Oracle Database has started, a number of processes are running, among which the most notable are:

pmon	: process monitor
smon	: system monitor
chkpt	: checkpoint process
dbwr	: database writer process
lgwr	: the process that writes the redologs


2.2 Scetch of a 2-node RAC Architecture:
----------------------------------------

+: network (with example IP addresses in picture)              user pc's/terminals
-: Fiber, or other Storage connection                          [ ] [ ] [ ]
                                                                +   +   +
                                                                +   +   +
                (subnet 1: 192.168.1 ) Public Network           +   +   +
        ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
                 +                                                       +   
                 +192.168.1.1 (vip=192.168.1.100)           192.168.1.2  + (vip=192.168.1.101)
                 +                                                       +
   Server Node:  +                                                       +
  ==================					   Server Node:  +       
  |Server A        |					  ==================  |--> SGA, and
  |--------------- |					  |Server B        |  |    processes like
  |Oracle          | Fiber				  |--------------- |  |    pmon, smon etc..
  |Instance A      |-----				--|Oracle          |<--
  |----------------|    |				| |Instance B      |
  |Listener A      |    |				| |----------------|
  |--------------- |    |				| |Listener B      | If ASM is used as a shared
  |-local rdbms    |    |				| |--------------- | storage for the database,
  | binaries etc.in|    |				| |-local rdbms    | each node also has an
  | (ORACLE_HOME)  |    |        (subnet 2: 192.168.2)  | | binaries etc.in| ASM Instance.
--|-CRS: Cluster   |               Private Network        | (ORACLE_HOME)  | 
| | ready services |++++++++++++++++++++++++++++++++++++++|-CRS: Cluster   |---
| | (CRS_HOME)     | 192.168.2.1              192.168.2.2 | ready services |  |
| -----------------          	                          | (CRS_HOME)     |  |
| |possibly vendor |    |    	                        | |-----------------  |
| |Clusterware     |    |                               | |possibly vendor |  |
| ==================    |      Shared Disks:		| |Clusterware     |  |
|  vendor               |    	----------------------	| ==================  |
|  clusterware          |--<>--	( 1 Shared Database: )<>|                     |
|  is not needed        |      	(system.dbf          )  |                     |
|  but might be in      |      	(temp.dbf            )  |                     |
|  Extended             |      	(users.dbf           )  |                     |
|  (long distance) RAC  |      	(data tablespaces    )  |                     |
|                       |      	(index tablespaces   )  |                     |
|                       |      	(etc..               )  |                     |
|                       |      	(------------------- )  |                     |
|			|---<>--(private redolog(s) A)  |                     |
|                       |---<>--(private undo.dbf   A)  |                     |
|                              	(private redolog(s) B)<>|                     |
|                              	(private undo.dbf   B)<>|                     |
|				(====================)                        |
----------------------<>--------( OCR:Oracle Cluster )--------<>--------------
				(      Registry      )
				(- Voting Disk       )
				----------------------


2.3 Overview RAC:
-----------------


RAC Architecture Overview


Let's begin by reviewing the structure of a Real Applications Cluster. Physically, a RAC consists 
of several nodes (servers), connected to each other by a private interconnect, which most of the time will be
a "private" Ethernet. 
The database files are kept on a shared storage subsystem, where they're accessible to all nodes. And each node has 
a public network connection. 

A cluster is a set of 2 or more machines (nodes) that share or coordinate resources to perform the same task. 
A RAC system is 2 or more instances running on a set of clustered nodes, with all instances accessing 
a shared set of database files (one Database). 
Depending on the O/S platform, a RAC database may be deployed on a cluster that uses vendor clusterware 
plus Oracle's own clusterware (Cluster Ready Services, CRS), or on a cluster that solely uses 
Oracle's own clusterware.
Thus, every RAC sits on a cluster that is running Cluster Ready Services. srvctl is the primary tool DBAs use 
to configure CRS for their RAC database and processes.


-- Cache Fushion:

Each cluster database instance in an Oracle RAC cluster uses its own memory structures and background processes. 
Oracle RAC uses Cache Fusion to synchronize the data stored in the buffer cache of each cluster database instance. 
Cache Fusion moves current data blocks (which reside in memory) between database instances, rather than having one database instance 
write the data blocks to disk and requiring another database instance to reread the data blocks from disk. When a data block located in the 
buffer cache of one instance is required by another instance, Cache Fusion transfers the data block directly between the instances 
using the interconnect, enabling the Oracle RAC database to access and modify data as if the data resided in a single buffer cache.

The CRS processes deal with cluster management, failover, services, OCR, Voting Disk etc..
But "the real cache fushion", that is Inter-instance block management, and lock management,
that is dealt with by a number of specialized Oracle background processes.
So, if an Instance reads a block from disk, then maybe a usersession will modify rows within that block.
Another Instance may request the same block, because an other user session connected to this second Instance,
wants to read (or modify) rows from that same block as well.

Single Instance:

> You probably know that a Single Instance Oracle database, consists of shared memory, where we can distinquish
certain areas. This shared memory is called the SGA (System Global Area), which "contains" the 
Buffer cache (diskblocks cached from disk), the shared pool (parsed sql, plsql codes), and a number of other area's.
like the large pool and the java pool.

>Next to this shared memory, a number of background processes make up the Instance, most notably are

pmon 
smon  
dbw0  
lgwr  
ckpt

and a number of other processes are running in a Single Instance.

If you have installed a RAC database, you have two or more Instances running on two ore more Nodes.
In this case, a number of additional structures and background processes deal with all the aspects of resource and
lock management that can occur if two or more instances wants to access the same blocks.
So, this in NOT the CRS processes who deals with that, but instead here we are talking of processes belonging
to the Oracle kernel.
In RAC, as more than one instance is accessing the resource, the instances require better coordination, at
the resource level (database objects, blocks, locks).
Sure, in RAC, the buffer cache of one Instance may contain data that is requested by an Instance at another node.

In former versions of RAC, namely Oracle Parallel Server (e.g. OPS in Oracle 8i), this management framework 
was called DLM (Distributed Lock Management).
In RAC 10g, 11g, this framework is made up by the Global Cache Services (GCS), the Global Enqueue Service (GES), 
and the Global Resource Directory (GRD).
The datastructures in memory distributed among all nodes (GRD), along with the specialized RAC background processes, collaborate
to enable "Cache Fushion".

In RAC Instances, we can expect, among others, to see the following additional background processes:

lms	Global Cache Services process
lmon	Global Enqueue Services Monitor
lmd	Global Enqueue Services daemon
lck0	Instance Enqueue process

When datablocks are requested from users attached to different instances, a lot of "tracking" and "synchronization" needs to be done
by, or with the aid of, the GCS, GES and GRD.

GCS:
Global Cache Service (GCS) is the main component of Oracle Cache Fusion technology. This is represented by background process LMSn. 
There can be max 10 LMS process for an instance. The main function of GCS is to track the status and location of data blocks. 
Status of data block means the mode and role of data block. GCS is the main mechanism by which cache coherency among �multiple cache� 
is maintained. GCS is also responsible for block transfer between the instances

GES:
Global Enqueue Service (GES) tracks the status of all Oracle enqueuing mechanism. This involves all non-cache fusion intra instance operations. 
GES performs concurrency control on dictionary cache locks, library cache locks and transactions. If performs this operation for resources 
that are accessed by more then once instance.
Enqueue services are also present in single instance database. These are responsible for locking the rows on a table using different locking modes.

GRD:
GES and GCS together maintains Global Resource Directory (GRD). GRD is like a in-memory database which contains details about all 
the blocks that are present in cache. GRD know what is the location of latest version of block, what is the mode of block, 
what is the role of block etc. When ever a user ask for any data block GCS gets all the information from GRD. 
GRD is a distributed resource, meaning that each instance maintain some part of GRD. This distributed nature of GRD is a key 
to fault tolerance of RAC. GRD is stored in the SGA of each Instance..


-- Cluster Ready Services and the OCR

Cluster Ready Services, or CRS, is a new feature for 10g RAC. Essentially, it is Oracle's own clusterware. 
On most platforms, Oracle supports vendor clusterware; in these cases, CRS interoperates with the vendor 
clusterware, providing high availability support and service and workload management. On Linux and Windows clusters, 
CRS serves as the sole clusterware. In all cases, CRS provides a standard cluster interface that is consistent 
across all platforms.

CRS consists of four processes (crsd, occsd, evmd, and evmlogger) and two disks (partitions): 
the Oracle Cluster Registry (OCR), and the voting disk. 

The CRSD manages the HA functionality by starting, stopping, and failing over the application resources 
and maintaining the profiles and current states in the Oracle Cluster Registry (OCR) whereas the OCSSD 
manages the participating nodes in the cluster by using the voting disk. The OCSSD also protects against 
the data corruption potentially caused by "split brain" syndrome by forcing a machine to reboot. 


-- CRS Processes

About those processes, we can show you how they run, and how they are started, on a unix system.

CRS consists of four processes (on most platforms: oprocd, crsd, occsd, evmd) and two disks: 
the Oracle Cluster Registry (OCR), and the voting disk. 

On most platforms, you may see the following processes:

oprocd	the Process Monitor Daemon
crsd	Cluster Ready Services Daemon (CRSD)
occsd	Oracle Cluster Synchronization Service Daemon
evmd	Event Volume Manager Daemon

Oracle CRS is Oracle's own clusterware tightly coupled with Oracle Real Application Clusters (RAC). CRS must be installed prior 
to the installation of Oracle RAC. It can also work over any third-party clustering software but there is no longer 
a requirement to buy and deploy such software. 

In short, Oracle CRS is primarily responsible for managing the high-availability (HA) architecture of Oracle RAC with the help 
of Cluster Ready Services Daemon (CRSD), Oracle Cluster Synchronization Server Daemon (OCSSD) and the Event Manager Daemon (EVMD). 
The CRSD manages the HA functionality by starting, stopping, and failing over the application resources and maintaining the profiles 
and current states in the Oracle Cluster Registry (OCR) whereas the OCSSD manages the participating nodes in the cluster 
by using the voting disk. The OCSSD also protects against the data corruption potentially caused by "split brain" syndrome 
by forcing a machine to reboot. 

Although Oracle CRS replaces the Oracle Cluster Manager (ORACM) in Oracle9i RAC, it does continue support for the Global Services Daemon (GSD), 
which in Oracle9i is responsible for communicating with the Oracle RAC database. In Oracle 10g, GSD's sole purpose is to serve Oracle9i 
clients (such as SRVCTL, Database Configuration Assistant, and Oracle Enterprise Manager). Financially, this is a very positive benefit 
since one is not bound to buy new client licenses and hardware to support an Oracle 10g database. 


To start and stop CRS when the machine starts or shutdown, on unix there are rc scripts in place.

You can also, as root, manually start, stop, enable or disable the services with:

/etc/init.d/init.crs start
/etc/init.d/init.crs stop
/etc/init.d/init.crs enable
/etc/init.d/init.crs disable

Or with

# crsctl start crs
# crsctl stop crs
# crsctl enable crs
# crsctl disable crs

On a unix system, you may find the following in the /etc/inittab file.

# cat /etc/inittab | grep crs
h3:35:respawn:/etc/init.d/init.crsd run > /dev/null 2>&1 </dev/null

# cat /etc/inittab | grep evmd
h1:35:respawn:/etc/init.d/init.evmd run > /dev/null 2>&1 </dev/null

# cat /etc/inittab | grep css
h2:35:respawn:/etc/init.d/init.cssd fatal > /dev/null 2>&1 </dev/null

/etc/init.d> ls -al *init*
init.crs
init.crsd
init.cssd
init.evmd

# cat /etc/inittab
..
..
h1:35:respawn:/etc/init.d/init.evmd run > /dev/null 2>&1 </dev/null
h2:35:respawn:/etc/init.d/init.cssd fatal > /dev/null 2>&1 </dev/null
h3:35:respawn:/etc/init.d/init.crsd run > /dev/null 2>&1 </dev/null


-- CRS logs:

Locating the Oracle Clusterware Alert Log
Oracle Clusterware posts alert messages when important events occur. For example, you might see alert messages from 
the Cluster Ready Services (CRS) daemon process when it starts, if it aborts, if the failover process fails, 
or if automatic restart of a CRS resource failed.

The location of the Oracle Clusterware log file is 

CRS_home/log/hostname/alerthostname.log, 

where CRS_home is the directory in which Oracle Clusterware was installed and hostname is the host name of the local node.

Oracle RAC uses a unified log directory structure to store all the Oracle Clusterware component log files. This consolidated structure 
simplifies diagnostic information collection and assists during data retrieval and problem analysis.

>>> The log files for the CRS daemon, crsd, can be found in the following directory:

CRS_home/log/hostname/crsd/

>>> The log files for the CSS deamon, cssd, can be found in the following directory:

CRS_home/log/hostname/cssd/

>>> The log files for the EVM deamon, evmd, can be found in the following directory:

CRS_home/log/hostname/evmd/

>>> The log files for the Oracle Cluster Registry (OCR) can be found in the following directory:

CRS_home/log/hostname/client/

>>> The log files for the Oracle RAC high availability component can be found in the following directories:

CRS_home/log/hostname/racg/
$ORACLE_HOME/log/hostname/racg


-- Enabling Debugging for an Oracle Clusterware Resource
You can use crsctl commands to enable resource debugging using the following syntax, where resource_name is the name 
of an Oracle Clusterware resource, such as ora.docrac1.vip, and debugging_level is a number from 1 to 5:

# crsctl debug log res resource_name:debugging_level

-- Running the Oracle Clusterware Diagnostics Collection Script

Run the diagcollection.pl script as the root user to collect diagnostic information from an Oracle Clusterware installation. 
The diagnostics provide additional information so that Oracle Support Services can resolve problems. Run this script from 
the operating system prompt as follows, where CRS_home is the home directory of your Oracle Clusterware installation:

# CRS_home/bin/diagcollection.pl --collect

This command displays the status of the Cluster Synchronization Services (CSS), Event Manager (EVM), 
and the Cluster Ready Services (CRS) daemons.


-- CRS manages the following resources: 

. The ASM instances on each node (for an explanation of ASM, see section 4)
. Databases 
. The instances on each node 
. Oracle Services on each node 
. The cluster nodes themselves, including the following processes, or "nodeapps":
  . VIP 
  . GSD 
  . The listener 
  . The ONS daemon

CRS stores information about these resources in the OCR. If the information in the OCR for one of these 
resources becomes damaged or inconsistent, then CRS is no longer able to manage that resource. 
Fortunately, the OCR automatically backs itself up regularly and frequently.


10g RAC (10.2) uses, or depends on,:

- Oracle Clusterware (10.2), formerly referred to as CRS "Cluster Ready Services" (10.1).
- Oracle's optional Cluster File System OCFS (This is optional), or use ASM and RAW.
- Oracle Database extensions

RAC is "scale out" technology: just add commodity nodes to the system.
The key component is "cache fusion". Data are transferred from one node
to another via very fast interconnects. 
Essential to 10g RAC is a "Shared Cache" technology.

Automatic Workload Repository (AWR) plays a role also.  The Fast Application Notification (FAN) mechanism
that is part of RAC, publishes events that describe the current service level being provided
by each instance, to AWR. The load balancing advisory information is then used to determine
the best instance to serve the new request.

. With RAC, ALL Instances of ALL nodes in a cluster, access a SINGLE database.
. But every instance has it's own UNDO tablespace, and REDO logs.

The Oracle Clusterware comprise several background processes that facilitate cluster operations.
The Cluster Synchronization Service CSS, Event Management EVM, and Oracle Cluster components
communicate with other cluster components layers in the other instances within the same 
cluster database environment.


Questions per implementation arise in the following points:
. Storage
. Computer Systems/Storage-Interconnect
. Datbase
. Application Server
. Public and Private networks
. Application Control & Display

On the Storage level, it can be said that 10g RAC supports
- Automatic Storage Management (ASM)
- Oracle Cluster File System (OCFS)
- Network File System (NFS) - limited (only theoretical actually, except for 11g)
- Disk raw partitions
- Third party cluster file systems, like GPFS

For application control and tools, it can be said that 10g RAC supports
- OEM Grid Control     http://hostname:5500/em
  OEM Database Control http://hostname:1158/em
- "svrctl" is a command line interface to manage the cluster configuration,
   for example, starting and stopping all nodes in one command.
- Cluster Verification Utility (cluvfy) can be used for an installation and sanity check.

Failure in Client connections:

Depending on the Net configuration, type of connection, type of transaction etc.., 
Oracle Net services provides a feature called "Transparant Application Failover" 
which can fail over a client session to another backup connection.

About HA and DR:

- RAC is HA       , High Availability, that will keep things Up and Running in one site.
- Data Guard is DR, Disaster Recovery, and is able to mirror one site to another remote site.


2.4 Storage with RAC:
---------------------

We have the following Database storage options:

Raw			Raw devices, no filesystem present
ASM			Automatic Storage Management 
Third party CFS		Vendor's Cluster File System 
OCFS			Oracle Cluster File System 
LVM			Logical Volume Manager 
NFS			Network File System (must be on certified NAS device) 

Storage					Oracle Clusterware OCR and Voting Disk	Database Recovery area
--------------				--------------------------------------	-------- -------------
Automatic Storage Management 		No 					Yes 	 Yes 
Cluster file system (OCFS or Other)	Yes		 			Yes 	 Yes 
Shared raw storage 			Yes 					Yes 	 No 


Here is a description about file types. A regular single-instance database has three basic types of files: 

1. database software and dump files (alertlog, trace files and that stuff); 
2. datafiles, spfile, control files and log files, often referred to as "database files"; 
3. and it may have recovery files, if using RMAN. 
   and, in case of RAC:
4. A RAC database has an additional type of file referred to as "CRS files". These consist of the 
   Oracle Cluster Registry (OCR) and the voting disk. 

Not all of these files have to be on the shared storage subsystem. The database files and CRS files 
must be accessible to all instances, so these *must be* on the shared storage subsystem. 

The database software can be on the shared subsystem and shared between nodes; or each node can have 
its own ORACLE_HOME. The flash recovery area must be shared by all instances, if used. 

Some storage options can't handle all of these file types. To take an obvious example, the database software 
and dump files can't be stored on raw devices. This isn't important for the dump files, 
but it does mean that choosing raw devices precludes having a shared ORACLE_HOME on the shared storage device. 

Remarks:

1.
On a particular platform, there might exist a vendor specific solution for shared storage.
For example, on AIX it is usually IBM GPFS that is used as a shared file system. But for
this platform you might also use SFRAC of Veritas.

VERITAS Storage Foundation for Oracle Real Application Clusters (SFRAC) provides an integrated solution stack 
for using clustered filesystems with Oracle RAC on AIX, as an alternative to using raw logical volumes, 
Automatic Storage Management (ASM) or the AIX General Parallel Filesystem (GPFS). 

If your OS is Linux Redhat, then investigate your options with the Redhat Global FileSystem GFS.


2.
SAN solutions:

And as far as SAN, there's no inherent SAN protocol that allows for 
block-level locking between hosts.  Your clustered filesystem is 
responsible for providing that.


========================================================
3. Oracle 10g RAC Installation example on Redhat Linux:
========================================================


This section shows how to install 10g RAC on a couple of Linux machines. But the method used,
represents an installation on any platform. For the most part, on all platforms the installation
is the same.


3.1 Prepare your nodes:
-----------------------


3.1.1 Scetch of a 2-node Linux cluster

			192.168.2.0
         ---------------------------------------------- public network 
             |                                 |
  Server A   |                    Server B     |
        ------------                       -------------
        |InstanceA |Private network        |InstanceB  |
        |          |Ethernet (interconnect)|           |
        |          |-----------------------|           |
        |          |192.168.1.0            |           |
        |          |                       |           |
        |          |____________           |           |
        |          |  -----    -|---       |           |
        |          |--|PWR|    |PWR|-------|           |
        |          |  -----    -----       |           |
        |          |    |__________________|           |
        |          |                       |           |
        ------------                       -------------
             | SCSI bus or Fible Channel       |
             ------------------  ---------------
                              |  |
                              |  |
                          -----------
                          |Shared   |  - has Single DB on ASM, or OCFS (or other Cluster FS), or RAW
                          |Disk     |  - has OCR and Voting disk on OCFS (or other Cluster FS), or RAW
                          |Storage  |
                          ----------- 


3.1.2 Storage Options

Storage					Oracle Clusterware	Database	Recovery area
--------------				------------------	--------	-------------
Automatic Storage Management 		No 			Yes 		Yes 
Cluster file system (OCFS) 		Yes 			Yes 		Yes 
Shared raw storage 			Yes 			Yes 		No 


In the following, we will do an example installation on 3 nodes.


3.1.3 Install Redhat on all nodes with all options.

3.1.4 create oracle user and groups dba, oinstall on all nodes.
      Make sure they all have the same UID and GUI.

3.1.5 Make sure the user oracle has an appropriate .profile or .bash_profile

3.1.6 Every node needs a private network connection and a public network connection (at least
      two networkcards).

3.1.7 Linux kernel parameters:

Most out of the box kernel parameters (of RHELS 3,4,5) are set correctly for Oracle
except a few.

You should have the following minimal configuration:

net.ipv4.ip_local_port_range	1024  65000
kernel.sem			250  32000  100  128
kernel.shmmni			4096
kernel.shmall			2097152
kernel.shmmax			2147483648
fs.file-max			65536


You can check the most important parameters using the following command:

# /sbin/sysctl -a | egrep 'sem|shm|file-max|ip_local'

net.ipv4.ip_local_port_range = 1024  65000
kernel.sem = 250  32000  100  128
kernel.shmmni = 4096
kernel.shmall = 2097152
kernel.shmmax = 2147483648
fs.file-max = 65536

If some value should be changed, you can change the "/etc/sysctl.conf" file and run the "/sbin/sysctl -p" command
to change the value immediately.
Every time the system boots, the init program runs the /etc/rc.d/rc.sysinit script. This script contains 
a command to execute sysctl using /etc/sysctl.conf to dictate the values passed to the kernel. 
Any values added to /etc/sysctl.conf will take effect each time the system boots. 
 

3.1.8 make sure ssh and scp are working on all nodes without asking for a password.
      Use shh-keygen to arrange that.


3.1.9 Example "/etc/host" on the nodes:

Suppose you have the following 3 hosts, with their associated public and private names:

public  private
oc1	poc1
oc2	poc2
oc3	poc3

Then this could be a valid "/etc/hosts" file on the nodes: 

127.0.0.1	localhost.localdomain	localhost

192.168.2.99	rhes30
192.168.2.166	oltp
192.168.2.167	mw

192.168.2.101	oc1	#public1
192.168.2.179	voc1	#virtual1
192.168.1.101	poc1	#private1

192.168.2.102	oc2	#public2
192.168.2.177	voc2	#virtual2
192.168.1.102	poc2	#private2

192.168.2.103	oc3	#public3
192.168.2.178	voc3	#virtual3
192.168.1.103	poc3	#private3


3.1.10 Example disk devices

On all nodes, the shared disk devices should be accessible through the same devices names.

Raw Device Name		Physical Device Name	Purpose
/dev/raw/raw1		/dev/sda1		ASM Disk 1: +DATA1
/dev/raw/raw2		/dev/sdb1		ASM Disk 1: +DATA1
/dev/raw/raw3		/dev/sdc1		ASM Disk 2: +RECOV1
/dev/raw/raw4		/dev/sdd1		ASM Disk 2: +RECOV1
/dev/raw/raw5		/dev/sde1		OCR Disk    (on RAW device)
/dev/raw/raw6		/dev/sdf1		Voting Disk (on RAW device)

So as you can see, we use a combination of ASM (for database files and recovery area),
and RAW (for the OCR and Voting Disk).


3.2 CRS installation:
---------------------

3.2.1 First install CRS in its own home directory

First install CRS in its own home directory, e.g. CRS10gHome, apart from the Oracle home dir.
In fact, you NEED to install CRS first, before installing any Oacle RDBMS software.

The special "thing" here, is that you perform the installation from one node, and that the setup program
will in fact install CRS on all three nodes. Ofcourse, given that scp and shh works OK on all nodes, and
that the accounts are all the same.
But, you still need to run a few scripts on all individual nodes. 


As Oracle user:

./runInstaller

 ---------------------------------------------------
 |                                                 |  Screen 1
 |Specify File LOcations                           |
 |                                                 |
 |Source                                           |
 |Path: /install/crs10g/Disk1/stage/products.xml   |
 |                                                 |
 |Destination                                      |
 |Name: CRS10gHome                                 |
 |Path: /u01/app/oracle/product/10.1.0/CRS10gHome  |
 |                                                 |
 ---------------------------------------------------


 ---------------------------------------------------
 |                                                 |  Screen 2
 |Cluster Configuration                            |
 |                                                 |
 |Cluster Name: lec1                               |
 |                                                 |
 | Public Node Name            Private Node Name   |
 | ---------------------------------------------   |
 | |oc1                 | p0c1                  |  |
 | |--------------------------------------------   |
 | |oc2                 | p0c2                  |  |
 | |--------------------------------------------   |
 | |oc3                 | poc3                  |  |
 | |--------------------------------------------   |
 ---------------------------------------------------

In the next screen, you specify which of your networks is to be used as
the public interface (to connect to the public network) and which will be used
for the private interconnect to support cache fushion and the cluster heartbeat.

 ---------------------------------------------------
 |                                                 |  Screen 3
 |Private Interconnect Enforcement                 |
 |                                                 |
 |                                                 |
 |                                                 |
 | Interface Name   Subnet          Interface type |
 | ---------------------------------------------   |
 | |eth0           |192.168.2.0   |Public      |   |
 | |--------------------------------------------   |
 | |eth1           |192.168.1.0   |Private     |   |
 | |--------------------------------------------   |
 |                                                 |
 ---------------------------------------------------

In the next screen, you specify /dev/raw/raw5 as the raw disk for the Oracle Cluster Registry.

 ---------------------------------------------------
 |                                                 |  Screen 4
 |Oracle Cluster Registry                          |
 |                                                 |
 |Specify OCR Location: /dev/raw/raw5              | (you are able to specify one extra mirror location of the OCR)
 |                                                 |
 ---------------------------------------------------

In a similar fashion you specify the location of the Voting Disk.

 ---------------------------------------------------
 |                                                 |  Screen 5
 |Voting Disk                                      |
 |                                                 |
 |Specify Voting Disk: /dev/raw/raw6               | (you are able to specify two extra locations 
 |                                                 | for copies of the VD)
 ---------------------------------------------------

You now have to execute the /u01/app/oracle/orainventory/orainstRoot.sh script
on all Cluster Nodes as the root user.

After this, you can continue with the other window, and see an "Install Summary" screen.
No you click "Install" and the installation begins.
Apart from the node you work on, the software will also be copied to the other nodes as well.

After the installation is complete, you are once again prompted to run a script as root
on each node of the Cluster.
This is the script "/u01/app/oracle/product/10.1.0/CRS10gHome/root.sh".


-- The olsnodes command.

After finishing the CSR installation, you can verify that the installation completed successfully
by running on any node the following command:

# cd /u01/app/oracle/product/10.1.0/CRS10gHome/bin
# olsnodes -n
oc1   1
oc2   2
oc3   3


3.3 Database software installation:
-----------------------------------

You can install the database software into the same directory in each node.
With OCFS2, you might do one install in a common shared directory for all nodes.

Because CSR is already running, the OUI detects that, and because its cluster aware, it
provides you with the options to install a clustered implementation.

You start the installation by running ./runInstaller as the oracle user on one node.
For most part, it looks the same as a single-instance installation.

After the file location screen, that is source and destination, you will see this screen:

 ---------------------------------------------------
 |                                                 |  
 |Specify Hardware Cluster Installation Mode       |
 |                                                 |
 | o Cluster installation mode                     |
 |                                                 |
 |  Node name                                      |
 |  ---------------------------------------------  |
 |  | [] oc1                                    |  |
 |  | [] oc2                                    |  |
 |  | [] oc3                                    |  |
 |  ---------------------------------------------  |
 |                                                 |
 | o Local installation (non cluster)              |
 |                                                 |
 |-------------------------------------------------|

Most of the time, you will do a "software only" installation, and create the database later
with the DBCA.

For the first node only, after some time, the Virtual IP Configuration Assistant, VIPCA, will start.
Here you can configure the Virtual IP adresses you will use for application failover
and the Enterprise Manager Agent.
Here you will select the Virtual IP's for all nodes.
VIPCA only needs to run once per Cluster.


3.4 Creating the RAC database with DBCA:
----------------------------------------

Launching the DBCA for installing a RAC database is much the same as launching DBCA for a single instance.
If DBCA detects cluster software installed, it gives you the option to install a RAC database 
or a single instance.

as oracle user:

% dbca &

 ---------------------------------------------------
 |                                                 |  
 |Welcome to the database configuration assistant  |
 |                                                 |
 |                                                 |
 |                                                 |
 | o Oracle Real Application Cluster database      |
 |                                                 |
 | o Oracle single instance database               |
 |                                                 |
 |-------------------------------------------------|

After selecting RAC, the next screen gives you the option to select nodes:

 ---------------------------------------------------
 |                                                 |  
 |Select the nodes on which you want to create     |
 |the cluster database. The local node oc1 will    |
 |always be used whether or not it is selected.    |
 |                                                 |
 |  Node name                                      |
 |  ---------------------------------------------  |
 |  | [] oc1                                    |  |
 |  | [] oc2                                    |  |
 |  | [] oc3                                    |  |
 |  ---------------------------------------------  |
 |                                                 |
 |                                                 |
 |-------------------------------------------------|
 
In the next screens, you can choose the type of database (oltp, dw etc..), and all
other items, just like a single instance install.
At a cetain point, you can choose to use ASM diskgroups or RAW etc.., choose a flash-recovery area etc..
The way you install the database really resembles a normal single instance install, so we won't discuss
that here.


==============================
5. ASM and RAC in Oracle 10g:
==============================

A number of notes will explain ASM, and the integration of ASM into a RAC system.


========
Note 1:
========

Automatic Storage Management (ASM) in Oracle Database 10g


With ASM, Automatic Storage Management, there is a separate lightweight 10g database involved.
This ASM database (+ASM), contains all metadata about the ASM system.
It also acts as the interface between the regular database and the filesystems.

ASM will provide for presentation and implementation of a special filesystem, on which a number
of redundancy/availability and performance features are implemented.

In addition to the normal database background processes like CKPT, DBWR, LGWR, SMON, and PMON, 
an ASM instance uses at least two additional background processes to manage data storage operations. 
The Rebalancer process, RBAL, coordinates the rebalance activity for ASM disk groups, 
and the Actual ReBalance processes, ARBn, handle the actual rebalance of data extent movements. 
There are usually several ARB background processes (ARB0, ARB1, and so forth). 

Every database instance that uses ASM for file storage, will also need the two new processes. 
The Rebalancer background process (RBAL) handles global opens of all ASM disks in the ASM Disk Groups, 
while the ASM Bridge process (ASMB) connects as a foreground process into the ASM instance when the 
regular database instance starts. ASMB facilitates communication between the ASM instance and 
the regular database, including handling physical file changes like data file creation and deletion. 

ASMB exchanges messages between both servers for statistics update and instance health validation. 
These two processes are automatically started by the database instance when a new Oracle file type - 
for example, a tablespace's datafile -- is created on an ASM disk group. When an ASM instance mounts 
a disk group, it registers the disk group and connect string with Group Services. The database instance 
knows the name of the disk group, and can therefore use it to locate connect information for 
the correct ASM instance.


========
Note 2: 
========

Some terminology in RAC:

CRS cluster ready services - Clusterware:

For Oracle10g on Linux and Windows-based platforms, CRS co-exists with, but does not inter-operate, 
with vendor clusterware. You may use vendor clusterware for all UNIX-based operating systems 
except for Linux. Even though, many of the Unix platforms have their own clusterware products, 
you *need to use* the CRS software to provide the RAC HA support services. CRS (cluster ready services) 
supports services and workload management and helps to maintain the continuous availability of the services. 
CRS also manages resources such as virtual IP (VIP) address for the node and the global services daemon.
Note that the "Voting disks" and the "Oracle Cluster Registry", are regarded as part of the CRS.

OCR:

The Oracle Cluster Registry (OCR) contains cluster and database configuration information 
for Real Application Clusters Cluster Ready Services (CRS), including the list of nodes 
in the cluster database, the CRS application, resource profiles, and the authorizations for 
the Event Manager (EVM). The OCR can reside in a file on a cluster file system or on a shared raw device. 
When you install Real Application Clusters, you specify the location of the OCR.

OCFS (not used often):

OCFS is a shared disk cluster filesystem. Version 1 released for Linux is specifically designed 
to alleviate the need for manag-ing raw devices. It can contain all the 
oracle datafiles, archive log files and controlfiles.  It is however not designed as a 
general purpose filesystem.

OCFS2 is the next generation of the Oracle Cluster File System for Linux. It is an extent based, 
POSIX compliant file system. Unlike the previous release (OCFS), OCFS2 is a general-purpose 
file system that can be used for shared Oracle home installations making management of 
Oracle Real Application Cluster (RAC) installations even easier. Among the new features and benefits are: 

Node and architecture local files using Context Dependent Symbolic Links (CDSL) 
Network based pluggable DLM 
Improved journaling / node recovery using the Linux Kernel "JBD" subsystem 
Improved performance of meta-data operations (space allocation, locking, etc). 
Improved data caching / locking (for files such as oracle binaries, libraries, etc) 

- OCFS1 does NOT support a shared Oracle Home
- OCFS2 does     support a shared Oracle Home

Though ASM appears to be the intended replacement for Oracle Cluster File System (OCFS) 
for the Real Applications Cluster (RAC).
ASM supports Oracle Real Application Clusters (RAC), so there is no need 
for a separate Cluster LVM or a Cluster File System.

So it boils down to:
- You use or OCFS2, or RAW, or ASM (preferrably) for your database files.

Storage Option				Oracle Clusterware	Database	Recovery area
--------------				------------------	--------	-------------
Automatic Storage Management 		No 			Yes 		Yes 
Cluster file system (OCFS) 		Yes 			Yes 		Yes 
Shared raw storage 			Yes 			Yes 		No 


========
Note 3:
========

Automatic Storage Management (ASM) simplifies database administration. It eliminates the need for you, 
as a DBA, to directly manage potentially thousands of Oracle database files. It does this by enabling 
you to create disk groups, which are comprised of disks and the files that reside on them. You only need 
to manage a small number of disk groups.

In the SQL statements that you use for creating database structures such as tablespaces, redo log and 
archive log files, and control files, you specify file location in terms of disk groups. 
Automatic Storage Management then creates and manages the associated underlying files for you.

Automatic Storage Management extends the power of Oracle-managed files. With Oracle-managed files, 
files are created and managed automatically for you, but with Automatic Storage Management you get 
the additional benefits of features such as mirroring and striping.
The primary component of Automatic Storage Management is the disk group. You configure Automatic Storage Management 
by creating disk groups, which, in your database instance, can then be specified as the default 
location for files created in the database. Oracle provides SQL statements that create and manage 
disk groups, their contents, and their metadata.

A disk group consists of a grouping of disks that are managed together as a unit. These disks are referred 
to as ASM disks. Files written on ASM disks are ASM files, whose names are automatically generated 
by Automatic Storage Management. You can specify user-friendly alias names for ASM files, 
but you must create a hierarchical directory structure for these alias names.

You can affect how Automatic Storage Management places files on disks by specifying failure groups. 
Failure groups define disks that share components, such that if one fails then other disks sharing 
the component might also fail. An example of what you might define as a failure group would be a set 
of SCSI disks sharing the same SCSI controller. Failure groups are used to determine which ASM disks 
to use for storing redundant data. For example, if two-way mirroring is specified for a file, 
then redundant copies of file extents must be stored in separate failure groups.


If you would take a look at the v$datafile, v$logfile, and v$controlfile of the regular Database,
you would see information like in the following example:

SQL> select file#, name from v$datafile;

1  +DATA1/rac0/datafile/system.256.1
2  +DATA1/rac0/datafile/undotbs.258.1
3  +DATA1/rac0/datafile/sysaux.257.1
4  +DATA1/rac0/datafile/users.259.1
5  +DATA1/rac0/datafile/example.269.1


SQL> select name from v$controlfile;

+DATA1/rac0/controlfile/current.261.3
+DATA1/rac0/controlfile/current.260.3


-- Initialization Parameters (init.ora or SPFILE) for ASM Instances

The following initialization parameters relate to an ASM instance. Parameters that start with ASM_ 
cannot be set in database instances.

Name             Description 
INSTANCE_TYPE    Must be set to INSTANCE_TYPE = ASM. 
                 Note: This is the only required parameter. All other parameters take suitable defaults 
                 for most environments.
 
DB_UNIQUE_NAME   Unique name for this group of ASM instances within the cluster or on a node. 
Default: +ASM    (Needs to be modified only if trying to run multiple ASM instances on the same node)
 
ASM_POWER_LIMIT  The maximum power on an ASM instance for disk rebalancing. 
Default: 1       Can range from 1 to 11. 1 is the lowest priority. 

See Also: "Tuning Rebalance Operations"
 
ASM_DISKSTRING   Limits the set of disks that Automatic Storage Management considers for discovery. 
Default: NULL    (This default causes ASM to find all of the disks in a platform-specific location to which 
                  it has read/write access.).
                  Example: /dev/raw/*

ASM_DISKGROUPS   Lists the names of disk groups to be mounted by an ASM instance at startup, 
                 or when the ALTER DISKGROUP ALL MOUNT statement is used. 
Default: NULL    (If this parameter is not specified, then no disk groups are mounted.)

Note: This parameter is dynamic and if you are using a server parameter file (SPFILE), then you should 
rarely need to manually alter this value. Automatic Storage Management automatically adds a disk group 
to this parameter when a disk group is successfully mounted, and automatically removes a disk group that 
is specifically dismounted. However, when using a traditional text initialization parameter file, 
remember to edit the initialization parameter file to add the name of any disk group that you want automatically 
mounted at instance startup, and remove the name of any disk group that you no longer want automatically mounted.


-- ASM Views:

The ASM configuration can be viewed using the V$ASM_% views, which often contain different information 
depending on whether they are queried from the ASM instance, or a dependant database instance.

Viewing ASM Instance Information Via SQL Queries
There are several dynamic and data dictionary views available to view an ASM configuration from within 
the ASM instance itself:

ASM Dynamic Views: FROM ASM Instance Information
 
View Name        Description
 
V$ASM_ALIAS      Shows every alias for every disk group mounted by the ASM instance
 
V$ASM_CLIENT     Shows which database instance(s) are using any ASM disk groups that are being mounted by this ASM instance
 
V$ASM_DISK       Lists each disk discovered by the ASM instance, including disks that are not part of any ASM disk group
 
V$ASM_DISKGROUP  Describes information about ASM disk groups mounted by the ASM instance
 
V$ASM_FILE       Lists each ASM file in every ASM disk group mounted by the ASM instance
 
V$ASM_OPERATION  Like its counterpart, V$SESSION_LONGOPS, it shows each long-running ASM operation in the ASM instance
 
V$ASM_TEMPLATE   Lists each template present in every ASM disk group mounted by the ASM instance
 
 
-- Managing disk groups

The SQL statements introduced in this section are only available in an ASM instance. 
You must first start the ASM instance. 

Creating disk group examples:

Example 1:
----------

Creating a Disk Group: Example

The following examples assume that the ASM_DISKSTRING is set to '/devices/*'. Assume the following:

ASM disk discovery identifies the following disks in directory /devices.

/devices/diska1 
/devices/diska2 
/devices/diska3 
/devices/diska4 
/devices/diskb1 
/devices/diskb2 
/devices/diskb3 
/devices/diskb4

The disks diska1 - diska4 are on a separate SCSI controller from disks diskb1 - diskb4.


The following SQL*Plus session illustrates starting an ASM instance and creating a disk group named dgroup1.

% SQLPLUS /NOLOG
SQL> CONNECT / AS SYSDBA

SQL> CREATE DISKGROUP dgroup1 NORMAL REDUNDANCY 
  2  FAILGROUP controller1 DISK
  3 '/devices/diska1',
  4 '/devices/diska2',
  5 '/devices/diska3',
  6 '/devices/diska4',
  7 FAILGROUP controller2 DISK
  8 '/devices/diskb1',
  9 '/devices/diskb2',
 10 '/devices/diskb3',
 11 '/devices/diskb4';

In this example, dgroup1 is composed of eight disks that are defined as belonging to either 
failure group controller1 or controller2. Since NORMAL REDUNDANCY level is specified for the disk group, 
then Automatic Storage Management provides redundancy for all files created in dgroup1 according to the 
attributes specified in the disk group templates.

For example, in the system default template shown in the table in "Managing Disk Group Templates", 
normal redundancy for the online redo log files (ONLINELOG template) is two-way mirroring. This means that 
when one copy of a redo log file extent is written to a disk in failure group controller1, a mirrored copy 
of the file extent is written to a disk in failure group controller2. You can see that to support normal 
redundancy level, at least two failure groups must be defined.

Since no NAME clauses are provided for any of the disks being included in the disk group, 
the disks are assigned the names of dgroup1_0001, dgroup1_0002, ..., dgroup1_0008.


Example 2:
----------

CREATE DISKGROUP disk_group_1 NORMAL REDUNDANCY
  FAILGROUP failure_group_1 DISK
    '/devices/diska1' NAME diska1,
    '/devices/diska2' NAME diska2,
  FAILGROUP failure_group_2 DISK
    '/devices/diskb1' NAME diskb1,
    '/devices/diskb2' NAME diskb2;


Example 3:
----------

At some point in using OUI in installing the software, and creating a database, you will
see the following screen:

----------------------------------------------------
|SPECIFY Database File Storage Option               |
|                                                   |
|  o File system                                    |
|    Specify Database file location: #########      |
|                                                   |
|  o Automatic Storage Management (ASM)             |
|                                                   |
|  o Raw Devices                                    |
|                                                   |
|    Specify Raw Devices mapping file: ##########   |
----------------------------------------------------

Suppose that you have on a Linux machine the following raw disk devices:

/dev/raw/raw1	8GB
/dev/raw/raw2	8GB
/dev/raw/raw3	6GB
/dev/raw/raw4	6GB
/dev/raw/raw5	6GB
/dev/raw/raw6	6GB

Then you can choose ASM in the upper screen, and see the following screen, where
you can create the initial diskgroup and assign disks to it:

-----------------------------------------------------
| Configure Automatic Storage Management              |
|                                                     |
| Disk Group Name:  data1                             |
|                                                     |
| Redundancy                                          |
| o High  o Normal  o External                        |             
|                                                     |
| Add member Disks                                    |
| |--------------------------------                   |
| | select  Disk Path              |                  |
| |[#]     /dev/raw/raw1           |                  |
| |[#]     /dev/raw/raw2           |                  | 
| |[ ]     /dev/raw/raw3           |                  |
| |[ ]     /dev/raw/raw4           |                  |
|  --------------------------------                   |
|                                                     |
-----------------------------------------------------


-- Mounting and Dismounting Disk Groups

Disk groups that are specified in the ASM_DISKGROUPS initialization parameter are mounted automatically 
at ASM instance startup. This makes them available to all database instances running on the same node 
as Automatic Storage Management. The disk groups are dismounted at ASM instance shutdown. 
Automatic Storage Management also automatically mounts a disk group when you initially create it, 
and dismounts a disk group if you drop it.

There may be times that you want to mount or dismount disk groups manually. For these actions use 
the ALTER DISKGROUP ... MOUNT or ALTER DISKGROUP ... DISMOUNT statement. You can mount or dismount 
disk groups by name, or specify ALL.

If you try to dismount a disk group that contains open files, the statement will fail, unless you also
specify the FORCE clause.


Example

The following statement dismounts all disk groups that are currently mounted to the ASM instance:

ALTER DISKGROUP ALL DISMOUNT;


The following statement mounts disk group dgroup1:

ALTER DISKGROUP dgroup1 MOUNT; 


========
Note 4:
========


-- Installing Oracle ASMLib for Linux:

ASMLib is a support library for the Automatic Storage Management feature of Oracle Database 10g. 
This document is a set of tips for installing the Linux specific ASM library and its assocated driver. 
This library is provide to enable ASM I/O to Linux disks without the limitations of the 
standard Unix I/O API. The steps below are steps that the system administrator must follow. 

The ASMLib software is available from the Oracle Technology Network. Go to ASMLib download page 
and follow the link for your platform. 
You will see 4-6 packages for your Linux platform. 

-The oracleasmlib package provides the actual ASM library. 
-The oracleasm-support package provides the utilities used to get the ASM driver 
 up and running. Both of these packages need to be installed. 
-The remaining packages provide the kernel driver for the ASM library. Each package provides 
 the driver for a different kernel. You must install the appropriate package for the kernel you are running. 
 Use the "uname -r command to determine the version of the kernel. The oracleasm kerel driver package 
 will have that version string in its name. For example, if you were running Red Hat Enterprise Linux 4 AS, 
 and the kernel you were using was the 2.6.9-5.0.5.ELsmp kernel, you would choose the 
 oracleasm-2.6.9-5.0.5-ELsmp package. 

So, for example, to install these packages on RHEL4 on an Intel x86 machine,  you might use the command: 

rpm -Uvh oracleasm-support-2.0.0-1.i386.rpm \
    oracleasm-lib-2.0.0-1.i386.rpm \
    oracleasm-2.6.9-5.0.5-ELsmp-2.0.0-1.i686.rpm

Once the command completes, ASMLib is now installed on the system. 

-- Configuring ASMLib: 
 
Now that the ASMLib software is installed, a few steps have to be taken by the system administrator 
to make the ASM driver available. The ASM driver needs to be loaded, and the driver filesystem needs 
to be mounted. This is taken care of by the initialization script, "/etc/init.d/oracleasm". 
Run the "/etc/init.d/oracleasm" script with the "configure" option. It will ask for the user and group 
that default to owning the ASM driver access point. If the database was running as the 'oracle' user 
and the 'dba' group, the output would look like this: 

[root@ca-test1 /]# /etc/init.d/oracleasm configure
  Configuring the Oracle ASM library driver.
 
  This will configure the on-boot properties of the Oracle ASM library
  driver.  The following questions will determine whether the driver is
  loaded on boot and what permissions it will have.  The current values
  will be shown in brackets ('[]').  Hitting  without typing an
  answer will keep that current value.  Ctrl-C will abort.

  Default user to own the driver interface []: oracle
  Default group to own the driver interface []: dba
  Start Oracle ASM library driver on boot (y/n) [n]: y
  Fix permissions of Oracle ASM disks on boot (y/n) [y]: y
  Writing Oracle ASM library driver configuration            [  OK  ]
  Creating /dev/oracleasm mount point                        [  OK  ]
  Loading module "oracleasm"                                 [  OK  ]
  Mounting ASMlib driver filesystem                          [  OK  ]
  Scanning system for ASM disks                              [  OK  ]
 

This should load the oracleasm.o driver module and mount the ASM driver filesystem. 
By selecting enabled = 'y' during the configuration, the system will always load the module 
and mount the filesystem on boot. 
The automatic start can be enabled or disabled with the 'enable' and 'disable' options 
to /etc/init.d/oracleasm: 

  [root@ca-test1 /]# /etc/init.d/oracleasm disable
  Writing Oracle ASM library driver configuration            [  OK  ]
  Unmounting ASMlib driver filesystem                        [  OK  ]
  Unloading module "oracleasm"                               [  OK  ]

  [root@ca-test1 /]# /etc/init.d/oracleasm enable
  Writing Oracle ASM library driver configuration            [  OK  ]
  Loading module "oracleasm"                                 [  OK  ]
  Mounting ASMlib driver filesystem                          [  OK  ]
  Scanning system for ASM disks                              [  OK  ]


-- Making Disks Available to ASMLib: 
 
The system administrator has one last task. Every disk that ASMLib is going to be accessing 
needs to be made available. This is accomplished by creating an ASM disk. The /etc/init.d/oracleasm script 
is again used for this task: 

  [root@ca-test1 /]# /etc/init.d/oracleasm createdisk VOL1 /dev/sdg1
  Creating Oracle ASM disk "VOL1"                            [  OK  ]

 
Disk names are ASCII capital letters, numbers, and underscores. They must start with a letter. 
Disks that are no longer used by ASM can be unmarked as well: 

  [root@ca-test1 /]# /etc/init.d/oracleasm deletedisk VOL1
  Deleting Oracle ASM disk "VOL1"                            [  OK  ]

Any operating system disk can be queried to see if it is used by ASM: 

  [root@ca-test1 /]# /etc/init.d/oracleasm querydisk /dev/sdg1
  Checking if device "/dev/sdg1" is an Oracle ASM disk        [  OK  ]
  [root@ca-test1 /]# /etc/init.d/oracleasm querydisk /dev/sdh1
  Checking if device "/dev/sdh1" is an Oracle ASM disk        [FAILED]

Existing disks can be listed and queried: 

  [root@ca-test1 /]# /etc/init.d/oracleasm listdisks
  VOL1
  VOL2
  VOL3
  [root@ca-test1 /]# /etc/init.d/oracleasm querydisk VOL1
  Checking for ASM disk "VOL1"                               [  OK  ]

When a disk is added to a RAC setup, the other nodes need to be notified about it. 
Run the 'createdisk' command on one node, and then run 'scandisks' on every other node: 

  [root@ca-test1 /]# /etc/init.d/oracleasm scandisks
  Scanning system for ASM disks                              [  OK  ]


-- Discovery Strings for Linux ASMLib: 
 
ASMLib uses discovery strings to determine what disks ASM is asking for. The generic Linux ASMLib 
uses glob strings. The string must be prefixed with "ORCL:". Disks are specified by name. 
A disk created with the name "VOL1" can be discovered in ASM via the discovery string "ORCL:VOL1". 
Similarly, all disks that start with the string "VOL" can be queried with the discovery string "ORCL:VOL*". 
Disks cannot be discovered with path names in the discovery string. If the prefix is missing, 
the generic Linux ASMLib will ignore the discovery string completely, expecting that it is intended 
for a different ASMLib. The only exception is the empty string (""), which is considered a full wildcard. 
This is precisely equivalent to the discovery string "ORCL:*". 

NOTE: Once you mark your disks with Linux ASMLib, Oracle Database 10g R1 (10.1) OUI will not be able 
to discover your disks. It is recommended that you complete a Software Only install and then use DBCA 
to create your database (or use the custom install). 

 
========
Note 5:
========

Automatic Storage Management (ASM) is a new feature that has be introduced in Oracle 10g to 
simplify the storage of Oracle datafiles, controlfiles and logfiles. 


- Overview of Automatic Storage Management (ASM) 
- Initialization Parameters and ASM Instance Creation 
- Startup and Shutdown of ASM Instances 
- Administering ASM Disk Groups 
- Disks 
- Templates 
- Directories 
- Aliases 
- Files 
- Checking Metadata 
- ASM Filenames 
- ASM Views 
- SQL and ASM 
- Migrating to ASM Using RMAN 

Overview of Automatic Storage Management (ASM)
Automatic Storage Management (ASM) simplifies administration of Oracle related files by allowing 
the administrator to reference disk groups rather than individual disks and files, which are managed by ASM. 
The ASM functionality is an extention of the Oracle Managed Files (OMF) functionality that also includes 
striping and mirroring to provide balanced and secure storage. The new ASM functionality can be used in 
combination with existing raw and cooked file systems, along with OMF and manually managed files.

The ASM functionality is controlled by an ASM instance. This is not a full database instance, 
just the memory structures and as such is very small and lightweight.

The main components of ASM are disk groups, each of which comprise of several physical disks that are controlled 
as a single unit. The physical disks are known as ASM disks, while the files that reside on the disks 
are know as ASM files. The locations and names for the files are controlled by ASM, but user-friendly aliases and directory structures can be defined for ease of reference.

The level of redundancy and the granularity of the striping can be controlled using templates. 
Default templates are provided for each file type stored by ASM, but additional templates can be defined as needed.

Failure groups are defined within a disk group to support the required level of redundancy. 
For two-way mirroring you would expect a disk group to contain two failure groups so individual files 
are written to two locations.

In summary ASM provides the following functionality:

Manages groups of disks, called disk groups. 
Manages disk redundancy within a disk group. 
Provides near-optimal I/O balancing without any manual tuning. 
Enables management of database objects without specifying mount points and filenames. 
Supports large files. 
Initialization Parameters and ASM Instance Creation

The init.ora / spfile initialization parameters that are of specific interest for an ASM instance are:

INSTANCE_TYPE   - Set to ASM or RDBMS depending on the instance type. The default is RDBMS. 
DB_UNIQUE_NAME  - Specifies a globally unique name for the database. This defaults to +ASM but 
                  must be altered if you intend to run multiple ASM instances. 
ASM_POWER_LIMIT - The maximum power for a rebalancing operation on an ASM instance. The valid values range 
                  from 1 to 11, with 1 being the default. The higher the limit the more resources are allocated 
                  resulting in faster rebalancing operations. This value is also used as the default 
                  when the POWER clause is omitted from a rebalance operation. 
ASM_DISKGROUPS  - The list of disk groups that should be mounted by an ASM instance during instance startup, 
                  or by the ALTER DISKGROUP ALL MOUNT statement. ASM configuration changes are automatically 
                  reflected in this parameter. 
ASM_DISKSTRING -  Specifies a value that can be used to limit the disks considered for discovery. 
                  Altering the default value may improve the speed of disk group mount time and the speed 
                  of adding a disk to a disk group. Changing the parameter to a value which prevents 
                  the discovery of already mounted disks results in an error. The default value is NULL 
                  allowing all suitable disks to be considered. 

Incorrect usage of parameters in ASM or RDBMS instances result in ORA-15021 errors.

To create an ASM instance first create a file called init+ASM.ora in the /tmp directory 
containing the following information.

INSTANCE_TYPE=ASM 

Next, using SQL*Plus connect to the ide instance.

export ORACLE_SID=+ASM

sqlplus / as sysdba

Create an spfile using the contents of the init+ASM.ora file.

SQL> CREATE SPFILE FROM PFILE='/tmp/init+ASM.ora';

File created.

Finally, start the instance with the NOMOUNT option.

SQL> startup nomount
ASM instance started

Total System Global Area  125829120 bytes
Fixed Size                  1301456 bytes
Variable Size             124527664 bytes
Database Buffers                  0 bytes
Redo Buffers                      0 bytes
SQL>

The ASM instance is now ready to use for creating and mounting disk groups. 
To shutdown the ASM instance issue the following command.

SQL> shutdown
ASM instance shutdown
SQL>

Once an ASM instance is present disk groups can be used for the following parameters 
in database instances (INSTANCE_TYPE=RDBMS) to allow ASM file creation:

DB_CREATE_FILE_DEST 
DB_CREATE_ONLINE_LOG_DEST_n 
DB_RECOVERY_FILE_DEST 
CONTROL_FILES 
LOG_ARCHIVE_DEST_n 
LOG_ARCHIVE_DEST 
STANDBY_ARCHIVE_DEST 


Here is an example of how to create a datafile using a default disk group specified by an initialization parameter setting. 
Suppose the Database initialization parameter file is set as follows:

DB_CREATE_FILE_DEST = �+dskgrp01�

If you now create a tablespace

SQL> CREATE TABLESPACE SALESDATA;

it will be stored in +dskgrp01


Startup and Shutdown of ASM Instances
ASM instance are started and stopped in a similar way to normal database instances. The options 
for the STARTUP command are:

FORCE - Performs a SHUTDOWN ABORT before restarting the ASM instance. 
MOUNT - Starts the ASM instance and mounts the disk groups specified by the ASM_DISKGROUPS parameter. 
NOMOUNT - Starts the ASM instance without mounting any disk groups. 
OPEN - This is not a valid option for an ASM instance. 

The options for the SHUTDOWN command are:

NORMAL - The ASM instance waits for all connected ASM instances and SQL sessions to exit then shuts down. 
IMMEDIATE - The ASM instance waits for any SQL transactions to complete then shuts down. 
            It doesn't wait for sessions to exit. 
TRANSACTIONAL - Same as IMMEDIATE. 
ABORT - The ASM instance shuts down instantly. 

Aministering ASM Disk Groups

Disk groups are created using the CREATE DISKGROUP statement. This statement allows you to specify 
the level of redundancy:

NORMAL REDUNDANCY   - Two-way mirroring, requiring two failure groups. 
HIGH REDUNDANCY     - Three-way mirroring, requiring three failure groups. 
EXTERNAL REDUNDANCY - No mirroring for disks that are already protected using hardware mirroring or RAID. 

In addition failure groups and preferred names for disks can be defined. If the NAME clause is omitted 
the disks are given a system generated name like "disk_group_1_0001". The FORCE option can be used 
to move a disk from another disk group into this one.

CREATE DISKGROUP disk_group_1 NORMAL REDUNDANCY
  FAILGROUP failure_group_1 DISK
    '/devices/diska1' NAME diska1,
    '/devices/diska2' NAME diska2,
  FAILGROUP failure_group_2 DISK
    '/devices/diskb1' NAME diskb1,
    '/devices/diskb2' NAME diskb2;

Disk groups can be deleted using the DROP DISKGROUP statement.

DROP DISKGROUP disk_group_1 INCLUDING CONTENTS;

Disks can be added or removed from disk groups using the ALTER DISKGROUP statement. 
Remember that the wildcard "*" can be used to reference disks so long as the resulting string does not match 
a disk already used by an existing disk group.

-- Add disks.
ALTER DISKGROUP disk_group_1 ADD DISK
  '/devices/disk*3',
  '/devices/disk*4';

-- Drop a disk.
ALTER DISKGROUP disk_group_1 DROP DISK diska2;

Disks can be resized using the RESIZE clause of the ALTER DISKGROUP statement. 
The statement can be used to resize individual disks, all disks in a failure group or all disks 
in the disk group. If the SIZE clause is omitted the disks are resized to the size of the disk returned by the OS.

-- Resize a specific disk.
ALTER DISKGROUP disk_group_1
  RESIZE DISK diska1 SIZE 100G;

-- Resize all disks in a failure group.
ALTER DISKGROUP disk_group_1
  RESIZE DISKS IN FAILGROUP failure_group_1 SIZE 100G;

-- Resize all disks in a disk group.
ALTER DISKGROUP disk_group_1
  RESIZE ALL SIZE 100G;The UNDROP DISKS clause of the ALTER DISKGROUP statement allows pending disk drops 
to be undone. It will not revert drops that have completed, or disk drops associated with the dropping of a disk group.

ALTER DISKGROUP disk_group_1 UNDROP DISKS;

Disk groups can be rebalanced manually using the REBALANCE clause of the ALTER DISKGROUP statement. 
If the POWER clause is omitted the ASM_POWER_LIMIT parameter value is used. Rebalancing is only needed 
when the speed of the automatic rebalancing is not appropriate. 

ALTER DISKGROUP disk_group_1 REBALANCE POWER 5;

Disk groups are mounted at ASM instance startup and unmounted at ASM instance shutdown. 
Manual mounting and dismounting can be accomplished using the ALTER DISKGROUP statement as seen below.

ALTER DISKGROUP ALL DISMOUNT;
ALTER DISKGROUP ALL MOUNT;
ALTER DISKGROUP disk_group_1 DISMOUNT;
ALTER DISKGROUP disk_group_1 MOUNT;

Templates
Templates are named groups of attributes that can be applied to the files within a disk group. 
The following example show how templates can be created, altered and dropped.

-- Create a new template.
ALTER DISKGROUP disk_group_1 ADD TEMPLATE my_template ATTRIBUTES (MIRROR FINE);

-- Modify template.
ALTER DISKGROUP disk_group_1 ALTER TEMPLATE my_template ATTRIBUTES (COARSE);

-- Drop template.
ALTER DISKGROUP disk_group_1 DROP TEMPLATE my_template;Available attributes include:

UNPROTECTED - No mirroring or striping regardless of the redundancy setting. 
MIRROR - Two-way mirroring for normal redundancy and three-way mirroring for high redundancy. 
         This attribute cannot be set for external redundancy. 
COARSE - Specifies lower granuality for striping. This attribute cannot be set for external redundancy. 
FINE - Specifies higher granularity for striping. This attribute cannot be set for external redundancy. 

Directories
A directory heirarchy can be defined using the ALTER DISKGROUP statement to support ASM file aliasing. 
The following examples show how ASM directories can be created, modified and deleted.

-- Create a directory.
ALTER DISKGROUP disk_group_1 ADD DIRECTORY '+disk_group_1/my_dir';

-- Rename a directory.
ALTER DISKGROUP disk_group_1 RENAME DIRECTORY '+disk_group_1/my_dir' TO '+disk_group_1/my_dir_2';

-- Delete a directory and all its contents.
ALTER DISKGROUP disk_group_1 DROP DIRECTORY '+disk_group_1/my_dir_2' FORCE;Aliases
Aliases allow you to reference ASM files using user-friendly names, rather than the fully qualified ASM filenames. 
-- Create an alias using the fully qualified filename.
ALTER DISKGROUP disk_group_1 ADD ALIAS '+disk_group_1/my_dir/my_file.dbf'
  FOR '+disk_group_1/mydb/datafile/my_ts.342.3';

-- Create an alias using the numeric form filename.
ALTER DISKGROUP disk_group_1 ADD ALIAS '+disk_group_1/my_dir/my_file.dbf'
  FOR '+disk_group_1.342.3';

-- Rename an alias.
ALTER DISKGROUP disk_group_1 RENAME ALIAS '+disk_group_1/my_dir/my_file.dbf'
  TO '+disk_group_1/my_dir/my_file2.dbf';

-- Delete an alias.
ALTER DISKGROUP disk_group_1 DELETE ALIAS '+disk_group_1/my_dir/my_file.dbf';

Attempting to drop a system alias results in an error.

Files
Files are not deleted automatically if they are created using aliases, as they are not Oracle Managed Files (OMF), 
or if a recovery is done to a point-in-time before the file was created. For these circumstances 
it is necessary to manually delete the files, as shown below.

-- Drop file using an alias.
ALTER DISKGROUP disk_group_1 DROP FILE '+disk_group_1/my_dir/my_file.dbf';

-- Drop file using a numeric form filename.
ALTER DISKGROUP disk_group_1 DROP FILE '+disk_group_1.342.3';

-- Drop file using a fully qualified filename.
ALTER DISKGROUP disk_group_1 DROP FILE '+disk_group_1/mydb/datafile/my_ts.342.3';

Checking Metadata
The internal consistency of disk group metadata can be checked in a number of ways using the CHECK clause 
of the ALTER DISKGROUP statement.

-- Check metadata for a specific file.
ALTER DISKGROUP disk_group_1 CHECK FILE '+disk_group_1/my_dir/my_file.dbf'

-- Check metadata for a specific failure group in the disk group.
ALTER DISKGROUP disk_group_1 CHECK FAILGROUP failure_group_1;

-- Check metadata for a specific disk in the disk group. 
ALTER DISKGROUP disk_group_1 CHECK DISK diska1;

-- Check metadata for all disks in the disk group. 
ALTER DISKGROUP disk_group_1 CHECK ALL;

ASM Views
The ASM configuration can be viewed using the V$ASM_% views, which often contain different information 
depending on whether they are queried from the ASM instance, or a dependant database instance.

Viewing ASM Instance Information Via SQL Queries
Finally, there are several dynamic and data dictionary views available to view an ASM configuration from within 
the ASM instance itself:

-- ASM Dynamic Views: FROM ASM Instance Information
 
View Name        Description
 
V$ASM_ALIAS      Shows every alias for every disk group mounted by the ASM instance
 
V$ASM_CLIENT     Shows which database instance(s) are using any ASM disk groups that are being mounted by this ASM instance
 
V$ASM_DISK       Lists each disk discovered by the ASM instance, including disks that are not part of any ASM disk group
 
V$ASM_DISKGROUP  Describes information about ASM disk groups mounted by the ASM instance
 
V$ASM_FILE       Lists each ASM file in every ASM disk group mounted by the ASM instance
 
V$ASM_OPERATION  Like its counterpart, V$SESSION_LONGOPS, it shows each long-running ASM operation in the ASM instance
 
V$ASM_TEMPLATE   Lists each template present in every ASM disk group mounted by the ASM instance
 

I was also able to query the following dynamic views against my database instance to view the related ASM storage 
components of that instance:

-- ASM Dynamic Views: FROM Database Instance Information
 
View Name          Description
 
V$ASM_DISKGROUP    Shows one row per each ASM disk group that's mounted by the local ASM instance
 
V$ASM_DISK         Displays one row per each disk in each ASM disk group that are in use by the database instance
 
V$ASM_CLIENT       Lists one row per each ASM instance for which the database instance has any open ASM files
 

ASM Filenames
There are several ways to reference ASM file. Some forms are used during creation and some for 
referencing ASM files. The forms for file creation are incomplete, relying on ASM to create the fully qualified name, 
which can be retrieved from the supporting views. The forms of the ASM filenames are summarised below.

Filename Type Format 
Fully Qualified ASM Filename +dgroup/dbname/file_type/file_type_tag.file.incarnation 
Numeric ASM Filename +dgroup.file.incarnation 
Alias ASM Filenames +dgroup/directory/filename 
Alias ASM Filename with Template +dgroup(template)/alias 
Incomplete ASM Filename +dgroup 
Incomplete ASM Filename with Template +dgroup(template) 

SQL and ASM
ASM filenames can be used in place of conventional filenames for most Oracle file types, including controlfiles, 
datafiles, logfiles etc. For example, the following command creates a new tablespace with a datafile 
in the disk_group_1 disk group.

CREATE TABLESPACE my_ts DATAFILE '+disk_group_1' SIZE 100M AUTOEXTEND ON;Migrating to ASM Using RMAN
The following method shows how a primary database can be migrated to ASM from a disk based backup:

Disable change tracking (only available in Enterprise Edition) if it is currently being used.

SQL> ALTER DATABASE DISABLE BLOCK CHANGE TRACKING;Shutdown the database.

SQL> SHUTDOWN IMMEDIATEModify the parameter file of the target database as follows:

Set the DB_CREATE_FILE_DEST and DB_CREATE_ONLINE_LOG_DEST_n parameters to the relevant ASM disk groups. 
Remove the CONTROL_FILES parameter from the spfile so the control files will be moved to the DB_CREATE_* destination 
and the spfile gets updated automatically. If you are using a pfile the CONTROL_FILES parameter must be set 
to the appropriate ASM files or aliases. 


Start the database in nomount mode.

RMAN> STARTUP NOMOUNTRestore the controlfile into the new location from the old location.

RMAN> RESTORE CONTROLFILE FROM 'old_control_file_name';Mount the database.

RMAN> ALTER DATABASE MOUNT;Copy the database into the ASM disk group.

RMAN> BACKUP AS COPY DATABASE FORMAT '+disk_group';Switch all datafile to the new ASM location.

RMAN> SWITCH DATABASE TO COPY;Open the database.

RMAN> ALTER DATABASE OPEN;Create new redo logs in ASM and delete the old ones.


Enable change tracking if it was being used.

SQL> ALTER DATABASE ENABLE BLOCK CHANGE TRACKING;Form more information see:

Using Automatic Storage Management 
Migrating a Database into ASM 
Hope this helps. Regards Tim...


Note 6:
=======


How to Use Oracle10g release 2 ASM on Linux:

[root@danaly etc]# fdisk /dev/cciss/c0d0

The number of cylinders for this disk is set to 8854.
There is nothing wrong with that, but this is larger than 1024,
and could in certain setups cause problems with:
1) software that runs at boot time (e.g., old versions of LILO)
2) booting and partitioning software from other OSs
   (e.g., DOS FDISK, OS/2 FDISK)

Command (m for help): p

Disk /dev/cciss/c0d0: 72.8 GB, 72833679360 bytes
255 heads, 63 sectors/track, 8854 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

           Device Boot      Start         End      Blocks   Id  System
/dev/cciss/c0d0p1   *           1          33      265041   83  Linux
/dev/cciss/c0d0p2              34         555     4192965   82  Linux swap
/dev/cciss/c0d0p3             556         686     1052257+  83  Linux
/dev/cciss/c0d0p4             687        8854    65609460    5  Extended
/dev/cciss/c0d0p5             687        1730     8385898+  83  Linux
/dev/cciss/c0d0p6            1731        2774     8385898+  83  Linux
/dev/cciss/c0d0p7            2775        3818     8385898+  83  Linux
/dev/cciss/c0d0p8            3819        4601     6289416   83  Linux

Command (m for help): n
First cylinder (4602-8854, default 4602): 
Using default value 4602
Last cylinder or +size or +sizeM or +sizeK (4602-8854, default 8854): +20000M    

Command (m for help): n
First cylinder (7035-8854, default 7035): 
Using default value 7035
Last cylinder or +size or +sizeM or +sizeK (7035-8854, default 8854): +3000M 

Command (m for help): n
First cylinder (7401-8854, default 7401): 
Using default value 7401
Last cylinder or +size or +sizeM or +sizeK (7401-8854, default 8854): +3000M

Command (m for help): p

Disk /dev/cciss/c0d0: 72.8 GB, 72833679360 bytes
255 heads, 63 sectors/track, 8854 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

           Device Boot      Start         End      Blocks   Id  System
/dev/cciss/c0d0p1   *           1          33      265041   83  Linux
/dev/cciss/c0d0p2              34         555     4192965   82  Linux swap
/dev/cciss/c0d0p3             556         686     1052257+  83  Linux
/dev/cciss/c0d0p4             687        8854    65609460    5  Extended
/dev/cciss/c0d0p5             687        1730     8385898+  83  Linux
/dev/cciss/c0d0p6            1731        2774     8385898+  83  Linux
/dev/cciss/c0d0p7            2775        3818     8385898+  83  Linux
/dev/cciss/c0d0p8            3819        4601     6289416   83  Linux
/dev/cciss/c0d0p9            4602        7034    19543041   83  Linux
/dev/cciss/c0d0p10           7035        7400     2939863+  83  Linux
/dev/cciss/c0d0p11           7401        7766     2939863+  83  Linux

Command (m for help): w
The partition table has been altered!

Calling ioctl() to re-read partition table.

WARNING: Re-reading the partition table failed with error 16: Device or resource busy.
The kernel still uses the old table.
The new table will be used at the next reboot.
Syncing disks.


[root@danaly data1]# /etc/init.d/oracleasm createdisk VOL5 /dev/cciss/c0d0p10
Marking disk "/dev/cciss/c0d0p10" as an ASM disk: [  OK  ]
[root@danaly data1]# /etc/init.d/oracleasm createdisk VOL6 /dev/cciss/c0d0p11
Marking disk "/dev/cciss/c0d0p11" as an ASM disk: [  OK  ]
[root@danaly data1]# /etc/init.d/oracleasm listdisks
VOL1
VOL2
VOL3
VOL4
VOL5
VOL6


(THE FOLLOWING QUERIES ARE ISSUED FROM THE ASM INSTANCE.)

[oracle@danaly ~]$ export ORACLE_SID=+ASM
[oracle@danaly ~]$ sqlplus "/ as sysdba"

SQL*Plus: Release 10.2.0.1.0 - Production on Sun Sep 3 00:28:09 2006

Copyright (c) 1982, 2005, Oracle.  All rights reserved.

Connected to an idle instance.

SQL> startup
ASM instance started

Total System Global Area   83886080 bytes
Fixed Size                  1217836 bytes
Variable Size              57502420 bytes
ASM Cache                  25165824 bytes
ASM diskgroups mounted

SQL> select group_number,disk_number,mode_status from v$asm_disk;

GROUP_NUMBER DISK_NUMBER MODE_STATUS
------------ ----------- --------------
           0           4 ONLINE
           0           5 ONLINE
           1           0 ONLINE
           1           1 ONLINE
           1           2 ONLINE
           1           3 ONLINE

6 rows selected.

SQL> select group_number,disk_number,mode_status,name from v$asm_disk;

GROUP_NUMBER DISK_NUMBER MODE_STATUS    NAME
------------ ----------- -------------- ---------------------------------
           0           4 ONLINE
           0           5 ONLINE
           1           0 ONLINE         VOL1
           1           1 ONLINE         VOL2
           1           2 ONLINE         VOL3
           1           3 ONLINE         VOL4

6 rows selected.

SQL> create diskgroup orag2 external redundancy disk 'ORCL:VOL5';

Diskgroup created.

SQL> select group_number,disk_number,mode_status,name from v$asm_disk;

GROUP_NUMBER DISK_NUMBER MODE_STATUS    NAME
------------ ----------- -------------- -------------------------------------
           0           5 ONLINE
           1           0 ONLINE         VOL1
           1           1 ONLINE         VOL2
           1           2 ONLINE         VOL3
           1           3 ONLINE         VOL4
           2           0 ONLINE         VOL5

6 rows selected.


(THE FOLLOWING QUERIES ARE ISSUED FROM THE DATABASE INSTANCE.)

[oracle@danaly ~]$ export ORACLE_SID=danaly
[oracle@danaly ~]$ sqlplus "/ as sysdba"

SQL*Plus: Release 10.2.0.1.0 - Production on Sun Sep 3 00:47:04 2006

Copyright (c) 1982, 2005, Oracle.  All rights reserved.

Connected to an idle instance.

SQL> startup
ORACLE instance started.

Total System Global Area  943718400 bytes
Fixed Size                  1222744 bytes
Variable Size             281020328 bytes
Database Buffers          654311424 bytes
Redo Buffers                7163904 bytes
Database mounted.
Database opened.

SQL> select name from v$datafile;

NAME
--------------------------------------------------------------------------------
+ORADG/danaly/datafile/system.264.600016955
+ORADG/danaly/datafile/undotbs1.265.600016969
+ORADG/danaly/datafile/sysaux.266.600016977
+ORADG/danaly/datafile/users.268.600016987


SQL> create tablespace eygle datafile '+ORAG2' ;

Tablespace created.

SQL> select name from v$datafile;

NAME
---------------------------------------------------------------------------------
+ORADG/danaly/datafile/system.264.600016955
+ORADG/danaly/datafile/undotbs1.265.600016969
+ORADG/danaly/datafile/sysaux.266.600016977
+ORADG/danaly/datafile/users.268.600016987
+ORAG2/danaly/datafile/eygle.256.600137647


oracle@danaly log]$ export ORACLE_SID=+ASM
[oracle@danaly log]$ sqlplus "/ as sysdba"

SQL*Plus: Release 10.2.0.1.0 - Production on Sun Sep 3 01:36:37 2006

Copyright (c) 1982, 2005, Oracle.  All rights reserved.


Connected to:
Oracle Database 10g Enterprise Edition Release 10.2.0.1.0 - Production
With the Partitioning, Oracle Label Security, OLAP and Data Mining Scoring Engine options

SQL> alter diskgroup orag2 add disk 'ORCL:VOL6';

Diskgroup altered.


============
7: OMF
============

In a way, OMF is partly a predeccessor for ASM, on Oracle 9i. The OMF feature only provides for
easy management of tablespace files, like in create statements, where you do not have to
"worry" anymore on filelocations.


Using Oracle-managed files simplifies the administration of an Oracle database. Oracle-managed files eliminate 
the need for you, the DBA, to directly manage the operating system files comprising an Oracle database. 
You specify operations in terms of database objects rather than filenames. Oracle internally uses standard 
file system interfaces to create and delete files as needed for the following database structures:

Tablespaces 
Online redo log files 
Control files 


The following initialization parameters init.ora/spfile.ora allow the database server to use 
the Oracle Managed Files feature:


- DB_CREATE_FILE_DEST
  Defines the location of the default file system directory where Oracle creates datafiles 
  or tempfiles when no file specification is given in the creation operation. Also used as the default 
  file system directory for online redo log and control files if DB_CREATE_ONLINE_LOG_DEST_n is not specified.
 
- DB_CREATE_ONLINE_LOG_DEST_n
  Defines the location of the default file system directory for online redo log files and 
  control file creation when no file specification is given in the creation operation. You can use this 
  initialization parameter multiple times, where n specifies a multiplexed copy of the online redo log 
  or control file. You can specify up to five multiplexed copies
 
Example:

DB_CREATE_FILE_DEST         = '/u01/oradata/payroll'
DB_CREATE_ONLINE_LOG_DEST_1 = '/u02/oradata/payroll'
DB_CREATE_ONLINE_LOG_DEST_2 = '/u03/oradata/payroll'


=============================================
8: Installation notes 10g RAC on Windows
==============================================


8.1 Before you install:
-----------------------

Each node in a cluster requires the following:

> One private internet protocol (IP) address for each node to serve as the private interconnect. 
 The following must be true for each private IP address:

 -It must be separate from the public network
 -It must be accessible on the same network interface on each node
 -It must have a unique address on each node

 The private interconnect is used for inter-node communication by both Oracle Clusterware and RAC. 
 If the private address is available from a network name server (DNS), then you can use that name. 
 Otherwise, the private IP address must be available in each node's C:\WINNT\system32\drivers\etc\hosts file.

> One public IP address for each node, to be used as the Virtual IP (VIP) address for client connections 
and for connection failover. The name associated with the VIP must be different from the default host name.

This VIP must be associated with the same interface name on every node that is part of your cluster. 
In addition, the IP addresses that you use for all of the nodes that are part of a cluster must be from 
the same subnet. 

> One public fixed hostname address for each node, typically assigned by the system administrator 
during operating system installation. If you have a DNS, then register both the fixed IP and the VIP address 
with DNS. If you do not have DNS, then you must make sure that the public IP and VIP addresses for all 
nodes are in each node's host file.

For example, with a two node cluster where each node has one public and one private interface, 
you might have the configuration shown in the following table for your network interfaces, 
where the hosts file is %SystemRoot%\system32\drivers\etc\hosts:

Node Interface Name 	Type 		IP Address 	Registered In 
rac1 rac1 		Public 		143.46.43.100 	DNS (if available, else the hosts file) 
rac1 rac1-vip 		Virtual 	143.46.43.104 	DNS (if available, else the hosts file) 
rac1 rac1-priv 		Private 	10.0.0.1 	Hosts file 
rac2 rac2 		Public 		143.46.43.101 	DNS (if available, else the hosts file) 
rac2 rac2-vip 		Virtual 	143.46.43.105 	DNS (if available, else the hosts file) 
rac2 rac2-priv 		Private 	10.0.0.2 	Hosts file 

The virtual IP addresses are assigned to the listener process.

To enable VIP failover, the configuration shown in the preceding table defines the public and VIP addresses 
of both nodes on the same subnet, 143.46.43. When a node or interconnect fails, then the associated VIP 
is relocated to the surviving instance, enabling fast notification of the failure to the clients connecting 
through that VIP. If the application and client are configured with transparent application failover options, 
then the client is reconnected to the surviving instance.

To disable Windows Media Sensing for TCP/IP, you must set the value of the DisableDHCPMediaSense parameter to 1 
on each node. Disable Media Sensing by completing the following steps on each node of your cluster:

Use Registry Editor (Regedt32.exe) to view the following key in the registry:

HKEY_LOCAL_MACHINE\System\CurrentControlSet\Services\Tcpip\Parameters

Add the following registry value:

Value Name: DisableDHCPMediaSense
Data Type: REG_DWORD -Boolean
Value: 1


- External shared disks for storing Oracle Clusterware and database files.
The disk configuration options available to you are described in Chapter 3, "Storage Pre-Installation Tasks". 
Review these options before you decide which storage option to use in your RAC environment. However, note 
that when Database Configuration Assistant (DBCA) configures automatic disk backup, it uses a 
database recovery area which must be shared. The database files and recovery files do not necessarily have 
to be located on the same type of storage.

Determine the storage option for your system and configure the shared disk. Oracle recommends that 
you use Automatic Storage Management (ASM) and Oracle Managed Files (OMF), or a cluster file system. 
If you use ASM or a cluster file system, then you can also take advantage of OMF and other Oracle Database 10g 
storage features. If you use RAC on Oracle Database 10g Standard Edition, then you must use ASM.

If you use ASM, then Oracle recommends that you install ASM in a separate home from the 
Oracle Clusterware home and the Oracle home. 

Oracle Database 10g Real Application Clusters installation is a two-phase installation. 
In phase one, use Oracle Universal Installer (OUI) to install Oracle Clusterware. 
In phase two, install the database software using OUI.

When you install Oracle Clusterware or RAC, OUI copies the Oracle software onto the node from which 
you are running it. If your Oracle home is not on a cluster file system, then OUI propagates the software 
onto the other nodes that you have selected to be part of your OUI installation session. 

- Shared Storage for Database Recovery Area
When you configure a database recovery area in a RAC environment, the database recovery area must be on 
shared storage. When Database Configuration Assistant (DBCA) configures automatic disk backup, it uses 
a database recovery area that must be shared.

If the database files are stored on a cluster file system, then the recovery area can also be shared through 
the cluster file system.

If the database files are stored on an Automatic Storage Management (ASM) disk group, then the recovery area 
can also be shared through ASM.

If the database files are stored on raw devices, then you must use either a cluster file system or ASM 
for the recovery area.

Note:

ASM disk groups are always valid recovery areas, as are cluster file systems. Recovery area files do not have 
to be in the same location where datafiles are stored. For instance, you can store datafiles on raw devices, 
but use ASM for the recovery area.

Data files are not placed on NTFS partitions, because they cannot be shared. 
Data files can be placed on Oracle Cluster File System (OCFS), on raw disks using ASM, or on raw disks.


- Oracle Clusterware
You must provide OUI with the names of the nodes on which you want to install Oracle Clusterware. 
The Oracle Clusterware home can be either shared by all nodes, or private to each node, depending 
on your responses when you run OUI. The home that you select for Oracle Clusterware must be different 
from the RAC-enabled Oracle home.

Versions of cluster manager previous to Oracle Database 10g were sometimes referred to as "Cluster Manager". 
In Oracle Database 10g, this function is performed by a Oracle Clusterware component known as 
Cluster Synchronization Services (CSS). The OracleCSService, OracleCRService, and OracleEVMService 
replace the service known previous to Oracle Database 10g as OracleCMService9i.


8.2 cluvfy or runcluvfy.bat:
----------------------------

Once you have installed Oracle Clusterware, you can use CVU by entering cluvfy commands on the command line. 
To use CVU before you install Oracle Clusterware, you must run the commands using a command file available 
on the Oracle Clusterware installation media. Use the following syntax to run a CVU command run from the 
installation media, where media is the location of the Oracle Clusterware installation media and options 
is a list of one or more CVU command options:

media\clusterware\cluvfy\runcluvfy.bat options

You do not have to be the root user to use the CVU and the CVU assumes that the current user is the oracle user.


The following code example is of a CVU help command, run from a staged copy of the Oracle Clusterware 
directory downloaded from OTN into a directory called stage on your C: drive:

C:\stage\clusterware\cluvfy> runcluvfy.bat comp nodereach -n node1,node2 -verbose

For a quick test, you can run the following CVU command that you would normally use after you have completed 
the basic hardware and software configuration:

prompt> media\clusterware\cluvfy\runcluvfy.bat stage �post hwos �n node_list

Use the location of your Oracle Clusterware installation media for the media value and a list of the nodes, 
separated by commas, in your cluster for node_list. Expect to see many errors if you run this command 
before you or your system administrator complete the cluster pre-installation steps.

On Oracle Real Application Clusters systems, each member node of the cluster must have user equivalency 
for the Administrative privileges account that installs the database. This means that the administrative 
privileges user account and password must be the same on all nodes.

- Checking the Hardware and Operating System Setup with CVU
You can use two different CVU commands to check your hardware and operating system configuration. 
The first is a general check of the configuration, and the second specifically checks for the components required 
to install Oracle Clusterware.

The syntax of the more general CVU command is:

cluvfy stage �post hwos �n node_list [-verbose]

where node_list is the names of the nodes in your cluster, separated by commas. However, because you have 
not yet installed Oracle Clusterware, you must execute the CVU command from the installation media using a command 
like the one following. In this example, the command checks the hardware and operating system of a two-node 
cluster with nodes named node1 and node2, using a staged copy of the installation media in a directory called 
stage on the C: drive:

C:\stage\clusterware\cluvfy> runcluvfy.bat stage �post hwos �n node1,node2 -verbose

You can omit the -verbose keyword if you do not wish to see detailed results listed as CVU performs 
each individual test.

The following example is a command, without the -verbose keyword, to check for the readiness of the cluster 
for installing Oracle Clusterware:

C:\stage\clusterware\cluvfy> runcluvfy.bat comp sys -n node1,node2 -p crs

- Checking the Network Setup
Enter a command using the following syntax to verify node connectivity between all of the nodes 
for which your cluster is configured:

cluvfy comp nodecon -n node_list [-verbose]

- Verifying Cluster Privileges
Before running Oracle Universal Installer, from the node where you intend to run the Installer, 
verify that you have administrative privileges on the other nodes. To do this, enter the following command 
for each node that is a part of the cluster:

net use \\node_name\C$

where node_name is the node name. If your installation will access drives in addition to the C: drive, repeat 
this command for every node in the cluster, substituting the drive letter for each drive you plan to use.

For the installation to be successful, you must use the same user name and password on each node in a cluster 
or use a domain user name. If you use a domain user name, then log on under a domain with a user name and password 
to which you have explicitly granted local administrative privileges on all nodes.

To verify your configuration BEFORE installing CRS:

[oracle] $ cd /staging_area/clusterware/cluvfy
[oracle] $ ./runcluvfy.sh stage -pre crsinst -n docrac1,docrac2 -verbose

To verify your configuration AFTER installing CRS:
$ ./runcluvfy.sh stage -post crsinst -n docrac1,docrac2


8.3 Shared disk considerations:
-------------------------------

Preliminary Shared Disk Preparation
Complete the following steps to prepare shared disks for storage:

-- Disabling Write Caching
You must disable write caching on all disks that will be used to share data between nodes in your cluster. 
To disable write caching, perform these steps:

Click Start, then click Settings, then Control Panel, then Administrative Tools, then Computer Management, 
then Device Manager, and then Disk drives
Expand the Disk drives and double-click the first drive listed
Under the Disk Properties tab for the selected drive, uncheck the option that enables the write cache
Double-click each of the other drives listed in the Disk drives hive and disable the write cache as described 
in the previous step

Caution:

Any disks that you use to store files, including database files, that will be shared between nodes, 
must have write caching disabled.

-- Enabling Automounting for Windows 2003
If you are using Windows 2003, then you must enable disk automounting, depending on the Oracle products 
you are installing and on other conditions.

You must enable automounting when using:

Raw partitions for Oracle Real Application Clusters (RAC)
Cluster file system for Oracle Real Application Clusters
Oracle Clusterware
Raw partitions for a single-node database installation
Logical drives for Automatic Storage Management (ASM)

To enable automounting:

Enter the following commands at a command prompt:

c:\> diskpart
DISKPART> automount enable
Automatic mounting of new volumes enabled.

Type exit to end the diskpart session

Repeat steps 1 and 2 for each node in the cluster.


8.4 Reviewing Storage Options for Oracle Clusterware, Database, and Recovery Files:
-----------------------------------------------------------------------------------

This section describes supported options for storing Oracle Clusterware files, Oracle Database software, 
and database files. 

-- Overview of Oracle Clusterware Storage Options

Note that Oracle Clusterware files include the Oracle Cluster Registry (OCR) and 
the Oracle Clusterware voting disk.

There are two ways to store Oracle Clusterware files:

1. Oracle Cluster File System (OCFS): The cluster file system Oracle provides for the Windows and Linux communities. 
If you intend to store Oracle Clusterware files on OCFS, then you must ensure that OCFS volume sizes 
are at least 500 MB each.

2. Raw storage: Raw logical volumes or raw partitions are created and managed by Microsoft Windows 
disk management tools or by tools provided by third party vendors.

Note that you must provide disk space for one mirrored Oracle Cluster Registry (OCR) file, 
and two mirrored voting disk files.

-- Overview of Oracle Database and Recovery File Options

There are three ways to store Oracle Database and recovery files on shared disks:

1. Automatic Storage Management (database files only): Automatic Storage Management (ASM) is an integrated, 
high-performance database file system and disk manager for Oracle files. Because ASM requires an 
Oracle Database instance, it cannot contain Oracle software, but you can use ASM to manage database 
and recovery files.

2. Oracle Cluster File System (OCFS): Note that if you intend to use OCFS for your database files, 
then you should create partitions large enough for the database files when you create partitions 
for Oracle Clusterware

Note:

If you want to have a shared Oracle home directory for all nodes, then you must use OCFS.

3. Raw storage: Note that you cannot use raw storage to store Oracle database recovery files.

The storage option that you choose for recovery files can be the same as or different to the option 
you choose for the database files.


Storage Option				Oracle Clusterware	Database	Recovery area
--------------				------------------	--------	-------------
Automatic Storage Management 		No 			Yes 		Yes 
Cluster file system (OCFS) 		Yes 			Yes 		Yes 
Shared raw storage 			Yes 			Yes 		No 


-- Checking for Available Shared Storage with CVU
To check for all shared file systems available across all nodes on the cluster, use the following CVU command:

cluvfy comp ssa -n node_list

Remember to use the full path name and the runcluvfy.bat command on the installation media and include 
the list of nodes in your cluster, separated by commas, for the node_list. The following example is for 
a system with two nodes, node1 and node2, and the installation media on drive F:

F:\clusterware\cluvfy> runcluvfy.bat comp ssa -n node1,node2

If you want to check the shared accessibility of a specific shared storage type to specific nodes 
in your cluster, then use the following command syntax:

cluvfy comp ssa -n node_list -s storageID_list

In the preceding syntax, the variable node_list is the list of nodes you want to check, separated by commas, 
and the variable storageID_list is the list of storage device IDs for the storage devices managed by the 
file system type that you want to check.


===========================================
9. RAC tools an utilities.
===========================================


9.1: removing and adding a failed node:
=======================================

Suppose, using above example, that instance rac3 on node oc3, fails. Suppose that you need to repair
the node (e.g. harddisk crash).

-- Remove the instance:

% srvctl remove instance -d rac -i rac3
Remove instance rac3 for the database rac (y/n)? y

or use DBCA to delete the rac3 instance

-- Remove the node from the cluster:

# cd /u01/app/oracle/product/10.1.0/CRS10gHome/bin
# ./olsnode -n
oc1   1
oc2   2
oc3   3
# cd ../install
# ./rootdeletenode.sh oc3,3
# cd ../bin
# ./olsnode -n
oc1   1
oc2   2
#

Suppose that you have repared host oc3. We now want to add it back into the cluster.
Host oc3 has the OS newly installed, and its /etc/host file is just like it is on the other nodes.

-- Add the node at the clusterware layer:

From oc1 or oc2, go to the $CRS_Home/oui/bin directory, and run

# ./addNode.sh

A graphical screen pops up, and you are able to add oc3 to the cluster.
Al CRS files are copied to the new node.

To start the services on the new node, you are then prompted to run "rootaddnode.sh" on the active node
and "root.sh" on the new node.

# ./rootaddnode.sh

# ssh oc3
# cd /u01/app/oracle product/10.1.0/CRS10gHome
# ./root.sh


9.2 CRSCTL, OCRDUMP, OCRCONFIG, OCRCHECK:
=========================================

These are the Primary tools to view status, manipulate OCR and Voting Disk:

Example: Checking status Voting Disks:
$ crsctl query css votedisk

[root@node1-pub ~]# crsctl query css votedisk
 0.     0    /u02/ocfs2/vote/VDFile_0
 1.     0    /u02/ocfs2/vote/VDFile_1
 2.     0    /u02/ocfs2/vote/VDFile_2
Located 3 voting disk(s).

Example: You can dynamically add and remove voting disks after installing Oracle RAC. 
Do this using the following commands where path is the fully qualified path for the additional voting disk. 
Run the following command as the root user to add a voting disk:

# crsctl add css votedisk /dev/raw/raw9

Example: Run the following command as the root user to remove a voting disk:

# crsctl delete css votedisk path


Example: Adding a extra OCR file:
# ocrconfig -replace ocr /dev/raw/raw2

Example: show the automatic ocr backups:
# ocrconfig -showbackup


9.3 Some other commandline tools:
=================================

- olsnodes

Example: If you want to list the nodes in the cluster running CRS
$ olsnodes
oranode1
oranode2


5. The "crs_" tools:

There are a few other tools that provide information of your cluster. These tools are targeted at registered applications like the nodeapps or other apps.

For example, to list the status of the apps in the cluster, use crs_stat:

/home/oracle-->$CRS_HOME/bin/crs_stat -t


Name           Type           Target    State     Host
------------------------------------------------------------
ora....SM1.asm application    ONLINE    ONLINE    aix1
ora....x1.lsnr application    ONLINE    ONLINE    aix1
ora....ix1.gsd application    ONLINE    ONLINE    aix1
ora....ix1.ons application    ONLINE    ONLINE    aix1
ora....ix1.vip application    ONLINE    ONLINE    aix1
ora....SM2.asm application    ONLINE    ONLINE    aix2
ora....x2.lsnr application    ONLINE    ONLINE    aix2
ora....ix2.gsd application    ONLINE    ONLINE    aix2
ora....ix2.ons application    ONLINE    ONLINE    aix2
ora....ix2.vip application    ONLINE    ONLINE    aix2
ora....test.db application    ONLINE    ONLINE    aix1
ora....x1.inst application    ONLINE    ONLINE    aix1
ora....x2.inst application    ONLINE    ONLINE    aix2
/home/oracle--> 


Note 3: showing all nodes from a node:
--------------------------------------

# lsnodes -v

# cd /u01/app/oracle/product/10.1.0/CRS10gHome/bin
# ./olsnodes -n
oc1   1
oc2   2
oc3   3


9.4: using svrctl:
==================


The Server Control SVRCTL utility is installed on each node by default. 
You can use SRVCTL to start and stop the database and instances, manage configuration information,
and to move or remove instances and services.

Some SVRCTL operations store configuration information in the OCR. 
SVRCTL performs other operations, such as starting and stopping instances, by sending request
to the Oracle Clusterware process CSRD, which then starts or stops the Oracle Clusterware resources.

srvctl must be run from the $ORACLE_HOME of the RAC you are administering. 
The basic format of a srvctl command is 

srvctl <command> <target> [options]

where command is one of

enable|disable|start|stop|relocate|status|add|remove|modify|getenv|setenv|unsetenv|config

and the target, or object, can be a 
-database, 
-instance, 
-service, 
-ASM instance, or the 
-nodeapps.

1. SVRCTL:
srvctl is the primary tool DBAs use to configure and manipulate Instances, RAC database and processes.

Example 1. Bring up the MYSID1 instance of the MYSID database.
$ srvctl start instance -d MYSID -i MYSID1

Example 2. 
Stop the MYSID database and all its instances, on all nodes:
$ srvctl stop database -d MYSID
$ srvctl stop database -d MYSID -o immediate
Start the MYSID database and all instances:
$ srvctl start database -d MYSID

Example 3. Stop the orcl3 and orcl4 instances, associated to the orcl database, immediate.
$ srvctl stop instance -d orcl -i "orcl3,orcl4" -o immediate -c "sysback/oracle as sysoper" 

Example 4. Stop the nodeapps (listener and other apps) on node1
$ srvctl stop nodeapps �n node1 

Example 5. Check status RAC database
$ svrctl status database -d orcl

srvctl <command> <target> [options]
command: enable|disable|start|stop|relocate|status|add|remove|modify|getenv|setenv|unsetenv|config
target: database, instance, service, ASM instance, or the nodeapps

srvctl commands, will affect the state information in the OCR


-- Example 1: To view help:

% svrctl -h
% svrctl command -h

-- Example 2: To see the SRVCTL version number, enter

% svrctl -V

-- Example 3. Bring up the MYSID1 instance of the MYSID database.

% srvctl start instance -d MYSID -i MYSID1

-- Example 4. Stop the MYSID database: all its instances and all its services, on all nodes.

% srvctl stop database -d MYSID

The following command mounts all of the non-running instances, using the default connection information:

% srvctl start database -d orcl -o mount

-- Example 5. Stop the nodeapps on the myserver node. NB: Instances and services also stop.

% srvctl stop nodeapps -n myserver

-- Example 6. Add the MYSID3 instance, which runs on the myserver node, to the MYSID clustered database.

% srvctl add instance -d MYSID -i MYSID3 -n myserver

-- Example 7. Add a new node, the mynewserver node, to a cluster.

% srvctl add nodeapps -n mynewserver -o $ORACLE_HOME -A 149.181.201.1/255.255.255.0/eth1
(The -A flag precedes an address specification.)

-- Example 8. To change the VIP (virtual IP) on a RAC node, use the command

% srvctl modify nodeapps -A new_address

-- Example 9. Status of components 

. Find out whether the nodeapps on mynewserver are up.

 % srvctl status nodeapps -n mynewserver
  VIP is running on node: mynewserver
  GSD is running on node: mynewserver
  Listener is not running on node: mynewserver
  ONS daemon is running on node: mynewserver

. Find out whether the ASM  is running:

  % srvctl status asm -n docrac1
  ASM instance +ASM1 is running on node docrac1.

. Find status of cluster database

  % srvctl status database -d EOPP
  Instance EOPP1 is running on node dbq0201
  Instance EOPP2 is running on node dbq0102

  % srvctl config database -d EOPP
  dbq0201 EOPP1 /ora/product/10.2.0/db
  dbq0102 EOPP2 /ora/product/10.2.0/db

  % srvctl config service -d EOPP
  opp.et.supp PREF: EOPP1 AVAIL: EOPP2
  opp.et.grid PREF: EOPP1 AVAIL: EOPP2


-- Example 10. The following command and output show the expected configuration for a three node 
               database called ORCL.

% srvctl config database -d ORCL

server01 ORCL1 /u01/app/oracle/product/10.1.0/db_1
server02 ORCL2 /u01/app/oracle/product/10.1.0/db_1
server03 ORCL3 /u01/app/oracle/product/10.1.0/db_1


-- Example 11. Disable the ASM instance on myserver for maintenance.

% srvctl disable asm -n myserver


-- Example 12. Debugging srvctl

Debugging srvctl in 10g couldn't be easier. Simply set the SRVM_TRACE environment variable.

% export SRVM_TRACE=true


-- Example 13. Question Version 10G RAC

Q: how to add a listener to the nodeapps using the srvctl command ??
or even if it can be added using srvctl ??

A: just edit listener.ora on all concerned nodes and add entries ( the usual way).
srvctl will automatically make use of it.
For example

% srvctl start database -d SAMPLE

will start database SAMPLE and its associated listener LSNR_SAMPLE. 


-- Example 14. Adding services.

% srvctl add database -d ORCL -o /u01/app/oracle/product/10.1.0/db_1
% srvctl add instance -d ORCL -i ORCL1 -n server01
% srvctl add instance -d ORCL -i ORCL2 -n server02
% srvctl add instance -d ORCL -i ORCL3 -n server03

% srvctl remove instance -d rac -i rac3
% srvctl disable instance -d orcl -i orcl2
% srvctl enable instance -d orcl -i orcl2 


-- Example 15. Administering ASM Instances with SRVCTL in RAC

You can use SRVCTL to add, remove, enable, and disable an ASM instance as described in the following procedure:

Use the following to add configuration information about an existing ASM instance:
% srvctl add asm -n node_name -i asm_instance_name -o oracle_home

Use the following to remove an ASM instance:
% srvctl remove asm -n node_name [-i asm_instance_name]

-- Example 16. Stop multiple instances.

The following command provides its own connection information to shut down the two instances orcl3 and orcl4
using the IMMEDIATE option:

% srvctl stop instance -d orcl -i "orcl3,orcl4" -o immediate -c "sysback/oracle as sysoper" 

-- Example 17. Showing policies.

Clusterware can automatically start your RAC database when the system restarts.
You can use Automatic or Manual "policies", to control whether clusterware restarts RAC.

To display the current policy:

% srvctl config database -d database_name -a

To change to another policy:

% srvctl modify database -d database_name -y policy_name

-- Example 18.

% srvctl start service -d DITOB

-- Example 19.

Relocate a service from one instance to another

% srvctl relocate  service -d ORACLE -s CRM -i RAC04 -t RAC01

-- Example 20.

Suppose you defined the HR service using the following command:
% srvctl add service -d RACDB -s HR -r RAC02,RAC03 -a RAC01

After a while you realize that there is a workload peak on the HR service,
and you decide to temporarily start HR on the RAC04 instance.

% srvctl modify service -d RACDB -s HR -i RAC02,RAC03,RAC04 -a ROC01
% srvctl stop service -d RACDB -s HR
% srvctl start service -d RACDB -s HR

-- Example 21.

How to disable oracle autostart function in RAC? 

You should be able to modify the service with:

srvctl modify database -d <db_name> -y MANUAL 


9.6: crsctl:
============

crsctl is the primary tool to manipulate CRS.

Example 1: Where is the Voting Disk (part of CRS) located?
$ crsctl query css votedisk
0. 0 /dev/raw/raw2 

Example 2: Do you want to check the health of the Clusterware?
$ crsctl check crs
CSS appears healthy
CRS appears healthy
EVM appears healthy

Example 3: start or stop or enable or disable CRS
$ crsctl start crs
$ crsctl stop crs
$ crsctl enable crs
$ crsctl disable crs

Example 4: Checking CRS Status on local node:
[root@node1]# crsctl check crs
Cluster Synchronization Services appears healthy
Cluster Ready Services appears healthy
Event Manager appears healthy

Example 5: Checking CRS Status in cluster
[root@node1]# crsctl check cluster
node1-pub    ONLINE
node2-pub    ONLINE 


Use CRSCTL to Control Your Clusterware

Oracle Clusterware enables servers in an Oracle database Real Application Cluster to coordinate simultaneous 
workload on the same database files. The crsctl command provides administrators many useful capabilities. 
For example, with crsctl, you can check Clusterware health disable/enable Oracle Clusterware startup on boot, 
find information on the voting disk and check the Clusterware version, and more.

>>> 1. Check the health of the Clusterware
# crsctl check crs
CSS appears healthy
CRS appears healthy
EVM appears healthy

You can also check the status of an individual daemon using the following syntax, where daemon is one of crsd, cssd, or evmd:

# crsctl check daemon


>>> 2. Do you want to reboot a node for maintenance without Clusterware coming up on boot?
## Disable clusterware on machine2 bootup:
# crsctl disable crs
## Stop the database then stop clusterware processes:
# srvctl stop instance �d db �i db2
# crsctl stop crs
# reboot 

## Enable clusterware on machine bootup:
# crsctl enable crs
# crsctl start crs
# srvctl start instance �d db �i db2 


>>> 3. Check the location of the voting disk:
# crsctl query css votedisk
0. 0 /dev/raw/raw2 


>>> 4. Do you need to find out what clusterware version is running on a server?
# crsctl query crs softwareversion
CRS software version on node [db2] is [10.2.0.2.0]


9.7: Adding and Removing Voting Disks:
======================================

You can dynamically add and remove voting disks after installing Oracle RAC. Do this using the following 
commands where path is the fully qualified path for the additional voting disk. Run the following command 
as the root user to add a voting disk:

# crsctl add css votedisk path

Run the following command as the root user to remove a voting disk:

# crsctl delete css votedisk path


9.9 CLUVFY:
===========

The Cluster Verification Utility pre or post validates an Oracle Clusterware environment or configuration.  
We found the CVU utility to be very useful for checking a cluster server environment for RAC. 
The CVU can check shared storage, interconnects, server systems and user permissions. The Universal Installer runs 
the verification utility at the end of the cluster ware install. The utility can also be run from the command line 
with parameters and options to validate components. 
 
For example, a script that verifies a cluster using cluvfy is named runcluvfy.sh and is located on 
the /clusterware/cluvfy directory in the installation area. This script unpacks the utility, sets environment 
variables and executes the verification command.
 
This command verifies that the hosts atlanta1, atlanta2 and atlanta3 are ready for a clustered database 
install of release 2.
 
./runcluvfy.sh stage -pre dbinst -n atlanta1,atlanta2,atlanta3 -r 10gR2 -osdba dba �verbose
 
The results of the command above check user and group equivalence across machines, connectivity, 
interface settings, system requirements like memory, disk space and kernel settings and versions, 
required Linux package existence and so on. Any problems are reported as errors, all successful 
checks are marked as passed.
 
Many other aspects of the cluster can be verified with this utility for Release 2 or Release 1.

Some more examples:

-- Checking for Available Shared Storage with CVU
To check for all shared file systems available across all nodes on the cluster, use the following CVU command:

% cluvfy comp ssa -n node_list

Remember to use the full path name and the runcluvfy.bat command on the installation media and include 
the list of nodes in your cluster, separated by commas, for the node_list. The following example is for 
a system with two nodes, node1 and node2, and the installation media on drive F:

% runcluvfy.bat comp ssa -n node1,node2

If you want to check the shared accessibility of a specific shared storage type to specific nodes 
in your cluster, then use the following command syntax:

% cluvfy comp ssa -n node_list -s storageID_list

In the preceding syntax, the variable node_list is the list of nodes you want to check, separated by commas, 
and the variable storageID_list is the list of storage device IDs for the storage devices managed by the 
file system type that you want to check.


More on Using CLUVFY:
---------------------

Using the Cluster Verification Utility to Diagnose Problems:

The Cluster Verification Utility (CVU) can assist you in diagnosing a wide variety of configuration problems. 

This section contains the following topics:

-- Enabling Tracing

-- Checking the Settings for the Interconnect

-- Troubleshooting a Node with Status of UNKNOWN

-- Verifying the Existence of Node Applications

-- Verifying the Integrity of Oracle Clusterware Components

-- Verifying the Integrity of the Oracle Cluster Registry

-- Verifying the Integrity of Your Entire Cluster

Enabling Tracing:
You can enable tracing by setting the environment variable SRVM_TRACE to true. After setting this variable to true, run the command 
that you want to trace. The CVU trace files are created in the CRS_HOME/cv/log directory. Oracle RAC automatically rotates the log files, 
and the most recently created log file has the name cvutrace.log.0. You should remove unwanted log files or archive them 
to reclaim disk space, if needed. The CVU does not generate trace files unless you enable tracing.

Checking the Settings for the Interconnect:
Cache Fusion enhances the performance of Oracle RAC by utilizing a high-speed interconnect to send data blocks 
to another instance's buffer cache. The high-speed interconnect should be a private network with the highest bandwidth to maximize performance.

For network connectivity verification, the CVU discovers all the available network interfaces if you do not specify 
an interface on the CVU command line.

To verify the accessibility of the cluster nodes from the local node or from any other cluster node, use the component verification 
command nodereach as follows:

$ cluvfy comp nodereach -n node_list [ -srcnode node ] [-verbose]

To verify that the other cluster nodes can be reached from the local node through all the available network interfaces or through 
specific network interfaces, use the component verification command nodecon as follows:

$ cluvfy comp nodecon -n node_list [ -i interface_list ] [-verbose]

You can also use the nodecon command without the -i option, as shown in the following example:

$ cluvfy comp nodecon -n all [-verbose]

When you issue the nodecom command as shown in the previous example, it instructs the CVU to perform the following tasks:

. Discover all the network interfaces that are available on the cluster nodes.
. Review the corresponding IP addresses and subnets for the interfaces.
. Obtain the list of interfaces that are suitable for use as VIPs and the list of interfaces to private interconnects.
. Verify the connectivity among all the nodes through those interfaces.

You can run the nodecon command in verbose mode to identify the mappings between the interfaces, IP addresses, and subnets. 
To verify the connectivity among the nodes through specific network interfaces, use the comp nodecon command with the -i option. 
For example, you can verify the connectivity among the nodes docrac1, docrac2, and docrac3, through interface eth0 by running the following command:

$ cluvfy comp nodecon -n docrac1, docrac2, docrac3 -i eth0 -verbose

Troubleshooting a Node with Status of UNKNOWN:
If you run the cluvfy command using the -verbose argument and the CVU responds with UNKNOWN for a particular node, then this is because 
the CVU cannot determine whether a check passed or failed. The cause of this could be because a node is not reachable, 
or as a result of any system problem that was occurring on that node at the time that the CVU was performing a check.

The following is a list of possible causes for an UNKNOWN response:

. The node is down.
. Executable files that the CVU requires are missing in the CRS_home/bin directory or the $ORACLE_HOME/bin directory.
. The user account that ran the CVU does not have privileges to run common operating system executable files on the node.
. The node is missing an operating system patch or required package.
. The kernel parameters on that node were not configured correctly and the CVU cannot obtain the operating system 
  resources required to perform its checks.

Verifying the Existence of Node Applications:
To verify the existence of node applications, namely the virtual IP (VIP), Oracle Notification Services (ONS), 
and Global Service Daemon (GSD), on all the nodes, use the CVU comp nodeapp command, using the following syntax:

$ cluvfy comp nodeapp [ -n node_list] [-verbose]

Verifying the Integrity of Oracle Clusterware Components:
To verify the existence of all the Oracle Clusterware components, use the component verification comp crs command, 
using the following syntax:

$ cluvfy comp crs [ -n node_list] [-verbose]

Verifying the Integrity of the Oracle Cluster Registry
To verify the integrity of the Oracle Cluster Registry, use the component verification comp ocr command, using the following syntax:

$ cluvfy comp ocr [ -n node_list] [-verbose]

Verifying the Integrity of Your Entire Cluster
To verify that all nodes in the cluster have the same view of the cluster configuration, use the component verification comp clu command, as follows:

$ cluvfy comp clu

 - cluvfy The Cluster Verification Utility
This tool has a very extended syntax and usage. You can use it to verify your systems before, during, and after installations as a sanity check..

Some examples that should give you an impression:
$ ./runcluvfy.sh stage �post hwos �n node1,node2 -verbose
$ ./runcluvfy.sh stage -pre crsinst -n docrac1,docrac2 -verbose
You can easily make a whole day study, of all ins and outs of this tool.


9.9 Further checking:
=====================

Use the following command to obtain component names, where module_name is crs, evm, css or the name of the module:

# crsctl lsmodules module_name

For example, viewing the components of the css module might return the following results:

# crsctl lsmodules css
The following are the CSS modules :: 
CSSD
COMMCRS
COMMNS

>>> Enabling Debugging of Oracle Clusterware Components:

You can enable debugging for the Oracle Cluster daemons, Event Manager (EVM), and their modules by running crsctl commands as follows, 
where component_name is the name of an Oracle Clusterware component for which you want to enable debugging, 
such as crs, evm, or css, module is the name of module as it appears in the output for the crcstl lsmodules command, 
and debugging_level is a number from 1 to 5:

# crsctl debug log component module:debugging_level

For example, to enable tracing for the CSSD module of the css component, you could use the following command:

# crsctl debug log css CSSD:1

To enable extra debugging on the currently running CRS daemons as well as those that will run in the future:

# crsctl debug log crs

>>> Enabling Debugging for an Oracle Clusterware Resource:

You can use crsctl commands to enable resource debugging using the following syntax, where resource_name is the name 
of an Oracle Clusterware resource, such as ora.docrac1.vip, and debugging_level is a number from 1 to 5:

# crsctl debug log res resource_name:debugging_level

To obtain a list of the resources available for debugging, use the following command:

# crs_stat


By default, Oracle enables traces for DBCA and the Database Upgrade Assistant (DBUA). 
For the CVU, GSDCTL, SRVCTL, and VIPCA, you can set the SRVM_TRACE environment variable to TRUE 
to make Oracle generate traces. Oracle writes traces to log files. For example, Oracle writes traces 
to log files in Oracle home/cfgtoollogs/dbca and Oracle home/cfgtoollogs/dbua for DBCA 
and the Database Upgrade Assistant (DBUA) respectively.


=================================
10: Example tnsnames.ora in RAC
=================================


Example 1:
----------

Lets take an example of a RAC service called SVC with two instances SVC1 and SVC2 running on host1 and host2 
(with virtual addresses host1_vip and host2_vip). The client tnsnames would look something like this:

SVC =(DESCRIPTION =     
      (ADDRESS = (PROTOCOL = TCP)(HOST = host1_vip)(PORT = 1521))     
      (ADDRESS = (PROTOCOL = TCP)(HOST = host2_vip)(PORT = 1521))     
      (LOAD_BALANCE = yes)     
      (CONNECT_DATA =       
      (SERVER = DEDICATED) (SERVICE_NAME = SVC)      
      (FAILOVER_MODE=(TYPE=select)(METHOD=basic)(RETRIES=10)(DELAY=1))))


Example 2:
----------

tnsnames.ora File


RAC =
(DESCRIPTION =
(LOAD_BALANCE = ON)
(ADDRESS_LIST =
(ADDRESS = (PROTOCOL = TCP)(HOST = linux1_vip)(PORT = 1521))
(ADDRESS = (PROTOCOL = TCP)(HOST = linux2_vip)(PORT = 1521))
(CONNECT_DATA =
(SERVICE_NAME = RAC)))

RAC1 =
(DESCRIPTION =
(ADDRESS_LIST =
(LOAD_BALANCE = ON)
(ADDRESS = (PROTOCOL = TCP)(HOST = linux1_vip)(PORT = 1521))
(CONNECT_DATA =
(SERVICE_NAME = TEST)(INSTANCE_NAME = RAC1)))

RAC2 =
(DESCRIPTION =
(ADDRESS_LIST =
(LOAD_BALANCE = ON)
(ADDRESS = (PROTOCOL = TCP)(HOST = linux2_vip)(PORT = 1521))
(CONNECT_DATA =
(SERVICE_NAME = TEST)(INSTANCE_NAME = RAC2)))

The entries RAC1 and RAC2 are optional.


Example 3:
----------

TEST =
(DESCRIPTION =
(LOAD_BALANCE = ON)
(ADDRESS_LIST =
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux1)(PORT = 1521))
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux2)(PORT = 1521)))
(CONNECT_DATA =
(SERVICE_NAME = TEST))))

TEST1 =
(DESCRIPTION =
(ADDRESS_LIST =
(LOAD_BALANCE = ON)
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux1)(PORT = 1521)))
(CONNECT_DATA =
(SERVICE_NAME = TEST)(INSTANCE_NAME = TEST1)))

TEST2 =
(DESCRIPTION =
(ADDRESS_LIST =
(LOAD_BALANCE = ON)
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux2)(PORT = 1521)))
(CONNECT_DATA =
(SERVICE_NAME = TEST)(INSTANCE_NAME = TEST2)))

EXTPROC_CONNECTION_DATA =
(DESCRIPTION =
(ADDRESS_LIST =
(ADDRESS = (PROTOCOL = IPC)(KEY = EXTPROC)))
(CONNECT_DATA =
(SID=PLSExtProc)(PRESENTATION = RO)))
 
LISTENERS_TEST =
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux1)(PORT = 1521))
(ADDRESS = (PROTOCOL = TCP)(HOST = testlinux2)(PORT = 1521))


Example 4:
----------

TAF (Transparent Application Failover)
Transparent Application Failover actually refers to a failover that occurs when a node or instance 
is unavailable due to an outage or other reason that prohibits a connection to be established on that node. 
This can be set to on with the following parameter FAILOVER. Setting it to ON will activate the TAF. 
It is turned on by default unless you set it to OFF to disable it. Now, when you turn it on you have two types 
of connections available by the means of the FAILOVER_MODE parameter. The type can be session, which is default 
or select. When the type is SESSION, if the instance fails, then the user is automatically connected to the next 
available node without the user�s manual intervention. The SQL statements need to be carried out again 
on the next node. However, when you set the TYPE to SELECT, then if you are connected and are in the middle 
of your query, then your query will be restarted after you have been failed over to the next available node. 
Take this example of our tnsnames.ora file, (go to the section beginning with CONNECT_DATA):

 (CONNECT_DATA =
      (SERVER = DEDICATED)
      (SERVICE_NAME = fokerac.wolga.com)
      (FAILOVER_MODE =
        (TYPE = SELECT)
        (METHOD = BASIC)
	(RETRIES = 180)
	(DELAY = 5)
      )
  )


==============================================
11: Notes about Backup and Restore of RAC
==============================================


11.1 Backing up Voting Disk:
---------------------------

Run the following command to backup a voting disk. Perform this operation on every voting disk
as needed where 'voting_disk_name' is the name of the active voting disk, and 'backup_file_name'
is the name of the file to which you want to backup the voting disk contents:

# dd if=voting_disk_name of=backup_file_name

When you use the dd command for making backups of the voting disk, the backup can be performed while 
the Cluster Ready Services (CRS) process is active; you do not need to stop the crsd.bin process 
before taking a backup of the voting disk.

-- Adding and Removing Voting Disks
You can dynamically add and remove voting disks after installing Oracle RAC. Do this using the following 
commands where path is the fully qualified path for the additional voting disk. Run the following command 
as the root user to add a voting disk:

# crsctl add css votedisk path

Run the following command as the root user to remove a voting disk:

# crsctl delete css votedisk path


11.2 Recovering Voting Disk:
---------------------------

Run the following command to recover a voting disk where 'backup_file_name'
is the name of the voting disk backupfile, and 'voting_disk_name' is the name of the active
voting disk:

# dd if=backup_file_name of=voting_disk_name


11.3 Backup and Recovery OCR:
----------------------------

Oracle Clusterware includes two important components: the voting disk and the OCR. The voting disk is a file 
that manages information about node membership, and the OCR is a file that manages 
cluster and Oracle RAC database configuration information.


Oracle Clusterware automatically creates OCR backups every 4 hours. At any one time, Oracle Clusterware 
always retains the latest 3 backup copies of the OCR that are 4 hours old, 1 day old, and 1 week old.
This happes on one node, the second of the cluster install.
The auto backup is stored in the CRS_HOME/cdata directory.

You cannot customize the backup frequencies or the number of files that Oracle Clusterware retains. 
You can use any backup software to copy the automatically generated backup files at least once daily 
to a different device from where the primary OCR file resides. The default location for generating backups 
on Red Hat Linux systems is "CRS_home/cdata/cluster_name" where cluster_name is the name of your cluster 
and CRS_home is the home directory of your Oracle Clusterware installation.


-- Viewing Available OCR Backups
To find the most recent backup of the OCR, on any node in the cluster, use the following command:

# ocrconfig -showbackup

-- Backing Up the OCR
Because of the importance of OCR information, Oracle recommends that you use the ocrconfig tool to make copies 
of the automatically created backup files at least once a day.

In addition to using the automatically created OCR backup files, you should also export the OCR contents 
to a file before and after making significant configuration changes, such as adding or deleting nodes 
from your environment, modifying Oracle Clusterware resources, or creating a database. 
Exporting the OCR contents to a file lets you restore the OCR if your configuration changes cause errors. 
For example, if you have unresolvable configuration problems, or if you are unable to restart your cluster database 
after such changes, then you can restore your configuration by importing the saved OCR content 
from the valid configuration.

>>> Export the contents of the OCR to a file:
---------------------------------------------

To export the contents of the OCR to a file, use the following command, where backup_file_name is the name 
of the OCR backup file you want to create:

# ocrconfig -export backup_file_name


>>> Recovering the OCR:
-----------------------

This section describes two methods for recovering the OCR. The first method uses automatically generated 
OCR file copies and the second method uses manually created OCR export files.

In event of a failure, before you attempt to restore the OCR, ensure that the OCR is unavailable. 
Run the following command to check the status of the OCR:

# ocrcheck 

This command does an Integrety Check of the OCR.
If this command does not display the message 'Device/File integrity check succeeded' for at least one copy 
of the OCR, then both the primary OCR and the OCR mirror have failed. You must restore the OCR from a backup.

Example:

CRS > ocrcheck
Status of Oracle Cluster Registry is as follows :
Version : 2
Total space (kbytes) : 102508
Used space (kbytes) : 5360
Available space (kbytes) : 97148
ID : 1255672694
Device/File Name : /dev/dba_ocr_disk
Device/File integrity check succeeded

Device/File not configured

Cluster registry integrity check succeeded


>>> Restoring the Oracle Cluster Registry from Automatically Generated OCR Backups:
-----------------------------------------------------------------------------------

When restoring the OCR from automatically generated backups, you first have to determine which backup file 
you will use for the recovery.

To restore the OCR from an automatically generated backup on a Red Hat Linux system:

Identify the available OCR backups using the ocrconfig command:

# ocrconfig -showbackup

Note:

You must be logged in as the root user to run the ocrconfig command.

Review the contents of the backup using the following ocrdump command, where file_name is the name 
of the OCR backup file:

# ocrdump -backupfile file_name

As the root user, stop Oracle Clusterware on all the nodes in your Oracle RAC cluster by executing 
the following command:

# crsctl stop crs

Repeat this command on each node in your Oracle RAC cluster.

As the root user, restore the OCR by applying an OCR backup file that you identified in step 1 
using the following command, where file_name is the name of the OCR that you want to restore. 
Make sure that the OCR devices that you specify in the OCR configuration exist, and that these OCR devices 
are valid before running this command.

# ocrconfig -restore file_name

As the root user, restart Oracle Clusterware on all the nodes in your cluster by restarting each node, 
or by running the following command:

# crsctl start crs

Repeat this command on each node in your Oracle RAC cluster.

Use the Cluster Verify Utility (CVU) to verify the OCR integrity. Run the following command, 
where the -n all argument retrieves a list of all the cluster nodes that are configured as part of your cluster:

$ cluvfy comp ocr -n all [-verbose]


>>> Recovering the OCR from an OCR Export File:
-----------------------------------------------

Using the ocrconfig -export command enables you to restore the OCR using the -import option if your 
configuration changes cause errors.

To restore the previous configuration stored in the OCR from an OCR export file:

Place the OCR export file that you created previously with the ocrconfig -export command in an accessible 
directory on disk.

As the root user, stop Oracle Clusterware on all the nodes in your Oracle RAC cluster by executing 
the following command:

# crsctl stop crs

Repeat this command on each node in your Oracle RAC cluster.

As the root user, restore the OCR data by importing the contents of the OCR export file using the 
following command, where file_name is the name of the OCR export file:

# ocrconfig -import file_name

As the root user, restart Oracle Clusterware on all the nodes in your cluster by restarting each node, 
or by running the following command:

# crsctl start crs

Repeat this command on each node in your Oracle RAC cluster.


>> >Replacing an OCR:
---------------------

If you need to change the location of an existing OCR, or change the location of a failed OCR to the location of a working one, 
you can use the following procedure as long as one OCR file remains online.

To change the location of an OCR:

Use the OCRCHECK utility to verify that a copy of the OCR other than the one you are going to replace is online using the following command:

$ ocrcheck 

  Note:

  The OCR that you are replacing can be either online or offline.

Verify that Oracle Clusterware is running on the node on which the you are going to perform the replace operation using the following command:

$ crsctl check crs

Run the following command to replace the OCR using either destination_file or disk to indicate the target OCR:

# ocrconfig -replace ocr destination_file
# ocrconfig -replace ocr disk

Run the following command to replace an OCR mirror location using either destination_file or disk to indicate the target OCR:

# ocrconfig -replace ocrmirror destination_file
# ocrconfig -replace ocrmirror disk

If any node that is part of your current Oracle RAC environment is shut down, then run the following command on the stopped node 
to let that node rejoin the cluster after the node is restarted:

# ocrconfig -repair


You may need to repair an OCR configuration on a particular node if your OCR configuration changes while that node is stopped. 
For example, you may need to repair the OCR on a node that was not up while you were adding, replacing, or
removing an OCR. To repair an OCR configuration, run the following command on the node on which you have stopped the Oracle Clusterware daemon:

ocrconfig -repair ocrmirror device_name

This operation only changes the OCR configuration on the node from which you run this command. For example, if the OCR mirror 
device name is /dev/raw1, then use the command syntax ocrconfig -repair ocrmirror /dev/raw1 on this node to repair its OCR configuration.


>>> Add or remove Voting Disk:
------------------------------

You can dynamically add and remove voting disks after installing Oracle RAC. Do this using the following commands 
where path is the fully qualified path for the additional voting disk. Run the following command as the root user to add a voting disk:

# crsctl add css votedisk path
Run the following command as the root user to remove a voting disk:

# crsctl delete css votedisk path


Use the CVU to verify the OCR integrity. Run the following command, where the -n all argument retrieves 
a list of all the cluster nodes that are configured as part of your cluster:

$ cluvfy comp ocr -n all [-verbose]


===================================
12. Adding Nodes and Instances:
===================================


12.1 Adding a node:
===================


This chapter describes how to add nodes and instances in Oracle Real Application Clusters (Oracle RAC) environments. 
You can use these methods when configuring a new Oracle RAC cluster, or when scaling up an existing Oracle RAC cluster.

This chapter includes the following sections:

A: Preparing Access to the New Node

B: Extending the Oracle Clusterware Home Directory

C: Extending the Oracle Automatic Storage Management Home Directory

D: Extending the Oracle RAC Software Home Directory

E: Creating a Listener on the New Node

F: Adding a New Cluster Instance on the New Node

For this chapter, it is very important that you perform each step in the order shown.

Suppose we want to add the node docrac3 to the existing cluster of the systems docrac1 en docrac2.


>>> A: Preparing Access to the New Node

To prepare the new node prior to installing the Oracle software, refer to Chapter 2, "Preparing Your Cluster".
It is critical that you follow the configuration steps for the following procedures to work. These steps include, 
but are not limited to the following:

Adding the public and private node names for the new node to the /etc/hosts file on the existing nodes, docrac1 and docrac2

Verifying the new node can be accessed (using the ping command) from the existing nodes

Running the following command on either docrac1 or docrac2 to verify the new node has been properly configured:

# cluvfy stage -pre crsinst -n docrac3


>>> B: Extending the Oracle Clusterware Home Directory

Now that the new node has been configured to support Oracle Clusterware, you use Oracle Universal Installer (OUI) 
to add an Oracle Clusterware home to the node being added to your Oracle RAC cluster. This chapter assumes that you are adding 
a node named docrac3 and that you have already successfully installed Oracle Clusterware on docrac1 in a nonshared home, 
where CRS_home represents the successfully installed Oracle Clusterware home.

To extend the Oracle Clusterware installation to include the new node:

1. Verify the $ORACLE_HOME environment variable on docrac1 directs you to the successfully installed Oracle Clusterware home on that node.

2. Go to CRS_home/oui/bin and run the addNode.sh script.

cd /opt/oracle/crs/oui/bin
./addNode.sh

OUI starts and first displays the Welcome window.

3. Click Next.

The Specify Cluster Nodes to Add to Installation window appears.

4. Select the node or nodes that you want to add. After selecting docrac3, click Next.

5. Verify the entries that OUI displays on the Summary Page and click Next.

6. Run the rootaddNode.sh script from the CRS_home/install/ directory on docrac1 when prompted to do so.

   Basically, this script adds the node applications of the new node to the OCR configuration.

7. Run the orainstRoot.sh script on the node docrac3 if OUI prompts you to do so.

8. Run the CRS_home/root.sh script on the node docrac3 to start Oracle Clusterware on the new node.

9. Add the new node's Oracle Notification Services (ONS) configuration information to the shared Oracle 
   Cluster Registry (OCR). Obtain the ONS port identifier used by the new node, which you need to know 
   for the next step, by running the following command from the CRS_home/opmn/conf directory on the 
   docrac1 node:

   cat ons.config

   After you locate the ONS port number for the new node, you must make sure that the ONS on 
   docrac1 can communicate with the ONS on the new node, docrac3.

10. From the CRS_home/bin directory on the node docrac1, run the Oracle Notification Services configuration utility
    as shown in the following example, where remote_port is the port number from step 9, and docrac3 is the name 
    of the node that you are adding:

    ./racgons add_config docrac3:remote_port

    At the end of the cloning process, you should have Oracle Clusterware running on the new node. 
    To verify the installation of Oracle Clusterware on the new node, you can run the following command 
    as the root user on the newly configured node, docrac3:

    CRS_home/bin/cluvfy stage -post crsinst -n docrac3 -verbose


>>> C: Extending the Oracle Automatic Storage Management Home Directory

To extend an existing Oracle RAC database to a new node, you must configure the shared storage 
for the new database instances that will be created on new node. You must configure access to the same 
shared storage that is already used by the existing database instances in the cluster. For example, 
the sales cluster database in this guide uses Oracle Automatic Storage Management (ASM) for the database 
shared storage, so you must configure ASM on the node being added to the cluster.

Because you installed ASM in its own home directory, you must configure an ASM home on the new node using OUI. 
The procedure for adding an ASM home to the new node is very similar to the procedure you just completed 
for extending Oracle Clusterware to the new node.

To extend the ASM installation to include the new node:

1. Ensure that you have successfully installed the ASM software on at least one node in your cluster environment. 
   To use these procedures as shown, your $ASM_HOME environment variable must identify your successfully 
   installed ASM home directory.

2. Go to the $ASM_HOME/oui/bin directory on docrac1 and run the addNode.sh script.

3. When OUI displays the Node Selection window, select the node to be added (docrac3), then click Next.

4. Verify the entries that OUI displays on the Summary window, then click Next.

5. Run the root.sh script on the new node, docrac3, from the ASM home directory on that node when OUI prompts you to do so.

You now have a copy of the ASM software on the new node.


>>> D: Extending the Oracle RAC Software Home Directory

Now that you have extended the Oracle Clusterware and ASM homes to the new node, you must extend the Oracle Database home 
on docrac1 to docrac3. The following steps assume that you have already completed the previous tasks described in this chapter, 
and that docrac3 is already a member node of the cluster to which docrac1 belongs.

The procedure for adding an Oracle RAC home to the new node is very similar to the procedure 
you just completed for extending ASM to the new node.

To extend the Oracle RAC installation to include the new node:

1. Ensure that you have successfully installed the Oracle RAC software on at least one node in your 
   cluster environment. To use these procedures as shown, your $ORACLE_HOME environment variable 
   must identify your successfully installed Oracle RAC home directory.

2. Go to the $ORACLE_HOME/oui/bin directory on docrac1 and run the addNode.sh script.

3. When OUI displays the Specify Cluster Nodes to Add to Installation window, select the node to be added (docrac3), then click Next.

4. Verify the entries that OUI displays in the Cluster Node Addition Summary window, then click Next.

5. Run the root.sh script on the new node, docrac3, from the $ORACLE_HOME directory on that node when OUI prompts you to do so.


After completing these steps, you should have an installed Oracle RAC home on the new node.


>>> E: Creating a Listener on the New Node

To service database instance connection requests on the new node, you must create a Listener on that node. 
Use the Oracle Net Configuration Assistant (NETCA) to create a Listener on the new node. 
Before beginning this procedure, ensure that your existing nodes have the $ORACLE_HOME environment variable 
set correctly.

To create a new Listener on the new node using Oracle Net Configuration Assistant:

1. Start the Oracle Net Configuration Assistant by entering netca at the system prompt from the 
   $ORACLE_HOME/bin directory.

   NETCA displays the Welcome window. Click Help on any NETCA window for additional information.

2. Select Listener configuration, and click Next.

   NETCA displays the Listener Configuration, Listener window.

3. Select Add to create a new Listener, then click Next.

   NETCA displays the Listener Configuration, Listener Name window.

4. Accept the default value of LISTENER for the Listener name by clicking Next.

   NETCA displays the Listener Configuration, Select Protocols window.

5. Choose TCP and move it to the Selected Protocols area, then click Next.

   NETCA displays the Listener Configuration, TCP/IP Protocol window.

6. Choose Use the standard port number of 1521, then click Next.

   NETCA displays the Real Application Clusters window.

7. Select Cluster configuration for the type of configuration to perform, then click Next.

   NETCA displays the Real Application Clusters, Active Nodes window.

8. Select the name of the node you are adding, for example docrac3, then click Next.

   NETCA creates a Listener using the configuration information provided. You can now exit NETCA.

You should now have a Listener named LISTENER running on the new node.


>>> F: Adding a New Cluster Instance on the New Node

You can use the Oracle Database Configuration Assistant (DBCA) to add database instances to new nodes. 
Before beginning this procedure, ensure that your existing nodes have the $ORACLE_HOME environment variable 
set correctly.

To create a new cluster instance on the new node using DBCA:

1. Start DBCA by entering dbca at the system prompt from the $ORACLE_HOME/bin directory.

   DBCA displays the Welcome window for Oracle RAC. Click Help on any DBCA page for additional information.

2. Select Oracle Real Application Clusters database, and then click Next.

   DBCA displays the Operations window.

3. Select Instance Management, and then click Next.

   DBCA displays the Instance Management window.

4. Select Add an Instance, then click Next.

   DBCA displays the List of Cluster Databases window, which shows the databases and their current status, 
   such as ACTIVE or INACTIVE.

5. In the List of Cluster Databases window, select the active Oracle RAC database to which you want to add an instance, 
   for example sales. Enter the user name and password for the database user that has SYSDBA privileges. Click Next.

   DBCA will spend a few minutes performing tasks in the background, then it will display the 
   Instance naming and node selection window.

6. In the Instance naming and node selection window, enter the instance name in the field at the top of this window 
   if the default instance name provided by DBCA does not match your existing instance naming scheme. 
   For example, instead of the sales3 instance, you might want to create the sales_03 instance.

   Click Next to accept the default instance name of sales3.

   DBCA displays the Instance Storage window.

7. In the Instance Storage window, you have the option of changing the default storage options and file locations 
   for the new database instance. In this example, you accept all the default values and click Finish.

   DBCA displays the Summary window.

8. Review the information in the Summary window, then click OK to start the database instance addition operation. 
   DBCA displays a progress dialog box showing DBCA performing the instance addition operation.

9. During the instance addition operation, if you are using ASM for your cluster database storage, 
   DBCA detects the need for a new ASM instance on the new node.

   When DBCA displays a dialog box, asking if you want to ASM to be extended, click Yes.

   After DBCA extends ASM on the new node and completes the instance addition operation, DBCA displays a dialog box 
   asking whether or not you want to perform another operation. Click No to exit DBCA.

You should now have a new cluster database instance and ASM instance running on the new node. 
After you terminate your DBCA session, you should run the following command to verify the administrative privileges 
on the new node and obtain detailed information about these privileges:

# CRS_home/bin/cluvfy comp admprv -o db_config -d oracle_home -n docrac3 -verbose


12.2 Deleting a Node:
=====================

Suppose the 4th node needs to be removed.

1. Delete the Instance on the Node with DBCA.
2. Clean up ASM

Go to your first node

$ srvctl stop asm -n rac4
$ srvctl remove asm -n rac4

3. Goto node 4 and rm -rf the ASM software

4. On node  4, remove the listener with NETCA

5. Remove the node from the Database

On node 4:

$ ./runInstaller -updateNodeList ORACLE_HOME=$ORACLE_HOME "CLUSTER_NODES={rac4}" -local
Choose to deinstall 

On node 1:

$ ./runInstaller -updateNodeList ORACLE_HOME=$ORACLE_HOME "CLUSTER_NODES={rac1,rac2,rac3}"

6. Remove the node from clusterware:

On node 1:

$ CRS_HOME/bin/racgons remove_config rac4:6200         # choose the right port

On node 4 as root:

# cd $CRS_HOME/install
# ./rootdelete.sh

On node 1 as root:

# olsnodes -n       # to obtain the nodenumber for rac4
# cd $CRS_HOME/install
# ./rootdeletenode rac4,4


==============================
13: Starting and Stopping RAC:
==============================


13.1 Stopping the Cluster:
==========================
						
						
Before you shut down any processes that are monitored by Enterprise Manager Grid Control, set a blackout in 						
Grid Control for the processes that you intend to shut down. This is necessary so that the availability records 						
for these processes indicate that the shutdown was planned downtime, rather than an unplanned system outage.						
Shut down all Oracle RAC instances on all nodes. To shut down all Oracle RAC instances for a database, 						
enter the following command, where db_name is the name of the database:	
					
$ oracle_home/bin/srvctl stop database -d db_name						
						
Shut down all ASM instances on all nodes. To shut down an ASM instance, enter the following command, 						
where node is the name of the node where the ASM instance is running:						

$ oracle_home/bin/srvctl stop asm -n node						
						
Stop all node applications on all nodes. To stop node applications running on a node, enter the following command, 						
where node is the name of the node where the applications are running						

$ oracle_home/bin/srvctl stop nodeapps -n node						
						
Log in as the root user, and shut down the Oracle Clusterware or CRS process by entering the following command 						
on all nodes:						

# CRS_home/bin/crsctl stop crs                        # as root						
						
						
13.2 Starting the Cluster:						
==========================
						
# CRS_home/bin/crsctl start crs                       # as root						
$ oracle_home/bin/srvctl start nodeapps -n node						
$ oracle_home/bin/srvctl start asm -n node						
$ oracle_home/bin/srvctl start database -d db_name    # will start all instances of the Database						
						
						
Example of stopping or starting Services on a RAC node:						
						
# shutdown services on a RAC node						
sudo -u oracle /opt/oracle/product/10.2.0/crs/bin/srvctl stop instance -d p01cfd -i pl01cfd2 -o immediate						
sudo -u oracle /opt/oracle/product/10.2.0/crs/bin/srvctl stop asm -n t-prod-oranode-11						
sudo -u oracle /opt/oracle/product/10.2.0/crs/bin/srvctl stop nodeapps -n t-prod-oranode-11						
						
# startup services on a RAC node						
sudo -u oracle /opt/oracle/product/10.2.0/crs/bin/srvctl start nodeapps -n t-prod-oranode-11						
sudo -u oracle /opt/oracle/product/10.2.0/crs/bin/srvctl start asm -n t-prod-oranode-11						
sudo -u oracle /opt/oracle/product/10.2.0/crs/bin/srvctl start instance -d p01cfd -i pl01cfd2 						
						

====================================
14: Other Noticable items in 10g RAC
====================================


14.1 SPFILE:
============


If an initialization parameter applies to all instances, use *.<parameter> notation, otherwise
prefix the parameter with the name of the instance.
For example:

*.OPEN_CURSORS=500
prod1.OPEN_CURSORS=1000

Assume that you start an instance with an SPFILE containing the following entries:

*.OPEN_CURSORS=500
prod1.OPEN_CURSORS=1000

For the instance with the Oracle system identifier (sid) prod1, the OPEN_CURSORS parameter remains set to 1000 even though it has a 
database-wide setting of 500. The instance-specific parameter setting in the parameter file for an instance prevents 
database-wide alterations of the setting. This gives you control over parameter settings for instance prod1. These two types of settings 
can appear in any order in the parameter file.

If another DBA runs the following statement, then Oracle updates the setting on all instances except the instance with sid prod1:

ALTER SYSTEM SET OPEN_CURSORS=1500 sid='*' SCOPE=MEMORY;

In the example instance with sid prod1, the parameter begins accepting ALTER SYSTEM values set by other instances 
if you change the parameter setting by running the following statement:

ALTER SYSTEM RESET OPEN_CURSORS SCOPE=MEMORY sid='prod1';

Then if you execute the following statement on another instance, the instance with sid prod1 also assumes the new setting of 2000:

ALTER SYSTEM SET OPEN_CURSORS=2000 sid='*' SCOPE=MEMORY;


14.2 Rolling upgrades, opatch :
===============================

>>> Patch:
Oracle issues product fixes for its software called patches. When you apply the patch to your Oracle software installation, 
a small collection of files are replaced to fix certain bugs. 
OPatch is an Oracle supplied utility that facilitates Oracle software patching.

The opatch binary file is located in the $ORACLE_HOME/OPatch directory. You can either specify this path when executing OPatch, 
or you can update the PATH environment variable to include the OPatch directory. For example, on RedHat Linux systems 
you would use a shell command similar to the following:

$ export PATH=$PATH:/opt/oracle/10gR2/db_1/OPatch


>>> Patchset:
A group of patches form a patch set. When you apply a patch set, many different files and utilities are modified. 
This results in a version change for your Oracle software, for example, from Oracle Database 10.2.0.1.0 
to Oracle Database 10.2.0.2.0. To apply a patch set you use the Oracle Universal Installer (OUI).


>>> RAC Rolling Upgrade

"One-off" patches or interim patches to database software are usually applied to implement known fixes for software problems 
an installation has encountered or to apply diagnostic patches to gather information regarding a problem. Such patch application 
is often carried out during a scheduled maintenance outage.

Oracle now provides the capability to do rolling patch upgrades with Real Application Clusters with little or no database downtime. 
The tool used to achieve this is the "opatch" command-line utility.

The advantage of a RAC rolling upgrade is that it enables at least some instances of the RAC installation to be available 
during the scheduled outage required for patch upgrades. Only the RAC instance that is currently being patched 
needs to be brought down. The other instances can continue to remain available. This means that the impact on the application 
downtime required for such scheduled outages is further minimized. Oracle's opatch utility enables the user to apply the patch 
successively to the different instances of the RAC installation.

Rolling upgrade is available only for patches that have been certified by Oracle to be eligible for rolling upgrades. 
Typically, patches that can be installed in a rolling upgrade include:

-Patches that do not affect the contents of the database such as the data dictionary
-Patches not related to RAC internode communication
-Patches related to client-side tools such as SQL*PLUS, Oracle utilities, development libraries, and Oracle Net
-Patches that do not change shared database resources such as datafile headers, control files, and common header definitions of kernel modules

Rolling upgrade of patches is currently available for one-off patches only. It is not available for patch sets.

Rolling patch upgrades are not available for deployments where the Oracle Database software is shared across the different nodes. 
This is the case where the Oracle home is on Cluster File System (CFS) or on shared volumes provided by file servers 
or NFS-mounted drives. The feature is only available where each node has its own copy of the Oracle Database software.

The opatch utility applies a patch successively to nodes of the RAC cluster. The nature of the patch enables a RAC installation 
to run in a mixed environment. Different instances of the database may be operating at the same time, and the patch may have been applied 
to some instances and not others. The opatch utility automatically detects the nodes of the cluster on which a specific RAC deployment 
has been implemented. The patch is applied to each node, one at a time. For each node, the DBA is prompted to shut down the instance. 
The patch is applied to the database software install on that node. After the current node has been patched, 
the instance can be restarted. After the patch is applied on the current node, the DBA is allowed to choose the next RAC node 
to apply the patch to. The cycle of instance shutdown, patch application, and instance startup is repeated. 
Thus, at any time during the patch application, only one node needs to be down.

To check if a patch is a rolling patch, execute the following on UNIX platforms:

$ opatch query -is_rolling

You can check to see whether a patch has been marked rolling upgradeable by running "$ opatch query �is_rolling" or 
by checking the online_rac_installable flag for a value of TRUE within "etc/config/inventory" directory. 
Usually, the patch README file should detail whether the patch is rolling upgradeable, but when in doubt, you can use one of the aforementioned methods.


Patches can be rolled back with the opatch utility. This enables the DBA to remove a troublesome patch or a patch 
that is no longer required. This can be done as a rolling procedure.

To roll back a patch across all nodes of a RAC cluster, execute the following command:

$ opatch rollback -id patch_id -ph patch_location

To roll back a patch on the local node only, enter the following command:

$ opatch rollback -local -id patch_id -ph patch_location

To check the results of a patch rollback, check the logs in the following location:

$ORACLE_HOME/.patch_storage/patch_id/patch_id_RollBack_timestamp.log


14.3 Redo Thread in RAC:
========================

Example:
--------

Suppose you have one RAC database, and two instances: RAC01 and RAC02.
Right now, you have 3 enabled threads of 2 group each:
- thread 3 consisting of log groups 5 and 6 is currently being used by instance RAC01
- thread 2 consisting of log groups 3 and 4 is currently being used by instance RAC02
- thread 1 consisting of log groups 1 and 2 is not being used

Now we want RAC01 to use thread 1 with log groups 1 and 2:

SQL> alter system set thread=1 scope=spfile sid='RAC01';


============================================
15. TROUBLESHOOTING RAC
============================================


Note 1: VIP on wrong Interface:
===============================

I  was installing RAC, and during the clusterware install I picked up the wrong interfaces for public and private. 
I had 10.x.x.x for the public IP on eth0 and 192.x.x.x for the private IP (interconnect) on eth1. 
I also had 10.x.x.x for the VIP. During the install I choose eth1 to be the public interface. Right after the install 
I lost connection to the machine via the 10.x.x.x IP.

What had happened was I had a 10.x.x.x IP on both eth0 and eth1, which was messing up the routing.

The solution? Simply modify the VIP in the cluster configuration. There�s actually a metalink article about this. 
Here are the essential commands:

srvctl stop nodeapps -n NODE1
srvctl stop nodeapps -n NODE2
srvctl modify nodeapps -n NODE1 -A 10.5.5.101/255.255.255.0/eth0
srvctl modify nodeapps -n NODE2 -A 10.5.5.102/255.255.255.0/eth0
srvctl start nodeapps -n NODE1
srvctl start nodeapps -n NODE2

They worked just fine. So if you ever mess up the interfaces, this is how you fix it.
If you need to change the private interface, 
then you need to use oifcfg. To verify your current settings use: 

oifcfg getif
eth0  10.5.5.0  global  public
eth1  192.168.0.0  global  cluster_interconnect

And use delif/addif to remove and re-create your private interface.


Note 2: Troubleshooting and debuging:
=====================================


You can enable debugging for the CRS, OCR, CSS, and EVM modules and their components by setting environment variables 
or by issuing crsctl debug commands using the following syntax:

# crsctl debug log module_name component:debugging_level

Run the following command to obtain component names where module_name is the name of the module, crs, evm, or css:

# crsctl lsmodules module_name

You must issue the crsctl debug command as the root user, and supply the following information:

- module_name�The name of the module: CRS, EVM, or CSS.
- component�The name of a component for the CRS, OCR, EVM, or CSS module. See Table F-1 for a list of all of the components.
- debugging_level�A number from 1 to 5 to indicate the level of detail you want the debug command to return, 
  where 1 is the least amount of debugging output and 5 provides the most detailed debugging output
 

For example, to enable tracing for the CSSD module of the css component, you could use the following command:

# crsctl debug log css CSSD:1

Other examples:
# crsctl debug log crs "CRSRTI:1,CRSCOMM:2"
# crsctl debug log evm "EVMCOMM:1" 
# crsctl debug log crs "CRSRTI:1,CRSCOMM:2,OCRSRV:4"

You can use crsctl commands to enable resource debugging using the following syntax:

# crsctl debug log res "ora.node1.vip:1"

This has the effect of setting the environment variable USER_ORA_DEBUG, to 1, before running 
the start, stop, or check action scripts for the ora.node1.vip resource.


# crsctl debug statedump evm - dumps state info for evm objects
# crsctl debug statedump crs - dumps state info for crs objects
# crsctl debug statedump css - dumps state info for css objects


To enable extra debugging on the currently running CRS daemons as well as those that will run in the future,
use the following command. Debugging information remains in the Oracle Cluster Registry (OCR) for use during the next startup.

# crsctl debug log crs


-- Running the Oracle Clusterware Diagnostics Collection Script

Run the diagcollection.pl script as the root user to collect diagnostic information from an Oracle Clusterware installation. 
The diagnostics provide additional information so that Oracle Support Services can resolve problems. Run this script from 
the operating system prompt as follows, where CRS_home is the home directory of your Oracle Clusterware installation:

# CRS_home/bin/diagcollection.pl --collect

This command displays the status of the Cluster Synchronization Services (CSS), Event Manager (EVM), 
and the Cluster Ready Services (CRS) daemons.

Debugging srvctl in 10g couldn't be easier. Simply set the SRVM_TRACE environment variable.

% export SRVM_TRACE=true


#############################################################################################
#############################################################################################
#############################################################################################


========================================
Section 23. MQ errors and messages:
========================================


Some Common Reason Codes:
=========================

# 0 (0000) (RC0): MQRC_NONE
# 900 (0384) (RC900): MQRC_APPL_FIRST
# 999 (03E7) (RC999): MQRC_APPL_LAST
# 2001 (07D1) (RC2001): MQRC_ALIAS_BASE_Q_TYPE_ERROR
# 2002 (07D2) (RC2002): MQRC_ALREADY_CONNECTED
# 2003 (07D3) (RC2003): MQRC_BACKED_OUT
# 2004 (07D4) (RC2004): MQRC_BUFFER_ERROR
# 2005 (07D5) (RC2005): MQRC_BUFFER_LENGTH_ERROR
# 2006 (07D6) (RC2006): MQRC_CHAR_ATTR_LENGTH_ERROR
# 2007 (07D7) (RC2007): MQRC_CHAR_ATTRS_ERROR
# 2008 (07D8) (RC2008): MQRC_CHAR_ATTRS_TOO_SHORT
# 2009 (07D9) (RC2009): MQRC_CONNECTION_BROKEN
# 2010 (07DA) (RC2010): MQRC_DATA_LENGTH_ERROR
# 2011 (07DB) (RC2011): MQRC_DYNAMIC_Q_NAME_ERROR
# 2012 (07DC) (RC2012): MQRC_ENVIRONMENT_ERROR
# 2013 (07DD) (RC2013): MQRC_EXPIRY_ERROR
# 2014 (07DE) (RC2014): MQRC_FEEDBACK_ERROR
# 2016 (07E0) (RC2016): MQRC_GET_INHIBITED
# 2017 (07E1) (RC2017): MQRC_HANDLE_NOT_AVAILABLE
# 2018 (07E2) (RC2018): MQRC_HCONN_ERROR
# 2019 (07E3) (RC2019): MQRC_HOBJ_ERROR
# 2020 (07E4) (RC2020): MQRC_INHIBIT_VALUE_ERROR
# 2021 (07E5) (RC2021): MQRC_INT_ATTR_COUNT_ERROR
# 2022 (07E6) (RC2022): MQRC_INT_ATTR_COUNT_TOO_SMALL
# 2023 (07E7) (RC2023): MQRC_INT_ATTRS_ARRAY_ERROR
# 2024 (07E8) (RC2024): MQRC_SYNCPOINT_LIMIT_REACHED
# 2025 (07E9) (RC2025): MQRC_MAX_CONNS_LIMIT_REACHED
# 2026 (07EA) (RC2026): MQRC_MD_ERROR
# 2027 (07EB) (RC2027): MQRC_MISSING_REPLY_TO_Q
# 2029 (07ED) (RC2029): MQRC_MSG_TYPE_ERROR
# 2030 (07EE) (RC2030): MQRC_MSG_TOO_BIG_FOR_Q
# 2031 (07EF) (RC2031): MQRC_MSG_TOO_BIG_FOR_Q_MGR
# 2033 (07F1) (RC2033): MQRC_NO_MSG_AVAILABLE
# 2034 (07F2) (RC2034): MQRC_NO_MSG_UNDER_CURSOR
# 2035 (07F3) (RC2035): MQRC_NOT_AUTHORIZED
# 2036 (07F4) (RC2036): MQRC_NOT_OPEN_FOR_BROWSE
# 2037 (07F5) (RC2037): MQRC_NOT_OPEN_FOR_INPUT
# 2038 (07F6) (RC2038): MQRC_NOT_OPEN_FOR_INQUIRE
# 2039 (07F7) (RC2039): MQRC_NOT_OPEN_FOR_OUTPUT
# 2040 (07F8) (RC2040): MQRC_NOT_OPEN_FOR_SET
# 2041 (07F9) (RC2041): MQRC_OBJECT_CHANGED
# 2042 (07FA) (RC2042): MQRC_OBJECT_IN_USE
# 2043 (07FB) (RC2043): MQRC_OBJECT_TYPE_ERROR
# 2044 (07FC) (RC2044): MQRC_OD_ERROR
# 2045 (07FD) (RC2045): MQRC_OPTION_NOT_VALID_FOR_TYPE
# 2046 (07FE) (RC2046): MQRC_OPTIONS_ERROR
# 2047 (07FF) (RC2047): MQRC_PERSISTENCE_ERROR
# 2048 (0800) (RC2048): MQRC_PERSISTENT_NOT_ALLOWED
# 2049 (0801) (RC2049): MQRC_PRIORITY_EXCEEDS_MAXIMUM
# 2050 (0802) (RC2050): MQRC_PRIORITY_ERROR
# 2051 (0803) (RC2051): MQRC_PUT_INHIBITED
# 2052 (0804) (RC2052): MQRC_Q_DELETED
# 2053 (0805) (RC2053): MQRC_Q_FULL
# 2055 (0807) (RC2055): MQRC_Q_NOT_EMPTY
# 2056 (0808) (RC2056): MQRC_Q_SPACE_NOT_AVAILABLE
# 2057 (0809) (RC2057): MQRC_Q_TYPE_ERROR
# 2058 (080A) (RC2058): MQRC_Q_MGR_NAME_ERROR
# 2059 (080B) (RC2059): MQRC_Q_MGR_NOT_AVAILABLE
# 2061 (080D) (RC2061): MQRC_REPORT_OPTIONS_ERROR
# 2062 (080E) (RC2062): MQRC_SECOND_MARK_NOT_ALLOWED
# 2063 (080F) (RC2063): MQRC_SECURITY_ERROR
# 2065 (0811) (RC2065): MQRC_SELECTOR_COUNT_ERROR
# 2066 (0812) (RC2066): MQRC_SELECTOR_LIMIT_EXCEEDED
# 2067 (0813) (RC2067): MQRC_SELECTOR_ERROR
# 2068 (0814) (RC2068): MQRC_SELECTOR_NOT_FOR_TYPE
# 2069 (0815) (RC2069): MQRC_SIGNAL_OUTSTANDING
# 2070 (0816) (RC2070): MQRC_SIGNAL_REQUEST_ACCEPTED
# 2071 (0817) (RC2071): MQRC_STORAGE_NOT_AVAILABLE
# 2072 (0818) (RC2072): MQRC_SYNCPOINT_NOT_AVAILABLE
# 2075 (081B) (RC2075): MQRC_TRIGGER_CONTROL_ERROR
# 2076 (081C) (RC2076): MQRC_TRIGGER_DEPTH_ERROR
# 2077 (081D) (RC2077): MQRC_TRIGGER_MSG_PRIORITY_ERR
# 2078 (081E) (RC2078): MQRC_TRIGGER_TYPE_ERROR
# 2079 (081F) (RC2079): MQRC_TRUNCATED_MSG_ACCEPTED
# 2080 (0820) (RC2080): MQRC_TRUNCATED_MSG_FAILED
# 2082 (0822) (RC2082): MQRC_UNKNOWN_ALIAS_BASE_Q
# 2085 (0825) (RC2085): MQRC_UNKNOWN_OBJECT_NAME
# 2086 (0826) (RC2086): MQRC_UNKNOWN_OBJECT_Q_MGR
# 2087 (0827) (RC2087): MQRC_UNKNOWN_REMOTE_Q_MGR
# 2090 (082A) (RC2090): MQRC_WAIT_INTERVAL_ERROR
# 2091 (082B) (RC2091): MQRC_XMIT_Q_TYPE_ERROR
# 2092 (082C) (RC2092): MQRC_XMIT_Q_USAGE_ERROR
# 2093 (082D) (RC2093): MQRC_NOT_OPEN_FOR_PASS_ALL
# 2094 (082E) (RC2094): MQRC_NOT_OPEN_FOR_PASS_IDENT
# 2095 (082F) (RC2095): MQRC_NOT_OPEN_FOR_SET_ALL
# 2096 (0830) (RC2096): MQRC_NOT_OPEN_FOR_SET_IDENT
# 2097 (0831) (RC2097): MQRC_CONTEXT_HANDLE_ERROR
# 2098 (0832) (RC2098): MQRC_CONTEXT_NOT_AVAILABLE
# 2099 (0833) (RC2099): MQRC_SIGNAL1_ERROR
# 2100 (0834) (RC2100): MQRC_OBJECT_ALREADY_EXISTS
# 2101 (0835) (RC2101): MQRC_OBJECT_DAMAGED
# 2102 (0836) (RC2102): MQRC_RESOURCE_PROBLEM
# 2103 (0837) (RC2103): MQRC_ANOTHER_Q_MGR_CONNECTED
# 2104 (0838) (RC2104): MQRC_UNKNOWN_REPORT_OPTION
# 2105 (0839) (RC2105): MQRC_STORAGE_CLASS_ERROR
# 2106 (083A) (RC2106): MQRC_COD_NOT_VALID_FOR_XCF_Q
# 2107 (083B) (RC2107): MQRC_XWAIT_CANCELED
# 2108 (083C) (RC2108): MQRC_XWAIT_ERROR
# 2109 (083D) (RC2109): MQRC_SUPPRESSED_BY_EXIT
# 2110 (083E) (RC2110): MQRC_FORMAT_ERROR
# 2111 (083F) (RC2111): MQRC_SOURCE_CCSID_ERROR
# 2112 (0840) (RC2112): MQRC_SOURCE_INTEGER_ENC_ERROR
# 2113 (0841) (RC2113): MQRC_SOURCE_DECIMAL_ENC_ERROR
# 2114 (0842) (RC2114): MQRC_SOURCE_FLOAT_ENC_ERROR
# 2115 (0843) (RC2115): MQRC_TARGET_CCSID_ERROR
# 2116 (0844) (RC2116): MQRC_TARGET_INTEGER_ENC_ERROR
# 2117 (0845) (RC2117): MQRC_TARGET_DECIMAL_ENC_ERROR
# 2118 (0846) (RC2118): MQRC_TARGET_FLOAT_ENC_ERROR
# 2119 (0847) (RC2119): MQRC_NOT_CONVERTED
# 2120 (0848) (RC2120): MQRC_CONVERTED_MSG_TOO_BIG
# 2121 (0849) (RC2121): MQRC_NO_EXTERNAL_PARTICIPANTS
# 2122 (084A) (RC2122): MQRC_PARTICIPANT_NOT_AVAILABLE
# 2123 (084B) (RC2123): MQRC_OUTCOME_MIXED
# 2124 (084C) (RC2124): MQRC_OUTCOME_PENDING
# 2125 (084D) (RC2125): MQRC_BRIDGE_STARTED
# 2126 (084E) (RC2126): MQRC_BRIDGE_STOPPED
# 2127 (084F) (RC2127): MQRC_ADAPTER_STORAGE_SHORTAGE
# 2128 (0850) (RC2128): MQRC_UOW_IN_PROGRESS
# 2129 (0851) (RC2129): MQRC_ADAPTER_CONN_LOAD_ERROR
# 2130 (0852) (RC2130): MQRC_ADAPTER_SERV_LOAD_ERROR
# 2131 (0853) (RC2131): MQRC_ADAPTER_DEFS_ERROR
# 2132 (0854) (RC2132): MQRC_ADAPTER_DEFS_LOAD_ERROR
# 2133 (0855) (RC2133): MQRC_ADAPTER_CONV_LOAD_ERROR
# 2134 (0856) (RC2134): MQRC_BO_ERROR
# 2135 (0857) (RC2135): MQRC_DH_ERROR
# 2136 (0858) (RC2136): MQRC_MULTIPLE_REASONS
# 2137 (0859) (RC2137): MQRC_OPEN_FAILED
# 2138 (085A) (RC2138): MQRC_ADAPTER_DISC_LOAD_ERROR
# 2139 (085B) (RC2139): MQRC_CNO_ERROR
# 2140 (085C) (RC2140): MQRC_CICS_WAIT_FAILED
# 2141 (085D) (RC2141): MQRC_DLH_ERROR
# 2142 (085E) (RC2142): MQRC_HEADER_ERROR
# 2143 (085F) (RC2143): MQRC_SOURCE_LENGTH_ERROR
# 2144 (0860) (RC2144): MQRC_TARGET_LENGTH_ERROR
# 2145 (0861) (RC2145): MQRC_SOURCE_BUFFER_ERROR
# 2146 (0862) (RC2146): MQRC_TARGET_BUFFER_ERROR
# 2148 (0864) (RC2148): MQRC_IIH_ERROR
# 2149 (0865) (RC2149): MQRC_PCF_ERROR
# 2150 (0866) (RC2150): MQRC_DBCS_ERROR
# 2152 (0868) (RC2152): MQRC_OBJECT_NAME_ERROR
# 2153 (0869) (RC2153): MQRC_OBJECT_Q_MGR_NAME_ERROR
# 2154 (086A) (RC2154): MQRC_RECS_PRESENT_ERROR
# 2155 (086B) (RC2155): MQRC_OBJECT_RECORDS_ERROR
# 2156 (086C) (RC2156): MQRC_RESPONSE_RECORDS_ERROR
# 2157 (086D) (RC2157): MQRC_ASID_MISMATCH
# 2158 (086E) (RC2158): MQRC_PMO_RECORD_FLAGS_ERROR
# 2159 (086F) (RC2159): MQRC_PUT_MSG_RECORDS_ERROR
# 2160 (0870) (RC2160): MQRC_CONN_ID_IN_USE
# 2161 (0871) (RC2161): MQRC_Q_MGR_QUIESCING
# 2162 (0872) (RC2162): MQRC_Q_MGR_STOPPING
# 2163 (0873) (RC2163): MQRC_DUPLICATE_RECOV_COORD
# 2173 (087D) (RC2173): MQRC_PMO_ERROR
# 2183 (0887) (RC2183): MQRC_API_EXIT_LOAD_ERROR
# 2184 (0888) (RC2184): MQRC_REMOTE_Q_NAME_ERROR
# 2185 (0889) (RC2185): MQRC_INCONSISTENT_PERSISTENCE
# 2186 (088A) (RC2186): MQRC_GMO_ERROR
# 2187 (088B) (RC2187): MQRC_CICS_BRIDGE_RESTRICTION
# 2188 (088C) (RC2188): MQRC_STOPPED_BY_CLUSTER_EXIT
# 2189 (088D) (RC2189): MQRC_CLUSTER_RESOLUTION_ERROR
# 2190 (088E) (RC2190): MQRC_CONVERTED_STRING_TOO_BIG
# 2191 (088F) (RC2191): MQRC_TMC_ERROR
# 2192 (0890) (RC2192): MQRC_PAGESET_FULL
# 2192 (0890) (RC2192): MQRC_STORAGE_MEDIUM_FULL
# 2193 (0891) (RC2193): MQRC_PAGESET_ERROR
# 2194 (0892) (RC2194): MQRC_NAME_NOT_VALID_FOR_TYPE
# 2195 (0893) (RC2195): MQRC_UNEXPECTED_ERROR
# 2196 (0894) (RC2196): MQRC_UNKNOWN_XMIT_Q
# 2197 (0895) (RC2197): MQRC_UNKNOWN_DEF_XMIT_Q
# 2198 (0896) (RC2198): MQRC_DEF_XMIT_Q_TYPE_ERROR
# 2199 (0897) (RC2199): MQRC_DEF_XMIT_Q_USAGE_ERROR
# 2201 (0899) (RC2201): MQRC_NAME_IN_USE
# 2202 (089A) (RC2202): MQRC_CONNECTION_QUIESCING
# 2203 (089B) (RC2203): MQRC_CONNECTION_STOPPING
# 2204 (089C) (RC2204): MQRC_ADAPTER_NOT_AVAILABLE
# 2206 (089E) (RC2206): MQRC_MSG_ID_ERROR
# 2207 (089F) (RC2207): MQRC_CORREL_ID_ERROR
# 2208 (08A0) (RC2208): MQRC_FILE_SYSTEM_ERROR
# 2209 (08A1) (RC2209): MQRC_NO_MSG_LOCKED
# 2210 (08A2) (RC2210): MQRC_SOAP_DOTNET_ERROR
# 2211 (08A3) (RC2211): MQRC_SOAP_AXIS_ERROR
# 2212 (08A4) (RC2212): MQRC_SOAP_URL_ERROR
# 2217 (08A9) (RC2217): MQRC_CONNECTION_NOT_AUTHORIZED
# 2218 (08AA) (RC2218): MQRC_MSG_TOO_BIG_FOR_CHANNEL
# 2219 (08AB) (RC2219): MQRC_CALL_IN_PROGRESS
# 2220 (08AC) (RC2220): MQRC_RMH_ERROR
# 2222 (08AE) (RC2222): MQRC_Q_MGR_ACTIVE
# 2223 (08AF) (RC2223): MQRC_Q_MGR_NOT_ACTIVE
# 2224 (08B0) (RC2224): MQRC_Q_DEPTH_HIGH
# 2225 (08B1) (RC2225): MQRC_Q_DEPTH_LOW
# 2226 (08B2) (RC2226): MQRC_Q_SERVICE_INTERVAL_HIGH
# 2227 (08B3) (RC2227): MQRC_Q_SERVICE_INTERVAL_OK
# 2228 (08B4) (RC2228): MQRC_RFH_HEADER_FIELD_ERROR
# 2229 (08B5) (RC2229): MQRC_RAS_PROPERTY_ERROR
# 2232 (08B8) (RC2232): MQRC_UNIT_OF_WORK_NOT_STARTED
# 2233 (08B9) (RC2233): MQRC_CHANNEL_AUTO_DEF_OK
# 2234 (08BA) (RC2234): MQRC_CHANNEL_AUTO_DEF_ERROR
# 2235 (08BB) (RC2235): MQRC_CFH_ERROR
# 2236 (08BC) (RC2236): MQRC_CFIL_ERROR
# 2237 (08BD) (RC2237): MQRC_CFIN_ERROR
# 2238 (08BE) (RC2238): MQRC_CFSL_ERROR
# 2239 (08BF) (RC2239): MQRC_CFST_ERROR
# 2241 (08C1) (RC2241): MQRC_INCOMPLETE_GROUP
# 2242 (08C2) (RC2242): MQRC_INCOMPLETE_MSG
# 2243 (08C3) (RC2243): MQRC_INCONSISTENT_CCSIDS
# 2244 (08C4) (RC2244): MQRC_INCONSISTENT_ENCODINGS
# 2245 (08C5) (RC2245): MQRC_INCONSISTENT_UOW
# 2246 (08C6) (RC2246): MQRC_INVALID_MSG_UNDER_CURSOR
# 2247 (08C7) (RC2247): MQRC_MATCH_OPTIONS_ERROR
# 2248 (08C8) (RC2248): MQRC_MDE_ERROR
# 2249 (08C9) (RC2249): MQRC_MSG_FLAGS_ERROR
# 2250 (08CA) (RC2250): MQRC_MSG_SEQ_NUMBER_ERROR
# 2251 (08CB) (RC2251): MQRC_OFFSET_ERROR
# 2252 (08CC) (RC2252): MQRC_ORIGINAL_LENGTH_ERROR
# 2253 (08CD) (RC2253): MQRC_SEGMENT_LENGTH_ZERO
# 2255 (08CF) (RC2255): MQRC_UOW_NOT_AVAILABLE
# 2256 (08D0) (RC2256): MQRC_WRONG_GMO_VERSION
# 2257 (08D1) (RC2257): MQRC_WRONG_MD_VERSION
# 2258 (08D2) (RC2258): MQRC_GROUP_ID_ERROR
# 2259 (08D3) (RC2259): MQRC_INCONSISTENT_BROWSE
# 2260 (08D4) (RC2260): MQRC_XQH_ERROR
# 2261 (08D5) (RC2261): MQRC_SRC_ENV_ERROR
# 2262 (08D6) (RC2262): MQRC_SRC_NAME_ERROR
# 2263 (08D7) (RC2263): MQRC_DEST_ENV_ERROR
# 2264 (08D8) (RC2264): MQRC_DEST_NAME_ERROR
# 2265 (08D9) (RC2265): MQRC_TM_ERROR
# 2266 (08DA) (RC2266): MQRC_CLUSTER_EXIT_ERROR
# 2267 (08DB) (RC2267): MQRC_CLUSTER_EXIT_LOAD_ERROR
# 2268 (08DC) (RC2268): MQRC_CLUSTER_PUT_INHIBITED
# 2269 (08DD) (RC2269): MQRC_CLUSTER_RESOURCE_ERROR
# 2270 (08DE) (RC2270): MQRC_NO_DESTINATIONS_AVAILABLE
# 2271 (08DF) (RC2271): MQRC_CONN_TAG_IN_USE
# 2272 (08E0) (RC2272): MQRC_PARTIALLY_CONVERTED
# 2273 (08E1) (RC2273): MQRC_CONNECTION_ERROR
# 2274 (08E2) (RC2274): MQRC_OPTION_ENVIRONMENT_ERROR
# 2277 (08E5) (RC2277): MQRC_CD_ERROR
# 2278 (08E6) (RC2278): MQRC_CLIENT_CONN_ERROR
# 2279 (08E7) (RC2279): MQRC_CHANNEL_STOPPED_BY_USER
# 2280 (08E8) (RC2280): MQRC_HCONFIG_ERROR
# 2281 (08E9) (RC2281): MQRC_FUNCTION_ERROR
# 2282 (08EA) (RC2282): MQRC_CHANNEL_STARTED
# 2283 (08EB) (RC2283): MQRC_CHANNEL_STOPPED
# 2284 (08EC) (RC2284): MQRC_CHANNEL_CONV_ERROR
# 2285 (08ED) (RC2285): MQRC_SERVICE_NOT_AVAILABLE
# 2286 (08EE) (RC2286): MQRC_INITIALIZATION_FAILED
# 2287 (08EF) (RC2287): MQRC_TERMINATION_FAILED
# 2288 (08F0) (RC2288): MQRC_UNKNOWN_Q_NAME
# 2289 (08F1) (RC2289): MQRC_SERVICE_ERROR
# 2290 (08F2) (RC2290): MQRC_Q_ALREADY_EXISTS
# 2291 (08F3) (RC2291): MQRC_USER_ID_NOT_AVAILABLE
# 2292 (08F4) (RC2292): MQRC_UNKNOWN_ENTITY
# 2294 (08F6) (RC2294): MQRC_UNKNOWN_REF_OBJECT
# 2295 (08F7) (RC2295): MQRC_CHANNEL_ACTIVATED
# 2296 (08F8) (RC2296): MQRC_CHANNEL_NOT_ACTIVATED
# 2297 (08F9) (RC2297): MQRC_UOW_CANCELED
# 2298 (08FA) (RC2298): MQRC_FUNCTION_NOT_SUPPORTED
# 2299 (08FB) (RC2299): MQRC_SELECTOR_TYPE_ERROR
# 2300 (08FC) (RC2300): MQRC_COMMAND_TYPE_ERROR
# 2301 (08FD) (RC2301): MQRC_MULTIPLE_INSTANCE_ERROR
# 2302 (08FE) (RC2302): MQRC_SYSTEM_ITEM_NOT_ALTERABLE
# 2303 (08FF) (RC2303): MQRC_BAG_CONVERSION_ERROR
# 2304 (0900) (RC2304): MQRC_SELECTOR_OUT_OF_RANGE
# 2305 (0901) (RC2305): MQRC_SELECTOR_NOT_UNIQUE
# 2306 (0902) (RC2306): MQRC_INDEX_NOT_PRESENT
# 2307 (0903) (RC2307): MQRC_STRING_ERROR
# 2308 (0904) (RC2308): MQRC_ENCODING_NOT_SUPPORTED
# 2309 (0905) (RC2309): MQRC_SELECTOR_NOT_PRESENT
# 2310 (0906) (RC2310): MQRC_OUT_SELECTOR_ERROR
# 2311 (0907) (RC2311): MQRC_STRING_TRUNCATED
# 2312 (0908) (RC2312): MQRC_SELECTOR_WRONG_TYPE
# 2313 (0909) (RC2313): MQRC_INCONSISTENT_ITEM_TYPE
# 2314 (090A) (RC2314): MQRC_INDEX_ERROR
# 2315 (090B) (RC2315): MQRC_SYSTEM_BAG_NOT_ALTERABLE
# 2316 (090C) (RC2316): MQRC_ITEM_COUNT_ERROR
# 2317 (090D) (RC2317): MQRC_FORMAT_NOT_SUPPORTED
# 2318 (090E) (RC2318): MQRC_SELECTOR_NOT_SUPPORTED
# 2319 (090F) (RC2319): MQRC_ITEM_VALUE_ERROR
# 2320 (0910) (RC2320): MQRC_HBAG_ERROR
# 2321 (0911) (RC2321): MQRC_PARAMETER_MISSING
# 2322 (0912) (RC2322): MQRC_CMD_SERVER_NOT_AVAILABLE
# 2323 (0913) (RC2323): MQRC_STRING_LENGTH_ERROR
# 2324 (0914) (RC2324): MQRC_INQUIRY_COMMAND_ERROR
# 2325 (0915) (RC2325): MQRC_NESTED_BAG_NOT_SUPPORTED
# 2326 (0916) (RC2326): MQRC_BAG_WRONG_TYPE
# 2327 (0917) (RC2327): MQRC_ITEM_TYPE_ERROR
# 2328 (0918) (RC2328): MQRC_SYSTEM_BAG_NOT_DELETABLE
# 2329 (0919) (RC2329): MQRC_SYSTEM_ITEM_NOT_DELETABLE
# 2330 (091A) (RC2330): MQRC_CODED_CHAR_SET_ID_ERROR
# 2331 (091B) (RC2331): MQRC_MSG_TOKEN_ERROR
# 2332 (091C) (RC2332): MQRC_MISSING_WIH
# 2333 (091D) (RC2333): MQRC_WIH_ERROR
# 2334 (091E) (RC2334): MQRC_RFH_ERROR
# 2335 (091F) (RC2335): MQRC_RFH_STRING_ERROR
# 2336 (0920) (RC2336): MQRC_RFH_COMMAND_ERROR
# 2337 (0921) (RC2337): MQRC_RFH_PARM_ERROR
# 2338 (0922) (RC2338): MQRC_RFH_DUPLICATE_PARM
# 2339 (0923) (RC2339): MQRC_RFH_PARM_MISSING
# 2340 (0924) (RC2340): MQRC_CHAR_CONVERSION_ERROR
# 2341 (0925) (RC2341): MQRC_UCS2_CONVERSION_ERROR
# 2342 (0926) (RC2342): MQRC_DB2_NOT_AVAILABLE
# 2343 (0927) (RC2343): MQRC_OBJECT_NOT_UNIQUE
# 2344 (0928) (RC2344): MQRC_CONN_TAG_NOT_RELEASED
# 2345 (0929) (RC2345): MQRC_CF_NOT_AVAILABLE
# 2346 (092A) (RC2346): MQRC_CF_STRUC_IN_USE
# 2347 (092B) (RC2347): MQRC_CF_STRUC_LIST_HDR_IN_USE
# 2348 (092C) (RC2348): MQRC_CF_STRUC_AUTH_FAILED
# 2349 (092D) (RC2349): MQRC_CF_STRUC_ERROR
# 2350 (092E) (RC2350): MQRC_CONN_TAG_NOT_USABLE
# 2351 (092F) (RC2351): MQRC_GLOBAL_UOW_CONFLICT
# 2352 (0930) (RC2352): MQRC_LOCAL_UOW_CONFLICT
# 2353 (0931) (RC2353): MQRC_HANDLE_IN_USE_FOR_UOW
# 2354 (0932) (RC2354): MQRC_UOW_ENLISTMENT_ERROR
# 2355 (0933) (RC2355): MQRC_UOW_MIX_NOT_SUPPORTED
# 2356 (0934) (RC2356): MQRC_WXP_ERROR
# 2357 (0935) (RC2357): MQRC_CURRENT_RECORD_ERROR
# 2358 (0936) (RC2358): MQRC_NEXT_OFFSET_ERROR
# 2359 (0937) (RC2359): MQRC_NO_RECORD_AVAILABLE
# 2360 (0938) (RC2360): MQRC_OBJECT_LEVEL_INCOMPATIBLE
# 2361 (0939) (RC2361): MQRC_NEXT_RECORD_ERROR
# 2362 (093A) (RC2362): MQRC_BACKOUT_THRESHOLD_REACHED
# 2363 (093B) (RC2363): MQRC_MSG_NOT_MATCHED
# 2364 (093C) (RC2364): MQRC_JMS_FORMAT_ERROR
# 2365 (093D) (RC2365): MQRC_SEGMENTS_NOT_SUPPORTED
# 2366 (093E) (RC2366): MQRC_WRONG_CF_LEVEL
# 2367 (093F) (RC2367): MQRC_CONFIG_CREATE_OBJECT
# 2368 (0940) (RC2368): MQRC_CONFIG_CHANGE_OBJECT
# 2369 (0941) (RC2369): MQRC_CONFIG_DELETE_OBJECT
# 2370 (0942) (RC2370): MQRC_CONFIG_REFRESH_OBJECT
# 2371 (0943) (RC2371): MQRC_CHANNEL_SSL_ERROR
# 2373 (0945) (RC2373): MQRC_CF_STRUC_FAILED
# 2374 (0946) (RC2374): MQRC_API_EXIT_ERROR
# 2375 (0947) (RC2375): MQRC_API_EXIT_INIT_ERROR
# 2376 (0948) (RC2376): MQRC_API_EXIT_TERM_ERROR
# 2377 (0949) (RC2377): MQRC_EXIT_REASON_ERROR
# 2378 (094A) (RC2378): MQRC_RESERVED_VALUE_ERROR
# 2379 (094B) (RC2379): MQRC_NO_DATA_AVAILABLE
# 2380 (094C) (RC2380): MQRC_SCO_ERROR
# 2381 (094D) (RC2381): MQRC_KEY_REPOSITORY_ERROR
# 2382 (094E) (RC2382): MQRC_CRYPTO_HARDWARE_ERROR
# 2383 (094F) (RC2383): MQRC_AUTH_INFO_REC_COUNT_ERROR
# 2384 (0950) (RC2384): MQRC_AUTH_INFO_REC_ERROR
# 2385 (0951) (RC2385): MQRC_AIR_ERROR
# 2386 (0952) (RC2386): MQRC_AUTH_INFO_TYPE_ERROR
# 2387 (0953) (RC2387): MQRC_AUTH_INFO_CONN_NAME_ERROR
# 2388 (0954) (RC2388): MQRC_LDAP_USER_NAME_ERROR
# 2389 (0955) (RC2389): MQRC_LDAP_USER_NAME_LENGTH_ERR
# 2390 (0956) (RC2390): MQRC_LDAP_PASSWORD_ERROR
# 2391 (0957) (RC2391): MQRC_SSL_ALREADY_INITIALIZED
# 2392 (0958) (RC2392): MQRC_SSL_CONFIG_ERROR
# 2393 (0959) (RC2393): MQRC_SSL_INITIALIZATION_ERROR
# 2394 (095A) (RC2394): MQRC_Q_INDEX_TYPE_ERROR
# 2395 (095B) (RC2395): MQRC_CFBS_ERROR
# 2396 (095C) (RC2396): MQRC_SSL_NOT_ALLOWED
# 2397 (095D) (RC2397): MQRC_JSSE_ERROR
# 2398 (095E) (RC2398): MQRC_SSL_PEER_NAME_MISMATCH
# 2399 (095F) (RC2399): MQRC_SSL_PEER_NAME_ERROR
# 2400 (0960) (RC2400): MQRC_UNSUPPORTED_CIPHER_SUITE
# 2401 (0961) (RC2401): MQRC_SSL_CERTIFICATE_REVOKED
# 2402 (0962) (RC2402): MQRC_SSL_CERT_STORE_ERROR
# 2406 (0966) (RC2406): MQRC_CLIENT_EXIT_LOAD_ERROR
# 2407 (0967) (RC2407): MQRC_CLIENT_EXIT_ERROR
# 2409 (0969) (RC2409): MQRC_SSL_KEY_RESET_ERROR
# 2411 (096B) (RC2411): MQRC_LOGGER_STATUS
# 2412 (096C) (RC2412): MQRC_COMMAND_MQSC
# 2413 (096D) (RC2413): MQRC_COMMAND_PCF
# 2414 (096E) (RC2414): MQRC_CFIF_ERROR
# 2415 (096F) (RC2415): MQRC_CFSF_ERROR
# 2416 (0970) (RC2416): MQRC_CFGR_ERROR
# 2417 (0971) (RC2417): MQRC_MSG_NOT_ALLOWED_IN_GROUP
# 2418 (0972) (RC2418): MQRC_FILTER_OPERATOR_ERROR
# 2419 (0973) (RC2419): MQRC_NESTED_SELECTOR_ERROR
# 2420 (0974) (RC2420): MQRC_EPH_ERROR
# 2421 (0975) (RC2421): MQRC_RFH_FORMAT_ERROR
# 2422 (0976) (RC2422): MQRC_CFBF_ERROR
# 2423 (0977) (RC2423): MQRC_CLIENT_CHANNEL_CONFLICT
# 2424 (0978) (RC2424): MQRC_SD_ERROR
# 2425 (0979) (RC2425): MQRC_TOPIC_STRING_ERROR
# 2426 (097A) (RC2426): MQRC_STS_ERROR
# 2428 (097C) (RC2428): MQRC_NO_SUBSCRIPTION
# 2429 (097D) (RC2429): MQRC_SUBSCRIPTION_IN_USE
# 2430 (097E) (RC2430): MQRC_STAT_TYPE_ERROR
# 2431 (097F) (RC2431): MQRC_SUB_USER_DATA_ERROR
# 2432 (0980) (RC2432): MQRC_SUB_ALREADY_EXISTS
# 2434 (0982) (RC2434): MQRC_IDENTITY_MISMATCH
# 2435 (0983) (RC2435): MQRC_ALTER_SUB_ERROR
# 2436 (0984) (RC2436): MQRC_DURABILITY_NOT_ALLOWED
# 2437 (0985) (RC2437): MQRC_NO_RETAINED_MSG
# 2438 (0986) (RC2438): MQRC_SRO_ERROR
# 2440 (0988) (RC2440): MQRC_SUB_NAME_ERROR
# 2441 (0989) (RC2441): MQRC_OBJECT_STRING_ERROR
# 2442 (098A) (RC2442): MQRC_PROPERTY_NAME_ERROR
# 2443 (098B) (RC2443): MQRC_SEGMENTATION_NOT_ALLOWED
# 2444 (098C) (RC2444): MQRC_CBD_ERROR
# 2445 (098D) (RC2445): MQRC_CTLO_ERROR
# 2446 (098E) (RC2446): MQRC_NO_CALLBACKS_ACTIVE
# 2448 (0990) (RC2448): MQRC_CALLBACK_NOT_REGISTERED
# 2452 (0994) (RC2452): MQRC_CALLBACK_ERROR
# 2453 (0995) (RC2453): MQRC_CALLBACK_STILL_ACTIVE
# 2457 (0999) (RC2457): MQRC_OPTIONS_CHANGED
# 2458 (099A) (RC2458): MQRC_READ_AHEAD_MSGS
# 2459 (099B) (RC2459): MQRC_SELECTOR_SYNTAX_ERROR
# 2460 (099C) (RC2460): MQRC_HMSG_ERROR
# 2461 (099D) (RC2461): MQRC_CMHO_ERROR
# 2462 (099E) (RC2462): MQRC_DMHO_ERROR
# 2463 (099F) (RC2463): MQRC_SMPO_ERROR
# 2464 (09A0) (RC2464): MQRC_IMPO_ERROR
# 2465 (09A1) (RC2465): MQRC_PROPERTY_NAME_TOO_BIG
# 2466 (09A2) (RC2466): MQRC_PROP_VALUE_NOT_CONVERTED
# 2467 (09A3) (RC2467): MQRC_PROP_TYPE_NOT_SUPPORTED
# 2469 (09A5) (RC2469): MQRC_PROPERTY_VALUE_TOO_BIG
# 2470 (09A6) (RC2470): MQRC_PROP_CONV_NOT_SUPPORTED
# 2471 (09A7) (RC2471): MQRC_PROPERTY_NOT_AVAILABLE
# 2472 (09A8) (RC2472): MQRC_PROP_NUMBER_FORMAT_ERROR
# 2473 (09A9) (RC2473): MQRC_PROPERTY_TYPE_ERROR
# 2478 (09AE) (RC2478): MQRC_PROPERTIES_TOO_BIG
# 2479 (09AF) (RC2479): MQRC_PUT_NOT_RETAINED
# 2480 (09B0) (RC2480): MQRC_ALIAS_TARGTYPE_CHANGED
# 2481 (09B1) (RC2481): MQRC_DMPO_ERROR
# 2482 (09B2) (RC2482): MQRC_PD_ERROR
# 2483 (09B3) (RC2483): MQRC_CALLBACK_TYPE_ERROR
# 2484 (09B4) (RC2484): MQRC_CBD_OPTIONS_ERROR
# 2485 (09B5) (RC2485): MQRC_MAX_MSG_LENGTH_ERROR
# 2486 (09B6) (RC2486): MQRC_CALLBACK_ROUTINE_ERROR
# 2487 (09B7) (RC2487): MQRC_CALLBACK_LINK_ERROR
# 2488 (09B8) (RC2488): MQRC_OPERATION_ERROR
# 2489 (09B9) (RC2489): MQRC_BMHO_ERROR
# 2490 (09BA) (RC2490): MQRC_UNSUPPORTED_PROPERTY
# 2492 (09BC) (RC2492): MQRC_PROP_NAME_NOT_CONVERTED
# 2494 (09BE) (RC2494): MQRC_GET_ENABLED
# 2495 (09BF) (RC2495): MQRC_MODULE_NOT_FOUND
# 2496 (09C0) (RC2496): MQRC_MODULE_INVALID
# 2497 (09C1) (RC2497): MQRC_MODULE_ENTRY_NOT_FOUND
# 2498 (09C2) (RC2498): MQRC_MIXED_CONTENT_NOT_ALLOWED
# 2499 (09C3) (RC2499): MQRC_MSG_HANDLE_IN_USE
# 2500 (09C4) (RC2500): MQRC_HCONN_ASYNC_ACTIVE
# 2501 (09C5) (RC2501): MQRC_MHBO_ERROR
# 2502 (09C6) (RC2502): MQRC_PUBLICATION_FAILURE
# 2503 (09C7) (RC2503): MQRC_SUB_INHIBITED
# 2504 (09C8) (RC2504): MQRC_SELECTOR_ALWAYS_FALSE
# 2507 (09CB) (RC2507): MQRC_XEPO_ERROR
# 2509 (09CD) (RC2509): MQRC_DURABILITY_NOT_ALTERABLE
# 2510 (09CE) (RC2510): MQRC_TOPIC_NOT_ALTERABLE
# 2512 (09D0) (RC2512): MQRC_SUBLEVEL_NOT_ALTERABLE
# 2513 (09D1) (RC2513): MQRC_PROPERTY_NAME_LENGTH_ERR
# 2514 (09D2) (RC2514): MQRC_DUPLICATE_GROUP_SUB
# 2515 (09D3) (RC2515): MQRC_GROUPING_NOT_ALTERABLE
# 2516 (09D4) (RC2516): MQRC_SELECTOR_INVALID_FOR_TYPE
# 2517 (09D5) (RC2517): MQRC_HOBJ_QUIESCED
# 2518 (09D6) (RC2518): MQRC_HOBJ_QUIESCED_NO_MSGS
# 2519 (09D7) (RC2519): MQRC_SELECTION_STRING_ERROR
# 2520 (09D8) (RC2520): MQRC_RES_OBJECT_STRING_ERROR
# 2521 (09D9) (RC2521): MQRC_CONNECTION_SUSPENDED
# 2522 (09DA) (RC2522): MQRC_INVALID_DESTINATION
# 2523 (09DB) (RC2523): MQRC_INVALID_SUBSCRIPTION
# 2524 (09DC) (RC2524): MQRC_SELECTOR_NOT_ALTERABLE
# 2525 (09DD) (RC2525): MQRC_RETAINED_MSG_Q_ERROR
# 2526 (09DE) (RC2526): MQRC_RETAINED_NOT_DELIVERED
# 2527 (09DF) (RC2527): MQRC_RFH_RESTRICTED_FORMAT_ERR
# 2528 (09E0) (RC2528): MQRC_CONNECTION_STOPPED
# 2529 (09E1) (RC2529): MQRC_ASYNC_UOW_CONFLICT
# 2530 (09E2) (RC2530): MQRC_ASYNC_XA_CONFLICT
# 2531 (09E3) (RC2531): MQRC_PUBSUB_INHIBITED
# 2532 (09E4): MQRC_MSG_HANDLE_COPY_FAILURE
# 2533 (09E5) (RC2533): MQRC_DEST_CLASS_NOT_ALTERABLE
# 2534 (09E6) (RC2534): MQRC_OPERATION_NOT_ALLOWED
# 2535 (09E7): MQRC_ACTION_ERROR
# 2537 (09E9) (RC2537): MQRC_CHANNEL_NOT_AVAILABLE
# 2538 (09EA) (RC2538): MQRC_HOST_NOT_AVAILABLE
# 2539 (09EB) (RC2539): MQRC_CHANNEL_CONFIG_ERROR
# 2540 (09EC) (RC2540): MQRC_UNKNOWN_CHANNEL_NAME
# 2541 (09ED) (RC2541): MQRC_LOOPING_PUBLICATION
# 6100 (17D4) (RC6100): MQRC_REOPEN_EXCL_INPUT_ERROR
# 6101 (17D5) (RC6101): MQRC_REOPEN_INQUIRE_ERROR
# 6102 (17D6) (RC6102): MQRC_REOPEN_SAVED_CONTEXT_ERR
# 6103 (17D7) (RC6103): MQRC_REOPEN_TEMPORARY_Q_ERROR
# 6104 (17D8) (RC6104): MQRC_ATTRIBUTE_LOCKED
# 6105 (17D9) (RC6105): MQRC_CURSOR_NOT_VALID
# 6106 (17DA) (RC6106): MQRC_ENCODING_ERROR
# 6107 (17DB) (RC6107): MQRC_STRUC_ID_ERROR
# 6108 (17DC) (RC6108): MQRC_NULL_POINTER
# 6109 (17DD) (RC6109): MQRC_NO_CONNECTION_REFERENCE
# 6110 (17DE) (RC6110): MQRC_NO_BUFFER
# 6111 (17DF) (RC6111): MQRC_BINARY_DATA_LENGTH_ERROR
# 6112 (17E0) (RC6112): MQRC_BUFFER_NOT_AUTOMATIC
# 6113 (17E1) (RC6113): MQRC_INSUFFICIENT_BUFFER
# 6114 (17E2) (RC6114): MQRC_INSUFFICIENT_DATA
# 6115 (17E3) (RC6115): MQRC_DATA_TRUNCATED
# 6116 (17E4) (RC6116): MQRC_ZERO_LENGTH
# 6117 (17E5) (RC6117): MQRC_NEGATIVE_LENGTH
# 6118 (17E6) (RC6118): MQRC_NEGATIVE_OFFSET
# 6119 (17E7) (RC6119): MQRC_INCONSISTENT_FORMAT
# 6120 (17E8) (RC6120): MQRC_INCONSISTENT_OBJECT_STATE
# 6121 (17E9) (RC6121): MQRC_CONTEXT_OBJECT_NOT_VALID
# 6122 (17EA) (RC6122): MQRC_CONTEXT_OPEN_ERROR
# 6123 (17EB) (RC6123): MQRC_STRUC_LENGTH_ERROR
# 6124 (17EC) (RC6124): MQRC_NOT_CONNECTED
# 6125 (17ED) (RC6125): MQRC_NOT_OPEN
# 6126 (17EE) (RC6126): MQRC_DISTRIBUTION_LIST_EMPTY
# 6127 (17EF) (RC6127): MQRC_INCONSISTENT_OPEN_OPTIONS
# 6128 (17FO) (RC6128): MQRC_WRONG_VERSION
# 6129 (17F1) (RC6129): MQRC_REFERENCE_ERROR


Processes:
==========


MQ Processes: List 1:
=====================


MQSERIES PROCESSES BY PLATFORM 

PLATFORM =AIX 
ProcName        Process Function 
amqhasmx        logger 
amqharmx        log formatter,used only if the queue manager has linear logging selected 
amqzllp0        checkpoint processor 
amqzlaa0        queue manager agent(s) 
amqzxma0        processing controller 
runmqsc 	MQ Command interface 
amqpcsea        PCF command processor 
amqcrsta        Any remotely started channel over TCP/IP - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
amqcrs6a        Any remotely started channel over LU62/SNA - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
runmqchl        Any locally started channel over any protocol - Could be SENDER,SERVER,CLUSSDR,REQUESTER 
runmqlsr        listener process 
runmqchi        channel initiator 


PLATFORM = AS/400 
ProcName        Process Function 
AMQHIXK4        Storage Manager (Housekeeper) 
AMQMCPRA        Data Store (Object Cache) 
AMQCLMAA        Listener 
AMQALMP4        Check Point Process 
AMQRMCLA        Sender channel 
AMQPCSVA        PCF command processor 
AMQRIMNA        Channel initiator (trigger monitor to start channel) 
AMQIQES4        Quiesce (forces user logoffs - for upgrades) 
AMQIQEJ4        Quiesce (without user logoffs - for daily use if desired) 
AMQCRSTA        Any remotely started channel over TCP/IP - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
AMQCRS6A        Any remotely started channel over LU62/SNA - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 


PLATFORM = HP/UX 
ProcName        Process Function 
amqhasmx        logger 
amqharmx        log formatter, used only if the queue manager has linear logging selected 
amqzllp0        checkpoint processor 
amqzlaa0        queue manager agents 
amqzxma0        processing controller 
runmqsc         MQ Command interface 
amqpcsea        PCF command processor 
amqcrsta        Any remotely started channel over TCP/IP - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
amqcrs6a        Any remotely started channel over LU62/SNA - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
runmqchl        Any locally started channel over any protocol - Could be SENDER,SERVER,CLUSSDR,REQUESTER 
runmqlsr        listener process 
runmqchi        channel initiator 


PLATFORM = OS2 
ProcName        Process Function 
AMQHASM2.EXE    The logger 
AMQHARM2.EXE    Log formatter (LINEAR logs only) 
AMQZLLP0.EXE    Checkpoint process 
AMQZLAA0.EXE    LQM agents 
AMQZXMA0.EXE    Execution controller 
AMQXSSV2.EXE    Shared memory servers 
RUNMQSC.EXE     MQSeries Command processor 
AMQPCSEA.EXE    PCF command processor 
AMQCRSTA.EXE    Any remotely started channel over TCP/IP - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
AMQCRS6A.EXE    Any remotely started channel over LU62/SNA - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
RUNMQCHL.EXE    Any locally started channel over any protocol - Could be SENDER,SERVER,CLUSSDR,REQUESTER 
RUNMQLSR        LISTENER PROCESS 
RUNMQCHI        CHANNEL INITIATOR 


PLATFORM = SOLARIS 
ProcName        Process Function 
amqhasmx        logger 
amqharmx        log formatter, used only if the queue manager has linear logging selected 
amqzllp0        checkpoint processor 
amqzlaa0        queue manager agents 
amqzxma0        processing controller 
amqcrsta        Any remotely started channel over TCP/IP - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
amqcrs6a        Any remotely started channel over LU62/SNA - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
runmqchl        Any locally started channel over any protocol - Could be SENDER,SERVER,CLUSSDR,REQUESTER 
runmqlsr        listener process 
runmqchi        channel initiator 
runmqsc         MQ Command interface 
amqpcsea        PCF command processor 


Windows/NT 
ProcName        Process Function 
AMQHASMN.EXE   The logger 
AMQHARMN.EXE   Log formatter (LINEAR logs only) 
AMQZLLP0.EXE   Checkpoint process 
AMQZLAA0.EXE   LQM agents 
AMQZTRCN.EXE   Trace 
AMQZXMA0.EXE   Execution controller 
AMQXSSVN.EXE   Shared memory servers 
AMQCRSTA.EXE   Any remotely started channel over TCP/IP - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
AMQCRS6A.EXE   Any remotely started channel over LU62/SNA - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
RUNMQCHL.EXE   Any locally started channel over any protocol - Could be SENDER,SERVER,CLUSSDR,REQUESTER 
RUNMQLSR       LISTENER PROCESS 
RUNMQCHI       CHANNEL INITIATOR 
RUNMQSC.EXE    MQSeries Command processor 
AMQPCSEA.EXE   PCF command processor 
AMQSCM.EXE     Service Control Manager 


Process Names     Process Function 
amqpcsea        Command server 
amqhasmx        Logger 
amqharmx        Log formatter (linear logs only) 
amqzllp0        Checkpoint processor 
amqzlaa0        Queue manager agents 
amqzfuma        OAM process 
amqzxma0        Processing controller 
amqrrmfa        Repository process (for clusters) 
amqzdmaa        Deferred message processor 
 

OS/390 and z/OS are very simple: 

qmgrMSTR - the main address space (manager, API calls etc) 
qmgrCHIN - communications (listener, channels) 

qmgr = name of the queue manager


MQ Processes: List 2:
=====================


Windows/NT 
AMQHASMN.EXE - The logger 
AMQHARMN.EXE - Log formatter (LINEAR logs only) 
AMQZLLP0.EXE - Checkpoint process 
AMQZLAA0.EXE - LQM agents 
AMQZTRCN.EXE - Trace 
AMQZXMA0.EXE - Execution controller 
AMQXSSVN.EXE - Shared memory servers 
AMQCRSTA.EXE - Any remotely started channel over TCP/IP 
             - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
AMQCRS6A.EXE - Any remotely started channel over LU62/SNA 
             - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
RUNMQCHL.EXE - Any locally started channel over any protocol 
             - Could be SENDER,SERVER,CLUSSDR,REQUESTER 
RUNMQLSR     - LISTENER PROCESS 
RUNMQCHI     - CHANNEL INITIATOR 
RUNMQSC.EXE  - MQSeries Command processor 
AMQPCSEA.EXE - PCF command processor 
AMQSCM.EXE   - Service Control Manager 

SOLARIS 
amqhasmx - logger 
amqharmx - log formatter, used only if the queue manager has linear logging selected 
amqzllp0 - checkpoint processor 
amqzlaa0 - queue manager agents 
amqzxma0 - processing controller 
amqcrsta - Any remotely started channel over TCP/IP 
         - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
amqcrs6a - Any remotely started channel over LU62/SNA 
         - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
runmqchl - Any locally started channel over any protocol 
         - Could be SENDER,SERVER,CLUSSDR,REQUESTER 
runmqlsr - listener process 
runmqchi - channel initiator 
runmqsc  - MQ Command interface 
amqpcsea - PCF command processor 


AS/400 
AMQHIXK4 - Storage Manager (Housekeeper) 
AMQMCPRA - Data Store (Object Cache) 
AMQCLMAA - Listener 
AMQALMP4 - Check Point Process 
AMQRMCLA - Sender channel 
AMQCRSTA - Any remotely started channel over TCP/IP 
         - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
AMQCRS6A - Any remotely started channel over LU62/SNA 
         - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
AMQPCSVA - PCF command processor 
AMQRIMNA - Channel initiator (trigger monitor to start channel) 
AMQIQES4 - Quiesce (forces user logoffs - for upgrades) 
AMQIQEJ4 - Quiesce (without user logoffs - for daily use if desired) 


AIX 
amqhasmx - logger 
amqharmx - log formatter, used only if the queue manager has linear logging selected 
amqzllp0 - checkpoint processor 
amqzlaa0 - queue manager agent(s) 
amqzxma0 - processing controller 
amqcrsta - Any remotely started channel over TCP/IP 
         - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
amqcrs6a - Any remotely started channel over LU62/SNA 
         - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
runmqchl - Any locally started channel over any protocol 
         - Could be SENDER,SERVER,CLUSSDR,REQUESTER 
runmqlsr - listener process 
runmqchi - channel initiator 
runmqsc  - MQ Command interface 
amqpcsea - PCF command processor 

HP/UX 
amqhasmx - logger 
amqharmx - log formatter, used only if the queue manager has linear logging selected 
amqzllp0 - checkpoint processor 
amqzlaa0 - queue manager agents 
amqzxma0 - processing controller 
amqcrsta - Any remotely started channel over TCP/IP 
         - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
amqcrs6a - Any remotely started channel over LU62/SNA 
         - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
runmqchl - Any locally started channel over any protocol 
         - Could be SENDER,SERVER,CLUSSDR,REQUESTER 
runmqlsr - listener process 
runmqchi - channel initiator 
runmqsc  - MQ Command interface 
amqpcsea - PCF command processor 

OS2 
AMQHASM2.EXE - The logger 
AMQHARM2.EXE - Log formatter (LINEAR logs only) 
AMQZLLP0.EXE - Checkpoint process 
AMQZLAA0.EXE - LQM agents 
AMQZXMA0.EXE - Execution controller 
AMQXSSV2.EXE - Shared memory servers 
AMQCRSTA.EXE - Any remotely started channel over TCP/IP 
             - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
AMQCRS6A.EXE - Any remotely started channel over LU62/SNA 
             - Could be RECEIVER,REQUESTER,CLUSRCVR,SVRCONN,SENDER,SERVER 
RUNMQCHL.EXE - Any locally started channel over any protocol 
             - Could be SENDER,SERVER,CLUSSDR,REQUESTER 
RUNMQLSR     - LISTENER PROCESS 
RUNMQCHI     - CHANNEL INITIATOR 
RUNMQSC.EXE  - MQSeries Command processor 
AMQPCSEA.EXE - PCF command processor 


MQSERIES PROCESSES BY PLATFORM

PLATFORM =AIX
ProcName        Process Function
amqhasmx        logger
amqharmx        log formatter,used only if the queue manager has linear
logging selected
amqzllp0        checkpoint processor
amqzlaa0        queue manager agent(s)
amqzxma0        processing controller
amqcrsta        TCPIP Receiver channel & Client Connection
amqcrs6a        LU62 Receiver channel & Client Connection
runmqchl        Sender Channel
runmqsc MQ Command interface
amqpcsea        PCF command processor


PLATFORM = AS/400
ProcName        Process Function

AMQHIXK4        Storage Manager (Housekeeper)
AMQMCPRA        Data Store (Object Cache)
AMQCLMAA        Listener
AMQALMP4        Check Point Process
AMQRMCLA        Sender channel
AMQCRSTA        TCP/IP Receiver channel & Client Connection
AMQCRS6A        LU62 Receiver channel & Client Connection
AMQPCSVA        PCF command processor
AMQRIMNA        Channel initiator (trigger monitor to start channel)
AMQIQES4        Quiesce (forces user logoffs - for upgrades)
AMQIQEJ4        Quiesce (without user logoffs - for daily use if desired)


PLATFORM = HP/UX
ProcName        Process Function
amqhasmx        logger
amqharmx        log formatter, used only if the queue manager has linear
logging selected
amqzllp0        checkpoint processor
amqzlaa0        queue manager agents
amqzxma0        processing controller
amqcrsta        TCPIP Receiver channel & Client Connection
amqcrs6a        LU62 Receiver channel & Client Connection
runmqchl        Sender Channel
runmqsc MQ Command interface
amqpcsea        PCF command processor

PLATFORM = OS2
ProcName                Process Function
AMQHASM2.EXE    The logger
AMQHARM2.EXE    Log formatter (LINEAR logs only)
AMQZLLP0.EXE    Checkpoint process
AMQZLAA0.EXE    LQM agents
AMQZXMA0.EXE    Execution controller
AMQXSSV2.EXE    Shared memory servers
AMQCRSTA.EXE    TCPIP Receiver channel & Client Connection
AMQCRS6A.EXE    LU62 Receiver channel & Client Connection
RUNMQCHL.EXE    Sender Channel
RUNMQSC.EXE             MQSeries Command processor
AMQPCSEA.EXE    PCF command processor


PLATFORM = SOLARIS
ProcName        Process Function
amqhasmx        logger
amqharmx        log formatter, used only if the queue manager has linear
logging selected
amqzllp0        checkpoint processor
amqzlaa0        queue manager agents
amqzxma0        processing controller
amqcrsta        TCPIP Receiver channel & Client Connection


Process Names     Process Function

amqpcsea        Command server
amqhasmx        Logger
amqharmx        Log formatter (linear logs only)
amqzllp0        Checkpoint processor
amqzlaa0        Queue manager agents
amqzfuma        OAM process
amqzxma0        Processing controller
amqrrmfa        Repository process (for clusters)
amqzdmaa        Deferred message processor


MQ Processes: List 3:
=====================

Description of WebSphere MQ tasks
When a queue manager is running, you see some or all of the following batch jobs running under the QMQM user profile 
in the WebSphere MQ subsystem. The jobs are described briefly in Table 1, to help you decide how to prioritize them.

Table 1. WebSphere MQ tasks. Job name Function 
AMQALMPX The checkpoint processor that periodically takes journal checkpoints 
AMQCLMAA Non-threaded TCP/IP listener 
AMQCRSTA TCP/IP-invoked channel responder 
AMQCRS6B LU62 receiver channel and client connection (see note). 
AMQFCXBA Broker worker job 
AMQPCSEA PCF command processor that handles PCF and remote administration requests 
AMQRMPPA Channel process pooling job 
AMQRRMFA Repository manager for clusters 
AMQZDMAA Deferred message handler 
AMQZFUMA Object authority manager (OAM) 
AMQZLAA0 Queue manager agents that perform the bulk of the work for applications that connect to the queue manager using MQCNO_STANDARD_BINDING 
AMQZLAS0 Queue manager agent 
AMQZXMA0 The execution controller that is the first job started by the queue manager. It deals with MQCONN requests, 
          and starts agent processes to process WebSphere MQ API calls 
AMQZMGR0 Process controller. This job is used to start up and manage listeners and services. 
AMQZMUC0 Utility manager. This job executes critical queue manager utilities, for example the journal chain manager. 
AMQZMUR0 Utility manager. This job executes critical queue manager utilities, for example the journal chain manager. 
RUNMQBRK Broker control job 
RUNMQCHI The channel initiator 
RUNMQCHL Sender channel job that is started for each sender channel 
RUNMQDLQ Dead letter queue handler 
RUNMQLSR Threaded TCP/IP listener 
RUNMQTRM Trigger monitor 
Note:
The LU62 receiver job runs in the communications subsystem and takes its run-time properties from the routing and communications 
entries that are used to start the job. See WebSphere MQ Intercommunication for more details. 

You can view all jobs connected to a queue manager, except listeners (which do not connect), 
using option 22 on the Work with Queue Manager (WRKMQM) panel. You can view listeners using the WRKMQMLSR command


WebSphere(R) MQ Explorer messages:
==================================

AMQ4000New object not created because the default object for the object type could not be found.
Severity:
10 : Warning

Explanation:
The creation of an object requires a default template for each object type. The required default template for this object type could not be found.

Response:
Determine why the default object is not available, or create a new one. Then try the request again.

AMQ4001The queue manager specified is already shown in the console.
Severity:
0 : Information

Response:
Message for information only.

AMQ4002Are you sure you want to delete the <insert_0> named <insert_1>?
Severity:
10 : Warning

Explanation:
A confirmation is required before the specified object is deleted. The type of object and name are provided in the message.

Response:
Continue only if you want to permanently delete the object.

AMQ4003WebSphere MQ system objects are used internally by WebSphere MQ. You are advised not to delete them. Do you want to keep the WebSphere MQ system object?
Severity:
0 : Information

Explanation:
A confirmation is required before an internal WebSphere MQ system object (for example SYSTEM.DEFAULT.LOCAL.QUEUE) is deleted.

Response:
Continue only if you want to permanently delete the system object.

AMQ4004Clear all messages from the queue?
Severity:
10 : Warning

Explanation:
The removal of the messages from the queue is an irreversible action. If the command is allowed to proceed the action cannot be undone.

Response:
Continue only if you want to permanently delete the messages.

AMQ4005The object has been replaced or deleted. The properties could not be applied.
Severity:
10 : Warning

Explanation:
During the process of updating the properties of the object, it was determined that the object has either been deleted or replaced by another instance. The properties have not been applied.

AMQ4006WebSphere MQ successfully sent data to the remote queue manager and received the data returned.
Severity:
0 : Information

Explanation:
An open channel has been successfully verified by WebSphere MQ as the result of a user request.

Response:
Message for information only.

AMQ4007The message sequence number for the channel was reset.
Severity:
0 : Information

Explanation:
A channel has had its sequence number successfully reset by WebSphere MQ as the result of a user request.

Response:
Message for information only.

AMQ4008The request to start the channel was accepted.
Severity:
0 : Information

Explanation:
A channel has been started successfully by WebSphere MQ as the result of a user request.

Response:
Message for information only.

AMQ4009The request to stop the channel was accepted.
Severity:
0 : Information

Explanation:
A channel has been stopped successfully by WebSphere MQ as the result of a user request.

Response:
Message for information only.

AMQ4010The 'in-doubt' state was resolved.
Severity:
0 : Information

Explanation:
A channel has had its 'in-doubt' state resolved successfully by WebSphere MQ as the result of a user request.

Response:
Message for information only

AMQ4011The queue has been cleared of messages.
Severity:
0 : Information

Explanation:
The CLEAR command completed successfully and has removed all messages from the target queue. If the CLEAR was performed using the MQGET API command, uncommitted messages might still be on the queue.

AMQ4012The object was created successfully but it is not visible with the current settings for visible objects.
Severity:
0 : Information

Response:
Message for information only.

AMQ4014The character <insert_0> is not valid.
Severity:
10 : Warning

AMQ4015Supply a non-blank name.
Severity:
0 : Information

Response:
Enter a valid name.

AMQ4016The test message was put successfully.
Severity:
0 : Information

Explanation:
The request to place a message on the target queue has completed successfully. The queue now contains the message.

Response:
Message for information only.

AMQ4019An object called <insert_0> already exists. Do you want to replace the definition of the existing object?
Severity:
0 : Information

Response:
Confirm that you want to replace the definition.

AMQ4020The changes you are making to the attributes of page "<insert_0>" will affect the operation of the queue manager or another program currently using the object. Do you want to force the change to the object's attributes?
Severity:
10 : Warning

Explanation:
You are trying to change a object that cannot be changed because the object is in use, or the change will affect other programs or queue managers. Some changes can be forced anyway.

Response:
Select Yes to try forcing the changes, or No to abandon the change.

AMQ4021Failed to access one or more WebSphere MQ objects.
Severity:
10 : Warning

Explanation:
The objects' icons have been marked to indicate the objects in error.

AMQ4022The name specified for the initiation queue is the same as the name of the queue itself.
Severity:
0 : Information

Response:
Specify a different name to that of the object being created or altered.

AMQ4023The queue manager <insert_0> does not exist on this computer.
Severity:
0 : Information

Response:
Message for information only.

AMQ4024The object cannot be replaced.
Severity:
0 : Information

Explanation:
The request to replace the object was unsuccessful.

Response:
To define this object, delete the existing object and try the operation again.

AMQ4025The changes made to the cluster attributes of the queue will take effect once they have propagated across the network.
Severity:
0 : Information

Response:
Refresh any views containing the cluster queues in the affected clusters to show the changes.

AMQ4026You have created a queue which is shared in one or more clusters. The queue will be available as a cluster queue once its definition has propagated across the network.
Severity:
0 : Information

Response:
Refresh any views containing the cluster queues in the affected clusters to show the cluster queue.

AMQ4027An error occurred connecting to the queue manager. Are you sure you want to show this queue manager in the folder anyway?
Severity:
10 : Warning

Explanation:
A connection could not be made to the specified remote queue manager.

Response:
Ensure that the named queue manager is running on the host and port specified, and has a channel corresponding to the specified name. Ensure that you have the authority to connect to the remote queue manager, and ensure that the network is running. Select Yes if you believe that the problem can be resolved later. Select No if you want to correct the problem now and try again.

AMQ4028Platform not supported. This queue manager cannot be administered by the WebSphere MQ Explorer because it is running on an unsupported platform. The value <insert_0> for the Platform attribute of the queue manager is not supported by the WebSphere MQ Explorer.
Severity:
20 : Error

AMQ4029Command level too low. This queue manager cannot be administered by the WebSphere MQ Explorer because the command level of the queue manager is less than <insert_0>.
Severity:
20 : Error

Response:
If you want to administer this queue manager, you must upgrade it to a newer version of WebSphere MQ.

AMQ4030Queue manager cannot be administered because codepage conversion table not found.
Severity:
20 : Error

Explanation:
This queue manager cannot be administered by the WebSphere MQ Explorer because a codepage conversion table was not found.

Response:
Install a codepage conversion table from CCSID <insert_0> to CCSID <insert_1> on the computer on which the WebSphere MQ Explorer is running.

AMQ4031Queue manager cannot be administered because CCSID not found.
Severity:
20 : Error

Explanation:
This queue manager cannot be administered by the WebSphere MQ Explorer because CCSID <insert_0> cannot be found in the CCSID table. The WebSphere MQ Explorer cannot convert character data to or from the unrecognized CCSID.

AMQ4032Command server not responding within timeout period.
Severity:
10 : Warning

Response:
Ensure that the command server is running and that the queue called 'SYSTEM.ADMIN.COMMAND.QUEUE' is configured to enable programs to get messages from it.

AMQ4033Cannot get messages from the queue.
Severity:
0 : Information

Explanation:
A reason code returned when the object was opened for input indicated that the queue is disabled for MQGET request.

Response:
To get messages from this queue, enable it for GET requests.

AMQ4034Message too long. You tried to put a message on a queue that was bigger than the maximum allowed for the queue or queue manager.
Severity:
10 : Warning

Explanation:
The request to put a message on a queue returned a reason code indicating that the data length of the message exceeds the maximum allowed in the definition of the queue.

Response:
Either change the MAXMSGL attribute of the queue so that it is equal to or greater than the length of the message, or reduce the length of the message being put on the queue.

AMQ4035No message available. The response message did not arrive within a reasonable amount of time.
Severity:
0 : Information

Explanation:
The request to get a message from a queue returned a reason code indicating that there are currently no messages on the queue that meet the selection criteria specified on the GET request.

AMQ4036Access not permitted. You are not authorized to perform this operation.
Severity:
10 : Warning

Explanation:
The queue manager's security mechanism has indicated that the userid associated with this request is not authorized to access the object.

AMQ4037Object definition changed since it was opened.
Severity:
0 : Information

Explanation:
Object definitions that affect this object have been changed since the Hobj handle used on this call was returned by the MQOPEN call.

Response:
Issue an MQCLOSE call to return the handle to the system. It is then usually sufficient to reopen the object and try the operation again.

AMQ4038Object damaged.
Severity:
10 : Warning

Explanation:
The object is damaged and cannot be accessed.

Response:
The object should be deleted. Alternatively, it might be possible to recover it from a media image or backup.

AMQ4039Object in use. The object is already opened by another application.
Severity:
10 : Warning

Explanation:
An MQOPEN call was issued, but the object in question has already been opened by this or another application with options that conflict with those specified in the Options parameter. This arises if the request is for shared input, but the object is already open for exclusive input. It also arises if the request is for exclusive input, but the object is already open for input (of any sort).

Response:
To change the attributes of an object, specify the Force option as 'Yes' to apply the changes. If you do this, any applications using the object must close and reopen the object to proceed.

AMQ4040Cannot put messages on this queue.
Severity:
0 : Information

Explanation:
MQPUT and MQPUT1 calls are currently inhibited for the queue, or for the queue to which this queue resolves.

AMQ4042Queue full. The queue contains the maximum number of messages.
Severity:
10 : Warning

Explanation:
On an MQPUT or MQPUT1 call, the call failed because the queue is full; that is, it already contains the maximum number of messages possible.

AMQ4043Queue manager not available for connection.
Severity:
10 : Warning

Response:
Ensure that the queue manager is running. If the queue manager is running on another computer, ensure it is configured to accept remote connections.

AMQ4044Queue manager stopping.
Severity:
0 : Information

Explanation:
An MQI call was issued, but the call failed because the queue manager is shutting down. If the call was an MQGET call with the MQGMO_WAIT option, the wait has been canceled.

Response:
You cannot issue any more MQI calls.

AMQ4045Queue not empty. The queue contains one or more messages or uncommitted PUT or GET requests.
Severity:
0 : Information

Explanation:
An operation that requires the queue to be empty has failed because the queue either contains messages or has uncommitted PUT or GET requests outstanding.

AMQ4046Insufficient system resources available.
Severity:
20 : Error

AMQ4047Insufficient storage available.
Severity:
20 : Error

AMQ4048The request received an unexpected reason code from an underlying API or command request. The reason code was <insert_0>
Severity:
20 : Error

Explanation:
While executing the requested operation, an unexpected return code was received. This has resulted in the operation not completing as expected.

Response:
Use the appropriate documentation to determine why the reason code was returned.

AMQ4049Unknown object name.
Severity:
10 : Warning

Explanation:
A command or API request was issued, but the object cannot be found.

AMQ4050Allocation failed. An attempt to allocate a conversation to a remote system failed.
Severity:
10 : Warning

Explanation:
The error might be due to an incorrect entry in the channel definition or it might be that the listening program on the remote system was not running.

AMQ4051Bind failed. The bind to a remote system during session negotiation failed.
Severity:
10 : Warning

AMQ4052Coded character-set ID error. Cannot convert a command message to the CCSID of the target queue manager.
Severity:
10 : Warning

AMQ4053Channel in doubt. Operation not completed.
Severity:
10 : Warning

Explanation:
The operation could not complete because the channel was in doubt.

AMQ4054Channel in use.
Severity:
10 : Warning

Explanation:
An attempt was made to perform an operation on a channel, but the channel is currently active.

AMQ4055Channel status not found.
Severity:
10 : Warning

Explanation:
No channel status is available for this channel. This might indicate that the channel has not been used.

AMQ4056Command failed.
Severity:
10 : Warning

AMQ4057Configuration error in the channel definition or communication subsystem.
Severity:
10 : Warning

Explanation:
Allocation of a conversation is not possible.

AMQ4058Connection closed.
Severity:
10 : Warning

Explanation:
The connection to a remote system has unexpectedly broken while receiving data.

AMQ4059Could not establish a connection to the queue manager.
Severity:
10 : Warning

Explanation:
The attempt to connect to the queue manager failed. This could be because the queue manager is incorrectly configured to allow a connection from this system, or the connection has been broken.

Response:
Try the operation again. If the error persists, examine the problem determination information to see if any information has been recorded.

AMQ4060Dynamic queue scope error.
Severity:
10 : Warning

Explanation:
The Scope attribute of the queue was set to MQSCO_CELL but this is not allowed for a dynamic queue.

AMQ4061Remote system not available. Could not allocate a conversation to a remote system.
Severity:
10 : Warning

Response:
The error might be transitory; try again later.

AMQ4062An MQINQ call failed when the queue manager inquired about a WebSphere MQ object.
Severity:
10 : Warning

Response:
Check the queue manager's error log for more information about the error.

AMQ4063An MQOPEN call failed when the queue manager tried to open a WebSphere MQ object.
Severity:
10 : Warning

Response:
Check the queue manager's error log for more information about the error.

AMQ4064An MQSET call failed when the queue manager tried to set the values of the attributes of a WebSphere MQ object.
Severity:
10 : Warning

Response:
Check the queue manager's error log for more information about the error.

AMQ4065Message sequence number error.
Severity:
10 : Warning

Explanation:
The message sequence number parameter was not valid.

AMQ4066Message truncated because it is larger than the command server's maximum valid message size.
Severity:
10 : Warning

AMQ4067Communications manager not available.
Severity:
20 : Error

Explanation:
The communications subsystem is not available.

AMQ4068Queue is not a transmission queue.
Severity:
10 : Warning

Explanation:
The queue specified in the channel definition was not a transmission queue.

AMQ4069Object already exists.
Severity:
10 : Warning

Explanation:
Could not create object because the object already existed.

AMQ4070Object is open.
Severity:
10 : Warning

Explanation:
An attempt was made to delete, change or clear an object that is in use.

Response:
Wait until the object is not in use, then try again.

AMQ4071Object has wrong type. Could not replace a queue object of a different type.
Severity:
10 : Warning

AMQ4072Queue already exists in cell.
Severity:
10 : Warning

Explanation:
Cannot define a queue with cell scope or change the scope of an existing queue from queue-manager scope to cell scope, because a queue with that name already exists in the cell.

AMQ4073Ping error. You can only ping a sender or server channel. If the local channel is a receiver channel, ping from the remote queue manager.
Severity:
10 : Warning

AMQ4074Receive failed, possibly due to a communications failure.
Severity:
10 : Warning

AMQ4075Error while receiving data from a remote system, possibly due to a communications failure.
Severity:
10 : Warning

AMQ4076Remote queue manager terminating.
Severity:
10 : Warning

Explanation:
The channel stopped because the remote queue manager was terminating.

AMQ4077Remote queue manager not available.
Severity:
10 : Warning

Explanation:
The channel could not be started because the remote queue manager was not available.

Response:
Ensure that the remote queue manager is started, and that it is configured to accept incoming communication requests.

AMQ4078Send failed. An error occurred while sending data to a remote system, possibly due to a communications failure.
Severity:
10 : Warning

AMQ4079Channel closed by security exit.
Severity:
10 : Warning

AMQ4080Remote channel not known.
Severity:
10 : Warning

Explanation:
There is no definition of this channel on the remote system.

AMQ4081User exit not available.
Severity:
10 : Warning

Explanation:
The channel was closed because the user exit specified does not exist.

AMQ4082Unexpected WebSphere MQ error (<insert_0>).
Severity:
20 : Error

AMQ4083Queue manager name not known.
Severity:
10 : Warning

Explanation:
If the queue manager is remote, this might indicate that another queue manager is incorrectly using the same connection name. Queue managers using TCP/IP on the same computer must listen on different port numbers. This means that they will also have different connection names.

AMQ4084Cell directory is not available.
Severity:
10 : Warning

Explanation:
The Scope attribute of the queue was set to MQSCO_CELL but no name service supporting a cell directory has been configured.

Response:
Configure a name service to support the cell directory.

AMQ4085No name supplied for transmission queue.
Severity:
10 : Warning

Response:
Supply a non-blank transmission queue name for this channel type.

AMQ4086No connection name supplied.
Severity:
10 : Warning

Response:
Supply a non-blank connection name for this channel type.

AMQ4087An error occurred while trying to use a cluster resource.
Severity:
10 : Warning

Response:
Check that the queues whose names start with 'SYSTEM.CLUSTER.' are not full and that messages are allowed to be put on them.

AMQ4088Cannot share transmission queue in cluster.
Severity:
10 : Warning

Explanation:
The queue is a transmission queue and cannot be shared in a cluster.

AMQ4089PUT commands inhibited for system command queue called 'SYSTEM.ADMIN.COMMAND.QUEUE'.
Severity:
10 : Warning

AMQ4090Are you sure you want to inhibit PUT and GET commands for the queue called 'SYSTEM.ADMIN.COMMAND.QUEUE'? If you do, you will no longer be able to administer the queue manager using the WebSphere MQ Explorer.
Severity:
10 : Warning

Explanation:
WebSphere MQ Explorer uses the queue called 'SYSTEM.ADMIN.COMMAND.QUEUE' to administer the queue manager.

Response:
Continue only if you really want to inhibit PUT or GET commands for this queue and stop using the WebSphere MQ Explorer to administer the queue manager.

AMQ4091Cannot connect to remote queue manager.
Severity:
10 : Warning

Explanation:
The remote queue manager is using an unsupported protocol for connections. The WebSphere MQ Explorer only supports connections to remote queue managers using the TCP/IP protocol.

AMQ4092The queue manager could not be removed from the cluster because its membership of the cluster is defined using namelist <insert_0>.
Severity:
10 : Warning

Response:
To remove the queue manager from the cluster, remove it from the namelist. Ensure that you do not inadvertently affect the definitions of other objects using the namelist.

AMQ4093The cluster specified is already shown in the console.
Severity:
0 : Information

AMQ4094An error occurred adding this cluster to the console. Are you sure you want to show this cluster in the console anyway?
Severity:
10 : Warning

Response:
Select Yes if you believe that the problem can be resolved later. Select No if you want to correct the problem now and try again.

AMQ4095Queue manager <insert_0> is not a repository queue manager for cluster <insert_1>.
Severity:
0 : Information

Explanation:
To administer a cluster, the WebSphere MQ Explorer needs a connection to the repository queue manager.

AMQ4096Are you sure you want to clear the password for this channel?
Severity:
0 : Information

Response:
Check with the user before clearing the password from the channel. Continue only if you really want to clear the password.

AMQ4097Unmatched quotation mark.
Severity:
10 : Warning

Explanation:
An unmatched quotation mark has been found in a list of attributes. Each value in the list can be enclosed in a pair of single or double quotation marks. (Only required for values which contain spaces, commas or quotation marks.)

Response:
Check that all opening and closing quotation marks are in pairs. (To include a quotation mark within an attribute, use two together with no space between.)

AMQ4098Incorrect list format.
Severity:
10 : Warning

Explanation:
The attribute can contain a list of values which must be separated by a space or a comma. Each value in the list can be enclosed in a pair of single or double quotation marks. (Only required for values which contain spaces, commas or quotation marks.)

Response:
Check that values are separated by a space or a comma, and that all opening and closing quotation marks are in pairs. (To include a quotation mark within an attribute, use two together with no space between.)

AMQ4099Cannot communicate with one or more repository queue managers. Cluster <insert_0> is configured to use one or more repository queue managers which communicate using a protocol other than TCP/IP.
Severity:
10 : Warning

Explanation:
The WebSphere MQ Explorer can only establish connections to remote queue managers using TCP/IP.

Response:
To complete removal of the queue manager from the cluster, issue the RESET CLUSTER ACTION(FORCEREMOVE) command from the repository queue manager.

AMQ4103An error occurred connecting to the queue manager. Are you sure you want to show this queue manager in the folder?
Severity:
10 : Warning

Explanation:
A connection could not be made to the specified remote queue manager.

Response:
Ensure that the named queue manager is running on the machine specified in the selected channel definition table. Ensure that you have the authority to connect to the remote queue manager, and ensure that the network is up and running. Select Yes if you believe that the problem can be resolved later. Select No if you want to correct the problem now and try again.

AMQ4104The specified file <insert_0> does not contain a client definition table in the correct format.
Severity:
10 : Warning

Explanation:
The given channel definition table is not in the correct format.

Response:
Specify a file in the correct format.

AMQ4105The remote queue manager has not been removed because it is still required by other plug-ins.
Severity:
10 : Warning

Explanation:
Other plug-ins have responded to the attempted removal of this queue manager by indicating that they are still using it.

Response:
Ensure that the other plug-ins have finished using the queue manager before trying to delete it again.

AMQ4117This action cannot be undone. Are you sure you want to delete the WebSphere MQ queue manager <insert_0> from your system?
Severity:
10 : Warning

Explanation:
A confirmation is required before the queue manager is deleted.

Response:
Continue only if you want to permanently delete the queue manager.

AMQ4121The MQGET request received an unexpected reason code of <insert_0>.
Severity:
10 : Warning

Explanation:
An unexpected reason code was returned from a MQGET API request. Use the reason code to determine the underlying reason why the request failed.

Response:
The MQGET request was not successful. Some messages might not have been retrieved.

AMQ4122The MQPUT request received an unexpected reason code of <insert_0>.
Severity:
10 : Warning

Explanation:
An unexpected reason code was returned from a MQPUT API request. Use the reason code to determine the underlying reason why the request failed.

Response:
MQPUT processing was unsuccessful. No message was placed on the queue.

AMQ4123The <insert_0> <insert_1> was deleted successfully.
Severity:
0 : Information

Explanation:
The object of the specified type and name has been successfully deleted.

Response:
none.

AMQ4124The MQOPEN request received an unexpected reason code of <insert_0>.
Severity:
10 : Warning

Explanation:
An unexpected reason code was returned from an MQOPEN API request. The queue has not been opened.

Response:
Use the reason code to determine the underlying reason for the failure.

AMQ4125Putting a test message on the queue received an unexpected reason code <insert_0>.
Severity:
10 : Warning

Explanation:
One of the underlying API requests was unsuccessful. The test message was not placed on the queue.

AMQ4126The value of one of the properties specified is not valid. The request was not processed.
Severity:
20 : Error

Response:
Specify a different value.

AMQ4127WebSphere MQ failed to read queue manager information from disk because the file format is not valid. The request was not processed.
Severity:
20 : Error

Explanation:
The format of the WebSphere MQ_Handles file is incorrect. This file has been backed up and removed, meaning that any remote queue manager definitions are lost. All local queue managers should be detected automatically and displayed in the WebSphere MQ Explorer.

Response:
Ensure that the Eclipse workspace has not been corrupted.

AMQ4128Could not start the iKeyMan program.
Severity:
30 : Severe error

Explanation:
An error was encountered when trying to execute the iKeyMan program.

Response:
Try again, and if symptoms persist contact the System Administrator.

AMQ4129Could not query the user ID from Java.
Severity:
10 : Warning

Explanation:
The Java API System.getProperty("user.id") threw a SecurityException.

Response:
Configure your Java security environment using the 'policytool' to allow WebSphere MQ Explorer to query the 'user.id'.

AMQ4130A Browser Control could not be opened. Make sure Mozilla has been installed.
Severity:
10 : Warning

Explanation:
The SWT Browser control depends on Mozilla being installed.

Response:
Ensure that the Mozilla browser is correctly installed.

AMQ4131A Browser Control could not be opened.
Severity:
10 : Warning

Explanation:
The SWT Browser control depends on the system browser being installed.

Response:
Ensure that the system browser is correctly installed.

AMQ4132Are you sure you want to stop the <insert_0> named "<insert_1>"?
Severity:
10 : Warning

Explanation:
A confirmation is required before the specified object is stopped. The type of object and name are provided in the message.

Response:
Continue only if you want to stop the object.

AMQ4133When a queue manager is removed, WebSphere MQ Explorer destroys the connection information for that queue manager. 
To see the queue manager at a later date use the Add Queue Manager wizard. 
Remove the queue manager <insert_0> ?
Severity:
10 : Warning

Response:
Continue only if you want to remove the queue manager.

AMQ4134The default channel used by remote queue managers to administer this queue manager does not exist. 
Do you want to create the default remote administration channel SYSTEM.ADMIN.SVRCONN to allow this queue manager to be administered by other queue managers?
Severity:
0 : Information

Response:
Select Yes to create the channel.

AMQ4135The default channel used by remote queue managers to administer this queue manager is SYSTEM.ADMIN.SVRCONN. 
Do you want to delete this channel to prevent the queue manager being administered by other queue managers?
Severity:
0 : Information

Response:
Select Yes to delete the channel.

AMQ4136Are you sure you want to clear all FFSTs and Trace from this machine? This operation cannot be undone.
Severity:
10 : Warning

Explanation:
Deleting all FFSTs and Trace from this machine means that any historical error logs and trace will be lost.

Response:
Select Yes to clear FFSTs and Trace

AMQ4137The default remote administration channel SYSTEM.ADMIN.SVRCONN has been deleted successfully.
Severity:
0 : Information

Response:
Message for information only.

AMQ4138Are you sure you want to import new settings that will overwrite the current settings? This operation cannot be undone.
Severity:
10 : Warning

Explanation:
Importing settings into the WebSphere MQ Explorer will overwrite the current settings.

Response:
Continue only if you want to overwrite the current settings.

AMQ4139The default remote administration channel SYSTEM.ADMIN.SVRCONN was created successfully.
Severity:
0 : Information

Response:
Message for information only.

AMQ4140The custom CipherSpec is not valid.
Severity:
10 : Warning

AMQ4141The Distinguished Names specification is not valid.
Severity:
10 : Warning

AMQ4142The default remote administration channel SYSTEM.ADMIN.SVRCONN could not be created.
Severity:
10 : Warning

Explanation:
A problem has occurred when issuing a command to the command server to create the channel.

Response:
Try again, and if symptoms persist contact the Systems Administrator.

AMQ4143The default remote administration channel SYSTEM.ADMIN.SVRCONN could not be created.
Severity:
10 : Warning

Explanation:
A problem has occurred with the underlying data model, and the channel could not be created.

Response:
Try again, and if symptoms persist contact the Systems Administrator.

AMQ4144The default remote administration channel SYSTEM.ADMIN.SVRCONN could not be deleted.
Severity:
10 : Warning

Explanation:
A problem has occurred issuing a command to the command server to delete the channel.

Response:
Ensure that the channel is not in use and try again. If symptoms persist, contact the Systems Administrator.

AMQ4145An error occurred connecting to the remote queue manager using the intermediate queue manager. Are you sure you want to show this queue manager in the folder anyway?
Severity:
10 : Warning

Explanation:
A connection could not be made to the specified remote queue manager.

Response:
Ensure that the intermediate queue manager is available and that the named remote queue manager is running, and is accessible from the intermediate queue manager. Ensure that you have the authority to connect to the remote queue manager, and ensure that the network is up and running. Select Yes if you believe that the problem can be resolved later. Select No if you want to correct the problem now and try again.

AMQ4146Eclipse cannot create or read the workspace for WebSphere MQ Explorer.
Severity:
40 : Stop Error

Explanation:
To load the WebSphere MQ Explorer, a valid workspace is required.

Response:
Ensure that you can write to the Eclipse workspace.

AMQ4147Eclipse cannot write to the workspace for WebSphere MQ Explorer in <insert_0>.
Severity:
40 : Stop Error

Explanation:
To load the WebSphere MQ Explorer, write access to the workspace is required.

Response:
Ensure that you can write to the Eclipse workspace.

AMQ4148The object was created successfully.
Severity:
0 : Information

Response:
Message for information only.

AMQ4149The request to start the listener was accepted.
Severity:
0 : Information

Explanation:
A user request to start the listener was accepted by WebSphere MQ.

Response:
Message for information only.

AMQ4150The request to stop the listener was accepted.
Severity:
0 : Information

Explanation:
A user request to stop the listener was accepted by WebSphere MQ.

Response:
Message for information only.

AMQ4151The request to start the service was accepted.
Severity:
0 : Information

Explanation:
A user request to start the service was accepted by WebSphere MQ.

Response:
Message for information only.

AMQ4152The request to stop the service was accepted.
Severity:
0 : Information

Explanation:
A user request to stop the service was accepted by WebSphere MQ.

Response:
Message for information only.

AMQ4153WebSphere MQ cannot stop the listener because it is not running.
Severity:
10 : Warning

AMQ4154WebSphere MQ cannot start the service because no start command has been specified.
Severity:
10 : Warning

Response:
Ensure that the service has a start command specified.

AMQ4155WebSphere MQ cannot stop the service because no stop command has been specified.
Severity:
10 : Warning

Response:
Ensure that the service has a stop command specified.

AMQ4156WebSphere MQ cannot stop the service because the service is not running.
Severity:
10 : Warning

AMQ4157WebSphere MQ cannot start the service because the services is already running.
Severity:
10 : Warning

AMQ4158WebSphere MQ cannot start the listener because it is already running.
Severity:
10 : Warning

AMQ4159WebSphere MQ cannot start the client connection channel because one or more of the properties are incorrectly specified.
Severity:
10 : Warning

Response:
Ensure that the client connection has the correct queue manager name and connection name before trying to start.

AMQ4160WebSphere MQ cannot process the request because the executable specified cannot be started.
Severity:
10 : Warning

Explanation:
The requested was unsuccessful because the program which was defined to be run to complete the action could not be started. 
Reasons why the program could not be started are :- 
The program does not exist at the specified location. 
The WebSphere MQ user does not have sufficient access to execute the program. 
If StdOut or StdErr are defined for the program, the WebSphere MQ user does not have sufficient access to the locations specified.

Response:
Check the Queue Manager error logs for further details on the cause of the failure, correct the problem and try again.

AMQ4161The parameter specified is not valid.
Severity:
20 : Error

Explanation:
The parameter specified when trying to create or alter an object is not valid.

Response:
Ensure that valid parameters are specified, then try again.

AMQ4162The password cannot be cleared.
Severity:
0 : Information

Response:
Try to clear the password again later.

AMQ4163The password cannot be changed.
Severity:
10 : Warning

Explanation:
The attempt to change the password failed because of an error.

Response:
Try a different password

AMQ4164The password was successfully changed.
Severity:
0 : Information

Response:
Message for information only.

AMQ4165No password entered in the new password field. No change applied.
Severity:
10 : Warning

Explanation:
You must enter a new password in both the new and confirm password fields.

Response:
Enter a new password in the new password field.

AMQ4166No password entered in the confirm new password field. No change applied.
Severity:
10 : Warning

Explanation:
You must enter a new password in both the new and confirm password fields.

Response:
Re-enter the new password in the confirm new password field.

AMQ4167Passwords do not match. No change applied.
Severity:
10 : Warning

Explanation:
You must enter the same new password in both the new and confirm password fields.

Response:
Ensure that the passwords in the new and confirm fields match.

AMQ4168WebSphere MQ failed to start listening for objects.
Severity:
20 : Error

Explanation:
No objects will be displayed in the currently selected view.

Response:
Check the problem determination information, and ensure that WebSphere MQ and the queue manager in question are both running correctly.

AMQ4169WebSphere MQ failed to set the object filter.
Severity:
20 : Error

Explanation:
The WebSphere MQ Explorer cannot listen for objects, so no objects will be displayed in the currently selected view.

Response:
Check the problem determination information, and ensure that WebSphere MQ and the queue manager in question are both running correctly.

AMQ4170The object name specified is not valid.
Severity:
20 : Error

Explanation:
The object name specified when trying to create or alter an object is not valid.

Response:
Ensure that a valid object name is specified, then try again.

AMQ4171There was an error when communicating with the queue manager.
Severity:
20 : Error

Explanation:
A request for information from the queue manager failed.

Response:
Try the operation again. If the error persists, examine the problem determination information to see if any details have been recorded.

AMQ4172There was an error when trying to set or retrieve information.
Severity:
20 : Error

Explanation:
There was an error when trying to set or retrieve information from the queue manager. This might have happened because you specified incorrect or inconsistent attributes when trying create or update an object.

Response:
If this error occurred during object creation or modification, ensure that the attributes specified are correct for this type of object. If the error persists, examine the problem determination information to see if any details have been recorded.

AMQ4173WebSphere MQ cannot clear one or more Trace and FFST files.
Severity:
10 : Warning

Explanation:
WebSphere MQ cannot clear some files, possibly because these files are currently in use, or the WebSphere MQ Explorer does not have the appropriate access permission.

Response:
Check that tracing is disabled, and that the WebSphere MQ Explorer has appropriate access permission to delete the Trace and FFST files.

AMQ4174FFSTs and Trace were cleared successfully.
Severity:
0 : Information

Response:
Message for information only.

AMQ4175WebSphere MQ cannot process your request because the value specified is not valid.
Severity:
20 : Error

Explanation:
Only certain combinations and values are valid for the object your are trying to alter or create. Please check the values and try again.

Response:
Specify a valid value and try again.

AMQ4176WebSphere MQ cannot process your request because the object name specified is not valid.
Severity:
20 : Error

Explanation:
Only certain combinations and values are valid for the object your are trying to alter or create. You might also see this message if you have specified an invalid QSG disposition.

Response:
Check all values are valid for this type of object and try again. If you have altered the disposition of this object, check that the value is correct.

AMQ4177The WebSphere MQ Explorer cannot process your request because the connection to WebSphere MQ is quiescing.
Severity:
20 : Error

Explanation:
The connection to WebSphere MQ is quiescing, so no new information can be queried.

Response:
Wait for the connection to end, then try reconnecting.

AMQ4178WebSphere MQ cannot process your request because there was a disposition conflict detected.
Severity:
20 : Error

Explanation:
A disposition conflict was detected. Please ensure that all disposition related fields are correct for this type of object.

Response:
Ensure that all disposition related fields are correct for this type of object and try again.

AMQ4179WebSphere MQ cannot process your request because the string provided was of an incorrect length.
Severity:
20 : Error

Explanation:
A string value has been modified or supplied that is too long when creating or modifying an object.

Response:
Check the values being supplied and try again.

AMQ4180WebSphere MQ cannot process your request because there was a parameter conflict.
Severity:
20 : Error

Explanation:
When creating or modifying an object, the combination of parameters specified is not valid.

Response:
Check that the combination specified is valid for the object and try again.

AMQ4181WebSphere MQ is not responding. Do you want to continue waiting?
Severity:
10 : Warning

Explanation:
WebSphere MQ does not appear to be responding. This could be because of a heavily loaded remote system, or a slow network connection. However there could have been a system failure. Choosing not to continue could leave the WebSphere MQ Explorer in an unknown state, so you should restart it.

Response:
If you choose not to continue waiting, restart the WebSphere MQ Explorer, if the problem persists check for problem determination information.

AMQ4182No objects were found.
Severity:
10 : Warning

Explanation:
The query did not find any objects.

Response:
If you were expecting objects to be found, check the problem determination information, and ensure that WebSphere MQ and the queue manager in question are both running correctly.

AMQ4183Query failed because the queue manager is not in a queue-sharing group.
Severity:
10 : Warning

Explanation:
WebSphere MQ issued a query that required the queue manager to be a member of a queue-sharing group.

Response:
Try the operation again, if the problem persists check the problem determination information for more details.

AMQ4184WebSphere MQ is unable to perform your request because the channel is not active.
Severity:
20 : Error

AMQ4185WebSphere MQ failed to import your settings.
Severity:
20 : Error

Response:
Try again. If the error persists, examine the problem determination information to see if any details have been recorded.

AMQ4186WebSphere MQ failed to export your settings.
Severity:
20 : Error

Response:
Try again. If the error persists, examine the problem determination information to see if any details have been recorded.

AMQ4187WebSphere MQ has succesfully imported your settings. (You must restart WebSphere MQ Explorer to apply the imported settings.)
Severity:
0 : Information

Response:
Restart WebSphere MQ explorer to apply the imported settings

AMQ4188Are you sure you want to remove queue manager <insert_0> from cluster <insert_1>?
Severity:
10 : Warning

Explanation:
A confirmation is required before the queue manager is removed from the cluster.

Response:
Continue only if you want to permanently remove the queue manger from the cluster.

AMQ4189The queue manager could not be suspended from the cluster. The operation failed with error <insert_0>.
Severity:
20 : Error

Explanation:
The queue manager has not been removed from the cluster.

Response:
Try the operation again. If the error persists, examine the problem determination information to see if any information has been recorded.

AMQ4190An error occurred when clearing the queue manager's REPOS field. The operation failed with error <insert_0>.
Severity:
20 : Error

Explanation:
The queue manager has only partially been removed from the cluster. The queue manager has been suspended from the cluster. The REPOS field of the queue manager and the CLUSTER fields of the associated cluster channels have not been cleared.

Response:
Try the operation again. If the error persists, examine the problem determination information to see if any information has been recorded.

AMQ4191An error occurred when clearing the CLUSTER field of channel <insert_0>. The operation failed with error <insert_1>.
Severity:
20 : Error

Explanation:
The queue manager has only partially been removed from the cluster. The queue manager has been suspended from the cluster and the queue manager's REPOS field has been cleared. Some of the CLUSTER fields of other associated cluster channels might also have been cleared.

Response:
To completely remove the queue manager, ensure that all the CLUSTER fields of associated cluster channels are cleared.

AMQ4192The queue manager could not be removed from a cluster because channel <insert_0> is using cluster namelist <insert_1>.
Severity:
10 : Warning

Response:
Remove the cluster channel from the cluster namelist. Ensure that you do not inadvertently affect the definitions of other objects using the namelist. Then try removing the queue manager again.

AMQ4193The information supplied could not be correctly converted to the required code page.
Severity:
20 : Error

Explanation:
All or part of the information entered required conversion to a different code page. One or more characters could not be converted to an equivalent character in the new code page.

Response:
Change the characters used, then try the operation again.

AMQ4194Request failed because the queue manager attempted to use a default transmission queue which is not valid.
Severity:
20 : Error

Explanation:
An MQOPEN or MQPUT1 call specified a remote queue as the destination. The queue manager used the default transmission queue, as there is no queue defined with the same name as the destination queue manager, but the attempt failed because the default transmission queue is not a valid local queue.

Response:
Check that the queue manager's default transmission queue property (DefXmitQName) specifies a valid local queue.

AMQ4195WebSphere MQ Explorer is now in an unknown state and should be restarted. Do you want to restart WebSphere MQ Explorer?
Severity:
10 : Warning

Explanation:
You have chosen not to wait for WebSphere MQ to respond to a request. WebSphere MQ Explorer is therefore in an unknown state and should be restarted.

Response:
Restart the WebSphere MQ Explorer and try the operation again. If the problem persists check for problem determination information.

AMQ4196The command or operation is not valid against the type of object or queue specified
Severity:
20 : Error

Explanation:
You have attempted a command or operation against an object or queue whose type is not valid for the operation specified. For instance: browsing a remote queue; issuing the clear command against a queue whose type is not QLOCAL; clearing by API calls, a queue who type cannot be opened for input.

Response:
Retry the command or operation against an object or queue whose type is valid for the operation requested.

AMQ4197An MQOPEN or MQPUT1 call was issued specifying an alias queue as the target, but the BaseQName in the alias queue attributes is not recognized as a queue name.
Severity:
20 : Error

Explanation:
An MQOPEN or MQPUT1 call was issued specifying an alias queue as the target, but the BaseQName in the alias queue attributes is not recognized as a queue name. This reason code can also occur when BaseQName is the name of a cluster queue that cannot be resolved successfully.

Response:
Correct the queue definitions.

AMQ4207The path specified is not valid.
Severity:
20 : Error

Response:
Check the path specified and try again.

AMQ4400Explorer cannot administer the queue manager because the queue <insert_0> is not defined.
Severity:
10 : Warning

Explanation:
Explorer uses the queue <insert_0> to administer queue managers.

Response:
Define the queue <insert_0> and retry.

AMQ4401Explorer cannot administer the queue manager because the user is not authorised to open the queue <insert_0>.
Severity:
10 : Warning

Explanation:
Explorer uses the queue <insert_0> to administer this queue manager.

Response:
Allow Explorer to open the queue <insert_0> and retry.

AMQ4402The queue <insert_0> could not be opened for reason <insert_1>.
Severity:
10 : Warning

Explanation:
Explorer uses the queue <insert_0> to administer this queue manager.

Response:
Allow Explorer to open the queue <insert_0> and retry.

AMQ4500Are you sure you want to forcibly remove queue manager <insert_0> from cluster <insert_1>?
Severity:
10 : Warning

Explanation:
You should only forcibly remove a queue manager from a cluster when it has already been deleted and cannot be removed from the cluster in the normal way. A confirmation is required before the queue manager is forcibly removed.

Response:
Continue only if you want to forcibly remove the queue manager.

AMQ4501The queue manager was successfully removed from the cluster.
Severity:
0 : Information

Explanation:
The queue manager will still appear as a member of the cluster until the configuration changes have been sent across the network and the cluster channels to the queue manager have become inactive. This might take a long time.

AMQ4502You have shared the queue in cluster <insert_0>. The queue manager is not a member of this cluster.
Severity:
10 : Warning

Response:
To make the queue available to the members of this cluster, you must join the queue manager to the cluster.

AMQ4503The list of values is too long.
Severity:
10 : Warning

Explanation:
The list of values that you have entered is too long. The maximum number of characters allowed for this value is <insert_0>.

AMQ4504The value is too long.
Severity:
10 : Warning

Explanation:
You have entered a value containing too many characters. The maximum number of characters allowed for each value of this attribute is <insert_0>.

AMQ4505There are too many entries in the list.
Severity:
10 : Warning

Explanation:
You have entered too many values in the list. The maximum number of values is <insert_0>.

AMQ4506Cannot connect to queue manager <insert_0>. It cannot be removed from the cluster in the normal way.
Severity:
10 : Warning

Response:
Try the operation again when the queue manager is available. If the queue manager no longer exists, you can choose to forcibly remove the queue manager from the cluster.

AMQ4507The remote queue manager is not using TCP/IP.
Severity:
10 : Warning

Explanation:
The connection information available for the remote queue manager uses a communication protocol other than TCP/IP. The WebSphere MQ Explorer cannot connect to the queue manager to remove it from the cluster in the normal way.

Response:
If the queue manager no longer exists, you can choose to forcibly remove the queue manager from the cluster.

AMQ4508The queue manager successfully left the cluster.
Severity:
0 : Information

Explanation:
The queue manager will still appear as a member of the cluster until the configuration changes have been sent across the network and the cluster channels to the queue manager have become inactive. This might take a long time.

AMQ4509The request to suspend membership of the cluster has been accepted.
Severity:
0 : Information

Response:
Message for information only.

AMQ4510The request to resume membership of the cluster has been accepted.
Severity:
0 : Information

Response:
Message for information only.

AMQ4511The queue manager is not a member of the cluster.
Severity:
0 : Information

Response:
Message for information only.

AMQ4513The request to refresh the information about the cluster has been accepted.
Severity:
0 : Information

Response:
Message for information only.

AMQ4514The queue manager is not a member of cluster <insert_0>.
Severity:
10 : Warning

Explanation:
The object that you have shared in the cluster will not be available to other members of the cluster until you make this queue manager a member of the cluster.

AMQ4515The repository queue manager for cluster <insert_0> is not available for connection.
Severity:
10 : Warning

Explanation:
Views showing cluster queues in this cluster might not be complete.

AMQ4516Cluster workload exit error.
Severity:
10 : Warning

Explanation:
The queue manager's cluster workload exit failed unexpectedly or did not respond in time.

AMQ4517Cluster resolution error.
Severity:
10 : Warning

Explanation:
The definition of the cluster queue could not be resolved correctly because a response from a repository queue manager was not available.

AMQ4518AMQ4518=A call was stopped by the cluster exit.
Severity:
10 : Warning

Explanation:
The queue manager's cluster workload exit rejected a call to open or put a message onto a cluster queue.

AMQ4519No destinations are available.
Severity:
10 : Warning

Explanation:
At the time that the message was put, there were no longer any instances of the queue in the cluster.

AMQ4520The WebSphere MQ Explorer could not initialize TCP/IP. Administration of remote queue managers and clusters is not possible.
Severity:
10 : Warning

AMQ4521The text you entered contained a comma (,) which is used as a list separator character.
Severity:
10 : Warning

Explanation:
This value does not accept lists.

Response:
If you want to use a comma as part of a value, enclose the value in double quotes.

AMQ4522The wizard was unable to add the queue manager to the cluster. 
All changes will be rolled back.
Severity:
10 : Warning

Explanation:
A problem occurred while defining objects or modifying the queue manager's properties.

Response:
Ensure that the default objects exist for the queue manager.

AMQ4523The wizard was unable to add one of the queue managers to the cluster. 
All changes will be rolled back.
Severity:
10 : Warning

Explanation:
A problem occurred while defining objects or modifying one of the queue managers' properties.

Response:
Ensure that the default objects exist for the queue manager.

AMQ4571Are you sure you want to change the location of the Key Repository for queue manager <insert_0>?
Severity:
10 : Warning

Explanation:
You might prevent the queue manager from starting if you change the Key Repository field to a location which is not valid.

Response:
Ensure that the location specified is correct before continuing.

AMQ4572The request to refresh the information about all clusters has been accepted.
Severity:
0 : Information

Response:
Message for information only.

AMQ4574IBM WebSphere Explorer is already running.
Severity:
30 : Severe error

AMQ4575An error occurred initializing the data model.
Severity:
30 : Severe error

AMQ4576The working directory <insert_0> is not valid.
Severity:
30 : Severe error

AMQ4577An error occurred initializing the process.
Severity:
30 : Severe error

AMQ4578An error occurred loading the messages file <insert_0>.
Severity:
30 : Severe error

AMQ4579An error occurred loading the system libraries.
Severity:
30 : Severe error

AMQ4580An internal method detected an unexpected system return code. The method <insert_0> returned <insert_1>.
Severity:
30 : Severe error

Response:
Examine the problem determination information on this computer to establish the cause of the error.

AMQ4581Parameter check failed on the internal function <insert_0>. The error was <insert_1>.
Severity:
30 : Severe error

Response:
Examine the problem determination information on this computer to establish the cause of the error.

AMQ4582Queue manager <insert_0> is not available for client connection.
Severity:
30 : Severe error

Response:
Ensure the queue manager is running and is configured to accept remote connections.

AMQ4583Queue manager <insert_0> is not available for connection.
Severity:
30 : Severe error

Response:
Ensure the queue manager is running.

AMQ4584Queue manager <insert_0> is not available for cluster connection.
Severity:
30 : Severe error

Response:
Ensure that the queue manager is running. If the queue manager has been deleted it might continue to be displayed as a member of a cluster for up to 30 days.

AMQ4585An internal method <insert_0> encountered an unexpected error.
Severity:
30 : Severe error

Response:
Examine the problem determination information on this computer to establish the cause of the error.

AMQ4586The attempt to create the URL for file <insert_0> failed.
Severity:
30 : Severe error

Explanation:
The file name specified was not recognized.

Response:
Ensure that the file exists at the specified location and can be read.

AMQ4587The attempt to read from URL <insert_0> failed.
Severity:
30 : Severe error

Explanation:
There was an error when the system tried to read the Client channel definition table.

Response:
Ensure that the file exists at the specified location and can be read.

AMQ4588The attempt to read from URL <insert_0> failed.
Severity:
30 : Severe error

Explanation:
There was an error when the system tried to read the file.

Response:
Ensure that the file exists at the specified location and can be read.

AMQ4589No connection was found to application <insert_0>.
Severity:
10 : Warning

Explanation:
The connection was not found. Possibly the connection was closed before the command was issued.

Response:
Check that the application connection has not been closed in the background.

AMQ4590The queue manager connection to application <insert_0> could not be closed.
Severity:
20 : Error

Explanation:
The connection could not be closed due to a PCF error.

Response:
Check for FFSTs.

AMQ4591The command server for <insert_0> is not running.
Severity:
30 : Severe error

Explanation:
The command server has stopped for some reason, so the request cannot be processed.

Response:
Start the command server. If the error persists, examine the problem determination information to see if any details have been recorded.

AMQ4592The connection was closed successfully.
Severity:
0 : Information

Explanation:
The request to close the connection to an application was successful.

Response:
Message for information only.

AMQ4593Do you really want to stop the connection to application "<insert_0>"
Severity:
0 : Information

Explanation:
WebSphere MQ explorer is about to stop a connection, stopping the connection will prevent further communication between MQ and the application in question.

Response:
Select yes if you want to stop the connection.

AMQ4700PCF command identifier (<insert_0>) not valid for queue manager <insert_1>.
Severity:
10 : Warning

Explanation:
The specified PCF command is not supported by this queue manager.

AMQ4800Error initializing <insert_0>.
Severity:
30 : Severe error

Explanation:
An error occurred while starting this application.

Response:
Check that the WebSphere MQ runtime libraries are available. 
Check that the PATH system environment variable includes the directory for these runtime libraries.)

AMQ4801Error getting the location of the help system.
Severity:
10 : Warning

Explanation:
To launch the standalone Eclipse help system, the WebSphere MQ file transfer application needs to know where it is installed.

Response:
Check that Eclipse has been installed with WebSphere MQ.

AMQ4802Error launching the help system.
Severity:
10 : Warning

Explanation:
The WebSphere MQ file transfer application failed to create an instance of the Eclipse standalone help system.

Response:
Check that Eclipse has been installed with WebSphere MQ.

AMQ4803Error starting the help system.
Severity:
10 : Warning

Explanation:
The WebSphere MQ file transfer application failed to start the standalone Eclipse system.

Response:
Check that Eclipse has been installed with WebSphere MQ.

AMQ4805Error saving the history log file.
Severity:
10 : Warning

Explanation:
The WebSphere MQ file transfer application could not read the history log file. This file is called com.ibm.mq.fta.log.ser and is in your home directory. 
On Windows, this is %APPDATA%\IBM\WebSphere MQ FileTransferApp 
On Unix, this is $HOME/.mqdata

Response:
Check that the read/write properties on this file allow you to write to it.

AMQ4806Error reading the history log.
Severity:
10 : Warning

Explanation:
The WebSphere MQ file transfer application could not read the history log file. This file is called com.ibm.mq.fta.log.ser and is in your home directory. 
On Windows, this is %APPDATA%\IBM\WebSphere MQ FileTransferApp 
On Unix, this is $HOME/.mqdata

Response:
Check that the read/write properties on this file allow you to write to it.

AMQ4807The message size specified (<insert_0>) is outside the permitted range.
Severity:
10 : Warning

Response:
Specify a value of 1000 to 100 000 000.


XXXX

4200-4217 - WebSphere MQ Default Configuration
AMQ4200There is a problem with the default configuration. Unable to display the Default Configuration window.
Explanation:
There is a problem with WebSphere MQ.

Response:
Use the 'Details>>' button to show further details about the problem and contact your systems administrator.

AMQ4201Unable to check if the computer exists.
Explanation:
WebSphere MQ was unable to check if the computer name you entered exists on your computer's domain.

Response:
Retry the operation, if the problem persists contact your systems administrator.

AMQ4202Unable to contact the computer '%1'.
Explanation:
WebSphere MQ was unable to locate a computer with this name on your computer's TCP/IP domain.

Response:
Enter a different computer name.

AMQ4203Unable to set up the default configuration.
Explanation:
WebSphere MQ was unable to set up the default configuration. This error may occur if WebSphere MQ is busy with another operation.

Response:
Retry the operation. If the problem persists, use the 'Details>>' and 'Print' buttons to record further details about the problem and contact your systems administrator.

AMQ4204Unable to join the default cluster.
Explanation:
WebSphere MQ was unable to join your computer to the default cluster. This error may occur if WebSphere MQ is busy with another operation.

Response:
Retry the operation. If the problem persists, use the 'Details>>' and 'Print' buttons to record further details about the problem and contact your systems administrator.

AMQ4205Unable to allow remote administration of the queue manager.
Explanation:
WebSphere MQ was unable change the configuration of your queue manager to allow it to be remotely administered. This error may occur if WebSphere MQ is busy with another operation.

Response:
Retry the operation. If the problem persists, use the 'Details>>' and 'Print' buttons to record further details about the problem and contact your systems administrator.

AMQ4206Unable to prevent remote administration of the queue manager.
Explanation:
WebSphere MQ was unable change the configuration of your queue manager to prevent it from being remotely administered. This error may occur if WebSphere MQ is busy with another operation.

Response:
Retry the operation. If the problem persists, use the 'Details>>' and 'Print' buttons to record further details about the problem and contact your systems administrator.

AMQ4208Show this panel again the next time the queue manager is started?
Explanation:
You can choose whether you want the same panel to be shown the next time this queue manager is started, and the default configuration is not complete.

Response:
Select whether you want the panel to be shown next time.

AMQ4209The TCP/IP name of the remote computer must not be your own computer name.
Explanation:
You have selected that the repository queue manager is on another computer, but you have entered the name of your own computer.

Response:
Enter the correct name of the repository queue manager.

AMQ4210The command server must be active to complete this operation. Use the WebSphere MQ Services to start it, then retry the operation.
Explanation:
The operation you requested needs the command server to be running.

Response:
Use WebSphere MQ Services to start the command server, then retry the operation.

AMQ4211The computer name entered must be on your local domain ('%1').
AMQ4212Unable to complete this task because you do not have authority to administer WebSphere MQ. You must be in the Administrators group, in the mqm group or logged in with the SYSTEM ID to administer WebSphere MQ.
Explanation:
Your userid is not authorized to carry out the operation you requested.

Response:
Retry the operation on a userid with the required authority, or contact your systems administrator.

AMQ4213Unable to delete the queue manager '%1' because it is being used by another program. Close any program using the queue manager, then click 'Retry'.
Explanation:
WebSphere MQ was unable to delete the old default configuration queue manager because another program is using the queue manager.

Response:
Close the programs that are using the queue manager, and click Retry.

AMQ4214The computer '%1' is not known on the network.
Explanation:
WebSphere MQ is unable to locate a computer with this name on your network.

Response:
Enter a different computer name.

AMQ4215Upgrade of the default configuration was cancelled.
Explanation:
You pressed 'Cancel' while running the default configuration wizard to upgrade the default configuration.

Response:
None

AMQ4216The WebSphere MQ services component does not have the authority it requires.
AMQ4217The MQSeriesServices component does not have the authority to create the default configuration.


4235-4238 - Prepare WebSphere MQ Wizard
AMQ4235WebSphere MQ running on this computer was unable to retrieve group membership information for user '%1'
AMQ4236WebSphere MQ running on this computer can now retrieve group membership information for user '%1'.
AMQ4237WebSphere MQ running on this computer is still unable to retrieve group membership information for user '%1'.
AMQ4238You are not authorized to run the Prepare WebSphere MQ Wizard. To run this wizard, you must be in the in the 'Administrators' group.


4250-4274 - 'Postcard' messages
AMQ4250No nickname supplied - Please supply one.
AMQ4251Cannot Initialise WinSock - TCP/IP may not be installed. Please install TCP/IP and try again.
Explanation:
Postcard was not able to initialize the interface to TCP/IP.

Response:
Check that TCP/IP has been installed successfully. If the problem persists, refer to your systems administrator.

AMQ4252Cannot Find WinSock - TCP/IP may not be installed. Please install TCP/IP and try again.
Explanation:
Postcard was not able to find the interface to TCP/IP.

Response:
Check that TCP/IP has been installed successfully. If the problem persists, refer to your systems administrator.

AMQ4253Cannot get fully qualified TCP/IP domain name - Please ensure that the TCP/IP protocol is configured.
Explanation:
Postcard was not able to determine the TCP/IP domain name for your computer.

Response:
Check that TCP/IP has been installed successfully. If the problem persists, refer to your systems administrator.

AMQ4254Failed to Allocate System Memory - Please contact your system administrator.
Explanation:
Postcard was not able to allocate enough memory to run correctly.

Response:
Close other programs to release system memory. If the problem persists, refer to your systems administrator.

AMQ4255Please supply a user name with which you wish to communicate.
AMQ4256Please supply %s's computer name (this must be a TCP/IP name). Please supply %s's computer name (this must be a TCP/IP name).
AMQ4257The call MQCONN failed while preparing for a Put operation, with Completion Code [%s (%ld)], Reason Code [%s (%ld)].
Explanation:
An error occurred when Postcard tried to connect to the queue manager in order to send the postcard. This error may occur if WebSphere MQ is busy with another operation.

Response:
Try to send the postcard again. If the problem persists contact your systems administrator.

AMQ4258The call MQOPEN failed while preparing for a Put operation, with Completion Code [%s (%ld)], Reason Code [%s (%ld)].
Explanation:
An error occurred when Postcard tried to open a queue in order to send the postcard. This error may occur if WebSphere MQ is busy with another operation.

Response:
Try to send the postcard again. If the problem persists contact your systems administrator.

AMQ4259The call MQCLOSE failed while preparing for a Put operation, with Completion Code [%s (%ld)], Reason Code [%s (%ld)].
Explanation:
An error occurred when Postcard tried to close the queue after sending the postcard. This error may occur if WebSphere MQ is busy with another operation.

Response:
If the problem persists contact your systems administrator.

AMQ4260The call MQDISC failed while preparing for a Put operation, with Completion Code [%s (%ld)], Reason Code [%s (%ld)].
Explanation:
An error occurred when Postcard tried to disconnect from the queue manager after sending the postcard. This error may occur if WebSphere MQ is busy with another operation.

Response:
If the problem persists contact your systems administrator.

AMQ4261The call MQPUT failed with Completion Code [%s (%ld)], Reason Code [%s (%ld)].
Explanation:
An error occurred when Postcard tried to send the postcard by putting its data to the queue. This error may occur if WebSphere MQ is busy with another operation.

Response:
Try to send the postcard again. If the problem persists contact your systems administrator.

AMQ4262The call MQCONN failed while preparing for a Get operation, with Completion Code [%s (%ld)], Reason Code [%s (%ld)].
Explanation:
An error occurred when Postcard tried to connect to the queue manager in order to receive postcards. This error may occur if WebSphere MQ is busy with another operation.

Response:
Restart Postcard. If the problem persists contact your systems administrator.

AMQ4263The call MQOPEN failed while preparing for a Get operation, with Completion Code [%s (%ld)], Reason Code [%s (%ld)].
Explanation:
An error occurred when Postcard tried to open a queue in order to send the postcard. This error may occur if WebSphere MQ is busy with another operation.

Response:
Restart Postcard. If the problem persists contact your systems administrator.

AMQ4264The call MQCLOSE failed while preparing for a Get operation, with Completion Code [%s (%ld)], Reason Code [%s (%ld)].
Explanation:
An error occurred when Postcard tried to close the queue after receiving postcards. This error may occur if WebSphere MQ is busy with another operation.

Response:
If the problem persists contact your systems administrator.

AMQ4265The call MQDISC failed while preparing for a Get operation, with Completion Code [%s (%ld)], Reason Code [%s (%ld)].
Explanation:
An error occurred when Postcard tried to disconnect from the queue manager after receiving postcards. This error may occur if WebSphere MQ is busy with another operation.

Response:
If the problem persists contact your systems administrator.

AMQ4266Please type in a message that you wish to send to %s.
AMQ4267The call MQGET failed with Completion Code [%s (%ld)], Reason Code [%s (%ld)].
Explanation:
An error occurred when Postcard tried to receive a postcards by getting its data from the queue. This error may occur if WebSphere MQ is busy with another operation.

Response:
Restart Postcard. If the problem persists contact your systems administrator.

AMQ4268MQI Postcard is unable to contact the queue manager on the remote computer. Verify that the default configuration is up and running on the remote computer.
AMQ4269Unable to run MQI Postcard because you do not have authority to use WebSphere MQ. You must be in the Administators group, in the mqm group, or logged in with the SYSTEM ID to use WebSphere MQ.
Explanation:
Your user Id is not authorized to use Postcard. You must be in the Administrator's group, in the mqm group, or logged in with the SYSTEM ID to use WebSphere MQ.

Response:
Use Postcard on a user Id with the required authority, or contact your systems administrator.

AMQ4270MQI Postcard is unable to send messages to the remote computer. MQI Postcard can only exchange messages with computers that are on the same TCP/IP domain as this computer (%1).
AMQ4271Unable to open a local queue called '%1' on the mailbox queue manager '%2'. Use WebSphere MQ Explorer to create the queue, then restart MQI Postcard.
Explanation:
Postcard was unable to automatically create the queue it uses on the queue manager.

Response:
Use WebSphere MQ Explorer to create the queue, and restart Postcard.

AMQ4272The mailbox queue manager '%1' does not exist on this computer.
Explanation:
The mailbox queue manager name specified after the '-m' parameter to Postcard does not exist on this computer.

Response:
Restart Postcard specifying the name of a queue manager that does exist on this computer.

AMQ4273Unable to contact the target mailbox '%1'.
Explanation:
Postcard was unable send the message as it could not contact the target mailbox.

Response:
Click 'Retry' to attempt to send the message again, otherwise click 'Cancel'.

AMQ4274MQI Postcard has detected that '%1' is the name of a computer and a queue manager.
Explanation:
Postcard has detected that the destination mailbox name is the name of a computer and of a queue manager.

Response:
Select whether you want to send the message to the computer or the queue manager with this name, then click OK.


4300-4309 - WebSphere MQ API Exerciser
AMQ4300Please supply some text in order for the MQPUT(1) operation to succeed.
Explanation:
No text has been supplied for the user so that the MQPUT or MQPUT1 operation can proceed.

Response:
Supply some text in the editable area so that the MQPUT or MQPUT1 operation can proceed.

AMQ4301Please supply some text in order for the MQPUT operation to succeed.
Explanation:
No text has been supplied for the user so that the MQPUT operation may proceed.

Response:
Supply some text in the editable area so that the MQPUT may proceed.

AMQ4302Please supply some text in order for the MQPUT1 operation to succeed.
Explanation:
No text has been supplied for the user so that the MQPUT1 operation may proceed.

Response:
Supply some text in the editable area so that the MQPUT1 may proceed.

AMQ4303The command server for the queue manager [%s] is not started. Start the command server and try again.
Explanation:
In order for the API Exerciser to function, a command server must be running.

Response:
Either start the command server from the MQServices application or run strmqcsv <Queue Manager> from the command line.

AMQ4304API Exerciser cannot enumerate objects for queue manager [%s].
Explanation:
The API Exerciser encountered a problem trying to enumerate queues.

Response:
Ensure that the command server is running (from the Service application) and that there are queues configured for the queue manager.

AMQ4305There are no queue managers present in the system. Please create one and try again.
Explanation:
The API Exerciser could not find any queue managers on the system.

Response:
Use the Services application to create one or run crtmqm <Queue Manager>.

AMQ4306Memory allocation failure. Stop some other applications and try again.
Explanation:
There are not sufficient system resources available in the system to satisfy the running of API Exerciser.

Response:
Shut some other applications down and try running the API Exerciser again.

AMQ4307API Exerciser encountered a COM failure and cannot continue. Please ensure that WebSphere MQ has been correctly installed and configured and that your user id. is a member of the mqm group.
Explanation:
When the API Exerciser started, it was unable to make a COM connection to WebSphere MQ Services.

Response:
Ensure that WebSphere MQ has been correctly installed and configured, and that your user ID is a member of the mqm group. If the problem persists, refer to your systems administrator.

AMQ4308API Exerciser cannot continue. Please ensure that the userid you are using is a member of the mqm group.
AMQ4309API Exerciser cannot continue. Please ensure that the userid you are using is a member of the Administrator group.


4350-4764 - Installation messages
AMQ4350Setup cannot continue; a later version of this product is installed.
Explanation:
Installation detected that a version of this product later than version 5.3 is already installed on the computer.

Response:
Do not attempt to install version 5.3 when a later version is already installed.

AMQ4351Uninstallation cannot continue; uninstallation is already running.
Explanation:
An attempt was made to run two copies of uninstallation at once.

Response:
Run only one copy of uninstallation at a time.

AMQ4352Setup cannot continue; a supported version of Windows(R) is required.
AMQ4353Setup cannot continue; '%s' is not an Administrator.
Explanation:
The user running installation does not have administrator authority.

Response:
Log off and log back on using a user ID with administrator authority.

AMQ4354No repository computer name entered.
AMQ4355Repository computer name is not valid.
AMQ4356Enter a remote computer name.
AMQ4357Registration failed for file '%s' (code 0x%8.8lx).
AMQ4358Unregistration failed for file '%s' (code 0x%8.8lx).
AMQ4359Unable to register file '%s'.
AMQ4360Unable to unregister file '%s'.
AMQ4361Uninstall cannot continue; Administrator logon required.
AMQ4362Failed to create the default configuration.
AMQ4363Setup could not detect the Windows NT(R) Service Pack level (Service Pack 3 or later is required). Is Service Pack 3 or later installed?
AMQ4364Setup could not detect the Windows NT Service Pack level (Service Pack 6a or later is required). Is Service Pack 6a or later installed?
AMQ4365Setup cannot continue because Service Pack 3 is not installed.
AMQ4366Setup cannot continue because Service Pack 6a or later is not installed.
AMQ4367Setup cannot continue because Internet Explorer Version 4.01 SP1 is not installed.
AMQ4368Select at least one component to proceed.
AMQ4369The 'Web Administration Server' component requires the 'Server' component.
AMQ4370Uninstallation of the 'Server' component requires uninstallation of the 'Web Administration Server' component.
AMQ4371The 'Documentation in Other Languages' component requires the 'Documentation in English' component.
AMQ4372Uninstallation of the 'Documentation in English' component requires uninstallation of the 'Documentation in Other Languages' component.
AMQ4373There is not enough space on drive %s (program files) to install these components. Please free up some disk space or modify your selections
AMQ4374There is not enough space on drive %s (data files) to install these components. Please free up some disk space or modify your selections
AMQ4375The program files top-level folder is not valid.
Explanation:
The program files top-level folder is not a valid path.

Response:
Enter a valid path.

AMQ4376The data files top-level folder is not valid.
Explanation:
The data files top-level folder is not a valid path.

Response:
Enter a valid path.

AMQ4377The log files folder is not valid.
Explanation:
The log files folder name is not a valid path.

Response:
Enter a valid path.

AMQ4378A root folder is not allowed for the program files top-level folder.
Explanation:
WebSphere MQ cannot be installed in a root folder, for example 'c:\'.

Response:
Enter a non-root folder.

AMQ4379A root folder is not allowed for the data files top-level folder.
Explanation:
WebSphere MQ cannot be installed in a root folder, for example 'c:\'.

Response:
Enter a non-root folder.

AMQ4380A root folder is not allowed for the log files folder.
Explanation:
WebSphere MQ cannot be installed in a root folder, for example 'c:\'.

Response:
Enter a non-root folder.

AMQ4381There is not enough space on drive %s (log files) to install these components. Please free up some disk space or modify your selections
AMQ4382Unable to create or replace folder '%s'
AMQ4383Uninstallation cannot continue; failed to save queue manager configuration.
Explanation:
An error occurred while saving the current queue manager configuration to a file.

Response:
Check that the registry keys under

'HKEY_LOCAL_MACHINE\SOFTWARE\IBM\WebSphere MQ'are readable by an administrator. Check that there is enough space on the drive containing the data files folder (where the configuration is being saved in file \config\config.reg). If the error persists, contact your systems administrator.

AMQ4385Unknown language specified ('%s')
AMQ4386Codepage (%d) for specified language not available.
AMQ4387Before Setup can display help, this computer's help system needs upgrading to HTML Help 1.3. Would you like to upgrade now? (You might need to restart the computer.)
AMQ4388WebSphere MQ Setup or uninstallation is already running.
AMQ4389Setup could not create a local 'mqm' group (code %d).
Explanation:
An error occurred creating a local user group called 'mqm'.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4390Setup could not create a global 'Domain mqm' group (code %d).
Explanation:
An error occurred creating a local user group called 'mqm'.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4391Setup could not find the global 'Domain mqm' group.
Explanation:
The global 'mqm ' group was created, but could not then be found.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4392Setup could not add the global 'Domain mqm' group to the local 'mqm' group (code %d).
Explanation:
An error occurred adding the global 'mqm' group to the local 'mqm' group.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4393No ports were specified; no listeners will be created.
AMQ4394No queue managers are selected for remote administration.
AMQ4395One or more 'Server' component prerequisites were not selected; the component cannot be installed.
AMQ4396One or more prerequisite upgrades were not selected; WebSphere MQ will not operate correctly.
AMQ4397Cannot install on a network drive (drive %s),
AMQ4703One or more problems occurred during Setup. Review '%s' for details
Explanation:
Setup was only partially successful.

Response:
Review the installation log file for details of any problems.

AMQ4704If specified, TCP/IP domain must be '%s'
AMQ4705Current maintenance level is '%s'. Re-apply maintenance after Setup completes.
Explanation:
Some service has been applied to the current installation. Installation or reinstallation of WebSphere MQ components might regress some files.

Response:
Review the instructions that came with the service that was applied. If necessary re-apply the service.

AMQ4706Dialog '%s' failed.
AMQ4707Error migrating '%s'.
Explanation:
An error occurred migrating a .ini file to the registry.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4708Error creating remote administration channel for queue manager '%s'.
AMQ4709Error creating TCP/IP listener for queue manager '%s'.
AMQ4710Error updating '%s' environment variable.
AMQ4711One or more problems occurred during uninstallation. Review '%s' for details
Explanation:
Uninstallation was only partially successful.

Response:
If the installation log file is available, review it for details of any problems. If the error persists, contact your systems administrator.

AMQ4712The WebSphere MQ Service failed to stop.
Explanation:
An error occurred trying to stop the WebSphere MQ service

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4713The WebSphere MQ Service failed to start.
Explanation:
An error occurred trying to start the WebSphere MQ service

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4714Failed to delete the WebSphere MQ Service (code %d).
Explanation:
An error occurred trying to delete the WebSphere MQ service

Response:
Review the installation log for details of any problems. If the error persists, contact your systems administrator.

AMQ4715Failed to add the WebSphere MQ Service.
Explanation:
An error occurred trying to create the WebSphere MQ service

Response:
Review the installation log for details of any problems. If the error persists, contact your systems administrator.

AMQ4716Can't load '%s'.
AMQ4717Can't start dialog '%s'.
AMQ4718Can't load performance counters (code 0x%8.8lx).
Explanation:
An error occurred trying to register the WebSphere MQ performance counter library.

Response:
Review the installation log for details of any problems. If the error persists, contact your systems administrator.

AMQ4719Error migrating queue manager command files.
Explanation:
An error occurred migrating queue manager command files.

Response:
Review the installation log for details of any problems. If the error persists, contact your systems administrator.

AMQ4720Error initializing security environment.
AMQ4721WebSphere MQ messages language changed to %s.
AMQ4722Setup cannot continue without VGA or better screen resolution.
Explanation:
Setup was run using a monitor resolution less than VGA resolution.

Response:
Use a monitor with resolution equal to or better than 640 x 480 pixels.

AMQ4723Error during uninstaller initialization. You might not be able to uninstall WebSphere MQ.
AMQ4724Error restoring queue manager configuration.
Explanation:
An error occurred restoring queue manager configuration from the config.reg file in the data directory.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4725The Server feature cannot be installed without 800 x 600 or better screen resolution.
AMQ4726The 'Internet Gateway' component requires the 'Windows NT Client' component.
AMQ4727Uninstallation of the 'Windows NT Client' component requires uninstallation of the 'Internet Gateway' component.
AMQ4728Setup could not create a default configuration because some files were locked. Run WebSphere MQ First Steps after restarting the computer.
AMQ4729You cannot install the Windows Client from the client CD because WebSphere MQ server components are already installed on this computer. To install the Windows Client on this computer, use the server CD.
Explanation:
An attempt has been made to install a feature using the Windows client CD when one or more features have already been installed using the server CD. This is not allowed. Either uninstall the server features first, or use only the server CD.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4730Java(TM) support is now separately available; see the installation notes. Setup will delete existing MQSeries(R) V5.1 Java files.
AMQ4731Setup cannot continue without SVGA or better screen resolution (800 x 600).
AMQ4732No installation language specified. Use the TRANSFORMS property.
Explanation:
An attempt was made to invoke an installation without specifying a user-interface language. Use the TRANSFORMS property to specify a language.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4733Unable to launch program '%s'.
Explanation:
An error occurred trying to execute the indicated program.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4734Can't open file '%s'.
Explanation:
Setup was unable to open the indicated file for reading.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4735Error %1 reading response file '%2'.
Explanation:
An error occurred.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4736Error %1 creating response file '%2'.
Explanation:
An error occurred restoring queue manager configuration from the config.reg file in the data directory.

Response:
Review the installation log file for details of any problems. If the error persists, contact your systems administrator.

AMQ4737Unknown value '%1' for property '%2' in '%3'.
AMQ4738Repair option is not supported.
AMQ4739One or more problems occurred. Review the trace and/or log file for details.
AMQ4740Unknown feature(s) '%1' in command-line property '%2'.
Explanation:
A property, for example ADDLOCAL, containing a feature-list was specified, but one or more of the feature names was invalid.

Response:
Remove the invalid feature name. If the error persists, contact your systems administrator.

AMQ4742Unknown feature(s) '%1' in property '%2' in '%3'.
Explanation:
A property, for example ADDLOCAL, containing a feature-list was specified, but one or more of the feature names was invalid.

Response:
Remove the invalid feature name. If the error persists, contact your systems administrator.

AMQ4743Unknown property '%1' in '%2'.
AMQ4744Have you purchased sufficient license units to install IBM(R) WebSphere MQ on this computer? (For further information on license units refer to the Quick Beginnings book.)
Explanation:
You must purchase the appropriate number of license units for the number of processors in this computer.

Response:
If you have purchased the appropriate number of license units, reply "Yes", otherwise reply "No".

AMQ4745After the upgrade you might need to reboot. Is it OK to proceed?
AMQ4745This installation requires %d license units to have been purchased with IBM WebSphere MQ (for further information on license units refer to the Quick Beginnings book). If you do not know how many license units have been purchased, ask your system administrator or vendor. Have sufficient license units been purchased for this installation?
AMQ4746Setup needs to install or upgrade this computer to version 2.0 of Microsoft(R) Windows Installer. (MSI). OK to proceed (you might need to reboot)?
Explanation:
A version of Microsoft Windows Installer (MSI) earlier than 2.0 is installed. WebSphere MQ Setup requires at least version 2.0.

Response:
Reply "Yes" to install MSI version 2.0, otherwise "No". To install WebSphere MQ, version 2.0 is required.

AMQ4747You must reboot before continuing with installation. Do you want to reboot now?
AMQ4748Can't install on top of an Early-Release installation. Uninstall the Early Release first.
Explanation:
An attempt was made to install WebSphere MQ on top of an Early Version ("beta").

Response:
Uninstall the Early Version before proceeding. If the error persists, contact your systems administrator.

AMQ4749Can't install an Early Release on top of Version %s. Uninstall first.
Explanation:
An attempt was made to install an Early Version ("beta") on top of another version of WebSphere MQ.

Response:
Uninstall the existing version of WebSphere MQ before proceeding with the Early Version. If the error persists, contact your systems administrator.

AMQ4750Can't convert. Need production version of WebSphere MQ.
Explanation:
The property TRIALTOPROD has been specified but the WebSphere MQ version on the CD is not a production version.

Response:
Do not specify TRIALTOPROD if the version of WebSphere MQ you are using is not a production version. If the error persists, contact your systems administrator.

AMQ4751Can't convert. Installed product is not an Evaluation Copy.
Explanation:
The property TRIALTOPROD has been specified but the WebSphere MQ version installed is not an Evaluation Copy.

Response:
Do not specify TRIALTOPROD if the installed version of WebSphere MQ is not an Evaluation Copy. If the error persists, contact your systems administrator.

AMQ4752You have insufficient license units for this installation, and you must purchase additional units from your vendor. You can continue to install WebSphere MQ, but this status will be recorded in the error log. For information on how to inform WebSphere MQ when you have purchased sufficient license units, refer to the System Administration Guide. Do you want to proceed with WebSphere MQ installation?
Explanation:
You replied "No" to message number AMQ4744.

Response:
Reply "Yes" to continue the installation, or "No" to cancel. Make sure that you purchase the appropriate number of license units.

AMQ4753SupportPac(TM) MC74 (Microsoft Cluster Server support) is installed on this system. You must uninstall the SupportPac before installing WebSphere MQ server; see the Installation Guide.
Explanation:
WebSphere MQ installation requires that SupportPac MC74 be uninstalled.

Response:
For more information, see the WebSphere MQ Installation Guide.

AMQ4754SupportPac MS0J (Web Administration Server) is installed on this computer. You must uninstall the SupportPac before uninstalling WebSphere MQ server.
AMQ4756IBM WebSphere MQ Version %s is not installed.
AMQ4757IBM WebSphere MQ files are in use. Stop activity and retry.
Explanation:
One or more WebSphere MQ files are being used by processes running on the system.

Response:
Ensure that all queue managers, listeners, channels and WebSphere MQ services are stopped. Use the Windows task manager to ensure that there are no processes running with an "amq", "runmq", or "strmq" prefix. Stop any tools used to monitor WebSphere MQ and stop any Performance Monitor tasks. If the process locking WebSphere MQ files cannot be identified, installation can usually procede following a reboot, with the WebSphere MQ service disabled.

AMQ4758Maintenance levels out of sequence. Latest maintenance applied is %s.
AMQ4759SupportPac MA0C (Publish And Subscribe) is installed on this system. You must uninstall the SupportPac before installing WebSphere MQ server; see the Installation Guide.
AMQ4760Error installing %s. Examine log file '%s'.
AMQ4761Error uninstalling %s. Examine log file '%s'.
AMQ4762No maintenance is installed.
AMQ4763Cannot overwrite WebSphere MQ Version %1 with Version %2.
Explanation:
The service level you are trying to install is less than that already installed. The installation will not proceed.

Response:
Check that you are attempting to install the latest CSD.

AMQ4764WebSphere MQ 'Client' feature at version %1 maintenance level %2 is not installed.


5000-5999 - Installable services
See Reading a message for an explanation of how to interpret these messages.

AMQ5005Unexpected error
Severity:
20 : Error

Explanation:
An unexpected error occurred in an internal function of the product.

Response:
Save the generated output files and contact your IBM support center.

AMQ5006Unexpected error: rc = <insert_1>
Severity:
20 : Error

Explanation:
An unexpected error occurred in an internal function of the product.

Response:
Save the generated output files and contact your IBM support center.

AMQ5008An essential WebSphere MQ process <insert_1> (<insert_3>) cannot be found and is assumed to be terminated.
Severity:
40 : Stop Error

Explanation:
1) A user has inadvertently terminated the process. 2) The system is low on resources. Some operating systems terminate processes to free resources. If your system is low on resources, it is possible it has terminated the process so that a new process can be created.

Response:
WebSphere MQ will stop all MQ processes. Inform your systems administrator. When the problem is rectified WebSphere MQ can be restarted.

AMQ5009WebSphere MQ agent process <insert_1> has terminated unexpectedly.
Severity:
40 : Stop Error

Explanation:
WebSphere MQ has detected that an agent process has terminated unexpectedly. The queue manager connection(s) that this process is responsible for will be broken.

Response:
Use any previous FFSTs to determine the reason for the failure. Try to eliminate the following reasons before contacting your IBM support center. 
1) A user has inadvertently terminated the process. 
2) The system is low on resources. Some operating systems terminate processes to free resources. If your system is low on resources, it is possible that the operating system has terminated the process so that a new process can be created.

AMQ5010The system is restarting the WorkLoad Management Server process.
Severity:
10 : Warning

Explanation:
The system has detected that the WorkLoad Management server process (amqzlwa0, pid:<insert_1>) has stopped and is restarting it.

Response:
Save the generated output files which may indicate the reason why the WorkLoad Management process stopped. If the reason the WorkLoad Management Server process stopped is a problem in a WorkLoad Management user exit, correct the problem, otherwise contact your IBM support center.

AMQ5011The Queue Manager ended for reason <insert_1> <insert_3>
Severity:
10 : Warning

Explanation:
The Queue Manager ended because of a previous error <insert_1> or <insert_3>

Response:
This message should be preceded by a message or FFST information from the internal routine that detected the error. Take the action associated with the earlier error information.

AMQ5019Unable to access program <insert_3>.
Severity:
40 : Stop Error

Explanation:
A request was made to execute the program <insert_3>, however the operation was unsuccessful because the program could not be found in the specified location.

Response:
Check the definition of the service specifies the correct and full path to the program to run. If the path is correct then verify that the program exists in the specified location and that WebSphere MQ userid has permission to access it.

AMQ5020Permission denied attempting to execute program <insert_3>.
Severity:
40 : Stop Error

Explanation:
A request was made to execute the program <insert_3>, however the operation was unsuccessful because the WebSphere MQ operating environment has insufficient permissions to access the program file.

Response:
Check the access permissions of the of the program to be executed and if necessary alter them to include execute permission for the WebSphere MQ userId. Also check that the WebSphere MQ userId has search access on all directories which compose the path to the program file.

AMQ5021Unable to start program <insert_3>.
Severity:
40 : Stop Error

Explanation:
A request was made to execute the program <insert_3> however the operation was unsuccessful. Reasons for the failure may include 
a shortage of available system resources 
a problem with the program to be started

Response:
If the problem persists then the WebSphere MQ error logs should be consulted for further information related to this error. The Operating System error recording facilities should also be consutled for information relating to shortage of system resources.

AMQ5022The Channel Initiator has started. ProcessId(<insert_1>).
Severity:
0 : Information

Explanation:
The Channel Initiator process has started.

Response:
None.

AMQ5023The Channel Initiator has ended. ProcessId(<insert_1>).
Severity:
0 : Information

Explanation:
The Channel Initiator process has ended.

Response:
None.

AMQ5024The Command Server has started. ProcessId(<insert_1>).
Severity:
0 : Information

Explanation:
The Command Server process has started.

Response:
None.

AMQ5025The Command Server has ended. ProcessId(<insert_1>).
Severity:
0 : Information

Explanation:
The Command Server process has ended.

Response:
None.

AMQ5026The Listener <insert_3> has started. ProcessId(<insert_1>).
Severity:
0 : Information

Explanation:
The Listener process has started.

Response:
None.

AMQ5027The Listener <insert_3> has ended. ProcessId(<insert_1>).
Severity:
0 : Information

Explanation:
The Listener process has ended.

Response:
None.

AMQ5028The Server <insert_3> has started. ProcessId(<insert_1>).
Severity:
0 : Information

Explanation:
The Server process has started.

Response:
None.

AMQ5029The Server <insert_3> has ended. ProcessId(<insert_1>).
Severity:
0 : Information

Explanation:
The Server process has ended.

Response:
None.

AMQ5030The Command <insert_3> has started. ProcessId(<insert_1>).
Severity:
0 : Information

Explanation:
The Command has started.

Response:
None.

AMQ5032Error (<insert_4>) accessing file <insert_3>.
Severity:
40 : Stop Error

Explanation:
While attempting to access the file <insert_3> the error <insert_4> occurred.

Response:
Use the information contained in the error to locate and correct the cause of the failure.

AMQ5036Error detected processing line <insert_1>, position <insert_2> in service environment file.
Severity:
40 : Stop Error

Explanation:
While processing the environment file <insert_3> an error was detected on line <insert_1> at position <insert_2>. Possible causes are 
Variable name too long 
Variable value too long 
Incorrectly formed line. Lines must be in the format <name>=<value>. There should be no blank characters in name field. All characters following the '=' are part of the value field.

Response:
This error will not stop the command from executing but any data on the invalid line is not processed.

AMQ5037The Queue Manager task <insert_3> has started.
Severity:
0 : Information

Explanation:
The Utility Task Manager, processId(<insert_1>) type(<insert_2>), has started the <insert_3> task.

Response:
None.

AMQ5038The Queue Manager task <insert_3> failed to start with error-code <insert_1>.
Severity:
40 : Stop Error

Explanation:
The Utility Task Manager, attempted to start the task <insert_3> but the start request failed with error code <insert_1>.

Response:
The failure to start the identified task may not be critical to queue-manager operation however all of the queue manager functionality may not be available. Further details of the failure are available in WebSphere MQ error logs.

AMQ5041The Queue Manager task <insert_3> has ended.
Severity:
0 : Information

Explanation:
The Queue Manager task <insert_3> has ended.

Response:
None.

AMQ5042Request to start <insert_3> failed.
Severity:
40 : Stop Error

Explanation:
The request to start the process <insert_3> failed.

Response:
Consult the Queue Manager error logs for further details on the cause of the failure.

AMQ5043Statistics recording is unavailable due to error code <insert_1>.
Severity:
40 : Stop Error

Explanation:
The statistics collection task was unable to start due the error code <insert_1>. Statistics collection will be unavailable until the problem is rectified and the Queue Manager is restarted.

Response:
Consult the Queue Manager error logs for further details on the cause of the failure.

AMQ5044<insert_3> task operation restricted due to Reason Code <insert_1>.
Severity:
10 : Warning

Explanation:
The <insert_3> task encountered a non-fatal error which may effect the operation of the task.

Response:
Using the Reason Code <insert_1> and any previous messages recorded in the Error Logs correct the error. It may be necessary to restart the Queue Manager in order remove the restriction caused by the failure.

AMQ5045System reconfiguration event received
Severity:
0 : Information

Explanation:
The Queue Manager received a system re-configuration event. This is likely to have been caused by an administrative change in the configuration of the machine (for example dynamically adding or removing resources such as memory or processors).

Response:
No action is required unless this notification was unexpected.

AMQ5203An error occurred calling the XA interface.
Severity:
0 : Information

Explanation:
The error number is <insert_2> where a value of 
1 indicates the supplied flags value of <insert_1> was invalid, 
2 indicates that there was an attempt to use threaded and non-threaded libraries in the same process, 
3 indicates that there was an error with the supplied queue manager name <insert_3>, 
4 indicates that the resource manager id of <insert_1> was invalid, 
5 indicates that an attempt was made to use a second queue manager called <insert_3> when another queue manager was already connected, 
6 indicates that the Transaction Manager has been called when the application isn't connected to a queue manager, 
7 indicates that the XA call was made while another call was in progress, 
8 indicates that the xa_info string <insert_3> in the xa_open call contained an invalid parameter value for parameter name <insert_4>, 
9 indicates that the xa_info string <insert_3> in the xa_open call is missing a required parameter, parameter name <insert_4>, and 
10 indicates that MQ was called in dynamic registration mode but cannot find the ax_reg and ax_unreg functions ! Either call MQ in non-dynamic registration mode or supply the correct library name via the AXLIB parameter in the xa_open string.

Response:
Correct the error and try the operation again.

AMQ5204A non-threaded application tried to run as a Trusted application.
Severity:
10 : Warning

Explanation:
Only applications linked with the threaded MQ libraries can run as Trusted applications.

Response:
Make sure that the application is relinked with the threaded MQ libraries, or set the the environment variable MQ_CONNECT_TYPE to STANDARD.

AMQ5205File or directory <insert_3> not owned by user <insert_4>.
Severity:
40 : Stop Error

Explanation:
WebSphere MQ has detected that the file or directory <insert_3> is not owned by the user <insert_4>. This is not necessarily an error but you should investigate further if this is unexpected.

Response:

If this is unexpected then you should alter the ownership of the file or directory back to the user <insert_4>. 
If this is expected, then WebSphere MQ will continue however WebSphere MQ will be unable to verify the security of this file or directory. If the access permissions are too strict then you may encounter problems if WebSphere MQ cannot access the contents of the file or directory. If the access permissions are too relaxed then there may be an increased risk to the security of the WebSphere MQ system.

AMQ5206Duplicate parameters detected.
Severity:
10 : Warning

Explanation:
WebSphere MQ has detected that the activity about to be displayed contains two or more parameters in the same group with the same parameter identifier. The activity may be displayed incorrectly.

Response:
Inform the author of the activity that there may be an error in it.

AMQ5358WebSphere MQ could not load AX support module <insert_3>.
Severity:
20 : Error

Explanation:
An error has occurred loading the AX support module <insert_3>. This module needs to be loaded so that dynamically-registering resource managers, such as DB2, can participate in global units of work.

Response:
Look for a previous message outlining the reason for the load failure. Message AMQ6175 should have been issued if the load failed because of a system error. If this is the case then follow the guidance given in message AMQ6175 to resolve the problem. In the absence of prior messages or FFST information related to this problem check that the AX support module and the mqmax library have been correctly installed on your system.

AMQ5501There was not enough storage to satisfy the request
Severity:
20 : Error

Explanation:
An internal function of the product attempted to obtain storage, but there was none available.

Response:
Stop the product and restart it. If this does not resolve the problem, save the generated output files and contact your IBM support center.

AMQ5502The CDS directory name <insert_3> is not in the correct format.
Severity:
20 : Error

Explanation:
An internal function of the DCE Naming service found a CDS directory name in the wrong format. The name was expected to start with either '/...' for a fully qualified name (from global root), or '/.:' for a partially qualified name (from local cell root).

Response:
Save the generated output files and contact your IBM support center.

AMQ5503The name of the local DCE cell cannot be determined, status = <insert_1>
Severity:
20 : Error

Explanation:
The DCE Naming Service attempted to determine the name of the local DCE cell by calling 'dce_cf_get_cell_name()', which returned a nonzero return code.

Response:
Save the generated output files and contact your IBM support center.

AMQ5504DCE error. No value for the XDS attribute found.
Severity:
20 : Error

Explanation:
The DCE Naming service called om_get() to get the entry from the object returned by ds_read(). Although the status was correct, no objects were returned.

Response:
Save the generated output files and contact your IBM support center.

AMQ5505DCE error. No value for the XDS attribute number <insert_1> found.
Severity:
20 : Error

Explanation:
The DCE Naming service called om_get() to get the entry from the object returned by ds_read(). Although the status was correct, no objects were returned.

Response:
Save the generated output files and contact your IBM support center.

AMQ5506DCE error. <insert_3> returned <insert_1> for attribute number <insert_2>.
Severity:
20 : Error

Explanation:
The DCE Naming service queried an object by calling <insert_3> which returned a nonzero return code.

Response:
Save the generated output files and contact your IBM support center.

AMQ5507DCE error. <insert_3> failed for an unknown reason.
Severity:
20 : Error

Explanation:
An unexpected error occurred in an internal function of the DCE Naming service.

Response:
Save the generated output files and contact your IBM support center.

AMQ5508DCE error. The requested attribute is not present.
Severity:
20 : Error

Explanation:
The DCE Naming service was attempting to extract the value from an attribute, but the attribute cannot be found in the XDS object.

Response:
Save the generated output files and contact your IBM support center.

AMQ5509DCE error. The XDS workspace cannot be initialized.
Severity:
20 : Error

Explanation:
The DCE Naming service called 'ds_initialize()' to initialize the XDS workspace, but 'ds_initialize()' returned a nonzero return code.

Response:
Save the generated output files and contact your IBM support center.

AMQ5510DCE error. <insert_3> returned with problem <insert_1>.
Severity:
20 : Error

Explanation:
The DCE Naming service found an unexpected XDS error.

Response:
Save the generated output files and contact your IBM support center.

AMQ5511Installable service component <insert_3> returned <insert_4>.
Severity:
20 : Error

Explanation:
The internal function, that adds a component to a service, called the component initialization process. This process returned an error.

Response:
Check the component was installed correctly. If it was, and the component was supplied by IBM, then save the generated output files and contact your IBM support center. If the component was not supplied by IBM, save the generated output files and follow the support procedure for that component.

AMQ5511 (iSeries)An installable service component returned an error.
Severity:
20 : Error

Explanation:
Installable service component <insert_3> returned <insert_4>. The internal function, that adds a component to a service, called the component initialization process. This process returned an error.

Response:
Check the component was installed correctly. If it was, and the component was supplied by IBM, then save the generated output files and contact your IBM support center. If the component was not supplied by IBM, save the generated output files and follow the support procedure for that component.

AMQ5512Installable service component <insert_3> returned <insert_4> for queue manager name = <insert_5>.
Severity:
20 : Error

Explanation:
An installable service component returned an unexpected return code.

Response:
Check the component was installed correctly. If it was, and the component was supplied by IBM, then save the generated output files and contact your IBM support center. If the component was not supplied by IBM, save the generated output files and follow the support procedure for that component.

AMQ5512 (iSeries)An installable service component returned an unexpected return code.
Severity:
20 : Error

Explanation:
Installable service component <insert_3> returned <insert_4> for queue manager name = <insert_5>.

Response:
Check the component was installed correctly. If it was, and the component was supplied by IBM, then save the generated output files and contact your IBM support center. If the component was not supplied by IBM, save the generated output files and follow the support procedure for that component.

AMQ5513<insert_3> returned <insert_1>.
Severity:
20 : Error

Explanation:
An unexpected error occurred.

Response:
Save the generated output files and contact your IBM support center.

AMQ5519Bad DCE identity. Status = <insert_1>, auth = <insert_2>, keytab file = <insert_3>, principal = <insert_4>.
Severity:
20 : Error

Explanation:
The keytab file was not installed correctly, or the WebSphere MQ user ID has a different password from that used to create the keytab file.

Response:
Make sure that the MQ user ID defined when the product was installed has the same password as that defined by the keytab file, and that the keytab file has been installed correctly.

AMQ5519 (iSeries)Bad DCE identity.
Severity:
20 : Error

Explanation:
Status = <insert_1>, auth = <insert_2>, keytab file = <insert_3>, principal = <insert_4>. The keytab file was not installed correctly, or the WebSphere MQ user ID has a different password from that used to create the keytab file.

Response:
Make sure that the MQ user ID defined when the product was installed has the same password as that defined by the keytab file, and that the keytab file has been installed correctly.

AMQ5520The system could not load the module <insert_5> for the installable service <insert_3> component <insert_4>. The system return code was <insert_1>. The Queue Manager is continuing without this component.
Severity:
10 : Warning

Explanation:
The queue manager configuration data included a stanza for the installable service <insert_3> component <insert_4> with the module <insert_5>. The system returned <insert_1> when it tried to load this module. The Queue Manager is continuing without this component.

Response:
Make sure that the module can be loaded. Put the module into a directory where the system can load it, and specify its full path and name in the configuration data . Then stop and restart the queue manager.

AMQ5520 (iSeries)The system could not load a module. The Queue Manager is continuing without this component.
Severity:
10 : Warning

Explanation:
The queue manager configuration data included a stanza for the installable service <insert_3> component <insert_4> with the module <insert_5>. The system returned <insert_1> when it tried to load this module. The Queue Manager is continuing without this component.

Response:
Make sure that the module can be loaded. Put the module into a directory where the system can load it, and specify its full path and name in the configuration data . Then stop and restart the queue manager.

AMQ5521The system could not open "<insert_3>".
Severity:
10 : Warning

Explanation:
The system failed to open the default object "<insert_3>" at connect time for reason <insert_4>. This may be because "<insert_3>" has been deleted or changed.

Response:
Recreate the default objects by running "strmqm -c <qmgr>" (where <qmgr> is the name of the queue manager) and retry the application.

AMQ5522A WebSphere MQ installable service component could not be initialized.
Severity:
20 : Error

Explanation:
An installable service component returned an unexpected return code.

Response:
Check the queue manager error logs for messages explaining which installable service could not be initialized and why that service could not be initialized. Check the component was installed correctly. If it was, and the component was supplied by IBM, then save the generated output files and contact your IBM support center. If the component was not supplied by IBM, save the generated output files and follow the support procedure for that component.

AMQ5524The WebSphere MQ Object Authority Manager has failed to migrate authority data.
Severity:
20 : Error

Explanation:
The Object Authority Manager has attempted to migrate existing queue manager authority data from a previous version of an Object Authority Manager and failed.

Response:
Check this log for any previous related messages, follow their recommendations then restart the queue manager.

AMQ5525The WebSphere MQ Object Authority Manager has failed.
Severity:
20 : Error

Explanation:
The Object Authority Manager has failed to complete an MQ request.

Response:
Check the queue manager error logs for messages explaining the failure and try to correct the problem accordingly.

AMQ5526The WebSphere MQ Object Authority Manager has failed with reason <insert_1>
Severity:
20 : Error

Explanation:
The Object Authority Manager has failed an operation on the Object Authority Manager's data queue <insert_3> with reason <insert_1>.

Response:
Investigate why the error has occured and correct the problem.

AMQ5527The WebSphere MQ Object Authority Manager has failed to locate an essential authority file
Severity:
20 : Error

Explanation:
The Object Authority Manager has failed to locate the authority file <insert_3>. The migration of authority data cannot continue until the file has been restored. The queue manager will shutdown.

Response:
Restore the authority file mentioned above and restart the queue manager.

AMQ5528The WebSphere MQ Object Authority Manager has failed to locate an object's authority file
Severity:
20 : Error

Explanation:
The Object Authority Manager has failed to locate the authority file for the object <insert_3> of type (<insert_1>). The authority access to this object will initially be limited to members of the mqm group. Where type is one of the following: 
1) Queue 
2) Namelist 
3) Process 
5) Queue Manager

Response:
To extend access to this object use the setmqaut command, see the WebSphere MQ System Administration documentation for details.

AMQ5529The Remote OAM Service is not available.
Severity:
20 : Error

Explanation:
The Remote OAM service is not available. The <insert_1> call returned <insert_1>, errno <insert_2> : <insert_3>. The context string is <insert_4>

Response:
To extend access to this object use the setmqaut command, see the WebSphere MQ System Administration documentation for details.

AMQ5600Usage: crtmqm [-z] [-q] [-c Text] [-d DefXmitQ] [-h MaxHandles] 
[-g ApplicationGroup]
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5601[-t TrigInt] [-u DeadQ] [-x MaxUMsgs] [-lp LogPri] [-ls LogSec]
Severity:
0 : Information

Response:
None.

AMQ5602[-lc | -ll] [-lf LogFileSize] [-ld LogPath] QMgrName
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5602 (iSeries)[-ll] [-lf LogFileSize] [-ld LogPath] [-lz ASPNum] QMgrName
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5603Usage: dltmqm [-z] QMgrName
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5604Usage: dspmqaut [-m QMgrName] [-n ObjName] -t ObjType (-p Principal | -g Group) [-s ServiceComponent]
Severity:
0 : Information

Response:
None.

AMQ5605Usage: endmqm [-z] [-c | -w | -i | -p] QMgrName
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5606Usage: setmqaut [-m QMgrName] [-n ObjName] -t ObjType (-p Principal | -g Group) [-s ServiceComponent] Authorizations
Severity:
0 : Information

Response:
None.

AMQ5607Usage: strmqm [-a|-c|-r][-d none|minimal|all][-z][-ns] [QMgrName]
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5608Usage: dspmqtrn [-m QMgrName] [-e] [-i]
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5609Usage: rsvmqtrn -m QMgrName (-a | ((-b | -c | -r RMId) Transaction,Number))
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5610 (iSeries)Usage: strmqtrc [-m QMgrName] [-e] [-t TraceType] [-o mqm|pex|all] [-x TraceType] [-l MaxFileSize] [-d UserDataSize]
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ5610 (Unix)Usage: strmqtrc [-m QMgrName] [-e] [-t TraceType] [-x TraceType] 
[-l MaxFileSize] [-d UserDataSize]
Severity:
0 : Information

Explanation:
This applies to UNIX systems. MaxFileSize is the maximum size of a trace file in millions of bytes. UserDataSize is the size of user data to be traced in bytes.

Response:
None.

AMQ5610 (Windows)Usage: strmqtrc [-t TraceType] [-x TraceType] [-l MaxFileSize] [-d UserDataSize]
Severity:
0 : Information

Explanation:
This applies to Windows NT and Windows 2000 systems only. MaxFileSize is the maximum size of a trace file in millions of bytes. UserDataSize is the size of user data to be traced in bytes.

Response:
None.

AMQ5611 (Unix)Usage: endmqtrc [-m QMgrName] [-e] [-a]
Severity:
0 : Information

Explanation:
This applies to UNIX systems.

Response:
None.

AMQ5611 (Windows)Usage: endmqtrc
Severity:
0 : Information

Explanation:
This applies to Windows NT and Windows 2000 systems only.

Response:
None.

AMQ5612Usage: dspmqtrc [-t TemplateFile] [-hs] [-o OutputFileName] [-C InputFileCCSID] InputFileName(s)
Severity:
0 : Information

Explanation:
Options: -t Template file for formatting trace data -h Skip the trace file header -s Summary (format only the trace header) -o Save trace output to file -C Specifies the CCSID value for the input file

Response:
None.

AMQ5613Usage: dspmq [-m QMgrName] [-o status | -s] [-o default]
Severity:
0 : Information

AMQ5614Usage: setmqtry
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5615Default objects cannot be created: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
During the creation of a queue manager, using the crtmqm command, the default objects could not be created. Possible reasons for this include another command, issued elsewhere, quiescing or stopping the queue manager, or insufficient storage being available.

Response:
Use the Completion and Reason codes shown in the message to determine the cause of the failure, then re-try the command.

AMQ5616Usage: setmqprd LicenseFile
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5617Default objects cannot be created.
Severity:
20 : Error

Explanation:
During the creation of a queue manager using the crtmqm command, the default objects could not be created. The most likely reason for this error is that the queue manager was started before the crtmqm command had completed.

Response:
Ensure that the queue manager being created is not started before the create request completes. Stop the queue manager if it is already running. Restart the queue manager using the strmqm command with the '-c' option to request that the default objects are created.

AMQ5618integer
Severity:
0 : Information

AMQ5619string
Severity:
0 : Information

AMQ5620channel_name
Severity:
0 : Information

AMQ5621process_name
Severity:
0 : Information

AMQ5622q_name
Severity:
0 : Information

AMQ5623connection_name
Severity:
0 : Information

AMQ5624generic_channel_name
Severity:
0 : Information

AMQ5625generic_process_name
Severity:
0 : Information

AMQ5626generic_q_name
Severity:
0 : Information

AMQ5627qalias_name
Severity:
0 : Information

AMQ5628qmodel_name
Severity:
0 : Information

AMQ5629qlocal_name
Severity:
0 : Information

AMQ5630qremote_name
Severity:
0 : Information

AMQ5631namelist_name
Severity:
0 : Information

AMQ5632generic_namelist_name
Severity:
0 : Information

AMQ5633generic_Q_Mgr_name
Severity:
0 : Information

AMQ5634generic_cluster_name
Severity:
0 : Information

AMQ5635The argument supplied with the -l flag is not valid.
Severity:
20 : Error

Explanation:
The argument supplied with the -l flag must be in the range 1 - 4293.

Response:
Submit the command again with a valid argument.

AMQ5636cluster_name
Severity:
0 : Information

AMQ5637 (AIX)The environment variable EXTSHM is set to "ON". This is incompatable with the way WebSphere MQ uses shared memory. Reset the environment variable EXTSHM and retry the command.
Severity:
20 : Error

Explanation:
On AIX the environment variable EXTSHM causes shared memory segments to be fixed size. WebSphere MQ expects to be able to extend shared memory segments.

Response:
Reset the environment variable EXTSHM and retry the command.

AMQ5646Usage: setmqcap Processors
Severity:
0 : Information

AMQ5647Usage: dspmqcap
Severity:
0 : Information

AMQ5648Usage: dmpmqaut [-m QMgrName] [-n Profile | -l] [-t ObjType] [-p Principal | -g Group] [-s ServiceComponent] [-e | -x]
Severity:
0 : Information

Response:
None.

AMQ5649generic_authinfo_name
Severity:
0 : Information

AMQ5650authinfo_name
Severity:
0 : Information

AMQ5651qmname
Severity:
0 : Information

AMQ5652The Deferred Message process failed to connect to the WebSphere MQ queue manager for reason <insert_1>.
Severity:
30 : Severe error

Explanation:
The WebSphere MQ queue manager <insert_3> might have generated earlier messages or FFST information explaining why the deferred message process (amqzdmaa) could not connect.

Response:
Correct any configuration errors. Configuration errors that can cause this problem include badly configured CLWL Exit modules. If the problem persists contact your IBM service representative.

AMQ5653The mqm user is not defined.
Severity:
30 : Severe error

Explanation:
The system call getpwnam("mqm") failed with errno <insert_1>. The program was running as <insert_3>.

Response:
Create the mqm user as a member of the mqm group and retry the operation.

AMQ5654Usage: dspmqrte [-c] [-n] [-l Persistence] [-m QMgrName] [-o] [-p Priority]
Severity:
0 : Information

Explanation:
This shows the correct usage of the DSPMQRTE command.

Response:
None.

AMQ5655[-rq ReplyQName [-rqm ReplyQMgrName]] [-ro ReportOptions]
Severity:
0 : Information

Explanation:
This shows the correct usage of the DSPMQRTE command.

Response:
None.

AMQ5656[-xs Expiry] [-xp Pass] [-qm TargetQMgrName] [-ac [-ar]]
Severity:
0 : Information

Explanation:
This shows the correct usage of the DSPMQRTE command.

Response:
None.

AMQ5657[-d Delivery] [-f Forwarding] [-s Activities] [-t Detail]
Severity:
0 : Information

Explanation:
This shows the correct usage of the DSPMQRTE command.

Response:
None.

AMQ5658[-i CorrelId] [-b] [-v Verbosity] [-w WaitTime] -q TargetQName
Severity:
0 : Information

Explanation:
This shows the correct usage of the DSPMQRTE command.

Response:
None.

AMQ5700listener_name
Severity:
0 : Information

AMQ5701service_name
Severity:
0 : Information

AMQ5749display_cmd
Severity:
0 : Information

AMQ5750filter_keyword
Severity:
0 : Information

AMQ5751operator
Severity:
0 : Information

AMQ5752filter_value
Severity:
0 : Information

AMQ5805WebSphere MQ Publish/Subscribe broker currently running for queue manager.
Severity:
10 : Warning

Explanation:
The command was unsuccessful because queue manager <insert_3> currently has an WebSphere MQ Publish/Subscribe broker running.

Response:
None.

AMQ5806WebSphere MQ Publish/Subscribe broker started for queue manager <insert_3>.
Severity:
0 : Information

Explanation:
WebSphere MQ Publish/Subscribe broker started for queue manager <insert_3>.

Response:
None.

AMQ5807WebSphere MQ Publish/Subscribe broker for queue manager <insert_3> ended.
Severity:
0 : Information

Explanation:
The WebSphere MQ Publish/Subscribe broker on queue manager <insert_3> has ended.

Response:
None.

AMQ5808WebSphere MQ Publish/Subscribe broker for queue manager <insert_3> is already quiescing.
Severity:
10 : Warning

Explanation:
The endmqbrk command was unsuccessful because an orderly shutdown of the WebSphere MQ Publish/Subscribe broker running on queue manager <insert_3> is already in progress.

Response:
None.

AMQ5808 (iSeries)WebSphere MQ Publish/Subscribe broker is already quiescing.
Severity:
10 : Warning

Explanation:
The endmqbrk command was unsuccessful because an orderly shutdown of the broker, running on queue manager <insert_3>, is already in progress.

Response:
None.

AMQ5809WebSphere MQ Publish/Subscribe broker for queue manager <insert_3> starting.
Severity:
0 : Information

Explanation:
The dspmqbrk command has been issued to query the state of the WebSphere MQ Publish/Subscribe broker. The WebSphere MQ Publish/Subscribe broker is currently initializing.

Response:
None.

AMQ5810WebSphere MQ Publish/Subscribe broker for queue manager <insert_3> running.
Severity:
0 : Information

Explanation:
The dspmqbrk command has been issued to query the state of the WebSphere MQ Publish/Subscribe broker. The WebSphere MQ Publish/Subscribe broker is currently running.

Response:
None.

AMQ5811WebSphere MQ Publish/Subscribe broker for queue manager <insert_3> quiescing.
Severity:
0 : Information

Explanation:
The dspmqbrk command has been issued to query the state of the WebSphere MQ Publish/Subscribe broker. The WebSphere MQ Publish/Subscribe broker is currently performing a controlled shutdown.

Response:
None.

AMQ5812WebSphere MQ Publish/Subscribe broker for queue manager <insert_3> stopping.
Severity:
0 : Information

Explanation:
Either the dspmqbrk command or the endmqbrk command has been issued. The WebSphere MQ Publish/Subscribe broker is currently performing an immediate shutdown. If the endmqbrk command has been issued to request that the broker terminate, the command is unsuccessful because the broker is already performing an immediate shutdown.

Response:
None.

AMQ5813WebSphere MQ Publish/Subscribe broker for queue manager <insert_3> not active.
Severity:
0 : Information

Explanation:
An WebSphere MQ Publish/Subscribe broker administration command has been issued to query or change the state of the broker. The WebSphere MQ Publish/Subscribe broker is not currently running.

Response:
None.

AMQ5814WebSphere MQ Publish/Subscribe broker for queue manager <insert_3> ended abnormally.
Severity:
0 : Information

Explanation:
The dspmqbrk command has been issued to query the state of the WebSphere MQ Publish/Subscribe broker. The WebSphere MQ Publish/Subscribe broker has ended abnormally.

Response:
Refer to the queue manager error logs to determine why the broker ended abnormally.

AMQ5815Invalid WebSphere MQ Publish/Subscribe broker initialization file stanza for queue manager (<insert_3>).
Severity:
20 : Error

Explanation:
The broker was started using the strmqbrk command. The broker stanza in the queue manager initialization file is not valid. The broker will terminate immediately. The invalid attribute is <insert_5>.

Response:
Correct the broker stanza in the queue manager initialization file.

AMQ5815 (iSeries)Invalid WebSphere MQ Publish/Subscribe broker initialization file stanza.
Severity:
20 : Error

Explanation:
The broker was started using the strmqbrk command. The Broker stanza in the queue manager(<insert_3>) initialization file is not valid. The broker will terminate immediately. The invalid attribute is <insert_5>.

Response:
Correct the Broker stanza in the queue manager initialization file.

AMQ5815 (Windows)The WebSphere MQ Publish/Subscribe broker configuration for queue manager (<insert_3>) is not valid.
Severity:
20 : Error

Explanation:
The broker was started using the strmqbrk command. The broker configuration information is not valid. The broker will terminate immediately. The invalid attribute is <insert_5>.

Response:
Correct the broker attribute using the cfgmqbrk configuration tool.

AMQ5816Unable to open WebSphere MQ Publish/Subscribe broker control queue for reason <insert_1>,<insert_2>.
Severity:
20 : Error

Explanation:
The broker has failed to open the broker control queue (<insert_3>). The attempt to open the queue failed with completion code <insert_1> and reason <insert_2>. The most likely reasons for this error are that an application program has opened the broker control queue for exclusive access, or that the broker control queue has been defined incorrectly. The broker will terminate immediately.

Response:
Correct the problem and restart the broker.

AMQ5817An invalid stream queue has been detected by the broker.
Severity:
10 : Warning

Explanation:
WebSphere MQ has detected an attempt to use a queue (<insert_3>) as a stream queue, but the attributes of the queue make it unsuitable for use as a stream queue. The most likely reason for this error is that the queue is: (1) Not a local queue; (2) A shareable queue; (3) A temporary dynamic queue. If the queue was created using implicit stream creation, the model stream might have been defined incorrectly. The message that caused the stream to be created will be rejected or put to the dead-letter queue, depending upon the message report options and broker configuration.

Response:
Correct the problem and resubmit the request.

AMQ5818Unable to open WebSphere MQ Publish/Subscribe broker stream queue.
Severity:
10 : Warning

Explanation:
The broker has failed to open a stream queue (<insert_3>). The attempt to open the queue failed with completion code <insert_1> and reason <insert_2>. The most likely reason for this error is that an application has the queue open for exclusive access. The stream will be temporarily shut down and an attempt will be made to restart the stream after a short interval.

Response:
Correct the problem.

AMQ5819An WebSphere MQ Publish/Subscribe broker stream has ended abnormally.
Severity:
10 : Warning

Explanation:
The broker stream (<insert_3>) has ended abnormally for reason <insert_1>. The broker will attempt to restart the stream. If the stream should repeatedly fail then the broker will progressively increase the time between attempts to restart the stream.

Response:
Investigate why the problem occurred and take appropriate action to correct the problem. If the problem persists, contact your IBM service representative.

AMQ5820WebSphere MQ Publish/Subscribe broker stream (<insert_3>) restarted.
Severity:
0 : Information

Explanation:
The broker has restarted a stream that ended abnormally. This message will frequently be preceded by message AMQ5867 or AMQ5819 indicating why the stream ended.

Response:
Correct the problem.

AMQ5821WebSphere MQ Publish/Subscribe broker unable to contact parent broker.
Severity:
10 : Warning

Explanation:
The broker has been started specifying a parent broker. The broker has been unable to send a message to the parent broker (<insert_3>) for reason <insert_1>. The broker will terminate immediately.

Response:
Investigate why the problem occurred and take appropriate action to correct the problem. The problem is likely to be caused by the parent broker name not resolving to the name of a transmission queue on the local broker.

AMQ5822WebSphere MQ Publish/Subscribe broker failed to register with parent broker.
Severity:
10 : Warning

Explanation:
The broker has been started specifying a parent broker (<insert_3>). The broker attempted to register as a child of the parent broker, but received an exception response (<insert_1>) indicating that this was not possible. The broker will attempt to reregister as a child of the parent periodically. The child might not be able to process global publications or subscriptions correctly until this registration process has completed normally.

Response:
Investigate why the problem occurred and take appropriate action to correct the problem. The problem is likely to be caused by the parent broker not yet existing, or a problem with the SYSTEM.BROKER.INTER.BROKER.COMMUNICATIONS queue at the parent broker.

AMQ5823Exit path attribute invalid in WebSphere MQ Publish/Subscribe broker stanza.
Severity:
10 : Warning

Explanation:
The broker exit path attribute <insert_3> is not valid. The attribute should be specified as: <path><module name>(<function name>). The broker will terminate immediately.

Response:
Correct the problem with the attribute and restart the broker.

AMQ5824WebSphere MQ Publish/Subscribe broker exit module could not be loaded.
Severity:
10 : Warning

Explanation:
The broker exit module <insert_3> could not be loaded for reason <insert_1>:<insert_4>. The broker will terminate immediately.

Response:
Correct the problem with the broker exit module <insert_3> and restart the broker.

AMQ5825The address of the WebSphere MQ Publish/Subscribe broker exit function could not be found.
Severity:
10 : Warning

Explanation:
The address of the broker exit function <insert_4> could not be found in module <insert_3> for reason <insert_1>:<insert_5>. The broker will terminate immediately.

Response:
Correct the problem with the broker exit function <insert_4> in module <insert_3>, and restart the broker.

AMQ5826The WebSphere MQ Publish/Subscribe broker has failed to propagate a subscription to another broker.
Severity:
10 : Warning

Explanation:
The broker failed to propagate subscription to stream (<insert_4>) at broker (<insert_3>). Reason codes <insert_1> and <insert_2>. An application has either registered or deregistered a global subscription to stream (<insert_4>). The broker has attempted to propagate the subscription change to broker (<insert_3>) but the request has not been successful. The message broker will immediately attempt to refresh the state of the global subscriptions for stream (<insert_4>) at broker (<insert_3>). Until the subscription state has been successfully refreshed, messages published on stream (<insert_4>) through broker (<insert_3>) might not reach this broker.

Response:
Use the reason codes to investigate why the problem occurred and take appropriate action to correct the problem.

AMQ5827An WebSphere MQ Publish/Subscribe broker internal subscription has failed.
Severity:
10 : Warning

Explanation:
The broker failed to subscribe to stream (<insert_4>) at broker (<insert_3>) with reason codes <insert_1> and <insert_2>. Related brokers learn about each others configuration by subscribing to information published by each other. A broker has discovered that one of these internal subscriptions has failed. The broker will reissue the subscription immediately. The broker cannot function correctly without knowing some information about neighboring brokers. The information that this broker has about broker (<insert_3>) is not complete and this could lead to subscriptions and publications not being propagated around the network correctly.

Response:
Investigate why the problem occurred and take appropriate action to correct the problem. The most likely cause of this failure is a problem with the SYSTEM.BROKER.CONTROL.QUEUE at broker (<insert_3>), or a problem with the definition of the route between this broker and broker (<insert_3>).

AMQ5828WebSphere MQ Publish/Subscribe broker exit returned an ExitResponse that is not valid.
Severity:
10 : Warning

Explanation:
The broker exit returned an ExitResponse <insert_1> that is not valid. The message has been allowed to continue and an FFST has been generated that contains the entire exit parameter structure.

Response:
Correct the problem with the broker exit.

AMQ5829Usage: <insert_3> [-m QMgrName] [-p ParentQMgrName]
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5830Usage: endmqbrk [-c | -i] [-m QMgrName]
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5831Usage: dspmqbrk [-m QMgrName]
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5832WebSphere MQ Publish/Subscribe broker failed to publish configuration information on SYSTEM.BROKER.ADMIN.STREAM.
Severity:
10 : Warning

Explanation:
Related brokers learn about each others configuration by subscribing to information published by each other. A broker has discovered that one of these internal publications has failed. The broker will republish the information immediately. Brokers cannot function correctly without knowing some information about neighboring brokers. The information that neighboring brokers have of this broker might not be complete and this could lead to some subscriptions and publications not being propagated around the network.

Response:
Investigate why the problem occurred and take appropriate action to correct the problem.

AMQ5833A loop has been detected in the WebSphere MQ Publish/Subscribe broker hierarchy.
Severity:
20 : Error

Explanation:
The broker, on queue manager (<insert_3>), introduced a loop in the broker hierarchy. This broker will terminate immediately.

Response:
Remove broker (<insert_3>) from the hierarchy, either by deleting the broker, or by removing knowledge of the broker's parent, using the clrmqbrk command.

AMQ5834Conflicting queue manager names in the WebSphere MQ Publish/Subscribe broker hierarchy.
Severity:
10 : Warning

Explanation:
The names of the queue managers (<insert_3>) and (<insert_4>) in the broker hierarchy both start with the same 12 characters. The first 12 characters of a broker's queue manager name should be unique to ensure that no confusion arises within the broker hierarchy, and to guarantee unique message ID allocation.

Response:
Use a queue manager naming convention that guarantees uniqueness of the first 12 characters of the queue manager name.

AMQ5835WebSphere MQ Publish/Subscribe broker failed to inform its parent of a relation for reason <insert_1>.
Severity:
0 : Information

Explanation:
The failed to notify its parent on queue manager (<insert_3>) of the relation (<insert_4>) in the broker hierarchy. The notification message will be put to the parent's dead-letter queue. A failure to notify a broker of a new relation will mean that no loop detection can be performed for the new relation.

Response:
Diagnose and correct the problem on the parent queue manager. One possible reason for this is that the parent broker does not yet exist.

AMQ5836Duplicate queue manager name located in the WebSphere MQ Publish/Subscribe hierarchy.
Severity:
0 : Information

Explanation:
Multiple instances of the queue manager name (<insert_3>) have been located. This could either be the result of a previously resolved loop in the broker hierarchy, or multiple queue managers in the broker hierarchy having the same name.

Response:
If this broker introduced a loop in the hierarchy (typically identified by message AMQ5833), this message can be ignored. It is strongly recommended that every queue manager in a broker hierarchy has a unique name. It is not recommended that multiple queue managers use the same name.

AMQ5837WebSphere MQ Publish/Subscribe broker failed to quiesce queue (<insert_3>) for reason <insert_1>.
Severity:
10 : Warning

Explanation:
When a broker is deleted, the broker's input queues are quiesced by making the queue get inhibited, and writing the contents of the queue to the dead-letter queue (depending upon the report options of the message). The broker was unable to quiesce the named queue for the reason shown. The attempt to delete the broker will fail.

Response:
Investigate why the problem occurred, take appropriate action to correct the problem, and reissue the dltmqbrk command. Likely reasons include the queue being open for input by another process, there being no dead-letter queue defined at this queue manager, or the operator setting the queue to get inhibited while the dltmqbrk command is running. If there is no dead-letter queue defined, the reason will be reported as MQRC_UNKNOWN_OBJECT_NAME. If the problem occurs because there is no dead-letter queue defined at this broker, the operator can either define a dead-letter queue, or manually empty the queue causing the problem.

AMQ5837 (iSeries)WebSphere MQ Publish/Subscribe broker failed to quiesce queue.
Severity:
10 : Warning

Explanation:
When a broker is deleted, the broker's input queues are quiesced by making the queue get inhibited, and writing the contents of the queue to the dead-letter queue (depending upon the report options of the message). The broker was unable to quiesce the queue (<insert_3>) for reason <insert_1>. The attempt to delete the broker will fail.

Response:
Investigate why the problem occurred, take appropriate action to correct the problem, and reissue the dltmqbrk command. Likely reasons include the queue being open for input by another process, there being no dead-letter queue defined at this queue manager, or the operator setting the queue to get inhibited while the dltmqbrk command is running. If there is no dead-letter queue defined, the reason will be reported as MQRC_UNKNOWN_OBJECT_NAME. If the problem occurs because there is no dead-letter queue defined at this broker, the operator can either define a dead-letter queue, or manually empty the queue causing the problem.

AMQ5838WebSphere MQ Publish/Subscribe broker cannot be deleted.
Severity:
10 : Warning

Explanation:
The broker cannot be deleted as child (<insert_3>) is still registered. A broker cannot be deleted until all other brokers that have registered as children of that broker, have deregistered as its children.

Response:
Use the clrmqbrk and dltmqbrk commands to change the broker topology so that broker (<insert_3>) is not registered as a child of the broker being deleted.

AMQ5839WebSphere MQ Publish/Subscribe broker received an unexpected inter-broker communication.
Severity:
10 : Warning

Explanation:
A broker has received an inter-broker communication that it did not expect. The message was sent by broker (<insert_3>). The message will be processed according to the report options in that message. The most likely reason for this message is that the broker topology has been changed while inter-broker communication messages were in transit (for example, on a transmission queue) and that a message relating to the previous broker topology has arrived at a broker in the new topology. This message may be accompanied by an informational FFST including details of the unexpected communication.

Response:
If the broker topology has changed and the broker named in the message is no longer related to the broker issuing this message, this message can be ignored. If the clrmqbrk command was issued to unilaterally remove knowledge of broker (<insert_3>) from this broker, the clrmqbrk command should also be used to remove knowledge of this broker from broker (<insert_3>). If the clrmqbrk command was issued to unilaterally remove knowledge of this broker from broker (<insert_3>), the clrmqbrk command should also be used to remove knowledge of broker (<insert_3>) at this broker.

AMQ5840WebSphere MQ Publish/Subscribe broker unable to delete queue.
Severity:
10 : Warning

Explanation:
The broker has failed to delete the queue (<insert_3>) for reason <insert_2>. The broker typically attempts to delete queues during dltmqbrk processing, in which case the dltmqbrk command will fail.

Response:
The most likely reason for this error is that some other process has the queue open. Determine why the queue cannot be deleted, remove the inhibitor, and retry the failed operation. In a multi-broker environment, it is likely that a message channel agent might have queues open, which the broker needs to delete for a dltmqbrk command to complete.

AMQ5841WebSphere MQ Publish/Subscribe broker (<insert_3>) deleted.
Severity:
0 : Information

Explanation:
The broker (<insert_3>) has been deleted using the dltmqbrk command.

Response:
None.

AMQ5842WebSphere MQ Publish/Subscribe broker (<insert_3>) cannot be deleted for reason <insert_1>:<insert_5>.
Severity:
20 : Error

Explanation:
An attempt has been made to delete the broker (<insert_3>) but the request has failed for reason <insert_1>:<insert_5>.

Response:
Determine why the dltmqbrk command cannot complete successfully. The message logs for the queue manager might contain more detailed information on why the broker cannot be deleted. Resolve the problem that is preventing the command from completing and reissue the dltmqbrk command.

AMQ5842 (iSeries)WebSphere MQ Publish/Subscribe broker cannot be deleted.
Severity:
20 : Error

Explanation:
An attempt has been made to delete the WebSphere MQ Publish/Subscribe broker (<insert_3>) but the request has failed for reason <insert_1>:<insert_5>.

Response:
Determine why the dltmqbrk command cannot complete successfully. The message logs for the queue manager might contain more detailed information on why the broker cannot be deleted. Resolve the problem that is preventing the command from completing and reissue the dltmqbrk command.

AMQ5843WebSphere MQ Publish/Subscribe broker (<insert_3>) cannot be started as it is partially deleted.
Severity:
10 : Warning

Explanation:
An attempt has been made to start a broker that is in a partially deleted state. An earlier attempt to delete the broker has failed. The broker deletion must be completed before the broker will be allowed to restart. When broker deletion is successful, message AMQ5841 is issued, indicating that the broker has been deleted. If this message is not received on completion of a dltmqbrk command, the broker deletion has not been completed and the command will have to be reissued.

Response:
Investigate why the earlier attempt to delete the broker failed. Resolve the problem and reissue the dltmqbrk command.

AMQ5843 (iSeries)WebSphere MQ Publish/Subscribe broker cannot be started as it is partially deleted.
Severity:
10 : Warning

Explanation:
An attempt has been made to start the broker <insert_3> that is in a partially deleted state. An earlier attempt to delete the broker has failed. The broker deletion must be completed before the broker will be allowed to restart. When broker deletion is successful, message AMQ5841 is issued, indicating that the broker has been deleted. If this message is not received on completion of a dltmqbrk command, the broker deletion has not been completed and the command will have to be reissued.

Response:
Investigate why the earlier attempt to delete the broker failed. Resolve the problem and reissue the dltmqbrk command.

AMQ5844The relation between two WebSphere MQ Publish/Subscribe brokers is unknown.
Severity:
10 : Warning

Explanation:
The clrmqbrk command has been issued in an attempt to remove a brokers knowledge of a relation of that broker. The relative (<insert_4>) is unknown at broker (<insert_3>). If the "-p" flag was specified, the broker does not currently have a parent. If the "-c" flag was specified, the broker does not recognize the named child.

Response:
Investigate why the broker is unknown.

AMQ5845Usage: dltmqbrk -m QMgrName
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5846Usage: clrmqbrk -p | -c ChildQMgrName -m QMgrName
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5847WebSphere MQ Publish/Subscribe broker (<insert_3>) has removed knowledge of relation (<insert_4>).
Severity:
0 : Information

Explanation:
The clrmqbrk command has been used to remove knowledge of broker (<insert_4>) from broker (<insert_3>).

Response:
None.

AMQ5847 (iSeries)WebSphere MQ Publish/Subscribe broker relation removed.
Severity:
0 : Information

Explanation:
The clrmqbrk command has been used to remove knowledge of broker (<insert_4>) from broker (<insert_3>).

Response:
None.

AMQ5848WebSphere MQ Publish/Subscribe broker (<insert_3>) has failed to remove references to relation (<insert_4>) for reason <insert_1>:<insert_5>.
Severity:
20 : Error

Explanation:
An attempt has been made to remove references to broker (<insert_4>) from broker (<insert_3>) using the clrmqbrk command, but the request has been unsuccessful.

Response:
Determine why the clrmqbrk command cannot complete successfully. The message logs for the queue manager might contain more detailed information on why the broker cannot be deleted. Resolve the problem that is preventing the command from completing and then reissue the clrmqbrk command.

AMQ5848 (iSeries)WebSphere MQ Publish/Subscribe broker has failed to remove references to a related broker.
Severity:
20 : Error

Explanation:
An attempt has been made to remove references to broker (<insert_4>) from broker (<insert_3>) using the clrmqbrk command, but the request has been unsuccessful for reason <insert_1>:<insert_5>.

Response:
Determine why the clrmqbrk command cannot complete successfully. The message logs for the queue manager might contain more detailed information on why the broker cannot be deleted. Resolve the problem that is preventing the command from completing and then reissue the clrmqbrk command.

AMQ5849WebSphere MQ Publish/Subscribe broker may not change parent.
Severity:
10 : Warning

Explanation:
An attempt has been made to start broker (<insert_3>), nominating broker (<insert_4>) as its parent. The broker (<insert_3>) has previously been started, nominating broker (<insert_5>) as its parent. The strmqbrk command cannot be used to change an existing relationship.

Response:
Do not attempt to change the broker topology by using the strmqbrk command. The dltmqbrk and clrmqbrk commands are the only supported means of changing the broker topology. Refer to the documentation of those commands for guidance on changing the broker topology.

AMQ5850WebSphere MQ Publish/Subscribe broker interrupted while creating queue.
Severity:
10 : Warning

Explanation:
The broker was interrupted while creating queue (<insert_3>) for user ID (<insert_4>). When the broker creates a queue, it first creates the queue with default security attributes and it then sets the appropriate security attributes for the queue. If the broker should be interrupted during this operation (for example the queue manager is shut down), the broker cannot reliably detect that the security attributes have not been set correctly. The broker was creating a queue, but was interrupted before it could complete creation of the queue and setting the initial authority. If the interrupt occurred before the initial authority of the queue could be set, it might be necessary for the operator to set the appropriate authorities using the setmqaut command.

Response:
Confirm that the named queue has the appropriate security attributes and modify them as necessary.

AMQ5851WebSphere MQ Publish/Subscribe broker interrupted while creating internal queue.
Severity:
10 : Warning

Explanation:
The broker was interrupted while creating internal queue (<insert_3>) for user ID (<insert_4>). When the broker creates an internal queue, it first creates the queue with default security attributes and it then sets the appropriate security attributes for the queue. If the broker should be interrupted during this operation (for example the queue manager is shut down), the broker attempts to delete and redefine the queue. If the internal queue is available to users (for example, the default stream or the administration stream), it is possible that a user will put a message on the queue while it is in this invalid state, or that a user application has the queue open. In this situation the broker does not automatically redefine the queue and cannot be restarted until the queue has been emptied or closed.

Response:
Examine any messages on the named queue and take appropriate action to remove them from the queue. Ensure that no applications have the queue open.

AMQ5852WebSphere MQ Publish/Subscribe broker failed to propagate delete publication command.
Severity:
0 : Information

Explanation:
The broker failed to propagate delete publication command for stream (<insert_3>) to related broker (<insert_4>) for reason <insert_1>. When an application issues a delete publication command to delete a global publication, the command has to be propagated to all brokers in the sub-hierarchy supporting the stream. The broker reporting the error has failed to forward a delete publication command to a related broker (<insert_4>) who supports stream (<insert_3>). Delete publication commands are propagated without MQRO_DISCARD_MSG and the command message might have been written to a dead-letter queue. The topic for which the delete publication has failed is (<insert_5>).

Response:
If the delete publication has failed because the stream has been deleted at the related broker, this message can be ignored. Investigate why the delete publication has failed and take the appropriate action to recover the failed command.

AMQ5853WebSphere MQ Publish/Subscribe failed to propagate a delete publication command.
Severity:
0 : Information

Explanation:
The broker failed to propagate a delete publication command for stream (<insert_3>) to a previously related broker. When an application issues a delete publication command to delete a global publication, the command is propagated to all brokers in the sub-hierarchy supporting the stream. The broker topology was changed after deleting the publication, but before a broker removed by the topology change processed the propagated delete publication message. The topic for which the delete publication has failed is (<insert_5>).

Response:
It is the user's responsibility to quiesce broker activity before changing the broker topology using the clrmqbrk command. Investigate why this delete publication activity was not quiesced. The delete publication command will have been written to the dead-letter queue at the broker that was removed from the topology. In this case, further action might be necessary to propagate the delete publication command that was not quiesced before the clrmqbrk command was issued. If this message occurs as a result of the dltmqbrk command, the publication will have been deleted as a result of the dltmqbrk command, and the delete publication message will have been written to the dead-letter queue at the queue manager where the broker was deleted. In this case the delete publication message on the dead-letter queue can be discarded.

AMQ5854WebSphere MQ Publish/Subscribe broker failed to propagate a delete publication command.
Severity:
0 : Information

Explanation:
When an application issues a delete publication command to delete a global publication, the command has to be propagated to all brokers in the sub-hierarchy supporting the stream. At the time the delete publication was propagated, broker (<insert_4>) was a known relation of this message broker supporting stream (<insert_3>). Before the delete publication command arrived at the related broker, the broker topology was changed so that broker (<insert_4>) no longer supported stream (<insert_3>). The topic for which the delete publication has failed is (<insert_5>).

Response:
It is the user's responsibility to quiesce broker activity before changing the stream topology of the broker. Investigate why this delete publication activity was not quiesced. The delete publication command will have been written to the dead-letter queue at broker (<insert_4>).

AMQ5855WebSphere MQ Publish/Subscribe broker ended.
Severity:
10 : Warning

Explanation:
An attempt has been made to run the broker (<insert_3>) but the broker has ended for reason <insert_1>:<insert_5>.

Response:
Determine why the broker ended. The message logs for the queue manager might contain more detailed information on why the broker cannot be started. Resolve the problem that is preventing the command from completing and reissue the strmqbrk command.

AMQ5856Broker publish command message cannot be processed. Reason code <insert_1>.
Severity:
10 : Warning

Explanation:
The WebSphere MQ Publish/Subscribe broker failed to process a publish message for stream (<insert_3>). The broker was unable to write the publication to the dead-letter queue and was not permitted to discard the publication. The broker will temporarily stop the stream and will restart the stream and consequently retry the publication after a short interval.

Response:
Investigate why the error has occurred and why the publication cannot be written to the dead-letter queue. Either manually remove the publication from the stream queue, or correct the problem that is preventing the broker from writing the publication to the dead-letter queue.

AMQ5857Broker control command message cannot be processed. Reason code <insert_1>.
Severity:
10 : Warning

Explanation:
The WebSphere MQ Publish/Subscribe broker failed to process a command message on the SYSTEM.BROKER.CONTROL.QUEUE. The broker was unable to write the command message to the dead-letter queue and was not permitted to discard the command message. The broker will temporarily stop the stream and will restart the stream and consequently retry the command message after a short interval. Other broker control commands cannot be processed until this command message has been processed successfully or removed from the control queue.

Response:
Investigate why the error has occurred and why the command message cannot be written to the dead-letter queue. Either, manually remove the command message from the stream queue, or correct the problem that is preventing the broker from writing the command message to the dead-letter queue.

AMQ5858Broker could not send publication to subscriber queue.
Severity:
10 : Warning

Explanation:
A failure has occurred sending a publication to subscriber queue (<insert_4>) at queue manager (<insert_3>) for reason <insert_1>. The broker configuration options prevent it from recovering from this failure by discarding the publication or by sending it to the dead-letter queue. Instead the broker will back out the unit of work under which the publication is being sent and retry the failing command message a fixed number of times. If the problem still persists, the broker will then attempt to recover by failing the command message with a negative reply message. If the issuer of the command did not request negative replies, the broker will either discard or send to the dead-letter queue the failing command message. If the broker configuration options prevent this, the broker will restart the affected stream, which will reprocess the failing command message again. This behavior will be repeated until such time as the failure is resolved. During this time the stream will be unable to process further publications or subscriptions.

Response:
Usually the failure will be due to a transient resource problem, for example, the subscriber queue, or an intermediate transmission queue, becoming full. Use reason code <insert_1> to determine what remedial action is required. If the problem persists for a long time, you will notice the stream being continually restarted by the broker. Evidence of this occurring will be a large number of AMQ5820 messages, indicating stream restart, being written to the error logs. In such circumstances, manual intervention will be required to allow the broker to dispose of the failing publication. To do this, you will need to end the broker using the endmqbrk command and restart it with appropriate disposition options. This will allow the publication to be sent to the rest of the subscribers, while allowing the broker to discard or send to the dead-letter queue the publication that could not be sent.

AMQ5859WebSphere MQ Publish/Subscribe broker stream is terminating due to an internal resource problem.
Severity:
10 : Warning

Explanation:
The broker stream (<insert_3>) has run out of internal resources and will terminate with reason code <insert_1>. If the command in progress was being processed under syncpoint control, it will be backed out and retried when the stream is restarted by the broker. If the command was being processed out of syncpoint control, it will not be able to be retried when the stream is restarted.

Response:
This message should only be issued in very unusual circumstances. If this message is issued repeatedly for the same stream, and the stream is not especially large in terms of subscriptions, topics, and retained publications, save all generated diagnostic information and contact your IBM Support Center for problem resolution.

AMQ5862WebSphere MQ Publish/Subscribe broker for queue manager <insert_3> migrating.
Severity:
0 : Information

Explanation:
The dspmqbrk command has been issued to query the state of the broker. The broker is currently being migrated.

Response:
None.

AMQ5863WebSphere MQ Integrator broker not ready for migration. See message logs for guidance.
Severity:
10 : Warning

Explanation:
The migmqbrk command was unsuccessful because the WebSphere MQ Integrator broker was not ready to accept messages. The state of the WebSphere MQ Publish/Subscribe message broker is exported to the WebSphere MQ Integrator broker in a series of messages sent to queue SYSTEM.BROKER.INTERBROKER.QUEUE. Before migration commences the WebSphere MQ Publish/Subscribe broker checks whether the WebSphere MQ Integrator broker is ready to accept messages on this queue. This check has failed for reason <insert_1> so migration has been abandoned.

Response:
Reason code <insert_1> should be used to determine the nature of the problem. A value of 1 means that queue SYSTEM.BROKER.INTERBROKER.QUEUE does not exist. This is probably because no WebSphere MQ Integrator broker has been defined yet on this queue manager. A value of 2 means that the WebSphere MQ Integrator broker does not have the queue open probably because it hasn't been started or the first message flow has yet to be deployed for it. If both of these steps have been taken then the WebSphere MQ Integrator broker may have been created incorrectly. In particular, it should have been created in migration mode. If the broker was not created with the migration flag set then it will need to be deleted and recreated before migration can commence. Any other value in the reason code will need to be reported to your IBM Support Center for problem resolution. Note that until the problem has been resolved the WebSphere MQ Publish/Subscribe broker can still be restarted with the the strmqbrk command.

AMQ5864Broker reply message could not be sent. The command will be retried.
Severity:
10 : Warning

Explanation:
While processing a publish/subscribe command, the WebSphere MQ Publish/Subscribe broker could not send a reply message to queue (<insert_4>) at queue manager (<insert_3>) for reason <insert_1>. The broker was also unable to write the message to the dead-letter queue. Since the command is being processed under syncpoint control, the broker will attempt to retry the command in the hope that the problem is only of a transient nature. If, after a set number of retries, the reply message still could not be sent, the command message will be discarded if the report options allow it. If the command message is not discardable, the stream will be restarted, and processing of the command message recommenced.

Response:
Use reason code <insert_1> to determine what remedial action is required. If the failure is due to a resource problem (for example, a queue being full), you might find that the problem has already cleared itself. If not, this message will be issued repeatedly each time the command is retried. In this case you are strongly advised to define a dead-letter queue to receive the reply message so that the broker can process other commands while the problem is being investigated. Check the application from which the command originated and ensure that it is specifying its reply-to queue correctly.

AMQ5865Broker reply message could not be sent.
Severity:
10 : Warning

Explanation:
While processing a publish/subscribe command, the WebSphere MQ Publish/Subscribe broker could not send a reply message to queue (<insert_4>) at queue manager (<insert_3>) for reason <insert_1>. The broker was also unable to write the message to the dead-letter queue. As the command is not being processed under syncpoint control, the broker is not able to retry the command.

Response:
Use reason code <insert_1> to determine what remedial action is required. If the failure is due to a resource problem (for example, a queue being full), you might find that the problem has already cleared itself. If not, check the application from which the command originated and ensure that it is specifying its reply-to queue correctly. You might find that defining a dead-letter queue to capture the reply message on a subsequent failure will help you with this task.

AMQ5866Broker command message has been discarded. Reason code <insert_1>.
Severity:
10 : Warning

Explanation:
The WebSphere MQ Publish/Subscribe broker failed to process a publish/subscribe command message, which has now been discarded. The broker will begin to process new command messages again.

Response:
Look for previous error messages to indicate the problem with the command message. Correct the problem to prevent the failure from happening again.

AMQ5867WebSphere MQ Publish/Subscribe broker stream has ended abnormally.
Severity:
10 : Warning

Explanation:
The broker stream (<insert_3>) has ended abnormally for reason <insert_1>. The broker will attempt to restart the stream. If the stream should repeatedly fail, the broker will progressively increase the time between attempts to restart the stream.

Response:
Use the reason code <insert_1> to investigate why the problem occurred. A reason code of 1 indicates that the stream ended because a command message could not be processed successfully. Look in the error logs for earlier messages to determine the reason why the command message failed. A reason code of 2 indicates that the stream ended because the broker exit could not be loaded. Until the problem with the broker exit has been resolved, the stream will continue to fail.

AMQ5868User is no longer authorized to subscribe to stream.
Severity:
0 : Information

Explanation:
The broker has attempted to publish a publication to a subscriber, but the subscriber no longer has browse authority to stream queue (<insert_4>). The publication is not sent to the subscriber and his subscription is deregistered. An event publication containing details of the subscription that was removed is published on SYSTEM.BROKER.ADMIN.STREAM. While user ID (<insert_3>) remains unauthorized, the broker will continue to deregister subscriptions associated with that user ID.

Response:
If the authority of user ID (<insert_3>) was intentionally removed, consider removing all of that user IDs subscriptions immediately by issuing an MQCMD_DEREGISTER_SUBSCRIBER command, specifying the MQREGO_DEREGISTER_ALL option on the subscriber's behalf. If the authority was revoked accidentally, reinstate it, but be aware that some, if not all, of the subscriber's subscriptions will have been deregistered by the broker.

AMQ5869WebSphere MQ Publish/Subscribe broker is checkpointing registrations.
Severity:
0 : Information

Explanation:
A large number of changes have been made to the publisher and subscriber registrations of stream (<insert_3>). These changes are being checkpointed, in order to minimize both stream restart time and the amount of internal queue space being used.

Response:
None.

AMQ5870(Unexpected Error)
Severity:
0 : Information

Explanation:
N/A

Response:
N/A

AMQ5871(Resource Problem)
Severity:
0 : Information

Explanation:
N/A

Response:
N/A

AMQ5872(WebSphere MQ Publish/Subscribe broker has a known child)
Severity:
0 : Information

Explanation:
N/A

Response:
N/A

AMQ5873(WebSphere MQ Publish/Subscribe broker active)
Severity:
0 : Information

Explanation:
N/A

Response:
N/A

AMQ5874(One or more queues could not be quiesced)
Severity:
0 : Information

Explanation:
N/A

Response:
N/A

AMQ5875WebSphere MQ Publish/Subscribe broker cannot write a message to the dead-letter queue.
Severity:
10 : Warning

Explanation:
The broker attempted to put a message to the dead-letter queue (<insert_3>) but the message could not be written to the dead-letter queue for reason <insert_1>:<insert_4>. The message was being written to the dead-letter queue with a reason of <insert_2>:<insert_5>.

Response:
Determine why the message cannot be written to the dead-letter queue. Also, if the message was not deliberately written to the dead-letter queue, for example by a message broker exit, determine why the message was written to the dead-letter queue and resolve the problem that is preventing the message from being sent to its destination.

AMQ5876A parent conflict has been detected in the WebSphere MQ Publish/Subscribe broker hierarchy.
Severity:
20 : Error

Explanation:
The broker (<insert_3>) has been started, naming this broker as its parent. This broker was started naming broker (<insert_3>) as its parent. The broker will send an exception message to broker (<insert_3>) indicating that a conflict has been detected. The most likely reason for this message is that the broker topology has been changed while inter-broker communication messages were in transit (for example, on a transmission queue) and that a message relating to the previous broker topology has arrived at a broker in the new topology. This message may be accompanied by an informational FFST including details of the unexpected communication.

Response:
If the broker topology has changed and the broker named in the message no longer identifies this broker as its parent, this message can be ignored - for example, if the command "clrmqbrk -m <insert_3> -p" was issued. If broker (<insert_3>) has been defined as this broker's parent, and this broker has been defined as broker (<insert_3>)'s parent, the clrmqbrk or the dltmqbrk commands should be used to resolve the conflict.

AMQ5877WebSphere MQ Publish/Subscribe broker stream has ended abnormally.
Severity:
10 : Warning

Explanation:
A broker stream (<insert_3>) has ended abnormally for reason <insert_1>. The broker recovery routines failed to reset the stream state and the stream cannot be restarted automatically.

Response:
Investigate why the stream failed and why the broker's recovery routine could not recover following the failure. Take appropriate action to correct the problem. Depending upon the broker configuration and the nature of the problem it will be necessary to restart either the broker,or both the queue manager and the broker, to make the stream available. If the problem persists contact your IBM service representative.

AMQ5878WebSphere MQ Publish/Subscribe broker recovery failure detected.
Severity:
10 : Warning

Explanation:
An earlier problem has occurred with the broker, and either a stream has been restarted or the broker has been restarted. The restarted stream or broker has detected that the previous instance of the stream or broker did not clean up successfully and the restart will fail.

Response:
Investigate the cause of the failure that caused a stream or broker restart to be necessary, and why the broker or stream was unable to clean up its resources following the failure. When the broker processes with a non trusted routing exit (RoutingExitConnectType=STANDARD), the broker runs in a mode where it is more tolerant of unexpected failures and it is likely that the restart will succeed after a short delay. In the case of a stream restart, the broker will normally periodically retry the failing restart. In the case of a broker restart, it will be necessary to manually retry the broker restart after a short delay. When the broker processes without a routing exit, or with a trusted routine exit (RoutingExitConnectType=FASTPATH), the broker runs in a mode where it is less tolerant of unexpected failures and a queue manager restart will be necessary to resolve this problem. When the broker is running in this mode, it is important that the broker processes are not subjected to unnecessary asynchronous interrupts, for example, kill. If the problem persists, contact your IBM service representative.

AMQ5879WebSphere MQ Publish/Subscribe broker has been migrated.
Severity:
10 : Warning

Explanation:
The command was unsuccessful because the MQ Pub/Sub broker at queue manager <insert_3> has been migrated. After migration the only command which can be issued against the migrated broker is the dltmqbrk command.

Response:
Issue the dltmqbrk command to delete the migrated broker.

AMQ5880User is no longer authorized to subscribe to stream.
Severity:
0 : Information

Explanation:
The broker has attempted to publish a publication to a subscriber but the subscriber no longer has altusr authority to stream queue (<insert_4>). The publication is not sent to the subscriber and that user IDs subscription is deregistered. An event publication containing details of the subscription that was removed is published on SYSTEM.BROKER.ADMIN.STREAM. While user ID (<insert_3>) remains unauthorized, the broker will continue to deregister subscriptions associated with that user ID.

Response:
If the authority of user ID (<insert_3>) was intentionally removed, consider removing subscriptions immediately by issuing an MQCMD_DEREGISTER_SUBSCRIBER command for the appropriate topics on the subscriber's behalf. If the authority was revoked accidentally, reinstate it, but be aware that some, if not all, of the subscriber's subscriptions will have been deregistered by the broker.

AMQ5881The WebSphere MQ Publish/Subscribe broker configuration parameter combination <insert_1> is not valid.
Severity:
20 : Error

Explanation:
A combination of Broker stanzas in the queue manager initialization file is not valid. The broker will not operate until this has been corrected. 
An combination of (1) indicates that SyncPointIfPersistent has been set to TRUE and DiscardNonPersistentInputMsg has been set to FALSE. DiscardNonPersistentInputMsg must be set to TRUE when SyncPointIfPersistent is set to TRUE. 
An combination of (2) indicates that SyncPointIfPersistent has been set to TRUE and DiscardNonPersistentResponse has been set to FALSE. DiscardNonPersistentResponse must be set to TRUE when SyncPointIfPersistent is set to TRUE. 
An combination of (3) indicates that SyncPointIfPersistent has been set to TRUE and DiscardNonPersistentPublication has been set to FALSE. DiscardNonPersistentPublication must be set to TRUE when SyncPointIfPersistent is set to TRUE.

Response:
Alter the message broker stanzas to comply with the above rules and retry the command.

AMQ5881 (Windows)The WebSphere MQ Publish/Subscribe broker configuration parameter combination <insert_1> is not valid.
Severity:
20 : Error

Explanation:
A combination of Broker parameters in the broker configuration information is not valid. The broker will not operate until this has been corrected. 
An combination of (1) indicates that SyncPointIfPersistent has been set to TRUE and DiscardNonPersistentInputMsg has been set to FALSE. DiscardNonPersistentInputMsg must be set to TRUE when SyncPointIfPersistent is set to TRUE. 
An combination of (2) indicates that SyncPointIfPersistent has been set to TRUE and DiscardNonPersistentResponse has been set to FALSE. DiscardNonPersistentResponse must be set to TRUE when SyncPointIfPersistent is set to TRUE. 
An combination of (3) indicates that SyncPointIfPersistent has been set to TRUE and DiscardNonPersistentPublication has been set to FALSE. DiscardNonPersistentPublication must be set to TRUE when SyncPointIfPersistent is set to TRUE.

Response:
Alter the message broker configuration information using the cfgmqbrk tool to comply with the above rules and retry the command.

AMQ5882WebSphere MQ Publish/Subscribe broker has written a message to the dead-letter queue.
Severity:
10 : Warning

Explanation:
The broker has written a message to the dead-letter queue (<insert_3>) for reason <insert_1>:<insert_5>. Note. To save log space, after the first occurrence of this message for stream (<insert_4>), it will only be written periodically.

Response:
If the message was not deliberately written to the dead-letter queue, for example by a message broker exit, determine why the message was written to the dead-letter queue, and resolve the problem that is preventing the message from being sent to its destination.

AMQ5883WebSphere MQ Publish/Subscribe broker state not recorded.
Severity:
10 : Warning

Explanation:
The broker state on stream (<insert_3>) not recorded while processing a publication outside of syncpoint. A nonpersistent publication has requested a change to either a retained message or a publisher registration. This publication is being processed outside of syncpoint because the broker has been configured with the SyncPointIfPersistent option set. A failure has occurred hardening either the publisher registration or the retained publication to the broker's internal queue. All state changes attempted as a result of this publication will be backed-out. Processing of the publication will continue and the broker will attempt to deliver it to all subscribers.

Response:
Investigate why the failure occurred. It is probably due to a resource problem occurring on the broker. The most likely cause is 'queue full' on a broker queue. If your publications also carry state changes, you are advised to send them either as persistent publications or turn off the SyncPointIfPersistent option. In this way, they will be carried out under syncpoint and the broker can retry them in the event of a failure such as this.

AMQ5884WebSphere MQ Publish/Subscribe broker control queue is not a local queue.
Severity:
10 : Warning

Explanation:
WebSphere MQ Publish/Subscribe has detected that the queue 'SYSTEM.BROKER.CONTROL.QUEUE' exists and is not a local queue. This makes the queue unsuitable for use as the control queue of the broker. The broker will terminate immediately.

Response:
Delete the definition of the existing queue and, if required, re-create the queue to be of type MQQT_LOCAL. If you do not re-create the queue the broker will automatically create one of the correct type when started.

AMQ5885Usage: migmqbrk -m QMgrName
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ5886WebSphere MQ Publish/Subscribe broker is being migrated.
Severity:
10 : Warning

Explanation:
The command cannot be issued at this time because the MQ Pub/Sub broker at queue manager <insert_3> is being migrated.

Response:
Once migration has commenced then the only command which can be issued against the MQ Pub/Sub broker is the endmqbrk command to cancel the migration. Once the broker has ended if migration did not complete then it can be reattempted using the migmqbrk command again. Alternatively it can be cancelled by restarting the broker using the strmqbrk command.

AMQ5887Migration started for stream <insert_3>
Severity:
0 : Information

Explanation:
Migration of stream <insert_3> has started.

Response:
None.

AMQ5888Migration complete for stream <insert_3>
Severity:
0 : Information

Explanation:
All of the state of stream <insert_3> has been exported to the WebSphere MQ Integrator broker.

Response:
None.

AMQ5889WebSphere MQ Publish/Subscribe broker has been successfully migrated. It will now be deleted.
Severity:
0 : Information

Explanation:
Migration of the broker has completed successfully. The broker will now be deleted.

Response:
The broker is no longer startable. If deletion fails then the dltmqbrk command will need to be re-issued to complete its deletion.

AMQ5890The migration of the WebSphere MQ Publish/Subscribe broker has failed.
Severity:
10 : Warning

Explanation:
The WebSphere MQ Publish/Subscribe broker is being migrated. During this migration all persistent state, for example subscriptions, are exported to the WebSphere MQ Integrator broker as a series of messages sent to queue <insert_3>. A migration message could not be written to this queue for reason <insert_1>.

Response:
Use the MQPUT failure code <insert_1> to determine why the message cannot be written to the queue. The reason code could indicate that the queue manager is terminating in which case the migmqbrk command will need to be re-issued after the queue manager has restarted. Alternatively there may be a problem with the queue which may need to be rectified before migration can be attempted again.

AMQ5891WebSphere MQ Publish/Subscribe broker has failed to receive a reply while exporting its state to WebSphere MQ Integrator.
Severity:
10 : Warning

Explanation:
The WebSphere MQ Publish/Subscribe broker is being migrated. During this migration all persistent state, for example subscriptions, are exported to the WebSphere MQ Integrator broker as a series of messages. A reply message for one of the migration messages could not be retrieved from queue <insert_3> for reason <insert_1>. The migration of the WebSphere MQ Publish/Subscribe broker has failed.

Response:
Use the MQGET failure code <insert_3> to determine why the reply message could not be received from the reply queue. The reason code could indicate that the queue manager is terminating in which the migmqbrk command will need to be re-issued after the queue manager has restarted. A reason code of 2033 indicates that no reply message was received within a 30 second wait interval. In this case the problem is more likely to have occurred at the WebSphere MQ Integrator broker. Check for error messages issued at the WebSphere MQ Integrator broker.

AMQ5892Migration of stream <insert_3> has failed for reason <insert_1>:<insert_4>.
Severity:
0 : Information

Explanation:
Migration of stream <insert_3> has failed.

Response:
Use reason code <insert_1> to investigate the reason for the failure. Once the problem has been resolved, re-issue the migmqbrk command to retry migration.

AMQ5892 (iSeries)Migration of stream <insert_3> has failed.
Severity:
0 : Information

Explanation:
Migration of stream <insert_3> has failed for reason <insert_1>:<insert_4>.

Response:
Use reason code <insert_1> to investigate the reason for the failure. Once the problem has been resolved, re-issue the migmqbrk command to retry migration.

AMQ5893WebSphere MQ Publish/Subscribe broker (<insert_3>) cannot be migrated for reason <insert_1>:<insert_5>.
Severity:
20 : Error

Explanation:
An attempt has been made to migrate the WebSphere MQ Publish/Subscribe broker (<insert_3>) but the request has failed for reason <insert_1>:<insert_5>.

Response:
Determine why the migmqbrk command cannot complete successfully. The message logs for the queue manager might contain more detailed information outlining why the broker cannot be migrated. Resolve the problem that is preventing the command from completing and reissue the migmqbrk command.

AMQ5893 (iSeries)WebSphere MQ Publish/Subscribe broker cannot be migrated.
Severity:
20 : Error

Explanation:
An attempt has been made to migrate the broker (<insert_3>) but the request has failed for reason <insert_1>:<insert_5>.

Response:
Determine why the migmqbrk command cannot complete successfully. The message logs for the queue manager might contain more detailed information outlining why the broker cannot be migrated. Resolve the problem that is preventing the command from completing and reissue the migmqbrk command.

AMQ5894WebSphere MQ Publish/Subscribe broker cannot be migrated.
Severity:
10 : Warning

Explanation:
The WebSphere MQ Publish/Subscribe broker cannot be migrated yet because the state of stream <insert_3> is not consistent with respect to related broker <insert_4>. While an WebSphere MQ Publish/Subscribe broker is being migrated a check is made to ensure that the state of each stream is consistent with respect to all of the broker's relations. This check has failed because an inconsistency has been detected in the state of stream <insert_3> with respect to broker <insert_4>. The problem will most likely be of a transient nature, caused because the WebSphere MQ Publish/Subscribe broker has yet to complete processing a recent change to the topology of the broker network. For example, the stream in question may have recently been created or deleted at related broker <insert_4> and this broker has yet to complete its processing for this change. Another cause maybe that either this broker, or broker <insert_4>, have just been added into the broker network and subscriptions have yet to be exchanged the two brokers. If this is the case then the brokers will be inconsistent with respect to all streams. If no recent topology changes have been made then there maybe a current failure with the propagation of subscriptions to broker <insert_4>.

Response:
In all cases migration of the WebSphere MQ Publish/Subscribe broker will need to be suspended until the inconsistency has been resolved. You will need to restart the broker using the strmqbrk command so that it can resolve the problem. After a short while, the broker can be ended and migration reattempted. If repeated attempts to migrate the broker all fail with this message then try to resolve the underlying problem. Look for earlier occurrences of message AMQ5826 and follow the guidance given there. In all cases ensure that the channels between the two brokers are running.

AMQ5895WebSphere MQ Publish/Subscribe broker cannot be migrated.
Severity:
10 : Warning

Explanation:
A topic has been detected which cannot be exported to the WebSphere MQ Integrator broker. The topic <insert_3> cannot be migrated because it contains wildcard characters recognised by the WebSphere MQ Integrator broker. The wildcard characters used by WebSphere MQ Integrator are the '+' and the '#' characters. The state associated with the topic isn't migrated and migration of the WebSphere MQ Publish/Subscribe broker fails.

Response:
The WebSphere MQ Publish/Subscribe broker cannot be migrated while topic <insert_3> is in use. All applications using topics which contain either the '+' or '#' characters will need to be redesigned to use different topic strings. Note that the amqspsd sample can be used to dump the state of the WebSphere MQ Publish/Subscribe broker. Within the dump produced by this program locate topic <insert_3> to determine information about the publishing or subscribing applications concerned. Until the problem has been resolved the WebSphere MQ Publish/Subscribe broker can be restarted as normal using the strmqbrk command.

AMQ5896Unknown attribute for WebSphere MQ Publish/Subscribe broker configuration parameter GroupId.
Severity:
20 : Error

Explanation:
The broker has attempted to create stream <insert_4> belonging to group <insert_3>, this group is unknown.

Response:
Modify the attribute for broker configuration parameter GroupId, to a group that exists, or create the group <insert_3>.


6000-6999 - Common services
See Reading a message for an explanation of how to interpret these messages.

AMQ6004An error occurred during WebSphere MQ initialization or ending.
Severity:
30 : Severe error

Explanation:
An error was detected during initialization or ending of MQ. The MQ error recording routine has been called.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6005 (iSeries)An error occurred during WebSphere MQ startup.
Severity:
30 : Severe error

Explanation:
An attempt to start the storage monitor process (job QMQM in subsystem QSYSWRK) was unsuccessful.

Response:
Check the joblog for this job and for the QMQM job for possible reasons for failure, correct the error and try the command again. If the problem is not resolved, a problem may have been logged. Use WRKPRB to record the problem identifier, and to save the QPSRVDMP, QPJOBLOG, and QPDSPJOB files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6015The operating system is either too busy or has insufficient resources to complete a system request.
Severity:
30 : Severe error

Explanation:
A system request <insert_3> was rejected by the operating system with return code <insert_1>. WebSphere MQ retried the request, but it continued to fail. This failure may indicate that the operating system is either too busy or has insufficient resources to complete the request.

Response:
Investigate whether the system is constrained by the workload on this system or by the workload on a server that it is using, and reduce the workload.

AMQ6025Program not found.
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to start program <insert_3> because it was not found.

Response:
Check the program name is correctly specified and rerun the program.

AMQ6026A resource shortage prevented the creation of a WebSphere MQ process.
Severity:
30 : Severe error

Explanation:
An attempt to create an MQ process was rejected by the operating system due to a process limit (either the number of processes for each user or the total number of processes running system wide), or because the system does not have the resources necessary to create another process.

Response:
Investigate whether a process limit is preventing the creation of the process and if so why the system is constrained in this way. Consider raising this limit or reducing the workload on the system.

AMQ6035WebSphere MQ failed, no storage available.
Severity:
30 : Severe error

Explanation:
An internal function of the product attempted to obtain storage, but there was none available.

Response:
Stop the product and restart it. If this does not resolve the problem, save the generated output files and contact your IBM support center.

AMQ6037WebSphere MQ was unable to obtain enough storage.
Severity:
20 : Error

Explanation:
The product is unable to obtain enough storage. The product's error recording routine may have been called.

Response:
Stop the product and restart it. If this does not resolve the problem see if a problem has been recorded. If a problem has been recorded, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6047Conversion not supported.
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to convert string data tagged in CCSID <insert_1> to data in CCSID <insert_2>.

Response:
Check the WebSphere MQ Application Programming Reference Appendix and the appropriate National Language Support publications to see if the CCSIDs are supported by your system.

AMQ6048DBCS error
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to convert string data due to a DBCS error. Conversion is from CCSID <insert_1> to CCSID <insert_2>.

Response:
Check the WebSphere MQ Application Programming Reference Appendix and the appropriate National Language Support publications to see if the CCSIDs are supported by your system.

AMQ6049DBCS-only string not valid.
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to convert string data in CCSID <insert_1> to data in CCSID <insert_2>. Message descriptor data must be in single-byte form. CCSID <insert_2> is a DBCS-only CCSID.

Response:
Check the CCSID of your job or system and change it to one supporting SBCS or mixed character sets. Refer to the WebSphere MQ Application Programming Reference Appendix and the appropriate National Language Support publications for character sets and CCSIDs supported.

AMQ6050CCSID error.
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to convert string data in CCSID <insert_1> to data in CCSID <insert_2>.

Response:
Check the WebSphere MQ Application Programming Reference Appendix and the appropriate National Language Support publications to see if the CCSIDs are supported by your system.

AMQ6051Conversion length error.
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to convert string data in CCSID <insert_1> to data in CCSID <insert_2>, due to an input length error.

AMQ6052Conversion length error.
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to convert string data in CCSID <insert_1> to data in CCSID <insert_2>.

AMQ6053CCSID error
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to convert string data in CCSID <insert_1> to data in CCSID <insert_2>.

Response:
One of the CCSIDs is not supported by the system. Check the WebSphere MQ Application Programming Reference Appendix and the appropriate National Language Support publications to see if the CCSIDs are supported by your system.

AMQ6064An internal WebSphere MQ error has occurred.
Severity:
30 : Severe error

Explanation:
An error has been detected, and the MQ error recording routine has been called.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6088 (iSeries)An internal WebSphere MQ error has occurred.
Severity:
40 : Stop Error

Explanation:
An internal error occurred when API call <insert_3> was made.

Response:
Use WRKPRB to record the problem identifier, and to save the QPSRVDMP, QPJOBLOG, and QPDSPJOB files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6089 (iSeries)WebSphere MQ was unable to display an error message.
Severity:
30 : Severe error

Explanation:
An attempt to display an error message was unsuccessful. This may be because the AMQMSG message file could not be found. The message identifier is <insert_3>.

Response:
Check that the library list is set up correctly to access the AMQMSG message file. If a change is necessary, rerun the failing application and record the error message. If you are unable to resolve the problem, contact your IBM support center.

AMQ6090WebSphere MQ was unable to display an error message <insert_6>.
Severity:
0 : Information

Explanation:
MQ has attempted to display the message associated with return code hexadecimal <insert_6>. The return code indicates that there is no message text associated with the message. Associated with the request are inserts <insert_1> : <insert_2> : <insert_3> : <insert_4> : <insert_5>.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6091An internal WebSphere MQ error has occurred.
Severity:
0 : Information

Explanation:
Private memory has detected an error, and is abending due to <insert_3>. The error data is <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6092 (Windows)Manual conversion required for CCSID: <insert_1>
Severity:
0 : Information

Explanation:
CCSID <insert_1> exists in new format but could not be reconciled against your old format.

Response:
Manually edit CCSID entry <insert_1> in conv\table\ccsid.tbl if you wish to retain your old conversion. For assistance call your Service Representative.

AMQ6100An internal WebSphere MQ error has occurred.
Severity:
0 : Information

Explanation:
MQ has detected an error, and is abending due to <insert_3>. The error data is <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6103 (iSeries)WebSphere MQ job submission error.
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to submit job <insert_3>.

AMQ6107CCSID not supported.
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to convert string data in CCSID <insert_1> to data in CCSID <insert_2>, because one of the CCSIDs is not recognized.

Response:
Check the WebSphere MQ Application Programming Reference Appendix and the appropriate National Language Support publications to see if the CCSIDs are supported by your system.

AMQ6109An internal WebSphere MQ error has occurred.
Severity:
30 : Severe error

Explanation:
An error has been detected, and the MQ error recording routine has been called.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6110An internal WebSphere MQ error has occurred.
Severity:
30 : Severe error

Explanation:
An error has been detected, and the MQ error recording routine has been called.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6112 (iSeries)WebSphere MQ CCSID <insert_1> is using a default value.
Severity:
10 : Warning

Explanation:
When initializing WebSphere MQ, no valid job CCSID was found, so the CCSID used is the default 37. This warning message will be issued until a valid CCSID has been set correctly.

Response:
Set the job CCSID.

AMQ6114 (iSeries)An internal WebSphere MQ error has occurred.
Severity:
30 : Severe error

Explanation:
An error has been detected, and the MQ error recording routine has been called.

Response:
Use WRKPRB to record the problem identifier, and to save the QPSRVDMP, QPJOBLOG, and QPDSPJOB files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6115An internal WebSphere MQ error has occurred.
Severity:
10 : Warning

Explanation:
An error has been detected, and the MQ error recording routine has been called.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6118An internal WebSphere MQ error has occurred (<insert_1>)
Severity:
40 : Stop Error

Explanation:
An error has been detected, and the MQ error recording routine has been called.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6119An internal WebSphere MQ error has occurred (<insert_3>)
Severity:
40 : Stop Error

Explanation:
MQ detected an unexpected error when calling the operating system. The MQ error recording routine has been called.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6120An internal WebSphere MQ error has occurred.
Severity:
40 : Stop Error

Explanation:
An error has been detected, and the MQ error recording routine has been called.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6121An internal WebSphere MQ error has occurred.
Severity:
40 : Stop Error

Explanation:
An error has been detected, and the MQ error recording routine has been called.

Response:
MQ has detected a parameter count of <insert_1> that is not valid. Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6122An internal WebSphere MQ error has occurred.
Severity:
40 : Stop Error

Explanation:
An error has been detected, and the MQ error recording routine has been called.

Response:
MQ has detected parameter <insert_1> that is not valid, having value <insert_2><insert_3>. Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6125An internal WebSphere MQ error has occurred.
Severity:
40 : Stop Error

Explanation:
An internal error has occurred with identifier <insert_1>. This message is issued in association with other messages.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6134 (iSeries)Trace continues in buffer
Severity:
0 : Information

AMQ6135 (iSeries)Stopping early trace
Severity:
0 : Information

AMQ6136 (iSeries)Stopping early trace <insert_3> system time
Severity:
0 : Information

AMQ6137 (iSeries)Resuming MQI trace
Severity:
0 : Information

AMQ6138 (iSeries)Resuming MQI trace <insert_3> system time
Severity:
0 : Information

AMQ6139 (iSeries)Stopping MQI trace
Severity:
0 : Information

AMQ6140 (iSeries)Stopping MQI trace <insert_3> system time
Severity:
0 : Information

AMQ6141 (iSeries)Starting MQI trace
Severity:
0 : Information

AMQ6142 (iSeries)Starting MQI trace <insert_3> system time
Severity:
0 : Information

AMQ6143 (iSeries)WebSphere MQ function stack
Severity:
0 : Information

AMQ6144 (iSeries)No stack available
Severity:
0 : Information

AMQ6145 (iSeries)Terminating MQI trace
Severity:
0 : Information

AMQ6146 (iSeries)Entering end job processing
Severity:
0 : Information

AMQ6147 (iSeries)Terminating MQI trace <insert_3> system time
Severity:
0 : Information

AMQ6148An internal WebSphere MQ error has occurred.
Severity:
0 : Information

Explanation:
MQ has detected an error, and is abending due to <insert_3>. The error data is <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6150 (iSeries)WebSphere MQ resource <insert_3> busy.
Severity:
30 : Severe error

Explanation:
MQ was unable to access an MQ object within the normal timeout period of <insert_1> minutes.

Response:
MQ will continue to wait for access. Ensure that all jobs using MQ are released. If the situation persists, quiesce the queue manager.

AMQ6150 (Windows)WebSphere MQ semaphore is busy.
Severity:
10 : Warning

Explanation:
WebSphere MQ was unable to acquire a semaphore within the normal timeout period of <insert_1> minutes.

Response:
MQ will continue to wait for access. If the situation does not resolve itself and you suspect that your system is locked then investigate the process which owns the semaphore. The PID of this process will be documented in the accompanying FFST.

AMQ6151 (iSeries)WebSphere MQ resource <insert_3> released.
Severity:
30 : Severe error

Explanation:
An MQ resource, for which another process has been waiting, for a period of over <insert_1> minutes has been released.

Response:
No recovery is needed.

AMQ6152 (iSeries)WebSphere MQ failed to end commitment control while attempting to quiesce a queue manager.
Severity:
30 : Severe error

Explanation:
WebSphere MQ failed to end commitment control whilst quiescing queue manager <insert_3>.

Response:
There are one or more active resources under commitment control. Use the Work with Job (WRKJOB) command with the OPTION(*CMTCTL) parameter to display the active resources under commitment control. Check the job log for previously issued messages.

AMQ6153 (iSeries)The attempt to quiesce queue manager <insert_3> failed
Severity:
30 : Severe error

Explanation:
The attempt to quiesce queue manager <insert_3> was unsuccessful

Response:
Check the job log for previously issued messages. If the quiesce was issued with the *CNTRLD option, re-issue the command with the *IMMED option. If a low TIMEOUT retry delay was used, re-issue the request with a higher value.

AMQ6154 (iSeries)Queue manager <insert_3> has been quiesced.
Severity:
0 : Information

Explanation:
The queue manager has been successfully quiesced.

Response:
None.

AMQ6158 (iSeries)SBCS CCSID not found.
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to find an SBCS CCSID which corresponds to mixed DBCS-SBCS CCSID <insert_1>.

Response:
Check the CCSID of your job or system and check it has a SBCS equivalent. Refer to the National Language Support Planning Guide for character sets and CCSIDs supported. If the CCSID used does have an SBCS equivalent, save the job log containing this message and contact your IBM support center.

AMQ6159 (iSeries)WebSphere MQ job submission error.
Severity:
30 : Severe error

Explanation:
WebSphere MQ for iSeries is unable to release job <insert_3>.

Response:
Contact you System Administrator to remove job <insert_3>. Ensure you have *JOBCTL authority and try again.

AMQ6160EXPLANATION:
Severity:
0 : Information

AMQ6161ACTION:
Severity:
0 : Information

AMQ6162An error has occurred reading an INI file.
Severity:
20 : Error

Explanation:
An error has occurred when reading the MQS.INI file or a queue manager QM.INI file.

Response:
If you have been changing the INI file content check and correct the change. If you have not changed the INI file, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6162 (Windows)An error occurred when reading the configuration data.
Severity:
20 : Error

Explanation:
An error has occurred when reading the configuration data.

Response:
If you have changed the configuration data, check and correct the change. If you have not changed the configuration data, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6163An error has occurred locking an INI file.
Severity:
10 : Warning

Explanation:
An error has occurred locking the MQS.INI file or a queue manager QM.INI file.

Response:
If you have been changing the INI file permissions check and correct the change. If you have not changed the INI file, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6163 (Windows)An error has occurred locking the configuration data.
Severity:
10 : Warning

Explanation:
An error has occurred locking the configuration data.

Response:
If you have changed the the registry permissions, check and correct the change. If you have not changed the registry, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6164An expected stanza in an INI file is missing or contains errors.
Severity:
10 : Warning

Explanation:
An expected stanza is missing from the MQS.INI file or a queue manager QM.INI file or the stanza contains errors.

Response:
If you have been changing the INI file content check and correct the change.

AMQ6164 (Windows)An expected stanza in the configuration data is missing or contains errors.
Severity:
10 : Warning

Explanation:
An expected stanza is missing from the configuration data or the stanza contains errors.

Response:
If you have changed the configuration data, check and correct the change.

AMQ6165Unable to access an INI file.
Severity:
10 : Warning

Explanation:
Access to the MQS.INI file or a queue manager QM.INI file is denied.

Response:
If you have been changing the INI file permissions check and correct the change.

AMQ6165 (Windows)Unable to access the configuration data.
Severity:
10 : Warning

Explanation:
Access to the configuration data is denied.

Response:
If you have changed the configuration data permissions, check and correct the changes.

AMQ6166An INI file is missing.
Severity:
20 : Error

Explanation:
The MQS.INI file or a queue manager QM.INI file is missing.

Response:
If you have been changing the INI file recover the previous file and retry the operation.

AMQ6166 (Windows)An entry in the configuration data is missing.
Severity:
20 : Error

Explanation:
A required entry in the configuration data is missing.

Response:
If you have changed the configuration data, recover the previous configuration data and retry the operation.

AMQ6172No codeset found for current locale.
Severity:
20 : Error

Explanation:
No codeset could be determined for the current locale. Check that the locale in use is supported.

Response:
None.

AMQ6173No CCSID found for codeset <insert_3>.
Severity:
20 : Error

Explanation:
Codeset <insert_3>. has no supported CCSID. Check that the locale in use is supported. CCSIDs can be added by updating the file /var/mqm/conv/table/ccsid.tbl.

Response:
None.

AMQ6174The library <insert_3> was not found. The queue manager will continue without this module.
Severity:
0 : Information

Explanation:
The dynamically loadable library <insert_3> was not found.

Response:
Check that the file exists and is either fully qualified or is in the appropriate directory.

AMQ6174 (iSeries)The library was not found.
Severity:
0 : Information

Explanation:
The dynamically loadable file <insert_3> was not found. The queue manager will continue without this module.

Response:
Check that the file exists and is either fully qualified or is in the appropriate directory.

AMQ6174 (Unix)The dynamically loadable shared library <insert_3> was not found. The system returned error number <insert_2> and error message <insert_4>. The queue manager will continue without this module.
Severity:
0 : Information

Explanation:
This message applies to UNIX systems. The shared library <insert_3> was not found.

Response:
Check that the file exists, and is either fully qualified or is in the appropriate director, also check the file access permissions.

AMQ6175 (AIX)The system could not dynamically load the shared library <insert_3>. The system returned error number <insert_2> and error message <insert_4>. The queue manager will continue without this module.
Severity:
30 : Severe error

Explanation:
This message applies to AIX systems. The shared library <insert_3> failed to load correctly due to a problem with the library.

Response:
Check the file access permissions and that the file has not been corrupted.

AMQ6175 (Unix)The system could not dynamically load the shared library <insert_3>. The system returned error message <insert_4>. The queue manager will continue without this module.
Severity:
30 : Severe error

Explanation:
This message applies to UNIX systems. The shared library <insert_3> failed to load correctly due to a problem with the library.

Response:
Check the file access permissions and that the file has not been corrupted.

AMQ6175 (Windows)The system could not dynamically load the library <insert_3>. The system return code was <insert_1>. The queue manager will continue without this module.
Severity:
30 : Severe error

Explanation:
This message applies to Windows NT and Windows 2000 systems only. The dynamically loadable file <insert_3> failed to load correctly due to an internal error. The MQ error recording routine has been called.

Response:
Check that the file has not been corrupted then use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6177 (Windows)An internal WebSphere MQ error has occurred.
Severity:
40 : Stop Error

Explanation:
An error has been detected, and the MQ error recording routine has been called.

Response:
Details of the error have been stored at <insert_3>. A synopsis is given in the data section below. Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6179The system could not find symbol <insert_5> in the dynamically loaded library <insert_3>. The system returned error number <insert_2> and error message <insert_4>.
Severity:
30 : Severe error

Explanation:
The library <insert_3> does not contain symbol <insert_5> or it has not been exported.

Response:
Check that symbol name <insert_5> is correct and has been exported from the library.

AMQ6179 (Unix)The system could not find the symbol <insert_5> in the dynamically loaded shared library <insert_3>. The system returned error message <insert_4>.
Severity:
30 : Severe error

Explanation:
This message applies to UNIX systems. The shared library <insert_3> does not contain symbol <insert_5> or it has not been exported.

Response:
Check that symbol name <insert_5> is correct and has been exported from the library.

AMQ6180 (Windows)Default conversion not supported.
Severity:
30 : Severe error

Explanation:
WebSphere MQ is unable to convert string data tagged in CCSID <insert_1> to data in CCSID <insert_2>.

Response:
Check the default CCSIDs specified in the ccsid.tbl file and make sure that conversion is supported between these CCSIDs.

AMQ6182Error found in line <insert_1> of ccsid.tbl
Severity:
30 : Severe error

Explanation:
Line <insert_1> contains and error. The content of the line is <insert_3>. Processing continues but the line in error is ignored.

Response:
Correct the line and rerun the program or command giving this message.

AMQ6183An internal WebSphere MQ error has occurred.
Severity:
10 : Warning

Explanation:
An error has been detected, and the WebSphere MQ error recording routine has been called. The failing process is process <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6184An internal WebSphere MQ error has occurred on queue manager <insert_3>.
Severity:
10 : Warning

Explanation:
An error has been detected, and the WebSphere MQ error recording routine has been called. The failing process is process <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6184 (iSeries)An internal WebSphere MQ error has occurred.
Severity:
10 : Warning

Explanation:
An internal MQ error has occurred on queue manager <insert_3> and the MQ error recording routine has been called. The failing process is process <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6187User is not authorized for RestrictedMode queue manager.
Severity:
40 : Stop Error

Explanation:
All users must be in the RestrictedMode application_group.

AMQ6188 (AIX)The system could not dynamically load the shared library <insert_3> as the entry point to the library, symbol 'MQStart', could not be located within the library. The queue manager will continue without this library.
Severity:
30 : Severe error

Explanation:
This message applies to AIX systems. The shared library <insert_3> failed to load correctly due to a problem with the library.

Response:
Check that the entry point to the library, symbol 'MQStart', exists and has been exported from the library.

AMQ6188 (Unix)The system could not dynamically load the shared library <insert_3> as the entry point to the library, symbol 'MQStart', could not be located within the library. The system returned error message <insert_4>. The queue manager will continue without this library.
Severity:
30 : Severe error

Explanation:
This message applies to UNIX systems. The shared library <insert_3> failed to load correctly due to a problem with the library.

Response:
Check that the entry point to the library, symbol 'MQStart', exists and has been exported from the library.

AMQ6188 (Windows)The system could not dynamically load the library <insert_3> due to a problem with the dll. The errno was <insert_1>. The queue manager will continue without this module.
Severity:
30 : Severe error

Explanation:
This message applies to Windows NT and Windows 2000 systems only. The dynamically loadable file <insert_3> failed to load correctly due to a problem with the dll.

Response:
Check that the dll is in the correct place with the correct file permissions etc. and has not been corrupted.

AMQ6190 (Windows)Program <insert_3> not found.
Severity:
30 : Severe error

Explanation:
The program <insert_3> cannot be found.

Response:
Check that the program specified is available on your system. If the program name is not fully qualified, ensure that the PATH environment variable includes the directory where the program is located.

AMQ6191 (Windows)Program <insert_3> failed to start, return code <insert_1>.
Severity:
30 : Severe error

Explanation:
The program <insert_3> was invoked, but failed to start. The failure reason code is <insert_1>.

Response:
Check that the program specified is available on your system, and that sufficient system resources are available. Where applicable, verify that the user is authorized to run the program.

AMQ6192 (Windows)IBM WebSphere MQ Utilities
Severity:
0 : Information

AMQ6193 (Windows)The registry entry <insert_3> was not found.
Severity:
20 : Error

Explanation:
WebSphere MQ for Windows NT and Windows 2000 sets the registry entry <insert_3> when the product is installed, but the entry is now missing.

Response:
If the registry has been edited, restore the previous version. If the product is newly installed, check whether the installation was successful, and reinstall the product if necessary.

AMQ6196An error has occurred whilst processing a temporary INI file <insert_3>
Severity:
20 : Error

Explanation:
An error has occurred when creating a backup of an INI file. The backup file <insert_4> already exists

Response:
You may have created a backup of the INI file with the name <insert_4>, or an earlier operation may have failed. Move or delete the file <insert_4> and reattempt the operation. If you have not changed the INI file, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ6207 (AIX)Failed to attach shared memory segment as Segment table is Full.
Severity:
20 : Error

Explanation:
When running in native mode an application may attach only 10 shared memory segments. The application which issued this message attempted to exceed this number. By setting the environment variable EXTSHM=ON this limit can be removed. Further explanation on using this variable and other options available, may be found in the documentation.

Response:
Either reduce the number of segments to which your application needs to attach or set the EXTSHM=ON variable in your environment before starting the application.

AMQ6208<insert_3>
Severity:
10 : Warning

Explanation:
<insert_4>

Response:
<insert_5>

AMQ6209An unexpected asynchronous signal (<insert_1> : <insert_3>) has been received and ignored.
Severity:
10 : Warning

Explanation:
Process <insert_2> received an unexpected asynchronous signal and ignored it. This has not caused an error but the source of the signal should be determined as it is likely that the signal has been generated externally to WebSphere MQ.

Response:
Determine the source of the signal and prevent it from re-occurring.

AMQ6212Failed to load Library <insert_3> as C++ environment is not initialised.
Severity:
10 : Warning

Explanation:
An attempt was made to load the identified C++ shared library. However, the attempt failed because the C++ environment has not been initialized for the current process.

Response:
Ensure the application is linked with the appropriate C++ runtime environment.

AMQ6218 (AIX)EXTSHM variable detected with unrecognised value <insert_3> and has been reset to <insert_4>.
Severity:
20 : Error

Explanation:
Processes that access the internal queue manager control blocks must use the AIX Extended Shared Memory model, and while one such process was starting, WebSphere MQ detected that the EXTSHM variable was set but did not contain an appropriate value. This value has been reset and the process will continue with the new setting.

Response:
No further action is required. To prevent this message being issued in future, correct the value of the EXTSHM variable in your environment.

AMQ6230Message <insert_3> suppressed <insert_1> times in the last <insert_4> seconds.
Severity:
20 : Error

Explanation:
Message <insert_3> was issued <insert_2> times in the last <insert_4> seconds but only the first instance of the message was written to the log. The suppressed messages may have included differing message arguments.

Response:
If you wish to see all occurrences of this message you should alter the definition of the SuppressMessage attribute in the Queue Manager configuration.

AMQ6231 (Unix)Usage: mqchkdir <insert_-a | -m QMgrName> [-f]
Severity:
20 : Error

Explanation:
An incorrect option was specified on the command line for the command.

Response:
Reissue the command specifying the correct parameters.

AMQ6232 (Unix)Operating System userid <insert_3> not found.
Severity:
20 : Error

Explanation:
A request was made to the operating system to lookup the details of the identified userid but the request failed.

Response:
Using the operating system supplied tools check for the existence of the identified userid, and if missing then recreate it.

AMQ6233 (Unix)Operating System authorisation group <insert_3> not found.
Severity:
20 : Error

Explanation:
A request was made to the operating system to lookup the details of the identified group but the request failed.

Response:
Using the operating system supplied tools check for the existence of the identified group, and if missing then recreate it.

AMQ6234 (Unix)Unknown Queue Manager name specified.
Severity:
20 : Error

Explanation:
An invalid Queue Manager name <insert_3> was specified in the parameters to the command.

Response:
Reissue the command specifying a valid Queue Manager name.

AMQ6235 (Unix)Directory <insert_3> missing.
Severity:
20 : Error

Explanation:
The identified directory is missing.

Response:
Reissue the command selecting the option to create missing directories.

AMQ6236 (Unix)Missing directory <insert_3> has been created.
Severity:
20 : Error

Explanation:
The identified directory was missing but has been created.

Response:
None

AMQ6237 (Unix)File <insert_3> missing.
Severity:
20 : Error

Explanation:
The identified file is missing.

Response:
Reissue the command selecting the option to create missing files.

AMQ6238 (Unix)Missing file <insert_3> has been created.
Severity:
20 : Error

Explanation:
The identified file was missing but has been created.

Response:
None

AMQ6239 (Unix)Permission denied attempting to access filesystem location <insert_3>.
Severity:
20 : Error

Explanation:
An attempt to query the filesystem object identified failed because the command issued did not have authority to access the object.

Response:
Check the authority on the object and of the user executing the command and reissue the command.

AMQ6240 (Unix)You must be an operating system superuser to run this command.
Severity:
20 : Error

Explanation:
In irder to run this command you must be logged on as a user with superuser privelages.

Response:
Log in as an appropriate user and reissue the command.

AMQ6241 (Unix)The filesystem object <insert_3> is a symbolic link.
Severity:
20 : Error

Explanation:
While checking the filesystem, an object was found which is a symbolic link.

Response:
This is not an error however you should verify that the symbolic link is expected and that the destination of the symbolic link is correct.

AMQ6242 (Unix)Incorrect ownership for <insert_3>. Current(<insert_1>) Expected(<insert_2>)
Severity:
20 : Error

Explanation:
The filesystem object <insert_3> is owned by the user with uid <insert_1> when it was expected to be owned by the user with uid <insert_2>.

Response:
Correct the ownership using operating system commands or reissue the command selecting the option to fix the incorrect owership.

AMQ6243 (Unix)Incorrect group ownership for <insert_3>. Current(<insert_1>) Expected(<insert_2>)
Severity:
20 : Error

Explanation:
The filesystem object <insert_3> is owned by the group with gid <insert_1> when it was expected to be owned by the group with gid <insert_2>.

Response:
Correct the ownership using operating system commands or reissue the command selecting the option to fix the incorrect owership.

AMQ6244 (Unix)Incorrect permissions on object <insert_3>. Current(<insert_4>) Expected(<insert_5>)
Severity:
20 : Error

Explanation:
The filesystem object <insert_3> has the wrong file permissions.

Response:
Correct the ownership using operating system commands or reissue the command selecting the option to fix the incorrect owership.

AMQ6245 (Unix)Error executing system call <insert_3> on file <insert_4> error <insert_2>.
Severity:
20 : Error

Explanation:
The execution of the system call <insert_3> on file <insert_4> failed and the error code <insert_2> was returned.

Response:
Investigate the cause of the failure using the operating system error code <insert_1> and reissue the command.

AMQ6251 (Unix)The system could not dynamically load the shared library <insert_3>. The queue manager will continue without this module.
Severity:
30 : Severe error

Explanation:
This message applies to UNIX systems. The shared library <insert_3> failed to load as it is probably a <insert_1>-bit library, a <insert_2>-bit library is required. Note that MQ tried to find a <insert_2>-bit library named either <insert_4> or <insert_5>, but failed. The following message gives details of the original failure.

Response:
Supply the name of a <insert_2>-bit library.

AMQ6252 (Unix)The system could not dynamically load the shared library <insert_3>. The queue manager will continue without this module.
Severity:
30 : Severe error

Explanation:
This message applies to UNIX systems. The shared library <insert_3> failed to load as it is probably a <insert_1>-bit library, a <insert_2>-bit library is required. Note that MQ found and loaded a <insert_2>-bit library named <insert_4> however this also failed to load with the system returning error message <insert_5>. The following message gives details of the original failure.

Response:
Supply the name of a <insert_2>-bit library.

AMQ6253 (Unix)The system could not dynamically load the shared library <insert_3>. The queue manager will continue without this module.
Severity:
30 : Severe error

Explanation:
This message applies to UNIX systems. The shared library <insert_3> failed to load as it is probably a <insert_1>-bit library, a <insert_2>-bit library is required. Note that MQ attempted to locate and load a <insert_2>-bit library named either of these: <insert_4>. The first library failed to load as it also is probably a <insert_1>-bit library, the second library is a <insert_2>-bit library, however this also failed to load with the system returning error message <insert_5>. The following message gives details of the original failure.

Response:
Supply the name of a <insert_2>-bit library.

AMQ6254 (Unix)The system could not dynamically load the shared library <insert_3>, library <insert_4> has been used instead.
Severity:
0 : Information

Explanation:
This message applies to UNIX systems. The shared library <insert_3> failed to load as it is probably a <insert_1>-bit library, a <insert_2>-bit library is required. Note that MQ has sucessfully located and loaded a <insert_2>-bit library named <insert_4>.

Response:
Supply the name of a <insert_2>-bit library or put the library (alternatively a symbolic link can be used) in the appropriate place: 32-bit libraries in /var/mqm/exits; 64-bit libraries in /var/mqm/exits64.

AMQ6255 (Unix)The system could not dynamically load the shared library <insert_3>. The queue manager will continue without this module.
Severity:
30 : Severe error

Explanation:
This message applies to UNIX systems. The shared library <insert_3> failed to load as it is probably a <insert_1>-bit library, a <insert_2>-bit library is required. The following message gives details of the original failure.

Response:
Supply the name of a <insert_2>-bit library.

AMQ6256 (Unix)The system could not dynamically load the shared library <insert_3>. The queue manager will continue without this module.
Severity:
30 : Severe error

Explanation:
This message applies to UNIX systems. The shared library <insert_3> failed to load as it is probably a <insert_1>-bit library, a <insert_2>-bit library is required. Note that MQ tried to find a <insert_2>-bit library named <insert_4>, but failed. The following message gives details of the original failure.

Response:
Supply the name of a <insert_2>-bit library.

AMQ6257Message suppression enabled for message numbers (<insert_3>).
Severity:
0 : Information

Explanation:
The message contain's a list of message id's for which entries repeated within the <insert_1> suppression interval will be suppressed.

Response:
If you wish to see all occurrences of these messages you should alter the definition of the SuppressMessage attribute in the Queue Manager configuration.

AMQ6258Message exclusion enabled for message numbers (<insert_3>).
Severity:
0 : Information

Explanation:
The message contain's a list of message id's which have been excluded. Requests to write these messages to the error log will be discarded.

Response:
If you wish to see instances of these messages you should alter the definition of the ExcludeMessage attribute in the Queue Manager configuration.

AMQ6259Message <insert_3> cannot be <insert_4>.
Severity:
10 : Warning

Explanation:
Message <insert_3> cannot be excluded or suppressed but was specified in the ExcludeMessage or SuppressMessage configuration for the Queue Manager. The Queue Manager will continue however the request to suppress or exclude this message will be ignored.

Response:
Update the Queue Manager configuration to remove the specified message identifier.

AMQ6260Help Topic not found
Severity:
10 : Warning

Explanation:
The requested help topic could not be located. 
For further assistance, please refer to the WebSphere MQ manuals.

Response:
Ensure that the WebSphere MQ InfoCenter is installed.

AMQ6261 (Unix)An exception occurred trying to dynamically load shared library <insert_3>. The queue manager will continue without this module.
Severity:
30 : Severe error

Explanation:
This message applies to UNIX systems. Exception number <insert_1> name <insert_4>, occurred trying to dynamically load shared library <insert_3>.

Response:
Check the shared library has not been corrupted. If the shared library contains any initializer functions, ensure these are not causing the problem and that they conform to the expected function prototype.

AMQ6261 (Windows)An exception occurred trying to load DLL <insert_3>. The queue manager will continue without this module.
Severity:
30 : Severe error

Explanation:
This message applies to Windows systems only. Exception number <insert_1> error <insert_4>, occurred trying to load DLL <insert_3>.

Response:
Check the DLL has not been corrupted. If the DLL contains any initializer functions, ensure these are not causing the problem and that they conform to the expected function prototype.

AMQ6666 (iSeries)Required WebSphere MQ system profile(s) can not be accessed.
Severity:
40 : Stop Error

Explanation:
The required MQ system profile(s) QMQM and/or QMQMADM are not found or have been disabled. MQ can not continue processing the command without the profiles existing and enabled on the system. The major error code is <insert_3>, the minor error code is <insert_4>. The major error codes and their meanings are as follows: *DISABLED - The user profile has been disabled. *PWDEXP - The password for the user profile has expired . *EXIST - The user profile does not exist. If none of these error codes are shown the major error code contains the exception identifier. The minor error code identifies the user profile which cannot be accessed.

Response:
Check that both QMQM and QMQMADM profiles exist and are both enabled using the DSPUSRPRF command, or contact the WebSphere MQ system administrator.

AMQ6708A disk full condition was encountered when formatting a new log file in location <insert_3>.
Severity:
20 : Error

Explanation:
The queue manager attempted to format a new log file in directory <insert_3>. The drive or file system containing this directory did not have sufficient free space to contain the new log file.

Response:
Increase the amount of space available for log files and retry the request.

AMQ6708 (iSeries)A disk full condition was encountered when formatting a new log file.
Severity:
20 : Error

Explanation:
The queue manager attempted to format a new log file in directory <insert_3>. The drive or file system containing this directory did not have sufficient free space to contain the new log file.

Response:
Increase the amount of space available for log files and retry the request.

AMQ6709The log for the Queue manager is full.
Severity:
20 : Error

Explanation:
This message is issued when an attempt to write a log record is rejected because the log is full. The queue manager will attempt to resolve the problem.

Response:
This situation may be encountered during a period of unusually high message traffic. However, if you persistently fill the log, you may have to consider enlarging the size of the log. You can either increase the number of log files by changing the values in the queue manager configuration file. You will then have to stop and restart the queue manager. Alternatively, if you need to make the log files themselves bigger, you will have to delete and recreate the queue manager.

AMQ6710Queue manager unable to access directory <insert_3>.
Severity:
20 : Error

Explanation:
The queue manager was unable to access directory <insert_3> for the log. This could be because the directory does not exist, or because the queue manager does not have sufficient authority.

Response:
Ensure that the directory exists and that the queue manager has authority to read and write to it. Ensure that the LogPath attribute in the queue manager's configuration file matches the intended log path.

AMQ6767Log file <insert_3> could not be opened for use.
Severity:
20 : Error

Explanation:
Log file <insert_3> could not be opened for use. Possible reasons include the file being missing, the queue manager being denied permission to open the file or the contents of the file being incorrect.

Response:
If the log file was required to start the queue manager, ensure that the log file exists and that the queue manager is able to read from and write to it. If the log file was required to recreate an object from its media image and you do not have a copy of the required log file, delete the object instead of recreating it.

AMQ6774Log file <insert_3> did not contain the requested log record.
Severity:
20 : Error

Explanation:
Log file <insert_3> does not contain the log record whose LSN is <insert_4>. This is because the log file numbers have wrapped and the log file name <insert_3> has been reused by a newer file. Once a log file name has been reused, it is not possible to access the data in the previous versions of the file to use this name. The operation which requested this log record cannot be completed.

AMQ6782The log file numbers have wrapped.
Severity:
0 : Information

Explanation:
Each log file formatted is assigned a number which makes up part of its file name. The numbers are allocated sequentially and consist of seven digits giving a maximum of 10 million different log file names. Once all available numbers have been allocated, the queue manager again starts allocating numbers starting from zero. Once a file number has been re-allocated, you can no longer access data in the previous log files allocated the same number. The file numbers wrapped at log sequence number <insert_3>.

Response:
You should periodically take media images of all WebSphere MQ objects. You must ensure that media images of all objects which you may need to recreate do not span more than 10 million log files.

AMQ6901 (iSeries)WebSphere MQ for iSeries
AMQ6902 (iSeries)WebSphere MQ for iSeries - Samples
AMQ6903 (iSeries)Installation failed, WebSphere MQ resources are still active.
Severity:
30 : Severe error

Explanation:
An attempt to install WebSphere MQ was unsuccessful because MQ resources from a previous installation of MQ are still active. This failure may indicate that a queue manager from a previous installation of MQ is still running or has active jobs.

Response:
Ensure that all queue managers from previous installations of WebSphere MQ have been quiesced, and that the QMQM subsystem is not active using the WRKSBS and ENDSBS commands. Refer to the installation section in the WebSphere MQ for iSeries Quick Beginnings publication for further details.

AMQ6904 (iSeries)Installation of WebSphere MQ for iSeries failed due to previous release installed.
Explanation:
Some releases of WebSphere MQ for iSeries require migration before a later release can be installed.

Response:

If you wish to retain your current MQ information you must step through the migration process - see the Quick Beginnings Manual. 
If you do not wish to retain your current MQ information remove the current version of MQ before retrying the install.

AMQ6905 (iSeries)Found <insert_3> new MQ jobs to end, and <insert_4> MQ jobs currently ending.
Severity:
0 : Information

Explanation:
Jobs with locks on library QMQM are ended so that WebSphere MQ may be deleted or updated.

Response:
None.

AMQ6906 (iSeries)<insert_3> jobs still ending.
Severity:
40 : Stop Error

Explanation:
Jobs report state of 'already being deleted' after timeout.

Response:
If system is heavily loaded wait and reissue the command CALL QMQM/AMQIQES4 to try to delete jobs using WebSphere MQ resources. If this message is issued again, issue the command WRKOBJLCK for library QMQM to see which jobs have not been deleted, and end them manually.

AMQ6907 (iSeries)All WebSphere MQ pre-requisite PTFs on OS/400 programs are installed.
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ6908 (iSeries)WebSphere MQ pre-requisite PTF <insert_4> for program <insert_3> is not installed.
Severity:
40 : Stop Error

Explanation:
PTF <insert_3>-<insert_4> is not installed on system in state 'Permanently applied' 'Temporarily applied' or 'Superseded'. WebSphere MQ installation will proceed, but you must install the PTF before starting WebSphere MQ.

Response:
Use the command GO CMDPTF to display commands to order and apply the required PTF <insert_3>-<insert_4>..

AMQ6909 (iSeries)User space recovery failed, WebSphere MQ is running.
Severity:
30 : Severe error

Explanation:
An attempt to recover user space was unsuccessful because MQ was running.

Response:
Quiesce WebSphere MQ for iSeries and try again. See the section on "Quiescing WebSphere MQ" in the WebSphere MQ for iSeries Quick Beginnings.

AMQ6910 (iSeries)The attempt to quiesce the queue manager failed.
Severity:
30 : Severe error

Explanation:
The attempt to quiesce the queue manager was unsuccessful because the current job has locks on library QMQM.

Response:
Sign off the current job, sign on and attempt to quiesce the queue manager again. See the section on "Quiescing WebSphere MQ" in the WebSphere MQ for iSeries Quick Beginnings.

AMQ6911 (iSeries)WebSphere MQ quiesce is performing a RCDMQMIMG, please wait.
Severity:
0 : Information

Explanation:
WebSphere MQ quiesce is performing a Record Object Image (RCDMQMIMG) for all objects. Please wait, there may be some delay until this is completed.

Response:
None.

AMQ6912 (iSeries)WebSphere MQ for iSeries - Java
AMQ6913 (iSeries)WebSphere MQ for iSeries - Java samples
AMQ6988yes
Severity:
0 : Information

AMQ6988 (iSeries)Yes
AMQ6989no
Severity:
0 : Information

AMQ6989 (iSeries)No
AMQ6992 (iSeries)Program <insert_3> parameter error.
Severity:
40 : Stop Error

Explanation:
WebSphere MQ for iSeries program <insert_3> has an incorrect number of parameters, or an error in the parameter value.

Response:
Display the job log, using the DSPJOBLOG command, for more information on the problem.

AMQ6993 (iSeries)Program <insert_3> ended abnormally.
Severity:
40 : Stop Error

Explanation:
A WebSphere MQ for iSeries program, <insert_3>, is ending abnormally.

Response:
Display the job log, using the DSPJOBLOG command, for information why the job or subsystem ended abnormally. Correct the error and retry the request.

AMQ6994 (Windows)5724-H72 (C) Copyright IBM Corp. 1994, 2004. ALL RIGHTS RESERVED.
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ6995 (iSeries)xcsFFST has been called; take a look at the job log.
Severity:
0 : Information

AMQ6998 (iSeries)An internal WebSphere MQ error has occurred.
Severity:
40 : Stop Error

Explanation:
WebSphere MQ for iSeries is diagnosing an unexpected error.

Response:
Save the job log, and contact your IBM support center.

AMQ6999 (iSeries)An internal WebSphere MQ error has occurred.
Severity:
0 : Information

Explanation:
WebSphere MQ has experienced an internal failure, from which it could not recover.

Response:
Use WRKPRB to check if a problem has been created. If one has, record the problem identifier, and save the QPSRVDMP, QPJOBLOG, and QPDSPJOB files. If a problem has not been created, save the job log. Contact your IBM support center. Do not discard these files until the problem has been resolved.

7000-7999 - WebSphere MQ product
See Reading a message for an explanation of how to interpret these messages.

AMQ7001The location specified for creation of the queue manager is not valid.
Severity:
40 : Stop Error

Explanation:
The directory under which queue managers are to be created is not valid. It may not exist, or there may be a problem with authorization.

Response:
The location is specified in the machine-wide ini file. Correct the file and submit the request again.

AMQ7001 (Windows)The location specified for the creation of the queue manager is not valid.
Severity:
40 : Stop Error

Explanation:
The directory under which the queue managers are to be created is not valid. It may not exist, or there may be a problem with authorization.

Response:
The location is specified in the configuration data. Correct the configuration data and submit the request again.

AMQ7002An error occurred manipulating a file.
Severity:
40 : Stop Error

Explanation:
An internal error occurred while trying to create or delete a queue manager file. It is likely that the error was caused by a disk having insufficient space, or by problems with authorization to the underlying filesystem.

Response:
Identify the file that caused the error, using problem determination techniques. For example check if there are any FFST files, which may identify the queue manager file causing the error. This error may also be caused if users have created, renamed or deleted that file. Correct the error in the filesystem and submit the request again.

AMQ7002 (Windows)An error occurred manipulating a file.
Severity:
40 : Stop Error

Explanation:
An internal error occurred while trying to create or delete a queue manager file. 
In the case of a failure to delete a file a common reason for this error is that a non MQ process, such as the windows explorer or a virus checker, is accessing the file. In the case where the object that cannot be deleted is a directory then a non MQ process may be accessing a file within the directory or one of its subdirectories. 
It is also possible that the error was caused by a disk having insufficient space, or by problems with authorization to the underlying filesystem.

Response:
Identify the file that caused the error, using problem determination techniques. For example check if there are any FFST files, which may identify the queue manager file causing the error. This error may also be caused if users have created, renamed or deleted that file. Correct the error in the filesystem and submit the request again.

AMQ7005The queue manager is running.
Severity:
40 : Stop Error

Explanation:
You tried to perform an action that requires the queue manager stopped, however, it is currently running. You probably tried to delete or start a queue manager that is currently running.

Response:
If the queue manager should be stopped, stop the queue manager and submit the failed command again.

AMQ7006Missing attribute <insert_5> on stanza starting on line <insert_1> of ini file <insert_3>.
Severity:
20 : Error

Explanation:
The <insert_4> stanza starting on line <insert_1> of configuration file <insert_3> is missing the required <insert_5> attribute.

Response:
Check the contents of the file and retry the operation.

AMQ7006 (Windows)Missing attribute <insert_5> from configuration data.
Severity:
20 : Error

Explanation:
The <insert_4> stanza in the configuration data is missing the required <insert_5> attribute.

Response:
Check the contents of the configuration data and retry the operation.

AMQ7008The queue manager already exists.
Severity:
40 : Stop Error

Explanation:
You tried to create a queue manager that already exists.

Response:
If you specified the wrong queue manager name, correct the name and submit the request again.

AMQ7010The queue manager does not exist.
Severity:
40 : Stop Error

Explanation:
You tried to perform an action against a queue manager that does not exist. You may have specified the wrong queue manager name.

Response:
If you specified the wrong name, correct it and submit the command again. If the queue manager should exist, create it, and then submit the command again.

AMQ7011The queue manager files have not been completely deleted.
Severity:
40 : Stop Error

Explanation:
While deleting the queue manager, an error occurred deleting a file or directory. The queue manager may not have been completely deleted.

Response:
Follow problem determination procedures to identify the file or directory and to complete deletion of the queue manager.

AMQ7012The specified trigger interval is not valid.
Severity:
40 : Stop Error

Explanation:
You specified a value for the trigger interval that is not valid. The value must be not less than zero and not greater than 999 999 999.

Response:
Correct the value and resubmit the request.

AMQ7013There is an error in the name of the specified dead-letter queue.
Severity:
40 : Stop Error

Explanation:
You specified a name for the dead-letter queue that is not valid.

Response:
Correct the name and resubmit the request.

AMQ7014There is an error in the name of the specified default transmission queue.
Severity:
40 : Stop Error

Explanation:
You specified a name for the default transmission queue that is not valid.

Response:
Correct the name and submit the command again.

AMQ7015There is an error in the maximum number of open object handles specified.
Severity:
40 : Stop Error

Explanation:
You specified a value for the maximum number of open object handles to be allowed that is not valid. The value must be not less than zero and not greater than 999 999 999.

Response:
Correct the value and submit the command again.

AMQ7016There is an error in the maximum number of uncommitted messages specified.
Severity:
40 : Stop Error

Explanation:
You specified a value for the maximum number of uncommitted messages to be allowed that is not valid. The value must be not less than 1 and not greater than 999 999 999.

Response:
Correct the value and submit the command again.

AMQ7017Log not available.
Severity:
40 : Stop Error

Explanation:
The queue manager was unable to use the log. This could be due to a log file being missing or damaged, or the log path to the queue manager being inaccessible.

Response:
Ensure that the LogPath attribute in the queue manager configuration file is correct. If a log file is missing or otherwise unusable, restore a backup copy of the file, or the entire queue manager.

AMQ7018The queue manager operation cannot be completed.
Severity:
20 : Error

Explanation:
An attempt has been made to perform an operation on a queue manager. Resources required to perform the operation are not available.

AMQ7019An error occurred while creating the directory structure for the new queue manager.
Severity:
40 : Stop Error

Explanation:
During creation of the queue manager an error occurred while trying to create a file or directory.

Response:
Identify why the queue manager files cannot be created. It is probable that there is insufficient space on the specified disk, or that there is a problem with access control. Correct the problem and submit the command again.

AMQ7020The operation was carried out, but one or more transactions remain in-doubt.
Severity:
10 : Warning

Explanation:
The queue manager tried to resolve all internally coordinated transactions which are in-doubt. In-doubt transactions still remain after the queue manager has attempted to deliver the outcome of these transactions to the resource managers concerned. Transactions remain in-doubt when the queue manager cannot deliver the outcome of the transaction to each of the participating resource managers. For example, a resource manager may not be available at this time.

Response:
Use the DSPMQTRN command to display the remaining in-doubt transactions.

AMQ7020 (iSeries)The operation was carried out, but one or more transactions remain in-doubt.
Severity:
10 : Warning

Explanation:
The queue manager tried to resolve all internally coordinated transactions which are in-doubt. In-doubt transactions still remain after the queue manager has attempted to deliver the outcome of these transactions to the resource managers concerned. Transactions remain in-doubt when the queue manager cannot deliver the outcome of the transaction to each of the participating resource managers. For example, a resource manager may not be available at this time.

Response:
Use the Work with Transactions (WRKMQMTRN) command to display the remaining in-doubt transactions.

AMQ7021An error occurred while deleting the directory structure for the queue manager.
Severity:
40 : Stop Error

Explanation:
While deleting the queue manager, an error occurred deleting a file or directory. The queue manager may not have been completely deleted.

Response:
Follow problem determination procedures to identify the file or directory and to complete deletion of the queue manager.

AMQ7022The resource manager identification number is not recognized.
Severity:
20 : Error

Explanation:
The identification number of the resource manager you supplied was not recognized.

Response:
Ensure that you entered a valid resource manager identification number. Use the DSPMQTRN command to display a list of resource managers and their identification numbers.

AMQ7023The resource manager was in an invalid state.
Severity:
20 : Error

Explanation:
The resource manager, the identification number of which you supplied, was in an invalid state.

Response:
Ensure that you entered the correct resource manager identification number. Use the DSPMQTRN command to display a list of resource managers and their identification numbers. A resource manager is in an invalid state, if it is still available to resolve the transaction, use the -a optional flag to resolve this and all other internally coordinated in-doubt transactions.

AMQ7024Arguments supplied to a command are not valid.
Severity:
20 : Error

Explanation:
You supplied arguments to a command that it could not interpret. It is probable that you specified a flag not accepted by the command, or that you included extra flags.

Response:
Correct the command and submit it again. Additional information on the arguments causing the error may be found in the error logs for the queue, or queue manager, referenced in the command.

AMQ7025Error in the descriptive text argument (-c parameter) of the crtmqm command.
Severity:
40 : Stop Error

Explanation:
The descriptive text you supplied to the crtmqm command was in error.

Response:
Correct the descriptive text argument and submit the command again.

AMQ7026A principal or group name was invalid.
Severity:
40 : Stop Error

Explanation:
You specified the name of a principal or group which does not exist.

Response:
Correct the name and resubmit the request.

AMQ7027Argument <insert_3> supplied to command <insert_4> is invalid.
Severity:
20 : Error

Explanation:
The argument <insert_3> was supplied to the command <insert_4> which could not be interpreted. This argument is either not accepted by the command, or an extra flag has been included.

Response:
Correct the command and submit it again.

AMQ7028The queue manager is not available for use.
Severity:
40 : Stop Error

Explanation:
You have requested an action that requires the queue manager running, however, the queue manager is not currently running.

Response:
Start the required queue manager and submit the command again.

AMQ7030Quiesce request accepted. The queue manager will stop when all outstanding work is complete.
Severity:
0 : Information

Explanation:
You have requested that the queue manager end when there is no more work for it. In the meantime, it will refuse new applications that attempt to start, although it allows those already running to complete their work.

Response:
None.

AMQ7031The queue manager is stopping.
Severity:
40 : Stop Error

Explanation:
You issued a command that requires the queue manager running, however, it is currently in the process of stopping. The command cannot be run.

Response:
None

AMQ7041Object already exists.
Severity:
40 : Stop Error

Explanation:
A Define Object operation was performed, but the name selected for the object is already in use by an object that is unknown to WebSphere MQ. The object name selected by MQ was <insert_3>, in directory <insert_4>, of object type <insert_5>.

Response:
Remove the conflicting object from the MQ system, then try the operation again.

AMQ7042Media image not available for object <insert_3> of type <insert_4>.
Severity:
20 : Error

Explanation:
The media image for object <insert_3>, type <insert_4>, is not available for media recovery. A log file containing part of the media image cannot be accessed.

Response:
A previous message indicates which log file could not be accessed. Restore a copy of the log file and all subsequent log files from backup. If this is not possible, you must delete the object instead.

AMQ7042 (iSeries)Media image not available for object <insert_3>.
Severity:
20 : Error

Explanation:
The media image for object <insert_3>, type <insert_4>, is not available for media recovery. A log file containing part of the media image cannot be accessed.

Response:
A previous message indicates which log file could not be accessed. Restore a copy of the log file and all subsequent log files from backup. If this is not possible, you must delete the object instead.

AMQ7044Media recovery not allowed.
Severity:
20 : Error

Explanation:
Media recovery is not possible on a queue manager using a circular log. Damaged objects must be deleted on such a queue manager.

Response:
None.

AMQ7047An unexpected error was encountered by a command.
Severity:
40 : Stop Error

Explanation:
An internal error occurred during the processing of a command.

Response:
Follow problem determination procedures to identify the cause of the error.

AMQ7048The queue manager name is either not valid or not known
Severity:
40 : Stop Error

Explanation:
Either the specified queue manager name does not conform to the rules required by WebSphere MQ or the queue manager does not exist. The rules for naming MQ objects are detailed in the WebSphere MQ Command Reference.

Response:
Correct the name and submit the command again.

AMQ7053The transaction has been committed.
Severity:
0 : Information

Explanation:
The prepared transaction has been committed.

Response:
None.

AMQ7054The transaction has been backed out.
Severity:
0 : Information

Explanation:
The prepared transaction has been backed out.

Response:
None.

AMQ7055The transaction number is not recognized.
Severity:
20 : Error

Explanation:
The number of the transaction you supplied was not recognized as belonging to an in-doubt transaction.

Response:
Ensure that you entered a valid transaction number. It is possible that the transaction number you entered corresponds to a transaction which was committed or backed out before you issued the command to resolve it.

AMQ7056Transaction number <insert_1>,<insert_2>.
Severity:
0 : Information

Explanation:
This message is used to report the number of an in-doubt transaction.

Response:
None.

AMQ7059An error has occurred reading an INI file.
Severity:
20 : Error

Explanation:
An error has occurred when reading the MQS.INI file or a queue manager QM.INI file.

Response:
If you have been changing the INI file content check and correct the change. If you have not changed the INI file, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ7059 (Windows)An error occurred when reading the configuration data.
Severity:
20 : Error

Explanation:
An error has occurred when reading the configuration data.

Response:
If you have changed the configuration data, check and correct the change. If you have not changed the configuration data, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ7060An error has occurred locking an INI file.
Severity:
20 : Error

Explanation:
An error has occurred locking the MQS.INI file or a queue manager QM.INI file.

Response:
If you have been changing the INI file permissions check and correct the change. If you have not changed the INI file, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ7060 (Windows)An error has occurred locking the configuration data.
Severity:
20 : Error

Explanation:
An error has occurred locking the configuration data.

Response:
If you have changed the configuration data permissions, check and correct the change. If you have not changed the configuration data, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ7061An expected stanza in an INI file is missing or contains errors.
Severity:
20 : Error

Explanation:
An expected stanza is missing from the MQS.INI file or a queue manager QM.INI file or the stanza contains errors.

Response:
If you have been changing the INI file content check and correct the change.

AMQ7061 (Windows)An expected stanza in the configuration data is missing or contains errors.
Severity:
20 : Error

Explanation:
An expected stanza is missing from the configuration data or the stanza contains errors.

Response:
If you have changed the configuration data, check and correct the change.

AMQ7062Unable to access an INI file.
Severity:
20 : Error

Explanation:
Access to the MQS.INI file or a queue manager QM.INI file is denied.

Response:
If you have been changing the INI file permissions check and correct the change.

AMQ7062 (Windows)Unable to access the configuration data.
Severity:
20 : Error

Explanation:
Access to the configuration data is denied.

Response:
If you have changed the configuration data permissions, check and correct the change.

AMQ7063An INI file is missing.
Severity:
20 : Error

Explanation:
The MQS.INI file or a queue manager QM.INI file is missing.

Response:
If you have been changing the INI file recover the previous file and retry the operation.

AMQ7063 (Windows)Configuration data is missing.
Severity:
20 : Error

Explanation:
The configuration data for WebSphere MQ is missing.

Response:
If you have changed the configuration data, recover the previous configuration data and retry the operation.

AMQ7064Log path not valid or inaccessible.
Severity:
40 : Stop Error

Explanation:
The supplied log path could not be used by the queue manager. Possible reasons for this include the path not existing, the queue manager not being able to write to the path, or the path residing on a remote device.

Response:
Ensure that the log path exists and that the queue manager has authority to read and write to it. If the queue manager already exists, ensure that the LogPath attribute in the queue manager's configuration file matches the intended log path.

AMQ7064 (iSeries)Auxiliary storage pool identifier not found.
Explanation:
The auxiliary storage pool identifier supplied does not exist on the system and could not be used by the queue manager to create a journal receiver.

Response:
Specify *SYSTEM, or the identifier of an existing auxiliary storage pool and try the request again. You can use WRKDSKSTS to check the assignment of disk units to auxiliary storage pools.

AMQ7065Insufficient space on disk.
Severity:
40 : Stop Error

Explanation:
The operation cannot be completed due to shortage of disk space.

Response:
Either make more disk space available, or reduce the disk requirements of the command you issued.

AMQ7066There are no prepared transactions.
Severity:
10 : Warning

Explanation:
There are no prepared transactions to be resolved.

Response:
None.

AMQ7068Authority file contains an authority stanza that is not valid.
Severity:
40 : Stop Error

Explanation:
A syntax error has been found in one of the files containing authorization information for the queue manager.

Response:
Correct the contents of the incorrect authorization file by editing it.

AMQ7069The queue manager was created successfully, but cannot be made the default.
Severity:
40 : Stop Error

Explanation:
The queue manager was defined to be the default queue manager for the machine when it was created. However, although the queue manager has been created, an error occurred trying to make it the default. There may not be a default queue manager defined for the machine at present.

Response:
There is probably a problem with the machine-wide ini file. Verify the existence of the file, its access permissions, and its contents. If its backup file exists, reconcile the contents of the two files and then delete the backup. Finally, either update the machine-wide ini file by hand to specify the desired default queue manager, or delete and recreate the queue manager.

AMQ7069 (Windows)The queue manager was created successfully, but cannot be made the default.
Severity:
40 : Stop Error

Explanation:
The queue manager was defined to be the default queue manager for the machine when it was created. However, although the queue manager has been created, an error occurred trying to make it the default. There may not be a default queue manager defined for the machine at present.

Response:
There is probably a problem with the configuration data. Update the configuration data to specify the desired default queue manager, or delete and recreate the queue manager.

AMQ7072Invalid QM.INI file stanza. Refer to the error log for more information.
Severity:
40 : Stop Error

Explanation:
An invalid QM.INI file stanza was found. Refer to the error log for more information.

Response:
Correct the error and then retry the operation.

AMQ7072 (Windows)Stanza not valid. Refer to the error log for more information.
Severity:
40 : Stop Error

Explanation:
A stanza that is not valid was found. Refer to the error log for more information.

Response:
Correct the error and retry the operation.

AMQ7073Log size not valid.
Severity:
40 : Stop Error

Explanation:
Either the number of log files or the size of the log files was outside the accepted values.

Response:
Make sure that the log parameters you enter lie within the valid range.

AMQ7074Unknown stanza key <insert_4> on line <insert_1> of ini file <insert_3>.
Severity:
10 : Warning

Explanation:
Line <insert_1> of the configuration file <insert_3> contained a stanza called <insert_3>. This stanza is not recognized.

Response:
Check the contents of the file and retry the operation.

AMQ7074 (iSeries)Unknown stanza key.
Severity:
10 : Warning

Explanation:
Line <insert_1> of the configuration file <insert_3> contained a stanza key <insert_4>. This stanza is not recognized.

Response:
Check the contents of the file and retry the operation.

AMQ7074 (Windows)Unknown stanza key <insert_4> at <insert_3> in the configuration data.
Severity:
10 : Warning

Explanation:
Key <insert_3> contained a stanza called <insert_4>. This stanza is not recognized.

Response:
Check the contents of the configuration data and retry the operation.

AMQ7075Unknown attribute in ini file.
Severity:
10 : Warning

Explanation:
Line <insert_1> of the configuration file <insert_3> contained an attribute called <insert_4> that is not valid. This attribute is not recognized in this context.

Response:
Check the contents of the file and retry the operation.

AMQ7075 (Windows)Unknown attribute <insert_4> at <insert_3> in the configuration data.
Severity:
10 : Warning

Explanation:
Key <insert_3> in the configuration data contained an attribute called <insert_4> that is not valid. This attribute is not recognized in this context.

Response:
Check the contents of the configuration data and retry the operation.

AMQ7076Invalid value for attribute in ini file.
Severity:
10 : Warning

Explanation:
Line <insert_1> of the configuration file <insert_3> contained value <insert_5> that is not valid for the attribute <insert_4>.

Response:
Check the contents of the file and retry the operation.

AMQ7076 (Windows)Value <insert_5> not valid for attribute <insert_4> at <insert_3> in the configuration data.
Severity:
10 : Warning

Explanation:
Key <insert_3> in the configuration data contained value <insert_5> that is not valid for the attribute <insert_4>.

Response:
Check the contents of the configuration data and retry the operation.

AMQ7077You are not authorized to perform the requested operation.
Severity:
40 : Stop Error

Explanation:
You tried to issue a command for the queue manager. You are not authorized to perform the command.

Response:
Contact your system administrator to perform the command for you. Alternatively, request authority to perform the command from your system administrator.

AMQ7078You entered an object type that is invalid with a generic profile name.
Severity:
40 : Stop Error

Explanation:
You entered an object type of *ALL or *MQM and an object name that contains generic characters, this is an invalid combination.

Response:
Correct the command and submit it again.

AMQ7080No objects processed.
Severity:
10 : Warning

Explanation:
No objects were processed, either because no objects matched the criteria given, or because the objects found did not require processing.

Response:
None.

AMQ7081Object <insert_3>, type <insert_4> recreated.
Severity:
0 : Information

Explanation:
The object <insert_3>, type <insert_4> was recreated from its media image.

Response:
None.

AMQ7082Object <insert_3>, type <insert_4> is not damaged.
Severity:
10 : Warning

Explanation:
Object <insert_3>, type <insert_4> cannot be recreated since it is not damaged.

Response:
None

AMQ7083A resource problem was encountered by a command.
Severity:
20 : Error

Explanation:
The command failed due to a resource problem. Possible causes include the log being full or the command running out of memory.

Response:
Look at the previous messages to diagnose the problem. Rectify the problem and retry the operation.

AMQ7084Object <insert_3>, type <insert_4> damaged.
Severity:
20 : Error

Explanation:
The object <insert_3>, type <insert_4> was damaged. The object must be deleted or, if the queue manager supports media recovery, recreated from its media image.

Response:
Delete the object or recreate it from its media image.

AMQ7085Object <insert_3>, type <insert_4> not found.
Severity:
20 : Error

Explanation:
Object <insert_3>, type <insert_4> cannot be found.

Response:
None.

AMQ7086Media image for object <insert_3>, type <insert_4> recorded.
Severity:
0 : Information

Explanation:
The media image for object <insert_3>, type <insert_4>, defined in Queue Manager <insert_5>, has been recorded.

Response:
None.

AMQ7087Object <insert_3>, type <insert_4> is a temporary object
Severity:
20 : Error

Explanation:
Object <insert_3>, type <insert_4> is a temporary object. Media recovery operations are not permitted on temporary objects.

Response:
None.

AMQ7088Object <insert_3>, type <insert_4> in use.
Severity:
20 : Error

Explanation:
Object <insert_3>, type <insert_4> is in use. Either an application has it open or, if it is a local queue, there are uncommitted messages on it.

Response:
Ensure that the object is not opened by any applications, and that there are no uncommitted messages on the object, if it is a local queue. Then, retry the operation.

AMQ7089Media recovery already in progress.
Severity:
20 : Error

Explanation:
Another media recovery operation is already in progress. Only one media recovery operation is permitted at a time.

Response:
Wait for the existing media recovery operation to complete and retry the operation.

AMQ7090 (iSeries)The queue manager CCSID is not valid.
Severity:
40 : Stop Error

Explanation:
The CCSID to be used by the QMGR is not valid for the iSeries platform. The CCSID encoding must be a valid EBCDIC value.

Response:
Check that the CCSID that you have entered is a valid EBCDIC value.

AMQ7090 (Windows)The queue manager CCSID is not valid.
Severity:
40 : Stop Error

Explanation:
The CCSID to be used by the QMGR is not valid, because: 
1) It is a DBCS CCSID. 
2) The CCSID encoding is not ASCII or ASCII related. EBCDIC or UCS2 encodings are not valid on this machine. 
3) The CCSID encoding is unknown.

Response:
Check the CCSID is valid for the machine on which you are working.

AMQ7091You are performing authorization for the queue manager, but you specified an object name.
Severity:
40 : Stop Error

Explanation:
Modification of authorizations for a queue manager can be performed only from that queue manager. You must not specify an object name.

Response:
Correct the command and submit it again.

AMQ7092An object name is required but you did not specify one.
Severity:
40 : Stop Error

Explanation:
The command needs the name of an object, but you did not specify one.

Response:
Correct the command and submit it again.

AMQ7093An object type is required but you did not specify one.
Severity:
40 : Stop Error

Explanation:
The command needs the type of the object, but you did not specify one.

Response:
Correct the command and submit it again.

AMQ7094You specified an object type that is not valid, or more than one object type.
Severity:
40 : Stop Error

Explanation:
Either the type of object you specified was not valid, or you specified multiple object types on a command which supports only one.

Response:
Correct the command and submit it again.

AMQ7095An entity name is required but you did not specify one.
Severity:
40 : Stop Error

Explanation:
The command needs one or more entity names, but you did not specify any. Entities can be principals or groups.

Response:
Correct the command and submit it again.

AMQ7096An authorization specification is required but you did not provide one.
Severity:
40 : Stop Error

Explanation:
The command sets the authorizations on WebSphere MQ objects. However you did not specify which authorizations are to be set.

Response:
Correct the command and submit it again.

AMQ7097You gave an authorization specification that is not valid.
Severity:
40 : Stop Error

Explanation:
The authorization specification you provided to the command contained one or more items that could not be interpreted.

Response:
Correct the command and submit it again.

AMQ7098The command accepts only one entity name. You specified more than one.
Severity:
40 : Stop Error

Explanation:
The command can accept only one principal or group name. You specified more than one.

Response:
Correct the command and submit it again.

AMQ7099Entity <insert_3> has the following authorizations for object <insert_4>:
Severity:
0 : Information

Explanation:
Informational message. The list of authorizations follows.

Response:
None.

AMQ7104Resource manager <insert_1> has prepared.
Severity:
0 : Information

Explanation:
This message reports the state of a resource manager with respect to an in-doubt transaction.

Response:
None.

AMQ7105Resource manager <insert_1> has committed.
Severity:
0 : Information

Explanation:
This message reports the state of a resource manager with respect to an in-doubt transaction.

Response:
None.

AMQ7106Resource manager <insert_1> has rolled back.
Severity:
0 : Information

Explanation:
This message reports the state of a resource manager with respect to an in-doubt transaction.

Response:
None.

AMQ7107Resource manager <insert_1> is <insert_3>.
Severity:
0 : Information

Explanation:
This message reports the identification number and name of a resource manager.

Response:
None.

AMQ7108Any in-doubt transactions have been resolved.
Severity:
0 : Information

Explanation:
All, if there were any, of the internally coordinated transactions which were in-doubt, have now been resolved. This message reports successful completion of the RSVMQTRN command when the -a option is used.

Response:
None.

AMQ7108 (iSeries)Any in-doubt transactions have been resolved.
Severity:
0 : Information

Explanation:
All, if there were any, of the internally coordinated transactions which were in-doubt, have now been resolved.

Response:
None.

AMQ7109A decision on behalf of the unavailable resource manager has been delivered.
Severity:
0 : Information

Explanation:
A decision for an internally coordinated transaction which was in-doubt, has now been delivered on behalf of the unavailable resource manager. This message reports successful completion of the RSVMQTRN command when the -r option is used.

Response:
None.

AMQ7110Media image for the syncfile recorded.
Severity:
0 : Information

Explanation:
The media image for the syncfile has been recorded.

Response:
None.

AMQ7111Resource manager <insert_1> has participated.
Severity:
0 : Information

Explanation:
This message reports the state of a resource manager with respect to an in-doubt transaction.

Response:
None.

AMQ7112Transaction number <insert_1>,<insert_2> has encountered an error.
Severity:
0 : Information

Explanation:
This message is used to report the number of an in-doubt transaction which has encountered an error with one or more resource managers.

Response:
Refer to the queue manager error log for more information about which resource managers are in error. Ensure that the resource managers that were in error, are working correctly, restart the queue manager. If the problem persists, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ7113The Database Name argument, -rn, is missing from the command crtmqm
Severity:
20 : Error

Explanation:
The required flag, -rn, was omitted from the command crtmqm

Response:
Add the flag and associated database name and submit it again.

AMQ7114The Database Password argument, -rp, is missing from the command crtmqm
Severity:
20 : Error

Explanation:
The required flag, -rp, was omitted from the command crtmqm

Response:
Add the flag and associated database password and submit it again.

AMQ7115The Database Type argument, -rt, is missing from the command crtmqm
Severity:
20 : Error

Explanation:
The required flag, -rt, was omitted from the command crtmqm

Response:
Add the flag and associated database type and submit it again

AMQ7116The Database Type argument, -rt, is greater than 8 characters long
Severity:
20 : Error

Explanation:
The argument supplied with the flag -rt, is greater than 8 characters long

Response:
Reduce the length of the database type argument and submit it again

AMQ7117The MSD shared library failed to load.
Severity:
20 : Error

Explanation:
The MSD shared library was either not located or failed to load correctly.

Response:
Ensure that the database type is specified correctly when creating a queue manager since this is used to form the name of the shared library to be loaded. Further information on the failure may be found in the FFST logs. Also, ensure that ensure that the MSD shared library is installed correctly.

AMQ7120The Trial Period license for this copy of WebSphere MQ has expired.
Severity:
20 : Error

Explanation:
This copy of WebSphere MQ was licensed to be used in trial mode for a limited period only. This period has expired.

Response:
Install a Production license for this copy of WebSphere MQ.

AMQ7121The trial period for this copy of WebSphere MQ has now expired.
Severity:
20 : Error

Explanation:
This copy of WebSphere MQ was licensed for a limited period only. This period has now expired.

Response:
Install a Production license for this copy of WebSphere MQ.

AMQ7122The Trial Period License Agreement was not accepted.
Severity:
10 : Warning

Explanation:
When the Trial Period License Agreement is displayed, the user must accept it before this copy of WebSphere MQ can be used.

Response:
Submit the command again and accept the agreement.

AMQ7123There is one day left in the trial period for this copy of WebSphere MQ.
Severity:
0 : Information

Explanation:
This copy of WebSphere MQ is licensed for a limited period only.

Response:
None.

AMQ7124This is the final day of the trial period for this copy of WebSphere MQ.
Severity:
10 : Warning

Explanation:
This copy of WebSphere MQ is licensed for a limited period only.

Response:
Install a Production license for this copy of WebSphere MQ.

AMQ7125There are <insert_1> days left in the trial period for this copy of WebSphere MQ.
Severity:
0 : Information

Explanation:
This copy of WebSphere MQ is licensed for a limited period only.

Response:
None.

AMQ7126This copy of WebSphere MQ is now running in Production mode.
Severity:
0 : Information

Explanation:
A Production license has been installed for this copy of WebSphere MQ.

Response:
None.

AMQ7127
Press Enter when you have read the messages
Severity:
0 : Information

Explanation:
One or more messages have been displayed. They will disappear when the user presses the Enter key.

Response:
Press the Enter key when the messages are no longer required.

AMQ7128No license installed for this copy of WebSphere MQ.
Severity:
20 : Error

Explanation:
The installation of WebSphere MQ is invalid since no Production, Beta, or Trial Period license has been installed.

Response:
Check that the installation steps described in the Quick Beginnings book have been followed, and if the problem persists contact your IBM service representative.

AMQ7129The trial period for this copy of WebSphere MQ has already been started.
Severity:
0 : Information

Explanation:
This copy of WebSphere MQ is licensed for a limited period only and the trial period has been started previously.

Response:
None.

AMQ7130This copy of WebSphere MQ is running in Production mode.
Severity:
0 : Information

Explanation:
A Production license has been installed for this copy of WebSphere MQ. A beta or trial period cannot be started.

Response:
None.

AMQ7131International License Agreement for Evaluation of Programs 
Part 1 - General Terms 
PLEASE READ THIS AGREEMENT CAREFULLY BEFORE USING THE PROGRAM. IBM WILL LICENSE THE PROGRAM TO YOU ONLY IF YOU FIRST ACCEPT THE TERMS OF THIS AGREEMENT. BY USING THE PROGRAM YOU AGREE TO THESE TERMS. IF YOU DO NOT AGREE TO THE TERMS OF THIS AGREEMENT, PROMPTLY RETURN THE UNUSED PROGRAM TO IBM.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7132
The Program is owned by International Business Machines Corporation or one of its subsidiaries (IBM) or an IBM supplier, and is copyrighted and licensed, not sold. 
The term "Program" means the original program and all whole or partial copies of it. A Program consists of machine-readable instructions, its components, data, audio-visual content (such as images, text, recordings, or pictures), and related licensed materials.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7133
This Agreement includes Part 1 - General Terms and Part 2 - Country Unique Terms and is the complete agreement regarding the use of this Program, and replaces any prior oral or written communications between you and IBM. The terms of Part 2 may replace or modify those of Part 1.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7134
1. License 
Use of the Program 
IBM grants you a nonexclusive, nontransferable license to use the Program. 
You may 1) use the Program only for internal evaluation, testing or demonstration purposes, on a trial or "try-and-buy" basis and 2) make and install a reasonable number of copies of the Program in support of such use, unless IBM identifies a specific number of copies in the documentation accompanying the Program. The terms of this license apply to each copy you make. You will reproduce the copyright notice and any other legends of ownership on each copy, or partial copy, of the Program.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7135
THE PROGRAM MAY CONTAIN A DISABLING DEVICE THAT WILL PREVENT IT FROM BEING USED UPON EXPIRATION OF THIS LICENSE. YOU WILL NOT TAMPER WITH THIS DISABLING DEVICE OR THE PROGRAM. YOU SHOULD TAKE PRECAUTIONS TO AVOID ANY LOSS OF DATA THAT MIGHT RESULT WHEN THE PROGRAM CAN NO LONGER BE USED.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7136
You will 1) maintain a record of all copies of the Program and 2) ensure that anyone who uses the Program does so only for your authorized use and in compliance with the terms of this Agreement. 
You may not 1) use, copy, modify or distribute the Program except as provided in this Agreement; 2) reverse assemble, reverse compile, or otherwise translate the Program except as specifically permitted by law without the possibility of contractual waiver; or 3) sublicense, rent or lease the Program.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7137
This license begins with your first use of the Program and ends 1) as of the duration or date specified in the documentation accompanying the Program or 2) when the Program automatically disables itself. Unless IBM specifies in the documentation accompanying the Program that you may retain the Program (in which case, an additional charge may apply), you will destroy the Program and all copies made of it within ten days of when this license ends.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7138
2. No Warranty 
SUBJECT TO ANY STATUTORY WARRANTIES WHICH CANNOT BE EXCLUDED, IBM MAKES NO WARRANTIES OR CONDITIONS EITHER EXPRESS OR IMPLIED, INCLUDING WITHOUT LIMITATION, THE WARRANTY OF NON-INFRINGEMENT AND THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE, REGARDING THE PROGRAM OR TECHNICAL SUPPORT, IF ANY. IBM MAKES NO WARRANTY REGARDING THE CAPABILITY OF THE PROGRAM TO CORRECTLY PROCESS, PROVIDE AND/OR RECEIVE DATE DATA WITHIN AND BETWEEN THE 20TH AND 21ST CENTURIES. 
This exclusion also applies to any of IBM's subcontractors, suppliers or program developers (collectively called "Suppliers"). 
Manufacturers, suppliers, or publishers of non-IBM Programs may provide their own warranties.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7139
3. Limitation of Liability 
NEITHER IBM NOR ITS SUPPLIERS ARE LIABLE FOR ANY DIRECT OR INDIRECT DAMAGES, INCLUDING WITHOUT LIMITATION, LOST PROFITS, LOST SAVINGS, OR ANY INCIDENTAL, SPECIAL, OR OTHER ECONOMIC CONSEQUENTIAL DAMAGES, EVEN IF IBM IS INFORMED OF THEIR POSSIBILITY. SOME JURISDICTIONS DO NOT ALLOW THE EXCLUSION OR LIMITATION OF INCIDENTAL OR CONSEQUENTIAL DAMAGES, SO THE ABOVE EXCLUSION OR LIMITATION MAY NOT APPLY TO YOU.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7140
4. General 
Nothing in this Agreement affects any statutory rights of consumers that cannot be waived or limited by contract.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7141
IBM may terminate your license if you fail to comply with the terms of this Agreement. If IBM does so, you must immediately destroy the Program and all copies you made of it. 
You may not export the Program. 
Neither you nor IBM will bring a legal action under this Agreement more than two years after the cause of action arose unless otherwise provided by local law without the possibility of contractual waiver or limitation. 
Neither you nor IBM is responsible for failure to fulfill any obligations due to causes beyond its control. 
There is no additional charge for use of the Program for the duration of this license. 
IBM does not provide program services or technical support, unless IBM specifies otherwise.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7142Reply 'yes' to accept the Agreement. Reply 'no' if you do not agree to the terms of the Agreement. Reply 'no' and submit the command again, if you want to read the Agreement again.
Severity:
0 : Information

Explanation:
The Trial Period License Agreement has been displayed to the user and the user should now accept or reject the Agreement.

Response:
Reply 'yes' or 'no' and press 'Enter'.

AMQ7143
Press Enter to continue
Severity:
0 : Information

Explanation:
Part of the Trial Period License Agreement has been displayed to the user. The user should press the Enter key to indicate that they are ready for the next part of the Agreement to be displayed.

Response:
Press the Enter key when ready for the next part of the Agreement to be displayed.

AMQ7144
The laws of the country in which you acquire the Program govern this Agreement, except 1) in Australia, the laws of the State or Territory in which the transaction is performed govern this Agreement; 2) in Albania, Armenia, Belarus, Bosnia/Herzegovina, Bulgaria, Croatia, Czech Republic, Georgia, Hungary, Kazakhstan, Kirghizia, Former Yogoslav Republic of Macedonia (FYROM), Moldova, Poland, Romania, Russia, Slovak Republic, Slovenia, Ukraine, and Federal Republic of Yugoslavia, the laws of Austria govern this Agreement; 3) in the United Kingdom, all disputes relating to this Agreement will be governed by English law and will be submitted to the exclusive jurisdiction of the English courts; 4) in Canada, the laws of the Province of Ontario govern this Agreement; and 5) in the United States and Puerto Rico, and People's Republic of China, the laws of the State of New York govern this Agreement.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7145
Part 2 - Country Unique Terms 
AUSTRALIA: 
No Warranty (Section 2): 
The following paragraph is added to this Section: 
Although IBM specifies that there are no warranties, you may have certain rights under the Trade Practices Act 1974 or other legislation and are only limited to the extent permitted by the applicable legislation. 
Limitation of Liability (Section 3): 
The following paragraph is added to this Section:
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7146
Where IBM is in breach of a condition or warranty implied by the Trade Practices Act 1974, IBM's liability is limited to the repair or replacement of the goods, or the supply of equivalent goods. Where that condition or warranty relates to right to sell, quiet possession or clear title, or the goods are of a kind ordinarily acquired for personal, domestic or household use or consumption, then none of the limitations in this paragraph apply.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7147
NEW ZEALAND: 
No Warranty (Section 2): 
The following paragraph is added to this Section: 
Although IBM specifies that there are no warranties, you may have certain rights under the Consumer Guarantees Act 1993 or other legislation which cannot be excluded or limited. The Consumer Guarantees Act 1993 will not apply in respect of any goods or services which IBM provides, if you require the goods and services for the purposes of a business as defined in the Act.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7148
Limitation of Liability (Section 3): 
The following paragraph is added to this Section: 
Where products or services are not acquired for the purposes of a business as defined in the Consumer Guarantees Act 1993, the limitations in this Section are subject to the limitations in that Act.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7149
GERMANY: No Warranty (Section 2): 
The following paragraphs are added to this Section: 
The minimum warranty period for Programs is six months. 
In case a Program is delivered without specifications, we will only warrant that the Program information correctly describes the Program and that the Program can be used according to the Program information. You have to check the usability according to the Program information within the "money-back guaranty" period. 
Limitation of Liability (Section 3): 
The following paragraph is added to this Section: 
The limitations and exclusions specified in the Agreement will not apply to damages caused by IBM with fraud or gross negligence, and for express warranty.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7150
INDIA: 
General (Section 4): 
The following replaces the fourth paragraph of this Section: 
If no suit or other legal action is brought, within two years after the cause of action arose, in respect of any claim that either party may have against the other, the rights of the concerned party in respect of such claim will be forfeited and the other party will stand released from its obligations in respect of such claim.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7151
IRELAND: 
No Warranty (Section 2): 
The following paragraph is added to this Section: 
Except as expressly provided in these terms and conditions, all statutory conditions, including all warranties implied, but without prejudice to the generality of the foregoing all warranties implied by the Sale of Goods Act 1893 or the Sale of Goods and Supply of Services Act 1980 are hereby excluded. 
ITALY: 
Limitation of Liability (Section 3): 
This section is replaced by the following: 
Unless otherwise provided by mandatory law, IBM is not liable for any damages which might arise.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7152
UNITED KINGDOM: 
Limitation of Liability (Section 3): 
The following paragraph is added to this Section at the end of the first paragraph: 
The limitation of liability will not apply to any breach of IBM's obligations implied by Section 12 of the Sales of Goods Act 1979 or Section 2 of the Supply of Goods and Services Act 1982.
Severity:
0 : Information

Explanation:
This is part of the Trial Period License Agreement which must be accepted before a trial period can be started. A trial period allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7153A license could not be installed for this copy of WebSphere MQ.
Severity:
20 : Error

Explanation:
A Production, Beta or Trial Period license could not be installed for this copy of WebSphere MQ. This is because the 'nodelock' file in the 'qmgrs/@SYSTEM' directory could not be created or updated.

Response:
Check the ownership and permissions of the 'qmgrs/@SYSTEM' directory.

AMQ7154The Production license for this copy of WebSphere MQ has expired.
Severity:
20 : Error

Explanation:
The production license for this copy of WebSphere MQ has an expiry date. This date has been passed.

Response:
Contact your IBM support center.

AMQ7155License file <insert_3> not found or not valid.
Severity:
20 : Error

Explanation:
The program requires that the file <insert_3> is present, available and is a valid license file.

Response:
Check that the installation steps described in the Quick Beginnings book have been followed, and if the problem persists contact your IBM service representative.

AMQ7156This copy of WebSphere MQ is already running in Production mode.
Severity:
0 : Information

Explanation:
A Production license has previously been installed for this copy of WebSphere MQ.

Response:
None.

AMQ7157The Production license is not valid for this copy of WebSphere MQ.
Severity:
20 : Error

Explanation:
The license <insert_3> has been installed but it is not a valid production license for this copy of WebSphere MQ.

Response:
Submit the SETMQPRD command again specifying the name of a valid production license.

AMQ7158The Trial Period license is not valid for this copy of WebSphere MQ.
Severity:
20 : Error

Explanation:
The license <insert_3> has been installed but it is not a valid trial period license for this copy of WebSphere MQ.

Response:
Check that the correct version of the file is available.

AMQ7159A FASTPATH application has ended unexpectedly.
Severity:
10 : Warning

Explanation:
A FASTPATH application has ended in a way which did not allow the queue manager to clean up the resources owned by that application. Any resources held by the application can only be released by stopping and restarting the queue manager.

Response:
Investigate why the application ended unexpectedly. Avoid ending FASTPATH applications in a way which prevents WebSphere MQ from releasing resources held by the application.

AMQ7160Queue Manager Object
Severity:
0 : Information

AMQ7161Object Catalogue
Severity:
0 : Information

AMQ7162The setmqaut command completed successfully.
Severity:
0 : Information

AMQ7163 (iSeries)WebSphere MQ job <insert_2> started for <insert_3>.
Severity:
0 : Information

Explanation:
The job's PID is <insert_2> the CCSID is <insert_1>. The job name is <insert_4>.

Response:
None

AMQ7164 (iSeries)WebSphere MQ is waiting for a job to start.
Severity:
0 : Information

Explanation:
WebSphere MQ has been waiting <insert_1> seconds to start job <insert_3> for Queue Manager: <insert_5>

Response:
Check that the job queue that is associated with job description <insert_4> is not held, and that the subsystem that is associated with the job queue is active.

AMQ7165The Beta license for this copy of WebSphere MQ has expired.
Severity:
20 : Error

Explanation:
This copy of WebSphere MQ was licensed to be used for Beta testing for a limited period only. This period has expired.

Response:
Install a Production license for this copy of WebSphere MQ.

AMQ7166The Beta period for this copy of WebSphere MQ has now expired.
Severity:
20 : Error

Explanation:
This copy of WebSphere MQ was licensed for a limited period only. This period has now expired.

Response:
Install a Production license for this copy of WebSphere MQ.

AMQ7167The 'Early Release of Programs License Agreement' was not accepted.
Severity:
10 : Warning

Explanation:
When the IBM International License Agreement for Early Release of Programs is displayed, the user must accept it before this copy of WebSphere MQ can be used.

Response:
Submit the command again and accept the agreement.

AMQ7168There is one day left in the Beta test period for this copy of WebSphere MQ.
Severity:
0 : Information

Explanation:
This copy of WebSphere MQ is licensed for a limited period only.

Response:
None.

AMQ7169This is the final day of the Beta test period for this copy of WebSphere MQ.
Severity:
10 : Warning

Explanation:
This copy of WebSphere MQ is licensed for a limited period only.

Response:
Install a Production license for this copy of WebSphere MQ.

AMQ7170 (iSeries)Option is not valid for this transaction.
Severity:
20 : Error

Explanation:
The Resolve option is not valid for external transactions. The Commit and Backout options are not valid for internal transactions.

Response:
Select a different option for this transaction.

AMQ7171IBM International License Agreement for Early Release of Programs 
Part 1 - General Terms 
PLEASE READ THIS AGREEMENT CAREFULLY BEFORE USING THE PROGRAM. IBM WILL LICENSE THE PROGRAM TO YOU ONLY IF YOU FIRST ACCEPT THE TERMS OF THIS AGREEMENT. BY USING THE PROGRAM YOU AGREE TO THESE TERMS. IF YOU DO NOT AGREE TO THE TERMS OF THIS AGREEMENT, PROMPTLY RETURN THE UNUSED PROGRAM TO IBM.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7172
The Program is owned by International Business Machines Corporation or one of its subsidiaries (IBM) or an IBM supplier, and is copyrighted and licensed, not sold. 
The term "Program" means the original program and all whole or partial copies of it. A Program consists of machine-readable instructions, its components, data, audio-visual content (such as images, text, recordings, or pictures), and related licensed materials.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7173
The term "Early Release" means that the Program is not formally released or generally available. The term does not imply that the Program will be formally released or made generally available. IBM does not guarantee that a Program formally released or made generally available will be similar to, or compatible with, Early Release versions. 
THIS AGREEMENT INCLUDES PART 1 - GENERAL TERMS AND PART 2 - COUNTRY-UNIQUE TERMS AND IS THE COMPLETE AGREEMENT REGARDING THE USE OF THIS PROGRAM, AND REPLACES ANY PRIOR ORAL OR WRITTEN COMMUNICATIONS BETWEEN YOU AND IBM. THE TERMS OF PART 2 MAY REPLACE OR MODIFY THOSE OF PART 1.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7174
1.License 
Use of the Program 
IBM grants you a nonexclusive, nontransferable license to use the Program. 
You may 
1) use the Program only for internal evaluation or testing purposes and 
2) make and install a reasonable number of copies of the Program in support of such use, unless IBM identifies a specific number of copies in the documentation accompanying the Program. The terms of this license apply to each copy you make. You will reproduce the copyright notice and any other legends of ownership on each copy, or partial copy, of the Program.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7175
THE PROGRAM MAY CONTAIN A DISABLING DEVICE THAT WILL PREVENT IT FROM BEING USED UPON EXPIRATION OF THIS LICENSE. YOU WILL NOT TAMPER WITH THIS DISABLING DEVICE OR THE PROGRAM. YOU SHOULD TAKE PRECAUTIONS TO AVOID ANY LOSS OF DATA THAT MIGHT RESULT WHEN THE PROGRAM CAN NO LONGER BE USED. 
You will 
1) maintain a record of all copies of the Program and 
2) ensure that anyone who uses the Program does so only for your authorized use and in compliance with the terms of this Agreement.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7176
You may not 
1) use, copy, modify, or distribute the Program except as provided in this Agreement; 
2) reverse assemble, reverse compile, or otherwise translate the Program except as specifically permitted by law without the possibility of contractual waiver; or 
3) sublicense, rent, or lease the Program.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7177
This license begins with your first use of the Program and ends 
1) as of the duration or date specified in the documentation accompanying the Program, 
2) when the Program automatically disables itself, or 
3) when IBM makes the Program generally available. Unless IBM specifies in the documentation accompanying the the Program that you may retain the Program (in which case, an additional charge may apply), you will destroy the Program and all copies made of it within ten days of when this license ends.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7178
2.No Warranty 
SUBJECT TO ANY STATUTORY WARRANTIES WHICH CANNOT BE EXCLUDED, IBM MAKES NO WARRANTIES OR CONDITIONS EITHER EXPRESS OR IMPLIED, INCLUDING WITHOUT LIMITATION, THE WARRANTY OF NON-INFRINGEMENT AND THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE, REGARDING THE PROGRAM OR TECHNICAL SUPPORT, IF ANY.. IBM MAKES NO WARRANTY REGARDING THE CAPABILITY OF THE PROGRAM TO CORRECTLY PROCESS, PROVIDE AND/OR RECEIVE DATE DATA WITHIN AND BETWEEN THE 20TH AND 21ST CENTURIES. 
This exclusion also applies to any of IBM's subcontractors, suppliers or program developers (collectively called "Suppliers"). 
Manufacturers, suppliers, or publishers of non-IBM Programs may provide their own warranties.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7179
3.Limitation of Liability 
NEITHER IBM NOR ITS SUPPLIERS ARE LIABLE FOR ANY DIRECT OR INDIRECT DAMAGES, INCLUDING WITHOUT LIMITATION, LOST PROFITS, LOST SAVINGS, OR ANY INCIDENTAL, SPECIAL, OR OTHER ECONOMIC CONSEQUENTIAL DAMAGES, EVEN IF IBM IS INFORMED OF THEIR POSSIBILITY. SOME JURISDICTIONS DO NOT ALLOW THE EXCLUSION OR LIMITATION OF INCIDENTAL OR CONSEQUENTIAL DAMAGES, SO THE ABOVE EXCLUSION OR LIMITATION MAY NOT APPLY TO YOU. 
4.Rights In Data 
You hereby assign to IBM all right, title, and interest (including ownership of copyright) in any data, suggestions, and written materials related to your use of the Program you provide to IBM. If IBM requires it, you will sign an appropriate document to assign such rights.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7180
5.General 
Nothing in this Agreement affects any statutory rights of consumers that cannot be waived or limited by contract. 
IBM may terminate your license if you fail to comply with the terms of this Agreement. If IBM does so, you must immediately destroy the Program and all copies you made of it. 
You may not export the Program.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7181
Neither you nor IBM will bring a legal action under this Agreement more than two years after the cause of action arose unless otherwise provided by local law without the possibility of contractual waiver or limitation. 
Neither you nor IBM is responsible for failure to fulfill any obligations due to causes beyond its control. 
There is no additional charge for use of the Program for the duration of this license. 
Neither of us will charge the other for rights in data or any work performed as a result of this Agreement. 
IBM does not provide program services or technical support, unless IBM specifies otherwise.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7182
The laws of the country in which you acquire the Program govern this Agreement, except 
1) in Australia, the laws of the State or Territory in which the transaction is performed govern this Agreement; 
2) in Albania, Armenia, Belarus, Bosnia/Herzegovina, Bulgaria, Croatia, Czech Republic, Georgia, Hungary, Kazakhstan, Kirghizia, Former Yugoslav Republic of Macedonia (FYROM), Moldova, Poland, Romania, Russia, Slovak Republic, Slovenia, Ukraine, and Federal Republic of Yugoslavia, the laws of Austria govern this Agreement; 
3) in the United Kingdom, all disputes relating to this Agreement will be governed by English Law and will be submitted to the exclusive jurisdiction of the English courts; 
4) in Canada, the laws of the Province of Ontario govern this Agreement; and 
5) in the United States and Puerto Rico, and People's Republic of China, the laws of the State of New York govern this Agreement.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7183
Part 2 - Country-unique Terms 
AUSTRALIA: No Warranty (Section 2): The following paragraph is added to this Section: Although IBM specifies that there are no warranties, you may have certain rights under the Trade Practices Act 1974 or other legislation and are only limited to the extent permitted by the applicable legislation. 
Limitation of Liability (Section 3): The following paragraph is added to this Section: Where IBM is in breach of a condition or warranty implied by the Trade Practices Act 1974, IBM's liability is limited to the repair or replacement of the goods, or the supply of equivalent goods. Where that condition or warranty relates to right to sell, quiet possession or clear title, or the goods are of a kind ordinarily acquired for personal, domestic or household use or consumption, then none of the limitations in this paragraph apply.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7184
GERMANY: No Warranty (Section 2): The following paragraphs are added to this Section: The minimum warranty period for Programs is six months. In case a Program is delivered without Specifications, IBM will only warrant that the Program information correctly describes the Program and that the Program can be used according to the Program information. You have to check the usability according to the Program information within the "money-back guaranty" period. 
Limitation of Liability (Section 3): The following paragraph is added to this Section: The limitations and exclusions specified in the Agreement will not apply to damages caused by IBM with fraud or gross negligence, and for express warranty.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7185
INDIA: General (Section 5): The following replaces the fourth paragraph of this Section: If no suit or other legal action is brought, within two years after the cause of action arose, in respect of any claim that either party may have against the other, the rights of the concerned party in respect of such claim will be forfeited and the other party will stand released from its obligations in respect of such claim.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7186
IRELAND: No Warranty (Section 2): The following paragraph is added to this Section: Except as expressly provided in these terms and conditions, all statutory conditions, including all warranties implied, but without prejudice to the generality of the foregoing, all warranties implied by the Sale of Goods Act 1893 or the Sale of Goods and Supply of Services Act 1980 are hereby excluded.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7187
ITALY: Limitation of Liability (Section 3): This Section is replaced by the following: Unless otherwise provided by mandatory law, IBM is not liable for any damages which might arise.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7188
JAPAN: Rights In Data (Section 4): The following paragraph is added to this Section: You also agree to assign to IBM the rights regarding derivative works, as defined in Articles 27 and 28 of the Japanese Copyright Law. You also agree not to exercise your moral rights.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7189
NEW ZEALAND: No Warranty (Section 2): The following paragraph is added to this Section: Although IBM specifies that there are no warranties, you may have certain rights under the Consumer Guarantees Act 1993 or other legislation which cannot be excluded or limited. The Consumer Guarantees Act 1993 will not apply in respect of any goods or services which IBM provides, if you require the goods and services for the purposes of a business as defined in that Act. 
Limitation of Liability (Section 3): The following paragraph is added to this Section: Where Programs are not acquired for the purposes of a business as defined in the Consumer Guarantees Act 1993, the limitations in this Section are subject to the limitations in that Act.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7190
UNITED KINGDOM: Limitation of Liability (Section 3): The following paragraph is added to this Section at the end of the first paragraph: The limitation of liability will not apply to any breach of IBM's obligations implied by Section 12 of the Sale of Goods Act 1979 or Section 2 of the Supply of Goods and Services Act 1982.
Severity:
0 : Information

Explanation:
This is part of the Early Release of Programs License Agreement (VZ125-5544-01 10/97 (MK002))which must be accepted before a Beta test period can be started. A Beta test version allows a copy of WebSphere MQ to be used for a limited period only.

Response:
None.

AMQ7191There are <insert_1> days left in the beta test period for this copy of WebSphere MQ.
Severity:
0 : Information

Explanation:
This copy of WebSphere MQ is licensed for a limited period only.

Response:
None.

AMQ7192The Beta test period for this copy of WebSphere MQ has already been started.
Severity:
0 : Information

Explanation:
This copy of WebSphere MQ is licensed for a limited period only and the Beta test period has been started previously.

Response:
None.

AMQ7193Reply 'yes' to accept the Agreement. Reply 'no' if you do not agree to the terms of the Agreement. Reply 'no' and submit the command again, if you want to read the Agreement again.
Severity:
0 : Information

Explanation:
The IBM International License Agreement for Early Release of Programs has been displayed to the user and the user should now accept or reject the Agreement.

Response:
Reply 'yes' or 'no' and press 'Enter'.

AMQ7194
Press Enter to continue
Severity:
0 : Information

Explanation:
Part of the IBM International License Agreement for Early Release of Programs has been displayed to the user. The user should press the Enter key to indicate that they are ready for the next part of the Agreement to be displayed.

Response:
Press the Enter key when ready for the next part of the Agreement to be displayed.

AMQ7195The Beta test license is not valid for this copy of WebSphere MQ.
Severity:
20 : Error

Explanation:
The license <insert_3> has been installed but it is not a valid trial period license for this copy of WebSphere MQ.

Response:
Check that the correct version of the file is available.

AMQ7196By installing this product, you accept the terms of the International Program License Agreement and the License Information supplied with the product.
Severity:
0 : Information

Response:
None.

AMQ7197A production or trial license could not be installed for this copy of WebSphere MQ.
Severity:
20 : Error

Explanation:
This copy of WebSphere MQ is a beta version and cannot be used with a production or trial license.

Response:
Uninstall the beta version of WebSphere MQ and install the production or trial version.

AMQ7198Insufficient license units.
Severity:
10 : Warning

Explanation:
The purchased processor allowance (<insert_1>) is less than the number of processors (<insert_2>) in this machine.

Response:
Ensure sufficient license units have been purchased and use the MQ setmqcap command to set the purchased processor allowance for this installation. Refer to the Quick Beginnings book for more information.

AMQ7198 (iSeries)Insufficient license units.
Severity:
10 : Warning

Explanation:
The purchased processor allowance for this installation is zero.

Response:
Ensure sufficient license units have been purchased and use the MQ CHGMQMCAP command to set the purchased processor allowance for this installation. Refer to the Quick Beginnings book for more information.

AMQ7199The purchased processor allowance is set to <insert_1>.
Severity:
0 : Information

Explanation:
The purchased processor allowance for this installation has been set to <insert_1> using the MQ setmqcap command.

Response:
None.

AMQ7199 (iSeries)The purchased processor allowance is set to <insert_1>.
Severity:
0 : Information

Explanation:
The purchased processor allowance for this installation has been set to <insert_1> using the MQ CHGMQMCAP command.

Response:
None.

AMQ7200The purchased processor allowance is <insert_1>
Severity:
0 : Information

Explanation:
The purchased processor allowance is currently set to <insert_1>

Response:
Ensure sufficient license units have been purchased and, if necessary, use the MQ setmqcap command to change the purchased processor allowance for this installation. Refer to the Quick Beginnings book for more information.

AMQ7200 (iSeries)The purchased processor allowance is <insert_1>
Severity:
0 : Information

Explanation:
The purchased processor allowance is currently set to <insert_1>

Response:
Ensure sufficient license units have been purchased and, if necessary, use the MQ CHGMQMCAP command to change the purchased processor allowance for this installation. Refer to the Quick Beginnings book for more information.

AMQ7201The number of processors in this machine is <insert_1>
Severity:
0 : Information

Explanation:
The operating system reports that the number of processors in this machine is <insert_1>

Response:
None.

AMQ7202The number of license units is sufficient for all future possible upgrades to this machine.
Severity:
0 : Information

Explanation:
The purchased processor allowance for this installation has been set to -1, which allows any permitted processor configuration.

Response:
None.

AMQ7203Purchased processor allowance not set (use setmqcap).
Severity:
10 : Warning

Explanation:
The purchased processor allowance for this installation has not been set.

Response:
Ensure sufficient license units have been purchased and use the MQ setmqcap command to set the purchased processor allowance for this installation. Refer to the Quick Beginnings book for more information.

AMQ7203 (iSeries)Purchased processor allowance not set (use CHGMQMCAP).
Severity:
10 : Warning

Explanation:
The purchased processor allowance for this installation has not been set.

Response:
Ensure sufficient license units have been purchased and use the MQ CHGMQMCAP command to set the purchased processor allowance for this installation. Refer to the Quick Beginnings book for more information.

AMQ7204Intel Hyper-Threading support enabled with Logical Processor Mask <insert_1>.
Severity:
0 : Information

Explanation:
Install has detected Intel Hyper=Threading support enabled for Logical Processors. The current Processor Mask is set to <insert_1>.

Response:
None.

AMQ7205System call SetProcessAffinityMask failed for Logical Processor Mask <insert_1>. Process Affinity Mask is <insert_2>, process is continuing.
Severity:
0 : Information

Explanation:
Microsoft Windows system call SetProcessAffinityMask failed. If this problem persists contact your IBM service representative.

Response:
None.

AMQ7206 (Windows)Group name has been truncated.
Severity:
40 : Stop Error

Explanation:
WebSphere MQ only supports group names up to 12 characters long. The operating system is attempting to return a group longer than this.

Response:
Reduce the group name to 12 characters or less.

AMQ7207 (Windows)User ID longer than 12 characters.
Severity:
40 : Stop Error

Explanation:
WebSphere MQ only supports user names up to 12 characters long. This operation is being attempted from a user name longer than this.

Response:
Reduce the user name to 12 characters or less.

AMQ7208The queue manager failed to pass a PCF message to another queue manager.
Severity:
10 : Warning

Explanation:
The queue manager attempted to put a PCF message to <insert_3> to start the channel <insert_4> to cluster queue manager <insert_5>. The put failed with reason <insert_1>. When the queue manager resolves a cluster queue to a remote cluster queue manager, the message is put to the SYSTEM.CLUS.TRANSMIT.QUEUE. If the channel to the remote cluster queue manager is not running, the queue manager attempts to start the channel by sending a PCF message to <insert_3>.

Response:
Resolve the problem with <insert_3> and if necessary start the channel manually.

AMQ7209The queue manager attempted to open SYSTEM.CHANNEL.INITQ which failed with reason <insert_3>
Severity:
10 : Warning

Explanation:
When the queue manager resolves a cluster queue to a remote cluster queue manager, the message is put to the SYSTEM.CLUS.TRANSMIT.QUEUE. If the channel to the remote cluster queue manager is not running, the queue manager attempts to start the channel by sending a PCF message to the SYSTEM.CHANNEL.INITQ

Response:
Resolve the problem with the SYSTEM.CHANNEL.INITQ and if necessary start the channels manually.

AMQ7210The Cluster Workload exit module could not be loaded.
Severity:
10 : Warning

Explanation:
The Cluster Workload exit module <insert_3> could not be loaded for reason <insert_4>.

Response:
Correct the problem with the Cluster Workload exit module <insert_3>

AMQ7211The Queue Manager is still waiting for a reply from the Cluster Workload Exit server process.
Severity:
10 : Warning

Explanation:
The Queue Manager is configured to run the Cluster Workload Exit in SAFE mode. This means that the Cluster Workload Exit is run by a server process (amqzlw0). The Queue Manager has been waiting <insert_1> seconds for this server process to reply to a request to run the Cluster Workload Exit. It is possible that the exit is hung or is looping.

Response:
End the Queue Manager, resolve the problem with the Cluster Workload Exit and restart the Queue Manager

AMQ7212The address of the Cluster exit function could not be found.
Severity:
10 : Warning

Explanation:
The address of the Cluster exit function <insert_4> could not be found in module <insert_3> for reason <insert_1> <insert_5>.

Response:
Correct the problem with the Cluster exit function <insert_4> in the module <insert_3>

AMQ7214The module for API Exit <insert_3> could not be loaded.
Severity:
40 : Stop Error

Explanation:
The module <insert_4> for API Exit <insert_3> could not be loaded for reason <insert_5>.

Response:
Correct the problem with the API Exit module <insert_3>.

AMQ7215The API Exit <insert_3> function <insert_4> could not be found in the module <insert_5>.
Severity:
40 : Stop Error

Explanation:
The API Exit <insert_3> function <insert_4> could not be found in the module <insert_5>. The internal return code was <insert_1>.

Response:
Correct the problem with the API Exit <insert_3>.

AMQ7215 (iSeries)Could not find a function in API Exit <insert_3>.
Severity:
40 : Stop Error

Explanation:
The API Exit <insert_3> function <insert_4> could not be found in the module <insert_5>. The internal return code was <insert_1>.

Response:
Correct the problem with the API Exit <insert_3>.

AMQ7216An API Exit initialization function returned an error.
Severity:
10 : Warning

Explanation:
The API Exit <insert_3> function <insert_4> in the module <insert_5> returned CompCode <insert_1> and ReasonCode <insert_2>.

Response:
Correct the problem with the API Exit <insert_3>

AMQ7217The response set by the exit is not valid.
Severity:
10 : Warning

Explanation:
The API Exit <insert_3> module <insert_4> function <insert_5> returned a response code <insert_1> that is not valid in the ExitResponse field of the API Exit parameters (MQAXP).

Response:
Investigate why the API Exit <insert_3> set a response code that is not valid.

AMQ7219profile: <insert_3>
Severity:
0 : Information

AMQ7220object type: <insert_3>
Severity:
0 : Information

AMQ7221entity: <insert_3>
Severity:
0 : Information

AMQ7222entity type: <insert_3>
Severity:
0 : Information

AMQ7223authority: <insert_3>
Severity:
0 : Information

AMQ7224profile: <insert_3>, object type: <insert_4>
Severity:
0 : Information

AMQ7225No matching authority records.
Severity:
0 : Information

Explanation:
No authority records match the specified parameters.

AMQ7226The profile name is invalid.
Severity:
20 : Error

Explanation:
The profile name contains invalid characters, contains an invalid wildcard specification, or is of invalid length.

Response:
Correct the profile name and submit it again.

AMQ7227WebSphere MQ encountered the following network error: <insert_3>
Severity:
10 : Warning

Explanation:
MQ failed to successfully complete a network operation due to the specified error.

Response:
Ensure that your network is functioning correctly.

AMQ7228 (iSeries)Display MQ Authority Records for <insert_3>
Severity:
0 : Information

AMQ7229<insert_1> log records accessed on queue manager <insert_3> during the log replay phase.
Severity:
0 : Information

Explanation:
<insert_1> log records have been accessed so far on queue manager <insert_3> during the log replay phase in order to bring the queue manager back to a previously known state.

Response:
None.

AMQ7230Log replay for queue manager <insert_3> complete.
Severity:
0 : Information

Explanation:
The log replay phase of the queue manager restart process has been completed for queue manager <insert_3>.

Response:
None.

AMQ7231<insert_1> log records accessed on queue manager <insert_3> during the recovery phase.
Severity:
0 : Information

Explanation:
<insert_1> log records have been accessed so far on queue manager <insert_3> during the recovery phase of the transactions manager state.

Response:
None.

AMQ7232Transaction manager state recovered for queue manager <insert_3>.
Severity:
0 : Information

Explanation:
The state of transactions at the time the queue manager ended has been recovered for queue manager <insert_3>.

Response:
None.

AMQ7233<insert_1> out of <insert_2> in-flight transactions resolved for queue manager <insert_3>.
Severity:
0 : Information

Explanation:
<insert_1> transactions out of <insert_2> in-flight at the time queue manager <insert_3> ended have been resolved.

Response:
None.

AMQ7234<insert_1> messages from queue <insert_4> loaded on queue manager <insert_3>.
Severity:
0 : Information

Explanation:
<insert_1> messages from queue <insert_4> have been loaded on queue manager <insert_3>.

Response:
None.

AMQ7235 (iSeries)Queue manager library <insert_3> already exists.
Severity:
40 : Stop Error

Explanation:
The library <insert_3> already exists.

Response:
Specify a library which does not already exist.

AMQ7236WebSphere MQ queue manager <insert_3> activated.
Severity:
0 : Information

Explanation:
WebSphere MQ queue manager <insert_3> has been activated.

Response:
None.

AMQ7237WebSphere MQ queue manager <insert_3> is not a backup queue manager.
Severity:
10 : Warning

Explanation:
WebSphere MQ queue manager <insert_3> is not a backup queue manager and so cannot be activated. A possible reason might be that the queue manager is configured for circular logging.

Response:
Re-try the command without the '-a' option.

AMQ7238WebSphere MQ queue manager <insert_3> replay completed.
Severity:
0 : Information

Explanation:
WebSphere MQ queue manager <insert_3> replay has completed.

Response:
None.

AMQ7249WebSphere MQ queue manager <insert_3> cannot be started for replay.
Severity:
20 : Error

Explanation:
WebSphere MQ queue manager <insert_3> cannot be started for replay. A possible reason might be that the queue manager is configured for circular logging.

Response:
Re-try the command without the '-r' option.

AMQ7250WebSphere MQ queue manager <insert_3> has not been activated.
Severity:
20 : Error

Explanation:
WebSphere MQ queue manager <insert_3> cannot be started because it has previously been started for replay but has not been activated.

Response:
Activate the queue manager and try starting the queue manager again.

AMQ7253The command <insert_3> requires one of the following arguments: <insert_4>.
Severity:
20 : Error

Explanation:
The command <insert_3> required at least one of the following arguments, none of which you supplied: <insert_4>.

Response:
Check the WebSphere MQ System Administration documentation for details on the usage of the command, correct the command and then retry.

AMQ7305Trigger message could not be put on an initiation queue.
Severity:
10 : Warning

Explanation:
The attempt to put a trigger message on queue <insert_4> on queue manager <insert_5> failed with reason code <insert_1>. The message will be put on the dead-letter queue.

Response:
Ensure that the initiation queue is available, and operational.

AMQ7306The dead-letter queue must be a local queue.
Severity:
10 : Warning

Explanation:
An undelivered message has not been put on the dead-letter queue <insert_4> on queue manager <insert_5>, because the queue is not a local queue. The message will be discarded.

Response:
Inform your system administrator.

AMQ7307A message could not be put on the dead-letter queue.
Severity:
10 : Warning

Explanation:
The attempt to put a message on the dead-letter queue <insert_4> on queue manager <insert_5> failed with reason code <insert_1>. The message will be discarded.

Response:
Ensure that the dead-letter queue is available and operational.

AMQ7308Trigger condition <insert_1> was not satisfied.
Severity:
0 : Information

Explanation:
At least one of the conditions required for generating a trigger message was not satisfied, so a trigger message was not generated. If you were expecting a trigger message, consult the WebSphere MQ Application Programming Guide for a list of the conditions required. (Note that arranging for condition <insert_1> to be satisfied might not be sufficient because the conditions are checked in an arbitrary order, and checking stops when the first unsatisfied condition is discovered.)

Response:
If a trigger message is required, ensure that all the conditions for generating one are satisfied.

AMQ7310Report message could not be put on a reply-to queue.
Severity:
10 : Warning

Explanation:
The attempt to put a report message on queue <insert_4> on queue manager <insert_5> failed with reason code <insert_1>. The message will be put on the dead-letter queue.

Response:
Ensure that the reply-to queue is available and operational.

AMQ7315Failed to put message to accounting queue. Reason(<insert_1>)
Severity:
20 : Error

Explanation:
The attempt to put a messsage containing accounting data to the queue <insert_3> failed with reason code <insert_1>. The message data has been discarded. 
This error message will be written only once for attempts to put a message to the queue as part of the same operation which fail for the same reason.

Response:
Ensure that the queue <insert_3> is available and operational.

AMQ7316Failed to put message to statistics queue. Reason(<insert_1>)
Severity:
20 : Error

Explanation:
The attempt to put a messsage containing statistics data to the queue <insert_3> failed with reason code <insert_1>. The message data has been discarded. 
This error message will be written only once for attempts to put a message to the queue as part of the same operation which fail for the same reason.

Response:
Ensure that the queue <insert_3> is available and operational.

AMQ7432 (iSeries)WebSphere MQ journal entry not available for replay.
Severity:
40 : Stop Error

Explanation:
A journal replay operation was attempted, but the operation required journal entries from journal receivers that are not currently present on the system.

Response:
Restore the required journal receivers from backup. Then try the operation again.

AMQ7433 (iSeries)An Error occured while performing a journal replay.
Severity:
40 : Stop Error

Explanation:
WebSphere MQ encountered a problem reading one or more journal entries while performing a journal replay operation.

Response:
If you have previously created a journal receiver for a queue manager or are performing a cold restart of a queue manager, delete the QMQMCHKPT file from the queue manager subdirectory in /QIBM/UserData/mqm/qmgrs/ and attempt to restart the queue manager. If the problem persists, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ7434 (iSeries)The MQ commitment control exit program was called incorrectly. Code <insert_1>.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ commitment control exit program was called with incorrect parameters.

Response:
If the program was called by OS/400 as part of a commit or rollback, save the job log, and contact your IBM support center.

AMQ7435 (iSeries)The MQ commitment control exit program failed. Code <insert_1>.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ commitment control exit program failed due to an unexpected error.

Response:
Save the job log, and contact your IBM support center.

AMQ7459 (iSeries)WebSphere MQ journal receiver <insert_3> is the oldest in the chain
Severity:
0 : Information

Explanation:
The oldest journal receiver in the receiver chain is <insert_3> in library <insert_4>.

Response:
None

AMQ7460 (iSeries)WebSphere MQ startup journal information.
Severity:
0 : Information

Explanation:
This message is issued periodically by WebSphere MQ to help you identify which journal receivers can be removed from the system because they are no longer required for startup recovery.

Response:
None

AMQ7461 (iSeries)WebSphere MQ object re-created - reapply authorities.
Severity:
0 : Information

Explanation:
A previously damaged object has been re-created, either automatically, or by explicit use of the Recreate Object (RCRMQMOBJ) command. The authorities that applied to this object have not been re-created.

Response:
Use the Grant Authority (GRTMQMAUT) command, as appropriate, to re-create the required authorities to this MQ object.

AMQ7462 (iSeries)WebSphere MQ media recovery journal information.
Severity:
0 : Information

Explanation:
This message is issued periodically by WebSphere MQ to help you identify which journal receivers can be removed from the system because they are no longer required for media recovery.

Response:
None

AMQ7463The log for queue manager <insert_3> is full.
Severity:
20 : Error

Explanation:
This message is issued when an attempt to write a log record is rejected because the log is full. The queue manager will attempt to resolve the problem.

Response:
This situation may be encountered during a period of unusually high message traffic. However, if you persistently fill the log, you may have to consider enlarging the size of the log. You can either increase the number of log files by changing the values in the queue manager configuration file. You will then have to stop and restart the queue manager. Alternatively, if you need to make the log files themselves bigger, you will have to delete and recreate the queue manager.

AMQ7464The log for queue manager <insert_3> is no longer full.
Severity:
0 : Information

Explanation:
This message is issued when a log was previously full, but an attempt to write a log record has now been accepted. The log full situation has been resolved.

Response:
None

AMQ7465The log for queue manager <insert_3> is full.
Severity:
20 : Error

Explanation:
An attempt to resolve a log full situation has failed. This is due to the presence of a long-running transaction.

Response:
Try to ensure that the duration of your transactions is not excessive. Commit or roll back any old transactions to release log space for further log records.

AMQ7466There is a problem with the size of the logfile.
Severity:
10 : Warning

Explanation:
The log for queue manager <insert_3> is too small to support the current data rate. This message is issued when the monitoring tasks maintaining the log cannot keep up with the current rate of data being written.

Response:
The number of primary log files configured should be increased to prevent possible log full situations.

AMQ7467The oldest log file required to start queue manager <insert_3> is <insert_4>.
Severity:
0 : Information

Explanation:
The log file <insert_4> contains the oldest log record required to restart the queue manager. Log records older than this may be required for media recovery.

Response:
You can move log files older than <insert_4> to an archive medium to release space in the log directory. If you move any of the log files required to recreate objects from their media images, you will have to restore them to recreate the objects.

AMQ7468The oldest log file required to perform media recovery of queue manager <insert_3> is <insert_4>.
Severity:
0 : Information

Explanation:
The log file <insert_4> contains the oldest log record required to recreate any of the objects from their media images. Any log files prior to this will not be accessed by media recovery operations.

Response:
You can move log files older than <insert_4> to an archive medium to release space in the log directory.

AMQ7469Transactions rolled back to release log space.
Severity:
0 : Information

Explanation:
The log space for the queue manager is becoming full. One or more long-running transactions have been rolled back to release log space so that the queue manager can continue to process requests.

Response:
Try to ensure that the duration of your transactions is not excessive. Consider increasing the size of the log to allow transactions to last longer before the log starts to become full.

AMQ7472Object <insert_3>, type <insert_4> damaged.
Severity:
10 : Warning

Explanation:
Object <insert_3>, type <insert_4> has been marked as damaged. This indicates that the queue manager was either unable to access the object in the file system, or that some kind of inconsistency with the data in the object was detected.

Response:
If a damaged object is detected, the action performed depends on whether the queue manager supports media recovery and when the damage was detected. If the queue manager does not support media recovery, you must delete the object as no recovery is possible. If the queue manager does support media recovery and the damage is detected during the processing performed when the queue manager is being started, the queue manager will automatically initiate media recovery of the object. If the queue manager supports media recovery and the damage is detected once the queue manager has started, it may be recovered from a media image using the rcrmqmobj command or it may be deleted.

AMQ7472 (iSeries)Object <insert_3>, type <insert_4> damaged.
Severity:
10 : Warning

Explanation:
Object <insert_3>, type <insert_4> has been marked as damaged. This indicates that the queue manager was either unable to access the object in the file system, or that some kind of inconsistency with the data in the object was detected.

Response:
If a damaged object is detected, the action performed depends on whether the queue manager supports media recovery and when the damage was detected. If the queue manager does not support media recovery, you must delete the object as no recovery is possible. If the queue manager does support media recovery and the damage is detected during the processing performed when the queue manager is being started, the queue manager will automatically initiate media recovery of the object. If the queue manager supports media recovery and the damage is detected once the queue manager has started, it may be recovered from a media image using the RCRMQMOBJ command or it may be deleted.

AMQ7477 (iSeries)WebSphere MQ session no longer active.
Severity:
10 : Warning

Explanation:
The commitment control exit program was called during a commit or rollback operation. The queue manager was stopped while the program was registered. This might have resulted in the rolling back of some uncommitted message operations.

Response:
Inform your system administrator that uncommitted message operations might have been rolled back when the queue manager was stopped.

AMQ7484Failed to put message to logger event queue. Reason(<insert_2>)
Severity:
20 : Error

Explanation:
The attempt to put a logger event messsage to the queue <insert_3> failed with reason code <insert_2>. The message data has been discarded.

Response:
Ensure that the queue <insert_3> is available and operational. Current logger status information can be displayed with the DISPLAY QMSTATUS runmqsc command.

AMQ7601Duplicate XA resource manager is not valid.
Severity:
40 : Stop Error

Explanation:
Line <insert_1> of the configuration file <insert_3> contained a duplicate XA resource manager <insert_5>. This is not valid for attribute <insert_4>. Each XA resource manager must be given a unique name.

Response:
Check the contents of the file and retry the operation.

AMQ7601 (Windows)Duplicate XA resource manager <insert_5> not valid for attribute <insert_4> at <insert_3> in the configuration data.
Severity:
40 : Stop Error

Explanation:
Key <insert_3> in the configuration data contained a duplicate XA resource manager <insert_5>. This is not valid for attribute <insert_4>. Each XA resource manager must be given a unique name.

Response:
Check the contents of the configuration data and retry the operation.

AMQ7602 (iSeries)The MQ commitment control exit program was called incorrectly.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ commitment control exit program was called with incorrect parameters.

Response:
If the program was called by OS/400 as part of a commit or rollback, save the job log, and contact your IBM support center.

AMQ7603WebSphere MQ has been configured with invalid resource manager <insert_3>.
Severity:
20 : Error

Explanation:
The XA switch file <insert_4> for resource manager <insert_3> indicates that an attempt has been made to configure another queue manager as an external resource manager. This is not allowed so the queue manager will terminate.

Response:
Remove the offending XAResourceManager stanza from the qm.ini configuration file and restart the queue manager.

AMQ7603 (Windows)WebSphere MQ has been configured with resource manager <insert_3> that is not valid.
Severity:
20 : Error

Explanation:
The XA switch file <insert_4> for resource manager <insert_3> indicates that an attempt has been made to configure another queue manager as an external resource manager. This is not allowed, so the queue manager will terminate.

Response:
Remove the offending XAResourceManager stanza from the configuration data and restart the queue manager.

AMQ7604The XA resource manager <insert_3> was not available when called for <insert_4>. The queue manager is continuing without this resource manager.
Severity:
10 : Warning

Explanation:
The XA resource manager <insert_3> has indicated that it is not available, by returning XAER_RMERR on an xa_open request or XAER_RMFAIL when called for something else. Normally this indicates that the resource manager has been shut down. In this case the resource manager cannot participate in any new transactions. Any in-flight transactions in which it was involved will be backed out, and any transactions in which it is in-doubt will only be resolved when contact with the resource manager is re-established. A further message will be issued when the queue manager has been able to do this. If the problem occurred on an xa_open request, and the resource manager should be available, then there may be a configuration problem.

Response:
Try to establish the reason why the resource manager is unavailable. It may be that an invalid XAOpenString has been defined for the resource manager in the 'qm.ini' configuration file. If this is the case, stop and then restart the queue manager so that any change will be picked up. Alternatively, the queue manager may be reaching a resource constraint with this resource manager. For example, the resource manager may not be able to accommodate all of the queue manager processes being connected at one time, you may need to alter one of its tuning parameters.

AMQ7604 (iSeries)The XA resource manager was not available when called.
Severity:
10 : Warning

Explanation:
The XA resource manager <insert_3> has indicated that it is not available, by returning XAER_RMERR on an xa_open request or XAER_RMFAIL when called for <insert_4>. The queue manager is continuing without this resource manager. Normally this indicates that the resource manager has been shut down. In this case the resource manager cannot participate in any new transactions. Any in-flight transactions in which it was involved will be backed out, and any transactions in which it is in-doubt will only be resolved when contact with the resource manager is re-established. A further message will be issued when the queue manager has been able to do this. If the problem occurred on an xa_open request, and the resource manager should be available, then there may be a configuration problem.

Response:
Try to establish the reason why the resource manager is unavailable. It may be that an invalid XAOpenString has been defined for the resource manager in the 'qm.ini' configuration file. If this is the case, stop and then restart the queue manager so that any change will be picked up. Alternatively, the queue manager may be reaching a resource constraint with this resource manager. For example, the resource manager may not be able to accommodate all of the queue manager processes being connected at one time, you may need to alter one of its tuning parameters.

AMQ7605The XA resource manager <insert_3> has returned an unexpected return code <insert_1>, when called for <insert_4>.
Severity:
20 : Error

Explanation:
WebSphere MQ received an unexpected return code when calling XA resource manager <insert_3> at its <insert_4> entry point. This indicates an internal error, either within MQ or the resource manager.

Response:
Try to determine the source of the error. A trace of the failure could be used to look at the XA flows between MQ and the resource manager. MQ has allocated an RMId of <insert_2> to this resource manager. This will be useful when isolating the flows associated with the resource manager concerned. If the error occurs on an xa_commit or xa_rollback request, the queue manager will not attempt to redeliver the commit or rollback instruction for this transaction, until after the queue manager has been restarted. The transaction indoubt is identified by the following XID of X<insert_5>. If you think that the error lies within the queue manager, contact your IBM support center. Do not discard any information describing the problem until after the problem has been resolved.

AMQ7605 (iSeries)The XA resource manager has returned an unexpected return code.
Severity:
20 : Error

Explanation:
WebSphere MQ received unexpected return code <insert_1> when calling XA resource manager <insert_3> at its <insert_4> entry point. This indicates an internal error, either within MQ or the resource manager.

Response:
Try to determine the source of the error. A trace of the failure could be used to look at the XA flows between MQ and the resource manager. MQ has allocated an RMId of <insert_2> to this resource manager. This will be useful when isolating the flows associated with the resource manager concerned. If the error occurs on an xa_commit or xa_rollback request, the queue manager will not attempt to redeliver the commit or rollback instruction for this transaction, until after the queue manager has been restarted. The transaction indoubt is identified by the following XID of X<insert_5>. If you think that the error lies within the queue manager, contact your IBM support center. Do not discard any information describing the problem until after the problem has been resolved.

AMQ7606A transaction has been committed but one or more resource managers have backed out.
Severity:
20 : Error

Explanation:
WebSphere MQ was processing the commit operation for a transaction involving external resource managers. One or more of these resource managers failed to obey the commit request and instead rolled back their updates. The outcome of the transaction is now mixed and the resources owned by these resource managers may now be out of synchronization. MQ will issue further messages to indicate which resource managers failed to commit their updates.

Response:
The transaction with the mixed outcome is identified by the following XID of X<insert_3>. The messages which identify the failing resource managers will also contain this same XID. If the transaction has completed it won't be displayed by the dspmqtrn command and all other transaction participants will have committed their updates. If the transaction is displayed by the dspmqtrn command then there are some participants still in prepared state. In order to preserve data integrity you will need to perform recovery steps local to the failing resource managers.

AMQ7607A transaction has been rolled back but one or more resource managers have committed.
Severity:
20 : Error

Explanation:
WebSphere MQ was rolling back a transaction involving external resource managers. One or more of these resource managers failed to obey the rollback request and instead committed their updates. The outcome of the transaction is now mixed and the resources owned by these resource managers may now be out of synchronization. MQ will issue further messages to indicate which resource managers failed to roll back their updates.

Response:
The transaction with the mixed outcome is identified by the following XID of X<insert_3>. The messages which identify the failing resource managers will also contain this same XID. If the transaction has completed it won't be displayed by the dspmqtrn command and all other transaction participants will have rolled back their updates. If the transaction is displayed by the dspmqtrn command then there are some participants still in prepared state. In order to preserve data integrity you will need to perform recovery steps local to the failing resource managers.

AMQ7608XA resource manager returned a heuristic return code.
Severity:
20 : Error

Explanation:
This message is associated with an earlier AMQ7606 message reporting a mixed transaction outcome. It identifies one of the resource managers (<insert_4>) that failed to commit its updates. The transaction associated with this failure is identified by the following XID of X<insert_3>.

Response:
Use the return code <insert_1> returned by the resource manager to determine the effects of the failure. The return code indicates that the resource manager made a heuristic decision about the outcome of the transaction which disagrees with the commit decision of the queue manager. In order to preserve data integrity you will need to perform recovery steps local to this resource manager.

AMQ7609XA resource manager returned a heuristic return code.
Severity:
20 : Error

Explanation:
This message is associated with an earlier AMQ7607 message reporting a mixed transaction outcome. It identifies one of the resource managers (<insert_4>) that failed to rollback its updates. The transaction associated with this failure is identified by the following XID of X<insert_3>.

Response:
Use the return code <insert_1> returned by the resource manager to determine the effects of the failure. The return code indicates that the resource manager made a heuristic decision about the outcome of the transaction which disagrees with the rollback decision of the queue manager. In order to preserve data integrity you will need to perform recovery steps local to this resource manager.

AMQ7612Switch call exception
Severity:
20 : Error

Explanation:
Exception number <insert_1> occurred when calling resource manager switch <insert_3>.

Response:
Check the resource manager switch has not been corrupted.

AMQ7622WebSphere MQ could not load the XA switch load file for resource manager <insert_3>.
Severity:
20 : Error

Explanation:
An error has occurred loading XA switch file <insert_4>. If the error occurred during startup then the queue manager will terminate. At all other times the queue manager will continue without this resource manager meaning that it will no longer be able to participate in global transactions. The queue manager will also retry the load of the switch file at regular intervals so that the resource manager will be able to participate again should the load problem be resolved.

Response:
Look for a previous message outlining the reason for the load failure. Message AMQ6175 is issued if the load failed because of a system error. If this is the case then follow the guidance given in message AMQ6175 to resolve the problem. In the absence of prior messages or FFST information related to this problem check that the name of the switch load file is correct and that it is present in a directory from which it can be dynamically loaded by the queue manager. The easiest method of doing this is to define the switch load file as a fully-qualified name. Note that if the queue manager is still running it will need to be restarted in order that any changes made to its configuration data can be picked up.

AMQ7623WebSphere MQ has not been configured with XA resource manager.
Severity:
10 : Warning

Explanation:
The queue manager has noticed that XA resource manager <insert_3> was removed from the qm.ini file of the queue manager. However, it was logged as being involved in <insert_1> transactions that are still in-doubt. The queue manager cannot resolve these transactions.The queue manager is continuing without this resource manager.

Response:
First check that the qm.ini configuration file of the queue manager concerned hasn't been mistakenly altered resulting in an 'XAResourceManager' stanza being removed, or the 'Name' of any the resource managers being changed. If the qm.ini file was changed by mistake then you will need to reinstate resource manager <insert_3> in the qm.ini file before stopping and then restarting the queue manager in order that the change will be picked up. If you have intentionally removed a resource manager from the qm.ini file, consider the integrity implications of your action since the resource manager concerned may be in an in-doubt state. If you are sure that is not the case then you can use the 'rsvmqtrn' command to deliver an outcome on behalf of the resource manager in order that the queue manager can forget about the transactions concerned. If you cannot be sure that such an action will not cause an integrity problem then you should consider re-instating the resource manager in the qm.ini file so that the queue manager can contact the resource manager and automatically resolve the transactions concerned next time the queue manager is restarted.

AMQ7623 (Windows)WebSphere MQ has not been configured with XA resource manager <insert_3> which may be involved in in-doubt transactions. The queue manager is continuing without this resource manager.
Severity:
10 : Warning

Explanation:
The queue manager has recognized that XA resource manager <insert_3> was removed from the registry entry of the queue manager. However, it was logged as being involved in <insert_1> transactions that are still in-doubt. The queue manager cannot resolve these transactions.

Response:
Check that the configuration data entry of the queue manager concerned has not been altered by mistake, resulting in an 'XAResourceManager' stanza being removed, or the 'Name' of any the resource managers being changed. 
If the configuration data entry was changed by mistake, you need to reinstate resource manager <insert_3> in the configuration data before stopping, and then restarting the queue manager to access the change. 
If you have intentionally removed a resource manager from the configuration data, consider the integrity implications of your action because the resource manager concerned may be in an in-doubt state. 
If you are sure that this is not the case, you can use the 'rsvmqtrn' command to instruct the resource manager to inform the queue manager that it can forget about the transactions concerned. 
If using the 'rsvmqtrn' command could result in an integrity problem, you should consider reinstating the resource manager in the configuration data, so that the queue manager can contact the resource manager and automatically resolve the transactions concerned next time the queue manager is restarted.

AMQ7624An exception occurred during an <insert_4> call to XA resource manager <insert_3>.
Severity:
20 : Error

Explanation:
An exception has been detected during a call to an XA resource manager. The queue manager will continue after assuming a return code of XAER_RMERR from the call.

Response:
An FFST should have been produced which documents the exception. Use this and any further FFSTs to try and determine the reason for the failure. A trace of the problem will be useful to identify the XA flows between the queue manager and the resource manager concerned. MQ has allocated an RMId of <insert_1> to this resource manager. Use this to isolate the flows concerned. First contact the supplier of the resource manager for problem resolution. If however you think that the problem lies within the queue manager then contact your IBM support center. Do not discard any information describing the problem until after it has been resolved.

AMQ7625The XA resource manager <insert_3> has become available again.
Severity:
0 : Information

Explanation:
WebSphere MQ has managed to regain contact with a resource manager that had become unavailable. Any in-doubt transactions involving this resource manager will be resolved. The resource manager will now be able to participate in new transactions.

Response:
None.

AMQ7626XA resource manager initialization failure. Refer to the error log for more information.
Severity:
20 : Error

Explanation:
The queue manager has failed to initialize one or more of the XA resource managers defined in the qm.ini configuration file.

Response:
Correct the error and restart the queue manager.

AMQ7626 (Windows)XA resource manager initialization failure. Refer to the error log for more information.
Severity:
20 : Error

Explanation:
The queue manager has failed to initialize one or more of the XA resource managers defined in the configuration data.

Response:
Correct the error and restart the queue manager.

AMQ7701DMPMQLOG command is starting.
Severity:
0 : Information

Explanation:
You have started the DMPMQLOG command and it is processing your request.

Response:
None.

AMQ7702DMPMQLOG command has finished successfully.
Severity:
0 : Information

Explanation:
The DMPMQLOG command has finished processing your request and no errors were detected.

Response:
None.

AMQ7703DMPMQLOG command has used option <insert_3> with an invalid value <insert_4>.
Severity:
20 : Error

Explanation:
You started the DMPMQLOG command specifying an invalid option value. The <insert_4> value for option <insert_3> is either missing or of an incorrect format.

Response:
Refer to the command syntax, and then try the command again.

AMQ7704DMPMQLOG command has used an invalid option <insert_3>.
Severity:
20 : Error

Explanation:
You started the DMPMQLOG command specifying an invalid option of <insert_3>.

Response:
Refer to the command syntax and then try the command again.

AMQ7705Usage: dmpmqlog [-b | -s StartLSN | -n ExtentNumber] [-e EndLSN] [-f LogFilePath] [-m QMgrName]
Severity:
0 : Information

Response:
None.

AMQ7706DMPMQLOG command has used an incorrect queue manager name <insert_3> or path <insert_4>.
Severity:
20 : Error

Explanation:
The DMPMQLOG command has used <insert_3> as the queue manager name and, if shown, <insert_4> as the directory path for <insert_3>. Either <insert_3> and/or <insert_4> is incorrect; if <insert_4> is not shown then it is <insert_3> which is incorrect. 
Possible reasons for the error include: 
that <insert_3> is not an existing queue manager name; 
the entries for <insert_3> in the MQ system initialization (INI) file are incorrect; 
<insert_4> is not a correct path for <insert_3>. 
If you started the command specifying option -m (queue manager name option) with a value then this value will have been used as the queue manager name, otherwise the default queue manager name will have been used.

Response:
Check that <insert_3> is an existing queue manager name. Check your MQ system's initialization (INI) file to ensure that <insert_3> and its associated entries are correct. If <insert_4> is shown, check that it is a correct MQ system directory path for <insert_3>.

AMQ7706 (iSeries)DMPMQLOG command has used an incorrect queue manager name or path.
Severity:
20 : Error

Explanation:
The DMPMQLOG command has used <insert_3> as the queue manager name and, if shown, <insert_4> as the directory path for <insert_3>. Either <insert_3> and/or <insert_4> is incorrect; if <insert_4> is not shown then it is <insert_3> which is incorrect. 
Possible reasons for the error include: 
that <insert_3> is not an existing queue manager name; 
the entries for <insert_3> in the MQ system initialization (INI) file are incorrect; 
<insert_4> is not a correct path for <insert_3>. 
If you started the command specifying option -m (queue manager name option) with a value then this value will have been used as the queue manager name, otherwise the default queue manager name will have been used.

Response:
Check that <insert_3> is an existing queue manager name. Check your MQ system's initialization (INI) file to ensure that <insert_3> and its associated entries are correct. If <insert_4> is shown, check that it is a correct MQ system directory path for <insert_3>.

AMQ7706 (Windows)DMPMQLOG command has used an incorrect queue manager name <insert_3> or path <insert_4>.
Severity:
20 : Error

Explanation:
The DMPMQLOG command has used <insert_3> as the queue manager name and, if shown, <insert_4> as the directory path for <insert_3>. Either <insert_3> and/or <insert_4> is incorrect; if <insert_4> is not shown then it is <insert_3> which is incorrect. 
Possible reasons for the error include: 
that <insert_3> is not an existing queue manager name; 
the entries for <insert_3> in the MQ configuration data are incorrect; 
<insert_4> is not a correct path for <insert_3>. 
If you started the command specifying option -m (queue manager name option) with a value then this value will have been used as the queue manager name, otherwise the default queue manager name will have been used.

Response:
Check that <insert_3> is an existing queue manager name. Check your MQ configuration data to ensure that <insert_3> and its associated entries are correct. If <insert_4> is shown, check that it is a correct MQ system directory path for <insert_3>.

AMQ7707DMPMQLOG command has failed: CompCode = 0x<insert_1>.
Severity:
20 : Error

Explanation:
The DMPMQLOG command has detected an error and the MQ recording routine has been called. Possible reasons for this include a damaged log file, a problem during initialization for the queue manager or an internal MQ failure.

Response:
Check that the queue manager being used by DMPMQLOG, as specified by you using the -m command option or defaulted, exists and is not currently running. If it does not exist, try the command again specifying an existing queue manager. If it is running, stop the queue manager and then try the command again. Otherwise, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Note the completion code (CompCode) and then contact your IBM support center.

AMQ7708DMPMQLOG command has used an invalid default queue manager name.
Severity:
20 : Error

Explanation:
You started the DMPMQLOG command without specifying option -m (queue manager name option) and so your MQ default queue manager name has been used. However, this default name either could not be found or is invalid.

Response:
Check that the default queue manager name exists and is valid, and then try the command again.

AMQ7709DMPMQLOG command has used an invalid combination of options.
Severity:
20 : Error

Explanation:
You started the DMPMQLOG command specifying an invalid combination of the options -b (base LSN option), -s (start LSN option) and -n (extent number option). Only 1 or none of these options may be specified.

Response:
Refer to the command syntax and then try the command again.

AMQ7710DMPMQLOG command has used option -n which is invalid for circular logging.
Severity:
20 : Error

Explanation:
You started the DMPMQLOG command specifying option -n (extent number option) but this is not valid when your MQ log is defined as circular.

Response:
Use a different option and then try the command again.

AMQ7711DMPMQLOG command has used option -m with a value that is too long.
Severity:
20 : Error

Explanation:
You started the DMPMQLOG command specifying option -m (queue manager name option) with a value that is more than <insert_1> characters.

Response:
Specify a shorter queue manager name and then try the command again.

AMQ7712DMPMQLOG command has used option -f with a value which is too long.
Severity:
20 : Error

Explanation:
You started the DMPMQLOG command specifying option -f (log file path option) with a value which is more than <insert_1> characters.

Response:
Specify a shorter log file path name and then try the command again.

AMQ7713DMPMQLOG command was unable to allocate sufficient storage.
Severity:
20 : Error

Explanation:
The DMPMQLOG command has been unable to allocate some storage.

Response:
Free some storage and then try the command again.

AMQ7714DMPMQLOG command has reached the end of the log.
Severity:
0 : Information

Explanation:
The DMPMQLOG command has processed any log data and has now reached the end of the log.

Response:
None.

AMQ7715DMPMQLOG command cannot open file <insert_3>.
Severity:
20 : Error

Explanation:
The DMPMQLOG command was unable to open file <insert_3> for reading.

Response:
Check that the file exists, can be opened for reading, and that you have authority to access it, and then try the command again.

AMQ7716DMPMQLOG command has finished unsuccessfully.
Severity:
0 : Information

Explanation:
The DMPMQLOG command has finished with your request but an error has been detected. The previous message issued by the command can be used to identify the error.

Response:
Refer to the previous message issued by the command.

AMQ7717DMPMQLOG command has failed to initialize: CompCode = 0x<insert_1>.
Severity:
20 : Error

Explanation:
The DMPMQLOG command has failed during its initialization and the MQ recording routine has been called. Possible reasons for this include that your queue manager is already running. The completion code can be used to identify the error.

Response:
Check that the queue manager being used by DMPMQLOG, as specified by you using the -m command option or defaulted, exists and is not currently running. If it is running, stop the queue manager and then try the command again. Otherwise, use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ7718DMPMQLOG command is using a default of <insert_3> for the queue manager name.
Severity:
0 : Information

Explanation:
You have started the DMPMQLOG command without specifying option -m (queue manager name option) and so a default value of <insert_3> is being used. This value is obtained from your default queue manager name.

Response:
None.

AMQ7718 (iSeries)DMPMQLOG command is using a the default queue manager name.
Severity:
0 : Information

Explanation:
You have started the DMPMQLOG command without specifying option -m (queue manager name option) and so a default value of <insert_3> is being used. This value is obtained from your MQ default queue manager name.

Response:
None.

AMQ7719DMPMQLOG command is using a default of <insert_3> for the starting dump location.
Severity:
0 : Information

Explanation:
You have started the DMPMQLOG command without specifying option -b (base LSN option), option -s (start LSN option) or option -n (extent number option), and so a default value of <insert_3> is being used. This value is the Log Sequence Number (LSN) of the first record in the active part of the log, and will be used as the location from which to start dumping.

Response:
None.

AMQ7719 (iSeries)DMPMQLOG command is using the default starting dump location.
Severity:
0 : Information

Explanation:
You have started the DMPMQLOG command without specifying option -b (base LSN option), option -s (start LSN option) or option -n (extent number option), and so a default value of <insert_3> is being used. This value is the Log Sequence Number (LSN) of the first record in the active part of the log, and will be used as the location from which to start dumping.

Response:
None.

AMQ7720DMPMQLOG command is using extent <insert_1> but the current extent is <insert_2>.
Severity:
20 : Error

Explanation:
You have started the DMPMQLOG command specifying option -n (extent number option) with a value of <insert_1> but this value is greater than <insert_2>, which represents the extent currently being used.

Response:
When using option -n, specify its value as being less than or equal to the extent number currently being used.

AMQ7721DMPMQLOG command has not found any log records in extent number <insert_1>.
Severity:
0 : Information

Explanation:
During its normal processing, the DMPMQLOG command did not find any log records in this extent.

Response:
None.

AMQ7722DMPMQLOG command cannot find the object catalogue for queue manager <insert_3>.
Severity:
20 : Error

Explanation:
The DMPMQLOG command is using the queue manager named <insert_3> but cannot find the manager's object catalogue file. This file should have been created at the time the queue manager was created.

Response:
Refer to the "System Management Guide" for a description of the location and name of the object catalogue file. Check that the file exists and is available for use by this command. If it does not exist then you will need to re-create the queue manager.

AMQ7722 (iSeries)DMPMQLOG command cannot find the object catalogue for the queue manager.
Severity:
20 : Error

Explanation:
The DMPMQLOG command is using the queue manager named <insert_3> but cannot find the manager's object catalogue file. This file should have been created at the time the queue manager was created.

Response:
Refer to the "System Management Guide" for a description of the location and name of the object catalogue file. Check that the file exists and is available for use by this command. If it does not exist then you will need to re-create the queue manager.

AMQ7723DMPMQLOG command cannot find the requested Log Sequence Number (LSN).
Severity:
20 : Error

Explanation:
The DMPMQLOG command has been started with an LSN but it cannot be found in the log.

Response:
Check for an existing LSN and then try the command again.

AMQ7724DMPMQLOG command cannot use the requested extent number.
Severity:
20 : Error

Explanation:
The DMPMQLOG command has been started with an extent number but it is beyond the end of the log.

Response:
Check for an existing extent number and then try the command again.

AMQ7725DMPMQLOG command cannot find an old Log Sequence Number (LSN).
Severity:
20 : Error

Explanation:
The DMPMQLOG command has been started specifying an LSN which is older than the log's base LSN. However, the specified LSN could not be found.

Response:
Check for an existing LSN and then try the command again.

AMQ7726DMPMQLOG command has used option -s with an incorrect value for circular logging.
Severity:
20 : Error

Explanation:
You started the DMPMQLOG command specifying option -s (start LSN option) with a value which is less than the base LSN of a log which is defined as circular. LSN values less than the base LSN can only be specified when using a linear log.

Response:
When using option -s with a circular log, specify an option value which is equal or greater to the log's base LSN, and then try the command again.

AMQ7751 (iSeries)MIGRATEMQM program is starting.
Severity:
0 : Information

Explanation:
You have started the MIGRATEMQM program.

Response:
None.

AMQ7752 (iSeries)MIGRATEMQM has completed successfully.
Severity:
0 : Information

Explanation:
The MIGRATEMQM program has completed migration of your queue manager and no errors were detected.

Response:
None.

AMQ7753 (iSeries)MIGRATEMQM has failed due to errors.
Severity:
20 : Error

Explanation:
See the previously listed messages in the job log. Correct the errors and then restart the MIGRATEMQM program.

Response:
None.

AMQ7754 (iSeries)MIGRATEMQM has detected an error and is unable to continue.
Severity:
20 : Error

Explanation:
See the previously listed messages in this job log, or in associated job logs. Correct the errors and then restart the MIGRATEMQM program.

Response:
None.

AMQ7755 (iSeries)Unable to locate a required journal receiver.
Severity:
20 : Error

Explanation:
The MIGRATEMQM program attempted to locate the journal receivers to use for migration, but the operation required access to a journal or journal receiver that is not currently present on the system.

Response:
Restore the required journal or journal receiver from backup. Then restart the MIGRATEMQM program.

AMQ7756 (iSeries)Unable to locate a required journal entry.
Severity:
20 : Error

Explanation:
The MIGRATEMQM program was unable to retrieve a journal entry required for migration. The operation may have failed because a required journal receiver is not currently present on the system.

Response:
Restore the required journal receiver from backup. Then restart the MIGRATEMQM program.

AMQ7757 (iSeries)Queue manager <insert_3> already exists.
Severity:
20 : Error

Explanation:
The MIGRATEMQM program is unable to create a queue manager with the same name as used in the previous release because a queue manager of this name has already been created.

Response:
Delete the queue manager. Then restart the MIGRATEMQM program.

AMQ7758 (iSeries)Queue manager starting.
Severity:
0 : Information

Explanation:
The queue manager "<insert_3>" is starting.

Response:
None.

AMQ7759 (iSeries)Recreating WebSphere MQ objects.
Severity:
0 : Information

Explanation:
WebSphere MQ objects are being recreated from their media images contained in the log.

Response:
None.

AMQ7760 (iSeries)Recreating WebSphere MQ channels.
Severity:
0 : Information

Explanation:
WebSphere MQ channels are being recreated from the previous channel definition file.

Response:
None.

AMQ7761 (iSeries)Unexpected return code from command <insert_3>.
Severity:
20 : Error

Explanation:
An unexpected return code, <insert_1>, was returned by command <insert_3>.

Response:
See the previously listed messages in this job log, or in associated job logs.

AMQ7762 (iSeries)Unexpected error from channel migration.
Severity:
20 : Error

Explanation:
The migration of channel definitions or channel synchronization data encountered an unexpected error.

Response:
See the previously listed messages in this job log, or in associated job logs.

AMQ7770Sent file <insert_3>
Severity:
40 : Stop Error

Explanation:
The file was successfully sent.

Response:
None.

AMQ7771Received file.
Severity:
40 : Stop Error

Explanation:
The file was successfully received.

Response:
None.

AMQ7772Complete file list
Severity:
40 : Stop Error

Explanation:
Displays a list of complete files.

Response:
None.

AMQ7773Incomplete file list
Severity:
40 : Stop Error

Explanation:
Displays a list of incomplete files.

Response:
None.

AMQ7774Other message list
Severity:
40 : Stop Error

Explanation:
Displays a list of other messages.

Response:
None.

AMQ7775Nothing to list.
Severity:
40 : Stop Error

Explanation:
Nothing to list.

Response:
None.

AMQ7776Deleted.
Severity:
40 : Stop Error

Explanation:
File deleted.

Response:
None.

AMQ7777Nothing to delete.
Severity:
40 : Stop Error

Explanation:
Nothing to delete.

Response:
None.

AMQ7778Syntax error. The correct syntax is:
Severity:
40 : Stop Error

Explanation:
Invalid arguments supplied.

Response:
One or more options were incorrectly specified when issuing the send or receive command. Check the options used and reissue the command.

AMQ7779Cannot connect to default queue manager.
Severity:
40 : Stop Error

Explanation:
Queue manager not available.

Response:
Check that the queue manager exists and that the listener is running.

AMQ7780Cannot connect to queue manager <insert_3>
Severity:
40 : Stop Error

Explanation:
Queue manager not available.

Response:
Check that the queue manager exists and that the listener is running.

AMQ7781Application memory unavailable.
Severity:
40 : Stop Error

Explanation:
There is insufficient memory to perform the requested action.

Response:

1) Check the message size is not excessive 
2) Close other applications and try the command again

AMQ7783Queue name required.
Severity:
40 : Stop Error

Explanation:
A queue name was not specified when issuing a send or receive command.

Response:
Reissue the command with the QueueName option.

AMQ7784Cannot open queue <insert_3>
Severity:
40 : Stop Error

Explanation:
Cannot open queue <insert_3>

Response:
Check that the queue exists.

AMQ7785Cannot open file <insert_3>
Severity:
40 : Stop Error

Explanation:
Cannot open file <insert_3>

Response:
Check that the file exists, that it is in the correct location and has the appropriate file permissions.

AMQ7786Cannot put to queue <insert_3>
Severity:
40 : Stop Error

Explanation:
Cannot put to queue <insert_3>

Response:

1) Check the Queue Manager has sufficient log space for sending large messages 
2) Check the queue does not have put inhibited 
3) Check the queue is not full 
4) Check the message size of the queue is greater than the message size 
5) Check the user has sufficient authority to put messages on the queue

AMQ7787No file name specified.
Severity:
40 : Stop Error

Explanation:
No file name specified.

Response:
A file name was not specified when issuing a send command. Reissue the command with the FileName option.

AMQ7788Message length is too small to send data.
Severity:
40 : Stop Error

Explanation:
Message length is too small to send data.

Response:
Increase the message size and resend with a send command, using the -l MessageSize option to specify a larger message size.

AMQ7789Sending file has changed.
Severity:
40 : Stop Error

Explanation:
The file being sent has been changed before the complete file has been sent.

Response:
Check the file for integrity and reissue the send command.

AMQ7790Cannot get from queue <insert_3>
Severity:
40 : Stop Error

Explanation:
The list, get, delete or extract request has failed.

Response:

1) Check the queue does have get inhibited 
2) Check the user has sufficient WMQ authority to get messages from the queue

AMQ7791Cannot write to file.
Severity:
40 : Stop Error

Explanation:
The get or extract request has failed.

Response:

1) Check that the file is not write-protected. In Windows Explorer, right-click the file name and select Properties. Check the user has sufficient authority to write to the destination file system. 
2) Check the destination file system exists 
3) Check the destination file system is not full

AMQ7792CorrelId is invalid.
Severity:
40 : Stop Error

Explanation:
CorrelId is invalid.

Response:

1) Check that a valid correlation ID has been specified when receiving a file with the -c option. 
2) It must be 48 characters in length. 
3) Use the -v option of the receive command to display the correlation ID.

AMQ7793MsgId is invalid.
Severity:
40 : Stop Error

Explanation:
MsgId is invalid.

Response:

1) Check that a valid message ID has been specified when receiving an 'other' message with the -u option. 
2) It must be 48 characters in length.

AMQ7794No messages to receive.
Severity:
40 : Stop Error

Explanation:
There are no FTA files on the specified queue.

Response:
Check with the sender that the file was actually sent.

AMQ7795Cannot delete the file because it's not unique.
Severity:
40 : Stop Error

Explanation:
Cannot delete the file because it's not unique.

Response:
None.

AMQ7796Cannot replace an existing file.
Severity:
40 : Stop Error

Explanation:
Cannot replace an existing file.

Response:
Reissue the command with the -y option.

AMQ7797Unable to load the WebSphere MQ library.
Severity:
40 : Stop Error

Explanation:
Unable to load the WebSphere MQ library.

Response:
None.

AMQ7798Unable to locate <insert_3>.
Severity:
40 : Stop Error

Explanation:
This application requires <insert_3>.

Response:
Check that <insert_3> is available and installed correctly.

AMQ7799Unable to start <insert_3>.
Severity:
40 : Stop Error

Explanation:
This application cannot start <insert_3>.

Response:
Check that <insert_3> is available and installed correctly.

AMQ7800CorrelId <insert_3>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ7801Dir <insert_3>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ7802UserData <insert_3>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ7803FileName <insert_3>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ7804Length <insert_3>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ7805MsgId <insert_3>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ7806Could not start WebSphere MQ web administration server: <insert_1>.
Severity:
0 : Information

Explanation:
An unsuccessful attempt was made to start the web administration server on port <insert_1>.

Response:
Check the product is installed correctly; the required registry keys and values are correct and the web server port is not already in use. If the problem persists contact your service representative.

AMQ7807WebSphere MQ web administration server running.
Severity:
0 : Information

Explanation:
WebSphere MQ web administration server running. Listening on port <insert_4>, root directory is <insert_5>.

Response:
No action is required.

AMQ7808Internal run-time error in WebSphere MQ web administration: <insert_4>.
Severity:
0 : Information

Explanation:
WebSphere MQ web administration had the following internal run-time error: <insert_4>.

Response:
Check that: the product is installed correctly and that the required registry keys and values are correct. If the problem persists contact your service representative.

AMQ7809WebSphere MQ Publish/Subscribe web administration user limit reached.
Severity:
10 : Warning

Explanation:
The maximum number of concurrent web administration users has been reached (<insert_4>).

Response:
Use the 'Web Administration Server' properties page in the Microsoft Management Console to increase the value of the web administration 'MaxClients' parameter.

AMQ7810 (Windows)Failed to create class, reason code: <insert_1>.
Severity:
20 : Error

Explanation:
While trying to create class <insert_3> on <insert_4> error code <insert_1> was encountered. The associated error message generated by the operating system is: <insert_5>

Response:
Check the system documentation to determine the course of action required to rectify the problem.

AMQ7880 (Windows)Error code <insert_1> starting <insert_4>/<insert_3> WebSphere MQ service.
Severity:
0 : Information

Explanation:
The service was unable to start <insert_4>/<insert_3>. The error message reported was as follows: <insert_5>

Response:
Use WebSphere MQ Explorer to investigate why the service could not begin. If recovery for this service is active, MQ will attempt to recover.

AMQ7881 (Windows)Unable to stop <insert_4>/<insert_3> WebSphere MQ service, return code <insert_1>.
Severity:
10 : Warning

Explanation:
The WebSphere MQ service was unable to stop <insert_4>/<insert_3>. The error message reported was as follows: <insert_5>

Response:
Use WebSphere MQ Explorer to investigate why the service could not be stopped.

AMQ7882 (Windows)Attempting to recover <insert_4>/<insert_3> WebSphere MQ service.
Severity:
0 : Information

Explanation:
The WebSphere MQ service has detected that <insert_4>/<insert_3> has failed, and is attempting to restart it.

Response:
No Action Required.

AMQ7883 (Windows)<insert_4>/<insert_3> WebSphere MQ service started from recovery.
Severity:
0 : Information

Explanation:
The WebSphere MQ service has successfully recovered <insert_4>/<insert_3>.

Response:
No Action Required.

AMQ7884 (Windows)Unable to recover <insert_4>/<insert_3> WebSphere MQ service.
Severity:
10 : Warning

Explanation:
The WebSphere MQ service has attempted to recover <insert_4>/<insert_3>, but all attempts have failed. There will be no more attempts to recover this service.

Response:
Use WebSphere MQ Explorer to investigate why the service failed and could not be restarted.

AMQ7885 (Windows)Unable to delete queue manager <insert_4>, error <insert_1>.
Severity:
10 : Warning

Explanation:
An attempt to delete queue manager <insert_4> failed. WebSphere MQ returned error code <insert_1>: <insert_5>

Response:
Ensure that the queue manager name has been specified correctly, and try again.

AMQ7886 (Windows)Unable to create queue manager <insert_4>.
Severity:
10 : Warning

Explanation:
Queue manager <insert_4> could not be created. WebSphere MQ returned error <insert_1>: <insert_5>

Response:
Check the error and application event logs to investigate the reason for the the returned error and suggested responses to take to rectify the fault. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ7890 (Windows)Unable to open mapped file containing WebSphere MQ performance data.
Severity:
20 : Error

Explanation:
The WebSphere MQ extensible counter dll was unable to open a mapped file used to collect queue performance data. Your system may be running short on virtual memory.

Response:
No action required. Performance statistics for MQ queues will not be displayed.

AMQ7891 (Windows)Unable to create a mutex to access WebSphere MQ performance data.
Severity:
20 : Error

Explanation:
The WebSphere MQ extensible counter dll was unable to create a mutex required to synchronise collection of queue performance data

Response:
No action required. Performance statistics for MQ queues will not be displayed.

AMQ7892 (Windows)Unable to map to shared memory file containing WebSphere MQ performance data.
Severity:
20 : Error

Explanation:
The WebSphere MQ extensible counter dll was unable to map the shared memory file required for collection of queue performance data.

Response:
No action required. Performance statistics for MQ queues will not be displayed.

AMQ7893 (Windows)Unable to open "Performance" key for WebSphere MQ services. Status code: <insert_1>.
Severity:
20 : Error

Explanation:
The WebSphere MQ extensible counter dll was unable to obtain performance counter values from the "Performance" key for WebSphere MQ services. Status code is the return value from the Windows registry call RegOpenKeyEx.

Response:
No action required. Performance statistics for MQ queues will not be displayed.

AMQ7894 (Windows)Unable to read the "Performance\First Counter" value for WebSphere MQ services. Status code: <insert_1>.
Severity:
20 : Error

Explanation:
The WebSphere MQ extensible counter dll was unable to obtain performance counter values from the "Performance\First Counter" key for WebSphere MQ services. Status code is the return value from the Windows registry call RegOpenKeyEx.

Response:
No action required. Performance statistics for MQ queues will not be displayed.

AMQ7895 (Windows)Unable to read the "Performance\First Help" value for WebSphere MQ services. Status code: <insert_1>.
Severity:
20 : Error

Explanation:
The WebSphere MQ extensible counter dll was unable to obtain performance counter values from the "Performance\First Help" key for WebSphere MQ services. Status code is the return value from the Windows registry call RegOpenKeyEx.

Response:
No action required. Performance statistics for MQ queues will not be displayed.

AMQ7901The data-conversion exit <insert_3> has not loaded.
Severity:
30 : Severe error

Explanation:
The data-conversion exit program, <insert_3>, failed to load. The internal function gave exception <insert_4>.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ7903The data-conversion exit <insert_3> cannot be found.
Severity:
30 : Severe error

Explanation:
Message data conversion has been requested for a WebSphere MQ message with a user-defined format, but the necessary data-conversion exit program, <insert_3>, cannot be found. The internal function gave exception <insert_4>.

Response:
Check that the necessary data-conversion exit <insert_3> exists.

AMQ7904The data-conversion exit <insert_3> cannot be found, or loaded.
Severity:
30 : Severe error

Explanation:
Message data conversion was requested for a WebSphere MQ message with a user-defined format, but the necessary data conversion exit program, <insert_3>, was not found, or loaded. The <insert_4> function call gave a return code of <insert_1>.

Response:
Check that the necessary data conversion exit routine exists in one of the standard directories for dynamically loaded modules. If necessary, inspect the generated output to examine the message descriptor (MQMD structure) of the MQ message for the conversion which was requested. This may help you to determine where the message originated.

AMQ7905Unexpected exception <insert_4> in data-conversion exit.
Severity:
30 : Severe error

Explanation:
The data-conversion exit program, <insert_3>, ended with an unexpected exception <insert_4>. The message has not been converted.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ7907Unexpected exception in data-conversion exit.
Severity:
30 : Severe error

Explanation:
The data-conversion exit routine, <insert_3>, ended with an unexpected exception. The message has not been converted.

Response:
Correct the error in the data-conversion exit routine.

AMQ7908 (Windows)Display active directory CRL server details.
Severity:
0 : Information

Explanation:
Display active directory CRL server details.

Response:
None.

AMQ7909 (Windows)There are no active directory CRL server details to display.
Severity:
0 : Information

Explanation:
No active directory CRL server definitions could be found.

Response:
None.

AMQ7910 (Windows)Usage: setmqscp [-m QmgrName | * ] [-a] [-d] [-r]
Severity:
0 : Information

AMQ7911 (Windows)The default Active Directory could not be located on your domain.
Severity:
20 : Error

Explanation:
No domain controllers with Active Directories could be found on the domain that your computer is a member of.

Response:
Active Directory support for MQ client connections cannot be used without a default Active Directory available on your domain.

AMQ7912 (Windows)The Active Directory support library failed to initialize.
Severity:
20 : Error

Explanation:
WebSphere MQ support libraries for Active Directory client connections could not be initialized.

Response:
Check that the Active Directory client pre-requisite software has been installed on your machine before attempting to use this feature.

AMQ7913 (Windows)The WebSphere MQ Active Directory container could not be created.
Severity:
20 : Error

Explanation:
WebSphere MQ has failed to create an IBM-MQClientConnections container as a child of your domain's system container in the Active Directory.

Response:
Ensure that you have permission to create sub-containers of the system container, and modify the otherWellKnownObjects property of the system container.

AMQ7914 (Windows)Migration of the client connection table for Queue Manager <insert_3> failed with reason code <insert_1><insert_4>.
Severity:
10 : Warning

Explanation:
The client connection table for this Queue Manager could not be migrated at this time.

Response:
Ensure that the client connection table exists and is not corrupted, and that you have authority to create new objects in the Active Directory on your domain.

AMQ7915 (Windows)Created service connection point for connection <insert_3>.
Severity:
0 : Information

Explanation:
The service connection point was successfully created for this client connection.

Response:
None.

AMQ7916 (Windows)The Active Directory channel definition table could not be opened.
Severity:
20 : Error

Explanation:
The IBM-MQClientConnections Active Directory container could not be located in the Global Catalog.

Response:
Ensure that setmqscp has been used to create the container object and that you have permission to read the container and its child objects.

AMQ7917 (Windows)Display active directory channel details.
Severity:
0 : Information

Explanation:
Display active directory channel details.

Response:
None.

AMQ7918 (Windows)The WebSphere MQ Active Directory container could not be deleted.
Severity:
20 : Error

Explanation:
There was a problem when attempting to delete the MQ Active Directory container. The container must be empty before it can be deleted from the directory.

Response:
None.

AMQ7919 (Windows)There are no active directory client channel details to display.
Severity:
0 : Information

Explanation:
No active directory client channel definitions could be found.

Response:
None.

AMQ7920 (Windows)Usage: setmqcrl [-m QmgrName] [-a] [-d] [-r]
Severity:
0 : Information

AMQ7921An incorrect eye-catcher field in an MQDXP structure has been detected.
Severity:
30 : Severe error

Explanation:
The MQDXP structure passed to the Internal Formats Conversion routine contains an incorrect eye-catcher field.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ7922A PCF message is incomplete.
Severity:
30 : Severe error

Explanation:
Message data conversion cannot convert a message in Programmable Command Format (PCF) because the message is only <insert_1> bytes long and does not contain a PCF header. The message has either been truncated, or it contains data that is not valid.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor of the message to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7923A message had an unrecognized integer encoding - <insert_1>.
Severity:
30 : Severe error

Explanation:
Message data conversion cannot convert a message because the integer encoding value of the message, <insert_1>, was not recognized.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor of the message to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7924Bad length in the PCF header (length = <insert_1>).
Severity:
30 : Severe error

Explanation:
Message data conversion cannot convert a message in Programmable Command Format (PCF) because the PCF header structure contains an incorrect length field. Either the message has been truncated, or it contains data that is not valid.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor of the message to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7925Message version <insert_1> is not supported.
Severity:
30 : Severe error

Explanation:
Message data conversion cannot convert a message because the Version field of the message contains an incorrect value.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor of the message to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7926A PCF message has an incorrect parameter count value <insert_1>.
Severity:
30 : Severe error

Explanation:
Message data conversion cannot convert a message in Programmable Command Format (PCF) because the parameter count field of the PCF header is incorrect.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor of the message to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7927Bad type in PCF structure number <insert_1> (type = <insert_2>).
Severity:
30 : Severe error

Explanation:
A Programmable Command Format (PCF) structure passed to the Internal Formats Converter contained an incorrect type field.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor of the message to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7928Bad length in PCF structure number <insert_1> (length = <insert_2>).
Severity:
30 : Severe error

Explanation:
A Programmable Command Format (PCF) structure passed to the Internal Formats Converter contained an incorrect length field.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor of the message to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7929A PCF structure is incomplete.
Severity:
30 : Severe error

Explanation:
Message data conversion cannot convert a message in Programmable Command Format (PCF) because structure number <insert_1>, of Type value <insert_2>, within the message is incomplete. The message has either been truncated, or it contains data that is not valid.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor of the message to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7930Bad CCSID in PCF structure number <insert_1> (CCSID = <insert_2>).
Severity:
30 : Severe error

Explanation:
A Programmable Command Format (PCF) structure passed to the Internal Formats Converter contains an incorrect CCSID.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor of the message to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7931Bad length in PCF structure number <insert_1> (length = <insert_2>).
Severity:
30 : Severe error

Explanation:
Message data conversion cannot convert a message in Programmable Command Format (PCF) because one of the structures of the message contains an incorrect length field.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor of the message to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7932Bad count in PCF structure number <insert_1> (count = <insert_2>).
Severity:
30 : Severe error

Explanation:
Message data conversion cannot convert a message in Programmable Command Format (PCF) because a StringList structure of the message contains an incorrect count field.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor, the headers of the message, and the incorrect structure to determine the source of the message, and to see how data that is not valid became included in the message.

AMQ7933Bad string length in PCF structure.
Severity:
30 : Severe error

Explanation:
Message data conversion cannot convert a message in Programmable Command Format (PCF) because structure number <insert_1> of the message contains an incorrect string length value <insert_2>.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor, the headers of the message, and the incorrect structure to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7934Wrong combination of MQCCSI_DEFAULT with MQCCSI_EMBEDDED or MQEPH_CCSID_EMBEDDED.
Severity:
30 : Severe error

Explanation:
Message data conversion could not convert a message in Programmable Command Format (PCF) because structure <insert_1> of the message contained a CodedCharSetId field of MQCCSI_DEFAULT while the message itself had a CodedCharSetId of MQCCSI_EMBEDDED, or the Flags field of the MQEPH structure containing the PCF specified flag MQEPH_CCSID_EMBEDDED. These are incorrect combinations.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor, the headers of the message and the incorrect structure to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7935Bad CCSID in message header (CCSID = <insert_1>).
Severity:
30 : Severe error

Explanation:
Message data conversion could not convert a message because the Message Descriptor of the message contained an incorrect CodedCharSetId field.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Do not discard these files until the problem has been resolved. Use the file containing the Message Descriptor of the message to determine the source of the message and to see how data that is not valid became included in the message.

AMQ7936The file <insert_3> already exists.
Severity:
30 : Severe error

Explanation:
The output file already exists, but REPLACE has not been specified.

Response:
Specify REPLACE to over-write the existing file, or select a different output file name.

AMQ7937Structure length <insert_1> in MQFMT_IMS_VAR_STRING format message is not valid.
Severity:
30 : Severe error

Explanation:
This error is detected when attempting data conversion. The valid range for the length is 4 (with no string data) to 32767. The message is returned unconverted with a reason code of MQRC_CONVERTED_STRING_TOO_BIG.

Response:
Check the content of the message before data conversion and correct the message format. When converting data using two or more bytes per character, remember that the number of bytes in each character can change during data conversion. This causes the message lengths to change.

AMQ7943Usage: crtmqcvx SourceFile TargetFile
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ7953One structure has been parsed.
Severity:
0 : Information

Explanation:
The crtmqcvx command has parsed one structure.

Response:
None.

AMQ7954<insert_1> structures have been parsed.
Severity:
0 : Information

Explanation:
The crtmqcvx command has parsed <insert_1> structures.

Response:
None.

AMQ7955Unexpected field: <insert_1>.
Severity:
0 : Information

Explanation:
The field within the structure is of a type that is not recognized.

Response:
Correct the field and retry the command.

AMQ7956Bad array dimension.
Severity:
0 : Information

Explanation:
An array field of the structure has an incorrect dimension value.

Response:
Correct the field and retry the command.

AMQ7957Warning at line <insert_1>.
Severity:
20 : Error

Explanation:
The structure contains another field after a variable length field. A variable length field must be the last field of the structure.

Response:
Correct the structure and retry the command.

AMQ7958Error at line <insert_1> in field <insert_3>.
Severity:
30 : Severe error

Explanation:
Field name <insert_3> is a field of type 'float'. Fields of type float are not supported by this command.

Response:
Either correct the structure to eliminate fields of type float, or write your own routine to support conversion of these fields.

AMQ7959Error at line <insert_1> in field <insert_3>.
Severity:
30 : Severe error

Explanation:
Field name <insert_3> is a field of type 'double'. Fields of type double are not supported by this command.

Response:
Either correct the structure to eliminate fields of type double, or write your own routine to support conversion of these fields.

AMQ7960Error at line <insert_1> in field <insert_3>.
Severity:
30 : Severe error

Explanation:
Field name <insert_3> is a 'pointer' field. Fields of type pointer are not supported by this command.

Response:
Either correct the structure to eliminate fields of type pointer, or write your own routine to support conversion of these fields.

AMQ7961Error at line <insert_1> in field <insert_3>.
Severity:
30 : Severe error

Explanation:
Field name <insert_3> is a 'bit' field. Bit fields are not supported by this command.

Response:
Either correct the structure to eliminate bit fields, or write your own routine to support conversion of these fields.

AMQ7962No input file specified.
Severity:
30 : Severe error

Explanation:
This command requires that an input file is specified.

Response:
Specify the name of the input file and retry the command.

AMQ7963No output file specified.
Severity:
30 : Severe error

Explanation:
This command requires that an output file name is specified.

Response:
Specify the name of the output file and retry the command.

AMQ7964Unexpected option <insert_3>.
Severity:
30 : Severe error

Explanation:
The option specified is not valid for this command.

Response:
Retry the command with a valid option.

AMQ7965Incorrect number of arguments.
Severity:
30 : Severe error

Explanation:
The command was passed an incorrect number of arguments.

Response:
Retry the command, passing it the correct number of arguments.

AMQ7968Cannot open file <insert_3>.
Severity:
30 : Severe error

Explanation:
You cannot open the file <insert_3>.

Response:
Check that you have the correct authorization to the file and retry the command.

AMQ7969Syntax error.
Severity:
30 : Severe error

Explanation:
This line of the input file contains a language syntax error.

Response:
Correct the syntax error and retry the command.

AMQ7970Syntax error on line <insert_1>.
Severity:
30 : Severe error

Explanation:
This message identifies where, in the input file, a previously reported error was detected.

Response:
Correct the error and retry the command.

AMQ7A01 (iSeries)Convert MQ Data Type
AMQ7A02 (iSeries)Display MQ Version
AMQ7A03 (iSeries)Create MQ Listener
AMQ7A04 (iSeries)Listener name
AMQ7A05 (iSeries)Listener control
AMQ7A06 (iSeries)Listener backlog
AMQ7A07 (iSeries)Change MQ Listener
AMQ7A08 (iSeries)Copy MQ Listener
AMQ7A09 (iSeries)From Listener
AMQ7A0A (iSeries)To Listener
AMQ7A0B (iSeries)Display MQ Listener
AMQ7A0C (iSeries)Delete MQ Listener
AMQ7A0D (iSeries)LSRNAME not allowed with PORT
Severity:
40 : Stop Error

Explanation:
A listener object can not be specified with a port.

Response:
Specify either a listener object or a port number.

AMQ7A0E (iSeries)LSRNAME not allowed with IPADDR
Severity:
40 : Stop Error

Explanation:
A listener object can not be specified with an IP address.

Response:
Specify either a listener object or an IP address.

AMQ7A0F (iSeries)Work with MQ Listener object
AMQ7A10 (iSeries)Create MQ Service
AMQ7A11 (iSeries)Change MQ Service
AMQ7A12 (iSeries)Copy MQ Service
AMQ7A13 (iSeries)Service name
AMQ7A14 (iSeries)Start program
AMQ7A15 (iSeries)Start program arguments
AMQ7A16 (iSeries)End program
AMQ7A17 (iSeries)End program arguments
AMQ7A18 (iSeries)Standard output
AMQ7A19 (iSeries)Standard error
AMQ7A1A (iSeries)Service type
AMQ7A1B (iSeries)Service control
AMQ7A1C (iSeries)From Service
AMQ7A1D (iSeries)To Service
AMQ7A1E (iSeries)Display MQ Service
AMQ7A20 (iSeries)Delete MQ Service
AMQ7A21 (iSeries)Work with MQ Service object
AMQ7A23 (iSeries)Start MQ Service
AMQ7A24 (iSeries)End MQ Service
AMQ7A25 (iSeries)Channel initiator control
AMQ7A26 (iSeries)Command server control
AMQ7A27 (iSeries)Display Queue Manager Status
AMQ7A28 (iSeries)Display Listener Status
AMQ7A29 (iSeries)Display Service Status
AMQ7A2A (iSeries)LSRNAME not allowed with OPTION
Severity:
40 : Stop Error

Explanation:
A listener object can not be specified with an end option.

Response:
Specify either a listener object or an end option.

AMQ7A2B (iSeries)Service startup
AMQ7A2C (iSeries)Work with Connection Handles
AMQ7A2D (iSeries)Connection Identifier
AMQ7A2E (iSeries)End Queue Manager Connection
AMQ7A2F (iSeries)Work with MQ Connections
AMQ7A30 (iSeries)Header Compression
AMQ7A31 (iSeries)Message Compression
AMQ7A32 (iSeries)Message compression *ANY not valid for channel type.
Severity:
30 : Severe error

Explanation:
The message compression value *ANY is only valid for *RCVR, *RQSTR and *SVRCN channel types.

Response:
Specify a valid message compression list.

AMQ7A33 (iSeries)Channel Monitoring
AMQ7A34 (iSeries)Channel Statistics
AMQ7A35 (iSeries)Cluster Workload Rank
AMQ7A36 (iSeries)Cluster Workload Priority
AMQ7A37 (iSeries)Cluster Channel Weight
AMQ7A38 (iSeries)Cluster workload channels
AMQ7A39 (iSeries)Cluster workload queue use
AMQ7A3A (iSeries)Queue Monitoring
AMQ7A3B (iSeries)Queue Manager Statistics
AMQ7A3C (iSeries)Cluster Sender Monitoring
AMQ7A3D (iSeries)Queue Statistics
AMQ7A3E (iSeries)Cluster Sender Statistics
AMQ7A3F (iSeries)Statistics Interval
AMQ7A40 (iSeries)Display MQ Route Information
AMQ7A41 (iSeries)Correlation Identifier
AMQ7A42 (iSeries)Message Persistence
AMQ7A43 (iSeries)Message Priority
AMQ7A44 (iSeries)Report Option
AMQ7A45 (iSeries)Reply Queue
AMQ7A46 (iSeries)Reply Queue Manager
AMQ7A47 (iSeries)Message Expiry
AMQ7A48 (iSeries)Expiry Report
AMQ7A49 (iSeries)Route Information
AMQ7A4A (iSeries)Reply Message
AMQ7A4B (iSeries)Deliver Message
AMQ7A4C (iSeries)Forward Message
AMQ7A4D (iSeries)Maximum Activities
AMQ7A4E (iSeries)Route Detail
AMQ7A4F (iSeries)Browse Only
AMQ7A50 (iSeries)Display Message
AMQ7A51 (iSeries)Target Queue Manager
AMQ7A52 (iSeries)Display Information
AMQ7A53 (iSeries)Wait Time
AMQ7A54 (iSeries)RTEINF(*YES) required for RPLYMSG(*YES).
Severity:
30 : Severe error

Explanation:
RPLYMSG(*YES) can not be specified without RTEINF(*YES).

Response:
If RPLYMSG(*YES) is specified then RTEINF(*YES) must also be specified.

AMQ7A55 (iSeries)RPLYQ required for RPLYMQM.
Severity:
30 : Severe error

Explanation:
RPLYMQM can not be specified without RPLYQ.

Response:
If RPLYMQM is specified then RPLYQ must also be specified.

AMQ7A56 (iSeries)CRRLID specified with invalid parameters.
Severity:
30 : Severe error

Explanation:
The CRRLID parameter was specified with one or more of MSGPST, MSGPRTY, OPTION, RPLYQ, RPLYMQM, EXPIRY, EXPRPT, RTEINF RPLYMSG, DLVRMSG, FWDMSG, MAXACTS and DETAIL which are invalid with CRRLID.

Response:
Specify only those parameters which are valid with CRRLID.

AMQ7A57 (iSeries)DSPMSG(*NO) specified with invalid parameters.
Severity:
30 : Severe error

Explanation:
DSPMSG(*NO) was specified with one or more of BROWSE, DSPINF and WAIT which are invalid with DSPMSG(*NO).

Response:
Specify only those parameters which are valid with DSPMSG(*NO).

AMQ7A58 (iSeries)RPLYQ required for DSPMSG(*NO) and RPLYMSG(*YES).
Severity:
30 : Severe error

Explanation:
DSPMSG(*NO) and RPLYMSG(*YES) can not be specified without RPLYQ.

Response:
If DSPMSG(*NO) and RPLYMSG(*YES) are specified than RPLYQ must also be specified.

AMQ7A59 (iSeries)RPLYQ required for DSPMSG(*NO) and OPTION not *NONE.
Severity:
30 : Severe error

Explanation:
DSPMSG(*NO) and OPTION not *NONE can not be specified without RPLYQ.

Response:
If DSPMSG(*NO) and OPTION not *NONE are specified than RPLYQ must also be specified.

AMQ7A5A (iSeries)Run WebSphere MQ Commands
AMQ7A5B (iSeries)Non Persistent Message Class
AMQ7A5C (iSeries)NPMCLASS not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The NPMCLASS parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the NPMCLASS parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ7A5D (iSeries)MONQ not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The MONQ parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the MONQ parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ7A5E (iSeries)STATQ not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The STATQ parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the STATQ parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ7A5F (iSeries)ACCTQ not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The ACCTQ parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the ACCTQ parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ7A60 (iSeries)All queue managers have been quiesced.
Severity:
0 : Information

Explanation:
All queue managers have been successfully quiesced.

Response:
None.

AMQ7A61 (iSeries)MQMNAME not valid for TRCEARLY(*YES).
Severity:
40 : Stop Error

Explanation:
The MQMNAME parameter may only be specified for TRCEARLY(*NO). TRCEARLY(*YES) applies to all queue managers.

Response:
If TRCEARLY(*YES) is required remove MQMNAME from the command.

AMQ7A62 (iSeries)MQMNAME not valid for SET(*END).
Severity:
40 : Stop Error

Explanation:
The MQMNAME parameter may only be specified for SET(*ON) or SET(*OFF). SET(*END) applies to all queue managers.

Response:
If SET(*END) is required remove MQMNAME from the command.

AMQ7B00 (iSeries)MQI Accounting
AMQ7B01 (iSeries)Input file
AMQ7B02 (iSeries)Queue Accounting
AMQ7B03 (iSeries)Member containing input
AMQ7B04 (iSeries)Accounting Interval
AMQ7B05 (iSeries)Accounting Override
AMQ7B06 (iSeries)Trace data size
AMQ7B07 (iSeries)Perform replay only
AMQ7B08 (iSeries)Activate backup
AMQ7B09 (iSeries)No connection handles to display
AMQ7B0A (iSeries)Trace Route Recording
AMQ7B0B (iSeries)Activity Recording
AMQ7B0C (iSeries)No queue manager connections to display
AMQ7B0D (iSeries)No listener objects to display
AMQ7B0E (iSeries)No service objects to display
AMQ7B0F (iSeries)CLWLRANK not allowed with queue type *MDL.
Severity:
40 : Stop Error

Explanation:
The CLWLRANK parameter may not be specified for a queue of type *MDL.

Response:
Remove the CLWLRANK parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ7B10 (iSeries)CLWLPRTY not allowed with queue type *MDL.
Severity:
40 : Stop Error

Explanation:
The CLWLPRTY parameter may not be specified for a queue of type *MDL.

Response:
Remove the CLWLPRTY parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ7B11 (iSeries)LSRNAME not allowed with BACKLOG
Severity:
40 : Stop Error

Explanation:
A listener object can not be specified with a listener backlog.

Response:
Specify either a listener object or a listener backlog.

AMQ7B12 (iSeries)MONCHL not valid for channel type *CLTCN.
Severity:
40 : Stop Error

Explanation:
The MONCHL parameter may not be specified with channel type *CLTCN.

Response:
Remove the MONCHL parameter from the command or, if the command is CRTMQMCHL, specify a different value for CHLTYPE. Then try the command again.

AMQ7B13 (iSeries)STATCHL not valid for channel type *CLTCN.
Severity:
40 : Stop Error

Explanation:
The STATCHL parameter may not be specified with channel type *CLTCN.

Response:
Remove the STATCHL parameter from the command or, if the command is CRTMQMCHL, specify a different value for CHLTYPE. Then try the command again.

AMQ7B14 (iSeries)CLWLRANK only valid for channel types *CLUSSDR and *CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The CLWLRANK parameter may only be specified with channel types *CLUSSDR or *CLUSRCVR.

Response:
Remove the CLWLRANK parameter from the command or, if the command is CRTMQMCHL, specify a different value for CHLTYPE. Then try the command again.

AMQ7B15 (iSeries)CLWLPRTY only valid for channel types *CLUSSDR and *CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The CLWLPRTY parameter may only be specified with channel types *CLUSSDR or *CLUSRCVR.

Response:
Remove the CLWLPRTY parameter from the command or, if the command is CRTMQMCHL, specify a different value for CHLTYPE. Then try the command again.

AMQ7B16 (iSeries)CLWLWGHT only valid for channel types *CLUSSDR and *CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The CLWLWGHT parameter may only be specified with channel types *CLUSSDR or *CLUSRCVR.

Response:
Remove the CLWLWGHT parameter from the command or, if the command is CRTMQMCHL, specify a different value for CHLTYPE. Then try the command again.

AMQ7B17 (iSeries)CLWLUSEQ only allowed with queue type *LCL.
Severity:
40 : Stop Error

Explanation:
The CLWLUSEQ parameter may only be specified for a queue of type *LCL.

Response:
Remove the CLWLUSEQ parameter from the command or, if the command is CRTMQMQ, specify a value of *LCL for QTYPE. Then try the command again.

AMQ7B18 (iSeries)MCAUSRID not valid for channel type *CLTCN.
Severity:
40 : Stop Error

Explanation:
The MCAUSRID parameter may not be specified with channel type *CLTCN.

Response:
Remove the MCAUSRID parameter from the command or, if the command is CRTMQMCHL, specify a different value for CHLTYPE. Then try the command again.


8000-8999 - Administration
See Reading a message for an explanation of how to interpret these messages.

AMQ8001WebSphere MQ queue manager created.
Severity:
0 : Information

Explanation:
WebSphere MQ queue manager <insert_5> created.

Response:
None.

AMQ8002WebSphere MQ queue manager <insert_5> deleted.
Severity:
0 : Information

Explanation:
WebSphere MQ queue manager <insert_5> deleted.

Response:
None.

AMQ8003WebSphere MQ queue manager <insert_5> started.
Severity:
0 : Information

Explanation:
WebSphere MQ queue manager <insert_5> started.

Response:
None.

AMQ8004WebSphere MQ queue manager <insert_5> ended.
Severity:
0 : Information

Explanation:
WebSphere MQ queue manager <insert_5> ended.

Response:
None.

AMQ8005WebSphere MQ queue manager changed.
Severity:
0 : Information

Explanation:
WebSphere MQ queue manager <insert_5> changed.

Response:
None.

AMQ8006WebSphere MQ queue created.
Severity:
0 : Information

Explanation:
WebSphere MQ queue <insert_5> created.

Response:
None.

AMQ8007WebSphere MQ queue deleted.
Severity:
0 : Information

Explanation:
WebSphere MQ queue <insert_5> deleted.

Response:
None.

AMQ8008WebSphere MQ queue changed.
Severity:
0 : Information

Explanation:
WebSphere MQ queue <insert_5> changed.

Response:
None.

AMQ8009 (iSeries)WebSphere MQ queue created by copying.
Severity:
0 : Information

Explanation:
Queue <insert_5> created in library <insert_3> by copying.

Response:
None.

AMQ8010WebSphere MQ process created.
Severity:
0 : Information

Explanation:
WebSphere MQ process <insert_5> created.

Response:
None.

AMQ8011WebSphere MQ process deleted.
Severity:
0 : Information

Explanation:
WebSphere MQ process <insert_5> deleted.

Response:
None.

AMQ8012WebSphere MQ process changed.
Severity:
0 : Information

Explanation:
WebSphere MQ process <insert_5> changed.

Response:
None.

AMQ8013 (iSeries)WebSphere MQ process copied.
Severity:
0 : Information

Explanation:
Process <insert_5> created in library <insert_3> by copying.

Response:
None.

AMQ8014WebSphere MQ channel created.
Severity:
0 : Information

Explanation:
WebSphere MQ channel <insert_5> created.

Response:
None.

AMQ8015WebSphere MQ channel deleted.
Severity:
0 : Information

Explanation:
WebSphere MQ channel <insert_5> deleted.

Response:
None.

AMQ8016WebSphere MQ channel changed.
Severity:
0 : Information

Explanation:
WebSphere MQ channel <insert_5> changed.

Response:
None.

AMQ8017 (iSeries)WebSphere MQ channel copied.
Severity:
0 : Information

Explanation:
Channel <insert_5> created by copying.

Response:
None.

AMQ8018Start WebSphere MQ channel accepted.
Severity:
0 : Information

Explanation:
The channel <insert_5> is being started. The start channel function has been initiated. This involves a series of operations across the network before the channel is actually started. The channel status displays "BINDING" for a short period while communication protocols are negotiated with the channel with whom communication is being initiated.

Response:
None.

AMQ8019Stop WebSphere MQ channel accepted.
Severity:
0 : Information

Explanation:
The channel <insert_5> has been requested to stop.

Response:
None.

AMQ8020Ping WebSphere MQ channel complete.
Severity:
0 : Information

Explanation:
Ping channel <insert_5> complete.

Response:
None.

AMQ8021Request to start WebSphere MQ Listener accepted.
Severity:
0 : Information

Explanation:
The Request to start the Listener has been accepted and is being processed.

Response:
Should the request to start the listener be unsuccessful then information related to the error will be available in the queue manager error log. Once started the status of the listener may be monitored using the MQSC command 'DISPLAY LSSTATUS'. On iSeries the status of the listener may also be monitored using the 'WRKMQMLSR OPTION(*STATUS)' command.

AMQ8022WebSphere MQ queue cleared.
Severity:
0 : Information

Explanation:
All messages on queue <insert_5> have been deleted.

Response:
None.

AMQ8023WebSphere MQ channel reset.
Severity:
0 : Information

Explanation:
Channel <insert_5> has been reset.

Response:
None.

AMQ8024WebSphere MQ channel initiator started.
Severity:
0 : Information

Explanation:
The channel initiator for queue <insert_5> has been started.

Response:
None.

AMQ8025WebSphere MQ channel resolved.
Severity:
0 : Information

Explanation:
In doubt messages for WebSphere MQ channel <insert_5> have been resolved.

Response:
None.

AMQ8026End WebSphere MQ queue manager accepted.
Severity:
0 : Information

Explanation:
A controlled stop request has been initiated for queue manager <insert_5>.

Response:
None.

AMQ8027WebSphere MQ command server started.
Severity:
0 : Information

Explanation:
The command server has been started.

Response:
None.

AMQ8028WebSphere MQ command server ended.
Severity:
0 : Information

Explanation:
The command server has been stopped.

Response:
None.

AMQ8029WebSphere MQ authority granted.
Severity:
0 : Information

Explanation:
Authority for object <insert_5> granted.

Response:
None.

AMQ8030WebSphere MQ authority revoked.
Severity:
0 : Information

Explanation:
Authority for object <insert_5> revoked.

Response:
None.

AMQ8031 (iSeries)Message Queue Manager connected.
Severity:
0 : Information

Explanation:
The message queue manager has been connected.

Response:
None.

AMQ8032 (iSeries)Message Queue Manager disconnected.
Severity:
0 : Information

Explanation:
The message queue manager has been disconnected.

Response:
None.

AMQ8033WebSphere MQ object recreated.
Severity:
0 : Information

Explanation:
MQ object <insert_5> has been recreated from image.

Response:
None.

AMQ8034WebSphere MQ object image recorded.
Severity:
0 : Information

Explanation:
Image of MQ object <insert_5> has been recorded.

Response:
None.

AMQ8035WebSphere MQ Command Server Status . . : Running
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8036WebSphere MQ command server status . . : Stopping
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8037WebSphere MQ command server status . . : Starting
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8038WebSphere MQ command server status . . : Running with queue disabled
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8039WebSphere MQ command server status . . : Stopped
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8040WebSphere MQ command server ending.
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8041The queue manager cannot be restarted or deleted because processes, that were previously connected, are still running.
Severity:
40 : Stop Error

Explanation:
Processes, that were connected to the queue manager the last time it was running, are still active. The queue manager cannot be restarted.

Response:
Stop the processes and try to start the queue manager.

AMQ8041 (iSeries)The queue manager cannot be restarted or deleted.
Severity:
40 : Stop Error

Explanation:
Jobs that were connected to the queue manager the last time it was running, are still active. The queue manager cannot be restarted or deleted.

Response:
Use option 22 from WRKMQM to identify which jobs are connected to the queue manager. End the connected jobs and then retry the command.

AMQ8042Process <insert_1> is still running.
Severity:
0 : Information

AMQ8043Non runtime application attempted to connect to runtime only queue manager.
Severity:
0 : Information

Explanation:
A non runtime application attempted to connect to a queue manager on a node where support for non runtime applications has not been installed. The connect attempt will be rejected with a reason of MQRC_ENVIRONMENT_ERROR.

Response:
If the node is intended to support only runtime applications, investigate why a non runtime application has attempted to connect to the queue manager. If the node is intended to support non runtime only applications, investigate if the base option has been installed. The base option must be installed if non runtime applications are to run on this node.

AMQ8044 (Windows)An error occurred while removing the queue manager from the Active Directory.
Severity:
0 : Information

Explanation:
The attempt to remove the queue manager from the Windows Active Directory failed. This may be because the appropriate entry could not be opened or modified, or the Service Control Point has already been removed.

Response:
Check that your account has the authority to delete objects from the Active Directory, and that the entry has not already been deleted.

AMQ8045 (Windows)An error occurred while removing the queue manager from the Service Control Manager.
Severity:
0 : Information

Explanation:
The attempt to remove the queue manager from the Windows registry failed. This may be because the appropriate entry could not be opened or modified.

Response:
Check that the registry has sufficient space on your disk to grow and that your account has the authority to modify it.

AMQ8046Migrating objects for <insert_3>.
Severity:
0 : Information

Response:
None.

AMQ8047Channel migration statistics : <insert_1> migrated. <insert_2> failed.
Severity:
0 : Information

Explanation:
Information on the number of channel objects migrated from previous versions of WebSphere MQ channel definitions as well as any failures that occurred.

Response:
None.

AMQ8048Default objects statistics : <insert_1> created. <insert_2> replaced. <insert_3> failed.
Severity:
0 : Information

Explanation:
Information on the number of objects created or replaced successfully as well as any failures that occurred while creating the default objects.

Response:
None.

AMQ8049Object <insert_4>. Unable to create or replace.
Severity:
20 : Error

Explanation:
While creating or replacing the default object <insert_4> for WebSphere MQ queue manager <insert_5> an error occurred. The error was due to improper authorization. The reason code is <insert_1>.

Response:
Check this log for more details of what the problem may be. Make sure there are sufficient resources such as disk space and storage. For damaged or corrupted objects, replace these from backup objects. If all else fails, delete the queue manager <insert_5> using dltmqm and create it again using crtmqm.

AMQ8050Creating or replacing default objects for <insert_3>.
Severity:
0 : Information

Response:
None.

AMQ8051For details of the failures that occurred, please check AMQERR01.LOG.
Severity:
0 : Information

Response:
None.

AMQ8052Completing setup.
Severity:
0 : Information

Response:
None.

AMQ8053Object <insert_4>. Unable to create or replace.
Severity:
20 : Error

Explanation:
While creating or replacing the default object <insert_4> for WebSphere MQ queue manager <insert_5> an error occurred. The error was due to a broken connection. The reason code is <insert_1>.

Response:
Check this log for more details of what the problem may be. Make sure there is sufficient resources such as disk space and storage. For damaged or corrupted objects, replace these from backup objects. If all else fails, delete the queue manager <insert_5> using dltmqm and create it again using crtmqm.

AMQ8054Object <insert_4>. Unable to create or replace.
Severity:
20 : Error

Explanation:
While creating or replacing the default object <insert_4> for WebSphere MQ queue manager <insert_5> an error occurred. The error was due to unavailable storage. The reason code is <insert_1>.

Response:
Check this log for more details of what the problem may be. Make sure there is sufficient resources such as disk space and storage. For damaged or corrupted objects, replace these from backup objects. If all else fails, delete the queue manager <insert_5> using dltmqm and create it again using crtmqm.

AMQ8055Object <insert_4>. Unable to create or replace.
Severity:
20 : Error

Explanation:
While creating or replacing the default object <insert_4> for WebSphere MQ queue manager <insert_5> an error occurred. The error was due to a damaged object. The reason code is <insert_1>.

Response:
Check this log for more details of what the problem may be. Make sure there is sufficient resources such as disk space and storage. For damaged or corrupted objects, replace these from backup objects. If all else fails, delete the queue manager <insert_5> using dltmqm and create it again using crtmqm.

AMQ8056Object <insert_4>. Unable to create or replace.
Severity:
20 : Error

Explanation:
While creating or replacing the default object <insert_4> for WebSphere MQ queue manager <insert_5> an error occurred. The error was due to a channel definition error. The error code is <insert_1> (X<insert_2>).

Response:
Check this log for more details of what the problem may be. Make sure there is sufficient resources such as disk space and storage. For damaged or corrupted objects, replace these from backup objects. If all else fails, delete the queue manager <insert_5> using dltmqm and create it again using crtmqm.

AMQ8057Object <insert_4>. Unable to create or replace.
Severity:
20 : Error

Explanation:
While creating or replacing the default object <insert_4> for WebSphere MQ queue manager <insert_5> an error occurred. The error was due to invalid records in the channel definition file. The error code is <insert_1> (X<insert_2>).

Response:
Check this log for more details of what the problem may be. Make sure there is sufficient resources such as disk space and storage. For damaged or corrupted objects, replace these from backup objects. If all else fails, delete the queue manager <insert_5> using dltmqm and create it again using crtmqm.

AMQ8058Object <insert_4>. Unable to create or replace.
Severity:
20 : Error

Explanation:
While creating or replacing the default object <insert_4> for WebSphere MQ queue manager <insert_5> an error occurred. The error was due to not finding the channel definition file. The error code is <insert_1> (X<insert_2>).

Response:
Check this log for more details of what the problem may be. Make sure there is sufficient resources such as disk space and storage. For damaged or corrupted objects, replace these from backup objects. If all else fails, delete the queue manager <insert_5> using dltmqm and create it again using crtmqm.

AMQ8059Object <insert_4>. Unable to create or replace.
Severity:
20 : Error

Explanation:
While creating or replacing the default object <insert_4> for WebSphere MQ queue manager <insert_5> an error occurred. The error was due to an unexpected error, error code <insert_1> (X<insert_2>).

Response:
Check this log for more details of what the problem may be. Make sure there is sufficient resources such as disk space and storage. For damaged or corrupted objects, replace these from backup objects. If all else fails, delete the queue manager <insert_5> using dltmqm and create it again using crtmqm.

AMQ8061 (Windows)Command <insert_4> is not valid.
Severity:
10 : Warning

Explanation:
The command <insert_4> at line <insert_1> in the WebSphere MQ service command file <insert_3> for queue manager <insert_5> is not valid for use in the service command file. The line is ignored.

Response:
Check the contents of the file and retry the operation.

AMQ8062 (Windows)Unexpected return code, <insert_1>, from command <insert_3>.
Severity:
10 : Warning

Explanation:
An unexpected return code, <insert_1>, was returned by command <insert_3>. This command was issued by the WebSphere MQ service for queue manager <insert_4>.

Response:
Verify that the command and parameters are correct.

AMQ8063 (Windows)Not authorized to issue command <insert_3>.
Severity:
20 : Error

Explanation:
The current user <insert_5> is not authorized to issue the command <insert_3>. The command is ignored.

Response:
Add the user to the local 'mqm' security group and retry the operation.

AMQ8064 (Windows)Not authorized to start trusted application.
Severity:
20 : Error

Explanation:
The user <insert_5> is not authorized to start the trusted application <insert_3>. The application has not started.

Response:
Add the user to the local 'mqm' security group and restart the application.

AMQ8065 (Windows)Local group <insert_3> not found.
Severity:
20 : Error

Explanation:
The local group <insert_3> is unavailable. It is not possible to verify that the user is authorized. The function cannot continue.

Response:
Create the required local group and retry the operation.

AMQ8066 (Windows)Local mqm group not found.
Severity:
20 : Error

Explanation:
The local mqm group is unavailable. It is not possible to verify that the user is authorized. The function cannot continue.

Response:
Create the local mqm group and retry the operation.

AMQ8067WebSphere MQ channel auto-defined.
Severity:
0 : Information

Explanation:
Channel <insert_5> auto-defined.

Response:
None.

AMQ8068Setup completed.
Severity:
0 : Information

Response:
None.

AMQ8069ApplicationGroup for the crtmqm command does not contain the mqm userid.
Severity:
40 : Stop Error

Explanation:
WebSphere MQ queue manager <insert_5> not created. The ApplicationGroup specified for the crtmqm command must contain the mqm userid when the RestrictedMode option (-g) is specified.

Response:
None.

AMQ8070ApplicationGroup for crtmqm command is not defined.
Severity:
40 : Stop Error

Explanation:
WebSphere MQ queue manager <insert_5> not created. RestrictedMode option (-g) specified, but the ApplicationGroup does not exist.

Response:
None.

AMQ8071RestrictedMode option not supported on this platform.
Severity:
40 : Stop Error

Explanation:
WebSphere MQ queue manager <insert_5> not created. The RestrictedMode option was specified but is not supported on this platform.

Response:
None.

AMQ8072 (Windows)Not authorized to administer channels.
Severity:
10 : Warning

Explanation:
The command server for queue manager <insert_3> received an administration command for channels. The user <insert_5> is not authorized to administer WebSphere MQ channels. The command server has not processed the command.

Response:
Add the user to the local 'mqm' security group, and ensure that the security policy is set as required.

AMQ8073 (Windows)Authorization failed because SID: (<insert_3>) could not be resolved.
Severity:
10 : Warning

Explanation:
The Object Authority Manager was unable to resolve the specified SID into entity and domain information.

Response:
Ensure that the application provides a SID that is recognized on this system, that all necessary domain controllers are available, and that the security policy is set as you required.

AMQ8074 (Windows)Authorization failed as the SID <insert_3> does not match the entity <insert_4>.
Severity:
10 : Warning

Explanation:
The Object Authority Manager received inconsistent data - the supplied SID does not match that of the supplied entity information.

Response:
Ensure that the application is supplying valid entity and SID information.

AMQ8075 (Windows)Authorization failed because the SID for entity <insert_3> cannot be obtained.
Severity:
10 : Warning

Explanation:
The Object Authority Manager was unable to obtain a SID for the specified entity.

Response:
Ensure that the entity is valid, and that all necessary domain controllers are available.

AMQ8076 (Windows)Authorization failed because no SID was supplied for entity <insert_3>.
Severity:
10 : Warning

Explanation:
The Object Authority Manager was not supplied with SID information for the specified entity, and the security policy is set to 'NTSIDsRequired'.

Response:
Ensure that the application is supplying a valid SID, and that the security policy is set as you require.

AMQ8077 (Windows)Entity <insert_3> has insufficient authority to access object <insert_4>.
Severity:
10 : Warning

Explanation:
The specified entity is not authorized to access the required object. The following requested permissions are unauthorized: <insert_5>

Response:
Ensure that the correct level of authority has been set for this entity against the required object, or ensure that the entity is a member of a privileged group.

AMQ8078Waiting for queue manager <insert_3> to end.
Severity:
0 : Information

Response:
None.

AMQ8079 (iSeries)WebSphere MQ trigger monitor job started.
Severity:
0 : Information

Explanation:
The message queue manager trigger monitor job has been started for queue manager <insert_5> to process messages on the selected initiation queue. See previously issued messages for job details.'

Response:
None.

AMQ8079 (Windows)Access was denied when attempting to retrieve group membership information for user <insert_3>.
Severity:
10 : Warning

Explanation:
WebSphere MQ, running with the authority of user <insert_4>, was unable to retrieve group membership information for the specified user.

Response:
Ensure Active Directory access permissions allow user <insert_4> to read group memberships for user <insert_3>. To retrieve group membership information for a domain user, MQ must run with the authority of a domain user.

AMQ8080 (iSeries)WebSphere MQ trigger monitor job start failed.
Severity:
40 : Stop Error

Explanation:
Message queue manager trigger job failed to start for manager <insert_5>. Failure reason code is <insert_2>. See previously issued messages for more information.'

Response:
None.

AMQ8081 (Windows)Not authorized to administer queue managers.
Severity:
10 : Warning

Explanation:
The command server for queue manager <insert_3> received an administration command for a queue manager. The user <insert_5> is not authorized to administer WebSphere MQ queue managers. The command server has not processed the command.

Response:
Add the user to the local 'mqm' security group, and ensure that the security policy is set as required.

AMQ8082 (Windows)Not authorized to administer clusters.
Severity:
10 : Warning

Explanation:
The command server for queue manager <insert_3> received an administration command for clusters. The user <insert_5> is not authorized to administer WebSphere MQ clusters. The command server has not processed the command.

Response:
Add the user to the local 'mqm' security group, and ensure that the security policy is set as required.

AMQ8083WebSphere MQ queue manager <insert_3> starting.
Severity:
0 : Information

Explanation:
WebSphere MQ queue manager <insert_3> starting.

Response:
None.

AMQ8084WebSphere MQ connection not found.
Severity:
40 : Stop Error

Explanation:
The connection specified does not exist.

Response:
Correct the connection name and then try the command again.

AMQ8085WebSphere MQ queue manager <insert_3> is being started for replay.
Severity:
0 : Information

Explanation:
WebSphere MQ queue manager <insert_3> is being started for replay. The strmqm command has been issued with the '-r' option. see the WebSphere MQ System Administration documentation for details.

Response:
None.

AMQ8086WebSphere MQ queue manager <insert_3> is being activated.
Severity:
0 : Information

Explanation:
WebSphere MQ queue manager <insert_3> is being activated. The strmqm command has been issued with the '-a' option. see the WebSphere MQ System Administration documentation for details.

Response:
None.

AMQ8086 (iSeries)WebSphere MQ queue manager <insert_3> is being activated.
Severity:
0 : Information

Explanation:
WebSphere MQ queue manager <insert_3> is being activated. The STRMQM command has been issued with the ACTIVATE(*YES) option. see the WebSphere MQ System Administration documentation for further details.

Response:
None.

AMQ8087Attempt to migrate listener <insert_3> to a QM object failed with <insert_1>.
Severity:
20 : Error

Explanation:
Whilst processing legacy services, listener <insert_3> could not be migrated to an MQ object named <insert_4>, the object creation failed with <insert_1>.

Response:
Save the generated output files and contact your IBM support center.

AMQ8088Attempt to migrate trigger monitor <insert_3> to a QM object failed with <insert_1>.
Severity:
20 : Error

Explanation:
Whilst processing legacy services, trigger monitor <insert_3> could not be migrated to an MQ object named <insert_4>, the object creation failed with <insert_1>.

Response:
Save the generated output files and contact your IBM support center.

AMQ8089Attempt to migrate channel service <insert_3> to a QM object failed with <insert_1>.
Severity:
20 : Error

Explanation:
Whilst processing legacy services, channel service <insert_3> could not be migrated to an MQ object named <insert_4>, the object creation failed with <insert_1>.

Response:
Save the generated output files and contact your IBM support center.

AMQ8090Attempt to migrate channel initiator <insert_3> to a QM object failed with <insert_1>.
Severity:
20 : Error

Explanation:
Whilst processing legacy services, channel initiator <insert_3> could not be migrated to an MQ object named <insert_4>, the object creation failed with <insert_1>.

Response:
Save the generated output files and contact your IBM support center.

AMQ8091Attempt to migrate custom service <insert_3> to a QM object failed with <insert_1>.
Severity:
20 : Error

Explanation:
Whilst processing legacy services, custom service <insert_3> could not be migrated to an MQ object named <insert_4>, the object creation failed with <insert_1>.

Response:
Save the generated output files and contact your IBM support center.

AMQ8092Service migration statistics : <insert_1> migrated. <insert_2> failed.
Severity:
0 : Information

Explanation:
Information on the number of service objects migrated from previous versions of WebSphere MQ for Windows services as well as any failures that occurred.

Response:
None.

AMQ8101WebSphere MQ error (<insert_1>) has occurred.
Severity:
40 : Stop Error

Explanation:
An unexpected reason code with hexadecimal value <insert_1> was received from the WebSphere MQ queue manager during command processing. (Note that hexadecimal values in the range X'07D1'-X'0BB7' correspond to MQI reason codes 2001-2999.) More information might be available in the log. If the reason code value indicates that the error was associated with a particular parameter, the parameter concerned is <insert_4>.

Response:
Correct the error and then try the command again.

AMQ8102WebSphere MQ object name specified in <insert_4> not valid.
Severity:
30 : Severe error

Explanation:
The object name <insert_5> specified in <insert_4> is not valid. The length of the name must not exceed 48 characters, or 20 characters if it is a channel name. The name should contain the following characters only: lowercase a-z, uppercase A-Z, numeric 0-9, period (.), forward slash (/), underscore (_) and percent sign (%).

Response:
Change the length of the parameter value or change the parameter value to contain a valid combination of characters, then try the command again.

AMQ8103Insufficient storage available.
Severity:
40 : Stop Error

Explanation:
There was insufficient storage available to perform the requested operation.

Response:
Free some storage and then try the command again.

AMQ8104WebSphere MQ directory <insert_3> not found.
Severity:
40 : Stop Error

Explanation:
Directory <insert_3> was not found. This directory is created when WebSphere MQ is installed successfully. Refer to the log for more information.

Response:
Verify that installation of WebSphere MQ was successful. Correct the error and then try the command again.

AMQ8105Object error.
Severity:
40 : Stop Error

Explanation:
An object error occurred. Refer to the log for more information.

Response:
Correct the error and then try the command again.

AMQ8106WebSphere MQ queue manager being created.
Severity:
0 : Information

Explanation:
The queue manager is being created.

Response:
Wait for the creation process to complete and then try the command again.

AMQ8107WebSphere MQ queue manager running.
Severity:
10 : Warning

Explanation:
The queue manager is running.

Response:
None.

AMQ8108WebSphere MQ queue manager <insert_3> ending.
Severity:
10 : Warning

Explanation:
The queue manager <insert_3> is ending.

Response:
Wait for the queue manager to end and then try the command again.

AMQ8109WebSphere MQ queue manager being deleted.
Severity:
40 : Stop Error

Explanation:
The queue manager is being deleted.

Response:
Wait for the deletion process to complete.

AMQ8110WebSphere MQ queue manager already exists.
Severity:
40 : Stop Error

Explanation:
The queue manager <insert_5> already exists.

Response:
None.

AMQ8111 (iSeries)Message Queue Manager exists under a different name.
Severity:
30 : Severe error

Explanation:
A message queue manager exists with a name different from the value <insert_5> specified in <insert_2>.

Response:
Change the parameter value to the name of the existing message queue manager and then try the command again.

AMQ8112 (iSeries)PRCNAME not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The PRCNAME parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the PRCNAME parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8113 (iSeries)TRGENBL not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The TRGENBL parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the TRGENBL parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8114 (iSeries)GETENBL not allowed with queue type *RMT.
Severity:
40 : Stop Error

Explanation:
The GETENBL parameter may not be specified for a queue of type *RMT.

Response:
Remove the GETENBL parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8115 (iSeries)SHARE not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The SHARE parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the SHARE parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8116 (iSeries)MSGDLYSEQ not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The MSGDLYSEQ parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the MSGDLYSEQ parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8117WebSphere MQ queue manager deletion incomplete.
Severity:
40 : Stop Error

Explanation:
Deletion of queue manager <insert_5> was only partially successful. An object was not found, or could not be deleted. Refer to the log for more information.

Response:
Delete any remaining queue manager objects.

AMQ8118WebSphere MQ queue manager does not exist.
Severity:
40 : Stop Error

Explanation:
The queue manager <insert_5> does not exist.

Response:
Either create the queue manager (crtmqm command) or correct the queue manager name used in the command and then try the command again.

AMQ8119 (iSeries)TRGTYPE not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The TRGTYPE parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the TRGTYPE parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8120 (iSeries)TRGDEPTH not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The TRGDEPTH parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the TRGDEPTH parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8121 (iSeries)TRGMSGPTY not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The TRGMSGPTY parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the TRGMSGPTY parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8122 (iSeries)TRGDATA not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The TRGDATA parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the TRGDATA parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8123 (iSeries)RTNITV not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The RTNITV parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the RTNITV parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8124 (iSeries)MAXMSGLEN not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The MAXMSGLEN parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the MAXMSGLEN parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8125 (iSeries)BKTTHLD not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The BKTTHLD parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the BKTTHLD parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8126 (iSeries)BKTQNAME not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The BKTQNAME parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the BKTQNAME parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8127 (iSeries)INITQNAME not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The INITQNAME parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the INITQNAME parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8128 (iSeries)USAGE not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The USAGE parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the USAGE parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8129 (iSeries)DFNTYPE only allowed with queue type *MDL.
Severity:
40 : Stop Error

Explanation:
The DFNTYPE parameter may only be specified for a queue of type *MDL.

Response:
Remove the DFNTYPE parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8130 (iSeries)TGTQNAME only allowed with queue type *ALS.
Severity:
40 : Stop Error

Explanation:
The TGTQNAME parameter may only be specified for a queue of type *ALS.

Response:
Remove the TGTQNAME parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8131 (iSeries)RMTQNAME only allowed with queue type *RMT.
Severity:
40 : Stop Error

Explanation:
The RMTQNAME parameter may only be specified for a queue of type *RMT.

Response:
Remove the RMTQNAME parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8132 (iSeries)RMTMQMNAME only allowed with queue type *RMT.
Severity:
40 : Stop Error

Explanation:
The RMTMQMNAME parameter may only be specified for a queue of type *RMT.

Response:
Remove the RMTMQMNAME parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8133 (iSeries)TMQNAME only allowed with queue type *RMT.
Severity:
40 : Stop Error

Explanation:
The TMQNAME parameter may only be specified for a queue of type *RMT.

Response:
Remove the TMQNAME parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8134 (iSeries)HDNBKTCNT not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The HDNBKTCNT parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the HDNBKTCNT parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8135Not authorized.
Severity:
40 : Stop Error

Explanation:
You are not authorized to perform the requested operation for the WebSphere MQ object <insert_5> specified in <insert_3>. Either you are not authorized to perform the requested operation, or you are not authorized to the specified MQ object. For a copy command, you may not be authorized to the specified source MQ object, or, for a create command, you may not be authorized to the system default MQ object of the specified type.

Response:
Obtain the necessary authority from your security officer or WebSphere MQ administrator. Then try the command again.

AMQ8136 (iSeries)Error detected by prompt control program.
Severity:
30 : Severe error

Explanation:
A prompt control program detected errors.

Response:
See the previously listed messages in the job log. Correct the errors and then prompt for the command again.

AMQ8137WebSphere MQ queue manager already starting.
Severity:
40 : Stop Error

Explanation:
The strmqm command was unsuccessful because the queue manager <insert_5> is already starting.

Response:
Wait for the strmqm command to complete.

AMQ8138The WebSphere MQ queue has an incorrect type.
Severity:
40 : Stop Error

Explanation:
The operation is not valid with queue <insert_5> because it is not a local queue.

Response:
Change the QNAME parameter to specify a queue of the correct type.

AMQ8139Already connected.
Severity:
10 : Warning

Explanation:
A connection to the WebSphere MQ queue manager already exists.

Response:
None.

AMQ8140Resource timeout error.
Severity:
40 : Stop Error

Explanation:
A timeout occurred in the communication between internal WebSphere MQ queue manager components. This is most likely to occur when the system is heavily loaded.

Response:
Wait until the system is less heavily loaded, then try the command again.

AMQ8141WebSphere MQ queue manager starting.
Severity:
40 : Stop Error

Explanation:
The queue manager <insert_5> is starting.

Response:
Wait for the queue manager startup process to complete and then try the command again.

AMQ8142WebSphere MQ queue manager stopped.
Severity:
40 : Stop Error

Explanation:
The queue manager <insert_5> is stopped.

Response:
Use the strmqm command to start the queue manager, and then try the command again.

AMQ8143WebSphere MQ queue not empty.
Severity:
40 : Stop Error

Explanation:
The queue <insert_5> specified in <insert_2> is not empty or contains uncommitted updates.

Response:
Commit or roll back any uncommitted updates. If the command is DELETE QLOCAL, use the CLEAR QLOCAL command to clear the messages from the queue. Then try the command again.

AMQ8144Log not available.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ logging resource is not available.

Response:
Use the dltmqm command to delete the queue manager and then the crtmqm command to create the queue manager. Then try the command again.

AMQ8145Connection broken.
Severity:
40 : Stop Error

Explanation:
The connection to the WebSphere MQ queue manager failed during command processing. This may be caused by an endmqm command being issued by another user, or by a queue manager error.

Response:
Use the strmqm command to start the message queue manager, wait until the message queue manager has started, and try the command again.

AMQ8146WebSphere MQ queue manager not available.
Severity:
40 : Stop Error

Explanation:
The queue manager is not available because it has been stopped or has not been created.

Response:
Use the crtmqm command to create the message queue manager, or the strmqm command to start the message queue manager as necessary. Then try the command again.

AMQ8146 (iSeries)WebSphere MQ queue manager not available.
Severity:
40 : Stop Error

Explanation:
The queue manager is not available because it has been stopped or has not been created.

Response:
Use the CRTMQM command to create the message queue manager or the STRMQM command to start the message queue manager as necessary, then retry the command. If a queue manager was not specified, ensure that a default queue manager has been created and is started using the WRKMQM command.

AMQ8147WebSphere MQ object <insert_3> not found.
Severity:
40 : Stop Error

Explanation:
If the command entered was Change or Display, the object <insert_3> specified does not exist. If the command entered was Copy, the source object does not exist. If the command entered was Create, the system default MQ object of the specified type does not exist.

Response:
Correct the object name and then try the command again or, if you are creating a new queue or process object, either specify all parameters explicitly or ensure that the system default object of the required type exists. The system default queue names are SYSTEM.DEFAULT.LOCAL.QUEUE, SYSTEM.DEFAULT.ALIAS.QUEUE and SYSTEM.DEFAULT.REMOTE.QUEUE. The system default process name is SYSTEM.DEFAULT.PROCESS.

AMQ8147 (iSeries)WebSphere MQ object <insert_5> not found.
Severity:
40 : Stop Error

Explanation:
If the command entered was Change or Display, the MQ object <insert_5> specified does not exist. If the command entered was Copy, the source MQ object does not exist. If the command entered was Create, the system default MQ object of the specified type does not exist.

Response:
Correct the MQ object name and then try the command again or, if you are creating a new MQ queue or process object, either specify all parameters explicitly or ensure that the system default object of the required type exists. The system default queue names are SYSTEM.DEFAULT.LOCAL.QUEUE, SYSTEM.DEFAULT.ALIAS.QUEUE and SYSTEM.DEFAULT.REMOTE.QUEUE. The system default process name is SYSTEM.DEFAULT.PROCESS.

AMQ8148WebSphere MQ object in use.
Severity:
40 : Stop Error

Explanation:
The object <insert_5> specified in <insert_3> is in use by an MQ application program.

Response:
Wait until the object is no longer in use and then try the command again. If the command is ALTER or CHANGE, specify FORCE to force the processing of the object regardless of any application program affected by the change. If the object is the dead-letter queue and the open input count is nonzero, it may be in use by an MQ channel. If the object is another queue object with a nonzero open output count, it may be in use by a MQ channel (of type RCVR or RQSTR). In either case, use the STOP CHANNEL and START CHANNEL commands to stop and restart the channel in order to solve the problem. To alter the queue USAGE the FORCE option must be used if the queue is not empty.

AMQ8149WebSphere MQ object damaged.
Severity:
40 : Stop Error

Explanation:
The object <insert_5> specified in <insert_4> is damaged.

Response:
The object contents are not valid. Issue the DISPLAY CHANNEL, DISPLAY QUEUE, or DISPLAY PROCESS command, as required, to determine the name of the damaged object. Issue the DEFINE command, for the appropriate object type, to replace the damaged object, then try the command again.

AMQ8150WebSphere MQ object already exists.
Severity:
40 : Stop Error

Explanation:
The object <insert_5> specified for <insert_3> could not be created because it already exists.

Response:
Check that the name is correct and try the command again specifying REPLACE, or delete the object. Then try the command again.

AMQ8151WebSphere MQ object has different type.
Severity:
40 : Stop Error

Explanation:
The type specified for object <insert_5> is different from the type of the object being altered or defined.

Response:
Use the correct MQ command for the object type, and then try the command again.

AMQ8152Source WebSphere MQ object has different type.
Severity:
40 : Stop Error

Explanation:
The type of the source object is different from that specified.

Response:
Correct the name of the command, or source object name, and then try the command again, or try the command using the REPLACE option.

AMQ8153Insufficient disk space for the specified queue.
Severity:
40 : Stop Error

Explanation:
The command failed because there was insufficient disk space available for the specified queue.

Response:
Release some disk space and then try the command again.

AMQ8154API exit load error.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ queue manager was unable to load the API crossing exit.

Response:
Ensure that the API crossing exit program is valid, and that its name and directory are correctly specified. Correct any error and then try the command again.

AMQ8155Connection limit exceeded.
Severity:
40 : Stop Error

Explanation:
The queue manager connection limit has been exceeded.

Response:
The maximum limit on the number of WebSphere MQ application programs that may be connected to the queue manager has been exceeded. Try the command later.

AMQ8156WebSphere MQ queue manager quiescing.
Severity:
40 : Stop Error

Explanation:
The queue manager is quiescing.

Response:
The queue manager was stopping with -c specified for endmqm. Wait until the queue manager has been restarted and then try the command again.

AMQ8157Security error.
Severity:
40 : Stop Error

Explanation:
An error was reported by the security manager program.

Response:
Inform your systems administrator, wait until the problem has been corrected, and then try the command again.

AMQ8158 (iSeries)API exit not found.
Severity:
40 : Stop Error

Explanation:
The API crossing exit program was not found.

Response:
Ensure that the API crossing exit program for the MQI exists, and that its name and library are correctly specified. Correct any errors and then try the command again.

AMQ8159 (iSeries)MAXDEPTH not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The MAXDEPTH parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the MAXDEPTH parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8160 (iSeries)DFTSHARE not allowed with queue type *ALS or *RMT.
Severity:
40 : Stop Error

Explanation:
The DFTSHARE parameter may not be specified for a queue of type *ALS or *RMT.

Response:
Remove the DFTSHARE parameter from the command or, if the command is CRTMQMQ, specify a different value for QTYPE. Then try the command again.

AMQ8161 (iSeries)AUT(*MQMPASSID) only allowed with OBJTYPE(*ADM).
Severity:
40 : Stop Error

Explanation:
AUT(*MQMPASSID) may only be specified with OBJTYPE(*ADM).

Response:
Change the AUT parameter to specify another value and then try the command again.

AMQ8162 (iSeries)AUT(*MQMPASSALL) only allowed with OBJTYPE(*ADM).
Severity:
40 : Stop Error

Explanation:
AUT(*MQMPASSALL) may only be specified with OBJTYPE(*ADM).

Response:
Change the AUT parameter to specify another value and then try the command again.

AMQ8163 (iSeries)AUT(*MQMSETID) only allowed with OBJTYPE(*ADM).
Severity:
40 : Stop Error

Explanation:
AUT(*MQMSETID) may only be specified with OBJTYPE(*ADM).

Response:
Change the AUT parameter to specify another value and then try the command again.

AMQ8164 (iSeries)AUT(*MQMSETALL) only allowed with OBJTYPE(*ADM).
Severity:
40 : Stop Error

Explanation:
AUT(*MQMSETALL) may only be specified with OBJTYPE(*ADM).

Response:
Change the AUT parameter to specify another value and then try the command again.

AMQ8165 (iSeries)AUT(*MQMALTUSR) only allowed with OBJTYPE(*ADM).
Severity:
40 : Stop Error

Explanation:
AUT(*MQMALTUSR) may only be specified with OBJTYPE(*ADM).

Response:
Change the AUT parameter to specify another value and then try the command again.

AMQ8166 (iSeries)WebSphere MQ reference object not found.
Severity:
40 : Stop Error

Explanation:
The object specified by the REFOBJ and REFOBJTYPE parameters does not exist.

Response:
Correct the reference object name and type, and then try the command again.

AMQ8167 (iSeries)Referenced object name not valid.
Severity:
30 : Severe error

Explanation:
The referenced object name specified in REFOBJ is not valid. The length of the name must not exceed 48 characters and the name should contain the following characters only: lowercase a-z, uppercase A-Z, numeric 0-9, period (.), forward slash (/), underscore (_) and percent sign (%).

Response:
Change the length of the parameter value or change the parameter value to contain a valid combination of characters. Then try the command again.

AMQ8168 (iSeries)User profile name for parameter USER not found.
Severity:
30 : Severe error

Explanation:
The user profile name specified for parameter USER could not be found on the system, and is not the special value *PUBLIC.

Response:
Correct the user profile name, or use the Create User Profile (CRTUSRPRF) command to create the user profile then try the request again.

AMQ8169 (iSeries)Authorization list for parameter AUTL does not exist.
Severity:
30 : Severe error

Explanation:
The authorization list specified for parameter AUTL does not exist. It may have been destroyed.

Response:
Either specify an authorization list that exists, or create the authorization list using the Create Authorization List (CRTAUTL) command. Try the request again.

AMQ8170 (iSeries)REFOBJTYPE(*OBJTYPE) and OBJTYPE(*ALL) cannot be used together.
Severity:
30 : Severe error

Explanation:
REFOBJTYPE(*OBJTYPE) can be specified only with a specific object type.

Response:
Change the REFOBJTYPE or OBJTYPE input value to a specific object type. Then try the Grant Authority (GRTMQMAUT) command again.

AMQ8171 (iSeries)Authority of *AUTL is only allowed with USER(*PUBLIC).
Severity:
30 : Severe error

Explanation:
AUT(*AUTL) was specified on either the Grant Authority (GRTMQMAUT) command or the Revoke Authority (RVKMQMAUT) command with the USER parameter not set to *PUBLIC. Only the authority for *PUBLIC can be deferred to the authorization list.

Response:
Change the AUT parameter to the authorities that are correct for the users or change the USER parameter to *PUBLIC. Then try the command again.

AMQ8172Already disconnected.
Severity:
10 : Warning

Explanation:
The MQI reason code of 2018 was returned from the WebSphere MQ queue manager in response to an MQDISC request issued during command processing.

Response:
None.

AMQ8173No processes to display.
Severity:
0 : Information

Explanation:
There are no matching processes defined on this system.

Response:
Using the DEFINE PROCESS command to create a process.

AMQ8174No queues to display.
Severity:
0 : Information

Explanation:
There are no matching queues defined on this system.

Response:
Use the appropriate command to define a queue of the type that you require, that is, DEFINE QALIAS, DEFINE QLOCAL, DEFINE QMODEL, or DEFINE QREMOTE.

AMQ8175 (iSeries)WebSphere MQ trace has started.
Severity:
0 : Information

Explanation:
The trace has started successfully.

Response:
None.

AMQ8176 (iSeries)WebSphere MQ trace has been written.
Severity:
0 : Information

Explanation:
The trace has been written successfully.

Response:
None.

AMQ8177 (iSeries)WebSphere MQ trace has stopped.
Severity:
0 : Information

Explanation:
The trace has stopped.

Response:
None.

AMQ8178 (iSeries)WebSphere MQ trace did not start.
Severity:
40 : Stop Error

Explanation:
The trace did not start successfully.

Response:
None.

AMQ8179 (iSeries)WebSphere MQ trace output error.
Severity:
40 : Stop Error

Explanation:
The trace was not output successfully.

Response:
None.

AMQ8180 (iSeries)WebSphere MQ trace end request failed.
Severity:
40 : Stop Error

Explanation:
Your request to end the trace was not successful.

Response:
None.

AMQ8181 (iSeries)No jobs to display.
Severity:
10 : Warning

Explanation:
There are no matching jobs running on this system.

Response:
Specify another job name from the STRMQMSRV command.

AMQ8182 (iSeries)WebSphere MQ trace already off.
Severity:
10 : Warning

Explanation:
An attempt was made to set trace off, but the trace is not active.

Response:
None.

AMQ8183 (iSeries)WebSphere MQ trace already running.
Severity:
10 : Warning

Explanation:
An attempt was made to start trace, but trace is already running.

Response:
Either leave trace running as it is, or, if you want to change the trace settings, turn trace off and then turn it on again with appropriate settings.

AMQ8184 (iSeries)Requested job cannot be found
Severity:
10 : Warning

Explanation:
The job specified cannot be found in the table that controls WebSphere MQ for iSeries trace. As a result no trace action can be performed.

Response:
Specify an appropriate job name.

AMQ8185Operating system object already exists.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ object cannot be created because an object that is not known to MQ already exists in the MQ directory with the name that should be used for the new object. Refer to the log for previous messages.

Response:
Remove the non-MQ object from the MQ library, and try the command again.

AMQ8186Image not available for WebSphere MQ object <insert_5>.
Severity:
40 : Stop Error

Explanation:
The object <insert_5> type <insert_3> cannot be recreated because the image is not fully available in the logs that are currently online. Refer to earlier messages in the error log for information about the logs that need to be brought online for this object to be recreated.

Response:
Bring the relevant logs online, and try the command again.

AMQ8187WebSphere MQ object <insert_5> is currently open.
Severity:
40 : Stop Error

Explanation:
The object <insert_5>, type <insert_3>, is currently in use, so the <insert_1> command cannot be issued against it. If a generic list was presented to the command, the command is still issued against the other objects in the list.

Response:
Wait until the object is no longer in use, and try the command again.

AMQ8188Insufficient authorization to WebSphere MQ object <insert_5>.
Severity:
40 : Stop Error

Explanation:
You are not authorized to issue the <insert_1> command against the object <insert_5> type <insert_3>. If a generic list was presented to the command, the command is still issued against the other objects in the list.

Response:
Obtain sufficient authorization for the object, and retry the command.

AMQ8189WebSphere MQ object <insert_5> is damaged.
Severity:
40 : Stop Error

Explanation:
The object <insert_5> type <insert_4> is damaged and the <insert_3> command cannot be issued against it. If a generic list was presented to the command then the command is still issued against the other objects in the list.

Response:
Issue the appropriate DEFINE command for the object, specifying REPLACE, and then try the command again.

AMQ8190<insert_3> succeeded on <insert_1> objects and failed on <insert_2> objects.
Severity:
40 : Stop Error

Explanation:
An operation performed on a generic list of objects was not completely successful.

Response:
Examine the log for details of the errors encountered, and take appropriate action.

AMQ8191WebSphere MQ command server is starting.
Severity:
40 : Stop Error

Explanation:
The command server is starting.

Response:
Wait for the strmqcsv command to complete and then try the operation again.

AMQ8191 (iSeries)WebSphere MQ command server is starting.
Severity:
40 : Stop Error

Explanation:
The command server is starting.

Response:
Wait for the STRMQMCSVR command to complete and then try the operation again.

AMQ8192WebSphere MQ command server already starting.
Severity:
40 : Stop Error

Explanation:
The request to start the command server was unsuccessful because the command server is already starting.

Response:
Wait for the strmqcsv command to complete.

AMQ8192 (iSeries)WebSphere MQ command server already starting.
Severity:
40 : Stop Error

Explanation:
The request to start the command server was unsuccessful because the command server is already starting.

Response:
Wait for the STRMQMCSVR command to complete.

AMQ8193WebSphere MQ command server is ending.
Severity:
40 : Stop Error

Explanation:
The command server is ending.

Response:
Wait for the endmqcsv command to complete and then try the command again.

AMQ8193 (iSeries)WebSphere MQ command server is ending.
Severity:
40 : Stop Error

Explanation:
The command server is ending.

Response:
Wait for the ENDMQMCSVR command to complete and then try the command again.

AMQ8194WebSphere MQ command server already ending.
Severity:
40 : Stop Error

Explanation:
The end command server request was unsuccessful because the command server is already ending.

Response:
Wait for the endmqcsv command to complete.

AMQ8194 (iSeries)WebSphere MQ command server already ending.
Severity:
40 : Stop Error

Explanation:
The end command server request was unsuccessful because the command server is already ending.

Response:
Wait for the ENDMQMCSVR command to complete.

AMQ8195WebSphere MQ command server already running.
Severity:
40 : Stop Error

Explanation:
The strmqcsv command was unsuccessful because the command server is already running.

Response:
None.

AMQ8195 (iSeries)WebSphere MQ command server already running.
Severity:
40 : Stop Error

Explanation:
The STRMQMCSVR command was unsuccessful because the command server is already running.

Response:
None.

AMQ8196WebSphere MQ command server already stopped.
Severity:
40 : Stop Error

Explanation:
The request to end the command server was unsuccessful because the command server is already stopped.

Response:
None.

AMQ8197Deleted WebSphere MQ queue damaged.
Severity:
20 : Error

Explanation:
The deleted MQ queue <insert_5> was damaged, and any messages it contained have been lost.

Response:
None.

AMQ8198 (iSeries)Program <insert_3> called with incorrect number of parameters.
Severity:
20 : Error

Explanation:
The number of parameters passed in the call to program <insert_3> is not correct.

Response:
Correct the calling program and then retry the operation.

AMQ8199 (iSeries)Error in call identifier parameter passed to program QMQM.
Severity:
20 : Error

Explanation:
The call identifier, the first parameter passed to program QMQM, is not in the required packed decimal format, or its value is not supported. Permitted values of the call identifier are contained in the RPG copy file CMQR.

Response:
Correct the calling program, and retry the call.

AMQ8200 (iSeries)MODENAME only allowed with TRPTYPE(*LU62).
Severity:
40 : Stop Error

Explanation:
The MODENAME parameter may only be specified with TRPTYPE(*LU62).

Response:
Remove the MODENAME parameter from the command or change the TRPTYPE parameter value to specify *LU62 and then try the command again.

AMQ8201 (iSeries)TPGMNAME only allowed with TRPTYPE(*LU62).
Severity:
40 : Stop Error

Explanation:
The TPGMNAME parameter may only be specified with TRPTYPE(*LU62).

Response:
Remove the TPGMNAME parameter from the command or change the TRPTYPE parameter value to specify *LU62. Then try the command again.

AMQ8202TMQNAME only allowed with channel type *SDR or *SVR.
Severity:
40 : Stop Error

Explanation:
The TMQNAME parameter may only be specified with channel type *SDR or *SVR.

Response:
Remove the TMQNAME parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR or *SVR. Then try the command again.

AMQ8203 (iSeries)CONNAME only allowed with channel type *SDR, *SVR, *RQSTR, *CLUSSDR, *CLTCN and *CLUSRCVR
Severity:
40 : Stop Error

Explanation:
The CONNAME parameter may only be specified with channel type *SDR, *SVR, *RQSTR, *CLUSSDR, *CLTCN or *CLUSRCVR.

Response:
Remove the CONNAME parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR, *SVR, *RQSTR, *CLUSSDR, *CLTCN or *CLUSRCVR. Then try the command again.

AMQ8204MCANAME only allowed with channel type *SDR, *SVR, or *RQSTR.
Severity:
40 : Stop Error

Explanation:
The MCANAME parameter may only be specified with channel type *SDR, *SVR, or *RQSTR.

Response:
Remove the MCANAME parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR, *SVR, or *RQSTR. Then try the command again.

AMQ8205DSCITV only allowed with channel type *CLUSSDR, *CLUSRCVR, *SVRCN, *SDR or *SVR.
Severity:
40 : Stop Error

Explanation:
The DSCITV parameter may only be specified with channel type *CLUSSDR, *CLUSRCVR, *SVRCN, *SDR or *SVR.

Response:
Remove the DSCITV parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *CLUSSDR, *CLUSRCVR, *SVRCN, *SDR or *SVR. Then try the command again.

AMQ8206SHORTRTY only allowed with channel type *CLUSSDR, CLUSRCVR, *SDR or *SVR.
Severity:
40 : Stop Error

Explanation:
The SHORTRTY parameter may only be specified with channel type *CLUSSDR, *CLUSRCVR, *SDR or *SVR.

Response:
Remove the SHORTRTY parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *CLUSSDR, *CLUSRCVR, *SDR or *SVR. Then try the command again.

AMQ8207SHORTTMR only allowed with channel type *CLUSSDR, CLUSRCVR, *SDR or *SVR.
Severity:
40 : Stop Error

Explanation:
The SHORTTMR parameter may only be specified with channel type *CLUSSDR, *CLUSRCVR, *SDR or *SVR.

Response:
Remove the SHORTTMR parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *CLUSSDR, CLUSRCVR, *SDR or *SVR. Then try the command again.

AMQ8208LONGRTY only allowed with channel type *CLUSSDR, *CLUSRCVR, *SDR or *SVR.
Severity:
40 : Stop Error

Explanation:
The LONGRTY parameter may only be specified with channel type *CLUSSDR, *CLUSRCVR, *SDR or *SVR.

Response:
Remove the LONGRTY parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *CLUSSDR, CLUSRCVR, *SDR or *SVR. Then try the command again.

AMQ8209LONGTMR only allowed with channel type *CLUSSDR, *CLUSRCVR, *SDR or *SVR.
Severity:
40 : Stop Error

Explanation:
The LONGTMR parameter may only be specified with channel type *CLUSSDR, *CLUSRCVR, *SDR or *SVR.

Response:
Remove the LONGTMR parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *CLUSSDR, *CLUSRCVR, *SDR or *SVR. Then try the command again.

AMQ8210PUTAUT only allowed with channel type *RCVR or RQSTR.
Severity:
40 : Stop Error

Explanation:
The PUTAUT parameter may only be specified with channel type *RCVR or RQSTR.

Response:
Remove the PUTAUT parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *RCVR or RQSTR. Then try the command again.

AMQ8211BATCHINT only allowed with channel type *SDR or *SVR.
Severity:
40 : Stop Error

Explanation:
The BATCHINT parameter may only be specified with channel type *SDR or *SVR.

Response:
Remove the BATCHINT parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR or *SVR. Then try the command again.

AMQ8212 (iSeries)TPGMNAME parameter required with TRPTYPE(*LU62).
Severity:
40 : Stop Error

Explanation:
A required parameter was not specified.

Response:
Enter a value for parameter TPGMNAME.

AMQ8213 (iSeries)TMQNAME parameter required with channel type *SDR or *SVR.
Severity:
40 : Stop Error

Explanation:
The TMQNAME parameter must be specified with channel type *SDR or *SVR.

Response:
Enter a value for parameter TMQNAME.

AMQ8214CONNAME parameter missing.
Severity:
40 : Stop Error

Explanation:
The CONNAME parameter must be specified with channel types SDR, RQSTR, CLNTCONN, and CLUSSDR. It is also required with channel type CLUSRCVR if the TRPTYPE is not TCP.

Response:
Enter a value for parameter CONNAME.

AMQ8214 (iSeries)CONNAME parameter missing.
Severity:
40 : Stop Error

Explanation:
The CONNAME parameter must be specified with channel types *SDR, *RQSTR, and *CLUSSDR. It is also required with channel type *CLUSRCVR if the TRPTYPE is not *TCP.

Response:
Enter a value for parameter CONNAME.

AMQ8215 (iSeries)CVTMSG only allowed with channel type *SDR, *SVR, *CLUSSDR or *CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The CVTMSG parameter may only be specified with channel type *SDR, *SVR, *CLUSSDR or *CLUSRCVR.

Response:
Remove the CVTMSG parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR, *SVR, *CLUSSDR or CLUSRCVR. Then try the command again.

AMQ8216 (iSeries)MODENAME only allowed with TRPTYPE(*LU62).
Severity:
40 : Stop Error

Explanation:
The MODENAME parameter may only be specified with TRPTYPE(*LU62).

Response:
Remove the MODENAME parameter from the command or change the TRPTYPE parameter value to specify *LU62. Then try the command again.

AMQ8217 (iSeries)CONNAME only allowed with channel type *SDR, *SVR, *RQSTR, *CLUSSDR or CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The CONNAME parameter may only be specified with channel type *SDR, *SVR, *RQSTR, CLUSSDR or CLUSRCVR.

Response:
Remove the CONNAME parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR, *SVR, *RQSTR, CLUSSDR or CLUSRCVR. Then try the command again.

AMQ8218The system cannot accept the combination of parameters entered.
Severity:
30 : Severe error

AMQ8219Command server queue is open, retry later.
Severity:
30 : Severe error

Response:
Wait and try again later.

AMQ8220 (iSeries)The PNGMQMCHL command has completed.
Severity:
0 : Information

Explanation:
The PNGMQMCHL command sent <insert_1> bytes of data to <insert_3> and received the data back in <insert_4>.<insert_5> seconds. The number of bytes will be less than the amount requested on the command, when the length requested is greater than the allowed maximum, in one communications transmission, for the operating system and communications protocol.

Response:
None.

AMQ8221 (iSeries)Ping data length truncated, specified length <insert_1>, actual length <insert_2>.
Severity:
10 : Warning

Explanation:
The length of the ping data sent was reduced because of constraints in the current configuration.

Response:
None.

AMQ8222 (iSeries)The data sent and received by the PNGMQMCHL command was not identical.
Severity:
40 : Stop Error

Explanation:
Ping data compare failed at offset <insert_1>, data sent <insert_3>, data received <insert_4>.

Response:
This is probably due to a communications failure. Other messages may have been issued.

AMQ8223 (iSeries)No channels to display.
Severity:
0 : Information

Explanation:
There are no channels defined on this system.

Response:
Create a channel using the CRTMQMCHL command.

AMQ8224 (iSeries)From channel <insert_3> not found.
Severity:
30 : Severe error

Explanation:
The source WebSphere MQ channel does not exist.

Response:
Correct the MQ channel name and then try the command again.

AMQ8225 (iSeries)From channel and to channel names are equal.
Severity:
30 : Severe error

Explanation:
The same name has been specified for the from channel name and the to channel name.

Response:
Choose two different names, of which the from channel must exist.

AMQ8226WebSphere MQ channel already exists.
Severity:
40 : Stop Error

Explanation:
The channel <insert_3> cannot be created because it already exists.

Response:
Check that the name is correct and try the command again specifying REPLACE, or delete the channel and then try the command again.

AMQ8227Channel <insert_3> not found.
Severity:
30 : Severe error

Explanation:
The channel could not be found.

Response:
Correct the Channel Name if wrong and then try the command again. For DEFINE CHANNEL check that the Channel Name in error exists.

AMQ8228 (iSeries)Unexpected return code <insert_1>.
Severity:
30 : Severe error

Explanation:
The unexpected return code <insert_1> was returned to a channel command.

Response:
This message is associated with an internal error. Use WRKPRB to record the problem identifier, and to save the QPSRVDMP, QPJOBLOG, and QPDSPJOB files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ8229 (iSeries)No message queue managers to display.
Severity:
0 : Information

Explanation:
There are no message queue managers to administer.

Response:
Add a queue manager using PF6 or the ADMQMNAM command.

AMQ8230 (iSeries)No queue manager objects to display.
Severity:
0 : Information

Explanation:
Either the queue manager has no objects to display (this is unlikely), or the selection criteria resulted in zero objects to display.

Response:
Change or remove the selection criteria.

AMQ8231 (iSeries)No responses to display.
Severity:
0 : Information

Explanation:
There are no commands or command responses to display.

Response:
None.

AMQ8232 (iSeries)No messages to display.
Severity:
0 : Information

Explanation:
The queue is empty, or the queue does not exist.

Response:
None.

AMQ8233 (iSeries)No message data to display.
Severity:
0 : Information

Explanation:
The message contains no data.

Response:
None.

AMQ8234 (iSeries)No response data to display.
Severity:
0 : Information

Explanation:
There is no response data to display for this command. This is probably because the command has not yet completed.

Response:
None.

AMQ8235 (iSeries)No command parameters to display.
Severity:
0 : Information

Explanation:
Some commands have no required parameters.

Response:
None.

AMQ8236 (iSeries)Channel <insert_3> not found.
Severity:
30 : Severe error

Explanation:
CHGMQMCHL was issued for a non-existent channel.

Response:
Correct the WebSphere MQ channel name and then try the command again.

AMQ8237 (iSeries)NPMSPEED only allowed with channel type *SDR, *SVR, *RCVR *RQSTR, CLUSSDR or CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The NPMSPEED parameter may only be specified with channel type *SDR, *SVR, *RCVR *RQSTR, CLUSSDR or CLUSRCVR.

Response:
Remove the NPMSPEED parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR, *SVR, *RCVR *RQSTR, CLUSSDR or CLUSRCVR. Then try the command again.

AMQ8238 (iSeries)Queue manager connection already open.
Severity:
30 : Severe error

Explanation:
An MQCONN call was issued, but the thread or process is already connected to a different queue manager. The thread or process can connect to only one queue manager at a time.

Response:
Use the MQDISC call to disconnect from the queue manager which is already connected, and then issue the MQCONN call to connect to the new queue manager. Disconnecting from the existing queue manager will close any queues which are currently open, it is recommended that any uncommitted units of work should be committed or backed out before the MQDISC call is used.

AMQ8239 (iSeries)LOCLADDR not valid for channel type *RCVR or *SVRCN.
Severity:
40 : Stop Error

Explanation:
The LOCLADDR parameter may only be specified with channel type *SDR, *SVR, *RQSTR, *CLUSSDR, *CLUSRCVR or *CLTCN.

Response:
Remove the CONNAME parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR, *SVR, *RQSTR, *CLUSSDR, *CLUSRCVR or *CLTCN. Then try the command again.

AMQ8240 (iSeries)Unexpected error <insert_1> in <insert_3>.
Severity:
40 : Stop Error

Explanation:
The unexpected return code <insert_1> was returned during <insert_3> processing.

Response:
This message is associated with an internal error. Use WRKPRB to record the problem identifier, and to save the QPSRVDMP, QPJOBLOG, and QPDSPJOB files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ8241 (iSeries)Unexpected message format <insert_3> received.
Severity:
40 : Stop Error

Explanation:
The unexpected message format <insert_3> was received in message on the internal reply queue.

Response:
This message is probably a message sent erroneously to this queue. The message in error is written to the SYSTEM.ADMIN.EXCEPTION.QUEUE, where it may be viewed using the WRKMQMMSG command.

AMQ8242SSLCIPH definition wrong.
Severity:
40 : Stop Error

Explanation:
The definition of the SSLCIPH parameter was wrong.

Response:
Correct the SSLCIPH definition and try the command again.

AMQ8243SSLPEER definition wrong.
Severity:
40 : Stop Error

Explanation:
The definition of the SSLPEER parameter was wrong. Possible causes may be that the syntax was invalid or that it contained an invalid attribute type.

Response:
Correct the SSLPEER definition and try the command again.

AMQ8266 (iSeries)No objects to display.
Severity:
0 : Information

Explanation:
There are no objects with the specified name and type.

Response:
None.

AMQ8276Display Connection details.
Severity:
0 : Information

Explanation:
The DISPLAY CONN command completed successfully. Details follow this message.

AMQ8277 (iSeries)Object changed.
Severity:
40 : Stop Error

Explanation:
The definition of WebSphere MQ object <insert_5> changed after it had been opened, thereby invalidating the open operation.

Response:
Try the command again.

AMQ8278 (iSeries)Maximum handle limit reached.
Severity:
40 : Stop Error

Explanation:
An attempt was made to exceed the maximum handle limit specified for the message queue manager.

Response:
Increase the maximum handle limit specified for the message queue manager using the CHGMQM command. Then try the command again.

AMQ8279 (iSeries)Option not valid for type.
Severity:
40 : Stop Error

Explanation:
The options specified when opening WebSphere MQ object <insert_5> were not valid for the object type.

Response:
Correct the definition of the failing object. Then try the command again.

AMQ8280 (iSeries)Queue does not exist.
Severity:
30 : Severe error

Explanation:
The queue being displayed does not exist on this queue manager.

Response:
Check the name of the queue and retry the operation. If you are attempting to display a queue of type *ALS, check the queue definition references an existing queue definition.

AMQ8282 (iSeries)Queue manager <insert_3> is not defined on the connected queue manager.
Severity:
30 : Severe error

Explanation:
Either the necessary queue manager name has been entered incorrectly on the add queue manager panel, or the queue manager has not been defined on the connected queue manager.

Response:
Correct the name, or define <insert_3> on the connected queue manager by creating a local queue with name <insert_3> and usage *TMQ (transmission queue), and then creating sender and receiver channels on both the connected queue manager and queue manager <insert_3>.

AMQ8283 (iSeries)The administration application data store program failed to start.
Severity:
40 : Stop Error

Explanation:
The program AMQMCPRA (data store program) was not started because of reason code <insert_1>.

Response:
Check the joblog for AMQMCPRA by issuing a WRKSPLF QMQM. Correct the problem and try to start the Administration application by invoking the STRMQMADM command. If the problem persists contact your systems programmer.

AMQ8284 (iSeries)This user is not authorized to queue <insert_3>.
Severity:
40 : Stop Error

Explanation:
Queue <insert_3> (queue manager <insert_4>) has not been authorized for your use.

Response:
Have queue <insert_3> authorized for your use. If queue manager <insert_4> is not the local queue manager, you might not be authorized to the transmission queue for this queue manager.

AMQ8287No channels with status to display.
Severity:
0 : Information

Explanation:
There are no channels having status information to display. This indicates either, that the channel has not been started previously, or, that the channel has been started but has not yet completed a transmission sequence.

Response:
None.

AMQ8288 (iSeries)Not authorized to command <insert_1>
Severity:
40 : Stop Error

Explanation:
You are not authorized to perform the requested operation for WebSphere MQ command <insert_1>.

Response:
Obtain the necessary authority from your WebSphere MQ administrator. Then try the command again.

AMQ8289 (iSeries)You are not authorized to the WebSphere MQ command.
Severity:
40 : Stop Error

Explanation:
You are not authorized to the WebSphere MQ command because your user profile is not a member of the QMQMADM group.

Response:
Ask your MQ administrator to give your user profile *ALLOBJ authority, or add your user profile to the QMQMADM group (either as a primary or supplemental group)

AMQ8291 (iSeries)WebSphere MQ remote trace already running.
Severity:
10 : Warning

Explanation:
An attempt was made to start remote trace, but it is already running.

Response:
Either leave remote trace running as it is, or, if you want to change the settings, turn remote trace off and then turn it on again with appropriate settings.

AMQ8294 (iSeries)WebSphere MQ remote trace already off.
Severity:
10 : Warning

Explanation:
An attempt was made to end remote trace, but it is already off.

Response:
Leave remote trace off.

AMQ8295 (iSeries)WebSphere MQ object not secured by authorization list.
Severity:
40 : Stop Error

Explanation:
The specified object is not secured by the authorization list to be revoked from it.

Response:
Use the display authority (DSPMQMAUT) command to determine what authorization list is securing the object, if any. Issue the RVKMQMAUT command again with the authorization list that is securing the the object to revoke the authorization list's authority.

AMQ8296<insert_1> MQSC commands completed successfully.
Severity:
0 : Information

Explanation:
The <insert_3> command has completed successfully. The <insert_1> MQ commands from <insert_5> have been processed without error and a report written to the printer spool file.

Response:
None.

AMQ8297<insert_1> MQSC commands verified successfully.
Severity:
0 : Information

Explanation:
The <insert_3> command completed successfully. The <insert_1> MQ commands from <insert_5> have been verified and a report written to the printer spool file.

Response:
None.

AMQ8298Error report generated for MQSC command process.
Severity:
40 : Stop Error

Explanation:
The <insert_3> command attempted to process a sequence of MQ commands and encountered some errors, however, the operation may have partially completed.

Response:
If the STRMQMMQSC command was executed a report has been written to a printer spool file. Examine the spooled printer file for details of the errors encountered and correct the MQSC source <insert_5> and retry the operation.

AMQ8299Cannot open <insert_5> for MQSC process.
Severity:
40 : Stop Error

Explanation:
The <insert_1> command failed to open <insert_5> for MQ command processing.

Response:
Check that the intended file exists, and has been specified correctly. Correct the specification or create the object, and try the operation again.

AMQ8300 (iSeries)Too many exit programs/user data fields defined.
Severity:
30 : Severe error

Explanation:
An attempt was made to create or change a channel which had more than the allowed maximum of a total of six exit programs and/or user data fields defined.

Response:
Re-define the channel so that a total of six exit programs and/or user data fields are defined.

AMQ8301 (iSeries)WebSphere MQ storage monitor job could not be started.
Severity:
50 : System Error

Explanation:
An attempt to start the storage monitor process (job QMQM in subsystem QSYSWRK) was unsuccessful.

Response:
Check the job log for the reason for the failure, and try the command again.

AMQ8302Internal failure initializing WebSphere MQ services.
Severity:
50 : System Error

Explanation:
An error occurred while attempting to initialize WebSphere MQ services.

Response:
A call to xcsInitialize ended with the FAIL, STOP, or STOP_ALL return code. Refer to the log for messages diagnosing this problem.

AMQ8303Insufficient storage available to process request.
Severity:
50 : System Error

AMQ8304Tracing cannot be started. Too many traces are already running.
Severity:
40 : Stop Error

Explanation:
A maximum of 9 traces may be running concurrently. This number is already running.

Response:
Stop one or more of the other traces and try the command again.

AMQ8305Tracing cannot be started. Too many traces are already running.
Severity:
40 : Stop Error

Explanation:
A maximum of 9 traces can be running concurrently, and this number of traces is already running.

Response:
Stop one or more of the other traces and try the command again.

AMQ8306 (iSeries)BATCHSIZE only allowed with channel type *SDR, *SVR, *RCVR, *RQSTR, CLUSSDR or CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The BATCHSIZE parameter may only be specified with channel type *SDR, *SVR, *RCVR, *RQSTR, CLUSSDR or CLUSRCVR.

Response:
Remove the BATCHSIZE parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR, *SVR, *RCVR *RQSTR, CLUSSDR or CLUSRCVR. Then try the command again.

AMQ8307 (iSeries)SEQNUMWRAP only allowed with channel type *SDR, *SVR, *RCVR , *RQSTR, CLUSSDR or CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The SEQNUMWRAP parameter may only be specified with channel type *SDR, *SVR, *RCVR, *RQSTR, CLUSSDR or CLUSRCVR.

Response:
Remove the SEQNUMWRAP parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR, *SVR, *RCVR *RQSTR, CLUSSDR or CLUSRCVR. Then try the command again.

AMQ8308 (iSeries)MSGRTYEXIT only allowed with channel type *CLUSRCVR, *RCVR or *RQSTR.
Severity:
40 : Stop Error

Explanation:
The MSGRTYEXIT parameter may only be specified with channel type *CLUSRCVR, *RCVR or *RQSTR.

Response:
Remove the MSGRTYEXIT parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *CLUSRCVR, *RCVR or *RQSTR. Then try the command again.

AMQ8309 (iSeries)MSGRTYDATA only allowed with channel type *CLUSRCVR, *RCVR or *RQSTR.
Severity:
40 : Stop Error

Explanation:
The MSGRTYDATA parameter may only be specified with channel type *CLUSRCVR, *RCVR or *RQSTR.

Response:
Remove the MSGRTYDATA parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *CLUSRCVR, *RCVR or *RQSTR. Then try the command again.

AMQ8310 (iSeries)MSGRTYNBR only allowed with channel type *CLUSRCVR, *RCVR or *RQSTR.
Severity:
40 : Stop Error

Explanation:
The MSGRTYNBR parameter may only be specified with channel type *CLUSRCVR, *RCVR or *RQSTR.

Response:
Remove the MSGRTYNBR parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *CLUSRCVR, *RCVR or *RQSTR. Then try the command again.

AMQ8311 (iSeries)MSGRTYITV only allowed with channel type *CLUSRCVR, *RCVR or *RQSTR.
Severity:
40 : Stop Error

Explanation:
The MSGRTYITV parameter may only be specified with channel type *CLUSRCVR, *RCVR or *RQSTR.

Response:
Remove the MSGRTYITV parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *CLUSRCVR, *RCVR or *RQSTR. Then try the command again.

AMQ8312 (iSeries)CLUSTER only allowed with queue type *ALS, *LCL and *RMT.
Severity:
40 : Stop Error

Explanation:
The CLUSTER parameter may only be specified with queue type *ALS, *LCL and *RMT.

Response:
Remove the CLUSTER parameter from the command or, if the command is CRTMQMQ, change the QTYPE parameter value to specify *ALS, *LCL or *RMT. Then try the command again.

AMQ8313 (iSeries)CLUSNL only allowed with queue type *ALS, *LCL and *RMT.
Severity:
40 : Stop Error

Explanation:
The CLUSNL parameter may only be specified with queue type *ALS, *LCL and *RMT.

Response:
Remove the CLUSNL parameter from the command or, if the command is CRTMQMQ, change the QTYPE parameter value to specify *ALS, *LCL or *RMT. Then try the command again.

AMQ8314 (iSeries)DEFBIND only allowed with queue type *ALS, *LCL and *RMT.
Severity:
40 : Stop Error

Explanation:
The DEFBIND parameter may only be specified with queue type *ALS, *LCL and *RMT.

Response:
Remove the DEFBIND parameter from the command or, if the command is CRTMQMQ, change the QTYPE parameter value to specify *ALS, *LCL or *RMT. Then try the command again.

AMQ8315No namelists to display.
Severity:
0 : Information

Explanation:
There are no matching namelists defined on this system.

Response:
Use the Create Namelist (CRTMQMNL) command to create a namelist.

AMQ8316No cluster queue managers to display.
Severity:
0 : Information

Explanation:
There are no matching cluster queue managers defined on this system.

Response:
None.

AMQ8317 (iSeries)CLUSTER only allowed with channel type *CLUSSDR and *CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The CLUSTER parameter may only be specified with channel type *CLUSSDR and *CLUSRCVR.

Response:
Remove the CLUSTER parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *CLUSSDR or *CLUSRCVR. Then try the command again.

AMQ8318 (iSeries)CLUSNL only allowed with channel type *CLUSSDR and *CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The CLUSNL parameter may only be specified with channel type *CLUSSDR and *CLUSRCVR.

Response:
Remove the CLUSNL parameter from the command or, if the command is CRTMQMCHL, change the CHLQTYPE parameter value to specify *CLUSSDR or *CLUSRCVR. Then try the command again.

AMQ8319MSGEXIT only allowed with channel type *SDR, *SVR, *RCVR *RQSTR, *CLUSSDR or *CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The MSGEXIT parameter may only be specified with channel type *SDR, *SVR, *RCVR, *RQSTR, *CLUSSDR, or *CLUSRCVR.

Response:
Remove the MSGEXIT parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR or *SVR or *RCVR or *RQSTR or *CLUSSDR or *CLUSRCVR. Then try the command again.

AMQ8320 (iSeries)MSGUSRDATA only allowed with channel type *SDR, *SVR, *RCVR *RQSTR, or *CLUSSDR or *CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The MSGUSRDATA parameter may only be specified with channel type *SDR, *SVR, *RCVR *RQSTR, *CLUSSDR or *CLUSRCVR.

Response:
Remove the MSGUSRDATA parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR or *SVR or *RCVR or *RQSTR or *CLUSSDR or *CLUSRCVR. Then try the command again.

AMQ8321 (iSeries)Process <insert_3> is still running.
Severity:
0 : Information

AMQ8322 (iSeries)TIMEOUT only allowed with ENDCCTJOB(*YES).
Severity:
40 : Stop Error

Explanation:
The TIMEOUT parameter may only be specified when connected jobs are being ended with the ENDCCTJOB option set to *YES.

Response:
Remove the TIMEOUT parameter from the command or, if you want to fully quiesce the queue manager, change the ENDCCTJOB parameter to *YES. Then try the command again.

AMQ8323 (iSeries)OPTION(*PREEMPT) must not be used with ENDCCTJOB(*YES).
Severity:
40 : Stop Error

Explanation:
When performing a pre-emptive shutdown of the queue manager the ENDCCTJOB(*YES) parameter is not allowed.

Response:
Change the ENDCCTJOB(*YES) parameter to ENDCCTJOB(*NO) or, if you want to fully quiesce the queue manager without doing a pre-emptive shutdown, change the OPTION(*PREEMPT) parameter to another value. Then try the command again.

AMQ8324 (iSeries)OPTION(*WAIT) not allowed with MQMNAME(*ALL).
Severity:
40 : Stop Error

Explanation:
The OPTION(*WAIT) parameter is not allowed when performing a shutdown of all queue managers.

Response:
Remove the OPTION(*WAIT) parameter from the command or, specify individual queue manager names to shut down the queue managers one-by-one with the OPTION(*WAIT) parameter. Then try the command again.

AMQ8325 (iSeries)MQMNAME(*ALL) is not allowed with ENDCCTJOB(*NO).
Severity:
40 : Stop Error

Explanation:
The MQMNAME(*ALL) parameter is only allowed when performing a full shutdown of the queue managers.

Response:
Specify individual queue manager names to shut the queue managers down one-by-one or change the ENDCCTJOB parameter to *YES. Then try the command again.

AMQ8330Running
Severity:
0 : Information

AMQ8331Ended normally
Severity:
0 : Information

AMQ8332Ended immediately
Severity:
0 : Information

AMQ8333Ended preemptively
Severity:
0 : Information

AMQ8334Ended unexpectedly
Severity:
0 : Information

AMQ8335Starting
Severity:
0 : Information

AMQ8336Quiescing
Severity:
0 : Information

AMQ8337Ending immediately
Severity:
0 : Information

AMQ8338Ending preemptively
Severity:
0 : Information

AMQ8339Being deleted
Severity:
0 : Information

AMQ8340Not available
Severity:
0 : Information

AMQ8341SUBPOOL(<insert_3>)<insert_4>PID(<insert_1>)
Severity:
0 : Information

AMQ8342No authorities to display.
Severity:
0 : Information

Explanation:
There are no authority records defined on this system, satisfying the input parameters.

Response:
Use the appropriate input to list all the authorities defined on the system, or enter the command again with different input..

AMQ8343 (iSeries)The requested operation is not valid for user QMQMADM.
Severity:
0 : Information

Explanation:
You are not allowed to completely delete the authorities assigned to user QMQMADM, for a valid WebSphere MQ object, with the authority *REMOVE or *NONE.

Response:
Remove QMQMADM from the list of users to this command.

AMQ8344 (iSeries)The delete option is only valid for a generic profile name.
Severity:
0 : Information

Explanation:
The delete option, which will delete this authority profile by removing all the users from this authority profile, is not valid for an object name or the special value &class.

Response:
To delete users from an object, work from the WRKMQMAUTD command.

AMQ8345 (iSeries)BATCHHB not valid for channel type *RCVR, *RQSTR, *SVRCN or *CLTCN.
Severity:
40 : Stop Error

Explanation:
The BATCHHB parameter may only be specified with channel type *SDR, *SVR, *CLUSSDR, or *CLUSRCVR.

Response:
Remove the BATCHHB parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR, *SVR, *CLUSSDR or *CLUSRCVR. Then try the command again.

AMQ8346 (iSeries)Parameter mismatch between QMNAME and QMID.
Severity:
40 : Stop Error

Explanation:
The Queue Manager Name for Removal (QMNAME) parameter is not *QMID and there is a value for the Queue Manager Identifier for Removal (QMID) parameter.

Response:
A value for QMID is not allowed unless QMNAME is *QMID. Change the value specified on the QMNAME parameter or the value of the QMID parameter and then try the request again.

AMQ8347 (iSeries)USERID not valid for channel type *RCVR, *SVRCN or *CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The USERID parameter may only be specified with channel type *SDR, *SVR, *RQSTR, *CLUSSDR, or *CLTCN.

Response:
Remove the USERID parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR, *SVR, *RQSTR, *CLUSSDR, or *CLTCN. Then try the command again.

AMQ8348 (iSeries)PASSWORD not valid for channel type *RCVR, *SVRCN or *CLUSRCVR.
Severity:
40 : Stop Error

Explanation:
The PASSWORD parameter may only be specified with channel type *SDR, *SVR, *RQSTR, *CLUSSDR, or *CLTCN.

Response:
Remove the PASSWORD parameter from the command or, if the command is CRTMQMCHL, change the CHLTYPE parameter value to specify *SDR, *SVR, *RQSTR, *CLUSSDR, or *CLTCN. Then try the command again.

AMQ8349 (iSeries)Authority changes to <insert_5> failed.
Severity:
40 : Stop Error

Explanation:
Authority changes to an object were requested but could not be made.

Response:
Check the authorities that you are granting are relevant to the object type of <insert_5>.

AMQ8350Usage: dspmqver [-p Components] [-f Fields] [-b] [-v]
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ8359QMNAME(<insert_3>)<insert_4>STATUS(Being deleted)
Severity:
0 : Information

AMQ8370Usage: runmqdnm -q Queue -a Assembly 
[-m QueueManager] [-c ClassName] [-u Text] [-s Syncpoint] 
[-n MaxThreads] [-t Timeout] [-b BackoutThreshold] 
[-r BackoutQueue] [-p Context] [-d]
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ8371<insert_3> is not a valid command line option.
Severity:
40 : Stop Error

Explanation:
The option <insert_3> was specified on the command line to the application however this is not one of the valid set of command line options.

Response:
Check the usage information for the application and then retry.

AMQ8372The required command line option <insert_3> is missing.
Severity:
40 : Stop Error

Explanation:
The application expects several mandatory command line options. One of these, <insert_3>, was not specified.

Response:
Check the usage information for the application and ensure that all required parameters are specified then retry.

AMQ8373Invalid value specified for command line option <insert_3> (<insert_4>).
Severity:
40 : Stop Error

Explanation:
The value specified for command line option <insert_3> (<insert_4>) is invalid.

Response:
Check the usage information for the application and ensure that all options specify values in the valid range then retry.

AMQ8374 WebSphere MQ queue manager <insert_3> does not exist.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ queue manager <insert_3> does not exist.

Response:
Either create the queue manager (crtmqm command) or correct the queue manager name used in the command and then try the command again.

AMQ8375WebSphere MQ queue manager <insert_3> not available.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ queue manager <insert_3> is not available because it has been stopped or is otherwise not contactable.

Response:
Use the strmqm command to start the message queue manager as necessary or correct any intermittent problems (eg. network connectivity) then try the command again.

AMQ8376WebSphere MQ queue <insert_3> not found.
Severity:
40 : Stop Error

Explanation:
The queue <insert_3> could not be found, it may not have been created.

Response:
Ensure that the name of the queue specified is correct, queue names are case sensitive. If the queue is not created, use the runmqsc command to create it. Then try the command again.

AMQ8377Unexpected error <insert_1> was received by the application.
Severity:
40 : Stop Error

Explanation:
The error <insert_1> was returned unexpectedly to the application.

Response:
Save the generated output files and contact your IBM support center.

AMQ8378Unexpected exception received from .NET Framework 
<insert_3>
Severity:
40 : Stop Error

Explanation:
The application received an exception from the underlying .NET framework, information about the exception follows: 
<insert_4>

Response:
Examine the information contained within the exception to determine if it is possible to resolve locally. 
If it is not possible to resolve the problem locally, save the generated output files and contact your IBM support center.

AMQ8379Assembly <insert_3> could not be loaded
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ .NET Monitor attempted to load assembly <insert_3> but received an exception from the underlying .NET framework indicating that it could not be found. <insert_4>

Response:
Check that the assembly does exist and is accessible to the user running the application then retry. 
If the assembly should be available, contact your IBM support center.

AMQ8380No classes implementing IMQObjectTrigger found in <insert_3>.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ .NET monitor was unable to identify any classes in referenced assembly <insert_3> which implement the IMQObjectTrigger interface.

Response:
It is a requirement of the WebSphere MQ .NET monitor that either a single class implementing the IMQObjectTrigger interface exists in the referenced assembly or that a class is identified in that assembly to execute. Either modify the assembly to include a single class implementing IMQObjectTrigger or specify a class name on the command line and retry.

AMQ8381Too many classes implementing IMQObjectTrigger (<insert_1>) found in <insert_3>.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ .NET monitor found <insert_1> classes in referenced assembly <insert_3> all of which implement the IMQObjectTrigger interface.

Response:
It is a requirement of the WebSphere MQ .NET monitor that either a single class implementing the IMQObjectTrigger interface exists in the referenced assembly or that a class is identified in that assembly to execute. Either modify the assembly to include a single class implementing IMQObjectTrigger or specify a class name on the command line and retry.

AMQ8382A Message breaking the backout threshold (<insert_1>) was moved to <insert_4>
Severity:
10 : Warning

Explanation:
Whilst processing queue <insert_3> a message whose backout count exceeded the specified backout threshold (<insert_1>) was successfully moved to <insert_4>

Response:
The message moved to the backout queue has a backout count greater than the backout threshold specified (or picked up from the input queue BOTHRESH attribute). You should investigate the reason why this message was rolled back onto the input queue and resolve that issue. If backout processing is not required, modify the command line options and or queue definitions to achieve the required behaviour from the .NET monitor.

AMQ8383A Message breaking the backout threshold (<insert_1>) could not be moved.
Severity:
40 : Stop Error

Explanation:
While processing queue <insert_3> a message whose backout count exceeded the specified backout threshold (<insert_1>) was encountered however, it was not possible to move it to either a backout queue or the dead-letter queue.

Response:
Because it was not possible to move the backed out message to another queue, it has been left on the input queue. As a result, the .NET monitor has ended. 
It is possible that the backout queue or dead-letter queue are full or disabled for put - in this case, resolve this problem first. 
If backout processing should have resulted in the message being placed on another queue, check the command line options, input queue definition and queue manager dead-letter queue attribute to ensure that they are correct, then retry.

AMQ8390Usage: endmqdnm -q Queue [-m QueueManager]
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ8391<insert_3> is not a valid command line option.
Severity:
40 : Stop Error

Explanation:
The option <insert_3> was specified on the command line to the application however this is not one of the valid set of command line options.

Response:
Check the usage information for the application and then retry.

AMQ8392The required command line option <insert_3> is missing.
Severity:
40 : Stop Error

Explanation:
The application expects mandatory command line options. One of these, <insert_3>, was not specified.

Response:
Check the usage information for the application and ensure that all required parameters are specified then retry.

AMQ8393Invalid value specified for command line option <insert_3> (<insert_4>).
Severity:
40 : Stop Error

Explanation:
The value specified for command line option <insert_3> (<insert_4>) is invalid.

Response:
Check the usage information for the application and ensure that all options specify values in the valid range then retry.

AMQ8394WebSphere MQ queue manager <insert_3> does not exist.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ queue manager <insert_3> does not exist.

Response:
Either create the queue manager (crtmqm command) or correct the queue manager name used in the command and then try the command again.

AMQ8395WebSphere MQ queue manager <insert_3> not available.
Severity:
40 : Stop Error

Explanation:
The WebSphere MQ queue manager <insert_3> is not available because it has been stopped or is otherwise not contactable.

Response:
Use the strmqm command to start the message queue manager as necessary or correct any intermittent problems (eg. network connectivity) then try the command again.

AMQ8396WebSphere MQ queue <insert_3> not found.
Severity:
40 : Stop Error

Explanation:
The queue <insert_3> could not be found, it may not have been created.

Response:
Ensure that the name of the queue specified is correct, queue names are case sensitive. If the queue is not created, use the runmqsc command to create it. Then try the command again.

AMQ8397Unexpected error <insert_1> was received by the application.
Severity:
40 : Stop Error

Explanation:
The error <insert_1> was returned unexpectedly to the application.

Response:
Save the generated output files and contact your IBM support center.

AMQ8398Unexpected exception received from .NET Framework 
<insert_3>
Severity:
40 : Stop Error

Explanation:
The application received an exception from the underlying .NET framework, information about the exception follows: 
<insert_4>

Response:
Examine the information contained within the exception to determine if it is possible to resolve locally. 
If it is not possible to resolve the problem locally, save the generated output files and contact your IBM support center.

AMQ8401<insert_1> MQSC commands read.
Severity:
0 : Information

Explanation:
The MQSC script contains <insert_1> commands.

Response:
None.

AMQ8402<insert_1> commands have a syntax error.
Severity:
0 : Information

Explanation:
The MQSC script contains <insert_1> commands having a syntax error.

Response:
None.

AMQ8403<insert_1> valid MQSC commands could not be processed.
Severity:
0 : Information

Explanation:
The MQSC script contains <insert_1> commands that failed to process.

Response:
None.

AMQ8404Command failed.
Severity:
0 : Information

Explanation:
An MQSC command has been recognized, but cannot be processed.

Response:
None.

AMQ8405Syntax error detected at or near end of command segment below:-
Severity:
0 : Information

Explanation:
The MQSC script contains <insert_1> commands having a syntax error.

Response:
None.

AMQ8406Unexpected 'end of input' in MQSC.
Severity:
0 : Information

Explanation:
An MQSC command contains a continuation character, but the 'end of input' has been reached without completing the command.

Response:
None.

AMQ8407Display Process details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY PROCESS command completed successfully, and details follow this message.

Response:
None.

AMQ8408Display Queue Manager details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY QMGR command completed successfully, and details follow this message.

Response:
None.

AMQ8409Display Queue details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY QUEUE command completed successfully, and details follow this message.

Response:
None.

AMQ8410Parser Error.
Severity:
0 : Information

Explanation:
The MQSC Parser has an internal error.

Response:
None.

AMQ8411Duplicate Keyword Error.
Severity:
0 : Information

Explanation:
A command in the MQSC script contains duplicate keywords.

Response:
None.

AMQ8412Numeric Range Error.
Severity:
0 : Information

Explanation:
The value assigned to an MQSC command keyword is out of the permitted range.

Response:
None.

AMQ8413String Length Error.
Severity:
0 : Information

Explanation:
A string assigned to an MQSC keyword is either NULL, or longer than the maximum permitted for that keyword.

Response:
None.

AMQ8414Display Channel details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY CHL command completed successfully, and details follow this message.

Response:
None.

AMQ8415Ping WebSphere MQ Queue Manager command complete.
Severity:
0 : Information

Explanation:
The MQSC PING QMGR command completed successfully.

Response:
None.

AMQ8416MQSC timed out waiting for a response from the command server.
Severity:
0 : Information

Explanation:
MQSC did not receive a response message from the remote command server in the time specified.

Response:
None.

AMQ8417Display Channel Status details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY CHANNEL STATUS command completed successfully, and details follow this message.

Response:
None.

AMQ8418<insert_1> command responses received.
Severity:
0 : Information

Explanation:
Running in queued mode, <insert_1> command responses were received from the remote command server.

Response:
None.

AMQ8419The Queue is already in the DCE cell.
Severity:
0 : Information

Explanation:
The Queue is already in the cell, that is, its SCOPE attribute is already CELL.

Response:
None.

AMQ8420Channel Status not found.
Severity:
0 : Information

Explanation:
No status was found for the specified channel(s).

Response:
None.

AMQ8421A required keyword was not specified.
Severity:
0 : Information

Explanation:
A keyword required in this command was not specified.

Response:
None.

AMQ8422MQSC found the following response to a previous command on the reply q :-
Severity:
0 : Information

Explanation:
MQSC found additional command responses on the reply q. They will fill follow this message.

Response:
None.

AMQ8423Cell Directory not available.
Severity:
0 : Information

Explanation:
The DCE cell directory is not available, so the requested operation has failed.

Response:
None.

AMQ8424Error detected in a name keyword.
Severity:
0 : Information

Explanation:
A keyword in an MQSC command contained a name string which was not valid. This may be because it contained characters which are not accepted in MQ names. Typical keywords which can produce this error are QLOCAL (and the other q types), CHANNEL, XMITQ, INITQ, MCANAME etc.

Response:
None.

AMQ8425Attribute value error.
Severity:
0 : Information

Explanation:
A keyword in an MQSC command contained a value that was not valid.

Response:
None.

AMQ8426Valid MQSC commands are:
Severity:
0 : Information

Explanation:
The text shows valid MQSC commands.

Response:
None.

AMQ8427Valid syntax for the MQSC command:
Severity:
0 : Information

Explanation:
The text shown is the valid syntax for the MQSC command.

Response:
None.

AMQ8428TYPE Keyword has already been specified.
Severity:
0 : Information

Explanation:
The TYPE has already been specified after the DISPLAY verb, for example DISPLAY QUEUE(*) type(QLOCAL) type(QALIAS).

Response:
Delete the second TYPE keyword and run the command again.

AMQ8429 (iSeries)Error detected in a exit parameter.
Severity:
0 : Information

Explanation:
A syntax error occurred an the exit parameter. This may be because it contained characters which are not accepted as exit names. Check the parameters in the MSGEXIT, RCVEXIT, SCYEXIT and SENDEXIT definitions.

Response:
None.

AMQ8430Remote queue manager name is unknown.
Severity:
0 : Information

Explanation:
The Remote queue manager name is not known to this queue manager. Check that a transmission queue of the same name as the remote queue manager name exists.

Response:
Create a transmission queue of the same name as the remote queue manager if one does not exist.

AMQ8431Transmission queue does not exist
Severity:
0 : Information

Explanation:
The transmission queue does not exist on this queue manager.

Response:
None.

AMQ8432You are not allowed to set both the REPOS and REPOSNL fields.
Severity:
0 : Information

Explanation:
An attempt to set both the REPOS and REPOSNL fields has been made. Only one of these fields can have a value other than blank. Both of the fields may be blank.

Response:
None.

AMQ8433You are not allowed to set both the CLUSTER and CLUSNL fields.
Severity:
0 : Information

Explanation:
An attempt to set both the CLUSTER and CLUSNL fields has been made. Only one of these fields can have a value other than blank. Both of the fields may be blank.

Response:
None.

AMQ8434The repository is unavailable.
Severity:
0 : Information

Explanation:
The repository is unavailable and the data cannot be accessed. Stop and restart the queue manager.

Response:
None.

AMQ8435All valid MQSC commands were processed.
Severity:
0 : Information

Explanation:
The MQSC script contains no commands that failed to process.

Response:
None.

AMQ8436One valid MQSC command could not be processed.
Severity:
0 : Information

Explanation:
The MQSC script contains one command that failed to process.

Response:
None.

AMQ8437No MQSC commands read.
Severity:
0 : Information

Explanation:
The MQSC script contains no commands.

Response:
None.

AMQ8438One MQSC command read.
Severity:
0 : Information

Explanation:
The MQSC script contains one command.

Response:
None.

AMQ8439No commands have a syntax error.
Severity:
0 : Information

Explanation:
The MQSC script contains no commands having a syntax error.

Response:
None.

AMQ8440One command has a syntax error.
Severity:
0 : Information

Explanation:
The MQSC script contains one command which has a syntax error.

Response:
None.

AMQ8441Display Cluster Queue Manager details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY CLUSQMG command completed successfully, and details follow this message.

Response:
None.

AMQ8442USAGE can not be set to XMITQ with either the CLUSTER or CLUSNL fields set.
Severity:
0 : Information

Explanation:
An attempt has been made to set USAGE to XMITQ when the CLUSTER or CLUSNL field has a value. Change the value of USAGE, or set the CLUSTER and CLUSNL fields to blank, and try the command again.

Response:
None.

AMQ8442 (iSeries)USAGE can not be set to *TMQ with either the CLUSTER or CLUSNL fields set.
Severity:
0 : Information

Explanation:
An attempt has been made to set USAGE to *TMQ when the CLUSTER or CLUSNL field has a value. Change the value of USAGE, or set the CLUSTER and CLUSNL fields to blank, and try the command again.

Response:
None.

AMQ8443Only the CLUSTER or CLUSNL field may have a value.
Severity:
0 : Information

Explanation:
An attempt has been made to set both CLUSTER and CLUSNL fields. One and only one of the fields may have a value, the other field must be blank. Change the value of one of the fields to blank and try the command again.

Response:
None.

AMQ8444The CLUSTER or CLUSNL fields must have a value.
Severity:
0 : Information

Explanation:
Both the CLUSTER and CLUSNL fields are blank. One and only one of the fields may be blank, the other field must be a value. Change one of the fields from blank to a value and try the command again.

Response:
None.

AMQ8445Program cannot open queue manager object.
Severity:
30 : Severe error

Explanation:
An attempt to open a queue manager object has failed.

Response:
See the previously listed messages in the job log.

AMQ8446Channel is currently active.
Severity:
30 : Severe error

Explanation:
The requested operation failed because the channel is currently active.

Response:
See the previously listed messages in the job log.

AMQ8447Requested operation on channel <insert_5> not valid for this channel type.
Severity:
30 : Severe error

Explanation:
The operation requested cannot be performed because channel <insert_5> is not of a suitable type. For example, only sender, server and cluster-sender channels can be resolved.

Response:
Check that the correct operation was requested. If it was, check that the correct channel name was specified.

AMQ8448Channel <insert_5> is not running.
Severity:
30 : Severe error

Explanation:
A request to end channel <insert_5> has failed because the channel is not running.

Response:
Check that the correct operation was requested. If it was, check that the correct channel name was specified.

AMQ8449Queue <insert_5> inhibited for MQGET.
Severity:
30 : Severe error

Explanation:
An MQGET failed because the queue <insert_5> had been previously inhibited for MQGET.

Response:
None.

AMQ8450Display queue status details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY QSTATUS command completed successfully. Details follow this message.

AMQ8451 (iSeries)STATUS(*STOPPED) not allowed with CONNAME specified.
Severity:
0 : Information

Explanation:
The STATUS(*STOPPED) parameter is not allowed when specifying CONNAME on the ENDMQMCHL command.

Response:
Remove the CONNAME parameter from the command or, specify STATUS(*INACTIVE) to end the channel instance for the specified connection name.

AMQ8452 (iSeries)STATUS(*STOPPED) not allowed with RQMNAME specified.
Severity:
0 : Information

Explanation:
The STATUS(*STOPPED) parameter is not allowed when specifying RQMNAME on the ENDMQMCHL command.

Response:
Remove the RQMNAME parameter from the command or, specify STATUS(*INACTIVE) to end the channel instance for the specified remote queue manager.

AMQ8453The path <insert_3> is invalid
Severity:
20 : Error

Explanation:
You typed a path which was not syntactically correct for the operating system you are running WebSphere MQ on.

Response:
Determine the correct syntax of a path name for the operating system you are running WebSphere MQ on and use this information to type in a valid path.

AMQ8454Syntax error found in parameter <insert_3>.
Severity:
20 : Error

Explanation:
The data you entered for <insert_3> does not conform to the syntax rules laid down by WebSphere MQ for this parameter.

Response:
Carefully check the data entered for this parameter in conjunction with the WebSphere MQ Command Reference to determine the cause of error.

AMQ8455Password length error
Severity:
20 : Error

Explanation:
The password string length is rounded up by WebSphere MQ to the nearest eight bytes. This rounding causes the total length of the SSLCRYP string to exceed its maximum.

Response:
Decrease the size of the password, or of earlier fields in the SSLCRYP string.

AMQ8456Conflicting parameters in command.
Severity:
20 : Error

Explanation:
The command contains parameters that cannot be used together.

Response:
Refer to the WebSphere MQ Script (MQSC) Command Reference to determine an allowable combination of parameters for this command.

AMQ8457WebSphere MQ connection stopped.
Severity:
0 : Information

Explanation:
The STOP CONN command successfully stopped the connection that was specified.

Response:
None.

AMQ8458WebSphere MQ connection not stopped.
Severity:
0 : Information

Explanation:
The STOP CONN command could not stop the connection that was specified.

Response:
None.

AMQ8459Not Found.
Severity:
0 : Information

Explanation:
You specified an identifier that was not found. Please try the command again and supply a valid identifier.

Response:
None.

AMQ8460Syntax error in connection identifier.
Severity:
0 : Information

Explanation:
You specified an invalid connection identifier. A valid connection identifier contains 16 hex characters, where all of the characters in the connection identifier should lie within the range 0-9, a-z or A-Z.

Response:
Correct the connection identifier so that it conforms to the above specification.

AMQ8461Connection identifier not found.
Severity:
0 : Information

Explanation:
You specified a connection identifier which is not associated with this queue manager.

Response:
Correct the connection identifier so that it describes a connection identifier which is associated with this queue manager. Use the command DISPLAY CONN to identify potential connection identifiers to use with this command.

AMQ8498Starting MQSC for queue manager <insert_3>.
Severity:
0 : Information

Explanation:
The MQSC script contains <insert_1> commands.

Response:
None.

AMQ8499Usage: runmqsc [-e] [-v] [-w WaitTime [-x]] QMgrName
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8500WebSphere MQ Display MQ Files
Severity:
0 : Information

AMQ8501Common services initialization failed with return code <insert_1>.
Severity:
20 : Error

Explanation:
A request by the command server to initialize common services failed with return code <insert_1>.

Response:
None.

AMQ8502Connect shared memory failed with return code <insert_1>.
Severity:
20 : Error

Explanation:
A request by the command server to connect shared memory failed with return code <insert_1>.

Response:
None.

AMQ8503Post event semaphore failed with return code <insert_1>.
Severity:
20 : Error

Explanation:
A request by the command server to post an event semaphore failed with return code <insert_1>.

Response:
None.

AMQ8504Command server MQINQ failed with reason code <insert_1>.
Severity:
20 : Error

Explanation:
An MQINQ request by the command server, for the WebSphere MQ queue <insert_3>, failed with reason code <insert_1>.

Response:
None.

AMQ8505Reallocate memory failed with return code <insert_1>.
Severity:
20 : Error

Explanation:
A request by the command server to reallocate memory failed with return code <insert_1>.

Response:
None.

AMQ8506Command server MQGET failed with reason code <insert_1>.
Severity:
20 : Error

Explanation:
An MQGET request by the command server, for the WebSphere MQ queue <insert_3>, failed with reason code <insert_1>.

Response:
None.

AMQ8507Command server MQPUT1 request for an undelivered message failed with reason code <insert_1>.
Severity:
20 : Error

Explanation:
An attempt by the command server to put a message to the dead-letter queue, using MQPUT1, failed with reason code <insert_1>. The MQDLH reason code was <insert_2>.

Response:
None.

AMQ8508Queue Manager Delete Object List failed with return code <insert_1>.
Severity:
20 : Error

Explanation:
A request by the command server to delete a queue manager object list failed with return code <insert_1>.

Response:
None.

AMQ8509Command server MQCLOSE reply-to queue failed with reason code <insert_1>.
Severity:
20 : Error

Explanation:
An MQCLOSE request by the command server for the reply-to queue failed with reason code <insert_1>.

Response:
None.

AMQ8510Command server queue is open, try again later.
Severity:
30 : Severe error

AMQ8511Usage: strmqcsv [QMgrName]
Severity:
0 : Information

AMQ8512Usage: endmqcsv [-c | -i] QMgrName
Severity:
0 : Information

AMQ8513Usage: dspmqcsv [QMgrName]
Severity:
0 : Information

AMQ8514No response received after <insert_1> seconds.
Severity:
20 : Error

Explanation:
The command server has not reported the status of running, to the start request, before the timeout of <insert_1> seconds was reached.

Response:
None.

AMQ8549Total string length exceeds the maximum value of 999 characters.
Severity:
0 : Information

Explanation:
The total length of a channel exit string is 999 characters. The string list assigned to an MQSC keyword is longer than the maximum value of 999 characters permitted for that keyword.

Response:
None.

AMQ8550Display namelist details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY NAMELIST command completed successfully, and details follow this message.

Response:
None.

AMQ8551WebSphere MQ namelist changed.
Severity:
0 : Information

Explanation:
WebSphere MQ namelist <insert_5> changed.

Response:
None.

AMQ8552WebSphere MQ namelist created.
Severity:
0 : Information

Explanation:
WebSphere MQ namelist <insert_5> created.

Response:
None.

AMQ8553WebSphere MQ namelist deleted.
Severity:
0 : Information

Explanation:
WebSphere MQ namelist <insert_5> deleted.

Response:
None.

AMQ8554String List String Count Error.
Severity:
0 : Information

Explanation:
The number of strings within the stringlist is greater than the maximum number allowed for the keyword. Reduce the number of strings within the list and try the command again.

Response:
None.

AMQ8555String List String Length Error.
Severity:
0 : Information

Explanation:
A string in a string list assigned to a keyword is longer than the maximum permitted for that keyword.

Response:
None.

AMQ8556RESUME QUEUE MANAGER accepted.
Severity:
0 : Information

Explanation:
The RESUME QUEUE MANAGER command has been accepted for processing. The command will be sent to the repository which will process the command and notify all other repositories that this queue manager is now back in the cluster.

Response:
None.

AMQ8557SUSPEND QUEUE MANAGER accepted.
Severity:
0 : Information

Explanation:
The SUSPEND QUEUE MANAGER command has been accepted for processing. The command will be sent to the repository which will process the command and notify all other repositories that this queue manager is leaving the cluster.

Response:
None.

AMQ8558REFRESH CLUSTER accepted.
Severity:
0 : Information

Explanation:
The REFRESH CLUSTER command has been accepted for processing. The command will be sent to the Repository which will process the command and notify all other repositories that the Cluster needs refreshing.

Response:
None.

AMQ8559RESET CLUSTER accepted.
Severity:
0 : Information

Explanation:
The RESET CLUSTER command has been accepted for processing. The command will be sent to the Repository which will process the command and notify all other repositories that the Cluster needs resetting.

Response:
None.

AMQ8560WebSphere MQ security cache refreshed.
Severity:
0 : Information

Explanation:
The Object Authority Manager security cache has been refreshed.

Response:
None.

AMQ8561 (Windows)Domain controller unavailable.
Severity:
10 : Warning

Explanation:
WebSphere MQ was unable to contact the domain controller to obtain information for user <insert_3>.

Response:
Ensure that a domain controller for the domain on which user <insert_3> is defined is available. Alternatively, if you are using a computer which is not currently connected to the network and have logged on using a domain user ID, you may wish to log on using a local user ID instead.

AMQ8563WebSphere MQ authentication information object created.
Severity:
0 : Information

Explanation:
WebSphere MQ authentication information object <insert_5> created.

Response:
None.

AMQ8564WebSphere MQ authentication information object deleted.
Severity:
0 : Information

Explanation:
WebSphere MQ authentication information object <insert_5> deleted.

Response:
None.

AMQ8565Queue Status not found.
Severity:
0 : Information

Explanation:
Queue Status for the specified queue could not be found.

Response:
None.

AMQ8566Display authentication information details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY AUTHINFO command completed successfully. Details follow this message.

Response:
None.

AMQ8567WebSphere MQ authentication information changed.
Severity:
0 : Information

Explanation:
WebSphere MQ authentication information <insert_5> changed.

Response:
None.

AMQ8568 (iSeries)No authinfo objects to display.
Severity:
0 : Information

Explanation:
There are no matching authinfo objects defined on this system.

Response:
Using the DEFINE AUTHINFO command to create an authinfo object.

AMQ8569Error in filter specification
Severity:
0 : Information

Explanation:
You specified an invalid filter. Check the WHERE statement and make sure that the operator is valid for the type of parameter, that the parameter can be filtered on, and that the value that you specified for the filter is valid for the type of attribute you are filtering on.

Response:
None.

AMQ8570Attribute value error in <insert_3>.
Severity:
0 : Information

Explanation:
The keyword <insert_3> contained a value that was not valid for this configuration. Please check the MQSC Command Reference to determine valid values for <insert_3>.

Response:
None.

AMQ8601WebSphere MQ trigger monitor started.
Severity:
0 : Information

Explanation:
The WebSphere MQ trigger monitor has been started.

Response:
None.

AMQ8601 (iSeries)WebSphere MQ trigger monitor started.
Severity:
0 : Information

Explanation:
The trigger monitor has been started with initiation queue <insert_3>.

Response:
None.

AMQ8602WebSphere MQ trigger monitor ended.
Severity:
0 : Information

Explanation:
The WebSphere MQ trigger monitor has ended.

Response:
None.

AMQ8603Usage: runmqtrm [-m QMgrName] [-q InitQ]
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8604Use of WebSphere MQ trigger monitor not authorized.
Severity:
0 : Information

Explanation:
The trigger monitor cannot be run due to lack of authority to the requested queue manager or initiation queue.

Response:
Obtain the necessary authority from your security officer or WebSphere MQ administrator. Then try the command again.

AMQ8605Queue manager not available to the WebSphere MQ trigger monitor
Severity:
0 : Information

Explanation:
The queue manager specified for the trigger monitor does not exist, or is not active.

Response:
Check that you named the correct queue manager. Ask your systems administrator to start it, if it is not active. Then try the command again.

AMQ8606Insufficient storage available for the WebSphere MQ trigger monitor.
Severity:
0 : Information

Explanation:
There was insufficient storage available for the WebSphere MQ trigger monitor to run.

Response:
Free some storage and then try the command again.

AMQ8607WebSphere MQ trigger monitor connection failed.
Severity:
0 : Information

Explanation:
The trigger monitor's connection to the requested queue manager failed because of MQI reason code <insert_1> from MQCONN.

Response:
Consult your systems administrator about the state of the queue manager.

AMQ8608WebSphere MQ trigger monitor connection broken.
Severity:
0 : Information

Explanation:
The connection to the queue manager failed while the trigger monitor was running. This may be caused by an endmqm command being issued by another user, or by a queue manager error.

Response:
Consult your systems administrator about the state of the queue manager.

AMQ8609Initiation queue missing or wrong type
Severity:
0 : Information

Explanation:
The named initiation queue could not be found; or the queue type is not correct for an initiation queue.

Response:
Check that the named queue exists, and is a local queue, or that the named queue is an alias for a local queue which exists.

AMQ8610Initiation queue in use
Severity:
0 : Information

Explanation:
The WebSphere MQ trigger monitor could not open the initiation queue because the queue is open for exclusive use by another application.

Response:
Wait until the queue is no longer in use, and try the command again.

AMQ8611Initiation queue could not be opened.
Severity:
0 : Information

Explanation:
The WebSphere MQ trigger monitor could not open the initiation queue; reason code <insert_1> was returned from MQOPEN.

Response:
Consult your systems administrator.

AMQ8612Waiting for a trigger message
Severity:
0 : Information

Explanation:
The WebSphere MQ trigger monitor is waiting for a message to arrive on the initiation queue.

Response:
None.

AMQ8613Initiation queue changed or deleted
Severity:
0 : Information

Explanation:
The WebSphere MQ trigger monitor is unable to continue because the initiation queue has been deleted or changed since it was opened.

Response:
Retry the command.

AMQ8614Initiation queue not enabled for input.
Severity:
0 : Information

Explanation:
The WebSphere MQ trigger monitor cannot read from the initiation queue because input is not enabled.

Response:
Ask your systems administrator to enable the queue for input.

AMQ8615WebSphere MQ trigger monitor failed to get message.
Severity:
0 : Information

Explanation:
The WebSphere MQ trigger monitor failed because of MQI reason code <insert_1> from MQGET.

Response:
Consult your systems administrator.

AMQ8616End of application trigger.
Severity:
0 : Information

Explanation:
The action to trigger an application has been completed.

Response:
None.

AMQ8617Not a valid trigger message.
Severity:
0 : Information

Explanation:
The WebSphere MQ trigger monitor received a message that is not recognized as a valid trigger message. It has been written to the undelivered message queue.

Response:
Consult your systems administrator.

AMQ8618Error starting triggered application.
Severity:
0 : Information

Explanation:
An error was detected when trying to start the application identified in a trigger message.

Response:
Check that the application the trigger monitor was trying to start is available.

AMQ8619Application type <insert_1> not supported.
Severity:
0 : Information

Explanation:
A trigger message was received which specifies application type <insert_1>; the trigger monitor does not support this type.

Response:
Use an alternative trigger monitor for this initiation queue.

AMQ8620Trigger message with warning <insert_1>
Severity:
0 : Information

Explanation:
The trigger monitor received a message with a warning. For example, it may have been truncated or it could not be converted to the trigger monitor's data representation. The reason code for the warning is <insert_1>.

Response:
None.

AMQ8621Usage: runmqtmc [-m QMgrName] [-q InitQ]
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8622Usage: CICS-Transaction-Name [MQTMC2 structure]
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8623WebSphere MQ listener changed.
Severity:
0 : Information

Explanation:
WebSphere MQ listener <insert_5> changed.

Response:
None.

AMQ8624WebSphere MQ service changed.
Severity:
0 : Information

Explanation:
WebSphere MQ service <insert_5> changed.

Response:
None.

AMQ8625WebSphere MQ service created.
Severity:
0 : Information

Explanation:
WebSphere MQ service <insert_5> created.

Response:
None.

AMQ8626WebSphere MQ listener created.
Severity:
0 : Information

Explanation:
WebSphere MQ listener <insert_5> created.

Response:
None.

AMQ8627WebSphere MQ service object deleted.
Severity:
0 : Information

Explanation:
WebSphere MQ service object <insert_5> deleted.

Response:
None.

AMQ8628WebSphere MQ listener object deleted.
Severity:
0 : Information

Explanation:
WebSphere MQ listener object <insert_5> deleted.

Response:
None.

AMQ8629Display service information details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY SERVICE command completed successfully. Details follow this message.

Response:
None.

AMQ8630Display listener information details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY LISTENER command completed successfully. Details follow this message.

Response:
None.

AMQ8631Display listener status details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY LSSTATUS command completed successfully. Details follow this message.

AMQ8632Display service status details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY SVSTATUS command completed successfully. Details follow this message.

AMQ8649Reset WebSphere MQ Queue Manager accepted.
Severity:
0 : Information

Explanation:
The MQSC RESET QMGR command completed successfully. Details follow this message.

Response:
None.

AMQ8650Activity information unavailable.
Severity:
0 : Information

Explanation:
The DSPMQRTE command was expecting activity information but it was unavailable. This does not always constitute an error. Reasons why the activity information is unavailable include the following: 
1) One of the queue managers on the route did not support trace-route messaging. 
2) One of the queue managers on the route did not allow route information to be returned to the reply queue. See the documentation on the ActivityRecording and TraceRouteRecording queue manager attributes for more details. 
3) The report could not a find route back to the reply queue.

Response:
Try and determine whether the activity information should have been available. Running the command with the 'outline' verbosity option (used with the -v flag) may be useful in determining where the message was when the activity information was generated.

AMQ8650 (iSeries)Activity information unavailable.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command was expecting activity information but it was unavailable. This does not always constitute an error. Reasons why the activity information is unavailable include the following: 
1) One of the queue managers on the route did not support trace-route messaging. 
2) One of the queue managers on the route did not allow route information to be returned to the reply queue. See the documentation on the ActivityRecording and TraceRouteRecording queue manager attributes for more details. 
3) The report could not a find route back to the reply queue.

Response:
Try and determine whether the activity information should have been available. Running the command with DSPINF(*ALL) may be useful in determining where the message was when the activity information was generated.

AMQ8651DSPMQRTE command has finished with errors.
Severity:
0 : Information

Explanation:
The DSPMQRTE command has finished processing your request but an execution error was detected. Previous messages issued by the command can be used to identify the error.

Response:
Refer to previous messages issued by the command.

AMQ8651 (iSeries)DSPMQMRTE command has finished with errors.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command has finished processing your request but an execution error was detected. Previous messages issued by the command can be used to identify the error.

Response:
Refer to previous messages issued by the command.

AMQ8652DSPMQRTE command has finished.
Severity:
0 : Information

Explanation:
The DSPMQRTE command has finished processing your request and no execution errors were detected.

Response:
None.

AMQ8652 (iSeries)DSPMQMRTE command has finished.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command has finished processing your request and no execution errors were detected.

Response:
None.

AMQ8653DSPMQRTE command started with options <insert_3>.
Severity:
0 : Information

Explanation:
You have started the DSPMQRTE command with command line options <insert_3> and the command is now processing your request.

Response:
Wait for the command to finish processing your request. Any further messages that are issued can be used to determine the outcome of the request.

AMQ8653 (iSeries)DSPMQMRTE command started.
Severity:
0 : Information

Explanation:
You have started the DSPMQMRTE command and the command is now processing your request.

Response:
Wait for the command to finish processing your request. Any further messages that are issued can be used to determine the outcome of the request.

AMQ8654Trace-route message arrived on queue manager <insert_3>.
Severity:
0 : Information

Explanation:
The DSPMQRTE command has received confirmation of the successful arrival of the trace-route message at its destination queue on queue manager <insert_3>.

Response:
None.

AMQ8654 (iSeries)Trace-route message arrived on queue manager <insert_3>.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command has received confirmation of the successful arrival of the trace-route message at its destination queue on queue manager <insert_3>.

Response:
None.

AMQ8655Trace-route message expired.
Severity:
0 : Information

Explanation:
The DSPMQRTE command has received confirmation that the trace-route message has expired.

Response:
The expiry interval of trace-route messages generated by the DSPMQRTE command can be altered using the -xs option if this is required.

AMQ8655 (iSeries)Trace-route message expired.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command has received confirmation that the trace-route message has expired.

Response:
The expiry interval of trace-route messages generated by the DSPMQMRTE command can be altered using the EXPIRY parameter if this is required.

AMQ8656DSPMQRTE command received an exception report from queue manager <insert_4> with feedback <insert_1> <insert_3>.
Severity:
0 : Information

Explanation:
The DSPMQRTE command trace-route message caused an exception on queue manager <insert_4>. The Feedback field in the report was <insert_1> or <insert_3>.

Response:
Use the feedback given to determine why the trace-route message caused the exception.

AMQ8656 (iSeries)DSPMQMRTE command received an exception report from queue manager <insert_4> with feedback <insert_1> <insert_3>.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command trace-route message caused an exception on queue manager <insert_4>. The Feedback field in the report was <insert_1> or <insert_3>.

Response:
Use the feedback given to determine why the trace-route message caused the exception.

AMQ8657DSPMQRTE command used <insert_3> 0x<insert_4>.
Severity:
0 : Information

Explanation:
You started the DSPMQRTE command specifying that it should generate a trace-route message. This took place and the trace-route message had <insert_3> X<insert_4>.

Response:
The <insert_3> can be used to retrieve responses to this trace-route request. Run the DSPMQRTE command again specifying this identifier with the -i flag and with the target queue specified as the queue where the responses are expected to return or where the trace-route message is expected to have arrived. This may be on another queue manager.

AMQ8657 (iSeries)DSPMQMRTE command used <insert_3> 0x<insert_4>.
Severity:
0 : Information

Explanation:
You started the DSPMQMRTE command specifying that it should generate a trace-route message. This took place and the trace-route message had <insert_3> X<insert_4>.

Response:
The <insert_3> can be used to retrieve responses to this trace-route request. Run the DSPMQMRTE command again specifying this identifier for CRLLID and with the target queue specified as the queue where the responses are expected to return or where the trace-route message is expected to have arrived. This may be on another queue manager.

AMQ8658DSPMQRTE command failed to put a message on the target queue.
Severity:
0 : Information

Explanation:
The request for the DSPMQRTE command to put a trace-route message on the target queue was unsuccessful. Previous messages issued by the command can be used to identify why the message could not be put on the target queue.

Response:
Refer to previous messages issued by the command.

AMQ8658 (iSeries)DSPMQMRTE command failed to put a message on the target queue.
Severity:
0 : Information

Explanation:
The request for the DSPMQMRTE command to put a trace-route message on the target queue was unsuccessful. Previous messages issued by the command can be used to identify why the message could not be put on the target queue.

Response:
Refer to previous messages issued by the command.

AMQ8659DSPMQRTE command successfully put a message on queue <insert_3>, queue manager <insert_4>.
Severity:
0 : Information

Explanation:
The request for the DSPMQRTE command to put a message on the target queue was successful. The target queue resolved to <insert_3> on queue manager <insert_4>.

Response:
None.

AMQ8659 (iSeries)DSPMQMRTE command successfully put a message on queue <insert_3>, queue manager <insert_4>.
Severity:
0 : Information

Explanation:
The request for the DSPMQMRTE command to put a message on the target queue was successful. The target queue resolved to <insert_3> on queue manager <insert_4>.

Response:
None.

AMQ8660DSPMQRTE command could not correctly order the following activities:
Severity:
0 : Information

Explanation:
The DSPMQRTE command received the following activities, but they could not be printed in the correct order. This is commonly because an activity report has been received that does not contain a TraceRoute PCF group or is missing the RecordedActivities parameter which would allow it to be ordered correctly.

Response:
Find and correct the application that is generating activity reports without the necessary information for them to be ordered correctly.

AMQ8660 (iSeries)DSPMQMRTE command could not correctly order the following activities:
Severity:
0 : Information

Explanation:
The DSPMQMRTE command received the following activities, but they could not be printed in the correct order. This is commonly because an activity report has been received that does not contain a TraceRoute PCF group or is missing the RecordedActivities parameter which would allow it to be ordered correctly.

Response:
Find and correct the application that is generating activity reports without the necessary information for them to be ordered correctly.

AMQ8661DSPMQRTE command will not put to queue <insert_3>, queue manager <insert_4>.
Severity:
20 : Error

Explanation:
You started the DSPMQRTE command specifying that the trace-route message should not be delivered to a local queue (-d yes was not specified). However, it has been determined that the target queue does not resolve to a transmission queue. Therefore the DSPMQRTE command has chosen not to put the trace-route message to the target queue <insert_3> on queue manager <insert_4>.

Response:
Determine whether it was expected that the target queue would resolve to a local queue.

AMQ8661 (iSeries)DSPMQMRTE command will not put to queue <insert_3>, queue manager <insert_4>.
Severity:
20 : Error

Explanation:
You started the DSPMQMRTE command specifying that the trace-route message should not be delivered to a local queue (DLVRMSG(*NO) was specified). However, it has been determined that the target queue does not resolve to a transmission queue. Therefore the DSPMQMRTE command has chosen not to put the trace-route message to the target queue <insert_3> on queue manager <insert_4>.

Response:
Determine whether it was expected that the target queue would resolve to a local queue.

AMQ8662Trace-route message delivered on queue manager <insert_3>.
Severity:
0 : Information

Explanation:
The DSPMQRTE command has received confirmation of the successful delivery of the trace-route message on queue manager <insert_3> to a requesting application.

Response:
None.

AMQ8662 (iSeries)Trace-route message delivered on queue manager <insert_3>.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command has received confirmation of the successful delivery of the trace-route message on queue manager <insert_3> to a requesting application.

Response:
None.

AMQ8663Client connection not supported in this environment.
Severity:
20 : Error

Explanation:
An attempt was made to connect to a queue manager using a client connection. However, client connections are not supported in your environment.

Response:
Connect to the queue manager using a server connection.

AMQ8664DSPMQRTE command could not connect to queue manager <insert_3>.
Severity:
20 : Error

Explanation:
You started the DSPMQRTE command specifying that it should connect to queue manager <insert_3>. The command could not connect to that queue manager. Previous messages issued by the command can be used to identify the error.

Response:
Refer to previous messages issued by the command.

AMQ8664 (iSeries)DSPMQMRTE command could not connect to queue manager <insert_3>.
Severity:
20 : Error

Explanation:
You started the DSPMQMRTE command specifying that it should connect to queue manager <insert_3>. The command could not connect to that queue manager. Previous messages issued by the command can be used to identify the error.

Response:
Refer to previous messages issued by the command.

AMQ8665DSPMQRTE command was supplied an invalid CorrelId <insert_3>.
Severity:
20 : Error

Explanation:
You started the DSPMQRTE command specifying option -i with a CorrelId <insert_3> that was invalid. The CorrelId was either too long or not in the correct format.

Response:
Refer to the command syntax, and then try the command again.

AMQ8665 (iSeries)DSPMQMRTE command was supplied an invalid CorrelId <insert_3>.
Severity:
20 : Error

Explanation:
You started the DSPMQMRTE command specifying CRLLID with a CorrelId <insert_3> that was invalid.

Response:
Refer to the command syntax, and then try the command again.

AMQ8666Queue <insert_3> on queue manager <insert_4>.
Severity:
0 : Information

Explanation:
The DSPMQRTE command trace-route message has been confirmed as having taken a route involving queue <insert_3> on queue manager <insert_4> in an attempt to reach the destination queue.

Response:
Wait for subsequent messages which may indicate another queue which the message has been routed through.

AMQ8666 (iSeries)Queue <insert_3> on queue manager <insert_4>.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command trace-route message has been confirmed as having taken a route involving queue <insert_3> on queue manager <insert_4> in an attempt to reach the destination queue.

Response:
Wait for subsequent messages which may indicate another queue which the message has been routed through.

AMQ8667DSPMQRTE command could not open reply queue <insert_3>, queue manager <insert_4>.
Severity:
20 : Error

Explanation:
You started the DSPMQRTE command specifying reply queue <insert_3>. However the DSPMQRTE command could not successfully open a queue of that name on queue manager <insert_4>. Previous messages issued by the command can be used to identify the error. If the -rq option was not specified then the reply queue will be a temporary dynamic queue modelled on SYSTEM.DEFAULT.MODEL.QUEUE.

Response:
Refer to previous messages issued by the command. Specify a reply queue that can be opened and then retry the command.

AMQ8667 (iSeries)DSPMQMRTE command could not open reply queue <insert_3>, queue manager <insert_4>.
Severity:
20 : Error

Explanation:
You started the DSPMQMRTE command specifying reply queue <insert_3>. However the DSPMQMRTE command could not successfully open a queue of that name on queue manager <insert_4>. Previous messages issued by the command can be used to identify the error. If the RPLYQ parameter was not specified then the reply queue will be a temporary dynamic queue modelled on SYSTEM.DEFAULT.MODEL.QUEUE.

Response:
Refer to previous messages issued by the command. Specify a reply queue that can be opened and then retry the command.

AMQ8668DSPMQRTE command could not open queue <insert_3>, queue manager <insert_4>.
Severity:
20 : Error

Explanation:
You started the DSPMQRTE command specifying queue <insert_3>, using the -q option. However the DSPMQRTE command could not successfully open a queue of that name on queue manager <insert_4>. Previous messages issued by the command can be used to identify the error.

Response:
Refer to previous messages issued by the command. Specify a queue, using the -q option, that can be opened and then retry the command.

AMQ8668 (iSeries)DSPMQMRTE command could not open queue <insert_3>, queue manager <insert_4>.
Severity:
20 : Error

Explanation:
You started the DSPMQMRTE command specifying queue <insert_3> for the QNAME parameter. However the DSPMQMRTE command could not successfully open a queue of that name on queue manager <insert_4>. Previous messages issued by the command can be used to identify the error.

Response:
Refer to previous messages issued by the command. Specify a queue, using the QNAME parameter, that can be opened and then retry the command.

AMQ8669DSPMQRTE command failed to resolve queue manager <insert_3> on queue manager <insert_4>.
Severity:
20 : Error

Explanation:
The DSPMQRTE command attempted to resolve queue manager <insert_3> (supplied by the -qm option) on queue manager <insert_4> but the attempt failed. The queue specified by the -q option could not be opened.

Response:
Ensure that queue manager <insert_3> can be resolved on queue manager <insert_4> or specify a different queue manager with the -qm option. Retry the command.

AMQ8669 (iSeries)DSPMQMRTE command failed to resolve queue manager <insert_3> on queue manager <insert_4>.
Severity:
20 : Error

Explanation:
The DSPMQMRTE command attempted to resolve queue manager <insert_3> (supplied by the TGTMQM parameter) on queue manager <insert_4> but the attempt failed. The queue specified by the QNAME parameter could not be opened.

Response:
Ensure that queue manager <insert_3> can be resolved on queue manager <insert_4> or specify a different queue manager with the TGTMQM parameter. Retry the command.

AMQ8670Loading of server module <insert_3> failed.
Severity:
20 : Error

Explanation:
An attempt to dynamically load the server module <insert_3> failed. Typically this is because only the client modules are installed.

Response:
Check which modules are installed and retry the command with the -c option specified if applicable.

AMQ8671DSPMQRTE command was not supplied a reply queue when one was required.
Severity:
20 : Error

Explanation:
The DSPMQRTE command was expecting a reply queue specified by the -rq option but no reply queue was specified. Specifying a reply queue is mandatory if both the -n (no display) option and a response generating option (-ar or -ro [activity|coa|cod|exception|expiration]) is specified.

Response:
Specify a reply queue and retry the command.

AMQ8672DSPMQRTE command failed to get a message from queue <insert_3>, queue manager <insert_4>.
Severity:
20 : Error

Explanation:
The DSPMQRTE command attempted to get a message from queue <insert_3>, queue manager <insert_4>, but the attempt failed. Previous messages issued by the command can be used to identify the error.

Response:
Refer to previous messages issued by the command.

AMQ8672 (iSeries)DSPMQMRTE command failed to get a message from queue <insert_3>, queue manager <insert_4>.
Severity:
20 : Error

Explanation:
The DSPMQMRTE command attempted to get a message from queue <insert_3>, queue manager <insert_4>, but the attempt failed. Previous messages issued by the command can be used to identify the error.

Response:
Refer to previous messages issued by the command.

AMQ8673DSPMQRTE command was supplied option <insert_3> with an invalid object name <insert_4>.
Severity:
20 : Error

Explanation:
You started the DSPMQRTE command specifying option <insert_3> with an object name <insert_4> that is invalid. In general, the names of WebSphere MQ objects can have up to 48 characters. An object name can contain the following characters: 
1) Uppercase alphabetic characters (A through Z). 
2) Lowercase alphabetic characters (a through z). 
3) Numeric digits (0 through 9). 
4) Period (.), forward slash (/), underscore (_), percent (%). 
See the WebSphere MQ System Administration documentation for further details and restrictions.

Response:
Specify a valid object name and then try the command again.

AMQ8673 (iSeries)DSPMQMRTE command was supplied with an invalid object name <insert_4>.
Severity:
20 : Error

Explanation:
You started the DSPMQMRTE command specifying an object name <insert_4> that is invalid. In general, the names of WebSphere MQ objects can have up to 48 characters. An object name can contain the following characters: 
1) Uppercase alphabetic characters (A through Z). 
2) Lowercase alphabetic characters (a through z). 
3) Numeric digits (0 through 9). 
4) Period (.), forward slash (/), underscore (_), percent (%). 
See the WebSphere MQ System Administration documentation for further details and restrictions.

Response:
Specify a valid object name and then try the command again.

AMQ8674DSPMQRTE command is now waiting for information to display.
Severity:
0 : Information

Explanation:
The DSPMQRTE command has successfully generated and put the trace-route message and is now waiting for responses to be returned to the reply queue to indicate the route that the trace-route message took to its destination.

Response:
Wait for responses to be returned to the reply queue and for the information about the route to be displayed.

AMQ8674 (iSeries)DSPMQMRTE command is now waiting for information to display.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command has successfully generated and put the trace-route message and is now waiting for responses to be returned to the reply queue to indicate the route that the trace-route message took to its destination.

Response:
Wait for responses to be returned to the reply queue and for the information about the route to be displayed.

AMQ8675DSPMQRTE command was supplied an invalid option <insert_3>.
Severity:
20 : Error

Explanation:
You started the DSPMQRTE command specifying an option of <insert_3> that was not recognized. The command will end.

Response:
Refer to the command syntax and retry the command.

AMQ8676DSPMQRTE command was supplied an invalid combination of options.
Severity:
20 : Error

Explanation:
You started the DSPMQRTE command specifying a combination of the options that is not valid. The -i option cannot be specified with one or more of the following options: -ac, -ar, -d, -f, -l, -n, -o, -p, -qm, -ro, -rq, -rqm, -s, -t, -xs, -xp. The -n option cannot be specified with one or more of the following options: -b, -i, -v, -w. The -ar option can only be specified if the -ac option has also been specified. The -rqm option can only be specified if the -rq option has also been specified.

Response:
Refer to the command documentation and then try the command again.

AMQ8677DSPMQRTE command was supplied an option <insert_3> with conflicting values.
Severity:
20 : Error

Explanation:
You started the DSPMQRTE command specifying values for option <insert_3> that conflict. At least two values were specified for the same option but they conflict with each other. The DSPMQRTE command will end.

Response:
Refer to the command syntax and then try the command again.

AMQ8677 (iSeries)DSPMQMRTE command was supplied a parameter with conflicting values.
Severity:
20 : Error

Explanation:
You started the DSPMQMRTE command specifying values that conflict. At least two values were specified for the same parameter but they conflict with each other. The DSPMQMRTE command will end.

Response:
Refer to the command syntax and then try the command again.

AMQ8678DSPMQRTE command was supplied option <insert_3> with an invalid value <insert_4>.
Severity:
20 : Error

Explanation:
You started the DSPMQRTE command specifying an invalid option value. The <insert_4> value for option <insert_3> is either not recognized or of an incorrect format.

Response:
Refer to the command syntax, and then try the command again.

AMQ8678 (iSeries)DSPMQMRTE command was supplied an invalid value <insert_4>.
Severity:
20 : Error

Explanation:
You started the DSPMQMRTE command specifying an invalid parameter value. Value <insert_4> is either not recognized or of an incorrect format.

Response:
Refer to the command syntax, and then try the command again.

AMQ8679Persistent messages not allowed on reply queue <insert_3>, queue manager <insert_4>.
Severity:
20 : Error

Explanation:
It was specified that the DSPMQRTE command should put a persistent trace-route message on the target queue (see the documentation for the -l option). However, persistent messages are not allowed on the reply queue because it is a temporary dynamic queue and persistent responses were expected to return to it. The trace-route message was not put on the target queue.

Response:
Ensure that the reply queue is not a temporary dynamic queue. Use the -rq option to specify the reply queue.

AMQ8679 (iSeries)Persistent messages not allowed on reply queue <insert_3>, queue manager <insert_4>.
Severity:
20 : Error

Explanation:
It was specified that the DSPMQMRTE command should put a persistent trace-route message on the target queue (see the documentation for the MSGPST parameter). However, persistent messages are not allowed on the reply queue because it is a temporary dynamic queue and persistent responses were expected to return to it. The trace-route message was not put on the target queue.

Response:
Ensure that the reply queue is not a temporary dynamic queue. Use the RPLYQ parameter to specify the reply queue.

AMQ8680DSPMQRTE command failed to open queue manager <insert_3>.
Severity:
20 : Error

Explanation:
The DSPMQRTE command tried to open queue manager <insert_3> for inquire but the open failed. Previous messages issued by the command can be used to identify the error.

Response:
Refer to previous messages issued by the command.

AMQ8680 (iSeries)DSPMQMRTE command failed to open queue manager <insert_3>.
Severity:
20 : Error

Explanation:
The DSPMQMRTE command tried to open queue manager <insert_3> for inquire but the open failed. Previous messages issued by the command can be used to identify the error.

Response:
Refer to previous messages issued by the command.

AMQ8681DSPMQRTE command has detected an error, reason <insert_1> <insert_3>.
Severity:
20 : Error

Explanation:
The DSPMQRTE command has detected an error from an MQI call during the execution of your request. The reason for failure is <insert_1> or <insert_3>.

Response:
See the WebSphere MQ Messages documentation for an explanation of the reason for failure. Follow any correction action and retry the command.

AMQ8681 (iSeries)DSPMQMRTE command has detected an error, reason <insert_1> <insert_3>.
Severity:
20 : Error

Explanation:
The DSPMQMRTE command has detected an error from an MQI call during the execution of your request. The reason for failure is <insert_1> or <insert_3>.

Response:
See the WebSphere MQ Messages documentation for an explanation of the reason for failure. Follow any correction action and retry the command.

AMQ8682Trace-route message processed by application <insert_3> on queue manager <insert_4>.
Severity:
0 : Information

Explanation:
The DSPMQRTE command successfully put a trace-route message on the target queue and it was then delivered by queue manager <insert_4> to application <insert_3> which processed the message.

Response:
Determine if it was expected that this application would process the trace-route message.

AMQ8682 (iSeries)Trace-route message processed by application <insert_3> on queue manager <insert_4>.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command successfully put a trace-route message on the target queue and it was then delivered by queue manager <insert_4> to application <insert_3> which processed the message.

Response:
Determine if it was expected that this application would process the trace-route message.

AMQ8683Trace-route message reached the maximum activities limit of <insert_1>.
Severity:
0 : Information

Explanation:
The DSPMQRTE command trace-route message was rejected after the number of activities of which it was a participant reached the maximum activities limit. The limit was set to <insert_1>. The maximum activities limit is set using the -s option.

Response:
Using the output from the command determine whether it is expected that the trace-route message should have reached the maximum activities limit.

AMQ8683 (iSeries)Trace-route message reached the maximum activities limit of <insert_1>.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command trace-route message was rejected after the number of activities of which it was a participant reached the maximum activities limit. The limit was set to <insert_1>. The maximum activities limit is set using the MAXACTS parameter.

Response:
Using the output from the command determine whether it is expected that the trace-route message should have reached the maximum activities limit.

AMQ8684Trace-route message reached trace-route incapable queue manager <insert_3>.
Severity:
0 : Information

Explanation:
The DSPMQRTE command trace-route message was rejected because it was about to be sent to a queue manager which does not support trace-route messaging. This behaviour was requested because the forwarding options specified on the command only allowed the trace-route message to be forwarded to queue managers which support trace-route messaging. Sending a trace-route message to a queue manager which cannot process it in accordance with its specified options could cause undesirable results, including having the trace-route message be put to a local queue on the remote queue manager. If this is acceptable then the '-f all' option can be specified.

Response:
Retry the command with different forwarding options, if appropriate.

AMQ8684 (iSeries)Trace-route message reached trace-route incapable queue manager <insert_3>.
Severity:
0 : Information

Explanation:
The DSPMQMRTE command trace-route message was rejected because it was about to be sent to a queue manager which does not support trace-route messaging. This behaviour was requested because the forwarding options specified on the command only allowed the trace-route message to be forwarded to queue managers which support trace-route messaging. Sending a trace-route message to a queue manager which cannot process it in accordance with its specified options could cause undesirable results, including having the trace-route message be put to a local queue on the remote queue manager. If this is acceptable then FWDMSG(*ALL) can be specified.

Response:
Retry the command with different forwarding options, if appropriate.

AMQ8685Trace-route message rejected due to invalid forwarding options X<insert_1>.
Severity:
20 : Error

Explanation:
The DSPMQRTE command trace-route message was rejected because one or more of the forwarding options was not recognized and it was in the MQROUTE_FORWARD_REJ_UNSUP_MASK bitmask. The forwarding options, when they were last observed, in hexadecimal were X<insert_1>.

Response:
Change the application that inserted the forwarding options that were not recognized to insert valid and supported forwarding options.

AMQ8685 (iSeries)Trace-route message rejected due to invalid forwarding options X<insert_1>.
Severity:
20 : Error

Explanation:
The DSPMQMRTE command trace-route message was rejected because one or more of the forwarding options was not recognized and it was in the MQROUTE_FORWARD_REJ_UNSUP_MASK bitmask. The forwarding options, when they were last observed, in hexadecimal were X<insert_1>.

Response:
Change the application that inserted the forwarding options that were not recognized to insert valid and supported forwarding options.

AMQ8686Trace-route message rejected due to invalid delivery options X<insert_1>.
Severity:
20 : Error

Explanation:
The DSPMQRTE command trace-route message was rejected because one or more of the delivery options was not recognized and it was in the MQROUTE_DELIVER_REJ_UNSUP_MASK bitmask. The delivery options, when they were last observed, in hexadecimal were X<insert_1>.

Response:
Change the application that inserted the delivery options that were not recognized to insert valid and supported delivery options.

AMQ8686 (iSeries)Trace-route message rejected due to invalid delivery options X<insert_1>.
Severity:
20 : Error

Explanation:
The DSPMQMRTE command trace-route message was rejected because one or more of the delivery options was not recognized and it was in the MQROUTE_DELIVER_REJ_UNSUP_MASK bitmask. The delivery options, when they were last observed, in hexadecimal were X<insert_1>.

Response:
Change the application that inserted the delivery options that were not recognized to insert valid and supported delivery options.

AMQ8687Program ending.
Severity:
0 : Information

Explanation:
The program operation was interrupted by a SIGINT signal on UNIX systems or a CTRL+c/CTRL+BREAK signal on Windows systems. The program is now ending.

Response:
Wait for the program to end.

AMQ8688DSPMQRTE command has detected an unexpected error, reason <insert_1> <insert_3>.
Severity:
20 : Error

Explanation:
The DSPMQRTE command has detected an unexpected error during execution of your request. The reason for failure is <insert_1> or <insert_3>. The WebSphere MQ error recording routine has been called.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ8688 (iSeries)DSPMQMRTE command has detected an unexpected error, reason <insert_1> <insert_3>.
Severity:
20 : Error

Explanation:
The DSPMQMRTE command has detected an unexpected error during execution of your request. The reason for failure is <insert_1> or <insert_3>. The WebSphere MQ error recording routine has been called.

Response:
Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ8689Loading of client module <insert_3> failed.
Severity:
20 : Error

Explanation:
An attempt to dynamically load the client module <insert_3> failed. Typically this is because the client modules are not installed.

Response:
Check which modules are installed and retry the command without the -c option specified, if applicable.

AMQ8701Usage: rcdmqimg [-z] [-l] [-m QMgrName] -t ObjType [GenericObjName]
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8702Usage: rcrmqobj [-z] [-m QMgrName] -t ObjType [GenericObjName]
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8703Usage: dspmqfls [-m QMgrName] [-t ObjType] GenericObjName
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ8705Display Queue Manager Status Details.
Severity:
0 : Information

Explanation:
The MQSC DISPLAY QMSTATUS command completed successfully. Details follow this message.

Response:
None.

AMQ8706Request to stop WebSphere MQ Listener accepted.
Severity:
0 : Information

Explanation:
The channel listener program has been requested to stop. This command executes asynchronously so may complete after this message has been displayed.

Response:
Further information on the progress of the request is available in the queue manager error log.

AMQ8707 (iSeries)Start WebSphere MQ DLQ Handler
Severity:
0 : Information

AMQ8708Dead-letter queue handler started to process INPUTQ(<insert_3>).
Severity:
0 : Information

Explanation:
The dead-letter queue handler (runmqdlq) has been started and has parsed the input file without detecting any errors and is about to start processing the queue identified in the message.

Response:
None.

AMQ8708 (iSeries)Dead-letter queue handler started to process INPUTQ(<insert_3>).
Severity:
0 : Information

Explanation:
The dead-letter queue handler (STRMQMDLQ) has been started and has parsed the input file without detecting any errors and is about to start processing the queue identified in the message.

Response:
None.

AMQ8709Dead-letter queue handler ending.
Severity:
0 : Information

Explanation:
The dead-letter queue handler (runmqdlq) is ending because the WAIT interval has expired and there are no messages on the dead-letter queue, or because the queue manager is shutting down, or because the dead-letter queue handler has detected an error. If the dead-letter queue handler has detected an error, an earlier message will have identified the error.

Response:
None.

AMQ8709 (iSeries)Dead-letter queue handler ending.
Severity:
0 : Information

Explanation:
The dead-letter queue handler (STRMQMDLQ) is ending because the WAIT interval has expired and there are no messages on the dead-letter queue, or because the queue manager is shutting down, or because the dead-letter queue handler has detected an error. If the dead-letter queue handler has detected an error, an earlier message will have identified the error.

Response:
None.

AMQ8710Usage: runmqdlq [QName[QMgrName]].
Severity:
0 : Information

Explanation:
Syntax for the usage of runmqdlq.

Response:
None.

AMQ8711 (iSeries)Job <insert_5> has terminated unexpectedly.
Severity:
10 : Warning

Explanation:
Execution of the command <insert_3> caused job <insert_5> to be started, but the job terminated unexpectedly.

Response:
Consult the log for job <insert_5> to determine why it was terminated.

AMQ8721Dead-letter queue message not prefixed by a valid MQDLH.
Severity:
10 : Warning

Explanation:
The dead-letter queue handler (runmqdlq) retrieved a message from the nominated dead-letter queue, but the message was not prefixed by a recognizable MQDLH. This typically occurs because an application is writing directly to the dead-letter queue but is not prefixing messages with a valid MQDLH. The message is left on the dead-letter queue and the dead-letter queue handler continues to process the dead-letter queue. Each time the dead-letter queue handler repositions itself to a position before this message to process messages that could not be processed on a previous scan it will reprocess the failing message and will consequently re-issue this message.

Response:
Remove the invalid message from the dead-letter queue. Do not write messages to the dead-letter queue unless they have been prefixed by a valid MQDLH. If you require a dead-letter queue handler that can process messages not prefixed by a valid MQDLH, you must change the sample program called amqsdlq to cater for your needs.

AMQ8721 (iSeries)Dead-letter queue message not prefixed by a valid MQDLH.
Severity:
10 : Warning

Explanation:
The dead-letter queue handler (STRMQMDLQ) retrieved a message from the nominated dead-letter queue, but the message was not prefixed by a recognizable MQDLH. This typically occurs because an application is writing directly to the dead-letter queue but is not prefixing messages with a valid MQDLH. The message is left on the dead-letter queue and the dead-letter queue handler continues to process the dead-letter queue. Each time the dead-letter queue handler repositions itself to a position before this message to process messages that could not be processed on a previous scan it will reprocess the failing message and will consequently re-issue this message.

Response:
Remove the invalid message from the dead-letter queue. Do not write messages to the dead-letter queue unless they have been prefixed by a valid MQDLH. If you require a dead-letter queue handler that can process messages not prefixed by a valid MQDLH, you must change the sample program called amqsdlq to cater for your needs.

AMQ8722Dead-letter queue handler unable to put message: Rule <insert_1> Reason <insert_2>.
Severity:
10 : Warning

Explanation:
This message is produced by the dead-letter queue handler when it is requested to redirect a message to another queue but is unable to do so. If the reason that the redirect fails is the same as the reason the message was put to the dead-letter queue then it is assumed that no new error has occurred and no message is produced. The retry count for the message will be incremented and the dead-letter queue handler will continue.

Response:
Investigate why the dead-letter queue handler was unable to put the message to the dead-letter queue. The line number of the rule used to determine the action for the message should be used to help identify to which queue the dead-letter queue handler attempted to PUT the message.

AMQ8729The listener could not be stopped at this time.
Severity:
10 : Warning

Explanation:
A request was made to stop a listener, however the listener could not be stopped at this time. Reasons why a listener could not be stopped are: 
The listener has active channels and the communications protocol being used is LU 6.2, SPX or NETBIOS. 
The listener has active channels and the communications protocol being used is TCP/IP and channel threads are restricted to run within the listener process.

Response:
End the channels using the STOP CHANNEL command and reissue the request.

AMQ8730Listener already active.
Severity:
10 : Warning

Explanation:
A request was made to start a listener, however the listener is already running and cannot be started.

Response:
If the listener should not be running then use the STOP LISTENER command to stop the listener before reissuing the command.

AMQ8731Listener not active.
Severity:
10 : Warning

Explanation:
A request was made to stop a listener, however the listener is not running.

Response:
If the listener should be running then use the START LISTENER command to start the listener.

AMQ8732Request to stop Service accepted.
Severity:
0 : Information

Explanation:
The Request to stop the Service has been accepted and is being processed.

Response:
None.

AMQ8733Request to start Service accepted.
Severity:
0 : Information

Explanation:
The Request to start the Service has been accepted and is being processed.

Response:
None.

AMQ8734Command failed - Program could not be started.
Severity:
20 : Error

Explanation:
The command requested was unsuccessful because the program which was defined to be run to complete the action could not be started. 
Reasons why the program could not be started are 
The program does not exist at the specified location. 
The WebSphere MQ user does not have sufficient access to execute the program. 
If STDOUT or STDERR are defined for the program, the WebSphere MQ user does not have sufficient access to the locations specified.

Response:
Check the Queue Manager error logs for further details on the cause of the failure and correct before reissuing the command.

AMQ8735Command failed - Access denied.
Severity:
20 : Error

Explanation:
The command requested was unsuccessful because access was denied attempting to execuete the program defined to run.

Response:
Examine the definition of the object and ensure that the path to program file is correct. If the defined path is correct ensure that the program exists at the location specified and that the WebSphere MQ user has access to execute the program.

AMQ8736Command failed - Program start failed.
Severity:
20 : Error

Explanation:
The command requested was unsuccessful because the attempt to execute the program defined to run was unsuccessful.

Response:
Examine the definition of the object and ensure that the path to program file is correct. If the defined path is correct ensure that the program exists at the location specified and that the WebSphere MQ user has access to execute the program. Further information on the failure may be available in the WebSphere MQ error logs.

AMQ8737Service already active.
Severity:
10 : Warning

Explanation:
A request was made to start a service, however the service is already running and cannot be started.

Response:
If the service should not be running then use the STOP SERVICE command to stop the service before reissuing the command. If the intention is to allow more than one instance of s service to run, then the service definition may be altered to be of SERVTYPE(COMMAND) which allows more than one instance of the service to be executed concurrently, however status of services of type COMMAND is not available from the SVSTAUS command.

AMQ8738Service not active.
Severity:
10 : Warning

Explanation:
A request was made to stop a service, however the service is not running.

Response:
If the service should be running then use the START SERVICE command to start the service.

AMQ8739Stop cannot be executed for service with blank STOPCMD.
Severity:
20 : Error

Explanation:
A request was made to STOP a service, however the service has no Stop Command defined so no action could be taken.

Response:
Examine the definition of the service and if necessary update the definition of the service to include the command to run when STOP is issued. For services of type 'SERVER' the command to run when STOP is executed is stored when the service is started so any alteration to the service definition will have no effect until the service is restarted following the update.

AMQ8740Start cannot be executed for service with blank STARTCMD.
Severity:
20 : Error

Explanation:
A request was made to START a service, however the service has no Start Command defined so no action could be taken.

Response:
Examine the definition of the service and if necessary update the definition of the service to include the command to run when START is issued.

AMQ8741Unable to connect to queue manager.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (runmqdlq) could not connect to queue manager <insert_3>. This message is typically issued when the requested queue manager has not been started or is quiescing, or if the process does not have sufficient authority. The completion code (<insert_1>) and the reason (<insert_2>) can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8741 (iSeries)Unable to connect to queue manager.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not connect to queue manager <insert_3>. This message is typically issued when the requested queue manager has not been started or is quiescing, or if the process does not have sufficient authority. The completion code (<insert_1>) and the reason (<insert_2>) can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8742Unable to open queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (runmqdlq) could not open the queue manager object. This message is typically issued because of a resource shortage or because the process does not have sufficient authority. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8742 (iSeries)Unable to open queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not open the queue manager object. This message is typically issued because of a resource shortage or because the process does not have sufficient authority. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8743Unable to inquire on queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (runmqdlq) could not inquire on the queue manager. This message is typically issued because of a resource shortage or because the queue manager is ending. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8743 (iSeries)Unable to inquire on queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not inquire on the queue manager. This message is typically issued because of a resource shortage or because the queue manager is ending. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8744Unable to close queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (runmqdlq) could not close the queue manager. This message is typically issued because of a resource shortage or because the queue manager is ending. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8744 (iSeries)Unable to close queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not close the queue manager. This message is typically issued because of a resource shortage or because the queue manager is ending. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8745Unable to open dead-letter queue for browse.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (runmqdlq) could not open the dead-letter queue <insert_3> for browsing. This message is typically issued because another process has opened the dead-letter queue for exclusive access, or because an invalid dead-letter queue name was specified. Other possible reasons include resource shortages or insufficient authority. The completion code(<insert_1>) and the reason(<insert_2>) can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8745 (iSeries)Unable to open dead-letter queue for browse.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not open the dead-letter queue <insert_3> for browsing. This message is typically issued because another process has opened the dead-letter queue for exclusive access, or because an invalid dead-letter queue name was specified. Other possible reasons include resource shortages or insufficient authority. The completion code(<insert_1>) and the reason(<insert_2>) can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8746Unable to close dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (runmqdlq) could not close the dead-letter queue. This message is typically issued because of a resource shortage or because the queue manager is ending. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8746 (iSeries)Unable to close dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not close the dead-letter queue. This message is typically issued because of a resource shortage or because the queue manager is ending. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8747Integer parameter outside permissible range.
Severity:
20 : Error

Explanation:
The integer parameter (<insert_2>) supplied to the dead-letter handler was outside of the valid range for <insert_3> on line <insert_1>.

Response:
Correct the input data and restart the dead-letter queue handler.

AMQ8748Unable to get message from dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (runmqdlq) could not get the next message from the dead-letter queue. This message is typically issued because of the queue manager ending, a resource problem, or another process having deleted the dead-letter queue. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8748 (iSeries)Unable to get message from dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not get the next message from the dead-letter queue. This message is typically issued because of the queue manager ending, a resource problem, or another process having deleted the dead-letter queue. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8749Unable to commit/backout action on dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (runmqdlq) was unable to commit or backout an update to the dead-letter queue. This message is typically issued because of the queue manager ending, or because of a resource shortage. If the queue manager has ended, the update to the dead-letter queue (and any associated updates) will be backed out when the queue manager restarts. If the problem was due to a resource problem then the updates will be backed out when the dead-letter queue handler terminates. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8749 (iSeries)Unable to commit/backout action on dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (STRMQMDLQ) was unable to commit or backout an update to the dead-letter queue. This message is typically issued because of the queue manager ending, or because of a resource shortage. If the queue manager has ended, the update to the dead-letter queue (and any associated updates) will be backed out when the queue manager restarts. If the problem was due to a resource problem then the updates will be backed out when the dead-letter queue handler terminates. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Take appropriate action based upon the completion code and reason.

AMQ8750No valid input provided to runmqdlq.
Severity:
20 : Error

Explanation:
Either no input was provided to runmqdlq, or the input to runmqdlq contained no valid message templates. If input was provided to runmqdlq but was found to be invalid, earlier messages will have been produced explaining the cause of the error. The dead-letter queue handler will ends.

Response:
Correct the input data and restart the dead-letter queue handler.

AMQ8750 (iSeries)No valid input provided to STRMQMDLQ.
Severity:
20 : Error

Explanation:
Either no input was provided to STRMQMDLQ, or the input to STRMQMDLQ contained no valid message templates. If input was provided to STRMQMDLQ but was found to be invalid, earlier messages will have been produced explaining the cause of the error. The dead-letter queue handler will ends.

Response:
Correct the input data and restart the dead-letter queue handler.

AMQ8751Unable to obtain private storage.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (runmqdlq) was unable to obtain private storage. This problem would typically arise as a result of some more global problem. For example if there is a persistent problem that is causing messages to be written to the DLQ and the same problem (for example queue full) is preventing the dead-letter queue handler from taking the requested action with the message, it is necessary for the dead-letter queue handler to maintain a large amount of state data to remember the retry counts associated with each message, or if the dead-letter queue contains a large number of messages and the rules table has directed the dead-letter queue handler to ignore the messages.

Response:
Investigate if some more global problem exists, and if the dead-letter queue contains a large number of messages. If the problem persists contact your support center.

AMQ8751 (iSeries)Unable to obtain private storage.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (STRMQMDLQ) was unable to obtain private storage. This problem would typically arise as a result of some more global problem. For example if there is a persistent problem that is causing messages to be written to the DLQ and the same problem (for example queue full) is preventing the dead-letter queue handler from taking the requested action with the message, it is necessary for the dead-letter queue handler to maintain a large amount of state data to remember the retry counts associated with each message, or if the dead-letter queue contains a large number of messages and the rules table has directed the dead-letter queue handler to ignore the messages.

Response:
Investigate if some more global problem exists, and if the dead-letter queue contains a large number of messages. If the problem persists contact your support center.

AMQ8752Parameter(<insert_3>) exceeds maximum length on line <insert_1>.
Severity:
20 : Error

Explanation:
A parameter supplied as input to the dead-letter handler exceeded the maximum length for parameters of that type.

Response:
Correct the input data and restart the dead-letter queue handler.

AMQ8753Duplicate parameter(<insert_3>) found on line <insert_1>.
Severity:
20 : Error

Explanation:
Two or more parameters of the same type were supplied on a single input line to the dead-letter queue handler.

Response:
Correct the input and restart the dead-letter queue handler.

AMQ8756Error detected releasing private storage.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (runmqdlq) was informed of an error while attempting to release an area of private storage. The dead-letter queue handler ends.

Response:
This message should be preceded by a message or FFST information from the internal routine that detected the error. Take the action associated with the earlier error information.

AMQ8756 (iSeries)Error detected releasing private storage.
Severity:
20 : Error

Explanation:
The dead-letter queue handler (STRMQMDLQ) was informed of an error while attempting to release an area of private storage. The dead-letter queue handler ends.

Response:
This message should be preceded by a message or FFST information from the internal routine that detected the error. Take the action associated with the earlier error information.

AMQ8757Integer parameter(<insert_3>) outside permissible range on line <insert_1>.
Severity:
20 : Error

Explanation:
An integer supplied as input to the dead-letter handler was outside of the valid range of integers supported by the dead-letter queue handler.

Response:
Correct the input data and restart the dead-letter queue handler.

AMQ8758<insert_1> errors detected in input to runmqdlq.
Severity:
20 : Error

Explanation:
One or more errors have been detected in the input to the dead-letter queue handler(runmqdlq). Error messages will have been generated for each of these errors. The dead-letter queue handler ends.

Response:
Correct the input data and restart the dead-letter queue handler.

AMQ8758 (iSeries)<insert_1> errors detected in input to STRMQMDLQ.
Severity:
20 : Error

Explanation:
One or more errors have been detected in the input to the dead-letter queue handler(STRMQMDLQ). Error messages will have been generated for each of these errors. The dead-letter queue handler ends.

Response:
Correct the input data and restart the dead-letter queue handler.

AMQ8759Invalid combination of parameters to dead-letter queue handler on line <insert_1>.
Severity:
20 : Error

Explanation:
An invalid combination of input parameters has been supplied to the dead-letter queue handler. Possible causes are: no ACTION specified, ACTION(FWD) but no FWDQ specified, HEADER(YES|NO) specified without ACTION(FWD).

Response:
Correct the input data and restart the dead-letter queue handler.

AMQ8760Unexpected failure while initializing process: Reason = <insert_1>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (runmqdlq) could not perform basic initialization required to use MQ services because of an unforeseen error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8760 (iSeries)Unexpected failure while initializing process: Reason = <insert_1>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not perform basic initialization required to use MQ services because of an unforeseen error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8761Unexpected failure while connecting to queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (runmqdlq) could not connect to the requested queue manager because of an unforeseen error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8761 (iSeries)Unexpected failure while connecting to queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not connect to the requested queue manager because of an unforeseen error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8762Unexpected error while attempting to open queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (runmqdlq) could not open the queue manager because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8762 (iSeries)Unexpected error while attempting to open queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not open the queue manager because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8763Unexpected error while inquiring on queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead letter queue handler (runmqdlq) could not inquire on the queue manager because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8763 (iSeries)Unexpected error while inquiring on queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead letter queue handler (STRMQMDLQ) could not inquire on the queue manager because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8764Unexpected error while attempting to close queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (runmqdlq) could not close the queue manager because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8764 (iSeries)Unexpected error while attempting to close queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not close the queue manager because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8765Unexpected failure while opening dead-letter queue for browse: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (runmqdlq) could not open the dead-letter queue for browsing because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8765 (iSeries)Unexpected failure while opening dead-letter queue for browse: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not open the dead-letter queue for browsing because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8766Unexpected error while closing dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (runmqdlq) could not close the dead-letter queue because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8766 (iSeries)Unexpected error while closing dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not close the dead-letter queue because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8767Unexpected error while getting message from dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (runmqdlq) could not get the next message from the dead-letter queue because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8767 (iSeries)Unexpected error while getting message from dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (STRMQMDLQ) could not get the next message from the dead-letter queue because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8768Unexpected error committing/backing out action on dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (runmqdlq) was unable to either commit or backout an update to the dead-letter queue because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8768 (iSeries)Unexpected error committing/backing out action on dead-letter queue: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (STRMQMDLQ) was unable to either commit or backout an update to the dead-letter queue because of an unforeseen error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8769Unable to disconnect from queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (runmqdlq) was unable to disconnect from the queue manager because of an unexpected error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8769 (iSeries)Unable to disconnect from queue manager: CompCode = <insert_1> Reason = <insert_2>.
Severity:
30 : Severe error

Explanation:
The dead-letter queue handler (STRMQMDLQ) was unable to disconnect from the queue manager because of an unexpected error. The completion code and the reason can be used to identify the error. The dead-letter queue handler ends.

Response:
Use the standard facilities supplied with your system to record the problem identifier and to save the generated output files. Contact your support center. Do not discard these files until the problem has been resolved.

AMQ8770 (iSeries)Cannot open <insert_5> for command <insert_3>.
Severity:
40 : Stop Error

Explanation:
The <insert_3> command failed to open <insert_5> for WebSphere MQ processing.

Response:
Check that the intended file or member exists, and was specified correctly. Correct the specification or create the object and try the operation again.

AMQ8822Invalid response, please re-enter (y or n):
Severity:
0 : Information

Response:
None.

AMQ8919There are no matching WebSphere MQ queue manager names.
Severity:
30 : Severe error

AMQ8934 (iSeries)Message . . . . :
Severity:
10 : Warning

AMQ8935 (iSeries)Cause . . . . . :
Severity:
10 : Warning

AMQ8936 (iSeries)Recovery . . . :
Severity:
10 : Warning

AMQ8937 (iSeries)Technical Description . . . . . . . . :
Severity:
10 : Warning

AMQ8A01 (iSeries)Create Message Queue Manager
AMQ8A02 (iSeries)Delete Message Queue Manager
AMQ8A04 (iSeries)Work with MQ Messages
AMQ8A05 (iSeries)Change Message Queue Manager
AMQ8A06 (iSeries)Display Message Queue Manager
AMQ8A07 (iSeries)End Message Queue Manager
AMQ8A08 (iSeries)Start Message Queue Manager
AMQ8A09 (iSeries)Change MQ Queue
AMQ8A0A (iSeries)Clear MQ Queue
AMQ8A0B (iSeries)Copy MQ Queue
AMQ8A0C (iSeries)Create MQ Queue
AMQ8A0D (iSeries)Delete MQ Queue
AMQ8A0E (iSeries)Display MQ Queue
AMQ8A0F (iSeries)Work with MQ Queues
AMQ8A10 (iSeries)Change MQ Process
AMQ8A11 (iSeries)Copy MQ Process
AMQ8A12 (iSeries)Create MQ Process
AMQ8A13 (iSeries)Delete MQ Process
AMQ8A14 (iSeries)Display MQ Process
AMQ8A15 (iSeries)Work with MQ Processes
AMQ8A16 (iSeries)Start MQ Command Server
AMQ8A17 (iSeries)End MQ Command Server
AMQ8A18 (iSeries)Display MQ Command Server
AMQ8A19 (iSeries)Set MQ
AMQ8A20 (iSeries)Quiesce Message Queue Managers
AMQ8A21 (iSeries)Quiesce Retry Delay
AMQ8A23 (iSeries)Work with Queue Status
AMQ8A30 (iSeries)Create MQ Channel
AMQ8A31 (iSeries)Display MQ Channel
AMQ8A32 (iSeries)Start MQ Listener
AMQ8A33 (iSeries)Ping MQ Channel
AMQ8A34 (iSeries)Delete MQ Channel
AMQ8A36 (iSeries)Work with MQ Channels
AMQ8A37 (iSeries)Change MQ Channel
AMQ8A38 (iSeries)Copy MQ Channel
AMQ8A39 (iSeries)Reset MQ Channel
AMQ8A40 (iSeries)End MQ Channel
AMQ8A41 (iSeries)Start MQ Channel
AMQ8A42 (iSeries)Start MQ Channel Initiator
AMQ8A43 (iSeries)Grant MQ Object Authority
AMQ8A44 (iSeries)Revoke MQ Object Authority
AMQ8A45 (iSeries)Display MQ Object Authority
AMQ8A46 (iSeries)Display MQ Object Names
AMQ8A47 (iSeries)Refresh WebSphere MQ Authority
AMQ8A48 (iSeries)Work with MQ Authority
AMQ8A49 (iSeries)Start MQ Service
AMQ8A50 (iSeries)End MQ Service
AMQ8A51 (iSeries)Connect MQ
AMQ8A52 (iSeries)Disconnect MQ
AMQ8A53 (iSeries)Work with MQ Authority Data
AMQ8A54 (iSeries)Resolve MQ Channel
AMQ8A55 (iSeries)Work with MQ Channel Status
AMQ8A56 (iSeries)SSL Client Authentication
AMQ8A57 (iSeries)SSL CipherSpec
AMQ8A58 (iSeries)SSL Peer name
AMQ8A59 (iSeries)Local communication address
AMQ8A5A (iSeries)Batch Heartbeat Interval
AMQ8A5B (iSeries)Remove Queues
AMQ8A5C (iSeries)Refresh Repository
AMQ8A5D (iSeries)IP Address
AMQ8A60 (iSeries)Cluster Name
AMQ8A61 (iSeries)Cluster Name List
AMQ8A62 (iSeries)Mode Name
AMQ8A63 (iSeries)Password
AMQ8A64 (iSeries)Transaction Program Name
AMQ8A65 (iSeries)User Profile
AMQ8A66 (iSeries)Network Connection Priority
AMQ8A67 (iSeries)Batch Interval
AMQ8A68 (iSeries)Batch Interval
AMQ8A69 (iSeries)Cluster Workload Exit Data
AMQ8A6A (iSeries)Cluster Workload Exit
AMQ8A6B (iSeries)Repository Cluster
AMQ8A6C (iSeries)Repository Cluster Namelist
AMQ8A6D (iSeries)Cluster Workload Exit Data Length
AMQ8A6E (iSeries)Maximum Message Length
AMQ8A6F (iSeries)Default Queue Manager
AMQ8A70 (iSeries)Default Binding
AMQ8A71 (iSeries)Channel Table
AMQ8A72 (iSeries)Change MQ Namelist
AMQ8A73 (iSeries)List of Names
AMQ8A74 (iSeries)Namelist
AMQ8A75 (iSeries)Create MQ Namelist
AMQ8A76 (iSeries)Recreate MQ Object
AMQ8A77 (iSeries)Record MQ Object Image
AMQ8A78 (iSeries)Start WebSphere MQ Commands
AMQ8A7A (iSeries)Copy MQ Namelist
AMQ8A7B (iSeries)From Namelist
AMQ8A7C (iSeries)To Namelist
AMQ8A7D (iSeries)Delete MQ Namelist
AMQ8A7E (iSeries)Display MQ Namelist
AMQ8A7F (iSeries)Work with MQ Namelist
AMQ8A80 (iSeries)Group Profile
AMQ8A81 (iSeries)User Profile
AMQ8A82 (iSeries)Service Component
AMQ8A83 (iSeries)Work with MQ Queue Manager
AMQ8A84 (iSeries)Work with MQ Clusters
AMQ8A85 (iSeries)Start MQ Trigger Monitor
AMQ8A86 (iSeries)End MQ Listeners
AMQ8A87 (iSeries)Work with MQ Transactions
AMQ8A88 (iSeries)Resolve MQ Transaction
AMQ8A89 (iSeries)Work with MQ Cluster Queues
AMQ8A8A (iSeries)Display Journal Receiver Data
AMQ8A8B (iSeries)Start MQ Pub/Sub Broker
AMQ8A8C (iSeries)End MQ Pub/Sub Broker
AMQ8A8D (iSeries)Display MQ Pub/Sub Broker
AMQ8A8E (iSeries)Clear MQ Pub/Sub Broker
AMQ8A8F (iSeries)Delete MQ Pub/Sub Broker
AMQ8B01 (iSeries)Message Queue Manager name
AMQ8B02 (iSeries)Text 'description'
AMQ8B03 (iSeries)Trigger interval
AMQ8B04 (iSeries)Undelivered message queue
AMQ8B05 (iSeries)Default transmission queue
AMQ8B06 (iSeries)Maximum handle limit
AMQ8B07 (iSeries)Maximum uncommitted messages
AMQ8B08 (iSeries)Queue name
AMQ8B09 (iSeries)Output
AMQ8B0A (iSeries)Library
AMQ8B0B (iSeries)File to receive output
AMQ8B0C (iSeries)OPTION(*MVS) not valid without specifying a value for WAIT.
Severity:
40 : Stop Error

Explanation:
The OPTION(*MVS) parameter may not be specified without specifying a value for the WAIT parameter.

Response:
Remove the OPTION(*MVS) parameter from the command or, specify a value for the WAIT parameter. Then try the command again.

AMQ8B0D (iSeries)Member to receive output
AMQ8B0E (iSeries)Replace or add records
AMQ8B0F (iSeries)Option
AMQ8B10 (iSeries)Mode
AMQ8B11 (iSeries)Put enabled
AMQ8B12 (iSeries)Default message priority
AMQ8B13 (iSeries)Default message persistence
AMQ8B14 (iSeries)Process name
AMQ8B15 (iSeries)Triggering enabled
AMQ8B16 (iSeries)Get enabled
AMQ8B17 (iSeries)Sharing enabled
AMQ8B18 (iSeries)Default share option
AMQ8B19 (iSeries)Message delivery sequence
AMQ8B1A (iSeries)Harden backout count
AMQ8B1B (iSeries)Trigger type
AMQ8B1C (iSeries)Trigger depth
AMQ8B1D (iSeries)Trigger message priority
AMQ8B1E (iSeries)Trigger data
AMQ8B1F (iSeries)Retention interval
AMQ8B20 (iSeries)Maximum queue depth
AMQ8B21 (iSeries)Maximum message length
AMQ8B22 (iSeries)Backout threshold
AMQ8B23 (iSeries)Backout requeue name
AMQ8B24 (iSeries)Initiation queue
AMQ8B25 (iSeries)Usage
AMQ8B26 (iSeries)Definition type
AMQ8B27 (iSeries)Target queue
AMQ8B28 (iSeries)Remote queue
AMQ8B29 (iSeries)Remote Message Queue Manager
AMQ8B2A (iSeries)Transmission queue
AMQ8B2B (iSeries)From queue name
AMQ8B2C (iSeries)To queue name
AMQ8B2D (iSeries)Replace
AMQ8B2E (iSeries)Queue type
AMQ8B2F (iSeries)Application type
AMQ8B30 (iSeries)Application identifier
AMQ8B31 (iSeries)User data
AMQ8B32 (iSeries)Environment data
AMQ8B33 (iSeries)From process
AMQ8B34 (iSeries)To process
AMQ8B36 (iSeries)Job name
AMQ8B37 (iSeries)Number
AMQ8B3A (iSeries)Convert message
AMQ8B3B (iSeries)Replace to member
AMQ8B3C (iSeries)Heartbeat interval
AMQ8B3D (iSeries)Non Persistent Message Speed
AMQ8B3E (iSeries)Force
AMQ8B3F (iSeries)No Jobs to display
AMQ8B41 (iSeries)Queue definition scope
AMQ8B42 (iSeries)Queue depth high threshold
AMQ8B43 (iSeries)Queue depth low threshold
AMQ8B44 (iSeries)Queue full events enabled
AMQ8B45 (iSeries)Queue high events enabled
AMQ8B46 (iSeries)Queue low events enabled
AMQ8B47 (iSeries)Service interval
AMQ8B48 (iSeries)Service interval events
AMQ8B49 (iSeries)Distribution list support
AMQ8B4A (iSeries)Parent Message Queue Manager
AMQ8B4B (iSeries)Break Parent link
AMQ8B4C (iSeries)Child Message Queue Manager
AMQ8B53 (iSeries)Authorization events enabled
AMQ8B54 (iSeries)Inhibit events enabled
AMQ8B55 (iSeries)Local error events enabled
AMQ8B56 (iSeries)Remote error events enabled
AMQ8B57 (iSeries)Performance events enabled
AMQ8B58 (iSeries)Start and stop events enabled
AMQ8B59 (iSeries)Automatic Channel Definition
AMQ8B5A (iSeries)Auto Chan. Def. events enabled
AMQ8B5B (iSeries)Auto Chan. Def. exit program
AMQ8B5C (iSeries)Redefine system objects
AMQ8B5D (iSeries)Wait time
AMQ8B5E (iSeries)Startup Status Detail
AMQ8B60 (iSeries)Transaction type
AMQ8B61 (iSeries)Log recovery events enabled
AMQ8B62 (iSeries)IP protocol
AMQ8B63 (iSeries)Configuration events enabled
AMQ8B64 (iSeries)Refresh Message Queue Manager
AMQ8B65 (iSeries)Refresh Type
AMQ8B66 (iSeries)Include Interval
AMQ8B67 (iSeries)WebSphere MQ queue manager refreshed.
AMQ8B68 (iSeries)Channel events enabled
AMQ8B69 (iSeries)SSL events enabled
AMQ8B6A (iSeries)Filter command
AMQ8B6B (iSeries)Filter keyword
AMQ8B6C (iSeries)Filter operator
AMQ8B6D (iSeries)Filter value
AMQ8B6E (iSeries)Filter value <insert_5> not valid with keyword <insert_4>.
Severity:
30 : Severe error

Explanation:
The filter value <insert_5> is not valid with the keyword <insert_4>.

Response:
Specify a valid filter value for the keyword <insert_4>.

AMQ8B70 (iSeries)Change MQ AuthInfo object
AMQ8B71 (iSeries)Copy MQ AuthInfo object
AMQ8B72 (iSeries)Create MQ AuthInfo object
AMQ8B73 (iSeries)Delete MQ AuthInfo object
AMQ8B74 (iSeries)Display MQ AuthInfo object
AMQ8B75 (iSeries)From AuthInfo name
AMQ8B76 (iSeries)AuthInfo name
AMQ8B77 (iSeries)AuthInfo type
AMQ8B78 (iSeries)User name
AMQ8B79 (iSeries)User password
AMQ8B7A (iSeries)Work with AuthInfo objects
AMQ8B7B (iSeries)To AuthInfo name
AMQ8B80 (iSeries)Change MQ Processor Allowance
AMQ8B81 (iSeries)Display MQ Processor Allowance
AMQ8B82 (iSeries)Sufficient Licence Units
AMQ8C01 (iSeries)From channel
AMQ8C02 (iSeries)Channel name
AMQ8C03 (iSeries)Channel type
AMQ8C04 (iSeries)SSL key reset count
AMQ8C05 (iSeries)Remote queue manager
AMQ8C07 (iSeries)Transmission queue
AMQ8C08 (iSeries)Connection name
AMQ8C09 (iSeries)Message channel agent
AMQ8C10 (iSeries)Message channel agent user ID
AMQ8C12 (iSeries)Batch size
AMQ8C13 (iSeries)Disconnect interval
AMQ8C14 (iSeries)Short retry count
AMQ8C15 (iSeries)Short retry interval
AMQ8C16 (iSeries)Long retry count
AMQ8C17 (iSeries)Long retry interval
AMQ8C18 (iSeries)Security exit
AMQ8C19 (iSeries)Message exit
AMQ8C20 (iSeries)Send exit
AMQ8C21 (iSeries)Receive exit
AMQ8C22 (iSeries)SSL CRL Namelist
AMQ8C23 (iSeries)SSL Key Repository
AMQ8C24 (iSeries)Put authority
AMQ8C25 (iSeries)Sequence number wrap
AMQ8C27 (iSeries)Transport type
AMQ8C28 (iSeries)Data count
AMQ8C29 (iSeries)Count
AMQ8C30 (iSeries)To channel
AMQ8C31 (iSeries)Message sequence number
AMQ8C32 (iSeries)SSL Cryptographic Hardware
AMQ8C33 (iSeries)Security exit user data
AMQ8C34 (iSeries)Send exit user data
AMQ8C35 (iSeries)Receive exit user data
AMQ8C36 (iSeries)Message exit user data
AMQ8C37 (iSeries)Resolve option
AMQ8C38 (iSeries)Connection name
AMQ8C39 (iSeries)Transmission queue name
AMQ8C40 (iSeries)SSL Repository Password
AMQ8C41 (iSeries)First Message
AMQ8C42 (iSeries)Maximum number of messages
AMQ8C43 (iSeries)Maximum message size
AMQ8C44 (iSeries)Message retry exit
AMQ8C45 (iSeries)Message retry exit data
AMQ8C46 (iSeries)Number of message retries
AMQ8C47 (iSeries)Message retry interval
AMQ8C48 (iSeries)Coded Character Set
AMQ8C49 (iSeries)Max message length
AMQ8C50 (iSeries)Repository name
AMQ8C51 (iSeries)Repository name list
AMQ8C52 (iSeries)Cluster workload exit length
AMQ8C53 (iSeries)Cluster workload exit
AMQ8C54 (iSeries)Cluster workload exit data
AMQ8C55 (iSeries)Suspend Cluster Queue Manager
AMQ8C56 (iSeries)Reset Cluster
AMQ8C57 (iSeries)Refresh MQ Cluster
AMQ8C58 (iSeries)Resume Cluster Queue Manager
AMQ8C59 (iSeries)Action
AMQ8C5A (iSeries)Queue Manager Name for removal
AMQ8C5B (iSeries)Work with MQ Listeners
AMQ8C5C (iSeries)Queue Manager Id for removal
AMQ8C60 (iSeries)Display Cluster Message Queue Manager
AMQ8C61 (iSeries)Cluster Queue Manager name
AMQ8C62 (iSeries)End MQ Listeners
AMQ8C63 (iSeries)Port number
AMQ8C64 (iSeries)Message channel agent Type
AMQ8C65 (iSeries)Task user identifier
AMQ8D01 (iSeries)Trace MQ
AMQ8D02 (iSeries)Trace option setting
AMQ8D03 (iSeries)Trace level
AMQ8D04 (iSeries)Trace types
AMQ8D05 (iSeries)Maximum storage to use
AMQ8D06 (iSeries)Trace early
AMQ8D07 (iSeries)Exclude types
AMQ8D0A (iSeries)Output member options
AMQ8D10 (iSeries)Object name
AMQ8D11 (iSeries)Object type
AMQ8D12 (iSeries)User names
AMQ8D13 (iSeries)Authority
AMQ8D14 (iSeries)Authorization list
AMQ8D15 (iSeries)Reference object name
AMQ8D16 (iSeries)Reference object type
AMQ8D17 (iSeries)Object name
AMQ8D18 (iSeries)Process name
AMQ8D19 (iSeries)Queue name
AMQ8D1A (iSeries)Queue Manager Library
AMQ8D1B (iSeries)ASP Number
AMQ8D1C (iSeries)Journal receiver threshold
AMQ8D20 (iSeries)Channel name
AMQ8D22 (iSeries)Cluster name
AMQ8D23 (iSeries)Cluster namelist name
AMQ8D24 (iSeries)User name
AMQ8D25 (iSeries)Channel status
AMQ8D26 (iSeries)End connected jobs
AMQ8D27 (iSeries)Timeout interval (seconds)
AMQ8D28 (iSeries)Object/Profile name
AMQ8D29 (iSeries)Service Component name
AMQ8D30 (iSeries)Keep Alive Interval


9000-9999 - Remote
See Reading a message for an explanation of how to interpret these messages.

AMQ9001 Channel <insert_3> ended normally.
Severity:
0 : Information

Explanation:
Channel <insert_3> ended normally.

Response:
None.

AMQ9002 Channel <insert_3> is starting.
Severity:
0 : Information

Explanation:
Channel <insert_3> is starting.

Response:
None.

AMQ9003 (iSeries)Channel <insert_3> last message sequence number is <insert_1>.
Severity:
0 : Information

Explanation:
Channel <insert_3> last message sequence number is <insert_1>.

Response:
None.

AMQ9004 (iSeries)Channel <insert_3> status information.
Severity:
0 : Information

Explanation:
Channel <insert_3> status information: Number of Messages in Doubt - <insert_1> In Doubt Sequence Number - <insert_2> In Doubt Logic Unit of Work ID - <insert_4>

Response:
None.

AMQ9181 The response set by the exit is not valid.
Severity:
30 : Severe error

Explanation:
The user exit <insert_3> returned a response code <insert_1> that is not valid in the ExitResponse field of the channel exit parameters (MQCXP). Message AMQ9190 is issued giving more details, and the channel stops.

Response:
Investigate why the user exit program set a response code that is not valid.

AMQ9182 The secondary response set by the exit is not valid.
Severity:
30 : Severe error

Explanation:
The user exit <insert_3> returned a secondary response code <insert_1> in the ExitResponse2 field of the channel exit parameters (MQCXP) that is not valid. Message AMQ9190 is issued giving more details, and the channel stops.

Response:
Investigate why the user exit program set a secondary response code that is not valid.

AMQ9184 The exit buffer address set by the exit is not valid.
Severity:
30 : Severe error

Explanation:
The user exit <insert_3> returned an address <insert_1> for the exit buffer that is not valid, when the secondary response code in the ExitResponse2 field of the channel exit parameters (MQCXP) is set to MQXR2_USE_EXIT_BUFFER. Message AMQ9190 is issued giving more details, and the channel stops.

Response:
Investigate why the user exit program set an exit buffer address that is not valid. The most likely cause is the failure to set a value, so that the value is 0.

AMQ9185 The exit space set by the exit is not valid.
Severity:
30 : Severe error

Explanation:
The user exit <insert_3> returned an exit space value <insert_1> that is not valid in the ExitSpace field of the channel exit parameters (MQCXP). Message AMQ9190 is issued giving more details, and the channel stops.

Response:
Investigate why the user exit program set an exit space value that is not valid. Correct the error.

AMQ9186 Too much exit space reserved by send exits.
Severity:
30 : Severe error

Explanation:
At exit initialization the send exits in the send exit chain for channel <insert_3> returned values in the ExitSpace field of the channel exit parameters (MQCXP). The total of these ExitSpace values is <insert_1>. The maximum number of bytes that can be sent in a single transmission is <insert_2>. Room must be left for at least 1024 bytes of message data in each transmission. So too much exit space has been reserved by the send exits. The channel stops.

Response:
Investigate why the send exit programs set exit space values that are too large. Correct the error.

AMQ9187 The header compression value set by the exit is not valid.
Severity:
30 : Severe error

Explanation:
The user exit <insert_3> returned a header compression value <insert_1> in the CurHdrCompression field of the channel exit parameters (MQCXP) that was not one of the negotiated supported values specified in the HdrCompList field of the channel description (MQCD). Message AMQ9190 is issued giving more details, and the channel stops.

Response:
Investigate why the user exit program specified a header compression value that was not one of the negotiated supported values.

AMQ9188 The message compression value set by the exit is not valid.
Severity:
30 : Severe error

Explanation:
The user exit <insert_3> returned a message compression value <insert_1> in the CurMsgCompression field of the channel exit parameters (MQCXP) that was not one of the negotiated supported values specified in the MsgCompList field of the channel description (MQCD). Message AMQ9190 is issued giving more details, and the channel stops.

Response:
Investigate why the user exit program specified a message compression value that was not one of the negotiated supported values.

AMQ9189 The data length set by the exit is not valid.
Severity:
30 : Severe error

Explanation:
The user exit <insert_3> returned a data length value <insert_1> that was not greater than zero. Message AMQ9190 is issued giving more details, and the channel stops.

Response:
Investigate why the user exit program set a data length that is not valid.

AMQ9190 Channel stopping because of an error in the exit.
Severity:
30 : Severe error

Explanation:
The user exit <insert_3>, invoked for channel <insert_4> with id <insert_1> and reason <insert_2>, returned values that are not valid, as reported in the preceding messages. The channel stops.

Response:
Investigate why the user exit program set values that are not valid.

AMQ9195 Data length larger than maximum segment length.
Severity:
30 : Severe error

Explanation:
The data length <insert_1> set by send exit <insert_3> is larger than the maximum segment length (<insert_2>). The maximum segment length is the maximum number of bytes that can be sent in a single transmission minus the user exit space required by all the send exits subsequent to the current one in the send exit chain. Message AMQ9190 is issued giving more details, and the channel stops.

Response:
Investigate why the user exit program set a data length that is not valid. Correct the error.

AMQ9196 Data length is larger than the agent buffer length.
Severity:
30 : Severe error

Explanation:
The data length <insert_1> set by exit <insert_3> is larger than the agent buffer length. The user exit returned data in the supplied agent buffer, but the length specified is greater than the length of the buffer. Message AMQ9190 is issued giving more details, and the channel stops.

Response:
Investigate why the user exit program set a data length that is not valid. Correct the error.

AMQ9197 Data length is larger than the exit buffer length.
Severity:
30 : Severe error

Explanation:
The data length <insert_1> set by exit <insert_3> is larger than the exit buffer length. The user exit returned data in the supplied exit buffer, but the length specified is greater than the length of the buffer. Message AMQ9190 is issued giving more details, and the channel stops.

Response:
Investigate why the user exit program set a data length that is not valid.

AMQ9201 Allocate failed to host <insert_3>.
Severity:
30 : Severe error

Explanation:
The attempt to allocate a conversation using <insert_4> to host <insert_3> was not successful.

Response:
The error may be due to an incorrect entry in the <insert_4> parameters contained in the channel definition to host <insert_3>. Correct the error and try again. If the error persists, record the error values and contact your systems administrator. The return code from the <insert_4><insert_5> call was <insert_1> (X<insert_2>). It may be possible that the listening program at host <insert_3> is not running. If this is the case, perform the relevant operations to start the listening program for protocol <insert_4> and try again.

AMQ9202 Remote host <insert_3> not available, retry later.
Severity:
30 : Severe error

Explanation:
The attempt to allocate a conversation using <insert_4> to host <insert_3> was not successful. However the error may be a transitory one and it may be possible to successfully allocate a <insert_4> conversation later.

Response:
Try the connection again later. If the failure persists, record the error values and contact your systems administrator. The return code from <insert_4> is <insert_1> (X<insert_2>). The reason for the failure may be that this host cannot reach the destination host. It may also be possible that the listening program at host <insert_3> was not running. If this is the case, perform the relevant operations to start the <insert_4> listening program, and try again.

AMQ9203 A configuration error for <insert_4> occurred.
Severity:
30 : Severe error

Explanation:
Error in configuration for communications to host <insert_3>. Allocation of a <insert_4> conversation to host <insert_3> was not possible.

Response:
The configuration error may be one of the following: 

1.If the communications protocol is LU 6.2, it may be that one of the transmission parameters (Mode, or TP Name) is incorrect. Correct the error and try again. The mode name should be the same as the mode defined on host <insert_3>. The TP name on <insert_3> should be defined. 

2.If the communications protocol is LU 6.2, it may be that an LU 6.2 session has not been established. Contact your systems administrator. 

3.If the communications protocol is TCP/IP, it may be that the host name specified is incorrect. Correct the error and try again. 

4.If the communications protocol is TCP/IP, it may be that the host name specified cannot be resolved to a network address. The host name may not be in the nameserver. 
The return code from the <insert_4><insert_5> call was <insert_1> (X<insert_2>). 
Record the error values and tell the system administrator.

AMQ9204 Connection to host <insert_3> rejected.
Severity:
30 : Severe error

Explanation:
Connection to host <insert_3> over <insert_4> was rejected.

Response:
The remote system may not be configured to allow connections from this host. Check the <insert_4> listener program has been started on host <insert_3>. 
If the conversation uses LU 6.2, it is possible that either the User ID or Password supplied to the remote host is incorrect. 
If the conversation uses TCP/IP, it is possible that the remote host does not recognize the local host as a valid host. 
The return code from the <insert_4><insert_5> call was <insert_1> X(<insert_2>). 
Record the error values and tell the systems administrator.

AMQ9205 The host name supplied is not valid.
Severity:
30 : Severe error

Explanation:
The supplied <insert_4> host name <insert_3> could not be resolved into a network address. Either the name server does not contain the host, or the name server was not available.

Response:
Check the <insert_4> configuration on your host.

AMQ9206 Error sending data to host <insert_3>.
Severity:
30 : Severe error

Explanation:
An error occurred sending data over <insert_4> to <insert_3>. This may be due to a communications failure.

Response:
The return code from the <insert_4><insert_5> call was <insert_1> X(<insert_2>). Record these values and tell your systems administrator.

AMQ9207 The data received from host <insert_3> is not valid.
Severity:
30 : Severe error

Explanation:
Incorrect data format received from host <insert_3> over <insert_4>. It may be that an unknown host is attempting to send data. An FFST file has been generated containing the invalid data received.

Response:
Tell the systems administrator.

AMQ9208 Error on receive from host <insert_3>.
Severity:
30 : Severe error

Explanation:
An error occurred receiving data from <insert_3> over <insert_4>. This may be due to a communications failure.

Response:
The return code from the <insert_4><insert_5> call was <insert_1> (X<insert_2>). Record these values and tell the systems administrator.

AMQ9209 Connection to host <insert_3> closed.
Severity:
30 : Severe error

Explanation:
An error occurred receiving data from <insert_3> over <insert_4>. The connection to the remote host has unexpectedly terminated.

Response:
Tell the systems administrator.

AMQ9210 Remote attachment failed.
Severity:
30 : Severe error

Explanation:
There was an incoming attachment from a remote host, but the local host could not complete the bind.

Response:
The return code from the <insert_4><insert_5> call was <insert_1> (X<insert_2>). Record these values and tell the systems administrator who should check the <insert_4> configuration.

AMQ9211 Error allocating storage.
Severity:
30 : Severe error

Explanation:
The program was unable to obtain enough storage.

Response:
Stop some programs which are using storage and retry the operation. If the problem persists contact your systems administrator.

AMQ9212 A TCP/IP socket could not be allocated.
Severity:
30 : Severe error

Explanation:
A TCP/IP socket could not be created, possibly because of a storage problem.

Response:
The return code from the <insert_4><insert_5> call was <insert_1> (X<insert_2>). Try the program again. If the failure persists, record the error values and tell the systems administrator.

AMQ9213A communications error for <insert_4> occurred.
Severity:
30 : Severe error

Explanation:
An unexpected error occurred in communications.

Response:
The return code from the <insert_4><insert_5> call was <insert_1> (X<insert_2>). Record these values and tell the systems administrator.

AMQ9214 Attempt to use an unsupported communications protocol.
Severity:
30 : Severe error

Explanation:
An attempt was made to use an unsupported communications protocol type <insert_2>.

Response:
Check the channel definition file. It may be that the communications protocol entered is not a currently supported one.

AMQ9215 Communications subsystem unavailable.
Severity:
30 : Severe error

Explanation:
An attempt was made to use the communications subsystem, but it has not been started.

Response:
Start the communications subsystem, and rerun the program.

AMQ9216 Usage: <insert_3> [-m QMgrName] [-n TPName]
Severity:
20 : Error

Explanation:
Values passed to the responder channel program are not valid. The parameters that are not valid are as follows :- 
<insert_4> 
The responder channel program exits.

Response:
Correct the parameters passed to the channel program and retry the operation.

AMQ9216 (AIX)Usage: <insert_3> [-m QMgrName]
Severity:
20 : Error

Explanation:
Values passed to the responder channel program are not valid. The parameters that are not valid are as follows :- 
<insert_4> 
The responder channel program exits.

Response:
Correct the parameters passed to the channel program and retry the operation.

AMQ9216 (HP-UX)Usage: <insert_3> [-m QMgrName]
Severity:
20 : Error

Explanation:
Values passed to the responder channel program are not valid. The parameters that are not valid are as follows :- 
<insert_4> 
The responder channel program exits.

Response:
Correct the parameters passed to the channel program and retry the operation.

AMQ9217 The TCP/IP listener program could not be started.
Severity:
30 : Severe error

Explanation:
An attempt was made to start a new instance of the listener program, but the program was rejected.

Response:
The failure could be because either the subsystem has not been started (in this case you should start the subsystem), or there are too many programs waiting (in this case you should try to start the listener program later).

AMQ9218 The <insert_4> listener program could not bind to port number <insert_1>.
Severity:
30 : Severe error

Explanation:
An attempt to bind the <insert_4> socket to the listener port was unsuccessful.

Response:
The failure could be due to another program using the same port number. The return code from the <insert_3> call for port <insert_5><insert_1> was <insert_2>. Record these values and tell the systems administrator.

AMQ9219 The TCP/IP listener program could not create a new connection for the incoming conversation.
Severity:
30 : Severe error

Explanation:
An attempt was made to create a new socket because an attach request was received, but an error occurred.

Response:
The failure may be transitory, try again later. If the problem persists, record the return code <insert_1> and tell the systems administrator. It may be necessary to free some jobs, or restart the communications system.

AMQ9220 The <insert_4> communications program could not be loaded.
Severity:
30 : Severe error

Explanation:
The attempt to load the <insert_4> library or procedure <insert_3> failed with error code <insert_1>.

Response:
Either the library must be installed on the system or the environment changed to allow the program to locate it.

AMQ9221 Unsupported protocol was specified.
Severity:
30 : Severe error

Explanation:
The specified value of <insert_3> was not recognized as one of the protocols supported.

Response:
Correct the parameter and retry the operation.

AMQ9222 Cannot find the configuration file.
Severity:
10 : Warning

Explanation:
The configuration file <insert_3> cannot be found. This file contains default definitions for communication parameters. Default values will be used.

Response:
None.

AMQ9223 Enter a protocol type.
Severity:
30 : Severe error

Explanation:
The operation you are performing requires that you enter the type of protocol.

Response:
Add the protocol parameter and retry the operation.

AMQ9224 Unexpected token detected.
Severity:
30 : Severe error

Explanation:
On line <insert_1> of the INI file, keyword <insert_3> was read when a keyword was expected.

Response:
Correct the file and retry the operation.

AMQ9224 (Windows)Unexpected token detected.
Severity:
30 : Severe error

Explanation:
Keyword <insert_3> was read when a keyword was expected.

Response:
Correct the configuration data and retry the operation.

AMQ9225 File syntax error.
Severity:
30 : Severe error

Explanation:
A syntax error was detected on line <insert_1> while processing the INI file.

Response:
Correct the problem and retry the operation.

AMQ9225 (Windows)File syntax error.
Severity:
30 : Severe error

Explanation:
A syntax error was detected while processing the configuration data.

Response:
Correct the problem and retry the operation.

AMQ9226 Usage: <insert_3> [-m QMgrName] -t (TCP | LU62 | NETBIOS | SPX) [ProtocolOptions]
Severity:
10 : Warning

Explanation:
Values passed to the listener program were invalid. 
The parameter string passed to this program is as follows: 
[-m QMgrName] ( -t TCP [-p Port] | 
-t LU62 [-n TPName] | 
-t NETBIOS [-l LocalName] [-e Names] [-s Sessions] 
[-o Commands] [-a Adaptor] | 
-t SPX [-x Socket]) 
Default values will be used for parameters not supplied.

Response:
Correct the parameters passed to the listener program and retry the operation.

AMQ9226 (AIX)Usage: <insert_3> [-m QMgrName] -t TCP [ProtocolOptions]
Severity:
10 : Warning

Explanation:
Values passed to the listener program were invalid. 
The parameter string passed to this program is as follows: 
[-m QMgrName] -t TCP [-p Port] 
Default values will be used for parameters not supplied.

Response:
Correct the parameters passed to the listener program and retry the operation.

AMQ9226 (Unix)Usage: <insert_3> [-m QMgrName] -t TCP [ProtocolOptions]
Severity:
10 : Warning

Explanation:
Values passed to the listener program were invalid. 
The parameter string passed to this program is as follows: 
[-m QMgrName] -t TCP [-p Port] 
Default values will be used for parameters not supplied.

Response:
Correct the parameters passed to the listener program and retry the operation.

AMQ9227 <insert_3> local host name not provided.
Severity:
30 : Severe error

Explanation:
A name is required for the <insert_3> process to register with the network.

Response:
Add a local name to the configuration file and retry the operation.

AMQ9228 The <insert_4> responder program could not be started.
Severity:
30 : Severe error

Explanation:
An attempt was made to start an instance of the responder program, but the program was rejected.

Response:
The failure could be because either the subsystem has not been started (in this case you should start the subsystem), or there are too many programs waiting (in this case you should try to start the responder program later). The <insert_5> reason code was <insert_1>.

AMQ9229 The application has been ended.
Severity:
30 : Severe error

Explanation:
You have issued a request to end the application.

Response:
None.

AMQ9230 An unexpected <insert_4> event occurred.
Severity:
30 : Severe error

Explanation:
During the processing of network events, an unexpected event <insert_1> occurred.

Response:
None.

AMQ9231 The supplied parameter is not valid.
Severity:
30 : Severe error

Explanation:
The value of the <insert_4> <insert_5> parameter has the value <insert_3>. This value has either not been specified or has been specified incorrectly.

Response:
Check value of the <insert_5> parameter and correct it if necessary. If the fault persists, record the return code (<insert_1>,<insert_2>) and <insert_4> and tell the systems administrator.

AMQ9232 No <insert_3> specified
Severity:
30 : Severe error

Explanation:
The operation requires the specification of the <insert_3> field.

Response:
Specify the <insert_3> and retry the operation.

AMQ9233 Error creating <insert_3> thread.
Severity:
30 : Severe error

Explanation:
The process attempted to create a new thread. The most likely cause of this problem is a shortage of an operating system resource (for example: memory). Use any previous FFSTs to determine the reason for the failure. The WebSphere MQ internal return code describing the reason for the failure is <insert_1>.

Response:
Contact the systems administrator. If the problem persists contact your IBM support center.

AMQ9235 The supplied local communications address cannot be resolved.
Severity:
30 : Severe error

Explanation:
The local communications address (LOCLADDR) value <insert_3> cannot be resolved into an IP address.

Response:
Enter a local communications address value which can be resolved into an IP address, and try again.

AMQ9236 The supplied Partner LU was invalid.
Severity:
30 : Severe error

Explanation:
The <insert_4> Partner LU name <insert_3> was invalid.

Response:
Either the Partner LU name was entered incorrectly or it was not in the <insert_4> communications configuration. Correct the error and try again.

AMQ9237 A configuration error for <insert_4> occurred.
Severity:
30 : Severe error

Explanation:
Allocation of a <insert_4> conversation to host <insert_3> was not possible. The configuration error may be one of the following: 
1. It may be that one of the transmission parameters (Mode, or TP Name) was incorrect. Correct the error and try again. The mode name should be the same as the mode defined on host <insert_3>. The TP name on <insert_3> should be defined. 

2. It may be that an LU 6.2 session has not been established. Contact your systems administrator. 
The return code from <insert_4> is <insert_1> with associated <insert_5> <insert_2>.

Response:
Record the error values and tell the system administrator.

AMQ9238 A communications error for <insert_4> occurred.
Severity:
30 : Severe error

Explanation:
An unexpected error occurred in communications.

Response:
The return code from the <insert_4><insert_3> call was <insert_1> with associated <insert_5> <insert_2>.

AMQ9239Usage: <insert_3> [-m QMgrName] -n TpName -g Gateway-name
Severity:
10 : Warning

Explanation:
Values passed to the listener program were invalid. The parameter string passed to this program is as follows, default values being used for parameters not supplied: [-m QMgrName] -n TpName -g Gateway-name

Response:
Correct the parameters passed to the listener program and retry the operation.

AMQ9240 An SPX socket was already in use.
Severity:
30 : Severe error

Explanation:
The Listener received return code <insert_1> when attempting to open socket <insert_2>.

Response:
The specified socket is already in use by another process. To use another socket specify another socket on the command line to RUNMQLSR or update the default in the qm.ini file.

AMQ9240 (iSeries)An SPX socket was already in use.
Severity:
30 : Severe error

Explanation:
The Listener received return code <insert_1> when attempting to open socket <insert_2>.

Response:
The specified socket is already in use by another process. To use another socket specify another socket on the command line to STRMQMLSR or update the default in the qm.ini file.

AMQ9240 (Windows)An SPX socket was already in use.
Severity:
30 : Severe error

Explanation:
The listener received return code <insert_1> when attempting to open socket <insert_2>.

Response:
The specified socket is already in use by another process. To use another socket, specify a different socket on the command line to the runmqlsr command, or update the default in the configuration data.

AMQ9241 SPX is not available.
Severity:
30 : Severe error

Explanation:
WebSphere MQ received return code <insert_1> when attempting to start SPX communications.

Response:
Ensure that IPX/SPX support is installed on the machine and that it is started before trying to start a WebSphere MQ SPX channel.

AMQ9242 SPX resource problem.
Severity:
30 : Severe error

Explanation:
WebSphere MQ received return code <insert_1> when attempting to start SPX communications, indicating a resource problem.

Response:
Ensure that sufficient IPX/SPX resources are available before commencing communications over IPX/SPX.

AMQ9243 The queue manager <insert_3> does not exist.
Severity:
30 : Severe error

Explanation:
You tried to perform an action against a queue manager that does not exist. You may have specified the wrong queue manager name.

Response:
If you specified the wrong name, correct the name and submit the command again. If the queue manager does not exist, create the queue manager and submit the command again.

AMQ9244The default queue manager does not exist.
Severity:
30 : Severe error

Explanation:
You tried to perform an action against a queue manager that does not exist.

Response:
Create the default queue manager and submit the command again.

AMQ9245 (Windows)Unable to obtain account details for channel MCA user ID.
Severity:
10 : Warning

Explanation:
WebSphere MQ was unable to obtain the account details for MCA user ID <insert_3>. This user ID was the MCA user ID for channel <insert_4> on queue manager <insert_5> and may have been defined in the channel definition, or supplied either by a channel exit or by a client.

Response:
Ensure that the user ID is correct and that it is defined on the Windows local system, the local domain or on a trusted domain. For a domain user ID, ensure that all necessary domain controllers are available.

AMQ9246The TCP/IP listener on port <insert_1> could not start a new channel.
Severity:
30 : Severe error

Explanation:
An attempt has been made to connect to the queue manager by starting a new channel within the TCP/IP listener which is listening on port <insert_1>. The maximum socket number which can be used by a channel running on this listener is <insert_2>. A socket number beyond this maximum was allocated for the new channel. This connection attempt has been rejected, but the listener continues to listen for further connection requests. The socket number allocated for a new listener channel is related to the number of channels currently running within that listener process. The problem has arisen because too many channels are directed at the port on which this listener is listening.

Response:
An extra listener process should be started to listen on a different port. Some of the channels to the queue manager should be redirected from the port on which the existing listener is listening to the new port.

AMQ9247SSPI Security: bad return from SSPI call.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> has been closed because the SSPI channel exit received a bad return code from SSPI.

Response:
Consult the appropriate SSPI manuals to find out the meaning of status <insert_4> on call <insert_5>, and correct the error.

AMQ9248The program could not bind to a <insert_3> socket.
Severity:
30 : Severe error

Explanation:
The attempt to bind to socket <insert_4> failed with return code <insert_1>. The failing <insert_3> call was <insert_5>. The most likely cause of this problem is incorrect configuration of the <insert_3> local address or incorrect start and end port parameters.

Response:
Contact the system administrator. If the problem persists contact your IBM support center.

AMQ9255Listener already running.
Severity:
30 : Severe error

Explanation:
The request to start the WebSphere MQ listener failed because there is already a listener running against the specified network resources.

Response:
None.

AMQ9259Connection timed out from host <insert_3>.
Severity:
30 : Severe error

Explanation:
A connection from host <insert_3> over <insert_4> timed out.

Response:
Check to see why data was not received in the expected time. Correct the problem. Reconnect the channel, or wait for a retrying channel to reconnect itself.

AMQ9401Channel <insert_3> autodefined.
Severity:
0 : Information

Explanation:
Channel <insert_3> which did not previously exist has been autodefined.

Response:
None.

AMQ9402Autodefinition exit for Channel <insert_3> failed to load.
Severity:
30 : Severe error

Explanation:
Autodefinition of Channel <insert_3> failed because <insert_4> would not load.

Response:
Ensure that the user exit is specified correctly in the queue manager definition, and that the user exit program is correct and available.

AMQ9403Autodefinition of Channel <insert_3> suppressed by user exit.
Severity:
30 : Severe error

Explanation:
Autodefinition exit <insert_4> for Channel <insert_3> returned a failure code.

Response:
None.

AMQ9404REFRESH CLUSTER REPOS(YES) command processed, cluster <insert_4>, <insert_1> objects changed.
Severity:
0 : Information

Explanation:
The queue manager successfully processed a REFRESH CLUSTER command with the REPOS(YES) option for the indicated cluster.

Response:
None.

AMQ9405FORCEREMOVE QUEUES(YES) command processed, cluster <insert_3> target <insert_4>.
Severity:
0 : Information

Explanation:
The repository queue manager successfully processed a RESET ACTION(FORCEREMOVE) command with the QUEUES(YES) option for the indicated cluster and target queue manager.

Response:
None.

AMQ9406REFRESH CLUSTER REPOS(YES) command failed, this queue manager is a full repository for cluster <insert_4>.
Severity:
30 : Severe error

Explanation:
The repository queue manager could not process a REFRESH CLUSTER command with the REPOS(YES) option for the indicated cluster, because the local queue manager provides full repository management services for the cluster. The command is ignored.

Response:
Either 
1) Reissue the command without REPOS(YES), or 
2) Issue the command on a queue manager which is not a full repository, or 
3) Change this queue manager definition so that it is not a full repository.

AMQ9407Cluster queue <insert_3> is defined inconsistently.
Severity:
10 : Warning

Explanation:
The definition of cluster queue <insert_3> on the queue manager with UUID <insert_4> has different DEFPRTY, DEFPSIST and DEFBIND values from the definition of the same cluster queue on the queue manager with UUID <insert_5>. Both definitions now exist in the local repository. All definitions of the same cluster queue should be identical. In particular, problems arise if your applications rely on a queue default value which is defined inconsistently to determine messaging behavior. This applies, for example, if the applications open a cluster queue with option MQOO_BIND_AS_Q_DEF. If different instances of the queue have different DEFBIND values the behavior of the message transfer differs depending on which instance of the queue is selected when it is opened. In general the instance selected varies across opens.

Response:
For each inconsistency decide which of the values is the correct one. Alter the definitions of cluster queue <insert_3> so that all definitions have correct DEFPRTY, DEFPSIST and DEFBIND values.

AMQ9408BIND_ON_OPEN messages for channel <insert_3> to dead-letter queue.
Severity:
0 : Information

Explanation:
The remote CLUSRCVR for channel <insert_3> was deleted while undelivered BIND_ON_OPEN messages associated with that channel existed on the local SYSTEM.CLUSTER.TRANSMIT.QUEUE. These messages could not be allocated to another channel because they were put BIND_ON_OPEN, but were very unlikely to ever flow along the channel with which they were associated as this has now been deleted. An attempt has therefore been made to move them from the transmission queue to the local dead-letter queue. The MQDLH reason is MQFB_BIND_OPEN_CLUSRCVR_DEL. Note that any internal WebSphere MQ Clustering messages for the deleted channel will also have been removed from the SYSTEM.CLUSTER.TRANSMIT.QUEUE (these are discarded) so the current depth of the queue may have decreased by more than the number of user messages moved to the dead-letter queue.

Response:
Examine the contents of the dead-letter queue. Each message is contained in an MQDLH structure that includes the reason why it was written and where it was originally addressed. Also look at previous error messages to see if the attempt to put messages to the dead-letter queue failed.

AMQ9409Repository manager ended abnormally.
Severity:
30 : Severe error

Explanation:
The repository manager ended abnormally.

Response:
Look at previous error messages for the repository manager in the error files to determine the cause of the failure.

AMQ9410Repository manager started
Severity:
0 : Information

Explanation:
The repository manager started successfully.

Response:
None.

AMQ9411Repository manager ended normally.
Severity:
0 : Information

Explanation:
The repository manager ended normally.

Response:
None.

AMQ9412Repository command received for <insert_3>.
Severity:
30 : Severe error

Explanation:
The repository manager received a command intended for some other queue manager, whose identifier is <insert_3>. The command was sent by the queue manager with identifier <insert_4>.

Response:
Check the channel and cluster definitions of the sending queue manager.

AMQ9413Repository command format error, command code <insert_1>
Severity:
30 : Severe error

Explanation:
An internal error has occurred.

Response:
Collect the items listed in the 'Problem determination' section of the System Administration manual and contact your IBM support center.

AMQ9415Repository command unexpected, command code <insert_1>, cluster object <insert_3>, sender <insert_4>
Severity:
30 : Severe error

Explanation:
An internal error has occurred.

Response:
Collect the items listed in the 'Problem determination' section of the System Administration manual and contact your IBM support center.

AMQ9415 (iSeries)An internal error has occurred.
Severity:
30 : Severe error

Explanation:
Repository command unexpected, command code <insert_1>, cluster object <insert_3>, sender <insert_4>

Response:
Collect the items listed in the 'Problem determination' section of the System Administration manual and contact your IBM support center.

AMQ9416Repository command processing error, RC=<insert_2>, command code <insert_1>, cluster object <insert_3>, sender <insert_4>.
Severity:
30 : Severe error

Explanation:
An internal error has occurred.

Response:
Collect the items listed in the 'Problem determination' section of the System Administration manual and contact your IBM support center.

AMQ9416 (iSeries)An internal error has occurred.
Severity:
30 : Severe error

Explanation:
Repository command processing error, RC=<insert_2>, command code <insert_1>, cluster object <insert_3>, sender <insert_4>.

Response:
Collect the items listed in the 'Problem determination' section of the System Administration manual and contact your IBM support center.

AMQ9417Manually defined CLUSSDR channels have been forcibly removed.
Severity:
0 : Information

Explanation:
The administrator has asked for the queue manager <insert_3> to be deleted, or forcibly removed, but has not yet deleted the manually defined CLUSSDR channels to <insert_3>. The auto-defined channels to <insert_3> have been deleted, but <insert_3> continues to receive updates until the manually defined CLUSSDR channels have been deleted.

Response:
Delete the manually defined CLUSSDR channels to <insert_3>.

AMQ9418Only one repository for cluster <insert_3>.
Severity:
0 : Information

Explanation:
The queue manager has received information about a cluster for which it is the only repository.

Response:
Alter the REPOS or REPOSNL attribute of the queue manager, that is to have the second full repository for the cluster, to specify the cluster name.

AMQ9419No cluster-receiver channels for cluster <insert_3>
Severity:
0 : Information

Explanation:
The repository manager has received information about a cluster for which no cluster-receiver channels are known.

Response:
Define cluster-receiver channels for the cluster on the local queue manager.

AMQ9420No repositories for cluster <insert_3>.
Severity:
0 : Information

Explanation:
The queue manager has received information about a cluster for which no repositories are known.

Response:
Alter the REPOS or REPOSNL attribute of the queue manager, that is to have a full repository for the cluster, to specify the cluster name.

AMQ9421Invalid cluster record action code detected
Severity:
30 : Severe error

Explanation:
An invalid record was read from the SYSTEM.CLUSTER.REPOSITORY.QUEUE. An FFST record has been generated containing the invalid record. was

Response:
Collect the items listed in the Problem Determination section of the System Administration manual and contact your IBM support center.

AMQ9422Repository manager error, RC=<insert_1>
Severity:
30 : Severe error

Explanation:
An internal error has occurred.

Response:
Collect the items listed in the 'Problem determination' section of the System Administration manual and contact your IBM support center.

AMQ9425An internal error has occurred.
Severity:
30 : Severe error

Explanation:
Repository command merge error, command code <insert_1>, cluster object <insert_3>, sender <insert_4>

Response:
Collect the items listed in the 'Problem determination' section of the System Administration manual and contact your IBM support center.

AMQ9426Repository command recipient unknown.
Severity:
30 : Severe error

Explanation:
The repository manager tried to send a command to another queue manager using channel <insert_4>. The recipient queue manager, whose identifier is <insert_3>, could not be found. Command code <insert_1>.

Response:
Check the channel and cluster definitions of the sending and receiving queue managers.

AMQ9427CLUSSDR channel does not point to a repository queue manager.
Severity:
30 : Severe error

Explanation:
A CLUSSDR channel must point to a queue manager that hosts repositories for all clusters of which the channel is a member. In addition, the CLUSRCVR for the channel must be a member of all the same clusters as the CLUSSDR channel. The queue manager pointed to by CLUSSDR channel <insert_3> does not meet these criteria for cluster <insert_4>. The remote queue manager has a QMID of <insert_5>.

Response:
Check the definitions on the local and remote queue managers to ensure that the CLUSSDR channel points to a queue manager that hosts a repository for the cluster, and that the CLUSRCVR for the channel is a member of the cluster.

AMQ9428Unexpected publication of a cluster queue object received.
Severity:
30 : Severe error

Explanation:
The local queue manager has received a publication of a cluster queue object from a remote queue manager on cluster <insert_3>. The local queue manager discards the request because it does not host a repository for cluster <insert_3> and has not subscribed to the published object. The remote CLUSSDR channel used to access the local queue manager has a channel name of <insert_4> and the remote queue manager has a QMID of <insert_5>.

Response:
Check the definitions on the local and remote queue managers to ensure that the CLUSSDR channel points to a repository queue manager for the cluster.

AMQ9429Unexpected publication of a cluster queue deletion received.
Severity:
30 : Severe error

Explanation:
The local queue manager has received a publication of a cluster queue deletion from a remote queue manager on cluster <insert_3>. The local queue manager discards the request because it does not host a repository for cluster <insert_3> and has not subscribed to the published object. The remote CLUSSDR channel used to access the local queue manager has a channel name of <insert_4> and the remote queue manager has a QMID of <insert_5>.

Response:
Check the definitions on the local and remote queue managers to ensure that the CLUSSDR channel points to a repository queue manager for the cluster.

AMQ9430Unexpected cluster queue manager publication received.
Severity:
30 : Severe error

Explanation:
The local queue manager has received a cluster queue manager publication on cluster <insert_3>. The local queue manager should not have received the publication because it does not host a repository for cluster <insert_3>, it has not subscribed to information concerning the published object, and the published object does not match any of its CLUSSDRs. The queue manager that sent the publication to the local queue manager has QMID <insert_4> (note that this is not necessarily the queue manager which originated the publication). CLUSSDR channel <insert_5> was used to send the publication.

Response:
Check the CLUSSDR definition on the sending queue manager to ensure that it points to a repository queue manager for the cluster.

AMQ9431Remote queue manager no longer hosts a repository for cluster
Severity:
0 : Information

Explanation:
The local queue manager has received a message from remote queue manager QMID <insert_3> indicating that it no longer hosts a repository for cluster <insert_4>. CLUSSDR channel <insert_5> is altered so that it can no longer be used to access queue manager <insert_3> within cluster <insert_4>. If the local queue manager does not host a repository for cluster <insert_4> the relevant subscriptions and publications are remade if possible.

Response:
None.

AMQ9432Query received by a non-repository queue manager
Severity:
30 : Severe error

Explanation:
The local queue manager has received a query from a remote queue manager on cluster <insert_3>. The local queue manager discards the query because it does not host a repository for cluster <insert_3>. The remote CLUSSDR channel used to access the local queue manager has a channel name of <insert_4> and the remote queue manager has a QMID of <insert_5>.

Response:
Check the definitions on the local and remote queue managers to ensure that the CLUSSDR channel points to a repository queue manager for the cluster.

AMQ9433CLUSRCVR must be in the same cluster as its matching CLUSSDR.
Severity:
30 : Severe error

Explanation:
CLUSRCVR channel <insert_3> is not defined as a member of cluster <insert_4>. The local queue manager has received a command that indicates that CLUSSDR channel <insert_3> on the remote queue manager with QMID <insert_5> is defined as a member of cluster <insert_4>.

Response:
Alter the CLUSRCVR or CLUSSDR definitions for channel <insert_3>, so that they are both members of the same cluster.

AMQ9434Unrecognized message on <insert_3>.
Severity:
30 : Severe error

Explanation:
The repository manager found a message on one of its queues having, either a format that could not be recognized, or that did not come from a queue manager or repository manager. The message was put on the dead-letter queue.

Response:
Examine the message on the dead-letter queue to determine the originator of the message.

AMQ9435Unable to put repository manager message.
Severity:
30 : Severe error

Explanation:
The repository manager tried to send a message to the SYSTEM.CLUSTER.COMMAND.QUEUE on another queue manager whose identifier is <insert_3>, but the MQPUT call was unsuccessful. MQCC=<insert_1>, MQRC=<insert_2>. Processing continues, but the repository information may be out of date.

Response:
Refer to the Application Programming Reference manual for information about MQCC <insert_1> and MQRC <insert_2>. Check the channel and cluster definitions on the local and target queue managers, and ensure that the channels between them are running. When the problem is corrected, the repository information will normally be updated automatically. The REFRESH CLUSTER command can be used to ensure that the repository information is up to date.

AMQ9436Unable to send repository manager message.
Severity:
30 : Severe error

Explanation:
The repository manager tried to send a message to the SYSTEM.CLUSTER.COMMAND.QUEUE on a queue manager that has the full repository for the specified cluster(<insert_3>), but the MQPUT call was unsuccessful. MQCC=<insert_1>, MQRC=<insert_2>. Processing continues, but repository information may be out of date.

Response:
Refer to the Application Programming Reference manual for information about MQCC <insert_1> and MQRC <insert_2>. Check the channel and cluster definitions on the local and target queue managers, and ensure that the channels between them are running. When the problem is corrected, the repository information will normally be updated automatically. The REFRESH CLUSTER command can be used to ensure that the repository information is up to date.

AMQ9437Unable to commit repository changes.
Severity:
30 : Severe error

Explanation:
The repository manager tried to commit some updates to the repository but was unsuccessful. Processing continues, but repository information may be out of date.

Response:
If this occurs when the repository manager is stopping, this message can be ignored, because the repository information will normally be updated automatically when the repository manager is restarted. If there is an isolated occurrence at other times, use the REFRESH CLUSTER command to bring the repository information up to date. If the problem persists, contact your IBM support center for assistance.

AMQ9438CONNAME could not be discovered for CLUSRCVR <insert_3>.
Severity:
30 : Severe error

Explanation:
TCP/IP CLUSRCVR <insert_3> was validly specified with a blank or absent CONNAME parameter. However when the repository process, amqrrmfa, attempted to obtain the CONNAME (IP address) for itself it was unable to. If there is an existing matching CLUSRCVR object in the cache its CONNAME is used. The CONNAME used was <insert_4>.

Response:
Check the error log for a message arising from an associated TCP/IP call (gethostname, gethostbyname or inet_ntoa). Pass all the error information to your systems administrator.

AMQ9439Repository corruption: bad CLQMGR object for channel <insert_3>.
Severity:
30 : Severe error

Explanation:
An internal error has occurred.

Response:
Collect the items listed in the 'Problem determination' chapter in the System Administration guide and contact your IBM support center.

AMQ9440Reset command failed.
Severity:
0 : Information

Explanation:
Reset Cluster(<insert_3>) Qmname(<insert_4>) command failed. To issue this command, queue manager <insert_5> must be a repository for cluster <insert_3>. Alter the queue manager attributes Repos, or Reposnl, to include cluster <insert_3> and retry the command.

Response:
None.

AMQ9441Reset command processed.
Severity:
0 : Information

Explanation:
The reset Cluster(<insert_3>) Qmname(<insert_4>) command has processed on this repository and <insert_1> other queue managers have been sent notification.

Response:
None.

AMQ9442Refresh Cluster command processed.
Severity:
0 : Information

Explanation:
The Refresh Cluster(<insert_4>) command caused <insert_1> objects to be refreshed and <insert_2> objects to be republished.

Response:
None.

AMQ9443Suspend Qmgr Cluster command processed.
Severity:
0 : Information

Explanation:
The Suspend Qmgr Cluster command completed. <insert_1> objects suspended.I n the case of a name list the cluster name is the first name in the list.

Response:
None.

AMQ9444Resume Qmgr Cluster command processed.
Severity:
0 : Information

Explanation:
The Resume Qmgr Cluster(<insert_4>) command completed. <insert_1> objects resumed. In the case of a name list the cluster name is the first name in the list.

Response:
None.

AMQ9445Error creating channel <insert_3>.
Severity:
30 : Severe error

Explanation:
Channel <insert_4> tried to replace itself by creating channel <insert_3>. The attempt to create the channel was unsuccessful for the following reason: "<insert_5>". A previous message may give further information.

Response:
Rectify the problem which prevented successful creation of channel <insert_3>. Restart channel <insert_4>.

AMQ9446Error deleting channel <insert_3>.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> tried to delete itself after creating channel <insert_4> to replace it. The attempt to delete the channel was unsuccessful for the following reason: "<insert_5>".

Response:
If channel <insert_3> still exists rectify the problem which prevented its deletion and then manually delete the channel.

AMQ9447Unable to backout repository changes.
Severity:
30 : Severe error

Explanation:
Following an error, the repository manager tried to backout some updates to the repository, but was unsuccessful. The repository manager terminates.

Response:
If the repository manager subsequently restarts successfully, this message can be ignored. If the repository manager does not restart, contact your IBM support center for assistance.

AMQ9448Repository manager stopping because of errors. Restart in <insert_1> seconds.
Severity:
30 : Severe error

Explanation:
A severe error, as reported in the preceding messages, occurred during repository manager processing. The repository manager was unable to continue and terminates. The repository manager will try to restart after the specified interval.

Response:
Correct the problem reported in the preceding messages.

AMQ9449Repository manager restarted.
Severity:
0 : Information

Explanation:
The repository manager restarted successfully following an error.

Response:
None.

AMQ9450Usage: <insert_3> [-m QMgrName] -f OutputFile [-v OutputFileVersion]
Severity:
10 : Warning

Explanation:
Values passed to the channel table writer program were invalid. 
The parameter string passed to this program is as follows: 
[-m QMgrName] -f OutputFile [-v OutputFileVersion] 

where OutputFileVersion can be either 2 or 5 (5 is the default) 
Default values will be used for parameters not supplied.

Response:
Correct the parameters passed to the channel table writer program and retry the operation.

AMQ9453FORCEREMOVE command failed, cluster <insert_3> target <insert_4> is not unique.
Severity:
0 : Information

Explanation:
The repository queue manager could not process a RESET ACTION(FORCEREMOVE) command for the indicated cluster and target queue manager, because there is more than one queue manager with the specified name in the cluster. The command is ignored.

Response:
Reissue the command specifying the identifier (QMID) of the queue manager to be removed, rather than its name.

AMQ9455FORCEREMOVE command failed, cluster <insert_3>, target <insert_4>, not found.
Severity:
0 : Information

Explanation:
The repository queue manager could not process a RESET ACTION(FORCEREMOVE) command for the indicated cluster and target queue manager, because no information about that queue manager was found in the local repository. The command is ignored.

Response:
Reissue the command, specifying the correct queue manager name or identifier.

AMQ9456Update not received for queue <insert_3>, queue manager <insert_4> from full repository for cluster <insert_5>.
Severity:
0 : Information

Explanation:
The repository manager detected a queue that has been used in the last 30 days for which updated information should have been sent from a full repository. However, this has not occurred. 
The repository manager will keep the information about this queue for a further 60 days.

Response:
If the queue is still required, check that: 
1) The cluster channels to and from the full repository and the queue manager that hosts the queue, are able to run. 
2) The repository managers running on these queue managers have not ended abnormally.

AMQ9456 (iSeries)Update not received from full repository.
Severity:
0 : Information

Explanation:
Update not received for queue <insert_3>, queue manager <insert_4> from full repository for cluster <insert_5>. 
The repository manager detected a queue that has been used in the last 30 days for which updated information should have been sent from a full repository. However, this has not occurred. 
The repository manager will keep the information about this queue for a further 60 days.

Response:
If the queue is still required, check that: 
1) The cluster channels to and from the full repository and the queue manager that hosts the queue, are able to run. 
2) The repository managers running on these queue managers have not ended abnormally.

AMQ9457Repository available, cluster <insert_4>, channel <insert_5>, sender <insert_3>.
Severity:
0 : Information

Explanation:
The repository queue manager received a command from another queue manager, whose identifier is <insert_3>, reporting that it is again a repository for cluster <insert_4>. The cluster-sender channel <insert_5> is changed so that it can be used to access the other queue manager in relation to the the cluster.

Response:
None.

AMQ9491Transmission Queue <insert_3> set to NOSHARE.
Severity:
20 : Error

Explanation:
The channel <insert_4> on queue manager <insert_5> cannot start because this queue manager has a setting for PipeLineLength greater than 1, and so multiple threads will run in this channel's MCA. Only the first thread would be able to open the Transmission Queue <insert_3> because it is set to be non-shareable.

Response:
Check the definition of the Transmission Queue <insert_3> on queue manager <insert_5> and set it to be SHARE instead of NOSHARE. Alternatively, you can set all channels on this queue manager to use only a single thread, by using the PipeLineLength parameter.

AMQ9492The <insert_3> responder program encountered an error.
Severity:
30 : Severe error

Explanation:
The responder program was started but detected an error.

Response:
Look at previous error messages in the error files to determine the error encountered by the responder program.

AMQ9494A protocol error was detected for channel <insert_3>.
Severity:
30 : Severe error

Explanation:
During communications with the remote queue manager, a TCP/IP read and receive call returned EINTR, indicating that it had been interrupted. Immediately after this the channel program detected a protocol error. The failure type was <insert_1> with associated data of <insert_2>.

Response:
If you are running an AIX client you will avoid problems arising from EINTRs on TCP/IP reads, by writing your application so that system calls interrupted by signals are restarted. You must establish the signal handler with sigaction(2) and set the SA_RESTART flag in the sa_flags field of the new action structure. If you are running on a platform other than AIX, an AIX server, or an AIX client with an application that adheres to the restart guidelines provided above, contact the systems administrator who should examine the error logs to determine the cause of the failure.

AMQ9495The CLWL exit <insert_3> is inconsistent with a dynamic cache.
Severity:
30 : Severe error

Explanation:
When the CLWL exit <insert_3> was called for the ExitReason MQXR_INIT, the value <insert_1> was returned in the ExitResponse2 field. This indicates the CLWL exit is incompatible with the Queue Manager cache type which is dynamic. Either change the Queue Manager cache type to static (using the Tuning Parameter, ClusterCacheType=STATIC) or rewrite the CLWL exit to be compatible with a dynamic cache". The CLWL exit has been suppressed.

Response:
None.

AMQ9496Channel ended by a remote exit.
Severity:
30 : Severe error

Explanation:
Channel program <insert_3> was ended because the channel exit at the remote end requested it.

Response:
Examine the error logs at the remote end of the channel to see the reason why the remote exit ended the channel.

AMQ9498The MQCD structure supplied was not valid.
Severity:
30 : Severe error

Explanation:
The value of the <insert_3> field has the value <insert_4>. This value is invalid for the operation requested.

Response:
Change the parameter and retry the operation.

AMQ9499A WebSphere MQ listener will end shortly.
Severity:
0 : Information

Explanation:
One listener detected in the system is scheduled for shutdown.

Response:
None.

AMQ9500No Repository storage
Severity:
10 : Warning

Explanation:
An operation failed because there was no storage available in the repository. An attempt was made to allocate <insert_1> bytes from <insert_3>.

Response:
Reconfigure the Queue Manager to allocate a larger repository.

AMQ9501Usage: <insert_3> [-m QMgrName] -c ChlName.
Severity:
10 : Warning

Explanation:
Values passed to the channel program are not valid. The parameter string passed to this program is as follows :- [-m QMgrName] -c ChlName Default values will be used for parameters not supplied.

Response:
Correct the parameters passed to the Channel program and retry the operation.

AMQ9502Type of channel not suitable for action requested.
Severity:
30 : Severe error

Explanation:
The operation requested cannot be performed on channel <insert_3>. Some operations are only valid for certain channel types. For example, you can only ping a channel from the end sending the message.

Response:
Check whether the channel name is specified correctly. If it is check that the channel has been defined correctly.

AMQ9503Channel negotiation failed.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> between this machine and the remote machine could not be established due to a negotiation failure.

Response:
Tell the systems administrator, who should attempt to identify the cause of the channel failure using problem determination techniques. For example, look for FFST files, and examine the error logs on the local and remote systems where there may be messages explaining the cause of failure. More information may be obtained by repeating the operation with tracing enabled.

AMQ9504A protocol error was detected for channel <insert_3>.
Severity:
30 : Severe error

Explanation:
During communications with the remote queue manager, the channel program detected a protocol error. The failure type was <insert_1> with associated data of <insert_2>.

Response:
Contact the systems administrator who should examine the error logs to determine the cause of the failure.

AMQ9505Channel sequence number wrap values are different.
Severity:
30 : Severe error

Explanation:
The sequence number wrap value for channel <insert_3> is <insert_1>, but the value specified at the remote location is <insert_2>. The two values must be the same before the channel can be started.

Response:
Change either the local or remote channel definitions so that the values specified for the message sequence number wrap values are the same.

AMQ9506Message receipt confirmation failed.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> has ended because the remote queue manager did not accept the last batch of messages.

Response:
The error log for the channel at the remote site will contain an explanation of the failure. Contact the remote Systems Administrator to resolve the problem.

AMQ9507Channel <insert_3> is currently in-doubt.
Severity:
30 : Severe error

Explanation:
The requested operation cannot complete because the channel is in-doubt with host <insert_4>.

Response:
Examine the status of the channel, and either restart a channel to resolve the in-doubt state, or use the RESOLVE CHANNEL command to correct the problem manually.

AMQ9508Program cannot connect to the queue manager.
Severity:
30 : Severe error

Explanation:
The connection attempt to queue manager <insert_4> failed with reason code <insert_1>.

Response:
Ensure that the queue manager is available and operational.

AMQ9509Program cannot open queue manager object.
Severity:
30 : Severe error

Explanation:
The attempt to open either the queue or queue manager object <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Ensure that the queue is available and retry the operation.

AMQ9510Messages cannot be retrieved from a queue.
Severity:
30 : Severe error

Explanation:
The attempt to get messages from queue <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
If the reason code indicates a conversion problem, for example MQRC_SOURCE_CCSID_ERROR, remove the message(s) from the queue. Otherwise, ensure that the required queue is available and operational.

AMQ9511Messages cannot be put to a queue.
Severity:
30 : Severe error

Explanation:
The attempt to put messages to queue <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Ensure that the required queue is available and operational.

AMQ9512Ping operation is not valid for channel <insert_3>.
Severity:
30 : Severe error

Explanation:
Ping may only be issued for SENDER, SERVER or CLUSSDR channel types. Also, it may not be issued for an SSL channel on the HP-UX or Linux platforms.

Response:
If the local channel is a receiver channel, you must issue the ping from the remote queue manager.

AMQ9513Maximum number of channels reached.
Severity:
30 : Severe error

Explanation:
The maximum number of channels that can be in use simultaneously has been reached. The number of permitted channels is a configurable parameter in the queue manager configuration file.

Response:
Wait for some of the operating channels to close. Retry the operation when some channels are available.

AMQ9514Channel <insert_3> is in use.
Severity:
30 : Severe error

Explanation:
The requested operation failed because channel <insert_3> is currently active.

Response:
Either end the channel manually, or wait for it to close, and retry the operation.

AMQ9515Channel <insert_3> changed.
Severity:
10 : Warning

Explanation:
The statistics shown are for the channel requested, but it is a new instance of the channel. The previous channel instance has ended.

Response:
None.

AMQ9516File error occurred.
Severity:
30 : Severe error

Explanation:
The filesystem returned error code <insert_1> for file <insert_3>.

Response:
Record the name of the file <insert_3> and tell the systems administrator, who should ensure that file <insert_3> is correct and available.

AMQ9516 (iSeries)File error occurred.
Severity:
30 : Severe error

Explanation:
The filesystem returned error code <insert_4> for file <insert_3>.

Response:
Record the name of the file <insert_3> and tell the systems administrator, who should ensure that file <insert_3> is correct and available.

AMQ9517File damaged.
Severity:
30 : Severe error

Explanation:
The program has detected damage to the contents of file <insert_3>.

Response:
Record the values and tell the systems administrator who must restore a saved version of file <insert_3>. The return code was <insert_1> and the record length returned was <insert_2>.

AMQ9518File <insert_3> not found.
Severity:
30 : Severe error

Explanation:
The program requires that the file <insert_3> is present and available.

Response:
This may be caused by invalid values for the optional environment variables MQCHLLIB, MQCHLTAB or MQDATA. If these variables are valid or not set then record the name of the file and tell the systems administrator who must ensure that file <insert_3> is available to the program.

AMQ9519Channel <insert_3> not found.
Severity:
30 : Severe error

Explanation:
The requested operation failed because the program could not find a definition of channel <insert_3>.

Response:
Check that the name is specified correctly and the channel definition is available.

AMQ9520Channel not defined remotely.
Severity:
30 : Severe error

Explanation:
There is no definition of channel <insert_3> at the remote location.

Response:
Add an appropriate definition to the remote hosts list of defined channels and retry the operation.

AMQ9521Host is not supported by this channel.
Severity:
30 : Severe error

Explanation:
The connection across channel <insert_5> was refused because the remote host <insert_4> did not match the host <insert_3> specified in the channel definition.

Response:
Update the channel definition, or remove the explicit mention of the remote machine connection name.

AMQ9522Error accessing the status table.
Severity:
30 : Severe error

Explanation:
The program could not access the channel status table.

Response:
A value of <insert_1> was returned from the subsystem when an attempt was made to access the Channel status table. Contact the systems administrator, who should examine the log files to determine why the program was unable to access the status table.

AMQ9523Remote host detected a protocol error.
Severity:
30 : Severe error

Explanation:
During communications through channel <insert_3>, the remote queue manager channel program detected a protocol error. The failure type was <insert_1> with associated data of <insert_2>.

Response:
Tell the systems administrator, who should examine the error files to determine the cause of the failure.

AMQ9524Remote queue manager unavailable.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> cannot start because the remote queue manager is not currently available.

Response:
Either start the remote queue manager, or retry the operation later.

AMQ9525Remote queue manager is ending.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> is closing because the remote queue manager is ending.

Response:
None.

AMQ9526Message sequence number error for channel <insert_3>.
Severity:
30 : Severe error

Explanation:
The local and remote queue managers do not agree on the next message sequence number. A message with sequence number <insert_1> has been sent when sequence number <insert_2> was expected.

Response:
Determine the cause of the inconsistency. It could be that the synchronization information has become damaged, or has been backed out to a previous version. If the situation cannot be resolved, the sequence number can be manually reset at the sending end of the channel using the RESET CHANNEL command.

AMQ9527Cannot send message through channel <insert_3>.
Severity:
30 : Severe error

Explanation:
The channel has closed because the remote queue manager cannot receive a message.

Response:
Contact the systems administrator who should examine the error files of the remote queue manager, to determine why the message cannot be received, and then restart the channel.

AMQ9528User requested closure of channel <insert_3>.
Severity:
10 : Warning

Explanation:
The channel is closing because of a request by the user.

Response:
None.

AMQ9529Target queue unknown on remote host.
Severity:
30 : Severe error

Explanation:
Communication using channel <insert_3> has ended because the target queue for a message is unknown at the remote host.

Response:
Ensure that the remote host contains a correctly defined target queue, and restart the channel.

AMQ9530Program could not inquire queue attributes.
Severity:
30 : Severe error

Explanation:
The attempt to inquire the attributes of queue <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Ensure that the queue is available and retry the operation.

AMQ9531Transmission queue specification error.
Severity:
30 : Severe error

Explanation:
Queue <insert_4> identified as a transmission queue in the channel definition <insert_3> is not a transmission queue.

Response:
Ensure that the queue name is specified correctly. If so, alter the queue usage parameter of the queue to that of a transmission queue.

AMQ9532Program cannot set queue attributes.
Severity:
30 : Severe error

Explanation:
The attempt to set the attributes of queue <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Ensure that the queue is available and retry the operation.

AMQ9533Channel <insert_3> is not currently active.
Severity:
10 : Warning

Explanation:
The channel was not stopped because it was not currently active. If attempting to stop a specific instance of a channel by connection name or by remote queue manager name this message indicates that the specified instance of the channel is not running.

Response:
None.

AMQ9534Channel <insert_3> is currently not enabled.
Severity:
30 : Severe error

Explanation:
The channel program ended because the channel is currently not enabled.

Response:
Issue the START CHANNEL command to re-enable the channel.

AMQ9535User exit not valid.
Severity:
30 : Severe error

Explanation:
Channel program <insert_3> ended because user exit <insert_4> is not valid.

Response:
Ensure that the user exit is specified correctly in the channel definition, and that the user exit program is correct and available.

AMQ9536Channel ended by an exit.
Severity:
30 : Severe error

Explanation:
Channel program <insert_3> was ended by exit <insert_4>.

Response:
None.

AMQ9537Usage: <insert_3> [-m QMgrName] [-q InitQ]
Severity:
10 : Warning

Explanation:
Values passed to the Channel Initiator program are not valid. The parameters should be passed as follows: [-m QMgrName] [-q InitQ] Default values are used for parameters that are not supplied.

Response:
Correct the parameters passed to the program and retry the operation.

AMQ9538Commit control error.
Severity:
30 : Severe error

Explanation:
An error occurred when attempting to start commitment control. Either exception <insert_3> was received when querying commitment status, or commitment control could not be started.

Response:
Refer to the error log for other messages pertaining to this problem.

AMQ9539No channels available.
Severity:
30 : Severe error

Explanation:
The channel initiator program received a trigger message to start an MCA program to process queue <insert_3>. The program could not find a defined, available channel to start.

Response:
Ensure that there is a defined channel, which is enabled, to process the transmission queue.

AMQ9540Commit failed.
Severity:
30 : Severe error

Explanation:
The program ended because return code <insert_1> was received when an attempt was made to commit change to the resource managers. The commit ID was <insert_3>.

Response:
Tell the systems administrator.

AMQ9541CCSID supplied for data conversion not supported.
Severity:
30 : Severe error

Explanation:
The program ended because, either the source CCSID <insert_1> or the target CCSID <insert_2> is not valid, or is not currently supported.

Response:
Correct the CCSID that is not valid, or ensure that the requested CCSID can be supported.

AMQ9542Queue manager is ending.
Severity:
10 : Warning

Explanation:
The program will end because the queue manager is quiescing.

Response:
None.

AMQ9543Status table damaged.
Severity:
30 : Severe error

Explanation:
The channel status table has been damaged.

Response:
End all running channels and issue a DISPLAY CHSTATUS command to see the status of the channels. Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9544Messages not put to destination queue.
Severity:
10 : Warning

Explanation:
During the processing of channel <insert_3> one or more messages could not be put to the destination queue and attempts were made to put them to a dead-letter queue. The location of the queue is <insert_1>, where 1 is the local dead-letter queue and 2 is the remote dead-letter queue.

Response:
Examine the contents of the dead-letter queue. Each message is contained in a structure that describes why the message was put to the queue, and to where it was originally addressed. Also look at previous error messages to see if the attempt to put messages to a dead-letter queue failed. The program identifier (PID) of the processing program was <insert_4>.

AMQ9545Disconnect interval expired.
Severity:
0 : Information

Explanation:
Channel <insert_3> closed because no messages arrived on the transmission queue within the disconnect interval period.

Response:
None.

AMQ9546Error return code received.
Severity:
30 : Severe error

Explanation:
The program has ended because return code <insert_1> was returned from function <insert_3>

Response:
Correct the cause of the failure and retry the operation.

AMQ9547Type of remote channel not suitable for action requested.
Severity:
30 : Severe error

Explanation:
The operation requested cannot be performed because channel <insert_3> on the remote machine is not of a suitable type. For example, if the local channel is defined as a sender the remote machine must define its channel as either a receiver or requester.

Response:
Check that the channel name is specified correctly. If it is, check that the remote channel has been defined correctly.

AMQ9548Message put to the 'dead-letter queue'.
Severity:
10 : Warning

Explanation:
During processing a message has been put to the dead-letter queue.

Response:
Examine the contents of the dead-letter queue. Each message is contained in a structure that describes why the message was put to the queue, and to where it was originally addressed.

AMQ9549Transmission Queue <insert_3> inhibited for MQGET.
Severity:
20 : Error

Explanation:
An MQGET failed because the transmission queue had been previously inhibited for MQGET.

Response:
None.

AMQ9550Channel program <insert_3> cannot be stopped at this time.
Severity:
30 : Severe error

Explanation:
The channel program can not be terminated immediately but should end shortly.

Response:
If the channel does not end in a short time issue the STOP CHANNEL command again.

AMQ9551Protocol not supported by remote host
Severity:
30 : Severe error

Explanation:
The operation you are performing over Channel <insert_3> to the host at <insert_4> is not supported by the target host.

Response:
Check that the connection name parameter is specified correctly and that the levels of the products in use are compatible.

AMQ9552Security flow not received.
Severity:
30 : Severe error

Explanation:
During communications through channel <insert_3> the local security exit requested security data from the remote machine. The security data has not been received so the channel has been closed.

Response:
Tell the systems administrator who should ensure that the security exit on the remote machine is defined correctly.

AMQ9553The function is not supported.
Severity:
30 : Severe error

Explanation:
The <insert_3> function <insert_4> attempted is not currently supported on this platform.

Response:
None.

AMQ9554User not authorized.
Severity:
30 : Severe error

Explanation:
You are not authorized to perform the Channel operation.

Response:
Tell the systems administrator who should ensure that the correct access permissions are available to you, and then retry the operation.

AMQ9555File format error.
Severity:
30 : Severe error

Explanation:
The file <insert_3> does not have the expected format.

Response:
Ensure that the file name is specified correctly.

AMQ9556Channel synchronization file missing or damaged.
Severity:
30 : Severe error

Explanation:
The channel synchronization file <insert_3> is missing or does not correspond to the stored channel information for queue manager <insert_4>.

Response:
Rebuild the synchronization file using the rcrmqmobj command rcrmqmobj -t syncfile (-m q-mgr-name)

AMQ9556 (iSeries)Channel synchronization file missing or damaged.
Severity:
30 : Severe error

Explanation:
The channel synchronization file <insert_3> is missing or does not correspond to the stored channel information for queue manager <insert_4>.

Response:
Rebuild the synchronization file using the RCRMQMOBJ command.

AMQ9557Queue Manager User ID initialization failed.
Severity:
30 : Severe error

Explanation:
The call to initialize the User ID failed with CompCode <insert_1> and Reason <insert_2>.

Response:
Correct the error and try again.

AMQ9558Remote Channel is not currently available.
Severity:
30 : Severe error

Explanation:
The channel program ended because the channel <insert_3> is not currently available on the remote system. This could be because the channel is disabled or that the remote system does not have sufficient resources to run a further channel.

Response:
Check the remote system to ensure that the channel is available to run, and retry the operation.

AMQ9560Rebuild Synchronization File - program started
Severity:
0 : Information

Explanation:
Rebuilding the Synchronization file for Queue Manager <insert_3> .

Response:
None.

AMQ9561Rebuild Synchronization File - program completed normally
Severity:
0 : Information

Explanation:
Rebuild Synchronization File program completed normally.

Response:
None.

AMQ9562Synchronization file in use.
Severity:
30 : Severe error

Explanation:
The Synchronization file <insert_3> is in use and cannot be recreated.

Response:
Stop any channel activity and retry the rcrmqmobj command.

AMQ9562 (iSeries)Synchronization file in use.
Severity:
30 : Severe error

Explanation:
The Synchronization file <insert_3> is in use and cannot be recreated.

Response:
Stop any channel activity and retry the RCRMQMOBJ command.

AMQ9563Synchronization file cannot be deleted
Severity:
30 : Severe error

Explanation:
The filesystem returned error code <insert_1> for file <insert_3>.

Response:
Tell the systems administrator who should ensure that file <insert_3> is available and not in use.

AMQ9564Synchronization File cannot be created
Severity:
30 : Severe error

Explanation:
The filesystem returned error code <insert_1> for file <insert_3>.

Response:
Tell the systems administrator.

AMQ9565No dead-letter queue defined.
Severity:
30 : Severe error

Explanation:
The queue manager <insert_4> does not have a defined dead-letter queue.

Response:
Either correct the problem that caused the program to try and write a message to the dead-letter queue or create a dead-letter queue for the queue manager.

AMQ9566Invalid MQSERVER value
Severity:
30 : Severe error

Explanation:
The value of the MQSERVER environment variable was <insert_3>. The variable should be in the format 'ChannelName/Protocol/ConnectionName'.

Response:
Correct the MQSERVER value and retry the operation.

AMQ9572Message header is not valid.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> is stopping because a message header is not valid. During the processing of the channel, a message was found that has a header that is not valid. The dead-letter queue has been defined as a transmission queue, so a loop would be created if the message had been put there.

Response:
Correct the problem that caused the message to have a header that is not valid.

AMQ9573Maximum number of active channels reached.
Severity:
30 : Severe error

Explanation:
There are too many channels active to start another. The current defined maximum number of active channels is <insert_1>.

Response:
Either wait for some of the operating channels to close or use the stop channel command to close some channels. Retry the operation when some channels are available. The maximum number of active channels is a configurable parameter in the queue manager configuration file.

AMQ9574Channel <insert_3> can now be started.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> has been waiting to start, but there were no channels available because the maximum number of active channels was running. One, or more, of the active channels has now closed so this channel can start.

AMQ9575DCE Security: failed to get the user's login name.
Severity:
30 : Severe error

Explanation:
System call <insert_4> to get the login name of the user running WebSphere MQ client application process <insert_1> failed with error value <insert_2>. This occurred in security exit function create_cred. The exit will now attempt to open channel <insert_3> using the DCE default login context.

Response:
If you wish to run using the DCE default login context take no action. If you wish to run using the user's login name as the DCE security exit principal examine the documentation for the operating system on which you are running MQ clients and reconfigure the operating system as necessary to allow the <insert_4> call to succeed.

AMQ9576DCE Security: an exit could not allocate memory.
Severity:
30 : Severe error

Explanation:
A DCE exit was unsuccessful in obtaining the memory it needed. The failure occurred in exit function <insert_4>. Channel <insert_3> is closed.

Response:
Make more memory available to the WebSphere MQ system and restart the relevant channel.

AMQ9577DCE security exit: no partner name.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> has not been opened because the DCE security exit which initiates the security context was not passed a valid partner name. When the DCE security exit is called to initiate the security context it is essential that the PartnerName field in the MQCXP structure contains a valid partner name. On this call it did not. This can arise as a result of a usage error, for instance only specifying the security exit on one end of the channel. The error was reported from security exit function savePartnerName.

Response:
Check your usage of the DCE security exit for errors, such as only specifying the exit in one of the matching channel definitions. Correct any errors found and retry.

AMQ9578DCE Security: bad return from DCE call.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> has been closed because one of the DCE channel exits received a bad return code from DCE.

Response:
Consult the appropriate DCE manuals to find out the meaning of major_status <insert_1> and minor_status <insert_2> on call <insert_5>. Then rectify the error. The exit function name is <insert_4>.

AMQ9579DCE Security: partner name does not match target.
Severity:
30 : Severe error

Explanation:
The DCE Security exit was requested to perform a trusted channel check: target partner name <insert_4> was specified in the SCYDATA field of channel <insert_3>. The actual partner name associated with channel <insert_3> was <insert_5>, so the security exit suppressed the channel.

Response:
Examine the channel definition of channel <insert_3> and alter it so that the relevant name on the partner system matches that specified in the SCYDATA field.

AMQ9580DCE Security: invalid message received.
Severity:
30 : Severe error

Explanation:
An IBM-supplied DCE exit on channel <insert_3> received a message that was not generated by a matching exit, or was not the expected type of message. The header.mechanism field had value <insert_1>. The header.msgtype field had value <insert_2>. The name of the exit function in which the error was discovered is <insert_4>.

Response:
Make sure that the exits at both ends of the channel generate compatible flows.

AMQ9581DCE Security: wrong exit called.
Severity:
30 : Severe error

Explanation:
Exit <insert_4> on channel <insert_3> was called for use as a WebSphere MQ exit of the wrong type. DCE_SEC_SCY_CHANNELEXIT functions as a security exit; DCE_SEC_SRM_CHANNELEXIT functions as a send, receive or message exit. The ExitId parameter passed to the exit was <insert_1>.

Response:
Alter the exit definitions to ensure that exit <insert_4> is called correctly.

AMQ9582DCE Security: invalid exit function requested.
Severity:
30 : Severe error

Explanation:
Exit <insert_4> on channel <insert_3> was called with an invalid ExitReason (value <insert_1>).

Response:
Check that the exit is being run with a compatible release of WebSphere MQ base code. If not then correct it. If it is, contact your IBM support center for help.

AMQ9583The DCE security exit was not run.
Severity:
30 : Severe error

Explanation:
The DCE_SEC_SRM_CHANNELEXIT exit was called on channel <insert_3>; the value of pContext->mechanism (<insert_1>) passed was not valid.

Response:
This is probably because the DCE_SEC_SRM_CHANNELEXIT exit has been called without first calling the DCE_SEC_SCY_CHANNELEXIT security exit. Alter the system so that either both or neither are run.

AMQ9584DCE Security: message too short.
Severity:
30 : Severe error

Explanation:
The DCE_SEC_SRM_CHANNELEXIT receive or message exit was called on channel <insert_3> to process an incoming message. The pDataLength parameter supplied to the exit indicated that the message received was too short to be a valid message for the relevant exit. The *pDataLength value was <insert_1>.

Response:
Configure the system so that compatible send/receive/message exits are run at both ends of the channel.

AMQ9585Maximum number of channel initiators reached.
Severity:
30 : Severe error

Explanation:
The maximum number of channels initiators that can be in use simultaneously has been reached. The number of permitted channel initiators is a configurable parameter in the queue manager configuration file.

Response:
Wait for one or more channel initiators to close and retry the operation or modify the configuration file to allow more initiators and restart the Queue Manager.

AMQ9586Program cannot create queue manager object.
Severity:
30 : Severe error

Explanation:
The attempt to create object <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier. Contact your IBM support center.

AMQ9587Program cannot open queue manager object.
Severity:
30 : Severe error

Explanation:
The attempt to open object <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier. Contact your IBM support center.

AMQ9588Program cannot update queue manager object.
Severity:
30 : Severe error

Explanation:
The attempt to update object <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier. Contact your IBM support center.

AMQ9589Program cannot query queue manager object.
Severity:
30 : Severe error

Explanation:
The attempt to query object <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier. Contact your IBM support center.

AMQ9590Program cannot close queue manager object.
Severity:
30 : Severe error

Explanation:
The attempt to close object <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier. Contact your IBM support center.

AMQ9591Program cannot prepare queue manager object.
Severity:
30 : Severe error

Explanation:
The attempt to prepare object <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier. Contact your IBM support center.

AMQ9592Program cannot resolve queue manager object.
Severity:
30 : Severe error

Explanation:
The attempt to resolve object <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier. Contact your IBM support center.

AMQ9593Program cannot delete queue manager object.
Severity:
30 : Severe error

Explanation:
The attempt to delete object <insert_4> on queue manager <insert_5> failed with reason code <insert_1>.

Response:
Use the standard facilities supplied with your system to record the problem identifier. Contact your IBM support center.

AMQ9594Usage: runmqfmt [filename].
Severity:
0 : Information

Explanation:
Syntax for the usage of runmqfmt.

Response:
None.

AMQ9595Usage: endmqlsr [-w] [-m QMgrName]
Severity:
10 : Warning

Explanation:
The correct usage is shown.

Response:
Correct the parameters passed to the endmqlsr program and retry the operation.

AMQ9596Queue Manager <insert_3> still running
Severity:
30 : Severe error

Explanation:
The requested operation can not complete because queue manager <insert_3> is still running.

Response:
End the queue manager and retry the operation.

AMQ9597No WebSphere MQ listeners for Queue Manager <insert_3>.
Severity:
0 : Information

Explanation:
No listener processes were found in the system for Queue Manager <insert_3>.

Response:
None.

AMQ9598<insert_1> WebSphere MQ listeners will end shortly.
Severity:
0 : Information

Explanation:
<insert_1> listeners detected in the system are scheduled for shutdown.

Response:
None.

AMQ9599Program could not open queue manager object.
Severity:
30 : Severe error

Explanation:
The attempt to open either the queue or queue manager object <insert_4> on queue manager <insert_5> by user <insert_3> failed with reason code <insert_1>.

Response:
Ensure that the queue is available and retry the operation. If the message is from a remote Queue Manager, check the Message Channel Agent User Identifier has the correct authority.

AMQ9601Program could not inquire on queues on this queue manager.
Severity:
30 : Severe error

Explanation:
The WebSphere MQ clustering repository program was attempting to find out about the queues on queue manager <insert_3>. One of the calls failed with reason code <insert_1>. The repository command was backed out and the repository process went into a timed wait.

Response:
Correct the error. When the repository process restarts it processes the backed out command again and continues.

AMQ9602Maximum number of channel processes reached.
Severity:
30 : Severe error

Explanation:
The channel can not start because the number of channel processes has already reached the maximum allowable value. The maximum number of channels processes is configured as <insert_1>. This value is a configurable parameter in the queue manager configuration file.

Response:
Wait for some of the operating channels to close. Retry the operation when some channels are available.

AMQ9603Error accessing the process pool shared segment.
Severity:
30 : Severe error

Explanation:
The program could not access the process pool shared segment

Response:
A value of <insert_1> was returned from the subsystem when an attempt was made to access the Channel process pool shared memory. Contact the systems administrator, who should examine the log files to determine why the program was unable to access the process pool shared segment.

AMQ9604Channel <insert_3> terminated unexpectedly
Severity:
30 : Severe error

Explanation:
The process or thread executing channel <insert_3> is no longer running. The check process system call returned <insert_1> for process <insert_2>.

Response:
No immediate action is required because the channel entry has been removed from the list of running channels. Inform the system administrator who should examine the operating system procedures to determine why the channel process has terminated.

AMQ9605<insert_1> WebSphere MQ listeners have been ended.
Severity:
0 : Information

Explanation:
<insert_1> listeners detected in the system have been ended.

Response:
None.

AMQ9606A WebSphere MQ listener has ended.
Severity:
0 : Information

Explanation:
One listener detected in the system has been ended.

Response:
None.

AMQ9608Remote resources in recovery
Severity:
30 : Severe error

Explanation:
Channel <insert_3> could not establish a successful connection with the remote Queue Manager because resources are being recovered.

Response:
Restart the channel at a later time. If the problem persists then examine the error logs of the remote Queue Manager to see the full explanation of the cause of the problem.

AMQ9610AMQ<insert_1> messages suppressed
Severity:
0 : Information

Explanation:
<insert_2> messages of type AMQ<insert_1> were suppressed

Response:
Message suppression is controlled by MQ_CHANNEL_SUPPRESS_MSGS and MQ_CHANNEL_SUPPRESS_INTERVAL environment variables.

AMQ9611Rebuild Client Channel Table - program completed normally
Severity:
0 : Information

Explanation:
Rebuild Client Channel Table program completed normally.

Response:
None.

AMQ9612<insert_1> WebSphere MQ listeners could not be ended.
Severity:
0 : Information

Explanation:
The request to the end the WebSphere MQ listeners for specified Queue Manager was completed however <insert_1> listeners could not be stopped. Reasons why listener may not be stopped are: 
The listener process contains channels which are still active.

Response:
Active channels may be stopped using the 'STOP CHANNEL' command or by ending the Queue Manager, and reissuing the end-listener request.

AMQ9614 (iSeries)Certificate is not signed by a trusted Certificate Authority.
Severity:
0 : Information

Explanation:
The attempt to start channel <insert_3> failed because the certificate used in the SSL handshake is not signed by a Certificate Authority (CA) listed in the certificate trust list for this queue manager. This error occurs when the SSL key repository for the queue manager is specified as '*SYSTEM' and the application definition in Digital Certificate Manager has been modified to specify a CA trust list.

Response:
Use Digital Certificate Manager to add the required Certificate Authority (CA) certificates to the application definitions CA trust list.

AMQ9615 (iSeries)Queue Manager is not registered with DCM.
Severity:
0 : Information

Explanation:
The attempt to start channel <insert_3> failed because the queue manager is not registered as a SSL server application with Digital Certificate Manager (DCM). This error occurs when the SSL key repository for the queue manager is specified as '*SYSTEM' but WebSphere MQ cannot register the queue manager as an SSL server application with DCM, or alternatively when the application definition for the queue manager has been manually removed from DCM.

Response:
Attempt to re-register the queue manager with Digital Certificate Manager by issuing CHGMQM SSLKEYR(*SYSTEM). If this is unsuccessful you may need to manually add the application definition through Digital Certificate Manager, see the WebSphere MQ Security manual for more details.

AMQ9616The CipherSpec proposed is not enabled on the SSL server.
Severity:
30 : Severe error

Explanation:
The SSL subsystem at the SSL server end of a channel been configured in such a way that it has rejected the CipherSpec proposed by an SSL client. This rejection occured during the SSL handshake (i.e. it happened before the proposed CipherSpec was compared with the CipherSpec in the SSL server channel definition). This error most commonly occurs when the choice of acceptable CipherSpecs has been limited by setting the SSLFipsRequired attribute on the SSL server queue manager while the SSL client is trying to connect with a CipherSpec which is not FIPS-certified on the SSL server. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Analyse why the proposed CipherSpec was not enabled on the SSL server. Alter the client CipherSpec, or reconfigure the SSL server to accept the original client CipherSpec. Restart the channel.

AMQ9617Parameter requesting FIPS has an invalid value.
Severity:
30 : Severe error

Explanation:
An SSL channel running on an MQ client has failed to start. This is because the value specified for the MQSSLFIPS environment variable, or in the MQSCO FipsRequired field, is invalid. The value specified was "<insert_3>"; valid values are "YES" or "NO".

Response:
Set the value specified for the MQSSLFIPS environment variable, or in the MQSCO FipsRequired field, to "YES" or "NO". Restart the channel.

AMQ9618SSLCRLNL attribute points to a namelist with no names.
Severity:
30 : Severe error

Explanation:
An SSL channel has failed to start because the SSLCRLNL queue manager attribute points to a namelist with an empty list of names.

Response:
If CRL checking is required, set up the namelist referenced by SSLCRLNL with a non-empty list of authentication information object names. If no CRL checking is required, clear the SSLCRLNL queue manager attribute. Restart the failing channel.

AMQ9619SSL cannot be run from an unthreaded HP-UX MQ client.
Severity:
30 : Severe error

Explanation:
On HP-UX, SSL cannot be run from a WebSphere MQ client which was linked with the unthreaded client libraries.

Response:
Either relink your client application with the threaded client libraries, or do not attempt to use SSL from this application.

AMQ9620Internal error on call to SSL function on channel <insert_3>.
Severity:
30 : Severe error

Explanation:
An error indicating a software problem was returned from a function which is used to provide SSL support. The error code returned was <insert_1>. The function call was <insert_4>. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Collect the items listed in the 'Problem determination' section of the System Administration manual and contact your IBM support center.

AMQ9620 (iSeries)Unexpected SSL error on call to <insert_4>.
Severity:
0 : Information

Explanation:
An unexpected SSL error was returned from function <insert_4> for channel <insert_3>. The error code returned was <insert_1>. GSKit error codes are documented in the MQ manuals and also in the GSKSSL member of the H file in library QSYSINC.

Response:
Collect the items listed in the 'Problem determination' section of the System Administration manual and contact your IBM support center.

AMQ9621Error on call to SSL function ignored on channel <insert_3>.
Severity:
10 : Warning

Explanation:
An error indicating a software problem was returned from a function which is used to provide SSL support. The error code returned was <insert_1>. The function call was <insert_4>. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. This error is not regarded as sufficiently serious to interrupt channel operation; channel operation was not affected.

Response:
None.

AMQ9622AUTHINFO object <insert_3> does not exist.
Severity:
30 : Severe error

Explanation:
A channel or channel process has failed to start because the namelist of CRL AUTHINFO objects includes the name <insert_3>, but no AUTHINFO object of that name exists.

Response:
Ensure all the names in the namelist specified on the SSLCRLNL queue manager attribute correspond to AUTHINFO objects which are to be used on the SSL channels. Restart the failing channel or channel process.

AMQ9623Error inquiring on AUTHINFO object <insert_3>.
Severity:
30 : Severe error

Explanation:
A channel or channel process has failed to start because reason code <insert_1> was returned when an inquire was performed on AUTHINFO object <insert_3>.

Response:
Look at the MQRC_ values in the WebSphere MQ Application Programming Reference to determine the meaning of reason code <insert_1>, correct the error, and restart the failing channel or channel process.

AMQ9624AUTHINFO object <insert_3> is not of type CRLLDAP.
Severity:
30 : Severe error

Explanation:
A channel or channel process has failed to start because one of the AUTHINFO objects specified in the SSLCRLNL namelist is not of AUTHTYPE CRLLDAP. Instead the type value is <insert_1>.

Response:
Only include CRLLDAP AUTHINFO objects in the namelist specified on the SSLCRLNL queue manager attribute. Restart the channel or channel process.

AMQ9625AUTHINFO object <insert_3> was specified with an invalid CONNAME.
Severity:
30 : Severe error

Explanation:
A channel or channel process has failed to start because one of the AUTHINFO objects specified in the SSLCRLNL namelist has an invalid CONNAME parameter. The invalid value is <insert_4>.

Response:
Correct the invalid parameter. Restart the channel or channel process.

AMQ9626Channel hanging while initializing SSL.
Severity:
30 : Severe error

Explanation:
The current channel cannot start because another channel is hanging while initializing the SSL subsystem.

Response:
Investigate the reason for the hang on the other channel. Once this is rectified, restart this channel.

AMQ9627The path and stem name for the SSL key repository have not been specified.
Severity:
30 : Severe error

Explanation:
The directory path and file stem name for the SSL key repository have not been specified. On a MQ client system there is no default location for this file. SSL connectivity is therefore impossible as this file cannot be accessed.

Response:
Use the MQSSLKEYR environment variable or MQCONNX API call to specify the directory path and file stem name for the SSL key repository.

AMQ9628An LDAP server containing CRLs was specified with an invalid CONNAME.
Severity:
30 : Severe error

Explanation:
The WebSphere MQ client has failed to connect because an invalid CONNAME was found for one of the LDAP servers containing CRLs. The invalid value is <insert_3>.

Response:
Correct the invalid parameter. If the LDAP details were defined on a queue manager system, regenerate the client definitions. Reconnect.

AMQ9629Bad SSL cryptographic hardware parameters.
Severity:
30 : Severe error

Explanation:
The following string was supplied to specify or control use of SSL cryptographic hardware: <insert_4>. This string does not conform to any of the MQ SSL cryptographic parameter formats. The channel is <insert_3>. The channel did not start.

Response:
Correct your SSL cryptographic hardware parameters and restart the channel.

AMQ9630An expired SSL certificate was loaded.
Severity:
30 : Severe error

Explanation:
An SSL certificate that was loaded was not corrupt, but failed validation checks on its date fields. The certificate has either expired, or its date is not valid yet (that is, the from date is later than today), or the validity date range is incorrect (for example, the to date is earlier than the from date).

Response:
Ensure that the specified SSL certificate has a valid expiry date.

AMQ9631The CipherSpecs on the two ends of channel <insert_3> do not match.
Severity:
30 : Severe error

Explanation:
There is a mismatch between the CipherSpecs on the local and remote ends of channel <insert_3>. The channel will not run until this mismatch is resolved.

Response:
Change the channel definitions for <insert_3> so the two ends have matching CipherSpecs and restart the channel.

AMQ9631 (iSeries)The CipherSpecs at the ends of channel <insert_3> do not match.
Severity:
30 : Severe error

Explanation:
There is a mismatch between the CipherSpecs on the local and remote ends of channel <insert_3>. The channel will not run until this mismatch is resolved. The local CipherSpec is <insert_4> and the remote CipherSpec is <insert_5>.

Response:
Change the channel definition for <insert_3> so that both ends have matching CipherSpecs and restart the channel.

AMQ9633Bad SSL certificate for channel <insert_3>.
Severity:
30 : Severe error

Explanation:
A certificate encountered during SSL handshaking is regarded as bad for one of the following reasons: 
(a) it was formatted incorrectly and could not be validated, or 
(b) it was formatted correctly but failed validation against the Certification Authority (CA) root and other certificates held on the local system, or 
(c) it was found in a Certification Revocation List (CRL) on an LDAP server. 
(d) a CRL was specified but the CRL could not be found on the LDAP server. 
The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Check which of the three possible causes applies on your system. Correct the error, and restart the channel.

AMQ9634SSL security context expired.
Severity:
30 : Severe error

Explanation:
During an SSL operation to encrypt or decrypt a secured message, the SSL security context, which is used to secure communications and was previously established with the remote party, has expired because the remote party has shut down. The secured message has not been encrypted or decrypted. This failure has closed WebSphere MQ channel name <insert_3>. If the name is '????', the name is unknown. The SSL operation was <insert_4> and its completion code was <insert_5>.

Response:
Determine why the remote party has shut down and if necessary re-start the channel. The shut down might be the result of controlled termination by a system administrator, or the result of an unexpected termination due to an error. The SSL operation is described in the Windows Schannel reference manual.

AMQ9635Channel <insert_3> did not specify a valid CipherSpec.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> did not specify a valid CipherSpec.

Response:
Change channel <insert_3> to specify a valid CipherSpec.

AMQ9635 (iSeries)Channel <insert_3> did not specify a valid CipherSpec.
Severity:
30 : Severe error

Explanation:
Channel <insert_3> did not specify a valid CipherSpec, or it specified a CipherSpec that is not available from the IBM Cryptographic Access Provider product installed on this machine. CipherSpecs that use 128-bit encryption algorithms are only available in 5722-AC3 (128-bit) IBM Cryptographic Access Provider.

Response:
Change channel <insert_3> to specify a valid CipherSpec that is available from the IBM Cryptographic Access Provider product installed on this machine. Check that the CipherSpec you are using is available on this machine in either the 5722-AC2 (56-bit) IBM Cryptographic Access Provider or 5722-AC3 (128-bit) IBM Cryptographic Access Provider licensed program.

AMQ9636SSL distinguished name does not match peer name, channel <insert_3>.
Severity:
30 : Severe error

Explanation:
The distinguished name, <insert_4>, contained in the SSL certificate for the remote end of the channel does not match the local SSL peer name for channel <insert_3>. The distinguished name at the remote end must match the peer name specified (which can be generic) before the channel can be started.

Response:
If this remote system should be allowed to connect, either change the SSL peer name specification for the local channel so that it matches the distinguished name in the SSL certificate for the remote end of the channel, or obtain the correct certificate for the remote end of the channel. Restart the channel.

AMQ9637Channel is lacking a certificate.
Severity:
30 : Severe error

Explanation:
The channel is lacking a certificate to use for the SSL handshake. The channel name is <insert_3> (if '????' it is unknown at this stage in the SSL processing). The channel did not start.

Response:
Make sure the appropriate certificates are correctly configured in the key repositories for both ends of the channel. 
If you have migrated from WebSphere MQ V5.3 to V6, it is possible that the missing certificate is due to a failure during SSL key repository migration. Check the relevant error logs. If these show that an orphan certificate was encountered then you should obtain the relevant missing certification authority (signer) certificates and then import these and the orphan certificate into the WebSphere MQ V6 key repository, and then re-start the channel.

AMQ9638SSL communications error for channel <insert_3>.
Severity:
30 : Severe error

Explanation:
An unexpected SSL communications error occurred for a channel, as reported in the preceding messages. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Investigate the problem reported in the preceding messages. Review the local and remote console logs for reports of network errors. Correct the errors and restart the channel.

AMQ9639Remote channel <insert_3> did not specify a CipherSpec.
Severity:
30 : Severe error

Explanation:
Remote channel <insert_3> did not specify a CipherSpec when the local channel expected one to be specified. The channel did not start.

Response:
Change the remote channel <insert_3> to specify a CipherSpec so that both ends of the channel have matching CipherSpecs.

AMQ9640SSL invalid peer name, channel <insert_3>, attribute <insert_5>.
Severity:
30 : Severe error

Explanation:
The SSL peer name for channel <insert_3> includes a distinguished name attribute key <insert_5> which is invalid or unsupported. The channel did not start.

Response:
Correct the SSL peer name for the channel. Restart the channel.

AMQ9641Remote CipherSpec error for channel <insert_3>.
Severity:
30 : Severe error

Explanation:
The remote end of channel <insert_3> has had a CipherSpec error. The channel did not start.

Response:
Review the error logs on the remote system to discover the problem with the CipherSpec.

AMQ9642No SSL certificate for channel <insert_3>.
Severity:
30 : Severe error

Explanation:
The channel <insert_3> did not supply a certificate to use during SSL handshaking, but a certificate is required by the remote queue manager. The channel did not start.

Response:
Ensure that the key repository of the local queue manager or MQ client contains an SSL certificate which is associated with the queue manager or client. Alternatively, if appropriate, change the remote channel definition so that its SSLCAUTH attribute is set to OPTIONAL and it has no SSLPEER value set. 
If you have migrated from WebSphere MQ V5.3 to V6, it is possible that the missing certificate is due to a failure during SSL key repository migration. Check the relevant error logs. If these show that an orphan certificate was encountered then you should obtain the relevant missing certification authority (signer) certificates and then import these and the orphan certificate into the WebSphere MQ V6 key repository, and then re-start the channel.

AMQ9642 (iSeries)No SSL certificate for channel <insert_3>.
Severity:
0 : Information

Explanation:
The channel <insert_3> did not supply a certificate to use during SSL handshaking, but a certificate is required by the remote queue manager. The channel did not start.

Response:
If the SSL key repository for the queue manager has been specified as '*SYSTEM' ensure that a certificate has been associated with the application description for the queue manager in Digital Certificate Manager. Alternatively, if appropriate, change the remote channel definition so that its SSLCAUTH attribute is set to OPTIONAL and it has no SSLPEER value set.

AMQ9643Remote SSL peer name error for channel <insert_3>.
Severity:
30 : Severe error

Explanation:
The remote end of channel <insert_3> has had an SSL peer name error. The channel did not start.

Response:
Review the error logs on the remote system to discover the problem with the peer name.

AMQ9645Correctly labelled SSL certificate missing on channel <insert_3>.
Severity:
30 : Severe error

Explanation:
The key database file in use has not been set up with a correctly labelled SSL certificate. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Add a correctly labelled SSL certificate to the current key database file. Restart the channel.

AMQ9646Channel <insert_3> could not connect to any LDAP CRL servers.
Severity:
30 : Severe error

Explanation:
LDAP Certification Revocation List (CRL) servers were specified but a connection could not be established to any of them. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Check that the LDAP CRL server specifications are correct. If they are, check that the servers are running and that the networking to access them is working correctly. Fix any errors found and restart the channel.

AMQ9647I/O error on SSL key repository.
Severity:
30 : Severe error

Explanation:
An I/O error was encountered when attempting to read the SSL key repository. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Analyse why there is a I/O problem when reading the key repository. Fix the error if one is found, or it may be a temporary problem. Restart the channel.

AMQ9648The SSL key repository has an invalid internal format.
Severity:
30 : Severe error

Explanation:
The SSL key repository has an invalid internal format. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Recreate the SSL key repository and restart the channel.

AMQ9649The SSL key repository contains duplicate keys.
Severity:
30 : Severe error

Explanation:
The SSL key repository contains two or more entries with the same key. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Use your key management tool to remove the duplicate keys. Restart the channel.

AMQ9650The SSL key repository contains entries with duplicate labels.
Severity:
30 : Severe error

Explanation:
The SSL key repository contains two or more entries with the same label. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Use your key management tool to remove the duplicate entries. Restart the channel.

AMQ9651The SSL key repository is corrupt or has a bad password.
Severity:
30 : Severe error

Explanation:
The SSL key repository has become corrupted or its password id is incorrect. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Use your key management tool to recreate the key repository with a new password. Restart the channel.

AMQ9652The remote SSL certificate has expired.
Severity:
30 : Severe error

Explanation:
The SSL certificate used by MQ on the remote end of the channel has expired. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Use your key management tool to provide MQ with a current SSL certificate on the remote end of the channel. Restart the channel.

AMQ9653An SSL trace file could not be opened.
Severity:
10 : Warning

Explanation:
An SSL trace file could not be opened. The SSL trace files are created in directory /var/mqm/trace and have names AMQ.SSL.TRC and AMQ.SSL.TRC.1. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. This error is not regarded as sufficiently serious to interrupt channel operation; channel operation was not affected.

Response:
Check that you have a directory called /var/mqm/trace and that the userid under which WebSphere MQ runs has permissions and space to create and open a file in that directory. Fix the problem and you will get SSL trace output.

AMQ9654An invalid SSL certificate was received from the remote system.
Severity:
30 : Severe error

Explanation:
An SSL certificate received from the remote system was not corrupt but failed validation checks on something other than its ASN fields and date. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Ensure that the remote system has a valid SSL certificate. Restart the channel.

AMQ9655Problem loading GSKit SSL support.
Severity:
30 : Severe error

Explanation:
MQ SSL support is provided on this platform using a component called GSKit which is installed as part of MQ. GSKit had an internal problem loading one if its dynamic link libraries. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Uninstall MQ and reinstall. Restart the channel.

AMQ9656An invalid SSL certificate was received from the remote system.
Severity:
30 : Severe error

Explanation:
An SSL certificate received from the remote system was not corrupt but failed validation checks on its ASN fields. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Ensure that the remote system has a valid SSL certificate. Restart the channel.

AMQ9657The key repository could not be opened (channel <insert_3>).
Severity:
30 : Severe error

Explanation:
The key repository could not be opened. The key repository either does not exist or has incorrect permissions associated with it. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Ensure that the key repository you specify exists and that its permissions are such that the MQ process involved can read from it. Restart the channel.

AMQ9658An invalid SSL certificate has been encountered.
Severity:
30 : Severe error

Explanation:
An SSL certificate has been encountered which was not corrupt but which failed validation checks on its date fields. The certificate has either expired, or its date is not valid yet (i.e. the from date is later than today), or the validity date range is incorrect (e.g. the to date is earlier than the from date). The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Ensure that both the local and remote systems have valid, current SSL certificates. Restart the channel.

AMQ9659A failure occurred during SSL handshaking.
Severity:
30 : Severe error

Explanation:
During SSL handshaking, or associated activities, a failure occurred. The failure is <insert_4> and has caused WebSphere MQ channel name <insert_3> to be closed. If the name is '????' then the name is unknown.

Response:
Refer to prior message in the WebSphere MQ error log for information related to this problem.

AMQ9660SSL key repository: password stash file absent or unusable.
Severity:
30 : Severe error

Explanation:
The SSL key repository cannot be used because MQ cannot obtain a password to access it. Reasons giving rise to this error include: 
(a) the key database file and password stash file are not present in the location configured for the key repository, 
(b) the key database file exists in the correct place but that no password stash file has been created for it, 
(c) the files are present in the correct place but the userid under which MQ is running does not have permission to read them, 
(d) one or both of the files are corrupt. 
The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Ensure that the key repository variable is set to where the key database file is. Ensure that a password stash file has been associated with the key database file in the same directory, and that the userid under which MQ is running has read access to both files. If both are already present and readable in the correct place, delete and recreate them. Restart the channel.

AMQ9661Bad SSL data from peer on channel <insert_3>.
Severity:
30 : Severe error

Explanation:
An SSL channel has stopped because bad SSL data was received from the remote end of the channel. More detail on the nature of the corruption can be found from the GSKit return value of <insert_1> (the GSKit return values are documented in the MQ manuals). The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'.

Response:
Ensure you are connecting to a version of MQ which supports SSL at the remote end of the channel. Check your network between the two ends of the channel, and consider whether any possible causes of message corruption could be present. Fix any problems which may exist and restart the channel.

AMQ9661 (iSeries)Bad SSL data from peer on channel <insert_3>.
Severity:
0 : Information

Explanation:
An SSL channel has stopped because bad SSL data was received from the remote end of the channel. More detail on the nature of the corruption can be found from the GSKit return value of <insert_1> (the GSKit return values are documented in the MQ manuals and also in the GSKSSL member of the H file in library QSYSINC). The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'.

Response:
Ensure the remote queue manager and channel listener are running and that you are connecting to a version of MQ which supports SSL at the remote end of the channel. Check your network between the two ends of the channel, and consider whether any possible causes of message corruption could be present. Fix any problems which may exist and restart the channel.

AMQ9662SSL has encountered something it does not support.
Severity:
30 : Severe error

Explanation:
This error can arise for a number of reasons: 1) The platform does not support a given type of cryptographic hardware, e.g. Ncipher and Rainbow are/were not supported on the Linux/390 platform. 2) The cryptographic hardware cryptography has returned an error. 3) Unsupported X509 General Name format when checking the remote certificate. The GSKit SSL provider incorporated in MQ only supports formats rfc822, DNSName, directoryname, uniformResourceID, and IPAddress. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Check that your cryptographic hardware is supported on your platform and test it to see that it is working correctly. Check that the remote certificates you are using conform to the X509 General Name formats listed. Fix the problem and restart the channel.

AMQ9663An invalid SSL certificate was received from the remote system.
Severity:
30 : Severe error

Explanation:
An SSL certificate received from the remote system failed validation checks on its signature. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Ensure that the remote system has a valid SSL certificate. Restart the channel.

AMQ9664Bad userid for CRL LDAP server; SSL channel <insert_3>.
Severity:
30 : Severe error

Explanation:
Certification Revocation List (CRL) checking on an LDAP server or servers has been configured on the local MQ system. The userid information configured for the LDAP server or servers is incorrect. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Check the userid information for the CRL LDAP server or servers you have configured locally. Correct any problems found and restart the channel.

AMQ9665SSL connection closed by remote end of channel <insert_3>.
Severity:
30 : Severe error

Explanation:
The SSL connection was closed by the remote end of the channel during the SSL handshake. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Check the remote end of the channel for SSL-related errors. Fix them and restart the channel.

AMQ9666Error accessing CRL LDAP servers; SSL channel <insert_3>.
Severity:
30 : Severe error

Explanation:
CRL checking on LDAP servers has been configured on the local MQ system. An error was found when trying to access the CRL LDAP servers when validating a certificate from the remote system. Possible causes are: 
(a) cannot connect to any of the LDAP servers, or 
(b) invalid login user id or password for an LDAP server, or 
(c) the certificate issuer's Distinguished Name (DN) is not defined in the DIT of an LDAP server. 
The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Check access to the CRL LDAP server(s) you have configured locally. Put right any problems found and restart the channel.

AMQ9667Bad password for CRL LDAP server; SSL channel <insert_3>.
Severity:
30 : Severe error

Explanation:
Certification Revocation List (CRL) checking on an LDAP server or servers has been configured on the local MQ system. The password information configured for the LDAP server or servers is incorrect. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Check the password information for the CRL LDAP server or servers you have configured locally. Correct any problems found and restart the channel.

AMQ9668The specified PKCS #11 shared library could not be loaded.
Severity:
30 : Severe error

Explanation:
A failed attempt was made to load the PKCS #11 shared library specified to MQ in the PKCS #11 driver path field of the GSK_PKCS11 SSL CryptoHardware parameter. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Ensure that the PKCS #11 shared library exists and is valid at the location specified. Restart the channel.

AMQ9669The PKCS #11 token could not be found.
Severity:
30 : Severe error

Explanation:
The PKCS #11 driver failed to find the token specified to MQ in the PKCS #11 token label field of the GSK_PKCS11 SSL CryptoHardware parameter. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Ensure that the PKCS #11 token exists with the label specified. Restart the channel.

AMQ9670PKCS #11 card not present.
Severity:
30 : Severe error

Explanation:
A PKCS #11 card is not present in the slot. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Ensure that the correct PKCS #11 card is present in the slot. Restart the channel.

AMQ9671The PKCS #11 token password specified is invalid.
Severity:
30 : Severe error

Explanation:
The password to access the PKCS #11 token is invalid. This is specified to MQ in the PKCS #11 token password field of the GSK_PKCS11 SSL CryptoHardware parameter. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Ensure that the PKCS #11 token password specified on GSK_PKCS11 allows access to the PKCS #11 token specified on GSK_PKCS11. Restart the channel.

AMQ9672An SSL security call failed.
Severity:
30 : Severe error

Explanation:
An SSPI call to the Secure Channel (Schannel) SSL provider failed. The failure has caused WebSphere MQ channel name <insert_3> to be closed. If the name is '????' then the name is unknown.

Response:
Consult the Windows Schannel reference manual to determine the meaning of status <insert_5> for SSPI call <insert_4>. Correct the failure and if necessary re-start the channel.

AMQ9673SSL client handshaking failed.
Severity:
30 : Severe error

Explanation:
During an SSL client's handshaking, an SSPI call to the Secure Channel (Schannel) SSL provider failed. The failure has caused WebSphere MQ channel name <insert_3> to be closed. If the name is '????' then the name is unknown.

Response:
Consult the Windows Schannel reference manual to determine the meaning of status <insert_4> for SSPI call <insert_5>. Correct the failure and if necessary re-start the channel.

AMQ9674An unknown error occurred during an SSL security call.
Severity:
30 : Severe error

Explanation:
An unknown error occurred during an SSPI call to the Secure Channel (Schannel) SSL provider. The error may be due to a Windows SSL problem or to a general Windows problem or to invalid WebSphere MQ data being used in the call. The WebSphere MQ error recording routine has been called. The error has caused WebSphere MQ channel name <insert_3> to be closed. If the name is '????' then the name is unknown.

Response:
Consult the Windows Schannel reference manual to determine the meaning of status <insert_5> for SSPI call <insert_4>. If the problem can be resolved using the manual, correct the failure and if necessary re-start the channel. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9675The requested certificate could not be found.
Severity:
30 : Severe error

Explanation:
A request for a certificate identified as <insert_4> <insert_5> in the store <insert_3> has failed, because the certificate could not be found. The Windows error code has been set to <insert_1>. The WebSphere MQ error recording routine has been called.

Response:
Consult the Windows reference manual to determine the meaning of error <insert_1> if this value is non-zero. Check to see whether the specified certificate has been copied to the correct certificate store and has not been deleted. Use the amqmcert command line utility or the WebSphere MQ Explorer administration application to configure certificate store for use with WebSphere MQ. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9676The Windows cryptographic services library could not be loaded.
Severity:
30 : Severe error

Explanation:
WebSphere MQ requires crypt32.dll to be available in order to carry out cryptographic functionality. The attempt to load this library returned the Windows error code <insert_1>. The WebSphere MQ error recording routine has been called.

Response:
Consult the Windows reference manual to determine the meaning of error code <insert_1>. Check that the crypt32.dll file is available and not corrupt. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9677The Windows security services library could not be loaded.
Severity:
30 : Severe error

Explanation:
WebSphere MQ requires <insert_3> to be available in order to run or configure SSL functionality. The attempt to load this library returned the Windows error code <insert_1>. The WebSphere MQ error recording routine has been called.

Response:
Consult the Windows reference manual to determine the meaning of error code <insert_1>. Check that the <insert_3> file is available and not corrupt. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9678The certificate <insert_4>/<insert_5> already exists in the store <insert_3>.
Severity:
10 : Warning

Explanation:
The certificate store <insert_3> already contains the specified certificate, identified by the issuer name of <insert_4>, serial number <insert_5>. The existing certificate has not been replaced.

AMQ9679The certificate store <insert_3> could not be opened.
Severity:
30 : Severe error

Explanation:
The certificate store <insert_3> could not be opened, and failed with the Windows error code <insert_1>. The WebSphere MQ error recording routine has been called.

Response:
Consult the Windows reference manual to determine the meaning of error <insert_1> if this value is non-zero. Check that either your MQSSLKEYR environment variable (for client connections), or SSLKEYR queue manager attribute (for WebSphere MQ queue managers) has been defined correctly, and that the file path specified is valid. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9680A problem was encountered with the specified certificate file.
Severity:
30 : Severe error

Explanation:
A problem occurred when attempting to read the certificate from the file <insert_3>. The file may be corrupt or incorrectly formatted. The Windows error code reported is <insert_1>. The WebSphere MQ error recording routine has been called.

Response:
Ensure that the certificate file is valid and complete, and in one of the file formats supported by WebSphere MQ. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9681The requested functionality is not supported on this system.
Severity:
30 : Severe error

Explanation:
An SSL function was attempted that is not supported on this system. a) importing pfx format certificate files with private key data is only supported on Windows 2000 or greater. b) a the security library installed on your system is not of the correct level and does not contain the pre-requisite functions. On pre Windows 2000 systems, Internet Explorer 4.1 or greater must be installed. The WebSphere MQ error recording routine has been called.

Response:
If pre-requisite software is missing, please install the necessary levels of software and retry the operation. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9682The WebSphere MQ SSL library has not been initialized.
Severity:
30 : Severe error

Explanation:
The WebSphere MQ SSL library 'amqcssln.dll' has been called without it first being initialized by the calling process.

Response:
Ensure that the initialization function has been called prior to issuing any amqcssln function calls.

AMQ9683The private key data for this certificate is not exportable.
Severity:
30 : Severe error

Explanation:
An attempt has been made to export the private key data from a certificate, but the properties of the certificate will not allow this. WebSphere MQ needs to be able to export private key data when copying personal certificates between certificate stores. The Windows cryptographic API returned the error code <insert_1>.

Response:
When requesting the certificate from the certificate authority, the private key data must be marked as exportable to enable WebSphere MQ to be able to copy the certificate and private key data into a WebSphere MQ store. The certificate file may need to be requested again to resolve this problem. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9684A problem occurred while attempting to access the certificate's properties.
Severity:
30 : Severe error

Explanation:
The certificate issued by <insert_3> with serial number <insert_4>, or it's private key data, appears to be unusable and may be corrupt. The Windows return code <insert_1> was generated when attempting to use this certificate. The WebSphere MQ error recording routine has been called.

Response:
Consult the Windows reference manual to determine the meaning of error <insert_1>. Check that the certificate is valid and has not been corrupted. If it is possible that the certificate or private key data is corrupt, try to remove the certificate from your system and re-import it. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9685A problem occured while accessing the registry.
Severity:
30 : Severe error

Explanation:
An error occured while attempting to load or unload the personal registry hive (HKEY_LOCAL_USER) for the user who launched this process. The WebSphere MQ error recording routine has been called.

Response:
If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9686An unexpected error occured while attempting to manage a certificate store.
Severity:
30 : Severe error

Explanation:
The Windows cryptographic API returned error code <insert_1> when calling the function <insert_3> for certificate store <insert_4>. The error may be due to a certificate store problem or to a general Windows problem or to a problem with a certificate in the store. The WebSphere MQ error recording routine has been called.

Response:
Consult the Windows reference manual to determine the meaning of error <insert_1>. Check that the certificate store is valid and not corrupt. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9687The pfx password provided is invalid.
Severity:
30 : Severe error

Explanation:
The password supplied for importing or copying the certificate is incorrect, and the operation could not be completed.

Response:
Make sure the password is correct and try again. If the password has been forgotten or lost, the certificate will need to be regenerated or exported from the original source.

AMQ9688The private key data for this certificate is unavailable.
Severity:
30 : Severe error

Explanation:
The private key data associated with this certificate is reported as being present on the system, but has failed, returning the Windows error code <insert_1>. The WebSphere MQ error recording routine has been called.

Response:
Consult the Windows reference manual to determine the meaning of error code <insert_1>. If the problem can be resolved using the manual, correct the failure and if necessary re-try the operation. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9689An unknown error occurred deleting the store <insert_3>.
Severity:
30 : Severe error

Explanation:
The WebSphere MQ certificate store for queue manager <insert_3> could not be deleted. The filename for the certificate store is <insert_4>. The Windows error code has been set to <insert_1>. The WebSphere MQ error recording routine has been called.

Response:
Consult the Windows reference manual to determine the meaning of error <insert_1>. If the problem can be resolved using the manual, correct the failure and if necessary re-try the operation. Check that the store file exists and that other processes (such as queue managers) that may be accessing the store are not running. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9690The public key in the issuer's certificate has failed to validate the subject certificate.
Severity:
30 : Severe error

Explanation:
The public key in the issuer's certificate (CA or signer certificate), is used to verify the signature on the subject certificate assigned to channel <insert_3>. This verification has failed, and the subject certificate therefore cannot be used. The WebSphere MQ error recording routine has been called.

Response:
Check that the issuer's certificate is valid and available, and that it is up to date. Verify with the certificate's issuer that the subject certificate and issuer certificate should still be valid. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9691The WebSphere MQ MQI library could not be loaded.
Severity:
30 : Severe error

Explanation:
The library file <insert_3> is expected to be available on your system, but attempts to load it have failed with Windows return code <insert_1>. The WebSphere MQ error recording routine has been called.

Response:
Ensure that the WebSphere MQ <insert_3> library file exists and is available on your system. Consult the Windows reference manual to determine the meaning of error code <insert_1>. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9692The SSL library has already been initialized.
Severity:
20 : Error

Explanation:
The SSL library has already been initialized once for this process, any changes to SSL attributes will not take affect, and the original values will remain in force.

Response:
If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9693The password provided for the LDAP server is incorrect.
Severity:
30 : Severe error

Explanation:
One or more of the LDAP servers used for providing CRL information to WebSphere MQ has rejected a login attempt because the password provided is incorrect. The WebSphere MQ error recording routine has been called. The error has caused WebSphere MQ channel name <insert_3> to be closed. If the name is '????' then the name is unknown.

Response:
Ensure that the passwords specified in the AuthInfo objects are correct for each server name provided. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9694The DN syntax provided for an LDAP search is invalid.
Severity:
30 : Severe error

Explanation:
The distinguished name provided in one or more AuthInfo object definitions is invalid, and the request to a CRL LDAP server has been rejected. The WebSphere MQ error recording routine has been called. The error has caused WebSphere MQ channel name <insert_3> to be closed. If the name is '????' then the name is unknown.

Response:
Verify that the details supplied in the AuthInfo object definitions for this channel are correct. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9695The username provided for the LDAP server is incorrect.
Severity:
30 : Severe error

Explanation:
One or more of the LDAP servers used for providing CRL information to WebSphere MQ has rejected a login attempt because the username provided does not exist. The WebSphere MQ error recording routine has been called. The error has caused WebSphere MQ channel name <insert_3> to be closed. If the name is '????' then the name is unknown.

Response:
Ensure that the usernamed specified in the AuthInfo objects for this channel are correct for each LDAP server name provided. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9696Usage: amqmcert [SERVERNAME] [-a handle] 
[-k SSLKeyR|CA|ROOT|MY] [-m QueueMgr] 
[-s CertFile] [-p PersonalCertFile] [-z Password] 
[-x handle] [-l] [-d handle] [-r handle] [-u] 
[-h]
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ9697WebSphere MQ Services could not be contacted on the target server.
Severity:
30 : Severe error

Explanation:
An attempt was made to contact the WebSphere MQ Services on the target server <insert_3>. The call failed with return code <insert_1>. The WebSphere MQ error recording routine has been called.

Response:
Ensure that the target server name specified is correct and that you have sufficient access rights on that server to be able to administer WebSphere MQ. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9698An SSL security call failed during SSL handshaking.
Severity:
30 : Severe error

Explanation:
An SSPI call to the Secure Channel (Schannel) SSL provider failed during SSL handshaking. The failure has caused WebSphere MQ channel name <insert_3> to be closed. If the name is '????' then the name is unknown.

Response:
Consult the Windows Schannel reference manual to determine the meaning of status <insert_5> for SSPI call <insert_4>. Correct the failure and if necessary re-start the channel.

AMQ9699An unknown error occurred during an SSL security call during SSL handshaking.
Severity:
30 : Severe error

Explanation:
An unknown error occurred during an SSPI call to the Secure Channel (Schannel) SSL provider during SSL handshaking. The error may be due to a Windows SSL problem or to a general Windows problem or to invalid WebSphere MQ data being used in the call. The WebSphere MQ error recording routine has been called. The error has caused WebSphere MQ channel name <insert_3> to be closed. If the name is '????' then the name is unknown.

Response:
Consult the Windows Schannel reference manual to determine the meaning of status <insert_5> for SSPI call <insert_4>. If the problem can be resolved using the manual, correct the failure and if necessary re-start the channel. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9710SSL security refresh failed.
Severity:
30 : Severe error

Explanation:
The request to refresh SSL security was unsuccessful.

Response:
Look at previous error messages in the error files to determine the cause of the failure.

AMQ9711SSL security refresh succeeded but channel restarts failed.
Severity:
30 : Severe error

Explanation:
The SSL environments for this queue manager have been refreshed so current values and certificates are in use for all SSL channels. However, not all the outbound SSL channels which were running when the security refresh was initiated could be restarted after the refresh had completed.

Response:
Look at previous error messages in the error files to determine which channels could not be restarted. Restart these if necessary.

AMQ9712SSL security refresh timed out waiting for channel <insert_3>.
Severity:
30 : Severe error

Explanation:
The system was performing a security refresh for SSL. This function requests all outbound and inbound SSL channels to stop. It then waits for these channels to actually stop. SSL channel <insert_3> did not stop within the timeout period.

Response:
Investigate why channel <insert_3> is hung. Terminate the hung channel. Rerun the SSL security refresh.

AMQ9713Channel <insert_3> ended: SSL refresh in progress.
Severity:
0 : Information

Explanation:
The SSL support on this queue manager is in the middle of a security refresh. An attempt was made to start outbound SSL channel <insert_3>. It cannot start while the SSL security refresh is in progress. The channel is restarted automatically once the SSL security refresh is complete.

Response:
None.

AMQ9714SSL refresh on receiving queue manager: channel did not start.
Severity:
30 : Severe error

Explanation:
An SSL security refresh is in progress on the queue manager at the receiving end of this SSL channel. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'. The channel did not start.

Response:
Restart the channel once the SSL refresh is complete. The channel will restart automatically if it is configured to retry the connection.

AMQ9715Unexpected error detected in validating SSL session ID.
Severity:
30 : Severe error

Explanation:
This error can arise when the GSKit SSL provider is missing one or more pre-requisite PTFs on the OS/400 platform. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'.

Response:
Ensure the GSKit SSL provider is at the latest level of maintenance and restart the channel.

AMQ9719Invalid CipherSpec for FIPS mode.
Severity:
30 : Severe error

Explanation:
The user is attempting to start a channel on a queue manager or MQ client which has been configured to run in FIPS mode. The user has specified a CipherSpec which is not FIPS-compliant. The channel is <insert_3>; in some cases its name cannot be determined and so is shown as '????'.

Response:
Redefine the channel to run with a FIPS-compliant CipherSpec. Alternatively, the channel may be defined with the correct CipherSpec and the queue manager or MQ client should not be running in FIPS mode; if this is the case, ensure that FIPS mode is not configured. Once the error is corrected, restart the channel.

AMQ9720

QUEUE MANAGERS:
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9721
Queue Manager Name: <insert_3>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9722

CLIENTS:
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9723
Client Certificate Store: <insert_3>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9724
Expiry Time: <insert_1> 
Migration Status: To be migrated 
Password: ********
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9725
Expiry Time: <insert_1> 
Migration Status: Failed 
Password: ********
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9726A certificate failed to be migrated because it has an invalid date. 
The certificate's details are: 
[Microsoft Certificate Store], [Subject], [Issuer], [Serial Number]: 
<insert_3>.
Severity:
30 : Severe error

Explanation:
During the migration of a certificate, the certificate's date fields have been found to be invalid. The certificate has either expired or its "from" date is later than today's date or its "to" date is earlier than the "from" date. 
The certificate has not been migrated.

Response:
If the certificate is required for migration then obtain a valid replacement before importing it into the GSKit key database <insert_5>.

AMQ9727A certificate failed to be migrated because it has an incomplete certification path. 
The certificate's details are: 
[Microsoft Certificate Store], [Subject], [Issuer], [Serial Number]: 
<insert_3>.
Severity:
30 : Severe error

Explanation:
During the migration of a certificate, the certificate's certification authority (signer) certificate could not be found. The certificate is therefore regarded as an orphan certificate. 
A copy of the certificate has been written to the file name <insert_4>. 
If file name is suffixed ".cer" then the certificate is a certification authority (signer) certificate. If file name is suffixed ".pfx" then the certificate is a personal certificate and it has a password which is the same as that specified for the GSKit key database <insert_5>. The certificate has not been migrated.

Response:
If the certificate is required for migration then ensure that a complete certification path exists in the GSKit key database <insert_5> before importing the certificate.

AMQ9728A certificate failed to be migrated because it could not be imported into the GSKit key database <insert_5>. 
The certificate's details are: 
[Microsoft Certificate Store], [Subject], [Issuer], [Serial Number]: 
<insert_3>.
Severity:
30 : Severe error

Explanation:
A certificate failed to be imported because there was a problem during the migration of the certificate. 
A copy of the certificate has been written to the file name <insert_4>. 
If file name is suffixed ".cer" then the certificate is a certification authority (signer) certificate. If file name is suffixed ".pfx" then the certificate is a personal certificate and it has a password which is the same as that specified for the GSKit key database <insert_5>. The certificate has not been migrated.

Response:
Refer to the previous message in the error log to determine the cause of the failure. If appropriate, refer to the Windows or GSKit reference documentation to determine the cause.

AMQ9729Unable to create certificate file <insert_3>.
Severity:
30 : Severe error

Explanation:
A certificate failed to be imported because there was a problem during the migration of the certificate. In addition to this first problem, a second problem occurred when trying to create a copy of the certificate by writing it to the file <insert_3>. The certificate is located in the Microsoft Certificate Store <insert_4>. The certificate is intended for the GSKit key database <insert_5>. If file name is suffixed ".cer" then the certificate is a certification authority (signer) certificate. If file name is suffixed ".pfx" then the certificate is a personal certificate. The certificate has not been migrated.

Response:
Determine the cause of the 2 problems. Refer to the previous message in the error log to determine the cause of the first failure. If appropriate, refer to the Windows or GSKit reference documentation to determine the cause. The second failure occurred during a call to the Windows 'CreateFile' function with a return code of <insert_1>. For this failure, check that file does not already exist and that you have authority to create this file.

AMQ9730Certificate migration has completed with no failures. The number of certificates migrated was <insert_1>.
Severity:
0 : Information

Explanation:
The migration of certificates from the Microsoft Certificate Store <insert_3> to the GSKit key database <insert_4> has completed and there were no migration failures. The number of certificates migrated was <insert_1>.

Response:
If any certificates were migrated, use the GSKit iKeyman GUI to verify that the GSKit key database contains all the certificates required to support the intended SSL channel. If no certificates were migrated then this is probably because <insert_3> contained only a default set of certification authority (signer) certificates. The default set is not migrated because the newly created GSKit key database will have its own set which will be the same or more up to date. 
Although there were no failures which caused certificates not to be migrated, there may have been other failures and these must be resolved otherwise the SSL channel may subsequently fail to start. Refer to the error log and check for any failures.

AMQ9731The Transfer Certificates (amqtcert) command has completed.
Severity:
0 : Information

Response:
None.

AMQ9732A registry entry already exists for <insert_3>.
Severity:
30 : Severe error

Explanation:
The command has been used to request automatic migration for a queue manager's or a client's Microsoft Certificate Store. However, there is already an entry in the registry for this store. If the request was for a queue manager then <insert_3> is the queue manager name, otherwise it is the name of the client's Microsoft Certificate Store.

Response:
List, and then check, the contents of the registry by running the Transfer Certificates (amqtcert) command with the options "-a -l". If it is necessary to replace the entry then firstly remove it, by using amqtcert with the "-r" option, then use amqtcert to request automatic migration.

AMQ9733The request to automatically migrate certificates has completed successfully.
Severity:
0 : Information

Explanation:
A request was made to automatically migrate SSL certificates. This request may have been made during the installation of WebSphere MQ or by using the Transfer Certificates (amqtcert) command. The request has now been performed and the migration has completed successfully.

Response:
Use the GSKit iKeyman GUI to verify that the GSKit key database contains all the certificates required to support the intended SSL channel. If no certificates were migrated then this is because the Microsoft Certificate Store contained only a default set of certification authority (signer) certificates. The default set is not migrated because the newly created GSKit key database will have its own set which will be the same or more up to date.

AMQ9734There was a failure during the automatic migration of certificates.
Severity:
30 : Severe error

Explanation:
A request was made to automatically migrate SSL certificates. This request may have been made during the installation of WebSphere MQ or by using the Transfer Certificates (amqtcert) command. The request has now been performed but there was a failure during the migration process.

Response:
Refer to previous messages in the error log to determine the cause of the failure. It may be the case that all certificates have successfully migrated and that the failure did not affect this part of the migration process. In this case, use the GSKit iKeyman GUI to verify that the GSKit key database contains all the certificates required to support the intended SSL channel.

AMQ9735Certificate migration has terminated unexpectedly. A failure occured during GSKit initialization.
Severity:
30 : Severe error

Explanation:
The certificate migration process has terminated unexpectedly. The migration requires the GSKit environment to be successfully initialized. This involves the GSKit operations of initialization, creation of the key database and stashing of the key database password. There was a failure during one of these operations. No certificates have been migrated. If the stashing of the password failed then the key database <insert_4> will have been created. The failure occurred during the GSKit operation <insert_3> and the GSKit return code <insert_1> was generated.

Response:
If the key database has been created then, after the cause of the failure has been resolved, delete it, remove the relevant registry state information and then re-try the certificate migration process. Use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9736The library <insert_3> was not found.
Severity:
30 : Severe error

Explanation:
An attempt to dynamically load the library <insert_3> failed because the library was not found. If this an WebSphere MQ library, it is only available on WebSphere MQ server installations and is required when the Transfer Certificates (amqtcert) command is used to perform a queue manager operation. If this a GSKit library, it should have been installed during the WebSphere MQ installation.

Response:
Do not use the command to perform a queue manager operation on a WebSphere MQ client-only installation. If the command has been made on a WebSphere MQ server installation, or if it is a GSKit library which is missing, then record the problem identifier, save any generated output files and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9737Unable to allocate memory.
Severity:
30 : Severe error

Explanation:
An attempt to allocate memory failed.

Response:
Make more memory available to the command.

AMQ9738Unable to obtain the MQSSLKEYR environment variable value.
Severity:
30 : Severe error

Explanation:
An attempt to obtain the MQSSLKEYR environment variable value failed. When using the command to specify all clients then the MQSSLKEYR environment variable must be defined with the name of a Microsoft Certificate Store file containing certificates for all clients.

Response:
Ensure that the MQSSLKEYR environment variable is defined with an appropriate value.

AMQ9739The certificate store <insert_3> could not be accessed.
Severity:
30 : Severe error

Explanation:
The certificate store <insert_3> could not be accessed, and failed with Windows error code <insert_1>. If you are using the -c parameter check that the name given to amqtcert is correct. If you are using the -m parameter check the SSLKEYR value on the queue manager specified.

Response:
Consult the Windows reference manual to determine the meaning of error <insert_1> if this value is non-zero. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9740The certificate store <insert_3> could not be opened.
Severity:
30 : Severe error

Explanation:
The certificate store <insert_3> could not be opened, and failed with Windows error code <insert_1>.

Response:
Consult the Windows reference manual to determine the meaning of error <insert_1> if this value is non-zero. If the problem cannot be resolved then use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9741A problem occurred during a Windows operation.
Severity:
30 : Severe error

Explanation:
During operation <insert_3>, the Windows return code <insert_1> was generated.

Response:
Consult the Windows reference manual to determine the meaning of return code <insert_1> for operation <insert_3>.

AMQ9742A problem occured during a GSKit operation.
Severity:
30 : Severe error

Explanation:
During operation <insert_3>, the GSKit return code <insert_1> was generated.

Response:
Use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9743A certificate failed to be migrated and failed to be logged. 
The certificate's details are: 
[Microsoft Certificate Store], [Subject], [Issuer], [Serial Number]: 
<insert_3>.
Severity:
30 : Severe error

Explanation:
There was a problem trying to migrate a certificate to the GSKit key database <insert_5>.

Response:
Refer to the previous message in the error log to determine why the migration failed.

AMQ9744No matching automatic migration registry entry.
Severity:
10 : Warning

Explanation:
There is no automatic certificate migration entry in the registry which matches the input provided.

Response:
None, if the entry to be removed was correctly specified. Otherwise, input the command again with correct parameters.

AMQ9745amqtcert: insufficient memory to migrate certificates.
Severity:
30 : Severe error

Explanation:
An attempt to allocate memory failed while amqtcert was migrating certificate file <insert_3>.sto'. The migration did not complete successfully.

Response:
Do not delete <insert_3>.sto', but delete all other files called <insert_4>.*' (these were created as a result of the failed migration). Also, if you want to rerun this migration automatically, use the -r flag on amqtcert to remove the automatic migration registry entry for this .sto file. Then use the -a flag on amqtcert to create a new automatic migration registry entry for this .sto file. 
Make more memory available. Rerun the migration.

AMQ9746File <insert_3> not found.
Severity:
30 : Severe error

Explanation:
The file specified as a command argument has not been found. The characters ".sto" have been automatically appended to the file name.

Response:
Check that file exists and that it is specified as the absolute (rather than relative) directory path and file name (excluding the .sto suffix) of the Microsoft Certificate Store.

AMQ9747Usage: amqtcert [-a] [-c [Filename | *]] [-e ExpirationTime] [-g FileName] 
[-i ListNumber] [-l] [-m [QMgrName | *]] [-p Password] 
[-r] [-u ClientLogonID] [-w FileName]
Severity:
0 : Information

Response:
None.

AMQ9748A problem occurred accessing the Windows registry.
Severity:
30 : Severe error

Explanation:
An attempt to access a key or value or data field in the Windows registry key failed. The failure may be due to part of the registry being in an invalid state or may be due to insufficient authority to access that part. The WebSphere MQ error recording routine has been called.

Response:
If <insert_3> includes the name of a Windows call, consult the Windows reference manual to determine the meaning of status <insert_1> for that call. Use the standard facilities supplied with your system to record the problem identifier, and to save the generated output files. Contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9749Invalid combination of command arguments.
Severity:
30 : Severe error

Explanation:
The command syntax is incorrect because of an invalid combination of arguments.

Response:
Re-try the command using a valid combination of arguments.

AMQ9750File <insert_3> already exists.
Severity:
30 : Severe error

Explanation:
The file <insert_3> cannot be created because it already exists.

Response:
Ensure that the file does not exist in the directory. If necessary, make a copy of the file before renaming or moving or deleting it.

AMQ9751You are not authorized to perform the requested operation.
Severity:
30 : Severe error

Explanation:
You tried to issue a command for which you are not authorized.

Response:
Contact your system administrator to perform the command for you or to request authority to perform the command.

AMQ9752A certificate failed to be migrated because a Windows operation failed. 
The certificate's details are: 
[Microsoft Certificate Store], [Subject], [Issuer], [Serial Number]: 
<insert_4>.
Severity:
30 : Severe error

Explanation:
A personal certificate could not be migrated because there was a failure during the Windows operation <insert_3> with a return code of <insert_1>. A personal certificate is exported, with its private key data, from the Microsoft Certificate Store prior to being imported into the GSKit key database. The failure occurred during the export and is probably due to a problem with accessing or using the private key data assoicated with the personal certificate.

Response:
Check that the private key data is available and that you have authority to access it. Consult the Windows reference manual to determine the meaning of return code <insert_1> for operation <insert_3>.

AMQ9753File <insert_3> is empty.
Severity:
30 : Severe error

Explanation:
The file <insert_3> cannot be used because it is empty.

Response:
Ensure that the correct file has been used and if necessary investigate the reason for it being empty.

AMQ9754A certificate failed to be migrated because a GSKit operation failed. 
The certificate's details are: 
[Microsoft Certificate Store], [Subject], [Issuer], [Serial Number]: 
<insert_4>.
Severity:
30 : Severe error

Explanation:
During operation <insert_3>, the GSKit return code <insert_1> was generated.

Response:
Use the standard facilities supplied with your system to record the problem identifier and save the generated output files, and then contact your IBM support center. Do not discard these files until the problem has been resolved.

AMQ9755Certificate migration has completed with some failures. The number of certificates migrated was <insert_1>.
Severity:
0 : Information

Explanation:
The migration of certificates from the Microsoft Certificate Store <insert_3> to the GSKit key database <insert_4> has completed but there has been one or more failures. The number of certificates migrated was <insert_1>.

Response:
If any certificates were migrated, use the GSKit iKeyman GUI to verify that the GSKit key database contains all the certificates required to support the intended SSL channel. The failures must be resolved otherwise the SSL channel may subsequently fail to start. Refer to previous messages in the error log to determine the cause of such failures.

AMQ9756The number of certificates in the Microsoft Certificate Store <insert_3> is <insert_1>.
Severity:
0 : Information

Explanation:
Provides a count of the number of certificates in the Microsoft Certificate Store <insert_3>.

Response:
None.

AMQ9757
Certificate <insert_1>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9758Subject: <insert_3>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9759Issuer: <insert_3>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9760Valid From: <insert_3> to <insert_4>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9761Certificate Usage: <All>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9762Certificate Usage: <insert_3>
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9763Certificate Type: Personal
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9764Certificate Type: Signer
Severity:
0 : Information

Explanation:
None.

Response:
None.

AMQ9765Personal certificate not found for the command option "-i <insert_1>".
Severity:
30 : Severe error

Explanation:
The Transfer Certificates (amqtcert) command was executed using the "-i ListNumber" option with a value of <insert_1>. However, no personal certificate was found which corresponded to this value. Certificate migration has failed and no certificates were migrated.

Response:
Check that the option value corresponds to a correctly identified personal certificate. If it is not correct then run the command using the "-l List" option to determine the correct number. A GSKit key database, and its associated key database files, was created when the command was run using the "-i ListNumber" option. The database and associated files must be deleted before re-trying the command with the "-i" option.

AMQ9766A failure occurred creating the GSKit key database <insert_4>.
Severity:
30 : Severe error

Explanation:
GSKit was unable to create the key database and its associated files. During the GSKit operation <insert_3>, the return code <insert_1> was generated. This is probably due to insufficient authority or to insufficient disk space being available.

Response:
Check that you have sufficient authority and that there is sufficient disk space available.

AMQ9767Usage: strmqikm [iKeymanWorkingDirectory]
Severity:
0 : Information

Response:
None.

AMQ9768Directory <insert_3> not found.
Severity:
30 : Severe error

Explanation:
The directory specified as a command argument has not been found.

Response:
Check that the directory exists and that it is specified as an absolute (rather than relative) directory path.

AMQ9769Usage: runmqckm 
-keydb -changepw Change the password for a key database 
-convert Convert the format of a key database 
-create Create a key database 
-delete Delete a key database 
-stashpw Stash the password of a key database into a file 
-list Currently supported types of key database. 
-cert -add Add a CA Certificate 
-create Create a self-signed certificate 
-delete Delete a certificate 
-details Show the details of a specific certificate 
-export Export a personal certificate and associated private key into a PKCS12 file or a key database 
-extract Extract a certificate from a key database 
-getdefault Show the default personal certificate 
-import Import a certificate from a key database or a PKCS12 file 
-list List certificates in a key database 
-modify Modify a certificate (NOTE: the only field that my be modified is the trust field) 
-receive Receive a certificate 
-setdefault Set the default personal certificate 
-sign Sign a certificate 
-certreq -create Create a certificate request 
-delete Delete a certificate request from a certificate request database 
-details Show the details of a specific certificate request 
-extract Extract a certificate from a certificate request database 
-list List all certificate requests in a certificate request database 
-recreate Recreate a certificate request 
-version Display ikeycmd version information 
-help Display this help text
Severity:
0 : Information

Response:
None.

AMQ9913The specified local address <insert_3> cannot be resolved to an IP address. The return code is <insert_1>.
Severity:
30 : Severe error

Explanation:
An attempt to resolve the local address hostname to an IP address has failed.

Response:
Check that the local address hostname is correct and has an entry in the DNS database.

AMQ9914The type of local address specified is incompatible with the IP protocol (<insert_3>) used.
Severity:
30 : Severe error

Explanation:
An attempt to use a local address that is incompatible with the IP protocol used.

Response:
Make sure that the local address specified is of the same type (IPv4 or IPV6) as the IP Protocol.

AMQ9915The IP protocol <insert_3> is not present on the system.
Severity:
30 : Severe error

Explanation:
An attempt to use an IP protocol that is not present on the system has been made.

Response:
Install the required IP protocol or use an IP protocol that is available on the system.

AMQ9920A SOAP Exception has been thrown.
Severity:
30 : Severe error

Explanation:
A SOAP method encountered a problem and has thrown an exception. Details of the exception are: 
<insert_3>

Response:
Investigate why the SOAP method threw the exception.

AMQ9921An error was encountered writing to the Dead Letter Queue.
Severity:
30 : Severe error

Explanation:
An error was encountered when an attempt was made to write a message to Dead Letter Queue <insert_3>. The message was <insert_4>.

Response:
Ensure that Dead Letter Queue <insert_3> exists and is put enabled. Ensure that the Queue Manager attribute DEADQ is set up correctly. Resend the SOAP message.

AMQ9922Maximum wait time exceeded on queue <insert_3>.
Severity:
30 : Severe error

Explanation:
The maximum time waiting for a message to arrive on queue <insert_3> has been exceeded.

Response:
Ensure that the queue is not put inhibited. Ensure that messages are being written to the queue.

AMQ9923Insufficient parameters on command.
Severity:
30 : Severe error

Explanation:
The SOAP command has been issued with insufficient paramaters.

Response:
Supply the correct number of parameters and reissue the command.

AMQ9924Usage: amqwSOAPNETListener -u wmqUri 
[-w WebServiceDirectory] [-n MaxThreads] 
[-d StayAlive] [-i IdContext] 
[-x TransactionalControl] [-a Integrity] [-? ThisHelp]
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ9925Cannot connect to queue manager <insert_3>.
Severity:
30 : Severe error

Explanation:
A SOAP application or the SOAP listener cannot connect to the queue manager <insert_3> using <insert_4> bindings.

Response:
Ensure the bindings are set to the correct value and that the queue manager exists. Check any error messages from the Java MQQueueManager class.

AMQ9926Null SOAP action specified in a received SOAP message.
Severity:
30 : Severe error

Explanation:
A NULL soap action has been specified in the SOAP message <insert_3>. The message will not be processed.

Response:
Include the appropriate SOAP action in the SOAP message.

AMQ9927MQ queue backout threshold exceeded.
Severity:
30 : Severe error

Explanation:
The WebSphere MQ backout threshold value has been exceeded for queue <insert_3>, processing message <insert_4>.

Response:
Correct the backout threshold value for queue <insert_3> and resend the SOAP message.

AMQ9928Target service or URI is missing from a SOAP message.
Severity:
30 : Severe error

Explanation:
The target service or the target URI is missing from SOAP message <insert_3>.

Response:
Supply a target service or the target URI in the SOAP message.

AMQ9929Message backout for message (<insert_3>) failed.
Severity:
30 : Severe error

Explanation:
Backout for a message has failed.

Response:
Investigate the reason for the backout failure.

AMQ9930Required Option <insert_3> missing from command.
Severity:
30 : Severe error

Explanation:
The SOAP command was issued with manadatory option <insert_3> missing.

Response:
Reissue the SOAP command supplying the missing option.

AMQ9931Invalid value <insert_3> specified for option <insert_4>.
Severity:
30 : Severe error

Explanation:
THE SOAP command was issued with an invalid value for an option.

Response:
Reissue the SOAP command supplying the correct option value.

AMQ9932Application host class not found
Severity:
30 : Severe error

Explanation:
Application host class <insert_3> has not been found.

Response:
Specify the correct application host class in the SOAP message.

AMQ9933Options <insert_3> and <insert_4> are mutually exclusive
Severity:
30 : Severe error

Explanation:
The SOAP command was issued with incompatible options <insert_3> and <insert_4>.

Response:
Reissue the SOAP command supplying compatible options.

AMQ9934Could not parse URL <insert_3>. MQCC_FAILED(2) MQRC_SOAP_URL_ERROR(2212).
Severity:
30 : Severe error

Explanation:
.Could not parse URL <insert_3>.MQCC_FAILED(2) MQRC_SOAP_URL_ERROR(2212).

Response:
Correct the URL and retry.

AMQ9935Illegal URL <insert_3>. MQCC_FAILED(2) MQRC_SOAP_URL_ERROR(2212).
Severity:
30 : Severe error

Explanation:
.The URL <insert_3> failed validation.. MQCC_FAILED(2) MQRC_SOAP_URL_ERROR(2212).

Response:
Correct the URL and retry.

AMQ9936Cannot get connection using <insert_3> bindings. MQCC_FAILED(2) MQRC_CONNECTION_ERROR(2273).
Severity:
30 : Severe error

Explanation:
.Cannot get connection using <insert_3> bindings. MQCC_FAILED(2) MQRC_CONNECTION_ERROR(2273).

Response:
Check that the queue manager is available and running.

AMQ9937The asyncResult is null. MQCC_FAILED(2) MQRC_SOAP_DOTNET_ERROR.(2210).
Severity:
30 : Severe error

Explanation:
.The asyncResult is null. MQCC_FAILED(2) MQRC_SOAP_DOTNET_ERROR.(2210).

Response:
Check why the SOAP responses are not being received.

AMQ9938SOAP/WMQ Timeout.
Severity:
30 : Severe error

Explanation:
.The MQGET operation timed out. MQCC_FAILED(2) MQRC_SOAP_DOTNET_ERROR.(2210).

Response:
Check why the SOAP responses are not being received. MQCC_FAILED(2) MQRC_SOAP_DOTNET_ERROR.(2210).

AMQ9939SOAP/WMQ Error. MQCC_FAILED(2) MQRC_SOAP_DOTNET_ERROR.(2210).
Severity:
30 : Severe error

Explanation:
.A SOAP error was detected. MQCC_FAILED(2) MQRC_SOAP_DOTNET_ERROR.(2210).

Response:
Check the WMQ logs for the reason of the failure.

AMQ9940Report message returned in MQWebResponse. MQCC_FAILED(2) MQRC_SOAP_DOTNET_ERROR.(2210).
Severity:
30 : Severe error

Explanation:
.Report message returned in MQWebResponse. MQCC_FAILED(2) MQRC_SOAP_DOTNET_ERROR.(2210).

Response:
Check the report message for the reason of the failure.

AMQ9941No RFH2 header recognised. MQCC_FAILED(2) MQRCCF_MD_FORMAT_ERROR(3023).
Severity:
30 : Severe error

Explanation:
.No RFH2 header recognised. MQCC_FAILED(2) MQRCCF_MD_FORMAT_ERROR(3023).

Response:
Check why the message is being sent with no RFH2 header.

AMQ9942Message format is not MQFMT_NONE. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).
Severity:
30 : Severe error

Explanation:
.Message format is not MQFMT_NONE. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).

Response:
Correct the message format and retry.

AMQ9943Unrecognised RFH2 version. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).
Severity:
30 : Severe error

Explanation:
.Unrecognised RFH2 version. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).

Response:
Correct the version in the RFH2 message and retry.

AMQ9944Invalid RFH2 length. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).
Severity:
30 : Severe error

Explanation:
.Invalid RFH2 length. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).

Response:
Correct the RFH2 length and retry.

AMQ9945Illegal RFH2 <insert_3> folder length. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).
Severity:
30 : Severe error

Explanation:
.Illegal RFH2 <insert_3> folder length. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).

Response:
Correct the RFH2 message and retry.

AMQ9946Invalid actual message length. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).
Severity:
30 : Severe error

Explanation:
.Invalid actual message length. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).

Response:
Correct the RFH2 message and retry.

AMQ9947Illegal RFH2 Folder <insert_3> <insert_4>. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).
Severity:
30 : Severe error

Explanation:
.Illegal RFH2 Folder <insert_3> <insert_4>. MQCC_FAILED(2) MQRC_RFH_FORMAT_ERROR(2421).

Response:
Correct the RFH2 folder syntax/format and retry.

AMQ9948Backout Threshold exceeded. MQCC_FAILED(2) MQRC_BACKOUT_THRESHOLD_REACHED(2362).
Severity:
30 : Severe error

Explanation:
.Backout Threshold exceeded. MQCC_FAILED(2) MQRC_BACKOUT_THRESHOLD_REACHED(2362).

Response:
Correct the backout threshold limit and retry.

AMQ9949<insert_3> missing from RFH2. MQCC_FAILED(2) MQRC_RFH_PARM_MISSING(2339).
Severity:
30 : Severe error

Explanation:
.<insert_3> missing from RFH2. MQCC_FAILED(2) MQRC_RFH_PARM_MISSING(2339).

Response:
Correct the RFH2 message and retry.

AMQ9950Target service missing from SOAP URL. MQCC_FAILED(2) MQRC_SOAP_URL_ERROR(2212).
Severity:
30 : Severe error

Explanation:
.Target service missing from SOAP URL. MQCC_FAILED(2) MQRC_SOAP_URL_ERROR(2212).

Response:
Correct the URL and retry.

AMQ9951Asynchronous request queued successfully. MQCC_OK(0).
Severity:
30 : Severe error

Explanation:
.Asynchronous request queued successfully. MQCC_OK(0).

Response:
Wait for response if any is expected.

AMQ9952Unexpected message type received. MQCC_FAILED(2) MQRC_UNEXPECTED_MSG_TYPE.(2215).
Severity:
30 : Severe error

Explanation:
.A message of the wrong type was received; for instance, a report message was received when one had not been requested.

Response:
If you are running WMQ SOAP using the IBM supplied SOAP/WMQ sender, please contact IBM. If you are running WMQ SOAP using a bespoke sender, please check that the SOAP/WMQ request message has the correct options.

AMQ9953Either the ContentType or the TransportVersion in the RFH2 header have the wrong value. MQCC_FAILED(2) MQRC_RFH_HEADER_FIELD_ERROR(2228)
Severity:
30 : Severe error

Explanation:
.Either the ContentType or the TransportVersion in the RFH2 header have the wrong value. MQCC_FAILED(2) MQRC_RFH_HEADER_FIELD_ERROR(2228)

Response:
Correct the message format and retry.

AMQ9954ViaTran.Redirect called out of transaction MQCC_FAILED(2) MQRC_SOAP_DOTNET_ERROR(2410)
Severity:
30 : Severe error

Explanation:
.ViaTran.Redirect called out of transaction MQCC_FAILED(2) MQRC_SOAP_DOTNET_ERROR(2410)

Response:
Make sure ViaTran.Redirect is only called in a transaction.

AMQ9955Usage: amqswsdl [?] Uri inputFile outputFile
Severity:
0 : Information

Explanation:
This shows the correct usage.

Response:
None.

AMQ9990 (iSeries)Keyword <insert_3> not valid for this command or the command is incomplete.
Severity:
40 : Stop Error

Explanation:
The command is incomplete, or an invalid keyword was specified, or the parameter value of the keyword was not specified.

Response:
Complete the command, or correct the keyword, or add the parameter value, and then try the command again.

AMQ9991 (iSeries)The value specified is not allowed by the command.
Severity:
40 : Stop Error

Explanation:
<insert_3> not valid for parameter <insert_4>.

Response:
Enter one of the values that is defined for the parameter, and try the command again. More information on parameters and commands can be found in the CL reference manual or the appropriate licensed program manual.

AMQ9992 (iSeries)A matching parenthesis not found.
Severity:
40 : Stop Error

Explanation:
A matching left or right parenthesis is missing.

Response:
Add the missing parenthesis or remove the extra parenthesis.

AMQ9999Channel program ended abnormally.
Severity:
30 : Severe error

Explanation:
Channel program <insert_3> ended abnormally.

Response:
Look at previous error messages for channel program <insert_3> in the error files to determine the cause of the failure.


Reason code list:
=================


0 (X'0000')MQRC_NONE
Explanation:
The call completed normally. The completion code (CompCode) is MQCC_OK.

Completion Code:
MQCC_OK

Programmer Response:
None.

900 (X'0384')MQRC_APPL_FIRST
Explanation:
This is the lowest value for an application-defined reason code returned by a data-conversion exit. Data-conversion exits can return reason codes in the range MQRC_APPL_FIRST through MQRC_APPL_LAST to indicate particular conditions that the exit has detected.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
As defined by the writer of the data-conversion exit.

999 (X'03E7')MQRC_APPL_LAST
Explanation:
This is the highest value for an application-defined reason code returned by a data-conversion exit. Data-conversion exits can return reason codes in the range MQRC_APPL_FIRST through MQRC_APPL_LAST to indicate particular conditions that the exit has detected.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
As defined by the writer of the data-conversion exit.

2001 (X'07D1')MQRC_ALIAS_BASE_Q_TYPE_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued specifying an alias queue as the destination, but the BaseQName in the alias queue definition resolves to a queue that is not a local queue, a local definition of a remote queue, or a cluster queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the queue definitions.

2002 (X'07D2')MQRC_ALREADY_CONNECTED
Explanation:
An MQCONN or MQCONNX call was issued, but the application is already connected to the queue manager. 

On z/OS, this reason code occurs for batch and IMS applications only; it does not occur for CICS applications. 
On AIX, HP-UX, i5/OS, Solaris, Windows, this reason code occurs if the application attempts to create a nonshared handle when a nonshared handle already exists for the thread. A thread can have no more than one nonshared handle. 
On Windows, MTS objects do not receive this reason code, as additional connections to the queue manager are allowed.
Completion Code:
MQCC_WARNING

Programmer Response:
None. The Hconn parameter returned has the same value as was returned for the previous MQCONN or MQCONNX call.

An MQCONN or MQCONNX call that returns this reason code does not mean that an additional MQDISC call must be issued in order to disconnect from the queue manager. If this reason code is returned because the application has been called in a situation where the connect has already been done, a corresponding MQDISC should not be issued, because this will cause the application that issued the original MQCONN or MQCONNX call to be disconnected as well.

2003 (X'07D3')MQRC_BACKED_OUT
Explanation:
The current unit of work encountered a fatal error or was backed out. This occurs in the following cases: 

On an MQCMIT or MQDISC call, when the commit operation has failed and the unit of work has been backed out. All resources that participated in the unit of work have been returned to their state at the start of the unit of work. The MQCMIT or MQDISC call completes with MQCC_WARNING in this case. 
On z/OS, this reason code occurs only for batch applications.
On an MQGET, MQPUT, or MQPUT1 call that is operating within a unit of work, when the unit of work has already encountered an error that prevents the unit of work being committed (for example, when the log space is exhausted). The application must issue the appropriate call to back out the unit of work. (For a unit of work coordinated by the queue manager, this call is the MQBACK call, although the MQCMIT call has the same effect in these circumstances.) The MQGET, MQPUT, or MQPUT1 call completes with MQCC_FAILED in this case. 
On z/OS, this case does not occur.
Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Check the returns from previous calls to the queue manager. For example, a previous MQPUT call may have failed.

2004 (X'07D4')MQRC_BUFFER_ERROR
Explanation:
The Buffer parameter is not valid for one of the following reasons: 

The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The parameter pointer points to storage that cannot be accessed for the entire length specified by BufferLength. 
For calls where Buffer is an output parameter: the parameter pointer points to read-only storage.
Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2005 (X'07D5')MQRC_BUFFER_LENGTH_ERROR
Explanation:
The BufferLength parameter is not valid, or the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

This reason can also be returned to an MQ client program on the MQCONN or MQCONNX call if the negotiated maximum message size for the channel is smaller than the fixed part of any call structure.

This reason should also be returned by the MQZ_ENUMERATE_AUTHORITY_DATA installable service component when the AuthorityBuffer parameter is too small to accommodate the data to be returned to the invoker of the service component.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value that is zero or greater. For the mqAddString and mqSetString calls, the special value MQBL_NULL_TERMINATED is also valid.

2006 (X'07D6')MQRC_CHAR_ATTR_LENGTH_ERROR
Explanation:
CharAttrLength is negative (for MQINQ or MQSET calls), or is not large enough to hold all selected attributes (MQSET calls only). This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value large enough to hold the concatenated strings for all selected attributes.

2007 (X'07D7')MQRC_CHAR_ATTRS_ERROR
Explanation:
CharAttrs is not valid. The parameter pointer is not valid, or points to read-only storage for MQINQ calls or to storage that is not as long as implied by CharAttrLength. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2008 (X'07D8')MQRC_CHAR_ATTRS_TOO_SHORT
Explanation:
For MQINQ calls, CharAttrLength is not large enough to contain all of the character attributes for which MQCA_* selectors are specified in the Selectors parameter.

The call still completes, with the CharAttrs parameter string filled in with as many character attributes as there is room for. Only complete attribute strings are returned: if there is insufficient space remaining to accommodate an attribute in its entirety, that attribute and subsequent character attributes are omitted. Any space at the end of the string not used to hold an attribute is unchanged.

An attribute that represents a set of values (for example, the namelist Names attribute) is treated as a single entity--either all of its values are returned, or none.

Completion Code:
MQCC_WARNING

Programmer Response:
Specify a large enough value, unless only a subset of the values is needed.

2009 (X'07D9')MQRC_CONNECTION_BROKEN
Explanation:
Connection to the queue manager has been lost. This can occur because the queue manager has ended. If the call is an MQGET call with the MQGMO_WAIT option, the wait has been canceled. All connection and object handles are now invalid.

For MQ client applications, it is possible that the call did complete successfully, even though this reason code is returned with a CompCode of MQCC_FAILED.

Completion Code:
MQCC_FAILED

Programmer Response:
Applications can attempt to reconnect to the queue manager by issuing the MQCONN or MQCONNX call. It may be necessary to poll until a successful response is received. 

On z/OS for CICS applications, it is not necessary to issue the MQCONN or MQCONNX call, because CICS applications are connected automatically.
Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2010 (X'07DA')MQRC_DATA_LENGTH_ERROR
Explanation:
The DataLength parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

This reason can also be returned to an MQ client program on the MQGET, MQPUT, or MQPUT1 call, if the BufferLength parameter exceeds the maximum message size that was negotiated for the client channel.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

If the error occurs for an MQ client program, also check that the maximum message size for the channel is big enough to accommodate the message being sent; if it is not big enough, increase the maximum message size for the channel.

2011 (X'07DB')MQRC_DYNAMIC_Q_NAME_ERROR
Explanation:
On the MQOPEN call, a model queue is specified in the ObjectName field of the ObjDesc parameter, but the DynamicQName field is not valid, for one of the following reasons: 

DynamicQName is completely blank (or blank up to the first null character in the field). 
Characters are present that are not valid for a queue name. 
An asterisk is present beyond the 33rd position (and before any null character). 
An asterisk is present followed by characters that are not null and not blank.
This reason code can also sometimes occur when a server application opens the reply queue specified by the ReplyToQ and ReplyToQMgr fields in the MQMD of a message that the server has just received. In this case the reason code indicates that the application that sent the original message placed incorrect values into the ReplyToQ and ReplyToQMgr fields in the MQMD of the original message.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid name.

2012 (X'07DC')MQRC_ENVIRONMENT_ERROR
Explanation:
The call is not valid for the current environment. 

On z/OS, one of the following applies: 
An MQCONN or MQCONNX call was issued, but the application has been linked with an adapter that is not supported in the environment in which the application is running. For example, this can arise when the application is linked with the MQ RRS adapter, but the application is running in a DB2 Stored Procedure address space. RRS is not supported in this environment. Stored Procedures wishing to use the MQ RRS adapter must run in a DB2 WLM-managed Stored Procedure address space. 
An MQCMIT or MQBACK call was issued, but the application has been linked with the RRS batch adapter CSQBRSTB. This adapter does not support the MQCMIT and MQBACK calls. 
An MQCMIT or MQBACK call was issued in the CICS or IMS environment. 
The RRS subsystem is not up and running on the z/OS system that ran the application.
On HP OpenVMS, OS/2, i5/OS, Compaq NonStop Kernel, UNIX systems, and Windows, one of the following applies: 
The application is linked to the wrong libraries (threaded or nonthreaded). 
An MQBEGIN, MQCMIT, or MQBACK call was issued, but an external unit-of-work manager is in use. For example, this reason code occurs on Windows when an MTS object is running as a DTC transaction. This reason code also occurs if the queue manager does not support units of work. 
The MQBEGIN call was issued in an MQ client environment. 
An MQXCLWLN call was issued, but the call did not originate from a cluster workload exit.
Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following (as appropriate): 

On z/OS: 
Link the application with the correct adapter. 
Modify the application to use the SRRCMIT and SRRBACK calls in place of the MQCMIT and MQBACK calls. Alternatively, link the application with the RRS batch adapter CSQBRRSI. This adapter supports MQCMIT and MQBACK in addition to SRRCMIT and SRRBACK. 
For a CICS or IMS application, issue the appropriate CICS or IMS call to commit or backout the unit of work. 
Start the RRS subsystem on the z/OS system that is running the application.
In the other environments: 
Link the application with the correct libraries (threaded or nonthreaded). 
Remove from the application the call that is not supported.
2013 (X'07DD')MQRC_EXPIRY_ERROR
Explanation:
On an MQPUT or MQPUT1 call, the value specified for the Expiry field in the message descriptor MQMD is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value that is greater than zero, or the special value MQEI_UNLIMITED.

2014 (X'07DE')MQRC_FEEDBACK_ERROR
Explanation:
On an MQPUT or MQPUT1 call, the value specified for the Feedback field in the message descriptor MQMD is not valid. The value is not MQFB_NONE, and is outside both the range defined for system feedback codes and the range defined for application feedback codes.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQFB_NONE, or a value in the range MQFB_SYSTEM_FIRST through MQFB_SYSTEM_LAST, or MQFB_APPL_FIRST through MQFB_APPL_LAST.

2016 (X'07E0')MQRC_GET_INHIBITED
Explanation:
MQGET calls are currently inhibited for the queue, or for the queue to which this queue resolves.

Completion Code:
MQCC_FAILED

Programmer Response:
If the system design allows get requests to be inhibited for short periods, retry the operation later.

2017 (X'07E1')MQRC_HANDLE_NOT_AVAILABLE
Explanation:
An MQOPEN or MQPUT1 call was issued, but the maximum number of open handles allowed for the current task has already been reached. Be aware that when a distribution list is specified on the MQOPEN or MQPUT1 call, each queue in the distribution list uses one handle. 

On z/OS, "task" means a CICS task, a z/OS task, or an IMS-dependent region.
Completion Code:
MQCC_FAILED

Programmer Response:
Check whether the application is issuing MQOPEN calls without corresponding MQCLOSE calls. If it is, modify the application to issue the MQCLOSE call for each open object as soon as that object is no longer needed.

Also check whether the application is specifying a distribution list containing a large number of queues that are consuming all of the available handles. If it is, increase the maximum number of handles that the task can use, or reduce the size of the distribution list. The maximum number of open handles that a task can use is given by the MaxHandles queue manager attribute.

2018 (X'07E2')MQRC_HCONN_ERROR
Explanation:
The connection handle Hconn is not valid, for one of the following reasons: 

The parameter pointer is not valid, or (for the MQCONN or MQCONNX call) points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The value specified was not returned by a preceding MQCONN or MQCONNX call. 
The value specified has been made invalid by a preceding MQDISC call. 
The handle is a shared handle that has been made invalid by another thread issuing the MQDISC call. 
The handle is a shared handle that is being used on the MQBEGIN call (only nonshared handles are valid on MQBEGIN). 
The handle is a nonshared handle that is being used a thread that did not create the handle. 
The call was issued in the MTS environment in a situation where the handle is not valid (for example, passing the handle between processes or packages; note that passing the handle between library packages is supported).
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that a successful MQCONN or MQCONNX call is performed for the queue manager, and that an MQDISC call has not already been performed for it. Ensure that the handle is being used within its valid scope (see the description of MQCONN in the WebSphere MQ Application Programming Guide). 

On z/OS, also check that the application has been linked with the correct stub; this is CSQCSTUB for CICS applications, CSQBSTUB for batch applications, and CSQQSTUB for IMS applications. Also, the stub used must not belong to a release of the queue manager that is more recent than the release on which the application will run.
2019 (X'07E3')MQRC_HOBJ_ERROR
Explanation:
The object handle Hobj is not valid, for one of the following reasons: 

The parameter pointer is not valid, or (for the MQOPEN call) points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The value specified was not returned by a preceding MQOPEN call. 
The value specified has been made invalid by a preceding MQCLOSE call. 
The handle is a shared handle that has been made invalid by another thread issuing the MQCLOSE call. 
The handle is a nonshared handle that is being used by a thread that did not create the handle. 
The call is MQGET or MQPUT, but the object represented by the handle is not a queue.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that a successful MQOPEN call is performed for this object, and that an MQCLOSE call has not already been performed for it. Ensure that the handle is being used within its valid scope (see the description of MQOPEN in the WebSphere MQ Application Programming Guide).

2020 (X'07E4')MQRC_INHIBIT_VALUE_ERROR
Explanation:
On an MQSET call, the value specified for either the MQIA_INHIBIT_GET attribute or the MQIA_INHIBIT_PUT attribute is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value for the InhibitGet or InhibitPut queu attribute.

2021 (X'07E5')MQRC_INT_ATTR_COUNT_ERROR
Explanation:
On an MQINQ or MQSET call, the IntAttrCount parameter is negative (MQINQ or MQSET), or smaller than the number of integer attribute selectors (MQIA_*) specified in the Selectors parameter (MQSET only). This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value large enough for all selected integer attributes.

2022 (X'07E6')MQRC_INT_ATTR_COUNT_TOO_SMALL
Explanation:
On an MQINQ call, the IntAttrCount parameter is smaller than the number of integer attribute selectors (MQIA_*) specified in the Selectors parameter.

The call completes with MQCC_WARNING, with the IntAttrs array filled in with as many integer attributes as there is room for.

Completion Code:
MQCC_WARNING

Programmer Response:
Specify a large enough value, unless only a subset of the values is needed.

2023 (X'07E7')MQRC_INT_ATTRS_ARRAY_ERROR
Explanation:
On an MQINQ or MQSET call, the IntAttrs parameter is not valid. The parameter pointer is not valid (MQINQ and MQSET), or points to read-only storage or to storage that is not as long as indicated by the IntAttrCount parameter (MQINQ only). (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2024 (X'07E8')MQRC_SYNCPOINT_LIMIT_REACHED
Explanation:
An MQGET, MQPUT, or MQPUT1 call failed because it would have caused the number of uncommitted messages in the current unit of work to exceed the limit defined for the queue manager (see the MaxUncommittedMsgs queue-manager attribute). The number of uncommitted messages is the sum of the following since the start of the current unit of work: 

Messages put by the application with the MQPMO_SYNCPOINT option 
Messages retrieved by the application with the MQGMO_SYNCPOINT option 
Trigger messages and COA report messages generated by the queue manager for messages put with the MQPMO_SYNCPOINT option 
COD report messages generated by the queue manager for messages retrieved with the MQGMO_SYNCPOINT option 
On Compaq NonStop Kernel, this reason code occurs when the maximum number of I/O operations in a single TM/MP transaction has been exceeded.
Completion Code:
MQCC_FAILED

Programmer Response:
Check whether the application is looping. If it is not, consider reducing the complexity of the application. Alternatively, increase the queue-manager limit for the maximum number of uncommitted messages within a unit of work. 

On z/OS, the limit for the maximum number of uncommitted messages can be changed by using the ALTER QMGR command. 
On i5/OS, the limit for the maximum number of uncommitted messages can be changed by using the CHGMQM command. 
On Compaq NonStop Kernel, the application should cancel the transaction and retry with a smaller number of operations in the unit of work. See the MQSeries for Tandem NonStop Kernel System Management Guide for more details.
2025 (X'07E9')MQRC_MAX_CONNS_LIMIT_REACHED
Explanation:
The MQCONN or MQCONNX call was rejected because the maximum number of concurrent connections has been exceeded. 

On z/OS, connection limits are applicable only to TSO and batch requests. The limits are determined by the customer using the following parameters of the CSQ6SYSP macro: 
For TSO: IDFORE 
For batch: IDBACK
For more information, see the WebSphere MQ for z/OS System Setup Guide.

On HP OpenVMS, OS/2, i5/OS, Compaq NonStop Kernel, UNIX systems, and Windows, this reason code can also occur on the MQOPEN call. 
When using Java applications, a limit to the number of concurrent connections may be defined by the connection manager.
Completion Code:
MQCC_FAILED

Programmer Response:
Either increase the size of the appropriate parameter value, or reduce the number of concurrent connections.

2026 (X'07EA')MQRC_MD_ERROR
Explanation:
The MQMD structure is not valid, for one of the following reasons: 

The StrucId field is not MQMD_STRUC_ID. 
The Version field specifies a value that is not valid or not supported. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the pointer points to read-only storage.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQMD structure are set correctly.

2027 (X'07EB')MQRC_MISSING_REPLY_TO_Q
Explanation:
On an MQPUT or MQPUT1 call, the ReplyToQ field in the message descriptor MQMD is blank, but one or both of the following is true: 

A reply was requested (that is, MQMT_REQUEST was specified in the MsgType field of the message descriptor). 
A report message was requested in the Report field of the message descriptor.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify the name of the queue to which the reply message or report message is to be sent.

2029 (X'07ED')MQRC_MSG_TYPE_ERROR
Explanation:
Either: 

On an MQPUT or MQPUT1 call, the value specified for the MsgType field in the message descriptor (MQMD) is not valid. 
A message processing program received a message that does not have the expected message type. For example, if the WebSphere MQ command server receives a message which is not a request message (MQMT_REQUEST) then it rejects the request with this reason code.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value for the MsgType field. In the case where a request is rejected by a message processing program, refer to the documentation for that program for details of the message types that it supports.

2030 (X'07EE')MQRC_MSG_TOO_BIG_FOR_Q
Explanation:
An MQPUT or MQPUT1 call was issued to put a message on a queue, but the message was too long for the queue and MQMF_SEGMENTATION_ALLOWED was not specified in the MsgFlags field in MQMD. If segmentation is not allowed, the length of the message cannot exceed the lesser of the queue MaxMsgLength attribute and queue-manager MaxMsgLength attribute. 

On z/OS, the queue manager does not support the segmentation of messages; if MQMF_SEGMENTATION_ALLOWED is specified, it is accepted but ignored.
This reason code can also occur when MQMF_SEGMENTATION_ALLOWED is specified, but the nature of the data present in the message prevents the queue manager splitting it into segments that are small enough to place on the queue: 

For a user-defined format, the smallest segment that the queue manager can create is 16 bytes. 
For a built-in format, the smallest segment that the queue manager can create depends on the particular format, but is greater than 16 bytes in all cases other than MQFMT_STRING (for MQFMT_STRING the minimum segment size is 16 bytes).
MQRC_MSG_TOO_BIG_FOR_Q can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Check whether the BufferLength parameter is specified correctly; if it is, do one of the following: 

Increase the value of the queue's MaxMsgLength attribute; the queue-manager's MaxMsgLength attribute may also need increasing. 
Break the message into several smaller messages. 
Specify MQMF_SEGMENTATION_ALLOWED in the MsgFlags field in MQMD; this will allow the queue manager to break the message into segments.
2031 (X'07EF')MQRC_MSG_TOO_BIG_FOR_Q_MGR
Explanation:
An MQPUT or MQPUT1 call was issued to put a message on a queue, but the message was too long for the queue manager and MQMF_SEGMENTATION_ALLOWED was not specified in the MsgFlags field in MQMD. If segmentation is not allowed, the length of the message cannot exceed the lesser of the queue-manager MaxMsgLength attribute and queue MaxMsgLength attribute.

This reason code can also occur when MQMF_SEGMENTATION_ALLOWED is specified, but the nature of the data present in the message prevents the queue manager splitting it into segments that are small enough for the queue-manager limit: 

For a user-defined format, the smallest segment that the queue manager can create is 16 bytes. 
For a built-in format, the smallest segment that the queue manager can create depends on the particular format, but is greater than 16 bytes in all cases other than MQFMT_STRING (for MQFMT_STRING the minimum segment size is 16 bytes).
MQRC_MSG_TOO_BIG_FOR_Q_MGR can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

This reason also occurs if a channel, through which the message is to pass, has restricted the maximum message length to a value that is actually less than that supported by the queue manager, and the message length is greater than this value. 

On z/OS, this return code is issued only if you are using CICS for distributed queuing. Otherwise, MQRC_MSG_TOO_BIG_FOR_CHANNEL is issued.
Completion Code:
MQCC_FAILED

Programmer Response:
Check whether the BufferLength parameter is specified correctly; if it is, do one of the following: 

Increase the value of the queue-manager's MaxMsgLength attribute; the queue's MaxMsgLength attribute may also need increasing. 
Break the message into several smaller messages. 
Specify MQMF_SEGMENTATION_ALLOWED in the MsgFlags field in MQMD; this will allow the queue manager to break the message into segments. 
Check the channel definitions.
2033 (X'07F1')MQRC_NO_MSG_AVAILABLE
Explanation:
An MQGET call was issued, but there is no message on the queue satisfying the selection criteria specified in MQMD (the MsgId and CorrelId fields), and in MQGMO (the Options and MatchOptions fields). Either the MQGMO_WAIT option was not specified, or the time interval specified by the WaitInterval field in MQGMO has expired. This reason is also returned for an MQGET call for browse, when the end of the queue has been reached.

This reason code can also be returned by the mqGetBag and mqExecute calls. mqGetBag is similar to MQGET. For the mqExecute call, the completion code can be either MQCC_WARNING or MQCC_FAILED: 

If the completion code is MQCC_WARNING, some response messages were received during the specified wait interval, but not all. The response bag contains system-generated nested bags for the messages that were received. 
If the completion code is MQCC_FAILED, no response messages were received during the specified wait interval.
Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
If this is an expected condition, no corrective action is required.

If this is an unexpected condition, check that: 

The message was put on the queue successfully. 
The unit of work (if any) used for the MQPUT or MQPUT1 call was committed successfully. 
The options controlling the selection criteria are specified correctly. All of the following can affect the eligibility of a message for return on the MQGET call: 
MQGMO_LOGICAL_ORDER 
MQGMO_ALL_MSGS_AVAILABLE 
MQGMO_ALL_SEGMENTS_AVAILABLE 
MQGMO_COMPLETE_MSG 
MQMO_MATCH_MSG_ID 
MQMO_MATCH_CORREL_ID 
MQMO_MATCH_GROUP_ID 
MQMO_MATCH_MSG_SEQ_NUMBER 
MQMO_MATCH_OFFSET 
Value of MsgId field in MQMD 
Value of CorrelId field in MQMD
Consider waiting longer for the message.

2034 (X'07F2')MQRC_NO_MSG_UNDER_CURSOR
Explanation:
An MQGET call was issued with either the MQGMO_MSG_UNDER_CURSOR or the MQGMO_BROWSE_MSG_UNDER_CURSOR option. However, the browse cursor is not positioned at a retrievable message. This is caused by one of the following: 

The cursor is positioned logically before the first message (as it is before the first MQGET call with a browse option has been successfully performed). 
The message the browse cursor was positioned on has been locked or removed from the queue (probably by some other application) since the browse operation was performed. 
The message the browse cursor was positioned on has expired.
Completion Code:
MQCC_FAILED

Programmer Response:
Check the application logic. This may be an expected reason if the application design allows multiple servers to compete for messages after browsing. Consider also using the MQGMO_LOCK option with the preceding browse MQGET call.

2035 (X'07F3')MQRC_NOT_AUTHORIZED
Explanation:
The user is not authorized to perform the operation attempted: 

On an MQCONN or MQCONNX call, the user is not authorized to connect to the queue manager. 
On z/OS, for CICS applications, MQRC_CONNECTION_NOT_AUTHORIZED is issued instead.
On an MQOPEN or MQPUT1 call, the user is not authorized to open the object for the option(s) specified. 
On z/OS, if the object being opened is a model queue, this reason also arises if the user is not authorized to create a dynamic queue with the required name.
On an MQCLOSE call, the user is not authorized to delete the object, which is a permanent dynamic queue, and the Hobj parameter specified on the MQCLOSE call is not the handle returned by the MQOPEN call that created the queue. 
On a command, the user is not authorized to issue the command, or to access the object it specifies.
This reason code can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct queue manager or object was specified, and that appropriate authority exists.

2036 (X'07F4')MQRC_NOT_OPEN_FOR_BROWSE
Explanation:
An MQGET call was issued with one of the following options: 

MQGMO_BROWSE_FIRST 
MQGMO_BROWSE_NEXT 
MQGMO_BROWSE_MSG_UNDER_CURSOR 
MQGMO_MSG_UNDER_CURSOR
but the queue had not been opened for browse.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_BROWSE when the queue is opened.

2037 (X'07F5')MQRC_NOT_OPEN_FOR_INPUT
Explanation:
An MQGET call was issued to retrieve a message from a queue, but the queue had not been opened for input.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify one of the following when the queue is opened: 

MQOO_INPUT_SHARED 
MQOO_INPUT_EXCLUSIVE 
MQOO_INPUT_AS_Q_DEF
2038 (X'07F6')MQRC_NOT_OPEN_FOR_INQUIRE
Explanation:
An MQINQ call was issued to inquire object attributes, but the object had not been opened for inquire.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_INQUIRE when the object is opened.

2039 (X'07F7')MQRC_NOT_OPEN_FOR_OUTPUT
Explanation:
An MQPUT call was issued to put a message on a queue, but the queue had not been opened for output.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_OUTPUT when the queue is opened.

2040 (X'07F8')MQRC_NOT_OPEN_FOR_SET
Explanation:
An MQSET call was issued to set queue attributes, but the queue had not been opened for set.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_SET when the object is opened.

2041 (X'07F9')MQRC_OBJECT_CHANGED
Explanation:
Object definitions that affect this object have been changed since the Hobj handle used on this call was returned by the MQOPEN call. See the description of MQOPEN in the WebSphere MQ Application Programming Guide for more information.

This reason does not occur if the object handle is specified in the Context field of the PutMsgOpts parameter on the MQPUT or MQPUT1 call.

Completion Code:
MQCC_FAILED

Programmer Response:
Issue an MQCLOSE call to return the handle to the system. It is then usually sufficient to reopen the object and retry the operation. However, if the object definitions are critical to the application logic, an MQINQ call can be used after reopening the object, to obtain the new values of the object attributes.

2042 (X'07FA')MQRC_OBJECT_IN_USE
Explanation:
An MQOPEN call was issued, but the object in question has already been opened by this or another application with options that conflict with those specified in the Options parameter. This arises if the request is for shared input, but the object is already open for exclusive input; it also arises if the request is for exclusive input, but the object is already open for input (of any sort).

MCAs for receiver channels, or the intra-group queuing agent (IGQ agent), may keep the destination queues open even when messages are not being transmitted; this results in the queues appearing to be "in use". Use the MQSC command DISPLAY QSTATUS to find out who is keeping the queue open. 

On z/OS, this reason can also occur for an MQOPEN or MQPUT1 call, if the object to be opened (which can be a queue, or for MQOPEN a namelist or process object) is in the process of being deleted.
Completion Code:
MQCC_FAILED

Programmer Response:
System design should specify whether an application is to wait and retry, or take other action.

2043 (X'07FB')MQRC_OBJECT_TYPE_ERROR
Explanation:
On the MQOPEN or MQPUT1 call, the ObjectType field in the object descriptor MQOD specifies a value that is not valid. For the MQPUT1 call, the object type must be MQOT_Q.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid object type.

2044 (X'07FC')MQRC_OD_ERROR
Explanation:
On the MQOPEN or MQPUT1 call, the object descriptor MQOD is not valid, for one of the following reasons: 

The StrucId field is not MQOD_STRUC_ID. 
The Version field specifies a value that is not valid or not supported. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the pointer points to read-only storage.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQOD structure are set correctly.

2045 (X'07FD')MQRC_OPTION_NOT_VALID_FOR_TYPE
Explanation:
On an MQOPEN or MQCLOSE call, an option is specified that is not valid for the type of object or queue being opened or closed.

For the MQOPEN call, this includes the following cases: 

An option that is inappropriate for the object type (for example, MQOO_OUTPUT for an MQOT_PROCESS object). 
An option that is unsupported for the queue type (for example, MQOO_INQUIRE for a remote queue that has no local definition). 
One or more of the following options: 
MQOO_INPUT_AS_Q_DEF 
MQOO_INPUT_SHARED 
MQOO_INPUT_EXCLUSIVE 
MQOO_BROWSE 
MQOO_INQUIRE 
MQOO_SET
when either: 
the queue name is resolved through a cell directory, or 
ObjectQMgrName in the object descriptor specifies the name of a local definition of a remote queue (in order to specify a queue-manager alias), and the queue named in the RemoteQMgrName attribute of the definition is the name of the local queue manager.
For the MQCLOSE call, this includes the following case: 

The MQCO_DELETE or MQCO_DELETE_PURGE option when the queue is not a dynamic queue.
This reason code can also occur on the MQOPEN call when the object being opened is of type MQOT_NAMELIST, MQOT_PROCESS, or MQOT_Q_MGR, but the ObjectQMgrName field in MQOD is neither blank nor the name of the local queue manager.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the correct option. For the MQOPEN call, ensure that the ObjectQMgrName field is set correctly. For the MQCLOSE call, either correct the option or change the definition type of the model queue that is used to create the new queue.

2046 (X'07FE')MQRC_OPTIONS_ERROR
Explanation:
The Options parameter or field contains options that are not valid, or a combination of options that is not valid. 

For the MQOPEN, MQCLOSE, MQXCNVC, mqBagToBuffer, mqBufferToBag, mqCreateBag, and mqExecute calls, Options is a separate parameter on the call. 
This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

For the MQBEGIN, MQCONNX, MQGET, MQPUT, and MQPUT1 calls, Options is a field in the relevant options structure (MQBO, MQCNO, MQGMO, or MQPMO).
Completion Code:
MQCC_FAILED

Programmer Response:
Specify valid options. Check the description of the Options parameter or field to determine which options and combinations of options are valid. If multiple options are being set by adding the individual options together, ensure that the same option is not added twice.

2047 (X'07FF')MQRC_PERSISTENCE_ERROR
Explanation:
On an MQPUT or MQPUT1 call, the value specified for the Persistence field in the message descriptor MQMD is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify one of the following values: 

MQPER_PERSISTENT 
MQPER_NOT_PERSISTENT 
MQPER_PERSISTENCE_AS_Q_DEF
2048 (X'0800')MQRC_PERSISTENT_NOT_ALLOWED
Explanation:
On an MQPUT or MQPUT1 call, the value specified for the Persistence field in MQMD (or obtained from the DefPersistence queue attribute) specifies MQPER_PERSISTENT, but the queue on which the message is being placed does not support persistent messages. Persistent messages cannot be placed on temporary dynamic queues.

This reason code can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQPER_NOT_PERSISTENT if the message is to be placed on a temporary dynamic queue. If persistence is required, use a permanent dynamic queue or predefined queue in place of a temporary dynamic queue.

Be aware that server applications are recommended to send reply messages (message type MQMT_REPLY) with the same persistence as the original request message (message type MQMT_REQUEST). If the request message is persistent, the reply queue specified in the ReplyToQ field in the message descriptor MQMD cannot be a temporary dynamic queue. Use a permanent dynamic queue or predefined queue as the reply queue in this situation.

2049 (X'0801')MQRC_PRIORITY_EXCEEDS_MAXIMUM
Explanation:
An MQPUT or MQPUT1 call was issued, but the value of the Priority field in the message descriptor MQMD exceeds the maximum priority supported by the local queue manager, as shown by the MaxPriority queue-manager attribute. The message is accepted by the queue manager, but is placed on the queue at the queue manager's maximum priority. The Priority field in the message descriptor retains the value specified by the application that put the message.

Completion Code:
MQCC_WARNING

Programmer Response:
None required, unless this reason code was not expected by the application that put the message.

2050 (X'0802')MQRC_PRIORITY_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the value of the Priority field in the message descriptor MQMD is not valid. The maximum priority supported by the queue manager is given by the MaxPriority queue-manager attribute.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value in the range zero through MaxPriority, or the special value MQPRI_PRIORITY_AS_Q_DEF.

2051 (X'0803')MQRC_PUT_INHIBITED
Explanation:
MQPUT and MQPUT1 calls are currently inhibited for the queue, or for the queue to which this queue resolves.

This reason code can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

Completion Code:
MQCC_FAILED

Programmer Response:
If the system design allows put requests to be inhibited for short periods, retry the operation later.

2052 (X'0804')MQRC_Q_DELETED
Explanation:
An Hobj queue handle specified on a call refers to a dynamic queue that has been deleted since the queue was opened. (See the description of MQCLOSE in the WebSphere MQ Application Programming Guide for information about the deletion of dynamic queues.) 

On z/OS, this can also occur with the MQOPEN and MQPUT1 calls if a dynamic queue is being opened, but the queue is in a logically-deleted state. See MQCLOSE for more information about this.
Completion Code:
MQCC_FAILED

Programmer Response:
Issue an MQCLOSE call to return the handle and associated resources to the system (the MQCLOSE call will succeed in this case). Check the design of the application that caused the error.

2053 (X'0805')MQRC_Q_FULL
Explanation:
An MQPUT or MQPUT1 call, or a command, failed because the queue is full, that is, it already contains the maximum number of messages possible, as specified by the MaxQDepth queue attribute.

This reason code can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Retry the operation later. Consider increasing the maximum depth for this queue, or arranging for more instances of the application to service the queue.

2055 (X'0807')MQRC_Q_NOT_EMPTY
Explanation:
An MQCLOSE call was issued for a permanent dynamic queue, but the call failed because the queue is not empty or still in use. One of the following applies: 

The MQCO_DELETE option was specified, but there are messages on the queue. 
The MQCO_DELETE or MQCO_DELETE_PURGE option was specified, but there are uncommitted get or put calls outstanding against the queue.
See the usage notes pertaining to dynamic queues for the MQCLOSE call for more information.

This reason code is also returned from a command to clear or delete or move a queue, if the queue contains uncommitted messages (or committed messages in the case of delete queue without the purge option).

Completion Code:
MQCC_FAILED

Programmer Response:
Check why there might be messages on the queue. Be aware that the CurrentQDepth queue attribute might be zero even though there are one or more messages on the queue; this can happen if the messages have been retrieved as part of a unit of work that has not yet been committed. If the messages can be discarded, try using the MQCLOSE call with the MQCO_DELETE_PURGE option. Consider retrying the call later.

2056 (X'0808')MQRC_Q_SPACE_NOT_AVAILABLE
Explanation:
An MQPUT or MQPUT1 call was issued, but there is no space available for the queue on disk or other storage device.

This reason code can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Check whether an application is putting messages in an infinite loop. If not, make more disk space available for the queue.

2057 (X'0809')MQRC_Q_TYPE_ERROR
Explanation:
One of the following occurred: 

On an MQOPEN call, the ObjectQMgrName field in the object descriptor MQOD or object record MQOR specifies the name of a local definition of a remote queue (in order to specify a queue-manager alias), and in that local definition the RemoteQMgrName attribute is the name of the local queue manager. However, the ObjectName field in MQOD or MQOR specifies the name of a model queue on the local queue manager; this is not allowed. See the WebSphere MQ Application Programming Guide for more information. 
On an MQPUT1 call, the object descriptor MQOD or object record MQOR specifies the name of a model queue. 
On a previous MQPUT or MQPUT1 call, the ReplyToQ field in the message descriptor specified the name of a model queue, but a model queue cannot be specified as the destination for reply or report messages. Only the name of a predefined queue, or the name of the dynamic queue created from the model queue, can be specified as the destination. In this situation the reason code MQRC_Q_TYPE_ERROR is returned in the Reason field of the MQDLH structure when the reply message or report message is placed on the dead-letter queue.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid queue.

2058 (X'080A')MQRC_Q_MGR_NAME_ERROR
Explanation:
On an MQCONN or MQCONNX call, the value specified for the QMgrName parameter is not valid or not known. This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 

On z/OS for CICS applications, this reason can occur on any call if the original connect specified an incorrect or unrecognized name.
This reason code can also occur if an MQ client application attempts to connect to a queue manager within an MQ-client queue-manager group (see the QMgrName parameter of MQCONN), and either: 

Queue-manager groups are not supported. 
There is no queue-manager group with the specified name.
Completion Code:
MQCC_FAILED

Programmer Response:
Use an all-blank name if possible, or verify that the name used is valid.

2059 (X'080B')MQRC_Q_MGR_NOT_AVAILABLE
Explanation:
This occurs: 

On an MQCONN or MQCONNX call, the queue manager identified by the QMgrName parameter is not available for connection. 
On z/OS: 
For batch applications, this reason can be returned to applications running in LPARs that do not have a queue manager installed. 
For CICS applications, this reason can occur on any call if the original connect specified a queue manager whose name was recognized, but which is not available.
On i5/OS, this reason can also be returned by the MQOPEN and MQPUT1 calls, when MQHC_DEF_HCONN is specified for the Hconn parameter by an application running in compatibility mode.
On an MQCONN or MQCONNX call from an MQ client application: 
Attempting to connect to a queue manager within an MQ-client queue-manager group when none of the queue managers in the group is available for connection (see the QMgrName parameter of the MQCONN call). 
If there is an error with the client-connection or the corresponding server-connection channel definitions. 
On z/OS, if the optional OS/390 Client Attachment feature has not been installed.
If a command uses the CommandScope parameter specfying a queue manager that is not active in the queue-sharing group.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the queue manager has been started. If the connection is from a client application, check the channel definitions.

2061 (X'080D')MQRC_REPORT_OPTIONS_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the Report field in the message descriptor MQMD contains one or more options that are not recognized by the local queue manager. The options that cause this reason code to be returned depend on the destination of the message; see the description of REPORT in the WebSphere MQ Application Programming Guide for more details.

This reason code can also occur in the Feedback field in the MQMD of a report message, or in the Reason field in the MQDLH structure of a message on the dead-letter queue; in both cases it indicates that the destination queue manager does not support one or more of the report options specified by the sender of the message.

Completion Code:
MQCC_FAILED

Programmer Response:
Do the following: 

Ensure that the Report field in the message descriptor is initialized with a value when the message descriptor is declared, or is assigned a value prior to the MQPUT or MQPUT1 call. Specify MQRO_NONE if no report options are required. 
Ensure that the report options specified are valid; see the Report field described in the description of MQMD in the WebSphere MQ Application Programming Guide for valid report options. 
If multiple report options are being set by adding the individual report options together, ensure that the same report option is not added twice. 
Check that conflicting report options are not specified. For example, do not add both MQRO_EXCEPTION and MQRO_EXCEPTION_WITH_DATA to the Report field; only one of these can be specified.
2062 (X'080E')MQRC_SECOND_MARK_NOT_ALLOWED
Explanation:
An MQGET call was issued specifying the MQGMO_MARK_SKIP_BACKOUT option in the Options field of MQGMO, but a message has already been marked within the current unit of work. Only one marked message is allowed within each unit of work.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application so that no more than one message is marked within each unit of work.

2063 (X'080F')MQRC_SECURITY_ERROR
Explanation:
An MQCONN, MQCONNX, MQOPEN, MQPUT1, or MQCLOSE call was issued, but it failed because a security error occurred. 

On z/OS, the security error was returned by the External Security Manager.
Completion Code:
MQCC_FAILED

Programmer Response:
Note the error from the security manager, and contact your system programmer or security administrator. 

On i5/OS, the FFST log will contain the error information.
2065 (X'0811')MQRC_SELECTOR_COUNT_ERROR
Explanation:
On an MQINQ or MQSET call, the SelectorCount parameter specifies a value that is not valid. This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value in the range 0 through 256.

2066 (X'0812')MQRC_SELECTOR_LIMIT_EXCEEDED
Explanation:
On an MQINQ or MQSET call, the SelectorCount parameter specifies a value that is larger than the maximum supported (256).

Completion Code:
MQCC_FAILED

Programmer Response:
Reduce the number of selectors specified on the call; the valid range is 0 through 256.

2067 (X'0813')MQRC_SELECTOR_ERROR
Explanation:
An MQINQ or MQSET call was issued, but the Selectors array contains a selector that is not valid for one of the following reasons: 

The selector is not supported or out of range. 
The selector is not applicable to the type of object whose attributes are being inquired or set. 
The selector is for an attribute that cannot be set.
This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the value specified for the selector is valid for the object type represented by Hobj. For the MQSET call, also ensure that the selector represents an integer attribute that can be set.

2068 (X'0814')MQRC_SELECTOR_NOT_FOR_TYPE
Explanation:
On the MQINQ call, one or more selectors in the Selectors array is not applicable to the type of the queue whose attributes are being inquired.

This reason also occurs when the queue is a cluster queue that resolved to a remote instance of the queue. In this case only a subset of the attributes that are valid for local queues can be inquired. See the usage notes in the description of MQINQ in the WebSphere MQ Application Programming Guide for further details.

The call completes with MQCC_WARNING, with the attribute values for the inapplicable selectors set as follows: 

For integer attributes, the corresponding elements of IntAttrs are set to MQIAV_NOT_APPLICABLE. 
For character attributes, the appropriate parts of the CharAttrs string are set to a character string consisting entirely of asterisks (*).
Completion Code:
MQCC_WARNING

Programmer Response:
Verify that the selector specified is the one that was intended.

If the queue is a cluster queue, specifying one of the MQOO_BROWSE, MQOO_INPUT_*, or MQOO_SET options in addition to MQOO_INQUIRE forces the queue to resolve to the local instance of the queue. However, if there is no local instance of the queue the MQOPEN call fails.

2069 (X'0815')MQRC_SIGNAL_OUTSTANDING
Explanation:
An MQGET call was issued with either the MQGMO_SET_SIGNAL or MQGMO_WAIT option, but there is already a signal outstanding for the queue handle Hobj.

This reason code occurs only in the following environments: z/OS, Windows 95, Windows 98.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the application logic. If it is necessary to set a signal or wait when there is a signal outstanding for the same queue, a different object handle must be used.

2070 (X'0816')MQRC_SIGNAL_REQUEST_ACCEPTED
Explanation:
An MQGET call was issued specifying MQGMO_SET_SIGNAL in the GetMsgOpts parameter, but no suitable message was available; the call returns immediately. The application can now wait for the signal to be delivered. 

On z/OS, the application should wait on the Event Control Block pointed to by the Signal1 field. 
On Windows 95, Windows 98, the application should wait for the signal Windows message to be delivered.
This reason code occurs only in the following environments: z/OS, Windows 95, Windows 98.

Completion Code:
MQCC_WARNING

Programmer Response:
Wait for the signal; when it is delivered, check the signal to ensure that a message is now available. If it is, reissue the MQGET call. 

On z/OS, wait on the ECB pointed to by the Signal1 field and, when it is posted, check it to ensure that a message is now available. 
On Windows 95, Windows 98, the application (thread) should continue executing its message loop.
2071 (X'0817')MQRC_STORAGE_NOT_AVAILABLE
Explanation:
The call failed because there is insufficient main storage available.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that active applications are behaving correctly, for example, that they are not looping unexpectedly. If no problems are found, make more main storage available. 

On z/OS, if no application problems are found, ask your system programmer to increase the size of the region in which the queue manager runs.
2072 (X'0818')MQRC_SYNCPOINT_NOT_AVAILABLE
Explanation:
Either MQGMO_SYNCPOINT was specified on an MQGET call or MQPMO_SYNCPOINT was specified on an MQPUT or MQPUT1 call, but the local queue manager was unable to honor the request. If the queue manager does not support units of work, the SyncPoint queue-manager attribute will have the value MQSP_NOT_AVAILABLE.

This reason code can also occur on the MQGET, MQPUT, and MQPUT1 calls when an external unit-of-work coordinator is being used. If that coordinator requires an explicit call to start the unit of work, but the application has not issued that call prior to the MQGET, MQPUT, or MQPUT1 call, reason code MQRC_SYNCPOINT_NOT_AVAILABLE is returned. 

On i5/OS, this reason codes means that i5/OS Commitment Control is not started, or is unavailable for use by the queue manager. 
On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Remove the specification of MQGMO_SYNCPOINT or MQPMO_SYNCPOINT, as appropriate. 

On i5/OS, ensure that Commitment Control has been started. If this reason code occurs after Commitment Control has been started, contact your system programmer.
2075 (X'081B')MQRC_TRIGGER_CONTROL_ERROR
Explanation:
On an MQSET call, the value specified for the MQIA_TRIGGER_CONTROL attribute selector is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value.

2076 (X'081C')MQRC_TRIGGER_DEPTH_ERROR
Explanation:
On an MQSET call, the value specified for the MQIA_TRIGGER_DEPTH attribute selector is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value that is greater than zero.

2077 (X'081D')MQRC_TRIGGER_MSG_PRIORITY_ERR
Explanation:
On an MQSET call, the value specified for the MQIA_TRIGGER_MSG_PRIORITY attribute selector is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value in the range zero through the value of MaxPriority queue-manager attribute.

2078 (X'081E')MQRC_TRIGGER_TYPE_ERROR
Explanation:
On an MQSET call, the value specified for the MQIA_TRIGGER_TYPE attribute selector is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value.

2079 (X'081F')MQRC_TRUNCATED_MSG_ACCEPTED
Explanation:
On an MQGET call, the message length was too large to fit into the supplied buffer. The MQGMO_ACCEPT_TRUNCATED_MSG option was specified, so the call completes. The message is removed from the queue (subject to unit-of-work considerations), or, if this was a browse operation, the browse cursor is advanced to this message.

The DataLength parameter is set to the length of the message before truncation, the Buffer parameter contains as much of the message as fits, and the MQMD structure is filled in.

Completion Code:
MQCC_WARNING

Programmer Response:
None, because the application expected this situation.

2080 (X'0820')MQRC_TRUNCATED_MSG_FAILED
Explanation:
On an MQGET call, the message length was too large to fit into the supplied buffer. The MQGMO_ACCEPT_TRUNCATED_MSG option was not specified, so the message has not been removed from the queue. If this was a browse operation, the browse cursor remains where it was before this call, but if MQGMO_BROWSE_FIRST was specified, the browse cursor is positioned logically before the highest-priority message on the queue.

The DataLength field is set to the length of the message before truncation, the Buffer parameter contains as much of the message as fits, and the MQMD structure is filled in.

Completion Code:
MQCC_WARNING

Programmer Response:
Supply a buffer that is at least as large as DataLength, or specify MQGMO_ACCEPT_TRUNCATED_MSG if not all of the message data is required.

2082 (X'0822')MQRC_UNKNOWN_ALIAS_BASE_Q
Explanation:
An MQOPEN or MQPUT1 call was issued specifying an alias queue as the target, but the BaseQName in the alias queue attributes is not recognized as a queue name.

This reason code can also occur when BaseQName is the name of a cluster queue that cannot be resolved successfully.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the queue definitions.

2085 (X'0825')MQRC_UNKNOWN_OBJECT_NAME
Explanation:
An MQOPEN or MQPUT1 call was issued, but the object identified by the ObjectName and ObjectQMgrName fields in the object descriptor MQOD cannot be found. One of the following applies: 

The ObjectQMgrName field is one of the following: 
Blank 
The name of the local queue manager 
The name of a local definition of a remote queue (a queue-manager alias) in which the RemoteQMgrName attribute is the name of the local queue manager
but no object with the specified ObjectName and ObjectType exists on the local queue manager. 
The object being opened is a cluster queue that is hosted on a remote queue manager, but the local queue manager does not have a defined route to the remote queue manager. 
The object being opened is a queue definition that has QSGDISP(GROUP). Such definitions cannot be used with the MQOPEN and MQPUT1 calls.
This can also occur in response to a command that specifies the name of an object or other item that does not exist.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid object name. Ensure that the name is padded to the right with blanks if necessary. If this is correct, check the object definitions.

2086 (X'0826')MQRC_UNKNOWN_OBJECT_Q_MGR
Explanation:
On an MQOPEN or MQPUT1 call, the ObjectQMgrName field in the object descriptor MQOD does not satisfy the naming rules for objects. For more information, see the WebSphere MQ Application Programming Guide.

This reason also occurs if the ObjectType field in the object descriptor has the value MQOT_Q_MGR, and the ObjectQMgrName field is not blank, but the name specified is not the name of the local queue manager.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid queue manager name. To refer to the local queue manager, a name consisting entirely of blanks or beginning with a null character can be used. Ensure that the name is padded to the right with blanks or terminated with a null character if necessary.

2087 (X'0827')MQRC_UNKNOWN_REMOTE_Q_MGR
Explanation:
On an MQOPEN or MQPUT1 call, an error occurred with the queue-name resolution, for one of the following reasons: 

ObjectQMgrName is blank or the name of the local queue manager, ObjectName is the name of a local definition of a remote queue (or an alias to one), and one of the following is true: 
RemoteQMgrName is blank or the name of the local queue manager. Note that this error occurs even if XmitQName is not blank. 
XmitQName is blank, but there is no transmission queue defined with the name of RemoteQMgrName, and the DefXmitQName queue-manager attribute is blank. 
RemoteQMgrName and RemoteQName specify a cluster queue that cannot be resolved successfully, and the DefXmitQName queue-manager attribute is blank.
ObjectQMgrName is the name of a local definition of a remote queue (containing a queue-manager alias definition), and one of the following is true: 
RemoteQName is not blank. 
XmitQName is blank, but there is no transmission queue defined with the name of RemoteQMgrName, and the DefXmitQName queue-manager attribute is blank.
ObjectQMgrName is not: 
Blank 
The name of the local queue manager 
The name of a transmission queue 
The name of a queue-manager alias definition (that is, a local definition of a remote queue with a blank RemoteQName)
but the DefXmitQName queue-manager attribute is blank and the queue manager is not part of a queue-sharing group with intra-group queuing enabled. 
ObjectQMgrName is the name of a model queue. 
The queue name is resolved through a cell directory. However, there is no queue defined with the same name as the remote queue manager name obtained from the cell directory, and the DefXmitQName queue-manager attribute is blank.
Completion Code:
MQCC_FAILED

Programmer Response:
Check the values specified for ObjectQMgrName and ObjectName. If these are correct, check the queue definitions.

2090 (X'082A')MQRC_WAIT_INTERVAL_ERROR
Explanation:
On the MQGET call, the value specified for the WaitInterval field in the GetMsgOpts parameter is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value greater than or equal to zero, or the special value MQWI_UNLIMITED if an indefinite wait is required.

2091 (X'082B')MQRC_XMIT_Q_TYPE_ERROR
Explanation:
On an MQOPEN or MQPUT1 call, a message is to be sent to a remote queue manager. The ObjectName or ObjectQMgrName field in the object descriptor specifies the name of a local definition of a remote queue but one of the following applies to the XmitQName attribute of the definition: 

XmitQName is not blank, but specifies a queue that is not a local queue 
XmitQName is blank, but RemoteQMgrName specifies a queue that is not a local queue
This reason also occurs if the queue name is resolved through a cell directory, and the remote queue manager name obtained from the cell directory is the name of a queue, but this is not a local queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the values specified for ObjectName and ObjectQMgrName. If these are correct, check the queue definitions. For more information on transmission queues, see the WebSphere MQ Application Programming Guide.

2092 (X'082C')MQRC_XMIT_Q_USAGE_ERROR
Explanation:
On an MQOPEN or MQPUT1 call, a message is to be sent to a remote queue manager, but one of the following occurred: 

ObjectQMgrName specifies the name of a local queue, but it does not have a Usage attribute of MQUS_TRANSMISSION. 
The ObjectName or ObjectQMgrName field in the object descriptor specifies the name of a local definition of a remote queue but one of the following applies to the XmitQName attribute of the definition: 
XmitQName is not blank, but specifies a queue that does not have a Usage attribute of MQUS_TRANSMISSION 
XmitQName is blank, but RemoteQMgrName specifies a queue that does not have a Usage attribute of MQUS_TRANSMISSION 
XmitQName specifies the queue SYSTEM.QSG.TRANSMIT.QUEUE the IGQ queue manager attribute indicates that IGQ is DISABLED.
The queue name is resolved through a cell directory, and the remote queue manager name obtained from the cell directory is the name of a local queue, but it does not have a Usage attribute of MQUS_TRANSMISSION.
Completion Code:
MQCC_FAILED

Programmer Response:
Check the values specified for ObjectName and ObjectQMgrName. If these are correct, check the queue definitions. For more information on transmission queues, see the WebSphere MQ Application Programming Guide.

2093 (X'082D')MQRC_NOT_OPEN_FOR_PASS_ALL
Explanation:
An MQPUT call was issued with the MQPMO_PASS_ALL_CONTEXT option specified in the PutMsgOpts parameter, but the queue had not been opened with the MQOO_PASS_ALL_CONTEXT option.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_PASS_ALL_CONTEXT (or another option that implies it) when the queue is opened.

2094 (X'082E')MQRC_NOT_OPEN_FOR_PASS_IDENT
Explanation:
An MQPUT call was issued with the MQPMO_PASS_IDENTITY_CONTEXT option specified in the PutMsgOpts parameter, but the queue had not been opened with the MQOO_PASS_IDENTITY_CONTEXT option.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_PASS_IDENTITY_CONTEXT (or another option that implies it) when the queue is opened.

2095 (X'082F')MQRC_NOT_OPEN_FOR_SET_ALL
Explanation:
An MQPUT call was issued with the MQPMO_SET_ALL_CONTEXT option specified in the PutMsgOpts parameter, but the queue had not been opened with the MQOO_SET_ALL_CONTEXT option.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_SET_ALL_CONTEXT when the queue is opened.

2096 (X'0830')MQRC_NOT_OPEN_FOR_SET_IDENT
Explanation:
An MQPUT call was issued with the MQPMO_SET_IDENTITY_CONTEXT option specified in the PutMsgOpts parameter, but the queue had not been opened with the MQOO_SET_IDENTITY_CONTEXT option.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_SET_IDENTITY_CONTEXT (or another option that implies it) when the queue is opened.

2097 (X'0831')MQRC_CONTEXT_HANDLE_ERROR
Explanation:
On an MQPUT or MQPUT1 call, MQPMO_PASS_IDENTITY_CONTEXT or MQPMO_PASS_ALL_CONTEXT was specified, but the handle specified in the Context field of the PutMsgOpts parameter is either not a valid queue handle, or it is a valid queue handle but the queue was not opened with MQOO_SAVE_ALL_CONTEXT.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_SAVE_ALL_CONTEXT when the queue referred to is opened.

2098 (X'0832')MQRC_CONTEXT_NOT_AVAILABLE
Explanation:
On an MQPUT or MQPUT1 call, MQPMO_PASS_IDENTITY_CONTEXT or MQPMO_PASS_ALL_CONTEXT was specified, but the queue handle specified in the Context field of the PutMsgOpts parameter has no context associated with it. This arises if no message has yet been successfully retrieved with the queue handle referred to, or if the last successful MQGET call was a browse.

This condition does not arise if the message that was last retrieved had no context associated with it. 

On z/OS, if a message is received by a message channel agent that is putting messages with the authority of the user identifier in the message, this code is returned in the Feedback field of an exception report if the message has no context associated with it.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that a successful nonbrowse get call has been issued with the queue handle referred to.

2099 (X'0833')MQRC_SIGNAL1_ERROR
Explanation:
An MQGET call was issued, specifying MQGMO_SET_SIGNAL in the GetMsgOpts parameter, but the Signal1 field is not valid. 

On z/OS, the address contained in the Signal1 field is not valid, or points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
On Windows 95, Windows 98, the window handle in the Signal1 field is not valid.
This reason code occurs only in the following environments: z/OS, Windows 95, Windows 98.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the setting of the Signal1 field.

2100 (X'0834')MQRC_OBJECT_ALREADY_EXISTS
Explanation:
An MQOPEN call was issued to create a dynamic queue, but a queue with the same name as the dynamic queue already exists. 

On z/OS, a rare "race condition" can also give rise to this reason code; see the description of reason code MQRC_NAME_IN_USE for more details.
Completion Code:
MQCC_FAILED

Programmer Response:
If supplying a dynamic queue name in full, ensure that it obeys the naming conventions for dynamic queues; if it does, either supply a different name, or delete the existing queue if it is no longer required. Alternatively, allow the queue manager to generate the name.

If the queue manager is generating the name (either in part or in full), reissue the MQOPEN call.

2101 (X'0835')MQRC_OBJECT_DAMAGED
Explanation:
The object accessed by the call is damaged and cannot be used. For example, this may be because the definition of the object in main storage is not consistent, or because it differs from the definition of the object on disk, or because the definition on disk cannot be read. The object can be deleted, although it may not be possible to delete the associated user space. 

On z/OS, this reason occurs when the DB2 list header or structure number associated with a shared queue is zero. This situation arises as a result of using the MQSC command DELETE CFSTRUCT to delete the DB2 structure definition. The command resets the list header and structure number to zero for each of the shared queues that references the deleted CF strcture.
Completion Code:
MQCC_FAILED

Programmer Response:
It may be necessary to stop and restart the queue manager, or to restore the queue-manager data from back-up storage. 

On HP OpenVMS, OS/2, i5/OS, Compaq NonStop Kernel, and UNIX systems, consult the FFST(TM) record to obtain more detail about the problem. 
On z/OS, delete the shared queue and redefine it using the MQSC command DEFINE QLOCAL. This will automatically define a CF structure and allocate list headers for it.
2102 (X'0836')MQRC_RESOURCE_PROBLEM
Explanation:
There are insufficient system resources to complete the call successfully.

Completion Code:
MQCC_FAILED

Programmer Response:
Run the application when the machine is less heavily loaded. 

On z/OS, check the operator console for messages that may provide additional information. 
On HP OpenVMS, OS/2, i5/OS, Compaq NonStop Kernel, and UNIX systems, consult the FFST record to obtain more detail about the problem.
2103 (X'0837')MQRC_ANOTHER_Q_MGR_CONNECTED
Explanation:
An MQCONN or MQCONNX call was issued, but the thread or process is already connected to a different queue manager. The thread or process can connect to only one queue manager at a time. 

On z/OS, this reason code does not occur. 
On Windows, MTS objects do not receive this reason code, as connections to other queue managers are allowed.
Completion Code:
MQCC_FAILED

Programmer Response:
Use the MQDISC call to disconnect from the queue manager that is already connected, and then issue the MQCONN or MQCONNX call to connect to the new queue manager.

Disconnecting from the existing queue manager will close any queues that are currently open; it is recommended that any uncommitted units of work should be committed or backed out before the MQDISC call is issued.

2104 (X'0838')MQRC_UNKNOWN_REPORT_OPTION
Explanation:
An MQPUT or MQPUT1 call was issued, but the Report field in the message descriptor MQMD contains one or more options that are not recognized by the local queue manager. The options are accepted.

The options that cause this reason code to be returned depend on the destination of the message; see the description of REPORT in the WebSphere MQ Application Programming Guide for more details.

Completion Code:
MQCC_WARNING

Programmer Response:
If this reason code is expected, no corrective action is required. If this reason code is not expected, do the following: 

Ensure that the Report field in the message descriptor is initialized with a value when the message descriptor is declared, or is assigned a value prior to the MQPUT or MQPUT1 call. 
Ensure that the report options specified are valid; see the Report field described in the description of MQMD in the WebSphere MQ Application Programming Guide for valid report options. 
If multiple report options are being set by adding the individual report options together, ensure that the same report option is not added twice. 
Check that conflicting report options are not specified. For example, do not add both MQRO_EXCEPTION and MQRO_EXCEPTION_WITH_DATA to the Report field; only one of these can be specified.
2105 (X'0839')MQRC_STORAGE_CLASS_ERROR
Explanation:
The MQPUT or MQPUT1 call was issued, but the storage-class object defined for the queue does not exist.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Create the storage-class object required by the queue, or modify the queue definition to use an existing storage class. The name of the storage-class object used by the queue is given by the StorageClass queue attribute.

2106 (X'083A')MQRC_COD_NOT_VALID_FOR_XCF_Q
Explanation:
An MQPUT or MQPUT1 call was issued, but the Report field in the message descriptor MQMD specifies one of the MQRO_COD_* options and the target queue is an XCF queue. MQRO_COD_* options cannot be specified for XCF queues.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the relevant MQRO_COD_* option.

2107 (X'083B')MQRC_XWAIT_CANCELED
Explanation:
An MQXWAIT call was issued, but the call has been canceled because a STOP CHINIT command has been issued (or the queue manager has been stopped, which causes the same effect). Refer to the WebSphere MQ Intercommunication book for details of the MQXWAIT call.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Tidy up and terminate.

2108 (X'083C')MQRC_XWAIT_ERROR
Explanation:
An MQXWAIT call was issued, but the invocation was not valid for one of the following reasons: 

The wait descriptor MQXWD contains data that is not valid. 
The linkage stack level is not valid. 
The addressing mode is not valid. 
There are too many wait events outstanding.
This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Obey the rules for using the MQXWAIT call. Refer to the WebSphere MQ Intercommunication book for details of this call.

2109 (X'083D')MQRC_SUPPRESSED_BY_EXIT
Explanation:
On any call other than MQCONN or MQDISC, the API crossing exit suppressed the call.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Obey the rules for MQI calls that the exit enforces. To find out the rules, see the writer of the exit.

2110 (X'083E')MQRC_FORMAT_ERROR
Explanation:
An MQGET call was issued with the MQGMO_CONVERT option specified in the GetMsgOpts parameter, but the message cannot be converted successfully due to an error associated with the message format. Possible errors include: 

The format name in the message is MQFMT_NONE. 
A user-written exit with the name specified by the Format field in the message cannot be found. 
The message contains data that is not consistent with the format definition.
The message is returned unconverted to the application issuing the MQGET call, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

Completion Code:
MQCC_WARNING

Programmer Response:
Check the format name that was specified when the message was put. If this is not one of the built-in formats, check that a suitable exit with the same name as the format is available for the queue manager to load. Verify that the data in the message corresponds to the format expected by the exit.

2111 (X'083F')MQRC_SOURCE_CCSID_ERROR
Explanation:
The coded character-set identifier from which character data is to be converted is not valid or not supported.

This can occur on the MQGET call when the MQGMO_CONVERT option is included in the GetMsgOpts parameter; the coded character-set identifier in error is the CodedCharSetId field in the message being retrieved. In this case, the message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

This reason can also occur on the MQGET call when the message contains one or more MQ header structures (MQCIH, MQDLH, MQIIH, MQRMH), and the CodedCharSetId field in the message specifies a character set that does not have SBCS characters for the characters that are valid in queue names. MQ header structures containing such characters are not valid, and so the message is returned unconverted. The Unicode character set UCS-2 is an example of such a character set.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

This reason can also occur on the MQXCNVC call; the coded character-set identifier in error is the SourceCCSID parameter. Either the SourceCCSID parameter specifies a value that is not valid or not supported, or the SourceCCSID parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Check the character-set identifier that was specified when the message was put, or that was specified for the SourceCCSID parameter on the MQXCNVC call. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the specified character set, conversion must be carried out by the application.

2112 (X'0840')MQRC_SOURCE_INTEGER_ENC_ERROR
Explanation:
On an MQGET call, with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the message being retrieved specifies an integer encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

This reason code can also occur on the MQXCNVC call, when the Options parameter contains an unsupported MQDCC_SOURCE_* value, or when MQDCC_SOURCE_ENC_UNDEFINED is specified for a UCS-2 code page.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Check the integer encoding that was specified when the message was put. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required integer encoding, conversion must be carried out by the application.

2113 (X'0841')MQRC_SOURCE_DECIMAL_ENC_ERROR
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the message being retrieved specifies a decimal encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

Completion Code:
MQCC_WARNING

Programmer Response:
Check the decimal encoding that was specified when the message was put. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required decimal encoding, conversion must be carried out by the application.

2114 (X'0842')MQRC_SOURCE_FLOAT_ENC_ERROR
Explanation:
On an MQGET call, with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the message being retrieved specifies a floating-point encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

Completion Code:
MQCC_WARNING

Programmer Response:
Check the floating-point encoding that was specified when the message was put. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required floating-point encoding, conversion must be carried out by the application.

2115 (X'0843')MQRC_TARGET_CCSID_ERROR
Explanation:
The coded character-set identifier to which character data is to be converted is not valid or not supported.

This can occur on the MQGET call when the MQGMO_CONVERT option is included in the GetMsgOpts parameter; the coded character-set identifier in error is the CodedCharSetId field in the MsgDesc parameter. In this case, the message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

This reason can also occur on the MQGET call when the message contains one or more MQ header structures (MQCIH, MQDLH, MQIIH, MQRMH), and the CodedCharSetId field in the MsgDesc parameter specifies a character set that does not have SBCS characters for the characters that are valid in queue names. The Unicode character set UCS-2 is an example of such a character set.

This reason can also occur on the MQXCNVC call; the coded character-set identifier in error is the TargetCCSID parameter. Either the TargetCCSID parameter specifies a value that is not valid or not supported, or the TargetCCSID parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Check the character-set identifier that was specified for the CodedCharSetId field in the MsgDesc parameter on the MQGET call, or that was specified for the SourceCCSID parameter on the MQXCNVC call. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the specified character set, conversion must be carried out by the application.

2116 (X'0844')MQRC_TARGET_INTEGER_ENC_ERROR
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the MsgDesc parameter specifies an integer encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message being retrieved, and the call completes with MQCC_WARNING.

This reason code can also occur on the MQXCNVC call, when the Options parameter contains an unsupported MQDCC_TARGET_* value, or when MQDCC_TARGET_ENC_UNDEFINED is specified for a UCS-2 code page.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Check the integer encoding that was specified. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required integer encoding, conversion must be carried out by the application.

2117 (X'0845')MQRC_TARGET_DECIMAL_ENC_ERROR
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the MsgDesc parameter specifies a decimal encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

Completion Code:
MQCC_WARNING

Programmer Response:
Check the decimal encoding that was specified. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required decimal encoding, conversion must be carried out by the application.

2118 (X'0846')MQRC_TARGET_FLOAT_ENC_ERROR
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the MsgDesc parameter specifies a floating-point encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

Completion Code:
MQCC_WARNING

Programmer Response:
Check the floating-point encoding that was specified. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required floating-point encoding, conversion must be carried out by the application.

2119 (X'0847')MQRC_NOT_CONVERTED
Explanation:
An MQGET call was issued with the MQGMO_CONVERT option specified in the GetMsgOpts parameter, but an error occurred during conversion of the data in the message. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

This error may also indicate that a parameter to the data-conversion service is not supported.

Completion Code:
MQCC_WARNING

Programmer Response:
Check that the message data is correctly described by the Format, CodedCharSetId and Encoding parameters that were specified when the message was put. Also check that these values, and the CodedCharSetId and Encoding specified in the MsgDesc parameter on the MQGET call, are supported for queue-manager conversion. If the required conversion is not supported, conversion must be carried out by the application.

2120 (X'0848')MQRC_CONVERTED_MSG_TOO_BIG
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the message data expanded during data conversion and exceeded the size of the buffer provided by the application. However, the message had already been removed from the queue because prior to conversion the message data could be accommodated in the application buffer without truncation.

The message is returned unconverted, with the CompCode parameter of the MQGET call set to MQCC_WARNING. If the message consists of several parts, each of which is described by its own character-set and encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various character-set and encoding fields always correctly describe the relevant message data.

This reason can also occur on the MQXCNVC call, when the TargetBuffer parameter is too small too accommodate the converted string, and the string has been truncated to fit in the buffer. The length of valid data returned is given by the DataLength parameter; in the case of a DBCS string or mixed SBCS/DBCS string, this length may be less than the length of TargetBuffer.

Completion Code:
MQCC_WARNING

Programmer Response:
For the MQGET call, check that the exit is converting the message data correctly and setting the output length DataLength to the appropriate value. If it is, the application issuing the MQGET call must provide a larger buffer for the Buffer parameter.

For the MQXCNVC call, if the string must be converted without truncation, provide a larger output buffer.

2121 (X'0849')MQRC_NO_EXTERNAL_PARTICIPANTS
Explanation:
An MQBEGIN call was issued to start a unit of work coordinated by the queue manager, but no participating resource managers have been registered with the queue manager. As a result, only changes to MQ resources can be coordinated by the queue manager in the unit of work.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows.

Completion Code:
MQCC_WARNING

Programmer Response:
If the application does not require non-MQ resources to participate in the unit of work, this reason code can be ignored or the MQBEGIN call removed. Otherwise consult your system programmer to determine why the required resource managers have not been registered with the queue manager; the queue manager's configuration file may be in error.

2122 (X'084A')MQRC_PARTICIPANT_NOT_AVAILABLE
Explanation:
An MQBEGIN call was issued to start a unit of work coordinated by the queue manager, but one or more of the participating resource managers that had been registered with the queue manager is not available. As a result, changes to those resources cannot be coordinated by the queue manager in the unit of work.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows.

Completion Code:
MQCC_WARNING

Programmer Response:
If the application does not require non-MQ resources to participate in the unit of work, this reason code can be ignored. Otherwise consult your system programmer to determine why the required resource managers are not available. The resource manager may have been halted temporarily, or there may be an error in the queue manager's configuration file.

2123 (X'084B')MQRC_OUTCOME_MIXED
Explanation:
The queue manager is acting as the unit-of-work coordinator for a unit of work that involves other resource managers, but one of the following occurred: 

An MQCMIT or MQDISC call was issued to commit the unit of work, but one or more of the participating resource managers backed-out the unit of work instead of committing it. As a result, the outcome of the unit of work is mixed. 
An MQBACK call was issued to back out a unit of work, but one or more of the participating resource managers had already committed the unit of work.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Examine the queue-manager error logs for messages relating to the mixed outcome; these messages identify the resource managers that are affected. Use procedures local to the affected resource managers to resynchronize the resources.

This reason code does not prevent the application initiating further units of work.

2124 (X'084C')MQRC_OUTCOME_PENDING
Explanation:
The queue manager is acting as the unit-of-work coordinator for a unit of work that involves other resource managers, and an MQCMIT or MQDISC call was issued to commit the unit of work, but one or more of the participating resource managers has not confirmed that the unit of work was committed successfully.

The completion of the commit operation will happen at some point in the future, but there remains the possibility that the outcome will be mixed.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, Solaris, Windows.

Completion Code:
MQCC_WARNING

Programmer Response:
Use the normal error-reporting mechanisms to determine whether the outcome was mixed. If it was, take appropriate action to resynchronize the resources.

This reason code does not prevent the application initiating further units of work.

2125 (X'084D')MQRC_BRIDGE_STARTED
Explanation:
The IMS bridge has been started.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2126 (X'084E')MQRC_BRIDGE_STOPPED
Explanation:
The IMS bridge has been stopped.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2127 (X'084F')MQRC_ADAPTER_STORAGE_SHORTAGE
Explanation:
On an MQCONN call, the adapter was unable to acquire storage.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Notify the system programmer. The system programmer should determine why the system is short on storage, and take appropriate action, for example, increase the region size on the step or job card.

2128 (X'0850')MQRC_UOW_IN_PROGRESS
Explanation:
An MQBEGIN call was issued to start a unit of work coordinated by the queue manager, but a unit of work is already in existence for the connection handle specified. This may be a global unit of work started by a previous MQBEGIN call, or a unit of work that is local to the queue manager or one of the cooperating resource managers. No more than one unit of work can exist concurrently for a connection handle.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Review the application logic to determine why there is a unit of work already in existence. Move the MQBEGIN call to the appropriate place in the application.

2129 (X'0851')MQRC_ADAPTER_CONN_LOAD_ERROR
Explanation:
On an MQCONN call, the connection handling module (CSQBCON for batch and CSQQCONN for IMS) could not be loaded, so the adapter could not link to it.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified in the batch application program execution JCL, and in the queue-manager startup JCL.

2130 (X'0852')MQRC_ADAPTER_SERV_LOAD_ERROR
Explanation:
On an MQI call, the batch adapter could not load the API service module CSQBSRV, and so could not link to it.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified in the batch application program execution JCL, and in the queue-manager startup JCL.

2131 (X'0853')MQRC_ADAPTER_DEFS_ERROR
Explanation:
On an MQCONN call, the subsystem definition module (CSQBDEFV for batch and CSQQDEFV for IMS) does not contain the required control block identifier.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Check your library concatenation. If this is correct, check that the CSQBDEFV or CSQQDEFV module contains the required subsystem ID.

2132 (X'0854')MQRC_ADAPTER_DEFS_LOAD_ERROR
Explanation:
On an MQCONN call, the subsystem definition module (CSQBDEFV for batch and CSQQDEFV for IMS) could not be loaded.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified in the application program execution JCL, and in the queue-manager startup JCL.

2133 (X'0855')MQRC_ADAPTER_CONV_LOAD_ERROR
Explanation:
On an MQGET call, the adapter (batch or IMS) could not load the data conversion services modules.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified in the batch application program execution JCL, and in the queue-manager startup JCL.

2134 (X'0856')MQRC_BO_ERROR
Explanation:
On an MQBEGIN call, the begin-options structure MQBO is not valid, for one of the following reasons: 

The StrucId field is not MQBO_STRUC_ID. 
The Version field is not MQBO_VERSION_1. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the pointer points to read-only storage.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQBO structure are set correctly.

2135 (X'0857')MQRC_DH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQDH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQDH_STRUC_ID. 
The Version field is not MQDH_VERSION_1. 
The StrucLength field specifies a value that is too small to include the structure plus the arrays of MQOR and MQPMR records. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2136 (X'0858')MQRC_MULTIPLE_REASONS
Explanation:
An MQOPEN, MQPUT or MQPUT1 call was issued to open a distribution list or put a message to a distribution list, but the result of the call was not the same for all of the destinations in the list. One of the following applies: 

The call succeeded for some of the destinations but not others. The completion code is MQCC_WARNING in this case. 
The call failed for all of the destinations, but for differing reasons. The completion code is MQCC_FAILED in this case.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Examine the MQRR response records to identify the destinations for which the call failed, and the reason for the failure. Ensure that sufficient response records are provided by the application on the call to enable the error(s) to be determined. For the MQPUT1 call, the response records must be specified using the MQOD structure, and not the MQPMO structure.

2137 (X'0859')MQRC_OPEN_FAILED
Explanation:
A queue or other MQ object could not be opened successfully, for one of the following reasons: 

An MQCONN or MQCONNX call was issued, but the queue manager was unable to open an object that is used internally by the queue manager. As a result, processing cannot continue. The error log will contain the name of the object that could not be opened. 
An MQPUT call was issued to put a message to a distribution list, but the message could not be sent to the destination to which this reason code applies because that destination was not opened successfully by the MQOPEN call. This reason occurs only in the Reason field of the MQRR response record.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following: 

If the error occurred on the MQCONN or MQCONNX call, ensure that the required objects exist by running the following command and then retrying the application: 
STRMQM -c qmgrwhere qmgr should be replaced by the name of the queue manager. 
If the error occurred on the MQPUT call, examine the MQRR response records specified on the MQOPEN call to determine the reason that the queue failed to open. Ensure that sufficient response records are provided by the application on the call to enable the error(s) to be determined.
2138 (X'085A')MQRC_ADAPTER_DISC_LOAD_ERROR
Explanation:
On an MQDISC call, the disconnect handling module (CSQBDSC for batch and CSQQDISC for IMS) could not be loaded, so the adapter could not link to it.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified in the application program execution JCL, and in the queue-manager startup JCL. Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2139 (X'085B')MQRC_CNO_ERROR
Explanation:
On an MQCONNX call, the connect-options structure MQCNO is not valid, for one of the following reasons: 

The StrucId field is not MQCNO_STRUC_ID. 
The Version field specifies a value that is not valid or not supported. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the parameter pointer points to read-only storage.
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQCNO structure are set correctly.

2140 (X'085C')MQRC_CICS_WAIT_FAILED
Explanation:
On any MQI call, the CICS adapter issued an EXEC CICS WAIT request, but the request was rejected by CICS.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Examine the CICS trace data for actual response codes. The most likely cause is that the task has been canceled by the operator or by the system.

2141 (X'085D')MQRC_DLH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQDLH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQDLH_STRUC_ID. 
The Version field is not MQDLH_VERSION_1. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2142 (X'085E')MQRC_HEADER_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQ header structure that is not valid. Possible errors include the following: 

The StrucId field is not valid. 
The Version field is not valid. 
The StrucLength field specifies a value that is too small. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2143 (X'085F')MQRC_SOURCE_LENGTH_ERROR
Explanation:
On the MQXCNVC call, the SourceLength parameter specifies a length that is less than zero or not consistent with the string's character set or content (for example, the character set is a double-byte character set, but the length is not a multiple of two). This reason also occurs if the SourceLength parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

This reason code can also occur on the MQGET call when the MQGMO_CONVERT option is specified. In this case it indicates that the MQRC_SOURCE_LENGTH_ERROR reason was returned by an MQXCNVC call issued by the data conversion exit.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Specify a length that is zero or greater. If the reason code occurs on the MQGET call, check that the logic in the data-conversion exit is correct.

2144 (X'0860')MQRC_TARGET_LENGTH_ERROR
Explanation:
On the MQXCNVC call, the TargetLength parameter is not valid for one of the following reasons: 

TargetLength is less than zero. 
The TargetLength parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The MQDCC_FILL_TARGET_BUFFER option is specified, but the value of TargetLength is such that the target buffer cannot be filled completely with valid characters. This can occur when TargetCCSID is a pure DBCS character set (such as UCS-2), but TargetLength specifies a length that is an odd number of bytes.
This reason code can also occur on the MQGET call when the MQGMO_CONVERT option is specified. In this case it indicates that the MQRC_TARGET_LENGTH_ERROR reason was returned by an MQXCNVC call issued by the data conversion exit.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Specify a length that is zero or greater. If the MQDCC_FILL_TARGET_BUFFER option is specified, and TargetCCSID is a pure DBCS character set, ensure that TargetLength specifies a length that is a multiple of two.

If the reason code occurs on the MQGET call, check that the logic in the data-conversion exit is correct.

2145 (X'0861')MQRC_SOURCE_BUFFER_ERROR
Explanation:
On the MQXCNVC call, the SourceBuffer parameter pointer is not valid, or points to storage that cannot be accessed for the entire length specified by SourceLength. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

This reason code can also occur on the MQGET call when the MQGMO_CONVERT option is specified. In this case it indicates that the MQRC_SOURCE_BUFFER_ERROR reason was returned by an MQXCNVC call issued by the data conversion exit.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Specify a valid buffer. If the reason code occurs on the MQGET call, check that the logic in the data-conversion exit is correct.

2146 (X'0862')MQRC_TARGET_BUFFER_ERROR
Explanation:
On the MQXCNVC call, the TargetBuffer parameter pointer is not valid, or points to read-only storage, or to storage that cannot be accessed for the entire length specified by TargetLength. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

This reason code can also occur on the MQGET call when the MQGMO_CONVERT option is specified. In this case it indicates that the MQRC_TARGET_BUFFER_ERROR reason was returned by an MQXCNVC call issued by the data conversion exit.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Specify a valid buffer. If the reason code occurs on the MQGET call, check that the logic in the data-conversion exit is correct.

2148 (X'0864')MQRC_IIH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQIIH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQIIH_STRUC_ID. 
The Version field is not MQIIH_VERSION_1. 
The StrucLength field is not MQIIH_LENGTH_1. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2149 (X'0865')MQRC_PCF_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued to put a message containing PCF data, but the length of the message does not equal the sum of the lengths of the PCF structures present in the message. This can occur for messages with the following format names: 

MQFMT_ADMIN 
MQFMT_EVENT 
MQFMT_PCF
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the length of the message specified on the MQPUT or MQPUT1 call equals the sum of the lengths of the PCF structures contained within the message data.

2150 (X'0866')MQRC_DBCS_ERROR
Explanation:
An error was encountered attempting to convert a double-byte character set (DBCS) string. This can occur in the following cases: 

On the MQXCNVC call, when the SourceCCSID parameter specifies the coded character-set identifier of a double-byte character set, but the SourceBuffer parameter does not contain a valid DBCS string. This may be because the string contains characters that are not valid DBCS characters, or because the string is a mixed SBCS/DBCS string and the shift-out/shift-in characters are not correctly paired. The completion code is MQCC_FAILED in this case. 
On the MQGET call, when the MQGMO_CONVERT option is specified. In this case it indicates that the MQRC_DBCS_ERROR reason code was returned by an MQXCNVC call issued by the data conversion exit. The completion code is MQCC_WARNING in this case.
Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Specify a valid string.

If the reason code occurs on the MQGET call, check that the data in the message is valid, and that the logic in the data-conversion exit is correct.

2152 (X'0868')MQRC_OBJECT_NAME_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued to open a distribution list (that is, the RecsPresent field in MQOD is greater than zero), but the ObjectName field is neither blank nor the null string.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
If it is intended to open a distribution list, set the ObjectName field to blanks or the null string. If it is not intended to open a distribution list, set the RecsPresent field to zero.

2153 (X'0869')MQRC_OBJECT_Q_MGR_NAME_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued to open a distribution list (that is, the RecsPresent field in MQOD is greater than zero), but the ObjectQMgrName field is neither blank nor the null string.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
If it is intended to open a distribution list, set the ObjectQMgrName field to blanks or the null string. If it is not intended to open a distribution list, set the RecsPresent field to zero.

2154 (X'086A')MQRC_RECS_PRESENT_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued, but the call failed for one of the following reasons: 

RecsPresent in MQOD is less than zero. 
ObjectType in MQOD is not MQOT_Q, and RecsPresent is not zero. RecsPresent must be zero if the object being opened is not a queue.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
If it is intended to open a distribution list, set the ObjectType field to MQOT_Q and RecsPresent to the number of destinations in the list. If it is not intended to open a distribution list, set the RecsPresent field to zero.

2155 (X'086B')MQRC_OBJECT_RECORDS_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued to open a distribution list (that is, the RecsPresent field in MQOD is greater than zero), but the MQOR object records are not specified correctly. One of the following applies: 

ObjectRecOffset is zero and ObjectRecPtr is zero or the null pointer. 
ObjectRecOffset is not zero and ObjectRecPtr is not zero and not the null pointer. 
ObjectRecPtr is not a valid pointer. 
ObjectRecPtr or ObjectRecOffset points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that one of ObjectRecOffset and ObjectRecPtr is zero and the other nonzero. Ensure that the field used points to accessible storage.

2156 (X'086C')MQRC_RESPONSE_RECORDS_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued to open a distribution list (that is, the RecsPresent field in MQOD is greater than zero), but the MQRR response records are not specified correctly. One of the following applies: 

ResponseRecOffset is not zero and ResponseRecPtr is not zero and not the null pointer. 
ResponseRecPtr is not a valid pointer. 
ResponseRecPtr or ResponseRecOffset points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that at least one of ResponseRecOffset and ResponseRecPtr is zero. Ensure that the field used points to accessible storage.

2157 (X'086D')MQRC_ASID_MISMATCH
Explanation:
On any MQI call, the caller's primary ASID was found to be different from the home ASID.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the application (MQI calls cannot be issued in cross-memory mode). Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2158 (X'086E')MQRC_PMO_RECORD_FLAGS_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued to put a message, but the PutMsgRecFields field in the MQPMO structure is not valid, for one of the following reasons: 

The field contains flags that are not valid. 
The message is being put to a distribution list, and put message records have been provided (that is, RecsPresent is greater than zero, and one of PutMsgRecOffset or PutMsgRecPtr is nonzero), but PutMsgRecFields has the value MQPMRF_NONE. 
MQPMRF_ACCOUNTING_TOKEN is specified without either MQPMO_SET_IDENTITY_CONTEXT or MQPMO_SET_ALL_CONTEXT.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that PutMsgRecFields is set with the appropriate MQPMRF_* flags to indicate which fields are present in the put message records. If MQPMRF_ACCOUNTING_TOKEN is specified, ensure that either MQPMO_SET_IDENTITY_CONTEXT or MQPMO_SET_ALL_CONTEXT is also specified. Alternatively, set both PutMsgRecOffset and PutMsgRecPtr to zero.

2159 (X'086F')MQRC_PUT_MSG_RECORDS_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued to put a message to a distribution list, but the MQPMR put message records are not specified correctly. One of the following applies: 

PutMsgRecOffset is not zero and PutMsgRecPtr is not zero and not the null pointer. 
PutMsgRecPtr is not a valid pointer. 
PutMsgRecPtr or PutMsgRecOffset points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that at least one of PutMsgRecOffset and PutMsgRecPtr is zero. Ensure that the field used points to accessible storage.

2160 (X'0870')MQRC_CONN_ID_IN_USE
Explanation:
On an MQCONN call, the connection identifier assigned by the queue manager to the connection between a CICS or IMS allied address space and the queue manager conflicts with the connection identifier of another connected CICS or IMS system. The connection identifier assigned is as follows: 

For CICS, the applid 
For IMS, the IMSID parameter on the IMSCTRL (sysgen) macro, or the IMSID parameter on the execution parameter (EXEC card in IMS control region JCL) 
For batch, the job name 
For TSO, the user ID
A conflict arises only if there are two CICS systems, two IMS systems, or one each of CICS and IMS, having the same connection identifiers. Batch and TSO connections need not have unique identifiers.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the naming conventions used in different systems that might connect to the queue manager do not conflict.

2161 (X'0871')MQRC_Q_MGR_QUIESCING
Explanation:
An MQI call was issued, but the call failed because the queue manager is quiescing (preparing to shut down).

When the queue manager is quiescing, the MQOPEN, MQPUT, MQPUT1, and MQGET calls can still complete successfully, but the application can request that they fail by specifying the appropriate option on the call: 

MQOO_FAIL_IF_QUIESCING on MQOPEN 
MQPMO_FAIL_IF_QUIESCING on MQPUT or MQPUT1 
MQGMO_FAIL_IF_QUIESCING on MQGET
Specifying these options enables the application to become aware that the queue manager is preparing to shut down. 

On z/OS: 
For batch applications, this reason can be returned to applications running in LPARs that do not have a queue manager installed. 
For CICS applications, this reason can be returned when no connection was established.
On i5/OS for applications running in compatibility mode, this reason can be returned when no connection was established.
Completion Code:
MQCC_FAILED

Programmer Response:
The application should tidy up and end. If the application specified the MQOO_FAIL_IF_QUIESCING, MQPMO_FAIL_IF_QUIESCING, or MQGMO_FAIL_IF_QUIESCING option on the failing call, the relevant option can be removed and the call reissued. By omitting these options, the application can continue working in order to complete and commit the current unit of work, but the application should not start a new unit of work.

2162 (X'0872')MQRC_Q_MGR_STOPPING
Explanation:
An MQI call was issued, but the call failed because the queue manager is shutting down. If the call was an MQGET call with the MQGMO_WAIT option, the wait has been canceled. No more MQI calls can be issued.

For MQ client applications, it is possible that the call did complete successfully, even though this reason code is returned with a CompCode of MQCC_FAILED. 

On z/OS, the MQRC_CONNECTION_BROKEN reason may be returned instead if, as a result of system scheduling factors, the queue manager shuts down before the call completes.
Completion Code:
MQCC_FAILED

Programmer Response:
The application should tidy up and end. If the application is in the middle of a unit of work coordinated by an external unit-of-work coordinator, the application should issue the appropriate call to back out the unit of work. Any unit of work that is coordinated by the queue manager is backed out automatically.

2163 (X'0873')MQRC_DUPLICATE_RECOV_COORD
Explanation:
On an MQCONN or MQCONNX call, a recovery coordinator already exists for the connection name specified on the connection call issued by the adapter.

A conflict arises only if there are two CICS systems, two IMS systems, or one each of CICS and IMS, having the same connection identifiers. Batch and TSO connections need not have unique identifiers.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the naming conventions used in different systems that might connect to the queue manager do not conflict.

2173 (X'087D')MQRC_PMO_ERROR
Explanation:
On an MQPUT or MQPUT1 call, the MQPMO structure is not valid, for one of the following reasons: 

The StrucId field is not MQPMO_STRUC_ID. 
The Version field specifies a value that is not valid or not supported. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the pointer points to read-only storage.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQPMO structure are set correctly.

2183 (X'0887')MQRC_API_EXIT_LOAD_ERROR
Explanation:
The API crossing exit module could not be linked. If this reason is returned when the API crossing exit is invoked after the call has been executed, the call itself may have executed correctly.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified, and that the API crossing exit module is executable and correctly named. Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2184 (X'0888')MQRC_REMOTE_Q_NAME_ERROR
Explanation:
On an MQOPEN or MQPUT1 call, one of the following occurred: 

A local definition of a remote queue (or an alias to one) was specified, but the RemoteQName attribute in the remote queue definition is entirely blank. Note that this error occurs even if the XmitQName in the definition is not blank. 
The ObjectQMgrName field in the object descriptor is not blank and not the name of the local queue manager, but the ObjectName field is blank.
Completion Code:
MQCC_FAILED

Programmer Response:
Alter the local definition of the remote queue and supply a valid remote queue name, or supply a nonblank ObjectName in the object descriptor, as appropriate.

2185 (X'0889')MQRC_INCONSISTENT_PERSISTENCE
Explanation:
An MQPUT call was issued to put a message in a group or a segment of a logical message, but the value specified or defaulted for the Persistence field in MQMD is not consistent with the current group and segment information retained by the queue manager for the queue handle. All messages in a group and all segments in a logical message must have the same value for persistence, that is, all must be persistent, or all must be nonpersistent.

If the current call specifies MQPMO_LOGICAL_ORDER, the call fails. If the current call does not specify MQPMO_LOGICAL_ORDER, but the previous MQPUT call for the queue handle did, the call succeeds with completion code MQCC_WARNING.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Modify the application to ensure that the same value of persistence is used for all messages in the group, or all segments of the logical message.

2186 (X'088A')MQRC_GMO_ERROR
Explanation:
On an MQGET call, the MQGMO structure is not valid, for one of the following reasons: 

The StrucId field is not MQGMO_STRUC_ID. 
The Version field specifies a value that is not valid or not supported. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the pointer points to read-only storage.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQGMO structure are set correctly.

2187 (X'088B')MQRC_CICS_BRIDGE_RESTRICTION
Explanation:
It is not permitted to issue MQI calls from user transactions that are run in an MQ/CICS-bridge environment where the bridge exit also issues MQI calls. The MQI call fails. If this occurs in the bridge exit, it will result in a transaction abend. If it occurs in the user transaction, this may result in a transaction abend.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The transaction cannot be run using the MQ/CICS bridge. Refer to the appropriate CICS manual for information about restrictions in the MQ/CICS bridge environment.

2188 (X'088C')MQRC_STOPPED_BY_CLUSTER_EXIT
Explanation:
An MQOPEN, MQPUT, or MQPUT1 call was issued to open or put a message on a cluster queue, but the cluster workload exit rejected the call.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the cluster workload exit to ensure that it has been written correctly. Determine why it rejected the call and correct the problem.

2189 (X'088D')MQRC_CLUSTER_RESOLUTION_ERROR
Explanation:
An MQOPEN, MQPUT, or MQPUT1 call was issued to open or put a message on a cluster queue, but the queue definition could not be resolved correctly because a response was required from the repository manager but none was available.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the repository manager is operating and that the queue and channel definitions are correct.

2190 (X'088E')MQRC_CONVERTED_STRING_TOO_BIG
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, a string in a fixed-length field in the message expanded during data conversion and exceeded the size of the field. When this happens, the queue manager tries discarding trailing blank characters and characters following the first null character in order to make the string fit, but in this case there were insufficient characters that could be discarded.

This reason code can also occur for messages with a format name of MQFMT_IMS_VAR_STRING. When this happens, it indicates that the IMS variable string expanded such that its length exceeded the capacity of the 2-byte binary length field contained within the structure of the IMS variable string. (The queue manager never discards trailing blanks in an IMS variable string.)

The message is returned unconverted, with the CompCode parameter of the MQGET call set to MQCC_WARNING. If the message consists of several parts, each of which is described by its own character-set and encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various character-set and encoding fields always correctly describe the relevant message data.

This reason code does not occur if the string could be made to fit by discarding trailing blank characters.

Completion Code:
MQCC_WARNING

Programmer Response:
Check that the fields in the message contain the correct values, and that the character-set identifiers specified by the sender and receiver of the message are correct. If they are, the layout of the data in the message must be modified to increase the lengths of the field(s) so that there is sufficient space to allow the string(s) to expand when converted.

2191 (X'088F')MQRC_TMC_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQTMC2 structure that is not valid. Possible errors include the following: 

The StrucId field is not MQTMC_STRUC_ID. 
The Version field is not MQTMC_VERSION_2. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2192 (X'0890')MQRC_PAGESET_FULL
Explanation:
Former name for MQRC_STORAGE_MEDIUM_FULL.

2192 (X'0890')MQRC_STORAGE_MEDIUM_FULL
Explanation:
An MQI call or command was issued to operate on an object, but the call failed because the external storage medium is full. One of the following applies: 

A page-set data set is full (nonshared queues only). 
A coupling-facility structure is full (shared queues only).
This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Check which queues contain messages and look for applications that might be filling the queues unintentionally. Be aware that the queue that has caused the page set or coupling-facility structure to become full is not necessarily the queue referenced by the MQI call that returned MQRC_STORAGE_MEDIUM_FULL.

Check that all of the usual server applications are operating correctly and processing the messages on the queues.

If the applications and servers are operating correctly, increase the number of server applications to cope with the message load, or request the system programmer to increase the size of the page-set data sets.

2193 (X'0891')MQRC_PAGESET_ERROR
Explanation:
An error was encountered with the page set while attempting to access it for a locally defined queue. This could be because the queue is on a page set that does not exist. A console message is issued that tells you the number of the page set in error. For example if the error occurred in the TEST job, and your user identifier is ABCDEFG, the message is: 

CSQI041I CSQIALLC JOB TEST USER ABCDEFG HAD ERROR ACCESSING PAGE SET 27If this reason code occurs while attempting to delete a dynamic queue with MQCLOSE, the dynamic queue has not been deleted.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the storage class for the queue maps to a valid page set using the DISPLAY Q(xx) STGCLASS, DISPLAY STGCLASS(xx), and DISPLAY USAGE PSID commands. If you are unable to resolve the problem, notify the system programmer who should: 

Collect the following diagnostic information: 
A description of the actions that led to the error 
A listing of the application program being run at the time of the error 
Details of the page sets defined for use by the queue manager
Attempt to re-create the problem, and take a system dump immediately after the error occurs 
Contact your IBM Support Center
2194 (X'0892')MQRC_NAME_NOT_VALID_FOR_TYPE
Explanation:
An MQOPEN call was issued to open the queue manager definition, but the ObjectName field in the ObjDesc parameter is not blank.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the ObjectName field is set to blanks.

2195 (X'0893')MQRC_UNEXPECTED_ERROR
Explanation:
The call was rejected because an unexpected error occurred.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the application's parameter list to ensure, for example, that the correct number of parameters was passed, and that data pointers and storage keys are valid. If the problem cannot be resolved, contact your system programmer. 

On z/OS, check whether any information has been displayed on the console. If this error occurs on an MQCONN or MQCONNX call, check that the subsystem named is an active MQ subsystem. In particular, check that it is not a DB2(TM) subsystem. If the problem cannot be resolved, rerun the application with a CSQSNAP DD card (if you have not already got a dump) and send the resulting dump to IBM. 
On OS/2 and i5/OS, consult the FFST record to obtain more detail about the problem. 
On HP OpenVMS, Compaq NonStop Kernel, and UNIX systems, consult the FDC file to obtain more detail about the problem.
2196 (X'0894')MQRC_UNKNOWN_XMIT_Q
Explanation:
On an MQOPEN or MQPUT1 call, a message is to be sent to a remote queue manager. The ObjectName or the ObjectQMgrName in the object descriptor specifies the name of a local definition of a remote queue (in the latter case queue-manager aliasing is being used), but the XmitQName attribute of the definition is not blank and not the name of a locally-defined queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the values specified for ObjectName and ObjectQMgrName. If these are correct, check the queue definitions. For more information on transmission queues, see the WebSphere MQ Application Programming Guide.

2197 (X'0895')MQRC_UNKNOWN_DEF_XMIT_Q
Explanation:
An MQOPEN or MQPUT1 call was issued specifying a remote queue as the destination. If a local definition of the remote queue was specified, or if a queue-manager alias is being resolved, the XmitQName attribute in the local definition is blank.

Because there is no queue defined with the same name as the destination queue manager, the queue manager has attempted to use the default transmission queue. However, the name defined by the DefXmitQName queue-manager attribute is not the name of a locally-defined queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the queue definitions, or the queue-manager attribute. See the WebSphere MQ Application Programming Guide for more information.

2198 (X'0896')MQRC_DEF_XMIT_Q_TYPE_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued specifying a remote queue as the destination. Either a local definition of the remote queue was specified, or a queue-manager alias was being resolved, but in either case the XmitQName attribute in the local definition is blank.

Because there is no transmission queue defined with the same name as the destination queue manager, the local queue manager has attempted to use the default transmission queue. However, although there is a queue defined by the DefXmitQName queue-manager attribute, it is not a local queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following: 

Specify a local transmission queue as the value of the XmitQName attribute in the local definition of the remote queue. 
Define a local transmission queue with a name that is the same as that of the remote queue manager. 
Specify a local transmission queue as the value of the DefXmitQName queue-manager attribute.
See the WebSphere MQ Application Programming Guide for more information.

2199 (X'0897')MQRC_DEF_XMIT_Q_USAGE_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued specifying a remote queue as the destination. Either a local definition of the remote queue was specified, or a queue-manager alias was being resolved, but in either case the XmitQName attribute in the local definition is blank.

Because there is no transmission queue defined with the same name as the destination queue manager, the local queue manager has attempted to use the default transmission queue. However, the queue defined by the DefXmitQName queue-manager attribute does not have a Usage attribute of MQUS_TRANSMISSION.

Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following: 

Specify a local transmission queue as the value of the XmitQName attribute in the local definition of the remote queue. 
Define a local transmission queue with a name that is the same as that of the remote queue manager. 
Specify a different local transmission queue as the value of the DefXmitQName queue-manager attribute. 
Change the Usage attribute of the DefXmitQName queue to MQUS_TRANSMISSION.
See the WebSphere MQ Application Programming Guide for more information.

2201 (X'0899')MQRC_NAME_IN_USE
Explanation:
An MQOPEN call was issued to create a dynamic queue, but a queue with the same name as the dynamic queue already exists. The existing queue is one that is logically deleted, but for which there are still one or more open handles. For more information, see the description of MQCLOSE in the WebSphere MQ Application Programming Guide.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Either ensure that all handles for the previous dynamic queue are closed, or ensure that the name of the new queue is unique; see the description for reason code MQRC_OBJECT_ALREADY_EXISTS.

2202 (X'089A')MQRC_CONNECTION_QUIESCING
Explanation:
This reason code is issued when the connection to the queue manager is in quiescing state, and an application issues one of the following calls: 

MQCONN or MQCONNX 
MQOPEN, with no connection established, or with MQOO_FAIL_IF_QUIESCING included in the Options parameter 
MQGET, with MQGMO_FAIL_IF_QUIESCING included in the Options field of the GetMsgOpts parameter 
MQPUT or MQPUT1, with MQPMO_FAIL_IF_QUIESCING included in the Options field of the PutMsgOpts parameter
MQRC_CONNECTION_QUIESCING is also issued by the message channel agent (MCA) when the queue manager is in quiescing state.

Completion Code:
MQCC_FAILED

Programmer Response:
The application should tidy up and terminate. Any uncommitted changes in a unit of work should be backed out.

2203 (X'089B')MQRC_CONNECTION_STOPPING
Explanation:
This reason code is issued when the connection to the queue manager is shutting down, and the application issues an MQI call. No more message-queuing calls can be issued. For the MQGET call, if the MQGMO_WAIT option was specified, the wait is canceled.

Note that the MQRC_CONNECTION_BROKEN reason may be returned instead if, as a result of system scheduling factors, the queue manager shuts down before the call completes.

MQRC_CONNECTION_STOPPING is also issued by the message channel agent (MCA) when the queue manager is shutting down.

For MQ client applications, it is possible that the call did complete successfully, even though this reason code is returned with a CompCode of MQCC_FAILED.

Completion Code:
MQCC_FAILED

Programmer Response:
The application should tidy up and terminate. Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2204 (X'089C')MQRC_ADAPTER_NOT_AVAILABLE
Explanation:
This is issued only for CICS applications, if any call is issued and the CICS adapter (a Task Related User Exit) has been disabled, or has not been enabled.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The application should tidy up and terminate. Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2206 (X'089E')MQRC_MSG_ID_ERROR
Explanation:
An MQGET call was issued to retrieve a message using the message identifier as a selection criterion, but the call failed because selection by message identifier is not supported on this queue. 

On z/OS, the queue is a shared queue, but the IndexType queue attribute does not have an appropriate value: 
If selection is by message identifier alone, IndexType must have the value MQIT_MSG_ID. 
If selection is by message identifier and correlation identifier combined, IndexType must have the value MQIT_MSG_ID or MQIT_CORREL_ID.
On Compaq NonStop Kernel, a key file is required but has not been defined.
Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following: 

Modify the application so that it does not use selection by message identifier: set the MsgId field to MQMI_NONE and do not specify MQMO_MATCH_MSG_ID in MQGMO. 
On z/OS, change the IndexType queue attribute to MQIT_MSG_ID. 
On Compaq NonStop Kernel, define a key file.
2207 (X'089F')MQRC_CORREL_ID_ERROR
Explanation:
An MQGET call was issued to retrieve a message using the correlation identifier as a selection criterion, but the call failed because selection by correlation identifier is not supported on this queue. 

On z/OS, the queue is a shared queue, but the IndexType queue attribute does not have an appropriate value: 
If selection is by correlation identifier alone, IndexType must have the value MQIT_CORREL_ID. 
If selection is by correlation identifier and message identifier combined, IndexType must have the value MQIT_CORREL_ID or MQIT_MSG_ID.
On Compaq NonStop Kernel, a key file is required but has not been defined.
Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following: 

On z/OS, change the IndexType queue attribute to MQIT_CORREL_ID. 
On Compaq NonStop Kernel, define a key file. 
Modify the application so that it does not use selection by correlation identifier: set the CorrelId field to MQCI_NONE and do not specify MQMO_MATCH_CORREL_ID in MQGMO.
2208 (X'08A0')MQRC_FILE_SYSTEM_ERROR
Explanation:
An unexpected return code was received from the file system, in attempting to perform an operation on a queue.

This reason code occurs only on VSE/ESA.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the file system definition for the queue that was being accessed. For a VSAM file, check that the control interval is large enough for the maximum message length allowed for the queue.

2209 (X'08A1')MQRC_NO_MSG_LOCKED
Explanation:
An MQGET call was issued with the MQGMO_UNLOCK option, but no message was currently locked.

Completion Code:
MQCC_WARNING

Programmer Response:
Check that a message was locked by an earlier MQGET call with the MQGMO_LOCK option for the same handle, and that no intervening call has caused the message to become unlocked.

2210 (X'08A2')MQRC_SOAP_DOTNET_ERROR
Explanation:
An exception from the .NET environment (as opposed to WebSphere MQ .NET) has been received and is included as an inner exception.

Completion Code:
MQCC_FAILED

Programmer Response:
Refer to the .NET documentation for details about the inner exception. Follow the corrective action recommended there.

2211 (X'08A3')MQRC_SOAP_AXIS_ERROR
Explanation:
An exception from the Axis environment has been received and is included as a chained exception.

Completion Code:
MQCC_FAILED

Programmer Response:
Refer to the Axis documentation for details about the chained exception. Follow the corrective action recommended there.

2212 (X'08A4')MQRC_SOAP_URL_ERROR
Explanation:
The SOAP URL has been specified incorrectly.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the SOAP URL and rerun.

2217 (X'08A9')MQRC_CONNECTION_NOT_AUTHORIZED
Explanation:
This reason code arises only for CICS applications. For these, connection to the queue manager is done by the adapter. If that connection fails because the CICS subsystem is not authorized to connect to the queue manager, this reason code is issued whenever an application running under that subsystem subsequently issues an MQI call.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the subsystem is authorized to connect to the queue manager.

2218 (X'08AA')MQRC_MSG_TOO_BIG_FOR_CHANNEL
Explanation:
A message was put to a remote queue, but the message is larger than the maximum message length allowed by the channel. This reason code is returned in the Feedback field in the message descriptor of a report message. 

On z/OS, this return code is issued only if you are not using CICS for distributed queuing. Otherwise, MQRC_MSG_TOO_BIG_FOR_Q_MGR is issued.
Completion Code:
MQCC_FAILED

Programmer Response:
Check the channel definitions. Increase the maximum message length that the channel can accept, or break the message into several smaller messages.

2219 (X'08AB')MQRC_CALL_IN_PROGRESS
Explanation:
The application issued an MQI call whilst another MQI call was already being processed for that connection. Only one call per application connection can be processed at a time.

Concurrent calls can arise when an application uses multiple threads, or when an exit is invoked as part of the processing of an MQI call. For example, a data-conversion exit invoked as part of the processing of the MQGET call may try to issue an MQI call. 

On z/OS, concurrent calls can arise only with batch or IMS applications; an example is when a subtask ends while an MQI call is in progress (for example, an MQGET that is waiting), and there is an end-of-task exit routine that issues another MQI call. 
On OS/2 and Windows, concurrent calls can also arise if an MQI call is issued in response to a user message while another MQI call is in progress. 
If the application is using multiple threads with shared handles, MQRC_CALL_IN_PROGRESS occurs when the handle specified on the call is already in use by another thread and MQCNO_HANDLE_SHARE_NO_BLOCK was specified on the MQCONNX call.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that an MQI call cannot be issued while another one is active. Do not issue MQI calls from within a data-conversion exit. 

On z/OS, if you want to provide a subtask to allow an application that is waiting for a message to arrive to be canceled, wait for the message by using MQGET with MQGMO_SET_SIGNAL, rather than MQGMO_WAIT.
2220 (X'08AC')MQRC_RMH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQRMH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQRMH_STRUC_ID. 
The Version field is not MQRMH_VERSION_1. 
The StrucLength field specifies a value that is too small to include the structure plus the variable-length data at the end of the structure. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2222 (X'08AE')MQRC_Q_MGR_ACTIVE
Explanation:
This condition is detected when a queue manager becomes active. 

On z/OS, this event is not generated for the first start of a queue manager, only on subsequent restarts.
Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2223 (X'08AF')MQRC_Q_MGR_NOT_ACTIVE
Explanation:
This condition is detected when a queue manager is requested to stop or quiesce.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2224 (X'08B0')MQRC_Q_DEPTH_HIGH
Explanation:
An MQPUT or MQPUT1 call has caused the queue depth to be incremented to or above the limit specified in the QDepthHighLimit attribute.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2225 (X'08B1')MQRC_Q_DEPTH_LOW
Explanation:
An MQGET call has caused the queue depth to be decremented to or below the limit specified in the QDepthLowLimit attribute.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2226 (X'08B2')MQRC_Q_SERVICE_INTERVAL_HIGH
Explanation:
No successful gets or puts have been detected within an interval that is greater than the limit specified in the QServiceInterval attribute.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2227 (X'08B3')MQRC_Q_SERVICE_INTERVAL_OK
Explanation:
A successful get has been detected within an interval that is less than or equal to the limit specified in the QServiceInterval attribute.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2228 (X'08B4')MQRC_RFH_HEADER_FIELD_ERROR
Explanation:
An expected RFH header field was not found or had an invalid value. If this error occurs in a WebSphere MQ SOAP listener, the missing or erroneous field is either the contentType field or the transportVersion field or both.

Completion Code:
MQCC_FAILED

Programmer Response:
If this error occurs in a WebSphere MQ SOAP listener, and you are using the IBM-supplied sender, contact your IBM Support Center. If you are using a bespoke sender, check the associated error message, and that the RFH2 section of the SOAP/MQ request message contains all the mandatory fields, and that these fields have valid values.

2229 (X'08B5')MQRC_RAS_PROPERTY_ERROR
Explanation:
There is an error related to the RAS property file. The file may be missing, it may be not accessible, or the commands in the file may be incorrect.

Completion Code:
MQCC_FAILED

Programmer Response:
Look at the associated error message, which will explain the error in detail. Correct the error and retry.

2232 (X'08B8')MQRC_UNIT_OF_WORK_NOT_STARTED
Explanation:
An MQGET, MQPUT or MQPUT1 call was issued to get or put a message within a unit of work, but no TM/MP transaction had been started. If MQGMO_NO_SYNCPOINT is not specified on MQGET, or MQPMO_NO_SYNCPOINT is not specified on MQPUT or MQPUT1 (the default), the call requires a unit of work.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure a TM/MP transaction is available, or issue the MQGET call with the MQGMO_NO_SYNCPOINT option, or the MQPUT or MQPUT1 call with the MQPMO_NO_SYNCPOINT option, which will cause a transaction to be started automatically.

2233 (X'08B9')MQRC_CHANNEL_AUTO_DEF_OK
Explanation:
This condition is detected when the automatic definition of a channel is successful. The channel is defined by the MCA.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2234 (X'08BA')MQRC_CHANNEL_AUTO_DEF_ERROR
Explanation:
This condition is detected when the automatic definition of a channel fails; this may be because an error occurred during the definition process, or because the channel automatic-definition exit inhibited the definition. Additional information is returned in the event message indicating the reason for the failure.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING

Programmer Response:
Examine the additional information returned in the event message to determine the reason for the failure.

2235 (X'08BB')MQRC_CFH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFH structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2236 (X'08BC')MQRC_CFIL_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFIL or MQRCFIL64 structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2237 (X'08BD')MQRC_CFIN_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFIN or MQCFIN64 structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2238 (X'08BE')MQRC_CFSL_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFSL structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2239 (X'08BF')MQRC_CFST_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFST structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2241 (X'08C1')MQRC_INCOMPLETE_GROUP
Explanation:
An operation was attempted on a queue using a queue handle that had an incomplete message group. This reason code can arise in the following situations: 

On the MQPUT call, when the application specifies MQPMO_LOGICAL_ORDER and attempts to put a message that is not in a group. The completion code is MQCC_FAILED in this case. 
On the MQPUT call, when the application does not specify MQPMO_LOGICAL_ORDER, but the previous MQPUT call for the queue handle did specify MQPMO_LOGICAL_ORDER. The completion code is MQCC_WARNING in this case. 
On the MQGET call, when the application does not specify MQGMO_LOGICAL_ORDER, but the previous MQGET call for the queue handle did specify MQGMO_LOGICAL_ORDER. The completion code is MQCC_WARNING in this case. 
On the MQCLOSE call, when the application attempts to close the queue that has the incomplete message group. The completion code is MQCC_WARNING in this case.
If there is an incomplete logical message as well as an incomplete message group, reason code MQRC_INCOMPLETE_MSG is returned in preference to MQRC_INCOMPLETE_GROUP.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
If this reason code is expected, no corrective action is required. Otherwise, ensure that the MQPUT call for the last message in the group specifies MQMF_LAST_MSG_IN_GROUP.

2242 (X'08C2')MQRC_INCOMPLETE_MSG
Explanation:
An operation was attempted on a queue using a queue handle that had an incomplete logical message. This reason code can arise in the following situations: 

On the MQPUT call, when the application specifies MQPMO_LOGICAL_ORDER and attempts to put a message that is not a segment, or that has a setting for the MQMF_LAST_MSG_IN_GROUP flag that is different from the previous message. The completion code is MQCC_FAILED in this case. 
On the MQPUT call, when the application does not specify MQPMO_LOGICAL_ORDER, but the previous MQPUT call for the queue handle did specify MQPMO_LOGICAL_ORDER. The completion code is MQCC_WARNING in this case. 
On the MQGET call, when the application does not specify MQGMO_LOGICAL_ORDER, but the previous MQGET call for the queue handle did specify MQGMO_LOGICAL_ORDER. The completion code is MQCC_WARNING in this case. 
On the MQCLOSE call, when the application attempts to close the queue that has the incomplete logical message. The completion code is MQCC_WARNING in this case.
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
If this reason code is expected, no corrective action is required. Otherwise, ensure that the MQPUT call for the last segment specifies MQMF_LAST_SEGMENT.

2243 (X'08C3')MQRC_INCONSISTENT_CCSIDS
Explanation:
An MQGET call was issued specifying the MQGMO_COMPLETE_MSG option, but the message to be retrieved consists of two or more segments that have differing values for the CodedCharSetId field in MQMD. This can arise when the segments take different paths through the network, and some of those paths have MCA sender conversion enabled. The call succeeds with a completion code of MQCC_WARNING, but only the first few segments that have identical character-set identifiers are returned.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING

Programmer Response:
Remove the MQGMO_COMPLETE_MSG option from the MQGET call and retrieve the remaining message segments one by one.

2244 (X'08C4')MQRC_INCONSISTENT_ENCODINGS
Explanation:
An MQGET call was issued specifying the MQGMO_COMPLETE_MSG option, but the message to be retrieved consists of two or more segments that have differing values for the Encoding field in MQMD. This can arise when the segments take different paths through the network, and some of those paths have MCA sender conversion enabled. The call succeeds with a completion code of MQCC_WARNING, but only the first few segments that have identical encodings are returned.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING

Programmer Response:
Remove the MQGMO_COMPLETE_MSG option from the MQGET call and retrieve the remaining message segments one by one.

2245 (X'08C5')MQRC_INCONSISTENT_UOW
Explanation:
One of the following applies: 

An MQPUT call was issued to put a message in a group or a segment of a logical message, but the value specified or defaulted for the MQPMO_SYNCPOINT option is not consistent with the current group and segment information retained by the queue manager for the queue handle. 
If the current call specifies MQPMO_LOGICAL_ORDER, the call fails. If the current call does not specify MQPMO_LOGICAL_ORDER, but the previous MQPUT call for the queue handle did, the call succeeds with completion code MQCC_WARNING.

An MQGET call was issued to remove from the queue a message in a group or a segment of a logical message, but the value specified or defaulted for the MQGMO_SYNCPOINT option is not consistent with the current group and segment information retained by the queue manager for the queue handle. 
If the current call specifies MQGMO_LOGICAL_ORDER, the call fails. If the current call does not specify MQGMO_LOGICAL_ORDER, but the previous MQGET call for the queue handle did, the call succeeds with completion code MQCC_WARNING.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Modify the application to ensure that the same unit-of-work specification is used for all messages in the group, or all segments of the logical message.

2246 (X'08C6')MQRC_INVALID_MSG_UNDER_CURSOR
Explanation:
An MQGET call was issued specifying the MQGMO_COMPLETE_MSG option with either MQGMO_MSG_UNDER_CURSOR or MQGMO_BROWSE_MSG_UNDER_CURSOR, but the message that is under the cursor has an MQMD with an Offset field that is greater than zero. Because MQGMO_COMPLETE_MSG was specified, the message is not valid for retrieval.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Reposition the browse cursor so that it is located on a message whose Offset field in MQMD is zero. Alternatively, remove the MQGMO_COMPLETE_MSG option.

2247 (X'08C7')MQRC_MATCH_OPTIONS_ERROR
Explanation:
An MQGET call was issued, but the value of the MatchOptions field in the GetMsgOpts parameter is not valid, for one of the following reasons: 

An undefined option is specified. 
All of the following are true: 
MQGMO_LOGICAL_ORDER is specified. 
There is a current message group or logical message for the queue handle. 
Neither MQGMO_BROWSE_MSG_UNDER_CURSOR nor MQGMO_MSG_UNDER_CURSOR is specified. 
One or more of the MQMO_* options is specified. 
The values of the fields in the MsgDesc parameter corresponding to the MQMO_* options specified, differ from the values of those fields in the MQMD for the message to be returned next.
On z/OS, one or more of the options specified is not valid for the index type of the queue.
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that only valid options are specified for the field.

2248 (X'08C8')MQRC_MDE_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQMDE structure that is not valid. Possible errors include the following: 

The StrucId field is not MQMDE_STRUC_ID. 
The Version field is not MQMDE_VERSION_2. 
The StrucLength field is not MQMDE_LENGTH_2. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2249 (X'08C9')MQRC_MSG_FLAGS_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the MsgFlags field in the message descriptor MQMD contains one or more message flags that are not recognized by the local queue manager. The message flags that cause this reason code to be returned depend on the destination of the message; see the description of REPORT in the WebSphere MQ Application Programming Guide for more details.

This reason code can also occur in the Feedback field in the MQMD of a report message, or in the Reason field in the MQDLH structure of a message on the dead-letter queue; in both cases it indicates that the destination queue manager does not support one or more of the message flags specified by the sender of the message.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Do the following: 

Ensure that the MsgFlags field in the message descriptor is initialized with a value when the message descriptor is declared, or is assigned a value prior to the MQPUT or MQPUT1 call. Specify MQMF_NONE if no message flags are needed. 
Ensure that the message flags specified are valid; see the MsgFlags field described in the description of MQMD in the WebSphere MQ Application Programming Guide for valid message flags. 
If multiple message flags are being set by adding the individual message flags together, ensure that the same message flag is not added twice. 
On z/OS, ensure that the message flags specified are valid for the index type of the queue; see the description of the MsgFlags field in MQMD for further details.
2250 (X'08CA')MQRC_MSG_SEQ_NUMBER_ERROR
Explanation:
An MQGET, MQPUT, or MQPUT1 call was issued, but the value of the MsgSeqNumber field in the MQMD or MQMDE structure is less than one or greater than 999 999 999.

This error can also occur on the MQPUT call if the MsgSeqNumber field would have become greater than 999 999 999 as a result of the call.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value in the range 1 through 999 999 999. Do not attempt to create a message group containing more than 999 999 999 messages.

2251 (X'08CB')MQRC_OFFSET_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the value of the Offset field in the MQMD or MQMDE structure is less than zero or greater than 999 999 999.

This error can also occur on the MQPUT call if the Offset field would have become greater than 999 999 999 as a result of the call.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value in the range 0 through 999 999 999. Do not attempt to create a message segment that would extend beyond an offset of 999 999 999.

2252 (X'08CC')MQRC_ORIGINAL_LENGTH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued to put a report message that is a segment, but the OriginalLength field in the MQMD or MQMDE structure is either: 

Less than the length of data in the message, or 
Less than one (for a segment that is not the last segment), or 
Less than zero (for a segment that is the last segment)
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value that is greater than zero. Zero is valid only for the last segment.

2253 (X'08CD')MQRC_SEGMENT_LENGTH_ZERO
Explanation:
An MQPUT or MQPUT1 call was issued to put the first or an intermediate segment of a logical message, but the length of the application message data in the segment (excluding any MQ headers that may be present) is zero. The length must be at least one for the first or intermediate segment.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the application logic to ensure that segments are put with a length of one or greater. Only the last segment of a logical message is permitted to have a length of zero.

2255 (X'08CF')MQRC_UOW_NOT_AVAILABLE
Explanation:
An MQGET, MQPUT, or MQPUT1 call was issued to get or put a message outside a unit of work, but the options specified on the call required the queue manager to process the call within a unit of work. Because there is already a user-defined unit of work in existence, the queue manager was unable to create a temporary unit of work for the duration of the call.

This reason occurs in the following circumstances: 

On an MQGET call, when the MQGMO_COMPLETE_MSG option is specified in MQGMO and the logical message to be retrieved is persistent and consists of two or more segments. 
On an MQPUT or MQPUT1 call, when the MQMF_SEGMENTATION_ALLOWED flag is specified in MQMD and the message requires segmentation.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Issue the MQGET, MQPUT, or MQPUT1 call inside the user-defined unit of work. Alternatively, for the MQPUT or MQPUT1 call, reduce the size of the message so that it does not require segmentation by the queue manager.

2256 (X'08D0')MQRC_WRONG_GMO_VERSION
Explanation:
An MQGET call was issued specifying options that required an MQGMO with a version number not less than MQGMO_VERSION_2, but the MQGMO supplied did not satisfy this condition.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application to pass a version-2 MQGMO. Check the application logic to ensure that the Version field in MQGMO has been set to MQGMO_VERSION_2. Alternatively, remove the option that requires the version-2 MQGMO.

2257 (X'08D1')MQRC_WRONG_MD_VERSION
Explanation:
An MQGET, MQPUT, or MQPUT1 call was issued specifying options that required an MQMD with a version number not less than MQMD_VERSION_2, but the MQMD supplied did not satisfy this condition.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application to pass a version-2 MQMD. Check the application logic to ensure that the Version field in MQMD has been set to MQMD_VERSION_2. Alternatively, remove the option that requires the version-2 MQMD.

2258 (X'08D2')MQRC_GROUP_ID_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued to put a distribution-list message that is also a message in a group, a message segment, or has segmentation allowed, but an invalid combination of options and values was specified. All of the following are true: 

MQPMO_LOGICAL_ORDER is not specified in the Options field in MQPMO. 
Either there are too few MQPMR records provided by MQPMO, or the GroupId field is not present in the MQPMR records. 
One or more of the following flags is specified in the MsgFlags field in MQMD or MQMDE: 
MQMF_SEGMENTATION_ALLOWED 
MQMF_*_MSG_IN_GROUP 
MQMF_*_SEGMENT
The GroupId field in MQMD or MQMDE is not MQGI_NONE.
This combination of options and values would result in the same group identifier being used for all of the destinations in the distribution list; this is not permitted by the queue manager.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQGI_NONE for the GroupId field in MQMD or MQMDE. Alternatively, if the call is MQPUT specify MQPMO_LOGICAL_ORDER in the Options field in MQPMO.

2259 (X'08D3')MQRC_INCONSISTENT_BROWSE
Explanation:
An MQGET call was issued with the MQGMO_BROWSE_NEXT option specified, but the specification of the MQGMO_LOGICAL_ORDER option for the call is different from the specification of that option for the previous call for the queue handle. Either both calls must specify MQGMO_LOGICAL_ORDER, or neither call must specify MQGMO_LOGICAL_ORDER.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Add or remove the MQGMO_LOGICAL_ORDER option as appropriate. Alternatively, to switch between logical order and physical order, specify the MQGMO_BROWSE_FIRST option to restart the scan from the beginning of the queue, omitting or specifying MQGMO_LOGICAL_ORDER as required.

2260 (X'08D4')MQRC_XQH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQXQH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQXQH_STRUC_ID. 
The Version field is not MQXQH_VERSION_1. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2261 (X'08D5')MQRC_SRC_ENV_ERROR
Explanation:
This reason occurs when a channel exit that processes reference messages detects an error in the source environment data of a reference message header (MQRMH). One of the following is true: 

SrcEnvLength is less than zero. 
SrcEnvLength is greater than zero, but there is no source environment data. 
SrcEnvLength is greater than zero, but SrcEnvOffset is negative, zero, or less than the length of the fixed part of MQRMH. 
SrcEnvLength is greater than zero, but SrcEnvOffset plus SrcEnvLength is greater than StrucLength.
The exit returns this reason in the Feedback field of the MQCXP structure. If an exception report is requested, it is copied to the Feedback field of the MQMD associated with the report.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the source environment data correctly.

2262 (X'08D6')MQRC_SRC_NAME_ERROR
Explanation:
This reason occurs when a channel exit that processes reference messages detects an error in the source name data of a reference message header (MQRMH). One of the following is true: 

SrcNameLength is less than zero. 
SrcNameLength is greater than zero, but there is no source name data. 
SrcNameLength is greater than zero, but SrcNameOffset is negative, zero, or less than the length of the fixed part of MQRMH. 
SrcNameLength is greater than zero, but SrcNameOffset plus SrcNameLength is greater than StrucLength.
The exit returns this reason in the Feedback field of the MQCXP structure. If an exception report is requested, it is copied to the Feedback field of the MQMD associated with the report.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the source name data correctly.

2263 (X'08D7')MQRC_DEST_ENV_ERROR
Explanation:
This reason occurs when a channel exit that processes reference messages detects an error in the destination environment data of a reference message header (MQRMH). One of the following is true: 

DestEnvLength is less than zero. 
DestEnvLength is greater than zero, but there is no destination environment data. 
DestEnvLength is greater than zero, but DestEnvOffset is negative, zero, or less than the length of the fixed part of MQRMH. 
DestEnvLength is greater than zero, but DestEnvOffset plus DestEnvLength is greater than StrucLength.
The exit returns this reason in the Feedback field of the MQCXP structure. If an exception report is requested, it is copied to the Feedback field of the MQMD associated with the report.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the destination environment data correctly.

2264 (X'08D8')MQRC_DEST_NAME_ERROR
Explanation:
This reason occurs when a channel exit that processes reference messages detects an error in the destination name data of a reference message header (MQRMH). One of the following is true: 

DestNameLength is less than zero. 
DestNameLength is greater than zero, but there is no destination name data. 
DestNameLength is greater than zero, but DestNameOffset is negative, zero, or less than the length of the fixed part of MQRMH. 
DestNameLength is greater than zero, but DestNameOffset plus DestNameLength is greater than StrucLength.
The exit returns this reason in the Feedback field of the MQCXP structure. If an exception report is requested, it is copied to the Feedback field of the MQMD associated with the report.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the destination name data correctly.

2265 (X'08D9')MQRC_TM_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQTM structure that is not valid. Possible errors include the following: 

The StrucId field is not MQTM_STRUC_ID. 
The Version field is not MQTM_VERSION_1. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2266 (X'08DA')MQRC_CLUSTER_EXIT_ERROR
Explanation:
An MQOPEN, MQPUT, or MQPUT1 call was issued to open or put a message on a cluster queue, but the cluster workload exit defined by the queue-manager's ClusterWorkloadExit attribute failed unexpectedly or did not respond in time. Subsequent MQOPEN, MQPUT, and MQPUT1 calls for this queue handle are processed as though the ClusterWorkloadExit attribute were blank. 

On z/OS, a message giving more information about the error is written to the system log, for example message CSQV455E or CSQV456E.
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the cluster workload exit to ensure that it has been written correctly.

2267 (X'08DB')MQRC_CLUSTER_EXIT_LOAD_ERROR
Explanation:
An MQCONN or MQCONNX call was issued to connect to a queue manager, but the queue manager was unable to load the cluster workload exit. Execution continues without the cluster workload exit. 

On z/OS, if the cluster workload exit cannot be loaded, a message is written to the system log, for example message CSQV453I. Processing continues as though the ClusterWorkloadExit attribute had been blank.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING

Programmer Response:
Ensure that the queue-manager's ClusterWorkloadExit attribute has the correct value, and that the exit has been installed into the correct location.

2268 (X'08DC')MQRC_CLUSTER_PUT_INHIBITED
Explanation:
An MQOPEN call with the MQOO_OUTPUT and MQOO_BIND_ON_OPEN options in effect was issued for a cluster queue, but the call failed because all of the following are true: 

All instances of the cluster queue are currently put-inhibited (that is, all of the queue instances have the InhibitPut attribute set to MQQA_PUT_INHIBITED). 
There is no local instance of the queue. (If there is a local instance, the MQOPEN call succeeds, even if the local instance is put-inhibited.) 
There is no cluster workload exit for the queue, or there is a cluster workload exit but it did not choose a queue instance. (If the cluster workload exit does choose a queue instance, the MQOPEN call succeeds, even if that instance is put-inhibited.)
If the MQOO_BIND_NOT_FIXED option is specified on the MQOPEN call, the call can succeed even if all of the queues in the cluster are put-inhibited. However, a subsequent MQPUT call may fail if all of the queues are still put-inhibited at the time of the MQPUT call.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
If the system design allows put requests to be inhibited for short periods, retry the operation later. If the problem persists, determine why all of the queues in the cluster are put-inhibited.

2269 (X'08DD')MQRC_CLUSTER_RESOURCE_ERROR
Explanation:
An MQOPEN, MQPUT, or MQPUT1 call was issued for a cluster queue, but an error occurred whilst trying to use a resource required for clustering.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Do the following: 

Check that the SYSTEM.CLUSTER.* queues are not put inhibited or full. 
Check the event queues for any events relating to the SYSTEM.CLUSTER.* queues, as these may give guidance as to the nature of the failure. 
Check that the repository queue manager is available. 
On z/OS, check the console for signs of the failure, such as full page sets.
2270 (X'08DE')MQRC_NO_DESTINATIONS_AVAILABLE
Explanation:
An MQPUT or MQPUT1 call was issued to put a message on a cluster queue, but at the time of the call there were no longer any instances of the queue in the cluster. The message therefore could not be sent.

This situation can occur when MQOO_BIND_NOT_FIXED is specified on the MQOPEN call that opens the queue, or MQPUT1 is used to put the message.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the queue definition and queue status to determine why all instances of the queue were removed from the cluster. Correct the problem and rerun the application.

2271 (X'08DF')MQRC_CONN_TAG_IN_USE
Explanation:
An MQCONNX call was issued specifying one of the MQCNO_*_CONN_TAG_* options, but the call failed because the connection tag specified by ConnTag in MQCNO is in use by an active process or thread, or there is an unresolved unit of work that references this connection tag.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The problem is likely to be transitory. The application should wait a short while and then retry the operation.

2272 (X'08E0')MQRC_PARTIALLY_CONVERTED
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, one or more MQ header structures in the message data could not be converted to the specified target character set or encoding. In this situation, the MQ header structures are converted to the queue-manager's character set and encoding, and the application data in the message is converted to the target character set and encoding. On return from the call, the values returned in the various CodedCharSetId and Encoding fields in the MsgDesc parameter and MQ header structures indicate the character set and encoding that apply to each part of the message. The call completes with MQCC_WARNING.

This reason code usually occurs when the specified target character set is one that causes the character strings in the MQ header structures to expand beyond the lengths of their fields. Unicode character set UCS-2 is an example of a character set that causes this to happen.

Completion Code:
MQCC_FAILED

Programmer Response:
If this is an expected situation, no corrective action is required.

If this is an unexpected situation, check that the MQ header structures contain valid data. If they do, specify as the target character set a character set that does not cause the strings to expand.

2273 (X'08E1')MQRC_CONNECTION_ERROR
Explanation:
An MQCONN or MQCONNX call failed for one of the following reasons: 

The installation and customization options chosen for WebSphere MQ do not allow connection by the type of application being used. 
The system parameter module is not at the same release level as the queue manager. 
The channel initiator is not at the same release level as the queue manager. 
An internal error was detected by the queue manager.
This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
None, if the installation and customization options chosen for WebSphere MQ do not allow all functions to be used.

Otherwise, if this occurs while starting the channel initiator, ensure that the queue manager and the channel initiator are both at the same release level and that their started task JCL procedures both specify the same level of WebSphere MQ program libraries; if this occurs while starting the queue manager, relinkedit the system parameter module (CSQZPARM) to ensure that it is at the correct level. If the problem persists, contact your IBM support center.

2274 (X'08E2')MQRC_OPTION_ENVIRONMENT_ERROR
Explanation:
An MQGET call with the MQGMO_MARK_SKIP_BACKOUT option specified was issued from a DB2 Stored Procedure. The call failed because the MQGMO_MARK_SKIP_BACKOUT option cannot be used from a DB2 Stored Procedure.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the MQGMO_MARK_SKIP_BACKOUT option from the MQGET call.

2277 (X'08E5')MQRC_CD_ERROR
Explanation:
An MQCONNX call was issued to connect to a queue manager, but the MQCD channel definition structure addressed by the ClientConnOffset or ClientConnPtr field in MQCNO contains data that is not valid. Consult the error log for more information about the nature of the error.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQCD structure are set correctly.

2278 (X'08E6')MQRC_CLIENT_CONN_ERROR
Explanation:
An MQCONNX call was issued to connect to a queue manager, but the MQCD channel definition structure is not specified correctly. One of the following applies: 

ClientConnOffset is not zero and ClientConnPtr is not zero and not the null pointer. 
ClientConnPtr is not a valid pointer. 
ClientConnPtr or ClientConnOffset points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems. It also occurs in Java applications when a client channel definition table is specified to determine the name of the channel, but the table itself cannot be found.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that at least one of ClientConnOffset and ClientConnPtr is zero. Ensure that the field used points to accessible storage. Ensure that the URL of the client channel definition table is correct.

2279 (X'08E7')MQRC_CHANNEL_STOPPED_BY_USER
Explanation:
This condition is detected when the channel has been stopped by an operator. The reason qualifier identifies the reasons for stopping.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2280 (X'08E8')MQRC_HCONFIG_ERROR
Explanation:
The configuration handle Hconfig specified on the MQXEP call or MQZEP call is not valid. The MQXEP call is issued by an API exit function; the MQZEP call is issued by an installable service. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify the configuration handle that was provided by the queue manager: 

On the MQXEP call, use the handle passed in the Hconfig field of the MQAXP structure. 
On the MQZEP call, use the handle passed to the installable service's configuration function on the component initialization call. See the WebSphere MQ System Administration Guide book for information about installable services.
2281 (X'08E9')MQRC_FUNCTION_ERROR
Explanation:
An MQXEP or MQZEP call was issued, but the function identifier Function specified on the call is not valid, or not supported by the installable service being configured. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Do the following: 

For the MQXEP call, specify one of the MQXF_* values. 
For the MQZEP call, specify an MQZID_* value that is valid for the installable service being configured. Refer to the description of the MQZEP call in the WebSphere MQ System Administration Guide book to determine which values are valid.
2282 (X'08EA')MQRC_CHANNEL_STARTED
Explanation:
One of the following has occurred: 

An operator has issued a Start Channel command. 
An instance of a channel has been successfully established. This condition is detected when Initial Data negotiation is complete and resynchronization has been performed where necessary such that message transfer can proceed.
Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2283 (X'08EB')MQRC_CHANNEL_STOPPED
Explanation:
This condition is detected when the channel has been stopped. The reason qualifier identifies the reasons for stopping.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2284 (X'08EC')MQRC_CHANNEL_CONV_ERROR
Explanation:
This condition is detected when a channel is unable to do data conversion and the MQGET call to get a message from the transmission queue resulted in a data conversion error. The conversion reason code identifies the reason for the failure.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2285 (X'08ED')MQRC_SERVICE_NOT_AVAILABLE
Explanation:
This reason should be returned by an installable service component when the requested action cannot be performed because the required underlying service is not available. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Make the underlying service available.

2286 (X'08EE')MQRC_INITIALIZATION_FAILED
Explanation:
This reason should be returned by an installable service component when the component is unable to complete initialization successfully. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Correct the error and retry the operation.

2287 (X'08EF')MQRC_TERMINATION_FAILED
Explanation:
This reason should be returned by an installable service component when the component is unable to complete termination successfully. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Correct the error and retry the operation.

2288 (X'08F0')MQRC_UNKNOWN_Q_NAME
Explanation:
This reason should be returned by the MQZ_LOOKUP_NAME installable service component when the name specified for the QName parameter is not recognized. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
None. See the WebSphere MQ System Administration Guide book for information about installable services.

2289 (X'08F1')MQRC_SERVICE_ERROR
Explanation:
This reason should be returned by an installable service component when the component encounters an unexpected error. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Correct the error and retry the operation.

2290 (X'08F2')MQRC_Q_ALREADY_EXISTS
Explanation:
This reason should be returned by the MQZ_INSERT_NAME installable service component when the queue specified by the QName parameter is already defined to the name service. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
None. See the WebSphere MQ System Administration Guide book for information about installable service.

2291 (X'08F3')MQRC_USER_ID_NOT_AVAILABLE
Explanation:
This reason should be returned by the MQZ_FIND_USERID installable service component when the user ID cannot be determined. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
None. See the WebSphere MQ System Administration Guide book for information about installable services.

2292 (X'08F4')MQRC_UNKNOWN_ENTITY
Explanation:
This reason should be returned by the authority installable service component when the name specified by the EntityName parameter is not recognized. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the entity is defined.

2294 (X'08F6')MQRC_UNKNOWN_REF_OBJECT
Explanation:
This reason should be returned by the MQZ_COPY_ALL_AUTHORITY installable service component when the name specified by the RefObjectName parameter is not recognized. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the reference object is defined. See the WebSphere MQ System Administration Guide book for information about installable services.

2295 (X'08F7')MQRC_CHANNEL_ACTIVATED
Explanation:
This condition is detected when a channel that has been waiting to become active, and for which a Channel Not Activated event has been generated, is now able to become active because an active slot has been released by another channel.

This event is not generated for a channel that is able to become active without waiting for an active slot to be released.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2296 (X'08F8')MQRC_CHANNEL_NOT_ACTIVATED
Explanation:
This condition is detected when a channel is required to become active, either because it is starting or because it is about to make another attempt to establish connection with its partner. However, it is unable to do so because the limit on the number of active channels has been reached. 

On z/OS, the maximum number of active channels is given by the ACTCHL queue manager attribute. 
In other environments, the maximum number of active channels is given by the MaxActiveChannels parameter in the qm.ini file.
The channel waits until it is able to take over an active slot released when another channel ceases to be active. At that time a Channel Activated event is generated.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2297 (X'08F9')MQRC_UOW_CANCELED
Explanation:
An MQI call was issued, but the unit of work (TM/MP transaction) being used for the MQ operation had been canceled. This may have been done by TM/MP itself (for example, due to the transaction running for too long, or exceeding audit trail sizes), or by the application program issuing an ABORT_TRANSACTION. All updates performed to resources owned by the queue manager are backed out.

Completion Code:
MQCC_FAILED

Programmer Response:
Refer to the operating system's Transaction Management Operations Guide to determine how the Transaction Manager can be tuned to avoid the problem of system limits being exceeded.

2298 (X'08FA')MQRC_FUNCTION_NOT_SUPPORTED
Explanation:
The function requested is not available in the current environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the call from the application.

2299 (X'08FB')MQRC_SELECTOR_TYPE_ERROR
Explanation:
The Selector parameter has the wrong data type; it must be of type Long.

Completion Code:
MQCC_FAILED

Programmer Response:
Declare the Selector parameter as Long.

2300 (X'08FC')MQRC_COMMAND_TYPE_ERROR
Explanation:
The mqExecute call was issued, but the value of the MQIASY_TYPE data item in the administration bag is not MQCFT_COMMAND.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the MQIASY_TYPE data item in the administration bag has the value MQCFT_COMMAND.

2301 (X'08FD')MQRC_MULTIPLE_INSTANCE_ERROR
Explanation:
The Selector parameter specifies a system selector (one of the MQIASY_* values), but the value of the ItemIndex parameter is not MQIND_NONE. Only one instance of each system selector can exist in the bag.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQIND_NONE for the ItemIndex parameter.

2302 (X'08FE')MQRC_SYSTEM_ITEM_NOT_ALTERABLE
Explanation:
A call was issued to modify the value of a system data item in a bag (a data item with one of the MQIASY_* selectors), but the call failed because the data item is one that cannot be altered by the application.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the selector of a user-defined data item, or remove the call.

2303 (X'08FF')MQRC_BAG_CONVERSION_ERROR
Explanation:
The mqBufferToBag or mqGetBag call was issued, but the data in the buffer or message could not be converted into a bag. This occurs when the data to be converted is not valid PCF.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the logic of the application that created the buffer or message to ensure that the buffer or message contains valid PCF.

If the message contains PCF that is not valid, the message cannot be retrieved using the mqGetBag call: 

If one of the MQGMO_BROWSE_* options was specified, the message remains on the queue and can be retrieved using the MQGET call. 
In other cases, the message has already been removed from the queue and discarded. If the message was retrieved within a unit of work, the unit of work can be backed out and the message retrieved using the MQGET call.
2304 (X'0900')MQRC_SELECTOR_OUT_OF_RANGE
Explanation:
The Selector parameter has a value that is outside the valid range for the call. If the bag was created with the MQCBO_CHECK_SELECTORS option: 

For the mqAddInteger call, the value must be within the range MQIA_FIRST through MQIA_LAST. 
For the mqAddString call, the value must be within the range MQCA_FIRST through MQCA_LAST.
If the bag was not created with the MQCBO_CHECK_SELECTORS option: 

The value must be zero or greater.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value.

2305 (X'0901')MQRC_SELECTOR_NOT_UNIQUE
Explanation:
The ItemIndex parameter has the value MQIND_NONE, but the bag contains more than one data item with the selector value specified by the Selector parameter. MQIND_NONE requires that the bag contain only one occurrence of the specified selector.

This reason code also occurs on the mqExecute call when the administration bag contains two or more occurrences of a selector for a required parameter that permits only one occurrence.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the logic of the application that created the bag. If correct, specify for ItemIndex a value that is zero or greater, and add application logic to process all of the occurrences of the selector in the bag.

Review the description of the administration command being issued, and ensure that all required parameters are defined correctly in the bag.

2306 (X'0902')MQRC_INDEX_NOT_PRESENT
Explanation:
The specified index is not present: 

For a bag, this means that the bag contains one or more data items that have the selector value specified by the Selector parameter, but none of them has the index value specified by the ItemIndex parameter. The data item identified by the Selector and ItemIndex parameters must exist in the bag. 
For a namelist, this means that the index parameter value is too large, and outside the range of valid values.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify the index of a data item that does exist in the bag or namelist. Use the mqCountItems call to determine the number of data items with the specified selector that exist in the bag, or the nameCount method to determine the number of names in the namelist.

2307 (X'0903')MQRC_STRING_ERROR
Explanation:
The String parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2308 (X'0904')MQRC_ENCODING_NOT_SUPPORTED
Explanation:
The Encoding field in the message descriptor MQMD contains a value that is not supported: 

For the mqPutBag call, the field in error resides in the MsgDesc parameter of the call. 
For the mqGetBag call, the field in error resides in: 
The MsgDesc parameter of the call if the MQGMO_CONVERT option was specified. 
The message descriptor of the message about to be retrieved if MQGMO_CONVERT was not specified.
Completion Code:
MQCC_FAILED

Programmer Response:
The value must be MQENC_NATIVE.

If the value of the Encoding field in the message is not valid, the message cannot be retrieved using the mqGetBag call: 

If one of the MQGMO_BROWSE_* options was specified, the message remains on the queue and can be retrieved using the MQGET call. 
In other cases, the message has already been removed from the queue and discarded. If the message was retrieved within a unit of work, the unit of work can be backed out and the message retrieved using the MQGET call.
2309 (X'0905')MQRC_SELECTOR_NOT_PRESENT
Explanation:
The Selector parameter specifies a selector that does not exist in the bag.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a selector that does exist in the bag.

2310 (X'0906')MQRC_OUT_SELECTOR_ERROR
Explanation:
The OutSelector parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2311 (X'0907')MQRC_STRING_TRUNCATED
Explanation:
The string returned by the call is too long to fit in the buffer provided. The string has been truncated to fit in the buffer.

Completion Code:
MQCC_FAILED

Programmer Response:
If the entire string is required, provide a larger buffer. On the mqInquireString call, the StringLength parameter is set by the call to indicate the size of the buffer required to accommodate the string without truncation.

2312 (X'0908')MQRC_SELECTOR_WRONG_TYPE
Explanation:
A data item with the specified selector exists in the bag, but has a data type that conflicts with the data type implied by the call being used. For example, the data item might have an integer data type, but the call being used might be mqSetString, which implies a character data type.

This reason code also occurs on the mqBagToBuffer, mqExecute, and mqPutBag calls when mqAddString or mqSetString was used to add the MQIACF_INQUIRY data item to the bag.

Completion Code:
MQCC_FAILED

Programmer Response:
For the mqSetInteger and mqSetString calls, specify MQIND_ALL for the ItemIndex parameter to delete from the bag all existing occurrences of the specified selector before creating the new occurrence with the required data type.

For the mqInquireBag, mqInquireInteger, and mqInquireString calls, use the mqInquireItemInfo call to determine the data type of the item with the specified selector, and then use the appropriate call to determine the value of the data item.

For the mqBagToBuffer, mqExecute, and mqPutBag calls, ensure that the MQIACF_INQUIRY data item is added to the bag using the mqAddInteger or mqSetInteger calls.

2313 (X'0909')MQRC_INCONSISTENT_ITEM_TYPE
Explanation:
The mqAddInteger or mqAddString call was issued to add another occurrence of the specified selector to the bag, but the data type of this occurrence differed from the data type of the first occurrence.

This reason can also occur on the mqBufferToBag and mqGetBag calls, where it indicates that the PCF in the buffer or message contains a selector that occurs more than once but with inconsistent data types.

Completion Code:
MQCC_FAILED

Programmer Response:
For the mqAddInteger and mqAddString calls, use the call appropriate to the data type of the first occurrence of that selector in the bag.

For the mqBufferToBag and mqGetBag calls, check the logic of the application that created the buffer or sent the message to ensure that multiple-occurrence selectors occur with only one data type. A message that contains a mixture of data types for a selector cannot be retrieved using the mqGetBag call: 

If one of the MQGMO_BROWSE_* options was specified, the message remains on the queue and can be retrieved using the MQGET call. 
In other cases, the message has already been removed from the queue and discarded. If the message was retrieved within a unit of work, the unit of work can be backed out and the message retrieved using the MQGET call.
2314 (X'090A')MQRC_INDEX_ERROR
Explanation:
An index parameter to a call or method has a value that is not valid. The value must be zero or greater. For bag calls, certain MQIND_* values can also be specified: 

For the mqDeleteItem, mqSetInteger and mqSetString calls, MQIND_ALL and MQIND_NONE are valid. 
For the mqInquireBag, mqInquireInteger, mqInquireString, and mqInquireItemInfo calls, MQIND_NONE is valid.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value.

2315 (X'090B')MQRC_SYSTEM_BAG_NOT_ALTERABLE
Explanation:
A call was issued to add a data item to a bag, modify the value of an existing data item in a bag, or retrieve a message into a bag, but the call failed because the bag is one that had been created by the system as a result of a previous mqExecute call. System bags cannot be modified by the application.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the handle of a bag created by the application, or remove the call.

2316 (X'090C')MQRC_ITEM_COUNT_ERROR
Explanation:
The mqTruncateBag call was issued, but the ItemCount parameter specifies a value that is not valid. The value is either less than zero, or greater than the number of user-defined data items in the bag.

This reason also occurs on the mqCountItems call if the parameter pointer is not valid, or points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value. Use the mqCountItems call to determine the number of user-defined data items in the bag.

2317 (X'090D')MQRC_FORMAT_NOT_SUPPORTED
Explanation:
The Format field in the message descriptor MQMD contains a value that is not supported: 

In an administration message, the format value must be one of the following: MQFMT_ADMIN, MQFMT_EVENT, MQFMT_PCF. For the mqPutBag call, the field in error resides in the MsgDesc parameter of the call. For the mqGetBag call, the field in error resides in the message descriptor of the message about to be retrieved. 
On z/OS, the message was put to the command input queue with a format value of MQFMT_ADMIN, but the version of MQ being used does not support that format for commands.
Completion Code:
MQCC_FAILED

Programmer Response:
If the error occurred when putting a message, correct the format value.

If the error occurred when getting a message, the message cannot be retrieved using the mqGetBag call: 

If one of the MQGMO_BROWSE_* options was specified, the message remains on the queue and can be retrieved using the MQGET call. 
In other cases, the message has already been removed from the queue and discarded. If the message was retrieved within a unit of work, the unit of work can be backed out and the message retrieved using the MQGET call.
2318 (X'090E')MQRC_SELECTOR_NOT_SUPPORTED
Explanation:
The Selector parameter specifies a value that is a system selector (a value that is negative), but the system selector is not one that is supported by the call.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a selector value that is supported.

2319 (X'090F')MQRC_ITEM_VALUE_ERROR
Explanation:
The mqInquireBag or mqInquireInteger call was issued, but the ItemValue parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2320 (X'0910')MQRC_HBAG_ERROR
Explanation:
A call was issued that has a parameter that is a bag handle, but the handle is not valid. For output parameters, this reason also occurs if the parameter pointer is not valid, or points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2321 (X'0911')MQRC_PARAMETER_MISSING
Explanation:
An administration message requires a parameter that is not present in the administration bag. This reason code occurs only for bags created with the MQCBO_ADMIN_BAG or MQCBO_REORDER_AS_REQUIRED options.

Completion Code:
MQCC_FAILED

Programmer Response:
Review the description of the administration command being issued, and ensure that all required parameters are present in the bag.

2322 (X'0912')MQRC_CMD_SERVER_NOT_AVAILABLE
Explanation:
The command server that processes administration commands is not available.

Completion Code:
MQCC_FAILED

Programmer Response:
Start the command server.

2323 (X'0913')MQRC_STRING_LENGTH_ERROR
Explanation:
The StringLength parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2324 (X'0914')MQRC_INQUIRY_COMMAND_ERROR
Explanation:
The mqAddInquiry call was used previously to add attribute selectors to the bag, but the command code to be used for the mqBagToBuffer, mqExecute, or mqPutBag call is not recognized. As a result, the correct PCF message cannot be generated.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the mqAddInquiry calls and use instead the mqAddInteger call with the appropriate MQIACF_*_ATTRS or MQIACH_*_ATTRS selectors.

2325 (X'0915')MQRC_NESTED_BAG_NOT_SUPPORTED
Explanation:
A bag that is input to the call contains nested bags. Nested bags are supported only for bags that are output from the call.

Completion Code:
MQCC_FAILED

Programmer Response:
Use a different bag as input to the call.

2326 (X'0916')MQRC_BAG_WRONG_TYPE
Explanation:
The Bag parameter specifies the handle of a bag that has the wrong type for the call. The bag must be an administration bag, that is, it must be created with the MQCBO_ADMIN_BAG option specified on the mqCreateBag call.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the MQCBO_ADMIN_BAG option when the bag is created.

2327 (X'0917')MQRC_ITEM_TYPE_ERROR
Explanation:
The mqInquireItemInfo call was issued, but the ItemType parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2328 (X'0918')MQRC_SYSTEM_BAG_NOT_DELETABLE
Explanation:
An mqDeleteBag call was issued to delete a bag, but the call failed because the bag is one that had been created by the system as a result of a previous mqExecute call. System bags cannot be deleted by the application.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the handle of a bag created by the application, or remove the call.

2329 (X'0919')MQRC_SYSTEM_ITEM_NOT_DELETABLE
Explanation:
A call was issued to delete a system data item from a bag (a data item with one of the MQIASY_* selectors), but the call failed because the data item is one that cannot be deleted by the application.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the selector of a user-defined data item, or remove the call.

2330 (X'091A')MQRC_CODED_CHAR_SET_ID_ERROR
Explanation:
The CodedCharSetId parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2331 (X'091B')MQRC_MSG_TOKEN_ERROR
Explanation:
An MQGET call was issued to retrieve a message using the message token as a selection criterion, but the options specified are not valid, because MQMO_MATCH_MSG_TOKEN was specified with either MQGMO_WAIT or MQGMO_SET_SIGNAL.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the MQMO_MATCH_MSG_TOKEN option from the MQGET call.

2332 (X'091C')MQRC_MISSING_WIH
Explanation:
An MQPUT or MQPUT1 call was issued to put a message on a queue whose IndexType attribute had the value MQIT_MSG_TOKEN, but the Format field in the MQMD was not MQFMT_WORK_INFO_HEADER. This error occurs only when the message arrives at the destination queue manager.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application to ensure that it places an MQWIH structure at the start of the message data, and sets the Format field in the MQMD to MQFMT_WORK_INFO_HEADER. Alternatively, change the ApplType attribute of the process definition used by the destination queue to be MQAT_WLM, and specify the required service name and service step name in its EnvData attribute.

2333 (X'091D')MQRC_WIH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQWIH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQWIH_STRUC_ID. 
The Version field is not MQWIH_VERSION_1. 
The StrucLength field is not MQWIH_LENGTH_1. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).

On z/OS, this error also occurs when the IndexType attribute of the queue is MQIT_MSG_TOKEN, but the message data does not begin with an MQWIH structure.
Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field). 

On z/OS, if the queue has an IndexType of MQIT_MSG_TOKEN, ensure that the message data begins with an MQWIH structure.
2334 (X'091E')MQRC_RFH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQRFH or MQRFH2 structure that is not valid. Possible errors include the following: 

The StrucId field is not MQRFH_STRUC_ID. 
The Version field is not MQRFH_VERSION_1 (MQRFH), or MQRFH_VERSION_2 (MQRFH2). 
The StrucLength field specifies a value that is too small to include the structure plus the variable-length data at the end of the structure. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2335 (X'091F')MQRC_RFH_STRING_ERROR
Explanation:
The contents of the NameValueString field in the MQRFH structure are not valid. NameValueString must adhere to the following rules: 

The string must consist of zero or more name/value pairs separated from each other by one or more blanks; the blanks are not significant. 
If a name or value contains blanks that are significant, the name or value must be enclosed in double-quote characters. 
If a name or value itself contains one or more double-quote characters, the name or value must be enclosed in double-quote characters, and each embedded double-quote character must be doubled. 
A name or value can contain any characters other than the null, which acts as a delimiter. The null and characters following it, up to the defined length of NameValueString, are ignored.
The following is a valid NameValueString: 

Famous_Words "The program displayed ""Hello World"""Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application that generated the message to ensure that it places in the NameValueString field data that adheres to the rules listed above. Check that the StrucLength field is set to the correct value.

2336 (X'0920')MQRC_RFH_COMMAND_ERROR
Explanation:
The message contains an MQRFH structure, but the command name contained in the NameValueString field is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application that generated the message to ensure that it places in the NameValueString field a command name that is valid.

2337 (X'0921')MQRC_RFH_PARM_ERROR
Explanation:
The message contains an MQRFH structure, but a parameter name contained in the NameValueString field is not valid for the command specified.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application that generated the message to ensure that it places in the NameValueString field only parameters that are valid for the specified command.

2338 (X'0922')MQRC_RFH_DUPLICATE_PARM
Explanation:
The message contains an MQRFH structure, but a parameter occurs more than once in the NameValueString field when only one occurrence is valid for the specified command.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application that generated the message to ensure that it places in the NameValueString field only one occurrence of the parameter.

2339 (X'0923')MQRC_RFH_PARM_MISSING
Explanation:
The message contains an MQRFH structure, but the command specified in the NameValueString field requires a parameter that is not present.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application that generated the message to ensure that it places in the NameValueString field all parameters that are required for the specified command.

2340 (X'0924')MQRC_CHAR_CONVERSION_ERROR
Explanation:
This reason code is returned by the Java MQQueueManager constructor when a required character-set conversion is not available. The conversion required is between two nonUnicode character sets.

This reason code occurs in the following environment: MQ Classes for Java on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the National Language Resources component of the OS/390 Language Environment is installed, and that conversion between the IBM-1047 and ISO8859-1 character sets is available.

2341 (X'0925')MQRC_UCS2_CONVERSION_ERROR
Explanation:
This reason code is returned by the Java MQQueueManager constructor when a required character-set conversion is not available. The conversion required is between the UCS-2 Unicode character set and the queue-manager's character set. IBM-500 is used for the queue-manager's character set if no specific value is available.

This reason code occurs in the following environment: MQ Classes for Java on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the relevant Unicode conversion tables are installed, and that they are available to the z/OS Language Environment. The conversion tables should be installed as part of the z/OS C/C++ optional feature. Refer to the z/OS C/C++ Programming Guide for more information about enabling UCS-2 conversions.

2342 (X'0926')MQRC_DB2_NOT_AVAILABLE
Explanation:
An MQOPEN, MQPUT1, or MQSET call, or a command, was issued to access a shared queue, but it failed because the queue manager is not connected to a DB2 subsystem. As a result, the queue manager is unable to access the object definition relating to the shared queue.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Configure the DB2 subsystem so that the queue manager can connect to it.

2343 (X'0927')MQRC_OBJECT_NOT_UNIQUE
Explanation:
An MQOPEN or MQPUT1 call, or a command, was issued to access a queue, but the call failed because the queue specified cannot be resolved unambiguously. There exists a shared queue with the specified name, and a nonshared queue with the same name.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
One of the queues must be deleted. If the queue to be deleted contains messages, use the MQSC command MOVE QLOCAL to move the messages to a different queue, and then use the command DELETE QLOCAL to delete the queue.

2344 (X'0928')MQRC_CONN_TAG_NOT_RELEASED
Explanation:
An MQDISC call was issued when there was a unit of work outstanding for the connection handle. For CICS, IMS, and RRS connections, the MQDISC call does not commit or back out the unit of work. As a result, the connection tag associated with the unit of work is not yet available for reuse. The tag becomes available for reuse only when processing of the unit of work has been completed.

This reason code occurs only on z/OS.

Completion Code:
MQCC_WARNING

Programmer Response:
Do not try to reuse the connection tag immediately. If the MQCONNX call is issued with the same connection tag, and that tag is still in use, the call fails with reason code MQRC_CONN_TAG_IN_USE.

2345 (X'0929')MQRC_CF_NOT_AVAILABLE
Explanation:
An MQOPEN or MQPUT1 call was issued to access a shared queue, but the allocation of the coupling-facility structure specified in the queue definition failed because there is no suitable coupling facility to hold the structure, based on the preference list in the active CFRM policy.

This reason code can also occur when the API call requires a capability that is not supported by the CF level defined in the coupling-facility structure object. For example, this reason code is returned by an attempt to open a shared queue that has a index type of MQIT_GROUP_ID, but the coupling-facility structure for the queue has a CF level lower than three.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Make available a coupling facility with one of the names specified in the CFRM policy, or modify the CFRM policy to specify the names of coupling facilities that are available.

2346 (X'092A')MQRC_CF_STRUC_IN_USE
Explanation:
An MQI call or command was issued to operate on a shared queue, but the call failed because the coupling-facility structure specified in the queue definition is temporarily unavailable. The coupling-facility structure can be unavailable because a structure dump is in progress, or new connectors to the structure are currently inhibited, or an existing connector to the structure failed or disconnected abnormally and clean-up is not yet complete.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The problem is temporary; wait a short while and then retry the operation.

2347 (X'092B')MQRC_CF_STRUC_LIST_HDR_IN_USE
Explanation:
An MQGET, MQOPEN, MQPUT1, or MQSET call was issued to access a shared queue, but the call failed because the list header associated with the coupling-facility structure specified in the queue definition is temporarily unavailable. The list header is unavailable because it is undergoing recovery processing.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The problem is temporary; wait a short while and then retry the operation.

2348 (X'092C')MQRC_CF_STRUC_AUTH_FAILED
Explanation:
An MQOPEN or MQPUT1 call was issued to access a shared queue, but the call failed because the user is not authorized to access the coupling-facility structure specified in the queue definition.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the security profile for the user identifier used by the application so that the application can access the coupling-facility structure specified in the queue definition.

2349 (X'092D')MQRC_CF_STRUC_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued to access a shared queue, but the call failed because the coupling-facility structure name specified in the queue definition is not defined in the CFRM data set, or is not the name of a list structure.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the queue definition to specify the name of a coupling-facility list structure that is defined in the CFRM data set.

2350 (X'092E')MQRC_CONN_TAG_NOT_USABLE
Explanation:
An MQCONNX call was issued specifying one of the MQCNO_*_CONN_TAG_* options, but the call failed because the connection tag specified by ConnTag in MQCNO is being used by the queue manager for recovery processing, and this processing is delayed pending recovery of the coupling facility.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The problem is likely to persist. Consult the system programmer to ascertain the cause of the problem.

2351 (X'092F')MQRC_GLOBAL_UOW_CONFLICT
Explanation:
An attempt was made to use inside a global unit of work a connection handle that is participating in another global unit of work. This can occur when an application passes connection handles between objects where the objects are involved in different DTC transactions. Because transaction completion is asynchronous, it is possible for this error to occur after the application has finalized the first object and committed its transaction.

This error does not occur for nontransactional MQI calls.

This reason code occurs only on Windows and z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the "MTS Transaction Support" attribute defined for the object's class is set correctly. If necessary, modify the application so that the connection handle is not used by objects participating in different units of work.

2352 (X'0930')MQRC_LOCAL_UOW_CONFLICT
Explanation:
An attempt was made to use inside a global unit of work a connection handle that is participating in a queue-manager coordinated local unit of work. This can occur when an application passes connection handles between objects where one object is involved in a DTC transaction and the other is not.

This error does not occur for nontransactional MQI calls.

This reason code occurs only on Windows and z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the "MTS Transaction Support" attribute defined for the object's class is set correctly. If necessary, modify the application so that the connection handle is not used by objects participating in different units of work.

2353 (X'0931')MQRC_HANDLE_IN_USE_FOR_UOW
Explanation:
An attempt was made to use outside a unit of work a connection handle that is participating in a global unit of work.

This error can occur when an application passes connection handles between objects where one object is involved in a DTC transaction and the other is not. Because transaction completion is asynchronous, it is possible for this error to occur after the application has finalized the first object and committed its transaction.

This error can also occur when a single object that was created and associated with the transaction loses that association whilst the object is running. The association is lost when DTC terminates the transaction independently of MTS. This might be because the transaction timed out, or because DTC shut down.

This error does not occur for nontransactional MQI calls.

This reason code occurs only on Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the "MTS Transaction Support" attribute defined for the object's class is set correctly. If necessary, modify the application so that objects executing within different units of work do not try to use the same connection handle.

2354 (X'0932')MQRC_UOW_ENLISTMENT_ERROR
Explanation:
This reason code can occur for a variety of reasons. The most likely reason is that an object created by a DTC transaction does not issue a transactional MQI call until after the DTC transaction has timed out. (If the DTC transaction times out after a transactional MQI call has been issued, reason code MQRC_HANDLE_IN_USE_FOR_UOW is returned by the failing MQI call.)

Another cause of MQRC_UOW_ENLISTMENT_ERROR is incorrect installation; Windows NT Service pack must be installed after the Windows NT Option pack.

This reason code occurs only on Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the DTC "Transaction timeout" value. If necessary, verify the NT installation order.

2355 (X'0933')MQRC_UOW_MIX_NOT_SUPPORTED
Explanation:
The mixture of calls used by the application to perform operations within a unit of work is not supported. In particular, it is not possible to mix within the same process a local unit of work coordinated by the queue manager with a global unit of work coordinated by DTC (Distributed Transaction Coordinator).

An application may cause this mixture to arise if some objects in a package are coordinated by DTC and others are not. It can also occur if transactional MQI calls from an MTS client are mixed with transactional MQI calls from a library package transactional MTS object.

No problem arises if all transactional MQI calls originate from transactional MTS objects, or all transactional MQI calls originate from nontransactional MTS objects. But when a mixture of styles is used, the first style used fixes the style for the unit of work, and subsequent attempts to use the other style within the process fail with reason code MQRC_UOW_MIX_NOT_SUPPORTED.

When an application is run twice, scheduling factors in the operating system mean that it is possible for the queue-manager-coordinated transactional calls to fail in one run, and for the DTC-coordinated transactional calls to fail in the other run.

This reason code occurs only on Windows when running a version of the queue manager prior to version 5.2.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the "MTS Transaction Support" attribute defined for the object's class is set correctly. If necessary, modify the application so that objects executing within different units of work do not try to use the same connection handle.

2356 (X'0934')MQRC_WXP_ERROR
Explanation:
An MQXCLWLN call was issued from a cluster workload exit to obtain the address of the next record in the chain, but the workload exit parameter structure ExitParms is not valid, for one of the following reasons: 

The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The StrucId field is not MQWXP_STRUC_ID. 
The Version field is not MQWXP_VERSION_2. 
The CacheContext field does not contain the value passed to the exit by the queue manager.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the parameter specified for ExitParms is the MQWXP structure that was passed to the exit when the exit was invoked.

2357 (X'0935')MQRC_CURRENT_RECORD_ERROR
Explanation:
An MQXCLWLN call was issued from a cluster workload exit to obtain the address of the next record in the chain, but the address specified by the CurrentRecord parameter is not the address of a valid record. CurrentRecord must be the address of a destination record (MQWDR), queue record (MQWQR), or cluster record (MQWCR) residing within the cluster cache.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the cluster workload exit passes the address of a valid record residing in the cluster cache.

2358 (X'0936')MQRC_NEXT_OFFSET_ERROR
Explanation:
An MQXCLWLN call was issued from a cluster workload exit to obtain the address of the next record in the chain, but the offset specified by the NextOffset parameter is not valid. NextOffset must be the value of one of the following fields: 

ChannelDefOffset field in MQWDR 
ClusterRecOffset field in MQWDR 
ClusterRecOffset field in MQWQR 
ClusterRecOffset field in MQWCR
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the value specified for the NextOffset parameter is the value of one of the fields listed above.

2359 (X'0937')MQRC_NO_RECORD_AVAILABLE
Explanation:
An MQXCLWLN call was issued from a cluster workload exit to obtain the address of the next record in the chain, but the current record is the last record in the chain.

Completion Code:
MQCC_FAILED

Programmer Response:
None.

2360 (X'0938')MQRC_OBJECT_LEVEL_INCOMPATIBLE
Explanation:
An MQOPEN or MQPUT1 call, or a command, was issued, but the definition of the object to be accessed is not compatible with the queue manager to which the application has connected. The object definition was created or modified by a different version of the queue manager.

If the object to be accessed is a queue, the incompatible object definition could be the object specified, or one of the object definitions used to resolve the specified object (for example, the base queue to which an alias queue resolves, or the transmission queue to which a remote queue or queue-manager alias resolves).

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The application must be run on a queue manager that is compatible with the object definition. Refer to the WebSphere MQ for z/OS Concepts and Planning Guide and the WebSphere MQ for z/OS System Setup Guide for information about compatibility and migration between different versions of the queue manager.

2361 (X'0939')MQRC_NEXT_RECORD_ERROR
Explanation:
An MQXCLWLN call was issued from a cluster workload exit to obtain the address of the next record in the chain, but the address specified for the NextRecord parameter is either null, not valid, or the address of read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid address for the NextRecord parameter.

2362 (X'093A')MQRC_BACKOUT_THRESHOLD_REACHED
Explanation:
This reason code occurs only in the Reason field in an MQDLH structure, or in the Feedback field in the MQMD of a report message.

A JMS ConnectionConsumer found a message that exceeds the queue's backout threshold. The queue does not have a backout requeue queue defined, so the message was processed as specified by the disposition options in the Report field in the MQMD of the message.

On queue managers that do not support the BackoutThreshold and BackoutRequeueQName queue attributes, JMS ConnectionConsumer uses a value of 20 for the backout threshold. When the BackoutCount of a message reaches this threshold, the message is processed as specified by the disposition options.

If the Report field specifies one of the MQRO_EXCEPTION_* options, this reason code appears in the Feedback field of the report message. If the Report field specifies MQRO_DEAD_LETTER_Q, or the disposition report options are left as default, this reason code appears in the Reason field of the MQDLH.

Completion Code:
None

Programmer Response:
Investigate the cause of the backout count being greater than the threshold. To correct this, define the backout queue for the queue concerned.

2363 (X'093B')MQRC_MSG_NOT_MATCHED
Explanation:
This reason code occurs only in the Reason field in an MQDLH structure, or in the Feedback field in the MQMD of a report message.

While performing Point-to-Point messaging, JMS encountered a message matching none of the selectors of ConnectionConsumers monitoring the queue. To maintain performance, the message was processed as specified by the disposition options in the Report field in the MQMD of the message.

If the Report field specifies one of the MQRO_EXCEPTION_* options, this reason code appears in the Feedback field of the report message. If the Report field specifies MQRO_DEAD_LETTER_Q, or the disposition report options are left as default, this reason code appears in the Reason field of the MQDLH.

Completion Code:
None

Programmer Response:
To correct this, ensure that the ConnectionConsumers monitoring the queue provide a complete set of selectors. Alternatively, set the QueueConnectionFactory to retain messages.

2364 (X'093C')MQRC_JMS_FORMAT_ERROR
Explanation:
This reason code is generated when JMS encounters a message that it is unable to parse. If such a message is encountered by a JMS ConnectionConsumer, the message is processed as specified by the disposition options in the Report field in the MQMD of the message.

If the Report field specifies one of the MQRO_EXCEPTION_* options, this reason code appears in the Feedback field of the report message. If the Report field specifies MQRO_DEAD_LETTER_Q, or the disposition report options are left as default, this reason code appears in the Reason field of the MQDLH.

Completion Code:
None

Programmer Response:
Investigate the origin of the message.

2365 (X'093D')MQRC_SEGMENTS_NOT_SUPPORTED
Explanation:
An MQPUT call was issued to put a segment of a logical message, but the queue on which the message is to be placed has an IndexType of MQIT_GROUP_ID. Message segments cannot be placed on queues with this index type.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application to put messages that are not segments; ensure that the MQMF_SEGMENT and MQMF_LAST_SEGMENT flags in the MsgFlags field in MQMD are not set, and that the Offset is zero. Alternatively, change the index type of the queue.

2366 (X'093E')MQRC_WRONG_CF_LEVEL
Explanation:
An MQOPEN or MQPUT1 call was issued specifying a shared queue, but the queue requires a coupling-facility structure with a different level of capability.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the coupling-facility structure used for the queue is at the level required to support the capabilities that the queue provides.

2367 (X'093F')MQRC_CONFIG_CREATE_OBJECT
Explanation:
This condition is detected when an object is created.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2368 (X'0940')MQRC_CONFIG_CHANGE_OBJECT
Explanation:
This condition is detected when an object is changed.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2369 (X'0941')MQRC_CONFIG_DELETE_OBJECT
Explanation:
This condition is detected when an object is deleted.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2370 (X'0942')MQRC_CONFIG_REFRESH_OBJECT
Explanation:
This condition is detected when an object is refreshed.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2371 (X'0943')MQRC_CHANNEL_SSL_ERROR
Explanation:
This condition is detected when a connection cannot be established due to an SSL key-exchange or authentication failure.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2373 (X'0945')MQRC_CF_STRUC_FAILED
Explanation:
An MQI call or command was issued to access a shared queue, but the call failed because the coupling-facility structure used for the shared queue had failed.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Report the problem to the operator or administrator, who should use the MQSC command RECOVER CFSTRUCT to initiate recovery of the coupling-facility structure

2374 (X'0946')MQRC_API_EXIT_ERROR
Explanation:
An API exit function returned an invalid response code, or failed in some other way.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the exit logic to ensure that the exit is returning valid values in the ExitResponse and ExitResponse2 fields of the MQAXP structure. Consult the FFST record to see if it contains more detail about the problem.

2375 (X'0947')MQRC_API_EXIT_INIT_ERROR
Explanation:
The queue manager encountered an error while attempting to initialize the execution environment for an API exit function.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Consult the FFST record to obtain more detail about the problem.

2376 (X'0948')MQRC_API_EXIT_TERM_ERROR
Explanation:
The queue manager encountered an error while attempting to terminate the execution environment for an API exit function.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Consult the FFST record to obtain more detail about the problem.

2377 (X'0949')MQRC_EXIT_REASON_ERROR
Explanation:
An MQXEP call was issued by an API exit function, but the value specified for the ExitReason parameter is either not valid, or not supported for the specified function identifier Function.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the exit function to specify a value for ExitReason that is valid for the specified value of Function.

2378 (X'094A')MQRC_RESERVED_VALUE_ERROR
Explanation:
An MQXEP call was issued by an API exit function, but the value specified for the Reserved parameter is not valid. The value must be the null pointer.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the exit to specify the null pointer as the value of the Reserved parameter.

2379 (X'094B')MQRC_NO_DATA_AVAILABLE
Explanation:
This reason should be returned by the MQZ_ENUMERATE_AUTHORITY_DATA installable service component when there is no more authority data to return to the invoker of the service component. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
None.

2380 (X'094C')MQRC_SCO_ERROR
Explanation:
On an MQCONNX call, the MQSCO structure is not valid for one of the following reasons: 

The StrucId field is not MQSCO_STRUC_ID. 
The Version field is not MQSCO_VERSION_1.
This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the definition of the MQSCO structure.

2381 (X'094D')MQRC_KEY_REPOSITORY_ERROR
Explanation:
On an MQCONN or MQCONNX call, the location of the key repository is either not specified, not valid, or results in an error when used to access the key repository. The location of the key repository is specified by one of the following: 

The value of the MQSSLKEYR environment variable (MQCONN or MQCONNX call), or 
The value of the KeyRepository field in the MQSCO structure (MQCONNX call only).
For the MQCONNX call, if both MQSSLKEYR and KeyRepository are specified, the latter is used.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid location for the key repository.

2382 (X'094E')MQRC_CRYPTO_HARDWARE_ERROR
Explanation:
On an MQCONN or MQCONNX call, the configuration string for the cryptographic hardware is not valid, or results in an error when used to configure the cryptographic hardware. The configuration string is specified by one of the following: 

The value of the MQSSLCRYP environment variable (MQCONN or MQCONNX call), or 
The value of the CryptoHardware field in the MQSCO structure (MQCONNX call only).
For the MQCONNX call, if both MQSSLCRYP and CryptoHardware are specified, the latter is used.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid configuration string for the cryptographic hardware.

2383 (X'094F')MQRC_AUTH_INFO_REC_COUNT_ERROR
Explanation:
On an MQCONNX call, the AuthInfoRecCount field in the MQSCO structure specifies a value that is less than zero.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value for AuthInfoRecCount that is zero or greater.

2384 (X'0950')MQRC_AUTH_INFO_REC_ERROR
Explanation:
On an MQCONNX call, the MQSCO structure does not specify the address of the MQAIR records correctly. One of the following applies: 

AuthInfoRecCount is greater than zero, but AuthInfoRecOffset is zero and AuthInfoRecPtr is the null pointer. 
AuthInfoRecOffset is not zero and AuthInfoRecPtr is not the null pointer. 
AuthInfoRecPtr is not a valid pointer. 
AuthInfoRecOffset or AuthInfoRecPtr points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that one of AuthInfoRecOffset or AuthInfoRecPtr is zero and the other nonzero. Ensure that the field used points to accessible storage.

2385 (X'0951')MQRC_AIR_ERROR
Explanation:
On an MQCONNX call, an MQAIR record is not valid for one of the following reasons: 

The StrucId field is not MQAIR_STRUC_ID. 
The Version field is not MQAIR_VERSION_1.
This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the definition of the MQAIR record.

2386 (X'0952')MQRC_AUTH_INFO_TYPE_ERROR
Explanation:
On an MQCONNX call, the AuthInfoType field in an MQAIR record specifies a value that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQAIT_CRL_LDAP for AuthInfoType.

2387 (X'0953')MQRC_AUTH_INFO_CONN_NAME_ERROR
Explanation:
On an MQCONNX call, the AuthInfoConnName field in an MQAIR record specifies a value that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid connection name.

2388 (X'0954')MQRC_LDAP_USER_NAME_ERROR
Explanation:
On an MQCONNX call, an LDAP user name in an MQAIR record is not specified correctly. One of the following applies: 

LDAPUserNameLength is greater than zero, but LDAPUserNameOffset is zero and LDAPUserNamePtr is the null pointer. 
LDAPUserNameOffset is nonzero and LDAPUserNamePtr is not the null pointer. 
LDAPUserNamePtr is not a valid pointer. 
LDAPUserNameOffset or LDAPUserNamePtr points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that one of LDAPUserNameOffset or LDAPUserNamePtr is zero and the other nonzero. Ensure that the field used points to accessible storage.

2389 (X'0955')MQRC_LDAP_USER_NAME_LENGTH_ERR
Explanation:
On an MQCONNX call, the LDAPUserNameLength field in an MQAIR record specifies a value that is less than zero.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value for LDAPUserNameLength that is zero or greater.

2390 (X'0956')MQRC_LDAP_PASSWORD_ERROR
Explanation:
On an MQCONNX call, the LDAPPassword field in an MQAIR record specifies a value when no value is allowed.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value that is blank or null.

2391 (X'0957')MQRC_SSL_ALREADY_INITIALIZED
Explanation:
An MQCONN or MQCONNX call was issued with SSL configuration options specified, but the SSL environment had already been initialized. The connection to the queue manager completed successfully, but the SSL configuration options specified on the call were ignored; the existing SSL environment was used instead.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_WARNING

Programmer Response:
If the application must be run with the SSL configuration options defined on the MQCONN or MQCONNX call, use the MQDISC call to sever the connection to the queue manager and then terminate the application. Alternatively run the application later when the SSL environment has not been initialized.

2392 (X'0958')MQRC_SSL_CONFIG_ERROR
Explanation:
On an MQCONNX call, the MQCNO structure does not specify the MQSCO structure correctly. One of the following applies: 

SSLConfigOffset is nonzero and SSLConfigPtr is not the null pointer. 
SSLConfigPtr is not a valid pointer. 
SSLConfigOffset or SSLConfigPtr points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that one of SSLConfigOffset or SSLConfigPtr is zero and the other nonzero. Ensure that the field used points to accessible storage.

2393 (X'0959')MQRC_SSL_INITIALIZATION_ERROR
Explanation:
An MQCONN or MQCONNX call was issued with SSL configuration options specified, but an error occurred during the initialization of the SSL environment.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the SSL installation is correct.

2394 (X'095A')MQRC_Q_INDEX_TYPE_ERROR
Explanation:
An MQGET call was issued specifying one or more of the following options: 

MQGMO_ALL_MSGS_AVAILABLE 
MQGMO_ALL_SEGMENTS_AVAILABLE 
MQGMO_COMPLETE_MSG 
MQGMO_LOGICAL_ORDER
but the call failed because the queue is not indexed by group identifier. These options require the queue to have an IndexType of MQIT_GROUP_ID.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Redefine the queue to have an IndexType of MQIT_GROUP_ID. Alternatively, modify the application to avoid using the options listed above.

2395 (X'095B')MQRC_CFBS_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFBS structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2396 (X'095C')MQRC_SSL_NOT_ALLOWED
Explanation:
A connection to a queue manager was requested, specifying SSL encryption. However, the connection mode requested is one that does not support SSL (for example, bindings connect).

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application to request client connection mode, or to disable SSL encryption.

2397 (X'095D')MQRC_JSSE_ERROR
Explanation:
JSSE reported an error (for example, while connecting to a queue manager using SSL encryption). The MQException object containing this reason code references the Exception thrown by JSSE; this can be obtained by using the MQException.getCause() method. From JMS, the MQException is linked to the thrown JMSException.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Inspect the causal exception to determine the JSSE error.

2398 (X'095E')MQRC_SSL_PEER_NAME_MISMATCH
Explanation:
The application attempted to connect to the queue manager using SSL encryption, but the distinguished name presented by the queue manager does not match the specified pattern.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the certificates used to identify the queue manager. Also check the value of the sslPeerName property specified by the application.

2399 (X'095F')MQRC_SSL_PEER_NAME_ERROR
Explanation:
The application specified a peer name of incorrect format.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the value of the sslPeerName property specified by the application.

2400 (X'0960')MQRC_UNSUPPORTED_CIPHER_SUITE
Explanation:
A connection to a queue manager was requested, specifying SSL encryption. However, JSSE reported that it does not support the CipherSuite specified by the application.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the CipherSuite specified by the application. Note that the names of JSSE CipherSuites differ from their equivalent CipherSpecs used by the queue manager.

Also, check that JSSE is correctly installed.

2401 (X'0961')MQRC_SSL_CERTIFICATE_REVOKED
Explanation:
A connection to a queue manager was requested, specifying SSL encryption. However, the certificate presented by the queue manager was found to be revoked by one of the specified CertStores.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the certificates used to identify the queue manager.

2402 (X'0962')MQRC_SSL_CERT_STORE_ERROR
Explanation:
A connection to a queue manager was requested, specifying SSL encryption. However, none of the CertStore objects provided by the application could be searched for the certificate presented by the queue manager. The MQException object containing this reason code references the Exception encountered when searching the first CertStore; this can be obtained using the MQException.getCause() method. From JMS, the MQException is linked to the thrown JMSException.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Inspect the causal exception to determine the underlying error. Check the CertStore objects provided by your application. If the causal exception is a java.lang.NoSuchElementException, ensure that your application is not specifying an empty collection of CertStore objects.

2406 (X'0966')MQRC_CLIENT_EXIT_LOAD_ERROR
Explanation:
The external user exit required for a client connection could not be loaded because the shared library specified for it cannot be found, or the entry point specified for it cannot be found.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library has been specified, and that the path variable for the machine environment includes the relevant directory. Ensure also that the entry point has been named properly and that the named library does export it.

2407 (X'0967')MQRC_CLIENT_EXIT_ERROR
Explanation:
A failure occured while executing a non-Java user exit for a client connection.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the non-Java user exit can accept the parameters and message being passed to it and that it can handle error conditions, and that any information that the exit requires, such as user data, is correct and available.

2409 (X'0969')MQRC_SSL_KEY_RESET_ERROR
Explanation:
On an MQCONN or MQCONNX call, the value of the SSL key reset count is not in the valid range of 0 through 999 999 999.

The value of the SSL key reset count is specified by either the value of the MQSSLRESET environment variable (MQCONN or MQCONNX call), or the value of the KeyResetCount field in the MQSCO structure (MQCONNX call only). For the MQCONNX call, if both MQSSLRESET and KeyResetCount are specified, the latter is used. MQCONN or MQCONNX

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure and the MQSSLRESET environment variable are set correctly.

2411 (X'096B')MQRC_LOGGER_STATUS
Explanation:
This condition is detected when a logger event occurs.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2412 (X'096C')MQRC_COMMAND_MQSC
Explanation:
This condition is detected when an MQSC command is executed.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2413 (X'096D')MQRC_COMMAND_PCF
Explanation:
This condition is detected when a PCF command is executed.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2414 (X'096E')MQRC_CFIF_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFIF structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2415 (X'096F')MQRC_CFSF_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFSF structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2416 (X'0970')MQRC_CFGR_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFGR structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2417 (X'0971')MQRC_MSG_NOT_ALLOWED_IN_GROUP
Explanation:
An MQPUT or MQPUT1 call was issued to put a message in a group but it is not valid to put such a message in a group. An example of an invalid message is a PCF message where the Type is MQCFT_TRACE_ROUTE.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the invalid message from the group.

2418 (X'0972')MQRC_FILTER_OPERATOR_ERROR
Explanation:
The Operator parameter supplied is not valid.

If it is an input variable then the value is not one of the MQCFOP_* constant values. If it is an output variable then the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredicatable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2419 (X'0973')MQRC_NESTED_SELECTOR_ERROR
Explanation:
An mqAddBag call was issued, but the bag to be nested contained a data item with an inconsistent selector. This reason only occurs if the bag into which the nested bag was to be added was created with the MQCBO_CHECK_SELECTORS option.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that all data items within the bag to be nested have selectors that are consistent with the data type implied by the item.

2420 (X'0974')MQRC_EPH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQEPH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQEPH_STRUC_ID. 
The Version field is not MQEPH_VERSION_1. 
The StrucLength field specifies a value that is too small to include the structure plus the variable-length data at the end of the structure. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The Flags field contains an invalid combination of MQEPH_* values. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure, so the structure extends beyond the end of the message.
Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value; note that MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field.

2421 (X'0975')MQRC_RFH_FORMAT_ERROR
Explanation:
The message contains an MQRFH structure, but its format is incorrect. If you are using WebSphere MQ SOAP, the error is in an incoming SOAP/MQ request message.

Completion Code:
MQCC_FAILED

Programmer Response:
If you are using WebSphere MQ SOAP with the IBM-supplied sender, contact your IBM support center. If you are using WebSphere MQ SOAP with a bespoke sender, check that the RFH2 section of the SOAP/MQ request message is in valid RFH2 format.

2422 (X'0976')MQRC_CFBF_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFBF structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2423 (X'0977')MQRC_CLIENT_CHANNEL_CONFLICT
Explanation:
A client channel definition table was specified for determining the name of the channel, but the name has already been defined.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Change the channel name to blank and try again.

6100 (X'17D4')MQRC_REOPEN_EXCL_INPUT_ERROR
Explanation:
An open object does not have the correct ImqObject open options and requires one or more additional options. An implicit reopen is required but closure has been prevented.

Closure has been prevented because the queue is open for exclusive input and closure might result in the queue being accessed by another process or thread, before the queue is reopened by the process or thread that presently has access.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Set the open options explicitly to cover all eventualities so that implicit reopening is not required.

6101 (X'17D5')MQRC_REOPEN_INQUIRE_ERROR
Explanation:
An open object does not have the correct ImqObject open options and requires one or more additional options. An implicit reopen is required but closure has been prevented.

Closure has been prevented because one or more characteristics of the object need to be checked dynamically prior to closure, and the open options do not already include MQOO_INQUIRE.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Set the open options explicitly to include MQOO_INQUIRE.

6102 (X'17D6')MQRC_REOPEN_SAVED_CONTEXT_ERR
Explanation:
An open object does not have the correct ImqObject open options and requires one or more additional options. An implicit reopen is required but closure has been prevented.

Closure has been prevented because the queue is open with MQOO_SAVE_ALL_CONTEXT, and a destructive get has been performed previously. This has caused retained state information to be associated with the open queue and this information would be destroyed by closure.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Set the open options explicitly to cover all eventualities so that implicit reopening is not required.

6103 (X'17D7')MQRC_REOPEN_TEMPORARY_Q_ERROR
Explanation:
An open object does not have the correct ImqObject open options and requires one or more additional options. An implicit reopen is required but closure has been prevented.

Closure has been prevented because the queue is a local queue of the definition type MQQDT_TEMPORARY_DYNAMIC, that would be destroyed by closure.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Set the open options explicitly to cover all eventualities so that implicit reopening is not required.

6104 (X'17D8')MQRC_ATTRIBUTE_LOCKED
Explanation:
An attempt has been made to change the value of an attribute of an object while that object is open, or, for an ImqQueueManager object, while that object is connected. Certain attributes cannot be changed in these circumstances. Close or disconnect the object (as appropriate) before changing the attribute value.

An object may have been connected and/or opened unexpectedly and implicitly in order to perform an MQINQ call. Check the attribute cross-reference table in the WebSphere MQ Using C++ book to determine whether any of your method invocations result in an MQINQ call.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Include MQOO_INQUIRE in the ImqObject open options and set them earlier.

6105 (X'17D9')MQRC_CURSOR_NOT_VALID
Explanation:
The browse cursor for an open queue has been invalidated since it was last used by an implicit reopen.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Set the ImqObject open options explicitly to cover all eventualities so that implicit reopening is not required.

6106 (X'17DA')MQRC_ENCODING_ERROR
Explanation:
The encoding of the (next) message item needs to be MQENC_NATIVE for pasting.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6107 (X'17DB')MQRC_STRUC_ID_ERROR
Explanation:
The structure id for the (next) message item, which is derived from the 4 characters beginning at the data pointer, is either missing or is inconsistent with the class of object into which the item is being pasted.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6108 (X'17DC')MQRC_NULL_POINTER
Explanation:
A null pointer has been supplied where a nonnull pointer is either required or implied.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6109 (X'17DD')MQRC_NO_CONNECTION_REFERENCE
Explanation:
The connection reference is null. A connection to an ImqQueueManager object is required.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6110 (X'17DE')MQRC_NO_BUFFER
Explanation:
No buffer is available. For an ImqCache object, one cannot be allocated, denoting an internal inconsistency in the object state that should not occur.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6111 (X'17DF')MQRC_BINARY_DATA_LENGTH_ERROR
Explanation:
The length of the binary data is inconsistent with the length of the target attribute. Zero is a correct length for all attributes. 

The correct length for an accounting token is MQ_ACCOUNTING_TOKEN_LENGTH. 
The correct length for an alternate security id is MQ_SECURITY_ID_LENGTH. 
The correct length for a correlation id is MQ_CORREL_ID_LENGTH. 
The correct length for a facility token is MQ_FACILITY_LENGTH. 
The correct length for a group id is MQ_GROUP_ID_LENGTH. 
The correct length for a message id is MQ_MSG_ID_LENGTH. 
The correct length for an instance id is MQ_OBJECT_INSTANCE_ID_LENGTH. 
The correct length for a transaction instance id is MQ_TRAN_INSTANCE_ID_LENGTH. 
The correct length for a message token is MQ_MSG_TOKEN_LENGTH.
This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6112 (X'17E0')MQRC_BUFFER_NOT_AUTOMATIC
Explanation:
A user-defined (and managed) buffer cannot be resized. A user-defined buffer can only be replaced or withdrawn. A buffer must be automatic (system-managed) before it can be resized.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
6113 (X'17E1')MQRC_INSUFFICIENT_BUFFER
Explanation:
There is insufficient buffer space available after the data pointer to accommodate the request. This might be because the buffer cannot be resized.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6114 (X'17E2')MQRC_INSUFFICIENT_DATA
Explanation:
There is insufficient data after the data pointer to accommodate the request.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6115 (X'17E3')MQRC_DATA_TRUNCATED
Explanation:
Data has been truncated when copying from one buffer to another. This might be because the target buffer cannot be resized, or because there is a problem addressing one or other buffer, or because a buffer is being downsized with a smaller replacement.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6116 (X'17E4')MQRC_ZERO_LENGTH
Explanation:
A zero length has been supplied where a positive length is either required or implied.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6117 (X'17E5')MQRC_NEGATIVE_LENGTH
Explanation:
A negative length has been supplied where a zero or positive length is required.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6118 (X'17E6')MQRC_NEGATIVE_OFFSET
Explanation:
A negative offset has been supplied where a zero or positive offset is required.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6119 (X'17E7')MQRC_INCONSISTENT_FORMAT
Explanation:
The format of the (next) message item is inconsistent with the class of object into which the item is being pasted.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6120 (X'17E8')MQRC_INCONSISTENT_OBJECT_STATE
Explanation:
There is an inconsistency between this object, which is open, and the referenced ImqQueueManager object, which is not connected.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6121 (X'17E9')MQRC_CONTEXT_OBJECT_NOT_VALID
Explanation:
The ImqPutMessageOptions context reference does not reference a valid ImqQueue object. The object has been previously destroyed.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6122 (X'17EA')MQRC_CONTEXT_OPEN_ERROR
Explanation:
The ImqPutMessageOptions context reference references an ImqQueue object that could not be opened to establish a context. This may be because the ImqQueue object has inappropriate open options. Inspect the referenced object reason code to establish the cause.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6123 (X'17EB')MQRC_STRUC_LENGTH_ERROR
Explanation:
The length of a data structure is inconsistent with its content. For an MQRMH, the length is insufficient to contain the fixed fields and all offset data.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6124 (X'17EC')MQRC_NOT_CONNECTED
Explanation:
A method failed because a required connection to a queue manager was not available, and a connection cannot be established implicitly because the IMQ_IMPL_CONN flag of the ImqQueueManager behavior class attribute is FALSE.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Establish a connection to a queue manager and retry.

6125 (X'17ED')MQRC_NOT_OPEN
Explanation:
A method failed because an object was not open, and opening cannot be accomplished implicitly because the IMQ_IMPL_OPEN flag of the ImqObject behavior class attribute is FALSE.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Open the object and retry.

6126 (X'17EE')MQRC_DISTRIBUTION_LIST_EMPTY
Explanation:
An ImqDistributionList failed to open because there are no ImqQueue objects referenced.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Establish at least one ImqQueue object in which the distribution list reference addresses the ImqDistributionList object, and retry.

6127 (X'17EF')MQRC_INCONSISTENT_OPEN_OPTIONS
Explanation:
A method failed because the object is open, and the ImqObject open options are inconsistent with the required operation. The object cannot be reopened implicitly because the IMQ_IMPL_OPEN flag of the ImqObject behavior class attribute is false.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Open the object with appropriate ImqObject open options and retry.

6128 (X'17FO')MQRC_WRONG_VERSION
Explanation:
A method failed because a version number specified or encountered is either incorrect or not supported.

For the ImqCICSBridgeHeader class, the problem is with the version attribute.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
If you are specifying a version number, use one that is supported by the class. If you are receiving message data from another program, ensure that both programs are using consistent and supported version numbers.

6129 (X'17F1')MQRC_REFERENCE_ERROR
Explanation:
An object reference is invalid.

There is a problem with the address of a referenced object. At the time of use, the address of the object is nonnull, but is invalid and cannot be used for its intended purpose.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the referenced object is neither deleted nor out of scope, or remove the reference by supplying a null address value.


Appendix A. API completion and reason codes:
============================================


The following is a list of the completion codes (MQCC) returned by WebSphere MQ 

0: Successful completion (MQCC_OK) 
The call completed fully; all output parameters have been set.

The Reason parameter always has the value MQRC_NONE in this case.

1: Warning (partial completion) (MQCC_WARNING) 
The call completed partially. Some output parameters might have been set in addition to the CompCode and Reason output parameters.

The Reason parameter gives additional information.

2: Call failed (MQCC_FAILED) 
The processing of the call did not complete, and the state of the queue manager is normally unchanged; exceptions are specifically noted. Only the CompCode and Reason output parameters have been set; all other parameters are unchanged.

The reason might be a fault in the application program, or it might be a result of some situation external to the program, for example the application's authority might have been revoked. The Reason parameter gives additional information.

Reason codes
The reason code parameter (Reason) is a qualification to the completion code parameter (CompCode). 

If there is no special reason to report, MQRC_NONE is returned. A successful call returns MQCC_OK and MQRC_NONE.

If the completion code is either MQCC_WARNING or MQCC_FAILED, the queue manager always reports a qualifying reason; details are given under each call description.

Where user exit routines set completion codes and reasons, they should adhere to these rules. In addition, any special reason values defined by user exits should be less than zero, to ensure that they do not conflict with values defined by the queue manager. Exits can set reasons already defined by the queue manager, where these are appropriate.

Reason codes also occur in: 

The Reason field of the MQDLH structure 
The Feedback field of the MQMD structure
Reason code list
The following is a list of reason codes, in numeric order, providing detailed information to help you understand them, including: 

An explanation of the circumstances that have caused the code to be raised 
The associated completion code 
Suggested programmer actions in response to the code
See Reason code cross reference for a list of reason codes in alphabetic order.

Codes in the range 3000 - 4999 (X'0BB8' - X'1387') are specific to PCF and are described in Appendix B. PCF reason codes.

0 (X'0000')MQRC_NONE
Explanation:
The call completed normally. The completion code (CompCode) is MQCC_OK.

Completion Code:
MQCC_OK

Programmer Response:
None.

900 (X'0384')MQRC_APPL_FIRST
Explanation:
This is the lowest value for an application-defined reason code returned by a data-conversion exit. Data-conversion exits can return reason codes in the range MQRC_APPL_FIRST through MQRC_APPL_LAST to indicate particular conditions that the exit has detected.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
As defined by the writer of the data-conversion exit.

999 (X'03E7')MQRC_APPL_LAST
Explanation:
This is the highest value for an application-defined reason code returned by a data-conversion exit. Data-conversion exits can return reason codes in the range MQRC_APPL_FIRST through MQRC_APPL_LAST to indicate particular conditions that the exit has detected.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
As defined by the writer of the data-conversion exit.

2001 (X'07D1')MQRC_ALIAS_BASE_Q_TYPE_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued specifying an alias queue as the destination, but the BaseQName in the alias queue definition resolves to a queue that is not a local queue, a local definition of a remote queue, or a cluster queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the queue definitions.

2002 (X'07D2')MQRC_ALREADY_CONNECTED
Explanation:
An MQCONN or MQCONNX call was issued, but the application is already connected to the queue manager. 

On z/OS, this reason code occurs for batch and IMS applications only; it does not occur for CICS applications. 
On AIX, HP-UX, i5/OS, Solaris, Windows, this reason code occurs if the application attempts to create a nonshared handle when a nonshared handle already exists for the thread. A thread can have no more than one nonshared handle. 
On Windows, MTS objects do not receive this reason code, as additional connections to the queue manager are allowed.
Completion Code:
MQCC_WARNING

Programmer Response:
None. The Hconn parameter returned has the same value as was returned for the previous MQCONN or MQCONNX call.

An MQCONN or MQCONNX call that returns this reason code does not mean that an additional MQDISC call must be issued in order to disconnect from the queue manager. If this reason code is returned because the application has been called in a situation where the connect has already been done, a corresponding MQDISC should not be issued, because this will cause the application that issued the original MQCONN or MQCONNX call to be disconnected as well.

2003 (X'07D3')MQRC_BACKED_OUT
Explanation:
The current unit of work encountered a fatal error or was backed out. This occurs in the following cases: 

On an MQCMIT or MQDISC call, when the commit operation has failed and the unit of work has been backed out. All resources that participated in the unit of work have been returned to their state at the start of the unit of work. The MQCMIT or MQDISC call completes with MQCC_WARNING in this case. 
On z/OS, this reason code occurs only for batch applications.
On an MQGET, MQPUT, or MQPUT1 call that is operating within a unit of work, when the unit of work has already encountered an error that prevents the unit of work being committed (for example, when the log space is exhausted). The application must issue the appropriate call to back out the unit of work. (For a unit of work coordinated by the queue manager, this call is the MQBACK call, although the MQCMIT call has the same effect in these circumstances.) The MQGET, MQPUT, or MQPUT1 call completes with MQCC_FAILED in this case. 
On z/OS, this case does not occur.
Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Check the returns from previous calls to the queue manager. For example, a previous MQPUT call may have failed.

2004 (X'07D4')MQRC_BUFFER_ERROR
Explanation:
The Buffer parameter is not valid for one of the following reasons: 

The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The parameter pointer points to storage that cannot be accessed for the entire length specified by BufferLength. 
For calls where Buffer is an output parameter: the parameter pointer points to read-only storage.
Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2005 (X'07D5')MQRC_BUFFER_LENGTH_ERROR
Explanation:
The BufferLength parameter is not valid, or the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

This reason can also be returned to an MQ client program on the MQCONN or MQCONNX call if the negotiated maximum message size for the channel is smaller than the fixed part of any call structure.

This reason should also be returned by the MQZ_ENUMERATE_AUTHORITY_DATA installable service component when the AuthorityBuffer parameter is too small to accommodate the data to be returned to the invoker of the service component.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value that is zero or greater. For the mqAddString and mqSetString calls, the special value MQBL_NULL_TERMINATED is also valid.

2006 (X'07D6')MQRC_CHAR_ATTR_LENGTH_ERROR
Explanation:
CharAttrLength is negative (for MQINQ or MQSET calls), or is not large enough to hold all selected attributes (MQSET calls only). This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value large enough to hold the concatenated strings for all selected attributes.

2007 (X'07D7')MQRC_CHAR_ATTRS_ERROR
Explanation:
CharAttrs is not valid. The parameter pointer is not valid, or points to read-only storage for MQINQ calls or to storage that is not as long as implied by CharAttrLength. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2008 (X'07D8')MQRC_CHAR_ATTRS_TOO_SHORT
Explanation:
For MQINQ calls, CharAttrLength is not large enough to contain all of the character attributes for which MQCA_* selectors are specified in the Selectors parameter.

The call still completes, with the CharAttrs parameter string filled in with as many character attributes as there is room for. Only complete attribute strings are returned: if there is insufficient space remaining to accommodate an attribute in its entirety, that attribute and subsequent character attributes are omitted. Any space at the end of the string not used to hold an attribute is unchanged.

An attribute that represents a set of values (for example, the namelist Names attribute) is treated as a single entity--either all of its values are returned, or none.

Completion Code:
MQCC_WARNING

Programmer Response:
Specify a large enough value, unless only a subset of the values is needed.

2009 (X'07D9')MQRC_CONNECTION_BROKEN
Explanation:
Connection to the queue manager has been lost. This can occur because the queue manager has ended. If the call is an MQGET call with the MQGMO_WAIT option, the wait has been canceled. All connection and object handles are now invalid.

For MQ client applications, it is possible that the call did complete successfully, even though this reason code is returned with a CompCode of MQCC_FAILED.

Completion Code:
MQCC_FAILED

Programmer Response:
Applications can attempt to reconnect to the queue manager by issuing the MQCONN or MQCONNX call. It may be necessary to poll until a successful response is received. 

On z/OS for CICS applications, it is not necessary to issue the MQCONN or MQCONNX call, because CICS applications are connected automatically.
Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2010 (X'07DA')MQRC_DATA_LENGTH_ERROR
Explanation:
The DataLength parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

This reason can also be returned to an MQ client program on the MQGET, MQPUT, or MQPUT1 call, if the BufferLength parameter exceeds the maximum message size that was negotiated for the client channel.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

If the error occurs for an MQ client program, also check that the maximum message size for the channel is big enough to accommodate the message being sent; if it is not big enough, increase the maximum message size for the channel.

2011 (X'07DB')MQRC_DYNAMIC_Q_NAME_ERROR
Explanation:
On the MQOPEN call, a model queue is specified in the ObjectName field of the ObjDesc parameter, but the DynamicQName field is not valid, for one of the following reasons: 

DynamicQName is completely blank (or blank up to the first null character in the field). 
Characters are present that are not valid for a queue name. 
An asterisk is present beyond the 33rd position (and before any null character). 
An asterisk is present followed by characters that are not null and not blank.
This reason code can also sometimes occur when a server application opens the reply queue specified by the ReplyToQ and ReplyToQMgr fields in the MQMD of a message that the server has just received. In this case the reason code indicates that the application that sent the original message placed incorrect values into the ReplyToQ and ReplyToQMgr fields in the MQMD of the original message.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid name.

2012 (X'07DC')MQRC_ENVIRONMENT_ERROR
Explanation:
The call is not valid for the current environment. 

On z/OS, one of the following applies: 
An MQCONN or MQCONNX call was issued, but the application has been linked with an adapter that is not supported in the environment in which the application is running. For example, this can arise when the application is linked with the MQ RRS adapter, but the application is running in a DB2 Stored Procedure address space. RRS is not supported in this environment. Stored Procedures wishing to use the MQ RRS adapter must run in a DB2 WLM-managed Stored Procedure address space. 
An MQCMIT or MQBACK call was issued, but the application has been linked with the RRS batch adapter CSQBRSTB. This adapter does not support the MQCMIT and MQBACK calls. 
An MQCMIT or MQBACK call was issued in the CICS or IMS environment. 
The RRS subsystem is not up and running on the z/OS system that ran the application.
On HP OpenVMS, OS/2, i5/OS, Compaq NonStop Kernel, UNIX systems, and Windows, one of the following applies: 
The application is linked to the wrong libraries (threaded or nonthreaded). 
An MQBEGIN, MQCMIT, or MQBACK call was issued, but an external unit-of-work manager is in use. For example, this reason code occurs on Windows when an MTS object is running as a DTC transaction. This reason code also occurs if the queue manager does not support units of work. 
The MQBEGIN call was issued in an MQ client environment. 
An MQXCLWLN call was issued, but the call did not originate from a cluster workload exit.
Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following (as appropriate): 

On z/OS: 
Link the application with the correct adapter. 
Modify the application to use the SRRCMIT and SRRBACK calls in place of the MQCMIT and MQBACK calls. Alternatively, link the application with the RRS batch adapter CSQBRRSI. This adapter supports MQCMIT and MQBACK in addition to SRRCMIT and SRRBACK. 
For a CICS or IMS application, issue the appropriate CICS or IMS call to commit or backout the unit of work. 
Start the RRS subsystem on the z/OS system that is running the application.
In the other environments: 
Link the application with the correct libraries (threaded or nonthreaded). 
Remove from the application the call that is not supported.
2013 (X'07DD')MQRC_EXPIRY_ERROR
Explanation:
On an MQPUT or MQPUT1 call, the value specified for the Expiry field in the message descriptor MQMD is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value that is greater than zero, or the special value MQEI_UNLIMITED.

2014 (X'07DE')MQRC_FEEDBACK_ERROR
Explanation:
On an MQPUT or MQPUT1 call, the value specified for the Feedback field in the message descriptor MQMD is not valid. The value is not MQFB_NONE, and is outside both the range defined for system feedback codes and the range defined for application feedback codes.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQFB_NONE, or a value in the range MQFB_SYSTEM_FIRST through MQFB_SYSTEM_LAST, or MQFB_APPL_FIRST through MQFB_APPL_LAST.

2016 (X'07E0')MQRC_GET_INHIBITED
Explanation:
MQGET calls are currently inhibited for the queue, or for the queue to which this queue resolves.

Completion Code:
MQCC_FAILED

Programmer Response:
If the system design allows get requests to be inhibited for short periods, retry the operation later.

2017 (X'07E1')MQRC_HANDLE_NOT_AVAILABLE
Explanation:
An MQOPEN or MQPUT1 call was issued, but the maximum number of open handles allowed for the current task has already been reached. Be aware that when a distribution list is specified on the MQOPEN or MQPUT1 call, each queue in the distribution list uses one handle. 

On z/OS, "task" means a CICS task, a z/OS task, or an IMS-dependent region.
Completion Code:
MQCC_FAILED

Programmer Response:
Check whether the application is issuing MQOPEN calls without corresponding MQCLOSE calls. If it is, modify the application to issue the MQCLOSE call for each open object as soon as that object is no longer needed.

Also check whether the application is specifying a distribution list containing a large number of queues that are consuming all of the available handles. If it is, increase the maximum number of handles that the task can use, or reduce the size of the distribution list. The maximum number of open handles that a task can use is given by the MaxHandles queue manager attribute.

2018 (X'07E2')MQRC_HCONN_ERROR
Explanation:
The connection handle Hconn is not valid, for one of the following reasons: 

The parameter pointer is not valid, or (for the MQCONN or MQCONNX call) points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The value specified was not returned by a preceding MQCONN or MQCONNX call. 
The value specified has been made invalid by a preceding MQDISC call. 
The handle is a shared handle that has been made invalid by another thread issuing the MQDISC call. 
The handle is a shared handle that is being used on the MQBEGIN call (only nonshared handles are valid on MQBEGIN). 
The handle is a nonshared handle that is being used a thread that did not create the handle. 
The call was issued in the MTS environment in a situation where the handle is not valid (for example, passing the handle between processes or packages; note that passing the handle between library packages is supported).
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that a successful MQCONN or MQCONNX call is performed for the queue manager, and that an MQDISC call has not already been performed for it. Ensure that the handle is being used within its valid scope (see the description of MQCONN in the WebSphere MQ Application Programming Guide). 

On z/OS, also check that the application has been linked with the correct stub; this is CSQCSTUB for CICS applications, CSQBSTUB for batch applications, and CSQQSTUB for IMS applications. Also, the stub used must not belong to a release of the queue manager that is more recent than the release on which the application will run.
2019 (X'07E3')MQRC_HOBJ_ERROR
Explanation:
The object handle Hobj is not valid, for one of the following reasons: 

The parameter pointer is not valid, or (for the MQOPEN call) points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The value specified was not returned by a preceding MQOPEN call. 
The value specified has been made invalid by a preceding MQCLOSE call. 
The handle is a shared handle that has been made invalid by another thread issuing the MQCLOSE call. 
The handle is a nonshared handle that is being used by a thread that did not create the handle. 
The call is MQGET or MQPUT, but the object represented by the handle is not a queue.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that a successful MQOPEN call is performed for this object, and that an MQCLOSE call has not already been performed for it. Ensure that the handle is being used within its valid scope (see the description of MQOPEN in the WebSphere MQ Application Programming Guide).

2020 (X'07E4')MQRC_INHIBIT_VALUE_ERROR
Explanation:
On an MQSET call, the value specified for either the MQIA_INHIBIT_GET attribute or the MQIA_INHIBIT_PUT attribute is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value for the InhibitGet or InhibitPut queu attribute.

2021 (X'07E5')MQRC_INT_ATTR_COUNT_ERROR
Explanation:
On an MQINQ or MQSET call, the IntAttrCount parameter is negative (MQINQ or MQSET), or smaller than the number of integer attribute selectors (MQIA_*) specified in the Selectors parameter (MQSET only). This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value large enough for all selected integer attributes.

2022 (X'07E6')MQRC_INT_ATTR_COUNT_TOO_SMALL
Explanation:
On an MQINQ call, the IntAttrCount parameter is smaller than the number of integer attribute selectors (MQIA_*) specified in the Selectors parameter.

The call completes with MQCC_WARNING, with the IntAttrs array filled in with as many integer attributes as there is room for.

Completion Code:
MQCC_WARNING

Programmer Response:
Specify a large enough value, unless only a subset of the values is needed.

2023 (X'07E7')MQRC_INT_ATTRS_ARRAY_ERROR
Explanation:
On an MQINQ or MQSET call, the IntAttrs parameter is not valid. The parameter pointer is not valid (MQINQ and MQSET), or points to read-only storage or to storage that is not as long as indicated by the IntAttrCount parameter (MQINQ only). (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2024 (X'07E8')MQRC_SYNCPOINT_LIMIT_REACHED
Explanation:
An MQGET, MQPUT, or MQPUT1 call failed because it would have caused the number of uncommitted messages in the current unit of work to exceed the limit defined for the queue manager (see the MaxUncommittedMsgs queue-manager attribute). The number of uncommitted messages is the sum of the following since the start of the current unit of work: 

Messages put by the application with the MQPMO_SYNCPOINT option 
Messages retrieved by the application with the MQGMO_SYNCPOINT option 
Trigger messages and COA report messages generated by the queue manager for messages put with the MQPMO_SYNCPOINT option 
COD report messages generated by the queue manager for messages retrieved with the MQGMO_SYNCPOINT option 
On Compaq NonStop Kernel, this reason code occurs when the maximum number of I/O operations in a single TM/MP transaction has been exceeded.
Completion Code:
MQCC_FAILED

Programmer Response:
Check whether the application is looping. If it is not, consider reducing the complexity of the application. Alternatively, increase the queue-manager limit for the maximum number of uncommitted messages within a unit of work. 

On z/OS, the limit for the maximum number of uncommitted messages can be changed by using the ALTER QMGR command. 
On i5/OS, the limit for the maximum number of uncommitted messages can be changed by using the CHGMQM command. 
On Compaq NonStop Kernel, the application should cancel the transaction and retry with a smaller number of operations in the unit of work. See the MQSeries for Tandem NonStop Kernel System Management Guide for more details.
2025 (X'07E9')MQRC_MAX_CONNS_LIMIT_REACHED
Explanation:
The MQCONN or MQCONNX call was rejected because the maximum number of concurrent connections has been exceeded. 

On z/OS, connection limits are applicable only to TSO and batch requests. The limits are determined by the customer using the following parameters of the CSQ6SYSP macro: 
For TSO: IDFORE 
For batch: IDBACK
For more information, see the WebSphere MQ for z/OS System Setup Guide.

On HP OpenVMS, OS/2, i5/OS, Compaq NonStop Kernel, UNIX systems, and Windows, this reason code can also occur on the MQOPEN call. 
When using Java applications, a limit to the number of concurrent connections may be defined by the connection manager.
Completion Code:
MQCC_FAILED

Programmer Response:
Either increase the size of the appropriate parameter value, or reduce the number of concurrent connections.

2026 (X'07EA')MQRC_MD_ERROR
Explanation:
The MQMD structure is not valid, for one of the following reasons: 

The StrucId field is not MQMD_STRUC_ID. 
The Version field specifies a value that is not valid or not supported. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the pointer points to read-only storage.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQMD structure are set correctly.

2027 (X'07EB')MQRC_MISSING_REPLY_TO_Q
Explanation:
On an MQPUT or MQPUT1 call, the ReplyToQ field in the message descriptor MQMD is blank, but one or both of the following is true: 

A reply was requested (that is, MQMT_REQUEST was specified in the MsgType field of the message descriptor). 
A report message was requested in the Report field of the message descriptor.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify the name of the queue to which the reply message or report message is to be sent.

2029 (X'07ED')MQRC_MSG_TYPE_ERROR
Explanation:
Either: 

On an MQPUT or MQPUT1 call, the value specified for the MsgType field in the message descriptor (MQMD) is not valid. 
A message processing program received a message that does not have the expected message type. For example, if the WebSphere MQ command server receives a message which is not a request message (MQMT_REQUEST) then it rejects the request with this reason code.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value for the MsgType field. In the case where a request is rejected by a message processing program, refer to the documentation for that program for details of the message types that it supports.

2030 (X'07EE')MQRC_MSG_TOO_BIG_FOR_Q
Explanation:
An MQPUT or MQPUT1 call was issued to put a message on a queue, but the message was too long for the queue and MQMF_SEGMENTATION_ALLOWED was not specified in the MsgFlags field in MQMD. If segmentation is not allowed, the length of the message cannot exceed the lesser of the queue MaxMsgLength attribute and queue-manager MaxMsgLength attribute. 

On z/OS, the queue manager does not support the segmentation of messages; if MQMF_SEGMENTATION_ALLOWED is specified, it is accepted but ignored.
This reason code can also occur when MQMF_SEGMENTATION_ALLOWED is specified, but the nature of the data present in the message prevents the queue manager splitting it into segments that are small enough to place on the queue: 

For a user-defined format, the smallest segment that the queue manager can create is 16 bytes. 
For a built-in format, the smallest segment that the queue manager can create depends on the particular format, but is greater than 16 bytes in all cases other than MQFMT_STRING (for MQFMT_STRING the minimum segment size is 16 bytes).
MQRC_MSG_TOO_BIG_FOR_Q can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Check whether the BufferLength parameter is specified correctly; if it is, do one of the following: 

Increase the value of the queue's MaxMsgLength attribute; the queue-manager's MaxMsgLength attribute may also need increasing. 
Break the message into several smaller messages. 
Specify MQMF_SEGMENTATION_ALLOWED in the MsgFlags field in MQMD; this will allow the queue manager to break the message into segments.
2031 (X'07EF')MQRC_MSG_TOO_BIG_FOR_Q_MGR
Explanation:
An MQPUT or MQPUT1 call was issued to put a message on a queue, but the message was too long for the queue manager and MQMF_SEGMENTATION_ALLOWED was not specified in the MsgFlags field in MQMD. If segmentation is not allowed, the length of the message cannot exceed the lesser of the queue-manager MaxMsgLength attribute and queue MaxMsgLength attribute.

This reason code can also occur when MQMF_SEGMENTATION_ALLOWED is specified, but the nature of the data present in the message prevents the queue manager splitting it into segments that are small enough for the queue-manager limit: 

For a user-defined format, the smallest segment that the queue manager can create is 16 bytes. 
For a built-in format, the smallest segment that the queue manager can create depends on the particular format, but is greater than 16 bytes in all cases other than MQFMT_STRING (for MQFMT_STRING the minimum segment size is 16 bytes).
MQRC_MSG_TOO_BIG_FOR_Q_MGR can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

This reason also occurs if a channel, through which the message is to pass, has restricted the maximum message length to a value that is actually less than that supported by the queue manager, and the message length is greater than this value. 

On z/OS, this return code is issued only if you are using CICS for distributed queuing. Otherwise, MQRC_MSG_TOO_BIG_FOR_CHANNEL is issued.
Completion Code:
MQCC_FAILED

Programmer Response:
Check whether the BufferLength parameter is specified correctly; if it is, do one of the following: 

Increase the value of the queue-manager's MaxMsgLength attribute; the queue's MaxMsgLength attribute may also need increasing. 
Break the message into several smaller messages. 
Specify MQMF_SEGMENTATION_ALLOWED in the MsgFlags field in MQMD; this will allow the queue manager to break the message into segments. 
Check the channel definitions.
2033 (X'07F1')MQRC_NO_MSG_AVAILABLE
Explanation:
An MQGET call was issued, but there is no message on the queue satisfying the selection criteria specified in MQMD (the MsgId and CorrelId fields), and in MQGMO (the Options and MatchOptions fields). Either the MQGMO_WAIT option was not specified, or the time interval specified by the WaitInterval field in MQGMO has expired. This reason is also returned for an MQGET call for browse, when the end of the queue has been reached.

This reason code can also be returned by the mqGetBag and mqExecute calls. mqGetBag is similar to MQGET. For the mqExecute call, the completion code can be either MQCC_WARNING or MQCC_FAILED: 

If the completion code is MQCC_WARNING, some response messages were received during the specified wait interval, but not all. The response bag contains system-generated nested bags for the messages that were received. 
If the completion code is MQCC_FAILED, no response messages were received during the specified wait interval.
Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
If this is an expected condition, no corrective action is required.

If this is an unexpected condition, check that: 

The message was put on the queue successfully. 
The unit of work (if any) used for the MQPUT or MQPUT1 call was committed successfully. 
The options controlling the selection criteria are specified correctly. All of the following can affect the eligibility of a message for return on the MQGET call: 
MQGMO_LOGICAL_ORDER 
MQGMO_ALL_MSGS_AVAILABLE 
MQGMO_ALL_SEGMENTS_AVAILABLE 
MQGMO_COMPLETE_MSG 
MQMO_MATCH_MSG_ID 
MQMO_MATCH_CORREL_ID 
MQMO_MATCH_GROUP_ID 
MQMO_MATCH_MSG_SEQ_NUMBER 
MQMO_MATCH_OFFSET 
Value of MsgId field in MQMD 
Value of CorrelId field in MQMD
Consider waiting longer for the message.

2034 (X'07F2')MQRC_NO_MSG_UNDER_CURSOR
Explanation:
An MQGET call was issued with either the MQGMO_MSG_UNDER_CURSOR or the MQGMO_BROWSE_MSG_UNDER_CURSOR option. However, the browse cursor is not positioned at a retrievable message. This is caused by one of the following: 

The cursor is positioned logically before the first message (as it is before the first MQGET call with a browse option has been successfully performed). 
The message the browse cursor was positioned on has been locked or removed from the queue (probably by some other application) since the browse operation was performed. 
The message the browse cursor was positioned on has expired.
Completion Code:
MQCC_FAILED

Programmer Response:
Check the application logic. This may be an expected reason if the application design allows multiple servers to compete for messages after browsing. Consider also using the MQGMO_LOCK option with the preceding browse MQGET call.

2035 (X'07F3')MQRC_NOT_AUTHORIZED
Explanation:
The user is not authorized to perform the operation attempted: 

On an MQCONN or MQCONNX call, the user is not authorized to connect to the queue manager. 
On z/OS, for CICS applications, MQRC_CONNECTION_NOT_AUTHORIZED is issued instead.
On an MQOPEN or MQPUT1 call, the user is not authorized to open the object for the option(s) specified. 
On z/OS, if the object being opened is a model queue, this reason also arises if the user is not authorized to create a dynamic queue with the required name.
On an MQCLOSE call, the user is not authorized to delete the object, which is a permanent dynamic queue, and the Hobj parameter specified on the MQCLOSE call is not the handle returned by the MQOPEN call that created the queue. 
On a command, the user is not authorized to issue the command, or to access the object it specifies.
This reason code can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct queue manager or object was specified, and that appropriate authority exists.

2036 (X'07F4')MQRC_NOT_OPEN_FOR_BROWSE
Explanation:
An MQGET call was issued with one of the following options: 

MQGMO_BROWSE_FIRST 
MQGMO_BROWSE_NEXT 
MQGMO_BROWSE_MSG_UNDER_CURSOR 
MQGMO_MSG_UNDER_CURSOR
but the queue had not been opened for browse.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_BROWSE when the queue is opened.

2037 (X'07F5')MQRC_NOT_OPEN_FOR_INPUT
Explanation:
An MQGET call was issued to retrieve a message from a queue, but the queue had not been opened for input.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify one of the following when the queue is opened: 

MQOO_INPUT_SHARED 
MQOO_INPUT_EXCLUSIVE 
MQOO_INPUT_AS_Q_DEF
2038 (X'07F6')MQRC_NOT_OPEN_FOR_INQUIRE
Explanation:
An MQINQ call was issued to inquire object attributes, but the object had not been opened for inquire.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_INQUIRE when the object is opened.

2039 (X'07F7')MQRC_NOT_OPEN_FOR_OUTPUT
Explanation:
An MQPUT call was issued to put a message on a queue, but the queue had not been opened for output.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_OUTPUT when the queue is opened.

2040 (X'07F8')MQRC_NOT_OPEN_FOR_SET
Explanation:
An MQSET call was issued to set queue attributes, but the queue had not been opened for set.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_SET when the object is opened.

2041 (X'07F9')MQRC_OBJECT_CHANGED
Explanation:
Object definitions that affect this object have been changed since the Hobj handle used on this call was returned by the MQOPEN call. See the description of MQOPEN in the WebSphere MQ Application Programming Guide for more information.

This reason does not occur if the object handle is specified in the Context field of the PutMsgOpts parameter on the MQPUT or MQPUT1 call.

Completion Code:
MQCC_FAILED

Programmer Response:
Issue an MQCLOSE call to return the handle to the system. It is then usually sufficient to reopen the object and retry the operation. However, if the object definitions are critical to the application logic, an MQINQ call can be used after reopening the object, to obtain the new values of the object attributes.

2042 (X'07FA')MQRC_OBJECT_IN_USE
Explanation:
An MQOPEN call was issued, but the object in question has already been opened by this or another application with options that conflict with those specified in the Options parameter. This arises if the request is for shared input, but the object is already open for exclusive input; it also arises if the request is for exclusive input, but the object is already open for input (of any sort).

MCAs for receiver channels, or the intra-group queuing agent (IGQ agent), may keep the destination queues open even when messages are not being transmitted; this results in the queues appearing to be "in use". Use the MQSC command DISPLAY QSTATUS to find out who is keeping the queue open. 

On z/OS, this reason can also occur for an MQOPEN or MQPUT1 call, if the object to be opened (which can be a queue, or for MQOPEN a namelist or process object) is in the process of being deleted.
Completion Code:
MQCC_FAILED

Programmer Response:
System design should specify whether an application is to wait and retry, or take other action.

2043 (X'07FB')MQRC_OBJECT_TYPE_ERROR
Explanation:
On the MQOPEN or MQPUT1 call, the ObjectType field in the object descriptor MQOD specifies a value that is not valid. For the MQPUT1 call, the object type must be MQOT_Q.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid object type.

2044 (X'07FC')MQRC_OD_ERROR
Explanation:
On the MQOPEN or MQPUT1 call, the object descriptor MQOD is not valid, for one of the following reasons: 

The StrucId field is not MQOD_STRUC_ID. 
The Version field specifies a value that is not valid or not supported. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the pointer points to read-only storage.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQOD structure are set correctly.

2045 (X'07FD')MQRC_OPTION_NOT_VALID_FOR_TYPE
Explanation:
On an MQOPEN or MQCLOSE call, an option is specified that is not valid for the type of object or queue being opened or closed.

For the MQOPEN call, this includes the following cases: 

An option that is inappropriate for the object type (for example, MQOO_OUTPUT for an MQOT_PROCESS object). 
An option that is unsupported for the queue type (for example, MQOO_INQUIRE for a remote queue that has no local definition). 
One or more of the following options: 
MQOO_INPUT_AS_Q_DEF 
MQOO_INPUT_SHARED 
MQOO_INPUT_EXCLUSIVE 
MQOO_BROWSE 
MQOO_INQUIRE 
MQOO_SET
when either: 
the queue name is resolved through a cell directory, or 
ObjectQMgrName in the object descriptor specifies the name of a local definition of a remote queue (in order to specify a queue-manager alias), and the queue named in the RemoteQMgrName attribute of the definition is the name of the local queue manager.
For the MQCLOSE call, this includes the following case: 

The MQCO_DELETE or MQCO_DELETE_PURGE option when the queue is not a dynamic queue.
This reason code can also occur on the MQOPEN call when the object being opened is of type MQOT_NAMELIST, MQOT_PROCESS, or MQOT_Q_MGR, but the ObjectQMgrName field in MQOD is neither blank nor the name of the local queue manager.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the correct option. For the MQOPEN call, ensure that the ObjectQMgrName field is set correctly. For the MQCLOSE call, either correct the option or change the definition type of the model queue that is used to create the new queue.

2046 (X'07FE')MQRC_OPTIONS_ERROR
Explanation:
The Options parameter or field contains options that are not valid, or a combination of options that is not valid. 

For the MQOPEN, MQCLOSE, MQXCNVC, mqBagToBuffer, mqBufferToBag, mqCreateBag, and mqExecute calls, Options is a separate parameter on the call. 
This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

For the MQBEGIN, MQCONNX, MQGET, MQPUT, and MQPUT1 calls, Options is a field in the relevant options structure (MQBO, MQCNO, MQGMO, or MQPMO).
Completion Code:
MQCC_FAILED

Programmer Response:
Specify valid options. Check the description of the Options parameter or field to determine which options and combinations of options are valid. If multiple options are being set by adding the individual options together, ensure that the same option is not added twice.

2047 (X'07FF')MQRC_PERSISTENCE_ERROR
Explanation:
On an MQPUT or MQPUT1 call, the value specified for the Persistence field in the message descriptor MQMD is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify one of the following values: 

MQPER_PERSISTENT 
MQPER_NOT_PERSISTENT 
MQPER_PERSISTENCE_AS_Q_DEF
2048 (X'0800')MQRC_PERSISTENT_NOT_ALLOWED
Explanation:
On an MQPUT or MQPUT1 call, the value specified for the Persistence field in MQMD (or obtained from the DefPersistence queue attribute) specifies MQPER_PERSISTENT, but the queue on which the message is being placed does not support persistent messages. Persistent messages cannot be placed on temporary dynamic queues.

This reason code can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQPER_NOT_PERSISTENT if the message is to be placed on a temporary dynamic queue. If persistence is required, use a permanent dynamic queue or predefined queue in place of a temporary dynamic queue.

Be aware that server applications are recommended to send reply messages (message type MQMT_REPLY) with the same persistence as the original request message (message type MQMT_REQUEST). If the request message is persistent, the reply queue specified in the ReplyToQ field in the message descriptor MQMD cannot be a temporary dynamic queue. Use a permanent dynamic queue or predefined queue as the reply queue in this situation.

2049 (X'0801')MQRC_PRIORITY_EXCEEDS_MAXIMUM
Explanation:
An MQPUT or MQPUT1 call was issued, but the value of the Priority field in the message descriptor MQMD exceeds the maximum priority supported by the local queue manager, as shown by the MaxPriority queue-manager attribute. The message is accepted by the queue manager, but is placed on the queue at the queue manager's maximum priority. The Priority field in the message descriptor retains the value specified by the application that put the message.

Completion Code:
MQCC_WARNING

Programmer Response:
None required, unless this reason code was not expected by the application that put the message.

2050 (X'0802')MQRC_PRIORITY_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the value of the Priority field in the message descriptor MQMD is not valid. The maximum priority supported by the queue manager is given by the MaxPriority queue-manager attribute.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value in the range zero through MaxPriority, or the special value MQPRI_PRIORITY_AS_Q_DEF.

2051 (X'0803')MQRC_PUT_INHIBITED
Explanation:
MQPUT and MQPUT1 calls are currently inhibited for the queue, or for the queue to which this queue resolves.

This reason code can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

Completion Code:
MQCC_FAILED

Programmer Response:
If the system design allows put requests to be inhibited for short periods, retry the operation later.

2052 (X'0804')MQRC_Q_DELETED
Explanation:
An Hobj queue handle specified on a call refers to a dynamic queue that has been deleted since the queue was opened. (See the description of MQCLOSE in the WebSphere MQ Application Programming Guide for information about the deletion of dynamic queues.) 

On z/OS, this can also occur with the MQOPEN and MQPUT1 calls if a dynamic queue is being opened, but the queue is in a logically-deleted state. See MQCLOSE for more information about this.
Completion Code:
MQCC_FAILED

Programmer Response:
Issue an MQCLOSE call to return the handle and associated resources to the system (the MQCLOSE call will succeed in this case). Check the design of the application that caused the error.

2053 (X'0805')MQRC_Q_FULL
Explanation:
An MQPUT or MQPUT1 call, or a command, failed because the queue is full, that is, it already contains the maximum number of messages possible, as specified by the MaxQDepth queue attribute.

This reason code can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Retry the operation later. Consider increasing the maximum depth for this queue, or arranging for more instances of the application to service the queue.

2055 (X'0807')MQRC_Q_NOT_EMPTY
Explanation:
An MQCLOSE call was issued for a permanent dynamic queue, but the call failed because the queue is not empty or still in use. One of the following applies: 

The MQCO_DELETE option was specified, but there are messages on the queue. 
The MQCO_DELETE or MQCO_DELETE_PURGE option was specified, but there are uncommitted get or put calls outstanding against the queue.
See the usage notes pertaining to dynamic queues for the MQCLOSE call for more information.

This reason code is also returned from a command to clear or delete or move a queue, if the queue contains uncommitted messages (or committed messages in the case of delete queue without the purge option).

Completion Code:
MQCC_FAILED

Programmer Response:
Check why there might be messages on the queue. Be aware that the CurrentQDepth queue attribute might be zero even though there are one or more messages on the queue; this can happen if the messages have been retrieved as part of a unit of work that has not yet been committed. If the messages can be discarded, try using the MQCLOSE call with the MQCO_DELETE_PURGE option. Consider retrying the call later.

2056 (X'0808')MQRC_Q_SPACE_NOT_AVAILABLE
Explanation:
An MQPUT or MQPUT1 call was issued, but there is no space available for the queue on disk or other storage device.

This reason code can also occur in the Feedback field in the message descriptor of a report message; in this case it indicates that the error was encountered by a message channel agent when it attempted to put the message on a remote queue. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Check whether an application is putting messages in an infinite loop. If not, make more disk space available for the queue.

2057 (X'0809')MQRC_Q_TYPE_ERROR
Explanation:
One of the following occurred: 

On an MQOPEN call, the ObjectQMgrName field in the object descriptor MQOD or object record MQOR specifies the name of a local definition of a remote queue (in order to specify a queue-manager alias), and in that local definition the RemoteQMgrName attribute is the name of the local queue manager. However, the ObjectName field in MQOD or MQOR specifies the name of a model queue on the local queue manager; this is not allowed. See the WebSphere MQ Application Programming Guide for more information. 
On an MQPUT1 call, the object descriptor MQOD or object record MQOR specifies the name of a model queue. 
On a previous MQPUT or MQPUT1 call, the ReplyToQ field in the message descriptor specified the name of a model queue, but a model queue cannot be specified as the destination for reply or report messages. Only the name of a predefined queue, or the name of the dynamic queue created from the model queue, can be specified as the destination. In this situation the reason code MQRC_Q_TYPE_ERROR is returned in the Reason field of the MQDLH structure when the reply message or report message is placed on the dead-letter queue.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid queue.

2058 (X'080A')MQRC_Q_MGR_NAME_ERROR
Explanation:
On an MQCONN or MQCONNX call, the value specified for the QMgrName parameter is not valid or not known. This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 

On z/OS for CICS applications, this reason can occur on any call if the original connect specified an incorrect or unrecognized name.
This reason code can also occur if an MQ client application attempts to connect to a queue manager within an MQ-client queue-manager group (see the QMgrName parameter of MQCONN), and either: 

Queue-manager groups are not supported. 
There is no queue-manager group with the specified name.
Completion Code:
MQCC_FAILED

Programmer Response:
Use an all-blank name if possible, or verify that the name used is valid.

2059 (X'080B')MQRC_Q_MGR_NOT_AVAILABLE
Explanation:
This occurs: 

On an MQCONN or MQCONNX call, the queue manager identified by the QMgrName parameter is not available for connection. 
On z/OS: 
For batch applications, this reason can be returned to applications running in LPARs that do not have a queue manager installed. 
For CICS applications, this reason can occur on any call if the original connect specified a queue manager whose name was recognized, but which is not available.
On i5/OS, this reason can also be returned by the MQOPEN and MQPUT1 calls, when MQHC_DEF_HCONN is specified for the Hconn parameter by an application running in compatibility mode.
On an MQCONN or MQCONNX call from an MQ client application: 
Attempting to connect to a queue manager within an MQ-client queue-manager group when none of the queue managers in the group is available for connection (see the QMgrName parameter of the MQCONN call). 
If there is an error with the client-connection or the corresponding server-connection channel definitions. 
On z/OS, if the optional OS/390 Client Attachment feature has not been installed.
If a command uses the CommandScope parameter specfying a queue manager that is not active in the queue-sharing group.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the queue manager has been started. If the connection is from a client application, check the channel definitions.

2061 (X'080D')MQRC_REPORT_OPTIONS_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the Report field in the message descriptor MQMD contains one or more options that are not recognized by the local queue manager. The options that cause this reason code to be returned depend on the destination of the message; see the description of REPORT in the WebSphere MQ Application Programming Guide for more details.

This reason code can also occur in the Feedback field in the MQMD of a report message, or in the Reason field in the MQDLH structure of a message on the dead-letter queue; in both cases it indicates that the destination queue manager does not support one or more of the report options specified by the sender of the message.

Completion Code:
MQCC_FAILED

Programmer Response:
Do the following: 

Ensure that the Report field in the message descriptor is initialized with a value when the message descriptor is declared, or is assigned a value prior to the MQPUT or MQPUT1 call. Specify MQRO_NONE if no report options are required. 
Ensure that the report options specified are valid; see the Report field described in the description of MQMD in the WebSphere MQ Application Programming Guide for valid report options. 
If multiple report options are being set by adding the individual report options together, ensure that the same report option is not added twice. 
Check that conflicting report options are not specified. For example, do not add both MQRO_EXCEPTION and MQRO_EXCEPTION_WITH_DATA to the Report field; only one of these can be specified.
2062 (X'080E')MQRC_SECOND_MARK_NOT_ALLOWED
Explanation:
An MQGET call was issued specifying the MQGMO_MARK_SKIP_BACKOUT option in the Options field of MQGMO, but a message has already been marked within the current unit of work. Only one marked message is allowed within each unit of work.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application so that no more than one message is marked within each unit of work.

2063 (X'080F')MQRC_SECURITY_ERROR
Explanation:
An MQCONN, MQCONNX, MQOPEN, MQPUT1, or MQCLOSE call was issued, but it failed because a security error occurred. 

On z/OS, the security error was returned by the External Security Manager.
Completion Code:
MQCC_FAILED

Programmer Response:
Note the error from the security manager, and contact your system programmer or security administrator. 

On i5/OS, the FFST log will contain the error information.
2065 (X'0811')MQRC_SELECTOR_COUNT_ERROR
Explanation:
On an MQINQ or MQSET call, the SelectorCount parameter specifies a value that is not valid. This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value in the range 0 through 256.

2066 (X'0812')MQRC_SELECTOR_LIMIT_EXCEEDED
Explanation:
On an MQINQ or MQSET call, the SelectorCount parameter specifies a value that is larger than the maximum supported (256).

Completion Code:
MQCC_FAILED

Programmer Response:
Reduce the number of selectors specified on the call; the valid range is 0 through 256.

2067 (X'0813')MQRC_SELECTOR_ERROR
Explanation:
An MQINQ or MQSET call was issued, but the Selectors array contains a selector that is not valid for one of the following reasons: 

The selector is not supported or out of range. 
The selector is not applicable to the type of object whose attributes are being inquired or set. 
The selector is for an attribute that cannot be set.
This reason also occurs if the parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the value specified for the selector is valid for the object type represented by Hobj. For the MQSET call, also ensure that the selector represents an integer attribute that can be set.

2068 (X'0814')MQRC_SELECTOR_NOT_FOR_TYPE
Explanation:
On the MQINQ call, one or more selectors in the Selectors array is not applicable to the type of the queue whose attributes are being inquired.

This reason also occurs when the queue is a cluster queue that resolved to a remote instance of the queue. In this case only a subset of the attributes that are valid for local queues can be inquired. See the usage notes in the description of MQINQ in the WebSphere MQ Application Programming Guide for further details.

The call completes with MQCC_WARNING, with the attribute values for the inapplicable selectors set as follows: 

For integer attributes, the corresponding elements of IntAttrs are set to MQIAV_NOT_APPLICABLE. 
For character attributes, the appropriate parts of the CharAttrs string are set to a character string consisting entirely of asterisks (*).
Completion Code:
MQCC_WARNING

Programmer Response:
Verify that the selector specified is the one that was intended.

If the queue is a cluster queue, specifying one of the MQOO_BROWSE, MQOO_INPUT_*, or MQOO_SET options in addition to MQOO_INQUIRE forces the queue to resolve to the local instance of the queue. However, if there is no local instance of the queue the MQOPEN call fails.

2069 (X'0815')MQRC_SIGNAL_OUTSTANDING
Explanation:
An MQGET call was issued with either the MQGMO_SET_SIGNAL or MQGMO_WAIT option, but there is already a signal outstanding for the queue handle Hobj.

This reason code occurs only in the following environments: z/OS, Windows 95, Windows 98.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the application logic. If it is necessary to set a signal or wait when there is a signal outstanding for the same queue, a different object handle must be used.

2070 (X'0816')MQRC_SIGNAL_REQUEST_ACCEPTED
Explanation:
An MQGET call was issued specifying MQGMO_SET_SIGNAL in the GetMsgOpts parameter, but no suitable message was available; the call returns immediately. The application can now wait for the signal to be delivered. 

On z/OS, the application should wait on the Event Control Block pointed to by the Signal1 field. 
On Windows 95, Windows 98, the application should wait for the signal Windows message to be delivered.
This reason code occurs only in the following environments: z/OS, Windows 95, Windows 98.

Completion Code:
MQCC_WARNING

Programmer Response:
Wait for the signal; when it is delivered, check the signal to ensure that a message is now available. If it is, reissue the MQGET call. 

On z/OS, wait on the ECB pointed to by the Signal1 field and, when it is posted, check it to ensure that a message is now available. 
On Windows 95, Windows 98, the application (thread) should continue executing its message loop.
2071 (X'0817')MQRC_STORAGE_NOT_AVAILABLE
Explanation:
The call failed because there is insufficient main storage available.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that active applications are behaving correctly, for example, that they are not looping unexpectedly. If no problems are found, make more main storage available. 

On z/OS, if no application problems are found, ask your system programmer to increase the size of the region in which the queue manager runs.
2072 (X'0818')MQRC_SYNCPOINT_NOT_AVAILABLE
Explanation:
Either MQGMO_SYNCPOINT was specified on an MQGET call or MQPMO_SYNCPOINT was specified on an MQPUT or MQPUT1 call, but the local queue manager was unable to honor the request. If the queue manager does not support units of work, the SyncPoint queue-manager attribute will have the value MQSP_NOT_AVAILABLE.

This reason code can also occur on the MQGET, MQPUT, and MQPUT1 calls when an external unit-of-work coordinator is being used. If that coordinator requires an explicit call to start the unit of work, but the application has not issued that call prior to the MQGET, MQPUT, or MQPUT1 call, reason code MQRC_SYNCPOINT_NOT_AVAILABLE is returned. 

On i5/OS, this reason codes means that i5/OS Commitment Control is not started, or is unavailable for use by the queue manager. 
On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Remove the specification of MQGMO_SYNCPOINT or MQPMO_SYNCPOINT, as appropriate. 

On i5/OS, ensure that Commitment Control has been started. If this reason code occurs after Commitment Control has been started, contact your system programmer.
2075 (X'081B')MQRC_TRIGGER_CONTROL_ERROR
Explanation:
On an MQSET call, the value specified for the MQIA_TRIGGER_CONTROL attribute selector is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value.

2076 (X'081C')MQRC_TRIGGER_DEPTH_ERROR
Explanation:
On an MQSET call, the value specified for the MQIA_TRIGGER_DEPTH attribute selector is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value that is greater than zero.

2077 (X'081D')MQRC_TRIGGER_MSG_PRIORITY_ERR
Explanation:
On an MQSET call, the value specified for the MQIA_TRIGGER_MSG_PRIORITY attribute selector is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value in the range zero through the value of MaxPriority queue-manager attribute.

2078 (X'081E')MQRC_TRIGGER_TYPE_ERROR
Explanation:
On an MQSET call, the value specified for the MQIA_TRIGGER_TYPE attribute selector is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value.

2079 (X'081F')MQRC_TRUNCATED_MSG_ACCEPTED
Explanation:
On an MQGET call, the message length was too large to fit into the supplied buffer. The MQGMO_ACCEPT_TRUNCATED_MSG option was specified, so the call completes. The message is removed from the queue (subject to unit-of-work considerations), or, if this was a browse operation, the browse cursor is advanced to this message.

The DataLength parameter is set to the length of the message before truncation, the Buffer parameter contains as much of the message as fits, and the MQMD structure is filled in.

Completion Code:
MQCC_WARNING

Programmer Response:
None, because the application expected this situation.

2080 (X'0820')MQRC_TRUNCATED_MSG_FAILED
Explanation:
On an MQGET call, the message length was too large to fit into the supplied buffer. The MQGMO_ACCEPT_TRUNCATED_MSG option was not specified, so the message has not been removed from the queue. If this was a browse operation, the browse cursor remains where it was before this call, but if MQGMO_BROWSE_FIRST was specified, the browse cursor is positioned logically before the highest-priority message on the queue.

The DataLength field is set to the length of the message before truncation, the Buffer parameter contains as much of the message as fits, and the MQMD structure is filled in.

Completion Code:
MQCC_WARNING

Programmer Response:
Supply a buffer that is at least as large as DataLength, or specify MQGMO_ACCEPT_TRUNCATED_MSG if not all of the message data is required.

2082 (X'0822')MQRC_UNKNOWN_ALIAS_BASE_Q
Explanation:
An MQOPEN or MQPUT1 call was issued specifying an alias queue as the target, but the BaseQName in the alias queue attributes is not recognized as a queue name.

This reason code can also occur when BaseQName is the name of a cluster queue that cannot be resolved successfully.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the queue definitions.

2085 (X'0825')MQRC_UNKNOWN_OBJECT_NAME
Explanation:
An MQOPEN or MQPUT1 call was issued, but the object identified by the ObjectName and ObjectQMgrName fields in the object descriptor MQOD cannot be found. One of the following applies: 

The ObjectQMgrName field is one of the following: 
Blank 
The name of the local queue manager 
The name of a local definition of a remote queue (a queue-manager alias) in which the RemoteQMgrName attribute is the name of the local queue manager
but no object with the specified ObjectName and ObjectType exists on the local queue manager. 
The object being opened is a cluster queue that is hosted on a remote queue manager, but the local queue manager does not have a defined route to the remote queue manager. 
The object being opened is a queue definition that has QSGDISP(GROUP). Such definitions cannot be used with the MQOPEN and MQPUT1 calls.
This can also occur in response to a command that specifies the name of an object or other item that does not exist.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid object name. Ensure that the name is padded to the right with blanks if necessary. If this is correct, check the object definitions.

2086 (X'0826')MQRC_UNKNOWN_OBJECT_Q_MGR
Explanation:
On an MQOPEN or MQPUT1 call, the ObjectQMgrName field in the object descriptor MQOD does not satisfy the naming rules for objects. For more information, see the WebSphere MQ Application Programming Guide.

This reason also occurs if the ObjectType field in the object descriptor has the value MQOT_Q_MGR, and the ObjectQMgrName field is not blank, but the name specified is not the name of the local queue manager.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid queue manager name. To refer to the local queue manager, a name consisting entirely of blanks or beginning with a null character can be used. Ensure that the name is padded to the right with blanks or terminated with a null character if necessary.

2087 (X'0827')MQRC_UNKNOWN_REMOTE_Q_MGR
Explanation:
On an MQOPEN or MQPUT1 call, an error occurred with the queue-name resolution, for one of the following reasons: 

ObjectQMgrName is blank or the name of the local queue manager, ObjectName is the name of a local definition of a remote queue (or an alias to one), and one of the following is true: 
RemoteQMgrName is blank or the name of the local queue manager. Note that this error occurs even if XmitQName is not blank. 
XmitQName is blank, but there is no transmission queue defined with the name of RemoteQMgrName, and the DefXmitQName queue-manager attribute is blank. 
RemoteQMgrName and RemoteQName specify a cluster queue that cannot be resolved successfully, and the DefXmitQName queue-manager attribute is blank.
ObjectQMgrName is the name of a local definition of a remote queue (containing a queue-manager alias definition), and one of the following is true: 
RemoteQName is not blank. 
XmitQName is blank, but there is no transmission queue defined with the name of RemoteQMgrName, and the DefXmitQName queue-manager attribute is blank.
ObjectQMgrName is not: 
Blank 
The name of the local queue manager 
The name of a transmission queue 
The name of a queue-manager alias definition (that is, a local definition of a remote queue with a blank RemoteQName)
but the DefXmitQName queue-manager attribute is blank and the queue manager is not part of a queue-sharing group with intra-group queuing enabled. 
ObjectQMgrName is the name of a model queue. 
The queue name is resolved through a cell directory. However, there is no queue defined with the same name as the remote queue manager name obtained from the cell directory, and the DefXmitQName queue-manager attribute is blank.
Completion Code:
MQCC_FAILED

Programmer Response:
Check the values specified for ObjectQMgrName and ObjectName. If these are correct, check the queue definitions.

2090 (X'082A')MQRC_WAIT_INTERVAL_ERROR
Explanation:
On the MQGET call, the value specified for the WaitInterval field in the GetMsgOpts parameter is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value greater than or equal to zero, or the special value MQWI_UNLIMITED if an indefinite wait is required.

2091 (X'082B')MQRC_XMIT_Q_TYPE_ERROR
Explanation:
On an MQOPEN or MQPUT1 call, a message is to be sent to a remote queue manager. The ObjectName or ObjectQMgrName field in the object descriptor specifies the name of a local definition of a remote queue but one of the following applies to the XmitQName attribute of the definition: 

XmitQName is not blank, but specifies a queue that is not a local queue 
XmitQName is blank, but RemoteQMgrName specifies a queue that is not a local queue
This reason also occurs if the queue name is resolved through a cell directory, and the remote queue manager name obtained from the cell directory is the name of a queue, but this is not a local queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the values specified for ObjectName and ObjectQMgrName. If these are correct, check the queue definitions. For more information on transmission queues, see the WebSphere MQ Application Programming Guide.

2092 (X'082C')MQRC_XMIT_Q_USAGE_ERROR
Explanation:
On an MQOPEN or MQPUT1 call, a message is to be sent to a remote queue manager, but one of the following occurred: 

ObjectQMgrName specifies the name of a local queue, but it does not have a Usage attribute of MQUS_TRANSMISSION. 
The ObjectName or ObjectQMgrName field in the object descriptor specifies the name of a local definition of a remote queue but one of the following applies to the XmitQName attribute of the definition: 
XmitQName is not blank, but specifies a queue that does not have a Usage attribute of MQUS_TRANSMISSION 
XmitQName is blank, but RemoteQMgrName specifies a queue that does not have a Usage attribute of MQUS_TRANSMISSION 
XmitQName specifies the queue SYSTEM.QSG.TRANSMIT.QUEUE but the IGQ queue manager attribute indicates that IGQ is DISABLED.
The queue name is resolved through a cell directory, and the remote queue manager name obtained from the cell directory is the name of a local queue, but it does not have a Usage attribute of MQUS_TRANSMISSION.
Completion Code:
MQCC_FAILED

Programmer Response:
Check the values specified for ObjectName and ObjectQMgrName. If these are correct, check the queue definitions. For more information on transmission queues, see the WebSphere MQ Application Programming Guide.

2093 (X'082D')MQRC_NOT_OPEN_FOR_PASS_ALL
Explanation:
An MQPUT call was issued with the MQPMO_PASS_ALL_CONTEXT option specified in the PutMsgOpts parameter, but the queue had not been opened with the MQOO_PASS_ALL_CONTEXT option.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_PASS_ALL_CONTEXT (or another option that implies it) when the queue is opened.

2094 (X'082E')MQRC_NOT_OPEN_FOR_PASS_IDENT
Explanation:
An MQPUT call was issued with the MQPMO_PASS_IDENTITY_CONTEXT option specified in the PutMsgOpts parameter, but the queue had not been opened with the MQOO_PASS_IDENTITY_CONTEXT option.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_PASS_IDENTITY_CONTEXT (or another option that implies it) when the queue is opened.

2095 (X'082F')MQRC_NOT_OPEN_FOR_SET_ALL
Explanation:
An MQPUT call was issued with the MQPMO_SET_ALL_CONTEXT option specified in the PutMsgOpts parameter, but the queue had not been opened with the MQOO_SET_ALL_CONTEXT option.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_SET_ALL_CONTEXT when the queue is opened.

2096 (X'0830')MQRC_NOT_OPEN_FOR_SET_IDENT
Explanation:
An MQPUT call was issued with the MQPMO_SET_IDENTITY_CONTEXT option specified in the PutMsgOpts parameter, but the queue had not been opened with the MQOO_SET_IDENTITY_CONTEXT option.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_SET_IDENTITY_CONTEXT (or another option that implies it) when the queue is opened.

2097 (X'0831')MQRC_CONTEXT_HANDLE_ERROR
Explanation:
On an MQPUT or MQPUT1 call, MQPMO_PASS_IDENTITY_CONTEXT or MQPMO_PASS_ALL_CONTEXT was specified, but the handle specified in the Context field of the PutMsgOpts parameter is either not a valid queue handle, or it is a valid queue handle but the queue was not opened with MQOO_SAVE_ALL_CONTEXT.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQOO_SAVE_ALL_CONTEXT when the queue referred to is opened.

2098 (X'0832')MQRC_CONTEXT_NOT_AVAILABLE
Explanation:
On an MQPUT or MQPUT1 call, MQPMO_PASS_IDENTITY_CONTEXT or MQPMO_PASS_ALL_CONTEXT was specified, but the queue handle specified in the Context field of the PutMsgOpts parameter has no context associated with it. This arises if no message has yet been successfully retrieved with the queue handle referred to, or if the last successful MQGET call was a browse.

This condition does not arise if the message that was last retrieved had no context associated with it. 

On z/OS, if a message is received by a message channel agent that is putting messages with the authority of the user identifier in the message, this code is returned in the Feedback field of an exception report if the message has no context associated with it.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that a successful nonbrowse get call has been issued with the queue handle referred to.

2099 (X'0833')MQRC_SIGNAL1_ERROR
Explanation:
An MQGET call was issued, specifying MQGMO_SET_SIGNAL in the GetMsgOpts parameter, but the Signal1 field is not valid. 

On z/OS, the address contained in the Signal1 field is not valid, or points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
On Windows 95, Windows 98, the window handle in the Signal1 field is not valid.
This reason code occurs only in the following environments: z/OS, Windows 95, Windows 98.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the setting of the Signal1 field.

2100 (X'0834')MQRC_OBJECT_ALREADY_EXISTS
Explanation:
An MQOPEN call was issued to create a dynamic queue, but a queue with the same name as the dynamic queue already exists. 

On z/OS, a rare "race condition" can also give rise to this reason code; see the description of reason code MQRC_NAME_IN_USE for more details.
Completion Code:
MQCC_FAILED

Programmer Response:
If supplying a dynamic queue name in full, ensure that it obeys the naming conventions for dynamic queues; if it does, either supply a different name, or delete the existing queue if it is no longer required. Alternatively, allow the queue manager to generate the name.

If the queue manager is generating the name (either in part or in full), reissue the MQOPEN call.

2101 (X'0835')MQRC_OBJECT_DAMAGED
Explanation:
The object accessed by the call is damaged and cannot be used. For example, this may be because the definition of the object in main storage is not consistent, or because it differs from the definition of the object on disk, or because the definition on disk cannot be read. The object can be deleted, although it may not be possible to delete the associated user space. 

On z/OS, this reason occurs when the DB2 list header or structure number associated with a shared queue is zero. This situation arises as a result of using the MQSC command DELETE CFSTRUCT to delete the DB2 structure definition. The command resets the list header and structure number to zero for each of the shared queues that references the deleted CF strcture.
Completion Code:
MQCC_FAILED

Programmer Response:
It may be necessary to stop and restart the queue manager, or to restore the queue-manager data from back-up storage. 

On HP OpenVMS, OS/2, i5/OS, Compaq NonStop Kernel, and UNIX systems, consult the FFST(TM) record to obtain more detail about the problem. 
On z/OS, delete the shared queue and redefine it using the MQSC command DEFINE QLOCAL. This will automatically define a CF structure and allocate list headers for it.
2102 (X'0836')MQRC_RESOURCE_PROBLEM
Explanation:
There are insufficient system resources to complete the call successfully.

Completion Code:
MQCC_FAILED

Programmer Response:
Run the application when the machine is less heavily loaded. 

On z/OS, check the operator console for messages that may provide additional information. 
On HP OpenVMS, OS/2, i5/OS, Compaq NonStop Kernel, and UNIX systems, consult the FFST record to obtain more detail about the problem.
2103 (X'0837')MQRC_ANOTHER_Q_MGR_CONNECTED
Explanation:
An MQCONN or MQCONNX call was issued, but the thread or process is already connected to a different queue manager. The thread or process can connect to only one queue manager at a time. 

On z/OS, this reason code does not occur. 
On Windows, MTS objects do not receive this reason code, as connections to other queue managers are allowed.
Completion Code:
MQCC_FAILED

Programmer Response:
Use the MQDISC call to disconnect from the queue manager that is already connected, and then issue the MQCONN or MQCONNX call to connect to the new queue manager.

Disconnecting from the existing queue manager will close any queues that are currently open; it is recommended that any uncommitted units of work should be committed or backed out before the MQDISC call is issued.

2104 (X'0838')MQRC_UNKNOWN_REPORT_OPTION
Explanation:
An MQPUT or MQPUT1 call was issued, but the Report field in the message descriptor MQMD contains one or more options that are not recognized by the local queue manager. The options are accepted.

The options that cause this reason code to be returned depend on the destination of the message; see the description of REPORT in the WebSphere MQ Application Programming Guide for more details.

Completion Code:
MQCC_WARNING

Programmer Response:
If this reason code is expected, no corrective action is required. If this reason code is not expected, do the following: 

Ensure that the Report field in the message descriptor is initialized with a value when the message descriptor is declared, or is assigned a value prior to the MQPUT or MQPUT1 call. 
Ensure that the report options specified are valid; see the Report field described in the description of MQMD in the WebSphere MQ Application Programming Guide for valid report options. 
If multiple report options are being set by adding the individual report options together, ensure that the same report option is not added twice. 
Check that conflicting report options are not specified. For example, do not add both MQRO_EXCEPTION and MQRO_EXCEPTION_WITH_DATA to the Report field; only one of these can be specified.
2105 (X'0839')MQRC_STORAGE_CLASS_ERROR
Explanation:
The MQPUT or MQPUT1 call was issued, but the storage-class object defined for the queue does not exist.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Create the storage-class object required by the queue, or modify the queue definition to use an existing storage class. The name of the storage-class object used by the queue is given by the StorageClass queue attribute.

2106 (X'083A')MQRC_COD_NOT_VALID_FOR_XCF_Q
Explanation:
An MQPUT or MQPUT1 call was issued, but the Report field in the message descriptor MQMD specifies one of the MQRO_COD_* options and the target queue is an XCF queue. MQRO_COD_* options cannot be specified for XCF queues.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the relevant MQRO_COD_* option.

2107 (X'083B')MQRC_XWAIT_CANCELED
Explanation:
An MQXWAIT call was issued, but the call has been canceled because a STOP CHINIT command has been issued (or the queue manager has been stopped, which causes the same effect). Refer to the WebSphere MQ Intercommunication book for details of the MQXWAIT call.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Tidy up and terminate.

2108 (X'083C')MQRC_XWAIT_ERROR
Explanation:
An MQXWAIT call was issued, but the invocation was not valid for one of the following reasons: 

The wait descriptor MQXWD contains data that is not valid. 
The linkage stack level is not valid. 
The addressing mode is not valid. 
There are too many wait events outstanding.
This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Obey the rules for using the MQXWAIT call. Refer to the WebSphere MQ Intercommunication book for details of this call.

2109 (X'083D')MQRC_SUPPRESSED_BY_EXIT
Explanation:
On any call other than MQCONN or MQDISC, the API crossing exit suppressed the call.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Obey the rules for MQI calls that the exit enforces. To find out the rules, see the writer of the exit.

2110 (X'083E')MQRC_FORMAT_ERROR
Explanation:
An MQGET call was issued with the MQGMO_CONVERT option specified in the GetMsgOpts parameter, but the message cannot be converted successfully due to an error associated with the message format. Possible errors include: 

The format name in the message is MQFMT_NONE. 
A user-written exit with the name specified by the Format field in the message cannot be found. 
The message contains data that is not consistent with the format definition.
The message is returned unconverted to the application issuing the MQGET call, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

Completion Code:
MQCC_WARNING

Programmer Response:
Check the format name that was specified when the message was put. If this is not one of the built-in formats, check that a suitable exit with the same name as the format is available for the queue manager to load. Verify that the data in the message corresponds to the format expected by the exit.

2111 (X'083F')MQRC_SOURCE_CCSID_ERROR
Explanation:
The coded character-set identifier from which character data is to be converted is not valid or not supported.

This can occur on the MQGET call when the MQGMO_CONVERT option is included in the GetMsgOpts parameter; the coded character-set identifier in error is the CodedCharSetId field in the message being retrieved. In this case, the message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

This reason can also occur on the MQGET call when the message contains one or more MQ header structures (MQCIH, MQDLH, MQIIH, MQRMH), and the CodedCharSetId field in the message specifies a character set that does not have SBCS characters for the characters that are valid in queue names. MQ header structures containing such characters are not valid, and so the message is returned unconverted. The Unicode character set UCS-2 is an example of such a character set.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

This reason can also occur on the MQXCNVC call; the coded character-set identifier in error is the SourceCCSID parameter. Either the SourceCCSID parameter specifies a value that is not valid or not supported, or the SourceCCSID parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Check the character-set identifier that was specified when the message was put, or that was specified for the SourceCCSID parameter on the MQXCNVC call. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the specified character set, conversion must be carried out by the application.

2112 (X'0840')MQRC_SOURCE_INTEGER_ENC_ERROR
Explanation:
On an MQGET call, with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the message being retrieved specifies an integer encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

This reason code can also occur on the MQXCNVC call, when the Options parameter contains an unsupported MQDCC_SOURCE_* value, or when MQDCC_SOURCE_ENC_UNDEFINED is specified for a UCS-2 code page.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Check the integer encoding that was specified when the message was put. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required integer encoding, conversion must be carried out by the application.

2113 (X'0841')MQRC_SOURCE_DECIMAL_ENC_ERROR
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the message being retrieved specifies a decimal encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

Completion Code:
MQCC_WARNING

Programmer Response:
Check the decimal encoding that was specified when the message was put. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required decimal encoding, conversion must be carried out by the application.

2114 (X'0842')MQRC_SOURCE_FLOAT_ENC_ERROR
Explanation:
On an MQGET call, with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the message being retrieved specifies a floating-point encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

Completion Code:
MQCC_WARNING

Programmer Response:
Check the floating-point encoding that was specified when the message was put. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required floating-point encoding, conversion must be carried out by the application.

2115 (X'0843')MQRC_TARGET_CCSID_ERROR
Explanation:
The coded character-set identifier to which character data is to be converted is not valid or not supported.

This can occur on the MQGET call when the MQGMO_CONVERT option is included in the GetMsgOpts parameter; the coded character-set identifier in error is the CodedCharSetId field in the MsgDesc parameter. In this case, the message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

This reason can also occur on the MQGET call when the message contains one or more MQ header structures (MQCIH, MQDLH, MQIIH, MQRMH), and the CodedCharSetId field in the MsgDesc parameter specifies a character set that does not have SBCS characters for the characters that are valid in queue names. The Unicode character set UCS-2 is an example of such a character set.

This reason can also occur on the MQXCNVC call; the coded character-set identifier in error is the TargetCCSID parameter. Either the TargetCCSID parameter specifies a value that is not valid or not supported, or the TargetCCSID parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Check the character-set identifier that was specified for the CodedCharSetId field in the MsgDesc parameter on the MQGET call, or that was specified for the SourceCCSID parameter on the MQXCNVC call. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the specified character set, conversion must be carried out by the application.

2116 (X'0844')MQRC_TARGET_INTEGER_ENC_ERROR
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the MsgDesc parameter specifies an integer encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message being retrieved, and the call completes with MQCC_WARNING.

This reason code can also occur on the MQXCNVC call, when the Options parameter contains an unsupported MQDCC_TARGET_* value, or when MQDCC_TARGET_ENC_UNDEFINED is specified for a UCS-2 code page.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Check the integer encoding that was specified. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required integer encoding, conversion must be carried out by the application.

2117 (X'0845')MQRC_TARGET_DECIMAL_ENC_ERROR
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the MsgDesc parameter specifies a decimal encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

Completion Code:
MQCC_WARNING

Programmer Response:
Check the decimal encoding that was specified. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required decimal encoding, conversion must be carried out by the application.

2118 (X'0846')MQRC_TARGET_FLOAT_ENC_ERROR
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the Encoding value in the MsgDesc parameter specifies a floating-point encoding that is not recognized. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

Completion Code:
MQCC_WARNING

Programmer Response:
Check the floating-point encoding that was specified. If this is correct, check that it is one for which queue-manager conversion is supported. If queue-manager conversion is not supported for the required floating-point encoding, conversion must be carried out by the application.

2119 (X'0847')MQRC_NOT_CONVERTED
Explanation:
An MQGET call was issued with the MQGMO_CONVERT option specified in the GetMsgOpts parameter, but an error occurred during conversion of the data in the message. The message data is returned unconverted, the values of the CodedCharSetId and Encoding fields in the MsgDesc parameter are set to those of the message returned, and the call completes with MQCC_WARNING.

If the message consists of several parts, each of which is described by its own CodedCharSetId and Encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various CodedCharSetId and Encoding fields always correctly describe the relevant message data.

This error may also indicate that a parameter to the data-conversion service is not supported.

Completion Code:
MQCC_WARNING

Programmer Response:
Check that the message data is correctly described by the Format, CodedCharSetId and Encoding parameters that were specified when the message was put. Also check that these values, and the CodedCharSetId and Encoding specified in the MsgDesc parameter on the MQGET call, are supported for queue-manager conversion. If the required conversion is not supported, conversion must be carried out by the application.

2120 (X'0848')MQRC_CONVERTED_MSG_TOO_BIG
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, the message data expanded during data conversion and exceeded the size of the buffer provided by the application. However, the message had already been removed from the queue because prior to conversion the message data could be accommodated in the application buffer without truncation.

The message is returned unconverted, with the CompCode parameter of the MQGET call set to MQCC_WARNING. If the message consists of several parts, each of which is described by its own character-set and encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various character-set and encoding fields always correctly describe the relevant message data.

This reason can also occur on the MQXCNVC call, when the TargetBuffer parameter is too small too accommodate the converted string, and the string has been truncated to fit in the buffer. The length of valid data returned is given by the DataLength parameter; in the case of a DBCS string or mixed SBCS/DBCS string, this length may be less than the length of TargetBuffer.

Completion Code:
MQCC_WARNING

Programmer Response:
For the MQGET call, check that the exit is converting the message data correctly and setting the output length DataLength to the appropriate value. If it is, the application issuing the MQGET call must provide a larger buffer for the Buffer parameter.

For the MQXCNVC call, if the string must be converted without truncation, provide a larger output buffer.

2121 (X'0849')MQRC_NO_EXTERNAL_PARTICIPANTS
Explanation:
An MQBEGIN call was issued to start a unit of work coordinated by the queue manager, but no participating resource managers have been registered with the queue manager. As a result, only changes to MQ resources can be coordinated by the queue manager in the unit of work.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows.

Completion Code:
MQCC_WARNING

Programmer Response:
If the application does not require non-MQ resources to participate in the unit of work, this reason code can be ignored or the MQBEGIN call removed. Otherwise consult your system programmer to determine why the required resource managers have not been registered with the queue manager; the queue manager's configuration file may be in error.

2122 (X'084A')MQRC_PARTICIPANT_NOT_AVAILABLE
Explanation:
An MQBEGIN call was issued to start a unit of work coordinated by the queue manager, but one or more of the participating resource managers that had been registered with the queue manager is not available. As a result, changes to those resources cannot be coordinated by the queue manager in the unit of work.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows.

Completion Code:
MQCC_WARNING

Programmer Response:
If the application does not require non-MQ resources to participate in the unit of work, this reason code can be ignored. Otherwise consult your system programmer to determine why the required resource managers are not available. The resource manager may have been halted temporarily, or there may be an error in the queue manager's configuration file.

2123 (X'084B')MQRC_OUTCOME_MIXED
Explanation:
The queue manager is acting as the unit-of-work coordinator for a unit of work that involves other resource managers, but one of the following occurred: 

An MQCMIT or MQDISC call was issued to commit the unit of work, but one or more of the participating resource managers backed-out the unit of work instead of committing it. As a result, the outcome of the unit of work is mixed. 
An MQBACK call was issued to back out a unit of work, but one or more of the participating resource managers had already committed the unit of work.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Examine the queue-manager error logs for messages relating to the mixed outcome; these messages identify the resource managers that are affected. Use procedures local to the affected resource managers to resynchronize the resources.

This reason code does not prevent the application initiating further units of work.

2124 (X'084C')MQRC_OUTCOME_PENDING
Explanation:
The queue manager is acting as the unit-of-work coordinator for a unit of work that involves other resource managers, and an MQCMIT or MQDISC call was issued to commit the unit of work, but one or more of the participating resource managers has not confirmed that the unit of work was committed successfully.

The completion of the commit operation will happen at some point in the future, but there remains the possibility that the outcome will be mixed.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, Solaris, Windows.

Completion Code:
MQCC_WARNING

Programmer Response:
Use the normal error-reporting mechanisms to determine whether the outcome was mixed. If it was, take appropriate action to resynchronize the resources.

This reason code does not prevent the application initiating further units of work.

2125 (X'084D')MQRC_BRIDGE_STARTED
Explanation:
The IMS bridge has been started.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2126 (X'084E')MQRC_BRIDGE_STOPPED
Explanation:
The IMS bridge has been stopped.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2127 (X'084F')MQRC_ADAPTER_STORAGE_SHORTAGE
Explanation:
On an MQCONN call, the adapter was unable to acquire storage.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Notify the system programmer. The system programmer should determine why the system is short on storage, and take appropriate action, for example, increase the region size on the step or job card.

2128 (X'0850')MQRC_UOW_IN_PROGRESS
Explanation:
An MQBEGIN call was issued to start a unit of work coordinated by the queue manager, but a unit of work is already in existence for the connection handle specified. This may be a global unit of work started by a previous MQBEGIN call, or a unit of work that is local to the queue manager or one of the cooperating resource managers. No more than one unit of work can exist concurrently for a connection handle.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Review the application logic to determine why there is a unit of work already in existence. Move the MQBEGIN call to the appropriate place in the application.

2129 (X'0851')MQRC_ADAPTER_CONN_LOAD_ERROR
Explanation:
On an MQCONN call, the connection handling module (CSQBCON for batch and CSQQCONN for IMS) could not be loaded, so the adapter could not link to it.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified in the batch application program execution JCL, and in the queue-manager startup JCL.

2130 (X'0852')MQRC_ADAPTER_SERV_LOAD_ERROR
Explanation:
On an MQI call, the batch adapter could not load the API service module CSQBSRV, and so could not link to it.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified in the batch application program execution JCL, and in the queue-manager startup JCL.

2131 (X'0853')MQRC_ADAPTER_DEFS_ERROR
Explanation:
On an MQCONN call, the subsystem definition module (CSQBDEFV for batch and CSQQDEFV for IMS) does not contain the required control block identifier.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Check your library concatenation. If this is correct, check that the CSQBDEFV or CSQQDEFV module contains the required subsystem ID.

2132 (X'0854')MQRC_ADAPTER_DEFS_LOAD_ERROR
Explanation:
On an MQCONN call, the subsystem definition module (CSQBDEFV for batch and CSQQDEFV for IMS) could not be loaded.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified in the application program execution JCL, and in the queue-manager startup JCL.

2133 (X'0855')MQRC_ADAPTER_CONV_LOAD_ERROR
Explanation:
On an MQGET call, the adapter (batch or IMS) could not load the data conversion services modules.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified in the batch application program execution JCL, and in the queue-manager startup JCL.

2134 (X'0856')MQRC_BO_ERROR
Explanation:
On an MQBEGIN call, the begin-options structure MQBO is not valid, for one of the following reasons: 

The StrucId field is not MQBO_STRUC_ID. 
The Version field is not MQBO_VERSION_1. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the pointer points to read-only storage.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQBO structure are set correctly.

2135 (X'0857')MQRC_DH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQDH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQDH_STRUC_ID. 
The Version field is not MQDH_VERSION_1. 
The StrucLength field specifies a value that is too small to include the structure plus the arrays of MQOR and MQPMR records. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2136 (X'0858')MQRC_MULTIPLE_REASONS
Explanation:
An MQOPEN, MQPUT or MQPUT1 call was issued to open a distribution list or put a message to a distribution list, but the result of the call was not the same for all of the destinations in the list. One of the following applies: 

The call succeeded for some of the destinations but not others. The completion code is MQCC_WARNING in this case. 
The call failed for all of the destinations, but for differing reasons. The completion code is MQCC_FAILED in this case.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Examine the MQRR response records to identify the destinations for which the call failed, and the reason for the failure. Ensure that sufficient response records are provided by the application on the call to enable the error(s) to be determined. For the MQPUT1 call, the response records must be specified using the MQOD structure, and not the MQPMO structure.

2137 (X'0859')MQRC_OPEN_FAILED
Explanation:
A queue or other MQ object could not be opened successfully, for one of the following reasons: 

An MQCONN or MQCONNX call was issued, but the queue manager was unable to open an object that is used internally by the queue manager. As a result, processing cannot continue. The error log will contain the name of the object that could not be opened. 
An MQPUT call was issued to put a message to a distribution list, but the message could not be sent to the destination to which this reason code applies because that destination was not opened successfully by the MQOPEN call. This reason occurs only in the Reason field of the MQRR response record.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following: 

If the error occurred on the MQCONN or MQCONNX call, ensure that the required objects exist by running the following command and then retrying the application: 
STRMQM -c qmgrwhere qmgr should be replaced by the name of the queue manager. 
If the error occurred on the MQPUT call, examine the MQRR response records specified on the MQOPEN call to determine the reason that the queue failed to open. Ensure that sufficient response records are provided by the application on the call to enable the error(s) to be determined.
2138 (X'085A')MQRC_ADAPTER_DISC_LOAD_ERROR
Explanation:
On an MQDISC call, the disconnect handling module (CSQBDSC for batch and CSQQDISC for IMS) could not be loaded, so the adapter could not link to it.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified in the application program execution JCL, and in the queue-manager startup JCL. Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2139 (X'085B')MQRC_CNO_ERROR
Explanation:
On an MQCONNX call, the connect-options structure MQCNO is not valid, for one of the following reasons: 

The StrucId field is not MQCNO_STRUC_ID. 
The Version field specifies a value that is not valid or not supported. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the parameter pointer points to read-only storage.
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQCNO structure are set correctly.

2140 (X'085C')MQRC_CICS_WAIT_FAILED
Explanation:
On any MQI call, the CICS adapter issued an EXEC CICS WAIT request, but the request was rejected by CICS.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Examine the CICS trace data for actual response codes. The most likely cause is that the task has been canceled by the operator or by the system.

2141 (X'085D')MQRC_DLH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQDLH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQDLH_STRUC_ID. 
The Version field is not MQDLH_VERSION_1. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2142 (X'085E')MQRC_HEADER_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQ header structure that is not valid. Possible errors include the following: 

The StrucId field is not valid. 
The Version field is not valid. 
The StrucLength field specifies a value that is too small. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2143 (X'085F')MQRC_SOURCE_LENGTH_ERROR
Explanation:
On the MQXCNVC call, the SourceLength parameter specifies a length that is less than zero or not consistent with the string's character set or content (for example, the character set is a double-byte character set, but the length is not a multiple of two). This reason also occurs if the SourceLength parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

This reason code can also occur on the MQGET call when the MQGMO_CONVERT option is specified. In this case it indicates that the MQRC_SOURCE_LENGTH_ERROR reason was returned by an MQXCNVC call issued by the data conversion exit.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Specify a length that is zero or greater. If the reason code occurs on the MQGET call, check that the logic in the data-conversion exit is correct.

2144 (X'0860')MQRC_TARGET_LENGTH_ERROR
Explanation:
On the MQXCNVC call, the TargetLength parameter is not valid for one of the following reasons: 

TargetLength is less than zero. 
The TargetLength parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The MQDCC_FILL_TARGET_BUFFER option is specified, but the value of TargetLength is such that the target buffer cannot be filled completely with valid characters. This can occur when TargetCCSID is a pure DBCS character set (such as UCS-2), but TargetLength specifies a length that is an odd number of bytes.
This reason code can also occur on the MQGET call when the MQGMO_CONVERT option is specified. In this case it indicates that the MQRC_TARGET_LENGTH_ERROR reason was returned by an MQXCNVC call issued by the data conversion exit.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Specify a length that is zero or greater. If the MQDCC_FILL_TARGET_BUFFER option is specified, and TargetCCSID is a pure DBCS character set, ensure that TargetLength specifies a length that is a multiple of two.

If the reason code occurs on the MQGET call, check that the logic in the data-conversion exit is correct.

2145 (X'0861')MQRC_SOURCE_BUFFER_ERROR
Explanation:
On the MQXCNVC call, the SourceBuffer parameter pointer is not valid, or points to storage that cannot be accessed for the entire length specified by SourceLength. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

This reason code can also occur on the MQGET call when the MQGMO_CONVERT option is specified. In this case it indicates that the MQRC_SOURCE_BUFFER_ERROR reason was returned by an MQXCNVC call issued by the data conversion exit.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Specify a valid buffer. If the reason code occurs on the MQGET call, check that the logic in the data-conversion exit is correct.

2146 (X'0862')MQRC_TARGET_BUFFER_ERROR
Explanation:
On the MQXCNVC call, the TargetBuffer parameter pointer is not valid, or points to read-only storage, or to storage that cannot be accessed for the entire length specified by TargetLength. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

This reason code can also occur on the MQGET call when the MQGMO_CONVERT option is specified. In this case it indicates that the MQRC_TARGET_BUFFER_ERROR reason was returned by an MQXCNVC call issued by the data conversion exit.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Specify a valid buffer. If the reason code occurs on the MQGET call, check that the logic in the data-conversion exit is correct.

2148 (X'0864')MQRC_IIH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQIIH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQIIH_STRUC_ID. 
The Version field is not MQIIH_VERSION_1. 
The StrucLength field is not MQIIH_LENGTH_1. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2149 (X'0865')MQRC_PCF_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued to put a message containing PCF data, but the length of the message does not equal the sum of the lengths of the PCF structures present in the message. This can occur for messages with the following format names: 

MQFMT_ADMIN 
MQFMT_EVENT 
MQFMT_PCF
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the length of the message specified on the MQPUT or MQPUT1 call equals the sum of the lengths of the PCF structures contained within the message data.

2150 (X'0866')MQRC_DBCS_ERROR
Explanation:
An error was encountered attempting to convert a double-byte character set (DBCS) string. This can occur in the following cases: 

On the MQXCNVC call, when the SourceCCSID parameter specifies the coded character-set identifier of a double-byte character set, but the SourceBuffer parameter does not contain a valid DBCS string. This may be because the string contains characters that are not valid DBCS characters, or because the string is a mixed SBCS/DBCS string and the shift-out/shift-in characters are not correctly paired. The completion code is MQCC_FAILED in this case. 
On the MQGET call, when the MQGMO_CONVERT option is specified. In this case it indicates that the MQRC_DBCS_ERROR reason code was returned by an MQXCNVC call issued by the data conversion exit. The completion code is MQCC_WARNING in this case.
Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Specify a valid string.

If the reason code occurs on the MQGET call, check that the data in the message is valid, and that the logic in the data-conversion exit is correct.

2152 (X'0868')MQRC_OBJECT_NAME_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued to open a distribution list (that is, the RecsPresent field in MQOD is greater than zero), but the ObjectName field is neither blank nor the null string.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
If it is intended to open a distribution list, set the ObjectName field to blanks or the null string. If it is not intended to open a distribution list, set the RecsPresent field to zero.

2153 (X'0869')MQRC_OBJECT_Q_MGR_NAME_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued to open a distribution list (that is, the RecsPresent field in MQOD is greater than zero), but the ObjectQMgrName field is neither blank nor the null string.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
If it is intended to open a distribution list, set the ObjectQMgrName field to blanks or the null string. If it is not intended to open a distribution list, set the RecsPresent field to zero.

2154 (X'086A')MQRC_RECS_PRESENT_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued, but the call failed for one of the following reasons: 

RecsPresent in MQOD is less than zero. 
ObjectType in MQOD is not MQOT_Q, and RecsPresent is not zero. RecsPresent must be zero if the object being opened is not a queue.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
If it is intended to open a distribution list, set the ObjectType field to MQOT_Q and RecsPresent to the number of destinations in the list. If it is not intended to open a distribution list, set the RecsPresent field to zero.

2155 (X'086B')MQRC_OBJECT_RECORDS_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued to open a distribution list (that is, the RecsPresent field in MQOD is greater than zero), but the MQOR object records are not specified correctly. One of the following applies: 

ObjectRecOffset is zero and ObjectRecPtr is zero or the null pointer. 
ObjectRecOffset is not zero and ObjectRecPtr is not zero and not the null pointer. 
ObjectRecPtr is not a valid pointer. 
ObjectRecPtr or ObjectRecOffset points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that one of ObjectRecOffset and ObjectRecPtr is zero and the other nonzero. Ensure that the field used points to accessible storage.

2156 (X'086C')MQRC_RESPONSE_RECORDS_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued to open a distribution list (that is, the RecsPresent field in MQOD is greater than zero), but the MQRR response records are not specified correctly. One of the following applies: 

ResponseRecOffset is not zero and ResponseRecPtr is not zero and not the null pointer. 
ResponseRecPtr is not a valid pointer. 
ResponseRecPtr or ResponseRecOffset points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that at least one of ResponseRecOffset and ResponseRecPtr is zero. Ensure that the field used points to accessible storage.

2157 (X'086D')MQRC_ASID_MISMATCH
Explanation:
On any MQI call, the caller's primary ASID was found to be different from the home ASID.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the application (MQI calls cannot be issued in cross-memory mode). Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2158 (X'086E')MQRC_PMO_RECORD_FLAGS_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued to put a message, but the PutMsgRecFields field in the MQPMO structure is not valid, for one of the following reasons: 

The field contains flags that are not valid. 
The message is being put to a distribution list, and put message records have been provided (that is, RecsPresent is greater than zero, and one of PutMsgRecOffset or PutMsgRecPtr is nonzero), but PutMsgRecFields has the value MQPMRF_NONE. 
MQPMRF_ACCOUNTING_TOKEN is specified without either MQPMO_SET_IDENTITY_CONTEXT or MQPMO_SET_ALL_CONTEXT.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that PutMsgRecFields is set with the appropriate MQPMRF_* flags to indicate which fields are present in the put message records. If MQPMRF_ACCOUNTING_TOKEN is specified, ensure that either MQPMO_SET_IDENTITY_CONTEXT or MQPMO_SET_ALL_CONTEXT is also specified. Alternatively, set both PutMsgRecOffset and PutMsgRecPtr to zero.

2159 (X'086F')MQRC_PUT_MSG_RECORDS_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued to put a message to a distribution list, but the MQPMR put message records are not specified correctly. One of the following applies: 

PutMsgRecOffset is not zero and PutMsgRecPtr is not zero and not the null pointer. 
PutMsgRecPtr is not a valid pointer. 
PutMsgRecPtr or PutMsgRecOffset points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that at least one of PutMsgRecOffset and PutMsgRecPtr is zero. Ensure that the field used points to accessible storage.

2160 (X'0870')MQRC_CONN_ID_IN_USE
Explanation:
On an MQCONN call, the connection identifier assigned by the queue manager to the connection between a CICS or IMS allied address space and the queue manager conflicts with the connection identifier of another connected CICS or IMS system. The connection identifier assigned is as follows: 

For CICS, the applid 
For IMS, the IMSID parameter on the IMSCTRL (sysgen) macro, or the IMSID parameter on the execution parameter (EXEC card in IMS control region JCL) 
For batch, the job name 
For TSO, the user ID
A conflict arises only if there are two CICS systems, two IMS systems, or one each of CICS and IMS, having the same connection identifiers. Batch and TSO connections need not have unique identifiers.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the naming conventions used in different systems that might connect to the queue manager do not conflict.

2161 (X'0871')MQRC_Q_MGR_QUIESCING
Explanation:
An MQI call was issued, but the call failed because the queue manager is quiescing (preparing to shut down).

When the queue manager is quiescing, the MQOPEN, MQPUT, MQPUT1, and MQGET calls can still complete successfully, but the application can request that they fail by specifying the appropriate option on the call: 

MQOO_FAIL_IF_QUIESCING on MQOPEN 
MQPMO_FAIL_IF_QUIESCING on MQPUT or MQPUT1 
MQGMO_FAIL_IF_QUIESCING on MQGET
Specifying these options enables the application to become aware that the queue manager is preparing to shut down. 

On z/OS: 
For batch applications, this reason can be returned to applications running in LPARs that do not have a queue manager installed. 
For CICS applications, this reason can be returned when no connection was established.
On i5/OS for applications running in compatibility mode, this reason can be returned when no connection was established.
Completion Code:
MQCC_FAILED

Programmer Response:
The application should tidy up and end. If the application specified the MQOO_FAIL_IF_QUIESCING, MQPMO_FAIL_IF_QUIESCING, or MQGMO_FAIL_IF_QUIESCING option on the failing call, the relevant option can be removed and the call reissued. By omitting these options, the application can continue working in order to complete and commit the current unit of work, but the application should not start a new unit of work.

2162 (X'0872')MQRC_Q_MGR_STOPPING
Explanation:
An MQI call was issued, but the call failed because the queue manager is shutting down. If the call was an MQGET call with the MQGMO_WAIT option, the wait has been canceled. No more MQI calls can be issued.

For MQ client applications, it is possible that the call did complete successfully, even though this reason code is returned with a CompCode of MQCC_FAILED. 

On z/OS, the MQRC_CONNECTION_BROKEN reason may be returned instead if, as a result of system scheduling factors, the queue manager shuts down before the call completes.
Completion Code:
MQCC_FAILED

Programmer Response:
The application should tidy up and end. If the application is in the middle of a unit of work coordinated by an external unit-of-work coordinator, the application should issue the appropriate call to back out the unit of work. Any unit of work that is coordinated by the queue manager is backed out automatically.

2163 (X'0873')MQRC_DUPLICATE_RECOV_COORD
Explanation:
On an MQCONN or MQCONNX call, a recovery coordinator already exists for the connection name specified on the connection call issued by the adapter.

A conflict arises only if there are two CICS systems, two IMS systems, or one each of CICS and IMS, having the same connection identifiers. Batch and TSO connections need not have unique identifiers.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the naming conventions used in different systems that might connect to the queue manager do not conflict.

2173 (X'087D')MQRC_PMO_ERROR
Explanation:
On an MQPUT or MQPUT1 call, the MQPMO structure is not valid, for one of the following reasons: 

The StrucId field is not MQPMO_STRUC_ID. 
The Version field specifies a value that is not valid or not supported. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the pointer points to read-only storage.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQPMO structure are set correctly.

2183 (X'0887')MQRC_API_EXIT_LOAD_ERROR
Explanation:
The API crossing exit module could not be linked. If this reason is returned when the API crossing exit is invoked after the call has been executed, the call itself may have executed correctly.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library concatenation has been specified, and that the API crossing exit module is executable and correctly named. Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2184 (X'0888')MQRC_REMOTE_Q_NAME_ERROR
Explanation:
On an MQOPEN or MQPUT1 call, one of the following occurred: 

A local definition of a remote queue (or an alias to one) was specified, but the RemoteQName attribute in the remote queue definition is entirely blank. Note that this error occurs even if the XmitQName in the definition is not blank. 
The ObjectQMgrName field in the object descriptor is not blank and not the name of the local queue manager, but the ObjectName field is blank.
Completion Code:
MQCC_FAILED

Programmer Response:
Alter the local definition of the remote queue and supply a valid remote queue name, or supply a nonblank ObjectName in the object descriptor, as appropriate.

2185 (X'0889')MQRC_INCONSISTENT_PERSISTENCE
Explanation:
An MQPUT call was issued to put a message in a group or a segment of a logical message, but the value specified or defaulted for the Persistence field in MQMD is not consistent with the current group and segment information retained by the queue manager for the queue handle. All messages in a group and all segments in a logical message must have the same value for persistence, that is, all must be persistent, or all must be nonpersistent.

If the current call specifies MQPMO_LOGICAL_ORDER, the call fails. If the current call does not specify MQPMO_LOGICAL_ORDER, but the previous MQPUT call for the queue handle did, the call succeeds with completion code MQCC_WARNING.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Modify the application to ensure that the same value of persistence is used for all messages in the group, or all segments of the logical message.

2186 (X'088A')MQRC_GMO_ERROR
Explanation:
On an MQGET call, the MQGMO structure is not valid, for one of the following reasons: 

The StrucId field is not MQGMO_STRUC_ID. 
The Version field specifies a value that is not valid or not supported. 
The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The queue manager cannot copy the changed structure to application storage, even though the call is successful. This can occur, for example, if the pointer points to read-only storage.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQGMO structure are set correctly.

2187 (X'088B')MQRC_CICS_BRIDGE_RESTRICTION
Explanation:
It is not permitted to issue MQI calls from user transactions that are run in an MQ/CICS-bridge environment where the bridge exit also issues MQI calls. The MQI call fails. If this occurs in the bridge exit, it will result in a transaction abend. If it occurs in the user transaction, this may result in a transaction abend.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The transaction cannot be run using the MQ/CICS bridge. Refer to the appropriate CICS manual for information about restrictions in the MQ/CICS bridge environment.

2188 (X'088C')MQRC_STOPPED_BY_CLUSTER_EXIT
Explanation:
An MQOPEN, MQPUT, or MQPUT1 call was issued to open or put a message on a cluster queue, but the cluster workload exit rejected the call.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the cluster workload exit to ensure that it has been written correctly. Determine why it rejected the call and correct the problem.

2189 (X'088D')MQRC_CLUSTER_RESOLUTION_ERROR
Explanation:
An MQOPEN, MQPUT, or MQPUT1 call was issued to open or put a message on a cluster queue, but the queue definition could not be resolved correctly because a response was required from the repository manager but none was available.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the repository manager is operating and that the queue and channel definitions are correct.

2190 (X'088E')MQRC_CONVERTED_STRING_TOO_BIG
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, a string in a fixed-length field in the message expanded during data conversion and exceeded the size of the field. When this happens, the queue manager tries discarding trailing blank characters and characters following the first null character in order to make the string fit, but in this case there were insufficient characters that could be discarded.

This reason code can also occur for messages with a format name of MQFMT_IMS_VAR_STRING. When this happens, it indicates that the IMS variable string expanded such that its length exceeded the capacity of the 2-byte binary length field contained within the structure of the IMS variable string. (The queue manager never discards trailing blanks in an IMS variable string.)

The message is returned unconverted, with the CompCode parameter of the MQGET call set to MQCC_WARNING. If the message consists of several parts, each of which is described by its own character-set and encoding fields (for example, a message with format name MQFMT_DEAD_LETTER_HEADER), some parts may be converted and other parts not converted. However, the values returned in the various character-set and encoding fields always correctly describe the relevant message data.

This reason code does not occur if the string could be made to fit by discarding trailing blank characters.

Completion Code:
MQCC_WARNING

Programmer Response:
Check that the fields in the message contain the correct values, and that the character-set identifiers specified by the sender and receiver of the message are correct. If they are, the layout of the data in the message must be modified to increase the lengths of the field(s) so that there is sufficient space to allow the string(s) to expand when converted.

2191 (X'088F')MQRC_TMC_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQTMC2 structure that is not valid. Possible errors include the following: 

The StrucId field is not MQTMC_STRUC_ID. 
The Version field is not MQTMC_VERSION_2. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2192 (X'0890')MQRC_PAGESET_FULL
Explanation:
Former name for MQRC_STORAGE_MEDIUM_FULL.

2192 (X'0890')MQRC_STORAGE_MEDIUM_FULL
Explanation:
An MQI call or command was issued to operate on an object, but the call failed because the external storage medium is full. One of the following applies: 

A page-set data set is full (nonshared queues only). 
A coupling-facility structure is full (shared queues only).
This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Check which queues contain messages and look for applications that might be filling the queues unintentionally. Be aware that the queue that has caused the page set or coupling-facility structure to become full is not necessarily the queue referenced by the MQI call that returned MQRC_STORAGE_MEDIUM_FULL.

Check that all of the usual server applications are operating correctly and processing the messages on the queues.

If the applications and servers are operating correctly, increase the number of server applications to cope with the message load, or request the system programmer to increase the size of the page-set data sets.

2193 (X'0891')MQRC_PAGESET_ERROR
Explanation:
An error was encountered with the page set while attempting to access it for a locally defined queue. This could be because the queue is on a page set that does not exist. A console message is issued that tells you the number of the page set in error. For example if the error occurred in the TEST job, and your user identifier is ABCDEFG, the message is: 

CSQI041I CSQIALLC JOB TEST USER ABCDEFG HAD ERROR ACCESSING PAGE SET 27If this reason code occurs while attempting to delete a dynamic queue with MQCLOSE, the dynamic queue has not been deleted.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the storage class for the queue maps to a valid page set using the DISPLAY Q(xx) STGCLASS, DISPLAY STGCLASS(xx), and DISPLAY USAGE PSID commands. If you are unable to resolve the problem, notify the system programmer who should: 

Collect the following diagnostic information: 
A description of the actions that led to the error 
A listing of the application program being run at the time of the error 
Details of the page sets defined for use by the queue manager
Attempt to re-create the problem, and take a system dump immediately after the error occurs 
Contact your IBM Support Center
2194 (X'0892')MQRC_NAME_NOT_VALID_FOR_TYPE
Explanation:
An MQOPEN call was issued to open the queue manager definition, but the ObjectName field in the ObjDesc parameter is not blank.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the ObjectName field is set to blanks.

2195 (X'0893')MQRC_UNEXPECTED_ERROR
Explanation:
The call was rejected because an unexpected error occurred.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the application's parameter list to ensure, for example, that the correct number of parameters was passed, and that data pointers and storage keys are valid. If the problem cannot be resolved, contact your system programmer. 

On z/OS, check whether any information has been displayed on the console. If this error occurs on an MQCONN or MQCONNX call, check that the subsystem named is an active MQ subsystem. In particular, check that it is not a DB2(TM) subsystem. If the problem cannot be resolved, rerun the application with a CSQSNAP DD card (if you have not already got a dump) and send the resulting dump to IBM. 
On OS/2 and i5/OS, consult the FFST record to obtain more detail about the problem. 
On HP OpenVMS, Compaq NonStop Kernel, and UNIX systems, consult the FDC file to obtain more detail about the problem.
2196 (X'0894')MQRC_UNKNOWN_XMIT_Q
Explanation:
On an MQOPEN or MQPUT1 call, a message is to be sent to a remote queue manager. The ObjectName or the ObjectQMgrName in the object descriptor specifies the name of a local definition of a remote queue (in the latter case queue-manager aliasing is being used), but the XmitQName attribute of the definition is not blank and not the name of a locally-defined queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the values specified for ObjectName and ObjectQMgrName. If these are correct, check the queue definitions. For more information on transmission queues, see the WebSphere MQ Application Programming Guide.

2197 (X'0895')MQRC_UNKNOWN_DEF_XMIT_Q
Explanation:
An MQOPEN or MQPUT1 call was issued specifying a remote queue as the destination. If a local definition of the remote queue was specified, or if a queue-manager alias is being resolved, the XmitQName attribute in the local definition is blank.

Because there is no queue defined with the same name as the destination queue manager, the queue manager has attempted to use the default transmission queue. However, the name defined by the DefXmitQName queue-manager attribute is not the name of a locally-defined queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the queue definitions, or the queue-manager attribute. See the WebSphere MQ Application Programming Guide for more information.

2198 (X'0896')MQRC_DEF_XMIT_Q_TYPE_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued specifying a remote queue as the destination. Either a local definition of the remote queue was specified, or a queue-manager alias was being resolved, but in either case the XmitQName attribute in the local definition is blank.

Because there is no transmission queue defined with the same name as the destination queue manager, the local queue manager has attempted to use the default transmission queue. However, although there is a queue defined by the DefXmitQName queue-manager attribute, it is not a local queue.

Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following: 

Specify a local transmission queue as the value of the XmitQName attribute in the local definition of the remote queue. 
Define a local transmission queue with a name that is the same as that of the remote queue manager. 
Specify a local transmission queue as the value of the DefXmitQName queue-manager attribute.
See the WebSphere MQ Application Programming Guide for more information.

2199 (X'0897')MQRC_DEF_XMIT_Q_USAGE_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued specifying a remote queue as the destination. Either a local definition of the remote queue was specified, or a queue-manager alias was being resolved, but in either case the XmitQName attribute in the local definition is blank.

Because there is no transmission queue defined with the same name as the destination queue manager, the local queue manager has attempted to use the default transmission queue. However, the queue defined by the DefXmitQName queue-manager attribute does not have a Usage attribute of MQUS_TRANSMISSION.

Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following: 

Specify a local transmission queue as the value of the XmitQName attribute in the local definition of the remote queue. 
Define a local transmission queue with a name that is the same as that of the remote queue manager. 
Specify a different local transmission queue as the value of the DefXmitQName queue-manager attribute. 
Change the Usage attribute of the DefXmitQName queue to MQUS_TRANSMISSION.
See the WebSphere MQ Application Programming Guide for more information.

2201 (X'0899')MQRC_NAME_IN_USE
Explanation:
An MQOPEN call was issued to create a dynamic queue, but a queue with the same name as the dynamic queue already exists. The existing queue is one that is logically deleted, but for which there are still one or more open handles. For more information, see the description of MQCLOSE in the WebSphere MQ Application Programming Guide.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Either ensure that all handles for the previous dynamic queue are closed, or ensure that the name of the new queue is unique; see the description for reason code MQRC_OBJECT_ALREADY_EXISTS.

2202 (X'089A')MQRC_CONNECTION_QUIESCING
Explanation:
This reason code is issued when the connection to the queue manager is in quiescing state, and an application issues one of the following calls: 

MQCONN or MQCONNX 
MQOPEN, with no connection established, or with MQOO_FAIL_IF_QUIESCING included in the Options parameter 
MQGET, with MQGMO_FAIL_IF_QUIESCING included in the Options field of the GetMsgOpts parameter 
MQPUT or MQPUT1, with MQPMO_FAIL_IF_QUIESCING included in the Options field of the PutMsgOpts parameter
MQRC_CONNECTION_QUIESCING is also issued by the message channel agent (MCA) when the queue manager is in quiescing state.

Completion Code:
MQCC_FAILED

Programmer Response:
The application should tidy up and terminate. Any uncommitted changes in a unit of work should be backed out.

2203 (X'089B')MQRC_CONNECTION_STOPPING
Explanation:
This reason code is issued when the connection to the queue manager is shutting down, and the application issues an MQI call. No more message-queuing calls can be issued. For the MQGET call, if the MQGMO_WAIT option was specified, the wait is canceled.

Note that the MQRC_CONNECTION_BROKEN reason may be returned instead if, as a result of system scheduling factors, the queue manager shuts down before the call completes.

MQRC_CONNECTION_STOPPING is also issued by the message channel agent (MCA) when the queue manager is shutting down.

For MQ client applications, it is possible that the call did complete successfully, even though this reason code is returned with a CompCode of MQCC_FAILED.

Completion Code:
MQCC_FAILED

Programmer Response:
The application should tidy up and terminate. Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2204 (X'089C')MQRC_ADAPTER_NOT_AVAILABLE
Explanation:
This is issued only for CICS applications, if any call is issued and the CICS adapter (a Task Related User Exit) has been disabled, or has not been enabled.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The application should tidy up and terminate. Any uncommitted changes in a unit of work should be backed out. A unit of work that is coordinated by the queue manager is backed out automatically.

2206 (X'089E')MQRC_MSG_ID_ERROR
Explanation:
An MQGET call was issued to retrieve a message using the message identifier as a selection criterion, but the call failed because selection by message identifier is not supported on this queue. 

On z/OS, the queue is a shared queue, but the IndexType queue attribute does not have an appropriate value: 
If selection is by message identifier alone, IndexType must have the value MQIT_MSG_ID. 
If selection is by message identifier and correlation identifier combined, IndexType must have the value MQIT_MSG_ID or MQIT_CORREL_ID.
On Compaq NonStop Kernel, a key file is required but has not been defined.
Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following: 

Modify the application so that it does not use selection by message identifier: set the MsgId field to MQMI_NONE and do not specify MQMO_MATCH_MSG_ID in MQGMO. 
On z/OS, change the IndexType queue attribute to MQIT_MSG_ID. 
On Compaq NonStop Kernel, define a key file.
2207 (X'089F')MQRC_CORREL_ID_ERROR
Explanation:
An MQGET call was issued to retrieve a message using the correlation identifier as a selection criterion, but the call failed because selection by correlation identifier is not supported on this queue. 

On z/OS, the queue is a shared queue, but the IndexType queue attribute does not have an appropriate value: 
If selection is by correlation identifier alone, IndexType must have the value MQIT_CORREL_ID. 
If selection is by correlation identifier and message identifier combined, IndexType must have the value MQIT_CORREL_ID or MQIT_MSG_ID.
On Compaq NonStop Kernel, a key file is required but has not been defined.
Completion Code:
MQCC_FAILED

Programmer Response:
Do one of the following: 

On z/OS, change the IndexType queue attribute to MQIT_CORREL_ID. 
On Compaq NonStop Kernel, define a key file. 
Modify the application so that it does not use selection by correlation identifier: set the CorrelId field to MQCI_NONE and do not specify MQMO_MATCH_CORREL_ID in MQGMO.
2208 (X'08A0')MQRC_FILE_SYSTEM_ERROR
Explanation:
An unexpected return code was received from the file system, in attempting to perform an operation on a queue.

This reason code occurs only on VSE/ESA.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the file system definition for the queue that was being accessed. For a VSAM file, check that the control interval is large enough for the maximum message length allowed for the queue.

2209 (X'08A1')MQRC_NO_MSG_LOCKED
Explanation:
An MQGET call was issued with the MQGMO_UNLOCK option, but no message was currently locked.

Completion Code:
MQCC_WARNING

Programmer Response:
Check that a message was locked by an earlier MQGET call with the MQGMO_LOCK option for the same handle, and that no intervening call has caused the message to become unlocked.

2210 (X'08A2')MQRC_SOAP_DOTNET_ERROR
Explanation:
An exception from the .NET environment (as opposed to WebSphere MQ .NET) has been received and is included as an inner exception.

Completion Code:
MQCC_FAILED

Programmer Response:
Refer to the .NET documentation for details about the inner exception. Follow the corrective action recommended there.

2211 (X'08A3')MQRC_SOAP_AXIS_ERROR
Explanation:
An exception from the Axis environment has been received and is included as a chained exception.

Completion Code:
MQCC_FAILED

Programmer Response:
Refer to the Axis documentation for details about the chained exception. Follow the corrective action recommended there.

2212 (X'08A4')MQRC_SOAP_URL_ERROR
Explanation:
The SOAP URL has been specified incorrectly.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the SOAP URL and rerun.

2217 (X'08A9')MQRC_CONNECTION_NOT_AUTHORIZED
Explanation:
This reason code arises only for CICS applications. For these, connection to the queue manager is done by the adapter. If that connection fails because the CICS subsystem is not authorized to connect to the queue manager, this reason code is issued whenever an application running under that subsystem subsequently issues an MQI call.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the subsystem is authorized to connect to the queue manager.

2218 (X'08AA')MQRC_MSG_TOO_BIG_FOR_CHANNEL
Explanation:
A message was put to a remote queue, but the message is larger than the maximum message length allowed by the channel. This reason code is returned in the Feedback field in the message descriptor of a report message. 

On z/OS, this return code is issued only if you are not using CICS for distributed queuing. Otherwise, MQRC_MSG_TOO_BIG_FOR_Q_MGR is issued.
Completion Code:
MQCC_FAILED

Programmer Response:
Check the channel definitions. Increase the maximum message length that the channel can accept, or break the message into several smaller messages.

2219 (X'08AB')MQRC_CALL_IN_PROGRESS
Explanation:
The application issued an MQI call whilst another MQI call was already being processed for that connection. Only one call per application connection can be processed at a time.

Concurrent calls can arise when an application uses multiple threads, or when an exit is invoked as part of the processing of an MQI call. For example, a data-conversion exit invoked as part of the processing of the MQGET call may try to issue an MQI call. 

On z/OS, concurrent calls can arise only with batch or IMS applications; an example is when a subtask ends while an MQI call is in progress (for example, an MQGET that is waiting), and there is an end-of-task exit routine that issues another MQI call. 
On OS/2 and Windows, concurrent calls can also arise if an MQI call is issued in response to a user message while another MQI call is in progress. 
If the application is using multiple threads with shared handles, MQRC_CALL_IN_PROGRESS occurs when the handle specified on the call is already in use by another thread and MQCNO_HANDLE_SHARE_NO_BLOCK was specified on the MQCONNX call.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that an MQI call cannot be issued while another one is active. Do not issue MQI calls from within a data-conversion exit. 

On z/OS, if you want to provide a subtask to allow an application that is waiting for a message to arrive to be canceled, wait for the message by using MQGET with MQGMO_SET_SIGNAL, rather than MQGMO_WAIT.
2220 (X'08AC')MQRC_RMH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQRMH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQRMH_STRUC_ID. 
The Version field is not MQRMH_VERSION_1. 
The StrucLength field specifies a value that is too small to include the structure plus the variable-length data at the end of the structure. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2222 (X'08AE')MQRC_Q_MGR_ACTIVE
Explanation:
This condition is detected when a queue manager becomes active. 

On z/OS, this event is not generated for the first start of a queue manager, only on subsequent restarts.
Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2223 (X'08AF')MQRC_Q_MGR_NOT_ACTIVE
Explanation:
This condition is detected when a queue manager is requested to stop or quiesce.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2224 (X'08B0')MQRC_Q_DEPTH_HIGH
Explanation:
An MQPUT or MQPUT1 call has caused the queue depth to be incremented to or above the limit specified in the QDepthHighLimit attribute.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2225 (X'08B1')MQRC_Q_DEPTH_LOW
Explanation:
An MQGET call has caused the queue depth to be decremented to or below the limit specified in the QDepthLowLimit attribute.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2226 (X'08B2')MQRC_Q_SERVICE_INTERVAL_HIGH
Explanation:
No successful gets or puts have been detected within an interval that is greater than the limit specified in the QServiceInterval attribute.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2227 (X'08B3')MQRC_Q_SERVICE_INTERVAL_OK
Explanation:
A successful get has been detected within an interval that is less than or equal to the limit specified in the QServiceInterval attribute.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2228 (X'08B4')MQRC_RFH_HEADER_FIELD_ERROR
Explanation:
An expected RFH header field was not found or had an invalid value. If this error occurs in a WebSphere MQ SOAP listener, the missing or erroneous field is either the contentType field or the transportVersion field or both.

Completion Code:
MQCC_FAILED

Programmer Response:
If this error occurs in a WebSphere MQ SOAP listener, and you are using the IBM-supplied sender, contact your IBM Support Center. If you are using a bespoke sender, check the associated error message, and that the RFH2 section of the SOAP/MQ request message contains all the mandatory fields, and that these fields have valid values.

2229 (X'08B5')MQRC_RAS_PROPERTY_ERROR
Explanation:
There is an error related to the RAS property file. The file may be missing, it may be not accessible, or the commands in the file may be incorrect.

Completion Code:
MQCC_FAILED

Programmer Response:
Look at the associated error message, which will explain the error in detail. Correct the error and retry.

2232 (X'08B8')MQRC_UNIT_OF_WORK_NOT_STARTED
Explanation:
An MQGET, MQPUT or MQPUT1 call was issued to get or put a message within a unit of work, but no TM/MP transaction had been started. If MQGMO_NO_SYNCPOINT is not specified on MQGET, or MQPMO_NO_SYNCPOINT is not specified on MQPUT or MQPUT1 (the default), the call requires a unit of work.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure a TM/MP transaction is available, or issue the MQGET call with the MQGMO_NO_SYNCPOINT option, or the MQPUT or MQPUT1 call with the MQPMO_NO_SYNCPOINT option, which will cause a transaction to be started automatically.

2233 (X'08B9')MQRC_CHANNEL_AUTO_DEF_OK
Explanation:
This condition is detected when the automatic definition of a channel is successful. The channel is defined by the MCA.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2234 (X'08BA')MQRC_CHANNEL_AUTO_DEF_ERROR
Explanation:
This condition is detected when the automatic definition of a channel fails; this may be because an error occurred during the definition process, or because the channel automatic-definition exit inhibited the definition. Additional information is returned in the event message indicating the reason for the failure.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING

Programmer Response:
Examine the additional information returned in the event message to determine the reason for the failure.

2235 (X'08BB')MQRC_CFH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFH structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2236 (X'08BC')MQRC_CFIL_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFIL or MQCFIL64 structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2237 (X'08BD')MQRC_CFIN_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFIN or MQCFIN64 structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2238 (X'08BE')MQRC_CFSL_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFSL structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2239 (X'08BF')MQRC_CFST_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFST structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2241 (X'08C1')MQRC_INCOMPLETE_GROUP
Explanation:
An operation was attempted on a queue using a queue handle that had an incomplete message group. This reason code can arise in the following situations: 

On the MQPUT call, when the application specifies MQPMO_LOGICAL_ORDER and attempts to put a message that is not in a group. The completion code is MQCC_FAILED in this case. 
On the MQPUT call, when the application does not specify MQPMO_LOGICAL_ORDER, but the previous MQPUT call for the queue handle did specify MQPMO_LOGICAL_ORDER. The completion code is MQCC_WARNING in this case. 
On the MQGET call, when the application does not specify MQGMO_LOGICAL_ORDER, but the previous MQGET call for the queue handle did specify MQGMO_LOGICAL_ORDER. The completion code is MQCC_WARNING in this case. 
On the MQCLOSE call, when the application attempts to close the queue that has the incomplete message group. The completion code is MQCC_WARNING in this case.
If there is an incomplete logical message as well as an incomplete message group, reason code MQRC_INCOMPLETE_MSG is returned in preference to MQRC_INCOMPLETE_GROUP.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
If this reason code is expected, no corrective action is required. Otherwise, ensure that the MQPUT call for the last message in the group specifies MQMF_LAST_MSG_IN_GROUP.

2242 (X'08C2')MQRC_INCOMPLETE_MSG
Explanation:
An operation was attempted on a queue using a queue handle that had an incomplete logical message. This reason code can arise in the following situations: 

On the MQPUT call, when the application specifies MQPMO_LOGICAL_ORDER and attempts to put a message that is not a segment, or that has a setting for the MQMF_LAST_MSG_IN_GROUP flag that is different from the previous message. The completion code is MQCC_FAILED in this case. 
On the MQPUT call, when the application does not specify MQPMO_LOGICAL_ORDER, but the previous MQPUT call for the queue handle did specify MQPMO_LOGICAL_ORDER. The completion code is MQCC_WARNING in this case. 
On the MQGET call, when the application does not specify MQGMO_LOGICAL_ORDER, but the previous MQGET call for the queue handle did specify MQGMO_LOGICAL_ORDER. The completion code is MQCC_WARNING in this case. 
On the MQCLOSE call, when the application attempts to close the queue that has the incomplete logical message. The completion code is MQCC_WARNING in this case.
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
If this reason code is expected, no corrective action is required. Otherwise, ensure that the MQPUT call for the last segment specifies MQMF_LAST_SEGMENT.

2243 (X'08C3')MQRC_INCONSISTENT_CCSIDS
Explanation:
An MQGET call was issued specifying the MQGMO_COMPLETE_MSG option, but the message to be retrieved consists of two or more segments that have differing values for the CodedCharSetId field in MQMD. This can arise when the segments take different paths through the network, and some of those paths have MCA sender conversion enabled. The call succeeds with a completion code of MQCC_WARNING, but only the first few segments that have identical character-set identifiers are returned.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING

Programmer Response:
Remove the MQGMO_COMPLETE_MSG option from the MQGET call and retrieve the remaining message segments one by one.

2244 (X'08C4')MQRC_INCONSISTENT_ENCODINGS
Explanation:
An MQGET call was issued specifying the MQGMO_COMPLETE_MSG option, but the message to be retrieved consists of two or more segments that have differing values for the Encoding field in MQMD. This can arise when the segments take different paths through the network, and some of those paths have MCA sender conversion enabled. The call succeeds with a completion code of MQCC_WARNING, but only the first few segments that have identical encodings are returned.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING

Programmer Response:
Remove the MQGMO_COMPLETE_MSG option from the MQGET call and retrieve the remaining message segments one by one.

2245 (X'08C5')MQRC_INCONSISTENT_UOW
Explanation:
One of the following applies: 

An MQPUT call was issued to put a message in a group or a segment of a logical message, but the value specified or defaulted for the MQPMO_SYNCPOINT option is not consistent with the current group and segment information retained by the queue manager for the queue handle. 
If the current call specifies MQPMO_LOGICAL_ORDER, the call fails. If the current call does not specify MQPMO_LOGICAL_ORDER, but the previous MQPUT call for the queue handle did, the call succeeds with completion code MQCC_WARNING.

An MQGET call was issued to remove from the queue a message in a group or a segment of a logical message, but the value specified or defaulted for the MQGMO_SYNCPOINT option is not consistent with the current group and segment information retained by the queue manager for the queue handle. 
If the current call specifies MQGMO_LOGICAL_ORDER, the call fails. If the current call does not specify MQGMO_LOGICAL_ORDER, but the previous MQGET call for the queue handle did, the call succeeds with completion code MQCC_WARNING.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING or MQCC_FAILED

Programmer Response:
Modify the application to ensure that the same unit-of-work specification is used for all messages in the group, or all segments of the logical message.

2246 (X'08C6')MQRC_INVALID_MSG_UNDER_CURSOR
Explanation:
An MQGET call was issued specifying the MQGMO_COMPLETE_MSG option with either MQGMO_MSG_UNDER_CURSOR or MQGMO_BROWSE_MSG_UNDER_CURSOR, but the message that is under the cursor has an MQMD with an Offset field that is greater than zero. Because MQGMO_COMPLETE_MSG was specified, the message is not valid for retrieval.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Reposition the browse cursor so that it is located on a message whose Offset field in MQMD is zero. Alternatively, remove the MQGMO_COMPLETE_MSG option.

2247 (X'08C7')MQRC_MATCH_OPTIONS_ERROR
Explanation:
An MQGET call was issued, but the value of the MatchOptions field in the GetMsgOpts parameter is not valid, for one of the following reasons: 

An undefined option is specified. 
All of the following are true: 
MQGMO_LOGICAL_ORDER is specified. 
There is a current message group or logical message for the queue handle. 
Neither MQGMO_BROWSE_MSG_UNDER_CURSOR nor MQGMO_MSG_UNDER_CURSOR is specified. 
One or more of the MQMO_* options is specified. 
The values of the fields in the MsgDesc parameter corresponding to the MQMO_* options specified, differ from the values of those fields in the MQMD for the message to be returned next.
On z/OS, one or more of the options specified is not valid for the index type of the queue.
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that only valid options are specified for the field.

2248 (X'08C8')MQRC_MDE_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQMDE structure that is not valid. Possible errors include the following: 

The StrucId field is not MQMDE_STRUC_ID. 
The Version field is not MQMDE_VERSION_2. 
The StrucLength field is not MQMDE_LENGTH_2. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2249 (X'08C9')MQRC_MSG_FLAGS_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the MsgFlags field in the message descriptor MQMD contains one or more message flags that are not recognized by the local queue manager. The message flags that cause this reason code to be returned depend on the destination of the message; see the description of REPORT in the WebSphere MQ Application Programming Guide for more details.

This reason code can also occur in the Feedback field in the MQMD of a report message, or in the Reason field in the MQDLH structure of a message on the dead-letter queue; in both cases it indicates that the destination queue manager does not support one or more of the message flags specified by the sender of the message.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Do the following: 

Ensure that the MsgFlags field in the message descriptor is initialized with a value when the message descriptor is declared, or is assigned a value prior to the MQPUT or MQPUT1 call. Specify MQMF_NONE if no message flags are needed. 
Ensure that the message flags specified are valid; see the MsgFlags field described in the description of MQMD in the WebSphere MQ Application Programming Guide for valid message flags. 
If multiple message flags are being set by adding the individual message flags together, ensure that the same message flag is not added twice. 
On z/OS, ensure that the message flags specified are valid for the index type of the queue; see the description of the MsgFlags field in MQMD for further details.
2250 (X'08CA')MQRC_MSG_SEQ_NUMBER_ERROR
Explanation:
An MQGET, MQPUT, or MQPUT1 call was issued, but the value of the MsgSeqNumber field in the MQMD or MQMDE structure is less than one or greater than 999 999 999.

This error can also occur on the MQPUT call if the MsgSeqNumber field would have become greater than 999 999 999 as a result of the call.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value in the range 1 through 999 999 999. Do not attempt to create a message group containing more than 999 999 999 messages.

2251 (X'08CB')MQRC_OFFSET_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the value of the Offset field in the MQMD or MQMDE structure is less than zero or greater than 999 999 999.

This error can also occur on the MQPUT call if the Offset field would have become greater than 999 999 999 as a result of the call.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value in the range 0 through 999 999 999. Do not attempt to create a message segment that would extend beyond an offset of 999 999 999.

2252 (X'08CC')MQRC_ORIGINAL_LENGTH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued to put a report message that is a segment, but the OriginalLength field in the MQMD or MQMDE structure is either: 

Less than the length of data in the message, or 
Less than one (for a segment that is not the last segment), or 
Less than zero (for a segment that is the last segment)
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value that is greater than zero. Zero is valid only for the last segment.

2253 (X'08CD')MQRC_SEGMENT_LENGTH_ZERO
Explanation:
An MQPUT or MQPUT1 call was issued to put the first or an intermediate segment of a logical message, but the length of the application message data in the segment (excluding any MQ headers that may be present) is zero. The length must be at least one for the first or intermediate segment.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the application logic to ensure that segments are put with a length of one or greater. Only the last segment of a logical message is permitted to have a length of zero.

2255 (X'08CF')MQRC_UOW_NOT_AVAILABLE
Explanation:
An MQGET, MQPUT, or MQPUT1 call was issued to get or put a message outside a unit of work, but the options specified on the call required the queue manager to process the call within a unit of work. Because there is already a user-defined unit of work in existence, the queue manager was unable to create a temporary unit of work for the duration of the call.

This reason occurs in the following circumstances: 

On an MQGET call, when the MQGMO_COMPLETE_MSG option is specified in MQGMO and the logical message to be retrieved is persistent and consists of two or more segments. 
On an MQPUT or MQPUT1 call, when the MQMF_SEGMENTATION_ALLOWED flag is specified in MQMD and the message requires segmentation.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Issue the MQGET, MQPUT, or MQPUT1 call inside the user-defined unit of work. Alternatively, for the MQPUT or MQPUT1 call, reduce the size of the message so that it does not require segmentation by the queue manager.

2256 (X'08D0')MQRC_WRONG_GMO_VERSION
Explanation:
An MQGET call was issued specifying options that required an MQGMO with a version number not less than MQGMO_VERSION_2, but the MQGMO supplied did not satisfy this condition.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application to pass a version-2 MQGMO. Check the application logic to ensure that the Version field in MQGMO has been set to MQGMO_VERSION_2. Alternatively, remove the option that requires the version-2 MQGMO.

2257 (X'08D1')MQRC_WRONG_MD_VERSION
Explanation:
An MQGET, MQPUT, or MQPUT1 call was issued specifying options that required an MQMD with a version number not less than MQMD_VERSION_2, but the MQMD supplied did not satisfy this condition.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application to pass a version-2 MQMD. Check the application logic to ensure that the Version field in MQMD has been set to MQMD_VERSION_2. Alternatively, remove the option that requires the version-2 MQMD.

2258 (X'08D2')MQRC_GROUP_ID_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued to put a distribution-list message that is also a message in a group, a message segment, or has segmentation allowed, but an invalid combination of options and values was specified. All of the following are true: 

MQPMO_LOGICAL_ORDER is not specified in the Options field in MQPMO. 
Either there are too few MQPMR records provided by MQPMO, or the GroupId field is not present in the MQPMR records. 
One or more of the following flags is specified in the MsgFlags field in MQMD or MQMDE: 
MQMF_SEGMENTATION_ALLOWED 
MQMF_*_MSG_IN_GROUP 
MQMF_*_SEGMENT
The GroupId field in MQMD or MQMDE is not MQGI_NONE.
This combination of options and values would result in the same group identifier being used for all of the destinations in the distribution list; this is not permitted by the queue manager.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQGI_NONE for the GroupId field in MQMD or MQMDE. Alternatively, if the call is MQPUT specify MQPMO_LOGICAL_ORDER in the Options field in MQPMO.

2259 (X'08D3')MQRC_INCONSISTENT_BROWSE
Explanation:
An MQGET call was issued with the MQGMO_BROWSE_NEXT option specified, but the specification of the MQGMO_LOGICAL_ORDER option for the call is different from the specification of that option for the previous call for the queue handle. Either both calls must specify MQGMO_LOGICAL_ORDER, or neither call must specify MQGMO_LOGICAL_ORDER.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Add or remove the MQGMO_LOGICAL_ORDER option as appropriate. Alternatively, to switch between logical order and physical order, specify the MQGMO_BROWSE_FIRST option to restart the scan from the beginning of the queue, omitting or specifying MQGMO_LOGICAL_ORDER as required.

2260 (X'08D4')MQRC_XQH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQXQH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQXQH_STRUC_ID. 
The Version field is not MQXQH_VERSION_1. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2261 (X'08D5')MQRC_SRC_ENV_ERROR
Explanation:
This reason occurs when a channel exit that processes reference messages detects an error in the source environment data of a reference message header (MQRMH). One of the following is true: 

SrcEnvLength is less than zero. 
SrcEnvLength is greater than zero, but there is no source environment data. 
SrcEnvLength is greater than zero, but SrcEnvOffset is negative, zero, or less than the length of the fixed part of MQRMH. 
SrcEnvLength is greater than zero, but SrcEnvOffset plus SrcEnvLength is greater than StrucLength.
The exit returns this reason in the Feedback field of the MQCXP structure. If an exception report is requested, it is copied to the Feedback field of the MQMD associated with the report.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the source environment data correctly.

2262 (X'08D6')MQRC_SRC_NAME_ERROR
Explanation:
This reason occurs when a channel exit that processes reference messages detects an error in the source name data of a reference message header (MQRMH). One of the following is true: 

SrcNameLength is less than zero. 
SrcNameLength is greater than zero, but there is no source name data. 
SrcNameLength is greater than zero, but SrcNameOffset is negative, zero, or less than the length of the fixed part of MQRMH. 
SrcNameLength is greater than zero, but SrcNameOffset plus SrcNameLength is greater than StrucLength.
The exit returns this reason in the Feedback field of the MQCXP structure. If an exception report is requested, it is copied to the Feedback field of the MQMD associated with the report.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the source name data correctly.

2263 (X'08D7')MQRC_DEST_ENV_ERROR
Explanation:
This reason occurs when a channel exit that processes reference messages detects an error in the destination environment data of a reference message header (MQRMH). One of the following is true: 

DestEnvLength is less than zero. 
DestEnvLength is greater than zero, but there is no destination environment data. 
DestEnvLength is greater than zero, but DestEnvOffset is negative, zero, or less than the length of the fixed part of MQRMH. 
DestEnvLength is greater than zero, but DestEnvOffset plus DestEnvLength is greater than StrucLength.
The exit returns this reason in the Feedback field of the MQCXP structure. If an exception report is requested, it is copied to the Feedback field of the MQMD associated with the report.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the destination environment data correctly.

2264 (X'08D8')MQRC_DEST_NAME_ERROR
Explanation:
This reason occurs when a channel exit that processes reference messages detects an error in the destination name data of a reference message header (MQRMH). One of the following is true: 

DestNameLength is less than zero. 
DestNameLength is greater than zero, but there is no destination name data. 
DestNameLength is greater than zero, but DestNameOffset is negative, zero, or less than the length of the fixed part of MQRMH. 
DestNameLength is greater than zero, but DestNameOffset plus DestNameLength is greater than StrucLength.
The exit returns this reason in the Feedback field of the MQCXP structure. If an exception report is requested, it is copied to the Feedback field of the MQMD associated with the report.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the destination name data correctly.

2265 (X'08D9')MQRC_TM_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQTM structure that is not valid. Possible errors include the following: 

The StrucId field is not MQTM_STRUC_ID. 
The Version field is not MQTM_VERSION_1. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2266 (X'08DA')MQRC_CLUSTER_EXIT_ERROR
Explanation:
An MQOPEN, MQPUT, or MQPUT1 call was issued to open or put a message on a cluster queue, but the cluster workload exit defined by the queue-manager's ClusterWorkloadExit attribute failed unexpectedly or did not respond in time. Subsequent MQOPEN, MQPUT, and MQPUT1 calls for this queue handle are processed as though the ClusterWorkloadExit attribute were blank. 

On z/OS, a message giving more information about the error is written to the system log, for example message CSQV455E or CSQV456E.
This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the cluster workload exit to ensure that it has been written correctly.

2267 (X'08DB')MQRC_CLUSTER_EXIT_LOAD_ERROR
Explanation:
An MQCONN or MQCONNX call was issued to connect to a queue manager, but the queue manager was unable to load the cluster workload exit. Execution continues without the cluster workload exit. 

On z/OS, if the cluster workload exit cannot be loaded, a message is written to the system log, for example message CSQV453I. Processing continues as though the ClusterWorkloadExit attribute had been blank.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_WARNING

Programmer Response:
Ensure that the queue-manager's ClusterWorkloadExit attribute has the correct value, and that the exit has been installed into the correct location.

2268 (X'08DC')MQRC_CLUSTER_PUT_INHIBITED
Explanation:
An MQOPEN call with the MQOO_OUTPUT and MQOO_BIND_ON_OPEN options in effect was issued for a cluster queue, but the call failed because all of the following are true: 

All instances of the cluster queue are currently put-inhibited (that is, all of the queue instances have the InhibitPut attribute set to MQQA_PUT_INHIBITED). 
There is no local instance of the queue. (If there is a local instance, the MQOPEN call succeeds, even if the local instance is put-inhibited.) 
There is no cluster workload exit for the queue, or there is a cluster workload exit but it did not choose a queue instance. (If the cluster workload exit does choose a queue instance, the MQOPEN call succeeds, even if that instance is put-inhibited.)
If the MQOO_BIND_NOT_FIXED option is specified on the MQOPEN call, the call can succeed even if all of the queues in the cluster are put-inhibited. However, a subsequent MQPUT call may fail if all of the queues are still put-inhibited at the time of the MQPUT call.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
If the system design allows put requests to be inhibited for short periods, retry the operation later. If the problem persists, determine why all of the queues in the cluster are put-inhibited.

2269 (X'08DD')MQRC_CLUSTER_RESOURCE_ERROR
Explanation:
An MQOPEN, MQPUT, or MQPUT1 call was issued for a cluster queue, but an error occurred whilst trying to use a resource required for clustering.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Do the following: 

Check that the SYSTEM.CLUSTER.* queues are not put inhibited or full. 
Check the event queues for any events relating to the SYSTEM.CLUSTER.* queues, as these may give guidance as to the nature of the failure. 
Check that the repository queue manager is available. 
On z/OS, check the console for signs of the failure, such as full page sets.
2270 (X'08DE')MQRC_NO_DESTINATIONS_AVAILABLE
Explanation:
An MQPUT or MQPUT1 call was issued to put a message on a cluster queue, but at the time of the call there were no longer any instances of the queue in the cluster. The message therefore could not be sent.

This situation can occur when MQOO_BIND_NOT_FIXED is specified on the MQOPEN call that opens the queue, or MQPUT1 is used to put the message.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the queue definition and queue status to determine why all instances of the queue were removed from the cluster. Correct the problem and rerun the application.

2271 (X'08DF')MQRC_CONN_TAG_IN_USE
Explanation:
An MQCONNX call was issued specifying one of the MQCNO_*_CONN_TAG_* options, but the call failed because the connection tag specified by ConnTag in MQCNO is in use by an active process or thread, or there is an unresolved unit of work that references this connection tag.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The problem is likely to be transitory. The application should wait a short while and then retry the operation.

2272 (X'08E0')MQRC_PARTIALLY_CONVERTED
Explanation:
On an MQGET call with the MQGMO_CONVERT option included in the GetMsgOpts parameter, one or more MQ header structures in the message data could not be converted to the specified target character set or encoding. In this situation, the MQ header structures are converted to the queue-manager's character set and encoding, and the application data in the message is converted to the target character set and encoding. On return from the call, the values returned in the various CodedCharSetId and Encoding fields in the MsgDesc parameter and MQ header structures indicate the character set and encoding that apply to each part of the message. The call completes with MQCC_WARNING.

This reason code usually occurs when the specified target character set is one that causes the character strings in the MQ header structures to expand beyond the lengths of their fields. Unicode character set UCS-2 is an example of a character set that causes this to happen.

Completion Code:
MQCC_FAILED

Programmer Response:
If this is an expected situation, no corrective action is required.

If this is an unexpected situation, check that the MQ header structures contain valid data. If they do, specify as the target character set a character set that does not cause the strings to expand.

2273 (X'08E1')MQRC_CONNECTION_ERROR
Explanation:
An MQCONN or MQCONNX call failed for one of the following reasons: 

The installation and customization options chosen for WebSphere MQ do not allow connection by the type of application being used. 
The system parameter module is not at the same release level as the queue manager. 
The channel initiator is not at the same release level as the queue manager. 
An internal error was detected by the queue manager.
This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
None, if the installation and customization options chosen for WebSphere MQ do not allow all functions to be used.

Otherwise, if this occurs while starting the channel initiator, ensure that the queue manager and the channel initiator are both at the same release level and that their started task JCL procedures both specify the same level of WebSphere MQ program libraries; if this occurs while starting the queue manager, relinkedit the system parameter module (CSQZPARM) to ensure that it is at the correct level. If the problem persists, contact your IBM support center.

2274 (X'08E2')MQRC_OPTION_ENVIRONMENT_ERROR
Explanation:
An MQGET call with the MQGMO_MARK_SKIP_BACKOUT option specified was issued from a DB2 Stored Procedure. The call failed because the MQGMO_MARK_SKIP_BACKOUT option cannot be used from a DB2 Stored Procedure.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the MQGMO_MARK_SKIP_BACKOUT option from the MQGET call.

2277 (X'08E5')MQRC_CD_ERROR
Explanation:
An MQCONNX call was issued to connect to a queue manager, but the MQCD channel definition structure addressed by the ClientConnOffset or ClientConnPtr field in MQCNO contains data that is not valid. Consult the error log for more information about the nature of the error.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that input fields in the MQCD structure are set correctly.

2278 (X'08E6')MQRC_CLIENT_CONN_ERROR
Explanation:
An MQCONNX call was issued to connect to a queue manager, but the MQCD channel definition structure is not specified correctly. One of the following applies: 

ClientConnOffset is not zero and ClientConnPtr is not zero and not the null pointer. 
ClientConnPtr is not a valid pointer. 
ClientConnPtr or ClientConnOffset points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems. It also occurs in Java applications when a client channel definition table is specified to determine the name of the channel, but the table itself cannot be found.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that at least one of ClientConnOffset and ClientConnPtr is zero. Ensure that the field used points to accessible storage. Ensure that the URL of the client channel definition table is correct.

2279 (X'08E7')MQRC_CHANNEL_STOPPED_BY_USER
Explanation:
This condition is detected when the channel has been stopped by an operator. The reason qualifier identifies the reasons for stopping.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2280 (X'08E8')MQRC_HCONFIG_ERROR
Explanation:
The configuration handle Hconfig specified on the MQXEP call or MQZEP call is not valid. The MQXEP call is issued by an API exit function; the MQZEP call is issued by an installable service. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify the configuration handle that was provided by the queue manager: 

On the MQXEP call, use the handle passed in the Hconfig field of the MQAXP structure. 
On the MQZEP call, use the handle passed to the installable service's configuration function on the component initialization call. See the WebSphere MQ System Administration Guide book for information about installable services.
2281 (X'08E9')MQRC_FUNCTION_ERROR
Explanation:
An MQXEP or MQZEP call was issued, but the function identifier Function specified on the call is not valid, or not supported by the installable service being configured. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Do the following: 

For the MQXEP call, specify one of the MQXF_* values. 
For the MQZEP call, specify an MQZID_* value that is valid for the installable service being configured. Refer to the description of the MQZEP call in the WebSphere MQ System Administration Guide book to determine which values are valid.
2282 (X'08EA')MQRC_CHANNEL_STARTED
Explanation:
One of the following has occurred: 

An operator has issued a Start Channel command. 
An instance of a channel has been successfully established. This condition is detected when Initial Data negotiation is complete and resynchronization has been performed where necessary such that message transfer can proceed.
Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2283 (X'08EB')MQRC_CHANNEL_STOPPED
Explanation:
This condition is detected when the channel has been stopped. The reason qualifier identifies the reasons for stopping.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2284 (X'08EC')MQRC_CHANNEL_CONV_ERROR
Explanation:
This condition is detected when a channel is unable to do data conversion and the MQGET call to get a message from the transmission queue resulted in a data conversion error. The conversion reason code identifies the reason for the failure.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2285 (X'08ED')MQRC_SERVICE_NOT_AVAILABLE
Explanation:
This reason should be returned by an installable service component when the requested action cannot be performed because the required underlying service is not available. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Make the underlying service available.

2286 (X'08EE')MQRC_INITIALIZATION_FAILED
Explanation:
This reason should be returned by an installable service component when the component is unable to complete initialization successfully. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Correct the error and retry the operation.

2287 (X'08EF')MQRC_TERMINATION_FAILED
Explanation:
This reason should be returned by an installable service component when the component is unable to complete termination successfully. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Correct the error and retry the operation.

2288 (X'08F0')MQRC_UNKNOWN_Q_NAME
Explanation:
This reason should be returned by the MQZ_LOOKUP_NAME installable service component when the name specified for the QName parameter is not recognized. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
None. See the WebSphere MQ System Administration Guide book for information about installable services.

2289 (X'08F1')MQRC_SERVICE_ERROR
Explanation:
This reason should be returned by an installable service component when the component encounters an unexpected error. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Correct the error and retry the operation.

2290 (X'08F2')MQRC_Q_ALREADY_EXISTS
Explanation:
This reason should be returned by the MQZ_INSERT_NAME installable service component when the queue specified by the QName parameter is already defined to the name service. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
None. See the WebSphere MQ System Administration Guide book for information about installable service.

2291 (X'08F3')MQRC_USER_ID_NOT_AVAILABLE
Explanation:
This reason should be returned by the MQZ_FIND_USERID installable service component when the user ID cannot be determined. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
None. See the WebSphere MQ System Administration Guide book for information about installable services.

2292 (X'08F4')MQRC_UNKNOWN_ENTITY
Explanation:
This reason should be returned by the authority installable service component when the name specified by the EntityName parameter is not recognized. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the entity is defined.

2294 (X'08F6')MQRC_UNKNOWN_REF_OBJECT
Explanation:
This reason should be returned by the MQZ_COPY_ALL_AUTHORITY installable service component when the name specified by the RefObjectName parameter is not recognized. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the reference object is defined. See the WebSphere MQ System Administration Guide book for information about installable services.

2295 (X'08F7')MQRC_CHANNEL_ACTIVATED
Explanation:
This condition is detected when a channel that has been waiting to become active, and for which a Channel Not Activated event has been generated, is now able to become active because an active slot has been released by another channel.

This event is not generated for a channel that is able to become active without waiting for an active slot to be released.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2296 (X'08F8')MQRC_CHANNEL_NOT_ACTIVATED
Explanation:
This condition is detected when a channel is required to become active, either because it is starting or because it is about to make another attempt to establish connection with its partner. However, it is unable to do so because the limit on the number of active channels has been reached. 

On z/OS, the maximum number of active channels is given by the ACTCHL queue manager attribute. 
In other environments, the maximum number of active channels is given by the MaxActiveChannels parameter in the qm.ini file.
The channel waits until it is able to take over an active slot released when another channel ceases to be active. At that time a Channel Activated event is generated.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2297 (X'08F9')MQRC_UOW_CANCELED
Explanation:
An MQI call was issued, but the unit of work (TM/MP transaction) being used for the MQ operation had been canceled. This may have been done by TM/MP itself (for example, due to the transaction running for too long, or exceeding audit trail sizes), or by the application program issuing an ABORT_TRANSACTION. All updates performed to resources owned by the queue manager are backed out.

Completion Code:
MQCC_FAILED

Programmer Response:
Refer to the operating system's Transaction Management Operations Guide to determine how the Transaction Manager can be tuned to avoid the problem of system limits being exceeded.

2298 (X'08FA')MQRC_FUNCTION_NOT_SUPPORTED
Explanation:
The function requested is not available in the current environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the call from the application.

2299 (X'08FB')MQRC_SELECTOR_TYPE_ERROR
Explanation:
The Selector parameter has the wrong data type; it must be of type Long.

Completion Code:
MQCC_FAILED

Programmer Response:
Declare the Selector parameter as Long.

2300 (X'08FC')MQRC_COMMAND_TYPE_ERROR
Explanation:
The mqExecute call was issued, but the value of the MQIASY_TYPE data item in the administration bag is not MQCFT_COMMAND.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the MQIASY_TYPE data item in the administration bag has the value MQCFT_COMMAND.

2301 (X'08FD')MQRC_MULTIPLE_INSTANCE_ERROR
Explanation:
The Selector parameter specifies a system selector (one of the MQIASY_* values), but the value of the ItemIndex parameter is not MQIND_NONE. Only one instance of each system selector can exist in the bag.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQIND_NONE for the ItemIndex parameter.

2302 (X'08FE')MQRC_SYSTEM_ITEM_NOT_ALTERABLE
Explanation:
A call was issued to modify the value of a system data item in a bag (a data item with one of the MQIASY_* selectors), but the call failed because the data item is one that cannot be altered by the application.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the selector of a user-defined data item, or remove the call.

2303 (X'08FF')MQRC_BAG_CONVERSION_ERROR
Explanation:
The mqBufferToBag or mqGetBag call was issued, but the data in the buffer or message could not be converted into a bag. This occurs when the data to be converted is not valid PCF.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the logic of the application that created the buffer or message to ensure that the buffer or message contains valid PCF.

If the message contains PCF that is not valid, the message cannot be retrieved using the mqGetBag call: 

If one of the MQGMO_BROWSE_* options was specified, the message remains on the queue and can be retrieved using the MQGET call. 
In other cases, the message has already been removed from the queue and discarded. If the message was retrieved within a unit of work, the unit of work can be backed out and the message retrieved using the MQGET call.
2304 (X'0900')MQRC_SELECTOR_OUT_OF_RANGE
Explanation:
The Selector parameter has a value that is outside the valid range for the call. If the bag was created with the MQCBO_CHECK_SELECTORS option: 

For the mqAddInteger call, the value must be within the range MQIA_FIRST through MQIA_LAST. 
For the mqAddString call, the value must be within the range MQCA_FIRST through MQCA_LAST.
If the bag was not created with the MQCBO_CHECK_SELECTORS option: 

The value must be zero or greater.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value.

2305 (X'0901')MQRC_SELECTOR_NOT_UNIQUE
Explanation:
The ItemIndex parameter has the value MQIND_NONE, but the bag contains more than one data item with the selector value specified by the Selector parameter. MQIND_NONE requires that the bag contain only one occurrence of the specified selector.

This reason code also occurs on the mqExecute call when the administration bag contains two or more occurrences of a selector for a required parameter that permits only one occurrence.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the logic of the application that created the bag. If correct, specify for ItemIndex a value that is zero or greater, and add application logic to process all of the occurrences of the selector in the bag.

Review the description of the administration command being issued, and ensure that all required parameters are defined correctly in the bag.

2306 (X'0902')MQRC_INDEX_NOT_PRESENT
Explanation:
The specified index is not present: 

For a bag, this means that the bag contains one or more data items that have the selector value specified by the Selector parameter, but none of them has the index value specified by the ItemIndex parameter. The data item identified by the Selector and ItemIndex parameters must exist in the bag. 
For a namelist, this means that the index parameter value is too large, and outside the range of valid values.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify the index of a data item that does exist in the bag or namelist. Use the mqCountItems call to determine the number of data items with the specified selector that exist in the bag, or the nameCount method to determine the number of names in the namelist.

2307 (X'0903')MQRC_STRING_ERROR
Explanation:
The String parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2308 (X'0904')MQRC_ENCODING_NOT_SUPPORTED
Explanation:
The Encoding field in the message descriptor MQMD contains a value that is not supported: 

For the mqPutBag call, the field in error resides in the MsgDesc parameter of the call. 
For the mqGetBag call, the field in error resides in: 
The MsgDesc parameter of the call if the MQGMO_CONVERT option was specified. 
The message descriptor of the message about to be retrieved if MQGMO_CONVERT was not specified.
Completion Code:
MQCC_FAILED

Programmer Response:
The value must be MQENC_NATIVE.

If the value of the Encoding field in the message is not valid, the message cannot be retrieved using the mqGetBag call: 

If one of the MQGMO_BROWSE_* options was specified, the message remains on the queue and can be retrieved using the MQGET call. 
In other cases, the message has already been removed from the queue and discarded. If the message was retrieved within a unit of work, the unit of work can be backed out and the message retrieved using the MQGET call.
2309 (X'0905')MQRC_SELECTOR_NOT_PRESENT
Explanation:
The Selector parameter specifies a selector that does not exist in the bag.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a selector that does exist in the bag.

2310 (X'0906')MQRC_OUT_SELECTOR_ERROR
Explanation:
The OutSelector parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2311 (X'0907')MQRC_STRING_TRUNCATED
Explanation:
The string returned by the call is too long to fit in the buffer provided. The string has been truncated to fit in the buffer.

Completion Code:
MQCC_FAILED

Programmer Response:
If the entire string is required, provide a larger buffer. On the mqInquireString call, the StringLength parameter is set by the call to indicate the size of the buffer required to accommodate the string without truncation.

2312 (X'0908')MQRC_SELECTOR_WRONG_TYPE
Explanation:
A data item with the specified selector exists in the bag, but has a data type that conflicts with the data type implied by the call being used. For example, the data item might have an integer data type, but the call being used might be mqSetString, which implies a character data type.

This reason code also occurs on the mqBagToBuffer, mqExecute, and mqPutBag calls when mqAddString or mqSetString was used to add the MQIACF_INQUIRY data item to the bag.

Completion Code:
MQCC_FAILED

Programmer Response:
For the mqSetInteger and mqSetString calls, specify MQIND_ALL for the ItemIndex parameter to delete from the bag all existing occurrences of the specified selector before creating the new occurrence with the required data type.

For the mqInquireBag, mqInquireInteger, and mqInquireString calls, use the mqInquireItemInfo call to determine the data type of the item with the specified selector, and then use the appropriate call to determine the value of the data item.

For the mqBagToBuffer, mqExecute, and mqPutBag calls, ensure that the MQIACF_INQUIRY data item is added to the bag using the mqAddInteger or mqSetInteger calls.

2313 (X'0909')MQRC_INCONSISTENT_ITEM_TYPE
Explanation:
The mqAddInteger or mqAddString call was issued to add another occurrence of the specified selector to the bag, but the data type of this occurrence differed from the data type of the first occurrence.

This reason can also occur on the mqBufferToBag and mqGetBag calls, where it indicates that the PCF in the buffer or message contains a selector that occurs more than once but with inconsistent data types.

Completion Code:
MQCC_FAILED

Programmer Response:
For the mqAddInteger and mqAddString calls, use the call appropriate to the data type of the first occurrence of that selector in the bag.

For the mqBufferToBag and mqGetBag calls, check the logic of the application that created the buffer or sent the message to ensure that multiple-occurrence selectors occur with only one data type. A message that contains a mixture of data types for a selector cannot be retrieved using the mqGetBag call: 

If one of the MQGMO_BROWSE_* options was specified, the message remains on the queue and can be retrieved using the MQGET call. 
In other cases, the message has already been removed from the queue and discarded. If the message was retrieved within a unit of work, the unit of work can be backed out and the message retrieved using the MQGET call.
2314 (X'090A')MQRC_INDEX_ERROR
Explanation:
An index parameter to a call or method has a value that is not valid. The value must be zero or greater. For bag calls, certain MQIND_* values can also be specified: 

For the mqDeleteItem, mqSetInteger and mqSetString calls, MQIND_ALL and MQIND_NONE are valid. 
For the mqInquireBag, mqInquireInteger, mqInquireString, and mqInquireItemInfo calls, MQIND_NONE is valid.
Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value.

2315 (X'090B')MQRC_SYSTEM_BAG_NOT_ALTERABLE
Explanation:
A call was issued to add a data item to a bag, modify the value of an existing data item in a bag, or retrieve a message into a bag, but the call failed because the bag is one that had been created by the system as a result of a previous mqExecute call. System bags cannot be modified by the application.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the handle of a bag created by the application, or remove the call.

2316 (X'090C')MQRC_ITEM_COUNT_ERROR
Explanation:
The mqTruncateBag call was issued, but the ItemCount parameter specifies a value that is not valid. The value is either less than zero, or greater than the number of user-defined data items in the bag.

This reason also occurs on the mqCountItems call if the parameter pointer is not valid, or points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid value. Use the mqCountItems call to determine the number of user-defined data items in the bag.

2317 (X'090D')MQRC_FORMAT_NOT_SUPPORTED
Explanation:
The Format field in the message descriptor MQMD contains a value that is not supported: 

In an administration message, the format value must be one of the following: MQFMT_ADMIN, MQFMT_EVENT, MQFMT_PCF. For the mqPutBag call, the field in error resides in the MsgDesc parameter of the call. For the mqGetBag call, the field in error resides in the message descriptor of the message about to be retrieved. 
On z/OS, the message was put to the command input queue with a format value of MQFMT_ADMIN, but the version of MQ being used does not support that format for commands.
Completion Code:
MQCC_FAILED

Programmer Response:
If the error occurred when putting a message, correct the format value.

If the error occurred when getting a message, the message cannot be retrieved using the mqGetBag call: 

If one of the MQGMO_BROWSE_* options was specified, the message remains on the queue and can be retrieved using the MQGET call. 
In other cases, the message has already been removed from the queue and discarded. If the message was retrieved within a unit of work, the unit of work can be backed out and the message retrieved using the MQGET call.
2318 (X'090E')MQRC_SELECTOR_NOT_SUPPORTED
Explanation:
The Selector parameter specifies a value that is a system selector (a value that is negative), but the system selector is not one that is supported by the call.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a selector value that is supported.

2319 (X'090F')MQRC_ITEM_VALUE_ERROR
Explanation:
The mqInquireBag or mqInquireInteger call was issued, but the ItemValue parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2320 (X'0910')MQRC_HBAG_ERROR
Explanation:
A call was issued that has a parameter that is a bag handle, but the handle is not valid. For output parameters, this reason also occurs if the parameter pointer is not valid, or points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2321 (X'0911')MQRC_PARAMETER_MISSING
Explanation:
An administration message requires a parameter that is not present in the administration bag. This reason code occurs only for bags created with the MQCBO_ADMIN_BAG or MQCBO_REORDER_AS_REQUIRED options.

Completion Code:
MQCC_FAILED

Programmer Response:
Review the description of the administration command being issued, and ensure that all required parameters are present in the bag.

2322 (X'0912')MQRC_CMD_SERVER_NOT_AVAILABLE
Explanation:
The command server that processes administration commands is not available.

Completion Code:
MQCC_FAILED

Programmer Response:
Start the command server.

2323 (X'0913')MQRC_STRING_LENGTH_ERROR
Explanation:
The StringLength parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2324 (X'0914')MQRC_INQUIRY_COMMAND_ERROR
Explanation:
The mqAddInquiry call was used previously to add attribute selectors to the bag, but the command code to be used for the mqBagToBuffer, mqExecute, or mqPutBag call is not recognized. As a result, the correct PCF message cannot be generated.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the mqAddInquiry calls and use instead the mqAddInteger call with the appropriate MQIACF_*_ATTRS or MQIACH_*_ATTRS selectors.

2325 (X'0915')MQRC_NESTED_BAG_NOT_SUPPORTED
Explanation:
A bag that is input to the call contains nested bags. Nested bags are supported only for bags that are output from the call.

Completion Code:
MQCC_FAILED

Programmer Response:
Use a different bag as input to the call.

2326 (X'0916')MQRC_BAG_WRONG_TYPE
Explanation:
The Bag parameter specifies the handle of a bag that has the wrong type for the call. The bag must be an administration bag, that is, it must be created with the MQCBO_ADMIN_BAG option specified on the mqCreateBag call.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the MQCBO_ADMIN_BAG option when the bag is created.

2327 (X'0917')MQRC_ITEM_TYPE_ERROR
Explanation:
The mqInquireItemInfo call was issued, but the ItemType parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2328 (X'0918')MQRC_SYSTEM_BAG_NOT_DELETABLE
Explanation:
An mqDeleteBag call was issued to delete a bag, but the call failed because the bag is one that had been created by the system as a result of a previous mqExecute call. System bags cannot be deleted by the application.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the handle of a bag created by the application, or remove the call.

2329 (X'0919')MQRC_SYSTEM_ITEM_NOT_DELETABLE
Explanation:
A call was issued to delete a system data item from a bag (a data item with one of the MQIASY_* selectors), but the call failed because the data item is one that cannot be deleted by the application.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify the selector of a user-defined data item, or remove the call.

2330 (X'091A')MQRC_CODED_CHAR_SET_ID_ERROR
Explanation:
The CodedCharSetId parameter is not valid. Either the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2331 (X'091B')MQRC_MSG_TOKEN_ERROR
Explanation:
An MQGET call was issued to retrieve a message using the message token as a selection criterion, but the options specified are not valid, because MQMO_MATCH_MSG_TOKEN was specified with either MQGMO_WAIT or MQGMO_SET_SIGNAL.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the MQMO_MATCH_MSG_TOKEN option from the MQGET call.

2332 (X'091C')MQRC_MISSING_WIH
Explanation:
An MQPUT or MQPUT1 call was issued to put a message on a queue whose IndexType attribute had the value MQIT_MSG_TOKEN, but the Format field in the MQMD was not MQFMT_WORK_INFO_HEADER. This error occurs only when the message arrives at the destination queue manager.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application to ensure that it places an MQWIH structure at the start of the message data, and sets the Format field in the MQMD to MQFMT_WORK_INFO_HEADER. Alternatively, change the ApplType attribute of the process definition used by the destination queue to be MQAT_WLM, and specify the required service name and service step name in its EnvData attribute.

2333 (X'091D')MQRC_WIH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQWIH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQWIH_STRUC_ID. 
The Version field is not MQWIH_VERSION_1. 
The StrucLength field is not MQWIH_LENGTH_1. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).

On z/OS, this error also occurs when the IndexType attribute of the queue is MQIT_MSG_TOKEN, but the message data does not begin with an MQWIH structure.
Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field). 

On z/OS, if the queue has an IndexType of MQIT_MSG_TOKEN, ensure that the message data begins with an MQWIH structure.
2334 (X'091E')MQRC_RFH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQRFH or MQRFH2 structure that is not valid. Possible errors include the following: 

The StrucId field is not MQRFH_STRUC_ID. 
The Version field is not MQRFH_VERSION_1 (MQRFH), or MQRFH_VERSION_2 (MQRFH2). 
The StrucLength field specifies a value that is too small to include the structure plus the variable-length data at the end of the structure. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure (the structure extends beyond the end of the message).
Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value (note: MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field).

2335 (X'091F')MQRC_RFH_STRING_ERROR
Explanation:
The contents of the NameValueString field in the MQRFH structure are not valid. NameValueString must adhere to the following rules: 

The string must consist of zero or more name/value pairs separated from each other by one or more blanks; the blanks are not significant. 
If a name or value contains blanks that are significant, the name or value must be enclosed in double-quote characters. 
If a name or value itself contains one or more double-quote characters, the name or value must be enclosed in double-quote characters, and each embedded double-quote character must be doubled. 
A name or value can contain any characters other than the null, which acts as a delimiter. The null and characters following it, up to the defined length of NameValueString, are ignored.
The following is a valid NameValueString: 

Famous_Words "The program displayed ""Hello World"""Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application that generated the message to ensure that it places in the NameValueString field data that adheres to the rules listed above. Check that the StrucLength field is set to the correct value.

2336 (X'0920')MQRC_RFH_COMMAND_ERROR
Explanation:
The message contains an MQRFH structure, but the command name contained in the NameValueString field is not valid.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application that generated the message to ensure that it places in the NameValueString field a command name that is valid.

2337 (X'0921')MQRC_RFH_PARM_ERROR
Explanation:
The message contains an MQRFH structure, but a parameter name contained in the NameValueString field is not valid for the command specified.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application that generated the message to ensure that it places in the NameValueString field only parameters that are valid for the specified command.

2338 (X'0922')MQRC_RFH_DUPLICATE_PARM
Explanation:
The message contains an MQRFH structure, but a parameter occurs more than once in the NameValueString field when only one occurrence is valid for the specified command.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application that generated the message to ensure that it places in the NameValueString field only one occurrence of the parameter.

2339 (X'0923')MQRC_RFH_PARM_MISSING
Explanation:
The message contains an MQRFH structure, but the command specified in the NameValueString field requires a parameter that is not present.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application that generated the message to ensure that it places in the NameValueString field all parameters that are required for the specified command.

2340 (X'0924')MQRC_CHAR_CONVERSION_ERROR
Explanation:
This reason code is returned by the Java MQQueueManager constructor when a required character-set conversion is not available. The conversion required is between two nonUnicode character sets.

This reason code occurs in the following environment: MQ Classes for Java on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the National Language Resources component of the OS/390 Language Environment is installed, and that conversion between the IBM-1047 and ISO8859-1 character sets is available.

2341 (X'0925')MQRC_UCS2_CONVERSION_ERROR
Explanation:
This reason code is returned by the Java MQQueueManager constructor when a required character-set conversion is not available. The conversion required is between the UCS-2 Unicode character set and the queue-manager's character set. IBM-500 is used for the queue-manager's character set if no specific value is available.

This reason code occurs in the following environment: MQ Classes for Java on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the relevant Unicode conversion tables are installed, and that they are available to the z/OS Language Environment. The conversion tables should be installed as part of the z/OS C/C++ optional feature. Refer to the z/OS C/C++ Programming Guide for more information about enabling UCS-2 conversions.

2342 (X'0926')MQRC_DB2_NOT_AVAILABLE
Explanation:
An MQOPEN, MQPUT1, or MQSET call, or a command, was issued to access a shared queue, but it failed because the queue manager is not connected to a DB2 subsystem. As a result, the queue manager is unable to access the object definition relating to the shared queue.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Configure the DB2 subsystem so that the queue manager can connect to it.

2343 (X'0927')MQRC_OBJECT_NOT_UNIQUE
Explanation:
An MQOPEN or MQPUT1 call, or a command, was issued to access a queue, but the call failed because the queue specified cannot be resolved unambiguously. There exists a shared queue with the specified name, and a nonshared queue with the same name.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
One of the queues must be deleted. If the queue to be deleted contains messages, use the MQSC command MOVE QLOCAL to move the messages to a different queue, and then use the command DELETE QLOCAL to delete the queue.

2344 (X'0928')MQRC_CONN_TAG_NOT_RELEASED
Explanation:
An MQDISC call was issued when there was a unit of work outstanding for the connection handle. For CICS, IMS, and RRS connections, the MQDISC call does not commit or back out the unit of work. As a result, the connection tag associated with the unit of work is not yet available for reuse. The tag becomes available for reuse only when processing of the unit of work has been completed.

This reason code occurs only on z/OS.

Completion Code:
MQCC_WARNING

Programmer Response:
Do not try to reuse the connection tag immediately. If the MQCONNX call is issued with the same connection tag, and that tag is still in use, the call fails with reason code MQRC_CONN_TAG_IN_USE.

2345 (X'0929')MQRC_CF_NOT_AVAILABLE
Explanation:
An MQOPEN or MQPUT1 call was issued to access a shared queue, but the allocation of the coupling-facility structure specified in the queue definition failed because there is no suitable coupling facility to hold the structure, based on the preference list in the active CFRM policy.

This reason code can also occur when the API call requires a capability that is not supported by the CF level defined in the coupling-facility structure object. For example, this reason code is returned by an attempt to open a shared queue that has a index type of MQIT_GROUP_ID, but the coupling-facility structure for the queue has a CF level lower than three.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Make available a coupling facility with one of the names specified in the CFRM policy, or modify the CFRM policy to specify the names of coupling facilities that are available.

2346 (X'092A')MQRC_CF_STRUC_IN_USE
Explanation:
An MQI call or command was issued to operate on a shared queue, but the call failed because the coupling-facility structure specified in the queue definition is temporarily unavailable. The coupling-facility structure can be unavailable because a structure dump is in progress, or new connectors to the structure are currently inhibited, or an existing connector to the structure failed or disconnected abnormally and clean-up is not yet complete.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The problem is temporary; wait a short while and then retry the operation.

2347 (X'092B')MQRC_CF_STRUC_LIST_HDR_IN_USE
Explanation:
An MQGET, MQOPEN, MQPUT1, or MQSET call was issued to access a shared queue, but the call failed because the list header associated with the coupling-facility structure specified in the queue definition is temporarily unavailable. The list header is unavailable because it is undergoing recovery processing.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The problem is temporary; wait a short while and then retry the operation.

2348 (X'092C')MQRC_CF_STRUC_AUTH_FAILED
Explanation:
An MQOPEN or MQPUT1 call was issued to access a shared queue, but the call failed because the user is not authorized to access the coupling-facility structure specified in the queue definition.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the security profile for the user identifier used by the application so that the application can access the coupling-facility structure specified in the queue definition.

2349 (X'092D')MQRC_CF_STRUC_ERROR
Explanation:
An MQOPEN or MQPUT1 call was issued to access a shared queue, but the call failed because the coupling-facility structure name specified in the queue definition is not defined in the CFRM data set, or is not the name of a list structure.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the queue definition to specify the name of a coupling-facility list structure that is defined in the CFRM data set.

2350 (X'092E')MQRC_CONN_TAG_NOT_USABLE
Explanation:
An MQCONNX call was issued specifying one of the MQCNO_*_CONN_TAG_* options, but the call failed because the connection tag specified by ConnTag in MQCNO is being used by the queue manager for recovery processing, and this processing is delayed pending recovery of the coupling facility.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The problem is likely to persist. Consult the system programmer to ascertain the cause of the problem.

2351 (X'092F')MQRC_GLOBAL_UOW_CONFLICT
Explanation:
An attempt was made to use inside a global unit of work a connection handle that is participating in another global unit of work. This can occur when an application passes connection handles between objects where the objects are involved in different DTC transactions. Because transaction completion is asynchronous, it is possible for this error to occur after the application has finalized the first object and committed its transaction.

This error does not occur for nontransactional MQI calls.

This reason code occurs only on Windows and z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the "MTS Transaction Support" attribute defined for the object's class is set correctly. If necessary, modify the application so that the connection handle is not used by objects participating in different units of work.

2352 (X'0930')MQRC_LOCAL_UOW_CONFLICT
Explanation:
An attempt was made to use inside a global unit of work a connection handle that is participating in a queue-manager coordinated local unit of work. This can occur when an application passes connection handles between objects where one object is involved in a DTC transaction and the other is not.

This error does not occur for nontransactional MQI calls.

This reason code occurs only on Windows and z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the "MTS Transaction Support" attribute defined for the object's class is set correctly. If necessary, modify the application so that the connection handle is not used by objects participating in different units of work.

2353 (X'0931')MQRC_HANDLE_IN_USE_FOR_UOW
Explanation:
An attempt was made to use outside a unit of work a connection handle that is participating in a global unit of work.

This error can occur when an application passes connection handles between objects where one object is involved in a DTC transaction and the other is not. Because transaction completion is asynchronous, it is possible for this error to occur after the application has finalized the first object and committed its transaction.

This error can also occur when a single object that was created and associated with the transaction loses that association whilst the object is running. The association is lost when DTC terminates the transaction independently of MTS. This might be because the transaction timed out, or because DTC shut down.

This error does not occur for nontransactional MQI calls.

This reason code occurs only on Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the "MTS Transaction Support" attribute defined for the object's class is set correctly. If necessary, modify the application so that objects executing within different units of work do not try to use the same connection handle.

2354 (X'0932')MQRC_UOW_ENLISTMENT_ERROR
Explanation:
This reason code can occur for a variety of reasons. The most likely reason is that an object created by a DTC transaction does not issue a transactional MQI call until after the DTC transaction has timed out. (If the DTC transaction times out after a transactional MQI call has been issued, reason code MQRC_HANDLE_IN_USE_FOR_UOW is returned by the failing MQI call.)

Another cause of MQRC_UOW_ENLISTMENT_ERROR is incorrect installation; Windows NT Service pack must be installed after the Windows NT Option pack.

This reason code occurs only on Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the DTC "Transaction timeout" value. If necessary, verify the NT installation order.

2355 (X'0933')MQRC_UOW_MIX_NOT_SUPPORTED
Explanation:
The mixture of calls used by the application to perform operations within a unit of work is not supported. In particular, it is not possible to mix within the same process a local unit of work coordinated by the queue manager with a global unit of work coordinated by DTC (Distributed Transaction Coordinator).

An application may cause this mixture to arise if some objects in a package are coordinated by DTC and others are not. It can also occur if transactional MQI calls from an MTS client are mixed with transactional MQI calls from a library package transactional MTS object.

No problem arises if all transactional MQI calls originate from transactional MTS objects, or all transactional MQI calls originate from nontransactional MTS objects. But when a mixture of styles is used, the first style used fixes the style for the unit of work, and subsequent attempts to use the other style within the process fail with reason code MQRC_UOW_MIX_NOT_SUPPORTED.

When an application is run twice, scheduling factors in the operating system mean that it is possible for the queue-manager-coordinated transactional calls to fail in one run, and for the DTC-coordinated transactional calls to fail in the other run.

This reason code occurs only on Windows when running a version of the queue manager prior to version 5.2.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the "MTS Transaction Support" attribute defined for the object's class is set correctly. If necessary, modify the application so that objects executing within different units of work do not try to use the same connection handle.

2356 (X'0934')MQRC_WXP_ERROR
Explanation:
An MQXCLWLN call was issued from a cluster workload exit to obtain the address of the next record in the chain, but the workload exit parameter structure ExitParms is not valid, for one of the following reasons: 

The parameter pointer is not valid. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.) 
The StrucId field is not MQWXP_STRUC_ID. 
The Version field is not MQWXP_VERSION_2. 
The CacheContext field does not contain the value passed to the exit by the queue manager.
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the parameter specified for ExitParms is the MQWXP structure that was passed to the exit when the exit was invoked.

2357 (X'0935')MQRC_CURRENT_RECORD_ERROR
Explanation:
An MQXCLWLN call was issued from a cluster workload exit to obtain the address of the next record in the chain, but the address specified by the CurrentRecord parameter is not the address of a valid record. CurrentRecord must be the address of a destination record (MQWDR), queue record (MQWQR), or cluster record (MQWCR) residing within the cluster cache.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the cluster workload exit passes the address of a valid record residing in the cluster cache.

2358 (X'0936')MQRC_NEXT_OFFSET_ERROR
Explanation:
An MQXCLWLN call was issued from a cluster workload exit to obtain the address of the next record in the chain, but the offset specified by the NextOffset parameter is not valid. NextOffset must be the value of one of the following fields: 

ChannelDefOffset field in MQWDR 
ClusterRecOffset field in MQWDR 
ClusterRecOffset field in MQWQR 
ClusterRecOffset field in MQWCR
Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the value specified for the NextOffset parameter is the value of one of the fields listed above.

2359 (X'0937')MQRC_NO_RECORD_AVAILABLE
Explanation:
An MQXCLWLN call was issued from a cluster workload exit to obtain the address of the next record in the chain, but the current record is the last record in the chain.

Completion Code:
MQCC_FAILED

Programmer Response:
None.

2360 (X'0938')MQRC_OBJECT_LEVEL_INCOMPATIBLE
Explanation:
An MQOPEN or MQPUT1 call, or a command, was issued, but the definition of the object to be accessed is not compatible with the queue manager to which the application has connected. The object definition was created or modified by a different version of the queue manager.

If the object to be accessed is a queue, the incompatible object definition could be the object specified, or one of the object definitions used to resolve the specified object (for example, the base queue to which an alias queue resolves, or the transmission queue to which a remote queue or queue-manager alias resolves).

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
The application must be run on a queue manager that is compatible with the object definition. Refer to the WebSphere MQ for z/OS Concepts and Planning Guide and the WebSphere MQ for z/OS System Setup Guide for information about compatibility and migration between different versions of the queue manager.

2361 (X'0939')MQRC_NEXT_RECORD_ERROR
Explanation:
An MQXCLWLN call was issued from a cluster workload exit to obtain the address of the next record in the chain, but the address specified for the NextRecord parameter is either null, not valid, or the address of read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredictable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid address for the NextRecord parameter.

2362 (X'093A')MQRC_BACKOUT_THRESHOLD_REACHED
Explanation:
This reason code occurs only in the Reason field in an MQDLH structure, or in the Feedback field in the MQMD of a report message.

A JMS ConnectionConsumer found a message that exceeds the queue's backout threshold. The queue does not have a backout requeue queue defined, so the message was processed as specified by the disposition options in the Report field in the MQMD of the message.

On queue managers that do not support the BackoutThreshold and BackoutRequeueQName queue attributes, JMS ConnectionConsumer uses a value of 20 for the backout threshold. When the BackoutCount of a message reaches this threshold, the message is processed as specified by the disposition options.

If the Report field specifies one of the MQRO_EXCEPTION_* options, this reason code appears in the Feedback field of the report message. If the Report field specifies MQRO_DEAD_LETTER_Q, or the disposition report options are left as default, this reason code appears in the Reason field of the MQDLH.

Completion Code:
None

Programmer Response:
Investigate the cause of the backout count being greater than the threshold. To correct this, define the backout queue for the queue concerned.

2363 (X'093B')MQRC_MSG_NOT_MATCHED
Explanation:
This reason code occurs only in the Reason field in an MQDLH structure, or in the Feedback field in the MQMD of a report message.

While performing Point-to-Point messaging, JMS encountered a message matching none of the selectors of ConnectionConsumers monitoring the queue. To maintain performance, the message was processed as specified by the disposition options in the Report field in the MQMD of the message.

If the Report field specifies one of the MQRO_EXCEPTION_* options, this reason code appears in the Feedback field of the report message. If the Report field specifies MQRO_DEAD_LETTER_Q, or the disposition report options are left as default, this reason code appears in the Reason field of the MQDLH.

Completion Code:
None

Programmer Response:
To correct this, ensure that the ConnectionConsumers monitoring the queue provide a complete set of selectors. Alternatively, set the QueueConnectionFactory to retain messages.

2364 (X'093C')MQRC_JMS_FORMAT_ERROR
Explanation:
This reason code is generated when JMS encounters a message that it is unable to parse. If such a message is encountered by a JMS ConnectionConsumer, the message is processed as specified by the disposition options in the Report field in the MQMD of the message.

If the Report field specifies one of the MQRO_EXCEPTION_* options, this reason code appears in the Feedback field of the report message. If the Report field specifies MQRO_DEAD_LETTER_Q, or the disposition report options are left as default, this reason code appears in the Reason field of the MQDLH.

Completion Code:
None

Programmer Response:
Investigate the origin of the message.

2365 (X'093D')MQRC_SEGMENTS_NOT_SUPPORTED
Explanation:
An MQPUT call was issued to put a segment of a logical message, but the queue on which the message is to be placed has an IndexType of MQIT_GROUP_ID. Message segments cannot be placed on queues with this index type.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application to put messages that are not segments; ensure that the MQMF_SEGMENT and MQMF_LAST_SEGMENT flags in the MsgFlags field in MQMD are not set, and that the Offset is zero. Alternatively, change the index type of the queue.

2366 (X'093E')MQRC_WRONG_CF_LEVEL
Explanation:
An MQOPEN or MQPUT1 call was issued specifying a shared queue, but the queue requires a coupling-facility structure with a different level of capability.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the coupling-facility structure used for the queue is at the level required to support the capabilities that the queue provides.

2367 (X'093F')MQRC_CONFIG_CREATE_OBJECT
Explanation:
This condition is detected when an object is created.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2368 (X'0940')MQRC_CONFIG_CHANGE_OBJECT
Explanation:
This condition is detected when an object is changed.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2369 (X'0941')MQRC_CONFIG_DELETE_OBJECT
Explanation:
This condition is detected when an object is deleted.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2370 (X'0942')MQRC_CONFIG_REFRESH_OBJECT
Explanation:
This condition is detected when an object is refreshed.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2371 (X'0943')MQRC_CHANNEL_SSL_ERROR
Explanation:
This condition is detected when a connection cannot be established due to an SSL key-exchange or authentication failure.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2373 (X'0945')MQRC_CF_STRUC_FAILED
Explanation:
An MQI call or command was issued to access a shared queue, but the call failed because the coupling-facility structure used for the shared queue had failed.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Report the problem to the operator or administrator, who should use the MQSC command RECOVER CFSTRUCT to initiate recovery of the coupling-facility structure

2374 (X'0946')MQRC_API_EXIT_ERROR
Explanation:
An API exit function returned an invalid response code, or failed in some other way.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the exit logic to ensure that the exit is returning valid values in the ExitResponse and ExitResponse2 fields of the MQAXP structure. Consult the FFST record to see if it contains more detail about the problem.

2375 (X'0947')MQRC_API_EXIT_INIT_ERROR
Explanation:
The queue manager encountered an error while attempting to initialize the execution environment for an API exit function.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Consult the FFST record to obtain more detail about the problem.

2376 (X'0948')MQRC_API_EXIT_TERM_ERROR
Explanation:
The queue manager encountered an error while attempting to terminate the execution environment for an API exit function.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Consult the FFST record to obtain more detail about the problem.

2377 (X'0949')MQRC_EXIT_REASON_ERROR
Explanation:
An MQXEP call was issued by an API exit function, but the value specified for the ExitReason parameter is either not valid, or not supported for the specified function identifier Function.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the exit function to specify a value for ExitReason that is valid for the specified value of Function.

2378 (X'094A')MQRC_RESERVED_VALUE_ERROR
Explanation:
An MQXEP call was issued by an API exit function, but the value specified for the Reserved parameter is not valid. The value must be the null pointer.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the exit to specify the null pointer as the value of the Reserved parameter.

2379 (X'094B')MQRC_NO_DATA_AVAILABLE
Explanation:
This reason should be returned by the MQZ_ENUMERATE_AUTHORITY_DATA installable service component when there is no more authority data to return to the invoker of the service component. 

On z/OS, this reason code does not occur.
Completion Code:
MQCC_FAILED

Programmer Response:
None.

2380 (X'094C')MQRC_SCO_ERROR
Explanation:
On an MQCONNX call, the MQSCO structure is not valid for one of the following reasons: 

The StrucId field is not MQSCO_STRUC_ID. 
The Version field is not MQSCO_VERSION_1.
This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the definition of the MQSCO structure.

2381 (X'094D')MQRC_KEY_REPOSITORY_ERROR
Explanation:
On an MQCONN or MQCONNX call, the location of the key repository is either not specified, not valid, or results in an error when used to access the key repository. The location of the key repository is specified by one of the following: 

The value of the MQSSLKEYR environment variable (MQCONN or MQCONNX call), or 
The value of the KeyRepository field in the MQSCO structure (MQCONNX call only).
For the MQCONNX call, if both MQSSLKEYR and KeyRepository are specified, the latter is used.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid location for the key repository.

2382 (X'094E')MQRC_CRYPTO_HARDWARE_ERROR
Explanation:
On an MQCONN or MQCONNX call, the configuration string for the cryptographic hardware is not valid, or results in an error when used to configure the cryptographic hardware. The configuration string is specified by one of the following: 

The value of the MQSSLCRYP environment variable (MQCONN or MQCONNX call), or 
The value of the CryptoHardware field in the MQSCO structure (MQCONNX call only).
For the MQCONNX call, if both MQSSLCRYP and CryptoHardware are specified, the latter is used.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid configuration string for the cryptographic hardware.

2383 (X'094F')MQRC_AUTH_INFO_REC_COUNT_ERROR
Explanation:
On an MQCONNX call, the AuthInfoRecCount field in the MQSCO structure specifies a value that is less than zero.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value for AuthInfoRecCount that is zero or greater.

2384 (X'0950')MQRC_AUTH_INFO_REC_ERROR
Explanation:
On an MQCONNX call, the MQSCO structure does not specify the address of the MQAIR records correctly. One of the following applies: 

AuthInfoRecCount is greater than zero, but AuthInfoRecOffset is zero and AuthInfoRecPtr is the null pointer. 
AuthInfoRecOffset is not zero and AuthInfoRecPtr is not the null pointer. 
AuthInfoRecPtr is not a valid pointer. 
AuthInfoRecOffset or AuthInfoRecPtr points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that one of AuthInfoRecOffset or AuthInfoRecPtr is zero and the other nonzero. Ensure that the field used points to accessible storage.

2385 (X'0951')MQRC_AIR_ERROR
Explanation:
On an MQCONNX call, an MQAIR record is not valid for one of the following reasons: 

The StrucId field is not MQAIR_STRUC_ID. 
The Version field is not MQAIR_VERSION_1.
This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the definition of the MQAIR record.

2386 (X'0952')MQRC_AUTH_INFO_TYPE_ERROR
Explanation:
On an MQCONNX call, the AuthInfoType field in an MQAIR record specifies a value that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify MQAIT_CRL_LDAP for AuthInfoType.

2387 (X'0953')MQRC_AUTH_INFO_CONN_NAME_ERROR
Explanation:
On an MQCONNX call, the AuthInfoConnName field in an MQAIR record specifies a value that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a valid connection name.

2388 (X'0954')MQRC_LDAP_USER_NAME_ERROR
Explanation:
On an MQCONNX call, an LDAP user name in an MQAIR record is not specified correctly. One of the following applies: 

LDAPUserNameLength is greater than zero, but LDAPUserNameOffset is zero and LDAPUserNamePtr is the null pointer. 
LDAPUserNameOffset is nonzero and LDAPUserNamePtr is not the null pointer. 
LDAPUserNamePtr is not a valid pointer. 
LDAPUserNameOffset or LDAPUserNamePtr points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that one of LDAPUserNameOffset or LDAPUserNamePtr is zero and the other nonzero. Ensure that the field used points to accessible storage.

2389 (X'0955')MQRC_LDAP_USER_NAME_LENGTH_ERR
Explanation:
On an MQCONNX call, the LDAPUserNameLength field in an MQAIR record specifies a value that is less than zero.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value for LDAPUserNameLength that is zero or greater.

2390 (X'0956')MQRC_LDAP_PASSWORD_ERROR
Explanation:
On an MQCONNX call, the LDAPPassword field in an MQAIR record specifies a value when no value is allowed.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Specify a value that is blank or null.

2391 (X'0957')MQRC_SSL_ALREADY_INITIALIZED
Explanation:
An MQCONN or MQCONNX call was issued with SSL configuration options specified, but the SSL environment had already been initialized. The connection to the queue manager completed successfully, but the SSL configuration options specified on the call were ignored; the existing SSL environment was used instead.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_WARNING

Programmer Response:
If the application must be run with the SSL configuration options defined on the MQCONN or MQCONNX call, use the MQDISC call to sever the connection to the queue manager and then terminate the application. Alternatively run the application later when the SSL environment has not been initialized.

2392 (X'0958')MQRC_SSL_CONFIG_ERROR
Explanation:
On an MQCONNX call, the MQCNO structure does not specify the MQSCO structure correctly. One of the following applies: 

SSLConfigOffset is nonzero and SSLConfigPtr is not the null pointer. 
SSLConfigPtr is not a valid pointer. 
SSLConfigOffset or SSLConfigPtr points to storage that is not accessible.
This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that one of SSLConfigOffset or SSLConfigPtr is zero and the other nonzero. Ensure that the field used points to accessible storage.

2393 (X'0959')MQRC_SSL_INITIALIZATION_ERROR
Explanation:
An MQCONN or MQCONNX call was issued with SSL configuration options specified, but an error occurred during the initialization of the SSL environment.

This reason code occurs in the following environments: AIX, HP-UX, Solaris, Windows.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the SSL installation is correct.

2394 (X'095A')MQRC_Q_INDEX_TYPE_ERROR
Explanation:
An MQGET call was issued specifying one or more of the following options: 

MQGMO_ALL_MSGS_AVAILABLE 
MQGMO_ALL_SEGMENTS_AVAILABLE 
MQGMO_COMPLETE_MSG 
MQGMO_LOGICAL_ORDER
but the call failed because the queue is not indexed by group identifier. These options require the queue to have an IndexType of MQIT_GROUP_ID.

This reason code occurs only on z/OS.

Completion Code:
MQCC_FAILED

Programmer Response:
Redefine the queue to have an IndexType of MQIT_GROUP_ID. Alternatively, modify the application to avoid using the options listed above.

2395 (X'095B')MQRC_CFBS_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFBS structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2396 (X'095C')MQRC_SSL_NOT_ALLOWED
Explanation:
A connection to a queue manager was requested, specifying SSL encryption. However, the connection mode requested is one that does not support SSL (for example, bindings connect).

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Modify the application to request client connection mode, or to disable SSL encryption.

2397 (X'095D')MQRC_JSSE_ERROR
Explanation:
JSSE reported an error (for example, while connecting to a queue manager using SSL encryption). The MQException object containing this reason code references the Exception thrown by JSSE; this can be obtained by using the MQException.getCause() method. From JMS, the MQException is linked to the thrown JMSException.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Inspect the causal exception to determine the JSSE error.

2398 (X'095E')MQRC_SSL_PEER_NAME_MISMATCH
Explanation:
The application attempted to connect to the queue manager using SSL encryption, but the distinguished name presented by the queue manager does not match the specified pattern.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the certificates used to identify the queue manager. Also check the value of the sslPeerName property specified by the application.

2399 (X'095F')MQRC_SSL_PEER_NAME_ERROR
Explanation:
The application specified a peer name of incorrect format.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the value of the sslPeerName property specified by the application.

2400 (X'0960')MQRC_UNSUPPORTED_CIPHER_SUITE
Explanation:
A connection to a queue manager was requested, specifying SSL encryption. However, JSSE reported that it does not support the CipherSuite specified by the application.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the CipherSuite specified by the application. Note that the names of JSSE CipherSuites differ from their equivalent CipherSpecs used by the queue manager.

Also, check that JSSE is correctly installed.

2401 (X'0961')MQRC_SSL_CERTIFICATE_REVOKED
Explanation:
A connection to a queue manager was requested, specifying SSL encryption. However, the certificate presented by the queue manager was found to be revoked by one of the specified CertStores.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Check the certificates used to identify the queue manager.

2402 (X'0962')MQRC_SSL_CERT_STORE_ERROR
Explanation:
A connection to a queue manager was requested, specifying SSL encryption. However, none of the CertStore objects provided by the application could be searched for the certificate presented by the queue manager. The MQException object containing this reason code references the Exception encountered when searching the first CertStore; this can be obtained using the MQException.getCause() method. From JMS, the MQException is linked to the thrown JMSException.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Inspect the causal exception to determine the underlying error. Check the CertStore objects provided by your application. If the causal exception is a java.lang.NoSuchElementException, ensure that your application is not specifying an empty collection of CertStore objects.

2406 (X'0966')MQRC_CLIENT_EXIT_LOAD_ERROR
Explanation:
The external user exit required for a client connection could not be loaded because the shared library specified for it cannot be found, or the entry point specified for it cannot be found.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that the correct library has been specified, and that the path variable for the machine environment includes the relevant directory. Ensure also that the entry point has been named properly and that the named library does export it.

2407 (X'0967')MQRC_CLIENT_EXIT_ERROR
Explanation:
A failure occured while executing a non-Java user exit for a client connection.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the non-Java user exit can accept the parameters and message being passed to it and that it can handle error conditions, and that any information that the exit requires, such as user data, is correct and available.

2409 (X'0969')MQRC_SSL_KEY_RESET_ERROR
Explanation:
On an MQCONN or MQCONNX call, the value of the SSL key reset count is not in the valid range of 0 through 999 999 999.

The value of the SSL key reset count is specified by either the value of the MQSSLRESET environment variable (MQCONN or MQCONNX call), or the value of the KeyResetCount field in the MQSCO structure (MQCONNX call only). For the MQCONNX call, if both MQSSLRESET and KeyResetCount are specified, the latter is used. MQCONN or MQCONNX

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure and the MQSSLRESET environment variable are set correctly.

2411 (X'096B')MQRC_LOGGER_STATUS
Explanation:
This condition is detected when a logger event occurs.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2412 (X'096C')MQRC_COMMAND_MQSC
Explanation:
This condition is detected when an MQSC command is executed.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2413 (X'096D')MQRC_COMMAND_PCF
Explanation:
This condition is detected when a PCF command is executed.

Completion Code:
MQCC_WARNING

Programmer Response:
None. This reason code is only used to identify the corresponding event message.

2414 (X'096E')MQRC_CFIF_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFIF structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2415 (X'096F')MQRC_CFSF_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFSF structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2416 (X'0970')MQRC_CFGR_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFGR structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, z/OS, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2417 (X'0971')MQRC_MSG_NOT_ALLOWED_IN_GROUP
Explanation:
An MQPUT or MQPUT1 call was issued to put a message in a group but it is not valid to put such a message in a group. An example of an invalid message is a PCF message where the Type is MQCFT_TRACE_ROUTE.

Completion Code:
MQCC_FAILED

Programmer Response:
Remove the invalid message from the group.

2418 (X'0972')MQRC_FILTER_OPERATOR_ERROR
Explanation:
The Operator parameter supplied is not valid.

If it is an input variable then the value is not one of the MQCFOP_* constant values. If it is an output variable then the parameter pointer is not valid, or it points to read-only storage. (It is not always possible to detect parameter pointers that are not valid; if not detected, unpredicatable results occur.)

Completion Code:
MQCC_FAILED

Programmer Response:
Correct the parameter.

2419 (X'0973')MQRC_NESTED_SELECTOR_ERROR
Explanation:
An mqAddBag call was issued, but the bag to be nested contained a data item with an inconsistent selector. This reason only occurs if the bag into which the nested bag was to be added was created with the MQCBO_CHECK_SELECTORS option.

Completion Code:
MQCC_FAILED

Programmer Response:
Ensure that all data items within the bag to be nested have selectors that are consistent with the data type implied by the item.

2420 (X'0974')MQRC_EPH_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQEPH structure that is not valid. Possible errors include the following: 

The StrucId field is not MQEPH_STRUC_ID. 
The Version field is not MQEPH_VERSION_1. 
The StrucLength field specifies a value that is too small to include the structure plus the variable-length data at the end of the structure. 
The CodedCharSetId field is zero, or a negative value that is not valid. 
The Flags field contains an invalid combination of MQEPH_* values. 
The BufferLength parameter of the call has a value that is too small to accommodate the structure, so the structure extends beyond the end of the message.
Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly. Ensure that the application sets the CodedCharSetId field to a valid value; note that MQCCSI_DEFAULT, MQCCSI_EMBEDDED, MQCCSI_Q_MGR, and MQCCSI_UNDEFINED are not valid in this field.

2421 (X'0975')MQRC_RFH_FORMAT_ERROR
Explanation:
The message contains an MQRFH structure, but its format is incorrect. If you are using WebSphere MQ SOAP, the error is in an incoming SOAP/MQ request message.

Completion Code:
MQCC_FAILED

Programmer Response:
If you are using WebSphere MQ SOAP with the IBM-supplied sender, contact your IBM support center. If you are using WebSphere MQ SOAP with a bespoke sender, check that the RFH2 section of the SOAP/MQ request message is in valid RFH2 format.

2422 (X'0976')MQRC_CFBF_ERROR
Explanation:
An MQPUT or MQPUT1 call was issued, but the message data contains an MQCFBF structure that is not valid.

This reason code occurs in the following environments: AIX, HP-UX, OS/2, i5/OS, Solaris, Windows, plus WebSphere MQ clients connected to these systems.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the fields in the structure are set correctly.

2423 (X'0977')MQRC_CLIENT_CHANNEL_CONFLICT
Explanation:
A client channel definition table was specified for determining the name of the channel, but the name has already been defined.

This reason code occurs only with Java applications.

Completion Code:
MQCC_FAILED

Programmer Response:
Change the channel name to blank and try again.

6100 (X'17D4')MQRC_REOPEN_EXCL_INPUT_ERROR
Explanation:
An open object does not have the correct ImqObject open options and requires one or more additional options. An implicit reopen is required but closure has been prevented.

Closure has been prevented because the queue is open for exclusive input and closure might result in the queue being accessed by another process or thread, before the queue is reopened by the process or thread that presently has access.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Set the open options explicitly to cover all eventualities so that implicit reopening is not required.

6101 (X'17D5')MQRC_REOPEN_INQUIRE_ERROR
Explanation:
An open object does not have the correct ImqObject open options and requires one or more additional options. An implicit reopen is required but closure has been prevented.

Closure has been prevented because one or more characteristics of the object need to be checked dynamically prior to closure, and the open options do not already include MQOO_INQUIRE.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Set the open options explicitly to include MQOO_INQUIRE.

6102 (X'17D6')MQRC_REOPEN_SAVED_CONTEXT_ERR
Explanation:
An open object does not have the correct ImqObject open options and requires one or more additional options. An implicit reopen is required but closure has been prevented.

Closure has been prevented because the queue is open with MQOO_SAVE_ALL_CONTEXT, and a destructive get has been performed previously. This has caused retained state information to be associated with the open queue and this information would be destroyed by closure.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Set the open options explicitly to cover all eventualities so that implicit reopening is not required.

6103 (X'17D7')MQRC_REOPEN_TEMPORARY_Q_ERROR
Explanation:
An open object does not have the correct ImqObject open options and requires one or more additional options. An implicit reopen is required but closure has been prevented.

Closure has been prevented because the queue is a local queue of the definition type MQQDT_TEMPORARY_DYNAMIC, that would be destroyed by closure.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Set the open options explicitly to cover all eventualities so that implicit reopening is not required.

6104 (X'17D8')MQRC_ATTRIBUTE_LOCKED
Explanation:
An attempt has been made to change the value of an attribute of an object while that object is open, or, for an ImqQueueManager object, while that object is connected. Certain attributes cannot be changed in these circumstances. Close or disconnect the object (as appropriate) before changing the attribute value.

An object may have been connected and/or opened unexpectedly and implicitly in order to perform an MQINQ call. Check the attribute cross-reference table in the WebSphere MQ Using C++ book to determine whether any of your method invocations result in an MQINQ call.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Include MQOO_INQUIRE in the ImqObject open options and set them earlier.

6105 (X'17D9')MQRC_CURSOR_NOT_VALID
Explanation:
The browse cursor for an open queue has been invalidated since it was last used by an implicit reopen.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Set the ImqObject open options explicitly to cover all eventualities so that implicit reopening is not required.

6106 (X'17DA')MQRC_ENCODING_ERROR
Explanation:
The encoding of the (next) message item needs to be MQENC_NATIVE for pasting.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6107 (X'17DB')MQRC_STRUC_ID_ERROR
Explanation:
The structure id for the (next) message item, which is derived from the 4 characters beginning at the data pointer, is either missing or is inconsistent with the class of object into which the item is being pasted.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6108 (X'17DC')MQRC_NULL_POINTER
Explanation:
A null pointer has been supplied where a nonnull pointer is either required or implied.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6109 (X'17DD')MQRC_NO_CONNECTION_REFERENCE
Explanation:
The connection reference is null. A connection to an ImqQueueManager object is required.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6110 (X'17DE')MQRC_NO_BUFFER
Explanation:
No buffer is available. For an ImqCache object, one cannot be allocated, denoting an internal inconsistency in the object state that should not occur.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6111 (X'17DF')MQRC_BINARY_DATA_LENGTH_ERROR
Explanation:
The length of the binary data is inconsistent with the length of the target attribute. Zero is a correct length for all attributes. 

The correct length for an accounting token is MQ_ACCOUNTING_TOKEN_LENGTH. 
The correct length for an alternate security id is MQ_SECURITY_ID_LENGTH. 
The correct length for a correlation id is MQ_CORREL_ID_LENGTH. 
The correct length for a facility token is MQ_FACILITY_LENGTH. 
The correct length for a group id is MQ_GROUP_ID_LENGTH. 
The correct length for a message id is MQ_MSG_ID_LENGTH. 
The correct length for an instance id is MQ_OBJECT_INSTANCE_ID_LENGTH. 
The correct length for a transaction instance id is MQ_TRAN_INSTANCE_ID_LENGTH. 
The correct length for a message token is MQ_MSG_TOKEN_LENGTH.
This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6112 (X'17E0')MQRC_BUFFER_NOT_AUTOMATIC
Explanation:
A user-defined (and managed) buffer cannot be resized. A user-defined buffer can only be replaced or withdrawn. A buffer must be automatic (system-managed) before it can be resized.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
6113 (X'17E1')MQRC_INSUFFICIENT_BUFFER
Explanation:
There is insufficient buffer space available after the data pointer to accommodate the request. This might be because the buffer cannot be resized.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6114 (X'17E2')MQRC_INSUFFICIENT_DATA
Explanation:
There is insufficient data after the data pointer to accommodate the request.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6115 (X'17E3')MQRC_DATA_TRUNCATED
Explanation:
Data has been truncated when copying from one buffer to another. This might be because the target buffer cannot be resized, or because there is a problem addressing one or other buffer, or because a buffer is being downsized with a smaller replacement.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6116 (X'17E4')MQRC_ZERO_LENGTH
Explanation:
A zero length has been supplied where a positive length is either required or implied.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6117 (X'17E5')MQRC_NEGATIVE_LENGTH
Explanation:
A negative length has been supplied where a zero or positive length is required.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6118 (X'17E6')MQRC_NEGATIVE_OFFSET
Explanation:
A negative offset has been supplied where a zero or positive offset is required.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6119 (X'17E7')MQRC_INCONSISTENT_FORMAT
Explanation:
The format of the (next) message item is inconsistent with the class of object into which the item is being pasted.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6120 (X'17E8')MQRC_INCONSISTENT_OBJECT_STATE
Explanation:
There is an inconsistency between this object, which is open, and the referenced ImqQueueManager object, which is not connected.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6121 (X'17E9')MQRC_CONTEXT_OBJECT_NOT_VALID
Explanation:
The ImqPutMessageOptions context reference does not reference a valid ImqQueue object. The object has been previously destroyed.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6122 (X'17EA')MQRC_CONTEXT_OPEN_ERROR
Explanation:
The ImqPutMessageOptions context reference references an ImqQueue object that could not be opened to establish a context. This may be because the ImqQueue object has inappropriate open options. Inspect the referenced object reason code to establish the cause.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6123 (X'17EB')MQRC_STRUC_LENGTH_ERROR
Explanation:
The length of a data structure is inconsistent with its content. For an MQRMH, the length is insufficient to contain the fixed fields and all offset data.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

6124 (X'17EC')MQRC_NOT_CONNECTED
Explanation:
A method failed because a required connection to a queue manager was not available, and a connection cannot be established implicitly because the IMQ_IMPL_CONN flag of the ImqQueueManager behavior class attribute is FALSE.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Establish a connection to a queue manager and retry.

6125 (X'17ED')MQRC_NOT_OPEN
Explanation:
A method failed because an object was not open, and opening cannot be accomplished implicitly because the IMQ_IMPL_OPEN flag of the ImqObject behavior class attribute is FALSE.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Open the object and retry.

6126 (X'17EE')MQRC_DISTRIBUTION_LIST_EMPTY
Explanation:
An ImqDistributionList failed to open because there are no ImqQueue objects referenced.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Establish at least one ImqQueue object in which the distribution list reference addresses the ImqDistributionList object, and retry.

6127 (X'17EF')MQRC_INCONSISTENT_OPEN_OPTIONS
Explanation:
A method failed because the object is open, and the ImqObject open options are inconsistent with the required operation. The object cannot be reopened implicitly because the IMQ_IMPL_OPEN flag of the ImqObject behavior class attribute is false.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Open the object with appropriate ImqObject open options and retry.

6128 (X'17FO')MQRC_WRONG_VERSION
Explanation:
A method failed because a version number specified or encountered is either incorrect or not supported.

For the ImqCICSBridgeHeader class, the problem is with the version attribute.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
If you are specifying a version number, use one that is supported by the class. If you are receiving message data from another program, ensure that both programs are using consistent and supported version numbers.

6129 (X'17F1')MQRC_REFERENCE_ERROR
Explanation:
An object reference is invalid.

There is a problem with the address of a referenced object. At the time of use, the address of the object is nonnull, but is invalid and cannot be used for its intended purpose.

This reason code occurs in the WebSphere MQ C++ environment.

Completion Code:
MQCC_FAILED

Programmer Response:
Check that the referenced object is neither deleted nor out of scope, or remove the reference by supplying a null address value.

Reason code cross reference
The following is a list of reason codes, in alphabetic order, cross referenced to the full description in numeric order.

2129 (X'0851') 
MQRC_ADAPTER_CONN_LOAD_ERROR 
2133 (X'0855') 
MQRC_ADAPTER_CONV_LOAD_ERROR 
2131 (X'0853') 
MQRC_ADAPTER_DEFS_ERROR 
2132 (X'0854') 
MQRC_ADAPTER_DEFS_LOAD_ERROR 
2138 (X'085A') 
MQRC_ADAPTER_DISC_LOAD_ERROR 
2204 (X'089C') 
MQRC_ADAPTER_NOT_AVAILABLE 
2130 (X'0852') 
MQRC_ADAPTER_SERV_LOAD_ERROR 
2127 (X'084F') 
MQRC_ADAPTER_STORAGE_SHORTAGE 
2385 (X'0951') 
MQRC_AIR_ERROR 
2001 (X'07D1') 
MQRC_ALIAS_BASE_Q_TYPE_ERROR 
2002 (X'07D2') 
MQRC_ALREADY_CONNECTED 
2103 (X'0837') 
MQRC_ANOTHER_Q_MGR_CONNECTED 
2374 (X'0946') 
MQRC_API_EXIT_ERROR 
2375 (X'0947') 
MQRC_API_EXIT_INIT_ERROR 
2183 (X'0887') 
MQRC_API_EXIT_LOAD_ERROR 
2376 (X'0948') 
MQRC_API_EXIT_TERM_ERROR 
900 (X'0384') 
MQRC_APPL_FIRST 
999 (X'03E7') 
MQRC_APPL_LAST 
2157 (X'086D') 
MQRC_ASID_MISMATCH 
6104 (X'17D8') 
MQRC_ATTRIBUTE_LOCKED 
2387 (X'0953') 
MQRC_AUTH_INFO_CONN_NAME_ERROR 
2383 (X'094F') 
MQRC_AUTH_INFO_REC_COUNT_ERROR 
2384 (X'0950') 
MQRC_AUTH_INFO_REC_ERROR 
2386 (X'0952') 
MQRC_AUTH_INFO_TYPE_ERROR 
2003 (X'07D3') 
MQRC_BACKED_OUT 
2362 (X'093A') 
MQRC_BACKOUT_THRESHOLD_REACHED 
2303 (X'08FF') 
MQRC_BAG_CONVERSION_ERROR 
2326 (X'0916') 
MQRC_BAG_WRONG_TYPE 
6111 (X'17DF') 
MQRC_BINARY_DATA_LENGTH_ERROR 
2134 (X'0856') 
MQRC_BO_ERROR 
2125 (X'084D') 
MQRC_BRIDGE_STARTED 
2126 (X'084E') 
MQRC_BRIDGE_STOPPED 
2004 (X'07D4') 
MQRC_BUFFER_ERROR 
2005 (X'07D5') 
MQRC_BUFFER_LENGTH_ERROR 
6112 (X'17E0') 
MQRC_BUFFER_NOT_AUTOMATIC 
2219 (X'08AB') 
MQRC_CALL_IN_PROGRESS 
2277 (X'08E5') 
MQRC_CD_ERROR 
2345 (X'0929') 
MQRC_CF_NOT_AVAILABLE 
2348 (X'092C') 
MQRC_CF_STRUC_AUTH_FAILED 
2349 (X'092D') 
MQRC_CF_STRUC_ERROR 
2373 (X'0945') 
MQRC_CF_STRUC_FAILED 
2346 (X'092A') 
MQRC_CF_STRUC_IN_USE 
2347 (X'092B') 
MQRC_CF_STRUC_LIST_HDR_IN_USE 
2422 (X'0976') 
MQRC_CFBF_ERROR 
2395 (X'095B') 
MQRC_CFBS_ERROR 
2416 (X'0970') 
MQRC_CFGR_ERROR 
2235 (X'08BB') 
MQRC_CFH_ERROR 
2414 (X'096E') 
MQRC_CFIF_ERROR 
2236 (X'08BC') 
MQRC_CFIL_ERROR 
2237 (X'08BD') 
MQRC_CFIN_ERROR 
2415 (X'096F') 
MQRC_CFSF_ERROR 
2238 (X'08BE') 
MQRC_CFSL_ERROR 
2239 (X'08BF') 
MQRC_CFST_ERROR 
2295 (X'08F7') 
MQRC_CHANNEL_ACTIVATED 
2234 (X'08BA') 
MQRC_CHANNEL_AUTO_DEF_ERROR 
2233 (X'08B9') 
MQRC_CHANNEL_AUTO_DEF_OK 
2284 (X'08EC') 
MQRC_CHANNEL_CONV_ERROR 
2296 (X'08F8') 
MQRC_CHANNEL_NOT_ACTIVATED 
2371 (X'0943') 
MQRC_CHANNEL_SSL_ERROR 
2282 (X'08EA') 
MQRC_CHANNEL_STARTED 
2283 (X'08EB') 
MQRC_CHANNEL_STOPPED 
2279 (X'08E7') 
MQRC_CHANNEL_STOPPED_BY_USER 
2006 (X'07D6') 
MQRC_CHAR_ATTR_LENGTH_ERROR 
2007 (X'07D7') 
MQRC_CHAR_ATTRS_ERROR 
2008 (X'07D8') 
MQRC_CHAR_ATTRS_TOO_SHORT 
2340 (X'0924') 
MQRC_CHAR_CONVERSION_ERROR 
2187 (X'088B') 
MQRC_CICS_BRIDGE_RESTRICTION 
2140 (X'085C') 
MQRC_CICS_WAIT_FAILED 
2423 (X'0977') 
MQRC_CLIENT_CHANNEL_CONFLICT 
2278 (X'08E6') 
MQRC_CLIENT_CONN_ERROR 
2407 (X'0967') 
MQRC_CLIENT_EXIT_ERROR 
2406 (X'0966') 
MQRC_CLIENT_EXIT_LOAD_ERROR 
2266 (X'08DA') 
MQRC_CLUSTER_EXIT_ERROR 
2267 (X'08DB') 
MQRC_CLUSTER_EXIT_LOAD_ERROR 
2268 (X'08DC') 
MQRC_CLUSTER_PUT_INHIBITED 
2189 (X'088D') 
MQRC_CLUSTER_RESOLUTION_ERROR 
2269 (X'08DD') 
MQRC_CLUSTER_RESOURCE_ERROR 
2322 (X'0912') 
MQRC_CMD_SERVER_NOT_AVAILABLE 
2139 (X'085B') 
MQRC_CNO_ERROR 
2106 (X'083A') 
MQRC_COD_NOT_VALID_FOR_XCF_Q 
2330 (X'091A') 
MQRC_CODED_CHAR_SET_ID_ERROR 
2412 (X'096C') 
MQRC_COMMAND_MQSC 
2413 (X'096D') 
MQRC_COMMAND_PCF 
2300 (X'08FC') 
MQRC_COMMAND_TYPE_ERROR 
2368 (X'0940') 
MQRC_CONFIG_CHANGE_OBJECT 
2367 (X'093F') 
MQRC_CONFIG_CREATE_OBJECT 
2369 (X'0941') 
MQRC_CONFIG_DELETE_OBJECT 
2370 (X'0942') 
MQRC_CONFIG_REFRESH_OBJECT 
2160 (X'0870') 
MQRC_CONN_ID_IN_USE 
2271 (X'08DF') 
MQRC_CONN_TAG_IN_USE 
2344 (X'0928') 
MQRC_CONN_TAG_NOT_RELEASED 
2350 (X'092E') 
MQRC_CONN_TAG_NOT_USABLE 
2009 (X'07D9') 
MQRC_CONNECTION_BROKEN 
2273 (X'08E1') 
MQRC_CONNECTION_ERROR 
2217 (X'08A9') 
MQRC_CONNECTION_NOT_AUTHORIZED 
2202 (X'089A') 
MQRC_CONNECTION_QUIESCING 
2203 (X'089B') 
MQRC_CONNECTION_STOPPING 
2097 (X'0831') 
MQRC_CONTEXT_HANDLE_ERROR 
2098 (X'0832') 
MQRC_CONTEXT_NOT_AVAILABLE 
6121 (X'17E9') 
MQRC_CONTEXT_OBJECT_NOT_VALID 
6122 (X'17EA') 
MQRC_CONTEXT_OPEN_ERROR 
2120 (X'0848') 
MQRC_CONVERTED_MSG_TOO_BIG 
2190 (X'088E') 
MQRC_CONVERTED_STRING_TOO_BIG 
2207 (X'089F') 
MQRC_CORREL_ID_ERROR 
2382 (X'094E') 
MQRC_CRYPTO_HARDWARE_ERROR 
2357 (X'0935') 
MQRC_CURRENT_RECORD_ERROR 
6105 (X'17D9') 
MQRC_CURSOR_NOT_VALID 
2010 (X'07DA') 
MQRC_DATA_LENGTH_ERROR 
6115 (X'17E3') 
MQRC_DATA_TRUNCATED 
2150 (X'0866') 
MQRC_DBCS_ERROR 
2342 (X'0926') 
MQRC_DB2_NOT_AVAILABLE 
2198 (X'0896') 
MQRC_DEF_XMIT_Q_TYPE_ERROR 
2199 (X'0897') 
MQRC_DEF_XMIT_Q_USAGE_ERROR 
2263 (X'08D7') 
MQRC_DEST_ENV_ERROR 
2264 (X'08D8') 
MQRC_DEST_NAME_ERROR 
2135 (X'0857') 
MQRC_DH_ERROR 
6126 (X'17EE') 
MQRC_DISTRIBUTION_LIST_EMPTY 
2141 (X'085D') 
MQRC_DLH_ERROR 
2163 (X'0873') 
MQRC_DUPLICATE_RECOV_COORD 
2011 (X'07DB') 
MQRC_DYNAMIC_Q_NAME_ERROR 
6106 (X'17DA') 
MQRC_ENCODING_ERROR 
2308 (X'0904') 
MQRC_ENCODING_NOT_SUPPORTED 
2012 (X'07DC') 
MQRC_ENVIRONMENT_ERROR 
2420 (X'0974') 
MQRC_EPH_ERROR 
2377 (X'0949') 
MQRC_EXIT_REASON_ERROR 
2013 (X'07DD') 
MQRC_EXPIRY_ERROR 
2014 (X'07DE') 
MQRC_FEEDBACK_ERROR 
2208 (X'08A0') 
MQRC_FILE_SYSTEM_ERROR 
2418 (X'0972') 
MQRC_FILTER_OPERATOR_ERROR 
2110 (X'083E') 
MQRC_FORMAT_ERROR 
2317 (X'090D') 
MQRC_FORMAT_NOT_SUPPORTED 
2281 (X'08E9') 
MQRC_FUNCTION_ERROR 
2298 (X'08FA') 
MQRC_FUNCTION_NOT_SUPPORTED 
2016 (X'07E0') 
MQRC_GET_INHIBITED 
2351 (X'092F') 
MQRC_GLOBAL_UOW_CONFLICT 
2186 (X'088A') 
MQRC_GMO_ERROR 
2258 (X'08D2') 
MQRC_GROUP_ID_ERROR 
2353 (X'0931') 
MQRC_HANDLE_IN_USE_FOR_UOW 
2017 (X'07E1') 
MQRC_HANDLE_NOT_AVAILABLE 
2320 (X'0910') 
MQRC_HBAG_ERROR 
2280 (X'08E8') 
MQRC_HCONFIG_ERROR 
2018 (X'07E2') 
MQRC_HCONN_ERROR 
2142 (X'085E') 
MQRC_HEADER_ERROR 
2019 (X'07E3') 
MQRC_HOBJ_ERROR 
2148 (X'0864') 
MQRC_IIH_ERROR 
2241 (X'08C1') 
MQRC_INCOMPLETE_GROUP 
2242 (X'08C2') 
MQRC_INCOMPLETE_MSG 
2259 (X'08D3') 
MQRC_INCONSISTENT_BROWSE 
2243 (X'08C3') 
MQRC_INCONSISTENT_CCSIDS 
2244 (X'08C4') 
MQRC_INCONSISTENT_ENCODINGS 
6119 (X'17E7') 
MQRC_INCONSISTENT_FORMAT 
2313 (X'0909') 
MQRC_INCONSISTENT_ITEM_TYPE 
6120 (X'17E8') 
MQRC_INCONSISTENT_OBJECT_STATE 
6127 (X'17EF') 
MQRC_INCONSISTENT_OPEN_OPTIONS 
2185 (X'0889') 
MQRC_INCONSISTENT_PERSISTENCE 
2245 (X'08C5') 
MQRC_INCONSISTENT_UOW 
2314 (X'090A') 
MQRC_INDEX_ERROR 
2306 (X'0902') 
MQRC_INDEX_NOT_PRESENT 
2020 (X'07E4') 
MQRC_INHIBIT_VALUE_ERROR 
2286 (X'08EE') 
MQRC_INITIALIZATION_FAILED 
2324 (X'0914') 
MQRC_INQUIRY_COMMAND_ERROR 
6113 (X'17E1') 
MQRC_INSUFFICIENT_BUFFER 
6114 (X'17E2') 
MQRC_INSUFFICIENT_DATA 
2021 (X'07E5') 
MQRC_INT_ATTR_COUNT_ERROR 
2022 (X'07E6') 
MQRC_INT_ATTR_COUNT_TOO_SMALL 
2023 (X'07E7') 
MQRC_INT_ATTRS_ARRAY_ERROR 
2246 (X'08C6') 
MQRC_INVALID_MSG_UNDER_CURSOR 
2316 (X'090C') 
MQRC_ITEM_COUNT_ERROR 
2327 (X'0917') 
MQRC_ITEM_TYPE_ERROR 
2319 (X'090F') 
MQRC_ITEM_VALUE_ERROR 
2364 (X'093C') 
MQRC_JMS_FORMAT_ERROR 
2397 (X'095D') 
MQRC_JSSE_ERROR 
2381 (X'094D') 
MQRC_KEY_REPOSITORY_ERROR 
2390 (X'0956') 
MQRC_LDAP_PASSWORD_ERROR 
2388 (X'0954') 
MQRC_LDAP_USER_NAME_ERROR 
2389 (X'0955') 
MQRC_LDAP_USER_NAME_LENGTH_ERR 
2352 (X'0930') 
MQRC_LOCAL_UOW_CONFLICT 
2411 (X'096B') 
MQRC_LOGGER_STATUS 
2247 (X'08C7') 
MQRC_MATCH_OPTIONS_ERROR 
2025 (X'07E9') 
MQRC_MAX_CONNS_LIMIT_REACHED 
2026 (X'07EA') 
MQRC_MD_ERROR 
2248 (X'08C8') 
MQRC_MDE_ERROR 
2027 (X'07EB') 
MQRC_MISSING_REPLY_TO_Q 
2332 (X'091C') 
MQRC_MISSING_WIH 
2249 (X'08C9') 
MQRC_MSG_FLAGS_ERROR 
2206 (X'089E') 
MQRC_MSG_ID_ERROR 
2417 (X'0971') 
MQRC_MSG_NOT_ALLOWED_IN_GROUP 
2363 (X'093B') 
MQRC_MSG_NOT_MATCHED 
2250 (X'08CA') 
MQRC_MSG_SEQ_NUMBER_ERROR 
2331 (X'091B') 
MQRC_MSG_TOKEN_ERROR 
2218 (X'08AA') 
MQRC_MSG_TOO_BIG_FOR_CHANNEL 
2030 (X'07EE') 
MQRC_MSG_TOO_BIG_FOR_Q 
2031 (X'07EF') 
MQRC_MSG_TOO_BIG_FOR_Q_MGR 
2029 (X'07ED') 
MQRC_MSG_TYPE_ERROR 
2301 (X'08FD') 
MQRC_MULTIPLE_INSTANCE_ERROR 
2136 (X'0858') 
MQRC_MULTIPLE_REASONS 
2201 (X'0899') 
MQRC_NAME_IN_USE 
2194 (X'0892') 
MQRC_NAME_NOT_VALID_FOR_TYPE 
6117 (X'17E5') 
MQRC_NEGATIVE_LENGTH 
6118 (X'17E6') 
MQRC_NEGATIVE_OFFSET 
2325 (X'0915') 
MQRC_NESTED_BAG_NOT_SUPPORTED 
2419 (X'0973') 
MQRC_NESTED_SELECTOR_ERROR 
2358 (X'0936') 
MQRC_NEXT_OFFSET_ERROR 
2361 (X'0939') 
MQRC_NEXT_RECORD_ERROR 
6110 (X'17DE') 
MQRC_NO_BUFFER 
6109 (X'17DD') 
MQRC_NO_CONNECTION_REFERENCE 
2379 (X'094B') 
MQRC_NO_DATA_AVAILABLE 
2270 (X'08DE') 
MQRC_NO_DESTINATIONS_AVAILABLE 
2121 (X'0849') 
MQRC_NO_EXTERNAL_PARTICIPANTS 
2033 (X'07F1') 
MQRC_NO_MSG_AVAILABLE 
2209 (X'08A1') 
MQRC_NO_MSG_LOCKED 
2034 (X'07F2') 
MQRC_NO_MSG_UNDER_CURSOR 
2359 (X'0937') 
MQRC_NO_RECORD_AVAILABLE 
0 (X'0000') 
MQRC_NONE 
2035 (X'07F3') 
MQRC_NOT_AUTHORIZED 
6124 (X'17EC') 
MQRC_NOT_CONNECTED 
2119 (X'0847') 
MQRC_NOT_CONVERTED 
6125 (X'17ED') 
MQRC_NOT_OPEN 
2036 (X'07F4') 
MQRC_NOT_OPEN_FOR_BROWSE 
2037 (X'07F5') 
MQRC_NOT_OPEN_FOR_INPUT 
2038 (X'07F6') 
MQRC_NOT_OPEN_FOR_INQUIRE 
2039 (X'07F7') 
MQRC_NOT_OPEN_FOR_OUTPUT 
2093 (X'082D') 
MQRC_NOT_OPEN_FOR_PASS_ALL 
2094 (X'082E') 
MQRC_NOT_OPEN_FOR_PASS_IDENT 
2040 (X'07F8') 
MQRC_NOT_OPEN_FOR_SET 
2095 (X'082F') 
MQRC_NOT_OPEN_FOR_SET_ALL 
2096 (X'0830') 
MQRC_NOT_OPEN_FOR_SET_IDENT 
6108 (X'17DC') 
MQRC_NULL_POINTER 
2100 (X'0834') 
MQRC_OBJECT_ALREADY_EXISTS 
2041 (X'07F9') 
MQRC_OBJECT_CHANGED 
2101 (X'0835') 
MQRC_OBJECT_DAMAGED 
2042 (X'07FA') 
MQRC_OBJECT_IN_USE 
2360 (X'0938') 
MQRC_OBJECT_LEVEL_INCOMPATIBLE 
2152 (X'0868') 
MQRC_OBJECT_NAME_ERROR 
2343 (X'0927') 
MQRC_OBJECT_NOT_UNIQUE 
2153 (X'0869') 
MQRC_OBJECT_Q_MGR_NAME_ERROR 
2155 (X'086B') 
MQRC_OBJECT_RECORDS_ERROR 
2043 (X'07FB') 
MQRC_OBJECT_TYPE_ERROR 
2044 (X'07FC') 
MQRC_OD_ERROR 
2251 (X'08CB') 
MQRC_OFFSET_ERROR 
2137 (X'0859') 
MQRC_OPEN_FAILED 
2274 (X'08E2') 
MQRC_OPTION_ENVIRONMENT_ERROR 
2045 (X'07FD') 
MQRC_OPTION_NOT_VALID_FOR_TYPE 
2046 (X'07FE') 
MQRC_OPTIONS_ERROR 
2252 (X'08CC') 
MQRC_ORIGINAL_LENGTH_ERROR 
2310 (X'0906') 
MQRC_OUT_SELECTOR_ERROR 
2123 (X'084B') 
MQRC_OUTCOME_MIXED 
2124 (X'084C') 
MQRC_OUTCOME_PENDING 
2193 (X'0891') 
MQRC_PAGESET_ERROR 
2192 (X'0890') 
MQRC_PAGESET_FULL 
2321 (X'0911') 
MQRC_PARAMETER_MISSING 
2272 (X'08E0') 
MQRC_PARTIALLY_CONVERTED 
2122 (X'084A') 
MQRC_PARTICIPANT_NOT_AVAILABLE 
2149 (X'0865') 
MQRC_PCF_ERROR 
2047 (X'07FF') 
MQRC_PERSISTENCE_ERROR 
2048 (X'0800') 
MQRC_PERSISTENT_NOT_ALLOWED 
2173 (X'087D') 
MQRC_PMO_ERROR 
2158 (X'086E') 
MQRC_PMO_RECORD_FLAGS_ERROR 
2050 (X'0802') 
MQRC_PRIORITY_ERROR 
2049 (X'0801') 
MQRC_PRIORITY_EXCEEDS_MAXIMUM 
2051 (X'0803') 
MQRC_PUT_INHIBITED 
2159 (X'086F') 
MQRC_PUT_MSG_RECORDS_ERROR 
2290 (X'08F2') 
MQRC_Q_ALREADY_EXISTS 
2052 (X'0804') 
MQRC_Q_DELETED 
2224 (X'08B0') 
MQRC_Q_DEPTH_HIGH 
2225 (X'08B1') 
MQRC_Q_DEPTH_LOW 
2053 (X'0805') 
MQRC_Q_FULL 
2394 (X'095A') 
MQRC_Q_INDEX_TYPE_ERROR 
2222 (X'08AE') 
MQRC_Q_MGR_ACTIVE 
2058 (X'080A') 
MQRC_Q_MGR_NAME_ERROR 
2223 (X'08AF') 
MQRC_Q_MGR_NOT_ACTIVE 
2059 (X'080B') 
MQRC_Q_MGR_NOT_AVAILABLE 
2161 (X'0871') 
MQRC_Q_MGR_QUIESCING 
2162 (X'0872') 
MQRC_Q_MGR_STOPPING 
2055 (X'0807') 
MQRC_Q_NOT_EMPTY 
2226 (X'08B2') 
MQRC_Q_SERVICE_INTERVAL_HIGH 
2227 (X'08B3') 
MQRC_Q_SERVICE_INTERVAL_OK 
2056 (X'0808') 
MQRC_Q_SPACE_NOT_AVAILABLE 
2057 (X'0809') 
MQRC_Q_TYPE_ERROR 
2229 (X'08B5') 
MQRC_RAS_PROPERTY_ERROR 
2154 (X'086A') 
MQRC_RECS_PRESENT_ERROR 
6129 (X'17F1') 
MQRC_REFERENCE_ERROR 
2184 (X'0888') 
MQRC_REMOTE_Q_NAME_ERROR 
6100 (X'17D4') 
MQRC_REOPEN_EXCL_INPUT_ERROR 
6101 (X'17D5') 
MQRC_REOPEN_INQUIRE_ERROR 
6102 (X'17D6') 
MQRC_REOPEN_SAVED_CONTEXT_ERR 
6103 (X'17D7') 
MQRC_REOPEN_TEMPORARY_Q_ERROR 
2061 (X'080D') 
MQRC_REPORT_OPTIONS_ERROR 
2378 (X'094A') 
MQRC_RESERVED_VALUE_ERROR 
2102 (X'0836') 
MQRC_RESOURCE_PROBLEM 
2156 (X'086C') 
MQRC_RESPONSE_RECORDS_ERROR 
2336 (X'0920') 
MQRC_RFH_COMMAND_ERROR 
2338 (X'0922') 
MQRC_RFH_DUPLICATE_PARM 
2334 (X'091E') 
MQRC_RFH_ERROR 
2421 (X'0975') 
MQRC_RFH_FORMAT_ERROR 
2228 (X'08B4') 
MQRC_RFH_HEADER_FIELD_ERROR 
2337 (X'0921') 
MQRC_RFH_PARM_ERROR 
2339 (X'0923') 
MQRC_RFH_PARM_MISSING 
2335 (X'091F') 
MQRC_RFH_STRING_ERROR 
2220 (X'08AC') 
MQRC_RMH_ERROR 
2380 (X'094C') 
MQRC_SCO_ERROR 
2062 (X'080E') 
MQRC_SECOND_MARK_NOT_ALLOWED 
2063 (X'080F') 
MQRC_SECURITY_ERROR 
2253 (X'08CD') 
MQRC_SEGMENT_LENGTH_ZERO 
2365 (X'093D') 
MQRC_SEGMENTS_NOT_SUPPORTED 
2065 (X'0811') 
MQRC_SELECTOR_COUNT_ERROR 
2067 (X'0813') 
MQRC_SELECTOR_ERROR 
2066 (X'0812') 
MQRC_SELECTOR_LIMIT_EXCEEDED 
2068 (X'0814') 
MQRC_SELECTOR_NOT_FOR_TYPE 
2309 (X'0905') 
MQRC_SELECTOR_NOT_PRESENT 
2318 (X'090E') 
MQRC_SELECTOR_NOT_SUPPORTED 
2305 (X'0901') 
MQRC_SELECTOR_NOT_UNIQUE 
2304 (X'0900') 
MQRC_SELECTOR_OUT_OF_RANGE 
2299 (X'08FB') 
MQRC_SELECTOR_TYPE_ERROR 
2312 (X'0908') 
MQRC_SELECTOR_WRONG_TYPE 
2289 (X'08F1') 
MQRC_SERVICE_ERROR 
2285 (X'08ED') 
MQRC_SERVICE_NOT_AVAILABLE 
2069 (X'0815') 
MQRC_SIGNAL_OUTSTANDING 
2070 (X'0816') 
MQRC_SIGNAL_REQUEST_ACCEPTED 
2099 (X'0833') 
MQRC_SIGNAL1_ERROR 
2211 (X'08A3') 
MQRC_SOAP_AXIS_ERROR 
2210 (X'08A2') 
MQRC_SOAP_DOTNET_ERROR 
2212 (X'08A4') 
MQRC_SOAP_URL_ERROR 
2145 (X'0861') 
MQRC_SOURCE_BUFFER_ERROR 
2111 (X'083F') 
MQRC_SOURCE_CCSID_ERROR 
2113 (X'0841') 
MQRC_SOURCE_DECIMAL_ENC_ERROR 
2114 (X'0842') 
MQRC_SOURCE_FLOAT_ENC_ERROR 
2112 (X'0840') 
MQRC_SOURCE_INTEGER_ENC_ERROR 
2143 (X'085F') 
MQRC_SOURCE_LENGTH_ERROR 
2261 (X'08D5') 
MQRC_SRC_ENV_ERROR 
2262 (X'08D6') 
MQRC_SRC_NAME_ERROR 
2391 (X'0957') 
MQRC_SSL_ALREADY_INITIALIZED 
2402 (X'0962') 
MQRC_SSL_CERT_STORE_ERROR 
2401 (X'0961') 
MQRC_SSL_CERTIFICATE_REVOKED 
2392 (X'0958') 
MQRC_SSL_CONFIG_ERROR 
2409 (X'0969') 
MQRC_SSL_KEY_RESET_ERROR 
2393 (X'0959') 
MQRC_SSL_INITIALIZATION_ERROR 
2396 (X'095C') 
MQRC_SSL_NOT_ALLOWED 
2399 (X'095F') 
MQRC_SSL_PEER_NAME_ERROR 
2398 (X'095E') 
MQRC_SSL_PEER_NAME_MISMATCH 
2188 (X'088C') 
MQRC_STOPPED_BY_CLUSTER_EXIT 
2105 (X'0839') 
MQRC_STORAGE_CLASS_ERROR 
2192 (X'0890') 
MQRC_STORAGE_MEDIUM_FULL 
2071 (X'0817') 
MQRC_STORAGE_NOT_AVAILABLE 
2307 (X'0903') 
MQRC_STRING_ERROR 
2323 (X'0913') 
MQRC_STRING_LENGTH_ERROR 
2311 (X'0907') 
MQRC_STRING_TRUNCATED 
6107 (X'17DB') 
MQRC_STRUC_ID_ERROR 
6123 (X'17EB') 
MQRC_STRUC_LENGTH_ERROR 
2109 (X'083D') 
MQRC_SUPPRESSED_BY_EXIT 
2024 (X'07E8') 
MQRC_SYNCPOINT_LIMIT_REACHED 
2072 (X'0818') 
MQRC_SYNCPOINT_NOT_AVAILABLE 
2315 (X'090B') 
MQRC_SYSTEM_BAG_NOT_ALTERABLE 
2328 (X'0918') 
MQRC_SYSTEM_BAG_NOT_DELETABLE 
2302 (X'08FE') 
MQRC_SYSTEM_ITEM_NOT_ALTERABLE 
2329 (X'0919') 
MQRC_SYSTEM_ITEM_NOT_DELETABLE 
2146 (X'0862') 
MQRC_TARGET_BUFFER_ERROR 
2115 (X'0843') 
MQRC_TARGET_CCSID_ERROR 
2117 (X'0845') 
MQRC_TARGET_DECIMAL_ENC_ERROR 
2118 (X'0846') 
MQRC_TARGET_FLOAT_ENC_ERROR 
2116 (X'0844') 
MQRC_TARGET_INTEGER_ENC_ERROR 
2144 (X'0860') 
MQRC_TARGET_LENGTH_ERROR 
2287 (X'08EF') 
MQRC_TERMINATION_FAILED 
2265 (X'08D9') 
MQRC_TM_ERROR 
2191 (X'088F') 
MQRC_TMC_ERROR 
2075 (X'081B') 
MQRC_TRIGGER_CONTROL_ERROR 
2076 (X'081C') 
MQRC_TRIGGER_DEPTH_ERROR 
2077 (X'081D') 
MQRC_TRIGGER_MSG_PRIORITY_ERR 
2078 (X'081E') 
MQRC_TRIGGER_TYPE_ERROR 
2079 (X'081F') 
MQRC_TRUNCATED_MSG_ACCEPTED 
2080 (X'0820') 
MQRC_TRUNCATED_MSG_FAILED 
2341 (X'0925') 
MQRC_UCS2_CONVERSION_ERROR 
2195 (X'0893') 
MQRC_UNEXPECTED_ERROR 
2232 (X'08B8') 
MQRC_UNIT_OF_WORK_NOT_STARTED 
2082 (X'0822') 
MQRC_UNKNOWN_ALIAS_BASE_Q 
2197 (X'0895') 
MQRC_UNKNOWN_DEF_XMIT_Q 
2292 (X'08F4') 
MQRC_UNKNOWN_ENTITY 
2085 (X'0825') 
MQRC_UNKNOWN_OBJECT_NAME 
2086 (X'0826') 
MQRC_UNKNOWN_OBJECT_Q_MGR 
2288 (X'08F0') 
MQRC_UNKNOWN_Q_NAME 
2294 (X'08F6') 
MQRC_UNKNOWN_REF_OBJECT 
2087 (X'0827') 
MQRC_UNKNOWN_REMOTE_Q_MGR 
2104 (X'0838') 
MQRC_UNKNOWN_REPORT_OPTION 
2196 (X'0894') 
MQRC_UNKNOWN_XMIT_Q 
2400 (X'0960') 
MQRC_UNSUPPORTED_CIPHER_SUITE 
2297 (X'08F9') 
MQRC_UOW_CANCELED 
2354 (X'0932') 
MQRC_UOW_ENLISTMENT_ERROR 
2128 (X'0850') 
MQRC_UOW_IN_PROGRESS 
2355 (X'0933') 
MQRC_UOW_MIX_NOT_SUPPORTED 
2255 (X'08CF') 
MQRC_UOW_NOT_AVAILABLE 
2291 (X'08F3') 
MQRC_USER_ID_NOT_AVAILABLE 
2090 (X'082A') 
MQRC_WAIT_INTERVAL_ERROR 
2333 (X'091D') 
MQRC_WIH_ERROR 
2366 (X'093E') 
MQRC_WRONG_CF_LEVEL 
2256 (X'08D0') 
MQRC_WRONG_GMO_VERSION 
2257 (X'08D1') 
MQRC_WRONG_MD_VERSION 
6128 (X'17FO') 
MQRC_WRONG_VERSION 
2356 (X'0934') 
MQRC_WXP_ERROR 
2091 (X'082B') 
MQRC_XMIT_Q_TYPE_ERROR 
2092 (X'082C') 
MQRC_XMIT_Q_USAGE_ERROR 
2260 (X'08D4') 
MQRC_XQH_ERROR 
2107 (X'083B') 
MQRC_XWAIT_CANCELED 
2108 (X'083C') 
MQRC_XWAIT_ERROR 
6116 (X'17E4') 
MQRC_ZERO_LENGTH 


MQRC_* (Reason Codes)
=====================


MQRC_NONE 0 X'00000000' 
MQRC_APPL_FIRST 900 X'00000384' 
MQRC_APPL_LAST 999 X'000003E7' 
MQRC_ALIAS_BASE_Q_TYPE_ERROR 2001 X'000007D1' 
MQRC_ALREADY_CONNECTED 2002 X'000007D2' 
MQRC_BACKED_OUT 2003 X'000007D3' 
MQRC_BUFFER_ERROR 2004 X'000007D4' 
MQRC_BUFFER_LENGTH_ERROR 2005 X'000007D5' 
MQRC_CHAR_ATTR_LENGTH_ERROR 2006 X'000007D6' 
MQRC_CHAR_ATTRS_ERROR 2007 X'000007D7' 
MQRC_CHAR_ATTRS_TOO_SHORT 2008 X'000007D8' 
MQRC_CONNECTION_BROKEN 2009 X'000007D9' 
MQRC_DATA_LENGTH_ERROR 2010 X'000007DA' 
MQRC_DYNAMIC_Q_NAME_ERROR 2011 X'000007DB' 
MQRC_ENVIRONMENT_ERROR 2012 X'000007DC' 
MQRC_EXPIRY_ERROR 2013 X'000007DD' 
MQRC_FEEDBACK_ERROR 2014 X'000007DE' 
MQRC_GET_INHIBITED 2016 X'000007E0' 
MQRC_HANDLE_NOT_AVAILABLE 2017 X'000007E1' 
MQRC_HCONN_ERROR 2018 X'000007E2' 
MQRC_HOBJ_ERROR 2019 X'000007E3' 
MQRC_INHIBIT_VALUE_ERROR 2020 X'000007E4' 
MQRC_INT_ATTR_COUNT_ERROR 2021 X'000007E5' 
MQRC_INT_ATTR_COUNT_TOO_SMALL 2022 X'000007E6' 
MQRC_INT_ATTRS_ARRAY_ERROR 2023 X'000007E7' 
MQRC_SYNCPOINT_LIMIT_REACHED 2024 X'000007E8' 
MQRC_MAX_CONNS_LIMIT_REACHED 2025 X'000007E9' 
MQRC_MD_ERROR 2026 X'000007EA' 
MQRC_MISSING_REPLY_TO_Q 2027 X'000007EB' 
MQRC_MSG_TYPE_ERROR 2029 X'000007ED' 
MQRC_MSG_TOO_BIG_FOR_Q 2030 X'000007EE' 
MQRC_MSG_TOO_BIG_FOR_Q_MGR 2031 X'000007EF' 
MQRC_NO_MSG_AVAILABLE 2033 X'000007F1' 
MQRC_NO_MSG_UNDER_CURSOR 2034 X'000007F2' 
MQRC_NOT_AUTHORIZED 2035 X'000007F3' 
MQRC_NOT_OPEN_FOR_BROWSE 2036 X'000007F4' 
MQRC_NOT_OPEN_FOR_INPUT 2037 X'000007F5' 
MQRC_NOT_OPEN_FOR_INQUIRE 2038 X'000007F6' 
MQRC_NOT_OPEN_FOR_OUTPUT 2039 X'000007F7' 
MQRC_NOT_OPEN_FOR_SET 2040 X'000007F8' 
MQRC_OBJECT_CHANGED 2041 X'000007F9' 
MQRC_OBJECT_IN_USE 2042 X'000007FA' 
MQRC_OBJECT_TYPE_ERROR 2043 X'000007FB' 
MQRC_OD_ERROR 2044 X'000007FC' 
MQRC_OPTION_NOT_VALID_FOR_TYPE 2045 X'000007FD' 
MQRC_OPTIONS_ERROR 2046 X'000007FE' 
MQRC_PERSISTENCE_ERROR 2047 X'000007FF' 
MQRC_PERSISTENT_NOT_ALLOWED 2048 X'00000800' 
MQRC_PRIORITY_EXCEEDS_MAXIMUM 2049 X'00000801' 
MQRC_PRIORITY_ERROR 2050 X'00000802' 
MQRC_PUT_INHIBITED 2051 X'00000803' 
MQRC_Q_DELETED 2052 X'00000804' 
MQRC_Q_FULL 2053 X'00000805' 
MQRC_Q_NOT_EMPTY 2055 X'00000807' 
MQRC_Q_SPACE_NOT_AVAILABLE 2056 X'00000808' 
MQRC_Q_TYPE_ERROR 2057 X'00000809' 
MQRC_Q_MGR_NAME_ERROR 2058 X'0000080A' 
MQRC_Q_MGR_NOT_AVAILABLE 2059 X'0000080B' 
MQRC_REPORT_OPTIONS_ERROR 2061 X'0000080D' 
MQRC_SECOND_MARK_NOT_ALLOWED 2062 X'0000080E' 
MQRC_SECURITY_ERROR 2063 X'0000080F' 
MQRC_SELECTOR_COUNT_ERROR 2065 X'00000811' 
MQRC_SELECTOR_LIMIT_EXCEEDED 2066 X'00000812' 
MQRC_SELECTOR_ERROR 2067 X'00000813' 
MQRC_SELECTOR_NOT_FOR_TYPE 2068 X'00000814' 
MQRC_SIGNAL_OUTSTANDING 2069 X'00000815' 
MQRC_SIGNAL_REQUEST_ACCEPTED 2070 X'00000816' 
MQRC_STORAGE_NOT_AVAILABLE 2071 X'00000817' 
MQRC_SYNCPOINT_NOT_AVAILABLE 2072 X'00000818' 
MQRC_TRIGGER_CONTROL_ERROR 2075 X'0000081B' 
MQRC_TRIGGER_DEPTH_ERROR 2076 X'0000081C' 
MQRC_TRIGGER_MSG_PRIORITY_ERR 2077 X'0000081D' 
MQRC_TRIGGER_TYPE_ERROR 2078 X'0000081E' 
MQRC_TRUNCATED_MSG_ACCEPTED 2079 X'0000081F' 
MQRC_TRUNCATED_MSG_FAILED 2080 X'00000820' 
MQRC_UNKNOWN_ALIAS_BASE_Q 2082 X'00000822' 
MQRC_UNKNOWN_OBJECT_NAME 2085 X'00000825' 
MQRC_UNKNOWN_OBJECT_Q_MGR 2086 X'00000826' 
MQRC_UNKNOWN_REMOTE_Q_MGR 2087 X'00000827' 
MQRC_WAIT_INTERVAL_ERROR 2090 X'0000082A' 
MQRC_XMIT_Q_TYPE_ERROR 2091 X'0000082B' 
MQRC_XMIT_Q_USAGE_ERROR 2092 X'0000082C' 
MQRC_NOT_OPEN_FOR_PASS_ALL 2093 X'0000082D' 
MQRC_NOT_OPEN_FOR_PASS_IDENT 2094 X'0000082E' 
MQRC_NOT_OPEN_FOR_SET_ALL 2095 X'0000082F' 
MQRC_NOT_OPEN_FOR_SET_IDENT 2096 X'00000830' 
MQRC_CONTEXT_HANDLE_ERROR 2097 X'00000831' 
MQRC_CONTEXT_NOT_AVAILABLE 2098 X'00000832' 
MQRC_SIGNAL1_ERROR 2099 X'00000833' 
MQRC_OBJECT_ALREADY_EXISTS 2100 X'00000834' 
MQRC_OBJECT_DAMAGED 2101 X'00000835' 
MQRC_RESOURCE_PROBLEM 2102 X'00000836' 
MQRC_ANOTHER_Q_MGR_CONNECTED 2103 X'00000837' 
MQRC_UNKNOWN_REPORT_OPTION 2104 X'00000838' 
MQRC_STORAGE_CLASS_ERROR 2105 X'00000839' 
MQRC_COD_NOT_VALID_FOR_XCF_Q 2106 X'0000083A' 
MQRC_XWAIT_CANCELED 2107 X'0000083B' 
MQRC_XWAIT_ERROR 2108 X'0000083C' 
MQRC_SUPPRESSED_BY_EXIT 2109 X'0000083D' 
MQRC_FORMAT_ERROR 2110 X'0000083E' 
MQRC_SOURCE_CCSID_ERROR 2111 X'0000083F' 
MQRC_SOURCE_INTEGER_ENC_ERROR 2112 X'00000840' 
MQRC_SOURCE_DECIMAL_ENC_ERROR 2113 X'00000841' 
MQRC_SOURCE_FLOAT_ENC_ERROR 2114 X'00000842' 
MQRC_TARGET_CCSID_ERROR 2115 X'00000843' 
MQRC_TARGET_INTEGER_ENC_ERROR 2116 X'00000844' 
MQRC_TARGET_DECIMAL_ENC_ERROR 2117 X'00000845' 
MQRC_TARGET_FLOAT_ENC_ERROR 2118 X'00000846' 
MQRC_NOT_CONVERTED 2119 X'00000847' 
MQRC_CONVERTED_MSG_TOO_BIG 2120 X'00000848' 
MQRC_TRUNCATED 2120 X'00000848' 
MQRC_NO_EXTERNAL_PARTICIPANTS 2121 X'00000849' 
MQRC_PARTICIPANT_NOT_AVAILABLE 2122 X'0000084A' 
MQRC_OUTCOME_MIXED 2123 X'0000084B' 
MQRC_OUTCOME_PENDING 2124 X'0000084C' 
MQRC_BRIDGE_STARTED 2125 X'0000084D' 
MQRC_BRIDGE_STOPPED 2126 X'0000084E' 
MQRC_ADAPTER_STORAGE_SHORTAGE 2127 X'0000084F' 
MQRC_UOW_IN_PROGRESS 2128 X'00000850' 
MQRC_ADAPTER_CONN_LOAD_ERROR 2129 X'00000851' 
MQRC_ADAPTER_SERV_LOAD_ERROR 2130 X'00000852' 
MQRC_ADAPTER_DEFS_ERROR 2131 X'00000853' 
MQRC_ADAPTER_DEFS_LOAD_ERROR 2132 X'00000854' 
MQRC_ADAPTER_CONV_LOAD_ERROR 2133 X'00000855' 
MQRC_BO_ERROR 2134 X'00000856' 
MQRC_DH_ERROR 2135 X'00000857' 
MQRC_MULTIPLE_REASONS 2136 X'00000858' 
MQRC_OPEN_FAILED 2137 X'00000859' 
MQRC_ADAPTER_DISC_LOAD_ERROR 2138 X'0000085A' 
MQRC_CNO_ERROR 2139 X'0000085B' 
MQRC_CICS_WAIT_FAILED 2140 X'0000085C' 
MQRC_DLH_ERROR 2141 X'0000085D' 
MQRC_HEADER_ERROR 2142 X'0000085E' 
MQRC_SOURCE_LENGTH_ERROR 2143 X'0000085F' 
MQRC_TARGET_LENGTH_ERROR 2144 X'00000860' 
MQRC_SOURCE_BUFFER_ERROR 2145 X'00000861' 
MQRC_TARGET_BUFFER_ERROR 2146 X'00000862' 
MQRC_IIH_ERROR 2148 X'00000864' 
MQRC_PCF_ERROR 2149 X'00000865' 
MQRC_DBCS_ERROR 2150 X'00000866' 
MQRC_OBJECT_NAME_ERROR 2152 X'00000868' 
MQRC_OBJECT_Q_MGR_NAME_ERROR 2153 X'00000869' 
MQRC_RECS_PRESENT_ERROR 2154 X'0000086A' 
MQRC_OBJECT_RECORDS_ERROR 2155 X'0000086B' 
MQRC_RESPONSE_RECORDS_ERROR 2156 X'0000086C' 
MQRC_ASID_MISMATCH 2157 X'0000086D' 
MQRC_PMO_RECORD_FLAGS_ERROR 2158 X'0000086E' 
MQRC_PUT_MSG_RECORDS_ERROR 2159 X'0000086F' 
MQRC_CONN_ID_IN_USE 2160 X'00000870' 
MQRC_Q_MGR_QUIESCING 2161 X'00000871' 
MQRC_Q_MGR_STOPPING 2162 X'00000872' 
MQRC_DUPLICATE_RECOV_COORD 2163 X'00000873' 
MQRC_PMO_ERROR 2173 X'0000087D' 
MQRC_API_EXIT_NOT_FOUND 2182 X'00000886' 
MQRC_API_EXIT_LOAD_ERROR 2183 X'00000887' 
MQRC_REMOTE_Q_NAME_ERROR 2184 X'00000888' 
MQRC_INCONSISTENT_PERSISTENCE 2185 X'00000889' 
MQRC_GMO_ERROR 2186 X'0000088A' 
MQRC_CICS_BRIDGE_RESTRICTION 2187 X'0000088B' 
MQRC_STOPPED_BY_CLUSTER_EXIT 2188 X'0000088C' 
MQRC_CLUSTER_RESOLUTION_ERROR 2189 X'0000088D' 
MQRC_CONVERTED_STRING_TOO_BIG 2190 X'0000088E' 
MQRC_TMC_ERROR 2191 X'0000088F' 
MQRC_PAGESET_FULL 2192 X'00000890' 
MQRC_STORAGE_MEDIUM_FULL 2192 X'00000890' 
MQRC_PAGESET_ERROR 2193 X'00000891' 
MQRC_NAME_NOT_VALID_FOR_TYPE 2194 X'00000892' 
MQRC_UNEXPECTED_ERROR 2195 X'00000893' 
MQRC_UNKNOWN_XMIT_Q 2196 X'00000894' 
MQRC_UNKNOWN_DEF_XMIT_Q 2197 X'00000895' 
MQRC_DEF_XMIT_Q_TYPE_ERROR 2198 X'00000896' 
MQRC_DEF_XMIT_Q_USAGE_ERROR 2199 X'00000897' 
MQRC_NAME_IN_USE 2201 X'00000899' 
MQRC_CONNECTION_QUIESCING 2202 X'0000089A' 
MQRC_CONNECTION_STOPPING 2203 X'0000089B' 
MQRC_ADAPTER_NOT_AVAILABLE 2204 X'0000089C' 
MQRC_MSG_ID_ERROR 2206 X'0000089E' 
MQRC_CORREL_ID_ERROR 2207 X'0000089F' 
MQRC_FILE_SYSTEM_ERROR 2208 X'000008A0' 
MQRC_NO_MSG_LOCKED 2209 X'000008A1' 
MQRC_SOAP_DOTNET_ERROR 2210 X'000008A2' 
MQRC_SOAP_AXIS_ERROR 2211 X'000008A3' 
MQRC_SOAP_URL_ERROR 2212 X'000008A4' 
MQRC_FILE_NOT_AUDITED 2216 X'000008A8' 
MQRC_CONNECTION_NOT_AUTHORIZED 2217 X'000008A9' 
MQRC_MSG_TOO_BIG_FOR_CHANNEL 2218 X'000008AA' 
MQRC_CALL_IN_PROGRESS 2219 X'000008AB' 
MQRC_RMH_ERROR 2220 X'000008AC' 
MQRC_Q_MGR_ACTIVE 2222 X'000008AE' 
MQRC_Q_MGR_NOT_ACTIVE 2223 X'000008AF' 
MQRC_Q_DEPTH_HIGH 2224 X'000008B0' 
MQRC_Q_DEPTH_LOW 2225 X'000008B1' 
MQRC_Q_SERVICE_INTERVAL_HIGH 2226 X'000008B2' 
MQRC_Q_SERVICE_INTERVAL_OK 2227 X'000008B3' 
MQRC_RFH_HEADER_FIELD_ERROR 2228 X'000008B4' 
MQRC_RAS_PROPERTY_ERROR 2229 X'000008B5' 
MQRC_UNIT_OF_WORK_NOT_STARTED 2232 X'000008B8' 
MQRC_CHANNEL_AUTO_DEF_OK 2233 X'000008B9' 
MQRC_CHANNEL_AUTO_DEF_ERROR 2234 X'000008BA' 
MQRC_CFH_ERROR 2235 X'000008BB' 
MQRC_CFIL_ERROR 2236 X'000008BC' 
MQRC_CFIN_ERROR 2237 X'000008BD' 
MQRC_CFSL_ERROR 2238 X'000008BE' 
MQRC_CFST_ERROR 2239 X'000008BF' 
MQRC_INCOMPLETE_GROUP 2241 X'000008C1' 
MQRC_INCOMPLETE_MSG 2242 X'000008C2' 
MQRC_INCONSISTENT_CCSIDS 2243 X'000008C3' 
MQRC_INCONSISTENT_ENCODINGS 2244 X'000008C4' 
MQRC_INCONSISTENT_UOW 2245 X'000008C5' 
MQRC_INVALID_MSG_UNDER_CURSOR 2246 X'000008C6' 
MQRC_MATCH_OPTIONS_ERROR 2247 X'000008C7' 
MQRC_MDE_ERROR 2248 X'000008C8' 
MQRC_MSG_FLAGS_ERROR 2249 X'000008C9' 
MQRC_MSG_SEQ_NUMBER_ERROR 2250 X'000008CA' 
MQRC_OFFSET_ERROR 2251 X'000008CB' 
MQRC_ORIGINAL_LENGTH_ERROR 2252 X'000008CC' 
MQRC_SEGMENT_LENGTH_ZERO 2253 X'000008CD' 
MQRC_UOW_NOT_AVAILABLE 2255 X'000008CF' 
MQRC_WRONG_GMO_VERSION 2256 X'000008D0' 
MQRC_WRONG_MD_VERSION 2257 X'000008D1' 
MQRC_GROUP_ID_ERROR 2258 X'000008D2' 
MQRC_INCONSISTENT_BROWSE 2259 X'000008D3' 
MQRC_XQH_ERROR 2260 X'000008D4' 
MQRC_SRC_ENV_ERROR 2261 X'000008D5' 
MQRC_SRC_NAME_ERROR 2262 X'000008D6' 
MQRC_DEST_ENV_ERROR 2263 X'000008D7' 
MQRC_DEST_NAME_ERROR 2264 X'000008D8' 
MQRC_TM_ERROR 2265 X'000008D9' 
MQRC_CLUSTER_EXIT_ERROR 2266 X'000008DA' 
MQRC_CLUSTER_EXIT_LOAD_ERROR 2267 X'000008DB' 
MQRC_CLUSTER_PUT_INHIBITED 2268 X'000008DC' 
MQRC_CLUSTER_RESOURCE_ERROR 2269 X'000008DD' 
MQRC_NO_DESTINATIONS_AVAILABLE 2270 X'000008DE' 
MQRC_CONN_TAG_IN_USE 2271 X'000008DF' 
MQRC_PARTIALLY_CONVERTED 2272 X'000008E0' 
MQRC_CONNECTION_ERROR 2273 X'000008E1' 
MQRC_OPTION_ENVIRONMENT_ERROR 2274 X'000008E2' 
MQRC_CD_ERROR 2277 X'000008E5' 
MQRC_CLIENT_CONN_ERROR 2278 X'000008E6' 
MQRC_CHANNEL_STOPPED_BY_USER 2279 X'000008E7' 
MQRC_HCONFIG_ERROR 2280 X'000008E8' 
MQRC_FUNCTION_ERROR 2281 X'000008E9' 
MQRC_CHANNEL_STARTED 2282 X'000008EA' 
MQRC_CHANNEL_STOPPED 2283 X'000008EB' 
MQRC_CHANNEL_CONV_ERROR 2284 X'000008EC' 
MQRC_SERVICE_NOT_AVAILABLE 2285 X'000008ED' 
MQRC_INITIALIZATION_FAILED 2286 X'000008EE' 
MQRC_TERMINATION_FAILED 2287 X'000008EF' 
MQRC_UNKNOWN_Q_NAME 2288 X'000008F0' 
MQRC_SERVICE_ERROR 2289 X'000008F1' 
MQRC_Q_ALREADY_EXISTS 2290 X'000008F2' 
MQRC_USER_ID_NOT_AVAILABLE 2291 X'000008F3' 
MQRC_UNKNOWN_ENTITY 2292 X'000008F4' 
MQRC_UNKNOWN_AUTH_ENTITY 2293 X'000008F5' 
MQRC_UNKNOWN_REF_OBJECT 2294 X'000008F6' 
MQRC_CHANNEL_ACTIVATED 2295 X'000008F7' 
MQRC_CHANNEL_NOT_ACTIVATED 2296 X'000008F8' 
MQRC_UOW_CANCELED 2297 X'000008F9' 
MQRC_FUNCTION_NOT_SUPPORTED 2298 X'000008FA' 
MQRC_SELECTOR_TYPE_ERROR 2299 X'000008FB' 
MQRC_COMMAND_TYPE_ERROR 2300 X'000008FC' 
MQRC_MULTIPLE_INSTANCE_ERROR 2301 X'000008FD' 
MQRC_SYSTEM_ITEM_NOT_ALTERABLE 2302 X'000008FE' 
MQRC_BAG_CONVERSION_ERROR 2303 X'000008FF' 
MQRC_SELECTOR_OUT_OF_RANGE 2304 X'00000900' 
MQRC_SELECTOR_NOT_UNIQUE 2305 X'00000901' 
MQRC_INDEX_NOT_PRESENT 2306 X'00000902' 
MQRC_STRING_ERROR 2307 X'00000903' 
MQRC_ENCODING_NOT_SUPPORTED 2308 X'00000904' 
MQRC_SELECTOR_NOT_PRESENT 2309 X'00000905' 
MQRC_OUT_SELECTOR_ERROR 2310 X'00000906' 
MQRC_STRING_TRUNCATED 2311 X'00000907' 
MQRC_SELECTOR_WRONG_TYPE 2312 X'00000908' 
MQRC_INCONSISTENT_ITEM_TYPE 2313 X'00000909' 
MQRC_INDEX_ERROR 2314 X'0000090A' 
MQRC_SYSTEM_BAG_NOT_ALTERABLE 2315 X'0000090B' 
MQRC_ITEM_COUNT_ERROR 2316 X'0000090C' 
MQRC_FORMAT_NOT_SUPPORTED 2317 X'0000090D' 
MQRC_SELECTOR_NOT_SUPPORTED 2318 X'0000090E' 
MQRC_ITEM_VALUE_ERROR 2319 X'0000090F' 
MQRC_HBAG_ERROR 2320 X'00000910' 
MQRC_PARAMETER_MISSING 2321 X'00000911' 
MQRC_CMD_SERVER_NOT_AVAILABLE 2322 X'00000912' 
MQRC_STRING_LENGTH_ERROR 2323 X'00000913' 
MQRC_INQUIRY_COMMAND_ERROR 2324 X'00000914' 
MQRC_NESTED_BAG_NOT_SUPPORTED 2325 X'00000915' 
MQRC_BAG_WRONG_TYPE 2326 X'00000916' 
MQRC_ITEM_TYPE_ERROR 2327 X'00000917' 
MQRC_SYSTEM_BAG_NOT_DELETABLE 2328 X'00000918' 
MQRC_SYSTEM_ITEM_NOT_DELETABLE 2329 X'00000919' 
MQRC_CODED_CHAR_SET_ID_ERROR 2330 X'0000091A' 
MQRC_MSG_TOKEN_ERROR 2331 X'0000091B' 
MQRC_MISSING_WIH 2332 X'0000091C' 
MQRC_WIH_ERROR 2333 X'0000091D' 
MQRC_RFH_ERROR 2334 X'0000091E' 
MQRC_RFH_STRING_ERROR 2335 X'0000091F' 
MQRC_RFH_COMMAND_ERROR 2336 X'00000920' 
MQRC_RFH_PARM_ERROR 2337 X'00000921' 
MQRC_RFH_DUPLICATE_PARM 2338 X'00000922' 
MQRC_RFH_PARM_MISSING 2339 X'00000923' 
MQRC_CHAR_CONVERSION_ERROR 2340 X'00000924' 
MQRC_UCS2_CONVERSION_ERROR 2341 X'00000925' 
MQRC_DB2_NOT_AVAILABLE 2342 X'00000926' 
MQRC_OBJECT_NOT_UNIQUE 2343 X'00000927' 
MQRC_CONN_TAG_NOT_RELEASED 2344 X'00000928' 
MQRC_CF_NOT_AVAILABLE 2345 X'00000929' 
MQRC_CF_STRUC_IN_USE 2346 X'0000092A' 
MQRC_CF_STRUC_LIST_HDR_IN_USE 2347 X'0000092B' 
MQRC_CF_STRUC_AUTH_FAILED 2348 X'0000092C' 
MQRC_CF_STRUC_ERROR 2349 X'0000092D' 
MQRC_CONN_TAG_NOT_USABLE 2350 X'0000092E' 
MQRC_GLOBAL_UOW_CONFLICT 2351 X'0000092F' 
MQRC_LOCAL_UOW_CONFLICT 2352 X'00000930' 
MQRC_HANDLE_IN_USE_FOR_UOW 2353 X'00000931' 
MQRC_UOW_ENLISTMENT_ERROR 2354 X'00000932' 
MQRC_UOW_MIX_NOT_SUPPORTED 2355 X'00000933' 
MQRC_WXP_ERROR 2356 X'00000934' 
MQRC_CURRENT_RECORD_ERROR 2357 X'00000935' 
MQRC_NEXT_OFFSET_ERROR 2358 X'00000936' 
MQRC_NO_RECORD_AVAILABLE 2359 X'00000937' 
MQRC_OBJECT_LEVEL_INCOMPATIBLE 2360 X'00000938' 
MQRC_NEXT_RECORD_ERROR 2361 X'00000939' 
MQRC_BACKOUT_THRESHOLD_REACHED 2362 X'0000093A' 
MQRC_MSG_NOT_MATCHED 2363 X'0000093B' 
MQRC_JMS_FORMAT_ERROR 2364 X'0000093C' 
MQRC_SEGMENTS_NOT_SUPPORTED 2365 X'0000093D' 
MQRC_WRONG_CF_LEVEL 2366 X'0000093E' 
MQRC_CONFIG_CREATE_OBJECT 2367 X'0000093F' 
MQRC_CONFIG_CHANGE_OBJECT 2368 X'00000940' 
MQRC_CONFIG_DELETE_OBJECT 2369 X'00000941' 
MQRC_CONFIG_REFRESH_OBJECT 2370 X'00000942' 
MQRC_CHANNEL_SSL_ERROR 2371 X'00000943' 
MQRC_CF_STRUC_FAILED 2373 X'00000945' 
MQRC_API_EXIT_ERROR 2374 X'00000946' 
MQRC_API_EXIT_INIT_ERROR 2375 X'00000947' 
MQRC_API_EXIT_TERM_ERROR 2376 X'00000948' 
MQRC_EXIT_REASON_ERROR 2377 X'00000949' 
MQRC_RESERVED_VALUE_ERROR 2378 X'0000094A' 
MQRC_NO_DATA_AVAILABLE 2379 X'0000094B' 
MQRC_SCO_ERROR 2380 X'0000094C' 
MQRC_KEY_REPOSITORY_ERROR 2381 X'0000094D' 
MQRC_CRYPTO_HARDWARE_ERROR 2382 X'0000094E' 
MQRC_AUTH_INFO_REC_COUNT_ERROR 2383 X'0000094F' 
MQRC_AUTH_INFO_REC_ERROR 2384 X'00000950' 
MQRC_AIR_ERROR 2385 X'00000951' 
MQRC_AUTH_INFO_TYPE_ERROR 2386 X'00000952' 
MQRC_AUTH_INFO_CONN_NAME_ERROR 2387 X'00000953' 
MQRC_LDAP_USER_NAME_ERROR 2388 X'00000954' 
MQRC_LDAP_USER_NAME_LENGTH_ERR 2389 X'00000955' 
MQRC_LDAP_PASSWORD_ERROR 2390 X'00000956' 
MQRC_SSL_ALREADY_INITIALIZED 2391 X'00000957' 
MQRC_SSL_CONFIG_ERROR 2392 X'00000958' 
MQRC_SSL_INITIALIZATION_ERROR 2393 X'00000959' 
MQRC_Q_INDEX_TYPE_ERROR 2394 X'0000095A' 
MQRC_CFBS_ERROR 2395 X'0000095B' 
MQRC_SSL_NOT_ALLOWED 2396 X'0000095C' 
MQRC_JSSE_ERROR 2397 X'0000095D' 
MQRC_SSL_PEER_NAME_MISMATCH 2398 X'0000095E' 
MQRC_SSL_PEER_NAME_ERROR 2399 X'0000095F' 
MQRC_UNSUPPORTED_CIPHER_SUITE 2400 X'00000960' 
MQRC_SSL_CERTIFICATE_REVOKED 2401 X'00000961' 
MQRC_SSL_CERT_STORE_ERROR 2402 X'00000962' 
MQRC_CLIENT_EXIT_LOAD_ERROR 2406 X'00000966' 
MQRC_CLIENT_EXIT_ERROR 2407 X'00000967' 
MQRC_SSL_KEY_RESET_ERROR 2409 X'00000969' 
MQRC_UNKNOWN_COMPONENT_NAME 2410 X'0000096A' 
MQRC_LOGGER_STATUS 2411 X'0000096B' 
MQRC_COMMAND_MQSC 2412 X'0000096C' 
MQRC_COMMAND_PCF 2413 X'0000096D' 
MQRC_CFIF_ERROR 2414 X'0000096E' 
MQRC_CFSF_ERROR 2415 X'0000096F' 
MQRC_CFGR_ERROR 2416 X'00000970' 
MQRC_MSG_NOT_ALLOWED_IN_GROUP 2417 X'00000971' 
MQRC_FILTER_OPERATOR_ERROR 2418 X'00000972' 
MQRC_NESTED_SELECTOR_ERROR 2419 X'00000973' 
MQRC_EPH_ERROR 2420 X'00000974' 
MQRC_RFH_FORMAT_ERROR 2421 X'00000975' 
MQRC_CFBF_ERROR 2422 X'00000976' 
MQRC_CLIENT_CHANNEL_CONFLICT 2423 X'00000977' 
MQRC_REOPEN_EXCL_INPUT_ERROR 6100 X'000017D4' 
MQRC_REOPEN_INQUIRE_ERROR 6101 X'000017D5' 
MQRC_REOPEN_SAVED_CONTEXT_ERR 6102 X'000017D6' 
MQRC_REOPEN_TEMPORARY_Q_ERROR 6103 X'000017D7' 
MQRC_ATTRIBUTE_LOCKED 6104 X'000017D8' 
MQRC_CURSOR_NOT_VALID 6105 X'000017D9' 
MQRC_ENCODING_ERROR 6106 X'000017DA' 
MQRC_STRUC_ID_ERROR 6107 X'000017DB' 
MQRC_NULL_POINTER 6108 X'000017DC' 
MQRC_NO_CONNECTION_REFERENCE 6109 X'000017DD' 
MQRC_NO_BUFFER 6110 X'000017DE' 
MQRC_BINARY_DATA_LENGTH_ERROR 6111 X'000017DF' 
MQRC_BUFFER_NOT_AUTOMATIC 6112 X'000017E0' 
MQRC_INSUFFICIENT_BUFFER 6113 X'000017E1' 
MQRC_INSUFFICIENT_DATA 6114 X'000017E2' 
MQRC_DATA_TRUNCATED 6115 X'000017E3' 
MQRC_ZERO_LENGTH 6116 X'000017E4' 
MQRC_NEGATIVE_LENGTH 6117 X'000017E5' 
MQRC_NEGATIVE_OFFSET 6118 X'000017E6' 
MQRC_INCONSISTENT_FORMAT 6119 X'000017E7' 
MQRC_INCONSISTENT_OBJECT_STATE 6120 X'000017E8' 
MQRC_CONTEXT_OBJECT_NOT_VALID 6121 X'000017E9' 
MQRC_CONTEXT_OPEN_ERROR 6122 X'000017EA' 
MQRC_STRUC_LENGTH_ERROR 6123 X'000017EB' 
MQRC_NOT_CONNECTED 6124 X'000017EC' 
MQRC_NOT_OPEN 6125 X'000017ED' 
MQRC_DISTRIBUTION_LIST_EMPTY 6126 X'000017EE' 
MQRC_INCONSISTENT_OPEN_OPTIONS 6127 X'000017EF' 
MQRC_WRONG_VERSION 6128 X'000017F0' 
MQRC_REFERENCE_ERROR 6129 X'000017F1' 


Reason code cross reference
===========================

3091 (X'0C13') 
MQRCCF_ACTION_VALUE_ERROR 
3166 (X'0C5E') 
MQRCCF_ALLOC_FAST_TIMER_ERROR 
3164 (X'0C5C') 
MQRCCF_ALLOC_RETRY_ERROR 
3165 (X'0C5D') 
MQRCCF_ALLOC_SLOW_TIMER_ERROR 
4009 (X'0FA9') 
MQRCCF_ALLOCATE_FAILED 
3157 (X'0C55') 
MQRCCF_ALREADY_JOINED 
4005 (X'0FA5') 
MQRCCF_ATTR_VALUE_ERROR 
3213 (X'0C8D') 
MQRCCF_ATTR_VALUE_FIXED 
3171 (X'0C63') 
MQRCCF_AUTH_VALUE_ERROR 
3172 (X'0C64') 
MQRCCF_AUTH_VALUE_MISSING 
4086 (X'0FF6') 
MQRCCF_BATCH_INT_ERROR 
4087 (X'0FF7') 
MQRCCF_BATCH_INT_WRONG_TYPE 
3037 (X'0BDD') 
MQRCCF_BATCH_SIZE_ERROR 
4024 (X'0FB8') 
MQRCCF_BIND_FAILED 
3094 (X'0C16') 
MQRCCF_BROKER_COMMAND_FAILED 
3070 (X'0BFE') 
MQRCCF_BROKER_DELETED 
3049 (X'0BE9') 
MQRCCF_CCSID_ERROR 
4068 (X'0FE4') 
MQRCCF_CELL_DIR_NOT_AVAILABLE 
3236 (X'0CA4') 
MQRCCF_CF_STRUC_ERROR 
3266 (X'0CC2') 
MQRCCF_CFBF_FILTER_VAL_LEN_ERR 
3264 (X'0CC0') 
MQRCCF_CFBF_LENGTH_ERROR 
3267 (X'0CC3') 
MQRCCF_CFBF_OPERATOR_ERROR 
3265 (X'0CC1') 
MQRCCF_CFBF_PARM_ID_ERROR 
3254 (X'0CB6') 
MQRCCF_CFBS_DUPLICATE_PARM 
3255 (X'0CB7') 
MQRCCF_CFBS_LENGTH_ERROR 
3256 (X'0CB8') 
MQRCCF_CFBS_PARM_ID_ERROR 
3257 (X'0CB9') 
MQRCCF_CFBS_STRING_LENGTH_ERR 
3258 (X'0CBA') 
MQRCCF_CFGR_LENGTH_ERROR 
3259 (X'0CBB') 
MQRCCF_CFGR_PARM_COUNT_ERROR 
3240 (X'0CA8') 
MQRCCF_CFGR_PARM_ID_ERROR 
3007 (X'0BBF') 
MQRCCF_CFH_COMMAND_ERROR 
3005 (X'0BBD') 
MQRCCF_CFH_CONTROL_ERROR 
3002 (X'0BBA') 
MQRCCF_CFH_LENGTH_ERROR 
3004 (X'0BBC') 
MQRCCF_CFH_MSG_SEQ_NUMBER_ERR 
3006 (X'0BBE') 
MQRCCF_CFH_PARM_COUNT_ERROR 
3001 (X'0BB9') 
MQRCCF_CFH_TYPE_ERROR 
3003 (X'0BBB') 
MQRCCF_CFH_VERSION_ERROR 
3241 (X'0CA9') 
MQRCCF_CFIF_LENGTH_ERROR 
3242 (X'0CAA') 
MQRCCF_CFIF_OPERATOR_ERROR 
3243 (X'0CAB') 
MQRCCF_CFIF_PARM_ID_ERROR 
3027 (X'0BD3') 
MQRCCF_CFIL_COUNT_ERROR 
3026 (X'0BD2') 
MQRCCF_CFIL_DUPLICATE_VALUE 
3028 (X'0BD4') 
MQRCCF_CFIL_LENGTH_ERROR 
3047 (X'0BE7') 
MQRCCF_CFIL_PARM_ID_ERROR 
3017 (X'0BC9') 
MQRCCF_CFIN_DUPLICATE_PARM 
3009 (X'0BC1') 
MQRCCF_CFIN_LENGTH_ERROR 
3014 (X'0BC6') 
MQRCCF_CFIN_PARM_ID_ERROR 
3244 (X'0CAC') 
MQRCCF_CFSF_FILTER_VAL_LEN_ERR 
3245 (X'0CAD') 
MQRCCF_CFSF_LENGTH_ERROR 
3246 (X'0CAE') 
MQRCCF_CFSF_OPERATOR_ERROR 
3247 (X'0CAF') 
MQRCCF_CFSF_PARM_ID_ERROR 
3068 (X'0BFC') 
MQRCCF_CFSL_COUNT_ERROR 
3066 (X'0BFA') 
MQRCCF_CFSL_DUPLICATE_PARM 
3024 (X'0BD0') 
MQRCCF_CFSL_LENGTH_ERROR 
3033 (X'0BD9') 
MQRCCF_CFSL_PARM_ID_ERROR 
3069 (X'0BFD') 
MQRCCF_CFSL_STRING_LENGTH_ERR 
3067 (X'0BFB') 
MQRCCF_CFSL_TOTAL_LENGTH_ERROR 
3095 (X'0C17') 
MQRCCF_CFST_CONFLICTING_PARM 
3018 (X'0BCA') 
MQRCCF_CFST_DUPLICATE_PARM 
3010 (X'0BC2') 
MQRCCF_CFST_LENGTH_ERROR 
3015 (X'0BC7') 
MQRCCF_CFST_PARM_ID_ERROR 
3011 (X'0BC3') 
MQRCCF_CFST_STRING_LENGTH_ERR 
4079 (X'0FEF') 
MQRCCF_CHAD_ERROR 
4081 (X'0FF1') 
MQRCCF_CHAD_EVENT_ERROR 
4082 (X'0FF2') 
MQRCCF_CHAD_EVENT_WRONG_TYPE 
4083 (X'0FF3') 
MQRCCF_CHAD_EXIT_ERROR 
4084 (X'0FF4') 
MQRCCF_CHAD_EXIT_WRONG_TYPE 
4080 (X'0FF0') 
MQRCCF_CHAD_WRONG_TYPE 
4042 (X'0FCA') 
MQRCCF_CHANNEL_ALREADY_EXISTS 
4090 (X'0FFA') 
MQRCCF_CHANNEL_CLOSED 
4038 (X'0FC6') 
MQRCCF_CHANNEL_DISABLED 
3235 (X'0CA3') 
MQRCCF_CHANNEL_ERROR 
4031 (X'0FBF') 
MQRCCF_CHANNEL_IN_USE 
4025 (X'0FB9') 
MQRCCF_CHANNEL_INDOUBT 
3218 (X'0C93') 
MQRCCF_CHANNEL_INITIATOR_ERROR 
4044 (X'0FCC') 
MQRCCF_CHANNEL_NAME_ERROR 
4064 (X'0FE0') 
MQRCCF_CHANNEL_NOT_ACTIVE 
4032 (X'0FC0') 
MQRCCF_CHANNEL_NOT_FOUND 
3062 (X'0BF6') 
MQRCCF_CHANNEL_TABLE_ERROR 
3034 (X'0BDA') 
MQRCCF_CHANNEL_TYPE_ERROR 
3064 (X'0BF8') 
MQRCCF_CHL_INST_TYPE_ERROR 
3065 (X'0BF9') 
MQRCCF_CHL_STATUS_NOT_FOUND 
3168 (X'0C60') 
MQRCCF_CHL_SYSTEM_NOT_ACTIVE 
3088 (X'0C10') 
MQRCCF_CLUSTER_NAME_CONFLICT 
3090 (X'0C12') 
MQRCCF_CLUSTER_Q_USAGE_ERROR 
3008 (X'0BC0') 
MQRCCF_COMMAND_FAILED 
3204 (X'0C84') 
MQRCCF_COMMAND_INHIBITED 
3230 (X'0C9E') 
MQRCCF_COMMAND_LENGTH_ERROR 
3222 (X'0C96') 
MQRCCF_COMMAND_LEVEL_CONFLICT 
3231 (X'0C9F') 
MQRCCF_COMMAND_ORIGIN_ERROR 
3226 (X'0C9A') 
MQRCCF_COMMAND_REPLY_ERROR 
3225 (X'0C99') 
MQRCCF_COMMAND_SCOPE_ERROR 
4040 (X'0FC8') 
MQRCCF_COMMIT_FAILED 
3092 (X'0C14') 
MQRCCF_COMMS_LIBRARY_ERROR 
4011 (X'0FAB') 
MQRCCF_CONFIGURATION_ERROR 
4062 (X'0FDE') 
MQRCCF_CONN_NAME_ERROR 
3260 (X'0CBC') 
MQRCCF_CONN_NOT_STOPPED 
4017 (X'0FB1') 
MQRCCF_CONNECTION_CLOSED 
3174 (X'0C66') 
MQRCCF_CONNECTION_ID_ERROR 
4012 (X'0FAC') 
MQRCCF_CONNECTION_REFUSED 
3080 (X'0C08') 
MQRCCF_CORREL_ID_ERROR 
3052 (X'0BEC') 
MQRCCF_DATA_CONV_VALUE_ERROR 
4043 (X'0FCB') 
MQRCCF_DATA_TOO_LARGE 
3087 (X'0C0F') 
MQRCCF_DEL_OPTIONS_ERROR 
3038 (X'0BDE') 
MQRCCF_DISC_INT_ERROR 
4054 (X'0FD6') 
MQRCCF_DISC_INT_WRONG_TYPE 
3163 (X'0C5B') 
MQRCCF_DISC_RETRY_ERROR 
3211 (X'0C8B') 
MQRCCF_DISPOSITION_CONFLICT 
3078 (X'0C06') 
MQRCCF_DUPLICATE_IDENTITY 
3152 (X'0C50') 
MQRCCF_DUPLICATE_SUBSCRIPTION 
4067 (X'0FE3') 
MQRCCF_DYNAMIC_Q_SCOPE_ERROR 
3050 (X'0BEA') 
MQRCCF_ENCODING_ERROR 
3169 (X'0C61') 
MQRCCF_ENTITY_NAME_MISSING 
4013 (X'0FAD') 
MQRCCF_ENTRY_ERROR 
3054 (X'0BEE') 
MQRCCF_ESCAPE_TYPE_ERROR 
3224 (X'0C98') 
MQRCCF_EVENTS_DISABLED 
3162 (X'0C5A') 
MQRCCF_FILE_NOT_AVAILABLE 
3150 (X'0C4E') 
MQRCCF_FILTER_ERROR 
3012 (X'0BC4') 
MQRCCF_FORCE_VALUE_ERROR 
3227 (X'0C9B') 
MQRCCF_FUNCTION_RESTRICTED 
4077 (X'0FED') 
MQRCCF_HB_INTERVAL_ERROR 
4078 (X'0FEE') 
MQRCCF_HB_INTERVAL_WRONG_TYPE 
4010 (X'0FAA') 
MQRCCF_HOST_NOT_AVAILABLE 
3079 (X'0C07') 
MQRCCF_INCORRECT_Q 
3075 (X'0C03') 
MQRCCF_INCORRECT_STREAM 
3053 (X'0BED') 
MQRCCF_INDOUBT_VALUE_ERROR 
4003 (X'0FA3') 
MQRCCF_LIKE_OBJECT_WRONG_TYPE 
3232 (X'0CA0') 
MQRCCF_LISTENER_CONFLICT 
4020 (X'0FB4') 
MQRCCF_LISTENER_NOT_STARTED 
3249 (X'0CB1') 
MQRCCF_LISTENER_RUNNING 
3233 (X'0CA1') 
MQRCCF_LISTENER_STARTED 
3268 (X'0CC4') 
MQRCCF_LISTENER_STILL_ACTIVE 
3234 (X'0CA2') 
MQRCCF_LISTENER_STOPPED 
3175 (X'0C67') 
MQRCCF_LOG_TYPE_ERROR 
3041 (X'0BE1') 
MQRCCF_LONG_RETRY_ERROR 
4057 (X'0FD9') 
MQRCCF_LONG_RETRY_WRONG_TYPE 
3042 (X'0BE2') 
MQRCCF_LONG_TIMER_ERROR 
4058 (X'0FDA') 
MQRCCF_LONG_TIMER_WRONG_TYPE 
3250 (X'0CB2') 
MQRCCF_LSTR_STATUS_NOT_FOUND 
3044 (X'0BE4') 
MQRCCF_MAX_MSG_LENGTH_ERROR 
4047 (X'0FCF') 
MQRCCF_MCA_NAME_ERROR 
4053 (X'0FD5') 
MQRCCF_MCA_NAME_WRONG_TYPE 
3063 (X'0BF7') 
MQRCCF_MCA_TYPE_ERROR 
3023 (X'0BCF') 
MQRCCF_MD_FORMAT_ERROR 
4061 (X'0FDD') 
MQRCCF_MISSING_CONN_NAME 
3029 (X'0BD5') 
MQRCCF_MODE_VALUE_ERROR 
4026 (X'0FBA') 
MQRCCF_MQCONN_FAILED 
4028 (X'0FBC') 
MQRCCF_MQGET_FAILED 
4036 (X'0FC4') 
MQRCCF_MQINQ_FAILED 
4027 (X'0FBB') 
MQRCCF_MQOPEN_FAILED 
4029 (X'0FBD') 
MQRCCF_MQPUT_FAILED 
4063 (X'0FDF') 
MQRCCF_MQSET_FAILED 
4069 (X'0FE5') 
MQRCCF_MR_COUNT_ERROR 
4070 (X'0FE6') 
MQRCCF_MR_COUNT_WRONG_TYPE 
4071 (X'0FE7') 
MQRCCF_MR_EXIT_NAME_ERROR 
4072 (X'0FE8') 
MQRCCF_MR_EXIT_NAME_WRONG_TYPE 
4073 (X'0FE9') 
MQRCCF_MR_INTERVAL_ERROR 
4074 (X'0FEA') 
MQRCCF_MR_INTERVAL_WRONG_TYPE 
4050 (X'0FD2') 
MQRCCF_MSG_EXIT_NAME_ERROR 
3016 (X'0BC8') 
MQRCCF_MSG_LENGTH_ERROR 
3030 (X'0BD6') 
MQRCCF_MSG_SEQ_NUMBER_ERROR 
3048 (X'0BE8') 
MQRCCF_MSG_TRUNCATED 
3215 (X'0C8F') 
MQRCCF_NAMELIST_ERROR 
4088 (X'0FF8') 
MQRCCF_NET_PRIORITY_ERROR 
4089 (X'0FF9') 
MQRCCF_NET_PRIORITY_WRONG_TYPE 
3093 (X'0C15') 
MQRCCF_NETBIOS_NAME_ERROR 
3217 (X'0C91') 
MQRCCF_NO_CHANNEL_INITIATOR 
4019 (X'0FB3') 
MQRCCF_NO_COMMS_MANAGER 
3077 (X'0C05') 
MQRCCF_NO_RETAINED_MSG 
3262 (X'0CBE') 
MQRCCF_NO_START_CMD 
3263 (X'0CBF') 
MQRCCF_NO_STOP_CMD 
4018 (X'0FB2') 
MQRCCF_NO_STORAGE 
3239 (X'0CA7') 
MQRCCF_NO_XCF_PARTNER 
3200 (X'0C80') 
MQRCCF_NONE_FOUND 
3081 (X'0C09') 
MQRCCF_NOT_AUTHORIZED 
3073 (X'0C01') 
MQRCCF_NOT_REGISTERED 
4037 (X'0FC5') 
MQRCCF_NOT_XMIT_Q 
4075 (X'0FEB') 
MQRCCF_NPM_SPEED_ERROR 
4076 (X'0FEC') 
MQRCCF_NPM_SPEED_WRONG_TYPE 
4001 (X'0FA1') 
MQRCCF_OBJECT_ALREADY_EXISTS 
3205 (X'0C85') 
MQRCCF_OBJECT_BEING_DELETED 
3160 (X'0C58') 
MQRCCF_OBJECT_IN_USE 
3209 (X'0C89') 
MQRCCF_OBJECT_LIMIT_EXCEEDED 
4008 (X'0FA8') 
MQRCCF_OBJECT_NAME_ERROR 
3208 (X'0C88') 
MQRCCF_OBJECT_NAME_RESTRICTED 
4004 (X'0FA4') 
MQRCCF_OBJECT_OPEN 
3210 (X'0C8A') 
MQRCCF_OBJECT_OPEN_FORCE 
3173 (X'0C65') 
MQRCCF_OBJECT_TYPE_MISSING 
4002 (X'0FA2') 
MQRCCF_OBJECT_WRONG_TYPE 
3203 (X'0C83') 
MQRCCF_PARM_CONFLICT 
3020 (X'0BCC') 
MQRCCF_PARM_COUNT_TOO_BIG 
3019 (X'0BCB') 
MQRCCF_PARM_COUNT_TOO_SMALL 
3228 (X'0C9C') 
MQRCCF_PARM_MISSING 
3035 (X'0BDB') 
MQRCCF_PARM_SEQUENCE_ERROR 
3097 (X'0C19') 
MQRCCF_PARM_SYNTAX_ERROR 
3229 (X'0C9D') 
MQRCCF_PARM_VALUE_ERROR 
3096 (X'0C18') 
MQRCCF_PATH_NOT_VALID 
3032 (X'0BD8') 
MQRCCF_PING_DATA_COMPARE_ERROR 
3031 (X'0BD7') 
MQRCCF_PING_DATA_COUNT_ERROR 
4030 (X'0FBE') 
MQRCCF_PING_ERROR 
3167 (X'0C5F') 
MQRCCF_PORT_NUMBER_ERROR 
3170 (X'0C62') 
MQRCCF_PROFILE_NAME_ERROR 
3177 (X'0C69') 
MQRCCF_PROGRAM_AUTH_FAILED 
3176 (X'0C68') 
MQRCCF_PROGRAM_NOT_AVAILABLE 
3084 (X'0C0C') 
MQRCCF_PUB_OPTIONS_ERROR 
3046 (X'0BE6') 
MQRCCF_PURGE_VALUE_ERROR 
3045 (X'0BE5') 
MQRCCF_PUT_AUTH_ERROR 
4059 (X'0FDB') 
MQRCCF_PUT_AUTH_WRONG_TYPE 
3098 (X'0C1A') 
MQRCCF_PWD_LENGTH_ERROR 
3021 (X'0BCD') 
MQRCCF_Q_ALREADY_IN_CELL 
3223 (X'0C97') 
MQRCCF_Q_ATTR_CONFLICT 
3086 (X'0C0E') 
MQRCCF_Q_MGR_CCSID_ERROR 
3074 (X'0C02') 
MQRCCF_Q_MGR_NAME_ERROR 
3212 (X'0C8C') 
MQRCCF_Q_MGR_NOT_IN_QSG 
3076 (X'0C04') 
MQRCCF_Q_NAME_ERROR 
3022 (X'0BCE') 
MQRCCF_Q_TYPE_ERROR 
4007 (X'0FA7') 
MQRCCF_Q_WRONG_TYPE 
3029 (X'0BD5') 
MQRCCF_QUIESCE_VALUE_ERROR 
4051 (X'0FD3') 
MQRCCF_RCV_EXIT_NAME_ERROR 
4016 (X'0FB0') 
MQRCCF_RECEIVE_FAILED 
4015 (X'0FAF') 
MQRCCF_RECEIVED_DATA_ERROR 
3083 (X'0C0B') 
MQRCCF_REG_OPTIONS_ERROR 
4035 (X'0FC3') 
MQRCCF_REMOTE_QM_TERMINATING 
4034 (X'0FC2') 
MQRCCF_REMOTE_QM_UNAVAILABLE 
3025 (X'0BD1') 
MQRCCF_REPLACE_VALUE_ERROR 
3089 (X'0C11') 
MQRCCF_REPOS_NAME_CONFLICT 
4095 (X'0FFF') 
MQRCCF_RETAINED_NOT_SUPPORTED 
4049 (X'0FD1') 
MQRCCF_SEC_EXIT_NAME_ERROR 
3202 (X'0C82') 
MQRCCF_SECURITY_REFRESH_FAILED 
3201 (X'0C81') 
MQRCCF_SECURITY_SWITCH_OFF 
4048 (X'0FD0') 
MQRCCF_SEND_EXIT_NAME_ERROR 
4014 (X'0FAE') 
MQRCCF_SEND_FAILED 
3043 (X'0BE3') 
MQRCCF_SEQ_NUMBER_WRAP_ERROR 
3252 (X'0CB4') 
MQRCCF_SERV_STATUS_NOT_FOUND 
3261 (X'0CBD') 
MQRCCF_SERVICE_REQUEST_PENDING 
3251 (X'0CB3') 
MQRCCF_SERVICE_RUNNING 
3253 (X'0CB5') 
MQRCCF_SERVICE_STOPPED 
3039 (X'0BDF') 
MQRCCF_SHORT_RETRY_ERROR 
4055 (X'0FD7') 
MQRCCF_SHORT_RETRY_WRONG_TYPE 
3040 (X'0BE0') 
MQRCCF_SHORT_TIMER_ERROR 
4056 (X'0FD8') 
MQRCCF_SHORT_TIMER_WRONG_TYPE 
4092 (X'0FFC') 
MQRCCF_SSL_CIPHER_SPEC_ERROR 
4094 (X'0FFE') 
MQRCCF_SSL_CLIENT_AUTH_ERROR 
4093 (X'0FFD') 
MQRCCF_SSL_PEER_NAME_ERROR 
3207 (X'0C87') 
MQRCCF_STORAGE_CLASS_IN_USE 
3071 (X'0BFF') 
MQRCCF_STREAM_ERROR 
3013 (X'0BC5') 
MQRCCF_STRUCTURE_TYPE_ERROR 
3154 (X'0C52') 
MQRCCF_SUB_IDENTITY_ERROR 
3153 (X'0C51') 
MQRCCF_SUB_NAME_ERROR 
3155 (X'0C53') 
MQRCCF_SUBSCRIPTION_IN_USE 
3156 (X'0C54') 
MQRCCF_SUBSCRIPTION_LOCKED 
4085 (X'0FF5') 
MQRCCF_SUPPRESSED_BY_EXIT 
4065 (X'0FE1') 
MQRCCF_TERMINATED_BY_SEC_EXIT 
3248 (X'0CB0') 
MQRCCF_TOO_MANY_FILTERS 
3072 (X'0C00') 
MQRCCF_TOPIC_ERROR 
3238 (X'0CA6') 
MQRCCF_UNEXPECTED_ERROR 
3085 (X'0C0D') 
MQRCCF_UNKNOWN_BROKER 
3161 (X'0C59') 
MQRCCF_UNKNOWN_FILE_NAME 
4006 (X'0FA6') 
MQRCCF_UNKNOWN_Q_MGR 
4033 (X'0FC1') 
MQRCCF_UNKNOWN_REMOTE_CHANNEL 
3082 (X'0C0A') 
MQRCCF_UNKNOWN_STREAM 
3237 (X'0CA5') 
MQRCCF_UNKNOWN_USER_ID 
4039 (X'0FC7') 
MQRCCF_USER_EXIT_NOT_AVAILABLE 
4041 (X'0FC9') 
MQRCCF_WRONG_CHANNEL_TYPE 
3151 (X'0C4F') 
MQRCCF_WRONG_USER 
3036 (X'0BDC') 
MQRCCF_XMIT_PROTOCOL_TYPE_ERR 
4045 (X'0FCD') 
MQRCCF_XMIT_Q_NAME_ERROR 
4052 (X'0FD4') 
MQRCCF_XMIT_Q_NAME_WRONG_TYPE 
 

Secure Sockets Layer (SSL) return codes

1 Handle is not valid. 
3 An internal error has occured. 
4 Insufficient storage is available 
5 Handle is in the incorrect state. 
6 Key label is not found. 
7 No certificates available. 
8 Certificate validation error. 
9 Cryptographic processing error. 
10 ASN processing error. 
11 LDAP processing error. 
12 An unexpected error has occurred. 
102 Error detected while reading key database or SAF key ring. 
103 Incorrect key database record format. 
106 Incorrect key database password. 
109 No certification authority certificates. 
201 No key database password supplied. 
202 Error detected while opening the key database. 
203 Unable to generate temporary key pair 
204 Key database password is expired. 
302 Connection is active. 
401 Certificate is expired or is not valid yet. 
402 No SSL cipher specifications. 
403 No certificate received from partner. 
405 Certificate format is not supported. 
406 Error while reading or writing data. 
407 Key label does not exist. 
408 Key database password is not correct. 
410 SSL message format is incorrect. 
411 Message authentication code is incorrect. 
412 SSL protocol or certificate type is not supported. 
413 Certificate signature is incorrect. 
414 Certificate is not valid. 
415 SSL protocol violation. 
416 Permission denied. 
417 Self-signed certificate cannot be validated. 
420 Socket closed by remote partner. 
421 SSL V2 cipher is not valid. 
422 SSL V3 cipher is not valid. 
427 LDAP is not available. 
428 Key entry does not contain a private key. 
429 SSL V2 header is not valid. 
431 Certificate is revoked. 
432 Session renegotiation is not allowed. 
433 Key exceeds allowable export size. 
434 Certificate key is not compatible with cipher suite. 
435 Certification authority is unknown. 
436 Certificate revocation list cannot be processed. 
437 Connection closed. 
438 Internal error reported by remote partner. 
439 Unknown alert received from remote partner. 
501 Buffer size is not valid. 
502 Socket request would block. 
503 Socket read request would block. 
504 Socket write request would block. 
505 Record overflow. 
601 Protocol is not SSL V3 or TLS V1. 
602 Function identifier is not valid. 
701 Attribute identifier is not valid. 


z/OS and MQ related messages:
=============================


Other notes:
============


-------
Note
-------

Location of WebSphere MQ error logs:
------------------------------------
  
 Technote (troubleshooting) 
  
Problem(Abstract) 
Directions to find WebSpherer MQ and MQSeriesr error logs.

Note: The MQ error logs are by default located in the following directories, however it may have been changed at install time:  
  
 
Resolving the problem 

The messages that are recorded in the error logs and job logs are the most important information that you can provide when reporting an MQ problem.
. 
Select one of the following platforms to find the location of the MQ error logs: 

HP NSS
i5/OSr
OpenVMS
UNIXr and Linuxr
VSE
Windowsr
z/OSr 


HP NSS 
The WebSphere MQ for HP NSS error logs are located in the following directories: 

/var/mqm/errors
/var/mqm/qmgrs/<qmname>/errors

The error log files are named; AMQERR01.LOG, AMQERR02.LOG and AMQERR03.LOG.

Notes: 

If the queue manager name is not known then the error message is written to an error log file in the errors subdirectory. 
For example, if the default prefix is /usr/ibm/wmq/GA/var/mqm, the error message is written to an error log file 
in the directory /usr/ibm/wmq/GA/var/mqm/errors 
If the queue manager name is known, then the error message is written to an error log file in the queue manager's errors directory. 


i5/OS 
The WebSphere MQ for i5/OS error logs are located in the following directories: 

/QIBM/UserData/mqm/qmgrs/<qmname>/errors
/QIBM/UserData/mqm/qmgrs/&SYSTEM 

The error log files are named; AMQERR01.LOG, AMQERR02.LOG and AMQERR03.LOG. 


OpenVMS 
The WebSphere MQ HP OpenVMS error logs are located in the following directories: 

MQS_ROOT:[MQM.ERRORS]AMQERR01.LOG 
MQS_ROOT:[MQM.QMGRS.QMgrName.ERRORS]AMQERR01.LOG
MQS_ROOT:[MQM.QMGRS.$SYSTEM.ERRORS]AMQERR01.LOG 

The error log files are named; AMQERR01.LOG, AMQERR02.LOG and AMQERR03.LOG.

Notes: 

If an error has occurred with a client application: MQS_ROOT:[MQM.ERRORS]AMQERR01.LOG 
If the queue manager name is known and the queue manager is available: MQS_ROOT:[MQM.QMGRS.QMgrName.ERRORS]AMQERR01.LOG 
If the queue manager is not available: MQS_ROOT:[MQM.QMGRS.$SYSTEM.ERRORS]AMQERR01.LOG 


UNIX and Linux 
The WebSphere MQ for UNIX error logs are located in the following directories: 

/var/mqm/errors
/var/mqm/qmgrs/<qmname>/errors
/var/mqm/qmgrs/@SYSTEM/errors (not used at V6 and higher)

The error log files are named; AMQERR01.LOG, AMQERR02.LOG and AMQERR03.LOG.


VSE 
MQSeriesr uses the SYSTEM.LOG queue defined in the global system definition as its primary message log and additional 
informational messages are output to the VSE console. Typically, these detail starting, stopping, and initializing MQSeries for VSE 
If the SYSTEM.LOG queue is unavailable, the messages are directed to the CICS CSMT log. 

These messages should always be reviewed carefully for any error messages. The type of messages included in the SYSTEM.LOG queue 
can now be controlled by using the 'Log and Trace Settings'. 

Refer to the MQSeries for VSE System Management Guide, "Queue Manager Log and Trace Settings" on page 74 for details. 
Note that the types of messages put to the SYSTEM.LOG queue can be controlled 
using the LOG SETTINGS. When trying to resolve problems, ensure that all messages of all severity are
selected to be logged. You can view the contents of the system log using the Master Terminal transaction (MQMT) option 4 (Browse Queue Records).


Windows 
The WebSphere MQ for Windows error logs are located in the following directories. This is the default directory path, however it may 
have been changed at install time:

c:\Program Files\IBM\WebSphere MQ\errors
c:\Program Files\IBM\WebSphere MQ\qmgrs\<qmname>\errors
c:\Program Files\IBM\WebSphere MQ\qmgrs\@SYSTEM\errors (not used at V6 and higher)

The error log files are named; AMQERR01.LOG, AMQERR02.LOG and AMQERR03.LOG.


z/OS 
The WebSphere MQ for OS/390 and z/OS job logs are located in the following: 

Syslog
MSTR job log
CHIN job log

The job logs are named; xxxxMSTR, and xxxxCHIN. Where xxxx is the WMQ subsystem identifier (ssid).


-------
Note:
-------

Article:

AMQ9209 and AMQ9228 messages flooding the error log
  
 Technote (troubleshooting) 
  
Problem(Abstract) 
Your WebSphere MQ queue manager records many AMQ9209 and AMQ9228 messages despite the fact 
all of the channels and clients were running normally.  
  
Symptom 
The system error log messages were: 

AMQ9209: Connection to host 'jtf (9.27.32.32)' closed.
AMQ9228: The TCP/IP responder program could not be started. 
 
 
Cause 
The error messages were caused by a port scanning tool, that runs on the user's network. Each time the scanner 
connected to the MQ port, the listener would assume a new MQ connection was being established. 

When the scanner disconnected without sending any data, then MQ wrote the AMQ9209 message to record the error. 
Because the listener did not know what channel the connection was for, it followed up with a generic AMQ9228 message 
rather than the more usual AMQ9999 message ("Channel ended abnormally").  
  
 
Resolving the problem 
Prevent the port scanner from accessing the WebSphere MQ ports.  
  
 
Historical Number 
88709999  
  
Product Alias/Synonym 
WebSphere MQ
WMQ
MQSeries  
 
 
-------
Note:
-------


-------
Note:
-------

thread:

Q:

Why do I get this error and why do my MDBs quit processing when I get this error?


2008-09-28 00:51:21,859 WARN  [org.jboss.mq.server.BasicQueue]

        Caught unusual exception sending message to receiver.

        org.jboss.util.threadpool.ThreadPoolFullException:

        java.lang.InterruptedException

               at

        org.jboss.util.threadpool.BasicThreadPool.execute(BasicThreadPool.java:417)

               at

        org.jboss.util.threadpool.BasicThreadPool.runTaskWrapper(BasicThreadPool.java:192)

               at

        org.jboss.util.threadpool.BasicThreadPool.run(BasicThreadPool.java:212)

               at

        org.jboss.util.threadpool.BasicThreadPool.run(BasicThreadPool.java:206)

               at

        org.jboss.mq.server.ClientConsumer.queueMessageForSending(ClientConsumer.java:125)

               at

        org.jboss.mq.server.BasicQueue.queueMessageForSending(BasicQueue.java:1161)

               at

        org.jboss.mq.server.BasicQueue.internalAddMessage(BasicQueue.java:1132)

               at org.jboss.mq.server.BasicQueue.access

        $000(BasicQueue.java:76)

               at org.jboss.mq.server.BasicQueue

        $AddMessagePostCommitTask.run(BasicQueue.java:1399)

               at org.jboss.mq.pm.Tx.commit(Tx.java:217)

               at

        org.jboss.mq.pm.TxManager.commitTx(TxManager.java:113)

               at

        org.jboss.mq.server.JMSDestinationManager.transact(JMSDestinationManager.java:468)

               at

        org.jboss.mq.server.ClientMonitorInterceptor.transact(ClientMonitorInterceptor.java:168)

               at

        org.jboss.mq.server.JMSServerInterceptorSupport.transact(JMSServerInterceptorSupport.java:126)

               at

        org.jboss.mq.security.ServerSecurityInterceptor.transact(ServerSecurityInterceptor.java:197)

               at

        org.jboss.mq.server.TracingInterceptor.transact(TracingInterceptor.java:352)

               at

        org.jboss.mq.server.JMSServerInvoker.transact(JMSServerInvoker.java:132)

               at

        org.jboss.mq.il.uil2.ServerSocketManagerHandler.handleMsg(ServerSocketManagerHandler.java:194)

               at org.jboss.mq.il.uil2.SocketManager

        $ReadTask.handleMsg(SocketManager.java:417)

               at

        org.jboss.mq.il.uil2.msgs.BaseMsg.run(BaseMsg.java:398)

               at EDU.oswego.cs.dl.util.concurrent.PooledExecutor

        $Worker.run(PooledExecutor.java:761)

               at java.lang.Thread.run()V(Unknown Source)
 
A:

You get this error becuase of a bug in the Oswego concurrent libraries.  This is an issue that we have worked around in JBoss mq.
 
It is fixed in the community 4.2.3 and it will be fixed in EAP 4.2 CP06.


-------
Note:
-------

thread:

Q:

Hi,
When I run a java program that connects to MQSeries through a shell
script from
qsh I get an MQSeries invalid environment error (reason code 2012).

1) Here is the call and results from the shell script:

> send.sh

MQJE001: Completion Code 2, Reason 2012

JS_LoadAccounts: MQ exception occurred - Completeion code: 2 Reason
code:
2012

com.ibm.mq.MQException: MQJE001: Completion Code 2, Reason 2012


2) When I copy the contents of the shell script (send.sh) and make the
call
outside of the shell script (still in qsh), it works fine:

===> java -classpath
".:$CLASSPATH:/QIBM/ProdData/mqm/java/lib/com.ibm.mq.jar:/QIBM/P
rodData/mqm/java/lib" com.services.JS_LoadAccounts testmsg POD1 POD_TEST

�

3) When I call the class from a CL it also works fine.

QMQM and QMQMJAVA are in the library list.

Any ideas?

Thanks in advance


A:

What version of operating system are you on?

I had this problem on some V5R1 systems, but not all. On a very fast
system it would do exactly this, run in QSH as individual commands, but
in the .sh it would fail, both in QSH and submitted as a QSH. Frank K.
at IBM advised me that it might be a timing level error on the system.
When the system was upgraded to V5R2, the problem disappeared.


-------
Note:
-------

thread:

Q:

How disable the MQ error 2033 to the stc_lh and stc_is log file

on 2/6/2009 7:48 AM
Hi all,
With Ican 505, we have multiple log MQ error 2033 to the stc_lh log file and to the stc_is log files.
Is it possible to disable this error or have you got any ideas to don't log this error, because it's not really an error ?
Thanks for your help.

A:


hi,
You can try this code:
MQException.logExclude(new Integer(MQException.MQRC_NO_MSG_AVAILABLE))
found here:
http://mqseries.net/phpBB2/viewtopic.php?t&H33479&hi ghlight=stderr&sid=8a7f7ccfa56d954da55e3447330eecd 2
Regards


A:

Hi ,
this is a faulty logging statement. As far as i know this has never been addressed in any of the update releases.
If Eric's suggestion does not work, then there's no way around it..


-------
Note:
-------

thread:

Q:

 am accessing MQ via .NET and I get 2059 errors using the conncetion code: 
---- 
MQEnvironment.Hostname = txtMQHost.Text; 
MQEnvironment.Channel = txtMQChan.Text; 
MQEnvironment.Port = Convert.ToInt32("1413"); 
MQEnvironment.UserId = "rhorn"; 

mqQMgr = new MQQueueManager(txtMQMgr.Text); 
---- 
2059 error occurs when I try to connect above. Checked all inputs and they are correct.

A:

Setting UserId may not do what you think it does. 

Specifying a qmgr name in the MQQueueManager constructor may not do what you think it does. 

Specifying a qmgr name in the wrong cAsE may not do what you think it does. 
 
A:

Thanks for the response. But after persisting to the admin., he found that I had the wrong IP address. 
Knew something wasn't right because I never got a 2059 before. 
 

-------
Note:
-------


thread:

Q:

19-Jul-07 10:32:04
 
I'm trying to use the MQSC or MQ Series client adapter and am getting the
following error when trying to send a message.  Any ideas?

Thanks,


Event Type:	Warning
Event Source:	BizTalk Server 2006
Event Category:	BizTalk Server 2006
Event ID:	5740
Date:		07/19/2007
Time:		9:13:16 AM
User:		N/A
Computer:	13694L
Description:
The adapter "MQSC" raised an error message. Details "The specified module
could not be found. (Exception from HRESULT: 0x8007007E)".

A:


-------
Note:
-------

thread:

Q:

As a very green installer of WebSphere, I am customizing WebSphere MQ 5.3.1
with z/OS 1.6s.  I've done very little to change the basic installation.
QMGR comes up without any errors.  When I start the channel initiator, it
fails immediately with the following series of messages on the console:


+CSQX090I %CSQ1 CSQXGIP CHINIT parameters ...
+CSQX091I %CSQ1 CSQXGIP TRAXSTR=YES, TRAXTBL=2, ADAPS=8, DISPS=5 
+CSQX092I %CSQ1 CSQXGIP CURRCHL=200, ACTCHL=200, LSTRTMR=60 CSQX093I
+%CSQ1 CSQXGIP TCPCHL=200, TCPKEEP=NO, TCPNAME=TCPIP, 280
TCPTYPE=OESOCKET, OPORTMIN=0, OPORTMAX=0
+CSQX094I %CSQ1 CSQXGIP LU62CHL=200, LUNAME= , LU62ARM= CSQX095I %CSQ1 
+CSQXGIP ADOPTMCA=NO, ADOPTCHK=ALL, RCVTIME=X0, RCVTMIN=0 CSQX096I %CSQ1 
+CSQXGIP DNSWLM=NO, DNSGROUP= , LUGROUP= CSQX099I %CSQ1 CSQXGIP Client 
+attachment feature available CSQX007E %CSQ1 CSQXADPI Unable to connect 
+to queue manager CSQ1, 285
MQCC=2 MQRC=2009
+CSQX140E %CSQ1 CSQXADPI Adapter failed to start CSQX005E %CSQ1 CSQXJST 
+Channel initiator failed to start
IEF402I CSQ1CHIN FAILED IN ADDRESS SPACE 0044 288
   SYSTEM ABEND S6C6 - REASON CODE F30905 $HASP310 CSQ1CHIN TERMINATED AT
END OF MEMORY


I've searched IBMLINK, the MQ manuals, the ques, the logs, the internet, the
list...  nothing or at least nothing I understand well enough to resolve
whatever is wrong.  Nothing else is using the port.  Security is turned off.
All the necessary libraries are APF auth'd and linklisted.  It has been a
long search and a long day; any ideas would be greatly appreciated.


A:

Your abend seems to indicate that you have run out of memory as others have pointed out.
 If it's having problems with ESTAE, the memory in question is likely LSQA memory and probably 
"below the line" memory. Message "$HASP310 CSQ1CHIN TERMINATED AT END OF MEMORY" hints at this. 
Go talk with your MVS systems programmer and have him set a slip on the abend to get 
an SVC dump (if you didn't already get one), and ask him/her to take a look at the dump 
for/with you. What's likely needed, is a modification to an MVS user exit called IEFUSI. 
Again, your MVS systems programmer can help. He can code the exit in such a way 
that slack space "below the line" is left for LSQA. IEFUSI can also limit the amout of storage available 
"above the line" and that may be the problem as well. Good luck! 


-------
Note:
-------

thread:

Q:

Hi,
I have just installed the Websphere MQSeries 5.3 on Solaris 8. I am trying to verify whether the installation is proper.
To do that, I created a created a queue manager using crtmqm Q1.q.manager.
After successfully executing that step, that I followed these steps.

bash-2.03$ strmqm Q1.QUEUE.MANAGER
WebSphere MQ was unable to display an error message 893.

bash-2.03$ runmqsc Q1.QUEUE.MANAGER
5724-B41 (C) Copyright IBM Corp. 1994, 2002. ALL RIGHTS RESERVED.
AMQ8146: WebSphere MQ queue manager not available.

No MQSC commands read.
No commands have a syntax error.

All valid MQSC commands were processed.
bash-2.03$

Can anyone please suggest the reason for getting AMQ8146 ERROR.


A:

AMQ6119: An internal WebSphere MQ error has occurred (Failed to attach shared
memory segment: shmat(ShmId 0x00000e80) [rc=-1 errno=24] Too many open files)
EXPLANATION:
MQ detected an unexpected error when calling the operating system. The MQ error
recording routine has been called.
ACTION:
Use the standard facilities supplied with your system to record the problem
identifier, and to save the generated output files. Contact your IBM support
center. Do not discard these files until the problem has been resolved.
----- amqxfdcx.c : 671 --------------------------------------------------------
08/30/06 07:23:58 AM
AMQ6184: An internal WebSphere MQ error has occurred on queue manager
Q1.QUEUE.MANAGER.

EXPLANATION:
An error has been detected, and the WebSphere MQ error recording routine has
been called. The failing process is process 26050.
ACTION:
Use the standard facilities supplied with your system to record the problem
identifier, and to save the generated output files. Contact your IBM support
center. Do not discard these files until the problem has been resolved.
----- amqxfdcx.c : 705 --------------------------------------------------------
08/30/06 07:23:59 AM
AMQ6119: An internal WebSphere MQ error has occurred ()

EXPLANATION:
MQ detected an unexpected error when calling the operating system. The MQ error
recording routine has been called.
ACTION:
Use the standard facilities supplied with your system to record the problem
identifier, and to save the generated output files. Contact your IBM support
center. Do not discard these files until the problem has been resolved.
----- amqxfdcx.c : 671 --------------------------------------------------------
08/30/06 07:23:59 AM
AMQ6184: An internal WebSphere MQ error has occurred on queue manager
Q1.QUEUE.MANAGER.
EXPLANATION:
An error has been detected, and the WebSphere MQ error recording routine has
been called. The failing process is process 26050.
ACTION:
Use the standard facilities supplied with your system to record the problem
identifier, and to save the generated output files. Contact your IBM support
center. Do not discard these files until the problem has been resolved.

May be this error log help you in resolution of problem.

Thanks,


-------
Note:
-------


thread:

Q:


> My system send messages to a queue. However, the size of the message
> over the default max message size limit(64K). I want to increase the
> max message size limit, but I am wondering the impact to the system.
> Is there any one have try to increase the max message size of a
> queue?

A:

The default maximum message size is 4 MiB in recent versions of MQ, and 
I routinely use messages up to 2 MiB without problems.

Your MQ administrators have probably set a lower size limit to help them 
with their capacity planning; I suggest you ask them whether they're 
happy for you to change it.

Regards,


-------
Note:
-------

thread:

Q:

Dear All,

facing following error :Our IBM WebSphere Work Flow client whenever communicates with AIX server which has the QMGR gives following errors:-

MQ Error Log:

AMQ6109: An internal WebSphere MQ error has occurred.
EXPLANATION:
An error has been detected, and the WebSphere MQ error recording routine has been
called.
ACTION:
Use the standard facilities supplied with your system to record the problem
identifier, and to save the generated output files. Contact your IBM support
center. Do not discard these files until the problem has been resolved.

----- amqxfdcx.c : 688 --------------------------------------------------------
01/05/07 15:54:18
AMQ6183: An internal WebSphere MQ error has occurred.

EXPLANATION:
An error has been detected, and the WebSphere MQ error recording routine has been
called. The failing process is process 975078.
ACTION:
Use the standard facilities supplied with your system to record the problem
identifier, and to save the generated output files. Contact your IBM support
center. Do not discard these files until the problem has been resolved.

QMGR LOG:

AMQ9208: Error on receive from host 10.16.17.194.
EXPLANATION:
An error occurred receiving data from 10.16.17.194 over TCP/IP. This may be due
to a communications failure.
ACTION:
The return code from the TCP/IP (read) call was 73 (X'49'). Record these values
and tell the systems administrator.

According to above AMQ**** errors we have done the changes in AIX server
But we are still getting the above errors

Step:1
In /etc/services:
add lines:
mqseries_1 1460/tcp # MQSeries QM1
mqseries_2 14002/tcp # MQSeries QM2

Step2:
In /etc/inetd.conf
add lines:
mqseries_1 stream tcp nowait mqm /usr/mqm/bin/amcrsta amqcrsta
mqseries_2 stream tcp nowait mqm /usr/mqm/bin/amcrsta amqcrsta

Step3:
Run the inetd super-daemon to recognize the update:
inetimp
refresh -s inetd


-------
Note:
-------

article:


Recommended Fixes for WebSphere MQ
  
http://www-01.ibm.com/support/docview.wss?rs=171&uid=swg27006037
 

-------
Note:
-------

Display

MQ installed, which version
dspmqver (pre V6: mqver)
Display QueueManagers on a machine
dspmq
Display QueueManager settings
   MQSC: DIS QMGR
Display Queues, name and type or ALL attributes [optional]
   MQSC: DIS Q(*) [ALL]
Display Queues, specific attribute(s)
   MQSC: DIS Q(*) CURDEPTH
often used attributes: CURDEPTH IPPROCS OPPROCS GET PUT MAXDEPTH MAXMSGL
Display non empty Queues
   MQSC: DIS Q(*) CURDEPTH WHERE(CURDEPTH GT 0)
Display Channels, name and type or ALL attributes [optional]
   MQSC: DIS CHL(*) [ALL]
Display Channels, specific attribute(s)
   MQSC: DIS CHL(*) CONNAME
Display Services, name and type or ALL attributes [optional]
   MQSC: DIS SERVICE(*) [ALL]
Display Listener, name and type or ALL attributes [optional]
   MQSC: DIS LSTR(*) [ALL]


Display Log Settings (Windows)
amqmdain reg QmgrName -c display -s Log -v *
Display Queue Filenames for Queues
dspmqfls -m QmgrName -t qlocal *
Start / Stop
Starting QueueManager
strmqm QmgrName
Stopping QueueManager
endmqm QmgrName
return control after end: endmqm -w QmgrName
end immediately: endmqm -i QmgrName
if all else fails (use with caution!): endmqm -p QmgrName
Starting Channel
   MQSC: START CHL(ChannelName)
Stopping Channel
   MQSC: STOP CHL(ChannelName)
set inactive: STOP CHL(ChannelName) mode(quiesce) status(inactive)
Starting Service
   MQSC: START SERVICE(ServiceName)
Stopping Service
   MQSC: STOP SERVICE(ServiceName)
Starting Listener
   MQSC: START LSTR(ListenerName)
Stopping Listener
   MQSC: STOP LSTR(ListenerName)
Status
Display Channel status or ALL status information [optional]
   MQSC: DIS CHS(*) [SAVED] [ALL]
Display Queue status or ALL status information [optional]
   MQSC: DIS QS(*) [TYPE(HANDLE)] [ALL]
Display Service status or ALL status information [optional]
   MQSC: DIS SVSTATUS(*) [SAVED] [ALL]
Display Listener status or ALL status information [optional]
   MQSC: DIS LSTATUS(*) [SAVED] [ALL]


-------
Note:
-------

A
access control list (ACL) 
In computer security, a list associated with an object that identifies all the subjects that can access the object that it is associated with. The list also defines their access rights. Subjects are principals that have explicit permissions (to publish, to subscribe to, and to request persistent delivery of, a publication message) against a topic in the topic tree. The ACLs define the implementation of topic-based security. 
ACL 
See access control list. 
AMI 
See Application Messaging Interface. 
Application Messaging Interface (AMI) 
The programming interface, provided by WebSphere MQ, that defines a high level interface to message queuing services. See also Message Queue Interface (MQI) and Java Message Service (JMS). Applications that use the AMI connect to the broker using WebSphere MQ Enterprise Transport. 
TOP

B
bar file 
See broker archive file. 
bend point 
A point that is introduced in a connection between two message flow nodes at which the line that represents the connection changes direction. A bend point can be used to make node alignment and processing logic clearer and more effectively displayed. 
binary large object (BLOB) 
A block of bytes of data (for example, the body of a message) that has no discernible meaning, but is treated as one solid entity that cannot be interpreted. 
BLOB 
See binary large object. 
broker 
A set of execution processes that host one or more message flows. Also known as message broker. 
broker archive file 
The unit of deployment to the broker; also known as a bar file. It contains any number of different files, including compiled message flows (.cmf). It can also contain any additional files that you might need, provided that the extension does not overlap the .cmf extensions. 
broker domain 
A collection of brokers that share a common configuration, together with the Configuration Manager that controls them. 
broker schema 
A symbol space that defines the scope of uniqueness of the names of resources (message flows) that are defined within it. 
built-in node 
A message flow node that is supplied by the product. Some of the supplied nodes provide basic processing such as input and output. 
TOP

C
cmf 
See compiled message flow. 
collective 
A set of brokers that are fully interconnected and form part of a multi-broker network for publish/subscribe applications. 
compiled message flow (cmf) 
A message flow that has been compiled to prepare it for deployment to the broker. A cmf file is sent to the broker within a bar file. 
component 
A set of runtime processes that perform a specific set of functions. A component is a broker, a Configuration Manager, a Database Instance Manager, or a User Name Server. 
component directory 
In z/OS, the root directory of the component's runtime environment. 
component name 
The external name of a component. Each component requires a name, which is used, for example, in the workbench and in commands. 
component PDSE 
In a z/OS environment, a PDSE that contains jobs to define resources to DB2, WebSphere MQ, and the WebSphere Event Broker started task. See partitioned data set. 
configuration 
In a broker domain, the brokers, execution groups, deployed message flows, and defined topics and access control lists. 
Configuration Manager 
The component that provides an interface between the workbench and a set of runtime brokers. It provides brokers with their initial configuration, and updates them with any subsequent changes. It maintains the broker domain configuration. 
Configuration Manager Proxy 
An application programming interface that your applications can use to control broker domains through a remote interface to the Configuration Manager. 
connection 
See message flow node connection. For broker-to-broker connections, see publish/subscribe topology. 
content-based filter 
In publish/subscribe, an expression that is included as part of a subscription to determine whether a publication message is received based on its content. The expression can include wild cards. 
TOP

D
Database Instance Manager 
On Windows, a network server that supports the creation, maintenance, and deletion of databases used by brokers in all installations on a single computer. Database support is limited to Derby and DB2. The Database Instance Manager is associated with a Windows service. 
DataFlowEngine (DFE) 
See execution group. 
datagram 
A form of asynchronous messaging in which an application sends a message, but does not want a response. Also known as send-and-forget. Contrast with request/reply. 
deploy 
The process of transferring data to an execution group on a broker so that it can take effect in the broker domain. For deploying message flows and associated resources, the data is packaged in a broker archive (bar) file before being sent to the Configuration Manager, from where it is unpackaged and distributed appropriately. 
Derby 
The database based on the Apache Derby open source project from Apache Software Foundation. Derby database support is embedded in the broker component on Windows only. 
distribution list 
A list of WebSphere MQ queues to which a message can be put with a single statement. 
TOP

E
editor area 
The area in the workbench window where files are opened for editing. 
ESM 
See external security manager. 
execution group 
A named grouping of message flows that have been assigned to a broker. The broker enforces a degree of isolation between message flows in distinct execution groups by ensuring that they execute in separate address spaces, or as unique processes. 
An execution group process is also known as a DataFlowEngine (DFE); this term is typically used in problem determination scenarios (trace contents, diagnostic messages, and so on). A DFE is created as an operating system process, and has a one-to-one relationship with the named execution group. If more than one message flow runs within an execution group, multiple threads are created within the DFE process.

Extensible Markup Language (XML) 
A standard metalanguage for defining markup languages that is based on Standard Generalized Markup Language (SGML). 
External Security Manager (ESM) 
In a z/OSr environment, a security product that performs security checking on users and resources. RACF is an example of an ESM. 
TOP

G
graphical user interface (GUI) 
A type of computer interface that presents a visual metaphor of a real-world scene, often of a desktop, by combining high-resolution graphics, pointing devices, menu bars and other menus, overlapping windows, icons, and the object-action relationship. 
GUI 
See graphical user interface. 
TOP

I
IBM Runtime Environment for Java 
A subset of the IBM Developer Kit for the Java Platform that contains the core executable files and other files that constitute the standard Java platform. The IBM Runtime Environment includes the Java virtual machine (JVM), core classes, and supporting files. 
IBM Software Developer Kit for Java 
A software package that can be used to write, compile, debug, and run Java applets and applications. 
input node 
A message flow node that represents a source of messages for a message flow or subflow. See also output node. 
install_dir 
The location in the local file system in which product components have been installed. For example, the default location for runtime components on Windows is C:\Program Files\IBM\6.0. 
installation directory 
In a z/OS environment, a file system into which all product data is installed, and from which it is referenced and retrieved during the customization phase. 
TOP

J
Java Database Connectivity (JDBC) 
An industry standard for database-independent connectivity between the Java platform and a wide range of databases. The JDBC interface provides a call-level API for SQL-based and XQuery-based database access. See also Open Database Connectivity. 
Java Message Service (JMS) 
An application programming interface that provides Java language functions for handling messages. See also Application Messaging Interface (AMI) and Message Queue Interface (MQI). Applications using JMS connect to the broker using either WebSphere MQ Real-time Transport or WebSphere MQ Multicast Transport. 
JCL 
See Job Control Language 
JDBC 
See Java Database Connectivity. 
JMS 
See Java Message Service. 
Job Control Language 
Job Control Language (JCL) comprises a set of Job Control Statements that are used to define work requests called jobs. JCL tells the operating system what program to run, and defines its inputs and outputs. 
TOP

L
local error log 
A generic term that refers to the logs to which WebSphere Event Broker writes records on the local system. Also known as the system log. 
TOP

M
message 
A communication that is sent from a person or program to another person or program. In WebSphere Event Broker, messages must have a structure and format which is agreed by the sending and receiving applications. 
message broker 
See broker. 
Message Brokers Toolkit 
The WebSphere Event Broker development environment that integrates with Rational Application Developer which is based on the IBM WebSphere Eclipse Platform. Also known as the workbench. 
message flow 
A sequence of processing steps that run in the broker when an input message is received. A message flow is created in the workbench by including a number of message flow nodes that each represents a set of actions that define a processing step. The connections in the flow determine which processing steps are carried out, in which order, and under which conditions. A message flow must include an input node that provides the source of the messages that are processed. Message flows are then ready to deploy to a broker for execution. See also subflow. 
message flow node 
A processing step in a message flow, also called a message processing node. A message flow node can be a built-in node, a user-defined node, or a subflow node. 
message flow node connection 
An entity that connects an output terminal of one message flow node to an input terminal of another. A message flow node connection represents the flow of control and data between two message flow nodes. 
message parser 
A program that interprets an incoming message and creates an internal representation of the message in a tree structure, and that regenerates a bit stream for an outgoing message from the internal representation. 
message processing node 
See message flow node. 
Message Queue Interface (MQI) 
The programming interface that is provided by WebSphere MQ queue managers. The programming interface allows application programs to access message queuing services. See also Application Messaging Interface (AMI) and Java Message Service (JMS). Applications that use the MQI, connect to the broker using WebSphere MQ Enterprise Transport. 
metadata 
The data that describes the characteristic of stored data. 
MQI 
See Message Queue Interface. 
MQIsdp 
See SCADA device protocol. 
MQRFH 
An architected message header that is used to provide metadata for the processing of a message. This header is supported by the WebSphere MQ (MQSeriesr) Publish/Subscribe SupportPac. 
MQRFH2 
An extended version of MQRFH, providing enhanced function in message processing. 
multilevel wild card 
A wild card that can be specified in subscriptions to match any number of levels in a topic. 
TOP

N
node 
An endpoint or junction used in a message flow. See message flow node. 
TOP

O
ODBC 
See Open Database Connectivity. 
Open Database Connectivity (ODBC) 
A standard application programming interface (API) for accessing data in both relational and non-relational database management systems. Using this API, database applications can access data stored in database management systems on a variety of computers even if each database management system uses a different data storage format and programming interface. 
output node 
A message flow node that represents a point at which messages leave the message flow or subflow. See also input node. 
TOP

P
parser 
See message parser. 
partitioned data set (PDS, PDSE) 
In a z/OS environment, a data set in direct-access storage that is divided into partitions, which are called members. A partitioned data set (extended) (PDSE) is an extension to a PDS that contains an indexed directory in addition to the members. 
PDS, PDSE 
See partitioned data set. 
perspective 
A group of views that show various aspects of the resources in the workbench. See also view. 
point-to-point 
A style of messaging application in which the sending application knows the destination of the message. Contrast with publish/subscribe. 
principal 
An individual user ID (for example, a login ID) or a group. A group can contain individual user IDs and other groups, to the level of nesting that is supported by the underlying facility. 
property 
A characteristic that, as one of a set of characteristics, defines the values and behaviors of objects in the workbench. For example, message flow nodes and deployed message flows have properties. 
publication 
A piece of information about a specified topic that is available to a broker in a publish/subscribe system. 
publication node 
An end point of a specific path through a message flow to which a client application subscribes, identified to the client by its subscription point. 
publisher 
An application that makes information about a specified topic available to a broker in a publish/subscribe system. 
publish/subscribe 
A style of messaging application in which the providers of information (publishers) are de-coupled from the consumers of that information (subscribers) using a broker. See also topic. Contrast with point-to-point messaging. 
publish/subscribe topology 
The brokers, the collectives, and the connections between them, that support publish/subscribe applications in the broker domain. 
TOP

Q
queue 
A WebSphere MQ object to which message queuing applications can put messages, and from which message queuing applications can get messages. 
queue manager 
A system program that provides queuing services to applications. A queue manager provides an application programming interface (the MQI) that enables programs to access messages on the queues that the queue manager owns. 
TOP

R
request/reply 
A type of messaging application in which a request message is used to request a reply from another application. Contrast with datagram. 
resource 
A file of any type that exists in the workbench. You can view and edit a resource in the Broker Development view (previously called the Resource Navigator view) in the workbench. 
Resource Recovery Services (RRS) 
A z/OS facility that provides two-phase sync point support across participating resource managers. 
retained publication 
A published message that is kept at the broker for propagation to clients that subscribe in the future. 
RRS 
See Resource Recovery Services. 
TOP

S
SCADA 
See Supervisory, Control, And Data Acquisition. 
SCADA device protocol (MQIsdp) 
A protocol that implements the WebSphere MQ Telemetry Transport to connect SCADA devices to the broker. 
send-and-forget 
See datagram. 
single-level wild card 
A wild card that can be specified in subscriptions to match a single level in a topic. 
stream 
A method of topic partitioning that is used by applications that connect to WebSphere MQ Publish/Subscribe brokers. 
subflow 
A sequence of processing steps, implemented by message flow nodes, that is designed to be embedded in a message flow or in another subflow. A subflow must include at least one Input or Output node. A subflow can be started by a broker only as part of the message flow in which it is embedded, and therefore cannot be deployed. 
subflow node 
A message flow node that represents a subflow. 
subscriber 
An application that requests information about a specified topic from a publish/subscribe broker. 
subscription 
A record that contains the information that a subscriber passes to its local broker to describe the publications that it wants to receive. 
subscription filter 
A predicate that specifies the subset of messages that are to be delivered to a particular subscriber. 
subscription point 
The name that a subscriber uses to request publications from a particular set of publication nodes. It is the property of a publication node that differentiates that publication node from other publication nodes in the same message flow. 
Supervisory, Control, And Data Acquisition (SCADA) 
A term used to describe any form of remote telemetry system that is used to gather data from remote sensor devices (for example, flow rate meters on an oil pipeline) and for the near real time control of remote equipment (for example, pipeline valves). These devices communicate with the broker using the SCADA device protocol (MQIsdp). 
system log 
See local error log. 
TOP

T
terminal 
The point at which one node in a message flow is connected to another node. You can connect terminals to control the route that a message takes, dependent on the outcome of the operation that is performed on that message by the node. 
topic 
A character string that describes the nature of the data that is published in a publish/subscribe system. 
topic based subscription 
A subscription specified by a subscribing application that includes a topic that filters publications. 
topic security 
The application of ACLs to one or more topics to control subscriber access to published messages. 
topology 
See publish/subscribe topology. 
TOP

U
Unicode Transformation Format, 8-bit encoding form (UTF-8) 
A transformation format that is designed for ease of use with existing ASCII-based systems. UTF-8 is an encoding of Unicode character strings that optimizes the encoding of ASCII characters in support of text-based communication. 
uniform resource identifier (URI) 
An encoded address that represents any resource, such as an HTML document, image, video clip, or program, on the Web; a URI is an abstract superclass compared with a Uniform resource locator or a Uniform resource name, which are concrete entities. 
uniform resource locator (URL) 
A sequence of characters that represent information resources on a computer or in a network such as the Internet. This sequence of characters includes:
The abbreviated name of the protocol that is used to access the information resource 
The information that is used by the protocol to locate the information resource 
A Web server typically maps the request portion of the URL to a path and file name. Also known as universal resource locator. 
uniform resource name (URN) 
A name that uniquely identifies a Web service to a client. 
URI 
See uniform resource identifier. 
URL 
See uniform resource locator. 
URN 
See uniform resource name. 
user-defined node 
An extension to the broker that provides a new message flow node in addition to those that are supplied with the product. A user-defined node cannot be developed in WebSphere Event Broker, but can be imported and deployed. 
User Name Server 
A component that interfaces with operating system facilities to determine valid users and groups. 
UTF-8 
See Unicode Transformation Format. 
TOP

V
view 
In Eclipse-based user interfaces, a pane that is outside the editor area, which can be used to look at or work with the resources in the workbench. For example, you can view and edit your project files in the Broker Development view (previously called the Resource Navigator view). See also perspective. 
TOP

W
WebSphere MQ Enterprise Transport 
A transport protocol supported by WebSphere Event Broker that enables WebSphere MQ application clients to connect to brokers. 
WebSphere MQ Everyplace 
A generally available WebSphere MQ product that provides proven WebSphere MQ reliability and security for mobile and wireless devices. WebSphere MQ Everyplacer applications connect to the broker using WebSphere MQ Mobile Transport. 
WebSphere MQ Mobile Transport 
A transport protocol supported by WebSphere Event Broker that enables WebSphere MQ Everyplace application clients to connect to brokers. 
WebSphere MQ Multicast Transport 
A transport protocol supported by WebSphere Event Broker that enables dedicated JMS application clients to connect to brokers. This protocol is optimized for high volume, one-to-many publish/subscribe topologies. 
WebSphere MQ Real-time Transport 
A transport protocol supported by WebSphere Event Broker that enables dedicated JMS application clients to connect to brokers. 
WebSphere MQ Telemetry Transport 
A transport protocol supported by WebSphere Event Broker that enables SCADA devices to connect to brokers. This protocol is a lightweight publish/subscribe protocol that flows over TCP/IP that uses a subset of UTF-8. 
wild card 
A character that can be specified in subscriptions to match a range of topics. See also multilevel wild card and single-level wild card. 
workbench 
See Message Brokers Toolkit. 
work_path 
The location in the local file system in which the component stores internal and working data. For example, the default location on Windows systems is C:\Documents and Settings\All Users\Application Data\IBM\MQSI\. 
World Wide Web Consortium (W3C) 
An international industry consortium set up to develop common protocols to promote the evolution and interoperability of the World Wide Web. 
W3C 
See World Wide Web Consortium. 
TOP

X
XML 
See Extensible Markup Language. 


-------
Note:
-------

thread:

Q:

Hi all
 
I'm resending this as I haven't seen it arrive on the list so if you get it twice, please accept my apologies.
 
We're having a problem here connecting an app written for MQ V6.0 on Sun to a V5.3 QMGR running on Z/OS that hopefully someone can give us some advice on.  The developers have used "the MQ Java plug-in", by which I assume they mean JMS.  
 
The problem is that the command server has to be running as the plug-in apparently uses PCF commands.  We're considering implementing a policy in which no app will be allowed to issue PCF commands unless they can give us a very good idea of why.  
My question is what kind of PCF commands would the Java plug-in be issuing?  
As they are trying to connect to a remote queue pointing to a Z/OS queue, obviously it would be rejected by the V5.3 QMGR on that side but I can't figure out what PCF command they are even trying to issue or why.
When they connect just to a local queue on Sun, whether MQ is on V5.3 or V6.0, the PUT happens successfully.  However, when the attempt to connect to a remote queue, once again, on either V.3 or V6.0, they get the following error, assumedly because Z/OS doesn't support PCF commands yet.
 
[lcs-server.log] 2005-10-14 15:37:01,497 INFO [LCS] SocketHandler:No socket listener started.
[lcs-server.log] 15:37:03.0098 main.Server(Exception): Object CISACA.MVS.INPUT0 not found on queue manager ZANC000.
GODZILLA
    [trace]
BUILD FAILED
/main/software/Documentum/aca_interfaces/jlib/apps/lcs/build.xml:42: LCS Startup Failed: Exception occured during s
tartup

The MQ error we get is:
 
"10/14/05 03:37:03 PM - Process(10684.8) User(mqm) Program(amqrmppa)
AMQ9209: Connection to host 'godzilla (10.58.1.40)' closed.
 
EXPLANATION:
An error occurred receiving data from 'godzilla (10.58.1.40)' over TCP/IP.  The
connection to the remote host has unexpectedly terminated.
ACTION:
Tell the systems administrator."

Godzilla is the name of the server both the V6.0 QMGR and the app are running.
 
Also, we don't have the upgrade of Z/OS to V6.0 on the cards until 1Q next year, so it looks like we're going to have to find a workaround until then. 

A:


-------
Note:
-------


thread:

Q:

Hi! 

I have some interessting errormessage occuring in my application-eventlog. 
1.) AMQ9519 'The requested operation failed because the program could not find a definition of channel.' 
2.) AMQ9999 'Channel program ended abnormally.' 
Both entries refert to 'Sys1.Sys2' and occur pairwise every 1 to 5 minutes. 

The error itself in this case is absolutely correct as Sys1 has been shut down, Channel-definitions were deleted. 
New channel was created. - So current productive channel is 'Sys3.Sys2'. Everything worked fine without errors. 

After applying a windows-security update (some time ago, no-one really can remember) the errormessage about old channel 'sys1.sys2' started. 
System is still working fine - expect from the 'annoying entry' in applicationlog. 

Has anyone some idea how to get this error stopped? 


Regards 


A:

 What type of channel is Sys1.Sys2?

both systems are 'MQ servers' - so Sys3 is sending data to Sys2, Sys2 is processing the data and sends some result-data back to Sys3. 
If the terminology is correct, i would name it as 'bidirectional'. 

Sys1 is the predecessor of Sys3. 


If the terminology is correct, i would name it as 'bidirectional'. 


The terminology is not correct. On one machine, you are receiving this error. On that machine, there will be 
a uniquely named MQ object called 'Sys1.Sys2'. That object will be a channel object, and it will have a 
specific channel type - a sender, a receiver, etc. 

MQ channels are never bi-directional. They are always uni-directional. Bi-directional communication is accomplished 
with two sets of uni-directional channels (each of which has a channel object on each side - so *four* channel objects).

 OK. 

I'm sitting at 'QM.Sys2'. 
Sys3.Sys2 - Receiver 
Sys2.Sys3 - Server 

So if I understand correctly, somehow the 'old Sys1-MQserver' tries to send something to this 'Sys2-MQserver' 
old settings, and this machine fails, as the according settings are deleted? 
 
go into runmqsc on both QM.Sys2 and QM.Sys3 and enter: 
Code: 
display channel('Sys1.Sys2') 
and tell us what you see.... 
and you're sure 'QM.Sys1' qmgr has been deleted?

System2 is telling me: 

Code: 

display channel ('Sys1.Sys2') 
3: display channel ('Sys1.Sys2') 
AMQ8147: WebSphere MA object Sys1.Sys2 not found. 
 

I look for someone with access to System3... 
 
did you also delete the xmitq or qremote to sys2, it should'nt really matter but just in case. 
 
what i am guessing is probably chad is enabled for your queue manager, which i think auto defines channels,try disabling that 
and see if the error goes off. 
you can see that in the Q.M properties 
 
what i am guessing is probably chad is enabled for your queue manager, which i think auto defines channels,
try disabling that and see if the error goes off. 
you can see that in the Q.M properties 
except simply setting "chad(disabled)" doesn't make any autodefined channels go away.....

 OK, problem solved: 

The channel was still active on remote site (at least it tried to be). 

Thanks for the idea. :) 
 

-------
Note:
-------


IC31952: THE RECOREY OF THE CHANNEL FAILED DUE TO AMQ9999.
  

 A fix is available 
MQSeries for HP NonStop Kernel, V5.1 - Fix Pack 03 (CSD03)

 
APAR status
Closed as program error.

Error description 
When this customer performed takeover the queue server process
to backup process from primary process during sending
the message to remote queue, the recovery of the channel
failed due to AMQ9999 although takeover of the process
was completed.
So that, sending of the message was also terminated.
This problem occurred about 4 times per 5 operations.
Moreover,though trying to stop/start the channel after the above

problem, ever could not recover the channel due to AMQ9999 too.

Local fix 

Problem summary 
primary process failed (cpu failure or TACL stop) and the
backup process took over. The most serious error was that
messages could no longer be dequeued as noted with a FFST with
component qslReadQRec.

Problem conclusion 
of the Compaq NSK Open TMF features in the backup process. This
allows the reporting of TM/MP transaction completion status to
the backup process. Prior to this implementation the backup
process had to check the status of every TM/MP transaction that
was stored in its internal list. However there was still a
window of transactions that where started but the outcome was
unknown.
A routine was added to check for persistent messages there
physical presenseon disk. If found approriate action was taken
to adjust the Qserver. In the case of a non-persistent message
we don't know if it was commited or aborted and there is no
physical disk record to check, NPM are memory based within the
QServer, so we leave as is.
Other problems with checkpointing were addressed that caused
errors. The major one being the opener context was not using
the correct process. Others include the sync of the new backup
process was not storing reply data, get checkpointing had some
windows, improve reliability od SET_SIGNAL get, order of
checkpoint for no syncpoint operations
Many other changes to handle failed TM/MP transactions and MQI
operations for large messages using multiple IPCs where
implemented. Reason codes turned to applications were corrected.
No syncpoint failure due to local TM/MP transaction abort will
return UOW_CANCELED. Syncpoint failure due to global transaction
abort BACKED_OUT is returned.

Temporary fix 

Comments 
APAR information 
APAR number IC31952 
Reported component name MQSERIES COMPAQ 
Reported component ID 5724A3900 
Reported release 510 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2001-10-19 
Closed date 2002-06-21 
Last modified date 2002-06-21 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Modules/Macros
MQQSSVR MQSRLLIB         

Publications Referenced


Fix information 
Fixed component name MQSERIES COMPAQ 
Fixed component ID 5724A3900 

 
-------
Note:
-------

Fix list for WebSphere MQ Version 6.0
  
 Product readme 
  
Abstract 
WebSpherer MQ provides periodic fixes for release 6.0. The following is a complete listing of available and scheduled fixes 
for Version 6.0 with the most recent fix at the top, for WebSphere MQ 6.0 on iSeriesr, UNIXr and Windowsr.  
 
 
Content 
 Back to all versions 


--------------------------------------------------------------------------------

 Fix Pack 6.0.2.6 (V6.0.2.6)  
 Fix Pack 6.0.2.5 (V6.0.2.5)  
 Fix Pack 6.0.2.4 (V6.0.2.4)  
 Fix Pack 6.0.2.3 (V6.0.2.3)  
 Fix Pack 6.0.2.2 (V6.0.2.2)  
 Fix Pack 6.0.2.1 (V6.0.2.1)  
 Refresh Pack (V6.0.2.0)  
 Fix Pack 6.0.1.1 (V6.0.1.1)  
 Refresh Pack 6.0.1.0 (V6.0.1.0)  
 Recent and planned Fix Pack content summary  

Glossary of Terms 

--------------------------------------------------------------------------------

Note: To download WebSphere MQ Fix and Refresh Packs follow this link. http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006037


Fix Pack 6.0.2.6 (V6.0.2.6)  
Fix release date: 1Q 2009
Last modified: 01 October 2008
Status: Scheduled
    
Fix Pack (V6.0.2.5)  
iSeries Fix release date: 30 October 2008
Last modified: 30 October 2008
Status: Available
    
Windows and UNIX fix release date: 06 October 2008
Last modified: 01 October 2008
Status: Available

Download information     

APAR Description 
IC54772  Mqjms publish subscribe applications MQ channel and tcpip resources are not freed under some circumstances  
IC54941  Usage of ccdt to connect to queue managers from .NET interface  
IC55048  Application using SOAP transport for WMQ fail to connect to target services in client binding mode  
IC55141  Mqconnx does not check mqcno and mqcd versions and lengths under com+  
IC55154  Correction in WMQ system admin guide for availability of runmqtmc command on Windows and UNIX clients.  
IC55175  Domain nested groups documentation is ambiguous and misleading.  
IC55218  Customer unable to start a channel. error message AMQ9587 found in log. may also get FDC with probe AD004001 on QM termination.  
IC55259 MQJAVA: Java native memory leak when using MQINQ or MQSET in a bindings mode application with Java J9VM5.x 
IC55390  Sequence number on queue stats msg is always 1  
IC55482  WMQ explorer does not end the svrconn channel instance if browsing a remote queue fails.  
IC55548  Wmqjms: JMS applications unexpectedly disconnected from message broker  
IC55611  Hang in com+ client applications when connection to queue manager has been lost or the queue manager is restarted  
IC56006  Wmqjava: incorrect put time and date is displayed when looking at messages on a queue  
IC56009  Comphdr and compmsg channel attributes can't have spaces around their values when being accessed by runmqsc  
IC56068  WebSphere MQ queue statistics data field browsefailcount is incorrectly incremented  
IC56133  Mqrc_api_exit_not_found reason code not found when compiling WebSphere MQ V6 Java applications  
IC56151  Error code 2354 missing in return code translation table in ActiveX code.  
IC56170  Implement setclonesupport in resource adapter  
IC56264  A sta .NET application (eg. vb.net) suffers memory leaks due to a hung finalizer thread.  
IC56279  Explorer writes incorrect put/get authority to setmqaut dump  
IC56327  A WebSphere MQ .NET application fails to get or put a message to a z/OS queue manager  
IC56352  Mqrc_uow_enlistment_error (2354) when using extended transactional client  
IC56408  AMQ9245 message generated without authority event message being put onto the system.admin.qmgr.event queue  
IC56432  Misleading statements in WMQ V6 info-center regarding support formessage grouping & segmentation when using pub/sub  
IC56461  When using xms.net to connect to an LDAP server, connection factory lookups reports that 'tcm' has already been added  
IC56649  Installing the MQ client causes the javawebstart registration information to be overridden.  
IC56662  Hang in AMQZMUC0 process following the 'disk not ready' error  
IC56709  Maximum logfilepages displayed in WebSphere MQ explorer GUI is 16384 instead of the V6 maximum of 65535  
IC56754  WebSphere MQ API exerciser doubles the number of bytes when using MQPut in DBCS locales  
IC56858  Mqexception message method is missing from the .NET libraries INV6+  
IC57918  Delay in delivering large messages to message-driven beans when using WebSphere MQ with WebSphere application server  
IC57931  WebSphere MQ version 6 SOAP ivt fails when using Java 5  
IZ05857  WebSphere MQ responds with incorrect XA error code in a global XA transaction.  
IZ13144  Cluster workload is affected due to change in netprty behaviour  
IZ13921  GSK7CAPICMD crash when incorrect command syntax is used  
IZ14005  Probe XC130003 FDC (sigsegv) in runmqlsr process with function rppconnectpool on the mqm stack.  
IZ15456  Very slow initial access to deep queues holding large numbers of grouped messages.  
IZ15677  Jms: application thread hangs when exception thrown by mqthread.  
IZ16357  Conversion table 819 to 284 missing for WebSphere MQ installation on AIX  
IZ16620  Conversion table entries for ccsid 284 missing for WebSphere MQ installation on AIX  
IZ16645  Probe RM554010 rrce_file_corrupt. client program mqconn fails, reason code 2058. clntconn defns missing from runmqsc output.  
IZ17062  Jvm userid is not passed while creating an xaconnection if no userid is specified by the JMS application  
IZ17156  Strmqm incorrectly displays error code during failure  
IZ17158  Setting errorlogsize does not take effect for queue managers whose name contain a "." and/or "/"  
IZ17303  Cluster channel ping fails with segmentation violation and core dump  
IZ17313  System.cluster.repository.queue damaged, strmqm failed with an exception in AMQZXMA0 from rfxqueryclqmgr.  
IZ17341  WMQ may start a user-defined service with all signals blocked, so a stop service that sends e.g. sigterm would be ineffective  
IZ18103  Problem during the truncation of a "ghost" queue file which is marked for reuse.  
IZ18142  Endmqm can take many minutes to end the queue manager if there are active svrconn channels  
IZ18716  Errors may not be propagated from early stanzas during processing of qm.ini and mqs.ini files.  
IZ18954  Delay in sending messages when pipelining is used with SSL  
IZ19009  Arce_object_damaged and probe ID AQ143011 after migrating from MQ 5.3 to MQ 6  
IZ19168  Probe RM400001 FFST using pipelined channels at 6.0.2.3  
IZ19249  WMQ quick beginnings guides should be updated to clarify queue manager shutdown by root user when applying maintenance  
IZ19340  Mqjms: deadlock condition may occur when an application thread attempts to close session used by asynchronous messageconsumer.  
IZ19555  Probes XY179010 and XY180010 from AMQZMGR0.  
IZ20546  High cpu usage in the amqrrmfa process on an hourly basis for several minutes. applications unable to issue MQ API calls.  
IZ20672  WMQ trace generates incorrect thread ID when pipelining is used with SSL  
IZ20758  XC034002 AMQZLAA0 unexpected response to a pthread_cond_timedwait() request causes waiter chain corruption  
IZ20974  Broken symbolic link can cause MQ queue manager data loss  
IZ21318  WebSphere MQ failure to reserve log space in a queue manager restart scenario.  
IZ21552  SSL ping channel fails on hp Itanium  
IZ21977  MQRC_OBJECT_CHANGED(2041), AMQ9511 SYSTEM.CLUSTER.TRANSMIT.QUEUE,AMQ9448, repository manager ends  
IZ22019  Data conversion performance problems after multiple unsupported or invalid conversions have been requested  
IZ22198  AMQ4048 error when browsing messages using the MQ explorer. an FDC with an access violation is also produced.  
IZ22272  AMQ9526 and 'scratchpad in use' errors are seen when two identically named qmgrs are connecting to same receiving QM  
IZ22725  Incorrect statement in manual: queue manager configuration maxchannels attribute  
IZ22727  Correct reason code when cluster alias sub is not acknowledged  
IZ23058  Workload balancing does not round robin the messages as expected in a clustered environment.  
IZ23230  Mqjava: unable to use WebSphere MQ Java client to connect to a tpf queue manager with gmo matching options.  
IZ23438  Cluster cache look ups are taking too long causing clustered object resolution errors.  
IZ23756  Function abstime does not always send a value less than 1000000000 as an argument to cond_timedwait().  
IZ23780  Channel status stopped not migrated to V6  
IZ23789  Queue manager terminated abruptly and an FDC with probe ID AT004007 is generated.  
IZ23839  Address alignment exception with the log formatter when formatting an MQGet log record  
IZ23943  Hang in duow (pipelined) channel process following forced termination of a channel xppthreadmutex  
IZ24069  PCF mqcmd_inquire_auth_recs response messages all have msgseqnumber 1 in the mqcfh  
IZ24178  Fdc's produced by synchronous signal handler incomplete on hp/ipf (hp itanium)  
IZ24186  Cluster workload exit sample amqswlm does not load in MQ6.0.  
IZ24193  WebSphere MQ does not generate AIX trace on AIX V6.  
IZ24362  Core dump and truncated FDC in WebSphere MQ whilst writing an FDC during the very early stages of a process startup.  
IZ24944  Wmqjms: JMS pub/sub cleanup thread (pub/sub) fails to utilize the username/password specified by the application.  
IZ25171  During channel termination, an FDC with probe ID RM409000 followed by many with probe ID RM409001 is produced.  
IZ25614  WebSphere MQ channels terminate with AMQ9604 and AMQ9208 and get FDCs with XC130003 and RM031101  
IZ25622  Error 2185 putting group msgs to cluster queue with persistence as queue definition  
IZ28942  Incorrect warnings during install of 6.0.2.4 mqm-javasdk and mqm-keyman on HP-UX  
IZ31327  Defects fixed in WebSphere MQ fix pack 6.0.2.5 on AIX V6.0  
SE28827  MQM400 SDR(1256) to RCVR(420) fails to start with AMQ9520  
SE28896  MQM400 testfix for APAR SE28113 on refreshpack 6.0.2.1  
SE29574  MQM400 testfix for APAR SE28838 on WMQ6.0.2.1  
SE30937  MQM400 mdv V6.0.2.2 (mqrc) testfix for IZ08829  
SE31046  MQM400 mdv V6.0.2.2 (mqcs) testfix for IZ07206  
SE31087  MQM400 mdv V6.0.2.2 (mqqm) testfix for IC53974  
SE31088  MQM400 mdv V6.0.2.2 (mqrc) testfix for SE30754  
SE31252  MQM400 mdv V6.0.2.2 (mqjb) testfix for SE28838  
SE31779  MQM400 validateauth=no does not maintain error log permissions  
SE31994  MQM400 mqoa crtmqmprc/chgmqmprc fails to save values with *  
SE32112  Osp-perfm MQM400-DELAY when using SSL with MQ V6  
SE32282  MQM400 cannot successfully remove a queue manager from a clusterwith the spdmqmclqm followed by chgmqmchl .. chltype(*clusrcvr)  
SE32853  MQM400 it is not possible to create teraspace enabled c++ program so we see CPD5CCF similar to SA96347  
SE33397  MQM400 *public authority change for content in directory /qibm/proddata/mqm/inq in WMQV6 from WMQV5.3  
SE33722  MQM400-INCORROUT MQ 6.0.2.3 may fail to start AMQ7432  
SE34580  MQM400 56-BIT export cipherspecs do not work in WMQ 6.0 and I5/OS  
SE34587  MQM400 strmqm fails with probe AL008000 on OS400 with OS version V6R1M0  
SE34588 MQM400 dspmqmsts and AMQ7460, AMQ7462 in the queue manager error log shows as ******** on V6R1M0 OS 

Fix Pack (V6.0.2.4)  
iSeries fix release date: 26 June 2008
Last modified: 26 June 2008
Status: Available

iSeries download information 

Windows and UNIX fix release date: 30 May 2008
Last modified: 30 May 2008
Status: Available

UNIX and Windows download information     

APAR Description 
IC53540 Queue manager fails to start up and FDC with probe ID XY338011 is generated after amqzmgr.exe is terminated.  
IC53578  WMQ explorer displays invalid date/time for a message having blank putdate or puttime  
IC53819  "mq java/jms application gives a peer name mismatch error when the exit-list length is greater than 8"  
IC53960  Channel shortretry count will not fall after moving into retry state even though many attempts were made to restart  
IC54088 Automatic startup settings for command server are lost when migrating from MQ V5.3 to MQ V6 in an MSCS configuration 
IC54095  FDC with probe ID XC368002 and error 'winnt error 5 from duplicatehandle' is generated when a queue manager is ended.  
IC54182  Access denied error on an openprocess call during an mqconn by a customer application.  
IC54292  WebSphere MQ queue manager in MSCS cluster ends unexpectedly  
IC54346  Closing a jca session does not close its producers and consumers  
IC54459  Channel stays in binding state for a long time when it contains an invalid conname value  
IC54584  PCF message write method does not set MQMD fields to default values for PCF messages  
IC54585  Jca: wmq.jmsra.ivt.ear contains an invalid <method-name> value in wmq.jmsra.ivt.ear file ejb-jar.xml  
IC54608  Channel statistics messages have queue manager name padded with zeros instead of blanks  
IC54678  Display conn command in runmqsc failing to find any matches whenfiltering by conname  
IC54711  PCF escape command with dis lsstatus for a listener of 48 chars returns additional chars in listener name error message  
IC54888  Queue manager restart fails with HL080077 following outage when system has been under severe resource constraints  
IC56162  "dspmqras.exe folder" file is incorrect. should be dspmqras.exe  
IZ04683  JMS client connections not closed in asynchronous mode when queue is manager stopped.  
IZ05023  FDC with probe ID XC348015 from xlsrecoverthread on hpux  
IZ05045  FDC generated with probeid ZC004063 when using single threaded agent  
IZ06097  Mqconnx fails with reason code 2409 when version 1 mqsco is usedwith WMQ 6  
IZ06131  Setmqaut fails when authority records are not yet created or are missing  
IZ06425  The WebSphere MQ signal handler is exiting when a sigsegv occurs, preventing a WMQ application from producing a core file.  
IZ07198  Queue manager unresponsive, with a probe XY441020 FDC from function xstaddconnectedthreads.  
IZ07206  An application connect during queue manager startup may cause startup failure with likely FDCs ZF095010 and RM185002  
IZ07297  Improved error handling during refresh cluster command when clusnl() attribute points to a namelist that doesn't exist.  
IZ07778  The runmqsc changes to amqclchl.tab may not be inserted in the correct ascii-order. also, amqclchl.tab will never shrink.  
IZ07803  Following the installation of 6.0.2.2 the customer may experience java.lang.classnotfoundexception problems  
IZ07905  Delays in getting messages when using the WebSphere MQ JMS client in bindings mode.  
IZ08014  Illegalstateexception calling createconsumer  
IZ08018  Strmqm fails for single threaded MQ queue manager on Linux  
IZ08180  Qmgr not reporting the appropriate warning if amqrsyna.dat file is corrupted.  
IZ08596  XC034002 in xcswaiteventsem shortly after zlaperformhealthcheck returns xecp_e_invalid_pid.  
IZ08748  Deleting queues while PCF inquirequeuenames command is executing, causes an FDC with probe PC024010  
IZ08754  Failing publish not retried correctly, stream repeatedly restarts without republishing.  
IZ08783  Probe AD028004 FDC from adiopenfile (RC=24 from open) in AMQZXMA0, when migrating to V6. this is a file descriptor leak.  
IZ08829  Pcf: mqcmd_inquire_channel_names fails if more than 628 channels. can cause probe XC006001 FDC or hang command server.  
IZ09144  WMQ 6.0.2.2 FFST RM056000 from riitriggermessage with error rrcw_already_started  
IZ09338  In the case of queue manager recycled infrequently, the number of accumulated channel stats records becomes very large.  
IZ09419  Improve AMQ9565 error message  
IZ09519  Improving queue manager's robustness when application is using non-unique groupid/msgseqnumber/offset/report-type combinations  
IZ09591  Endmqlsr fails with 'no WebSphere MQ listeners for queue manager' if listener launched using runmqlsr  
IZ09658  AMQZMGR0 does not clean up shared memory in a timely fashion when it is the last process to reference that memory  
IZ10060  Inconsistent treatment of cluster qmgr alias 2087 mqrc_unknown_remote_q_mgr  
IZ10757  Amqrrmfa terminates with error rrci_clus_no_clusrcvr_defined  
IZ10800  Rare command server sigsegv in function pcmbuildmsgparms when executing PCF command mqcmd_inquire_q_status.  
IZ10832  Jca: a nosuchmethod exception is thrown when using WebSphere jca with sun application ee  
IZ10869  Occasional deadlock when issuing stop conn runmqsc command (or PCF equivalent).  
IZ11153  WebSphere V6 may not always consider the mqmaxerrorlogsize environment variable when writing error log messages  
IZ11458  Dis qs uncom is not a boolean value  
IZ11718  Sigsegv on Linux inquire listener status  
IZ12149  Cleanup code mistakes socket files as directories and does not delete them appropriately.  
IZ12159  Receiver channel might not decompress message when message is sent in multiple segments  
IZ12208  MQOpen fails with 2042, if the MQ application had previously failed with unexpected error, holding the object handle open.  
IZ12274  Unable to determine the exit status of internal processes started as queue manager services.  
IZ12283  Command server returning 2017 ( mqrc_handle_not_available )  
IZ12497  Probe KN272002 FDC reported from function kqiputaccountingqueue when a connection has a large number of open objects  
IZ12536  Incorrect output can be given by PCF mqcmd_inquire_auth_recs command. bad output can also be given by dmpmqaut and amqoamd.  
IZ12700  Very large FDC produced including an extremely large dump area entitled "active shared memory heap".  
IZ12739  Readme for WebSphere MQ maintenance did not clearly state product version and migration information  
IZ12795  Inquirechannel PCF command with channeltype parameter does not return complete channel details, only the channel description.  
IZ12796  Jms: a non thread safe JDK class is declared as a static  
IZ12827  FDC written by WebSphere MQ explorer if mqs.ini file has stanzasthat refer to unavailable locations in filesystem.  
IZ13135  The JMS message.setjmsreplyto() method does not work correctly when using WebSphere MQ resource adapter (ra).  
IZ13247  Imqqueuemanager::connect() fails when mqcsp authentication is used even when IY92447 is applied.  
IZ13512  WebSphere MQ loops when queue manager ini file is corrupted  
IZ14279  Queue manager time calculations must use thread-safe system calls  
IZ14399  Queue managers rejoining clusters with IY99051 have incorrect sequence numbers, changes may not be published in the clusters  
IZ14732  WebSphere MQ slow performance on HP-UX 11.31  
IZ14812  Sigsegv in xcsreleasemutexsem called from rrxreporterror when using mq_channel_suppress_msgs.  
IZ14977  Missing cluster information when namelists are used to add and remove queue managers from multiple clusters at once.  
IZ15279  Jmsreplyto field in the MQRFH2 header does not specify replytoqmgr name unless it has been manually set  
IZ18598  Client mqconn can cause program to end if already connected or in a variety of error conditions  
IZ23320  Defects fixed in WebSphere MQ Fix Pack 6.0.2.4 
SE29894  MQM400-RC545284153-MSGAMQ5011 WMQ V6 strmqm fails to start  
SE30113  MQM400 - WMQ listener jobs started before queue manager start are not terminated when queue manager is quiesced  
SE30379  MQM400 inconsistencies noticed for the sslpeer attribute of chgmqmchl  
SE30398  MQM400 unable to use clrmqmq command without *altusr authority  
SE30649  OSP-MSGAMQ9507 MQM400-AMQ9507 for *sdr channel after V6  
SE30754  MQM400- MSGMCH3402 amqrrmfa abend during queue manager startup  
SE30965  MQM400 WMQ channel problems with hrtbtint > 999999  
SE31023  MQM400 - V6 installation grants ownership of objects to  
SE31209 MQM400-INCORROUT wrkmqmq - filtering the type of queue - F11  
SE31214  MQM400 ile RPG copy book contains duplicate entries  
SE31933  MQM400 runmqchl joblog do not contain any useful error message.  
SE31958 MQM400 dspmqmlsr output(*print) does not output spoolfile  


Fix Pack6.0.2.3 (V6.0.2.3)  
iSeries fix release date: 11 March 2008
Last modified: 11 March 2008
Status: Available

iSeries download information     

Windows and UNIX fix release date: 1 February 2008
Last modified: 31 January 2008
Status: Available


UNIX and Windows download information     


APAR Description 
IC52223  Access violation FDC XC130031 in the amqfcxba process.  
IC52320  FDC with probe XY470022 is incorrectly cut if connectnamedpipe returns zero and the getlasterror returns error_pipe_connected.  
IC52378  MSCS times out in lookalive checks leading to failover of cluster resources through mqmterminate.  
IC52523  Resolve channel action(commit) fails to resolve a manually defined clussdr channel.  
IC52603  Amqrmppa thread calls mqback instead of xa_rollback in an XA environment  
IC52619  Executing stop connection leads to an access violation in the execution controller and the queue manager crashes.  
IC52632  Inquire queue status PCF command incomplete response  
IC52674  Error AMQ7017 occurs when trying to start a queue manager and anfdc is generated with probe ID ZX000001  
IC52684  Channel compression from z/OS to Windows results in message data with 4 extra bytes  
IC52709  Qmgr is reporting the following probes: HL008001 hlgsetlogrestartlsn and AL020000 almsetoldesttranlsn  
IC52770  New schemes defined in WebSphere MQ explorer are lost when the WebSphere MQ explorer restarts.  
IC52821  Client side exits cannot be loaded through MQExplorer using a CCDT 
IC52823  "the handle is invalid" error appears in the file specified as stderr in WebSphere MQ custom service definitions.  
IC52862  WebSphere MQ explorer fails to import scheme settings that have been previously exported.  
IC52873  Automatic refresh of the queue manager cannot be disabled in explorer.  
IC52954  SSL channels defined with sslpeername do not start after migration from MQV5.3  
IC52989  Clussdr channel can be deleted even though channel is in-doubt  
IC53034  Japanese characters are corrupted in fix pack 6.0.2.1 installation dialogs  
IC53065  WebSphere MQ queue manager fails with probe XC037003 when started  
IC53107  Migration failure if incorrect backup folder is selected during fix pack installation.  
IC53158  Gskit fails to open key database correctly when invoked from explorer using a non administrative user ID.  
IC53189  The accumulated authority records displayed in WebSphere MQ explorer through find authority sub-menu are incorrect  
IC53192  Mqwin-wmq services performance counter fails to restore with AMQH0001.H not found  
IC53204  WebSphere MQ V6: repeated queue damaged instances  
IC53255  MQ SSL migration from V5.3 to V6 results in 'chain incomplete' during 'check WMQ store cert wizard'  
IC53266  Listener process does not end on queue manager termination despite using a listener object configured with control(qmgr)  
IC53286 Unable to compute putdate and puttime when any one of the fields is set to blank 
IC53291  Runmqdnm process fails with mqrc_options_error when monitoring a queue on tpf  
IC53318  Queue manager fails to start on MSCS with FDC MC033007  
IC53368  WMQ explorer fails to display queue manager objects during it's startup due to the error in creating the navigator view  
IC53379  Memory access violation errors seen in mqconn  
IC53384  Channel status data displayed incorrectly in WMQ explorer  
IC53403  Performance slow on mqput/mqgets and AMQZLAA0 cpu usage is high when running 100'S of application threads/processes.  
IC53429 Amqsmon shows 0 in putbytes field despite having put messages with MQPUT1 call  
IC53457  Changes to preofflinecommand and postonlinecommand parameters for an MSCS resource result in 'the data is invalid'  
IC53508  Environment variable name containing special symbol is not added through service.env file  
IC53514  XA client hangs while connecting to the queue manager  
IC53533  MQ explorer fails to browse the queues of a queue manager which is connected through SSL client channel definition table  
IC53545  The recommended method for removing oam entries  
IC53651 Handle leak when exclude or suppress messages is set  
IC53676  Error messages AMQ7315 or AMQ7316 are logged when any accountingor statistics message is put to the accounting/statistics queue.  
IC53716  Amqoamd command displays inconsistent results for '+none' authority.  
IC53764  Nullpointerexception error received when tracing level set to 5 and messageid is null 
IC53782  Amqrrmfa terminates when the queue manager object configuration contains an invalid value  
IC53817  Setmqprd silently fails to convert a trial version of MQ into a production version.  
IC53842  Queue manager terminates unexpectedly with FDC AD031001 and error_invalid_user_buffer from adiwritefile.  
IC53936  FDC with probe ID RM400001 and error code rrce_bad_parameter generated upon termination of a client channel.  
IC53962  Application successfully opens cluster queue object on remote queue manager despite specifying an invalid user ID as altusr  
IC53967  Poor performance with large messages on Windows clients using non-blocking tcp model  
IC53974  MQSeries service fails to start with error 1053. strmqm hangs. FDC with probe ID ZX005025 generated.  
IC54121  Cluster channel in retrying state would not start after command: stop channel(<name>) mode(quiesce) status(inactive)  
IC54142  Thread handle leak from MQ com+ (mts) layer  
IY92377  Destseqfactor incremented for clusrcvr on remote queue manager causes unexpected results in workload balancing  
IY93129  MQ is not honoring the value specified for errorlogsize in the qm.ini file.  
IY95538  2080 is issued when MCA agent is trying to get messages from system.cluster.transmit.queue. no messages are written to the log.  
IY96630  A deleted damaged object prevents advancement of WMQ logs. it further causes dltmqm to loop consuming log space.  
IY96836  Environment variable for changing the order during a 2 phase commit  
IY96879  Messages can arrive in incorrect order via cluster channel  
IY97159  Repository manager process tries to access the cache while restoring the cache, resulting in a hang.  
IY97173  Amqiclen failure when there is a non-mq file in qmgrs directory  
IY97428  SSL peer name in client channel table can fail to be matched correctly against SSL peer name in server-side certificate.  
IY97558  Agent process does not return from a MQGet call for a period of time significantly larger than the time-out specified  
IY97580  FFST probe XC130003 in ccxsend on server side of SSL channel  
IY97736  Channel fails to start with error AMQ9519 (rrce_channel_not_found)  
IY98258  Not setting user name in MQEnvironment gives 2035 error for WMQ v6 
IY98550 Customer user exit gives security exception when Java2 security is enabled 
IY98585 Lower case userid being passed to z/OS v1.6 with Websphere MQ 6 
IY98620  Queue manager hang during define object when an API exit is installed.  
IY98777  WebSphere MQ hangs after rc=stop from a call to xlllistenselectacceptandclose  
IY98973  Memory deallocation failure due to thread mismatch when pipelining is used  
IY99050  Rcdmqimg fails with AMQ7084 for a queue containing segmented messages  
IY99051  Queue manager unable to re-establish cluster membership  
IY99057  MQ error handling in XA protocol violations where xa_end() called for transaction not associated with current thread  
IY99181  Amqrfdm at WMQ 6.0 does not work when using isolated bindings  
IY99200  If the execution controller fails to create a new agent process, it will sigsegv (from WMQ 5.3 CSD13 or 6.0.1.1).  
IY99415  Extra validation for data received from tcp/ip to handle case where tsh header overwritten during transit  
IY99425  32-BIT applications reserving large amount of heap memory and/or shared memory segments fail to connect to WMQ V6.X qmgr  
IY99591  Highly intermittent probe XC271004 FDC, reported from very short-running processes such as dspmq.  
IY99598 An extremely intermittent probe ZI032002 FDC from function ziistophealththread is reported.  
IY99683 Corrupted message header in dead letter messages produced by JMS Client 
IY99847  WMQ error log file rollover is incorrect for some non-mqm applications on UNIX  
IY99912  MQ adds too many log extents to the log file header at MQ shutdown resulting in subsequent qmgr restart failure.  
IZ00315  Mqmessage class in .NET API does not allow the offset and originallength fields to be set.  
IZ00349  Locking problem where we try to write an accounting record whilst already holding a lock on a non-queue object.  
IZ00380  Correct parsing of MQ exception handler override commands.  
IZ00609  XC034071 in xcswaiteventsem with error code einval (22)  
IZ00896  WebSphere MQ messages occasionally delivered out of sequence under stress on MQ V6  
IZ00993  SYSTEM.BROKER.IQ.1.4 been referred to as a stream queue by a user application.  
IZ01058  XC130004 sigsegv when restoring cluster objects during queue manager start up  
IZ01151  Probe XC015010 raised by checkpoint task during endmqm processing  
IZ01402  Broker fails to start or publish or subscribe.  
IZ01512  A failure to start a timer thread when waiting for a critical mutex causes queue manager failure. 
IZ01580  A timing window results in two threads using the same semaphore to suspend/resume leading to hangs and unpredictable results.  
IZ01599  Sigsegv in kqiwakeupwaiter when waiter state reset during stop channel command  
IZ01794  WebSphere MQ queue manager fails to start when transaction's first lsn is not in active log after disk space issue.  
IZ01835  WebSphere MQ performance is poor when querying xqmsgsa at a time when system.cluster.transmit.queue is deep.  
IZ02442  Child broker fails to restart after breaking parent relationship  
IZ02497 Receive() with unified factories fails to find messages 
IZ02512  Migration to WMQ 6.0 may corrupt clustering information  
IZ02573  WMQ 6.0.2.1 and 6.0.2.2 Solaris install sets erroneous 777 file permissions in /var/sadm/pkg  
IZ02777  Queue manager fails to start. FDC with probe AT013011  
IZ02915  XC130004 FFST in apiunlockexclusive, when attempting to record a media image of a damaged queue.  
IZ03090  Dis qs does not show userid after appltype(system)  
IZ03209  When message is gotten by application with RC=2079, lgettime/lgetdate fields do not get updated.  
IZ03678  Mqcmit for a long running transaction generating an mqrc_unexpected_error return code.  
IZ04163  WMQ channel connection fails with AMQ9213 error 22 over IPV6  
IZ04394  Queue file corrupted crossing 4MB boundary and no space avail.  
IZ04523  Amqiclen does not clear trace control shared memory  
IZ04767  Tcf_last with no tcf_first should be detected by WebSphere MQ  
IZ04821  Crtmqm fails on WMQ 6.0.2.2 when mqsprefix is set or if mqs.ini defaultprefix is set to a location other than /var/mqm  
IZ04971  Missing attributes in the queue manager log stanza causes queue manager restart to fail.  
IZ05005  AT004018 possible when simultaneous activity from separate threads attempt to complete an XA transaction concurrently  
IZ05013  Using data compression on a channel, channel ends with FDC with probe ID CO052100 generated on receiving side qmgr  
IZ05057  Qdphiev event messages sent to system.admin.perfm.event queue incorrectly  
IZ05176 Websphere MQ resource adapter IVT ear file not shipped on UNIX 
IZ05307  Dspmqtrn formats XA xids incorrectly  
IZ05527  When browsing a queue with mqgmo_browse_next the first message ever put to the queue can be skipped.  
IZ05653  AMQZMUR0 fails with XC006001 and xecs_i_private_memory_error  
IZ05792 MQRC_NO_CONTEXT_AVAILABLE returned from attempted MQPUT to dead letter queue after queue backout threshold reached 
IZ05950  Generating non-unique groupid's when using message segmentation,can cause a queue manager to crash.  
IZ06614  No authority event generated after mqrc_not_authorized returned to mqconn, despite enabling authority events.  
IZ06672  WMQ queue manager error log locking can be briefly compromised, but with typically no impact.  
IZ07210  Abstract 2035 if client user name > 12 chars and valid mcauser  
IZ07794  Defbind not honoured when alias queue resolves to a cluster queue  
IZ09339  Averagequeuetime (mqiamo_avg_q_time) displays negative values  
IZ09383  Occasional fdc's with probe KN072085 when applications specify mqgmo_msg_under_cursor with mqgmo_wait.  
IZ10294  Incorrect authorization check when putting directly to an MQ transmit queue.  
IZ13990  Defects fixed in WebSphere MQ fix pack 6.0.2.3  
IZ14160  Defects fixed in WebSphere MQ fix pack 6.0.2.3  
SE28711  MQM400 - MSGMCH3601 when attempting to display message AMQ6106 for xcsdisplaymessage  
SE28838 MQM400-THREADS-UNPRED MULTITHREADED JAVA APPLICATIONINTERMITTENT FAILURE 
SE28867  MQM400 wrkmqm option 22 receives msg CPF3C53.  
SE28943  MQM400: improve diagnostics when strmqm hangs with X'00000D49'  
SE28955  MQM400 queue becomes damaged at least 2 to 3 times a week.  
SE29272  MQM400 AMQ6903 received on install of fp 6.0.2.1  
SE29323  MQM400 chgmqmprc or crtmqmprc fails to record values of usrdata starting with asterisk ( * )  
SE29410  MQM400 strmqm fails with AMQ7432 arce_log_recd_not_found  
SE29414  MQM400 - P0000* and/or S0000* files in qmqm library not removed when queue manager quiesced  
SE29524  MQM400 amqcrsta job does not run under user-defined subsystem  
SE29638  MQM400 - command server for clustered queue manager failing with MCH0601 on starting up MQ explorer  
SE29710  MQM400: rcdmqmimg does not log messages AMQ7460 and AMQ7462 in the queue manager message queue (qmqmmsg)  
SE29770  MQM400 : queue manager migration failed with MCH0601 and channels are missing.  
SE29795  MQM400 mdv V6.0.2.2 testfix for IY99050 and SE28955  
SE29796  MQM400 mdv V6.0.2.2 testfix for IY99050 and SE28955  
SE29844  MQM400 lodptf of fixpack 6.0.2.2 fails with AMQ6903  
SE30262  MQM400-QUEUE manager with pending transaction fails to start  
SE32195 MQM400 MCAUSER *PUBLIC NOT RECOGNIZED IN CRTMQMCHL or CHGMQMCHL - FOLLOW ON FROM THE APAR SE27576 

Fix Pack 6.0.2.2 (V6.0.2.2)  
iSeries fix release date: 14 September 2007
Last modified: 14 September 2007
Status: Available    

iSeries download information 


Windows and UNIX fix release date: 24 August 2007
Last modified: 22 August 2007
Status: Available

Windows and UNIX download information 


APAR Description 
IC50588  FDC AQ109001 from aqhlogicalmsglock during browse,lock of a segmented message. 
IC51002  WebSphere MQ explorer tests plug-in displays incorrect attribute and refers to a non-existent channel object.  
IC51005  Mqrc_security_error returned on a PCF inquire authority records command submitted with invalid parameters.  
IC51054  "mqrc_storage_not_available error due to the invalid data length returned from the MQGet of the message.  
IC51126  MQ explorer does not accept symbols @,#,$ when trying to connect queue-sharing group. 
IC51315  WebSphere MQ rolls back prepared transactions if msdtc goes down. 
IC51322  Code changes for optimization of default conversion. 
IC51324  WMQ invokes enlistwithdtc to enlist with the transaction leading to a number of calls in msdtcprx.dll causing a deadlock.  
IC51350  Warning AMQ8075 logged when a user with an ID of more than 12 characters issues the change channel command. 
IC51397  Server file transfer application does not display remote queues of the queue manager.  
IC51404  Windows MQ fails to shutdown if shared memory server dies. 
IC51408  FDC with probe ID JP707000 generated when running test plug-in of V6 MQ explorer.  
IC51434  WMQ V6 explorer fails to refresh after queue manager is stopped and restarted from command line. 
IC51439  Log write operations do not retry when an error_lock_violation return code is returned on Windows.  
IC51451  Access violation occurs when printing out the object descriptor and associated object records in trace of MQOpen  
IC51472  XC090001 and xecf_e_invalid_parameter  
IC51497  Get complete msg fails with mqrc_match_options_error. 
IC51589  Runmqdlq suffers XC130031 access violation when it encounters a null message on the dead letter queue.  
IC51598  Damaged object following log errors caused by a sharing violation on the log itself. 
IC51667  XA_PROTO errors occur when WebSphere MQ is used as a JMS provider with WebSphere application server.  
IC51721  Unable to rebuild a syncfile using rcrmqobj if using circular logging  
IC51889  Soap/WMQ client C0000005 access violation. 
IC51904  An FDC with probeid XC130031 is generated in amqrmppa process. 
IC51952  Mqrc_truncated_msg_failed error from MQGET call even though enough buffer is supplied.  
IC52019  Optimize code where MQ reads the sync file to get the saved channel status. 
IC52177  Channel process reports incomplete error messages if it encounters a TCP error during connect, send and receive. 
IC52193  AMQ9519 when starting auto-defined clussdr channel. 
IC52257  Access violation FDC XC130031 in the command server (amqpcsea.exe)  
IC52322  Queuemanager service does not start a program with extension .cmd or .bat if the stdout parameter is not set. 
IC52411  6.0.2.1 install fails on 64 bit Windows under terminal services. 
IC52445  MQ .NET client applications cannot put or get messages greater than 4MB.  
IC52687  SSL certificate import or migration fails on 6.0.2.1 Windows. 
IC52757  WMQ V6.0.2.1 client installation fails with error code AMQ4739. 
IY87626  Resolve channel on an auto cluster sender channel ( in indoubt state ) fails with xecl_e_invalid_param.  
IY88024  The API exit structure ID is blank and version is 0.  
IY89755  Very small timing window whereby pipelined channels ("dual unit of work", duow) can fail with probe CO000002 FDCs  
IY90059  Clwlprty value changes if namelist is changed.  
IY90521  WMQ listener on hpux stops when an accept call fails with return code 233 enobufs.  
IY90873  FFST with probeid KN111000 continually generated reports mqrc_stopped_by_cluster_exit  
IY91269  FFST with probe XC006001 when using channel exits  
IY91348  Reset non-existent channel in runmqsc results in memory fault and SIGSEGV.  
IY91385  Man pages for crtmqm don't reflect new V6 values for logfilesizeand number of logfiles. 
IY91510  MQJMS2002 jmsexception thrown when using multiple consumers with selectors on a transacted session.  
IY92011  Runmqsc "dis q(*)" doesn't display all queues if one or more of the queues are damaged.  
IY92016  PCF inquirequeuenames command does not handle damaged objects gracefully.  
IY92051  Amqoamd command generates an erroneous attribute for setmqaut. 
IY92141  Load exit fails with error 'cannot open shared object file: no such file or directory'. 
IY92192  .NET client argumentoutofrangeexception when compression is used. 
IY92194  Dis chs fails on clusrcvr channels when using conname limiter. 
IY92196  Xstdisconnectextent causes FFST for applications using fork without exec. 
IY92390  The malformed or unexpected message handling fails with 2098 error in asynchronous message delivery scenario.  
IY92441  Bothresh ignored in JMS applications when using alias queues.  
IY92447  Userid and password not passed across channel.  
IY92471  Nullpointerexception when using MQ JMS client in mqjms_tp_direct_tcpip mode with SSL  
IY92929  Logger process amqhasmx (AMQZMUC0 in V6) could eventually run out of file descriptors some time after a disk full condition. 
IY92963  SIGSEGV in preparedumpareas during crtmqm and resulting core file. 
IY93155  Channel triggering fails due to object_already_exists error. 
IY93324  MQJMS2003 gets generated with a nullpointerexception.  
IY93408  Mqbegin fails with SIGSEGV or SIGBUS when an Oracle session is killed. FDC file contains probe ZM008001 and XC130003  
IY93415  MQ explorer could hang when selecting services and the reply queue fill up if multiple messages are returned from AMQZMGR0  
IY93506  SIGBUS from dmpmqlog commands. 
IY93655  Performance problems related to resolving high lock contention. 
IY93707  Wmq/cics application returns 2035 (mqrc_not_authorized) error from mqconn call in isolated bindings mode.  
IY93752  Excessive DNS lookups. 
IY93881  Potential SIGSEGV if AIX is very slow at scheduling.  
IY94013  Poor performance loading deep queues containing both persistent and non-persistent messages.  
IY94250  Setting a zero interval via the mq_channel_suppress_interval environment variable causes SIGFPE FDCs in channel processes.  
IY94267  JVM crash when using MQ JMS XA in bindings mode in was 6.1  
IY94451  Queue manager alias definitions with alteration date more than 1 month old not successfully deleted from a cluster. 
IY94625  Amqfcxba suffering a SIGSEGV in function fkirestoresubscription or in function fkxderegistersubscriber. 
IY94674  Client applications continuously cycling thread connections may re-read mqs.ini many times.  
IY94700  Hang in channel processes such as runmqlsr, amqrmppa, runmqchl.  
IY94811  Nullpointer exception occurs if WebSphere application server trace is disabled.  
IY94832  Duplicate subscriptions with different subname's cause SIGSEGV in amqfcxba process.  
IY94876  UNIX file descriptors being left open while generating FDC for asynchronous signals.  
IY94977  Queue manager fails to start throwing log not available error following an abrupt failure.  
IY95005  Windows client channel compression settings ignored. 
IY95078  Long delays in multi-threaded puts in WMQ c++ client. 
IY95181  JMS null valued causes exception in MQ JMS application. 
IY95255  Probe: XC130004, SIGSEGV: address not mapped, function: aqhaddmsg. problem writing temporary dynamic queue object.  
IY95370  Synchronization issue with the jmsconnection hangs shutdown of application server. 
IY95485  Queue manager is ignoring requests for more than 253 secondary log files on UNIX.  
IY95508  Channel statistics getting collected at the end of the configured statint after having turned off the statchl attribute. 
IY95513  Probe XC130004 FDC (SIGBUS or SIGSEGV) in xcsendgrent function.  
IY95544  SSL enabled channel will not run because MQ code is not able to access the channel status table.  
IY95555  WebSphere MQ authority event message incorrectly shows mqm as the user ID. 
IY95566  Runmqsc ctrl/ctrlx auth not honoured for non-mqm users. 
IY95706  Sigsegv immediately following return from zfudoesobjectexist XC130004 from amqzlaa0, amqrmppa, or amqfcxba.  
IY96055  Messages due to expire do not expire after a queue manager crash/restart or an HA failover.  
IY96066  MQ JMS publish/subscribe clean up does not process all applicable messages.  
IY96150  AMQ9509 and RC=2009 errors during endmqm -i, causing channels to fail abnormally.  
IY96282  Two threads have been concurrently allocated the same semaphore. 
IY96442  Message expiry report not resolved to cluster queue.  
IY96689  FDC's produced by asynchronous signal handler incomplete on HP/IPF (HP Itanium). 
IY96924  JMS applications hang when multiple threads try to connect to a queue manager which is not running. 
IY96959  Exitbuffer parameters not passed to mqxr_sec_parms. 
IY97755 SIGSEGV in function rfxaddclqmgr during queue manager startup 
IY98002  MDB listener port gives error with custom property CCDT URL set. 
IZ01272  Potential security exposure in MQ client channels. 
IZ03429 Defects fixed in Websphere MQ Fix Pack 6.0.2.2 
IZ13291 WMQ SSL Channel hangs in binding state. Stackit shows a hang in META_GENERATERANDOMSEED. 
SE26233  MQM400 PNGMQMCHL fails with MSGMCH3601, if user profile locale attribute is /qsys.lib/de_de.locale. (germany)  
SE26739  MQM400:PROBE CO052000 gets reported on server side for SSL enabled channels. 
SE27177  MQM400 dead letter queue handler fails with CPF0001 AMQ8750. 
SE27576 MQM400 MCAUSER *PUBLIC not recognised in CHGMQMCHL or CRTMQMCHL 
SE27700  MQM400 STRMQM fails to start queue manager with probe ZF089070. 
SE27911  MQM400 - MSGAMQ5522 and MSGCPF1151 generated on STRMQM for queuemanager when library QMQM in system portion of library list. 
SE28113  MQM400 jobs not ended after shutdown of MQ V6 queue manager. 
SE28167  MQM400 ENDMQM *ALL generates FDC with probe ID XY043007. 
SE28535  MQM400 - runmqbrk may suffer MCH3601 and reports probe XY353001. 
SE29192  MQM400 RSTLICPGM halts with AMQ6233. 


Fix Pack (V6.0.2.1)  
iSeries fix release date: 20 April 2007
Last modified: 20 April 2007
Status: Available

iSeries download information     


Windows and UNIX fix release date: 30 March 2007
Last modified: 28 March 2007
Status: Available Windows and UNIX download information   
APAR Description 
IC49453  Alert monitor task bar icon not getting hidden when "alert monitor icon added to task bar" is set to "no"  
IC49520  When running with WebSphere application server, a complete list of the indoubts are not passed on a resynchronization  
IC49616  FDCs with AT040010 and AT003001 while reusing the agent connection  
IC49717  MQ external (native) exits called by the Java client incorrectly  
IC49767  Sigsegv in amqrrmfa when processing rrmreallocmsgs, causing the amqrrmfa (repository manager) to terminate.  
IC49826  Compile error "byref argument type mismatch" in WMQV6 VB sample program amqscnxb  
IC49857  MQ rc = 2354 mqrc_uow_enlistment_error when using msdtc in MSCS environment  
IC49914  IP address not shown properly in the error logs when 'localhost' is used in conname of the channel definition.  
IC49977  WebSphere MQ V6 explorer unable to show queues for remote z/OS queue managers connected via an intermediate queue manager  
IC50201  MQExplorer security failure connecting to z/OS  
IC50215 Access violation in dllhost with VB or STA application component using WMQ  
IC50309  MSCS hang when restart threshold has been set.  
IC50327  FDC with probe KN101001 from kqicloseit during mqdisc  
IC50415  WebSphere MQV6 explorer crashes while updating repository information under queue manager properties  
IC50431  WebSphere MQ XA client exposes security hole in MTS and com+ environments  
IC50448  Unable to deserialize object (JMS1061)  
IC50453  AMQSTRG0 comments say it is using MQTMC2 but the sample says memcpy(&trig.version, " 1", 4): AMQSTRG0.C  
IC50499  Amqidnet.exe unable to locate component when installing only the WMQ Java client on Windows  
IC50536  Queue handle leak when accessing alias or remote queues on z/OS  
IC50734  Access violation in the amqmtsxatm.dll when msdtc.exe is interacting with MQ.  
IC50882  Runmqsc or PCF cannot match channel status entries to a supplied conname for inbound channels after 6.0.1.1.  
IC50901  Amqxssvn process prevents queue manager restarting under MSCS control.  
IC50958  Amqrspin.dll and amqsspin.c sample source do not handle exit reason mqxr_sec_parms leading to channel termination by exit.  
IC50992  Deadlock during queue manager shutdown if a service stopcmd is launched.  
IC51090  AMQ6119 access violation with FDC probe ID XC130031 when MQ client is connecting to queue manager using SSL channels.  
IC51145  Regression in 6.0.2.0 of apars IC48024, IC48213 and IC4746  
IC51658  Application fails to launch under Visual studio with 'the application failed to initialize PROPERLY(0XC0000008)'  
IY84659  Unusual sequence of requests on a thread currently associated with another transaction causes SIGSEGV.  
IY84998  Agent looping inside zlahealththread producing repeated probe XC330005  
IY85622  Sigsegv FDC with xpprundestructors on function stack.  
IY85632  Netprty ignored in workload balancing within MQ clustering.  
IY85679  Unending series of xecl_w_long_lock_wait FDCs from function xllosspinlockwaitlock, e.g. probe XY086003  
IY86287  Sigsegv in atxassociationcheckidle called (indirectly) from atxassociationremoveall.  
IY86322  Failure in "strmqm -r" during log replay  
IY86343  Connectionname passed to exit or conname displayed in runmqsc is 0.0.0.0  
IY86361  WebSphere MQ V6.0 accounting messages display incorrect info.  
IY86365  XC130003 SIGSEGV in WebSphere MQ conversion routine xcsconvertstring  
IY86395  WMQ messages written to AIX error log have a resource name that is not quite right  
IY86541  WebSphere MQ message returned even though correlid does not match  
IY86600  Accounting messages are not generated after exceeding account count interval.  
IY86701  MQJE082 exception after installing WebSphere MQ 6.0 extended transactional client  
IY86822  Timeonqavg, timeonqmin and timeonqmax accounting messages values are always set to zero (0).  
IY86827  Queue manager is not able to write to queue manager log files.  
IY86828  Delivery problem with two listeners on same-name MQ destinations  
IY86994  WebSphere MQ clussdr channel does not start for messages committed via xa_commit  
IY87162  Problems with stop channel mode(force|terminate) calls.  
IY87173  Memory leak within WebSphere MQ cluster repository process amqrrmfa  
IY87192  Mqrc_connection_broken and recursive imq_impl_disc_backout  
IY87310  FFST by xcssimplepipecleanup with probe ID XY490002. 
IY87523  Corrections to ensure proper mapping between euro-enabled CCSIDs and respective codesets  
IY87638  Queue handle leak when using producer.send(destination, message).  
IY87702  Sigsegv e.g. in amqicdir, due to a getpwnam failure (e.g. if LDAP fails). 
IY87749  Hang in malloc inside WMQ signal handler while handling exception in malloc.  
IY87797  MQ runmqsc creates FDC with probe ID XC267011 showing SIGPIPE.  
IY87802  Output of the amqoamd utility may have spurious trailing characters for any 48-character queue names  
IY87804  WMQ acting as an XA resource manager calls ax_unreg in some exceptional cases disallowed by the XA spec  
IY87834  A base Java application works fine on MQ 5.3, but when run on MQ6.0 the application returns mqrc_iih_error.  
IY87844  Avoid caching password entry lookup for the UID of a process, until a connect successfully obtains it.  
IY88140  MQ JMS map/stream/text messages publish with blank rfh.format  
IY88151  Problem supplying the userid information via the mqenvironment when using client channel tables.  
IY88246  A WMQ semaphore set with a semid of 0 may be created with wrong ownership/permissions, which can hang WMQ processes  
IY88283  Channels would not start after upgrading to WMQ V6  
IY88509  Channel with SSL enabled never goes to retry even when remote end is not reachable  
IY88514  Connections left open when an XA create session fails  
IY88551  Queue manager can fail to start without producing FFST failure report  
IY88573  Third quadrant enabled applications on HP-UX fail with FDC XY079022  
IY88873  Memory leak in queue session when creating message consumers.  
IY88948  WMQ channel fails to start with AMQ9202 on Windows  
IY88954  Poolscavenger is never started and so connections are not closed in a running JVM.  
IY89259  Nullpointerexception in getconnectionccsid() or spiget()  
IY89374  JMS clients should not assume the default persistence (defpsist)on the system.broker.control.queue is yes.  
IY89484  ZL000128 zlamain during endmqm after probe XC037008 xcsexecprogram from AMQZMGR0 AMQ6026  
IY89548  Java / JMS client native user supplied send/receive exits core dump on close channel, and also fail when resizing buffer.  
IY89674  Mqrc_dbcs_error returned to client when clustering used.  
IY89729  Passing a blank as a selector causes the listener to fail reporting invalid parameter.  
IY90046  Messageconsumer.close() may block indefinitely if there are many messages on the queue that do not meet selection criteria.  
IY90227  Client channel definition file created from WMQ 5.3 gives error when read from a WMQ V6.0 Java client when CRL is configured  
IY90244  Heap corruption can cause MQ exception handler to hang.  
IY90460  Amqrmppa process will not release threads causing high resource utilization.  
IY90548  Not being able to modify the 5000MS timeout period used while retrieving messages (internal chunktime value)  
IY90566  Cluster workload algorithm excludes local instances of cluster queues when putting via alias queues  
IY90707  Get(disabled) on an alias queue does not wake up "waiting" gets  
IY90712  Sigsegv in function rfxaddclqmgr during queue manager startup when migrating to WebSphere MQ V6.0  
IY90995  An FDC reporting a SIGSEGV occurs if the queue manager attempts to obtain the group entry for a group ID that is undefined  
IY91959  If the clusrcvr channel is stopped, when it is re-started the repository cache is not properly updated causing workload issues  
IY93095  Performance hit when user generated msgIDs are used rather than MQ generated msgIDs.  
IY93381  Incorrect DST information in Java SDK 1.4.2SR5 included in MQ 6.0.2.0 refresh pack for HP-UX on PA-RISC and HP-UX on Itanium  
IY94369  A SIGSEGV can occur in function xtrestablishtracestatus when an application makes MQ C API calls using JNI  
SE25512  MQM400 -service program MQJBDF02 and MQJBND05 are built with terespace attribute set to *none  
SE25969  OSP-MSGAMQ8135-PAR AMQ8135 not authorized when using *allobj profiles with hash ( # ) character  
SE26291  MQM400 duplicate SSWC entries in cmqcfg rpgle copy file. the customer cannot compile their PCF/ILE RPG applications.  
SE26751  MQM400 wrkmqm option 22 results in CPF3C53  
SE26780  MQM400 - strmqmlsr with IP address specified fails with AMQ9248 message. 
SE26805  MQM400 ship new PTF exit program amqiptfx  
SE26834  MQM400-OWNERSHIP of error logs incorrect after the batch submission (sbmjob) of ENDMQMLSR.  
SE26835  MQM400 Checking 'WMQ is active' during un-install deos not happen.  
SE27019 MQM400 STRMQM fails when the SYNCQ is damaged. 
SE28140 MQM400 STRMQM of V6 queue manager fails with CPF706D in AMQZXMAX 
SE28286 MQM400-WRKMQM remains inputminhibited when enduser has primary group profile (QPGMR) > 4000 endusers 
SE28327 MQM400 : RCRMQMOBJ fails with unexpected error on WMQ V6.0.2.0 
SE28421 MQM400-ENDMQM MQMNAME(*ALL) fails with error number 3027 (operation not permitted) after WMQ V6 installation/migration. 
IY93389  Defects fixed in WebSphere MQ fix pack 6.0.2.1.  


Refresh Pack (V6.0.2.0) 
iSeries fix release date: 07 November 2006
Last modified: 07 November 2006
Status: Available

iSeries download information     


Windows and UNIX fix release date: 16 October 2006
Last modified: 27 October 2006
Status: Available

Windows and UNIX download information   
APAR Description 
IC48241  Messages received using messagelistener with durable subscribers are getting backed out once the application ends  
IC48295  Mqwindows-incorrect output for user-defined filter based on the overall channel status attribute for channels  
IC48397  Exits are deleted when migrating from WebSphere MQ V5.3 to V6.0 on Windows platforms.  
IC48478  MQJMS3023 in pub sub application after quiescing and restarting the queue manager.  
IC48512  FDC probe KY322000 indicates incorrect configuration of DCOM object  
IC48555  JMS pub/sub cleanup utility fails with 2009 in SSL environment  
IC48576  Program exceptions in Visual BASIC (vb) applications using the administration API.  
IC48662  Convert chained header to UCS-2 returns wrong reason code  
IC48678  Listener status inquire fails if the listener name length is 48  
IC48680  Stopping the first instance of a receiver channel which is connected to multiple sender channel throws AMQ9533 error.  
IC48696  Java .lang . nullpointerexception is received when the path specified for explorer Java trace is incorrect  
IC48705  WebSphere MQ creates FDC with probe ZF165008 from wasreceivedata  
IC48711  Registry key wrong for explorer excludemessages  
IC48721  WebSphere MQ explorer message browser tool does not display some Japanese character sets  
IC48727  WebSphere MQ message AMQ7227 does not clearly define possible causes.  
IC48775  MQ Java classes when run as a client fail to negotiate to an MQ server running pre-fap 4 level, and incorrectly use hbint value.  
IC48795  Message browser included in WebSphere MQ explorer (V6) limits browsing to first 500 messages on queue  
IC48803  AMQ9207 invalid data received on a channel following a timeout on the Windows platform. FDC with probe ID CO052000 is created.  
IC48880  Custom services fail to migrate during migration to MQ 6.0.  
IC48908  Channel name changed by exit is not picked up when using the MQ explorer.  
IC48913  Performance problem getting message from a large queue  
IC48914  Conversion from CCSID 819 to 912 fails on get/convert  
IC48919  Locking errors when attempting to delete an in-doubt channel  
IC48920  Repeated resrcmon.exe FDC's following MSCS related errors  
IC49003  Multi threaded C++ application hangs  
IC49005  Mqtcpsdrport environment variable not working with WebSphere MQ version 6.0  
IC49024  Explorer cannot connect to queue manager on HP-UX with CCSID 923  
IC49051  WebSphere MQ explorer fails to display queues when run as a non-mqm user  
IC49084  .NET sample code (for cs & cpp files) supplied with WMQ V5.3/V6 is missing close and disconnect methods.  
IC49093  WMQ Java client change in the response expected from a negotiation call which occurred around fap 4  
IC49150  DCOM security corruption on Windows 2003 SP1, XP SP2 and later  
IC49167  Data conversion error causes segmented messages to be dead letter queued (or return mqrc_format_error from mqget)  
IC49197  Performance problem on machines where MQ fails to calculate a required accuracy from the performance counters.  
IC49409  Clusrcvr disappearing at queue manager startup time.  
IC49431  Z or z in the mqmd.useridentifier field is incorrectly transcoded to j, causing authentication failures.  
IC49533  Mqrc_not_authorized received on mqconnx for a user ID passed in a mqcsp structure.  
IC49569  The extended transactional client receives FDCs with probe ID ZSL33001 and subsequently with probe ID ZS129001.  
IC49782  Mqwindows-incorrout memory leak when using a managed MQ dotnet client  
IC50156  Need one certificate to authenticate multiple clients  
IC50265  Java.lang.nosuchmethoderror: append" when processing JMS stream messages with WMQ 6.0.1.1 and was 5.0.2.  
IY80410  MQJMS2013 error when connecting to WebSphere MQ in bindings mode  
IY80806  MQ looping during startup when invalid attribute logbufferpager is located in the log stanza of the qm.ini file.  
IY80952  Memory leaks in clustered channels, and (V6) PCF filtering.  
IY81353  AMQ8135 mqrc_not_authorized errors when manually starting or stopping auto-defined clussdr channels.  
IY81358  Changes to MCAuseridentifier made by security exits not reflected in channel definitions  
IY81533  Connectionname is often not included in "channel SSL error" event messages.  
IY81628  MQJMS2005 reason code 2102 when 2035 mqrc_not_authorized occurred  
IY81661  Channel created using auto-definition exit has incorrect fields  
IY81671  XC308090 when xa_start and xa_rollback or xa_commit are issued concurrently for same xid.  
IY81714  MQMD userid put in upper case by V6 rcvr channel  
IY81774  WebSphere MQ as JMS provider not adhering to JMS specifications as per section "4.3.8 exceptionlistener".  
IY81875  Trace improvements for 'stop channels' functionality during queue manager ending  
IY81906  Migration to WMQ V6 sets new clwluseq queue attribute incorrectly  
IY81941  Very frequent channel starting and ending may cause excessive CPU usage in amqrmppa (channel pooling) processes  
IY81945  Hang in function kqiwakeupwaiter e.g. during quiesce endmqm  
IY82078  Sigsegv in aqpcopydatabuffers during MQPut to a dynamic queue.  
IY82241  If a correlid has been set as text, retrieving it as bytes returns null.  
IY82297  Rcdmqimg records image of damaged object leading to subsequent rcrmqobj failure.  
IY82419  Probe id's XC332070 and XC034071 from xlswaitevent.  
IY82629  Reset_iconv_table not found when installing client on HP-UX  
IY82779  Rcrmqobj of syncfile can result in misleading AMQ7047 message; also, references to rcrmqmobj should be to rcrmqobj.  
IY82806  Npmclass on local queue is not inherited from 'like' object.  
IY82834  When one or more /tmp/mqseries.[pid] files are deleted then endmqm can hang.  
IY82889  When using cross domain connection factory definitions in was V6 the TCPIP connection is not released during the cleanup.  
IY83093  Workload balancing temporarily unbalanced after a lot balancing  
IY83272  Amqsstop does not work on big endian machines.  
IY83321  Error messages logged by an application not in mqm group, may cause the system error log file to exceed the specified maximum  
IY83372  Xcsfreequickcell reports xecs_e_block_already_free in error.  
IY83535  MQ V6 can attempt to create more than logprimary + logsecondary log extents.  
IY83588  Extremely long running channel exits cause channels to end due to hbint not taking the exit time into account.  
IY83698  WebSphere MQ clussdr channels do not start if messages are put in a XA transaction, and xa_commit is called after mqdisc  
IY83704  Java.lang.unsatisfiedlinkerror trying to load LIBMQJEXITSTUB01.SO for MQ client only installations.  
IY83775  C++ client disconnect causes mqrc_hconn_error in non-C++ applications  
IY83778  Messages do not get rolled back in point to point domain when exceptions occurs in MDB  
IY84127  When QM runs out of resource and error logic is being exercised,mq calls xcsfreequickcellblock() instead of xcsfreequickcell()  
IY84356  Possible loss of queue manager data area if strmqm detects a problem with files or directories during startup  
IY84410  FDC with probe AT002001 is generated when atm.rmidgenerator reaches 2147483647  
IY84479  Code change for function "strptr" at line 541 of AMQSAXE0.C  
IY84777  A probe ZX033006 FDC with errorcode xecs_e_seg_in_use may be dumped from function zxccleanupwlmserver during endmqm  
IY84934  FDC showing SIGSEGV in function rrxconvertchannelfromdiskver or zxcrestoreobject when migrating from WMQ V5.2 to V6.0  
IY85202  2017 mqrc_handle_not_available is reported in WMQ at service levels 6.0.1.0 and 6.0.1.1  
IY85203  WebSphere MQ script reset_iconv_table may cause blank file name or incorrect permissions on conversion tables  
IY85541  AMQ9426 queue manager unable to rejoin cluster  
IY85542  Reset cluster does not remove deleted repository entry  
IY85562  A tcp send failure may retry indefinitely as of MQ5.3 CSD11.  
IY85620  Sigsegv in zxcrestoreobject on starting a WMQ 5.2 queue manager under WMQ V6.  
IY86606  Cluster subscriptions made for non-cluster queues  
SE24179  MQM400-AMQ8059 RC2292 (Unknown entity) on CRTMQM  
SE24271  MQM400 abnormal end of MQ application program may hang queue manager  
SE24477  MQM400 MCH0601 after multiple strmqmchl  
SE25214  MQM400 rfrmqmaut removes group members authorities  
SE27124 MQM400-AMQ8059 RC2292 (unknown entity) on crtmqm 
IY89836  Defects fixed in WebSphere MQ refresh pack 6.0.2.0.  


Fix Pack 6.0.1.1 (V6.0.1.1)  
Fix release date: 8 May 2006
Last modified: 13 October 2006
Status: Available

iSeries download information     


Windows and UNIX download information   

APAR Description 
IC45004  Performance slow downs and timeouts whilst using a queue status monitor  
IC46861  V5.3 to V6: cannot update API exit definitions for migrated queue manager using the eclipse explorer GUI.  
IC47255  When message selector is used with durable subscriber and application terminates abruptly, the messages are lost.  
IC47289  Deadlock on object catalog caused by performance events on system.auth.data.queue  
IC47462  Probe PC082099 from pcminquireclusterqueuemanager  
IC47466  WMQ version 6 explorer client SSL key stores remote administration  
IC47481  Multi-threaded client return mqrc_already_connected  
IC47528  Rcdmqimg taking long time to complete  
IC47557  Unable to start/stop auto-defined cluster sender channels using MQ explorer  
IC47595  Duplicate listeners in output of runmqsc dis lsstatus command  
IC47771  Channel terminated FFST RM487001 with memory access violation XC130031.  
IC47804  MQ MSCS resource fails to apply local mqm group permissions to the directories containing the queue manager data even after IC43947  
IC47879  When the MQ V6 explorer is used to manage a z/OS queue manager any lowercase userids are rejected.  
IC47881  XY324190 (getsubpoolslock) winnt error 6 from createmutex when debugging application under Windows terminal services  
IC47974  Trap due to a stack underflow in a COM+/MTS environment whilst continually connecting to a non-running queue manager  
IC47978  Timing problem may cause WebSphere MQ for Windows queue manager not to start in auto mode.  
IC48024  Error messages AMQ6125 & AMQ6183 are logged when displaying context menu of a remote queue manager.  
IC48031  .NET WMQ transport for SOAP client application receives no response when connected as MQ client to z/OS queue manager.  
IC48046  Mqmessageconsumer.receive(timeout) does not honour the timeout value when message selectors are used  
IC48069  Install of 6.0 client fails on XP without service pack 1 installed. Error message incorrectly reports AMQ4366.  
IC48143  Queue list incomplete from mqcmd_inquire_q if include mqiacf_cluster_info and cluster queues 
IC48156  AMQ9210: remote attachment failed  
IC48213  Max active channels displays misleading value  
IC48217  EOFexception occurs when calling mqmessage.readstring()  
IC48243  Unable to add remote queue manager where host-name contains under-bar character.  
IC48309  Com+ application hangs when connecting to queue manager  
IC48310  Xcsrefreshmtime cuts an FDC with probe ID XC457010  
IC48596  WebSphere MQ using incorrect user ID, and receiving mqrc_not_authorised (2035) errors, in a com+ environment  
IC48699  Deadlock within the MQ com+ layer following a connection broken return code in the extended transactional client  
IC49065  Using PCF, a non-mqm user can start/stop channels without having +ctrl authority  
IC49148  WMQ V6 hashtable port number property returns exception 'system.invalidcastexception' in amqmdnet.dll  
IY73649  Mqcmd_inquire_q_names, mqcmd_inquire_channel_names, AMQ2035 mqrc_not_authorized  
IY76063  Channels in stopping/binding state which cannot be stopped using stop chl(chlname) mode(force)  
IY76845  Failure to perform data conversion of an MQRFH2 which contains namevaluedata which is not aligned on a 4 byte boundary.  
IY77059  Probe id's MQ000010 and XY180010 following application of IY74420.  
IY77246  Mqchllib and mqchltab not honored by WebSphere MQ server.  
IY77448  Rare deadlock possibilities involving FDC reporting  
IY77769  Messages remain on the system.cluster.transmit.queue after the channel is suppressed by a chadexit.  
IY78390  Incorrectly built GCC 2.95.2 libraries shipped with WMQ V6.0 for Linux X86 platform  
IY78429  Channel process (amqrmppa) may fail due to bad data from pre- CSD10 Java client  
IY78438  Clustering: large numbers of subscriptions cause slowdown of repository cache creation (during strmqm); or recreation  
IY78634  MQExplorer does not retain 'automatic refresh' disablement.  
IY78636  In MQ explorer date and time are sorted as string  
IY78788  Sigsegv in aqsreleasebclist during queue manager shutdown.  
IY78836  Jmsadmin cannot find its configuration file  
IY79100  Clustering uneven message distribution during workload balancing  
IY79142  Channels fail with AMQ9631 when using a global server certificate  
IY79158  Rare timing condition in mqconn causes hang  
IY79226  Some processes (e.g. amqrmppa) may not recreate a trace file (.trc) if it has been deleted and trace restarted.  
IY79234  Xaer_rmerr returned by extended transaction client XA calls, following mqrc_another_q_mgr_connected from mqconn or mqconnx  
IY79288  FDC probe XC130003 reporting sigbus or sigsegv in broker function fkirestoresubscription  
IY79301  Incorrect installation of refresh pack 6.0.1.0 on HP-UX if /var/adm/sw/save does not exist  
IY79414  SSL distinguished name does not match peer name  
IY79457  WMQ V6.0 client channel fails when a mqcsp structure is specified on the mqcno passed into a mqconnx call.  
IY79616  Linux: small timing window can result in spurious semaphore unlock, giving FDC probes such as XC346012.  
IY79663  XC130004 sigsegv out of kpisyncpoint during an xa_commit call  
IY79668  AMQ9661: bad SSL data from peer on channel  
IY79906  After FDC with probe ID AD031001, component adiwritefile with comment RC=0 from write, the queue manager can fail to restart.  
IY79915  Write FFST if deleting a cluster object with a live subscription  
IY80064  Applications fail to connect to a queue manager. FFSTs with probe ID XY029001 dumped.  
IY80142  Amqoamd -s fails with mqrc_hconn_error on WMQ V6.0  
IY80247  Subscription to full repository manager object is deleted if there is a problem when the subscription is renewed  
IY80428  Amqrrmfa can end abruptly when two or more cluster receivers are defined with the same name within the same cluster.  
IY80596  RM550000 + XC130003 FFSTs in mqconn from 64-bit WMQ V6.0 client application using a ccdt containing old channel definitions.  
IY80863  WMQ V6 HP-UX (itanium) performance problems related to high workload.  
IY81580  Cluster workload exit reason not set for cluster PCF message  
IY81696  WebSphere MQ V6 on Solaris X86-64 connect or other MQ APIs hang when some MQ of the applications are 32 bit.  
IY81993  SNA channels fail at WMQ V6.0 due to missing library LIB64/AMQCC62A_R  
IY82062  Deadlock in COM+ when one thread issues an mqconn which fails, at the same time as another issues mqdisc  
IY82071  Wrong MQMD passed to the pre data conversion MQGet API crossing exit  
IY82794  Unable to set mapnamestyle (mnst) when defining connectionfactory using the jmsadmin tool.  
SE21231  MQM400 agent jobs are not being ended after FDC probe ZL000028 has been logged  
SE21437  MQM400 unable to restart queue manager. AMQ8041 on strmqm  
SE22928  MQM400 V6.0 - FDC probe XY353002 doing external commit - job AMQZLAA0 - MCH3601 *escape in libmqml_r zsqverifypcd seq #7  
SE23096  MQM400-MSGMCH3601 exception occurs in LoadExit when MQCD structure supplied by CHAD exit is not valid. 
SE23098  OSP qmname can't be defined in MQ clntconn channel with CL commands  
SE23241  MQM400 strmqmmqsc fails with -MCH3601 and CEE9901 exception for *DFT queue manager. 
SE23550  MQM400 crtmqm or strmqm receives MSGAMQ7155 and MSGAMQ7128  
SE24862  Strmqmmqsc under a profile authorized with qmqmadm group authority on iSeries fails  
IY89815  Defects fixed in WebSphere MQ fix pack 6.0.1.1. (part 1 of 3)  
IY90003  Defects fixed in WebSphere MQ fix pack 6.0.1.1. (part 2 of 3)  
IY90004  Defects fixed in WebSphere MQ fix pack 6.0.1.1. (part 3 of 3)  


Refresh Pack 6.0.1.0 (V6.0.1.0)  
Fix release date: 18 October 2005
Last modified: 13 October 2006
Status: Available

iSeries download information     

Windows and UNIX download information   
APAR Description 
IC45799  Probe XY051025 reporting duplicate AMQXCS2.DLL found. no duplicate exists. the path to the duplicate copy is empty.  
IC45816  FDC files with probe ids XC130031 and HL081010 but no message when the logpath is set incorrectly.  
IC45869  MQ service fails to stop, and trap occurs if a queue manager is deleted.  
IC45894  Cluster administrator or MQ MMCs hangs when an MSCS cluster contains more than queue manager resource or custom service.  
IC46074  Client channel to z/OS never times out after a connection drop  
IC46145  Mqrc_no_msg_available MQRC2033 when getting locked segments  
IC46192  RFH2 errors when attempting to connect WebSphere MQ V6 SOAP to a CICS SOAP client or a CICS SOAP service.  
IC46301  MC011057 when stopping Windows while MSCS controlled queue manager is still running.  
IC46407  Incorrect truncation of queue file during log full caused a damaged queue  
IC46433  MQ .net classes need +inq authority to get dynamic queue name  
IC46530  AMQ2018 .net mqbegin  
IC46539  .net dotnet dynamic queue name mqrc_dynamic_q_name_error 2011 accessqueue  
IC46548  Mqrc_options_error is returned if qpmo_alternate_user_authority is specified with mqqueuemanager.put() call.  
IC46653  Mqqueuemanager constructor does a connect to the QM, but it does not check to see if it is connected already before reconnecting.  
IC46666  FDCs while deleting a stopped channel  
IC46698  Local cluster queue not listed in display qcluster, or in WMQ explorer, after refresh cluster(clname) repos(yes)  
IC46766  Putdatetime property in 'mqmessage' MQ .net class is read only and cannot be altered or set.  
IC46774  MMC shows incorrect status information, overlapping/multiple amqmsrvn processes  
IC46920  Windows, information center fails to start.  
IC46955  Various setmqscp problems  
IC46965  Was V6 connectionfactories not bound with WMQ V6  
IC46987  Coinitialize failure rpc_e_changed_mode (-2147417850) with MSCS  
IC47013  Traps or XC130031 with any MQ call stack but O/S stack dump shows xcssynchronizecountertime.  
IC47032  JMS client support for the pgm multicast protocol  
IC47044  SSL authentication with JMS realtime node fails from JMS client when JDK1.4.2 is used at client side with legalargumentexception  
IC47181  Problem in linking COBOL programs with Visual Age COBOL compiler on Windows with MQ 6.0  
IC47224  Multiple poolscavenger threads created when using either WebSphere application server version 5.X or WebSphere MQ.  
IC47236  Mqrc_context_handle_error (2097 error)- when pass_all_context option is used with Java distribution list.  
IC47275  Message browser fails to show list of MQ messages for a queue.  
IC47332  MQ client AMQ9691 error when trying to add a certificate using amqmcert -a when the certificate is already present in the store  
IC47343  Mqerrorpath variable no longer sets path for error logs in WebSphere MQ V6.0  
IC47443  JMS messageproducer memory leak.  
IC47447  Messages not acknowledged when using connectionbrowsers with auto_ack or dups_ok sessions  
IY60843  Message selectors that contain the <> (not equal) operator do not work.  
IY66331  When mqpmo.recspresent is set in a client application MQPUT1 fails with 2154 (mqrc_recs_present_error).  
IY69753  Hang in xihquerythreadentry on Solaris  
IY70366  FDC ZD008040 from zdmopendeferredq when starting queue manager  
IY70415  Incorrect message may be returned to MQGet with mqgmo_wait and mqgmo_msg_under_cursor  
IY71004  MQ logs fill up and the queue manager will not restart after many AO084010 FDCs.  
IY71204  Damaged temporary dynamic queue inadvertently added to pool of reusable queues at startup  
IY71335  Channel remains in stopping status after stop channel mode (terminate) command has been issued  
IY72218  Mqdisc fails with mqrc_hconn_error (2018 0X7E2) and the CICS application returns abnormal termination U8035.  
IY72519  AMQ9652 error message generated incorrectly when the cryptographic store / key repository password has expired.  
IY72714  Failure of LDAP server providing O/S user identification data to WMQ through the getgrent interface observed on Solaris  
IY72844  Queue manager cannot restart: FDC with probe ID HL083114.  
IY72981  Truncated was FDCs (probes ZF178* to ZF216*)  
IY73045  Set logbufferpages default to 128, as documented (was 64)  
IY73062  Selector ignored on connectionconsumer  
IY73149  Japanese message catalog contains invalid DBCS characters also Traditional Chinese - AIX only  
IY73202  Software may get permissions failure reading amqcap.inf file  
IY73543  Probe XC307004 FDC from xlsrequestmutex (Linux only)  
IY73548  MQ may write a XC130003 FDC under zcpqueryterminus when using XA  
IY73907  FDC with probe XC006001 from xcsfreemem from the repository manager process.  
IY74045  RM409000 FFST from rriwaitsecondary  
IY74094  User-level object type name inconsistencies  
IY74339  Sigbus/sigsegv in kqiinquirequeuehandlestatus  
IY74420  MQ hang following pthread_cancel on Solaris. xcskillthread, xlslockmutexfn mcatype(thread)  
IY74705  Queue managers hang when using event messages and may see probe XY337080 from xlllonglockrequest.  
IY74818  WMQ not rolling back a transaction after XA calls return xaer_nota  
IY74915  Performance impact on AIX when an API exit is invoked  
IY75237  Migrating queue manager to WMQV6 sets MCA type of all the channels to process.  
IY75252  MQ commands fail if the MQ files path is specified at the end of the path string and is not terminated with a colon (:).  
IY75467  WMQ broker dies with probe XC130003 FDC in function faiadderrortag  
IY75589  MQJMS1061: unable to deserialize object message due to java.lang.classnotfoundexception when using WebSphere MQ  
IY75854  Publishing applications are delayed by up to 60 seconds when publishing. Error 2033 msg_not_available return code is seen.  
IY76101  Sigsegv in amqfcxba after MQRFH2 message sent to system.broker.control.queue  
IY76118  MQ explorer V6.0, broker V6.0, empty object tables, Linux.  
IY76314  XA client ending abruptly leaves outstanding units of work locked until they span the active log, when they are backed out.  
IY76712  Unpredictable results when the topic associated with a durable subscription changes when using the MQ broker.  
IY76799  Failed call to getpeername leaks file descriptor in amqrmppa  
IY77233  Object catalog corruption during resource exhaustion  
IY77282  Version 6 channels do not trigger start  
IY79428  Probe XC307010 when attempting to raise a COD report message in response to a transactional MQGet on MQ V6.  
SE19791  MQM400 channel pair remains in retrying/binding status after a network change and does not recover. 
SE20571  MQM400- AMQ6993 message incorrectly generated when endmqm submitted in batch  
SE21176  MQM400 wrkmqm generates MSGMCH6902 and MSGCZM1212 when more than nine queue managers to be displayed  
SE21259  MQM400 security audit journal entries produced when using OS V5R3 with user *USER with no special authorities 
SE21565  MQM400 - verifying MQ V6 using chkprdopt returns error CPF0C20 with programs AMQI0XRL, AMQI0XVL, AMQI0X1L.  
IY89814  Defects fix in WebSphere MQ refresh pack 6.0.1.0. (part 1 of 5)  
IY89954  Defects fixed in WebSphere MQ refresh pack 6.0.1.0. (part 2 of 5)  
IY89970  Defects fixed in WebSphere MQ refresh pack 6.0.1.0. (part 3 of 5)  
IY90006  Defects fixed in WebSphere MQ refresh pack 6.0.1.0. (part 4 of 5)  
IY90007  Defects fixed in WebSphere MQ refresh pack 6.0.1.0. (part 5 of 5)  


Recent and planned Fix Pack content summary  
    

To show the APARs flagged for the combination of fix pack and platform select the appropriate link. 

 Fix Pack 6.0.2.1 Fix pack 6.0.2.2 Fix pack 6.0.2.3 All for this platform 
Windows List List List List 
AIX List List List List 
HP-UX (PA-RISC) List List List List 
HP-UX (Itanium) List List List List 
Solaris (SPARC) List List List List 
Solaris (x86-64) List List List List 
Linux (x86) List List List List 
Linux (x86-64) List List List List 
Linux (zSeries) List List List List 
Linux (s390x) List List List List 
Linux (Power) List List List List 
iSeries List List List List 
     
All in Fix or Refresh Pack List List List  
     
* Note this Fix Pack has not been delivered. The APARs listed are planned content but this could change before the Fix Pack is made available.

Change history

Last modified: 30 October 2008 
30 October 2008: Added information on availability of WMQ v6.0.2.5 for iSeries. 
06 October 2008: Added information on availability and content for WMQ v6.0.2.5. 
26 June 2008: Added information on availability of WMQ v6.0.2.4 for iSeries. 
30 May 2008: Added information on availability and content for WMQ v6.0.2.4 
11 March 2008: Added information on the availability of WMQ v6.0.2.3 for iSeries. 
01 February 2008: Added information of availability and content for WMQ v6.0.2.3 
14 September 2007: Added information of availability of WMQ v6.0.2.2 for iSeries. 
22 August 2007: Added 6.0.2.2 availability and content. The next fix pack is planned to be 6.0.2.3. 
19 July 2007: Add target date for fix pack 6.0.2.3. Add cross reference table by fix pack and platform to the end of the document. 
20 April 2007: Updated availability date of 6.0.2.1 for iSeries 
28 March 2007: Add Fix pack 6.0.2.1 is available and content. The next fix pack is planned to be 6.0.2.2. 
27 February 2007: Modify fix pack 6.0.2.1 release date to 1Q2007 
07 November 2006: Add refresh pack 6.0.2.0 for iSeries 
13 October 2006: Add refresh pack 6.0.2.0. 
08 May 2006: Created fix list page. 
 
  
-------
Note:
-------


Recommended Fixes for WebSphere MQ
  
  
Abstract 
This page provides links to the latest available maintenance for the WebSphere MQ and MQSeries products. 
 
Content 
IBM WebSphere MQ Version 7.0
IBM WebSphere MQ Version 6.0
IBM WebSphere MQ Version 5.3 and 5.3.1
IBM MQSeries Link for R/3 V1.2
IBM MQSeries Version 5.1
IBM MQSeries for VSE
IBM WebSphere MQ ESE v6.0

A list of the planned release dates for future maintenance can be found at the following link.


Product & Version Latest Maintenance Pack Info. All Platform Downloads 
IBM WebSphere MQ Version 7.0 
AIX Fix Pack 7.0.0.1 Platform downloads 
HP-UX Itanium Fix Pack 7.0.0.1 Platform downloads 
HP-UX PA-RISC Fix Pack 7.0.0.1 Platform downloads 
i5/OS Fix Pack 7.0.0.1 - 
Linux on POWER Fix Pack 7.0.0.1 Platform downloads 
Linux on x86 Fix Pack 7.0.0.1 Platform downloads 
Linux on zSeries s390x Fix Pack 7.0.0.1 Platform downloads 
Linux x86-64 Fix Pack 7.0.0.1 Platform downloads 
Solaris SPARC Fix Pack 7.0.0.1 Platform downloads 
Solaris x86-64 Fix Pack 7.0.0.1 Platform downloads 
Windows Fix Pack 7.0.0.1 Platform downloads 
z/OS - - 

Note: The "Platform downloads" contain all Refresh Packs, Fix Packs and Interim Fixes for each platform. 
 

Product & Version Latest Maintenance Pack Info. All Platform Downloads 
IBM WebSphere MQ Version 6.0 
AIX Fix Pack 6.0.2.5 Platform downloads 
HP-UX Itanium Fix Pack 6.0.2.5 Platform downloads 
HP-UX PA-RISC Fix Pack 6.0.2.5 Platform downloads 
i5/OS Fix Pack 6.0.2.5 - 
Linux on POWER Fix Pack 6.0.2.5 Platform downloads 
Linux on x86 Fix Pack 6.0.2.5 Platform downloads 
Linux on zSeries Fix Pack 6.0.2.5 Platform downloads 
Linux on zSeries s390x Fix Pack 6.0.2.5 Platform downloads 
Linux x86-64 Fix Pack 6.0.2.5 Platform downloads 
Solaris SPARC Fix Pack 6.0.2.5 Platform downloads 
Solaris x86-64 Fix Pack 6.0.2.5 Platform downloads 
Windows Fix Pack 6.0.2.5 Platform downloads 
z/OS - - 

Note: The "Platform downloads" contain all Refresh Packs, Fix Packs and Interim Fixes for each platform. More information about V6 maintenance can be found in the Maintenance Strategy for V6.0 document.
 

Product & Version Latest Fix Pack Release Date Comments 
IBM WebSphere MQ Version 5.3
IBM WebSphere MQ Version 5.3.1
IBM WebSphere MQ Express Version 5.3 Download to all Fix Packs 
for Multiplatforms Fix Pack 14 Dec. 2007 
for i5/OS Fix Pack 14 Feb. 2008 
for z/OS -  -  
for OpenVMS - Alpha Fix Pack 14 Sep 2008 
for OpenVMS - Itanium Fix Pack 14 Sep 2008 
for HP NonStop Server V5.3.1.4 Oct 2008  
 

Product & Version Latest Fix Pack Release Date Comments 
IBM MQSeries Link for R/3 V1.2 CSD03 March 2005 Only available from your IBM Service Representative 
 

Product & Version Latest Fix Pack Release Date Comments 
IBM MQSeries Version 5.1  
MQSeries for Compaq NSK CSD03 Oct 2004 
MQSeries for Compaq OVMS Alpha CSD05 Feb 2006 
MQSeries for Compaq Tru64 UNIX CSD09 Mar 2003 
MQSeries for Sun Solaris on Intel CSD09 Aug 2002 
 

Product & Version Latest Fix Pack Release Date Comments 
IBM MQSeries for VSE Only available from your IBM Service Representative 
Version 2.1.0 - EOS 30 Sep 2005 - - 
Version 2.1.2 - - 
 

Change History
Last updated: 30 January 2009

11 November 2005: Added WebSphere MQ v6 Refresh Pack 1. 
16 November 2005: Added WebSphere MQ v6 for z/OS link. 
17 November 2005: Released WebSphere MQ v5.3 for OpenVMS Fix Pack 8. 
5 December 2005: Minor modifications, included V6 maintenance strategy link. 
25 January 2006: Released WebSphere MQ v5.3 Fix Pack 12. 
27 January 2006: Changed releases dates for WebSphere MQ 5.3 Fix Pack 12 to Jan 2006. 
16 February 2006: Released MQSeries for Compaq OVMS Alpha 5.1 Fix Pack 5. 
4 April 2006: Released MQSeries for Compaq OVMS Alpha/Itanium 5.3 Fix Pack 9. 
7 April 2006: Changed the recommended download for WebSphere MQ 5.3 for i5/OS from Fix Pack 12 to Fix Pack 11. 
13 April 2006: Changed the recommended download for WebSphere MQ 5.3 for i5/OS from Fix Pack 11 to Fix Pack 12. 
17 May 2006: Released 6.0.1.1. 
13 October 2006: Released 6.0.2.0 for Windows and UNIX 
17 October 2006: Released MQSeries for Compaq OVMS Alpha/Itanium 5.3 Fix Pack 10. 
27 October 2006: Released 6.0.2.0 for i5/OS. 
26 December 2006: Released 5.3.0.13. 
9 February 2007: Released 5.3.0.13 for i5/OS 
30 March 2007: Released WebSphere MQ v6.0.2.1 for AIX, Linux, Solaris and Windows 
30 March 2007: Released WebSphere MQ v5.3 FP 11 for Open VMS 
13 April 2007: Released WebSphere MQ v6.0.2.1 for HP-UX PA-RISC and HP-UX for Itanium 
20 April 2007: Released WebSphere MQ v6.0.2.1 for i5/OS 
8 May 2007: Removed reference to V2.2.1 products as these are no longer in service. 
23 August 2007: Released WebSphere MQ v6.0.2.2 for UNIX and Windows. 
24 August 2007: Released WebSphere MQ v5.3 FP12 for Open VMS 
14 September 2007: Released WebSphere MQ v6.0.2.2 for i5/OS 
8 October 2007: Released WebSphere MQ v5.3.1.1 for NSS 
28 December 2007: Released WebSphere MQ v5.3 FP14 for UNIX and Windows 
01 February 2008: Released WebSphere MQ v6.0.2.3 for AIX, Linux and Windows. 
07 February 2008: Released WebSphere MQ v6.0.2.3 for HPUX and Solaris. 
28 February 2008: Released WebSphere MQ v5.3. FP14 for i5/OS. 
06 March 2008: Released WebSphere MQ v5.3 FP13 for Open VMS. 
11 March 2008: Released WebSphere MQ v6.0.2.3 for i5/OS. 
03 April 2008: Released WebSphere MQ v5.3.1.2 for HP NonStop Server. 
30 May 2008: Released WebSphere MQ v6.0.2.4 for UNIX and Windows. 
20 June 2008: Released WebSphere MQ v6.0.2.4 for i5/OS. 
09 July 2008: Removed WebSphere MQ v5.2 & v5.2.1 details. 
10 July 2008: Released WebSphere MQ v5.3.1.3 for HP NonStop Server. 
11 September 2008: Released WebSphere MQ v5.3 FP14 for Open VMS. 
03 October 2008: Released WebSphere MQ v5.3.1.4 for HP NonStop Server. 
06 October 2008: Released Websphere MQ v6.0.2.5 for UNIX and Windows. 
30 October 2008: Released Websphere MQ v6.0.2.5 for i5/OS. 
20 January 2009: Released WebSphere MQ v7.0.0.1 for UNIX and Windows 
30 January 2009: Released Websphere MQ v7.0.0.1 for i5/OS. 
 
  
-------
Note:
-------

thread:

Q:

Hi, 

Presently one of my production server is having the MQ 6.0.2.0, We are planning to install latest FIX Pack MQ 6.2.0.5 

For installing the latest fix pack do i need to un-install and install the whole MQ or it's ok if i install directly MQ Fix pack 
 
A:


Short answer: NO, you do not have to uninstall - it's a FixPack - logic says it's there to fix something, 
which it can't do if that 'something' has been uninstalled.

 
-------
Note:
-------

Title: IBM WebSphere MQ Security Bypass Vulnerability
Severity: MODERATE
Description:
IBM WebSphere MQ is a commercially available messaging engine for enterprises.

IBM WebSphere MQ is prone to a security-bypass vulnerability because the application fails to properly restrict 
access to certain functionality. Specifically, attackers can access a queue manager through a SVRCONN (MQ client) channel 
even if it is secured with a security exit or 'mcauser'.

Attackers can exploit this issue to bypass certain security restrictions, connect to a queue manager in an unauthorized manner, 
and obtain potentially sensitive information; other attacks are also possible.

This issue affects versions prior to:

5.3 Fix Pack 14
6.0 Fix Pack 6.0.2.2


Affected Products:
IBM WebSphere MQ 5.3.0 
IBM WebSphere MQ 6 
References:


-------
Note:
-------


IBM WebSphere MQ Commands Local Privilege Escalation Issues  


Title : IBM WebSphere MQ Commands Local Privilege Escalation Issues
VUPEN ID : VUPEN/ADV-2009-0511
CVE ID : CVE-2009-0439
Rated as : Moderate Risk 
Remotely Exploitable : No
Locally Exploitable : Yes
Release Date : 2009-02-24

 
Technical Description            

Multiple vulnerabilities have been identified in IBM WebSphere MQ, which could be exploited by local attackers 
to gain elevated privileges. These issues are caused by unspecified errors in the command line tools "setmqaut", "dmpmqaut" and "dspmqaut", 
which could allow malicious users to execute arbitrary code with elevated privileges on a vulnerable UNIX system. 
No further details have been disclosed.

Affected Products

IBM WebSphere MQ 5.x
IBM WebSphere MQ 6.x
IBM WebSphere MQ 7.x 

Solution

IBM WebSphere MQ 6.0 - Apply the latest Fix Pack (6.0.2.6 or later) or APAR IZ40824 :
http://www-01.ibm.com/support/docview.wss?uid=swg24022268

IBM WebSphere MQ 7.0 - Apply the latest Fix Pack (7.0.0.2 or later) or APAR IZ40824 :
http://www-01.ibm.com/support/docview.wss?uid=swg27006037

References

http://www.vupen.com/english/advisories/2009/0511 
http://www-01.ibm.com/support/docview.wss?uid=swg21376107
http://xforce.iss.net/xforce/xfdb/48529

Credits

Vulnerability reported by the vendor.

ChangeLog

2009-02-24 : Initial release

 
-------
Note:
-------

thread:

Q:

Hello,

I work on EDI and do not have much experience in handling Java and MQ Errors.
I was working to configure JMS messaging and encountered the following error while running JMSAdmin.

$ ./JMSAdmin -v

5724-H72, 5655-L82, 5724-L26 (c) Copyright IBM Corp. 2002,2005. All Rights Reserved.
Starting Websphere MQ classes for Java(tm) Message Service Administration

Initializing JNDI Context...
INITIAL_CONTEXT_FACTORY: com.ibm.ejs.ns.jndi.CNInitialContextFactory
PROVIDER_URL: iiop://localhost:900/
Class specified by INITIAL_CONTEXT_FACTORY not found in CLASSPATH

Here is a snapshot of the JMSAdmin.config file
#INITIAL_CONTEXT_FACTORY=com.sun.jndi.ldap.LdapCtxFactory
#INITIAL_CONTEXT_FACTORY=com.sun.jndi.fscontext.RefFSContextFactory
INITIAL_CONTEXT_FACTORY=com.ibm.ejs.ns.jndi.CNInitialContextFactory
#INITIAL_CONTEXT_FACTORY=com.ibm.websphere.naming.WsnInitialContextFactory

#PROVIDER_URL=ldap://polaris/o=ibm,c=us
#PROVIDER_URL=file://localhost/
PROVIDER_URL=iiop://localhost:900/

Here is the content of the JMSAdmin file :
java -classpath /usr/mqm/java/lib/com.ibm.mqjms.jar -Djava.ext.dirs="$libpaths" -DMQJMS_LOG_DIR=$MQ_JAVA_INSTALL_PATH/log -DM
QJMS_TRACE_DIR=$MQ_JAVA_INSTALL_PATH/trace -DMQJMS_INSTALL_PATH=$MQ_JAVA_INSTALL_PATH com.ibm.mq.jms.admin.JMSAdmin $*


Can you please let me know if I am doing anything wrong, or what is needed to run the JMSAdmin utility in AIX ..


-------
Note:
-------

thread:

Q:

when i compiling my api exit, i got the error: 

ld: 0711-317 ERROR: Undefined symbol: .MQXEP 
ld: 0711-345 Use the -bloadmap or -bnoquiet option to obtain more information. 
collect2: ld returned 8 exit status 
make: 1254-004 The error code from the last command is 1. 

anybody can help me?


A:

You are missing the "-lmqmzf" 
in the line 

xlc -q64 -e MQStart -bE:mirrorq.exp -bM:SRE 
-o mirrorq mirrorq.c 
-I/usr/mqm/inc -L/usr/mqm/lib64 -lmqm -lmqmzf 


-------
Note:
-------

thread:

Q:

Hi, 
Can someone please confirm what is the deafult maximum disk queue size on AIX (MQ is v6.0.2.5) - I cannot find it documented anywhere. 
Also there seems to be little documentation on the impacts of increasing it via theDefaultQFileSize setting in qm.ini 
to a large value (eg. 2Gb) - can it be set to this size ? 

I ask because we had a Production problem today where messages were being written to the Dead Letter queue with a reason code 2056 
(Queue Space Not Available). All applications connected to the QMgr were reporting problems. The /var/mqm/ directory was only using 
18% disk space. I discovered that the actual queue refered to in the 2056 Dead Letter messages was holding 75,000 
for a system that had only reads the messages 3 times a day. The actual queue file in /var/mqm/qmgrs/QMGR/queues/queue_name was 1.07Gb, 
after clearing 20,000 messages from the queue the QMgr was OK and no more messages were going to the Dead Letter Queue. 

We're looking at getting the application to process the messages more often but it would be good to have a bigger buffer on the queue size if I can.


A:

AFAIK it's the same as the maximum size of a file on your OS/hardware combination. 

http://publib.boulder.ibm.com/infocenter/wmqv6/v6r0/topic/com.ibm.mq.amqaac.doc/aq10270_.htm 
Also there seems to be little documentation on the impacts of increasing it via theDefaultQFileSize setting in qm.ini 
to a large value (eg. 2Gb) - can it be set to this size ?[/quote] 

Yes - see here for details. IIRC you have to enable large file support in AIX as well; not entirely certain I'm right on that....  

Don't forget that no matter how much space you supply, the queue will consider itself full when it hits MaxDepth. 
If the queue's only read 3 times a day ensure this value is high enough. 

But of course you knew that. 

A:

for your problem, focus on the queue's Max Message Size and Max Depth parameters, 
as well as enabling the largefiles option on your server. 

DefaultQFileSize is only for performance tuning and trying to keep messages in memory versus spilling over to disk, 
for queues that are contantly being drained of new messages. That is not your problem. Going to disk is OK for you.

A:

Thanks for your help. I'll document the real problem here for reference. 

The operating system file limit was 1Gb (1048575 blocks * 1024 bytes)... 

[mqm@xxxxxx:/data/home/mqm]> ulimit -a 
core file size (blocks, -c) 1048575 
data seg size (kbytes, -d) 131072 
file size (blocks, -f) 1048575         <---
max memory size (kbytes, -m) 32768 
open files (-n) 2000 
pipe size (512 bytes, -p) 64 
stack size (kbytes, -s) 32768 
cpu time (seconds, -t) unlimited 
max user processes (-u) 128 
virtual memory (kbytes, -v) unlimited


-------
Note:
-------

arcticle:

In the AIX error log, there are errors with label AMQFFSTx, such as AMQFFST1 AMQFFST2 AMQFFST3 AMQFFST4
  
 Technote (troubleshooting) 
  
This document applies only to the following language version(s):
English 
  
Problem(Abstract) 
You see in the AIXr error log, errors with a label format of AMQFFSTx, such as AMQFFST1, AMQFFST3, AMQFFST4.  
  
Symptom 
An example is shown below:

LABEL: AMQFFST3
IDENTIFIER: 8xxxxxxx
Date/Time: Sun Dec 31 11:31:18 EST
Sequence Number: 3770
Machine Id: 00FFFFFF
Node Id: pppppdc
Class: S
Type: UNKN
Resource Name: MQSeries NONE
Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED
Probable Causes
UNDETERMINED
Failure Causes
SOFTWARE PROGRAM
Recommended Actions
REVIEW DETAILED DATA
CONTACT APPROPRIATE SERVICE REPRESENTATIVE
Detail Data
DETECTING MODULE
xcsCloseEventSem
SOFTWARE ERROR CODE
0000 0001
FILE NAME
/var/mqm/errors/AMQ30656.0.FDC  
  
Cause 
Sometimes, WebSpherer MQ errors will cause the AIX operating system to put entries into the AIX errort log 
that have the label of AMQFFSTx, such as AMQFFST1, AMQFFST3, AMQFFST4.  
  
Resolving the problem 
For more details on the MQ problem, review the file mentioned at the end of the error entry in the AIX errpt, such as: 
FILE NAME
/var/mqm/errors/AMQ30656.0.FDC 


When an abnormal situation happens, MQ creates an FDC (First Data Capture) file using FFST (First Failure Support Technology) 
in /var/mqm/errors and it is important to view this FDC file to find out more information on the abnormal situation.

For more information on FDC files, see:

http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg21176953
MustGather: Documentation required by the WebSphere MQ support team for an ABEND or FFST

http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg21173468
MustGather: WebSphere MQ Dumps and FFSTs are written to the following locations

http://publib.boulder.ibm.com/infocenter/wmqv6/v6r0/topic/com.ibm.mq.amqzag.doc/fa15410_.htm
WebSphere MQ 6 Information Center
System Administration Guide
First-failure support technology (FFST) 
 

-------
Note:
-------

thread:

Q:

[MQ Series Error Code : 2059 AIX]
Asked by abrobit.roy
on 1/8/2007 6:04 AM
Hi All,

I am getting the shared memory issues on AIX Box. We come across the
below errors ::

Unable to connect to MQSeries Queue Manager 'QM_sdcb80a023' MQSeries
Error Code: 2059

Please check your MQSeries configuration and verify that the queue
manager is running. (SBL-EAI-04233)

We have tried the below workaround mentioned in the supportweb.

1. Shut down the queue manager(s) ./endmqm QM_sdcb80a023

2. Edit the file /var/mqm/mqs.ini. In the `QueueManager' stanza, for ea
ch
queue manager of interest, add an additional line for IPCCBaseAddress
explicitly specifying the shared memory segment to use. For example:

QueueManager:
Name=myQueueManager
Prefix=/var/mqm
Directory=myQueueManager
IPCCBaseAddress&H12

(We have tried with all the relevant shared memory segment)

3. Restart the queue manager(s).

4. To configure the AIX environment to run Siebel Server with less memory

setenv LDR_CNTRL MAXDATA=0x30000000

5. Restart Siebel Server

The error is still reproducible.

Could anyone assist me to resolve the issue.

Note :: QM is Running and we are able to get/put the messages.

Regards,


A:


-------
Note:
-------

Article:

HPUX HP-UX

Kernel parameters HP-UX for MQ:

Kernel configuration
WebSpherer MQ uses semaphores and shared memory. It is possible, therefore, that the default kernel configuration is not adequate.

Before installation, review the machine's configuration and increase the values if necessary. The minimum recommended values 
of the tunable kernel parameters are given in Figure 1. These values might need to be increased if you obtain 
any First Failure Support TechnologyT (FFSTT) records.

Note: 
On platforms earlier than HP-UX 11i v1.6 (11.22), if you intended to run a high number of concurrent connections 
to WebSphere MQ, you were required to configure the number of kernel timers (CALLOUTS) by altering the NCALLOUT kernel parameter. 
On HP-UX 11i v1.6 (11.22) platforms or later, the NCALLOUT parameter is obsolete as the kernel automatically adjusts the data structures. 
Semaphore and swap usage does not vary significantly with message rate or message persistence. 
WebSphere MQ queue managers are generally independent of each other. Therefore system tunable kernel parameters, 
for example shmmni, semmni, semmns, and semmnu need to allow for the number of queue managers in the system. 
See the HP-UX documentation for information about changing these values. 

Figure 1. Minimum recommended tunable kernel parameters values

   shmmax           536870912
   shmseg           1024
   shmmni           1024
   semaem           16384
   semvmx           32767
   semmns           16384
   semmni           1024 (semmni < semmns)
   semmnu           16384
   semume           256
   max_thread_proc  66
   maxfiles         10000
   maxfiles_lim     10000
   nfile            10000 

Note: For HP-UX 11.23 (11i V2) and later operating systems, the tunable kernel parameters: shmem, sema, semmap, and maxusers, are obsolete. 
This applies to the Itanium and PA-RISC platforms.
You must restart the system once you have made any changes to the tunable kernel parameters. 


System resource limits
You can set global limits for the size of process data segments and the size of process stack segments for the whole system 
by altering the tunable kernel parameters.
The tunable kernel parameters are: 
Parameter What it controls Recommended minimum value 
maxdsiz Maximum size of the data segment for 32-bit processes 1073741824 
maxdsiz_64bit Maximum size of the data segment for 64-bit processes 1073741824 
maxssiz Maximum size of the stack segment for 32-bit processes 8388608 
maxssiz_64bit Maximum size of the stack segment for 64-bit processes 8388608 

If other software on the same machine recommends higher values, then the operation of WebSphere MQ will not 
be adversely affected if those higher values are used.
For the full documentation for these parameters see the HP-UX product documentation.

To apply the settings to an HP-UX 11i system which has the System Administration Manager (SAM) utility, 
you can use SAM to achieve the following steps:
Select and alter the parameters 
Process the new kernel 
Apply the changes and restart the system 
It is possible that other releases of HP-UX provide different facilities to set the tunable kernel parameters. If so, then please 
consult your HP-UX product documentation for the relevant information.

The ulimit shell command
On a per-shell basis the available limits can be tuned down from the values stored for the System resource limits parameters above. 
Use the ulimit shell command to tune the values of the parameters with a combination of the following switches:
Switch Meaning 
-H The hard limit 
-S The soft limit 
-d The data segments size 
-s The stack segment size 

Verifying that the kernel settings are applied
To verify that the resource limits have not been lowered by a ulimit command and that the queue manager will experience 
the correct limits, go to the shell from which the queue manager will be started and enter:

ulimit -Ha
ulimit -SaAmongst the console output you should see:
data(kbytes)   1048576
stack(kbytes)  8192If lower numbers are returned, then a ulimit command has been issued in the current shell to lower the limits. 
You should consult with your system administrator to resolve the issue.


-------
Note:
-------

Article:

Kernel parameters Solaris and MQ (1):


Resource limit configuration
Configure Solaris systems with the resource limits required by WebSpherer MQ.

WebSphere MQ uses semaphores, shared memory, and file descriptors, and it is probable that the default resource limits 
are not adequate.

The configuration required by WebSphere MQ depends on the version of Solaris you are using.

>> If you are using Solaris 10:

You must change the default resource limits for each zone WebSphere MQ will be installed in. 
To set new default limits for all users in the mqm group, set up a project for the mqm group in each zone.

To find out if you already have a project for the mqm group, log in as root and enter the following command:
projects -lIf you do not already have a group.mqm project defined, enter the following command:

projadd -c "WebSphere MQ default settings" 
        -K "process.max-file-descriptor=(basic,10000,deny)" 
        -K "project.max-shm-memory=(priv,4GB,deny)"
        -K "project.max-shm-ids=(priv,1024,deny)" 
        -K "project.max-sem-ids=(priv,1024,deny)" group.mqm

If a project called group.mqm is listed, review the attributes for that project. 
The attributes must include the following minimum values:


process.max-file-descriptor=(basic,10000,deny)
project.max-sem-ids=(priv,1024,deny)
project.max-shm-ids=(priv,1024,deny)"
project.max-shm-memory=(priv,4294967296,deny)

If you need to change any of these values, enter the following command:

projmod -s -K "process.max-file-descriptor=(basic,10000,deny)" 
           -K "project.max-shm-memory=(priv,4GB,deny)" 
           -K "project.max-shm-ids=(priv,1024,deny)"
           -K "project.max-sem-ids=(priv,1024,deny)" group.mqm

Note that you can omit any attributes from this command that are already correct.
For example, to change only the number of file descriptors, enter the following command: 
projmod -s -K "process.max-file-descriptor=(basic,10000,deny)" group.mqm
(To set only the limits for starting the queue manager under the mqm user, 
login as mqm and enter the command projects. The first listed project is likely to be default, 
and so you can use default instead of group.mqm, with the projmod command.) 

You can find out what the file descriptor limits for the current project are, by compiling and running the following program:
#include <sys/types.h>
#include <sys/stat.h>
#include <fcntl.h>
#include <stdio.h>

int main () {
  int fd;
  for (;;) {
    fd = open ("./tryfd", O_RDONLY);
    printf ("fd is %d\n", fd);
    if (fd == -1)  break;
  }
}

To ensure that the attributes for the project group.mqm are used by a user session when running Websphere MQ, 
make sure that the primary group of that user ID is mqm. In the above examples, the group.mqm project ID will be used. 
For further information on how projects are associated with user sessions, see Sun's System Administration Guide: 
Solaris Containers-Resource Management and Solaris Zones for your release of Solaris.

>>> If you are using Solaris 8 or Solaris 9:

Review the system's current resource limit configuration.

As the root user, load the relevant kernel modules into the running system by typing the following commands:
modload -p sys/msgsys
modload -p sys/shmsys
modload -p sys/semsysThen display your current settings by typing the following command:

sysdef

Check that the following parameters are set to the minimum values required by WebSphere MQ, or higher. 
The minimum values required by WebSphere MQ are documented in the tables below.

Table 1. Minimum values for semaphores required by WebSphere MQ Parameter Minimum value 
SEMMNI 1024 
SEMAEM 16384 
SEMVMX 32767 
SEMMNS 16384 
SEMMSL 100 
SEMOPM 100 
SEMMNU 16384 
SEMUME 256 

Table 2. Minimum values for shared memory required by WebSphere MQ Parameter Minimum value 
SHMMAX 4294967295 
SHMMNI 1024 
SHMSEG (Solaris 8 only) 1024 

Table 3. Minimum values for file descriptors required by WebSphere MQ Parameter Minimum value 
rlim_fd_cur 10000 
rlim_fd_max 10000 

To change any parameters that are lower than the minimum value required by WebSphere MQ, edit your 
/etc/system file to include the relevant lines from the following list:

set shmsys:shminfo_shmmax=4294967295
set shmsys:shminfo_shmmni=1024
set semsys:seminfo_semmni=1024
set semsys:seminfo_semaem=16384
set semsys:seminfo_semvmx=32767
set semsys:seminfo_semmns=16384
set semsys:seminfo_semmsl=100
set semsys:seminfo_semopm=100
set semsys:seminfo_semmnu=16384
set semsys:seminfo_semume=256
set shmsys:shminfo_shmseg=1024
set rlim_fd_cur=10000
set rlim_fd_max=10000Note: 

These values are suitable for running WebSphere MQ, other products on the system might require higher values. 
Do not change the value of shmmin from the system default value. 
Semaphore and swap usage does not vary significantly with message rate or persistence. 
WebSphere MQ queue managers are generally independent of each other. Therefore system kernel parameters, 
for example shmmni, semmni, semmns, and semmnu need to allow for the number of queue managers in the system. 
After saving the /etc/system file, you must reboot your system


-------
Note:
-------

Article:

Kernel parameters Solaris and MQ (2):


A lot of people in the past have run into this problem, namely, the kernel configuration on solaris is not sufficient 
to run MQSeries. Add the following values to the /etc/system file and be sure to reboot. 
Make sure you do all of this before creating a queue manager on the box. 

/etc/system 
set shmsys:shminfo_shmmax = 4294967295 
set shmsys:shminfo_shmseg = 1024 
set shmsys:shminfo_shmmni = 1024 
set semsys:seminfo_semaem = 16384 
set shmsys:shminfo_semmni = 1024 
set semsys:seminfo_semmap = 1026 
set semsys:seminfo_semmns = 16384 
set semsys:seminfo_semmsl = 100 
set semsys:seminfo_semopm = 100 
set semsys:seminfo_semmnu = 2048 
set semsys:seminfo_semume = 256 
set msgsys:msginfo_msgmap = 1026 
set msgsys:msginfo_msgmax = 4096 

_________________

 The recommendation on /etc/system is in the Quick Start guide, but is critical, and worth reiterating. 
However, the values given are suitable for one queue manager which is not too busy. In environments with multiple 
queue managers, or where queue managers are busy, you may need to increase these values. IBM has not been 
able to provide good guidelines (to my knowledge) on how to tune these values. The manual also completely 
leaves out any mention of two importamt parameters. 

set rlim_fd_max = 1024 
set rlim_fd_cur = 1024 

Prior to solaris 2.7, the maximum values for these was 1024. It is now maxint at 2.7 and above. 
IBM has records on IBMLink referencing these parameters as fixes to various reported problems, 
but they don't seem to have made it into the manuals yet. 

The other thing to note is that the values listed allow for MQSeries usage of shared memory and semaphores, 
but if you have other users of these resources (such as BMC Patrol) then these values may not be sufficient. 


thanks for the additional info. The point you made about other services using these same resources is important, 
because some products, require even more of these resources than MQSeries. For instance, on one sun ultra we had, 
we were running MQSeries without a problem (after updating /etc/system according to the quick beginnings guide) 
and then installed Oracle and everything broke. Turns out that Oracle needs even more of certain resources than MQSeries, 
so we had to go back and increase values in /etc/system beyond what we required for MQSeries... 


This might help - it's from IBM: 

Solaris Kernel Parameters for MQ 
MQSeries makes extensive use of IPC (Inter-Process Communication) resources, 
including shared memory, semaphores, and message queues (the IPC kind). Many 
Solaris systems will require some adjustment of the kernel parameters which 
govern these resources in order to able to run MQSeries comfortably, or to 
support heavily-used MQSeries installations. Indications that MQSeries lacks 
enough IPC resources may be an inability to start MQSeries, or difficulty in 
running many MQSeries programs concurrently. Furthermore, MQSeries may 
generate FDC files in /var/mqm/errors which contain error messages from 
IPC-related functions like semget, shmget, or shmat. 
In order to make more IPC resources available to MQSeries, it is necessary 
to modify the kernel parameters on your machine using facilities like 
configure and idtune. Use the values given in this note in preference over 
those listed in the MQSeries Quick Beginnings for Solaris book. In cases 
where this note mentions new parameters, or overlooks some listed in the 
Quick Beginnings book, again give preference to this note. For more 
information on modifying your kernel, refer to your Solaris documentation or 
contact Solaris support. 
We strongly urge you to save your current kernel configuration before trying 
to make any changes. When you make changes, realize that other programs 
(databases, for example) which make much use of IPC resources may force you 
to modify these parameters so that both MQSeries and those programs will 
run. The values msgmax, msgmnb, msgssz, semaem, semume, semvmx, shmmax, and 
shmseg should not in general require augmentation if you are running 
databases or other IPC-intensive programs. The values msgmap, msgmni, 
msgseg, msgtql, semmap, semmni, semmns, semmnu, and shmmni may require 
augmentation depending on the other programs running on the system. Refer to 
the meaning of each parameter listed below and other vendors' instructions 
to help you with that determination. 
In general, the values that follow are only policing values. In other words, 
they can usually be over-allocated without causing harm to your system. This 
means that if your existing programs are not already running up against the 
limits you have specified, they will not use more kernel resources after 
modifying your kernel parameters. 
Note: The parameters for Shared Memory and Semaphores tend to be more 
important than the parameters for Message Queues. 
Note: The correct way of getting kernel parameters on a Solaris box is with 
the 
following command... 
/usr/sbin/sysdef -i > kernel.txt 
Please do not ask for the /etc/system file from customers to get kernel 
parameters. This file is used to set/tune kernel parameters and the 
parameters in this file will not be picked up until the box is rebooted. So 
getting the /etc/system file from the customer box might not give you the 
correct information as to what kernel parameters the operating system is 
actually running with. 
These are the recommended MINIMUM parameters... 
________________________________________________________ 

IPC Message Queue Parameters 
_____________________________________________ 

mesg 1 This should not be changed, and is generally hard-coded. 

msgmap 1026 This is the number of entries in the kernel's 
message map table. This value should equal msgtql+2, and is should always be 
less than msgseg. A value roughly half of msgseg should be good. 
msgmax 4096 This is the maximum size of a single message in bytes. 

msgmnb 4096 This is the maximum number of bytes that all the 
messages on a single message queue can occupy. 
msgmni 50 This is the maximum number of message queues 
allowed on the system at any time. 

msgseg 2048 This is the number of memory segments allocated 
by the kernel at system startup to hold messages. Each system will have a 
limit on the total memory allocated (msgseg*msgssz), often 128KB. 
msgssz 8 This is the size in bytes of the memory segments 
used for storing messages. Valid values must 
be multiples of 4. 
msgtql 1024 This is the number of system messages headers 
which the kernel can store, which is effectively 
the maximum number of unread messages at any time. 

_____________________________________________ 
IPC Semaphore Parameters 
_____________________________________________ 

sema 1 This should not be changed, and is generally hard-coded. 

semaem 16384 This is the maximum adjust-on-exit value for a 
semaphore. It must be less than or equal to semvmx. 
semmap 1026 This is the size of the kernel's map of 
semaphore sets. This value should equal but never exceed semmni+2. 
semmni 1024 This is the maximum number of semaphore sets 
that can exist on the system at any time. 
HP-UX hard-codes the number of semaphores per 
set (semmsl on Solaris systems) to 500, so if 
only "full" semaphore sets are going to be 
allocated, this value should be roughly 
semmns/500. MQSeries generally allocates 64 
semaphores per set, so if most of the 
semaphore usage on the system is due to 
MQSeries, a more ideal number would be semmns/64. 

semmns 32767 This is the maximum number of semaphores in 
the system. A value of 16384 will generally work for a small MQSeries 
installation, but setting it to 32767 is advisable for larger systems. 
semmnu 2048 This is the number of semaphore undo 
structures allocated by the system. It must be less than or equal to 
nproc-4. 
semmsl 128 This is the maximum number of semaphores per 
semaphore set 
semopm 128 This is the maximum number of semaphore 
operations that can be done by one semop() call. If this is set to semmsl, 
one semop() call can operate on every semaphore in a semaphore set, although 
MQSeries does not require this. 
semume 256 This is the number of semaphore undo entries 
for each process. 
semvmx 32767 This is the maximum value that a semaphore can have. 

_____________________________________________ 
IPC Shared Memory Parameters 
_____________________________________________ 

shmem 1 This should not be changed, and is generally 
hard-coded. 

shmmax 4194304 This is the maximum size in bytes of a shared 
memory segment. 
shmmni 1024 This is the maximum number of shared memory 
segments that can exist on the system at any time. 1024 is the maximum on 
many systems. 
shmseg 1024 This is the maximum number of shared memory 
segments that a single process can have at any time. It should always be 
less than or equal to shmmni. 

_____________________________________________ 
Miscellaneous Parameters 
____________________________________________ 

maxusers 32 This controls the number of users which can 
log in to the system. More importantly, it 
controls other system values which limit the 
number of processes that can run at once. 
... 
Rather than changing maxusers, we would recommend that you alter the nproc 
and maxuprc values as follows: 
nproc: The maximum number of processes on the system. 
1 for each non-MQSeries process on the system PLUS 
3 for each MQSeries queue manager (strmqm) PLUS 
2 for each MQSeries receiver or svrconn channel PLUS 1 for each MQSeries 
sender channel PLUS 1 for each other MQSeries process (runmqtrm, etc.) 

maxuprc: The maximum number of processes for a single user. 
1 for each non-MQSeries process run by 'mqm' PLUS 3 for each MQSeries queue 
manager (strmqm) PLUS 2 for each MQSeries receiver or svrconn channel 
PLUS 1 for each MQSeries sender channel PLUS 1 for each other MQSeries 
process (runmqtrm, etc.) 

Users of Sun Solaris 2.5.1 or better may wish to verify that they are not in 
fact using more than 25% of their kernel resources for semaphore structures. 
In order to calculate this in bytes, use the formula given below. Also, if 
you are letting the kernel determine nproc for you, you can find this value 
by typing 'sysdef | grep v_proc': 
kernel_memory = semmns * 16 + 
nproc * 16 + 
semmni * 92 + 
semmnu * ((semume + 1) * 16) * 4 
Solaris 2.5.1 users must also be certain that they are not using more than 
25% of their kernel resources for shared memory structures. In order to 
calculate this in bytes, use the formula given below: 
kernel_memory = shmmni * 120 
Of course, simply calculating the bytes needed for shared memory and 
semaphore structures is not terribly useful if you don't know what the 
overall kernel resources are. Kernel memory is limited by your kernel 
architecture as well as by your available RAM. Type 'uname -m' to see what 
your kernel architecture is. The maximum kernel memory that common Sun 
architectures can use today is given below: 
Kernel Resources Machines 

Sun4m 256 MB ------ 

Sun4d 576 MB SS1000, SC2000 

Sun4u 4 GB UltraSPARC 

 
Just a minor thing - BMC Patrol does not use IPCS stuff DIRECTLY - it consumes IPC resources via. 
the MMA clients it uses to communicate with MQ. 
 
Please find the following link for docs on "Solaris Tunable Parameters Ref." Man." 

http://docs.sun.com/ab2/coll.707.1/SOLTUNEPARAMREF/@Ab2TocView?Ab2Lang=C&Ab2Enc=iso-8859-1 
 
excuse my ignorance, but I presume the recommended kernel settings are only for MQSeries Server, 
and not applicable to Client only installation? 
 
Excuse my ignorance as well but I'd love to have "andystone"'s previous point clarified if possible. 
It's not stated clearly in any of the IBM documentation whether it's necessary to alter kernel parameters 
when performing a Client-only installation on Solaris ... in fact the documentation gives the impression that 
these kernel changes are associated with the Server installation only. However when installing the 
Client product on Sloaris I received the following : 
=============================================== 
Checking kernel configuration... 

33554432 max shared memory segment size (SHMMAX) 
24 max attached shm segments per process (SHMSEG) 
100 shared memory identifiers (SHMMNI) 
300 semaphore identifiers (SEMMNI) 
300 semaphores in system (SEMMNS) 
25 max semaphores per id (SEMMSL) 
10 max operations per semop call (SEMOPM) 
600 undo structures in system (SEMMNU) 
10 max undo entries per process (SEMUME) 
2048 max message size (MSGMAX) 

ADVISORY WARNING - You may need to alter the kernel parameters listed above to run WebSphere MQ. 
See the Quick Beginnings manual for more information. 
=============================================== 

Does anyone have a definite yes or no response to this query? 
In addition, this is my first posting. I'd like to say that I've found this site to be of invaluable assistance 
over the past 3 months as I've begun to explore the world of MQ. 

I could n't see these values in my sysdef file.
Where I could find these values?. 
set rlim_fd_max = 1024 
set rlim_fd_cur = 1024 


Sysdef is the utility to display the system settings. You should put the rlim_fd parameters in the file /etc/system . 

For some reason sysdef doesn't display these params, but if use ulimit -a you should see: 

nofiles(descriptors) 1024 


Just an update regarding my previously posted question. 
The answer is, that you must change the kernel parameters only if you 
install a MQ Server. If you install a MQ Client you can ignore the message 
that you should change the kernel configuration. 
Many thanks to Carsten Scheunemann for emailing me this information which he received from IBM support. 
 
For some recent information on MQ Solaris kernel changes, in particular msgmap, msgseg, msgssz and msgtql see: 

http://www-1.ibm.com/support/docview.wss?rs=172&context=SW900&q1=msgseg&uid=swg21116351&loc=en_US&cs=utf-8&lang=en 

- you said about 'Solaris Kernel Parameters for MQ ': 
"it's from IBM: " 
- is it an official answer from IBM support or some information documented somewhere? 
thanks in advance 
 
 
-------
Note:
-------

Article:

Kernel parameters Linux and MQ:


Kernel configuration
WebSpherer MQ makes use of System V IPC resources, in particular shared memory and semaphores. The default configuration 
of these resources, supplied with your installation, is probably adequate for WebSphere MQ but if you have 
a large number of queues or connected applications, you might need to increase this configuration.

You can determine the amount of System V IPC resources available by looking at the contents of the following files: 

  /proc/sys/kernel/shmmax - The maximum size of a shared memory segment.
  /proc/sys/kernel/shmmni - The maximum number of shared memory segments.
  /proc/sys/kernel/shmall - The maximum amount of shared memory that can be allocated.
  /proc/sys/kernel/sem    - The maximum number and size of semaphore sets  that can be allocated.

For example, to view the maximum size of a shared memory segment that can be created enter: 

  cat /proc/sys/kernel/shmmax

To change the maximum size of a shared memory segment to 256 MB enter: 

  echo 268435456 > /proc/sys/kernel/shmmax

To view the maximum number of semaphores and semaphore sets which can be created enter: 

cat /proc/sys/kernel/sem

This returns 4 numbers indicating:
 SEMMSL - The maximum number of semaphores in a sempahore set
 SEMMNS - The maximum number of sempahores in the system
 SEMOPM - The maximum number of operations in a single semop call
 SEMMNI - The maximum number of sempahore sets   

For WebSphere MQ: 
the SEMMSL value must be 128 or greater 
the SEMOPM value must be 5 or greater 
the SEMMNS value must be 16384 or greater 
the SEMMNI value must be 1024 or greater 

To increase the maximum number of semaphores available to WebSphere MQ, you should update the SEMMNS and SEMMNI values.
To configure these values every time the machine is restarted you are recommended to add these commands to a startup script in /etc/rc.d/...


-------
Note:
-------

Article:

AIX requirements and MQ:


Checking the operating environment
Before you install WebSpherer MQ Version 6.0, you must check that your system meets the hardware and 
operating system software requirements set for this product and the particular components you intend to install on it. 

Note: WebSphere MQ does not support host names that contain spaces. If you install WebSphere MQ on a computer 
with a host name that contains spaces, you will be unable to create any queue managers.

Hardware
WebSphere MQ for AIXr, Version 6.0 runs on any machine that supports the AIX5L V5.2 or AIX5L V5.3 operating systems 
capable of running 64-bit programs whether from IBMr or other vendors.

Operating System
The operating systems supported by WebSphere MQ for AIX, Version 6.0 are:
AIX5L V5.2 (plus maintenance Level 3) 
AIX5L V5.3 

Use the oslevel -r command to determine the level of the operating system you are running, including the maintenance level.

Connectivity Requirements
Check that the system has 64-bit compatible communications hardware that supports at least one of the following:
TCP/IP 
SNA LU6.2: If you want to use the SNA LU6.2 support on WebSphere MQ you need the IBM Communications Server for AIX Version 6.1. 
UDP is no longer supported, existing channels should either be deleted or migrated to one of the supported protocols listed above. 
To migrate UDP channels to an alternative protocol alter the channel TRPTYPE attribute. 
For information about this channel attribute see the Intercommunication book.

Storage Requirements
The storage requirements for the WebSphere MQ for AIX, Version 6.0 depend on which components you install, 
and how much working space you need. This, in turn, depends on the number of queues that you use, 
the number and size of the messages on the queues, and whether the messages are persistent. You also require 
archiving capacity on disk, tape or other media. The approximate amount of storage space required for a server installation 
is detailed in the table below. 

Table 1. Storage requirements for a WebSphere MQ server Storage Requirements Storage Requirement in MB in /opt 
WebSphere MQ Server installation 325 

You can use the df command to determine the amount of free space on your system.
Disk storage is also required for
Prerequisite software 
Optional software 
Your application programs 

File descriptors
When running a multi-threaded process such as the agent process, you might reach the soft limit for file descriptors. 
This gives you the WebSphere MQ reason code MQRC_UNEXPECTED_ERROR (2195) and, if there are enough file descriptors, 
a WebSphere MQ FFSTT file.
To avoid this problem, you can increase the process limit for the number of file descriptors. To do this, alter the nofiles attribute 
in /etc/security/limits to 10,000 for the mqm user id or in the default stanza. For information about the mqm user id see, 
Setting up the user ID and group ID.

System Resource Limits
Set the system resource limit for data segment and stack segment to unlimited using the following commands in a command prompt:
ulimit -d unlimited
ulimit -s unlimited


-------
Note:
-------

article:

IY92929: Logger process amqhasmx (amqzmuc0 in V6) could eventually run out of file descriptors 
some time after a disk full condition
  

 Fixes are available 
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 14 (CSD14)
WebSphere MQ V5.3 for iSeries - Fix Pack 14 (CSD14)
WebSphere MQ V6.0 Fix Pack 6.0.2.2
WebSphere MQ V6.0 for iSeries Fix Pack 6.0.2.2
WebSphere MQ 5.3 for HP OpenVMS Alpha and Itanium - Fix Pack 14


APAR status
Closed as program error.

Error description 
If the disk holding the log files for a queue manager becomes
full, then the log formatting process amqharmx (amqzmuc0 in V6)
may (correctly) report probe HL062054, HL049110 and HL062054
FDCs (erorr code hrcE_MQLO_DISK), with FDCs also from the
logger process amqhasmx (amqzmuc0 in V6).

This condition may introduce a permanent state within the
logger whereby log files need to be renamed. Each rename leaks
a file descriptor, leading to eventual file descriptor
exhaustion within the logger, which will get reported via a
large variety of FDCs, depending on where it is encountered.

Note that log files do not normally need to be renamed.

A workaround is to ensure that the queue manager is recycled if
the log formatter ever reports disk full (hrcE_MQLO_DISK) FDCs.
Local fix 

Recycled asap if the logger ever reports lack of disk space (the
hrcE_MQLO_DISK error code).

Problem summary 
****************************************************************
USERS AFFECTED:
Users running out of disk space during log formatting leading
to hrcE_MQLO_DISK FDCs from the amqharmx (amqzmuc0 in V6)
process. But only if the queue manager is not recycled after
such an event.

Platforms affected:
All Unix

****************************************************************
PROBLEM SUMMARY:
File descriptor leak in the little-used log file renaming
function.
Problem conclusion 
Ensured that the log file renaming function does not leak any
file descriptors.

---------------------------------------------------------------
The fix is targeted for delivery in the following PTFs:

                   v5.3
Platform           Fix Pack 14
--------           --------------------
AIX                U808477
HP-UX (PA-RISC)    U808478
Solaris (SPARC)    U808480
Linux (x86)        U808481
Linux (zSeries)    U808483

                   v6.0
Platform           Fix Pack 6.0.2.2
--------           --------------------
AIX                U809895
HP-UX (PA-RISC)    U809898
HP-UX (Itanium)    U810084
Solaris (SPARC)    U809913
Solaris (x86-64)   U810362
Linux (x86)        U809950
Linux (x86-64)     U810178
Linux (zSeries)    U810081
Linux (Power)      U810083
Linux (s390x)      U810110

The latest available maintenance can be obtained from
'WebSphere MQ Recommended Fixes'
http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006037

If the maintenance level is not yet available, information on
its planned availability can be found in 'WebSphere MQ
Planned Maintenance Release Dates'
http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006309
---------------------------------------------------------------
Temporary fix 
Comments 
APAR information 
APAR number IY92929 
Reported component name WEBS MQ FOR SUN 
Reported component ID 5724B4103 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2006-12-18 
Closed date 2007-01-03 
Last modified date 2007-07-27 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Modules/Macros

Publications Referenced


Fix information 
Fixed component name WEBS MQ FOR SUN 
Fixed component ID 5724B4103 

Applicable component levels 
R530 PSY    UP 
 

-------
Note:
-------

article:

IC56662: HANG IN AMQZMUC0 PROCESS FOLLOWING THE 'DISK NOT READY' ERROR
  

 A fix is available 
WebSphere MQ V6.0 Fix Pack 6.0.2.5

 
APAR status
Closed as program error.

Error description:
 
When customer attempts to initiate a failover from node1 to
node2 in an MSCS environment by failing the shared disk
resource, the resource manager issues an 'endmqm' to terminate
the queue manager resource. As part of queue manager
termination the utility manager process amqzmuc0 attempts to
write the buffered log records to the disk. However, this fails
with DISK_NOT_READY error as the disk is unavailable for
performing the write operation. Following this the amqzmuc0
process goes in a loop for a while consuming significant amount
of CPU until it is terminated by the queue manager.

The FDC with probe HL166091 is cut by the component
WriteLogPages2 when the buffer write fails with hrcE_MQLO_DNRD
error.
Local fix 

Problem summary 
****************************************************************
USERS AFFECTED:
WMQ users who has a potentially failing or offline disk during
a write operation.

Platforms affected:
All Distributed (iSeries, all Unix and Windows)
****************************************************************

PROBLEM SUMMARY:
When the amqzmuc0 process encounters the error hrcE_MQLO_DNRD
while attempting to write data to the log file, it runs in to a
infinite loop in the function mqlpgasn causing the delay in the
termination of the queue manager.
Problem conclusion 
The code is altered to handle this error condition.

---------------------------------------------------------------
The fix is targeted for delivery in the following PTFs:

                   v6.0
Platform           Fix Pack 6.0.2.5
--------           --------------------
Windows            U200292
AIX                U815929
HP-UX (PA-RISC)    U815636
HP-UX (Itanium)    U815818
Solaris (SPARC)    U815659
Solaris (x86-64)   U815928
iSeries            tbc
Linux (x86)        U815767
Linux (x86-64)     U815808
Linux (zSeries)    U815805
Linux (Power)      U815806
Linux (s390x)      U815807

The latest available maintenance can be obtained from
'Websphere MQ Recommended Fixes'
http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006037

If the maintenance level is not yet available, information on
its planned availability can be found in 'Websphere MQ
Planned Maintenance Release Dates'
http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006309
---------------------------------------------------------------
Temporary fix 
Comments 
APAR information 
APAR number IC56662 
Reported component name WMQ WINDOWS V6 
Reported component ID 5724H7200 
Reported release 602 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2008-05-27 
Closed date 2008-06-16 
Last modified date 2008-06-16 


-------
Note:
-------

thread:

Q:

Hi,

I've got a problem with WebSphere MQ Demo 6.0.1.0 on a Windows 2003 Server running in a Virtual PC. 
Allmost at all times amqzlaa0 is consuming all the CPU cycles. With sysinternals process explorer 
we traced it down to a thread which is calling from AMQXCS2.dll!xcsSynchronizeCounterTime+0xf2 into Windows kernel.

Somtimes it is amqzmuc0 which calls xcsSynchronizeCounterTime as well. I found some hints that this routine has to do 
with logging/tracing. So: Does anyone know of a patch or how to disable logging so that routine doesn't get called?

Thanks,

A:

Trace is disabled by running:

endmqtrc

You should also try making sure that event monitoring is not configured 
on your queue manager -- run DIS QMGR in runmqsc and check that 
everything that ends EV is (DISABLED).

Also make sure you aren't using the Windows Performance Monitor to 
monitor your queues, as that implicitly enables some of the logging 
functionality.

The underlying problem is probably that your emulator isn't emulating 
the function that the Windows QueryPerformanceCounter() and/or 
GetSystemTimeAsFileTime() APIs require, or at least is not emulating the 
hardware correctly and efficiently.

Regards,


-------
Note:
-------

article:

Recently I have been trying to determine what the impact of using MQ message persistence is on disk subsystem performance. There is alot of literature from IBM recommending ideal configurations to support MQ persistence, so I won't turn this into a post that recommends ideal settings. What I did want to achieve though, was to get a better understanding of how MQ interacts with the disk subsystem when using message persistence, so that I could plan for capacity and make better recommendations for tuning on a production SAN environment.


The first behaviour I wanted to describe, was whether the use of MQ persistence creates random or sequential disk activity, on the assumption that random disk access to block locations on the disk would incur a greater time penalty than sequential access.

In order to setup a test environment, I installed MQ version 6 on winXP platform using the default settings. This test was setup using a custom java load harness to put/get messages from a default queue which had persistence enabled with DEFPSIST(YES). File and disk statistics were collected on a Win32 platform using filemon and diskmon respectively. 

The following graphs depict the block size written over time (x axis) relative to the sector position (y axis) of the disk drive being monitored. I was trying to achieve a similar effect to the Solaris internals TAZ disk trace utility, albeit in a Windows environment. File stats such as number and size of write operations were collected using filemon. Transaction throughput was measured by dividing the total amount of transactions (1,000) by the total processing time. A number of different configurations were tested, but I chose to present a common setup which uses circular logging with single write, and a file page size of 16,384 x 4KB = 64MB per primary log file. The two variations I have presented are writes (MQPUT) only as well as writes (MQPUT) and reads (MQGET) concurrently. The latter being a more familiar situation for me in the workplace.


It can be determined from the previous test results that MQ accesses the disk in a sequential behaviour. On windows, depending on the concurrent load, it will tend to allocate 16 - 32 x 512B blocks ranging from 8 - 16KB in size, although smaller less frequent writes will favour smaller 8 x 512B blocks 4KB in size. As a result, a high number of disk write IOs are observed. 

The environment used to test was virtual (parallels), so limited disk cacacity was available, however the change in amplitude for trans/sec is considered relative. As a result, over an 80% reduction in throughput was observed when reading (MQGET) and writing (MQPUT) to the same queue. This will have significant impact on disk subsystem performance in a production environment. 

I was unable to perform this test on a Solaris environment due to lack of privileged access, so would be keen to see what the results are there. I believe Solaris uses the same 512B block size, so am expecting similar results. Although analysing an MQ server's impact on disk performance using sar, produces the following information:
sar -d 1 10 | egrep 'md72'              
           md72             60     0.7    1867   16749     0.0     0.4
           md72             58     0.6    1955   17272     0.0     0.3
           md72             52     0.6    1768   15619     0.0     0.3
           md72             52     0.6    1764   15577     0.0     0.3
           md72             54     0.6    1817   16045     0.0     0.3
           md72             56     0.6    1861   16447     0.0     0.3
From that information I can then infer that the total reads + writes/sec (r+w/s) divided by the number of 512B blocks per second (blk/s) would give me an average size of around 8-9KB/sec being written to the filesystem, which correlates my assumptions on block/file sizes. Using sar and other tools like iostat though is thought to be fraught with danger, especially when monitoring a rather complex SAN subsystem; but without access to any other tools, I'm stuck with the basics. To date I've been relying on perceived throughput (Mr+Mw/s) reported by iostat -xM

Worth noting, is that when MQ performs an MQGET on a persistent queue, it also causes a write IO to update the active log file, so the number of total write IOs effectively doubles.

Out of all this I'm working on the following assumptions:
1. MQ uses sequential disk access for persistence logging.
2. The OS is typically allocating between 16 - 32 x 512B blocks ranging between 8 - 16KB in size.
3. MQGETs are a contributor to overall writes (I'm assuming o remove the transactions from the log files)
4. Transaction throughput is significantly reduced when an application is reading (MQGET) and writing (MQPUT) to the same queue.

Attached below is a copy of the qm.ini I am working with, which is based on recommended performance tuning considerations from IBM
#*******************************************************************#
#* Module Name: qm.ini                                             *#
#*******************************************************************#
#ExitPath:
   ExitsDefaultPath=/var/mqm/exits/
   ExitsDefaultPath64=/var/mqm/exits64/
#Log:
   LogPrimaryFiles=15
   LogSecondaryFiles=15
   LogFilePages=16384
   LogType=CIRCULAR
   LogBufferPages=512
   LogPath=/var/mqm/log/QUEUEMANGERNAME/
   LogWriteIntegrity=SingleWrite
#Service:
   Name=AuthorizationService
   EntryPoints=13
#ServiceComponent:
   Service=AuthorizationService
   Name=MQSeries.UNIX.auth.service
   Module=/opt/mqm/lib64/amqzfu
   ComponentDataSize=0
#Channels: 
   MaxChannels=400 
   MaxActiveChannels=400
#TCP: 
   KeepAlive=Yes 
#TuningParameters:
   DefaultQBufferSize=1048576
   DefaultPQBufferSize=1048576

LogFilePages is set to the maximum 16384 x 4KB making each S0000[n].log file about 64MB in size. You must set this parameter as part of your Queue Manager creation i.e. crtmqm -lf 16384 Hopefully that decreases the frequency MQ needs to loop around the ring of log files. LogPrimaryFiles and LogSecondary files are set to 15, and I haven't run out of space in my circular log files yet. It is worth reading up on the MQ system administration manual for this, in terms of planning how much space you need. As a planning figure I add the size of MQPUT messages with DEFPSIST + 750B overhead as IBM state, then multiply that amount by my expected throughput. Checkpoint behaviour as part of circular logging activity should hopefully release space as it checkpoints (once every 10,000 transactions I believe), so if you plan properly, you shouldn't hit the tail end of your ring too soon
 

The DefaultQBufferSize is 64KB which I increased to 1MB (1048576). Apparently this increases the size of the buffer before writing to the file system, but I haven't seen the evidence of that with the tools I'm using to monitor disk writes. This area is a little cloudy for me.

I have also modified the LogBufferPages to 512 x 4KB giving me a 2MB buffer for similar reasons.

And finally, I've been using a combination of either MQ, a custom perl script, or EMC iorate to benchmark message persistence on different mounts/filesystems etc which has been useful. Especially when you don't have one or the other available, and you need to determine if the space you've been allocated is up to the task of supporting MQ persistence.

In order to truss the process on a Solaris 9 environment, we first had to establish which pid was writing to the log mount. Our sysadmin was able to do this for me with the following:

# fuser -cu /var/mqm/log
/var/mqm/log:    27306c(root)   27197c(root)   19735o(sr123456)

In this case the sr user was the account under which mqm was running, so we were then able to determine the corresponding process name:

# top -s sr123456
2756  sr123456   11  59    0    0K    0K sleep 313:05 12.08% amqzlaa0_nd
20162 sr123456   10  59    0    0K    0K cpu45 406:47  9.49% amqrmppa
19735 sr123456    6  59    0    0K    0K sleep 718:45  3.47% amqzmuc0
19742 sr123456   14  59    0    0K    0K sleep  22:22  0.00% amqzlaa0_nd
19744 sr123456    1  59    0    0K    0K sleep   1:36  0.00% amqpcsea
19752 sr123456    3  59    0    0K    0K sleep   0:16  0.00% runmqlsr_nd
19734 sr123456    2  59    0    0K    0K sleep   0:10  0.00% amqzfuma
19733 sr123456   14  59    0    0K    0K sleep   0:06  0.00% amqzxma0_nd
19751 sr123456    1  59    0    0K    0K sleep   0:00  0.00% runmqtrm
19749 sr123456    1  59    0    0K    0K sleep   0:00  0.00% runmqtrm
19739 sr123456    4  59    0    0K    0K sleep   0:00  0.00% amqzmgr0
19746 sr123456    1  59    0    0K    0K sleep   0:00  0.00% runmqtrm
19747 sr123456    1  59    0    0K    0K sleep   0:00  0.00% runmqtrm
19737 sr123456    2  59    0    0K    0K sleep   0:00  0.00% amqrrmfa
19738 sr123456    2  59    0    0K    0K sleep   0:00  0.00% amqzdmaa

So in this case, the process named amqzmuc0 was the process performing the logging.

For reference, here is a list of processes and what they typically control for MQ:

1.  RUNMQLSR - MQ TCP listener (multi-threaded)
2.  AMQCLMAA - MQ TCP listener (single-threaded)
3.  AMQRMPPA - Channel process pooling job
4.  RUNMQCHI - MQ channel initiator
5.  AMQCRSTA - MQ receiving MCA jobs
6.  RUNMQCHL - MQ sending MCA jobs
7.  AMQCRS6B - LU62 receiver channel
8.  AMQPCSEA - MQ command server
9.  RUNMQTRM - Application trigger monitor
10. RUNMQDLQ - Dead letter queue handler
11. AMQFCXBA - MQ Broker Worker Job
12. RUNMQBRK - MQ Broker Control Job
13. AMQZMUC0 - MQ Utility Manager
14. AMQZMUR0 - MQ Utility Manager
15. AMQZMGR0 - MQ Process Controller
16. AMQRRMFA - MQ cluster repository manager
17. AMQZDMAA - MQ deferred message manager
18. AMQALMPX - MQ Log Manager
19. AMQZFUMA - MQ Object Authority Manager
20. AMQZLAS0 - MQ Local Queue Manager agents
21. AMQZLAA0 - MQ Local Queue Manager agents
22. AMQZXMA0 - MQ Execution Controller


With this info at hand we could then truss the process as per the following:

# truss -D -p 19735


Which produces the following output:

12677624.0012   lseek(18, 2543616, SEEK_SET)                    = 2543616
12677624.0017   write(18, "0FEE0315\001\0\0\0D6A4 f".., 4096)   = 4096
12677624.0018   lwp_cond_broadcast(0xFFFFFFFF7B46CAC0)          = 0
12677624.0018   lwp_mutex_wakeup(0xFFFFFFFF7B46CAA8)            = 0
12677624.0019   lseek(18, 2547712, SEEK_SET)                    = 2547712
12677624.0023   write(18, "0FEE02FD\001\0\0\0D6A4 f".., 4096)   = 4096
12677624.0024   lwp_cond_broadcast(0xFFFFFFFF7B46C3D8)          = 0
12677624.0024   lwp_mutex_wakeup(0xFFFFFFFF7B46C3C0)            = 0
12677624.0024   lwp_cond_broadcast(0xFFFFFFFF7B46CCE0)          = 0
12677624.0025   lwp_mutex_wakeup(0xFFFFFFFF7B46CCC8)            = 0
12677624.0025   lwp_cond_broadcast(0xFFFFFFFF7B46BE88)          = 0
12677624.0026   lwp_mutex_wakeup(0xFFFFFFFF7B46BE70)            = 0
12677624.0026   lseek(18, 2551808, SEEK_SET)                    = 2551808
12677624.0030   write(18, "\tD10206\001\0\0\0D6A4 f".., 4096)   = 4096


What we can see from this test case is that Solaris is writing 4KB sized blocks of data to the file system 
in support of MQ persistent logging. I would prefer a higher size (say 8 - 16KB) as the SAN under test would be more efficient 
and capable of higher throughput, but I guess that is a limitation of the size of messages 
I am writing (185B per message) and the way Solaris breaks up the IO.

There is a comment from an IBM performance tuning guide for message persistence that states:It is unlikely 
that poor persistent message throughput will be attributed to the 2MB limit of the queue manager log. 
It is possible to fill and empty the log buffer several times each second and reach a CPU limit writing data 
into the log buffer, before a log disk bandwidth limit is reached.

In this, they are referring to the LogBufferPages parameter which I have increased to its maximum configurable size 
of 512 x 4K pages = 2MB. At this point in time I am still working with the sysadmin in an effort to prove this statement 
provided by IBM. For the time being we are investigating the seemingly high number of s
yscalls and context switching which is evident from vmstat:

# vmstat 1
kthr      memory            page            disk          faults      cpu
r b w   swap  free  re  mf pi po fr de sr m0 m1 m3 m4   in   sy   cs us sy id
0 1 0 27901544 13821744 0 6 8  0  0  0  0  0  0  0  0 11369 173717 83335 16 24 60
0 1 0 27901544 13821688 0 0 8  0  0  0  0  0  0  0  0 11201 175563 84711 14 20 66
0 1 0 27901536 13821648 635 4573 0 0 0 0 0 0  0  0  0 11841 179265 84000 14 26 59
0 0 0 27900496 13821472 847 6045 8 8 8 0 0 0  0  0  0 13003 178012 82634 17 25 58
2 0 0 27901552 13821528 0 0 8  0  0  0  0  0  0  0  0 10994 174223 83744 15 20 64
0 1 0 27901544 13821480 0 0 0  0  0  0  0  0  0  0  0 11210 175509 83986 15 18 66
0 1 0 27901544 13821440 0 0 8  0  0  0  0  0  0  0  0 11178 174778 83952 17 19 63


Another aspect I am investigating is the use of threads my the amqzmuc0 process. It would seem that 
for a given load amqzmuc0 is spawns around XX threads for that process. Looking at the prstat for that process confirms the following:

# prstat -mL -p 29314
PID  USERNAME USR  SYS TRP TFL DFL LCK SLP LAT VCX ICX SCL SIG PROCESS/LWPID
29314 sr53186  0.4 0.7  -   -   -   -  3.2  -  564 157  1K   0 amqzmuc0/3
29314 sr53186  0.0 0.0  -   -   -   -  0.4  -    1   0   1   0 amqzmuc0/4
29314 sr53186  0.0 0.0  -   -   -   -  0.0  -    0   0   0   0 amqzmuc0/6
29314 sr53186  0.0 0.0  -   -   -   -  0.0  -    0   0   0   0 amqzmuc0/5
29314 sr53186  0.0 0.0  -   -   -   -  100  -    0   0   0   0 amqzmuc0/2
29314 sr53186  0.0 0.0  -   -   -   -  0.0  -    0   0   0   0 amqzmuc0/1


-------
Note:
-------

IY47735: QUEUE MANAGER HAD A SEGV IN AN AMQZLLP0 PROCESS
  

 Fixes are available 
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 14 (CSD14)
WebSphere MQ v5.3 and WebSphere MQ Express v5.3 - Fix Pack 6 (CSD06)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 9 (CSD09)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 12 (CSD12)
WebSphere MQ v5.3 for iSeries - Fix Pack 10 (CSD10)
WebSphere MQ V5.3 for iSeries - Fix Pack 12 (CSD12)
WebSphere MQ V5.3 for iSeries - Fix Pack 11 (CSD11)
WebSphere MQ V5.3 & WebSphere MQ Express V5.3 - Fix Pack 7 (CSD07)
WebSphere MQ v5.3 for iSeries - Fix Pack 8 (CSD08)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 11 (CSD11)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 13 (CSD13)
WebSphere MQ V5.3 & WebSphere MQ Express V5.3 - Fix Pack 10 (CSD10)
WebSphere MQ v5.3 for iSeries - Fix Pack 6 (CSD06)
WebSphere MQ V5.3 & WebSphere MQ Express V5.3 - Fix Pack 8 (CSD08)

 
APAR status
Closed as program error.

Error description 
amqzllp0 received SIGSEGV caused by a queue having buffers with
non-zero write counts on buffers that are NULL.
Local fix 
Zero write counts at the time the buffers are freed.
Problem summary 
amqzllp0 received SIGSEGV caused by a queue having buffers with
non-zero write counts on buffers that are NULL.
It's not clear what the full set of conditions are that are
necessary for this problem to be encountered. It is certainly a
rare problem, but potentially any MQ5.3 user is affected.
The problem is caused by dirty write counts on queue
buffers.This causes a subsequent checkpoint to fail with a
SIGSEGV in MQ function aqhIdxToPtrFn called from aqpFlushCache
whilst checkpointing.
Problem conclusion 
The problem was fixed by ensuring that queue buffer write counts
are zeroed at the time that the queue buffers are freed.
It will be shiped into:
WebSphere MQ V5.3 CSD06
.
Windows           U200202
AIX               U489863
HP-UX             U489864
Linux on Intel    U489967
Linux on zSeries  U489972
Sun Solaris       U489865
Temporary fix 
Comments 
APAR information 
APAR number IY47735 
Reported component name WEBS MQ FOR LIN 
Reported component ID 5724B4104 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2003-08-20 
Closed date 2003-08-26 
Last modified date 2004-03-10 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:
 

-------
Note:
-------

Queue Manager fails unexpectedly but is able to restart
  
 Technote (FAQ) 
  
Problem 
Queue manager fails with several FDCs containing probes: XC130004 AQ051000, AO084010, AL047011. 
These FDCs indicate that the queue manager failed while doing a checkpoint..

See details bellow.  
  
Cause 
The checkpoint processor (amqzllp0) received SIGSEGV caused by a queue having buffers with non-zero write counts 
on buffers that are NULL. It is not clear what conditions are necessary for this problem to be encountered. 
It is certainly a rare problem, but any MQ v5.3 user may be affected.
 
  
Solution 
APAR IY47735 addresses this problem. Apply CSD06 or higher. 

Additional information


Probe Id :- XC130004 
Component :- xehExceptionHandler 
Program Name :- amqzllp0 
Major Errorcode :- STOP 
Probe Type :- HALT6109 
Arith1 :- 11 b 

MQM Function Stack 
zllpMain 
alsCheckPointLoop 
aocPerformCheckpoint 
aqmCheckPointQueue 
aqpCheckPointQ 
aqpFlushCache 
aqhIdxToPtrFn 
xcsFFST 
***************************** 
Probe Id :- AQ051000 
Component :- aqsStartQOp 
Program Name :- amqzllp0 
Major Errorcode :- STOP_ALL 
Probe Type :- HALT6110 

MQM Function Stack 
zllpMain 
alsCheckPointLoop 
aocPerformCheckpoint 
aqmCheckPointQueue 
xcsFFST 
**************************** 
Probe Id :- AO084010 
Component :- aocPerformCheckpoint 
Program Name :- amqzllp0 

MQM Function Stack 
zllpMain 
alsCheckPointLoop 
aocPerformCheckpoint 
xcsFFST 
*************************** 
Probe Id :- AL047011 
Component :- alsCheckPointLoop 
Program Name :- amqzllp0 
Major Errorcode :- STOP_ALL 

MQM Function Stack 
zllpMain 
alsCheckPointLoop 
xcsFFST 

We reviewed the FDCs closely and see the first FDC to be fired is the XC130004 FDC from the checkpoint process. 
This is a segmentation violation that has occurred in aqhIdxToPtrFn(). 

This function is called to dereference an index and return a pointer. The pointer seems to be corrupted resulting 
in this FDC. The other FDCs are the after effect of this FDC. 

In one case the qm error log showed - 

08/15/03 03:30:16 
AMQ7472: Object QS.CALC.REPLY.3F19889F21A08101, type queue damaged. 

08/15/03 03:30:23 
AMQ9542: Queue manager is ending. 

In another case there was no corrupt queue reported but the fdc's and the problem were the same. 
 
 
Related information 
APAR IY47735
 
   
Historical Number 
22199
7td
000  
  
Product Alias/Synonym 
MQ WMQ  
 

-------
Note:
-------

SE16730: MQM400 QUEUE MANAGER AGENT JOB (AMQZLAA0) CONSUMES HIGH CPU IN RFIALLOCCACHE/RFXLINK WHEN MQOPEN FAILS CONTINUALLY
 
Fixes are available 
WebSphere MQ 5.3 for HP OpenVMS Alpha and Itanium - Fix Pack 13
WebSphere MQ v5.3 for iSeries - Fix Pack 9 (CSD09)


APAR status
Closed as program error.

Error description 
Using WebSphere MQ V5.3 with CSD 06 under OS/400 5.1, the job
AMQZLAA0 has a very high CPU usage.
Local fix 
Problem summary 
****************************************************************
USERS AFFECTED:
All users of WebSphere MQ v5.3

Platforms affected:
All Distributed
****************************************************************
PROBLEM SUMMARY:
API calls such as MQOPEN will search the cluster repository
cache when the MQ object does not exist on the local Queue
Manager. Each search will attempt to allocate, and subsequently
free, a registration entry (of 320 bytes) in the cluster cache
memory block. A problem with pointer/offset arithmetic made
each freed entry appear to have size zero, causing each new
allocate to request a new entry from the memory block. After a
few hours, the number of freed entries had grown so large that
rfiAllocateCache was consuming high CPU scanning the chain of
freed cache entries.
Problem conclusion 
The problem has been fixed; rfiAllocateCache will re-use a
freed registration entry, instead of continually allocating new
registration entries.
Temporary fix 
TEST FIX
Comments 
The problem has been fixed; rfiAllocateCache will re-use a
freed registration entry, instead of continually allocating new
registration entries.
APAR information 
APAR number SE16730 
Reported component name WEB MQ FOR ISER 
Reported component ID 5724B4106 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2004-07-23 
Closed date 2004-08-24 
Last modified date 2007-05-14 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Modules/Macros
ROCSMGR           

Publications Referenced


Fix information 
Fixed component name WEB MQ FOR ISER 
Fixed component ID 5724B4106 

Applicable component levels 
R530 PSY SI16678    UP07/05/14 P 7121 
 

-------
Note:
-------


IC45414: AMQZLAA0 USES 100% CPU WHILE LOADING A QUEUE WITH PERSISTENT AND NON-PERSISTENT MESSAGES
  

 Fixes are available 
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 14 (CSD14)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 12 (CSD12)
WebSphere MQ V5.3 for iSeries - Fix Pack 12 (CSD12)
WebSphere MQ V5.3 for iSeries - Fix Pack 11 (CSD11)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 11 (CSD11)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 13 (CSD13)
WebSphere MQ 5.3 for HP OpenVMS Alpha and Itanium - Fix Pack 12


APAR status
Closed as program error.

Error description 
Customer reporting 100% CPU usage each morning in amqzlaa0.exe.
Slow queue loading - persistent and non-persistent messages.
Applications hang while load completes.
Local fix 
Speed up loading queue with mixture of persistent and
non-persistent messages.
Problem summary 
****************************************************************
USERS AFFECTED:
All users of WebSphere MQ who mix persistent and non persistent
messages on the same queue

Platforms affected:
 All Distributed (iSeries, all Unix and Windows)
****************************************************************
PROBLEM SUMMARY:
When a queue is not used for a period of time, it may get
unloaded. If it does, it will be reloaded on the next access,
and the load will restore all the persistent messages first and
then add in the non persistent messages. The algorithm for
reloading the non-persistent messages was not optimal, and
resulted in frequent walks through the complete message chain,
which consumed disk i/o and CPU resource.
Problem conclusion 
The algorithm for reloading a queue has been modified to
optimize the reloading when the queue contains a mixture of
persistent and non persistent messages.

---------------------------------------------------------------
The fix is targeted for delivery in the following PTFs:

                   v5.3
Platform           Fix Pack 11
--------           --------------------
Windows            U200236
AIX                U802047
HP-UX (PA-RISC)    U802131
Solaris (SPARC)    U802142
iSeries            SI18375
Linux (x86)        U802143
Linux (zSeries)    U802146
Linux (Power)      Not applicable

The latest available maintenance can be obtained from
'WebSphere MQ Recommended Fixes'
http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006037

If the maintenance level is not yet available, information on
its planned availability can be found in 'WebSphere MQ
Planned Maintenance Release Dates'
http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006309
---------------------------------------------------------------
Temporary fix 
Comments 
APAR information 
APAR number IC45414 
Reported component name WEB MQ FOR WINS 
Reported component ID 5724B4100 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2005-03-16 
Closed date 2005-03-24 
Last modified date 2007-08-02 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Modules/Macros

Publications Referenced


Fix information 

Applicable component levels 
R530 PSY    UP 
 

-------
Note:
-------

SE10932: MQM400 XY324192 GETSUBPOOLSLOCK IN JOB AMQZXMA0 XECF_E_UNEXPECTE
  

APAR status
Closed as program error.

Error description 
=========================================
Date/Time         :- Thursday May 22 08:23:00  2003
Host Name         :- mvalxxxx.xxx.xxxxxxxxx.xxx.au
PIDS              :- 5724B4106
LVLS              :- 530.3  CSD03
Product Long Name :- WebSphere MQ for iSeries
Vendor            :- IBM
Probe Id          :- XY043007
Application Name  :- MQM
Component         :- xllSemGetVal
Build Date        :- Mar 19 2003
UserID            :- 00001186 (QMQM)
Job Name          :- 086731/QMQM/AMQZXMA0
Job Description   :- QMQM/AMQZXMA0
Submitted By      :- 086730/MQADMIN/STRMQM_R
Activation Group  :- 99 (QMQM) (QMQM/AMQZXMA0)
Process           :- 00002416
Thread            :- 00000003
QueueManager      :- MQaaaannnnx
Major Errorcode   :- xecF_E_UNEXPECTED_SYSTEM_RC
Minor Errorcode   :- OK
Probe Type        :- MSGAMQ6119
Probe Severity    :- 2
Probe Description :- AMQ6119: An internal WebSphere MQ error has
 occurred ('3021 - The value specified for the argument is not
  correc' from semctl.)
MQM Function Stack
xstSubpoolSubtask
xcsWaitEventSem
xcsRecoverSubpoolsLockForThread
xllSemGetVal
xcsFFST
.
..simialr another FDC  in the same file:-
.
================================================================
in amqerr01.log
see similar messages:-
AMQ6184: An internal WebSphere MQ error has occurred.The failing
         process is process 2416.
AMQ6119: An internal WebSphere MQ error has occurred ('3021 -
The value
        specified for the argument is not correc' from semctl.)
AMQ6184: An internal WebSphere MQ error has occurred.
.
============================================================
May 2003
-Date/Time- --- Filename - -Probe-  ---Tid-- --- Component
21 22:45:43 AMQ2416.0.FDC  XY324192 00000001 GetSubpoolsLock
           'AMQ6119: An internal WebSphere MQ error has
occurred',''3021
          - The value specified for the argument is not', '3021
bcd'
22 08:23:00 AMQ2416.0.FDC  XY043007 00000003 xllSemGetVal
           'AMQ6119: An internal WebSphere MQ error has
occurred',''3021
          - The value specified for the argument is not', '3021
bcd'
22 08:23:01 AMQ2416.0.FDC  XY043007 00000001 xllSemGetVal
           'AMQ6119: An internal WebSphere MQ error has
occurred',''3021
          - The value specified for the argument is not', '3021
bcd'
:-
this is defect 73655
Local fix 
Problem summary 
 IPC semaphore set being blown away by either ipcrm or two
 asynchronous ENDMQM MQMNAME(*ALL) ENDCCTJOB(*YES) in
 two simultaneous sessions.
Problem conclusion 
The failure to handle semaphore reporting EINVAL  has been
 added. The handler will try access the semaphore again to
 recreate it. If still unsuccessful after 5 retry attempts then
 a new FDC with probe XY324195 will be logged.
Temporary fix 
Comments 
APAR information 
APAR number SE10932 
Reported component name WEB MQ FOR ISER 
Reported component ID 5724B4106 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2003-07-03 
Closed date 2003-09-24 
Last modified date 2007-05-14 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Modules/Macros

Publications Referenced


Fix information 
Fixed component name WEB MQ FOR ISER 
Fixed component ID 5724B4106 

Applicable component levels 
R530 PSY SI10092    UP04/06/06 P 4083 
 

-------
Note:
-------


Abstract 

MQM400-MSGAMQ5615 STRMQM FAILS WITH AMQ5615 AFTER AMQZXMA0


Pre/Co-Requisite PTF / Fix List 


REQ  LICENSED      PTF/FIX  LEVEL 
TYPE PROGRAM  REL  NUMBER   MIN/MAX  OPTION 
---- -------- ---  -------  -------  ------ 
PRE  5724B41  530  SI13613   NONE     0000 
PRE  5724B41  530  SI13881   NONE     0000 
CO   5724B41  530  SI13906   NONE     0000 
CO   5724B41  530  SI13612   NONE     0000 
DIST 5724B41  530  SI13926   NONE     0001 
DIST 5724B41  530  SI11234   NONE     0001 
DIST 5724B41  530  SI10120   NONE     0001 


NOTICE: 
------- 
   Application of this PTF may disable or render ineffective programs that 
   use system memory addresses not generated by the IBM translator, 
   including programs that circumvent control technology designed to limit 
   interactive capacity to purchased levels.  This PTF may be a prerequisite 
   for future PTFs.  By applying this PTF you authorize and agree to the 
   foregoing. 

APAR Error Description / Circumvention 

DESCRIPTION OF PROBLEM FIXED FOR APAR SE10763 : 
----------------------------------------------- 
   The Queue Manager fails to start because the automatic 
   migration of cluster channel objects from MQ R520 to MQ R530 
   has corrupted the repository - due to an error in computing the 
   size of the MQCD data area which needs to be migrated. 

CORRECTION FOR APAR SE10763 : 
----------------------------- 
   The problem of startup after migration has been corrected. The 
   Queue Manager will be successfully started by ignoring 
   corrupted entries in the cluster repository, and logging a new 
   FDC record with Probe Id ZX054040 or ZX054060 in function 
   zxcRestoreObject. The FDC will identify those cluster objects 
   which need to be deleted and recreated after the Queue Manager 
   has become active. 

CIRCUMVENTION FOR APAR SE10763 : 
-------------------------------- 
   None. 

DESCRIPTION OF PROBLEM FIXED FOR APAR SE13524 : 
----------------------------------------------- 
   When rflOpen detects error rflRC_INCORRECT_FORMAT, the hFile 
   handle is being freed and changed to NULL before closing the 
   fildes in (hFile->fildes) - causing a pointer exception. 

CORRECTION FOR APAR SE13524 : 
----------------------------- 
   The condition causing the pointer exception whilst processing 
   data in the Channel file and/or the Sync file has been fixed. 

CIRCUMVENTION FOR APAR SE13524 : 
-------------------------------- 
   None. 

DESCRIPTION OF PROBLEM FIXED FOR APAR SE14146 : 
----------------------------------------------- 
   After applying CSD06, the CHGMQMQ command (or option 2 of the 
   WRKMQMQ panel) does not work. When a change is attempted, 
   the AMQ8008 message is displayed stating that the change has 
   occured, but a display of the queue shows the attributes have 
   not been changed. This problem is not reported when using the 
   ALTER MQSC command to change the queue attributes. 
   A similar problem happens with commands CPYMQMQ and CHGMQM. 

CORRECTION FOR APAR SE14146 : 
----------------------------- 
   The problem has been fixed in CSD07. The MQSeries command 
   processor will correctly parse the parameters specified in the 
   CHGMQMQ, CPYMQMQ and CHGMQM commands. 

CIRCUMVENTION FOR APAR SE14146 : 
-------------------------------- 
   As a workaround :- Attributes of the queue can be 
   changed using alter on mqsc command. 

DESCRIPTION OF PROBLEM FIXED FOR APAR SE13837 : 
----------------------------------------------- 
   This APAR describes two problems: 
   i) a small timing window in ENDMQM with ENDCCTJOB(*YES) which 
   prevents the tidy-up of shared memory files 
   in /QIBM/UserData/mqm/qmgrs/<QM_NAME>/&qmpersist 
   after all Listener jobs have been ended. 
   ii) none of the shared memory files are tidied up by ENDMQM 
   with ENDCCTJOB(*YES) when the user of the ENDMQM command does 
   not have sufficient authority to access the directory 
   /QIBM/UserData/mqm/qmgrs/<QM_NAME> 

CORRECTION FOR APAR SE13837 : 
----------------------------- 
   Both problems have been fixed. The program AMQICLEN, will 
   operate with the authority of the QMQM profile, when it is used 
   by ENDMQM with ENDCCTJOB(*YES). There is an extra delay of 2 
   seconds before starting the shared memory tidy-up. 
   In addition, ENDCCTJOB(*YES) will log a QPRINT history file 
   (for user QMQM) which will list all the Queue Manager 
   directories which have been processed, and when MQMNAME(*ALL) 
   is specified the process has been optimised to invoke AMQICLEN 
   once only. 
   This problem has been fixed in WMQ v5.3 CSD07. 

CIRCUMVENTION FOR APAR SE13837 : 
-------------------------------- 
   None 

DESCRIPTION OF PROBLEM FIXED FOR APAR SE14672 : 
----------------------------------------------- 
   Tracing functions in the Command Server is controlled by flags 
   YTRC_DATA_ADMIN and YTRC_FLOWS_ADMIN, but there is no 
   corresponding TRCTYPE value which allows the user to set these 
   flags. 

CORRECTION FOR APAR SE14672 : 
----------------------------- 
   Tracing functions in the Command Server will be controlled by 
   flags YTRC_DATA_OTHER and YTRC_FLOWS_OTHER. The user can set 
   these flags using TRCTYPE values *OTHDATA and *OTHFLOW 
   respectively. 
   This problem has been fixed in WMQ v5.3 CSD07 

CIRCUMVENTION FOR APAR SE14672 : 
-------------------------------- 
   Use TRCTYPE value *ALL. 

DESCRIPTION OF PROBLEM FIXED FOR APAR SE15656 : 
----------------------------------------------- 
   Collective Service Delivery for CSD07 

CORRECTION FOR APAR SE15656 : 
----------------------------- 
   Collective service delivery for the set of CSD07 PTFs          . 
   - which contains fixes for all the problems listed below:      . 
   .                                                              . 
   SE10763 - MQM400-MSGAMQ5615 STRMQM FAILS WITH AMQ5615 AFTER    . 
             AMQZXMA0 FAILS MCH0601 IN FUNCTION RFXADDCLQMGR      . 
             STRMQM fails after quiescing the QMANAGER with an    . 
             MSGAMQ5615. >> Default objects cannot be created:    . 
             CompCode = 0 Reason = 0. It was found that procedure . 
             rfxAddCLQMGR had been called many times resulting in . 
             the reported failure. The Queue Manager fails to start 
             because the automatic migration of cluster channel   . 
             objects from MQ R520 to MQ R530 has corrupted the    . 
             repository - due to an error in computing the size of 
             the MQCD data area which needs to be migrated.       . 
   .                                                              . 
   SE12660 - MQM400-MSGMCH3601 AMQZXMAX DURING AN FPRINT STATEMENT 
             4 RECEIVES ON THE STRMQM. - AMQAPICA NEEDS TO CHECK  . 
             FOR NULL POINTER.                                    . 
             A failure occurs in AMQZXMAX which receives MCH3601 on 
             STRMQM: To module . . . . . . . . :  AMQAPICA_N To   . 
             procedure  . . . . . . :  lclPrintLogStats Statement . 
             . . . . . . . :  4 Message . . . :  Pointer not set  . 
             for location referenced. referenced. referenced.     . 
   SE13288 - MQM400-MSGAMQ9592 PROGRAM CANNOT RESOLVE QUEUE MANAGER 
             OBJECT - THE ATTEMPT TO RESOLVE OBJECT '%CHLBATCH.18' 
             FAILS RC 2195                                        . 
             After migrating from MQ 5.2 to 5.3, one of our       . 
             channels will not start. Start fails, and Resolve    . 
             fails with: AMQ9592: Program can not resolve         . 
             queuemanager object. The attempt to resolve object   . 
             '%CHLBATCH.18' failed with reason code 2195.         . 
   SE13524 - MQM400  USING MQ CLIENT TO VIEW CHANNELS ON A SERVER . 
             CAUSES THE CMD SERVER JOB (AMQPCSEA) TO FAIL WITH    . 
             MSGAMQ9604 AND MSGMCH3601                            . 
             Customers have a problem viewing channels via the    . 
             client using a svrconn (ILSMQP1.PRCMQP1) channel.  On 
             the client side message AMQ4032 is being generated and 
             on the server (iSeries) side the AMQPCSEA job is     . 
             failing with a AMQ9604.                              . 
   SE13837 - MQM400 - SHARED MEMORY IS BEING LEFT BEHIND IN &SYSTEM 
             AND &QMPERSIST AFTER ENDMQMLSR IN OUR SHUTDOWN PROGRAM 
             Shared memory is being left behind in &system and    . 
             &qmpersist if the user does not have *ALLOBJ         . 
             authority.                                           . 
   SE14146 - MQM400-MSGAMQ8008 CHGMQMQ COMMAND OR OPTION 2 OF THE . 
             WRKMQM PANEL DOES NOT WORK AS EXPECTED (same as      . 
             SA96283 at R520)                                     . 
             The CHGMQMQ command or option 2 of the WRKMQM panel  . 
             does not work. When a change is attempted, message   . 
             AMQ8008 is displayed stating that the change has     . 
             occured, but a display of the queue shows the        . 
             attributes have not been changed.                    . 
   SE14601 - MQM400-CHANNEL REMAINS IN STOPPED STATUS AFTER QUEUE . 
             MANAGER                                              . 
             Enduser has migrated a V5.2 queue manager to V5.3.   . 
             The sender channels where placed in STOPPED status.  . 
             Now, when the migrated queue manager is started the  . 
             channels remain in STOPPED status, forcing the enduser 
             to manually start the channels. (see also IY53917)   . 
   SE14672 - MQM400-MSGAMQ9554 ALLOW TRCTYPE(*OTHDATA *OTHFLOW) TO 
             APPLY TO THE COMMAND SERVER COMPONENT                . 
             With the current TRCMQM, the only way to trace the   . 
             Command Server component is to use TRCTYPE(*ALL). We . 
             can take an APAR which will allow us to deliver a    . 
             circumvention at R530, namely to allow               . 
             TRCTYPE(*OTHDATA *OTHFLOW) to trace the Command      . 
             Server.                                              . 
   SE15311 - MQM400 - TEST FIX FOR IY51907 AT CSD06               . 
   SE15657 - MQM400 QUEUE MANAGER ENDS WHEN DAMAGED OBJECT IS     . 
             DETECTED AFTER FDC WITH PROBE ID AQ066010 IN         . 
             AQHALLOCATESPACE                                     . 
             Queue manager ends when damaged object is encountered, 
             and object cannot be recovered using RCRMQMOBJ after . 
             Queue Manager restart.                               . 
   SA96202 - MQM400 USER MANAGED JOURNAL RECEIVER DOES NOT SWITCH . 
             A User Managed Journal Reciever doesn't switch when a 
             journal receiver size has exceeded its THRESHOLD value 
             and if the RCDMQMIMG *ALL *ALL *YES command is run,  . 
             before the checkpoint has run on its own.            . 
   SA96224 - MQM400 RPG APPLICATION FAILS WITH ERROR CODE RC2023  . 
             In RPG programs, MQINQ returns RC  2023              . 
             (MQRC_INT_ATTRS_ARRAY_ERROR) when IACNT (IntAttrCount) 
             is 0.                                                . 
   SA96333 - MQM400 - AMQALMP4 DOES NOT LOG NEW AMQ7460 OR AMQ7462 
             AND THE AMQERR01.LOG DOES NOT UPDATE THE JOURNAL     . 
             RECEIVER NAME                                        . 
             RCDMQMIMG with DSPJRNDTA(*YES), AMQ7460 or AMQ7462   . 
             does not update the journal receiver name.           . 
   IC38202 - FDC file indicating a sigsegv exception in component . 
             rrxReportError.                                      . 
   IC38311 - An API Exit puts a message within the BEFORE CMIT    . 
             entry using MQPUT with SYNCPOINT which fails when    . 
             runmqchl is a SENDER or SERVER channel type.  Then it 
             gets a 2195, thereby failing the channel.  An FDC file 
             is created with a Probe Id of AT032010, from component 
             atmStartOp.                                          . 
   IC38907 - If the queue manager name passed to amqiclen contains 
             a .(dot), for example MY.QMGR, amqiclen does not     . 
             produce any output (see also IY52444).               . 
   IC39552 - SDR & CLUSSDR CHANNELS CAN BECOME STUCK IN           . 
             INITIALIZING STATUS                                  . 
   IC39916 - When RCDMQMIMG reaches a temporary dynamic queue while 
             recording the objects, it creates an AMQ7087 msg:    . 
             Object AMQ.xxxxxx , type queue is a temporary object . 
             and fails to record the rest of the objects.         . 
   IY29028 - SDR channels stay in INITIALIZING state during       . 
             start-up.                                            . 
   IY35297 - Clustering: MQSeries did not return an error when an . 
             MQCOMMIT was not successful in a clustering          . 
             environment.                                         . 
   IY47766 - After enabling pipelining on a channel that sends    . 
             messages larger then 4MB, regular AMQ9514 'Channel   . 
             <name> is in use' and AMQ9558 'Remote Channel is not . 
             currently available' messages are reported.          . 
   IY49438 - MQBEGIN fails with RC= 2128 ( MQRC_UOW_IN_PROGRESS ). 
             Two applications connect to the same queue manager and 
             they run transactions using two phase commit at the  . 
             same time. If one of them executes a transaction (not 
             using two phase commit) without an MQBEGIN after an  . 
             MQBACK, the next MQBEGIN of the other application    . 
             fails with 2128.                                     . 
   IY50293 - An FDC file was produced for a read error - disk full 
             on a model queue with enough disk space: The probes in 
             the FDC are: AD030001 adiReadFile                    . 
             xecF_E_UNEXPECTED_SYSTEM_RC and AQ168001 aqpReadData . 
             arcE_PAST_EOF                                        . 
   IY50439 - An FDC file is created with probe CO052000, claiming . 
             to have received invalid data.  This is seen when the 
             sending end violates the TSH protocol.  A new FDC    . 
             (probe CO000044) is created to directly report the   . 
             protocol violation and to dump the received data.    . 
   IY50795 - An MQ application takes a long time to do its        . 
             processing and the transaction monitor software      . 
             assumes the transaction has timed out.  The queue    . 
             manager fails to complete the xa_rollback call it    . 
             receives from the transaction monitor and program    . 
             amqzlaa0 creates FDC files with probe IDs of XC304020 
             from xlsDestroyMutex and XY398007 from               . 
             xcsFreeMemBlock.                                     . 
   IY51152 - An FDC file with probe RM409000 from rriWaitSecondary 
             is followed by XC130003 from rriSendThread2 from the . 
             same process (runmqchi), causing the channel initiator 
             to die. This stops all channels running as threads in 
             runmqchi, and stops channels from restarting, and  the 
             channel state stays as RETRYING.                     . 
   IY51386 - Pubsub keeps sending messages to queue after         . 
             MQRC_Q_FULL, rc2053, is returned.  Messages are put to 
             the dead letter queue, until it reaches its maximum  . 
             queue depth.                                         . 
   IY51907 - Checkpointing for users with large queues of         . 
             persistent messages takes along time and locks out   . 
             other operations. RUNMQSC is not able to execute.    . 
   IY51992 - When creating a queue manager using the application  . 
             group option, for example crtmqm -g groupname qmgrname 
             the permissions are not created properly for         . 
             /var/mqm/qmgrs/<qmgrname>/zsocketapp. The zsocketapp . 
             directory is being created with universal access     . 
             (drwxrwxrwx) which is incorrect.                     . 
   IY52011 - Channels create an FDC file with a channel in use    . 
             message.                                             . 
   IY52182 - Channel name is missing from message AMQ9514 when    . 
             displayed by runmqsc.                                . 
   IY52344 - amqiclen -p qmgr prefix does not work; it fails to   . 
             find mqs.ini.  Create a queue manager with MQSPREFIX . 
             set so that the queue manager data is in a separate  . 
             directory, for example: # export                     . 
             MQSPREFIX=/var/mqm/QMGRDIR # crtmqm NEWQMGR amqiclen . 
             -c -m NEWQMGR -v -p QMGRDIR</b> gives the error      . 
             message: Unable to get subpool lock. mqs.ini does not 
             exist.                                               . 
   IY52444 - If the queue manager name passed to amqiclen contains 
             a .(dot), for example MY.QMGR, amqiclen does not     . 
             produce any output (see also IC38907).               . 
   IY52569 - dis q(*) and dis ql(*) runmqsc commands hang         . 
             unexpectedly while a large queue is being loaded.    . 
   IY52572 - runmqsc is connected to a remote qmgr and the DIS QS . 
             command  is issued with a wild card specified in the . 
             queue name.  After the end of the list, runmqsc always 
             displays AMQ8416: MQSC timed out waiting for a       . 
             response from the command server.                    . 
   IY52575 - Improvements to amqiclen utility: 1. SEM: -1 -1 is   . 
             output if the semaphore set associated with the queue 
             manager QMGR directory is not found. 2. Add -h flag to 
             print headings in the amqiclen output. 3.  Add -t flag 
             to work with the -x flag to remove the trace control . 
             SEM and SHM IPC resources associated with            . 
             /var/mqm/errors. 4. Add the description of the errno . 
             (strerror) to error output.                          . 
   IY52676 - SIGSEGV in zfu_as_SearchPrincipalList, while         . 
             authenticating the user ID passed from the client.   . 
   IY52951 - When strmqcsv is started repeatedly without stopping . 
             after approximately 8182 times the strmqcsv command  . 
             creates an FDC file (probe-id XC307010 from          . 
             xlsRequestMutex).  Also with each restart of strmqcsv 
             (after 8182 times) strmqcsv returns with AMQ8101 error 
             code.                                                . 
   IY53065 - An FDC file (Probe ID:ZL043050) is generated with a  . 
             segmentation violation(SIGSEGV).  The customer uses  . 
             group messages.  This problem is generated only when . 
             the receiving process is completed without getting a . 
             part of the messages.                                . 
   IY53173 - Improve parameters traced on API exits.              . 
   IY53481 - Return value from rcrmqobj is incorrectly set to 71  . 
             (Unexpected error) when -z option is used.           . 
   IY53668 - High CPU use in a threaded MCA process, for exmaple: . 
             amqrmppa, runmqlsr, or runmqchi, when a channel runs . 
             an exit which creates a thread which persists after  . 
             the exit is called for MQXR_TERM.                    . 
   IY53700 - Clustering: There is an alias queue defined on QMGR_A 
             whose target queue (targq) is a queue on QMGR_B.  The 
             queue on QMGR_B is a CLUSTER queue.  The alias queue . 
             on QMGR_A is not in the cluster.  When putting a     . 
             message to the alias queue, the message goes to the  . 
             Dead Letter Queue with a reason of 2082 0x00000822   . 
             MQRC_UNKNOWN_ALIAS_BASE_Q.                           . 
   IY53907 - Clustering: During MQSET a user who has all          . 
             permissions for the cluster queue, but does not have a 
             put permissions to the   SYSTEM.CLUSTER.COMMAND.QUEUE. 
             The agent fails with a 2035 error and creates an FDC . 
             file with probe KN204020 from function               . 
             KqiDoPendingChangeCLQ.                               . 
   IY53917 - Sender or server channels went into STOPPED state    . 
             after migrating from WMQ 5.2 to WMQ 5.3 (see also    . 
             SE14601)                                             . 
     70541 - If the user issues a SysReq#2 to cancel an MQSC      . 
             session, subsequent MQSC sessions end imediately     . 
             with the message '0 MQSC commands completed          . 
             successfully'.                                       . 
   75119.1 - Add trace into atmReplayComplete                     . 
     78045 - CURDEPTH value does not reset after restart of       . 
             queue manager                                        . 
     78248 - Add trace into aqhLoadMsgChain                       . 
     78424 - alsReleaseGlobalHeap uses wrong FDC component name   . 
     79209 - RUNMQCHI joblog does not record INITQ name.          . 
     79805 - DSPMQMCHL with *CLTCN and *SVRCN channels shows wrong 
             field data                                           . 
     79958 - Add new #defines into copyfile CMQC                  . 
             #define MQCIH_PASS_EXPIRATION      0x00000001        . 
             #define MQCIH_UNLIMITED_EXPIRATION 0x00000000        . 
             #define MQCIH_REPLY_WITHOUT_NULLS  0x00000002        . 
             #define MQCIH_REPLY_WITH_NULLS     0x00000000        . 
             #define MQCIH_SYNC_ON_RETURN       0x00000004        . 
             #define MQCIH_NO_SYNC_ON_RETURN    0x00000000        . 
             #define MQIIH_PASS_EXPIRATION      0x00000001        . 
             #define MQIIH_UNLIMITED_EXPIRATION 0x00000000        . 
             #define MQIIH_REPLY_FORMAT_NONE    0x00000008        . 
     80382 - CSD07J: Large message put/retrieve failed with FDC   . 
             probe AO107010 in aomRecordMediaImage.               . 

CIRCUMVENTION FOR APAR SE15656 : 
-------------------------------- 
   None. 

DESCRIPTION OF PROBLEM FIXED FOR APAR SE12660 : 
----------------------------------------------- 
   The fatal failure occurs in the AMQZXMAX job during an fprint 
   statement because there was no checking for a NULL pointer being 
   passed in as an argument. 

CORRECTION FOR APAR SE12660 : 
----------------------------- 
   The problem is caused by the unexpected exception when trying to 
   open file 
   /QIBM/UserData/mqm/Qmgrs/<QMGR_NAME>/startprm/ZXMAXSTAT 
   has been fixed.  If the 'fopen' fails then a new FDC will be 
   logged with probe 0 in function apiStartup (Probe Id AO000000), 
   and the STRMQM task will continue to completion without logging 
   any start-up statistics. 

CIRCUMVENTION FOR APAR SE12660 : 
-------------------------------- 
   None. 

DESCRIPTION OF PROBLEM FIXED FOR APAR SE13288 : 
----------------------------------------------- 
   The receiver in-doubt problem is caused by a very small timing 
   window which is allowing simultaneous updates of the channel 
   status data. 

CORRECTION FOR APAR SE13288 : 
----------------------------- 
   Improvements have been made which maintain the integrity of the 
   status data channel name, and a new FDC with probe RM351000 in 
   rriAdoptMCA will be logged if an inconsistency is detected. 

CIRCUMVENTION FOR APAR SE13288 : 
-------------------------------- 
   None 

DESCRIPTION OF PROBLEM FIXED FOR APAR SE14601 : 
----------------------------------------------- 
   EThe SYNC record structure is changed in WebSphereMQ R530. 
   During the first start-up after migration from an earlier 
   release, the Queue Manager will automatically convert all the 
   messages in SYSTEM.CHANNEL.SYNCQ and save them back to the same 
   queue. The problem is caused by this conversion process saving 
   duplicate records for the same channel. 
   One correct record, with channel status saved by channel 
   process, and one erroneous record with corrupted message id 
   and wrong channel status. During Queue Manager restart, the 
   erroneous record will always be read(because it is at the 
   beginning of the queue) and the wrong status restored. 

CORRECTION FOR APAR SE14601 : 
----------------------------- 
   The problem has been fixed. After Queue Manager restart, 
   channels migrated from an earlier release will show status 
   INACTIVE instead of status STOPPED. 

CIRCUMVENTION FOR APAR SE14601 : 
-------------------------------- 
   None. 

DESCRIPTION OF PROBLEM FIXED FOR APAR SE15311 : 
----------------------------------------------- 
   When very stressed, MQ dumps long lock wait FDCs as a 
   warning. MQ could sometimes deadlock dumping one of these FDCs. 

CORRECTION FOR APAR SE15311 : 
----------------------------- 
   The possible deadlock when dumping a long lock wait FFST has 
   been fixed. 

CIRCUMVENTION FOR APAR SE15311 : 
-------------------------------- 
   none 

DESCRIPTION OF PROBLEM FIXED FOR APAR SE15657 : 
----------------------------------------------- 
   When a damaged queue object was encountered, the 
   aqhAllocateSpace routine generated an FDC with probe id AQ066010 
   and retcode = STOP_ALL. This caused the queue manager to end. 
   The retcode should be set to lrcE_OBJECT_DAMAGED to makr the 
   object as damaged and prevent the queue manager from ending 
   aburptly. 

CORRECTION FOR APAR SE15657 : 
----------------------------- 
   Changes has been done for aqhAllocateSpace probe  AQ066010 
   to set retcode =  lrcE_OBJECT_DAMAGED and not STOP_ALL. 
   The object would be marked as object damaged. The Object could 
   be recovered from a media image using the RCRMQMOBJ provided 
   RCDMQMIMG of the object has been successful, and that the 
   journal containing the media image of the object is available 
   The problem has been fixed in WMQ vresion 5.3 CSD07. 

CIRCUMVENTION FOR APAR SE15657 : 
-------------------------------- 
   None. 


Activation Instructions 


   None. 


Special Instructions 

   Whether or not you have experienced this problem, it is recommended that 
   you apply this PTF, following the instructions given below. 

   This PTF, for WebSphere MQ for iSeries (5724B4106) Version 5 Release 3, 
   is of type *IMMED. 

   Before applying this PTF, you MUST stop ALL Queue Manager activity and 
   FULLY quiesce WebSphere MQ :- 

   1. Use F12 (Cancel) to return to your initial MENU. 

      Note: If you have WebSphere MQ Commands (CMDMQM) as your initial 
            menu, change the initial menu in your user profile, sign off 
            and sign on again. 

   2. Warn all users that you are going to shut down WebSphere MQ, 
      and that they should not restart their Queue Managers or their MQI 
      applications until all PTFs have been loaded and applied. 

   3. Quiesce all queue managers in a controlled manner: 
      ... ENDMQM MQMNAME(*ALL) OPTION(*CNTRLD) ENDCCTJOB(*YES) TIMEOUT(15) 

   4. If step 3 does NOT log AMQ6154 ("Queue manager '*ALL' has been 
      quiesced") then shut down WebSphere MQ using the *IMMED option: 
      ... ENDMQM MQMNAME(*ALL) OPTION(*IMMED) ENDCCTJOB(*YES) TIMEOUT(15) 

   5. Shut down the default WebSphere MQ subsystem (QMQM) and also any 
      user-defined WebSphere MQ subsystems: 
      ... ENDSBS SBS(QMQM) OPTION(*IMMED) 

   6. Sign off, then sign on again. 

   7. Load and apply this PTF - using the OS/400 menu of PTF commands. 

   8. Load and apply other WebSphere MQ PTFs which are requisites for 
      this PTF. 

   9. If steps 3 or 4 did NOT log AMQ6154 ("Queue manager '*ALL' has been 
      quiesced") then you MUST clear WebSphere shared memory by repeating: 
      ... ENDMQM MQMNAME(*ALL) OPTION(*IMMED) ENDCCTJOB(*YES) TIMEOUT(15) 

   10. Restart the WebSphere MQ subsystem: 
       ... STRSBS SBSD(QMQM/QMQM) 

   11. Restart one or more Queue Managers, using either the STRMQM command 
       or option 14 from the WRKMQM panel. 

   All users who were actively using WebSphere MQ before it was quiesced 
   should sign off and sign on again before restarting their Queue 
   Managers or restarting their MQI applications. 


Default Instructions 

   THIS PTF CAN BE APPLIED IMMEDIATE OR DELAYED. 


Supersedes 

PTF/FIX NO(S).  APAR TITLE LINE 
--------------  ------------------------------------------------------------ 
   SI13026      MQM400 Additional Service on CSD6 
   SI11236      MQM400 XSTCONNECTEXTENT FAILS WITH MSGMCH3601 
   SI11236      WHEN USING STRMQM, IF HE SYSTEM.CHANNEL.SYNCQ IS DAMAGED, 
   SI11236      MQM400 Test Fix for SE12052 
   SI11236      MQM400 CSD06 CUMULATIVE SERVICE (WMQ 5.3 COMMANDS & OPS) 
   SI11236      MQM400 XSTCONNECTEXTENT FAILS WITH MSGMCH3601 
   SI11236      MQM400  INTERMITTENT AMQ6125 PROBE ID AL029008 FOLLOWING 
   SI10092      MQM400 CSD05 CUMULATIVE SERVICE (WMQ 5.3 COMMANDS & OPS) 
   SI10092      MQM400 WAMQZDMAA CONSUMES HIGH CUP WHILE OPENING DOING 
   SI10092      MQM400 QUEUE MANAGER DOES NOT END *IMMED WHEN RCDMQMIMG 
   SI10092      MQM400 XY324192 GETSUBPOOLSLOCK XECF_E_UNEXPECTED_SYSTEM_RC 
   SI10092      MQM400 MSGMCH3601 RECEIVED IN AQHRESIZESPACEMAP 
   SI10092      MQM400 - STRMQM FAILS WITH AMQ7432 AFTER A SYSTEM FAILURE 
   SI09428      MQM400 CSD04 CUMULATIVE SERVICE (MQSERIES OPS & CONTROLS) 
   SI08397      MQM400 WMQ53 CSD03 CODE CHANGES FOR COMMANDS & OPERATIONS 
   SI07469      MQM400 WMQ53 QMQM used when NO user id is specified for the 

Summary Information 

System..............................................  iSeries  
Models..............................................   
Release............................................  V5R3M0  
Recompile........................................  N  
Library................................................  QMQM  
MRI Feature......................................  NONE  
Cum Level......................................  C4209530  


-------
Note:
-------


IY36646: AN AMQPCSEA PROCESS DUMPED AN FDC DESCRIBING A CHANNEL THAT HAD UNEXPECTEDLY TERMINATED. EVEN THOUGH THE CHANNEL IS RUNNING FINE
  

 Fixes are available 
WebSphere MQ V5.3 Fix Pack 3 (CSD03)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 14 (CSD14)
WebSphere MQ v5.3 and WebSphere MQ Express v5.3 - Fix Pack 6 (CSD06)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 9 (CSD09)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 12 (CSD12)
WebSphere MQ v5.3 for iSeries - Fix Pack 10 (CSD10)
WebSphere MQ V5.3 for iSeries - Fix Pack 12 (CSD12)
WebSphere MQ V5.3 for iSeries - Fix Pack 11 (CSD11)
WebSphere MQ V5.3 & WebSphere MQ Express V5.3 - Fix Pack 7 (CSD07)
WebSphere MQ v5.3 for iSeries - Fix Pack 8 (CSD08)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 11 (CSD11)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 13 (CSD13)
WebSphere MQ V5.3 & WebSphere MQ Express V5.3 - Fix Pack 10 (CSD10)
WebSphere MQ v5.3 and WebSphere MQ Express v5.3 - Fix Pack 05 (CSD05)
WebSphere MQ V5.3 & WebSphere MQ Express V5.3 - Fix Pack 8 (CSD08)

 
APAR status
Closed as program error.

Error description 
An amqpcsea process dumped an FDC describing a channel that had
unexpectedly terminated. Even though the channel is running fine
.
It's possible that the pid info has got overwritten very briefly
the rriAccessSync function.
Local fix 
Eliminate the possibility by avoiding the temporary overwriting
of the PID.
Problem summary 
The problem has been fixed.
Problem conclusion 
This problem has been fixed and the fix will be shipped in the
following PTFs:
.
    A) WebSphere MQ V5.3    CSD03
.
          Windows           U200187
          AIX               U485561
          HP-UX             U485562
          Linux on Intel    U485563
          Linux on zSeries  U485646
          Sun Solaris       U485560
.
Temporary fix 
Comments 
APAR information 
APAR number IY36646 
Reported component name WEBS MQ FOR SUN 
Reported component ID 5724B4103 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2002-11-04 
Closed date 2002-11-20 
Last modified date 2006-01-10 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Modules/Macros

Publications Referenced


Fix information 
Fixed component name WEBS MQ FOR SUN 
Fixed component ID 5724B4103 

Applicable component levels 
R530 PSY    UP 
 

-------
Note:
-------


IY47486: WEBSPHERE MQ PROCESSES RUNMQSC AND AMQPCSEA HANG ACCESSING CLUSTER OBJECTS
  

 Fixes are available 
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 14 (CSD14)
WebSphere MQ v5.3 and WebSphere MQ Express v5.3 - Fix Pack 6 (CSD06)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 9 (CSD09)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 12 (CSD12)
WebSphere MQ v5.3 for iSeries - Fix Pack 10 (CSD10)
WebSphere MQ V5.3 for iSeries - Fix Pack 12 (CSD12)
WebSphere MQ V5.3 for iSeries - Fix Pack 11 (CSD11)
WebSphere MQ V5.3 & WebSphere MQ Express V5.3 - Fix Pack 7 (CSD07)
WebSphere MQ v5.3 for iSeries - Fix Pack 8 (CSD08)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 11 (CSD11)
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 13 (CSD13)
WebSphere MQ V5.3 & WebSphere MQ Express V5.3 - Fix Pack 10 (CSD10)
WebSphere MQ v5.3 for iSeries - Fix Pack 6 (CSD06)
WebSphere MQ v5.3 for iSeries - Fix Pack 9 (CSD09)
WebSphere MQ V5.3 & WebSphere MQ Express V5.3 - Fix Pack 8 (CSD08)

 
APAR status
Closed as program error.

Error description 
The problem is caused by a deadlock between amqrrmfa and
amqzlaa0, and was introduced in CSD03 by defect 72223.
Internal Reference Only:see prb 1441
Local fix 
Problem summary 
The problem is caused by a deadlock between amqrrmfa and
amqzlaa0, and was introduced in CSD03 by defect 72223. Internal
Reference Only:see prb 1441
Problem conclusion 
The problem has been fixed and will be inclu
ded in:
WebSphere MQ V5.3 CSD06

Windows           U200202
AIX               U489863
HP-UX             U489864
Linux on Intel    U489967
Linux on zSeries  U489972
Sun Solaris       U489865
Temporary fix 
Comments 
APAR information 
APAR number IY47486 
Reported component name WEBS MQ FOR SUN 
Reported component ID 5724B4103 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2003-08-12 
Closed date 2003-08-13 
Last modified date 2004-02-12 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Modules/Macros

Publications Referenced


Fix information 
Fixed component name WEBS MQ FOR SUN 
Fixed component ID 5724B4103 

Applicable component levels 
R530 PSY    UP 
 

-------
Note:
-------


AMQ9213 AMQCRSTA fails
  
 Technote (troubleshooting) 
  
Problem(Abstract) 
You start your sender channel and receive the following error:

AMQ9213 from ioctl for TCP/IP giving rc=22(x'16'). This is EINVAL, which means that one of the parameters for ioctl is invalid.  
  
 
Cause 
The inetd configuration file did not have the correct information or syntax.  
  
 
Resolving the problem 
Receive on TCP using either of the following:

Use the runmqlsr command
  See Using the WebSphere MQ listener

Use the inet daemon
  See Using the inet daemon (INETD)

Additional information
The inetd configuration settings are detailed in the following manual starting with Chapter 12. 
Refer to the manual for the details regarding inetd configuration on your platform. 

WebSphere MQ Intercommunication (SC34-6587-00)


Keywords: listener inetd fails RRCE_COMMUNICATIONS_ERROR COMMUNICATIONS ERROR AMQ9213 
  
 
Cross Reference information 
Segment Product Component Platform Version Edition 
Business Integration WebSphere MQ Express Channel Linux, Windows 5.3  
  Product Alias/Synonym 
WebSphere MQ WMQ MQ  
 
 
-------
Note:
-------

Using the inet daemon (INETD)

To establish a TCP connection, follow these steps. 
Edit the file /etc/services. If you do not have the following line in the file, add it as shown: 
MQSeries       1414/tcp      # MQSeries channel listenerNote: To edit this file, you must be logged in as a superuser or root.
Edit the file /etc/inetd.conf. If you do not have the following line in that file, add it as shown: 
MQSeries stream tcp nowait mqm /opt/mqm/bin/amqcrsta amqcrsta
[-m queue.manager.name]Find the process ID of the inetd with the command: 
ps -ef | grep inetdRun the command: 
kill -1 inetd processidIf you have more than one queue manager on your system, and therefore require more 
than one service, you must add a line for each additional queue manager to both /etc/services and inetd.conf.

For example: 
MQSeries1     1414/tcp
MQSeries2     1822/tcpMQSeries1 stream tcp nowait mqm /mqmtop/bin/amqcrsta amqcrsta -m QM1
MQSeries2 stream tcp nowait mqm /mqmtop/bin/amqcrsta amqcrsta -m QM2This avoids error messages being generated 
if there is a limitation on the number of outstanding connection requests queued at a single TCP port. 
For information about the number of outstanding connection requests, see Using the TCP listener backlog option.

The inetd process on Linuxr can limit the rate of inbound connections on a TCP port. 
The default is 40 connections in a 60 second interval. If you need a higher rate, specify a new limit 
on the number of inbound connections in a 60 second interval by appending a period (.) followed by the new limit 
to the nowait parameter of the appropriate service in inetd.conf. For example, for a limit of 500 connections 
in a 60 second interval use: 
MQSeries stream tcp nowait.500 mqm /mqmtop/bin/amqcrsta amqcrsta -m QM1


-------
Note:
-------

Using the extended inet daemon (XINETD)

The following instructions describe how the extended inet daemon is implemented on Red Hat Linuxr. 
If you are using a different Linux distribution, you might have to adapt these instructions.

To establish a TCP connection, follow these steps. 
Edit the file /etc/services. If you do not have the following line in the file, add it as shown: 
MQSeries       1414/tcp      # MQSeries channel listenerNote: To edit this file, you must be logged in as a superuser or root.
Create a file called MQSeriesr in the XINETD configuration directory, /etc/xinetd.d. Add the following stanza to the file: 
# WebSphere MQ service for XINETD
service MQSeries
{
  disable         = no
  flags           = REUSE
  socket_type     = stream
  wait            = no
  user            = mqm
  server          = /opt/mqm/bin/amqcrsta
  server_args     = -m queue.manager.name
  log_on_failure += USERID
}

Restart the extended inet daemon by issuing the following command: 
/etc/rc.d/init.d/xinetd restart

If you have more than one queue manager on your system, and therefore require more than one service, 
you must add a line to /etc/services for each additional queue manager. You can create a file in the 
/etc/xinetd.d directory for each service, or you can add additional stanzas to the MQSeries file 
you created previously.

The xinetd process on Linux can limit the rate of inbound connections on a TCP port. 
The default is 50 connections in a 10 second interval. If you need a higher rate, specify a new limit 
on the rate of inbound connections by specifying the 'cps' attribute in the xinetd configuration file. 
For example, for a limit of 500 connections in a 60 second interval use: 
cps = 500 60


-------
Note:
-------


SE31933 - MQM400 RUNMQCHL JOBLOG DO NOT CONTAIN ANY USEFUL ERROR MESSAGE.
  
 APAR (Authorized Program Analysis Report) 
 

Abstract 

MQM400 RUNMQCHL JOBLOG DO NOT CONTAIN ANY USEFUL ERROR MESSAGE. 

Error Description 

The RUNMQCHL joblog contents are different from v5 to v6.       
For example, at v5.3 the RUNMQCHL joblog contains:               
CPF1124 CPI1125 AMQ7163 AMQ9002 AMQ9558 AMQ9999 AMQ6993 CPF1164 
.                                                               
And at v6.0 the RUNMQCHL joblog contains:                       
CPF1124 CPI1125 AMQ7163 CPF1164                                 
.                                                               
At v6 the messages are being written to the AMQZMUR0 process     
joblog instead of the RUNMQCHL joblog.                           

Problem Summary 

**************************************************************** 
USERS AFFECTED:                                                 
Users having channels on iSeries                                 
                                                                
Platforms affected:                                             
iSeries                                                         
                                                                
**************************************************************** 
PROBLEM SUMMARY:                                                 
In WMQv6, the task of writing messages to AMQERR01.LOG is       
handed off outside the RUNMQCHL process by the error log daemon 
and messages are also written to the joblog of AMQZMUR0.         

Problem Conclusion 

Changes has been carried out for logging the message locally to 
the RUNMQCHL joblog instead of logging in AMQZMUR0 joblog.       
                                                                
--------------------------------------------------------------- 
The fix is targeted for delivery in the following PTFs:         
                                                                
                   v6.0                                         
Platform           Fix Pack 6.0.2.4                             
--------           --------------------                         
iSeries            SI31813                                       
                                                                
The latest available maintenance can be obtained from           
'Websphere MQ Recommended Fixes'                                 
http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006037 
                                                                
If the maintenance level is not yet available, information on   
its planned availability can be found in 'Websphere MQ           
Planned Maintenance Release Dates'                               
http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006309 
--------------------------------------------------------------- 

Temporary Fix 


Comments 


Circumvention 


Refer Queue manager error log for channel related errors.       
PTFs Available 

R600 SI33280    1000 

Affected Modules 

          
Affected Publications 


Summary Information 


Status............................................  CLOSED PER  
HIPER...........................................  No  
Component..................................  5724H7206  
Failing Module..........................  RCHMGR  
Reported Release...................  R600  
Duplicate Of..............................  
 

-------
Note:
-------


IY94700: Hang in channel processes such as runmqlsr, amqrmppa, runmqchl.
  

Fixes are available 
WebSphere MQ V5.3 and WebSphere MQ Express V5.3 - Fix Pack 14 (CSD14)
WebSphere MQ V5.3 for iSeries - Fix Pack 14 (CSD14)
WebSphere MQ V6.0 Fix Pack 6.0.2.2
WebSphere MQ V6.0 for iSeries Fix Pack 6.0.2.2


APAR status
Closed as program error.

Error description 
A channel process hangs. This can be diagnosed by obtaining
SIGUSR2 FDCs from channel processes (or all processes if you
aren't sure which are channel processes). You obtain a SIGUSR2
FDC from a process by sending it the SIGUSR2 signal, e.g. as
root: "kill -s SIGUSR2 PID" or "kill -s USR2 PID", where PID is
the pid of a channel process.

If the following sequence is seen towards the end of the
traceback info in the FDC, then you've most likely encountered
this problem.

  --{ cccProcessReceive
  ---{ recv
  ---} recv rc=Unknown(FFFF)

Local fix 
Problem summary 
****************************************************************
USERS AFFECTED:
Users of channels. The likelihood of a hang is very small. The
queue manager needs to be recycled if this occurs.

Platforms affected:
All Unix

****************************************************************
PROBLEM SUMMARY:
This problem shows up when a recvmsg function is interrupted
and returns with an errno of EINTR. This causes the WMQ
internal inter-process communications link to stall.
Problem conclusion 
Corrected the handling of EINTR from the recvmsg function.

---------------------------------------------------------------
The fix is targeted for delivery in the following PTFs:

                   v5.3
Platform           Fix Pack 14
--------           --------------------
AIX                U808477
HP-UX (PA-RISC)    U808478
Solaris (SPARC)    U808480
Linux (x86)        U808481
Linux (zSeries)    U808483

                   v6.0
Platform           Fix Pack 6.0.2.2
--------           --------------------
AIX                U809895
HP-UX (PA-RISC)    U809898
HP-UX (Itanium)    U810084
Solaris (SPARC)    U809913
Solaris (x86-64)   U810362
Linux (x86)        U809950
Linux (x86-64)     U810178
Linux (zSeries)    U810081
Linux (Power)      U810083
Linux (s390x)      U810110

The latest available maintenance can be obtained from
'WebSphere MQ Recommended Fixes'
http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006037

If the maintenance level is not yet available, information on
its planned availability can be found in 'WebSphere MQ
Planned Maintenance Release Dates'
http://www-1.ibm.com/support/docview.wss?rs=171&uid=swg27006309
---------------------------------------------------------------
Temporary fix 
Comments 
APAR information 
APAR number IY94700 
Reported component name WMQ AIX V6 
Reported component ID 5724H7201 
Reported release 600 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2007-02-09 
Closed date 2007-02-14 
Last modified date 2007-07-27 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Modules/Macros

Publications Referenced


Fix information 
Fixed component name WMQ AIX V6 
Fixed component ID 5724H7201 
 

-------
Note:
-------

thread:

Q:


I follow WebSphere MQ Client v6.0 manual to verify my MQ installation. AFter define QM, QLOCAL and CHANNEL 
then start server connection by enter below runmqlsr command

runmqlsr -t tcp -m queue.manager.1 -p 1414 -i 192.168.1.2

Besides '5274-H72 (C) Copyright IBM corp. 1994, 2004. ALL RIGHT RESERVED.' then nothing else is continue to display.

I have to press CTRL+C to break it.

Can anyone tell me what did I miss?


A:
 
You are doing it right at first. By pressing Ctrl-C you are actually killing your listener. 
You can start your listener in the background so that your command window is available for use. 

On any Unix flavour, use
runmqlsr -t tcp -m queue.manager.1 -p 1414 -i 192.168.1.2 & 

And on Windows 
start /b runmqlsr -t tcp -m queue.manager.1 -p 1414 -i 192.168.1.2

 
-------
Note:
-------


runmqlsr (run listener)

Purpose
Use the runmqlsr command to start a listener process.

This command is run synchronously and will wait until the listener process has finished before returning to the caller.

Syntax

>>-runmqlsr-- -t ----------------------------------------------->

>--+- tcp --+------------+--+--------------+--+---------------+-+-->
   |        '- -p --Port-'  '- -i --IPAddr-'  '- -b --Backlog-' |   
   +- lu62 -- -n --TpName---------------------------------------+   
   |            .---------------------.                         |   
   |            V                     |                         |   
   +- netbios ----+-----------------+-+-------------------------+   
   |              +- -a --Adapter---+                           |   
   |              +- -l --LocalName-+                           |   
   |              +- -e --Names-----+                           |   
   |              +- -s --Sessions--+                           |   
   |              '- -o --Commands--'                           |   
   |        .-------------------.                               |   
   |        V                   |                               |   
   '- spx ----+---------------+-+-------------------------------'   
              +- -x --Socket--+                                     
              '- -b --Backlog-'                                     

>--+----------------+------------------------------------------><
   '- -m --QMgrName-'   

Required parameters
-t 
The transmission protocol to be used: 
tcp Transmission Control Protocol / Internet Protocol (TCP/IP) 
lu62 SNA LU 6.2 (Windowsr only) 
netbios NetBIOS (Windows only) 
spx SPX (Windows only) 

Optional parameters

-p Port 
The port number for TCP/IP. This flag is valid for TCP only. If you omit the port number, it is taken from the queue manager configuration information, or from defaults in the program. The default value is 1414. 
-i IPAddr 
The IP address for the listener, specified in one of the following formats: 
IPv4 dotted decimal 
IPv6 hexadecimal notation 
Alphanumeric format 
This flag is valid for TCP/IP only. 
On systems that are both IPv4 and IPv6 capable you can split the traffic by running two separate listeners, one listening on all IPv4 addresses and one listening on all IPv6 addresses. If you omit this parameter, the listener listens on all configured IPv4 and IPv6 addresses.

-n TpName 
The LU 6.2 transaction program name. This flag is valid only for the LU 6.2 transmission protocol. If you omit the name, it is taken from the queue manager configuration information. 
-a Adapter 
The adapter number on which NetBIOS listens. By default the listener uses adapter 0. 
-l LocalName 
The NetBIOS local name that the listener uses. The default is specified in the queue manager configuration information. 
-e Names 
The number of names that the listener can use. The default value is specified in the queue manager configuration information. 
-s Sessions 
The number of sessions that the listener can use. The default value is specified in the queue manager configuration information. 
-o Commands 
The number of commands that the listener can use. The default value is specified in the queue manager configuration information. 
-x Socket 
The SPX socket on which SPX listens. The default value is hexadecimal 5E86. 
-m QMgrName 
The name of the queue manager. By default the command operates on the default queue manager. 
-b Backlog 
The number of concurrent connection requests that the listener supports. See LU62, NETBIOS, TCP, and SPX for a list of default values and further information. 
Return codes
0 Command completed normally 
10 Command completed with unexpected results 
20 An error occurred during processing 

Examples
The following command runs a listener on the default queue manager using the NetBIOS protocol. 
The listener can use a maximum of five names, five commands, and five sessions. These resources 
must be within the limits set in the queue manager configuration information. 

runmqlsr -t netbios -e 5 -s 5 -o 5


-------
Note:
-------


IC35473: APAR TO DESCRIBE AND DOCUMENT DEFECT 54255.1REGARDING RUNMQCHI IS NOT HANDLING SIGCHLD RESULTINGIN DEFUNCT PROCESSES
  

APAR status
Closed as program error.

Error description 
Runmqchi is not correctly handling the SIGCHLD signal from
channelprocesses which results inthe the DEFUNCT processes.
this has been fixed via internal defect 54255.1
Local fix 
fix exists via defect 54255.1
Problem summary 
The problem has been fixed.
Problem conclusion 
This problem has been fixed and the fix will be shipped in PTFs
U200155 and U200156.
Temporary fix 
Comments 
APAR information 
APAR number IC35473 
Reported component name MQSERIES FOR CO 
Reported component ID 5765E3800 
Reported release 510 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2003-01-20 
Closed date 2003-01-28 
Last modified date 2003-01-28 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Modules/Macros

Publications Referenced


Fix information 
Fixed component name MQSERIES FOR CO 
Fixed component ID 5765E3800 

Applicable component levels 
R510 PSY    UP 
 
 
-------
Note:
-------


SE29898: MQM400 - Unexpected job descriptions for AMQPCSEA and
  

APAR status
Closed as documentation error.

Error description 
Enduser has created a job description for the AMQPCSEA.
RUNMCHL, RUNMQMLSR jobs in the queue manager library,
not referenced when WMQ tasked started.  In enduser
environment the QMQMJOBD job description in the QMQM
library.
.
Steps to recreate:
.
1. CRTMQM MQMNAME(COMMON)
.
2. Create job descriptions for queue manager jobs:
.
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(AMQALMPX)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(AMQRRMFA)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(AMQZDMAA)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(AMQZFUMA)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(AMQZLAA0)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(AMQZMGR0)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(AMQZMUC0)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(AMQZMUR0)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          OLIB(QMCOMMON) NEWOBJ(AMQZXMA0)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(RUNMQCHL)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(RUNMQCHI)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(RUNMQLSR)
CRTDUPOBJ OBJ(QMQMJOBD) FROMLIB(QMQM) OBJTYPE(*JOBD)
          TOLIB(QMCOMMON) NEWOBJ(QMQMJOBD)
.
3. STRMQM MQMNAME(COMMON)
4. CRTMQMLSR LSRNAME(LISTENER1516) MQMNAME(COMMON)
       CONTROL(*QMGR) PORT(1516)
   NOTE: This will create the *NEW* WMQ V6 Listener object,
   which will automatically starts and terminates with
   the queue manager.
5. ENDMQM MQMNAME(COMMON) OPTION(*IMMED) ENDCCTJOB(*YES)
     RCDMQMIMG(*YES) TIMEOUT(15)
6. STRMQM MQMNAME(COMMON)
7. STRMQMLSR PORT(1818) MQMNAME(COMMON)
   NOTE: This is the pre-WMQ V6 way of starting the queue
         manager.
.
.
NOTE: Command server and channel initiator configured to
start with the queue manager.
.
With queue manager active, use option 22 from the WRKMQM
panel to display list of queue manager jobs.
.
Name                    Application
069981/QMQM/AMQALMPX    Checkpoint Job
069988/QMQM/AMQPCSEA    Command Server
069983/QMQM/AMQRRMFA    Repository Manager
069984/QMQM/AMQZDMAA    Deferred Message Handler
069978/QMQM/AMQZFUMA    Object Authority Manager
069986/QMQM/AMQZLAA0    Queue Manager Agent
069985/QMQM/AMQZMGR0    Process Controller
069979/QMQM/AMQZMUC0    Utility Manager
069982/QMQM/AMQZMUR0    Utility Manager
069977/QMQM/AMQZXMA0    Execution Controller
069997/QMQM/PUTXMSGSX2  PUTXMSGS
069987/QMQM/RUNMQCHI    Channel Initiator
069989/QMQM/RUNMQLSR    Threaded Listener  <-- STRMQMLSR 1515
069990/QMQM/RUNMQLSR    Threaded Listener  <-- STTMQMLSR 1516
                                               Listener object
.
Use option 5 (Display), then
    option 2 (Display job definition attributes)
    for job AMQPCSEA, RUNMQCHI and RUNMQLSR.
.
Job:   AMQPCSEA   User:   QMQM Number:   068777
Job description . . . . . . . . . . :   AMQZMGR0
  Library . . . . . . . . . . . . . :     QMCOMMON
Job queue . . . . . . . . . . . . . :
  Library . . . . . . . . . . . . . :
.
Job:   RUNMQCHI   User:   QMQM Number:   068776
Job description . . . . . . . . . . :   AMQZMGR0
  Library . . . . . . . . . . . . . :     QMCOMMON
Job queue . . . . . . . . . . . . . :
  Library . . . . . . . . . . . . . :
.
Job:   RUNMQLSR   User:   QMQM Number:   069990
Job description . . . . . . . . . . :   AMQZMGR0
  Library . . . . . . . . . . . . . :     QMCOMMON
Job queue . . . . . . . . . . . . . :
  Library . . . . . . . . . . . . . :
  NOTE: This job represents the listener object
.
Job:   PUTXMSGSX2 User:   QMQM Number:   070000
Job description  . . . . . . . . . . :   AMQZMGR0
  Library  . . . . . . . . . . . . . :     QMCOMMON
Job queue  . . . . . . . . . . . . . :
  Library  . . . . . . . . . . . . . :
  NOTE: This job represents the user-defined application
.
These jobs are spawned via AMQZMGR0, which is the name of
job description being referenced.
Local fix 
Enduser will create a job description AMQZMGR0 in the queue
manager library. Informed the customer with WebSphere MQ V6
these jobs are now spawned from AMQZMGR0 and they can control
the job attributes via this job description.
Problem summary 
When the jobs AMQPCSEA and RUNMQCHI are started, they do not
pick up the job descriptions created by the same name in queue
manager library or QMQM library, but instead takes the job
description used by AMQZMGR0 job.

Users affected: WMQ users on iSeries platform defining the job
descriptions AMQPCSEA and RUNMQCHI.

Platforms affected: iSeries

The jobs AMQPCSEA and RUNMQCHI are spawned by the parent job
AMQZMGR0 rather than started as a batch job. While spawning,
the parent job passes all the attributes to the child job.
Hence AMQPCSEA and RUNMQCHI job adopts the job description of
AMQZMGR0.
Problem conclusion 
In the "WebSphere MQ for iSeries System Administration Guide"
Version 6.0 manual, in chapter "Chapter 4. Work management" on
section "How WebSphere MQ uses the work management objects" pg
46, the following text needs to be corrected.

"Note: If WebSphere MQ jobs do not appear to be starting, make
sure that the subsystem is running and the job queue is not
held,"

The corrected text is
"Note:
1. If WebSphere MQ jobs do not appear to be starting, make sure
that the subsystem is running and the job queue is not held.
2. The AMQPCSEA and RUNMQCHI jobs are spawned from the parent
job AMQZMGR0 and hence they inherit the job attributes of
AMQZMGR0, including the job descriptions."
Temporary fix 
Comments 
APAR information 
APAR number SE29898 
Reported component name WMQ ISERIES V6 
Reported component ID 5724H7206 
Reported release 600 
Status CLOSED DOC 
PE NoPE 
HIPER NoHIPER 
Special Attention NoSpecatt 
Submitted date 2007-10-03 
Closed date 2007-10-30 
Last modified date 2007-10-30 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Modules/Macros

Publications Referenced
SC34658600         


Fix information 

Applicable component levels 
R600 PSY    UP 
 

-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------

Installing MQ on AIX:


1. Some Definitions first:
==========================


1.1:
Websphere MQ, formerly known as MQ (message queue) series, is an IBM standard for program-to-program messaging 
across multiple platforms. Websphere MQ is sometimes referred to as message-oriented middleware (MOM). 

1.2:
MQSeries is an IBM software family whose components are used to tie together other software applications 
so that they can work together. This type of application is often known as business integration software or middleware. 


2. Some Important characteristics:
==================================

MQ is asynchronous messaging, which means that the sending process doesn't have to wait until the receiving process 
handles the data before it continues processing. Additionally, the content of the data doesn't have to be 
defined up front (although obviously the receiver needs to know what to do with it when it arrives).


3. Sample installation on unix (AIX):
=====================================


On AIX, you can use smitty, or use the setup program on the media.

WebSpherer MQ is supplied as a set of filesets that are installed using AIX's standard installation tools. 
The procedure below uses the smit tool, but you may chose to use installp, geninstall or 
the Web-based System Manager. You may select which components you want to install. The components and filesets are listed (partially) below; 
you must install at least the Runtime, and Server components.

WebSphere MQ for AIXr can be installed as a server or a client.
A WebSphere MQ server is an installation of one or more queue managers that provide queueing services to one or more clients. 
All the WebSphere MQ objects, for example queues, exist only on the queue manager machine (the WebSphere MQ server machine), 
and not the client. A WebSphere MQ server can also support local WebSphere MQ applications.

A WebSphere MQ client is a component that allows an application running on one system to communicate with a queue manager 
running on another system. The output from the call is sent back to the client, which passes it back to the application. 
To install a WebSphere MQ client see, Installing a WebSphere MQ client.

It is possible to have both a server and a client installation on the same machine, for instructions on how to do this see, 
Installing a client on the same machine as a server.

File descriptors
When running a multi-threaded process such as the agent process, you might reach the soft limit for file descriptors. 
This gives you the WebSphere MQ reason code MQRC_UNEXPECTED_ERROR (2195) and, if there are enough file descriptors, a WebSphere MQ FFSTT file.
To avoid this problem, you can increase the process limit for the number of file descriptors. To do this, alter the nofiles attribute in 
/etc/security/limits to 10,000 for the mqm user id or in the default stanza.

System Resource Limits
Set the system resource limit for data segment and stack segment to unlimited using the following commands in a command prompt:
ulimit -d unlimited
ulimit -s unlimited

Create the filesystems and userid and group before installation.


Filesystems:
-----------

The installation directory for the WebSpherer MQ product code is /usr/mqm. 
Working data is stored in /var/mqm. 
You cannot change these locations. The GSKit must also be installed into its default location.

You can also create separate file systems for your log data (/var/mqm/log) and error files (/var/mqm/errors). 
If possible, store log files on a different physical volume from the WebSpherer MQ queues (/var/mqm).

If you create separate file systems: 
The /var/mqm and /var/mqm/log directories must be on a local file system. 
The /var/mqm/errors directory can be NFS mounted. However, if you choose to NFS-mount /var/mqm/errors, 
the error logs might be lost if the network fails. 

WebSphere MQ libraries are in the following locations: /usr/mqm/lib (32-bit libraries) and /usr/mqm/lib64 (64-bit libraries).


User account:
-------------

WebSpherer MQ requires a user ID of the name mqm, with a primary group of mqm. The mqm user ID owns the directories and files 
that contain the resources associated with the product.

If you want to run administration commands, for example crtmqm (create queue manager) or strmqm (start queue manager), 
your user ID must be a member of the mqm group.

Users do not need mqm group authority to run applications that use the queue manager; 
it is needed only for the administration commands.

You can use smitty to add an existing user ID to the mqm group.

---------
Language:
---------

To select messages in a different language, use the following command with the identifier for the language you want to install: 
export LANG=message identifier
The message identifiers for the message catalogs are as follows:

de_DE (German) 
es_ES (Spanish) 
etc..

--------------------------
Most important components:
--------------------------

Component:	Description:										Fileset:
Runtime 	Mandatory component. Needed for application development and provides support for	mqm.base.runtime 
		external applications.  

SDK		Required for compiling applications.							mqm.base.sdk 

Server 		The server feature allows you to run queue managers on your computer and connect	mqm.server.rte 
		to other computers over a network. Provides messaging and queuing services 
		to applications, and support for WebSphere MQ client connections.  

Client		The WebSphere MQ client is a small subset of WebSphere MQ, without a queue manager.	mqm.client.rte 
		Provides remote access to WebSphere MQ. Must be connected to a server. 
		To install a client on the same machine as a server, use the Server CD-ROM; 
		otherwise use the Clients CD-ROM.  

Sample programs Sample application programs. Needed if you want to check your WebSphere MQ installation	mqm.base.samples  
		using the verification procedures described in Verifying the installation using the 
		JMS Postcard application. 

JavaT messaging The files needed for messaging using Java (includes Java Messaging Service).		mqm.java.rte 

Man pages 	UNIXr man pages, in U.S. English, for the following:					mqm.man.en_US.data 
		Control commands 
		Message Queue Interface (MQI) commands 
		MQSC commands 

And a number of "message catalogs", such as the French Message catalogs  mqm.msg.fr_FR,mqm.msg.Fr_FR 
 
 
----------------
Filesets on AIX:
----------------

Fileset             Component  
mqm.base.runtime    Runtime  
mqm.base.samples    Sample programs  
mqm.base.sdk        Base Kit  
mqm.Client.Bnd      Client Bundle (for Easy Installation)  
mqm.client.rte      Client for AIX  
mqm.dce.samples     DCE samples  
mqm.dce.server      DCE support  
mqm.java.rte        Java and JMS support  
mqm.keyman.rte      Support for SSL key management
mqm.server.rte      Server  
mqm.Server.Bnd      Server Bundle (for Easy Installation)  
gskak.rte  IBM      Global Security Kit V6   
mqm.man.en_US.data  Man pages (US English)  
mqm.msg.de_DE       Message catalog (German)  
mqm.msg.De_DE       Message catalog (German)  
mqm.msg.en_US       Message catalog (US English)  
mqm.msg.es_ES       Message catalog (Spanish)  
etc..

-------------
Installation:
-------------

Log in as root. 
Insert the WebSphere MQ Server CD-ROM into the CD-ROM drive. 
Enter the following command to mount the CD-ROM: 
mount /cdrom

Select the required smit window using the following sequence: 
Software Installation and Maintenance
	Install and Update Software
		Install and Update from ALL Available Software

Alternatively you can use a fastpath command (smitty install_latest), however this does not give you the opportunity 
to install the language filesets. 
Click List to display the input device or directory for the software, select the location that contains the installation images. 
Use the SOFTWARE to install field to obtain a list of available filesets, and select the filesets you want to install. 
Ensure that you include the appropriate message catalog if you require a messages in a language different than that specified by 
the locale selected on your machine. 
Make sure that "Include corresponding LANGUAGE filesets?" is set to yes. 
Change "Preview new LICENSE agreements?" to yes and press Enter to view the license agreements. 
Change "ACCEPT new license agreements?" to yes and press Enter to accept the license agreements 
and install WebSphere MQ. 

-----------------------------
Testing a Local Installation:
-----------------------------

>>> SETUP:
----------

To verify your installation you must first perform this task. From a shell window, use these steps 
to create a queue manager and a queue: 

-Log in as a user in the mqm group 
-Create a default queue manager called venus.queue.manager by entering the following command: 

$ crtmqm -q venus.queue.manager

You will see messages telling you that the queue manager has been created, and that the default WebSpherer MQ objects 
have been created. 

To start the queue manager, type: 

$ strmqm

A message tells you when the queue manager has started. 

- Enable MQSC commands by typing: 

$ runmqsc

A message tells you that an MQSC session has started.  

- Define a local queue called ORANGE.QUEUE by entering the following command: 

define qlocal (orange.queue)

A message tells you when the queue has been created. 

- Stop MQSC by typing: 

end

You will see some messages, followed by the command prompt. 

You have now defined: 
-- A default queue manager called venus.queue.manager 
-- A queue called ORANGE.QUEUE 

Note: 

Use the runmqsc command to issue MQSC "Message Queue Script Command" commands to a queue manager. 
MQSC commands enable you to perform administration tasks, 
for example defining, altering, or deleting a local queue object.

>> TEST:
--------

Before completing this task you must have created a queue manager called venus.queue.manager 
and a local queue called ORANGE.QUEUE. For instructions on how to do this see Setting up the installation.

To test the queue manager and queue, use the "amqsput" sample program to put a message on the queue, and the "amqsget" 
sample program to get the message back from the queue: 

- Log on as a user in group mqm, if you are not already. 
- Change into the /usr/mqm/samp/bin directory, which contains the sample programs. 
- Put a message on the queue using the following command: 

./amqsput ORANGE.QUEUE

The following messages are displayed: 
Sample AMQSPUT0 start
target queue is ORANGE.QUEUE

- Type some message text, on one or more lines, followed by a blank line. The following message is displayed: 

Sample AMQSPUT0 end 

Your message is now on the queue and the command prompt is displayed again. 

- To get the message from the queue, use the following command: 

./amqsget ORANGE.QUEUE

The sample program starts, and your message is displayed. After a pause, the sample ends and the command prompt is displayed again. 
You have now successfully verified your local installation.

----------------------------------------
Testing a Server to Server Installation:
----------------------------------------

To verify a server-to-server installation using two servers, one as a sender and one as a receiver, complete the following tasks.

-Setting up the sender server 
-Setting up the receiver server 
-Testing communication between the servers 

>>> Setting up the sender server

In order to verify a server-to-server installation you must first set up a sender server. 
From a shell window, follow these steps to set up the sender server.
Log in as a user in the mqm group. 
Create a default queue manager called saturn.queue.manager with the following command: 

$ crtmqm -q saturn.queue.manager

Messages tell you that the queue manager has been created, and that the default WebSpherer MQ objects have been created. 
To start the queue manager, type: 

$ strmqm

A message tells you when the queue manager has started. 

Start MQSC commands by typing: 

$ runmqsc

A message tells you that an MQSC session has started. 
Define a local queue called TRANSMIT1.QUEUE (to be used as a transmission queue) by entering the following command: 

define qlocal (transmit1.queue) usage (xmitq)

A message tells you when the queue has been created. 
Define a local definition of the remote queue with the following command: 

define qremote (local.def.of.remote.queue) rname (orange.queue) 
rqmname ('venus.queue.manager') xmitq (transmit1.queue)

The name specified by the rname parameter must be the same as the name of the queue to which you are sending 
the message (ORANGE.QUEUE on the receiver workstation). 

Define a sender channel with the following command: 

define channel (first.channel) chltype (sdr) 
conname ('con-name(port)') xmitq (transmit1.queue) trptype (tcp)

The value con-name is the TCP address of the receiver workstation, and port is the port number, port 1414 is the default port number. 
End MQSC by typing: 

end

Some messages are displayed, followed by the shell prompt. 
You have now defined the following objects:
A default queue manager called saturn.queue.manager 
A transmission queue called TRANSMIT1.QUEUE 
A local definition of a remote queue called LOCAL.DEF.OF.REMOTE.QUEUE 
A sender channel called FIRST.CHANNEL 

Now to set up the receiver server so that you can verify your server-to-server installation, see Setting up the receiver server.


>>> Setting up the receiver server:

After you have completed the task, Setting up the sender server, follow these steps to set up the receiver server:

Log in as a user in the mqm group. 
Create a default queue manager called venus.queue.manager by entering the following command: 

$ crtmqm -q venus.queue.manager

Messages tell you that the queue manager has been created, and that the default WebSpherer MQ objects have been created. 

To start the queue manager, type: 

$ strmqm

A message tells you when the queue manager has started. 

Enable MQSC commands by typing: 

$ runmqsc

A message tells you that an MQSC session has started. 
Define a local queue called ORANGE.QUEUE by entering the following command: 

define qlocal (orange.queue)

A message tells you when the queue has been created. 
Define a listener by entering the following command: 
Note: If you do not specify the port that the listener should listen on, the default of 1414 is used. 
If you specified a port other than 1414 in step 7 of Setting up the sender server, you must include the port parameter 
in the command, as shown below.

define listener (listener1) trptype (tcp) control (qmgr) port (port_number)

Where port_number is the name of the port the listener should run on. This must be the same as the number used 
when defining your sender channel. 
Start the listener by entering the following command: 

start listener (listener1)

Note: It is not recommended to start the listener in the background from any shell that automatically 
lowers the priority of background processes.

Define a receiver channel with the following command: 

define channel (first.channel) chltype (rcvr) trptype (tcp)

A message tells you when the channel has been created. 
End MQSC by typing: 

end

Some messages are displayed, followed by the shellprompt. 
You have now defined the following objects: 
A default queue manager called venus.queue.manager 
A queue called ORANGE.QUEUE 
A receiver channel called FIRST.CHANNEL 
Now to test communications between your sender and receiver workstations, see Testing communication between the servers.


>>> Testing communication between the servers

After completing, Setting up the sender server, and Setting up the receiver server, use this topic to test communications 
between sender and receiver workstations using sample programs. Use the amqsput sample program to put a message 
from the sender server to a queue at the receiver server, and the amqsget sample program on the receiver server 
to get the message from the queue:

Log in to both servers as a user in the mqm group. 
If the queue managers on the two servers have stopped, restart them now by typing the following on both servers: 

$ strmqm

On the sender server, start the sender channel using the MQSC START CHANNEL command and specify the channel name:  

START CHANNEL(FIRST.CHANNEL) 

The receiver channel on the receiver server starts automatically when the sender channel starts. 
On the sender server, change into the /usr/mqm/samp/bin directory, which contains the sample programs. 
To put a message on the local definition of the remote queue (which in turn specifies the name of the remote queue), 
use the following command: 

 ./amqsput LOCAL.DEF.OF.REMOTE.QUEUE

You will see the following messages:

Sample amqsput0 start
target queue is LOCAL.DEF.OF.REMOTE.QUEUE

Type some message text on one or more lines, followed by a blank line. You will see the following message: 
 Sample amqsput0 end

Your message is now on the queue and the command prompt is displayed again. 

On the receiver server, change into the /usr/mqm/samp/bin directory, which contains the sample programs. 
To get the message from the queue at the receiver, enter the following command: 

./amqsget ORANGE.QUEUE

The sample program starts, and your message is displayed. After a pause, the sample ends and the command prompt is displayed again. 

You have now successfully verified the server-to-server installation.


-------
Note:
-------


runmqsc (run MQSC commands)

Purpose
Use the runmqsc command to issue MQSC commands to a queue manager. MQSC commands enable you to perform administration tasks, 
for example defining, altering, or deleting a local queue object. MQSC commands and their syntax are described 
in the WebSphere MQ Script (MQSC) Command Reference.

Syntax

            .------------------------------.                 
            V                              |                 
>>-runmqsc----+--------------------------+-+--+----------+-----><
              +- -e ---------------------+    '-QMgrName-'   
              +- -v ---------------------+                   
              '- -w --WaitTime--+------+-'                   
                                '- -x -'                     

Description
You can invoke the runmqsc command in three ways: 
Verify command 
Verify MQSC commands but do not run them. An output report is generated indicating the success or failure of each command. This mode is available on a local queue manager only. 
Run command directly 
Send MQSC commands directly to a local queue manager. 
Run command indirectly 
Run MQSC commands on a remote queue manager. These commands are put on the command queue on a remote queue manager and run in the order in which they were queued. Reports from the commands are returned to the local queue manager. 
Indirect mode operation is performed through the default queue manager.

The runmqsc command takes its input from stdin. When the commands are processed, the results and a summary are put into a report that is sent to stdout.

By taking stdin from the keyboard, you can enter MQSC commands interactively.

By redirecting the input from a file, you can run a sequence of frequently-used commands contained in the file. You can also redirect the output report to a file.

Optional parameters
-e 
Prevents source text for the MQSC commands from being copied into a report. This is useful when you enter commands interactively. 
-v 
Verifies the specified commands without performing the actions. This mode is only available locally. The -w and -x flags are ignored if they are specified at the same time. 
-w WaitTime 
Run the MQSC commands on another queue manager. You must have the required channel and transmission queues set up for this. See Preparing channels and transmission queues for remote administration for more information. 
WaitTime 
The time, in seconds, that runmqsc waits for replies. Any replies received after this are discarded, but the MQSC commands still run. Specify a time between 1 and 999 999 seconds. 
Each command is sent as an Escape PCF to the command queue (SYSTEM.ADMIN.COMMAND.QUEUE) of the target queue manager.

The replies are received on queue SYSTEM.MQSC.REPLY.QUEUE and the outcome is added to the report. This can be defined as either a local queue or a model queue.

Indirect mode operation is performed through the default queue manager.

This flag is ignored if the -v flag is specified.

-x 
The target queue manager is running under z/OSr. This flag applies only in indirect mode. The -w flag must also be specified. In indirect mode, the MQSC commands are written in a form suitable for the WebSpherer MQ for z/OS command queue. 
QMgrName 
The name of the target queue manager on which to run the MQSC commands, by default, the default queue manager. 
Return codes
00 MQSC command file processed successfully 
10 MQSC command file processed with errors; report contains reasons for failing commands 
20 Error; MQSC command file not run 

Examples
Enter this command at the command prompt: 
runmqscNow you can enter MQSC commands directly at the command prompt. No queue manager name is specified, so the MQSC commands are processed on the default queue manager. 
Use one of these commands, as appropriate in your environment, to specify that MQSC commands are to be verified only: 
runmqsc -v BANK < "/u/users/commfile.in"
 
runmqsc -v BANK < "c:\users\commfile.in"This command verifies the MQSC commands in file commfile.in. The queue manager name is BANK. The output is displayed in the current window. 
These commands run the MQSC command file mqscfile.in against the default queue manager. 
runmqsc < "/var/mqm/mqsc/mqscfile.in" > "/var/mqm/mqsc/mqscfile.out"
 
runmqsc < "c:\Program Files\IBM\WebSphere MQ\mqsc\mqscfile.in" > 
	"c:\Program Files\IBM\WebSphere MQ\mqsc\mqscfile.out"In this example, the output is directed to file mqscfile.out. 


#############################################################################################
#############################################################################################
#############################################################################################


=====================================================================================
Secton 24. A collection of Unix error codes and messages.
=====================================================================================

 
##############################################################

SECTION 1: AIX IPL progress codes:

##############################################################


>>>> PART 1: POWER 5 and above<<<<:
===================================


-- AIX configuration program indicators

The numbers in this list display on the operator panel as the system loads the AIX operating system 
and prepares the hardware by loading software drivers.

Some systems may produce 4-digit codes. If the leftmost digit of a 4-digit code is 0, 
use the three rightmost digits.


Progress code Description/Action:
--------------------------------- 

2E6 The PCI Differential Ultra SCSI adapter or the Universal PCI Differential Ultra SCSI adapter being configured. 
2E7 Configuration method unable to determine if the SCSI adapter type is SE or DE type. 
440 9.1GB Ultra SCSI Disk Drive being identified or configured. 
441 18.2 GB Ultra SCSI Disk Drive being identified or configured. 
444 2-Port Multiprotocol PCI Adapter (ASIC) being identified or configured. 
447 PCI 64-bit Fibre Channel Arbitrated Loop Adapter being configured. 
458 36 GB DAT72 Tape Drive 
459 36 GB DAT72 Tape Drive 
45D 200 GB HH LTO2 Tape drive 
500 Querying Standard I/O slot. 
501 Querying card in Slot 1. 
502 Querying card in Slot 2. 
503 Querying card in Slot 3. 
504 Querying card in Slot 4. 
505 Querying card in Slot 5. 
506 Querying card in Slot 6. 
507 Querying card in Slot 7. 
508 Querying card in Slot 8. 
510 Starting device configuration. 
511 Device configuration completed. 
512 Restoring device configuration files from media. 
513 Restoring basic operating system installation files from media. 
516 Contacting server during network boot. 
517 Mounting client remote file system during network IPL. 
518 Remote mount of the root (/) and /usr file systems failed during network boot. 
520 Bus configuration running. 
521 /etc/init invoked cfgmgr with invalid options; /etc/init has been corrupted or incorrectly modified (irrecoverable error). 
522 The configuration manager has been invoked with conflicting options (irrecoverable error). 
523 The configuration manager is unable to access the ODM database (irrecoverable error). 
524 The configuration manager is unable to access the config.rules object in the ODM database (irrecoverable error). 
525 The configuration manager is unable to get data from a customized device object in the ODM database (irrecoverable error). 
526 The configuration manager is unable to get data from a customized device driver object in the ODM database (irrecoverable error). 
527 The configuration manager was invoked with the phase 1 flag; running phase 1 at this point is not permitted (irrecoverable error). 
528 The configuration manager cannot find sequence rule, or no program name was specified in the ODM database (irrecoverable error). 
529 The configuration manager is unable to update ODM data (irrecoverable error). 
530 The savebase program returned an error. 
531 The configuration manager is unable to access the PdAt object class (irrecoverable error). 
532 There is not enough memory to continue (malloc failure); irrecoverable error. 
533 The configuration manager could not find a configuration method for a device. 
534 The configuration manager is unable to acquire database lock (irrecoverable error). 
535 HIPPI diagnostics interface driver being configured. 
536 The configuration manager encountered more than one sequence rule specified in the same phase (irrecoverable error). 
537 The configuration manager encountered an error when invoking the program in the sequence rule. 
538 The configuration manager is going to invoke a configuration method. 
539 The configuration method has terminated, and control has returned to the configuration manager. 
541 A DLT tape device is being configured. 
542 7208-345 60 GB tape drive
7334-410 60 GB tape drive
 
549 Console could not be configured for the Copy a System Dump Menu. 
551 IPL vary-on is running. 
552 IPL vary-on failed. 
553 IPL phase 1 is complete. 
554 The boot device could not be opened or read, or unable to define NFS swap device during network boot. 
555 An ODM error occurred when trying to vary-on the rootvg, or unable to create an NFS swap device during network boot. 
556 Logical Volume Manager encountered error during IPL vary-on. 
557 The root file system does not mount. 
558 There is not enough memory to continue the system IPL. 
559 Less than 2 MB of good memory are available to load the AIX kernel. 
569 FCS SCSI protocol device is being configured (32 bits). 
570 Virtual SCSI devices being configured. 
571 HIPPI common function device driver being configured. 
572 HIPPI IPI-3 master transport driver being configured. 
573 HIPPI IPI-3 slave transport driver being configured. 
574 HIPPI IPI-3 transport services user interface device driver being configured. 
575 A 9570 disk-array driver being configured. 
576 Generic async device driver being configured. 
577 Generic SCSI device driver being configured. 
578 Generic commo device driver being configured. 
579 Device driver being configured for a generic device. 
580 HIPPI TCP/IP network interface driver being configured. 
581 Configuring TCP/IP. 
582 Configuring Token-Ring data link control. 
583 Configuring an Ethernet data link control. 
584 Configuring an IEEE Ethernet data link control. 
585 Configuring an SDLC MPQP data link control. 
586 Configuring a QLLC X.25 data link control. 
587 Configuring a NETBIOS. 
588 Configuring a Bisync Read-Write (BSCRW). 
589 SCSI target mode device being configured. 
590 Diskless remote paging device being configured. 
591 Configuring an LVM device driver. 
592 Configuring an HFT device driver. 
593 Configuring SNA device drivers. 
594 Asynchronous I/O being defined or configured. 
595 X.31 pseudo-device being configured. 
596 SNA DLC/LAPE pseudo-device being configured. 
597 OCS software being configured. 
598 OCS hosts being configured during system reboot. 
599 Configuring FDDI data link control. 
59B FCS SCSI protocol device being configured (64 bits). 
5C0 Streams-based hardware drive being configured. 
5C1 Streams-based X.25 protocol being configured. 
5C2 Streams-based X.25 COMIO emulator driver being configured 
5C3 Streams-based X.25 TCP/IP interface driver being configured. 
5C4 FCS adapter device driver being configured. 
5C5 SCB network device driver for FCS being configured. 
5C6 AIX SNA channel being configured. 
600 Starting network boot portion of /sbin/rc.boot. 
602 Configuring network parent devices. 
603 /usr/lib/methods/defsys, /usr/lib/methods/cfgsys, or /usr/lib/methods/cfgbus failed. 
604 Configuring physical network boot device. 
605 Configuration of physical network boot device failed. 
606 Running /usr/sbin/ifconfig on logical network boot device. 
607 /usr/sbin/ifconfig failed. 
608 Attempting to retrieve the client.info file with tftp. 

Note:
Note that a flashing 608 indicates multiple attempt(s) to retrieve the client_info file are occurring. 

609 The client.info file does not exist or it is zero length. 
60B 18.2 GB 68-pin LVD SCSI Disk Drive being configured. 
610 Attempting remote mount of NFS file system. 
611 Remote mount of the NFS file system failed. 
612 Accessing remote files; unconfiguring network boot device. 
613 8 mm 80 GB VXA-2 tape device 
614 Configuring local paging devices. 
615 Configuration of a local paging device failed. 
616 Converting from diskless to dataless configuration. 
617 Diskless to dataless configuration failed. 
618 Configuring remote (NFS) paging devices. 
619 Configuration of a remote (NFS) paging device failed. 
61B 36.4 GB 80-pin LVD SCSI Disk Drive being configured. 
61D 36.4 GB 80-pin LVD SCSI Disk Drive being configured. 
61E 18.2 GB 68-pin LVD SCSI Disk Drive being configured. 
620 Updating special device files and ODM in permanent file system with data from boot RAM file system. 
621 9.1 GB LVD 80-pin SCSI Drive being configured. 
622 Boot process configuring for operating system installation. 
62D 9.1 GB 68-pin LVD SCSI Disk Drive being configured. 
62E 9.1GB 68-pin LVD SCSI Disk Drive being configured. 
636 TURBOWAYS� 622 Mbps PCI MMF ATM Adapter. 
637 Dual Channel PCI-2 Ultra2 SCSI Adapter being configured. 
638 4.5 GB Ultra SCSI Single Ended Disk Drive being configured. 
639 9.1 GB 10K RPM Ultra SCSI Disk Drive (68-pin). 
643 18.2 GB LVD 80-pin SCA-2 connector SCSI Disk Drive being configured. 
63A See 62D. 
63B 9.1 GB 80-pin LVD SCSI Disk Drive being configured. 
63C See 60B. 
63D 18.2 GB 80-pin LVD SCSI Disk Drive being configured. 
63E 36.4 GB 68-pin LVD SCSI Disk Drive being configured. 
63F See 61B. 
640 9.1 GB 10K RPM Ultra SCSI Disk Drive (80-pin). 
646 High-Speed Token-Ring PCI Adapter being configured. 
64A See 62E. 
64B 9.1 GB 80-pin LVD SCSI Disk Drive being configured. 
64C See 61E. 
64D 18.2 GB LVD 80-pin Drive/Carrier being configured. 
64E 36.4 GB 68-pin LVD SCSI Disk Drive being configured. 
64F See 61D. 
650 SCSD disk drive being configured. 
653 18.2 GB Ultra-SCSI 16-bit Disk Drive being configured. 
655 GXT130P Graphics adapter being configured. 
657 GXT2000P graphics adapter being configured. 
658 PCI Fibre Channel Disk Subsystem Controller being identified or configured. 
659 2102 Fibre Channel Disk Subsystem Controller Drawer being identified or configured. 
660 2102 Fibre Channel Disk Array being identified or configured. 
662 Ultra2 Integrated SCSI controller. 
663 The ARTIC960RxD Digital Trunk Quad PCI Adapter or the ARTIC960RxF Digital Trunk Resource Adapter being configured. 
664 32x (MAX) SCSI-2 CD-ROM drive being configured. 
667 PCI 3-Channel Ultra2 SCSI RAID Adapter being configured. 
669 PCI Gigabit Ethernet Adapter being configured. 
66A Keyboard/Mouse Attachment Card-PCI being configured. 
66C 10/100/1000 Base-T Ethernet PCI Adapter. 
66D PCI 4-Channel Ultra-3 SCSI RAID Adapter. 
66E 4.7 GB DVD-RAM drive. 
674 ESCON� Channel PCI Adapter being configured. 
677 PCI 32-bit Fibre Channel Arbitrated Loop Adapter being configured. 
678 12 GB 4 mm SCSI tape drive 
67B PCI Cryptographic Coprocessor being configured. 
682 20x (MAX) SCSI-2 CD-ROM Drive being configured. 
689 4.5 GB Ultra SCSI Single Ended Disk Drive being configured. 
68C 20 GB 4-mm Tape Drive being configured. 
68E POWER GXT6000P PCI Graphics Adapter. 
690 9.1 GB Ultra SCSI Single Ended Disk Drive being configured. 
69b 64-bit/66 MHz PCI ATM 155 MMF PCI adapter being configured. 
69d 64-bit/66 MHz PCI ATM 155 UTP PCI adapter being configured. 
6CC SSA disk drive being configured. 
700 A 1.1 GB 8-bit SCSI disk drive being identified or configured. 
701 A 1.1 GB 16-bit SCSI disk drive being identified or configured. 
702 A 1.1 GB 16-bit differential SCSI disk drive being identified or configured. 
703 A 2.2 GB 8-bit SCSI disk drive being identified or configured. 
704 A 2.2 GB 16-bit SCSI disk drive being identified or configured. 
705 The configuration method for the 2.2 GB 16-bit differential SCSI disk drive is being run. If an irrecoverable error occurs, the system halts. 
706 A 4.5 GB 16-bit SCSI disk drive being identified or configured. 
707 A 4.5 GB 16-bit differential SCSI disk drive being identified or configured. 
708 An L2 cache being identified or configured. 
709 128 port ISA adapter being configured 
710 POWER GXT150M graphics adapter being identified or configured. 
711 Unknown adapter being identified or configured. 
712 Graphics slot bus configuration is executing. 
713 The IBM ARTIC960 device being configured. 
714 A video capture adapter being configured. 
715 The Ultramedia Services audio adapter being configured. This number displays briefly on the panel. 
717 TP Ethernet Adapter being configured. 
718 GXT500 Graphics Adapter being configured. 
720 Unknown read/write optical drive type being configured. 
721 Unknown disk or SCSI device being identified or configured. 
722 Unknown disk being identified or configured. 
723 Unknown CD-ROM being identified or configured. 
724 Unknown tape drive being identified or configured. 
725 Unknown display adapter being identified or configured. 
726 Unknown input device being identified or configured. 
727 Unknown async device being identified or configured. 
728 Parallel printer being identified or configured. 
729 Unknown parallel device being identified or configured. 
730 Unknown diskette drive being identified or configured. 
731 PTY being identified or configured. 
732 Unknown SCSI initiator type being configured. 
733 7 GB 8-mm tape drive being configured. 
734 4x SCSI-2 640 MB CD-ROM Drive being configured. 
736 Quiet Touch keyboard and speaker cable being configured. 
741 1080 MB SCSI Disk Drive being configured. 
745 16 GB 4-mm Tape Auto Loader being configured. 
746 SCSI-2 Fast/Wide PCI Adapter being configured. 
747 SCSI-2 Differential Fast/Wide PCI Adapter being configured. 
749 7331 Model 205 Tape Library being configured. 
751 SCSI 32-bit SE F/W RAID Adapter being configured. 
754 1.1 GB 16-bit SCSI disk drive being configured. 
755 2.2 GB 16-bit SCSI disk drive being configured. 
756 4.5 GB 16-bit SCSI disk drive being configured. 
757 External 13 GB 1.5M/s 1/4-inch tape being configured. 
763 SP Switch MX Adapter being configured. 
764 SP System Attachment Adapter being configured. 
772 4.5 GB SCSI F/W Disk Drive being configured. 
773 9.1 GB SCSI F/W Disk Drive being configured. 
774 9.1 GB External SCSI Disk Drive being configured. 
776 PCI Token-Ring Adapter being identified or configured. 
777 10/100 Ethernet Tx PCI Adapter being identified or configured. 
778 POWER GXT3000P 3D PCI Graphics adapter being configured. 
77B 4-Port 10/100 Ethernet Tx PCI Adapter being identified or configured. 
77c A 1.0 GB 16-bit SCSI disk drive being identified or configured. 
783 4-mm DDS-2 Tape Autoloader being configured. 
789 2.6 GB External Optical Drive being configured. 
78B POWER GXT4000P PCI Graphics Adapter. 
78D GXT300P 2D Graphics adapter being configured. 
790 Multi-bus Integrated Ethernet Adapter being identified or configured. 
797 TURBOWAYS� 155 UTP/STP ATM Adapter being identified or configured. 
798 Video streamer adapter being identified or configured. 
799 2-Port Multiprotocol PCI adapter being identified or configured. 
79c ISA bus configuration executing. 
7C0 CPU/System Interface being configured. 
7C1 Business Audio Subsystem being identified or configured. 
7cc PCMCIA bus configuration executing. 
800 TURBOWAYS� 155 MMF ATM Adapter being identified or configured. 
803 7336 Tape Library robotics being configured. 
804 8x Speed SCSI-2 CD-ROM Drive being configured. 
806 POWER GXT800 PCI Graphics adapter being configured. 
807 SCSI Device Enclosure being configured. 
80c SSA 4-Port Adapter being identified or configured. 
811 Processor complex being identified or configured. 
812 Memory being identified or configured. 
813 Battery for time-of-day, NVRAM, and so on being identified or configured, or system I/O control logic being identified or configured. 
814 NVRAM being identified or configured. 
815 Floating-point processor test. 
816 Operator panel logic being identified or configured. 
817 Time-of-day logic being identified or configured. 
819 Graphics input device adapter being identified or configured. 
821 Standard keyboard adapter being identified or configured. 
823 Standard mouse adapter being identified or configured. 
824 Standard tablet adapter being identified or configured. 
825 Standard speaker adapter being identified or configured. 
826 Serial Port 1 adapter being identified or configured. 
827 Parallel port adapter being identified or configured. 
828 Standard diskette adapter being identified or configured. 
831 3151 adapter being identified or configured, or Serial Port 2 being identified or configured. 
834 64-port async controller being identified or configured. 
835 16-port async concentrator being identified or configured. 
836 128-port async controller being identified or configured. 
837 16-port remote async node being identified or configured. 
838 Network Terminal Accelerator Adapter being identified or configured. 
839 7318 Serial Communications Server being configured. 
840 PCI Single-Ended Ultra SCSI Adapter being configured. 
841 8-port async adapter (EIA-232) being identified or configured. 
842 8-port async adapter (EIA-422A) being identified or configured. 
843 8-port async adapter (MIL-STD-188) being identified or configured. 
844 7135 RAIDiant Array disk drive subsystem controller being identified or configured. 
845 7135 RAIDiant Array disk drive subsystem drawer being identified or configured. 
846 RAIDiant Array SCSI 1.3 GB Disk Drive being configured. 
847 16-port serial adapter (EIA-232) being identified or configured. 
848 16-port serial adapter (EIA-422) being identified or configured. 
849 X.25 Interface Coprocessor/2 adapter being identified or configured. 
850 Token-Ring network adapter being identified or configured. 
851 T1/J1 Portmaster� adapter being identified or configured. 
852 Ethernet adapter being identified or configured. 
854 3270 Host Connection Program/6000 connection being identified or configured. 
855 Portmaster Adapter/A being identified or configured. 
857 FSLA adapter being identified or configured. 
858 5085/5086/5088 adapter being identified or configured. 
859 FDDI adapter being identified or configured. 
85c Token-Ring High-Performance LAN adapter being identified or configured. 
861 Optical adapter being identified or configured. 
862 Block Multiplexer Channel Adapter being identified or configured. 
865 ESCON Channel Adapter or emulator being identified or configured. 
866 SCSI adapter being identified or configured. 
867 Async expansion adapter being identified or configured. 
868 SCSI adapter being identified or configured. 
869 SCSI adapter being identified or configured. 
870 Serial disk drive adapter being identified or configured. 
871 Graphics subsystem adapter being identified or configured. 
872 Grayscale graphics adapter being identified or configured. 
874 Color graphics adapter being identified or configured. 
875 Vendor generic communication adapter being configured. 
876 8-bit color graphics processor being identified or configured. 
877 POWER Gt3�/POWER Gt4� being identified or configured. 
878 POWER Gt4� graphics processor card being configured. 
879 24-bit color graphics card, MEV2 being configured. 
880 POWER Gt1� adapter being identified or configured. 
887 Integrated Ethernet adapter being identified or configured. 
889 SCSI adapter being identified or configured. 
890 SCSI-2 Differential Fast/Wide and Single-Ended Fast/Wide Adapter/A being configured. 
891 Vendor SCSI adapter being identified or configured. 
892 Vendor display adapter being identified or configured. 
893 Vendor LAN adapter being identified or configured. 
894 Vendor async/communications adapter being identified or configured. 
895 Vendor IEEE 488 adapter being identified or configured. 
896 Vendor VME bus adapter being identified or configured. 
897 S/370� Channel Emulator adapter being identified or configured. 
898 POWER Gt1x� graphics adapter being identified or configured. 
899 3490 attached tape drive being identified or configured. 
89c A multimedia SCSI CD-ROM being identified or configured. 
900 GXT110P Graphics Adapter being identified or configured. 
901 Vendor SCSI device being identified or configured. 
902 Vendor display device being identified or configured. 
903 Vendor async device being identified or configured. 
904 Vendor parallel device being identified or configured. 
905 Vendor other device being identified or configured. 
908 POWER GXT1000 Graphics subsystem being identified or configured. 
910 1/4 GB Fiber Channel/266 Standard Adapter being identified or configured. 
911 Fiber Channel/1063 Adapter Short Wave being configured. 
912 2.0 GB SCSI-2 differential disk drive being identified or configured. 
913 1.0 GB differential disk drive being identified or configured. 
914 5 GB 8-mm differential tape drive being identified or configured. 
915 4 GB 4-mm tape drive being identified or configured. 
916 Non-SCSI vendor tape adapter being identified or configured. 
917 A 2.0 GB 16-bit differential SCSI disk drive being identified or configured. 
918 A 2.0 GB 16-bit single-ended SCSI disk drive being identified or configured. 
920 Bridge Box being identified or configured. 
921 101 keyboard being identified or configured. 
922 102 keyboard being identified or configured. 
923 Kanji keyboard being identified or configured. 
924 Two-button mouse being identified or configured. 
925 Three-button mouse being identified or configured. 
926 5083 tablet being identified or configured. 
927 5083 tablet being identified or configured. 
928 Standard speaker being identified or configured. 
929 Dials being identified or configured. 
930 Lighted program function keys (LPFK) being identified or configured. 
931 IP router being identified or configured. 
933 Async planar being identified or configured. 
934 Async expansion drawer being identified or configured. 
935 3.5-inch diskette drive being identified or configured. 
936 5.25-inch diskette drive being identified or configured. 
937 An HIPPI adapter being configured. 
938 Serial HIPPI PCI adapter being configured. 
942 POWER GXT 100 graphics adapter being identified or configured. 
943 A 3480 or 3490 control unit attached to a System/370 Channel Emulator/A adapter are being identified or configured. 
944 100 MB ATM adapter being identified or configured. 
945 1.0 GB SCSI differential disk drive being identified or configured. 
946 Serial port 3 adapter being identified or configured. 
947 A 730 MB SCSI disk drive being configured. 
948 Portable disk drive being identified or configured. 
949 Unknown direct bus-attach device being identified or configured. 
950 Missing SCSI device being identified or configured. 
951 670 MB SCSI disk drive being identified or configured. 
952 355 MB SCSI disk drive being identified or configured. 
953 320 MB SCSI disk drive being identified or configured. 
954 400 MB SCSI disk drive being identified or configured. 
955 857 MB SCSI disk drive being identified or configured. 
956 670 MB SCSI disk drive electronics card being identified or configured. 
957 120 MB DBA disk drive being identified or configured. 
958 160 MB DBA disk drive being identified or configured. 
959 160 MB SCSI disk drive being identified or configured. 
960 1.37 GB SCSI disk drive being identified or configured. 
964 Internal 20 GB 8-mm tape drive identified or configured. 
968 1.0 GB SCSI disk drive being identified or configured. 
970 Half-inch, 9-track tape drive being identified or configured. 
971 150 MB 1/4-inch tape drive being identified or configured. 
972 2.3 GB 8-mm SCSI tape drive being identified or configured. 
973 Other SCSI tape drive being identified or configured. 
974 CD-ROM drive being identified or configured. 
975 An optical disk drive being identified or configured. 
977 M-Audio Capture and Playback Adapter being identified or configured. 
981 540 MB SCSI-2 single-ended disk drive being identified or configured. 
984 1 GB 8-bit disk drive being identified or configured. 
985 M-Video Capture Adapter being identified or configured. 
986 2.4 GB SCSI disk drive being identified or configured. 
987 An Enhanced SCSI CD-ROM drive being identified or configured. 
989 200 MB SCSI disk drive being identified or configured. 
990 2.0 GB SCSI-2 single-ended disk drive being identified or configured. 
991 525 MB 1/4-inch cartridge tape drive being identified or configured. 
994 5 GB 8-mm tape drive being identified or configured. 
995 1.2GB 1/4-inch cartridge tape drive being identified or configured. 
996 A single-port, multiprotocol communications adapter being identified or configured. 
997 FDDI adapter being identified or configured. 
998 2.0 GB 4-mm tape drive being identified or configured. 
999 7137 or 3514 Disk Array Subsystem being configured. 
D46 Token-Ring cable. 
D81 T2 Ethernet Adapter being configured. 
2000 Dynamic LPAR CPU Addition 
2001 Dynamic LPAR CPU Removal 
2002 Dynamic LPAR Memory Addition 
2003 Dynamic LPAR Memory Removal 
2004 DLPAR Maximum Memory size too large 
2010 HTX miscompare 
2011 Configuring device model 2107 fcp 
2012 Configuring device model 2107 iscsi 
2013 Configuring MR-1750 (device model 1750) fcp 
2014 Configuring MR-1750 (device model 1750) iscsi 
2015 Configuring SVC (device model 2145) fcp 
2016 Configuring SVCCISCO (device model 2062) fcp 
2017 Configuring SVCCISCO (device model 2062) iscsi 
2018 Configuring Virtual Management Channel driver 
2019 Configuring vty server 
201b Configuring Virtual SCSI Optical 
2020 Configuring Infiniband ICM kernel component 
2021 Configuring TCP Infiniband Interface kernel component 
2502 Configuring PCI-X266 Planar 3 GB integrated SAS adapter 
2503 Configuring PCI-X266 Planar 3 GB integrated SAS RAID adapter 
2512 Configuring PCI-X DDR quad channel Ultra320 SCSI RAID adapter 
2513 Configuring PCI-X DDR quad channel Ultra320 SCSI RAID adapter 
2514 Configuring PCI-X DDR quad channel Ultra320 SCSI RAID adapter 
2520 PCI Dual-Channel Ultra-3 SCSI adapter being identified or configured. 
2522 PCI-X Dual Channel Ultra320 SCSI Adapter 
2523 PCI-X Ultra320 SCSI RAID Adapter 
2526 PCI-X Ultra320 SCSI RAID Battery Pack 
2527 PCI-X Quad Channel U320 SCSI RAID Adapter 
2528 PCI-X Dual Channel Ultra320 SCSI adapter 
2529 PCI-X Dual Channel Ultra320 SCSI RAID adapter 
252B PCI-X DDR Dual Channel Ultra320 SCSI RAID adapter 
252D PCI-X DDR Dual Channel Ultra320 SCSI RAID adapter 
252E PCI-X DDR Auxiliary Cache adapter 
2530 10/100 Mbps Ethernet PCI Adapter II being configured. 
2533 10 GB Ethernet -SR PCI-X 2.0 DDR adapter being configured 
2534 10 GB Ethernet -LR PCI-X 2.0 DDR adapter being configured 
2535 4-Port 10/100/1000 Base-TX Ethernet PCI-X Adapter being configured. 
2547 Generic 522 bites per sector SCSI JBOD (not osdisk) Disk Drive 
254E Fibre Channel Expansion Card 
2562 Keyboard/Mouse Attachment Card-PCI being configured. 
2564 Keyboard/Mouse Attachment Card-PCI being configured. 
2566 USB 3.5 inch Micro Diskette Drive 
2568 USB CD-ROM, Generic 
2571 2-Port PCI Asynchronous EIA-232 Adapter 
2581 1 GB iSCSI TOE PCI-X adapter is being configured (copper connector) 
2582 iSCSI protocol device associated with an iSCSI adapter is being configured 
2583 1 GB iSCSI TOE PCI-X adapter being configured (copper connector) 
2584 IDE DVD-RAM drive being configured 
2585 IDE DVD-ROM drive being configured 
2586  
2587 Slimline DVD-ROM drive 
2588 4.7 GB slimline DVD-RAM drive 
2590 IDE CD-ROM drive being configured 
2591 IDE DVD-ROM drive being configured. 
2592 IDE DVD-ROM drive being configured. 
2593 IDE DVD-RAM drive being configured. 
2594 4.7 GB IDE slimline DVD-RAM drive 
2595 IDE slimline DVD-ROM drive 
25A0 I/O Planar Control Logic for IDE devices 
25B9 Ethernet Adapter (Fiber) 
25C0 Gigabit Ethernet-SX PCI-X adapter 
25C1 10/100/1000 base-TX Ethernet PCI-X adapter 
25C2 Dual Port Gigabit SX Ethernet PCI-X Adapter 
25C3 10/100/1000 Base-TX Dual Port PCI-Adapter 
25C4 Broadcom Dual-Port Gpbs Ethernet PCI-X Adapter 
25D2 LSI SAS adapter 
2600 PCI 64-bit Fibre Channel Arbitrated Loop Adapter being configured. 
2601 PCI 64-bit Fibre Channel Arbitrated Loop Adapter being configured. 
2602 PCI 64-Bit 4 GB fibre channel adapter 
2611 36/72 GB 4 mm internal tape drive 
2612 80/160 GB internal tape drive with VXA2 technology 
2613 200/400 GB LTO2 Tape drive 
2614 VXA3 160/320 GB Tape Drive 
2615 Configuring DAT160 80 GB Tape drive 
2617 Configuring LTO3 400 GB Tape drive 
2621 PCI-X Dual-port 4x HCA Adapter being configured 
2631 Integrated IDE controller 
2640 IDE Disk Drive, 2.5 inch 
2641 73 GB SCSI disk drive 68 pin 10K rpm being identified or configured. 
2642 73 GB SCSI disk drive 80 pin 10K rpm with u3 carrier being identified or configured. 
2643 73 GB SCSI disk drive 80 pin 10K rpm with u3 carrier being identified or configured. (For OpenPower systems) 
2644 146 GB SCSI disk drive 68 pin 10K rpm being identified or configured. 
2645 146 GB SCSI disk drive 80 pin 10K rpm with u3 carrier being identified or configured. 
2646 146 GB SCSI disk drive 80 pin 10K rpm with u3 carrier being identified or configured. (For OpenPower systems) 
2647 300 GB SCSI disk drive 68 pin 10K rpm being identified or configured. 
2648 300 GB SCSI disk drive 80 pin 10K rpm with u3 carrier being identified or configured. 
2649 300 GB SCSI disk drive 80 pin 10K rpm with u3 carrier being identified or configured. (For OpenPower systems) 
264b 36 GB SCSI disk drive 80 pin 15K rpm with u3 carrier being identified or configured. 
264d 36 GB SCSI disk drive 80 pin 15K rpm with u3 carrier being identified or configured. (For OpenPower systems) 
264e 73 GB SCSI disk drive 80 pin 15K rpm with u3 carrier being identified or configured. 
2650 ESS iSCSI devices being identified or configured. 
2651 SVC being identified or configured. 
2652 SVCCISCOi being identified or configured. 
2653 73 GB SCSI disk drive 80 pin 15K rpm with u3 carrier being identified or configured. (For OpenPower systems) 
2654 146 GB SCSI disk drive 80 pin 15K rpm with u3 carrier being identified or configured. 
2655 146 GB SCSI disk drive 80 pin 15K rpm with u3 carrier being identified or configured. (For OpenPower systems) 
2656 73 GB SCSI disk drive 80 pin 15K rpm being identified or configured. 
2657 146 GB SCSI disk drive 80 pin 15K rpm being identified or configured. 
2658 73 GB SCSI disk drive 80 pin 10K rpm being identified or configured. 
2659 146 GB SCSI disk drive 80 pin 10K rpm being identified or configured. 
265b 300 GB SCSI disk drive 80 pin 10K rpm being identified or configured. 
2D01 PCI-X Quad Channel U320 SCSI RAID battery pack 
2D05 PCI-X266 Planar 3 GB SAS RAID adapter battery pack 
2D07 PCI-X DDR Auxiliary Cache adapter 


-- AIX diagnostics load-progress indicators 

Note:
Some systems might produce 4-digit codes. If the leftmost digit of a 4-digit code is 0, 
use the three rightmost digits.

Progress code Description/Action
--------------------------------
 
c00 AIX Install/Maintenance loaded successfully. 
c01 Insert the first diagnostic diskette. 
c02 Diskettes inserted out of sequence. 
c03 The wrong diskette is in diskette drive. 
c04 The loading stopped with an irrecoverable error. 
c05 A diskette error occurred. 
c06 The rc.boot configuration shell script is unable to determine type of boot. 
c07 Insert the next diagnostic diskette. 
c08 RAM file system started incorrectly. 
c09 The diskette drive is reading or writing a diskette. 
c20 An unexpected halt occurred, and the system is configured to enter the kernel debug program instead of entering a system dump. 
c21 The ifconfig command was unable to configure the network for the client network host. 
c22 The tftp command was unable to read client's ClientHostName. info file during a client network boot. 
c24 Unable to read client's ClientHostName.info file during a client network boot. 
c25 Client did not mount remote miniroot during network install. 
c26 Client did not mount the /usr file system during the network boot. 
c29 The system was unable to configure the network device. 
c31 Select the console display for the diagnostics. To select No console display, set the key mode switch to Normal, then to Service. The diagnostic programs then load and run the diagnostics automatically. If you continue to get the message, check the cables and make sure you are using the serial port. 
c32 A directly attached display (HFT) was selected. 
c33 A TTY terminal attached to serial ports S1 or S2 was selected. 
c34 A file was selected. The console messages store in a file. 
c35 No console found. 
c40 Configuration files are being restored. 
c41 Could not determine the boot type or device. 
c42 Extracting data files from diskette. 
c43 Cannot access the boot/install tape. 
c44 Initializing installation database with target disk information. 
c45 Cannot configure the console. 
c46 Normal installation processing. 
c47 Could not create a physical volume identifier (PVID) on disk. 
c48 Prompting you for input. 
c49 Could not create or form the JFS log. 
c50 Creating root volume group on target disks. 
c51 No paging devices were found. 
c52 Changing from RAM environment to disk environment. 
c53 Not enough space in the /tmp directory to do a preservation installation. 
c54 Installing either BOS or additional packages. 
c55 Could not remove the specified logical volume in a preservation installation. 
c56 Running user-defined customization. 
c57 Failure to restore BOS. 
c58 Displaying message to turn the key. 
c59 Could not copy either device special files, device ODM, or volume group information from RAM to disk. 
c61 Failed to create the boot image. 
c62 Loading platform dependent debug files. 
c63 Loading platform dependent data files. 
c64 Failed to load platform dependent data files. 
c70 Problem Mounting diagnostic CD-ROM disc. 
c99 Diagnostics have completed. This code is only used when there is no console. 
Fxx (xx is any number) Refer to Firmware chapter of the service manual. 


-- Dump progress indicators (dump status codes)

The following dump progress indicators, or dump status codes, are part of a Type 102 message. 

Note:

When a lowercase c is listed, it displays in the lower half of the character position. 
Some systems produce 4-digit codes, the two leftmost positions can have blanks or zeros. Use the two rightmost digits.

Progress code Description/Action 
--------------------------------
0c0 The dump completed successfully. 
0c1 The dump failed due to an I/O error. 
0c2 A dump, requested by the user, is started. 
0c3 The dump is inhibited. 
0c4 The dump device is not large enough. 
0c5 The dump did not start, or the dump crashed. 
0c6 Dumping to a secondary dump device. 
0c7 Reserved. 
0c8 The dump function is disabled. 
0c9 A dump is in progress. 
0cc Unknown dump failure. 


-- Crash codes

Note:
Some systems may produce 4-digit codes. If the leftmost digit of a 4-digit code is 0, use the three rightmost digits.
The crash codes that follow are part of a Type 102 message. These crash codes are grouped into three categories: 

Category 1 
Dump analysis is the appropriate first action in Problem Determination. Begin the Problem Determination process with software support. 
Category 2 
Dump analysis most likely will not aid in Problem Determination. Begin the Problem Determination process with hardware support. 
Category 3 
Both software and hardware support may be needed in Problem Determination, go to 888 sequence in operator panel display to assist in problem isolation. 
Category 1 crash progress code

Progress code Description/Action 
300 Data storage interrupt from the processor. 
32x Data storage interrupt because of an I/O exception from IOCC. 
38x Data storage interrupt because of an I/O exception from SLA. 
400 Instruction storage interrupt. 
700 Program interrupt. 

Category 2 crash progress code

Progress code Description/Action 
200 Machine check because of a memory bus error. 
201 Machine check because of a memory timeout. 
202 Machine check because of a memory card failure. 
203 Machine check because of an out of range address. 
204 Machine check because of an attempt to write to ROS. 
205 Machine check because of an uncorrectable address parity. 
206 Machine check because of an uncorrectable ECC error. 
207 Machine check because of an unidentified error. 
208 Machine check due to an L2 uncorrectable ECC. 
500 External interrupt because of a scrub memory bus error. 
501 External interrupt because of an unidentified error. 
51x External interrupt because of a DMA memory bus error. 
52x External interrupt because of an IOCC channel check. 
53x External interrupt from an IOCC bus timeout; x represents the IOCC number. 
54x External interrupt because of an IOCC keyboard check. 
800 Floating point is not available. 

Category 3 crash progress code

Progress code Description/Action 
000 Unexpected system interrupt. 
558 There is not enough memory to continue the IPL. 
600 AIX 4.3.3.3 and above: Alignment Interrupt. If pre-AIX 4.3.3.3: AIX has crashed because the Portability Assist Layer (PAL) for this machine type has detected a problem. 
605 AIX 4.3.3.3 and above: AIX has crashed because the Portability Assist Layer (PAL) for this machine type has detected a problem. 


>>>>> PART 2: POWER AND RS IPL CODES <<<<<
==========================================


MCA LED codes:
--------------

Booting BIST phase: leds 100-195, defining hardware status
Booting POST phase: leds 200-2E7, during finding BLV
LED 200: key in secure position
LED 299: BLV will be loaded

PCI systems an pSeries LED codes:
---------------------------------

reduced ODM from BLV copied into RAMFS: OK=510, NOT OK=LED 548: 
LED 511: bootinfo -b is called to determine the last bootdevice
ipl_varyon of rootvg: OK=517,ELSE 551,552,554,556: 
LED 555,557: mount /dev/hd4 on temporary mountpoint /mnt
LED 518: mount /usr, /var
LED 553: syncvg rootvg, or inittab problem
LED 549
LED 581: tcp/ip is being configured, and there is some problem

Last phases in the boot is where cfgcon is called, to configure the console.
cfgcon LED codes include:
C31: Console not yet configured.
C32: Console is an LFT terminal
C33: Console is a TTY
C34: Console is a file on disk
C99: Could not detect a console device

LED 551: ipl_varyon of rootvg

201           : Damaged boot image
223-229       : Invalid boot list
551,555,557   : Corrupted filesystem, corrupted JFS log
552,554,556   : Superblock corrupted, corrupted customized ODM database
553           : Corrupted /etc/inittab file

Firmware that leads to LED code:
--------------------------------

LED Code 888 right after boot: software problem 102, OR, hardware or software problem 103

rc.boot LED codes:
------------------

rc.boot1

  init          success=F05 error=c06                             
  restbase      copies bootimage ODM -> RAM fs ODM:  success=510  error=548
  cfgmgr -f     configuration all base devices needed to access rootvg
  bootinfo -b

end rc.boot 1   LED=511


Built-In Self-Test (BIST) Indicators
------------------------------------

100 BIST completed successfully; control was passed to IPL ROS.
101 BIST started following reset.
102 BIST started, following the system unit's power-on reset.
103 BIST could not determine the system model number.
104 Equipment conflict; BIST could not find the CBA.
105 BIST could not read from the OCS EPROM.
106 BIST failed: CBA not found
111 OCS stopped; BIST detected a module error.
112 A checkstop occurred during BIST; checkstop results could not be logged out.
113 Three checkstops have occurred.
120 BIST starting a CRC check on the 8752 EPROM.
121 BIST detected a bad CRC in the first 32K bytes of the OCS EPROM.
122 BIST started a CRC check on the first 32K bytes of the OCS EPROM.
123 BIST detected a bad CRC on the OCS area of NVRAM.
124 BIST started a CRC check on the OCS area of NVRAM.
125 BIST detected a bad CRC on the time-of-day area of NVRAM.
126 BIST started a CRC check on the time-of-day area of NVRAM.
127 BIST detected a bad CRC on the 8752 EPROM.
130 BIST presence test started.
140 Running BIST. (Box Manufacturing Mode Only)
142 Box manufacturing mode operation.
143 Invalid memory configuration.
144 Manufacturing test failure.
151 BIST started AIPGM test code.
152 BIST started DCLST test code.
153 BIST started ACLST test code.
154 BIST started AST test code.
160 Bad EPOW Signal/Power status signal.
161 BIST being conducted on BUMP I/O.
162 BIST being conducted on JTAG.
163 BIST being conducted on Direct I/O.
164 BIST being conducted on CPU.
165 BIST being conducted on DCB and Memory.
166 BIST being conducted on Interrupts.
170 BIST being conducted on Multi-Processors.
180 Logout in progress.
182 BIST COP bus not responding.
185 A checkstop condition occurred during the BIST.
186 System logic-generated checkstop (Model 250 only).
187 Graphics-generated checkstop (Model 250).
195 Checkstop logout complete
199 Generic SCSI backplane
888 BIST did not start.

Power-On Self-Test (POST) Indicators
------------------------------------ 

200 IPL attempted with keylock in the Secure position.
201 IPL ROM test failed or checkstop occurred (irrecoverable).
202 Unexpected machine check interrupt.
203 Unexpected data storage interrupt.
204 Unexpected instruction storage interrupt.
205 Unexpected external interrupt.
206 Unexpected alignment interrupt.
207 Unexpected program interrupt.
208 Unexpected floating point unavailable interrupt.
209 Unexpected SVC interrupt.
20c L2 cache POST error. (The display shows a solid 20c for 5 seconds.)
210 Unexpected SVC interrupt.
211 IPL ROM CRC comparison error (irrecoverable).
212 RAM POST memory configuration error or no memory found (irrecoverable).
213 RAM POST failure (irrecoverable).
214 Power status register failed (irrecoverable).
215 A low voltage condition is present (irrecoverable).
216 IPL ROM code being uncompressed into memory.
217 End of boot list encountered.
218 RAM POST is looking for good memory.
219 RAM POST bit map is being generated.
21c L2 cache is not detected. (The display shows a solid 21c for 2 seconds.)
220 IPL control block is being initialized.
221 NVRAM CRC comparison error during AIX IPL(key mode switch in Normal mode).
Reset NVRAM by reaccomplishing IPL in Service mode. For systems with an
internal, direct-bus-attached (DBA) disk, IPL ROM attempted to perform an IPL from
that disk before halting with this operator panel display value.
222 Attempting a Normal mode IPL from Standard I/O planar-attached devices specified 
in NVRAM IPL Devices List.
223 Attempting a Normal mode IPL from SCSI-attached devices specified in NVRAM IPL 
Devices List.
224 Attempting a Normal mode IPL from 9333 subsystem device specified in NVRAM IPL 
Devices List.
225 Attempting a Normal mode IPL from 7012 DBA disk-attached devices specified in 
NVRAM IPL Devices List.
226 Attempting a Normal mode IPL from Ethernet specified in NVRAM IPL Devices List.
227 Attempting a Normal mode IPL from Token-Ring specified in NVRAM IPL Devices List.
228 Attempting a Normal mode IPL from NVRAM expansion code.
229 Attempting a Normal mode IPL from NVRAM IPL Devices List; cannot IPL from any
of the listed devices, or there are no valid entries in the Devices List.
22c Attempting a normal mode IPL from FDDI specified in NVRAM IPL device list.
230 Attempting a Normal mode IPL from adapter feature ROM specified in IPL ROM
Device List.
231 Attempting a Normal mode IPL from Ethernet specified in IPL ROM Device List.
232 Attempting a Normal mode IPL from Standard I/O planar-attached devices specified
in ROM Default Device List.
233 Attempting a Normal mode IPL from SCSI-attached devices specified in IPL ROM
Default Device List.
234 Attempting a Normal mode IPL from 9333 subsystem device specified in IPL ROM
Device List.
235 Attempting a Normal mode IPL from 7012 DBA disk-attached devices specified in
IPL ROM Default Device List.
236 Attempting a Normal mode IPL from Ethernet specified in IPL ROM Default Device
List.
237 Attempting a Normal mode IPL from Token-Ring specified in IPL ROM Default
Device List.
238 Attempting a Normal mode IPL from Token-Ring specified by the operator.
239 System failed to IPL from the device chosen by the operator.
23c Attempting a normal mode IPL from FDDI specified in IPL ROM device list.
240 Attempting a Service mode IPL from adapter feature ROM.
241 Attempting a normal boot from devices specified in the NVRAM boot list.
242 Attempting a Service mode IPL from Standard I/O planar-attached devices specified
in the NVRAM IPL Devices List.
243 Attempting a Service mode IPL from SCSI-attached devices specified in the
NVRAM IPL Devices List.
244 Attempting a Service mode IPL from 9333 subsystem device specified in the
NVRAM IPL Devices List.
245 Attempting a Service mode IPL from 7012 DBA disk-attached devices specified in
the NVRAM IPL Devices List.
246 Attempting a Service mode IPL from Ethernet specified in the NVRAM IPL Devices
List.
247 Attempting a Service mode IPL from Token-Ring specified in the NVRAM Device
List.
248 Attempting a Service mode IPL from NVRAM expansion code.
249 Attempting a Service mode IPL from the NVRAM IPL Devices List; cannot IPL from
any of the listed devices, or there are no valid entries in the Devices List.
24c Attempting a service mode IPL from FDDI specified in NVRAM IPL device list.
250 Attempting a Service mode IPL from adapter feature ROM specified in the IPL ROM
Device List.
251 Attempting a Service mode IPL from Ethernet specified in the IPL ROM Default
Device List.
252 Attempting a Service mode IPL from Standard I/O planar-attached devices specified
in the ROM Default Device List.
253 Attempting a Service mode IPL from SCSI-attached devices specified in the IPL
ROM Default Device List.
254 Attempting a Service mode IPL from 9333 subsystem device specified in the IPL
ROM Devices List.
255 Attempting a Service mode IPL from 7012 DBA disk-attached devices specified in
IPL ROM Default Device List.
256 Attempting a Service mode IPL from Ethernet specified in the IPL ROM Devices
List.
257 Attempting a Service mode IPL from Token-Ring specified in the IPL ROM Devices
List.
258 Attempting a Service mode IPL from Token-Ring specified by the operator.
259 Attempting a Service mode IPL from FDDI specified by the operator.
25c Attempting a service mode IPL from FDDI specified in IPL ROM device list.
260 Information is being displayed on the display console.
261 No supported local system display adapter was found.
262 Keyboard not detected as being connected to the system's keyboard port.
263 Attempting a Normal mode IPL from adapter feature ROM specified in the NVRAM
Device List.
269 Stalled state - the system is unable to IPL.
270 Low Cost Ethernet Adapter (LCE) POST executing
271 Mouse and Mouse port POST.
272 Tablet Port POST.
276 10/100Mbps MCA Ethernet Adapter POST executing
277 Auto Token-Ring LANstreamer MC 32 Adapter.
278 Video ROM scan POST.
279 FDDI POST.
280 3com Ethernet POST.
281 Keyboard POST executing.
282 Parallel port POST executing.
283 Serial port POST executing.
284 POWER Gt1 graphics adapter POST executing.
285 POWER Gt3 graphics adapter POST executing.
286 Token-Ring adapter POST executing.
287 Ethernet adapter POST executing.
288 Adapter card slots being queried.
289 POWER GT0 Display Adapter POST.
290 IOCC POST error (irrecoverable).
291 Standard I/O POST running.
292 SCSI POST running.
293 7012 DBA disk POST running.
294 IOCC bad TCW memory module in slot location J being tested.
295 Graphics Display adapter POST, color or grayscale.
296 ROM scan POST.
297 System model number does not compare between OCS and ROS (irrecoverable).
298 Attempting a software IPL.
299 IPL ROM passed control to the loaded program code.
301 Flash Utility ROM test failed or checkstop occurred (irrecoverable
302 Flash Utility ROM: User prompt, move the key to the service position in order to
perform an optional Flash Update. LED 3d2 will only appear if the key switch is in
the secure position. This signals the user that a Flash Update may be initiated by
moving the key switch to the service position. If the key is moved to the service
position then LED 3d3 will be displayed, this signals the user to press the Reset
button and select optional Flash Update.
303 Flash Utility ROM: User prompt, press the Reset button in order to perform an
optional Flash Update. LED 3d2 will only appear if the key switch is the secure
position. This signals the user that a Flash Update may be initiated by moving the
key switch to the service position. If the key is moved to the service position LED
3d3 will be displayed, this signals the user to press the Reset button and select
optional Flash Update.
304 Flash Utility ROM IOCC POST error (irrecoverable).
305 Flash Utility ROM standard I/O POST running.
306 Flash Utility ROM is attempting IPL from Flash Update media device.
307 Flash Utility ROM system model number does not compare between OCS and
ROM (irrecoverable).
308 Flash Utility ROM: IOCC TCW memory is being tested.
309 Flash Utility ROM passed control to a Flash Update Boot Image.
311 Flash Utility ROM CRC comparison error (irrecoverable).
312 Flash Utility ROM RAM POST memory configuration error or no memory found
(irrecoverable).
313 Flash Utility ROM RAM POST failure (irrecoverable).
314 Flash Utility ROM Power status register failed (irrecoverable).
315 Flash Utility ROM detected a low voltage condition.
318 Flash Utility ROM RAM POST is looking for good memory.
319 Flash Utility ROM RAM POST bit map is being generated.
322 CRC error on media Flash Image. No Flash Update performed.
323 Current Flash Image is being erased.
324 CRC error on new Flash Image after Update was performed. (Flash Image is cor-rupted.)
325 Flash Update successful and complete.

Configuration Program Indicators
--------------------------------

500 Querying Standard I/O slot.
501 Querying card in Slot 1.
502 Querying card in Slot 2.
503 Querying card in Slot 3.
504 Querying card in Slot 4.
505 Querying card in Slot 5.
506 Querying card in Slot 6.
507 Querying card in Slot 7.
508 Querying card in Slot 8.
510 Starting device configuration.
511 Device configuration completed.
512 Restoring device configuration files from media.
513 Restoring basic operating system installation files from media.
516 Contacting server during network boot.
517 Mounting client remote file system during network IPL.
518 Remote mount of the root and /usr file systems failed during network boot.
520 Bus configuration running.
521 /etc/init invoked cfgmgr with invalid options; /etc/init has been corrupted or incor-rectly 
modified (irrecoverable error).
522 The configuration manager has been invoked with conflicting options (irrecoverable
error).
523 The configuration manager is unable to access the ODM database (irrecoverable
error).
524 The configuration manager is unable to access the config.rules object in the ODM
database (irrecoverable error).
525 The configuration manager is unable to get data from a customized device object in
the ODM database (irrecoverable error).
526 The configuration manager is unable to get data from a customized device driver
object in the ODM database ( irrecoverable error).
527 The configuration manager was invoked with the phase 1 flag; running phase 1 at
this point is not permitted (irrecoverable error).
528 The configuration manager cannot find sequence rule, or no program name was
specified in the ODM database (irrecoverable error).
529 The configuration manager is unable to update ODM data (irrecoverable error).
530 The program savebase returned an error.
531 The configuration manager is unable to access the PdAt object class (irrecoverable
error).
532 There is not enough memory to continue (malloc failure); irrecoverable error.
533 The configuration manager could not find a configure method for a device.
534 The configuration manager is unable to acquire database lock (irrecoverable error).
535 HIPPI diagnostics interface driver being configured.
536 The configuration manager encountered more than one sequence rule specified in
the same phase (irrecoverable error).
537 The configuration manager encountered an error when invoking the program in the
sequence rule.
538 The configuration manager is going to invoke a configuration method.
539 The configuration method has terminated, and control has returned to the configura-tion 
manager.
551 IPL vary-on is running.
552 IPL varyon failed.
553 IPL phase 1 is complete.
554 The boot device could not be opened or read, or unable to define NFS swap device
during network boot.
555 An ODM error occurred when trying to varyon the rootvg, or unable to create an
NFS swap device during network boot.
556 Logical Volume Manager encountered error during IPL vary-on.
557 The root filesystem will not mount.
558 There is not enough memory to continue the system IPL.
559 Less than 2 M bytes of good memory are available to load the AIX kernel.
570 Virtual SCSI devices being configured.
571 HIPPI common function device driver being configured.
572 HIPPI IPI-3 master transport driver being configured.
573 HIPPI IPI-3 slave transport driver being configured.
574 HIPPI IPI-3 transport services user interface device driver being configured.
575 A 9570 disk-array driver is being configured.
576 Generic async device driver being configured.
577 Generic SCSI device driver being configured.
578 Generic commo device driver being configured.
579 Device driver being configured for a generic device.
580 HIPPI TCPIP network interface driver being configured.
581 Configuring TCP/IP.
582 Configuring Token-Ring data link control.
583 Configuring an Ethernet data link control.
584 Configuring an IEEE Ethernet data link control.
585 Configuring an SDLC MPQP data link control.
586 Configuring a QLLC X.25 data link control.
587 Configuring a NETBIOS.
588 Configuring a Bisync Read-Write (BSCRW).
589 SCSI target mode device being configured.
590 Diskless remote paging device being configured.
591 Configuring an LVM device driver.
592 Configuring an HFT device driver.
593 Configuring SNA device drivers.
594 Asynchronous I/O being defined or configured.
595 X.31 pseudo-device being configured.
596 SNA DLC/LAPE pseudo-device being configured.
597 OCS software being configured.
598 OCS hosts being configured during system reboot.
599 Configuring FDDI data link control.
5c0 Streams-based hardware drive being configured.
5c1 Streams-based X.25 protocol being configured.
5c2 Streams-based X.25 COMIO emulator driver being configured.
5c3 Streams-based X.25 TCP/IP interface driver being configured.
5c4 FCS adapter device driver being configured.
5c5 SCB network device driver for FCS is being configured.
5c6 AIX SNA channel being configured.
600 Starting network boot portion of /sbin/rc.boot
602 Configuring network parent devices.
603 /usr/lib/methods/defsys, /usr/lib/methods/cfgsys, or /usr/lib/methods/cfgbus
failed.
604 Configuring physical network boot device.
605 Configuration of physical network boot device failed.
606 Running /usr/sbin/ifconfig on logical network boot device.
607 /usr/sbin/ifconfig failed.
608 Attempting to retrieve the client.info file with tftp.Note that a flashing 608 indicates
multiple attempt(s) to retrieve the client_info file are occurring.
609 The client.info file does not exist or it is zero length.
610 Attempting remote mount of NFS file system.
611 Remote mount of the NFS file system failed.
612 Accessing remote files; unconfiguring network boot device.
614 Configuring local paging devices.
615 Configuration of a local paging device failed.
616 Converting from diskless to dataless configuration.
617 Diskless to dataless configuration failed.
618 Configuring remote (NFS) paging devices.
619 Configuration of a remote (NFS) paging device failed.
620 Updating special device files and ODM in permanent filesystem with data from boot
RAM filesystem.
622 Boot process configuring for operating system installation.
650 IBM SCSD disk drive being configured
668 25MB ATM MCA Adapter being configured
680 POWER GXT800M Graphics Adapter
689 4.5GB Ultra SCSI Single Ended Disk Drive being configured
690 9.1GB Ultra SCSI Single Ended Disk Drive being configured
694 Eicon ISDN DIVA MCA Adapter for PowerPC Systems
700 Progress indicator. A 1.1 GB 8-bit SCSI disk drive being identified or configured.
701 Progress indicator. A 1.1 GB 16-bit SCSI disk drive is being identified or configured.
702 Progress indicator. A 1.1 GB 16-bit differential SCSI disk drive is being identified or
configured.
703 Progress indicator. A 2.2 GB 8-bit SCSI disk drive is being identified or configured.
704 Progress indicator. A 2.2 GB 16-bit SCSI disk drive is being identified or configured.
705 The configuration method for the 2.2 GB 16-bit differential SCSI disk drive is being
run. If an irrecoverable error occurs, the system halts.
706 Progress indicator. A 4.5 GB 16-bit SCSI disk drive is being identified or configured.
707 Progress indicator. A 4.5 GB 16-bit differential SCSI disk drive is being identified or
configured.
708 Progress indicator. A L2 cache is being identified or configured.
710 POWER GXT150M graphics adapter being identified or configured.
711 Unknown adapter being identified or configured.
712 Graphics slot bus configuration is executing.
713 The IBM ARTIC960 device is being configured.
714 A video capture adapter is being configured.
715 The Ultimedia Services audio adapter is being configured. This LED displays briefly
on the panel.
717 TP Ethernet Adapter being configured.
718 GXT500 Graphics Adapter being configured.
720 Unknown read/write optical drive type being configured.
721 Unknown disk or SCSI device being identified or configured.
722 Unknown disk being identified or configured.
723 Unknown CD-ROM being identified or configured.
724 Unknown tape drive being identified or configured.
725 Unknown display adapter being identified or configured.
726 Unknown input device being identified or configured.
727 Unknown async device being identified or configured.
728 Parallel printer being identified or configured.
729 Unknown parallel device being identified or configured.
730 Unknown diskette drive being identified or configured.
731 PTY being identified or configured.
732 Unknown SCSI initiator type being configured.
733 7GB 8mm tape drive being configured.
734 4x SCSI-2 640MB CD-ROM Drive
741 1080MB SCSI Disk Drive
745 16GB 4mm Tape Auto Loader
748 MCA keyboard/mouse adapter being configured.
749 7331 Model 205 Tape Library
754 1.1GB 16-bit SCSI disk drive being configured.
755 2.2GB 16-bit SCSI disk drive being configured.
756 4.5GB 16-bit SCSI disk drive being configured.
757 External 13GB 1.5M/s 1/4 inch tape being configured.
772 4.5GB SCSI F/W Disk Drive
773 9.1GB SCSI F/W Disk Drive
774 9.1GB External SCSI Disk Drive
77c Progress indicator. A 1.0 GB 16-bit SCSI disk drive being identified or configured.
783 4mm DDS-2 Tape Autoloader
789 2.6GB External Optical Drive
794 10/100MB Ethernet PX MC Adapter
797 Turboways 155 UTP/STP ATM Adapter being identified or configured.
798 Video streamer adapter being identified or configured.
800 Turboways 155 MMF ATM Adapter being identified or configured.
803 7336 Tape Library Robotics being configured
804 8x Speed SCSI-2 CD ROM drive being configured
807 SCSI Device Enclosure being configured
808 System Interface Full (SIF) configuration process
80c SSA 4-Port Adapter being identified or configured.
811 Processor complex being identified or configured.
812 Memory being identified or configured.
813 Battery for time-of-day, NVRAM, and so on being identified or configured, or system
I/O control logic being identified or configured.
814 NVRAM being identified or configured.
815 Floating-point processor test
816 Operator panel logic being identified or configured.
817 Time-of-day logic being identified or configured.
819 Graphics input device adapter being identified or configured.
821 Standard keyboard adapter being identified or configured.
823 Standard mouse adapter being identified or configured.
824 Standard tablet adapter being identified or configured.
825 Standard speaker adapter being identified or configured.
826 Serial Port 1 adapter being identified or configured.
827 Parallel port adapter being identified or configured.
828 Standard diskette adapter being identified or configured.
831 3151 adapter being identified or configured, or Serial Port 2 being identified or con-figured.
834 64-port async controller being identified or configured.
835 16-port async concentrator being identified or configured.
836 128-port async controller being identified or configured.
837 16-port remote async node being identified or configured.
838 Network Terminal Accelerator Adapter being identified or configured.
839 7318 Serial Communications Server being configured.
841 8-port async adapter (EIA-232) being identified or configured.
842 8-port async adapter (EIA-422A) being identified or configured.
843 8-port async adapter (MIL-STD 188) being identified or configured.
844 7135 RAIDiant Array disk drive subsystem controller being identified or configured.
845 7135 RAIDiant Array disk drive subsystem drawer being identified or configured.
846 RAIDiant Array SCSI 1.3GB Disk Drive
847 16-port serial adapter (EIA-232) being identified or configured.
848 16-port serial adapter (EIA-422) being identified or configured.
849 X.25 Interface Co-Processor/2 adapter being identified or configured.
850 Token-Ring network adapter being identified or configured.
851 T1/J1 Portmaster adapter being identified or configured.
852 Ethernet adapter being identified or configured.
854 3270 Host Connection Program/6000 connection being identified or configured.
855 Portmaster Adapter/A being identified or configured.
857 FSLA adapter being identified or configured.
858 5085/5086/5088 adapter being identified or configured.
859 FDDI adapter being identified or configured.
85c Progress indicator. Token-Ring High-Performance LAN adapter is being identified or
configured.
861 Optical adapter being identified or configured.
862 Block Multiplexer Channel Adapter being identified or configured.
865 ESCON Channel Adapter or emulator being identified or configured.
866 SCSI adapter being identified or configured.
867 Async expansion adapter being identified or configured.
868 SCSI adapter being identified or configured.
869 SCSI adapter being identified or configured.
870 Serial disk drive adapter being identified or configured.
871 Graphics subsystem adapter being identified or configured.
872 Grayscale graphics adapter being identified or configured.
874 Color graphics adapter being identified or configured.
875 Vendor generic communication adapter being configured.
876 8-bit color graphics processor being identified or configured.
877 POWER Gt3/POWER Gt4 being identified or configured.
878 POWER Gt4 graphics processor card being configured.
879 24-bit color graphics card, MEV2
880 POWER Gt1 adapter being identified or configured.
887 Integrated Ethernet adapter being identified or configured.
889 SCSI adapter being identified or configured.
890 SCSI-2 Differential Fast/Wide and Single-Ended Fast/Wide Adapter/A.
891 Vendor SCSI adapter being identified or configured.
892 Vendor display adapter being identified or configured.
893 Vendor LAN adapter being identified or configured.
894 Vendor async/communications adapter being identified or configured.
895 Vendor IEEE 488 adapter being identified or configured.
896 Vendor VME bus adapter being identified or configured.
897 S/370 Channel Emulator adapter being identified or configured.
898 POWER Gt1x graphics adapter being identified or configured.
899 3490 attached tape drive being identified or configured.
89c Progress indicator. A multimedia SCSI CD-ROM is being identified or configured.
901 Vendor SCSI device being identified or configured.
902 Vendor display device being identified or configured.
903 Vendor async device being identified or configured.
904 Vendor parallel device being identified or configured.
905 Vendor other device being identified or configured.
908 POWER GXT1000 Graphics subsystem being identified or configured.
910 1/4GB Fibre Channel/266 Standard Adapter being identified or configured.
911 Fibre Channel/1063 Adapter Short Wave
912 2.0GB SCSI-2 differential disk drive being identified or configured.
913 1.0GB differential disk drive being identified or configured.
914 5GB 8 mm differential tape drive being identified or configured.
915 4GB 4 mm tape drive being identified or configured.
916 Non-SCSI vendor tape adapter being identified or configured.
917 Progress indicator. 2.0GB 16-bit differential SCSI disk drive is being identified or
configured.
918 Progress indicator. 2GB 16-bit single-ended SCSI disk drive is being identified or
configured.
920 Bridge Box being identified or configured.
921 101 keyboard being identified or configured.
922 102 keyboard being identified or configured.
923 Kanji keyboard being identified or configured.
924 Two-button mouse being identified or configured.
925 Three-button mouse being identified or configured.
926 5083 tablet being identified or configured.
927 5083 tablet being identified or configured.
928 Standard speaker being identified or configured.
929 Dials being identified or configured.
930 Lighted program function keys (LPFK) being identified or configured.
931 IP router being identified or configured.
933 Async planar being identified or configured.
934 Async expansion drawer being identified or configured.
935 3.5-inch diskette drive being identified or configured.
936 5.25-inch diskette drive being identified or configured.
937 An HIPPI adapter is being configured.
942 POWER GXT 100 graphics adapter being identified or configured.
943 Progress indicator. 3480 and 3490 control units attached to a System/370 Channel
Emulator/A adapter are being identified or configured.
944 100MB ATM adapter being identified or configured
945 1.0GB SCSI differential disk drive being identified or configured.
946 Serial port 3 adapter is being identified or configured.
947 Progress indicator. A 730MB SCSI disk drive is being configured.
948 Portable disk drive being identified or configured.
949 Unknown direct bus-attach device being identified or configured.
950 Missing SCSI device being identified or configured.
951 670MB SCSI disk drive being identified or configured.
952 355MB SCSI disk drive being identified or configured.
953 320MB SCSI disk drive being identified or configured.
954 400MB SCSI disk drive being identified or configured.
955 857MB SCSI disk drive being identified or configured.
956 670MB SCSI disk drive electronics card being identified or configured.
957 120MB DBA disk drive being identified or configured.
958 160 MB DBA disk drive being identified or configured.
959 160MB SCSI disk drive being identified or configured.
960 1.37GB SCSI disk drive being identified or configured.
964 Internal 20GB 8mm tape drive identified or configured.
968 1.0GB SCSI disk drive being identified or configured.
970 Half-inch, 9-track tape drive being identified or configured.
971 150MB 1/4-inch tape drive being identified or configured.
972 2.3GB 8 mm SCSI tape drive being identified or configured.
973 Other SCSI tape drive being identified or configured.
974 CD-ROM drive being identified or configured.
975 Progress indicator. An optical disk drive is being identified or configured.
977 M-Audio Capture and Playback Adapter being identified or configured.
981 540MB SCSI-2 single-ended disk drive being identified or configured.
984 1GB 8-bit disk drive being identified or configured.
985 M-Video Capture Adapter being identified or configured.
986 2.4GB SCSI disk drive being identified or configured.
987 Progress indicator. Enhanced SCSI CD-ROM drive is being identified or configured.
989 200MB SCSI disk drive being identified or configured.
990 2.0GB SCSI-2 single-ended disk drive being identified or configured.
991 525MB 1/4-inch cartridge tape drive being identified or configured.
994 5GB 8 mm tape drive being identified or configured.
995 1.2GB 1/4 inch cartridge tape drive being identified or configured.
996 Progress indicator. Single-port, multi-protocol communications adapter is being
identified or configured.
997 FDDI adapter being identified or configured.
998 2.0GB4 mm tape drive being identified or configured.
999 7137 or 3514 Disk Array Subsystem being configured.
D81 T2 Ethernet Adapter being configured.

Diagnostic Load Progress Indicators
-----------------------------------

Note: When a lowercase c is listed, it displays in the lower half of the seven-segment
character position.

c00 AIX Install/Maintenance loaded successfully.
c01 Insert the first diagnostic diskette.
c02 Diskettes inserted out of sequence.
c03 The wrong diskette is in diskette drive.
c04 The loading stopped with a nonrecoverable error.
c05 A diskette error occurred.
c06 The rc.boot configuration shell script is unable to determine type of boot.
c07 Insert the next diagnostic diskette.
c08 RAM file system started incorrectly.
c09 The diskette drive is reading or writing a diskette.
c20 An unexpected halt occurred, and the system is configured to enter the kernel
debug program instead of entering a system dump.
c21 The ifconfig command was unable to configure the network for the client network
host.
c22 The tftp command was unable to read client's ClientHostName info file during a
client network boot.
c24 Unable to read client's ClientHostName.info file during a client network boot.
c25 Client did not mount remote miniroot during network install.
c26 Client did not mount the /usr file system during the network boot.
c29 The system was unable to configure the network device.
c31 Select the console display for the diagnostics. To select No console display, set the 
key mode switch to Normal then to Service. The diagnostic programs will then load
and run the diagnostics automatically.
c32 A direct-attached display (HFT) was selected.
c33 A tty terminal attached to serial ports S1 or S2 was selected.
c34 A file was selected. The console messages store in a file.
c40 Configuration files are being restored.
c41 Could not determine the boot type or device.
c42 Extracting data files from diskette.
c43 Cannot access the boot/install tape.
c44 Initializing installation database with target disk information.
c45 Cannot configure the console.
c46 Normal installation processing.
c47 Could not create a physical volume identifier (PVID) on disk.
c48 Prompting you for input.
c49 Could not create or form the JFS log.
c50 Creating root volume group on target disks.
c51 No paging devices were found.
c52 Changing from RAM environment to disk environment.
c53 Not enough space in the /tmp directory to do a preservation installation.
c54 Installing either BOS or additional packages.
c55 Could not remove the specified logical volume in a preservation installation.
c56 Running user-defined customization.
c57 Failure to restore BOS.
c58 Displaying message to turn the key.
c59 Could not copy either device special files, device ODM, or volume group information
from RAM to disk.
c61 Failed to create the boot image.
c62 Loading platform dependent debug files
c63 Loading platform dependent data files
c64 Failed to load platform dependent data files
c70 Problem Mounting diagnostic CDROM disc
c99 Diagnostics have completed. This code is only used when there is no console.


0c0 The dump completed successfully 
0c1 The dump failed due to an I/O error. 
0c2 A user-requested dump has started. You requested a dump using the SYSDUMPSTART command, a dump key sequence, 
or the Reset button. 

0c3 The dump is inhibit 
0c4 The dump did not complete. A partial dump was written to the dump device. There is not enough space on the dump device
to contain the entire dump. To prevent this problem from occuring again, you must increase the size of your dumpmedia. 


0c5 The dump failed to start. An unecpected error occured while the system was attempting to write to the dump media. 
0c6 A dump to the secondary dump device was requested. Make the secondary dump device ready, then press CTRL-ALT-NUMPAD2. 
0c7 Reserved. 
0c8 The dump function is disabled. No primary dump device is configured. 
0c9 A dump is in progress. 
0cc Unknown dump failure 


---------- Diagnostics Load Progress Indicators ----------- 

c00 AIX Install/Maintenance loaded successfully. 
c01 Insert the first diagnostic diskette. 
c02 Diskettes inserted out of sequence. 
c03 The wrong diskette is in the drive. 
c04 The loading stopped with an irrecoverable error. 
c05 A diskette error occurred. 
c08 RAM filesystem started incorrectly. 
c07 Insert the next diagnostic diskette. 
c09 The diskette drive is reading or writing a diskette. 
c20 An unexpected halt occured, and the system is configured to enter the kernel debug program instead of entering asystem dump. 

c21 The 'ifconfig' command was unable to configure the network for the client network host. 
c22 The 'tftp' command was unable to read client's ClientHostName.info file during a client network boot. 
c24 Unable to read client's ClientHostName.info file during a client network boot. 
c25 Client did not mount remote miniroot during network install. 
c26 Client did not mount the /usr filesystem during the network boot. 
c29 System was unable to configure the network device. 
c31 Select the console display for the diagnostics. To select "No console display", set the key mode switch to normal then
to Service. The diagnostic program will then load and run the diagnostics automatically. 

c32 A direct-attached display (HFT) was selected. 
c33 a TTY terminal attached to serial ports S1 or S2 was selected. 
c34 A file was selected. The console messages store in a file 
c40 Configuration files are been restored. 
c41 Could not determine the boot type or device. 
c42 Extracting data files from diskette. 
c43 Diagboot cannot be accessed. 
c44 Initialyzing installation database with target disk information. 
c45 Cannot configure the console. 
c46 Normal installation processing. 
c47 Could not create a physical volume identifier (PVID) on disk. 
c48 Prompting you for input. 
c49 Could not create or form the JFS log. 
c50 Creating rootvg volume group on target disk 
c51 No paging space were found. 
c52 Changing from RAM environment to disk environment. 
c53 Not enough space in the /tmp directory to do a preservation installation. 
c54 Installing either BOS or additionnal packages. 
c55 Could not remove the specified logical volume in a preservation installation. 
c56 Running user-defined customization. 
c57 Failure to restore BOS. 
c58 Display message to turn the key. 
c59 Could not copy either device special files, device ODM, or volume group information from RAM to disk. 
c61 Failed to create the boot image. 
c70 Problem Mounting diagnostics CDROM disc. 
c99 Diagnostics have completed. This code is only used when there is no console. 


--------Debugger Progress Indicators ---------- 

c20 Kernel debug program activated. An unexpected system halt has occured, and you have configured the system 
to enter the kernel debug program instead of performing a dump. 


---------Built-In Self Test (Bist) Indicators--------- 

100 BIST completed successfully. Control was passed to IPL ROS. 
101 BIST started following RESET 
102 BIST started following Power-on Reset 
103 BIST could not determine the system model number. 
104 Equipment conflict. BIST could not find the CBA. 
105 BIST could not read the OCS EPROM. 
106 BIST detected a module error. 
111 OCS stopped. BIST detected a module error. 
112 A checkstop occured during BIST. 
113 BIST checkstop count is greater than 1. 
120 BIST starting a CRC check on the 8752 EPROM. 
121 BIST detected a bad CRC in the first 32K of the OCS EPROM. 
122 BIST started a CRC check on the first 32K of the OCS EPROM. 
123 BIST detected a bad CRC on the OCS area of NVRAM. 
124 BIST started a CRC check on the OCS area of NVRAM. 
125 BIST detected a bad CRC on the time-of-day area of NVRAM. 
126 BIST started a CRC check on the time-of-day area of the NVRAM. 
127 BIST detected a bad CRC on the 8752 EPROM. 
130 BIST presence test started. 
140 BIST failed: procedure error 
142 BIST failed: procedure error 
143 Invalid memory configuration. 
144 BIST failed; procedure error. 
151 BIST started AIPGM test code. 
152 BIST started DCLST test code. 
153 BIST started ACLST test code. 
154 BIST started AST test code. 
160 Bad EPOW Signal/Power status signal 
161 BIST being conducted on BUMP I/O 
162 BIST being conducted on JTAG 
163 BIST being conducted on Direct I/O 
164 BIST being conducted on CPU 
165 BIST being conducted on DCB and Memory 
166 BIST being conducted on interrupts 
170 BIST being conducted on 'Multi-Processor 
180 BIST logout failed. 
182 BIST COP bus not responding 
185 A checkstop condition occured during the BIST 
186 System logic-generated checkstop (Model 250 only) 
187 Graphics-generated checkstop (Model 250) 
195 BIST logout completed. 
888 BIST did not start 


------- Power-On Self Test ------- 

200 IPL attempted with keylock in the SECURE position. 
201 IPL ROM test failed or checkstop occured (irrecoverable) 
202 IPL ROM test failed or checkstop occured (irrecoverable) 
203 Unexpected data storage interrupt. 
204 Unexpected instruction storage interrupt. 
205 Unexpected external interrupt. 
206 Unexpected alignment interrupt. 
207 Unexpected program interrupt. 
208 Unexpected floating point unavailable interrupt. 
209 Unexpected SVC interrupt. 
20c L2 cache POST error. (The display shows a solid 20c for 5 seconds 
210 Unexpected SVC interrupt. 
211 IPL ROM CRC comparison error (irrecoverable). 
212 RAM POST memory configuration error or no memory found (irrecoverable). 
213 RAM POST failure (irrecoverable). 
214 Power status register failed (irrecoverable). 
215 A low voltage condition is present (irrecoverable). 
216 IPL ROM code being uncompressed into memory. 
217 End of bootlist encountered. 
218 RAM POST is looking for 1M bytes of good memory. 
219 RAM POST bit map is being generated. 
21c L2 cache is not detected. (The display shows a solid 21c for 5 sec) 
220 IPL control block is being initialized. 
221 NVRAM CRC comparison error during AIX. 
IPL(Key Mode Switch in Normal mode). 
Reset NVRAM by reaccomplishing IPL in Service mode. For systems with an internal, direct-bus-attached(DBA)disk,IPL 
ROM attempted to perform an IPL from that disk before halting with this three-digit display value. 
222 Attempting a Normal mode IPL from Standard I/O planar attached devices specified in NVRAM IPL Devices List. 
223 Attempting a Normal mode IPL from SCSI attached devices specified in NVRAM IPL Devices List. 
Note: May be caused by incorrect jumper setting for external SCSI devices or by incorrect SCSI terminator. 
REFER FFC B88 
224 Attempting a Normal mode restart from 9333 subsystem device specified in NVRAM device list. 
225 Attempting a Normal mode IPL from IBM 7012 DBA disk attached devices specified in NVRAM IPL Devices List. 
226 Attempting a Normal mode restart from Ethernet specified in NVRAM device list. 
227 Attempting a Normal mode restart from Token Ring specified in NVRAM device list. 
228 Attempting a Normal mode IPL from NVRAM expansion code. 
229 Attempting a Normal mode IPL from NVRAM IPL Devices List; cannot IPL from any of the listed devices, or there are 
no valid entry in the Devices List. 
22c Attempting a normal mode IPL from FDDI specified in NVRAM IPL device list. 
230 Attempting a Normal mode restart from adapter feature ROM specified in IPL ROM devices list. 
231 Attempting a Normal mode restart from Ethernet specified in IPL ROM devices list. 
232 Attempting a Normal mode IPL from Standard I/O planar attached devices specified in Rom Default Device List. 
233 Attempting a Normal mode IPL from SCSI attached devices specified in IPL ROM Default Device List. 
234 Attempting a Normal mode restart from 9333 subsystem device specified in IPL ROM device list. 
235 Attempting a Normal mode IPL from IBM 7012 DBA disk attached devices specified in IPL ROM Default Device List. 
236 Attempting a Normal mode restart from Ethernet specified in IPL ROM default devices list. 
237 Attempting a Normal mode restart from Token Ring specified in IPL ROM default device list. 
238 Attempting a Normal mode restart from Token Ring specified by the operator. 
239 System failed to restart from the device chosen by the operator. 
23c Attempting a normal mode IPL from FDDI specified in IPL ROM device list. 
240 Attempting a Service mode restart from adapter feature ROM. 
241 Attempting a Normal mode IPL from devices specified in the NVRAM IPL Devices List. 
242 Attempting a Service mode IPL from Standard I/O planar attached devices specified in NVRAM IPL Devices List. 
243 Attempting a Service mode IPL from SCSI attached devices specified in NVRAM IPL Devices List. 
244 Attempting a Service mode restart from 9333 subsystem device specified in NVRAM device list. 
245 Attempting a Service mode IPL from IBM 7012 DBA disk attached devices specified in NVRAM IPL Devices List. 
246 Attempting a Service mode restart from Ethernet specified in NVRAM device list. 
247 Attempting a Service mode restart from Token Ring specified in NVRAM device list. 
248 Attempting a Service mode IPL from NVRAM expansion code. 
249 Attempting a Service mode IPL from NVRAM IPL Devices List; cannot IPL from any of the listed devices, 
or there areno valid entries in the Devices List. 

24c Attempting a service mode IPL from FDDI specified in NVRAM IPL device list. 
250 Attempting a Service mode restart from adapter feature ROM specified in IPL ROM device list. 
251 Attempting a Service mode restart from Ethernet specified in IPL ROM device list. 
252 Attempting a Service mode IPL from standard I/O planar attached devicesspecified in ROM Default Device List. 
253 Attempting a Service mode IPL from SCSI attached devices specified in IPL ROM Default Device List. 
254 Attempting a Service mode restart from 9333 subsystem device specified in IPL ROM device list. 
255 Attempting a Service mode IPL from IBM 7012 DBA disk'attached devices specified in IPL ROM Default Devices List. 
256 Attempting a Service mode restart from Ethernet specified in IPL ROM default device list. 
257 Attempting a Service mode restart from Token Ring specified in IPL ROM default device list. 
258 Attempting a Service mode restart from Token Ring specified by the operator. 
259 Attempting a Service mode restart from FDDI specified by the operator. 

25c Attempting a normal mode IPL from FDDI specified in IPL ROM device list. 
260 Information is being displayed on the display console. 
261 Information will be displayed on the tty terminal when the "1" key is pressed on the tty terminal keyboard. 
262 A keyboard was not detected as being connected to the system's 
NOTE: Check for blown planar fuses or for a corrupted boot on disk drive 
263 Attempting a Normal mode restart from adapter feature ROM specified in NVRAM device list. 
269 Stalled state - the system is unable to IPL 
271 Mouse port POST. 
272 Tablet port POST. 
277 Auto Token-Ring LANstreamer MC 32 Adapter 
278 Video ROM Scan POST. 
279 FDDI adapter POST. 
280 3COM Ethernet POST. 
281 Keyboard POST executing. 
282 Parallel port POST executing 
283 Serial port POST executing 
284 POWER Gt1 graphadapte POST executing 
285 POWER Gt3 graphadapte POST executing 
286 Token Ring adapter POST executing. 
287 Ethernet adapter POST executing. 
288 Adapter card slots being queried. 
289 GTO POST. 
290 IOCC POST error (irrecoverable). 
291 Standard I/O POST running. 
292 SCSI POST running. 
293 IBM 7012 DBA disk POST running. 
294 IOCC bad TCW SIMM in slot location J being tested. 
295 Graphics Display adapter POST, color or grayscale. 
296 ROM scan POST. 
297 System model number does not compare between OCS and ROS 
(irrecoverable). Attempting a software IPL. 
298 Attempting a software IPL (warm boot). 
299 IPL ROM passed control to the loaded program code. 
301 Flash Utility ROM failed or checkstop occured (irrecoverable) 
302 Flash Utility ROM failed or checkstop occured (irrecoverable) 
302 Flash Utility ROM: User prompt, move the key to the service in order to perform an optional Flash Update. LED 
will only appear if the key switch is in the SECURE position. This signals the user that a Flash Update may be 
initiated by moving the key switch to the SERVICE position. If the key is moved to the SERVICE position, 
LED 303 will be displayed. This signals the user to press the reset button and select optional Flash Update. 
303 Flash Utility ROM: User prompt, press the reset button in order to perform an optional Flash Update. LED 
only appear if the key switch is in the SECURE position. This signals the user that a Flash Update may be initiated 
by moving the key switch to the SERVICE position. If the key is moved to the SERVICE position, LED 303 will be 
displayed. This signals the user to press the reset button and select optional Flash Update. 
304 Flash Utility ROM IOCC POST error (irrecoverable) 
305 Flash Utility ROM standard I/O POST running. 
306 Flash Utility ROM is attempting IPL from Flash Update Boot Image. 
307 Flash Utility ROM system model number does not compare between OCS and ROM (irrecoverable). 
308 Flash Utility ROM: IOCC TCW memory is being tested. 
309 Flash Utility ROM passed control to a Flash Update Boot Image. 
311 Flash Utility ROM CRC comparison error (irrecoverable). 
312 Flash Utility ROM RAM POST memory configuration error or no memory found ( iirecoverable). 
313 Flash Utility ROM RAM POST failure( irrecoverable). 
314 Flash Utility ROM Power status register failed (irrecoverable). 
315 Flash Utility ROM detected a low voltage condition. 
318 Flash Utility ROM RAM POST is looking for good memory. 
319 Flash Utility ROM RAM POST bit map is being generated. 
322 CRC error on media Flash Image. No Flash Update performed. 
323 Current Flash Image is being erased. 
324 CRC error on new Flash Image after Update was performed. (Flash Image is corrupted). 
325 Flash Image successful and complete. 

500 Querying Native I/O slot. 
501 Querying card in Slot 1 
502 Querying card in Slot 2 
503 Querying card in Slot 3 
504 Querying card in Slot 4 
505 Querying card in Slot 5 
506 Querying card in Slot 6 
507 Querying card in Slot 7 
508 Querying card in Slot 8 
510 Starting device configuration. 
511 Device configuration completed. 
512 Restoring device configuration files from media. 
513 Restoring basic operating system installation files from media. 
516 Contacting server during network boot 
517 Mounting client remote file system during network IPL. 
518 Remote mount of the root and /usr filesystems failed during network boot. 
520 Bus configuration running. 
521 /etc/init invoked cfgmgr with invalid options; /etc/init has been corrupted or incorrectly modified 
(irrecoverable error). 
522 The configuration manager has been invoked with conflicting options (irrecoverable error). 
523 The configuration manager is unable to access the ODM database (irrecoverable error). 
524 The configuration manager is unable to access the config rules object in the ODM database (irrecoverable error). 
525 The configuration manager is unable to get data from a customized device object in the ODM database 
(irrecoverable error). 
526 The configuration manager is unable to get data from a customized device driver objet in the ODM database 
(irrecoverable error). 
527 The configuration manager was invoked with the phase 1 flag; running phase 1 flag; running phase 1 at this point 
is not permitted (irrecoverable error). 
528 The configuration manager cannot find sequence rule, or no program was specified in the ODM database 
(irrecoverable error). 
529 The configuration manager is unable to update ODM data 
(irrecoverable error). 
530 The program "savebase" returned an error. 
531 The configuration manager is unable to access PdAt object class 
(irrecoverable eroor) 
532 There is not enough memory to continue (malloc failure); 
irrecoverable error. 
533 The configuration manager could not find a configure method for a device. 
534 The configuration manager is unable to aquire database lock. irrecoverable error. 
536 The configuration manager encountered more than one sequence rule specified in the same phase. (irrecoverable error). 
537 The configuration manager encountered an error when invoking the program in the sequence rule. 
538 The configuration manager is going to invoke a configuration 
539 The configuration method has terminated, and control has returned to the configuration manager. 
551 IPL Varyon is running 

552 IPL Varyon failed. 
553 IPL phase 1 is complete. 
554 Unable to define NFS swap device during network boot 
555 Unable to define NFS swap device during network boot 
556 Logical Volume Manager encountered error during IPL varyon. 
557 The root filesystem will not mount. 
558 There is not enough memory to continue the IPL. 
559 Less than 2MB of good memory are available to load the AIX kernel. 
570 Virtual SCSI devices being configured. 
571 HIPPI common function device driver being configured. 
572 HIPPI IPI-3 master transport driver being configured. 
573 HIPPI IPI-3 slave transport driver being configured. 
574 HIPPI IPI-3 transport services user interface device driver being configured. 
576 Generic async device driver being configured. 
577 Generic SCSI device driver being configured. 
578 Generic commo device driver being configured. 
579 Device driver being configured for a generic device. 
580 HIPPI TCPIP network interface driver being configured. 
581 Configuring TCP/IP. 
582 Configuring token ring data link control. 
583 Configuring an Ethernet data link control. 
584 Configuring an IEEE ethernet data link control. 
585 Configuring an SDLC MPQP data link control. 
586 Configuring a QLLC X.25 data link control. 
587 Configuring NETBIOS. 
588 Configuring a Bisync Read-Write (BSCRW). 
589 SCSI target mode device being configured. 
590 Diskless remote paging device being configured. 
591 Configuring an LVM device driver 
592 Configuring an HFT device driver 
593 Configuring SNA device drivers. 
594 Asynchronous I/O being defined or configured. 
595 X.31 pseudo device being configured. 
596 SNA DLC/LAPE pseudo device being configured. 
597 OCS software being configured. 
598 OCS hosts being configured during system reboot. 
599 Configuring FDDI data link control. 
5c0 Streams-based hardware drive being configured. 
5c1 Streams-based X.25 protocol being configured. 
5c2 Streams-based X.25 COMIO emulator driver being configured. 
5c3 Streams-based X.25 TCP/IP interface driver being configured. 
5c4 FCS adapter device driver being configured. 
5c5 SCB network device driver for FCS is being configured. 
5c6 AIX SNA channel being configured. 
600 Starting network boot portion of /sbin/rs.boot 
602 Configuring network parent devices. 
603 /usr/lib/methods/defsys 
/usr/lib/methods/cggsys, or 
/usr/lib/methods/cggbus failed. 
604 Configuring physical network boot device. 
605 Configuring physical network boot device failed. 
606 Running /usr/sbin/ifconfig on logical network boot device. 
607 /usr/sbin/ifconfig failed. 
608 Attempting to retrieve the client.info file with tftp. Note that a flashing 608 indicates multiple attempts 
to retrieve the client_info file are occuring. 
609 The client.info file does not exist or it is zero length. 
610 Attempting remote mount of NFS file system 
611 Remote mount of the NFS filesystem failed. 
612 Accessing remote files; unconfiguring network boot device. 
614 Configuring local paging devices. 
615 Configuring of a local paging device failed. 
616 Converting from diskette to dataless configuration. 
617 Diskless to dataless configuration failed. 
618 Configuring remote (NFS) paging devices. 
619 Configuration of a remote (NFS) paging device failed. 
620 Updating special device files and ODM in permanent filesystem with data from boot RAM filesystem. 
622 Boot process configuring for operating system installation. 

650 IBM SCSD disk drive drive being configured 
700 Progress indicator. A 1.1GB 8-bit SCSI disk drive being identified or configured. 
701 Progress indicator. A 1.1GB 16-bit SCSI SE disk drive being identified or configured. 
702 Progress indicator. A 1.1GB 16-bit SCSI differential disk drive being identified or configured. 
703 Progress indicator. A 2.2GB 8-bit SCSI disk drive being identified or configured. 
704 Progress indicator. A 2.2GB 16-bit SCSI SE disk drive being identified or configured. 
705 The configuration method for the 2.2GB 16-bit differential SCSI disk drive is being run. If a irrecoverable
error occurs, the system halts. identified or configured. 

706 Progress indicator. A 4.5GB 16-bit SE SCSI disk drive is being identified or configured. 
707 Progress indicator. A 4.5GB 16-bit differential SCSI drive is being identified or configured. 
708 Progress indicator: A L2 cache is being identified or configured. 
710 POWER GXT150M graphics adapterbeing ientifyied or configured. 
711 Unknown adapter being identified or configured. 
712 Graphics slot bus configuration is executing. 
713 The IBM ARTIC960 device is being configured. 
714 A video capture adapter is being configured. 
715 The Ultimedia Services audio adapter is being configured. This LED displays briefly on the panel. 
720 Unknown read/write optical drive type being configured. 
721 Unknown disk or SCSI device being identified or configured. 
722 Unknown disk being identified or configured. 
723 Unknown CDROM being identified or configured. 
724 Unknown tape drive being identified or configured. 
725 Unknown display being identified or configured. 
726 Unknown input device being idenor configured 
727 Unknown adync device being idenor configured


+++++ pSeries:

Display codes (LEDs)

This page provides descriptions for the numbers and characters that
display on the operator panel and descriptions of the location codes
used to identify a particular item. Information is available about the
following codes:


*Note:*
    AIX logical location codes can still be seen and supported under
    various AIX commands and functions. However, the Diagnostic screens
    and menus display physical location codes for resources when running
    versions 5.2.0 and later. For these systems, refer to Physical
    Location Codes
    <http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/ledsearch.htm#led_physical_location>.


The basic formats of the AIX location codes are as follows:

    * For non-SCSI devices/drives:

AB-CD-EF-GH

    * For SCSI devices/drives:

AB-CD-EF-G,H

For planars, cards, and non-SCSI devices, the location code is defined
as follows:

AB-CD-EF-GH
 |  |  |  |
 |  |  |  Device/FRU/Port ID
 |  |  Connector ID
 |  devfunc Number, Adapter Number or Physical Location
 Bus Type or PCI Parent Bus
 

    * The AB value identifies a bus type or PCI parent bus as assigned
      by the firmware.
    * The CD value identifies adapter number, adapter's devfunc number,
      or physical location. The devfunc number is defined as the PCI
      device number times 8, plus the function number.
    * The EF value identifies a connector.
    * The GH value identifies a port, address, device, or FRU. 

Adapters and cards are identified only with AB-CD. The possible values
for AB are:
00 	Processor bus
01 	ISA bus
02 	EISA bus
03 	MCA bus
04 	PCI bus used in the case where the PCI bus cannot be identified
05 	PCMCIA buses
xy 	For PCI adapters where x is equal to or greater than 1. The x and y
are characters in the range of 0-9, A-H, J-N, P-Z (O, I, and lower case
are omitted) and are equal to the parent bus's ibm, aix-loc Open
Firmware Property.

The possible values for CD depend on the adapter or card are as follows:

    * For pluggable PCI adapters/cards, CD is the device's *devfunc*
      number (PCI device number times 8, plus the function number). The
      C and D are characters in the range of 0-9, and A-F (hex numbers).
      This allows the location code to uniquely identify multiple
      adapters on individual PCI cards.

      For pluggable ISA adapters, CD is equal to the order in which the
      ISA cards defined or configured, either by SMIT or the ISA Adapter
      Configuration Service Aid.

      For integrated ISA adapters, CD is equal to a unique code
      identifying the ISA adapter. In most cases, this is equal to the
      adapter's physical location code. In cases where a physical
      location code is not available, CD is FF.

    * EF is the connector ID. It is used to identify a connector on the
      adapter to which a resource is attached.
    * GH is used to identify a port, device, or FRU. For example:
          o For async devices, GH defines the port on the fanout box.
            The values are 00 to 15.
          o For a diskette drive, H defines either diskette drive 1 or
            2. G is always 0.
          o For all other devices, GH is equal to 00. 

For the integrated adapters, EF-GH is the same as the definition for the
pluggable adapters. For example, the location code for a diskette drive
is 01-D1-00-00. A second diskette drive is 01-D1-00-01.

For SCSI devices, the location code is defined as:

AB-CD-EF-G,H
 |  |  | | |
 |  |  | | Logical Unit address of the SCSI Device
 |  |  | Control Unit Address of the SCSI Device
 |  |  Connector ID
 |  devfunc Number, Adapter Number or Physical Location
 Bus Type or PCI Parent Bus
 

Where:

    * AB-CD-EF are the same as non-SCSI devices.
    * G defines the control unit address of the device. Values of 0 to
      15 are valid.
    * H defines the logical unit address of the device. Values of 0 to
      255 are valid. 

There is also a bus location code that is generated as '00-xxxxxxxx'
where xxxxxxxx is equivalent to the node's unit address. Refer to the
system unit service guide for additional information.

 Location Codes for CHRP Model Architecture System Units 	Top of page 
<http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/ledsearch.htm#top>


*Note:*
    You need to know which system architecture the system unit on which
    you are working uses. If you are working with a RSPC model use the
    Location Codes for RSPC Model Architecture System Units
    <http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/ledsearch.htm#led_rspc>.
    If you do not know which model you have, refer to Determining System
    Architecture
    <http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/hardware_docs/pdf/380509.pdf>
    in /Diagnostic Information for Multiple Bus Systems/ before proceeding. 

The (CHRP) system unit uses Physical Location Codes in conjunction with
AIX Location Codes to provide mapping of the failing field replaceable
units. The location codes are produced by the system unit's firmware and
the AIX operating system.

 Diagnostic Load Progress Indicators 	Top of page 
<http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/ledsearch.htm#top>


*Note:*
    Some systems might produce 4-digit codes. If the leftmost digit of a
    4-digit code is 0, use the three rightmost digits. 

*c00 *
    AIX Install/Maintenance loaded successfully. 

*c01 *
    Insert the first diagnostic diskette. 

*c02 *
    Diskettes inserted out of sequence. 

*c03 *
    The wrong diskette is in diskette drive. 

*c04 *
    The loading stopped with an irrecoverable error. 

*c05 *
    A diskette error occurred. 

*c06 *
    The *rc.boot* configuration shell script is unable to determine type
    of boot. 

*c07 *
    Insert the next diagnostic diskette. 

*c08 *
    RAM file system started incorrectly. 

*c09 *
    The diskette drive is reading or writing a diskette. 

*c20 *
    An unexpected halt occurred, and the system is configured to enter
    the kernel debug program instead of entering a system dump. 

*c21 *
    The *ifconfig* command was unable to configure the network for the
    client network host. 

*c22 *
    The *tftp* command was unable to read client's /ClientHostName/
    *info* file during a client network boot. 

*c24 *
    Unable to read client's /ClientHostName/.*info* file during a client
    network boot. 

*c25 *
    Client did not mount remote miniroot during network install. 

*c26 *
    Client did not mount the /usr file system during the network boot. 

*c29 *
    The system was unable to configure the network device. 

*c31 *
    Select the console display for the diagnostics. To select No console
    display, set the key mode switch to Normal then to Service. The
    diagnostic programs then load and run the diagnostics automatically.
    If you continue to get the message, check the cables and make sure
    you are using the serial port. 

*c32 *
    A directly attached display (HFT) was selected. 

*c33 *
    A TTY terminal attached to serial ports S1 or S2 was selected. 

*c34 *
    A file was selected. The console messages store in a file. 

*c35 *
    No console found. 

*c40 *
    Configuration files are being restored. 

*c41 *
    Could not determine the boot type or device. 

*c42 *
    Extracting data files from diskette. 

*c43 *
    Cannot access the boot/install tape. 

*c44 *
    Initializing installation database with target disk information. 

*c45 *
    Cannot configure the console. 

*c46 *
    Normal installation processing. 

*c47 *
    Could not create a physical volume identifier (PVID) on disk. 

*c48 *
    Prompting you for input. 

*c49 *
    Could not create or form the JFS log. 

*c50 *
    Creating root volume group on target disks. 

*c51 *
    No paging devices were found. 

*c52 *
    Changing from RAM environment to disk environment. 

*c53 *
    Not enough space in the */tmp* directory to do a preservation
    installation. 

*c54 *
    Installing either BOS or additional packages. 

*c55 *
    Could not remove the specified logical volume in a preservation
    installation. 

*c56 *
    Running user-defined customization. 

*c57 *
    Failure to restore BOS. 

*c58 *
    Displaying message to turn the key. 

*c59 *
    Could not copy either device special files, device ODM, or volume
    group information from RAM to disk. 

*c61 *
    Failed to create the boot image. 

*c62 *
    Loading platform dependent debug files. 

*c63 *
    Loading platform dependent data files. 

*c64 *
    Failed to load platform dependent data files. 

*c70 *
    Problem Mounting diagnostic CD-ROM disc. 

*c99 *
    Diagnostics have completed. This code is only used when there is no
    console. 

*Fxx *
    (xx is any number) Refer to Firmware chapter of the service manual. 


      Dump Progress Indicators (Dump Status Codes)

The following dump progress indicators, or dump status codes, are part
of a Type 102 message.

*Note:*
    When a lowercase c is listed, it displays in the lower half of the
    character position. Some systems produce 4-digit codes, the two
    leftmost positions can have a blanks or zeros. Use the two rightmost
    digits. 

*0c0 *
    The dump completed successfully. 

*0c1 *
    The dump failed due to an I/O error. 

*0c2 *
    A dump, requested by the user, is started. 

*0c3 *
    The dump is inhibited. 

*0c4 *
    The dump device is not large enough. 

*0c5 *
    The dump did not start, or the dump crashed. 

*0c6 *
    Dumping to a secondary dump device. 

*0c7 *
    Reserved. 

*0c8 *
    The dump function is disabled. 

*0c9 *
    A dump is in progress. 

*0cc *
    Unknown dump failure 


      Crash Codes

*Note:*
    Some systems may produce 4-digit codes. If the leftmost digit of a
    4-digit code is 0, use the three rightmost digits. 

The crash codes that follow are part of a Type 102 message. These crash
codes are grouped into three categories:

*Category 1 *
    Dump analysis is the appropriate first action in Problem
    Determination, begin the Problem Determination process with software
    support. 

*Category 2 *
    Dump analysis most likely will not aid in Problem Determination,
    begin the Problem Determination process with hardware support. 

*Category 3 *
    Both software and hardware support may be needed in Problem
    Determination, go to MAP 0070: 888 Sequence in Operator Panel
    Display
    <http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/hardware_docs/pdf/380509.pdf>
    in /Diagnostic Information for Multiple Bus Systems/to assist in
    problem isolation. 


        Category 1

*300 *
    Data storage interrupt from the processor. 

*32x *
    Data storage interrupt because of an I/O exception from IOCC. 

*38x *
    Data storage interrupt because of an I/O exception from SLA. 

*400 *
    Instruction storage interrupt. 

*700 *
    Program interrupt. 


        Category 2

*200 *
    Machine check because of a memory bus error. 

*201 *
    Machine check because of a memory timeout. 

*202 *
    Machine check because of a memory card failure. 

*203 *
    Machine check because of a out of range address. 

*204 *
    Machine check because of an attempt to write to ROS. 

*205 *
    Machine check because of an uncorrectable address parity. 

*206 *
    Machine check because of an uncorrectable ECC error. 

*207 *
    Machine check because of an unidentified error. 

*208 *
    Machine check due to an L2 uncorrectable ECC. 

*500 *
    External interrupt because of a scrub memory bus error. 

*501 *
    External interrupt because of an unidentified error. 

*51x *
    External interrupt because of a DMA memory bus error. 

*52x *
    External interrupt because of an IOCC channel check. 

*53x *
    External interrupt from an IOCC bus timeout; x represents the IOCC
    number. 

*54x *
    External interrupt because of an IOCC keyboard check. 

*800 *
    Floating point is not available. 


        Category 3

*000 *
    Unexpected system interrupt. 

*558 *
    There is not enough memory to continue the IPL. 

*600 *
    AIX 4.3.3.3 and above: Alignment Interrupt. If pre-AIX 4.3.3.3: AIX
    has crashed because the Portability Assist Layer (PAL) for this
    machine type has detected a problem. 

*605 *
    AIX has crashed because the Portability Assist Layer (PAL) for this
    machine type has detected a problem (AIX 4.3.3.3 and above). 

 Operator Panel Display Numbers 	Top of page 
<http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/ledsearch.htm#top>


This page contains a list of the various numbers and characters that
display in the operator panel display. There are three categories of
numbers and characters. The first group tracks the progress of the
configuration program. The second group tracks the progress of the
diagnostics. The third group provides information about messages that
follow an 888 sequence.


      Configuration Program Indicators

The numbers in this list display on the operator panel as the system
loads the operating system and prepares the hardware by loading software
drivers.

*Note:*
    Some systems may produce 4-digit codes. If the leftmost digit of a
    4-digit code is 0, use the three rightmost digits. 

*2E6 *
    The PCI Differential Ultra SCSI adapter or the Universal PCI
    Differential Ultra SCSI adapter being configured. 

*2E7 *
    Configuration method unable to determine if the SCSI adapter type is
    SE or DE type. 

*440 *
    9.1GB Ultra SCSI Disk Drive being identified or configured. 

*441 *
    18.2GB Ultra SCSI Disk Drive being identified or configured. 

*444 *
    2-Port Multiprotocol PCI Adapter (ASIC) being identified or configured. 

*447 *
    PCI 64-bit Fibre Channel Arbitrated Loop Adapter being configured. 

*500 *
    Querying Standard I/O slot. 

*501 *
    Querying card in Slot 1. 

*502 *
    Querying card in Slot 2. 

*503 *
    Querying card in Slot 3. 

*504 *
    Querying card in Slot 4. 

*505 *
    Querying card in Slot 5. 

*506 *
    Querying card in Slot 6. 

*507 *
    Querying card in Slot 7. 

*508 *
    Querying card in Slot 8. 

*510 *
    Starting device configuration. 

*511 *
    Device configuration completed. 

*512 *
    Restoring device configuration files from media. 

*513 *
    Restoring basic operating system installation files from media. 

*516 *
    Contacting server during network boot. 

*517 *
    Mounting client remote file system during network IPL. 

*518 *
    Remote mount of the *root (/)* and */usr* file systems failed during
    network boot. 

*520 *
    Bus configuration running. 

*521 *
    */etc/init* invoked *cfgmgr* with invalid options; */etc/init*has
    been corrupted or incorrectly modified (irrecoverable error). 

*522 *
    The configuration manager has been invoked with conflicting options
    (irrecoverable error). 

*523 *
    The configuration manager is unable to access the ODM database
    (irrecoverable error). 

*524 *
    The configuration manager is unable to access the config.rules
    object in the ODM database (irrecoverable error). 

*525 *
    The configuration manager is unable to get data from a customized
    device object in the ODM database (irrecoverable error). 

*526 *
    The configuration manager is unable to get data from a customized
    device driver object in the ODM database ( irrecoverable error). 

*527 *
    The configuration manager was invoked with the phase 1 flag; running
    phase 1 at this point is not permitted (irrecoverable error). 

*528 *
    The configuration manager cannot find sequence rule, or no program
    name was specified in the ODM database (irrecoverable error). 

*529 *
    The configuration manager is unable to update ODM data
    (irrecoverable error). 

*530 *
    The program *savebase* returned an error. 

*531 *
    The configuration manager is unable to access the *PdAt* object
    class (irrecoverable error). 

*532 *
    There is not enough memory to continue (malloc failure);
    irrecoverable error. 

*533 *
    The configuration manager could not find a configuration method for
    a device. 

*534 *
    The configuration manager is unable to acquire database lock
    (irrecoverable error). 

*535 *
    HIPPI diagnostics interface driver being configured. 

*536 *
    The configuration manager encountered more than one sequence rule
    specified in the same phase (irrecoverable error). 

*537 *
    The configuration manager encountered an error when invoking the
    program in the sequence rule. 

*538 *
    The configuration manager is going to invoke a configuration method. 

*539 *
    The configuration method has terminated, and control has returned to
    the configuration manager. 

*541 *
    A DLT tape device is being configured. 

*549 *
    Console could not be configured for the Copy a System Dump Menu. 

*551 *
    IPL vary-on is running. 

*552 *
    IPL vary-on failed. 

*553 *
    IPL phase 1 is complete. 

*554 *
    The boot device could not be opened or read, or unable to define NFS
    swap device during network boot. 

*555 *
    An ODM error occurred when trying to vary-on the rootvg, or unable
    to create an NFS swap device during network boot. 

*556 *
    Logical Volume Manager encountered error during IPL vary-on. 

*557 *
    The root filesystem does not mount. 

*558 *
    There is not enough memory to continue the system IPL. 

*559 *
    Less than 2 M bytes of good memory are available to load the AIX
    kernel. 

*569 *
    FCS SCSI protocol device is being configured (32 bits). 

*570 *
    Virtual SCSI devices being configured. 

*571 *
    HIPPI common function device driver being configured. 

*572 *
    HIPPI IPI-3 master transport driver being configured. 

*573 *
    HIPPI IPI-3 slave transport driver being configured. 

*574 *
    HIPPI IPI-3 transport services user interface device driver being
    configured. 

*575 *
    A 9570 disk-array driver being configured. 

*576 *
    Generic async device driver being configured. 

*577 *
    Generic SCSI device driver being configured. 

*578 *
    Generic commo device driver being configured. 

*579 *
    Device driver being configured for a generic device. 

*580 *
    HIPPI TCPIP network interface driver being configured. 

*581 *
    Configuring TCP/IP. 

*582 *
    Configuring Token-Ring data link control. 

*583 *
    Configuring an Ethernet data link control. 

*584 *
    Configuring an IEEE Ethernet data link control. 

*585 *
    Configuring an SDLC MPQP data link control. 

*586 *
    Configuring a QLLC X.25 data link control. 

*587 *
    Configuring a NETBIOS. 

*588 *
    Configuring a Bisync Read-Write (BSCRW). 

*589 *
    SCSI target mode device being configured. 

*590 *
    Diskless remote paging device being configured. 

*591 *
    Configuring an LVM device driver. 

*592 *
    Configuring an HFT device driver. 

*593 *
    Configuring SNA device drivers. 

*594 *
    Asynchronous I/O being defined or configured. 

*595 *
    X.31 pseudo-device being configured. 

*596 *
    SNA DLC/LAPE pseudo-device being configured. 

*597 *
    OCS software being configured. 

*598 *
    OCS hosts being configured during system reboot. 

*599 *
    Configuring FDDI data link control. 

*59B *
    FCS SCSI protocol device being configured (64 bits). 

*5C0 *
    Streams-based hardware drive being configured. 

*5C1 *
    Streams-based X.25 protocol being configured. 

*5C2 *
    Streams-based X.25 COMIO emulator driver being configured 

*5C3 *
    Streams-based X.25 TCP/IP interface driver being configured. 

*5C4 *
    FCS adapter device driver being configured. 

*5C5 *
    SCB network device driver for FCS being configured. 

*5C6 *
    AIX SNA channel being configured. 

*600 *
    Starting network boot portion of */sbin/rc.boot*. 

*602 *
    Configuring network parent devices. 

*603 *
    */usr/lib/methods/defsys, /usr/lib/methods/cfgsys,* or
    */usr/lib/methods/cfgbus* failed. 

*604 *
    Configuring physical network boot device. 

*605 *
    Configuration of physical network boot device failed. 

*606 *
    Running */usr/sbin/ifconfig* on logical network boot device. 

*607 *
    */usr/sbin/ifconfig* failed. 

*608 *
    Attempting to retrieve the *client.info* file with *tftp.*Note that
    a flashing 608 indicates multiple attempt(s) to retrieve the
    *client_info* file are occurring. 

*609 *
    The *client.info* file does not exist or it is zero length. 

*60B *
    18.2GB 68-pin LVD SCSI Disk Drive being configured. 

*610 *
    Attempting remote mount of NFS file system. 

*611 *
    Remote mount of the NFS file system failed. 

*612 *
    Accessing remote files; unconfiguring network boot device. 

*614 *
    Configuring local paging devices. 

*615 *
    Configuration of a local paging device failed. 

*616 *
    Converting from diskless to dataless configuration. 

*617 *
    Diskless to dataless configuration failed. 

*618 *
    Configuring remote (NFS) paging devices. 

*619 *
    Configuration of a remote (NFS) paging device failed. 

*61B *
    36.4GB 80-pin LVD SCSI Disk Drive being configured. 

*61D *
    36.4GB 80-pin LVD SCSI Disk Drive being configured. 

*61E *
    18.2GB 68-pin LVD SCSI Disk Drive being configured. 

*620 *
    Updating special device files and ODM in permanent filesystem with
    data from boot RAM filesystem. 

*621 *
    9.1 GB LVD 80-pin SCSI Drive being configured. 

*622 *
    Boot process configuring for operating system installation. 

*62D *
    9.1GB 68-pin LVD SCSI Disk Drive being configured. 

*62E *
    9.1GB 68-pin LVD SCSI Disk Drive being configured. 

*636 *
    TURBROWAYS 622 Mbps PCI MMF ATM Adapter. 

*637 *
    Dual Channel PCI-2 Ultra2 SCSI Adapter being configured. 

*638 *
    4.5GB Ultra SCSI Single Ended Disk Drive being configured. 

*639 *
    9.1GB 10K RPM Ultra SCSI Disk Drive (68-pin). 

*63A *
    See 62D. 

*63B *
    9.1GB 80-pin LVD SCSI Disk Drive being configured. 

*63C *
    See 60B. 

*63D *
    18.2GB 80-pin LVD SCSI Disk Drive being configured. 

*63E *
    36.4GB 68-pin LVD SCSI Disk Drive being configured. 

*63F *
    See 61B. 

*640 *
    9.1GB 10K RPM Ultra SCSI Disk Drive (80-pin). 

*646 *
    High-Speed Token-Ring PCI Adapter being configured. 

*64A *
    See 62E. 

*64B *
    9.1GB 80-pin LVD SCSI Disk Drive being configured. 

*64C *
    See 61E. 

*64D *
    18.2 GB LVD 80-pin Drive/Carrier being configured. 

*64E *
    36.4GB 68-pin LVD SCSI Disk Drive being configured. 

*64F *
    See 61D. 

*650 *
    IBM SCSD disk drive being configured. 

*653 *
    18.2GB Ultra-SCSI 16-bit Disk Drive being configured. 

*655 *
    GXT130P Graphics adapter being configured. 

*657 *
    GXT2000P graphics adapter being configured. 

*658 *
    PCI Fibre Channel Disk Subsystem Controller being identified or
    configured. 

*659 *
    2102 Fibre Channel Disk Subsystem Controller Drawer being identified
    or configured. 

*660 *
    2102 Fibre Channel Disk Array being identified or configured. 

*662 *
    Ultra2 Integrated SCSI controller. 

*663 *
    The ARTIC960RxD Digital Trunk Quad PCI Adapter or the ARTIC960RxF
    Digital Trunk Resource Adapter being configured. 

*664 *
    32x (MAX) SCSI-2 CD-ROM drive being configured. 

*667 *
    PCI 3-Channel Ultra2 SCSI RAID Adapter being configured. 

*669 *
    PCI Gigabit Ethernet Adapter being configured. 

*66C *
    10/100/1000 Base-T EthernetPCI Adapter. 

*66D *
    PCI 4-Channel Ultra-3 SCSI RAID Adapter. 

*66E *
    4.7 GB DVD-RAM drive. 

*674 *
    ESCON^(R) Channel PCI Adapter being configured. 

*677 *
    PCI 32-bit Fibre Channel Arbitrated Loop Adapter being configured. 

*67B *
    PCI Cryptographic Coprocessor being configured. 

*682 *
    20x (MAX) SCSI-2 CD-ROM Drive being configured. 

*689 *
    4.5GB Ultra SCSI Single Ended Disk Drive being configured. 

*68C *
    20 GB 4-mm Tape Drive being configured. 

*68E *
    POWER GXT6000P PCI Graphics Adapter. 

*690 *
    9.1GB Ultra SCSI Single Ended Disk Drive being configured. 

*69b *
    64-bit/66MHz PCI ATM 155 MMF PCI adapter being configured. 

*69d *
    64-bit/66MHz PCI ATM 155 UTP PCI adapter being configured. 

*6CC *
    SSA disk drive being configured. 

*700 *
    A 1.1 GB 8-bit SCSI disk drive being identified or configured. 

*701 *
    A 1.1 GB 16-bit SCSI disk drive being identified or configured. 

*702 *
    A 1.1 GB 16-bit differential SCSI disk drive being identified or
    configured. 

*703 *
    A 2.2 GB 8-bit SCSI disk drive being identified or configured. 

*704 *
    A 2.2 GB 16-bit SCSI disk drive being identified or configured. 

*705 *
    The configuration method for the 2.2 GB 16-bit differential SCSI
    disk drive is being run. If an irrecoverable error occurs, the
    system halts. 

*706 *
    A 4.5 GB 16-bit SCSI disk drive being identified or configured. 

*707 *
    A 4.5 GB 16-bit differential SCSI disk drive being identified or
    configured. 

*708 *
    A L2 cache being identified or configured. 

*710 *
    POWER GXT150M graphics adapter being identified or configured. 

*711 *
    Unknown adapter being identified or configured. 

*712 *
    Graphics slot bus configuration is executing. 

*713 *
    The IBM ARTIC960 device being configured. 

*714 *
    A video capture adapter being configured. 

*715 *
    The Ultramedia Services audio adapter being configured. (this number
    displays briefly on the panel). 

*717 *
    TP Ethernet Adapter being configured. 

*718 *
    GXT500 Graphics Adapter being configured. 

*720 *
    Unknown read/write optical drive type being configured. 

*721 *
    Unknown disk or SCSI device being identified or configured. 

*722 *
    Unknown disk being identified or configured. 

*723 *
    Unknown CD-ROM being identified or configured. 

*724 *
    Unknown tape drive being identified or configured. 

*725 *
    Unknown display adapter being identified or configured. 

*726 *
    Unknown input device being identified or configured. 

*727 *
    Unknown async device being identified or configured. 

*728 *
    Parallel printer being identified or configured. 

*729 *
    Unknown parallel device being identified or configured. 

*730 *
    Unknown diskette drive being identified or configured. 

*731 *
    PTY being identified or configured. 

*732 *
    Unknown SCSI initiator type being configured. 

*733 *
    7GB 8 mm tape drive being configured. 

*734 *
    4x SCSI-2 640 MB CD-ROM Drive being configured. 

*736 *
    Quiet Touch keyboard and speaker cable being configured. 

*741 *
    1080 MB SCSI Disk Drive being configured. 

*745 *
    16GB 4 mm Tape Auto Loader being configured. 

*746 *
    SCSI-2 Fast/Wide PCI Adapter being configured. 

*747 *
    SCSI-2 Differential Fast/Wide PCI Adapter being configured. 

*749 *
    7331 Model 205 Tape Library being configured. 

*751 *
    SCSI 32-bit SE F/W RAID Adapter being configured. 

*754 *
    1.1GB 16-bit SCSI disk drive being configured. 

*755 *
    2.2GB 16-bit SCSI disk drive being configured. 

*756 *
    4.5GB 16-bit SCSI disk drive being configured. 

*757 *
    External 13GB 1.5M/s 1/4 inch tape being configured. 

*763 *
    SP Switch MX Adapter being configured. 

*764 *
    SP System Attachment Adapter being configured. 

*772 *
    4.5GB SCSI F/W Disk Drive being configured. 

*773 *
    9.1GB SCSI F/W Disk Drive being configured. 

*774 *
    9.1GB External SCSI Disk Drive being configured. 

*776 *
    PCI Token-Ring Adapter being identified or configured. 

*777 *
    10/100 Ethernet Tx PCI Adapter being identified or configured. 

*778 *
    POWER GXT3000P 3D PCI Graphics adapter being configured. 

*77B *
    4-Port 10/100 Ethernet Tx PCI Adapter being identified or configured. 

*77c *
    A 1.0 GB 16-bit SCSI disk drive being identified or configured. 

*783 *
    4 mm DDS-2 Tape Autoloader being configured. 

*789 *
    2.6 GB External Optical Drive being configured. 

*78B *
    POWER GXT4000P PCI Graphics Adapter. 

*78C *
    PCI bus configuration executing. 

*78D *
    GXT300P 2D Graphics adapter being configured. 

*790 *
    Multi-bus Integrated Ethernet Adapter being identified or configured. 

*797 *
    TURBOWAYS^(R) 155 UTP/STP ATM Adapter being identified or configured. 

*798 *
    Video streamer adapter being identified or configured. 

*799 *
    2-Port Multiprotocol PCI adapter being identified or configured. 

*79c *
    ISA bus configuration executing. 

*7C0 *
    CPU/System Interface being configured. 

*7C1 *
    Business Audio Subsystem being identified or configured. 

*7cc *
    PCMCIA bus configuration executing. 

*800 *
    TURBOWAYS 155 MMF ATM Adapter being identified or configured. 

*803 *
    7336 Tape Library robotics being configured. 

*804 *
    8x Speed SCSI-2 CD-ROM Drive being configured. 

*806 *
    POWER GXT800 PCI Graphics adapter being configured. 

*807 *
    SCSI Device Enclosure being configured. 

*80c *
    SSA 4-Port Adapter being identified or configured. 

*811 *
    Processor complex being identified or configured. 

*812 *
    Memory being identified or configured. 

*813 *
    Battery for time-of-day, NVRAM, and so on being identified or
    configured, or system I/O control logic being identified or configured. 

*814 *
    NVRAM being identified or configured. 

*815 *
    Floating-point processor test. 

*816 *
    Operator panel logic being identified or configured. 

*817 *
    Time-of-day logic being identified or configured. 

*819 *
    Graphics input device adapter being identified or configured. 

*821 *
    Standard keyboard adapter being identified or configured. 

*823 *
    Standard mouse adapter being identified or configured. 

*824 *
    Standard tablet adapter being identified or configured. 

*825 *
    Standard speaker adapter being identified or configured. 

*826 *
    Serial Port 1 adapter being identified or configured. 

*827 *
    Parallel port adapter being identified or configured. 

*828 *
    Standard diskette adapter being identified or configured. 

*831 *
    3151 adapter being identified or configured, or Serial Port 2 being
    identified or configured. 

*834 *
    64-port async controller being identified or configured. 

*835 *
    16-port async concentrator being identified or configured. 

*836 *
    128-port async controller being identified or configured. 

*837 *
    16-port remote async node being identified or configured. 

*838 *
    Network Terminal Accelerator Adapter being identified or configured. 

*839 *
    7318 Serial Communications Server being configured. 

*840 *
    PCI Single-Ended Ultra SCSI Adapter being configured. 

*841 *
    8-port async adapter (EIA-232) being identified or configured. 

*842 *
    8-port async adapter (EIA-422A) being identified or configured. 

*843 *
    8-port async adapter (MIL-STD 188) being identified or configured. 

*844 *
    7135 RAIDiant Array disk drive subsystem controller being identified
    or configured. 

*845 *
    7135 RAIDiant Array disk drive subsystem drawer being identified or
    configured. 

*846 *
    RAIDiant Array SCSI 1.3GB Disk Drive being configured. 

*847 *
    16-port serial adapter (EIA-232) being identified or configured. 

*848 *
    16-port serial adapter (EIA-422) being identified or configured. 

*849 *
    X.25 Interface Coprocessor/2 adapter being identified or configured. 

*850 *
    Token-Ring network adapter being identified or configured. 

*851 *
    T1/J1 Portmaster^(R) adapter being identified or configured. 

*852 *
    Ethernet adapter being identified or configured. 

*854 *
    3270 Host Connection Program/6000 connection being identified or
    configured. 

*855 *
    Portmaster Adapter/A being identified or configured. 

*857 *
    FSLA adapter being identified or configured. 

*858 *
    5085/5086/5088 adapter being identified or configured. 

*859 *
    FDDI adapter being identified or configured. 

*85c *
    Token-Ring High-Performance LAN adapter being identified or configured. 

*861 *
    Optical adapter being identified or configured. 

*862 *
    Block Multiplexer Channel Adapter being identified or configured. 

*865 *
    ESCON Channel Adapter or emulator being identified or configured. 

*866 *
    SCSI adapter being identified or configured. 

*867 *
    Async expansion adapter being identified or configured. 

*868 *
    SCSI adapter being identified or configured. 

*869 *
    SCSI adapter being identified or configured. 

*870 *
    Serial disk drive adapter being identified or configured. 

*871 *
    Graphics subsystem adapter being identified or configured. 

*872 *
    Grayscale graphics adapter being identified or configured. 

*874 *
    Color graphics adapter being identified or configured. 

*875 *
    Vendor generic communication adapter being configured. 

*876 *
    8-bit color graphics processor being identified or configured. 

*877 *
    POWER Gt3^(TM) /POWER Gt4^(TM) being identified or configured. 

*878 *
    POWER Gt4 graphics processor card being configured. 

*879 *
    24-bit color graphics card, MEV2 being configured. 

*880 *
    POWER Gt1^(TM) adapter being identified or configured. 

*887 *
    Integrated Ethernet adapter being identified or configured. 

*889 *
    SCSI adapter being identified or configured. 

*890 *
    SCSI-2 Differential Fast/Wide and Single-Ended Fast/Wide Adapter/A
    being configured. 

*891 *
    Vendor SCSI adapter being identified or configured. 

*892 *
    Vendor display adapter being identified or configured. 

*893 *
    Vendor LAN adapter being identified or configured. 

*894 *
    Vendor async/communications adapter being identified or configured. 

*895 *
    Vendor IEEE 488 adapter being identified or configured. 

*896 *
    Vendor VME bus adapter being identified or configured. 

*897 *
    S/370^(TM) Channel Emulator adapter being identified or configured. 

*898 *
    POWER Gt1x^(TM) graphics adapter being identified or configured. 

*899 *
    3490 attached tape drive being identified or configured. 

*89c *
    A multimedia SCSI CD-ROM being identified or configured. 

*900 *
    GXT110P Graphics Adapter being identified or configured. 

*901 *
    Vendor SCSI device being identified or configured. 

*902 *
    Vendor display device being identified or configured. 

*903 *
    Vendor async device being identified or configured. 

*904 *
    Vendor parallel device being identified or configured. 

*905 *
    Vendor other device being identified or configured. 

*908 *
    POWER GXT1000 Graphics subsystem being identified or configured. 

*910 *
    1/4GB Fiber Channel/266 Standard Adapter being identified or
    configured. 

*911 *
    Fiber Channel/1063 Adapter Short Wave being configured. 

*912 *
    2.0GB SCSI-2 differential disk drive being identified or configured. 

*913 *
    1.0GB differential disk drive being identified or configured. 

*914 *
    5GB 8 mm differential tape drive being identified or configured. 

*915 *
    4GB 4 mm tape drive being identified or configured. 

*916 *
    Non-SCSI vendor tape adapter being identified or configured. 

*917 *
    A 2.0 GB 16-bit differential SCSI disk drive being identified or
    configured. 

*918 *
    A 2 GB 16-bit single-ended SCSI disk drive being identified or
    configured. 

*920 *
    Bridge Box being identified or configured. 

*921 *
    101 keyboard being identified or configured. 

*922 *
    102 keyboard being identified or configured. 

*923 *
    Kanji keyboard being identified or configured. 

*924 *
    Two-button mouse being identified or configured. 

*925 *
    Three-button mouse being identified or configured. 

*926 *
    5083 tablet being identified or configured. 

*927 *
    5083 tablet being identified or configured. 

*928 *
    Standard speaker being identified or configured. 

*929 *
    Dials being identified or configured. 

*930 *
    Lighted program function keys (LPFK) being identified or configured. 

*931 *
    IP router being identified or configured. 

*933 *
    Async planar being identified or configured. 

*934 *
    Async expansion drawer being identified or configured. 

*935 *
    3.5-inch diskette drive being identified or configured. 

*936 *
    5.25-inch diskette drive being identified or configured. 

*937 *
    An HIPPI adapter being configured. 

*938 *
    Serial HIPPI PCI adapter being configured. 

*942 *
    POWER GXT 100 graphics adapter being identified or configured. 

*943 *
    A 3480 or 3490 control unit attached to a System/370 Channel
    Emulator/A adapter are being identified or configured. 

*944 *
    100MB ATM adapter being identified or configured. 

*945 *
    1.0GB SCSI differential disk drive being identified or configured. 

*946 *
    Serial port 3 adapter being identified or configured. 

*947 *
    A 730MB SCSI disk drive being configured. 

*948 *
    Portable disk drive being identified or configured. 

*949 *
    Unknown direct bus-attach device being identified or configured. 

*950 *
    Missing SCSI device being identified or configured. 

*951 *
    670MB SCSI disk drive being identified or configured. 

*952 *
    355MB SCSI disk drive being identified or configured. 

*953 *
    320MB SCSI disk drive being identified or configured. 

*954 *
    400MB SCSI disk drive being identified or configured. 

*955 *
    857MB SCSI disk drive being identified or configured. 

*956 *
    670MB SCSI disk drive electronics card being identified or configured. 

*957 *
    120 MB DBA disk drive being identified or configured. 

*958 *
    160 MB DBA disk drive being identified or configured. 

*959 *
    160 MB SCSI disk drive being identified or configured. 

*960 *
    1.37GB SCSI disk drive being identified or configured. 

*964 *
    Internal 20 GB 8 mm tape drive identified or configured. 

*968 *
    1.0 GB SCSI disk drive being identified or configured. 

*970 *
    Half-inch, 9-track tape drive being identified or configured. 

*971 *
    150 MB 1/4-inch tape drive being identified or configured. 

*972 *
    2.3 GB 8 mm SCSI tape drive being identified or configured. 

*973 *
    Other SCSI tape drive being identified or configured. 

*974 *
    CD-ROM drive being identified or configured. 

*975 *
    An optical disk drive being identified or configured. 

*977 *
    M-Audio Capture and Playback Adapter being identified or configured. 

*981 *
    540MB SCSI-2 single-ended disk drive being identified or configured. 

*984 *
    1GB 8-bit disk drive being identified or configured. 

*985 *
    M-Video Capture Adapter being identified or configured. 

*986 *
    2.4GB SCSI disk drive being identified or configured. 

*987 *
    An Enhanced SCSI CD-ROM drive being identified or configured. 

*989 *
    200MB SCSI disk drive being identified or configured. 

*990 *
    2.0GB SCSI-2 single-ended disk drive being identified or configured. 

*991 *
    525MB 1/4-inch cartridge tape drive being identified or configured. 

*994 *
    5 GB 8 mm tape drive being identified or configured. 

*995 *
    1.2GB 1/4 inch cartridge tape drive being identified or configured. 

*996 *
    A single-port, multiprotocol communications adapter being identified
    or configured. 

*997 *
    FDDI adapter being identified or configured. 

*998 *
    2.0 GB 4 mm tape drive being identified or configured. 

*999 *
    7137 or 3514 Disk Array Subsystem being configured. 

*D46 *
    Token-Ring cable 

*D81 *
    T2 Ethernet Adapter being configured. 

*2530 *
    10/100 Mbps Ethernet PCI Adapter II being configured. 

 Physical Location Codes 	Top of page 
<http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/ledsearch.htm#top>


*Note:*
    Diagnostic Versions 5.2.0 and later display physical location codes
    for all resources. Diagnostic versions earlier than 5.2.0 show a
    mixture of physical location codes and AIX location codes.

    As an example, under diagnostics version 5.2.0 might display a
    resource as:

ent0            P2/E1    IBM 10/100  Mbps Ethernet PCI adapter    

    The P2/E1 is the physical location code indicating an Ethernet port
    built into the P2 planar.

    whereas, in versions prior to 5.2.0, the resource might be shown as:

 ent0            10-60    IBM 10/100  Mbps Ethernet PCI adapter  

    The 10-60 is an AIX location code indicating a PCI parent bus of 10,
    and a devfunc number of 60 (for more information , see AIX Location
    Codes
    <http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/ledsearch.htm#led_aix_loc_codes>).


    These physical location codes can appear in many places while
    running diagnostics; for instance, within resource menus, SRNs, or
    specific service aids.

Physical location codes provide a mapping of logical functions in a
platform (or expansion sites for logical functions, such as connectors
or ports) to their specific locations within the physical structure of
the platform.


      Location Code Format

The format for the location code is a string of alphanumeric characters
separated by a dash (-), slash (/), pound sign (#), or period (.). The
base location is all of the information before the slash (/) or pound
sign (#). It identifies a device that is connected or plugged into the
parent. Extended location information follows the slash (/). It
identifies a device that is part of the parent, a connector, or a cable.
Cable information follows the pound sign (#). It identifies a cable that
is connected to the parent. The following are examples:

    * P1 identifies system planar P1.
    * U1-P1 also identifies system planar P1 in a rack or drawer unit.
    * P2 identifies an I/O planar (including all integrated I/O devices).
    * P1-C1 identifies a CPU card C1 plugged into planar P1.
    * P1-M2 identifies a memory card or SIMM M2 plugged into planar P1.
    * P2/K1 identifies a keyboard port controller (with connector)
      connected to planar P2.
    * P1-K1 identifies a keyboard attached to connector K1 on planar P1.
    * P1/S1 identifies serial port 1 controller on planar P1, the
      connector for serial port 1, or the cable attached to connector S1.
    * P1-I2/E3 identifies; Ethernet controller 3 on the card plugged
      into slot 2 (I2) on planar P1, the connector for Ethernet
      controller 3, or the cable attached to Ethernet controller 3.
    * P1-I2#E3 identifies; the cable attached to Ethernet controller 3
      plugged into slot 2 (I2) on planar P1. 

The period (.) is used to identify sub-locations such as memory DIMMs on
a base memory card or a specific SCSI address. The following are examples:

    * P1-M1.4 identifies DIMM 4 on memory card 1 on planar 1.
    * U1-P1-M2.12 identifies DIMM 12 on memory card in slot 2 on the
      system planar.
    * P1-C1.1 identifies CPU 1 on CPU card 1 on planar 1.
    * P2/Z1-A3.1 identifies a SCSI device with a SCSI address of LUN 1
      at SCSI ID 3 attached to SCSI bus 1 from planar 2.
    * P1-I2#E3.2 identifies the second cable in a series of cables
      attached to Ethernet controller 3 in slot 2 (I2) on planar 1. 

Depending on the AIX and firmware levels, AIX Diagnostics may include
extended location information when identifying a planar or card. The
extended location information or cable information is always included
when identifying a cable or connector. Location codes with extended
location information that display without a description identifying the
devices, always identify the cable attached to the port.


      Physical Location Code Standard Prefixes

The following table lists the assigned values for the location type
prefixes. In most cases, the prefix value assignments were chosen to
provide some mnemonic characteristic, so that they would be easier to
remember. The underlined characters in the description field are
intended to illustrate this mnemonic relationship.

Description 	Prefix Value (n=instance #)
Rack or drawer _u_nit 	Un
Drawer _u_nit mounted in a rack 	Un.n (U0.n if rack cannot be sensed by
firmware)
Single enclosure platform 	(No enclosure location code)
_P_lanar (backplane, system, I/O) 	Pn
_P_lanar riser card, extender 	Pn.n
Power/_v_oltage supply, _v_oltage regulator, backup battery 	Vn
_F_an/sensor 	Fn
_L_ED/_L_CD operator panel
or
Logical device address n relative to adapter port
	Ln
_C_PU/cache card (or pluggable module if on planar) 	Cn
_C_PU/cache module on CPU card (if pluggable) 	Cn.n
_M_emory card or SIMM/DIMM on planar 	Mn
_M_emory SIMM/DIMM on memory card 	Mn.n
Other _e_xtra-function base system cards (for example, service
processor) 	Xn
_I_/O adapter 	In
Pluggable modules or daughter cards on _I_/O adapter 	In.n
_D_evice in Bay n 	Dn
Ports/Connectors: 	
_G_raphics/video connector 	Gn
_K_eyboard/keyboard connector 	Kn
M_o_use/mouse connector 	On
_S_erial port 	Sn
Parallel port 	Rn
_E_thernet connector 	En
_T_oken Ring connector 	Tn
SCSI (pronounced scu_z_zy) connector 	Zn
Other I/O ports or connectors 	Qn
SCSI device addresses (including SSA (Serial Storage Architecture)) 	
Primary _a_ddress (SCSI control unit ID) 	An
Primary and secondary _a_ddress (SCSI ID and LUN (Logical Unit
Number)) 	An.n
SCSI device location in SCSI Enclosure Services (SES) 	
SCSI bank 	Bn
SCSI bank and bay 	Bn.n
Undefined prefixes (reserved) 	H, J, N, Y
Unique device address, this address remains constant independent of
which port the device is attached to. 	Wn

 Location Codes for RSPC Model Architecture System Units 	Top of page 
<http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/ledsearch.htm#top>


*Notes: *

   1. RSPC systems are only supported with AIX or Diagnostic versions
      below 5.2.0
   2. You need to know which system architecture the system unit on
      which you are working uses. If you are working with a CHRP model,
      use the Location Codes for CHRP Model Architecture System Units
      <http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/ledsearch.htm#led_chrp>.
      If you do not know which model you have, refer to Determining
      System Architecture
      <http://web.archive.org/web/20041112035526/http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base/hardware_docs/pdf/380509.pdf>
      in /Diagnostic Information for Multiple Bus Systems/ before
      proceeding.

Because the same diagnostic programs are used on all system units, a
location code is used to physically locate a failing device or unit. The
location code is displayed along with the service request number (SRN)
when the diagnostic programs isolate a failure. If the location code is
not known, you can run the Display Previous Diagnostic Results service
aid to display the results of the last time the diagnostic programs were
run.

The basic format of the system unit's location code is:

AB-CD-EF-GH  non-SCSI
AB-CD-EF-G,H  SCSI

For planars, cards, and non-SCSI devices, the location code is defined
as follows:

AB-CD-EF-GH
 |  |  |  |
 |  |  |  Device/FRU/Port ID
 |  |  Connector ID
 |  Slot or Adapter Number
 Bus Type

    * AB identifies a bus type.
    * CD identifies a slot or adapter number.
    * EF is the connector identifier, used to identify the adapter
      connector to which a resource is attached.
    * GH identifies a port, address, memory module, device, or FRU. GH
      has several meanings depending upon the resource type, they are as
      follows:
          o For memory cards, GH defines a memory module. Values for GH
            are 1 through 16.

            For systems that have memory modules that plug directly into
            the system planar, the location code is 00-00-00-GH where GH
            is the memory module slot. For systems that have memory
            cards with memory modules, the location code is 00-CD-EF-GH,
            where CD is the card slot and GH is the memory module slot.

          o For L2 caches, GH defines the cache. Values for GH are 1
            through 16.
          o For PCMCIA devices, GH defines the PCMCIA. Values for GH are
            1 through 16.
          o For async devices, GH defines the port on the fanout box.
            Values are 00 to 15.
          o For a diskette drive, H defines which diskette drive 1 or 2.
            G is always 0.
          o For all other devices, GH is equal to 00. 

For integrated adapters, EF-GH is the same as the definition for a
pluggable adapter. For example, the location code for a diskette drive
is 01-A0-00-00. A second diskette drive is 01-A0-00-01.

For SCSI, the location code is defined as follows:

AB-CD-EF-G,H
 |  |  | | |
 |  |  | | Logical Unit Address of SCSI Device
 |  |  | Control Unit Address of SCSI Device
 |  |  Connector ID
 |  Slot or Adapter Number
 Bus Type

Where:

    * AB-CD-EF are the same as non-SCSI devices.
    * G defines the control unit address of the device. Values of 0 to
      15 are valid.
    * H defines the logical unit address of the device. Values of 0 to
      255 are valid.

Adapters and cards are identified with only AB-CD. The possible values
for AB are as follows:

 00   for processor bus
 01   for ISA buses
 04   for PCI buses
 05   for PCMCIA buses (not supported on 7024)

The possible values for CD depend on the adapter or card.

For pluggable adapters or cards, this is a two-digit slot number in the
range from 01 to 99. However, in the case of ISA cards these numbers do
not actually correspond to the physical slot numbers. They simply are
based on the order in which the ISA cards are defined or configured,
either by SMIT or the ISA Adapter Configuration Service Aid.

For integrated adapters, the first character (C) is a letter in the
range from A to Z. This letter is based on the order in which the
integrated adapters are defined in residual data. This ensures unique
location codes for the integrated adapters. The second character (D) is
set to 0.

Refer to the following RSPC location code examples:

Processor-PCI bus
 00-00       PCI bus
Memory module in system planar
 00-00-00-01
Memory module in card
 00-0A-00-01
Integrated PCI adapters
 04-A0 ISA bus (Integrated PCI-ISA bridge)
 04-B0 Secondary PCI bus (Integrated PCI-PCI bridge)
 04-C0 Integrated PCI SCSI controller
Non-integrated PCI adapters
 04-01 Any PCI card in slot 1
 04-02 Any PCI card in slot 2
Integrated ISA adapters
 01-A0 Diskette adapter
 01-B0 Parallel port adapter
 01-C0 Serial port 1 adapter
 01-D0 Serial port 2 adapter
 01-E0 Keyboard adapter
 01-F0 Mouse adapter
Non-integrated ISA adapters
 01-01 First ISA card defined/configured
 01-02 Second ISA card defined/configured
 01-03 Third ISA card defined/configured
 01-04 Fourth ISA card defined/configured
Device attached to SCSI controller
 04-C0-01-4,0 Device attached to Integrated PCI SCSI controller


+++++ RS/6000

RS/6000 Diagnostic LED's

------------------------------------------------------------------------
TITLE : Diagnostic LED numbers and codes.
OS LEVEL : AIX
DATE : 07/04/99
VERSION : 1.0
------------------------------------------------------------------------

Built-In Self-Test (BIST) Indicators
------------------------------------

100 BIST completed successfully; control was passed to IPL ROS.
101 BIST started following reset.
102 BIST started, following the system unit's power-on reset.
103 BIST could not determine the system model number.
104 Equipment conflict; BIST could not find the CBA.
105 BIST could not read from the OCS EPROM.
106 BIST failed: CBA not found
111 OCS stopped; BIST detected a module error.
112 A checkstop occurred during BIST; checkstop results could
    not be logged out.
113 Three checkstops have occurred.
120 BIST starting a CRC check on the 8752 EPROM.
121 BIST detected a bad CRC in the first 32K bytes of the OCS EPROM.
122 BIST started a CRC check on the first 32K bytes of the OCS
    EPROM.
123 BIST detected a bad CRC on the OCS area of NVRAM.
124 BIST started a CRC check on the OCS area of NVRAM.
125 BIST detected a bad CRC on the time-of-day area of NVRAM.
126 BIST started a CRC check on the time-of-day area of NVRAM.
127 BIST detected a bad CRC on the 8752 EPROM.
130 BIST presence test started.
140 Running BIST. (Box Manufacturing Mode Only)
142 Box manufacturing mode operation.
143 Invalid memory configuration.
144 Manufacturing test failure.
151 BIST started AIPGM test code.
152 BIST started DCLST test code.
153 BIST started ACLST test code.
154 BIST started AST test code.
160 Bad EPOW Signal/Power status signal.
161 BIST being conducted on BUMP I/O.
162 BIST being conducted on JTAG.
163 BIST being conducted on Direct I/O.
164 BIST being conducted on CPU.
165 BIST being conducted on DCB and Memory.
166 BIST being conducted on Interrupts.
170 BIST being conducted on Multi-Processors.
180 Logout in progress.
182 BIST COP bus not responding.
185 A checkstop condition occurred during the BIST.
186 System logic-generated checkstop (Model 250 only).
187 Graphics-generated checkstop (Model 250).
195 Checkstop logout complete
199 Generic SCSI backplane
888 BIST did not start.

Power-On Self-Test (POST) Indicators
------------------------------------

200 IPL attempted with keylock in the Secure position.
201 IPL ROM test failed or checkstop occurred (irrecoverable).
202 Unexpected machine check interrupt.
203 Unexpected data storage interrupt.
204 Unexpected instruction storage interrupt.
205 Unexpected external interrupt.
206 Unexpected alignment interrupt.
207 Unexpected program interrupt.
208 Unexpected floating point unavailable interrupt.
209 Unexpected SVC interrupt.
20c L2 cache POST error. (The display shows a solid 20c for 5
    seconds.)
210 Unexpected SVC interrupt.
211 IPL ROM CRC comparison error (irrecoverable).
212 RAM POST memory configuration error or no memory found
    (irrecoverable).
213 RAM POST failure (irrecoverable).
214 Power status register failed (irrecoverable).
215 A low voltage condition is present (irrecoverable).
216 IPL ROM code being uncompressed into memory.
217 End of boot list encountered.
218 RAM POST is looking for good memory.
219 RAM POST bit map is being generated.
21c L2 cache is not detected. (The display shows a solid 21c for
    2 seconds.)
220 IPL control block is being initialized.
221 NVRAM CRC comparison error during AIX IPL(key mode switch in
    Normal mode). Reset NVRAM by reaccomplishing IPL in Service mode.
    For systems with an internal, direct-bus-attached (DBA) disk, 
    IPL ROM attempted to perform an IPL from that disk before halting 
    with this operator panel display value.
222 Attempting a Normal mode IPL from Standard I/O
    planar-attached devices specified in NVRAM IPL Devices List.
223 Attempting a Normal mode IPL from SCSI-attached devices
    specified in NVRAM IPL Devices List.
224 Attempting a Normal mode IPL from 9333 subsystem device
    specified in NVRAM IPL Devices List.
225 Attempting a Normal mode IPL from 7012 DBA disk-attached
    devices specified in NVRAM IPL Devices List.
226 Attempting a Normal mode IPL from Ethernet specified in
    NVRAM IPL Devices List.
227 Attempting a Normal mode IPL from Token-Ring specified in
    NVRAM IPL Devices List.
228 Attempting a Normal mode IPL from NVRAM expansion code.
229 Attempting a Normal mode IPL from NVRAM IPL Devices List;
    cannot IPL from any of the listed devices, or there are no 
    valid entries in the Devices List.
22c Attempting a normal mode IPL from FDDI specified in NVRAM
    IPL device list.
230 Attempting a Normal mode IPL from adapter feature ROM
    specified in IPL ROM Device List.
231 Attempting a Normal mode IPL from Ethernet specified in IPL
    ROM Device List.
232 Attempting a Normal mode IPL from Standard I/O
    planar-attached devices specified in ROM Default Device List.
233 Attempting a Normal mode IPL from SCSI-attached devices
    specified in IPL ROM Default Device List.
234 Attempting a Normal mode IPL from 9333 subsystem device
    specified in IPL ROM Device List.
235 Attempting a Normal mode IPL from 7012 DBA disk-attached
    devices specified in IPL ROM Default Device List.
236 Attempting a Normal mode IPL from Ethernet specified in IPL
    ROM Default Device List.
237 Attempting a Normal mode IPL from Token-Ring specified in
    IPL ROM Default Device List.
238 Attempting a Normal mode IPL from Token-Ring specified by
    the operator.
239 System failed to IPL from the device chosen by the operator.
23c Attempting a normal mode IPL from FDDI specified in IPL ROM
    device list.
240 Attempting a Service mode IPL from adapter feature ROM.
241 Attempting a normal boot from devices specified in the NVRAM
    boot list.
242 Attempting a Service mode IPL from Standard I/O
    planar-attached devices specified in the NVRAM IPL Devices List.
243 Attempting a Service mode IPL from SCSI-attached devices
    specified in the NVRAM IPL Devices List.
244 Attempting a Service mode IPL from 9333 subsystem device
    specified in the NVRAM IPL Devices List.
245 Attempting a Service mode IPL from 7012 DBA disk-attached
    devices specified in the NVRAM IPL Devices List.
246 Attempting a Service mode IPL from Ethernet specified in the
    NVRAM IPL Devices List.
247 Attempting a Service mode IPL from Token-Ring specified in
    the NVRAM Device List.
248 Attempting a Service mode IPL from NVRAM expansion code.
249 Attempting a Service mode IPL from the NVRAM IPL Devices
    List; cannot IPL from any of the listed devices, or there 
    are no valid entries in the Devices List.
24c Attempting a service mode IPL from FDDI specified in NVRAM
    IPL device list.
250 Attempting a Service mode IPL from adapter feature ROM
    specified in the IPL ROM Device List.
251 Attempting a Service mode IPL from Ethernet specified in the
    IPL ROM Default Device List.
252 Attempting a Service mode IPL from Standard I/O
    planar-attached devices specified in the ROM Default Device List.
253 Attempting a Service mode IPL from SCSI-attached devices
    specified in the IPL ROM Default Device List.
254 Attempting a Service mode IPL from 9333 subsystem device
    specified in the IPL ROM Devices List.
255 Attempting a Service mode IPL from 7012 DBA disk-attached
    devices specified in IPL ROM Default Device List.
256 Attempting a Service mode IPL from Ethernet specified in the
    IPL ROM Devices List.
257 Attempting a Service mode IPL from Token-Ring specified in
    the IPL ROM Devices List.
258 Attempting a Service mode IPL from Token-Ring specified by
    the operator.
259 Attempting a Service mode IPL from FDDI specified by the
    operator.
25c Attempting a service mode IPL from FDDI specified in IPL ROM
    device list.
260 Information is being displayed on the display console.
261 No supported local system display adapter was found.
262 Keyboard not detected as being connected to the system's
    keyboard port.
263 Attempting a Normal mode IPL from adapter feature ROM
    specified in the NVRAM Device List.
269 Stalled state - the system is unable to IPL.
270 Low Cost Ethernet Adapter (LCE) POST executing
271 Mouse and Mouse port POST.
272 Tablet Port POST.
276 10/100Mbps MCA Ethernet Adapter POST executing
277 Auto Token-Ring LANstreamer MC 32 Adapter.
278 Video ROM scan POST.
279 FDDI POST.
280 3com Ethernet POST.
281 Keyboard POST executing.
282 Parallel port POST executing.
283 Serial port POST executing.
284 POWER Gt1 graphics adapter POST executing.
285 POWER Gt3 graphics adapter POST executing.
286 Token-Ring adapter POST executing.
287 Ethernet adapter POST executing.
288 Adapter card slots being queried.
289 POWER GT0 Display Adapter POST.
290 IOCC POST error (irrecoverable).
291 Standard I/O POST running.
292 SCSI POST running.
293 7012 DBA disk POST running.
294 IOCC bad TCW memory module in slot location J being tested.
295 Graphics Display adapter POST, color or grayscale.
296 ROM scan POST.
297 System model number does not compare between OCS and ROS
    (irrecoverable).
298 Attempting a software IPL.
299 IPL ROM passed control to the loaded program code.
301 Flash Utility ROM test failed or checkstop occurred
    (irrecoverable
302 Flash Utility ROM: User prompt, move the key to the service
    position in order to perform an optional Flash Update. LED 302
    will only appear if the key switch is in the secure position. 
    This signals the user that a Flash Update may be initiated by
    moving the key switch to the service position. If the key is
    moved to the service position then LED 303 will be displayed, 
    this signals the user to press the Reset button and select 
    optional Flash Update.
303 Flash Utility ROM: User prompt, press the Reset button in
    order to perform an optional Flash Update. LED 3�2 will only 
    appear if the key switch is the secure position. This signals 
    the user that a Flash Update may be initiated by moving the
    key switch to the service position. If the key is moved to the
    service position LED 303 will be displayed, this signals the 
    user to press the Reset button and select optional Flash Update.
304 Flash Utility ROM IOCC POST error (irrecoverable).
305 Flash Utility ROM standard I/O POST running.
306 Flash Utility ROM is attempting IPL from Flash Update media
    device.
307 Flash Utility ROM system model number does not compare
    between OCS and ROM (irrecoverable).
308 Flash Utility ROM: IOCC TCW memory is being tested.
309 Flash Utility ROM passed control to a Flash Update Boot Image.
311 Flash Utility ROM CRC comparison error (irrecoverable).
312 Flash Utility ROM RAM POST memory configuration error or no
    memory found (irrecoverable).
313 Flash Utility ROM RAM POST failure (irrecoverable).
314 Flash Utility ROM Power status register failed (irrecoverable).
315 Flash Utility ROM detected a low voltage condition.
318 Flash Utility ROM RAM POST is looking for good memory.
319 Flash Utility ROM RAM POST bit map is being generated.
322 CRC error on media Flash Image. No Flash Update performed.
323 Current Flash Image is being erased.
324 CRC error on new Flash Image after Update was performed.
    (Flash Image is cor-rupted.)
325 Flash Update successful and complete.

Configuration Program Indicators
--------------------------------

500 Querying Standard I/O slot.
501 Querying card in Slot 1.
502 Querying card in Slot 2.
503 Querying card in Slot 3.
504 Querying card in Slot 4.
505 Querying card in Slot 5.
506 Querying card in Slot 6.
507 Querying card in Slot 7.
508 Querying card in Slot 8.
510 Starting device configuration.
511 Device configuration completed.
512 Restoring device configuration files from media.
513 Restoring basic operating system installation files from media.
516 Contacting server during network boot.
517 Mounting client remote file system during network IPL.
518 Remote mount of the root and /usr file systems failed during
    network boot.
520 Bus configuration running.
521 /etc/init invoked cfgmgr with invalid options; /etc/init has
    been corrupted or incor-rectly modified (irrecoverable error).
522 The configuration manager has been invoked with conflicting
    options (irrecoverable error).
523 The configuration manager is unable to access the ODM
    database (irrecoverable error).
524 The configuration manager is unable to access the
    config.rules object in the ODM database (irrecoverable error).
525 The configuration manager is unable to get data from a
    customized device object in the ODM database (irrecoverable error).
526 The configuration manager is unable to get data from a
    customized device driver object in the ODM database ( irrecoverable 
    error).
527 The configuration manager was invoked with the phase 1 flag;
    running phase 1 at this point is not permitted (irrecoverable error).
528 The configuration manager cannot find sequence rule, or no
    program name was specified in the ODM database (irrecoverable error).
529 The configuration manager is unable to update ODM data
    (irrecoverable error).
530 The program savebase returned an error.
531 The configuration manager is unable to access the PdAt
    object class (irrecoverable error).
532 There is not enough memory to continue (malloc failure);
    irrecoverable error.
533 The configuration manager could not find a configure method
    for a device.
534 The configuration manager is unable to acquire database lock
    (irrecoverable error).
535 HIPPI diagnostics interface driver being configured.
536 The configuration manager encountered more than one sequence
    rule specified in the same phase (irrecoverable error).
537 The configuration manager encountered an error when invoking
    the program in the sequence rule.
538 The configuration manager is going to invoke a configuration
    method.
539 The configuration method has terminated, and control has
    returned to the configura-tion manager.
551 IPL vary-on is running.
552 IPL varyon failed.
553 IPL phase 1 is complete.
554 The boot device could not be opened or read, or unable to
    define NFS swap device during network boot.
555 An ODM error occurred when trying to varyon the rootvg, or
    unable to create an NFS swap device during network boot.
556 Logical Volume Manager encountered error during IPL vary-on.
557 The root filesystem will not mount.
558 There is not enough memory to continue the system IPL.
559 Less than 2 M bytes of good memory are available to load the
    AIX kernel.
570 Virtual SCSI devices being configured.
571 HIPPI common function device driver being configured.
572 HIPPI IPI-3 master transport driver being configured.
573 HIPPI IPI-3 slave transport driver being configured.
574 HIPPI IPI-3 transport services user interface device driver
    being configured.
575 A 9570 disk-array driver is being configured.
576 Generic async device driver being configured.
577 Generic SCSI device driver being configured.
578 Generic commo device driver being configured.
579 Device driver being configured for a generic device.
580 HIPPI TCPIP network interface driver being configured.
581 Configuring TCP/IP.
582 Configuring Token-Ring data link control.
583 Configuring an Ethernet data link control.
584 Configuring an IEEE Ethernet data link control.
585 Configuring an SDLC MPQP data link control.
586 Configuring a QLLC X.25 data link control.
587 Configuring a NETBIOS.
588 Configuring a Bisync Read-Write (BSCRW).
589 SCSI target mode device being configured.
590 Diskless remote paging device being configured.
591 Configuring an LVM device driver.
592 Configuring an HFT device driver.
593 Configuring SNA device drivers.
594 Asynchronous I/O being defined or configured.
595 X.31 pseudo-device being configured.
596 SNA DLC/LAPE pseudo-device being configured.
597 OCS software being configured.
598 OCS hosts being configured during system reboot.
599 Configuring FDDI data link control.
5c0 Streams-based hardware drive being configured.
5c1 Streams-based X.25 protocol being configured.
5c2 Streams-based X.25 COMIO emulator driver being configured.
5c3 Streams-based X.25 TCP/IP interface driver being configured.
5c4 FCS adapter device driver being configured.
5c5 SCB network device driver for FCS is being configured.
5c6 AIX SNA channel being configured.
600 Starting network boot portion of /sbin/rc.boot
602 Configuring network parent devices.
603 /usr/lib/methods/defsys, /usr/lib/methods/cfgsys, or
    /usr/lib/methods/cfgbus failed.
604 Configuring physical network boot device.
605 Configuration of physical network boot device failed.
606 Running /usr/sbin/ifconfig on logical network boot device.
607 /usr/sbin/ifconfig failed.
608 Attempting to retrieve the client.info file with tftp.Note
    that a flashing 608 indicates multiple attempt(s) to retrieve 
    the client_info file are occurring.
609 The client.info file does not exist or it is zero length.
610 Attempting remote mount of NFS file system.
611 Remote mount of the NFS file system failed.
612 Accessing remote files; unconfiguring network boot device.
614 Configuring local paging devices.
615 Configuration of a local paging device failed.
616 Converting from diskless to dataless configuration.
617 Diskless to dataless configuration failed.
618 Configuring remote (NFS) paging devices.
619 Configuration of a remote (NFS) paging device failed.
620 Updating special device files and ODM in permanent
    filesystem with data from boot RAM filesystem.
622 Boot process configuring for operating system installation.
650 IBM SCSD disk drive being configured
668 25MB ATM MCA Adapter being configured
680 POWER GXT800M Graphics Adapter
689 4.5GB Ultra SCSI Single Ended Disk Drive being configured
690 9.1GB Ultra SCSI Single Ended Disk Drive being configured
694 Eicon ISDN DIVA MCA Adapter for PowerPC Systems
700 Progress indicator. A 1.1 GB 8-bit SCSI disk drive being
    identified or configured.
701 Progress indicator. A 1.1 GB 16-bit SCSI disk drive is being
    identified or configured.
702 Progress indicator. A 1.1 GB 16-bit differential SCSI disk
    drive is being identified or configured.
703 Progress indicator. A 2.2 GB 8-bit SCSI disk drive is being
    identified or configured.
704 Progress indicator. A 2.2 GB 16-bit SCSI disk drive is being
    identified or configured.
705 The configuration method for the 2.2 GB 16-bit differential
    SCSI disk drive is being run. If an irrecoverable error occurs, 
    the system halts.
706 Progress indicator. A 4.5 GB 16-bit SCSI disk drive is being
    identified or configured.
707 Progress indicator. A 4.5 GB 16-bit differential SCSI disk
    drive is being identified or configured.
708 Progress indicator. A L2 cache is being identified or
    configured.
710 POWER GXT150M graphics adapter being identified or configured.
711 Unknown adapter being identified or configured.
712 Graphics slot bus configuration is executing.
713 The IBM ARTIC960 device is being configured.
714 A video capture adapter is being configured.
715 The Ultimedia Services audio adapter is being configured.
    This LED displays briefly on the panel.
717 TP Ethernet Adapter being configured.
718 GXT500 Graphics Adapter being configured.
720 Unknown read/write optical drive type being configured.
721 Unknown disk or SCSI device being identified or configured.
722 Unknown disk being identified or configured.
723 Unknown CD-ROM being identified or configured.
724 Unknown tape drive being identified or configured.
725 Unknown display adapter being identified or configured.
726 Unknown input device being identified or configured.
727 Unknown async device being identified or configured.
728 Parallel printer being identified or configured.
729 Unknown parallel device being identified or configured.
730 Unknown diskette drive being identified or configured.
731 PTY being identified or configured.
732 Unknown SCSI initiator type being configured.
733 7GB 8mm tape drive being configured.
734 4x SCSI-2 640MB CD-ROM Drive
741 1080MB SCSI Disk Drive
745 16GB 4mm Tape Auto Loader
748 MCA keyboard/mouse adapter being configured.
749 7331 Model 205 Tape Library
754 1.1GB 16-bit SCSI disk drive being configured.
755 2.2GB 16-bit SCSI disk drive being configured.
756 4.5GB 16-bit SCSI disk drive being configured.
757 External 13GB 1.5M/s 1/4 inch tape being configured.
772 4.5GB SCSI F/W Disk Drive
773 9.1GB SCSI F/W Disk Drive
774 9.1GB External SCSI Disk Drive
77c Progress indicator. A 1.0 GB 16-bit SCSI disk drive being
    identified or configured.
783 4mm DDS-2 Tape Autoloader
789 2.6GB External Optical Drive
794 10/100MB Ethernet PX MC Adapter
797 Turboways 155 UTP/STP ATM Adapter being identified or
    configured.
798 Video streamer adapter being identified or configured.
800 Turboways 155 MMF ATM Adapter being identified or configured.
803 7336 Tape Library Robotics being configured
804 8x Speed SCSI-2 CD ROM drive being configured
807 SCSI Device Enclosure being configured
808 System Interface Full (SIF) configuration process
80c SSA 4-Port Adapter being identified or configured.
811 Processor complex being identified or configured.
812 Memory being identified or configured.
813 Battery for time-of-day, NVRAM, and so on being identified
    or configured, or system I/O control logic being identified or 
    configured.
814 NVRAM being identified or configured.
815 Floating-point processor test
816 Operator panel logic being identified or configured.
817 Time-of-day logic being identified or configured.
819 Graphics input device adapter being identified or configured.
821 Standard keyboard adapter being identified or configured.
823 Standard mouse adapter being identified or configured.
824 Standard tablet adapter being identified or configured.
825 Standard speaker adapter being identified or configured.
826 Serial Port 1 adapter being identified or configured.
827 Parallel port adapter being identified or configured.
828 Standard diskette adapter being identified or configured.
831 3151 adapter being identified or configured, or Serial Port
    2 being identified or con-figured.
834 64-port async controller being identified or configured.
835 16-port async concentrator being identified or configured.
836 128-port async controller being identified or configured.
837 16-port remote async node being identified or configured.
838 Network Terminal Accelerator Adapter being identified or
    configured.
839 7318 Serial Communications Server being configured.
841 8-port async adapter (EIA-232) being identified or configured.
842 8-port async adapter (EIA-422A) being identified or configured.
843 8-port async adapter (MIL-STD 188) being identified or
    configured.
844 7135 RAIDiant Array disk drive subsystem controller being
    identified or configured.
845 7135 RAIDiant Array disk drive subsystem drawer being
    identified or configured.
846 RAIDiant Array SCSI 1.3GB Disk Drive
847 16-port serial adapter (EIA-232) being identified or configured.
848 16-port serial adapter (EIA-422) being identified or configured.
849 X.25 Interface Co-Processor/2 adapter being identified or
    configured.
850 Token-Ring network adapter being identified or configured.
851 T1/J1 Portmaster adapter being identified or configured.
852 Ethernet adapter being identified or configured.
854 3270 Host Connection Program/6000 connection being
    identified or configured.
855 Portmaster Adapter/A being identified or configured.
857 FSLA adapter being identified or configured.
858 5085/5086/5088 adapter being identified or configured.
859 FDDI adapter being identified or configured.
85c Progress indicator. Token-Ring High-Performance LAN adapter
    is being identified or configured.
861 Optical adapter being identified or configured.
862 Block Multiplexer Channel Adapter being identified or
    configured.
865 ESCON Channel Adapter or emulator being identified or
    configured.
866 SCSI adapter being identified or configured.
867 Async expansion adapter being identified or configured.
868 SCSI adapter being identified or configured.
869 SCSI adapter being identified or configured.
870 Serial disk drive adapter being identified or configured.
871 Graphics subsystem adapter being identified or configured.
872 Grayscale graphics adapter being identified or configured.
874 Color graphics adapter being identified or configured.
875 Vendor generic communication adapter being configured.
876 8-bit color graphics processor being identified or configured.
877 POWER Gt3/POWER Gt4 being identified or configured.
878 POWER Gt4 graphics processor card being configured.
879 24-bit color graphics card, MEV2
880 POWER Gt1 adapter being identified or configured.
887 Integrated Ethernet adapter being identified or configured.
889 SCSI adapter being identified or configured.
890 SCSI-2 Differential Fast/Wide and Single-Ended Fast/Wide
    Adapter/A.
891 Vendor SCSI adapter being identified or configured.
892 Vendor display adapter being identified or configured.
893 Vendor LAN adapter being identified or configured.
894 Vendor async/communications adapter being identified or
    configured.
895 Vendor IEEE 488 adapter being identified or configured.
896 Vendor VME bus adapter being identified or configured.
897 S/370 Channel Emulator adapter being identified or configured.
898 POWER Gt1x graphics adapter being identified or configured.
899 3490 attached tape drive being identified or configured.
89c Progress indicator. A multimedia SCSI CD-ROM is being
    identified or configured.
901 Vendor SCSI device being identified or configured.
902 Vendor display device being identified or configured.
903 Vendor async device being identified or configured.
904 Vendor parallel device being identified or configured.
905 Vendor other device being identified or configured.
908 POWER GXT1000 Graphics subsystem being identified or configured.
910 1/4GB Fibre Channel/266 Standard Adapter being identified or
    configured.
911 Fibre Channel/1063 Adapter Short Wave
912 2.0GB SCSI-2 differential disk drive being identified or
    configured.
913 1.0GB differential disk drive being identified or configured.
914 5GB 8 mm differential tape drive being identified or configured.
915 4GB 4 mm tape drive being identified or configured.
916 Non-SCSI vendor tape adapter being identified or configured.
917 Progress indicator. 2.0GB 16-bit differential SCSI disk
    drive is being identified or configured.
918 Progress indicator. 2GB 16-bit single-ended SCSI disk drive
    is being identified or configured.
920 Bridge Box being identified or configured.
921 101 keyboard being identified or configured.
922 102 keyboard being identified or configured.
923 Kanji keyboard being identified or configured.
924 Two-button mouse being identified or configured.
925 Three-button mouse being identified or configured.
926 5083 tablet being identified or configured.
927 5083 tablet being identified or configured.
928 Standard speaker being identified or configured.
929 Dials being identified or configured.
930 Lighted program function keys (LPFK) being identified or
    configured.
931 IP router being identified or configured.
933 Async planar being identified or configured.
934 Async expansion drawer being identified or configured.
935 3.5-inch diskette drive being identified or configured.
936 5.25-inch diskette drive being identified or configured.
937 An HIPPI adapter is being configured.
942 POWER GXT 100 graphics adapter being identified or configured.
943 Progress indicator. 3480 and 3490 control units attached to
    a System/370 Channel Emulator/A adapter are being identified or 
    configured.
944 100MB ATM adapter being identified or configured
945 1.0GB SCSI differential disk drive being identified or
    configured.
946 Serial port 3 adapter is being identified or configured.
947 Progress indicator. A 730MB SCSI disk drive is being configured.
948 Portable disk drive being identified or configured.
949 Unknown direct bus-attach device being identified or configured.
950 Missing SCSI device being identified or configured.
951 670MB SCSI disk drive being identified or configured.
952 355MB SCSI disk drive being identified or configured.
953 320MB SCSI disk drive being identified or configured.
954 400MB SCSI disk drive being identified or configured.
955 857MB SCSI disk drive being identified or configured.
956 670MB SCSI disk drive electronics card being identified or
    configured.
957 120MB DBA disk drive being identified or configured.
958 160 MB DBA disk drive being identified or configured.
959 160MB SCSI disk drive being identified or configured.
960 1.37GB SCSI disk drive being identified or configured.
964 Internal 20GB 8mm tape drive identified or configured.
968 1.0GB SCSI disk drive being identified or configured.
970 Half-inch, 9-track tape drive being identified or configured.
971 150MB 1/4-inch tape drive being identified or configured.
972 2.3GB 8 mm SCSI tape drive being identified or configured.
973 Other SCSI tape drive being identified or configured.
974 CD-ROM drive being identified or configured.
975 Progress indicator. An optical disk drive is being
    identified or configured.
977 M-Audio Capture and Playback Adapter being identified or
    configured.
981 540MB SCSI-2 single-ended disk drive being identified or
    configured.
984 1GB 8-bit disk drive being identified or configured.
985 M-Video Capture Adapter being identified or configured.
986 2.4GB SCSI disk drive being identified or configured.
987 Progress indicator. Enhanced SCSI CD-ROM drive is being
    identified or configured.
989 200MB SCSI disk drive being identified or configured.
990 2.0GB SCSI-2 single-ended disk drive being identified or
    configured.
991 525MB 1/4-inch cartridge tape drive being identified or
    configured.
994 5GB 8 mm tape drive being identified or configured.
995 1.2GB 1/4 inch cartridge tape drive being identified or
    configured.
996 Progress indicator. Single-port, multi-protocol
    communications adapter is being identified or configured.
997 FDDI adapter being identified or configured.
998 2.0GB4 mm tape drive being identified or configured.
999 7137 or 3514 Disk Array Subsystem being configured.
D81 T2 Ethernet Adapter being configured.

Diagnostic Load Progress Indicators
-----------------------------------

Note: When a lowercase c is listed, it displays in the lower
half of the seven-segment
character position.

c00 AIX Install/Maintenance loaded successfully.
c01 Insert the first diagnostic diskette.
c02 Diskettes inserted out of sequence.
c03 The wrong diskette is in diskette drive.
c04 The loading stopped with a nonrecoverable error.
c05 A diskette error occurred.
c06 The rc.boot configuration shell script is unable to
    determine type of boot.
c07 Insert the next diagnostic diskette.
c08 RAM file system started incorrectly.
c09 The diskette drive is reading or writing a diskette.
c20 An unexpected halt occurred, and the system is configured to
    enter the kernel debug program instead of entering a system dump.
c21 The ifconfig command was unable to configure the network for
    the client network host.
c22 The tftp command was unable to read client's ClientHostName
    info file during a client network boot.
c24 Unable to read client's ClientHostName.info file during a
    client network boot.
c25 Client did not mount remote miniroot during network install.
c26 Client did not mount the /usr file system during the network
    boot.
c29 The system was unable to configure the network device.
c31 Select the console display for the diagnostics. To select No
    console display, set the key mode switch to Normal then to Service. 
    The diagnostic programs will then load and run the diagnostics 
    automatically.
c32 A direct-attached display (HFT) was selected.
c33 A tty terminal attached to serial ports S1 or S2 was selected.
c34 A file was selected. The console messages store in a file.
c40 Configuration files are being restored.
c41 Could not determine the boot type or device.
c42 Extracting data files from diskette.
c43 Cannot access the boot/install tape.
c44 Initializing installation database with target disk information.
c45 Cannot configure the console.
c46 Normal installation processing.
c47 Could not create a physical volume identifier (PVID) on disk.
c48 Prompting you for input.
c49 Could not create or form the JFS log.
c50 Creating root volume group on target disks.
c51 No paging devices were found.
c52 Changing from RAM environment to disk environment.
c53 Not enough space in the /tmp directory to do a preservation
    installation.
c54 Installing either BOS or additional packages.
c55 Could not remove the specified logical volume in a
    preservation installation.
c56 Running user-defined customization.
c57 Failure to restore BOS.
c58 Displaying message to turn the key.
c59 Could not copy either device special files, device ODM, or
    volume group information from RAM to disk.
c61 Failed to create the boot image.
c62 Loading platform dependent debug files
c63 Loading platform dependent data files
c64 Failed to load platform dependent data files
c70 Problem Mounting diagnostic CDROM disc
c99 Diagnostics have completed. This code is only used when
    there is no console.


TITLE    : Common Boot Time LEDs and Their Solution
OS LEVEL : AIX
DATE     : 17/11/99
VERSION  : 1.0
----------------------------------------------------------------------------

Common Boot Time LEDs and Their Solution

LED 201 - Damaged Boot Image
----------------------------
1. Access your rootvg using a maintenance shell.
2. Check / and /tmp filesystems. If they are almost full create more space.
3. Determine the boot disk by using the command lslv -m hd5
4. Re-create boot image using bosboot -a -d /dev/hdiskn
5. Check for CHECKSTOP errors in the error log. If such errors are
   found, it is probably failing hardware.
6. Shutdown and restart the system.

LED 223-229 - Invalid Boot List
-------------------------------
1. Set the key mode switch to service (F5 for systems without keylock)
   and power up the machine.
2. If display continues normally, change the key mode switch to Normal
   and continue with step 3. If you do not get the prompt, go to step 4.
3. When you get the login prompt, login and change the bootlist.
   Continue with step 7.
4. Access your rootvg using a maintenance shell and continue with step 5.
5. Determine the boot disk by using the command lslv -m hd5.
6. Change the bootlist. 
7. Shutdown and restart your system.

LED 551, 555, and 557 - Corrupted File System, Corrupted JFS log, and so on.
----------------------------------------------------------------------------
1. Access your rootvg using a maintenance shell, access the rootvg before 
   mounting any file systems (Option 2 on the Maintenance screen).
2. Verify and correct the file systems as follows:
   fsck -y /dev/hd1
   fsck -y /dev/hd2
   fsck -y /dev/hd3
   fsck -y /dev/hd4
   fsck -y /dev/hd9var
3. Format the JFS log again by using the command:
   /usr/sbin/logform /dev/hd8
4. Use lslv -m hd5 to find out the boot disk.
5. Recreate boot image by using the command:
   bosboot -a -d /dev/hdiskn
   Where n is the disk number of the disk containing boot logical volume.

LED 552, 554, and 556 - Super Block Corrupted or Corrupted Customized ODM Database
----------------------------------------------------------------------------------
1. Repeat steps 1 through 2 for LEDs 551, 555, and 557.
2. If fsck indicates that block 8 is corrupted, the super block for the file
   system is corrupted and needs to be repaired. Enter the command:
   dd count=1 bs=4k skip=31 seek=1 if=/dev/hdn of=/dev/hdn
   where n is the number of the file system.
3. Rebuild your JFS log by using the command:
   /usr/sbin/logform /dev/hd8
4. If this solves the problem, stop here otherwise continue with step 5.
5. Your ODM database is corrupted. Restart your system and Access your 
   rootvg using a maintenance shell, access the rootvg before 
   mounting any file systems (Option 2 on the Maintenance screen).
6. Mount the root and usr file system as follows:
   mount /dev/hd4 /mnt
   mount /usr
7. Copy system configuration to a back up directory:
   mkdir /mnt/etc/objrepors/backup
   cp /mnt/etc/objrepors/Cu* /mnt/etc/objrepos
8. Copy configuration from RAM file system as follows:
   cp /etc/objrepos/Cu* /mnt/etc/objrepos
9. Unmount all file systems by using the umount all command.
10. Determine bootdisk by using the lslv -m hd5 command.
11. Save the clean ODM to the boot logical volume by using the command:
    savebase -d/dev/hdiskn
12. Reboot, if system does not come up, reinstall BOS.

LED 553 - Corrupted /etc/inittab file
-------------------------------------
1. Access the rootvg with all file systems mounted.
2. Check for free space in /, /var and /tmp by using df command.
3. Check the /etc/inittab file and correct the inittab problems if there is
   one empty inittab file, missing inittab file or wrong entry in inittab file.
4. Check problems with:
   /etc/environment file
   /bin/sh
   /bin/bsh
   /etc/fsck
   /etc/profile
   /.profile
5. Shutdown the system and reboot.


##############################################################

SECTION 2: IBM lpar reference codes:

##############################################################


(A2xx, B2xx) Logical partition reference codes

When the server posts these SRCs, you can find them in the Serviceable Event View or the view that you use 
to see informational logs (such as the Product Activity Log or ASM).

Characters 3 and 4 of word 1 are the partition ID of the logical partition with the problem. 
If the SRC begins with A2xx, no service action is required. If the SRC begins with B2xx, find 
the next 4 characters of the SRC (called the unit reference code) in the following table.
Table 1. (A2xx, B2xx) Logical partition reference codes

Reference Code Description/Action Perform all actions before exchanging Failing Items Failing Item 
1150 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

This is a partitioning configuration problem. The LPARCFG Symbolic FRU will help correct the problem.

If the problem persists, call your next level of support.
 LPARCFG
 
1225 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

The partition attempted to IPL prior to the platform fully initializing. Retry the partition IPL after the platform IPL 
has fully completed and the platform is not in standby mode. If that IPL fails, call your next level of support.
 SVCDOCS
 

1230 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

This is a partitioning configuration problem. The partition is lacking the necessary resources to IPL.

This error might occur when you shut down a partition that is set to automatically IPL and then turn the managed system off 
and back on. When the partition automatically IPLs, it uses the resources specified in PHYP NVRAM, and this error 
occurs when the server does not find the exact resources specified in NVRAM. The solution is to activate the partition 
by using the partition profile on the HMC. The HMC applies the values in the profile to NVRAM. When the partition IPLs, 
it uses the resources specified in the profile.
 LPARCFG
LICCODE
 
1260 A problem occurred during the IPL of a partition. 
The partition could not IPL at the Timed Power On setting because the IPL setting of the partition was not set to Normal. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).
 SVCDOCS
 
1265 A problem occurred during the IPL of a partition. 
The partition could not IPL. The partition ID is characters 3 and 4 of the B2xx reference code in 
word 1 of the SRC (in hexadecimal format). If characters 3 and 4 are both zero, then the partition ID is 
in extended word one as LP=xxx (in decimal format).

An operating system MSD IPL was attempted with the IPL side on D-mode. This is not a valid operating system IPL scenario, 
and the IPL will be halted. This SRC is usually seen when a D-mode SLIC install fails and attempts an MSD.
 SVCDOCS
 
1266 A problem occurred during the IPL of a partition. 
The partition could not IPL. The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC 
(in hexadecimal format). If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

You are attempting to IPL an operating system that is not supported.
 SVCDOCS
 
1280 A problem occurred during a partition Main Storage Dump. 
A mainstore dump IPL did not complete due to configuration mismatch. Contact your next level of support.
 NEXTLVL
 
1281 A partition memory error occurred 
An attempt to perform a partition dump failed. A partition memory error occurred. The failed memory will 
no longer be used. The partition dump was terminated. The partition ID is in extended word one as 
LP=xxx (in decimal format). Re-IPL the partition.
 SVCDOCS
 
1310 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
No alternate (D-mode) IPL IOP was selected. The IPL will attempt to continue, but there may not be enough 
information to find the correct D-mode load source.

Have the customer configure an alternate IPL IOP for the partition. Then retry the partition IPL.
 SVCDOCS
 
1320 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
No default load source IOP was selected for an A/B-mode IPL. The IPL will attempt to continue, but there may 
not be enough information to find the correct load source.

Have the customer configure a load source IOP for the partition. Then retry the partition IPL.
 SVCDOCS
 
1321 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
The IOA for the load source device needed an IOP, and none was detected. Check your LPAR configuration and make sure 
the correct slot is specified for the IPL load source. Then retry the partition IPL.
 SVCDOCS
 
1322 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
During the partition IPL, code tried to determine if the device in a slot was an I/O Processor or an I/O Adapter. 
That check failed. Check your LPAR configuration and make sure that the correct slot is specified for the IPL load source. 
Then retry the partition IPL. If this does not resolve the problem, perform LICIP15.
 SVCDOCS
 
2048 A problem occurred during a partition Main Storage Dump. 
A mainstore dump IPL did not complete due to a copy error. Contact your next level of support.
 NEXTLVL
 
2058 A problem occurred during a partition Main Storage Dump. 
A mainstore dump IPL did not complete due to a copy error. Contact your next level of support.
 NEXTLVL
 
2250, 2300 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
A slot that was needed for the partition was unavailable. See the Symbolic FRU SLOTUSE for more information on the cause of this error.
 SLOTUSE
 
2310, 2320, 2425 to 2426 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
The platform LIC for this partition attempted an operation. There was a failure. Contact your next level of support.
 NEXTLVL
 
2475 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
A slot that was needed for the partition was either empty or the device in the slot has failed. See the Symbolic 
FRU SLOTUSE for more information on the cause of this error.

If you have a RAID enablement card (CCIN 5709) on your system, it will disable an embedded SCSI adapter. If that embedded 
slot is called out in the error, you can safely ignore this error.
 SLOTUSE
 
2485 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
The platform LIC for this partition attempted an operation. There was a failure. Contact your next level of support.
 NEXTLVL
 
3000 System log entry only, no service action required 
A user requested an immediate termination and main store dump of a partition. The partition ID is in extended word 
one as LP=xxx in decimal format.
 
 
3081 A problem occurred during the IPL of a partition. 
IPL did not complete due to a copy error. Contact your next level of support.
 LICCODE
 
3110 A problem occurred during the IPL of a partition. 
The search for a valid load source device was exhausted. The partition ID is characters 3 and 4 of the B2xx reference code 
in word 1 of the SRC (in hexadecimal format). If characters 3 and 4 are both zero, then the partition ID is in extended word 
one as LP=xxx (in decimal format). Perform LICIP15.
 SVCDOCS
 
3113 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
A problem occurred on the path to the load source for the partition.

If present, look in the Serviceable Event View for a B7xx xxxx during the partition's IPL. Correct that error and retry the partition IPL.
 SVCDOCS
 
3114 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

The B2xx xxxx SRC Format is Word 1: B2xx3114, Word 3: Bus, Word 4: Board, Word 5: Card.
 NEXTLVL
 
3120 System log entry only, no service action required 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
Retry count exceeded. This is logged for each unsuccessful attempt to IPL with a loadsource candidate. 
If the IPL fails, look for other serviceable errors.
 
 
3123 System log entry only, no service action required 
 
3125 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

This is a platform LIC main store utilization problem. The platform LIC could not obtain a segment of main storage 
within the platform's main store to use for managing the creation of a partition.
 LICCODE
 
3128 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
An unexpected failure return code was returned when attempting to query the IOA slots that are assigned to an IOP.

Look for B700 69xx errors in the Serviceable Event View and work those errors.
 NEXTLVL
 
3130 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
If word 3 is zero, then this SRC is informational and can be ignored.

Otherwise there is a problem in the platform LIC. A nonzero bus number has no associated bus object.

Look for B700 69xx errors in the Serviceable Event View and work those errors.

If there are no serviceable B700 69xx errors, or if correcting the errors did not correct this problem, contact your next level of support.
 NEXTLVL
 
3135 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
An unknown bus type was detected.
 NEXTLVL
 
3140 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
The load source IOP is not owned by the partition. This is a configuration problem in the partition. 
Have the customer reconfigure the partition to have the intended load source IOP.

If there is not a configuration problem then contact your next level of support.
 SVCDOCS
 
3141 System log entry only, no service action required 
The IOP in the slot used for the last successful IPL of the operating system was replaced with an I/O Adapter. 
The IPL will continue by searching for a valid load source device.

Check the LPAR configuration if required, and ensure that the tagged I/O for the partition is correct.
 
 
3142 System log entry only, no service action required 
The I/O Adapter in the slot used for the last successful IPL of the operating system was replaced with an I/O Processor. 
The IPL will continue by searching for a valid load source device.

Check the LPAR configuration if required, and ensure that the tagged I/O for the partition is correct.
 
 
3143 System log entry only, no service action required 
The I/O Adapter in the slot used for the last successful IPL of the operating system was removed. 
The IPL will continue by searching for a valid load source device.

Check the LPAR configuration if required, and ensure that the tagged I/O for the partition is correct.
 
 
3144 System log entry only, no service action required 
The I/O Processor in the slot used for the last successful IPL of the operating system was removed. 
The IPL will continue by searching for a valid load source device.

Check the LPAR configuration if required, and ensure that the tagged I/O for the partition is correct.
 
 
3200 System log entry only, no service action required 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

Look for a SRC in the Serviceable Event View logged at the time the partition was performing an IPL.

This error indicates a failure during a search for the load source. There may be a number of these failures 
prior to finding a good load source. This is normal. If a B2xx3110 error is logged, a B2xx3200 may be posted to the control panel. 
Work the B2xx3110 error in the Serviceable Event View. If the system IPL hangs at B2xx3200 and you cannot check the SRC history, 
perform the actions indicated for the B2xx3110 SRC.
 
 
4158 System log entry only, no service action required 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

Look for a SRC in the Serviceable Event View logged at the time the partition was performing an IPL.

This error indicates a failure during a search for the load source. It is usual for a number of these failures 
to occur prior to finding a valid load source. This is normal. If a B2xx3110 error is logged, a B2xx3200 may be 
posted to the control panel. Work the B2xx3110 error in the Serviceable Event View. If the system IPL hangs at B2xx3200 
and you cannot check the SRC history, perform the actions indicated for the B2xx3110 SRC.
 
 
5106 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There is not enough space to contain the partition main storage dump.

Contact your next level of support.
 NEXTLVL
 
5109 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There was a partition main storage dump problem. Contact your next level of support.
 NEXTLVL
 
5114 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There is not enough space to contain the partition main storage dump. Contact your next level of support.
 NEXTLVL
 
5115 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There was an error reading the partition's main storage dump from the partition's load source into main storage.
 NEXTLVL
 
5117 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
A partition main storage dump has occurred but cannot be written to the load source device because a valid dump already exists.

Use the Main Storage Dump Manager to rename or copy the current main storage dump.
 SVCDOCS
 
5121 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There was an error writing the partition's main storage dump to the partition's load source.
 NEXTLVL
 
5122 to 5123 System log entry only, no service action required 
A problem occurred during the IPL of a partition.

The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

An error occurred when writing the partition's main storage dump to the partition's load source. No service action required.
 
 
5135 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There was an error writing the partition's main storage dump to the partition's load source.
 NEXTLVL
 
5137 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There was an error writing the partition's main storage dump to the partition's load source.
 NEXTLVL
 
5145 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There was an error writing the partition's main storage dump to the partition's load source.
 NEXTLVL
 
5148 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
An error occurred while doing a main storage dump that would have caused another main storage dump.

Contact your next level of support.
 NEXTLVL
 
6006 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
A platform LIC error occurred when the partition's memory initialized. The IPL will not continue.

Contact your next level of support.
 NEXTLVL
 
6012 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
The partition's LID failed to completely load into the partition's mainstore area.

Contact your next level of support.
 NEXTLVL
 
6015 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

The load source media is corrupted or not valid.
 LSERROR
 
6025 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

This is a problem with the load source media being corrupt or not valid.
 LSERROR
 
6027 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format).

A failure occurred when allocating memory for an internal object used for LID load operations. Ensure the partition 
was allocated enough main storage, verify that no memory leaks are present, and then retry the operation.
 NEXTLVL
 
6110 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
Error on load source device.
 LSERROR
 
690A A problem occurred during the IPL of a partition. 
An error occurred while copying Open Firmware into the partition load area. Contact your next level of support.
 NEXTLVL
 
7200 System log entry only, no service action required 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
An error condition was encountered when communicating with the load source I/O Processor for the partition 
identified in the xx field of the B2xx SRC.

This informational error indicates a failure resetting the I/O Processor in the preceding B2xx3200 error. 
This may be normal. If there is a hardware failure there will be a different serviceable event. If the system IPL hangs 
at B2xx7200 and you cannot check the SRC history, perform the actions indicated for the B2xx3110 SRC.
 
 
8080 System log entry only, no service action required 
 
8081 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
An internal LIC timeout has occurred. The partition may continue to IPL but it may experience problems while running.
 LICCODE
 
8105 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There was a failure loading the VPD areas of the partition. Possible causes are: 

Corrupted/unsupported load source media 
Insufficient resources allocated to the partition 
Unsupported partition configuration by the operating system
If the problem is due to media, replace the load source media. If the problem is due to insufficient resources, 
allocate enough resources to the partition. If the problem is due to unsupported partition configuration, 
correct the partition configuration.
 SVCDOCS
 
8107 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There was a problem getting a segment of main storage in the platform's main store.
 LICCODE
 
8109 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
A failure occurred. The IPL is terminated. Ensure that there is enough memory to IPL the partition.
 LICCODE
 
8112 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
A failure occurred. The IPL is terminated.
 LICCODE
 
8113 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
A problem occurred on the path to the load source for the partition.

There was an error mapping memory for the partition's IPL. Call your next level of support.
 LICCODE
 
8114 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
A problem occurred on the path to the load source for the partition.

There was a failure verifying VPD for the partition's resources during IPL. Call your next level of support.
 LICCODE
 
8115 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There was a low level partition to partition communication failure.
 LICCODE
 
8117, 8121, 8123, 8125, 8127, 8129 A problem occurred during the IPL of a partition. 
Partition did not IPL due to platform Licensed Internal Code error.

Contact your next level of support.
 NEXTLVL
 
813A A problem occurred during the IPL of a partition. 
Ensure that the console device cables are connected properly. If the cables are already connected properly, replace the cables. 
Re-IPL the partition. If the problem reoccurs, contact your next level of support.
 SVCDOCS
 
A100 to A101 A problem occurred after a partition ended abnormally. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
This partition could not stay running and shut itself down.

Work any error logs in the Serviceable Event View. If there are no errors, contact your next level of support.
 SVCDOCS
 
B07B System log entry only, no service action required 
 
B215 A problem occurred after a partition ended abnormally. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There was a communications problem between this partition's service processor and the platform's service processor.

The platform will need to be re-IPLed before that partition can be used. Call your next level of support.
 NEXTLVL
 
C1F0 A problem occurred during a power off a partition 
Internal platform Licensed Internal Code error occurred during partition shutdown or re-IPL.

Contact your next level of support.
 NEXTLVL
 
D150 A problem occurred after a partition ended abnormally. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
There was a communications problem between this partition and code that handles resource allocation. Call your next level of support.
 LICCODE
 
E0AA A problem occurred during the IPL of a partition. 
Ensure that the console device cables are connected properly. If the cables are already connected properly, replace the cables. 
Re-IPL the partition. If the problem reoccurs, contact your next level of support.
 SVCDOCS
 
F001 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). An operation has timed out.

Ignore this error if there are other serviceable errors. Work those error logs for this partition and for the platform 
from the Serviceable Event View. If there are no errors, contact your next level of support.
 SVCDOCS
 
F003 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
Partition processors did not start LIC within the timeout window.

Capture a Partition Dump and call your next level of support.
 NEXTLVL
 
F004 A system request to power off a partition failed 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
The partition did not respond to a system request to power off the partition. This partition had a communications problem.

If the partition is an i5/OS partition, capture a Partition Dump. Contact your next level of support.
 NEXTLVL
 
F005 A system request to power off a partition failed 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
The partition did not respond to a system request to power off the partition. This partition had a communications problem.

If the partition is an i5/OS partition, perform a Partition Dump and contact your next level of support.

For all other partition types, a Partition Dump is not supported. If the system is Hardware Management Console (HMC) 
or Integrated Virtualization Manager (IVM) controlled, do an immediate partition power off. If the system is not HMC 
or IVM controlled, perform a Function 8 on the control panel. After the partition has powered off, re-IPL the partition, 
collect error logs and contact your next level of support.
 NEXTLVL
 
F006 A problem occurred during the IPL of a partition. 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
The code load operation for the partition's IPL timed out.

Work any error logs for this partition in the Serviceable Event View. If there are no errors, contact your next level of support.
 SVCDOCS
 
F007 A problem occurred during a power off a partition 
The partition ID is characters 3 and 4 of the B2xx reference code in word 1 of the SRC (in hexadecimal format). 
If characters 3 and 4 are both zero, then the partition ID is in extended word one as LP=xxx (in decimal format). 
A problem occurred on the path to the load source for the partition.

A timeout occurred during the process of trying to stop a partition from running. Contact your next level of support.
 LICCODE
 
F008 A problem occurred during the IPL of a partition. 
During an IPL, a timeout occurred while waiting for a ready message from the partition.

Look for other errors. Re-IPL the partition to recover.
 
 
F009 System log entry only, no service action required 
During an IPL, a timeout occurred while waiting for a response to a message.

Look for other errors. Re-IPL the partition to recover.
 
 
F00A to F00B System log entry only, no service action required 
During an IPL, a timeout occurred while waiting for a response to a message.

Look for other errors. If this SRC is displayed in the operator panel, then panel function 34 might be used to retry 
the current IPL while the partition is still in the failed state.
 
 
F00C System log entry only, no service action required 
During an IPL, a timeout occurred while waiting for a response to a message.

Look for other errors. If this SRC is displayed in the operator panel, then panel function 34 might be used to retry 
the current IPL while the partition is still in the failed state.
 
 
F00D Timeout occurred during a main store dump IPL 
If this SRC is displayed in the operator panel during a main store dump IPL, then panel function 34 might be used to retry 
the current main store dump IPL while the partition is still in the failed state.


##############################################################

SECTION 3: IBM Partition firmware reference (error) codes:

##############################################################


(BAxx) Partition firmware reference (error) codes
The partition firmware detected a failure. The first eight characters in the display represent the SRC. 
Any additional characters represent the associated location code. Record the location code as well as the reference code, 
then find the SRC in the following table.

Table 1. (BAxx) Partition firmware reference (error) codes

Reference Code Description/Action Perform all actions before exchanging Failing Items Failing Item 
BA000010 The device data structure is corrupted FWFLASH 
BA000020 Incompatible firmware levels were found 
Reflash the platform firmware.
  
BA000030 An lpevent communication failure occurred FWFLASH 
BA000032 The firmware failed to register the lpevent queues FWFLASH 
BA000034 The firmware failed to exchange capacity and allocate lpevents FWFLASH 
BA000038 The firmware failed to exchange virtual continuation events FWFLASH 
BA000040 The firmware was unable to obtain the RTAS code lid details FWFLASH 
BA000050 The firmware was unable to load the RTAS code lid FWFLASH 
BA000060 The firmware was unable to obtain the open firmware code lid details FWFLASH 
BA000070 The firmware was unable to load the open firmware code lid FWFLASH 
BA000080 The user did not accept the license agreement 
There is no further action required. If the user did not accept the license agreement, the system will not function.
  
BA000081 Failed to get the firmware license policy FWFLASH 
BA000082 Failed to set the firmware license policy FWFLASH 
BA010000 There is insufficient information to boot the partition FWIPIPL 
BA010001 The client IP address is already in use by another network device FWIPIPL 
BA010002 Cannot get gateway IP address 
If the system is a model 185 or A50, refer to partition firmware progress code E174. 
For all other systems, refer to partition firmware progress code CA00E174.
 FWHOST 
BA010003 Cannot get server hardware address 
If the system is a model 185 or A50, refer to partition firmware progress code E174. 
For all other systems, refer to partition firmware progress code CA00E174.
 FWHOST 
BA010004 Bootp failed  
BA010005 File transmission (TFTP) failed 
Refer to partition firmware progress code CA00E174
 FWHOST
FWADIPL
 
BA010006 The boot image is too large FWADIPL 
BA020001 Partition firmware password entry error 
Reenter the password.
  
BA020009 Invalid password entered - system locked 
A password was entered incorrectly three times. Deactivate the partition using the HMC, 
then reactivate it. When asked for the password, enter the correct password.
  
BA030011 RTAS attempt to allocate memory failed FWFWPBL 
BA04000F Self test failed on device; no error or location code information available 
If there was a location code reported with the error, replace the device specified by the location code.
 NEXTLVL 
BA040010 Self test failed on device; cannot locate package NEXTLVL 
BA040020 The machine type and model are not recognized by the server firmware 
Check for server firmware updates, and apply them, if available.
 NEXTLVL 
BA040030 The firmware was not able to build the UID properly for this system. As a result, 
problems may occur with the licensing of the AIX� operating system. 
Using the Advanced System Management Interface (ASMI) menus, ensure that the machine type, 
model, and serial number in the VPD for this system are correct. 
If this is a new system, check for server firmware updates and apply them, if available.
  
BA040035 The firmware was unable to find the "plant of manufacture" in the VPD. This may cause problems 
with the licensing of the AIX operating system. 
Verify the that machine type, model, and serial number are correct for this system. If this is a new system, 
check for server firmware updates and apply them, if available.
  
BA040040 Setting the machine type, model, and serial number failed. FWFWPBL 
BA040050 The h-call to switch off the boot watchdog timer failed. FWFWPBL 
BA040060 Setting the firmware boot side for the next boot failed. FWFWPBL 
BA050001 Rebooting a partition in logical partition mode failed. FWFWPBL 
BA050004 Locating a service processor device tree node failed. FWFWPBL 
BA05000A Failed to send boot failed message to the service processor FWFWPBL 
BA060003 IP parameter requires 3 period (.) characters 
Enter a valid IP parameter. Example: 000.000.000.000
  
BA060004 Invalid IP parameter 
Enter a valid IP parameter. Example: 000.000.000.000
  
BA060005 Invalid IP parameter (>255) 
Enter a valid IP parameter. Example: 000.000.000.000
  
BA060007 A keyboard was not found 
Make sure that a keyboard is attached to the USB port that is assigned to the partition. 
Replace the USB card to which the keyboard is attached.
  
BA060008 No configurable adapters found by the remote IPL menu in the System Management Services (SMS) utilities 
This error occurs when the remote IPL menu in the SMS utilities cannot locate any LAN adapters that 
are supported by the remote IPL function.
 FWRIPL 
BA06000B The system was not able to find an operating system on the devices in the boot list. 
See Problems with loading and starting the operating system (AIX and Linux�)
  
BA06000C A pointer to the operating system was found in non-volatile storage. FWPTR 
BA060020 The boot-device environment variable exceeded the allowed character limit. FWNIM 
BA060021 The boot-device environment variable contained more than five entries. FWNIM 
BA060022 The boot-device environment variable contained an entry that exceeded 255 characters in length FWNIM 
BA060030 Logical partitioning with shared processors is enabled and the operating system does not support it. 
Install or boot a level of the operating system that supports shared processors. 
Disable logical partitioning with shared processors in the operating system.
  
BA060040 The system or partition is configured to use huge pages, but the operating system image does not support huge pages. 
Do one of the following:

Install a newer version of the operating system that supports huge pages. 
Use the ASMI to remove the huge pages.
 
BA060050 The Hypervisor supports dynamic partitioning of the huge page-type of memory allocation, 
but dynamic partitioning of huge pages is not supported. 
Use the ASMI to disable dynamic partitioning of huge pages.
  
BA060060 The operating system expects an IOSP partition, but the operating system failed to make the transition to alpha mode. 
Ensure that the alpha-mode operating system image is intended for this partition. 
Ensure that the configuration of the partition supports an alpha-mode operating system.
  
BA060061 The operating system expects a non-IOSP partition, but the operating system failed to make the transition to MGC mode. 
Ensure that the nonalpha-mode operating system image is intended for this partition. 
Ensure that the configuration of the partition supports a nonalpha-mode operating system.
  
BA07xxxx SCSI controller failure FWSCSI1 
BA080001 An IDE device remained busy for a longer period than the time out period FWFWPBL 
BA080002 The IDE controller senses IDE devices but with errors. 
Verify that the IDE devices are properly seated and cabled correctly 
Replace the IDE controller (model-dependent)
  
BA080010 An IDE device is busy longer than specified time-out period. 
Retry the operation.
 FWIDE1 
BA080011 An IDE command timed out; command is exceeding the period allowed to complete. 
Retry the operation.
 FWIDE1 
BA080012 The ATA command failed FWIDE2 
BA080013 The media is not present in the tray 
Retry the operation.
 FWIDE1 
BA080014 The media has been changed 
Retry the operation.
 FWIDE1 
BA080015 The packet command failed; the media might not be readable. 
Retry the operation.
 FWIDE1 
BA09xxxx SCSI controller failure. 
This checkpoint might remain in the control panel for up to 15 minutes If the checkpoint persists 
longer than 15 minutes, do the following:

Power off the server and reboot from the permanent side. Reject the firmware image on the temporary side. 
If the problem persists, before replacing any components, refer to the actions for BA090001.
  
BA090001 SCSI disk unit: test unit ready failed; hardware error FWSCSI1 
BA090002 SCSI disk unit: test unit ready failed; sense data available FWSCSI2 
BA090003 SCSI disk unit: send diagnostic failed; sense data available FWSCSI3 
BA090004 SCSI disk unit: send diagnostic failed: devofl command FWSCSI3 
BA100001 SCSI tape: test unit ready failed; hardware error FWSCSI1 
BA100002 SCSI tape: test unit ready failed; sense data available FWSCSI4 
BA100003 SCSI tape: send diagnostic failed; sense data available FWSCSI3 
BA100004 SCSI tape: send diagnostic failed: devofl command FWSCSI3 
BA110001 SCSI changer: test unit ready failed; hardware error FWSCSI1 
BA110002 SCSI changer: test unit ready failed; sense data available FWSCSI4 
BA110003 SCSI changer: send diagnostic failed; sense data available FWSCSI3 
BA110004 SCSI changer: send diagnostic failed: devofl command FWSCSI3 
BA120001 On an undetermined SCSI device, test unit ready failed; hardware error FWSCSI5 
BA120002 On an undetermined SCSI device, test unit ready failed; sense data available FWSCSI4 
BA120003 On an undetermined SCSI device, send diagnostic failed; sense data available FWSCSI4 
BA120004 On an undetermined SCSI device, send diagnostic failed; devofl command FWSCSI4 
BA130001 SCSI CD-ROM: test unit ready failed; hardware error FWSCSI1 
BA130002 SCSI CD-ROM: test unit ready failed; sense data available FWSCSI3 
BA130003 SCSI CD-ROM: send diagnostic failed; sense data available FWSCSI3 
BA130004 SCSI CD-ROM: send diagnostic failed: devofl command FWSCSI3 
BA130010 USB CD-ROM: device remained busy longer than the time-out period 
Retry the operation.
 FWFWPBL 
BA130011 USB CD-ROM: execution of ATA/ATAPI command was not completed within the allowed time. 
Retry the operation.
 FWCD1 
BA130012 USB CD-ROM: execution of ATA/ATAPI command failed. 
Verify that the power and signal cables going to the USB CD-ROM are properly connected and are not damaged. 
If any problems are found, correct them, then retry the operation. 
If the problem persists, the CD in the USB CD-ROM drive might not be readable. Remove the CD and insert another CD.
 NEXTLVL 
BA130013 USB CD-ROM: bootable media is missing from the drive 
Insert a bootable CD-ROM in the USB CD-ROM drive, then retry the operation.
 FWCD1 
BA130014 USB CD-ROM: the media in the USB CD-ROM drive has been changed. 
Retry the operation.
 FWCD2 
BA130015 USB CD-ROM: ATA/ATAPI packet command execution failed. 
If the problem persists, the CD in the USB CD-ROM drive might not be readable. Remove the CD and insert another CD.
 FWCD2 
BA131010 The USB keyboard was removed. 
Plug in the USB keyboard and reboot the partition. 
Check for system firmware updates and apply them, if available.
  
BA140001 SCSI read/write optical: test unit ready failed; hardware error FWSCSI1 
BA140002 SCSI read/write optical: test unit ready failed; sense data available FWSCSI1 
BA140003 SCSI read/write optical: send diagnostic failed; sense data available FWSCSI3 
BA140004 SCSI read/write optical: send diagnostic failed; devofl command FWSCSI3 
BA150001 PCI Ethernet BNC/RJ-45 or PCI Ethernet AUI/RJ-45 adapter: internal wrap test failure 
Replace the adapter specified by the location code.
  
BA151001 10/100 MBPS Ethernet PCI adapter: internal wrap test failure 
Replace the adapter specified by the location code.
  
BA151002 10/100 MBPS Ethernet card FWENET 
BA153002 Gigabit Ethernet adapter failure 
Verify that the MAC address programmed in the FLASH/EEPROM is correct.
  
BA153003 Gigabit Ethernet adapter failure 
Check for adapter firmware updates; apply if available. 
Remove other cards from the PHB in which the gigabit Ethernet adapter is plugged and retry the operation. 
If the operation is successful, plug the cards in again, one at a time, until the failing card is isolated. 
After you identify the failing card, replace it. 
Replace the adapter.
  
BA160001 PCI auto LANstreamer� token ring adapter: failed to complete hardware initialization. 
Replace the adapter specified by the location code.
  
BA161001 PCI token ring adapter: failed to complete hardware initialization. 
Replace the adapter specified by the location code.
  
BA170xxx NVRAM problems FWNVR1 
BA170000 NVRAMRC initialization failed; device test failed FWNVR2 
BA170100 NVRAM data validation check failed 
Turn off, then turn on the system.
 FWNVR2 
BA170201 The firmware was unable to expand target partition - saving configuration variable FWNVR1 
BA170202 The firmware was unable to expand target partition - writing error log entry FWNVR1 
BA170203 The firmware was unable to expand target partition - writing VPD data FWNVR1 
BA170210 Setenv/$Setenv parameter error - name contains a null character FWNVR1 
BA170211 Setenv/$Setenv parameter error - value contains a null character FWNVR1 
BA170220 The firmware was not able to write a variable value into NVRAM because not enough space exists in NVRAM. 
Do the following:

Reduce the number of partitions, if possible, so that each of the remaining partitions has more NVRMA allocated to it. 
Contact your next level of support.
  
BA170221 The setenv/$setenv function had to delete network boot information to free space in NVRAM. 
You might need to use the SMS menus to reenter the parameters for network installation or boot.
  
BA170998 NVRAMRC script evaluation error - command line execution error. FWNVR3 
BA170999 NVRAMRC script evaluation error - stack unbalanced on completion. 
This is a firmware debug environment error. There is no user action or FRU replacement for this error.
 NEXTLVL 
BA180008 PCI device Fcode evaluation error. FWPCI1 
BA180009 The Fcode on a PCI adapter left a data stack imbalance 
You should load the new adapter Fcode before you use the adapter (specified by the location code 
associated with this error) for booting.
 FWPCI1 
BA180010 PCI probe error, bridge in freeze state FWPCI2 
BA180011 PCI bridge probe error, bridge is not usable FWPCI3 
BA180012 PCI device runtime error, bridge in freeze state FWPCI3 
BA180013 A PCI adapter was found that this machine type and model does not support. 
Is the system an IBM� Intellistation model?

Yes: Complete the following steps. 
Check for and apply any available server firmware udpates. 
Replace the adapter at the location code that was reported with the error.
No: Remove the PCI adapter specified by the location code.
  
BA180014 MSI software error FWFLASH 
BA180100 FDDI adapter Fcode driver is not supported on this system. 
This server does not support the Fcode driver of this adapter. Service support might have additional information.
  
BA180101 Stack underflow from fibre-channel adapter FWFWPBL 
BA188000 An unsupported adapter was found in a PCI slot 
Remove the unsupported adapter in the slot identified by the location code.
  
BA188001 EEH recovered a failing I/O adapter 
This is an informational code only, and no action is required. Since it is informational, no location code will be reported.
  
BA188002 EEH could not recover the failed I/O adapter 
Replace the adapter in the slot identified by the location code.
  
BA190001 Firmware function to get/set time-of-day reported an error FWFWPBL 
BA191001 The server firmware function to turn on the speaker reported an error FWFWPBL 
BA201001 The serial interface dropped data packets FWFWPBL 
BA201002 The serial interface failed to open 
Note:
Check console settings to ensure the console is defined to the correct port. Ensure the console cables 
are connected to the port that is defined as the console. FWFWPBL 
BA201003 The firmware failed to handshake properly with the serial interface FWFWPBL 
BA210000 Partition firmware reports a default catch FWFWPBL 
BA210001 Partition firmware reports a stack underflow was caught FWFWPBL 
BA210002 Partition firmware was ready before standout was ready FWFWPBL 
BA210010 The transfer of control to the SLIC loader failed FWFWPBL 
BA210020 The I/O configuration exceeds the maximum size allowed by partition firmware. 
Increase the logical memory block size to 256 megabytes (MB) and reboot the managed system.

Note:
If the logical memory block size is already 256 MB, contact your next level of support.  
BA210100 The partition firmware was unable to log an error with the server firmware. No reply was received 
from the server firmware to an error log that was sent previously NEXTLVL 
BA210101 The partition firmware error log queue is full NEXTLVL 
BA250010 dlpar error in open firmware FWLPAR 
BA250020 dlpar error in open firmware due to an invalid dlpar entity. This error may have been caused 
by an errant or hung operating system process. 
Check for operating system updates that resolve problems with dynamic logical partitioning (dlpar) and apply them, if available. 
Check for server firmware updates and apply them, if available.
  
BA250030 A hotplug operation in dynamic logical partitioning (dlpar) was terminated for concurrent firmware update. 
Retry the hotplug operation after the concurrent firmware update is complete.  
BA250040 The firmware was unable to generate a device tree node 
After you perform the FRU indicated in the Failing Items column, check for operating system updates 
and apply them, if available.
 FWFLASH 
BA278001 Failed to flash firmware: invalid image file 
Obtain a valid firmware update (flash) image for this system.
  
BA278002 Flash file is not designed for this eServer� platform 
Obtain a valid firmware update (flash) image for this system.
  
BA278003 Unable to lock the firmware update lid manager 
Reboot the system. 
Make sure that the operating system is authorized to update the firmware. If the system is running 
multiple partitions, verify that this partition has service authority.
  
BA278004 An invalid firmware update lid was requested 
Obtain a valid firmware update (flash) image for this system.
  
BA278005 Failed to flash a firmware update lid 
Obtain a valid firmware update (flash) image for this system.
  
BA278006 Unable to unlock the firmware update lid manager 
Reboot the system.
  
BA278007 Failed to reboot the system after a firmware flash update 
Reboot the system.
  
BA278008 A server firmware update was attempted from the operating system. You must perform the update 
by using the Hardware Management Console (HMC). 
Perform the server firmware update by using the HMC.
  
BA278009 The server firmware update management tools for the version of Linux that you are running are 
incompatible with this system. 
Go to Service and productivity tools for Linux on POWER� and download the latest service aids and productivity tools 
for the version of Linux that you are running.
  
BA280000 RTAS discovered an invalid operation that may cause a hardware error NEXTLVL 
BA290000 RTAS discovered an internal stack overflow FWFWPBL 
BA300010 The partition exceeded the maximum number of logical memory blocks allowed under the new memory allocation scheme. 
Reduce the total logical memory block limit in the partition profile, then reactivate the partition.

Note:
The maximum number of logical memory blocks per partition is 128 kilobytes (K) under the new memory allocation scheme.  
BA300020 Function call to isolate a logical memory block failed under the standard memory allocation scheme. 
Do the following:

Upgrade the firmware of the managed system to the latest level, if a newer level is available. 
Upgrade the operating system to a level that supports the new memory representation, or edit the profile to have fewer 
logical memory blocks than the 8K maximum. 
Reboot the partition.
  
BA300030 Function call to make a logical memory block unusable failed under the standard memory allocation scheme. 
Do the following:

Upgrade the firmware of the managed system to the latest level, if a newer level is available. 
Upgrade the operating system to a level that supports the new memory representation, or edit the profile to have 
fewer logical memory blocks than the 8K maximum. 
Reboot the partition.
  
BA300040 The partition, which is running the traditional memory representation, exceeded the limit of 8192 logical 
memory blocks allowed by the standard memory allocation scheme. 
Do the following:

Upgrade the operating system to one that supports the new memory representation, or edit the profile 
to have fewer than 8192 logical memory blocks. 
Reboot the partition.
  
BA310010 The firmware could not obtain the SRC history FWFLASH 
BA310020 The firmware received an invalid SRC history FWFLASH 
BA310030 The firmware operation to write the MAC address to vital product data (VPD) failed FWFLASH 


##############################################################

SECTION 4: IBM: Using system reference codes:

##############################################################


Using system reference codes
System reference codes (SRCs) indicate a server hardware or software problem that can originate in hardware, 
in Licensed Internal Code, or in the operating system.

A server component generates an error code when it detects a problem. An SRC identifies the component that detected 
the error code and describes the error condition. Use the SRC information to identify a list of possible failing items 
and to find information about any additional isolation procedures.

SRC formats

SRCs are strings of either six or eight alphanumeric characters. The characters in the SRC typically represent the reference 
code type and the unit reference code (URC):

For SRCs displayed on the control panel, the first four characters designate the reference code type and the second four 
characters designate the URC. 
For SRCs displayed on software displays, characters 1 through 4 of word 1 designate the reference code type and characters 
5 through 8 of word 1 designate the URC.
Note:
For partition firmware SRCs (AAxx, BAxx, and DAxx) and service processor SRCs (A1xx and B1xx), only the first two characters 
of the SRC indicate the necessary action. For partition firmware SRCs that begin with 2xxx, only the first character indicates 
the necessary action. In these cases, the term URC does not apply.
A reference code that is 6 or 8 characters long and appears in either of the following formats (xxxxxx or xxxxxxxx) is an SRC, 
unless it fits one of the following conditions:

An 8-character code that begins with a C (except CB) or D (except DA) is a progress code 
An 8-character code that begins with an H is a Hardware Management Console (HMC) error code or message 
A 6-character code that begins with a zero (0) and does not include a hyphen is an HMC error code 
A code that begins with a number sign character (#) represents an AIX� diagnostics message.
Using the list of reference codes

The list of system reference codes is organized in hexadecimal sequence, with numeric characters listed before 
alphabetic characters. Each entry in the list represents the first four characters (the reference code type) of the SRC. 
The entries link to more information, typically a table that lists the URCs that are associated with that reference code type.

Unless specified otherwise on a particular SRC page, the SRC tables contain the following columns:

The Reference Code column contains numbers that represent the unit reference code (URC). 
The Description/Action column offers a brief description of the failure that this SRC represents. It may also contain 
instructions for continuing the problem analysis. 
The Failing Item column represents functional areas of the system unit. When available, the failing function code links 
to the FRU that contains this function for each specific system unit.
To use the list of system reference codes, complete the following steps:

Click the item in the list of system reference codes that matches the reference code type that you want to find. 
Note:
The SRC tables support only 8-character reference code formats. If the reference code provided contains only 4 or 6 characters, 
contact your next level of support for assistance.
When the SRC table appears, select the appropriate URC from the first column of the table. The tables list URCs 
in hexadecimal sequence, with numeric characters listed before alphabetic characters. 
Perform the action indicated for the URC in the Description/Action column of the table. 
If the table entry does not indicate an action or if performing the action does not correct the problem, exchange 
the failing items or parts listed in the Failing Item column in the order that they are listed. Use the following 
instructions to exchange failing items: 
Note:
Some failing items are required to be exchanged in groups until the problem is solved. Other failing items are flagged 
as mandatory exchange and must be exchanged before the service action is complete, even if the problem appears 
to have been repaired. For more information, see Block replacement of FRUs.

Exchange the failing item listed first. 
If exchanging the first failing item does not correct the problem, reinstall the original item 
and exchange the next failing item listed. 
Continue to exchange and reinstall the failing items, one at a time, until the problem is corrected. 
If exchanging the failing items does not correct the problem, ask your next level of support for assistance.


##############################################################

SECTION 5: GENERAL: UNIX ERROR CODES errno.h :

##############################################################


Generic errormaps/links from the errno.h file:


>>>> Errcodes, for example, Linux (generic):


#define EPERM            1      /* Operation not permitted */
#define ENOENT           2      /* No such file or directory */
#define ESRCH            3      /* No such process */
#define EINTR            4      /* Interrupted system call */
#define EIO              5      /* I/O error */
#define ENXIO            6      /* No such device or address */
#define E2BIG            7      /* Arg list too long */
#define ENOEXEC          8      /* Exec format error */
#define EBADF            9      /* Bad file number */
#define ECHILD          10      /* No child processes */
#define EAGAIN          11      /* Try again */
#define ENOMEM          12      /* Out of memory */
#define EACCES          13      /* Permission denied */
#define EFAULT          14      /* Bad address */
#define ENOTBLK         15      /* Block device required */
#define EBUSY           16      /* Device or resource busy */
#define EEXIST          17      /* File exists */
#define EXDEV           18      /* Cross-device link */
#define ENODEV          19      /* No such device */
#define ENOTDIR         20      /* Not a directory */
#define EISDIR          21      /* Is a directory */
#define EINVAL          22      /* Invalid argument */
#define ENFILE          23      /* File table overflow */
#define EMFILE          24      /* Too many open files */
#define ENOTTY          25      /* Not a typewriter */
#define ETXTBSY         26      /* Text file busy */
#define EFBIG           27      /* File too large */
#define ENOSPC          28      /* No space left on device */
#define ESPIPE          29      /* Illegal seek */
#define EROFS           30      /* Read-only file system */
#define EMLINK          31      /* Too many links */
#define EPIPE           32      /* Broken pipe */
#define EDOM            33      /* Math argument out of domain of func */
#define ERANGE          34      /* Math result not representable */
#define EDEADLK         35      /* Resource deadlock would occur */
#define ENAMETOOLONG    36      /* File name too long */
#define ENOLCK          37      /* No record locks available */
#define ENOSYS          38      /* Function not implemented */
#define ENOTEMPTY       39      /* Directory not empty */
#define ELOOP           40      /* Too many symbolic links encountered */
#define EWOULDBLOCK     EAGAIN  /* Operation would block */
#define ENOMSG          42      /* No message of desired type */
#define EIDRM           43      /* Identifier removed */
#define ECHRNG          44      /* Channel number out of range */
#define EL2NSYNC        45      /* Level 2 not synchronized */
#define EL3HLT          46      /* Level 3 halted */
#define EL3RST          47      /* Level 3 reset */
#define ELNRNG          48      /* Link number out of range */
#define EUNATCH         49      /* Protocol driver not attached */
#define ENOCSI          50      /* No CSI structure available */
#define EL2HLT          51      /* Level 2 halted */
#define EBADE           52      /* Invalid exchange */
#define EBADR           53      /* Invalid request descriptor */
#define EXFULL          54      /* Exchange full */
#define ENOANO          55      /* No anode */
#define EBADRQC         56      /* Invalid request code */
#define EBADSLT         57      /* Invalid slot */
#define EDEADLOCK       EDEADLK
#define EBFONT          59      /* Bad font file format */
#define ENOSTR          60      /* Device not a stream */
#define ENODATA         61      /* No data available */
#define ETIME           62      /* Timer expired */
#define ENOSR           63      /* Out of streams resources */
#define ENONET          64      /* Machine is not on the network */
#define ENOPKG          65      /* Package not installed */
#define EREMOTE         66      /* Object is remote */
#define ENOLINK         67      /* Link has been severed */
#define EADV            68      /* Advertise error */
#define ESRMNT          69      /* Srmount error */
#define ECOMM           70      /* Communication error on send */
#define EPROTO          71      /* Protocol error */
#define EMULTIHOP       72      /* Multihop attempted */
#define EDOTDOT         73      /* RFS specific error */
#define EBADMSG         74      /* Not a data message */
#define EOVERFLOW       75      /* Value too large for defined data type */
#define ENOTUNIQ        76      /* Name not unique on network */
#define EBADFD          77      /* File descriptor in bad state */
#define EREMCHG         78      /* Remote address changed */
#define ELIBACC         79      /* Can not access a needed shared library */
#define ELIBBAD         80      /* Accessing a corrupted shared library */
#define ELIBSCN         81      /* .lib section in a.out corrupted */
#define ELIBMAX         82      /* Attempting to link in too many shared libraries */
#define ELIBEXEC        83      /* Cannot exec a shared library directly */
#define EILSEQ          84      /* Illegal byte sequence */
#define ERESTART        85      /* Interrupted system call should be restarted */
#define ESTRPIPE        86      /* Streams pipe error */
#define EUSERS          87      /* Too many users */
#define ENOTSOCK        88      /* Socket operation on non-socket */
#define EDESTADDRREQ    89      /* Destination address required */
#define EMSGSIZE        90      /* Message too long */
#define EPROTOTYPE      91      /* Protocol wrong type for socket */
#define ENOPROTOOPT     92      /* Protocol not available */
#define EPROTONOSUPPORT 93      /* Protocol not supported */
#define ESOCKTNOSUPPORT 94      /* Socket type not supported */
#define EOPNOTSUPP      95      /* Operation not supported on transport endpoint */
#define EPFNOSUPPORT    96      /* Protocol family not supported */
#define EAFNOSUPPORT    97      /* Address family not supported by protocol */
#define EADDRINUSE      98      /* Address already in use */
#define EADDRNOTAVAIL   99      /* Cannot assign requested address */
#define ENETDOWN        100     /* Network is down */
#define ENETUNREACH     101     /* Network is unreachable */
#define ENETRESET       102     /* Network dropped connection because of reset */
#define ECONNABORTED    103     /* Software caused connection abort */
#define ECONNRESET      104     /* Connection reset by peer */
#define ENOBUFS         105     /* No buffer space available */
#define EISCONN         106     /* Transport endpoint is already connected */
#define ENOTCONN        107     /* Transport endpoint is not connected */
#define ESHUTDOWN       108     /* Cannot send after transport endpoint shutdown */
#define ETOOMANYREFS    109     /* Too many references: cannot splice */
#define ETIMEDOUT       110     /* Connection timed out */
#define ECONNREFUSED    111     /* Connection refused */
#define EHOSTDOWN       112     /* Host is down */
#define EHOSTUNREACH    113     /* No route to host */
#define EALREADY        114     /* Operation already in progress */
#define EINPROGRESS     115     /* Operation now in progress */
#define ESTALE          116     /* Stale NFS file handle */
#define EUCLEAN         117     /* Structure needs cleaning */
#define ENOTNAM         118     /* Not a XENIX named type file */
#define ENAVAIL         119     /* No XENIX semaphores available */
#define EISNAM          120     /* Is a named type file */
#define EREMOTEIO       121     /* Remote I/O error */
#define EDQUOT          122     /* Quota exceeded */
#define ENOMEDIUM       123     /* No medium found */
#define EMEDIUMTYPE     124     /* Wrong medium type */


The list above should actually be enough, but we shall list the same for AIX:


>>>> errcodes AIX:


#define EPERM   1       /* Operation not permitted              */
#define ENOENT  2       /* No such file or directory            */
#define ESRCH   3       /* No such process                      */
#define EINTR   4       /* interrupted system call              */
#define EIO     5       /* I/O error                            */
#define ENXIO   6       /* No such device or address            */
#define E2BIG   7       /* Arg list too long                    */
#define ENOEXEC 8       /* Exec format error                    */
#define EBADF   9       /* Bad file descriptor                  */
#define ECHILD  10      /* No child processes                   */
#define EAGAIN  11      /* Resource temporarily unavailable     */
#define ENOMEM  12      /* Not enough space                     */
#define EACCES  13      /* Permission denied                    */
#define EFAULT  14      /* Bad address                          */
#define ENOTBLK 15      /* Block device required                */
#define EBUSY   16      /* Resource busy                        */
#define EEXIST  17      /* File exists                          */
#define EXDEV   18      /* Improper link                        */
#define ENODEV  19      /* No such device                       */
#define ENOTDIR 20      /* Not a directory                      */
#define EISDIR  21      /* Is a directory                       */
#define EINVAL  22      /* Invalid argument                     */
#define ENFILE  23      /* Too many open files in system        */
#define EMFILE  24      /* Too many open files                  */
#define ENOTTY  25      /* Inappropriate I/O control operation  */
#define ETXTBSY 26      /* Text file busy                       */
#define EFBIG   27      /* File too large                       */
#define ENOSPC  28      /* No space left on device              */
#define ESPIPE  29      /* Invalid seek                         */
#define EROFS   30      /* Read only file system                */
#define EMLINK  31      /* Too many links                       */
#define EPIPE   32      /* Broken pipe                          */
#define EDOM    33      /* Domain error within math function    */
#define ERANGE  34      /* Result too large                     */
#define ENOMSG  35      /* No message of desired type           */
#define EIDRM   36      /* Identifier removed                   */
#define ECHRNG  37      /* Channel number out of range          */
#define EL2NSYNC 38     /* Level 2 not synchronized             */
#define EL3HLT  39      /* Level 3 halted                       */
#define EL3RST  40      /* Level 3 reset                        */
#define ELNRNG  41      /* Link number out of range             */
#define EUNATCH 42      /* Protocol driver not attached         */
#define ENOCSI  43      /* No CSI structure available           */
#define EL2HLT  44      /* Level 2 halted                       */
#define EDEADLK 45      /* Resource deadlock avoided            */
#define ENOTREADY       46      /* Device not ready             */
#define EWRPROTECT      47      /* Write-protected media        */
#define EFORMAT         48      /* Unformatted media            */
#define ENOLCK          49      /* No locks available           */
#define ENOCONNECT      50      /* no connection                */
#define ESTALE          52      /* no filesystem                */
#define EDIST           53      /* old, currently unused AIX errno*/
#define EINPROGRESS     55      /* Operation now in progress */
#define EALREADY        56      /* Operation already in progress */
#define ENOTSOCK        57      /* Socket operation on non-socket */
#define EDESTADDRREQ    58      /* Destination address required */
#define EDESTADDREQ     EDESTADDRREQ /* Destination address required */
#define EMSGSIZE        59      /* Message too long */
#define EPROTOTYPE      60      /* Protocol wrong type for socket */
#define ENOPROTOOPT     61      /* Protocol not available */
#define EPROTONOSUPPORT 62      /* Protocol not supported */
#define ESOCKTNOSUPPORT 63      /* Socket type not supported */
#define EOPNOTSUPP      64      /* Operation not supported on socket */
#define EPFNOSUPPORT    65      /* Protocol family not supported */
#define EAFNOSUPPORT    66      /* Address family not supported by protocol family */
#define EADDRINUSE      67      /* Address already in use */
#define EADDRNOTAVAIL   68      /* Can't assign requested address */
#define ENETDOWN        69      /* Network is down */
#define ENETUNREACH     70      /* Network is unreachable */
#define ENETRESET       71      /* Network dropped connection on reset */
#define ECONNABORTED    72      /* Software caused connection abort */
#define ECONNRESET      73      /* Connection reset by peer */
#define ENOBUFS         74      /* No buffer space available */
#define EISCONN         75      /* Socket is already connected */
#define ENOTCONN        76      /* Socket is not connected */
#define ESHUTDOWN       77      /* Can't send after socket shutdown */
#define ETIMEDOUT       78      /* Connection timed out */
#define ECONNREFUSED    79      /* Connection refused */
#define EHOSTDOWN       80      /* Host is down */
#define EHOSTUNREACH    81      /* No route to host */
#define ERESTART        82      /* restart the system call */
#define EPROCLIM        83      /* Too many processes */
#define EUSERS          84      /* Too many users */
#define ELOOP           85      /* Too many levels of symbolic links      */
#define ENAMETOOLONG    86      /* File name too long                     */
#define EDQUOT          88      /* Disc quota exceeded */
#define ECORRUPT        89      /* Invalid file system control data */
#define EREMOTE         93      /* Item is not local to host */
#define ENOSYS          109     /* Function not implemented  POSIX */
#define EMEDIA          110     /* media surface error */
#define ESOFT           111     /* I/O completed, but needs relocation */
#define ENOATTR         112     /* no attribute found */
#define ESAD            113     /* security authentication denied */
#define ENOTRUST        114     /* not a trusted program */
#define ETOOMANYREFS    115     /* Too many references: can't splice */
#define EILSEQ          116     /* Invalid wide character */
#define ECANCELED       117     /* asynchronous i/o cancelled */
#define ENOSR           118     /* temp out of streams resources */
#define ETIME           119     /* I_STR ioctl timed out */
#define EBADMSG         120     /* wrong message type at stream head */
#define EPROTO          121     /* STREAMS protocol error */
#define ENODATA         122     /* no message ready at stream head */
#define ENOSTR          123     /* fd is not a stream */
#define ECLONEME        ERESTART /* this is the way we clone a stream ... */
#define ENOTSUP         124     /* POSIX threads unsupported value */
#define EMULTIHOP       125     /* multihop is not allowed */
#define ENOLINK         126     /* the link has been severed */
#define EOVERFLOW       127     /* value too large to be stored in data type */


Base AIX error codes:
=====================


Appendix A. Base Operating System Error Codes for Services That Require Path-Name Resolution
The following errors apply to any service that requires path name resolution:

EACCES	 Search permission is denied on a component of the path prefix. 
EFAULT	 The Path parameter points outside of the allocated address space of the process. 
EIO	 An I/O error occurred during the operation. 
ELOOP	 Too many symbolic links were encountered in translating the Path parameter. 
ENAMETOOLONG A component of a path name exceeded 255 characters and the process has the DisallowTruncation 
attribute (see the ulimit subroutine) or an entire path name exceeded 1023 characters. 
ENOENT	 A component of the path prefix does not exist. 
ENOENT	 A symbolic link was named, but the file to which it refers does not exist. 
ENOENT	 The path name is null. 
ENOTDIR	 A component of the path prefix is not a directory. 
ESTALE	 The root or current directory of the process is located in a virtual file system that is unmounted. 


albert@starboss:/usr/include $ cat errlog.h
/* IBM_PROLOG_BEGIN_TAG                                                   */
/* This is an automatically generated prolog.                             */
/*                                                                        */
/* bos53D src/bos/usr/ccs/lib/liberrlog/errlog.h 1.7                      */
/*                                                                        */
/* Licensed Materials - Property of IBM                                   */
/*                                                                        */
/* Restricted Materials of IBM                                            */
/*                                                                        */
/* (C) COPYRIGHT International Business Machines Corp. 2000,2005          */
/* All Rights Reserved                                                    */
/*                                                                        */
/* US Government Users Restricted Rights - Use, duplication or            */
/* disclosure restricted by GSA ADP Schedule Contract with IBM Corp.      */
/*                                                                        */
/* IBM_PROLOG_END_TAG                                                     */
#ifndef H_errlog
#define H_errlog
/* @(#)74        1.7  src/bos/usr/ccs/lib/liberrlog/errlog.h, cmderrlg, bos53D, d2005_09B1 2/24/05 15:34:58 */

/*
 * COMPONENT_NAME: CMDERRLG   system error logging and reporting facility
 *
 * External definitions and declarations for liberrlog.a
 *
 */


#include <sys/types.h>
#include <sys/err_rec.h>

typedef void *errlog_handle_t;

/*
 *  These magic numbers will indicate which version of errlog
 *  entry is being returned.
 *  All users of errlog_entry_t should use only LE_MAGIC.
 */
#define LE_MAGIC_41 0x0C3DF420
/* LE_MAGIC434_INTERUM is an interum 43T magic, before le_errdiag was added. */
#define LE_MAGIC434_INTERUM 0x0C3DF434
#define LE_MAGIC434 0x0C4DF434
#define LE_MAGIC52F 0x0C4DF52F
#define LE_MAGIC53D 0x0C4DF53D
#define LE_MAGIC   LE_MAGIC53D          /* current errlog_open magic # */
/* VALID_LE_MAGIC gives valid magic numbers for an error log record. */
#define VALID_LE_MAGIC(m) (((m) == LE_MAGIC_41) || \
                ((m) == LE_MAGIC434_INTERUM) || ((m) == LE_MAGIC434))
/* VALID_LENTRY_MAGIC gives valid magic numbers for errlog_open(). */
#define VALID_LENTRY_MAGIC(m) (((m) == LE_MAGIC) || ((m) == LE_MAGIC434) ||\
                               ((m) == LE_MAGIC52F))

/*
 * Optional duplicate information.
 */
struct errdup {
    unsigned int        ed_dupcount;
    time32_t            ed_time1;
    time32_t            ed_time2;
};

/* Lengths of the various fields in the structure. */
#define LE_LABEL_MAX            20
#define LE_MACHINE_ID_MAX       32
#define LE_NODE_ID_MAX          32
#define LE_CLASS_MAX            2
#define LE_TYPE_MAX             5
#define LE_RESOURCE_MAX         16
#define LE_RCLASS_MAX           16
#define LE_RTYPE_MAX            16
#define LE_VPD_MAX              512
#define LE_IN_MAX               256
#define LE_CONN_MAX             20
#define LE_DETAIL_MAX           ERR_REC_MAX
#define LE_SYMPTOM_MAX          312
#define LE_ERRDUP_MAX           sizeof(struct errdup)

/* The data structure that contains an errlog entry */
typedef struct errlog_entry {
    unsigned int        el_magic;
    unsigned int        el_sequence;
    char                el_label[LE_LABEL_MAX];
    unsigned int        el_timestamp;
    unsigned int        el_crcid;
    unsigned int        el_errdiag;
    char                el_machineid[LE_MACHINE_ID_MAX];
    char                el_nodeid[LE_NODE_ID_MAX];
    char                el_class[LE_CLASS_MAX];
    char                el_type[LE_TYPE_MAX];
    char                el_resource[LE_RESOURCE_MAX];
    char                el_rclass[LE_RCLASS_MAX];
    char                el_rtype[LE_RTYPE_MAX];
    char                el_vpd_ibm[LE_VPD_MAX];
    char                el_vpd_user[LE_VPD_MAX];
    char                el_in[LE_IN_MAX];
    char                el_connwhere[LE_CONN_MAX];
    unsigned short      el_flags;
    unsigned short      el_detail_length;
    char                el_detail_data[LE_DETAIL_MAX];
    unsigned int        el_symptom_length;
    char                el_symptom_data[LE_SYMPTOM_MAX];
    struct errdup       el_errdup;
} errlog_entry_t;


/* Values for the el_flags element. */
#define LE_FLAG_ERR64           0x01
#define LE_FLAG_ERRDUP          0x100

/*
 *  This structure is used to pass search criteria to errlog_find_first.

 *  To use it an operation is put in em_op.  If it is a leaf operation,
 *  the field in errlog_entry_t to apply the op to is put in em_field and
 *  the value to compare against is put in em_strvalue or em_intvalue.
 *  Boolean values are put in em_intvalue.
 *
 *  To connect operations, a unary or binary operator is put in em_op.
 *  The operation(s) to apply the operator to are put in em_left and,
 *  if it's a binary operator, em_right.
 */

typedef struct errlog_match {
    unsigned int                em_op;
    union {
        struct errlog_match     *emu_left;
        unsigned int            emu_field;
    } emu1;
    union {
        struct errlog_match     *emu_right;
        unsigned int            emu_intvalue;
        unsigned char           *emu_strvalue;
    } emu2;
} errlog_match_t;

#define em_left         emu1.emu_left
#define em_field        emu1.emu_field
#define em_right        emu2.emu_right
#define em_intvalue     emu2.emu_intvalue
#define em_strvalue     emu2.emu_strvalue

/* Operators to use in the match structures for the find functions */
#define LE_OP_EQUAL             0x01
#define LE_OP_NE                0x02
#define LE_OP_SUBSTR            0x03
#define LE_OP_LT                0x04
#define LE_OP_LE                0x05
#define LE_OP_GT                0x06
#define LE_OP_GE                0x07
#define LE_OP_LEAF              0x100
#define LE_OP_NOT               0x101
#define LE_OP_AND               0x201
#define LE_OP_OR                0x202
#define LE_OP_XOR               0x203

/* Flags to combine with the field id to indicate the data type of the field */
#define LE_TYPE                 0xff00
#define LE_TYPE_INT             0x0100
#define LE_TYPE_STRING          0x0200
#define LE_TYPE_BOOLEAN         0x0300

/* Flags to indicate which field to match in the find functions. */
#define LE_MATCH_FIELD          0xff
#define LE_MATCH_SEQUENCE       (0x01|LE_TYPE_INT)
#define LE_MATCH_LABEL          (0x02|LE_TYPE_STRING)
#define LE_MATCH_TIMESTAMP      (0x03|LE_TYPE_INT)
#define LE_MATCH_CRCID          (0x04|LE_TYPE_INT)
#define LE_MATCH_MACHINEID      (0x05|LE_TYPE_STRING)
#define LE_MATCH_NODEID         (0x06|LE_TYPE_STRING)
#define LE_MATCH_CLASS          (0x07|LE_TYPE_STRING)
#define LE_MATCH_TYPE           (0x08|LE_TYPE_STRING)
#define LE_MATCH_RESOURCE       (0x09|LE_TYPE_STRING)
#define LE_MATCH_RCLASS         (0x0a|LE_TYPE_STRING)
#define LE_MATCH_RTYPE          (0x0b|LE_TYPE_STRING)
#define LE_MATCH_VPD_IBM        (0x0c|LE_TYPE_STRING)
#define LE_MATCH_VPD_USER       (0x0d|LE_TYPE_STRING)
#define LE_MATCH_IN             (0x0e|LE_TYPE_STRING)
#define LE_MATCH_CONNWHERE      (0x0f|LE_TYPE_STRING)
#define LE_MATCH_FLAG_ERR64     (0x10|LE_TYPE_BOOLEAN)
#define LE_MATCH_FLAG_ERRDUP    (0x11|LE_TYPE_BOOLEAN)
#define LE_MATCH_DETAIL_DATA    (0x12|LE_TYPE_STRING)
#define LE_MATCH_SYMPTOM_DATA   (0x13|LE_TYPE_STRING)
#define LE_MATCH_ERRDIAG        (0x14|LE_TYPE_INT)

/*
 *  Define the directions find can walk through the errlog file.
 */

#define LE_FORWARD              0x01
#define LE_REVERSE              0x02

/*
 * Define the errors that the functions can return.
 */

#define LE_ERR_INVARG   0x01            /* Invalid input argument */
#define LE_ERR_NOFILE   0x02            /* The errlog file can't be opened */
#define LE_ERR_INVFILE  0x03            /* The errlog file isn't valid */
#define LE_ERR_NOMEM    0x04            /* We're out of memory */
#define LE_ERR_NOWRITE  0x05            /* Can't write entry back */
#define LE_ERR_IO       0x06            /* IO error in the errlog file */
#define LE_ERR_DONE     0x07            /* The find function reached the end */

/*
 * These are the functions that comprise the API
 */
extern int errlog_open(char             *path,
                       int              mode,
                       unsigned int     magic,
                       errlog_handle_t  *handle);

extern int errlog_close(errlog_handle_t handle);

extern int errlog_find_first(errlog_handle_t    handle,
                             errlog_match_t     *filter,
                             errlog_entry_t     *result);

extern int errlog_find_next(errlog_handle_t     handle,
                            errlog_entry_t      *result);

extern int errlog_find_sequence(errlog_handle_t handle,
                                int             sequence,
                                errlog_entry_t  *result);

extern int errlog_set_direction(errlog_handle_t handle,
                                int             direction);

extern int errlog_write(errlog_handle_t         handle,
                        errlog_entry_t          *data);

#endif
albert@starboss:/usr/include $


albert@starboss:/usr/include/sys $ cat errno.h
/* IBM_PROLOG_BEGIN_TAG                                                   */
/* This is an automatically generated prolog.                             */
/*                                                                        */
/* bos530 src/bos/kernel/sys/errno.h 1.27.1.23                            */
/*                                                                        */
/* Licensed Materials - Property of IBM                                   */
/*                                                                        */
/* (C) COPYRIGHT International Business Machines Corp. 1985,1995          */
/* All Rights Reserved                                                    */
/*                                                                        */
/* US Government Users Restricted Rights - Use, duplication or            */
/* disclosure restricted by GSA ADP Schedule Contract with IBM Corp.      */
/*                                                                        */
/* IBM_PROLOG_END_TAG                                                     */
/* @(#)49       1.27.1.23  src/bos/kernel/sys/errno.h, incstd, bos530 1/25/01 16:31:11 */
/*
 * COMPONENT_NAME: (INCSTD) Standard Include Files
 *
 * FUNCTIONS:
 *
 * ORIGINS: 27,71
 *
 * (C) COPYRIGHT International Business Machines Corp. 1985, 1996
 * All Rights Reserved
 * Licensed Materials - Property of IBM
 *
 * US Government Users Restricted Rights - Use, duplication or
 * disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
 */
/*
 * (c) Copyright 1990, 1991, 1992 OPEN SOFTWARE FOUNDATION, INC.
 * ALL RIGHTS RESERVED
 */

#ifndef _H_ERRNO
#define _H_ERRNO
#include <standards.h>

/*
 *      Error codes
 *
 *      The ANSI, POSIX, and XOPEN standards require that certain values be
 *      in errno.h.  The standards allow additional macro definitions,
 *      beginning with an E and an uppercase letter.
 *
 */

#ifdef _ANSI_C_SOURCE

#ifndef _KERNEL

#if defined(_THREAD_SAFE) || defined(_THREAD_SAFE_ERRNO)
/*
 * Per thread errno is provided by the threads provider. Both the extern int
 * and the per thread value must be maintained by the threads library.
 */
extern  int     *_Errno( void );
#define errno   (*_Errno())

#else

extern int errno;

#endif  /* _THREAD_SAFE || _THREAD_SAFE_ERRNO */

#endif  /* _KERNEL */

#ifdef _ALL_SOURCE

extern  char    *sys_errlist[];
extern  int     sys_nerr;

#endif /* _ALL_SOURCE */

#define EPERM   1       /* Operation not permitted              */
#define ENOENT  2       /* No such file or directory            */
#define ESRCH   3       /* No such process                      */
#define EINTR   4       /* interrupted system call              */
#define EIO     5       /* I/O error                            */
#define ENXIO   6       /* No such device or address            */
#define E2BIG   7       /* Arg list too long                    */
#define ENOEXEC 8       /* Exec format error                    */
#define EBADF   9       /* Bad file descriptor                  */
#define ECHILD  10      /* No child processes                   */
#define EAGAIN  11      /* Resource temporarily unavailable     */
#define ENOMEM  12      /* Not enough space                     */
#define EACCES  13      /* Permission denied                    */
#define EFAULT  14      /* Bad address                          */
#define ENOTBLK 15      /* Block device required                */
#define EBUSY   16      /* Resource busy                        */
#define EEXIST  17      /* File exists                          */
#define EXDEV   18      /* Improper link                        */
#define ENODEV  19      /* No such device                       */
#define ENOTDIR 20      /* Not a directory                      */
#define EISDIR  21      /* Is a directory                       */
#define EINVAL  22      /* Invalid argument                     */
#define ENFILE  23      /* Too many open files in system        */
#define EMFILE  24      /* Too many open files                  */
#define ENOTTY  25      /* Inappropriate I/O control operation  */
#define ETXTBSY 26      /* Text file busy                       */
#define EFBIG   27      /* File too large                       */
#define ENOSPC  28      /* No space left on device              */
#define ESPIPE  29      /* Invalid seek                         */
#define EROFS   30      /* Read only file system                */
#define EMLINK  31      /* Too many links                       */
#define EPIPE   32      /* Broken pipe                          */
#define EDOM    33      /* Domain error within math function    */
#define ERANGE  34      /* Result too large                     */
#define ENOMSG  35      /* No message of desired type           */
#define EIDRM   36      /* Identifier removed                   */
#define ECHRNG  37      /* Channel number out of range          */
#define EL2NSYNC 38     /* Level 2 not synchronized             */
#define EL3HLT  39      /* Level 3 halted                       */
#define EL3RST  40      /* Level 3 reset                        */
#define ELNRNG  41      /* Link number out of range             */
#define EUNATCH 42      /* Protocol driver not attached         */
#define ENOCSI  43      /* No CSI structure available           */
#define EL2HLT  44      /* Level 2 halted                       */
#define EDEADLK 45      /* Resource deadlock avoided            */

#define ENOTREADY       46      /* Device not ready             */
#define EWRPROTECT      47      /* Write-protected media        */
#define EFORMAT         48      /* Unformatted media            */

#define ENOLCK          49      /* No locks available           */

#define ENOCONNECT      50      /* no connection                */
#define ESTALE          52      /* no filesystem                */
#define EDIST           53      /* old, currently unused AIX errno*/

/* non-blocking and interrupt i/o */
/*
 * AIX returns EAGAIN where 4.3BSD used EWOULDBLOCK;
 * but, the standards insist on unique errno values for each errno.
 * A unique value is reserved for users that want to code case
 * statements for systems that return either EAGAIN or EWOULDBLOCK.
 */
#if _XOPEN_SOURCE_EXTENDED==1
#define EWOULDBLOCK     EAGAIN   /* Operation would block       */
#else /* _XOPEN_SOURCE_EXTENDED */
#define EWOULDBLOCK     54
#endif /* _XOPEN_SOURCE_EXTENDED */

#define EINPROGRESS     55      /* Operation now in progress */
#define EALREADY        56      /* Operation already in progress */

/* ipc/network software */

        /* argument errors */
#define ENOTSOCK        57      /* Socket operation on non-socket */
#define EDESTADDRREQ    58      /* Destination address required */
#define EDESTADDREQ     EDESTADDRREQ /* Destination address required */
#define EMSGSIZE        59      /* Message too long */
#define EPROTOTYPE      60      /* Protocol wrong type for socket */
#define ENOPROTOOPT     61      /* Protocol not available */
#define EPROTONOSUPPORT 62      /* Protocol not supported */
#define ESOCKTNOSUPPORT 63      /* Socket type not supported */
#define EOPNOTSUPP      64      /* Operation not supported on socket */
#define EPFNOSUPPORT    65      /* Protocol family not supported */
#define EAFNOSUPPORT    66      /* Address family not supported by protocol family */
#define EADDRINUSE      67      /* Address already in use */
#define EADDRNOTAVAIL   68      /* Can't assign requested address */

        /* operational errors */
#define ENETDOWN        69      /* Network is down */
#define ENETUNREACH     70      /* Network is unreachable */
#define ENETRESET       71      /* Network dropped connection on reset */
#define ECONNABORTED    72      /* Software caused connection abort */
#define ECONNRESET      73      /* Connection reset by peer */
#define ENOBUFS         74      /* No buffer space available */
#define EISCONN         75      /* Socket is already connected */
#define ENOTCONN        76      /* Socket is not connected */
#define ESHUTDOWN       77      /* Can't send after socket shutdown */

#define ETIMEDOUT       78      /* Connection timed out */
#define ECONNREFUSED    79      /* Connection refused */

#define EHOSTDOWN       80      /* Host is down */
#define EHOSTUNREACH    81      /* No route to host */

/* ERESTART is used to determine if the system call is restartable */
#define ERESTART        82      /* restart the system call */

/* quotas and limits */
#define EPROCLIM        83      /* Too many processes */
#define EUSERS          84      /* Too many users */
#define ELOOP           85      /* Too many levels of symbolic links      */
#define ENAMETOOLONG    86      /* File name too long                     */

/*
 * AIX returns EEXIST where 4.3BSD used ENOTEMPTY;
 * but, the standards insist on unique errno values for each errno.
 * A unique value is reserved for users that want to code case
 * statements for systems that return either EEXIST or ENOTEMPTY.
 */
#if defined(_ALL_SOURCE) && !defined(_LINUX_SOURCE_COMPAT)
#define ENOTEMPTY       EEXIST  /* Directory not empty */
#else   /* not _ALL_SOURCE */
#define ENOTEMPTY       87
#endif  /* _ALL_SOURCE */

/* disk quotas */
#define EDQUOT          88      /* Disc quota exceeded */

#define ECORRUPT        89      /* Invalid file system control data */

/* errnos 90-92 reserved for future use compatible with AIX PS/2 */

/* network file system */
#define EREMOTE         93      /* Item is not local to host */

/* errnos 94-108 reserved for future use compatible with AIX PS/2 */

#define ENOSYS          109     /* Function not implemented  POSIX */

/* disk device driver */
#define EMEDIA          110     /* media surface error */
#define ESOFT           111     /* I/O completed, but needs relocation */

/* security */
#define ENOATTR         112     /* no attribute found */
#define ESAD            113     /* security authentication denied */
#define ENOTRUST        114     /* not a trusted program */

/* BSD 4.3 RENO */
#define ETOOMANYREFS    115     /* Too many references: can't splice */

#define EILSEQ          116     /* Invalid wide character */
#define ECANCELED       117     /* asynchronous i/o cancelled */

/* SVR4 STREAMS */
#define ENOSR           118     /* temp out of streams resources */
#define ETIME           119     /* I_STR ioctl timed out */
#define EBADMSG         120     /* wrong message type at stream head */
#define EPROTO          121     /* STREAMS protocol error */
#define ENODATA         122     /* no message ready at stream head */
#define ENOSTR          123     /* fd is not a stream */

#define ECLONEME        ERESTART /* this is the way we clone a stream ... */

#define ENOTSUP         124     /* POSIX threads unsupported value */

#define EMULTIHOP       125     /* multihop is not allowed */
#define ENOLINK         126     /* the link has been severed */
#define EOVERFLOW       127     /* value too large to be stored in data type */

#endif /* _ANSI_C_SOURCE */

#endif /* _H_ERRNO */
albert@starboss:/usr/include/sys $


albert@starboss:/usr/include $ file sysexits.h
sysexits.h: ascii text
albert@starboss:/usr/include $ cat sysexits.h
/* IBM_PROLOG_BEGIN_TAG                                                   */
/* This is an automatically generated prolog.                             */
/*                                                                        */
/* bos530 src/bos/usr/include/sysexits.h 1.6                              */
/*                                                                        */
/* Licensed Materials - Property of IBM                                   */
/*                                                                        */
/* (C) COPYRIGHT International Business Machines Corp. 1989,1991          */
/* All Rights Reserved                                                    */
/*                                                                        */
/* US Government Users Restricted Rights - Use, duplication or            */
/* disclosure restricted by GSA ADP Schedule Contract with IBM Corp.      */
/*                                                                        */
/* IBM_PROLOG_END_TAG                                                     */
/* @(#)30       1.6  src/bos/usr/include/sysexits.h, incstd, bos530 6/16/90 00:14:57 */
#ifndef _H_SYSEXITS
#define _H_SYSEXITS
/*
 * COMPONENT_NAME: (INCSTD) Standard Include Files
 *
 * FUNCTIONS:
 *
 * ORIGINS: 27
 *
 * (C) COPYRIGHT International Business Machines Corp. 1989
 * All Rights Reserved
 * Licensed Materials - Property of IBM
 *
 * US Government Users Restricted Rights - Use, duplication or
 * disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
 */

/*
**  SYSEXITS.H -- Exit status codes for system programs.
**
**      This include file attempts to categorize possible error
**      exit statuses for system programs, notably delivermail
**      and the Berkeley network.
**
**      Error numbers begin at EX__BASE to reduce the possibility of
**      clashing with other exit statuses that random programs may
**      already return.  The meaning of the codes is approximately
**      as follows:
**
**      EX_USAGE -- The command was used incorrectly, e.g., with
**              the wrong number of arguments, a bad flag, a bad
**              syntax in a parameter, or whatever.
**      EX_DATAERR -- The input data was incorrect in some way.
**              This should only be used for user's data & not
**              system files.
**      EX_NOINPUT -- An input file (not a system file) did not
**              exist or was not readable.  This could also include
**              errors like "No message" to a mailer (if it cared
**              to catch it).
**      EX_NOUSER -- The user specified did not exist.  This might
**              be used for mail addresses or remote logins.
**      EX_NOHOST -- The host specified did not exist.  This is used
**              in mail addresses or network requests.
**      EX_UNAVAILABLE -- A service is unavailable.  This can occur
**              if a support program or file does not exist.  This
**              can also be used as a catchall message when something
**              you wanted to do doesn't work, but you don't know
**              why.
**      EX_SOFTWARE -- An internal software error has been detected.
**              This should be limited to non-operating system related
**              errors as possible.
**      EX_OSERR -- An operating system error has been detected.
**              This is intended to be used for such things as "cannot
**              fork", "cannot create pipe", or the like.  It includes
**              things like getuid returning a user that does not
**              exist in the passwd file.
**      EX_OSFILE -- Some system file (e.g., /etc/passwd, /etc/utmp,
**              etc.) does not exist, cannot be opened, or has some
**              sort of error (e.g., syntax error).
**      EX_CANTCREAT -- A (user specified) output file cannot be
**              created.
**      EX_IOERR -- An error occurred while doing I/O on some file.
**      EX_TEMPFAIL -- temporary failure, indicating something that
**              is not really an error.  In sendmail, this means
**              that a mailer (e.g.) could not create a connection,
**              and the request should be reattempted later.
**      EX_PROTOCOL -- the remote system returned something that
**              was "not possible" during a protocol exchange.
**      EX_NOPERM -- You did not have sufficient permission to
**              perform the operation.  This is not intended for
**              file system problems, which should use NOINPUT or
**              CANTCREAT, but rather for higher level permissions.
**              For example, kre uses this to restrict who students
**              can send mail to.
**
*/

# define EX_OK          0       /* successful termination */

# define EX__BASE       64      /* base value for error messages */

# define EX_USAGE       64      /* command line usage error */
# define EX_DATAERR     65      /* data format error */
# define EX_NOINPUT     66      /* cannot open input */
# define EX_NOUSER      67      /* addressee unknown */
# define EX_NOHOST      68      /* host name unknown */
# define EX_UNAVAILABLE 69      /* service unavailable */
# define EX_SOFTWARE    70      /* internal software error */
# define EX_OSERR       71      /* system error (e.g., can't fork) */
# define EX_OSFILE      72      /* critical OS file missing */
# define EX_CANTCREAT   73      /* can't create (user) output file */
# define EX_IOERR       74      /* input/output error */
# define EX_TEMPFAIL    75      /* temp failure; user is invited to retry */
# define EX_PROTOCOL    76      /* remote error in protocol */
# define EX_NOPERM      77      /* permission denied */
# define EX_CONFIG      78      /* configuration error */
# define EX_DB          79      /* database access error */

#endif /* _H_SYSEXITS */


>>>> For Solaris:


1 EPERM Not superuser
Typically this error indicates an attempt to modify a file in some way forbidden except to its owner or the super-user. It is also returned for attempts by ordinary users to do things allowed only to the super-user.

2 ENOENT No such file or directory
A file name is specified and the file should exist but doesn't, or one of the directories in a path name does not exist.

3 ESRCH No such process, LWP, or thread
No process can be found in the system that corresponds to the specified PID, LWPID_t, or thread_t.

4 EINTR Interrupted system call
An asynchronous signal (such as interrupt or quit), which the user has elected to catch, occurred during a system service routine. If execution is resumed after processing the signal, it will appear as if the interrupted routine call returned this error condition. In a multi-threaded application, EINTR may be returned whenever another thread or LWP calls fork(2).

5 EIO I/O error
Some physical I/O error has occurred. This error may in some cases occur on a call following the one to which it actually applies.

6 ENXIO No such device or address
I/O on a special file refers to a subdevice which does not exist, or exists beyond the limit of the device. It may also occur when, for example, a tape drive is not on-line or no disk pack is loaded on a drive.

7 E2BIG Arg list too long
An argument list longer than ARG_MAX bytes is presented to a member of the exec family of routines. The argument list limit is the sum of the size of the argument list plus the size of the environment's exported shell variables.

8 ENOEXEC Exec format error
A request is made to execute a file which, although it has the appropriate permissions, does not start with a valid format (see a.out(4)).

9 EBADF Bad file number
Either a file descriptor refers to no open file, or a read (respectively, write) request is made to a file that is open only for writing (respectively, reading).

10 ECHILD No child processes
A wait routine was executed by a process that had no existing or unwaited-for child processes.

11 EAGAIN No more processes, or no more LWPs
For example, the fork routine failed because the system's process table is full or the user is not allowed to create any more processes, or a system call failed because of insufficient memory or swap space.

12 ENOMEM Not enough space
During execution of an exec, brk, or sbrk routine, a program asks for more space than the system is able to supply. This is not a temporary condition; the maximum size is a system parameter. On some architectures, the error may also occur if the arrangement of text, data, and stack segments requires too many segmentation registers, or if there is not enough swap space during the fork routine. If this error occurs on a resource associated with Remote File Sharing (RFS), it indicates a memory depletion which may be temporary, dependent on system activity at the time the call was invoked.

13 EACCES Permission denied
An attempt was made to access a file in a way forbidden by the protection system.

14 EFAULT Bad address
The system encountered a hardware fault in attempting to use an argument of a routine. For example, errno potentially may be set to EFAULT any time a routine that takes a pointer argument is passed an invalid address, if the system can detect the condition. Because systems will differ in their ability to reliably detect a bad address, on some implementations passing a bad address to a routine will result in undefined behavior.

15 ENOTBLK Block device required
A non-block device or file was mentioned where a block device was required (for example, in a call to the mount routine).

16 EBUSY Device busy
An attempt was made to mount a device that was already mounted or an attempt was made to unmount a device on which there is an active file (open file, current directory, mounted-on file, active text segment). It will also occur if an attempt is made to enable accounting when it is already enabled. The device or resource is currently unavailable. EBUSY is also used by mutexes, semaphores, condition variables, and r/w locks, to indicate that a lock is held. And, EBUSY is also used by the processor control function P_ONLINE.

17 EEXIST File exists
An existing file was mentioned in an inappropriate context (for example, call to the link routine).

18 EXDEV Cross-device link
A hard link to a file on another device was attempted.

19 ENODEV No such device
An attempt was made to apply an inappropriate operation to a device (for example, read a write-only device).

20 ENOTDIR Not a directory
A non-directory was specified where a directory is required (for example, in a path prefix or as an argument to the chdir routine).

21 EISDIR Is a directory
An attempt was made to write on a directory.

22 EINVAL Invalid argument
An invalid argument was specified (for example, unmounting a non-mounted device), mentioning an undefined signal in a call to the signal or kill routine.

23 ENFILE File table overflow
The system file table is full (that is, SYS_OPEN files are open, and temporarily no more files can be opened).

24 EMFILE Too many open files
No process may have more than OPEN_MAX file descriptors open at a time.

25 ENOTTY Inappropriate ioctl for device
A call was made to the ioctl routine specifying a file that is not a special character device.

26 ETXTBSY Text file busy (obsolete)
An attempt was made to execute a pure-procedure program that is currently open for writing. Also an attempt to open for writing or to remove a pure-procedure program that is being executed. (This message is obsolete.)

27 EFBIG File too large
The size of the file exceeded the limit specified by resource RLIMIT_FSIZE; the file size exceeds the maximum supported by the file system; or the file size exceeds the offset maximum of the file descriptor. See the File Descriptor subsection of the DEFINITIONS section below.

28 ENOSPC No space left on device
While writing an ordinary file or creating a directory entry, there is no free space left on the device. In the fcntl routine, the setting or removing of record locks on a file cannot be accomplished because there are no more record entries left on the system.

29 ESPIPE Illegal seek
A call to the lseek routine was issued to a pipe.

30 EROFS Read-only file system
An attempt to modify a file or directory was made on a device mounted read-only.

31 EMLINK Too many links
An attempt to make more than the maximum number of links, LINK_MAX, to a file.

32 EPIPE Broken pipe
A write on a pipe for which there is no process to read the data. This condition normally generates a signal; the error is returned if the signal is ignored.

33 EDOM Math argument out of domain of func
The argument of a function in the math package (3M) is out of the domain of the function.

34 ERANGE Math result not representable
The value of a function in the math package (3M) is not representable within machine precision.

35 ENOMSG No message of desired type
An attempt was made to receive a message of a type that does not exist on the specified message queue (see msgrcv(2)).

36 EIDRM Identifier removed
This error is returned to processes that resume execution due to the removal of an identifier from the file system's name space (see msgctl(2), semctl(2), and shmctl(2)).

37 ECHRNG Channel number out of range

38 EL2NSYNC Level 2 not synchronized

39 EL3HLT Level 3 halted

40 EL3RST Level 3 reset

41 ELNRNG Link number out of range

42 EUNATCH Protocol driver not attached

43 ENOCSI No CSI structure available

44 EL2HLT Level 2 halted

45 EDEADLK Deadlock condition
A deadlock situation was detected and avoided. This error pertains to file and record locking, and also applies to mutexes, semaphores, condition variables, and r/w locks.

46 ENOLCK No record locks available
There are no more locks available. The system lock table is full (see fcntl(2)).

47 ECANCELED Operation canceled
The associated asynchronous operation was canceled before completion.

48 ENOTSUP Not supported
This version of the system does not support this feature. Future versions of the system may provide support.

49 EDQUOT Disc quota exceeded
A write() to an ordinary file, the creation of a directory or symbolic link, or the creation of a directory entry failed because the user's quota of disk blocks was exhausted, or the allocation of an inode for a newly created file failed because the user's quota of inodes was exhausted.

58-59 Reserved

60 ENOSTR Device not a stream
A putmsg or getmsg system call was attempted on a file descriptor that is not a STREAMS device.

61 ENODATA No data available

62 ETIME Timer expired
The timer set for a STREAMS ioctl call has expired. The cause of this error is device-specific and could indicate either a hardware or software failure, or perhaps a timeout value that is too short for the specific operation. The status of the ioctl operation is indeterminate. This is also returned in the case of _lwp_cond_timedwait() or cond_timedwait().

63 ENOSR Out of stream resources
During a STREAMS open, either no STREAMS queues or no STREAMS head data structures were available. This is a temporary condition; one may recover from it if other processes release resources.

64 ENONET Machine is not on the network
This error is Remote File Sharing (RFS) specific. It occurs when users try to advertise, unadvertise, mount, or unmount remote resources while the machine has not done the proper startup to connect to the network.

65 ENOPKG Package not installed
This error occurs when users attempt to use a system call from a package which has not been installed.

66 EREMOTE Object is remote
This error is RFS-specific. It occurs when users try to advertise a resource which is not on the local machine, or try to mount/unmount a device (or pathname) that is on a remote machine.

67 ENOLINK Link has been severed
This error is RFS-specific. It occurs when the link (virtual circuit) connecting to a remote machine is gone.

68 EADV Advertise error
This error is RFS-specific. It occurs when users try to advertise a resource which has been advertised already, or try to stop RFS while there are resources still advertised, or try to force unmount a resource when it is still advertised.

69 ESRMNT Srmount error
This error is RFS-specific. It occurs when an attempt is made to stop RFS while resources are still mounted by remote machines, or when a resource is readvertised with a client list that does not include a remote machine that currently has the resource mounted.

70 ECOMM Communication error on send
This error is RFS-specific. It occurs when the current process is waiting for a message from a remote machine, and the virtual circuit fails.

71 EPROTO Protocol error
Some protocol error occurred. This error is device-specific, but is generally not related to a hardware failure.

74 EMULTIHOP Multihop attempted
This error is RFS-specific. It occurs when users try to access remote resources which are not directly accessible.

76 EDOTDOT Error 76
This error is RFS-specific. A way for the server to tell the client that a process has transferred back from mount point.

77 EBADMSG Not a data message
During a read, getmsg, or ioctl I_RECVFD system call to a STREAMS device, something has come to the head of the queue that can not be processed. That something depends on the system call:
read: control information or passed file descriptor.
getmsg: passed file descriptor.
ioctl: control or data information.

78 ENAMETOOLONG File name too long
The length of the path argument exceeds PATH_MAX, or the length of a path component exceeds NAME_MAX while _POSIX_NO_TRUNC is in effect; see limits(4).

79 EOVERFLOW
Value too large for defined data type.

80 ENOTUNIQ Name not unique on network
Given log name not unique.

81 EBADFD File descriptor in bad state
Either a file descriptor refers to no open file or a read request was made to a file that is open only for writing.

82 EREMCHG Remote address changed

83 ELIBACC Cannot access a needed shared library
Trying to exec an a.out that requires a static shared library and the static shared library does not exist or the user does not have permission to use it.

84 ELIBBAD Accessing a corrupted shared library
Trying to exec an a.out that requires a static shared library (to be linked in) and exec could not load the static shared library. The static shared library is probably corrupted.

85 ELIBSCN .lib section in a.out corrupted
Trying to exec an a.out that requires a static shared library (to be linked in) and there was erroneous data in the .lib section of the a.out. The .lib section tells exec what static shared libraries are needed. The a.out is probably corrupted.

86 ELIBMAX Attempting to link in more shared libraries than system limit
Trying to exec an a.out that requires more static shared libraries than is allowed on the current configuration of the system. See NFS AdministrationGuide.

87 ELIBEXEC Cannot exec a shared library directly
Attempting to exec a shared library directly.

88 EILSEQ Error 88
Illegal byte sequence. Handle multiple characters as a single character.

89 ENOSYS Operation not applicable

90 ELOOP Number of symbolic links encountered during path name traversal exceeds MAXSYMLINKS

91 ESTART Restartable system call
Interrupted system call should be restarted.

92 ESTRPIPE If pipe/FIFO, don't sleep in stream head
Streams pipe error (not externally visible).

93 ENOTEMPTY Directory not empty

94 EUSERS Too many users

95 ENOTSOCK Socket operation on non-socket

96 EDESTADDRREQ Destination address required
A required address was omitted from an operation on a transport endpoint. Destination address required.

97 EMSGSIZE Message too long
A message sent on a transport provider was larger than the internal message buffer or some other network limit.

98 EPROTOTYPE Protocol wrong type for socket
A protocol was specified that does not support the semantics of the socket type requested.

99 ENOPROTOOPT Protocol not available
A bad option or level was specified when getting or setting options for a protocol.

120 EPROTONOSUPPORT Protocol not supported
The protocol has not been configured into the system or no implementation for it exists.

121 ESOCKTNOSUPPORT Socket type not supported
The support for the socket type has not been configured into the system or no implementation for it exists.

122 EOPNOTSUPP Operation not supported on transport endpoint
For example, trying to accept a connection on a datagram transport endpoint.

123 EPFNOSUPPORT Protocol family not supported
The protocol family has not been configured into the system or no implementation for it exists. Used for the Internet protocols.

124 EAFNOSUPPORT Address family not supported by protocol family
An address incompatible with the requested protocol was used.

125 EADDRINUSE Address already in use
User attempted to use an address already in use, and the protocol does not allow this.

126 EADDRNOTAVAIL Cannot assign requested address
Results from an attempt to create a transport endpoint with an address not on the current machine.

127 ENETDOWN Network is down
Operation encountered a dead network.

128 ENETUNREACH Network is unreachable
Operation was attempted to an unreachable network.

129 ENETRESET Network dropped connection because of reset
The host you were connected to crashed and rebooted.

130 ECONNABORTED Software caused connection abort
A connection abort was caused internal to your host machine.

131 ECONNRESET Connection reset by peer
A connection was forcibly closed by a peer. This normally results from a loss of the connection on the remote host due to a timeout or a reboot.

132 ENOBUFS No buffer space available
An operation on a transport endpoint or pipe was not performed because the system lacked sufficient buffer space or because a queue was full.

133 EISCONN Transport endpoint is already connected
A connect request was made on an already connected transport endpoint; or, a sendto or sendmsg request on a connected transport endpoint specified a destination when already connected.

134 ENOTCONN Transport endpoint is not connected
A request to send or receive data was disallowed because the transport endpoint is not connected and (when sending a datagram) no address was supplied.

143 ESHUTDOWN Cannot send after transport endpoint shutdown
A request to send data was disallowed because the transport endpoint has already been shut down.

144 ETOOMANYREFS Too many references: cannot splice

145 ETIMEDOUT Connection timed out
A connect or send request failed because the connected party did not properly respond after a period of time; or a write or fsync request failed because a file is on an NFS file system mounted with the __s__o__f__t option.

146 ECONNREFUSED Connection refused
No connection could be made because the target machine actively refused it. This usually results from trying to connect to a service that is inactive on the remote host.

147 EHOSTDOWN Host is down
A transport provider operation failed because the destination host was down.

148 EHOSTUNREACH No route to host
A transport provider operation was attempted to an unreachable host.

149 EALREADY Operation already in progress
An operation was attempted on a non-blocking object that already had an operation in progress.

150 EINPROGRESS Operation now in progress
An operation that takes a long time to complete (such as a connect) was attempted on a non-blocking object.

151 ESTALE Stale NFS file handle


##############################################################

SECTION 6: SOLARIS (and GENERIC) Errors:

##############################################################


Section is devided in PARTS 1,2,3  


>>>> PART 1 <<<<
================


A command window has exited because its child exited. 
=====================================================

The argument to a cmdtool(1) or a shelltool(1) window looks like
it is supposed to be a command, but the system cannot find the
command.

To run this command inside a cmdtool or a shelltool, make sure
the command is spelled correctly and is in your search path (if
necessary, use a full path name). If you intended this argument
as an option setting, use a minus sign (-) at the beginning of
the option.

Both the cmdtool and the shelltool are OpenWindows terminal
emulators.

admintool: Received communication service error 4 
=================================================

AdminTool could not start a display method because a remote
procedure call timed out, so it can't send the request. This
error results when admintool tries to access the NIS or NIS+
tables when networking is not enabled.

Verify the system network status with ifconfig -a to make sure
the system is connected to the network. Make sure the ethernet
cable is connected and the system is configured to run NIS or
NIS+.

answerbook: XView error: NULL pointer passed to xv_set 
======================================================

The AnswerBook navigator window comes up, but the document viewer
window does not. This message appears on the console, and the
message "Could not start new viewer" appears in the navigator
window. This situation indicates that you have an unknown client
or a problem with the network naming service.

Run the ypmatch(1) or nismatch(1) command o determine if the
client hostname is in the hosts map. If it isn't, add it to to
NIS hosts map on the NIS master server. Then make sure the
/etc/hosts file on the client contains an IP address and entry
for that hostname followed by loghost (reboot if you changed the
/etc/hosts file). Check that the ypmatch or nismatch client hosts
command returns the same IP host address as in the /etc/hosts
file. Finally, quit all existing AnswerBooks and restart.

For more information on the NIS hosts map, see the section on the
default search criteria in the NIS+ and FNS Administration Guide.
If you are using the AnswerBook, "NIS hosts map" is a good search
string.

Arg list too long 
=================

The system could not handle the number of arguments given to a
command or program when it combined those arguments with the
environment's exported shell variables. The argument list limit
is the size of the argument list plus the size of the
environment's exported shell variables.

The easiest solution is to reduce the size of the parent process
environment by unsetting extraneous environment variables. (See
the man page for the shell you're using to find out how to list
and change your environment variables.) Then run the program
again.

An argument list longer than ARG_MAX bytes was presented to a
member of the exec() family of system calls.

The symbolic name for this error is E2BIG, errno=7.

Argument out of domain 
======================

This is a programming error or a data input error.

Ask the program's author to fix this condition,or supply data in
a different format.

This indicates an attempt to evaluate a mathematical programming
function at a point where its value is not defined. The argument
of a programming function in the math package (3M) is out of the
domain of the function. This could happen when taking the square
root, power, or log of a negative number, when computing a power
to a non-integer, or when passing an out-of-range argument to a
hyperbolic programming function.

To help pinpoint a program's math errors, use the matherr(3M)
facility.

The symbolic name for this error is EDOM, errno=33.

Arguments too long 
==================

This C shell error message indicates that there are too many
arguments after a command. For example, this can happen by
invoking rm * in a huge directory. The C shell cannot handle more
than 1706 arguments.

Temporarily start a Bourne shell with sh and run the command
again. The Bourne shell dynamically allocates command line
arguments. Return to your original shell by typing exit.

assertion failed: variable, file variable, line N 
=================================================

A condition in the program that was never expected to happen has
happened.

Contact the vendor or author of the program to ask why it failed.
If you have the source code for the program, you can look at the
file and line number where the assertion failed. This might give
you an idea of how to run the program differently.

This message results from a diagnostic macro called assert() that
a programmer inserted into the specified line of a source file.
The expression that evaluated untrue precedes the file name and
line number.

automountd[N]: No network locking on variable:  
contact admin to install server change 
======================================= 

See "WARNING: No network locking on variable: contact admin to
install server" message for details. If the server is not
changed, data loss is possible in applications that depend on
locking.

automountd[N]: server variable not responding 
=============================================

This automounter message indicates that the system tried to mount
a filesystem from an NFS server that is either down or extremely
slow to respond. In some cases this message indicates that the
network link to the NFS server is broken, although that condition
produces other error messages as well.

If you are the system administrator responsible for the non-
responding NFS server, check it out to see whether the machine
needs repair or rebooting. Encourage your user community to
report such problems quickly but only once. When the NFS server
is back in operation, the automounter will be able to access the
requested file system.

For more information on NFS failures, seethe section on NFS
troubleshooting in the NFS Administration Guide. If you are using
the AnswerBook, a good search string is "NFS Service."

automount[N]: variable: Not a directory 
=======================================

The file specified after the first colon is not a valid mount
point because it is not a directory.

Ensure that the mount point is a directory, and not a regular
file or a symbolic link.

Bad address 
===========

The system encountered a hardware fault in attempting to access a
parameter of a programming function.

Check if the bad address resulted from supplying the wrong device
or option to a command. If that is not the problem, contact the
vendor or author of the program for an update.

This error could occur any time a function that takes a pointer
argument is passed an invalid address. Because processors differ
in their ability to detect bad addresses, on some architectures
passing bad addresses can result in undefined behaviors.

The symbolic name for this error is EFAULT,errno=14.

BAD/DUP FILE I=i OWNER=o MODE=m SIZE=s MIME ==== CLEAR? 

While checking anode link counts during phase 4, fsck(1M) found a
file (or directory) that either does not exist or exists
somewhere else.

To clear the anode of its reference to this file or directory,
answer yes. With the -p (preen) option, fsck automatically clears
bad or duplicate file references, so answering yes to this
question seldom causes a problem.

Bad file number 
===============

Generally this is a program error, not a usage error.

Contact the vendor or author of the program for an update.

Either a file descriptor refers to no open file, or a read (or
write) request is made to a file that is open only for writing
(or reading).

The symbolic name for this error is EBADF, errno=9.

N BAD I=N 
=========

Upon detecting an out-of-range block, fsck(1M) prints the bad
block number and its containing inode (after I=).

In fsck phases 2 and 4, you will decide whether ornot to clear
these bad blocks.  Before committing to repair with fsck, you
could determine which file contains this inode by passing the
inode number to the ncheck(1M) command: by passing the inode
number to the ncheck(1M) command:

# ncheck -iinum file system

For more information, see the chapter on checking file system
integrity in the System Administration Guide, Volume I.

bad module/chip at: variable 
============================

This message from the memory management system often appears with
parity errors, and indicates a bad memory module or chip at the
position listed. Data loss is possible if the problem occurs
other than at boot time.

Replace the memory module or chip at the indicated position.
Refer to the vendor's hardware manual for help finding this
location.

BAD SUPER BLOCK: variable 
=========================

This message from fsck(1M) indicates that a filesystem's super-
block is damaged beyond repair and must be replaced. At boot time
(with the -p option) this message is prefaced by the file system's
device name. After this message comes the actual damage
recognized (see Action). Unfortunately fsck does not print the
number of the damaged super-block.

The most common cause of this error is overlapping disk
partitions. Donot immediately rerun fsck as suggested by the
lines that display after the error message.  First make sure that
you have a recent backup of the file system involved; if not, try
to back up the file system now using ufsdump(1M). Then run the
format(1M) command, select the disk involved, and print out the
partition information.

# format : N > partition > print

Note whether the overlap occurs at the beginning or end of the
file system involved.  Then run newfs(1M) with the -N option to
print out the file system parameters, including the location of
backup super-blocks.

# newfs -N /dev/dsk/device

Select a super-block from a non-overlapping area of the disk, but
note that in most cases you have only one chance to select the
proper replacement super-block, which fsck soon propagates to all
the cylinders. If you select the wrong replacement super-block,
data corruption will probably occur, and you will have to restore
from backup tapes.  After you select a new super-block, provide
fsck with the new master super-block number:

# fsck -o b=NNNN /dev/dsk/device

Specific reasons for a damaged super-block include: a wrong magic
number, out of range NCG (number of cylinder groups) or CPG
(cylinders per group), the wrong number of cylinders, a
preposterously large super-block size, and trashed values in
super-block. These reasons are generally not meaningful because a
corrupt super-block is usually extremely corrupt.

For more information on bad super blocks, see the sections on
restoring bad super blocks in the System Administration Guide,
Volume I. If you are using the AnswerBook, "superblock" is a good
search string.

BAD TRAP 
========

A bad trap can indicate faulty hardware or a mismatch between
hardware and its configuration information. Data loss is possible
if the problem occurs other than at boot time.

If you recently installed new hardware, verify that the software
was correctly configured. Check the kernel trace back displayed on
the console to see which device generated the trap. If the
configuration files are correct, you will probably have to
replace the device.

In some cases, the bad trap message indicates a bad or down-rev
CPU.

A hardware processor trap occurred, and the kernel trap handler
was unable to restore system state. This is a fatal error that
usually precedes a panic, after which the system performs a sync,
dump, and reboot. The following conditions can cause a bad trap:
a system text or data access fault, a system data alignment
error, or certain kinds of user software traps.

bad trap = N 
============

See the message "BAD TRAP" for details.

/bin/sh: variable: too big 
==========================

This Bourne shell message indicates a classic "no memory" error.
While trying to load the program specified after thefirstcolon,
the shell noticed that the system ran out of virtual memory (swap
space).

See the message "Not enough space" for information on
reconfiguring your system to add more swap space.

Block device required 
=====================

A raw (character special) device was specified where a block
device was required, such as during a call to the mount(1M)
command.

To see which block devices are available, use ls -l to look in
/devices. Then specify a block device instead of a character
device. Block device modes start with a b, whereas raw character
device modes start with a c.

The symbolic name for this error is ENOTBLK, errno=15.

Boot device: /iommu/sbus/variable/variable/sd@3,0 
=================================================

This message alwaysappears at the beginning of rebooting. If
there is a problem, the system hangs, and no other messages
appear. This condition is caused by conflicting SCSI targets for
the boot device, which is almost always target 3.

The boot device is usually the machine's internal disk drive,
target 3. Make sure that external and secondary disk drives are
targeted to 1, 2, or 0, and do not conflict with each other. Also
make sure that tape drives are targeted to 4 or 5, and CD drives
to 6, avoiding any conflict with each other or with the disk
drives. You can set a device's target number using pushbutton
switches or a dial on the back near the SCSI cables. If the
targeting of the internal disk drive is in question, check it by
powering off the machine, removing all external drives, turning
the power on, and running the probe-scsi-all or probe-scsi
command from the PROM monitor.

Broadcast Message from root (pts/N) on server [date] 
====================================================

This message from the wall(1M) command gets transmitted to all
users logged into a system. You could see it during a rlogin or
telnet session, or on terminals connected to a timesharing
system.

Carefully read the broadcast message. Often this broadcast is
followed by a shutdown warning.

See the message "The system will be shut down in N minutes" for
details about system shutdown.

For more information on bringing down the system, see the section
on halting the system in the System Administration Guide, Volume
I. If you are using the AnswerBook, "halting the system" is a
good search string.

Broken pipe 
===========

This condition is often normal, and the message is merely
informational (as when piping many lines to the head program).
The condition occurs when a write on a pipe does not find a
reading process. This usually generates a signal to the executing
program, but this message displays when the program ignores the
signal.

Check the process at the end of the pipe to see why it exited.

The symbolic name for this error is EPIPE, errno=32.

Bus Error 
=========

A process has received a signal indicating that it attempted to
perform I/O to a device that is restricted or that does not
exist. This message is usually accompanied by a core dump, except
on read-only filesystems.

Use a debugger to examine the core file and determine what
program fault or system problem led to the bus error. If
possible, check the program's output files for data corruption
that might have occurred before the bus error.

Bus errors can result from either programming error or device
corruption on your system. Some common causes of bus errors are:
invalid file descriptors, unreasonable I/O requests, bad memory
allocation, misaligned data structures, compiler bugs, and
corrupt boot blocks.

Cannot allocate color map entry for "variable" 
=============================================

This message from libXt (X Intrinsics library) indicates that the
system color map was full even before the color name specified in
quotes was requested. Some applications can continue after this
message. Other applications, such as Workspace Properties Color,
fail to come up when the color map is full.

Exit the programs that make heavy use of the color map, then
restart the failed application and try again.

Can't create public message device (Device busy) 
================================================

This message comes from the lp print scheduler, indicating that
it is either extremely busy or hung.

If print jobs are coming out of the printer in question, wait
until they are finished and then resubmit this print job. If you
see this message again, the lp system is probably hung.

See the message "lp hang" for a procedure to clear the queue.

If lp is unable to create a device for printer messages, the
message FIFO could be already in use, or locked by another print
job.

For more information on the print scheduler, see the section on
administrating printers in the System Administration Guide Volume
II.

Can't invoke /etc/init, error N 
===============================

This message can appear while a system is booting, indicating
that the init program is missing or corrupted. Note that
/etc/init is a symbolic link to /sbin/init.

Boot the miniroot so you can replace init. Halt the machine by
typing Stop-A or by pressing the reset button. Reboot single-user
from CDROM, the net, or diskette. For example, type boot cdrom -s
at the ok prompt to boot from CDROM. After the system comes up
and gives you a # prompt, mount the device corresponding to the
original / partition somewhere, with a command similar to the
mount command below. Then copy the init program from the miniroot
to the original / partition, and reboot the system.

# mount /dev/dsk/c0t3d0s0 /mnt # cp /sbin/init /mnt/sbin/init #
reboot

If this doesn't work, other files might be corrupted, and you
might need to reinstall the entire system.

The error number is 2 if /sbin/init is missing, or 8 if
/sbin/init has an incorrect executable format. This is usually
followed by a "panic:icode" message. The system tries to reboot
itself, but goes into a loop, because rebooting is impossible
without init.

For more information on booting the system, see the section on
halting and booting the system in the System Administration
Guide, Volume I.

can't synchronize with hayes 
============================

This message sometimes appears when using a modem that the system
regards as a "Hayes" type modem, which includes most modems
manufactured today. The message can be caused by incorrect switch
settings, by poor cable connections, or by not turning the modem
on.

Check that the modem is on and that the cables between the modem
and your system are securely connected. Check the internal and
external modem switch settings. Turn the modem off and then on
again, if necessary.

cd: Too many arguments 
======================

The C shell's cd(1) command takes only one argument. Either more
than one directory was specified, or a directory name containing
a space was specified.  Directory names with spaces are easy to
create with File Manager.

Use only one directory name. To change to a directory whose name
contains spaces, enclose the directory name in double (") or
single (') quotes, or use File Manager.

Channel number out of range 
===========================

The system has run out of stream devices. This error results when
a stream head attempts to open a minor device that does not exist
or that is currently in use.

Check that the stream device in question exists and was created
with an appropriate number of minor devices. Make sure that the
hardware corresponds to this configuration. If the stream device
configuration is correct, try again later when more system
resources might be available.

The symbolic name for this error is ECHRNG, errno=37.

chmod: ERROR: invalid mode 
==========================

This message from the chmod(1) command indicates a problem in the
first non-option argument.

If you are specifying a numeric file mode, you can provide any
number of digits (although only the final one to four are
considered), but all digits must be between 0 and 7. If you are
specifying a symbolic file mode, use the syntax provided in the
chmod usage message to avoid the "invalid mode" error message:

Usage: chmod [ugoa][+-=][rwxlstugo] file ...

Note that some combinations of symbolic keyletters produce no
error message but fail to have any effect. The first group,
[ugoa], is truly optional. The second group, [+-=], is mandatory
for chmod to have an effect. The third group,[rwxlstugo], is
also mandatory for effect, and can be used in combination when
that combination does not conflict.

Command not found 
=================

The C shell could not find the program you gave as a command.

Check the form and spelling of the command line. If that looks
correct, echo $path to see if the user's search path is correct.
When communications are garbled, it is possible to unset a search
path to such an extent that only built-in shell commands are
available. Here is a command to reset a basic search path:

 % set path = (/usr/bin /usr/ccs/bin /usr/openwin/bin .)

If the search path looks correct, check the directory contents
along the search path to see if programs are missing or if
directories are not mounted.

For more information about the C shell, see csh(1).

Connection closed. 
==================

This message can appear when using rlogin(1) to another system if
the remote host cannot create a process for this user, if the
user takes too long to type the correct password, if the user
interrupts the network connection, or if the remote host goes
down. Data loss is possible if files were modified and not saved
before the connection closed.

Just try again. If the other system has gone down, wait for it to
reboot first.

Connection closed by foreign host. 
==================================

When a user telnets to another system, this message can appear if
the user takes too long to type the correct password, if the
remote host cannot create a login for this user,or if the remote
host goes down or terminates the connection. Data loss is
possible if files were modified and not saved before the
connection closed.

Just try again. If the other system has gone down, wait for it to
reboot first.

[Connection closed. Exiting] 
============================

After using the talk(1) command to communicate with another user,
the other person enters an interrupt (usually Control-c), and
this message appears on your screen.

Sending an interrupt like this is the usual way of exiting the
talk program. The talk session is over and you can return to your
work.

Connection refused 
==================

No connection could be made because the target machine actively
refused it. This happens either when trying to connect to an
inactive service or when a service process is not present at the
requested address.

Activate the service on the target machine, or start it up again
if it has disappeared. If for security reasons you do not intend
to provide this service, inform the user community, possibly
suggesting an alternative.

The symbolic name for this error is ECONNREFUSED, errno=146.

Connection timed out 
====================

This occurs either when the destination host is down or when
problems in the network cause lost transmission.

First check the operation of the host system, for example by
using ping(1M) and ftp (1), then repair or reboot as necessary.
If that doesn't solve the problem, check the network cabling and
connections.

No connection was established in a specified time. A connect or
send request failed because the destination host did not properly
respond after a reasonable interval. (The timeout period is
dependent on the communication protocol.)

The symbolic name for this error is ETIMEDOUT, errno=145.

console login: ^J^M^Q^K^K^P 
===========================

This usually occurs because OpenWindows exited abnormally,
leaving the system's keyboard in the wrong mode. The characters
that appear when someone attempts to login are garbage
transliterations of what someone types.

Find another machine and remote login to this system, then run
this command:

$ /usr/openwin/bin/kbd_mode -a

This puts the console back into ASCII mode. Note that kbd_mode is
not a windows program, it just fixes the console mode.

The usual reason for this problem occurring is an automated
script run from cron that clears out the /tmp directory every so
often. Ensure that any such scripts do not remove the /tmp/.X11-
pipe or /tmp/.X11-unix directories, or any files therein.

core dumped 
===========

A core file contains an image of memory at the point of software
failure, and is used by programmers to find the reason for the
failure.

To see which program produced a core file, run either the file(1)
command or the adb (1) command. The following examples show the
output of the file and adb commands on a core file from the
dtmail program.

$ file core core: ELF 32-bit MSB core file SPARC Version 1, from
`dtmail'

$ adb core core file = core -- program `dtmail' SIGSEGV 11:
segmentation violation ^D      (use Control-d to quit the
program)

Ask the vendor or author of this program for a debugged version.

Some signals, such as SIGQUIT, SIGBUS, and SIGSEGV, produce a
core dump. See the signal(5) man page for a complete list.

If youhave the source code for the program, you can try
compiling it with cc -g, and debugging it yourself using dbx or a
similar debugger. The where directive of dbx provides a stack
trace.

On mixed networks, it can be difficult to discern which machine
architecture produced a particular core dump, since adb on one
type of system generally cannot read a core file from another
type of system, and will produce an "unrecognized file" message.
Run adb on various machine architectures until you find the right
one.

The term "core" is archaic-- ferrite core memory was supplanted
by silicon RAM in the 1970s, although spaceships still employ
core memory for its imperviousness to radiation.

For information on saving and viewing crash information see the
System Administration Guide, Volume II. If you are using the
AnswerBook, "system crash" is a good search string.

Could not initialize tooltalk (tt_open): TT_ERR_NOMP 
====================================================

Various desktop tools display or print this message when the
ttsession(1) process is not available. The TookTalk service
generally tries to restart ttsession if it is not running. So
this error indicates that the ToolTalk service is either not
installed or is not installed correctly.

Verify that the ttsession command exists in /usr/openwin/bin or
/usr/dt/bin. If this command is not present, ToolTalk is not
installed correctly. The packages constituting ToolTalk are the
runtime SUNWtltk, developer support SUNWtltkd, and themanual
pages SUNWtltkm. CDE ToolTalk packages have the same names with
".2" appended.

The full TT_ERR_NOMP message string reads as follows: "No
ttsession is running, probably because tt_open() has not been
called yet. If this is returned from tt_open() it means ttsession
could not be started, which generally means ToolTalk is not
installed on the system."

Could not start new viewer 
==========================

This message appears in the AnswerBook navigator window, along
with an XView error messageon the console.

See the message "answerbook: XView error: NULL pointer passed to
xv_set" for details.

cpio: Bad magic number/header. 
==============================

A cpio(1) archive has either become corrupted or was written out
with an incompatible version of cpio.

Use the -k option to cpio to skip I/O errors and corrupted file
headers. This might permit you to extract other files from the
cpio archive. To extract files with corrupted headers, try
editing the archive with a binary editor such as emacs. Each cpio
file header contains a filename as a string.

For more information on magic numbers, see magic(4).

Cross-device link 
=================

An attempt was made to make a hard link to a file on another
device, such as on another file system.

Establish a symbolic link using ln -s instead. Symbolic links are
permitted across file system boundaries.

The symbolic name for this error is EXDEV, errno=18.

data access exception 
=====================

This message can result from running an old version of the
operating system that does not support new hardware, or by
running an operating system that is not configured for new
hardware. It can also result from incorrectly installed DSIMMs or
from a disk problem.

Upgrade your operating system to a version that supports the new
hardware or machine architecture. For example, upgrading a
SPARCstation 2 (with sun4c kernel architecture) to a SPARCstation
20 (with sun4m kernel architecture) requires an operating system
upgrade or reconfiguration.

For more information onupgrades, see the section describing
system and device configuration in the Solaris 1.x to Solaris 2.x
Transition Guide.

Data fault 
==========

This is a kind of bad trap that usually causes a system panic.
When this message appears after a bad trap message, a system text
or data access fault probably occurred.� In the absence of a bad
trap message, this message might indicate a user text or data
access fault. Data loss is possible if the problem occurs other
than at boot time.

Make sure the machine can reboot, then check the log file
/var/adm/messages for hints about what went wrong.

� See the message "BAD TRAP" for more information.

Deadlock situation detected/avoided 
===================================

A programming deadlock situation was detected and avoided.

If the system had not detected and avoided a deadlock, a piece of
software would have hung. Run the program again. The deadlock
might not reoccur.

This error usually relates to file and record locking, but can
also apply to mutexes, semaphores, condition variables, and
read/write locks.

The symbolic name for this error is EDEADLK, errno=45.

See the section on deadlock handling in the System Interface
Guide. See the section on avoiding deadlock in the Multithreaded
Programming Guide.

Device busy 
===========

An attempt was made to mount a device that was already mounted or
to unmount a device containing an active file (such as an open
file, a current directory, a mount point, or a running program).
This message also occurs when trying to enable accounting that is
already enabled.

To unmount a device containing active processes, close all the
files under that mount point, quit any programs started from
there, and change directories out of that hierarchy. Then try to
unmount again.

Mutexes, semaphores, condition variables, and read/write locks
set this error condition to indicate that a lock is held.

The symbolic name for this error is EBUSY, errno=16.

/dev/rdsk/variable: CAN'T CHECK FILE SYSTEM. 
============================================

The system cannot automatically clean (preen) this file system
because it appears to be set up incorrectly or is having hard
disk problems. This message asks that you run fsck(1M) manually,
since data corruption might already have occurred.

Run fsck to clean the file system in question. See the message
"/dev/rdsk/N:  UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY" for
proper procedures.

/dev/rdsk/variable: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY. 
================================================================

At boot time the /etc/rcS script runs the fsck(1M) command to
check the integrity of file systems marked "fsck" in /etc/vfstab.
If fsck cannot repair a file system automatically, it interrupts
the boot procedure and produces this message. When fsck gets into
this state, it cannot repair a file system without losing one or
more files, so it wants to defer this responsibility to you, the
administrator. Data corruption has probably already occurred.

First run sack -n on the file system, to see how many and what
type of problems exist.  Then run fsck again to repair the
file system. If you have a recent backup of the file system, you
can generally answer "y" to all the fsck questions. It's a good
idea to keep a record of all problematic files and inode numbers
for later reference. To run fsck yourself, specify options as
recommended by the boot script. For example:

# fsck /dev/rdsk/c0t4d0s0

Usually the files lost during fsck repair are these that were
created just before a crash or power outage, and they cannot be
recovered. If you lose important files, you can recover them from
backup tapes.

If you don't have a backup, ask an expert to run fsck for you.

For more information on file checking, see the section on
checking file system integrity in the System Administration Guide,
Volume I.

Directory not empty 
===================

The directory operation that was attempted, such as directory
removal with rmdir, can be performed only on an empty directory.

To remove the directory, first remove all the files that it
contains. A quick way to remove a non-empty directory hierarchy
is with the rm -r command.

The symbolic name for this error is ENOTEMPTY, errno=93.

Disc quota exceeded 
===================

The user'sdisk limit has been exceeded on a user filesystem,
usually because a file was just created or enlarged beyond the
limit. This almost always refers to a magnetic disk, and not to
an optical disc. Any data created after this condition occurs
will be lost.

The user can delete files to bring disk usage under the limit, or
the server administrator can use the edquota(1M) command to
increase the user's disk limit.

The symbolic name for this error is EDQUOT, errno=49.

dumptm: Cannot open `/dev/rmt/variable': Device busy 
====================================================

During file system backup, the dump program cannot open the tape
drive because some other process is holding it open.

Find the process that has the tape drive open, and either kill(1)
the process or wait for it to finish.

# ps -ef | grep /dev/rmt # kill -9 processID

DUP/BAD I=i OWNER=o MODE=m SIZE=s MTIME=t FILE=f REMOVE? 
=========================================================

During phase 1, fsck(1M) found duplicate blocks or bad blocks
associated with the file or directory specified after FILE= whose
inode number appears after I= (with other information).

To remove this file or directory, answer yes. If you end up
removing more than a few files in this manner, data loss will
result, so it might be preferable to restore the filesystem from
backup tapes.

For more information on checking filesystems, see the section on
checking filesystem integrity in the System Administration Guide,
Volume I.

N DUP I=N 
=========

Upon detecting a block that is already claimed by another inode,
fsck(1M) prints the duplicate block number and its containing
inode (after I=).

In fsck phases 2 and 4, you will decide whether or not to clear
these bad blocks.  Before committing to repair with fsck, you
could determine which file contains this inode by passing the
inode number to the ncheck(1M) command:

# ncheck -iinum filesystem

For more information, see the chapter on checking filesystem
integrity in the System Administration Guide,Volume I. 
 
 
error: DPS has not initialized or server connection failed 
==========================================================

This message appears when trying to run AnswerBook with a generic
X11 window server or on a generic X terminal.

Running AnswerBook requires Display PostScript (DPS), or a NeWS
server, or the Adobe DPS NS remote display software. In addition,
a complete LaserWriterII Type-1 font set (including Palatino)
should be installed on the X server. To find out if your X server
has DPS, run xdpyinfo(1) to verify the presence of an "Adobe-
DPS-Extension" line. X servers without this line don't know about
DPS.

ERROR: missing file arg (cm3) 
=============================

An attempt was madd to run some sccs(1) operation that requires a
filename, such as create, edit, delget, or prt.

Supply the appropriate filename after the SCCS operation.

ERROR [SCCS/s.variable]: `SCCS/p.variable' nonexistent (ut4) 
============================================================

An attempt was made to sccs edit or sccs get a file that is not
yet under SCCS control.

Run sccs create on that file to place it under SCCS control.

ERROR [SCCS/s.variable]: writable `variable' exists (ge4) 
=========================================================

An attempt was made to sccs edit a file that is writable,
probably because it is already checked out.

Run sccs info to see who has the file checked out. If it is you,
go ahead and edit it. If it is somebody else, ask that personto
check in the file.

esp0: data transfer overrun 
===========================

When a user tries to mount a CDROM on a third-party CD drive,
mount(1M) fails with the above error, followed by the "sr0: SCSI
transport failed" message. The CD drive probably comes from a
vendor unknown to the system.

Third-party CD drives generally have an 8192 block size, as
opposed to the 512 block size on supported Sun drives. Check with
the vendor to see if any special configuration is possible to
allow the drive to operate on a Sun workstation.

Event not found 
===============

This C shell message indicates that a user tried to repeat a
command from the history list, but that command or number does
not exist in the list.

Run the C shell history command to display recent events in the
history list. If a user often tries to run commands that have
disappeared from the history list, make the list longer by
setting history to a higher value.

For more information about the C shell, see csh(1).

EXCESSIVE BAD BLKSI=N CONTINUE? 
==================================

During phase 1, fsck(1M) found more than 10 bad (out-of-range)
blocks associated with the specified inode number.

With this many bad blocks, it might be preferable to restore the
filesystem from backup tapes.

For more information on bad blocks, see the section on checking
filesystem integrity in the System Administration Guide, Volume
I. If you are using the AnswerBook, "bad blocks" is a good search
string.

EXCESSIVE DUP BLKS I=N CONTINUE? 
==================================

During phase 1, fsck(1M) found more than 10 duplicate (previously
claimed) blocks associated with the specified inode number.

With this many duplicate blocks, it might be preferable to
restore the filesystem from backup tapes.

For more informationon blocks, see the section on checking
filesystem integrity in the System Administration Guide, Volume
I. If you are using the AnswerBook, "bad blocks" is a good search
string.

Exec format error 
=================

This often happens when trying to runsoftware compiled for
different systems or architectures, such as when executing
Solaris 2.x programs on a SunOS 4.1.x system, or when trying to
execute SPARC-specific programs on an x86 machine. On a Solaris
2.x system, it can also occur if the BinaryCompatibility Package
was not installed.

Make sure that the software matches the architecture and system
you're using. The file(1) command can help you determine the
target architecture. If you're using SunOS 4.1.x softwareon a
Solaris 2.x system, make sure that the Binary Compatibility
Package is installed. You can check for it using this command:

$ pkginfo | grep SUNWbcp

A request was made to execute a file that, although it has the
appropriate permissions, does not start with a valid format.

The symbolic name for this error is ENOEXEC, errno=8.

See the a.out(4) man page for a description of executable files.

fd0: unformatted diskette or no diskette in the drive 
=====================================================

This message appears on the system console to indicate that the
floppy driver fd(7) could not read the label on a diskette.
Usually this is either because a new diskette has not yet been
formatted, or a formatted diskette has become corrupted. This
message often appears along with "read failed" and "bad format"
messages after volcheck(1) is run.

If you are certain that the diskette contains no data, run
fdformat -d to format the diskette in DOS format. (You can also
format a diskette in UFS format if you like, although then it is
not transportable to most other systems.) When the diskette is
formatted, you can write on it, if it was not corrupted beyond
repair.

File exists 
===========

The name of an existing file was mentioned in an inappropriate
context. For example,it is not allowed to establish a link to an
existing file, or to overwrite an existing file when the csh(1)
noclobber option is set.

Look at the names of files in the directory, then try again with
a different name or after renaming or removing the existing file.

The symbolic name for this error is EEXIST, errno=17.

File locking deadlock 
=====================

This is a programming problem, in some cases unavoidable.

All a user can do is restart the program and hope deadlock does
not reoccur.

Inthe file locking subsystem, two processes tried to modify some
lock at the same time. In the multithreading subsystem, two
threads became deadlocked and could not continue. When a program
using the threads library encounters this error, it should
restart the deadlocked threads.

The symbolic name for this error is EDEADLOCK, errno=56.

filemgr: mknod: Permission denied 
=================================

File Manager issues this message and fails to come up whenever
the /tmp/.removable directory is owned by another user and is not
1777 mode. This can happen, for example, when multiple users
share a workstation.

Have the original owner change the mode ((chmod(1)) of this file
back to 1777, its default creation mode. Rebooting the
workstation also resolves this problem.

This is a known problem that was fixed in Solaris 2.4.

File name too long 
==================

The specified file name has too many characters.

If a file name or path name component is too long, devise a
shorter name. If the totalpath name is longer than PATH_MAX
characters, first change to an intermediate directory, then
specify a shorter path name. Newly-created data will be lost
unless written to another file with a shorter name.

In a UFS or NFS-mounted UFS filesystem, the length of a path name
component exceeds MAXNAMLEN (255) characters, or the total length
of the path name exceeds PATH_MAX (1024) characters. In a System
V filesystem, the length of a path name component exceeds
NAME_MAX (14) characters while no-truncation mode is in effect.
These values are defined in the /usr/include/limits.h(4) file.

The symbolic name for this error is ENAMETOOLONG, errno=78.

FILE SYSTEM STATE IN SUPERBLOCK IS WRONG; FIX? 
==============================================

The fsck(1M) command has just checked a filesystem, and has
determined that the filesystem is clean. The filesystem's
superblock, however, still thinks the filesystem is "dirty" in
some way.

If you believe that the filesystem is adequately repaired, answer
yes to mark the filesystem as clean.

Different "dirty" filesystem types are listed in
/usr/include/sys/fs/ufs_fs.h, and include FSACTIVE, FSBAD, FSFIX,
FSLOG, and FSSUSPEND.

For more information on superblocks, see the section onchecking
filesystem integrity in the System Administration Guide, Volume
I. If you are using the AnswerBook, "bad superblock" is a good
search string.

File table overflow 
===================

The kernel file table is full because too many files are open on
the system.  Temporarily, no more files can be opened. New data
created under this condition will probably be lost.

Simply waiting often gives the system time to close files.
However, if this message occurs often, reconfigure the kernel to
allow more open files. To increasethe size of the file table in
Solaris 2.x, increase the value of maxusers in the /etc/system
file.  The default maxusers value is the amount of main memory in
MB, minus 2.

The symbolic name for this error is ENFILE, errno=23.

File too large 
==============

The file size exceeded the limit specified by ulimit(1), or the
file size exceeds the maximum supported by the file system. New
data created under this condition will probably be lost.

In the C shell, use the limit command to see or set the default
file size. In the Bourne or Korn shells, use the ulimit -a
command. Even when the shells claim that the file size is
unlimited, in fact the system limit is FCHR_MAX (usually 1
gigabyte).

The symbolic name for this error is EFBIG, errno=27.

FREE BLK COUNT(S) WRONG IN SUPERBLK SALVAGE? 
=============================================

During phase 5, fsck(1M) detected that the actual number of free
blocks in the filesystem did not match the superblock's free
block count.The df(1M) command accesses this free block count
when measuring filesystem capacity.

Generally you can answer yes to this question without harming the
filesystem.

For more information on superblocks, see the section on checking
filesystem integrity in the System Administration Guide, Volume
I. If you are using the AnswerBook, "bad superblock" is a good
search string.

fsck: Can't open /dev/dsk/variable 
==================================

The fsck(1M) command cannot open the disk device, because
although a similar filesystem exists, the partition specified
does not.

Run the mount(1M) or the format(1M) command to see what
filesystems are configured on the machine. Then run fsck again on
an existing partition.

fsck: Can't stat /dev/dsk/variable 
==================================

The fsck(1M) command cannot open the disk device, because the
specified filesystem does not exist.

Run the mount(1M) or the format(1M) command to see what
filesystems are configured on the machine. Then run fsck again on
an existing filesystem.

giving up 
=========

This message appears in the SCSI log to indicate that a read or
write operation has been retried until it timed out. With SCSI
disk the timeout period is usually 30 seconds; with tape the
period is usually 20 attempts. Timeout periods are generally
coded into the drivers.

Check that all SCSI devices are connected and powered on. Make
sure that SCSI target numbers are correct and not in conflict.
Verify that all cables are no longer than six meters, total, and
that all SCSI connections are properly terminated.

The scsi_log(9F) routine usually displays messages on the system
console and in the /var/adm/messages file. Run the dmesg(1M)
command to see the most recent message buffer.

Graphics Adapterdevice /dev/fb is of unknown type 
==================================================

The /dev/fb driver is either missing or corrupted.

See "InitOutput: Error loading module for /dev/fb" for details.

group.org_dir: NIS+ servers unreachable 
=======================================

This is the second of three messages that an NIS+ client prints
when it cannot locate an NIS+ server on the network.

See the message "hosts.org_dir: NIS+ servers unreachable" for
details.

/home/variable: No such file ordirectory 
=========================================

An attempt was made to change to a user's home directory, but
either that user does not exist or the user's fileserver has not
shared (exported) that filesystem.

To check on the existence of a particular user, run the
ypmatch(1) or nismatch(1) command, specifying the user name and
then the passwd map.

To export filesystems from the remote fileserver, become
superuser on that system and run the share(1M) command with the
appropriate options. If that system is sharing (exporting)
filesystems for the first time, also invoke
/etc/init.d/nfs.server start to begin NFS service.

For more information on sharing filesystems, see the
share_nfs(1M) man page.

Host is down 
============

A transport connection failed because the destination host was
down. For example, mail delivery was attempted over several days,
but the destination machine was not available during any of these
attempts.

Report this error to the system administrator for the host. If
you are the person responsible for this system, check to see if
the machine needs repair or rebooting.

This error results from status information delivered by the
underlying communication interface. If there is no known
connection to the host, a different message usually results. See
"No route to host" for details.

The symbolic name for this error is EHOSTDOWN, errno=147.

host name configuration error 
=============================

This is an old sendmail message, which replaced "I refuse to talk
tomyself" and is now replaced by the "Local configuration error"
message.

See the message "554 variable... Local configuration error" for
details.

hosts.org_dir: NIS+ servers unreachable 
=======================================

This is the third of three messages that an NIS+ client prints
when it cannot locate an NIS+ server on the network.

If other NIS+ clients are behaving normally, check the Ethernet
cabling on the workstation showing this message. On SPARC
machines, disconnected network cablingalso produces a series of
"no carrier" messages. On x86 machines, the NIS+ messages might
be your only indication that network cabling is disconnected.

If many NIS+ clients on the network are giving this message, go
to the NIS+ server in question and reboot or repair it, as
necessary. When the server machine is back in operation, NIS+
clients will give an "NIS server for domain OK" message.

I can't read your attachments. What mailer are you using? 
=========================================================

The SunView mailtool andpre-3.3 OpenWindows mailtool produce
this message when they cannot cope with an attachment. The
attachment is probably in MIME (Multipurpose Internet Mail
Extensions) format, using base64 encoding.

To read a mail message containing MIME attachments, use
mailtool(1) from Solaris 2.3 or later. If you are running an
earlier version of Solaris, rlogin(1) to a later version of
Solaris, set the DISPLAY environment variable back to the first
system, and run mailtool remotely. If those options prove
impossible, ask the originator to send the message again using
mailtool, or using the CDE dtmail compose File->SendAs-
>SunMailTool option.

Standard MIME attachments with base64 encoding, for example,
produce this message and fail to display in older mailtools.

Look into using metamail, available on the Internet, which allows
you to send and receive MIME attachments.

ie0: Ethernet jammed 
====================

This message can appear on SPARCservers or x86 machines with an
Intel 82586 Ethernet chip. It indicates that 16 successive
transmission attempts failed, causing the driver to give up on
the current packet.

If this error occurs sporadically or at busy times, it probably
means that the network is saturated. Wait for network traffic to
clear. If bottlenecks arise frequently, think about reconfiguring
the network or adding subnets.

Another possible cause of this message is a noise source
somewhere in the network, such as a loose transceiver connection.
Use snoop(1M)or a similar program to isolate the problem area,
then check and tighten network connectors as necessary.

ie0: no carrier 
===============

This message can appear on SPARCservers or x86 machines with an
Intel 82586 Ethernet chip. It indicates that thechip has lost
input to its carrierdetect pin while trying to transmit a
packet, causing the packet to be dropped.

Check that the Ethernet connector is not loose or disconnected.
Other possible causes include an open circuit somewhere in the
network and noise on the carrier detect linefrom the
transceiver. Use snoop(1M) or a similar program to isolate the
problem area, then check the network connectors and transceivers,
as needed.

Illegal Instruction 
===================

A process has received a signal indicating that it attempted to
execute an instruction that is not allowed by the kernel. This
usually results from running programs compiled for a slightly
different machine architecture. This message is usually
accompanied by a core dump, excepton read-only filesystems.

If you are booting from CDROM or from the net, check README files
to make sure you are using an image appropriate for your machine
architecture. Run df to make sure there is enough swap space on
the system; too little swap space can cause this error. If you
recently upgraded your CPU to a new architecture, replace your
operating system with one that supports the new architecture (an
operating system upgrade might be required).

Sometimes this condition results from programming error, such as
when a program attempts to execute data as instructions. This
condition can also indicate device file corruption on your
system.

Illegal instruction "0xN" was encountered at PC 0xN 
===================================================

The machine is trying to boot from a non-boot device, or from a
boot device for a different hardware architecture.

If you are booting from the net, check README files to make sure
you are using a boot image for that architecture. If you are
booting from disk, make sure the system is looking at the right
disk, which is usually SCSI target 3. Failing these solutions,
connect a CD drive to the system and boot from CDROM.

Illegal seek 
============

Using a pipe ("|") on the command line doesn't work here.

Rather than using a pipe on the command line, redirect the output
of the first program into a file and then run the second program
on that file.

A call to lseek(2) was issued to a pipe. This error condition can
also be fixed by altering the program to avoid using lseek().

The symbolic name for this error is ESPIPE, errno=29.

Image Tool: Unable to open XIL Library. 
=======================================

This message follows multiple multi-line "XilDefaultErrorFunc"
errors, indicating that ImageTool could not locate the X Imaging
Library. Many OpenWindows and CDE deskset programs require XIL.

Run pkginfo(1) to determine what packages are installed on the
system. If the following packages are not present, install them
from CDROM or over thenet:  SUNWxildg, SUNWxiler, SUNWxilow, and
SUNWxilrt.

Inappropriate ioctl for device 
==============================

This is a programming error.

Ask the program's author to fix this condition. The program needs
to be changed so it employs a device driver that can accept
special character device controls.

The ioctl() system call was given as an argument for a file that
is not a special character device. This message replaces the
traditional but puzzling "Not a typewriter" message.

The symbolic name for this error is ENOTTY, errno=25.

INCORRECT BLOCK COUNT I=N (should be N) CORRECT? 
=================================================

During phase 1, fsck(1M) determined that the specified inode
pointed to a number of bad or duplicate blocks, sothe block
count should be corrected to the actual number shown.

Generally you can answer yes to this question without harming the
filesystem.

For more information on bad blocks, see the section on checking
filesystem integrity in the System Administration Guide, Volume
I.

inetd[N]: execv /usr/sbin/in.uucpd: No such file or directory 
=============================================================

This message indicates that the Internet services daemon
inetd(1M) tried to start up the UUCP service without the UUCP
daemon existing on the system.

The SUNWbnuu package must be installed before the machine can run
UUCP. Run pkgadd(1M) to install this package from the
distribution CDROM or over the network.

inetd[N]: variable/tcp: unknown service 
=======================================

This message indicates that the Internet services daemon
inetd(1M) could not locate the TCP service specified after the
first colon.

Check the current machine's /etc/services file, and the NIS
services map, to see if the service is described. To start this
service, add an appropriate entry into the /etc/services file and
possibly the services map as well. Note that NIS+ does not
consult the local /etc/services file unless you put "files" right
after "nisplus" on the services line of the system's
/etc/nsswitch.conf file.

If you do not want to start this service, edit the system's
/etc/inetd.conf file and delete the entry that tries to start it
up.

For more information about NIS+, see the NIS+ and FNS
Administration Guide.

inetd[N]: variable/udp:unknown service 
=======================================

This message indicates that the Internet services daemon
inetd(1M) could not locate the UDP service specified after the
first colon.

See the message "inetd[N]: variable/tcp: unknown service" fora
solution.

inetd: Too many open files 
==========================

This message can appear when someone runs a command from the
shell or uses a third-party application. The sar(1M) command does
not indicate that the system-wide open file limit has been
exceeded.

The probable cause for this is that the shell limit has been
exceeded. The default open file limit is 64, but can be raised to
256.

See the message "Too many open files" for a solution.

INIT: Cannot create /var/adm/utmp or /var/adm/utmpx 
===================================================

This console message indicates that init(1M) cannot write in the
/var directory, which is usually part of the / (root) filesystem.
Some other messages follow, andthe system usually comes up
single-user. The problem is often that / or /var is mounted
read-only. Sometimes a brief power outage leaves the system
believing that many filesystems are still mounted.

If /var is a separate filesystem on the machine, andis not yet
not mounted, mount it now. If the filesystem containing /var is
mounted read-only, remount it read-write with a command similar
to this:

# mount -o rw,remount /

Then type Control-d and try to bring up the system multi-user. If
that fails, the root filesystem is probably corrupted.  Run
fsck(1M) on the root filesystem, halt the machine, power cycle
the CPU, and wait for the system to reboot. Should this problem
still occur, restore the root filesystem from backup tapes, or
re-install the system from net or CDROM to replace the root
filesystem.

InitOutput: Error loading module for /dev/fb 
============================================

This fatal X server error message indicates that /dev/fb, the
"dumb frame buffer," is either missing or corrupted. It is
usually followed by a "giving up" message and a few xinit errors.

If other devices on the system are working correctly, the most
likely reason for this error is that the SUNWdfb package was
removed or never installed. Insert the installation CD-ROM,
change to the Solaris_2.xdirectory, and run the following
command to install the packages SUNWdfbh and SUNWdfb (for your
machine architecture):

pkgadd -d .

If other devices on the system are not working correctly, the
system might havea corrupt /devices directory. Halt the system
and boot using the -r (reconfigure) option.  The system will run
fsck(1M) if the /devices filesystem is corrupted, most likely
fixing the problem.

Interrupted system call 
=======================

The user issued an interrupt signal (usually Control-c) while the
system was in the middle of executing a system call. When network
service is slow, interrupting cd(1) to a remote-mounted directory
can produce this message.

Proceed with your work, this message is purely informational.

An asynchronoussignal (such as interrupt or quit), which a
program was set up to catch, occurred during an internal system
call. If execution is resumed after processing the signal, it
will appear as if the interrupted programming function returned
this error condition, so the program might exit with an incorrect
error message.

The symbolic name for this error is EINTR, errno=4.

Invalid argument 
================

An invalid parameter was specified that the system cannot
interpret. For example, trying to mount an uncreated filesystem,
printing without sufficient system support, or providing an
undefined signal to a signal(3c) library function, can all
produce this message.

If you see this message when you are trying to mount a
filesystem, make sure that you have run newfs(1M) to create the
filesystem. If you see this message when you are trying to read a
diskette, make sure that the diskette was properly formatted with
fdformat(1), either in DOS format (pcfs) or as a UFS filesystem.
If you see this message while you are trying to print, make sure
that the print service is configured correctly.

The symbolic name for this error is EINVAL, errno=22.

Invalid null command 
====================

This C shell message results from a command line with two pipes
(|)in a row or from a pipe without a command afterwards.

Change the command line so that each pipe is followed by a
command.

I/O error 
=========

Some physical Input/Output error has occurred. If the process was
writing a file, data corruption is possible.

First find out which device is experiencing the I/O error. If the
device is a tape drive, make sure a tape is inserted into the
drive. When this error occurs with a tape in the drive, it is
likely that the tape contains an unrecoverable bad spot.

If the device is a floppy drive, an unformatted or defective
diskette could be at fault.  Format the diskette, or obtain a
replacement.

If the device is a hard disk drive, you might need to run
fsck(1M) and possibly even reformat the disk.

In some cases this error might occur on a call following the one
to which it actually applies.

The symbolic name for this error is EIO, errno=5.

Is a directory 
==============

An attempt was made to read or write a directory as if it were a
file.

Look at a listing of all the files in the current directory and
try again, specifying a file instead of a directory.

The symbolic name for this error is EISDIR, errno=21.

kernel read error 
=================

This message appears when savecore(1M), if activated, tries to
copy a debugging image of kernel memory to disk but cannot read
various kernel data structures correctly. Generally this occurs
after a system panic has corrupted main memory.  Data corruption
on the systemis possible.

Look at the kernel error messages that preceded this one to try
to determine the cause of the problem. Error messages such as
"BAD TRAP" usually indicate faulty hardware. Until the problem
that caused the kernel panic is resolved, a kernel core image
cannot be saved for debugging.

Killed 
======

This message is purely informational. If the killed process was
writing a file, some data might be lost.

Continue with your work.

This message from the signal handler or various shells indicates
that a process has been terminated with a SIGKILL. However, if
you don't see this message and cannot terminate a process with a
SIGKILL, you might have to reboot the machine to get rid of that
process.

kmem_free block already free 
============================

This is a programming error,probably from a device driver.

Determine which driver is giving this message and contact the
vendor for a software update, as this message indicates a bug in
the driver.

This message is from the DDI programming function kmem_free(9F),
which releases a block of memory at address addr of size siz that
was previously allocated by the DDI function kmem_alloc(9F). Both
addr and siz must correspond to the original allocation. If you
have source code for the driver, follow kmem_alloc() and
kmem_free() in the code to make sure they allocate and free the
same chunk of memory.

  
last message repeated N times 
=============================

This message comes from syslog(1M), the facility that prints
messages on the console and records them in /var/adm/messages. To
reduce the log size and minimize buffer usage, syslog collapses
any identical messages it sees during a 20 second period, then
prints this message with the number of repetitions.

Look above this message to see which message was repeated so
often. Then consider the repeated message and take action
accordingly. If repeated log entries such as "su ...  failed"
appear, consider the possibility of a security breach.

ld.so.1: variable: fatal: relocation error: symbol not found: 
 variable 
This message from the run-time linker ld.so.1 indicates that in
trying to execute the application given after the first colon,
the specified symbol could not be found for relocation. The
message goes on to say in what file the symbol was referenced.
Since this is a fatal error, the application terminates with this
message.

Run the ldd -d command on the application to show its shared
object dependencies and symbols that aren't found. Probably your
system contains an old version of the shared object that should
contain this symbol. Contact the library vendor or author for an
update.

This error does not necessarily occur when you first bring up an
application. It could take months to develop, if ordinary use of
the application seldom references the undefined symbol.

ld.so.1: variable: fatal: variable: can't open file: errno=2 
============================================================

This message indicates that the run-time linker, ld.so.1, while
running the program specified after the first colon, could not
find the shared object specified after the third colon. (A shared
object is sometimes called a dynamically linked library.) Error
number 2 translates to "No such file or directory" (ENOENT).

As a workaround, set the environment variable LD_LIBRARY_PATH to
include the location of the shared object in question, for
example:

/usr/dt/lib:/usr/openwin/lib

Better yet, if you have accessto source code, recompile the
program using the -Rpath loader option. Using LD_LIBRARY_PATH is
discouraged because it slows down performance.

le0: Memory error! 
==================

This message indicates that the network interface encountered an
access time-out from the CPU's main memory. There is probably
nothing wrong except system overload.

If the system is busy with other processes, this error can occur
frequently. If possible, try to reduce the system load by
quitting applications or killing some processes.

The Lance Ethernet chip timed out while trying to acquire the bus
for a DVMA transfer. Most network applications wait for a
transfer to occur, so generally no data gets lost. However, data
transfer might fail after too many time-outs.

For more information about the Lance Ethernet chip, see the
le(7D) man page.

le0: No carrier-- cable disconnected or hub link test disabled? 
===============================================================

Standalone machines with no Ethernet port connection get this
error when the system triesto access the network. If the
Ethernet cable is disconnected, SPARC machines with the sun4m
architecture usually display this message, whereas machines with
the sun4c architecture usually display the "le0: No carrier--
transceiver cable problem" message instead. If the Ethernet cable
is connected, this message could result from a mismatch between
the machine's NVRAM settings and the Ethernet hub settings.

If this message is continuous, try to save any workto local
disk.

When a machine is configured as a networked system, it must be
plugged into the Ethernet with a twisted pair J45 connector.

If the Ethernet cable is plugged in, find out whether or not the
Ethernet hub does a Link Integrity Test. Then become superuser to
check and possibly set the machine's NVRAM. If the hub's Link
Integrity Test is disabled, set this variable to false.

# eeprom | grep tpe tpe-link-test?=true # eeprom 'tpe-link-
test?=false'

The default setting is true. If for some reason tpe-link-test?
was set to false,and the hub's Link Integrity Test is enabled,
set this variable to true.

le0: No carrier-- transceiver cable problem? 
============================================

Standalone machines with no Ethernet port connection get this
error when the system tries to access the network.

If this message is continuous, try to save any work to local
disk.

When a machine is configured as a networked system, it must be
plugged into the Ethernet with either a twisted pair J45
connector or thicknet 10Base-T connector (depending on the
building's Ethernet cable type).

Older workstations have a thicknet connection on the back instead
of a twisted pair Ethernet connection, so they require a thicknet
to twisted pair transceiver to translate between cabling types.

LINK COUNT FILE I=i OWNER=o MODE=m SIZE=s MTIME=t COUNT... ADJUST? 
===================================================================

During phase 4, fsck(1M) determined that the inode's link count
for the specified file is wrong, and asks if you want to adjust
it to the value given.

Generally you can answer yes to this question without harming the
filesystem.

For more information on fsck, see the section on checking
filesystem integrity in the SystemAdministration Guide, Volume
I.

LL105W: Protocol error detected. 
================================

This error message comes from Lifeline Mail, an unbundled PC
compatibility application.

The likeliest cause for this problem is that someone set up a
user account without a password. Assign the user a password to
solve this problem.

ln: cannot create /dev/fb: Read-only file system 
================================================

During device reconfiguration at boot time, the system cannot
link to the frame buffer because /dev is on a read-only
filesystem.

Check that /dev/fb is a symbolic link to the hardware frame
buffer, such as cgsix or tcx. Ensure that the filesystem
containing /dev is mounted read-write.

lockd[N]: create_client: no name forinet address 0xN 
=====================================================

This lock daemon message usually indicates that the NIS
hosts.byname and hosts.byaddr maps are not coordinated.

Wait a short time for the maps to synchronize. If they don't,
takesteps to coordinate them.

For information on updating NIS data, see the section on NIS maps
in the NIS+ and FNS Administration Guide. If you are using the
AnswerBook, "hosts.byaddr" is a good search string.

Login incorrect 
===============

This message from the login(1) program indicates an incorrect
combination of login name and password. There is no way to tell
whether what's wrong is the login name, the password, or both.
Other programs such as ftp(1), rexecd(1M), sulogin(1M), and
uucp(1C) alsogive this error under similar conditions.

Check the /etc/passwd file and the NIS or NIS+ passwd map on the
local system to see if an entry exists for this user. If a user
has simply forgotten the password, su and set a new one with the
passwd usernamecommand. This command automatically updates the
NIS+ passwd map, but with NIS you'll need to coordinate the
update with the passwd map.

The "Login incorrect" problem can also occur with older versions
of NIS when the user name has more than eight characters. If this
is the case, edit the NIS password file, change the user name to
have eight or fewer characters, and then remake the NIS passwd
map.

If you cannot log in to the system as root, despite knowing the
proper password, it is possible that the /etc/passwd file is
corrupted. Try to log in as a regular user and su to root.

If that doesn't work, see the message "su: No shell" and follow
most of the instructions given there. Instead of changing the
default shell however, make the password field blank in
/etc/shadow.

lp hang 
=======

On a print server, the queue continues to grow but nothing comes
out of the printer.  The printer daemon is hung.

Here is a simple procedure for flushing a hung printing queue:

 1. Login or switch user to root.
 2. Issue the reject printername command to make sure no one
sends any job to the
   printer.
 3. Turn off power to the printer.
 4. If the active job appears to be causing the hang, remove it
from the print queue
   with the cancel jobnumber command, and ask the owner to
requeue that print
   job.
 5. Shut down the print queue with the /usr/lib/lpshut command.
 6. Remove the lock file /var/spool/lp/SCHEDLOCK and the
temporary files
   /var/spool/lp/tmp/*/*.
 7. Turn the printer back on.
 8. Restart the print queue with the /usr/lib/lpsched command.

For more information on print queuing, see the System
Administration Guide, Volume II. If you are using the AnswerBook,
"print server" is a good search string.

mailtool: Can't create dead letter: Permission denied 
=====================================================

An attempt was made to send a message with mailtool(1) from a
directory where the user does not have write permission, and the
user's home directory is currently unavailable.

Change to another directory and start mailtool again, or use
chmod(1) to change permissions for the directory (if possible).

mailtool: Could not initialize the Classing Engine 
==================================================

When a user runs mailtool(1) on a remote machine, setting the
DISPLAY environment back to the local machine, this message might
appear inside a dialog box window. The dialog box goes on to say
that the Classing Engine must be installed to use Attachments.
This problem occurs because rlogin(1) does not propagate the
user's environment.

Exit mailtool and set your OPENWINHOME environment variable to
/usr/openwin.  Then run mailtool again. The error message will
not appear, and you will be able to use Attachments.

Classing Engine is a new name for Tool Talk. Earlier versions of
mailtool said "Tool Talk: TT_ERR_NOMP" instead of Classing
Engine.

Mail Tool is confused about the state of your Mail File. 
========================================================

This message appears in a pop-up dialog box whenever you ask
mailtool(1) to access messages after another mail reader has
modified your inbox. A request follows:  "Please Quit this Mail
Tool."

Click "Continue" to close the dialog box, then exit mailtool. If
you continue trying to read mail, messages deleted by the other
mail reader will never appear, and mailtool will fail to see any
new messages.

mail: Your mailfile was found to be corrupted (Content-length mismatch). 
=======================================================================

This message comes from mail(1) or mailx(1) whenever it detects
messages with a different content length than advertised. The
mail program tells you which message might be truncated or might
have another message concatenated to it.

Two common causes of content length mismatches are the
simultaneous use of different mail readers (such as mail and
mailtool), or using a mail reading program (or an editor) that
does not update the Content-Length field after altering a
message.

The mailx program can usually recover from this error and
delineate mail message boundaries correctly. Pay close attention
to the message that might be truncated or combined with another
message, and to all messages after that one. If a mail file
becomes hopelessly corrupted, run it through a text editor to
eliminate all Content-Length lines, and ensure that each message
has a From (no colon) line for each message, preceded by a blank
line.

To avoid mailfile corruption, exit from mailtool without saving
changes when you are currently running mail or mailx.

Memory address alignment 
========================

This message can occur when printing large files on a
SPARCprinter attached to a SPARCstation 2.

Replace the SPARCstation 2 CPU with one that isat the most
recent dash level.

memory leaks 
============

An application uses up more and more memory, until all swap space
is exhausted.

Many developers have found that third party software (such as
Purify) can help identify memory leaks in their applications. If
you suspect that you have a memory leak, you can use sar(1) to
check on the Kernel Memory Allocation (KMA). Any driver or module
that uses KMA resources, but does not specifically return the
resources before it exits, can create a memory leak.

For more information on memory leaks, see the section on
monitoring system activity in the System Administration Guide,
Volume II. If you are using the AnswerBook, "displaying disk
usage" is a good search string. Also, see the section on system
resource problems in the NIS+ and FNS Administration Guide.

mount: /dev/dsk/variable is already mounted, /variable is busy, or... 
=====================================================================

While trying to mount a filesystem, the mount(1M) command
received a "Device busy" (EBUSY) error code.There are several
possible reasons: this /dev/dsk filesystem is already mounted on
a different directory, the busy path name is the working
directory of an active process, or the system has exceeded its
maximum number of mount points (unlikely).

Run /etc/mount to see if the filesystem is already mounted. If
not, check to see if any shells are active in the busy directory
(did the user cd into the directory?), or if any processes in the
ps(1) listing are active in that directory. If the reason for the
error message isn't obvious, try using a different directory for
the mount point.

mount: giving up on: /variable 
==============================

An existing server did not respond to an NFS mount request, so
after retrying a number of times (default1000), the mount(1M)
command has given up. Nonexistent servers or bad mount points
produce different messages.

If the "RPC: Program not registered" message precedes this one,
the requested mount serverprobably did not share (export) any
filesystems, so it has no NFS daemons running. Have the superuser
on the mount server share(1M) the filesystem, then run
/etc/init.d/nfs.server start to begin NFS service.

If the requested mount server is down or slow to respond, check
to see whether the machine needs repair or rebooting.

mount: mount-point /variable does not exist. 
============================================

Someone tried to mount a filesystem onto the specified directory,
but there is no suchdirectory.

If this is the directory name you want,run mkdir(1) to create
this directory as a mount point.

mount: the state of /dev/dsk/variable is not okay 
=================================================

The system was unable to mount the filesystem that was specified
because the super-block indicates that the filesystem might be
corrupted. This is not an impediment for read-only mounts.

If you don't need to write on this filesystem, mount(1M) it using
the -o ro option.  Otherwise, do as one of the message
continuation lines suggests and run fsck(1M) to correct the
filesystem state and update the super-block.

For more information on using fsck, see the section on checking
filesystem integrity in the System Administration Guide, Volume
I.

/net/variable: No such file or directory 
========================================

A user tried to change directory (for example with cd) to a
network partition on the system specified after /net/, but this
host either does not exist or has not shared (exported) any
filesystem.

To gain access to files on this system, try rlogin(1).

To export filesystems from the remote system, become superuser on
that system and run the share(1M) command with the appropriate
options. If that system is sharing filesystems for the first
time, also run /etc/init.d/nfs.server start to begin NFS service.

Network is down 
===============

A transport connection failed because it encountered a dead
network.

Report this error to the system administrator for the network. If
you are the person responsible for this network, check to see why
the network is dead and what repairs are necessary.

This error results from status information delivered by the
underlying communication interface.

The symbolic name for this error is ENETDOWN, errno=127.

Network is unreachable 
======================

An operational error occurred either because there was no route
to the network or because negative status information was
returned by intermediate gateways or switching nodes.

The returned status is not always sufficient to distinguish
between a network that is down and a host that is down. See the
"No route to host" message.

Check the network routers and switches to see if they are
disallowing these packet transfers. If they areallowing all
packet transfers, check network cablingand connections.

The symbolic name for this error is ENETUNREACH, errno=128.

NFS getattr failed for server variable: RPC: Timed out 
======================================================

This message appears on an NFS client that requested a service
from an NFS server whose hardware is failing. Often the message
"NFS read failed" appears along with this message. If the server
were merely down or slow to respond, the "NFS server not
responding" message would appear instead. Data corruption on the
server system is possible.

Because this message usually indicates server hardware failure,
initiate repair procedures as soon as possible. Check the memory
modules, disk controllers, and CPU board.

For more information on NFS tuning, see chapter on monitoring
network performance in the System Administration Guide, Volume
II.

nfs mount: Couldn't bind to reserved port 
=========================================

This message appears when a client attempts to NFS mount a
filesystem from a server that has more than one Ethernet
interface configured on the same physical subnet.

Always connect multiple Ethernet interfaces on one router system
to different physical subnetworks.

nfs mount: mount: variable: Device busy 
=======================================

This message appears when the superuser attempts to NFS mount on
top of an active directory. The busy device is actually the
working directory of a process.

Determine which shell on the workstation is currently located
below the mount point, and change out of that directory. Be wary
of subshells (such as su shells) that could be in different
working directories while the parents remain below the mount
point.

NFS mount: /variable mounted OK 
===============================

While booting, the system failed to mount the directory specified
after the first colon, probably because the NFS server involved
was down or slow to respond. The mount ran in the background and
successfully contacted the NFS server.

This is a purely informative message to let you know that the
mount process has completed.

NFS read failed for server variable 
===================================

This is generally a permissions problem. Perhaps a directory or
file permission was changed while the client held the file open.
Perhaps the filesystem's share or netgroup permissions changed.
If the server were down or the network saturated, the "NFS server
not responding" message would appear instead.

Log in to the NFS server and check the permissions of directories
leading to the file.  Make certain that the filesystem is shared
with (exported to) the client experiencing an NFS read failure.

For more information, see the chapter on NFS troubleshooting in
the NFS Administration Guide.

nfs_server: bad getargs for N/N 
===============================

This message comes from the NFS server when it gets a request
with unrecognized or incorrect arguments. Typically, it means the
request could not be XDR decoded properly. This can result from
corruption of the packet over the network, or from an
implementation bug causing the NFS client to improperly encode
its arguments.

If this message originates from a single client, investigate that
machine for NFS client software bugs. If this message appears all
over a network, especially accompanied by other networking
errors, investigate the network cabling and connectors.

NFS server variable not responding still trying 
===============================================

In mostcases this very common message indicates that the system
has requested a service from an NFS server that is either down or
extremely slow to respond. In some cases this message indicates
that the network link to this NFS server is broken, although
usually that condition generates other error messages as well. In
a few cases this message indicates NFS client set-up problems.

Check the non-responding NFS server to see whether the machine
needs repair or rebooting. Encourage your user community to
report such problems quickly but only once.

Should this message appear when booting a diskless client, make
sure that the client's /etc/hosts file and the network naming
service (NIS, NIS+, or other /etc/hosts files on the network)
have been updated.

Formore information, see the chapter on NFS troubleshooting in
the NFS Administration Guide.

NFS server variable ok 
======================

This message is the follow-up to the "NFS server not responding"
error. It indicates that the NFS server is back in operation.

When an NFS server first comes up, it will be busy fulfilling
client requests for a while. Be patient and wait for your client
system to respond. Making many extraneous requests only further
slows the NFS server response time.

nfs umount:variable: is busy 
=============================

This message appears when the superuser attempts to unmount an
active NFS filesystem. The busy point is the working directory of
a process.

Determine which shell (or process) on the workstation is
currently located in the remotely mounted filesystem, and change
(cd) out of that directory. Be wary of subshells (such as su
shells) that could be in different directories while the parent
shells remain in the NFS filesystem.

NFS write error on host variable: No space left on device. 
==========================================================

This console message indicates that an NFS-mounted partition has
filled up and cannot accept writing of new data. Unfortunately,
software that attempts to overwriteexisting files will usually
zero out all data in these files. This is particularly
destructive on NFS-mounted /home partitions.

Find the user or process that is filling up the filesystem, and
get the out-of-control process stopped as soon as you can. Then
delete files as necessary to create more space on the filesystem
(large core files are good candidates for deletion). Have users
write any modified files to local disk if possible. If this error
occurs often, redistribute directories to ease demandon this
partition.

For more information on disk usage, see the System Administration
Guide, Volume II.  If you are using the AnswerBook, "managing
disk use" is a good search string.

NFS write failed for server variable: RPC: Timed out 
====================================================

This error can occur when a file system is soft-mounted, and
server or network response time lags. Any data written to the
server during this period could be corrupted.

If you intend to write on a filesystem, never specify the soft
mount option. Use the default hard mount for all the filesystems
that are mounted read-write.

For more information, see the chapter on NFS troubleshooting in
the NFS Administration Guide.

NIS+ authentication failure 
===========================

This is a Federated Naming Service message. The operation could
not be completed because the principal making the request could
not be authenticated with the name service involved.

Run the nisdefaults(1) command to verify that you are identified
as the correct NIS+ principal. Also check that the system has
specified the correct public key source.

For more information, see the authentication and authorization
overview in the NIS+ and FNS Administration Guide.

No buffer space available 
=========================

An operation on a transport endpoint or pipe was not performed
because the system lacked sufficient buffer space or because a
queue was full. The target system probably ran out of memory or
swap space. Any data written during this condition will probably
be lost.

To add more swap area, use the swap -a command on the target
system.  Alternatively, reconfigure the target system to have
more swap space. As a general rule, wwap space should be two to
three times as large as physical memory.

The symbolic name for this error is ENOBUFS,errno=132.

No child processes 
==================

This message can appear when an application tries to communicate
with cooperating process that do not exist.

Restart the parent process so it can create the child processes
again. If that doesn't help, this could be the result of
programming error; contact the vendor or author of the program
for an update.

A wait(2) system call was executed by a process that had no
existing or unwaited-for child processes. The child processes
could have exited prematurely, or might never have been created.

The symbolic name for this error is ECHILD, errno=10.

No default media available 
==========================

The volume manager issues this message if a user makes an
eject(1) request when the drives containno diskette or CDROM to
eject.

Insert a diskette or CDROM. If the volume manager is confused and
there actually is a diskette or CDROM in a drive, run volcheck to
update the volume manager. If the system remains confused, try
booting with the -r option to reconfigure devices.

No directory! Logging in with home=/ 
====================================

The login(1) program could not find the home directory listed in
the password file or NIS passwd map, so it deposited the user in
the root directory.

Check that the user's home directory is mounted and is owned by
and accessible to that user. Perhaps the automounter tried to
mount the home directory, but the NFS server did not respond
quicklyenough. Try listing the files in /home/username. If the
NFS server responds to this request, have the user log out and
log in again.

It is possible that the automounter daemon is not running. Run
the ps command to see if automountd is present. If not,run the
second command; if it appears to be wedged, run both these
commands:

# /etc/init.d/autofs stop # /etc/init.d/autofs start

When the automounter daemon is running, verify that the
/etc/auto_master file has a line like this:

/home  auto_home

Verify that the /etc/auto_home file has a line like this:

+auto_home

These entries depend on the NIS auto_home map.

It is also possible that the NFS server has not shared (exported)
this /home directory, or that the NFS daemons on the server have
disappeared.

For more information on NFS, see the NFS Administration Guide.

No message of desired type 
==========================

An attempt was made to receive a message of a type that does not
exist on the specified message queue. See the msgop(2) man page
for details.

This indicates an error in the System V IPC message facility.
Generally the message queue is empty or devoid of the desired
message type, while IPC_NOWAIT is set.

The symbolic name for this error is ENOMSG, errno=35.

No recipients specified 
=======================

This message comes from the mailx(1) command whenever a user
doesn't provide an address in the To: field.

See the message "Recipient names must be specified" for details.

No record locks available 
=========================

No more record locks are available. The system lock table is
full.

The symbolic name for this error is ENOLCK, errno=46.

Perhaps a process called fcntl(2) with the F_SETLK or F_SETLKW
option, and the system maximum was exceeded. The system contains
several different locking subsystems, including fcntl,the NFS
lock daemon, and mail locking, all of which can produce this
error.

Try again later, when more locks might be available.

No route to host 
================

An operational error occurred because there was no route to the
destination host, or because of status information returned by
intermediate gateways or switching nodes.

The returned status is not always sufficient to distinguish
between a host that is down and a network that isdown. See the
"Network is unreachable" message.

Check the network routers and switches to see if they are
disallowing these packet transfers. If they are allowing all
packet transfers, check network cabling and connections.

The symbolic name for thiserror is EHOSTUNREACH, errno=148.

No shell Connection closed 
===========================

A user has attempted to remote login to the system, and has a
valid account name and password, but the shell specified for
their account is not available on that system. For example, the
seventh field could request the GNUBourne-again shell /bin/bash,
which does not exist on standard Solaris distributions.

If you have a copy of the requested shell, become superuser and
install the missing shell on that system. Otherwise, change the
user's password file entry (perhaps only in the NIS+ or NIS
passwd map) to specify an available shell such as /bin/csh or
/bin/ksh.

No space left on device 
=======================

While writing an ordinary file or creating a directory entry,
there was no free space left on the device. The disk, tape, or
diskette is full of data. Any data written to that device during
this condition will be lost.

Remove unneeded files from the hard disk or diskette until there
is space for all the data you are writing. It might be advisable
to move some directories onto another filesystem and create
symbolic links accordingly. When a tape is full, continue on
another one, use a higher density setting, or obtain a higher-
capacity tape.

To create multi-volume tapes or diskettes, use the pax(1) or
cpio(1) command; tar(1) is still limited to a single volume.

The symbolic name for this error is ENOSPC, errno=28.

No such device 
==============

An attempt was made to apply an operation to an inappropriate
device, such as writing to a nonexistent device.

Look in the /devices directory to see why this device does not
exist, or why the program expects it to exist. The similar "No
such device or address" message tends to indicate I/O problems
with an existing device, whereas this message tends to indicate a
device that does not exist at all.

The symbolic name for this error is ENODEV, errno=19.

No such device or address 
=========================

This can occur when a tape drive is off-line or when a device has
been powered off or removed from thesystem.

For tape drives, make sure the device is connected, powered on,
and toggled on-line (if applicable). For disk and CDROM drives,
check that the device is connected and powered on.

With all SCSI devices, ensure that the target switch or dial is
set to the number where the system originally mounted it. To
inform the system of a change to the target device number, reboot
using the -r (reconfigure) option.

This message results from I/O to a special file's subdevice that
either does not exist or that exists beyond the limit of the
device.

The symbolic name for this error is ENXIO, errno=6.

No such file or directory 
=========================

The specified file or directory does not exist. Either the file
name or path name was entered incorrectly.

Check the file name and path name for correctness and try again.
If the specified file or directory is a symbolic link, it
probably points to a nonexistent file or directory.

The symbolic name for this error is ENOENT, errno=2.

no such map in server's domain 
==============================

A user or an application tried to look up something using Network
Information Services (NIS), but NIS has no corresponding database
for this request.

Make sure the NIS map name is spelled correctly. To see a list of
nicknames for the various NIS maps, run the ypcat -x command. To
see a full list of the various NIS maps (databases), run the
ypwhich -m command. If the NIS service were not running on the
current machine, these commands would result in a "can't
communicate with ypbind" message.

No such process 
===============

This process cannot be found. The process could have finished
execution and disappeared, or it might still be in thesystem
under a different numeric ID.

Use the ps(1) command tocheck that the process ID you're
supplying is correct.

No process corresponds to the specified process ID (PID), light-
weight process ID, or thread_t.

The symbolic name for this error is ESRCH, errno=3.

No such user as variable-- cron entries not created 
===================================================

A file exists in /var/spool/cron/crontabs for the specified user,
but this user is not in /etc/passwd or the NIS passwd map. The
system cannot create cron entries for nonexistent users.

To eliminate this message at boot time, remove the cron file for
the nonexistent user, or rename it if the user's login name has
changed. If this is a valid user, create an appropriate password
entry for this name.

Not a directory 
===============

A non-directory was specified where a directory is required, such
as in a path prefix or as an argument to the chdir(2) system
call.

Look at a listing of all the files in the current directory and
try again, specifying a directory instead of a file.

The symbolic name for this error is ENOTDIR, errno=20.

Not enough space 
================

This message indicates that the system is running many large
applications simultaneously, and has run out ofswap space
(virtual memory). It could also indicate that applications failed
without freeing pages from the swap area. Swap space is an area
of disk set aside to store portions of applications and data not
immediately required in memory. Any data written during this
condition will probably be lost.

Reinstall or reconfigure the system to have more swap space. A
general rule of thumb is that swap space should be two to three
times as large as physical memory.  Alternatively, use mkfile(1M)
and swap(1M) to add more swap area. This example shows how to add
16 MB of virtual memory in the /usr/swap file (any filesystem
with enough free space would work):

# mkfile 16m /usr/swap # swap -a /usr/swap

To make this automatic at boot time, add the following line to
the /etc/vfstab file:

/usr/swap   -   -   swap   -   no  -

In calling the fork(2), exec(2), sbrk(2), or malloc(3C) routine,
a program asked for more memory than the system could supply.
This is not a temporary condition; swap space is a system
parameter.

The symbolic name for this error is ENOMEM, errno=12.

not found 
=========

This message indicates that the Bourne shell could not find the
program name given as a command.

Check the form and spelling of the command line. If that looks
correct, echo $PATH to see if the user's search path is correct.
When communications are garbled, it is possible to unset a search
path to such an extent that only built-in shell commands are
available. Here is a command to reset a basic search path:

$ PATH=/usr/bin:/usr/ccs/bin:/usr/openwin/bin:.

If the search path looks correct, check the directory contents
along the search path to see if programs are missing or if
directories are not mounted.

NOTICE: /variable: out of inodes 
================================

The filesystem specified after the first colon probably contains
many small files, exceeding the per-filesystem limit for inodes
(file information nodes).

If many small files were created unintentionally, removing them
will resolve the problem.

Otherwise, follow these steps to increase filesystem capacity for
small files. Make several backup copies of the filesystem on
different tapes (for safety), then bring the machine down to
single-user mode. Use the newfs(1M) command with the -i option to
increase inode density for this filesystem. Here is an example:

# newfs -i 1024 /dev/rdsk/partition

Finally, restore the filesystem from a backup tape. Note that
increasing the inode density slightly reduces total filesystem
capacity.

Not login shell 
===============

This message results when a user triesto logout(1) from a shell
other than the one started at login time.

To quit a non-login shell, use the exit(1) command. Continue
doing so until you have logged out.

For more general information on the login shell, see the section
on customizing your work environment in the Solaris Advanced
User's Guide.

Not on system console 
=====================

A user tried to login(1) to a system as the superuser (uid=0,
which is not necessarily root) from a terminal other than the
console.

Login to that system as a normal user, then run su(1M) to become
superuser. To allow superuser logins from any terminal, comment
out the CONSOLE line in /etc/default/login (this is not
recommended for security reasons).

Not owner 
=========

Either an ordinary user tried to do something reserved for the
superuser, or the user tried to modify a file in a way restricted
to the file's owner or to the superuser.

Switch user to root and try again.

The symbolic name for this error is EPERM, errno=1.

Not supported 
=============

This version of the system does not support the feature
requested, although future versions of the system might provide
support.

This is generally not a system message from the kernel, but an
error returned by an application. Contact the vendor or author of
the application for an update.

The symbolic name for this error is ENOTSUP, errno=48. 
 

operation failed [error 185], unknown group error 0, variable 
=============================================================

When you use admintool to add a user to a newly-created group,
admintool issues this error.

Apply patch 101384-05 to fix bug ID 1151837 and to provide a
workaround for bug ID 1153087.

Operation not applicable 
========================

This error indicates that no system support exists for some
function that the application requested.

Ask the system vendor for an upgrade, or contact the vendoror
author of the application for an update.

This message indicates that no system support exists for an
operation. Many modules set this error when a programming
function is not yet implemented. If you are writing a program
that produces this message while calling a system library, try to
find and use an alternative library function. Future versions of
the system might support this operation; check system release
notes for further information.

The symbolic name for this error is ENOSYS, errno=89.

out of memory 
=============

Hundreds of different programs can produce this message when the
system is running many large applications simultaneously. This
message usually means that the system has run out of swap space
(virtual memory).

See the message "Not enough space" for details. Any data written
during this condition will probably be lost.

PARTIALLY ALLOCATED INODE I=N CLEAR? 
=====================================

During phase 1, fsck(1M) found that the specified inode was
neither allocated nor unallocated. The reason is probably that
the system crashed in the middle of a sync(2) or write(2)
operation.

Should you answer yes to this question, "UNALLOCATED" messages
might result during phase 2, if any directory entries point to
this inode. If you are being careful, exit fsck(1M) and run
ncheck(1M) (specifying the inode number after the -i option) to
determine which file or directory is involved here. You might be
able restore this file or directory from another system. It is
also possiblethat fsck will copy this file to the lost+found
directory in a later phase.

For more information, see the chapter on checking filesystem
integrity in the System Administration Guide, Volume I.

passwd.org_dir: NIS+ servers unreachable 
========================================

This is the first of three messages thatan NIS+ client prints
when it cannot locate an NIS+ server on the network.

See the message "hosts.org_dir: NIS+ servers unreachable" for
details.

Password does not decrypt secret key for unix.uid@variable 
==========================================================

This message appears at login time when a user's password is not
identical to the user's keylogin network password. When a system
is running NIS+, the login program firstperforms UNIX
authentication, and then attempts a keylogin(1) for secure RPC
authentication.

To gain credentials for secure RPC, users can run keylogin (after
login) and type in their secret key. To stop this message from
appearing at login time, users can run the chkey -p command and
set their network password to bethe same as their NIS+ password.
If a user doesn't remember the network password, the system
administrator should delete and re-create the user's credentials
table entry so the user can establish a new network password with
chkey.

Permission denied 
=================

An attempt was made to access a file in a way forbidden by the
protection system.

Check the ownership and protection mode of the file (with a long
listing from the ls-l command) to see who is allowed to access
the file. Then change the file or directory permissions as
needed.

The symbolic name for this error is EACCES, errno=13.

Please specify a recipient. 
===========================

With mailtool, this message comes up in a dialog box whenever a
user tries to deliver a message with no address in the To: field.

See the message "Recipient names must be specified" for details.

Protocol not supported 
======================

The requested networking protocol hasnot been configured into
the system, or no implementation for it exists. (A protocol is a
formal description of the messages to be exchanged and the rules
to be followed when systems exchange information.)

Verify that the protocol is in the /etc/inet/protocols file and
in the NIS protocols map, if applicable. If the protocol is not
listed, and you want to permit its use, configure the protocol as
documented or as required.

The symbolic name for this error is EPROTONOSUPPORT, errno=120.

Protocol wrong type for socket 
==============================

This message indicates either application programming error, or
badly configured protocols.

Make sure that the /etc/protocols file corresponds number-for-
number with the NIS protocols map. It it does, ask the vendor or
author of the application for an update.

A protocol was specified that does not support the semantics of
the socket type requested. This amounts to a request for an
unsupported type of socket. Look at the source code that made
this socket request and check that it requested one of the types
specifiedin /usr/include/sys/socket.h.

The symbolic name for this error is EPROTOTYPE, errno=98.

Read error from network: Connection reset by peer 
=================================================

This message appears when a user is remotely logged into a
machine that crashes or gets rebooted during the rlogin(1) or
rsh(1) session. Any data changes that were not saved are probably
lost. Sometimes this message appears only when the user types
something, even though the system went down hours before.

Try torlogin again, perhaps after waiting a few minutes for the
system to reboot.

Read-only file system 
=====================

Files and directories on filesystems that are mounted read-only
cannot be changed.

If you only modify these files and directoriesoccasionally,
rlogin(1) to the servers from which the filesystems are mounted
and change the files or directories there. If you change these
files and directories frequently, mount(1M) the filesystems
read/write.

The symbolic name for this error is EROFS, errno=30.

rebooting... 
============

This message appears on the console to indicate that the machine
is booting, either after the superuser issued a reboot command,
or after a system panic if the EEPROM's watchdog-reboot? variable
is set to true.

Allow the machine to boot itself. In case of a system panic, look
above this message for other indications of what went wrong.

Recipient names must be specified 
=================================

Somebody sent mail without a valid recipient in the To: field, so
sendmail could not deliver the mail message. Using mail(1), the
recipient's address might have been specified using spaces or
non-alphanumeric characters. The mailtool(1)and mailx(1)
commands try to prevent this by issuing "Please specify a
recipient" or "No recipients specified" messages instead. If
there is at least one valid recipient, each invalid recipient
address will generate a "User unknown" message.

Look in the sender's dead.letter file for the automatically saved
message, andhave the originator send it again, this time
specifying a recipient.

For more information about sendmail, see the Mail Administration
Guide.

Reset tty pgrp from N to N 
==========================

The C shell sometimes issues this message when it clears away the
window process group after the user exits the window system. This
can happen when the window system doesn't clean up after itself.

Proceed with your work. This message is purely informational.

Resource temporarily unavailable 
================================

This indicates that the fork(2) system call failed because the
system's process table is full, or that a system call failed
because of insufficient memory or swap space. It is also possible
that a user is not allowed to create anymore processes.

Simply waiting often gives the system time to free resources.
However if this message occurs often on a system, reconfigure the
kernel and allow more processes.  To increase the size of the
process table in Solaris 2.x, increase the value of maxusers in
the /etc/system file. The default maxusers value is the amount of
main memory in MB, minus 2.

If one user is not allowed to create any more processes, that
user has probably exceeded the memorysize limit; see the limit(1)
man page for details.

The symbolic name for this error is EAGAIN, errno=11.

Result too large 
================

This is a programming error or a data input error.

Ask the program's author to fix this condition.

This indicates an attempt to evaluate a mathematical programming
function at a point where its value would overflow or underflow.
The value of a programming function in the math package (3M) is
not representable within machine precision. This could occur
after floating point overflow or underflow (either single or
double precision), or after total loss of numeric significance in
Bessel functions.

Note that this message can indicate "Result too small" in the
case of floating pointunderflow.

To help pinpoint a program's math errors, use the matherr(3M)
facility.

The symbolic name for this error is ERANGE, errno=34.

rmdir: variable: Directory not empty 
====================================

The rmdir(1) command can remove empty directories, only. The
directory whose name appears after the first colon in the message
still contains some files or directories.

Use rm(1) instead of rmdir. To remove this directory and
everything underneath it, use the rm -ir command to recursively
descend the directory, being asked if you want to delete each
element. To remove the directory and all its contents without
being asked for approval, use the rm -r command.

ROOT LOGIN /dev/console 
=======================

This syslog message indicates that someone has logged in as root
on the system console.

If you have just logged in as root, don't worry. If this is not
you, consider the possibility of a security breach. The best
site-wide policy is for all system administrators to su instead
oflogging in as root.

ROOT LOGIN /dev/pts/N FROM variable 
===================================

This syslog message indicates that someone has remote logged in
as root on a pseudo-terminal from the system specified after the
FROM keyword.

For security reasons, it is a bad idea to allow root logins from
anywhere besides the console. To restrict superuser logins to the
console, remove the comment from the CONSOLE line in
/etc/default/login.

rx framing error 
================

Usually this error indicates a hardware problem.

Check the Ethernet cabling and connectors to locate a problem.

A framing error occurs when the Ethernet I/O driver receives a
non-integral unit of octets, such as 63 bytes and then 3 bits.
(Ethernet specifies the use of octets.) Framing errors are caused
by corruption of the starting or ending frame delimiters. These
can be corrupted by some violation of the encoding scheme.

Framing errors are a subset of CRC errors, which are usually
caused by anomalies on the physical media.An "alignment/framing
error" is a type of CRC error where octet boundaries do not line
up.

SCSI bus DATA IN phase parity error 
===================================

The most common cause of this problem is unapproved hardware.
Some SCSI devices for thePC market do not meet the high I/O
speed requirements for the UNIX market.  Other possible causes of
this problem are improper cabling or termination, and power
fluctuations. Data corruption is possible but unlikely to occur,
because this parity error prevents data transfer.

Check that all SCSI devices on the bus are Sun approved hardware.
Then verify that all cables are no longer than six meters, total,
and that all SCSI connections are properly terminated. If power
fluctuations are occuring, invest in an uninterruptible power
supply.

SCSI transport failed: reason 'reset' 
=====================================

This message indicates that the system sent data over the SCSI
bus, but the data never reached its destination because of a SCSI
bus reset. The most common cause of this condition is conflicting
SCSI targets.�Data corruption is possible but unlikely to
occur, because this failure prevents data transfer.

Verify that all cables are no longer than six meters, total, and
that all SCSI connections are properly terminated. If power
surges are a problem, acquire a surge suppressor or
uninterruptible power supply.

A machine's internal disk drive is usually SCSI target 3. Make
sure that external and secondary disk drives are targeted to 1,
2, or 0, and do not conflict with each other.  Also make sure
that tape drives are targeted to 4 or 5, and CD drives to 6,
avoiding any conflict with each other or with disk drives. If the
targeting of the internal disk drive is in question, power off
the machine, remove all external drives, turn the power on, and
from the PROM monitor run the probe-scsi-all or probe-scsi
command.

If SCSI device targeting is acceptable, memory configuration
could be the problem, especially for machines with the sun4c
architecture. Ensure that high-capacity memory chips (such as 4MB
SIMMs) are in lower banks, while lower-capacity memory chips
(such as 1MB SIMMs) are in the upper banks.

Note that SPARC systems do not always support third party CDROM
drives, and might generate a similar "unknown vendor" error
message. Check with the CDROM vendor for specific configuration
requirements.

Some third party disk drives have a read-ahead cache that
interferes with Solaris device drivers. Make sure that any
existing read-ahead cache facility is turned off.

� For more information on SCSI targets, see the section on
device naming conventions in the Solaris 1.x to Solaris 2.x
Transition Guide. If you are using the AnswerBook, "scsi targets"
is a good search string.

Segmentation Fault 
==================

Segmentation faults usually result from programming error. This
message is usually accompanied by a core dump, except on read-
only filesystems.

To see which program produced a core file, run either the file(1)
command or the adb (1) command. The following examples show the
output of the file and adb commands on a core file from the
dtmail program.

$ file core core: ELF 32-bit MSB core file SPARC Version 1, from
`dtmail'

$ adb core core file = core -- program `dtmail' SIGSEGV  11:
segmentation violation ^D      (use Control-d to quit the adb
rogram)

Ask the vendor or author of this program for a debugged version.

A process has received a signal indicating that it attempted to
access an area of memory that is protected or that does not
exist. The two most common causes of segmentation faults are
attempting to dereference a null pointer or indexing past the
bounds of an array.

sendmail[N]: NOQUEUE: SYSERR: net hang reading from variable 
============================================================

This is a sendmail message that appears on the console and in the
log file /var/adm/messages. If this message occurs once for a
particular user, it is possible that a mail message from this
user ends with a partial line (having no terminating newline
character). If this message appears frequently or at busy times,
especially along with other networking errors, it could indicate
network problems.

Check the user's mail spool file to see if a message ends without
a newline character.  If so, talk with the user and determine how
to prevent the problem from occurring again. If these messages
are the result of network problems, you could try moving the mail
spool directory to another machine with a faster network
interface.

During the SMTP receipt of DATA phase, a message-terminating
period on a line of its own never arrived, so sendmail timed out
and produced this error.

setmnt: Cannot open /etc/mnttab for writing 
===========================================

The system is having problems writing to /etc/mnttab. It is
possible that the filesystem containing /etc is mounted read-
only, or is not mounted at all.

Check that this file exists and is writable by root. If so,
ensure that the /etc filesystem has been mounted, and is mounted
read-write rather than read-only.

share_nfs: /home: Operation not applicable 
==========================================

This message usually indicates that the system has a local
filesystem mounted on /home, which is where the automounter
usually mounts users' home directories.

When a systemis running the automounter, do not mount local
filesystems on the /home directory. Mount them on another
directory, such as /disk2, which on most systems you will have to
create.You could also change the automounter auto_home entry,
but that is a more difficult solution.

Soft error rate (N%) during writing was too high 
================================================

This message from the SCSI tape drive appears when Exabyteor DAT
tapes generate too many soft (recoverable) errors. It is followed
bythe advisory "Please, replace tape cartridge" message. Soft
errors are an indication that hard errors could soon occur,
causing data corruption.

First clean the tape head witha cleaning tape as recommended by
the manufacturer. If that doesn't work, replace the tape
cartridge. You might need to replace the tape drive if the
problem still occurs with new tape cartridges.

Soft error rate (retries = N) during writing was too high 
=========================================================

This message from the SCSI tape drive appears when Archive tapes
generate too many soft (recoverable) errors. It is followed by
the advisory "Periodic head cleaning required and/or replace tape
cartridge" message. Soft errors are an indication that hard
errors couldsoon occur, causing data corruption.

First clean the tape head with a cleaning tape as recommended by
the manufacturer. If that doesn't work, replace the tape
cartridge. Youmight need to replace the tape drive if the
problem still occurs with new tape cartridges.

Stale NFS file handle 
=====================

A file or directory that was opened by an NFS client was either
removed or replaced on the server.

If you were editing this file, write it to a local filesystem
instead. Try remounting the filesystem on top of itself or
shutting down any client processes that refer to stale file
handles. If neither of these solutions works, reboot the system.

The original vnode isno longer valid. The only way to get rid of
this error is to force the NFS server and client to renegotiate
file handles.

The symbolic name for this error is ESTALE, errno=151.

statd: cannot talk to statd at variable 
=======================================

This message comes from the NFS status monitor daemon statd,
which provides crash recovery services for the NFS lock daemon
lockd. The message indicates that statd has left old references
in the /var/statmon/sm and /var/statmon/sm.bak directories. After
a user has removed or modified a host in the hosts database,
statd might not properly purge files in these directories, which
results in its trying to communicate with a nonexistent host.

Remove the file named variable (where variable is the hostname)
from both the /var/statmon/sm and /var/statmon/sm.bak
directories. Then kill the statd daemon and restart it. If that
doesn't get rid of the message, kill and restart lockd as well.
If that doesn't work, reboot the machine at your convenience.

stty: TCGETS: Operation not supported on socket 
===============================================

This message results when a user tries to remote copy with rcp(1)
or remote shell with rsh(1) from one machine to another, but has
an stty(1) command in the remote

The solution is to move the stty command to the user's .login (or
equivalent) file.  Alternatively, execute the stty command in
.cshrc only when the shell is interactive.  Here is a test to do
just that:

if ($?prompt) stty ...

The rcp andrsh commands make a connection using sockets, which
do not support stty's TCGETS ioctl.

su: No shell 
============

This message indicates that someone changed the default login
shell for root to a program missing from the system. For example,
the final colon-separated field in /etc/passwd could have been
changed from /sbin/sh to/usr/bin/bash, which does not exist in
that location. Possibly an extra space was appended at the end of
line. The outcome is that you cannot login as root or switch user
to root, and so cannot directly fix this problem.

The only solution is to reboot the system from another source,
then edit the password file to correct this problem. Invoke
sync(1M) several times, then halt the machine by typing Stop-A or
by pressing the reset button. Reboot single-user from CDROM, the
net, or diskette, such as by typing boot cdrom -s at the ok
prompt.

After the system comes up and gives you a # prompt, mount the
device corresponding to the original / partition somewhere, such
as with a mount(1M) command similar to the one below. Then run an
editor on the newly-mounted system password file (use ed(1) if
terminal support is lacking):

# mount /dev/dsk/c0t3d0s0 /mnt # ed /mnt/etc/passwd

Use the editor to change the password file's root entry to call
an existing shell, such as /usr/bin/csh or /usr/bin/ksh.

To keep the "No shell" problem from happening, habitually use
admintool or /usr/ucb/vipw to edit the password file. These tools
make it difficult to change password entries in ways that make
the system unusable.

su: 'su root' failed for variable on /dev/pts/N 
===============================================

The user specified after "for" tried to become superuser, but
typed the wrong password.

If the user is supposed to know the root password, wait to see if
the correct password is supplied. If the user is not supposed to
know the root password, ask why he or she is attempting to become
superuser.

su: 'su root' succeeded for variable on /dev/pts/N 
==================================================

The user specified after "for" just became superuser by typing
the root password.

If the user is supposed to know the root password, this message
is purely informational. If the user is not supposed to know the
root password, change this password immediately and ask how the
user learned it.

syncing file systems... 
=======================

This indicates that the kernel is updating the super-blocks
before taking the system down, to ensure filesystem integrity.
This message appears after a halt(1M) or reboot (1M) command. It
can also appear after a system panic, in which case the system
might contain corrupted data.

If you just halted or rebooted the machine, don't worry-- this
message is normal. In case of a system panic, look up the panic
messages that appear above this one. Your system vendor might be
able to help diagnose the problem. So that you can describe the
panic to the vendor, either leave your system in its panicked
state or be sure that you can reproduce the problem.

Numbers that sometimes display after the three dots in the
message show the count of dirty pages that are being written out.
Numbers in brackets show an estimate of the number of busy
buffers in the system.

syslog service starting. 
========================

During system reboot, this message might appear and theboot
seems to hang. After starting syslogd(1M) service, the system
runs /etc/rc2.d/S75cron, which in turn calls ps(1). Sometimes
after an abrupt system crash /dev/bd.off becomes a link to
nowhere, causing the ps command to hang indefinitely.

Reboot single user (for example with boot -s) and run ls -l
/dev/bd* to see if this is the problem. If so, remove
/dev/bd.off, then run bdconfig off or reboot with the -r
(reconfigure) option.

This is the most commonly reported situation that causes ps to
hang.


tar: /dev/rmt/0: No such file or directory 
==========================================

The default tape device /dev/rmt/0, or possibly the device
specified by the TAPE environment variable, is not currently
connected to the system, is not configured, or its hardware
symbolic link is broken.

List the files in the /dev/rmt directory to see which tape
devices are currently configured. If none are configured, 
 ensure
that a tape device is correctly attached to the system, and
reboot with the -r option to reconfigure devices.

If tape devices other than /dev/rmt/0 are configured, you 

could
specify one of them after the -f option of tar(1).

tar: directory checksum error 
=============================

This error message from tar(1) indicates that the checksum of the
directory and the files it has read from tape does not match the
checksum advertised in the header block. Usually this indicates
the wrong blocking factor, although it could indicate corrupt
data on tape.

To resolve this problem, make certain that the blocking factor
you specify on the command line (after -b) matches the blocking
factor originally specified. If in doubt, leave out the block
size and let tar determine it automatically. If that doesn't
help, tape data could be corrupted.

tar: tape write error 
=====================

A physical write error has occurred on the tar(1) output file,
which is usually a tape, although it could be a diskette or disk
file. Look on the system console, where the device driver should
provide the actual error condition. This might be a write-
protected tape, a physical I/O error, an end-of-tape condition,
or a File too large limitation.

In the case of write-protectedtapes, enable the write switch.
For physical I/O errors, the best course of action is to replace
the tape with a new one. For end-of-tape conditions, try using a
higher density if the device supports one, or use cpio(1) or pax
(1) for their multi-volume support., When encountering File too
large limitations, use the parent shell'slimit(1) or ulimit
facility to increase the maximum file size.

For more information on tar tapes, see the section on copying UFS
files in the System Administration Guide,Volume I.

Text is lost because the maximum edit log size has been exceeded. 
=================================================================

This message appears at the beginning of a cmdtool(1) session
after 100,000 characters have gone by in the scrolling window.
Clicking on the top rectangle of the scrollbar might display this
message. No data were lost, but the user cannot scroll back
before this wraparound point.

To increase the maximum size of the Command Tool log file, use
cmdtool with the-M option, specifying more than 100,000 bytes.

THE FOLLOWING FILE SYSTEM(S) HAD AN UNEXPECTED INCONSISTENCY: 
============================================================

At boot time the /etc/rcS script runs the fsck(1M) command to
check the integrity of filesystems marked "fsck" in /etc/vfstab.
If fsck cannot repair a filesystem automatically, it interrupts
the boot procedure and produces this message. When fsck gets into
this state, it cannot repair filesystems without losing one or
more files, so it wants to defer this responsibility to you, the
administrator. Data corruption has probably already occurred.

First run fsck -n on the filesystem, to see how many and what
type of problems exist.  Then run fsck again to repair the
filesystem. If you have a backup of the filesystem, you can
generally answer "y" to all the fsck questions. It's a good idea
to keep a record of all problematic files and inode numbers for
later reference. To run fsck yourself, specify options as
recommended by the boot script. For example:

# fsck /dev/rdsk/c0t4d0s0

Usually, files lost during fsck repair were created just before a
crash or power outage, and cannot be recovered. If important
files are lost, you can recover them from backup tapes.

If you don't havea backup, ask an expert to run fsck for you.

For more information, see the sectionon checking filesystem
integrity in the System Administration Guide, Volume I.

The SCSI bus is hung. Perhaps an external device is turned off. 
===============================================================

This message appears near the beginning of rebooting, immediately
after a "Boot device: ..." message, and then the system hangs.
The problem is conflicting SCSI targets for a non-boot device.
Having an external device turned off is unlikely to cause this
problem.

See the message "Boot device:
/iommu/sbus/variable/variable/sd@3,0" for a solution.

For more information, see the section on halting and booting in
the System Administration Guide, Volume I.

THE SYSTEM IS BEING SHUT DOWN NOW !!! 
=====================================

This message means the system is going down immediately and it's
too late to save any changes.

This message is often preceded by messages telling you that the
system is going down in 15 minutes, 10 minutes, and so on. When
you see these initial broadcast shutdown messages, save all your
work, send any e-mail you're working on, and close your files.
Fortunately vi sessions are automatically saved for later
recovery, but many otherapplications have no crash protection
mechanism. Data loss is likely.

For more information on shutting down the system, see the System
Administration Guide, Volume I. If you are using the AnswerBook,
"halting the system" is a good search string.

The system will be shut down in N minutes 
=========================================

Thismessage from the system shutdown(1M) script informs you that
the superuser is taking down the system.

Save all changes now or your work will be lost. Write out any
files you were changing, send any e-mail messages you were
composing, and close your files.

For more information on shutting down the system, see the System
Administration Guide, Volume I. If you are using the AnswerBook,
"halting the system" is a good search string.

This mail file has been changed by another mail reader. 
=======================================================

This message appears in a pop-up dialog box whenever you start
mailtool(1) while another mail reader has the inbox locked. A
question follows: "Do you wish to ask that mail reader to save
the changes?" You are given three choices.

If you choose "Save Changes" mailtool will request the other mail
reader to relinquish its lock and write out any changes it has
made to your inbox. If you choose "Ignore" mailtool will read
your inbox without locking it. If you choose "Cancel" mailtool
will exit.

Timeout waiting for ARP/RARP packet 
===================================

This problem can occur while booting from the net, and indicates
a network connection problem.

Make sure the Ethernet cable is connected to the network. Check
that this system has an entry in the NIS ethers map or locally on
the boot server. Then check the IP address of the server and the
client to make sure they are on the same subnet. Local /etc/hosts
files must agree with each other and withthe NIS hosts map.

If those are not causing the problem, go to the system's PROM
monitor ok prompt and run test net to test the network
connection. (On older PROM monitors, use test-net instead.) If
the network test fails, check the Ethernet port, card, fuse, and
cable, replacing them if necessary. Also check the twisted pair
port to make sure it is patched to the correct subnet.

For more information on packets, see SPARC: Installing Solaris
Software. If you are using the AnswerBook, "ARP/RARP" isa good
search string.

Too many links 
==============

An attempt was made to create more than the maximum number of
hard links (LINK_MAX, by default 32767) to a file. Because each
subdirectory is a link to its parent directory, the same error
results from trying to create too many subdirectories.

Check to see why there are so many links to the same file. To get
more than the maximum number of hard links, use symbolic links
instead.

The symbolic name for this error is EMLINK, errno=31.

Too many open files 
===================

A process has too many files open at once. The system imposes a
per-process soft limit on open files, OPEN_MAX (usually 64),
which can be increased, and a per-process hard limit (usually
1024), which cannot be increased.

You can control the soft limit from the shell. In the C shell,
use the limit command to increase the number of descriptors. In
the Bourne or Korn shells, use the ulimit command with the -n
option to increase the number of file descriptors.

If the window system refuses to start new applications because of
this error, increase the open file limit in your login shell
before starting the window system.

The symbolic name for this error is EMFILE, errno=24.

umount: warning: /variable not in mnttab 
========================================

This message results when the superuser attempts to unmount a
filesystem that is not mounted. Note that subdirectories of
filesystems,such as /var, cannot be unmounted.

Run the mount(1M) or df(1M) command to see what filesystems are
mounted. If you really want to unmount one of them, specify the
existing mount point.

Unable to install/attach driver 'variable' 
==========================================

These messages appear in /var/adm/messages at boot time, when the
system tries to load drivers for devices the machine does not
have.

Despite the alarmist tone, this message is intended as purely
informational. You probably don't want all these device drivers,
because they make your system kernel larger, requiring more
memory.

undefined control 
=================

This message, prefaced by the file name and line number involved,
is from the C preprocessor /usr/ccs/lib/cpp, and indicates a line
starting with a sharp (#) but not followed by a valid keyword
such as define or include.

A piece of software might be running the C preprocessor on an
initialization file that you thought was interpreted by a shell.
In most shells, the sharp (#) indicates a comment. The C
preprocessor considers comments to be anythingbetween /* and */
delimiters.

Unmatched ` 
===========

This message from the C shell csh(1) indicates that a user typed
a command containing a backquote symbol (`) without a closeing
backquote. Similar messages result from an unmatched single quote
(') or an unmatched double quote ("). Other shells generally give
a continuation prompt when a command line contains an unmatched
quote symbol.

Correct the command line and try again. To continue typing on
another line, give the C shell a backslash right before the
newline.

UNREF FILE I=i OWNER=o MODE=m SIZE=s MTIME=t
============================================= CLEAR? 
======

During phase 4, fsck(1M) discovered that the specified file was
orphaned because the inode had no record of its pathname. In
other words, the file was not connected into any directory.

Answer yes to reconnect the file into the lost+found directory.
Then contact the file's owner to ask whether they want it back,
and where they want you to place it.

For more information, see the chapter on checking filesystem
integrity in the System Administration Guide, Volume I.

Use "logout" to logout. 
=======================

This C shell message might come as a surprise to Bourne or Korn
shell users accustomed to logging out with a Control-d.

When ignoreeof is set, the C shell requires users to logout by
typing logout or exit.  Write any modified files to disk before
exiting.

/usr/openwin/bin/xinit: connection to X server lost 
===================================================

This means that the xinit(1) program, which sets up X11 resources
and starts a window manager, failed to locate the X server
process. Perhaps the user interrupted window system startup, or
exited abnormally from OpenWindows (for example, by killing
processes or by rebooting). It is possible that the X server
crashed. Data loss is possible in some cases. Depending on
process timing, this message might be normal when OpenWindows
exits during a system reboot.

The only solution is to exit and restart OpenWindows. You do not
need to reboot the system unless it hangs and fails to give you a
console prompt. To exit OpenWindows, select Workspace->Exit. To
restart OpenWindows, type openwin at the system prompt.

Value too large for defined data type 
=====================================

The user ID or group ID of an IPC object or file system object
was too large to be stored in an appropriate member of the
caller-provided structure.

Run the application on a newer system, or ask the program's
author to fix this condition.

This error occurs only on systems that support a larger range of
user or group ID values than a declared member structure can
support. This condition usually occurs because the IPC or file
system object resides on a remote machine with a larger value of
type uid_t, off_t, or gid_t than that of the local system.

The symbolic name for this error is EOVERFLOW, errno=79.

WARNING: Clock gained N days-- CHECK AND RESET THE DATE! 
========================================================

Each workstation contains an internal clock powered by a
rechargeable battery. After the system is halted and turned off,
the internal clock continues to keep time. When the system is
powered on and reboots, the system notices that the internal
clock has gained time since the workstation was halted.

In most cases, especially if the power has been off for less than
a month, the internal clock keeps the correct time, and you do
not have to reset the date. Use the date(1) command to check the
date andtime on your system. If the date or time is wrong,
become superuser and use the date(1) command to reset them.

WARNING: No network locking on variable: 
 contact adminto install server change 
=====================================

The Solaris 2.x mount(1M) command issues this message whenever it
mounts a filesystem that doesn't have NFS locking, such as a
standard SunOS 4.1.x exported filesytem. Data loss is possible in
applications that depend on locking.

On the remote SunOS 4.1.x system, install the appropriate
rpc.lockd jumbo patch to implement NFS locking. For SunOS 4.1.4,
install patch #102264; for SunOS 4.1.3, install patch #100075;
for earlier 4.1 releases, install patch #101817.

WARNING: processorlevel 4 interrupt not serviced 
=================================================

This message is basically a diagnostic from the SCSI driver.
Especially on machineswith the sun4c architecture, it can appear
on the console every 10 minutes or so.

To reduce the frequency of this message, add this line near the
bottom of the /etc/system file and reboot:

set esp:esp_use_poll_loop=0

You might also see this message repeatedly after manually
removing a CD when it was busy. Don't do this! To get the system
back to normal, reboot the system with the -r (reconfigure)
option.

WARNING: /tmp: File system full, swap space limit exceeded 
==========================================================

The system swap area (virtual memory) has filled up. You needto
reduce swap space consumption by killing some processes or
possibly by rebooting the system.

See the message "Not enough space" for information about
increasingswap space.

WARNING: TOD clock not initialized-- CHECK AND RESET THE DATE! 
========================================================-=====

This message indicates that the Time Of Day (TOD) clock reads
zero, so its time is the beginning of the UNIX epoch: midnight 31
December 1969. On a brand-new system, the manufacturer might have
neglected to initialize the system clock. On older systems it is
more likely that the rechargeable battery has run out and
requires replacement.

First replace the batteryaccording to the manufacturer's
instructions. Then become superuser and use the date(1) command
to set the time and date. On SPARC systems the clock is powered
by the same battery as the NVRAM, so a dead battery also causes
loss of the machine's Ethernet address and host ID, which are
more serious problems for networked systems.

WARNING:Unable to repair the / filesystem. Run fsck 
====================================================

This message comes at boot time from the /etc/rcS script whenever
it gets a bad return code from fsck(1) after checking a
filesystem. The message recommends an fsck command line, and
instructs you to exit the shell when done to continue booting.
Then the script places the system in single-user mode so fsck can
be run effectively.

See "/dev/rdsk/variable: UNEXPECTED INCONSISTENCY" for
information about repairing UFS filesystems.

See "THE FOLLOWING FILE SYSTEM(S) HAD AN UNEXPECTED
INCONSISTENCY" for information about repairing non-UFS
filesystems.

Watchdog Reset 
==============

This fatal error usually indicates some kind of hardware problem.
Data corruption on the system is possible.

Look for some other message that might help diagnose the problem.
By itself, a watchdog reset doesn't provide enough information;
because traps are disabled, all information has been lost. If all
that appears on the console is an ok prompt, issue the PROM
command below to view the final messages that occurred just
before system failure:

ok f8002010 wector p

Yes, that word iswector, not vector.

The result is a display of messages similar to those produced by
the dmesg(1M) command. These messages can be useful in finding
the cause of system failure.

This message doesn't come from the kernel, but from the OpenBoot
PROM monitor, a piece of Forth software that gives you the ok
prompt before you boot UNIX. If the CPU detects a trap when traps
are disabled (an unrecoverable error), it signals a watchdog. The
OpenBoot PROM monitor detects the watchdog, issues this message,
and brings down the system.

Watchdog Reset, Rebooting. 
==========================

See the message "Watchdog Reset" for details. This rebooting
message occurs under the same conditions, but when the EEPROM's
watchdog-reboot? variable is set to true, causing the machine to
automatically reboot itself. Data corruption on the system is
possible.

Who are you? 
============

Many networking programs can print this message, including
from(1B), lpr(1B), lprm(1B), mailx(1), rdist(1), sendmail(1M),
talk(1), and rsh(1). The command prints this message when it
cannot locate a password file entry for the current user.  This
might occur if a user logged in just before the superuser deleted
that user's password entry, or if the network naming service
fails for a user who has no entry in the local password file.

If a user's password file entry was accidentally deleted, restore
it from backups or from another password file. If a user's login
name or user ID was changed, ask that user to logout and login
again. If the network naming service failed, check the NIS
server(s) and repair or reboot as necessary.

There is a known problem (bug 1138025) with starting hundreds of
rsh processes on another machine. This message appears because
rsh hangs while binding to a reserved port, and responds too
slowly to interact with the network naming service.

Window Underflow 
================

This message often occurs at boot time, sometimes along with a
"Watchdog Reset" error. It comes from the OpenBoot PROM monitor,
which was passed a processor trap from the hardware. This error
indicates that some programtried to access a SPARC register
window that wasn't accessible from the processor.

On some system architectures, specifically sun4c, the problem
could be that different capacity memory chips are mixed together.
Someone might have placed 1MB SIMMs in the same bank with 4MB
SIMMs. If this is so, rearrange the memory chips. Make sure to
put higher-capacity SIMMs in the first bank(s), and lower-
capacity SIMMs inthe remaining bank(s); never mix different
capacity SIMMs in the same bank.

The problem could also be that cache memory on the motherboard
has gone bad and needs replacement. If main memory is installed
correctly, try swapping the motherboard.

The best way to isolate the problem is to look at the %pc
register to see where it got its arguments from, and why the
arguments were bad. If you can reproduce the condition causing
this message, your system vendor might be able to help diagnose
the problem.

X connection to variable:0.0 broken (explicit kill or 
 server shutdown). 
=================

This means that the client has lost its connection to the X
server. The "0.0" represents the display device, which is usually
the console. This message can appear when a user is running an X
application on a remote system with the DISPLAY set back to the
original system and the remote system's X server disappears,
perhaps because someone exited X windows orrebooted the machine.
It sometimes appears locally when a user exits the window system.
Dataloss is possible if applications were killed before saving
files.

Try to run the application again in a few minutes after the
system has rebooted and the window system is running.

xinit: not found 
================

OpenWindows was probably not installed properly, and the
openwin(1) program could not find xinit(1) to start up the X
windows system. If the user is running another version of X
windows, such as the MIT X11 distribution, the startx program
serves the same function as xinit.

Check the PATH environment variable to make sure it contains the
appropriate X windows install directory. Verify that xinit is in
this directory as an executable program.

XIO: fatal IO error 32 (Broken pipe) on X server "variable:0.0" 
===============================================================

This means that I/O with the X server has been broken. The "0.0"
represents the display device, which is usually the console. This
message can appear when a user is running Display PostScript
applications and the X server disappears or the client is shut
down. Data loss is possible if applications disappeared before
saving files.

Try to run the application again in a few minutes after the
system has rebooted and the window system is running.

Xlib: Client is not authorized to connect to Server 
===================================================

See the message "Xlib: connection to ... refused by server" for
details.

Xlib: connection to "variable:0.0" refused by server 
====================================================

This message is immediately followed by the "Xlib: Client is not
authorized to connect to Server" message. These messages indicate
that an X windows application tried to run on the X server
specified inside double quotes, which did not allow the request.
The "0.0" represents the display device, which is usually the
console. If no server name appears, the superuser probably tried
to run an X application on the current machine in an X session
that was owned by somebody else.

To allow this client to connect to the X server, run xhost
+clientname on the X server system. Only the owner of the current
X session (who is not necessarily the superuser) isallowed to
run the xhost command. If somebody else is running X windows on
the server, ask them to log out and then start your own X session
on that server; remote X connections are usually allowed for the
same user ID.

xterm: fatal IO error 32 (Broken Pipe) or KillClient on X server
variable:0.0" 
=============

This means that xterm(1) has lost its connection to the X server.
The "0.0" represents the display device, which is usually the
console. This message can appear when a user is running xterm and
the X server disappears or the client gets shut down. Data loss
is possible if applications were killed before saving files.

Try to run the terminal emulator again in a few minutes after the
system has rebooted and the window system is running.

XView warning: Cannot load font set 'variable' (Font Package) 
=============================================================

This message from the XView library warns that a requested font
is not installed on the X server. Often multiple warnings appear
about the same font. The set of available fonts can vary from
release to release.

To see which fonts are available on the X server, run the
xlsfonts(1) program. Then specify another font name that you see
in the output of xlsfonts. Sometimes it is possible to locate a
similar font from a different vendor.

There are two packages of X windowsfonts: the common but not
required fonts (SUNWxwcft), and the optional fonts (SUNWxwoft).
Run pkginfo(1) to see if both these packages are installed, and
add them to the system as you wish.

ypbind[N]: NIS server for domain "variable" OK 
==============================================

This message appears after an "NIS server not responding" message
to indicate that ypbind(1M is able to communicate with an NIS
server again.

Proceed with your work. This message is purely informational.

ypbind[N]: NIS server not responding for domain 
 "variable"; still trying 
=========================

This means that the NIS client daemon ypbind(1M) cannot
communicate with an NIS server for the specified domain. This
message appears when a workstation running the NIS naming service
has become disconnected from the network, or when NIS servers are
down or extremely slow to respond.

If other NIS clients are behaving normally, check the Ethernet
cabling on the workstation that is getting this message. On SPARC
machines, disconnected network cabling also produces a series of
"no carrier" messages. On x86 machines, the above message might
be your only indication that network cabling is disconnected.

If many NIS clients on the network are giving this message, go to
the NIS server in question and reboot or repair as necessary. To
locate the NIS server for a domain, run the ypwhich(1) command.
When the server machine comes back in operation, NIS clients give
an "NIS server for domain OK" message.

For more information about ypbind, see the section on
administering secure NFS in the NFS Administration Guide.

ypwhich: can't communicate with ypbind 
======================================

This message from the ypwhich(1) command indicates that the NIS
binder process ypbind(1M) is not running on the local machine.

If the system is not configured to use NIS, this message is
normal and expected.  Configure the system to use NIS if
necessary.

If the system is configured to use NIS, but the ypbind process is
not running, invoke the following command to start it up:

# /usr/lib/netsvc/yp/ypbind -broadcast

zsN: silo overflow 
==================

This message means that the Zilog 8530 character input silo (or
serial portFIFO) overflowed before it could be serviced. The
zs(4S) driver, which talks to a Zilog Z8530 chip, is reporting
that the FIFO (holding about two characters) has been overrun.
The number after zs shows which serial port experienced an
overflow:

zs0 - tty serial port 0 (/dev/ttya) zs1 - tty serial port 1
(/dev/ttyb) zs2 - keyboard port (/dev/kbd) zs3 - mouse port
(/dev/mouse)

Silo overflows indicate that data in the respective serial port
FIFO has been lost.  However, consequences of silo overflows
might be negligible if the overflows occur infrequently, if data
loss is not catastrophic, or if data can be recovered or
reproduced.  For example, although a silo overflow on the mouse
driver (zs3) indicates that the system could not process mouse
events quickly enough, the user can perform mouse motions again.
Similarly, lost data from a silo overflow on a serial port with a
modem connection transferring data using uucp(1C) will be
recovered when uucp discovers the loss of data and requests
retransmission of the corrupted packet.

Frequent silo overflow messages can indicate a zs hardware FIFO
problem, a serial driver software problem, or abnormal data or
system activity. For example, the system ignores interrupts
during system panics, so mouse and keyboard activity result in
silo overflows.

If the serial ports experiencing silo overflows are not being
used, a silo overflow could indicate the onset of a hardware
problem.

Another type of silo overflow is one that occurs during reboot
when an HDLC line is connected to any of the terminal ports. For
example, an X.25 network could be sending frames before the
kernel has been told to expect them. Such overflow messages can
be ignored.


>>>> PART II <<<<:
==================

Error Message interpretation
See below for a list of common error messages. 

Traps and interrupts can be blocked by a kernel thread's signal mask, or they can trigger an exception handling routine. In the absence of such a routine or mask, the process is terminated. 


Traps
Traps are syncronous messages generated by the process or its underlying kernel thread. Examples include SIGSEGV, SIGPIPE and SIGSYS. They are delivered to the process that caused the signal. 

Trap messages can be discovered in a number of places, including error logs, adb output, and console messages. Sun provides a couple of files that can help determine the type of trap encountered:

/usr/include/sys/trap.h (software traps) 
/usr/include/v7/sys/machtrap.h (hardware traps, 32 bit) 
/usr/include/v9/sys/machtrap.h (hardware traps, 64 bit) 

ECC (Error Checking and Correcting) interrupts are reported as traps when a bit error is corrected. These, while they do not crash the system, are usually a signal that the memory chip in question needs to be replaced. 

Critical errors include things like fan/temperature warnings or power loss that require immediate attention and shutdown. 

Fatal errors are hardware errors where proper system function cannot be guaranteed. These result in a watchdog reset. 


Bus Errors
A bus error is issued to the processor when it references a location that cannot be accessed. 
Illegal address: (usually a software failure) 
Instruction fetch/Data load: (device driver bug) 
DVMA: (on an Sbus system) 
Synchronous/asynchronous data store 
MMU: (Memory Management Unit: can be hardware or software, but frequently are system board problems.) 


Interrupts
These notify the CPU of external device conditions that are asynchronous with normal operation. They can be delivered to the responsible process or kernel thread. 

In Solaris, interrupts are handled by dedicated interrupt-handling kernel threads, which use mutex locks and semaphores. The kernel will block interrupts in a few exceptional circumstances, such as during the process of acquiring a mutex lock protecting a sleep queue. 

Device done or ready. 
Error detected. 
Power on/off. 


Watchdog Reset
Watchdog resets can be caused by hardware or software issues. See the watchdog reset page for information on how to troubleshoot watchdog resets. 

Error Message List
A complete (or even reasonably complete) listing of error messages on Solaris is beyond the scope of this site. For that matter, the nature of an evolving operating system may put it beyond the scope of any reasonably sized page. Maybe a wiki? If someone has such a resource, let me know and I will link to it. 

Having said that, this page contains a list of several of the most common error messages. Where I have been able to identify a usual cause for an error message, I have included that. 

There are several sources that contain listings of error messages that are useful for debugging purposes. 

One of the best resources is the Solaris Common Messages and Troubleshooting Guide released by Sun with Solaris 8. Since this is a better resource than I could provide for Solaris up through 8, I have focused on Solaris 10. (There is obviously a lot of overlap.) 

The SunSolve web site is available to anyone with a Sun service contract. Its search feature can be used to look up key words in an error message to look for current bug reports and patches that may resolve them. This page does not provide a listing of bug reports or patches to apply for given error messages in certain conditions. This page is intended as a supplement to Sunsolve, not a replacement. 

The Intro(2) man page contains an introduction to system calls and error numbers. The information comes from the errno.h include file. Several include files contain at least basic information about different kinds of error messages: 

/usr/include/sys/errno.h (error messages, including abbreviations and numbers seen in truss output.) 
/usr/include/sys/trap.h (software traps) 
/usr/include/v7/sys/machtrap.h (hardware traps, 32 bit) 
/usr/include/v9/sys/machtrap.h (hardware traps, 64 bit) 

These messages are alphabetized by the first non-variable portion of the message. Wording may vary slightly between Solaris versions or even patch levels. If you run across common messages not on this list, feel free to make a comment to the Solaris Troubleshooting blog. 

Accessing a corrupted shared library (ELIBBAD): exec(2) was unable to load a required static shared library. The most common cause for this is a corrupted library. 
Address already in use (EADDRINUSE): The protocol does not permit using an address that is already in use. This error indicates a software programming bug. 
Address family not supported by protocol family (EAFNOSUPPORT): The protocol does not support the requested address. This indicates a software programming bug. 
Arg list too long (E2BIG): The argument list includes both the argument list and the environment variable settings. The most common cause for this problem is that so many environment variables are set that it exceeds the size of the argument buffer used by exec(2). The easiest solution may be to unset some environment variables in the calling shell. 
Argument out of domain (EDOM): This error appears when an improper argument is submitted to a math package programming function. (For example, an attempt to take a square root of a negative number would probably yield this error.) It may be helpful to use matherr(3M) to diagnose the problem, or the programmer may need to implement argument-checking before the function is called. 
Arguments too long: This is a C shell message indicating that more than 1706 arguments follow a command. This may happen if globbing is applied to a large number of objects (eg rm * in a directory of more than 1706 objects). Temporarily switching to Bourne shell may resolve the problem, since Bourne shells dynamically allocate space for arguments. 
Assertion failed: This is a result of an assert(3C) debugging command that the programmer inserted into the program. The output will include an expression, a source file number and a code line number. The information may be useful in examining the source code. 
Attachment point not found: Use cfgadm to list available attachment points. Check the physical connection to the desired device. 
Attempting to link in more shared libraries than system limit (ELIBMAX): The executable requires more static libraries than the current system limit. 
authentication receive failed: Initiator unable to receive authentication information. Verify network connectivity to storage device and authentication server. 
authentication transmit failed: Initiator unable to transmit authentication information. Verify network connectivity to storage device and authentication server. 
Bad address (EFAULT): A function taking pointer argument has been passed an invalid address. This may result from supplying the wrong device or option to a command, or it may be the result of a programming bug. 
Bad file number (EBADF): The file descriptor references a file that is either not open or is open for a conflicting purpose. (eg, a read(2) is specified against a file that is open for write(2) or vice-versa.) This is a programming bug. 
Bad module/chip: This error message usually indicates a memory module or chip that is associated with parity errors. This is a hardware fault. 
BAD SUPER BLOCK: Check the Trap 3E entry below to see if there are possible hardware or SCSI configuration causes for this problem. It may be possible to boot from alternate super blocks. If there is no current backup, boot from a CD and back up the raw partition with ufsdump or another similar utility. Solaris 10's 6/06 release includes enhancements to fsck to automatically find and repair bad superblocks. This option should only be used to repair filesystems that were created with mkfs or newfs. For older systems, an alternate superblock can frequently be found with a
newfs -N /dev/rdsk/c#t#d#s# 
command while booted from a CD. (Note the -N option. Running this command without this option may mess things up beyond repair.) fsck can be run against an alternate superblock with
fsck -o b=superblock /dev/rdsk/c#t#d#s#
If there is a lot of output, it may be necessary to choose the -y option to avoid having to answer a ton of prompts. We may need to try several alternate superblocks before finding a working one. Once we are done, we need to re-install the bootblock:
cd /usr/platform/`arch -k`/lib/fs/ufs
/usr/sbin/installboot ./bootblk /dev/rdsk/c#t#d#s# 
BAD TRAP: The causes for bad traps include system text errors, data access faults, data alignment errors or some types of user software traps. These can indicate either a hardware fault or a mismatch between the hardware and its software configuration. They may also indicate a CPU with an obsolete firmware. Bad traps usually result in a panic, sync, dump, reboot cycle. The kernel traceback message on the console will frequently indicate the hardware component that generated the bad trap. If the configuration for this component is correct, it will need to be replaced (or at least reseated). 
/bin/sh: ... too big: This Bourne shell message is a variant of Not enough space. Check that message for steps to take. 
Block device required (ENOTBLK): A raw device was specified where a block device is required. 
Broken pipe (EPIPE): No reading process was available to accept a write on the other end of a pipe. This can happen when the reading process (the process after the pipe) exits suddenly. 
Bus Error: I/O was attempted to a device that is unavailable or does not exist. See Bus Error above. 
Cannot access a needed shared library (ELIBACC): Either the library does not exist, the LD_LIBRARY_PATH variable does not include the library, or the user is not permissioned to use it. The library in question can usually be pinned down with truss. 
Cannot assign requested address (EADDRNOTAVAIL): The requested address is not on the current machine. 
Cannot exec a shared library directly (ELIBEXEC): You can't execute shared libraries directly. This error indicates a software bug. 
Cannot install bootblock: On an x86 system, this error typically appears when a newfs and restore operation was carried out without performing a installboot before installing the OS. It may be possible to install the bootblock from the CD drive in single-user mode (note that Sun does not guarantee this procedure): 
cd /usr/platform/`arch -k`/lib/fs/ufs 
installboot ./pboot ./bootblk /dev/rdsk/c#t#d#s#

Cannot send after transport endpoint shutdown (ESHUTDOWN): The transport endpoint has been shut down, so data was unable to be sent. The solution is usually to restore the endpoint and re-run the transfer. (We may need to troubleshoot why the remote endpoint became unavailable.) 
can't accept: Initiator does not accept the specified data of the given format. Consult storage device documentation to look for compatibility information for the server hardware and OS. 
can't accept ... in security stage: Device responded with unsupported login information during login security phase. Verify storage device authentication settings. Consult storage device documentation to look for compatibility information for the server hardware and OS. 
can't find environment variable: The specified environment variable has not been set. Check for a typo and/or verify that the variable has been set. 
Can't invoke /etc/init: The init binary is missing or corrupted during a reboot. We may be able to complete the boot by copying init from a CDROM during a CDROM reboot. 
capacity of this LUN is too large: SCSI partitions must be less than 2TB. 
Channel number out of range (ECHRNG): A stream head attempted to open a minor device that is in use or does not exist. We need to make sure that the stream device exists, along with an appropriate number of minor devices, and that it matches the hardware configuration. It may be necessary to schedule jobs differently to allow for limited system resources. 
check boot archive content: If SMF does not start up on its own, this message in response to svcs -x may indicate a failure of svc:/system/boot-archive:default To resolve this problem, select the Solaris failsafe archive option in the GRUB boot menu during the next reboot. The failsafe boot option provides instructions for rebuilding the boot archive. Once that is complete, the boot can be continued by clearing the SMF boot archive with the svcadm clear boot-archive command. 
Command not found: This is a C shell error message that means exactly what it says. It typically means that the command was misspelled or does not live on the PATH. 
Communication error on send (ECOMM): The link between machines breaks after data is sent, but before the confirmation is received. 
Component system is busy, try again: failed to offline: cfgadm attempted to remove or replace a device with a mounted file system, swap area or configured dump device. Unmount the file system, remove the swap and/or disable the dump device, then retry the cfgadm command. See the cfgadm(1M) man page. 
Configuration operation invalid: invalid transition: The incorrect device may have been specified, or there may be a problem with the device or its seating. Use cfgadm to check the receptacle and its state. The card may need to be reseated. 
Connection refused (ECONNREFUSED): The target machine actively refused the connection. The service may not be active, or there may be restrictions on connections (such as the hosts.allow and hosts.deny in TCP wrappers). 
Connection reset (ECONNRESET): The target system forcibly closed an existing connection. This typically happens as a result of a reboot or a timeout. 
Connection timed out (ETIMEOUT): The target host is unreachable due to network problems or the system being down. 
Core dumped: A core file (image of software memory at the time of failure) has been taken. See Core File Management. 
Corrupt label: This happens if cylinder 0 has been overwritten, usually by a database using a raw partition including cylinder 0. The best solution is to back everything up and repartition the disk with cylinder 0 either not in any partition or at least in a partition with a filesystem (such as UFS) that respects cylinder 0. 
cpio: Bad magic number/header: The cpio archive has become corrupted. We can try to recover whatever we can by using the cpio -k command. 
Cross-device link (EXDEV): Hard links are not permitted across different filesystems. Use a soft link instead. 
Data access exception: Mismatch between the operating system and disk storage hardware. This can be due to mis-seated DIMMs or disk problems, so it makes sense to try to identify any hardware problems. Usually, the operating system (and perhaps filesystem) will need to be upgraded to deal with the newer hardware. 
DataDigest=... is required, can't accept: Device returned an improperly processed DataDigest. Verify that storage device digest settings are compatible with the initiator. 
Data Fault: This is a particular type of bad trap that indicates a configuration text or data access fault. See BAD TRAP above. 
Deadlock situation detected/avoided (EDEADLK): A potential deadlock over a system resource (usually a lock) was detected and avoided. The software should be examined to see if it can be made more resilient. 
Destination address required (EDESTADDRREQ): An address was omitted from an operation that requires one. 
/dev/fd/#: cannot open: Indicates that the file descriptor file system (fdfs) is not mounted correctly. In most cases, the problem is that it is mounted either nosuid or not at all. The file descriptor file system should have the following options in the vfstab: 
fd - /dev/fd fd - no - 
Device busy (EBUSY): A hard drive or removable media failed to unmount or eject due to an active process using them. The fuser command allows us to see what processes are using the filesystem or even kill them with a command like: 
fuser -ck /mountpoint
(Make sure that you know what processes are running on a filesystem before killing them.) 
DIMMs Manufacturer Mismatch: DIMMs in the system are not on the hardware compatibility list. 
Directory not empty: This is an error from rmdir which means exactly what it says. Non-empty directories cannot be removed. (If a process is holding a file open, it is possible to track down the culprit by looking for the inode of the file in question (ls -i filename) in pfiles output.) 
Disc quota exceeded (EDQUOT): A user's disk quota has been exceeded. Some of the user's files can be removed or the quotas can be increased with edquota. 
Disk# not unique: This error is displayed if there are multiple EEPROM devalias entries for a disk. At the ok> prompt, the values of the aliases can be shown with 
ok> printenv
the aliases can be reset with 
ok> nvunalias disk#
ok> nvalias disk# device-path 
dquot table full: The UFS quota table needs to be increased in size. This is done by increasing ndquot in /etc/system and rebooting. ndquot defaults to (maxusers x 40)/4 + max_nprocs 
dr in progress: This error may occur if a SCSI unconfigure operation fails while only partially completed. The controller may need to be reconfigured with cfgadm 
driver not attached: No driver currently attached to the specified device because no device exists at the node or the device is not in use. This may or may not mean that a proper driver is not installed. Make sure that the driver is installed and properly configured. 
empty RADIUS shared secret: The RADIUS shared secret needs to be set. 
Error 88 (EILSEQ): This is an illegal byte sequence error. Multiple characters have been provided where only one is expected. 
Error code 2: access violation: This error is due to a permissioning or pathing error on a tftp get. 
Error: missing file arg (cm3): A filename was not included in an sccs command that requires one. 
error opening dir: The specified path may not be a directory. 
error writing name when booting: /etc/nodename must contain exactly one line with the name of the system and no blanks or returns. 
esp0: data transfer overrun: This error appears when we attempt to mount a CD drive with an 8192 block size as opposed to the Sun-standard 512 block size. Check with the drive manufacturer to see if the block size can be switched. 
ether_hostton: Bad file number/Resource temporarily unavailable: These messages may be a result of a mis-matched nodename file. Make sure that the /etc/nodename entry matches the corresponding /etc/hostname.interface and /etc/inet/hosts files. 
Event not found: The shell reports that a command matching the request cannot be found in the history buffer for the shell session. The history command shows the current contents of the history buffer. 
Exec format error (ENOEXEC): This error usually means that the software was compiled for an architecture other than the one on which it finds itself. This may also happen if an expected binary compatibility package is not installed. The file command displays the expected architecture for the binary. 
Failed to initialize adapter: If the adapter has been correctly identified, this means that the configuration of the adapter is incorrect. In particular, make sure to check the DMA settings. 
Failed to receive login response: Initiator failed to receive a login Payload Data Unit (PDU) across the network. Verify that the network connection is working. 
Failed to transfer login: Initiator failed to transfer a login Payload Data Unit (PDU) across the network. Verify that the network connection is working. 
Fast access mmu miss: This is usually due to a hardware problem. Memory is a possible culprit, as are the system board and CPU. Check PROM Monitor Diagnostics for hardware diagnostics on OBP/Sparc systems. 
File descriptor in bad state (EBADFD): The requested file descriptor does not refer to an open file or it refers to a file descriptor that is restricted to another purpose. (For example, a read request is made to a file descriptor that is open for writing only.) 
File exists (EEXISTS): An existing file was targeted for a command that would have overwritten it improperly. For example, there may have been a request to overwrite a file while the csh noclobber option is set, or there may have been a request to set a link to the name of an existing file. 
File locking deadlock (EDEADLOCK): Two processes deadlocked over a resource, such as a lock. This is a software programming bug. 
File name too long (ENAMETOOLONG): The referenced file name is longer than the limit specified in /usr/include/limits.h. 
File system full: The file system is full. (Error messages sometimes mean what they say.) If the message occurs during a login, the problem is likely the filesystem that includes the utmpx file (usually /var). 
File too large (EFBIG): The file size has grown past what is allowed by the protocol or filesystem in question, or exceeds the resource limit (rlimit) for file size. The resource limit can be checked by running ulimit -a in Bourne or Korn shells or limit in C shell. Check the Resource Management page for additional information on managing resource limits. 
Giving up: In the context of a SCSI command, this means that the timeout has been exceeded. This is usually due to a hardware or connection problem, but it can be caused by contention on the SCSI channel, or even a mis-match in timeout settings between the OS and the device in question. 
Hardware address trying to be our address: Either we have two systems on our network with the same IP address, or we have snooping enabled on a device on the network. 
Host is down (EHOSTDOWN): A connection attempt failed because the target system was unavailable. 
HeaderDigest=... is required, can't accept: Device returned an improperly processed HeaderDigest. Verify that storage device digest settings are compatible with the initiator. 
Host name local configuration error: sendmail wants to have a fully qualified domain name for the local host. It is good practice to include a fully qualified domain name in the hosts file entry for the local server. 
Hypertransport Sync Flood occurred on last boot: Uncorrectable ECC error caused the last reboot. For x64 systems, check the service processor's System Event Log and BIOS log to identify the culprit. 
Identifier removed (EIDRM): There is a problem accessing a file associated with messaging, semaphores or shared memory. Check the msgctl(2), semctl(2) or shmctl(2) man page for more details. 
ieN Ethernet jammed: The number of successive failed transmission attempts has exceeded the threshold. Check whether the network is saturated or check for other network problems. 
ieN no carrier: The carrier detect pin died during a packet transmission, resulting in a dropped packet. Check for loose connections and otherwise check the network. 
If pipe/FIFO, don't sleep in stream head (ESTRPIPE): There is a problem with the STEAMS connection. 
ifconfig: bad address: Check /etc/hostname.* to make sure that the entries match the hosts file. When this error occurs early in the boot process, make sure that the filesystem containing hostname.* and hosts is online at that stage of the boot process. If �files� is not the first entry in the �hosts� line of /etc/nsswitch.conf, the hostname lookup will not be possible until the interface comes online. 
ifconfig: no such interface: Make sure that the /etc/hostname.interface file exists. 
Illegal instruction: This error message means exactly what it says. This may come about because the binary is not compiled for this architecture (see �Exec format error� above), or it may come as a result of trying to run a data file as a program. If this appears during a boot, it means that the system is trying to boot from a non-boot device, that the boot information has become corrupted, or that the boot information is meant for a different architecture. 
Illegal seek (ESPIPE): There is a problem with a pipe in the statement. A workaround suggested by Sun is to redirect the output of the source command to a scratch file, then process the file. 
Initiator could not be successfully authenticated: Verify CHAP and/or RADIUS settings, as appropriate. 
Initiator is not allowed access to the given target: Verify initiator name, masking and provisioning. 
initiator name is required: The initiator name is improperly configured. 
Interrupted system call (EINTR): An signal (like an interrupt or quit) was received before the system call had completed. (If we try to resume, we may error out as a result of this condition.) 
Invalid argument (EINVAL): System cannot interpret a supplied parameter. Depending on the context, this may be an indication that the object named by the parameter is not set up properly. 
Invalid null command: This may indicate that there were two pipes in a row (�||�) in the referenced command. 
I/O error (EIO): This references a physical I/O fault. Depending on the context, it makes sense to replace the removable media, check all connections, run diagnostics on the referenced hardware or fsck the filesystem. If this error occurs during a write, we must assume that the data is corrupt. 
Is a directory (EISDIR): We tried to treat a directory like a file. 
iSCSI service or target is not currently operational: Run diagnostics on the storage device hardware; check storage device software configuration. 
Kernel read error: savecore is unable to read the kernel data structures to produce a crash dump. This may indicate a hardware problem, especially a memory problem. This problem may accompany a BAD TRAP error. 
Killed: This may happen as a result of a memory allocation attempt where either there is insufficient swap space or the stack and data segment size are in conflict. A �Killed� message may also appear when a program is sent a SIGKILL by other means, such as a kill command. 
kmem_free block already free: This is a software programming bug, probably in a device driver. 
ld.so.1 fatal: can't set protection on segment: Sun reports a case where this error occurred due to a lack of swap space. ld.so.1 complained because there was no segment on which to set protections. 
ld.so.1 fatal: open failed: No such file or directory: The linker was unable to find the shared library in question. Make sure that LD_LIBRARY_PATH is set properly. 
ld.so.1 fatal: relocation error: referenced symbol not found: The symbol referenced by the specified application was not found. This error most frequently occurs after installations or upgrades of shared libraries. ldd -d on the application will show its dependencies. Depending on the nature of the conflict, it may be resolvable by changing the LD_LIBRARY_PATH or installing an appropriate version of the shared library. 
Link has been severed (ENOLINK): The connection to a remote machine has been severed, either by the remote process dying or a network problem. 
Login incorrect: This error means that an appropriate username and password pair was not entered. This may be due to a problem with the passwd and shadow file, the naming service, or the user forgetting login credentials. 
login redirection failed: Storage device attempted to redirect initiator to an invalid destination. Verify storage device redirection settings. 
Memory Configuration Mismatch: Can be caused by damaged or unsupported DIMMs, or by running non-identical DIMMs within the same bank. 
Message too long (EMSGSIZE): A message was sent that was larger than the internal message buffer. 
Miscellaneous iSCSI initiator errors: Check the initiator. 
Missing parameters (e.g, iSCSI initiator and/or target name): Verify that the initiator and target name are properly specified. 
mount: ...already mounted... (EBUSY): Either the filesystem is mounted elsewhere, an active process has its working directory inside the mount point or the maximum number of mounts has been exceeded. 
mount: giving up on...: The remote mount request was unsuccessful for more than the threshold number of retries. Check the network connection and make sure that the NFS server is sharing the directory to the client as expected. 
mount: mount-point...does not exist: The directory specified as the mount point does not exist. 
mount: the state of /dev/dsk/... is not okay: The filesystem should either be mounted read-only or fsck-ed. 
Network dropped connection because of reset (ENETRESET): The remote host crashed or rebooted. 
Network is down (ENETDOWN): A transport connection failed due to a dead network. 
Network is unreachable (ENETUNREACH): Either there is no route to the network, or negative status information was received from intermediate network devices. 
NFS getattr failed for server...RPC: Timed out: The NFS server has failing hardware. (For a server that is slow to respond, the NFS server not responding message would appear instead.) 
nfs mount: Couldn't bind to reserved port: The NFS server has multiple network cards bound to the same subnet. 
nfs mount: mount:...Device busy: An active process has a working directory inside the mount point. 
NFS mount:...mounted OK: A backgrounded mount completed successfully. This may be an indication that the server response is poor, since otherwise the mount would have completed immediately and not required backgrounding. 
NFS read failed for server: This is a permissions problem error message. In addition to checking the permissions on the NFS server, make sure that the permissions underneath the mount are acceptable. (Mount points should have 755 permissions to avoid odd permissioning behavior on mounted filesystems.) 
nfs_server: bad getargs: The arguments are unrecognized or incorrect. This may be an indication of a network problem, or it may indicate a software configuration problem on the client. 
NFS server ... not responding: The network connection to the NFS server is either slow or broken. 
NFS server ... ok: The network connection to the NFS server has been restored. This is a followup to NFS server ... not responding. 
nfs umount: ... is busy: An active process has a working directory inside the specified NFS mount. See the Device busy error message. 
NFS write error on host ... No space left on device: If an NFS mount runs out of space, attempts to write to files on the share may corrupt or zero out those files. 
NFS write failed for server ... RPC: Timed out: The filesystem is soft mounted, and response time is inadequate. Sun recommends that writable filesystems not be soft-mounted, as it can lead to data corruption. 
No carrier-cable disconnected or hub disabled?: This error may manifest due to a physical networking problem or a configuration issue. 
No child processes (ECHILD): An application attempted to communicate with a cooperating process that does not exist. Either the child exited improperly or failed to start. 
No default media available: Drives contain no floppy or CD media to eject. 
No directory! Logging in with home=/: The home directory either does not exist or is not permissioned such that the user can use it. If home directories are automounted, it may be necessary to troubleshoot the automounter. 
no driver found for device: A driver has been disabled while the device is still attached. Depending on the type of device, cfgadm, drvconfig, devfsadm or a reconfiguration reboot (boot -r) may be required. Check the System Administration Guide: Devices and File Systems document. 
No message of desired type (ENOMSG): Something attempted to receive a message of a type that does not exist on the message queue. See the msgsnd(2) and msgrcv(2) man pages. 
No record locks available (ENOLCK): Any of several different locking subsystems, including fcntl(2), NFS lockd and mail, may yield this message when no more locks are available. 
No route to host (EHOSTUNREACH): In practice, this message is not distinguishable from Network is unreachable. 
No shell Connection closed: The shell specified for the user is either unavailable or illegal. Make sure it is listed in /etc/shells and that it exists. It may be necessary to change the passwd entry for this user to assign a valid shell. 
No space left on device (ENOSPC): The disk, tape or diskette is full. 
No such device (ENODEV): An operation was attempted on an inappropriate or nonexistent device. Make sure that it exists in /devices and /dev. The drvconfig or boot -r commands can be used to regenerate many /devices entries. 
No such device or address (ENXIO): I/O has been attempted to a device that does not exist or that exists beyond the limits of the device. Make sure that the device in question is powered up and connected properly, including the correct SCSI ID. 
No such file or directory (ENOENT): The file or path name does not exist on the system. Make sure that the relevant filesystems are mounted and that the expected files and/or directories exist. 
No such process (ESRCH): The process does not exist on the system. It may have finished prior to the attempt to reference it. 
No such user ... cron entries not created: Even though a file exists in /var/spool/cron/crontabs for this username, the username is not present in the passwd database. 
No utmpx entry: The filesystem containing the utmpx file is full. This may need to be resolved in single-user mode, since logins will not be permitted. 
Not a data message (EBADMSG): Data has come to the head of a STREAMS queue that cannot be processed. See the man pages for read(2), getmsg(2) and ioctl(2). 
Not a directory (ENOTDIR): A non-directory was specified as an argument where a directory is required. 
Not a stream device (ENOTSTR): The file descriptor used as a target for the putmsg(2) or getmsg(2) is not a STREAMS device. 
Not a UFS filesystem: The boot device is improperly defined. For x86, boot the system with the Configuration Assistant/boot CD and identify the disk from which to boot. For PROM-based systems, set the boot-device properly in the PROM environment variables. 
Not enough space (ENOMEM): Insufficient swap space available. 
Not found: The specified command could not be found. Check the spelling and the PATH. 
Not login shell: Use exit to get out of non-login shells. (The logout command can only be used from login shells.) 
Not on system console: Direct root logins are only permitted on the system console unless otherwise specified in /etc/default/login. 
Not owner (EPERM): Action attempted that can only be performed by object owner or the superuser. 
Not supported (ENOTSUP): A requested application feature is not available in the current version, though it may be expected in a future release. 
Object is remote (EREMOTE): We tried to share a resource not on the local machine. 
Operation already in progress (EALREADY): An operation was already in progress on a non-blocking object. 
Operation canceled (ECANCELED): The asynchronous operation was canceled before completion. 
Operation not applicable (ENOSYS): No system support exists for this operation. 
Operation not supported on transport endpoint (EOPNOTSUPP): Tried to accept a connection on a datagram transport endpoint. 
Operation now in progress (EINPROGRESS): Operation in progress on a non-blocking object. 
Option not supported by protocol (ENOPROTOOPT): A bad option or level was specified. 
Out of memory: System is running out of virtual memory (including swap space). See �Not enough space� as well. 
Out of stream resources (ENOSR): No STEAMS queues or no STREAMS head data structures available during a STREAMS open. 
Overlapping swap volume: Make sure that the additional swap volumes have unique names. 
Package not installed (ENOPKG): The attempted system call belongs to a package that is not installed on this system. 
Paired DIMMs Mismatch: Checksum mismatch between two DIMMs in a pair. Can be caused by damaged or non-identical DIMMs. 
Panic � boot: Could not mount filesystem: (During a Jumpstart) The Jumpstart boot process is unable to get to the install image. Make sure that the Jumpstart configurations and file shares are correct. 
Panic ... valloc'd past tmpptes: May occur if maxusers is set to an absurdly high number. It should not be set past the number of MB of RAM or 4096, whichever is smaller. 
Permission denied (EACCES): The attempted file access is forbidden due to filesystem permissions. 
Protocol family not supported (EPFNOSUPPORT): The protocol has not been implemented on this system. 
Protocol not supported (EPROTONOSUPPORT): The protocol has not been configured for this system. Check the protocols database (/etc/inet/protocols by default). 
Protocol wrong type for socket (EPROTOTYPE): Application programming error or misconfigured protocols. The requested protocol does not support the requested socket type. Make sure that the protocols database matches with the corresponding entries in /usr/include/sys/socket.h. 
quotactl: open Is a directory: A directory named �quota� can cause edquota to fail. Such directories should be renamed. 
RADIUS packet authentication failed: Re-set the RADIUS shared secret. 
Read error from network: Connection reset by peer: The remote system crashed or rebooted during an rsh or rlogin session. 
Read-only file system (EROFS): We can't change stuff on filesystems that are mounted read-only. 
received invalid login response: Storage device response was unexpected. Verify initiator authentication settings. 
Requested iSCSI version range is not supported by the target: The initiator's iSCSI version is not supported by the target storage device. Check the compatibility lists. See if firmware or driver upgrades would be sufficient. 
Requested ITN does not exist at this address: The iSCSI target name (ITN) is not accessible. Verify the initiator discovery information and storage device configuration. 
Requested ITN has been removed and no forwarding address is provided: The requested iSCSI target name is no longer accessible. Verify the initiator discovery information and storage device configuration. 
Resource temporarily unavailable (EAGAIN): fork(2) cannot create a new process due to a lack of resources. These resources may include limits on active processes (see the Resource Management page) or a lack of swap space. 
Restartable system call (ESTART): The system call has been interrupted in a restartable state. 
Result too large (ERANGE): This is a programming or data input error. The result of a calculation is not representable in the defined data type. The matherr(3M) facility may be helpful in debugging the problem. 
ROOT LOGIN ...: Someone has just logged in as root or su-ed to root. 
RPC: Program not registered: Make sure that the requested service is available. 
rx framing error: This error usually indicates a problem with the network hardware. Framing errors are types of CRC errors, which are usually caused by physical media problems. 
SCSI bus DATA IN phase parity error: This is a problem related to SCSI hardware or connections. It may have to do with hardware that is not qualified for attachment to Sun servers, connections with cables that are flaky or too long (total length more than 6 meters), bad terminators or flaky power supplies. See the SCSI transport failed: reason 'reset' message as well. 
SCSI transport failed: reason 'reset': The system sent data that was never received due to a SCSI reset. This may occur due to conflicting SCSI IDs, hardware that is not qualified for attachment to Sun servers, connections with cables that are flaky or too long (total length more than 6 meters), bad terminators or flaky power supplies. These issues have also been observed on systems where the highest capacity DIMMs are not in the lowest numbered slots. Disk arrays wth read-ahead caches can sometimes also cause this problem; turn off the caching to see if the problem goes away. Non-obvious SCSI ID conflicts may be diagnosed using the PROM monitor probe-scsi-all command. (See OBP Command Line Diagnostics for more details.) These errors may also happen when the SCSI device and the server are set to different SCSI timeout thresholds. 
Segmentation Fault: These can be produced as a result of programming errors or improperly set rlimit resource settings. (See Resource Management for how to check and adjust resource settings.) Segmentation faults are an indication that the program has attempted to access an area of memory that is protected or does not exist. Programming causes for segmentation faults include dereferencing a null pointer and indexing past the bounds of an array. 
setmnt: Cannot open /etc/mnttab for writing: The system is unable to write to /etc/mnttab. This may be caused by the /etc directory being mounted read-only (which can happen during certain types of boot problems). 
share_nfs: /home: Operation not applicable: A local filesystem is mounted on /home, which is usually reserved for use by the automounter. 
skipping LIST command � no active base: A LIST command is present without an associated BASE command. (cachefspack) 
Socket type not supported (ESOCKTNOSUPPORT): The socket type's support has not been configured for this system. 
Soft error rate ... during writing was too high: The number of soft errors on a tape device have exceeded the threshold. It may be due to a dirty head, bad media or a faulty tape drive. 
Software caused connection abort (ECONNABORTED): The connection was aborted within the local host machine. 
Stale NFS file handle (ESTALE): The file or directory on the NFS server is no longer available. It may have been removed or replaced. A remount may be needed to force a renegotiation of file handles. 
statd: cannot talk to statd: statd has left remnants in the /var/statmon/sm and /var/statmon/sm.bak directories. Files named after inactive hosts should be removed, and statd and lockd should be restarted. 
su: No shell: The default shell for root is improper. It may have been set to a nonexistent program or an illegal shell. This problem has been known to occur when an extra space is appended to the �root� line of the passwd file. The passwd file will need to be repaired while booted from CDROM or network. 
syncing file system: The kernel is updating the superblocks before taking the system down or in the wake of a panic. 
System booting after fatal error FATAL: This can be caused by UPA address parity errors, Master queue overflows or DTAG parity errors. This is going to be due to a bad CPU or possibly a bad system board. 
tar: ...: No such file or directory: The specified target (which defaults to TAPE) is not available. This may be due to a hardware problem with the tape drive or connections, or to a misspecified target. 
tar: directory checksum error: The checksum of the files read from tape do not match the checksum in the header block. This may be due to an incorrectly specified block size or a bad piece of tape media. 
tar: tape write error: A physical write error has occurred on the tar target. 
Target hardware or software error: Run diagnostics on the storage device hardware; check storage device software configuration. 
Target has insufficient session, connection or other resources: Check storage device settings. Check with storage device vendor to see if resource settings can be increased or capacity can be otherwise increased. 
target protocol group tag mismatch: Initiator and target had a Target Portal Group Tag (TPGT) mismatch. Verify TPGT discovery settings on initiator and storage device. 
Text file busy (ETXTBSY): An attempt was made to execute a file that was open for writing. 
The SCSI bus is hung: The likely cause is a conflict in SCSI target numbers. See the SCSI transport failed: reason 'reset' message as well. 
Timeout waiting for ARP/RARP packet: Indicates a network connection problem while booting from the network. This problem can sometimes be observed on subnets containing multiple servers willing to answer a RARP request, which can result in a server without a bootparams file receiving a request. (We have had good luck moving Jumpstart targets to an isolated subnet for initial installations.) 
Timer expired: The timer for a STREAMS ioctl has expired. The cause is device specific, and may be related to a flaky hardware, driver failure or an inappropriately short timeout threshold. 
Too many links (EMLINK): A file has too many hard links associated with it. Use soft links instead. 
Too many open files (EMFILE): A process has exceeded the limit on the number of open files per process. (See the Resource Management page for methods to monitor and manage these limits.) 
Transport endpoint is already connected (EISCONN): Connection request made on an already connected transport endpoint. 
Transport endpoint is not connected (ENOTCONN): The endpoint is not connected and/or an address was not specified. 
Trap 3E: These are caused by a bad boot disk superblock. This may have been caused by a failing disk, faulty disk connections, software misconfiguration or duplicate SCSI addresses. Check the possible hardware and SCSI configuration issues before attempting to recover the superblock using the methods listed under BAD SUPER BLOCK above. 
Too Many Arguments: This is a variant of the C shell's Arguments too long message, except that this time the problem may be the number rather than the length of arguments. 
unable to connect to target: Initiator unable to establish a network connection. This message typically accompanied by an error number from /usr/include/sys/errno.h. 
unable to get shared objects: The executable may be corrupt or in an unrecognized format. 
unable to initialize authentication: Verify that initiator authentication settings are properly configured. 
unable to make login pdu: Initiator could not make a login Payload Data Unit (PDU) based on the initiator and storage device settings. Reset target login parameters and other settings as required. 
unable to schedule enumeration: Initiator unable to enumerate the LUNs on the target. LUN enumeration can be forced via the devfsadm -i iscsi command. 
unable to set [authentication|ipsec|password|remote authentication|username]: Verify that initiator authentication settings are properly configured. 
uname: error writing name when booting: /etc/nodename must contain exactly one line with the name of the system and no blanks or returns. 
Unknown service: Either the service is not listed in the services database (/etc/services by default), or the permissions for the services database are set so that the user cannot read it. 
Value too large for defined data type (EOVERFLOW): Argument improperly formatted for the structure allocated to it. 
WARNING: /tmp: File system full, swap space limit exceeded: Virtual memory has filled up. A reboot is recommended after we have figured out which process is hogging all the memory and/or swap, since the system may be in an unstable state. 
WARNING: TOD clock not initialized: It is likely that the system clock's battery is dead. 
Watchdog Reset: This usually indicates a hardware problem. (See the Watchdog Resets page for a complete discussion.) 
Window Underflow: These errors sometimes accompany a trap, especially at boot time. Some program attempted access of a register window that was not accessible from that processor. These errors may occur when differently sized DIMMs are improperly used together, or when cache memory has gone bad. If mismatched memory is not the problem, the CPU or system board will need to be replaced. 
wrong magic number: See �Corrupt label� above. 
you are not authorized to use: A configuration file (eg at.deny or cron.deny) forbids access to this service. 


>>>> PART 3 <<<<:
=================


Booting problems poses serious challenge to the system administrators as system is down and no one can use it . This article tries to cover some of the general booting problems and their possible solutions to enable understand the problem cause and bring the system up very quickly.

Following are some of the booting issues ,error messages their meaning and possible solutions

1) Booting in single user mode and mounting root disk . 
2) Making boot device alias 
3) "Timeout waiting for ARP/RARP packet"?  error message. 
4) "The file just loaded does not appear to be executable" error message. 
5) "bootblk: can't find the boot program" error message. 
6) "boot: cannot open kernel/unix" error message . 
7) "Error reading ELF header"? error message . 
8) "Cannot open '/etc/path_to_inst'" error message. 
9) "Can't stat /dev/rdsk/c0t3d0s0" error message . 
10) Next Steps 
  
1.Booting in single user mode and mounting root hard disk. 
Most important step in  diagnosing the booting problems is booting the system in single user mode and examining the hard disk for possible errors & work out the corrective measure. Single user mode can be achieved by any of the following methods :- 
ok> boot -s           ;from root disk 
ok> boot net -s       ;from network 
  
ok>boot cdrom -s      ;from cdrom 
Rebooting with command: cdrom -s  
Configuring the /devices directory 
Configuring the /dev directory | 
INIT: SINGLE USER MODE 
# 
# fsck /dev/rdsk/c0t3d0s0 
# mount /dev/dsk/c0t3d0s0 /mnt 
  
Perform the required operation on mounted disk , now accessible through /mnt ,& unmount the hard disk after you are done ; 
# umount /mnt 
# reboot   
 

2.Making boot device alias 
In case system can not boot from primary disk  and it is needed to make another boot disk to access the data , nvalias command is used . 
nvalias command makes the device alias  and assigns an alternate name to a physical disk. Physical address of target disk is required  which can be had by show-disk command on ok>. 
  
ok> nvalias disk7 /iommu@f,e0000000/sbus@f,e0001000/dma@3,81000/esp@3,80000/sd2,0 
The new aliased disk can be named as boot disk or can be used for booting by refering its name . 
ok> setenv boot-device disk7 
ok>reset 
or 
ok> boot disk7 
  
3."Timeout waiting for ARP/RARP packet"? 
At ok> type printenv and look for these parameters . 
  boot-device           disk
  mfg-switch?           false
  diag-switch?          false 
if you see "boot-device net " or true value for the other two parameter change it to the values above. 
In case you wants to boot from network make sure your client is properly configured in boot server and network connections & configuration are proper. 
  
4."The file just loaded does not appear to be executable." 
Boot block on the hard disk is corrupted .Boot the system in single user mode with cdrom and reinstall boot block . 
#installboot /usr/platform/`uname -i`/lib/fs/ufs/bootblk /dev/rdsk/c0t3d0s0

5."bootblk: can't find the boot program" 
boot block can not find the boot programe - ufsboot in Solaris .Either ufsboot is missing or corrupted . In such cases it can be restored from the cdrom after booting from cdrom & mounting the hard disk  
# cp /platform/`uname -i`/ufsboot /mnt/platform/`uname -i` 
  
6."boot: cannot open kernel/unix" 
Kernel directory or unix kernel file in this directory is not found .Probably deleted during fsck or deleted by mistake .Copy it from the cdrom or restore from the backup tape. 
# cp /platform/`uname -i`/kernel/unix /mnt/platform/`uname -i`/kernel 
  
7."Error reading ELF header."? 
Kernel directory or unix kernel file in this directory is corrupted.Copy it from the cdrom or restore from the backup tape. 
# cp /platform/`uname -i`/kernel/unix /mnt/platform/`uname -i`/kernel 
  
8."Cannot open '/etc/path_to_inst'" 
System can not find the /etc/path_to_install file .It might be missing or corrupted and needs to be rebuild. 
To rebuild this file  boot the system with  -ar option : 
ok>boot -ar 
Press enter to select default values for the questions  asked during booting and select yes to rebuild /etc/path_to_install 
The /etc/path_to_inst on your system does not exist or is empty. Do you want to rebuild this file [n]? y 
system will continue booting after rebuilding the file. 
  
9."Can't stat /dev/rdsk/c0t3d0s0" 
When booted from cdrom  and done fsck the root partition comes out to be fine but on booting from root disk this error occurs. The device name for / is missing from /dev/dsk directory and to resolve the issue /dev & /devices directories has to be restored from root backup tapes . 
 

>>>> PART 4 <<<<
================


-- Numbers and Symbols


***** FILE SYSTEM WAS MODIFIED *****
Cause
This comment from the fsck(1M) command tells you that it changed the file system it was checking.

Action
If fsck(1M) was checking the root file system, reboot the system immediately to avoid corrupting the / partition. If fsck(1M) was checking a mounted file system, unmount that file system and run fsck(1M) again, so that work done by fsck(1M) is not undone when in-memory file tables are written out to disk.

** Phase 1-- Check Blocks and Sizes
Cause
The fsck(1M) command is checking the file system shown in the messages that are displayed before this one. The first phase checks the inode list, finds bad or duplicate blocks, and verifies the inode size and format.

Action
If more than a dozen errors occur during this important phase, you might want to restore the file system from backup tapes. Otherwise, it is fine to proceed with fsck(1M).

See Also
For more information, see the chapter on checking file system integrity in the System Administration Guide, Volume 1.

** Phase 1b-- Rescan For More DUPS
Cause
The fsck(1M) command detected duplicate blocks while checking a file system, so fsck(1M) is rescanning the file system to find the inode that originally claimed that block.

Action
If fsck(1M) executes this optional phase, you will see additional DUP/BAD messages in phases 2 and 4.

See Also
For more information, see the chapter on checking file system integrity in the System Administration Guide, Volume 1.

** Phase 2-- Check Pathnames
Cause
The fsck(1M) command is checking a file system, and fsck(1M) is now removing directory entries pointing to bad inodes that were discovered in phases 1 and 1b. This phase might ask you to remove files, salvage directories, fix inodes, reallocate blocks, and so on.

Action
If more than a dozen errors occur during this important phase, you might want to restore the file system from backup tapes. Otherwise it is fine to proceed with fsck(1M).

See Also
For more information, see the chapter on checking file system integrity in the System Administration Guide, Volume 1.

** Phase 3-- Check Connectivity
Cause
The fsck(1M) command is checking a file system, and fsck(1M) is now verifying the integrity of directories. You might be asked to adjust, create, expand, reallocate, or reconnect directories.

Action
You can usually answer "yes" to all these questions without harming the file system.

See Also
For more information, see the chapter on checking file system integrity in the System Administration Guide, Volume 1.

** Phase 4-- Check Reference Counts
Cause
The fsck(1M) command is checking a file system, and fsck(1M) is now checking link count information obtained in phases 2 and 3. You might be asked to clear or adjust link counts.

Action
You can usually answer "yes" to all these questions without harming the file system.

See Also
For more information, see the chapter on checking file system integrity in the System Administration Guide, Volume 1.

** Phase 5-- Check Cyl groups
Cause
The fsck(1M) command is checking a file system, and fsck(1M) is now checking the free-block and used-inode maps. You might be asked to salvage free blocks or summary information.

Action
You can usually answer "yes" to all these questions without harming the file system.

See Also
For more information, see the chapter on checking file system integrity in the System Administration Guide, Volume 1.

@@ 
Cause
This message is about how to fix the common @@token sendmail errors. There are instances when you receive email bounce messages because of syntax errors complaining that it does not know how to send email to @@token. Probably a site is NOT running NIS and is generating these errors or is talking to another site that is generating the errors and then passing the email on to your site. This happens because a single token is changed into a null ("") token. As a result, ruleset 3 (S3) changes null tokens into @@token. There are two key issues here. First, you do not want to be the host responsible for generating these errors, and, second, you do not want to pass along any errors that were generated by other hosts.

Action
To fix this problem, modify rules S3 and S22. (You'll only have S22, if using main.cf.) First, so you do not cause these errors, comment out the invert aliases rule in S22: 

S22
R$*<@LOCAL>$*      $:$1
#R$-<@$->          $:$>3${Z$1@$2$}   invert aliases
R$*<@$+.$*>$*      $@$1<@$2.$3>$4    already ok
R$+<@$+>$*         $@$1<@$2.$m>$3    tack on our domain
R$+                $@$1<@$w.$m>      tack on our full name 
 

Next, so you do not pass on errors caused by other hosts, modify ruleset S3 from:


S3
# handle "from:<>"   special case
R$*<>$*		$@@        turn into magic token
 

To: 


S3
# handle "from:<>"   special case
R$*<>$*		$@$n       turn into magic token
 

29a00 illegal instruction
Cause
When trying to boot a client from a boot/jumpstart server to install or upgrade a workstation, it fails with the following message: 

boot net - install
Rebooting with command: net - install
Boot device: /iommu/sbus/ledma@f, 400010/le@f, 8c0000 File and args: -
install
29a00  Illegal Instruction
(0) ok 


Action
The problem lies in the /tftpboot directory of the boot server. Confirm that the HOSTID and HOSTID.ARCH files are linked to the correct inetboot.* file for your architecture. The following is an example of how a symbolic link should look: 

# cd /tftpboot
# ls -l 81971904*
81971904 -> inetboot.sun4m.Solaris_2.4
81971904.SUN4M -> inetboot.sun4m.Solaris_2.4 
If the entries are not correct, remove the entry for the particular client in this directory, using rm_install_client or rm_client commands, and re-add the client with the add_install_client(1M) or add_client command or through Solstice giving the correct architecture.

451 timeout waiting for input during source 
Cause
When sendmail(1M) reads from anything that might time out, such as an SMTP connection, it sets a timer to the value of the r processing option before reading begins. If the read does not complete before the timer expires, this message appears and reading stops. (Usually this happens during RCPT.) The mail message is then queued for later delivery.

Action
If you see this message often, increase the value of the r processing option in the /etc/mail/sendmail.cf file. If the timer is already set to a large number, look for hardware problems, such as poor network cabling or connections.

See Also
For more information about setting the timer, see the section describing the sendmail(1M) configuration options in the System Administration Guide, Volume 3. If you are using AnswerBook online documentation, the term "timeouts" is a good search string.

501 MAIL FROM: unrecognized address: @@hostname 
Cause
A Sun machine running Sendmail 8.6 is used as a mailhost to send mail to the Internet in an environment that has MS Mailexchanger or a cc:Mail gateway. Mail from the MS exchange/cc:Mail gateway for the Internet is relayed to the mailhost, which actually delivers the mail. The mail from the Internet is accepted on the mailhost and forwarded to the MS exchanger/cc:mail gateway. The postmaster on the mailhost sees bounced messages with error messages, such as the following: 

The original message was received at Thu, 29 May 1997 12:30:41 -0700 
from artemis [206.189.46.3]
     
   ----- The following addresses had delivery problems -----
<Joe_Smith@cc.test.com>  (unrecoverable error)
     
   ----- Transcript of session follows -----
... while talking to cc:
>>> MAIL From:<hermes>
>>> 501 MAIL FROM: unrecognized address: <hermes> 
554 <Joe_Smith@cc.test.com> Remote protocol error 
When analyzed, this mail turns out to be mail that has bounced from the Internet (for any reason) and was on its way back to the MS Exchange/cc:Mail gateway by the mailhost. The MS Exchange/cc:Mail gateway does not want to accept the mail because the "MAIL FROM:" address does not stick to the standards. @@hostname is an illegal SMTP address. Sendmail does not have a restriction on sender's address; however, other SMTP gateways, which need to translate the address to their native address formats, are rather strict in adhering to the SMTP address format and would not accept the address in the @@hostname format. 

Another situation: The user with cc:Mail sends mail to the Internet, and, due to one of many possible errors (user not found, host not found, and so forth), the message is sent back to the sender (bounces back). When a message is sent back, its recipient`s address is replaced by the sender's address and the sender's address is erased (contains only "<>"). When the bounced sender's address goes through ruleset 3 and then 11 on the user's mail gateway (as it has to return it to the cc:Mail gateway, which is in the local domain => mailer=ether), it is transformed to @@mail-gateway-name.

Action
Insert the following line in the S11 ruleset after the line starting with R$=D&: 

R@       $@mailer_daemon<@$w>         for @@hostname problem
 
After the insertion, S11 looks like this: 

S11
R$*<@$+>$*     $1<@$2>$3                    already ok
R$=D           $@$1<@$w>                    tack on my hostname
R@             $@mailer_daemon<@$w>         for @@hostname problem
R$+            $@$1<@$m>                    tack on my mbox hostname
 

550 hostname... Host unknown
Cause
This sendmail(1M) message indicates that the destination host machine, specified by the portion of the address after the at-sign (@), was not found during domain naming system (DNS) lookup.

Action
Use the nslookup(1M) command to verify that the destination host exists in that or other domains, perhaps with a slightly different spelling. Failing that, contact the intended recipient and ask for a proper address.

Sometimes this return message indicates that the intended host is inoperable, rather than unknown. If a DNS record contains an unknown alternate host, and the primary host is inoperable, sendmail(1M) returns a "Host unknown" message from the alternate host. [This is a known sendmail(1M) version 8.6.7 bug.] 

For uucp(1C) mail addresses, the "Host unknown" message probably means that the destination host name is not listed in the /etc/uucp/Systems file.

See Also
For information on how sendmail(1M) works, see the System Administration Guide, Volume 3 

550 Security server failed to perform requested command
Cause
While using the 3.x FW-1 FTP Security Server, the user sees the following error message when trying to use FTP get or put commands: 

550 Security server failed to perform requested command 


Action
FW-1's FTP Security Server sends a pwd command prior to any data connection command (such as get, put, ls), since it needs to know the current directory for purposes such as logging, virus inspection, and resources. FW-1 assumes that these commands are blocked whenever the pwd command is blocked. Therefore, do not disable pwd on your FTP server.

550 username... User unknown
Cause
This sendmail(1M) message indicates that the intended recipient, specified by the portion of the address before the at-sign (@), could not be located on the destination host machine.

Action
Check the email address and try again, perhaps with a slightly different spelling. If this does not work, contact the intended recipient and ask for a proper address.

See Also
For information on how sendmail(1M) works, see the System Administration Guide, Volume 3.

554 hostname... Local configuration error
Cause
This sendmail(1M) message usually indicates that the local host is trying to send mail to itself.

Action
Check the value of the $j macro in the /etc/mail/sendmail.cf file to ensure that this value is a fully qualified domain name.

Technical Notes
When the sending system provides its host name to the receiving system (in the SMTP HELO command), the receiving system compares its name to the sender's name. If these are the same, the receiving system issues this error message and closes the connection. The name provided in the HELO command is the value of the $j macro.

See Also
For information on how sendmail(1M) works, see the System Administration Guide, Volume 3.


"A"
A command window has exited because its child exited.
Cause
The argument to a cmdtool(1) or a shelltool(1) window looks like it is supposed to be a command, but the system cannot find the command.

Action
To run this command inside a cmdtool(1) or a shelltool(1), make sure the command is spelled correctly and is in your search path. If necessary, use a full path name. If you intended this argument as an option setting, use a minus sign (-) at the beginning of the option.

Technical Notes
Both the cmdtool(1) and the shelltool(1) are OpenWindows terminal emulators.

access violation unknown host IP address
Cause
Solstice backup utility fails and displays the following error: access violation unknown host IP address on Networker 4.2.2. This error is usually caused by a corrupted host name in the host NIS/NIS+ map/table.

Action
Check the Networker client configuration for an incorrect host name. If all else fails, as a workaround, add the entry to /etc/hosts.

Accessing a corrupted shared library
Cause
The system is trying to exec(2) an a.out that requires that it be linked in a static shared library, and exec(2) could not load the static shared library. The static shared library is probably corrupted.

Technical Notes
The symbolic name for this error is ELIBBAD, errno=84.

Address already in use
Cause
The user attempted to use an address already in use, and the protocol does not allow this.

Technical Notes
The symbolic name for this error is EADDRINUSE, errno=125.

Address family not supported by protocol family
Cause
An address incompatible with the requested protocol was used.

Technical Notes
The symbolic name for this error is EAFNOSUPPORT, errno=124.

admintool: Received communication service error 4
Cause
AdminTool could not start a display method, because a remote procedure, which had been called, timed out; therefore, it could not send the request. You receive this error when admintool(1M) tries to access the NIS or NIS+ tables and networking is not enabled.

Action
Verify the system network status with ifconfig -a to make sure the system is connected to the network. Make sure the Ethernet cable is connected and the system is configured to run NIS or NIS+.

Advertise error
Cause
This error is RFS specific. It occurs when users try to advertise a resource already advertised, try to stop RFS while there are resources still advertised, or try to forceably unmount a resource that is still advertised.

Technical Notes
The symbolic name for this error is EADV, errno=68.

answerbook: XView error: NULL pointer passed to xv_set
Cause
The AnswerBook navigator window comes up, but the document viewer window does not. This message appears on the console, and the message Could not start new viewer appears in the navigator window. This situation indicates that you have an unknown client or a problem with the network naming service.

Action
Run the ypmatch(1) or nismatch(1) command to determine if the client host name is in the host's map. If not, add it to the NIS hosts map on the NIS master server. Then, make sure the /etc/hosts file on the client contains an IP address and entry for that host name, which is followed by loghost. 


--------------------------------------------------------------------------------
Note - 
Reboot, if you changed the /etc/hosts file.


--------------------------------------------------------------------------------

Check that the ypmatch(1) or nismatch(1) client hosts command returns the same IP host address as in the /etc/hosts file. Finally, quit all existing AnswerBooks and restart.

See Also
For more information on the NIS hosts map, see the section on the default search criteria in the NIS+ and FNS Administration Guide. If you are using AnswerBook online documentation, "NIS hosts map" is a good search string.

apdb: Resource temporarily unavailable
Cause
This error can occur when attempting to add or remove AP databases with the apdb command.

Action
From /var/adm/messages you find the reason for the apdb command failure, as shown below: 

Jan 15 14:00:51 Starfire2 apd[683]: /etc/system: could not find:                                          																			    																																						* End AP database info (do not edit)
Jan 15 14:00:52 Starfire2 apd[683]: failed to patch the system file! 
Unfortunately, this error from the netcon session does not get an echo to the console; therefore, it can easily be missed. To correct it, simply edit the /etc/system file so that it has the correct comments before and after setting ap:apdb_dblist. See below: 

* Begin AP database info (do not edit) 
set ap:apdb_dblist="sd:5 sd:8" 
* End AP database info (do not edit) 


Arg list too long
Cause
The system could not handle the number of arguments given to a command or program when it combined those arguments with the environment's exported shell variables. The argument list limit is the size of the argument list plus the size of the environment's exported shell variables.

Action
The easiest solution is to reduce the size of the parent process environment by unsetting extraneous environment variables. (See the man page for the shell you are using to find out how to list and change your environment variables.) Then run the program again.

Technical Notes
An argument list longer than ARG_MAX bytes was presented to a member of the exec(2) family of system calls.

The symbolic name for this error is E2BIG, errno=7.

Argument out of domain
Cause
This message is a programming error or a data input error.

Action
Ask the program's author to fix this condition or to supply data in a different format.

Technical Notes
This indicates an attempt to evaluate a mathematical programming function at a point where its value is not defined. The argument of a programming function in the math package is out of the domain of the function. This could happen when taking the square root, power, or log of a negative number, when computing a power to a non-integer, or when passing an out-of-range argument to a hyperbolic programming function.

To help pinpoint a program's math errors, use the matherr(3M) facility.

The symbolic name for this error is EDOM, errno=33.

Arguments too long
Cause
This C shell error message indicates that too many arguments follow a command. For example, this can happen by invoking rm * in a huge directory. The C shell cannot handle more than 1706 arguments.

Action
Temporarily start a Bourne shell with sh(1) and run the command again. The Bourne shell dynamically allocates command line arguments. Return to your original shell by typing exit.

assertion failed: string, file name, line int 
Cause
An unexpected condition in the program has occurred.

Action
Contact the vendor or author of the program to ask why it failed. If you have the source code for the program, you can look at the file and line number where the assertion failed. This might give you an idea of how to run the program differently.

Technical Notes
This message is the result of a diagnostic macro called assert(3C) that a programmer inserted into the specified line of a source file. The untrue expression precedes the file name and line number.

Attempting to link in more shared libraries than system limit
Cause
The system is trying to exec(2) an a.out that requires more static shared libraries than is allowed on the current configuration of the system.

Technical Notes
The symbolic name for this error is ELIBMAX, errno=86.

automount[int]: name: Not a directory
Cause
The file specified after the first colon is not a valid mount point, because it is not a directory.

Action
Ensure that the mount point is a directory and not a regular file or a symbolic link.

automountd[int]: server hostname responding
Cause
This automounter message indicates that the system tried to mount a file system from an NFSTM server that is either down or extremely slow to respond. In some cases, this message indicates that the network link to the NFS server is broken, although that condition produces other error messages as well.

Action
If you are the system administrator responsible for the non-responding NFS server, check to see whether the machine needs repair or rebooting. Encourage your user community to report such problems quickly, but only once. When the NFS server is back in operation, the automounter can access the requested file system.

See Also
For more information on NFS failures, see the section on NFS troubleshooting in the System Administration Guide, Volume 3. If you are using AnswerBook online documentation, a good search string is "NFS Service."


"B"
Bad address
Cause
The system encountered a hardware fault in attempting to access a parameter of a programming function.

Action
Check the address to see if it resulted from supplying the wrong device or option to a command. If that is not the problem, contact the vendor or author of the program for an update.

Technical Notes
This error could occur any time a function that takes a pointer argument is passed an invalid address. Because processors differ in their ability to detect bad addresses, on some architectures, passing bad addresses can result in undefined behaviors.

The symbolic name for this error is EFAULT, errno=14.

BAD/DUP FILE I=i OWNER=o MODE=m SIZE=s MTIME=t CLEAR? 
Cause
While checking inode link counts during phase 4, fsck(1M) found a file (or directory) that either does not exist or exists somewhere else.

Action
To clear the inode of its reference to this file or directory, answer "yes." With the -p (preen) option, fsck(1M) automatically clears bad or duplicate file references. Answering "yes" to this question seldom causes a problem.

Bad file number
Cause
Generally this message is a program error, not a usage error.

Action
Contact the vendor or author of the program for an update.

Technical Notes
Either a file descriptor refers to no open file, or a read(2)--or a write(2)--request is made to a file that is open only for writing or reading.

The symbolic name for this error is EBADF, errno=9.

block no. BAD I=inode no. 
Cause
Upon detecting an out-of-range block, fsck(1M) prints the bad block number and its containing inode (after I=).

Action
In fsck(1M) phases 2 and 4, you decide whether or not to clear these bad blocks. Before committing to repair with fsck(1M), you could determine which file contains this inode by passing the inode number to the ncheck(1M) command: 

# ncheck -i inum filesystem
 

See Also
For more information, see the chapter on checking file system integrity in the System Administration Guide, Volume 1.

BAD_MESSAGE (error code 100) from X.400
Cause
In this situation, X.400 software had been working without problems. Suddenly, the message exchanges failed in ma_start_delivery(). It was returning an error code of 100 (BAD_MESSAGE).

The ma_start_delivery() call fails when trying to exchange a file of more than 900 bytes.

Action
X.400 was restarted with the wrong umask. To fix, set the umask to 0022 and restart the software.

bad module/chip at: position 
Cause
This message from the memory management system often appears with parity errors and indicates a bad memory module or chip at the position listed. Data loss is possible, if the problem occurs other than at boot time.

Action
Replace the memory module or chip at the indicated position. Refer to the vendor's hardware manual for help finding this location.

Bad request descriptor
Cause
This message is apparently only used in NIS+ to indicate corrupted or missing tables.

Technical Notes
The symbolic name for this error is EBADR, errno=51.

BAD SUPER BLOCK: string 
Cause
This message from fsck(1M) indicates that a file system's super block is damaged beyond repair and must be replaced. At boot time (with the -p option) this message is prefaced by the file system's device name. After this message comes the actual damage recognized (see Action). Unfortunately, fsck(1M) does not print the number of the damaged super block.

Action
The most common cause of this error is overlapping disk partitions. Do not immediately rerun fsck(1M) as suggested by the lines that display after the error message. First, make sure that you have a recent backup of the file system involved; if not, try to back up the file system now using ufsdump(1M). Then, run the format(1M) command, select the disk involved, and print out the partition information. 

# format
: N
> partition
> print 
Note whether the overlap occurs at the beginning or end of the file system involved. Then, run newfs(1M) with the -N option to print out the file system parameters, including the location of backup super blocks. 

# newfs -N /dev/dsk/device
 
Select a super block from a non-overlapping area of the disk, but note that in most cases you have only one chance to select the proper replacement super block, which fsck(1M) soon propagates to all the cylinders. If you select the wrong replacement super block, data corruption will probably occur, and you will have to restore from backup tapes. After you select a new super block, provide fsck(1M) with the new master super block number: 

# fsck -o b=NNNN /dev/dsk/device
 

Technical Notes
Specific reasons for a damaged super block include: a wrong magic number, an out-of-range number of cylinder groups (NCG) or cylinders per group (CPG), the wrong number of cylinders, a preposterously large super block size, and trashed values in super block. These reasons are generally not meaningful, because a corrupt super block is usually extremely corrupt.

See Also
For more information on bad super blocks, see the sections on restoring bad super blocks in the System Administration Guide, Volume 1. If you are using AnswerBook online documentation, "super block" is a good search string.

BAD TRAP
Cause
A bad trap can indicate faulty hardware or a mismatch between hardware and its configuration information. Data loss is possible if the problem occurs other than at boot time.

Action
If you recently installed new hardware, verify that the software was correctly configured. Check the kernel traceback displayed on the console to see which device generated the trap. If the configuration files are correct, you probably have to replace the device.

In some cases, the bad trap message indicates a bad or down-rev CPU.

Technical Notes
A hardware processor trap occurred, and the kernel trap handler was unable to restore the system state. This message is a fatal error that usually precedes a panic, after which the system performs a sync, dump, and reboot. The following conditions can cause a bad trap: a system text or data access fault, a system data alignment error, or certain kinds of user software traps.

/bin/sh: file: too big
Cause
This Bourne shell message indicates a classic "no memory" error. While trying to load the program specified after the first colon, the shell noticed that the system ran out of virtual memory (swap space).

Action
For information on reconfiguring your system to add more swap space, refer to "Not enough space".

Block device required
Cause
A raw (character special) device was specified where a block device was required, such as during a call to the mount(1M) command.

Action
To see which block devices are available, use ls -l to look in /devices. Then specify a block device instead of a character device. Block device modes start with a b, whereas raw character device modes start with a c.

Technical Notes
The symbolic name of this error is ENOTBLK, errno=15.

Boot device: /iommu/sbus/directory/directory/sd@3,0
Cause
This message always appears at the beginning of rebooting. If there is a problem, the system hangs, and no other messages appear. This condition is caused by conflicting SCSI targets for the boot device, which is almost always target 3.

Action
The boot device is usually the machine's internal disk drive, target 3. Make sure that external and secondary disk drives are targeted to 1, 2, or 0, and do not conflict with each other. Also make sure that the tape drives are targeted to 4 or 5, and CD drives to 6, avoiding any conflict with each other or with the disk drives. You can set a device's target number using push-button switches or a dial on the back near the SCSI cables. If the targeting of the internal disk drive is in question, check it by powering off the machine, removing all external drives, turning the power on, and running the probe-scsi-all or probe-scsi command from the PROM monitor.

Broadcast Message from root (pts/int) on server [date]
Cause
This message from the wall(1M) command is transmitted to all users logged into a system. You could see it during a rlogin(1) or telnet(1) session, or on terminals connected to a timesharing system.

Action
Carefully read the broadcast message. Often this broadcast is followed by a shutdown warning.

For details about system shutdown, refer to "The system will be shut down in int minutes". 

See Also
For more information on bringing down the system, see the section on halting the system in the System Administration Guide, Volume 1. If you are using AnswerBook online documentation, "halting the system" is a good search string.

Broken pipe
Cause
This condition is often normal, and the message is merely informational (as when piping many lines to the head(1) program). The condition occurs when a write on a pipe does not find a reading process. This usually generates a signal to the executing program, but this message displays when the program ignores the signal.

Action
Check the process at the end of the pipe to see why it exited.

Technical Notes
The symbolic name of this error is EPIPE, errno=32.

Bus Error
Cause
A process has received a signal indicating that it attempted to perform I/O to a device that is restricted or that does not exist. This message is usually accompanied by a core dump, except on read-only file systems.

Action
Use a debugger to examine the core file and determine what program fault or system problem led to the bus error. If possible, check the program's output files for data corruption that might have occurred before the bus error.

Technical Notes
Bus errors can result from either a programming error or device corruption on your system. Some common causes of bus errors are: invalid file descriptors, unreasonable I/O requests, bad memory allocation, misaligned data structures, compiler bugs, and corrupt boot blocks.


"C"
Cannot access a needed shared library
Cause
The system is trying to exec(2) an a.out that requires a static shared library, and the static shared library does not exist or the user does not have permission to use it.

Technical Notes
The symbolic name for this error is ELIBACC, errno=83.

Cannot allocate colormap entry for "string"
Cause
This message from libXt (X Intrinsics library) indicates that the system color map was full, even before the color name specified in quotes was requested. Some applications can continue after this message. Other applications, such as workspace properties color, fail to come up when the color map is full.

Action
Exit the programs that make heavy use of the color map, then restart the failed application and try again.

Cannot assign requested address
Cause
An attempt was made to create a transport endpoint with an address not on the current machine.

Technical Notes
The symbolic name for this error is EADDRNOTAVAIL, errno=126.

Cannot bind to domain domainname: can't communicate with ypbind
Cause
While running the ypinit -m script for the setup of an NIS Master Server, you get this error message.

Action
You could be using the wrong nsswitch template for /etc/nsswitch.conf. During setup, you should be using /etc/nsswitch.files as the name services switch template. After setup is complete, you would then want to use /etc/nsswitch.nis. Do the following to verify that you are using nsswitch.files: 

# head /etc/nsswitch.conf 	
#   -->	
# /etc/nsswitch.files: 
If you are not using the nsswitch.files, copy it over as shown below: 

# cp /etc/nsswitch.files /etc/nsswitch.conf 
Run the ypinit -m script, again.

Cannot boot after install, error that points to an .rc file
Cause
The user completes the installation of the Solaris 2.6 IA software. Upon reboot, the user gets an error referencing an .rc file (example: 11045.rc). This file has probably been deleted or placed in a different directory. As the Solaris software looks for this file during the bootup sequence and cannot find it, the system hangs, because it cannot complete the boot process.

Action
During the installation process, there is an option to save the configuration assistant choices to a file. The error is pointing to the saved configuration file. The user was never supposed to have the option to save these choices to a file. Users should exit the setup after making their choices. If the users do save these choices to a file and if this file gets deleted or moved, the system hangs during the boot process. To solve this problem, the user boots in single user mode. From the # prompt, the user should do the following: 

cd /platform/i86pc/boot/solaris/machines 

Delete all files in this directory.

Reboot the system.

This corrects the problem and allows the Solaris software to complete loading.

cannot change passwd, not correct passwd
Cause
While running yppasswd(1) and trying to change a user's password, the system responded with this message: cannot change passwd, not correct passwd. 

Also, the user was getting yppasswd user string does not exist on the server console, but by running ypcat passwd | grep user it returns the user name. It was verified that yppasswdd(1M) was running.

Action
Check the passwd(4) file with pwck(1M) and verify that yppasswdd(1M) is running on the right server. Then verify where the passwd(4) file is located and, if changed, check that yppasswdd(1M) has the location in the process line. The password located in /etc/yp should read /usr/lib/yp/rpc.yppasswdd -D /etc/yp. The -D option with the passwd files directory location tells yppasswdd(1M) where to update and verify password changes.

cannot establish nfs service over /dev/tcp: transport setup problem
Cause
During boot strap of a SunOS 2.5.2 system, nfsd(1M) displays the following: 

netdir_getbyname (transport tcp, host/serv \1/nfs), No such file or directory
Cannot establish NFS service over /dev/tcp: transport setup problem. 
The problem: The NIS maps have been populated from older systems, and the nfs/tcp entry of the services map is missing. (The user is running NIS+, but this problem can also occur with NIS.) 

Action
Either put a files entry before the nis or nisplus in the services line of the /etc/nsswitch.conf file, or, better, merge the changes to the services file into the services map. 

It is a good idea to always merge in the new entries to /etc/services, /etc/inet/protocols, and /etc/rpc into their respective maps whenever a new OS is installed.

Cannot exec a shared library directly
Cause
The system is attempting to exec(2) a shared library, directly.

Technical Notes
The symbolic name for this error is ELIBEXEC, errno=87.

Cannot find SERVER hostname in network database
Cause
A brief description: the user is on a different subnet and is running permanent licenses: 

ultra1(50)% cc -o hello hello.c
License Error : Cannot find the license server (fry)
in the network database for product(Sun WorkShop Compiler C)
Cannot find SERVER hostname in network database (-14,7)
cc: acomp failed for hello.c
ultra1(51)% 


Action
Check the following:

Make sure that the server is up and running.

Make sure that the server is in the /etc/hosts file of the client system by typing: ping servername.

Make sure the license daemon on the server is running.

Make sure there is an elementary license file on the client: 

cd /etc/opt/licenses
more sunpro.loc 


Make sure there are only text license files, such as sunpro.lic.1 in the sunpro,loc directory.

For the client check, see below: 

 % cd /etc
 % more nsswitch.conf | grep hosts
 hosts:      nis [NOTFOUND=return] files 
This means that it is using the NIS server to look up the IP address. If it is set first for nis and the /etc/hosts file has the server listed by name, change the line to 

hosts:      files nis  
Then, see if it can be found. If not, try truss and snoop to see what is happening.

cannot install bootblock
Cause
In this case, the user installs the Solaris IA software on the Intel platform and the install seems fine. When the system is rebooted after the installation, the user receives the above error message at startup. At this point, the user cannot gain access to the system.

Action
This error occurs when you use the fdisk utility in the Solaris operating environment, do a newfs, and then do a restore, but forget to do the install for the boot block. When you do a newfs and then a restore operation, you need to perform an installboot before installing the OS. Otherwise, you get the above error. There is no guarantee, but the installboot procedure might or might not work after booting into single user mode from the CD-ROM. 

To install the UFS boot block and partition the boot program on slice 2 of target 0 on controller 1 of the platform, where the command is being run, use the following: 

# installboot /usr/platform/uname -i/lib/fs/ufs/pboot \           
/usr/platform/uname -i/lib/fs/ufs/bootblk /dev/rdsk/c1t0d0s2 


Cannot open FCC file
Cause
When trying to send mail by Netscape, this message is displayed. Netscape is trying to save the outbound message to a file that has been specified by the user, but does not exist.

Action
To correct this problem do the following: go to options Mail and News Preferences, then go to Compose. A template pops up. There is a section that specifies where to save outgoing mail and news files. Make sure that these files exist or remove them from the template, if you do not care about logging which messages are sent through Netscape.

Cannot send after transport endpoint shutdown
Cause
A request to send data was disallowed, because the transport endpoint has already been shut down.

Technical Notes
The symbolic name for this error is ESHUTDOWN, errno=143.

can't communicate with ypbind
Cause
ypcat passwd returns with the error message, can't communicate with ypbind, but ypbind is running. 

ls -l /var/yp/binding/ypbind.pid   
-r--------   1 root     root           3 Dec  1 07:40 ypbind.pid   
umask for root is set to 077.

Action
Set umask for root back to 022. /var/yp/binding/ypbind.pid must be readable by all groups. 

Refer to the following example: 

ls -l /var/yp/binding/ypbind.pid   
-r--r--r--   1 root     root           3 Dec  1 07:40 ypbind.pid 


Can't create public message device (Device busy)
Cause
This message comes from the lp(1) print scheduler, indicating that it is either extremely busy or hanging.

Action
If print jobs are coming out of the printer in question, wait until they are finished and then resubmit this print job. If you see this message again, the lp(1) system is probably hanging.

For a procedure to clear the queue, refer to "lp hang". 

Technical Notes
If lp(1) is unable to create a device for printer messages, the message FIFO could already be in use or could be locked by another print job.

See Also
For more information on the print scheduler, see the section on administrating printers in the System Administration Guide, Volume 2.

Can't invoke /etc/init, error int 
Cause
This message can appear while a system is booting, indicating that the init(1M) program is missing or corrupted. Note that /etc/init is a symbolic link to /sbin/init.

Action
Do the following: 

Boot the mini-root so you can replace init(1M). 

Halt the machine by typing Stop-A or by pressing the reset button.

Reboot as a single user from the CD-ROM, the net, or a diskette. For example, type boot cdrom -s at the ok prompt to boot from a CD-ROM.

After the system comes up and gives you a # prompt, mount the device corresponding to the original root (/) partition somewhere, with a command similar to the mount(1M) command, as shown below: 

# mount /dev/dsk/c0t3d0s0 /mnt
# cp /sbin/init /mnt/sbin/init
# reboot 


Then copy the init(1M) program from the mini-root to the original root (/) partition.

Reboot the system.

If this does not work, other files might be corrupted, and you might need to reinstall the entire system.

Technical Notes
The error number is 2 if /sbin/init is missing, or 8 if /sbin/init has an incorrect executable format. This message is usually followed by a panic: icode message. The system tries to reboot itself, but goes into a loop, because rebooting is impossible without init(1M).

See Also
For more information on booting the system, see the section on halting and booting the system in the System Administration Guide, Volume 1.

can't open /dev/rdsk/string: (null): UNEXPECTED INCONSISTENCY
Cause
In the SunOSTM 4.1.x release, this message indicated that the device containing the /dev file system has become disconnected. 

A particular response from the Solaris operating environment has not been defined.

can't synchronize with hayes
Cause
This message sometimes appears when using a modem that the system regards as a "Hayes" type modem, which includes most modems manufactured today. The message can be caused by incorrect switch settings, by poor cable connections, or by not turning the modem on.

Action
Check that the modem is on and that the cables between the modem and your system are securely connected. Check the internal and external modem switch settings. If necessary, turn the modem off and then on again.

cd: Too many arguments
Cause
The C shell's cd(1) command takes only one argument. Either more than one directory was specified, or a directory name containing a space was specified. Directory names with spaces are easy to create with File Manager.

Action
Use only one directory name. To change to a directory whose name contains spaces, enclose the directory name in double (") or single (') quotes, or use File Manager.

Channel number out of range
Cause
The system has run out of stream devices. This error results when a stream head attempts to open a minor device that does not exist or is currently in use.

Action
Check that the stream device in question exists and was created with an appropriate number of minor devices. Make sure that the hardware corresponds to this configuration. If the stream device configuration is correct, try again later when more system resources might be available.

Technical Notes
The symbolic name for this error is ECHRNG, errno=37.

chmod: ERROR: invalid mode
Cause
This message from the chmod(1) command indicates a problem in the first non-option argument.

Action
If you are specifying a numeric file mode, you can provide any number of digits (although only the final one-to-four are considered), but all digits must be between 0 and 7. If you are specifying a symbolic file mode, use the syntax provided in the chmod(1) usage message to avoid the "invalid mode" error message: Usage: chmod [ugoa][+-=][rwxlstugo] file ... 

Some combinations of symbolic key letters produce no error message, but fail to have any effect. The first group, [ugoa], is truly optional. The second group, [+-=], is mandatory for chmod(1) to have an effect. The third group, [rwxlstugo], is also mandatory for effect and can be used in combination when that combination does not conflict.

Command not found
Cause
The C shell could not find the program you gave as a command.

Action
Check the form and spelling of the command line. If that looks correct, use echo $path to see if the user's search path is correct. When communications are garbled, it is possible to unset a search path to such an extent that only built-in shell commands are available. Below is a command to reset a basic search path: 

 % set path = (/usr/bin /usr/ccs/bin /usr/openwin/bin .) 
If the search path looks correct, check the directory contents along the search path to see if programs are missing or if directories are not mounted.

See Also
For more information about the C shell, see csh(1).

Communication error on send
Cause
This error occurs when the current process is waiting for a message from a remote machine, but the link connecting the machines breaks.

Technical Notes
The symbolic name for this error is ECOMM, errno=70.

config error: mail loops back to myself.
Cause
User sees this message when sending mail: 

# dle@g3... Connecting to g3.xyz.edu. (ether)... 
220 xyz.edu Sendmail SMI-8.6/SMI-SVR4 ready at Wed, 7 Jan 1998 14:28:20 -0600 
>>> HELO xyz.edu 
250 xyz.edu Hello g1.xyz.edu [129.106.16.1], pleased to meet you 
xyz.edu config error: mail loops back to myself 
>>> QUIT 
221 g1.xyz.edu closing connection 
dle@g3... Local configuration error 
Saving message in /dead.letter 
/dead.letter... Sent 
The sending system (see line 220) and the receiving system (see the HELO line) both think they are known as "xyz.edu."

Action
Edit the sendmail.cf file as follows: 

Type the official host name.

For the domain, you have choices: If you want the gateway machine to identify itself as the domain, use Dj$m; if you want the gateway machine to appear to be inside the domain, use Dj$w.$m; and if you are using sendmail.mx (or have a fully-qualified host name), use Dj$w.

Uncomment Dj$w.$m and comment Dj$m. This gives each system a unique name. $w is the system host name, and $m is the domain.

connect from hostIP to callit(ypserv): request from non-local host
Refer to "connect from hostIP to callit(ypserv): request from unauthorized host".

connect from hostIP to callit(ypserv): request from unauthorized host
Cause
An example of a message from SunOS: 

Jan  5 14:45:37 host1 portmap[86]: connect from 158.175.36.135 to 
callit(ypserv): request from unauthorized host 
Other possiblities for the end portion of the error message include: 

request from unprivileged port 

request from non-local host 

request not forwarded 


In the Solaris operating environment, the error might look similar to the following: 

Jan  5 14:45:37 host1 rpcbind[86]: refused connect from 158.175.36.135 
to callit(ypserv) 


In all cases, the ypserv part of the message might actually be any RPC service, such as mount or nfs or status.

Action
The user has a replacement portmap or rpcbind. The version is enhanced to add access controls, and the error in question is reporting an access violation. The replacements are third-party and are not supported by Sun. The user must locate the access control configuration files and change them to the desired access controls.

connect from hostIP to callit(ypserv): request from unprivileged port
Refer to "connect from hostIP to callit(ypserv): request from unauthorized host".

connect from hostIP to callit(ypserv): request not forwarded
Refer to "connect from hostIP to callit(ypserv): request from unauthorized host".

Connection closed.
Cause
When using rlogin(1), this message can appear under the following circumstances: 

If the remote host cannot create a process for this user

If the user takes too long to type the correct password

If the user interrupts the network connection

If the remote host goes down

Data loss is possible if files were modified and not saved before the connection closed.

Action
Try again. If the other system has gone down, wait for it to reboot first.

Connection closed by foreign host.
Cause
When a user applies telnet(1) to another system, this message can appear under the following circumstances: 

If the user takes too long to type the correct password

If the remote host cannot create a login for this user

If the remote host goes down or terminates the connection

Data loss is possible if files were modified and not saved before the connection closed.

Action
Try again. If the other system has gone down, wait for it to reboot first.

[Connection closed. Exiting]
Cause
After using the talk(1) command to communicate with another user, the other person enters an interrupt (usually Control-C), and this message appears on your screen.

Action
Sending an interrupt is the usual way of exiting the talk program. The talk(1) session is over, and you can return to your work.

Connection refused
Cause
No connection could be made because the target machine actively refused it. This happens either when trying to connect to an inactive service or when a service process is not present at the requested address.

Action
Activate the service on the target machine, or start it up again if it has disappeared. If, for security reasons, you do not intend to provide this service, inform the user community, possibly suggesting an alternative.

Technical Notes
The symbolic name for this error is ECONNREFUSED, errno=146.

Connection reset by peer
Cause
A connection was forcibly closed by a peer. This is normally due to a remote host connection loss from a timeout or a reboot.

Technical Notes
The symbolic name for this error is ECONNRESET, errno=131.

Connection timed out
Cause
This error occurs either when the destination host is down or when problems in the network cause a loss in transmission.

Action
Do the following: 

Check the operation of the host system, for example by using ping(1M) and ftp(1).

Repair or reboot as necessary.

If the above does not solve the problem, check the network cabling and connections.


Technical Notes
No connection was established in a specified time. A connect or send request failed because the destination host did not properly respond after a reasonable interval. (The time-out period is dependent on the communication protocol.)

The symbolic name for this error is ETIMEDOUT, errno=145.

console login: ^J^M^Q^K^K^P
Cause
This error usually occurs because OpenWindows exited abnormally, leaving the system's keyboard in the wrong mode. The characters that appear when someone attempts to login are garbage transliterations of what someone typed.

Action
If you are on a SPARCTM system, do the following: 

Find another machine and remote log in to this system

Run the following command: 

$ /usr/openwin/bin/kbd_mode -a 


This puts the console back into ASCII mode. 


--------------------------------------------------------------------------------
Note - 
kbd_mode is not a windows program; it fixes the console mode.


--------------------------------------------------------------------------------

If you are on an IA system, do the following: 

Log in remotely and start 

kill the X server or reboot the system


Technical Notes
The usual reason for this problem occurring is an automated script run from cron(1M) that clears the /tmp directory periodically. Ensure that any such scripts do not remove the /tmp/.X11-pipe or /tmp/.X11-unix directories, or any files in them.

core dumped
Cause
A core(4) file contains an image of memory at the time of software failure and is used by programmers to find the reason for the failure.

Action
To see which program produced a core(4) file, run either the file(1) command or the adb(1) command. The following examples show the output of the file(1) and adb(1) commands on a core file from the dtmail program. 

$ file core
core: ELF 32-bit MSB core file SPARC Version 1, from `dtmail' 


$ adb core
core file = core -- program `dtmail'
SIGSEGV  11: segmentation violation
^D      (use Control-d to quit the program)
 
Ask the vendor or author of this program for a debugged version.

Technical Notes
Some signals, such as SIGQUIT, SIGBUS, and SIGSEGV, produce a core dump. See the signal(5) man page for a complete list.

If you have the source code for the program, you can try compiling it with cc -g, and debugging it yourself using dbx or a similar debugger. The where directive of dbx provides a stack trace.

On mixed networks, it can be difficult to discern which machine architecture produced a particular core dump, since adb(1) on one type of system generally cannot read a core(4) file from another type of system and can produce an unrecognized file message. Run adb(1) on various machine architectures until you find the right one.

See Also
For information on saving and viewing crash information, see the System Administration Guide, Volume 2. If you are using AnswerBook online documentation, "system crash" is a good search string.

corrupt label - wrong magic number or corrupt label or corrupt label - label checksum failed
Cause
After a power cycle, the machine displays either of the following error messages: 

corrupt label - label checksum failed 

corrupt label - wrong magic number 

format(1M) displayed the following: 


  0 unassigned    wm       0               0         (0/0/0)          0
  1 unassigned    wm       0               0         (0/0/0)          0
  2     backup    wm       0 - 5460        4.2G    (5460/0/0)   4154160
  3 unassigned    wm       0               0         (0/0/0)          0
  4 unassigned    wm       0               0         (0/0/0)          0
  5 unassigned    wm       0               0         (0/0/0)          0
  6 unassigned    wm       0 - 2730       2.1G       (0/0/0)          0
  7 unassigned 	  wm       2730-5460      2.1G       (0/0/0)          0 

The disks were using raw partitions beginning at block 0 (cylinder 0). The disk label (VTOC) is kept on the block 0 of cylinder 0. The label eventually gets overwritten by database programs using raw partitions, if the raw partition begins at cylinder 0. (UNIX� file systems avoid this area of the partition.)

Action
As a workaround, do the following: 

Go into format(1M) and get the backup label using the backup command.

Relabel the disk using this backup label. You should then be able to access the disk.

Backup the data on this disk.

Go back to the disk and relabel it, starting the raw partition at cylinder 1. (This loses one cylinder, but prevents corrupting the VTOC.)

Label again.

Restore the data from your backup.


could not grant slave pty
Cause
User gets the error message could not grant slave pty when attempting a telnet(1), rlogin(1), or rsh(1) session (anything that requires a shell) or when trying to bring up an x-term.

Action
The user's file permissions were set wrong on /usr/lib/pt_chmod. The user had: 

# ls -la /usr/lib/pt_chmod
---s--x--x   1 bin     bin         3120 May  3  1996 
The permissions should be: 

# ls -la /usr/lib/pt_chmod
---s--x--x   1 root     bin         3120 May  3  1996 


--------------------------------------------------------------------------------
Note - 
The owner should be root; the user had bin as the owner. Also, the setuid bit must be set. 


--------------------------------------------------------------------------------

By using chown root pt_chmod, the problem was corrected.

Could not initialize tooltalk (tt_open): TT_ERR_NOMP
Cause
Various desktop tools display or print this message when the ttsession(1) process is not available. The ToolTalk service generally tries to restart ttsession(1), if it is not running. Thus, this error indicates that the ToolTalk service is either not installed or is not installed correctly.

Action
Verify that the ttsession(1) command exists in /usr/openwin/bin or /usr/dt/bin. If this command is not present, ToolTalk is not installed correctly. The packages constituting ToolTalk are the runtime SUNWtltk, developer support SUNWtltkd, and the manual pages SUNWtltkm. CDE ToolTalk packages have the same names with ".2" appended.

Technical Notes
The full TT_ERR_NOMP message string reads as follows: "No ttsession(1) is running, probably because tt_open(3) has not been called yet. If this is returned from tt_open(3), it means ttsession(1) could not be started, which generally means ToolTalk is not installed on the system."

Could not open ToolTalk Channel
Cause
This error message is displayed while attempting to remotely run workshop. 

Action
Do the following: 

Make sure workshop is no longer running.

In the telnet/rlogin session window, type: /bin/ps -ef | grep ttsession. If one is running in the system that belongs to the telnet user, type kill pid_of_ttsession.

In the telnet rlogin session, type /usr/dt/bin/ttsession -s -d machine_telnetting_from:0.0.

Start workshop.


Could not start new viewer
Cause
This message appears in the AnswerBook navigator window, along with an XView error message on the console.

Action
For details, refer to "answerbook: XView error: NULL pointer passed to xv_set".

Could not start NFS service for any protocol. Exiting
Cause
The following errors occur at boot time: 

/usr/lib/nfs/nfsd[478]: t_bind to wrong address
/usr/lib/nfs/nfsd[478]: t_bind to wrong address
/usr/lib/nfs/nfsd[478]: Cannot establish NFS service over /dev/udp: transport setup problem.
/usr/lib/nfs/nfsd[478]: Cannot establish NFS service over /dev/udp: transport setup problem.
/usr/lib/nfs/nfsd[478]: t_bind to wrong address
/usr/lib/nfs/nfsd[478]: t_bind to wrong address
/usr/lib/nfs/nfsd[478]: Cannot establish NFS service over /dev/tcp: transport setup problem.
/usr/lib/nfs/nfsd[478]: Cannot establish NFS service over /dev/tcp: transport setup problem.
/usr/lib/nfs/nfsd[478]: Could not start NFS service for any protocol. Exiting.
/usr/lib/nfs/nfsd[478]: Could not start NFS service for any protocol. Exiting. 


In this situation, a backup copy of the S15nfs.server script in /etc/rc3.d was made. However, the backup copy was renamed to S15nfs.server.BAK. Since the backup copy starts with a upper case "S," it was also executed at boot time. The errors occurred when a second NFSD was attempted. 

Action
If a backup copy of any startup script is made, it should be renamed with a lower case "s," so as not to be executed at boot.

cpio: Bad magic number/header.
Cause
A cpio(1) archive has either become corrupted or was written out with an incompatible version of cpio(1).

Action
Use the -k option to cpio(1) to skip I/O errors and corrupted file headers. This might permit you to extract other files from the cpio(1) archive. To extract files with corrupted headers, try editing the archive with a binary editor such as emacs(1). Each cpio(1) file header contains a filename as a string.

See Also
For more information on magic numbers, see magic(4).

cpio : can't read input : end of file encountered prior to expected end of archive.
Cause
This message appears when trying to read a multi-volume floppy in bar format using the following command: 

  # cpio -id -H bar -I /dev/diskette0 


Action
kill /usr/sbin/vold by running /etc/init.d/volmgt stop and use the device name /dev/rfd0.

Cross-device link
Cause
An attempt was made to make a hard link to a file on another device, such as on another file system.

Action
Establish a symbolic link using ln -s instead. Symbolic links are permitted across file system boundaries.

Technical Notes
The symbolic name for this error is EXDEV, errno=18.


"D"
data access exception
Cause
This message appears when running an old version of the operating system that does not support new hardware or when running an operating system that is not configured for new hardware. It can also be the result of an incorrectly installed DSIMMs or a disk problem.

Action
Upgrade your operating system to a version that supports the new hardware or machine architecture. 

See Also
For more information on upgrades, see the section describing system and device configuration in the Solaris Transition Guide.

Data fault
Cause
This error is a kind of BAD TRAP that usually causes a system panic. When this message appears after a BAD TRAP message, a system text or data access fault probably occurred. [See the message BAD TRAP for more information.] In the absence of a BAD TRAP message, this message might indicate a user text or data access fault. Data loss is possible, if the problem occurs other than at boot time.

Action
Make sure the machine can reboot, then check the log file /var/adm/messages for hints about what went wrong.

Deadlock situation detected/avoided
Cause
A programming deadlock situation was detected and avoided.

Action
If the system had not detected and avoided a deadlock, a piece of software would have hung. Run the program again. The deadlock might not reoccur.

Technical Notes
This error usually relates to file and record locking, but can also apply to mutexes, semaphores, condition variables, and read/write locks.

The symbolic name for this error is EDEADLK, errno=45.

See Also
See the section on deadlock handling in the System Interface Guide. See also the section on avoiding deadlock in the Multithreaded Programming Guide.

Destination address required
Cause
A required address was omitted from an operation on a transport endpoint. Destination address required.

Technical Notes
The symbolic name for this error is EDESTADDRREQ, errno=96.

destination component full
Cause
Solstice backup is reporting destination component full.

This message appears when a manual operation is performed on the jukebox/autochanger (for example, physically unloading the tape drive by means of the buttons on the autochanger, rather than using SBU to unmount the volume). This operation causes SBU to lose track of the status of the media in the autochanger.

Action
The following command should resolve the problem: /usr/sbin/nsr/nsrjb -H.

/dev/fd/int: /dev/fd/int: cannot open
Cause
setuid and setgid shell scripts refuse to run. They return an error message similar to /dev/fd/3: /dev/fd/3: cannot open. (The number following /dev/fd/ is not necessarily 3.) The first line of the script properly starts a shell, and the file system containing the script is not mounted with the nosuid option.

Running truss on the shell script reveals that a call to open(2) is failing with error number 6 (ENXIO): 

open("/dev/fd/3", O_RDONLY)                     Err#6 ENXIO 


Action
setuid and setgid shell scripts use the file descriptors in /dev/fd. The contents of /dev/fd are a file descriptor file system (FDFS) and have no connection with floppy disks!

Ensure that the fdfs is mounted as /dev/fd. Before the machine is next rebooted, the following line should appear in /etc/vfstab, exactly like this (with no initial comment symbol): 

fd		-		/dev/fd		fd	-	no	- 


It might be possible to remount /dev/fd without rebooting by running the following as root: 

# mount fd /dev/fd 
Otherwise, to make setuid/setgid shell scripts available, the machine must be rebooted after editing /etc/vfstab as detailed above.

Some administrators, unaware of what /dev/fd is for, comment out the entry in /etc/vfstab that mounts the FDFS (file descriptor file system). This can go unnoticed until an attempt is made to run a setuid or setgid shell script.

/dev/rdsk/c0t6d0s2: No such file or directory
Cause
When attempting to eject a CD-ROM on a Ultra 450 system, the eject cdrom command fails, displaying the error message.

This happens when the CD-ROM is on controller 1, not 0. When using the eject(1) command, the CD-ROM "nickname" equates to /dev/rdsk/c0t6d0s2. On an Ultra 450, the CD-ROM equates to /dev/rdsk/c1t6d0s2. Therefore, using cdrom does not work.

Action
Use the following command instead: 

# eject cdrom0 
If volume manager /usr/sbin/vold is not running, you can use the following: 

# eject /dev/rdsk/c1t6d0s2 


--------------------------------------------------------------------------------
Note - 
Make sure that the front panel of the system is unobstructed so the CD-ROM tray is not blocked. Otherwise, the eject(1) command appears to hang since the tray is trying to open, but is physically blocked.


--------------------------------------------------------------------------------

Device busy
Cause
An attempt was made to mount a device that was already mounted or to unmount a device containing an active file (such as an open file, a current directory, a mount point, or a running program). This message also occurs when trying to enable accounting that is already enabled.

Action
To unmount a device containing active processes, close all the files under that mount point, quit any programs started from there, and change directories out of that hierarchy. Then try to unmount again.

Technical Notes
Mutexes, semaphores, condition variables, and read/write locks set this error condition to indicate that a lock is held.

The symbolic name for this error is EBUSY, errno=16.

device busy
Cause
If you perform an eject cdrom and then receive the above message, it could be due to a number of problems. Below is a list of things that you can check and do to permit ejection of the CD from the device.

Action
Step A: Ensure that the current directory is not somewhere in the CD: 

 % cd
 %eject cdrom 


Step B: As root: 

# cd /etc/init.d
# ./volmgt stop
# eject cdrom  
If this works, then try:

# ./volmgt start  
If this does not work, go to step C.

Step C: As root: 

# fuser /cdrom   
Kill any processes you feel you have already terminated. A note of caution: If this is an NFS-mounted CD-ROM and there are other users who access this drive, make sure you know what process you are killing and why. 

# ./volmgt stop
#  ps -ef | grep vold  
If vold still is running, kill the process. 

#  eject cdrom  
If this does not work, then: 

#  cd /vol  
Make sure that dev, dsk, rdsk, rmt are in the directory. If not, probably your /vol directory is corrupt and a reboot might be needed for proper rebuild.

Step D: The last three options are: 

Reboot.

If the CD drive is external to the system, try power cycling the drive and pressing the eject button.

If all else fails and the CD-ROM is external, on the right hand side of the eject button is a small hole into which you can insert a small straight device which forces manual ejection of the caddy.


/dev/rdsk/string: CAN'T CHECK FILE SYSTEM.
Cause
The system cannot automatically clean (preen) this file system because it appears to be set up incorrectly or is having hard-disk problems. This message asks that you run fsck(1M) manually, since data corruption might already have occurred. 

Action
Run fsck to clean the file system in question. For proper procedures, refer to "/dev/rdsk/string: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY.".

/dev/rdsk/string: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY.
Cause
During a boot, the /etc/rcS script runs the fsck(1M) command to check the integrity of file systems marked "fsck" in /etc/vfstab. If fsck(1M) cannot repair a file system automatically, it interrupts the boot procedure and produces this message. When fsck(1M) gets into this state, it cannot repair a file system without losing one or more files, so it wants to defer this responsibility to you, the administrator. Data corruption has probably already occurred. 

Action
First run fsck -n on the file system, to see how many and what type of problems exist. Then run fsck(1M) again to repair the file system. If you have a recent backup of the file system, you can generally answer "y" to all the fsck(1M) questions. It is a good idea to keep a record of all problematic files and inode numbers for later reference. To run fsck(1M) yourself, specify options as recommended by the boot script. For example: 

# fsck /dev/rdsk/c0t4d0s0 
Usually the files lost during fsck(1M) repair are those that were created just before a crash or power outage, and they cannot be recovered. If you lose important files, you can recover them from backup tapes.

If you do not have a backup, ask an expert to run fsck(1M) for you.

See Also
For more information on file checking, see the section on checking file system integrity in the System Administration Guide, Volume 1.

Directory not empty
Cause
The directory operation that was attempted, such as directory removal with rmdir(1), can be performed only on an empty directory.

Action
To remove the directory, first remove all the files that it contains. A quick way to remove a non-empty directory hierarchy is with the rm -r command.

Technical Notes
The symbolic name for this error is ENOTEMPTY, errno=93.

Disc quota exceeded
Cause
The user's disk limit has been exceeded on a user file system, usually because a file was just created or enlarged beyond the limit. This almost always refers to a magnetic disk, and not to an optical disc. Any data created after this condition occurs can be lost.

Action
The user can delete files to bring disk usage under the limit, or the server administrator can use the edquota(1M) command to increase the user's disk limit.

Technical Notes
The symbolic name for this error is EDQUOT, errno=49.

disk does not appear to be prepared for encapsulation
Cause
When attempting to encapsulate the root disk during vxinstall, the user gets this error message.

The disk was sliced properly for encapsulation; however, the prtvtoc command was non-executable, because the permissions had been changed.

diskN not unique
Cause
During boot, the system displays disk0 not unique. The error happens before the kernel loads.

Action
There are more than one devalias entries for disk0. Use devalias at the OK prompt to see the entries.

To remove the duplicate, run the following command at the OK prompt: 

nvunalias disk0 
and reset the system.

dlopen (libxfn.so) failed
Cause
The SUNWfns package was left out of the End User Cluster. If only this cluster is installed and automounter is used, it fails with the above message. libxfn.so is the shared library for the Federated Naming System.

Action
Install package SUNWfns from of the distribution CD.

driver is already installed
Cause
The SunPCTM 4.1 package and then necessary patches (102924) were added. When trying to run sunpc_install, the user got the above error message. prtconf(1M) shows that the driver is not attached, and modinfo(1M) displays 4 modules.

After removing the package, backing out the patch, and reinstalling, the user still received the same error message.

Action
SunPC had previously been installed on the system. When removing the package with the pkgrm(1M) command, not all components were removed, because pkgrm(1M) is not aware of changes made by the sunpc_install script.

To resolve this problem it is necessary to remove sections in the files pertaining to SunPC: /etc/devlink.tab, /etc/driver_aliases, and /etc/rc2.d/S10storekernname, and then reinstall the package.

dtmail: cannot open mailfile on 2.5.1 /var/mail server
Cause
/var/mail is mounted onto client machine A, which is running CDE 1.2 (the Solaris 2.6 release), from machine B, a server running the Solaris 2.5.1 release.

OpenWindow's mailtool can read/write mailfiles on the server without any problems. However, CDE's dtmail does not open the mailbox.

Action
The bug's permissions and ownership have to be checked. The mail directory should have the following permissions: 

skywalker$ ls -lad /var/mail
drwxrwsrwt   3 root     mail         512 Feb 10 14:40 /var/mail/ 
while the mailbox itself should look something like this:

-rw-------   1 zvinakis mail     3206838 Feb 19 11:51 /var/mail/zvinakis 
If the directory's permissions are not set properly, issue these commands on the mail server: 

chmod a+t /var/mail
chmod g+s /var/mail 


If the permissions (or group) are not correct on the mailbox itself, using "joe" as an example mailbox, type: 

chgrp mail /var/mail/joe 
To change the permissions, type: 

chmod 600 /var/mail/joe 


DUMP: Cannot open dump device `/dev/rdsk/c2t0d0s1': Permission denied
Cause
When using ufsdump(1M) as user sys (UID 3) on a disk drive in an SSA, the ufsdump(1M) command fails with this message.

Action
Six-hundred (600) permissions were created on the SSD "instance path" for a disk drive in an SSA. For a non-root user to read them, there should have been 0640. For example, if you see this: 

# ls -lL /dev/rdsk/c2t0d0s1
crw-------   1 root     sys      192,241 Jul 10  1996 /dev/rdsk/c2t0d0s1 
Change it to this: 

crw-r-----   1 root     sys      192,241 Jul 10  1996 /dev/rdsk/c2t0d0s1 
You might also want to add the following line: 

ssd:* 0640 root sys    
to the /etc/minor_perm file, so subsequently added arrays do not have the same problem.

dumptm: Cannot open `/dev/rmt/string': Device busy
Cause
During file system backup, the dump program cannot open the tape drive, because some other process is holding it open.

Action
Find the process that has the tape drive open, and either kill(1) the process or wait for it to finish. 

# ps -ef | grep /dev/rmt
# kill -9 processID
 

DUP/BAD I=i OWNER=o MODE=m SIZE=s MTIME=t FILE=f REMOVE?
Cause
During phase 1, fsck(1M) found duplicate blocks or bad blocks associated with the file or directory specified after FILE= whose inode number appears after I= (with other information).

Action
To remove this file or directory, answer "yes." If you have to remove more than a few files in this manner, data can be lost. Therefore, it might be preferable to restore the file system from backup tapes.

See Also
For more information on checking file systems, see the section on checking file system integrity in the System Administration Guide, Volume 1.

int DUP I=int 
Cause
Upon detecting a block that is already claimed by another inode, fsck(1M) prints the duplicate block number and its containing inode (after I=).

Action
In fsck(1M) phases 2 and 4, you decide whether or not to clear these bad blocks. Before committing to repair with fsck(1M), you could determine which file contains this inode by passing the inode number to the ncheck(1M) command: 

# ncheck -iinum filesystem
 

See Also
For more information, see the chapter on checking file system integrity in the System Administration Guide, Volume 1.


"E"
Encapsulation of root disk is not supported on systems with old PROM versions
Cause
When encapsulating the root disk with Volume Manager, this error message is printed.

Action
This error message probably has nothing at all to do with the user's system PROM version. It most likely is related to the fact that the file /dev/vx/config (and the pseudo device that it is linked to) does not exist on the system. A few things could contribute to this file not being created: 

Make sure these lines are in the /etc/system file: 

forceload: drv/vxdmp    (only needed for SEVM 2.5 and above) 
forceload: drv/vxio 
forceload: drv/vxspec 


Make sure the vx entries are in the /etc/name_to_major file. 

grep vx /etc/name_to_major 
This should come back with 2 or 3 lines (vxio and vxspec, also vxdmp, if running SEVM 2.5 and above). The major number values might be different from machine to machine; however, if the entries are in there, that should be sufficient.

If you have not performed a boot -r since doing a pkgadd to the Volume Manager software, this might have contributed to the problem.


ENOMEM The available data space is not large enough to accommodate the shared memory segment
Cause
ENOMEM errors occur after 80 segments have been allocated by Lotus Notes.

Action
The design and implementation of the Solaris ISM (intimate shared memory)--which limits the number of shared memory segments that can be attached to a particular process--caused the ENOMEM failures to the Lotus Notes application.

There is a limit because all shared memory segments are attached in the intimate shared memory (ISM) mode by a system variable that is set in the shmsys:share_page_table system file.

When a shared memory segment is attached in ISM mode, the OS locks that segment into physical memory and arranges the virtual/physical address mappings such that only one copy of the mapping information is shared among all attaching processes. To accomplish this, the OS requires that the virtual starting address of the segment be aligned on a 16 Mbytes (hex 0x1000000) = 16777216-bytes address boundary.

The NULL address lets the system decide what virtual address the segment should be attached to. The system also assigns addresses at 0x3000000 apart, unless forced to attach addresses at 0x1000000 apart. 

A sun4d could create and attach up to 220 1-Mbyte ISM segments, and a sun4m could create and attach up to 235 1-Mbyte ISM segments, providing the segments were 0x1000000 apart.

Having established that ISM is the cause of the limit, below are some options:

First, the limit only gives Lotus Notes the ability to attach a total of 80 Mbytes of shared memory. By increasing the segment size to 10 Mbyte, as Lotus has already recommended, 8 ISM segments can handle the load previously needing 80 1-Mbyte segments. The load could conceivably grow to 800 Mbytes now without running into the ISM addressing limit.

Second, the share_page_table (ISM) flag could be turned off. This would give a sun4m the ability to create in excess of 3000 1-Mbyte segments. The problem here is that ISM does improve the performance of shared memory accesses, and, if the user intends to move up to 2.5.1, ISM is required to get around another set of problems related to shared memory loads of this kind.

Third, Lotus could change the Notes server so that it kept track of the attach addresses and always attached at 0x1000000 boundary addresses, instead of having the system default to the 0x3000000 address boundary. This would allow a Notes server to grow to 235 segments on a sun4m.

error 15 initializing
Cause
It is caused by a bad /boot or 4.1 on ss2 - level 15 interrupt.

Error 76
Cause
This error is RFS-specific. The server is telling the client that a process has transferred back from mount point.

Technical Notes
The symbolic name for this error is EDOTDOT, errno=76.

Error 88
Cause
This error is caused by an illegal byte sequence. 

Action
You need to handle multiple characters as a single character.

Technical Notes
The symbolic name for this error is EILSEQ, errno=88.

error code 2: access violation
Cause
The user receives this message when trying to do a tftp get.

Action
Do not use a relative path when using tftp. For example: 

tftp> get /tftpboot/testfile 
fails, and 

tftp> get testfile 
succeeds.

error: DPS has not initialized or server connection failed
Cause
This message appears when trying to run AnswerBook on a generic X11 window server or on a generic X terminal.

Action
Running AnswerBook requires Display PostScript (DPS), or a NeWS server, or the Adobe DPS NS remote display software. Additionally, a complete LaserWriter II Type-1 font set (including Palatino) should be installed on the X server. To find out if the X server has DPS, run xdpyinfo(1) to verify the presence of an "Adobe-DPS-Extension" line. X servers without this line do not know about DPS.

Error: Error adding OS service Solaris 2.6 sparc sun4u:
Cause
While trying to add OS services to a newly installed Solaris 2.6 environment and using Solstice Adminsuite 2.3, the process fails with the following error message: 

Error: Error adding OS service Solaris 2.6 sparc sun4u:
inconsistent revision, installed package SUNWpppk revision 3.0.1
does not match revision 11.6.0,REV=1997.07.15.21.46 for sparc
architecture. 
This error is caused by the optional Solstice PPP 3.0.1 packages from the "Solaris Server Intranet Extension" CD-ROM installed on the system.

Action
As a workaround, remove the PPP 3.0.1 packages and replace them with the PPP packages from the Solaris 2.6 release CD-ROM. For example: 

# pkgrm SUNWlicsw SUNWlit SUNWpppk SUNWpppm SUNWpppr SUNWppps SUNWpppu
:
:  {package remove info}
:
# cd /cdrom/cdrom0/s0/Solaris_2.6/Product
# pkgadd -d . SUNWapppr SUNWapppu SUNWpppk
:
:  {package add info}
: 
Then, use Adminsuite to add the OS services, which should then work without error.


--------------------------------------------------------------------------------
Note - 
If the Solstice PPP 3.0.1 package is configured and currently in use on the system, the user should save any of the previously entered PPP configuration information for restoration after the OS services have been installed. (pkgrm(1M) the 3 PPP packages installed from the 2.6 CD release, and againpkgadd(1M) all of the PPP packages from the Intranet Extension CD-ROM, then redo the configuration.) If the Solstice PPP 3.0.1 package was not used on the system, there is no reason to reinstall it. Use /usr/bin/pkginfo to check the installed packages.


--------------------------------------------------------------------------------

This is documented in Chapter 9 of the Solaris Server Intranet Extension Installation and Release Notes Solaris 2.6 manual.

Error Host Unknown:
Cause
In this case, the user is on Windows 95, running PC-NFS pro2.0. The user uses ping(1M) to reach another computer on the network. ping(1M) returns Host Unknown.

This happens when name services are not set up correctly.

Action
Click the Windows 95 Start button, click Programs, click PC-NFSpro, then click Configuration. 

Click TCP/IP and make sure all settings are entered correctly. 

If NIS is enabled, click Configure NIS and make sure the NIS domain and server names are correct. 

If DNS is enabled, click Configure DNS and make sure the DNS domain and server names are correct. 

Click edit hosts and add the name and IP address of the machine you are trying to ping(1M), along with the authentication server.

If you make any changes, click OK, then click Save and Exit on the Configuration dialog box. Shut down and restart Windows 95.

ERROR: missing file arg (cm3)
Cause
An attempt was made to run some sccs(1) operation that requires a file name, such as create, edit, delget, or prt.

Action
Supply the appropriate file name after the SCCS operation.

ERROR [SCCS/s.string]: `SCCS/p.string' nonexistent (ut4)
Cause
An attempt was made to sccs(1) edit or sccs get a file that was not yet under SCCS control.

Action
Run sccs(1) create on that file to place it under SCCS control.

ERROR [SCCS/s.string]: writable `string' exists (ge4)
Cause
An attempt was made to sccs(1) edit a file that is writable, probably because it was already checked out.

Action
Run sccs(1) info to see who has the file checked out. If it is you, go ahead and edit it. If it is somebody else, ask that person to check-in the file.

Error: you don't have a license to run this program
Cause
The user tries to mount the /export file system with Volume Manager 2.1.1 and gets this message.

Action
Run vxserial -p to print the available Volume Manager licenses in the system. 

Also, check the /etc/vfstab file to make sure that the file system is not a vxfs file system.

esp0: data transfer overrun
Cause
When a user tries to mount a CD-ROM on a third-party CD drive, mount(1M) fails with the above error, followed by the sr0: SCSI transport failed message. The CD drive probably comes from a vendor unknown to the system.

Action
Third-party CD drives generally have an 8192 block size, as opposed to the 512 block size on supported Sun drives. Check with the vendor to see if any special configuration is possible to allow the drive to operate on a Sun workstation.

ether_hostton errors from cb_reset
Cause
You issue cb_reset on an SSP and get the following: 

cb_reset
Resetting host snax-cb0... 
warning: ether_hostton(SrcHost:beer): Bad file number 
warning: ether_hostton(SrcHost:beer): Bad file number 
warning: ether_hostton(SrcHost:beer): Bad file number 
Resetting host snax-cb1... 
warning: ether_hostton(SrcHost:beer): Resource temporarily unavailable 
warning: ether_hostton(SrcHost:beer): Resource temporarily unavailable 
warning: ether_hostton(SrcHost:beer): Resource temporarily unavailable 
snax-cb0 is ready... 
snax-cb1 is ready... 
The cb_reset actually completes, but the error messages are annoying.

Action
/etc/nodename is probably incorrect. The following details are from a machine getting this error message. Note that /etc/nodename contains an alias to the real name of the SSP. To correct the problem, edit /etc/nodename to match the true name and reboot. 

# cat /etc/nodename  
beer  

# cat /etc/hostname.qfe0  
snax-ssp  

# cat /etc/hosts  127.0.0.1	localhost  
129.153.49.179	 snax-ssp beer	loghost  

# cat /etc/ethers  
8:0:20:87:58:a5         snax-ssp beer 


Event not found
Cause
This C shell message indicates that a user tried to repeat a command from the history list, but that command or number does not exist in the list.

Action
Run the C shell history(1) command to display recent events in the history list. If a user often tries to run commands that have disappeared from the history list, make the list longer by setting history(1) to a higher value.

See Also
For more information about the C shell, see csh(1).

EXCESSIVE BAD BLKS I=int CONTINUE?
Cause
During phase 1, fsck(1M) found more than 10 bad (out-of-range) blocks associated with the specified inode number.

Action
With this many bad blocks, it might be preferable to restore the file system from backup tapes.

See Also
For more information on bad blocks, see the section on checking file system integrity in the System Administration Guide, Volume 1. If you are using AnswerBook online documentation, "bad blocks" is a good search string.

EXCESSIVE DUP BLKS I=int CONTINUE? 
Cause
During phase 1, fsck(1M) found more than 10 duplicate (previously claimed) blocks associated with the specified inode number.

Action
With this many duplicate blocks, it might be preferable to restore the file system from backup tapes.

See Also
For more information on blocks, see the section on checking file system integrity in the System Administration Guide, Volume 1. If you are using AnswerBook online documentation, "bad blocks" is a good search string.

Exec format error
Cause
This often happens when trying to run software compiled for different systems or architectures, such as when executing the programs on a SunOS 4.1 system, or when trying to execute SPARC-specific programs on an IA machine. This error can also occur if the Binary Compatibility Package was not installed.

Action
Make sure that the software matches the architecture and system you are using. The file(1) command can help you determine the target architecture. If you are using SunOS 4.1 software on a later release, make sure that the Binary Compatibility Package is installed. You can check for it using this command: 

$ pkginfo | grep SUNWbcp 


Technical Notes
A request was made to execute a file that, although it has the appropriate permissions, does not start with a valid format.

The symbolic name for this error is ENOEXEC, errno=8.

See Also
See the a.out(4) man page for a description of executable files.


"F"
failed to initialize adapter
Cause
When using an Adaptec AHA-154x Cx SCSI HBA during installation of the IA release, you might see a message during the MDB device probe that says failed to initialize adapter after the probe has correctly identified the card. There are a variety of reasons for this error, but in all cases the error is because of misconfiguring the card.

Action
To correct the problem, press Ctrl-A during boot to enter the 154x BIOS configuration utility. Choose the Configure/View Host Adapter Settings option, then press the F6 key to return the adapter to its factory default settings. 

After doing this, reconfigure the adapter using the instructions contained in the IA Device Configuration Guide or Driver Update Guide, if applicable. It is especially important that the adapter be configured to use DMA 6. Note that it must be changed from the default of DMA 5.

Failed to Load Security Policy: Invalid argument
Cause
While installing a policy from the GUI (or the command line) the following error message is displayed: 


default.W: Security Policy Script generated into default.pf
default:
Compiled OK.

Installing Security Policy default on all.all@lab-netra
Failed to Load Security Policy: Invalid argument  <-------------- !!
Installing Security Policy on localhost(localhost) failed 

If you truss the policy load, you receive the following: 

truss -o /tmp/truss -f -vall -rall -wall /etc/fw/bin/fw 
                   /etc/fw/conf/default.W 


The following is near the end of the truss: 


1226:   open("/dev/fw0", O_RDWR|O_NONBLOCK)             = 7
1226:   ioctl(7, 0xC0C07A18, 0xEFFFBCA0)                Err#22 EINVAL 

This problem is caused by someone "plumbing" or configuring a new Ethernet interface after Firewall-1 has already started (that is, plumbing an interface by hand after the system has been booted). 

Action
This error can be resolved by configuring the interface to configure automatically at boot time (for example, by creating a /etc/hostname.qe0 file) and rebooting the system. 

The following is another solution: 


/etc/fw/bin/fwstop                       # Stop firewall 
modinfo | grep fw                        # Get kernel module ID

85 f5e19000  3cc0c  51   1  fw (fw)  

modunload -i 85                          # Unload kernel module

/etc/fw/bin/fwstart                      # Restart firewall 

The policy installs correctly now with the following: 


# ./fw load ../conf/default.W
default.W: Security Policy Script generated into default.pf
default:
Compiled OK. 

fast access mmu error
Cause
The user receives this message while trying to boot the Ultra over the network by using the FDDI 5.0 card. 

Action
Do the following: 

Setenv auto-boot? to false.

Reset the system.

Boot the FDDI card.


fbconsole: ioctl SRIOCSREDIR: Device Busy.
Cause
When starting OpenWindows from the command line, the following error message is echoed on the Solaris "Welcome" screen: fbconsole: ioctl SRIOCSREDIR: Device Busy 

Once inside OpenWindows, the following message is displayed in the background windows and when starting cmdtool -C:

SYSTEM WARNING: Object 0x340f8, Device busy, ioctl SRIOCSREDIR returned -1, attempt to make tty the console failed (Tty package) 

Action
OpenWindows was probably started in the background (using the "&"). Exit OpenWindows, and run the command in foreground: /usr/openwin/bin/openwin 

If this does not help, then perhaps some daemon or process is "holding" the console. Type the command: fuser /dev/console.

A list of process IDs is returned. Examine these processes to determine if an application has hold of the console (using the ps(1) command helps).

fd0: unformatted diskette or no diskette in the drive
Cause
This message appears on the system console to indicate that the floppy driver fd(4) could not read the label on a diskette. Usually this is either because a new diskette has not yet been formatted, or a formatted diskette has become corrupted. This message often appears along with read failed and bad format messages after volcheck(1) has been run.

Action
If you are certain that the diskette contains no data, run fdformat -d to format the diskette in DOS format. (You can also format a diskette in UFS format if you like, although then it cannot be transported to most other systems.) When the diskette is formatted, you can write on it, if it has not been corrupted beyond repair.

File descriptor in bad state
Cause
Either a file descriptor refers to no open file or a read request was made to a file that is open only for writing.

Technical Notes
The symbolic name for this error is EBADFD, errno=81.

File exists
Cause
The name of an existing file was mentioned in an inappropriate context. For example, establishing a link to an existing file, or overwriting an existing file are not allowed when the csh(1) noclobber option is set.

Action
Look at the names of files in the directory, then try again with a different name or after renaming or removing the existing file.

Technical Notes
The symbolic name for this error is EEXIST, errno=17.

File locking deadlock
Cause
This is a programming problem and, in some cases, is unavoidable.

Action
All a user can do is restart the program and hope deadlock does not reoccur.

Technical Notes
In the file locking subsystem, two processes tried to modify some lock at the same time. In the multi-threading subsystem, two threads became deadlocked and could not continue. When a program using the threads library encounters this error, it should restart the deadlocked threads.

The symbolic name for this error is EDEADLOCK, errno=56.

File name too long
Cause
The specified file name has too many characters.

Action
If a file name or path name component is too long, devise a shorter name. If the total path name is longer than PATH_MAX characters, first change to an intermediate directory, then specify a shorter path name. Newly created data will be lost unless written to another file with a shorter name.

Technical Notes
In a UFS or NFS-mounted UFS file system, the length of a path name component exceeds MAXNAMLEN (255) characters, or the total length of the path name exceeds PATH_MAX (1024) characters. In a System V file system, the length of a path name component exceeds NAME_MAX (14) characters while no-truncation mode is in effect. These values are defined in the /usr/include/limits.h file.

The symbolic name for this error is ENAMETOOLONG, errno=78.

file system full
Cause
This error message is seen during a login. The login fails with the message No utmpx entry.

See Also
Refer to "No utmpx entry".

FILE SYSTEM STATE IN SUPERBLOCK IS WRONG; FIX?
Cause
The fsck(1M) command has just checked a file system, and has determined that the file system is clean. The file system's super block, however, still thinks the file system is "dirty" in some way.

Action
If you believe that the file system is adequately repaired, answer "yes" to mark the file system as clean.

Technical Notes
Different "dirty" file system types are listed in /usr/include/sys/fs/ufs_fs.h, and include FSACTIVE, FSBAD, FSFIX, FSLOG, and FSSUSPEND.

See Also
For more information on super blocks, see the section on checking file system integrity in the System Administration Guide, Volume 1. If you are using AnswerBook online documentation, "bad super block" is a good search string.

File table overflow
Cause
The kernel file table is full, because too many files are open on the system. Temporarily, no more files can be opened. New data created under this condition will probably be lost.

Action
Simply waiting often gives the system time to close files. However, if this message occurs often, reconfigure the kernel to allow more open files. To increase the size of the file table, increase the value of MAXUSERS in the /etc/system file. The default MAXUSERS value is the amount of main memory in Mbytes, minus 2.

Technical Notes
The symbolic name for this error is ENFILE, errno=23.

File too large
Cause
The file size exceeded the limit specified by ulimit(1), or the file size exceeds the maximum supported by the file system. New data created under this condition can probably be lost.

Action
In the C shell, use the limit(1) command to see or set the default file size. In the Bourne or Korn shells, use the ulimit -a command. Even when the shells claim that the file size is unlimited, in fact the system limit is FCHR_MAX (usually 1 Gbyte).

Technical Notes
The symbolic name for this error is EFBIG, errno=27.

filemgr: mknod: Permission denied
Cause
File Manager issues this message and fails to come up whenever the /tmp/.removable directory is owned by another user and is not in 1777 mode. This can happen, for example, when multiple users share a workstation.

Action
Have the original owner use chmod(1) to change the mode of this file back to 1777, its default creation mode. Rebooting the workstation also resolves this problem.

Technical Notes
This is a known problem that was fixed in the Solaris 2.4 release.

FREE BLK COUNT(S) WRONG IN SUPERBLK SALVAGE?
Cause
During phase 5, fsck(1M) detected that the actual number of free blocks in the file system did not match the super block's free block count. The df(1M) command accesses this free block count when measuring file system capacity.

Action
Generally you can answer "yes" to this question without harming the file system.

See Also
For more information on super blocks, see the section on checking file system integrity in the System Administration Guide, Volume 1. If you are using AnswerBook online documentation, "bad super block" is a good search string.

fsck & ufsdump - cannot read block/sector errors
Cause
If you have received the following messages from fsck(1M): 

CANNOT READ: BLK 196896         
CONTINUE? y
THE FOLLOWING SECTORS COULD NOT BE READ: 196896 196897 196898 196899 
Or the following warnings from ufsdump(1M): 

DUMP: Warning - cannot read sector 164016 of /dev/vx/rdsk/newdg/vol02 
DUMP: Warning - cannot read sector 164017 of /dev/vx/rdsk/newdg/vol02 
DUMP: Warning - cannot read sector 164018 of /dev/vx/rdsk/newdg/vol02 
It could be that the size of this file system in this volume does not match the size of the regular file system.

Action
To check this, follow the example below: 

Run the command: 

# fstyp -v /dev/vx/rdsk/newdg/vol02 | head -30 | grep ncg 
to print the following line (disregard any error or warning messages you might get): 

ncg     25      size    102400  blocks  95983 
Disregard everything but the number after the word size. This number tells you the file system is 102,400 Kbytes in length. 

Next, find out the size of the volume. Run the command: 

# vxprint -g newdg -vt vol02 
which prints: 

V NAME  USETYPE  KSTATE  STATE  LENGTH READPOL PREFPLEX  
v vol02 fsgen    ENABLED ACTIVE 163840 SELECT 
From this, you can see the volume is 163,840 sectors (divide this number by 2 to get it into Kbytes) or 81,920 Kbytes. As you can see from this example, the volume (80 Mbytes) is much smaller than the file system (100 Mbytes). This should be rectified immediately to avoid or minimize data loss. 

To resolve this problem, back up the data as best you can, then either create a new volume or newfs this one and restore the data. 

This problem can also occur on a DiskSuite metadevice. The difference is that you need to check the size of the metadevice using the metastat command. The metastat command shows the size of the metadevice in sectors, just like the vxprint does.

fsck: Can't open /dev/dsk/string 
Cause
The fsck(1M) command cannot open the disk device, because although a similar file system exists, the partition specified does not. 

Action
Run the mount(1M) or the format(1M) command to see what file systems are configured on the machine. Then run fsck(1M) again on an existing partition.

fsck: Can't stat /dev/dsk/string 
Cause
The fsck(1M) command cannot open the disk device, because the specified file system does not exist.

Action
Run the mount(1M) or the format(1M) command to see what file systems are configured on the machine. Then run fsck(1M) again on an existing file system.

ftp: ftp/tcp: unknown service
Cause
The user received this error while using no naming service. The services file looked fine. The user could FTP as root, but not as a normal user.

Action
The permissions on the /etc/services file were wrong. To correct the problem, the user changed them to read access for everyone (644).

fw_ipinput: q fc5fddc0:illegal interface
Cause
The FW-1 kernel module displays this error message when a new network interface has been added to the FW-1 system while fwd is running.

Action
To resolve this problem, run the following to reinstall the FW kernel and the security policy: 

# fw ctl uninstall
# fw ctl install
# fw fetch localhost 


FW1: log message queue is full
Cause
The console reports FW1: Log message queue is full.

The message log is a queue that keeps all the firewall's event logs until FW-1 finishes processing them. If too many logs arrive, the buffer is full and the message FW-1: log message queue is full appears. It usually happens on loaded systems or firewalls that handle many network connections.

Action
Below are some suggestions to stop this warning message: 

Reduce the amount of logging in the security policy. 


--------------------------------------------------------------------------------
Note - 
ACCOUNTING logging is very heavy. Reducing logging from LONG to SHORT also helps.


--------------------------------------------------------------------------------

Increase the internal memory allocated to the FW kernel module. The default amount of memory is 524K. To increase to 1Mbyte, add the statement below to /etc/system and reboot: 

set fw:fwhmem=0x100000 


Set the Excessive Log Grace Period to 0. This is set through Properties -> Logging and Alerting. You must then reinstall the security policy for the change to take affect. The drawback for setting the Excessive Log Grace Period to 0: Your log now includes similar packets received at approximately the same time. When it was not zero, they were hidden (see Managing FireWall-1 Using the OpenLook GUI, p. 104). Thus, no packet disappears from the log, so your log might be a little bit bigger, but apart from that, no problem.

Use Renice fwd for a higher priority. The default priority of the FW daemon is 0 (like most processes). To raise the priority you must give a negative priority, depending on the load on your system. See the man page on nice(1) for more information.


fwm: no license
Cause
Firewall-1 version 2.1 produces this message when the fwstart command is issued or when fwm is started from the command line.

There are two possible reasons for this: 

When a firewall module is installed without a control station on the same machine, the messages are displayed on the console (under UNIX) or in the event log (under WinNT).

The messages might be legitimate. You might find that fwm has not started and you cannot do some crucial tasks. One possible problem: The license might be issued for the wrong host ID.

Action
Make sure the license daemon is running on the server. Then, consider the following cases:

Case one: As a workaround, ignore the present messages and get an upgrade to 2.1c or above.

Case two: To check for a misassigned license, run the command hostid(1). Your hostid is displayed.

Next, run the command fw printlic to see output similar to the following: 

This is FireWall-1 Version 2.1
Type             Expiration Features
id-649f152b	 never	    stdlight 
The first field should list the correct hostid. Also check the expiration date and the features. A list of what is included with the features is provided in INFODOC 13215. If you find any inconsistencies, call the Sun License and Password Center and get a license reissued. Have you host ID and serial number ready.

fwskip_parse_headers: invalid peer n
Cause
In Firewall-1, the connections encrypted with SKIP are dropped at certain times, specifically near the top of the hour. For example, connections will be dropped from 10:55 to 11:15, then continue working normally until 11:55. These error messages appear on the console in pairs: 

fwskip_parse_headers: invalid peer n 
fw_skip_decrypt: cannot parse headers 


These error messages are referring to the n counter. The n counter is the absolute number of hours in GMT time. It is included in the SKIP calculations as a safeguard against a playback attack. If the 2 hosts or firewalls exchanging encrypted packets are not in sync with respect to GMT time, they have different n counters and these errors appear.

Action
Keep the clocks on the encrypting hosts within one hour of each other, GMT time.

"G"
giving up
Cause
This message appears in the SCSI log to indicate that a read or write operation has been retried until it timed out. With SCSI disk the time-out period is usually 30 seconds; with tape, the period is usually 20 attempts. Time-out periods are generally coded into the drivers.

Action
Check that all SCSI devices are connected and powered on. Make sure that SCSI target numbers are correct and not in conflict. Verify that all cables are no longer than a total of six meters, and that all SCSI connections are properly terminated.

Technical Notes
The scsi_log(9F) routine usually displays messages on the system console and in the /var/adm/messages file. Run the dmesg(1M) command to see the most recent message buffer.

Graphics Adapter device /dev/fb is of unknown type
Cause
The /dev/fb driver is either missing or corrupted.

Action
For details, refer to "InitOutput: Error loading module for /dev/fb".

group.org_dir: NIS+ servers unreachable
Cause
This is the second of three messages that an NIS+ client prints when it cannot locate an NIS+ server on the network.

Action
For details, refer to "hosts.org_dir: NIS+ servers unreachable".

"H"
hang console
Cause
Console hangs, but all other operations are working, including rlogin(1) and telnet(1). Rebooting the system (by way of a remote shell) clears the problem.

This problem occurs if another window is opened with the -C option, causing the console to hang. The other window could be another cmdtool window, shelltool window, or even an xterm window. Only one console window can be active at a time. 

Action
The window/process that is causing the problem can be located by using the ps(1) command (auxw options might be necessary). The process can then be killed. Eliminate the console window running with -C, and control returns to the real console.

Machine hung in reboot process: when the user is booting the machine, it hangs at checking file systems.

As a possible workaround, do the following: 

Boot miniroot from tape or CD-ROM.

Type: mkdir mnt.

Mount the root partition to some mount point (/mnt).

Change the directory to /mnt/dev.

Make sure the console is located in the mnt/dev directory.

If not, make the device std (MAKEDEV std).

Halt the system and reboot.

/home/string: No such file or directory
Cause
An attempt was made to change to a user's home directory, but either that user does not exist or the user's file server has not shared (exported) that file system.

Action
To check on the existence of a particular user, run the ypmatch(1) or nismatch(1) command, specifying the user name and then the passwd(4) map.

To export file systems from the remote file server, become superuser on that system and run the share(1M) command with the appropriate options. If that system is sharing (exporting) file systems for the first time, also invoke /etc/init.d/nfs.server start to begin NFS service.

See Also
For more information on sharing file systems, see the share_nfs(1M) man page.

Host is down
Cause
A transport connection failed because the destination host was down. For example, mail delivery was attempted over several days, but the destination machine was not available during any of these attempts.

Action
Report this error to the system administrator for the host. If you are the person responsible for this system, check to see if the machine needs repair or rebooting.

Technical Notes
This error results from status information delivered by the underlying communication interface. If there is no known connection to the host, a different message usually results. For details, refer to "No route to host".

The symbolic name for this error is EHOSTDOWN, errno=147.

host name configuration error
Cause
This is an old sendmail(1M) message, which replaced I refuse to talk to myself and is now replaced by the Local configuration error message.

Action
For details, refer to "554 hostname... Local configuration error".

hosts.org_dir: NIS+ servers unreachable
Cause
This is the third of three messages that an NIS+ client prints when it cannot locate an NIS+ server on the network.

Action
If other NIS+ clients are behaving normally, check the Ethernet cabling on the workstation showing this message. Note the following differences between architectures: 

On SPARC machines, disconnected network cabling also produces a series of no carrier messages.

On IA machines, the NIS+ messages might be the only indication that network cabling is disconnected.


If many NIS+ clients on the network are giving this message, go to the NIS+ server in question and reboot or repair it, as necessary. When the server machine is back in operation, NIS+ clients give an NIS server for domain OK message.

"I"
I can't read your attachments. What mailer are you using?
Cause
The SunView mailtool(1) and prior 3.3 OpenWindows mailtool(1) produce this message when they cannot cope with an attachment. The attachment is probably in MIME (multipurpose internet mail extensions) format, using base64 encoding.

Action
To read a mail message containing MIME attachments, use mailtool(1) from a system running at least the Solaris 2.3 release. If you are running an earlier version of the Solaris environment, rlogin(1) to a system running a later version, set the DISPLAY environment variable back to the first system, and run mailtool remotely. If those options prove impossible, ask the originator to send the message again using mailtool(1), or using the CDE dtmail compose File->SendAs->SunMailTool option.

Technical Notes
Standard MIME attachments with base64 encoding, for example, produce this message and fail to display in older mailtool(1)s.

See Also
Look into using metamail, available on the Internet, which allows you to send and receive MIME attachments.

Identifier removed
Cause
This message indicates an error in a System V IPC facility. Most likely a file associated with messaging, semaphores, or shared memory was deleted from the file system where it had been created.

Technical Notes
This error is returned to processes that resume execution after the removal of an identifier from the file system's name space. See msgctl(2), semctl(2), and shmctl(2) for details.

The symbolic name for this error is EIDRM, errno=36.

ie0: Ethernet jammed
Cause
This message can appear on SPARCservers or IA machines with an Intel 82586 Ethernet chip. It indicates that 16 successive transmission attempts failed, causing the driver to give up on the current packet.

Action
If this error occurs sporadically or at busy times, it probably means that the network is saturated. Wait for network traffic to clear. If bottlenecks arise frequently, think about reconfiguring the network or adding subnets.

Another possible cause of this message is a noise source somewhere in the network, such as a loose transceiver connection. Use snoop(1M) or a similar program to isolate the problem area, then check and tighten network connectors as necessary.

ie0: no carrier
Cause
This message can appear on SPARCservers or IA machines with an Intel 82586 Ethernet chip. It indicates that the chip has lost input to its carrier detect pin while trying to transmit a packet, causing the packet to be dropped.

Action
Check that the Ethernet connector is not loose or disconnected. Other possible causes include an open circuit somewhere in the network and noise on the carrier detect line from the transceiver. Use snoop(1M) or a similar program to isolate the problem area, then check the network connectors and transceivers, as needed.

If pipe/FIFO, don't sleep in stream head
Cause
This is a streams pipe error (not externally visible).

Technical Notes
The symbolic name for this error is ESTRPIPE, errno=92.

ifconfig: bad address
Cause
System fails to boot with this error message: ifconfig: bad address. When coming up to multi-user ifconfig -a, it indicates the following: 

le0: flags=863<UP,BROADCAST,NOTRAILERS,RUNNING,MULTICAST> mtu 1500
	inet 0.0.0.0 netmask 0  
Once up, if this command succeeds, then all is well: 

# ifconfig le0 inet hostname 


Action
Check /etc/hostname.* for a possible bad entry.

/etc/hosts was linked to /var/named/hosts and /var was a separate file system. Until system comes up in multi-user to mount /var, host name could not be resolved to proper IP address.

ifconfig bad address le0
Cause
The user installed the recommended 2.5.1 patches. When booting, rootuser.sh presented the following errors: 

ifconfig bad address le0
le0 arp - revarp failed no rarp replies
bad address hme0
hme0 auto-revarp failed: no rarp replies received. 
The IP address of interface is set to 0.0.0.0.

System fails to resolve host IP address from /etc/host and no other RARP servers responded to the system's request for its IP address.

Action
If dns [NOTFOUND=return] appears before files in /etc/nsswitch.conf, ifconfig complains at boot-time about bad address. In some cases this can cause the boot to fail.

ifconfig: host name bad space address
Cause
When the system is booted, this error message is displayed. The /etc/nsswitch.conf file had the following entries for the hosts line: 

hosts: dns nis [NOTFOUND=return] files 


Action
Move files to the first entry in the list. Now, when the system boots, it resolves the interface names from the /etc/hosts file.

ifconfig: SIOCGIFFLAGS: hme0: no such interface
Cause
If you just installed hme interface and are now manually configuring it, you could receive this error message when running the following: 


ifconfig hme0 inet ipaddr netmask + broadcast + -trailers up
 

Action
If there is no hostname.hme0 file, then the startup scripts do not execute the ifconfig hme0 plumb command. The user can either create the hostname.hme0 file or issue the ifconfig hme0 plumb command manually before attempting to configure the interface.

Illegal Instruction
Cause
A process has received a signal indicating that it attempted to execute an instruction that is not allowed by the kernel. This usually results from running programs compiled for a slightly different machine architecture. This message is usually accompanied by a core dump, except on read-only file systems.

Action
If you are booting from a CD-ROM or from the net, check Readme files to make sure you are using an image appropriate for your machine architecture. Run df(1M) to make sure there is enough swap space on the system; too little swap space can cause this error. If you recently upgraded your CPU to a new architecture, replace your operating system with one that supports the new architecture (an operating system upgrade might be required).

Technical Notes
Sometimes this condition results from a programming error, such as when a program attempts to execute data as instructions. This condition can also indicate device file corruption on your system.

Illegal instruction "0xhex" was encountered at PC 0xhex 
Cause
The machine is trying to boot from a non-boot device, or from a boot device for a different hardware architecture.

Action
If you are booting from the net, check Readme files to make sure you are using a boot image for that architecture. If you are booting from disk, make sure the system is looking at the right disk, which is usually SCSI target 3. If these solutions fail, connect a CD drive to the system and boot from CD-ROM.

Illegal seek
Cause
In this instance, using a pipe (|) on the command line does not work.

Action
Rather than using a pipe on the command line, redirect the output of the first program into a file and run the second program on that file.

Technical Notes
A call to lseek(2) was issued to a pipe. This error condition can also be fixed by altering the program to avoid using lseek(2).

The symbolic name for this error is ESPIPE, errno=29.

Image Tool: Unable to open XIL Library.
Cause
This message follows multiple multi-line XilDefaultErrorFunc errors, indicating that ImageTool could not locate the X Imaging Library. Many OpenWindows and CDE deskset programs require XIL.

Action
Run pkginfo(1) to determine what packages are installed on the system. If the following packages are not present, install them from the CD-ROM or over the net: SUNWxildg, SUNWxiler, SUNWxilow, and SUNWxilrt.

Inappropriate ioctl for device
Cause
This is a programming error.

Action
Ask the program's author to fix this condition. The program needs to be changed so it employs a device driver that can accept special character device controls.

Technical Notes
The ioctl(2) system call was given as an argument for a file that is not a special character device. This message replaces the traditional, but puzzling Not a typewriter message.

The symbolic name for this error is ENOTTY, errno=25.

INCORRECT BLOCK COUNT I=int (should be int) CORRECT?
Cause
During phase 1, fsck(1M) determined that the specified inode pointed to a number of bad or duplicate blocks. The block count should be corrected to the actual number shown.

Action
Generally you can answer "yes" to this question without harming the file system.

See Also
For more information on bad blocks, see the section on checking file system integrity in the System Administration Guide, Volume 1.

index failed:full:index preceded by saveset name
Cause
This is a server that has several clients. It seems that when the backup kicks off, many of the savesets fail with the message listed below:

godzilla                              index failed:full:index 
* godzilla:index 2 retries attempted
* godzilla:index sh: save: not found 


Action
Edit the /etc/init.d/networker file and change the nsrexecd startup line to include a -p option to specify this command search path: 

(/usr/sbin/nsr/nsrexecd -s masters -p /usr/sbin/nsr )     > /dev/console 


inetd[int]: execv /usr/sbin/in.uucpd: No such file or directory
Cause
This message indicates that the Internet services daemon, inetd(1M), tried to start up the UUCP service without the UUCP daemon existing on the system.

Action
The SUNWbnuu package must be installed before the machine can run UUCP. Run pkgadd(1M) to install this package from the distribution CD-ROM or over the network.

inetd[int]: string/tcp: unknown service
Cause
This message indicates that the Internet services daemon, inetd(1M), could not locate the TCP service specified after the first colon.

Action
Check the current machine's /etc/services file, and the NIS services map, to see if the service is described. To start this service, add an appropriate entry into the /etc/services file and possibly the services map as well. Note that NIS+ does not consult the local /etc/services file unless you put files right after nisplus on the services line of the system's /etc/nsswitch.conf file.

If you do not want to start this service, edit the system's /etc/inetd.conf file and delete the entry that tries to start it up.

See Also
For more information about NIS+, see the NIS+ and FNS Administration Guide.

inetd[int]: string/udp: unknown service
Cause
This message indicates that the Internet services daemon, inetd(1M), could not locate the UDP service specified after the first colon.

Action
For a solution, refer to "inetd[int]: string/tcp: unknown service".

inetd: Too many open files
Cause
This message can appear when someone runs a command from the shell or uses a third-party application. The sar(1) command does not indicate that the system-wide open file limit has been exceeded.

The probable cause of this message is that the shell limit has been exceeded. The default open file limit is 64, but it can be raised to 256.

Action
For a solution, refer to "Too many open files".

INIT: Cannot create /var/adm/utmpx
Cause
This console message indicates that init(1M) cannot write in the /var directory, which is usually part of the / (root) file system. Some other messages follow, and the system usually comes up single-user. The problem is often that / or /var is mounted read-only. Sometimes a brief power outage leaves the system believing that many file systems are still mounted.

Action
If /var is a separate file system on the machine and is not yet mounted, mount it now. If the file system containing /var is mounted read-only, remount it read-write with a command similar to this: 

# mount -o rw,remount / 
Then type Control-D and try to bring up the system multi-user. If that fails, the root file system is probably corrupted. Run fsck(1M) on the root file system, halt the machine, power cycle the CPU, and wait for the system to reboot. Should this problem still occur, restore the root file system from backup tapes, or re-install the system from net or CD-ROM to replace the root file system.

InitOutput: Error loading module for /dev/fb
Cause
This fatal X server error message indicates that /dev/fb, the "dumb frame buffer," is either missing or corrupted. It is usually followed by a giving up message and a few xinit(1) errors.

Action
If other devices on the system are working correctly, the most likely reason for this error is that the SUNWdfb package was removed or never installed. Insert the installation CD-ROM, change to the Solaris_2.* directory, and run the following command to install the packages SUNWdfbh and SUNWdfb (for your machine architecture): 

pkgadd -d . 


If other devices on the system are not working correctly, the system might have a corrupt /devices directory. Halt the system and boot using the -r (reconfigure) option. The system will run fsck(1M) if the /devices file system is corrupted, most likely fixing the problem. 

insertion failed: a problem with the filesystem has been detected: filesystem is probably full
Cause
With the use of automounter, ls -l of an automounted directory is giving the above error. This is a pop-up error message that forces you to press continue. However, the ls -l does not work properly.

Action
Do a df -k to see if the /var directory is completely full. Since the /var/statmon directory contains the locks for NFS, the automount fails if the /var is completely full. After the /var directory is reduced to less than 100% of the automount point, ls -l should work properly.

Interrupted system call
Cause
The user issued an interrupt signal (usually Control-C) while the system was in the middle of executing a system call. When network service is slow, interrupting cd(1) to a remote-mounted directory can produce this message.

Action
Proceed with your work; this message is strictly informational.

Technical Notes
An asynchronous signal (such as interrupt or quit), which a program was set up to catch, occurred during an internal system call. If execution is resumed after processing the signal, it will appear as if the interrupted programming function returned this error condition, so the program might exit with an incorrect error message.

The symbolic name for this error is EINTR, errno=4.

Invalid argument
Cause
An invalid parameter was specified that the system cannot interpret. For example, trying to mount an uncreated file system, printing without sufficient system support, or providing an undefined signal to a signal(3C) library function can all produce this message.

Action
If you see this message when you are trying to mount a file system, make sure that you have run newfs(1M) to create the file system. 

If you see this message when you are trying to read a diskette, make sure that the diskette was properly formatted with fdformat(1), either in DOS format, pcfs(7FS), or as a UFS file system. 

If you see this message while you are trying to print, make sure that the print service is configured correctly.

Technical Notes
The symbolic name for this error is EINVAL, errno=22.

Invalid null command
Cause
This C shell message results from a command line with two pipes (|) in a row or from a pipe without a command afterwards.

Action
Change the command line so that each pipe is followed by a command.

Invalid_SS_JWS_HOME:no C:\\lib\basicframe.properties
Cause
The user was running WinNT 4.0 and received this error message when trying to launch Java WorkshopTM.

Action
Loaded software from marimba company was removed from the user's system. The product was castanet. Afterwards, the JWS worked without problems. Apparently, the product SunTM Tuner came loaded with JDKTM, and this conflicted with JWS.

See www.marimba.com for more details on marimba products.

Another possible solution:

Double-click jws.exe within the C:\Java-WorkShop\jws\intel-win32\bin\ folder.

I/O error
Cause
Some physical Input/Output error has occurred. If the process was writing a file at the time, data corruption is possible. 

Action
First, find out which device is experiencing the I/O error. If the device is a tape drive, make sure a tape is inserted into the drive. When this error occurs with a tape in the drive, it is likely that the tape contains an unrecoverable bad spot.

If the device is a floppy drive, an unformatted or defective diskette could be at fault. Format the diskette, or obtain a replacement.

If the device is a hard disk drive, you might need to run fsck(1M) and possibly even reformat the disk.

Technical Notes
In some cases this error might occur on a call following the one to which it actually applies.

The symbolic name for this error is EIO, errno=5.

IP: Hardware address '08:00:20:xx:xx:xx' trying to be our address xxx.xxx.xxx.xxx!
Cause
The above message appears in /var/adm/messages. 

This can happen, for example, when the ATM lane device is set to promiscuous mode by running snoop -d lane0.

Action
Do not let the ATM lane device run in promiscuous mode and do not ignore the warning about it.

Technical Notes
A broadcast over ATM LAN Emulation is emulated by the broadcast and the unknown server (BUS) for the emulated LAN. If the Sun command transmits its ARP request, some switch implementations for LANE repeat the ARP request over the bus_forward channel, so that it can be seen on the local interface, again: 

----- ATM AAL5 Header -----
Packet 1 arrived at 12:12:30.42
Packet size=66 bytes
TRANSMIT : VC=75
LANE Data Frame  Type=0x0806 (ARP)
ARP:  ----- ARP/RARP Frame -----
ARP:  
ARP:  Hardware type = 1
ARP:  Protocol type = 0800 (IP)
ARP:  Length of hardware address = 6 bytes
ARP:  Length of protocol address = 4 bytes
ARP:  Opcode 1 (ARP Request)
ARP:  Sender's hardware address = 8:0:20:82:8f:91
ARP:  Sender's protocol address = 192.168.31.54, lab054-lane0
ARP:  Target hardware address = ?
ARP:  Target protocol address = 192.168.31.50, lab050-lane0
ARP:  

----- ATM AAL5 Header -----
Packet 2 arrived at 12:12:30.42
Packet size=66 bytes
RECEIVE : VC=76
LANE Data Frame  Type=0x0806 (ARP)
ARP:  ----- ARP/RARP Frame -----
ARP:  
ARP:  Hardware type = 1
ARP:  Protocol type = 0800 (IP)
ARP:  Length of hardware address = 6 bytes
ARP:  Length of protocol address = 4 bytes
ARP:  Opcode 1 (ARP Request)
ARP:  Sender's hardware address = 8:0:20:82:8f:91
ARP:  Sender's protocol address = 192.168.31.54, lab054-lane0
ARP:  Target hardware address = ?
ARP:  Target protocol address = 192.168.31.50, lab050-lane0
ARP:   
Now the request is answered: 

----- ATM AAL5 Header -----
Packet 3 arrived at 12:12:30.42
Packet size=66 bytes
RECEIVE : VC=84
LANE Data Frame  Type=0x0806 (ARP)
ARP:  ----- ARP/RARP Frame -----
ARP:  
ARP:  Hardware type = 1
ARP:  Protocol type = 0800 (IP)
ARP:  Length of hardware address = 6 bytes
ARP:  Length of protocol address = 4 bytes
ARP:  Opcode 2 (ARP Reply)
ARP:  Sender's hardware address = 8:0:20:8c:4e:f0
ARP:  Sender's protocol address = 192.168.31.50, lab050-lane0
ARP:  Target hardware address = 8:0:20:82:8f:91
ARP:  Target protocol address = 192.168.31.54, lab054-lane0
ARP:   


Normally, the reflected ARP Request is suppressed. If the lane device is set to promiscuous mode, all packets are passed to upper layers, and so the upper instances receive Sun's own packet and raise this message:


Feb 10 12:12:30 sissi unix: IP: Hardware address '08:00:20:82:8f:91' 
trying to be our address 192.168.031.054! 

Is a directory
Cause
An attempt was made to read or write a directory as if it were a file.

Action
Look at a listing of all the files in the current directory and try again, specifying a file instead of a directory.

Technical Notes
The symbolic name for this error is EISDIR, errno=21.

"J"
java.lang.UnsatisfiedLinkError:
Cause
When trying to start Java Workshop 2.0 (or some other Java applications), the following error is displayed: 

java.lang.UnsatisfiedLinkError: setCursor
        at sun.awt.motif.MComponentPeer.initialize(Compiled Code)
        at sun.awt.motif.MTextAreaPeer.initialize(Compiled Code)
        at sun.awt.motif.MComponentPeer.<init>(Compiled Code)
        at sun.awt.motif.MTextAreaPeer.<init>(Compiled Code)
        at sun.awt.motif.MToolkit.createTextArea(Compiled Code) 


Action
The LD_LIBRARY_PATH is probably set up to include a Java lib directory that does not quite match the java bin command used. For example, in the Solaris 2.6 release LD_LIBRARY_PATH = /usr/openwin/lib results in Java Workshop running properly. But setting LD_LIBRARY_PATH = /usr/java/lib:/usr/openwin/lib results in the error being displayed, since Java Workshop uses its own version of JDK and the startup process picks up a mixture of versions.

To resolve, include /usr/java/lib in your LD_LIBRARY_PATH, since it is needed only in rare circumstances (like when you are using the Java Invocation API).

"K"
kernel read error
Cause
This message appears when savecore(1M), if activated, tries to copy a debugging image of kernel memory to disk, but cannot read various kernel data structures correctly. Generally, this occurs after a system panic has corrupted the main memory. Data corruption on the system is possible. 

Action
Look at the kernel error messages that preceded this one to try to determine the cause of the problem. Error messages such as BAD TRAP usually indicate faulty hardware. Until the problem that caused the kernel panic is resolved, a kernel core image cannot be saved for debugging.

killed
Cause
A process, which attempts to allocate large amounts of memory either as an array or by using malloc, fails when launched by the shell. This problem has been seen while allocating 240,000,000 elements as either an array of doubles or using malloc to allocate the 1,920,000,000 bytes of space.

Action
This can have one of two causes. Resolve it accordingly.

1. Lack of swap space

Try running the program as root on the console; if it runs, this is not the problem. 

2. Stack size and data segment size are in conflict 

If the stack size is set too large, this can conflict with the data segment, and the process cannot be started. Setting the stack size to the default value of 8192 resolves this problem and allows the programs to start. 

Killed
Cause
This message is strictly informational. If the killed process was writing a file, some data might be lost.

Action
Continue with your work.

Technical Notes
This message from the signal handler or various shells indicates that a process has been terminated with a SIGKILL. However, if you do not see this message and cannot terminate a process with a SIGKILL, you might have to reboot the machine to remove that process.

kmem_free block already free
Cause
This is a programming error, probably from a device driver.

Action
Determine which driver is giving this message and contact the vendor for a software update, as this message indicates a bug in the driver.

Technical Notes
This message is from the DDI programming function kmem_free(9F), which releases a block of memory at address addr of size siz that was previously allocated by the DDI function kmem_alloc(9F). Both addr and siz must correspond to the original allocation. If you have source code for the driver, follow kmem_alloc(9F) and kmem_free(9F) in the code to make sure they allocate and free the same chunk of memory.

"L"
last message repeated int times
Cause
This message comes from syslogd(1M), the facility that prints messages on the console and records them in /var/adm/messages. To reduce the log size and minimize buffer usage, syslog collapses any identical messages it sees during a 20 second period, then prints this message with the number of repetitions.

Action
Look above this message to see which message was repeated so often. Then consider the repeated message and take action accordingly. If repeated log entries such as su ... failed appear, consider the possibility of a security breach.

late initialization error
Cause
Netscape enterprise server 2.0 receives these error messages from the daemon: 

late initialization error 

start up failure no such file or directory 

system will not connect to port 80 


Action
This is a file permission problem caused by someone changing the UID for the httpd user in /etc/passwd.

Change UID in /etc/passwd to the correct UID.

ld.so.1 fatal: can't set protection on segment
Cause
Applications have recently begun to fail with this error, ld.so.1 fatal: can't set protection on segment. The failures are random.

Action
This was happening because of the recent introduction of a rogue application that consumed most of the swap space on the system. The other applications, which failed randomly, were doing so because of having insufficient swap space to run. The error from ld.so.1 occurred because there was no segment on which to set the protections.

ld.so.1: string: fatal: string: can't open file: errno=2
Cause
This message is produced in releases earlier than Solaris 2.5.1. It is not produced in releases after Solaris 2.5.1.

For more information about the cause, refer to "ld.so.1: string: fatal: string: open failed: No such file or directory". It has the same cause.

Action
For the resolution, refer to "ld.so.1: string: fatal: string: open failed: No such file or directory". Their resolutions are the same.

See Also
For more information about the Linker, see the Linker and Libraries Guide.

ld.so.1: string: fatal: string: open failed: No such file or directory
Cause
This message is produced in releases after Solaris 2.5.1. It is not produced in releases before Solaris 2.5.1.

This message indicates that the runtime linker, ld.so.1(1), while running the program specified after the first colon, could not find the shared object specified after the third colon. (A shared object is sometimes called a dynamically linked library.)

Action
As a workaround, set the environment variable LD_LIBRARY_PATH to include the location of the shared object in question. For example: 

/usr/dt/lib:/usr/openwin/lib 
Better yet, if you have access to source code, recompile the program using the -Rpath loader option. Using LD_LIBRARY_PATH slows down performance.

See Also
For more information about the Linker, see the Linker and Libraries Guide.

ld.so.1: string: fatal: relocation error: string: string: referenced symbol not found
Cause
This message is produced in releases after the Solaris 2.5.1. It is not produced in the Solaris 2.5.1 or earlier releases.

The message from the runtime linker ld.so.1(1) indicates that in trying to execute the application given after the first colon, the specified symbol could not be found for relocation. The message goes on to say in what file the symbol was referenced. Because this is a fatal error, the application terminates with this message.

Action
Run the ldd -d command on the application to show its shared object dependencies and symbols that are not found. Probably your system contains an old version of the shared object that should contain this symbol. Contact the library vendor or author for an update.

Technical Notes
This error does not necessarily occur when you first bring up an application. It could take months to develop, if ordinary use of the application seldom references the undefined symbol.

See Also
For more information about the Linker, see the Linker and Libraries Guide.

ld.so.1: string: fatal: relocation error: symbol not found: string 
Cause
This message is produced in the Solaris 2.5.1 release and earlier. It is not produced in releases after the Solaris 2.5.1.

Refer to "ld.so.1: string: fatal: relocation error: string: string: referenced symbol not found". It has the same cause.

Action
For a resolution, refer to "ld.so.1: string: fatal: relocation error: string: string: referenced symbol not found". Their resolutions are the same.

Technical Notes
This error does not necessarily occur when you first bring up an application. It could take months to develop, if ordinary use of the application seldom references the undefined symbol.

See Also
For more information about the Linker, see the Linker and Libraries Guide.

le0: Memory error!
Cause
This message indicates that the network interface encountered an access time-out from the CPU's main memory. There is probably nothing wrong except system overload.

Action
If the system is busy with other processes, this error can occur frequently. If possible, try to reduce the system load by quitting applications or killing some processes.

Technical Notes
The Lance Ethernet chip timed out while trying to acquire the bus for a DVMA transfer. Most network applications wait for a transfer to occur, so generally no data gets lost. However, data transfer might fail after too many time-outs.

See Also
For more information about the Lance Ethernet chip, see the le(7D) man page. 

le0: No carrier-- cable disconnected or hub link test disabled?
Cause
Stand-alone machines with no Ethernet port connection get this error when the system tries to access the network. If the Ethernet cable is connected, this message could result from a mismatch between the machine's NVRAM settings and the Ethernet hub settings.

Action
If this message is continuous, try to save any work to a local disk.

When a machine is configured as a networked system, it must be plugged into the Ethernet with a twisted pair J45 connector.

If the Ethernet cable is plugged in, find out whether or not the Ethernet hub does a link integrity test. Then become superuser to check and possibly set the machine's NVRAM. If the hub's link integrity test is disabled, set this variable to false. 

# eeprom | grep tpe
tpe-link-test?=true
# eeprom 'tpe-link-test?=false' 
The default setting is true. If for some reason tpe-link-test? was set to false, and the hub's link integrity test is enabled, reset this variable to true.

le0: No carrier-- transceiver cable problem?
Cause
Stand-alone machines with no Ethernet port connection get this error when the system tries to access the network.

Action
If this message is continuous, try to save any work to a local disk.

When a machine is configured as a networked system, it must be plugged into the Ethernet with either a twisted pair J45 connector or thicknet 10Base-T connector (depending on the building's Ethernet cable type).

Technical Notes
Older workstations have a thicknet connection on the back, instead of a twisted pair Ethernet connection; therefore, they require a thicknet to the twisted pair transceiver to translate between cabling types.

level 15 interrupt
Cause
This error occurred on an SS20.

.lib section in a.out corrupted
Cause
This occurred while trying to exec(2) an a.out(4), which requires that a static shared library be linked in. Also, there was erroneous data in the .lib section of the a.out(4). The .lib section tells exec(2) which static shared libraries are needed. The a.out(4) is probably corrupted.

Technical Notes
The symbolic name for this error is ELIBSCN, errno=85.

LINK COUNT FILE I=i OWNER=o MODE=m SIZE=s MTIME=t COUNT... ADJUST?
Cause
During phase 4, fsck(1M) determined that the inode's link count for the specified file is wrong and asks if you want to adjust it to the value given.

Action
Generally you can answer "yes" to this question without harming the file system.

See Also
For more information on fsck(1M), see the section on checking file system integrity in the System Administration Guide, Volume 1.

Link has been severed
Cause
This error occurs when the connection to a remote machine is gone, for example after a remote procedure call is interrupted.

Technical Notes
The symbolic name for this error is ENOLINK, errno=67.

LL105W: Protocol error detected.
Cause
This error message comes from LifelineTM Mail, an unbundled PC compatibility application.

Most likely, someone set up a user account without a password.

Action
To solve this problem, assign the user a password.

ln: cannot create /dev/fb: Read-only filesystem
Cause
During device reconfiguration at boot time, the system cannot link to the frame buffer because /dev is on a read-only file system.

Action
Check that /dev/fb is a symbolic link to the hardware frame buffer, such as cgsix(7D) or tcx(7D). Ensure that the file system containing /dev is mounted read-write.

lockd[int]: create_client: no name for inet address 0xhex 
Cause
This lock daemon message usually indicates that the NIS hosts.byname and hosts.byaddr maps are not coordinated.

Action
Wait a short time for the maps to synchronize. If they do not, take steps to coordinate them.

See Also
For information on updating NIS data, see the section on NIS maps in the NIS+ and FNS Administration Guide. If you are using AnswerBook online documentation, "hosts.byaddr" is a good search string.

log_get: len is not a multiple of 4 from FW-1
Cause
The Firewall-1TM log contains this message. It is logged when one of the log files is somehow damaged, usually after a power outage or violent reboot of the system.

Action
Try the following workaround: 

 
# fwstop
# rename fw.log, fw.alog, fw.vlog
# fwstart 


Login incorrect
Cause
This message from the login(1) program indicates an incorrect combination of login name and password. There is no way to tell whether the problem comes from the login name, the password, or both. Other programs such as ftp(1), rexecd(1M), sulogin(1M), and uucp(1C) also give this error under similar conditions.

Action
Check the /etc/passwd file and the NIS or NIS+ passwd map on the local system to see if an entry exists for this user. If a user has simply forgotten the password, su(1M) and set a new one with the passwd(1) username command. This command automatically updates the NIS+ passwd map, but with NIS you will need to coordinate the update with the passwd map.

The Login incorrect problem can also occur with older versions of NIS when the user name has more than eight characters. If this is the case, edit the NIS password file, change the user name to have eight or fewer characters, and then remake the NIS passwd map.

If you cannot log in to the system as root, despite knowing the proper password, it is possible that the /etc/passwd file is corrupted. Try to log in as a regular user and su(1M) to root.

If that does not work, see the message su: No shell and follow most of the instructions given there. Instead of changing the default shell, make the password field blank in /etc/shadow.

lp hang
Cause
On a print server, the queue continues to grow but nothing comes out of the printer. The printer daemon is hung.

Action
Below is a simple procedure for flushing a hung printing queue: 

Login or switch user to root.

Issue the reject(1M) printername command to make sure no one sends any job to the printer.

Turn the power off to the printer.

If the active job appears to be causing the hang, remove it from the print queue with the cancel(1) jobnumber command and ask the owner to requeue that print job.

Shut down the print queue with the /usr/lib/lpshut command.

Remove the lock file /var/spool/lp/SCHEDLOCK and the temporary files /var/spool/lp/tmp/*/*.

Turn the printer back on.

Restart the print queue with the /usr/lib/lpsched command.

See Also
For more information on print queuing, see the System Administration Guide, Volume 2. If you are using AnswerBook online documentation, "print server" is a good search string.

"M"
Machine is not on the network
Cause
This error is remote file sharing (RFS) specific. It occurs when users try to advertise, unadvertise, mount, or unmount remote resources while the machine has not properly started a network connect.

Technical Notes
The symbolic name for this error is ENONET, errno=64.

Mail Tool is confused about the state of your Mail File.
Cause
This message appears in a pop-up dialog box whenever you ask mailtool(1) to access messages after another mail reader has modified your inbox. A request follows: Please Quit this Mail Tool.

Action
Click continue to close the dialog box, then exit mailtool(1). If you continue trying to read mail, messages deleted by the other mail reader will never appear, and mailtool(1) will fail to see any new messages.

mail: Your mailfile was found to be corrupted (Content-length mismatch).
Cause
This message comes from mail(1) or mailx(1) whenever it detects messages with a different content length than advertised. The mail(1) program tells you which message might be truncated or might have another message concatenated to it.

Two common causes of content length mismatches are the simultaneous use of different mail readers (such as mail(1) and mailtool(1)), or the use of a mail reading program (or an editor) that does not update the content-length field after altering a message.

Action
The mailx(1) program can usually recover from this error and delineate mail message boundaries correctly. Pay close attention to the message that might be truncated or combined with another message, and to all messages after that one. If a mail file becomes hopelessly corrupted, run it through a text editor to eliminate all Content-Length lines, and ensure that each message has a From (no colon) line for each message, preceded by a blank line.

To avoid mail file corruption, exit from mailtool(1) without saving changes when you are currently running mail(1) or mailx(1).

mailtool: Can't create dead letter: Permission denied
Cause
An attempt was made to send a message with mailtool(1) from a directory where the user does not have write permission, and the user's home directory is currently unavailable.

Action
Change to another directory and start mailtool(1) again, or use chmod(1) to change permissions for the directory (if possible).

mailtool: Could not initialize the Classing Engine
Cause
When a user runs mailtool(1) on a remote machine, setting the DISPLAY environment back to the local machine, this message might appear inside a dialog box window. The message also indicates that the Classing Engine must be installed to use Attachments. This problem occurs because rlogin(1) does not propagate the user's environment.

Action
Exit mailtool(1) and set your OPENWINHOME environment variable to /usr/openwin. Then run mailtool(1) again. The error message does not appear, and you can now use Attachments.

Technical Notes
Classing Engine is a new name for Tool Talk. Earlier versions of mailtool(1) said Tool Talk: TT_ERR_NOMP instead of Classing Engine.

Management Server is VPN while client is NON-VPN
Cause
When the Windows GUI (fwpolicy) is started in Firewall-1 3.0 and the login process is initiated, the error message window pops up displaying this message.

Action
The Firewall-1 GUI packages SUNWfwgui and SUNWfweui were installed in the incorrect order. First, remove the packages using pkgrm(1M). Next, install the SUNWfwgui and, then, the SUNWfweui in that order to resolve the error message.

file name may contain holes - can't swap on it.
Cause
A swap file was created with the following command: 

# mkfile -nv 50m /ab/swap_50mb 
When the user tried to add the file with 

# swap -a /ab/swap_50mb 
it failed with this message: 

/ab/swap_50mb may contain holes - can't swap on it.
/ab/swap_50mb: Error 0 


Action
Starting with the Solaris 2.0 release, -n works only when the file is to be used by the NFS system. Local swap files cannot be created with the -n option.

mbuf map full
Cause
This error has to do with mbuf allocation.

Memory address alignment
Cause
This message can occur when printing large files on a SPARCprinterTM attached to a SPARCstation 2.

Action
Replace the SPARCstation 2 CPU with one that is at the most recent dash level.

memory leaks
Cause
An application uses up more and more memory, until all swap space is exhausted.

Action
Third-party software can help identify memory leaks in their applications. If you suspect that you have a memory leak, you can use sar(1) to check on the Kernel Memory Allocation (KMA). Any driver or module that uses KMA resources, but does not specifically return the resources before it exits, can create a memory leak.

See Also
For more information on memory leaks, see the section on monitoring system activity in the System Administration Guide, Volume 2. If you are using AnswerBook online documentation, "displaying disk usage" is a good search string. Also, see the section on system resource problems in the NIS+ and FNS Administration Guide.

Message too long
Cause
A message sent on a transport provider was larger than the internal message buffer or some other network limit.

Technical Notes
The symbolic name for this error is EMSGSIZE, errno=97.

mount: /dev/dsk/string is already mounted, /string is busy, or...
Cause
While trying to mount a file system, the mount(1M) command received a "Device busy" (EBUSY) error code. Several possible reasons are: this /dev/dsk file system is already mounted on a different directory, the busy path name is the working directory of an active process, or the system has exceeded its maximum number of mount points (unlikely).

Action
Run /etc/mount to see if the file system is already mounted. If not, check to see if any shells are active in the busy directory (did the user switch to the directory by using cd(1)?), or if any processes in the ps(1) listing are active in that directory. If the reason for the error message is not obvious, try using a different directory for the mount point.

mount: giving up on: /string 
Cause
An existing server did not respond to an NFS mount request, so after retrying a number of times (default 1000), the mount(1M) command has ceased. Nonexistent servers or bad mount points produce different messages.

Action
If the RPC: Program not registered message precedes this one, the requested mount server probably did not share (export) any file systems, so it has no NFS daemons running. Have the superuser on the mount server run share(1M) on the file system, then run /etc/init.d/nfs.server start to begin NFS service.

If the requested mount server is down or slow to respond, check whether the machine needs repair or rebooting.

mount: mount-point /string does not exist.
Cause
Someone tried to mount a file system onto the specified directory, but there is no such directory.

Action
If this is the directory name you want, run mkdir(1) to create this directory as a mount point.

mount: the state of /dev/dsk/string is not okay
Cause
The system was unable to mount the file system that was specified because the super block indicates that the file system might be corrupted. This is not an impediment for read-only mounts.

Action
If you do not need to write on this file system, run mount(1M) on it using the -o ro option. Otherwise, do as one of the message continuation lines suggests and run fsck(1M) to correct the file system state and update the super-block.

See Also
For more information on using fsck(1M), see the section on checking file system integrity in the System Administration Guide, Volume 1.

Multihop attempted
Cause
This error occurs when users try to access remote resources that are not directly accessible.

Technical Notes
The symbolic name for this error is EMULTIHOP, errno=74.

"N"
Name not unique on network
Cause
The given log name is not unique.

Technical Notes
The symbolic name for this error is ENOTUNIQ, errno=80.

named [pid]: hostname.domainname has CNAME and other data (illegal)
Cause
This error message is displayed on the DNS server.

Action
This error indicates that an alias (CNAME) is associated with another type of DNS record. 

The DNS system allows you to set up an alias to a system using the CNAME record. See the following example: 

alias1		IN CNAME	host1.domain1. 


The alias alias1 cannot appear in any other type of record. Only the actual name of the host can be used. So, if you wanted to use this host as a mail exchanger, the record 

alias1		IN MX  10  host2.domain1. 
would be illegal and would produce the error.

Instead, you should use: 

host1		IN MX  10  host2.domain1. 
This remedy applies to all types of records, including HINFO and A records.

Also, this error might occur without explicitly setting the left side of a record. The DNS system defaults the left side to the last given left side. So you might have the following in a named database file: 

host1	IN A	 123.124.125.126
        IN HINFO Sun Solaris
alias1  IN CNAME host1.domain1.
        IN MX 10 host2.domain1. 
In this fragment, an implied alias1 is in the left side of the MX record. If the alias was added after the database was in use for a while, the error would suddenly occur. The MX record was legal until the CNAME was added in front of it. This example could be fixed either by reversing the order of the MX and CNAME records, or explicitly giving the host1 in the left side of the MX record.

/net/string: No such file or directory
Cause
A user tried to change directory--for example with cd(1)--to a network partition on the system specified after /net/, but this host either does not exist or has not shared (exported) any file system.

Action
To gain access to files on this system, try rlogin(1).

To export file systems from the remote system, become superuser on that system and run the share(1M) command with the appropriate options. If that system is sharing file systems for the first time, also run /etc/init.d/nfs.server start to begin the NFS service.

Network dropped connection because of reset
Cause
The host you were connected to crashed and rebooted.

Technical Notes
The symbolic name for this error is ENETRESET, errno=129.

Network is down
Cause
A transport connection failed because it encountered a dead network.

Action
Report this error to the system administrator for the network. If you are the person responsible for this network, check why the network is dead and what repairs are necessary.

Technical Notes
This error results from status information delivered by the underlying communication interface.

The symbolic name for this error is ENETDOWN, errno=127.

Network is unreachable
Cause
An operational error occurred either because there was no route to the network or because negative status information was returned by intermediate gateways or switching nodes.

The returned status is not always sufficient to distinguish between a network that is down and a host that is down. See the No route to host message.

Action
Check the network routers and switches to see if they are disallowing these packet transfers. If they are allowing all packet transfers, check network cabling and connections.

Technical Notes
The symbolic name for this error is ENETUNREACH, errno=128.

NFS getattr failed for server string: RPC: Timed out
Cause
This message appears on an NFS client that requested a service from an NFS server that has failing hardware. Often the message NFS read failed appears along with this message. If the server were merely down or slow to respond, the NFS server not responding message would appear instead. Data corruption on the server system is possible. 

Action
Because this message usually indicates server hardware failure, initiate repair procedures as soon as possible. Check the memory modules, disk controllers, and CPU board.

See Also
For more information on NFS tuning, see the chapter on monitoring network performance in the System Administration Guide, Volume 2.

nfs mount: Couldn't bind to reserved port
Cause
This message appears when a client attempts to use NFS to mount a file system from a server that has more than one Ethernet interface configured on the same physical subnet.

Action
Always connect multiple Ethernet interfaces on one router system to different physical subnetworks.

nfs mount: mount: string: Device busy
Cause
This message appears when the superuser attempts to NFS mount on top of an active directory. The busy device is actually the working directory of a process.

Action
Determine which shell on the workstation is currently located below the mount point, and change that directory. Be wary of subshells (such as su(1M) shells) that could be in different working directories while the parents remain below the mount point.

NFS mount: /string mounted OK
Cause
While booting, the system failed to mount the directory specified after the first colon, probably because the NFS server involved was down or slow to respond. The mount ran in the background and successfully contacted the NFS server.

Action
This is strictly an informative message to notify you that the mount process has completed.

NFS mounted callog file Unsupported.
Cause
After installing the Solaris 2.6 software on a system, when users try to start their calendars either with CDE's calendar manager (/usr/dt/bin/dtcm) or OpenWindows' calendar (/usr/openwin/bin/cm), they see this dialog box: 

Calendar :Informational - NFS mounted callog file Unsupported.
Your default startup Calendar file appears to be NFS mounted or
a symlink to the same.  This is Not Supported.
			Continue 
The following error is displayed in the console window when the Continue button is clicked: 

date time host rpc.cmsd[pid]: rpc.cmsd : 
	NFS mounted callog file Not Supported - user@host
date time host rpc.cmsd[pid]: rpc.cmsd : 
	NFS mounted callog file Not Supported - user@host
 
The calendars would have worked under the Solaris software versions including and prior to 2.5.1, however.

Action
It has long been known that NFS-mounted calendars are not supported. The calendar can be corrupted when more than one person uses the calendar at the same time. If two rpc.cmsd daemons write to the callog file at the same time, the file becomes corrupt. However, two rpc.cmsd daemons could be run simultaneously on the Solaris 2.5.1 release, even though this is not a supported configuration.

With the Solaris 2.6 release, this concurrency is no longer an option. rpc.cmsd does not allow the user to start a calendar that is NFS-mounted and produces the previous error message.

NFS read failed for server string 
Cause
This message generally indicates a permissions problem. Perhaps a directory or file permission was changed while the client kept the file open. Perhaps the file system's share or netgroup permissions changed. If the server were down or the network overloaded, the NFS server not responding message would appear instead.

Action
Log in to the NFS server and check the permissions of directories leading to the file. Make certain that the file system is shared with (exported to) the client experiencing an NFS read failure.

See Also
For more information, see the chapter on NFS troubleshooting in the System Administration Guide, Volume 3.

nfs_server: bad getargs for int/int 
Cause
This message comes from the NFS server when it receives a request with unrecognized or incorrect arguments. Typically, it means the request could not be XDR decoded properly. This error can result from corruption of the packet over the network, or from an implementation bug causing the NFS client to encode its arguments improperly.

Action
If this message originates from a single client, investigate that machine for NFS client software bugs. If this message appears throughout a network, especially accompanied by other networking errors, investigate the network cabling and connectors.

NFS server string not responding still trying
Cause
In most cases this common message indicates that the system has requested a service from an NFS server that is either down or extremely slow to respond. In some cases this message indicates that the network link to this NFS server is broken, although usually that condition generates other error messages as well. In a few cases this message indicates NFS client setup problems.

Action
Check the non-responding NFS server for the need for machine repair or rebooting. Encourage your user community to report such problems quickly but only once.

See Also
For more information, see the chapter on NFS troubleshooting in the System Administration Guide, Volume 3.

NFS server string ok
Cause
This message is the follow-up to the NFS server not responding error. It indicates that the NFS server is again operating.

Action
When an NFS server first starts, it is busy fulfilling client requests for a while. Be patient and wait for your client system to respond. Making many extraneous requests only further slows the NFS server response time.

NFS string failed for server string: error int (string)
Cause
The failed NFS operation could be any one of the following: getattr, setattr, lookup, access, readlink, read, write, create, mkdir, symlink, mknod, remove, rmdir, rename, link, readdir, readdir+, fsstat, fsinfo, pathconf, or commit.

See Also
For more information on NFS, see the System Administration Guide, Volume 3.

nfs umount: string: is busy
Cause
This message appears when the superuser attempts to unmount an active NFS file system. The busy point is the working directory of a process.

Action
Determine which shell (or process) on the workstation is currently located in the remotely mounted file system, and change--cd(1)--out of that directory. Be wary of subshells (such as su(1M) shells) that could be in different directories while the parent shells remain in the NFS file system.

NFS write error on host string: No space left on device.
Cause
This console message indicates that an NFS-mounted partition has filled up and cannot accept writing of new data. Unfortunately, software that attempts to overwrite existing files will usually zero-out all data in these files. This is particularly destructive on NFS-mounted /home partitions. 

Action
Find the user or process that is filling up the file system, and stop the out-of-control process as soon as you can. Then delete files as necessary to create more space on the file system (large core(4) files are good candidates for deletion). Have users write any modified files to local disk if possible. If this error occurs often, redistribute directories to ease the demand on this partition.

See Also
For more information on disk usage, see the System Administration Guide, Volume 2. If you are using AnswerBook online documentation, "managing disk use" is a good search string.

NFS write failed for server string: RPC: Timed out
Cause
This error can occur when a file system is soft mounted, and server or network response time lags. Any data written to the server during this period could be corrupted. 

Action
If you intend to write on a file system, never specify the soft-mount option. Use the default hard mount for all the file systems that are mounted read-write.

See Also
For more information, see the chapter on NFS troubleshooting in the System Administration Guide, Volume 3.

NIS+ authentication failure
Cause
This is a Federated Naming Service message. The operation could not be completed because the principal making the request could not be authenticated with the name service involved. 

Action
Run the nisdefaults(1) command to verify that you are identified as the correct NIS+ principal. Also check that the system has specified the correct public key source.

See Also
For more information, see the authentication and authorization overview in the NIS+ and FNS Administration Guide.

nis_cachemgr: Error in reading NIS cold start file : '/var/nis/NIS_COLD_START'
Cause
After installing patches 104331-04 and 103612-33, nis_cachemgr(1M) failed to start. The symptoms are as follows during the reboot: 

Sep 11 16:34:00 nis_cachemgr: Error in reading NIS cold start file : 
          '/var/nis/NIS_COLD_START' 
Additionally, nis_cachemgr(1M) is not running after login. Trussing nis_cachemgr(1M) showed that it is reading /var/nis/NIS_COLD_START and immediately reporting an error. Neither reinitializing the client nor copying NIS_COLD_START helps.

Action
This error is a timing problem. Put a sleep(1) before the NIS+ initialization in /etc/init.d/rpc, after rpc.bind has been started. rpc.bind is slow initializing and needs a few extra seconds before nis_cachemgr(1M) takes effect.

No buffer space available
Cause
An operation on a transport endpoint or pipe was not performed because the system lacked sufficient buffer space or because a queue was full. The target system probably ran out of memory or swap space. Any data written during this condition is probably lost.

Action
To add more swap area, use the swap -a command on the target system. Alternatively, reconfigure the target system to have more swap space. As a general rule, swap space should be two to three times as large as physical memory.

Technical Notes
The symbolic name for this error is ENOBUFS, errno=132.

No child processes
Cause
This message can appear when an application tries to communicate with a cooperating process that does not exist.

Action
Restart the parent process so it can create the child processes again. If that does not help, this error could be the result of a programming error; contact the vendor or author of the program for an update.

Technical Notes
A wait(2) system call was executed by a process that had no existing or unwaited-for child processes. The child processes could have exited prematurely, or might never have been created.

The symbolic name for this error is ECHILD, errno=10.

No default media available
Cause
The volume manager issues this message if a user makes an eject(1) request when the drives contain no diskette or CD-ROM to eject.

Action
Insert a diskette or CD-ROM. If the volume manager is confused and a diskette or CD-ROM is actually in a drive, run volcheck(1) to update the volume manager. If the system remains confused, try booting with the -r option to reconfigure devices.

No directory! Logging in with home=/
Cause
The login(1) program could not find the home directory listed in the password file or NIS passwd(4) map, so it deposited the user in the root directory.

Action
Check that the user's home directory is mounted and is owned by and accessible to that user. Perhaps the automounter tried to mount the home directory, but the NFS server did not respond quickly enough. Try listing the files in /home/username. If the NFS server responds to this request, have the user log out and log in again.

The automounter daemon might not be running. Run the ps(1) command to see if automountd(1M) is present. If not, run the second command; if it appears to be wedged, run both these commands: 

# /etc/init.d/autofs stop
# /etc/init.d/autofs start 
When the automounter daemon is running, verify that the /etc/auto_master file has a line like this: 

/home  auto_home 
Verify that the /etc/auto_home file has a line like this: 

+auto_home 
These entries depend on the NIS auto_home map.

Also, the NFS server might not have shared (exported) this /home directory, or the NFS daemons on the server might have disappeared.

See Also
For more information on NFS, see the System Administration Guide, Volume 3.

No message of desired type
Cause
An attempt was made to receive a message of a type that does not exist on the specified message queue. See the msgsnd(2) and msgrcv(2) man pages for details.

Action
This message indicates an error in the System V IPC message facility. Generally the message queue is empty or devoid of the desired message type while IPC_NOWAIT is set.

Technical Notes
The symbolic name for this error is ENOMSG, errno=35.

No recipients specified
Cause
This message comes from the mailx(1) command whenever a user does not provide an address in the To: field.

Action
For details, refer to "Recipient names must be specified".

No record locks available
Cause
No more record locks are available. The system lock table is full.

Action
Try again later, when more locks might be available.

Technical Notes
The symbolic name for this error is ENOLCK, errno=46.

Perhaps a process called fcntl(2) with the F_SETLK or F_SETLKW option, and the system maximum was exceeded. The system contains several different locking subsystems, including fcntl(2), the NFS lock daemon, and mail locking. All subsystems can produce this error.

No route to host
Cause
An operational error occurred because there was no route to the destination host, or because of status information returned by intermediate gateways or switching nodes.

The returned status is not always sufficient to distinguish between a host that is down and a network that is down. Refer to "Network is unreachable".

Action
Check that the network routers and switches are not disallowing these packet transfers. If they are allowing all packet transfers, check network cabling and connections.

Technical Notes
The symbolic name for this error is EHOSTUNREACH, errno=148.

No shell Connection closed
Cause
A user has attempted a remote login to the system, and has a valid account name and password, but the shell specified for the account is not available on that system. 

Action
If you have a copy of the requested shell, become superuser and install the missing shell on that system. Otherwise, change the user's password file entry--perhaps only in the NIS+ or NIS passwd(4) map--to specify an available shell such as /bin/csh or /bin/ksh.

No space left on device
Cause
While writing an ordinary file or creating a directory entry, there was no free space left on the device. The disk, tape, or diskette is full of data. Any data written to that device during this condition can be lost.

Action
Remove unneeded files from the hard disk or diskette until there is space for all the data you are writing. You also might move some directories onto another file system and create symbolic links accordingly. When a tape is full, continue on another one, use a higher-density setting, or obtain a higher-capacity tape.

To create multi-volume tapes or diskettes, use the pax(1) or cpio(1) command; tar(1) is still limited to a single volume.

Technical Notes
The symbolic name for this error is ENOSPC, errno=28.

No such device
Cause
An attempt was made to apply an operation to an inappropriate device, such as writing to a nonexistent device.

Action
Check the /devices directory to find out why this device does not exist, or why the program expects it to exist. The similar No such device or address message tends to indicate I/O problems with an existing device, whereas this message tends to indicate a device that does not exist at all.

Technical Notes
The symbolic name for this error is ENODEV, errno=19.

No such device or address
Cause
This error can occur when a tape drive is offline or when a device has been powered off or removed from the system.

Action
For tape drives, make sure the device is connected, powered on, and toggled online (if applicable). For disk and CD-ROM drives, check that the device is connected and powered on.

With all SCSI devices, ensure that the target switch or dial is set to the number where the system originally mounted it. To inform the system of a change to the target device number, reboot using the -r (reconfigure) option.

Technical Notes
This message results from I/O to a special file's subdevice that either does not exist or that exists beyond the limit of the device.

The symbolic name for this error is ENXIO, errno=6.

No such file or directory
Cause
The specified file or directory does not exist. Either the file name or path name was entered incorrectly.

Action
Check the file name and path name for correctness and try again. If the specified file or directory is a symbolic link, it probably points to a nonexistent file or directory.

Technical Notes
The symbolic name for this error is ENOENT, errno=2.

no such map in server's domain
Cause
A user or an application tried to look up something using Network Information Services (NIS), but NIS has no corresponding database for this request.

Action
Check the following: 

Make sure the NIS map name is spelled correctly. To see a list of nicknames for the various NIS maps, run the ypcat -x command.

To see a full list of the various NIS maps (databases), run the ypwhich -m command. 

If the NIS service was not running on the current machine, these commands would result in this message: "can't communicate with ypbind".


No such process
Cause
This process cannot be found. The process could have finished execution and disappeared, or it might still be in the system under a different numeric ID.

Action
Use the ps(1) command to check that the process ID you are supplying is correct.

Technical Notes
No process corresponds to the specified process ID (PID), lightweight process ID, or thread_t.

The symbolic name for this error is ESRCH, errno=3.

No such user as string-- cron entries not created
Cause
A file exists in /var/spool/cron/crontabs for the specified user, but this user is not in /etc/passwd or the NIS passwd(4) map. The system cannot create cron(1M) entries for nonexistent users.

Action
To eliminate this message at boot time, remove the cron file for the nonexistent user, or rename it if the user's login name has changed. If this is a valid user, create an appropriate password entry for this name.

No utmpx entry
Cause
During login, file system full errors are seen and the login fails with the message No utmpx entry.

This error is caused by a full file system. The system has no space to write its utmpx (login information) entry.

Action
To correct this condition the system must be booted into single user mode. Then clear (do not delete) these files: /var/adm/utmp and /var/adm/utmpx. This can be done by typing: 

#cat /dev/null > /var/adm/utmp
#cat /dev/null > /var/adm/utmpx 
These commands zero-out the files but keep them with the correct permissions. 

In some cases, after clearing these files, the /var file system might still be full. In this case type: 

du -askd /var |sort -nr |more 
This command gives you a listing of the files from largest to smallest in the /var file system. To create space you can zero these files: /var/cron/log, /var/spool/lp/logs, and /var/adm/messages. You can also check /.wastebasket for large files to delete.

no valid fm license
Cause
The firewall gives you this error when the proper module is not updated.

Action
When you run the VPN version, you need to use the module fwmodvpn 5.x.o. To make the update, you can follow these steps: 

# fwstop
# cd $FWDIR/modules
# mv fwmod.5.x.o old.fwmod.5.x.o
# ln -s fwmodvpn.5.x.o fwmod.5.x.o
# fw putlic 0 0-0-0 0       # For Firewall-1 2.x)
# fw putlic -K              # 3.x Firewall)
# fwstart 


no VTOC
Cause
In this case, the user installs the Solaris 2.6 IA software and receives this error when rebooting the system. Other error messages refer to not having a default boot device configured, but this is the usual error message. This error leaves the system unusable; the user cannot boot.

Action
The user needs to do the following: 

Insert the Solaris 2.6 software CD in the drive.

Boot with the Device Configuration Assistant diskette.

Select the CD-ROM to boot when presented with the available devices.

Type b -s when asked to select either Interactive or Jumpstart to boot as a single user.

At the # prompt, type the following: 

# mount /dev/dsk/cxdxpx /a   (where "x" is information from your system)
 

# TERM=at386; export TERM 
# cd /a/platform/i86pc/boot/solaris/devicedb 


In this directory is a file called master. BEFORE EDITING this file, make a backup copy. After it is backed up, view the master file in vi. Look for the term ata.bef and replace it with the word none.

Run touch /reconfigure and then reboot the system. (The command boot -r, reboot -- -r also works.)

Not a data message
Cause
During a read(2), getmsg(2), or ioctl(2) I_RECVFD call to a STREAMS device, some data has come to the head of the queue that cannot be processed. That data depends on the call: 

read(2) -- Controls information or passes a file descriptor

getmsg(2) -- Passes a file descriptor

ioctl(2) -- Controls data information

Technical Notes
The symbolic name for this error is EBADMSG, errno=77.

Not a directory
Cause
A non-directory was specified where a directory is required, such as a path prefix or an argument to the chdir(2) call.

Action
Look at a listing of all the files in the current directory and try again, specifying a directory instead of a file.

Technical Notes
The symbolic name for this error is ENOTDIR, errno=20.

Not a stream device
Cause
A putmsg(2) or getmsg(2) system call was attempted on a file descriptor that is not a STREAMS device.

Technical Notes
The symbolic name for this error is ENOSTR, errno=60.

Not enough space
Cause
This message indicates that the system is running many large applications simultaneously and has run out of swap space (virtual memory). It could also indicate that applications failed without freeing pages from the swap area. Swap space is an area of disk set aside to store portions of applications and data not immediately required in memory. Any data written during this condition is probably lost.

Action
Reinstall or reconfigure the system to have more swap space. A general rule is that swap space should be two to three times as large as physical memory. Alternatively, use mkfile(1M) and swap(1M) to add more swap area. This example shows how to add 16 Mbytes of virtual memory in the /usr/swap file (any file system with enough free space would work): 

# mkfile 16m /usr/swap
# swap -a /usr/swap 
To make this reconfiguration automatic at boot time, add the following line to the /etc/vfstab file: 

/usr/swap   -   -   swap   -   no   - 


Technical Notes
When calling the fork(2), exec(2), sbrk(2), or malloc(3C) routine, a program asked for more memory than the system could supply. This is not a temporary condition; swap space is a system parameter.

The symbolic name for this error is ENOMEM, errno=12.

not found
Cause
This message indicates that the Bourne shell could not find the program name given as a command.

Action
Check the form and spelling of the command line. If that data looks correct, do a echo $PATH to see if the user's search path is correct. When communications are garbled, it is possible to unset a search path to such an extent that only built-in shell commands are available. Below is a command to reset a basic search path: 

$ PATH=/usr/bin:/usr/ccs/bin:/usr/openwin/bin:. 
If the search path looks correct, check the directory contents along the search path for missing programs or directories that are not mounted.

Not login shell
Cause
This message results when a user tries to use the logout(1) command from a shell other than the one started at login time.

Action
To quit a non-login shell, use the exit(1) command. Continue doing so until you have logged out.

See Also
For more general information on the login shell, see the section on customizing your work environment in the Solaris Advanced User's Guide.

Not on system console
Cause
A user tried to use the login(1) command to a system as the superuser (uid=0, which is not necessarily root) from a terminal other than the console.

Action
Log in to that system as a normal user, then run su(1M) to become superuser. To allow superuser logins from any terminal, comment out the CONSOLE line in /etc/default/login (this is not recommended for security reasons).

Not owner
Cause
Either an ordinary user tried to do something reserved for the superuser, or the user tried to modify a file in a way restricted to the file's owner or to the superuser.

Action
Switch user to root and try again.

Technical Notes
The symbolic name for this error is EPERM, errno=1.

Not supported
Cause
This version of the system does not support the feature requested, although future versions of the system might provide support.

Action
This is generally not a system message from the kernel, but an error returned by an application. Contact the vendor or author of the application for an update.

Technical Notes
The symbolic name for this error is ENOTSUP, errno=48.

NOTICE: /string: out of inodes
Cause
The file system specified after the first colon probably contains many small files, exceeding the per-file system limit for inodes (file information nodes).

Action
If many small files were created unintentionally, remove them to resolve the problem. Otherwise, follow these steps to increase file system capacity for small files: 

Make several backup copies of the file system on different tapes (for safety).

Change the machine to single-user mode.

Use the newfs(1M) command with the -i option to increase inode density for this file system. The following is an example: 

# newfs -i 1024 /dev/rdsk/partition
 

Restore the file system from a backup tape. 


--------------------------------------------------------------------------------
Note - 
Increasing the inode density slightly reduces the total file system capacity.


--------------------------------------------------------------------------------

NOTICE: vxvm: unexpected status on close
Cause
Every time the system boots (or is shut down), the message is displayed on the console. Sometimes the following message is also displayed on the console and in the /var/adm/messages file: 

	WARNING:
/iommu@0,10000000/sbus@0,10001000/SUNW,soc@2,0/SUNW,pln@a0000000,74127a/ssd@4,2
(ssd22):
	Error for Command: <undecoded cmd 0x35>       Error Level: Fatal
	Requested Block: 0      Error Block: 0
	Vendor: CONNER                  Serial Number: 93081LPT
	Sense Key: Aborted Command
	ASC: 0xb3 (<vendor unique code 0xb3>), ASCQ: 0x0, FRU: 0x0
	WARNING:
/iommu@0,10000000/sbus@0,10001000/SUNW,soc@2,0/SUNW,pln@a0000000,74127a/ssd@4,2
(ssd22): ssd_synchronize_cache failed (5)

Action
In a High Availability system with NVRAM, this error would be caused by unprocessed data in a NVRAM cache of the active logical host that has been down and started again later. Because of the possibility of error, NVRAM should not be used in an HA system. The problem can be solved in this case by removing the NVRAM on the HA system.

In a non-HA system, this error can also be caused by stale data in the NVRAM cache. (The example commands that follow assume the controller for the array is c1.) To fix for a non-HA system: 

Turn off all fast writes on this array and sync any remaining pending writes: 

# ssaadm fast_write -d c1
# ssaadm sync_cache c1 


When you sync the fast writes to the array, all pending writes are physically made to the disks. Anything that is left in the cache is stale; thus, it is safe to purge it. Run this command: 

# ssaadm purge c1  


Turn on the fast writes for the disks. This command might be different on your system, depending on the disks where you want fast writes enabled and the types of fast writes you want: 

# ssaadm fast_write -s -e c1 


nsrck: SYSTEM error, more space needed to compress [client] index, 8.1 MB required
Cause
In networker, you cannot use the Remove Oldest Cycle feature because the /nsr file system is too full to perform a remove. An error message appears in the console window indicating that the file system is full.

Action
Stop the networker daemons so that some of the indexes can be moved. In the SunOS 5 system, use /etc/init.d/networker stop. In the SunOS 4 system, use ps -ef | grep nsr and kill(1) the processes.

Find a file system with enough space to move one of the client's indexes. Only one of the client's indexes should be moved, not the networker server's index. To find the size of a client's index, go to /nsr/index/clientname/db and list the contents using ls -l. The database file can be large (possibly over 500 Mbytes).

Move the contents of a client's index to the other file system and check that /nsr has freed the space to use. You might need to unmount and remount /nsr, or even to reboot to designate the space freed by the move, as available.

After the space is available, restart the daemons.

Open nwadmin. Under Clients--Indexes, select a client and use Remove Oldest Cycle to free more space.

Use Reclaim Space to reclaim the space from the removed cycles. After a few of the old cycles have been removed, enough space should be in the file system to move the removed client's index back.

Stop the daemons, and move the client's index back to /nsr/index/clientname.

Restart the daemons. Remove the oldest cycles for the client that was just moved.

Tweaking of the browse policy and retention policy might be necessary to prevent this situation from happening in the future.

Otherwise, as long-term solutions, add more hard disk and run growfs, or move /nsr to a drive with more space on it.

"O"
Object is remote
Cause
This error occurs when users try to share a resource that is not on the local machine, or try to mount/unmount a device or path name that is on a remote machine.

Technical Notes
The symbolic name for this error is EREMOTE, errno=66.

ok
Cause
This is the OpenBoot PROM monitor prompt. From this prompt, you can boot the system (from disk, CD-ROM, or net), or you can use the go command to continue where you left off.

Action
If you suddenly see this prompt, look at the messages above it to see if the system crashed. If no other messages appear, and you just typed Stop-A or plugged in a new keyboard, type go to continue. You might need to Refresh the window system from its Workspace Menu.

Technical Notes
Never invoke sync from the prompt without first running the fsck(1M) command, especially if the file system has changed.

open: no such device or address from FW-1
Cause
The FW-1 has been installed on a disk other than the default root disk. If the Default Filter option is set (allowing a default filter to be automatically installed during boot), FW-1 tries to load the default security policy from $FWDIR, but the partition that contains $FWDIR is not yet mounted. This mismatch causes this error.

Action
To work around this problem, follow these steps: 

# cp /$FWDIR/modules/fwmod.5.x.0 /etc/fw.boot/
# cp /$FWDIR/modules/fw.mkdev /etc/fw.boot/
# cp /$FWDIR/modules/fw.conf /etc/fw.boot/ 
Go to /usr/kernel/drv and change the links as follows: 

fw -> /etc/fw.boot/fwmod.5.x.0
fw.conf -> /etc/fw.boot/fw.conf 


Operation already in progress
Cause
An operation was attempted on a non-blocking object that already had an operation in progress.

Technical Notes
The symbolic name for this error is EALREADY, errno=149.

Operation canceled
Cause
The associated asynchronous operation was canceled before completion.

Technical Notes
The symbolic name for this error is ECANCELED, errno=47.

operation failed [error 185], unknown group error 0, string 
Cause
When you use admintool to add a user to a newly created group, admintool issues this error.

Action
Apply patch 101384-05 to fix bug ID 1151837 and to provide a workaround for bug ID 1153087.

Operation not applicable
Cause
This error indicates that no system support exists for a function that the application requested.

Action
Ask the system vendor for an upgrade, or contact the vendor or author of the application for an update.

Technical Notes
This message indicates that no system support exists for an operation. Many modules set this error when a programming function is not yet implemented. If you are writing a program that produces this message, while calling a system library, find and use an alternative library function. Future versions of the system might support this operation; check system release notes for further information.

The symbolic name for this error is ENOSYS, errno=89.

Operation not supported on transport endpoint
Cause
As an example, this error could occur when trying to accept a connection on a datagram transport endpoint.

Technical Notes
The symbolic name for this error is EOPNOTSUPP, errno=122.

Operation now in progress
Cause
An operation that takes a long time to complete (such as a connect) was attempted on a non-blocking object.

Technical Notes
The symbolic name for this error is EINPROGRESS, errno=150.

/opt/bin/jws: /solaris/bin/locate_dirs: not found
Cause
This error message occurs if you try to start Java Workshop by linking from /opt/bin/jws to /opt/SUNWjws/JWS/sparc-S2/bin/jws. Typing the full path name works, but typing jws gives this error.

Action
This error occurs because /opt/bin/jws is not /opt/SUNWjws/JWS/sparc-S2/bin/jws, which is a script that runs another script: $_SS_JWS_HOME/solaris/bin/locate_dirs.

/opt/bin/jws is not setting $_SS_JWS_HOME correctly. Remove it from the path and replace it with /opt/SUNWjws/JWS/sparc-S2/bin/jws. Then, which jws can return /opt/SUNWjws/JWS/sparc-S2/bin/jws.

Option not supported by protocol
Cause
A bad option or level was specified when getting or setting options for a protocol.

Technical Notes
The symbolic name for this error is ENOPROTOOPT, errno=99.

out of memory
Cause
Hundreds of different programs can produce this message when the system is running many large applications simultaneously. This message usually means that the system has run out of swap space (virtual memory).

Action
For details, refer to "Not enough space". Any data written during this condition is probably lost.

Out of stream resources
Cause
During a STREAMS open, either no STREAMS queues or no STREAMS head data structures were available. This is a temporary condition; you might recover from it if other processes release resources.

Technical Notes
The symbolic name for this error is ENOSR, errno=63.

overlapping swap volume
Cause
After creating volumes in rootdg to be used as additional swap and adding these to the /etc/vfstab file, an error message is displayed at boot time that indicates overlapping swap volumes. 

Action
Change the names of these volumes to read swap1, swap2, and so forth.

If you still get this message after making the previous change, edit the /sbin/swapadd script. Find the line: 

c=`$SWAP -l | grep -c '\\<'${special}'\>'` 
and change it to: 

c=`$SWAP -l | grep -c ''${special}''` 

"P"
Package not installed
Cause
This error occurs when a user attempts to use a system call from a package that has not been installed.

Technical Notes
The symbolic name for this error is ENOPKG, errno=65.

page_create: invalid flag
Cause
This error occurs after a vxvm upgrade. In this case, the user had the drivers (vxio and vxspec) for the Solaris 2.5.1 software and not for the Solaris 2.6 software. This condition was verified by using ls -l /kernel/drv/*vx*. 

Action
Execute a pkgrm or re-install VXVM 2.4 and re-encapsulate the root.

Panic
Cause
A system panics and crashes when a program exercises an operating system bug. Although the crash might seem unfriendly to a user, the sudden stop actually safeguards the system and its data from further corruption.

In addition to stopping the operating system, the panic routine copies the memory contents in use to a dump device, recording critical information about the current state of the CPU from which the panic routine was called.

Because the primary swap device is usually the default dump device, the primary swap device should be large enough to hold a complete image of memory. The system tries to reboot after the memory image is saved.

If the system does not reboot successfully, consider these possibilities:

Catastrophic hardware failure, such as faulty memory or a crashed disk

Major kernel configuration faults, such as an unstable device driver

Major kernel-tuning errors, such as a too-large value for MAXUSERS

Data corruption, including corruption of the operating system files

Manual intervention needed, as when fsck(1M) expects answers to its queries

Action
To find out why a system crashed, you can look in the /var/adm/message* log files.

Of these methods, using savecore(1M) is the most informative. The savecore(1M) command transfers the system crash dump image generated by the panic routine from the dump device to a file system. The image can then be analyzed with a debugger, such as adb(1).

See Also
Correctly setting up savecore(1M) and interpreting its results can be difficult. For more information about debugging system panics, refer to Panic! UNIX System Crash Dump Analysis by Chris Drake and Kimberley Brown (ISBN 0-13-149386-8).

panic -boot: Could not mount filesystem
Cause
The first problem comes from the following jumpstart error: 

2ec00 RPC: Can't decode result.
whoami RPC call failed with rpc status: 2
panic - boot: Could not mount filesystem.
program terminated
ok 
Normally, this error occurs when the boot process is unable to get to the install image. 

Additionally, other users have the same error message, with an additional message: 

'Timeout waiting for ARP/RARP packet...' 


Action
To solve the first problem: 

Check how the dfstab(4) (/etc/dfs/dfstab on the install image NFS server) looks: 

share -F nfs -o ro,anon=o /jumpstart-dir 


Run share(1M) command on the installed image NFS server, to make sure it is shared properly.

Check /etc/bootparams file on the net install server. Look for entries with incorrect boot path.

Make sure that /usr/sbin/rpc.bootparamd is running on the boot server. If necessary, kill and restart it.

Check /etc/ethers on the boot server for duplicate or conflicting entries.

At the prompt, run test net /test-net and/or watch net /watch-net to test the network connectivity.

As a workaround for the second problem, check the nsswitch.conf(4) file. If some of the entries point to NIS, such as: 

rpc		nis	files
hosts		nis	files
ethers		nis	files
bootparams	files   nis 
change all of these entries to files first: 

rpc		files 	nis
hosts		files 	nis
ethers		files	nis
bootparams	files	nis 


--------------------------------------------------------------------------------
Note - 
You might have to update these files manually if they do not contain information on the client machine you are trying to jumpstart. 


--------------------------------------------------------------------------------

Then, remove the client with rm_install_client(1M), remove the contents of tftpboot, and again add the client: 

add_install_client -c /jumpstart-dir/profiles  'client name'  'arch' 


Panic on cpu 0: valloc'd past tmpptes
Cause
The machine is an SS20 with 256 Mbytes of RAM, an FDDI interface, and a single CPU. It is running Online Disksuite for mirroring and striping. The following recommended kernel patches were installed: 

102517-03 
102436-02 
102394-02 
102516-06 
After their installation, the machine was rebuilt to allow for the new patches to be implemented. However, the machine panicked immediately after loading the kernel with this error message.

Action
The kernel was rebuilt with a new MAXUSERS value of 96, and this kernel enabled the machine to boot properly.

Technical Notes
Information directly related to this situation was not available; however, there was a description of another type of panic that was related to seg_u. In that description, the MAXUSERS value was set too large, causing the kernel to overrun table space. Furthermore, the value of MAXUSERS varies among the different architectures and the different revisions of the OS and is directly related to the amount of physical RAM in the system in an inverse proportion. Further investigation revealed that the value of MAXUSERS was set to 128. Based on the related information, it seems that the panic was due to valloc attempting to define memory space in excess of the value of tmpptes.

PARTIALLY ALLOCATED INODE I=int CLEAR?
Cause
Probably the system crashed in the middle of a sync(2) or write(2) operation, and during phase 1, fsck(1M) found that the specified inode was neither allocated nor unallocated. 

Action
If any directory entries point to this inode and you answer "yes" to this question, phase 2 might get UNALLOCATED messages. Carefully exit fsck(1M) and run ncheck(1M)--specifying the inode number after the -i option--to determine which file or directory is involved. You might be able to restore this file or directory from another system. fsck(1M) also might copy this file to the lost+found directory in a later phase.

See Also
For more information, see the chapter on checking file system integrity in the System Administration Guide, Volume 1.

passwd: Changing password for string 
Cause
The following lines are put into /etc/nsswitch.conf: 

passwd:     compat
passwd_compat:     nis   
Then, when passwd is run, it fails as follows: 

server1% passwd
passwd:  Changing password for khh
server1% 


--------------------------------------------------------------------------------
Note - 
passwd exits before a password is entered.


--------------------------------------------------------------------------------

Action
In the man page for passwd, you see the following:

If all requirements are met, by default, the passwd(1) command consults /etc/nsswitch.conf to determine which repositories need a password update. It searches the passwd(4) and passwd_compat entries. The sources (repositories) associated with these entries are updated. However, the supported password update configurations are limited to the following five cases. Failure to comply with the configurations prevents users from logging in to the system. 

passwd: files
passwd: files nis
passwd: files nisplus
passwd: compat (==> files nis)
passwd: compat (==> files nisplus)
passwd_compat: nisplus 


--------------------------------------------------------------------------------
Note - 
The passwd(1) man page does NOT say that you can use the line: passwd_compat: nis. passwd(1) works exactly as described in the man page.


--------------------------------------------------------------------------------


passwd (SYSTEM): System error: repository out of range
Cause
When trying to lock a user account and using nispasswd with the -l option in the Solaris 2.6 release, you get this error: passwd (SYSTEM): System error: repository out of range. 

Action
Use passwd -r nisplus -l username instead.

passwd.org_dir: NIS+ servers unreachable
Cause
This is the first of three messages that an NIS+ client prints when it cannot locate an NIS+ server on the network.

Action
For details, refer to "hosts.org_dir: NIS+ servers unreachable".

Password does not decrypt secret key for unix.uid@string 
Cause
This message appears at login when a user's password is not identical to the user's keylogin(1) network password. When a system is running NIS+, the login program first performs UNIX authentication, and then attempts a keylogin(1) for secure RPC authentication.

Action
To gain credentials for secure RPC, users can run keylogin(1) (after login) and type their secret key. To stop this message from appearing at time of login, users can run the chkey -p command and set their network password to be the same as their NIS+ password. If a user does not remember the network password, the system administrator should delete and re-create the user's credentials table entry so the user can establish a new network password with chkey(1).

password file busy - try again later.
Cause
On a SunOS system running NIS (YP), the user runs yppasswdd(1M)and the system reports this error. On the NIS Master server, this error is in the messages file from rpc.yppasswdd: password file busy - try again. This error is caused superficially by the existence of a lock file, /var/yp/passwd.ptmp. Removing this file allows yppasswdd to run to completion, but subsequent invocations still fail with the same error message. The root cause is that yppasswdd has the-m option, which says to run make to push the maps out to the slave servers. In this situation, a problem occurred in pushing the maps to a slave server; the push would hang. Thus, the push was never completed, and the lock file was never removed. This was tested by doing the following: 

#cd /var/yp 
#make passwd 
passwd is up to date 
#touch passwd 
#make passwd
 
From here, the make remakes the map, but then hangs on the push to the slave.

Action
To fix the root cause, find out why the map does not push. In this situation, it was a routing issue; however, the remedy could lie elsewhere.

pdbadmin start node fails cluster_establish join not allowed
Cause
The user created a disk group, but forgot to make it shared. After it was made a shared disk group, the user attempted to start the second node (which had not been rebooted). pdbadmin start node on second pdb node failed with this repeated message until it finally timed out: 

return from cluster_establish is join not allowed now  
retrying cluster_establish 


Action
You can either reboot the second node or run vxdctl enable. 

pdbadmin start node now works.

Permission denied
Cause
An attempt was made to access a file in a way forbidden by the protection system.

Action
Check the ownership and protection mode of the file (with a long listing from the ls -l command) to see who is allowed access to the file. Then change the file or directory permissions, as needed.

Technical Notes
The symbolic name for this error is EACCES, errno=13.

Please specify a recipient.
Cause
With mailtool(1), this message comes up in a dialog box whenever a user tries to deliver a message with no address in the To: field.

Action
For details, refer to "Recipient names must be specified".

Protocol error
Cause
A protocol error occurred. This error is device specific, but is generally not related to a hardware failure.

Technical Notes
The symbolic name for this error is EPROTO, errno=71.

protocol error, string closed connection
Cause
rlogin(1) fails on a machine with the SunOS system installed.

Action
Check the permissions in in.rlogind on the machine you are trying to connect to. The permissions should look like this: 

-rwxr-xr-x  1 root     staff       16384 Jan 20  1994 /usr/sbin/in.rlogind 


Check the login line in the /etc/inetd.conf file. It should look like the following: 

login	stream	tcp	nowait	root	/usr/sbin/in.rlogind	in.rlogind 


Check /etc/passwd to see if an invalid login shell has been substituted in the entry for the login ID.

Protocol family not supported
Cause
The protocol family has not been configured into the system or no implementation for it exists. This is used for the Internet protocols.

Technical Notes
The symbolic name for this error is EPFNOSUPPORT, errno=123.

Protocol not supported
Cause
The requested networking protocol has not been configured into the system, or no implementation for it exists. (A protocol is a formal description of the messages to be exchanged and the rules to be followed when systems exchange information.)

Action
Verify that the protocol is in the /etc/inet/protocols file and in the NIS protocols map, if applicable. If the protocol is not listed, and you want to permit its use, configure the protocol as documented or as required.

Technical Notes
The symbolic name for this error is EPROTONOSUPPORT, errno=120.

Protocol wrong type for socket
Cause
This message indicates either an application programming error, or badly configured protocols.

Action
Make sure that the /etc/protocols file corresponds number-for-number with the NIS protocols(4) map. If it does, ask the vendor or author of the application for an update.

Technical Notes
A protocol was specified that does not support the semantics of the socket type requested. This protocol amounts to a request for an unsupported type of socket. Look at the source code that made this socket request and check that it requested one of the types specified in /usr/include/sys/socket.h.

The symbolic name for this error is EPROTOTYPE, errno=98.

"Q"
quotactl: open Is a directory
Cause
When using edquota to set user limits, the command displays this error. edquota updates all quota files that are on a mounted file system. A directory named quotas causes it to fail.

Action
In one of the mounted file systems is a directory named quotas. To fix the problem, move the directory from the mounted file system and rename or delete it.

For example: If you have /usr/quotas/old_info, the directory /usr/quotas will cause edquota to fail. Either move /usr/quotas to /usr/old_quotas or delete the directory. 

"R"
Read error from network: Connection reset by peer
Cause
This message appears when a user logs in remotely to a machine that crashes or is rebooted during the rlogin(1) or rsh(1) session. Any data changes that were not saved are probably lost. Sometimes this message appears only when the user types some data, even though the system failed hours before.

Action
Try to rlogin(1) again, perhaps after waiting a few minutes for the system to reboot.

Reading configuration data
Cause
In this situation, the user loaded SunPC 4.1 on a SPARCstation 5 machine. The Solaris 2.5 operating environment is patched to the Solaris 2.5.1. The user also has a SunPC accelerator card installed. When starting SunPC, the user gets this error message on the SunPC splash screen. If the user clicks anywhere in the screen, the whole console locks. The user has to move to another machine and use rlogin and then kill the SunPC process. In an effort to resolve the problem, the user had installed and removed SunPC and the 102924-25 patch with the same results. The user also removed the accelerator card, performed a boot -r and still SunPC 4.1 hung at the splash screen. The following error was found in the /var/adm file: 

modrput() sdos_mbsigolint failed -1 


Action
In this situation, the user had wiped the operating system off the SPARCstation 5 machine and, at that point, was not sure which patches had been applied. The user installed a copy of the Solaris 2.5.1 software and, then, performed the SunPC installation. That solved the problem. SunPC worked without the Accelerator card. The user added the Accelerator card, performed a boot -r, and ran SunPC with no problems.

Read-only file system
Cause
Files and directories on file systems that are mounted read-only cannot be changed.

Action
If you only modify these files and directories occasionally, use rlogin(1) to log in to the servers of the mounted file systems and change the files or directories from there. 

If you change these files and directories frequently, use mount(1M) to make the file systems read-write.

Technical Notes
The symbolic name for this error is EROFS, errno=30.

rebooting...
Cause
This message appears on the console to indicate that the machine is booting, either after the superuser issued a reboot(1M) command, or after a system panic, if the EEPROM's watchdog-reboot? variable is set to true.

Action
Allow the machine to boot itself. In case of a system panic, look above this message for other indications of what went wrong.

Recipient names must be specified
Cause
Someone sent mail without a valid recipient in the To: field. Thus, sendmail(1M) could not deliver the mail message. Using mail(1), the recipient's address might have been specified using spaces or non-alphanumeric characters. The mailtool(1) and mailx(1) commands try to prevent such problems by issuing Please specify a recipient or No recipients specified messages instead. If at least one valid recipient exists, each invalid recipient address will generate a User unknown message.

Action
Look in the sender's dead.letter file for the automatically saved message, and have the originator send it again; this time the sender specifies a recipient.

See Also
For more information about sendmail(1M), see the System Administration Guide, Volume 3.

refused connect from hostIP to callit(ypserv)
Refer to "connect from hostIP to callit(ypserv): request from unauthorized host".

Reset tty pgrp from int to int 
Cause
The C shell sometimes issues this message when it clears away the window process group after the user exits the window system. This clearing can happen when the window system does not clean up after itself.

Action
Proceed with your work. This message is only informational.

Resource temporarily unavailable
Cause
This error indicates that the fork(2) system call failed because the system's process table is full, or that a system call failed because of insufficient memory or swap space. Also, a user might not be allowed to create more processes.

Action
Simply waiting often gives the system time to free resources. However, if this message occurs often on a system, reconfigure the kernel and allow more processes. To increase the size of the process table, increase the value of MAXUSERS in the /etc/system file. The default MAXUSERS value is the amount of main memory in Mbytes, minus 2.

If one user is not allowed to create any more processes, that user has probably exceeded the memory size limit; see the limit(1) man page for details.

Technical Notes
The symbolic name for this error is EAGAIN, errno=11.

Restartable system call
Action
Restart the interrupted system call.

Technical Notes
The symbolic name for this error is ESTART, errno=91.

Result too large
Cause
This is a programming error or a data input error.

Action
Ask the program's author to fix this condition.

Technical Notes
This error indicates an attempt to evaluate a mathematical programming function at a point where its value would overflow or underflow. The value of a programming function in the math package (3M) is not representable within machine precision. This error could occur after floating point overflow or underflow (either single or double precision), or after total loss of numeric significance in Bessel functions.

This message can indicate Result too small in the case of floating point underflow.

To help pinpoint a program's math errors, use the matherr(3M) facility.

The symbolic name for this error is ERANGE, errno=34.

rlogin: no directory! connection closed
Cause
When a user tries to remotely log in to a machine, the user gets this error.

The machine that the user was trying to access with rlogin(1) had permissions of 700 on its root directory. The permissions on root should be 755. 

After the permissions on the root file system were changed to 755, the user was able to proceed farther when attempting to execute an rlogin, but it still failed with the following: 

Last login: Fri Aug 29 10:24:43 from machinename
no shell
connection closed 


Action
The machine that the user was trying to access with rlogin had the permissions set to 700 on both the root and /usr/bin directories. For both directories, the permissions should be 775. Once the user changed the permissions to 775, rlogin(1) was successful.

Also, check the user's passwd(1) entry in the NIS/NIS+ map. A login shell such as /usr/dist/exe/tcsh or /net/lab/.../csh could cause the failure because of NFS mount permission.

rmdir: string: Directory not empty
Cause
The rmdir(1) command can only remove empty directories. The directory with the name appearing after the first colon in the message still contains some files or directories.

Action
Use rm(1) instead of rmdir(1). To remove this directory and everything underneath it, use the rm -ir command to descend the directory recursively, and respond to requests to delete each element. To remove the directory and all its contents without prompts for approval, use the rm -r command.

ROOT LOGIN /dev/console
Cause
This syslog message indicates that someone has logged in as root on the system console.

Action
If you have just logged in as root, take no action. If you are not root, consider the possibility of a security breach. The best site-wide policy is for all system administrators to use su(1M) instead of logging in as root.

ROOT LOGIN /dev/pts/int FROM string 
Cause
This syslog message indicates that someone has logged in remotely as root on a pseudo-terminal from the system specified after the FROM keyword.

Action
For security reasons, it is a bad practice to allow root logins from anywhere other than the console. To restrict superuser logins to the console, remove the comment from the CONSOLE line in /etc/default/login.

route: socket: Protocol not supported
Cause
During a boot, this error is displayed and the multicast is not configured.

Action
An inittab(4) from a previous release of the operating environment was used. Thus, the following entry, which is required for the route command in the Solaris 2.6 release, was missing from /etc/inittab. 

ap::sysinit:/sbin/soconfig -f /etc/sock2path 
By default, this is the second entry in the file. After this entry was added, the multicast configured at boot time without error.

RPC: Program not registered
Cause
Check the rpc.bynumber NIS map.

rx framing error
Cause
Usually this error indicates a hardware problem.

Action
Check the Ethernet cabling and connectors to locate a problem.

Technical Notes
A framing error occurs when the Ethernet I/O driver receives a non-integral unit of octets, such as 63 bytes and then 3 bits. (Ethernet specifies the use of octets.) Framing errors are caused by corruption of the starting or ending frame delimiters. These delimiters can be corrupted by some violation of the encoding scheme.

Framing errors are a subset of CRC errors, which are usually caused by anomalies on the physical media. An alignment/framing error is a type of CRC error where octet boundaries do not align.

"S"
save: SYSTEM error, Arg list too long
Cause
The save fails with this error because the database (index) file for the client is greater than 2 Gbytes. With the Solaris 2.6 release and SBU 5.0.1 this is no longer a problem.

Action
However, with earlier versions of the Solaris software you need to open nwadmin -> indexes -> select appropriate client -> select appropriate fs -> remove oldes cycle -> reclaim space. 

You might have to repeat a few times to reclaim enough space. The indexes can be re-created later, if necessary, by using a scanner.

SCSI bus DATA IN phase parity error
Cause
The most common cause of this problem is unapproved hardware. Some SCSI devices for the PC market do not meet the high I/O speed requirements for the UNIX market. Other possible causes of this problem are improper cabling or termination, and power fluctuations. Data corruption is possible, but unlikely to occur, because this parity error prevents data transfer.

Action
Check that all SCSI devices on the bus are Sun-approved hardware. Then verify that all cables measure no longer than six meters total and that all SCSI connections are properly terminated. If power fluctuations are occurring, invest in an uninterruptible power supply.

SCSI transport failed: reason 'reset'
Cause
This message indicates that the system sent data over the SCSI bus, but the data never reached its destination because of a SCSI bus reset. The most common cause of this condition is conflicting SCSI targets. Data corruption is possible, but unlikely to occur, because this failure prevents data transfer.

Action
Verify that all cables measure no longer than six meters total and that all SCSI connections are properly terminated. If power surges are a problem, acquire a surge suppressor or an uninterruptible power supply.

A machine's internal disk drive is usually SCSI target 3. Make sure that external and secondary disk drives are targeted to 1, 2, or 0, and do not conflict with each other. Also, make sure that tape drives are targeted to 4 or 5, and CD drives to 6, avoiding any conflict with each other or with disk drives. If the targeting of the internal disk drive is in question, power off the machine, remove all external drives, turn on the power, and from the PROM monitor run the probe-scsi-all or probe-scsi command.

If SCSI device targeting is acceptable, memory configuration could be the problem. Ensure that high-capacity memory chips (such as 4-Mbyte SIMMs) are in lower banks, while lower-capacity memory chips (such as 1-Mbyte SIMMs) are in the upper banks.

SPARC systems do not always support third-party CD-ROM drives, and can generate a similar unknown vendor error message. Check with the CD-ROM vendor for specific configuration requirements.

Some third-party disk drives have a read-ahead cache that interferes with the Solaris device drivers. Make sure that any existing read-ahead cache facility is turned off.

See Also
For more information on SCSI targets, see the section on device naming conventions in the Solaris Transition Guide. If you are using AnswerBook online documentation, "SCSI targets" is a good search string.

Security exception on host string. USER ACCESS DENIED.
Cause
When trying to create a user with Adminsuite by placing the home directory on a system remote from the NIS+ server, the user gets this error message: 

Security exception on host hostname. USER ACCESS DENIED.
The user identity (555)username was received, but that user
is not authorized to execute the requested functionality
on this system. Is this user a member of an appropriate
security group on this system ?
(Function: class directory method create_dir) 
The user can use rsh(1) to access the remote machine and create a home directory on the system.

Action
The user was not in the system administration group NIS+ tables. 

# niscat group.org_dir | grep sysadmin 
 sysadmin::14: 
Add the user name to the system administration group.

Segmentation Fault
Cause
Segmentation faults usually come from a programming error. This message is usually accompanied by a core dump, except on read-only file systems.

Action
To see which program produced a core(4) file, run either the file(1) command or the adb(1) command. The following examples show the output of the file(1) and adb(1) commands on a core file from the dtmail program. 

$ file core
core: ELF 32-bit MSB core file SPARC Version 1, from `dtmail' 


$ adb core
core file = core -- program `dtmail'
SIGSEGV  11: segmentation violation
^D      (use Control-d to quit the adb rogram) 
Ask the vendor or author of this program for a debugged version.

Technical Notes
A process has received a signal indicating that it attempted to access an area of memory that is protected or that does not exist. The two most common causes of segmentation faults are attempting to dereference a null pointer or indexing past the bounds of an array.

sendmail[]: can't lookup data via name server "dns" or sendmail[]: can't lookup data via name server "nis"
Cause
The following entry in the /etc/nsswitch.conf file, sendmailvars: dns nis files, causes the messages to appear in the console window.

Action
The sendmailvars database can be used only with local files and/or NIS+. If you do not have this database setup, the default sendmailvars entry should look as follows in the /etc/nsswitch.conf file: 

sendmailvars: files 


sendmail[init]: NOQUEUE: SYSERR(root): Cannot bind to domain <domain>: no such map in server's domain: Bad file number
Cause
The user is running NIS and receives this error on several NIS machines.

Action
Check the following:

For the system(s) not working, make sure there is a /var/yp/nicknames file. Also, make sure that this file contains this entry: aliases mail.aliases 

On one of the systems not working, execute the following: 

ypcat aliases 


You will probably get this message: no such map in servers domain. Do a ypwhich to see which NIS server the system is bound to. Next, go to that server and verify that the mail.aliases map is missing from /var/yp/domainname. This map must either be created or copied over from one of the NIS servers that contains the map.

sendmail[int]: NOQUEUE: SYSERR: net hang reading from string 
Cause
This is a sendmail(1M) message that appears on the console and in the log file /var/adm/messages. If this message occurs once for a particular user, a mail message from this user might end with a partial line (having no terminating newline character). If this message appears frequently or at busy times, especially along with other networking errors, it could indicate network problems.

Action
Check the user's mail spool file to see if a message ends without a newline character. If so, talk with the user and determine how to prevent the problem from occurring again. If these messages are the result of network problems, you could try moving the mail spool directory to another machine with a faster network interface.

Technical Notes
During the SMTP receipt of DATA phase, a message-terminating period on a line of its own never arrived. sendmail(1M) timed out and produced this error.

Service wouldn't let us acquire selection
Cause
This message indicates that the OpenWindows selection service failed to seize the requested selection from /tmp/winselection. 

Consider the following diagnostics: the requested selection could be 0 for unknown, 1 for caret, 2 for primary, 3 for secondary, or 4 for clipboard. The result could be 0 for failure, 2 for nonexistent, 3 for did not have, 4 for wrong rank, 5 for continued, 6 for cancelled, or 7 for unrecognized.

setmnt: Cannot open /etc/mnttab for writing
Cause
The system is having problems writing to /etc/mnttab. The file system containing /etc might be mounted read-only, or not mounted at all.

Action
Check that this file exists and is writable by root. If so, ensure that the /etc file system has been mounted, and is mounted read-write, rather than read-only.

share_nfs: /home: Operation not applicable
Cause
This message usually indicates that the system has a local file system mounted on /home, which is where the automounter usually mounts users' home directories.

Action
When a system is running the automounter, do not mount local file systems on the /home directory. Mount them on another directory, such as /disk2, which on most systems you have to create. You could also change the automounter auto_home entry, but that is a more difficult solution.

Signal 8 error
Cause
In this case, the user gets a Signal 8 error during installation--right after starting Openwindows--and installation stops.

Action
Shut down the system "gracefully," and, as it is rebooting, place a ZIP drive cartridge (blank or used) in the ZIP drive. Begin the normal installation of the Solaris IA software. It is not possible to continue the existing installation of the Solaris software by putting a cartridge in the ZIP drive after receiving this error. When the Solaris software checks all of your hardware, it thinks the ZIP drive is just another hard drive and attempts to read from it. If there is no cartridge in the drive, then you receive the signal 8 error. If the Solaris software installation "sees" a cartridge in the ZIP drive, it reads from it, even if there is no data on the cartridge, and then continues.

SIMS license error: licenses invalid
Cause
This is a license internet mail server problem. The user is installing a departmental version of SIMS 3.1 on a Pentium 2 PC that is running the Solaris 2.6 IA release. The system is using a JavaTM interface and keeps getting the above error. The two license files from the license center are:

SERVER server 
DAEMON lic.SUNW /etc/opt/licenses/lic.SUNW 
INCREMENT SLAPD.1 lic.SUNW 1.000 08-Mar-1998 1  

SERVER nwlab4 727a2b6a 7588 
DAEMON suntechd /etc/opt/licenses/suntechd /etc/opt/licenses/daemon_options 
INCREMENT sun.mail.mbox suntechd 3.100 08-Mar-1998 100 


Action
Merge the two license files together and delete the extra SERVER line.

Slice c0t1d0s0 is too small to contain 1 replicas
Cause
When trying to add a state replica using metatool to cylinder 0 of a disk, the following error message appears: 

	Your attempt to attach metastate database
	replicas on slice "c?t?d?s?" failed for the
	following reason: Slice c?t?d?s? is too small
	to contain 1 replicas. 


This is because metatool masks out the very first cylinder to protect the disk label. On disksuite v4.1, metatool does allow adding the databases to cylinder 0 on 2.1Gbyte disks or larger.

Action
As a workaround, start at cylinder 1 (not cylinder 0) or use the command line (metadb -a).

snmpdx: bind() failed on udp on 161 [errno: address already in use] 125 snmpdx dmid: unable to connect to snmpdx
Cause
The user is running the Solaris 2.6 release with a Cisco FDDI card and is receiving the above error.

Action
In the Solaris 2.6 software a startup script is included in /etc/rc3.d that starts snmpdx (which uses port 161). You receive the error message because the FDDI SNMP agent is running, and it has already claimed port 161. Two solutions are: 

Move the snmpdx start-up script 

mv /etc/rc3.d/S76snmpdx    /etc/rc3.d/s76snmpdx 
so that snmpdx does not start.

Check if the FDDI can use a different port, other than 161.

Socket type not supported
Cause
The support for the socket type has not been configured into the system or no implementation for it exists.

Technical Notes
The symbolic name for this error is ESOCKTNOSUPPORT, errno=121.

Soft error rate (int%) during writing was too high
Cause
This message from the SCSI tape drive appears when Exabyte or DAT tapes generate too many soft (recoverable) errors. It is followed by the advisory Please, replace tape cartridge message. Soft errors are an indication that hard errors could soon occur, causing data corruption.

Action
First, clean the tape head with a cleaning tape, as recommended by the manufacturer. If that remedy does not work, replace the tape cartridge. If the problem persists, you might need to replace the tape drive with new tape cartridges.

Software caused connection abort
Cause
A connection abort occurred internally to your host machine.

Technical Notes
The symbolic name for this error is ECONNABORTED, errno=130.

Srmount error
Cause
This error is RFS specific. It occurs when an attempt is made to stop RFS while resources are still mounted by remote machines, or when a resource is readvertised with a client list that does not include a remote machine with the resource currently mounted.

Technical Notes
The symbolic name for this error is ESRMNT, errno=69.

Stale NFS file handle
Cause
A file or directory that was opened by an NFS client was either removed or replaced on the server.

Action
If you were editing this file, write it to a local file system instead. Try remounting the file system on top of itself or shutting down any client processes that refer to stale file handles. If neither of these solutions works, reboot the system.

Technical Notes
The original vnode is no longer valid. The only way to remove this error is to force the NFS server and client to renegotiate file handles.

The symbolic name for this error is ESTALE, errno=151.

start up failure no such file or directory
Refer to "late initialization error".

statd: cannot talk to statd at string 
Cause
This message comes from the NFS status monitor daemon statd(1M), which provides crash recovery services for the NFS lock daemon lockd(1M). The message indicates that statd(1M) has left old references in the /var/statmon/sm and /var/statmon/sm.bak directories. After a user has removed or modified a host in the hosts database, statd(1M) might not properly purge files in these directories, which results in its trying to communicate with a nonexistent host.

Action
Remove the file named variable (where variable is the host name) from both the /var/statmon/sm and /var/statmon/sm.bak directories. Then kill the statd(1M) daemon and restart it. If that does not get rid of the message, kill and restart lockd(1M) as well. If that remedy does not work, reboot the machine at your convenience.

stty: TCGETS: Operation not supported on socket
Cause
This message occurs when a user tries to use remote copy with rcp(1) or remote shell with rsh(1) from one machine to another, but has an stty(1) command in the remote .cshrc file. This error creates failure for the rcp(1) or rsh(1) command.

Action
The solution is to move the invocation of the stty(1) command to the user's .login (or equivalent) file. Alternatively, execute the stty(1) command in .cshrc only when the shell is interactive. You could perform the following test: 

if ($?prompt) stty ... 


Technical Notes
The rcp(1) and rsh(1) commands make a connection using sockets, which do not support stty(1)'s TCGETS ioctl.

su: No shell
Cause
This message indicates that someone changed the default login shell for root to a program that is missing from the system. For example, the final colon-separated field in /etc/passwd could have been changed from /sbin/sh to /usr/bin/bash, which does not exist in that location. Possibly an extra space was appended at the end of the line. The outcome is that you cannot login as root or switch user to root, and, thus, cannot directly fix this problem.

Action
The only solution is to reboot the system from another source, then edit the password file to correct this problem. Invoke sync(1M) several times, then halt the machine by typing Stop-A or by pressing the reset button. Reboot as single-user from CD-ROM, the net, or diskette, such as by typing boot cdrom -s at the prompt.

After the system starts and gives you a # prompt, mount the device corresponding to the original root partition somewhere, such as with a mount(1M) command similar to the one that follows. Then run an editor on the newly mounted system password file (use ed(1) if terminal support is lacking): 

# mount /dev/dsk/c0t3d0s0 /mnt
# ed /mnt/etc/passwd 
Use the editor to change the password file's root entry to call an existing shell, such as /usr/bin/csh or /usr/bin/ksh.

Technical Notes
To keep the No shell problem from happening, habitually use admintool or /usr/ucb/vipw to edit the password file. These tools make it difficult to change password entries in ways that make the system unusable.

su: 'su root' failed for login on /dev/pts/int 
Cause
The user specified by login tried to become superuser, but typed the wrong password.

Action
If the user is supposed to know the root password, wait to see if the correct password is supplied. If the user is not supposed to know the root password, ask why he or she is attempting to become superuser.

su: 'su root' succeeded for login on /dev/pts/int 
Cause
The user specified by login just became superuser by typing the root password.

Action
If the user is supposed to know the root password, this message is only informational. If the user is not supposed to know the root password, change this password immediately and ask how the user learned it.

SunPC may NOT run correctly as root
Cause
With SunPC 4.1 and the 102924 jumbo patch installed, a user (who is not root) attempts to run SunPC and receives the following error message: 

SunPC may NOT run correctly as root.
Please run in user mode.
SunPC script is exiting 


The user's primary group ID is probably root. For example: 

$ /usr/bin/id
uid=33650(gruff) gid=0(root) 


Action
Change the user's primary group to another group, such as 10, and, because the user still needs to be in the root group, add the root group to the user's secondary group list.

syncing file systems...
Cause
This message indicates that the kernel is updating the super-blocks before taking the system down to ensure file system integrity. This message appears after a halt(1M) or reboot(1M) command. It can also appear after a system panic, in which case the system might contain corrupted data.

Action
If you just halted or rebooted the machine, take no action. This message is normal. In case of a system panic, look up the panic messages. Your system vendor might be able to help diagnose the problem. So that you can describe the panic to the vendor, either leave your system in its panicked state or be sure that you can reproduce the problem.

Technical Notes
Numbers that sometimes display after the three dots in the message show the count of dirty pages that are being written out. Numbers in brackets show an estimate of the number of busy buffers in the system.

syslog service starting.
Cause
During system reboot, this message might appear and the boot seemingly hangs. After starting syslogd(1M) service, the system runs /etc/rc2.d/S75cron, which in turn calls ps(1). Sometimes after an abrupt system crash /dev/bd.off becomes a link to nowhere, causing the ps(1) command to hang indefinitely.

Action
Reboot as a single user (for example with boot -s) and run ls -l /dev/bd* to see if this is the problem. If so, remove /dev/bd.off, then run bdconfig off or reboot with the -r (reconfigure) option.

This is the most commonly reported situation that causes ps(1) to hang.

System booting after fatal error FATAL
Cause
The system reboots automatically. Afterward, the messages file contains System booting after fatal error FATAL.

The message is issued during a reboot after the system detects a hardware error. The following can cause this response: UPA address parity error, Master queue overflows, DTAG parity errors, E-Cache tag parity errors, and Coherence errors.

Action
Use prtdiag(1M) to help identify failed hardware components. The errors indicate that you either have a bad CPU module or a bad system board.

SYSTEM error, Arg list too long
Cause
When trying to back up a client with networker, the following error occurs: 

* heaven.com:/export/heaven2 save: SYSTEM error, Arg list too long 
* heaven.com:/export/heaven2 save: Cannot open save session with heaven.com 
* heaven.com:/export/heaven3 1 retry attempted 
* heaven.com:/export/heaven3 save: SYSTEM error, Arg list too long 
* heaven.com:/export/heaven3 save: Cannot open save session with heaven.com 


Action
An error like this is due to an index file (/nsr/index/clientname) that is greater than 2 Gbytes in Solstice backup revisions less than 5.0.1. In 5.0.1 the indexes are segmented so this error should no longer be a problem. In any revision of Solstice backup this error can also be due to a corrupt client index. If so, running the following command might resolve the problem: 

# nsrck -F clientname
 
If this remedy does not fix the problem, shut down the networker daemons, remove the client index, and restart the daemons. The backup should then run fine.

system hang
Cause
4.1.3C Sbus cards suffered a system freeze.

SYSTEM HANGS DURING BOOT
Cause
When the user boots a system, it hangs after the following boot messages: root on, swap on, and dump on. After the system displays these messages, the LEDs flash and the system hangs.

This is due to an earlier fsck that deleted devices under the /dev directory. Check for the /dev/console device and, if it is missing, create one.

system will not connect to port 80
Refer to "late initialization error".

"T"
tar: /dev/rmt/0: No such file or directory
Cause
The default tape device /dev/rmt/0 or possibly the device specified by the TAPE environment variable is not currently connected to the system, is not configured, or its hardware symbolic link is broken.

Action
List the files in the /dev/rmt directory to see which tape devices are currently configured. If none are configured, ensure that a tape device is correctly attached to the system, and reboot with the -r option to reconfigure devices.

If tape devices other than /dev/rmt/0 are configured, you could specify one of them after the -f option of tar(1).

tar: directory checksum error
Cause
This error message from tar(1) indicates that the checksum of the directory and the files it has read from tape does not match the checksum advertised in the header block. Usually this message indicates the wrong blocking factor, although it could indicate corrupt data on tape.

Action
To resolve this problem, make certain that the blocking factor you specify on the command line (after -b) matches the blocking factor originally specified. If in doubt, leave out the block size and let tar(1) determine it automatically. If that remedy does not help, the tape data could be corrupted.

tar: tape write error
Cause
A physical write error has occurred on the tar(1) output file, which is usually a tape, although it could be a diskette or disk file. Look on the system console, where the device driver should provide the actual error condition. The condition might be a write-protected tape, a physical I/O error, an end-of-tape condition, or a file-too-large limitation.

Action
In the case of write-protected tapes, enable the write switch. For physical I/O errors, replace the tape with a new one. For end-of-tape conditions, try using a higher density, if the device supports one, or use cpio(1) or pax(1) for their multi-volume support. When encountering the file-too-large limitations, use the parent shell's limit(1) or ulimit(1) facility to increase the maximum file size.

See Also
For more information on tar tapes, see the section on copying UFS files in the System Administration Guide, Volume 1.

Text file busy
Cause
This error can occur when an attempt was made to execute a pure-procedure program that is currently open for writing. It also occurs when attempting to open for writing or to remove a pure-procedure program being executed. (This message is obsolete.)

Technical Notes
The symbolic name for this error is ETXTBSY, errno=26.

Text is lost because the maximum edit log size has been exceeded.
Cause
This message appears at the beginning of a cmdtool(1) session after 100,000 characters have scrolled by. Clicking the top rectangle of the scrollbar might display this message. No data were lost, but the user cannot scroll back before this wraparound point.

Action
To increase the maximum size of the Command Tool log file, use cmdtool -M, specifying more than 100,000 bytes.

tftpd: nak: Transport endpoint is already connected
Cause
After configuring an Autoclient (Autoclient 2.1 - Solstice Adminsuite 2.3), particularly on a Solaris 2.6 environment, you might get a similar error message on your Server from /dev/console and/or from /var/adm/messages: 

tftpd: nak: Transport endpoint is already connected 


A subsequent boot net by the Autoclient hangs. For example: 

Boot Device:... 
File and Args... 


--------------------------------------------------------------------------------
Note - 
This error message is difficult to decipher. Also, at this early point in the autoclient's boot, there is a minimum record of the event. To troubleshoot this problem, a snoop of the client, run from another system on the client's subnet, is necessary.


--------------------------------------------------------------------------------

Action
A change was made in the Solaris 2.6 in.tftpd to use sendto(), instead of send(). Because the Solaris 2.5.1 environment uses send() as opposed to sendto(), one workaround would be to copy in.tftpd from a Solaris 2.5.1 to the Solaris 2.6 environment. Another workaround would be to troubleshoot from the server the nonexistent file that it is trying to receive by doing a snoop of the client. 

For example (assuming you are using an onboard Ethernet interface): 

# snoop autoclient_name
 
or 

# snoop ethernet_address_of_autoclient_name
 
In this case, you might get a Trivial File Transfer Protocol (TFTP) read similar to the following: 

81911ED4.SUN4C 
TFTP Error: access violation 
The error tells you that something is wrong within your /tftpboot directory. 

For an AUTOCLIENT: The problem lies in the /tftpboot directory of the boot server. Confirm that the HOSTID and HOSTID.ARCH files are linked to the correct inetboot file for your architecture. This is a correct entry for a sun4m system: 

81971904 -> inetboot.sun4m.Solaris_2.4 
81971904.SUN4M -> inetboot.sun4m.Solaris_2.4 
This is an incorrect entry for a sun4m system: 

C753002F -> inetboot.axil4m.Solaris_2.5.1 
C753002F.AXIL4M -> inetboot.axil4m.Solaris_2.5.1 
If they are not correct, remove the entry for that particular client in this directory and again add the client with the add_install_client script or through the Solstice tool. 

For a JUMPSTART client: The Error: access violation from the server to the client might be an indication that the wrong kernel architecture has been specified in the add_install_client command line. On the server, type these commands: 

# cd /cdrom/cdrom0/s0         
# ./add_install_client host_name correct_architecture
 
The add_install_client script cleans out the incorrect architecture and sets up the install server with the correct architecture to boot the client. If a problem arises using add_install_client, use ./rm_install_client and ./add_install_client with the correct architecture. 

All other follow the same path of checking the /tftpboot directory.

THE FOLLOWING FILE SYSTEM(S) HAD AN UNEXPECTED INCONSISTENCY:
Cause
At boot time the /etc/rcS script runs the fsck(1M) command to check the integrity of file systems marked fsck in /etc/vfstab. If fsck(1M) cannot repair a file system automatically, it interrupts the boot procedure and produces this message. When fsck(1M) gets into this state, it cannot repair file systems without losing one or more files, so it defers this responsibility to you, the administrator. Data corruption has probably already occurred. 

Action
First run fsck -n on the file system to see how many and what type of problems exist. Then run fsck(1M) again to repair the file system. If you have a backup of the file system, you can generally answer "y" to all the fsck(1M) questions. It is a good practice to keep a record of all problematic files and inode numbers for later reference. To run fsck(1M) yourself, specify options as recommended by the boot script. For example: 

# fsck /dev/rdsk/c0t4d0s0 
Usually, files lost during fsck(1M) repair were created just before a crash or power outage, and cannot be recovered. If important files are lost, you can recover them from backup tapes.

If you do not have a backup, ask an expert to run fsck(1M) for you.

See Also
For more information, see the section on checking file system integrity in the System Administration Guide, Volume 1.

The SCSI bus is hung. Perhaps an external device is turned off.
Cause
This message appears near the beginning of rebooting, immediately after a Boot device: ... message. Then, the system hangs. The problem is conflicting SCSI targets for a non-boot device. Having an external device turned off is unlikely to cause this problem.

Action
For a solution, refer to "Boot device: /iommu/sbus/directory/directory/sd@3,0".

See Also
For more information, see the section on halting and booting in the System Administration Guide, Volume 1.

THE SYSTEM IS BEING SHUT DOWN NOW !!!
Cause
This message means the system is going down immediately, and it is too late to save any changes.

Action
This message is often preceded by messages telling you that the system is going down in 15 minutes, 10 minutes, and so on. When you see these initial broadcast shutdown messages, save all your work, send any email you are working on, and close your files. Fortunately, vi(1) sessions are automatically saved for later recovery, but many other applications have no crash protection mechanism. Data loss is likely.

See Also
For more information on shutting down the system, see the System Administration Guide, Volume 1. If you are using AnswerBook online documentation, "halting the system" is a good search string.

The system will be shut down in int minutes
Cause
This message from the system shutdown(1M) script informs you that the superuser is taking down the system.

Action
Save all changes now or your work will be lost. Write out any files you were changing, send any email messages you were composing, and close your files.

See Also
For more information on shutting down the system, see the System Administration Guide, Volume 1. If you are using AnswerBook online documentation, "halting the system" is a good search string.

This gateway does not support Unix Password.
Cause
While using Firewall v2.0, the following sequence happens: 

# telnet firewall-machine
Trying 192.29.174.60 ...
Connected to firewall-machine
Escape character is '^]'.
CheckPoint FireWall-1 authenticated Telnet server running on
firewall-machine
Login: testuser
This gateway does not support Unix Password. 


Action
Under Network Objects, edit the Gateway object Host Properties Auth Schemes and select UNIX Password. UNIX Password is not checked by default as it is considered an unsecure method of authentication.

This mail file has been changed by another mail reader.
Cause
This message appears in a pop-up dialog box whenever you start mailtool(1) while another mail reader has the inbox locked. A question follows: Do you wish to ask that mail reader to save the changes? You are given three choices.

Action
If you choose Save Changes, mailtool(1) requests the other mail reader to relinquish its lock and write out any changes it has made to your inbox. If you choose Ignore, mailtool(1) reads your inbox without locking it. If you choose Cancel, mailtool(1) exits.

Timeout waiting for ARP/RARP packet
Cause
This problem can occur while booting from the net, and indicates a network connection problem.

Action
Make sure the Ethernet cable is connected to the network. Check that this system has an entry in the NIS ethers(4) map or locally on the boot server. Then check the IP address of the server and the client to make sure they are on the same subnet. Local /etc/hosts files must agree with one another and with the NIS hosts(4) map.

If those conditions are not causing the problem, go to the system's PROM monitor ok prompt and run test net to test the network connection. (On older PROM monitors, use test-net instead.) If the network test fails, check the Ethernet port, card, fuse, and cable, replacing them if necessary. Also check the twisted pair port to make sure it is patched to the correct subnet.

See Also
For more information on packets, see SPARC: Installing Solaris Software. If you are using AnswerBook online documentation, "ARP/RARP" is a good search string.

Timer expired
Cause
The timer set for a STREAMS ioctl call has expired. The cause of this error is device specific and could indicate either a hardware or software failure, or perhaps a time-out value that is too short for the specific operation. The status of the ioctl(2) operation is indeterminate. This is also returned in the case of _lwp_cond_timedwait(2) or cond_timedwait(3THR).

Technical Notes
The symbolic name for this error is ETIME, errno=62.

token ring hangs
Cause
4.1.3C Sbus cards suffered a system freeze.

Too many links
Cause
An attempt was made to create more than the maximum number of hard links (LINK_MAX, by default 32767) to a file. Because each subdirectory is a link to its parent directory, the same error results from trying to create too many subdirectories.

Action
Check why the file has so many links to it. To get more than the maximum number of hard links, use symbolic links instead.

Technical Notes
The symbolic name for this error is EMLINK, errno=31.

Too many open files
Cause
A process has too many files open at once. The system imposes a per-process soft limit on open files, OPEN_MAX (usually 64), which can be increased, and a per-process hard limit (usually 1024), which cannot be increased.

Action
You can control the soft limit from the shell. In the C shell, use the limit(1) command to increase the number of descriptors. In the Bourne or Korn shells, use the ulimit -n command to increase the number of file descriptors.

If the window system refuses to start new applications because of this error, increase the open-file limit in your login shell before starting the window system.

Technical Notes
The symbolic name for this error is EMFILE, errno=24.

Transport endpoint is already connected
Cause
A connect request was made on an already connected transport endpoint; or, a sendto(3XNET) or sendmsg(3XNET) transport endpoint specified a destination when already connected.

Technical Notes
The symbolic name for this error is EISCONN, errno=133.

Transport endpoint is not connected
Cause
A request to send or receive data was disallowed because the transport endpoint is not connected and (when sending a datagram) no address was supplied.

Technical Notes
The symbolic name for this error is ENOTCONN, errno=134.

TRAP 3E
Cause
The Ultra system fails to boot with TRAP 3E. The system sometimes also displays bad magic number errors.

This error is caused by a bad super block on the boot disk. Which, in turn, could have been caused by a SCSI configuration problem. 

Action
To fix: 

Check the SCSI bus for illegal configuration, bad cables, and duplicate SCSI addresses.

Boot from CD-ROM as single user. 

OK boot cdrom -sw
 

Attempt to fsck(1M) boot disk. This could fail with a super block error. 

# fsck /dev/rdsk/device
 

Find the locations of alternate super blocks. BE SURE TO USE AN UPPERCASE -N. For example: 

# newfs -N /dev/rdsk/c0t0d0s0
/dev/rdsk/c0t0d0s0:     2048960 sectors in 1348 cylinders of 19 tracks, 
80 sectors 1000.5MB in 85 cyl groups (16 c/g, 11.88MB/g, 5696 i/g)
super-block backups (for fsck -F ufs -o b=#) at:
32, 24432, 48832, 73232, 97632, 122032, 146432, 170832, 195232, 219632,
244032, 268432, 292832, 317232, 341632, 366032, 390432, 414832, 439232,
463632, 488032, 512432, 536832, 561232, 585632, 610032, 634432, 658832,
683232, 707632, 732032, 756432, 778272, 802672, 827072, 851472, 875872,
900272, 924672, 949072, 973472, 997872, 1022272, 1290672, ... 


Using an alternate super block, run fsck(1M) on the disk. You might have to try more than one alternate super block to make this to work. Pick a couple from the beginning, the middle, and the end. 

# fsck -o b=<altblk> /dev/rdsk/c0t0d0s0 


The boot block is probably bad too. Restore it while you are booted from the CD-ROM. 

# /usr/sbin/installboot /usr/platform/architecture/lib/fs/ufs/bootblk 
/dev/rdsk/c0t0d0s0 


Reboot the operating environment. 

# reboot 

"U"
ufsdump 4mm commands
Cause
Dump syntax was used with autoloader.

umount: warning: /string not in mnttab
Cause
This message occurs when the superuser attempts to unmount a file system that is not mounted. Subdirectories of file systems, such as /var, cannot be unmounted.

Action
Run the mount(1M) or df(1M) command to see which file systems are mounted. If you really want to unmount one of them, specify the existing mount point.

Unable to connect to license server. Inconsistent encryption code.
Cause
The user receives this error message, and only the IP address of the machine has changed.

Action
The IP address defined with ifconfig(1M) must match that in /etc/hosts. That is, if you change the machine's IP address with ifconfig(1M), you must also change the machine's entry in the /etc/hosts file.

For machines with multiple interfaces, you must check and possibly update /etc/hostname.*.

unable to get pty!
Cause
When trying to open a Terminal window (dtterm) in CDE, a pop-up window appears stating, Unable to get pty! 

dtterm is not able to open /dev/pts/int (where int is an integer). The user cannot open this file because grantpt(3C) failed to change the permissions on the file. grantpt(3C) failed because the binary /usr/lib/pt_chmod is not setuid root. The permissions on /usr/lib/pt_chmod must be 4111.

Action
To restore the correct permissions to pt_chmod, use the following command (as root): 

# chmod 4111 /usr/lib/pt_chmod 


Unable to install/attach driver 'string'
Cause
These messages appear in /var/adm/messages at boot time, when the system tries to load drivers for devices the machine does not have.

Action
This message is strictly informational. You probably do not want all these device drivers because they make your system kernel larger, requiring more memory.

Unable to open nwrecover, Error: nwrecover: NSR: please start a server on client_name 
Cause
While trying to open the graphical recovery interface by running nwrecover from the client, this error was displayed.

Action
In this case, multiple networker servers existed and nwrecover could not determine which network server to use for the client. 

The server can be specified to the nwrecover command with the -s option. 

nwrecover -c client_name -s server_name
 
-s server_name sets the NetWorker server, and -c client_name sets the NetWorker client index.

uname: error writing name when booting
Cause
The system cannot bootstrap.

Action
Boot from the CD-ROM and check /etc/nodename. The file must contain exactly one line with the name of the system. No blank or other lines are allowed.

undefined control
Cause
This message, prefaced by the file name and line number involved, is from the C preprocessor /usr/ccs/lib/cpp and indicates a line starting with a pound-sign (#) but not followed by a valid keyword (such as define or include).

Action
A piece of software might be running the C preprocessor on an initialization file that you thought was interpreted by a shell. In most shells, the sharp (#) indicates a comment. The C preprocessor considers comments to be anything between /* and */ delimiters.

unknown host exception: unknown host
Cause
The user tries to install Sun Directory Services 1.0 using the Java front end. During the installation, an error occurs: unknown host exception: unknown host. Then the Services displays the host name with domain name appended twice.

Action
The user had the following line in /etc/nsswitch.conf: hosts: dns files. 

By changing the line to point first to files and then to DNS, hosts: files dns, the problem was resolved. 

Other considerations: This error could also happen if you are using a fully qualified host name. Make sure your host name does not have the domain appended. If you use a fully qualified host name, the domain is appended twice. Also, verify that the domain name specified in /etc/resolv.conf is a reachable domain.

Unmatched `
Cause
This message from the C shell csh(1) indicates that a user typed a command containing a backquote symbol (`) without a closing backquote. Similar messages occur from an unmatched single quote (') or an unmatched double quote ("). Other shells generally give a continuation prompt when a command line contains an unmatched quote symbol.

Action
Correct the command line and try again. To continue typing on another line, give the C shell a backslash right before the newline.

UNREF FILE I=i OWNER=o MODE=m SIZE=s MTIME=t CLEAR?
Cause
During phase 4, fsck(1M) discovered that the specified file was orphaned because the inode had no record of its path name. In other words, the file was not connected with any directory.

Action
Answer "yes" to reconnect the file into the lost+found directory. Then contact the file's owner to ask if you should send it back, and where to place it.

See Also
For more information, see the chapter on checking file system integrity in the System Administration Guide, Volume 1.

UnsatisfiedLinkError
Cause
A user was able to use a demo version only when dialed-in to an Internet provider. The user further noted that this Java error message occurred when trying to load library pages without a connection.

Action
The Java WorkShop package relies on the Java Development Kit to provide networking services. There could be two possible problems: 

The JDK/VM tries to load net.dll, which then loads wsock32.dll as its socket services. The winsocket program might have done something with the system socket DLLs and might have broken the JDK net.dll, which could explain the UnsatisfiedLinkError.

When JDK creates a ServerSocket or Socket object, it tries to resolve the local host name by calling gethostbyaddr(), which eventually queries the DNS on the Win95/NT, if the user has a DNS entry configured for the TCP/IP. (This normally results in a "Dialup dialog" coming up.)

For the first problem: If the winsocket program renames/moves the wsock32.dll or winsock.dll, the resolution includes modifying the JDK. 

For the second problem: To avoid the DNS query, add an entry to your %WinDir%\HOSTS file. Refer to the Java WorkShop release notes for more details.

Use "logout" to logout.
Cause
This C shell message might come as a surprise to Bourne or Korn shell users accustomed to logging out with a Control-D.

Action
When ignoreeof is set, the C shell requires users to log out by typing logout(1) or exit(1). Write any modified files to disk before exiting.

user unknown
Cause
When trying to mail to a user, the error Username... User unknown is displayed. The user is on the same system.

Action
Check for a typographical error in the entered email address. Otherwise, the user could be aliased to a nonexistent email address in /etc/mail/aliases or the user's .mailrc file.

You cannot mail to a user that has capital letters in its name. sendmail(1M) converts all the capital letters to lowercase before attempting to find the user. Because UNIX is case sensitive, it finds no user name on the system with all lowercase letters, so it displays the User unknown message.

As a workaround, make sure all user names are composed of only lowercase letters. 

/usr/dt/bin/rpc.ttdbserverd:Child Status' changed
Cause
While running CDE, the error in the console or /var/adm/messages file was as follows: 

Oct 19 04:41:00 darkcastle last message repeated 393 times
Oct 19 04:41:01 darkcastle inetd[120]: /usr/dt/bin/rpc.ttdbserverd:Child Status Changed 


Action
Create the following soft links: 

ln -s /usr/openwin/bin/rpc.ttdbserver /usr/dt/bin/rpc.ttdbserver
ln -s /usr/openwin/bin/rpc.ttdbserverd /usr/dt/bin/rpc.ttdbserverd 


/usr/openwin/bin/xinit: connection to X server lost
Cause
This error means that the xinit(1) program, which sets up X11 resources and starts a window manager, failed to locate the X server process. Perhaps the user interrupted window system startup, or exited abnormally from OpenWindows (for example, by killing processes or by rebooting). The X server might have crashed. Data loss is possible in some cases. Depending on the process timing, this message might be normal when the OpenWindows environment exits during a system reboot.

Action
The only solution is to exit and restart the OpenWindows environment. You do not need to reboot the system unless it hangs and fails to give you a console prompt. To exit the OpenWindows environment, select Workspace->Exit. To restart the OpenWindows environment, type openwin(1) at the system prompt. 

/usr/ucb/cc: language optional software package not installed
Cause
While compiling some code for BSD compatibility, the error occurred after invoking usr/ucb/cc. The unbundled compiler SPARCworks Professional C product was installed in /opt.

/usr/ucb/cc is a script that checks for the file /usr/ccs/bin/ucbcc and, if found, invokes it with appropriate library flags for BSD-compatibility compilation. 

/usr/ucb/cc is part of the package SUNWscpu. /usr/ccs/bin/ucbcc is supposed to be a symbolic link to /opt/SUNWspro/bin/acc, which is created during the installation of the unbundled C compiler, SPROcc.

Action
Verify that you have the essential OS-bundled Developer packages, SUNWscpu, SUNWbtool, and the unbundled C compiler, SPROcc. However, in this case, /usr/ccs/bin/ucbcc was missing on the user's system. Evidently, somehow this link was removed. 

Solve the problem by creating a new symbolic link: 

# ln -s /opt/SUNWspro/bin/acc /usr/ccs/bin/ucbcc 
Invoke usr/ucb/cc to verify this remedy worked. 

The following commands are used to identify which packages contain the particular components involved: 

craterlake% grep ucb/cc /var/sadm/install/contents
/usr/ucb/cc f none 0555 bin bin 3084 50323 814621113 *SUNWscpu
craterlake% ls -l /usr/ucb/cc
-r-xr-xr-x   1 bin      bin         3084 Oct 25  1995 /usr/ucb/cc
craterlake% file !$
file /usr/ucb/cc
/usr/ucb/cc:    executable /usr/bin/sh script
craterlake% grep ucbcc /var/sadm/install/contents
/usr/ccs/bin/ucbcc=/opt1/40/SUNWspro/SC4.0/bin/acc s none SPROcc SPROcc.2 SPROcc.5
craterlake% file /usr/ccs/bin/ucbcc
/usr/ccs/bin/ucbcc:  ELF 32-bit MSB executable SPARC Version 1, dynamically linked, stripped
craterlake% ls -l /usr/ccs/bin/ucbcc
lrwxrwxrwx   1 root     other         31 Aug 23  1996 /usr/ccs/bin/ucbcc 
                    -> /opt1/40/SUNWspro/SC4.0/bin/acc 


UX: userdel: error: Cannot update system files login cannot be deleted
Cause
This error is displayed when using userdel to delete a user, 

userdel -r userid
 
and the root (/) file system is full. 

Action
Free up some space on the root (/) file system.

"V"
Value too large for defined data type
Cause
The user ID or group ID of an IPC object or file system object was too large to be stored in an appropriate member of the caller-provided structure.

Action
Run the application on a newer system, or ask the program's author to fix this condition.

Technical Notes
This error occurs only on systems that support a larger range of user or group ID values than a declared member structure can support. This condition usually occurs because the IPC or file system object resides on a remote machine with a larger value of type uid_t, off_t, or gid_t than that of the local system.

The symbolic name for this error is EOVERFLOW, errno=79.

Volume Manager reports error: Configuration daemon can't speak protocol version
Cause
While attempting to run vxva (the volume manager GUI) with an upgrade from VXVM 2.0 or 2.1 to VXVM 2.3, you receive this message: 

Volume Manager reports error:
Configuration daemon can't speak protocol version 


This message indicates that there is a version mismatch between the version of the volume manager daemon, vxconfigd, and the GUI, vxva, that you are trying to run. For example, you are running the 2.3 version of vxconfigd, and trying to run an old (2.1) version of vxva.

Most likely you are using the wrong path for vxva. For versions 2.1 and below of vxva, the binary can be found in /opt/vxva/bin; but starting with 2.1.1, the location was changed to /opt/SUNWvxva/bin.

If you did not remove the old SUNWvxva package before installing the new 2.3 version (which is normal, since you do not NEED to remove the old package), you probably still have the old /opt/vxva/bin in your $PATH, and, thus, you are attempting to run the older version of vxva.

Action
Run the newer vxva program: /opt/SUNWvxva/bin/vxva. If that remedy does work and you do not get the error message, remove /opt/vxva/bin/vxva from your path statement or remove the old version of vxva and create a symbolic link to the new version with the following two commands: 

# rm /opt/vxva/bin/vxva 
# ln -s /opt/SUNWvxva/bin/vxva /opt/vxva/bin/vxva  


Volume too large for defined data type
Cause
This error occurred when trying to open a database file that was greater than 2 Gbytes in size. You should be able to do this, because the Solaris 2.6 release supports file sizes greater than 2 Gbytes.

Action
It is true that the Solaris 2.6 software supports file sizes greater than 2 Gbytes, but to open a file of that size, you must use a new version of the standard calls. There are 64-bit versions of most system calls and libc functions. For example: open64 instead of open. 

See Also
Refer to the lf64(5) man page.

vxconfigd error: segmentation fault
Cause
When the system boots, the vxconfigd fails to start. It fails with a segmentation fault (core dump). 

vxconfigd error: segmentation fault
	[ vxvm warning: _illegal vminor encountered ] 


Action
Check the date on the system using date(1) (/bin/date or /usr/bin/date). If the date on the system is old (like 1970) or far out in the future (like 2010), vxconfigd core dumps.

Change the date on the system using /bin/date or /usr/bin/date and vxconfigd starts without problems.

vxfs filesystems not mounting
Cause
In this case, the user was unable to mount and was getting uncorrectable error messages from mountall. Below is the individual mount report: 

mount: You don't have a license to run this program 
However, vxserial -p showed the following: 

Feature name: CURRSET [95]     
Number of licenses: 1 (non-floating)     
Expiration date: Sun Jan 18 03:00:00 1998 (22.8 days from now)     
Release Level: 20     
Machine Class: All      

Feature name: RAID [96]     
Number of licenses: 1 (non-floating)     
Expiration date: Sun Jan 18 03:00:00 1998 (22.8 days from now)     
Release Level: 20     
Machine Class: All 


Action
Use vxfsserial -p to see the state of the vxfs license. In this case, it had expired. Unexpired vxfsserial -p output looks similiar to the following: 

Feature name: VXFS [80]     
Number of licenses: 1 (non-floating)     
Expiration date: No expiration date     
Release Level: 22     
Machine Class: 934986342 


vxvm:vxslicer:ERROR unsupported disk layout
Cause
When trying to encapsulate a disk you receive this error.

Action
You must meet the minimum requirements to encapsulate a disk:

You must have two free, zero-length, slices on the disk (no cylinders should be assigned to these slices).

You must have two free cylinders on the disk. These two cylinders must not be in use by any slice other than slice two.

The two free cylinders must be located at the beginning or end of the drive.


"W"
WARNING: add_spec: No major number for sf
Cause
The system prints the following warning message while booting: 

SunOS Release 5.5.1 Version Generic_103640-03 [UNIX(R)
System V Release 4.0]
Copyright (c) 1983-1996, Sun Microsystems, Inc.
WARNING: add_spec: No major number for sf 
The sf(7D) driver is specific for a Sun Enterprise Network Array (SENA), also known as a "photon."

Action
If no SENA is attached to the system, the message can be safely ignored. To stop seeing the message, comment out the last line in /kernel/drv/ssd.conf that references sf(7D).

If you do this, and then later attach a SENA to your system, remember to uncomment this line again.

warning:cachefs:invalid cache version
Cause
While running the Solaris 2.5.1 release and using Adminsuite2.3/Autoclient2.1, the user added 5 autoclients. During startup of the clients, the user received this error message.

Action
The /kernel/fs/cachefs files between server and client are different versions. Cachefs versions on the server and the client should be the same as shown in the following: 

On the server: 

# cd /kernel/fs 
# ls -al cachefs 
-rwxr-xr-x   1 root     sys       229396 Jul 15  1997 cachefs* 
On the client: 

# cd /export/root/clientname/kernel/fs  
# ls -al cachefs 
-rwxr-xr-x   1 root     sys       229396 Jul 15  1997 cachefs*  
solution: load patch 104849-02 or higher 


To solve the problem, load patch 104849-02 or higher.

WARNING: Clock gained int days-- CHECK AND RESET THE DATE!
Cause
Each workstation contains an internal clock powered by a rechargeable battery. After the system is halted and turned off, the internal clock continues to keep time. When the system is powered on and reboots, the system notices that the internal clock has gained time since the workstation was halted.

Action
In most cases, especially if the power has been off for less than a month, the internal clock keeps the correct time, and you do not have to reset the date. Use the date(1) command to check the date and time on your system. If the date or time is wrong, become superuser and use the date(1) command to reset them.

Warning: Could not find matching rule in rules.ok
Cause
After an upgrade to the Solaris 2.5.1 release, jumpstart fails with this message: 

Checking rules.ok file... 
Warning: Could not find matching rule in rules.ok 
This message can occur even if the rules file is known to work, or, after review, it appears to be fine, and the check script has been run.

Action
Remove the rule keyword, network, from the rule file and run the check, again. Jumpstart should run without error.

WARNING: FAN FAILURE check if fans are still spinning
Cause
A SPARCcenterTM 2000/2000E might get one of these error messages, WARNING: FAN FAILURE check if fans are still spinning or WARNING: FAN FAILURE still sensed, displayed on the console screen at any time, with a record of the event in /var/adm/messages.

Action
The error itself is descriptive and self-explanatory, and you might suspect that a hardware problem occurred with the system's blower or fan assembly located at the top-most rear of the system cabinet. 

Upon further investigation you note that the blower is indeed spinning at a good rate. Given that, you should then check to see if the "AC Dist to Blower to Filter to Keyswitch Harness" plug/adapter is plugged in correctly. Two cable assemblies connect the blower assembly to the unit's power supply. One is the "power supply" cable and the other is the "AC Dist to Blower to Filter to Keyswitch Harness." 

Once the harness is securely connected, you see another message, NOTICE: FAN RECOVERED, logged on the system's console screen, or, if missed, it is in /var/adm/messages. 

WARNING: FAN FAILURE still sensed
Refer to "WARNING: FAN FAILURE check if fans are still spinning".

WARNING: No network locking on string: contact admin to install server change
Cause
The mount(1M) command issues this message whenever it mounts a file system that does not have NFS locking, such as a standard SunOS 4.1 exported file sytem. Data loss is possible in applications that depend on locking.

Action
On the remote SunOS 4.1 system, install the appropriate rpc.lockd jumbo patch to implement NFS locking. For the SunOS 4.1.4 system, install patch #102264; for the SunOS 4.1.3 system, install patch #100075; for earlier 4.1 releases, install patch #101817.

WARNING: processor level 4 interrupt not serviced
Cause
This message is basically a diagnostic from the SCSI driver. It can appear on the console every 10 minutes or so.

Action
To reduce the frequency of this message, add this line near the bottom of the /etc/system file and reboot: 

set esp:esp_use_poll_loop=0 


Technical Notes
You might also see this message repeatedly after manually removing a CD when it was busy. Do not do this! To return the system to normal, reboot the system with the -r (reconfigure) option.

WARNING: /tmp: File system full, swap space limit exceeded
Cause
The system swap area (virtual memory) has filled up. You need to reduce swap space consumption by killing some processes or possibly by rebooting the system.

Action
For information about increasing swap space, refer to "Not enough space".

WARNING: TOD clock not initialized-- CHECK AND RESET THE DATE!
Cause
This message indicates that the Time Of Day (TOD) clock reads zero, so its time is the beginning of the UNIX epoch: midnight, 31 December 1969. On a brand-new system, the manufacturer might have neglected to initialize the system clock. On older systems it is more likely that the rechargeable battery has run out and requires replacement.

Action
First replace the battery according to the manufacturer's instructions. Then become superuser and use the date(1) command to set the time and date. On some systems the clock is powered by the same battery as the NVRAM, so a dead battery also causes loss of the machine's Ethernet address and host ID, which are more serious problems for networked systems.

WARNING: Unable to repair the / filesystem. Run fsck
Cause
This message comes at boot time from the /etc/rcS script whenever it gets a bad return code from fsck(1M) after checking a file system. The message recommends an fsck(1M) command line, and instructs you to exit the shell when done to continue booting. Then the script places the system in single-user mode so fsck(1M) can be run effectively.

Action
For information about repairing UFS file systems, refer to "/dev/rdsk/string: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY.".

For information about repairing non-UFS file systems, refer to "THE FOLLOWING FILE SYSTEM(S) HAD AN UNEXPECTED INCONSISTENCY:".

WARNING: vxvm:vxio: Illegal vminor encountered
Cause
In this case, the message occurred during booting. The system was sharing an SSA1XX with an identical system. The user was also getting an error in disk group configuration copies during booting. The identical system was booting up fine--without error messages. vxconfigd died. A vxprivutil scan of one of the disks indicated the following: 

diskid:  880409237.1043.system_that_comes_up 
hostid: none 


Action
The user quickly applied a vxinstall on both systems: first, on the system that did not successfully boot, and then on the system that did. The user had to run a custom vxinstall, selecting only the disks desired for each system.

Technical Notes

--------------------------------------------------------------------------------
Note - 
The following attempt to resolve the problem failed. 

vxiod set 10 
vxconfigd -m disable 
vxdctl init hostname 
vxdctl enable 


--------------------------------------------------------------------------------

Watchdog Reset
Cause
This fatal error usually indicates some kind of hardware problem. Data corruption on the system is possible. 

Action
Look for some other message that might help diagnose the problem. By itself, a watchdog reset does not provide enough information; because traps are disabled, all information has been lost. If all that appears on the console is an ok prompt, issue the following PROM command to view the final messages that occurred just before system failure: 

ok f8002010 wector p 
Yes, that word is wector, not vector.

The result is a display of messages similar to those produced by the dmesg(1M) command. These messages can be useful in finding the cause of system failure.

Technical Notes
This message does not come from the kernel, but from the OpenBoot PROM monitor, a piece of Forth software that gives you the ok prompt before you boot UNIX. If the CPU detects a trap when traps are disabled (an unrecoverable error), it signals a watchdog. The OpenBoot PROM monitor detects the watchdog, issues this message, and shuts down the system.

Who are you?
Cause
Many networking programs can print this message, including from(1B), lpr(1B), lprm(1B), mailx(1), rdist(1), sendmail(1M), talk(1), and rsh(1). The command prints this message when it cannot locate a password file entry for the current user. This error might occur if a user logged in just before the superuser deleted that user's password entry, or if the network naming service fails for a user who has no entry in the local password file.

Action
If a user's password file entry was accidentally deleted, restore it from backups or from another password file. If a user's login name or user ID was changed, ask that user to log out and log in again. If the network naming service failed, check the NIS server(s) and repair or reboot as necessary.

Technical Notes
A known problem exists with starting hundreds of rsh(1) processes on another machine. This message appears because rsh(1) hangs while binding to a reserved port and responds too slowly to interact with the network naming service.

Window Underflow
Cause
This message often occurs at boot time, sometimes along with a Watchdog Reset error. It comes from the OpenBoot PROM monitor, which was passed a processor trap from the hardware. This error indicates that some program tried to access a register window that was not accessible from the processor.

Action
On some system architectures the problem could be that different capacity memory chips are mixed together. Someone might have placed 1-Mbyte SIMMs in the same bank with 4-Mbyte SIMMs. If this is so, rearrange the memory chips. Make sure to put higher-capacity SIMMs in the first bank(s), and lower-capacity SIMMs in the remaining bank(s); never mix different capacity SIMMs in the same bank.

The problem could also be that cache memory on the motherboard has gone bad and needs replacement. If main memory is installed correctly, try swapping the motherboard.

Technical Notes
The best way to isolate the problem is to look at the %pc register to see where it got its arguments, and why the arguments were bad. If you can reproduce the condition causing this message, your system vendor might be able to help diagnose the problem.

"X"
X connection to string:0.0 broken (explicit kill or server shutdown).
Cause
This error means that the client has lost its connection to the X server. The "0.0" represents the display device, which is usually the console. This message can appear when a user is running an X application on a remote system with the DISPLAY set back to the original system and the remote system's X server disappears, perhaps because someone exited X windows or rebooted the machine. It sometimes appears locally when a user exits the window system. Data loss is possible if applications were killed before saving files.

Action
Try to run the application again in a few minutes after the system has rebooted and the window system is running.

xinit: not found
Cause
The OpenWindows environment was probably not installed properly, and the openwin(1) program could not find xinit(1) to start the X windows system. If the user is running another version of X windows, such as the MIT X11 distribution, the startx program serves the same function as xinit(1).

Action
Check the PATH environment variable to make sure it contains the appropriate X windows install directory. Verify that xinit(1) is in this directory as an executable program.

XIO: fatal IO error 32 (Broken pipe) on X server "string:0.0"
Cause
This error means that I/O with the X server has been broken. The 0.0 represents the display device, which is usually the console. This message can appear when a user is running Display PostScript applications and the X server disappears or the client is shut down. Data loss is possible, if applications disappeared before saving files.

Action
Try to run the application again in a few minutes after the system has rebooted and the window system is running.

Xlib: connection to "string:0.0" refused by server
Cause
This message is immediately followed by the Xlib: Client is not authorized to connect to Server message. These messages indicate that an X windows application tried to run on the X server specified inside double quotes, which did not allow the request. The 0.0 represents the display device, which is usually the console. If no servername appears, the superuser probably tried to run an X application on the current machine in an X session that was owned by somebody else.

Action
To allow this client to connect to the X server, run xhost(1) +clientname on the X server system. Only the owner of the current X session (who is not necessarily the superuser) is allowed to run the xhost(1) command. If somebody else is running X windows on the server, ask them to log out and then start your own X session on that server; remote X connections are usually allowed for the same user ID.

Xlib: extension "GLX" missing on display "0.0"
Cause
Install the OpenGL� 1.0 software and test the configuration by running /usr/openwin/demo/GL/ogl_install_check, which provides the following results: 

# ./ogl_install_check
    Xlib:  extension "GLX" missing on display "0.0".
    Xlib:  extension "GLX" missing on display "0.0".
    Xlib:  extension "GLX" missing on display "0.0".
    can't find visual 


Action
First check that the installation has worked correctly by running the package check utility on the runtime package: pkgchk SUNWglrt. This should result in an error message like this: 

ERROR: /usr/openwin/server/etc/OWconfig
file size <187> expected <5423> actual
file cksum <14394> expected <27045> actual 
(The numbers might be different, but there should be only one file.) If other errors occur, re-install OpenGL, especially the SUNWglrt package.

Assuming that is fine, look at the process owner for the Xsun process using the following: 

# ps -aef | grep Xsun | grep -v grep
nobody 20022   225  0 11:36:22 ?        0:34 /usr/openwin/bin/Xsun :0 -nobanner  
If the owner is not root, that is most likely the problem. There is a permissions issue loading the graphic pipelines.

If you are using CDE, ensure that the Xservers file has this form: 

:0 Local local_uid@console root /usr/openwin/bin/Xsun :0 -nobanner  
The Xservers file can be found in /usr/dt/config, if you have not done any customization. Otherwise, it can more than likely be found in /etc/dt/config/. Additional arguments after the -nobanner option are acceptable. 

Another way of proving this is to run the OpenWindows environment from the command line as root. It ensures that the Xsun process is owned by root.

Another possibility is that the system is NOT a Creator 3D. You can only run OpenGL 1.0 on an Ultra machine with a Creator 3D graphics card. If you install this application on an Ultra machine with a Creator framebuffer and NOT a Creator 3D, you see these same error messages.

xntpd: clnt_dg_create: out of memory
Cause
At boot time, the error occurs after configuring NTP. Except for the error, everything seems to be working properly.

Action
As a workaround, move the script for xntpd from S74xntpd to S77xntpd, so it starts after S76nscd.

xterm: fatal IO error 32 (Broken Pipe) or KillClient on X server "string:0.0"
Cause
This error means that xterm(1) has lost its connection to the X server. The 0.0 represents the display device, which is usually the console. This message can appear when a user is running xterm and the X server disappears or the client is shut down. Data loss is possible if applications were killed before saving files.

Action
Try to run the terminal emulator again in a few minutes after the system has rebooted and the window system is running.

XView warning: Cannot load font set 'string' (Font Package)
Cause
This message from the XView library warns that a requested font is not installed on the X server. Often multiple warnings are displayed for the same font. The set of available fonts can vary from release to release.

Action
To see which fonts are available on the X server, run the xlsfonts(1) program. Then specify another font name that you see in the output of xlsfonts(1). Sometimes you can locate a similar font from a different vendor.

Technical Notes
Two packages of X windows fonts are: the common but not required fonts (SUNWxwcft), and the optional fonts (SUNWxwoft). Run pkginfo(1) to see if both packages are installed, and add them to the system as you desire.

"Y"
yp_all RPC clnt_call (transport level) failure
Cause
At random times, a slave NIS server has a problem that causes ypbind(1M) to report ypserver not responding, and the machine must be rebooted. The syslog file contains the following: 

Dec 14 07:11:03 rahab syslog: yp_all - 
RPC clnt_call (transport level) failure:
RPC: Unable to receive; An event requires attention 


Action
As a workaround, increase the file descriptor limit in the yp startup script, /etc/rc2.d/S71rpc. Add this command to the script before ypserv is started: 

ulimit -n 256 


ypbind[int]: NIS server for domain "string" OK
Cause
This message appears after an NIS server not responding message to indicate that ypbind(1M) is able to communicate with an NIS server again.

Action
Proceed with your work. This message is strictly informational.

ypbind[int]: NIS server not responding for domain "string"; still trying
Cause
This means that the NIS client daemon ypbind(1M) cannot communicate with an NIS server for the specified domain. This message appears when a workstation running the NIS naming service has become disconnected from the network, or when NIS servers are down or extremely slow to respond.

Action
If other NIS clients are behaving normally, check the Ethernet cabling on the workstation that is getting this message. Note the following differences between architectures: 

On SPARC machines, disconnected network cabling also produces a series of no carrier messages.

On IA machines, the NIS+ messages might be the only indication that network cabling is disconnected.


If many NIS clients on the network are giving this message, go to the NIS server in question and reboot or repair as necessary. To locate the NIS server for a domain, run the ypwhich(1) command. When the server machine returns to operation, NIS clients give an NIS server for domain OK message.

See Also
For more information about ypbind(1M), see the section on administering secure NFS in the System Administration Guide, Volume 3.

ypserv[int]: restarting resolv server. old one not responding
Cause
In this instance, the NIS Server, which had been upgraded from version 2.5.1 to version 2.6, was repeating this error message every ten minutes. Also, the Server was less frequently repeating the following message: 

rpc.nisd_resolv[7472]: svc_getreqset: no transport handle for fd2 
The SUNWypu and SUNWypr packages had been installed.

Action
Install Patch-ID# 105552-01. Also, set B= in the Makefile. Run make again to recreate the maps on the following: 

#B=-b 
B=You might also need to remove the -d option from the ypserv command in the /usr/lib/netsvc/yp/ypstart script. Then, you must reboot the machine.

ypwhich: can't communicate with ypbind
Cause
This message from the ypwhich(1) command indicates that the NIS binder process ypbind(1M) is not running on the local machine.

Action
If the system is not configured to use NIS, this message is normal and expected. Configure the system to use NIS if necessary.

If the system is configured to use NIS, but the ypbind(1M) process is not running, invoke the following command to start it up: 

# /usr/lib/netsvc/yp/ypbind -broadcast 

"Z"
zsint: silo overflow
Cause
This message means that the Zilog 8530 character input silo (or serial port FIFO) overflowed before it could be serviced. The zs(7D) driver, which talks to a Zilog Z8530 chip, is reporting that the FIFO (holding about two characters) has been overrun. The number after zs(7D) shows which serial port experienced an overflow: 

zs0 - tty serial port 0 (/dev/ttya)
zs1 - tty serial port 1 (/dev/ttyb)
zs2 - keyboard port (/dev/kbd)
zs3 - mouse port (/dev/mouse) 


Action
Silo overflows indicate that data in the respective serial port FIFO have been lost. However, the consequences of silo overflows might be negligible if the overflows occur infrequently, if data loss is not catastrophic, or if data can be recovered or reproduced. For example, although a silo overflow on the mouse driver (zs3) indicates that the system could not process mouse events quickly enough, the user can perform mouse motions again. Similarly, lost data from a silo overflow on a serial port with a modem connection transferring data using uucp(1C) is recovered when uucp(1C) discovers the loss of data and requests retransmission of the corrupted packet.

Frequent silo overflow messages can indicate a zs(7D) hardware FIFO problem, a serial driver software problem, or abnormal data or system activity. For example, the system ignores interrupts during system panics, so mouse and keyboard activity result in silo overflows.

If the serial ports experiencing silo overflows are not being used, a silo overflow could indicate the onset of a hardware problem. 

Technical Notes
Another type of silo overflow is one that occurs during reboot, when an HDLC line is connected to any of the terminal ports. For example, an X.25 network could be sending frames before the kernel has been told to expect them. Such overflow messages can be ignored.


##############################################################

SECTION 7: Some Solaris threads dealing on errors:

##############################################################


-------
Note:
-------

thread:


TNS 12546, 12560, 00516, Solaris Error 13 

$ lsnrctl start erp11i


LSNRCTL for Solaris: Version 10.2.0.3.0 - Production on 30-DEC-2008 17:16:23


Copyright (c) 1991, 2006, Oracle.  All rights reserved.


Starting /erp11i/oracle/10.2.0/bin/tnslsnr: please wait...


TNSLSNR for Solaris: Version 10.2.0.3.0 - Production System parameter file is /erp11i/oracle/10.2.0/network/admin/erp11i_erp11i/listener.ora
Log messages written to /erp11i/oracle/10.2.0/network/admin/erp11i.log
Error listening on: (ADDRESS=(PROTOCOL=IPC)(KEY=EXTPROCerp11i))
TNS-12546: TNS:permission denied
 TNS-12560: TNS:protocol adapter error
  TNS-00516: Permission denied
   Solaris Error: 13: Permission denied


Listener failed to start. See the error message(s) above...


I did a truss lsnrctl start erp11i and found this in the result:

uname(0xFFFFFFFF7F2DACC8)                       = 1
access("/var/tmp/.oracle", F_OK)                = 0
chmod("/var/tmp/.oracle", 01777)                Err#1 EPERM [ALL]
so_socket(PF_UNIX, SOCK_STREAM, 0, "", SOV_DEFAULT) = 4 access("/var/tmp/.oracle/sEXTPROCerp11i", F_OK) = 0 connect(4, 0xFFFFFFFF7FFF7AE0, 110, SOV_DEFAULT) Err#146 ECONNREFUSED access("/var/tmp/.oracle/sEXTPROCerp11i", F_OK) = 0 pollsys(0x00000000, 0, 0xFFFFFFFF7FFF7910, 0x00000000) = 0
close(4)                                        = 0
so_socket(PF_UNIX, SOCK_STREAM, 0, "", SOV_DEFAULT) = 4 connect(4, 0xFFFFFFFF7FFF7AE0, 110, SOV_DEFAULT) Err#146 ECONNREFUSED access("/var/tmp/.oracle/sEXTPROCerp11i", F_OK) = 0 pollsys(0x00000000, 0, 0xFFFFFFFF7FFF7910, 0x00000000) = 0
close(4)                                        = 0
so_socket(PF_UNIX, SOCK_STREAM, 0, "", SOV_DEFAULT) = 4 connect(4, 0xFFFFFFFF7FFF7AE0, 110, SOV_DEFAULT) Err#146 ECONNREFUSED access("/var/tmp/.oracle/sEXTPROCerp11i", F_OK) = 0


I checked the ownership of /var/tmp/.oracle/sEXTPROCerp11i :


$ ls -ld /var/tmp/.oracle/sEXTPROCerp11i
srwxrwxrwx   1 oraprod    dbaprod          0 Oct 25 20:05 /var/tmp/.oracle/sEXTPROCerp11i


The correct owner for this instance was oraerp:dbaerp


$ file /var/tmp/.oracle/sEXTPROCerp11i
/var/tmp/.oracle/sEXTPROCerp11i:      socket


Because this socket is owned by oraprod:dbaprod, the socket can't be accessed by oraerp. 


The simple solution is to login as oraerp:


sudo -u oraerp -i
rm /var/tmp/.oracle/sEXTPROCerp11i
exit
sudo -u oraerp


$ lsnrctl start erp11i


LSNRCTL for Solaris: Version 10.2.0.3.0 - Production on 30-DEC-2008 18:10:00


Copyright (c) 1991, 2006, Oracle.  All rights reserved.


Starting /erp11i/oracle/10.2.0/bin/tnslsnr: please wait...


TNSLSNR for Solaris: Version 10.2.0.3.0 - Production System parameter file is /erp11i/oracle/10.2.0/network/admin/erp11i_erp11i/listener.ora
Log messages written to /erp11i/oracle/10.2.0/network/admin/erp11i.log
Listening on: (DESCRIPTION=(ADDRESS=(PROTOCOL=ipc)(KEY=EXTPROCerp11i)))
Listening on: (DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=tsgsd1003.energy.ge.com)(PORT=1589)))


Connecting to (ADDRESS=(PROTOCOL=IPC)(KEY=EXTPROCerp11i))
STATUS of the LISTENER
------------------------
Alias                     erp11i
Version                   TNSLSNR for Solaris: Version 10.2.0.3.0 - Production
Start Date                30-DEC-2008 18:10:01
Uptime                    0 days 0 hr. 0 min. 0 sec
Trace Level               off
Security                  ON: Local OS Authentication
SNMP                      OFF
Listener Parameter File   /erp11i/oracle/10.2.0/network/admin/erp11i_erp11i/listener.ora
Listener Log File         /erp11i/oracle/10.2.0/network/admin/erp11i.log
Listening Endpoints Summary...
  (DESCRIPTION=(ADDRESS=(PROTOCOL=ipc)(KEY=EXTPROCerp11i)))
  (DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=erp11i.justanexample.com)(PORT=1589)))
Services Summary...
Service "PLSExtProc" has 1 instance(s).
  Instance "PLSExtProc", status UNKNOWN, has 1 handler(s) for this service...
Service "erp11i" has 1 instance(s).
  Instance "erp11i", status UNKNOWN, has 1 handler(s) for this service...
The command completed successfully


The issue stands resolved after this.


-------
Note:
-------

thread:

Q:

solaris error BAD SUPER BLOCK 

--------------------------------------------------------------------------------

I want mount a disk. I have this error. I'm trying to correct with the superblock but i have the same error. Look my procedure.

bash-2.03# fsck -F ufs /dev/rdsk/c0t1d0s0
Alternate super block location: 9423392.
** /dev/rdsk/c0t1d0s0
BAD SUPER BLOCK: MAGIC NUMBER WRONG
USE AN ALTERNATE SUPER-BLOCK TO SUPPLY NEEDED INFORMATION;


bash-2.03# newfs -N /dev/rdsk/c0t1d0s0
/dev/rdsk/c0t1d0s0: 17682084 sectors in 4924 cylinders of 27 tracks, 133 sectors
8633.8MB in 308 cyl groups (16 c/g, 28.05MB/g, 3392 i/g)
super-block backups (for fsck -F ufs -o b=#) at:
32, 57632, 115232, 172832, 230432, 288032, 345632, 403232, 460832, 518432,
576032, 633632, 691232, 748832, 806432, 864032, 921632, 979232, 1036832,

bash-2.03# fsck -F ufs -o b=32 /dev/rdsk/c0t1d0s0
Alternate super block location: 32.
** /dev/rdsk/c0t1d0s0
BAD SUPER BLOCK: NUMBER OF DIRECTORIES OUT OF RANGE
USE AN ALTERNATE SUPER-BLOCK TO SUPPLY NEEDED INFORMATION;
eg. fsck [-F ufs] -o b=# [special ...]
where # is the alternate super block. SEE fsck_ufs(1M).
bash-2.03# fsck -F ufs -o b=57632 /dev/rdsk/c0t1d0s0
Alternate super block location: 57632.
** /dev/rdsk/c0t1d0s0
BAD SUPER BLOCK: MAGIC NUMBER WRONG
USE AN ALTERNATE SUPER-BLOCK TO SUPPLY NEEDED INFORMATION;
eg. fsck [-F ufs] -o b=# [special ...]
where # is the alternate super block. SEE fsck_ufs(1M).
bash-2.03# fsck -F ufs -o b=17468960 /dev/rdsk/c0t1d0s0
Alternate super block location: 17468960.
** /dev/rdsk/c0t1d0s0
BAD SUPER BLOCK: MAGIC NUMBER WRONG
USE AN ALTERNATE SUPER-BLOCK TO SUPPLY NEEDED INFORMATION;
eg. fsck [-F ufs] -o b=# [special ...]
where # is the alternate super block. SEE fsck_ufs(1M).

A:


-------
Note:
-------

thread:

Bug ID  6401066  
Synopsis  "IRQ 10 is shared with different levels" is misleading or incorrect.  
State  11-Closed:Will Not Fix (Closed)  
Category:Subcategory  kernel:ddi  
Keywords  opensolaris  
Responsible Engineer  Surya Prakki  
Reported Against  snv_28  
Duplicate Of   
Introduced In   
Commit to Fix   
Fixed In   
Release Fixed   
Related Bugs   
Submit Date  20-MAR-2006  
Last Update Date  24-MAR-2009  
Description  Category
   driver
Sub-Category
   aac
Description
   I get a message during booting:
 Warning: IRQ 10 is shared by different drivers with different
levels ...
The message suggests that the system may not perform
efficiently. So I started to investigate.
Maybe mixture of edge-triggered or level-triggered.
I originally thought this is a good warning, 
but meaning not clear enough.
As I checked the PC using a different installaion of linux, I found that
the IRQ is shared with three devices.
Intell EtherExpress 100.
Adaptec AHA29160
and a USB interface card (that uses VIA.)
All of them, however, seem to use level-triggered interrupt (low-level trigger).
So the message doesn't seem to be quite correct.
(Or is it the case, that solaris's driver for these
cards try to impose different polarity for interrupt detection !?)
Frequency
   Always
Regression
   No
Steps to Reproduce
   As above. Always the message appears during booting.
Expected Result
   Correct Interrupt configuration detection.
Actual Result
   It seems that Solaris 10 nv detects the interrupt configuration 
in a slightly incorrect manner?!
Error Message(s)
   
Test Case
   
Workaround
   
Submitter wants to work on bug
   No
Additional configuration information 


-------
Note:
-------

Bug ID  6271471  
Synopsis  libaio hang because of missing AIONOTIFY to kernel thread  
State  10-Fix Delivered (Fix available in build)  
Category:Subcategory  library:libaio  
Keywords  AIONOTIFY | aio_suspend | no-s9+  
Responsible Engineer  Surya Prakki  
Reported Against   
Duplicate Of   
Introduced In   
Commit to Fix  solaris_8  
Fixed In  s8patch  
Release Fixed  solaris_8(s8patch)  
Related Bugs  6310825  
Submit Date  17-MAY-2005  
Last Update Date  24-MAR-2009  
Description  See comments.
 xxxxx@xxxxx  2005-05-17 09:07:07 GMT 
Work Around  N/A 


-------
Note:
-------

This is a list of the 25 bugs with the most votes. This list is compiled on a daily basis, so there may be discrepancies 
between the vote counts in the list below and the vote counts shown in the individual bug detail pages.

Votes Bug ID  Synopsis 
 
647 4244499 ZipEntry() does not convert filenames from Unicode to platform 
358 4109888 Semantics of external process is not defined in JLS 
198 4265778 Java2D incorrectly renders objects with large coordinates 
158 4071957 (reflect) Method.invoke access control does not understand inner class scoping 
151 4717969 (process) Control-C does not end forked Java process (w2k, wnt) 
144 4957990 PermHeap bloat in and only in server VM 
138 6434149 (cl) ClassLoader.loadClass() throws java.lang.ClassNotFoundException: [Ljava.lang.String; in JDK 6.0 
127 4330950 Lost newly entered data in the cell when resizing column width 
119 6245070 JMStudio doesn't playback any .avi file correctly, video always fail 
90 6635462 D3D: REGRESSION: XOR rendering is extremly slow 
80 6203567 NameService resolving in Applet causes AccessControlException and deadlock 
80 6372808 JFileChooser takes a long time to instantiate, at least the first time 
77 4171239 File.deleteOnExit() does not work on open files (win32) 
76 4816922 No way to set drag icon: TransferHandler.getVisualRepresentation() is not used 
76 4723383 Incomplete RTF support in javax.swing.text.rtf.RTFEditorKit 
75 4787931 System property "user.home" does not correspond to "USERPROFILE" (win) 
75 4743225 Size of JComboBox list is wrong when list is populated via PopupMenuListener 
72 6262392 Problem with dismissing dialogs on CDE with XToolkit 
69 6429812 NPE after calling JTable.updateUI() when using a header renderer + XP L&F 
67 4770092 (process) Process.destroy does not kill multiple child processes 
62 4290274 (timer) java.util.Timer.scheduleAtFixedRate() fails if the system time is changed 
54 4677493 REGRESSION: java.sql.Timestamp.getTime() returns wrong value with GMT 
50 6506617 Keyboard-lock in swing program on Linux box 
50 6476706 Error AGENT_ERROR_NO_JNI_ENV printed sometimes to console when JVM finishes 
48 4267450 (cal) API: Need public API to calculate, format and parse "year of week" 


-------
Note:
-------

Bad magic number

Bad magic number error indicates the system is mostly likely having trouble accessing VTOC 
(Volumne Table of Contents) This just indicates that the partition is in a strange state.

What to do when you saw bad magic number errors in Solaris? If you know the alternate superblocks, 
then you can fsck the partition. If you don�t know the alternate superblocks and created the partition 
using newfs command, then you can use the �-N� option to print out the superblocks without actually 
recreate the file system. For example:

newfs -N /dev/rdsk/c0t0d0s0

�format� tool might be able to correctly label the disk with valid VTOC. 
Relabel your disks essentially repartition the disks and existing data will be lost. 
A recommended approach is to run analyze feature in format utility. Analyze will try 
to verify or repair a bad sector on a disk. Please note that bad magic number errors doesn�t 
always translate to bad sectors. The physical disk might be working perfectly without errors.


------
Note:
------

03/04/2009:


Solaris Releases
The following Solaris releases are currently shipping or no longer shipping but still supported by Sun. 


--------------------------------------------------------------------------------

Most Current Release
Solaris 10 Operating System (OS) 
As with earlier versions, several Solaris 10 update releases are planned to come out prior to the next Solaris version. 
The latest update release is Solaris 10 10/08. 

Note 1: Previous versions of Solaris 10 are no longer necessary as the latest release 
includes support for all Solaris 10 supported platforms and includes important enhancements 
not found in earlier releases. 

Note 2: Certain systems or configurations may still require or recommend use of the previous 
Solaris 10 updates which are still available for download as DVD (SPARC, x64/x86) or CD (SPARC, x64/x86) images:

Solaris 10 3/05

SPARC DVD    SPARC CD     x64/x86 DVD    x64/x86 CD 

Solaris 10 1/06

SPARC DVD    SPARC CD    x64/x86 DVD    x64/x86 CD 

Solaris 10 6/06

SPARC DVD    SPARC CD    x64/x86 DVD    x64/x86 CD 

Solaris 10 11/06

SPARC DVD    SPARC CD    x64/x86 DVD    x64/x86 CD 

Solaris 10 8/07

SPARC DVD    SPARC CD     x64/x86 DVD    x64/x86 CD 

Solaris 10 5/08

SPARC DVD    SPARC CD     x64/x86 DVD    x64/x86 CD 


--------------------------------------------------------------------------------

Other Shipping Releases
Solaris 9 Operating Environment 
The latest update is Solaris 9 9/05. 
A special version, Solaris 9 9/05 HW is available in both DVD or CD downloads to support new SPARC systems: 
Sun Ultra 25 and 45 Workstations and Sun Fire V215, V245 and V445 servers. 

To purchase applicable licenses, please contact Sun.


--------------------------------------------------------------------------------

Non-Shipping Releases that are Supported

Solaris 8 Operating Environment

The latest update is Solaris 8 2/04. 
The Solaris 8 Operating Environment stopped shipping on February 16, 2007 and is currently at the 
Vintage Phase I support level. 
Transition to Vintage Phase II support level will be on March 31, 2009 
End of service life will be March 31, 2012 
For further information, please see: http://www.sun.com/software/solaris/support/sol8.xml 
Solaris 7 Operating Environment

The latest update is Solaris 7 11/99. 
The Solaris 7 Operating Environment stopped shipping on August 15, 2003 and is currently at the 
Vintage Phase II support level. 
End-of-support will be August 15, 2008. 


------
Note:
------


------
Note:
------


------
Note:
------


##############################################################

SECTION 8: GENERIC: FILESYSTEM ERRORS:

##############################################################


----------------------------------------------------------------------------------------
Note 1.1         : Possible way how to save files from A corrupt directory
Works on OS      : all unix
probable message : ksh: Invalid file system control data detected:
----------------------------------------------------------------------------------------


>>>> Question:

Anybody recognize this? This directory seems to be missing the ".", I can't 
umount, can't remove the directory, can't copy a good directory over it, 
etc. 

spiderman# cd probes 
spiderman# pwd 
/opt/diagnostics/probes 
spiderman# ls -la 
ls: 0653-341 The file . does not exist. 
spiderman# cd .. 
spiderman# ls -la probes 
ls: probes: Invalid file system control data detected. 
total 0 
spiderman# 

spiderman# fuser /opt 
/opt: 
spiderman# umount /opt 
umount: 0506-349 Cannot unmount /dev/hd10opt: The requested resource is 
busy. 
spiderman# umount /dev/hd10opt 
umount: 0506-349 Cannot unmount /dev/hd10opt: The requested resource is 
busy. 

spiderman# fsck /opt 

** Checking /dev/hd10opt (/opt) MOUNTED FILE SYSTEM; WRITING SUPPRESSED; 
Checking a mounted filesystem does not produce dependable results. 
** Phase 1 - Check Blocks and Sizes 
** Phase 2 - Check Pathnames 
DIRECTORY CORRUPTED (NOT FIXED) 
DIRECTORY CORRUPTED (NOT FIXED) 
Directory /diagnostics/probes, '.' entry is missing. (NOT FIXED) 
Directory /diagnostics/probes, '..' entry is missing. (NOT FIXED) 
** Phase 3 - Check Connectivity 
** Phase 4 - Check Reference Counts 
link count directory I@98 owner=bin mode$0755 
sizeQ2 mtime=May 13 14:54 2005 
count 3 should be 2 (NOT ADJUSTED) 
link count directory I@99 owner=bin mode$0755 
size24 mtime=Jan 10 13:45 2005 
count 2 should be 1 (NOT ADJUSTED) 
Unreferenced file IA06 owner=bin mode0555 
sizee56 mtime=Jul 07 14:25 2004 (NOT RECONNECTED) 
Unreferenced file IA06 (NOT CLEARED) 
Unreferenced file IA07 owner=bin mode0555 
size)12 mtime=Jul 07 14:25 2004 (NOT RECONNECTED) 
etc....


>>>> Answer:

Some good news here. Yes, your directory is hosed, but the important 
things is that all a directory is a repository for storing inode numbers 
and associated (human readable) file names. Since fsck is so nicely 
generating all of those now currently inaccessible inode numbers, a find 
command can be used to move them into a new directory. Once the old 
directory is empty, you can (hopefully) rm -r it. 

Here's what you need to do. 

a) Get all the inode numbers generated from your fsck 
b) put them into a variable (e.g. lost_inodes="4099 4106....etc." 
c) Make a target directory for the lost inodes to be moved into: 
mkdir /tmp/recovery 
d) cd into your problem File System: 
cd /opt 
d) Run a loop using find: 

for i in ${lost_inodes} 
do 
find . -inum ${i} mv * /tmp/recovery \; 
echo "Moved and recovered inode # ${i}" 
done 

That should do it. Let me know if it works ok! BTW, the new "file 
name" should be the inode number of the file. You will have to rename 
the files as needed. 


Note that this mehod saved the files from the corrupt directory.


----------------------------------------------------------------------------------------
Note 1.2         : A superblock issue
Works on OS      : all unix
probable message : probably fsck gives you a message
disks            : local disks, most likely not SAN
----------------------------------------------------------------------------------------


>>>> Method 1:

Use this command in case the superblock is corrupted. This will restore the BACKUP COPY of the superblock 
to the CURRENT copy.

# dd count=1 bs=4k skip=31 seek=1 if=/dev/hd4 of=/dev/hd4    (hd4 is an example)

# fsck /dev/hd4 2>&1 | tee /tmp/fsck.errors

OR

>>>>> Method 2:

If you have a dirty superblock you might try to do �fsck�. If this does not work try the following (This procedure does not promise 100% success).
(The following example relats to a bad filesystem in slv4.0)

1. Copy the original Superblock into a file (calld sd0 in /tmp - places can be changed):
dd if=/dev/rslv4.0 of=/tmp/sb0 bs=4k count=1 skip=1

Note: if=Input File, of=Output file, bs=Block Size.

2. Copy the backup Superblock into a file (calld sd1 in /tmp - places can be changed):
dd if=/dev/rslv4.0 of=/tmp/sb1 bs=4k count=1 skip=31

3. Copy the Backup Superblock file over the original Superblock:
dd if=/tmp/sb1 of=/dev/rslv4.0 bs=4k seek=1

4. Do �fsck� again on this filesystem

Note:
If you want to restore the original Superblock, do:
dd if=/tmp/sb0 of=/dev/rslv4.0 bs=4k seek=1


----------------------------------------------------------------------------------------
Note 1.3         : A superblock issue
Works on OS      : AIX
probable message : probably fsck gives you a message
disks            : local disks, most likely not SAN
----------------------------------------------------------------------------------------


>>>> Method 1:

-- Fixing a corrupted magic number in the file system superblock.

If the superblock of a file system is damaged, the file system cannot be accessed. You can fix a 
corrupted magic number in the file system superblock.

Most damage to the superblock cannot be repaired. The following procedure describes how to repair a superblock 
in a JFS file system when the problem is caused by a corrupted magic number. If the primary superblock is corrupted 
in a JFS2 file system, use the fsck command to automatically copy the secondary superblock and repair the primary 
superblock.

In the following scenario, assume /home/myfs is a JFS file system on the physical volume /dev/lv02.

The information in this how-to was tested using AIX� 5.2. If you are using a different version or level of AIX, 
the results you obtain might vary significantly. 

1. Unmount the /home/myfs file system, which you suspect might be damaged, using the following command: 

# umount /home/myfs

2. To confirm damage to the file system, run the fsck command against the file system. For example: 

# fsck -p /dev/lv02

If the problem is damage to the superblock, the fsck command returns one of the following messages: 

fsck: Not an AIXV5 file system
OR 
Not a recognized filesystem type

3. With root authority, use the od command to display the superblock for the file system, 
as shown in the following example: 

# od -x -N 64 /dev/lv02 +0x1000

Where the -x flag displays output in hexadecimal format and the -N flag instructs the system to format 
no more than 64 input bytes from the offset parameter (+), which specifies the point in the file where 
the file output begins. The following is an example output: 

0001000  1234 0234 0000 0000 0000 4000 0000 000a
0001010  0001 8000 1000 0000 2f6c 7633 0000 6c76
0001020  3300 0000 000a 0003 0100 0000 2f28 0383
0001030  0000 0001 0000 0200 0000 2000 0000 0000
0001040

In the preceding output, note the corrupted magic value at 0x1000 (1234 0234). If all defaults were taken 
when the file system was created, the magic number should be 0x43218765. If any defaults were overridden, 
the magic number should be 0x65872143. 

4. Use the od command to check the secondary superblock for a correct magic number. An example command 
and its output follows: 

# od -x -N 64 /dev/lv02 +0x1f000

001f000  6587 2143 0000 0000 0000 4000 0000 000a
001f010  0001 8000 1000 0000 2f6c 7633 0000 6c76
001f020  3300 0000 000a 0003 0100 0000 2f28 0383
001f030  0000 0001 0000 0200 0000 2000 0000 0000
001f040

Note the correct magic value at 0x1f000. 

5. Copy the secondary superblock to the primary superblock. An example command and output follows: 

# dd count=1 bs=4k skip=31 seek=1 if=/dev/lv02 of=/dev/lv02

dd: 1+0 records in.
dd: 1+0 records out.

Use the fsck command to clean up inconsistent files caused by using the secondary superblock. For example: 

# fsck /dev/lv02 2>&1 | tee /tmp/fsck.errs

For more information

The fsck and od command descriptions in AIX 5L Version 5.3 Commands Reference, Volume 4 
AIX Logical Volume Manager from A to Z: Introduction and Concepts, an IBM Redbook 
AIX Logical Volume Manager from A to Z: Troubleshooting and Commands, an IBM Redbook 
"Boot Problems" in Problem Solving and Troubleshooting in AIX 5L, an IBM Redbook 


OR

>>>>> Method 2:

If you experience a dirty superblock, which causes a filesystem to be 
not mountable, you can use backup copy of superblock to copy it over the 
corrupted one. 


With little unix experience it can be a tough task, because the steps 
required are as follows: 


- boot from bootable media (install cd/tape, mksysb tape) 
- access rootvg before mounting fs 
- fsck -y on corrupted fs's 
- logform on logdevice 
- dd count=1 bs=4k skip=31 seek=1 if=/dev/<corrupted_lv> of=/dev/<corrupted_lv> 


----------------------------------------------------------------------------------------
Note 1.3         : A superblock issue
Works on OS      : Solaris
probable message : probably fsck gives you a message
disks            : local disks, most likely not SAN
----------------------------------------------------------------------------------------

>>>> Method 1:

Boot from OK prompt to single user mode, for example from CDROM

OK boot cdrom -sw
 

Attempt to fsck(1M) boot disk. This could fail with a super block error. 

# fsck /dev/rdsk/device

Find the locations of alternate super blocks. BE SURE TO USE AN UPPERCASE -N. For example: 

# newfs -N /dev/rdsk/c0t0d0s0
/dev/rdsk/c0t0d0s0:     2048960 sectors in 1348 cylinders of 19 tracks, 
80 sectors 1000.5MB in 85 cyl groups (16 c/g, 11.88MB/g, 5696 i/g)
super-block backups (for fsck -F ufs -o b=#) at:
32, 24432, 48832, 73232, 97632, 122032, 146432, 170832, 195232, 219632,
244032, 268432, 292832, 317232, 341632, 366032, 390432, 414832, 439232,
463632, 488032, 512432, 536832, 561232, 585632, 610032, 634432, 658832,
683232, 707632, 732032, 756432, 778272, 802672, 827072, 851472, 875872,
900272, 924672, 949072, 973472, 997872, 1022272, 1290672, ... 


Using an alternate super block, run fsck(1M) on the disk. You might have to try more than one alternate super block 
to make this to work. Pick a couple from the beginning, the middle, and the end. 

# fsck -o b=<altblk> /dev/rdsk/c0t0d0s0 


The boot block is probably bad too. Restore it while you are booted from the CD-ROM. 

# /usr/sbin/installboot /usr/platform/architecture/lib/fs/ufs/bootblk 
/dev/rdsk/c0t0d0s0 


Reboot the operating environment. 

# reboot 

OR:

>>>>> Method 2:

#newfs -N /dev/rdsk/<device>  (like c0t0d0s7)

it will generate the identical superblock.

then run.......

#fsck -o b=535952 /dev/rdsk/<device> (like c0t0d0s7)


OR:

>>>>>>> Method 3:

Restore a Bad Superblock (Solaris 8,9 and 10)
February 25, 2008 by sun4u 


Become superuser or assume an equivalent role. 
Determine whether the bad superblock is in the root (/), /usr, or /var file system and select one of
the following:

If the bad superblock is in either the root (/), /usr, or /var file system, 

then boot from the network or a locally connected CD.

From a locally-connected CD, use the following command:
ok boot cdrom -s

From the network where a boot or install server is already setup, use the following command:
ok boot net -s

If the bad superblock is not in either the root (/), /usr, /var file system, change to a directory
outside the damaged file system and unmount the file system.

# umount /mount-point

Caution � Be sure to use the newfs -N in the next step. If you omit the -N option, you will destroy
all of the data in the file system and replace it with an empty file system.

Display the superblock values by using the newfs -N command. 
# newfs -N /dev/rdsk/device-name

Provide an alternate superblock by using the fsck command. 
# fsck-F ufs -o b=block-number /dev/rdsk/device-name

The fsck command uses the alternate superblock you specify to restore the primary superblock. You
can always try 32 as an alternate block. Or, use any of the alternate blocks shown by the newfs -N
command.

 
Restoring a Bad Superblock (Solaris 8, 9, and 10 Releases)
The following example shows how to restore the superblock copy 5264.

# newfs -N /dev/rdsk/c0t3d0s7
/dev/rdsk/c0t3d0s7: 163944 sectors in 506 cylinders of 9 tracks, 36 sectors
83.9MB in 32 cyl groups (16 c/g, 2.65MB/g, 1216 i/g)
super-block backups (for fsck -b #) at:
32, 5264, 10496, 15728, 20960, 26192, 31424, 36656, 41888,
47120, 52352, 57584, 62816, 68048, 73280, 78512, 82976, 88208,
93440, 98672, 103904, 109136, 114368, 119600, 124832, 130064, 135296,
140528, 145760, 150992, 156224, 161456,

# fsck-F ufs -o b=5264 /dev/rdsk/c0t3d0s7
Alternate superblock location: 5264.
** /dev/rdsk/c0t3d0s7
** Last Mounted on
** Phase 1- Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
** Phase 5 - Check Cyl groups
36 files, 867 used, 75712 free (16 frags, 9462 blocks, 0.0% fragmentation)
***** FILE SYSTEM WAS MODIFIED *****
#


----------------------------------------------------------------------------------------
Note 1.4         : A superblock issue
Works on OS      : Linux ext2 filesystem
probable message : probably fsck gives you a message
disks            : local disks, most likely not SAN
----------------------------------------------------------------------------------------


DAMAGED SUPERBLOCK


If a filesystem check fails and returns the error message �Damaged Superblock� you're lost . . . . . . . 
or not ?
Well, not really, the damaged �superblock� can be restored from a backup. There are several backups stored 
on the harddisk. But let me first have a go at explaining what a �superblock�is.

A superblock is located at position 0 of every partition, contains vital information about the filesystem 
and is needed at a filesystem check.

The information stored in the superblock are about what sort of fiesystem is used, the I-Node counts, 
block counts, free blocks and I-Nodes, the numer of times the filesystem was mounted, date of the 
last filesystem check and the first I-Node where / is located.

Thus, a damaged superblock means that the filesystem check will fail. 

Our luck is that there are backups of the superblock located on several positions and we can restore 
them with a simple command.

The usual ( and only ) positions are: 8193, 32768, 98304, 163840, 229376 and 294912. ( 8193 in many cases 
only on older systems, 32768 is the most current position for the first backup )
You can check this out and have a lot more info about a particular partition you have on your HD by:

  
# dumpe2fs /dev/hda5 

You will see that the primary superblock is located at position 0, and the first backup on position 32768.
O.K. let�s get serious now, suppose you get a �Damaged Superblock� error message at filesystem check 
( after a power failure ) and you get a root-prompt in a recovery console, then you give the command:


# e2fsck -b 32768 /dev/hda5 


don�t try this on a mounted filesystem

It will then check the filesystem with the information stored in that backup superblock and if the check 
was successful it will restore the backup to position 0.
Now imagine the backup at position 32768 was damaged too . . . then you just try again with the backup 
stored at position 98304, and 163840, and 229376 etc. etc. until you find an undamaged backup  
( there are five backups so if at least one of those five is okay it�s bingo ! )

So next time don�t panic . . just get the paper where you printed out this Tip and give the magic command
 
# e2fsck -b 32768 /dev/hda5  


----------------------------------------------------------------------------------------
Note 1.5         : Root filesystem full or nearly full
Works on OS      : most unixes
----------------------------------------------------------------------------------------


Always take care that the "/" root filesystem does not get near 100% full.

Potential problems

1. Some systems will not boot anymore in the normal multi-user way
2. On many systems new logons are not possible anymore
3. Some apps write or create unamed pipes "somewhere" in the root fs: they may stall or even crash
   
Remarks on 2:

This is caused by a full file system and the system has no space
to write its utmpx (login info) entry.

To get around this condition the system must be booted up
into single user mode, or you may need to boot from CDROM or from network etc..
Then you might be able to clear logfiles under /var/..
Or just increase the / filesystem with some additional space.


##############################################################

SECTION 9: GENERIC: HOW TO REMOVE A WEIRD FILE:

##############################################################

----------------------------------------------------------------------------------------
Note 2.1         : You cannot rm a file in the "normal" way, or
                   How to Delete or Remove Files With Inode Number
Works on OS      : all unix
----------------------------------------------------------------------------------------

>>>>>> Question: 

How can I remove a bizarre, irremovable file from a directory? I've tried every way of using 
/bin/rm and nothing works." 


>>>>>> Answer: 

In some rare cases a strangely-named file will show itself in your directory and appear to be 
un-removable with the rm command. Here is will the use of ls -li and find with its -inum [inode] 
primary does the job. 
Let's say that ls -l shows your irremovable as 

-rw-------  1 smith  smith  0 Feb  1 09:22 ?*?*P

Type: 

ls -li

to get the index node, or inode. 

153805 -rw-------  1 smith  smith  0 Feb  1 09:22 ?*?^P

The inode for this file is 153805. Use find -inum [inode] to make sure that the file is correctly identified. 


%  find -inum 153805 -print
./?*?*P

Here, we see that it is. Then used the -exec functionality to do the remove. . 
  
% find . -inum 153805 -print -exec /bin/rm {} \;

Note that if this strangely named file were not of zero-length, it might contain accidentally misplaced 
and wanted data. Then you might want to determine what kind of data the file contains and move the file 
to some temporary directory for further investigation, for example: 

% find . -inum 153805 -print -exec /bin/mv {} unknown.file \;

Will rename the file to unknown.file, so you can easily inspect it. 

Another way to remove strangely-named files is to use "ls -q" or "cat -v" to show the special characters, 
and then use shell's globbing mechanism to delete the file. 

$ ls
-????*'?
$ ls | cat -v
-^B^C?^?*'

$ rm ./-'^B'*           -- achieved by typing control-V control-B
$ ls


the argument given to rm is a judicious selection of glob wildcards (*'s) and sufficient control characters 
to uniquely identify the file. The leading "./" is useful when the file begins with a hyphen. 
These binary name files are caused by: 

* accidental cut-and-pastes to shell prompts - especially when you paste something of the form: "junk > garbage" 
because the shell creates the file "garbage" before trying to execute the command "junk" 

* filesystem corruption (in which case touching the filesystem any more can really stuff things up) 
If you discover that you have two files of the same name, one of the files probably has a bizarre 
(and unprintable) character in its name. Most probably, this unprintable character is a backspace. 

For example: 


    $ ls
    filename filename
    $ ls -q
    filename fl?ilename
    $ ls | cat -v
    filename
    fl^Hilename


----------------------------------------------------------------------------------------
Note 2.2         : You cannot rm a file in the "normal" way, or
                   How to Delete or Remove Files With Inode Number
Works on OS      : all unix
Same problem as noted in note 2.1.
----------------------------------------------------------------------------------------


An inode identifies the file and its attributes such as file size, owner, and so on. A unique inode number 
within the file system identifies each inode. But, why to delete file by an inode number? 
Sure, you can use rm command to delete file. Sometime accidentally you creates filename with control characters 
or characters which are unable to be input on a keyboard or special character such as ?, * ^ etc. 
Removing such special character filenames can be problem. Use following method to delete a file with strange characters in its name:

Please note that the procedure outlined below works with Solaris, FreeBSD, Linux, or any other Unixish oses out there:


Find out file inode 
First find out file inode number with any one of the following command:

stat {file-name}

OR 

ls -il {file-name}

Use find command to remove file:
Use find command as follows to find and remove a file:

find . -inum [inode-number] -exec rm -i {} \;

When prompted for confirmation, press Y to confirm removal of the file.

Let us try to delete file using inode number.

(a) Create a hard to delete file name:
$ cd /tmp
$ touch "\+Xy \+\8"
$ ls 
(b) Try to remove this file with rm command:
$ rm \+Xy \+\8

(c) Remove file by an inode number, but first find out the file inode number:
$ ls -ilOutput: 

781956 drwx------  3 viv viv 4096 2006-01-27 15:05 gconfd-viv
781964 drwx------  2 viv viv 4096 2006-01-27 15:05 keyring-pKracm
782049 srwxr-xr-x  1 viv viv    0 2006-01-27 15:05 mapping-viv
781939 drwx------  2 viv viv 4096 2006-01-27 15:31 orbit-viv
781922 drwx------  2 viv viv 4096 2006-01-27 15:05 ssh-cnaOtj4013
781882 drwx------  2 viv viv 4096 2006-01-27 15:05 ssh-SsCkUW4013
782263 -rw-r--r--  1 viv viv    0 2006-01-27 15:49 \+Xy \+\8Note: 782263 is inode number.

(d) Use find command to delete file by inode:
Find and remove file using find command, type the command as follows:
$ find . -inum 782263 -exec rm -i {} \;
Note you can also use add \ character before special character in filename to remove it directly so the command would be:

$ rm "\+Xy \+\8"

If you have file like name like name "2005/12/31" then no UNIX or Linux command can delete this file by name. 
Only method to delete such file is delete file by an inode number. Linux or UNIX never allows creating filename like 2005/12/31 
but if you are using NFS from MAC OS or Windows then it is possible to create a such file.

OR

read this thead:


Become superuser or assume an equivalent role. 
Determine whether the bad superblock is in the root (/), /usr, or /var file system and select one of
the following:

If the bad superblock is in either the root (/), /usr, or /var file system, 

then boot from the network or a locally connected CD.

From a locally-connected CD, use the following command:
ok boot cdrom -s

From the network where a boot or install server is already setup, use the following command:
ok boot net -s

If the bad superblock is not in either the root (/), /usr, /var file system, change to a directory
outside the damaged file system and unmount the file system.

# umount /mount-point

Caution � Be sure to use the newfs -N in the next step. If you omit the -N option, you will destroy
all of the data in the file system and replace it with an empty file system.

Display the superblock values by using the newfs -N command. 
# newfs -N /dev/rdsk/device-name

Provide an alternate superblock by using the fsck command. 
# fsck-F ufs -o b=block-number /dev/rdsk/device-name

The fsck command uses the alternate superblock you specify to restore the primary superblock. You
can always try 32 as an alternate block. Or, use any of the alternate blocks shown by the newfs -N
command.

 
Restoring a Bad Superblock (Solaris 8, 9, and 10 Releases)
The following example shows how to restore the superblock copy 5264.

# newfs -N /dev/rdsk/c0t3d0s7
/dev/rdsk/c0t3d0s7: 163944 sectors in 506 cylinders of 9 tracks, 36 sectors
83.9MB in 32 cyl groups (16 c/g, 2.65MB/g, 1216 i/g)
super-block backups (for fsck -b #) at:
32, 5264, 10496, 15728, 20960, 26192, 31424, 36656, 41888,
47120, 52352, 57584, 62816, 68048, 73280, 78512, 82976, 88208,
93440, 98672, 103904, 109136, 114368, 119600, 124832, 130064, 135296,
140528, 145760, 150992, 156224, 161456,

# fsck-F ufs -o b=5264 /dev/rdsk/c0t3d0s7
Alternate superblock location: 5264.
** /dev/rdsk/c0t3d0s7
** Last Mounted on
** Phase 1- Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
** Phase 5 - Check Cyl groups
36 files, 867 used, 75712 free (16 frags, 9462 blocks, 0.0% fragmentation)
***** FILE SYSTEM WAS MODIFIED *****
#


##############################################################

SECTION 10: GENERIC: HINTS ON UNDELETE A FILE:

##############################################################


---------------------------------------------------------------------------------------
Note 1:
----------------------------------------------------------------------------------------

http://www.cyberciti.biz/tips/linuxunix-recover-deleted-files.html

Using grep (traditional UNIX way) to recover files
Use following grep syntax:

grep -b 'search-text' /dev/partition > file.txt
OR

grep -a -B[size before] -A[size after] 'text' /dev/[your_partition] > file.txt

Where,

-i : Ignore case distinctions in both the PATTERN and the input files i.e. match both uppercase and lowercase character. 
-a : Process a binary file as if it were text 
-B Print number lines/size of leading context before matching lines. 
-A: Print number lines/size of trailing context after matching lines. 

To recover text file starting with "nixCraft" word on /dev/sda1 you can try following command:

# grep -i -a -B10 -A100 'nixCraft' /dev/sda1 > file.txt

Next use vi to see file.txt. This method is ONLY useful if deleted file is text file. 
If you are using ext2 file system, try out recover command. .


----------------------------------------------------------------------------------------
Note 2:
----------------------------------------------------------------------------------------

Bring back deleted files with lsof
By Michael Stutz on November 16, 2006 (8:00:00 AM) 

Briefly, a file as it appears somewhere on a Linux filesystem is actually just a link to an inode, 
which contains all of the file's properties, such as permissions and ownership, as well as the addresses 
of the data blocks where the file's content is stored on disk. When you rm a file, you're removing the link 
that points to its inode, but not the inode itself; other processes (such as your audio player) might still 
have it open. It's only after they're through and all links are removed that an inode and the data blocks 
it pointed to are made available for writing.

This delay is your key to a quick and happy recovery: if a process still has the file open, the data's there 
somewhere, even though according to the directory listing the file already appears to be gone.

This is where the Linux process pseudo-filesystem, the /proc directory, comes into play. Every process on 
the system has a directory here with its name on it, inside of which lies many things -- 
including an fd ("file descriptor") subdirectory containing links to all files that the process has open. 
Even if a file has been removed from the filesystem, a copy of the data will be right here:

/proc/process id/fd/file descriptor 

To know where to go, you need to get the id of the process that has the file open, and the file descriptor. 
These you get with lsof, whose name means "list open files." (It actually does a whole lot more than this 
and is so useful that almost every system has it installed. If yours isn't one of them, you can grab the latest 
version straight from its author.)

Once you get that information from lsof, you can just copy the data out of /proc and call it a day.

This whole thing is best demonstrated with a live example. First, create a text file that you can delete 
and then bring back:

$ man lsof | col -b > myfile 

Then have a look at the contents of the file that you just created:

$ less myfile 

You should see a plaintext version of lsof's huge man page looking out at you, courtesy of less.

Now press Ctrl-Z to suspend less. Back at a shell prompt make sure your file is still there:

$ ls -l myfile
-rw-r--r--  1 jimbo jimbo 114383 Oct 31 16:14 myfile
$ stat myfile
  File: `myfile'
  Size: 114383          Blocks: 232        IO Block: 4096   regular file
Device: 341h/833d       Inode: 1276722     Links: 1
Access: (0644/-rw-r--r--)  Uid: ( 1010/    jimbo)   Gid: ( 1010/    jimbo)
Access: 2006-10-31 16:15:08.423715488 -0400
Modify: 2006-10-31 16:14:52.684417746 -0400
Change: 2006-10-31 16:14:52.684417746 -0400
Yup, it's there all right. OK, go ahead and oops it:

$ rm myfile
$ ls -l myfile
ls: myfile: No such file or directory
$ stat myfile
stat: cannot stat `myfile': No such file or directory
$
It's gone.

At this point, you must not allow the process still using the file to exit, because once that happens, 
the file will really be gone and your troubles will intensify. Your background less process in this walkthrough 
isn't going anywhere (unless you kill the process or exit the shell), but if this were a video or sound file that 
you were playing, the first thing to do at the point where you realize you deleted the file would be to 
immediately pause the application playback, or otherwise freeze the process, so that it doesn't eventually 
stop playing the file and exit. 

Now to bring the file back. First see what lsof has to say about it:

$ lsof | grep myfile
less      4158    jimbo    4r      REG       3,65   114383   1276722 /home/jimbo/myfile (deleted)
The first column gives you the name of the command associated with the process, the second column is the 
process id, and the number in the fourth column is the file descriptor (the "r" means that it's a regular file). 
Now you know that process 4158 still has the file open, and you know the file descriptor, 4. That's everything 
you have to know to copy it out of /proc.

You might think that using the -a flag with cp is the right thing to do here, since you're restoring the file -- 
but it's actually important that you don't do that. Otherwise, instead of copying the literal data contained 
in the file, you'll be copying a now-broken symbolic link to the file as it once was listed in its original directory:

$ ls -l /proc/4158/fd/4
lr-x------  1 jimbo jimbo 64 Oct 31 16:18 /proc/4158/fd/4 -> /home/jimbo/myfile (deleted)
$ cp -a /proc/4158/fd/4 myfile.wrong
$ ls -l myfile.wrong
lrwxr-xr-x  1 jimbo jimbo 24 Oct 31 16:22 myfile.wrong -> /home/jimbo/myfile (deleted)
$ file myfile.wrong
myfile.wrong: broken symbolic link to `/home/jimbo/myfile (deleted)'
$ file /proc/4158/fd/4
/proc/4158/fd/4: broken symbolic link to `/home/jimbo/myfile (deleted)'
So instead of all that, just a plain old cp will do the trick:

$ cp /proc/4158/fd/4 myfile.saved 

And finally, verify that you've done good:

$ ls -l myfile.saved
-rw-r--r--  1 jimbo jimbo 114383 Oct 31 16:25 myfile.saved
$ man lsof | col -b > myfile.new
$ cmp myfile.saved myfile.new
No complaints from cmp -- your restoration is the real deal.

Incidentally, there are a lot of useful things you can do with lsof in addition to rescuing lost files.


----------------------------------------------------------------------------------------
Note 3:
----------------------------------------------------------------------------------------

Recover Deleted Files
Files on Unix may be deleted, but still held open by another process. While most Unix would require a utility to read a file 
by the filesystem and inode(5) number, the special /proc filesystem on Linux allows the recovery of deleted but held open files:

Use lsof(1) to discover the deleted file, and record the Process ID (PID) and File Descriptor (FD) open to this file. 
Recover the file: 

cp /proc/$PID/fd/$FD /var/tmp/recovered 

The deleted file should appear as a broken symbolic link under the /proc/$PID/fd directory. 
Despite this, /proc still allows the file to be copied elsewhere. For related information, see how to debug Unix systems.


----------------------------------------------------------------------------------------
Note 4:
----------------------------------------------------------------------------------------

HOWTO recover deleted files on an Linux ext3 file system

Please see:

http://www.xs4all.nl/~carlo17/howto/undelete_ext3.html

Or see

Tom Pycke, Recovering Files in Linux, available at www.recover.source.net/linux


For Linux ext2 file system:

1. R-Linux undelete utility: 
Take a look here:
http://3d2f.com/tags/undelete/recover/unix/

2. The ext2 file system has an addon program called e2undel[1] which allows file undeletion, although the similar ext3 file system 
does not support that kind of undeletion.

3. Also, mabe the following "unrm" can be of help on Linux:
http://freshmeat.net/projects/unrm/


Another "unrm" pointer:
http://staff.washington.edu/dittrich/talks/blackhat/tct/man/man1/unrm.1.html


----------------------------------------------------------------------------------------
Note 5:
----------------------------------------------------------------------------------------

Possible AIX undelete tool:

http://www.compunix.com/products.html
http://www.compunix.com/prod/analyse.html
http://www.compunix.com/eval/list.html


For AIX and JFS:

http://www.phase2.net/2008/03/04/aix-recovering-a-deleted-file-undelete/

When you are really good with the fsdb tool (included in AIX), you might be able
to recover files yourself. See another note in this document for an example of using fsdb.

See man page for fsdb or 
http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.aix.cmds/doc/aixcmds2/fsdb.htm


----------------------------------------------------------------------------------------
Note 6:
----------------------------------------------------------------------------------------

1. Solaris Recovery:

-- Kernel Recovery for Solaris Sparc
Kernel Recovery for Solaris Sparc is a do-it-yourself data recovery software. Software performs read-only scan, 
which helps you to recover your important data in minutes. File System supported for recovery is UFS File system.

http://www.download.com/Kernel-Recovery-for-Solaris-Sparc/3000-2248_4-10578170.html
http://www.download3k.com/Press-Launch-of-Kernel-Recovery-for-Solaris-SPARC.html
http://www.tucows.com/preview/505583
http://www.programurl.com/kernel-recovery-for-solaris-sparc.htm

Nucleus Technologies.com: http://www.nucleustechnologies.com 

-- Other Solaris Data Recovery Software:

http://solaris-data-recovery-software.qarchive.org/


2. R-Tools technology: Undelete tool for Linux and Solaris:

http://www.data-recovery-software.net/


----------------------------------------------------------------------------------------
Note 7:
----------------------------------------------------------------------------------------

For AIX and JFS filesystem: an undelete program
Not tested by writer of this document:


/*****************************************************************************
 * rsb.c - Read Super Block. Allows a jfs superblock to be dumped, inode
 * table to be listed or specific inodes data pointers to be chased and
 * dumped to standard out (undelete).
 *
 * Phil Gibbs - Trinem Consulting (pgibbs@trinem.co.uk)
 ****************************************************************************/
#include <stdio.h>
#include <jfs/filsys.h>
#include <jfs/ino.h>
#include <sys/types.h>
#include <pwd.h>
#include <grp.h>
#include <unistd.h>
#include <time.h>

#define FOUR_MB		(1024*1024*4)
#define THIRTY_TWO_KB	(1024*32)

extern int optind;
extern int Optopt;
extern int Opterr;
extern char *optarg;

void PrintSep()
{
	int k=80;

	while (k)
	{
		putchar('-');
		k--;
	}
	putchar('\n');
}

char *UserName(uid_t uid)
{
char replystr[10];
struct passwd *res;

res=getpwuid(uid);
if (res->pw_name[0])
{
	return res->pw_name;
}
else
{
	sprintf(replystr,"%d",uid);
	return replystr;
}
}

char *GroupName(gid_t gid)
{
struct group *res;
res=getgrgid(gid);
return res->gr_name;
}


ulong NumberOfInodes(struct superblock *sb)
{
	ulong MaxInodes;
	ulong TotalFrags;

	if (sb->s_version==fsv3pvers)
	{
		TotalFrags=(sb->s_fsize*512)/sb->s_fragsize;
		MaxInodes=(TotalFrags/sb->s_agsize)*sb->s_iagsize;
	}
	else
	{
		MaxInodes=(sb->s_fsize*512)/sb->s_bsize;
	}
	return MaxInodes;
}


void AnalyseSuperBlock(struct superblock *sb)
{
	ulong TotalFrags;

	PrintSep();
	printf("SuperBlock Details:\n-------------------\n");
	printf("File system size:  %ld x 512 bytes (%ld Mb)\n",
				sb->s_fsize,
				(sb->s_fsize*512)/(1024*1024));
	printf("Block size:        %d bytes\n",sb->s_bsize);
	printf("Flags:             ");
	switch (sb->s_fmod)
	{
		case (char)FM_CLEAN:
			break;
		case (char)FM_MOUNT:
			printf("mounted ");
			break;
		case (char)FM_MDIRTY:
			printf("mounted dirty ");
			break;
		case (char)FM_LOGREDO:
			printf("log redo failed ");
			break;
		default:
			printf("Unknown flag ");
			break;
	}
	if (sb->s_ronly) printf("(read-only)");
	printf("\n");
	printf("Last SB update at: %s",ctime(&(sb->s_time)));
	printf("Version:           %s\n",
	sb->s_version?"1 - fsv3pvers":"0 - fsv3vers");
	printf("\n");
	if (sb->s_version==fsv3pvers)
	{
		TotalFrags=(sb->s_fsize*512)/sb->s_fragsize;
		printf("Fragment size:     %5d         ",sb->s_fragsize);
		printf("inodes per alloc:  %8d\n",sb->s_iagsize);
		printf("Frags per alloc:   %5d         ",sb->s_agsize);
		printf("Total Fragments:   %8d\n",TotalFrags);
		printf("Total Alloc Grps:  %5d         ",
						TotalFrags/sb->s_agsize);
		printf("Max inodes:        %8ld\n",NumberOfInodes(sb));
	}
	else
	{
		printf("Total Alloc Grps:  %5d         ",
				(sb->s_fsize*512)/sb->s_agsize);
		printf("inodes per alloc:  %8d\n",sb->s_agsize);
		printf("Max inodes:      %8ld\n",NumberOfInodes(sb));
	}
	PrintSep();
}

void ReadInode(	FILE *in,
		ulong StartInum,
		struct dinode *inode,
		ulong InodesPerAllocBlock,
		ulong AllocBlockSize)
{
	off_t			SeekPoint;
	long			BlockNumber;
	int			OffsetInBlock;
	static struct dinode	I_NODES[PAGESIZE/DILENGTH];
	ulong			AllocBlock;
	ulong			inum;
	static off_t		LastSeekPoint=-1;

	AllocBlock=(StartInum/InodesPerAllocBlock);
	BlockNumber=(StartInum-(AllocBlock*InodesPerAllocBlock))/
			(PAGESIZE/DILENGTH);
	OffsetInBlock=(StartInum-(AllocBlock*InodesPerAllocBlock))-
			(BlockNumber*(PAGESIZE/DILENGTH));
	SeekPoint=(AllocBlock)?
		(BlockNumber*PAGESIZE)+(AllocBlock*AllocBlockSize):
		(BlockNumber*PAGESIZE)+(INODES_B*PAGESIZE);
	if (SeekPoint!=LastSeekPoint)
	{
		sync();
		fseek(in,SeekPoint,SEEK_SET);
		fread(I_NODES,PAGESIZE,1,in);
		LastSeekPoint=SeekPoint;
	}
	*inode=I_NODES[OffsetInBlock];
}

void DumpInodeContents(	long	inode,
			FILE	*in,
			ulong	InodesPerAllocBlock,
			ulong	AllocBlockSize,
			ulong	Mask,
			ulong	Multiplier)
{
	struct dinode		DiskInode;
	ulong			SeekPoint;
	char			Buffer[4096];
	ulong			FileSize;
	int			k;
	int			BytesToRead;
	ulong			*DiskPointers;
	int			NumPtrs;

	ReadInode(	in,
			inode,
			&DiskInode,
			InodesPerAllocBlock,
			AllocBlockSize);
	FileSize=DiskInode.di_size;

	if (FileSize>FOUR_MB)
	{
		/* Double indirect mapping */
	}
	else
	if (FileSize>THIRTY_TWO_KB)
	{
		/* Indirect mapping */
		SeekPoint=DiskInode.di_rindirect & Mask;
		SeekPoint=SeekPoint*Multiplier;
		DiskPointers=(ulong *)malloc(1024*sizeof(ulong));
		fseek(in,SeekPoint,SEEK_SET);
		fread(DiskPointers,1024*sizeof(ulong),1,in);
		NumPtrs=1024;
	}
	else
	{
		/* Direct Mapping */
		DiskPointers=&(DiskInode.di_rdaddr[0]);
		NumPtrs=8;
	}

	for (k=0;k<=NumPtrs && FileSize;k++)
	{
		SeekPoint=(DiskPointers[k] & Mask);
		SeekPoint=SeekPoint*Multiplier;

		BytesToRead=(FileSize>sizeof(Buffer))?sizeof(Buffer):FileSize;
		fseek(in,SeekPoint,SEEK_SET);
		fread(Buffer,BytesToRead,1,in);
		FileSize=FileSize-BytesToRead;
		write(1,Buffer,BytesToRead);
	}
}

void DumpInodeList(	FILE	*in,
			ulong	MaxInodes,
			ulong	InodesPerAllocBlock,
			ulong	AllocBlockSize)
{
	long			inode;
	struct dinode		DiskInode;
	struct tm		*TimeStruct;

	printf("   Inode Links     User    Group     Size    ModDate\n");
	printf("-------- ----- -------- -------- --------    -------\n");
	for (inode=0;inode<=MaxInodes;inode++)
	{
		ReadInode(	in,
				inode,
				&DiskInode,
				InodesPerAllocBlock,
				AllocBlockSize);
		if (DiskInode.di_mtime)
		{
			TimeStruct=localtime((long *)&DiskInode.di_mtime);
			printf("%8d %5d %8s %8s %8d %02d/%02d/%4d\n",
				inode,
				DiskInode.di_nlink,
				UserName(DiskInode.di_uid),
				GroupName(DiskInode.di_gid),
				DiskInode.di_size,
				TimeStruct->tm_mday,
				TimeStruct->tm_mon,
				TimeStruct->tm_year+1900);
		}
	}
}

void ExitWithUsageMessage()
{
	fprintf(stderr,"USAGE: rsb [-i inode] [-d] [-s] <block_device>\n");
	exit(1);
}

main(int argc,char **argv)
{
	FILE			*in;
	struct superblock	SuperBlock;
	short			Valid;
	long			inode=0;
	struct dinode		DiskInode;
	ulong			AllocBlockSize;
	ulong			InodesPerAllocBlock;
	ulong			MaxInodes;
	ulong			Mask;
	ulong			Multiplier;
	int			option;
	int			DumpSuperBlockFlag=0;
	int			DumpFlag=0;

	while ((option=getopt(argc,argv,"i:ds")) != EOF)
	{
		switch(option)
		{
			case 'i':
				/* Inode specified */
				inode=atol(optarg);
				break;
			case 'd':
				/* Dump flag */
				DumpFlag=1;
				break;
			case 's':
				/* List Superblock flag */
				DumpSuperBlockFlag=1;
				break;
			default:
				break;
		}
	}

	if (strlen(argv[optind])) in=fopen(argv[optind],"r");
	else ExitWithUsageMessage();

	if (in)
	{
		fseek(in,SUPER_B*PAGESIZE,SEEK_SET);
		fread(&SuperBlock,sizeof(SuperBlock),1,in);
		switch (SuperBlock.s_version)
		{
			case fsv3pvers:
				Valid=!strncmp(SuperBlock.s_magic,fsv3pmagic,4);
				InodesPerAllocBlock=SuperBlock.s_iagsize;
				AllocBlockSize=
				SuperBlock.s_fragsize*SuperBlock.s_agsize;
				Multiplier=SuperBlock.s_fragsize;
				Mask=0x3ffffff;
				break;
			case fsv3vers:
				Valid=!strncmp(SuperBlock.s_magic,fsv3magic,4);
				InodesPerAllocBlock=SuperBlock.s_agsize;
				AllocBlockSize=SuperBlock.s_agsize*PAGESIZE;
				Multiplier=SuperBlock.s_bsize;
				Mask=0xfffffff;
				break;
			default:
				Valid=0;
				break;
		}
		if (Valid)
		{
			if (DumpSuperBlockFlag==1)
			{
				AnalyseSuperBlock(&SuperBlock);
			}
			MaxInodes=NumberOfInodes(&SuperBlock);
			if (DumpFlag==1)
			{
				if (inode)
				DumpInodeContents(inode,in,InodesPerAllocBlock,AllocBlockSize,Mask,Multiplier);
				else
				DumpInodeList(in,MaxInodes,InodesPerAllocBlock,AllocBlockSize);
			}
		}
		else
		{
			fprintf(stderr,"Superblock - bad magic number\n");
			exit(1);
		}
	}
	else
	{
		fprintf(stderr,"couldn't open ");
		perror(argv[optind]);
		exit(1);
	}
}


----------------------------------------------------------------------------------------
Note 8:
----------------------------------------------------------------------------------------

http://wiki.yak.net/592


HOWTO rescue deleted Linux files | undelete | unremove | unrm | rm -v
Here's how we rescued a LaTeX *.tex file that was accidentally removed on a Linux box. 


Stop doing anything else on the system. The idea is to use the disk as little as possible. (We stopped short of killing idle daemons, 
because we didn't want them scribbling stuff in log files. ) 

Know the first few bytes of the file you want. Hopefully they are fairly unique. The LaTeX document we wanted began with the characters 
"\document", so we used that pattern. 

Write a program that will read each sector from the raw partition (you must be root) (assuming 512 byte sectors is safest) 
and see if it begins with the pattern. If not, it loops and reads the next 512 bytes... If it finds it, it saves that sector and some 
fixed amount of following sectors (we did 600 more sectors, which is 300 KBytes) in a rescue file. Save probably twice as long a file as you think 
you're looking for. Save them to an extra partition -- or invoke "scp" or something to save them on another machine. 
(Usually ext2 & ext3 store files contiguously on disk -- especially if they are not too big & are written all at once.) 

The following TCL script did the job. Make it open the exact partition you want to scan. It needs another partition to write the rescue files to. 
grope.tcl 

 #
 #  This is in the language Tcl.
 #  Usage:
 #      tclsh scriptname < /dev/hda1   (the partition with the deleted file)
 #
 #  Notice:  change the MOUNT below to a different partition!
 #
 #  Also fix the "string match" pattern -- we used \document for a LaTeX document.
 #
 #  Occasinally sector numbers are written out, to indicate progress.
 #       ( 1 sector == 512 bytes == 0.5KBytes )


 set i 0
 set n 0
 fconfigure stdin -translation binary -encoding binary
 while true {
 	set x [read stdin 512 ]
 	if {$x==""} break
 
 	if {[string match {\\document*} $x ]} {
 		incr i
 		puts stderr "SAVING $i"
 		set f [open /WRITABLE_MOUNT_TO_SAVE_FILES_IN_GOES_HERE/rescue.$i w]
 		fconfigure $f -translation binary -encoding binary
 		puts -nonewline $f $x 
 		puts -nonewline $f [read stdin [expr 600*512] ]
 		close $f
 	}
 
 	incr n
 	if { ($n % 200000)==0 } { puts -nonewline stderr $n. }
 } 


Use "less" to examine the rescue files to see if you can find your data. Also the "strings" command is very good about 
extracting ASCII text portions. 

Even better, if you have physical access to the machine, shut down the system IMMEDIATELY and physically install its disk 
as an extra drive in another unix box. Do your scanning of the raw disk from there. (In our recent case, we didn't have access to this box.) 
Or boot a KNOPPIX CD (which will not write to any partitions unless you specifically mount them writeable from a root shell.) 

I've also used this kind of technique to rescue JPEG files from a digital camera's Compact Flash with a corrupted FAT file system. 
We wrote a program that started a new rescue file every time it found "JFIF" as the first 4 bytes of a sector, even if it was still 
saving the previous rescue file. We completely rescued about 3/4 of the images this way, and fragments of more. 

Obviously the data you are rescuing must be important enough to warrent this much trouble with no guarentee of successfull results. 

Your file could always have been overwritten, or it could be fragmented so you don't find the pieces. But the couple of times I've had 
to do this (for someone else's data!) we've had pretty good success. 


----------------------------------------------------------------------------------------
Note 9: special case: text file edited with vi
----------------------------------------------------------------------------------------

If the file that was deleted, was a text file, and recently edited by vi, then there still might be a version 
available on your system.

On most unix systems, vi keep tracks of former versions.
Check

/var/preserve/username (or similar directory: vi -r )

or a similar directory, depending on the unix version, where there still might exist a recent
version of your text file.


----------------------------------------------------------------------------------------
Note 10:
----------------------------------------------------------------------------------------

Subject: Undelete of a file on AIX, using fsdb.

Remark   : Quite an elaborate procedure but it seems to work for small files.
Important: Be carefull in using fsdb.


Document:

http://www.phase2.net/2008/03/04/aix-recovering-a-deleted-file-undelete/


-- Contents repeated here:

This is a document I wrote a while back for work that I thought I would release in hopes that some people out there would find it useful.

Preferably, you have a backup of the file system that you can use. If not, the filesystem you are about to try to to recover a file on 
must meet these requirements:

No new files have been created on the filesystem. 
No files have been extended. 
The filesystem is able to be unmounted. 
It is a JFS filesystem, not JFS2 
If so, then please, drink a few more beers and continue, but before you do�

BACKUP THE CURRENT FILESYSTEM!

Also, note that if you are dealing with a directory that has been deleted and would like to recover both the directory 
and the files under that directory, you should try Recovering a Deleted Directory ( a document I have yet to post.. ). 
It follows many of the same steps, but has some very important differences. Do not try and use this procedure to recover 
deleted directories and the files that were contained within them. You will mess up.

Before we begin, I need to note a few things. I take no responsibility if this screws up your system. Use this at your own risk. 
Also, the example presented here is an actual representation of me recovering a deleted file, this is not just made up numbers. 
Also, this only works on jfs filesystems, not jfs2. The jfs2 fsdb is much different and I haven�t had a chance to play with it 
to determine the proper way of doing this.

Now that I�ve said that, we can begin. We�ll use an example directory with some example files. Our directory is called 
/test and our filesystem is testlv, otherwise known as /dev/testlv. In our example, our Junior System Admin, Myron, 
has accidentally deleted a perl script called testfile.pl and needs to recover it.

Note: If you are performing this operation on a filesystem while in maintenance mode, do NOT use option 1 when asked on how to mount 
the filesystems. ALWAYS use option 2, which specifies to start a shell before mounting the filesystems. Otherwise, the system will force 
a fsck -y on the filesystem and delete your files.

Step 1.
First, run this command:

ls -id /testOutput:

[test:/]# ls -id /test
    2 /test/

This informs us that the inode for the directory /test is 2. Record this for future use.

Step 2.
Unmount /test

umount /test

Output: None

We must unmount the directory. We don�t want anyone to try and use it while we are attempting to restore the file.

Step 3
Now we�ll start up the filesystem debugger.

fsdb /dev/testlv

Output:

[test:/]# fsdb /dev/testlv

File System:                           /dev/testlv

File System Size:                         193200128  (512 byte blocks)
Disk Map Size:                                 1660  (4K blocks)
Inode Map Size:                                 831  (4K blocks)
Fragment Size:                                 4096  (bytes)
Allocation Group Size:                        16384  (fragments)
Inodes per Allocation Group:                   8192
Total Inodes:                              12075008
Total Fragments:                           24150016

This starts the filesystem debugger on our testlv filesystem.

Step 4
Now we look at our inode number.

2i

Output:

2i
i#:      2  md: d-g-rwxr-xr-x  ln:    4  uid:    3  gid:    3
szh:        0  szl:      512  (actual size:      512)
a0: 0x25d       a1: 0x00        a2: 0x00        a3: 0x00
a4: 0x00        a5: 0x00        a6: 0x00        a7: 0x00
at: Mon Jan 10 11:19:17 2005
mt: Mon Jan 10 11:11:26 2005
ct: Mon Jan 10 11:11:26 2005

The INODE in the command is the inode number we recorded in step #1. This will display the inode information for the directory. 
The field a0 contains the block number of the directory. The following steps assume only field a0 is used. If a value appears in a1, etc, 
it may be necessary to repeat steps #5 and #6 for each block until the file to be recovered is found.

Step 5
Move to the block

a0b

Output:

a0b
0x000025d000  :  0x00000000 (0)

This moves to the block pointed to by field �a0? of this inode.

Step 6
Now we need to print out some data.

p256c

Output:

p256c

0x000025d000:   \0 \0 \0 \? \0 \? \0 \? .  \0 \0 \0 \0 \0 \0 \?
0x000025d010:   \0 \? \0 \? .  .  \0 \0 \0 \0 \0 \? \0 \? \0 \n
0x000025d020:   l  o  s  t  +  f  o  u  n  d  \0 \0 \0 \0 \0 \?
0x000025d030:   \0 $  \0 \? m  e  m  _  r  e  p  o  r  t  _  2
0x000025d040:   0  0  4  1  1  0  1  .  d  m  p  .  g  z  \0 \0
0x000025d050:   \0 \0 \0 \? \0 \s \0 \? o  r  a  s  c  r  a  t
0x000025d060:   c  h  .  c  p  i  o  .  g  z  \0 \0 \0 \0 \0 \?
0x000025d070:   \0 (  \0 \s u  s  e  r  _  a  c  t  i  v  i  t
0x000025d080:   y  _  2  0  0  4  1  1  0  1  .  d  m  p  .  g
0x000025d090:   z  \0 \0 \0 \0 \0 \0 \? \0 ,  \0 !  u  s  e  r
0x000025d0a0:   _  a  c  t  i  v  i  t  y  _  d  e  t  _  2  0
0x000025d0b0:   0  4  1  1  0  1  .  d  m  p  .  g  z  \0 \0 \0
0x000025d0c0:   \0 \? `  \0 \? @  \0 \? E  C  R  1  X  \0 \0 \0
0x000025d0d0:   \0 \0 \0 \? \? 0  \0 \? t  e  s  t  f  i  l  e
0x000025d0e0:   .  p  l  \0 \?    \0 \a t  e  s  t  d  i  r  \0
0x000025d0f0:   j  d  u  c  k  o  .  t  x  t  \0 \0 \0 \0 \0 \?

The command p256c stands for �print 256 bytes in character mode�. You could type �p128c� and it would print 128 bytes in character mode 
and so on. The beginning left column is the address of the first character in that row. The important thing in this output is 
to find which line the file to be recovered is on. Our file ( testfile.pl ) is located on line 0�000025d0d0. Next, we have to find 
the address of the first character of our filename. To do this, starting at 0, count in hexidecimal until you reach the first character 
of the filename. In our example, the �t� of testfile.pl is at address 0�000025d0d8. Record this address.

If you cannot find your filename here, issue the command again. It will print the next 256 bytes in character mode. 
Do this until you find your filename.

Here�s a layout to help you in figuring out how we got the address:

Address:        0  1  2  3  4  5  6  7  8  9  A  B  C  D  E  F
0�000025d0d0:   \0 \0 \0 \? \? 0  \0 \? t  e  s  t  f  i  l  eStep 7

Reset our position.

a0b

Output:

a0b
0x000025d000  :  0x00000000 (0)

This resets our position back to the beginning of the a0 block. This is necessary whenever you want to reprint out 
the byte data. Remember, however, that if you had to use the �p� command many times to find your filename, you will probably 
have to use it many times each time you reset back to the beginning.

Step 8
Print our data in decimal

p256e

Output:

p256e

0x000025d000:         0       2      12       1   11776       0       0       2
0x000025d010:        12       2   11822       0       0      16      20      10
0x000025d020:     27759   29556   11110   28533   28260       0       0      17
0x000025d030:        36      26   28005   27999   29285   28783   29300   24370
0x000025d040:     12336   13361   12592   12590   25709   28718   26490       0
0x000025d050:         0      18      28      18   28530   24947   25458   24948
0x000025d060:     25448   11875   28777   28462   26490       0       0      19
0x000025d070:        40      29   30067   25970   24417   25460   26998   26996
0x000025d080:     31071   12848   12340   12593   12337   11876   28016   11879
0x000025d090:     31232       0       0      20      44      33   30067   25970
0x000025d0a0:     24417   25460   26998   26996   31071   25701   29791   12848
0x000025d0b0:     12340   12593   12337   11876   28016   11879   31232       0
0x000025d0c0:        18   24576     320       5   17731   21041   22528       0
0x000025d0d0:         0      21     304      11   29797   29556   26217   27749
0x000025d0e0:     11888   27648     288       7   29797   29556   25705   29184
0x000025d0f0:     27236   30051   27503   11892   30836       0       0      23
0x000025d100:       260      16   27233   28005   29549   24947   29537   29281
0x000025d110:     11892   30836       0       0       0       0       0       0
0x000025d120:         0       0       0       0       0       0       0       0
0x000025d130:         0       0       0       0       0       0       0       0
0x000025d140:         0       0       0       0       0       0       0       0

The command �p256e� stands for �print 256 bytes in decimal word format�. This output can be helpful and confusing at the same time. 
First, find the beginning address that our file name is on. In our example, this was 0�000025d0d0. The line in decimal format reads:

0x000025d0d0:         0      21     304      11   29797   29556   26217   27749

For each file, assume the following:

   {ADDRESS}:  x    x    x    x    x    x    x    x    x
               |    |    |    |    |---- filename -----|
     inode # --+----+    |    |
                         |    +-- filename length
         record LENGTH --+

Note that the inode # may begin on any part of the line. The reason we print the data in decimal format is to help us 
determine where in the line the inode number is. There are several ways to help you do this, here are some:

Count the number of characters in your filename, then try and find that number in our address line. 
( eg: There are 11 characters in the filename �testfile.pl�. ) You can see on our line there is a matching number 11. 
Recount to the address 0�000025d0d8, assuming each column represents two numbers. The first column is 0 and 1. The second column is 2 and 3, 
then 4 and 5, etc. When you reach the column that matches your address, go back one column. The number in this column should match up 
with your filename length. Unless, of course, your filename is over 255 characters. 
Once you are sure you have the the correct column for your filename length, you are going to count back three more columns. 
This should put at the first column of the inode number. We�ll use our example decimal line to explain this more:

0x000025d0d0:         0      21     304      11   29797   29556   26217   27749

Like we mentioned before, testfile.pl is 11 characters. We find a matching number 11 in the 4th column. That means that the column 
with �304' is our record length field and the 0 and 21 columns make up our inode. Now, that we know which columns our inode is in ( columns 1 and 2 ), 
we must translate this number into our real inode number.

Step 9
Reset our position again.

a0b

Output:

a0b
0x000025d000  :  0x00000000 (0)

Again, we have to reset our position back to the beginning because this time, we�re going to print the information in hex.

Step 10
Print our data in hex.

p256x

Output:

p256x

0x000025d000:    0000  0002  000C  0001  2E00  0000  0000  0002
0x000025d010:    000C  0002  2E2E  0000  0000  0010  0014  000A
0x000025d020:    6C6F  7374  2B66  6F75  6E64  0000  0000  0011
0x000025d030:    0024  001A  6D65  6D5F  7265  706F  7274  5F32
0x000025d040:    3030  3431  3130  312E  646D  702E  677A  0000
0x000025d050:    0000  0012  001C  0012  6F72  6173  6372  6174
0x000025d060:    6368  2E63  7069  6F2E  677A  0000  0000  0013
0x000025d070:    0028  001D  7573  6572  5F61  6374  6976  6974
0x000025d080:    795F  3230  3034  3131  3031  2E64  6D70  2E67
0x000025d090:    7A00  0000  0000  0014  002C  0021  7573  6572
0x000025d0a0:    5F61  6374  6976  6974  795F  6465  745F  3230
0x000025d0b0:    3034  3131  3031  2E64  6D70  2E67  7A00  0000
0x000025d0c0:    0012  6000  0140  0005  4543  5231  5800  0000
0x000025d0d0:    0000  0015  0130  000B  7465  7374  6669  6C65
0x000025d0e0:    2E70  6C00  0120  0007  7465  7374  6469  7200
0x000025d0f0:    6A64  7563  6B6F  2E74  7874  0000  0000  0017
0x000025d100:    0104  0010  6A61  6D65  736D  6173  7361  7261
0x000025d110:    2E74  7874  0000  0000  0000  0000  0000  0000
0x000025d120:    0000  0000  0000  0000  0000  0000  0000  0000
0x000025d130:    0000  0000  0000  0000  0000  0000  0000  0000
0x000025d140:    0000  0000  0000  0000  0000  0000  0000  0000

First, we find the line that begins with our address 0�000025d0d0. There it is!

0x000025d0d0:    0000  0015  0130  000B  7465  7374  6669  6C65

Next, find the two columns that we know our inode is in. For us, that�s column 1 and 2. Column 1 is all 0�s, so we can disregard it. 
Column 2, however, is 0015. Open up a calculator and translate 15 from hexidecimal to decimal. As you can see, this number turns into 21, 
which is our real inode number.

Some of you may be asking why we just didn�t use the inode number from the decimal output in step 8. The reason is because it always isn�t 
always this easy. Take, for example, the address above ours. The directory ECR1X is on this address. Its inode number, like ours, is in 
columns 1 and 2. However, if you compare the lines between hexidecimal and decimal, you can immediately see the difference.

Decimal:
0x000025d0c0:      18  24576
Hex:
0x000025d0c0:    0012  6000

If you translate 12600 from hexidecimal to decimal, the output is 1204224, which is the correct inode number for the ECR1X directory. 
If you can figure out how to translate 18 24576 into 1204224, please let me know and I�ll update this document.

In any case, we now know the inode number of the missing file. We�re close to recovery!

Step 11
We go to our new inode number

21i

Output:

21i
i#:     21  md: f---rw-r--r--  ln:    0  uid:    0  gid:    3
szh:        0  szl:       45  (actual size:       45)
a0: 0xeff       a1: 0x00        a2: 0x00        a3: 0x00
a4: 0x00        a5: 0x00        a6: 0x00        a7: 0x00
at: Mon Jan 10 14:16:40 2005
mt: Mon Jan 10 14:16:48 2005
ct: Mon Jan 10 14:16:53 2005

From this output, you can see that we have a file.

Step 12
21i.ln=1

Output:

21i.ln=1
0x0000020a88  :  0x00000001 (1)

This sets the link count of the file back to 1. You can verify this by reissuing the command from step #11 and noticing that the �ln� field has incremented.

21i
i#:     21  md: f---rw-r--r--  ln:    1  uid:    0  gid:    3
szh:        0  szl:       45  (actual size:       45)
a0: 0xeff       a1: 0x00        a2: 0x00        a3: 0x00
a4: 0x00        a5: 0x00        a6: 0x00        a7: 0x00
at: Mon Jan 10 14:16:40 2005
mt: Mon Jan 10 14:16:48 2005
ct: Mon Jan 10 14:16:53 2005

We have now told the filesystem that the link count for inode 21 should be 1. This means that there should be a filename pointing 
at this inode. This basically reverses what the OS actually does when deleting files. It doesn�t actually erase the file data, 
instead, it unlinks the filename from its inode number, effectively preventing you from seeing the data.

Step 13
Quit.

q

Output:

q
[test:/]#

This quits out of the fsdb.

Step 14
Fsck our volume

fsck /dev/testlv

Output:

[test:/]# fsck /dev/testlv

** Checking /dev/rtestlv (/test)
** Phase 1 - Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
Unreferenced file  I=21  owner=root mode=100644
size=45 mtime=Jan 10 14:16 2005 ; RECONNECT? y
** Phase 5 - Check Inode Map
Bad Inode Map; SALVAGE? y
** Phase 5b - Salvage Inode Map
** Phase 6 - Check Block Map
Bad Block Map; SALVAGE? y
** Phase 6b - Salvage Block Map
18 files 21893872 blocks 171306256 free
***** Filesystem was modified *****

This does a filesystem check on /dev/testlv. As you can see, it finds an inode claiming it is linked to, but no file that links to it. 
We answer �y� to tell it to reconnect the inode to a filename, effectively giving us our file back!

Step 15
Remount our directory.

mount /test

Output: None

We must remount our filesystem to get back at our file.

Step 16
Go into lost and found. It�s where all lost little kiddies go. Duh.

cd /test/lost+found

Output: None

Our file is now located in lost+found. If you do an �ls� in this directory, you will see something like the following:

[test:/test/lost+found]# ls -l
total 8
-rw-r�r�   1 root     sys              45 Jan 10 14:16 21

And if we cat the file 21, we get the following:

[test:/test/lost+found]# cat 21
#!/usr/bin/perl

print �this is a test\n�;

Ta-da! It�s Myron�s missing perl script!

As a final aside, I will say that there may be different and much better ways of recovering files on AIX, however, this is the way 
I constructed from notes I found on various mailing lists and a few days of fooling around with it. So if you see some mistakes in this document 
or have some suggestions for better ways of doing this, please, let me know! I will happily update this document with better information as it is provided.

I hope this helps some of you who have to deal with certain people who accidentally delete files on your systems. Nothing beats a good backup 
but when you don�t have one of those, this can always be used as a fallback.


----------------------------------------------------------------------------------------
Note 11:
----------------------------------------------------------------------------------------

Subject: Undelete of a file on AIX, using fsdb.

http://faqs.cs.uu.nl/na-dir/aix-faq/part1.html


Contents repeated here:


RECOVERING REMOVED FILES AND DIRECTORIES IN A FILESYSTEM

If a file is Deleted from the system, the filesytem blocks composing 
that file still exist, but are no longer allocated. As long as no new
files are created or existing files extended within the same filesystem, 
the blocks will remain untouched. It is possible to reallocate the 
blocks to the previous file using the "fsdb" command (filesystem debugger).


 MAKE A BACKUP OF THE ENTIRE FILESYSTEM BEFORE PERFORMING THESE STEPS!!!
 ELSE ( BANG !!!!! ).

 It is possible to send a mail for have some informations ...

                   Bernard.Kozyra@bull.net


Steps to recover a deleted file
-------------------------------

1) "ls -id {dir}" 
   (where dir is directory where file resided)
   Record INODE number for next step.

2) Unmount the filesystem.

3) "fsdb /{Mountpoint}" or "fsdb /dev/{LVname}"
   (where Mountpoint is the filesystem mount point, and LVname is 
   the logical volume name of the filesystem)

4) "{INODE}i"
   (where INODE is the inode number recorded in step 1)
   This will display the inode information for the directory. The
   field a0 contains the block number of the directory.
   The following steps assume only field a0 is used. If a value 
   appears in a1, etc, it may be necessary to repeat steps #5 and 
   #6 for each block until the file to be recovered is found.

5) "a0b"
   (moves to block pointed to by field "a0" of this inode)

6) "p128c"
   (prints 128 bytes of directory in character format)
   Look for missing filename. If not seen, repeat this step until
   filename is found. Record address where filename begins. Also
   record address where PRIOR filename begins. If filename does 
   not appear, return to step #5, and selecting a1b, a2b, etc.

   Note that the address of the first field is shown to the far left.
   Increment the address by one for each position to the right,
   counting in octal.

7) "a0b"
   (moves to block pointed to by field "a0" of this inode)
   If the filename was found in block 1, use a1b instead, etc.

8) "p128e"
   (prints first 128 bytes in decimal word format)
   Find the address of the file to recover (as recorded in step 6) 
   in the far left column. If address is not shown, repeat until found.

9) Record the address of the file which appeared immediately PRIOR to 
   the file you want to recover.

10) Find the ADDRESS of the record LENGTH field for the file in step 
   #9 assuming the following format:

   {ADDRESS}:  x    x    x    x    x    x    x    x    x    x  ...
               |    |    |    |    |-------- filename ------|
     inode # --+----+    |    |
                         |    +-- filename length
         record LENGTH --+

   Note that the inode number may begin at any position on the line.
   Note also that each number represents two bytes, so the address
   of the LENGTH field will be `{ADDRESS} + (#hops * 2) + 1'

11) Starting with the first word of the inode number, count in OCTAL
    until you reach the inode number of the file to be restored, 
    assuming each word is 2 bytes.

12) "0{ADDRESS}B={BYTES}"
    (where ADDRESS is the address of the record LENGTH field found
    in step #10, and BYTES is the number of bytes [octal] counted 
    in step #11)

13) If the value found in the LENGTH field in step #10 is greater than
    255, also type the following:

    "0{ADDRESS-1}B=0"
    (where ADDRESS-1 is one less than the ADDRESS recorded in step #10)
    This is necessary to clear out the first byte of the word.

14) "q"
    (quit fsdb)

15) "fsck {Mountpoint}" or "fsck /dev/{LVname}"
    This command will return errors for each recovered file asking if
    you wish to REMOVE the file. Answer "n" to all questions.
    For each file that is listed, record the associated INODE number.

16) "fsdb /{Mountpoint}" or "fsdb /dev/{LVname}"

17) {BLOCK}i.ln=1
    (where BLOCK is the block number recoded in step #15)
    This will change the link count for the inode associated with
    the recovered file. Repeat this step for each file listed in
    step #15.

18) "q"
    (quit fsdb)

19) "fsck {Mountpoint}" or "fsck /dev/{LVname}"
    The REMOVE prompts should no longer appear. Answer "y" to
    all questions pertaining to fixing the block map, inode map,
    and/or superblock.

20) If the desired directory or file returns, send money to the author
    of this document.


----------------------------------------------------------------------------------------
Note 12:
----------------------------------------------------------------------------------------

This note has some interresting feautures. You can't use it for all types of un-delete,
but maybe you want to take a look.

Original:

http://lde.sourceforge.net/UNERASE.txt

Here the contents is repeated:


	I imagine that most of the people initially using this package
will be the ones who have recently deleted something.  After all,
that's what finally inspired me to learn enough about the different
file systems to write this package.  Undelete under unix really isn't
that hard, it really only suffers the same problems that DOS undelete
does which is -- you can't recover data that someone else has just
overwritten.

	If you are quick and have very few users on your system there
is a good chance that the data will be intact and you can go ahead
with a successful undelete.  I don't recommend using this package to
undelete your /usr/bin directory or really any directory, but if you
have trashed a piece of irreplaceable code or data, undelete is where
it's at.  If you can reinstall or have recent backups I'd recommend
you try them.  But it's up to you, besides, sometimes playing with
lde/undelete for a while is a lot more fun than going back and
recoding a few hours worth of lost work.

	Before I tell you how to undelete stuff, have a look at
doc/minix.tex (or the ps or dvi version).  Even if you aren't using a
minix file system, read it carefully, it will get you used to the
terms and the general idea behind things here.

These are the steps for a successful undelete:

#########################  STEP ONE  ##################################

	Unmount the partition which has the erased file on it.  If you
want to, you can remount it read-only, but it isn't necessary.  

NOTE: lde does some checks to see if the file system is mounted, but
it does not check if it was mounted read-only.  Some functions will be
deactivated for any (read-only or read/write) mounted partition.

#########################  STEP TWO  ##################################

	Figure out what you want to undelete.  If you know what kind
of file you are looking for (tar file, compressed file, C file),
finding it will be a lot easier.  There are a few ways to look for
file data.

	lde supports a type search and a string search for data at the
beginning of a file.  Currently, the supported types include gz
(gzip), tgz (tarred gzip file), and script (those beginning with
"#!/").

---- EXAMPLE ----
String search (search for a PKzip file - starts with PK, -O 0 not required):
	lde -S PK -O 0 /dev/hda1 

String search (search for JPEG files - JIFF starts at byte 6):
	lde -S JIFF -O 6 /dev/hda1

Type search (search for a gzipped tar file):
	lde -T tgz /dev/hda1
-------------------

	When searching by type, you can also include the filename;
the desired pattern will be extracted from the file.  You should
specify an offest (-O) and length (-L) when using this option.  This
option was included to make generalized searches easier.  You can
find pattern, length, and offset information in /etc/magic which you
can use to generate your own template files, or specify lengths and
offsets so that existing files may be used as templates.

---- EXAMPLE ----
Type search (search for core file - see /etc/magic to determine -O/-L):
	lde -T /proc/kcore -O 216 -L 4 /dev/hda1
-----------------

If you add --recoverable to the command line, it will check to see if
another active inode uses any blocks in this inode.  If no blocks are
marked used by another inode, "recovery possible" will be printed.  If
blocks are used by another file "recovery NOT possible" will be
printed to the screen.  You may still be able to get some data back
even when it reports that recovery is not possible.  To get an idea of
how many blocks are in use, you will have to check its recoverablilty
from lde via its curses interface.

---- EXAMPLE ----
./lde --paranoid -T script --ilookup --recoverable /dev/hda5
---- OUTPUT  ----
Paranoid flag set.  Opening device "/dev/hda5" read-only.
User requested autodetect filesystem. Checking device . . .
Found ext2fs on device.
Match at block 0x107, check inode 0xB, recovery possible.
Match at block 0x421E7, no unused inode found.
-----------------

	When you run lde in these mode, it will report a block (and
inode if you are lucky and used the --ilookup flag) where a match was
found. Take this inode number and go to step (3).

	If lde doesn't report anything on its own, or the search
detailed above does not suit your needs, you can use grep to search
the partition for data and pipe it through lde which will attempt to
find a block and inode again.  The recommended procedure (all this can
go on one line, the '\' indicates continuation) is:

   grep -b SEARCH DEVICE | awk '{FS = ":" } ; {print $1 }' | \
	 lde ${LDE_OPT} --grep DEVICE

A shell script (crash_recovery/grep-inode) is included that will do
this for you.

   grep-inode [grep_options] search_string device

---- EXAMPLE ----
   grep-inode -i MyDevelopment.h /dev/hda1
-----------------

	If none of these search methods are productive, you can page
through the disk with an editor (emacs /dev/hda2) or the preferred
choice might be to page through it with lde.  Fire up lde and go into
block mode (hit 'b') then use PG_UP/PG_DN to flip through all the
blocks until you find one you like.  Hitting '^R' while displaying the
block will attempt to find an inode which references the block.

########################  STEP THREE  #################################

	If you have an inode number, things are looking good.  Go into
inode mode and display this inode.  Then hit 'R' (use capital 'R') to
copy the inode information to the recovery block list and enter
recovery mode.  Now hit 'R' again and lde will prompt you for a file
name (you can include a full path).  Make sure you write it to a FILE
SYSTEM OTHER THAN THE ONE WHICH THE DELETED FILE RESIDES ON or you
will probably overwrite it as you go.  One day, when lde supports disk
writes, it will be able to undelete the file to its original location,
but for now this is safer.

	The recovered file will be a little larger than the original
as the last block will be padded with zeroes (or whatever was on the
disk at the end of the last block).  If you did find an inode for the
deleted file, you can copy its old size to the new inode by using lde
to edit the two inodes (don't use lde's copy/paste as it will copy the
entire inode and undo all the work you just did to restore the file).

######################  OTHER OPTIONS  ################################

	If you were unable to find an intact inode, things are going
to be tough.  You will have to find all the blocks in the file in
order.  If your disk is relatively unfragmented, you can hopefully
find everything in order or close by at least.  Currently, you have to
tag all the direct blocks, then find the indirect blocks and tag them.

	If the indirect block was wiped or you are unable to find it,
you've got a lot of work to do.  You can copy individual blocks one at
a time to the recovery file by using 'w' in block mode.  Display the
next block in the file, hit 'w', then enter the filename (if you hit
enter, the last filename will be reused and the block will be appended
to the file).  lde will always ask if you want to append, overwrite,
or cancel when a file exists.  You can override this by setting the
append flag from the flags menu ('f' from most modes).

	If you find any type of indirect block, you can copy it to the
recovery inode in its corresponding position and recover a whole bunch
of blocks at once.  Leave the direct blocks filled with zeros.

	Another option is to use dd.  Real programmers still probably
use emacs and dd to hack a fs. ;) If you know there are a bunch (one
or more) of contiguous blocks on the disk, you can use the unix
command dd to copy them from the device to a file.

---- EXAMPLE ----
To copy blocks 200-299 from the device /dev/hda1 to /home/recover/file1:

   dd if=/dev/hda1 of=/home/recover/file1 bs=1024 count=100 skip=200

	if    input file or device
	of    output file or device
	bs    blocksize (will be 1024 for most linux fs's)
	count number of blocks to copy
	skip  number of blocks to skip from the start of the device 
-----------------

Read the dd man page for more info.

####################  ABOUT INDIRECT BLOCKS  ##########################

[ Mail from to an lde user ]

> 1 - install a routine that lets you read what the indirect blocks
> are pointing to in the chain, I mean, I know that file X has 2
> indirect blocks but what blocks do these point to and how do I find
> out?

        This is hard to describe, but if you have figured out how to
use inode mode any you are looking at the blocklist contained in that
inode (it should list all the direct blocks and the 1x, 2x, and 3x
indirect blocks), when you hit 'B' when the cursor is sitting on the
1x indirect block, it will take you to that block in block mode, then
each entry in that block (most likely each entry is 4 bits -- as in
the ext2 fs) points to another block in the chain.

I.E.

        INDIRECT BLOCK:   0x000200

   Now look at block 0x000200

       0000:   01 00 00 00 02 00 00 00 : 04 04 04 00 10 01 00 00

   This would indicate the the next 4 blocks in the file are

        0x00000001, 0x00000002, 0x00040404, 0x00000110

The same is true for double indirect blocks, but the double indirect
blocks contains pointers to more indirect block which you must then
look up as above.

That was a pretty lousy explaination, someday I do plan to add a
feature where you may view all the blocks in a file without doing the
indirect indexing yourself.  For now, lde is mostly a crutch for last
ditch efforts at file recovery, but I'm glad if people find other uses
for it.


#################  RECOVERING WITHOUT INODES  #######################

[ This is mail to a person who was unable to find an inode, it gives
  some last ditch suggestions before giving up. ]

        In a perfect world, or on a virgin disk, everything would
be sequential.  But with things like unix and (network) file sharing,
many people can write to the disk at the same time, so the blocks
can get interleaved.  Also depending on the free space situation of
the disk, the two free blocks may not exist sequentially on the disk.
Also, there are file "holes" in ext2 where there are block pointers of
zero on the disk.  Normally an indirect block would point to 256
direct blocks, but with zero entries it may be less than this.

        If things are perfect, here is how I imagine your disk is
layed out:

        Direct blocks 1-9: you already know where these are and they
                           are in that tiny recovery file (9k).  These
                           were not sequential, so it makes me wonder
                           if the rest of the bytes will be layed out
			   in order.

        Indirect block:    This takes up one block and ideally your
                           data would start right after it.
        256 blocks of data:
        2x indirect block: Should only have one entry, pointing to the
                           next block on the disk
        indirect block:    pointed to by the 2xindirect block
        88 blocks of data:

So my last ditch recommendation is to use dd to copy the blocks off
the disk and then cat all the dd'ed files together.

        0x5e65e - 0x5e660  |
        0x61a72            |
        0x5e661            +--  These are the direct blocks, you could
        0x61ad4            |    use the lde recovered file instead of
        0x5e662 - 0x5e664  |    dd + cat.

        0x5e665 - 0x5e764  - 256 blocks of data
        0x5e750 - 0x5e7a8  - 88 blocks of data

Things look bad becuse the numbers are out of sequence (those 256
blocks of data should end right before the 2x indirect block at 0x5e74
there's 0x10 blocks unaccounted for (maybe this is just some of the
ext2 file system data which is dispersed about the disk -- it could
fall anywhere in that data range if it's there).

        So try:

---- EXAMPLE ----
	lde (recover direct blocks to /home/recover/block1)
        dd if=/dev/sdb1 of=/home/recover/block2 bs=1024 count=256 skip=386661
        dd if=/dev/sdb1 of=/home/recover/block3 bs=1024 count=88  skip=386896
        cat block.1 block2 block3 > access_file.dos
-----------------

####################  TRIPLE INDIRECT BLOCKS  #########################

[ This is a response to one persons request for immediate help
  recovering a very large file -- the stuff about the triple block
  having _three_ entries was specific to this persons problem.  In general
  though, the triple indirect block will not have very many entries, so
  this method might be viable until I get things together and write in
  the triple indirect block support. ]

        lde allows you to append a single block to the recover file
(use 'w' from block mode) -- you can page through the triple indirect
blocks to figure out the block order and then write each block to the
recover file.  I.e. after piecing things together from the triple
indirect block, you should have a list of all the blocks in the file,
now display the first block on the screen, write it to the file,
display the second block, write it to the file . . . I really don't
think it's worth it for 145,000 blocks though.

        The semi-automated way to do this is to make some fake inodes.
The triple indirect inode should be pretty empty - maybe 3 entires.
Each of these entries points to a double indirect block.  Solution:

        1) Recover any direct/indirect/double indirect blocks in 
           the original inode to a file.  Do this with lde.

        2) Look at the triple indirect block.  It should have 3
           entries.  Write down the 3 double indirect blocks listed here.

        3) Use the recover mode fake inode, fill in all entires with
           zeroes.  Now fill in the 1st double indirect block that
           you wrote down in step 2 in the slot for the 2x indirect 
           block.

        4) Execute a recover, dump it to a file, say "file1".  Repeat
           step 3 with the other two double indirect inodes from step 2.

        5) Now you should have 4 files, catenate them all together and
           with any luck, it will un-tar.
 

----------------------------------------------------------------------------------------
Note 13:
----------------------------------------------------------------------------------------

>>> Some tools or info that might be usefull:


1. Midnight Commander 
  is GNU (free) software that runs on UNIX based operating systems. 
  At the time of writing, the undelete feature only works on ext2 filesystems.
  Midnight Commander can be obtained at http://www.ibiblio.org/mc/

2. Opensource forensic:
  http://www.opensourceforensics.org/tools/unix.html

3. R-Linux, recovery and undelete tool for Ext2 fs
   http://3d2f.com/tags/undelete/recover/unix/

4. http://foremost.sourceforge.net/
   Also take a look at
   Tom Pycke, Recovering Files in Linux, available at www.recover.source.net/linux

5. R-Linux 1.0
   Data Recovery and Undelete Tool for Ext2FS (Linux) file system. 
   http://www.supershareware.com/info/r-linux.html

6. Compunix AIX undelete tool:
   http://www.compunix.com/prod/analyse.html
   http://www.compunix.com/eval/list.html


7. Check out a tool called "Lazarus" which can work in combination with unrm

8. For Linux (ext2, ext3 fs) and Solaris (ufs fs)
   R-Tools technology: Undelete tool for Linux and Solaris:
   http://www.data-recovery-software.net/

9. Solaris undelete tools:

   -- Kernel Recovery for Solaris Sparc
   http://www.download.com/Kernel-Recovery-for-Solaris-Sparc/3000-2248_4-10578170.html
   http://www.download3k.com/Press-Launch-of-Kernel-Recovery-for-Solaris-SPARC.html
   http://www.tucows.com/preview/505583
   http://www.programurl.com/kernel-recovery-for-solaris-sparc.htm

   Nucleus Technologies.com: http://www.nucleustechnologies.com 

   -- Other Solaris Data Recovery Software:
   http://solaris-data-recovery-software.qarchive.org/


   R-Tools technology: Undelete tool for Linux and Solaris:
   http://www.data-recovery-software.net/

10. General info on undelete intentions on ext2 fs:
    http://amadeus.uprm.edu/~undelete/Presentacion.html

11. Patents on undelete feature in Unix (requires a change in how inodes are freed)
    http://www.patentstorm.us/patents/6615224.html
    http://www.freepatentsonline.com/6615224.html


##############################################################

SECTION 11: GENERIC: SIMPLE EXAMPLES ON USING dd, od:

##############################################################


----------------------------------------------------------------------------------------
Note 1:
----------------------------------------------------------------------------------------

You already have seen some examples of using the dd and od commands. These commands are available on almost
all unix versions. They are extremely powerfull, and could be very dangerous also, if not used properly.
Because you can dump any diskblock, or blocks from tape, to any output, with possible conversion of data,
you might even recover data which would otherwise be considered as lost.

The following article is very instructive on how to use the dd command.


http://www.codecoffee.com/tipsforlinux/articles/036.html

>> How and when to use the dd command?  
 

In this article, Sam Chessman explains the use of the dd command with a lot of useful examples. This article is not aimed at absolute beginners. 
Once you are familiar with the basics of Linux, you would be in a better position to use the dd command. 

The ' dd ' command is one of the original Unix utilities and should be in everyone's tool box. It can strip headers, extract parts of 
binary files and write into the middle of floppy disks; it is used by the Linux kernel Makefiles to make boot images. 
It can be used to copy and convert magnetic tape formats, convert between ASCII and EBCDIC, swap bytes, and force to upper and lowercase. 


For blocked I/O, the dd command has no competition in the standard tool set. One could write a custom utility to do specific I/O or 
formatting but, as dd is already available almost everywhere, it makes sense to use it. 

Like most well-behaved commands, dd reads from its standard input and writes to its standard output, unless a command line specification 
has been given. This allows dd to be used in pipes, and remotely with the rsh remote shell command. 

Unlike most commands, dd uses a keyword=value format for its parameters. This was reputedly modeled after IBM System/360 JCL, 
which had an elaborate DD 'Dataset Definition' specification for I/O devices. A complete listing of all keywords is available from GNU dd with 

$ dd --help

Some people believe dd means ``Destroy Disk'' or ``Delete Data'' because if it is misused, a partition or output file can be trashed very quickly. 
Since dd is the tool used to write disk headers, boot records, and similar system data areas, misuse of dd has probably trashed 
many hard disks and file systems. 

In essence, dd copies and optionally converts data. It uses an input buffer, conversion buffer if conversion is specified, and an output buffer. 
Reads are issued to the input file or device for the size of the input buffer, optional conversions are applied, and writes are issued 
for the size of the output buffer. This allows I/O requests to be tailored to the requirements of a task. Output to standard error reports 
the number of full and short blocks read and written. 


Example 1


A typical task for dd is copying a floppy disk. As the common geometry of a 3.5" floppy is 18 sectors per track, two heads and 80 cylinders, 
an optimized dd command to read a floppy is: 

Example 1-a : Copying from a 3.5" floppy

dd bs=2x80x18b if=/dev/fd0 of=/tmp/floppy.image 
1+0 records in
1+0 records out 

The 18b specifies 18 sectors of 512 bytes, the 2x multiplies the sector size by the number of heads, and the 80x is for the cylinders--
a total of 1474560 bytes. This issues a single 1474560-byte read request to /dev/fd0 and a single 1474560 write request to 
/tmp/floppy.image, whereas a corresponding cp command 

cp /dev/fd0 /tmp/floppy.image


issues 360 reads and writes of 4096 bytes. While this may seem insignificant on a 1.44MB file, when larger amounts of data are involved, 
reducing the number of system calls and improving performance can be significant. 


This example also shows the factor capability in the GNU dd number specification. This has been around since before the Programmers Work Bench and, 
while not documented in the GNU dd man page, is present in the source and works just fine, thank you. 


To finish copying a floppy, the original needs to be ejected, a new diskette inserted, and another dd command issued to write to the diskette: 

Example 1-b : Copying to a 3.5" floppy
dd bs=2x80x18b < /tmp/floppy.image > /dev/fd0 
1+0 records in 
1+0 records out 

Here is shown the stdin/stdout usage, in which respect dd is like most other utilities. 


Example 2


The original need for dd came with the 1/2" tapes used to exchange data with other systems and boot and install Unix on the PDP/11. 
Those days are gone, but the 9-track format lives. To access the venerable 9-track, 1/2" tape, dd is superior. With modern SCSI tape devices, 
blocking and unblocking are no longer a necessity, as the hardware reads and writes 512-byte data blocks. 

However, the 9-track 1/2" tape format allows for variable length blocking and can be impossible to read with the cp command. The dd command allows 
for the exact specification of input and output block sizes, and can even read variable length block sizes, by specifying an input buffer size larger 
than any of the blocks on the tape. Short blocks are read, and dd happily copies those to the output file without complaint, simply reporting on the 
number of complete and short blocks encountered. 


Then there are the EBCDIC datasets transferred from such systems as MVS, which are almost always 80-character blank-padded Hollerith Card Images! 
No problem for dd, which will convert these to newline-terminated variable record length ASCII. Making the format is just as easy and dd again 
is the right tool for the job. 

Example 2 : Converting EBCDIC 80-character fixed-length record to ASCII variable-length newline-terminated record 
dd bs=10240 cbs=80 conv=ascii,unblock if=/dev/st0 of=ascii.out
40+0 records in
38+1 records out 

The fixed record length is specified by the cbs=80 parameter, and the input and output block sizes are set with bs=10240. 
The EBCDIC-to-ASCII conversion and fixed-to-variable record length conversion are enabled with the conv=ascii,noblock parameter. 


Notice the output record count is smaller than the input record count. This is due to the padding spaces eliminated from the output file and 
replaced with newline characters. 


Example 3


Sometimes data arrives from sources in unusual formats. For example, every time I read a tape made on an SGI machine, the bytes are swapped. 
The dd command takes this in stride, swapping the bytes as required. The ability to use dd in a pipe with rsh means that the tape device 
on any *nix system is accessible, given the proper rlogin setup. 

Example 3 : Byte Swapping with Remote Access of Magnet Tape
rsh sgi.with.tape dd bs=256b if=/dev/rmt0 conv=swab | tar xvf -


The dd runs on the SGI and swaps the bytes before writing to the tar command running on the local host. 


Example 4

Murphy's Law was postulated long before digital computers, but it seems it was specifically targeted for them. 
When you need to read a floppy or tape, it is the only copy in the universe and you have a deadline past due, that is when you will have a bad spot 
on the magnetic media, and your data will be unreadable. To the rescue comes dd, which can read all the good data around the bad spot and continue 
after the error is encountered. Sometimes this is all that is needed to recover the important data. 

Example 4 : Error Handling
dd bs=265b conv=noerror if=/dev/st0 of=/tmp/bad.tape.image 


Example 5


The Linux kernel Makefiles use dd to build the boot image. In the Alpha Makefile /usr/src/linux/arch/alpha/boot/Makefile, 
the srmboot target issues the command: 

Example 5 : Kernel Image Makefile
dd if=bootimage of=$(BOOTDEV) bs=512 seek=1 skip=1 

This skips the first 512 bytes of the input bootimage file (skip=1) and writes starting at the second sector of the $(BOOTDEV) device (seek=1). 
A typical use of dd is to skip executable headers and begin writing in the middle of a device, skipping volume and partition data. 
As this can cause your disk to lose file system data, please test and use these applications with care.

 
----------------------------------------------------------------------------------------
Note 2:
----------------------------------------------------------------------------------------


od Command


Purpose
Displays files in a specified format. 
dump files in octal and other formats


Syntax

To Display Files Using a Type-String to Format the Output
od [  -v ] [  -A AddressBase ] [  -N Count ] [  -j Skip ] [  -t TypeString ... ] [ File ... ] 

type is a string of one or more of the below type indicator characters. If you include more than one type indicator character 
in a single type string or use this option more than once, od writes one copy of each output line using each of the data types 
that you specified, in the order that you specified. 

a named character 
c ASCII character or backslash escape 
d signed decimal 
f floating point 
o octal 
u unsigned decimal 
x hexadecimal 
C char 
S short 
I int 
L long 
For floating point (f): 
F float 
D double 
L long double 


Examples:

>> To display a file in octal, a page at a time, enter: 

od a.out | pg

This command displays the a.out file in octal format and pipes the output through the pg command. 

>> To translate a file into several formats at once, enter: 

od -t cx a.out > a.xcd

This command writes the contents of the a.out file, in hexadecimal format ( x) and character format ( c), into the a.xcd file. 

>> To start displaying a file in the middle (using the first syntax format), enter: 

od -t acx -j 100 a.out

This command displays the a.out file in named character ( a), character ( c), and hexadecimal ( x) formats, starting from the 100th byte. 

>> To start in the middle of a file (using the second syntax format), enter: 

od -bcx a.out +100.

This displays the a.out file in octal-byte ( -b), character ( -c), and hexadecimal ( -x) formats, starting from the 100th byte. 
The . (period) after the offset makes it a decimal number. Without the period, the output would start from the 64th (100 octal) byte. 

% dir | od -c | more
% cat my_file | od -c |more
% od my_file |more
Comparison of different outputs:

>> Show 16 first characters from a binary file (/bin/sh) as ASCII characters or backslash escapes (octal):

% od -N 16 -c /bin/sh
output: 
0000000 177 E L F 001 001 001 \0 \0 \0 \0 \0 \0 \0 \0 \0

>> Show the same binary as named ASCII characters:

% od -N 16 -a /bin/sh
output:

0000000 del E L F soh soh soh nul nul nul nul nul nul nul nul nul

>> Show the same binary as short hexcadecimals:

% od -N 16 -t x1 /bin/sh
output:

0000000 7f 45 4c 46 01 01 01 00 00 00 00 00 00 00 00 00


>> Show the same binary as octal numbers:

% od -N 16 /bin/sh
output:

% 0000000 042577 043114 000401 000001 000000 000000 000000 000000


##############################################################

SECTION 12: MOUNTING A CD DEVICE:

##############################################################


AIX:
----
# mount -r -v cdrfs /dev/cd0 /cdrom


Solaris:
--------
# mount -r -F hsfs /dev/dsk/c0t6d0s2 /cdrom


HPUX:
-----

mount -F cdfs -o rr /dev/dsk/c1t2d0 /cdrom


SuSE Linux:
-----------
# mount -t iso9660 /dev/cdrom /cdrom
# mount -t iso9660 /dev/cdrom /media/cdrom


Redhat Linux:
-------------
# mount -t iso9660 /dev/cdrom /media/cdrom

Other commands on Linux:
------------------------

Sometimes on some Linux, and some scsi CDROM devices, you might try

# mount /dev/sr0 /mount_point
# mount -t iso9660 /dev/sr0 /mount_point


##############################################################

SECTION 13:COMMANDS TO RETREIVE SYSTEM INFO:

##############################################################


Memory:
-------
AIX:     bootinfo -r
         lsattr -E -l mem0
         lsattr -E -l sys0 -a realmem
         svmon -G
         vmstat -v
         vmo -L
         or use a tool as "topas" or "nmon" (these are utilities)

Linux:   cat /proc/meminfo
         /usr/sbin/dmesg | grep "Physical"
         free   (the free command)
HP:      /usr/sam/lbin/getmem
         grep MemTotal /proc/meminfo
         /etc/dmesg | grep -i phys      
         wc -c /dev/mem
         or us a tool as "glance", like entering "glance -m" from prompt (is a utility)
Solaris: /usr/sbin/prtconf | grep "Memory size"
Tru64:   /bin/vmstat -P | grep "Total Physical Memory"


Swap:
-----

AIX:           lsps -a  (or lsps -s)
               pstat -s
               
HP:            /usr/sbin/swapinfo -a
Solaris:       /usr/sbin/swap -l
Linux:         /sbin/swapon -s
               cat /proc/swaps
               cat /proc/meminfo


cpu:
----

HP:       ioscan -kfnC processor		
	  getconf CPU_VERSION		
	  getconf CPU_CHIP_TYPE		
	  model	

AIX:      lparstat (-i)       
          prtconf | grep proc
          pmcycles -m
          lsattr -El procx (x is 0,2, etc..)
          lscfg | grep proc
          pstat -S

Linux:    cat /proc/cpuinfo

Solaris:  psrinfo -v
          prtconf
          psrset -p 
          prtdiag


OS version:
-----------

HP:      uname -a

Linux:   cat /proc/version 
      
Solaris: uname -a
         cat /etc/release   (or other way to view that file, like "more /etc/release")
Tru64:   /usr/sbin/sizer -v

AIX:     oslevel -r
         lslpp -h bos.rte

AIX firmware:
lsmcode -c               display the system firmware level and service processor
lsmcode -r -d scraid0    display the adapter microcode levels for a RAID adapter scraid0
lsmcode -A               display the microcode level for all supported devices
prtconf                  shows many setting including memory, firmware, serial# etc..


  Notes about Power 4 or 5 lpars: 
  -------------------------------

  For AIX: The uname -L command identifies a partition on a system with multiple LPARS. The LPAR id  
  can be useful for writing shell scripts that customize system settings such as IP address or hostname. 

  The output of the command looks like: 

  # uname -L
  1 lpar01 

  The output of uname -L varies by maintenance level. For consistent output across maintenance levels,  
  add a -s flag. For illustrate, the following command assigns the partition number to the variable 
  "lpar_number" and partiton name to "lpar_name". 

  For HP-UX:
  Use commands like "parstatus" or "getconf PARTITION_IDENT" to get npar information.


patches:
--------

AIX:     Is a certain fix (APAR) installed?
         instfix -ik APAR_number
         instfix -a -ivk APAR_number
         
         To determine your platform firmware level, at the command prompt, type:

         lscfg -vp | grep -p Platform

         The last six digits of the ROM level represent the platform firmware date in the format, YYMMDD.


HP:      /usr/sbin/swlist -l patch
         swlist | grep patch
Linux:   rpm -qa
Solaris: showrev -p
         pkginfo -i package_name
Tru64:   /usr/sbin/dupatch -track -type kit


Netcards:
---------

AIX:	 lsdev -Cc adapter
         lsdev -Cc adapter | grep ent
	 lsdev -Cc if
         lsattr -E -l ent1
         ifconfig -a
Solaris: prtconf -D    /    prtconf -pv   /     prtconf | grep "card"
         prtdiag | grep "card"
         svcs -x
         ifconfig -a (up plumb)


Network sniffing:
-----------------

Here are a few short descriptions, and examples, of usefull network trace / dump commands.


-- Solaris: 

snoop command examples:

For example, if we want to observe traffic between systems alpha and beta  we can use the following command: 
# snoop alpha,beta
To enable data captures from the snoop output without losing packets while writing to the screen, send the snoop output to a file. For example:
# snoop -o /tmp/snooper -V 128.50.1.250
To snoop a specific port:
# snoop -o port xxx 


-- AIX:

tcpdump command examples:

# tcpdump port 23
# tcpdump -i en0 
A good way to use tcpdump is to save the network trace to a file with the -w flag and then analyze the trace by using different
filtering options together with the -r flag. The following example show how to run a basic tcpdump network trace, 
saving the output in a file with the -w flag (on a Ethernet network interface):
# tcpdump -w /tmp/tcpdump.en0 -i en0

To limit the number of traced packets, use the -c flag and specify the number, such as in the following example
that traces the first 128 packets (on a token-ring network interface):
# tcpdump -c 128 -w /tmp/tcpdump.tr0 -i tr0

iptrace command examples:

To start the iptrace daemon with the System Resource Controller (SRC),
# startsrc -s iptrace -a "/tmp/nettrace"

To stop the iptrace daemon with SRC enter the following:
# stopsrc -s iptrace

To record packets coming in and going out to any host on every interface, enter the command in the following format:
# iptrace /tmp/nettrace

The recorded packets are received on and sent from the local host. All
packet flow between the local host and all other hosts on any interface is
recorded. The trace information is placed into the /tmp/nettrace file.

To record packets received on an interface from a specific remote host,
enter the command in the following format:
# iptrace - i en0 -p telnet -s airmail /tmp/telnet.trace

The packets to be recorded are received on the en0 interface, from remote
hostairmail, over the telnet port. The trace information is placed into the
/tmp/telnet.trace file.

To record packets coming in and going out from a specific remote host,
enter the command in the following format:
# iptrace -i en0 -s airmail -b /tmp/telnet.trace

The packets to be recorded are received on the en0 interface, from remote
host airmail. The trace information is placed into the /tmp/telnet.trace file.


-- HPUX:

nettl command:

Initialize the tracing/logging facility:
# nettl -start
Logging is enabled for all subsystems as determined by the /etc/nettlgen.conf file. Log messages are sent 
to a log file whose name is determined by adding the suffix .LOG000 to the log file name specified
in the /etc/nettlgen.conf configuration file. 

To stop the tracing facility:
# nettl -stop

Turn on inbound and outbound PDU tracing for the transport and session (OTS/9000) subsystems
and send binary trace messages to file /var/adm/trace.TRC000. 
# nettl -traceon pduin pduout -entity transport session \ 
     -file /var/adm/trace 

Session using nettl and the formatter netfmt:
1. Capture packets
nettl -tn all -e ns_ls_ip -tm 99999 -size 1024 -f some-raw-capture-file

2. Reproduce problem.

3. Turn off trace: nettl -tf -e all

4. Create formatter filter file. Example:
filter tcp_sport 6699
filter tcp_dport 6699

5. Filter the packets:
5.1 "Long" display
netfmt -Nlnc filter-file -f some-raw.capture > formatted.out
5.2 "One-liner" display
netfmt -Nln1Tc filter-file -f some-raw.capture > one-liner.out


-- Restart inetd, nfs:
-- -------------------

Starting and stopping NFS:			
--------------------------
			
On all unixes, a number of daemons should be running in order for NFS to be functional, like for example			
the rpc.* processes, biod, nfsd and others.			
			
Once nfs is running, and in order to actually "share" or "export" your filesystem on your server, so remote clients 			
are able to mount the nfs mount, in most cases you should edit the "/etc/exports" file.			
			
-- AIX:			
The following subsystems are part of the nfs group: nfsd, biod, rpc.lockd, rpc.statd, and rpc.mountd. 			
The nfs subsystem (group) is under control of the "resource controller", so starting and stopping nfs			
is actually easy			
			
# startsrc -g nfs			
# stopsrc -g nfs			
			
Or use smitty.			
			
-- Redhat Linux:			
# /sbin/service nfs restart			
# /sbin/service nfs start			
# /sbin/service nfs stop			
			
-- On some other Linux distros			
# /etc/init.d/nfs start 			
# /etc/init.d/nfs stop			
# /etc/init.d/nfs restart			
			
-- Solaris:			
If the nfs daemons aren't running, then you will need to run:			
# /etc/init.d/nfs.server start 			
			
-- HP-UX:			
Issue the following command on the NFS server to start all the necessary NFS processes (HP): 			
# /sbin/init.d/nfs.server start 			
 			
Or if your machine is only a client:			
# cd /sbin/init.d			
# ./nfs.client start			
			
			
Restart or refresh inetd after you have edited "inetd.conf":			
------------------------------------------------------------
			
After you have edited "/etc/inetd.conf", for example, to enable or disable some service,			
you need to restart, or refresh inetd, to read the new configuration information.			
To let inetd to reread the configfile:			
			
-- AIX:			
# refresh -s inetd			
			
-- HPUX:			
# /usr/sbin/inetd -c 			
			
-- Solaris:			
# /etc/init.d/inetd stop			
# /etc/init.d/inetd start			
# pkill -HUP inetd		# The command will restart the inetd and reread the configuration.	
			
-- RedHat / Linux			
# service xinetd restart			
or			
# /etc/init.d/inetd restart			


---------------------------------------------------------------------------------
Note: How to get a "reaonable" view on memory consumption of a process in UNIX:
---------------------------------------------------------------------------------

With using just the command line, or some free utils.


In general not so easy to answer, because of the "sub components" you might distinguish
in memory occupation. For example, do you mean RSS, real, shared, virtual, paging, including all libraries loaded, etc..?

-- Some people like to use the ps command with some special flags, like
   ps -vg
   ps auxw   # or  ps auxw | sort -r +3 |head -10 (top users)

   But those commands seems not so very satisfactory, and not "complete" in their output.

-- There are some great common utilities like topas, nmon, top etc.., or tools specific to a certain Unix, like SMC for Solaris.
   No bad word on those tools, because they are great. But some people think that they are not satisfactory 
   on the subject of memory consumption of a process (although they show a lot of other interesting information).

-- Some other ways might be:

# procmap pid      (in e.g. AIX)
# pmap -x pid      (in e.g. Solaris)

Those tools also show a "total" memory usage, which is a good indicator.

For example:
   
# pmap -x $$

492328: -ksh
 Address  Kbytes     RSS    Anon  Locked Mode   Mapped File
00010000     192     192       -       - r-x--  ksh
00040000       8       8       8       - rwx--  ksh
00042000      40      40       8       - rwx--    [ heap ]
FF180000     680     680       -       - r-x--  libc.so.1
FF23A000      24      24       -       - rwx--  libc.so.1
FF240000       8       8       8       - rwx--  libc.so.1
FF280000     576     576       -       - r-x--  libnsl.so.1
FF310000      40      40       -       - rwx--  libnsl.so.1
FF31A000      24      16       -       - rwx--  libnsl.so.1
FF350000      16      16       -       - r-x--  libmp.so.2
FF364000       8       8       -       - rwx--  libmp.so.2
FF380000      40      40       -       - r-x--  libsocket.so.1
FF39A000       8       8       -       - rwx--  libsocket.so.1
FF3A0000       8       8       -       - r-x--  libdl.so.1
FF3B0000       8       8       8       - rwx--    [ anon ]
FF3C0000     152     152       -       - r-x--  ld.so.1
FF3F6000       8       8       8       - rwx--  ld.so.1
FFBFC000      16      16       8       - rw---    [ stack ]
-------- ------- ------- ------- -------
total Kb    1856    1848      48       -

This gives you a reasonable idea on memory consumption of a pid.

You can also try:

# svmon -G
# svmon -U
# svmon -P -t 10     (top 10 users)
# svmon -U steve -l  (memory stats for user steve)

But svmon is not available on all unixes.

The following might also be helpfull (not on all unixes):

# ls -l /proc/{pid}/as
# prstat -a -s rss

And ps can give some info as well

# ps -ef | egrep -v "STIME|$LOGNAME" | sort +3 -r | head -n 15
# ps au


------------------------------
Note: Show aioservers in AIX:
------------------------------

# lsattr -El aio0
autoconfig available STATE to be configured at system restart True
fastpath   enable    State of fast path                       True
kprocprio  39        Server PRIORITY                          True
maxreqs    4096      Maximum number of REQUESTS               True
maxservers 10        MAXIMUM number of servers per cpu        True
minservers 1         MINIMUM number of servers                True

# pstat -a | grep -c aios
20

# ps -k | grep aioserver
  331962      -  0:15 aioserver
  352478      -  0:14 aioserver
  450644      -  0:12 aioserver
  454908      -  0:10 aioserver
  565292      -  0:11 aioserver
  569378      -  0:10 aioserver
  581660      -  0:11 aioserver
  585758      -  0:17 aioserver
  589856      -  0:12 aioserver
  593954      -  0:15 aioserver
  598052      -  0:17 aioserver
  602150      -  0:12 aioserver
  606248      -  0:13 aioserver
  827642      -  0:14 aioserver
  991288      -  0:14 aioserver
  995388      -  0:11 aioserver
 1007616      -  0:12 aioserver
 1011766      -  0:13 aioserver
 1028096      -  0:13 aioserver
 1032212      -  0:13 aioserver

What are aioservers in AIX5?:

With IO on filesystems, for example if a database is involved, you may try to tune the number
of aioservers (asynchronous IO)

AIX 5L supports asynchronous I/O (AIO) for database files created both on file system partitions and on raw devices. 
AIO on raw devices is implemented fully into the AIX kernel, and does not require database processes 
to service the AIO requests. When using AIO on file systems, the kernel database processes (aioserver) 
control each request from the time a request is taken off the queue until it completes. The kernel database 
processes are also used with I/O with virtual shared disks (VSDs) and HSDs with FastPath disabled. By default, 
FastPath is enabled. The number of aioserver servers determines the number of AIO requests that can be executed 
in the system concurrently, so it is important to tune the number of aioserver processes when using file systems 
to store Oracle Database data files. 

- Use one of the following commands to set the number of servers. This applies only when using asynchronous I/O 
on file systems rather than raw devices: 

# smit aio 

# chdev -P -l aio0 -a maxservers='128' -a minservers='20' 

- To set asynchronous IO to `Available':
# chdev -l aio0 -P -a autoconfig=available

You need to restart the Server:
# shutdown -Fr


aio on Linux distro's:

On some Linux distro's, Oracle 9i/10g supports asynchronous I/O but it is disabled by default because 
some Linux distributions do not have libaio by default. For Solaris, the following configuration is not required 
- skip down to the section on enabling asynchronous I/O.

On Linux, the Oracle binary needs to be relinked to enable asynchronous I/O. The first thing to do is shutdown 
the Oracle server. After Oracle has shutdown, do the following steps to relink the binary:

su - oracle
cd $ORACLE_HOME/rdbms/lib
make -f ins_rdbms.mk async_on
make -f ins_rdbms.mk ioracle


----------------------------------
Note: The ipcs and ipcrm commands:
----------------------------------

The "ipcs" command is really a "listing" command. But if you need to intervene
in memory structures, like for example if you need to "clear" or remove a shared memory segment, 
because a faulty or crashed
application left semaphores, memory identifiers, or queues in place,
you can use to "ipcrm" command to remove those structures.

Example ipcrm command usage:
----------------------------

Suppose an application crashed, but it cannot be started again. The following might help,
if you happened to know which IPC identifier it used.
Suppose the app used 47500 as the IPC key. Calcultate this decimal number to hex
which is, in this example, B98C.

No do the following:

# ipcs -bm | grep B89C

This might give you, for example, the shared memory identifier "50855977".
Now clear the segment: 

# ipcrm -m 50855977

It might also be, that still a semaphore and/or queue is still "left over".
In that case you might also try commands like the following example:

ipcs -q
ipcs -s

# ipcrm -s 2228248    (remove semaphore)
# ipcrm -q 5111883    (remove queue)


Note: in some cases the "slibclean" command can be used to clear unused modules in kernel and library memory.
Just give as root the command:

# slibclean

Other Example:
--------------

If you run the following command to remove a shared memory segment and you get this error:

# ipcrm -m 65537
ipcrm: 0515-020 shmid(65537) was not found.

However, if you run the ipcs command, you still see the segment there:

# ipcs | grep 65537
m 65537 0x00000000 DCrw------- root system

If you look carefully, you will notice the "D" in the forth column. The "D" means:

D If the associated shared memory segment has been removed. It disappears when the last process attached 
to the segment detaches it.

So, to clear the shared memory segment, find the process which is still associated with the segment:

# ps -ef | grep process_owner

where process_owner is the name of the owner using the shared segment 

Now kill the process found from the ps command above

# kill -9 pid

Running another ipcs command will show the shared memory segment no longer exists:

# ipcs | grep 65537 
Example

ipcrm -m 65537 


-----------------------------------------
Note : Show patches, version, systeminfo:
-----------------------------------------

Solaris:
========

showrev:
--------

#showrev
Displays system summary information.

#showrev -p
Reports which patches are installed 

sysdef and dmesg:
-----------------

The follwing commands also displays configuration information
# sysdef
# dmesg


versions:
---------

==> To check your Solaris version:
# uname -a or uname -m
# cat /etc/release 
# isainfo -v

==> To check your AIX version:

# oslevel
# oslevel -r    tells you which maintenance level you have.

>> To find the known recommended maintenance levels:
# oslevel -rq

>> To find all filesets lower than a certain maintenance level:
# oslevel -rl 5200-06

>> To find all filesets higher than a certain maintenance level:
# oslevel -rg 5200-05

>> To list all known recommended maintenance and technology levels on the system, type:

# oslevel -q -s
Known Service Packs
-------------------
5300-05-04
5300-05-03
5300-05-02
5300-05-01
5300-05-00
5300-04-CSP
5300-04-03
5300-04-02
5300-04-01
5300-03-CSP

>> Example:
5300-02 is TL 02
5300-02-04 is TL 02 and SP 04
5300-02-CSP is TL 02 and CSP for TL 02 
(and there won't be anymore SPs because when you see a CSP it is because the next TL has been released.  
In this case it would be TL 03).

>> How can I determine which fileset updates are missing from a particular AIX level?
To determine which fileset updates are missing from 5300-04, for example, run the following command:

# oslevel -rl 5300-04 

>> What SP (Service Pack) is installed on my system?
To see which SP is currently installed on the system, run the oslevel -s command. Sample output for an 
AIX 5L Version 5.3 system, with TL4, and SP2 installed would be:

# oslevel -s
5300-04-02
			 
>> Is a CSP (Concluding Service Pack) installed on my system?
To see if a CSP is currently installed on the system, run the oslevel -s command. 
Sample output for an AIX 5L Version 5.3 system, with TL3, and CSP installed would be:

# oslevel -s
5300-03-CSP
 

==> To check your HP machine:

# model
9000/800/rp7410


: machine info on AIX

How do I find out the Chip type, System name, Node name, Model Number etc.? 

The uname command provides details about your system. uname -p  Displays the chip type of the system. 
For example, powerpc. 

uname -r  Displays the release number of the operating system. 
uname -s  Displays the system name. For example, AIX. 
uname -n  Displays the name of the node.  
uname -a  Displays the system name, nodename,Version, Machine id. 
uname -M  Displays the system model name. For example, IBM, 7046-B50. 
uname -v  Displays the operating system version 
uname -m  Displays the machine ID number of the hardware running the system. 
uname -u  Displays the system ID number.  

Architecture:
-------------

To see if you have a CHRP machine, log into the machine as the root user, and run the following command:

# lscfg | grep Architecture               or use:
# lscfg -pl sysplanar0 | more

The bootinfo -p command also shows the architecture of the pSeries, RS/6000

# bootinfo -p
chrp


------------------------------------------------------------
Note: some usefull commands on Linux and AIX (and other OS):
------------------------------------------------------------


-- Linux:
=========


-- Show your OS version:

# cat /proc/version 
# uname -a

-- Show the open files that a process uses:

# pfiles pid

-- Show the jobs that are scheduled (in the account you use) from cron:

# crontab -l

-- What are the standard mounted filesystems: That's defined in "/etc/fstab"

# cat /etc/fstab

-- Which processes are using a certain filesystem?

# fuser -c /filesystem     # We mean the "mountpoint", like for example "/apps/oracle"

-- Show memory usage of a process:

# pmap -d pid                       # (Most important options: -x  Show the extended format; -d Show the device format.)
                                    # (And pid is the process-id, as visible in the command "ps -ef".)

-- Show system memory:

# cat /proc/meminfo
# /usr/sbin/dmesg | grep "Physical"
# free                              # (the free command)   

-- Swap usage:

# cat /proc/swaps                   # Above 60%-70% it's getting scary
# cat /proc/meminfo

-- cpu info:

# cat /proc/cpuinfo

-- user and process limits:

Sometimes, when a process runs under some account, and it fails for no immediate reason, it might be
worth checking the "ulimit" of that account (like max filesize, max open files, number of files etc..)
use it under that account as:

# ulimit (-a)

-- Show processtree of parent and children:

# pstree pid                       # on some distros ptree is implemented


-- Show the system error report / error log:

# cat /var/log/messages | more    (# more will ensure that not all contents scroll at your screen "at once", until the end is reached)


-- Determine the type of a file (e.g. is it ascii, or another type of file?)

# file file_name                  # (the command is really named "file")


-- Show free/used space of the filesystems:

# df -m                           # m in MB; k in KB

If there are many filesystems, you might want to see just the top 5 that are the lowest on free space:

# df -k |awk '{print $4,$7}' |grep -v "Filesystem" | sort -n | tail -5

-- How to become another user, or possibly root:

# su - accountname       # (switch to that accountname like "su - albert")
# su -                   # (switch to root)
                         # if the sudo utility is implemented, you might try the command "sudo -l" to see what you might execute.

-- Carefull!! How to kill a process "the hard way"?

# kill -9 PID              # carefull, don't kill the wrong one; not recommended unless you don't have a choice.

-- Carefull!! How to kill all your processes "the hard way", all at once?

# kill -9 -1               # very carefull; not recommended unless you don't have a choice.
# killall                  # implemented on some distros. very carefull; not recommended unless you don't have a choice.

-- Show your uid (userid) and gid (groupid):

# id

-- refreshing (restarting) inetd after modifying "/etc/inetd.conf"

# service xinetd restart	    # depending on the distro, like RedHat					
# /etc/init.d/inetd restart	

-- To show the init runlevel:

# who -r 

-- Show uptime of system plus average load (15 minutes)

# uptime

-- Show the last logged on users: account name & pts & date (history since last restart)

# last | more


-- AIX:
=======

-- Show your AIX version:

# oslevel -r

-- Show the jobs that are scheduled (in the account you use) from cron:

# crontab -l

-- What are the standard mounted filesystems?: That's defined in "/etc/filesystems"

# cat /etc/filesystems | more

-- Which processes are using a certain filesystem?

# fuser -c /filesystem     # We mean the "mountpoint", like for example /appl/oracle

-- Show memory usage of a process:

# procmap pid              # pid is the process-id, as visible in the command "ps -ef"   

-- Show the open files that a process uses:

# pfiles pid               # also take a look at the "lsof" command: man lsof            

-- Show system memory:

# bootinfo -r
# lsattr -E -l mem0
# lsattr -E -l sys0 -a realmem
# svmon -G
# vmstat -v
# vmo -L                # ( lots of output )
# svmon -U -g -t 10     # ( top 10 users paging space)

-- Swap usage:

# lsps -s                 # more than 60%-70% used? It get's really scary. More than 75% used? Oh boy!
# pstat -s

-- cpu info:

# lparstat (-i)       
# prtconf | grep proc
# pmcycles -m
# lscfg | grep proc
# pstat -S

-- ulimit:

Sometimes, when a process runs under some ones credentials, and it fails for no immediate reason, it might be
worth checking the "ulimit" of that account (like max filesize, max open files, number of files etc..)
use it under that account as:

# ulimit -a

-- Show process tree of parent and children:

# proctree pid        # Tip: take a look at the "proc tools" on AIX               


-- Show the system error report / error log:

# errpt                           # or "errpt | more" 
# errpt -aj <ERRID> | more        # view details of an error record. ERRID is the 1st identifier in such a record.

-- Determine the type of a file (e.g. is it ascii, or another type of file?)

# file file_name          # (yes..., the command is really "file")

-- Show free/used space of the filesystems:

# df -m         # m in MB; k in KB; g in GB

If there are many filesystems, you might want to see just the top 5 that have the lowest on free space:

# df -k |awk '{print $4,$7}' |grep -v "Filesystem" | sort -n | tail -5


-- How to become another user, or possibly root:

# su - accountname       # (switch to that accountname like "su - albert")
# su -                   # (switch to root)
                         # if the sudo utility is implemented, you might try the command "sudo -l" to see what you might execute.

-- Carefull!! How to kill a process "the hard way"?

# kill -9 PID              # carefull, don't kill the wrong one; not recommended unless you don't have a choice.

-- Carefull!! How to kill all your processes "the hard way", all at once?

# kill -9 -1               # be very carefull; not recommended unless you don't have a choice.
# killall                  # be very carefull; not recommended unless you don't have a choice.


-- Show your uid (userid) and gid (groupid):

# id

-- refresh inetd after modifying "/etc/inetd.conf":

# refresh -s inetd

-- Show the last logged on users + date (history since last restart):

# last | more

-- To show the init runlevel:

# who -r 

-- Show uptime of system plus average load (15 minutes):

# uptime

-- Clean memory with ipcrm (be carefull):

# ipcrm -m 50855977      # (clear memory segment, identfied by example id 50855977; Be carefull)
# ipcrm -s 2228248       # (remove semaphore, identfied by example id 2228248; Be carefull) 
# ipcrm -q 5111883       # (remove queue, identfied by example id 5111883; Be carefull) )
                         # (see man pages ipcrm)

-- To clear out unused system modules (currently unused modules in kernel and library memory):

# slibclean


##############################################################

SECTION 14: Various errors:

##############################################################


------
Note:
------

JVM problems and AIX Environment Variables in relation to Java:
===============================================================

Default Behavior of Java on AIX

This section describes the settings as they are right now. These settings may, and in most cases will, 
change over time. The README or SDK Guide accompanying the SDK are always the most up-to-date references 
for such settings.

Java uses the following environment settings:

AIXTHREAD_SCOPE=S 
This setting is used to ensure that each Java thread maps 1x1 to a kernel thread. The advantage of this approach 
is seen in several places; a notable example is how Java exploits Dynamic Logical Partitioning (DLPAR); 
when a new CPU is added to the partition, a Java thread can be scheduled on it. This setting should not be 
changed under normal circumstances. 

AIXTHREAD_COND_DEBUG, AIXTHREAD_MUTEX_DEBUG and AIXTHREAD_RWLOCK_DEBUG 
These flags are used for kernel debugging purposes. These may sometimes be set to OFF. If not, switching 
them off can provide a good performance boost.

LDR_CNTRL=MAXDATA=0x80000000 
This is the default setting on Java 1.3.1, and controls how large the Java heap can be allowed to grow. 
Java 1.4 decides the LDR_CNTRL setting based on requested heap. See Getting more memory in AIX for your 
Java applications for details on how to manipulate this variable.

JAVA_COMPILER 
This decides what the Just-In-Time compiler will be. The default is jitc, which points to the IBM JIT compiler. 
It can be changed to jitcg for the debug version of JIT compiler, or to NONE for switching the JIT compiler off 
(which in most cases is the absolute worst thing you can do for performance).

IBM_MIXED_MODE_THRESHOLD 
This decides the number of invocations after which the JVM JIT-compiles a method. This setting varies 
by platform and version; for example, it is 600 for Java 1.3.1 on AIX. 


Note 1:
-------

About o_maxdata and LDR_CNTRL:

... space for the native heap. Moving the fence down allows the native heap to grow, while reducing shared memory. 
For a setting of o_maxdata = N, the fence is placed at 0x30000000+N. For several good reasons, 
it is recommended to set o_maxdata to a value that is the start of a particular segment, 
such as 0xn0000000. In this case, the fence sits between segments 2+n and 3+n, which translates 
to n segments for the native heap, and 10-n segments for shared memory.

o_maxdata=8: 8 seg for native, 2 seg for shared
o_maxdata=7: 7 seg for native, 3 seg for shared
o_maxdata=6: 6 seg for native, 4 seg for shared
o_maxdata=5: 5 seg for native, 5 seg for shared
o_maxdata=4: 4 seg for native, 6 seg for shared
o_maxdata=3: 3 seg for native, 7 seg for shared *
o_maxdata=2: 2 seg for native, 8 seg for shared


By default, o_maxdata is set to 0x80000000, leaving 2 GB for native heap and 512 MB for shared memory. 
If you attempt to allocate a Java heap larger than 1 GB, it fails because Java tries to use shared memory 
for heap, and there is only 512 MB of shared memory available. If you set IBM_JAVA_MMAP_JAVA_HEAP 
in the environment and try to allocate a heap larger than 512 MB, JVM will be unable to allocate the heap. 
The solution is to adjust o_maxdata in such a way that the size of shared memory grows large enough 
to accommodate the Java heap. The next section shows you how to do this. 


So how do you go to a larger Java heap? You need to change o_maxdata to increase the amount of 
shared memory address space. You can use the following calculations to come up with the appropriate value 
for o_maxdata. Supposing you need a maximum heap size of J bytes, you would invoke Java as 

java -mxJ <other arguments> 

If J is less than 1 GB, and IBM_JAVA_MMAP_JAVA_HEAP is not set, the default setup will suffice. 
If J is > 1 GB, or if IBM_JAVA_MMAP_JAVA_HEAP is set, use o_maxdata = 0xn0000000 

where  n = (10 - ceil(J/256M)) or 8 

whichever is smaller. The function ceil rounds up the argument to the next integer. 

For example, if you need to allocate 1500 MB of heap, we have 

n = (10 - ceil(1500M/256M)) = (10 - 6) = 4. If you set o_maxdata = 0x40000000, 

you will be able to allocate the needed size of heap. To change o_maxdata, set the following 
environment variable: LDR_CNTRL=MAXDATA=<new o_maxdata value> 

The above example would set the following environment variable: LDR_CNTRL=MAXDATA=0x40000000
 

To verify that your calculation is accurate, you can try the following commands: 
$ export LDR_CNTRL=MAXDATA=0x40000000 
$ java -mx1500m -version
 
Setting the IBM_JAVA_MMAP_JAVA_HEAP variable

# export IBM_JAVA_MMAP_JAVA_HEAP=true


So, if you need to enhance memory for Websphere 5.x 32 bits, put the following lines
into the startServer.sh script, or in /prj/was/omgeving.rc:

export LDR_CNTRL=MAXDATA=0xn0000000
export IBM_JAVA_MMAP_JAVA_HEAP=true

try:

export AIXTHREAD_SCOPE=S
export AIXTHREAD_MUTEX_DEBUG=OFF
export AIXTHREAD_RWLOCK_DEBUG=OFF
export AIXTHREAD_COND_DEBUG=OFF
export LDR_CNTRL=MAXDATA=0x40000000 
export IBM_JAVA_MMAP_JAVA_HEAP=TRUE

or

export IBM_JAVA_MMAP_JAVA_HEAP=true
export LDR_CNTRL=MAXDATA=0x80000000

or

export IBM_JAVA_MMAP_JAVA_HEAP=true
export LDR_CNTRL=MAXDATA=0x80000000 


------
Note:
------


Hi

I need help because, i have rp7410 with two npar, (first npar only 11.23)(second npar (vpar1 an vpar2, but on vpar1 
i have ignite server)a also i have assign DVD drive to second npar,now i have reinstall first npar and i 
looking information how can i reinstall my first npar from Ignite server,if it is possible and how 
can I do it because when i try do it i have information:
//Main Menu: Enter command or menu > bo lan 9.156.xx.yy INSTALL

BCH Directed Boot Path: 0/0/8/0/0/4/0.


Do you wish to stop at the ISL prompt prior to booting? (y/n) >> y

Initializing boot Device.


Boot IO Dependent Code (IODC) Revision 4


IODC ENTRY_INIT failed. Error Status: -4

The IODC for this boot device was unable to provide text describing the failure.
IODC ENTRY_INIT[Return Messages] failed. Error Status: -2

0x0000 0000000000000000 0000000000000000 0000000000000000 0000000000000000
0x0004 0000000000000000 0000000000000000 0000000000000000 0000000000000000
//

Today and also instaled vpar (vparboot -p vpar1n1 -I 9.156.xx,yy,/opt/ignite/boot/Rel_B.11.23/WINSTALL) 
from Ignite (golden_image) and i havn,t any problem


Please help me,

Slawek  
 
Note: If you are the author of this question and wish to assign points to any of the answers, 
please login first.For more information on assigning points ,click here  

 
Sort Answers By: Date or Points
 
 
Eric SAUBIGNAC   Feb 26, 2008 09:05:25 GMT  6 pts   

--------------------------------------------------------------------------------
Bonjour Slawek,

As usually with ignite you must check some basic things :

- are both npar on the same subnet
- on the vpar ignite server, how is configured the file /etc/opt/ignite/instl_boottab
- do you find those lines in /etc/inetd.conf :

tftp dgram udp wait root /usr/lbin/tftpd tftpd\
/opt/ignite\
/var/opt/ignite

instl_boots dgram udp wait root /opt/ignite/lbin/instl_bootd instl_bootd

Eric 
 
Slawek Ksiazek  Feb 26, 2008 09:23:09 GMT    N/A: Question Author   

--------------------------------------------------------------------------------
Thanks for answer
If I boot from BCH how can I check lan settings ?Thanks

Slawek 
 
Daniel Parkes   Feb 26, 2008 10:30:01 GMT  7 pts   

--------------------------------------------------------------------------------
If you don't have any system installed on that npar you will have to boot from dvd and then check your parameters.

If you have so installed on that npar, boot it up and you can check out network, after testing the network 
if you have the 1st npar on a different subnet,try the bootsys command from the ignite server,
check it out:

man bootsys
bootsys reboot and install clients using Ignite 


------
Note:
------


Hi,

I am using server rx7620 [itanium]for nPar and vPar creation.

For deploying image on vPars from ignite server, I give an entry in bootptab for network configuration. 
Is there any way I can specify the image to be used from ignite, what hostname to be given, how 
the file system should be in advance so that I need not intervene the process of booting after I issue 
a "vparboot -p vparname -I" from an existing vPar. 

Is there any way to specify all the above information already somewhere in Ignite server from where 
it can pick it up automatically, something like "AUTO" file for default OS image.

Any help will be appreciated.

Thanks in advance.  
 
Note: If you are the author of this question and wish to assign points to any of the answers, 
please login first.For more information on assigning points ,click here  

 
Sort Answers By: Date or Points
 
 
Steven E. Protter    Aug 7, 2008 18:31:48 GMT    Unassigned   

--------------------------------------------------------------------------------
Shalom,

Standard Ignite rules apply.

1) The vpar needs network a disk and must be on the same subnet as the ignite server or connect through a boot helper.

2)I suggest booting the vpar via the console and then issuing the standard Ignite client command.

boot lan.192.168.10.20 install

SEP 
 
Torsten.    Aug 7, 2008 19:02:41 GMT    Unassigned   

--------------------------------------------------------------------------------
Correction for SEP's step 2:

# vparboot -p target_partition -I

You can either use the client or server interface to configure the values or modify the server config files - see

http://docs.hp.com/en/IUX 
 
willsfrazer  Sep 8, 2008 11:06:33 GMT    N/A: Question Author   

--------------------------------------------------------------------------------
For me it's kind of a first time deployment of OS on subsequent vpar so i do not have any OE on target vPar. 
It's just a kind of bare metal from subsequent vPar's point of view. My problem is how do I drive expect interface 
of Ignite UI as using those tabs and delete and back spaces for file system configuration are not getting through using expect. 
Any way to specify all that already and get Ignite read it to proceed without requiring any input at UI and target being 
without any OE prior to this installation which means I have a vpar2 to be deployed with HP UX which is just a set 
of some hardware resources without having any OE and all what I have is an up and running vPar1 on the 
same nPar as vPar2 is + ignite server and access to MP console. Now how do I automate it [Ignite UI is not being 
driven for file system configuration] I can still use expect for root user specification and keep the hostname 
modification for post installation. But how abt defalut os image to be picked up from Ignite and file system specification.

Any suggestions... 


-------
Note:
-------

0301-150 bosboot: Invalid or no boot device specified!
--------------------------------------------------------------


== Technote:

APAR status
Closed as program error.

Error description 

On a system, that does not have tape support
installed, running mkszfile will show the
following error:
0301-150 bosboot: Invalid or no boot device
specified.

Local fix 
Install device support for scsi tape devices.

Problem summary 
Error message when creating backup if devices.scsi.tape.rte
not installed even if the system does not have a tape drive.

Problem conclusion 
Redirect message to /dev/null.

Temporary fix 
Ignore message.

Comments 
APAR information 
APAR number IY52551 IY95261
Reported component name AIX 5L POWER V5 
Reported component ID 5765E6200 
Reported release 520 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2004-01-12 
Closed date 2004-01-12 
Last modified date 2004-02-27 


== Technote:

APAR status
Closed as program error.

Error description 
If /dev/ipldevice is missing, mksfile will show the
bosboot usage statement.

  0301-150 bosboot: Invalid or no boot device
           specified!
Local fix 
Problem summary 
If /dev/ipldevice is missing, mksfile will show the
bosboot usage statement.

  0301-150 bosboot: Invalid or no boot device
           specified!

Problem conclusion 
Do not run bosboot against /dev/ipldevice.

Temporary fix 
Comments 

APAR information 
APAR number IY95261 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2007-02-22 
Closed date 2007-02-22 
Last modified date 2007-06-06 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Publications Referenced


Fix information 
Fixed component name AIX 5.3 
Fixed component ID 5765G0300 


== thread:

Q:

> 
> Someone out there knows the fix for this one; if you get a moment, would you 
> mind giving me the fix? 
> 
> 
> # mksysb -i /dev/rmt0 
> 
> /dev/ipldevice not found 
> 

A:

The ipldevice file is probably deleted from your /dev directory, or 
point to wrong 
entry. The '/dev/ipldevice' file is (re)created in boot time 2nd 
phase. For additional 
information look into /sbin/rc.boot script... The ipldevice entry 
type is hardlink. Usually point to /dev/rhdiskN, assuming that boot 
device is hdiskN. 
Check your system and you should got similar ... 
find /dev -links 2 -ls 
.... 
8305 0 crw------- 2 root system 14, 1 Feb 20 2005 /dev/rhdisk0 
8305 0 crw------- 2 root system 14, 1 Feb 20 2005 /dev/ipldevice 
... 
(The first cloumn of the output is the inode number) 

So, you can recreate the wrong, or missing ipdevice file. 
'bootinfo -b' says the physical boot device name. 
For exapmle: 
ln -f /dev/rhdisk0 /dev/ipldevice 

I hope this will solve your bosboot problem. 


Q:

I was installing Atape driver and noticed bosboot failure when installp 
calls bosboot with /dev/ipldevice. Messages below: 

0503-409 installp: bosboot verification starting... 
0503-497 installp: An error occurred during bosboot verification 
processing. 

Inspection of /dev showed no ipldevice file 

I was able to easily recreate the /dev/ipldevice using 

ln /dev/rhdisk0 /dev/ipldevice 

then successfully install the Atape driver software. 

After reboot /dev/ipldevice is missing again???. 

Environment is p5 520 AIX 5.3 ML1 
mirrored internal drives hdisk0 and hdisk1 in rootvg 

I have 5.3 ML2 (but have not applied yet) 
I don't see any APAR's in ML2 regarding /dev/ipldevice problems.

A:

Are you using EMC disk? There is a known problem with the later 
Powerpath versions where the powerpath startup script removes the 
/dev/ipldevice file if there is more than one device listed in the 
bootlist. 

A:

Yes, running EMC PowerPath 4.3 for AIX, with EMC Clariion CX600 Fibre 
disks attached to SAN. I always boot from, and mirror the OS on IBM 
internal disks. We order 4 internal IBM drives. Two for primary OS and 
mirror, the other two for alt_disk and mirrors. 

Thanks for the tip. I will investigate at EMC Powerlink site for fix. I 
know PowerPath 4.4 for AIX is out, but still pretty new.


A:

ipldevice is a link to the rawdevice (rhdisk0 , not hdisk0) 


-----Original Message----- 
From: IBM AIX Discussion List [mailto:aix-l@Princeton.EDU] On Behalf Of 
Robert Miller 
Sent: Wednesday, April 07, 2004 6:13 PM 
To: aix-l@Princeton.EDU 
Subject: Re: 64 Bit Kernel 


It may be one of those odd IBMisms where they want to call something a 
certain name so they put it in as a link to the actual critter... 

Looking on my box, the /dev/ipldevice has the same device major and 
minor numbers as hdisk0 - tho it is interesting that ipldevice is a 
character device, where a drive is usually a block device: 


mybox:rmiller$ ls -l /dev/ipl* 
crw------- 2 root system 23, 0 Jan 15 2002 /dev/ipldevice 
mybox:rmiller$ ls -l /dev/hdisk0 
brw------- 1 root system 23, 0 Sep 13 2002 /dev/hdisk0 


A:

> Hi, 

> AIX 5.3 
> I have a machine where /dev/ipldevice doesn't exit 
> I can reboot it safely ? 
> How I can I re-create it ? 

> Thanks in advance 

I did this today, and there is probably a more accepted way. 
I made a hard link from my rhdiskX device to /dev/ipldevice. 

If your boot device is /dev/hdisk0, then the command line would be as 
follows: 

ln /dev/rhdisk0 /dev/ipldevice 

Again, there is probably a more acceptable way to achieve this, but it 
worked for me. 


== thread:

how to recover from an invalid or no boot device error in AIX 
Description

When running the command "bosboot -ad /dev/ipldevice" in IBM AIX, you get the following error:

0301-150 bosboot: Invalid or no boot device specified!

A device specified with the bosboot -d command is not valid. The bosboot command was unable to finish processing 
because it could not locate the required boot device. The installp command calls the bosboot command 
with /dev/ipldevice. If this error does occur, it is probably because /dev/ipldevice does not exist. 
/dev/ipldevice is a link to the boot disk. 

To determine if the link to the boot device is missing or incorrect :

1) Verify the link exists:

# ls -l /dev/ipldevice
ls: 0653-341 The file /dev/ipldevice does not exist.

2) In this case, it does not exist. To identify the boot disk, enter "lslv -m hd5". The boot disk name displays. 

# lslv -m hd5
hd5:N/A
LP PP1 PV1 PP2 PV2 PP3 PV3
0001 0001 hdisk4 0001 hdisk1 

In this example the boot disk name is hdisk4 and hdisk1.

3) Create a link between the boot device indicated and the /dev/ipldevice file. Enter: 

# ln /dev/boot_device_name /dev/ipldevice
(An example of boot_device_name is rhdisk0.)

In my case, I ran:

# ln /dev/rhdisk4 /dev/ipldevice

4) Now run the bosboot command again:

# bosboot -ad /dev/ipldevice 
Example

lslv -m hd5; ln /dev/rhdisk4 /dev/ipldevice; bosboot -ad /dev/ipldevice 

Other mksysb errors on AIX 5.3:
---------------------------------------

It turns out, that on AIX 5.3, on certain ML/TL levels (below TL 6), an mksysb error turns up,
if you have other volume groups defined other than rootvg, while there is NO filesystem created on
those Volume groups.

Solution: create a filesystem, even only a "test" or "dummy" filesystem, on those VG's.


>> thread 1:

Q:

Hi 

can't find any information about "backup structure of volume group, vios". included service: 
"savevgstruct vgname" working with errors: 
# lsvg 
rootvg 
vg_dev 
datavg_dbs 
# /usr/ios/cli/ioscli savevgstruct vg_dev 

Creating information file for volume group vg_dev.. 

Some error messages may contain invalid information 
for the Virtual I/O Server environment. 

cat: 0652-050 Cannot open /tmp/vgdata/vg_dev/fs_data_tmp. 

# ls -al /tmp/vgdata/vg_dev/ 
total 16 
drwxr-xr-x 2 root staff 256 Apr 02 08:38 . 
drwxrwxr-x 5 root system 256 Apr 02 08:20 .. 
-rw-r--r-- 1 root staff 2002 Apr 02 08:35 filesystems 
-rw-r--r-- 1 root staff 1537 Apr 02 08:35 vg_dev.data 
# oslevel -r 
5300-05 
# df -k | grep tmp 
/dev/hd3 1310720 1309000 1% 42 1% /tmp 


A:

I had this issue as well with VIO 1.3. I called IBM support 
about it and it is a known issue. The APAR is IY87935. The fix 
will not be released until AIX 5.3 TL 6, which is due out in 
June. It occurs when you run savevgstruct on a user defined 
volume group that contains volumes where at least one does not 
have a filesystem defined on it. The workaround is to define a 
filesystem on every volume in the user defined volume group.


>> thread 2:

IBM APAR Note:

http://www-1.ibm.com/support/docview.wss?uid=isg1IY87935

IY87935: MKVGDATA/SAVEVG CAN FAIL


APAR status
Closed as program error.

Error description 
The mkvgdata command when executed on a volume group that does
not have any mounted filesystems:

  # savevg -f /home/vgbackup -i vg00

  Creating information file for volume group vg00..cat:
  0652-050 Cannot open /tmp/vgdata/vg00/fs_data_tmp.

  /usr/bin/savevg 33 :  BACKUPSHRINKSIZE = 16 + FSSHRINKSIZE :
  0403-009 The specified number is not valid for this command.

Local fix 

Problem summary 
The mkvgdata command when executed on a volume group that does
not have any mounted filesystems:

  # savevg -f /home/vgbackup -i vg00

  Creating information file for volume group vg00..cat:
  0652-050 Cannot open /tmp/vgdata/vg00/fs_data_tmp.

  /usr/bin/savevg 33 :  BACKUPSHRINKSIZE = 16 + FSSHRINKSIZE :
  0403-009 The specified number is not valid for this command.

Problem conclusion 
Check variable.

Temporary fix 

Comments 

APAR information 
APAR number IY87935 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2006-08-09 
Closed date 2006-08-09 


-------
Note:
-------


Recovery of the root filesystem on Solaris:
===========================================


Restoring the root (/) File System

-- To restore the / (root) file system, boot from the Solaris CD-ROM and then run ufsrestore.

If / (root), /usr, or the /var file system is unusable because of some type of corruption the system will not boot.

The following procedure demonstrates how to restore the / (root) file system which is assumed to be on boot disk c0t0d0s0.

1. Insert the Solaris 8 Software CD 1, and boot the CD-ROM with the single-user mode option. 

ok boot cdrom -s

2. Create the new file system structure.

# newfs /dev/rdsk/c0t0d0s0

3. Mount the file system to an empty mount point directory, /a and change to that directory.

# mount /dev/dsk/c0t0d0s0 /a
# cd /a

4. Restore the / (root) file system from its backup tape.

# ufsrestore rf /dev/rmt/0

Note - Remember to always restore a file system starting with the level 0 backup tape and continuing with the next lowest level 
tape up through the highest level tape.

5. Remove the restoresymtable file.

# rm restoresymtable

6. Install the bootblk in sectors 1-15 of the boot disk. Change to the directory containing the bootblk, and run the installboot command.

# cd /usr/platform/`uname -m`/lib/fs/ufs
# installboot bootblk /dev/rdsk/c0t0d0s0 


7. Unmount the new file system.

# cd /
# umount /a

8. Use the fsck command to check the restored file system.

# fsck /dev/rdsk/c0t0d0s0

9. Reboot the system.

# init 6

10. Perform a full backup of the file system. For example:

# ufsdump 0uf /dev/rmt/0 /dev/rdsk/c0t0d0s0

Note - Always back up the newly created file system, as ufsrestore repositions the files and changes the inode allocation. 

Restoring the /usr and /var File Systems 


-- To restore the /usr and /var file systems repeat the steps described above, except step 6. 
This step is required only when restoring the (/) root file system.

To restore a regular file system, (for example, /export/home, or /opt) back to disk, repeat the steps described above, except steps 1, 6, and 9.

Example

# newfs /dev/rdsk/c#t#d#s#
# mount /dev/dsk/c#t#d#s# /mnt
# cd /mnt
# ufsrestore rf /dev/rmt/#
# rm restoresymtable
# cd /
# umount /mnt
# fsck /dev/rdsk/c#t#d#s#
# ufsdump 0uf /dev/rmt/# /dev/rdsk/c#t#d#s#


-------
Note:
-------

thread 1:

Q:

Has anyone seen these errors before? We're running 6239 fc cards on a 
CX600. AIX level is 52-03 with the latest patches for devices.pci.df1000f7 
as well. 


I didn't know that these adapters still used devices.pci.df1000f7 as part 
of their device driver set, but aparently they do. We're mostly seeing 
ERR4s on bootup and occassionaly throughout the day. They're TEMP but 
should I be concerned about this? Any help would be greatly appreciated! 

LABEL: SC_DISK_ERR4 
IDENTIFIER: DCB47997 

A:

DISK_ERR_4 are simply bad-block relocation errors. They are quite normal. 
However, I heard that if you get more than 8 in an 8-hour period, you 
should get the disk replaced as it is showing signs of impending failure. 


thread 2:

Q:

> Has anyone corrected this issue? SC_DISK_ERR2 with EMC Powerpath = 
> filesets listed below? I am using a CX-500.=20 
> 


A:

 got those errors before using a CX700 and it turned out to be a 
firmware problem on the fibre adapter, model 6259. EMC recommended the 
92X1 firmware and to find out IBM found problems with timeouts to the 
drives and recommended going back a level to 81X1. 

A:

We have the same problem as well. EMC say its a firmware error on the 
FC adapters

A:

This is how to fix these errors, downgrading firware is not recommended. 

Correcting SCSI_DISK_ERR2's in the AIX Errpt Log - Navisphere Failover 
Wizard 

1. In the Navisphere main screen, select tools and then click the 
Failover Setup Wizard. Click next to continue. 

2. From the drop-down list select the host server you wish to 
modify and click next 

3. Highlight the CX-500 and click next 

4. Under the specify settings box be sure to select 1 for the 
failover setting and disable for array commpath. Click next to process. 
5. The next screen is the opportunity to review your selections 
(host, failover mode and array commpath); click next to commit 
6. The following screen displays a warning message to alert you are 
committing these changes. Click yes to process. 

7. Next login to the AIX command prompt as root and perform the 
following commands to complete stopping the SCSI_DISK_ERR2. 
a. lsdev -Cc disk | grep LUNZ 

(Filter for disks with LUNZ in the description) 
b. rmdev -dl hdisk(#)'s 

(Note the disks and remove them from the ODM) 
c. errclear 0 
(Clear the AIX system error log) 
d. cfgmgr -v 
(Attempt to re-add the LUNZ disks) 
e. lsdev -Cc disk | grep LUNZ 
(Double check to make sure the LUNZ disk does not add itself back to the 
system after the cfgmgr command) 
f. errpt -a 

(Monitor the AIX error log to insure the SCSI_DISK_ERR2's are gone) 
Task Complete... 


E87EF1BE   0512150008 P O dumpcheck      The largest dump device is too small.
------------------------------------------------------------------------------


Problems with errpt:
--------------------

Invalid log, or other problems

thread 1:

Q:

Hello ...

the 'errpt' Command tells me:

0315-180 logread: UNEXPECTED EOF 0315-171 Unable to process the error log file
/var/adm/ras/errlog. 0315-132 The supplied error log is not valid:
/var/adm/ras/errlog.

# ls -l /var/adm/ras/errlog
-rw-r--r-- 1 root system 0 Jun 14 17:31 /var/adm/ras/errlog

How can I fix this problem?

A:

/usr/lib/errstop           # stop logging

rm /var/adm/ras/errlog     # get rid of that log.

/usr/lib/errdemon          # restart the daemon, creating a new error log.


Some err identifiers that can sometimes be hard to trace to their true sources:
===============================================================================

Take a look at those errpt entries:


--------------------------------------------------------------------------


ERRPT ENTRY 1:
--------------

LABEL:          CORE_DUMP 
IDENTIFIER:     C69F5C9B 

Date/Time:       Thu Jan 15 02:00:45 MET 2009 
Sequence Number: 999 
Machine Id:      00CC94EE4C00 
Node Id:         srv1 
Class:           S 
Type:            PERM 
Resource Name:   SYSPROC 

Description 
SOFTWARE PROGRAM ABNORMALLY TERMINATED 

Probable Causes 
SOFTWARE PROGRAM 

User Causes 
USER GENERATED SIGNAL 

        Recommended Actions 
        CORRECT THEN RETRY 

Failure Causes 
SOFTWARE PROGRAM 

        Recommended Actions 
        RERUN THE APPLICATION PROGRAM 
        IF PROBLEM PERSISTS THEN DO THE FOLLOWING 
        CONTACT APPROPRIATE SERVICE REPRESENTATIVE 

Detail Data 
SIGNAL NUMBER 
          11 
USER'S PROCESS ID: 
               1298680 
FILE SYSTEM SERIAL NUMBER 
          57 
INODE NUMBER 
       37134 
CORE FILE NAME 
/var/core/core.1298680.15010044 
PROGRAM NAME 
BS_sear 
STACK EXECUTION DISABLED 
           0 
COME FROM ADDRESS REGISTER 

PROCESSOR ID 
  hw_fru_id: 1 
  hw_cpu_id: 9 

ADDITIONAL INFORMATION 
?? 
?? 
Unable to generate symptom string. 


  (or as another example of the last lines, where you can see the "program name")

  PROGRAM NAME 
  opmn 
  STACK EXECUTION DISABLED 
           0 
  COME FROM ADDRESS REGISTER 

  PROCESSOR ID 
    hw_fru_id: 0 
    hw_cpu_id: 2 

  ADDITIONAL INFORMATION 
  strlen 0 
  pmStrdup 14 

  Symptom Data 
  REPORTABLE 
  1 
  INTERNAL ERROR   
  0 
  SYMPTOM CODE 
  PCSS/SPI2 FLDS/opmn SIG/11 FLDS/strlen VALU/0 FLDS/pmStrdup 
  

--------------------------------------------------------------------------

POSSIBLE EXPLANATION:
=====================

http://publib.boulder.ibm.com/infocenter/systems/index.jsp?topic=/com.ibm.aix.security/doc/security/stack_exec_disable.htm

AIXr has enabled the stack execution disable (SED) mechanism to disable the execution of code on a stack 
and select data areas of a process.

By disabling the execution and then terminating, an infringing program, the attacker is prevented 
from gaining root user privileges through a buffer overflow attack. While this feature does not stop 
buffer overflows, it provides protection by disabling the execution of attacks on buffers that have been overflowed.

Beginning with the POWER4T family of processors, you can use a page-level execution enable and/or disable feature 
for the memory. The AIX SED mechanism uses this underlying hardware support for implementing a 
no-execution feature on select memory areas. Once this feature is enabled, the operating system checks 
and flags various files during the executable programs. It then alerts the operating system memory manager 
and the process managers that the SED is enabled for the process being created. The select memory areas 
are marked for no-execution. If any execution occurs on these marked areas, the hardware raises 
an exception flag and the operating system stops the corresponding process. The exception and application 
termination details are captured through the AIX error log events.

SED is implemented mainly through the sedmgr command. The sedmgr command permits control 
of the systemwide SED mode of operation as well as setting the executable file based SED flags.

SED modes and monitoring
The stack execution disable (SED) mechanism in AIXr is implemented through systemwide mode flags, 
as well as individual executable file-based header flags.

While systemwide flags control the systemwide operation of the SED, file level flags indicate 
how files should be treated in SED. The buffer overflow protection (BOP) mechanism provides 
for four systemwide modes of operation:

-- off 
The SED mechanism is turned off and no process is marked for SED protection. 
--select 
Only a select set of files are enabled and monitored for SED protection. The select set of files 
are chosen by reviewing the SED related flags in the executable program binary headers. 
The executable program header enables SED related flags to request to be included in the select mode. 
-- setidfiles 
Permits you to enable SED, not only for the files requesting such a mechanism, but all the important 
setuid and setgid system files. In this mode, the operating system not only provides SED for the files 
with the request SED flag set, but also enables SED for the executable files with the following 
characteristics (except the files marked for exempt in their file headers):
 .SETUID files owned by root 
 .SETGID files with primary group as system or security 
-- all 
All executable programs loaded on the system are SED protected except for the files requesting 
an exemption from SED mode. Exemption related flags are part of the executable program headers. 
The SED feature on AIX also provides the ability to monitor instead of stopping the process when 
an exception happens. This systemwide control permits a system administrator to check for breakdowns 
and issues in the system environment by monitoring it before the SED is deployed in the production systems. 

The sedmgr command provides an option that permits you to enable SED to monitor files instead 
of stopping the processes when exceptions occur. The system administrator can evaluate whether 
an executable program is doing any legitimate stack execution. This setting works in conjunction 
with the systemwide mode set using the -c option. When the monitor mode is turned on, the system permits 
the process to continue operating even if an SED-related exception occurs. Instead of stopping the process, 
the operating system logs the exception in the AIX error log. If SED monitoring is off, 
the operating system stops any process that violates and raises an exception per SED facility.

Any changes to the SED mode systemwide flags requires that you restart the system for the changes 
to take effect. All of these types of events are audited.


--------------------------------------------------------------------------

ERRPT ENTRY 2:
--------------

LABEL:          SRC 
IDENTIFIER:     E18E984F 

Date/Time:       Fri Jan 16 09:31:33 MET 2009 
Sequence Number: 1513 
Machine Id:      00C503AC4C00 
Node Id:         heilbot 
Class:           S 
Type:            PERM 
Resource Name:   SRC 

Description 
SOFTWARE PROGRAM ERROR 

Probable Causes 
APPLICATION PROGRAM 

Failure Causes 
SOFTWARE PROGRAM 

        Recommended Actions 
        PERFORM PROBLEM RECOVERY PROCEDURES 

Detail Data 
SYMPTOM CODE 
           0 
SOFTWARE ERROR CODE 
       -9053 
ERROR CODE 
           2 
DETECTING MODULE 
'tellsrc.c'@line:'87' 
FAILING MODULE 

Duplicates 
Number of duplicates 
           3 
Time of first duplicate 
Fri Jan 16 09:31:18 MET 2009 
Time of last duplicate 
Fri Jan 16 09:31:33 MET 2009 


POSSIBLE EXPLANATIONS:
======================

In entry 2, we see the identifier E18E984F, and "SOFTWARE ERROR CODE -9053", and "Detecting module tellsrc.c@line:87".
tellsrc.c'@line:'87'


http://www-01.ibm.com/support/docview.wss?uid=isg1IZ03064

IZ03064: VARYONVG -C FAILS WITH "GSCHILD:CANNOT REGISTER WITH DRIVER APPLIES TO AIX 5300-07


APAR status
Closed as program error.

Error description 
"varyonvg -c" fails to varyon concurrent volume group and
reports the following error message:

tellclvmd: request failed rc = -9014 [UNKNOWN rc]
0516-1334 varyonvg: The command /usr/sbin/tellclvmd
   returned an error.


errpt logs following entry:

LABEL:          SRC
IDENTIFIER:     E18E984F
Class:           S
Type:            PERM
Resource Name:   SRC

Description
SOFTWARE PROGRAM ERROR

Probable Causes
APPLICATION PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        PERFORM PROBLEM RECOVERY PROCEDURES

Detail Data
SYMPTOM CODE
           0
SOFTWARE ERROR CODE
       -9053
ERROR CODE
          74
DETECTING MODULE
'srcmstr.c'@line:'529'
FAILING MODULE
Local fix 
This problem occurs when multiple "varyonvg -nc"
commands are performed together. By serializing
these commands, this can be avoided.
Problem summary 
Multiple varyonvg -c processes will all create threads in
the gsclvmd daemon.  With certain timing, these threads can
interfere with eachothers global variables and possibly cause
varyonvg to fail.
Problem conclusion 
Privatize variables so mutliple vgs coming online can't
interfere with eachother.
Temporary fix 
Comments 
5200-10 - use AIX APAR IZ05735
5300-06 - use AIX APAR IZ02334
5300-07 - use AIX APAR IZ03064
APAR information 
APAR number IZ03064 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2007-08-14 
Closed date 2007-09-04 
Last modified date 2007-12-06 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Publications Referenced


Fix information 
Fixed component name AIX 5.3 
Fixed component ID 5765G0300 


error INTRPPC_ERR:
------------------

LABEL:          INTRPPC_ERR
IDENTIFIER:     853015D6

Date/Time:       Sun Mar 22 00:27:49 MET 2009
Sequence Number: 1515
Machine Id:      00C503AC4C00
Node Id:         starboss
Class:           H
Type:            UNKN
Resource Name:   sysplanar0
Resource Class:  planar
Resource Type:   sysplanar_rspc
Location:

Description
UNDETERMINED ERROR

Probable Causes
SYSTEM I/O BUS
SOFTWARE PROGRAM
ADAPTER
DEVICE

        Recommended Actions
        PERFORM PROBLEM DETERMINATION PROCEDURES

Detail Data
BUS NUMBER
9001 00C0
INTERRUPT LEVEL
0009 0001
Number of Occurrences
0000 0001


Possible explanations:
----------------------

thread 1:


IY58847: INTRPPC_ERR ERRORS IN ERROR LOGS
  

 A fix is available 
Download fix packs
 

APAR status
Closed as program error.

Error description 
INTRPPC_ERR errors were observed in the error log while
customer ran a testcase as mentioned in the defect.
Local fix 
Problem summary 
INTRPPC_ERR errors were observed in the error log while
customer ran a testcase, which brings up and down the phxentdd
interface in a infinite loop. A ping is executed using the ip
address associated with this interface.
Problem conclusion 
A simple code change to ignore the interrupts
while driver is in closing state.
Temporary fix 
Comments 
APAR information 
APAR number IY58847 
Reported component name AIX 5L FOR POWE 
Reported component ID 5765E6100 
Reported release 510 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2004-07-13 
Closed date 2004-07-13 
Last modified date 2004-10-29 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Publications Referenced


Fix information 
Fixed component name AIX 5L FOR POWE 
Fixed component ID 5765E6100 

Applicable component levels 
R510 PSY U477721    UP04/10/29 I 1000 
 

thread 2:


> > I've recently started getting INTRPCC_ERR's on an old (but important!)
> > aix 4.3 box. They dont seem to correspond to anything else and the
> > box seems to be working normally. I found a way to lookup the BUS
> > NUMBER via odmget -q value= CuAt, but that didn't return anything
> > for me. Also looking for the interrupt number via lsresource didn't
> > give any matches either. And diag/Advanced Diagnostics/Problem
> > Determination didn't find any trouble.
>
> > Any other suggestions on how to track this down?
>
> > Thanks,
>
> > LABEL: INTRPPC_ERR
> > IDENTIFIER: DADF69E4
>
> > Date/Time: Wed Jul 11 08:51:41
> > Sequence Number: 735309
> > Machine Id: 000247824C00
> > Node Id: scully
> > Class: H
> > Type: UNKN
> > Resource Name: SYSINTR
> > Resource Class: NONE
> > Resource Type: NONE
> > Location: NONE
>
> > Description
> > UNDETERMINED ERROR
>
> > Probable Causes
> > SYSTEM I/O BUS
> > SOFTWARE PROGRAM
> > ADAPTER
> > DEVICE
>
> > Recommended Actions
> > PERFORM PROBLEM DETERMINATION PROCEDURES
>
> > Detail Data
> > BUS NUMBER
> > 0000 00C0
> > INTERRUPT LEVEL
> > 0000 0005
>
> convert the bus number from hex, then look for that value in `ls -l /
> dev`


.... but it is more than likely a device driver issue rather than the
device itself.


thread 3:


Here's how to map the error information to a specific adapter. Let's
do that first.


>Detail Data
>BUS NUMBER
>0000 00C0
>INTERRUPT LEVEL
>0000 0005

Example:

Detail Data
BUS NUMBER
0000 00C0
INTERRUPT LEVEL
0000 0003

lsresource -al pci0 | grep 0x000000C0
--> O pci0 0x8d5c_5 0x000000c0 - 0x000000df

lsresource -al pci0 | grep 3 | grep bus_intr_lvl
--> N sa1 bus_intr_lvl 3


Note: lsresource command example:
---------------------------------


selalbe@starboss:/home/beab_krn/selalbe $ lsresource -al pci0
TYPE DEVICE ATTRIBUTE S G CURRENT
B    pci0   0xda40_1      0x0000000080080000 - 0x00000000800bffff
B    pci0   0xdfa8_1      0x0000000080000000 - 0x000000008003ffff
B    ent0   busmem        0x0000000080120000 - 0x000000008013ffff
B    ent0   rom_mem       0x00000000800c0000 - 0x00000000800fffff
B    ent1   busmem        0x0000000080100000 - 0x000000008011ffff
B    ent1   rom_mem       0x0000000080040000 - 0x000000008007ffff
O    pci0   0xda40_0      0x00000000000df800 - 0x00000000000df83f
O    pci0   0xdfa8_0      0x00000000000dfc00 - 0x00000000000dfc3f
I    ent0   busintr                249    (A1)
I    ent1   busintr                250    (A1)


-------
Note:
-------

diag command:
-------------

Whenever a hardware problem occurs in AIX, use the diag command to diagnose the problem.

The diag command is the starting point to run a wide choice of tasks and service aids. 
Most of the tasks/service aids are platform specific. 

To run diagnostics on the scdisk0 device, without questions, enter:

# diag -d scdisk0 -c


-------
Note:
-------


System dumps:
-------------

A system dump is created when the system has an unexpected system halt or system failure.
In AIX 5L the default dump device is /dev/hd6, which is also the default paging device.
You can use the sysdumpdev command to manage system crash dumps.

The sysdumpdev command changes the primary or secondary dump device designation in a system that is running. 
The primary and secondary dump devices are designated in a system configuration object. 
The new device designations are in effect until the sysdumpdev command is run again, or the system is restarted.

If no flags are used with the sysdumpdev command, the dump devices defined in the SWservAt 
ODM object class are used. The default primary dump device is /dev/hd6. The default secondary dump device is 
/dev/sysdumpnull.


Examples
To display current dump device settings, enter: 
sysdumpdev  -l

To designate logical volume hd7 as the primary dump device, enter: 
sysdumpdev  -p /dev/hd7

To designate tape device rmt0 as the secondary dump device, enter: 
sysdumpdev  -s /dev/rmt0

To display information from the previous dump invocation, enter: 
sysdumpdev  -L

To permanently change the database object for the primary dump device to /dev/newdisk1, enter: 
sysdumpdev  -P  -p /dev/newdisk1

To determine if a new system dump exists, enter: 
sysdumpdev  -z

If a system dump has occurred recently, output similar to the following will appear: 

4537344 /dev/hd7
To designate remote dump file /var/adm/ras/systemdump on host mercury for a primary dump device, enter: 
sysdumpdev  -p mercury:/var/adm/ras/systemdump

A : (colon) must be inserted between the host name and the file name. 
To specify the directory that a dump is copied to after a system crash, if the dump device is /dev/hd6, enter: 
sysdumpdev  -d /tmp/dump

This attempts to copy the dump from /dev/hd6 to /tmp/dump after a system crash. If there is an error during the copy, 
the system continues to boot and the dump is lost. 
To specify the directory that a dump is copied to after a system crash, if the dump device is /dev/hd6, enter: 
sysdumpdev  -D /tmp/dump

This attempts to copy the dump from /dev/hd6 to the /tmp/dump directory after a crash. If the copy fails, 
you are prompted with a menu that allows you to copy the dump manually to some external media.


Starting a system dump:
-----------------------

If you have the Software Service Aids Package installed, you have access to the sysdumpstart command.
You can start the system dump by entering:
# sysdumpstart -p

You can also use:
# smit dump

Notes regarding system dumps:
-----------------------------


The_Nail <tomapam@gmail.com> wrote: 
> I handle several AIX 5.1 servers and some of them warns me (via errpt) 
> about a lack of disk space for the dumpcheck ressource. 
> Here is a copy of the message : 

> 
> Description 
> The copy directory is too small. 
> 
> Recommended Actions 
> Increase the size of that file system. 
> 
> Detail Data 
> File system name 
> /var/adm/ras 
> 
> Current free space in kb 
> 7636 
> Current estimated dump size in kb 
> 207872 


> I guess /dev/hd6 is not big enough to contain a system dump. So how 
> can i change that? 


The error message tells you something else. 
Read it, and you will understand! 


> How can i configure a secondary susdump space in case the primary 
> would be unavailable? 


sysdumpdev -s /dev/whatever 


> What does "copy directory /var/adm/ras" mean? 


That's where the crash dump will be put when you reboot after the crash. 
/dev/hd6 will be needed for other purposes (paging space), so you cannot 
keep your system dump there. 


And that file system is too small to contain the dump, that's the meaning 
of the error message. 


You have two options: 


- increase the /var file system (it should have ample free space anyway). 
- change the dump directory to something where you have more space: 
  sysdumpdev -D /something/in/rootvg/with/free/space 


Yours, 
Laurenz Albe 


Suppose you find the following error:

$ errpt
IDENTIFIER TIMESTAMP  T C RESOURCE_NAME  DESCRIPTION
F89FB899   0822150005 P O dumpcheck      The copy directory is too small

This message is the result of a dump device check. You can fix this by 
increasing the size of your dump device. If you are using the default 
dump device (/dev/hd6) then increase your paging size or go to smit dump 
and "select System Dump Compression". Myself, I don't like to use the 
default dump device so I create a sysdumplv and make sure I have enough 
space. To check space needed go to smit dump and select "Show Estimated 
Dump Size" this will give you an idea about the size needed.

The copy directory is whatever sysdumpdev says it is.
Run sysdumpdev and you will get something like
#sysdumpdev
primary              /dev/hd6
secondary            /dev/sysdumpnull
copy directory       /var/adm/ras
forced copy flag     TRUE
always allow dump    FALSE
dump compression     ON

# sysdumpdev -e
0453-041 Estimated dump size in bytes: 57881395
Divide this number by 1024.  This is the free space that is needed in 
your copy directory.  Compare it to a df -k or divide this number by 
512.  This is the free space that is needed in your copy directory.  
Compare it to a df


Suppose you find the following error:

selalbe@wijting:/home/beab_krn/selalbe $ errpt
IDENTIFIER TIMESTAMP  T C RESOURCE_NAME  DESCRIPTION
E87EF1BE   0309150009 P O dumpcheck      The largest dump device is too small.


thread:

do sysdumpdev -l you should see both primary and secondary dump devices from
this you need to ensure that these are big enough to hold a system dump so
type sysdumpdev -e to get an estimate on the dump size and resize your dump
devices accordingly. 

Try to increase these above the value you have if it is a new system allow
for growth of the system and give it plenty of space if possible

thread:


-------
Note:
-------

Notes on SDD and SDDPCM:
========================

Note 1:
-------

thread

Q +A:

> I've been reading IBM web sites and PDF manuals and still can't decide
> on exactly how to upgrade my AIX 4.3.3 machine to AIX 5.2 and have my
> ESS SDD vpath disks visible and working when I'm done.
>
> Has someone done this? Can you comment on my proposed method here?

Yes, I've done this.


> What I think I need to do is this:
>
> 1. Do the migration installation from 4.3.3 to 5. Question: Do I need to
> do anything to my ESS disks BEFORE migrating? Unmount? Vary off volume
> groups? Export volume groups?

Yes to all of the above, prior to upgrade. Uninstall SDD software.


> 2. After the migration, and reboot, I understand that the ESS disks will
> not "be there", since the migration does not upgrade the SDD (subsystem
> device driver) does NOT get upgraded. Question: Is this true?

Yes, the datapath devices will be gone because you deleted the SDD
software; IIRC, that is part of the un-install process. After your
upgrade, install SDD just like the first time. This will get you your
hdisks and vpaths back, though not necessarily with the same numbers; have
a 'lsvpcfg' from before your upgrade to cross-reference your new setup to.
'importvg' the VG(s) one at a time, using one of the hdisk's which
constitute the vpath, then run 'hd2vp' on the VG. That will convert the
VG back to using the vpath's.

Note: IIRC, If I Recall/Remember Correctly

>
> 3. Vary off all ESS volume groups, if I shouldn't have done this back in
> step 1.
>
> 4. Remove all the "datapath devices", via: rmdev -dl dpo -R
>
> 5. Uninstall the 4.3 version of the SDD.
>
> 6. Install the 5.2 version of the SDD.
>
> 7. Install the latest PTF of the 5.2 SDD, that they call version
> 1.5.1.3.
>
> 8. Reboot.
>
>
> If you can tell me how to make this procedure more nearly correct, I'd
> greatly appreciate it.


Note 2:
-------

thread

Q + A:

>
> I need a quick refresher here. I've got a HACMP (4.4) cluster with SAN- attached
> ESS storage. SDD is installed. Can I add volumes to one of these volume groups on
> the fly, or does HA need to be down? It's been awhile since I have done this and I
> can't quite remember if I have to jump through any hoops. Thanks for the help.

Should be relatively easy with no downtime required.
1) acquire the new disks on primary node (where the VG is in service) with: 

cfgmgr -Svl fcs0 
- repeat this for all fcs adapters in system
2) convert hdisks to vpaths, note use the smit screens for this because the commands
have changed from version to version.
3) add vpaths to VG with: extendvg4vp vgname vpath#
4) create LVs/filesystems on the vpaths.
5) break VG/scsi locks so that other systems can see the disks with: varyonvg
-b -u vgname
6) perform steps 1 & 2 for all failover nodes in the cluster.
7) refresh the VG definitions on all the failover nodes with: importvg -L
vgname vpath#
8) reestablish disk locks on service node with: varyonvg vgname
9) add new filesystems to HA configuration.
10) synchronise HA resources to the cluster.


Note 3:
-------

From IBM Doc SC30-4131-00:


hd2vp and vp2hd 

SDD provides two conversion scripts, hd2vp and vp2hd. 

The hd2vp script converts a volume group from supported storage device
hdisks to SDD vpath devices, and the vp2hd script converts a volume
group from SDD vpath devices to supported storage device hdisks. 

Use the vp2hd program when you want to configure your applications back
to original supported storage device hdisks, or when you want to remove
SDD from your AIX host system. 

The syntax for these conversion scripts is as follows:
hd2vp vgname 
vp2hd vgname 

vgname Specifies the volume group name to be converted.


Note 4:
-------

thread

Q:

Hi There, 
I want to add a vpath to running hacmp cluster with HACMP 5.1 on AIX 5.2 with Rotating Resource Group. 
If anyone has done it before then can provide a step by step procedure for this. Do i need to stop and start 
HACMP for this? 


A:

On Vg active node : 
#extendvg4vp vg00 vpath10 vpath11 
#smitty chfs ( Increase the f/s as required ) 
#varyonvg -bu vg00 ( this is to un-lock the vg) 

On Secondary node where vg is not active : 
# cfgmgr -vl fscsi0 ( fscsi1 and fcs0 and fcs1 ) 
Found new vpaths 
# chdev -l vpath10 -a pv=yes ( for vpath11 also ) 
# lsvg vg00|grep path ( just note down any one vpath which is from this o/p-for e.g vpath0 ) 
# importvg vg00 vpath0 

Once its fine...go to Primary Node 

# varyonvg vg00 ( Locking the VG ) 

Regards

Note 5:
-------

> HI,

> Is there a way to know dependencies between devices.
> For example,
> hdisk2 is attached to fscsi0 which in turn is attached to fcs0

> I have found nothing in lsdev's man
> Do I have to look in the odm directly

> I need this in order to improve a script

This is a good question and the lsdev man
page should be burned in front of the building
where they develop and document AIX in
Austin, TX, for not answering it for you.
After all, you bothered to read the damn
thing; why didn't it tell you?

$ /usr/sbin/lsdev -Cc adapter -F 'name parent'
ppa0 isa0
sa0 isa0
sa1 isa0
sa2 isa0
siokma0 isa0
fda0 isa0
scsi0 pci0
ent0 pci0
cxpa0 pci0
ent1 pci0
mga0 pci1
ent2 pci1
scsi1 pci2
sioka0 siokma0
sioma0 siokma0
ent3 pci0

There's also the lsparent command.

Regards,

Actually, I have the same question as Frederic and you have not
quite answered it. Sure, lsdev can tell you that "hdisk5" is
matched to "fcs0" . . . but what tells you that "fcs0" in turn
matches to "fscsi0"? And if "hdisk126" matches to adapter "fchan1",
how do I determine what that matches to? I've checked all of the
various lsxxxx commands but can't find this bit of info.

ONCE AGAIN the answer pops up just moments after announcing
to the world that "there's no way to do that" and "I've looked
everywhere and tried everything". Herewith the output from the
necessary commands, with extraneous lines removed:

# lsdev -C -c disk -F 'name location'
hdisk0 11-08-00-2,0
hdisk1 11-08-00-4,0
hdisk2 3A-08-01
hdisk3 3A-08-01
hdisk4 27-08-01
hdisk5 27-08-01


# lsdev -C -c driver -F 'name location'
fscsi0 27-08-01
fscsi1 3A-08-01

# lsdev -C -c adapter -F 'name location'
scsi0 11-08
scsi1 11-09
fcs0 27-08
mg20 2D-08
fcs1 3A-08
#

Obviously it is a simply matter to match disk to adapter to driver
by the location of each object. After that I can easily

sprintf(pathname, "/dev/%s", driver);
fp = open(pathname, O_RDONLY | O_NDELAY);
ioctl(fp, SCIOINQU, &info);

to get the scsi inquiry buffer.


Note 6:
-------

thread

Q:

where to fidnd a guide for the adapter (described  all its states, LED blinkging/lighting)

Adapter is cabled by SAN guys, they double checked it and when I run:

rmdev -Rl fcs0
cfgmgr -l fcs0
lsattr -El fscsi0 -l attach

I don't see "switch" but "none".


thx in advance.

A:

Did you check SAN Switch Zoning?

Regards,

Do something like:

rmdev -Rdl fscsi0
rmdev -dl fcnet0
rmdev -l fcs0
cfgmgr -l fcs0

rmdev -Rdl fscsi0

rmdev -Rdl fscsi1
rmdev -l fcs1

This way, the FC adapter re-negociates an FC fabric logon.

HTH,

I had already done something similiar but it didn't helped:

# lsslot -c slot|grep fcs0
U787B.001.DNWFFM5-P1-C4   Logical I/O Slot  pci4 fcs0
# rmdev -dl pci4 -R
fcnet0 deleted
fscsi0 deleted
fcs0 deleted
pci4 deleted
# cfgmgr
Method error (/usr/lib/methods/cfgefscsi -l fscsi0 ):
        0514-061 Cannot find a child device.
# lsattr -El fscsi0 -a attach
attach none How this adapter is CONNECTED False

the second FC is connected ok:
# lsattr -El fscsi1 -a attach
attach switch How this adapter is CONNECTED False
#

thx anyway,
I will ask my SAN team to check cables once more.
 

Note 7:
-------

thread

hdisk and vpath correspondance for IBM SAN (shark) 
Description

Correspondance between phsical disks:

4 hdisk = 1 vpath = 1 physical disk

To remove all vpaths run the command:

# rmdev -dl dpo -R

To remove all fibre channel disks (2 cards in this example):

# rmdev -dl fscsi0 -R
# rmdev -dl fscsi1 -R

To recreate the hdisks run the command:
# cfgmgr -vl fcs0
# cfgmgr -vl fcs1

To recreate the vpaths run the command:

# cfallvpath

To delete a device run this command:

# rmdev -l fcs1 -d 
Example

rmdev -dl dpo -R ; rmdev -dl fscsi0 -R ; cfgmgr -vl fcs0 ; cfallvpath 


Note 8:
-------

Technote (FAQ) 
  
Problem 
When non-root AIX users issue SDD datapath commands, the "No device file found" message results.  
  
Cause 
AIX SDD does not distinguish between file not found and invalid permissions.  
  
Solution 
Login as the root user or "su" to root user and re-execute command in order to obtain the desired SDD datapath 
command output.  


Note 9: 
-------

(thread ibm site)

Question:

Hi,

I have an AIX 5.3 server running with 2 FCs. One on a DS8300 and one on a DS4300.
On the server, i have a filesystems that is mounted and active (hdisks are from the DS8300). 
I can access it fine, write, delete etc...

Yet, when i do a "datapath query adapter" i get the following :

# datapath query adapter
Active Adapters :1
Adpt# Name State Mode Select Errors Paths Active
0 fscsi0 NORMAL ACTIVE 4111177 0 32 0

I would expect to see my 32 paths Active. I checked another server that has a similar configuration 
(though it only has 1 FC) and i can see 32 Paths, 32 Active...

Is it because of the other FC being connected to a DS4300?

Answer:

Hi.

The reason is that the vpaths are not part of a varied on volume group.
If you do a 'datapath query device' you should find all the paths will be 
state=closed.
If the vpaths are being used by a volume group, do a varyonvg xxxx.
Then display the datapath and the paths should be active.

Question:

Hi.

THanks, but as i mentionned in my original post, the VG is varied on and the FS is mounted. I ran the 
datapath command after i i varyonvg bkpvg and mount /backup. 
I then dumped a DB within the FS, deleted and everything else works...yet datapath query adapter shows 
no Active paths...weird...

Question:

Hi.

What version of SDD?
What does 'datapath query device' say?

Answer:

Version of SDD is 1.6.0.5
And a datapath query device shows :

...

DEV#: 14 DEVICE NAME: vpath14 TYPE: 2107900 POLICY: Optimized
SERIAL: 75AYYV111B7
===========================================================================
Path# Adapter/Hard Disk State Mode Select Errors
0 fscsi0/hdisk40 CLOSE NORMAL 147989 0
1 fscsi0/hdisk23 CLOSE NORMAL 0 0

DEV#: 15 DEVICE NAME: vpath15 TYPE: 2107900 POLICY: Optimized
SERIAL: 75AYYV111B8
===========================================================================
Path# Adapter/Hard Disk State Mode Select Errors
0 fscsi0/hdisk41 CLOSE NORMAL 155256 0
1 fscsi0/hdisk24 CLOSE NORMAL 0 0


yet, as i mentionned, my FS /backup is mounted and accessible... 


Note 10:
--------

thread

Q:

Hi All, 

I am having problems on a p570 on which there are 3 HBA cards. 
2 of the HBAs are connected via a SAN switch to an ESS 800. 
It appears only one of the "paths" to the ESS 800 is working 
As I only have one set of view of the disks on the ESS. 

Running cfgmgr on the adapter gives the following error. 

I have tried removing fscsi0 then unconfiguring fcs0, 
Then reconfiguring fcs0 but I still get the same error. 
Any ideas? Is there some command/utility I can run to verify 
The state of ths HBA? Thank you. 

bash-3.00# cfgmgr -l fcs0 
Method error (/usr/lib/methods/cfgefscsi -l fscsi0 ): 
0514-061 Cannot find a child device. 
bash-3.00# 

0514-061 Cannot find a child device 

A:

HI 

I have had the same problem using HDS SAN devices. 

AT that time I did not have the corect version off the device driver for the fiber cards in P570. 

For aix 5.2 
devices.pci.df1000fa >= 5.2.0.40 
For aix 5.3 
devices.pci.df1000f7 >= 5.3.0.10 

/HGA


Note 11:
--------

Greetings: 

The "0514-061 Cannot find a child device" is common when the FC card is either 
not attached to a FC device, or if it is attached, then I would look at the 
polarity of the cable 
ie. (tx -> rx and rx -> tx) NOT (tx -> tx and rx -> rx) 

cfgmgr is attempting to configure the FC device it is connected to (child 
device) but is unable to see it. 

In this context, device would be some sort of FC endpoint, not just a switch or 
director. 

I would make sure the FC card has connectivity to a FC device, not just the 
fabric and re-run cfgmgr. 


-=Patrick=- 


"Vincent D'Antonio, III" <dantoniov@COMCAST.NET> on 02/19/2003 01:51:24 PM 

Please respond to IBM AIX Discussion List <aix-l@Princeton.EDU> 

  To: aix-l@Princeton.EDU 
  cc: (bcc: Patrick Bigelbach/DSS) 
  Subject Re: Cannot cfgmgr on a new FC 

Put in your OS cd in the cdrom drive and run: 

cfgmgr -vi /dev/cd0 

this should load any filesets you need for the adapter if they are not 
already there. You should the adapter in lsdev -Cc adapter | grep fs. 

HTH 
Vince 

-----Original Message----- 
From: IBM AIX Discussion List [mailto:aix-l@Princeton.EDU] On Behalf Of 
Calderon, Linda 
Sent: Wednesday, February 19, 2003 10:12 AM 
To: aix-l@Princeton.EDU 
Subject: Cannot cfgmgr on a new FC 

I am trying to connect a new HBA on a P660 to a switch for a SAN. This HBA 
has not been used previously, newly cabled etc. I issued the following 
commands and receive the following errors: 

* rmdev -Rdl fsc1 

0514-519 The following device was not found in the customized device 
configuration database: name 'fcs1' 

* cfgmgr 

0514-061 Cannot find a child device 

Looking for ideas as to root cause. 


Note 12:
--------

thread

Q:

Hi All AIXers,
I am trying to add some vpath to Current Volume Group (which is on vpath)and i
am getting this error


Method Error (/usr/lib/methods/chgvpath):
0514-047 Cannot access a device

0516-1182 extendvg open failure on vpath3

0516-792 extendvg: Unable to estend a Volume Group

Do anybody have any idea about this error. I never seen this error before.
Thanks


A:

James,

If you're adding a vpath to a volume group that has other vpaths, you
will need to use extendvg4vp instead of extendvg.

Hope this helps!


Note 13:
--------

On Vg active node : 
#extendvg4vp vg00 vpath10 vpath11 
#smitty chfs ( Increase the f/s as required ) 
#varyonvg -bu vg00 ( this is to un-lock the vg) 

On Secondary node where vg is not active : 
# cfgmgr -vl fscsi0 ( fscsi1 and fcs0 and fcs1 ) 
Found new vpaths 
# chdev -l vpath10 -a pv=yes ( for vpath11 also ) 
# lsvg vg00|grep path ( just note down any one vpath which is from this o/p-for e.g vpath0 ) 
# importvg vg00 vpath0 

Once its fine...go to Primary Node 

# varyonvg vg00 ( Locking the VG ) 

Regards


Note 14:
--------

thread

How to add a a new PV into an existing concurrent mounted VG.

The PMR action plan suggests:

- stop of the resource group
- varyoffvg dummyvg
- varyonvg -nc dummyvg
- extendvg4vp dummyvg vpath0
- start of the resource group

as a backup action

- restart of the cluster
- extendvg4vp dummyvg vpath0
- start of the resource group

After a spech with the Country IBM referent we modify the action plan
in:

- stop of the cluster
- varyoffvg dummyvg
- varyonvg dummyvg
dummyvg should remain Enhanced Concurrent Capable, but I mount
it in normal mode to do the extentions
- extendvg4vp dummyvg vpath0
- importvg -L dummyvg disk on the other node of the cluster
- varyoffvg dummyvg
- cluster verification & syncro
- start of the cluster

Anyway before applying the modified action plan I try to follow the
original one, but with unpredictable return codes. With some vpaths
works, with someothers halfworks (update the VGDA, but not the odm),
with others return the original error.

In my opinion there is an high probability that the cause is in
gsclvmd...

So, a bit disappointed, I applied the modified plan.
All works and the extendvg4vp enlarged the dummyvg...
My machines are too downlevel and very full of lacks :-(

After that my curiosity pulls me to try the next step:

mirrorvg -s -c 2 dummyvg vpath0 vpath1
0516-1509 : VGDA corruption: physical partition info for this LV
is invalid.
0516-842 : Unable to make logical partition copies for logical
volume.
0516-1199 mirrorvg: Failed to create logical partition copies for
logical volume dummylv.
0516-1200 mirrorvg: Failed to mirror the volume group

Now, IBM support is working for analyze this new issue......

Regards.


Note 15: cfgmgr method errors:
------------------------------

1:
==

APAR status
Closed as program error.

Error description 
Users of the 64bit kernel may observe an error when cfgmgr is
invoked at runtime in the cfgsisscsi or cfgsisioa config
methods. Following is an example:
# cfgmgr
Method error (/usr/lib/methods/cfgsisscsi -l sisscsia0 ):
        0514-061 Cannot find a child device.

The error occurs in the cfgsisscsi or cfgsisioa routines
which automatically update the microcode on the adapter if
it is found to be at a level lower than the minimum supported
microcode level.

If the adapter was previously unconfigured, the adapter will
remain in the Defined state. A system reboot should make it
Available.

APAR information 
APAR number IY48873 
Reported component name AIX 5L POWER V5 
Reported component ID 5765E6200 
Reported release 520 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2003-09-19 
Closed date 2003-09-19 
Last modified date 2003-10-24 


Note 16: cfgmgr method errors:
------------------------------

Q:

cfgmgr error-- devices are reported twice
Asked by kuntal_acharyy... on 11/28/2005 6:15:00 AM  

I have an IBM DS4400 with two EXP 700s expansion units connected to a pSeries 650 with AIX 5.1.I have 
created two logical drives in the storage unit.When i run "cfgmgr" to recognise the new raw physical volume 
each disk is reported twice. 

hdisk4 Available 1n-08-01 1742 (700) Disk Array Device 
hdisk5 Available 1n-08-01 1742 (700) Disk Array Device 
hdisk6 Available 11-08-01 1742 (700) Disk Array Device 
hdisk7 Available 11-08-01 1742 (700) Disk Array Device 

There is an error message while running cfgmgr: 

Method error (/etc/methods/cfgfdar -l dar0 ): 
0514-002 Cannot initialize the ODM. 
cfgmgr: 0514-621 WARNING: The following device packages are required for 
device support but are not currently installed. 
devices.scsi 

What may have cause the problem ? 
How ca I solve this problem? 
Any advice is truly welcome. 

A:

hi, I had met the same problem just as 
yours. 3 LPARs(AIX 5300-02) on a p570 
connect FastT600(Ds4300) with 2 HBA cards each, using SAN fibre switch. 2 of the 
LPARs reported hdisk twice, and 1 of them 
reported normally. And I found that the HBA cards on the normal one are in the PCI 
Slots belong to different BUSs, and the HBA cards on unnormal ones are in the same 
BUSs. Then I changed HBA cards to different BUSs' slots, deleted all the dar 
dac and HBA cards in the system, and cfgmgr at last. The problem got solved. I guess there must be some thing wrong with 
the BUS design. Some one told me that he solved the problem by install the last 
patch (AIX 5300-03). So my advice is that 
you should chang the HBA cards to differet 
slots, clear the system and cfgmgr. Or 
maybe update your AIX with the last patch. 
Just try and tell me the result. Good luck!


Note 17: cfgmgr method errors:
------------------------------

ed.malina@uvm.edu (Ed) wrote in message news:<bb30127.0311120759.171bdc46@posting.google.com>... 
> I deleted a scsi device from my 4.3.3 configuration with the following 
> command: 
> rmdev -l scsi2 -dR 
> 
> The device is a dual channel ultra scsi 3 card. I deleted it to try 
> to resolve some performance problems with a drawer connected to the 
> device. Incidentally, scsi3 which is the other side of the dual 
> channel card, is working fine. 
> 
> When I try to reconfigure the device with: 
> cfgmgr -v -lscsi2 
> 
> I get the following error: 
> 
> Method error (/usr/lib/methods/cfgncr_scsi -l scsi2 ): 
> 0514-034 The following attributes do not have valid values: 
> 
> Any thoughts on how to fix it? For the timebeing I can't reboot the 
> machine. Would a reboot be able to resolve the problem if there is no 
> other solution? 
> 
> Thanks! 
> -- Ed 

#>> Ed, 


what you probably should do is run the cfgmgr comand without the 
device name behind it. Because you deleted the scsi device with the 
options -dR you also removed any child devices. 


try this: cfgmgr -v 


Note 18: cfgmgr method errors:
------------------------------

Q:

Hi... 

Does someone know what to do with an SDD driver which can't detect vpaths 
from an ESS F20 but hdisks are already available on AIX? 

showvpath, cfgvpath, datapath query commands don't display or found anything 

By the way, rebooting the system didn't help 

I accept any suggestions. 

Regards 

Luis A. Rojas

A:

Thank you all for your suggestions 

I solve the problem using the hd2vp command which converts the logical 
hdisk 
to its related vpath. And Wal? !.. vpaths suddenly were recognized by 
cfgvpath command. 

I don't know why this happened, but, everything is OK now. 

To those people with similar problems, please check these following 
commands: dpovgfix, hd2vp, vp2hd 

Best Regards 


Note 19: fget_config:
---------------------

how to show the current state and volume (hdisk) ownership in a IBM DS4000 
Description

The fget_config command shows the current state and volume (hdisk) ownership.

To display controllers and hdisks that are associated with a specified DS4000 (dar):

# fget_config

To display the state of each controller in a DS4000 array, and the current path that is being used 
for I/O for each hdisk:

# fget_config -A 
Example

fget_config -A 


Note 20:
--------

Q:

dpovgfix, hd2vp, vp2hd
Asked by RandallGoff on 1/23/2007 9:38:00 AM  

What filesets do dpovgfix, hd2vp and vp2hd belong to. I installed my sdd 
driver and can see everything but can't find these commands. 

A:

They are part of your SDD drivers. You probably installed the devices.xxx filesets. Did you also 
install the host attachment script... the ibm2105 filesets?


Note 21:
--------

thread

Q:

Hi 

I have several AIX LPARS running on SVC controlled disks. Right now i have SDD SW 1.6.1.2. After configuration 
i have some vpath devices that can be managed using the datapath command. 
Now in a recent training of SVC i was asked to install the new SDDPCM driver in order to get some of the benefits 
of this SW driver. 

SDDPCM does not use the concept of vpath anymore, instead a hdisk device object is created. 
This object has definitions and attributes in ODM files. 

Recently i had to change a faulty HBA under SDD drivers. I was able to: 

1- datapath query device: in order to check hdisk devices belonging to the faulty adaptr. 
2- datapath query adapter: in order to check the faulty adapter. 
3- datapath set adapter XX offline: in order to put the faulty HAB offline. 
4- datapath remove adapter XX 
5- Used the diag Hot Plug option to remove the PCI-x HBA and install a new one. 
   Configured the system and modified the corresponden zone. 

How to do the same with SDDPCM even when there's no concept of vpath anymore. 

Thanks in advanced

A:

Hello , 
You can do the same with sddpcm , either using the MPIO commands or smitty screens , smitty devices ---> MPIO devices 
there you can list paths , remove paths , adapters. 
IN the SDD user guide there is a complete section describing what you can do , but same functions you use 
for the vpath , you can use for sddpcm. 
Here is the link for the latest user guide 
http://www-1.ibm.com/support/docview.wss?rsP3&con text=ST52G7&dc=DA490&dc=DA4A30&dc=DA480&dc=D700&dc =DA410&dc=DA4A20&dc=DA460&dc=DA470&dc=DA400&uid=ss g1 S7000303&loc=en_US&cs=utf-8&lang=en


Note 22:
--------

thread

Q:

Greetings: 

Has anyone encountered the 0516-1182 ( mkvg: Open Failure on vpath ) or 
0516-826 ( mkvg: Unable to create volume group ) 
errors while trying to create a new volume group ? 

I attempted to create a new volume group using a couple of newly added 
vpath devices and received 
those errors. 

Any help will be greatly appreciated. 

Thanks in advance. 

Jay. 

A:

Hi 

If using vpath devices then you can confirm that you can open any given device by running: 

datapath query device 

and confirm there's no error in the HBA communications. 

Also you can review the errpt reports in order to look for VPATH OPEN messages. You can also use 
the lquerypr command in order to check for SCSI reservations in the SAN box previously set 
by another host (in case of a cluster). 

Hope this helps


Example lquerypr output

# lquerypr -Vh /dev/hdisk12
connection type: fscsi1
open dev: /dev/hdisk12

Attempt to read reservation key...

Attempt to read registration keys...
Read Keys parameter
        Generation :  52
        Additional Length:  32
        Key0 :  c8ca9d09
        Key1 :  c8ca9d09
        Key2 :  c8cabd09
        Key3 :  c8cabd09
Reserve Key provided by current host = c8cabd09
Not reserved.


Note 23:
--------

thread

Q:

All, 

I'm in the process of preparing for our upcoming disaster recovery exercise 
which is happening in a few weeks. Our plan is to create one big volume 
group, instead of a bunch of little ones like we have in our production 
environment, to try and save some time. 

My question is, is there a way to script using a for/next loop to assign 
each hdisk/vpath when creating a new volume group instead of going into smit 
and assigning them one by one by hand? The hdisks will be sequential and 
will probably be over a hundred in number so you can imagine how tedious 
this will be. Also, this will need to be bigvg enabled. 

Any of you scripters out there have any suggestions? Thanks for your help in 
advance!


A:

Create the VG 
>mkvg -B -y datavg vpathN 

Extend it 
for i in `lspv | grep vpath | grep None | awk '{print #1}'` 
do 
extendvg datavg $i 
done 

That would assign all unused vpaths to the VG. BTW Use the vpath and 
not the hdisk. You could add a count into it to limit the number of 
disks you assign.


Note 24:
--------

thread

Q:

Is anyone aware of a problem if i do a

cfgmgr -vl dp0
and once the vpaths are made
it shows as
vpathxx none None

and then i add the vpath to VG

#extendvg VGname vpathxx

Does this create a problem ?

A:

it sound like the vpath is showing correctly after cfgmgr so thats OK.
But you need to use extendvg4vp and not just extendvg
Do a 'smitty vg' and choose
'Add a Data Path Volume to a Volume Group'

Once its added to a VG then it will show more info in lspv


Note 25: cfgmgr Method error (/usr/sbin/fcppcmmap > /etc/essmap.out):
---------------------------------------------------------------------

Method error (/usr/sbin/fcppcmmap > /etc/essmap.out):
        0514-001 System error:


Note 26: mkpath, lspath commands:
---------------------------------

Examples mkpath:

--To define and configure an already defined path between scsi0 and the hdisk1 device at SCSI ID 5 
and LUN 0 (i.e., connection 5,0), enter: 
# mkpath -l hdisk1 -p scsi0 -w 5,0

The system displays a message similar to the following: 
path available

--To configure an already defined path from 'fscsi0' to fiber channel disk 'hdisk1', the command would be: 
# mkpath -l hdisk1 -p fscsi0

The message would look similar to: 
path available

--To only add to the Customized Paths object class a path definition between scsi0 and the hdisk1 disk device 
at SCSI ID 5 and LUN 0, enter: 
# mkpath -d -l hdisk1 -p scsi0 -w 5,0

The system displays a message similar to the following: 
path defined


Examples lspath:

lspath displays information about paths to an MultiPath I/O (MPIO) capable device.

Examples of displaying path status:

-- To display the status of all paths to hdisk1 with column headers, enter: 
# lspath -H -l hdisk1

The system will display a message similar to the following: 
status    device   parent
enabled   hdisk1   scsi0
disabled  hdisk1   scsi1
missing   hdisk1   scsi2

-- To display, without column headers, the set of paths whose operational status is disabled, enter: 
# lspath -s disabled

The system will display a message similar to the following: 
disabled  hdisk1   scsi1
disabled  hdisk2   scsi1
disabled  hdisk23  scsi8
disabled  hdisk25  scsi8

--To display the set of paths whose operational status is failed, enter: 
# lspath -s failed

The system will display a message similar to the following: 
failed  hdisk1   scsi1
failed  hdisk2   scsi1
failed  hdisk23  scsi8
failed  hdisk25  scsi8

-- To display in a user-specified format, without column headers, the set of paths to hdisk1 whose path status 
is available enter: 
# lspath -l hdisk1 -s available -F"connection:parent:path_status:status"

The system will display a message similar to the following: 
5,0:scsi0:available:enabled
6,0:scsi1:available:disabled

Note that this output shows both the path status and the operational status of the device. 
The path status simply indicates whether the path is configured or not. The operational status indicates 
how the path is being used with respect to path selection processing in the device driver. 
Only paths with a path status of available also have an operational status. If a path is not currently configured 
into the device driver, it does not have an operational status.
Examples of displaying path attributes:

--If the target device is a SCSI disk, to display all attributes for the path to parent scsi0 at connection 5,0, 
use the command: 
# lspath -AHE -l hdisk10 -p scsi0 -w "5,0"
The system will display a message similar to the following: 
attribute  value  description                       user_settable
weight     1      Order of path failover selection  true


Note 26: About FastT and DS Storage:
------------------------------------

IBM TotalStorager FAStT has been renamed IBM TotalStorage DS4000 series 

DS4100 formerly FAStT100

DS4300 formerly FAStT600

DS4300 Turbo formerly FAStT600 Turbo

DS4400 formerly FAStT700

DS4500 formerly FAStT900


Note 27: from GPFS FAQ: 
-----------------------

Q20:

What's the difference between using an ESS with or without SDD or SDDPCM installed on the host? 

A20: 
The use of SDD or SDDPCM gives the AIX host the ability to access multiple paths to a single LUN 
within an ESS. This ability to access a single LUN on multiple paths allows for a higher degree of 
data availability in the event of a path failure. Data can continue to be accessed within the ESS 
as long as there is at least one available path. Without one of these installed, you will lose access 
to the LUN in the event of a path failure. 
However, your choice of whether to use SDD or SDDPCM impacts your ability to use single-node quourm:

Single-node quorum is not supported if SDD is installed. 
Single-node quorum is support if SDDPCM is installed.
To determine the GPFS disk support guidelines for SDD and SDDPCM for your cluster type, see

Q3: What disk support guidelines must be followed when running GPFS in an sp cluster type? 
Q6: What disk support guidelines must be followed when running GPFS in an rpd cluster type? 
Q9:What are the disk support guidelines that must be followed when running GPFS in an hacmp cluster type


Note 28: changing attributes of a fcs0 device:
----------------------------------------------

Examples:

# chdev -l fscsi0 -a fc_err_recov=fast_fail
# chdev -l fscsi0 -a dyntrk=yes

Display attributes:

# lsattr -El fscsi0

attach       switch       How this adapter is CONNECTED         False
dyntrk       no           Dynamic Tracking of FC Devices        True
fc_err_recov fast_fail    FC Fabric Event Error RECOVERY Policy True
scsi_id      0x741113     Adapter SCSI ID                       False
sw_fc_class  3            FC Class for Fabric                   True


Note 29: Flash alerts:
----------------------


IBM Flash Alert on AIX migration with vpaths:
---------------------------------------------

http://www-1.ibm.com/support/docview.wss?rs=540&context=ST52G7&uid=ssg1S1002295&loc=en_US&cs=utf-8&lang=en

All hdisks and vpath devices must be removed from host system before upgrading to SDD host attachment script 
32.6.100.21 and above. All MPIO hdisks must be removed from host system before upgrading to SDDPCM host attachment 
script 33.6.100.9. 
 Flash (Alert) 
  
Abstract 
When upgrading from SDDPCM host attachment script devices.fcp.disk.ibm2105.mpio.rte version 33.6.100.8 or below 
to 33.6.100.9, all SDDPCM MPIO hdisks must be removed from the AIX host system before the upgrade. 
When upgrading from SDD host attachment script ibm2105.rte version 32.6.100.18 or below to 32.6.100.21 or later, 
all AIX hdisks and SDD vpath devices must be removed from the AIX host system before the upgrade.  
  
Content 
Please note that this document contains the following sections:


Problem description, symptoms, and information 
SDD/host attachment upgrade procedures 
Recovery procedures should the ODM become corrupted 
Recovery procedures should the associations become corrupted 
Procedures for upgrading if rootvg is on an ESS disk

- Problem description, symptoms, and information:

Starting with SDDPCM host attachment script devices.fcp.disk.ibm2105.mpio.rte version 33.6.100.9 and 
SDD host attachment script ibm2105.rte version 32.6.100.21, ESS FCP devices are configured as "IBM MPIO FC 2105" 
for MPIO devices, and "IBM FC 2105" for ESS devices. This information can be seen in the "lsdev -Cc disk" output. 
Prior to these host attachment script versions, ESS FCP devices were configured as "IBM MPIO FC 2105XXX" for 
MPIO devices and "IBM FC 2105XXX" for ESS devices, where 'XXX' is the ESS device module, such as F20 or 800. 

If a host system is upgraded without removing all of the hdisks first, then the AIX host system ODM will 
be corrupted. Additionally, if all he hdisks are removed without removing all SDD vpath devices, 
then the associations between an SDD vpath device and its hdisks may be corrupted because the hdisk's device 
minor number may change after reconfiguration. The ODM corruption may look something like the following in the 
"lsdev -Cc disk" output:

# lsdev -Cc disk
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk1.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk2.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk3.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk4.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk5.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk6.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk7.
lsdev: 0514-521 Cannot find information in the predefined device
configuration database for the customized device hdisk8.
hdisk0 Available 10-60-00-8,0 16 Bit SCSI Disk Drive
hdisk1 Available 20-60-01 N/A
hdisk2 Available 20-60-01 N/A
hdisk3 Available 20-60-01 N/A
hdisk4 Available 20-60-01 N/A
hdisk5 Available 20-60-01 N/A
hdisk6 Available 20-60-01 N/A
hdisk7 Available 20-60-01 N/A
hdisk8 Available 20-60-01 N/A

- SDD/host attachment upgrade procedures:

In order to prevent ODM corruption and vpath/hdisk association corruption, all hdisks and SDD vpath devices 
must be removed prior to the upgrade. The following procedure should be used when you want to upgrade:

- AIX OS only*
- Host attachment + AIX OS*
- SDD + AIX OS*
- Host attachment + SDD
- Host attachment only
- SDD + Host attachment + AIX OS*

* Upgrading the AIX OS will always require you to install the SDD which corresponds to the new AIX OS level.

To upgrade SDD only, follow the procedure in the SDD User's Guide.

1. Ensure rootvg is on local scsi disks. If this is not possible, see "Procedures for upgrading if rootvg is on 
   an ESS disk" below.
2. Stop all applications running on SDD Volume Groups/File Systems.
3. Unmount all File Systems of SDD volume group.
4. Varyoff all SDD volume groups.
5. If upgrading OS, save output of lspv command to remember pvids of VGs.
6. If upgrading OS, export volume groups with exportvg.
7. Remove SDD vpath devices with rmdev command.
8. Remove 2105 hdisk devices with rmdev command.
9. If upgrading OS, run 'stopsrc -s sddsrv' to stop sddsrv daemon.
10. If upgrading OS, uninstall SDD.
11. If required, upgrade ibm2105.rte. The recommended version is 32.6.100.18 if support for ESS model 750 is 
    not needed. Version 32.6.100.21 is required to support ESS model 750.
12. If upgrading OS, migrate AIX OS level.
13. If OS upgraded, boot to new AIX level with no disk groups online except rootvg, which is on local scsi disks. 
    /* reboot will automatically start at the end of migration */
14. If OS upgraded, install SDD for the new OS level. Otherwise, if required, upgrade SDD.
15. If OS not upgraded, configure hdisks with the 'cfgmgr -vl fcsX' command.
16. Configure SDD vpath devices by running 'cfallvpath'.
17. If OS upgraded, use lspv command to find out one physical volume which has a pvid matching the previous 
    SDD VG's pv.

Example:
===================================================
Previous lspv output (from step 4):
hdisk0 000bc67da3945d3c None 
hdisk1 000bc67d531c699f rootvg active
hdisk2 none None 
hdisk3 none None 
hdisk4 none None 
hdisk5 none None 
hdisk6 none None 
hdisk7 none None 
hdisk8 none None 
hdisk9 none None 
hdisk10 none None 
hdisk11 none None 
hdisk12 none None 
hdisk13 none None 
hdisk14 none None 
hdisk15 none None 
hdisk16 none None 
hdisk17 none None 
hdisk18 none None 
hdisk19 none None 
hdisk20 none None 
hdisk21 none None 
vpath0 000bc67d318fb8ea SDDVG0 
vpath1 000bc67d318fde50 SDDVG1 
vpath2 000bc67d318ffbb0 SDDVG2 
vpath3 000bc67d319018f3 SDDVG3 
vpath4 000bc67d319035b2 SDDVG4
Current lspv output (from this step):
hdisk0 000bc67da3945d3c None 
hdisk1 000bc67d531c699f rootvg active
hdisk2 000bc67d318fb8ea None 
hdisk3 000bc67d318fde50 None 
hdisk4 000bc67d318ffbb0 None 
hdisk5 000bc67d319018f3 None 
hdisk6 000bc67d319035b2 None 
hdisk7 000bc67d318fb8ea None 
hdisk8 000bc67d318fde50 None 
hdisk9 000bc67d318ffbb0 None 
hdisk10 000bc67d319018f3 None 
hdisk11 000bc67d319035b2 None 
hdisk12 000bc67d318fb8ea None 
hdisk13 000bc67d318fde50 None 
hdisk14 000bc67d318ffbb0 None 
hdisk15 000bc67d319018f3 None 
hdisk16 000bc67d319035b2 None 
hdisk17 000bc67d318fb8ea None 
hdisk18 000bc67d318fde50 None 
hdisk19 000bc67d318ffbb0 None 
hdisk20 000bc67d319018f3 None 
hdisk21 000bc67d319035b2 None 
vpath0 none None 
vpath1 none None 
vpath2 none None 
vpath3 none None 
vpath4 none None 

In this case, hdisk2, hdisk7, hdisk12, and hdisk17 from the current lspv output
has the pvid which matches the pvid of SDDVG0 from the previous lspv output. 
So, use either hdisk2, hdisk7, hdisk12, or hdisk17 to import the volume group 
with the name SDDVG0

18. Run hd2vp on all SDD volume groups.
19. Vary on all SDD volume groups.
20. Mount all file system back.

- Recovery procedures should the ODM become corrupted:

If the host system's ODM is already corrupted as a result of upgrading without removing the hdisks, 
please contact IBM Customer Support at 1-800-IBM-SERV to request a script to fix the corrupted ODM. 

- Recovery procedures should the associations become corrupted:

If vpath/hdisk association corruption has occurred because hdisks were removed without removing SDD vpath devices, 
all SDD vpath devices must be removed and reconfigured in order to correct this corrupted association.

- Procedures for upgrading if rootvg is on an ESS disk:

If rootvg is on an ESS device and cannot be moved to local scsi disks, all hdisks cannot be removed prior 
to the upgrade. In this case, the following procedure should be used to upgrade the SDD host attachment script 
to version 32.6.100.21 or later:

. Contact IBM Customer Support at 1-800-IBM-SERV to request a script to fix the corrupted ODM referenced above. 
. Without removing ESS hdisks, use smitty to upgrade the SDD host attachment script on the host system. 
. Immediately run the script to fix the corrupted ODM on the host system. 
. Run bosboot on the host system. 
. Reboot the host system so that the hdisks can be configured with the new ODM attributes. 
. Return to the "SDD/host attachment upgrade procedures" above and follow the appropriate upgrade steps now that 
  the SDD host attachment script upgrade is complete. 

This issue only occurs when upgrading to devices.fcp.disk.ibm2105.mpio.rte version 33.6.100.9 and SDD host 
attachment script ibm2105.rte version 32.6.100.21 and above.  
  
 
IBM Flash Alert: SDD 1.6.2.0 requires minimum AIX code levels; possible 0514-035 error:
---------------------------------------------------------------------------------------
 Flash (Alert) 
  
Abstract 
SDD 1.6.2.0 requires minimum AIX code levels. Not upgrading to correct AIX version and level can result in 
0514-035 error when attempting removal of dpo or vpath device  
  
Content 
Starting from SDD version 1.6.2.0, a unique ID attribute is added to SDD vpath devices, in order to 
support AIX5.3 VIO future features. AIX device configure methods have been changed in both AIX52 TL8 and 
AIX53 TL4 for this support.

Following are the requirements for this version of SDD with:

AIX5.2 and AIX5.3:  
AIX52 TL8 & above with PTF U804193 (IY76991)
AIX53 TL4 & above with PTF U804397 (IY76997)

Please view 1.6.2.0 readme for further details

If upgraded to SDD 1.6.2.0 and above without first upgrading AIX to the levels listed above the following error 
will be experienced when attempting to remove any vpath devices using the:

# rmdev -dl dpo -R

or the 

# rmdev -dl vpathX command.
                                                   
Method error (/usr/lib/methods/ucfgdevice):                           
0514-035 Cannot perform the requested function because of missing predefined information in the device 
configuration database. 

Solution:
1) Upgrade AIX to correct level and ptf, or
2) Contact SDD support at 1-800-IBM-SERV for steps to clean up ODM to allow for downgrading the SDD level 
   from 1.6.2.0, if unable to upgrade AIX to a newer technology level.  
 

Note 30:
--------

Suppose the following happens:

# rmdev -dRl fcs0

fcnet0 deleted
fscsi0 deleted
fcs0 deleted

# cfgmgr

Method error (/usr/lib/methods/cfgefscsi -l fscsi0 ):
        0514-061 Cannot find a child device.

root@n5114l02:/root#
adapter checked with several commands
connection with san seems impossible.
root@n5114l02:/root#lsattr -El fscsi0
attach       none         How this adapter is CONNECTED         False
dyntrk       no           Dynamic Tracking of FC Devices        True
fc_err_recov delayed_fail FC Fabric Event Error RECOVERY Policy True
scsi_id                   Adapter SCSI ID                       False
sw_fc_class  3            FC Class for Fabric                   True


Note 31:
--------

IY83872: AFTER CHVG -T, VG IS IN INCONSISTENT STATE 

 A fix is available 
Obtain fix for this APAR
 

APAR status
Closed as program error.

Error description 
#---------------------------------------------------
chvg -t renumber pvs that have pv numbers greater than
maxpvs with the new factor. chvg -t is only updating the
new pv_num in lvmrec and not updating the VGDA.
chvg -t leaves the vg is inconsistent state and any changes to
vg may get unpredictable results like a system crash.
Local fix 
Problem summary 
#---------------------------------------------------
chvg -t renumber pvs that have pv numbers greater than
maxpvs with the new factor. chvg -t is only updating the
new pv_num in lvmrec and not updating the VGDA.
chvg -t leaves the vg is inconsistent state and any changes to
vg may get unpredictable results like a system crash.
Problem conclusion 
Fix chvg -t to update the VGDA with the new pv number.
Add a check in hd_kextendlv to make sure that the pvol we
are trying to access is not null.
Temporary fix 
Comments 
APAR information 
APAR number IY83872 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2006-04-11 
Closed date 2006-04-11 
Last modified date 2006-05-03 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Publications Referenced


Fix information 
Fixed component name AIX 5.3 
Fixed component ID 5765G0300 

Applicable component levels 
R530 PSY U805071    UP06/05/03 I 1000 
 

Note 32:
========


ESB-2008.0267 -- [AIX] -- AIX Logical Volume Manager buffer overflow 

--------------------------------------------------------------------------------
 
Date: 14 March 2008 
AusCERT Reference #: ESB-2008.0267

Click here for printable version 
Click here for PGP verifiable version

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

===========================================================================
             AUSCERT External Security Bulletin Redistribution

                          ESB-2008.0267 -- [AIX]
                AIX Logical Volume Manager buffer overflow
                               14 March 2008

===========================================================================

        AusCERT Security Bulletin Summary
        ---------------------------------

Product:              AIX 5.2
                      AIX 5.3
Publisher:            IBM
Operating System:     AIX
Impact:               Root Compromise
Access:               Existing Account

Original Bulletin:    
http://www14.software.ibm.com/webapp/set2/subscriptions/pqvcmjd?mode=18&ID=4169

- --------------------------BEGIN INCLUDED TEXT--------------------

- -----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

IBM SECURITY ADVISORY

First Issued: Tue Jan 22 14:02:18 CST 2008
| Updated: Tue Mar 11 12:55:14 CDT 2008
| IZ10828 availablity updated
===============================================================================
                           VULNERABILITY SUMMARY

VULNERABILITY:   AIX Logical Volume Manager buffer overflow

PLATFORMS:       AIX 5.2, 5.3

SOLUTION:        Apply the fix or workaround as described below.

THREAT:          A local attacker may execute arbitrary code with root
                 privileges.

CERT VU Number:  n/a
CVE Number:      n/a
===============================================================================
                           DETAILED INFORMATION

I. OVERVIEW

    The AIX Logical Volume Manager provides a suite of utilities for
    AIX logical volume management features and functions. The primary
    fileset for the AIX Logical Volume Manager is 'bos.rte.lvm'. In
    addition, AIX provides another suite of utilities for concurrent
    logical volume management across multiple hosts.  The primary
    fileset for the AIX Concurrent Logical Volume Manager is
    'bos.clvm.enh'. Several imporant commands provided by these
    filesets for performing various logical volume management tasks
    have been identified as containing buffer overflow
    vulnerabilities.

II. DESCRIPTION

    Buffer overflow vulnerabilities exist in the 'bos.rte.lvm' and
    'bos.clvm.enh' fileset commands listed below.  A local attacker
    may execute arbitrary code with root privileges because the
    commands are setuid root.  The local attacker must be a member of
    the 'system' group to execute these commands.

    The following 'bos.rte.lvm' commands are vulnerable:

        /usr/sbin/lchangevg
        /usr/sbin/ldeletepv
        /usr/sbin/putlvodm
        /usr/sbin/lvaryoffvg
        /usr/sbin/lvgenminor

    The following 'bos.clvm.enh' command is vulnerable:

        /usr/sbin/tellclvmd

III. IMPACT

    The successful exploitation of this vulnerability allows a
    non-privileged user to execute code with root privileges.

IV. PLATFORM VULNERABILITY ASSESSMENT

    To determine if your system is vulnerable, execute the following
    command:

    lslpp -L bos.rte.lvm bos.clvm.enh

    The following fileset levels are vulnerable:

    AIX Fileset        Lower Level       Upper Level
    ------------------------------------------------
    bos.rte.lvm        5.2.0.0           5.2.0.107
    bos.rte.lvm        5.3.0.0           5.3.0.61
    bos.clvm.enh       5.2.0.0           5.2.0.105
    bos.clvm.enh       5.3.0.0           5.3.0.60

V. SOLUTIONS

    A. APARS

        IBM provides the following fixes:

        AIX Level           APAR number        Availability
        -----------------------------------------------------
        5.2.0               IZ00559            (available now)
|       5.2.0               IZ10828            05/07/2008
        5.3.0               IY98331            (available now)
        5.3.0               IY98340            (available now)
        5.3.0               IY99537            (available now)

        Subscribe to the APARs here:

        http://www.ibm.com/support/docview.wss?uid=isg1IZ00559
        http://www.ibm.com/support/docview.wss?uid=isg1IZ10828
        http://www.ibm.com/support/docview.wss?uid=isg1IY98331
        http://www.ibm.com/support/docview.wss?uid=isg1IY98340
        http://www.ibm.com/support/docview.wss?uid=isg1IY99537

        By subscribing, you will receive periodic email alerting you
        to the status of the APAR, and a link to download the fix once
        it becomes available.

    B. FIXES

        Fixes are available.  The fixes can be downloaded via ftp
        from:

        ftp://aix.software.ibm.com/aix/efixes/security/lvm_ifix.tar

        The link above is to a tar file containing this signed
        advisory, fix packages, and PGP signatures for each package.
        The fixes below include prerequisite checking. This will
        enforce the correct mapping between the fixes and AIX
        Technology Levels.

        AIX Fileset         AIX Level            Fix and Interim Fix
        -----------------------------------------------------------------
        bos.lvm.rte         5200-08              IZ10828_08.071212.epkg.Z
        bos.lvm.rte         5200-08              IZ00559_8a.071212.epkg.Z
        bos.clvm.enh        5200-08              IZ00559_8b.071212.epkg.Z

        bos.lvm.rte         5200-09              IZ10828_09.071212.epkg.Z
        bos.lvm.rte         5200-09              IZ00559_9a.071211.epkg.Z
        bos.clvm.enh        5200-09              IZ00559_9b.071211.epkg.Z

        bos.lvm.rte         5200-10              IZ10828_10.071212.epkg.Z
        bos.lvm.rte         5200-10              bos.rte.lvm.5.2.0.107.U
        bos.clvm.enh        5200-10              bos.clvm.enh.5.2.0.107.U

        bos.lvm.rte         5300-05              IY98331_05.071212.epkg.Z
        bos.lvm.rte         5300-05              IY99537_05.071212.epkg.Z
        bos.lvm.rte         5300-05              IY98340_5a.071211.epkg.Z
        bos.clvm.enh        5300-05              IY98340_5b.071211.epkg.Z

        bos.lvm.rte         5300-06              bos.rte.lvm.5.3.0.63.U
        bos.clvm.enh        5300-06              bos.clvm.enh.5.3.0.61.U

        To extract the fixes from the tar file:

        tar xvf lvm_ifix.tar
        cd lvm_ifix

        Verify you have retrieved the fixes intact:

        The checksums below were generated using the "sum", "cksum",
        "csum -h MD5" (md5sum), and "csum -h SHA1" (sha1sum) commands
        and are as follows:

        sum         filename
        ------------------------------------
        14660    17 IY98331_05.071212.epkg.Z
        26095     9 IY98340_5a.071211.epkg.Z
        40761     8 IY98340_5b.071211.epkg.Z
        10885    16 IY99537_05.071212.epkg.Z
        24909    10 IZ00559_8a.071212.epkg.Z
        64769     9 IZ00559_8b.071212.epkg.Z
        65110    10 IZ00559_9a.071211.epkg.Z
        25389     9 IZ00559_9b.071211.epkg.Z
        26812    26 IZ10828_08.071212.epkg.Z
        55064    26 IZ10828_09.071212.epkg.Z
        55484    26 IZ10828_10.071212.epkg.Z
        03885   157 bos.clvm.enh.5.2.0.107.U
        30581   128 bos.clvm.enh.5.3.0.61.U
        48971  1989 bos.rte.lvm.5.2.0.107.U
        64179  2603 bos.rte.lvm.5.3.0.63.U

        cksum              filename
        -------------------------------------------
        3121912357 16875   IY98331_05.071212.epkg.Z
        107751313  9190    IY98340_5a.071211.epkg.Z
        1129637178 7735    IY98340_5b.071211.epkg.Z
        4019303479 16201   IY99537_05.071212.epkg.Z
        1791374386 9289    IZ00559_8a.071212.epkg.Z
        3287090389 8299    IZ00559_8b.071212.epkg.Z
        565672617  9294    IZ00559_9a.071211.epkg.Z
        257555679  8302    IZ00559_9b.071211.epkg.Z
        3930477686 26525   IZ10828_08.071212.epkg.Z
        1199269029 26533   IZ10828_09.071212.epkg.Z
        358657844  26480   IZ10828_10.071212.epkg.Z
        3753492719 160768  bos.clvm.enh.5.2.0.107.U
        4180839749 131072  bos.clvm.enh.5.3.0.61.U
        3765659627 2036736 bos.rte.lvm.5.2.0.107.U
        3338925192 2665472 bos.rte.lvm.5.3.0.63.U

        csum -h MD5 (md5sum)              filename
        ----------------------------------------------------------
        73bcf7604dd13f26a7500e45468ff5f7  IY98331_05.071212.epkg.Z
        5f32179fc2156bb6e29e775aa7bff623  IY98340_5a.071211.epkg.Z
        7c47e56cadabcba0a105ffa7fc1d40fc  IY98340_5b.071211.epkg.Z
        ef3e4512c3b55091893ce733c707e1a2  IY99537_05.071212.epkg.Z
        db04be33e56169b6a8e8fd747e6948da  IZ00559_8a.071212.epkg.Z
        553f31ccf6a265333938d81eeae6dabc  IZ00559_8b.071212.epkg.Z
        2921b9d2a3dbd84591d60fddf0663798  IZ00559_9a.071211.epkg.Z
        93ce34dec8f4fa9681a2c7c86be065fc  IZ00559_9b.071211.epkg.Z
        e6b0a4a91ba197de0005bd800f06ba4e  IZ10828_08.071212.epkg.Z
        602a8c777cc27e51c3d3dbfa8ebd69be  IZ10828_09.071212.epkg.Z
        b84a5cae03921d30675e522da29da1aa  IZ10828_10.071212.epkg.Z
        2aa4b9b43ca55f74b0fac6be7bc48b66  bos.clvm.enh.5.2.0.107.U
        844e1f2ef9d388d2ddd8cf3ef6251f06  bos.clvm.enh.5.3.0.61.U
        0c73aa8f0211c400455feaa6fb8a95c4  bos.rte.lvm.5.2.0.107.U
        1b5a08eabe984d957db9a145e2a4fd06  bos.rte.lvm.5.3.0.63.U

        csum -h SHA1 (sha1sum)                    filename
        ------------------------------------------------------------------
        d9929214a4d85b986fb2e06c9b265c768c7178a9  IY98331_05.071212.epkg.Z
        0f5fbcdfbbbf505366dad160c8dec1c1ce75285e  IY98340_5a.071211.epkg.Z
        cf2cda3b8d19b73d06b69eeec7e4bae192bec689  IY98340_5b.071211.epkg.Z
        9d8727b5733bc34b8daba267b82864ef17b7156f  IY99537_05.071212.epkg.Z
        e7a366956ae7a08deb93cbd52bbbbf451d0f5565  IZ00559_8a.071212.epkg.Z
        1898733cdf6098e4f54ec36132a03ebbe0682a7e  IZ00559_8b.071212.epkg.Z
        f68c458c817f99730b193ecbd02ae24b9e51cc67  IZ00559_9a.071211.epkg.Z
        185954838c439a3c7f8e5b769aa6cc7d31123b59  IZ00559_9b.071211.epkg.Z
        6244138dc98f3fd16928b2bbcba3c5b4734e9942  IZ10828_08.071212.epkg.Z
        98bfaf44ba4bc6eba452ea074e276b8e87b41c9d  IZ10828_09.071212.epkg.Z
        2a9c0dd75bc79eba153d0a4e966d930151121d45  IZ10828_10.071212.epkg.Z
        96706ec5afd792852350d433d1bf8d8981b67336  bos.clvm.enh.5.2.0.107.U
        91f6d3a4d9ffd15d258f4bda51594dbce7011d8a  bos.clvm.enh.5.3.0.61.U
        4589a5bca998f437aac5c3bc2c222eaa51490dab  bos.rte.lvm.5.2.0.107.U
        3449afd795c24594c7a0c496f225c7148b4071ab  bos.rte.lvm.5.3.0.63.U

        To verify the sums, use the text of this advisory as input to
        csum, md5sum, or sha1sum. For example:

        csum -h SHA1 -i Advisory.asc
        md5sum -c Advisory.asc
        sha1sum -c Advisory.asc

        These sums should match exactly. The PGP signatures in the tar
        file and on this advisory can also be used to verify the
        integrity of the fixes.  If the sums or signatures cannot be
        confirmed, contact IBM AIX Security at
        security-alert@austin.ibm.com and describe the discrepancy.

     C. FIX AND INTERIM FIX INSTALLATION

        IMPORTANT: If possible, it is recommended that a mksysb backup
        of the system be created.  Verify it is both bootable and
        readable before proceeding.

        To preview a fix installation:

        installp -a -d . -p all

        To install a fix package:

        installp -a -d . -X all

        Interim fixes have had limited functional and regression
        testing but not the full regression testing that takes place
        for Service Packs; thus, IBM does not warrant the fully
        correct functionality of an interim fix.

        Interim fix management documentation can be found at:

        http://www14.software.ibm.com/webapp/set2/sas/f/aix.efixmgmt/home.html

        To preview an interim fix installation:

        emgr -e ipkg_name -p         # where ipkg_name is the name of the
                                     # interim fix package being previewed.

        To install an interim fix package:

        emgr -e ipkg_name -X         # where ipkg_name is the name of the
                                     # interim fix package being installed.

VI. WORKAROUNDS

    There are two workarounds available.

    A. OPTION 1

        Change the permissions of these commands to remove the setuid
        bit using the following commands:

        chmod 500 /usr/sbin/lchangevg
        chmod 500 /usr/sbin/ldeletepv
        chmod 500 /usr/sbin/putlvodm
        chmod 500 /usr/sbin/lvaryoffvg
        chmod 500 /usr/sbin/lvgenminor
        chmod 500 /usr/sbin/tellclvmd

        NOTE: chmod will disable functionality of these commands for
        all users except root.

    B. OPTION 2 (AIX 6.1, AIX 5.3 TL6 and TL7)

        Use the File Permissions Manager (fpm) command to manage
        setuid and setgid programs.

        fpm documentation can be found in the AIX 6 Security Redbook
        at:

        http://www.redbooks.ibm.com/abstracts/sg247430.html

        An fpm level of high will remove the setuid bit from the
        affected commands.  For example:

        fpm -l high -p    # to preview changes
        fpm -l high       # to execute changes

        NOTE: Please review the documentation before execution.  fpm
        will disable functionality of multiple commands for all users
        except root.

VII. OBTAINING FIXES

    AIX security related fixes can be downloaded from:

        ftp://aix.software.ibm.com/aix/efixes/security

    AIX fixes can be downloaded from:

        http://www.ibm.com/eserver/support/fixes/fixcentral/main/pseries/aix

    NOTE: Affected customers are urged to upgrade to the latest
    applicable Technology Level and Service Pack.

VIII. CONTACT INFORMATION

    If you would like to receive AIX Security Advisories via email,
    please visit:

        http://www14.software.ibm.com/webapp/set2/subscriptions/pqvcmjd
 
    Comments regarding the content of this announcement can be
    directed to:

        security-alert@austin.ibm.com

    To request the PGP public key that can be used to communicate
    securely with the AIX Security Team you can either:

        A. Send an email with "get key" in the subject line to:

            security-alert@austin.ibm.com

        B. Download the key from a PGP Public Key Server. The key ID is:

            0xA6A36CCC

    Please contact your local IBM AIX support center for any
    assistance.

    eServer is a trademark of International Business Machines
    Corporation.  IBM, AIX and pSeries are registered trademarks of
    International Business Machines Corporation.  All other trademarks
    are property of their respective holders.

IX. ACKNOWLEDGMENTS

    IBM discovered and fixed this vulnerability as part of its
    commitment to secure the AIX operating system.


-------
Note:
-------

AIX:


DESCRIPTOR AREA'S:
------------------

- 1. VOLUME GROUP DESCRIPTOR AREA, VGDA 

Global to the VG:
The VGDA, located at the beginning of each physical volume, contains information that describes all
the LV's and all the PV's that belong to the VG of which that PV is a member.
The VGDA makes a VG selfdescribing. An AIX System can read the VGDA on a disk, and from that, can
determine what PV's and LV's are part of this VG.
There are one or two copies per disk.

- 2. VOLUME GROUP STATUS AREA, VGSA

Tracks the state of mirrorred copies.
The VGSA contains state information about physical partitions and physical volumes.
For example, the VGSA knows if one PV in a VG is unavailable.

Each PV has at least one VGDA/VGSA. The number of VGDA's contained on a single disk
varies according to the number of disks in the VG.

- 3. LOGICAL VOLUME CONTROL BLOCK, LVCB

Contains LV attributes (policies, number of copies).
The LVCB is located at the start of every LV. It contains information about the logical volume. 
You can however, use the mklv command with the -T option, to request that the LVCB will not
be stored in the beginning of the LV. 

With Scalable VG's, LVCM info is no longer stored in the first user block of any LV.
All relevant LVCM info is kept in the VGDA.


The lqueryvg command:
---------------------

The lqueryvg command reads the VGDA from a specified disk in a VG.

Example:

# lqueryvg -p hdisk1 -At
# lqueryvg -Atp hdisk0

-p: which PV
-A: show all available information
-t: show descriptive tags

Example:

#lqueryvg -Atp hdisk0
Max LVs:        256
PP Size:        25
Free PPs:       468
LV count:       20
PV count:       2
Total VGDAs:    3
Conc Allowed:   0
MAX PPs per PV  1016
MAX PVs:        32
Conc Autovaryo  0
Varied on Conc  0
Logical:        00c665ed00004c0000000112b7408848.1   hd5 1
                00c665ed00004c0000000112b7408848.2   hd6 1
                00c665ed00004c0000000112b7408848.3   hd8 1
                00c665ed00004c0000000112b7408848.4   hd4 1
                00c665ed00004c0000000112b7408848.5   hd2 1
                00c665ed00004c0000000112b7408848.6   hd9var 1
                00c665ed00004c0000000112b7408848.7   hd3 1
                00c665ed00004c0000000112b7408848.8   hd1 1
                00c665ed00004c0000000112b7408848.9   hd10opt 1
                00c665ed00004c0000000112b7408848.10  hd7 1
                00c665ed00004c0000000112b7408848.11  hd7x 1
                00c665ed00004c0000000112b7408848.12  beheerlv 1
                00c665ed00004c0000000112b7408848.13  varperflv 1
                00c665ed00004c0000000112b7408848.14  loglv00 1
                00c665ed00004c0000000112b7408848.15  db2_server_v8 1
                00c665ed00004c0000000112b7408848.16  db2_var_v8 1
                00c665ed00004c0000000112b7408848.17  db2_admin_v8 1
                00c665ed00004c0000000112b7408848.18  db2_adminlog_v8 1
                00c665ed00004c0000000112b7408848.19  db2_dasscr_v8 1
                00c665ed00004c0000000112b7408848.20  db2_Fixpak10 1
Physical:       00c665edb74079bc                2   0
                00c665edb7f2987a                1   0
Total PPs:      1022
LTG size:       128
HOT SPARE:      0
AUTO SYNC:      0
VG PERMISSION:  0
SNAPSHOT VG:    0
IS_PRIMARY VG:  0
PSNFSTPP:       4352
VARYON MODE:    0
VG Type:        0
Max PPs:        32512


The lquerypv command:
---------------------

-------
How do I find out what the maximum supported logical track group (LTG) size of my hard disk? 

You can use the lquerypv command with the -M flag. The output gives the LTG size in KB. For instance, 
the LTG size for hdisk0 in the following example is 256 KB.

/usr/sbin/lquerypv -M hdisk0
256
------ 

run 

lquerypv -h core 6b0 

to find the executable (probably man, but man may have called 
something else in the background) 

then run 

dbx path_/to_/executable core 

and run the subcommand 


dbx> where 

and paste the stack output, should be able to find it from there. also 
paste the level of fileset you are on for the executable 


lslpp -w /path_/to_/executable -> this will give fileset_name 
lslpp -l fileset_name 

-------

Wie l,sst sich ein Storage Lock auf einer SAN-Disk brechen?
Endlich die ersehnte SAN-Disk bekommen und dann das, es l,sst sich keine Volume Group darauf anlegen. 

# mkvg -f vpath100 

gibt einen I/O Error. Was tun? 
H"chstwahrscheinlich befindet sich noch ein Lock auf der SAN-Disk. Dies l,sst sich mit dem Befehl 

# lquerypv -ch /dev/vpath100

aufbrechen und die Volume Group kann angelegt werden. 


-------

# lquerypv -h /dev/hdisk9 80 10
  00000080   00001155 583CD4B0 00000000 00000000  |...UX<..........|


# lquerypv -h /dev/hdisk1
00000000   C9C2D4C1 00000000 00000000 00000000  |................|
00000010   00000000 00000000 00000000 00000000  |................|
00000020   00000000 00000000 00000000 00000000  |................|
00000030   00000000 00000000 00000000 00000000  |................|
00000040   00000000 00000000 00000000 00000000  |................|
00000050   00000000 00000000 00000000 00000000  |................|
00000060   00000000 00000000 00000000 00000000  |................|
00000070   00000000 00000000 00000000 00000000  |................|
00000080   00C665ED B7F2987A 00000000 00000000  |..e....z........|
00000090   00000000 00000000 00000000 00000000  |................|
000000A0   00000000 00000000 00000000 00000000  |................|
000000B0   00000000 00000000 00000000 00000000  |................|
000000C0   00000000 00000000 00000000 00000000  |................|
000000D0   00000000 00000000 00000000 00000000  |................|
000000E0   00000000 00000000 00000000 00000000  |................|
000000F0   00000000 00000000 00000000 00000000  |................|

# lquerypv -h /dev/hdisk0 80 10

root@zd93l12:/root#lquerypv -h /dev/hdisk0 80 10
00000080   00C665ED B74079BC 00000000 00000000  |..e..@y.........|


The getlvcb command:
--------------------

The LVCB stores attributes of a LV. The getlvcb command reads the LVCB of a specified LV.
Displays a formatted output of the data in the LVCB of a LV.

Example:

# getlvcb -At hd2

# getlvcb -TA hd3 
Displays the information held in the LVCB of LV hd3. 


The putlvcb command:
--------------------

Writes the control block information (only the specified fields) into block 0 of a logical volume (LVCB).


# putlvcb -t jfs lvdata
writes the LV type jfs to the LVCB of LV lvdata. 


-------
Note:
-------

AIX:


Fixing ODM problems on a VG which is not the rootvg:
====================================================

In the following examle, the VG is called "myvg" consisting of the Physical Volume hdisk3.

1. Unmount all filesystems in that VG first, otherwise you cannot varyoff the VG.
Then varyoff the VG.

# varyoffvg myvg

2. Now remove the complete information of that VG from ODM. The VGDA and LVCB
on the actual disks are NOT touched by the exportvg command.

# exportvg myvg

3. Now import the VG and create new ODM objects associated with that VG:

# importvg -y myvg hdisk3

You only need to specify one intact PV of the VG in the above command. Any disk in the VG
will have a VGDA which contains all neccessary information.
The importvg command reads the VGDA and LVCB on that disk and creates completely new ODM entries.


Fixing ODM problems on the rootvg:
==================================

rvgrecover:
-----------

You can try to use the "rvgrecover" shell script.
The rootvg cannot be varied off, like an ordinary VG, so the solution from the
former section cannot be used.
But the script "rvgrecover" issues a series of odmdelete statements, just like exportvg does.
At the end of the script, an importvg is done.
The importvg command, reads the VGDA and LVCB from the boot disk, resulting in new ODM entries.

The rvgrecover script has the following contents:

Reinitializing the rootvg Volume Group 
To reinitialize the rootvg volume group, copy the shell script to /bin/rvgrecover and run 
the following to make that file executable: 

chmod +x /bin/rvgrecover 
Then run: 

/bin/rvgrecover
Use the following shell script to reinitialize the ODM entries for the rootvg volume group: 

PV=/dev/ipldevice  # PV=hdisk0
VG=rootvg
    cp /etc/objrepos/CuAt /etc/objrepos/CuAt.$$
    cp /etc/objrepos/CuDep /etc/objrepos/CuDep.$$
    cp /etc/objrepos/CuDv /etc/objrepos/CuDv.$$
    cp /etc/objrepos/CuDvDr /etc/objrepos/CuDvDr.$$
    lqueryvg -Lp $PV | awk '{ print $2 }' | while read LVname; do
        odmdelete -q "name = $LVname" -o CuAt
        odmdelete -q "name = $LVname" -o CuDv
        odmdelete -q "value3 = $LVname" -o CuDvDr
    done
    odmdelete -q "name = $VG" -o CuAt
    odmdelete -q "parent = $VG" -o CuDv
    odmdelete -q "name = $VG" -o CuDv
    odmdelete -q "name = $VG" -o CuDep
    odmdelete -q "dependency = $VG" -o CuDep
    odmdelete -q "value1 = 10" -o CuDvDr
    odmdelete -q "value3 = $VG" -o CuDvDr
    importvg -y $VG $PV      # ignore lvaryoffvg errors
    varyonvg $VG


redefinevg:
-----------

redefinevg Command

Purpose
Redefines the set of physical volumes of the given volume group in the device configuration database. 

Syntax
redefinevg { -d Device | -i Vgid } VolumeGroup

Description
During normal operations the device configuration database remains consistent with the 
Logical Volume Manager (LVM) information in the reserved area on the physical volumes. 
If inconsistencies occur between the device configuration database and the LVM, the redefinevg command 
determines which physical volumes belong to the specified volume group and re-enters this information 
in the device configuration database. The redefinevg command checks for inconsistencies by reading 
the reserved areas of all the configured physical volumes attached to the system.


Note: To use this command, you must either have root user authority or be a member of the system group.

Flags

-d Device The volume group ID, Vgid, is read from the specified physical volume device. 
   You can specify the Vgid of any physical volume belonging to the volume group that you are redefining. 
-i Vgid The volume group identification number of the volume group to be redefined. 

Example

To redefine rootvg physical volumes in the Device Configuration Database, enter a command similar to the following:

# redefinevg -d hdisk0 rootvg


synclvodm:
----------

synclvodm Command 
Purpose
Synchronizes or rebuilds the logical volume control block, the device configuration database, 
and the volume group descriptor areas on the physical volumes. 

Syntax
synclvodm [ -v ] VolumeGroup [ LogicalVolume ... ] 


Description
During normal operations, the device configuration database remains consistent with the 
logical volume manager information in the logical volume control blocks and the volume group descriptor 
areas on the physical volumes. If for some reason the device configuration database is not consistent 
with Logical Volume Manager information, the synclvodm command can be used to resynchronize the database. 
The volume group must be active for the resynchronization to occur (see varyonvg). 
If logical volume names are specified, only the information related to those logical volumes is updated. 

Attention: Do not remove the /dev entries for volume groups or logical volumes. Do not change the 
device configuration database entries for volume groups or logical volumes using the object data manager. 
Note: To use this command, you must either have root user authority or be a member of the system group.
Flags
-v verbose 

Example

To synchronize the device configuration database with the logical volume manager information for rootvg, 
enter the following: 

synclvodm rootvg


How to Replace a Disk?: 
=======================

1. Short version for normal VG (not rootvg) and the disk is working:
--------------------------------------------------------------------

extendvg VolumeGroupName hdiskY
migratepv hdiskX hdiskY
reducevg -d VolumeGroupName hdiskX


2. More Detail:
---------------

2.1 The disk is mirrored:
-------------------------

1. Remove all copies from the disk:
   # unmirrorvg vg_name hdiskX

2. Remove disk from VG:
   # reducevg vg_name hdiskX

3. Remove disk from ODM:
   # rmdev -l hdiskX -d

4. Add new disk to the system.

5. Add the new disk to the VG:
   # extendvg vg_name hdiskY

6. Create new copies:
   # mirrorvg vg_name 
   # syncvg vg_name


2.2 The disk was not mirrored, or you want to replace a working disk:
---------------------------------------------------------------------

1. Add the new disk to the system.

2. Add the disk to the VG:
   # extendvg vg_name hdiskY

3. Migrate old disk to new disk:
   # migratepv hdiskX hdiskY

4. Remove old disk from VG:
   # reducevg vg_name hdiskX

5. Remove old disk from ODM:
   # rmdev -l hdiskX -d


2.3 Replace the disk in the rootvg:
-----------------------------------

1. Add the new disk to the system.

2. Add the disk to the VG:
   # extendvg rootvg hdiskY

3. The diskX contains hd5? If so:

   # migratepv -l hd5 hdiskX hdiskY
   # bosboot -ad /dev/hdiskY
   # chpv -c hdiskX
   # bootlist -m normal hdiskY

   If hdiskX contains the primary dump device, you must deactivate it:
   # sysdumpdev -p /dev/sysdumpnull

4. Migrate old disk to new disk:
   # migratepv hdiskX hdiskY

   If the primary dump device has been deactivated, activate it again
   # sysdumpdev -p /dev/hdX

5. Remove old disk from VG:
   # reducevg rootvg hdiskX

6. Remove old disk from ODM:
   # rmdev -l hdiskX -d


-------
Note:
-------

IY94101: J2_DMAP_CORRUPT ERROR REPORT AFTER SHRINKING JFS2 FILESYSTEM

APAR status
Closed as program error.

Error description 
After shrinking a filesystem, J2_DMAP_CORRUPT reports
appear in the error report and some file creates/writes
fail with "Invalid file system control data detected".
Local fix 
Problem summary 
Problem conclusion 
Temporary fix 
Comments 
APAR information 
APAR number IY94101 
Reported component name AIX 5.3 
Reported component ID 5765G0300 
Reported release 530 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2007-01-26 
Closed date 2007-01-29 
Last modified date 2007-05-25 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:

Publications Referenced


Fix information 
Fixed component name AIX 5.3 
Fixed component ID 5765G0300 


-------
Note:
-------

Q:

Since applying ML7 for AIX 5.1 I have been getting file corruption error 
messages on a particular filesystem and the only way to fix it is to umount 
the filesystem and fsck it. I thought it might be a hardware problem but 
now it is also happening on another machine I put the ML7 on and it is 
happening to the same filesystem (one machine is a test server of the 
other). The only unique thing about the filesystem is that it is not in 
rootvg and it is large -1281228 1024-blocks. Has anyone heard of this? 
Below is the error I am getting: 
LABEL: JFS_META_CORRUPTION 
IDENTIFIER: 684A365B 


Date/Time: Tue Apr 26 13:45:26 EDT 
Sequence Number: 2023 
Machine Id: 0000F11F4C00 
Node Id: XX00 
Class: U 
Type: UNKN 
Resource Name: SYSPFS 
Resource Class: NONE 
Resource Type: NONE 
Location: NONE 
VPD: 


Description 
FILE SYSTEM CORRUPTION 


Probable Causes 
INVALID FILE SYSTEM CONTROL DATA 


        Recommended Actions 
        PERFORM FULL FILE SYSTEM RECOVERY USING FSCK UTILITY OBTAIN 
DUMP 
        CHECK ERROR LOG FOR ADDITIONAL RELATED ENTRIES 


Failure Causes 
ADAPTER HARDWARE OR MICROCODE 
DISK DRIVE HARDWARE OR MICROCODE 
SOFTWARE PROGRAM 
STORAGE CABLE LOOSE, DEFECTIVE, OR UNTERMINATED 


        Recommended Actions 
        CHECK CABLES AND THEIR CONNECTIONS 
        INSTALL LATEST ADAPTER AND DRIVE MICROCODE 
        INSTALL LATEST STORAGE DEVICE DRIVERS 
        IF PROBLEM PERSISTS, CONTACT APPROPRIATE SERVICE REPRESENTATIVE 


Detail Data 
FILE NAME 
xix_lookup.c 
LINE NO. 
         300 
MAJOR/MINOR DEVICE NUMBER 
0026 0006 
ADDITIONAL INFORMATION 
4A46 5345 426E 8C46 0000 000E 0000 001D 0003 0610 0000 0000 0000 0000 0000 
0002 
164D A330 0001 86D3 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 
0000 
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 
0000 
--------------------------------------------------------------------------- 
LABEL: JFS_FSCK_REQUIRED 
IDENTIFIER: CD546B25 


Date/Time: Tue Apr 26 13:45:26 EDT 
Sequence Number: 2022 
Machine Id: 0000F11F4C00 
Node Id: XX00 
Class: O 
Type: INFO 
Resource Name: SYSPFS 


Description 
FILE SYSTEM RECOVERY REQUIRED 


        Recommended Actions 
        PERFORM FULL FILE SYSTEM RECOVERY USING FSCK UTILITY 


Detail Data 
MAJOR/MINOR DEVICE NUMBER 
0026 0006 
FILE SYSTEM DEVICE AND MOUNT POINT 
/dev/lv04, /opt/egate 


Q: 

How can I remove a bizarre, irremovable file from a directory? I've tried every way of using 
/bin/rm and nothing works." 

A: 

In some rare cases a strangely-named file will show itself in your directory and appear to be 
un-removable with the rm command. Here is will the use of ls -li and find with its -inum [inode] 
primary does the job. 
Let's say that ls -l shows your irremovable as 

-rw-------  1 smith  smith  0 Feb  1 09:22 ?*?*P

Type: 

ls -li

to get the index node, or inode. 

153805 -rw-------  1 smith  smith  0 Feb  1 09:22 ?*?^P

The inode for this file is 153805. Use find -inum [inode] to make sure that the file is correctly identified. 


%  find -inum 153805 -print
./?*?*P

Here, we see that it is. Then used the -exec functionality to do the remove. . 
  
% find . -inum 153805 -print -exec /bin/rm {} \;

Note that if this strangely named file were not of zero-length, it might contain accidentally misplaced 
and wanted data. Then you might want to determine what kind of data the file contains and move the file 
to some temporary directory for further investigation, for example: 

% find . -inum 153805 -print -exec /bin/mv {} unknown.file \;

Will rename the file to unknown.file, so you can easily inspect it. 

Another way to remove strangely-named files is to use "ls -q" or "cat -v" to show the special characters, 
and then use shell's globbing mechanism to delete the file. 

$ ls
-????*'?
$ ls | cat -v
-^B^C?^?*'

$ rm ./-'^B'*           -- achieved by typing control-V control-B
$ ls


the argument given to rm is a judicious selection of glob wildcards (*'s) and sufficient control characters 
to uniquely identify the file. The leading "./" is useful when the file begins with a hyphen. 
These binary name files are caused by: 

* accidental cut-and-pastes to shell prompts - especially when you paste something of the form: "junk > garbage" 
because the shell creates the file "garbage" before trying to execute the command "junk" 

* filesystem corruption (in which case touching the filesystem any more can really stuff things up) 
If you discover that you have two files of the same name, one of the files probably has a bizarre 
(and unprintable) character in its name. Most probably, this unprintable character is a backspace. 

For example: 


    $ ls
    filename filename
    $ ls -q
    filename fl?ilename
    $ ls | cat -v
    filename
    fl^Hilename


More on Filesystem errors (1):
------------------------------


Q:

Hi all, 

I have a error message complaining about filesystem being full. 
but df does not sure any filesystem being full. 
The error report gives me the major/minor number: 0027/0004 
I went to /dev dir, and searched for the numbers, but it turns out to be ptyp4. 
Why is that? What does this mean? 

Any suggestion? 

A:

Those numbers are reported in hex, the actual major/minor #'s 
are 39 and 4

A:

Convert the errpt #'s to hex. The use ls -l to find them. 


Q:

Hi, 
I get a error concerning a filesystem. 
Now I have 2 questions: 


- What is the way to find out which filesystems is concerned? 
- What can I do? Because all fs have unused space. I cannot find any fs 
with 100% in use. 

LABEL:            J2_FS_FULL
IDENTIFIER: CED6B4B5
Date/Time:       Mon Dec 27 12:49:35 NFT
Sequence Number: 3420
Machine Id:      00599DDD4C00
Node Id:         srvdms0
Class:           O
Type:            INFO
Resource Name:   SYSJ2
Description
UNABLE TO ALLOCATE SPACE IN FILE SYSTEM
Probable Causes
FILE SYSTEM FULL
 Recommended Actions
 INCREASE THE SIZE OF THE ASSOCIATED FILE SYSTEM  REMOVE UNNECESSARY
DATA FROM FILE SYSTEM  USE FUSER UTILITY TO LOCATE UNLINKED FILES STILL
REFERENCED
Detail Data
JFS2 MAJOR/MINOR DEVICE NUMBER
 002B 000B
 

A:

002b is 2*16+11 -->43 
ls -l /dev|grep 43, 
000b is 11 --> look for 43, 11 

Date:         Wed, 29 Dec 2004 11:06:27 +0000
To: aix-l@Princeton.EDU

Q:

Subject 
Re: error concerning filesystem [Virus checked] 

Hi Holger, 

A small query...how did you arrive at this figure of 43 from the error 
code. 
The decimal value of B is 11 but I could not understand the 2*16.. 

can you please exp this.... 

A:

The major/minor numbers (002B 000B) are in hex: hex abcd = 
a*16^3+b*16^2+c*16^1+d therefore hex 002B=0*16^3+0*16^2+2*16^1+11=2*16+11 


AIX superblock issues:
----------------------

-- Hint 1 for AIX:
-- ---------------

thread:

Use this command in case the superblock is corrupted. This will restore the BACKUP COPY of the superblock 
to the CURRENT copy.

# dd count=1 bs=4k skip=31 seek=1 if=/dev/hd4 of=/dev/hd4

# fsck /dev/hd4 2>&1 | tee /tmp/fsck.errors


Note:

fuser
Identifies processes using a file or file system

# fuser -u /dev/hd3
Sample output: /dev/hd3: 2964(root) 6615c(root) 8465(casado) 11290(bonner)


-- Hint 2 for AIX:
-- ---------------

http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.aix.howtos/doc/howto/HT_baseadmn_badmagnumber.htm


Fixing a corrupted magic number in the file system superblock
If the superblock of a file system is damaged, the file system cannot be accessed. You can fix a 
corrupted magic number in the file system superblock.

Most damage to the superblock cannot be repaired. The following procedure describes how to repair a superblock 
in a JFS file system when the problem is caused by a corrupted magic number. If the primary superblock is corrupted 
in a JFS2 file system, use the fsck command to automatically copy the secondary superblock and repair the primary 
superblock.

In the following scenario, assume /home/myfs is a JFS file system on the physical volume /dev/lv02.

The information in this how-to was tested using AIXr 5.2. If you are using a different version or level of AIX, 
the results you obtain might vary significantly. 

1. Unmount the /home/myfs file system, which you suspect might be damaged, using the following command: 

# umount /home/myfs

2. To confirm damage to the file system, run the fsck command against the file system. For example: 

# fsck -p /dev/lv02

If the problem is damage to the superblock, the fsck command returns one of the following messages: 

fsck: Not an AIXV5 file system
OR 
Not a recognized filesystem type

3. With root authority, use the od command to display the superblock for the file system, 
as shown in the following example: 

# od -x -N 64 /dev/lv02 +0x1000

Where the -x flag displays output in hexadecimal format and the -N flag instructs the system to format 
no more than 64 input bytes from the offset parameter (+), which specifies the point in the file where 
the file output begins. The following is an example output: 

0001000  1234 0234 0000 0000 0000 4000 0000 000a
0001010  0001 8000 1000 0000 2f6c 7633 0000 6c76
0001020  3300 0000 000a 0003 0100 0000 2f28 0383
0001030  0000 0001 0000 0200 0000 2000 0000 0000
0001040

In the preceding output, note the corrupted magic value at 0x1000 (1234 0234). If all defaults were taken 
when the file system was created, the magic number should be 0x43218765. If any defaults were overridden, 
the magic number should be 0x65872143. 

4. Use the od command to check the secondary superblock for a correct magic number. An example command 
and its output follows: 

# od -x -N 64 /dev/lv02 +0x1f000

001f000  6587 2143 0000 0000 0000 4000 0000 000a
001f010  0001 8000 1000 0000 2f6c 7633 0000 6c76
001f020  3300 0000 000a 0003 0100 0000 2f28 0383
001f030  0000 0001 0000 0200 0000 2000 0000 0000
001f040

Note the correct magic value at 0x1f000. 

5. Copy the secondary superblock to the primary superblock. An example command and output follows: 

# dd count=1 bs=4k skip=31 seek=1 if=/dev/lv02 of=/dev/lv02

dd: 1+0 records in.
dd: 1+0 records out.

Use the fsck command to clean up inconsistent files caused by using the secondary superblock. For example: 

# fsck /dev/lv02 2>&1 | tee /tmp/fsck.errs

For more information

The fsck and od command descriptions in AIX 5L Version 5.3 Commands Reference, Volume 4 
AIX Logical Volume Manager from A to Z: Introduction and Concepts, an IBM Redbook 
AIX Logical Volume Manager from A to Z: Troubleshooting and Commands, an IBM Redbook 
"Boot Problems" in Problem Solving and Troubleshooting in AIX 5L, an IBM Redbook 


Linux superblock issues:
------------------------

1.

DAMAGED SUPERBLOCK


If a filesystem check fails and returns the error message "Damaged Superblock" you're lost . . . . . . . 
or not ?
Well, not really, the damaged "superblock" can be restored from a backup. There are several backups stored 
on the harddisk. But let me first have a go at explaining what a "superblock"is.

A superblock is located at position 0 of every partition, contains vital information about the filesystem 
and is needed at a fielsystem check.

The information stored in the superblock are about what sort of fiesystem is used, the I-Node counts, 
block counts, free blocks and I-Nodes, the numer of times the filesystem was mounted, date of the 
last filesystem check and the first I-Node where / is located.

Thus, a damaged superblock means that the filesystem check will fail. 

Our luck is that there are backups of the superblock located on several positions and we can restore 
them with a simple command.

The usual ( and only ) positions are: 8193, 32768, 98304, 163840, 229376 and 294912. ( 8193 in many cases 
only on older systems, 32768 is the most current position for the first backup )
You can check this out and have a lot more info about a particular partition you have on your HD by:


CODE  
# dumpe2fs /dev/hda5 

You will see that the primary superblock is located at position 0, and the first backup on position 32768.
O.K. let's get serious now, suppose you get a "Damaged Superblock" error message at filesystem check 
( after a power failure ) and you get a root-prompt in a recovery console, then you give the command:

CODE  
# e2fsck -b 32768 /dev/hda5 


don't try this on a mounted filesystem

It will then check the filesystem with the information stored in that backup superblock and if the check 
was successful it will restore the backup to position 0.
Now imagine the backup at position 32768 was damaged too . . . then you just try again with the backup 
stored at position 98304, and 163840, and 229376 etc. etc. until you find an undamaged backup  
( there are five backups so if at least one of those five is okay it's bingo ! )

So next time don't panic . . just get the paper where you printed out this Tip and give the magic command
 
CODE  
# e2fsck -b 32768 /dev/hda5  


-------
Note:
-------

XXX


LABEL:          LVM_SA_PVMISS
IDENTIFIER:     F7DDA124

Date/Time:       Fri Jan 30 17:25:44 MET 2009
Sequence Number: 1079
Machine Id:      00CC94EE4C00
Node Id:         srv1
Class:           H
Type:            UNKN
Resource Name:   LVDD
Resource Class:  NONE
Resource Type:   NONE
Location:

Description
PHYSICAL VOLUME DECLARED MISSING

Probable Causes
POWER, DRIVE, ADAPTER, OR CABLE FAILURE

Detail Data
MAJOR/MINOR DEVICE NUMBER
8000 0011 0000 0001
SENSE DATA
00CC 94EE 0000 4C00 0000 011D B976 A0BF 00CC 94EE DAE4 754C 0000 0000 0000 0000


-------
Note:
-------

case 1:

LABEL:          SCAN_ERROR_CHRP
IDENTIFIER:     BFE4C025

Date/Time:       Thu Apr  2 10:28:37 ZOM 2009
Sequence Number: 1083
Machine Id:      00CDA84C4C00
Node Id:         srv1
Class:           H
Type:            PERM
Resource Name:   sysplanar0
Resource Class:  planar
Resource Type:   sysplanar_rspc
Location:

Description
UNDETERMINED ERROR

Failure Causes
UNDETERMINED

        Recommended Actions
        RUN SYSTEM DIAGNOSTICS.

Detail Data
PROBLEM DATA
0644 00E0 0000 0698 9E00 8E00 0000 0000 0000 0000 4942 4D00 5048 0030 0100 DD00
2009 0402 0824 5933 2009 0402 0824 5976 4500 010A 0000 0000 0000 0000 0000 0000
5046 70BE 5046 70BE 5548 0018 0100 DD00 2303 2000 0000 E500 0000 A800 0000 0000


case 2:

LABEL:          INTRPPC_ERR
IDENTIFIER:     853015D6

Date/Time:       Sun Mar 22 00:27:49 MET 2009
Sequence Number: 1515
Machine Id:      00C503AC4C00
Node Id:         heilbot
Class:           H
Type:            UNKN
Resource Name:   sysplanar0
Resource Class:  planar
Resource Type:   sysplanar_rspc
Location:

Description
UNDETERMINED ERROR

Probable Causes
SYSTEM I/O BUS
SOFTWARE PROGRAM
ADAPTER
DEVICE

        Recommended Actions
        PERFORM PROBLEM DETERMINATION PROCEDURES

Detail Data
BUS NUMBER
9001 00C0
INTERRUPT LEVEL
0009 0001
Number of Occurrences
0000 0001


-------
Note:
-------

LABEL:          FSCSI_ERR4
IDENTIFIER:     3074FEB7

Date/Time:       Sat Mar 28 21:01:31 MET 2009
Sequence Number: 1597429
Machine Id:      0005A21CD700
Node Id:         goofy
Class:           H
Type:            TEMP
Resource Name:   fscsi0
Resource Class:  driver
Resource Type:   efscsi
Location:        U787F.001.DPM2D71-P1-C5-T1

Description
ADAPTER ERROR

Probable Causes
ADAPTER HARDWARE OR CABLE
ADAPTER MICROCODE
FIBRE CHANNEL SWITCH OR FC-AL HUB

Failure Causes
ADAPTER
CABLES AND CONNECTIONS
DEVICE

        Recommended Actions
        PERFORM PROBLEM DETERMINATION PROCEDURES
        CHECK CABLES AND THEIR CONNECTIONS
        VERIFY DEVICE CONFIGURATION

Detail Data
SENSE DATA
0000 0000 0000 00AF 0000 0902 0200 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 003D 001B 0000 0000
003F 001B 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 402F 0000 0032 0002 0000 0000 0000 0000 0203 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0003 5006 0484
52A5 BB6C 5006 0484 52A5 BB6C 0200 0000 0000 0000 0000 0000 0000 0000 0000 0000
2D6B 3000


-------
Note:
-------


LABEL:          NONE_DUMP
IDENTIFIER:     A63BEB70

Date/Time:       Mon Nov 10 15:27:23 MET 2008
Sequence Number: 4598
Machine Id:      00C503AC4C00
Node Id:         goofy
Class:           U
Type:            NONE
Resource Name:   SYSPROC
Resource Class:  NONE
Resource Type:   NONE
Location:


Detail Data

0000 000B 0000 0000 000C 60D6 0000 001A 0000 00A0 0000 0000 2F76 6172 2F63 6F72
652F 636F 7265 2E38 3131 3232 322E 3130 3134 3237 3233 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
6B63 7000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 6663 6C6F 7365 2039 380A 6361 7467 6574 7320 3144 300A 6361 7467 6574 7320
3144 300A 5F58 4465 6661 756C 7420 3438 0A5F 5849 4F45 7272 6F72 2032 430A 785F
7265 6164 5F35 3820 4230 0A5F 5852 6561 6420 3143 0A5F 5845 7665 6E74 7351 2033
4130 0A58 4576 656E 7473 5175 2035 380A 4669 6E64 496E 7075 7420 3838 0A5F 5874
5761 6974 466F 2033 4543 0A58 7441 7070 4E65 7874 2031 3734 0A58 7441 7070 4D61
696E 2034 380A 3F3F 0A3F 3F0A 3F3F 0A3F 3F0A 3F3F 0A5F 5F73 7461 7274 2038 430A
Symptom Data
REPORTABLE
1
INTERNAL ERROR
0
SYMPTOM CODE
PCSS/SPI2 FLDS/kcp SIG/11 FLDS/fclose VALU/98 FLDS/__start


-------
Note:
-------


LABEL:          CONSOLE
IDENTIFIER:     7F88E76D

Date/Time:       Thu Mar 26 07:36:12 MET 2009
Sequence Number: 545
Machine Id:      00CC696E4C00
Node Id:         fint
Class:           S
Type:            PERM
Resource Name:   console

Description
SOFTWARE PROGRAM ERROR

Probable Causes
SOFTWARE PROGRAM

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        REVIEW DETAILED DATA

Detail Data
USER'S PROCESS ID:
                192716
DETECTING MODULE
conwrite
FAILING MODULE
UIO_WRITE
RETURN CODE
           5
ERROR CODE
           0


-------
Note:
-------


LABEL:          LVM_MWCWFAIL
IDENTIFIER:     41BF2110

Date/Time:       Fri Jan 30 14:58:11 MET 2009
Sequence Number: 476
Machine Id:      00CC696E4C00
Node Id:         goofy
Class:           H
Type:            UNKN
Resource Name:   LVDD
Resource Class:  NONE
Resource Type:   NONE
Location:

Description
MIRROR WRITE CACHE WRITE FAILED

Detail Data
MAJOR/MINOR DEVICE NUMBER
8000 0011 0000 0001
BLOCK NUMBER
                     2
ERROR CODE AS DEFINED IN sys/errno.h
           5
SENSE DATA
00CC 696E 0000 4C00 0000 011D AE6F 59DF 00CC 696E 082E 051E 0000 0000 0000 0000


-------
Note:
-------

LABEL:          CORE_DUMP
IDENTIFIER:     C69F5C9B

Date/Time:       Tue Mar 31 02:00:41 ZOM 2009
Sequence Number: 203
Machine Id:      00CC696E4C00
Node Id:         goofy
Class:           S
Type:            PERM
Resource Name:   SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

        Recommended Actions
        CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

        Recommended Actions
        RERUN THE APPLICATION PROGRAM
        IF PROBLEM PERSISTS THEN DO THE FOLLOWING
        CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
          11
USER'S PROCESS ID:
                360584
FILE SYSTEM SERIAL NUMBER
          27
INODE NUMBER
       28967
CORE FILE NAME
/var/core/core.360584.31000041
PROGRAM NAME
BS_sear
STACK EXECUTION DISABLED
           0
COME FROM ADDRESS REGISTER

PROCESSOR ID
  hw_fru_id: 1
  hw_cpu_id: 8

ADDITIONAL INFORMATION
??
??
Unable to generate symptom string.


-------
Note:
-------


Q:

LABEL:          TS_NIM_ERROR_STUCK_
IDENTIFIER:     3D32B80D

Date/Time:       Sat Mar 28 16:30:19 MET 2009
Sequence Number: 1157
Machine Id:      00CC94EE4C00
Node Id:         vleet
Class:           S
Type:            PERM
Resource Name:   topsvcs

Description
NIM thread blocked

Probable Causes
A thread in a Topology Services Network Interface Module (NIM) process
was blocked
Topology Services NIM process cannot get timely access to CPU

User Causes
Excessive memory consumption is causing high memory contention
Excessive disk I/O is causing high memory contention

        Recommended Actions
        Examine I/O and memory activity on the system
        Reduce load on the system
        Tune virtual memory parameters
        Call IBM Service if problem persists

Failure Causes
Excessive virtual memory activity prevents NIM from making progress
Excessive disk I/O traffic is interfering with paging I/O

        Recommended Actions
        Examine I/O and memory activity on the system
        Reduce load on the system
        Tune virtual memory parameters
        Call IBM Service if problem persists

Detail Data
DETECTING MODULE
rsct,nim_control.C,1.39.1.22,5947
ERROR ID
6BUfAx.98Yn7/79N1Nr9GF0...................
REFERENCE CODE

Thread which was blocked
receive thread
Interval in seconds during which process was blocked
          86
Interface name
rhdisk4


A:

IZ02759: TOPOLOGY SERVICES NIM COMMAND RECEIVE THREAD TOO SENSITIVE TO NIM_ERROR_STUCK CAUSED BY A CLOCK CHANGE.
  

 A fix is available 
Obtain fix for this APAR
 
APAR status
Closed as program error.

Error description 
The Topology Services NIM threads have always complained of
being blocked if the clock is moved forward by 7-10 seconds or
more (depending on the thread).
   Error Label: TS_NIM_ERROR_STUCK_ER
      Error ID: 3D32B80D

Normally the length of time stuck reported is about the same as
the amount of time the clock was moved forward.  However lately
the Command Receive thread has been seen in some cases to report
10-12 seconds of blockage from a very small (1-2 second) change
in time.  This was first reported as a result of an xntpd clock
adjustment, but can be recreated with the date command.

   Thread which was blocked
   command receive thread
   Interval in seconds during which process was blocked
             11
   Interface name
   en4
Local fix 
ignore error.
Problem summary 
A recent change to certain internal timers in the command
receive thread of the Network Interface Modules makes that
thread complain of being stuck (for 10 seconds or more) when
the clock jumps for any interval of a second or more.
Problem conclusion 
The blockage threshold has been adjusted for the command
receive thread to eliminate this problem.

Note that all NIM threads will complain of being stuck when
the clock jumps for a significant time (at least 6 seconds,
greater than that in most cases), and that will not change
with this fix.  This only eliminates one of the threads
being much more sensitive to a time change than the rest.
Temporary fix 
Comments 
APAR information 
APAR number IZ02759 
Reported component name RSCT/RMC 
Reported component ID 5765F07AP 
Reported release 247 
Status CLOSED PER 
PE NoPE 
HIPER NoHIPER 
Submitted date 2007-08-10 
Closed date 2007-08-24 
Last modified date 2007-09-26 

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:
IZ03716
 

A:

I've seen something more on this topic.
I've seen that xnptd is running on our servers, but they're not syncronized.
So on one server I've stopped the xntpd service and executed manually ntpdate against the time server (a windows server).
The server has syncronised the date/hour, and some new "NIM thread blocked" errors have just appeared in errpt.


Q:

the problem is that there are so many "sometimes" in this situation

sometimes only a disk-heartbeat is blocked
sometimes only a network-heartbeat is blocked
sometimes both
sometimes there is one entry in errlog... i ignore it
sometimes there are 3-4 errors

2 days ago 2 nodes of the same cluster were starting to log those errors
every node reported nim_threads blocked for 40 seconds

finally they didnt see each other anymore... standby took over but primary didnt notice... 
when that situation went away a dms was triggered

this cluster is a 64 cpu p595
how can 4 (nim-)threads be blocked for 40 seconds on a system having 64 (physical) cpus???

it is NOT a ntp problem. time is in sync and is syncronized 2 times a day with ntpdate/cron

(you can easily trigger this error in errpt by giving a kill -17 to the hats-proc, waiting 30 seconds and give a kill -19 to it. so ntp could be a problem... but isnt)

once when this error came I was logged in one the node and hat a vmstat running

root@sbpsgava01:/root > errpt|head
IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
3D32B80D 1030182208 P S topsvcs NIM thread blocked
3D32B80D 1030182208 P S topsvcs NIM thread blocked

--> error at 18:22

now look at vmstat-output:

System configuration: lcpu=12 mem=28672MB ent=6.00
kthr memory page faults cpu time
----------- --------------------- --------------------------------- --- ------------------ ----------------------- --------
r b p avm fre fi fo pi po fr sr in sy cs us sy id wa pc ec hr mi se
1 1 0 4387395 2332220 53 35 0 0 68 821 69 10613 445 9 1 89 2 0.59 9.9 18:18:03
2 4 0 4387482 2332188 16 139 0 0 96 1035 205 6263 659 4 1 94 1 0.33 5.6 18:19:03
2 1 0 4389805 2330376 38 70 0 0 63 723 117 6636 513 10 1 87 1 0.71 11.9 18:20:03
3 1 0 4392562 2327118 12 47 0 0 28 314 79 7237 450 3 1 95 0 0.27 4.5 18:21:03
6 1 0 4388433 2331413 23 53 0 0 52 634 90 6105 499 3 1 95 1 0.29 4.8 18:22:03
5 1 0 4377480 2342374 46 46 0 0 74 1055 102 3373 551 4 1 94 1 0.32 5.3 18:23:03
2 1 0 4388646 2330929 132 56 0 0 156 2203 122 16122 596 12 1 82 4 0.84 13.9 18:24:03
2 1 0 4391073 2328497 81 44 0 0 104 1632 139 22069 647 13 2 82 3 0.94 15.7 18:25:03
1 1 0 4395142 2324464 108 30 0 0 119 1667 102 18325 564 13 1 82 4 0.87 14.4 18:26:03
2 1 0 4362831 2356799 88 34 0 0 104 1655 103 10019 503 7 1 88 3 0.51 8.5 18:27:03

6 physical cpu. 12 logical. only 6 running and 1 blocked process at 18:22.

94 % idle!!

so the often heard response from ibm to this problem (that goes "there were too much load on the system") 
cannot convince me


-----Original Message-----
From: IBM AIX Discussion List [mailto:aix-l@xxxxxxxxxxxxx] On Behalf Of Stefan.Gocke@xxxxxxxxxxx
Sent: Tuesday, February 24, 2009 2:38 PM
To: aix-l@xxxxxxxxxxxxx
Subject: Re: NIM thread blocked

Hello Holger,

this most ofthenly comes from the disk-heartbeat when backup runs.

Is the NIM-THREAD blocked from the disk-heartbeat or the LAN heartbeat?
Does it occur on all interfaces? then it's time to really do something.

And there is an old error in some releases/ptfs of HACMP that had a problem when the automatic 
cluster verify runs. I've seen cluster where 400 hdisks had this error. It happend when the automatic 
verification from the node NOT haveing the the disks ran verification. That was an error in programming, 
not a real error. If you ran manual verification it didn't happen.

When this occurs, the system did not give the CPU to the heartbeat process to write it's heartbeat, 
because a higher priority thread was blocking access to that device. If it happens often - 
I normally suggest to add another fiberchannel adapter (if disk heartbeat).

As long as all other hearbeats are working normally and if it's just sporadic .... 
ignore for now and monitor that it doesn't happen too often.

Regards. Stefan

--

-----Original Message-----
Date: Tue, 24 Feb 2009 13:35:24 +0100
Subject: NIM thread blocked
From: Holger van Koll <Holger.vanKoll@xxxxxxxxxxxx>
To: aix-l@xxxxxxxxxxxxx

Hello, on about 60 systems (proably all that have hacmp
running) I get entries in errpt like these: 3D32B80D 16-02-09
00:05 P S topsvcs NIM thread blocked Details in errlog tell that those 
nim-threads (one per heartbeat) have been blocked for a certain amount of time, 
can be 5 seconds, can be 50. When I look at performance-logging tools 
(like patrol or even simple vmstat commands that were running) I see that 
those commands have been blocked for approximately the same amount of time. 
So, something on some of my nodes prevents tasks to be executed. The nodes vary from 
64 cpu p595 to partitions with 0.5 cpu. The errors come without and regularity. 
One night 5 come. Then its quiet for days or weeks. Does anybody have an idea or 
at least a similar situation? Regs, 


A:


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


-------
Note:
-------


##############################################################

SECTION 15: Filesystems and Logical Volume Management:

##############################################################


Traditional filesystems in Solaris:
===================================


>>>> A few traditional filesystem commands:
===========================================

The UFS filesystem has always been the most popular fs on Solaris.
Ofcourse, when the newer ZFS filesystem became available, it has been rapidly adopted.

We will frst take a look at a few classical commands, that you would typically use on a UFS filesystem.
Ofcourse, many "listing commands" like for example, df (to show what's used and what is free space), 
can be used on ZFS as well. But creating an fs on ZFS goes absolutly different from what you can find in section 29.1


Checks on the filesystems in Solaris:
-------------------------------------

1. used space etc.. 
#  df -k, df -h etc..

# du -ks /home/fred 

Shows only a summary of the disk usage of the /home/fred subdirectory (measured in kilobytes).

# du -ks /home/fred/* 

Shows a summary of the disk usage of each subdirectory of /home/fred (measured in kilobytes).

# du -s /home/fred

Shows a total summary of /home/fred

# du -sg /data

Shows a total summary of /data in GB


This command shows the diskusage of /dirname in GB
# du -g /dirname

2. examining the disklabel
#  prtvtoc /dev/rdisk/c0t3d0s2

3. format just by itself shows the disks
#  format

#  format -> specify disk -> choose partition -> choose print to get the partition table

4. Display information about SCSI devices

# cfgadm -al

or, from the PROM, commands like probe-scsi


What is the CDROM device in Solaris:
------------------------------------

-- pointer 1.

If you have a CD put in the drive, and it was automounted, simply use the "df" command to view your filesystems:

# df -k    or df -h

-- pointer 2.

From the output of the command

# iostat -En

you could figure out what logical device name your CDROM has.

-- pointer 3.

Solaris uses the same naming conventions as used with hardisks, for example the CDROM in the following command

# mount -r -F hsfs /dev/dsk/c0t6d0s2 /cdrom

means that in this case, the CDROM device is "/dev/dsk/c0t6d0s2"
Normally, a CD is automounted on "/cdrom" or "/cdrom/cdrom0"

The simplest way to mount CDROM on Solaris is use vold daemon.  The vold daemon in Solaris manages the CD-ROM device 
and automatically performs the mounting similar to how Windows manages CDROMs (but not as transparent or reliable). 
If CD is detected in drive its should be  automatically mounted to the /cdrom/cdrom0 directory. 


Recovering disk partition information in Solaris:
-------------------------------------------------

Use the fmthard command to write the backup VTOC information back to the disk.
The following example uses the fmthard command to recover a corrupt label on a disk
named /dev/rdisk/c0t3d0s1. The backup VTOC information is in a file named c0t3d0
in the /vtoc directory.

# fmthard -s /vtoc/c0t3d0s0 /dev/rdsk/c0t3d0s2

Remember that the format of /dev/(r)dsk/cWtXdYsZ means:

W is the controller number,
X is the SCSI target number,
Y is the logical unit number (LUN, almost always 0),
Z is the slice or partition number

Make a new filesystem in Solaris:
---------------------------------

To create a UFS filesystem on a formatted disk that already has been divided into slices
you need to know the raw device filename of the slice that will contain the filesystem.
Example:

# newfs /dev/rdsk/c0t3d0s7

defaults on UFS on Solaris: 
blocksize 8192
fragmentsize 1024
one inode for each 2K of diskspace

FSCK in Solaris:
----------------

If you just want to determine the state of a filesystem, whether it needs checking, 
you can use the fsck command while the fs is mounted.
Example:

# fsck -m /dev/rdsk/c0t0d0s6

The state flag in the superblock of the filesystem you specify is checked to see
whether the filesystem is clean or requires checking.

If you ommit the device argument, all the filesystems listed in /etc/vfstab  with a fsck 
pass value greater than 0 are checked.


Adding a disk in Solaris 2.6, 2.7, 8, 9:
----------------------------------------

In case you have just build in a new disk,
its probably best, to first use the probe-scsi command from the OK prompt:

ok probe-scsi
..
Target 3
 Unit 0  Disk   Seagate ST446452W   0001
..

Next, do a reconfiguration reboot, with the "boot -r" command:

ok boot -r

Specifying the -r flag when booting, tells Solaris to reconfigure itself by scanning
for new hardware.
Once the system is up, check the output for "dmesg" to find kernel messages relating
to the new disk.
You probably find complaints telling you stuff as "corrupt label - wrong magic number" etc..
That's good, because we now know that the kernel is aware of this new disk.

In this example, our disk is SCSI target 3, so we can refer to the whole disks as
/dev/rdsk/c0t3d0s2           # slice 2, or partition 2, s2 refers to the whole disk


Remember that the format of /dev/(r)dsk/cWtXdYsZ means:

W is the controller number,
X is the SCSI target number,
Y is the logical unit number (LUN, almost always 0),
Z is the slice or partition number


We now use the format program to partition the disk, and afterwards create filesystems.

# format /dev/rdsk/c0t3d0s2
(.. output..)
FORMAT MENU:

format>label
Ready to label disk, continue? y

format>partition 
PARTITION MENU:

partition>

Once you have created and sized the partitions, you can get a list with the "partition>print" command.

Now, for example, you can create a filesystem like in the following command:

# newfs /dev/rdsk/c0t3d0s0


devfsadm:
---------

As from Solaris 8:

devfsadm(1M) maintains the /dev and /devices namespaces. It replaces the previous suite of devfs administration tools 
including drvconfig(1M) , disks(1M) , tapes(1M) , ports(1M) , audlinks(1M) , and devlinks(1M) .

The default operation is to attempt to load every driver in the system and attach to all possible device instances. devfsadm then creates 
device special files in /devices and logical links in /dev .

In other words, the devfsadm command is used to dynamically reconfigure system device tables
without having to reboot the system.

Examples:

# devfsadm -i sd
# devfsadm -c tape

In the first example, devfsadm configures only those devices supported by the
sd driver. In the second example, devfsadm configures only tape devices.


>>>> Other notes on filesystems on Solaris:
===========================================

There are at least 4 different types of filesystems you can use with Solaris 10 (except for zfs, 
for the older Solaris 8 and 9 versions).
These are:

-- UFS
The traditional filesystem for Solaris systems. UFS is old technology but it is a stable and fast filesystem. 
Sun has continuously tuned and improved the code over the years.
Solaris 10 (and older ofcouse) can only boot from a UFS root filesystem. In the future, 
ZFS boot will be available, as it already is in OpenSolaris. But for now, every Solaris system must have 
at least one UFS filesystem.
Note: This "boot-statement" was true at the time of writing. Maybe you read this way after that time, and maybe
Solaris can now boot from zfs or other filesystem.

-- ZFS
We will talk a bit on ZFS in section 29.3

-- VxFS
The Veritas filesystem and volume manager have their roots in a fault-tolerant proprietary minicomputer 
built by Veritas in the 1980s. They have been available for Solaris since at least 1993 and have been 
ported to AIX and Linux. They are integrated into HP-UX and SCO UNIX, and Veritas Volume Manager code 
has been used (and extensively modified) in Tru64 UNIX and even in Windows. 
VxFS has never been part of Solaris but, when UFS was the only option, it was a popular addition. 
VxVM and VxFS are tightly integrated. Through vxassist, one may shrink and grow filesystems and their 
underlying volumes with minimal trouble. 

VxFS can run in single instance mode or in a parallel access/cluster file system mode. 
This latter mode allows for multiple servers (also known as cluster nodes) to simultaneously access 
the same file system. When run in this mode, VxFS is referred to as VERITAS Cluster File System. 
Cluster File System provides cache coherency and POSIX compliance across nodes, so that data changes 
are atomically seen by all cluster nodes simultaneously. Because Cluster File System shares the same 
binaries and same on-disk layout as single instance VxFS, moving between cluster and single instance mode 
is straightforward.


-- SAM and QFS
QFS is Sun's cluster filesystem, meaning that the same filesystem may be simultaneously mounted 
by multiple systems. SAM is a hierarchical storage manager; it allows a set of disks to be used 
as a cache for a tape library. SAM and QFS are designed to work together, but each may be used separately. 

-- PCFS
It's even possible to use the DOS FAT filesystem.

-- HSFS
Ofcourse, the CDROM HSFS can be used.

Maybe the following list will show you what can be used in Solaris:

Filesystem 	Type 	Device 		Description 
UFS 		Regular Disk 		Unix Fast filesystem; default in Solaris
ZFS 		Regular	Disk		The new Regular FS in Solaris 10 
VxFS 		Regular Disk 		Veritas filesystem 
QFS 		Regular Disk 		QFS filesystem from LSC Inc. 
pcfs 		Regular Disk 		MSDOS FAT and FAT32 filesystem 
hsfs 		Regular Disk 		High Sierra filesystem (CDROM) 
tmpfs 		Regular Memory 		Uses memory and swap 
nfs 		Pseudo 	Network 	Network filesystem 
cachefs 	Pseudo 	filesystem 	Uses a local disk as cache for another NFS filesystem 
autofs 		Pseudo 	filesystem 	Uses a dynamic layout to mount other filesystems 
specfs 		Pseudo 	Device drivers 	filesystem for the /dev devices 
procfs 		Pseudo 	Kernel 		/proc filesystem representing processes 
sockfs 		Pseudo 	Network		Filesystem of socket connections 
fifofs 		Pseudo 	Files 		FIFO filesystem 

If we look at the regular disk based filesystems, the following can be said on the "allocation format":

Filesystem 	Allocation format 
UFS 		Block, allocator tries to allocate sequential blocks 
VxFS 		Extent based 
QFS 		Extent based 
ZFS		Extent based


>>>> Some notes on the ZFS filesystem. Solaris 10 
==================================================


>>> ZFS Pooled Storage:
-----------------------

ZFS uses the concept of storage pools to manage physical storage. Historically, file systems were constructed on top of a single physical device. 
To address multiple devices and provide for data redundancy, the concept of a "logical volume manager", LVM, was introduced to provide for Volume Groups,
and Logical Volumes (which could span multiple disks), and then add a filesystem on such a Logical Volume. This design added another layer 
of complexity and ultimately prevented certain file system advances, because the file system had no control over the physical placement 
of data on the virtualized volumes. 

ZFS eliminates the volume management altogether. Instead of forcing you to create virtualized volumes, ZFS aggregates devices into a storage pool. 
The storage pool describes the physical characteristics of the storage (device layout, data redundancy, and so on,) and acts as an arbitrary data store 
from which file systems can be created. File systems are no longer constrained to individual devices, allowing them to share space with all file systems 
in the pool. You no longer need to predetermine the size of a file system, as file systems grow automatically within the space allocated to the storage pool. 
When new storage is added, all file systems within the pool can immediately use the additional space without additional work. In many ways, 
the storage pool acts as a virtual memory system. When a memory DIMM is added to a system, the operating system doesn't force you to invoke some commands 
to configure the memory and assign it to individual processes. All processes on the system automatically use the additional memory.

Everything you hate about managing file systems and volumes is gone: you don't have to use format, and create slices/partitions, use newfs, mount, edit /etc/vfstab, 
fsck, growfs, metadb, metainit, etc.

Meet your new best friends: zpool and zfs.

ZFS is easy, so let's get on with it! It's time to create your first pool: 

# zpool create tank c1t2d0

You now have a single-disk storage pool named tank, with a single file system mounted at /tank. There is nothing else to do.
Yes, its really true: 
The new ZFS file system, tank, can use as much of the disk space as needed, and is automatically mounted at /tank.

You can determine if your pool was successfully created by using the zpool list command. 

# zpool list
NAME                    SIZE    USED   AVAIL    CAP  HEALTH     ALTROOT
tank                     80G    137K     80G     0%  ONLINE     - 


Suppose we create a file in /tank and want to see how things looks like:
# mkfile 100m /tank/foo
# df -h /tank
Filesystem             size   used  avail capacity  Mounted on
tank                   80G   100M    80G     1%    /tank


If you want mirrored storage for mail and home directories, that's easy too:

Create the pool:

# zpool create tank mirror c1t2d0 c2t2d0

Now lets try to create the "/var/mail" file system:

# zfs create tank/mail
# zfs set mountpoint=/var/mail tank/mail

Create home directories, and mount them all in /export/home/<username>:

# zfs create tank/home
# zfs set mountpoint=/export/home tank/home


At this point, we have "/export/home" present.
Now you could even do this:

# zfs create tank/home/ahrens

ZFS file systems are hierarchical: each one inherits properties from above. In this example, the mountpoint property is inherited 
as a pathname prefix. That is, tank/home/ahrens is automatically mounted at /export/home/ahrens because tank/home is mounted at /export/home. 
You don't have to specify the mountpoint for each individual user - you just tell ZFS the pattern.


>>> Commit and Rollback semantics:
----------------------------------

ZFS uses a commit and rollback mechanism, to ensure that all data is written completely, and if not, everything is rolled back.
You probably know that with former filesystems, that you could choose 
- for a filesystem without journaling (logging)
- or indeed use journaling (or logging).

Now you have a third option: using a transactional filesystem, like zfs.

ZFS is a transactional file system, which means that the file system state is always consistent on disk. Traditional file systems (with no logging) 
overwrite data in place, which means that if the machine loses power, for example, between the time a data block is allocated and 
when it is linked into a directory, the file system will be left in an inconsistent state. Historically, this problem was solved through the use 
of the fsck command. This command was responsible for going through and verifying file system state, making an attempt to repair any inconsistencies 
in the process. This problem sometimes caused great pain to administrators and was never guaranteed to fix all possible problems. 

More recently, file systems have introduced the concept of journaling. The journaling process records action in a separate journal, 
which can then be replayed safely if a system crash occurs. This process introduces unnecessary overhead, because the data needs 
to be written twice, and often results in a new set of problems, such as when the journal can't be replayed properly. 

With a transactional file system, data is managed using copy on write semantics. Data is never overwritten, and any sequence of operations 
is either entirely committed or entirely ignored. This mechanism means that the file system can never be corrupted through accidental 
loss of power or a system crash. So, no need for a fsck equivalent exists. While the most recently written pieces of data might be lost, 
the file system itself will always be consistent. In addition, synchronous data (written using the O_DSYNC flag) is always guaranteed 
to be written before returning, so it is never lost.


>>> Unparalleled Scalability:
-----------------------------

ZFS has been designed from the ground up to be a very scalable file system. The file system itself is 128-bit, allowing for 256 quadrillion zettabytes 
of storage. All metadata is allocated dynamically, so no need exists to pre-allocate inodes or otherwise limit the scalability 
of the file system when it is first created. All the algorithms have been written with scalability in mind. 
Directories can have up to 248 (256 trillion) entries, and no limit exists on the number of file systems or number of files 
that can be contained within a file system.


>>> Some more examples:
-----------------------

-- To give user ahrens a 10G quota:

# zfs set quota=10g tank/home/ahrens

-- To give user bonwick a 100G reservation (membership has its privileges):

# zfs set reservation=100g tank/home/bonwick

-- To automatically NFS-export all home directories read/write:

# zfs set sharenfs=rw tank/home

-- To scrub all disks and verify the integrity of all data in the pool:

# zpool scrub tank

-- To replace a flaky disk:

# zpool replace tank c2t2d0 c4t1d0

-- To add more space:

# zpool add tank mirror c5t1d0 c6t1d0

-- To move your pool from SPARC machine 'sparky' to AMD machine 'amdy':

[on sparky]
    # zpool export tank

Physically move your disks from sparky to amdy.

[on amdy]
    # zpool import tank


-- Determining if Problems Exist in a ZFS Storage Pool

The easiest way to determine if any known problems exist on the system is to use the "zpool status x" command. 
This command describes only pools exhibiting problems. If no bad pools exist on the system, 
then the command displays a simple message, as follows:

# zpool status -x

all pools are healthy

Without the x flag, the command displays the complete status for all pools (or the requested pool, if specified on the command line), 
even if the pools are otherwise healthy. 


-- Understanding zpool status Output
The complete zpool status output looks similar to the following:

# zpool status tank
  pool: tank
 state: DEGRADED
status: One or more devices has been taken offline by the administrator.
        Sufficient replicas exist for the pool to continue functioning in a
        degraded state.
action: Online the device using 'zpool online' or replace the device with
        'zpool replace'.
 scrub: none requested
 config:

        NAME         STATE     READ WRITE CKSUM
        tank         DEGRADED     0     0     0
          mirror     DEGRADED     0     0     0
            c1t0d0   ONLINE       0     0     0
            c1t1d0   OFFLINE      0     0     0

errors: No known data errors


>>>> Some examples on VxFS:
===========================


Example 1:
----------

# mkfs -F vxfs /dev/vx/rdsk/testdg/msvol1 200m
version 4 layout
409600 sectors, 204800 blocks of size 1024, log size 1024 blocks
unlimited inodes, largefiles not supported
204800 data blocks, 203656 free data blocks
7 allocation units of 32768 blocks, 32768 data blocks
last allocation unit has 8192 data blocks

Example 2:
----------

We are going to show how to create a mirroring volume and a stripping volume on Veritas Storage Foundation.
on Solaris 10.

The first step is to check quantity of disks you have available on the server. 
A simple way to check this on solaris is using format utility:

bash-3.00# format

Searching for disks.done

AVAILABLE DISK SELECTIONS:

0. c1t0d0 <DEFAULT cyl 4092 alt 2 hd 128 sec 32>
/pci@0,0/pci15ad,1976@10/sd@0,0

1. c1t1d0 <DEFAULT cyl 7 alt 2 hd 64 sec 32>
/pci@0,0/pci15ad,1976@10/sd@1,0

2. c1t2d0 <DEFAULT cyl 7 alt 2 hd 64 sec 32>
/pci@0,0/pci15ad,1976@10/sd@2,0

3. c1t3d0 <DEFAULT cyl 2 alt 2 hd 64 sec 32>
/pci@0,0/pci15ad,1976@10/sd@3,0

Also, you can check disks available to Veritas Storage Foundation using vxdisk command:

bash-3.00# vxdisk -o alldgs list

DEVICE TYPE DISK GROUP STATUS

c1t0d0s2 auto:none - - online invalid
c1t1d0s2 auto:none - - online invalid
c1t2d0s2 auto:none - - online invalid
c1t3d0s2 auto:none - - online invalid

You can see above that there are 4 disks on the server that are available to Veritas but they have not yet 
been initialized by Veritas (invalid status). To use a disk on Veritas SF you need to initialize this 
using Veritas utilities.

NOTE: If you are going to use a disk on Veritas, pay attention that you should give this whole disk to Veritas. 
Disk will be formatted and you will lose all data in the disk when you are allocating a disk to Veritas Storage.

In this example the only disk that is in use for O.S Solaris is the first one. (c1t0d0s2).

We can use those 3 others disks to add on Veritas Storage.

Caution: If for a mistake we add the first disk (c1t0d0s2) to Veritas Storage, it will format 
the disk and erase Solaris info. We need to pay attention to get the right disks.

Let's start allocating (initializing) those 3 disks to solaris:

# vxdisksetup -i c1t1d0
#
# vxdisksetup -i c1t2d0

# vxdisksetup -i c1t3d0

We have those 3 disks initialized on Veritas, then the next step is to create a Disk Group.

>>> Disk Group

Disk Group is a collection of disks. Disk Group is very useful for management and isolation purpose.
Lets create a DG using only the fist disk initialized on Veritas (c1t1d0). 
We are using DG1 for the name of Disk Group.

# vxdg init DG1 c1t1d0

Check if  DG1 was created successfully:

# vxdg list

NAME STATE ID

DG1 enabled,cds 1218633322.13.vrt2

Also, check if the disk is properly assigned to DG1:

# vxdisk -o alldgs list

DEVICE TYPE DISK GROUP STATUS

c1t0d0s2 auto:none - - online invalid
c1t1d0s2 auto:cdsdisk c1t1d0 DG1 online
c1t2d0s2 auto:cdsdisk - - online
c1t3d0s2 auto:cdsdisk - - online

Let's add more 2 disks to DG1:

# vxdg -g DG1 adddisk c1t2d0s2 c1t3d0s2

Check if the disks are properly assigned to DG1:

# vxdisk -o alldgs list

DEVICE TYPE DISK GROUP STATUS

c1t0d0s2 auto:none - - online invalid
c1t1d0s2 auto:cdsdisk c1t1d0 DG1 online
c1t2d0s2 auto:cdsdisk c1t2d0 DG1 online
c1t3d0s2 auto:cdsdisk c1t3d0 DG1 online

At this point we have added 3 disks into Disk Group DG1. 

Next step we will create 2 different volumes in the DG1.

>>> Volumes

A volume is a virtual storage that is used as an physical disk. Volume can be composed by many disks 
and have many layouts.

In this example, we are going to create two Volumes:

Volume VolS - Stripping layout using c1t1d0 and c1t2d0 disks (RAID 0).
Volume VolM - Mirroring layout using c1t2d0 and c1t3d0 (RAID 1).

-- To create a Stripping Volume VolS (Size=10m):

# vxassist -g DG1 make VolS 10m layout=stripe c1t1d0s2 c1t2d0s2

To check if volume VolS was created successfully:

# vxprint -g DG1

TY NAME ASSOC KSTATE LENGTH PLOFFS STATE TUTIL0 PUTIL0

dg DG1 DG1 - - - - - -

dm c1t1d0 c1t1d0s2 - 159488 - - - -
dm c1t2d0s2 c1t2d0s2 - 159488 - - - -
dm c1t3d0s2 c1t3d0s2 - 159488 - - - -


v VolS fsgen ENABLED 20480 - ACTIVE - -
pl VolS-01 VolS ENABLED 20480 - ACTIVE - -
sd c1t1d0-01 VolS-01 ENABLED 10240 0 - - -
sd c1t2d0s2-01 VolS-01 ENABLED 10240 0 - - -


-- To create a Mirroring Volume VolM (Size=10m):

# vxassist -g DG1 make VolM 10m layout=mirror c1t2d0s2 c1t3d0s2

To check if Volume VolM was created successfully:

# vxprint -g DG1

TY NAME ASSOC KSTATE LENGTH PLOFFS STATE TUTIL0 PUTIL0

dg DG1 DG1 - - - - - -

dm c1t1d0 c1t1d0s2 - 159488 - - - -
dm c1t2d0s2 c1t2d0s2 - 159488 - - - -
dm c1t3d0s2 c1t3d0s2 - 159488 - - - -

v VolM fsgen ENABLED 20480 - ACTIVE - -
pl VolM-01 VolM ENABLED 20480 - ACTIVE - -
sd c1t3d0s2-01 VolM-01 ENABLED 20480 0 - - -

pl VolM-02 VolM ENABLED 20480 - ACTIVE - -
sd c1t2d0s2-02 VolM-02 ENABLED 20480 0 - - -

v VolS fsgen ENABLED 20480 - ACTIVE - -
pl VolS-01 VolS ENABLED 20480 - ACTIVE - -
sd c1t1d0-01 VolS-01 ENABLED 10240 0 - - -
sd c1t2d0s2-01 VolS-01 ENABLED 10240 0 - - -

Note: You can see above that both Volumes were created successfully. Also, you can note the difference 
between stripping and mirroring volume layouts. 

VolM is using two different Plex in differente disks. This means that if you lose one disk (Plex) 
you still have the data in the other disk (other Plex). It is the main configuration of Mirroring Volumes.

VolS is using only one Plex divided in 2 disks. This means that the data will be split in those 2 disks. 
If you lose one disk you would lose the whole Plex, therefore you would lose the data. 
This is the main configuration of Stripping Volumes. It does not provide data protection but it is very useful 
for performance for purpose.

Also, you can add those 2 layouts in only one layout that provide data protection and better performance. 
It is the case of RAID 0 + 1 or RAID 1 + 0.

In the next step we will create 2 different Filesystem using those 2 Volumes.

>>> Filesystem

In this example we will create two filesystem:

- Filesystem fsS will use VolS. It will be mounted at /stripe mount point.
- Filesystem fsM will use VolM. It will be mounted at /mirror mount point.

To create a VxFS filesystem:

# mkfs -F vxfs /dev/vx/rdsk/DG1/VolS

version 7 layout

20480 sectors, 10240 blocks of size 1024, log size 1024 blocks
largefiles supported

# mkfs -F vxfs /dev/vx/rdsk/DG1/VolM

version 7 layout

20480 sectors, 10240 blocks of size 1024, log size 1024 blocks

largefiles supported

To mount a VxFS filesystem:

# mount -F vxfs /dev/vx/dsk/DG1/VolS /stripe/
# mount -F vxfs /dev/vx/dsk/DG1/VolM /mirror/

Now there are 2 filesystems configured and you can use it at Solaris Mount Point level.

Any data written in /stripe directory will be written in the stripping VolS volume.
Any data written in /mirror directory will be written in the mirroring VolM volume.


Example 3:
----------

Rather than mess with vxmake  you can employ vxassist to do all the dirty work. If you have any amount of experience with vxassist 
you'll know that the more information you can supply to vxassist the better the end product will be. 

I'm going to use vxassist to build a stripe-pro volume from four disks and I want the volume to be 1G in size:

# vxassist -g testdg make stripeprovol 1g  layout=stripe-mirror \
			testdg01 testdg02 testdg03 testdg04


Pretty kool, huh? Quick, efficient, and poorly named; everything you love about vxassist. I can then go a bit further 
and explore my sizing options to see how much I can grow my new volume if I need to:

# vxassist -g testdg maxgrow stripeprovol

Volume stripeprovol can be extended by 282050560 to 284147712 (138744Mb)

See? Just like a normal volume. Now comes the beauty part. When you look at that seemingly unmanageable mess of objects above 
does it really make you want to tear it apart and work on it like you might other "normal" volumes? Probably not. And you'd be wise 
to feel that way, there are just too many places to get confused or make a mistake when real data is involved. What if you could get back 
to a more normal point of view? Luckily you can, check this out:

# vxassist -g testdg convert stripeprovol layout=mirror-stripe


Veritas terminology:

In a "typical" RAID0+1 volume configuration, we take several disks and then create a stripe across thoughs disks (the RAID0 part). 
Then once complete we do this again on a separate set of disks, and then attach that new stripe to the first creating a mirror (the +1 part). 
We then have a RAID0+1 volume thats ready to have a filesystem put on it. The point of interest with this setup is that we're actually 
mirroring a complete stripe (and therefore ALL the disks in that stripe) to another stripe (and therefore ALL of it's disks). 
The problem here is that if for some reason we need to re-sync the volume we'd need to re-sync a full stripe to a full stripe (very timely) 
which is a nearly tragic proposition if your talking about 50G+. A far more efficient setup would be to mirror each disk to each disk... 
in other words, to mirror a bunch of disks on a one-to-one basis, and then build a stripe on top of these mirrors. In this case if we need 
to re-sync due to a disk failure we can simply sync the failed disk to its mirror, instead of the full stripe. This is the power of RAID1+0; 
the difference between mirroring the stripes (0+1) and stripping the mirrors (1+0).

If the terms seem to confuse you, try this for size:

RAID0	Striping (VxVM says: stripe)
RAID1	Mirroring (VxVM says: mirror)
RAID0+1 Striping plus Mirroring (VxVM says: mirror-stripe)
	Think this: Striped disks, then mirror the stripes
RAID1+0 Mirroring plus Striping (VxVM says: stripe-mirror) 
	(Veritas Marketing Dept says: StripePro
	Think this: Mirrored disks, then stripe on top of the mirrors
Concat+Mirror	Concatenation plus Mirroring (VxVM says: mirror)
		Same as RAID1
Mirror+Concat	Mirroring plus Concatenation (VxVM says: concat-mirror)
		(Veritas Marketing Dept says: ConcatPro)
		Think this: Concatenation on top of mirrored disks.


Veritas Default diskgroup: rootdg

Default rootdg disk group. 
 Block Device Node /dev/vx/dsk/volume_name 
 Raw Device Node /dev/vx/rdsk/volume_name 
Other DiskGroups 
 Block Device Node /dev/vx/dsk/diskgroup_name/volume_name 
 Raw Device Node /dev/vx/rdsk/diskgroup_name/volume_name 
 

Example 4:
----------

Some more examples:

Create Veritas layout on a disk:
	vxdisksetup -i c1t10d0

Create a disk group on a new disk:
	vxdg init <dg name> <media name>=c1t10d0

Add disk to an existing disk group:
	vxdg -g <dg name> adddisk <media name>=c2t0d0
 	replace addisk with rmdisk to remove a disk

Set up a preferred reading plex, this can be useful if we have a sparse plex (plex in RAM):
	vxvol -g <group> rdpol prefer <volname> <plexname>
	instead of prefer we can have round or sdeet

View configuration:
	vxprint -th
List disks:
	vxdisk list
	vxdisk -o alldgs list (shows deported disks)

Adding disks while solaris is running:
	drvconfig	(This probes scsi - Solaris)
	disks		(Creates links in /dev - Solaris)
	prtvtoc		(View the vtoc - Solaris)
	vxdctl enable	(Rescan for disks - Veritas)
	vxdisk list	(Shows the disk in error as they are not initalized jet)
	vxdisksetup  	(init the disks)

To encapsulate use:
 	vxencap -g <discgroup> <devicename>

Export a disk group:
	vxdg deport <dg name>
	vxdg -h <hostame> deport <dgname> to export to another host

Import a disk group:
	vxdg import <dg name>
	vxdg -C to clear hostid of old host (When failing over in DR situation)
	vxdg -fC to clear hostid of old host and forcing diskgroup online

Destroy a disk group:
	vxdg destroy <disk group>

Evacuate data from a disk:
	vxevac -g <dg name> <from disk> <to disks>

Create a volume on a diskgroup:
	vxassist -g <dg name> make <volname> <size> layou=stripe
	ncols=number of colums stripeunit=size

Create a veritas filesystem on this volume:
        mkfs -F vxfs /dev/vx/rdsk/<disk group>/<volume> <size>

Delete a volume	same as creatiuon but replace make with remove

Resize a filesystem:
        vxresize -g <disk group> -F <fstype> <volume> <size>

If Veritas is ever causing you problems, do the following:
	Touch /etc/vx/reconfig.d/state.d/install-db
	edit /etc/system and modify /etc/vfstab 
	to disable VRTS to start up and access the old root
	partitions


vxassist make martin 100m
makes a volume called martin using any disk

vxassist make martin 100m disk10
makes a volume called martin using disk10

vxassist make martin 100m layout=stripe disk07 disk08
creates a 100mb striped volume called martin using disks7 and 8

vxassist mirror martin disk05 disk06
uses disks5 and 6 ro make a mirror on volume called martin

vxassist make martin 50m layout=mirror
makes a 50Mb mirror using any 2 disks

vxassist make martin 50m layout=mirror disk05 disk06
makes a 50mb mirror using disks 5 and 6

vxassist make martin 50m layout=mirror,stripe disk05 disk06 disk07 
disk08
makes a 50Mb stripe using disks5 and 6 mirrored across 7 and 8

vxassist make martin 50m layout=mirror,stripe,log disk05 disk06 disk07 
disk08
makes a 50Mb stripe using disks5 and 6 mirrored across 7 and 8 and uses 
a 
log subdisk

vxassist make martin 100m layout=raid5
makes a 100m raid5 volume

/usr/sbin/vxedit -g rootdg rename disk12 disk09 
to rename disk12 to disk09 in the rootdg

vxedit rm disk10 
to remove a greyed out or obsolete disk in this case disk10
or to remove a disk from a diskgroup

vxdisk list - to list all disks under vmcontrol 

vxdisk clearimport c#t#d#s#
to allow a disk to be imported after a server crash

vxdg -g razadg rmdisk test
to remove a disk called test from a dg called razadg

vxdg -g razadg adddisk test=c1t3d3  
to add disk c1t3d3 to a dg called razadg calling the disk test, use 
vxdisk list
to determine what disks are free :)

vxedit -g rootdg set spare=on disk09
sets disk09 in the rootdg as a hotspare.


vxmirror rootdisk disk01
mirrors all the volumes on the root disk to disk01

vxassist -g rootdg mirror vol01 disk03
mirrors vol01 (in rootdg) to disk03


vxassist mirror martin

will mirror the volume martin


to make a mirror manually try

 /usr/sbin/vxmake -g rootdg sd disk03-01 dm_name=disk03 dm_offset=0 
 len=81920 
 to create a subdisk on disk03 callin the subdisk disk03-01 the len 
 81920 is
 81920sectors x 512bytes =40M 

 vxmake plex martin-02 sd=disk03-01
 creates a plex called martin-02 using subdisk disk03-01

 vxplex att martin martin-02
 attaches the plex martin-02 to volume martin

 to list all volumes on your primary boot disk enter
 vxprint -t -v -e 'aslist.aslist.sd_disk="boot_disk_name"'


 vxsd mv disk03-01 disk05-01
 moves the contents of subdisk disk03-01 to disk05-01
 then moves  subdisk disk05-01 into the plex where subdisk disk03-01
 once lived, leaving disk03-01 to your mercy :)


 to make a subdisk

 vxmake sd disk02-02 disk02,0,8000
 this would create a subdisk called disk02-02 at the start of disk02
 and would be 8000blocks (4000k) long.
 if you wanted to create another subdisk on this disk the offset would 
 be
 8000 as this is where the next free space would be onthe disk so...
 vxmake sd disk02-02 disk02,8000,8000 would create another 8000block
 subdisk.


 vxdisk rm c#t#d#s2
 to remove a disk so it's out of vm control

 vxdiskadd c#t#d#
 to add bring a new disk under vm control

 or you can try...
 vxdisksetup -i c#t#d#  

 vxvol -g dg volname stop
 this stops a volume

 vxedit -rf rm martin
 removes a volume called martin and plex(es) and subdisks though

 vxprint -ht volume


>>>> AIX devices:
=================

In AIX 5.x, the device configuration information is stored in the ODM repository. The corresponding files
are in 

/etc/objrepos
/usr/lib/objrepos
/usr/share/lib/objrepos


There are 2 sections in ODM:
- predefined: all of the devices in principle supported by the OS
- customized: all devices already configured in the system

Every device in ODM has a unique definition that is provided by 3 attributes:

1. Type
2. Class
3. Subclass


Information thats stored in the ODM:

- PdDv,PdAt, PdCn   :  Predefined device information
- CuDv, CuAt, CuDep :  Customized device information
- lpp, inventory    :  Software vital product data
- smit menu's
- Error log, alog, and dump information
- System Resource Controller: SRCsubsys, SRCsubsrv
- NIM: nim_attr, nim_object, nim_pdattr


There are commands, representing an interface to ODM, so you can add, retrieve, drop and change objects.
The following commands can be used with ODM:

odmadd, 
odmdrop, 
odmshow, 
odmdelete, 
odmcreate, 
odmchange

Examples:

# odmget -q "type LIKE lv*" PdDv
# odmget -q name=hdisk0 CuAt


Logical devices and physical devices:
-------------------------------------

AIX includes both logical devices and physical devices in the ODM device configuration database.
Logical devices include Volume Groups, Logical Volumes, network interfaces and so on.
Physical devices are adapters, modems etc..


Most devices are selfconfiguring devices, only serial devices (modems, printers) are not selfconfigurable.

The command that configures devices is "cfgmgr", the "configuration manager".
When run, it compares the information from the device with the predefined section in ODM.
If it finds a match, then it creates the entries in the customized section in ODM.

The configuration manager runs every time the system is restarted.

If you have installed an adapter for example, and you have put the software in a directory
like /usr/sys/inst.images, you can call cfgmgr to install device drivers as well with

# cfgmgr -i /usr/sys/inst.images

$$
09-08-00-1,0
u5971-t1-l1-l0


Device information:
-------------------

The most important AIX command to show device info is "lsdev". This command queries the ODM, so we can use
it to locate the customized or the predifined devices.

The main commands in AIX to get device information are:
- lsdev  : queries ODM
- lsattr : gets specific configuration attributes of a device
- lscfg  : gets vendor name, serial number, type, model etc.. of the device

lsdev also shows the status of a device as Available (that is configured) or as Defined (that is predefined).


lsdev examples:
---------------

If you need to see disk or other devices, defined or available, you can use the lsdev command
as in the following examples:

# lsdev -Cc tape
rmt0  Available  10-60-00-5,0  SCSI 8mm Tape Drive

# lsdev -Cc disk
hdisk0 Available 20-60-00-8,0    16 Bit LVD SCSI Disk Drive
hdisk1 Available 20-60-00-9,0    16 Bit LVD SCSI Disk Drive
hdisk2 Available 20-60-00-10,0   16 Bit LVD SCSI Disk Drive
hdisk3 Available 20-60-00-11,0   16 Bit LVD SCSI Disk Drive
hdisk4 Available 20-60-00-13,0   16 Bit LVD SCSI Disk Drive

Note: -C queries the Customized section of ODM, -P queries the Predefined section of ODM.

Example if some of the disks are on a SAN (through FC adapters):

# lsdev -Cc disk
hdisk0 Available          Virtual SCSI Disk Drive
hdisk1 Available          Virtual SCSI Disk Drive
hdisk2 Available 02-08-02 SAN Volume Controller MPIO Device  (through FC adapter)
hdisk3 Available 02-08-02 SAN Volume Controller MPIO Device  (through FC adapter)

# lsattr -El hdisk2
PCM             PCM/friend/sddpcm                                   PCM                                     True
PR_key_value    none                                                Reserve Key                             True
algorithm       load_balance                                        Algorithm                               True
dist_err_pcnt   0                                                   Distributed Error Percentage            True
dist_tw_width   50                                                  Distributed Error Sample Time           True
hcheck_interval 20                                                  Health Check Interval                   True
hcheck_mode     nonactive                                           Health Check Mode                       True
location                                                            Location Label                          True
lun_id          0x0                                                 Logical Unit Number ID                  False
lun_reset_spt   yes                                                 Support SCSI LUN reset                  True
max_transfer    0x40000                                             Maximum TRANSFER Size                   True
node_name       0x50050768010029c8                                  FC Node Name                            False
pvid            00cb5b9e66cc16470000000000000000                    Physical volume identifier              False
q_type          simple                                              Queuing TYPE                            True
qfull_dly       20                                                  delay in seconds for SCSI TASK SET FULL True
queue_depth     20                                                  Queue DEPTH                             True
reserve_policy  no_reserve                                          Reserve Policy                          True
rw_timeout      60                                                  READ/WRITE time out value               True
scbsy_dly       20                                                  delay in seconds for SCSI BUSY          True
scsi_id         0x611013                                            SCSI ID                                 False
start_timeout   180                                                 START unit time out value               True
unique_id       33213600507680190014E30000000000001E204214503IBMfcp Device Unique Identification            False
ww_name         0x50050768014029c8                                  FC World Wide Name                      False


lsdev [ -C ][ -c Class ] [ -s Subclass ] [ -t Type ] [ -f File ] [ -F Format |
-r ColumnName ] [ -h ] [ -H ] [ -l { Name | - } ] [ -p Parent ] [ -S State ]

lsdev -P [ -c Class ] [ -s Subclass ] [ -t Type ] [ -f File ] [ -F Format | -r
ColumnName ] [ -h ] [ -H ]

Remark:

For local attached SCSI devices, the general format of the LOCATION code "AB-CD-EF-GH" is actually "AB-CD-EF-G,H" , 
the first three sections are the same and for the GH section, the G is de SCSI ID and the H is the LUN. 
For adapters, only the AB-CD is mentioned in the location code.

A location code is a representation of the path to the device, from drawer, slot, connector and port.

- For an adapter it is sufficient to have the codes of the drawer and slot to identify
  the adapter. The location code of an adapter takes the form of AB-CD.

- Other devices needs more specification, like a specific disk on a specific SCSI bus.
  For other devices the format is AB-CD-EF-GH. 
  The AB-CD part then indicates the adapter the device is connected on.

- For SCSI devices we have a location code like AB-CD-EF-S,L where the S,L fields identifies
  the SCSI ID and LUN of the device.


To lists all devices in the Predefined object class with column headers, use
# lsdev -P -H

To list the adapters that are in the Available state in the Customized Devices object class, use
# lsdev -C -c adapter -S 


lsattr examples:
----------------

This command gets the current attributes (-E flag) for a tape drive: 

# lsattr -El rmt0
mode           yes     Use DEVICE BUFFERS during writes    True
block_size     1024    Block size (0=variable length)      True
extfm          no      Use EXTENDED file marks             True
ret            no      RETENSION on tape change or reset   True
..
..

(Ofcourse, the equivalent for the above command is for example # lsattr -l rmt0 -E )

To list the default values for that tape device (-D flag), use
# lsattr -l -D rmt0


This command gets the attributes for a network adapter:

# lsattr -E -l ent1
busmem     0x3cfec00     Bus memory address     False
busintr    7             Bus interrupt level    False
..
..

To list only a certain attribute (-a flag), use the command as in the following example:

# lsattr -l -E scsi0 -a bus_intr_lvl 
bus_intr_lvl 14 Bus interrupt level False

# lsattr -El tty0 -a speed
speed 9600 BAUD rate true


You must specify one of the following flags with the lsattr command: 
-D  Displays default values.  
-E  Displays effective values (valid only for customized devices specified with the -l flag).  
-F  Format  Specifies the user-defined format.  
-R  Displays the range of legal values.  
-a  Displays for that attribute


lscfg examples:
---------------

Example 1:

This command gets the Vital Product Data for the tape drive rmt0:

# lscfg -vl rmt0
Manufacturer...............EXABYTE
Machine Type and Model.....IBM-20GB
Device Specific(Z1)........38zA
Serial Number..............60089837
..
..

-l Name Displays device information for the named device.

-p Displays the platform-specific device information. This flag only applies to
   AIX 4.2.1 or later.

-v Displays the VPD found in the Customized VPD object class. Also, on AIX 4.2.1
   or later, displays platform specific VPD when used with the -p flag.

-s Displays the device description on a separate line from the name and
   location.


# lscfg -vp | grep -p 'Platform Firmware:'

# lscfg -vp | grep -p Platform

sample output:

Platform Firmware:
ROM Level.(alterable).......3R040602
Version.....................RS6K
System Info Specific.(YL)...U1.18-P1-H2/Y2
Physical Location: U1.18-P1-H2/Y2
The ROM Level denotes the firmware/microcode level
Platform Firmware:
ROM Level ............. RH020930
Version ................RS6K
.. 


Example 2:

The following command shows details about the Fiber Channel cards:

# lscfg -vl fcs*          (fcs0 for example, is the parent of fsci0)


Adding a device:
----------------

Adding a device with cfmgr:
---------------------------

To add a device you can run cfgmgr, or shutdown the system, attach the new device and boot the system.
There are also many smitty screens to accomplish the task of adding a new device.


Adding a device with mkdev:
---------------------------

Also the mkdev command can be used as in the following example:

# mkdev -c tape -s scsi -t scsd -p scsi0 -w 5,0

where

-c    Class of the device
-s    Subclass of the device
-t    Type of the device. This is a specific attribute for the device 
-p    The parent adapter of the device. You have to specify the logical name.
-w    You have to know the SCSI ID that you are goiing to assign to the new device.
      If it's non SCSI, you have to know the port number on the adapter.
-a    Specifies the device attribute-value pair


The mkdev command also creates the ODM entries for the device and loads the device driver.

The following command configures a new disk and ensures that it is available as a physical volume.
This example adds a 2.2GB disk with a scsi ID of 6 and a LUN of 0 to the scsi3 SCSI bus.

# mkdev -c disk -s scsi -t 2200mb -p scsi3 -w 6,0 -a pv=yes

This example adds a terminal:

# mkdev -c tty -t tty -s rd232 -p sa1 -w 0 -a login=enable -a term=ibm3151
tty0 Available


Changing a device with chdev:
-----------------------------

Suppose you have just added a new disk. Suppose the cfgmgr has run and detected the disk.

Now you run
# lspv
hdisk1    none                 none
OR
hdisk1    0005264d2            none

The first field identifies the system-assigned name of the disk. The second field displays the
"physical volume id" PVID. If that is not shown, you can use chdev:

# chdev -l hdisk2 -a pv=yes


Removing a device with rmdev:
-----------------------------

Examples:

# lsdev -Cc tape
rmt0  Available  10-60-00-5,0  SCSI 8mm Tape Drive

# rmdev -l rmt0               # -l indicates using the logical device name
rmt0 Defined

The status have shifted from Available to Defined.

# lsdev -Cc tape
rmt0  Defined  10-60-00-5,0  SCSI 8mm Tape Drive

If you really want to remove it from the system, use the -d flag as well

# rmdev -l rmt0 -d

To unconfigure the childeren of PCI bus pci1 and all devices under them, while retaining their
device definition in the Customized Devices Object Class. 

# rmdev -p pci1
rmt0 Defined
hdisk1 Defined
scsi1 Defined
ent0 Defined


The special device sys0:
------------------------

In AIX 5.x we have a special device named sys0 that is used to manage some kernel parameters.
The way to change these values is by using smitty, the chdev command or WSM.

Example.

To change the maxusersprocesses parameter, you can for example use the Web-based System Manager.
You can also use the chdev command:

#chdev -l sys0 -a maxuproc=50
sys0 changed

Note: In Solaris, to change kernel parameters, you have to edit /etc/system.

Device drivers:
---------------

Device drivers are located in /usr/lib/drivers directory.


>>> filesystem commands AIX:
============================


The Logical Volume Manager LVM:
===============================

In AIX, it's common to use a Logical Volume Manager LVM to cross the boundaries posed by
traditional disk management.
Traditionally, a filesystem was on a single disk or on a single partition.
Changing a partionion size was a difficult task. With a LVM, we can create logical volumes
which can span several disks.

The LVM has been a feature of the AIX operating system since version 3, and it is installed 
automatically with the Operating System.

LVM commands in AIX:
--------------------

mkvg  (or the mkvg4vp command in case of SAN vpath disks. See section 31.3)
cplv
rmlv
mklvcopy
extendvg
reducevg
getlvcb
lspv
lslv
lsvg
mirrorvg
chpv
migratepv
exportvg, importvg
varyonvg, varyoffvg

And related commands:
mkdev
chdev
rmdev
lsdev

Volume group:
-------------

What a physical disk is, or a physical volume is, is evident. When you add a physical volume to a volume group,
the physical volume is partitioned into contiguous equal-sized units of space called "physical partitions".
A physical partition is the smallest unit of storage space allocation and is a contiguous space
on a physical volume.
The physical volume must now become part of a volume group. The disk must be in a available state
and must have a "physical volume id" assigned to it.

A volume group (VG) is an entity consisting of 1 to 32 physical volumes (of varying sizes and types). 
A "Big volume group" kan scale up to 128 devices.

You create a volume group with the "mkvg" command. You add a physical volume to an existing volume group with
the "extendvg" command, you make use of the changed size of a physical volume with the "chvg" command,
and remove a physical volume from a volume group with the "reducevg" command.
Some of the other commands that you use on volume groups include:
list (lsvg), remove (exportvg), install (importvg), reorganize (reorgvg), synchronize (syncvg),
make available for use (varyonvg), and make unavailable for use (varyoffvg).

To create a VG, using local disks, use the "mkvg" command:

mkvg -y <name_of_volume_group> -s <partition_size> <list_of_hard_disks>

Typical example:

mkvg -y oravg -s 64 hdisk3 hdisk4

mkvg -y appsvg -s 32 hdisk2
mkvg -y datavg -s 64 hdisk3

mkvg -y appsvg -s 32 hdisk3
mkvg -y datavg -s 32 hdisk2
mkvg -y vge1corrap01 -s 64 hdisk2


In case you use the socalled SDD subsystem with vpath SAN storage, you should use the "mkvg4vp" command,
which works similar (same flags) as the mkvg command.


Types of VG's:
==============

There are 3 kinds of VG's:

- Normal VG (AIX 5L)
- Big VG (AIX 5L)
- Scalable VG (as from AIX 5.3)

Normal VG:
----------

Number of disks		Max number of partitions/disk
1			32512
2			16256
4			8128
8			4064
16			2032
32			1016

Big VG:
-------
Number of disks		Max number of partitions/disk
1			130048
2			65024
4			32512
8			16256
16			8128
32			4064
64			2032
128			1016


VG Type		Max PV's	Max LV's	Max PP's per VG
---------------------------------------------------------------
Normal		32		256		32512
Big		128		512		130048
Scalable	1024		4096		2097152


Physical Partition:
===================

You can change the NUMBER of PPs in a VG, but you cannot change the SIZE of PPs afterwards.
Defaults:
- 4 MB partition size. It can be a multiple of that amount. The Max size is 1024 MB
- The default is 1016 PPs per disk. You can increase the number of PPs in powers of 2 per PV, but the number
  of maximum disks per VG is decreased. 

#disks   max # of PPs / disk
32       1016
16       2032
8        4064
4        8128
2       16256
1       32512


In the case of a set of "normal" internal disks of, for example, 30G or 70G or so,
common partition sizes are 64M or 128M.


Logical Partition:
------------------

A LP maps to (at least) one PP, and is actually the smallest unit of allocatable space.


Logical Volume:
---------------

Consists of LPs in a VG. A LV consists of LPs from actual PPs from one or more disks.


   |-----|               | ----|
   |LP1  |      --->     | PP1 | 
   |-----|               | ----|
   |LP2  |      --->     | PP2 |
   |-----|               | ----|
   |..   |                hdisk 1 (Physical Volume 1)
   |..   |
   |..   |
   |-----|               |---- |
   |LPn  |      --->     |PPn  |
   |-----|               |---- |
   |LPn+1|      --->     |PPn+1|
   |-----|               |---- |
   Logical Volume      hdisk2 (Physical Volume 2)


So, a VG is a collection of related PVs, but you know that actually LVs are created in the VG.
For the applications, the LVs are the entities they work with.
In AIX, a filesystem like "/data", corresponds to a LV.


lspv Command
------------

Purpose: Displays information about a physical volume within a volume group.

lspv [ -L ] [ -l | -p | -M ] [ -n DescriptorPhysicalVolume] [ -v VolumeGroupID] PhysicalVolume

-p: lists range, state, region, LV names, type and mount points


# lspv
# lspv hdisk3
# lspv -p hdisk3


# lspv
hdisk0   00453267554   rootvg
hdisk1   00465249766   rootvg

# lspv hdisk23
PHYSICAL VOLUME:    hdisk23                  VOLUME GROUP:     oravg
PV IDENTIFIER:      00ccf45d564cfec0 VG IDENTIFIER     00ccf45d00004c0000000104564d2386
PV STATE:           active
STALE PARTITIONS:   0                        ALLOCATABLE:      yes
PP SIZE:            256 megabyte(s)          LOGICAL VOLUMES:  3
TOTAL PPs:          947 (242432 megabytes)   VG DESCRIPTORS:   1
FREE PPs:           247 (63232 megabytes)    HOT SPARE:        no
USED PPs:           700 (179200 megabytes)
FREE DISTRIBUTION:  00..00..00..57..190
USED DISTRIBUTION:  190..189..189..132..00


# lspv -p hdisk23
hdisk23:
PP RANGE  STATE   REGION        LV NAME             TYPE       MOUNT POINT
  1-22    used    outer edge    u01                 jfs2       /u01
 23-190   used    outer edge    u02                 jfs2       /u02
191-379   used    outer middle  u01                 jfs2       /u01
380-568   used    center        u01                 jfs2       /u01
569-600   used    inner middle  u02                 jfs2       /u02
601-700   used    inner middle  u03                 jfs2       /u03
701-757   free    inner middle
758-947   free    inner edge

# lspv -p hdisk0
hdisk0:
PP RANGE  STATE   REGION        LV NAME             TYPE       MOUNT POINT
1-1       used    outer edge    hd5                 boot       N/A
2-48      free    outer edge       
49-51     used    outer edge    hd9var              jfs        /var
52-52     used    outer edge    hd2                 jfs        /usr
53-108    used    outer edge    hd6                 paging     N/A
109-116   used    outer middle  hd6                 paging     N/A
117-215   used    outer middel  hd2                 jfs        /usr
216-216   used    center        hd8                 jfslog     N/A
217-217   used    center        hd4                 jfs        /
218-222   used    center        hd2                 jfs        /usr
223-320   used    center        hd4                 jfs        /
..
..

Note that in this example the Logical Volumes corresponds to the filesystems in the
following way: 
hd4= /, hd5=boot, hd6=paging, hd2=/usr, hd3=/tmp, hd9var=/var


lslv Command
------------
Purpose: Displays information about a logical volume.


To Display Logical Volume Information
lslv [ -L ] [ -l| -m ] [ -nPhysicalVolume ] LogicalVolume

To Display Logical Volume Allocation Map
lslv [ -L ] [ -nPhysicalVolume ] -pPhysicalVolume [ LogicalVolume ]


# lslv -l lv06
lv06:/backups
PV                COPIES        IN BAND       DISTRIBUTION
hdisk3            512:000:000   100%          000:218:218:076:000


# lslv lv06
LOGICAL VOLUME:     lv06                   VOLUME GROUP:   backupvg
LV IDENTIFIER:      00c8132e00004c0000000106ef70cec2.2 PERMISSION:     read/write
VG STATE:           active/complete        LV STATE:       opened/syncd
TYPE:               jfs                    WRITE VERIFY:   off
MAX LPs:            512                    PP SIZE:        64 megabyte(s)
COPIES:             1                      SCHED POLICY:   parallel
LPs:                512                    PPs:            512
STALE PPs:          0                      BB POLICY:      relocatable
INTER-POLICY:       minimum                RELOCATABLE:    yes
INTRA-POLICY:       middle                 UPPER BOUND:    32
MOUNT POINT:        /backups               LABEL:          /backups
MIRROR WRITE CONSISTENCY: on/ACTIVE
EACH LP COPY ON A SEPARATE PV ?: yes
Serialize IO ?:     NO

# lslv -p hdisk3
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE       1-10
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      11-20
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      21-30
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      31-40
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      41-50
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      51-60
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      61-70
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      71-80
FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE   FREE      81-90
..
..


Also, you can list LVs per VG by running, for example:

# lsvg -l backupvg
backupvg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
loglv02             jfslog     1     1     1    open/syncd    N/A
lv06                jfs        512   512   1    open/syncd    /backups

# lsvg -l splvg
splvg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
loglv01             jfslog     1     1     1    open/syncd    N/A
lv04                jfs        240   240   1    open/syncd    /data
lv00                jfs        384   384   1    open/syncd    /spl
lv07                jfs        256   256   1    open/syncd    /apps

For a complete storage system, this could yield in for example:

-redovg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
redo1lv             jfs2       42    42    3    open/syncd    /u05
redo2lv             jfs2       1401  1401  3    open/syncd    /u04
loglv03             jfs2log    1     1     1    open/syncd    N/A
-db2vg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
db2lv               jfs2       600   600   2    open/syncd    /db2_database
loglv00             jfs2log    1     1     1    open/syncd    N/A
-oravg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
u01                 jfs2       800   800   2    open/syncd    /u01
u02                 jfs2       400   400   2    open/syncd    /u02
u03                 jfs2       200   200   2    open/syncd    /u03
logfs               jfs2log    2     2     1    open/syncd    N/A
-rootvg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
hd5                 boot       1     2     2    closed/syncd  N/A
hd6                 paging     36    72    2    open/syncd    N/A
hd8                 jfs2log    1     2     2    open/syncd    N/A
hd4                 jfs2       8     16    3    open/syncd    /
hd2                 jfs2       24    48    2    open/syncd    /usr
hd9var              jfs2       9     18    3    open/syncd    /var
hd3                 jfs2       11    22    3    open/syncd    /tmp
hd1                 jfs2       10    20    2    open/syncd    /home
hd10opt             jfs2       2     4     2    open/syncd    /opt
fslv00              jfs2       1     2     2    open/syncd    /XmRec
fslv01              jfs2       2     4     3    open/syncd    /tmp/m2
paging00            paging     32    32    1    open/syncd    N/A
sysdump1            sysdump    80    80    1    open/syncd    N/A
oralv               jfs2       100   100   1    open/syncd    /opt/app/oracle
fslv03              jfs2       63    63    2    open/syncd    /bmc_home


And you can list the LVs by PV by running
# lspv -l hdiskn


lsvg Command:
-------------

-o          Shows only the active volume groups.
-p VG_name  Shows all the PVs that belong to the vg_name
-l VG_name  Shows all the LVs that belong to the vg_name


Examples:

# lsvg
rootvg
informixvg
oravg

# lsvg -o
rootvg
oravg

# lsvg oravg
VOLUME GROUP:   oravg                    VG IDENTIFIER:  00ccf45d00004c0000000104564d2386
VG STATE:       active                   PP SIZE:        256 megabyte(s)
VG PERMISSION:  read/write               TOTAL PPs:      1894 (484864 megabytes)
MAX LVs:        256                      FREE PPs:       492 (125952 megabytes)
LVs:            4                        USED PPs:       1402 (358912 megabytes)
OPEN LVs:       4                        QUORUM:         2
TOTAL PVs:      2                        VG DESCRIPTORS: 3
STALE PVs:      0                        STALE PPs:      0
ACTIVE PVs:     2                        AUTO ON:        yes
MAX PPs per PV: 1016                     MAX PVs:        32
LTG size:       128 kilobyte(s)          AUTO SYNC:      no
HOT SPARE:      no                       BB POLICY:      relocatable

# lsvg -p informixvg
informixvg
PV_NAME       PV STATE     TOTAL PPs     FREE PPs     FREE DISTRIBUTION
hdisk3        active       542           462          109..28..108..108..109
hdisk4        active       542           447          109..13..108..108..109

# lsvg -l rootvg
LV NAME       TYPE         LPs    PPs    PVs     LV STATE      MOUNT POINT
hd5           boot         1      1      1       closed/syncd  N/A
hd6           paging       24     24     1       open/syncd    N/A
hd8           jfslog       1      1      1       open/syncd    N/A
hd4           jfs          4      4      1       open/synced   /
hd2           jfs          76     76     1       open/synced   /usr
hd9var        jfs          4      4      1       open/synced   /var
hd3           jfs          6      6      1       open/synced   /tmp
paging00      paging       20     20     1       open/synced   N/A
..
..

Suppose we have 70GB disk=70000MB
1016 partitions=> 63 MB per PP


extendvg command:
-----------------

extendvg VGName hdiskNumber

# extendvg newvg hdisk23

How to Add a Disk to a Volume Group? 

extendvg   VolumeGroupName   hdisk0 hdisk1 ... hdiskn 


reducevg command:
-----------------

To remove a PV from a VG:

# reducevg myvg hdisk23

To remove a VG:

Suppose we have a VG informixvg with 2 PV, hdisk3 and hdisk4:

# reducevg -d informixvg hdisk4

When you delete the last disk from the VG, the VG is also removed.

# reducevg -d informix hdisk3


varyonvg and varyoffvg commands:
--------------------------------

When you activate a VG for use, all its resident filesystems are mounted by default if they have
the flag mount=true in the /etc/filesystems file.

# varyonvg apachevg

# varyoffvg apachevg

To use this command, you must be sure that none of the logical volumes are opened, that is, in use.


mkvg command:
-------------

You can create a new VG by using "smitty mkvg" or by using the mkvg command.

Use the following command, where s "partition_size" sets the number of megabytes in each physical partition 
where the partition_size is expressed in units of megabytes from 1 through 1024. The size variable must 
be equal to a power of 2 (for example 1, 2, 4, 8). The default value is 4.

mkvg -y <name_of_volume_group> -s <partition_size> <list_of_hard_disks>

As with physical volumes, volume groups can be created and removed and their characteristics
can be modified.

Before a new volume group can be added to the system, one or more physical volumes not used
in other volume groups, and in an available state, must exist on the system.

The following example shows the use of the mkvg command to create a volume group myvg
using the physical volumes hdisk1 and hdisk5.

# mkvg -y myvg -d 10 -s 8 hdisk1 hdisk5

# mkvg -y oravg -d 10 -s 64 hdisk1


mklv command:
-------------

To create a LV, you can use the smitty command "smitty mklv" or just use the mklv command
by itself.

The mklv command creates a new logical volume within the VolumeGroup. For example, all file systems 
must be on separate logical volumes. The mklv command allocates the number of logical partitions 
to the new logical volume. If you specify one or more physical volumes with the PhysicalVolume parameter, 
only those physical volumes are available for allocating physical partitions; otherwise, all the 
physical volumes within the volume group are available. 

The default settings provide the most commonly used characteristics, but use flags to tailor the logical volume 
to the requirements of your system. Once a logical volume is created, its characteristics can be changed 
with the chlv command. 

When you create a LV, you also specify the number of LP's, and how a LP maps to PP's. 
Later, you can create one filesystem per LV.

Examples

The following example creates a LV "lv05" on the VG "splvg", with two copies (2 PPs) of each LP.
In this case, we are mirroring a LP to two PP's.
Also, 200 PP's are specified. If a PP is 128 MB is size, the total amount of space of one "mirror" is 25600 MB.

# mklv -y lv05 -c 2 splvg 200

The following example shows the use of mklv command to create a new LV newlv in the rootvg
and it will have 10 LP's and each LP consists of 2 physical partitions.

# mklv -y newlv -c 2 rootvg 10

To make a logical volume in volume group vg02 with one logical partition and a total of two copies of the data, enter: 

# mklv -c 2 vg02 1

To make a logical volume in volume group vg03 with nine logical partitions and a total of three copies 
spread across a maximum of two physical volumes, and whose allocation policy is not strict, enter: 

# mklv -c 3 -u 2 -s n vg03 9

To make a logical volume in vg04 with five logical partitions allocated across the center sections of the 
physical volumes when possible, with no bad-block relocation, and whose type is paging, enter: 

# mklv -a c -t paging -b n vg04 5

To make a logical volume in vg03 with 15 logical partitions chosen from physical volumes hdisk5, hdisk6, and hdisk9, 
enter: 

# mklv vg03 15 hdisk5 hdisk6 hdisk9

To make a striped logical volume in vg05 with a stripe size of 64K across 3 physical volumes and 12 
logical partitions, enter: 

# mklv -u 3 -S 64K vg05 12

To make a striped logical volume in vg05 with a stripe size of 8K across hdisk1, hdisk2, and hdisk3 and 
12 logical partitions, enter: 

# mklv -S 8K vg05 12 hdisk1 hdisk2 hdisk3

The following example uses a "map file /tmp/mymap1" which list which PPs are to be used in creating a LV:

# mklv -t jfs -y lv06 -m /tmp/mymap1 rootvg 10


The setting Strict=y means that each copy of the LP is placed on a different PV. The setting Strict=n means
that copies are not restricted to different PVs. 
The default is strict.


# mklv -y lv13 -c 2 failovervg 150
# crfs -v jfs -d lv13 -m /backups2 -a bf=true

Another simple example using local disks:

# mkvg -y appsvg -s 32 hdisk2
# mkvg -y datavg -s 32 hdisk3

# mklv -y testlv -c 1 appsvg 10
# mklv -y backuplv -c 1 datavg 10

# crfs -v jfs -d testlv -m /test -a bf=true
# crfs -v jfs -d backuplv -m /backup -a bf=true

mklv -y testlv1 -c 1 appsvg 10
mklv -y testlv2 -c 1 datavg 10
crfs -v jfs -d testlv1 -m /test1 -a bf=true
crfs -v jfs -d testlv2 -m /test2 -a bf=true


mklv -y testlv1 -c 1 vgp0corddap01 10
mklv -y testlv2 -c 1 vgp0corddad01 10
crfs -v jfs -d testlv1 -m /test1 -a bf=true
crfs -v jfs -d testlv2 -m /test2 -a bf=true

rmlv command:
-------------

# rmlv newlv
Warning, all data on logical volume newlv will be destroyed.
rmlv: Do you wish to continue? y(es) n(o) y
#

extendlv command:
-----------------

The following example shows the use of the extentlv command to add 3 more LP's to the LP newlv:

# extendlv newlv 3

cplv command:
-------------

The following command copies the contents of LV oldlv to a new LV called newlv:
# cplv -v myvg -y newlv oldlv

To copy to an existing LV:
# cplv -e existinglv oldlv

Purpose
Copies the contents of a logical volume to a new logical volume.

Syntax
To Copy to a New Logical Volume

cplv [ -vg VolumeGroup ] [ -lv NewLogicalVolume | -prefix Prefix ] SourceLogicalVolume

To Copy to an Existing Logical Volume

cplv [ -f ] SourceLogicalVolume DestinationLogicalVolume

cplv -e DestinationLogicalVolume [-f] SourceLogicalVolume

-e: specifies that the DestinationLogicalVolume already exists.
-f: no user confirmation
-y: specifies the name to use for the NewLogicalVolume, instead of a system generated name.

Description
Attention: Do not copy from a larger logical volume containing data to a smaller one. Doing so results 
in a corrupted file system because some data is not copied.
The cplv command copies the contents of SourceLogicalVolume to a new or existing logical volume. 
The SourceLogicalVolume parameter can be a logical volume name or a logical volume ID. 
The cplv command creates a new logical volume with a system-generated name by using the default syntax. 
The system-generated name is displayed. 

Note:
The cplv command can not copy logical volumes which are in the open state, 
including logical volumes 
that are being used as backing devices for virtual storage.
Flags
-f Copies to an existing logical volume without requesting user confirmation. 
-lv NewLogicalVolume Specifies the name to use, in place of a system-generated name, 
 for the new logical volume. Logical volume names must be unique systemwide names, and can range 
 from 1 to 15 characters. 
-prefix Prefix Specifies a prefix to use in building a system-generated name for the new logical volume. 
 The prefix must be less than or equal to 13 characters. A name cannot be a name already used by another device. 
-vg VolumeGroup Specifies the volume group where the new logical volume resides. If this is not specified, 
 the new logical volume resides in the same volume group as the SourceLogicalVolume. 

Examples
To copy the contents of logical volume fslv03 to a new logical volume, type: 

# cplv fslv03
The new logical volume is created, placed in the same volume group as fslv03, 
and named by the system. 

To copy the contents of logical volume fslv03 to a new logical volume in volume group vg02, 
type: 
#cplv  -vg vg02 fslv03
The new logical volume is created, named, and added to volume group vg02. 

#To copy the contents of logical volume lv02 to a smaller, existing logical volume, 
lvtest, without requiring user confirmation, type: 
cplv -f lv02 lvtest


Errors:
-------

0516-746 cplv: Destination logical volume must have 
         type set to copy 

chlv -t copy lvprj


==========================================================================
CASES of usage of cplv command:

CASE 1:
-------

TITLE    : Procedure for moving a filesystem between disks that are in
           different volume groups using the cplv command.
OS LEVEL : AIX 4.x
DATE     : 25/11/99
VERSION  : 1.0

----------------------------------------------------------------------------

In the following example, an RS6000 has 1 one disk with rootvg on, and has
just had a second disk installed. The second disk needs a volume group
creating on it and a data filesystem transferring to the new disk. Ensure
that you have a full system backup befor you start.


lspv

hdisk0         00009922faf79f0d    rootvg         
hdisk1         None                None           

df -k

Filesystem    1024-blocks      Free %Used    Iused %Iused Mounted on
/dev/hd4             8192      1228   86%     1647    41% /
/dev/hd2           380928     40984   90%    11014    12% /usr
/dev/hd9var         32768     20952   37%      236     3% /var
/dev/hd3            28672      1644   95%      166     3% /tmp
/dev/hd1            53248     51284    4%       95     1% /home
/dev/lv00          200704    110324   46%     1869     4% /home/john
/dev/ftplv         102400     94528    8%       32     1% /home/ftp
/dev/lv01          114688     58240   50%       59     1% /usr2

In this example the /usr2 filesystem needs to be moved to the new disk 
drive, freeing up space in the root volume group. 


1, Create a data volume group on the new disk (hdisk1), the command below
   will create a volume group called datavg on hdisk1 with a PP size of 
   32 Meg:-

   mkvg -s 32 -y datavg hdisk1

2, Create a jfslog logical volume on the new volume group :-

   mklv -y datalog -t jfslog datavg 1

3, Initialise the jfslog :-

   logform /dev/datalog

   logform: destroy /dev/datalog (y)?y

4, Umount the filesystem that is being copied :-

   umount /usr2

5, Copy the /usr2 logical volume (lv01) to a new logical volume (lv11) on 
   the new volume group :-

   cplv -y lv11 -v datavg lv01

   cplv: Logical volume lv01 successfully copied to lv11 .

6, Change the /usr2 filesystem to use the new (/dev/lv11) logical volume 
   and not the old (/dev/lv01) logical volume :-

   chfs -a dev=/dev/lv11 /usr2

7, Change the /usr2 filesystem to use the jfslog on the new volume group 
   (/dev/datalog) :- 

   chfs -a log=/dev/datalog /usr2

8, Mount the filesystem :-

   mount /usr2

   df -k

   Filesystem    1024-blocks      Free %Used    Iused %Iused Mounted on
   /dev/hd4             8192      1220   86%     1649    41% /
   /dev/hd2           380928     40984   90%    11014    12% /usr
   /dev/hd9var         32768     20952   37%      236     3% /var
   /dev/hd3            28672      1644   95%      166     3% /tmp
   /dev/hd1            53248     51284    4%       95     1% /home
   /dev/lv00          200704    110324   46%     1869     4% /home/john
   /dev/ftplv         102400     94528    8%       32     1% /home/ftp
   /dev/lv11          114688     58240   50%       59     1% /usr2

9, Once the filesystem has been checked out, the old logical volume can
   be removed :-

   rmfs /dev/lv01

   Warning, all data contained on logical volume lv01 will be destroyed.
   rmlv: Do you wish to continue? y(es) n(o)? y
   rmlv: Logical volume lv01 is removed. 


If you wish to copy further filesystems repeat parts 4 to 9.

==========================================================================

CASE 2:
-------

Doel:
-----

Een "move" van het /prj filesystem (met Websphere in /prj/was) op rootvg,
naar een nieuw (groter en beter) volume group "wasvg".
Het huidige /prj op rootvg, correspondeerd met de LV "prjlv".
De nieuw te maken /prj op wasvg, correspondeerd met de LV "lvprj".

  ROOTVG                     WASVG
  --------------            --------------
  |/usr  (hd2) |            |             |
  |..          |            |             |
  |/prj (prjlv)|----------->|/prj (lvprj) | 
  |..          |            |             |
  --------------             -------------
  hdisk0,hdisk1              hdisk12,hdisk13

opm: /prj bevat "/prj/was", en dat is Websphere.

Hier maken we geen gebruik van een backup tape.

Gebruik het cplv command

  umount /prj
  chfs -m /prj_old /prj

 + mkvg -y wasvg -d 10 -s 128 hdisk12 hdisk13   -- maak VG aan

 + mklv -y lvprj -c 2 wasvg 400                 -- maak LV aan

 + mklv -y waslog -t jfslog wasvg 1             -- maak een jfslog

 + logform /dev/waslog                          -- init de log


  cplv -e lvprj prjlv

  chfs -a dev=/dev/lvprj /prj_old                   --
 
  chfs -a log=/dev/waslog /prj_old

  chfs -m /prj /prj_old
 
  mount /prj

==========================================================================


migratepv command:
------------------

Use the following command to move PPs from hdisk1 to hdisk6 and hdisk7 (all PVs must be in 1 VG)
# migratepv hdisk1 hdisk6 hdisk7

Use the following command to move PPs in LV lv02 from hdisk1 to hdisk6 
# migratepv -l lv02 hdisk1 hdisk6


chvg command:
-------------

This example multiplies by 2 the number of PPs:
# chvg -t2 datavg
 

chpv command:
-------------

The chpv command changes the state of the physical volume in a volume group by setting allocation 
permission to either allow or not allow allocation and by setting the availability to either 
available or removed. This command can also be used to clear the boot record for the given physical volume. 
Characteristics for a physical volume remain in effect unless explicitly changed with the corresponding flag.

Examples

To close physical volume hdisk03, enter: 
# chpv -v r hdisk03

The physical volume is closed to logical input and output until the -v a flag is used. 

To open physical volume hdisk03, enter: 
# chpv -v a hdisk03

The physical volume is now open for logical input and output. 

To stop the allocation of physical partitions to physical volume hdisk03, enter: 
# chpv -a n hdisk03

No physical partitions can be allocated until the -a y flag is used. 

To clear the boot record of a physical volume hdisk3, enter: 
# chpv -c hdisk3


How to synchronize stale partitions in a VG?:
---------------------------------------------

the syncvg command:

syncvg Command

Purpose
Synchronizes logical volume copies that are not current.

Syntax
syncvg [ -f ] [ -i ] [ -H ] [ -P NumParallelLps ] { -l | -p | -v } Name ...

Description
The syncvg command synchronizes the physical partitions, which are copies of the original physical partition, 
that are not current. The syncvg command can be used with logical volumes, physical volumes, 
or volume groups, with the Name parameter representing the logical volume name, physical volume name, 
or volume group name. The synchronization process can be time consuming, depending on the 
hardware characteristics and the amount of data.

When the -f flag is used, a good physical copy is chosen and propagated to all other copies 
of the logical partition, whether or not they are stale. Using this flag is necessary 
in cases where the logical volume does not have the mirror write consistency recovery.

Unless disabled, the copies within a volume group are synchronized automatically when the volume group is 
activated by the varyonvg command. 

Note:
For the sycnvg command to be successful, at least one good copy of the logical volume should 
be accessible, and the physical volumes that contains this copy should be in ACTIVE state. 
If the -f option is used, the above condition applies to all mirror copies.
If the -P option is not specified, syncvg will check for the NUM_PARALLEL_LPS environment variable. 
The value of NUM_PARALLEL_LPS will be used to set the number of logical partitions to be synchronized in parallel.

Examples
To synchronize the copies on physical volumes hdisk04 and hdisk05, enter: 
# syncvg  -p hdisk04 hdisk05

To synchronize the copies on volume groups vg04 and vg05, enter: 
# syncvg  -v vg04 vg05


How to Mirror a Logical Volume? :
--------------------------------

mklvcopy LogicalVolumeName Numberofcopies 
syncvg VolumeGroupName 

To add a copy for LV lv01 on disk hdisk7:

# mklvcopy lv01 2 hdisk7


Identifying hotspots: lvmstat command:
--------------------------------------

The lvmstat command display statistics values since the previous lvmstat command.
# lvmstat -v rootvg -e
# lvmstat -v rootvg -C
# lvmstat -v rootvg

Logical Volume       iocnt    KB_read   KB_wrtn   Kbps
hd8                   4        0        0         0.00
paging01              0        0        0         0.00
..
..


Mirroring a VG:
===============

LVM provide a disk mirroring facility at the LV level. 
Mirroring is the association of 2 or 3 PP's with each LP in a LV.

Use the "mklv", or the "mklvcopy", or the "mirrorvg" command.

The mklv command allows you to select one or two additional copies for each logical volume.

example:

To make a logical volume in volume group vg03 with nine logical partitions and a total of three copies 
spread across a maximum of two physical volumes, and whose allocation policy is not strict, enter: 

mklv -c 3 -u 2 -s n vg03 9

Mirroring can also be added to an existing LV using the mklvcopy command.

The mirrorvg command mirrors all the LV's on a given VG.
Examples:

- To triply mirror a VG, run
# mirrorvg -c 3 myvg

- To get default mirroring of the rootvg, run
# mirrorvg rootvg

- To replace a failed disk in a mirrored VG, run
# unmirrorvg workvg hdisk7
# reducevg workvg hdisk7
# rmdev -l hdisk7 -d

Now replace the failed disk with a new one and name it hdisk7
# extendvg workvg hdisk7
# mirrorvg workvg


mirrorvg command:
-----------------

mirrorvg Command


Purpose
Mirrors all the logical volumes that exist on a given volume group. 
This command only applies to AIX 4.2.1 or later. 


Syntax
mirrorvg [ -S | -s ] [ -Q ] [ -c Copies] [ -m ] VolumeGroup [ PhysicalVolume ... ] 


Description
The mirrorvg command takes all the logical volumes on a given volume group and mirrors 
those logical volumes. This same functionality may also be accomplished manually if you execute 
the mklvcopy command for each individual logical volume in a volume group. As with mklvcopy, 
the target physical drives to be mirrored with data must already be members of the volume group. 
To add disks to a volume group, run the extendvg command. 

By default, mirrorvg attempts to mirror the logical volumes onto any of the disks in a volume group. 
If you wish to control which drives are used for mirroring, you must include the list of disks in the 
input parameters, PhysicalVolume. Mirror strictness is enforced. Additionally, mirrorvg mirrors 
the logical volumes, using the default settings of the logical volume being mirrored. 
If you wish to violate mirror strictness or affect the policy by which the mirror is created, 
you must execute the mirroring of all logical volumes manually with the mklvcopy command. 

When mirrorvg is executed, the default behavior of the command requires that the synchronization 
of the mirrors must complete before the command returns to the user. If you wish to avoid the delay, 
use the -S or -s option. Additionally, the default value of 2 copies is always used. To specify a value 
other than 2, use the -c option. 


Note: To use this command, you must either have root user authority or be a member of the system group. 

Attention: The mirrorvg command may take a significant amount of time before completing because 
of complex error checking, the amount of logical volumes to mirror in a volume group, and the time 
is takes to synchronize the new mirrored logical volumes. 
You can use the Volumes application in Web-based System Manager (wsm) to change volume characteristics. 
You could also use the System Management Interface Tool (SMIT) smit mirrorvg fast path to run this command. 


Flags

-c Copies  Specifies the minimum number of copies that each logical volume must have after 
   the mirrorvg command has finished executing. It may be possible, through the independent use 
   of mklvcopy, that some logical volumes may have more than the minimum number specified after 
   the mirrorvg command has executed. Minimum value is 2 and 3 is the maximum value. 
   A value of 1 is ignored.  
-m exact map  Allows mirroring of logical volumes in the exact physical partition order that 
   the original copy is ordered. This option requires you to specify a PhysicalVolume(s) where the exact map 
   copy should be placed. If the space is insufficient for an exact mapping, then the command will fail. 
   You should add new drives or pick a different set of drives that will satisfy an exact 
   logical volume mapping of the entire volume group. The designated disks must be equal to or exceed 
   the size of the drives which are to be exactly mirrored, regardless of if the entire disk is used. 
   Also, if any logical volume to be mirrored is already mirrored, this command will fail.  
-Q Quorum Keep  By default in mirrorvg, when a volume group's contents becomes mirrored, volume group 
   quorum is disabled. If the user wishes to keep the volume group quorum requirement after mirroring 
   is complete, this option should be used in the command. For later quorum changes, refer to the chvg command.  
-S Background Sync  Returns the mirrorvg command immediately and starts a background syncvg of the volume group. 
   With this option, it is not obvious when the mirrors have completely finished their synchronization. 
   However, as portions of the mirrors become synchronized, they are immediately used by the operating system 
   in mirror usage.  
-s Disable Sync  Returns the mirrorvg command immediately without performing any type of 
   mirror synchronization. If this option is used, the mirror may exist for a logical volume but 
   is not used by the operating system until it has been synchronized with the syncvg command.  


The following is a description of rootvg: 

- rootvg mirroring  When the rootvg mirroring has completed, you must perform three additional tasks: 
bosboot, bootlist, and reboot. 
The bosboot command is required to customize the bootrec of the newly mirrored drive. 
The bootlist command needs to be performed to instruct the system which disk and order you prefer 
the mirrored boot process to start. 

Finally, the default of this command is for Quorum to be turned off. For this to take effect 
on a rootvg volume group, the system must be rebooted. 
 
- non-rootvg mirroring  When this volume group has been mirrored, the default command causes Quorum 
to deactivated. The user must close all open logical volumes, execute varyoffvg and then varyonvg on 
the volume group for the system to understand that quorum is or is not needed for the volume group. 
If you do not revaryon the volume group, mirror will still work correctly. However, any quorum changes 
will not have taken effect.  
rootvg and non-rootvg mirroring  The system dump devices, primary and secondary, should not be mirrored. 
In some systems, the paging device and the dump device are the same device. However, most users want 
the paging device mirrored. When mirrorvg detects that a dump device and the paging device are the same, 
the logical volume will be mirrored automatically. 
If mirrorvg detects that the dump and paging device are different logical volumes, the paging device 
is automatically mirrored, but the dump logical volume is not. The dump device can be queried and modified 
with the sysdumpdev command. 

 
Remark:
-------
Run bosboot to initialize all boot records and devices by executing the 
following command:
bosboot -a -d /dev/hdisk?
hdisk? is the first hdisk listed under the PV heading after the command 
lslv -l hd5 has executed.

Secondary, you need to understant that the mirroring under AIX it's at 
the logical volume level. The mirrorvg command is a hight level command 
that use "mklvcopy" command.
So, all LV created before runing the mirrorvg command are keep 
synchronised, but if you add a new LV after runing mirrorvg, you need to 
mirror it manualy using "mklvcopy" .

Remark:
-------

lresynclv


Mirroring the rootvg:
---------------------

Method 1:
---------

Howto mirror an AIX rootvg
The following steps will guide you trough the mirroring of an AIX rootvg.
This info is valid for AIX 4.3.3, AIX 5.1, AIX 5.2 and AIX 5.3.

Make sure you have an empty disk, in this example its hdisk1 
Add the disk to the vg via 

# extendvg rootvg hdisk1 

Mirror the vg via: 

# mirrorvg -s rootvg

Now synchronize the new copies you created:

# syncvg -v rootvg

As we want to be able to boot from different disks, we need to use bosboot:

# bosboot -a

As hd5 is mirrored there is no need to do it for each disk.

Now, update the bootlist:

# bootlist -m normal hdisk1 hdisk0
# bootlist -m service hdisk1 hdisk0


When mirrorvg is executed, the default behavior of the command requires that the synchronization of the mirrors 
must complete before the command returns to the user. If you wish to avoid the delay, use the -S or -s option. 
Additionally, the default value of 2 copies is always used. To specify a value other than 2, use the -c option.


Method 2:
---------

-------------------------------------------------------------------------------
# Add the new disk, say its hdisk5, to rootvg

extendvg rootvg hdisk5

# If you use one mirror disk, be sure that a quorum is not required for varyon:

chvg -Qn rootvg

# Add the mirrors for all rootvg LV's:

mklvcopy hd1 2 hdisk5
mklvcopy hd2 2 hdisk5
mklvcopy hd3 2 hdisk5
mklvcopy hd4 2 hdisk5
mklvcopy hd5 2 hdisk5
mklvcopy hd6 2 hdisk5
mklvcopy hd8 2 hdisk5
mklvcopy hd9var 2 hdisk5
mklvcopy hd10opt 2 hdisk5
mklvcopy prjlv 2 hdisk5

#If you have other LV's in your rootvg, be sure to create copies for them as well !!
------------------------------------------------------------------------------

# lspv -l hdisk0
hd5                   1     1     01..00..00..00..00    N/A
prjlv                 256   256   108..44..38..50..16   /prj
hd6                   59    59    00..59..00..00..00    N/A
fwdump                5     5     00..05..00..00..00    /var/adm/ras/platform
hd8                   1     1     00..00..01..00..00    N/A
hd4                   26    26    00..00..02..24..00    /
hd2                   45    45    00..00..37..08..00    /usr
hd9var                10    10    00..00..02..08..00    /var
hd3                   22    22    00..00..04..10..08    /tmp
hd1                   8     8     00..00..08..00..00    /home
hd10opt               24    24    00..00..16..08..00    /opt


Method 3:
---------

In the following example, an RS6000 has 3 disks, 2 of which have the AIX
filesystems mirrored on. The boolist contains both hdisk0 and hdisk1. 
There are no other logical volumes in rootvg other than the AIX system 
logical volumes. hdisk0 has failed and need replacing, both hdisk0 and hdisk1
are in "Hot Swap" carriers and therefore the machine does not need shutting 
down. 

lspv

hdisk0         00522d5f22e3b29d    rootvg
hdisk1         00522d5f90e66fd2    rootvg 
hdisk2         00522df586d454c3    datavg                                     

lsvg -l rootvg

rootvg:
LV NAME             TYPE       LPs   PPs   PVs  LV STATE      MOUNT POINT
hd6                 paging     4     8     2    open/syncd    N/A
hd5                 boot       1     2     2    closed/syncd  N/A
hd8                 jfslog     1     2     2    open/syncd    N/A
hd4                 jfs        1     2     2    open/syncd    /
hd2                 jfs        12    24    2    open/syncd    /usr
hd9var              jfs        1     2     2    open/syncd    /var
hd3                 jfs        2     4     2    open/syncd    /tmp
hd1                 jfs        1     2     2    open/syncd    /home


1, Reduce the logical volume copies from both disks to hdisk1 only :-

   rmlvcopy hd6 1 hdisk0
   rmlvcopy hd5 1 hdisk0
   rmlvcopy hd8 1 hdisk0
   rmlvcopy hd4 1 hdisk0
   rmlvcopy hd2 1 hdisk0
   rmlvcopy hd9var 1 hdisk0
   rmlvcopy hd3 1 hdisk0
   rmlvcopy hd1 1 hdisk0
   
2, Check that no logical volumes are left on hdisk0 :-

   lspv -p hdisk0

   hdisk0:
   PP RANGE  STATE   REGION        LV ID          TYPE       MOUNT POINT
     1-101   free    outer edge
   102-201   free    outer middle
   202-301   free    center
   302-401   free    inner middle
   402-501   free    inner edge     

3, Remove the volume group from hdisk0

   reducevg -df rootvg hdisk0

4, Recreate the boot logical volume on hdisk1, and reset bootlist:-

   bosboot -a -d /dev/hdisk1
   bootlist -m normal rmt0 cd0 hdisk1

5, Check that everything has been removed from hdisk0 :-

   lspv

   hdisk0         00522d5f22e3b29d    None
   hdisk1         00522d5f90e66fd2    rootvg
   hdisk2         00522df586d454c3    datavg          

6, Delete hdisk0 :-

   rmdev -l hdisk0 -d

7, Remove the failed hard drive and replace with a new hard drive.

8, Configure the new disk drive :-

   cfgmgr

9, Check new hard drive is present :-

   lspv

10, Include the new hdisk in root volume group :-

    extendvg rootvg hdisk?  (where hdisk? is the new hard disk)

11, Re-create the mirror :-

    mirrorvg rootvg hdisk?  (where hdisk? is the new hard disk)

12, Syncronise the mirror :-

    syncvg -v rootvg

13, Reset the bootlist :-

    bootlist -m normal rmt0 cd0 hdisk0 hdisk1

14, Turn off Quorum checking on rootvg :-

    chvg -Q n rootvg


Method 4:
---------

Howto mirror an AIX rootvg
The following steps will guide you trough the mirroring of an AIX rootvg.
This info is valid for AIX 4.3.3, AIX 5.1, AIX 5.2 and AIX 5.3.

Make sure you have an empty disk, in this example its hdisk1 
Add the disk to the vg via "extendvg rootvg hdisk1 
Mirror the vg via: "mirrorvg rootvg" 
Adapt the bootlist to add the current disk, the system will then fail to hdisk1 is hdisk0 fails during startup 
do bootlist -o -m normal 
this will list currently 1 disk, in this exmaple hdisk0 
do bootlist -m normal hdisk0 hdisk1 
Run a bosboot on both new disks, this will install all software needed for boot on the disk 
bosboot -ad hdisk0 
bosboot -ad hdisk1 


Method 5:
---------

Although the steps to mirror volume groups between HP and AIX are incredibly similar, 
there are enough differences to send me through hoops if/when I ever have to do that. 
Therefore, the following checklist: 

1. Mirror the logical volumes: 
If you don't care what disks the lvs get mirrored to, execute

mirrorvg rootvg


Otherwise: 

for lv in $(lsvg -l rootvg | grep -i open/syncd | \
	grep -v dumplv | awk '{print $1}')
do
	mklvcopy ${lv} 1 ${disk}
done

2. Change the quorum checking if you did not use mirrorvg:

chvg -Q n rootvg


3. Run bosboot on the new drive to copy boot files to it:

bosboot ${disk}


4. Update the bootlist with the new drive:

bootlist -m normal hdisk0 hdisk1


5. Reboot the system to enable the new quorum checking parameter 


Method 6:
---------

Audience: System Administrators 
Date: September 25, 2002 


Mirroring "rootvg" protects the operating system from a disk failure. Mirroring "rootvg" 
requires a couple extra steps compared to other volume groups. The mirrored rootvg disk must be bootable 
*and* in the bootlist. Otherwise, if the primary disk fails, you'll continue to run, 
but you won't be able to reboot. 

In brief, the procedure to mirror rootvg on hdisk0 to hdisk1 is 

1. Add hdisk1 to rootvg:
extendvg rootvg hdisk1 

2. Mirror rootvg to hdisk1:
mirrorvg rootvg hdisk1 (or smitty mirrorvg) 

3. Create boot images on hdisk1:
bosboot -ad /dev/hdisk1 

4. Add hdisk1 to the bootlist:
bootlist -m normal hdisk0 hdisk1 

5. Reboot to disable quorum checking on rootvg. The mirrorvg turns off quorum by default, 
but the system needs to be rebooted for it to take effect. 

For more information, and a comprehensive procedure see the man page for mirrorvg and 


Example using mklvcopy:
-----------------------

mklvcopy [ -a Position ] [ -e Range ] [ -k ] [ -m MapFile ] [ -s Strict ] [ -u UpperBound ] LogicalVolume 
         Copies [ PhysicalVolume... ] 


Add a copy of LV "lv01" on disk hdisk7:

# mklvcopy lv01 2 hdisk7

The mklvcopy command increases the number of copies in each logical partition in LogicalVolume. 
This is accomplished by increasing the total number of physical partitions for each logical partition 
to the number represented by Copies. The LogicalVolume parameter can be a logical volume name or 
logical volume ID. You can request that the physical partitions for the new copies be allocated 
on specific physical volumes (within the volume group) with the PhysicalVolume parameter; 
otherwise, all the physical volumes within the volume group are available for allocation.

The logical volume modified with this command uses the Copies parameter as its new copy characteristic. 
The data in the new copies are not synchronized until one of the following occurs: 
the -k option is used, the volume group is activated by the varyonvg command, or the volume group 
or logical volume is synchronized explicitly by the syncvg command. Individual logical partitions 
are always updated as they are written to.

The default allocation policy is to use minimum numbering of physical volumes per logical volume copy, 
to place the physical partitions belong to a copy as contiguously as possible, and then to place 
the physical partitions in the desired region specified by the -a flag. Also, by default, each copy 
of a logical partition is placed on a separate physical volume.


Using smitty:
-------------

# smit mklv 

or 

# smit mklvcopy

Using "smit mklv" you can create a new LV and at the same time tell the system to create a mirror
(2 or 3 copies) of each LP and which PV's are involved.

Using "smit mklvcopy" you can add mirrors to an existing LV.


Filesystems in AIX:
===================

After a VG is created, you can create filesystems. You can use smitty or the crfs and mkfs command.
File systems are confined to a single logical volume.

The journaled file system (JFS) and the enhanced journaled file system (JFS2) are built into the 
base operating system. Both file system types link their file and directory data to the structure 
used by the AIX Logical Volume Manager for storage and retrieval. A difference is that JFS2 is designed to accommodate 
a 64-bit kernel and larger files.

Run lsfs -v jfs2 to determine if your system uses JFS2 file systems. 
This command returns no output if it finds only standard file systems. 


crfs:
-----

crfs -v VfsType { -g VolumeGroup | -d Device } [ -l LogPartitions ]
     -m MountPoint [ -n NodeName ] [ -u MountGroup ] [ -A { yes | no } ] [ -p {ro | rw } ] 
     [ -a Attribute= Value ... ] [ -t { yes | no } ]


The crfs command creates a file system on a logical volume within a previously created volume group. 
A new logical volume is created for the file system unless the name of an existing logical volume is 
specified using the -d. An entry for the file system is put into the /etc/filesystems file.

crfs -v jfs -g(vg) -m(mount point) -a size=(size of fs) -A yes 
Will create a logical volume on the volume group and create the file system on 
the logical volume. All at the size stated. Will add entry into 
/etc/filesystems and will create the mount point directory if it does not exist. 

- To make a JFS on the rootvg volume group with nondefault fragment size and nondefault nbpi, enter:
# crfs  -v jfs  -g  rootvg  -m /test -a size=32768 -a frag=512 -a nbpi=1024

This command creates the /test file system on the rootvg volume group with a fragment size of 512 bytes, 
a number of bytes per i-node (nbpi) ratio of 1024, and an initial size of 16MB (512 * 32768).

- To make a JFS on the rootvg volume group with nondefault fragment size and nondefault nbpi, enter: 
# crfs -v jfs -g rootvg -m /test -a size=16M -a frag=512 -a nbpi=1024

This command creates the /test file system on the rootvg volume group with a fragment size of 512 bytes, 
a number of bytes per i-node (nbpi) ratio of 1024, and an initial size of 16MB. 

- To create a JFS2 file system which can support NFS4 ACLs, type: 
# crfs -v jfs2 -g rootvg -m /test -a size=1G -a ea=v2

- This command creates the /test JFS2 file system on the rootvg volume group with an initial size of 1 gigabyte. 
The file system will store extended attributes using the v2 format.
# crfs -v jfs -g backupvg -m /backups -a size=32G -a bf=true

# crfs -v jfs -g oravg -m /filetransfer -a size=4G -a bf=true


Extended example:
-----------------

The following command creates a JFS filesystem on a previously created LV "lv05".
In this example, suppose the LV was created in the following way:

# mklv -y lv05 -c 2 splvg 200

In this case, it is clear that we mirror each LP to 2 PP's (because of the -c 2).

Now to create a filesystem on lv05, we can use the command
# crfs -v jfs -d lv05 -m /spl -a bf=true

Note that we did not mentioned the size of the filesystem. This is because we use a previously defined LV
with a known size. 
 

Notes:

1. The option -a bf=true allows large files [ > 2Gb]; 

2. Specifying -m /<name> (like for example "/data") will create the entry in /etc/filesystems for you


Some more examples:
-------------------

Commands to create VG's:
mkvg oravg -d 10 -s 128 hdisk2 hdisk4
mkvg splvg -d 10 -s 128 hdisk3 hdisk5
mkvg softwvg -d 10 -s 128 hdisk6
mkvg backupvg -d 10 -s 128 hdisk7

Set of Create Logical Volume and Filesystem commands:	

# crfs -v jfs -g <Vgname> -m <Mountpoint> -a size=xG -a bf=true
or
# mklv -y <LV_name> -c 2 <VG_name> No_Of_PPs
# crfs -v jfs -d <LV_name> -m <MountPoint> -a bf=true

		
# mklv -y lv05 -c 2 splvg 300			
# crfs -v jfs -d lv05 -m /spl -a bf=true			
# mklv -y lv06 -c 2 splvg 100			
# crfs -v jfs -d lv06 -m /u04 -a bf=true			
			
# mklv -y lv02 -c 2 oravg 200			
# mklv -y lv03 -c 2 oravg 200			
# mklv -y lv04 -c 2 oravg 200			
# crfs -v jfs -d lv02 -m /u01 -a bf=true			
# crfs -v jfs -d lv03 -m /u02 -a bf=true			
# crfs -v jfs -d lv04 -m /u03 -a bf=true			
			
# crfs -v jfs -g backupvg -m /backups -a size=33G -a bf=true			
# crfs -v jfs -g backupvg -m /data -a size=33G -a bf=true			
# crfs -v jfs -g softwvg -m /apps -a size=16G -a bf=true			
# crfs -v jfs -g softwvg -m /software -a size=33G -a bf=true			
# crfs -v jfs -g softwvg -m /u05 -a size=12G -a bf=true			


mkfs:
-----

The mkfs command makes a new file system on a specified device. The mkfs command initializes the volume label, 
file system label, and startup block.

The Device parameter specifies a block device name, raw device name, or file system name. If the parameter 
specifies a file system name, the mkfs command uses this name to obtain the following parameters from the 
applicable stanza in the /etc/filesystems file, unless these parameters are entered with the mkfs command.

- To specify the volume and file system name for a new file system, type: 
# mkfs  -lworks  -vvol001 /dev/hd3

This command creates an empty file system on the /dev/hd3 device, giving it the volume serial number vol001 
and file system name works. The new file system occupies the entire device. 
The file system has a default fragment size (4096 bytes) and a default nbpi ratio (4096). 

- To create a file system with nondefault attributes, type: 
# mkfs  -s 8192  -o nbpi=2048,frag=512 /dev/lv01

This command creates an empty 4 MB file system on the /dev/lv01 device with 512-byte fragments and 
1 i-node for each 2048 bytes. 

-To create a large file enabled file system, type: 
# mkfs -V jfs -o nbpi=131072,bf=true,ag=64 /dev/lv01

This creates a large file enabled JFS file system with an allocation group size of 64 megabytes and 1 inode 
for every 131072 bytes of disk. The size of the file system will be the size of the logical volume lv01.

- To create a file system with nondefault attributes, type: 
# mkfs -s 4M -o nbpi=2048, frag=512 /dev/lv01

This command creates an empty 4 MB file system on the /dev/lv01 device with 512-byte fragments and one i-node 
for each 2048 bytes. 

- To create a JFS2 file system which can support NFS4 ACLs, type: 
# mkfs -V jfs2 -o ea=v2 /dev/lv01

This command creates an empty file system on the /dev/lv01 device with v2 format for extended attributes.


chfs command:
-------------

- Example 1:

How do I change the size of a filesystem? 

To increase /usr filesystem size by 1000000 512-byte blocks, type:
# chfs -a size=+1000000 /usr
- Example 2:

To split off a copy of a mirrored file system and mount it read-only for use as an online backup, enter: 
# chfs -a splitcopy=/backup -a copy=2 /testfs
This mount a read-only copy of /testfs at /backup.

- Example 3:

To change the mount point of a file system, enter: 
# chfs  -m /test2 /test
This command changes the mount point of a file system from /test to /test2. 

- Eaxample 4:

# chfs -a size=+20G /data/udb/eidwha2/eddwha2/DATA03

- Example 5:

chfs -a size=+5M /opt


 would do it this way:

1) chfs -m old_filename new_filename

2) umount old_filename

3) mount new_filename

To stop or kill access to a fs, use:
fuser -xuc /scratch


lsfs command:
-------------

Displays the characteristics of file systems.

Syntax
lsfs [ -q ] [ -c | -l ] [ -a | -v VfsType | -u MountGroup| [FileSystem...] ]

Description
The lsfs command displays characteristics of file systems, such as mount points, automatic mounts, permissions, 
and file system size. The FileSystem parameter reports on a specific file system. 
The following subsets can be queried for a listing of characteristics:

All file systems 
All file systems of a certain mount group 
All file systems of a certain virtual file system type 
One or more individual file systems

The lsfs command displays additional Journaled File System (JFS) or Enhanced Journaled File System (JFS2) 
characteristics if the -q flag is specified.

To show all file systems in the /etc/filesystems file, enter: 
#lsfs

To show all file systems of vfs type jfs, enter: 
#lsfs  -v jfs

To show the file system size, the fragment size, the compression algorithm (if any), and the 
number of bytes per i-node as recorded in the superblock of the root file system, enter: 
#lsfs  -q /


SAN connection via SDD, and related commands:
=============================================

If you use advanced storage on AIX, the workings on disks and volume groups are a bit different
from the traditional ways, using local disks, as described above. 

You can use SDD or SDDPCM Multipath IO. This section describes SDD. See section 31.5 for SDDPCM.


Overview of the Subsystem device driver:
----------------------------------------

The IBM System Storage Multipath Device Driver SDD provides multipath configuration environment support
for a host system that is attached to storage devices. It provides:

-Enhanced data availability 
-Automatic path failover and recovery to an alternate path 
-Dynamic load balancing of multiple paths 
-Concurrent microcode upgrade.

The IBM System Storage Multipath Subsystem Device Driver Path Control Module SDDPCM provides
AIX MPIO support. Its a loadable module. During the configuration of supported devices, SDDPCM is loaded
and becomes part of the AIX MPIO Fibre Channel protocol device driver. The AIX MPIO-capable device driver
with the SDDPCM module provides the same functions that SDD provides.

Note that before attempting to exploit the Virtual shared disk support for the Subsystem device driver, 
you must read IBM Subsystem Device Driver Installation and User's Guide.

An SDD implementation is available for AIX, Solaris, HP-UX, some Linux distro's, Windows 200x.

An impression about the architecture on AIX can be seen in the following figure:


               -------------------------------
               | Host System                 |
               | -------             ------- |
               | |FC 0 |             | FC 1| |
               | -------             ------- |
               -------------------------------
                    |                   |
                    |                   |
              ----------------------------------
          ESS |  --------         --------    |
              |  |port 0|         |port 1|    |
              |  -------- \      /--------    |
              |      |      \   /      |      | 
              |      |        \/       |      |
              |      |        / \      |      |
              |   -----------/    \---------- |
              |   |Cluster 1|      |Cluster 2||
              |   -----------      -----------|
              |    |  |  |  |       | | |  |  |
              |    |  |  |  |       | | |  |  |
              |    O--|--|--|-------| | |  |  |           
              |   lun0|  |  |         | |  |  |
              |       O--|--|---------| |  |  |
              |      lun1|  |           |  |  |
              |          O--|-----------|  |  |
              |         lun2|              |  |
              |             O--------------|  |
              |            lun3               |
              ---------------------------------


DPO (Data Path Optimizer) was renamed by IBM a couple years ago- and became SDD (Subsystem Device Driver). 
When redundant paths are configured to ESS logical units, and the SDD is installed and configured, 
the AIX(R) lspv command shows multiple hdisks as well as a new construct called a vpath. The hdisks and vpaths 
represent the same logical unit. You will need to use the lsvpcfg command to get more information. 

Each SDD vpath device represents a unique physical device on the storage server.
Each physical device is presented to the operating system as an operating system disk device.
So, essentially, a vpath device acts like a disk.

You will see later on that a hdisk is actually a "path" to a LUN, that can be reached either by fscsi0 or fscsi1.
Also you will see that a vpath represents the LUN.

SDD does not support multipathing to a bootdevice.

Support for VIO:
----------------

Starting from SDD version 1.6.2.0, a unique ID attribute is added to SDD vpath devices, in order to 
support AIX5.3 VIO future features. AIX device configure methods have been changed in both AIX52 TL8 and 
AIX53 TL4 for this support.


Examples:
---------

For example, after issuing lspv, you see output similar to this:

# lspv
hdisk0          000047690001d59d      rootvg
hdisk1          000047694d8ce8b6      None
hdisk18         000047694caaba22      None
hdisk19         000047694caadf9a      None
hdisk20         none                  None
hdisk21         none                  None
hdisk22         000047694cab2963      None
hdisk23         none                  None
hdisk24         none                  None
vpath0          none                  None
vpath1          none                  None
vpath2          000047694cab0b35      gpfs1scsivg
vpath3          000047694cab1d27      gpfs1scsivg


After issuing lsvpcfg, you see output similar to this:

# lsvpcfg
vpath0 (Avail ) 502FCA01 = hdisk18 (Avail pv )
vpath1 (Avail ) 503FCA01 = hdisk19 (Avail pv )
vpath2 (Avail pv gpfs1scsivg) 407FCA01 = hdisk20 (Avail ) hdisk24 (Avail )


The examples above illustrate some important points:

- vpath0 consists of a single path (hdisk18) and therefore will not provide failover protection. 
Also, hdisk18 is defined to AIX as a physical volume (pv flag) and has a PVID, as you can see from the output 
of the lspv command. Likewise for vpath1.

- vpath2 has two paths (hdisk20 and hdisk24) and has a volume group defined on it. Notice that with the 
lspv command, hdisk20 and hdisk24 look like newly installed disks with no PVIDs. The lsvpcfg command had 
to be used to determine that hdisk20 and hdisk24 make up vpath2, which has a PVID.

Warning: so be very carefull not to use a hdisk for a "local" VG, if its already used for a vpath.


Other Example:
--------------

# lspv
 hdisk0          00c49e8c8053fe86                    rootvg          active
 hdisk1          00c49e8c841a74d5                    rootvg          active
-hdisk2          none                                None
-hdisk3          none                                None
 vpath0          00c49e8c94c02c15                    datavg          active
 vpath1          00c49e8c94c050d4                    appsvg          active
-hdisk4          none                                None
 vpath2          00c49e8c2806dc22                    appsvg          active
-hdisk5          none                                None
-hdisk6          none                                None
-hdisk7          none                                None


# lsvpcfg

vpath0 (Avail pv datavg) 75BAFX1006C = hdisk2 (Avail ) hdisk5 (Avail )
vpath1 (Avail pv appsvg) 75BAFX1017B = hdisk3 (Avail ) hdisk6 (Avail )
vpath2 (Avail pv appsvg) 75BAFX10329 = hdisk4 (Avail ) hdisk7 (Avail )


# datapath query adapter

Active Adapters :2

Adpt#     Name   State     Mode             Select     Errors  Paths  Active
    0   fscsi0  NORMAL   ACTIVE           12611291          0      3       3
    1   fscsi1  NORMAL   ACTIVE           13375287          0      3       3


# datapath query device

Total Devices : 3


DEV#:   0  DEVICE NAME: vpath0  TYPE: 2107900         POLICY:    Optimized  # this is vpath0
SERIAL: 75BAFX1006C
==========================================================================
Path#      Adapter/Hard Disk          State     Mode     Select     Errors
    0          fscsi0/hdisk2           OPEN   NORMAL   12561763          0
    1          fscsi1/hdisk5           OPEN   NORMAL   13324883          0

DEV#:   1  DEVICE NAME: vpath1  TYPE: 2107900         POLICY:    Optimized
SERIAL: 75BAFX1017B
==========================================================================
Path#      Adapter/Hard Disk          State     Mode     Select     Errors
    0          fscsi0/hdisk3           OPEN   NORMAL      28024          0
    1          fscsi1/hdisk6           OPEN   NORMAL      28847          0

DEV#:   2  DEVICE NAME: vpath2  TYPE: 2107900         POLICY:    Optimized
SERIAL: 75BAFX10329
==========================================================================
Path#      Adapter/Hard Disk          State     Mode     Select     Errors
    0          fscsi0/hdisk4           OPEN   NORMAL      21672          0
    1          fscsi1/hdisk7           OPEN   NORMAL      21712          0


# lsattr -El vpath0
active_hdisk  hdisk2/75BAFX1006C/fscsi0        Active hdisk               False
active_hdisk  hdisk5/75BAFX1006C/fscsi1        Active hdisk               False
policy        df                               Scheduling Policy          True
pvid          00c49e8c94c02c150000000000000000 Physical volume identifier False
serial_number 75BAFX1006C                      LUN serial number          False


# lsdev -Cc adapter
ent0      Available 04-08 10/100/1000 Base-TX PCI-X Adapter (14106902)
ent1      Available 06-08 10/100/1000 Base-TX PCI-X Adapter (14106902)
fcs0      Available 05-08 FC Adapter
fcs1      Available 07-08 FC Adapter
sa0       Available       LPAR Virtual Serial Adapter
sisscsia0 Available 03-08 PCI-X Ultra320 SCSI Adapter


# lsattr -El fcs0
bus_intr_lvl  131193     Bus interrupt level                                False
bus_io_addr   0xcfc00    Bus I/O address                                    False
bus_mem_addr  0xc0040000 Bus memory address                                 False
init_link     al         INIT Link flags                                    True
intr_priority 3          Interrupt priority                                 False
lg_term_dma   0x800000   Long term DMA                                      True
max_xfer_size 0x100000   Maximum Transfer Size                              True
num_cmd_elems 200        Maximum number of COMMANDS to queue to the adapter True
pref_alpa     0x1        Preferred AL_PA                                    True
sw_fc_class   2          FC Class for Fabric                                True


# lscfg -lv fcs0
  fcs0             U7879.001.DQDKCPR-P1-C2-T1  FC Adapter

        Part Number.................03N6441
        EC Level....................A
        Serial Number...............1D54508045
        Manufacturer................001D
        Feature Code................280B
        FRU Number.................. 03N6441
        Device Specific.(ZM)........3
        Network Address.............10000000C94F91CD
        ROS Level and ID............0288193D
        Device Specific.(Z0)........1001206D
        Device Specific.(Z1)........00000000
        Device Specific.(Z2)........00000000
        Device Specific.(Z3)........03000909
        Device Specific.(Z4)........FF801412
        Device Specific.(Z5)........0288193D
        Device Specific.(Z6)........0683193D
        Device Specific.(Z7)........0783193D
        Device Specific.(Z8)........20000000C94F91CD
        Device Specific.(Z9)........TS1.90X13
        Device Specific.(ZA)........T1D1.90X13
        Device Specific.(ZB)........T2D1.90X13
        Device Specific.(YL)........U7879.001.DQDKCPR-P1-C2-T1


# lsdev -Cc adapter -F 'name parent'
ent0      pci4
ent1      pci6
fcs0      pci5
fcs1      pci7
sa0
sisscsia0 pci3


# lsdev -Cc disk -F 'name location'
hdisk0 03-08-00-3,0
hdisk1 03-08-00-5,0
hdisk2 05-08-01 ------------------------>|
hdisk3 05-08-01 ------------------------>|
hdisk4 05-08-01 ------------------------>|
hdisk5 07-08-01                          |
hdisk6 07-08-01                          |
hdisk7 07-08-01                          |
vpath0                                   |
vpath1                                   |
vpath2                                   |
                                         |
                                         |
# lsdev -Cc driver -F 'name location'    |
dpo                                      |
fcnet0 05-08-02                          |
fcnet1 07-08-02                          |
fscsi0 05-08-01 <-------------------------
fscsi1 07-08-01
iscsi0
scsi0  03-08-00

Please note that, for example, from the above output, that fsci0 can be "linked" to hdisk2, hdisk3 and hdisk4,
due to the location code.
You can compare that to the output of "datapath query device".
Also interesting can be the following:

# lsdev -C | grep fc
fcnet0      Defined   05-08-02      Fibre Channel Network Protocol Device
fcnet1      Defined   07-08-02      Fibre Channel Network Protocol Device
fcs0        Available 05-08         FC Adapter
fcs1        Available 07-08         FC Adapter

# lsdev -C | grep fsc
fscsi0      Available 05-08-01      FC SCSI I/O Controller Protocol Device
fscsi1      Available 07-08-01      FC SCSI I/O Controller Protocol Device

From this, you can see that fcs0 is the "parent" of the child "fsci0".


# lsattr -D -l fscsi0
attach       none         How this adapter is CONNECTED         False
dyntrk       no           Dynamic Tracking of FC Devices        True
fc_err_recov delayed_fail FC Fabric Event Error RECOVERY Policy True
scsi_id                   Adapter SCSI ID                       False
sw_fc_class  3            FC Class for Fabric                   True

# lsattr -D -l fcs0
bus_intr_lvl             Bus interrupt level                                Fals                                              e
bus_io_addr   0x00010000 Bus I/O address                                    Fals                                              e
bus_mem_addr  0x01000000 Bus memory address                                 Fals                                              e
init_link     al         INIT Link flags                                    True
intr_priority 3          Interrupt priority                                 Fals                                              e
lg_term_dma   0x800000   Long term DMA                                      True
max_xfer_size 0x100000   Maximum Transfer Size                              True
num_cmd_elems 200        Maximum number of COMMANDS to queue to the adapter True
pref_alpa     0x1        Preferred AL_PA                                    True
sw_fc_class   2          FC Class for Fabric                                True


# datapath query essmap
 Disk          Path  P     Location   adapter    LUN SN       Type           Size   LSS     Vol  Rank  C/A   S   Connection  port RaidMode
-------       -----  -   -----------  ------   -----------  ------------     ----   ----    ---  ----- ----  -   ----------- ---- --------
vpath0        hdisk2     05-08-01[FC] fscsi0   75BAFX1006C  IBM 2107-900  107.5GB     0    108   fff2   02   Y   R1-B3-H3-ZC  232 RAID5
vpath0        hdisk5     07-08-01[FC] fscsi1   75BAFX1006C  IBM 2107-900  107.5GB     0    108   fff2   02   Y   R1-B3-H3-ZA  230 RAID5
vpath1        hdisk3     05-08-01[FC] fscsi0   75BAFX1017B  IBM 2107-900   14.3GB     1    123   fff1   0b   Y   R1-B3-H3-ZC  232 RAID5
vpath1        hdisk6     07-08-01[FC] fscsi1   75BAFX1017B  IBM 2107-900   14.3GB     1    123   fff1   0b   Y   R1-B3-H3-ZA  230 RAID5
vpath2        hdisk4     05-08-01[FC] fscsi0   75BAFX10329  IBM 2107-900   14.3GB     3     41   ffe1   08   Y   R1-B3-H3-ZC  232 RAID5
vpath2        hdisk7     07-08-01[FC] fscsi1   75BAFX10329  IBM 2107-900   14.3GB     3     41   ffe1   08   Y   R1-B3-H3-ZA  230 RAID5

From this you can see that a hdisk is actually a "path" to a LUN, that can be reached either by fscsi0 or fscsi1.
Also you can see that a vpath represents the LUN.
 

# datapath query adaptstats

Adapter #:  0
=============
                Total Read  Total Write  Active Read  Active Write   Maximum
I/O:               9595892      4371836            0             0        23
SECTOR:          176489389    138699019            0             0      5128

Adapter #:  1
=============
                Total Read  Total Write  Active Read  Active Write   Maximum
I/O:              10238891      4523508            0             0        24
SECTOR:          188677891    143739157            0             0      5128


# datapath query portmap
                          BAY-1(B1)                BAY-2(B2)                BAY-3(B3)                BAY-4(B4)
   ESSID    DISK      H1   H2   H3   H4        H1   H2   H3   H4        H1   H2   H3   H4        H1   H2   H3   H4
                     ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD
                          BAY-5(B5)                BAY-6(B6)                BAY-7(B7)                BAY-8(B8)
                      H1   H2   H3   H4        H1   H2   H3   H4        H1   H2   H3   H4        H1   H2   H3   H4
                     ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD      ABCD ABCD ABCD ABCD
 75BAFX1    vpath0   ---- ---- ---- ----      ---- ---- ---- ----      ---- ---- Y-Y- ----      ---- ---- ---- ----
 75BAFX1    vpath1   ---- ---- ---- ----      ---- ---- ---- ----      ---- ---- Y-Y- ----      ---- ---- ---- ----
 75BAFX1    vpath2   ---- ---- ---- ----      ---- ---- ---- ----      ---- ---- Y-Y- ----      ---- ---- ---- ----

Y  =  online/open               y = (alternate path) online/open
O  =  online/closed             o = (alternate path) online/closed
N  =  offline                   n = (alternate path) offline
-  =  path not configured
PD =  path down

Note: 2105 devices' essid has 5 digits, while 1750/2107 device's essid has 7 digits.


# datapath query wwpn
Adapter Name    PortWWN
fscsi0          10000000C94F91CD
fscsi1          10000000C94F9923


If you need to force the Subsystem Device Driver (SDD), or equivalent driver, to rescan and map the new devices,
use the following command at the system prompt: 

# /usr/sbin/cfgvpath

Procedure to make a new lun available to AIX:
---------------------------------------------

-Allocate the new lun on the SAN 
-Run "cfgmgr" 
-Verify the new vpath/hdisk by running "lsvpcfg" 

There should be a new vpath and it should be available with no volume group - if not, rerun cfgmgr


Create Volume groups with vpaths:
---------------------------------

You should use the mkvg4vp command to create Volume Groups.

Example:

# mkvg4vp -B -t 32 -s 4 -y DB01_RECOV_VG1 vpath4 vpath10

By default, VG's can accommodate up to 255 LV's and 32 PV's. If the -B flag is used on the mkvg or mkvg4vp
command, the resulting VG will support up to 512 LV's and 128 PV's.
The -s flag, as usual, designates the Partition size.


SDD software on AIX:
--------------------

Starting with SDD 1.6.1.0, the SDD package for AIX53 is devices.sdd.53.rte and requires AIX53E 
with APAR IY76997.

Starting with SDD 1.6.2.0, the SDD package for AIX52 is devices.sdd.52.rte and requires AIX52M
with APAR IY76997.

See also in this document:
IBM Flash Alert: SDD 1.6.2.0 requires minimum AIX code levels; possible 0514-035 error

The SDD installation package installs a number of new commands, like datapath, chgvpath, lsvpcfg etc..

Before installing SDD, you should check firmware levels, and AIX APAR requirements. See the following sites: 

-- scsi and ESS, and Fiber:
www-1.ibm.com/servers/storage/support/
www-1.ibm.com/servers/eserver/support/unixservers/index.html 

-- AIX APAR:
www-03.ibm.com/servers/eserver/support/unixservers/aixfixes.html            or,
www.ibm.com/servers/eserver/support/pseries/aixfixes.html                   or,
www14.software.ibm.com/webapp/set2/sas/f/genunix3/aixfixes.html


SAN connections with SDDPCM MPIO:
=================================


This section covers some of the SDDPCM MPIO SAN connections. 
There are some different commands with this type
of connections to SAN storage.

The use of SDD or SDDPCM gives the AIX host the ability to access multiple paths to a single LUN 
within an ESS or SAN. This ability to access a single LUN on multiple paths allows for a higher degree of 
data availability in the event of a path failure. Data can continue to be accessed within the ESS 
as long as there is at least one available path. Without one of these installed, you will lose access 
to the LUN in the event of a path failure. 

If you have "sdd" installed use the datapath command, and with sddpcm use the pcmpath command.

Just as the commands shown in section 31.4, just replace datapath with pcmpath, like


# pcmpath query device

DEV#:   2  DEVICE NAME: hdisk2  TYPE: 2107900  ALGORITHM:  Load Balance
SERIAL: 75065711100
==========================================================================
Path#      Adapter/Path Name          State     Mode     Select     Errors
    0           fscsi0/path0           OPEN   NORMAL       1240          0
    1           fscsi0/path1           OPEN   NORMAL       1313          0
    2           fscsi0/path2           OPEN   NORMAL       1297          0
    3           fscsi0/path3           OPEN   NORMAL       1294          0

DEV#:   3  DEVICE NAME: hdisk3  TYPE: 2107900  ALGORITHM:  Load Balance
SERIAL: 75065711101
==========================================================================
Path#      Adapter/Path Name          State     Mode     Select     Errors
    0           fscsi0/path0          CLOSE   NORMAL          0          0
    1           fscsi0/path1          CLOSE   NORMAL          0          0
    2           fscsi0/path2          CLOSE   NORMAL          0          0
    3           fscsi0/path3          CLOSE   NORMAL          0          0

DEV#:   4  DEVICE NAME: hdisk4  TYPE: 1750500  ALGORITHM:  Load Balance
SERIAL: 13AAGXA1101
==========================================================================
Path#      Adapter/Path Name          State     Mode     Select     Errors
    0*          fscsi0/path0           OPEN   NORMAL         12          0
    1           fscsi0/path1           OPEN   NORMAL       3787          0
    2*          fscsi1/path2           OPEN   NORMAL         17          0
    3           fscsi1/path3           OPEN   NORMAL       3822          0


# pcmpath query essmap


Some possible errors with pcmpath:

root@zd110l04:/root#pcmpath query device

Kernel extension sdduserke was not loaded. Errno=8.
Please verify SDDPCM device configuration.


On a system with SDDPCM, you will see the SDDPCM server daemon, "pcmsrv", running. 
This process checks available paths and does other checks and monitoring.

The process is under control of the resource controller, like for example starting and stopping it goes with

# stopsrc -s pcmsrv
# startsrc -s pcmsrv

The process is started on boot from inittab:

# cat /etc/inittab | grep pcmsrv
srv:2:wait:/usr/bin/startsrc -s pcmsrv > /dev/null 2>&1


>>>> Filesystems in Linux:
==========================

Disks:
======

Linux on x86 systems, have the following (storage) devices:

-- Entire harddisks are listed as devices without numbers, such as "/dev/hda" or "/dev/sda".

- IDE:

/dev/hda    is the primary IDE master drive,
/dev/hdb    is the primary IDE slave drive,
/dev/hdc    is the secondary IDE master,
/dev/hdd    is the secondary IDE slave,

- SCSI:
/dev/sda   is the first SCSI interface and 1st device id number
etc..

-- Partitions on a disk are referred to with a number such as

/dev/hda1


Floppydrive:

/dev/fd0
# mount -t auto /dev/fd0 /mnt/floppy
# mount -t vfat /dev/fd0 /mnt/floppy
# mount /dev/fd0 /mnt/floppy

Zipdrive:

# insmod ppa       # load the module
# mount -t vfat /dev/sda /mnt/zip


Filesystems:
============

Linux supports a huge number of filesystems, including FAT, JFS, NTFS etc.. But the most common are ext2 and ext3.
For the "native" filesystems, we take a look at the following FS's:

- ReiserFS   
A journaled filesystem

- Ext2
The most popular filesystem for years. But it does not use a log/jounal,
so gradually it becomes less important.

- Ext3
Very related to Ext2, but this one supports journaling.
An Ext2 filesystem can easily be upgraded to Ext3.


Adding a disk in Linux (traditional way, No LVM):
=================================================

Suppose you have SCSI card on with a disk is attached.  
The disk as a whole would be refferred to as "/dev/sda" and the
first partition would be referred to as "/dev/sda1".

But we have a new disk here.
If you cannot find the device files /dev/sda in /dev, you might
create it with the /dev/MAKEDEV script:

# cd /dev
# ./MAKEDEV sda

The disk is now ready to be partitioned. In this example, we plan
to create 3 partitions, including a swap partition.

# fdisk /dev/sda
The number of cylinders for this disk is set to ..
(.. more output..)
Command:

The fdisk program is interactive; pressing m displays a list of all its commands.

Command: new
Command action
  e extended
  p primary partition (1-4): 1
(.. more output..)

Command: print

Device           Boot    Start   End   Blocks   Id   System
/dev/sda1                1       255   2048256  83   Linux

So we have created our first partition.
We now create the swap partition:

Command: new
Command action
  e extended
  p primary partition (1-4): 2
(.. more output..)

Command: type
Partition number (1-4): 2
Hex code: 82              # which is a Linix swap partition
Changed system type of partition 2 to 82 (Linux swap)

The third partition can be created in a similar way.
We now would like to see a listing of our partitions

Command: print

Device           Boot    Start   End   Blocks   Id   System
/dev/sda1                1       255   2048256  83   Linux
/dev/sda2                256     511   2056320  82   Swap
/dev/sda3                512    5721  41849325  83   Linux


Now, save the label to the disk:

Command: write
(.. more output..)

Ofcourse, we now would like to create the filesystems and the swap.

If you want to use the Ext2 filesystem on partition one, use the following command:

# mke2fs /dev/sda1 2048256       ( or # mkfs -t ext2 -b 4096 /dev/sda1 )

Lets check the filesystem with fsck:
# fsck -f /dev/sda1

A new filesystem can be mounted as soon as the mount point is created.

# mkdir /bkroot
# mount /dev/sda1 /bkroot

Lets now create the swap space:
# mkswap -c /dev/sda2 2056320

and activate it using the command:

# swapon /dev/sda2

See also section 34.3 for administering swap space on Linux.


>>>> Notes about Linux and LVM:
===============================


Note 1:
=======


-What is RAID and LVM 
-Initial setup of a RAID-5 array 
-Initial setup of LVM on top of RAID 
-Handling a Drive Failure 
-Common Glitches 
-Other Useful Resources 
-Expanding an Array/Filesytem 

--------------------------------------------------------------------------------

-What is RAID and LVM
RAID is usually defined as Redundant Array of Inexpensive disks. It is normally used to spread data among several 
physical hard drives with enough redundancy that should any drive fail the data will still be intact. 
Once created a RAID array appears to be one device which can be used pretty much like a regular partition. 
There are several kinds of RAID but I will only refer to the two most common here. 
The first is RAID-1 which is also known as mirroring. With RAID-1 it's basically done with two essentially 
identical drives, each with a complete set of data. The second, the one I will mostly refer to in this guide 
is RAID-5 which is set up using three or more drives with the data spread in a way that any one drive failing 
will not result in data loss. The Red Hat website has a great overview of the RAID Levels. 

There is one limitation with Linux Software RAID that a /boot parition can only reside on a RAID-1 array. 

Linux supports both several hardware RAID devices but also software RAID which allows you to use any IDE or 
SCSI drives as the physical devices. In all cases I'll refer to software RAID. 

LVM stands for Logical Volume Manager and is a way of grouping drives and/or partition in a way where instead 
of dealing with hard and fast physical partitions the data is managed in a virtual basis where the virtual 
partitions can be resized. The Red Hat website has a great overview of the Logical Volume Manager. 

There is one limitation that a LVM cannot be used for the /boot. 


--------------------------------------------------------------------------------

Initial set of a RAID-5 array
I recommend you experiment with setting up and managing RAID and LVM systems before using it on an 
important filesystem. One way I was able to do it was to take old hard drive and create a bunch of 
partitions on it (8 or so should be enough) and try combining them into RAID arrays. 
In my testing I created two RAID-5 arrays each with 3 partitions. You can then manually fail and hot remove 
the partitions from the array and then add them back to see how the recovery process works. You'll get a warning 
about the partitions sharing a physical disc but you can ignore that since it's only for experimentation. 
In my case I have two systems with RAID arrays, one with two 73G SCSI drives running RAID-1 (mirroring) and my other 
test system is configured with three 120G IDE drives running RAID-5. In most cases I will refer to my RAID-5 
configuration as that will be more typical. 

I have an extra IDE controller in my system to allow me to support the use of more than 4 IDE devices which caused a very odd drive assignment. 
The order doesn't seem to bother the Linux kernel so it doesn't bother me. My basic configuration is as follows: 

hda 120G drive
hdb 120G drive
hde 60G boot drive not on RAID array
hdf 120G drive
hdg CD-ROM drive

The first step is to create the physical partitions on each drive that will be part of the RAID array. 
In my case I want to use each 120G drive in the array in it's entirety. All the drives are partitioned identically 
so for example, this is how hda is partitioned: 

Disk /dev/hda: 120.0 GB, 120034123776 bytes
16 heads, 63 sectors/track, 232581 cylinders
Units = cylinders of 1008 * 512 = 516096 bytes

   Device Boot      Start         End      Blocks   Id  System
/dev/hda1   *           1      232581   117220792+  fd  Linux raid autodetect

So now with all three drives with a partitioned with id fd Linux raid autodetect you can go ahead and combine 
the paritions into a RAID array: 

# /sbin/mdadm --create --verbose /dev/md0 --level=5 --raid-devices=3 \
	/dev/hdb1 /dev/hda1 /dev/hdf1

Wow, that was easy. That created a special device /dev/md0 which can be used instead of a physical parition. 
You can check on the status of that RAID array with the mdadm command: 

# /sbin/mdadm --detail /dev/md0
        Version : 00.90.01
  Creation Time : Wed May 11 20:00:18 2005
     Raid Level : raid5
     Array Size : 234436352 (223.58 GiB 240.06 GB)
    Device Size : 117218176 (111.79 GiB 120.03 GB)
   Raid Devices : 3
  Total Devices : 3
Preferred Minor : 0
    Persistence : Superblock is persistent

    Update Time : Fri Jun 10 04:13:11 2005
          State : clean
 Active Devices : 3
Working Devices : 3
 Failed Devices : 0
  Spare Devices : 0

         Layout : left-symmetric
     Chunk Size : 64K

           UUID : 36161bdd:a9018a79:60e0757a:e27bb7ca
         Events : 0.10670

    Number   Major   Minor   RaidDevice State
       0       3        1        0      active sync   /dev/hda1
       1       3       65        1      active sync   /dev/hdb1
       2      33       65        2      active sync   /dev/hdf1

The important lines to see are the State line which should say clean otherwise there might be a problem. 
At the bottom you should make sure that the State column always says active sync which says each device 
is actively in the array. You could potentially have a spare device that's on-hand should any drive should fail. 
If you have a spare you'll see it listed as such here. 
One thing you'll see above if you're paying attention is the fact that the size of the array is 240G but I 
have three 120G drives as part of the array. That's because the extra space is used as extra parity data that is 
needed to survive the failure of one of the drives. 


--------------------------------------------------------------------------------

- Initial set of LVM on top of RAID
Now that we have /dev/md0 device you can create a Logical Volume on top of it. Why would you want to do that? 
If I were to build an ext3 filesystem on top of the RAID device and someday wanted to increase it's capacity 
I wouldn't be able to do that without backing up the data, building a new RAID array and restoring my data. 
Using LVM allows me to expand (or contract) the size of the filesystem without disturbing the existing data. 
Anyway, here are the steps to then add this RAID array to the LVM system. The first command pvcreate will 
"initialize a disk or parition for use by LVM". The second command vgcreate will then create the Volume Group, 
in my case I called it lvm-raid: 

# pvcreate /dev/md0
# vgcreate lvm-raid /dev/md0

The default value for the physical extent size can be too low for a large RAID array. In those cases you'll need 
to specify the -s option with a larger than default physical extent size. The default is only 4MB as of the 
version in Fedora Core 5. For example, to successfully create a 550G RAID array a size of 2G works well: 

# vgcreate -s 2G <volume group name>

Ok, you've created a blank receptacle but now you have to tell how many Physical Extents from the 
physical device (/dev/md0 in this case) will be allocated to this Volume Group. In my case I wanted all the data 
from /dev/md0 to be allocated to this Volume Group. If later I wanted to add additional space I would create 
a new RAID array and add that physical device to this Volume Group. 
To find out how many PEs are available to me use the vgdisplay command to find out how many are available 
and now I can create a Logical Volume using all (or some) of the space in the Volume Group. 
In my case I call the Logical Volume lvm0. 

# vgdisplay lvm-raid
	.
	.
   Free  PE / Size       57235 / 223.57 GB

# lvcreate -l 57235 lvm-raid -n lvm0

In the end you will have a device you can use very much like a plain 'ol parition called /dev/lvm-raid/lvm0. 
You can now check on the status of the Logical Volume with the lvdisplay command. The device can then be used to to create a filesystem on. 

# lvdisplay /dev/lvm-raid/lvm0 
  --- Logical volume ---
  LV Name                /dev/lvm-raid/lvm0
  VG Name                lvm-raid
  LV UUID                FFX673-dGlX-tsEL-6UXl-1hLs-6b3Y-rkO9O2
  LV Write Access        read/write
  LV Status              available
  # open                 1
  LV Size                223.57 GB
  Current LE             57235
  Segments               1
  Allocation             inherit
  Read ahead sectors     0
  Block device           253:2

# mkfs.ext3 /dev/lvm-raid/lvm0
	.
	.
# mount /dev/lvm-raid/lvm0 /mnt

# df -h /mnt
Filesystem            Size  Used Avail Use% Mounted on
/dev/mapper/lvm--raid-lvm0
                       224G   93M  224G   1% /mnt


--------------------------------------------------------------------------------

- Handling a Drive Failure
As everything eventually does break (some sooner than others) a drive in the array will fail. It is a very good idea 
to run smartd on all drives in your array (and probably ALL drives period) to be notified of a failure 
or a pending failure as soon as possible. You can also manually fail a partition, meaning to take it out 
of the RAID array, with the following command: 

# /sbin/mdadm /dev/md0 -f /dev/hdb1
mdadm: set /dev/hdb1 faulty in /dev/md0

Once the system has determined a drive has failed or is otherwise missing (you can shut down and pull out a drive 
and reboot to similate a drive failure or use the command to manually fail a drive above it will show something 
like this in mdadm: 

# /sbin/mdadm --detail /dev/md0
     Update Time : Wed Jun 15 11:30:59 2005
           State : clean, degraded
  Active Devices : 2
 Working Devices : 2
  Failed Devices : 1
   Spare Devices : 0
	.
	.
     Number   Major   Minor   RaidDevice State
        0       3        1        0      active sync   /dev/hda1
        1       0        0        -      removed
        2      33       65        2      active sync   /dev/hdf1

You'll notice in this case I had /dev/hdb fail. I replaced it with a new drive with the same capacity and was able 
to add it back to the array. The first step is to partition the new drive just like when first creating the array. 
Then you can simply add the partition back to the array and watch the status as the data is rebuilt onto the newly replace drive. 

# /sbin/mdadm /dev/md0 -a /dev/hdb1
# /sbin/mdadm --detail /dev/md0
     Update Time : Wed Jun 15 12:11:23 2005
           State : clean, degraded, recovering
  Active Devices : 2
 Working Devices : 3
  Failed Devices : 0
   Spare Devices : 1

          Layout : left-symmetric
      Chunk Size : 64K

  Rebuild Status : 2% complete
	.
	.

During the rebuild process the system performance may be somewhat impacted but the data should remain in-tact. 
--------------------------------------------------------------------------------

- Expanding an Array/Filesytem
The answer to how to expand a RAID-5 array is very simple: You can't. 
I'm used to working with a NetApp Filer where you plug in a drive, type a simple command and that drive was added 
to the existing RAID array, no muss, no fuss. While you can't add space to a RAID-5 array directly in Linux you CAN 
add space to an existing Logical Volume and then expand the ext3  filesytem on top of it. That's the main reason you 
want to run LVM on top of RAID. 

Before you start it's probably a good idea to back up your data just in case something goes wrong. 

Assuming you want your data to be protected from a drive failing you'll need to create another RAID array 
per the instructions above. In my case I called it /dev/md1  so after partitioning I can create the array: 

# /sbin/mdadm --create --verbose /dev/md1 --level=5 --raid-devices=3 \
	/dev/hde1 /dev/hdg1 /dev/hdh1
# /sbin/mdadm --detail /dev/md1

The next couple steps will add the space from the new RAID array to the space available to be used by Logical Volumes. 
You then check to see how many Physical Extents you have and add them to the Logical Volume you're using. 
Remember that since you can have multiple Logical Volumes on top of a physical RAID array you need to do this extra step. 

# vgextend lvm-raid /dev/md1
# vgdisplay lvm-raid
	.
	.
	.
  Alloc PE / Size       57235 / 223.57 GB
  Free  PE / Size       57235 / 223.57 GB
# lvextend -l 57235 lvm-raid -n lvm0

There, you now have a much larger Logical Volume which is using space on two separate RAID arrays. 
You're not done yet, you now have to extend your filesystem to make use of all that new space. Fortunately this 
is easy on FC4 and RHEL4 since there is a command to expand a ext3  filesytem without even unmounting it! 
Be patient, expanding the file system takes a while. 

# lvdisplay /dev/lvm-raid/lvm0
	.
	.
  LV Size                447.14 GB
	.
# df /raid-array
Filesystem           1K-blocks      Used Available Use% Mounted on
/dev/mapper/lvm--raid-lvm0
                     230755476  40901348 178132400  19% /raid-array
# ext2online /dev/lvm-raid1/lvm0 447g
Get yourself a sandwich
# df /raid-array
Filesystem           1K-blocks      Used Available Use% Mounted on
/dev/mapper/lvm--raid-lvm0
                     461510952  40901348 40887876   9% /raid-array

Congrats, you now have more space. Now go fill it with something. 


Note 2:
=======

Creating a LVM in Linux
 

I am sure anybody who have used windows (2000 and above) have come across the term dynamic disks. 
Linux/Unix also have its own dynamic disk management called LVM.

What is an LVM ?

LVM stands for Logical Disk Manager which is the fundamental way to manage UNIX/Linux storage systems 
in a scalable manner. An LVM abstracts disk devices into pools of storage space called Volume Groups. 
These volume groups are in turn subdivided into virtual disks called Logical Volumes. The logical volumes 
may be used just like regular disks with filesystem created on them and mounted in the Unix/Linux 
filesystem tree. The logical volumes can span multiple disks. Even though a lot of companies have implemented 
their own LVM's for *nixes, the one created by Open Software Foundation (OSF) was integrated into many 
Unix systems which serves as a base for the Linux implementation of LVM.

Note: Sun Solaris ships with LVM from Veritas which is substantially different from the OSF implementation.

Benefits of Logical Volume Management

LVM created in conjunction with RAID can provide fault tolerance coupled with scalability and easy disk management. 
Create a logical volume and filesystem which spans multiple disks.

By creating virtual pools of space, an administrator can create dozens of small filesystems for different projects 
and add space to them as needed without (much) disruption. When a project ends, he can remove the space a
nd put it back into the pool of free space.

Note : Before you move to implement LVM's in linux, make sure your kernel is 2.4 and above. Or else you will have 
to recompile your kernel from source to include support for LVM.

LVM Creation
To create a LVM, we follow a three step process.

Step One : We need to select the physical storage resources that are going to be used for LVM. Typically, these 
are standard partitions but can also be Linux software RAID volumes that we've created. In LVM terminology, 
these storage resources are called "physical volumes" (eg: /dev/hda1, /dev/hda2 ... etc).

Our first step in setting up LVM involves properly initializing these partitions so that they can be recognized 
by the LVM system. This involves setting the correct partition type (usually using the fdisk command, and entering 
the type of partition as 'Linux LVM' - 0x8e ) if we're adding a physical partition; and then running 
the pvcreate command.

# pvcreate /dev/hda1 /dev/hda2 /dev/hda3
# pvscan

The above step creates a physical volume from 3 partitions which I want to initialize for inclusion 
in a volume group.

Step Two : Creating a volume group. You can think of a volume group as a pool of storage that consists of one 
or more physical volumes. While LVM is running, we can add physical volumes to the volume group or even remove them.

First initialize the /etc/lvmtab and /etc/lvmtab.d files by running the following command:

# vgscan

Now you can create a volume group and assign one or more physical volumes to the volume group.

# vgcreate my_vol_grp /dev/hda1 /dev/hda2

Behind the scenes, the LVM system allocates storage in equal-sized "chunks", called extents. 
We can specify the particular extent size to use at volume group creation time. The size of an extent 
defaults to 4Mb, which is perfect for most uses.You can use the -s flag to change the size of the extent. 
The extent affects the minimum size of changes which can be made to a logical volume in the volume group, 
and the maximum size of logical and physical volumes in the volume group. A logical volume can contain at most 
65534 extents, so the default extent size (4 MB) limits the volume to about 256 GB; a size of 1 TB would require 
extents of atleast 16 MB. So to accomodate a 1 TB size, the above command can be rewriten as :

# vgcreate -s 16M my_vol_grp /dev/hda1 /dev/hda2

You can check the result of your work at this stage by entering the command:

# vgdisplay

This command displays the total physical extends in a volume group, size of each extent, 
the allocated size and so on.

Step Three : This step involves the creation of one or more "logical volumes" using our volume group storage pool. 
The logical volumes are created from volume groups, and may have arbitary names. The size of the new volume 
may be requested in either extents (-l switch) or in KB, MB, GB or TB ( -L switch) rounding up to whole extents.

# lvcreate -l 50 -n my_logical_vol my_vol_grp

The above command allocates 50 extents of space in my_vol_grp to the newly created my_logical_vol. 
The -n switch specifies the name of the logical volume we are creating.

Now you can check if you got the desired results by using the command :

# lvdisplay

which shows the information of your newly created logical volume.

Once a logical volume is created, we can go ahead and put a filesystem on it, mount it, and start using 
the volume to store our files. For creating a filesystem, we do the following:

# mke2fs -j /dev/my_vol_grp/my_logical_vol

The -j signifies journaling support for the ext3 filesystem we are creating.
Mount the newly created file system :

# mount /dev/my_vol_grp/my_logical_vol /data
Also do not forget to append the corresponding line in the /etc/fstab file:

#File: /etc/fstab
/dev/my_vol_grp/my_logical_vol /data ext3 defaults 0 0
Now you can start using the newly created logical volume accessable at /data mount point.
Next : Resizing Logical Volumes


Some more on Linux LVM commands:


Linux vgcreate command:
=======================

Linux / Unix Command: vgcreate 
 
 Command Library  

NAME
vgcreate - create a volume group   
SYNOPSIS
vgcreate [-A|--autobackup {y|n}] [-d|--debug] [-h|--help] [-l|--maxlogicalvolumes MaxLogicalVolumes] 
[-p|--maxphysicalvolumes MaxPhysicalVolumes] [-s|--physicalextentsize PhysicalExtentSize[kKmMgGtT]] 
[-v|--verbose] [--version] VolumeGroupName PhysicalVolumePath [PhysicalVolumePath...]   

DESCRIPTION
vgcreate creates a new volume group called VolumeGroupName using the block special device 
PhysicalVolumePath previously configured for LVM with pvcreate(8).   

OPTIONS
-A, --autobackup {y|n} 
      Controls automatic backup of VG metadata after the change (see vgcfgbackup(8)). Default is yes. 
-d, --debug 
      Enables additional debugging output (if compiled with DEBUG). 
-h, --help 
      Print a usage message on standard output and exit successfully. 
-l, --maxlogicalvolumes MaxLogicalVolumes 
      Sets the maximum possible logical volume count. More logical volumes can't be created in this volume group. 
      Absolute maximum is 256. 
-p, --maxphysicalvolumes MaxPhysicalVolumes 
      Sets the maximum possible physical volume count. More physical volumes can't be included in this volume group. Absolute maximum is 256. 
-s, --physicalextentsize PhysicalExtentSize[kKmMgGtT] 
      Sets the physical extent size on physical volumes of this volume group. A size suffix 
      (k for kilobytes up to t for terabytes) is optional, megabytes is the default if no suffix is present. 
      Values can be from 8 KB to 16 GB in powers of 2. The default of 4 MB causes maximum LV sizes of ~256GB 
      because as many as ~64k extents are supported per LV. In case larger maximum LV sizes are needed (later), 
      you need to set the PE size to a larger value as well. Later changes of the PE size in an existing VG are 
      not supported. 
-v, --verbose 
      Display verbose runtime information about vgcreate's activities. 
--version 
      Display tool and IOP version and exit successfully. 
  
EXAMPLES
To create a volume group named test_vg using physical volumes /dev/hdk1, /dev/hdl1, and /dev/hdm1 
with default physical extent size of 4MB: 

# vgcreate test_vg /dev/sd[k-m]1

To create a volume group named test_vg using physical volumes /dev/hdk1, and /dev/hdl1 with default 
physical extent size of 4MB:

# vgcreate test_vg /dev/sdk1 /dev/sdl1

NOTE: If you are using devfs it is essential to use the full devfs name of the device rather than the 
symlinked name in /dev. so: the above could be 

# vgcreate test_vg /dev/scsi/host1/bus0/target[1-3]/lun0/part1


Linux vgextend command:
=======================


Linux / Unix Command: vgextend 
 
 Command Library  

NAME
vgextend - add physical volumes to a volume group   

SYNOPSIS
vgextend [-A|--autobackup{y|n}] [-d|--debug] [-h|--help] [-v|--verbose] VolumeGroupName 
         PhysicalVolumePath [PhysicalVolumePath...]   

DESCRIPTION
vgextend allows you to add one or more initialized physical volumes ( see pvcreate(8) ) to an existing 
volume group to extend it in size.   

OPTIONS
-A, --autobackup y/n 
Controls automatic backup of VG metadata after the change ( see vgcfgbackup(8) ). Default is yes. 
-d, --debug 
Enables additional debugging output (if compiled with DEBUG). 
-h, --help 
Print a usage message on standard output and exit successfully. 
-v, --verbose 
Gives verbose runtime information about lvextend's activities. 
  
Examples

# vgextend vg00 /dev/sda4 /dev/sdn1

tries to extend the existing volume group "vg00" by the new physical volumes (see pvcreate(8) ) 
"/dev/sdn1" and /dev/sda4".   


Linux pvcreate command:
=======================

Linux / Unix Command: pvcreate 
 
 Command Library  

NAME
pvcreate - initialize a disk or partition for use by LVM   

SYNOPSIS
pvcreate [-d|--debug] [-f[f]|--force [--force]] [-y|--yes] [-h|--help] [-v|--verbose] [-V|--version] 
         PhysicalVolume [PhysicalVolume...]   

DESCRIPTION
pvcreate initializes PhysicalVolume for later use by the Logical Volume Manager (LVM). Each PhysicalVolume 
can be a disk partition, whole disk, meta device, or loopback file. For DOS disk partitions, 
the partition id must be set to 0x8e using fdisk(8), cfdisk(8), or a equivalent. For whole disk devices 
only the partition table must be erased, which will effectively destroy all data on that disk. This can be done 
by zeroing the first sector with: 

# dd if=/dev/zero of=PhysicalVolume bs=512 count=1 

Continue with vgcreate(8) to create a new volume group on PhysicalVolume, or vgextend(8) to add PhysicalVolume 
to an existing volume group.   

OPTIONS
-d, --debug 
      Enables additional debugging output (if compiled with DEBUG). 
-f, --force 
      Force the creation without any confirmation. You can not recreate (reinitialize) a physical volume belonging 
      to an existing volume group. In an emergency you can override this behaviour with -ff. In no case case can you 
      initialize an active physical volume with this command. 
-s, --size 
      Overrides the size of the physical volume which is normally retrieved. Useful in rare case where this value 
      is wrong. More useful to fake large physical volumes of up to 2 Terabyes - 1 Kilobyte on smaller devices 
      for testing purposes only where no real access to data in created logical volumes is needed. If you wish 
      to create the supported maximum, use "pvcreate -s 2147483647k PhysicalVolume [PhysicalVolume ...]". 
      All other LVM tools will use this size with the exception of lvmdiskscan(8) 
-y, --yes 
      Answer yes to all questions. 
-h, --help 
      Print a usage message on standard output and exit successfully. 
-v, --verbose 
      Gives verbose runtime information about pvcreate's activities. 
-V, --version 
      Print the version number on standard output and exit successfully. 
  
Example

Initialize partition #4 on the third SCSI disk and the entire fifth SCSI disk for later use by LVM: 

# pvcreate /dev/sdc4 /dev/sde 


>>>> Installing a Cluster filesystem on Linux:
==============================================

Suppose, in this example, we have 2 Linux nodes, and we want to create a scsi attached shared disksystem.
We plan to use OCFS2 as the Clustered FileSystem.

First, we partition the disks to raw volumes.

This example uses /dev/sdb (an empty SCSI disk with no existing partitions) to create a single partition for the entire disk (36 GB). 
We will do this for all disks.


Ex:
# fdisk /dev/sdb
Device contains neither a valid DOS partition table, nor Sun, SGI or OSF disklabel
Building a new DOS disklabel. Changes will remain in memory only,
until you decide to write them. After that, of course, the previous
content won't be recoverable.


The number of cylinders for this disk is set to 4427.
There is nothing wrong with that, but this is larger than 1024,
and could in certain setups cause problems with:
1) software that runs at boot time (e.g., old versions of LILO)
2) booting and partitioning software from other OSs
 (e.g., DOS FDISK, OS/2 FDISK)

Command (m for help): p

Disk /dev/sdb: 255 heads, 63 sectors, 4427 cylinders
Units = cylinders of 16065 * 512 bytes

 Device Boot Start End Blocks Id System

Command (m for help): n
Command action
 e extended
 p primary partition (1-4)
p
Partition number (1-4): 1
First cylinder (1-4427, default 1):
Using default value 1
Last cylinder or +size or +sizeM or +sizeK (1-4427, default 4427):
Using default value 4427

Command (m for help): w
The partition table has been altered!

Calling ioctl() to re-read partition table.

WARNING: If you have created or modified any DOS 6.x
partitions, please see the fdisk manual page for additional
information.
Syncing disks.


Now verify the new partition: 
Ex:
# fdisk -l /dev/sdb

Disk /dev/sdb: 36.4 GB, 36420075008 bytes
255 heads, 63 sectors/track, 4427 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot      Start         End      Blocks   Id  System
/dev/sdb1   *           1        4427    35559846   83  Linux

Repeat the above steps for each disk to be partitioned.   Disk partitioning should be done from one node only.  
When finished partitioning, run the 'partprobe' command as root on each of the remaining cluster nodes in order to assure 
that the new partitions are configured.

Ex:
# partprobe

  
Oracle Cluster File System (OCFS) Release 2
-------------------------------------------

OCFS2 is a general-purpose cluster file system that can be used to store Oracle Clusterware files, Oracle RAC database files,
 Oracle software, or any other types of files normally stored on a standard filesystem such as ext3.  
This is a significant change from OCFS Release 1, which only supported Oracle Clusterware files and Oracle RAC database files.   

Obtain OCFS2

OCFS2 is available free of charge from Oracle as a set of three RPMs:  a kernel module, support tools, and a console.  
There are different kernel module RPMs for each supported Linux kernel so be sure to get the OCFS2 kernel module for your Linux kernel.  
OCFS2 kernel modules may be downloaded from http://oss.oracle.com/projects/ocfs2/files/ and the tools and console may be downloaded from 
http://oss.oracle.com/projects/ocfs2-tools/files/.  

To determine the kernel-specific module that you need, use uname -r. 

# uname -r
2.6.9-22.ELsmp

For this example I downloaded:
ocfs2console-1.0.3-1.i386.rpm
ocfs2-tools-1.0.3-1.i386.rpm
ocfs2-2.6.9-22.ELsmp-1.0.7-1.i686.rpm 

>>> Install OCFS2 as root on each cluster node 

# rpm -ivh ocfs2console-1.0.3-1.i386.rpm \
ocfs2-tools-1.0.3-1.i386.rpm \
ocfs2-2.6.9-22.ELsmp-1.0.7-1.i686.rpm

Preparing...                ########################################### [100%]
   1:ocfs2-tools            ########################################### [ 33%]
   2:ocfs2console           ########################################### [ 67%]
   3:ocfs2-2.6.9-22.ELsmp   ########################################### [100%]
Configure OCFS2 

Run ocfs2console as root: 
# ocfs2console


Now a Graphical interface will appear:

Select Cluster ? Configure Nodes
Click on Add and enter the Name and IP Address of each node in the cluster

Once all of the nodes have been added, click on Cluster --> Propagate Configuration.  This will copy the OCFS2 configuration file 
to each node in the cluster.  You may be prompted for root passwords as ocfs2console uses ssh to propagate the configuration file.  
Leave the OCFS2 console by clicking on File --> Quit.  It is possible to format and mount the OCFS2 partitions using the ocfs2console GUI; however, 
this guide will use the command line utilities. 


>>> Enable OCFS2 to start at system boot: 

As root, execute the following command on each cluster node to allow the OCFS2 cluster stack to load at boot time:
/etc/init.d/o2cb enable
Ex:
# /etc/init.d/o2cb enable


Writing O2CB configuration: OK
Loading module "configfs": OK
Mounting configfs filesystem at /config: OK
Loading module "ocfs2_nodemanager": OK
Loading module "ocfs2_dlm": OK
Loading module "ocfs2_dlmfs": OK
Mounting ocfs2_dlmfs filesystem at /dlm: OK


 Starting cluster ocfs2: OK


>>> Create a mount point for the OCFS filesystem 

As root on each of the cluster nodes, create the mount point directory for the OCFS2 filesystem
Ex:
# mkdir /u03


>>> Create the OCFS2 filesystem on the unused disk partition:

The example below creates an OCFS2 filesystem on the unused /dev/sdc1 partition with a volume label of "/u03" (-L /u03), a block size of 4K (-b 4K) 
and a cluster size of 32K (-C 32K) with 4 node slots (-N 4).  See the OCFS2 Users Guide for more information on mkfs.ocfs2 command line options.

Ex:
# mkfs.ocfs2 -b 4K -C 32K -N 4 -L /u03 /dev/sdc1

mkfs.ocfs2 1.0.3
Filesystem label=/u03
Block size=4096 (bits=12)
Cluster size=32768 (bits=15)
Volume size=36413280256 (1111245 clusters) (8889960 blocks)
35 cluster groups (tail covers 14541 clusters, rest cover 32256 clusters)
Journal size=33554432
Initial number of node slots: 4
Creating bitmaps: done
Initializing superblock: done
Writing system files: done
Writing superblock: done
Writing lost+found: done
mkfs.ocfs2 successful


>>> Mount the OCFS2 filesystem:

Since this filesystem will contain the Oracle Clusterware files and Oracle RAC database files, we must ensure that all I/O 
to these files uses direct I/O (O_DIRECT).  Use the "datavolume" option whenever mounting the OCFS2 filesystem to enable direct I/O.  
Failure to do this can lead to data loss in the event of system failure.

Ex:
# mount -t ocfs2 -L /u03 -o datavolume /u03

Notice that the mount command uses the filesystem label (-L  u03) used during the creation of the filesystem. This is a handy way to refer 
to the filesystem without having to remember the device name. 

To verify that the OCFS2 filesystem is mounted, issue the mount command or run df: 

# mount -t ocfs2
/dev/sdc1 on /u03 type ocfs2 (rw,_netdev,datavolume)

# df /u03
Filesystem           1K-blocks      Used Available Use% Mounted on
/dev/sdc1             35559840    138432  35421408   1% /u03

The OCFS2 filesystem can now be mounted on the other cluster nodes. 

To automatically mount the OCFS2 filesystem at system boot, add a line similar to the one below to /etc/fstab on each cluster node: 
LABEL=/u03   /u03    ocfs2   _netdev,datavolume,nointr 0 0


Create the directories for shared files 
CRS files
mkdir /u03/oracrs
chown oracle:oinstall /u03/oracrs
chmod 775 /u03/oracrs

Database files
mkdir /u03/oradata
chown oracle:oinstall /u03/oradata
chmod 775 /u03/oradata


>>>>SWAP space:
===============

>>> Solaris:
============

-- View swap space:
-- ----------------

The /usr/sbib/swap utility provides a method of adding, deleting, and monitoring the system swap areas
used by the memory manager.

# swap -l

The -l option can be used to list swap space. The system displays information like:
swapfile           dev      swaplo    blocks    free
/dev/dsk/c0t0d0s3  136,3        16    302384    302384

path  : the pathname for the swaparea. In this example the pathname is swapfile.
dev   : the major/minor device number is in decimal if it's a block special device; zeroes otherwise
swaplo: the offset in 512 byte blocks where usable swapspace begins
blocks: size in 512 byte blocks. The swaplen value can be adjusted as a kernel parameter.
free  : free 512 byte blocks.
The swap -l command does not include physical memory in it's calculation of swap space.

# swap -s

The -s option can be used to list a summary of the system's virtual swap space.
total: 31760k bytes allocated + 5952k reserved = 37712k used, 202928k available

These numbers are in 1024 byte blocks.

-- Add swap area's:
-- ----------------

There are 2 methods available for adding more swap to your system.

(1) create a secondary swap partition:
(2) create a swapfile in an existing UFS file system

(1) Creating a secondary swap partition requires additional unused diskspace. You must use the format coommand
to create a new partition and filesystem on a disk.
Suppose we have the /data directory currently on slice 5 and is 200MB in size.
- free up the /data directory (save the contents to another location )
- unmount /dev/dsk/c0t0d0s5
- use format:
  Enter partition id tag (unassigned): swap
  Enter partition permission flags (wm): wu
  Enter new starting cil(3400): return
  Enter partition size: return
  Then label the disk as follows
  Partition> la
  Ready to label disk? y

- Run the newfs command on that partition to create a fresh filesystem on slice 5
  newfs /dev/rdsk/c0t0d0s5
- Make an entry to the /etc/vfstab file
- Run the swapadd script to add the swap to your system as follows:
  /sbin/swapadd
- verify that the swap has been added with swap -l


(2) The other method to add more swap space is to use the mkfile and swap commands
to designate a part of an existing UFS filesystem as a supplementary swap area.
You can use it as a temporary solution, or as a solution for longer duration as well,
but a swap file is just another file in the filesystem, so you cannot unmount that
filesystem while the swapfile is in use.
The following steps enable you to add more swap space without repartitioning a disk.
- As root, use df -k to locate a suitable filesystem. Suppose /data looks allright
  for this purpose
- Use the mkfile command to add a 50MB swapfile named swapfile in the /data partition.

  mkfile 50m /data/swapfile

- use ls -l /data to verify that the file has been created.
  Notice that the sticky bit has automatically been set.
- Activate the swaparea with the swap command as follows:

  /usr/sbin/swap -a /data/swapfile

- verify that the swap has been added with swap -l
  The system responds something like this:
  
swapfile           dev      swaplo    blocks    free
/dev/dsk/c0t0d0s3  136,3        16    302384    302384
/data/swapfile       -          16    102384    102384

If this will be a permanent swaparea, add an entry for the swapfile in the vfstab file.
/data/swapfile - - swap - no -

-- Removing a swapfile:
-- --------------------

As root use the swap -d command to remove a swaparea is follows

swap -d /dev/dsk/c0t0d0s5  for a swap partition
swap -d /data/swapfile     for a swapfile

Use the swap -l command to verify that the swaparea is gone.
Edit the /etc/vfstab file and delete the entry for the swapfile if neccessary.

In case of a swapfile, just remove the file with rm /data/swapfile

-- Creating a Temporary File System:
-- ---------------------------------

Create a directory which will serve as the mount point for the TMPFS file system.
There is no command such as newfs to create a TMPFS file system before mounting it.
The TMPFS file system actually gets created in RAM when you execute the mount command
and specify a filesystem type of TMPFS. The following example creates a new directory
/export/data and mounts a TMPFS filesystem, limiting it to 25MB.

mount -F tmpfs -o size=25m swap /export/data 


>>>> AIX:
=========

The installation creates a default paging logical volume, hd6, on drive hdisk0,
also referred as primary paging space.

The reports from the "vmstat" and "topas" commands indicate the amount of paging space I/O that is
taking place. 

Showing paging space:
---------------------

The lsps -a command provides a snapshot of the current utilization of each of the paging spaces
on the system, while the lsps -s command provides a summary of the total active paging space
and its current utilization.

# lsps -a
Page Space    Physical Volume    Volume Group     Size    %Used    Active    Auto  Type
paging00      hdisk1             rootvg           80MB    1        yes       yes   lv
hd6           hdisk1             rootvg          256MB    1        yes       yes   lv

The /etc/swapspaces file specifies the paging-space devices that are activated by the swapon -a command.
A pagingspace is added to this file when its created by the mkps -a command, and removed from
the file when rmps is used. 

You can also try:

# pstat -s

Managing Paging space:
----------------------

The following commands are used to manage paging space:

chps      : changes the attributes of a paging space
lsps      : displays the characteristics of a paging space
pstat -s  : displays the characteristics of a paging space
mkps      : creates an additional paging space
rmps      : removes an inactive paging space
swapon    : activates a paging space
swapoff   : deactivates one or more paging spaces


Managing Paging behaviour:
--------------------------

Note 1:
-------


There are several page space allocation policies available in AIXr.

- Deferred Page Space Allocation (DPSA) 
- Late Page Space Allocation (LPSA) 
- Early Page Space Allocation (EPSA) 
- Deferred page space allocation

The deferred page space allocation policy is the default policy in AIX. 

Late page space allocation LPSA
The AIX operating system provides a way to enable the late page space allocation policy, which means that the disk block 
for a paging space page is only allocated when the corresponding in-memory page is touched. 

Early page space allocation EPSA
If you want to ensure that a process will not be killed due to low paging conditions, this process can 
preallocate paging space by using the early page space allocation policy. 

Choosing between LPSA and DPSA with the vmo command:

Using the "vmo -o defps" command enables turning the deferred page space allocation, or DPSA, 
on or off in order to preserve the late page space allocation policy, or LPSA. 

Paging space and virtual memory
The vmstat command (avm column), ps command (SIZE, SZ), and other utilities report the amount 
of virtual memory actually accessed because with DPSA, the paging space might not get touched. 


Note 2:
-------

High paging space during online backup on AIX
  
 Technote (FAQ) 
  
Question 
During an online backup, you might see a high paging space usage, which will not be released 
even after online backup completion in DB2� Universal Database� (DB2 UDB) Version 8. 
This problem does not occur during an offline backup.  
  
Cause 
Paging space usage increases during online database backups on AIX� 5.2 and 5.3. 
This is an expected behavior from ML4 of AIX 5L� 5.2 and ML1 of AIX 5L 5.3 onwards.
During an online database backup operation, file pages are loaded into memory by AIX 
in order for the backup processes to read them. If DB2 UDB runs out of memory, 
AIX has to free memory to fit additional file pages into RAM. It does this by writing 
DB2 UDB shared memory segments out to paging space. When the backup completes, 
these pages in paging space are not released because they are still in use by the other 
DB2 UDB processes. They will only be freed when the database is deactivated.  
  
 
Answer 
To free up paging space without stopping the database, use the AIX tuning parameter lru_file_repage. 
It affects Virtual Memory Manager (VMM) page replacement. By setting this parameter to 0, 
you force the system to only free file pages when you run out of memory and to not write 
working pages out to paging space. This will stop paging use from increasing. 
To set this parameter to zero, use vmo command. For example:

vmo -o lru_file_repage=0

This parameter was introduced in ML4 of AIX5L 5.2 and ML1 of AIX5L 5.3. The default value is 1.  
 

Note 3:
-------

Warning: this is a trick. 

trick I have found to "reset" paging is to
increase page space by 1 PP and then decrease it by 1 PP. 
Decreasing the size of paging space causes the S to create a
new page space, copy everything to the new space, delete the old
recreate it at the new size. You'll need enough free disk space
to create a new page space.
 
Note 4:
-------


The VM kernel parameters minperm% and maxperm% affect the use of physical memory that can be used 
for file system caching and
govern when computational pages of memory get paged (swapped) to paging space. 
If these values have been changed recently, that could explain the results that you describe. 

When the (dynamic) value of numperm% drops below minperm%, it will cause the paging of computational pages to page space.

It would be interesting to know if the minperm% and maxperm% values were changed and, if so, what the former and current values are.


Note 5:
-------


Show paging space usage:

# lsps -a
# lsps -s

Increase paging space:

# chps -s 32 hd6   32x32MB

where we increased the size of hd6 with 30 LP's.

Reducing paging space:

# chps -d 1 hd6


where we decreased the size of hd6 with 1 LP.


mkps:
-----

To Add a Logical Volume for Additional Paging Space
mkps [ -a ] [ -n ] [ -t lv ] -s LogicalPartitions VolumeGroup [ PhysicalVolume ]

To create a paging space in volume group myvg that has four logical partitions and is activated immediately 
and at all subsequent system restarts, enter: 

# mkps  -a  -n  -s 4 myvg

To create a paging space in rootvg on hdisk0

# mkps -a -n -s 30 rootvg hdisk0

rmps:
-----

Before AIX 5L:
Active paging spaces cannot be removed. It must first be made inactive.
Use the chps command so the paging space is not used on the next restart.
After reboot, the paging space is inactive and can be removed with the rmps command.

AIX 51 or later:
Use the swapoff command to dynamically deactive the paging space, then use the rmps command.
# swapoff /dev/paging03
# rmps paging03

chps:
-----

As from AIX 5L you can use the chps -d command, to decrease the size of a paging space, 
without having to deactive it, then reboot, then remove, and then recreate it with a smaller size.
Decrease it with a number of LP's like:
# chps -d 2 paging03

chps -a {y|n} paging00 : specifies that the paging space paging00 is active (y) or inactive (n) at subsequent system restarts.
chps -s 10 paging02 : adds ten LPs to paging02 without rebooting.
chps -d 5 paging01 : removes five LPs from paging01 without rebooting.
chps -d 50 hd6 : removes fifty LPs from hd6 without rebooting.


List the active paging spaces:
------------------------------

# lsps -a     or lsps -s

# pg /etc/swapspaces
hd6:
         dev=/dev/hd6

paging00
         dev=/dev/paging00


Note on paging on AIX:
----------------------

If the amount of paging space is less than the amount of real memory in the system, it's possible the system 
will run out of paging space before real memory. This is because AIX performs early allocation of page space. 
When a page is referenced, real memory and paging space blocks are allocated. If there are less paging space blocks 
then real memory pages, paging space will be exhaused before all of real memory is consumed.

Early allocation algorithm
The second operating system's paging-space-slot-allocation method is intended for use in installations 
where this situation is likely, or where the cost of failure to complete is intolerably high. Aptly called early allocation, 
this algorithm causes the appropriate number of paging-space slots to be allocated at the time the 
virtual-memory address range is allocated, for example, with the malloc() subroutine. If there are not 
enough paging-space slots to support the malloc() subroutine, an error code is set. 
The early-allocation algorithm is invoked as follows:

# export PSALLOC=early
This example causes all future programs to be executed in the environment to use early allocation. 
The currently executing shell is not affected.

Early allocation is of interest to the performance analyst mainly because of its paging-space size implications. 
If early allocation is turned on for those programs, paging-space requirements can increase many times. 
Whereas the normal recommendation for paging-space size is at least twice the size of the system's real memory, 
the recommendation for systems that use PSALLOC=early is at least four times the real memory size. 
Actually, this is just a starting point. Analyze the virtual storage requirements of your workload and 
allocate paging spaces to accommodate them. As an example, at one time, the AIXwindows server required 250 MB of paging space 
when run with early allocation.

When using PSALLOC=early, the user should set a handler for the following SIGSEGV signal by pre-allocating and setting 
the memory as a stack using the sigaltstack function. Even though PSALLOC=early is specified, when there 
is not enough paging space and a program attempts to expand the stack, the program may receive the SIGSEGV signal.

Deferred allocation algorithm
The third operating system's paging-space-slot-allocation method is the default beginning with AIX 4.3.2 
Deferred Page Space Allocation (DPSA) policy delays allocation of paging space until it is necessary to page out the page, 
which results in no wasted paging space allocation. This method can save huge amounts of paging space, which means disk space.
Best to use Deffered.

On some systems, paging space might not ever be needed even if all the pages accessed have been touched. 
This situation is most common on systems with very large amount of RAM. However, this may result in overcommitment 
of paging space in cases where more virtual memory than available RAM is accessed.

To disable DPSA and preserve the Late Page Space Allocation policy, run the following command:

# vmo -o defps=0

To activate DPSA, run the following command:

# vmo -o defps=1

In general, system performance can be improved by DPSA, because the overhead of allocating page space after 
page faults is avoided the. Paging space devices need less disk space if DPSA is used


>>>> Linux:
===========

-- Check the swapspace:

# cat /proc/meminfo 
# cat /proc/swaps
# /sbin/swapon -s

-- Creating swap space using a partition

Create a partition of the proper size using fdisk.
Format the partition, for example

# mkswap -c /dev/hda4

Enable the swap, for example

# swapon /dev/hd4

If you want the swap space enabled after boot, include the appropriate entry into /etc/fstab, for example
/dev/hda4  swap swap defaults 0 0

If you need to disable the swap, you can do it with
# swapoff /dev/hda4


-- Creating swap space using a swapfile

Create a file with the size of your swapfile
# dd if=/dev/zero of=/swapfile bs=1024 count=8192

Setup the file with the command
# mkswap /swapfile 8192

Enable the swap with the command
# swapon /swapfile

When you are done using the swapfile, you can turn it off and remove with
# swapoff /swapfile
# rm /swapfile


>>>> Volume group, logical volumes, and filesystem commands in HPUX:
====================================================================


>>>> Filesystems in HPUX:
-------------------------

HFS : used at HP-UX < v. 10
VxFS: used at HP-UX >= v. 10

Ofcourse, CDFS (cdroms), and other filesystem types, are supported.

HP-UX's implementation of a journaled file system, also known as JFS, is based on the version from 
VERITAS Software Inc. called VxFS.

Up through the 10.0 release of HP-UX, HFS has been the only available locally mounted read/write file system. 
Beginning at 10.01, you also have the option of using VxFS. (Note, however, that VxFS cannot be used 
as the root file system.)

As compared to HFS, VxFS allows much shorter recovery times in the event of system failure. 
It is also particularly useful in environments that require high performance or deal with large 
volumes of data. This is because the unit of file storage, called an extent, can be multiple blocks, 
allowing considerably faster I/O than with HFS. It also provides for minimal downtime by allowing 
online backup and administration - that is, unmounting the file system will not be necessary for 
certain tasks. You may not want to configure VxFS, though, on a system with limited memory 
because VxFS memory requirements are considerably larger than that for HFS.

Basic VxFS functionality is included with the HP-UX operating system software. Additional enhancements 
to VxFS are available as a separately orderable product called HP "OnlineJFS", product number B5117AA (Series 700) 
and B3928AA (Series 800). 


>>>> How to create a filesystem in HP-UX: an outline:
-----------------------------------------------------


-- Task 1. Estimate the Size Required for the Logical Volume  
 
-- Task 2. Determine If Sufficient Disk Space Is Available for the Logical Volume within Its Volume Group  
 
Use the vgdisplay command to calculate this information. vgdisplay will output data on one or more volume groups, 
including the physical extent size (under PE Size (Mbytes)) and the number of available physical extents 
(under Free PE). By multiplying these two figures together, you will get the number of megabytes available 
within the volume group. See vgdisplay(1M) for more information.

-- Task 3. Add a Disk to a Volume Group If Necessary 
 
If there is not enough space within a volume group, you will need to add a disk to a volume group.
To add a disk to an existing volume group, use pvcreate(1M) and vgextend(1M). You can also add a disk 
by creating a new volume group with pvcreate(1M) and vgcreate(1M).

-- Task 4. Create the Logical Volume  
 
Use lvcreate to create a logical volume of a certain size in the above volume group. See lvcreate(1M) for details.
Use lvcreate as in the following example:

Create a logical volume of size 100 MB in volume group /dev/vg03:
# lvcreate -L 100 /dev/vg03

-- Task 5. Create the New File System  
 
Create a file system using the newfs command. Note the use of the character device file. For example:
 
# newfs -F hfs /dev/vg02/rlvol1 
 
If you do not use the -F FStype option, by default, newfs creates a file system based on the content 
of your /etc/fstab file. If there is no entry for the file system in /etc/fstab, then the file system type 
is determined from the file /etc/default/fs. For information on additional options, see newfs(1M).

$ cat /etc/default/fs
LOCAL=vxfs


For HFS, you can explicitly specify that newfs create a file system that allows short file names or long file names 
by using either the -S or -L option. By default, these names will as short or long as those allowed 
by the root file system. Short file names are 14 characters maximum. Long file names allow up to 255 characters. 
Generally, you use long file names to gain flexibility in naming files. Also, files created on other systems 
that use long file names can be moved to your system without being renamed.

When creating a VxFS file system, file names will automatically be long.

After creating a filesystem, you need to mount it to make it accesible, for example like:


-- Task 6. mount the new local file system:

Choose an empty directory to serve as the mount point for the file system. Use the mkdir command to 
create the directory if it does not currently exist. For example, enter:
 
# mkdir /test 
 
Mount the file system using the mount command. Use the block device file name that contains the file system. 
You will need to enter this name as an argument to the mount command.

For example, enter
 
# mount /dev/vg01/lvol1 /test 


Note: 
The newfs command is a "friendly" front-end to the mkfs command (see mkfs(1M)). The newfs command 
calculates the appropriate parameters and then builds the file system by invoking the mkfs command.


>>>> HP-UX LVM commands:
========================

-- vgdisplay:
-- ----------

Displays information about volume groups.

Examples:

# vgdisplay
# vgdisplay -v vgdatadir


-- pvdisplay:
-- ----------

Display information about physical volumes within LVM volume group. 

EXAMPLES

Display the status and characteristics of a physical volume: 
# pvdisplay /dev/dsk/c1t0d0 

Display the status, characteristics, and allocation map of a physical volume: 
# pvdisplay -v /dev/dsk/c2t0d0 

# pvdisplay /dev/dsk/c102t9d3

--- Physical volumes ---
PV Name                     /dev/dsk/c43t9d3
PV Name                     /dev/dsk/c102t9d3   Alternate Link
VG Name                     /dev/vgora_e1atlas_data
PV Status                   available
Allocatable                 yes
VGDA                        2
Cur LV                      2
PE Size (Mbytes)            4
Total PE                    1668
Free PE                     102
Allocated PE                1566
Stale PE                    0
IO Timeout (Seconds)        default
Autoswitch                  On


-- lvdisplay:
-- ----------

Displays information about logical volumes.

Examples:

# lvdisplay lvora_p0gencfg_apps
# lvdisplay -v lvora_p0gencfg_apps
# lvdisplay -v /dev/vg00/lvol2

# lvdisplay /dev/vgora_e0etea_data/lvora_e0etea_data
--- Logical volumes ---
LV Name                     /dev/vgora_e0etea_data/lvora_e0etea_data
VG Name                     /dev/vgora_e0etea_data
LV Permission               read/write
LV Status                   available/syncd
Mirror copies               1
Consistency Recovery        MWC
Schedule                    parallel
LV Size (Mbytes)            17020
Current LE                  4255
Allocated PE                8510
Stripes                     0
Stripe Size (Kbytes)        0
Bad block                   on
Allocation                  strict
IO Timeout (Seconds)        default


-- vgchange:
-- ---------

Set volume group availability. This command activates or deactivates one or more volume groups as specified
by the -a option, namely y or n.

Activate a volume group:
# vgchange -a y /dev/vg03

Deactivate a volume group:
# vgchange -a n /dev/vg03


-- vgcreate:
-- ---------


/usr/sbin/vgcreate [-f] [-A autobackup] [-x extensibility] [-e max_pe] [-l max_lv] [-p max_pv] 
                   [-s pe_size] [-g pvg_name] vg_name pv_path ...

The vgcreate command creates a new volume group. vg_name is a symbolic name for the volume group and must be used 
in all references to it. vg_name is the path to a directory entry under /dev that must contain a character 
special file named group. Except for the group entry, the vg_name directory should be empty. 
The vg_name directory and the group file have to be created by the user (see lvm(7)).

vgcreate leaves the volume group in an active state.


EXAMPLES

1. Create a volume group named /dev/vg00 containing two physical volumes
with extent size set to 2 Mbytes.  If directory /dev/vg00 exists with
the character special file group, the volume group is created:

# vgcreate -s 2 /dev/vg00 /dev/dsk/c1d0s2 /dev/dskc2d0s2

2. Create a volume group named /dev/vg01 that can contain a maximum of
three logical volumes, with extent size set to 8 Mbytes:

# vgcreate -l 3 -s 8 /dev/vg01 /dev/dsk/c4d0s2

3. Create a volume group named /dev/vg00 and a physical volume group
named PVG0 with two physical volumes:

# vgcreate -g PVG0 /dev/vg00 /dev/dsk/c1d0s2 /dev/dsk/c2d0s2

3. Create a volume group named /dev/vg00 containing two physical volumes with extent size 
set to 2 MB, from scratch. 

First, create the directory /dev/vg00 with the character special file called group. 

mkdir /dev/vg00 
mknod /dev/vg00/group c 64 0x030000 

The minor number for the group file should be unique among all the volume groups on the system. 
It has the format 0xNN0000, where NN runs from 00 to ff. The maximum value of NN is controlled by the kernel 
tunable parameter maxvgs.

Initialize the disks using pvcreate(1M). 

pvcreate /dev/rdsk/c1t0d0 
pvcreate /dev/rdsk/c1t2d0 

Create the volume group. 

vgcreate -s 2 /dev/vg00 /dev/dsk/c1t0d0 /dev/dsk/c1t2d0 


Note About the "dsk" and "rdsk" notation:
-----------------------------------------

Physical volumes are identified by their device file names, for example

/dev/dsk/cntndn

/dev/rdsk/cntndn

Note that each disk has a block device file and a character or raw device file, the latter identified by the r. 
Which name you use depends on what task you are doing with the disk. In the notation above, the first name 
represents the block device file while the second is the raw device file.

-- Use a physical volume's raw device file for these two tasks only:

-> When creating a physical volume. Here, you use the device file for the disk. For example, 
this might be /dev/rdsk/c3t2d0 if the disk were at card instance 3, target address 2, and device number 0. 
(The absence of a section number beginning with s indicates you are referring to the entire disk.)

-> When restoring your volume group configuration.

For all other tasks, use the block device file. For example, when you add a physical volume to a volume group, 
you use the disk's block device file for the disk, such as /dev/dsk/c5t3d0.


-- vgextend:
-- ---------

Extends a volume group by adding physical volumes to it.

Examples:

Add physical volumes /dev/dsk/c1d0s2 and /dev/dsk/c2d0s2 to volume group /dev/vg03:
# vgextend /dev/vg03 /dev/dsk/c1d0s2 /dev/dsk/c2d0s2

# vgextend vg01 /dev/dsk/c0t4d0


-- pvcreate:
-- ---------

Creates physical volume for use in a volume group.

Examples:

# pvcreate -f /dev/rdsk/c1d0s2

# ioscan -fnC disk
# pvcreate -f /dev/rdsk/c0t1d0


-- lvcreate:
-- ---------

Create logical volume in LVM volume group 

The lvcreate command creates a new logical volume within the volume group specified by vg_name. 
Up to 255 logical volumes can be created in one volume group

SYNOPSIS
      /etc/lvcreate [-d schedule] {-l logical_extents_number | -L
      logical_volume_size} [-m mirror_copies] [-n lv_path] [-p permission]
      [-r relocate] [-s strict] [-C contiguous] [-M mirror_write_cache] [-c
      vol_group_name


Examples:

Create a logical volume in volume group /dev/vg02: 

# lvcreate /dev/vg02 

Create a logical volume in volume group /dev/vg03 with nonstrict allocation policy: 

# lvcreate -s n /dev/vg03 

Create a logical volume of size 100 MB in volume group /dev/vg03: 

# lvcreate -L 100 /dev/vg03 

Create a logical volume of size 90 MB striped across 3 disks with a stripe size of 64 KB: 

# lvcreate -L 90 -i 3 -I 64 /dev/vg03 


-- fstyp:
-- ------

Determines file system type.

SYNOPSIS
/usr/sbin/fstyp [-v] special

The fstyp command allows the user to determine the file system type of a mounted or unmounted file system. 
special represents a device special file (for example: /dev/dsk/c1t6d0).

The file system type is determined by reading the superblock of the supplied special file. If the superblock 
is read successfully, the command prints the file system type identifier on the standard output and exits 
with an exit status of 0. If the type of the file system cannot be identified, the error message 
unknown_fstyp (no matches) is printed and the exit status is 1. Exit status 2 is not currently returned, 
but is reserved for the situation where the file system matches more than one file system type. 
Any other error will cause exit status 3 to be returned.

The file system type is determined by reading the superblock of the supplied special file.

Examples:

Find the type of the file system on a disk, /dev/dsk/c1t6d0: 

# fstyp /dev/dsk/c1t6d0 

Find the type of the file system on a logical volume, /dev/vg00/lvol6: 

# fstyp /dev/vg00/lvol6 

Find the file system type for a particular device file and also information about its super block: 

# fstyp -v /dev/dsk/c1t6d0 


-- mkboot:
-- -------

mkboot is used to install or update boot programs on the specified device file.

The position on device at which boot programs are installed depends on the disk layout of the device. 
mkboot examines device to discover the current layout and uses this as the default. If the disk is uninitialized, 
the default is LVM layout on PA-RISC and Whole Disk on Itanium(R)-based systems. 
The default can be overridden by the -l, -H, or -W options.

Boot programs are stored in the boot area in Logical Interchange Format (LIF), which is similar to a file system. 
For a device to be bootable, the LIF volume on that device must contain at least the ISL 
(the initial system loader) and HPUX (the HP-UX bootstrap utility) LIF files. If, in addition, the device 
is an LVM physical volume, the LABEL file must be present (see lvlnboot(1M) ).

For the VERITAS Volume Manager (VxVM) layout on the Itanium-based system architecture, the only relevant 
LIF file is the LABEL file. All other LIF files are ignored. VxVM uses the LABEL file when the system boots 
to determine the location of the root, stand, swap, and dump volumes.

EXAMPLES

Install default boot programs on the specified disk, treating it as an LVM disk: 

# mkboot -l /dev/dsk/c0t5d0 

Use the existing layout, and install only SYSLIB and ODE files and preserve the EST file on the disk: 

# mkboot -i SYSLIB -i ODE -p EST /dev/rdsk/c0t5d0 

Install only the SYSLIB file and retain the ODE file on the disk. Use the Whole Disk layout. Use the file 
/tmp/bootlf to get the boot programs rather than the default. (The -i ODE option will be ignored): 

# mkboot -b /tmp/bootlf -i SYSLIB -i ODE -p ODE -W /dev/rdsk/c0t5d0 

Install EFI utilities to the EFI partition on an Itanium-based system, treating it as an LVM or VxVM disk: 

# mkboot -e -l /dev/dsk/c3t1d0 

Create AUTO file with the string autofile command on a device. If the device is on an Itanium-based system, 
the file is created as /EFI/HPUX/AUTO in the EFI partition. If the device is on a PA-RISC system, the file 
is created as a LIF file in the boot area. 

# mkboot -a "autofile command" /dev/dsk/c2t0d0 


-- bdf:
-- ----

Report number of free disk blocks.

bdf prints out the amount of free disk space available on the specified filesystem (/dev/dsk/c0d0s0, for example) 
or on the file system in which the specified file ($HOME, for example) is contained.
If no file system is specified, the free space on all of the normally mounted file systems is printed.  
The reported numbers are in kilobytes.
 
Examples:

# bdf

oranh300:/home/se1223>bdf | more
Filesystem          kbytes    used   avail %used Mounted on
/dev/vg00/lvol3     434176  165632  266504   38% /
/dev/vg00/lvol1     298928   52272  216760   19% /stand
/dev/vg00/lvol8    2097152 1584488  508928   76% /var
/dev/vg00/lvol11    524288    2440  490421    0% /var/tmp
/dev/vg00/lvucmd     81920    1208   75671    2% /var/opt/universal
/dev/vg00/lvol9    1048576  791925  240664   77% /var/adm
/dev/vg00/lvol10   2064384   47386 1890941    2% /var/adm/crash
/dev/vg00/lvol7    1548288 1262792  283320   82% /usr
/dev/vg00/vsaunixlv
                    311296  185096  118339   61% /usr/local/vsaunix
/dev/vg00/lvol4    1867776    5264 1849784    0% /tmp
/dev/vg00/lvol6    1187840  757456  427064   64% /opt
/dev/vg00/lvol5     262144   34784  225632   13% /home
/dev/vg00/lvbeheer  131072   79046   48833   62% /beheer
/dev/vg00/lvbeheertmp
                    655360   65296  553190   11% /beheer/tmp
/dev/vg00/lvbeheerlog
                    524288   99374  398407   20% /beheer/log
/dev/vg00/lvbeheerhistlog
..
..


# bdf /tmp
Filesystem          kbytes    used   avail %used Mounted on
/dev/vg00/lvol4    1867776    5264 1849784    0% /tmp


-- lvextend:
-- ---------

Increase number of physical extents allocated to a logical volume.

/etc/lvextend {-l logical_extents_number | -L logical_volume_size | -m
              mirror_copies} lv_path [physical_volume_path ...  |
              physical_vol_group_name...]

lvextend increases the number of mirrored copies or the size of the lv_path parameter.  
The change is determined according to which command options are specified.

WARNINGS
      The -m option cannot be used on HP-IB devices.

EXAMPLES
- Increase the number of the logical extents of a logical volume to one hundred:

# lvextend -l 100 /dev/vg01/lvol3

- Increase the logical volume size to 400 Mbytes:

# lvextend -L 400 /dev/vg01/lvol4

Allocate two mirrors (that is, three copies) for each logical extent of a logical volume:

# lvextend -m 2 /dev/vg01/lvol5


-- extendfs:
-- ---------

Extend file system size.

/etc/extendfs [-q] [-v] [-s size] special

If the original hfs filesystem image created on special does not make use of all of the available space, 
extendfs can be used to increase the capacity of an hfs filesystem by updating the filesystem structure
to include the extra space.
The command-line parameter special specifies the character device special file of either a logical volume 
or a disk partition. If special refers to a mounted filesystem, special must be un-mounted
before extendfs can be run (see mount(1M)).

The root filesystem cannot be extended using the extendfs command
because the root filesystem is always mounted, and extendfs only works
on unmounted filesystems.


EXAMPLES
To increase the capacity of a filesystem created on a logical volume, enter:

# umount /dev/vg00/lvol1

# lvextend -L larger_size /dev/vg00/lvol1

# extendfs /dev/vg00/rlvol1


-- fsadm:
-- ------

 
EXAMPLES
Convert a HFS file system from a nolargefiles file system to a largefiles file system: 

# fsadm -F hfs -o largefiles /dev/vg02/lvol1 

Display HFS relevant file system statistics: 

# fsadm -F hfs /dev/vg02/lvol1 


-- diskinfo:
-- ---------

diskinfo - describe characteristics of a disk device

SYNOPSIS
     /etc/diskinfo [-b|-v] character_devicefile

DESCRIPTION
      diskinfo determines whether the character special file named by
      character_devicefile is associated with a SCSI, CS/80, or Subset/80
      disk drive; if so, diskinfo summarizes the disk's characteristics.

Example:

# diskinfo /dev/rdsk/c31t1d3
SCSI describe of /dev/rdsk/c31t1d3:
             vendor: IBM
         product id: 2105800
               type: direct access
               size: 13671904 Kbytes
   bytes per sector: 512


Notes and further examples on HPUX:
===================================


Examples: More on how to create a filesystem on HP-UX:
------------------------------------------------------


Example 1: 
----------

Here we repeat the essentials of section 35.2:

Task 1. Estimate the Size Required for the Logical Volume  
Task 2. Determine If Sufficient Disk Space Is Available for the Logical Volume within Its Volume Group  
Task 3. Add a Disk to a Volume Group If Necessary 
 
Task 4. Create the Logical Volume  
 
Use lvcreate to create a logical volume of a certain size in the above volume group. See lvcreate(1M) for details.
Use lvcreate as in the following example:

Create a logical volume of size 100 MB in volume group /dev/vg03:

# lvcreate -L 100 /dev/vg03

-- Task 5. Create the New File System  
 
Create a file system using the newfs command. Note the use of the character device file. For example:
 
# newfs -F hfs /dev/vg02/rlvol1 
 
-- Task 6. mount the new local file system:

Choose an empty directory to serve as the mount point for the file system. Use the mkdir command to 
create the directory if it does not currently exist. For example, enter:
 
# mkdir /test 
 
Mount the file system using the mount command. Use the block device file name that contains the file system. 
You will need to enter this name as an argument to the mount command.

For example, enter
 
# mount /dev/vg01/lvol1 /test 


Example 2:
----------

This is an example of creating volume group vg01 & logical 
volume/partion data. 

Prepare for logical volume creation: 

root:/> mkdir /dev/vg01 
root:/> mknod /dev/vg01/group c 64 0x010000 
root:/> pvcreate -f /dev/rdsk/c0t5d0 
Physical volume "/dev/rdsk/c0t5d0" has been successfully created. 

root:/> vgcreate vg01 /dev/dsk/c0t5d0 
Volume group "/dev/vg01" has been successfully created. 
Volume Group configuration for /dev/vg01 has been saved in 
/etc/lvmconf/vg01.conf 

root:/> vgdisplay -v vg01 
root:/> lvcreate -L 100 -n data vg01 
Logical volume "/dev/vg01/data" has been successfully created with 
character device "/dev/vg01/rdata". 

Create HFS file system 

root:/> newfs -F hfs /dev/vg01/rdata 

Create Journal or Veritas file system 

root:/> newfs -F vxfs /dev/vg02/rdata 


Example 3:
----------

To create a VxFS file system 12288 sectors in size on VxVM volume, enter: 

# mkfs -F vxfs /dev/vx/rdsk/diskgroup/volume 12288

To use mkfs to create a VxFS file system on /dev/rdsk/c0t6d0: 

# mkfs -F vxfs /dev/rdsk/c0t6d0 1024 

To use mkfs to determine the command that was used to create the VxFS file system on /dev/rdsk/c0t6d0: 

# mkfs -F vxfs -m /dev/rdsk/c0t6d0 

To create a VxFS file system on /dev/vgqa/lvol1, with a Version 4 disk layout and largefiles capability: 

# mkfs -F vxfs -o version=4,largefiles /dev/vgqa/lvol1 


http://www.docs.hp.com/en/B2355-90672/index.html


Example 4:
----------

Example: Creating a Logical Volume Using HP-UX Commands

To create a logical volume:

Select one or more disks. ioscan(1M) shows the disks attached to the system and their device file names.
Initialize each disk as an LVM disk by using the pvcreate command. For example, enter
 
# pvcreate /dev/rdsk/c0t0d0 
 
Note that using pvcreate will result in the loss of any existing data currently on the physical volume.
You use the character device file for the disk.
Once a disk is initialized, it is called a physical volume.

- Pool the physical volumes into a volume group. To complete this step:

Create a directory for the volume group. For example:
 
# mkdir /dev/vgnn 
 
Create a device file named group in the above directory with the mknod command.
 
# mknod /dev/vgnn/group c 64 0xNN0000 
 
The c following the device file name specifies that group is a character device file.
The 64 is the major number for the group device file; it will always be 64.
The 0xNN0000 is the minor number for the group file in hexadecimal. Note that each particular NN must be a 
unique number across all volume groups.

For more information on mknod, see mknod(1M); for more information on major numbers and minor numbers, 
see Configuring HP-UX for Peripherals.

Create the volume group specifying each physical volume to be included using vgcreate. For example:
 
# vgcreate /dev/vgnn /dev/dsk/c0t0d0 
 
Use the block device file to include each disk in your volume group. You can assign all the physical volumes 
to the volume group with one command. No physical volume can already be part of an existing volume group.

Once you have created a volume group, you can now create a logical volume using lvcreate. For example:

# lvcreate /dev/vgnn 
 
Using the above command creates the logical volume /dev/vgnn/lvoln with LVM automatically assigning 
the n in lvoln.

When LVM creates the logical volume, it creates the block and character device files and places them in the directory 
/dev/vgnn.


VxFS can, theoretically, support files up to two terabytes in size because file system structures 
are no longer in fixed locations (see Chapter 2 "Disk Layout"). The maximum size tested and supported 
on HP-UX 11.x systems is one terabyte. Large files are files larger than two gigabytes in size.

 NOTE: Be careful when enabling large file capability. Applications and utilities such as backup may experience 
 problems if they are not aware of large files. 
 
 
Creating a File System with Large Files 

You can create a file system with large file capability by entering the following command:

# mkfs -F vxfs -o largefiles special_device size 
 
Specifying largefiles sets the largefiles flag, which allows the file system to hold files 
up to one terabyte in size. Conversely, the default nolargefiles option clears the flag and limits 
files being created to a size of two gigabytes or less:

# mkfs -F vxfs -o nolargefiles special_device size 


Notes:
------

Note 1: Create a System Mirror Disk:
------------------------------------

This note describes how to configure LVM mirroring of a system disk. In this example the HP server is STSRV1,
the primary boot device is SCSI=6 (/dev/dsk/c2t6d0) and the alternative mirrored bootdevice is 
SCSI=5 (/dev/dsk/c2t5d0). The following commands will do the trick:

# ioscan -fnC disk
# pvcreate -Bf /dev/rdsk/c2t5d0
# mkboot -l /dev/rdsk/c2t5d0
# mkboot -a "hpux -lq (;0)/stand/vmunix" /dev/rdsk/c2t5d0
# vgextend /dev/vg00 /dev/dsk/c2t5d0

# for P in 1 2 3 4 5 6 7 8 9 10
> do
> lvextend -m 1 /dev/vg00/lvol$P /dev/dsk/c2t5d0
> sleep 1
> done


Note 2: Create a System Mirror Disk:
------------------------------------

# ioscan -fnC disk 
Class I H/W Path Driver S/W State H/W Type Description 
===================================================================== 
disk 0 0/0/1/1.2.0 sdisk CLAIMED DEVICE HP 73.4GMAN3735MC 
                         /dev/dsk/c1t2d0 /dev/rdsk/c1t2d0 
disk 1 0/0/2/0.2.0 sdisk CLAIMED DEVICE HP 73.4GATLAS10K3_73_SCA 
                         /dev/dsk/c2t2d0 /dev/rdsk/c2t2d0 
  
Note: c1t2d0 is the boot disk and c2t2d0 is the mirrored disk. 
       
1) Initialize the disk and make it bootable 
        pvcreate -B /dev/rdsk/c2t2d0 
            Note: the -B parameter tells pvcreate that this will be a bootable disk. 
       
2) Add the physical volume to the volume group 
            vgextend /dev/vg00 /dev/dsk/c2t2d0 
       
3) Use mkboot to place the boot utilities in the boot area and add the AUTO file. 
            mkboot /dev/dsk/c2t2d0 
            mkboot -a "hpux -lq" /dev/rdsk/c2t2d0 
       
4) Use mkboot to update the AUTO file on the primary boot disk. 
            mkboot -a "hpux -lq" /dev/rdsk/c1t2d0 
       
5) Mirror the stand, root and swap logical volumes 
            lvextend -m 1 /dev/vg00/lvol1 
            lvextend -m 1 /dev/vg00/lvol2 
            lvextend -m 1 /dev/vg00/lvol3 


Note: LVM will resynchronize the new mirror copies. 


Repeat the lvextend for all other logical volumes on the boot mirror. 
            lvextend -m 1 /dev/vg00/lvol4 
            lvextend -m 1 /dev/vg00/lvol5 
            lvextend -m 1 /dev/vg00/lvol6 
            lvextend -m 1 /dev/vg00/lvol7 
            lvextend -m 1 /dev/vg00/lvol8 


6) Modify your alternate boot path to point to the mirror copy of the boot disk. 
Note: Use the Hardware path for your new boot disk. 
            setboot -a 0/0/2/0.2.0 


Note 3: Increase a filesystem in HP-UX:
---------------------------------------

Example 1:
----------

In this example, you would need to increase the file system size of /var by 10 MB, which actually needs 
to be rounded up to 12 MB.

Increase /var
Follow these steps to increase the size limit of /var.

- Determine if any space is available for the /dev/vg00:

# /sbin/vgdisplay /dev/vg00 

 
The Free PE indicates the number of 4 MB extents available, in this case 79 (equivalent to 316 MB).

- Change to single user state:

/sbin/shutdown

This allows /var to be unmounted.

- View mounted volumes:

# /sbin/mount

You see a display similar to the following:

/ on /dev/vg00/lvol1 defaults on Sat Mar 8 23:19:19 1997
/var on /dev/vg00/lvol7 defaults on Sat Mar 8 23:19:28 1997 


# Determine which logical volume maps to /var. In this example, it is /dev/vg00/lvol7

- Unmount /var:

# /sbin/umount /var

This is required for the next step, because extendfs can only work on unmounted volumes. If you get a 
"device busy" error at this point, reboot the system and log on in single-user mode before continuing.

- Extend the size of the logical volume:

# /sbin/lvextend -L new_size_in_MB /dev/vg00/lvol7

For example, to make this volume 332 MB:

# /sbin/lvextend -L 332 /dev/vg00/lvol7

To extend the file system size to the logical volume size:

# /sbin/extendfs /dev/vg00/rlvol7

Mount /var:

# /sbin/mount /var

Go back to the regular init state: init 3 or init 4, or reboot.


Example 2:
----------

To increase the capacity of a file system created on a logical volume, enter:

# umount /dev/vg00/lvol1
# lvextend -L larger_size /dev/vg00/lvol1
# extendfs -F hfs /dev/vg00/rlvol1          -- For operation like mkfs or extendfs, you should use raw device interface. 
# mount /dev/vg00/lvol1 mount_directory


Example 3:
----------

> 
> Date: 12/14/99 
> Document description: Extending /var, /usr, /tmp without Online JFS 
> Document id: KBRC00000204 
> 
> 
> You may provide feedback on this document 
> 
> 
> Extending /var, /usr, /tmp without Online JFS DocId: KBRC00000204 Updated: 
> 12/14/99 1:14:29 PM 
> 
> PROBLEM 
> Since /var, /usr, /tmp (and sometimes /opt) are always in use by the 
> operating system, they cannot be unmounted with the umount command. In order 
> to extend these filesystems, the system must be in single user mode. 
> 
> RESOLUTION 
> This example will show how to extend /usr to 400MB without Online JFS 
> 
> 
> 1.. Backup the filesystem before extending 
> 
> 
> 2.. Display disk information on the logical volume 
> 
> lvdisplay -v /dev/vg00/lvol4 | more 
> 
> 
> a.. Make sure this is enough Free PE's to increase this filesystem. 
> b.. Make sure that allocation is NOT strict/contiguous. 
> 
> 
> 3.. Reboot the machine 
> 
> shutdown -r now 
> 
> 
> 4.. When prompted, press "ESC" to interrupt the boot. 
> 
> 
> 5.. Boot from the primary device and invoke ISL interaction. 
> 
> bo pri isl 
> 
> NOTE: If prompted to interact with ISL, respond "y" 
> 
> 
> 6.. Boot into single user mode 
> 
> hpux -is 
> 
> NOTE:Nothing will be mounted. 
> 
> 
> 7.. Extend the logical volume that holds the filesystem. 
> 
> /sbin/lvextend -L 400 /dev/vg00/lvol4 
> 
> 
> 8.. Extend the file system. 
> 
> /sbin/extendfs -F hfs /dev/vg00/rlvol4 
> 
> NOTE: The use of the character device. 
> 
> 
> 9.. Ensure the filesystem now reports to be the new size 
> 
> bdf 
> 
> 
> 10.. Reboot the system to its normal running state. 
> 
> shutdown -r now 
> 
> 
> 
The only thing is that you have to have contiguous lvols to do that. The 
best way is to do an Ignite make_tape_recovery -i for vg00 and then 
resize it when you recreate it. If you have vg00 on a seperate disk then 
it is real easy, the backup can run in the background, and the restore 
interactive will take about 2.5 hours for a 9GB root disk, you can make 
the lvols any size you want and it also puts it back in place in order 
so you save space. 


Example 4:
----------

The right way to extend a file system with "OnLine jfs" is using the command "fsadm".
For example, if you want to extend the fs /mk2/toto in the
/dev/vgmk2/lvtoto in from 50Mbytes to 60 you must extend de logical volume

# lvextend -L 60 /dev/vgmk2/lvtoto

Now use fsadm ( I supose you have vxfs, if you are using hfs is not
possible to increase on-line, or at least I don't know how ).

# fsadm -F vxfs -b 61440 /mk2/toto

You will have your fs increased on line ... be carefull if your fs is 100% occupied the comand fsadm will fail, you
need some free space on the file system ( it depends on the fs type, size etc ..).

In general, Online jfs should be increased in the following way:

lvextend -L ???? /dev/vg??/lvol??

fsadm -F vxfs -b ????? /<filesystem name>

oranh300:/home/se1223>cat /etc/inittab | grep enab
vxen::bootwait:/sbin/fs/vxfs/vxenablef -a


Note 4:
-------

Extend OnlineJFS licenses on next D&ST servers:
aavnh400
oranh503
oranh603
orazh500
orazh601
orazh602

commands are:
swagentd -r
swinstall -x mount_all_filesystems=false -x enforce_dependencies=true -s hpdepot.ao.nl.abnamro.com:/beheer/depot/OnlineJFS_License OnlineJFS
swagentd -k


HP-UX errors: Error 23 filetable overflow:
------------------------------------------

Error: 23 is a infamous error, as shown in this thread:

thread:

Doc ID: Note:1018306.102 
Problem Description:
====================
You are backing up your database and are getting the following errors:

HP-UX Error 23: file table overflow

RMAN-569 file not found
LEM-00031 file not found
LEM-00033 lempgfm couldn't open message file
RMAN indicates that Recovery Manager is complete, however the database
and the catalog are not resync'd.
Problem Explanation:
====================
Recovery Manager cannot find or open the message file.
Search Words:
=============
Recovery Manager, LEM-33, LEM-31, RMAN-00569, message file, lempgfm,
error 23, HPUX error 23, HP-UX error 23
Solution Description:
=====================
You may need to increase the value of the unix kernel parameter 'nfile'.
Solution Explanation:
=====================
'nfile' needs to have a value in the thousands for a database server. 
If this parameter is < 1000, increase it to something like 5000 or 
greater. If there is enough memory on your system, this parameter can
be set to values > 30000.


Some HP-UX troubleshooting tips:
--------------------------------


Where to get information about problems:

dmesg  --> provides a finite list of diagnostic messages 
/var/adm/syslog/syslog.log  -->  system log 
/opt/resmon/log/error.log  --> 
/etc/shutdownlog  --> shutdown information 
/etc/rc.log  -->  system startup log 
/var/tombstones/ts99  --> crash analysis file 
cstm  -  command line support tool manager

mstm  -  menu based support tool manager 
<alt><underlined letter of command> 
<tab>  -->  to move to another portion of the screen, such as the drop down menu area 
Service Processor 
<ctrl> <b> from a serial console 
he  - help 
co  - return to console mode (exits the program) 
sl  - show log


Panic Reboots

Check these files for clues:

/var/tombstones/ts99 
/etc/shutdownlog 

Bad disk

1.  Check the syslog (/var/adm/syslog/syslog.log) looking for disk errors.
2.  Check the ioscan (ioscan -fnC disk), looking for NO_HW rather than Claimed.
3.  If diaglogd is running then check STM logs (/var/opt/resmon/log/event.log)
4.  Check the volume group to see if the disk is listed and whether there is any problem  with it's status (vgdisplay -v | more)
5.  Check lvmtab to see if the disk is supposed to be in a volume group (strings /etc/lvmtab | more)

Filesystem do not mount after a reboot

1.  Reactivate the Volume Group  -->  vgchange -a y /dev/<volume group>
2.  Remount the filesystems  -->  mount -a
3.  If still no success then perform a filesystem check  -->  fsck /dev/<volumegroup>/<logicalvolume>
4.  Remount the filesystems  -->  mount -a
5.  Check to see if all the filesystems are there:
          a)  bdf
          b)  compare with /etc/fstab

Filesystem full

du -kx / | sort -rn | more 
du -akx | sort -nr | more 

Shows directories on the local filesystem and how much space they are taking up

NFS mount - Permission Denied

1.  Check to see if the format of the /etc/exports file is correct on the server that is the nfs server.

2.  exportfs -av to export the filesystem

3.  Check the /etc/fstab file on the client to make sure that it is correct

4.  /usr/sbin/showmount -e <server>  on the client to show what is being exported

5.  To bypass the /etc/exports file execute the following on the nfs server:     exportfs -i -o rw <filesystem>. 


NFS Server

/etc/rc.config.d/nfsconf  --> NFS_SERVER=1

Verify the proper processes are running:

/sbin/init.d/nfs.server stop

The processes should NOT be running:

# ps -ef|grep nfsd
# ps -ef|grep rpc.mountd
# ps -ef|grep rpc.lockd
# ps -ef|grep rpc.statd

/sbin/init.d/nfs.server start

These processes should be running:

# ps -ef|grep nfsd
    root  3444     1  0 10:39:12 ?         0:00 /usr/sbin/nfsd 4
    root  3451  3444  0 10:39:12 ?         0:00 /usr/sbin/nfsd 4
    root  3449  3444  0 10:39:12 ?         0:00 /usr/sbin/nfsd 4
    root  3445  3444  0 10:39:12 ?         0:00 /usr/sbin/nfsd 4
# ps -ef|grep rpc.mountd
    root  3485     1  0 10:42:09 ?         0:00 rpc.mountd
# ps -ef|grep rpc.lockd
    root  3459     1  0 10:39:12 ?         0:00 /usr/sbin/rpc.lockd
# ps -ef|grep rpc.statd
    root  3453     1  0 10:39:12 ?         0:00 /usr/sbin/rpc.statd


To start a process if it is not running:

# ps -ef|grep rpc.mountd
# rpc.mountd   or  /usr/sbin/rpc.mountd
# ps -ef|grep rpc.mountd
    root  3485     1  0 10:42:09 ?         0:00 rpc.mountd


/etc/inetd.conf needs to have the proper services active (not commented out)

##
# WARNING: The rpc.mountd should now be started from a startup script.
#          Please enable the mountd startup script to start rpc.mountd.
##
#rpc  stream tcp  nowait  root  /usr/sbin/rpc.rexd     100017  1    rpc.rexd
rpc  dgram  udp  wait    root  /usr/lib/netsvc/rstat/rpc.rstatd   100001  2-4  rpc.rstatd
rpc  dgram  udp  wait    root  /usr/lib/netsvc/rusers/rpc.rusersd  100002  1-2  rpc.rusersd
rpc  dgram  udp  wait    root  /usr/etc/rpc.mountd  100005  1  rpc.mountd -e
rpc  dgram  udp  wait    root  /usr/lib/netsvc/rwall/rpc.rwalld   100008  1    rpc.rwalld
#rpc  dgram  udp  wait    root  /usr/sbin/rpc.rquotad  100011  1    rpc.rquotad
rpc  dgram  udp  wait    root  /usr/lib/netsvc/spray/rpc.sprayd   100012  1    rpc.sprayd


NIC problems:

The lanadmin utility provides NIC statistics
The nettladmin utility provides packet trace information


Replacing a Mirrored Root Disk:

Replace the disk 
Hot swap can be performed while system is up 
Not hot swappable means the system must be brought down 
Reboot the system into single user mode 
shutdown -r 0, unless the system is powered off already, then power it back on

interrupt the boot 
bo pri (or bo alt if the disk that was replaced was the primary boot disk) 
IPL>hpux -is -lq (;0)/stand/vmunix 
vgcfgrestore -n /dev/vg00 /dev/rdsk/c?t?d? 
vgsync /dev/vg00 
mkboot /dev/rdsk/c?t?d? 
mkboot -a "hpux -lq (;0)/stand/vmunix" /dev/rdsk/c?t?d? 
shutdown -r 0 
lvlnboot -v /dev/vg00  to verify that the disk is seen as bootable


Software Installation (swinstall, sd, etc)
    ERROR:   "server::/tmp/omni_tmp/packet":  You do not have the
         required permissions to perform this operation.  Check
         permissions using the "swacl" command or see your system
         administrator for assistance.  Or, to manage applications
         designed and packaged for nonprivileged mode, see the
         "run_as_superuser" option in the "sd" man page.
WARNING: More information may be found in the daemon logfile on this
         target (default location is
         server:/var/adm/sw/swagentd.log).

Bounce swagentd daemon:  /usr/sbin/swagentd -r


##############################################################

SECTION 16: Kernel Parameters:

##############################################################


=========================
>>>> Simplified Overview:
=========================


Simplified overview Kernel parameters Solaris, AIX, Linux:
==========================================================

Solaris:
--------

The "/etc/system" file:

Available for Solaris Operating Environment, the /etc/system file contains definitions for kernel configuration limits 
such as the maximum number of users allowed on the system at a time, the maximum number of processes per user, 
and the inter-process communication (IPC) limits on size and number of resources. These limits are important because 
they affect, for example, DB2, Oracle performance on a Solaris Operating Environment machine. 

Some examples:

set shmsys:shminfo_shmmax=4294967295
set shmsys:shminfo_shmmin=1
set shmsys:shminfo_shmmni=100
set shmsys:shminfo_shmseg=10
set semsys:seminfo_semmni=100
set semsys:seminfo_semmsl=100
set semsys:seminfo_semmns=2500
set semsys:seminfo_semopm=100
set semsys:seminfo_semvmx=32767
..
..

You can use, among others, the "ipcs" command and "adb" command to retrieve kernel parameters and mem info.

Some remarks on Shared Memory and Semaphores:

- Shared Memory
Shared memory provides the fastest way for processes to pass large amounts of data to one another. 
As the name implies, shared memory refers to physical pages of memory that are shared by more than one process. 

Of particular interest is the "Intimate Shared Memory" facility, where the translation tables are shared 
as well as the memory. This enhances the effectiveness of the TLB (Translation Lookaside Buffer), 
which is a CPU-based cache of translation table information. Since the same information is used for 
several processes, available buffer space can be used much more efficiently. In addition, ISM-designated memory 
cannot be paged out, which can be used to keep frequently-used data and binaries in memory. 

Database applications are the heaviest users of shared memory. Vendor recommendations should be consulted 
when tuning the shared memory parameters. 

Solaris 10 only uses the shmmax and shmmni parameters. (Other parameters are set dynamically within the 
Solaris 10 IPC model.) 

shmmax (max-shm-memory in Solaris 10+): This is the maximum size of a shared memory segment 
(ie the largest value that can be used by shmget). Its theoretical maximum value is 4294967295 (4GB), 
but practical considerations usually limit it to less than this. There is no reason not to tune this value 
as high as possible, since no kernel resources are allocated based on this parameter. Solaris 10 sets shmmax 
to 1/4 physical memory by default, vs 512k for previous versions. 
shmmin: This is the smallest possible shared memory segment size. The default is 1 byte; this parameter 
should probably not be tuned. 
shmmni (max-shm-ids in Solaris 10+): Maximum number of shared memory identifiers at any given time. 
This parameter is used by kernel memory allocation to determine how much size to put aside for shmid_ds structures. 
Each of these is 112 bytes and requires an additional 8 bytes for a mutex lock; if it is set too high, memory useage 
can be a problem. The maximum setting for this variable in Solaris 2.5.1 and 2.6 is 2147483648 (2GB), and the 
default is 100. For Solaris 10, the default is 128 and the maximum is MAXINT. 
shmseg: Maximum number of segments per process. It is usually set to shmmni, but it should always be less 
than 65535. Sun documentations suggests a maximum for this parameter of 32767 and a default of 8 for 
Solaris 2.5.1 and 2.6. 

- Semaphores
Semaphores are a shareable resource that take on a non-negative integer value. They are manipulted 

by the P (wait) and V (signal) functions, which decrement and increment the semaphore, respectively. When a 
process needs a resource, a "wait" is issued and the semaphore is decremented. When the semaphore contains 
a value of zero, the resources are not available and the calling process spins or blocks (as appropriate) 
until resources are available. When a process releases a resource controlled by a semaphore, it increments 
the semaphore and the waiting processes are notified. 

Solaris 10 only uses the semmni, semmsl and semopm parameters. (Other parameters are dynamic within 
the Solaris 10 IPC model.) 

semmap: This sets the number of entries in the semaphore map. This should never be greater than semmni. If the number 
of semaphores per semaphore set used by the application is "n" then set semmap = ((semmni + n - 1)/n)+1
or more. Alternatively, we can set semmap to semmni x semmsl. An undersized semmap leads to "WARNING: 
rmfree map overflow" errors. The default setting is 10; the maximum for Solaris 2.6 is 2GB. The default for 
Solaris 9 was 25; Solaris 10 increased the default to 512. The limit is SHRT_MAX. 
semmni (max-sem-ids in Solaris 10+): Maximum number of systemwide semaphore sets. Each control structure consumes 
84 bytes. For Solaris 2.5.1-9, the default setting is 10; for Solaris 10, the default setting is 128. 
The maximum is 65535 
semmns: Maximum number of semaphores in the system. Each structure uses 16 bytes. This parameter should be set 
to semmni x semmsl. The default is 60; the maximum is 2GB. 
semmnu: Maximum number of undo structures in the system. This should be set to semmni so that each control structure 
has an undo structure. The default is 30, the maximum is 2 GB. 
semmsl (max-sem-nsems in Solaris 10+): Maximum number of semaphores per semaphore set. The default is 25, 
the maximum is 65535. 
semopm (max-sem-ops in Solaris 10+): Maximum number of semaphore operations that can be performed in each 
semop call. The default in Solaris 2.5.1-9 is 10, the maximum is 2 GB. Solaris 10 increased the default to 512. 
semume: Maximum number of undo structures per process. This should be set to semopm times the number of processes 
that will be using semaphores at any one time. The default is 10; the maximum is 2 GB. 
semusz: Number of bytes required for semume undo structures. This should not be tuned; it is set to 
semume x (1 + sizeof(undo)). The default is 96; the maximum is 2 GB. 
semvmx: Maximum value of a semaphore. This should never exceed 32767 (default value) unless SEM_UNDO 
is never used. The default is 32767; the maximum is 65535. 
semaem: Maximum adjust-on-exit value. This should almost always be left alone. The default is 16384; 
the maximum is 32767. 


Linux:
------

Kernel parameters used for system configuration are found in "/etc/sysctl.conf" and on a running system also in "/proc/sys/kernel", where you 
will find an individual file for each configuration parameter. Because these parameters have a direct effect on system 
performance and viability, you must have root access in order to modify them.

Occasionally, a prerequisite to a package installation requires the modification of kernel parameters. 
Since each parameter file contains a single line of data consisting of either a text 
string or numeric values, it is often easy to modify a parameter by simply using the echo command:

# echo 2048 > /proc/sys/kernel/msgmax

The aforementioned command will set the value of the msgmax parameter to 2048.

-- More on the proc File System:

The Linux kernel has two primary functions: to control access to physical devices on the computer 
and to schedule when and how processes interact with these devices. The /proc/ directory contains 
a hierarchy of special files which represent the current state of the kernel - allowing applications 
and users to peer into the kernel's view of the system. 

Within the /proc/ directory, one can find a wealth of information about the system hardware and any processes 
currently running. In addition, some of the files within the /proc/ directory tree can be manipulated by users 
and applications to communicate configuration changes to the kernel. 

Under Linux, all data are stored as files. Most users are familiar with the two primary types of files: 
text and binary. But the /proc/ directory contains another type of file called a virtual file. 
It is for this reason that /proc/ is often referred to as a virtual file system. 
These virtual files have unique qualities. Most of them are listed as zero bytes in size and yet when one 
is viewed, it can contain a large amount of information. In addition, most of the time and date settings 
on virtual files reflect the current time and date, indicative of the fact they constantly changing. 

Virtual files such as interrupts, /proc/meminfo, /proc/mounts, and /proc/partitions provide an 
up-to-the-moment glimpse of the system's hardware. Others, like /proc/filesystems and the /proc/sys/ 
directory provide system configuration information and interfaces. 

For organizational purposes, files containing information on a similar topic are grouped into virtual 
directories and sub-directories. For instance, /proc/ide/ contains information for all physical IDE devices. 
Likewise, process directories contain information about each running process on the system. 

By using the cat, more, or less commands on files within the /proc/ directory, you can immediately access 
an enormous amount of information about the system. For example, if you want to see what sort of CPU 
your computer has, type "cat /proc/cpuinfo" and you will see something similar to the following: 

processor	: 0
vendor_id	: AuthenticAMD
cpu family	: 5
model		: 9
model name	: AMD-K6(tm) 3D+ Processor
stepping	: 1
cpu MHz		: 400.919
cache size	: 256 KB
fdiv_bug	: no
hlt_bug		: no
f00f_bug	: no
coma_bug	: no
fpu		: yes
fpu_exception	: yes
cpuid level	: 1
wp		: yes
flags		: fpu vme de pse tsc msr mce cx8 pge mmx syscall 3dnow k6_mtrr
bogomips	: 799.53
 

When viewing different virtual files in the /proc/ file system, you will notice some of the information is 
easily understandable while some is not human-readable. This is in part why utilities exist to pull data 
from virtual files and display it in a useful way. Some examples of such applications are 
lspci, apm, free, and top. 

As a general rule, most virtual files within the /proc/ directory are read only. However, some can be used 
to adjust settings in the kernel. This is especially true for files in the /proc/sys/ subdirectory. 

To change the value of a virtual file, use the echo command and a > symbol to redirect the new value to the file. 
For instance, to change your hostname on the fly, you can type: 

echo bob.subgenius.com > /proc/sys/kernel/hostname 
 
Other files act as binary or boolean switches. For instance, if you type cat /proc/sys/net/ipv4/ip_forward, 
you will see either a 0 or a 1. A 0 indicates the kernel is not forwarding network packets. By using the 
echo command to change the value of the ip_forward file to 1, you can immediately turn packet forwarding on. 

Another command used to alter settings in the /proc/sys/ subdirectory is /sbin/sysctl.


-- sysctl:

Linux also provides the sysctl command to modify kernel parameters at runtime. 
Sysctl uses parameter information stored in a file called /etc/sysctl.conf. If, for example, we wanted to 
change the value of the msgmax parameter as we did above, but this time using sysctl, the command would 
look like this:

# sysctl -w kernel.msgmax=2048


- About the kernel:

Finding the Kernel
Locate the kernel image on your hard disk. It should be in the file /vmlinuz, or /vmlinux, or /boot/vmlinux
In some installations, /vmlinuz is a soft link to the actual kernel, so you may need to track down 
the kernel by following the links. On Redhat 6.1 it is in "/boot/vmlinuz". To find the kernel being used 
look in "/etc/lilo.conf".

You can also type "uname -a" to see the kernel version. 

/proc/cmdline

This file shows the parameters passed to the kernel at the time it is started. A sample /proc/cmdline file 
looks like this: 

ro root=/dev/hda2

This tell us the kernel is mounted read-only - signified by (ro) - off of the second partition 
on the first IDE device (/dev/hda2). 


- Kernel, memory tuning:

Most about tuning memory en kernel params seem to do with the "/etc/sysctl.conf" file:

In most distributions, the "/etc/sysctl.conf" determines the limits and/or behaviour of the kernel 
and memory.

If you type "sysctl -a |more" you will see a long list of kernel parameters. 
You can use this sysctl program to modify these parameters, for example:

# sysctl -w kernel.shmmax=100000000
# sysctl -w fs.file-max=65536
# echo "kernel.shmmax = 100000000" >> /etc/sysctl.conf


Example configuration: setting kernel parameters before installing Oracle 10g:
------------------------------------------------------------------------------

Most out of the box kernel parameters (of RHELS 3,4,5) are set correctly for Oracle
except a few.

You should have the following minimal configuration:

net.ipv4.ip_local_port_range	1024  65000
kernel.sem			250  32000  100  128
kernel.shmmni			4096
kernel.shmall			2097152
kernel.shmmax			2147483648
fs.file-max			65536


You can check the most important parameters using the following command:

# /sbin/sysctl -a | egrep 'sem|shm|file-max|ip_local'

net.ipv4.ip_local_port_range = 1024  65000
kernel.sem = 250  32000  100  128
kernel.shmmni = 4096
kernel.shmall = 2097152
kernel.shmmax = 2147483648
fs.file-max = 65536

If some value should be changed, you can change the "/etc/sysctl.conf" file and run the "/sbin/sysctl -p" command
to change the value immediately.
Every time the system boots, the init program runs the /etc/rc.d/rc.sysinit script. This script contains 
a command to execute sysctl using /etc/sysctl.conf to dictate the values passed to the kernel. 
Any values added to /etc/sysctl.conf will take effect each time the system boots. 


Example configuration: from: Installing Oracle 91 on Linux
-----------------------------------------------------------

For Linux, use the ipcs command to obtain a list of the system's current shared memory segments and 
semaphore sets, and their identification numbers and owner. 

Perform the following steps to modify the kernel parameters by using the /proc file system. 

Log in as the root user. 

Change to the /proc/sys/kernel directory. 

Review the current semaphore parameter values in the sem file by using the cat or more utility. 
For example, using the cat utility, enter the following command: 

# cat sem

The output lists, in order, the values for the SEMMSL, SEMMNS, SEMOPM, and SEMMNI parameters. 
The following example shows how the output appears: 

250 32000 32 128

In the preceding output example, 250 is the value of the SEMMSL parameter, 32000 is the value of the 
SEMMNS parameter, 32 is the value of the SEMOPM parameter, and 128 is the value of the SEMMNI parameter. 

Modify the parameter values by using the following command syntax: 

# echo SEMMSL_value SEMMNS_value SEMOPM_value SEMMNI_value > sem

Replace the parameter variables with the values for your system in the order that they are entered 
in the preceding example. For example: 

# echo 100 32000 100 100 > sem

Review the current shared memory parameters by using the cat or more utility. For example, using the cat utility, 
enter the following command: 

# cat shared_memory_parameter

In the preceding example, the variable shared_memory_parameter is either the SHMMAX or SHMMNI parameter. 
The parameter name must be entered in lowercase letters. 

Modify the shared memory parameter by using the echo utility. For example, to modify the SHMMAX parameter, 
enter the following command: 

# echo 2147483648 > shmmax

Modify the shared memory parameter by using the echo utility. For example, to modify the SHMMNI parameter, 
enter the following command: 

# echo 4096 > shmmni

Modify the shared memory parameter by using the echo utility. For example, to modify the SHMALL parameter, 
enter the following command: 

# echo 2097152 > shmall

Write a script to initialize these values during system startup, and include the script in your system init files. 

See Also: 
Your system vendor's documentation for more information on script files and init files.  

Set the File Handles by using ulimit -n and /proc/sys/fs/file-max. 

# echo 65536 > /proc/sys/fs/file-max
ulimit -n 65536

Set the Sockets to /proc/sys/net/ipv4/ip_local_port_range 

# echo 1024 65000 > /proc/sys/net/ipv4/ip_local_port_change

Set the Process limit by using ulimit -u. This will give you the number of processes per user. 

ulimit -u 16384


Linux modules:
--------------


Modules on Linux (1):
---------------------

- insmod, rmmod, lsmod

lsmod:
------

lsmod - list loaded modules.   

SYNOPSIS
lsmod [-hV]   
DESCRIPTION
lsmod shows information about all loaded modules. 
The format is name, size, use count, list of referring modules. The information displayed is identical 
to that available from "/proc/modules". 

If the module controls its own unloading via a can_unload routine then the user count displayed by lsmod 
is always -1, irrespective of the real use count.   

insmod:
-------

insmod - install loadable kernel module 

SYNOPSIS
insmod [-fhkLmnpqrsSvVxXyYN] [-e persist_name] [-o module_name] [-O blob_name] [-P prefix] module [ symbol=value ... ] 
DESCRIPTION
insmod installs a loadable module in the running kernel. 
insmod tries to link a module into the running kernel by resolving all symbols from the kernel's 
exported symbol table. 

If the module file name is given without directories or extension, insmod will search for the module 
in some common default directories. The environment variable MODPATH can be used to override this default. 
If a module configuration file such as /etc/modules.conf exists, it will override the paths defined in MODPATH. 

The environment variable MODULECONF can also be used to select a different configuration file from the 
default /etc/modules.conf (or /etc/conf.modules (deprecated)). This environment variable will override 
all the definitions above. 

When environment variable UNAME_MACHINE is set, modutils will use its value instead of the machine field 
from the uname() syscall. This is mainly of use when you are compiling 64 bit modules in 32 bit user space 
or vice versa, set UNAME_MACHINE to the type of the modules. Current modutils does not support full 
cross build mode for modules, it is limited to choosing between 32 and 64 bit versions of the host architecture. 

rmmod:
------

rmmod - unload loadable modules   
SYNOPSIS
rmmod [ -aehrsvV ] module ...   
DESCRIPTION
rmmod unloads loadable modules from the running kernel. 
rmmod tries to unload a set of modules from the kernel, with the restriction that they are not in use 
and that they are not referred to by other modules. 

If more than one module is named on the command line, the modules will be removed in the given order. 
This supports unloading of stacked modules. 

With the option '-r', a recursive removal of modules will be attempted. This means that if a top module 
in a stack is named on the command line, all modules that are used by this module will be removed as well, 
if possible. 


More info about the mod commands:
---------------------------------

- Hardware Detection with the Help of hwinfo
hwinfo can detect the hardware of your system and select the drivers needed to run this hardware. 
Get a small introduction to this command with hwinfo --help. If you, for example, need information about 
your SCSI devices, use the command hwinfo --scsi.

All this information is also available in YaST in the hardware information module. 

- Handling Modules
The following commands are available:

insmod
insmod loads the requested module after searching for it in a subdirectory of /lib/modules/<version>. 
It is better, however, to use modprobe rather than insmod. 

rmmod
Unloads the requested module. This is only possible if this module is no longer needed. For example, 
the isofs module cannot be unloaded while a CD is still mounted. 

depmod
Creates the file modules.dep in /lib/modules/<version> that defines the dependencies of all the modules. 
This is necessary to ensure that all dependent modules are loaded with the selected ones. 
This file will be built after the system is started if it does not exist.

modprobe
Loads or unloads a given module while taking into account dependencies of this module. This command 
is extremely powerful and can be used for a lot of things (e.g., probing all modules of a given type 
until one is successfully loaded). In contrast to insmod, modprobe checks /etc/modprobe.conf and therefore 
is the preferred method of loading modules. For detailed information about this topic, refer to the 
corresponding man page. 

lsmod
Shows which modules are currently loaded as well as how many other modules are using them. Modules started 
by the kernel daemon are tagged with autoclean. This label denotes that these modules will automatically 
be removed once they reach their idle time limit. 

modinfo
Shows module information.

/etc/modprobe.conf
The loading of modules is affected by the files /etc/modprobe.conf and /etc/modprobe.conf.local 
and the directory /etc/modprobe.d. See man modprobe.conf. Parameters for modules that access hardware directly
must be entered in this file. Such modules may need system-specific options (e.g., CD-ROM driver or network driver). 
The parameters used here are described in the kernel sources. Install the package kernel-source and read the 
documentation in the directory /usr/src/linux/Documentation. 

Kmod - the Kernel Module Loader
The kernel module loader is the most elegant way to use modules. Kmod performs background monitoring 
and makes sure the required modules are loaded by modprobe as soon as the respective functionality is needed 
in the kernel. 

To use Kmod, activate the option `Kernel module loader' (CONFIG_KMOD) in the kernel configuration. 
Kmod is not designed to unload modules automatically; in view of today's RAM capacities, the potential memory savings 
would be marginal. For reasons of performance, monolithic kernels may be more suitable for servers 
that are used for special tasks and need only a few drivers. 


modprobe.conf:
--------------

Example 1:

# This file is autogenerated from /etc/modules.conf using generate-modprobe.conf command

alias eth1 sk98lin
alias eth0 ipw2200
alias sound-slot-0 snd-hda-intel
install scsi_hostadapter /sbin/modprobe ahci; /bin/true
remove snd-hda-intel /sbin/modprobe -r snd-pcm-oss; /sbin/modprobe --first-time -r --ignore-remove snd-hda-intel
install snd-hda-intel /sbin/modprobe --first-time --ignore-install snd-hda-intel && { /sbin/modprobe snd-pcm-oss; /bin/true; }
install usb-interface /sbin/modprobe uhci-hcd; /sbin/modprobe ehci-hcd; /bin/true
#alias eth1 eth1394
alias ieee1394-controller ohci1394
alias net-pf-10 off

#irda
alias tty-ldisc-11 irtty
alias char-major-161-* ircomm-tty

# Para nsc 383 SIO:
alias char-major-160-* nsc-ircc
alias irda0 nsc-ircc
options nsc-irc io=0x2f8 irq=3 dma=0
install nsc-ircc { /bin/setserial /dev/ttyS1 uart none; } ; /sbin/modprobe --first-time --ignore-install nsc-ircc

#irda: 0x2f8, irq 3, dma 0
#lpt: 0x3f8, irq 7, dma 1

options parport_pc io=0x378 irq=7 dma=1

Example 2:

alias ieee1394-controller ohci1394
alias eth0 eepro100
alias sound-slot-0 emu10k1
alias net-pf-10 off
install snd-emu10k1 /sbin/modprobe --first-time --ignore-install snd-emu10k1 
&& { /sbin/modprobe snd-pcm-oss; /bin/true; }
install usb-interface /sbin/modprobe usb-uhci; /sbin/modprobe ehci-hcd; /bin/true
remove snd-emu10k1 { /sbin/modprobe -r snd-pcm-oss; } ; /sbin/modprobe -r --first-time --ignore-remove snd-emu10k1 


/etc/sysconfig:
---------------

Note 1:
-------

SuSEconfig and /etc/sysconfig
The main configuration of SUSE LINUX can be made with the configuration files in /etc/sysconfig. 
Former versions of SUSE LINUX relied on /etc/rc.config for system configuration, but it became obsolete 
in previous versions. /etc/rc.config is not created at installation time, as all system configuration 
is controlled by /etc/sysconfig. However, if /etc/rc.config exists at the time of a system update, 
it remains intact.

The individual files in /etc/sysconfig are only read by the scripts to which they are relevant. This ensures 
that network settings, for instance, need to be parsed only by network-related scripts. Apart from that, 
there are many other system configuration files that are generated according to the settings in /etc/sysconfig. 
This task is performed by SuSEconfig. For example, if you change the network configuration, SuSEconfig is likely 
to make changes to the file /etc/host.conf as well, as this is one of the files relevant for the 
network configuration. 

If you change anything in these files manually, run SuSEconfig afterwards to make sure all the necessary 
changes are made in all the relevant places. If you change the configuration using the YaST sysconfig editor, 
all changes are applied automatically - YaST automatically starts SuSEconfig to update the configuration 
files as needed.

This concept enables you to make basic changes to your configuration without needing to reboot the system. 
Because some changes are rather complex, some programs must be restarted for the changes to take effect. 
For instance, changes to the network configuration may require a restart of the network programs concerned. 
This can be achieved by entering the commands rcnetwork stop and rcnetwork start.

Note 2:
-------

The Linux sysconfig directory
The /etc/sysconfig directory is where many of the files that control the system configuration are stored. 
This section lists these files and many of the optional values in the files used to make system changes. 
To get complete information on these files read the file /usr/doc/initscripts-4.48/sysconfig.txt. 

/etc/sysconfig/clock
Used to configure the system clock to Universal or local time and set some other clock parameters. An example file: 
UTC=false
ARC=false

Options: 
UTC - true means the clock is set to UTC time otherwise it is at local time 
ARC - Set true on alpha stations only. It indicates the ARC console's 42-year time offset is in effect. If not set to true, the normal Unix epoch is assumed. 
ZONE="filename" - indicates the zonefile under the directory /usr/share/zoneinfo that the /etc/localtime file is a copy of. This may be set to: 
ZONE="US/Eastern" 

/etc/sysconfig/init
This file is used to set some terminal characteristics and environment variables. A sample listing: 
# color => new RH6.0 bootup
# verbose => old-style bootup
# anything else => new style bootup without ANSI colors or positioning
BOOTUP=color
# column to start "[  OK  ]" label in 
RES_COL=60
# terminal sequence to move to that column. You could change this
# to something like "tput hpa ${RES_COL}" if your terminal supports it
MOVE_TO_COL="echo -en \\033[${RES_COL}G"
# terminal sequence to set color to a 'success' color (currently: green)
SETCOLOR_SUCCESS="echo -en \\033[1;32m"
# terminal sequence to set color to a 'failure' color (currently: red)
SETCOLOR_FAILURE="echo -en \\033[1;31m"
# terminal sequence to set color to a 'warning' color (currently: yellow)
SETCOLOR_WARNING="echo -en \\033[1;33m"
# terminal sequence to reset to the default color.
SETCOLOR_NORMAL="echo -en \\033[0;39m"
# default kernel loglevel on boot (syslog will reset this)
LOGLEVEL=1
# Set to something other than 'no' to turn on magic sysrq keys...
MAGIC_SYSRQ=no
# Set to anything other than 'no' to allow hotkey interactive startup...
PROMPT=yes

Options: 
BOOTUP=bootupmode - Choices are color, or verbose. The choice color sets new boot display. The choice verbose sets old style display. 
Anything else sets a new display without ANSI formatting. 
LOGLEVEL=number - Sets the initial console logging level for the kernel. The default is 7. The values are: 
emergency, panic - System is unusable 
alert - Action must be taken immediately 
crit - Critical conditions 
err, error (depreciated) - Error conditions 
warning, warn (depreciated) - Warning conditions 
notice - Normal but significant conditions 
info - Informational message 
debug - Debug level message 
RES_COL=number - Screen column to start status labels at. The Default is 60. 
MOVE_TO_COL=command - A command to move the cursor to $RES_COL. 
SETCOLOR_SUCCESS=command - Set the color used to indicate success. 
SETCOLOR_FAILURE=command - Set the color used to indicate failure. 
SETCOLOR_WARNING=command - Set the color used to indicate warning. 
SETCOLOR_NORMAL=command - Set the color used tor normal color 
MAGIC_SYSRQ=yes|no - Set to 'no' to disable the magic sysrq key. 
PROMPT=yes|no - Set to 'no' to disable the key check for interactive mode. 


/etc/sysconfig/keyboard
Used to configure the keyboard. Used by the startup script /etc/rc.d/rc.sysinit. An example file: 
KEYTABLE="us"

Options: 
KEYTABLE="keytable file" - The line [ KEYTABLE="/usr/lib/kbd/keytables/us.map" ] tells the system to use the file shown for keymapping. 
KEYBOARDTYPE=sun|pc - The selection, "sun", indicates attached on /dev/kbd is a sun keyboard. The selection "pc" indicates a PS/2 keyboard is on the ps/2 port. 


/etc/sysconfig/mouse
This file is used to configure the mouse. An example file: 
FULLNAME="Generic - 2 Button Mouse (PS/2)"
MOUSETYPE="ps/2"
XEMU3="yes"
XMOUSETYPE="PS/2"

Options: 
MOUSETYPE=type - Choices are microsoft, mouseman, mousesystems, ps/2, msbm, logibm, atibm, logitech, mmseries, or mmhittab. 
XEMU3=yes|no - If yes, emulate three buttons, otherwise not. 


/etc/sysconfig/network
Used to configure networking options. All IPX options default to off. An example file: 
NETWORKING=yes
FORWARD_IPV4="yes"
HOSTNAME="mdct-dev3"
GATEWAY="10.1.0.25"
GATEWAYDEV="eth0"

Options: 
NETWORKING=yes|no - Sets network capabilities on or off. 
HOSTNAME="hostname". To work with old software, the /etc/HOSTNAME file should contain the same hostname. 
FORWARD_IPV4=yes|no - Turns the ability to perform IP forwarding on or off. Turn it on if you want to use the machine as a router. 
Turn it off to use it as a firewall or IP masquerading. 
DEFRAG_IPV4=yes|no - Set this to automatically defragment IPv4 packets. This is good for masquerading, and a bad idea otherwise. It defaults to 'no'. 
GATEWAY="gateway IP" 
GATEWAYDEV="gateway device" Possible values include eth0, eth1, or ppp0. 
NISDOMAIN="nis domain name" 
IPX=yes|no - Turn IPX ability on or off. 
IPXAUTOPRIMARY=on|off - Must not be yes or no. 
IPXAUTOFRAME=on|off 
IPXINTERNALNETNUM="netnum" 
IPXINTERNALNODENUM="nodenum" 


/etc/sysconfig/static-routes
Configures static routes on a network. Used to set up static routing. An example file: 
eth1 net 192.168.199.0 netmask 255.255.255.0 gw 192.168.199.1
eth0 net 10.1.0.0 netmask 255.255.0.0 gw 10.1.0.153
eth1 net 255.255.255.255 netmask 255.255.255.255

The syntax is: 
device net network netmask netmask gw gateway 

The device may be a device name such as eth0 which is used to have the route brought up and down as the device is brought up or down. 
The value can also be "any" to let the system calculate the correct devices at run time. 


/etc/sysconfig/routed 
Sets up dynamic routing policies. An example file: 
EXPORT_GATEWAY="no"
SILENT="yes"

Options: 
SILENT=yes|no 
EXPORT_GATEWAY=yes|no 


/etc/sysconfig/pcmcia
Used to configure pcmcia network cards. An example file: 
PCMCIA=no
PCIC=
PCIC_OPTS=
CORE_OPTS=

Options: 
PCMCIA=yes|no 
PCIC=i82365|tcic 
PCIC_OPTS=socket driver (i82365 or tcic) timing parameters 
CORE_OPTS=pcmcia_core options 
CARDMGR_OPTS=cardmgr options 


/etc/sysconfig/amd
Used to configure the auto mount daemon. An example file: 
ADIR=/.automount
MOUNTPTS='/net /etc/amd.conf'
AMDOPTS=

Options: 
ADIR=/.automount (normally never changed) 
MOUNTPTS='/net /etc/amd.conf' (standard automount stuff) 
AMDOPTS= (extra options for AMD) 


/etc/sysconfig/tape
Used for backup tape device configuration. Options: 
DEV=/dev/nst0 - The tape device. Use the non-rewinding tape for these scripts. For SCSI tapes the device is /dev/nst#, 
where # is the number of the tape drive you want to use. If you only have one then use nst0. For IDE tapes the device is 
/dev/ht#. For floppy tape drives the device is /dev/ftape. 
ADMIN=root - The person to mail to if the backup fails for any reason 
SLEEP=5 - The time to sleep between tape operations. 
BLOCKSIZE=32768 - This worked fine for 8mm, then 4mm, and now DLT. An optimal setting is probably the amount of data your drive writes at one time. 
SHORTDATE=$(date +%y:%m:%d:%H:%M) - A short date string, used in backup log filenames. 
DAY=$(date +log-%y:%m:%d) - Used for the log file directory. 
DATE=$(date) - Date string, used in log files. 
LOGROOT=/var/log/backup - Root of the logging directory 
LIST=$LOGROOT/incremental-list - This is the file name the incremental backup will use to store the incremental list. It will be $LIST-{some number}. 
DOTCOUNT=$LOGROOT/.count - For counting as you go to know which incremental list to use. 
COUNTER=$LOGROOT/counter-file - For rewinding when done...might not use. 
BACKUPTAB=/etc/backuptab - The file in which we keep our list of backup(s) we want to make. 


/etc/sysconfig/sendmail
An example file: 
DAEMON=yes
QUEUE=1h

Options: 
DAEMON=yes|no - yes implies -bd 
QUEUE=1h - Given to sendmail as -q$QUEUE. The -q option is not given to sendmail if /etc/sysconfig/sendmail exists and QUEUE is empty or undefined. 


/etc/sysconfig/i18n
Controls the system font settings. The language variables are used in /etc/profile.d/lang.sh. An example i18n file: 
LANG="en_US"
LC_ALL="en_US"
LINGUAS="en_US"

Options: 
LANG= set locale for all categories, can be any two letter ISO language code. 
LC_CTYPE= localedata configuration for classification and conversion of characters. 
LC_COLLATE= localedata configuration for collation (sort order) of strings. 
LC_MESSAGES= localedata configuration for translation of yes and no messages. 
LC_NUMERIC= localedata configuration for non-monetary numeric data. 
LC_MONETARY= localedata configuration for monetary data. 
LC_TIME= localedata configuration for date and time. 
LC_ALL= localedata configuration overriding all of the above. 
LANGUAGE= can be a : separated list of ISO language codes. 
LINGUAS= can be a ' ' separated list of ISO language codes. 
SYSFONT= any font that is legal when used as /usr/bin/consolechars -f $SYSFONT ... (See console-tools package for consolechars command) 
UNIMAP= any SFM (screen font map, formerly called Unicode mapping table - see consolechars(8)) 
/usr/bin/consolechars -f $SYSFONT --sfm $UNIMAP 

SYSFONTACM= any ACM (application charset map - see consolechars(8)) 
/usr/bin/consolechars -f $SYSFONT --acm $SYSFONTACM 

The above is used by the /sbin/setsysfont command (which is run by rc.sysinit at boot time.) 


/etc/sysconfig/network-scripts/ifup:
/etc/sysconfig/network-scripts/ifdown:
These are symbolic links to /sbin/ifup and /sbin/ifdown, respectively. These symlinks are here for legacy purposes only. 
They will probably be removed in future versions. These scripts take one argument normally: the name of the device (e.g. eth0). 
They are called with a second argument of "boot" during the boot sequence so that devices that are not meant to be brought up 
on boot (ONBOOT=no, see below) can be ignored at that time. 


/etc/sysconfig/network-scripts/network-functions
This is not really a public file. Contains functions which the scripts use for bringing interfaces up and down. In particular, 
it contains most of the code for handling alternative interface configurations and interface change notification through netreport. 


/etc/sysconfig/network-scripts/ifcfg-interface
/etc/sysconfig/network-scripts/ifcfg-interface-clone
Defines an interface. An example file called ifcfg-eth0: 
DEVICE="eth0"
IPADDR="10.1.0.153"
NETMASK="255.255.0.0"
ONBOOT="yes"
BOOTPROTO="none"
IPXNETNUM_802_2=""
IPXPRIMARY_802_2="no"
IPXACTIVE_802_2="no"
IPXNETNUM_802_3=""
IPXPRIMARY_802_3="no"
IPXACTIVE_802_3="no"
IPXNETNUM_ETHERII=""
IPXPRIMARY_ETHERII="no"
IPXACTIVE_ETHERII="no"
IPXNETNUM_SNAP=""
IPXPRIMARY_SNAP="no"
IPXACTIVE_SNAP="no"

The /etc/sysconfig/network-scripts/ifcfg-interface-clone file only contains the parts of the definition that are different in a "clone" 
(or alternative) interface. For example, the network numbers might be different, but everything else might be the same, 
so only the network numbers would be in the clone file, but all the device information would be in the base ifcfg file.

Base items in the above two files: 

NAME="friendly name for users to see" - Most important for PPP. Only used in front ends. 
DEVICE="name of physical device" 
IPADDR= 
NETMASK= 
GATEWAY= 
ONBOOT=yes|no 
USERCTL=yes|no 
BOOTPROTO=none|bootp|dhcp - If BOOTPROTO is not "none", then the only other item that must be set is the DEVICE item; 
all the rest will be determined by the boot protocol. No "dummy" entries need to be created. 
Base items being deprecated: 
NETWORK="will be calculated automatically with ifcalc" 
BROADCAST="will be calculated automatically with ifcalc" 
Ethernet-only items: 
{IPXNETNUM,IPXPRIMARY,IPXACTIVE}_{802_2,802_3,ETHERII,SNAP} configuration matrix for IPX. Only used if IPX is active. 
Managed from /etc/sysconfig/network-scripts/ifup-ipx 
PPP/SLIP items: 
PERSIST=yes|no 
MODEMPORT=device - An example device is /dev/modem. 
LINESPEED=speed - An example speed is 115200. 
DEFABORT=yes|no - Tells netcfg whether or not to put default abort strings in when creating/editing the chat script and/or dip script for this interface. 
PPP-specific items 
WVDIALSECT="list of sections from wvdial.conf to use" - If this variable is set, then the chat script (if it exists) is ignored, 
and wvdial is used to open the PPP connection. 
PEERDNS=yes|no - Modify /etc/resolv.conf if peer uses msdns extension. 
DEFROUTE=yes|no - Set this interface as default route? 
ESCAPECHARS=yes|no -Simplified interface here doesn't let people specify which characters to escape; almost everyone can use 
asyncmap 00000000 anyway, and they can set PPPOPTIONS to asyncmap foobar if they want to set options perfectly). 
HARDFLOWCTL=yes|no - Yes implies "modem crtscts" options. 
PPPOPTIONS="arbitrary option string" - It is placed last on the command line, so it can override other options like asyncmap that were specified differently. 
PAPNAME="name $PAPNAME" - On pppd command line. Note that the "remotename" option is always specified as the logical ppp device name, 
like "ppp0" (which might perhaps be the physical device ppp1 if some other ppp device was brought up earlier...), which makes it easy 
to manage pap/chap files -- name/password pairs are associated with the logical ppp device name so that they can be managed together. 
REMIP="remote ip address" - Normally unspecified. 
MTU= 
MRU= 
DISCONNECTTIMEOUT="number of seconds" The current default is 5. This is the time to wait before re-establishing the connection after 
a successfully-connected session terminates before attempting to establish a new connection. 
RETRYTIMEOUT="number of seconds" - The current default is 60. This is the time to wait before re-attempting to establish 
a connection after a previous attempt fails. 
/etc/sysconfig/network-scripts/chat-interface - This is the chat script for PPP or SLIP connection intended to establish 
the connection. For SLIP devices, a DIP script is written from the chat script; for PPP devices, the chat script is used directly.


/etc/sysconfig/network-scripts/dip-interface
A write-only script created from the chat script by netcfg. Do not modify this. In the future, this file may disappear by default 
and created on-the-fly from the chat script if it does not exist.


/etc/sysconfig/network-scripts/ifup-post
Called when any network device EXCEPT a SLIP device comes up. Calls /etc/sysconfig/network-scripts/ifup-routes to bring up static routes 
that depend on that device. Calls /etc/sysconfig/network-scripts/ifup-aliases to bring up aliases for that device. Sets the hostname 
if it is not already set and a hostname can be found for the IP for that device. Sends SIGIO to any programs that have requested 
notification of network events. It could be extended to fix up nameservice configuration, call arbitrary scripts, etc, as needed.


/etc/sysconfig/network-scripts/ifup-routes
Set up static routes for a device. An example file: 
#!/bin/sh

# adds static routes which go through device $1

if [ "$1" = "" ]; then
	echo "usage: $0 <net-device>"
	exit 1
fi

if [ ! -f /etc/sysconfig/static-routes ]; then
	exit 0
fi

#note the trailing space in the grep gets rid of aliases
grep "^$1 " /etc/sysconfig/static-routes | while read device args; do
	/sbin/route add -$args $device
done


/etc/sysconfig/network-scripts/ifup-aliases
Bring up aliases for a device.


/etc/sysconfig/network-scripts/ifdhcpc-done
Called by dhcpcd once dhcp configuration is complete; sets up /etc/resolv.conf from the version dhcpcd dropped in /etc/dhcpc/resolv.conf 


Note 3:
-------

Red Hat Linux 8.0: The Official Red Hat Linux Reference Guide 
Prev Chapter 3. Boot Process, Init, and Shutdown Next 

--------------------------------------------------------------------------------

The /etc/sysconfig/ Directory
The following information outlines some of the files found in the /etc/sysconfig/ directory, their function, 
and their contents. This information is not intended to be complete, as many of these files have a variety 
of options that are only used in very specific or rare circumstances.

The /usr/share/doc/initscripts-<version-number>/sysconfig.txt file contains a more authoritative listing 
of the files found in the /etc/sysconfig directory and the configuration options available.

Files in the /etc/sysconfig/ Directory
The following files are normally found in the /etc/sysconfig/ directory:

amd
apmd
arpwatch
authconfig
cipe
clock
desktop
dhcpd
firstboot
gpm
harddisks
hwconf
i18n
identd
init
ipchains
iptables
irda
keyboard
kudzu
mouse
named
netdump
network
ntpd
pcmcia
radvd
rawdevices
redhat-config-users
redhat-logviewer
samba
sendmail
soundcard
squid
tux
ups
vncservers
xinetd

It is possible that your system may be missing a few of them if the corresponding program that would need 
that file is not installed.

Next, we will take a look at each one.

/etc/sysconfig/amd
The /etc/sysconfig/amd file contains various parameters used by amd allowing for the automounting and 
automatic unmounting of file systems.

/etc/sysconfig/apmd
The /etc/sysconfig/apmd file is used by apmd as a configuration for what things to start/stop/change 
on suspend or resume. It is set up to turn on or off apmd during startup, depending on whether your hardware 
supports Advanced Power Management (APM) or if you choose not to use it. apm is a monitoring daemon that works 
with power management code within the Linux kernel. It can alert you to a low battery if you are using 
Red Hat Linux on a laptop, among other things.

/etc/sysconfig/arpwatch
The /etc/sysconfig/arpwatch file is used to pass arguments to the arpwatch daemon at boot time. 
The arpwatch daemon maintains a table of Ethernet MAC addresses and their IP address pairings. 
For more information about what parameters you can use in this file, type man arpwatch. By default, 
this file sets the owner of the arpwatch process to the user pcap.

/etc/sysconfig/authconfig
The /etc/sysconfig/authconfig file sets the kind of authorization to be used on the host. 
It contains one or more of the following lines:

USEMD5=<value>, where <value> is one of the following:

yes - MD5 is used for authentication.
no - MD5 is not used for authentication.

USEKERBEROS=<value>, where <value> is one of the following:

yes - Kerberos is used for authentication.
no - Kerberos is not used for authentication.

USELDAPAUTH=<value>, where <value> is one of the following:

yes - LDAP is used for authentication.
no - LDAP is not used for authentication.

/etc/sysconfig/clock
The /etc/sysconfig/clock file controls the interpretation of values read from the system hardware clock.

The correct values are:

UTC=<value>, where <value> is one of the following boolean values:

true or yes - Indicates that the hardware clock is set to Universal Time.
false or no - Indicates that the hardware clock is set to local time.

ARC=<value>, where <value> is the following:

true or yes - Indicates the ARC console's 42-year time offset is in effect. This setting is only 
for ARC- or AlphaBIOS-based Alpha systems. Any other value indicates that the normal UNIX epoch is in use.

SRM=<value>, where <value> is the following:

true or yes - Indicates the SRM console's 1900 epoch is in effect. This setting is only for SRM-based 
Alpha systems. Any other value indicates that the normal UNIX epoch is in use.

ZONE=<filename> - Indicates the timezone file under /usr/share/zoneinfo that /etc/localtime is a copy of, such as:

ZONE="America/New York"


Earlier releases of Red Hat Linux used the following values (which are deprecated):

CLOCKMODE=<value>, where <value> is one of the following:

GMT - Indicates that the clock is set to Universal Time (Greenwich Mean Time).

ARC - Indicates the ARC console's 42-year time offset is in effect (for Alpha-based systems only).

/etc/sysconfig/desktop
The /etc/sysconfig/desktop file specifies the desktop manager to be run, such as:

DESKTOP="GNOME"

/etc/sysconfig/dhcpd
The /etc/sysconfig/dhcpd file is used to pass arguments to the dhcpd daemon at boot time. 
The dhcpd daemon implements the Dynamic Host Configuration Protocol (DHCP) and the Internet Bootstrap 
Protocol (BOOTP). DHCP and BOOTP assign hostnames to machines on the network. For more information 
about what parameters you can use in this file, type man dhcpd.

/etc/sysconfig/firstboot
Beginning with Red Hat Linux 8.0, the first time you boot the system, the /sbin/init program calls 
the etc/rc.d/init.d/firstboot script. This allows the user to install additional applications 
and documentation before the boot process completes.

The /etc/sysconfig/firstboot file tells the firstboot command not to run on subsequent reboots. 
If you want firstboot to run the next time you boot the system, simply remove /etc/sysconfig/firstboot 
and execute chkconfig --level 5 firstboot on.

/etc/sysconfig/gpm
The /etc/sysconfig/gpm file is used to pass arguments to the gpm daemon at boot time. The gpm daemon is the 
mouse server which allows mouse acceleration and middle-click pasting. For more information about what 
parameters you can use in this file, type man gpm. By default, it sets the mouse device to /dev/mouse.

/etc/sysconfig/harddisks
The /etc/sysconfig/harddisks file allows you to tune your hard drive(s). You can also use /
etc/sysconfig/hardiskhd[a-h], to configure parameters for specific drives.

 Warning 
  Do not make changes to this file lightly. If you change the default values stored here, you could 
  corrupt all of the data on your hard drive(s).
 
The /etc/sysconfig/harddisks file may contain the following:

USE_DMA=1, where setting this to 1 enables DMA. However, with some chipsets and hard drive combinations, 
DMA can cause data corruption. Check with your hard drive documentation or manufacturer before enabling this.

Multiple_IO=16, where a setting of 16 allows for multiple sectors per I/O interrupt. When enabled, 
this feature reduces operating system overhead by 30-50%. Use with caution.

EIDE_32BIT=3 enables (E)IDE 32-bit I/O support to an interface card.

LOOKAHEAD=1 enables drive read-lookahead.

EXTRA_PARAMS= specifies where extra parameters can be added.

/etc/sysconfig/hwconf
The /etc/sysconfig/hwconf file lists all the hardware that kudzu detected on your system, as well as 
the drivers used, vendor ID and device ID information. The kudzu program detects and configures new and/or 
changed hardware on a system. The /etc/sysconfig/hwconf file is not meant to be manually edited. 
If you do edit it, devices could suddenly show up as being added or removed.

/etc/sysconfig/i18n
The /etc/sysconfig/i18n file sets the default language, such as:

LANG="en_US"

/etc/sysconfig/identd
The /etc/sysconfig/identd file is used to pass arguments to the identd daemon at boot time. 
The identd daemon returns the username of processes with open TCP/IP connections. Some services on 
the network, such as FTP and IRC servers, will complain and cause slow responses if identd is not running. 
But in general, identd is not a required service, so if security is a concern, you should not run it. 
For more information about what parameters you can use in this file, type man identd. By default, 
the file contains no parameters.

/etc/sysconfig/init
The /etc/sysconfig/init file controls how the system will appear and function during the boot process.

The following values may be used:

BOOTUP=<value>, where <value> is one of the following:

BOOTUP=color means the standard color boot display, where the success or failure of devices and services starting up is shown in different colors.

BOOTUP=verbose means an old style display, which provides more information than purely a message of success or failure.

Anything else means a new display, but without ANSI-formatting.

RES_COL=<value>, where <value> is the number of the column of the screen to start status labels. Defaults to 60.

MOVE_TO_COL=<value>, where <value> moves the cursor to the value in the RES_COL line. Defaults to ANSI sequences output by echo -e.

SETCOLOR_SUCCESS=<value>, where <value> sets the color to a color indicating success. Defaults to ANSI sequences output by echo -e, setting the color to green.

SETCOLOR_FAILURE=<value>, where <value> sets the color to a color indicating failure. Defaults to ANSI sequences output by echo -e, setting the color to red.

SETCOLOR_WARNING=<value>, where <value> sets the color to a color indicating warning. Defaults to ANSI sequences output by echo -e, setting the color to yellow.

SETCOLOR_NORMAL=<value>, where <value> sets the color to 'normal'. Defaults to ANSI sequences output by echo -e.

LOGLEVEL=<value>, where <value> sets the initial console logging level for the kernel. The default is 7; 8 means everything 
(including debugging); 1 means nothing except kernel panics. syslogd will override this once it starts.

PROMPT=<value>, where <value> is one of the following boolean values:

yes - Enables the key check for interactive mode.

no - Disables the key check for interactive mode.

/etc/sysconfig/ipchains
The /etc/sysconfig/ipchains file contains information used by the kernel to set up ipchains packet filtering rules 
at boot time or whenever the service is started.

This file is modified by typing the command /sbin/service ipchains save when valid ipchains rules are in place. 
You should not manually edit this file. Instead, use the /sbin/ipchains command to configure the necessary packet filtering rules 
and then save the rules to this file using /sbin/service ipchains save.

Use of ipchains to set up firewall rules is not recommended as it is deprecated and may disappear from future releases of Red Hat Linux. 
If you need a firewall, you should use iptables instead.

/etc/sysconfig/iptables
Like /etc/sysconfig/ipchains, the /etc/sysconfig/iptables file stores information used by the kernel to set up packet 
filtering services at boot time or whenever the service is started.

You should not modify this file by hand unless you are familiar with how to construct iptables rules. The simplest way to add rules
is to use the /usr/sbin/lokkit command or the gnome-lokkit graphical application to create your firewall. Using these applications 
will automatically edit this file at the end of the process.

If you wish, you can manually create rules using /sbin/iptables and then type /sbin/service iptables save to add the rules to the /etc/sysconfig/iptables file.

Once this file exists, any firewall rules saved there will persist through a system reboot or a service restart.

For more information on iptables see Chapter 13.

/etc/sysconfig/irda
The /etc/sysconfig/irda file controls how infrared devices on your system are configured at startup.

The following values may be used:

IRDA=<value>, where <value> is one of the following boolean values:

yes - irattach will be run, which periodically checks to see if anything is trying to connect to the infrared port, 
such as another notebook computer trying to make a network connection. For infrared devices to work on your system, this line must be set to yes.

no - irattach will not be run, preventing infrared device communication.

DEVICE=<value>, where <value> is the device (usually a serial port) that handles infrared connections.

DONGLE=<value>, where <value> specifies the type of dongle being used for infrared communication. This setting exists for people 
who use serial dongles rather than real infrared ports. A dongle is a device that is attached to a traditional serial port 
to communicate via infrared. This line is commented out by default because notebooks with real infrared ports are far more 
common than computers with add-on dongles.

DISCOVERY=<value>, where <value> is one of the following boolean values:d

yes - Starts irattach in discovery mode, meaning it actively checks for other infrared devices. This needs to be turned on 
for the machine to be actively looking for an infrared connection (meaning the peer that does not initiate the connection).

no - Does not start irattach in discovery mode.

/etc/sysconfig/keyboard
The /etc/sysconfig/keyboard file controls the behavior of the keyboard. The following values may be used:

KEYBOARDTYPE=sun|pc, which is used on SPARCs only. sun means a Sun keyboard is attached on /dev/kbd, and pc means a PS/2 keyboard connected to a PS/2 port.

KEYTABLE=<file>, where <file> is the name of a keytable file.

For example: KEYTABLE="us". The files that can be used as keytables start in /lib/kbd/keymaps/i386 and branch into different keyboard 
layouts from there, all labeled <file>.kmap.gz. The first file found beneath /lib/kbd/keymaps/i386that matches the KEYTABLE setting is used.

/etc/sysconfig/kudzu
The /etc/sysconfig/kuzdu allows you to specify a safe probe of your system's hardware by kudzu at boot time. A safe probe is one 
that disables serial port probing.

SAFE=<value>, where <value> is one of the following:

yes - kuzdu does a safe probe.

no - kuzdu does a normal probe.

/etc/sysconfig/mouse
The /etc/sysconfig/mouse file is used to specify information about the available mouse. The following values may be used:

FULLNAME=<value>, where <value> refers to the full name of the kind of mouse being used.

MOUSETYPE=<value>, where <value> is one of the following:

microsoft - A MicrosoftT mouse.

mouseman - A MouseManT mouse.

mousesystems - A Mouse SystemsT mouse.

ps/2 - A PS/2 mouse.

msbm - A MicrosoftT bus mouse.

logibm - A LogitechT bus mouse.

atibm - An ATIT bus mouse.

logitech - A LogitechT mouse.

mmseries - An older MouseManT mouse.

mmhittab - An mmhittab mouse.

XEMU3=<value>, where <value> is one of the following boolean values:

yes - The mouse only has two buttons, but three mouse buttons should be emulated.

no - The mouse already has three buttons.

XMOUSETYPE=<value>, where <value> refers to the kind of mouse used when X is running. 
The options here are the same as the MOUSETYPE setting in this same file.

DEVICE=<value>, where <value> is the mouse device.

In addition, /dev/mouse is a symbolic link that points to the actual mouse device.

/etc/sysconfig/named
The /etc/sysconfig/named file is used to pass arguments to the named daemon at boot time. The named daemon is a 
Domain Name System (DNS) server which implements the Berkeley Internet Name Domain (BIND) version 9 distribution. 
This server maintains a table of which hostnames are associated with IP addresses on the network.

Currently, only the following values may be used:

ROOTDIR="</some/where>", where </some/where> refers to the full directory path of a configured chroot environment 
under which named will run. This chroot environment must first be configured. Type info chroot for more information on how to do this.

OPTIONS="<value>", where <value> any option listed in the man page for named except -t. In place of -t, use the 
ROOTDIR line above instead.

For more information about what parameters you can use in this file, type man named. For detailed information on how 
to configure a BIND DNS server, see Chapter 16. By default, the file contains no parameters.

/etc/sysconfig/netdump
The /etc/sysconfig/netdump file is the configuration file for the /etc/init.d/netdump service. The netdump service sends 
both oops data and memory dumps over the network. In general, netdump is not a required service, so you should only run it 
if you absolutely need to. For more information about what parameters you can use in this file, type man netdump.

/etc/sysconfig/network
The /etc/sysconfig/network file is used to specify information about the desired network configuration. 
The following values may be used:

NETWORKING=<value>, where <value> is one of the following boolean values:

yes - Networking should be configured.

no - Networking should not be configured.

HOSTNAME=<value>, where <value> should be the Fully Qualified Domain Name (FQDN), such as hostname.domain.com, 
but can be whatever hostname you want.

 Note 
  For compatibility with older software that people might install (such as trn), the /etc/HOSTNAME file should contain the same value as here.
 

GATEWAY=<value>, where <value> is the IP address of the network's gateway.

GATEWAYDEV=<value>, where <value> is the gateway device, such as eth0.

NISDOMAIN=<value>, where <value> is the NIS domain name.

/etc/sysconfig/ntpd
The /etc/sysconfig/ntpd file is used to pass arguments to the ntpd daemon at boot time. The ntpd daemon sets and 
maintains the system clock to synchronize with an Internet standard time server. It implements version 4 of the Network Time Protocol (NTP). 
For more information about what parameters you can use in this file, point a browser at the following file: 
/usr/share/doc/ntp-<version>/ntpd.htm (where <version> is the version number of ntpd). By default, this file sets 
the owner of the ntpd process to the user ntp.

/etc/sysconfig/pcmcia
The /etc/sysconfig/pcmcia file is used to specify PCMCIA configuration information. The following values may be used:

PCMCIA=<value>, where <value> is one of the following:

yes - PCMCIA support should be enabled.

no - PCMCIA support should not be enabled.

PCIC=<value>, where <value> is one of the following:

i82365 - The computer has an i82365-style PCMCIA socket chipset.

tcic - The computer has a tcic-style PCMCIA socket chipset.

PCIC_OPTS=<value>, where <value> is the socket driver (i82365 or tcic) timing parameters.

CORE_OPTS=<value>, where <value> is the list of pcmcia_core options.

CARDMGR_OPTS=<value>, where <value> is the list of options for the PCMCIA cardmgr (such as -q for quiet mode; -m to look 
for loadable kernel modules in the specified directory, and so on). Read the cardmgr man page for more information.

/etc/sysconfig/radvd
The /etc/sysconfig/radvd file is used to pass arguments to the radvd daemon at boot time. The radvd daemon listens to for router 
requests and sends router advertisements for the IP version 6 protocol. This service allows hosts on a network to dynamically 
change their default routers based on these router advertisements. For more information about what parameters you can use in this file, 
type man radvd. By default, this file sets the owner of the radvd process to the user radvd.

/etc/sysconfig/rawdevices
The /etc/sysconfig/rawdevices file is used to configure raw device bindings, such as:

/dev/raw/raw1 /dev/sda1
/dev/raw/raw2 8 5

 
/etc/sysconfig/redhat-config-users
The /etc/sysconfig/redhat-config-users file is the configuration file for the graphical application, User Manager. Under Red Hat Linux 8.0 
this file is used to filter out system users such as root, daemon, or lp. This file is edited by the Preferences => Filter system 
users and groups pull-down menu in the User Manager application and should not be edited by hand. For more information on using this 
application, see the chapter called User and Group Configuration in the Official Red Hat Linux Customization Guide.

/etc/sysconfig/redhat-logviewer
The /etc/sysconfig/redhat-logviewer file is the configuration file for the graphical, interactive log viewing application, 
Log Viewer. This file is edited by the Edit => Preferences pull-down menu in the Log Viewer application and should not be edited 
by hand. For more information on using this application, see the chapter called Log Files in the Official Red Hat Linux Customization Guide.

/etc/sysconfig/samba
The /etc/sysconfig/samba file is used to pass arguments to the smbd and the nmbd daemons at boot time. The smbd daemon offers 
file sharing connectivity for Windows clients on the network. The nmbd daemon offers NetBIOS over IP naming services. 
For more information about what parameters you can use in this file, type man smbd. By default, this file sets smbd and nmbd to run in daemon mode.

/etc/sysconfig/sendmail
The /etc/sysconfig/sendmail file allows messages to be sent to one or more recipients, routing the message over whatever 
networks are necessary. The file sets the default values for the Sendmail application to run. Its default values are to run 
as a background daemon, and to check its queue once an hour in case something has backed up.

The following values may be used:

DAEMON=<value>, where <value> is one of the following boolean values:

yes - Sendmail should be configured to listen to port 25 for incoming mail. yes implies the use of Sendmail's -bd options.

no - Sendmail should not be configured to listen to port 25 for incoming mail.

QUEUE=1h which is given to Sendmail as -q$QUEUE. The -q option is not given to Sendmail if /etc/sysconfig/sendmail exists 
and QUEUE is empty or undefined.

/etc/sysconfig/soundcard
The /etc/sysconfig/soundcard file is generated by sndconfig and should not be modified. The sole use of this file is to 
determine what card entry in the menu to pop up by default the next time sndconfig is run. Sound card configuration information 
is located in the /etc/modules.conf file.

It may contain the following:

CARDTYPE=<value>, where <value> is set to, for example, SB16 for a Soundblaster 16 sound card.

/etc/sysconfig/squid
The /etc/sysconfig/squid file is used to pass arguments to the squid daemon at boot time. The squid daemon is a proxy caching server 
for Web client applications. For more information on configuring a squid proxy server, use a Web browser to open the 
/usr/share/doc/squid-<version>/ directory (replace <version> with the squid version number installed on your system). By default, 
this file sets squid top start in daemon mode and sets the amount of time before it shuts itself down.

/etc/sysconfig/tux
The /etc/sysconfig/tux file is the configuration file for the Red Hat Content Accelerator (formerly known as TUX), 
the kernel-based web server. For more information on configuring the Red Hat Content Accelerator, use a Web browser to open 
the /usr/share/doc/tux-<version>/tux/index.html (replace <version> with the version number of TUX installed on your system). 
The parameters available for this file are listed in /usr/share/doc/tux-<version>/tux/parameters.html.

/etc/sysconfig/ups
The /etc/sysconfig/ups file is used to specify information about any Uninterruptible Power Supplies (UPS) connected to your system. 
A UPS can be very valuable for a Red Hat Linux system because it gives you time to correctly shut down the system in the case 
of power interruption. The following values may be used:

SERVER=<value>, where <value> is one of the following:

yes - A UPS device is connected to your system.

no - A UPS device is not connected to your system.

MODEL=<value>, where <value> must be one of the following or set to NONE if no UPS is connected to the system:

apcsmart - For a APC SmartUPST or similar device.

fentonups - For a Fenton UPST.

optiups - For an OPTI-UPST device.

bestups - For a Best PowerT UPS.

genericups - For a generic brand UPS.

ups-trust425+625 - For a TrustT UPS.

DEVICE=<value>, where <value> specifies where the UPS is connected, such as /dev/ttyS0.

OPTIONS=<value>, where <value> is a special command that needs to be passed to the UPS.

/etc/sysconfig/vncservers
The /etc/sysconfig/vncservers file configures the way the Virtual Network Computing (VNC) server starts up.

VNC is a remote display system which allows you to view a desktop environment not only on the machine where it is running 
but across different networks on a variety of architectures.

It may contain the following:

VNCSERVERS=<value>, where <value> is set to something like "1:fred", to indicate that a VNC server should be started for user fred 
on display :1. User fred must have set a VNC password using vncpasswd before attempting to connect to the remote VNC server.

Note that when you use a VNC server, your communication with it is unencrypted, and so it should not be used on an untrusted network. 
For specific instructions concerning the use of SSH to secure the VNC communication, please read the information found at 
http://www.uk.research.att.com/vnc/sshvnc.html. To find out more about SSH, see Chapter 9 or Official Red Hat Linux Customization Guide.

/etc/sysconfig/xinetd
The /etc/sysconfig/xinetd file is used to pass arguments to the xinetd daemon at boot time. 
The xinetd daemon starts programs that provide Internet services when a request to the port for that service 
is received. For more information about what parameters you can use in this file, type man xinetd. 
For more information on the xinetd service, see the Section called Access Control Using xinetd in Chapter 8.

Directories in the /etc/sysconfig/ Directory
The following directories are normally found in /etc/sysconfig/ and a basic description of what they contain:

apm-scripts - This contains the Red Hat APM suspend/resume script. You should not edit this file directly. If you need customization, 
simple create a file called /etc/sysconfig/apm-scripts/apmcontinue and it will be called at the end of the script. Also, you can control 
the script by editing /etc/sysconfig/apmd.

cbq - This directory contains the configuration files needed to do Class Based Queuing for bandwidth management on network interfaces.

networking - This directory is used by the Network Administration Tool (redhat-config-network) and its contents should not be edited manually. 
For more information about configuring network interfaces using the Network Administration Tool, see the chapter called 
Network Configuration in the Official Red Hat Linux Customization Guide.

network-scripts - This directory contains the following network-related configuration files:

Network configuration files for each configured network interface, such as ifcfg-eth0 for the eth0 Ethernet interface.

Scripts used to bring up and down network interfaces, such as ifup and ifdown.

Scripts used to bring up and down ISDN interfaces, such as ifup-isdn and ifdown-isdn

Various shared network function scripts which should not be edited directly.

For more information on the network-scripts directory, see Chapter 12

rhn - This directory contains the configuration files and GPG keys for the Red Hat Network. No files in this directory should be edited 
by hand. For more information on the Red Hat Network, see the Red Hat Network website at the following URL: https://rhn.redhat.com.


AIX kernel parameters:
---------------------

Througout this document, you can find many AIX kernel parameter statements.
Most commands are related to retrieving or changing attributes on the sys0 object.


For example, take a look at the following example:

  maxuproc:    Specifies the maximum number of processes per user ID. 
  Values:      Default: 40; Range: 1 to 131072 
  Display:     lsattr -E -l sys0 -a maxuproc 
  Change:      chdev -l sys0 -a maxuproc=NewValue 
               Change takes effect immediately and is preserved over boot. If value is reduced, 
               then it goes into effect only after a system boot. 
  Diagnosis:   Users cannot fork any additional processes. 
  Tuning:      This is a safeguard to prevent users from creating too many processes. 


Kernel Tunable Parameters
Following are kernel parameters, grouped into the following sections:

-Scheduler and Memory Load Control Tunable Parameters 
-Virtual Memory Manager Tunable Parameters 
-Synchronous I/O Tunable Parameters 
-Asynchronous I/O Tunable Parameters 
-Disk and Disk Adapter Tunable Parameters 
-Interprocess Communication Tunable Parameters
-Scheduler and Memory Load Control Tunable Parameters
-Most of the scheduler and memory load control tunable parameters are fully described in the schedo man page. 
-The following are a few other related parameters:


==========================================================
>>>> Some important HPUX filesystem related kernel params:
==========================================================


nfile:
------

nfile defines the maximum number of files that can be open simultaneously, system-wide, at any given time.

Acceptable Values:
Minimum 
14 
Maximum 
Memory limited 
Default 
((16*(Nproc+16+MaxUsers)/10)+32+2*(Npty+Nstrpty) 

Specify integer value or use integer formula expression. For more information, see Specifying Parameter Values.

Description
nfile defines the maximum number files that can be open at any one time, system-wide.
It is the number of slots in the file descriptor table. Be generous with this number because the required memory 
is minimal, and not having enough slots restricts system processing capacity.

Related Parameters and System Factors
The value used for nfile must be sufficient to service the number of users and processes allowed by the combination 
of nproc, maxusers, npty , and nstrpty.

Every process uses at least three file descriptors per process (standard input, standard output, 
and standard error).

Every process has two pipes per process (one per side), each of which requires a pty. Stream pipes also use s
treams ptys which are limited by nstrpty.


Other HP-UX kernel parameters:
==============================

Take especially notice of the parameters nfile, nflocks, ninodes, nprocs.
They determine how many open files, open locks, simultaneous processes are possible *system-wide*.
Too low values may result in HP-UX errors when dealing with larger databases, huge App Servers
and the like.

Entering Values: 
 
Use the kcweb web interface or the kmtune command to view and change values. kcweb is described 
in the kcweb(1M) manpage and in the program's help topics. You can run kcweb from the command line 
or from the System Administration Manager (SAM); see sam(1M). You run kmtune from the command line; 
see kmtune(1M) for details.


Accounting
 acctresume Resume accounting when free space on the file system where accounting log files reside rises above acctresume plus minfree percent of total usable file system size. Manpage: acctsuspend(5).
 
Accounting
 acctsuspend
 Suspend accounting when free space on the file system where accounting log files reside drops below acctsuspend plus minfree percent of total usable file system size. Manpage: acctsuspend(5).
 
Asynchronous I/O
 aio_listio_max
 Maximum number of POSIX asynchronous I/O operations allowed in a single lio_listio() call. Manpage: aio_listio_max(5).
 
Asynchronous I/O
 aio_max_ops
 System-wide maximum number of POSIX asynchronous I/O operations allowed at one time. Manpage: aio_max_ops(5).
 
Asynchronous I/O
 aio_physmem_pct
 Maximum percentage of total system memory that can be locked for use in POSIX asynchronous I/O operations. Manpage: aio_physmem_pct(5).
 
Asynchronous I/O
 aio_prio_delta_max
 Maximum priority offset (slowdown factor) allowed in a POSIX asynchronous I/O control block (aiocb). Manpage: aio_prio_delta_max(5).
 
Memory Paging
 allocate_fs_swapmap
 Enable or disable preallocation of file system swap space when swapon() is called as opposed to allocating swap space when malloc() is called. Enabling allocation reduces risk of insufficient swap space and is used primarily where high availability is important. Manpage: allocate_fs_swapmap(5).
 
Kernel Crash Dump
 alwaysdump
 Select which classes of system memory pages are to be dumped if a kernel panic occurs. Manpage: alwaysdump(5).
 
Spinlock Pool
 bufcache_hash_locks
 Buffer-cache spinlock pool. NO MANPAGE. 
 
File System: Buffer
 bufpages
 Number of 4 KB pages in file system static buffer cache. Manpage: bufpages(5).
 
Spinlock Pool
 chanq_hash_locks
 Channel queue spinlock pool. Manpage: chanq_hash_locks(5).
 
IPC: Share
 core_addshmem_read
 Flag to include readable shared memory in a process core dump. Manpage: core_addshmem_read(5).
 
IPC: Share
 core_addshmem_write
 Flag to include read/write shared memory in a process core dump. Manpage: core_addshmem_write(5).
 
Miscellaneous: Links
 create_fastlinks
 Create fast symbolic links using a newer, more efficient format to improve access speed by reducing disk block accesses during path name look-up sequences. Manpage: create_fastlinks(5).
 
File System: Buffer
 dbc_max_pct
 Maximum percentage of memory for dynamic buffer cache. Manpage: dbc_max_pct(5).
 
File System: Buffer
 dbc_min_pct
 Minimum percentage of memory for dynamic buffer cache. Manpage: dbc_min_pct(5).
 
Miscellaneous: Disk I/O
 default_disk_ir
 Immediate reporting for disk writes; whether a write() returns immediately after the data is placed in the disk's write buffer or waits until the data is physically stored on the disk media. Manpage: default_disk_ir(5).
 
File System: Buffer
 disksort_seconds
 Maximum wait time for disk requests. NO MANPAGE.
 
Miscellaneous: Disk I/O
 dma32_pool_size
 Amount of memory to set aside for 32-bit DMA (bytes). Manpage: dma32_pool_size(5).
 
Spinlock Pool
 dnlc_hash_locks
 Number of locks for directory cache synchronization. NO MANPAGE.
 
Kernel Crash Dump
 dontdump
 Select which classes of system memory pages are not to be dumped if a kernel panic occurs. Manpage: dontdump(5).
 
Miscellaneous: Clock
 dst
 Enable/disable daylight savings time. Manpage: timezone(5).
 
Miscellaneous: IDS
 enable_idds
 Flag to enable the IDDS daemon, which gathers data for IDS/9000. Manpage: enable_idds(5).
 
Miscellaneous: Memory
 eqmemsize
 Number of pages of memory to be reserved for equivalently mapped memory, used mostly for DMA transfers. Manpage: eqmemsize(5).
 
ProcessMgmt: Process
 executable_stack
 Allows or denies program execution on the stack. Manpage: executable_stack(5).
 
File System: Write
 fs_async
 Enable/disable asynchronous writes of file system data structures to disk. Manpage: fs_async(5).
 
Spinlock Pool
 ftable_hash_locks
 File table spinlock pool. NO MANPAGE. 
 
Spinlock Pool
 hdlpreg_hash_locks
 Set the size of the pregion spinlock pool. Manpage: hdlpreg_hash_locks(5).
 
File System: Read
 hfs_max_ra_blocks
 The maximum number of read-ahead blocks that the kernel may have outstanding for a single HFS file system. Manpage: hfs_max_ra_blocks(5).
 
File System: Read
 hfs_max_revra_blocks
 The maximum number of reverse read-ahead blocks that the kernel may have outstanding for a single HFS file system. Manpage: hfs_max_revra_blocks(5).
 
File System: Read
 hfs_ra_per_disk
 The amount of HFS file system read-ahead per disk drive, in KB. Manpage: hfs_ra_per_disk(5).
 
File System: Read
 hfs_revra_per_disk
 The amount of memory (in KB) for HFS reverse read-ahead operations, per disk drive. Manpage: hfs_revra_per_disk(5).
 
File System: Read
 hp_hfs_mtra_enabled
 Enable or disable HFS multithreaded read-ahead. NO MANPAGE.
 
Kernel Crash Dump
 initmodmax
 Maximum size of the dump table of dynamically loaded kernel modules. Manpage: initmodmax(5).
 
Spinlock Pool
 io_ports_hash_locks I/O port spinlock pool. NO MANPAGE.  
Miscellaneous: Queue
 ksi_alloc_max
 Maximum number of system-wide queued signals that can be allocated. Manpage: ksi_alloc_max(5).
 
Miscellaneous: Queue
 ksi_send_max
 Maximum number of queued signals that a process can send and have pending at one or more receivers. Manpage: ksi_send_max(5).
 
ProcessMgmt: Memory
 maxdsiz
 Maximum process data storage segment space that can be used for statics and strings, as well as dynamic data space allocated by sbrk() and malloc() (32-bit processes). Manpage: maxdsiz(5).
 
ProcessMgmt: Memory
 maxdsiz_64bit
 Maximum process data storage segment space that can be used for statics and strings, as well as dynamic data space allocated by sbrk() and malloc() (64-bit processes). Manpage: maxdsiz(5).
 
File System: Open/Lock
 maxfiles
 Soft limit on how many files a single process can have opened or locked at any given time. Manpage: maxfiles(5).
 
File System: Open/Lock
 maxfiles_lim
 Hard limit on how many files a single process can have opened or locked at any given time. Manpage: maxfiles_lim(5).
 
ProcessMgmt: Memory
 maxrsessiz
 Maximum size (in bytes) of the RSE stack for any user process on the IPF platform. Manpage: maxrsessiz(5).
 
ProcessMgmt: Memory
 maxrsessiz_64bit
 Maximum size (in bytes) of the RSE stack for any user process on the IPF platform. Manpage: maxrsessiz(5).
 
ProcessMgmt: Memory
 maxssiz
 Maximum dynamic storage segment (DSS) space used for stack space (32-bit processes). Manpage: maxssiz(5).
 
ProcessMgmt: Memory
 maxssiz_64bit
 Maximum dynamic storage segment (DSS) space used for stack space (64-bit processes). Manpage: maxssiz(5).
 
ProcessMgmt: Memory
 maxtsiz
 Maximum allowable process text segment size, used by unchanging executable-code (32-bit processes). Manpage: maxtsiz(5).
 
ProcessMgmt: Memory
 maxtsiz_64bit
 Maximum allowable process text segment size, used by unchanging executable-code (64-bit processes). Manpage: maxtsiz(5).
 
ProcessMgmt: Process
 maxuprc
 Maximum number of processes that any single user can have running at the same time, including login shells, user interface processes, running programs and child processes, I/O processes, etc. If a user is using multiple, simultaneous logins under the same login name (user ID) as is common in X Window, CDE, or Motif environments, all processes are combined, even though they may belong to separate process groups. Processes that detach from their parent process group, where that is possible, are not counted after they detach (line printer spooler jobs, certain specialized applications, etc.). Manpage: maxuprc(5).
 
Miscellaneous: Users
 maxusers
 Maximum number of users expected to be logged in on the system at one time; used by other system parameters to allocate system resources. Manpage: maxusers(5).
 
File System: LVM
 maxvgs
 Maximum number of volume groups configured by the Logical Volume Manager on the system. Manpage: maxvgs(5).
 
Accounting
 max_acct_file_size
 Maximum size of the accounting file. Manpage: max_acct_file_size(5).
 
Asynchronous I/O
 max_async_ports
 System-wide maximum number of ports to the asynchronous disk I/O driver that processes can have open at any given time. Manpage: max_async_ports(5).
 
Memory Paging
 max_mem_window
 Maximum number of group-private 32-bit shared memory windows. Manpage: max_mem_window(5).
 
ProcessMgmt: Threads
 max_thread_proc
 Maximum number of threads that any single process can create and have running at the same time. Manpage: max_thread_proc(5).
 
IPC: Message
 mesg
 Enable or disable IPC messages at system boot time. Manpage: mesg(5).
 
Kernel Crash Dump
 modstrmax
 Maximum size, in bytes, of the savecrash kernel module table that contains module names and their locations in the file system. Manpage: modstrmax(5).
 
IPC: Message
 msgmap
 Size of free-space resource map for allocating shared memory space for messages. Manpage: msgmap(5).
 
IPC: Message
 msgmax
 System-wide maximum size (in bytes) for individual messages. Manpage: msgmax(5).
 
IPC: Message
 msgmnb
 Maximum combined size (in bytes) of all messages that can be queued simultaneously in a message queue. Manpage: msgmnb(5).
 
IPC: Message
 msgmni
 Maximum number of message queues allowed on the system at any given time. Manpage: msgmni(5).
 
IPC: Message
 msgseg
 Maximum number of message segments that can exist on the system. Manpage: msgseg(5).
 
IPC: Message
 msgssz
 Message segment size in bytes. Manpage: msgssz(5).
 
IPC: Message
 msgtql
 Maximum number of messages that can exist on the system at any given time. Manpage: msgtql(5).
 
File System: Buffer
 nbuf
 System-wide number of static file system buffer and cache buffer headers. Manpage: nbuf(5).
 
Miscellaneous: CD
 ncdnode
 Maximum number of entries in the vnode table and therefore the maximum number of open CD-ROM file system nodes that can be in memory. Manpage: ncdnode(5).
 
Miscellaneous: Terminal
 nclist
 Maximum number of cblocks available for data transfers through tty and pty devices. Manpage: nclist(5).
 
File System: Open/Lock
 ncsize
 Inode space needed for directory name lookup cache (DNLC). NO MANPAGE.
 
File System: Open/Lock
 nfile
 Maximum number of files that can be open simultaneously on the system at any given time. Manpage: nfile(5).
 
File System: Open/Lock
 nflocks
 Maximum combined number of file locks that are available system-wide to all processes at one time. Manpage: nflocks(5).
 
File System: Open/Lock
 ninode
 Maximum number of open inodes that can be in memory. Manpage: ninode(5).
 
ProcessMgmt: Threads
 nkthread
 Maximum number of kernel threads allowed on the system at the same time. Manpage: nkthread(5).
 
ProcessMgmt: Process
 nproc
 Defines the maximum number of processes that can be running simultaneously on the entire system, including remote execution processes initiated by other systems via remsh or other networking commands. Manpage: nproc(5).
 
Miscellaneous: Terminal
 npty
 Maximum number of pseudo-tty entries allowed on the system at any one time. Manpage: npty(5).
 
Streams
 NSTREVENT
 Maximum number of outstanding streams bufcalls that are allowed to exist at any given time on the system. This number should be equal to or greater than the maximum bufcalls that can be generated by the combined total modules pushed onto any given stream, and serves to limit run-away bufcalls. Manpage: nstrevent(5).
 
Miscellaneous: Terminal
 nstrpty
 System-wide maximum number of streams-based pseudo-ttys that are allowed on the system. Manpage: nstrpty(5).
 
Streams
 nstrpty
 System-wide maximum number of streams-based pseudo-ttys that are allowed on the system. Manpage: nstrpty(5).
 
Streams
 NSTRPUSH
 Maximum number of streams modules that are allowed to exist in any single stream at any one time on the system. This provides a mechanism for preventing a software defect from attempting to push too many modules onto a stream, but it is not intended as adequate protection against malicious use of streams. Manpage: nstrpush(5).
 
Streams
 NSTRSCHED
 Maximum number of streams scheduler daemons that are allowed to run at any given time on the system. This value is related to the number of processors installed in the system. Manpage: nstrsched(5).
 
Miscellaneous: Terminal
 nstrtel
 Number of telnet session device files that are available on the system. Manpage: nstrtel(5).
 
Memory Paging
 nswapdev
 Maximum number of devices, system-wide, that can be used for device swap. Set to match actual system configuration. Manpage: nswapdev(5).
 
Memory Paging
 nswapfs
 Maximum number of mounted file systems, system-wide, that can be used for file system swap. Set to match actual system configuration. Manpage: nswapfs(5).
 
Miscellaneous: Memory
 nsysmap
 Number of entries in the kernel dynamic memory virtual address space resource map (32-bit processes). Manpage: nsysmap(5).
 
Miscellaneous: Memory
 nsysmap64
 Number of entries in the kernel dynamic memory virtual address space resource map (64-bit processes). Manpage: nsysmap(5).
 
Miscellaneous: Disk I/O
 o_sync_is_o_dsync
 Specifies whether an open() or fcntl() with the O_SYNC flag set can be converted to the same call with the O_DSYNC flag instead. This controls whether the function can return before updating the file access. NO MANPAGE.
 
ProcessMgmt: Memory
 pa_maxssiz_32bit
 Maximum size (in bytes) of the stack for a user process running under the PA-RISC emulator on IPF. Manpage: pa_maxssiz(5).
 
ProcessMgmt: Memory
 pa_maxssiz_64bit
 Maximum size (in bytes) of the stack for a user process running under the PA-RISC emulator on IPF. Manpage: pa_maxssiz(5).
 
Spinlock Pool
 pfdat_hash_locks
 Pfdat spinlock pool. Manpage: pfdat_hash_locks(5).
 
Miscellaneous: Disk I/O
 physical_io_buffers
 Total buffers for physical I/O operations. Manpage: physical_io_buffers(5).
 
Spinlock Pool
 region_hash_locks
 Process-region spinlock pool. Manpage: region_hash_locks(5).
 
Memory Paging
 remote_nfs_swap
 Enable or disable swap to mounted remote NFS file system. Used on cluster clients for swapping to NFS-mounted server file systems. Manpage: remote_nfs_swap(5).
 
Miscellaneous: Schedule
 rtsched_numpri
 Number of distinct real-time interrupt scheduling priority levels are available on the system. Manpage: rtsched_numpri(5).
 
Miscellaneous: Terminal
 scroll_lines
 Defines the number of lines that can be scrolled on the internal terminal emulator (ITE) system console. Manpage: scroll_lines(5).
 
File System: SCSI
 scsi_maxphys
 Maximum record size for the SCSI I/O subsystem, in bytes. Manpage: scsi_maxphys(5).
 
File System: SCSI
 scsi_max_qdepth
 Maximum number of SCSI commands queued up for SCSI devices. Manpage: scsi_max_qdepth(5).
 
ProcessMgmt: Process
 secure_sid_scripts
 Controls whether setuid and setgid bits on scripts are honored. Manpage: secure_sid_scripts(5).
 
IPC: Semaphore
 sema
 Enable or disable IPC semaphores at system boot time. Manpage: sema(5).
 
IPC: Semaphore
 semaem
 Maximum value by which a semaphore can be changed in a semaphore "undo" operation. Manpage: semaem(5).
 
IPC: Semaphore
 semmni
 Maximum number of sets of IPC semaphores allowed on the system at any one time. Manpage: semmni(5).
 
IPC: Semaphore
 semmns
 Maximum number of individual IPC semaphores available to system users, system-wide. Manpage: semmns(5).
 
IPC: Semaphore
 semmnu
 Maximum number of processes that can have undo operations pending on any given IPC semaphore on the system. Manpage: semmnu(5).
 
IPC: Semaphore
 semmsl
 Maximum number of individual System V IPC semaphores per semaphore identifier. Manpage: semmsl(5).
 
IPC: Semaphore
 semume
 Maximum number of IPC semaphores that a given process can have undo operations pending on. Manpage: semume(5).
 
IPC: Semaphore
 semvmx
 Maximum value any given IPC semaphore is allowed to reach (prevents undetected overflow conditions). Manpage: semvmx(5).
 
Miscellaneous: Web
 sendfile_max
 The amount of buffer cache that can be used by the sendfile() system call on HP-UX web servers. Manpage: sendfile_max(5).
 
IPC: Share
 shmem
 Enable or disable shared memory at system boot time. Manpage: shmem(5).
 
IPC: Share
 shmmax
 Maximum allowable shared memory segment size (in bytes). Manpage: shmmax(5).
 
IPC: Share
 shmmni
 Maximum number of shared memory segments allowed on the system at any given time. Manpage: shmmni(5).
 
IPC: Share
 shmseg
 Maximum number of shared memory segments that can be attached simultaneously to any given process. Manpage: shmseg(5).
 
Streams
 STRCTLSZ
 Maximum number of control bytes allowed in the control portion of any streams message on the system. Manpage: strctlsz(5).
 
Streams
 streampipes
 Force all pipes to be streams-based. Manpage: streampipes(5).
 
Streams
 STRMSGSZ
 Maximum number of bytes that can be placed in the data portion of any streams message on the system. Manpage: strmsgsz(5).
 
File System: SCSI
 st_ats_enabled
 Flag whether to reserve a tape device on open. Manpage: st_ats_enabled(5).
 
File System: SCSI
 st_fail_overruns
 SCSI tape read resulting in data overrun causes failure. Manpage: st_fail_overruns(5).
 
File System: SCSI
 st_large_recs
 Enable large record support for SCSI tape. Manpage: st_large_recs(5).
 
Memory Paging
 swapmem_on
 Enable or disable pseudo-swap allocation. This allows systems with large installed memory to allocate memory space as well as disk swap space for virtual memory use instead of restricting availability to defined disk swap area. Manpage: swapmem_on(5).
 
Memory Paging
 swchunk
 Amount of space allocated for each chunk of swap area. Chunks are allocated from device to device by the kernel. Changing this parameter requires extensive knowledge of system internals. Without such knowledge, do not change this parameter from the normal default value. Manpage: swchunk(5).
 
Spinlock Pool
 sysv_hash_locks
 System V interprocess communication spinlock pool. Manpage: sysv_hash_locks(5).
 
Miscellaneous: Network
 tcphashsz
 TCP hash table size, in bytes. Manpage: tcphashsz(5).
 
ProcessMgmt: CPU
 timeslice
 Maximum time a process can use the CPU until it is made available to the next process having the same process execution priority. This feature also prevents runaway processes from causing system lock-up. Manpage: timeslice(5).
 
Miscellaneous: Clock
 timezone
 The offset between the local time zone and Coordinated Universal Time (UTC), often called Greenwich Mean Time or GMT. Manpage: timezone(5).
 
Miscellaneous: Memory
 unlockable_mem
 Amount of system memory to be reserved for system overhead and virtual memory management, that cannot be locked by user processes. Manpage: unlockable_mem(5).
 
Spinlock Pool
 vnode_cd_hash_locks
 Vnode clean/dirty spinlock pool. NO MANPAGE. 
 
Spinlock Pool
 vnode_hash_locks
 Vnode spinlock pool. NO MANPAGE. 
 
Memory Paging: Size
 vps_ceiling
 Maximum system-selected page size (in KB) if the user does not specify a page size. Manpage: vps_ceiling(5).
 
Memory Paging: Size
 vps_chatr_ceiling
 Maximum page size a user can specify with the chatr command in a program. Manpage: vps_chatr_ceiling(5).
 
Memory Paging: Size
 vps_pagesize
 Minimum user page size (in KB) if no page size is specified using chatr. Manpage: vps_pagesize(5).
 
File System: Journaled
 vxfs_max_ra_kbytes
 Maximum amount of read-ahead data, in KB, that the kernel may have outstanding for a single VxFS file system. Manpage: vxfs_max_ra_kbytes(5).
 
File System: Read
 vxfs_max_ra_kbytes
 Maximum amount of read-ahead data, in KB, that the kernel may have outstanding for a single VxFS file system. Manpage: vxfs_max_ra_kbytes(5).
 
File System: Journaled
 vxfs_ra_per_disk
 Maximum amount of VxFS file system read-ahead per disk, in KB. Manpage: vxfs_ra_per_disk(5).
 
File System: Read
 vxfs_ra_per_disk
 Maximum amount of VxFS file system read-ahead per disk, in KB. Manpage: vxfs_ra_per_disk(5).
 
File System: Journaled
 vx_fancyra_enable
 Enable or disable VxFS file system read-ahead. NO MANPAGE.
 
File System: Journaled
 vx_maxlink
 Number of subdirectories created within a directory. NO MANPAGE.
 
File System: Journaled
 vx_ncsize
 Memory space reserved for VxFS directory path name cache. Manpage: vx_ncsize(5).
 
File System: Journaled
 vx_ninode
 Number of entries in the VxFS inode table. NO MANPAGE
 
 
==========================================================
>>>> Some important Solaris kernel params:
==========================================================


/etc/system


-- Process Sizing Tunables
Several variables are used to control the number of processes that are available on the system and the number of processes 
that an individual user can create. The foundation variable is maxusers, which drives the values assigned to max_nprocs and maxuprc.

-- maxusers
Description Originally, maxusers defined the number of logged in users the system could support. Various tables 
were sized based on this setting when a kernel was generated. Now, the Solaris release does much of its sizing based 
on the amount of memory on the system, so much of the past use of maxusers has changed. 
There are still a number of subsystems that are derived from maxusers: 

The maximum number of processes on the system

The number of quota structures held in the system

The size of the directory name lookup cache (DNLC)
 
Data Type Signed integer
 
Default Lesser of the amount of memory in Mbytes and 2048
 
Range 1 to 2048, based on physical memory if not set in the /etc/system file.

1 to 4096, if set in the /etc/system file.
 
Units Users
 
Dynamic? No. After computation of dependent variables is done, maxusers is never referenced again.
 
Validation None
 
When to Change When the default number of user processes derived by the system is too low. 
This situation is seen by the following message that displays on the system console:out of processes

When the default number of processes is too high:

Database servers that have a lot of memory and relatively few running processes, can save system memory by reducing the default value of maxusers.

File servers that have a lot of memory and few running processes can reduce this value, but should explicitly set the size of the DNLC. (See ncsize.)

Compute servers that have a lot of memory and few running processes can reduce this value.
 
Commitment Level Unstable
 

General Parameters
This section describes general kernel parameters relating to physical memory and stack size.

physmem 
Description 
Modifies the system's idea of the number of physical pages of memory after the OS and firmware are accounted for. 

Data Type 
Unsigned long

Default 
Number of usable pages of physical memory available on the system�not counting the memory where the core kernel and data are stored.

Range 
1 to amount of physical memory on system

Units 
Pages

Dynamic? 
No

Validation 
None

When to Change 
Whenever you want to test the effect of running with less physical memory. Note that because this parameter does not take 
into account the memory used by the core kernel and data as well as various other data structures allocated early in the 
startup process, the value of physmem should be less than the actual number of pages that represent the smaller amount of memory.

Commitment Level 
Unstable

lwp_default_stksize 
Description 
Default value of size of stack to be used when a kernel thread is created, and the calling routine does not provide an explicit size to be used. 

Data Type 
Integer

Default 
8192 for all 32-bit SPARC and IA based platforms

16,384 for 64-bit sun4u platforms

Range 
0 to 262,144

Units 
Bytes in multiples of the value returned by getpagesize(3C).

Dynamic? 
Yes. Affects threads created after the variable is changed.

Validation 
Must be greater than or equal to 8192 and less than or equal to 262,144 (256 x 1024) and must be a multiple of the system page size. 
If these conditions are not met, the following message is displayed:


Illegal stack size, Using N 

The value of N is the default described above.

When to Change 
When the system panics because it has run out of stack space. The best solution for this problem is to determine why the system 
is running out of space and make a correction. Increasing the default stack size means that almost every kernel thread 
will have a larger stack, resulting in increased kernel memory consumption for no good reason, because that space will generally be unused. 
The increased consumption means that other resources competing for the same pool of memory will have the amount of space available 
to them reduced, possibly decreasing the system's ability to perform work. Among the side effects will be a reduction in the number 
of threads which the kernel can create. This solution should be treated as no more than an interim workaround until the root cause is remedied.

Commitment Level 
Unstable

logevent_max_q_sz 
Description 
Maximum number of system events allowed to be queued waiting for delivery to the syseventd daemon. Once the size of the system 
event queue reaches this limit, no other system events will be allowed on the queue. 

Data Type 
Integer

Default 
2000

Range 
0 to MAXINT

Units 
System events

Dynamic? 
Yes

Validation 
The sysevent framework checks this value every time a system event is generated by ddi_log_sysevent(9F) and sysevent_post_event(3SYSEVENT).

When to Change 
When error log messages indicate that a system event failed to be logged, generated, or posted.


==========================================================
>>>> Some important AIX kernel params:
==========================================================


- Kernel Tunable Parameters
Following are kernel parameters, grouped into the following sections:

- Scheduler and Memory Load Control Tunable Parameters:

Virtual Memory Manager Tunable Parameters 
Synchronous I/O Tunable Parameters 
Asynchronous I/O Tunable Parameters 
Disk and Disk Adapter Tunable Parameters 
Interprocess Communication Tunable Parameters
Scheduler and Memory Load Control Tunable Parameters
Most of the scheduler and memory load control tunable parameters are fully described in the schedo man page. 
The following are a few other related parameters:

- maxuproc 
Purpose: Specifies the maximum number of processes per user ID. 
Values: Default: 40; Range: 1 to 131072 
Display: lsattr -E -l sys0 -a maxuproc 
Change: chdev -l sys0 -a maxuproc=NewValue 
Change takes effect immediately and is preserved over boot. If value is reduced, then it goes into effect 
only after a system boot. 
Diagnosis: Users cannot fork any additional processes. 
Tuning: This is a safeguard to prevent users from creating too many processes. 

- ncargs 
Purpose: Specifies the maximum allowable size of the ARG/ENV list (in 4KB blocks) when running exec() subroutines. 
Values: Default: 6; Range: 6 to 1024 
Display: lsattr -E -l sys0 -a ncargs 
Change: chdev -l sys0 -a ncargs=NewValue 
Change takes effect immediately and is preserved over boot. 
Diagnosis: Users cannot execute any additional processes because the argument list passed to the exec() 
system call is too long. A low default value might cause some programs to fail with the arg list too long 
error message, in which case you might try increasing the ncargs value with the chdev command above and then 
rerunning the program. 
Tuning: This is a mechanism to prevent the exec() subroutines from failing if the argument list 
is too long. Please note that tuning to a higher ncargs value puts additional constraints on system memory resources. 
 

- Virtual Memory Manager Tunable Parameters:

The complete listing of the virtual memory manager tunable parameters is located in the vmo man page.

- Synchronous I/O Tunable Parameters:

Most of the synchronous I/O tunable parameters are fully described in the ioo man page. 
The following are a few other related parameters:

maxbuf Purpose: Number of (4 KB) pages in the block-I/O buffer cache. 
Values: Default: 20; Range: 20 to 1000 
Display: lsattr -E -l sys0 -a maxbuf 
Change: chdev -l sys0 -a maxbuf=NewValue 
Change is effective immediately and is permanent. If the -T flag is used, the change is immediate and lasts until 
the next boot. If the -P flag is used, the change is deferred until the next boot and is permanent. 
Diagnosis: If the sar -b command shows breads or bwrites with %rcache and %wcache being low, you might want to 
tune this parameter. 
Tuning: This parameter normally has little performance effect on systems, where ordinary I/O does not use the 
block-I/O buffer cache. 
Refer to: Tuning Asynchronous Disk I/O 

maxpout Purpose: Specifies the maximum number of pending I/Os to a file. 
Values: Default: 0 (no checking); Range: 0 to n (n should be a multiple of 4, plus 1) 
Display: lsattr -E -l sys0 -a maxpout 
Change: chdev -l sys0 -a maxpout=NewValue 
Change is effective immediately and is permanent. If the -T flag is used, the change is immediate and lasts 
until the next boot. If the -P flag is used, the change is deferred until the next boot and is permanent. 
Diagnosis: If the foreground response time sometimes deteriorates when programs with large amounts 
of sequential disk output are running, sequential output may need to be paced. 
Tuning: Set maxpout to 33 and minpout to 16. If sequential performance deteriorates unacceptably, 
increase one or both. If foreground performance is still unacceptable, decrease both. 

minpout Purpose: Specifies the point at which programs that have reached maxpout can resume writing to the file. 
Values: Default: 0 (no checking); Range: 0 to n (n should be a multiple of 4 and should be at least 4 less than maxpout) 
Display: lsattr -E -l sys0 -a minpout 
Change: chdev -l sys0 -a minpout=NewValue 
Change is effective immediately and is permanent. If the -T flag is used, the change is immediate and lasts until 
the next boot. If the -P flag is used, the change is deferred until the next boot and is permanent. 
Diagnosis: If the foreground response time sometimes deteriorates when programs with large amounts of sequential 
disk output are running, sequential output may need to be paced. 
Tuning: Set maxpout to 33 and minpout to 16. If sequential performance deteriorates unacceptably, 
increase one or both. If foreground performance is still unacceptable, decrease both. 

mount -o nointegrity Purpose: A new mount option (nointegrity) may enhance local file system performance for 
certain write-intensive applications. This optimization basically eliminates writes to the JFS log. 
Note that the enhanced performance is achieved at the expense of metadata integrity. Therefore, use this 
option with extreme caution because a system crash can make a file system mounted with this option unrecoverable. 
Nevertheless, certain classes of applications do not require file data to remain consistent after a system crash, 
and these may benefit from using the nointegrity option. Two examples in which a nointegrity file system may be 
beneficial is for compiler temporary files, and for doing a nonmigration or mksysb installation. 

Paging Space Size Purpose: The amount of disk space required to hold pages of working storage. 
Values: Default: configuration-dependent; Range: 32 MB to n MB for hd6, 16 MB to n MB for non-hd6 
Display: lsps -a mkps or chps or smitty pgsp 
Change: Change is effective immediately and is permanent. Paging space is not necessarily put into use immediately, however. 
Diagnosis: Run: lsps -a. If processes have been killed for lack of paging space, monitor the situation with the psdanger() subroutine. 
Tuning: If it appears that there is not enough paging space to handle the normal workload, add a new paging space on another physical volume or make the existing paging spaces larger. 

syncd Interval Purpose: The time between sync() calls by syncd. 
Values: Default: 60; Range: 1 to any positive integer 
Display: grep syncd /sbin/rc.boot vi /sbin/rc.boot or 
Change: Change is effective at next boot and is permanent. An alternate method is to use the kill command to terminate the syncd daemon and restart it from the command line with the command /usr/sbin/syncd interval. 
Diagnosis: I/O to a file is blocked when syncd is running. 
Tuning: At its default level, this parameter has little performance cost. No change is recommended. Significant 
reductions in the syncd interval in the interests of data integrity (as for HACMPT) could have adverse performance 
consequences. 

Asynchronous I/O Tunable Parameters
maxreqs Purpose: Specifies the maximum number of asynchronous I/O requests that can be outstanding at any one time. 
Values: Default: 4096; Range: 1 to AIO_MAX (/usr/include/sys/limits.h) 
Display: lsattr -E -l aio0 -a maxreqs 
Change: chdev -l aio0 -a maxreqs=NewValue 
Change is effective after reboot and is permanent. 
Diagnosis: N/A 
Tuning: This includes requests that are in progress, as well as those that are waiting to be started. The maximum number of asynchronous I/O requests cannot be less than the value of AIO_MAX, as defined in the /usr/include/sys/limits.h file, but can be greater. It would be appropriate for a system with a high volume of asynchronous I/O to have a maximum number of asynchronous I/O requests larger than AIO_MAX. 
Refer to: Tuning Asynchronous Disk I/O 

maxservers Purpose: Specifies the maximum number of AIO kprocs per processor. 
Values: Default: 10 per processor 
Display: lsattr -E -l aio0 -a maxservers 
Change: chdev -l aio0 -a maxservers=NewValue 
Change is effective after reboot and is permanent. 
Diagnosis: N/A 
Tuning: This value limits the number of concurrent asynchronous I/O requests. The value should be about the same as the expected number of concurrent AIO requests. This tunable parameter only affects AIO on JFS file systems (or Virtual Shared Disks (VSD) before AIX 4.3.2). 
Refer to: Tuning Asynchronous Disk I/O 

minservers Purpose: Specifies the number of AIO kprocs that will be created when the AIO kernel extension is loaded. 
Values: Default: 1 
Display: lsattr -E -l aio0 -a maxservers 
Change: chdev -l aio0 -a minservers=NewValue 
Change is effective after reboot and is permanent. 
Diagnosis: N/A 
Tuning: Making this a large number is not recommended, because each process takes up some memory. Leaving this number small is acceptable in most cases because AIO will create additional kprocs up to maxservers as needed. This tunable is only effective for AIO on JFS file systems (or VSDs before AIX 4.3.2). 
Refer to: Tuning Asynchronous Disk I/O 

Disk and Disk Adapter Tunable Parameters
Disk Adapter Outstanding-Requests Limit Purpose: Maximum number of requests that can be outstanding on a SCSI bus. (Applies only to the SCSI-2 Fast/Wide Adapter.) 
Values: Default: 40; Range: 40 to 128 
Display: lsattr -E -l scsin -a num_cmd_elems 
Change: chdev -l scsin -a num_cmd_elems=NewValue 
Change is effective immediately and is permanent. If the -T flag is used, the change is immediate and lasts until the next boot. If the -P flag is used, the change is deferred until the next boot and is permanent. 
Diagnosis: Applications performing large writes to striped raw logical volumes are not obtaining the desired throughput rate. 
Tuning: Value should equal the number of physical drives (including those in disk arrays) on the SCSI bus, times the queue depth of the individual drives. 

Disk Drive Queue Depth Purpose: Maximum number of requests the disk device can hold in its queue. 
Values: Default: IBMr disks=3; Non-IBM disks=0; Range: specified by manufacturer 
Display: lsattr -E -l hdiskn 
Change: chdev -l hdiskn -a q_type=simple -a queue_depth=NewValue 
Change is effective immediately and is permanent. If the -T flag is used, the change is immediate and lasts until the next boot. If the -P flag is used, the change is deferred until the next boot and is permanent. 
Diagnosis: N/A 
Tuning: If the non-IBM disk drive is capable of request-queuing, make this change to ensure that the operating system takes advantage of the capability. 
Refer to: Setting SCSI-Adapter and Disk-Device Queue Limits 

Interprocess Communication Tunable Parameters
msgmax Purpose: Specifies maximum message size. 
Values: Dynamic with maximum value of 4 MB 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

msgmnb Purpose: Specifies maximum number of bytes on queue. 
Values: Dynamic with maximum value of 4 MB 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

msgmni Purpose: Specifies maximum number of message queue IDs. 
Values: Dynamic with maximum value of 131072 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

msgmnm Purpose: Specifies maximum number of messages per queue. 
Values: Dynamic with maximum value of 524288 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semaem Purpose: Specifies maximum value for adjustment on exit. 
Values: Dynamic with maximum value of 16384 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semmni Purpose: Specifies maximum number of semaphore IDs. 
Values: Dynamic with maximum value of 131072 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semmsl Purpose: Specifies maximum number of semaphores per ID. 
Values: Dynamic with maximum value of 65535 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semopm Purpose: Specifies maximum number of operations per semop() call. 
Values: Dynamic with maximum value of 1024 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semume Purpose: Specifies maximum number of undo entries per process. 
Values: Dynamic with maximum value of 1024 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

semvmx Purpose: Specifies maximum value of a semaphore. 
Values: Dynamic with maximum value of 32767 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

shmmax Purpose: Specifies maximum shared memory segment size. 
Values: Dynamic with maximum value of 256 MB for 32-bit processes and 0x80000000u for 64-bit 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

shmmin Purpose: Specifies minimum shared-memory-segment size. 
Values: Dynamic with minimum value of 1 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the kernel. 

shmmni Purpose: Specifies maximum number of shared memory IDs. 
Values: Dynamic with maximum value of 131072 
Display: N/A 
Change: N/A 
Diagnosis: N/A 
Tuning: Does not require tuning because it is dynamically adjusted as needed by the 


AIX TUNABLE ENVIRONMENT PARAMETERS:
=======================================

Thread Support Tunable Parameters
Following is a list of thread support parameters that can be tuned:

AIXTHREAD_COND_DEBUG (AIX 4.3.3 and subsequent versions) Purpose: Maintains a list of condition variables for use by the debugger. 
Values: Default: ON 
Range: ON, OFF 
Display: echo $AIXTHREAD_COND_DEBUG (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_COND_DEBUG={ON|OFF} 
export AIXTHREAD_COND_DEBUG 
Change takes effect immediately in this shell. Change is effective until logging out of this shell. 
Permanent change is made by adding AIXTHREAD_COND_DEBUG={ON|OFF} command to the /etc/environment file. 
Diagnosis: Leaving it on makes debugging threaded applications easier, but may impose some overhead. 
Tuning: If the program contains a large number of active condition variables and frequently creates and destroys condition variables, this may create higher overhead for maintaining the list of condition variables. Setting the variable to OFF will disable the list. 
Refer to Thread Debug Options. 

AIXTHREAD_ENRUSG Purpose: Enable or disable pthread resource collection. 
Values: Default: OFF 
Range: ON, OFF 
Display: echo $AIXTHREAD_ENRUSG (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_ENRUSG={ON|OFF} 
export AIXTHREAD_ENRUSG 
Change takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_ENRUSG={ON|OFF} command to the /etc/environment file. 
Diagnosis: Turning it on allows for resource collection of all pthreads in a process, but will impose some overhead. 
Tuning:  
Refer to Thread Environment Variables. 

AIXTHREAD_GUARDPAGES (AIX 4.3 and later) Purpose: Controls the number of guard pages to add to the end of the pthread stack. 
Values: Default: 0Range: A positive integer 
Display: echo $AIXTHREAD_GUARDPAGES (This is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_GUARDPAGES=nexport AIXTHREAD_GUARDPAGESChange takes effect immediately in this shell. 
Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_GUARDPAGES=n 
command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: N/A 
Refer to Thread Environment Variables. 

AIXTHREAD_MINKTHREADS (AIX 4.3 and later) Purpose Controls the the minimum number of kernel threads that should be used. 
Values: Default: 8 
Range: A positive integer value 
Display: echo $AIXTHREAD_MINKTHREADS (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_MINKTHREADS=nexport AIXTHREAD_MINKTHREADSChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_MINKTHREADS =n command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: The library scheduler will not reclaim kernel threads below this figure. A kernel thread may be reclaimed at virtually any point. Generally, a kernel thread is targeted as a result of a pthread terminating. 
Refer to: Variables for Process-Wide Contention Scope 

AIXTHREAD_MNRATIO (AIX 4.3 and later) Purpose: Controls the scaling factor of the library. This ratio is used when creating and terminating pthreads. 
Values: Default: 8:1 
Range: Two positive values (p:k), where k is the number of kernel threads that should be employed to handle p runnable pthreads 
Display: echo $AIXTHREAD_MNRATIO (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_MNRATIO=p:kexport AIXTHREAD_MNRATIOChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_MNRATIO=p:k command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: May be useful for applications with a very large number of threads. However, always test a ratio of 1:1 because it may provide for better performance. 
Refer to: Variables for Process-Wide Contention Scope 

AIXTHREAD_MUTEX_DEBUG (AIX 4.3.3 and later) Purpose: Maintains a list of active mutexes for use by the debugger. 
Values: Default: OFF 
Range: ON, OFF 
Display: echo $AIXTHREAD_MUTEX_DEBUG (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_MUTEX_DEBUG={ON|OFF}export AIXTHREAD_MUTEX_DEBUGChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_MUTEX_DEBUG={ON|OFF} command to the /etc/environment file. 
Diagnosis: Setting the variable to ON makes debugging threaded applications easier, but may impose some overhead. 
Tuning: If the program contains a large number of active mutexes and frequently creates and destroys mutexes, this may create higher overhead for maintaining the list of mutexes. Leaving the variable off disables the list. 
Refer to: Thread Debug Options 

AIXTHREAD_RWLOCK_DEBUG (AIX 4.3.3 and later) Purpose: Maintains a list of read-write locks for use by the debugger. 
Values: Default: ON 
Range: ON, OFF 
Display: echo $AIXTHREAD_RWLOCK_DEBUG (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_RWLOCK_DEBUG={ON|OFF}export AIXTHREAD_RWLOCK_DEBUGChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_RWLOCK_DEBUG={ON|OFF} command to the /etc/environment file. 
Diagnosis: Leaving it on makes debugging threaded applications easier, but may impose some overhead. 
Tuning: If the program contains a large number of active read-write locks and frequently creates and destroys read-write locks, this may create higher overhead for maintaining the list of read-write locks. Setting the variable to OFF will disable the list. 
Refer to: Thread Debug Options 

AIXTHREAD_SCOPE (AIX 4.3.1 and later) Purpose: Controls contention scope. P signifies process-based 
contention scope (M:N). S signifies system-based contention scope (1:1). 
Values: Default: P 
Possible Values: P or S 
Display: echo $AIXTHREAD_SCOPE (this is turned on internally, so the initial default value will not be seen 
with the echo command) 
Change: AIXTHREAD_SCOPE={P|S}export AIXTHREAD_SCOPE Change takes effect immediately in this shell. 
Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_SCOPE={P|S} 
command to the /etc/environment file. 

Diagnosis: If fewer threads are being dispatched than expected, then system scope should be tried. 
Tuning: Tests on AIX 4.3.2 have shown that certain applications can perform much better with system based 
contention scope (S). The use of this environment variable impacts only those threads created with the 
default attribute. The default attribute is employed when the attr parameter to pthread_create is NULL. 
Refer to: Thread Environment Variables 


AIXTHREAD_SLPRATIO (AIX 4.3 and later) Purpose: Controls the number of kernel threads that should be held in reserve for sleeping threads. 
Values: Default: 1:12 
Range: Two positive values (k:p), where k is the number of kernel threads that should be held in reserve for p sleeping pthreads 
Display: echo $AIXTHREAD_SLPRATIO (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_SLPRATIO=k:pexport AIXTHREAD_SLPRATIOChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_SLPRATIO=k:p command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: In general, fewer kernel threads are required to support sleeping pthreads, because they are generally woken one at a time. This conserves kernel resources. 
Refer to: Variables for Process-Wide Contention Scope 

AIXTHREAD_STK=n (AIX 4.3.3 ML 09 and later) Purpose: The decimal number of bytes that should be allocated for each pthread. This value may be overridden by pthread_attr_setstacksize. 
Values: Default: 98,304 bytes for 32bit applications, 196,608 bytes for 64bit applications. 
Range: Decimal integer values from 0 to 268,435,455 which will be rounded up to the nearest page (currently 4,096). 
Display: echo $AIXTHREAD_STK (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: AIXTHREAD_STK=size export AIXTHREAD_STK Change takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding AIXTHREAD_STK=size to the /etc/environment file. 
Diagnosis: If analysis of a failing program indicates stack overflow, the default stack size can be increased. 
Tuning: If trying to reach the 32,000 thread limit on a 32 bit application, it may be necessary to decrease the default stack size. 

MALLOCBUCKETS (Version 4.3.3.25 and later) Purpose: Enables buckets-based extension in the default memory allocator which may enhance performance of applications that issue large numbers of small allocation requests. 
Values: MALLOCTYPE=buckets 
 

MALLOCBUCKETS=[[ number_of_buckets:n | bucket_sizing_factor:n | blocks_per_bucket:n | bucket_statistics:[stdout|stderr|pathname]],...] 
The following table displays default values of MALLOCBUCKETS. MALLOCBUCKETS Default Values

MALLOCBUCKETS Options 
Default Value 
number_of_buckets1 
16 
bucket_sizing_factor (32-bit)2 
32 
bucket_sizing_factor (64-bit)3 
64 
blocks_per_bucket 
10244  
Notes:

1. The minimum value allowed is 1. The maximum value allowed is 128.

2. For 32-bit implementations, the value specified for bucket_sizing_factor must be a multiple of 8.

3. For 64-bit implementations, the value specified for bucket_sizing_factor must be a multiple of 16.

4. The bucket_statistics option is disabled by default.
 
Display: echo $MALLOCBUCKETS; echo $MALLOCTYPE 
Change: Use the shell specific method of exporting the environment variables. 
Diagnosis: If malloc performance is slow and many small malloc requests are issued, this feature may enhance performance. 
Tuning: To enable malloc buckets, the MALLOCTYPE environment variable has to be set to the value "buckets". 
 

The MALLOCBUCKETS environment variable may be used to change the default configuration of the malloc buckets, although the default values should be sufficient for most applications. 
 

The number_of_buckets:n option can be used to specify the number of buckets available per heap, where n is the number of buckets. The value specified for n will apply to all available heaps. 
 

The bucket_sizing_factor:n option can be used to specify the bucket sizing factor, where n is the bucket sizing factor in bytes. 
 

The blocks_per_bucket:n option can be used to specify the number of blocks initially contained in each bucket, where n is the number of blocks. This value is applied to all of the buckets. The value of n is also used to determine how many blocks to add when a bucket is automatically enlarged because all of its blocks have been allocated. 
 

The bucket_statistics option will cause the malloc subsystem to output a statistical summary for malloc buckets upon typical termination of each process that calls the malloc subsystem while malloc buckets is enabled. This summary will show buckets configuration information and the number of allocation requests processed for each bucket. If multiple heaps have been enabled by way of malloc multiheap, the number of allocation requests shown for each bucket will be the sum of all allocation requests processed for that bucket for all heaps. 
 

The buckets statistical summary will be written to one of the following output destinations, as specified with the bucket_statistics option. 
stdout 
Standard output 
stderr 
Standard error 
pathname 
A user-specified pathname 
 

If a user-specified pathname is provided, statistical output will be appended to the existing contents of the file (if any). Avoid using standard output as the output destination for a process whose output is piped as input into another process. 
Refer to: Malloc Buckets 

MALLOCMULTIHEAP (AIX 4.3.1 and later) Purpose: Controls the number of heaps within the process private segment. 
Values: Default: 16 for 4.3.1 and 4.3.2, 32 for 4.3.3 and later 
Range: A positive number between 1 and 32) 
Display: echo $MALLOCMULTIHEAP (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: MALLOCMULTIHEAP=[[heaps:n | considersize],...] export MALLOCMULTIHEAPChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding MALLOCMULTIHEAP=[[heaps:n | considersize],...] command to the /etc/environment file. 
Diagnosis: Look for lock contention on the malloc lock (located in segment F) or fewer than expected runnable threads. 
Tuning: Smaller number of heaps can help reduce size of the process. Certain multithreaded user processes which use the malloc subsystem heavily may obtain better performance by exporting the environment variable MALLOCMULTIHEAP=1 before starting the application. 
 

The potential performance enhancement is particularly likely for multithreaded C++ programs, because these may make use of the malloc subsystem whenever a constructor or destructor is called. 
 

Any available performance enhancement will be most evident when the multithreaded user process is running on an SMP system, and particularly when system scope threads are used (M:N ratio of 1:1). However, in some cases, enhancement may also be evident under other conditions, and on uniprocessors. 
 

If the considersize option is specified, an alternate heap selection algorithm is used that tries to select an available heap that has enough free space to handle the request. This may minimize the working set size of the process by reducing the number of sbrk() calls. However, there is a bit more processing time required for this algorithm. 
Refer to: Thread Environment Variables 

SPINLOOPTIME Purpose: Controls the number of times to retry a busy lock before yielding to another processor (only for libpthreads). 
Values: Default: 1 on uniprocessors, 40 on multiprocessors 
Range: A positive integer 
Display: echo $SPINLOOPTIME (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: SPINLOOPTIME=nexport SPINLOOPTIMEChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding SPINLOOPTIME=n command to the /etc/environment file. 
Diagnosis: If threads are going to sleep often (lot of idle time), then the SPINLOOPTIME may not be high enough. 
Tuning: Increasing the value from default of 40 on multiprocessor systems might be of benefit if there is pthread mutex contention. 
Refer to: Thread Environment Variables 

YIELDLOOPTIME Purpose: Controls the number of times to yield the processor before blocking on a busy lock (only for libpthreads). The processor is yielded to another kernel thread, assuming there is another runnable kernel thread with sufficient priority. 
Values: Default: 0 
Range: A positive value 
Display: echo $YIELDLOOPTIME (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: YIELDLOOPTIME=nexport YIELDLOOPTIMEChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding YIELDLOOPTIME=n command to the /etc/environment file. 
Diagnosis: If threads are going to sleep often (lot of idle time), then the YIELDLOOPTIME may not be high enough. 
Tuning: Increasing the value from default value of 0 may benefit if you do not want the threads to go to sleep when waiting for locks. 
Refer to: Thread Environment Variables 

Miscellaneous Tunable Parameters
Following is a list of miscellaneous parameters that can be tuned:

EXTSHM (AIX 4.2.1 and later) Purpose: Turns on the extended shared memory facility. 
Values: Default: Not set 
Possible Value: ON 
Display: echo $EXTSHM 
Change: EXTSHM=ON export EXTSHMChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding EXTSHM=ON command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: Setting value to ON will allow a process to allocate shared memory segments as small as 1 byte (though this will be rounded up to the nearest page); this effectively removes the limitation of 11 user shared memory segments. Maximum size of all segments together can still only be 2.75 GB worth of memory for 32-bit processes. 64-bit processes do not need to set this variable since a very large number of segments is available. Some restrictions apply for processes that set this variable, and these restrictions are the same as with processes that use mmap buffers. 
Refer to: Extended Shared Memory (EXTSHM) 

LDR_CNTRL Purpose: Allows tuning of the kernel loader. 
Values: Default: Not set Possible Values: PREREAD_SHLIB, LOADPUBLIC, IGNOREUNLOAD, USERREGS, MAXDATA, DSA, PRIVSEG_LOADS 
Display: echo $LDR_CNTRL 
Change: LDR_CNTRL={PREREAD_SHLIB | LOADPUBLIC| ...} export LDR_CNTRLChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding the following line to the /etc/environment file: LDR_CNTRL={PREREAD_SHLIB | LOADPUBLIC| ...} 
Diagnosis: N/A 
Tuning: The LDR_CNTRL environment variable can be used to control one or more aspects of the system loader behavior. You can specify multiple options with the LDR_CNTRL variable. When doing this, separate the options using an @ character (that is, LDR_CNTRL=PREREAD_SHLIB@LOADPUBLIC). Specifying the PREREAD_SHLIB option will cause entire libraries to be read as soon as they are accessed. With VMM readahead tuned, a library can be read in from disk and be cached in memory by the time the program starts to access its pages. While this method can use more memory, it can enhance performance of programs that use many shared library pages providing the access pattern is non-sequential. (for example, Catia). Specifying the LOADPUBLIC option directs the system loader to load all modules requested by an application into the global shared library segment. If a module cannot be loaded publicly into the global shared library segment then it is loaded privately for the application. Specifying the IGNOREUNLOAD option will cause modules that are marked to be unloaded and used again (if the module has not been unloaded already). As a side effect of this option, you can end up with two different data instances for the module. Specifying the USERREGS option will tell the system to save all general-purpose user registers across system calls made by an application. This can be helpful in applications doing garbage collection. Specifying the MAXDATA option sets the maximum heap size for a process, including overriding any MAXDATA value specified in an executable. If you want to use Large Program Support with a data heap size of 0x30000000, then specify LDR_CNTRL=MAXDATA=0x30000000. To turn off Large Program Support, specify LDR_CNTRL=MAXDATA=0. Specifying the DSA (Dynamic Segment Allocation) option tells the system loader to run applications using Very Large Program Support. The DSA option is only valid for 32-bit applications. Specifying the PRIVSEG_LOADS option directs the system loader to put dynamically loaded private modules into the process private segment. This might improve the availability of memory in large memory model applications that perform private dynamic loads and tend to run out of memory in the process heap. If the process private segment lacks sufficient space, the PRIVSEG_LOADS option has no effect. The PRIVSEG_LOADS option is only valid for 32-bit applications with a non-zero MAXDATA value. 

NODISCLAIM Purpose: Controls how calls to free() are being handled. When PSALLOC is set to early, all free() calls result in a disclaim() system call. When NODISCLAIM is set to True, this does not occur. 
Values: Default: Not set 
Possible Value: True 
Display: echo $NODISCLAIM 
Change: NODISCLAIM=true export NODISCLAIMChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding NODISCLAIM=true command to the /etc/environment file. 
Diagnosis: If number of disclaim() system calls is very high, you may want to set this variable. 
Tuning: Setting this variable will eliminate calls to disclaim() from free() if PSALLOC is set to early. 
Refer to: Early Page Space Allocation 

NSORDER Purpose: Overwrites the set name resolution search order. 
Values: Default: bind, nis, local 
Possible Values: bind, local, nis, bind4, bind6, local4, local6, nis4, or nis6 
Display: echo $NSORDER (this is turned on internally, so the initial default value will not be seen with the echo command) 
Change: NSORDER=value, value, ... export NSORDERChange takes effect immediately in this shell. Change is effective until logging out of this shell. Permanent change is made by adding NSORDER=value command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: NSORDER overrides the /etc/netsvc.conf file. 
Refer to: Tuning Name Resolution 

PSALLOC Purpose: Sets the PSALLOC environment variable to determine the paging-space allocation policy. 
Values: Default: Not set 
Possible Value: early 
Display: echo $PSALLOC 
Change: PSALLOC=early export PSALLOCChange takes effect immediately in this shell. Change is effective until logging out of this shell. 
Diagnosis: N/A 
Tuning: To ensure that a process is not killed due to low paging conditions, this process can preallocate paging space by using the Early Page Space Allocation policy. However, this may result in wasted paging space. You may also want to set the NODISCLAIM environment variable. 
Refer to: Allocation and Reclamation of Paging Space Slots and Early Page Space Allocation 

RT_GRQ (AIX 4.3.3.1 and later) Purpose: Causes thread to be put on a global run queue rather than on a per-CPU run queue. 
Values: Default: Not set; Range: ON, OFF 
Display: echo $RT_GRQ 
Change: RT_GRQ={OFF/ONexport RT_GRQChange takes effect immediately. Change is effective until next boot. Permanent change is made by adding RT_GRQ={ON|OFF} command to the /etc/environment file. 
Diagnosis: N/A 
Tuning: May be tuned on multiprocessor systems. Set to ON, will cause the thread to be put on a global run queue. In that case, the global run queue is searched to see which thread has the best priority. This might allow to get the thread dispatched sooner and can improve performance for threads that are running SCHED_OTHER, and are interrupt driven. 
Refer to: Scheduler Run Queue 

RT_MPC (AIX 4.3.3 and later) Purpose: When running the kernel in real-time mode (see bosdebug command), an MPC can be sent to a different CPU to interrupt it if a better priority thread is runnable so that this thread can be dispatched immediately. 
Values: Default: Not set; Range: ON 
Display: echo $RT_MPC 
Change: RT_MPC=ON 
export RT_MPC 
Change takes effect immediately. Change is effective until next boot. Permanent change is made by adding RT_MPC=ON command to the /etc/environment file. 
Diagnosis: N/A 


Note on LDR_CNTRL:
------------------


Setting the maximum number of AIX data segments that a process can use (LDR_CNTRL)
In AIX, Version 4.3.3 and later, the number of segments that a process can use for data is controlled 
by the LDR_CNTRL environment variable. It is defined in the parent process of the process that 
is to be affected. For example, the following defines one additional data segment: 

export LDR_CNTRL =MAXDATA=0x10000000
start_process
unset LDR_CNTRL

It is a good idea to unset the LDR_CNTRL environment variable, so that it does not unintentionally 
affect other processes. 

Unlike other environment variables for the IBM SecureWay Directory server process (slapd), 
the LDR_CNTRLenvironment variable cannot be set as a front-end variable in the slapd32.conf file. 
It must be set as an environment variable. 

The following table shows the LDR_CNTRL setting and memory increase for various numbers of data segments: 

LDP_CNTRL Setting  	Number of Additional Segments  Process Memory Limit Increase  
Unset  				0 (default)  		256 MB  
LDR_CNTRL=MAXDATA=0x1000000  	1  			512 MB  
LDR_CNTRL=MAXDATA=0x2000000  	2  			768 MB  
LDR_CNTRL=MAXDATA=0x3000000  	3  			1 GB  
LDR_CNTRL=MAXDATA=0x4000000  	4  			1.25 GB  
LDR_CNTRL=MAXDATA=0x5000000  	5 			1.5 GB  
LDR_CNTRL=MAXDATA=0x6000000  	6  			1.75 GB  
LDR_CNTRL=MAXDATA=0x7000000  	7  			2 GB  
LDR_CNTRL=MAXDATA=0x8000000  	8  			2.25 GB


##############################################################

SECTION 17: Unix, Java, Websphere:

##############################################################


-----
Note:
-----

technote:

AIX
  

AIX Java News and FAQ 

What's New?

March 2009 - AIX Java 6 32-bit update released (APAR IZ45341) and redistributional SDK refreshed, both using pap3260sr4-20090219_01 build (SR4). 
March 2009 - AIX Java 6 64-bit update released (APAR IZ45342) and redistributional SDK refreshed, both using pap6460sr4-20090219_01 build (SR4). 
December 2008 - AIX Java 5 32-bit update released (APAR IZ39401) and redistributional SDK refreshed, both using pap32dev-20081129 build (SR9). 
December 2008 - AIX Java 5 64-bit update released (APAR IZ39402) and redistributional SDK refreshed, both using pap64dev-20081129 build (SR9). 
March 2009 - SDK 1.4.2 32-bit update released (APAR IZ47402) and JRE 1.4.2 32-bit refreshed, both using pj9ap32142-20090310 build (SR13). 
March 2009 - SDK 1.4.2 64-bit update released (APAR IZ47404) and JRE 1.4.2 64-bit refreshed, both using pj9ap64142-20090310 build (SR13). 
September 2007 - SDK 1.3.1 32-bit PTF (APAR IZ05458) released and JRE 1.3.1 32-bit refreshed, both using ca131-20070913 build (SR11). 
September 2007 - SDK 1.3.1 64-bit PTF (APAR IZ05460) released and JRE 1.3.1 64-bit refreshed, both using caix64131-20070913 build (SR11). 
June 2006 - SDK 1.3.0 32-bit PTF (APAR IY85292) released and JRE 1.3.0 32-bit refreshed, both using ca131-20060605 build. 
September 2004 - Java security packages refreshed for 1.3.0, 1.3.1, and 1.3.1 64-bit. 
February 2003 - JDK 1.2.2 PTF (APAR IY40034) released and JRE 1.2.2 refreshed, both using the ca122-20030115 build. 
May 2002 - JDK 1.1.8 PTF 13 (APAR IY30886) released and JRE 1.1.8 refreshed, both using the a118-20020509 build. 

Frequently Asked Questions
-What AIX levels are required for Java releases? 
-What should I download - JDK/SDK or JRE, base images or update images? Can you explain SDK 1.4.x installp images? 
-How do I extract from a tar or tar.gz file downloaded?  How do I install the base and update images? 
-Can these releases co-exist on a machine? In which directories are these releases installed? 
-How to find out what versions of Java I have installed? Why does "java -fullversion" still show 1.1.8 on my system after 
-I have installed another release? How can 1.3.0 "java -fullversion" show 1.3.1? 
-What paths do I need to set to use a specific Java release on my system? 
-How do I know if an image file has been downloaded successfully? 
-How do I to get support information on Java for AIX? 
-Are there any Newsgroups for Java on AIX? 

1. Question: What AIX levels are required for Java releases?
    Answer: 
To take advantage of latest AIX fixes it is recommended/required that latest AIX Recommended Maintenance Level be used. 
The following is the minimum AIX level required at the time when a Java release was first released: 
Java 1.1.8 requires AIX 4.2.1 
Java 1.2.2 requires AIX 4.3.3 PLUS fixes 
Java 1.3.0 requires AIX 4.3.3.10 PLUS fixes 
Java 1.3.1 64-bit requires AIX 5.1.0.10 
Java 1.3.1 requires AIX 4.3.3.75 
Java 1.4 requires at least AIX 5.1.0.75 or AIX 5.2.0.10
Java 5 requires at least AIX 5.2.0.75 or AIX 5.3.0.30

2. Question: What should I download - JDK/SDK or JRE, base images or update images? Can you explain SDK 1.4.x installp images? 
    Answer: 

For each release you can download either the Developer Kit (JDK or SDK) or the Runtime Environment (JRE). 
The Runtime Environment is meant only for distributing Java with your applications.

The Developer Kit is in AIX's installp format. There are install images (base images) and update images (also called PTFs or fixes). 
Update images are to be installed on top of the base images; that is, update images can only be installed after the base images are installed.

All installp images have VRMF numbers associated with them. VRMF stands for Version, Release, Modification, and Fix. 
If you run command "lslpp -l | grep Java" on your machine you will see installed Java filesets and their VRMF numbers. 

All SDK 1.4.x installp images (VRMF numbers 1.4.0.*, 1.4.1.*, and 1.4.2.*) install in the same directories, 
/usr/java14 for 32-bit code and /usr/java14_64 for 64-bit code. This is because AIX Java 1.4 is the product 
and 1.4.1 and 1.4.2 are maintenance levels (1.4.2 being the latest and last maintenance level) of the product. 

Now what installp images does one need to install? Remember update images are to be installed on top of base images. 
These 1.4.x base images are available: the oldest 1.4.0 base images, the latest 1.4.1 base images, and the latest 1.4.2 
base images. If Java 1.4 is not installed and you want to install it on your system, you will need to start with one of these base images, 
and then you can update using update images. 

Base images can be downloaded directly from the individual release download pages. To download update images, 
please follow the "How and where to get fixes" link. 
 

3. Question: How do I extract from a tar or tar.gz file downloaded?  How do I install the JDK/SDK base and update images?

    Answer: 
For base images after you downloaded either packagename.tar or the packagename.tar.gz file (the latter is recommended 
if you have gunzip utility available), you need to extract packagename from the downloaded file: 
    tar -xvf packagename.tar  (example: tar -xvf Java14.sdk.tar), or
    gunzip -c packagename.tar.gz | tar -xvf -  (example: gunzip -c Java14.sdk.tar.gz | tar -xvf - )

For update images the .bff files are ready to be installed. Before installing, remove the old .toc file 
(if it exist) in the directory containing the .bff images.

You can use the smitty command to install (both base and update images):

        Run "smitty install"
        Select "Install and Update Software"
        Select "Install Software"
        Specify directory containing the images
        ...

4. Question: Can these releases co-exist on a machine? In which directories are these releases installed?
    Answer: 
Yes, releases can co-exist. 
Java 1.1.8 installs in /usr/jdk_base
Java 1.2.2 installs in /usr/java_dev2
Java 1.3.0 installs in /usr/java130
Java 1.3.1 64-bit installs in /usr/java13_64
Java 1.3.1 installs in /usr/java131
Java 1.4 64-bit installs in /usr/java14_64
Java 1.4 installs in /usr/java14 
Java 5 64-bit installs in /usr/java5_64
Java 5 32-bit installs in /usr/java5 
Java 6 64-bit installs in /usr/java6_64
Java 6 32-bit installs in /usr/java6 

5. Question: How to find out what versions of Java I have installed? Why does "java -fullversion" still show 1.1.8 on my system 
after I have installed another release? How can 1.3.0 "java -fullversion" show 1.3.1?

    Answer: 
The most accurate information is returned by command java -fullversion.
Example: java full version "J2RE 1.4.1 IBM AIX build ca141-20030522"
The date (20030522) tells when the code was built. 
Java 1.1.8 - the command is "/usr/jdk_base/bin/java -fullversion"
Java 1.2.2 - the command is "/usr/java_dev2/jre/sh/java -fullversion"
Java 1.3.0 - the command is "/usr/java130/jre/bin/java -fullversion"
Java 1.3.1 64-bit - the command is "/usr/java13_64/jre/bin/java -fullversion"
Java 1.3.1 - the command is "/usr/java131/jre/bin/java -fullversion"
Java 1.4 64-bit - the command is "/usr/java14_64/jre/bin/java -fullversion"
Java 1.4 - the command is "/usr/java14/jre/bin/java -fullversion"
Java 5 32-bit - the command is "/usr/java5/jre/bin/java -fullversion"
Java 5 64-bit - the command is "/usr/java5_64/jre/bin/java -fullversion" 
Java 6 32-bit - the command is "/usr/java6/jre/bin/java -fullversion"
Java 6 64-bit - the command is "/usr/java6_64/jre/bin/java -fullversion" 

Installp installation for 1.1.8 creates a symbolic links in /usr/bin for commands such as java and javac. 
Installation of other releases does not create similar links. So, by default (when the PATH is not set for other releases), 
"java -fullversion" returns 1.1.8. 

The command "/usr/java130/jre/bin/java -fullversion" can show 1.3.1, because the support for 1.3.0 ended 12/31/2002. 
All 1.3.0 PTFs shipped after that date use 1.3.1 code.

6. Question: What paths do I need to set to use a specific Java release on my system?
    Answer: 

Java 1.1.8:
PATH=/usr/jdk_base/bin:$PATH 
Java 1.2.2:
PATH=/usr/java_dev2/jre/sh:/usr/java_dev2/sh:$PATH 

Java 1.3.0
PATH=/usr/java130/jre/bin:/usr/java130/bin:$PATH 

Java 1.3.1 64-bit:
PATH=/usr/java13_64/jre/bin:/usr/java13_64/bin:$PATH 

Java 1.3.1
PATH=/usr/java131/jre/bin:/usr/java131/bin:$PATH 

Java 1.4 64-bit:
PATH=/usr/java14_64/jre/bin:/usr/java14_64/bin:$PATH 

Java 1.4
PATH=/usr/java14/jre/bin:/usr/java14/bin:$PATH 

Java 5 32-bit
PATH=/usr/java5/jre/bin:/usr/java5/bin:$PATH 

Java 5 64-bit
PATH=/usr/java5_64/jre/bin:/usr/java5_64/bin:$PATH 

Java 6 32-bit
PATH=/usr/java6/jre/bin:/usr/java6/bin:$PATH 

Java 6 64-bit
PATH=/usr/java6_64/jre/bin:/usr/java6_64/bin:$PATH 

7. Question: How do I know if an image file has been downloaded successfully?
    Answer: 
After you downloaded a package in a tar or tar.gz file, if you can extract from the file successfully (see Question 3) 
the downloading must be successful. 
If "smitty install" does not complain about an extracted package, the downloading must be successful. 

For JDK/SDK base images, the download pages have numbers of bytes for all tar and tar.gz files.  

8. Question: How do I to get support information on Java for AIX?
    Answer: 
Please go to https://techsupport.services.ibm.com/server/support?view=pSeries. If you are a customer in the U.S.A. 
and have an AIX Support Line contract, you can call 1-800-CALL-AIX for support.

9. Question: Are there any Newsgroups for Java on AIX?
    Answer: 
Newsgroup for Java on AIX:  ibmpub.java.aix
Newsgroup for AIX:  comp.unix.aix 
 

-----
Note:
-----

thread:

Q:

Hi,


Problem is in Solaris environment. Application works fine in Windows environment with connection pool size of 10. 
But in Solaris even pool size set to 100, this error occurs.


WebSphere Version: 6.1.0.15

OS : Solaris 10

Database: Oracle 10g


J2CA0045E: Connection not available while invoking method createOrWaitForConnection for resource


com.ibm.websphere.ce.j2c.ConnectionWaitTimeoutExce ption: Connection not available, Timed out waiting for 180004

at com.ibm.ejs.j2c.FreePool.createOrWaitForConnection (FreePool.java:1499)

at com.ibm.ejs.j2c.PoolManager.reserve(PoolManager.ja va:2287)

at com.ibm.ejs.j2c.ConnectionManager.allocateMCWrappe r(ConnectionManager.java:896)

at com.ibm.ejs.j2c.ConnectionManager.allocateConnecti on(ConnectionManager.java:596 )

at com.ibm.ws.rsadapter.jdbc.WSJdbcDataSource.getConn ection(WSJdbcDataSource.java: 439)

at com.ibm.ws.rsadapter.jdbc.WSJdbcDataSource.getConn ection(WSJdbcDataSource.java: 408)

....................................

Anybody else have this problem? Any solutions? Suggestions?

A:

Using Connection information in WebSphere trace files to troubleshoot J2CA0045E and J2CA0020E or connection wait time-out problems.
  
 Technote (troubleshooting) 
  
Problem(Abstract) 
J2CA0045E and J2CA0020E errors can be caused by many problems. They are showing a time-out condition where a resource 
or a managed connection is not available to fulfill a connection request.

In this technote we will use connection information in WebSphere� trace files to troubleshoot 
J2CA0045E and J2CA0020E or connection wait time-out problems.  
 
 
Cause 
In a trace file with WAS.j2c=all=enabled:RRA=all=enabled:WAS.database=all=enabled, you will get dumps of the 
connection pooling that look like the following:

[7/20/05 23:40:01:450 EDT] 6a8068fa d UOW= source=com.ibm.ejs.j2c.poolmanager.PoolManager org=IBM prod=WebSphere 
component=Application Server reserve(), Pool contents ==> PoolManager name:jdbc/xxxxxx
PoolManager object:1424371533
Total number of connections: 45 (max/min 45/20)
Pool Bucket configuration
Shared Buckets: 200
UnShared Buckets: 45
The waiter count is 38
The mcWrappers in waiter queue []
Shared Connection information 
Unshare Connection information 


Resolving the problem 
1) See if you have Waiters:
Waiters are connection requests waiting on a managed connection. If a connection request is made and a managed connection is 
not allocated for that request , the request will timeout and throw an error combination like below in your 
SystemOut.log and or your trace file: 

J2CA0045E: Connection not available while invoking method queueRequest for resource jdbc/xxxxxx.
J2CA0020E: The Connection Pool Manager could not allocate a Managed Connection: com.ibm.websphere.ce.j2c.ConnectionWaitTimeoutException: 
Connection not available, Timed out waiting for 10988

One way to use the connection pool section in the trace file to troubleshoot these kinds of errors is to enter the 
trace file and search for the word "waiter". As you move down through the file, you will expect to see the waiter 
count to increase and decrease if in fact you do have waiters. If you do not have any waiters, the waiter count entry 
will not appear in the trace for a particular connection object listing and you can be assured that your connection pooling 
is sized properly and you do not have contention for resources. If you do see waiters, 
you have connection requests waiting on managed connections. 

2) Discover if the waiters you have is on one or more PoolManager Object:
Another important point to make at this point is that there can be more than one PoolManager Object. 
There is one pool manager object per data source that is configured for the application server and the number 
assigned to that PoolManager object will remain the same for the duration of the application server run from 
startup to stop of the same. 

Once you discover you have waiters, you must find out if they are for one or more PoolManager Objects. You can determine 
how many pool manager objects you have by searching on the keyword PoolManager object then note the numbers that follow. 
In the case of figure 1, the number is: 1424371533.

Once you have determined how many PoolManager Objects you have, you must determine if the waiters are for one Object 
or many. You do this by going through each of the PoolManager Objects to see if in fact that Object has the waiters. 
You can usually tell after going through a few of them or try the first one in the file and the last one in the file. 

3) Assess the damage: 
Look in the area called Shared Connection information that is below all the PoolManger Object information. 
Pull that information out into a separate file and then sort it by thread ID or manually move the same thread ID 
lines together. You can see problems as such in some cases:

A) J2CA0086W errors can be found to look as follows in the Shared Connection information section:

com.ibm.ws.LocalTransaction.LocalTranCoordImpl@4423ef2d MCWrapper id 3705a8c8 Managed connection 
com.ibm.ws.rsadapter.spi.WSRdbManagedConnectionImpl@2612ef7a State:STATE_TRAN_WRAPPER_INUSE Thread Id: 2a1368fa 
Thread Name: Servlet.Engine.Transports : 39 Handle count Start time inuse Wed Jul 20 23:24:32 EDT 2005 Time inuse 929 (seconds)

com.ibm.ws.LocalTransaction.LocalTranCoordImpl@4423ef2d MCWrapper id 67cc68ce Managed connection 
com.ibm.ws.rsadapter.spi.WSRdbManagedConnectionImpl@6f4128ce State:STATE_TRAN_WRAPPER_INUSE Thread Id: 2a1368fa 
Thread Name: Servlet.Engine.Transports : 39 Handle count Start time inuse Wed Jul 20 23:24:32 EDT 2005 Time inuse 929 (seconds)

What do we see here? We see that the transaction type is local for both entries. We see the thread ID is
 indicating the same thread. We also see that the managed connection is a different connection. 
Keep in mind that we are looking in the shared connection area. So, what we have here is two different connections 
that are held by the same thread and the same local transaction. The result of this is that these two separate connections 
will be held until the local transaction is complete. If our max connections for the data source is 10, 
we have just decreased it to 8 for the duration of the local transaction above. 

B) Nested Local transactions:
In the information below that is taken from the Shared Connection Information section as well, you may see the following: 

com.ibm.ws.LocalTransaction.LocalTranCoordImpl@1a3fef31 MCWrapper id 1fea8bc Managed connection com.ibm.ws.rsadapter.spi.
WSRdbManagedConnectionImpl@195ee8e4 State:STATE_TRAN_WRAPPER_INUSE Thread Id: 2d9228fa Thread Name: 
Servlet.Engine.Transports : 40 Handle count Start time inuse Wed Jul 20 23:26:30 EDT 2005 Time inuse 811 (seconds)

com.ibm.ws.LocalTransaction.LocalTranCoordImpl@5f08288c MCWrapper id 21d428c1 Managed connection com.ibm.ws.rsadapter.
spi.WSRdbManagedConnectionImpl@3a40e8d2 State:STATE_TRAN_WRAPPER_INUSE Thread Id: 2d9228fa Thread Name: Servlet.Engine.Transports : 
40 Handle count Start time inuse Wed Jul 20 23:26:28 EDT 2005 Time inuse 812 (seconds)

What do we see here? We see the same thread ID with two shared connections that are of different connection IDs and two 
separate transaction IDs. In this case, these are nested local transactions in the same thread. 

Because shareable connections are used, even if the app called close() on the first connection it will remain allocated, 
and when the thread enters a nested LTC and gets a connection, it cannot re-use the first connection because of the differing 
LTC contexts. Each connection is released as the associated LTC comes to an end.

These kind of situations are cause when you pass control of processing in the thread from something like a servlet to another 
servlet or EJB. Could be several things. Again, we have taken up two connections out of our pool. 

Note! If the customer�s code must have this type of nested architecture, they should use unshared connections 
to fix the problem. Yes, they will see more connections but, the connections will release as soon as the work with them is complete. 

Nested Global and Local Transactions:

com.ibm.ws.Transaction.JTA.TransactionImpl@7ebdafd2#tid=510632 MCWrapper id 421fa8c1 Managed connection com.ibm.ws.rsadapter.
spi.WSRdbManagedConnectionImpl@1a5d68c8 State:STATE_TRAN_WRAPPER_INUSE Thread Id: 32c4e8fa Thread Name: Servlet.Engine.Transports : 
381 Handle count Start time inuse Wed Jul 20 23:36:09 EDT 2005 Time inuse 231 (seconds)

com.ibm.ws.LocalTransaction.LocalTranCoordImpl@69132f2d MCWrapper id 16df28df Managed connection 
com.ibm.ws.rsadapter.spi.WSRdbManagedConnectionImpl@d5fe8e4 State:STATE_TRAN_WRAPPER_INUSE Thread Id: 32c4e8fa Thread Name: 
Servlet.Engine.Transports : 381 Handle count Start time inuse Wed Jul 20 23:36:05 EDT 2005 Time inuse 235 (seconds)

What do we see here? We have a local transaction that spawned a global transaction. The same thread number is 
identified for both. We can tell which one did the spawning by the Start time inuse entry in the entries above. 
They both have different managed connection numbers so that means is that each of them has a shared connection but not the same one. 
There is two more connections that are basically dependant on each other out of our max number of connections. 


Long Running Transactions:
com.ibm.ws.LocalTransaction.LocalTranCoordImpl@31f36fda MCWrapper id 1e64a8af Managed connection 
com.ibm.ws.rsadapter.spi.WSRdbManagedConnectionImpl@3d5328a8 State:STATE_TRAN_WRAPPER_INUSE Thread Id: 7b76e8fa Thread Name: 
Servlet.Engine.Transports : 35 Handle count Start time inuse Wed Jul 20 23:23:17 EDT 2005 Time inuse 1003 (seconds)

What do we see here? We see this transaction in the Shared Connections section is showing us that it has been in use for 1003 seconds. 
That is a bit much. If you see entries such as these and the customer�s transaction time out is much less than this, 
you can expect that there is something holding up this transaction like the database or JDBC. Also, note the Handle count 
is null so it is waiting on something to complete. 

Note: Transaction timeout does not kill a transaction when it times out. The transaction timeout will not be enforced 
until the transaction comes back from the database or what ever it is working into. Therefore, if the transaction time-out 
is reached, all that means is that when the transaction does come back, it will be rolled back or handled in a non-positive 
way depending on how the customer has his transactions set up in the deployment descriptor but, the default is to roll the transaction back. 
Further, if a database, JMS or something else does not perform the tasks requested of it in a transaction before the end of 
the transaction timeout setting, the transaction timeout. WebSphere will rollback the transaction. If the transaction does not 
come back or is not released by the database, JMS or whatever, it will sit there and hold the resources. 

Follow on:
If you look in the Shared Connection information or the Unshared Connection information and you see an entry no 
shared connections or no unshared connections respectively, this does not mean that there are no connections left. This means there are none in use. 

Both shared and unshared connections count toward the maximum number of connections set in your data source. 

If you see the free connection count, you have to remember the minimum connections. If connections time-out, 
they will go away down to the minimum connections number. 

Other possible causes of the ConnectionWaitTimeout:
The web container thread pool settings. 
1. Setting the isGrowable flag on in the WebContainer thread pool can cause this. If you have too many consumers 
   for the size of the connection pool.
2. Similarly, a web container thread pool whose maximum is too great for the connection pool, then it can cause this too.  
 

A:

Check with your DBA...you may be running into licensing issues w/ Oracle
that limits the number of connections the db server allows.
 

-----
Note:
-----

technote:


Application hangs with prior J2CA0086W warning messages
  
 Technote (troubleshooting) 
  
Problem(Abstract) 
An application seems to be in a hang state while trying to connect to database. The logs show several ConnectionWaitTimeoutExceptions 
as well as other errors referring to problems with obtaining free connections from the pool. 

Before occurrences of a hang, there are repeated J2CA0086W warning messages and the connection pool is at a maximum.  
  

Cause 
A warning message similar to the one below is observed in the logs before the hang occurs:

[8/19/03 21:37:53:366 CST] 1c86bdf0 SharedPool I J2CA0086W: Shareable connection MCWrapper id 686bbdf9 Managed connection 
com.ibm.ws.rsadapter.spi.WSRdbManagedConnectionImpl@6156bdf9
State:STATE_TRAN_WRAPPER_INUSE

The message reference contains the following explanation of this message:

J2CA0086W: Shareable connection {0} from resource {1} was used within a local transaction containment boundary. 

Explanation: Shareable connections are not allowed within a local transaction containment boundary.

User Response: Connection was made non-shareable. 

The above scenario suggests a possible problem in the application logic and how it is using the connection code. 
The J2CA0086 message is telling you that your application is using a shared connection in a LocalTransaction. 
Since the connection is enlisted in a Local Transaction, and not a Global Transaction, different rules are followed 
when it comes to connection sharing. The connection usage has to follow the pattern shown below: 

get connection, 
use connection, 
close connection, 
commit transaction 

Then the connection can be used again. If this logic is not followed, a second (or third) connection can be allocated. 

For example, if the application calls the getConnection() method it gets connection1, uses connection1, and then, 
if it calls getConnection() method again and connection1 is not ready to be reused, connection2 is obtained. 
Both connections remain in the shared pool and both are associated with the Local Transaction until the Local Transaction ends 
(is committed or rolled back, or the method ends). 

This can result in more connections being created than is expected, which is why the application is reaching 
the maximum connections, and getting ConnectionWaitTimeoutExceptions among others. This can cause a hang if the 
pool is at a maximum, and none of the threads that have connections can complete because they are waiting to get 
another connection. Another consequence could be that while these connections are persisting in the used pool, 
that other threads making connections to the same tables get timed out by the backend database while waiting 
for a lock to be returned, thus causing the exception: 

SQL0911N: The current transaction has been rolled back because of a deadlock or timeout. Reason code 68  
  
 
Resolving the problem 
There are two solutions to this problem: 
1. The application must be modified to use serial reuse 

or 

2. The connection pool can be changed to unshareable. 
 
 
-----
Note:
-----

67.3 Websphere and AIX Memory:
------------------------------

67.3.1 Errors you may find in Websphere logs

1. java.lang.OutOfMemory
2. javax.naming.NameNotFoundException
3. javax.servlet.ServletException
4. java.lang.StringIndexOutOfBoundsException
5. java.net.SocketException
6. java.io.IOException
7. java.io.FileNotFoundException
8. java.util.MissingResourceException
9. java.lang.ClassNotFoundException
10.java.lang.StringIndexOutOfBoundsException
11.java.io.InterruptedIOException
12.com.splwg.cis.common.NestedRuntimeException


The number that is associated with action determines the type of garbage
collection that is being done:

action=1 means a preemptive garbage collection cycle.
action=2 means a full allocation failure.
action=3 means that a heap expansion takes place.
action=4 means that all known soft references are cleared.
action=5 means that stealing from the transient heap is done.
action=6 means that free space is very low.


Note 1 on java.lang.OutOfMemory
-------------------------------

The Java process has two memory areas: the Java heap, and the "native heap", 
which combine total the memory usage of the process. 
The Java heap is controlled via the -Xms and -Xmx setting, and the space 
available to the native heap is that which isn't used by the Java heap. 


The act of reducing the maximum Java heap size has made the "native heap" 
bigger, and this is the area that was memory constrained. 
We know this because the OutOfMemoryError was generated the message informed 
you that the JVM was unable to allocate a new native stack, this is 
allocated onto the native heap (there is also a Java thread object which is 
created and allocated onto the Java heap). 


It is entirely possible that the amount of "native heap" available to the 
JVM was insufficient to allocate the underlying resources to run the Java 
process under the load that was being driven through it. The native heap is 
now 500MB bigger, and unless there is a memory leak or the load is 
significantly increased, this change should prevent any OutOfMemoryErrors 
based on the native heap. 

Note 2 on java.lang.OutOfMemory
-------------------------------

Hi,

I'm experiment with Tomcat with simple "Hello World" servlet.
When I send 50 concurrent requests, I got java.lang.outOfMemory error.
Tomcat works fine upto 40 concurrent requests for the same servlet.
I'm using Tomcat 3.1M1 with Java 1.2 on Solaris 2.7.

We try to add -mx swith to the Java invocation in tomcat.sh
(line 102)
    $JAVACMD -mx96m org.apache.tomcat.shell.Startup "$@" &
And it still out of memory.

Any suggestion?

Lishin

Hi Lishin

This could be to do with exceeding max file-descriptors - this gave us the
error below (45 connections)

We are running tomcat on Solaris 2.6.  Each new connection uses at least one
socket connection, which is treated as a file-descriptor.  There is a
default limit (user) of 64 file descriptors 

To check this try: 

ulimit -n

To increase this try

ulimit -n <num>

There will be a system limit - for Solaris this is default 1024:

system limit:

ulimit -Hn


I hope this helps - I had a very frustrating time solving this one!

Joe.

Note 3 on java.lang.OutOfMemory
-------------------------------

AIX

LDR_CNTRL Purpose: Allows tuning of the kernel loader. 

Values: Default: Not set Possible Values: PREREAD_SHLIB, LOADPUBLIC, IGNOREUNLOAD, USERREGS, MAXDATA, 
DSA, PRIVSEG_LOADS 
Display: echo $LDR_CNTRL 
Change: LDR_CNTRL={PREREAD_SHLIB | LOADPUBLIC| ...} export LDR_CNTRLChange takes effect immediately in this shell. 
Change is effective until logging out of this shell. Permanent change is made by adding the following line to 
the /etc/environment file: LDR_CNTRL={PREREAD_SHLIB | LOADPUBLIC| ...} 
Diagnosis: N/A 

Tuning: The LDR_CNTRL environment variable can be used to control one or more aspects of the system loader behavior. 
You can specify multiple options with the LDR_CNTRL variable. When doing this, separate the options using 
an @ character (that is, LDR_CNTRL=PREREAD_SHLIB@LOADPUBLIC). Specifying the PREREAD_SHLIB option will cause 
entire libraries to be read as soon as they are accessed. With VMM readahead tuned, a library can be read in from disk 
and be cached in memory by the time the program starts to access its pages. While this method can use more memory, 
it can enhance performance of programs that use many shared library pages providing the access pattern 
is non-sequential. (for example, Catia). Specifying the LOADPUBLIC option directs the system loader to load 
all modules requested by an application into the global shared library segment. If a module cannot be loaded 
publicly into the global shared library segment then it is loaded privately for the application. Specifying 
the IGNOREUNLOAD option will cause modules that are marked to be unloaded and used again 
(if the module has not been unloaded already). As a side effect of this option, you can end up with 
two different data instances for the module. Specifying the USERREGS option will tell the system to save 
all general-purpose user registers across system calls made by an application. This can be helpful in 
applications doing garbage collection. Specifying the MAXDATA option sets the maximum heap size for a process, 
including overriding any MAXDATA value specified in an executable. If you want to use Large Program Support 
with a data heap size of 0x30000000, then specify LDR_CNTRL=MAXDATA=0x30000000. To turn off Large Program Support, 
specify LDR_CNTRL=MAXDATA=0. Specifying the DSA (Dynamic Segment Allocation) option tells the system loader 
to run applications using Very Large Program Support. The DSA option is only valid for 32-bit applications. 
Specifying the PRIVSEG_LOADS option directs the system loader to put dynamically loaded private modules into 
the process private segment. This might improve the availability of memory in large memory model applications 
that perform private dynamic loads and tend to run out of memory in the process heap. If the process private segment 
lacks sufficient space, the PRIVSEG_LOADS option has no effect. The PRIVSEG_LOADS option is only valid for 
32-bit applications with a non-zero MAXDATA value. 


-----
Note:
-----

BBB


MustGather: Out of Memory errors on AIX, Part 2a - native leak
Technote (FAQ) 
 
Problem 
MustGather for native leak problems with the WebSphere? Application Server Out of Memory component on AIX? platforms. 
Gathering this information before calling IBM? support will help familiarize you with the troubleshooting process and save you time. 
 
 
Solution 
The following steps outline how to troubleshoot java.lang.OutOfMemoryError errors on an AIX platform when you suspect that 
there is a native memory leak. This suspicion is based upon your analysis of the data collected from technote: 
MustGather: Out of Memory errors on AIX, Part 1.

If you already contacted support, continue to the component-specific MustGather information. Otherwise, click: MustGather: 
Read first for all WebSphere Application Server products.

Out of Memory (native leak) specific MustGather information:


Set up, before problem occurs:

Enabling MMAP for Java? heap:

For V6.0 release:
To enable the MMAP Java heap, add the following line to the top of the startServer.sh file 
in the install_root/profiles/profile_name/bin directory:

export IBM_JAVA_MMAP_JAVA_HEAP=true


For V5.0 and V5.1 releases:

To enable the MMAP Java heap, add the following line to the top of the startServer.sh file in the install_root/bin directory:

export IBM_JAVA_MMAP_JAVA_HEAP=true

For V3.5 and V4.0 releases:

To enable the MMAP Java heap, add the following line to the top of the startupServer.sh file in the install_root/bin directory:

export IBM_JAVA_MMAP_JAVA_HEAP=true


Changing the Native Heap:
For releases of V6.0:

Verifiy the Java Virtual Machine (JVM?) is 32-bit, since that is what WebSphere Application Server supports.
Issue bootinfo -K from the AIX command line.
Add the following line to the top of the startServer.sh file in the install_root/profiles/profile_name/bin directory:

export LDR_CNTRL=MAXDATA=0xn0000000

where n= (10 - round(Xmx/256)

For releases of V5.0 and V5.1: 

Verifiy the JVM is 32-bit, since that is what WebSphere Application Server supports.

Issue bootinfo -K from the AIX command line.
Add the following line to the top of the startServer.sh file in the install_root/bin directory:

export LDR_CNTRL=MAXDATA=0xn0000000

where n= (10 - round(Xmx/256)


For releases of V3.5 and V4.0:
Verifiy the JVM is 32-bit, since that is what WebSphere Application Server supports.
Issue bootinfo -k from the AIX command line.
Add the following line to the top of the startupServer.sh file in the install_root/bin directory:

export LDR_CNTRL=MAXDATA=0xn0000000

where n= (10 - round(Xmx/256))


Examples for all releases
Xmx = 1024
n = (10 - round(1024/256))
n = (10 - round(4))
n = (10 - 4)
n = 6


Xmx = 640
n = (10 - round(640/256))
n = (10 - round(2.5))
n = (10 - 3)
n = 7


Follow instructions for Enabling verbosegc in WebSphere Application Server.


Clear all application server log files before starting the Application Server to capture data.


After starting the Application Server, run the AIX_433_memory_leak.sh or AIX_51_memory_leak.sh shell script 
(see attached ;script for your version of AIX) while trying to reproduce the problem. 

AIX_memory_leak.sh -p pid-f base_filename-i interval

Where: 
pid is the process id to be profiled
base_filename is a path and identifier for the output
interval is the time in seconds between each iteration

Example:
AIX_memory_leak.sh -p 123456 -f /logs/leak -i 120

After problem re-occurs:

Stop the Application Server and the script.

Collect the following:
For V6.0 release:
All files in install_root/profiles/profile_name/logs/server_name directory. 
All files in install_root/profiles/profile_name/logs/ffdc directory.
A copy of server.xml located in install_root/profiles/profile_name/config/cells/cell_name/nodes/node_name/servers/server_namedirectory.


For V5.0 and V5.1 releases:
Include all of the files from the install_root/logs/server_namedirectory. 
All files in install_root/logs/ffdc directory.
A copy of server.xml located at install_root/config/cells/cell_name/nodes/node_name/servers/server_name directory


For V3.5 and V4.0 releases:
Include all of the files from the install_root/logs directory.
A copy of XMLExport for the server configuration


For all releases:
If you have configured the application server to write logs into a different location, send them accordingly.
The log file output from memory leak script.
List of all other software and version information on the system. For example, databases, WebSphere MQ, and so forth.


Follow instructions to send diagnostic information to IBM support.


For a listing of all technotes, downloads, and educational materials specific to the Out of Memory component, 
search the WebSphere Application Server support site. 


-----
Note:
-----

Java� virtual machine (JVM�) crash on Solaris� throws java.lang.OutOfMemoryError exception 
 
 Technote (FAQ) 
  
Problem 
The Application Server crashes with a SIGSEGV and the pstack output is corrupted. 
The pmap shows that the [heap] has exceeded 2GB. 

The hs_err_pid*.log file shows the following: 

An unexpected exception has been detected in native code outside the VM. 
Unexpected Signal : 11 occurred at PC=0xff330924 
Function name=memcpy 
Library=/usr/platform/SUNW,Ultra-80/lib/libc_psr.so.1  
  
Cause 
This is a bug in the HotSpot server mode and there is a Sun defect opened, reference Sun Bug Id 4724509: 
There can be other reasons that the core is so large, like a native memory leak, possibly with the DB drivers.

The Sun JDK� 1.3.1 doesn't always produce a Java� thread dump when a crash occurs, especially when the crash is inside the HotSpot library. 

Below, the pmap command output shows that the native heap has grown to over 2 GB. Such 
a large heap size is unusual for a Java application and can lead to memory problems, as in a crash in the C runtime library (libc) function memory: 


core 'core1' of 16996: /opt/WebSphere/AppServer/java/jre/bin/../bin/sparc/native_threads/java 
00010000 24K read/exec /opt/WebSphere/AppServer/java/jre/bin/sparc/native_threads/java 
00024000 8K read/write/exec /opt/WebSphere/AppServer/java/jre/bin/sparc/native_threads/java 
00026000 2088888K read/write/exec [ heap ]

A known bug in Java HotSpot Server mode might cause this large heap. In rare circumstances, the compiler gets into 
an expansion loop and asks for larger and larger amounts of heap memory on malloc (memory allocation) calls within 
the loop that is expanding the heap size. When this happens, a java.lang.OutOfMemoryError with a large "CompileThread0" 
request is logged in an hs_err_pid log and a pmap of the core file generated shows a heap size that is unusually large, 
frequently 2000 MB or more. 
  
Solution 
There are several possible ways to resolve this problem. 

Switch to the Java HotSpot Client VM. Refer to the technote, Setting up a HotSpot server or client mode on a Java 2 SDK 

Note: Application Servers in V4.0 releases run in HotSpot server mode by default. Application Servers in V5.x releases run in Hotspot client mode by default. 


If it is not an option to switch to the client HotSpot compiler, identify the methods causing the crash and skip from HotSpot comp. 

Refer to technote Debugging HotSpot failures on Solaris 


Upgrade to JDK 1.3.1_08 or later. The fix for SunBug 4724509 is in JDK 1.3.1_08. http://developer.java.sun.com/developer/bugParade/bugs/4724509.html 

Note: Sometimes this upgrade does not resolve this problem. 


If a java.lang.OutOfMemoryError is issued and this upgrade does not solve the problem, refer to this technote: JAVA.LANG.OUTOFMEMORYERRORs (Solaris).


Document Information 


Product categories: Software > Application Servers > Distributed Application & Web Servers > WebSphere Application Server > Java SDK 
Operating system(s): Solaris 
Software version: 5.1 
Software edition: 
Reference #: 1159828 
IBM Group: Software Group 
Modified date: Aug 9, 2004 


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


-----
Note:
-----


EOF