Friday, April 3, 2009

SUN Solaris

Solaris Boot Sequence :-

Solaris 10 :-

Solaris 10 boot process is divided in 5 phases :-

1)Boot PROM phase:
  • PROM runs POST;
  • boot with local boot-device;
  • boot reads boot block;
  • boot load boot block

2)Boot Program phase:

  • boot block loads secondary boot program(ufsboot);
  • ufsboot loads kernel

3)Kernel Initialization phase:

  • kernel reads /etc/system;
  • kernel load modules

4)Init phase:

  • kernel starts /etc/init;
  • init starts svc.startd process

5)svc.startd phase:

  • svc.startd starts system processes



Solaris SPARC Boot Sequence :-


The following represents a summary of the boot process for a Solaris x.x system on Sparc hardware.

Power On: Depending on the system involved, you may see some output on a serial terminal immediately after power on. This may take the form of a Hardware Power ON message on a large Enterprise server, or a "'" or "," in the case of an older Ultra system. These indications will not be present on a monitor connected directly to the server.

POST: If the PROM diag-switch? parameter is set to true, output from the POST (Power On Self Test) will be viewable on a serial terminal. The PROM diag-level parameter determines the extent of the POST tests. If a serial terminal is not connected, a prtdiag -v will show the results of the POST once the system has booted. If a keyboard is connected, it will beep and the keyboard lights will flash during POST. If the POST fails, an error indication may be displayed following the failure.

Init System: The "Init System" process can be broken down into several discrete parts:

  • OBP: If diag-switch? is set, an Entering OBP message will be seen on a serial terminal. The MMU (memory management unit) is enabled.
  • NVRAM: If use-nvramrc? is set to true, read the NVRAMRC. This may contain information about boot devices, especially where the boot disk has been encapsulated with VxVM or DiskSuite.
  • Probe All: This includes checking for SCSI or other disk drives and devices.
  • Install Console: At this point, a directly connected monitor and keyboard will become active, or the serial port will become the system console access. If a keyboard is connected to the system, the lights will flash again during this step.
  • Banner: The PROM banner will be displayed. This banner includes a logo, system type, PROM revision level, the ethernet address, and the hostid.
  • Create Device Tree: The hardware device tree will be built. This device tree can be explored using PROM monitor commands at the ok> prompt, or by using prtconf once the system has been booted.

Extended Diagnostics: If diag-switch? and diag-level are set, additional diagnostics will appear on the system console.
auto-boot?: If the auto-boot? PROM parameter is set, the boot process will begin. Otherwise, the system will drop to the ok> PROM monitor prompt, or (if sunmon-compat? and security-mode are set) the > security prompt.


The boot process will use the boot-device and boot-file PROM parameters unless diag-switch? is set. In this case, the boot process will use the diag-device and diag-file.


bootblk: The OBP (Open Boot PROM) program loads the bootblk primary boot program from the boot-device (or diag-device, if diag-switch? is set). If the bootblk is not present or needs to be regenerated, it can be installed by running the installboot command after booting from a CDROM or the network. A copy of the bootblk is available at /usr/platform/`arch -k`/lib/fs/ufs/bootblk
ufsboot: The secondary boot program, /platform/`arch -k`/ufsboot is run. This program loads the kernel core image files. If this file is corrupted or missing, a bootblk: can't find the boot program or similar error message will be returned.

kernel: The kernel is loaded and run.

For 32-bit Solaris systems, the relevant files are:
/platform/`arch -k`/kernel/unix
/kernel/genunix

For 64-bit Solaris systems, the files are:
/platform/`arch -k`/kernel/sparcV9/unix
/kernel/genunix

  • As part of the kernel loading process, the kernel banner is displayed to the screen.
  • The kernel initializes itself and begins loading modules, reading the files with the ufsboot program until it has loaded enough modules to mount the root filesystem itself.
  • At that point, ufsboot is unmapped and the kernel uses its own drivers. If the system complains about not being able to write to the root filesystem, it is stuck in this part of the boot process.

The boot -a command singlesteps through this portion of the boot process. This can be a useful diagnostic procedure if the kernel is not loading properly.


/etc/system: The /etc/system file is read by the kernel, and the system parameters are set.
The following types of customization are available in the /etc/system file:

  • moddir: Changes path of kernel modules.
  • forceload: Forces loading of a kernel module.
  • exclude: Excludes a particular kernel module.
  • rootfs: Specify the system type for the root file system. (ufs is the default.)
  • rootdev: Specify the physical device path for root.
  • set: Set the value of a tuneable system parameter.


If the /etc/system file is edited, it is strongly recommended that a copy of the working file be made to a well-known location. In the event that the new /etc/system file renders the system unbootable, it might be possible to bring the system up with a boot -a command that specifies the old file. If this has not been done, the system may need to be booted from CD or network so that the file can be mounted and edited.


kernel initialized: The kernel creates PID 0 ( sched). The sched process is sometimes called the "swapper."
init: The kernel starts PID 1 (init).
init: The init process reads the /etc/inittab and /etc/default/init and follows the instructions in those files. Some of the entries in the /etc/inittab are:

  • fs: sysinit (usually /etc/rcS)
  • is: default init level (usually 3, sometimes 2)
  • s#: script associated with a run level (usually /sbin/rc#)


rc scripts: The rc scripts execute the files in the /etc/rc#.d directories. They are run by the /sbin/rc# scripts, each of which corresponds to a run level.

This Completes the BootUp process.

SUN Clusters

Thursday, April 2, 2009

HP-UX

HP-UX Boot Sequence


Boot Sequence: Quick Reference :-

On a server without vPars, a simplified boot sequence is:

1. ISL (Initial System Loader)
2. hpux (secondary system loader)
3. /stand/vmunix (kernel)

Adding vPars adds the monitor layer, so now hpux loads the monitor and then the monitor boots the kernels of the virtual partitions. The boot sequence becomes

1. ISL
2. hpux
3. /stand/vpmon (vPars monitor and partition database)
4. /stand/vmunix (kernels of the virtual partitions)


Boot Sequence: The Details :-

With or without vPars, the firmware loads and launches ISL.

ISL>
In a server without vPars, at the ISL prompt, the secondary system loader hpux loads the kernel /stand/vmunix:

ISL> hpux /stand/vmunix

However, in a server with vPars, at the ISL prompt, the secondary system loader hpux loads the vPars monitor /stand/vpmon:

ISL> hpux /stand/vpmon

The monitor loads the partition database (the default is /stand/vpdb) from the same disk that /stand/vpmon was booted. The monitor internally creates (but does not boot) each virtual partition according to the resource assignments in the partition database.

Next, the vPars monitor runs in interactive mode (when no options to /stand/vpmon are given) with a command line interface.

MON>

To boot a kernel in a virtual partition (that is, to launch a virtual partition), use the monitor command vparload. For example, to launch the virtual partition named ABCxyz:

MON> vparload -p ABCxyz

In this example, the vPars monitor would load the virtual partition ABCxyz and launch the kernel from the boot device specified for ABCxyz.

(The boot device is assigned when the virtual partition is created and is recorded in the monitor database.)

HP-UX is now booted on the virtual partition ABCxyz.

Once a virtual partition is running, you will be at the virtual console of a virtual partition. Subsequent virtual partitions can be booted using the vPars command vparboot at the UNIX shell prompt of ABCxyz.


Boot Sequence Difference between PA-RISC and Integrity Architectures :-


On a server without vPars, a simplified boot sequence is as follows.

PA-RISC Architecture :-

1. ISL(Initial System Loader)
2. hpux(secondary system loader)
3. /stand/vmunix(kernel)

Integrity Architecture (IA64) :-

1. EFI(Extensible Firmware Interface)
2. hpux.efi (HP-UX boot loader)
3. /stand/vmunix

Adding vPars adds the Monitor layer, so now hpux(for Integrity, hpux.efi) loads the Monitor. Then the Monitor boots the kernels of the virtual partitions. The boot sequence becomes the following.

1. ISL or EFI (firmware)
2. hpux or hpux.efi
3. /stand/vpmon (vPars Monitor and partition database)
4. /stand/vmunix (kernels of the virtual partitions)

M/C Service Guard

Veritas Cluster

Overview

Veritas Cluster enables one system to failover to the other system. All related software processes are simply moved from one system to the other system with minimal downtime.

Cluster Startup :-

Here is what the cluster does at startup:-

-Node checks if other node is already started, if so -- stays OFFLINE
-If no other machine is running, checks communication (gabconfig). May need system admin intervention if cluster requires both nodes to be available. (/sbin/gabconfig -c -x)
-Once communication between machines is open -- or gabconfig has been started, it sets up network (nic & ip adddress) (starts cluster server) .

If also brings up volume manager, file system, and then Applications.

File Locations (Logs, Conf, Executables):-

Log location: /var/VRTSvcs/log

There are several logs in this directory:-

engine.log_A: primary log, usually what you will be reading for debugging.

Conf files:-

Llt conf: /etc/llttab [should NOT need to access this]
Network conf: /etc/gabtab
Cluster conf: /etc/VRTSvcs/conf/config/main.cf (Has exact details on what the cluster contains. )

Most executables are in: /opt/VRTSvcs/bin or /sbin

Changing Configurations :-

ALWAYS be very careful when changing the cluster configurations.

There are two ways of changing the configurations.
The method one uses if the system is up (cluster is running on at least one node, preferably on both):
  1. haconf -makerw
  2. run needed commands (ie. hasys ....)
  3. haconf -dump -makero


If both systems are down: -

hastop -all (shouldn't need this as cluster is down)

cp main.cf main.cf.%date%

vi main.cf

hacf -verify /etc/VRTSvcs/conf/config

hacf -generate /etc/VRTSvcs/conf/config

hastart

Veritas Cluster Debugging Tips :-


The normal debugging of steps includes: checking on status, restarting if no faults, checking licenses, clearing faults if needed, and checking logs.


To find out Current Status:-
/opt/VRTSvcs/bin/hastatus -summary This will give the general status of each machine and processes
/opt/VRTSvcs/bin/hares -display This gives much more detail - down to the resource level.
If hastatus fails on both machines (it returns that the cluster is not up or returns nothing), try to start the cluster
/opt/VRTSvcs/bin/hastart
/opt/VRTSvcs/bin/hastatus -summary will tell you if processes started properly. It will NOT start processes on a FAULTED system.

To check licenses:

vxlicense -p

Make sure all licenses are current - and NOT expired! If they are expired, that is your problem. Call VERITAS to get temporary licenses.

There is a BUG with veritas licences. Veritas will not run if there are ANY expired licenses -- even if you have the valid ones you need. To get veritas to run, you will need to MOVE the expired licenses.

vxlicense -p

Note the NUMBER after the license (ie: Feature name: DATABASE_EDITION [100])


cd /etc/vx/elm

mkdir oldmv lic.number old [do this for all expired licenses]

vxlicense -p [Make sure there are no expired licenses AND your good licenses are there]

hastart
If still fails, call veritas for temp licenses. Otherwise, be certain to do the same on your second machine.


To clear FAULTS:


hares -display
For each resource that is faulted run:

hares -clear resource-name -sys faulted-system

If all of these clear, then run hastatus -summary and make sure that these are clear. If some don't clear you MAY be able to clear them on the group level. Only do this as last resort:

hagrp -disableresources groupname
hagrp -flush group -sys sysname
hagrp -enableresources groupname

To get a group to go online:

hagrp -online group -sys desired-system

Unix Flavours :-

HP unveils HP-UX 11i v3 server OS


New architecture supports 100 million zetta bytes of storage.

HP has announced the launch of HP-UX 11i v3, the latest version of its Unix-based server operating system.

Despite the increase in Linux servers, HP is committed to the HP-UX operating system, and plans releases every two to three years in line with its publicly available roadmap.

The vendor is placing a heavy focus on total cost of ownership, virtualisation and dynamic resource management.

HP claims an average 30 per cent performance increase in applications running HP-UX 11i v3 simply by upgrading and without any recompiling.

This is down to tweaks in threading and a complete rewrite of the storage stack. The biggest performance gains can be seen in Java applications.

HP director Nick van der Zweep said at a press conference in San Francisco: "There really is no need to recompile your applications to experience an improvement in operational performance."

"The new version is completely code-compatible with anything that currently runs properly on version 2."

The complete overall of the mass storage stack means that the architecture now supports the addressing of up to 100 million zettabytes of highly available, secure storage.

One zettabyte equals one billion terabytes, so the capacity represents effectively limitless data storage.

HP-UX 11i v3 also promises many new virtualisation features, allowing administrators to dynamically move memory and resources among distributed virtual partitions on the fly with no disruption to users.

Similarly, in the event of planned or unplanned downtime, instances can be moved to another available server or cluster to help maximise uptime and server utilisation.
Uptime and stability is also enhanced with the ability to hot-swap memory, processors and I/O cards.

A new Software Assistant simplifies patch and security bulletin management and patch deployment, while a Dynamic Root Disk system administration toolset enables online patching by running an image of the system while patches are installed.

Followers