Total Pageviews

Thursday, 20 October 2016

Using ASR (Automatic System Recovery) to enable/disable hardware !!!

Generally coming to OBP, we know many regular commands like probe-scsi, probe-scsi-all, reset-all, devalias, show-devs. We also have some commands which are used to troubleshoot hardware issues. In this post we are going to discuss about "asr" command usage.

We can disable or enable cpu/motherboards by using " ASR " command.

ASR stands for Automatic System Recovery. Using this we can disable hardware directly from the OBP.

In my scenario, one of our server is continuously rebooting (panic), then I took login from console to start troubleshooting...

rsc>
rsc>
rsc> console
rsc>
rsc>
rsc>               THIS IS THE SCENARIO SERVER IS BOOTING.....
rsc>
rsc>
rsc>
SUNW-MSG-ID: SUNOS-8000-0G, TYPE: Error, VER: 1, SEVERITY: Major
EVENT-TIME: 0x54d12ddd.0xde0bcc8 (0x963e9d9a84)
PLATFORM: SUNW,Sun-Fire-V490, CSN: -, HOSTNAME: SRVR
SOURCE: SunOS, REV: 5.10 Generic_147440-27
DESC: Errors have been detected that require a reboot to ensure system
integrity.  See http://www.sun.com/msg/SUNOS-8000-0G for more information.
AUTO-RESPONSE: Solaris will attempt to save and diagnose the error telemetry
IMPACT: The system will sync files, save a crash dump if needed, and reboot
REC-ACTION: Save the error summary below in case telemetry cannot be saved

panic[cpu2]/thread=3000ccd7200: UE WDU Error(s)

!!!!!!!!!!!!!!!!!!!

000002a100f778d0 unix:pagefault+ac (1036e6000, 0, 2, 0, 60035099778, 2)
  %l0-3: 0000000000000004 0000000000000000 0000030003c36000 0000000000000000
  %l4-7: 000000000183e400 000000000183b400 0000000000000000 000006003b4d2cd0
000002a100f77990 unix:trap+d50 (2a100f77b90, 1036e64b9, 0, 2, ffffffff7826ff1c, 0)
  %l0-3: 0000000000000000 000006003b4d2cd0 0000000000010031 00000600353d2fa8
  %l4-7: 0000000000000000 0000000000010034 0000000000010000 000006003b4d2eb0

syncing file systems... [1] 112 [1] 74 [1] 4 [1] 4 [1] 4 [1] 4 [1] ...... 4 [1] 4 done (not all i/o completed)
ereport.cpu.ultraSPARC-IVplus.edu-st ena=963cb72b2c00801 detector=[ version=1
 scheme="cpu" cpuid=2 cpumask=22 serial="80001A58E75C3807" ] afsr=
 10000a00000003 afsr-ext=0 afar-status=1 afar=a3c9ea0250 pc=12a3870 tl=0 tt=63
 privileged=1 multiple=0 syndrome-status=1 syndrome=3 l3-cache-ways=4
 l3-cache-data=[...] l2-cache-ways=1 l2-cache-data=[...] dcache-ways=0
 icache-ways=0 resource=[ version=1 scheme="cpu" cpuid=2 cpumask=22 serial=
 "80001A58E75C3807" ]

...... OUTPUT TRUNCATED .........

ereport.cpu.ultraSPARC-IVplus.ce ena=963e9ce73804001 detector=[ version=1
 scheme="cpu" cpuid=10 cpumask=22 serial="80001A58C75C3807" ] afsr=
 100002000000b0 afsr-ext=0 afar-status=1 afar=a3e7122f40 pc=0 tl=0 tt=0
 privileged=1 multiple=0 syndrome-status=1 syndrome=b0 error-type="U"
 error-disposition=0 l3-cache-ways=0 l2-cache-ways=0 dcache-ways=0 icache-ways=
 0 resource=[ version=0 scheme="mem" unum="Slot A: J8001" ]

dumping to /dev/md/dsk/d20, offset 6873219072, content: kernel
 0:11 100% done
100% done: 169129 pages dumped, dump succeeded
rebooting...

Resetting ...

!!!!!!!!!!!!!!!!!!

RSC Alert: Host System has Reset

<*>
Software Reset

@(#)OBP 4.30.4.c 2010/09/29 09:42 Sun Fire 4XX
Online:  CMP0 UltraSPARC IV+ (v2.2) 10:1 1500MHz 32MB 5:1 ECache
Online:  CMP1 UltraSPARC IV+ (v2.2) 10:1 1500MHz 32MB 5:1 ECache
Online:  CMP2 UltraSPARC IV+ (v2.2) 10:1 1500MHz 32MB 5:1 ECache
Online: *CMP3 UltraSPARC IV+ (v2.2) 10:1 1500MHz 32MB 5:1 ECache
Skipping POST.
Enabling system bus....... CMP0 CMP1 CMP2 CMP3 Done
Init ICache/etc........... CMP0 CMP1 CMP2 CMP3 Done
Init ECache Tags.......... CMP0 CMP1 CMP2 CMP3 Done
Clearing TLBs............. CMP0 CMP1 CMP2 CMP3 Done
Setup I/DTLBs............. CMP0 CMP1 CMP2 CMP3 Done
Enabling Cache/MMUs....... CMP0 CMP1 CMP2 CMP3 Done
Init ECache Data.......... CMP0 CMP1 CMP2 CMP3 Done
Zeroing memory...Done
Copying FLASHRAM to memory...Verifying base 128KB...Done
Jumping into RAM (leaving slave CPUs in ROM)
RAM CRC = 0000.0000.b81b.5f23;  ROM CRC = 0000.0000.b81b.5f23
Dropping in...
Find dropin, Decompressing Done, Size 0000.0000.0007.fd30 (512KB)
Slave CPUs starting Forth at 0000.0000.f000.00e0
Boot  CPU3 starting Forth at 0000.0000.f000.00e0
Diagnostic console initialized
Configure root name: SUNW,Sun-Fire-V490
Probing system devices
(1500 MHz @ 10:1, 16 MB) /: gptwo at 0,0 cmp cpu cpu memory-controller
(1500 MHz @ 10:1, 16 MB) /: gptwo at 1,0 cmp cpu cpu memory-controller
(1500 MHz @ 10:1, 16 MB) /: gptwo at 2,0 cmp cpu cpu memory-controller
(1500 MHz @ 10:1, 16 MB) /: gptwo at 3,0 cmp cpu cpu memory-controller
/: gptwo at 4,0 Nothing there
/: gptwo at 5,0 Nothing there
/: gptwo at 6,0 Nothing there
/: gptwo at 7,0 Nothing there
/: gptwo at 8,0 pci pci
/: gptwo at 9,0 pci pci
Loading Support Packages: obp-tftp kbd-translator SUNW,i2c-ram-device SUNW,fru-device
Loading onboard drivers: ebus
/pci@9,700000/ebus@1: flashprom bbc power i2c i2c rtc gpio pmc rsc-control rsc-console serial
/pci@9,700000/ebus@1/i2c@1,2e: fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru nvram idprom fru fru
/pci@9,700000/ebus@1/i2c@1,30: temperature temperature temperature ioexp ioexp ioexp temperature ioexp ioexp ioexp ioexp temperature-sensor fru fru fru fru fru rscrtc
/memory: CMP0 Bank0  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #0
/memory: CMP0 Bank1  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #2
/memory: CMP0 Bank2  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #4
/memory: CMP0 Bank3  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #6
/memory: CMP1 Bank0  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #0
/memory: CMP1 Bank1  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #2
/memory: CMP1 Bank2  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #4
/memory: CMP1 Bank3  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #6
/memory: CMP2 Bank0  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #1
/memory: CMP2 Bank1  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #3
/memory: CMP2 Bank2  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #5
/memory: CMP2 Bank3  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #7
/memory: CMP3 Bank0  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #1
/memory: CMP3 Bank1  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #3
/memory: CMP3 Bank2  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #5
/memory: CMP3 Bank3  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #7
ChassisSerialNumber 0412KM4616
Probing I/O buses
/pci@8,600000: Device 1 Nothing there
/pci@8,600000: Device 2 Nothing there
/pci@8,700000: Device 2 Nothing there
/pci@8,700000: Device 3 QLGC,qlc fp disk QLGC,qlc fp disk
/pci@8,700000: Device 4 Nothing there
/pci@8,700000: Device 5 QLGC,qlc fp disk QLGC,qlc fp disk
/pci@8,700000: Device 6 ide disk cdrom
/pci@9,600000: Device 1 network
/pci@9,600000: Device 2 SUNW,qlc fp disk
/pci@9,700000: Device 1 usb
/pci@9,700000: Device 2 network
Configure root name: SUNW,Sun-Fire-V490           

UNABLE TO TRUNCATE OUTPUT AS IT IS NECESSARY TO UNDERSTAND.....

Probing system devices
(1500 MHz @ 10:1, 16 MB) /: gptwo at 0,0 cmp cpu cpu memory-controller
(1500 MHz @ 10:1, 16 MB) /: gptwo at 1,0 cmp cpu cpu memory-controller
(1500 MHz @ 10:1, 16 MB) /: gptwo at 2,0 cmp cpu cpu memory-controller
(1500 MHz @ 10:1, 16 MB) /: gptwo at 3,0 cmp cpu cpu memory-controller
/: gptwo at 4,0 Nothing there
/: gptwo at 5,0 Nothing there
/: gptwo at 6,0 Nothing there
/: gptwo at 7,0 Nothing there
/: gptwo at 8,0 pci pci
/: gptwo at 9,0 pci pci
Loading Support Packages: obp-tftp kbd-translator SUNW,i2c-ram-device SUNW,fru-device
Loading onboard drivers: ebus
/pci@9,700000/ebus@1: flashprom bbc power i2c i2c rtc gpio pmc rsc-control rsc-console serial
/pci@9,700000/ebus@1/i2c@1,2e: fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru fru nvram idprom fru fru
/pci@9,700000/ebus@1/i2c@1,30: temperature temperature temperature ioexp ioexp ioexp temperature ioexp ioexp ioexp ioexp temperature-sensor fru fru fru fru fru rscrtc
/memory: CMP0 Bank0  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #0
/memory: CMP0 Bank1  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #2
/memory: CMP0 Bank2  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #4
/memory: CMP0 Bank3  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #6
/memory: CMP1 Bank0  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #0
/memory: CMP1 Bank1  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #2
/memory: CMP1 Bank2  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #4
/memory: CMP1 Bank3  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #6
/memory: CMP2 Bank0  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #1
/memory: CMP2 Bank1  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #3
/memory: CMP2 Bank2  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #5
/memory: CMP2 Bank3  512 +  512 +  512 +  512 :    2 GB @  a000000000  8-way #7
/memory: CMP3 Bank0  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #1
/memory: CMP3 Bank1  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #3
/memory: CMP3 Bank2  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #5
/memory: CMP3 Bank3  512 +  512 +  512 +  512 :    2 GB @  b000000000  8-way #7
ChassisSerialNumber 0412KM4616
Probing I/O buses
/pci@8,600000: Device 1 Nothing there
/pci@8,600000: Device 2 Nothing there
/pci@8,700000: Device 2 Nothing there
/pci@8,700000: Device 3 QLGC,qlc fp disk QLGC,qlc fp disk
/pci@8,700000: Device 4 Nothing there
/pci@8,700000: Device 5 QLGC,qlc fp disk QLGC,qlc fp disk
/pci@8,700000: Device 6 ide disk cdrom
/pci@9,600000: Device 1 network
/pci@9,600000: Device 2 SUNW,qlc fp disk
/pci@9,700000: Device 1 usb
/pci@9,700000: Device 2 network

Sun Fire V490, No Keyboard
Copyright (c) 1998, 2010, Oracle and/or its affiliates. All rights reserved.
OpenBoot 4.30.4.c, 32768 MB memory installed, Serial #70989125.
Ethernet address 0:14:4f:3b:35:45, Host ID: 843b3545.

Creating CMP memory layout properties.

Reading temperature limits from FRUPROMs: CMP0/2 CMP1/3 BACKPLANE

Environmental monitor is ON
Rebooting with command: boot
Boot device: rootdisk  File and args:
SunOS Release 5.10 Version Generic_147440-27 64-bit
Copyright (c) 1983, 2012, Oracle and/or its affiliates. All rights reserved.
Hostname: SRVR

........... OUTPUT TRUNCATED ............

SRVR console login: root        ---------- AS YOU CAN SEE BEFORE TAKING LOGIN , IT STARTED BOOTING AGAIN ....

SUNW-MSG-ID: SUNOS-8000-0G, TYPE: Error, VER: 1, SEVERITY: Major
EVENT-TIME: 0x54d12fbb.0x237c8904 (0x5af9a9771c)
PLATFORM: SUNW,Sun-Fire-V490, CSN: -, HOSTNAME: SRVR
SOURCE: SunOS, REV: 5.10 Generic_147440-27
DESC: Errors have been detected that require a reboot to ensure system
integrity.  See http://www.sun.com/msg/SUNOS-8000-0G for more information.
AUTO-RESPONSE: Solaris will attempt to save and diagnose the error telemetry
IMPACT: The system will sync files, save a crash dump if needed, and reboot
REC-ACTION: Save the error summary below in case telemetry cannot be saved

...... OUTPUT TRUNCATED .........

panic[cpu19]/thread=2a100a33c80: Fatal PCI UE Error

000002a100a5bd40 pcisch:ecc_intr+84 (6003340b920, 4, 6003340b910, fffffffffffffff8, 6003340b910, 3000010de38)
  %l0-3: 0000000000000000 0000000000004000 0000000000000000 0000000000000004
  %l4-7: 000000007be12800 00000000018e8c00 ffffffffffffffff 000003000010de98
000002a100a5bf50 unix:current_thread+164 (2, 180c000, 180c000, 0, ffffffffffffffff, 0)
  %l0-3: 000000000100777c 000002a100a32fe1 000000000000000e 0000000070008100
  %l4-7: 0000000000000000 0000000000000000 0000000000000000 000002a100a33890
000002a100a33930 0 (30003c5c000, 184c9d0, 0, ffffffffffffffff, 420d, 1814800)
  %l0-3: 0000000000000000 0000000000000001 0000000000000001 0000000000000000
  %l4-7: 0000000001000000 0000000000000002 0000030003c5c178 000000000000e193
000002a100a339e0 unix:idle+d4 (1814800, 0, 30003c5c000, ffffffffffffffff, 8, 1813000)
  %l0-3: 000006003492eff8 000000000000001b 0000000000000000 ffffffffffffffff
  %l4-7: 000006003492eff8 ffffffffffffffff 000000000184c9d0 0000000001063a88

syncing file systems... [2] 205 [2] 164 [2] 127 [2] 127 [2] 127 [2] 127rsc>
rsc>
rsc>

THIS TIME I AM LEFT WITH ONLY OPTION IS TO BREAK THE SYSTEM, USUALLY WHICH WE SHOULD NOT PREFER.

rsc> break
rsc>
rsc>
rsc> break
rsc>
rsc>
rsc> console

{13} ok
{13} ok
{13} ok
{13} ok
panic[cpu19]/thread=2a100a33c80: panic sync timeout
ereport.io.pci.sta ena=5af9a4175404c01 detector=[ version=0 scheme="dev"
 device-path="/pci@9,600000" ] pci-status=aa0 pci-command=146 pci-pa=4000c0

!!!!!!!!!!!!!!!!

ereport.cpu.ultraSPARC-IVplus.ce ena=5af9a8d11800401 detector=[ version=1
 scheme="cpu" cpuid=1 cpumask=22 serial="80010220E95CB6CF" ] afsr=
 100002000000b0 afsr-ext=0 afar-status=1 afar=a3fd0d3940 pc=0 tl=0 tt=0
 privileged=1 multiple=0 syndrome-status=1 syndrome=b0 error-type="U"
 error-disposition=0 l3-cache-ways=0 l2-cache-ways=0 dcache-ways=0 icache-ways=
 0 resource=[ version=0 scheme="mem" unum="Slot A: J8001" ]

dumping to /dev/md/dsk/d20, offset 6873219072, content: kernel

panic[cpu19]/thread=2a100a33c80: BAD TRAP: type=31 rp=fff53cd0 addr=d61c6500d00c650 mmu_fsr=0
dump aborted: please record the above information!
rebooting...
No space left in device
Resetting ...

...... OUTPUT TRUNCATED .........

RSC Alert: Host System has Reset

<*>
Software Reset

@(#)OBP 4.30.4.c 2010/09/29 09:42 Sun Fire 4XX

........... OUTPUT TRUNCATED ............

!!!!!!!!!!!!!!!!!!!!!!

Sun Fire V490, No Keyboard
Copyright (c) 1998, 2010, Oracle and/or its affiliates. All rights reserved.
OpenBoot 4.30.4.c, 32768 MB memory installed, Serial #70989125.
Ethernet address 0:14:4f:3b:35:45, Host ID: 843b3545.

Creating CMP memory layout properties.

Reading temperature limits from FRUPROMs: CMP0/2 CMP1/3 BACKPLANE

Environmental monitor is ON
{3} ok
{3} ok
{3} ok    ------ FINALLY TO OBP ....
{3} ok
{3} ok
{3} ok
{3} ok .asr       ----- command to check status...

ASR Disablement Status
Component:     Status

CMP/Memory:    Enabled
IO-Bridge8:    Enabled
IO-Bridge9:    Enabled
GPTwo Slots:   Enabled
Onboard FCAL:  Enabled
Onboard Net1:  Enabled
Onboard Net0:  Enabled
Onboard IDE:   Enabled
PCI Slots:     Enabled

{3} ok
{3} ok asr-disable  ----- "disable" command with options...

Usage: asr-disable <dev-id>
Where <dev-id> is an absolute device path, a device alias, or a device label.
Valid device labels include:
    cmp3-bank3      cmp3-bank2      cmp3-bank1      cmp3-bank0
    cmp2-bank3      cmp2-bank2      cmp2-bank1      cmp2-bank0
    cmp1-bank3      cmp1-bank2      cmp1-bank1      cmp1-bank0
    cmp0-bank3      cmp0-bank2      cmp0-bank1      cmp0-bank0
    pci-slot5       pci-slot4       pci-slot3       pci-slot2
    pci-slot1       pci-slot0       gptwo-slotc     gptwo-slotb
    gptwo-slota     ob-ide          ob-net0         ob-net1
    ob-fcal         io-bridge9      io-bridge8      cmp3
    cmp2            cmp1            cmp0

{3} ok
{3} ok
{3} ok asr-enable ----- "enable" command with options...

Usage: asr-enable <dev-id>
Where <dev-id> is an absolute device path, a device alias, or a device label.
Valid device labels include:
    cmp3-bank3      cmp3-bank2      cmp3-bank1      cmp3-bank0
    cmp2-bank3      cmp2-bank2      cmp2-bank1      cmp2-bank0
    cmp1-bank3      cmp1-bank2      cmp1-bank1      cmp1-bank0
    cmp0-bank3      cmp0-bank2      cmp0-bank1      cmp0-bank0
    pci-slot5       pci-slot4       pci-slot3       pci-slot2
    pci-slot1       pci-slot0       gptwo-slotc     gptwo-slotb
    gptwo-slota     ob-ide          ob-net0         ob-net1
    ob-fcal         io-bridge9      io-bridge8      cmp3
    cmp2            cmp1            cmp0
    *               cmp3-bank*      cmp2-bank*      cmp1-bank*
    cmp0-bank*      pci*            pci-slot*       gptwo-slot*
    io-bridge*      cmp*

{3} ok
{3} ok
{3} ok asr-disable cmp0      ----Finally disabling the faulted...
{3} ok
{3} ok asr-disable cmp2
{3} ok
{3} ok
{3} ok .asr                ----- can check the status now...
ASR Disablement Status
Component:     Status

CMP0:          Disabled
Memory Bank0:  Enabled
Memory Bank1:  Enabled
Memory Bank2:  Enabled
Memory Bank3:  Enabled
CMP1/Memory:   Enabled
CMP2:          Disabled
Memory Bank0:  Enabled
Memory Bank1:  Enabled
Memory Bank2:  Enabled
Memory Bank3:  Enabled
CMP3/Memory:   Enabled
IO-Bridge8:    Enabled
IO-Bridge9:    Enabled
GPTwo Slots:   Enabled
Onboard FCAL:  Enabled
Onboard Net1:  Enabled
Onboard Net0:  Enabled
Onboard IDE:   Enabled
PCI Slots:     Enabled

{3} ok devalias
rootmirror               /pci@9,600000/SUNW,qlc@2/fp@0,0/disk@1,0
rootdisk                 /pci@9,600000/SUNW,qlc@2/fp@0,0/disk@0,0
disk1                    /pci@9,600000/SUNW,qlc@2/fp@0,0/disk@1,0
disk0                    /pci@9,600000/SUNW,qlc@2/fp@0,0/disk@0,0
disk                     /pci@9,600000/SUNW,qlc@2/fp@0,0/disk@0,0
ide                      /pci@8,700000/ide@6
scsi                     /pci@9,600000/SUNW,qlc@2
cdrom                    /pci@8,700000/ide@6/cdrom@0,0:f
net                      /pci@9,700000/network@2
net1                     /pci@9,600000/network@1
net0                     /pci@9,700000/network@2
flash                    /pci@9,700000/ebus@1/flashprom@0,0
idprom                   /pci@9,700000/ebus@1/i2c@1,2e/idprom@4,a4
nvram                    /pci@9,700000/ebus@1/i2c@1,2e/nvram@4,a4
i2c1                     /pci@9,700000/ebus@1/i2c@1,30
i2c0                     /pci@9,700000/ebus@1/i2c@1,2e
bbc                      /pci@9,700000/ebus@1/bbc@1,0
rsc-console              /pci@9,700000/ebus@1/rsc-console@1,3083f8
rsc-control              /pci@9,700000/ebus@1/rsc-control@1,3062f8
ttya                     /pci@9,700000/ebus@1/serial@1,400000:a
pci9b                    /pci@9,700000
pci9a                    /pci@9,600000
pci8b
                         /pci@8,700000
pci8a                    /pci@8,600000
ebus                     /pci@9,700000/ebus@1
name                     aliases
{3} ok
{3} ok

Finally boot the system,

{3} ok boot rootdisk
Boot device: /pci@9,600000/SUNW,qlc@2/fp@0,0/disk@0,0  File and args:
SunOS Release 5.10 Version Generic_147440-27 64-bit
Copyright (c) 1983, 2012, Oracle and/or its affiliates. All rights reserved.
Hostname: SRVR

VxVM sysboot INFO V-5-2-3409 starting in boot mode...
NOTICE: VxVM vxdmp V-5-0-34 added disk array DISKS, datype = Disk

NOTICE: VxVM vxdmp V-5-3-1700 dmpnode 341/0x0 has migrated from enclosure FAKE_ENCLR_SNO to enclosure DISKS

VxVM sysboot INFO V-5-2-3390 Starting restore daemon...
Feb  4 02:35:36 svc.startd[408]: restarting after interruption
WARNING: /pci@8,700000/ide@6/sd@0,0 (sd1):
        transport rejected bad packet

WARNING: /pci@8,700000/ide@6/sd@0,0 (sd1):
        transport rejected bad packet

!!!!!!!!!!!!!!!!!!!!!!

panic[cpu3]/thread=30003e6aa60: UE WDU Error(s)

000002a1013124e0 SUNW,UltraSPARC-IV+:cpu_deferred_error+56c (1, 3, 10000400000154, 400000000, 0, 0)
  %l0-3: 000000a3e4eb5c40 0000000000000001 0000000001851000 0000080c00000040
  %l4-7: 0000080c00000000 0000000000203000 0000000000000001 0000000000000000


syncing file systems... [1] 177 [1] 132 [1] 127 [1] 127 [1] 127 [1] 127 [1] 127 [1] 127 [1] 127 done (not all i/o completed)

ereport.cpu.ultraSPARC-IVplus.ce ena=1f6adb3c02c04801 detector=[ version=1
 scheme="cpu" cpuid=12 cpumask=22 serial="80001A58E75C3807" ] afsr=
 100002000001e4 afsr-ext=0 afar-status=1 afar=a3e5bdd040 pc=0 tl=0 tt=0
 privileged=1 multiple=0 syndrome-status=1 syndrome=1e4 error-type="U"
 error-disposition=2000000 l3-cache-ways=0 l2-cache-ways=0 dcache-ways=0
 icache-ways=0 resource=[ version=0 scheme="mem" unum="Slot A: J8001" ]

dumping to /dev/md/dsk/d20, offset 6873219072, content: kernel
 0:10 100% done
100% done: 147791 pages dumped, dump succeeded
rebooting...

Resetting ...

!!!!!!!!!!!!!!!!!!!!!!

RSC Alert: Host System has Reset

<*>
Software Reset

Skipping POST.
WARNING: Offlining/Disabling CMP0...and CMP2...FRU bus access...Done.
Enabling system bus....... CMP1 CMP3 Done
Mungeing Memory...........Done
HiMem: 0000.00b0.0000.0000, size: 0000.0004.0000.0000
Configuring Memory........ CMP1 CMP3 Done
Init ICache/etc........... CMP1 CMP3 Done
Init ECache Tags.......... CMP1 CMP3 Done
Clearing TLBs............. CMP1 CMP3 Done
Setup I/DTLBs............. CMP1 CMP3 Done
Enabling Cache/MMUs....... CMP1 CMP3 Done
Init ECache Data.......... CMP1 CMP3 Done
Zeroing memory...Done
Copying FLASHRAM to memory...Verifying base 128KB...Done
Jumping into RAM (leaving slave CPUs in ROM)
RAM CRC = 0000.0000.b81b.5f23;  ROM CRC = 0000.0000.b81b.5f23
Dropping in...
Find dropin, Decompressing Done, Size 0000.0000.0007.fd30 (512KB)
Slave CPUs starting Forth at 0000.0000.f000.00e0
Boot  CPU3 starting Forth at 0000.0000.f000.00e0
Diagnostic console initialized

Creating CMP memory layout properties.

Reading temperature limits from FRUPROMs: CMP1/3 BACKPLANE

Environmental monitor is ON
Rebooting with command: boot
Boot device: rootdisk  File and args:
SunOS Release 5.10 Version Generic_147440-27 64-bit
Copyright (c) 1983, 2012, Oracle and/or its affiliates. All rights reserved.
Hostname: SRVR

................. OUTPUT TRUNCATED...........

SRVR console login: root
Password:
Last login: Wed Feb  4 01:47:48 on console
Feb  4 02:45:06 SRVR login: ROOT LOGIN /dev/console
You have new mail.
Sourcing /root/.profile-EIS.....
root@SRVR #
root@SRVR #
root@SRVR #

Using fmadm command we can check which the faulted SB/CPU's .....

root@SRVR # fmadm faulty
--------------- ------------------------------------  -------------- ---------
TIME            EVENT-ID                              MSG-ID         SEVERITY
--------------- ------------------------------------  -------------- ---------
Feb 03 19:42:56 5fa23e99-815a-e9bd-c8b1-929231753f98  SUN4U-8007-L3  Critical

Host        : SRVR
Platform    : SUNW,Sun-Fire-V490        Chassis_id  :
Product_sn  :

Fault class : fault.memory.dimm-ue-imminent 95%
Affects     : mem:///unum=Slot,A:J8001
                  faulted but still in service
FRU         : mem:///unum=Slot,A:J8001 95%
                  faulty
Serial ID.  : 887214

Description : A pattern of correctable errors has been observed suggesting the
              potential exists that an uncorrectable error may occur.
              Refer to http://sun.com/msg/SUN4U-8007-L3 for more information.

Response    : None at this time.

Impact      : None at this time. However, the potential uncorrectable error
              warrants proactive service action to avoid any unplanned system
              outages.

Action      : Schedule a repair procedure to replace the DIMM. Use fmadm faulty
              to identify the DIMM to replace.

--------------- ------------------------------------  -------------- ---------
TIME            EVENT-ID                              MSG-ID         SEVERITY
--------------- ------------------------------------  -------------- ---------
Feb 03 19:34:02 499d428d-84ad-cacd-bae8-e0b6258b0bc0  SUN4U-8000-35  Critical

Host        : SRVR
Platform    : SUNW,Sun-Fire-V490        Chassis_id  :
Product_sn  :

Fault class : fault.memory.bank 95%
Affects     : mem:///unum=Slot,A:J7900,J7901,J8001,J8000
                  faulted but still in service
FRU         : mem:///unum=Slot,A:J7900,J7901,J8001,J8000 95%
                  faulty
Serial ID.  : 887430
              887318
              887214
              688813

Description : The number of errors associated with this memory module has
              exceeded acceptable levels.  Refer to
              http://sun.com/msg/SUN4U-8000-35 for more information.

Response    : Pages of memory associated with this memory module are being
              removed from service as errors are reported.

Impact      : Total system memory capacity will be reduced as pages are
              retired.

Action      : Schedule a repair procedure to replace the affected memory
              module. Use fmdump -v -u <EVENT_ID> to identify the module.

--------------- ------------------------------------  -------------- ---------
TIME            EVENT-ID                              MSG-ID         SEVERITY
--------------- ------------------------------------  -------------- ---------
Feb 03 20:02:00 1536fecc-7ee3-6e49-fb81-c2da0262733a  PCI-8000-AP    Major

Host        : SRVR
Platform    : SUNW,Sun-Fire-V490        Chassis_id  :
Product_sn  :

Fault class : fault.io.pci.device-invreq 50%
              fault.io.pci.device-interr 50%
Affects     : dev:////pci@9,600000/SUNW,qlc@2
              dev:////pci@9,600000
                  faulted but still in service
FRU         : "MB" (hc://:product-id=SUNW,Sun-Fire-V490:server-id=SRVR/motherboard=0)
                  faulty

Description : Either the transmitting device sent an invalid request or the
              receiving device is reporting an internal fault.
              Refer to http://sun.com/msg/PCI-8000-AP for more information.

Response    : One or more device instances may be disabled

Impact      : Possible loss of services provided by the device instances
              associated with this fault

Action      : Ensure that the latest drivers and patches are installed.
              Otherwise schedule a repair procedure to replace the affected
              device(s).  Use fmadm faulty to identify the devices or contact
              Sun for support.

--------------- ------------------------------------  -------------- ---------
TIME            EVENT-ID                              MSG-ID         SEVERITY
--------------- ------------------------------------  -------------- ---------
Feb 02 17:36:45 50af5cf4-571a-cee2-b76b-85b4d0c0430b  SUN4U-8007-KY  Major

Host        : SRVR
Platform    : SUNW,Sun-Fire-V490        Chassis_id  :
Product_sn  :

Fault class : fault.memory.dimm-page-retires-excessive 95%
Affects     : mem:///unum=Slot,A:J8001
                  faulted but still in service
FRU         : mem:///unum=Slot,A:J8001 95%
                  faulty
Serial ID.  : 887214

Description : The number of correctable errors associated with this memory
              module has exceeded acceptable levels.
              Refer to http://sun.com/msg/SUN4U-8007-KY for more information.

Response    : Pages of memory associated with this memory module have been
              removed from service, up to a limit which has now been reached.

Impact      : Total system memory capacity has been reduced.

Action      : Schedule a repair procedure to replace the DIMM. Use fmadm faulty
              to identify the DIMM to replace.

root@SRVR #
root@SRVR #

Now we can check using "prtdiag" command whether SLOT-A is disabled or not...

root@SRVR # prtdiag -v | grep -i mem
Memory size: 16384 Megabytes
========================= Memory Configuration ===============================
root@SRVR #
root@SRVR # prtdiag -v
System Configuration:  Oracle Corporation  sun4u Sun Fire V490
System clock frequency: 150 MHz
Memory size: 16384 Megabytes

========================= CPUs ===============================================

          Run   E$  CPU     CPU
Brd  CPU  MHz   MB  Impl.   Mask
--- ----- ---- ---- ------- ----
 B  1, 17 1500 32.0 US-IV+   2.2                       BOARD "A" is missing.....
 B  3, 19 1500 32.0 US-IV+   2.2

========================= Memory Configuration ===============================

          Logical  Logical  Logical
     MC   Bank     Bank     Bank         DIMM    Interleave  Interleaved
Brd  ID   num      size     Status       Size    Factor      with
---  ---  ----     ------   -----------  ------  ----------  -----------
 B    1     0      2048MB   no_status    1024MB     8-way        0
 B    1     1      2048MB   no_status    1024MB     8-way        0
 B    1     2      2048MB   no_status    1024MB     8-way        0
 B    1     3      2048MB   no_status    1024MB     8-way        0
 B    3     0      2048MB   no_status    1024MB     8-way        0
 B    3     1      2048MB   no_status    1024MB     8-way        0
 B    3     2      2048MB   no_status    1024MB     8-way        0
 B    3     3      2048MB   no_status    1024MB     8-way        0

========================= IO Cards =========================

                    Bus  Max
 IO  Port Bus       Freq Bus  Dev,
Type  ID  Side Slot MHz  Freq Func State Name                              Model
---- ---- ---- ---- ---- ---- ---- ----- --------------------------------  ----------------------
PCI   8    B    3    33   33  3,0  ok    QLGC,qlc-pci1077,2312.1077.101.2+
PCI   8    B    3    33   33  3,1  ok    QLGC,qlc-pci1077,2312.1077.101.2+
PCI   8    B    5    33   33  5,0  ok    QLGC,qlc-pci1077,2312.1077.101.2+
PCI   8    B    5    33   33  5,1  ok    QLGC,qlc-pci1077,2312.1077.101.2+
.......

So we need to schedule downtime for Memory replacement....

##########################################################################################




Monday, 2 February 2015

Making Luns Visible at Host Side !!!

Whenever we assign luns for host, they won't be visible to host. To make luns visible at host side we need to perform few things. 

We can assume all these steps together as " host side scanning ".
Once host side scanning is performed, luns will be visible at OS level.

In one of our servers, we alloted 2 new luns. I want to add these luns to my DATA DG to increase it's size.

Now let us see what things need to be followed to make them visible at OS level.

root@mydbsrvr # bash
root@mydbsrvr #
root@mydbsrvr # df -kh /oracle
Filesystem             size   used  avail capacity  Mounted on
/dev/vx/dsk/DATA/oracle
                       408G   386G    22G    95%    /oracle
root@mydbsrvr #
root@mydbsrvr # vxdisk -oalldgs list

DEVICE       TYPE            DISK         GROUP        STATUS
c1t0d0s2     auto:none       -            -            online invalid
c1t1d0s2     auto:none       -            -            online invalid
emcpower0s2  auto:sliced     DATA0     DATA      online
emcpower1s2  auto:sliced     DATA1     DATA      online
emcpower2s2  auto:sliced     DATA2     DATA      online
emcpower3s2  auto:sliced     DATA3     DATA      online
emcpower4s2  auto:sliced     DATA4     DATA      online
emcpower5s2  auto:sliced     DATA5     DATA      online
emcpower6s2  auto:sliced     -            (DATA_old)      online
emcpower7s2  auto:sliced     DATA_new0     DATA_new      online
root@mydbsrvr #

Till now luns are not visible at veritas level, let us check in powerpath.

root@mydbsrvr # powermt display
Symmetrix logical device count=8
CLARiiON logical device count=0
Hitachi logical device count=0
Invista logical device count=0
HP xp logical device count=0
Ess logical device count=0
HP HSx logical device count=0
==============================================================================
----- Host Bus Adapters ---------  ------ I/O Paths -----  ------ Stats ------
###  HW Path                       Summary   Total   Dead  IO/Sec Q-IOs Errors
==============================================================================
3072 pci@8,700000/QLGC,qlc@3/fp@0,0  optimal      8      0       -     0      0
3076 pci@8,700000/QLGC,qlc@2/fp@0,0  optimal      8      0       -     0      8
root@mydbsrvr #

We can see device count is 8, new luns not yet visible...
So now to make luns visible, we need to scan our host, which includes following steps :

1. Configure the controllers related to fibre channel.

root@mydbsrvr # cfgadm -al
Ap_Id                          Type         Receptacle   Occupant     Condition
c0                             scsi-bus     connected    configured   unknown
c0::dsk/c0t0d0                 CD-ROM       connected    configured   unknown
c1                             fc-private   connected    configured   unknown
c1::500000e01821d261           disk         connected    configured   unknown
c1::500000e01821f6c1           disk         connected    configured   unknown
c2                             fc-fabric    connected    configured   unknown
c2::5006048452a85012           disk         connected    configured   unknown
c3                             fc           connected    unconfigured unknown
c6                             fc-fabric    connected    configured   unknown
c6::5006048452a8501d           disk         connected    configured   unknown
c7                             fc           connected    unconfigured unknown
usb0/1                         unknown      empty        unconfigured ok
usb0/2                         unknown      empty        unconfigured ok
usb0/3                         unknown      empty        unconfigured ok
usb0/4                         unknown      empty        unconfigured ok
root@mydbsrvr #

root@mydbsrvr # cfgadm -c configure c2
root@mydbsrvr #
root@mydbsrvr # cfgadm -c configure c6
root@mydbsrvr #

2. Scanning at multipathing level.

root@mydbsrvr # powermt config
root@mydbsrvr #
root@mydbsrvr # powermt save
root@mydbsrvr #
root@mydbsrvr # powermt display
Symmetrix logical device count=10
CLARiiON logical device count=0
Hitachi logical device count=0
Invista logical device count=0
HP xp logical device count=0
Ess logical device count=0
HP HSx logical device count=0
==============================================================================
----- Host Bus Adapters ---------  ------ I/O Paths -----  ------ Stats ------
###  HW Path                       Summary   Total   Dead  IO/Sec Q-IOs Errors
==============================================================================
3072 pci@8,700000/QLGC,qlc@3/fp@0,0  optimal      10      0       -     0      0
3076 pci@8,700000/QLGC,qlc@2/fp@0,0  optimal      10      0       -     0      8
root@mydbsrvr #

Sometimes If at this step luns are not visible go for step 3. 

3. vxscandisks and vxdctl enable which helps to scan host at Veritas level.

Let us check now,

root@mydbsrvr # vxdisk -oalldgs list
DEVICE       TYPE            DISK         GROUP        STATUS
c1t0d0s2     auto:none       -            -            online invalid
c1t1d0s2     auto:none       -            -            online invalid
emcpower0s2  auto:sliced     DATA0     DATA      online
emcpower1s2  auto:sliced     DATA1     DATA      online
emcpower2s2  auto:sliced     DATA2     DATA      online
emcpower3s2  auto:sliced     DATA3     DATA      online
emcpower4s2  auto:sliced     DATA4     DATA      online
emcpower5s2  auto:sliced     DATA5     DATA      online
emcpower6s2  auto:sliced     -            (DATA_old)     online
emcpower7s2  auto:sliced     DATA_new0     DATA_new      online
emcpower8s2  auto:none       -            -            online invalid
emcpower9s2  auto:none       -            -            online invalid
root@mydbsrvr #

Now we have to proceed initializing the disk to bring them under Veritas Control.

root@mydbsrvr #  vxdisksetup -i emcpower8 format=sliced
root@mydbsrvr #
root@mydbsrvr #  vxdisksetup -i emcpower9 format=sliced
root@mydbsrvr #

root@mydbsrvr # vxdisk -oalldgs list
DEVICE       TYPE            DISK         GROUP        STATUS
c1t0d0s2     auto:none       -            -            online invalid
c1t1d0s2     auto:none       -            -            online invalid
emcpower0s2  auto:sliced     DATA0     DATA      online
emcpower1s2  auto:sliced     DATA1     DATA      online
emcpower2s2  auto:sliced     DATA2     DATA      online
emcpower3s2  auto:sliced     DATA3     DATA      online
emcpower4s2  auto:sliced     DATA4     DATA      online
emcpower5s2  auto:sliced     DATA5     DATA      online
emcpower6s2  auto:sliced     -            (DATA_old)     online
emcpower7s2  auto:sliced     DATA_new0     DATA_new      online
emcpower8s2  auto:sliced     -            -            online
emcpower9s2  auto:sliced     -            -            online
root@mydbsrvr #

Now luns are ready to add to DG.

root@mydbsrvr # vxdg list
NAME         STATE           ID
DATA      enabled              1330684297.33.DATA
DATA_new  enabled              1330684501.41.DATA_new
root@mydbsrvr #
root@mydbsrvr #
root@mydbsrvr # vxdg -g DATA adddisk DATA6=emcpower8
root@mydbsrvr #
root@mydbsrvr # vxdg -g DATA adddisk DATA7=emcpower9
root@mydbsrvr #
root@mydbsrvr # vxprint -htg DATA

dg DATA      default      default  18000    1330684297.33.DATA_old

dm DATA0     emcpower0s2  auto     68863    142690560 -
dm DATA1     emcpower1s2  auto     68863    142690560 -
dm DATA2     emcpower2s2  auto     68863    142690560 -
dm DATA3     emcpower3s2  auto     68863    142690560 -
dm DATA4     emcpower4s2  auto     68863    142690560 -
dm DATA5     emcpower5s2  auto     68863    142690560 -
dm DATA6     emcpower8s2  auto     68863    142690560 -
dm DATA7     emcpower9s2  auto     68863    142690560 -

v  oracle       -            ENABLED  ACTIVE   855638016 SELECT   oracle-01 fsgen
pl oracle-01    oracle       ENABLED  ACTIVE   855638016 STRIPE   2/1024   RW
sd DATA0-01  oracle-01    DATA0 0        142690560 0/0      emcpower0 ENA
sd DATA2-01  oracle-01    DATA2 0        142690560 0/142690560 emcpower2 ENA
sd DATA4-01  oracle-01    DATA4 0        142437888 0/285381120 emcpower4 ENA
sd DATA1-01  oracle-01    DATA1 0        142690560 1/0      emcpower1 ENA
sd DATA3-01  oracle-01    DATA3 0        142690560 1/142690560 emcpower3 ENA
sd DATA5-01  oracle-01    DATA5 0        142437888 1/285381120 emcpower5 ENA
root@mydbsrvr #

root@mydbsrvr # vxdisk -oalldgs list
DEVICE       TYPE            DISK         GROUP        STATUS
c1t0d0s2     auto:none       -            -            online invalid
c1t1d0s2     auto:none       -            -            online invalid
emcpower0s2  auto:sliced     DATA0     DATA      online
emcpower1s2  auto:sliced     DATA1     DATA      online
emcpower2s2  auto:sliced     DATA2     DATA      online
emcpower3s2  auto:sliced     DATA3     DATA      online
emcpower4s2  auto:sliced     DATA4     DATA      online
emcpower5s2  auto:sliced     DATA5     DATA      online
emcpower6s2  auto:sliced     -            (DATA_old)      online
emcpower7s2  auto:sliced     DATA_new0     DATA_new      online
emcpower8s2  auto:sliced     DATA6     DATA      online
emcpower9s2  auto:sliced     DATA7     DATA      online
root@mydbsrvr #

root@mydbsrvr # df -kh /oracle
Filesystem             size   used  avail capacity  Mounted on
/dev/vx/dsk/DATA/oracle
                       408G   386G    22G    95%    /oracle
root@mydbsrvr #

Now we can resize the DG using these newly assigned luns,

root@mydbsrvr # vxassist -g DATA maxsize
Maximum volume size: 285886464 (139593Mb)
root@mydbsrvr #
root@mydbsrvr #
root@mydbsrvr # vxresize -g DATA oracle +130g
root@mydbsrvr #
root@mydbsrvr #
root@mydbsrvr # df -kh /oracle
Filesystem             size   used  avail capacity  Mounted on
/dev/vx/dsk/DATA/oracle
                       538G   386G   151G    72%    /oracle
root@mydbsrvr #

After performing Host Scan luns are visibled to server and thus we increased the size of our DG.

################################################################################

Renaming Guest LDOM !!!

If we want to rename our guest ldom, we need to do following things :

We can't straight away change our ldom name with a single command, once a ldom is bind and started all its resources are linked with respect to ldom's name. 

So to rename a ldom , firstly we need to stop, unbind it and then we need to remove it.
Then using it's constraints file we can add the domain with its new name.

Following are the steps :

1. Take a fresh backup of constraints for your ldom. (It should be a xml file)

ldm ls-constraints -x testldm >> /tmp/testldm.xml (where testldm is old LDom)

2. Replace the ldom name with it's new name in the file.

vi /tmp/testldm.xml (Replace the old Ldom with new Ldom)

3. Stop,unbind and remove the domain.

ldm stop testldm

ldm unbind testldm

ldm remove-domain testldm

4. Remove the disks assigned to this particular ldom.

ldm list-bindings | grep -i testldm| more (Will list all vdisk names)

ldm remove-vdsdev testldm_disk_1EF3@primary-vds0

ldm remove-vdsdev testldm-1323@primary-vds0

ldm remove-vdsdev testldm-1264@primary-vds0

5. Now there is no info related to old ldom, so proceed by adding ldom with new name using our bkp of constraints file.

ldm add-domain -i /tmp/testldm.xml

ldm bind myprod (where myprod is the new LDom)

ldm start myprod

root@RRLT5DVQA:~# ldm ls | grep myprod
myprod          active     -t----  5003    16    32G      1.2%  1.6%  6s

Thus we can rename a ldom by following above steps.

################################################################################

Hostname Change in Solaris 10 & 11 !!!

In Solaris 10, we all know how to change hostname of server without reboot.
It is so simple, just to edit entry of hostname in few files.

/etc/nodename 
/etc/hostname.*interface 
/etc/inet/hosts or /etc/hosts
/etc/inet/ipnodes 

Then we need to take care regarding crashdump location. Usually we use default location for crash dump location as /var/crash/"hostname". So rename the directory in this location:

# cd /var/crash
# mv oldname newname

Coming to Solaris 11, modifications need to be done in system configuration can be achieved through single command " svccfg ". By Centralizing such management simplifies Configuration and Administration.

Few example like,

# svccfg -s svc:/system/environment:init --- we can modify environmental values like Timezone.

then >setprop TZ=Asia/Calcutta.

Similarly to change hostname, we have svccfg -s svc:/system/identity:node rather than editing individual files like /etc/nodename for hostname and /etc/default/init for Timezone.

To change hostname in Solaris 11 follow these steps :

svccfg -s system/identity:node setprop config/nodename="MyProdServer"

svccfg -s system/identity:node setprop config/loopback="MyProdServer"

svccfg -s system/identity:node refresh

svcadm restart system/identity:node

As shown above, in identity:node service need to change 2 properties " nodename and loopback ". After setting properties just take a restart of the service to reflect modifications.

Take a new session and observe the change in hostname.

################################################################################

Friday, 19 December 2014

Issue faced while unzipping 10_Recommended.zip !!!

This month, I just want to post some small small issues I faced, so that it will be bit helpful for those who face them...

Recently small Issue faced while performing KJP Activiity, it was regarding unzipping the patch cluster. I have to upgrade my solaris V490 server to latest patch, then I faced this.

I copied my patch cluster zip file to /var/tmp directory of my server.
Then straight away i tried to unzip it as follows :

root@mytest1 # cd /var/tmp
root@mytest1 #
root@mytest1 # ls -lrth
total 4847856
-rw-r--r--   1 root     root        2.3G Dec 18 03:03 10_Recommended.zip
root@mytest1 #
root@mytest1 #
root@mytest1 # unzip 10_Recommended.zip
Archive:  10_Recommended.zip
  End-of-central-directory signature not found.  Either this file is not
  a zipfile, or it constitutes one disk of a multi-part archive.  In the
  latter case the central directory and zipfile comment will be found on
  the last disk(s) of this archive.
unzip:  cannot find zipfile directory in one of 10_Recommended.zip or
        10_Recommended.zip.zip, and cannot find 10_Recommended.zip.ZIP, period.
root@mytest1 #

Then I googled and tried in many ways. Finally I got "jar" command.

By using jar we can unzip any zip file....

root@mytest1 #             Takes longtime like unzip only, but I felt jar is bit faster than unzip.
root@mytest1 # jar -xf 10_Recommended.zip

root@mytest1 #
root@mytest1 #
root@mytest1 # cd /var/tmp
root@mytest1 #
root@mytest1 # ls -lrth
total 4847858
-rw-r--r--   1 root     root        2.3G Dec 18 03:03 10_Recommended.zip
drwxr-xr-x   3 root     root         512 Dec 18 03:15 10_Recommended
root@mytest1 #
root@mytest1 # cd 10_Recommended
root@mytest1 #
root@mytest1 # ls -lrth
total 670
-rw-r--r--   1 root     root          15 Dec 18 03:04 installcluster
-rw-r--r--   1 root     root         18K Dec 18 03:04 LEGAL_LICENSE.TXT
-rw-r--r--   1 root     root        7.4K Dec 18 03:04 Copyright
-rw-r--r--   1 root     root        107K Dec 18 03:04 10_Recommended.html
-rw-r--r--   1 root     root        4.8K Dec 18 03:04 patchset.conf
drwxr-xr-x 377 root     root        8.0K Dec 18 03:15 patches
-rw-r--r--   1 root     root        100K Dec 18 03:15 installpatchset
-rw-r--r--   1 root     root        3.7K Dec 18 03:15 patch_order
-rw-r--r--   1 root     root         57K Dec 18 03:15 10_Recommended.README
root@mytest1 #
root@mytest1 #

Then I took my console login and started activity,

RSC version 2.2.3 (D4XTESTDV)
Please login: admin
Please Enter password:
rsc> console
mytest1 console login: root
Password:
Last login: Thu Dec 18 02:53:34 from 10.66.37.14
Dec 18 03:18:17 mytest1 login: ROOT LOGIN /dev/console
Sun Microsystems Inc.   SunOS 5.10      Generic January 2005
root@mytest1 # 
root@mytest1 # df -kh
Filesystem             size   used  avail capacity  Mounted on
/dev/dsk/c1t0d0s0       30G   3.5G    26G    12%    /
/devices                 0K     0K     0K     0%    /devices
ctfs                     0K     0K     0K     0%    /system/contract
proc                     0K     0K     0K     0%    /proc
mnttab                   0K     0K     0K     0%    /etc/mnttab
swap                    47G   568K    47G     1%    /etc/svc/volatile
objfs                    0K     0K     0K     0%    /system/object
/platform/sun4u-us3/lib/libc_psr/libc_psr_hwcap2.so.1
                        30G   3.5G    26G    12%    /platform/sun4u-us3/lib/libc_psr.so.1
/platform/sun4u-us3/lib/sparcv9/libc_psr/libc_psr_hwcap2.so.1
                        30G   3.5G    26G    12%    /platform/sun4u-us3/lib/sparcv9/libc_psr.so.1
fd                       0K     0K     0K     0%    /dev/fd
swap                    47G    32K    47G     1%    /tmp
swap                    47G    40K    47G     1%    /var/run
root@mytest1 #
root@mytest1 # uname -a
SunOS mytest1 5.10 Generic_118833-33 sun4u sparc SUNW,Sun-Fire-V490

Above is my older kernel version...

root@mytest1 #
root@mytest1 # cd /var/tmp
root@mytest1 #
root@mytest1 # ls -lrth
total 4847858
-rw-r--r--   1 root     root        2.3G Dec 18 03:03 10_Recommended.zip
drwxr-xr-x   3 root     root         512 Dec 18 03:15 10_Recommended
root@mytest1 #
root@mytest1 # cd 10_Recommended
root@mytest1 #
root@mytest1 # ls -lrth
total 670
-rw-r--r--   1 root     root          15 Dec 18 03:04 installcluster
-rw-r--r--   1 root     root         18K Dec 18 03:04 LEGAL_LICENSE.TXT
-rw-r--r--   1 root     root        7.4K Dec 18 03:04 Copyright
-rw-r--r--   1 root     root        107K Dec 18 03:04 10_Recommended.html
-rw-r--r--   1 root     root        4.8K Dec 18 03:04 patchset.conf
drwxr-xr-x 377 root     root        8.0K Dec 18 03:15 patches
-rw-r--r--   1 root     root        100K Dec 18 03:15 installpatchset
-rw-r--r--   1 root     root        3.7K Dec 18 03:15 patch_order
-rw-r--r--   1 root     root         57K Dec 18 03:15 10_Recommended.README
root@mytest1 #
root@mytest1 #
root@mytest1 # ./installpatchset --s10patchset
./installpatchset: cannot execute
root@mytest1 #

I checked and provided permissions for " installpatchset "

root@mytest1 # pwd
/var/tmp/10_Recommended
root@mytest1 #
root@mytest1 # ls -lrth
total 670
-rw-r--r--   1 root     root          15 Dec 18 03:04 installcluster
-rw-r--r--   1 root     root         18K Dec 18 03:04 LEGAL_LICENSE.TXT
-rw-r--r--   1 root     root        7.4K Dec 18 03:04 Copyright
-rw-r--r--   1 root     root        107K Dec 18 03:04 10_Recommended.html
-rw-r--r--   1 root     root        4.8K Dec 18 03:04 patchset.conf
drwxr-xr-x 377 root     root        8.0K Dec 18 03:15 patches
-rw-r--r--   1 root     root        100K Dec 18 03:15 installpatchset
-rw-r--r--   1 root     root        3.7K Dec 18 03:15 patch_order
-rw-r--r--   1 root     root         57K Dec 18 03:15 10_Recommended.README
root@mytest1 #
root@mytest1 # chmod 755 installpatchset        --- Now let's try again....
root@mytest1 #
root@mytest1 # ./installpatchset --s10patchset
Setup .....

Recommended OS Patchset Solaris 10 SPARC (2014.05.08)
Patch 118833-36 will be applied shortly. Installation of the patch set will
halt after this patch is applied, and it will be necessary to reboot before
installation of the patch set can continue.
Application of patches started : 2014.12.18 03:22:12
Applying 120900-04 (  1 of 375) ... skipped
Applying 121133-02 (  2 of 375) ... skipped
Applying 119254-89 (  3 of 375) ... failed
Application of patches finished : 2014.12.18 03:22:16

Following patches were skipped :
 Patches already applied
 120900-04     121133-02
Following patch failed to apply :
 119254-89
Aborting due to failure while applying patch 119254-89.
Application of this patch should have succeeded - this failure is unexpected.
Please assess cause of failure and verify system integrity before proceeding.
Install log files written :
  /var/sadm/install_data/s10s_rec_patchset_short_2014.12.18_03.22.12.log
  /var/sadm/install_data/s10s_rec_patchset_verbose_2014.12.18_03.22.12.log
  /var/sadm/install_data/s10s_rec_patchset_failed_2014.12.18_03.22.12.log
root@mytest1 #
root@mytest1 # pwd
/var/tmp/10_Recommended
root@mytest1 # 
root@mytest1 # ls -lrth
total 670
-rw-r--r--   1 root     root          15 Dec 18 03:04 installcluster
-rw-r--r--   1 root     root         18K Dec 18 03:04 LEGAL_LICENSE.TXT
-rw-r--r--   1 root     root        7.4K Dec 18 03:04 Copyright
-rw-r--r--   1 root     root        107K Dec 18 03:04 10_Recommended.html
-rw-r--r--   1 root     root        4.8K Dec 18 03:04 patchset.conf
drwxr-xr-x 377 root     root        8.0K Dec 18 03:15 patches
-rw-r--r--   1 root     root        100K Dec 18 03:15 installpatchset
-rw-r--r--   1 root     root        3.7K Dec 18 03:15 patch_order
-rw-r--r--   1 root     root         57K Dec 18 03:15 10_Recommended.README
root@mytest1 # 
root@mytest1 # cd patches
root@mytest1 #
root@mytest1 # ls -lrth | more
total 800
drwxr-xr-x   4 root     root         512 Dec 18 03:04 120812-32
drwxr-xr-x   5 root     root         512 Dec 18 03:04 119282-01
drwxr-xr-x   7 root     root         512 Dec 18 03:04 118712-23
drwxr-xr-x   3 root     root         512 Dec 18 03:04 119317-01
drwxr-xr-x   5 root     root         512 Dec 18 03:04 122259-06
drwxr-xr-x   6 root     root         512 Dec 18 03:04 124204-04
drwxr-xr-x   7 root     root         512 Dec 18 03:04 139520-02
drwxr-xr-x   6 root     root         512 Dec 18 03:04 148027-03
drwxr-xr-x   3 root     root         512 Dec 18 03:04 149646-02
drwxr-xr-x   7 root     root         512 Dec 18 03:04 118777-16   OUTPUT TRUNCATED

Then I wanted to install that single patch manually,

root@mytest1 # patchadd 119254-89
Validating patches...
Loading patches installed on the system...
Done!
Loading patches requested to install.
Done!
Checking patches that you specified for installation.
Done!

Approved patches will be installed in this order:
119254-89

ERROR: The /var/tmp/10_Recommended/patches/119254-89/prepatch script contains invalid permissions. Please make the script executable and reinstall 119254-89.
WARNING: Skipping patch
root@mytest1 #

This time I got to know that, this is due to file permissions...So I gave permissions as below,

root@mytest1 #
root@mytest1 # pwd
/var/tmp/10_Recommended
root@mytest1 #
root@mytest1 #  ls -lrth patches/119254-89/postpatch
-rw-r--r--   1 root     root        2.9K Dec 18 03:06 patches/119254-89/postpatch
root@mytest1 #
root@mytest1 #
root@mytest1 # chmod 755 patches/119254-89/postpatch
root@mytest1 #
root@mytest1 # ls -lrth patches/119254-89/prepatch
-rw-r--r--   1 root     root         539 Dec 18 03:06 patches/119254-89/prepatch
root@mytest1 #
root@mytest1 # chmod 755 patches/119254-89/prepatch
root@mytest1 #
root@mytest1 #  ls -lrth patches/119254-89/prepatch
-rwxr-xr-x   1 root     root         539 Dec 18 03:06 patches/119254-89/prepatch
root@mytest1 #
root@mytest1 #
root@mytest1 # pwd
/var/tmp/10_Recommended
root@mytest1 #

Now let's try again,

root@mytest1 # ./installpatchset --s10patchset
Setup ...

Recommended OS Patchset Solaris 10 SPARC (2014.05.08)
Patch 118833-36 will be applied shortly. Installation of the patch set will
halt after this patch is applied, and it will be necessary to reboot before
installation of the patch set can continue.
Application of patches started : 2014.12.18 03:26:36
Applying 120900-04 (  1 of 375) ... skipped
Applying 121133-02 (  2 of 375) ... skipped
Applying 119254-89 (  3 of 375) ... success
Applying 119317-01 (  4 of 375) ... skipped
Applying 121296-01 (  5 of 375) ... skipped
Applying 138215-01 (  6 of 375) ... success
Applying 148336-01 (  7 of 375) ... failed
Application of patches finished : 2014.12.18 03:26:56

Following patches were applied :
 119254-89     138215-01
Following patches were skipped :
 Patches already applied
 120900-04     121133-02     119317-01     121296-01
Following patch failed to apply :
 148336-01
Aborting due to failure while applying patch 148336-01.
Application of this patch should have succeeded - this failure is unexpected.
Please assess cause of failure and verify system integrity before proceeding.
Install log files written :
  /var/sadm/install_data/s10s_rec_patchset_short_2014.12.18_03.26.36.log
  /var/sadm/install_data/s10s_rec_patchset_verbose_2014.12.18_03.26.36.log
  /var/sadm/install_data/s10s_rec_patchset_failed_2014.12.18_03.26.36.log
root@mytest1 #

Ohho again I faced the same, this time I felt like due to "jar" command after unzipping there were no perfect permissions for files, so I wanted to give 755 permission and try.

root@mytest1 #
root@mytest1 # pwd
/var/tmp/10_Recommended
root@mytest1 #
root@mytest1 #
root@mytest1 # cd ../
root@mytest1 #
root@mytest1 # chmod -R 755 10_Recommended
root@mytest1 #
root@mytest1 #
root@mytest1 # cd 10_Recommended
root@mytest1 #
root@mytest1 # ls -lrth
total 670
-rwxr-xr-x   1 root     root          15 Dec 18 03:04 installcluster
-rwxr-xr-x   1 root     root         18K Dec 18 03:04 LEGAL_LICENSE.TXT
-rwxr-xr-x   1 root     root        7.4K Dec 18 03:04 Copyright
-rwxr-xr-x   1 root     root        107K Dec 18 03:04 10_Recommended.html
-rwxr-xr-x   1 root     root        4.8K Dec 18 03:04 patchset.conf
drwxr-xr-x 377 root     root        8.0K Dec 18 03:15 patches
-rwxr-xr-x   1 root     root        100K Dec 18 03:15 installpatchset
-rwxr-xr-x   1 root     root        3.7K Dec 18 03:15 patch_order
-rwxr-xr-x   1 root     root         57K Dec 18 03:15 10_Recommended.README
root@mytest1 #

Hoping that the issue will be resolved....

root@mytest1 #
root@mytest1 # ./installpatchset --s10patchset
Setup ...

Recommended OS Patchset Solaris 10 SPARC (2014.05.08)
Patch 118833-36 will be applied shortly. Installation of the patch set will
halt after this patch is applied, and it will be necessary to reboot before
installation of the patch set can continue.
Application of patches started : 2014.12.18 03:30:37
Applying 120900-04 (  1 of 375) ... skipped
Applying 121133-02 (  2 of 375) ... skipped
Applying 119254-89 (  3 of 375) ... skipped
Applying 119317-01 (  4 of 375) ... skipped
Applying 121296-01 (  5 of 375) ... skipped
Applying 138215-01 (  6 of 375) ... skipped
Applying 148336-01 (  7 of 375) ... success
Applying 146054-07 (  8 of 375) ... success
Applying 142251-02 (  9 of 375) ... success
Applying 125555-15 ( 10 of 375) ... success
Applying 118367-04 ( 11 of 375) ... skipped
Applying 118666-65 ( 12 of 375) ...

Finally I got succeeded and patch was successfully installed...

###############################################################