Escolar Documentos
Profissional Documentos
Cultura Documentos
README FIRST
README FIRST
This AP has been updated to include commands for systems running "Cluster-Mode" (C-Mode) ONTAP.
The login name for C-Mode systems is "admin", not "root".
The ONTAP version and mode is listed in your dispatch!
C-Mode: Has two console command shells, clustershell and nodeshell. The default shell is clustershell.
IF clustershell, the console prompt includes a double colon ( :: ). Ex(1): cluster ::> Ex(2): cluster ::storage>
To switch from clustershell to nodeshell, enter 'run local' at the ::> prompt, then the double colons (::) are
removed. To exit nodeshell, enter 'exit' or Ctrl-D.
From clustershell, nodeshell commands can be entered by prefacing the 7-Mode command with run local".
Ex: cluster::> run local sysconfig -v Note, all 7-Mode commands are not supported in C-Mode.
No "Failed" Disks can exist in the target node in a HA config or the disk reassign will not execute. The AP covers this.
If this is a V-Series system with a 3rd party storage array or has SAN attached Tape drives, confirm a storage admin
is available to update the LUN Masking and/or switch zoning if the internal FC adapters are connected to the array.
Pre-ONTAP 8.2 Known Bugs/Issues - Skip IF ONTAP 8.2, otherwise refer to the Bug Table and Notes Below
Description
First Fixed Release
Bug
See Note 1
In disruptive MB w/NVMEM replacements, a TO/GB from the repaired node is req'd. See Note 2
NDMP, Qtree-SnapMirror, Vol-SnapMirror or SnapVault processes can hang TO/GB See Note 3
Diags report a false error on the NVRAM test in a HA Config.
Diag version 5.6.1 and >
Bug Notes:
1 In some versions of ONTAP when the 'disk reassign' command is executed from the partner, ONTAP may print out a
warning that states 2 things.
(i) The giveback must be done right way - IF a GB will not be immediately performed, the disk reassign needs to be
post-poned.
(ii) A second TO/GB should be performed from the repaired node. This is covered in the AP. (TSB-1110-04)
disk reassign: A giveback must be done immediately following a reassign of partner
disks. After the partner node becomes operational, do a takeover and giveback of
this node to complete the disk reassign process.
Do you want to continue (y/n)?
2 IF this system has a partner AND the partner did NOT takeover this controller, it is still necessary to sync the new system-ids
by executing a TO/GB from the repaired node although no console message is displayed. This is covered in the AP.
3 The AP will cover asking the customer if they are running these processes. If so, there is a link how to disable them.
4 Bug- ERROR DNH0500: The AP will cover checking for the diag version and how to work around it.
AP doc rev is at top of page - If using hard-copy for secure site, be sure to print all the linked documents in this AP.
XIII.
XIV.
XV.
XVI.
XVII.
XVIII.
XIX.
XX.
The FAS3100 Appliance has either one or two Controller Modules, A, B in a single chassis.
FAS3100 Model
Fig 1
PS-1 AC
Switch
6u
Fig 2
PS-2 AC
Switch
B
AC Power
HA Configurations
1 Red Thumbscrew to
extract each controller
Cam Handle
Fig 3
4 PCI slots
I.
XI.
XII.
PCI - 1
PCI - 2
PCI - 4
0a, 0b
IOIOI
(Console)
Port
Ethernet Ports:
e0a, e0b
Controller
Rear View
PCI - 3
0c, 0d
RLM
Port
Status LED
The NVMEM D87 LED will start flashing through the grill, reference Fig 3, when power is removed
STOP !! from the controller if the system is "waiting for giveback", or the system was not shutdown properly
(uncommitted data). Follow the steps in Section V carefully.
2
I.
Action Description
Fig 4
Notes:
1. This Action Plan covers FAS and V-Series Controller running ONTAP 7-Mode
or Cluster-Mode .
2. This procedure will take 90-180 minutes.
3. Note the Caution on NVMEM LEDs in Section V.
4. This Action Plan needs to be followed in step order
5. FC port configuration, disk list and the system date are captured prior to
removing the original Controller.
6. Many parts need to be moved from the Original Controller to the Replacement
Controller Module.
7. System variables; date-time, disk reassignment and FC port configuration
must be verified before rebooting the system.
8. If a HA configuration and ONTAP 8, the console may report you "must
perform a final ' cf takeover' and 'cf giveback' from the 'partner node", the node
that was repaired to complete the 'disk reassign' process. Follow the new steps
in 'Disk Reassign' and 'Boot the OS' sections carefully.
Fig 5b
AC Power
"B" Controller
Fault LED is "ON"
"A" Top is OFF
NOTE Chassis Check: To see if two controllers are installed reference HA (active-active) figures here > HA Configs
2 Check the state of the node by viewing the console port responses from (each) controller if an HA (Active-Active) configuration.
A HA config is two controller assemblies installed in the same physical chassis except if a MetroCluster (MC) configuration.
Appliance Check
A MC will have a controller in the top slot which is connect to its partner through cables or switches.
The "LOADER" prompt will include -A if attached to the top controller or -B if attached to the bottom controller.
NOTE
HA-config Status Command: After logging in, "cf status" will display the state of the HA . Example of >> cf status cmd
WARNING for HA (Active-Active) configurations:
STOP! If the failure has caused a controller failover you may have been dispatched on the surviving controller's serial number, not
the failed one.
3 HA Controller Configuration
a) If the 'target' and 'partner' controllers are UP, the end-user will have to issue a cf takeover from the partner node.
Work with NGS if you have questions.
b) If the 'target' controller's console response is: "Waiting for giveback" proceed with step 6.
4 For non-HA Configuration Only: If the console response is "login" or "password" or the <system prompt>, the end-user will
have issue a halt on the system for proper shutdown. Work with NGS if you have questions.
5
6
Step 6: Hitting
Enter displays
Information on
Partner Status
will show:
auto-giveback
------------true
b) Disable the auto-giveback option if enabled from the partner node. (copy-n-paste)
Cluster-Mode
Cluster-Mode (Run
(Run in
in clustershell)
clustershell)
cluster::>
cluster::> sto
sto fa
fa modify
modify -node
-node local
local -auto-giveback
-auto-giveback false
false
2
The date and time is stored in the system PROM in Greenwich Mean Time, (GMT) also known as Universal Time Clock, (UTC).
At the LOADER> prompt, enter: show date Record on paper the system's GMT time and the local time to determine the
number of hours (and minutes) the local time is ahead or behind GMT.
LOADER-A>
LOADER-A> show
show date
date
Current
Current date
date && time
time is:
is: 06/12/2011
06/12/2011 15:59:10
15:59:10
Step
Step 2):
2): Enter
Enter:'show
show date'
date
Enter: printenv This command displays (and captures) all boot environmental variables.
LOADER-A>
LOADER-A> printenv
printenv
STEP
STEP 3):
3): Enter
Enter:printenv
printenv
LOADER> autoboot
Loading
X86_64/freebsd/image1/kernel:0x100000/3375736
0x538280/3221872
Step 6: Enter: autoboot
.....
Copyright (C) 1992-2010 NetApp.
All rights reserved.
*******************************
Step 6a): Wait for
*
*
this message, then
* Press Ctrl-C for Boot Menu. *
hit ^C (CTRL-C)
*
*
*******************************
^CBoot Menu will be available.
LOADER> autoboot
Loading
x86_64/freebsd/image2/kernel:....0x100000/3386664
0x53b000/3222096 0x84da50/1190096
Step 6: Enter: autoboot
.....
NetApp Data ONTAP 8.0.1 Cluster-Mode
Copyright (C) 1992-2010 NetApp.
All rights reserved.
*******************************
Step 6a): Wait for
*
*
this message, then
* Press Ctrl-C for Boot Menu. *
hit ^C (CTRL-C)
*
*
*******************************
^CBoot Menu will be available.
*>
(normal)
Normally
(install)
Install new software first
(password [<user>]) Change user password
(setup)
Run setup first
(init)
Initialize disks and create
flexvol
(maint)
Boot into maintenance mode
(syncflash)
Update flash from backup
config
(reboot)
Reboot node
Step 6b):
Please make a selection: maint
Enter: maint
.....
.....
In a High Availablity configuration, you MUST
ensure that the
partner node is (and remains) down, or that
takeover is manually
disabled on the partner node, because High
Availability
software is not started or fully enabled in
Maintenance mode.
FAILURE TO DO SO CAN RESULT IN YOUR FILESYSTEMS
BEING DESTROYED
NOTE: It is okay to use 'show/status' sub-commands
such as
'disk show or aggr status' in Maintenance mode
while the partner is up
.....
*>
From the > *> prompt enter fcadmin config to log the configuration of the integrated FC host adapters.
a) Note the "0a-0d" Adapter ports to see if configured as a "target" adapter. If so, it will need to be configured later.
*> fcadmin config
Example Only
Step 7: Enter: fcadmin config
Local
Adapter Type
State
Status
--------------------------------------------------Step 7a): Log all the adapters
0a
initiator CONFIGURED.
online
listed as "target" adapters. In
0b
target
CONFIGURED
offline
0c
initiator CONFIGURED.
online
our example, adapters 0b and
0d
target
CONFIGURED
offline
0d are targets
8
Go to Section V, "Remove the cables, Cable Management Tray and extract the Controller Module" on next page.
V. V-FAS3100 Family: Remove the cables, Cable Management Tray and extract the Controller Module
Step Action Description
If TWO controllers modules are installed, DO NOT shut off the power supplies to replace the controller card, BUT DO
NOTE
shut off both power supplies if only ONE controller card is installed .
1 On the controller to be serviced, loosen the red thumbscrew, ref Fig 2 & 3; pull down on the cam lever and slide the controller
module towards you a few inches or until it stops.
STOP!
and
READ
HA (Active-Active) Configuration : If the red NVMEM Status D87 LED starts flashing ref Page-1, Fig 3,
when the controller is extracted from the chassis:
(i) Confirm from end-user or NGS that the partner controller had a clean takeover, or if this controller was "waiting for
giveback", the flashing LED can be ignored.
(ii) If a non-successful takeover, the flashing LED indicates uncommitted customer data - Contact NGS
Non-HA Configuration : If the red NVMEM Status D87 LED is flashing, the system was not 'halted' properly:
(i) Ask end-user if controller was properly "halted". If not, re-insert controller and if the system does not autoboot, enter
'bye ' at the LOADER-A|B> prompt . If the system boots to the login prompt, login and then enter 'halt' to properly
shutdown. Engage NGS if questions.
this
CAUTION
* The node configuration should have been determined by following Section III.
FAS3100-NVRAM-LEDs
NOTE For detail on the locations of the two NVRAM LEDs click here >>
2 Before proceeding further the state of the NVMEM LED should be resolved if it's valid by reading caution above.
3 Label each cable connector with its port number and then unplug the cabling from the connector.
NOTE If possible keep the cables in the cable clips on the cable tray to keep them in the correct position for reconnection.
4 Remove the cable management tray, Fig 6a-b, by pushing in the sides of the tray at the arrows and lifting it up.
Fig 6a
Fig 6b
Optional Cable
Management Trays
Cable Management
Tray Mounting Hooks
Push in on the blue release latch on the left side of the tray as shown in Fig 7 and firmly grip the tray on each side as you extract
it.
Fig 7
Go to Section VI, "Move onboard SFPs - Remove PCI Cards and the Riser" on next page.
VI. V-FAS3100 Family: Move onboard SFPs - Remove PCI Cards and the Riser
Step Action Description
1 Remove each SFP/GBIC one at a time inserted in the original Controller Module's on-board Ethernet and FC ports and install into
the same port location in the replacement Module. (Do not mix them up!)
2 Loosen the thumbscrew on the controller module side panel and swing the side panel open until it comes off the controller
module. Ref Fig 8a.
3 Label each card with it's PCI slot number and slide out of PCI Riser, Fig 8b.
4 Loosen the PCI card riser thumbscrew and pull the riser up and out of the socket. Ref Fig 8c.
5 Attach and close the side panel of the System Tray.
Fig 8b)
Fig 8a)
Fig 8c)
Fig 9a
Fig 9b
Insert the RLM into the replacement controller module by pressing it completely into the socket .
Go to Section VIII, "Exchange the CompactFlash Cards" on next page.
Fig 10b
CompactFlash Card
showing proper
orientation
CompactFlash Slot
Fig 11a
Fig 13a
NVRAM DIMM
Fig 13b
XI. V-FAS3100 Family: Install PCI Riser and Cards (if any)
Step Action Description
1 Loosen the thumbscrew on the new controller module side panel and swing the side panel open until it comes off the controller
module.
2 Align the PCI riser removed from the original controller module with the guide slots on the replacement controller module, and
then push down to seat it completely in the socket and tighten the riser thumbscrew.
3 Install the PCI cards removed from the original controller module into correct slots on the replacement controller module.
XII. V-FAS3100 Family: Partially Reinsert the Replacement Controller and Reconnect the cables
Step Action Description
1 Partially insert the controller into the slot so that the cables can be attached- DO NOT engage the backplane yet.
2 Re-attach the Cable Management Tray if removed. (Reference pictures in Section V)
Do not re-connect the FC cables for ports 0a-0d as the "maintenance mode" boot or the Fibre Channel Diags may fail. A "step"
STOP
has been added after the FC ports are configured in Section XV to reconnect these cable(s).
3 Fully insert all other cables that were removed to their proper port until each clicks in. Test by pulling on them.
4 Go to Section XIII, "Set date and time" on next page.
Action Description
Re-attach laptop to the console port and capture the display output even if using the end user's computer.
Fully Insert the Controller Module into the slot and raise the cam lever and secure it with Red thumbscrew.
IMMEDIATELY after the console message "Starting AUTOBOOT press Ctrl-C to abort" is displayed, press Ctrl-C
(^C) key a couple times to abort the autoboot. See Console output example below.
Phoenix TrustedCore(tm) Server
Copyright 1985-2006 Phoenix Technologies Ltd.
.......
"." = Deleted lines to save space
.......
Portions Copyright (C) 2002-2008 NetApp
CPU Type: Intel(R) Xeon(R) CPU
L5410
@ 2.33GHz
IF you miss the window to abort the autoboot, look for this message: "Press CTRL-C for boot menu" and complete steps
4a-4c, otherwise if at the "LOADER" prompt, skip to step 5 .
a. Immediately Press ^C (CTRL-C) to access the "Boot menu".
b. If a 'System ID mismatch' warning message below is displayed, answer : y
.......
.......
*******************************
*
*
* Press Ctrl-C for Boot Menu. *
*
*
STEP 4a:
*******************************
Press CTRL-C
^C
Boot Menu will be available.
Restoring /var from /cfcard/x86/freebsd/varfs.tgz
WARNING: System id mismatch. This usually occurs when replacing CF or NVRAM cards!
Override system id? {y|n} [n] y
STEP 4b: Enter: y
N
O
T
E
If the replacement MB fails to boot to the Maintenance menu, confirm the memory DIMMS and all PCI cards are
properly seated. Also was the original Boot Device (CF Card) moved from the original MB to the replacement?
Engage NGS for assistance.
If the system reports the NVRAM battery is not detected, re-check the battery cable connection. If the system
reports the battery voltage is too low or a critical failure, do NOT proceed - Do NOT bypass the system stop.
STOP Engage NGS and ask if this motherboard is being replaced due to a battery issue. If so, a new battery needs to
be installed before continuing.
Do NOT bypass the system halt on a NVRAM battery voltage issue. Controller giveback will fail.
c. Next, drop to the LOADER prompt from the Boot Menu following the linked proceshere
Continue with Section XIII on next page.
LOADER-A> update_flash
Step 11: At the LOADER-A|B> prompt, enter:
New BIOS Version: 4.4.0
update_flash to update the flash PROM
New Loader Version: 1.8
Saving Primary Image to Secondary
Updating Secondary Boot Flash
Programming .+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+.+ done. 2097152
bytes written
Updating Primary Boot Flash
Programming .+.+.+.+.+.+.+.+.+.+.+.+.+.+ done. 917504 bytes written
LOADER-A>
12
XIV. FAS3100 Family: Run Diagnostics (20-45 minutes depending on model and expansion options)
Step Action Description
1 Test the Replacement Tray with diagnostics by entering boot_diags at the "LOADER-A|B>" prompt.
STOP There is a bug in older versions of the diags. The diag version is highlighted below - Read the NOTE Text box.
2 Check the Diagnostic version in the Menu and enter the proper test sequence below:
IF Diag version is 5.6.1 or higher enter: run mb mem cf-card
IF Diag version is lower than 5.6.1 enter: run mem cf-card
These diagnostics tests are basic confidence tests for the new motherboard, memory and CompactFlash. IF any single test
FAILs, then the next diag test will not be started. Contact NGS for test failure. IF error can be skipped, run remaining test(s)
individually, ex: run mem or run cf-card, etc
LOADER-A> boot_diags
Loading x86_64/diag/diag.krn:...0x200000/12601344 0xe04800/4664888 0x1277638/8 Entry at
0x00202018
STEP 1: Enter: boot_diags
Starting program at 0x00202018
Commands:
Config (print a list of configured PCI devices)
Default (restore all options to default settings)
Exit
(exit diagnostics)
Help
(print this commands list)
Options (print current option settings)
Version (print the diagnostic version)
Run
<diag ... diag> (run selected diagnostics)
Options:
Count
<number> (loop selected diagnostic(s) (number) of passes)
Loop
<yes|no> (loop selected diagnostic(s)) IF any Comprehensive test FAILs, then the next
diag test will not be started
Status <yes|no> (print status messages)
Stop
<yes|no> (stop-on-error / keep running)
Xtnd
<yes|no> (extended tests / regular tests)
Mchk
<auto|off|on|halt> (machine check control)
Cpu
<0|1|2|3> (default cpu)
Seed
<number> (random seed (0:use machine generated number))
CompactFlash Diagnostic
-----------------------.....
****** Comprehensive CompactFlash test .......... PASSED
STEP 6:
6:
STEP
Note:
PLEASE CONFIRM
CONFIRM
Note: PLEASE
V-FAS3140
should
total~4GB
~4GB
V-FAS3140 should total
V-FAS3160
should
total
~8GB
V-FAS3160 should total ~8GB
V-FAS3170 should
shouldtotal
total~16GB
~16GB
V-FAS3170
Outputisisfrom
froma aFAS3170
FAS3170
Output
Note: That the Comprehensive
Memory test, Comprehensive
RLM test & Comprehensive
CompactFlash test "PASSED"
NOTE: If the NVRAM FW is down rev, an autoupdate will start and the controller will reboot.
NOTE
Example Only
Local
Adapter Type
State
Status
--------------------------------------------------0a
initiator CONFIGURED.
online
0b
initiator CONFIGURED
offline
0c
initiator CONFIGURED.
online
0d
initiator CONFIGURED
offline
STEP 5: Enter:
fcadmin config -t target <HA> for
each port to be configured as a target
7
8
Firmly re-connect any FC cables that were left disconnected to adapters '0a, 0b, 0c or 0d' now. They must completely click in.
Continue with Section XV on next page.
In this example, the local System ID for the new Controller is 1943753293.
The old MB System ID was 1573753606 (disk show -v from Section IV).
Example Only
The disks need to be reassigned to the local System ID.
DISK
OWNER
POOL
SERIAL NUMBER HOME
------------ --------------------------- ------------1b.02.4
fas3170cl2-ams(1573753632)
Pool0
9QJ7VRRF
fas3170cl2-ams(1573753632)
1b.02.3
fas3170cl2-ams(1573753632)
Pool0
9QJ7WMNQ
fas3170cl2-ams(1573753632)
.....
0d.41
fas3170cl2-ams(1573753632)
Pool0
JLVT29GC
fas3170cl2-ams(1573753632)
0d.43
fas3170cl2-ams(1573753632)
Pool0
JLVT7BUC
fas3170cl2-ams(1573753632)
.....
0c.21
fas3170cl1-ams(1573753606)
Pool0
JLVT0KDC
fas3170cl1-ams(1573753606)
0c.18
fas3170cl1-ams(1573753606)
Pool0
JLVT2HZC
fas3170cl1-ams(1573753606)
.....
1d.01.13
fas3170cl1-ams(1573753606)
Pool0
9QJ7W3XZ
fas3170cl1-ams(1573753606)
1d.01.21
fas3170cl1-ams(1573753606)
Pool0
9QJ7WSX8
fas3170cl1-ams(1573753606)
1d.01.16
fas3170cl1-ams(1573753606)
Pool0
9QJ7W3YT
fas3170cl1-ams(1573753606)
1d.01.12
fas3170cl1-ams(1573753606)
Pool0
9QJ7WS0R
fas3170cl1-ams(1573753606)
4
S
T
O
P
Procedure-A
Bug
537799
A1
A2
A3
A4
A5
A6
Procedure to be followed
IF ONTAP version is 8.2.x or higher , follow the linked process here
IF ONTAP version is less than 8.2, follow Procedure-A below.
Follow Procedure-B on next page.
Partner has taken over target controller running ONTAP less than version 8.2
Execute all of these steps on the PARTNER node.
ONTAP < 8.0.3 will give an error message if more than 500 disks are attempted to be reassigned from the partner.
If system is exposed, a system outage is required to do the disk reassign - Read this link for options >here
Connect to the PARTNER node and login: (7-Mode=root , C-Mode=admin ). Engage end-user for password.
C-Mode only: Enter: run local to enter the nodeshell.
Partner "takeover" Verification Check: Check the console prompt as follows:
Case1: IF the prompt has the word (takeover) in it (Example: nodeB (takeover)> ) , continue with step A4.
Case2: IF the prompt has a "/" in it (Example: nodeA / nodeB> ) , enter: partner and then press enter key twice to
exit the partner shell. IF the prompt has "(takeover)" in it - continue with step A4, otherwise NGS.
Case3: IF either case1 or case2 does not match your console prompt, verify with cf status.
IF no "takeover", follow Procedure-B on next page. Questions or if partial takeover, NGS.
Enter: partner aggr status -f IF any FAILED disk exists, inform customer and NGS about that.
FAILED disk(s) must be physically dis-engaged before the disk reassignment and giveback (Leave the disk in
the slot until replacement is received). If the system reports " partner: Not in takeover mode" or "partner
not found" you are entering the command from the wrong controller!! - Restart at step 1 above.
Enter: priv set advanced at the prompt for the following command to work. Prompt will include " * ".
Reconfirm the partner console prompt has the word "takeover" in it (see Command Example below) and then
enter: disk reassign -s <old_system_ID> -d <new_system_ID>
Cut and paste the old and new System IDs from the console Log. Read "CAUTION" steps 1 and 2.
Example Only
1) IF the system reports: "Partner node must not be in Takeover mode during disk reassignment
C
A
U
T
I
O
N
Disk ownership will be updated on all disks previously belonging to Filer with
sysid 1573753606.
7-Mode only: A console message will be displayed for
Would you like to continue (y/n)? y
Enter: y
each disk changing ownership (System ID)
STOP
A9
If the console messages stated the giveback must be completed immediately, do not enter any other commands on
the partner node until "after" the disk ownership on the down node is verified and the giveback is completed.
Continue with step 2 on next page.
Procedure-B
B1
Use for Node that does not have a partner (non-HA) OR the partner did not takeover
Execute all of these steps from Maintenance mode on the repaired node.
At the maintenance mode " * > " prompt enter:
disk reassign -s <old_system_ID> -d <new_system_ID> Read "CAUTION" below.
Cut and paste the old and new System IDs from the console Log.
Enter: y
B3
Enter: y
2
From the console port on "target" controller on which you replaced the MB (in maintenance mode):
a) Enter: disk show -s <old-sysID> No disks or V-Series LUNs should be listed as shown in console window below.
(The "-s old-sysID" was specified in the disk reassign step1)
IF any disks/V-LUNs are listed, a reservation may not have released, continue with step 2(b). IF no output, skip to step 3.
*> disk show -s 1573753606
Local System ID: 1943753293
Example Only
DISK
OWNER
POOL
SERIAL NUMBER
HOME
---------- ----------------- ------------------------*>
IF all disks properly reassigned, there should be no disks/V-LUNs listed in the output.
b) Not all disks re-assigned: Re-issue the disk reassign, from the node it was entered on, to see if the reservation releases.
Then repeat the disk show -s command in step 2(a). IF disks/V-LUNs are still listed in the output, call Support.
Continue with Section XVII on next page.
Example Only
DISK
OWNER
------------ ------------1b.02.4
fas3170cl2-ams(1573753632)
1b.02.3
fas3170cl2-ams(1573753632)
.....
0d.41
fas3170cl2-ams(1573753632)
0d.43
fas3170cl2-ams(1573753632)
.....
0c.21
-------------------------0c.18
-------------------------.....
1d.01.13
-------------------------1d.01.21
-------------------------1d.01.16
-------------------------1d.01.12
-------------------------.....
.....
POOL
----Pool0
Pool0
Pool0
Pool0
JLVT29GC
JLVT7BUC
fas3170cl2-ams(1573753632)
fas3170cl2-ams(1573753632)
Pool0
Pool0
JLVT0KDC
JLVT2HZC
(1943753293)
(1943753293)
Pool0
Pool0
Pool0
Pool0
9QJ7W3XZ
9QJ7WSX8
9QJ7W3YT
9QJ7WS0R
(1943753293)
(1943753293)
(1943753293)
(1943753293)
If this is a V-Series (V3200), perform the below Additional Steps. If not , skip to step 6.
The "disk show -v" command displays the connectivity to the third party array and any (optional) NetApp disks.
At the maintenance mode prompt: " * >", enter: halt to exit to LOADER-A|B.
Go to Section XVIII, "Boot PROM Variable Checks" on next page.
At the LOADER-A|B> prompt, enter: printenv to list "all" the boot PROM variables. Search the output to confirm the required
variables are properly set based on your system type and configuration . Follow steps A-C in Table-1 below.
1) If you accidently issued a "set-defaults", all the configuration variables are unset - You have to examine the printenv capture
STOP
of the original MB to determine which ones will need to be manually entered in step 3 for your System Type & Configuration.
Table-1
Step System Type & Configuration
A) IF HA:
Verify this variable is set with correct partner sys-id value.
B) IF C-Mode:
Verify this variable is set to true.
C) IF V-Series:
fc-non-array-adapter-list is only required if the V-Series is also
hosting NetApp multipath shelves.
One of these two variables "may" be set if the storage
switches are McData.
3
If any of the variables are missing or values are not set correctly, set them now one by one by following steps a-d below.
a) Open the console log file that was previously captured from the old MB and scroll to the "printenv" output listing.
b) Use the "Find" function and search for the variable required and its value. Sample "printenv" output ishere
c) Once identified, copy them using "Ctrl-C", then at the LOADER prompt enter:
setenv <(CTRL-V)to paste the variable and value>
d) Enter: savenv to save the variable and value.
LOADER> setenv variable-name value
LOADER> savenv
PROM will accept improper variable names. Recheck they were entered properly by issuing printenv again.
STOP
Go to Section XIX, "Boot the Operating System - 'giveback' if applicable" on next page.
4
5
NOTE 3.1:
If you see this message, this node is part
of a HA configuration and the partner
node took over for it.
Login into the PARTNER node (7-Mode=root , C-Mode=admin ). Engage end-user for password.
Check takeover status by entering the appropriate command shown for the specified ONTAP Mode. If "partner not ready" may
have to wait 2-4 minutes for the NVRAMs to synchronize.
7-Mode
partner(takeover)> cf status
Cluster-Mode
cluster::> run local cf status
IF Pre-ONTAP 8.2 and 7-Mode: Ask the customer if there are any heavy NDMP, SnapMirror or SnapVault processes running. If
Yes, they should be disabled due to bug 489060.
The procedure to disable the processes is here.
7 Enter the proper controller giveback command(s) based on the mode running as follows:
A giveback cannot be completed due to: "a failed disk" or "Open CIFS sessions" or "partner not ready" .
IF FAILED disk: Physically dis-engage the failed disk (Leave the disk in the slot till replacement is received).
IF Open CIFS sessions: Check with customer how to close out CIFS sessions. Terminating CIFS can cause loss of data.
NOTE
IF partner "not ready": Wait 5 minutes for the NVMEMs to snychronize.
Giveback fails due to any other reason? contact NGS.
7-Mode
partner(takeover)> cf giveback
8
Cluster-Mode
cluster::> storage failover giveback -fromnode local
XIX. V-FAS3100 Family: Boot the Operating System - 'giveback' if applicable (cont.)
Step Action Description
9 Wait! 90 seconds for 7-Mode or 3 minutes for Cluster-Mode after giveback reported complete.
Check controller failover status by entering the appropriate command shown for the specified ONTAP Mode.
7-Mode
partner> cf status
Controller Failover enabled,
XYZ is up.
Look for failover enabled
10
Cluster-Mode
cluster::> storage failover show
Follow step (a)
IF this system has a partner controller, but the partner did not takeover, (Disks were assigned in maintenance mode) continue
STOP with step 10a (Ref Internal TSB-1209-02). Note, the console message above is not displayed when disks are reassigned in
maintenance mode.
a) Login to the "repaired node" (target) and re-enable "controller failover" using proper command syntax below (copy-andpaste).
Cluster-Mode (run from clustershell )
7-Mode
(1st cmd is for 2-node clusters ONLY, 2nd cmd is for 3 or more node clusters)
cluster::> cluster ha modify -configured true
OR
cluster::> storage failover modify -node local -enabled true
target> cf enable
11
From the "repaired node", execute a takeover using the proper command below to sync the sys-IDs.
7-Mode
target> cf takeover
a) Wait! 60 seconds for 7-Mode or 90 seconds for Cluster-Mode after takeover reports complete- Then check takeover
status by entering the appropriate command shown for the specified ONTAP Mode.
7-Mode
target(takeover)> cf status
Cluster-Mode
cluster::> run local cf status
b) After the appropriate Wait period in step 11a) and the cf status reports: "Ready for giveback" , enter the proper
"giveback" command below. This is the final synchronization of the system-Ids across the HA pair.
7-Mode
target(takeover)> cf giveback
c) Continue with Section XIX on next page.
Cluster-Mode
cluster::> storage failover giveback -fromnode local
XIX. V-FAS3100 Family: Boot the Operating System - 'giveback' if applicable (cont.)
Step Action Description
d) Wait Again! This time 90 seconds for 7-Mode or 3 miniutes for Cluster-Mode and then check giveback status by entering
the appropriate command below for the proper ONTAP mode. For 7-Mode look for failiover enabled , for Cluster-Mode
follow step (i).
7-Mode
target> cf status
Controller Failover enabled,
XYZ is up.
12
Cluster-Mode
cluster::> storage failover show
Follow step (i).
(i) For ONTAP Cluster Mode, storage failover show should not show any "partial" givebacks. If there are, wait
another 60 seconds and recheck. Some large systems may take up to 10 minutes to complete.
Click > ONTAP 8 failover show
to see examples of output. Issues? Call NGS.
IF Cluster-Mode: Follow steps (a-b) below, otherwise skip to Step 13.
a) From the clustershell on each node, enter the command below to list the logical interfaces that are not on their home
server and port.
Cluster-Mode
cluster::> net int show -is-home false
Example of output here> net int show
b) If any interfaces are listed as "false" in the above command, enter the command below to revert them back to their home
port. Issues? Call NGS.
Cluster-Mode
cluster::> net int revert *
13
XX. V-FAS3100 Family: Controller registration, Enable options, Submit logs and Part Return
Step Action Description
NOTE Service entitlements break when the MB is swapped because the new motherboard changes the system serial number.
1 IF NDMP, SnapMirror or SnapVault options were disabled, enable them now. Refer to page 2 of doc > >
here
2 C-Mode Only: Re-enable "auto-giveback" options if they were disabled on either node. C-Mode command
here
3 Ask customer if using Operations Manager? If so, can they still access the controllers?
Bug
583160
4
SNMP v3 access may fail despite the user existing with the correct permissions after a takeover or giveback.
WORKAROUND: Change the password or re-add the user. Engage Customer to step through the Work-around.
Ask end-user if using "AutoSupport"? If YES, perform step 4(a). If NO, perform step 4(b).
a) ASUP Enabled System: Request end-user to send NetApp an ASUP Message from the target node so the configuration
setup can be verified and the new system serial number can be registered by NGS. If the target system is not UP, send
ASUP from its partner. Use the corresponding command for the mode of ONTAP running. Enter your dispatch's 7-digit
FSO number (begins with 5).
7-Mode
filer> options autosupport.doit 5xxxxxx
5
6
7
8
9
Cluster-Mode
cluster::> invoke * -type all -message 5xxxxxx
b) If ASUP is disabled: Call NGS CSR and provide the new MB serial number so they can register it as the new system s/n.
xdl-tpm-console-logs@netapp.com
Email the console log with the NetApp Reference Number in the Subject Line to
Place the defective part in the antistatic bag and seal the box.
Follow the return shipping instructions on the box to ship the part(s) back to NetApps RMA processing center. If the
shipping label is missing see process to obtain a shipping label here >
Missing Shipping Label?
Verify with customer that the system is OK and if working with NGS ask them if it is OK to be released.
Close dispatch per Rules of Engagement.