Skip to main content

RAC root.sh fix +ASM instance number order

While I was installing RAC, the last step is to run root.sh, I did not wait for it to complete on the first node and then started to run on the second node, and messed up. I need to clean it up.

use this oracle document as reference:
https://support.oracle.com/epmos/faces/DocumentDisplay?_afrLoop=411427040358360&id=942166.1&_afrWindowMode=0&_adf.ctrl-state=c8ddpyf94_4

Step 1: As root, run "$GRID_HOME/crs/install/rootcrs.pl -verbose -deconfig -force" on all nodes, except the last one.

Step 2: As root, run "$GRID_HOME/crs/install/rootcrs.pl -verbose -deconfig -force -lastnode" on last node. This command will zero out OCR, Voting Disk and the ASM diskgroup for OCR and Voting Disk


Step 2 may not run sucessfully, but I can zero the OCR voting disk using dd command.
 
# dd if=/dev/zero of=/dev/sdb1 bs=1024 count=100

Step 3: As root, run $GRID_HOME/root.sh on first node

Step 4: As root, run $GRID_HOME/root.sh on all other node(s), except last one.

Step 5: As root, run $GRID_HOME/root.sh on last node.



the root.sh will use the $GRID_HOME/crs/install/crsconfig_params configuration file to configure cluster.

below is a section of crsconfig_params file
##################################
SILENT=false
ORACLE_OWNER=grid
ORA_DBA_GROUP=oinstall
ORA_ASM_GROUP=asmadmin
LANGUAGE_ID=AMERICAN_AMERICA.AL32UTF8
TZ=America/Vancouver
ISROLLING=true
REUSEDG=false
ASM_AU_SIZE=4
USER_IGNORED_PREREQ=true


ORACLE_HOME=/u01/11.2.0/grid
ORACLE_BASE=/u01/app/oracle
OLD_CRS_HOME=

JREDIR=/u01/11.2.0/grid/jdk/jre/
JLIBDIR=/u01/11.2.0/grid/jlib

VNDR_CLUSTER=false
OCR_LOCATIONS=NO_VAL
CLUSTER_NAME=xxxxracstg01
HOST_NAME_LIST=xxxxracstg01,xxxxracstg02,xxxxracstg03
NODE_NAME_LIST=xxxxracstg01,xxxxracstg02,xxxxracstg03
PRIVATE_NAME_LIST=
VOTING_DISKS=NO_VAL
#VF_DISCOVERY_STRING=%s_vfdiscoverystring%
ASM_UPGRADE=false
ASM_SPFILE=
ASM_DISK_GROUP=ocr_vote
ASM_DISCOVERY_STRING=/dev/emcpower*
ASM_DISKS=/dev/emcpowerd1,/dev/emcpowere1,/dev/emcpowerf1
ASM_REDUNDANCY=NORMAL
CRS_STORAGE_OPTION=1
CSS_LEASEDURATION=400
CRS_NODEVIPS='xxxxracstg01-vip/255.255.254.0/bond0.401,xxxxracstg02-vip/255.255.254.0/bond0.401,xxxxracstg03-vip/255.255.254.0/bond0.401'
NODELIST=xxxxracstg01,xxxxracstg02,xxxxracstg03
NETWORKS="bond0.401"/xx.250.4.0:public,"bond1.34"/xx.10.18.0:cluster_interconnect
SCAN_NAME=xxxxracstg00-scan.company.corp
SCAN_PORT=1521
GPNP_PA=
OCFS_CONFIG=

# GNS consts
GNS_CONF=false
GNS_ADDR_LIST=
GNS_DOMAIN_LIST=
GNS_ALLOW_NET_LIST=
GNS_DENY_NET_LIST=
GNS_DENY_ITF_LIST=

#### Required by OUI add node
NEW_HOST_NAME_LIST=
NEW_NODE_NAME_LIST=
NEW_PRIVATE_NAME_LIST=
NEW_NODEVIPS='xxxxracstg01-vip/255.255.254.0/bond0.401,xxxxracstg02-vip/255.255.254.0/bond0.401,xxxxracstg03-vip/255.255.254.0/bond0.401'

############### OCR constants
# GPNPCONFIGDIR is handled differently in dev (T_HAS_WORK for all)
# GPNPGCONFIGDIR in dev expands to T_HAS_WORK_GLOBAL
GPNPCONFIGDIR=$ORACLE_HOME
GPNPGCONFIGDIR=$ORACLE_HOME
OCRLOC=
OLRLOC=
OCRID=
CLUSTER_GUID=

CLSCFG_MISSCOUNT=

#### IPD/OS
CRFHOME="/u01/11.2.0/grid"
##################################

Because I ran the root.sh on node 3 before node 2, node 2 took +ASM3 instance and node 3 took +ASM2 instance, I use this above method to fix +ASM instance number problem,after I run above procedures again in the correct order, node 2 take +ASM2 now, problem fixed.

Comments

Popular posts from this blog

Opatch apply/lsinventory error: oneoff is corrupted or does not exist

I am applying the quarterly patch for 19c RDBMS, I tried using napply but failed, but somehow it corrupted the inventory though nothing applied. further apply and lsinventory command ran into error like this: $ ./OPatch/opatch lsinventory Oracle Interim Patch Installer version 12.2.0.1.21 Copyright (c) 2020, Oracle Corporation.  All rights reserved. Oracle Home       : /u02/app/oracle/19.0.0 Central Inventory : /u01/app/oraInventory    from           : /u02/app/oracle/19.0.0/oraInst.loc OPatch version    : 12.2.0.1.21 OUI version       : 12.2.0.7.0 Log file location : /u02/app/oracle/19.0.0/cfgtoollogs/opatch/opatch2020-09-08_13-35-59PM_1.log Lsinventory Output file location : /u02/app/oracle/19.0.0/cfgtoollogs/opatch/lsinv/lsinventory2020-09-08_13-35-59PM.txt -------------------------------------------------------------------------------- Inventory load failed... OPatch cannot load inventory ...

oracle dba_hist_sysmetric_summary

found this blog is helpful to get CPU and IO statistics on oracle database. http://shob-dbadmin.blogspot.ca/2012/12/how-to-find-total-io-of-database.html courtesy to  Shomil Bansal , below are hist writing, not mine. How to find total IO of the database instance Total IO of database instance is sum of the physical reads, physical writes and redo writes. There are several views to find these values. v$sysmetric  - Reports metric values for only the most current time sample 60 secs. v$sysmetric_summary  - Reports metric values for time sample of 1 hour. v$sysmetric_history  - Reports metric values every 60 sec from the time instance is up. Better way to analyse IO using this view to take deltas between two time periods. dba_hist_sysmetric_history  - All the above views are refreshed when the instance is restarted. This view, part of AWR, stores the historical stats. I have used this view for my report. Query: ====== set lines 350...

ORA_RMAN_SGA_TARGET

assume that we lost all the files of oracle database but we do have rman backup, when trying to bring up a dummy database before restore start, I get this error. RMAN> startup nomount force; WARNING: cannot translate ORA_RMAN_SGA_TARGET value startup failed: ORA-01078: failure in processing system parameters ORA-01565: error in identifying file '+DATA/PROD/spfilePROD.ora' ORA-17503: ksfdopn:2 Failed to open file +DATA/PROD/spfilePROD.ora ORA-15056: additional error message ORA-17503: ksfdopn:DGOpenFile05 Failed to open file +DATA/prod/spfileprod.ora ORA-17503: ksfdopn:2 Failed to open file +DATA/prod/spfileprod.ora ORA-15173: entry 'spfileprod.ora' does not exist in directory 'prod' ORA-06512: at line 4 starting Oracle instance without parameter file for retrival of spfile RMAN-00571: =========================================================== RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS =============== RMAN-00571: =================================...