由于丢失OLR导致的节点无法启动

环境：RHEL6.5+11.2.0.4 RAC，两节点

问题描述：故意把OLR删掉，重启后发现GI无法启动

分析过程：

1.确认GI启动到了哪一个阶段

[[email protected] ~]$ crsctl status resource -t -init
CRS-4639: Could not contact Oracle High Availability Services
CRS-4000: Command Status failed, or completed with errors.

解析：发现连OHASD都没有启动，两种可能：1是init.ohasd脚本没有被调用 2是ohasd.bin守护进程没有启动成功，那么：
[[email protected] ~]$ ps -ef | grep ohas |grep -v grep
root       960     1  0 09:23 ?        00:00:00 /bin/sh /etc/init.d/init.ohasd run

发现，脚本被调用了，但是守护进程没有成功启动。

2.查看ohasd的日志

2016-04-18 12:26:25.918: [ default][1661986592] OHASD Daemon Starting. Command string :restart
2016-04-18 12:26:25.919: [ default][1661986592] Initializing OLR
2016-04-18 12:26:25.919: [  OCROSD][1661986592]utopen:6m': failed in stat OCR file/disk /u01/app/11.2.0.1/grid/cdata/rac1.olr, errno=2, os err string=No such file or directory
2016-04-18 12:26:25.919: [  OCROSD][1661986592]utopen:7: failed to open any OCR file/disk, errno=2, os err string=No such file or directory
2016-04-18 12:26:25.919: [  OCRRAW][1661986592]proprinit: Could not open raw device
2016-04-18 12:26:25.919: [  OCRAPI][1661986592]a_init:16!: Backend init unsuccessful : [26]
2016-04-18 12:26:25.920: [  CRSOCR][1661986592] OCR context init failure.  Error: PROCL-26: Error while accessing the physical storage Operating System error [No such file or directory] [2]
2016-04-18 12:26:25.920: [ default][1661986592] Created alert : (:OHAS00106:) :  OLR initialization failed, error: PROCL-26: Error while accessing the physical storage Operating System error [No such file or directory] [2]
2016-04-18 12:26:25.920: [ default][1661986592][PANIC] OHASD exiting; Could not init OLR
2016-04-18 12:26:25.920: [ default][1661986592] Done.

解析：看报错是OLR打不开，那就过去看看存在不（手动删的，怎么可能存在）

[[email protected] cdata]$ ll
total 12
drwxrwxr-x 2 grid oinstall 4096 Apr 18 07:51 liming-cluster
drwxr-xr-x 2 grid oinstall 4096 Apr 18 07:49 localhost
drwxr-xr-x 2 grid oinstall 4096 Apr 18 08:11 rac1

OLR不存在了。

3.查看OLR的备份是否存在

[[email protected] rac1]$ ll
total 6644
-rw------- 1 root root 6803456 Apr 18 08:11 backup_20160418_081108.olr

可以的。

4.恢复OLR

<span style="font-size:18px;">[[email protected] bin]# ./ocrconfig -local -restore /u01/app/11.2.0.1/grid/cdata/rac1/backup_20160418_081108.olr
PROTL-35: The configured OLR location is not accessible.

书中没写的步骤来了！
[[email protected] cdata]$ touch rac1.olr
[[email protected] bin]# ./ocrconfig -local -restore /u01/app/11.2.0.1/grid/cdata/rac1/backup_20160418_081108.olr
[[email protected] bin]#
[[email protected] cdata]$ ll
total 6660
drwxrwxr-x 2 grid oinstall      4096 Apr 18 07:51 liming-cluster
drwxr-xr-x 2 grid oinstall      4096 Apr 18 07:49 localhost
drwxr-xr-x 2 grid oinstall      4096 Apr 18 08:11 rac1
-rw-r--r-- 1 grid oinstall 272756736 Apr 18 13:02 rac1.olr

</span>

5.启动GI，恢复正常

<span style="font-size:18px;">[[email protected] bin]# ./crsctl start crs </span>

时间： 2024-10-09 03:20:38

由于丢失OLR导致的节点无法启动的相关文章

系统突然断电重启导致rac节点无法启动，crs-4000错误

公司rac集群为双节点oracle11g的rac,操作系统为AIX6.1,突然断电重启了,再次查看集群状态,发现其中一个节点起不来. 经过系统工程师检查,发现重启后存储的光纤网络有十几秒左右的延时,于是手动启动crs,结果保crs-4000问题.以root用户执行./crsctl start crs仍然不行. 怀疑是asm有问题,在grid用户下asmcmd,结果发现连接到空实例,真是 ASM没有启动,于是直接在asmcmd里startup没有启动.但是半天也没有反应,于是进入asm实例: sq

一次Windows 注册表中注册表项目丢失导致的Oracle 数据库启动问题。

一次Windows 注册表中注册表项目丢失导致的Oracle 数据库启动问题. 环境说明: 1.windows 2008操作系统 x64bit 2.Oracle database 11.2.0.1 32bit 3.sqlplus / as sysdba进不去,报错: c:\user\administrator>sqlplus / as sysdba Error 6 initializing SQL*Plus Message file sp1<lang>.msb not found SP

时间不对导致vSAN服务无法启动

今天在做vSAN实验的时候发现一个问题,如果ESXi主机的时间不对(与当前时间相差太远)会导致ESXi主机的vSAN服务无法配置和启动.[说明]在安装ESXi 6.x版本时,为ESXi主机配置了默认的证书,证书的有效期为5年(以安装时主机的时间为基准).在大多数的情况下,为ESXi主机调整为正确的时候即可启动vSAN服务.如果调整为正确的时间后仍然不能启动,此时要查看ESXi主机"证书"时间,如果为主机颁发的证书的截止时间已经早于当前时间,需要重新为ESXi主机重新申请证书才能正常使用

rac 中节点的vip在该节点启动不了，在其他节点正常启动。

本实验是在虚拟机中! 与实验相同的问题:http://blog.csdn.net/wanghui5767260/article/details/38850335 问题描述: 节点1.节点2和节点3同时我做了断电处理,结果节点1 和节点2 都正常启动了,而节点3的vip在自己的节点上启动报错而在其他节点上启动正常. 注:节点3只装入了集群. 查看状态: [[email protected] bin]# ./crs_stat -t Name Type Targe

Hadoop Datanode节点无法启动（All directories in dfs.data.dir are invalid）

Hadoop Datanode节点无法启动(All directories in dfs.data.dir are invalid) java.io.IOException: All directories in dfs.datanode.data.dir are invalid: "/usr/local/hadoop-2.4.0/dfs/data" at org.apache.hadoop.hdfs.server.datanode.DataNode.checkStorageLocat

解决导致kdm无法正常启动的原因

因为 ~/.bash_profile 和 ~/.xinitrc中加入了 export XAUTHORITY="/tmp/${USER}.Xauthority" 所以每次启动桌面都无法成功注释掉后即可正常进入桌面使用登录管理器后,会调用 ~/.xprofile的变量,因此需要注意 case ${INPUT_APP} in fcitx) # 使用小企鹅输入法 export XIM=fcitx export XMODIFIERS="@im=fcitx" export

VMware Centos中创建克隆虚拟机导致网卡设备无法启动的问题

一.前言在创建克隆虚拟机的时候,如果样板虚拟机没有事先配置好,则会导致网卡设备无法启动的问题(报错如下). Bringing up interface eth0: Device eth0 does not seem to be present, delaying initialization 二.解决方法 1.修改网卡配置文件/etc/sysconfig/network-scripts/ifcfg-eth0 ?删除UUID这一行,因为每张网卡的mac地址是不一样的,所以UUID也是不一样的.

解决ROS系统中节点无法启动的问题

在调试中出现ROS节点无法启动的问题报错 Usage: rosrun [--prefix cmd] [--debug] PACKAGE EXECUTABLE [ARGS] rosrun will locate PACKAGE and try to find an executable named EXECUTABLE in the PACKAGE tree. If it finds it, it will run it with ARGS. 处理方法就是把节点文件直接移到PACKAGE的下一级文

ORA-00119: invalid specification for system parameter LOCAL_LISTENER ORA-00132: syntax error or unresolved network name 'LISTENER_ORCL' 导致Oracle数据库无法启动

前言最近安装Oracle11g,重启电脑后,无法登陆数据库,到sqlplus登陆时就报错,没有例程...等等(忘记截图了),去看了下服务,监听器和实例服务都是开着的,就是无法启动数据库,到网上找了一些教程,通过 sqlplus / as sysdba 这种方式登录数据库,然后输入 startup 命令启动数据库就报了这两个错误. 个人分析,应该是监听器配置出现了问题,如果有知道的园友,麻烦告知一二. 解决步骤 1.先以 sqlplus / as sysdba 方式启动sqlplus sqlpl