-oracle数据库恢复-Raid5数据恢复-RAID0数据恢复ORACLE优化与疑难问题讨论区 → RAC双节点,数据库莫名死机,求原因


  共有2212人关注过本帖树形打印

主题:RAC双节点,数据库莫名死机,求原因

oracle数据恢复-RAID5数据恢复-raid数据恢复
cityvigil
  1楼 个性首页 | 信息 | 搜索 | 邮箱 | 主页 | UC


数据恢复 恢复数据
等级:新手上路 帖子:18 积分:198 威望:0 精华:0 注册:2007-2-2 16:25:43
RAC双节点,数据库莫名死机,求原因  发帖心情 Post By:2007-4-13 9:54:25

linux as3


oracle9204


64位,两节点做RAC,用交换机做连接


实例1 的alter.log
Thread 1 advanced to log sequence 3718
Current log# 2 seq# 3718 mem# 0: /opt/oracle/oradata/vam/redo12.log
Fri Apr 13 03:55:37 2007
Undo Segment 31 Onlined
Fri Apr 13 03:57:52 2007
Undo Segment 32 Onlined
Fri Apr 13 04:00:36 2007
Trace dumping is performing id=[cdmp_20070413040056]  --这个文件夹里有很多文件
Fri Apr 13 04:02:22 2007
Waiting for clusterware split-brain resolution --->这一句是什么意思
Evicting instance 2 from cluster
Fri Apr 13 04:12:40 2007
Reconfiguration started
List of nodes: 0,
Fri Apr 13 04:12:40 2007
Reconfiguration started
List of nodes: 0,
LMON: terminating instance due to error 481
Fri Apr 13 04:24:45 2007
Errors in file /opt/oracle/admin/vam/bdump/vam1_lgwr_27230.trc:
ORA-00481: LMON process terminated with error
Fri Apr 13 04:24:46 2007
Errors in file /opt/oracle/admin/vam/bdump/vam1_lck0_27278.trc:
ORA-00481: LMON process terminated with error
Fri Apr 13 04:24:49 2007
Errors in file /opt/oracle/admin/vam/bdump/vam1_pmon_27195.trc:
ORA-00481: LMON process terminated with error
Fri Apr 13 04:24:49 2007
Errors in file /opt/oracle/admin/vam/bdump/vam1_lmd0_27203.trc:
ORA-00481: LMON process terminated with error
Fri Apr 13 04:24:50 2007
System state dump is made for local instance
Fri Apr 13 04:24:55 2007
Instance terminated by LMON, pid = 27201
------
/opt/oracle/admin/vam/bdump/vam1_lmd0_27203.trc:


Oracle9i Enterprise Edition Release 9.2.0.4.0 - 64bit Production
With the Partitioning, Real Application Clusters, OLAP and Oracle Data Mining options
JServer Release 9.2.0.4.0 - Production
ORACLE_HOME = /opt/oracle/product/9.2.0
System name: Linux
Node name: vamdb1
Release: 2.4.21-37.EL
Version: #1 SMP Wed Sep 7 13:32:18 EDT 2005
Machine: x86_64
Instance name: vam1
Redo thread mounted by this instance: 0 <none>
Oracle process number: 5
Unix process pid: 27203, image: oracle@vamdb1 (LMD0)


*** SESSION ID:(4.1) 2007-03-05 21:04:18.730
open lock on RM 0 0
*** 2007-03-05 21:05:41.090
open lock on RM 0 0
*** 2007-03-07 14:17:58.390
Global Wait-For-Graph(WFG) at ddTS[0.40] :
BLOCKED 0x1ac994a58 5 [0xa002d][0x6995b],[TX] [2031617,30030] 0
BLOCKER 0x1b7741408 5 [0xa002d][0x6995b],[TX] [7405570,2847] 1
BLOCKED 0x1b7741408 5 [0xa002d][0x6995b],[TX] [7405570,2847] 1
BLOCKER 0x1b7753d48 5 [0xa002d][0x6995b],[TX] [1310721,18872] 0
BLOCKED 0x1b7753d48 5 [0xa002d][0x6995b],[TX] [1310721,18872] 0
BLOCKER 0x1b7648530 5 [0xa002d][0x6995b],[TX] [2686977,8817] 0
BLOCKED 0x1b7757a18 5 [0x60028][0x6fbb4],[TX] [2686977,8817] 0
BLOCKER 0x1b7759308 5 [0x60028][0x6fbb4],[TX] [2031617,30030] 0
*** 2007-03-23 08:29:30.920
stale cvak fr 1:0x1b7dc2108([0x9][0x2],[CI])[h=KJUSERNL,n=KJUSEREX,b=KJUSERPR,ls=KJUSERSTAT_NOVALUE]:0x149014b < 0x0
*** 2007-04-12 08:03:20.390
stale cvak fr 1:0x1b7dac6e8([0x9][0x2],[CI])[h=KJUSERNL,n=KJUSEREX,b=KJUSERPR,ls=KJUSERSTAT_NOVALUE]:0x3400342 < 0x0
*** 2007-04-13 04:24:49.210
error 481 detected in background process
ORA-00481: LMON process terminated with error

~



实例2 的alter.log
Thread 2 advanced to log sequence 4569
Current log# 8 seq# 4569 mem# 0: /opt/oracle/oradata/vam/redo24.log
Fri Apr 13 04:00:56 2007
Communications reconfiguration: instance 0
Fri Apr 13 04:00:56 2007
Trace dumping is performing id=[cdmp_20070413040056]
Fri Apr 13 04:02:43 2007
Waiting for clusterware split-brain resolution
Fri Apr 13 04:12:43 2007
Errors in file /opt/oracle/admin/vam/bdump/vam2_lmon_26957.trc:
ORA-29740: evicted by member 1, group incarnation 3
LMON: terminating instance due to error 29740
Instance terminated by LMON, pid = 26957


----
/opt/oracle/admin/vam/bdump/vam2_lmon_26957.trc:


*** 2007-04-13 04:00:56.170
kjxgrcomerr: Communications reconfig: instance 0 (2,2)
kjxgrrcfgchk: Initiating reconfig, reason 3
*** 2007-04-13 04:01:02.270
kjxgmrcfg: Reconfiguration started, reason 3
kjxgmcs: Setting state to 2 0.
*** 2007-04-13 04:01:02.450
Name Service frozen
kjxgmcs: Setting state to 2 1.
*** 2007-04-13 04:01:02.570
Obtained RR update lock for sequence 2, RR seq 2
*** 2007-04-13 04:02:43.340
kjxgrrecp2: Waiting for split-brain resolution, upd 1, seq 3
*** 2007-04-13 04:12:43.430
Voting results, upd 1, seq 3, bitmap: 0
*** 2007-04-13 04:12:43.450
kjxgrdtrt: Evicted by 1, seq (3, 3)
error 29740 detected in background process
ORA-29740: evicted by member 1, group incarnation 3
ksuitm: waiting for [5] seconds before killing


个人认为是LMON(Global Enqueue SEervice Monitor)进程死了,oracle提示是重启实例,


我是 srvctl start database -d database_name 把数据库启动起来。


现是想数据库自动关闭的原因。请大侠们指点下



桃花坞里桃花庵,

桃花庵下桃花仙。

桃花仙人种桃树,

又摘桃花换酒钱。

别人笑我太疯癫,

我笑他人看不穿。

不见五陵豪杰墓,

无花无酒锄作田。

支持(0中立(0反对(0单帖管理 | 引用 | 回复 回到顶部
oracle数据恢复-RAID5数据恢复-raid数据恢复
admin
  2楼 个性首页 | 信息 | 搜索 | 邮箱 | 主页 | UC


数据恢复 恢复数据
等级:管理员 帖子:412 积分:5738 威望:0 精华:0 注册:2003-12-30 16:34:32
  发帖心情 Post By:2007-4-13 10:09:52

oracle9204有个BUG 3149370 ,可能引起这个原因,在9205里修正了!如果是生产环境如果再出现建议升级道9205以上
[此贴子已经被作者于2007-4-13 10:11:42编辑过]



http://www.sosdb.com

qq:9417901

msn:glkgdj@hotmail.com

支持(0中立(0反对(0单帖管理 | 引用 | 回复 回到顶部
oracle数据恢复-RAID5数据恢复-raid数据恢复
cityvigil
  3楼 个性首页 | 信息 | 搜索 | 邮箱 | 主页 | UC


数据恢复 恢复数据
等级:新手上路 帖子:18 积分:198 威望:0 精华:0 注册:2007-2-2 16:25:43
  发帖心情 Post By:2007-4-13 11:24:15


谢谢管理员!

图片点击可在新窗口打开查看


桃花坞里桃花庵,

桃花庵下桃花仙。

桃花仙人种桃树,

又摘桃花换酒钱。

别人笑我太疯癫,

我笑他人看不穿。

不见五陵豪杰墓,

无花无酒锄作田。

支持(0中立(0反对(0单帖管理 | 引用 | 回复 回到顶部

返回版面帖子列表

RAC双节点,数据库莫名死机,求原因








签名