我正在尝试理解一个挂起的Java进程存在的问题。这个过程已经在生产中运行了大约4个月,本周早些时候它开始挂起。当我查看进程的线程转储时,所有相关线程(3)都有如下堆栈:
"TxnParser_1" prio=6 tid=0x69bd3400 nid=0x2534 runnable [0x6aa2f000]
java.lang.Thread.State: RUNNABLE
at java.net.SocketInputStream.socketRead0(Native Method)
at java.net.SocketInputStream.read(SocketInputStream.java:129)
at oracle.net.ns.Packet.receive(Unknown Source)
at oracle.net.ns.DataPacket.receive(Unknown Source)
at oracle.net.ns.NetInputStream.getNextPacket(Unknown Source)
at oracle.net.ns.NetInputStream.read(Unknown Source)
at oracle.net.ns.NetInputStream.read(Unknown Source)
at oracle.net.ns.NetInputStream.read(Unknown Source)
at oracle.jdbc.driver.T4CMAREngine.unmarshalUB1(T4CMAREngine.java:1099)
at oracle.jdbc.driver.T4CMAREngine.unmarshalSB1(T4CMAREngine.java:1070)
at oracle.jdbc.driver.T4C8Oall.receive(T4C8Oall.java:478)
at oracle.jdbc.driver.T4CStatement.doOall8(T4CStatement.java:207)
at oracle.jdbc.driver.T4CStatement.executeForDescribe(T4CStatement.java:790)
at oracle.jdbc.driver.OracleStatement.executeMaybeDescribe(OracleStatement.java:1039)
at oracle.jdbc.driver.T4CStatement.executeMaybeDescribe(T4CStatement.java:830)
at oracle.jdbc.driver.OracleStatement.doExecuteWithTimeout(OracleStatement.java:1132)
at oracle.jdbc.driver.OracleStatement.executeInternal(OracleStatement.java:1687)
at oracle.jdbc.driver.OracleStatement.execute(OracleStatement.java:1653)
- locked <0x40e22f88> (a oracle.jdbc.driver.T4CStatement)
- locked <0x28f8d398> (a oracle.jdbc.driver.T4CConnection)
at com.gcg.data.LogParsingInfo.initFromDB(LogParsingInfo.java:262)
at com.gcg.om.OmQueueEntry.initParseInfoFromDB(OmQueueEntry.java:104)
at com.gcg.om.GenericQueueEntry.run(GenericQueueEntry.java:237)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:619)
没有线程在等待锁,所以进程不会死锁。这3个正在做工作的线程只是被阻塞,等待Oracle的响应,至少在我看来是这样的。
看一下Oracle,当我查询v$session时,它看起来像是与这些线程相关联的一个连接当前正在执行一个查询,尽管我看不到sql。
select ... from v$session where ...;
SQL_ADDRESS SQL_HASH_VALUE SQL_ID SQL_CHILD_NUMBER SQL_EXEC_START SQL_EXEC_ID PREV_SQL_ADDR PREV_HASH_VALUE PREV_SQL_ID PREV_CHILD_NUMBER PREV_EXEC_START PREV_EXEC_ID
---------------- -------------- ------------- ---------------- -------------- ----------- ---------------- --------------- ------------- ----------------- --------------- ------------
00 0 0000000239F59EE8 1483377872 fqr8pndc6p36h 5 26-JUL-12 32080545
00 0 0000000239F59EE8 1483377872 fqr8pndc6p36h 5 26-JUL-12 32080546
0000000148CABD88 1784444892 a16hxxtp5sxyw 0000000239F59EE8 1483377872 fqr8pndc6p36h 5 26-JUL-12 32080544
select * from v$sql where sql_id = 'a16hxxtp5sxyw';
no rows selected
我的问题是:
更新:
基于关于在DBA_WAITERS和DBA_LOCKS中查找的评论
select * from dba_waiters;
no rows selected
select * from dba_locks where BLOCKING_OTHERS <> 'Not Blocking';
no rows selected
dba_locks中有98行,但由于所有行都“没有阻塞”,我不认为这是一个锁定问题?有问题的进程已经处于这种状态超过3个小时了,所以现在应该已经检测到了任何死锁。
我认为Oracle实例不是“健康的”,但我不知道该看什么。我有一个重启Oracle服务器的请求,但还没有完成。
后续问题:v$会话包含v$sql中不存在的sql_id正常吗?如果是,在什么情况下?
发布于 2012-07-27 22:42:54
问题解决了,答案在v$session表中是正确的。显然,Oracle会话可能会因为锁定以外的其他原因而阻塞。请注意列FINAL_BLOCKING_SESSION -它标识了导致阻塞的会话。我们调查了会话845,发现客户端进程(由机器和端口标识)不再存在。DBA终止了会话845,所有会话都恢复正常。
SID SERIAL# STATUS PROGRAM TYPE SQL_ID PREV_SQL_ID BLOCKING_SESSION_STATUS BLOCKING_INSTANCE BLOCKING_SESSION FINAL_BLOCKING_SESSION_STATUS FINAL_BLOCKING_INSTANCE FINAL_BLOCKING_SESSION EVENT
------- ------- --------- ---------------- ---- ------------- -------------- ----------------------- ----------------- ---------------- ----------------------------- ----------------------- ---------------------- ----------------------------
108 22447 ACTIVE Gcg log parser 1 USER fqr8pndc6p36h VALID 1 1581 VALID 1 845 library cache: mutex X
639 40147 ACTIVE Gcg log parser 3 USER fqr8pndc6p36h VALID 1 1581 VALID 1 845 library cache: mutex X
742 34683 ACTIVE Gcg log parser 2 USER a16hxxtp5sxyw fqr8pndc6p36h VALID 1 1581 VALID 1 845 library cache: mutex X
发布于 2016-04-11 12:50:04
我最近也遇到了这个问题,并使用以下查询在Oracle中查找锁定/锁定会话:
select
inst_id||' '||sid||','||serial# inst_sid_s#,
username,
row_wait_obj#||','||row_wait_block#||','||row_wait_row# obj_lck,
blocking_session_Status||' '||blocking_instance||','||blocking_session blk_info,
final_blocking_session_Status||' '||final_blocking_instance||','||final_blocking_session f_blk_info,
event,
seconds_in_wait
from
gv$session
where
lockwait is not null
order by
inst_id;
来源:http://www.dba-oracle.com/t_final_blocking_session_final_blocking_instance.htm
https://stackoverflow.com/questions/11673947
复制相似问题