添加链接
link管理
链接快照平台
  • 输入网页链接,自动生成快照
  • 标签化管理网页链接

1 背景

线上Hive任务偶尔出现hang住的现象.
经排查确认是触发了Hive的bug, 该bug在Hive-10569(
https://issues.apache.org/jira/browse/HIVE-10569 中已修复.

2 现象

用户beeline任务运行至Ended Job之后,一直未结束,查看yarn日志,该job已经运行成功.

1
2
3
4
5
6
7
INFO  : 2020-03-20 20:25:21,401 Stage-36 map = 100%,  reduce = 0%, Cumulative CPU 3.96 sec
INFO : 2020-03-20 20:25:30,224 Stage-36 map = 100%, reduce = 100%, Cumulative CPU 9.44 sec
INFO : MapReduce Total cumulative CPU time: 9 seconds 440 msec
INFO : Ended Job = job_1582793079899_13676666INFO : 2020-03-20 20:25:21,401 Stage-36 map = 100%, reduce = 0%, Cumulative CPU 3.96 sec
INFO : 2020-03-20 20:25:30,224 Stage-36 map = 100%, reduce = 100%, Cumulative CPU 9.44 sec
INFO : MapReduce Total cumulative CPU time: 9 seconds 440 msec
INFO : Ended Job = job_1582793079899_13676666

3 排查步骤

3.1 查看hiveserver2日志

找到该job在hive server端的日志,没有发现任何异常.

3.2 抓取现场

分别通过 jstack -l <pid> jmap -dump:[live,]format=b,file=<filename> <pid> 获取server端的堆栈信息.

3.3 定位排查