rtkit-daemon 데몬으로 인한 부팅 fail 이슈 해결방법
페이지 정보
본문
RHEL 6.2 서버의 PM작업을 위해 재부팅 하였으며, rtkit-daemon의 Call Trace 관련 부팅 fail 발생
sosreport 분석 결과 해당 커널 버전에서 Intel CPU에 대한 버그 확인
RHEL 6.5 - kernel-2.6.32-431.el6 혹은 그 이상의 커널 업데이트를 통해 이슈 해결
[관련 문서]
[1] Servers with Intel® Xeon® Processor E5, Intel® Xeon® Processor E5 v2, or Intel® Xeon® Processor E7 v2 and certain versions of Red Hat Enterprise Linux 6
kernels become unresponsive/hung or incur a kernel panic
https://access.redhat.com/solutions/433883
분석 결과
-----------------------------------------------------
DMIDECODE
BIOS:
Vend: HP Vers: P70 Date: 02/10/2014 BIOS Rev: FW Rev: 2.0
Prod: ProLiant DL380p Gen8
CPU:
1 of 2 CPU sockets populated, 6 cores/12 threads per CPU
6 total cores, 12 total threads
Vers: Intel(R) Xeon(R) CPU E5-2620 0 @ 2.00GHz
Memory:
Total: 16384 MiB (16 GiB)
OS
Hostname: GDCBSTAT1
Kernel:
Booted kernel: 2.6.32-358.el6.x86_64
Booted kernel cmdline:
ro root=/dev/mapper/VG01-root rd_NO_LUKS LANG=en_US.UTF-8 rd_NO_MD quiet rd_LVM_LV=VG01/root
SYSFONT=latarcyrheb-sun16 rhgb KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM crashkernel=130M@0M
Taint-check: 1 (see https://access.redhat.com/solutions/40594)
0 PROPRIETARY_MODULE: Proprietary module has been loaded
- - - - - - - - - - - - - - - - - - -
Sys time: Thu Aug 27 17:48:37 KST 2020
Boot time: Thu Aug 27 08:32:39 UTC 2020 (epoch: 1598517159)
Time Zone: America/New York
Uptime: 15 min, 5 users
LoadAvg: [12 CPU] 2.36 (20%), 1.69 (14%), 0.91 (8%)
/proc/stat:
procs_running: 5 procs_blocked: 0 processes [Since boot]: 74068
cpu [Utilization since boot]:
us 10%, ni 0%, sys 3%, idle 87%, iowait 1%, irq 0%, sftirq 0%, steal 0%
Aug 27 16:57:40 GDCBSTAT1 kernel: INFO: task dbus-daemon:3732 blocked for more than 120 seconds.
Aug 27 16:57:40 GDCBSTAT1 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 27 16:57:40 GDCBSTAT1 kernel: dbus-daemon D 0000000000000009 0 3732 1 0x00000084
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff88040ac09ce8 0000000000000086 0000000000000000 ffff880434f5aae0
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff880434f5ab18 0000000000000000 ffff88040ac09ca8 ffffffff81064a00
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff880434f5b098 ffff88040ac09fd8 000000000000fb88 ffff880434f5b098
Aug 27 16:57:40 GDCBSTAT1 kernel: Call Trace:
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff81064a00>] ? pick_next_task_fair+0xd0/0x130
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150d7ff>] ? thread_return+0x16d/0x76e
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e555>] schedule_timeout+0x215/0x2e0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff810669eb>] ? enqueue_rt_entity+0x6b/0x80
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e1d3>] wait_for_common+0x123/0x180
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff81063310>] ? default_wake_function+0x0/0x20
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e2ed>] wait_for_completion+0x1d/0x20
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8106513c>] sched_exec+0xdc/0xe0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff81189fc0>] do_execve+0xe0/0x2c0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff810095ea>] sys_execve+0x4a/0x80
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8100b4ca>] stub_execve+0x6a/0xc0
Aug 27 16:57:40 GDCBSTAT1 kernel: INFO: task SA-linux-64:3733 blocked for more than 120 seconds.
Aug 27 16:57:40 GDCBSTAT1 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 27 16:57:40 GDCBSTAT1 kernel: SA-linux-64 D 0000000000000009 0 3733 2732 0x00000080
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff88040aee9ce8 0000000000000086 ffff880434f5b540 ffff880434f5b540
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff88040aee9c58 ffff880433e98460 ffff88040aee9c98 ffffffff810d9513
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff880434f5baf8 ffff88040aee9fd8 000000000000fb88 ffff880434f5baf8
Aug 27 16:57:40 GDCBSTAT1 kernel: Call Trace:
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff810d9513>] ? audit_copy_inode+0x83/0xc0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffffa009fbf0>] ? ext4_file_open+0x0/0x130 [ext4]
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e555>] schedule_timeout+0x215/0x2e0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8117e434>] ? nameidata_to_filp+0x54/0x70
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff812771e9>] ? cpumask_next_and+0x29/0x50
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e1d3>] wait_for_common+0x123/0x180
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff81063310>] ? default_wake_function+0x0/0x20
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e2ed>] wait_for_completion+0x1d/0x20
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8106513c>] sched_exec+0xdc/0xe0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff81189fc0>] do_execve+0xe0/0x2c0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff810095ea>] sys_execve+0x4a/0x80
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8100b4ca>] stub_execve+0x6a/0xc0
sosreport 분석 결과 해당 커널 버전에서 Intel CPU에 대한 버그 확인
RHEL 6.5 - kernel-2.6.32-431.el6 혹은 그 이상의 커널 업데이트를 통해 이슈 해결
[관련 문서]
[1] Servers with Intel® Xeon® Processor E5, Intel® Xeon® Processor E5 v2, or Intel® Xeon® Processor E7 v2 and certain versions of Red Hat Enterprise Linux 6
kernels become unresponsive/hung or incur a kernel panic
https://access.redhat.com/solutions/433883
분석 결과
-----------------------------------------------------
DMIDECODE
BIOS:
Vend: HP Vers: P70 Date: 02/10/2014 BIOS Rev: FW Rev: 2.0
Prod: ProLiant DL380p Gen8
CPU:
1 of 2 CPU sockets populated, 6 cores/12 threads per CPU
6 total cores, 12 total threads
Vers: Intel(R) Xeon(R) CPU E5-2620 0 @ 2.00GHz
Memory:
Total: 16384 MiB (16 GiB)
OS
Hostname: GDCBSTAT1
Kernel:
Booted kernel: 2.6.32-358.el6.x86_64
Booted kernel cmdline:
ro root=/dev/mapper/VG01-root rd_NO_LUKS LANG=en_US.UTF-8 rd_NO_MD quiet rd_LVM_LV=VG01/root
SYSFONT=latarcyrheb-sun16 rhgb KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM crashkernel=130M@0M
Taint-check: 1 (see https://access.redhat.com/solutions/40594)
0 PROPRIETARY_MODULE: Proprietary module has been loaded
- - - - - - - - - - - - - - - - - - -
Sys time: Thu Aug 27 17:48:37 KST 2020
Boot time: Thu Aug 27 08:32:39 UTC 2020 (epoch: 1598517159)
Time Zone: America/New York
Uptime: 15 min, 5 users
LoadAvg: [12 CPU] 2.36 (20%), 1.69 (14%), 0.91 (8%)
/proc/stat:
procs_running: 5 procs_blocked: 0 processes [Since boot]: 74068
cpu [Utilization since boot]:
us 10%, ni 0%, sys 3%, idle 87%, iowait 1%, irq 0%, sftirq 0%, steal 0%
Aug 27 16:57:40 GDCBSTAT1 kernel: INFO: task dbus-daemon:3732 blocked for more than 120 seconds.
Aug 27 16:57:40 GDCBSTAT1 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 27 16:57:40 GDCBSTAT1 kernel: dbus-daemon D 0000000000000009 0 3732 1 0x00000084
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff88040ac09ce8 0000000000000086 0000000000000000 ffff880434f5aae0
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff880434f5ab18 0000000000000000 ffff88040ac09ca8 ffffffff81064a00
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff880434f5b098 ffff88040ac09fd8 000000000000fb88 ffff880434f5b098
Aug 27 16:57:40 GDCBSTAT1 kernel: Call Trace:
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff81064a00>] ? pick_next_task_fair+0xd0/0x130
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150d7ff>] ? thread_return+0x16d/0x76e
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e555>] schedule_timeout+0x215/0x2e0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff810669eb>] ? enqueue_rt_entity+0x6b/0x80
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e1d3>] wait_for_common+0x123/0x180
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff81063310>] ? default_wake_function+0x0/0x20
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e2ed>] wait_for_completion+0x1d/0x20
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8106513c>] sched_exec+0xdc/0xe0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff81189fc0>] do_execve+0xe0/0x2c0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff810095ea>] sys_execve+0x4a/0x80
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8100b4ca>] stub_execve+0x6a/0xc0
Aug 27 16:57:40 GDCBSTAT1 kernel: INFO: task SA-linux-64:3733 blocked for more than 120 seconds.
Aug 27 16:57:40 GDCBSTAT1 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 27 16:57:40 GDCBSTAT1 kernel: SA-linux-64 D 0000000000000009 0 3733 2732 0x00000080
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff88040aee9ce8 0000000000000086 ffff880434f5b540 ffff880434f5b540
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff88040aee9c58 ffff880433e98460 ffff88040aee9c98 ffffffff810d9513
Aug 27 16:57:40 GDCBSTAT1 kernel: ffff880434f5baf8 ffff88040aee9fd8 000000000000fb88 ffff880434f5baf8
Aug 27 16:57:40 GDCBSTAT1 kernel: Call Trace:
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff810d9513>] ? audit_copy_inode+0x83/0xc0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffffa009fbf0>] ? ext4_file_open+0x0/0x130 [ext4]
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e555>] schedule_timeout+0x215/0x2e0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8117e434>] ? nameidata_to_filp+0x54/0x70
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff812771e9>] ? cpumask_next_and+0x29/0x50
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e1d3>] wait_for_common+0x123/0x180
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff81063310>] ? default_wake_function+0x0/0x20
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8150e2ed>] wait_for_completion+0x1d/0x20
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8106513c>] sched_exec+0xdc/0xe0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff81189fc0>] do_execve+0xe0/0x2c0
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff810095ea>] sys_execve+0x4a/0x80
Aug 27 16:57:40 GDCBSTAT1 kernel: [<ffffffff8100b4ca>] stub_execve+0x6a/0xc0
- 이전글context switching 관련 CPU 사용율 문의 답변 20.10.20
댓글목록
등록된 댓글이 없습니다.