Nmi iock error

The following message is seen on the console NMI: IOCK error (debug interrupt?) CPU 0 Modules linked in: ipt_MASQUERADE iptable_nat ip_nat xt_state ip_conntrack nfnetlink ipt_REJECT xt_tcpudp iptable_filter ip_tables x_tables bridge mptctl mptbase bonding be2iscsi ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp bnx2i cnic ipv6 xfrm_nalgo crypto_api uio cxgb3i cxgb3 8021q libiscsi_tcp libiscsi2 scsi_transport_iscsi2 scsi_transport_iscsi dm_round_robin dm_multipath scsi_dh video backlight sbs power_meter hwmon i2c_ec i2c_core dell_wmi wmi button battery asus_acpi acpi_memhotplug ac parport_pc lp parport joydev sr_mod cdrom hpilo bnx2 serio_raw shpchp pcspkr sg dm_raid45 dm_message dm_region_hash dm_mem_cache dm_snapshot dm_zero dm_mirror dm_log dm_mod usb_storage qla2xxx scsi_transport_fc ata_piix libata cciss sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd Pid: 0, comm: swapper Not tainted 2.6.18-194.17.4.el5 #1 RIP: 0010:[] [] acpi_processor_idle_simple+0x14c/0x30e RSP: 0018:ffffffff803fbf58 EFLAGS: 00000046 RAX: 0000000000d4d87e RBX: ffff81061e10a160 RCX: 0000000000000908 RDX: 0000000000000915 RSI: 0000000000000003 RDI: 0000000000000000 RBP: 0000000000d4d87e R08: ffffffff803fa000 R09: 0000000000000039 R10: ffff810001005710 R11: 0000000000000000 R12: 0000000000000000 R13: ffff81061e10a000 R14: 0000000000000000 R15: 0000000000000000 FS: 0000000000000000(0000) GS:ffffffff803ca000(0000) knlGS:0000000000000000 CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b CR2: 0000000009013954 CR3: 000000060799d000 CR4: 00000000000006e0 Process swapper (pid: 0, threadinfo ffffffff803fa000, task ffffffff80308b60) Stack: ffff81061e10a000 ffffffff8019d404 0000000000000000 ffffffff8019d404 0000000000090000 0000000000000000 0000000000000000 ffffffff8004923a 0000000000200800 ffffffff80405807 0000000000090000 0000000000000000 Call Trace: [] acpi_processor_idle_simple+0x0/0x30e [] acpi_processor_idle_simple+0x0/0x30e [] cpu_idle+0x95/0xb8 [] start_kernel+0x220/0x225 [] _sinittext+0x22f/0x236 Code: 89 ca ed ed 41 89 c4 41 8a 45 1c 83 e0 30 3c 30 75 15 f0 ff

Environment

  • Red Hat Enterprise Linux 6
  • Red Hat Enterprise Linux 5

Issue

  • The following message is seen on the console

NMI: IOCK error (debug interrupt?) CPU 0 Modules linked in: ipt_MASQUERADE iptable_nat ip_nat xt_state ip_conntrack nfnetlink ipt_REJECT xt_tcpudp iptable_filter ip_tables x_tables bridge mptctl mptbase bonding be2iscsi ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp bnx2i cnic ipv6 xfrm_nalgo crypto_api uio cxgb3i cxgb3 8021q libiscsi_tcp libiscsi2 scsi_transport_iscsi2 scsi_transport_iscsi dm_round_robin dm_multipath scsi_dh video backlight sbs power_meter hwmon i2c_ec i2c_core dell_wmi wmi button battery asus_acpi acpi_memhotplug ac parport_pc lp parport joydev sr_mod cdrom hpilo bnx2 serio_raw shpchp pcspkr sg dm_raid45 dm_message dm_region_hash dm_mem_cache dm_snapshot dm_zero dm_mirror dm_log dm_mod usb_storage qla2xxx scsi_transport_fc ata_piix libata cciss sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd Pid: 0, comm: swapper Not tainted 2.6.18-194.17.4.el5 #1 RIP: 0010:[<ffffffff8019d550>] [<ffffffff8019d550>] acpi_processor_idle_simple+0x14c/0x30e RSP: 0018:ffffffff803fbf58 EFLAGS: 00000046 RAX: 0000000000d4d87e RBX: ffff81061e10a160 RCX: 0000000000000908 RDX: 0000000000000915 RSI: 0000000000000003 RDI: 0000000000000000 RBP: 0000000000d4d87e R08: ffffffff803fa000 R09: 0000000000000039 R10: ffff810001005710 R11: 0000000000000000 R12: 0000000000000000 R13: ffff81061e10a000 R14: 0000000000000000 R15: 0000000000000000 FS: 0000000000000000(0000) GS:ffffffff803ca000(0000) knlGS:0000000000000000 CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b CR2: 0000000009013954 CR3: 000000060799d000 CR4: 00000000000006e0 Process swapper (pid: 0, threadinfo ffffffff803fa000, task ffffffff80308b60) Stack: ffff81061e10a000 ffffffff8019d404 0000000000000000 ffffffff8019d404 0000000000090000 0000000000000000 0000000000000000 ffffffff8004923a 0000000000200800 ffffffff80405807 0000000000090000 0000000000000000 Call Trace: [<ffffffff8019d404>] acpi_processor_idle_simple+0x0/0x30e [<ffffffff8019d404>] acpi_processor_idle_simple+0x0/0x30e [<ffffffff8004923a>] cpu_idle+0x95/0xb8 [<ffffffff80405807>] start_kernel+0x220/0x225 [<ffffffff8040522f>] _sinittext+0x22f/0x236 Code: 89 ca ed ed 41 89 c4 41 8a 45 1c 83 e0 30 3c 30 75 15 f0 ff

Resolution

  • Use vendor hardware diagnostics software to analyse system health.

  • Contact the hardware manufacturer for further assistance.

  • Under RHEL6, the kernel.panic_on_io_nmi = 1 sysctl can be set to have the system panic when an I/O NMI is received. Refer to How to set sysctl variables on Red Hat Enterprise Linux? for details.

Root Cause

  • Under x86_64, I/O port 0x61 (System Control Port B) holds the reason of an NMI (NonMaskable Interrupt).
  • For a detailed description of NMIs, refer e.g. to the Intel® 64 and IA-32 Architectures Software Developer’s Manual Volume 3A: System Programming Guide, Part 1.

  • An NMI is a hardware-driven interrupt and it is usually reserved to report serious hardware errors, to a CPU.

File: include/asm-x86_64/nmi.h

    #define get_nmi_reason() inb(0x61)
  • In the event of an NMI it is possible to consult the aforementioned port to obtain an indication of what caused the error.

  • In the default_do_nmi() function we see why io_check_error() is called and consequently why the NMI: IOCK error (debug interrupt?) message is displayed on the console:

File: arch/x86_64/kernel/traps.c

    asmlinkage __kprobes void default_do_nmi(struct pt_regs *regs)
    {
        unsigned char reason = 0; 
        int cpu; 

        cpu = smp_processor_id();
        ...

        if (!cpu)
            reason = get_nmi_reason();

        ...
        if (reason & 0x40)
            io_check_error(reason, regs);
    }

    static __kprobes void
    io_check_error(unsigned char reason, struct pt_regs * regs)
    {
        printk("NMI: IOCK error (debug interrupt?)n");
        show_registers(regs);
        ...
  • As illustrated above, when the 6-th bit (1 << 6 == 0x40) is set, it reports that an I/O Channel Check error (IOCHK) has occurred.
    • Parity and uncorrectable hardware errors are examples of why an IOCHK error could be raised.
    • Most hardware errors should however be reported through the MCE (Machine Check Exception) mechanism. An MCE indicates that the CPU detected an internal machine error or a bus error, or that an external agent detected a bus error. Normally the hardware manufacturer will be able to provide further details.

Diagnostic Steps

  • Check dmesg and lspci output to figure out the cause of the NMI


  • Product(s)

  • Red Hat Enterprise Linux


  • Component

  • kernel


  • Category

  • Troubleshoot


  • Tags

  • hardware
  • kernel
  • kexec
  • rhel_5
  • rhel_6

This solution is part of Red Hat’s fast-track publication program, providing a huge library of solutions that Red Hat engineers have created while supporting our customers. To give you the knowledge you need the instant it becomes available, these articles may be presented in a raw and unedited form.

NMI IOCK error for reason 60 on CPU 0 (i3 2100/ASUS H77M)


  • Thread starter

    jiewmeng


  • Start date

    Apr 10, 2013



May 7, 2011



147



0



18,690

1


  • #1

My PC suddenly feels very laggy. Sometimes when I type, I suddenly get many duplicate characters. When I go into console (Ctrl+Alt+F2) I see afew messages abt NMI: IOCK error (debug interrupt?) for reason 60 on CPU 0 popping out. The reason number might be different. I am getting: dmesg output. This was on Arch Linux. But on Windows 7, I gotten 3 BSOD’s by now at different times about Hardware Malfunction, NMI: Channel Check / IOCHK

[  835.524732] NMI: IOCK error (debug interrupt?) for reason 60 on CPU 0.
[  835.524737] CPU 0 
[  835.524742] Pid: 0, comm: swapper/0 Tainted: G           O 3.8.6-1-ARCH #1 System manufacturer System Product Name/P8H77-M
[  835.524745] RIP: 0010:[<ffffffff812ce36c>]  [<ffffffff812ce36c>] intel_idle+0xac/0x100
[  835.524754] RSP: 0018:ffffffff81801e38  EFLAGS: 00000046
[  835.524756] RAX: 0000000000000020 RBX: 0000000000000008 RCX: 0000000000000001
[  835.524758] RDX: 0000000000000000 RSI: ffffffff818687c0 RDI: 000000000180d000
[  835.524760] RBP: ffffffff81801e60 R08: 0000000000000003 R09: 000000000000233f
[  835.524762] R10: 0000000000000000 R11: 00000000000005ee R12: 0000000000000003
[  835.524763] R13: 0000000000000020 R14: 0000000000000003 R15: ffffffff81387f30
[  835.524766] FS:  0000000000000000(0000) GS:ffff88021ec00000(0000) knlGS:0000000000000000
[  835.524768] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  835.524770] CR2: 00007fe8f403df00 CR3: 000000000180d000 CR4: 00000000000407f0
[  835.524772] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  835.524774] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[  835.524777] Process swapper/0 (pid: 0, threadinfo ffffffff81800000, task ffffffff81815440)
[  835.524778] Stack:
[  835.524779]  0000000081801e48 ffff88021ec19f70 ffffffff818687c0 000000c2baefba76
[  835.524784]  0000000000000003 ffffffff81801e70 ffffffff81387f49 ffffffff81801ea8
[  835.524787]  ffffffff81388ae2 ffff88021ec19f70 0000000000000003 0000000000000000
[  835.524798] Call Trace:
[  835.524802]  [<ffffffff81387f49>] cpuidle_enter+0x19/0x20
[  835.524804]  [<ffffffff81388ae2>] cpuidle_wrap_enter+0x32/0x80
[  835.524805]  [<ffffffff81387f60>] cpuidle_enter_tk+0x10/0x20
[  835.524807]  [<ffffffff813886f6>] cpuidle_idle_call+0xb6/0x390
[  835.524810]  [<ffffffff8101edac>] cpu_idle+0xec/0x150
[  835.524813]  [<ffffffff814a2e3f>] rest_init+0x7f/0x90
[  835.524816]  [<ffffffff818d0c38>] start_kernel+0x3db/0x3e7
[  835.524818]  [<ffffffff818d0666>] ? repair_env_string+0x5c/0x5c
[  835.524820]  [<ffffffff818d0120>] ? early_idt_handlers+0x120/0x120
[  835.524822]  [<ffffffff818d0355>] x86_64_start_reservations+0x130/0x133
[  835.524824]  [<ffffffff818d045a>] x86_64_start_kernel+0x102/0x10f
[  835.524825] Code: 31 d2 48 83 c0 10 48 89 d1 0f 01 c8 0f ae f0 65 48 8b 04 25 f0 c6 00 00 48 8b 80 38 e0 ff ff a8 08 75 08 b1 01 4c 89 e8 0f 01 c9 <85> 1d 2e a4 59 00 75 0e 48 8d 75 dc bf 05 00 00 00 e8 9e 13 de

Its a Intel Core i3 2100 and ASUS H77M MB. I originally thought it was a SSD fault (prev post) … Does this likely confirm CPU is faulty? What can I do? Will trying to resit the CPU in the MB help?

I think its useful to understand the cause too. since its a self built system. So I can avoid making same mistakes again? The CPU was in use for probably >1 years? The MB was replaced probably just ~6mths (guessing) due to mistake in flashing BIOS. No other major changes in hardware apart from that. CPU temps looks fine typically <40 degrees from BIOS. I noted that BIOS POST works, does this eliminate problems with CPU/MB? Its after POST, it gets stuck at blinking cursor. I tried booting from a bootable USB too. If the HDD is stuck, then USB is likely going to encounter the sme problem too.

UPDATE

I noticed that if I can get to Syslinux bootloader, after selecting an entry, I usually get Parity Error. I tried removing one RAM at a time, results are inconsistent, sometimes, I will only get a blinking cursor after POST (thus no syslinux nor partity error at all). I am thinking I might need to change some components soon. But I was hoping to isolate the problem so I dont replace working parts if possible.

das_stig



Jul 24, 2009



8,213



5



41,965

1,342




May 7, 2011



147



0



18,690

1


  • #3

Btw, I also noticed that if I ever get into bootloader (syslinux) but fails to load as OS, I get «Parity Error» on screen. I guess the problem is still likely MB/CPU?

I tried once after being able to boot an bootable ISO to load memtest+ but it seems to hang doing nothing …



Sep 11, 2012



6,966



0



18,460

375


  • #4

My first guess would be a flaky motherboard. I have gotten that warning on an OLD 486 with an EGA video card with an attitude.
It generated NMIs whenever it was sent data to its registers for whatever reason.

It is a general error/warning for whenever an NMI is generated by flaky/buggy hardware.

Thread starter Similar threads Forum Replies Date

idk12345121

Question CPU causing beep error on startup CPUs 4 Jan 12, 2023

N

Question CPU Fan Error CPUs 2 Jan 5, 2023

excitedlybored

Question please help — CPU Over Voltage Error CPUs 1 Dec 4, 2022

haydvn

Question how do i fix my cpu boot problem error code ndis.sys CPUs 1 Nov 25, 2022

C

Question Processor core fatal error causing constant system crashes CPUs 6 Nov 21, 2022

E

Question Upgraded CPU from Ryzen 5 2600 to a Ryzen 7 5700x and launching Modern Warfare 2 instantly reboots my computer with no error message. CPUs 4 Nov 15, 2022

M

Question OCCT errors when running at default (XMP off) but not when XMP is enabled CPUs 7 Oct 24, 2022

Funky Pig

[SOLVED] CPU fan error AND cannot go into BIOS CPUs 3 Aug 23, 2022

N

Question Hardware error, CPU temp is 84C in BIOS ? CPUs 7 Aug 6, 2022

S

[SOLVED] Cannot identify why CPU or MOBO died — Complete mystery CPUs 7 Aug 6, 2022

  • Advertising
  • Cookies Policies
  • Privacy
  • Term & Conditions
  • Topics

  • Index
  • » Kernel & Hardware
  • » CPU Faulty? NMI: IOCK error (debug interrupt?) for reason 60 on CPU 0

Pages: 1

#1 2013-04-10 01:09:03

jiewmeng
Member
Registered: 2012-02-09
Posts: 118

CPU Faulty? NMI: IOCK error (debug interrupt?) for reason 60 on CPU 0

My PC suddenly feels very laggy. Sometimes when I type, I suddenly get many duplicate characters. When I go into console (Ctrl+Alt+F2) I see afew messages abt NMI: IOCK error (debug interrupt?) for reason 60 on CPU 0 popping out. The reason number might be different. I am getting: dmesg output.

[  835.524732] NMI: IOCK error (debug interrupt?) for reason 60 on CPU 0.
[  835.524737] CPU 0 
[  835.524742] Pid: 0, comm: swapper/0 Tainted: G           O 3.8.6-1-ARCH #1 System manufacturer System Product Name/P8H77-M
[  835.524745] RIP: 0010:[<ffffffff812ce36c>]  [<ffffffff812ce36c>] intel_idle+0xac/0x100
[  835.524754] RSP: 0018:ffffffff81801e38  EFLAGS: 00000046
[  835.524756] RAX: 0000000000000020 RBX: 0000000000000008 RCX: 0000000000000001
[  835.524758] RDX: 0000000000000000 RSI: ffffffff818687c0 RDI: 000000000180d000
[  835.524760] RBP: ffffffff81801e60 R08: 0000000000000003 R09: 000000000000233f
[  835.524762] R10: 0000000000000000 R11: 00000000000005ee R12: 0000000000000003
[  835.524763] R13: 0000000000000020 R14: 0000000000000003 R15: ffffffff81387f30
[  835.524766] FS:  0000000000000000(0000) GS:ffff88021ec00000(0000) knlGS:0000000000000000
[  835.524768] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  835.524770] CR2: 00007fe8f403df00 CR3: 000000000180d000 CR4: 00000000000407f0
[  835.524772] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  835.524774] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[  835.524777] Process swapper/0 (pid: 0, threadinfo ffffffff81800000, task ffffffff81815440)
[  835.524778] Stack:
[  835.524779]  0000000081801e48 ffff88021ec19f70 ffffffff818687c0 000000c2baefba76
[  835.524784]  0000000000000003 ffffffff81801e70 ffffffff81387f49 ffffffff81801ea8
[  835.524787]  ffffffff81388ae2 ffff88021ec19f70 0000000000000003 0000000000000000
[  835.524798] Call Trace:
[  835.524802]  [<ffffffff81387f49>] cpuidle_enter+0x19/0x20
[  835.524804]  [<ffffffff81388ae2>] cpuidle_wrap_enter+0x32/0x80
[  835.524805]  [<ffffffff81387f60>] cpuidle_enter_tk+0x10/0x20
[  835.524807]  [<ffffffff813886f6>] cpuidle_idle_call+0xb6/0x390
[  835.524810]  [<ffffffff8101edac>] cpu_idle+0xec/0x150
[  835.524813]  [<ffffffff814a2e3f>] rest_init+0x7f/0x90
[  835.524816]  [<ffffffff818d0c38>] start_kernel+0x3db/0x3e7
[  835.524818]  [<ffffffff818d0666>] ? repair_env_string+0x5c/0x5c
[  835.524820]  [<ffffffff818d0120>] ? early_idt_handlers+0x120/0x120
[  835.524822]  [<ffffffff818d0355>] x86_64_start_reservations+0x130/0x133
[  835.524824]  [<ffffffff818d045a>] x86_64_start_kernel+0x102/0x10f
[  835.524825] Code: 31 d2 48 83 c0 10 48 89 d1 0f 01 c8 0f ae f0 65 48 8b 04 25 f0 c6 00 00 48 8b 80 38 e0 ff ff a8 08 75 08 b1 01 4c 89 e8 0f 01 c9 <85> 1d 2e a4 59 00 75 0e 48 8d 75 dc bf 05 00 00 00 e8 9e 13 de 

Its a Intel Core i3 2100 and ASUS H77M MB. I originally thought it was a SSD fault (prev post) … Does this likely confirm CPU is faulty? What can I do? Will trying to resit the CPU in the MB help?


Computer Science Student, Web Developer

#2 2013-04-10 01:22:58

cfr
Member
From: Cymru
Registered: 2011-11-27
Posts: 7,127

Re: CPU Faulty? NMI: IOCK error (debug interrupt?) for reason 60 on CPU 0


CLI Paste | How To Ask Questions

Arch Linux | x86_64 | GPT | EFI boot | refind | stub loader | systemd | LVM2 on LUKS
Lenovo x270 | Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz | Intel Wireless 8265/8275 | US keyboard w/ Euro | 512G NVMe INTEL SSDPEKKF512G7L

#3 2013-04-10 04:45:52

jiewmeng
Member
Registered: 2012-02-09
Posts: 118

Re: CPU Faulty? NMI: IOCK error (debug interrupt?) for reason 60 on CPU 0

I saw that link before. Its a self built system. So I think I need to roughly determine whats faulty. I guess I can conclude its the CPU? Also I noticed the reason number can be different, but its always from CPU 0. So its most likely a CPU thing? Not sure if I actually registered for any warranty sad … Also, its not a very good time to upgrade, since Haswell is coiming out soon …


Computer Science Student, Web Developer

#4 2013-04-11 00:40:39

jiewmeng
Member
Registered: 2012-02-09
Posts: 118

Re: CPU Faulty? NMI: IOCK error (debug interrupt?) for reason 60 on CPU 0

Does that imply a likely CPU or motherboard error? Someone replied at toms hardware thread its likely motherboard?http://www.tomshardware.com/answers/id-1648567/nmi-iock-error-reason-cpu-2100-asus-h77m.html


Computer Science Student, Web Developer

This document (7009715) is provided subject to the disclaimer at the end of this document.

Environment

SUSE Linux Enterprise Server 10 Service Pack 4

Situation

kdump on SLES10SP4 fails with the following message:

Resolution

Affected by this issue are certain HP SmartArray controllers supported by the cciss or hpsa driver (a comprehensive list and more details can be found in the HP support advisory listed below). All tools and configuration files which are necessary to create the dump on a system running SLES10 require access to the root partition regardless of the dump device. As long as root is located on the affected SCSI controller, the dump will not complete.

SLES 11 is not affected by this issue. If possible, please consider an upgrade to SLES 11 SP1 on those machines this behavior is experienced on.

HP support advisory:

Advisory c02911740 — HP Smart Array Controllers — Kernel Core Dump Using Kdump May Not Complete in Linux With Certain HP Smart Array Controllers

For details on setting up kdump on SLES10 or SLES11 please see TID 000016171 Configure crashkernel memory for kernel core dump analysis

Disclaimer

This Support Knowledgebase provides a valuable tool for SUSE customers and parties interested in our products and solutions to acquire information, ideas and learn from one another. Materials are provided for informational, personal or non-commercial use within your organization and are presented «AS IS» WITHOUT WARRANTY OF ANY KIND.

  • Document ID:7009715
  • Creation Date:
    09-Nov-2011
  • Modified Date:24-Feb-2021
    • SUSE Linux Enterprise Server

< Back to Support Search

For questions or concerns with the SUSE Knowledgebase please contact: tidfeedback[at]suse.com

Понравилась статья? Поделить с друзьями:
  • Nginx ошибка 304
  • Nmesh2 could not load geometry city car driving ошибка
  • Nginx как изменить порт
  • Nmake fatal error u1077 cl return code 0x1
  • Nls error detected