| View Issue Details [ Jump to Notes ] | [ Issue History ] [ Print ] | ||||||||||||
| ID | Project | Category | View Status | Date Submitted | Last Update | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0008135 | CentOS-7 | kernel | public | 2015-01-26 13:38 | 2016-10-11 09:00 | ||||||||
| Reporter | DaltonJ | ||||||||||||
| Priority | urgent | Severity | crash | Reproducibility | have not tried | ||||||||
| Status | new | Resolution | open | ||||||||||
| Product Version | 7.0-1406 | ||||||||||||
| Target Version | Fixed in Version | ||||||||||||
| Summary | 0008135: Constantly TCP Crashes / Link Reset in NIC (Tigon 3) | ||||||||||||
| Description | Hi, I'm using the tigon 3 kmod-driver from ELRepo 7 (tg3), because I had the same results with all the kernel versions of CentOS from the first to the last, which I'm using right now : 3.10.0-123.13.2.el7.x86_64 I've didn't found any way to fix this, here is the log it generates in /var/log/messages : Jan 26 03:23:15 HFX0027 kernel: ------------[ cut here ]------------ Jan 26 03:23:15 HFX0027 kernel: WARNING: at net/ipv4/tcp_output.c:1058 tcp_fragment+0x2e6/0x2f0() Jan 26 03:23:15 HFX0027 kernel: Modules linked in: tg3(OF) ptp pps_core sg acpi_cpufreq coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel hpilo ghash_clmulni_intel aesni_intel iTCO_wdt lrw hpwdt gf128mul iTCO_vendor_supp$ Jan 26 03:23:15 HFX0027 kernel: CPU: 0 PID: 0 Comm: swapper/0 Tainted: GF W O-------------- 3.10.0-123.13.2.el7.x86_64 #1 Jan 26 03:23:15 HFX0027 kernel: Hardware name: HP ProLiant DL320e Gen8 v2, BIOS P80 03/28/2014 Jan 26 03:23:15 HFX0027 kernel: 0000000000000000 dec78833ec6c41b8 ffff88040b4038d8 ffffffff815e232c Jan 26 03:23:15 HFX0027 kernel: ffff88040b403910 ffffffff8105dee1 ffff8803da69cc00 ffff8803f2fe0000 Jan 26 03:23:15 HFX0027 kernel: 0000000000000001 0000000000000030 00000000363bedb0 ffff88040b403920 Jan 26 03:23:15 HFX0027 kernel: Call Trace: Jan 26 03:23:15 HFX0027 kernel: <IRQ> [<ffffffff815e232c>] dump_stack+0x19/0x1b Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8105dee1>] warn_slowpath_common+0x61/0x80 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8105e00a>] warn_slowpath_null+0x1a/0x20 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff81525d56>] tcp_fragment+0x2e6/0x2f0 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8151c5ef>] tcp_mark_head_lost+0x1cf/0x270 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8151db33>] tcp_update_scoreboard+0x43/0x1d0 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff815216b3>] tcp_fastretrans_alert+0x683/0xca0 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff815227f8>] tcp_ack+0xa28/0x1070 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff81523322>] tcp_rcv_established+0x1a2/0x8d0 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8152d3f5>] tcp_v4_do_rcv+0x1b5/0x470 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8150f478>] ? ip_output+0x58/0x90 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8152f917>] tcp_v4_rcv+0x667/0x7c0 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8108ed38>] ? __wake_up_common+0x58/0x90 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff815098c4>] ip_local_deliver_finish+0xb4/0x1f0 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff81509b98>] ip_local_deliver+0x48/0x80 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8150953d>] ip_rcv_finish+0x7d/0x350 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff81509e04>] ip_rcv+0x234/0x380 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff814d0046>] __netif_receive_skb_core+0x676/0x870 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8101a0d9>] ? read_tsc+0x9/0x20 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff814d0258>] __netif_receive_skb+0x18/0x60 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff814d02e0>] netif_receive_skb+0x40/0xd0 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff814d0d40>] napi_gro_receive+0x80/0xb0 Jan 26 03:23:15 HFX0027 kernel: [<ffffffffa03cf5ad>] tg3_poll_work+0x7ad/0xfe0 [tg3] Jan 26 03:23:15 HFX0027 kernel: [<ffffffffa03cfe1b>] tg3_poll_msix+0x3b/0x140 [tg3] Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8109506d>] ? ttwu_do_activate.constprop.85+0x5d/0x70 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff814d06aa>] net_rx_action+0x15a/0x250 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff81067047>] __do_softirq+0xf7/0x290 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff815f435c>] call_softirq+0x1c/0x30 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff81014cf5>] do_softirq+0x55/0x90 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff810673e5>] irq_exit+0x115/0x120 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff815f4c58>] do_IRQ+0x58/0xf0 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff815e9dad>] common_interrupt+0x6d/0x6d Jan 26 03:23:15 HFX0027 kernel: <EOI> [<ffffffff814835a2>] ? cpuidle_enter_state+0x52/0xc0 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff814836d5>] cpuidle_idle_call+0xc5/0x200 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff8101bc7e>] arch_cpu_idle+0xe/0x30 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff810b47e5>] cpu_startup_entry+0xf5/0x290 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff815c3d57>] rest_init+0x77/0x80 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff81a07fa7>] start_kernel+0x429/0x44a Jan 26 03:23:15 HFX0027 kernel: [<ffffffff81a07987>] ? repair_env_string+0x5c/0x5c Jan 26 03:23:15 HFX0027 kernel: [<ffffffff81a07120>] ? early_idt_handlers+0x120/0x120 Jan 26 03:23:15 HFX0027 kernel: [<ffffffff81a075ee>] x86_64_start_reservations+0x2a/0x2c Jan 26 03:23:15 HFX0027 kernel: [<ffffffff81a07742>] x86_64_start_kernel+0x152/0x175 Jan 26 03:23:15 HFX0027 kernel: ---[ end trace 739cf5c28cc1a83e ]--- Jan 26 03:23:15 HFX0027 kernel: ------------[ cut here ]------------ I don't have any clue how this can be fixed, but all the time, when this happens, there is a 4 seconds until the PHY Link is restablished in the NIC. I've checked the wires and the switch port for errors, even tried with another machine to make sure it was not bad and indeed it is not. It happens only when using CentOS 7 + (tg3) driver. Sincerely, Dalton J. | ||||||||||||
| Tags | No tags attached. | ||||||||||||
| abrt_hash | |||||||||||||
| URL | |||||||||||||
| Attached Files |
| ||||||||||||
Notes |
|
|
DaltonJ (reporter) 2015-04-24 15:42 |
I have updated to the newest kernel + firmware and tg3 driver, but the issue persists :( 3.10.0-229.1.2.el7.x86_64 driver: tg3 version: 3.137h firmware-version: 5720-v2.3.15 NCSI v1.2.46.0 bus-info: 0000:03:00.0 supports-statistics: yes supports-test: yes supports-eeprom-access: yes supports-register-dump: yes supports-priv-flags: no |
|
DaltonJ (reporter) 2015-04-24 15:42 |
Apr 24 17:40:45 HFX0027 kernel: ------------[ cut here ]------------ Apr 24 17:40:45 HFX0027 kernel: WARNING: at net/ipv4/tcp_output.c:1059 tcp_fragment+0x2e6/0x2f0() Apr 24 17:40:45 HFX0027 kernel: Modules linked in: binfmt_misc tg3(OF) ptp pps_core nls_utf8 isofs sr_mod cdrom usb_storage ipt_SYNPROXY nf_synproxy_core xt_CT xt_set xt_NFLOG nfnetlink_log xt_limit xt_conntrack ip_set_hash_ip iptable_raw iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_filter ip_tables ip_set nfnetlink intel_powerclamp coretemp ipmi_si ipmi_msghandler hpwdt intel_rapl acpi_power_meter hpilo shpchp iTCO_wdt iTCO_vendor_support lpc_ich mfd_core serio_raw crct10dif_pclmul pcc_cpufreq pcspkr crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper cryptd tcp_htcp ext4 mbcache jbd2 sd_mod crc_t10dif crct10dif_common mgag200 syscopyarea sysfillrect sysimgblt i2c_algo_bit drm_kms_helper ttm drm ahci libahci libata i2c_core hpsa dm_mirror Apr 24 17:40:45 HFX0027 kernel: dm_region_hash dm_log dm_mod [last unloaded: kvm] Apr 24 17:40:45 HFX0027 kernel: CPU: 2 PID: 4820 Comm: haproxy1510 Tainted: GF W O-------------- 3.10.0-229.1.2.el7.x86_64 #1 Apr 24 17:40:45 HFX0027 kernel: Hardware name: HP ProLiant DL320e Gen8 v2, BIOS P80 03/28/2014 Apr 24 17:40:45 HFX0027 kernel: 0000000000000000 00000000bef4127e ffff88040b4438d8 ffffffff81604afa Apr 24 17:40:45 HFX0027 kernel: ffff88040b443910 ffffffff8106e34b ffff8803d3937a00 ffff8803c73a3480 Apr 24 17:40:45 HFX0027 kernel: 0000000000000001 0000000000000030 0000000094649cd3 ffff88040b443920 Apr 24 17:40:45 HFX0027 kernel: Call Trace: Apr 24 17:40:45 HFX0027 kernel: <IRQ> [<ffffffff81604afa>] dump_stack+0x19/0x1b Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8106e34b>] warn_slowpath_common+0x6b/0xb0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8106e49a>] warn_slowpath_null+0x1a/0x20 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff81552d66>] tcp_fragment+0x2e6/0x2f0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff815495ff>] tcp_mark_head_lost+0x1cf/0x270 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8154a3ac>] tcp_update_scoreboard+0x8c/0x1d0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8154e8c3>] tcp_fastretrans_alert+0x683/0xca0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8154fa08>] tcp_ack+0xa28/0x1070 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff815504a8>] tcp_rcv_established+0x118/0x6c0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffffa0325d61>] ? nf_ct_seq_adjust+0x1c1/0x3a0 [nf_conntrack] Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8155a6d5>] tcp_v4_do_rcv+0x1b5/0x470 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8155ca26>] tcp_v4_rcv+0x6e6/0x7d0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff81536b00>] ? ip_rcv_finish+0x350/0x350 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8152d7f4>] ? nf_hook_slow+0x84/0x140 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff81536b00>] ? ip_rcv_finish+0x350/0x350 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff81536bb4>] ip_local_deliver_finish+0xb4/0x1f0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff81536e88>] ip_local_deliver+0x48/0x80 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8153682d>] ip_rcv_finish+0x7d/0x350 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff815370f4>] ip_rcv+0x234/0x380 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff814fc506>] __netif_receive_skb_core+0x676/0x870 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8101b589>] ? read_tsc+0x9/0x10 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff814fc718>] __netif_receive_skb+0x18/0x60 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff814fc7a0>] netif_receive_skb+0x40/0xd0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff814fd2b0>] napi_gro_receive+0x80/0xb0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffffa042779d>] tg3_poll_work+0x7ad/0xfe0 [tg3] Apr 24 17:40:45 HFX0027 kernel: [<ffffffffa042800b>] tg3_poll_msix+0x3b/0x1b0 [tg3] Apr 24 17:40:45 HFX0027 kernel: [<ffffffff814fcb62>] net_rx_action+0x152/0x240 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff81077bf7>] __do_softirq+0xf7/0x290 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8161635c>] call_softirq+0x1c/0x30 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff81015de5>] do_softirq+0x55/0x90 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff81077f95>] irq_exit+0x115/0x120 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff81616ef8>] do_IRQ+0x58/0xf0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8160c0ed>] common_interrupt+0x6d/0x6d Apr 24 17:40:45 HFX0027 kernel: <EOI> [<ffffffff815450f6>] ? tcp_recvmsg+0x1a6/0xbb0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff81544fe5>] ? tcp_recvmsg+0x95/0xbb0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff815706cb>] inet_recvmsg+0x7b/0xa0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff814e441f>] sock_recvmsg+0xbf/0x100 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff814e3d50>] ? sock_sendmsg+0xb0/0xf0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff812ddb94>] ? timerqueue_del+0x24/0x70 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff8109acd6>] ? __remove_hrtimer+0x46/0xa0 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff814e4598>] SYSC_recvfrom+0xe8/0x160 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff810a9610>] ? wake_up_state+0x20/0x20 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff814e4d7e>] SyS_recvfrom+0xe/0x10 Apr 24 17:40:45 HFX0027 kernel: [<ffffffff81614a29>] system_call_fastpath+0x16/0x1b Apr 24 17:40:45 HFX0027 kernel: ---[ end trace c533013a178a635f ]--- |
|
wolfy (developer) 2015-04-25 07:32 |
Could you please refile this bug in bugzilla.redhat.com / RHEL 7/ kernel component? Once RH fixes it, CentOS will pick the fix as well. Bonus points for cross-linking the two bugs :) |
|
DaltonJ (reporter) 2015-04-25 07:34 |
Hi, wolfy. What do you mean with two bugs ? :o |
|
wolfy (developer) 2015-04-25 07:36 |
I meant bug reports, sorry |
|
DaltonJ (reporter) 2015-04-25 07:36 |
Ok, lets try, I'm since january on this :( |
|
dreger (reporter) 2015-10-26 12:17 |
Same bug here for me. Was this reported in RH yet? Has anyone found a fix? Oct 25 04:00:10 wdc05 kernel: ------------[ cut here ]------------ Oct 25 04:00:10 wdc05 kernel: WARNING: at net/ipv4/tcp_output.c:1059 tcp_fragment+0x2e6/0x2f0() Oct 25 04:00:11 wdc05 kernel: Modules linked in: binfmt_misc nf_conntrack_netbios_ns nf_conntrack_broadcast bonding ip6t_rpfilter ip6t_REJECT ipt_REJECT xt_conntrack ebtable_nat ebtable_broute bridge stp llc ebtable_filter ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw iptable_filter ip_tables intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel iTCO_wdt iTCO_vendor_support aesni_intel lrw gf128mul ses glue_helper ablk_helper serio_raw enclosure cryptd pcspkr i2c_i801 mei_me lpc_ich mei mfd_core ioatdma shpchp tcp_htcp xfs libcrc32c sd_mod crc_t10dif crct10dif_common Oct 25 04:00:11 wdc05 kernel: syscopyarea ahci sysfillrect sysimgblt drm_kms_helper libahci ttm igb aacraid libata drm ptp pps_core dca i2c_algo_bit i2c_core Oct 25 04:00:11 wdc05 kernel: CPU: 2 PID: 0 Comm: swapper/2 Tainted: G W -------------- 3.10.0-229.7.2.el7.x86_64 #1 Oct 25 04:00:11 wdc05 kernel: Hardware name: Supermicro PIO-617R-TLN4F+-ST031/X9DRi-LN4+/X9DR3-LN4+, BIOS 3.0b 05/27/2014 Oct 25 04:00:11 wdc05 kernel: 0000000000000000 83946332548d507b ffff88085f0838d8 ffffffff81604386 Oct 25 04:00:11 wdc05 kernel: ffff88085f083910 ffffffff8106e28b ffff880594e43400 ffff8806e98f0000 Oct 25 04:00:11 wdc05 kernel: 0000000000000004 0000000000000100 00000000030c7963 ffff88085f083920 Oct 25 04:00:11 wdc05 kernel: Call Trace: Oct 25 04:00:11 wdc05 kernel: <IRQ> [<ffffffff81604386>] dump_stack+0x19/0x1b Oct 25 04:00:11 wdc05 kernel: [<ffffffff8106e28b>] warn_slowpath_common+0x6b/0xb0 Oct 25 04:00:11 wdc05 kernel: [<ffffffff8106e3da>] warn_slowpath_null+0x1a/0x20 Oct 25 04:00:12 wdc05 kernel: [<ffffffff815526e6>] tcp_fragment+0x2e6/0x2f0 Oct 25 04:00:12 wdc05 kernel: [<ffffffff81548f7f>] tcp_mark_head_lost+0x1cf/0x270 Oct 25 04:00:12 wdc05 kernel: [<ffffffff81549ce3>] tcp_update_scoreboard+0x43/0x1d0 Oct 25 04:00:12 wdc05 kernel: [<ffffffff8154e243>] tcp_fastretrans_alert+0x683/0xca0 Oct 25 04:00:12 wdc05 kernel: [<ffffffff8154f388>] tcp_ack+0xa28/0x1070 Oct 25 04:00:12 wdc05 kernel: [<ffffffff8154fed8>] tcp_rcv_established+0x1c8/0x6c0 Oct 25 04:00:12 wdc05 kernel: [<ffffffff8155a095>] tcp_v4_do_rcv+0x1b5/0x470 Oct 25 04:00:12 wdc05 kernel: [<ffffffff8155c3e6>] tcp_v4_rcv+0x6e6/0x7d0 Oct 25 04:00:12 wdc05 kernel: [<ffffffff81536470>] ? ip_rcv_finish+0x350/0x350 Oct 25 04:00:13 wdc05 kernel: [<ffffffff8152d164>] ? nf_hook_slow+0x84/0x140 Oct 25 04:00:13 wdc05 kernel: [<ffffffff81536470>] ? ip_rcv_finish+0x350/0x350 Oct 25 04:00:13 wdc05 kernel: [<ffffffff81536524>] ip_local_deliver_finish+0xb4/0x1f0 Oct 25 04:00:13 wdc05 kernel: [<ffffffff815367f8>] ip_local_deliver+0x48/0x80 Oct 25 04:00:13 wdc05 kernel: [<ffffffff8153619d>] ip_rcv_finish+0x7d/0x350 Oct 25 04:00:13 wdc05 kernel: [<ffffffff81536a64>] ip_rcv+0x234/0x380 Oct 25 04:00:13 wdc05 kernel: [<ffffffff814fbe86>] __netif_receive_skb_core+0x676/0x870 Oct 25 04:00:13 wdc05 kernel: [<ffffffff8101b4b9>] ? read_tsc+0x9/0x10 Oct 25 04:00:13 wdc05 kernel: [<ffffffff814fc098>] __netif_receive_skb+0x18/0x60 Oct 25 04:00:13 wdc05 kernel: [<ffffffff814fc120>] netif_receive_skb+0x40/0xd0 Oct 25 04:00:14 wdc05 kernel: [<ffffffff814fcc30>] napi_gro_receive+0x80/0xb0 Oct 25 04:00:14 wdc05 kernel: [<ffffffffa013a533>] igb_clean_rx_irq+0x3b3/0x8b0 [igb] Oct 25 04:00:14 wdc05 kernel: [<ffffffff812ed82d>] ? list_del+0xd/0x30 Oct 25 04:00:14 wdc05 kernel: [<ffffffffa013adb7>] igb_poll+0x387/0x770 [igb] Oct 25 04:00:14 wdc05 kernel: [<ffffffff8101b9f9>] ? sched_clock+0x9/0x10 Oct 25 04:00:14 wdc05 kernel: [<ffffffff810abe95>] ? sched_clock_cpu+0xb5/0x100 Oct 25 04:00:14 wdc05 kernel: [<ffffffff810b6a1d>] ? trigger_load_balance+0x5d/0x200 Oct 25 04:00:14 wdc05 kernel: [<ffffffff814fc4e2>] net_rx_action+0x152/0x240 Oct 25 04:00:14 wdc05 kernel: [<ffffffff81077b2f>] __do_softirq+0xef/0x280 Oct 25 04:00:15 wdc05 kernel: [<ffffffff81615b1c>] call_softirq+0x1c/0x30 Oct 25 04:00:15 wdc05 kernel: [<ffffffff81015d95>] do_softirq+0x65/0xa0 Oct 25 04:00:15 wdc05 kernel: [<ffffffff81077ec5>] irq_exit+0x115/0x120 Oct 25 04:00:15 wdc05 kernel: [<ffffffff816166b8>] do_IRQ+0x58/0xf0 Oct 25 04:00:15 wdc05 kernel: [<ffffffff8160b96d>] common_interrupt+0x6d/0x6d Oct 25 04:00:15 wdc05 kernel: <EOI> [<ffffffff814aa002>] ? cpuidle_enter_state+0x52/0xc0 Oct 25 04:00:15 wdc05 kernel: [<ffffffff814a9ff8>] ? cpuidle_enter_state+0x48/0xc0 Oct 25 04:00:15 wdc05 kernel: [<ffffffff814aa135>] cpuidle_idle_call+0xc5/0x200 Oct 25 04:00:15 wdc05 kernel: [<ffffffff8101d14e>] arch_cpu_idle+0xe/0x30 Oct 25 04:00:16 wdc05 kernel: [<ffffffff810c6801>] cpu_startup_entry+0xf1/0x290 Oct 25 04:00:16 wdc05 kernel: [<ffffffff8104228a>] start_secondary+0x1ba/0x230 Oct 25 04:00:16 wdc05 kernel: ---[ end trace 60db6c6329879860 ]--- |
|
Omar (reporter) 2016-01-17 04:46 |
Hi there, I am also getting the same problem with tg3 driver, system crashes and reboots. Is there any way to remedy this problem until a fix is issued? like can we downgrade our kernel to a specific version? Here is the crash dump [76980.302163] Hardware name: Dell Inc. PowerEdge R320/0KM5PX, BIOS 2.4.2 01/29/2015 [76980.302235] task: ffff8817ba57dc00 ti: ffff8817ba5bc000 task.ti: ffff8817ba5bc000 [76980.302295] RIP: 0010:[<ffffffff81176e85>] [<ffffffff81176e85>] put_page+0x5/0x40 [76980.302361] RSP: 0018:ffff8817df043958 EFLAGS: 00010206 [76980.302385] RAX: 0000000000000030 RBX: ffff8814d306b0c0 RCX: 00000000d306b000 [76980.302416] RDX: 0000000000000060 RSI: 0000000000000000 RDI: 0000000100000fd8 [76980.302448] RBP: ffff8817df043978 R08: 0000000000000246 R09: ffff8816830bdd72 [76980.302490] R10: ffffffff81517ca7 R11: ffffea005d884d00 R12: 0000000000000001 [76980.302518] R13: ffff88169bdcbb00 R14: ffff881762135700 R15: ffff8814d306a410 [76980.302562] FS: 0000000000000000(0000) GS:ffff8817df040000(0000) knlGS:0000000000000000 [76980.302610] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [76980.302636] CR2: 0000000100000fd8 CR3: 000000000194a000 CR4: 00000000001407e0 [76980.302665] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [76980.302694] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [76980.302722] Stack: [76980.302731] ffffffff8151a148 ffff88169bdcbb00 ffff88169bdcbb00 0000000000000000 [76980.302766] ffff8817df043990 ffffffff8151a1f4 0000000000003800 ffff8817df0439b0 [76980.302826] ffffffff8151a43c 0000000000003800 ffff88169bdcbb00 ffff8817df043a30 [76980.302894] Call Trace: [76980.302915] <IRQ> [76980.302934] [76980.302963] [<ffffffff8151a148>] ? skb_release_data+0x88/0x110 [76980.302986] [<ffffffff8151a1f4>] skb_release_all+0x24/0x30 [76980.303012] [<ffffffff8151a43c>] consume_skb+0x2c/0x80 [76980.303037] [<ffffffff8156a1c8>] ip_fragment+0x1e8/0x8b0 [76980.303062] [<ffffffff81569290>] ? ip_reply_glue_bits+0x50/0x50 [76980.303091] [<ffffffff8156abac>] ip_finish_output+0x31c/0x7d0 [76980.303120] [<ffffffff8156bd7f>] ip_output+0x6f/0xe0 [76980.303145] [<ffffffff8156a890>] ? ip_fragment+0x8b0/0x8b0 [76980.303171] [<ffffffff815677b6>] ip_forward_finish+0x66/0x80 [76980.303198] [<ffffffff81567b47>] ip_forward+0x377/0x490 [76980.303222] [<ffffffff81567750>] ? ip_frag_mem+0x40/0x40 [76980.304434] [<ffffffff8156587d>] ip_rcv_finish+0x7d/0x350 [76980.305676] [<ffffffff81566216>] ip_rcv+0x2b6/0x410 [76980.306894] [<ffffffff81565800>] ? inet_del_offload+0x40/0x40 [76980.308071] [<ffffffff8152a832>] __netif_receive_skb_core+0x582/0x7d0 [76980.309012] [<ffffffff8101c829>] ? read_tsc+0x9/0x10 [76980.309800] [<ffffffff8152aa98>] __netif_receive_skb+0x18/0x60 [76980.310585] [<ffffffff8152ab20>] netif_receive_skb+0x40/0xc0 [76980.311374] [<ffffffff8152ac3d>] napi_gro_complete+0x7d/0x100 [76980.312142] [<ffffffff8152b293>] dev_gro_receive+0x253/0x350 [76980.312886] [<ffffffff8152b635>] napi_gro_receive+0x25/0xb0 [76980.313613] [<ffffffffa00a868b>] tg3_poll_work+0x95b/0xf80 [tg3] [76980.314499] [<ffffffffa00a8cf1>] tg3_poll_msix+0x41/0x150 [tg3] [76980.315529] [<ffffffff810aa209>] ? hrtimer_get_next_event+0x49/0x70 [76980.316485] [<ffffffff8152af52>] net_rx_action+0x152/0x240 [76980.317369] [<ffffffff81084b0f>] __do_softirq+0xef/0x280 [76980.318116] [<ffffffff816471dc>] call_softirq+0x1c/0x30 [76980.318804] [<ffffffff81016fc5>] do_softirq+0x65/0xa0 [76980.319477] [<ffffffff81084ea5>] irq_exit+0x115/0x120 [76980.320209] [<ffffffff81647d78>] do_IRQ+0x58/0xf0 [76980.320864] [<ffffffff8163d06d>] common_interrupt+0x6d/0x6d [76980.321420] <EOI> [76980.321428] [76980.321975] [<ffffffff8108e6fc>] ? get_next_timer_interrupt+0xec/0x270 [76980.322535] [<ffffffff814d4502>] ? cpuidle_enter_state+0x52/0xc0 [76980.323096] [<ffffffff814d4649>] cpuidle_idle_call+0xd9/0x210 [76980.323650] [<ffffffff8101e4be>] arch_cpu_idle+0xe/0x30 [76980.324203] [<ffffffff810d6305>] cpu_startup_entry+0x245/0x290 [76980.324746] [<ffffffff810475fa>] start_secondary+0x1ba/0x230 [76980.325278] Code: f3 90 49 8b 04 24 a9 00 00 00 01 75 f3 eb 8f 89 c2 e9 6c ff ff ff e8 6b fe ff ff e9 fa fe ff ff 66 0f 1f 44 00 00 0f 1f 44 00 00 <48> f7 07 00 c0 00 00 55 48 89 e5 75 15 f0 ff 4f 1c 0f 94 c0 84 [76980.326473] RIP [<ffffffff81176e85>] put_page+0x5/0x40 [76980.327058] RSP <ffff8817df043958> [76980.327684] CR2: 0000000100000fd8 |
|
vhf (reporter) 2016-10-11 09:00 |
Hi there, Get same crash with the igb driver. After uninstall pptpd from system - problem is gone. [1802043.415428] BUG: unable to handle kernel paging request at 0000000100000500 [1802043.418386] IP: [<ffffffff81176e95>] put_page+0x5/0x40 [1802043.421337] PGD 100f6dd067 PUD 0 [1802043.424275] Oops: 0000 [#1] SMP [1802043.427318] Modules linked in: iptable_filter iptable_nat nf_nat_ipv4 ip_tables ipt_REJECT xt_conntrack ipt_MASQUERADE nf_nat_masquerade_ipv4 nf_conntrack_ipv4 nf_ defrag_ipv4 nf_nat_pptp nf_nat_proto_gre nf_conntrack_pptp nf_conntrack_proto_gre nf_nat nf_conntrack ip_gre ip_tunnel gre arc4 ppp_mppe ppp_async crc_ccitt ppp_generic slhc binfmt_misc kvm_amd kvm crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper cryptd pcspkr sg fam15h_power k10temp amd64_edac_mod eda c_mce_amd edac_core sp5100_tco shpchp i2c_piix4 acpi_cpufreq xfs libcrc32c sr_mod cdrom ata_generic pata_acpi sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sy sfillrect sysimgblt drm_kms_helper ttm igb drm ahci crct10dif_pclmul crct10dif_common ptp crc32c_intel megaraid_sas libahci pata_atiixp pps_core [1802043.447645] serio_raw dca libata i2c_algo_bit i2c_core [last unloaded: ip_tables] [1802043.451220] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 3.10.0-327.el7.x86_64 #1 [1802043.454870] Hardware name: Supermicro H8DG6/H8DGi/H8DG6/H8DGi, BIOS 3.0 09/10/2012 [1802043.458566] task: ffffffff81951440 ti: ffffffff8193c000 task.ti: ffffffff8193c000 [1802043.462296] RIP: 0010:[<ffffffff81176e95>] [<ffffffff81176e95>] put_page+0x5/0x40 [1802043.466078] RSP: 0018:ffff880417c03960 EFLAGS: 00010216 [1802043.469859] RAX: 0000000000000030 RBX: ffff880c7288e2c0 RCX: 000000007288e200 [1802043.473689] RDX: 0000000000000060 RSI: 0000000000000000 RDI: 0000000100000500 [1802043.477645] RBP: ffff880417c03980 R08: 0000000000000246 R09: ffff880468fdcd78 [1802043.481536] R10: ffffffff81517cf7 R11: ffffea0010449280 R12: 0000000000000001 [1802043.485494] R13: ffff88025b7ae700 R14: ffff88041124b000 R15: ffff880c7288ec10 [1802043.489397] FS: 00007fd57bde9840(0000) GS:ffff880417c00000(0000) knlGS:0000000000000000 [1802043.493348] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [1802043.497496] CR2: 0000000100000500 CR3: 000000100f6dc000 CR4: 00000000000407f0 [1802043.501509] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [1802043.505500] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [1802043.509496] Stack: [1802043.513386] ffffffff8151a198 ffff88025b7ae700 ffff88025b7ae700 0000000000000000 [1802043.517599] ffff880417c03998 ffffffff8151a244 0000000000006800 ffff880417c039b8 [1802043.521682] ffffffff8151a48c 0000000000006800 ffff88025b7ae700 ffff880417c03a38 [1802043.525629] Call Trace: [1802043.529546] <IRQ> [1802043.529587] [1802043.533453] [<ffffffff8151a198>] ? skb_release_data+0x88/0x110 [1802043.537400] [<ffffffff8151a244>] skb_release_all+0x24/0x30 [1802043.541076] [<ffffffff8151a48c>] consume_skb+0x2c/0x80 [1802043.544798] [<ffffffff8156a218>] ip_fragment+0x1e8/0x8b0 [1802043.548373] [<ffffffff815692e0>] ? ip_reply_glue_bits+0x50/0x50 [1802043.552089] [<ffffffff8156abfc>] ip_finish_output+0x31c/0x7d0 [1802043.555558] [<ffffffff8156bdcf>] ip_output+0x6f/0xe0 [1802043.558943] [<ffffffff8156a8e0>] ? ip_fragment+0x8b0/0x8b0 [1802043.562272] [<ffffffff81567806>] ip_forward_finish+0x66/0x80 [1802043.565653] [<ffffffff81567b97>] ip_forward+0x377/0x490 [1802043.568843] [<ffffffff815677a0>] ? ip_frag_mem+0x40/0x40 [1802043.572043] [<ffffffff815658cd>] ip_rcv_finish+0x7d/0x350 [1802043.575197] [<ffffffff81566266>] ip_rcv+0x2b6/0x410 [1802043.578304] [<ffffffff81565850>] ? inet_del_offload+0x40/0x40 [1802043.581451] [<ffffffff8152a882>] __netif_receive_skb_core+0x582/0x7d0 [1802043.584632] [<ffffffff8101c829>] ? read_tsc+0x9/0x10 [1802043.587767] [<ffffffff8152aae8>] __netif_receive_skb+0x18/0x60 [1802043.590951] [<ffffffff8152ab70>] netif_receive_skb+0x40/0xc0 [1802043.594037] [<ffffffff8152ac8d>] napi_gro_complete+0x7d/0x100 [1802043.597120] [<ffffffff8152b2e3>] dev_gro_receive+0x253/0x350 [1802043.600133] [<ffffffff8152b685>] napi_gro_receive+0x25/0xb0 [1802043.603193] [<ffffffffa00dbc84>] igb_clean_rx_irq+0x384/0x7a0 [igb] [1802043.606164] [<ffffffffa00dc41f>] igb_poll+0x37f/0x770 [igb] [1802043.608934] [<ffffffff810b5d85>] ? check_preempt_curr+0x75/0xa0 [1802043.611605] [<ffffffff8152afa2>] net_rx_action+0x152/0x240 [1802043.614193] [<ffffffff81084b0f>] __do_softirq+0xef/0x280 [1802043.616691] [<ffffffff8164721c>] call_softirq+0x1c/0x30 [1802043.619094] [<ffffffff81016fc5>] do_softirq+0x65/0xa0 [1802043.621444] [<ffffffff81084ea5>] irq_exit+0x115/0x120 [1802043.623740] [<ffffffff81647db8>] do_IRQ+0x58/0xf0 [1802043.625993] [<ffffffff8163d0ed>] common_interrupt+0x6d/0x6d [1802043.628217] <EOI> [1802043.628253] [1802043.630423] [<ffffffff814d454f>] ? cpuidle_enter_state+0x4f/0xc0 [1802043.632648] [<ffffffff814d4548>] ? cpuidle_enter_state+0x48/0xc0 [1802043.635009] [<ffffffff814d4699>] cpuidle_idle_call+0xd9/0x210 [1802043.637268] [<ffffffff8101e4be>] arch_cpu_idle+0xe/0x30 [1802043.639438] [<ffffffff810d6305>] cpu_startup_entry+0x245/0x290 [1802043.641684] [<ffffffff81624e07>] rest_init+0x77/0x80 [1802043.643796] [<ffffffff81a8d057>] start_kernel+0x429/0x44a [1802043.645866] [<ffffffff81a8ca37>] ? repair_env_string+0x5c/0x5c [1802043.648042] [<ffffffff81a8c120>] ? early_idt_handlers+0x120/0x120 [1802043.650177] [<ffffffff81a8c5ee>] x86_64_start_reservations+0x2a/0x2c [1802043.652384] [<ffffffff81a8c742>] x86_64_start_kernel+0x152/0x175 [1802043.654548] Code: f3 90 49 8b 04 24 a9 00 00 00 01 75 f3 eb 8f 89 c2 e9 6c ff ff ff e8 6b fe ff ff e9 fa fe ff ff 66 0f 1f 44 00 00 66 66 66 66 90 <48> f7 07 00 c0 00 00 55 48 89 e5 75 15 f0 ff 4f 1c 0f 94 c0 84 [1802043.659099] RIP [<ffffffff81176e95>] put_page+0x5/0x40 [1802043.661259] RSP <ffff880417c03960> [1802043.663415] CR2: 0000000100000500 |
Issue History |
|||
| Date Modified | Username | Field | Change |
|---|---|---|---|
| 2015-01-26 13:38 | DaltonJ | New Issue | |
| 2015-04-24 15:42 | DaltonJ | Note Added: 0022907 | |
| 2015-04-24 15:42 | DaltonJ | Note Added: 0022908 | |
| 2015-04-25 07:32 | wolfy | Note Added: 0022914 | |
| 2015-04-25 07:34 | DaltonJ | Note Added: 0022915 | |
| 2015-04-25 07:36 | wolfy | Note Added: 0022916 | |
| 2015-04-25 07:36 | DaltonJ | Note Added: 0022917 | |
| 2015-04-25 07:39 | wolfy | Category | tcp_wrappers => kernel |
| 2015-10-26 12:17 | dreger | Note Added: 0024695 | |
| 2016-01-17 04:46 | Omar | Note Added: 0025383 | |
| 2016-10-11 09:00 | vhf | Note Added: 0027691 | |


