From: kernel test robot <oliver.sang@intel.com>
To: Yunsheng Lin <linyunsheng@huawei.com>
Cc: <oe-lkp@lists.linux.dev>, <lkp@intel.com>, <oliver.sang@intel.com>
Subject: [linyunsheng:page_frag_cache_prototype_v3] [mm] 9db64d371d: WARNING:at_net/core/skbuff.c:#skb_splice_from_iter
Date: Thu, 25 Apr 2024 16:33:18 +0800 [thread overview]
Message-ID: <202404251628.454c0dc9-oliver.sang@intel.com> (raw)
Hello,
kernel test robot noticed "WARNING:at_net/core/skbuff.c:#skb_splice_from_iter" on:
commit: 9db64d371dbb2fa5a4897fb905dd3a25d31f15f9 ("mm: page_frag: reuse existing space for pagecnt_bias and pfmemalloc")
https://github.com/gestionlin/linux.git page_frag_cache_prototype_v3
in testcase: filebench
version: filebench-x86_64-22620e6-1_20240224
with following parameters:
disk: 1HDD
fs: xfs
fs2: nfsv4
test: singlestreamreaddirect.f
cpufreq_governor: performance
compiler: gcc-13
test machine: 128 threads 2 sockets Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz (Ice Lake) with 128G memory
(please refer to attached dmesg/kmsg for entire log/backtrace)
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202404251628.454c0dc9-oliver.sang@intel.com
[ 91.421456][ T2564] ------------[ cut here ]------------
[ 91.427045][ T2564] WARNING: CPU: 103 PID: 2564 at net/core/skbuff.c:7104 skb_splice_from_iter (net/core/skbuff.c:7104 (discriminator 1))
[ 91.436926][ T2564] Modules linked in: kmem rpcsec_gss_krb5 nfsv4 dns_resolver nfsd auth_rpcgss xfs device_dax nd_pmem nd_btt dax_pmem btrfs blake2b_generic xor raid6_pq libcrc32c sd_mod t10_pi crc64_rocksoft_generic crc64_rocksoft crc64 sg intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel sha512_ssse3 rapl ahci ast libahci ipmi_ssif intel_cstate drm_shmem_helper acpi_ipmi mei_me ioatdma i2c_i801 intel_uncore ipmi_si dax_hmem libata drm_kms_helper mei intel_pch_thermal dca i2c_smbus wmi ipmi_devintf nfit ipmi_msghandler libnvdimm acpi_pad acpi_power_meter joydev binfmt_misc drm fuse loop dm_mod ip_tables
[ 91.500845][ T2564] CPU: 103 PID: 2564 Comm: nfsd Tainted: G S 6.9.0-rc4-00224-g9db64d371dbb #1
[ 91.511138][ T2564] Hardware name: Intel Corporation M50CYP2SB1U/M50CYP2SB1U, BIOS SE5C620.86B.01.01.0003.2104260124 04/26/2021
[ 91.523110][ T2564] RIP: 0010:skb_splice_from_iter (net/core/skbuff.c:7104 (discriminator 1))
[ 91.529165][ T2564] Code: 8b 75 00 48 29 c1 4c 39 f9 49 8b 56 08 49 0f 47 cf 49 89 cc f6 c2 01 0f 85 98 01 00 00 66 90 4c 89 f2 48 8b 12 80 e6 08 74 79 <0f> 0b 49 c7 c7 fb ff ff ff 48 8b 44 24 10 41 01 45 70 41 01 45 74
All code
========
0: 8b 75 00 mov 0x0(%rbp),%esi
3: 48 29 c1 sub %rax,%rcx
6: 4c 39 f9 cmp %r15,%rcx
9: 49 8b 56 08 mov 0x8(%r14),%rdx
d: 49 0f 47 cf cmova %r15,%rcx
11: 49 89 cc mov %rcx,%r12
14: f6 c2 01 test $0x1,%dl
17: 0f 85 98 01 00 00 jne 0x1b5
1d: 66 90 xchg %ax,%ax
1f: 4c 89 f2 mov %r14,%rdx
22: 48 8b 12 mov (%rdx),%rdx
25: 80 e6 08 and $0x8,%dh
28: 74 79 je 0xa3
2a:* 0f 0b ud2 <-- trapping instruction
2c: 49 c7 c7 fb ff ff ff mov $0xfffffffffffffffb,%r15
33: 48 8b 44 24 10 mov 0x10(%rsp),%rax
38: 41 01 45 70 add %eax,0x70(%r13)
3c: 41 01 45 74 add %eax,0x74(%r13)
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: 49 c7 c7 fb ff ff ff mov $0xfffffffffffffffb,%r15
9: 48 8b 44 24 10 mov 0x10(%rsp),%rax
e: 41 01 45 70 add %eax,0x70(%r13)
12: 41 01 45 74 add %eax,0x74(%r13)
[ 91.549120][ T2564] RSP: 0018:ffa000000b7a7b58 EFLAGS: 00010202
[ 91.555257][ T2564] RAX: 0000000000000000 RBX: ff11002071e63e00 RCX: 0000000000000004
[ 91.563312][ T2564] RDX: 0057ffffe0000840 RSI: 0000000000000001 RDI: ff1100012aba5c48
[ 91.571362][ T2564] RBP: ffa000000b7a7b88 R08: ff1100012aba6c68 R09: 0000000000000078
[ 91.579413][ T2564] R10: 0000000000000000 R11: ffd4000008c2ffc0 R12: 0000000000000004
[ 91.587464][ T2564] R13: ff11002071e63e00 R14: ffd4000081c66a00 R15: 0000000000000004
[ 91.595515][ T2564] FS: 0000000000000000(0000) GS:ff110020017c0000(0000) knlGS:0000000000000000
[ 91.604526][ T2564] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 91.611186][ T2564] CR2: 00007ffff7b453d8 CR3: 000000207de1c001 CR4: 0000000000771ef0
[ 91.619236][ T2564] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 91.627288][ T2564] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 91.635338][ T2564] PKRU: 55555554
[ 91.638966][ T2564] Call Trace:
[ 91.642329][ T2564] <TASK>
[ 91.645343][ T2564] ? __warn (kernel/panic.c:694)
[ 91.649493][ T2564] ? skb_splice_from_iter (net/core/skbuff.c:7104 (discriminator 1))
[ 91.654935][ T2564] ? report_bug (lib/bug.c:180 lib/bug.c:219)
[ 91.659508][ T2564] ? handle_bug (arch/x86/kernel/traps.c:239)
[ 91.663910][ T2564] ? exc_invalid_op (arch/x86/kernel/traps.c:260 (discriminator 1))
[ 91.668665][ T2564] ? asm_exc_invalid_op (arch/x86/include/asm/idtentry.h:621)
[ 91.673780][ T2564] ? skb_splice_from_iter (net/core/skbuff.c:7104 (discriminator 1))
[ 91.679223][ T2564] ? skb_splice_from_iter (net/core/skbuff.c:7092)
[ 91.684577][ T2564] tcp_sendmsg_locked (net/ipv4/tcp.c:1251 (discriminator 1))
[ 91.689684][ T2564] ? nfsd_splice_read (fs/nfsd/vfs.c:1065) nfsd
[ 91.695345][ T2564] ? __cond_resched (kernel/sched/core.c:8608)
[ 91.700080][ T2564] tcp_sendmsg (net/ipv4/tcp.c:1343)
[ 91.704374][ T2564] sock_sendmsg (net/socket.c:733 (discriminator 1) net/socket.c:745 (discriminator 1) net/socket.c:768 (discriminator 1))
[ 91.708846][ T2564] svc_tcp_sendmsg (net/sunrpc/svcsock.c:1238)
[ 91.713671][ T2564] svc_tcp_sendto (net/sunrpc/svcsock.c:1270)
[ 91.718305][ T2564] svc_send (net/sunrpc/svc_xprt.c:921)
[ 91.722331][ T2564] svc_handle_xprt (net/sunrpc/svc_xprt.c:815)
[ 91.727142][ T2564] svc_recv (include/linux/sunrpc/bc_xprt.h:40 net/sunrpc/svc_xprt.c:876)
[ 91.731345][ T2564] ? __pfx_nfsd (fs/nfsd/nfssvc.c:912) nfsd
[ 91.736358][ T2564] nfsd (fs/nfsd/nfssvc.c:941) nfsd
[ 91.740682][ T2564] kthread (kernel/kthread.c:388)
[ 91.744617][ T2564] ? __pfx_kthread (kernel/kthread.c:341)
[ 91.749243][ T2564] ret_from_fork (arch/x86/kernel/process.c:147)
[ 91.753704][ T2564] ? __pfx_kthread (kernel/kthread.c:341)
[ 91.758324][ T2564] ret_from_fork_asm (arch/x86/entry/entry_64.S:257)
[ 91.763112][ T2564] </TASK>
[ 91.766156][ T2564] ---[ end trace 0000000000000000 ]---
[ 91.771669][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[ 94.152299][ T2559] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[ 96.522359][ T2559] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[ 97.757006][ T1468] Events enabled
[ 97.757010][ T1468]
[ 97.781707][ C0] perf: interrupt took too long (2585 > 2500), lowering kernel.perf_event_max_sample_rate to 77000
[ 97.814727][ C0] perf: interrupt took too long (3346 > 3231), lowering kernel.perf_event_max_sample_rate to 59000
[ 97.871706][ C0] perf: interrupt took too long (4206 > 4182), lowering kernel.perf_event_max_sample_rate to 47000
[ 97.948725][ C0] perf: interrupt took too long (5292 > 5257), lowering kernel.perf_event_max_sample_rate to 37000
[ 98.034694][ C0] perf: interrupt took too long (6671 > 6615), lowering kernel.perf_event_max_sample_rate to 29000
[ 98.125726][ C0] perf: interrupt took too long (8412 > 8338), lowering kernel.perf_event_max_sample_rate to 23000
[ 98.392731][ C0] perf: interrupt took too long (10525 > 10515), lowering kernel.perf_event_max_sample_rate to 19000
[ 99.042094][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[ 100.765375][ T1468] [ perf record: Woken up 59 times to write data ]
[ 100.765378][ T1468]
[ 100.908563][ T1468] Warning:
[ 100.908567][ T1468]
[ 100.915314][ T1468] Processed 192218 events and lost 2 chunks!
[ 100.915316][ T1468]
[ 100.923772][ T1468]
[ 100.923773][ T1468]
[ 100.928828][ T1468] Check IO/CPU overload!
[ 100.928829][ T1468]
[ 100.935484][ T1468]
[ 100.935485][ T1468]
[ 100.941436][ T1468] [ perf record: Captured and wrote 29.751 MB /tmp/lkp/perf_c2c.data (73428 samples) ]
[ 100.941437][ T1468]
[ 101.606690][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[ 103.999638][ T2559] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[ 106.397587][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[ 108.730699][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[ 111.147193][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[ 111.578643][ T2564] stack segment: 0000 [#1] SMP NOPTI
[ 111.584040][ T2564] CPU: 38 PID: 2564 Comm: nfsd Tainted: G S W 6.9.0-rc4-00224-g9db64d371dbb #1
[ 111.594238][ T2564] Hardware name: Intel Corporation M50CYP2SB1U/M50CYP2SB1U, BIOS SE5C620.86B.01.01.0003.2104260124 04/26/2021
[ 111.606075][ T2564] RIP: 0010:find_openstateowner_str_locked (fs/nfsd/nfs4state.c:551) nfsd
[ 111.613452][ T2564] Code: cc cc cc cc 0f 1f 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 0f 1f 44 00 00 41 54 55 89 fd 53 48 c1 e5 04 48 03 6a 28 <48> 8b 5d 00 48 39 dd 74 27 4c 8d a6 b0 00 00 00 80 bb 70 01 00 00
All code
========
0: cc int3
1: cc int3
2: cc int3
3: cc int3
4: 0f 1f 00 nopl (%rax)
7: 90 nop
8: 90 nop
9: 90 nop
a: 90 nop
b: 90 nop
c: 90 nop
d: 90 nop
e: 90 nop
f: 90 nop
10: 90 nop
11: 90 nop
12: 90 nop
13: 90 nop
14: 90 nop
15: 90 nop
16: 90 nop
17: 0f 1f 44 00 00 nopl 0x0(%rax,%rax,1)
1c: 41 54 push %r12
1e: 55 push %rbp
1f: 89 fd mov %edi,%ebp
21: 53 push %rbx
22: 48 c1 e5 04 shl $0x4,%rbp
26: 48 03 6a 28 add 0x28(%rdx),%rbp
2a:* 48 8b 5d 00 mov 0x0(%rbp),%rbx <-- trapping instruction
2e: 48 39 dd cmp %rbx,%rbp
31: 74 27 je 0x5a
33: 4c 8d a6 b0 00 00 00 lea 0xb0(%rsi),%r12
3a: 80 .byte 0x80
3b: bb 70 01 00 00 mov $0x170,%ebx
Code starting with the faulting instruction
===========================================
0: 48 8b 5d 00 mov 0x0(%rbp),%rbx
4: 48 39 dd cmp %rbx,%rbp
7: 74 27 je 0x30
9: 4c 8d a6 b0 00 00 00 lea 0xb0(%rsi),%r12
10: 80 .byte 0x80
11: bb 70 01 00 00 mov $0x170,%ebx
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240425/202404251628.454c0dc9-oliver.sang@intel.com
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
reply other threads:[~2024-04-25 8:33 UTC|newest]
Thread overview: [no followups] expand[flat|nested] mbox.gz Atom feed
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=202404251628.454c0dc9-oliver.sang@intel.com \
--to=oliver.sang@intel.com \
--cc=linyunsheng@huawei.com \
--cc=lkp@intel.com \
--cc=oe-lkp@lists.linux.dev \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).