oe-lkp.lists.linux.dev archive mirror
 help / color / mirror / Atom feed
From: kernel test robot <oliver.sang@intel.com>
To: Yunsheng Lin <linyunsheng@huawei.com>
Cc: <oe-lkp@lists.linux.dev>, <lkp@intel.com>, <oliver.sang@intel.com>
Subject: [linyunsheng:page_frag_cache_prototype_v3] [mm] 9db64d371d: WARNING:at_net/core/skbuff.c:#skb_splice_from_iter
Date: Thu, 25 Apr 2024 16:33:18 +0800	[thread overview]
Message-ID: <202404251628.454c0dc9-oliver.sang@intel.com> (raw)



Hello,

kernel test robot noticed "WARNING:at_net/core/skbuff.c:#skb_splice_from_iter" on:

commit: 9db64d371dbb2fa5a4897fb905dd3a25d31f15f9 ("mm: page_frag: reuse existing space for pagecnt_bias and pfmemalloc")
https://github.com/gestionlin/linux.git page_frag_cache_prototype_v3

in testcase: filebench
version: filebench-x86_64-22620e6-1_20240224
with following parameters:

	disk: 1HDD
	fs: xfs
	fs2: nfsv4
	test: singlestreamreaddirect.f
	cpufreq_governor: performance



compiler: gcc-13
test machine: 128 threads 2 sockets Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz (Ice Lake) with 128G memory

(please refer to attached dmesg/kmsg for entire log/backtrace)



If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202404251628.454c0dc9-oliver.sang@intel.com


[   91.421456][ T2564] ------------[ cut here ]------------
[ 91.427045][ T2564] WARNING: CPU: 103 PID: 2564 at net/core/skbuff.c:7104 skb_splice_from_iter (net/core/skbuff.c:7104 (discriminator 1)) 
[   91.436926][ T2564] Modules linked in: kmem rpcsec_gss_krb5 nfsv4 dns_resolver nfsd auth_rpcgss xfs device_dax nd_pmem nd_btt dax_pmem btrfs blake2b_generic xor raid6_pq libcrc32c sd_mod t10_pi crc64_rocksoft_generic crc64_rocksoft crc64 sg intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel sha512_ssse3 rapl ahci ast libahci ipmi_ssif intel_cstate drm_shmem_helper acpi_ipmi mei_me ioatdma i2c_i801 intel_uncore ipmi_si dax_hmem libata drm_kms_helper mei intel_pch_thermal dca i2c_smbus wmi ipmi_devintf nfit ipmi_msghandler libnvdimm acpi_pad acpi_power_meter joydev binfmt_misc drm fuse loop dm_mod ip_tables
[   91.500845][ T2564] CPU: 103 PID: 2564 Comm: nfsd Tainted: G S                 6.9.0-rc4-00224-g9db64d371dbb #1
[   91.511138][ T2564] Hardware name: Intel Corporation M50CYP2SB1U/M50CYP2SB1U, BIOS SE5C620.86B.01.01.0003.2104260124 04/26/2021
[ 91.523110][ T2564] RIP: 0010:skb_splice_from_iter (net/core/skbuff.c:7104 (discriminator 1)) 
[ 91.529165][ T2564] Code: 8b 75 00 48 29 c1 4c 39 f9 49 8b 56 08 49 0f 47 cf 49 89 cc f6 c2 01 0f 85 98 01 00 00 66 90 4c 89 f2 48 8b 12 80 e6 08 74 79 <0f> 0b 49 c7 c7 fb ff ff ff 48 8b 44 24 10 41 01 45 70 41 01 45 74
All code
========
   0:	8b 75 00             	mov    0x0(%rbp),%esi
   3:	48 29 c1             	sub    %rax,%rcx
   6:	4c 39 f9             	cmp    %r15,%rcx
   9:	49 8b 56 08          	mov    0x8(%r14),%rdx
   d:	49 0f 47 cf          	cmova  %r15,%rcx
  11:	49 89 cc             	mov    %rcx,%r12
  14:	f6 c2 01             	test   $0x1,%dl
  17:	0f 85 98 01 00 00    	jne    0x1b5
  1d:	66 90                	xchg   %ax,%ax
  1f:	4c 89 f2             	mov    %r14,%rdx
  22:	48 8b 12             	mov    (%rdx),%rdx
  25:	80 e6 08             	and    $0x8,%dh
  28:	74 79                	je     0xa3
  2a:*	0f 0b                	ud2    		<-- trapping instruction
  2c:	49 c7 c7 fb ff ff ff 	mov    $0xfffffffffffffffb,%r15
  33:	48 8b 44 24 10       	mov    0x10(%rsp),%rax
  38:	41 01 45 70          	add    %eax,0x70(%r13)
  3c:	41 01 45 74          	add    %eax,0x74(%r13)

Code starting with the faulting instruction
===========================================
   0:	0f 0b                	ud2    
   2:	49 c7 c7 fb ff ff ff 	mov    $0xfffffffffffffffb,%r15
   9:	48 8b 44 24 10       	mov    0x10(%rsp),%rax
   e:	41 01 45 70          	add    %eax,0x70(%r13)
  12:	41 01 45 74          	add    %eax,0x74(%r13)
[   91.549120][ T2564] RSP: 0018:ffa000000b7a7b58 EFLAGS: 00010202
[   91.555257][ T2564] RAX: 0000000000000000 RBX: ff11002071e63e00 RCX: 0000000000000004
[   91.563312][ T2564] RDX: 0057ffffe0000840 RSI: 0000000000000001 RDI: ff1100012aba5c48
[   91.571362][ T2564] RBP: ffa000000b7a7b88 R08: ff1100012aba6c68 R09: 0000000000000078
[   91.579413][ T2564] R10: 0000000000000000 R11: ffd4000008c2ffc0 R12: 0000000000000004
[   91.587464][ T2564] R13: ff11002071e63e00 R14: ffd4000081c66a00 R15: 0000000000000004
[   91.595515][ T2564] FS:  0000000000000000(0000) GS:ff110020017c0000(0000) knlGS:0000000000000000
[   91.604526][ T2564] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   91.611186][ T2564] CR2: 00007ffff7b453d8 CR3: 000000207de1c001 CR4: 0000000000771ef0
[   91.619236][ T2564] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   91.627288][ T2564] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   91.635338][ T2564] PKRU: 55555554
[   91.638966][ T2564] Call Trace:
[   91.642329][ T2564]  <TASK>
[ 91.645343][ T2564] ? __warn (kernel/panic.c:694) 
[ 91.649493][ T2564] ? skb_splice_from_iter (net/core/skbuff.c:7104 (discriminator 1)) 
[ 91.654935][ T2564] ? report_bug (lib/bug.c:180 lib/bug.c:219) 
[ 91.659508][ T2564] ? handle_bug (arch/x86/kernel/traps.c:239) 
[ 91.663910][ T2564] ? exc_invalid_op (arch/x86/kernel/traps.c:260 (discriminator 1)) 
[ 91.668665][ T2564] ? asm_exc_invalid_op (arch/x86/include/asm/idtentry.h:621) 
[ 91.673780][ T2564] ? skb_splice_from_iter (net/core/skbuff.c:7104 (discriminator 1)) 
[ 91.679223][ T2564] ? skb_splice_from_iter (net/core/skbuff.c:7092) 
[ 91.684577][ T2564] tcp_sendmsg_locked (net/ipv4/tcp.c:1251 (discriminator 1)) 
[ 91.689684][ T2564] ? nfsd_splice_read (fs/nfsd/vfs.c:1065) nfsd
[ 91.695345][ T2564] ? __cond_resched (kernel/sched/core.c:8608) 
[ 91.700080][ T2564] tcp_sendmsg (net/ipv4/tcp.c:1343) 
[ 91.704374][ T2564] sock_sendmsg (net/socket.c:733 (discriminator 1) net/socket.c:745 (discriminator 1) net/socket.c:768 (discriminator 1)) 
[ 91.708846][ T2564] svc_tcp_sendmsg (net/sunrpc/svcsock.c:1238) 
[ 91.713671][ T2564] svc_tcp_sendto (net/sunrpc/svcsock.c:1270) 
[ 91.718305][ T2564] svc_send (net/sunrpc/svc_xprt.c:921) 
[ 91.722331][ T2564] svc_handle_xprt (net/sunrpc/svc_xprt.c:815) 
[ 91.727142][ T2564] svc_recv (include/linux/sunrpc/bc_xprt.h:40 net/sunrpc/svc_xprt.c:876) 
[ 91.731345][ T2564] ? __pfx_nfsd (fs/nfsd/nfssvc.c:912) nfsd
[ 91.736358][ T2564] nfsd (fs/nfsd/nfssvc.c:941) nfsd
[ 91.740682][ T2564] kthread (kernel/kthread.c:388) 
[ 91.744617][ T2564] ? __pfx_kthread (kernel/kthread.c:341) 
[ 91.749243][ T2564] ret_from_fork (arch/x86/kernel/process.c:147) 
[ 91.753704][ T2564] ? __pfx_kthread (kernel/kthread.c:341) 
[ 91.758324][ T2564] ret_from_fork_asm (arch/x86/entry/entry_64.S:257) 
[   91.763112][ T2564]  </TASK>
[   91.766156][ T2564] ---[ end trace 0000000000000000 ]---
[   91.771669][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[   94.152299][ T2559] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[   96.522359][ T2559] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[   97.757006][ T1468] Events enabled
[   97.757010][ T1468]
[   97.781707][    C0] perf: interrupt took too long (2585 > 2500), lowering kernel.perf_event_max_sample_rate to 77000
[   97.814727][    C0] perf: interrupt took too long (3346 > 3231), lowering kernel.perf_event_max_sample_rate to 59000
[   97.871706][    C0] perf: interrupt took too long (4206 > 4182), lowering kernel.perf_event_max_sample_rate to 47000
[   97.948725][    C0] perf: interrupt took too long (5292 > 5257), lowering kernel.perf_event_max_sample_rate to 37000
[   98.034694][    C0] perf: interrupt took too long (6671 > 6615), lowering kernel.perf_event_max_sample_rate to 29000
[   98.125726][    C0] perf: interrupt took too long (8412 > 8338), lowering kernel.perf_event_max_sample_rate to 23000
[   98.392731][    C0] perf: interrupt took too long (10525 > 10515), lowering kernel.perf_event_max_sample_rate to 19000
[   99.042094][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[  100.765375][ T1468] [ perf record: Woken up 59 times to write data ]
[  100.765378][ T1468]
[  100.908563][ T1468] Warning:
[  100.908567][ T1468]
[  100.915314][ T1468] Processed 192218 events and lost 2 chunks!
[  100.915316][ T1468]
[  100.923772][ T1468]
[  100.923773][ T1468]
[  100.928828][ T1468] Check IO/CPU overload!
[  100.928829][ T1468]
[  100.935484][ T1468]
[  100.935485][ T1468]
[  100.941436][ T1468] [ perf record: Captured and wrote 29.751 MB /tmp/lkp/perf_c2c.data (73428 samples) ]
[  100.941437][ T1468]
[  101.606690][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[  103.999638][ T2559] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[  106.397587][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[  108.730699][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[  111.147193][ T2564] rpc-srv/tcp: nfsd: got error -5 when sending 1048696 bytes - shutting down socket
[  111.578643][ T2564] stack segment: 0000 [#1] SMP NOPTI
[  111.584040][ T2564] CPU: 38 PID: 2564 Comm: nfsd Tainted: G S      W          6.9.0-rc4-00224-g9db64d371dbb #1
[  111.594238][ T2564] Hardware name: Intel Corporation M50CYP2SB1U/M50CYP2SB1U, BIOS SE5C620.86B.01.01.0003.2104260124 04/26/2021
[ 111.606075][ T2564] RIP: 0010:find_openstateowner_str_locked (fs/nfsd/nfs4state.c:551) nfsd
[ 111.613452][ T2564] Code: cc cc cc cc 0f 1f 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 0f 1f 44 00 00 41 54 55 89 fd 53 48 c1 e5 04 48 03 6a 28 <48> 8b 5d 00 48 39 dd 74 27 4c 8d a6 b0 00 00 00 80 bb 70 01 00 00
All code
========
   0:	cc                   	int3   
   1:	cc                   	int3   
   2:	cc                   	int3   
   3:	cc                   	int3   
   4:	0f 1f 00             	nopl   (%rax)
   7:	90                   	nop
   8:	90                   	nop
   9:	90                   	nop
   a:	90                   	nop
   b:	90                   	nop
   c:	90                   	nop
   d:	90                   	nop
   e:	90                   	nop
   f:	90                   	nop
  10:	90                   	nop
  11:	90                   	nop
  12:	90                   	nop
  13:	90                   	nop
  14:	90                   	nop
  15:	90                   	nop
  16:	90                   	nop
  17:	0f 1f 44 00 00       	nopl   0x0(%rax,%rax,1)
  1c:	41 54                	push   %r12
  1e:	55                   	push   %rbp
  1f:	89 fd                	mov    %edi,%ebp
  21:	53                   	push   %rbx
  22:	48 c1 e5 04          	shl    $0x4,%rbp
  26:	48 03 6a 28          	add    0x28(%rdx),%rbp
  2a:*	48 8b 5d 00          	mov    0x0(%rbp),%rbx		<-- trapping instruction
  2e:	48 39 dd             	cmp    %rbx,%rbp
  31:	74 27                	je     0x5a
  33:	4c 8d a6 b0 00 00 00 	lea    0xb0(%rsi),%r12
  3a:	80                   	.byte 0x80
  3b:	bb 70 01 00 00       	mov    $0x170,%ebx

Code starting with the faulting instruction
===========================================
   0:	48 8b 5d 00          	mov    0x0(%rbp),%rbx
   4:	48 39 dd             	cmp    %rbx,%rbp
   7:	74 27                	je     0x30
   9:	4c 8d a6 b0 00 00 00 	lea    0xb0(%rsi),%r12
  10:	80                   	.byte 0x80
  11:	bb 70 01 00 00       	mov    $0x170,%ebx


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240425/202404251628.454c0dc9-oliver.sang@intel.com



-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


                 reply	other threads:[~2024-04-25  8:33 UTC|newest]

Thread overview: [no followups] expand[flat|nested]  mbox.gz  Atom feed

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=202404251628.454c0dc9-oliver.sang@intel.com \
    --to=oliver.sang@intel.com \
    --cc=linyunsheng@huawei.com \
    --cc=lkp@intel.com \
    --cc=oe-lkp@lists.linux.dev \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).