commits

When an rport event (RPORT_EV_READY) is updated without work being queued,
avoid taking an additional reference.

This issue was leading to memory leak. Trace from KMEMLEAK tool:

unreferenced object 0xffff8888259e8780 (size 512):
comm "kworker/2:1", jiffies 4433237386 (age 113021.971s)
hex dump (first 32 bytes):
58 0a ec cf 83 88 ff ff 00 00 00 00 00 00 00 00
01 00 00 00 08 00 00 00 13 7d f0 1e 0e 00 00 10
backtrace:
[<000000006b25760f>] fc_rport_recv_req+0x3c6/0x18f0 [libfc]
[<00000000f208d994>] fc_lport_recv_els_req+0x120/0x8a0 [libfc]
[<00000000a9c437b8>] fc_lport_recv+0xb9/0x130 [libfc]
[<00000000a9c437b8>] fc_lport_recv+0xb9/0x130 [libfc]
[<00000000ad5be37b>] qedf_ll2_process_skb+0x73d/0xad0 [qedf]
[<00000000e0eb6893>] process_one_work+0x382/0x6c0
[<000000002dfd9e21>] worker_thread+0x57/0x5c0
[<00000000b648204f>] kthread+0x1a0/0x1c0
[<0000000072f5ab20>] ret_from_fork+0x35/0x40
[<000000001d5c05d8>] 0xffffffffffffffff

Below is the log sequence which leads to memory leak. Here we get the
RPORT_EV_READY and RPORT_EV_STOP back to back, which lead to overwrite the
event RPORT_EV_READY by event RPORT_EV_STOP. Because of this, kref_count
gets incremented by 1.

kernel: host0: rport fffce5: Received PLOGI request
kernel: host0: rport fffce5: Received PLOGI in INIT state
kernel: host0: rport fffce5: Port is Ready
kernel: host0: rport fffce5: Received PRLI request while in state Ready
kernel: host0: rport fffce5: PRLI rspp type 8 active 1 passive 0
kernel: host0: rport fffce5: Received LOGO request while in state Ready
kernel: host0: rport fffce5: Delete port
kernel: host0: rport fffce5: Received PLOGI request
kernel: host0: rport fffce5: Received PLOGI in state Delete - send busy
kernel: host0: rport fffce5: work event 3
kernel: host0: rport fffce5: lld callback ev 3
kernel: host0: rport fffce5: work delete

Link: https://lore.kernel.org/r/20200626094959.32151-1-jhasan@marvell.com
Reviewed-by: Girish Basrur <gbasrur@marvell.com>
Reviewed-by: Saurav Kashyap <skashyap@marvell.com>
Reviewed-by: Shyam Sundar <ssundar@marvell.com>
Signed-off-by: Javed Hasan <jhasan@marvell.com>
Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>

5y ago

Masahiro Yamada

b816b3db

kbuild: fix CONFIG_CC_CAN_LINK(_STATIC) for cross-compilation with Clang

5y ago

Linus Torvalds

0a319ef7

Merge tag 'x86-fpu-2020-06-01' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

5y ago

Steve Wahl

33649bf4

x86/apic/uv: Remove code for unused distributed GRU mode

5y ago

Dmitry Safonov

833e55bb

x86/vdso/vdso2c: Convert iterators to unsigned

5y ago

Thomas Gleixner

8ae0ae67

rcu: Provide rcu_irq_exit_preempt()

5y ago

Linus Torvalds

743f0573

Merge tag 'pm-5.7-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm

5y ago

Tang Bin

b52649ae

iommu/qcom: Fix local_base status check

5y ago

Arnd Bergmann

9c6c723f

btrfs: fix gcc-4.8 build warning for struct initializer

5y ago

Jiaxun Yang

a23df9a4

irqchip/loongson-pci-msi: Fix a typo in Kconfig

5y ago

Andy Lutomirski

cced0b24

selftests/x86: Consolidate and fix get/set_eflags() helpers

5y ago

Linus Torvalds

77834854

Merge branch 'i2c/for-current' of git://git.kernel.org/pub/scm/linux/kernel/git/wsa/linux

5y ago

Jens Axboe

b7db41c9

io_uring: fix regression with always ignoring signals in io_cqring_wait()

5y ago

Jens Axboe

b3c58fcd

Merge branch 'nvme-5.8' of git://git.infradead.org/nvme into block-5.8

5y ago

Javed Hasan

71f2bf85

scsi: libfc: Handling of extra kref

5y ago

Mauro Carvalho Chehab

8f8499a9

kconfig: qconf: parse newer types at debug info

5y ago

Linus Torvalds

eff5ddad

Merge tag 'x86-cpu-2020-06-01' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

5y ago

Yu-cheng Yu

55e00fb6

x86/fpu/xstate: Restore supervisor states for signal return

5y ago

Christoph Hellwig

2981cf83

x86/platform/uv: Remove the unused _uv_cpu_blade_processor_id() macro

5y ago

Dmitry Safonov

089ef557

x86/vdso/vdso2c: Correct error messages on file open

5y ago

Paul E. McKenney

9ea366f6

rcu: Make RCU IRQ enter/exit functions rely on in_nmi()

5y ago

Linus Torvalds

f66ed1eb

Merge tag 'iomap-5.7-fixes-1' of git://git.kernel.org/pub/scm/fs/xfs/xfs-linux

5y ago

Rafael J. Wysocki

a5383996

Merge branches 'pm-cpufreq' and 'pm-sleep'

5y ago

Greg Kroah-Hartman

ae74c19f

iommu: Properly export iommu_group_get_for_dev()

5y ago

Qu Wenruo

fcc99734

btrfs: transaction: Avoid deadlock due to bad initialization timing of fs_info::journal_info

[BUG]
One run of btrfs/063 triggered the following lockdep warning:
============================================
WARNING: possible recursive locking detected
5.6.0-rc7-custom+ #48 Not tainted
--------------------------------------------
kworker/u24:0/7 is trying to acquire lock:
ffff88817d3a46e0 (sb_internal#2){.+.+}, at: start_transaction+0x66c/0x890 [btrfs]

but task is already holding lock:
ffff88817d3a46e0 (sb_internal#2){.+.+}, at: start_transaction+0x66c/0x890 [btrfs]

other info that might help us debug this:
Possible unsafe locking scenario:

CPU0
----
lock(sb_internal#2);
lock(sb_internal#2);

*** DEADLOCK ***

May be due to missing lock nesting notation

4 locks held by kworker/u24:0/7:
#0: ffff88817b495948 ((wq_completion)btrfs-endio-write){+.+.}, at: process_one_work+0x557/0xb80
#1: ffff888189ea7db8 ((work_completion)(&work->normal_work)){+.+.}, at: process_one_work+0x557/0xb80
#2: ffff88817d3a46e0 (sb_internal#2){.+.+}, at: start_transaction+0x66c/0x890 [btrfs]
#3: ffff888174ca4da8 (&fs_info->reloc_mutex){+.+.}, at: btrfs_record_root_in_trans+0x83/0xd0 [btrfs]

stack backtrace:
CPU: 0 PID: 7 Comm: kworker/u24:0 Not tainted 5.6.0-rc7-custom+ #48
Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 0.0.0 02/06/2015
Workqueue: btrfs-endio-write btrfs_work_helper [btrfs]
Call Trace:
dump_stack+0xc2/0x11a
__lock_acquire.cold+0xce/0x214
lock_acquire+0xe6/0x210
__sb_start_write+0x14e/0x290
start_transaction+0x66c/0x890 [btrfs]
btrfs_join_transaction+0x1d/0x20 [btrfs]
find_free_extent+0x1504/0x1a50 [btrfs]
btrfs_reserve_extent+0xd5/0x1f0 [btrfs]
btrfs_alloc_tree_block+0x1ac/0x570 [btrfs]
btrfs_copy_root+0x213/0x580 [btrfs]
create_reloc_root+0x3bd/0x470 [btrfs]
btrfs_init_reloc_root+0x2d2/0x310 [btrfs]
record_root_in_trans+0x191/0x1d0 [btrfs]
btrfs_record_root_in_trans+0x90/0xd0 [btrfs]
start_transaction+0x16e/0x890 [btrfs]
btrfs_join_transaction+0x1d/0x20 [btrfs]
btrfs_finish_ordered_io+0x55d/0xcd0 [btrfs]
finish_ordered_fn+0x15/0x20 [btrfs]
btrfs_work_helper+0x116/0x9a0 [btrfs]
process_one_work+0x632/0xb80
worker_thread+0x80/0x690
kthread+0x1a3/0x1f0
ret_from_fork+0x27/0x50

It's pretty hard to reproduce, only one hit so far.

[CAUSE]
This is because we're calling btrfs_join_transaction() without re-using
the current running one:

btrfs_finish_ordered_io()
|- btrfs_join_transaction() <<< Call #1
|- btrfs_record_root_in_trans()
|- btrfs_reserve_extent()
|- btrfs_join_transaction() <<< Call #2

Normally such btrfs_join_transaction() call should re-use the existing
one, without trying to re-start a transaction.

But the problem is, in btrfs_join_transaction() call #1, we call
btrfs_record_root_in_trans() before initializing current::journal_info.

And in btrfs_join_transaction() call #2, we're relying on
current::journal_info to avoid such deadlock.

[FIX]
Call btrfs_record_root_in_trans() after we have initialized
current::journal_info.

CC: stable@vger.kernel.org # 4.4+
Signed-off-by: Qu Wenruo <wqu@suse.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

5y ago

Linus Torvalds

b3a9e3b9

Linux 5.8-rc1 v5.8-rc1

5y ago

Andy Lutomirski

a61fa279

selftests/x86/syscall_nt: Clear weird flags after each test

5y ago

Linus Torvalds

45a5ac7a

Merge tag 'mips_fixes_5.8_1' of git://git.kernel.org/pub/scm/linux/kernel/git/mips/linux

5y ago

Linux 5.8-rc4 v5.8-rc4

dcb7fd82

Linus Torvalds

x86/ldt: use "pr_info_once()" instead of open-coding it badly

bb5a93aa

Linus Torvalds

Merge tag 'x86-urgent-2020-07-05' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

72674d48

Linus Torvalds

Merge tag 'irq-urgent-2020-07-05' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

f23dbe18

Linus Torvalds

x86/ldt: Disable 16-bit segments on Xen PV

cc801833

Andy Lutomirski

Merge tag 'core-urgent-2020-07-05' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

5465a324

Linus Torvalds

Merge tag 'irqchip-fixes-5.8-1' of git://git.kernel.org/pub/scm/linux/kernel/git/maz/arm-platforms into irq/urgent

98817a84

Thomas Gleixner

x86/entry/32: Fix #MC and #DB wiring on x86_32

13cbc0cd

Andy Lutomirski

Merge tag 'kbuild-fixes-v5.8-2' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild

4bc92736

Linus Torvalds

Merge branch 'urgent-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/paulmck/linux-rcu into core/urgent

5fdeefa0

Ingo Molnar

Linux 5.7-rc4 v5.7-rc4

0e698dfa

Linus Torvalds

irqchip/gic: Atomically update affinity

005c34ae

Marc Zyngier

x86/entry/xen: Route #DB correctly on Xen PV

f41f0824

Andy Lutomirski

Merge tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi

19a61a75

Linus Torvalds

.gitignore: Do not track `defconfig` from `make savedefconfig`

ba77dca5

Paul Menzel

Merge tag 'x86-vdso-2020-06-01' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

4e909124

Linus Torvalds

rcuperf: Fix printk format warning

b3e2d209

Kefeng Wang

Merge tag 'for-5.7-rc3-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux

262f7a6b

Linus Torvalds

irqchip/riscv-intc: Fix a typo in a pr_warn()

559fe74b

Palmer Dabbelt

x86/entry, selftests: Further improve user entry sanity checks

3c73b81a

Andy Lutomirski

Merge tag 'block-5.8-2020-07-05' of git://git.kernel.dk/linux-block

29206c63

Linus Torvalds

scsi: mptfusion: Don't use GFP_ATOMIC for larger DMA allocations

311950f8

Christoph Hellwig

kbuild: make Clang build userprogs for target architecture

7f58b487

Masahiro Yamada

Merge tag 'x86-platform-2020-06-01' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

88bc1de1

Linus Torvalds

x86/vdso/Makefile: Add vobjs32

cd2f45b7

Dmitry Safonov

rcu: Provide __rcu_is_watching()

b1fcf9b8

Thomas Gleixner

Merge tag 'iommu-fixes-v5.7-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/joro/iommu

ea915933

Linus Torvalds

MAINTAINERS: btrfs: fix git repo URL

eb91db63

Eric Biggers

irqchip/gic-v4.1: Use readx_poll_timeout_atomic() to fix sleep in atomic

31dbb6b1

Zenghui Yu

x86/entry/compat: Clear RAX high bits on Xen PV SYSENTER

db5b2c5a

Andy Lutomirski

Merge tag 'io_uring-5.8-2020-07-05' of git://git.kernel.dk/linux-block

9fbe565c

Linus Torvalds

block: make function __bio_integrity_free() static

3197d48a

Wei Yongjun

scsi: libfc: Skip additional kref updating work event

823a6540

Javed Hasan

kbuild: fix CONFIG_CC_CAN_LINK(_STATIC) for cross-compilation with Clang

b816b3db

Masahiro Yamada

Merge tag 'x86-fpu-2020-06-01' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

0a319ef7

Linus Torvalds

x86/apic/uv: Remove code for unused distributed GRU mode

33649bf4

Steve Wahl

x86/vdso/vdso2c: Convert iterators to unsigned

833e55bb

Dmitry Safonov

rcu: Provide rcu_irq_exit_preempt()

8ae0ae67

Thomas Gleixner

Merge tag 'pm-5.7-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm

743f0573

Linus Torvalds

iommu/qcom: Fix local_base status check

b52649ae

Tang Bin

btrfs: fix gcc-4.8 build warning for struct initializer

9c6c723f

Arnd Bergmann

irqchip/loongson-pci-msi: Fix a typo in Kconfig

a23df9a4

Jiaxun Yang

selftests/x86: Consolidate and fix get/set_eflags() helpers

cced0b24

Andy Lutomirski

Merge branch 'i2c/for-current' of git://git.kernel.org/pub/scm/linux/kernel/git/wsa/linux

77834854

Linus Torvalds

io_uring: fix regression with always ignoring signals in io_cqring_wait()

b7db41c9

Jens Axboe

Merge branch 'nvme-5.8' of git://git.infradead.org/nvme into block-5.8

b3c58fcd

Jens Axboe

scsi: libfc: Handling of extra kref

Handling of extra kref which is done by lookup table in case rdata is
already present in list.

This issue was leading to memory leak. Trace from KMEMLEAK tool:

unreferenced object 0xffff8888259e8780 (size 512):
comm "kworker/2:1", pid 182614, jiffies 4433237386 (age 113021.971s)
hex dump (first 32 bytes):
58 0a ec cf 83 88 ff ff 00 00 00 00 00 00 00 00
01 00 00 00 08 00 00 00 13 7d f0 1e 0e 00 00 10
backtrace:
[<000000006b25760f>] fc_rport_recv_req+0x3c6/0x18f0 [libfc]
[<00000000f208d994>] fc_lport_recv_els_req+0x120/0x8a0 [libfc]
[<00000000a9c437b8>] fc_lport_recv+0xb9/0x130 [libfc]
[<00000000ad5be37b>] qedf_ll2_process_skb+0x73d/0xad0 [qedf]
[<00000000e0eb6893>] process_one_work+0x382/0x6c0
[<000000002dfd9e21>] worker_thread+0x57/0x5c0
[<00000000b648204f>] kthread+0x1a0/0x1c0
[<0000000072f5ab20>] ret_from_fork+0x35/0x40
[<000000001d5c05d8>] 0xffffffffffffffff

Below is the log sequence which leads to memory leak. Here we get the
nested "Received PLOGI request" for same port and this request leads to
call the fc_rport_create() twice for the same rport.

kernel: host1: rport fffce5: Received PLOGI request
kernel: host1: rport fffce5: Received PLOGI in INIT state
kernel: host1: rport fffce5: Port is Ready
kernel: host1: rport fffce5: Received PRLI request while in state Ready
kernel: host1: rport fffce5: PRLI rspp type 8 active 1 passive 0
kernel: host1: rport fffce5: Received LOGO request while in state Ready
kernel: host1: rport fffce5: Delete port
kernel: host1: rport fffce5: Received PLOGI request
kernel: host1: rport fffce5: Received PLOGI in state Delete - send busy

Link: https://lore.kernel.org/r/20200622101212.3922-2-jhasan@marvell.com
Reviewed-by: Girish Basrur <gbasrur@marvell.com>
Reviewed-by: Saurav Kashyap <skashyap@marvell.com>
Reviewed-by: Shyam Sundar <ssundar@marvell.com>
Signed-off-by: Javed Hasan <jhasan@marvell.com>
Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>