commits

Pull compute express link (cxl) fixes from Dan Williams:
"Several fixes for driver startup regressions that landed during the
merge window as well as some older bugs.

The regressions were due to a lack of testing with what the CXL
specification calls Restricted CXL Host (RCH) topologies compared to
the testing with Virtual Host (VH) CXL topologies. A VH topology is
typical PCIe while RCH topologies map CXL endpoints as Root Complex
Integrated endpoints. The impact is some driver crashes on startup.

This merge window also added compatibility for range registers (the
mechanism that CXL 1.1 defined for mapping memory) to treat them like
HDM decoders (the mechanism that CXL 2.0 defined for mapping
Host-managed Device Memory). That work collided with the new region
enumeration code that was tested with CXL 2.0 setups, and fails with
crashes at startup.

Lastly, the DOE (Data Object Exchange) implementation for retrieving
an ACPI-like data table from CXL devices is being reworked for v6.4.
Several fixes fell out of that work that are suitable for v6.3.

All of this has been in linux-next for a while, and all reported
issues [1] have been addressed.

Summary:

- Fix several issues with region enumeration in RCH topologies that
can trigger crashes on driver startup or shutdown.

- Fix CXL DVSEC range register compatibility versus region
enumeration that leads to startup crashes

- Fix CDAT endiannes handling

- Fix multiple buffer handling boundary conditions

- Fix Data Object Exchange (DOE) workqueue usage vs
CONFIG_DEBUG_OBJECTS warn splats"

Link: http://lore.kernel.org/r/20230405075704.33de8121@canb.auug.org.au [1]

* tag 'cxl-fixes-6.3-rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/cxl/cxl:
cxl/hdm: Extend DVSEC range register emulation for region enumeration
cxl/hdm: Limit emulation to the number of range registers
cxl/region: Move coherence tracking into cxl_region_attach()
cxl/region: Fix region setup/teardown for RCDs
cxl/port: Fix find_cxl_root() for RCDs and simplify it
cxl/hdm: Skip emulation when driver manages mem_enable
cxl/hdm: Fix double allocation of @cxlhdm
PCI/DOE: Fix memory leak with CONFIG_DEBUG_OBJECTS=y
PCI/DOE: Silence WARN splat with CONFIG_DEBUG_OBJECTS=y
cxl/pci: Handle excessive CDAT length
cxl/pci: Handle truncated CDAT entries
cxl/pci: Handle truncated CDAT header
cxl/pci: Fix CDAT retrieval on big endian

2y ago

Tony Luck

81515ecf

x86/cpu: Add model number for Intel Arrow Lake processor

2y ago

Peter Zijlstra

b1680989

perf: Optimize perf_pmu_migrate_context()

2y ago

Linus Torvalds

cdc9718d

Merge tag '6.3-rc5-smb3-cifs-client-fixes' of git://git.samba.org/sfrench/cifs-2.6

2y ago

Dan Williams

ca712e47

Merge branch 'for-6.3/cxl-doe-fixes' into for-6.3/cxl

2y ago

Eric DeVolder

fed8d877

x86/acpi/boot: Correct acpi_is_processor_usable() check

2y ago

Linus Torvalds

197b6b60

Linux 6.3-rc4 v6.3-rc4

2y ago

Linus Torvalds

68047c48

Merge tag 'char-misc-6.3-rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/char-misc

2y ago

Dan Carpenter

4f5d5b33

cifs: double lock in cifs_reconnect_tcon()

2y ago

Dan Williams

24b18197

cxl/hdm: Extend DVSEC range register emulation for region enumeration

2y ago

Lukas Wunner

abf04be0

PCI/DOE: Fix memory leak with CONFIG_DEBUG_OBJECTS=y

2y ago

Mario Limonciello

a74fabfb

x86/ACPI/boot: Use FADT version to check support for online capable

2y ago

Linus Torvalds

0ec57cfa

Merge tag 'usb-6.3-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb

2y ago

Linus Torvalds

aa46fe36

Merge tag 'tty-6.3-rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/tty

2y ago

Greg Kroah-Hartman

4bffd2c7

Merge tag 'iio-fixes-for-6.3a' of https://git.kernel.org/pub/scm/linux/kernel/git/jic23/iio into char-misc-linus

2y ago

Thiago Rafael Becker

d19342c6

cifs: sanitize paths in cifs_update_super_prepath.

2y ago

Dan Williams

52cc48ad

cxl/hdm: Limit emulation to the number of range registers

2y ago

Lukas Wunner

92dc899c

PCI/DOE: Silence WARN splat with CONFIG_DEBUG_OBJECTS=y

2y ago

Linus Torvalds

18940c88

Merge tag 'sched_urgent_for_v6.3_rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

2y ago

Fabrice Gasnier

50213832

usb: dwc2: fix a race, don't power off/on phy for dual-role mode

When in dual role mode (dr_mode == USB_DR_MODE_OTG), platform probe
successively basically calls:
- dwc2_gadget_init()
- dwc2_hcd_init()
- dwc2_lowlevel_hw_disable() since recent change [1]
- usb_add_gadget_udc()

The PHYs (and so the clocks it may provide) shouldn't be disabled for all
SoCs, in OTG mode, as the HCD part has been initialized.

On STM32 this creates some weird race condition upon boot, when:
- initially attached as a device, to a HOST
- and there is a gadget script invoked to setup the device part.
Below issue becomes systematic, as long as the gadget script isn't
started by userland: the hardware PHYs (and so the clocks provided by the
PHYs) remains disabled.
It ends up in having an endless interrupt storm, before the watchdog
resets the platform.

[ 16.924163] dwc2 49000000.usb-otg: EPs: 9, dedicated fifos, 952 entries in SPRAM
[ 16.962704] dwc2 49000000.usb-otg: DWC OTG Controller
[ 16.966488] dwc2 49000000.usb-otg: new USB bus registered, assigned bus number 2
[ 16.974051] dwc2 49000000.usb-otg: irq 77, io mem 0x49000000
[ 17.032170] hub 2-0:1.0: USB hub found
[ 17.042299] hub 2-0:1.0: 1 port detected
[ 17.175408] dwc2 49000000.usb-otg: Mode Mismatch Interrupt: currently in Host mode
[ 17.181741] dwc2 49000000.usb-otg: Mode Mismatch Interrupt: currently in Host mode
[ 17.189303] dwc2 49000000.usb-otg: Mode Mismatch Interrupt: currently in Host mode
...

The host part is also not functional, until the gadget part is configured.

The HW may only be disabled for peripheral mode (original init), e.g.
dr_mode == USB_DR_MODE_PERIPHERAL, until the gadget driver initializes.

But when in USB_DR_MODE_OTG, the HW should remain enabled, as the HCD part
is able to run, while the gadget part isn't necessarily configured.

I don't fully get the of purpose the original change, that claims disabling
the hardware is missing. It creates conditions on SOCs using the PHY
initialization to be completely non working in OTG mode. Original
change [1] should be reworked to be platform specific.

[1] https://lore.kernel.org/r/20221206-dwc2-gadget-dual-role-v1-2-36515e1092cd@theobroma-systems.com

Fixes: ade23d7b7ec5 ("usb: dwc2: power on/off phy for peripheral mode in dual-role mode")
Cc: stable <stable@kernel.org>
Signed-off-by: Fabrice Gasnier <fabrice.gasnier@foss.st.com>
Reviewed-by: Quentin Schulz <quentin.schulz@theobroma-systems.com>
Tested-by: Quentin Schulz <quentin.schulz@theobroma-systems.com>
Link: https://lore.kernel.org/r/20230315144433.3095859-1-fabrice.gasnier@foss.st.com
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

2y ago

Linus Torvalds

a211b1c0

Merge tag 'usb-6.3-rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb

2y ago

Biju Das

f92ed0cd

tty: serial: sh-sci: Fix Rx on RZ/G2L SCI

2y ago

Greg Kroah-Hartman

4dd52392

Merge tag 'coresight-fixes-v6.3' of git://git.kernel.org/pub/scm/linux/kernel/git/coresight/linux into char-misc-linus

2y ago

Lars-Peter Clausen

363c7dc7

iio: adc: ti-ads7950: Set `can_sleep` flag for GPIO chip

2y ago

Linus Torvalds

7e364e56

Linux 6.3-rc5 v6.3-rc5

2y ago

Dan Williams

9ff3eec9

cxl/region: Move coherence tracking into cxl_region_attach()

2y ago

Lukas Wunner

4fe2c13d

cxl/pci: Handle excessive CDAT length

2y ago

Linus Torvalds

974fc943

Merge tag 'perf_urgent_for_v6.3_rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

2y ago

Vincent Guittot

a53ce18c

sched/fair: Sanitize vruntime of entity being migrated

2y ago

Fabrice Gasnier

f7473132

usb: dwc2: fix a devres leak in hw_enable upon suspend resume

2y ago

Linus Torvalds

a79d5c76

Merge tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi

2y ago

Pawel Laszczak

1edf4899

usb: cdnsp: Fixes error: uninitialized symbol 'len'

2y ago

Sherry Sun

178e00f3

tty: serial: fsl_lpuart: fix crash in lpuart_uport_is_active

2y ago

Greg Kroah-Hartman

84052541

Merge tag 'counter-fixes-6.3a' of git://git.kernel.org/pub/scm/linux/kernel/git/wbg/counter into char-misc-linus

2y ago

Suzuki K Poulose

735e7b30

coresight: etm4x: Do not access TRCIDR1 for identification

2y ago

Patrik Dahlström

49f76c49

iio: adc: palmas_gpadc: fix NULL dereference on rmmod

2y ago

Linus Torvalds

6ab608fe

Merge tag 'for-6.3-rc4-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux

2y ago

Dan Williams

030f8803

cxl/region: Fix region setup/teardown for RCDs

2y ago

Lukas Wunner

b56faef2

cxl/pci: Handle truncated CDAT entries

2y ago

Linus Torvalds

f6cdaeb0

Merge tag 'core_urgent_for_v6.3_rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

2y ago

Breno Leitao

263f5eca

perf/x86/amd/core: Always clear status for idx

2y ago

Linus Torvalds

e8d018dd

Linux 6.3-rc3 v6.3-rc3

2y ago

Xu Yang

451b15ed

usb: chipidea: core: fix possible concurrent when switch role

2y ago

Linus Torvalds

da0af3c5

Merge tag 'block-6.3-2023-04-06' of git://git.kernel.dk/linux

2y ago

Zhong Jinghua

48b19b79

scsi: iscsi_tcp: Check that sock is valid before iscsi_set_param()

2y ago

Sandeep Dhavale

e07fec47

usb: gadgetfs: Fix ep_read_iter to handle ITER_UBUF

2y ago

Sherry Sun

9425914f

tty: serial: fsl_lpuart: avoid checking for transfer complete when UARTCTRL_SBK is asserted in lpuart32_tx_empty

2y ago

William Breathitt Gray

00f4bc51

counter: 104-quad-8: Fix Synapse action reported for Index signals

2y ago

Steve Clevenger

bf84937e

coresight-etm4: Fix for() loop drvdata->nr_addr_cmp range bug

2y ago

Nuno Sá

7b3825e9

iio: adc: max11410: fix read_poll_timeout() usage

2y ago

Javier Martinez Canillas

f95b8ea7

Revert "venus: firmware: Correct non-pix start and end addresses"

2y ago

Filipe Manana

2280d425

btrfs: ignore fiemap path cache when there are multiple paths for a node

2y ago

Dan Williams

d35b495d

cxl/port: Fix find_cxl_root() for RCDs and simplify it

2y ago

Lukas Wunner

34bafc74

cxl/pci: Handle truncated CDAT header

2y ago

Linus Torvalds

986c6374

Merge tag 'x86_urgent_for_v6.3_rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

2y ago

Frederic Weisbecker

b4165140

entry/rcu: Check TIF_RESCHED _after_ delayed RCU wake-up

2y ago

Linux 6.3-rc6 v6.3-rc6

09a9639e

Linus Torvalds

Merge tag 'perf_urgent_for_v6.3_rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

faf8f418

Linus Torvalds

Merge tag 'x86_urgent_for_v6.3_rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

4ba115e2

Linus Torvalds

perf/core: Fix the same task check in perf_event_set_output

The same task check in perf_event_set_output has some potential issues
for some usages.

For the current perf code, there is a problem if using of
perf_event_open() to have multiple samples getting into the same mmap’d
memory when they are both attached to the same process.
https://lore.kernel.org/all/92645262-D319-4068-9C44-2409EF44888E@gmail.com/
Because the event->ctx is not ready when the perf_event_set_output() is
invoked in the perf_event_open().

Besides the above issue, before the commit bd2756811766 ("perf: Rewrite
core context handling"), perf record can errors out when sampling with
a hardware event and a software event as below.
$ perf record -e cycles,dummy --per-thread ls
failed to mmap with 22 (Invalid argument)
That's because that prior to the commit a hardware event and a software
event are from different task context.

The problem should be a long time issue since commit c3f00c70276d
("perk: Separate find_get_context() from event initialization").

The task struct is stored in the event->hw.target for each per-thread
event. It is a more reliable way to determine whether two events are
attached to the same task.

The event->hw.target was also introduced several years ago by the
commit 50f16a8bf9d7 ("perf: Remove type specific target pointers"). It
can not only be used to fix the issue with the current code, but also
back port to fix the issues with an older kernel.

Note: The event->hw.target was introduced later than commit
c3f00c70276d. The patch may cannot be applied between the commit
c3f00c70276d and commit 50f16a8bf9d7. Anybody that wants to back-port
this at that period may have to find other solutions.

Fixes: c3f00c70276d ("perf: Separate find_get_context() from event initialization")
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Reviewed-by: Zhengjun Xing <zhengjun.xing@linux.intel.com>
Link: https://lkml.kernel.org/r/20230322202449.512091-1-kan.liang@linux.intel.com

24d3ae2f

Kan Liang

Merge tag 'cxl-fixes-6.3-rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/cxl/cxl

c08cfd67

Linus Torvalds

x86/cpu: Add model number for Intel Arrow Lake processor

81515ecf

Tony Luck

perf: Optimize perf_pmu_migrate_context()

b1680989

Peter Zijlstra

Merge tag '6.3-rc5-smb3-cifs-client-fixes' of git://git.samba.org/sfrench/cifs-2.6

cdc9718d

Linus Torvalds

Merge branch 'for-6.3/cxl-doe-fixes' into for-6.3/cxl

ca712e47

Dan Williams

x86/acpi/boot: Correct acpi_is_processor_usable() check

fed8d877

Eric DeVolder

Linux 6.3-rc4 v6.3-rc4

197b6b60

Linus Torvalds

Merge tag 'char-misc-6.3-rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/char-misc

68047c48

Linus Torvalds

cifs: double lock in cifs_reconnect_tcon()

4f5d5b33

Dan Carpenter

cxl/hdm: Extend DVSEC range register emulation for region enumeration

24b18197

Dan Williams

PCI/DOE: Fix memory leak with CONFIG_DEBUG_OBJECTS=y

abf04be0

Lukas Wunner

x86/ACPI/boot: Use FADT version to check support for online capable

a74fabfb

Mario Limonciello

Merge tag 'usb-6.3-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb

0ec57cfa

Linus Torvalds

Merge tag 'tty-6.3-rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/tty

aa46fe36

Linus Torvalds

Merge tag 'iio-fixes-for-6.3a' of https://git.kernel.org/pub/scm/linux/kernel/git/jic23/iio into char-misc-linus

4bffd2c7

Greg Kroah-Hartman

cifs: sanitize paths in cifs_update_super_prepath.

d19342c6

Thiago Rafael Becker

cxl/hdm: Limit emulation to the number of range registers

52cc48ad

Dan Williams

PCI/DOE: Silence WARN splat with CONFIG_DEBUG_OBJECTS=y

Gregory Price reports a WARN splat with CONFIG_DEBUG_OBJECTS=y upon CXL
probing because pci_doe_submit_task() invokes INIT_WORK() instead of
INIT_WORK_ONSTACK() for a work_struct that was allocated on the stack.

All callers of pci_doe_submit_task() allocate the work_struct on the
stack, so replace INIT_WORK() with INIT_WORK_ONSTACK() as a backportable
short-term fix.

The long-term fix implemented by a subsequent commit is to move to a
synchronous API which allocates the work_struct internally in the DOE
library.

Stacktrace for posterity:

WARNING: CPU: 0 PID: 23 at lib/debugobjects.c:545 __debug_object_init.cold+0x18/0x183
CPU: 0 PID: 23 Comm: kworker/u2:1 Not tainted 6.1.0-0.rc1.20221019gitaae703b02f92.17.fc38.x86_64 #1
Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS rel-1.16.0-0-gd239552ce722-prebuilt.qemu.org 04/01/2014
Call Trace:
pci_doe_submit_task+0x5d/0xd0
pci_doe_discovery+0xb4/0x100
pcim_doe_create_mb+0x219/0x290
cxl_pci_probe+0x192/0x430
local_pci_probe+0x41/0x80
pci_device_probe+0xb3/0x220
really_probe+0xde/0x380
__driver_probe_device+0x78/0x170
driver_probe_device+0x1f/0x90
__driver_attach_async_helper+0x5c/0xe0
async_run_entry_fn+0x30/0x130
process_one_work+0x294/0x5b0

Fixes: 9d24322e887b ("PCI/DOE: Add DOE mailbox support functions")
Link: https://lore.kernel.org/linux-cxl/Y1bOniJliOFszvIK@memverge.com/
Reported-by: Gregory Price <gregory.price@memverge.com>
Tested-by: Ira Weiny <ira.weiny@intel.com>
Tested-by: Gregory Price <gregory.price@memverge.com>
Signed-off-by: Lukas Wunner <lukas@wunner.de>
Reviewed-by: Ira Weiny <ira.weiny@intel.com>
Reviewed-by: Dan Williams <dan.j.williams@intel.com>
Reviewed-by: Gregory Price <gregory.price@memverge.com>
Cc: stable@vger.kernel.org # v6.0+
Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
Acked-by: Bjorn Helgaas <bhelgaas@google.com>
Link: https://lore.kernel.org/r/67a9117f463ecdb38a2dbca6a20391ce2f1e7a06.1678543498.git.lukas@wunner.de
Signed-off-by: Dan Williams <dan.j.williams@intel.com>