commits

Kyle has reported occasional crashes when booting a kernel in 5-level
paging mode with KASLR enabled:

WARNING: CPU: 0 PID: 0 at arch/x86/mm/init_64.c:87 phys_p4d_init+0x1d4/0x1ea
RIP: 0010:phys_p4d_init+0x1d4/0x1ea
Call Trace:
__kernel_physical_mapping_init+0x10a/0x35c
kernel_physical_mapping_init+0xe/0x10
init_memory_mapping+0x1aa/0x3b0
init_range_memory_mapping+0xc8/0x116
init_mem_mapping+0x225/0x2eb
setup_arch+0x6ff/0xcf5
start_kernel+0x64/0x53b
? copy_bootdata+0x1f/0xce
x86_64_start_reservations+0x24/0x26
x86_64_start_kernel+0x8a/0x8d
secondary_startup_64+0xb6/0xc0

which causes later:

BUG: unable to handle page fault for address: ff484d019580eff8
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
BAD
Oops: 0000 [#1] SMP NOPTI
RIP: 0010:fill_pud+0x13/0x130
Call Trace:
set_pte_vaddr_p4d+0x2e/0x50
set_pte_vaddr+0x6f/0xb0
__native_set_fixmap+0x28/0x40
native_set_fixmap+0x39/0x70
register_lapic_address+0x49/0xb6
early_acpi_boot_init+0xa5/0xde
setup_arch+0x944/0xcf5
start_kernel+0x64/0x53b

Kyle bisected the issue to commit b569c1843498 ("x86/mm/KASLR: Reduce
randomization granularity for 5-level paging to 1GB")

Before this commit PAGE_OFFSET was always aligned to P4D_SIZE when booting
5-level paging mode. But now only PUD_SIZE alignment is guaranteed.

In the case I was able to reproduce the following vaddr/paddr values were
observed in phys_p4d_init():

Iteration vaddr paddr
1 0xff4228027fe00000 0x033fe00000
2 0xff42287f40000000 0x8000000000

'vaddr' in both cases belongs to the same p4d entry.

But due to the original assumption that PAGE_OFFSET is aligned to P4D_SIZE
this overlap cannot be handled correctly. The code assumes strictly aligned
entries and unconditionally increments the index into the P4D table, which
creates false duplicate entries. Once the index reaches the end, the last
entry in the page table is missing.

Aside of that the 'paddr >= paddr_end' condition can evaluate wrong which
causes an P4D entry to be cleared incorrectly.

Change the loop in phys_p4d_init() to walk purely based on virtual
addresses like __kernel_physical_mapping_init() does. This makes it work
correctly with unaligned virtual addresses.

Fixes: b569c1843498 ("x86/mm/KASLR: Reduce randomization granularity for 5-level paging to 1GB")
Reported-by: Kyle Pelton <kyle.d.pelton@intel.com>
Signed-off-by: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Tested-by: Kyle Pelton <kyle.d.pelton@intel.com>
Acked-by: Baoquan He <bhe@redhat.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Cc: Dave Hansen <dave.hansen@linux.intel.com>
Cc: Andy Lutomirski <luto@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: https://lkml.kernel.org/r/20190624123150.920-1-kirill.shutemov@linux.intel.com

6y ago

Linus Torvalds

b253d5f3

Merge tag 'pci-v5.2-fixes-1' of git://git.kernel.org/pub/scm/linux/kernel/git/helgaas/pci

6y ago

Peter Xu

0aafc8ae

Revert "iommu/vt-d: Fix lock inversion between iommu->lock and device_domain_lock"

This reverts commit 7560cc3ca7d9d11555f80c830544e463fcdb28b8.

With 5.2.0-rc5 I can easily trigger this with lockdep and iommu=pt:

======================================================
WARNING: possible circular locking dependency detected
5.2.0-rc5 #78 Not tainted
------------------------------------------------------
swapper/0/1 is trying to acquire lock:
00000000ea2b3beb (&(&iommu->lock)->rlock){+.+.}, at: domain_context_mapping_one+0xa5/0x4e0
but task is already holding lock:
00000000a681907b (device_domain_lock){....}, at: domain_context_mapping_one+0x8d/0x4e0
which lock already depends on the new lock.
the existing dependency chain (in reverse order) is:
-> #1 (device_domain_lock){....}:
_raw_spin_lock_irqsave+0x3c/0x50
dmar_insert_one_dev_info+0xbb/0x510
domain_add_dev_info+0x50/0x90
dev_prepare_static_identity_mapping+0x30/0x68
intel_iommu_init+0xddd/0x1422
pci_iommu_init+0x16/0x3f
do_one_initcall+0x5d/0x2b4
kernel_init_freeable+0x218/0x2c1
kernel_init+0xa/0x100
ret_from_fork+0x3a/0x50
-> #0 (&(&iommu->lock)->rlock){+.+.}:
lock_acquire+0x9e/0x170
_raw_spin_lock+0x25/0x30
domain_context_mapping_one+0xa5/0x4e0
pci_for_each_dma_alias+0x30/0x140
dmar_insert_one_dev_info+0x3b2/0x510
domain_add_dev_info+0x50/0x90
dev_prepare_static_identity_mapping+0x30/0x68
intel_iommu_init+0xddd/0x1422
pci_iommu_init+0x16/0x3f
do_one_initcall+0x5d/0x2b4
kernel_init_freeable+0x218/0x2c1
kernel_init+0xa/0x100
ret_from_fork+0x3a/0x50

other info that might help us debug this:
Possible unsafe locking scenario:
CPU0 CPU1
---- ----
lock(device_domain_lock);
lock(&(&iommu->lock)->rlock);
lock(device_domain_lock);
lock(&(&iommu->lock)->rlock);

*** DEADLOCK ***
2 locks held by swapper/0/1:
#0: 00000000033eb13d (dmar_global_lock){++++}, at: intel_iommu_init+0x1e0/0x1422
#1: 00000000a681907b (device_domain_lock){....}, at: domain_context_mapping_one+0x8d/0x4e0

stack backtrace:
CPU: 2 PID: 1 Comm: swapper/0 Not tainted 5.2.0-rc5 #78
Hardware name: LENOVO 20KGS35G01/20KGS35G01, BIOS N23ET50W (1.25 ) 06/25/2018
Call Trace:
dump_stack+0x85/0xc0
print_circular_bug.cold.57+0x15c/0x195
__lock_acquire+0x152a/0x1710
lock_acquire+0x9e/0x170
? domain_context_mapping_one+0xa5/0x4e0
_raw_spin_lock+0x25/0x30
? domain_context_mapping_one+0xa5/0x4e0
domain_context_mapping_one+0xa5/0x4e0
? domain_context_mapping_one+0x4e0/0x4e0
pci_for_each_dma_alias+0x30/0x140
dmar_insert_one_dev_info+0x3b2/0x510
domain_add_dev_info+0x50/0x90
dev_prepare_static_identity_mapping+0x30/0x68
intel_iommu_init+0xddd/0x1422
? printk+0x58/0x6f
? lockdep_hardirqs_on+0xf0/0x180
? do_early_param+0x8e/0x8e
? e820__memblock_setup+0x63/0x63
pci_iommu_init+0x16/0x3f
do_one_initcall+0x5d/0x2b4
? do_early_param+0x8e/0x8e
? rcu_read_lock_sched_held+0x55/0x60
? do_early_param+0x8e/0x8e
kernel_init_freeable+0x218/0x2c1
? rest_init+0x230/0x230
kernel_init+0xa/0x100
ret_from_fork+0x3a/0x50

domain_context_mapping_one() is taking device_domain_lock first then
iommu lock, while dmar_insert_one_dev_info() is doing the reverse.

That should be introduced by commit:

7560cc3ca7d9 ("iommu/vt-d: Fix lock inversion between iommu->lock and
device_domain_lock", 2019-05-27)

So far I still cannot figure out how the previous deadlock was
triggered (I cannot find iommu lock taken before calling of
iommu_flush_dev_iotlb()), however I'm pretty sure that that change
should be incomplete at least because it does not fix all the places
so we're still taking the locks in different orders, while reverting
that commit is very clean to me so far that we should always take
device_domain_lock first then the iommu lock.

We can continue to try to find the real culprit mentioned in
7560cc3ca7d9, but for now I think we should revert it to fix current
breakage.

CC: Joerg Roedel <joro@8bytes.org>
CC: Lu Baolu <baolu.lu@linux.intel.com>
CC: dave.jiang@intel.com
Signed-off-by: Peter Xu <peterx@redhat.com>
Tested-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Joerg Roedel <jroedel@suse.de>

6y ago

Christophe Leroy

82f6e266

powerpc/32: fix build failure on book3e with KVM

6y ago

Linus Torvalds

01305db8

Merge tag 'xarray-5.2-rc6' of git://git.infradead.org/users/willy/linux-dax

6y ago

Rafael J. Wysocki

471a739a

PCI: PM: Avoid skipping bus-level PM on platforms without ACPI

6y ago

Rob Bradford

88447c5b

efi: Allow the number of EFI configuration tables entries to be zero

6y ago

Tian Baofeng

975a6166

efibc: Replace variable set function in notifier call

6y ago

Nicholas Piggin

471ba0e6

irq_work: Do not raise an IPI when queueing work on the local CPU

6y ago

Julien Grall

16e32c3c

iommu/dma-iommu: Remove iommu_dma_map_msi_msg()

6y ago

Paul Burton

6d4d367d

irqchip/mips-gic: Use the correct local interrupt map registers

6y ago

Kan Liang

90d42491

perf/x86/regs: Check reserved bits

6y ago

Kirill A. Shutemov

c1887159

x86/boot/64: Add missing fixup_pointer() for next_early_pgt access

6y ago

Linus Torvalds

f4102766

Merge tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi

6y ago

Logan Gunthorpe

6dbbd053

PCI/P2PDMA: Ignore root complex whitelist when an IOMMU is present

6y ago

Linus Torvalds

9e0babf2

Linux 5.2-rc5 v5.2-rc5

6y ago

Christophe Leroy

e8732ffa

powerpc/booke: fix fast syscall entry on SMP

6y ago

Linus Torvalds

0839c537

Merge branch 'akpm' (patches from Andrew)

6y ago

Matthew Wilcox

12fd2aee

XArray tests: Add check_insert

6y ago

Gen Zhang

4e78921b

efi/x86/Add missing error handling to old_memmap 1:1 mapping code

6y ago

Qian Cai

919aef44

x86/efi: fix a -Wtype-limits compilation warning

6y ago

Gustavo A. R. Silva

2d65c42b

genirq/devres: Use struct_size() in devm_kzalloc()

6y ago

Julien Grall

73103975

irqchip/gic-v3-mbi: Don't map the MSI page in mbi_compose_m{b, s}i_msg()

6y ago

Peter Ujfalusi

eb737b8f

irqchip/ti-sci-inta: Fix kernel crash if irq_create_fwspec_mapping fail

6y ago

Kan Liang

e321d02d

perf/x86: Disable extended registers for non-supported PMUs

The perf fuzzer caused Skylake machine to crash:

[ 9680.085831] Call Trace:
[ 9680.088301] <IRQ>
[ 9680.090363] perf_output_sample_regs+0x43/0xa0
[ 9680.094928] perf_output_sample+0x3aa/0x7a0
[ 9680.099181] perf_event_output_forward+0x53/0x80
[ 9680.103917] __perf_event_overflow+0x52/0xf0
[ 9680.108266] ? perf_trace_run_bpf_submit+0xc0/0xc0
[ 9680.113108] perf_swevent_hrtimer+0xe2/0x150
[ 9680.117475] ? check_preempt_wakeup+0x181/0x230
[ 9680.122091] ? check_preempt_curr+0x62/0x90
[ 9680.126361] ? ttwu_do_wakeup+0x19/0x140
[ 9680.130355] ? try_to_wake_up+0x54/0x460
[ 9680.134366] ? reweight_entity+0x15b/0x1a0
[ 9680.138559] ? __queue_work+0x103/0x3f0
[ 9680.142472] ? update_dl_rq_load_avg+0x1cd/0x270
[ 9680.147194] ? timerqueue_del+0x1e/0x40
[ 9680.151092] ? __remove_hrtimer+0x35/0x70
[ 9680.155191] __hrtimer_run_queues+0x100/0x280
[ 9680.159658] hrtimer_interrupt+0x100/0x220
[ 9680.163835] smp_apic_timer_interrupt+0x6a/0x140
[ 9680.168555] apic_timer_interrupt+0xf/0x20
[ 9680.172756] </IRQ>

The XMM registers can only be collected by PEBS hardware events on the
platforms with PEBS baseline support, e.g. Icelake, not software/probe
events.

Add capabilities flag PERF_PMU_CAP_EXTENDED_REGS to indicate the PMU
which support extended registers. For X86, the extended registers are
XMM registers.

Add has_extended_regs() to check if extended registers are applied.

The generic code define the mask of extended registers as 0 if arch
headers haven't overridden it.

Originally-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Reported-by: Vince Weaver <vincent.weaver@maine.edu>
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Stephane Eranian <eranian@google.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Fixes: 878068ea270e ("perf/x86: Support outputting XMM registers")
Link: https://lkml.kernel.org/r/1559081314-9714-1-git-send-email-kan.liang@linux.intel.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>

6y ago

Kirill A. Shutemov

81c7ed29

x86/boot/64: Fix crash if kernel image crosses page table boundary

6y ago

Linus Torvalds

a8282bf0

Merge tag 'powerpc-5.2-5' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux

6y ago

Arun Easi

5589b08e

scsi: qla2xxx: Fix hardlockup in abort command during driver remove

6y ago

Linus Torvalds

a188339c

Linux 5.2-rc1 v5.2-rc1

6y ago

Linus Torvalds

963172d9

Merge branch 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

Pull x86 fixes from Thomas Gleixner:
"The accumulated fixes from this and last week:

- Fix vmalloc TLB flush and map range calculations which lead to
stale TLBs, spurious faults and other hard to diagnose issues.

- Use fault_in_pages_writable() for prefaulting the user stack in the
FPU code as it's less fragile than the current solution

- Use the PF_KTHREAD flag when checking for a kernel thread instead
of current->mm as the latter can give the wrong answer due to
use_mm()

- Compute the vmemmap size correctly for KASLR and 5-Level paging.
Otherwise this can end up with a way too small vmemmap area.

- Make KASAN and 5-level paging work again by making sure that all
invalid bits are masked out when computing the P4D offset. This
worked before but got broken recently when the LDT remap area was
moved.

- Prevent a NULL pointer dereference in the resource control code
which can be triggered with certain mount options when the
requested resource is not available.

- Enforce ordering of microcode loading vs. perf initialization on
secondary CPUs. Otherwise perf tries to access a non-existing MSR
as the boot CPU marked it as available.

- Don't stop the resource control group walk early otherwise the
control bitmaps are not updated correctly and become inconsistent.

- Unbreak kgdb by returning 0 on success from
kgdb_arch_set_breakpoint() instead of an error code.

- Add more Icelake CPU model defines so depending changes can be
queued in other trees"

* 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
x86/microcode, cpuhotplug: Add a microcode loader CPU hotplug callback
x86/kasan: Fix boot with 5-level paging and KASAN
x86/fpu: Don't use current->mm to check for a kthread
x86/kgdb: Return 0 from kgdb_arch_set_breakpoint()
x86/resctrl: Prevent NULL pointer dereference when local MBM is disabled
x86/resctrl: Don't stop walking closids when a locksetup group is found
x86/fpu: Update kernel's FPU state before using for the fsave header
x86/mm/KASLR: Compute the size of the vmemmap section properly
x86/fpu: Use fault_in_pages_writeable() for pre-faulting
x86/CPU: Add more Icelake model numbers
mm/vmalloc: Avoid rare case of flushing TLB with weird arguments
mm/vmalloc: Fix calculation of direct map addr range

6y ago

Christophe Leroy

b7f8b440

powerpc/32s: fix initial setup of segment registers on secondary CPU

6y ago

Linus Torvalds

f8b5c722

Merge tag 'arc-5.2-rc7' of git://git.kernel.org/pub/scm/linux/kernel/git/vgupta/arc

6y ago

Vinod Koul

8f9fab48

linux/kernel.h: fix overflow for DIV_ROUND_UP_ULL

6y ago

Linux 5.2-rc7 v5.2-rc7

6fbc7275

Linus Torvalds

Merge tag 'powerpc-5.2-7' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux

39132f74

Linus Torvalds

Merge branch 'smp-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

7c15f41e

Linus Torvalds

powerpc/64s/exception: Fix machine check early corrupting AMR

e13e7cd4

Nicholas Piggin

Merge branch 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

72825454

Linus Torvalds

cpu/hotplug: Fix out-of-bounds read when setting fail state

33d4a5a7

Eiichi Tsukata

KVM: PPC: Book3S HV: Invalidate ERAT when flushing guest TLB entries

50087112

Suraj Jitindar Singh

Merge branch 'perf-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

57103eb7

Linus Torvalds

x86/unwind/orc: Fall back to using frame pointers for generated code

ae6a45a0

Josh Poimboeuf

cpu/speculation: Warn on unsupported mitigations= parameter

1bf72720

Geert Uytterhoeven

powerpc: enable a 30-bit ZONE_DMA for 32-bit pmac

9739ab7e

Christoph Hellwig

Merge branch 'irq-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

eed7d30e

Linus Torvalds

perf/x86/regs: Use PERF_REG_EXTENDED_MASK

8b12b812

Kan Liang

perf/x86: Always store regs->ip in perf_callchain_kernel()

83f44ae0

Song Liu

Linux 5.2-rc6 v5.2-rc6

4b972a01

Linus Torvalds

KVM: PPC: Book3S HV: Only write DAWR[X] when handling h_set_dawr in real mode

84b02824

Suraj Jitindar Singh

Merge branch 'efi-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

a7211bc9

Linus Torvalds

Merge tag 'irqchip-5.2-2' of git://git.kernel.org/pub/scm/linux/kernel/git/maz/arm-platforms into irq/urgent

a52548dd

Thomas Gleixner

perf/x86: Remove pmu->pebs_no_xmm_regs

cd6b984f

Kan Liang

x86/speculation: Allow guests to use SSBD even if host does not

c1f7fec1

Alejandro Jimenez

Merge tag 'iommu-fix-v5.2-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/joro/iommu

6698a71a

Linus Torvalds

KVM: PPC: Book3S HV: Fix r3 corruption in h_set_dabr()

fabb2efc

Michael Neuling

Merge tag 'pm-5.2-rc7' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm

2407e486

Linus Torvalds

Merge tag 'efi-urgent' of git://git.kernel.org/pub/scm/linux/kernel/git/efi/efi into efi/urgent

48c7d73b

Thomas Gleixner

Merge tag 'irqchip-5.2' of git://git.kernel.org/pub/scm/linux/kernel/git/maz/arm-platforms into irq/core

fb4e0592

Thomas Gleixner

irqchip/gic-v3-its: Fix command queue pointer comparison bug

a050fa54

Heyi Guo

perf/x86: Clean up PEBS_XMM_REGS

dce86ac7

Kan Liang

x86/mm: Handle physical-virtual alignment mismatch in phys_p4d_init()

432c8332

Kirill A. Shutemov

Merge tag 'pci-v5.2-fixes-1' of git://git.kernel.org/pub/scm/linux/kernel/git/helgaas/pci

b253d5f3

Linus Torvalds

Revert "iommu/vt-d: Fix lock inversion between iommu->lock and device_domain_lock"

0aafc8ae

Peter Xu

powerpc/32: fix build failure on book3e with KVM

82f6e266

Christophe Leroy

Merge tag 'xarray-5.2-rc6' of git://git.infradead.org/users/willy/linux-dax

01305db8

Linus Torvalds

PCI: PM: Avoid skipping bus-level PM on platforms without ACPI

471a739a

Rafael J. Wysocki

efi: Allow the number of EFI configuration tables entries to be zero

88447c5b

Rob Bradford

efibc: Replace variable set function in notifier call

975a6166

Tian Baofeng

irq_work: Do not raise an IPI when queueing work on the local CPU

471ba0e6

Nicholas Piggin

iommu/dma-iommu: Remove iommu_dma_map_msi_msg()

16e32c3c

Julien Grall

irqchip/mips-gic: Use the correct local interrupt map registers

6d4d367d

Paul Burton

perf/x86/regs: Check reserved bits

90d42491

Kan Liang

x86/boot/64: Add missing fixup_pointer() for next_early_pgt access

c1887159

Kirill A. Shutemov

Merge tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi

f4102766

Linus Torvalds

PCI/P2PDMA: Ignore root complex whitelist when an IOMMU is present

6dbbd053

Logan Gunthorpe

Linux 5.2-rc5 v5.2-rc5

9e0babf2

Linus Torvalds

powerpc/booke: fix fast syscall entry on SMP

e8732ffa

Christophe Leroy

Merge branch 'akpm' (patches from Andrew)

0839c537

Linus Torvalds

XArray tests: Add check_insert

12fd2aee

Matthew Wilcox

efi/x86/Add missing error handling to old_memmap 1:1 mapping code

4e78921b

Gen Zhang

x86/efi: fix a -Wtype-limits compilation warning

919aef44

Qian Cai

genirq/devres: Use struct_size() in devm_kzalloc()

2d65c42b

Gustavo A. R. Silva

irqchip/gic-v3-mbi: Don't map the MSI page in mbi_compose_m{b, s}i_msg()

73103975

Julien Grall

irqchip/ti-sci-inta: Fix kernel crash if irq_create_fwspec_mapping fail

eb737b8f

Peter Ujfalusi

perf/x86: Disable extended registers for non-supported PMUs

e321d02d

Kan Liang

x86/boot/64: Fix crash if kernel image crosses page table boundary

81c7ed29

Kirill A. Shutemov

Merge tag 'powerpc-5.2-5' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux

Pull powerpc fixes from Michael Ellerman:
"This is a frustratingly large batch at rc5. Some of these were sent
earlier but were missed by me due to being distracted by other things,
and some took a while to track down due to needing manual bisection on
old hardware. But still we clearly need to improve our testing of KVM,
and of 32-bit, so that we catch these earlier.

Summary: seven fixes, all for bugs introduced this cycle.

- The commit to add KASAN support broke booting on 32-bit SMP
machines, due to a refactoring that moved some setup out of the
secondary CPU path.

- A fix for another 32-bit SMP bug introduced by the fast syscall
entry implementation for 32-bit BOOKE. And a build fix for the same
commit.

- Our change to allow the DAWR to be force enabled on Power9
introduced a bug in KVM, where we clobber r3 leading to a host
crash.

- The same commit also exposed a previously unreachable bug in the
nested KVM handling of DAWR, which could lead to an oops in a
nested host.

- One of the DMA reworks broke the b43legacy WiFi driver on some
people's powermacs, fix it by enabling a 30-bit ZONE_DMA on 32-bit.

- A fix for TLB flushing in KVM introduced a new bug, as it neglected
to also flush the ERAT, this could lead to memory corruption in the
guest.

Thanks to: Aaro Koskinen, Christoph Hellwig, Christophe Leroy, Larry
Finger, Michael Neuling, Suraj Jitindar Singh"

* tag 'powerpc-5.2-5' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux:
KVM: PPC: Book3S HV: Invalidate ERAT when flushing guest TLB entries
powerpc: enable a 30-bit ZONE_DMA for 32-bit pmac
KVM: PPC: Book3S HV: Only write DAWR[X] when handling h_set_dawr in real mode
KVM: PPC: Book3S HV: Fix r3 corruption in h_set_dabr()
powerpc/32: fix build failure on book3e with KVM
powerpc/booke: fix fast syscall entry on SMP
powerpc/32s: fix initial setup of segment registers on secondary CPU