commits

When the system runs out of enclave memory, SGX can reclaim EPC pages
by swapping to normal RAM. These backing pages are allocated via a
per-enclave shared memory area. Since SGX allows unlimited over
commit on EPC memory, the reclaimer thread can allocate a large
number of backing RAM pages in response to EPC memory pressure.

When the shared memory backing RAM allocation occurs during
the reclaimer thread context, the shared memory is charged to
the root memory control group, and the shmem usage of the enclave
is not properly accounted for, making cgroups ineffective at
limiting the amount of RAM an enclave can consume.

For example, when using a cgroup to launch a set of test
enclaves, the kernel does not properly account for 50% - 75% of
shmem page allocations on average. In the worst case, when
nearly all allocations occur during the reclaimer thread, the
kernel accounts less than a percent of the amount of shmem used
by the enclave's cgroup to the correct cgroup.

SGX stores a list of mm_structs that are associated with
an enclave. Pick one of them during reclaim and charge that
mm's memcg with the shmem allocation. The one that gets picked
is arbitrary, but this list almost always only has one mm. The
cases where there is more than one mm with different memcg's
are not worth considering.

Create a new function - sgx_encl_alloc_backing(). This function
is used whenever a new backing storage page needs to be
allocated. Previously the same function was used for page
allocation as well as retrieving a previously allocated page.
Prior to backing page allocation, if there is a mm_struct associated
with the enclave that is requesting the allocation, it is set
as the active memory control group.

[ dhansen: - fix merge conflict with ELDU fixes
- check against actual ksgxd_tsk, not ->mm ]

Cc: stable@vger.kernel.org
Signed-off-by: Kristen Carlson Accardi <kristen@linux.intel.com>
Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
Reviewed-by: Shakeel Butt <shakeelb@google.com>
Acked-by: Roman Gushchin <roman.gushchin@linux.dev>
Link: https://lkml.kernel.org/r/20220520174248.4918-1-kristen@linux.intel.com

3y ago

Linus Torvalds

55fe9217

Merge tag 'i3c/for-5.19' of git://git.kernel.org/pub/scm/linux/kernel/git/i3c/linux

3y ago

Fabio Estevam

f78e3d40

rtc: mxc: Silence a clang warning

3y ago

Zi Yan

547be963

mm: page_isolation: use compound_nr() correctly in isolate_single_pageblock()

3y ago

Linus Torvalds

31231092

Linux 5.18-rc1 v5.18-rc1

3y ago

Linus Torvalds

9784edd7

Merge tag 'x86-microcode-2022-06-05' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

3y ago

Fanjun Kong

e19d1126

x86/mm: Use PAGE_ALIGNED(x) instead of IS_ALIGNED(x, PAGE_SIZE)

3y ago

Linus Torvalds

17d8e3d9

Merge tag 'ceph-for-5.19-rc1' of https://github.com/ceph/ceph-client

3y ago

Linus Torvalds

fa78526a

Merge tag 'for-5.19/dm-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm

3y ago

Lukas Bulwahn

66ed42ca

MAINTAINERS: rectify entries for some i3c drivers after dt conversion

3y ago

Miquel Raynal

3f348924

rtc: rzn1: Fix a variable type

3y ago

Muchun Song

0111def9

mm: hugetlb_vmemmap: fix CONFIG_HUGETLB_PAGE_FREE_VMEMMAP_DEFAULT_ON

3y ago

Linus Torvalds

09bb8856

Merge tag 'trace-v5.18-2' of git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/linux-trace

3y ago

Linus Torvalds

a9251280

Merge tag 'x86-cleanups-2022-06-05' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

3y ago

Borislav Petkov

0c0fe08c

x86/microcode: Remove unnecessary perf callback

3y ago

Linus Torvalds

6f3f04c1

Merge tag 'sched-core-2022-05-23' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

3y ago

Linus Torvalds

7c9e960c

Merge tag 'livepatching-for-5.19' of git://git.kernel.org/pub/scm/linux/kernel/git/livepatching/livepatching

3y ago

Jeff Layton

af7dc8e5

MAINTAINERS: move myself from ceph "Maintainer" to "Reviewer"

3y ago

Linus Torvalds

c7993147

Merge tag 'for-v5.19' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply

3y ago

Sarthak Kukreti

4caae584

dm verity: set DM_TARGET_IMMUTABLE feature flag

3y ago

Guo Zhengkui

227fab1e

i3c: master: svc: fix returnvar.cocci warning

3y ago

Dan Carpenter

0b6da785

rtc: rzn1: Fix error code in probe

3y ago

Miaohe Lin

273aea95

MAINTAINERS: add maintainer information for z3fold

3y ago

Linus Torvalds

34a53ff9

Merge tag 'clk-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/clk/linux

3y ago

Steven Rostedt (Google)

5cfff569

tracing: Move user_events.h temporarily out of include/uapi

3y ago

Linus Torvalds

1fd9f4ce

Merge tag 'x86-boot-2022-06-05' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

3y ago

Bo Liu

f7081834

x86: Fix all occurences of the "the the" typo

3y ago

Borislav Petkov

d23d33ea

x86/microcode: Taint and warn on late loading

3y ago

Linus Torvalds

cfeb2522

Merge tag 'perf-core-2022-05-23' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

Pull perf events updates from Ingo Molnar:
"Platform PMU changes:

- x86/intel:
- Add new Intel Alder Lake and Raptor Lake support

- x86/amd:
- AMD Zen4 IBS extensions support
- Add AMD PerfMonV2 support
- Add AMD Fam19h Branch Sampling support

Generic changes:

- signal: Deliver SIGTRAP on perf event asynchronously if blocked

Perf instrumentation can be driven via SIGTRAP, but this causes a
problem when SIGTRAP is blocked by a task & terminate the task.

Allow user-space to request these signals asynchronously (after
they get unblocked) & also give the information to the signal
handler when this happens:

"To give user space the ability to clearly distinguish
synchronous from asynchronous signals, introduce
siginfo_t::si_perf_flags and TRAP_PERF_FLAG_ASYNC (opted for
flags in case more binary information is required in future).

The resolution to the problem is then to (a) no longer force the
signal (avoiding the terminations), but (b) tell user space via
si_perf_flags if the signal was synchronous or not, so that such
signals can be handled differently (e.g. let user space decide
to ignore or consider the data imprecise). "

- Unify/standardize the /sys/devices/cpu/events/* output format.

- Misc fixes & cleanups"

* tag 'perf-core-2022-05-23' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (32 commits)
perf/x86/amd/core: Fix reloading events for SVM
perf/x86/amd: Run AMD BRS code only on supported hw
perf/x86/amd: Fix AMD BRS period adjustment
perf/x86/amd: Remove unused variable 'hwc'
perf/ibs: Fix comment
perf/amd/ibs: Advertise zen4_ibs_extensions as pmu capability attribute
perf/amd/ibs: Add support for L3 miss filtering
perf/amd/ibs: Use ->is_visible callback for dynamic attributes
perf/amd/ibs: Cascade pmu init functions' return value
perf/x86/uncore: Add new Alder Lake and Raptor Lake support
perf/x86/uncore: Clean up uncore_pci_ids[]
perf/x86/cstate: Add new Alder Lake and Raptor Lake support
perf/x86/msr: Add new Alder Lake and Raptor Lake support
perf/x86: Add new Alder Lake and Raptor Lake support
perf/amd/ibs: Use interrupt regs ip for stack unwinding
perf/x86/amd/core: Add PerfMonV2 overflow handling
perf/x86/amd/core: Add PerfMonV2 counter control
perf/x86/amd/core: Detect available counters
perf/x86/amd/core: Detect PerfMonV2 support
x86/msr: Add PerfCntrGlobal* registers
...

3y ago

Dietmar Eggemann

991d8d81

topology: Remove unused cpu_cluster_mask()

3y ago

Linus Torvalds

12831f64

Merge tag 'printk-for-5.19-fixup' of git://git.kernel.org/pub/scm/linux/kernel/git/printk/linux

3y ago

Christophe Leroy

5d7c8545

livepatch: Remove klp_arch_set_pc() and asm/livepatch.h

3y ago

Luís Henriques

ea16567f

ceph: fix decoding of client session messages flags

3y ago

Linus Torvalds

96752be4

Merge tag 'linux-watchdog-5.19-rc1' of git://www.linux-watchdog.org/linux-watchdog

3y ago

Sebastian Reichel

da50aad6

Merge power-supply 'fixes' branch

3y ago

Mike Snitzer

9571f829

dm table: fix dm_table_supports_poll to return false if no data devices

3y ago

Minghao Chi

c157a606

i3c/master: simplify the return expression of i3c_hci_remove()

3y ago

Miquel Raynal

64d69b5d

rtc: rzn1: Avoid mixing variables

3y ago

Josh Poimboeuf

1ff810c1

mailmap: update Josh Poimboeuf's email

3y ago

Linus Torvalds

8b5656bc

Merge tag 'x86-urgent-2022-04-03' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

3y ago

Stephen Boyd

859c2c7b

Revert "clk: Drop the rate range on clk_put()"

3y ago

Christophe Leroy

18bfee32

ftrace: Make ftrace_graph_is_dead() a static branch

3y ago

Linus Torvalds

c049ecc5

Merge tag 'timers-core-2022-06-05' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

3y ago

XueBing Chen

8a33d96b

x86/setup: Use strscpy() to replace deprecated strlcpy()

3y ago

Linux 5.19-rc1 v5.19-rc1

f2906aa8

Linus Torvalds

Merge tag 'pull-work.fd-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs

6684cf42

Linus Torvalds

Merge tag 'mm-hotfixes-stable-2022-06-05' of git://git.kernel.org/pub/scm/linux/kernel/git/akpm/mm

815b196c

Linus Torvalds

fix the breakage in close_fd_get_file() calling conventions change

40a19260

Al Viro

Merge tag 'mm-nonmm-stable-2022-06-05' of git://git.kernel.org/pub/scm/linux/kernel/git/akpm/mm

e17fee89

Linus Torvalds

mm/oom_kill.c: fix vm_oom_kill_table[] ifdeffery

a19cad06

Andrew Morton

Unify the primitives for file descriptor closing

6319194e

Al Viro

bluetooth: don't use bitmaps for random flag accesses

The bluetooth code uses our bitmap infrastructure for the two bits (!)
of connection setup flags, and in the process causes odd problems when
it converts between a bitmap and just the regular values of said bits.

It's completely pointless to do things like bitmap_to_arr32() to convert
a bitmap into a u32. It shoudln't have been a bitmap in the first
place. The reason to use bitmaps is if you have arbitrary number of
bits you want to manage (not two!), or if you rely on the atomicity
guarantees of the bitmap setting and clearing.

The code could use an "atomic_t" and use "atomic_or/andnot()" to set and
clear the bit values, but considering that it then copies the bitmaps
around with "bitmap_to_arr32()" and friends, there clearly cannot be a
lot of atomicity requirements.

So just use a regular integer.

In the process, this avoids the warnings about erroneous use of
bitmap_from_u64() which were triggered on 32-bit architectures when
conversion from a u64 would access two words (and, surprise, surprise,
only one word is needed - and indeed overkill - for a 2-bit bitmap).

That was always problematic, but the compiler seems to notice it and
warn about the invalid pattern only after commit 0a97953fd221 ("lib: add
bitmap_{from,to}_arr64") changed the exact implementation details of
'bitmap_from_u64()', as reported by Sudip Mukherjee and Stephen Rothwell.

Fixes: fe92ee6425a2 ("Bluetooth: hci_core: Rework hci_conn_params flags")
Link: https://lore.kernel.org/all/YpyJ9qTNHJzz0FHY@debian/
Link: https://lore.kernel.org/all/20220606080631.0c3014f2@canb.auug.org.au/
Link: https://lore.kernel.org/all/20220605162537.1604762-1-yury.norov@gmail.com/
Reported-by: Stephen Rothwell <sfr@canb.auug.org.au>
Reported-by: Sudip Mukherjee <sudipm.mukherjee@gmail.com>
Reviewed-by: Yury Norov <yury.norov@gmail.com>
Cc: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
Cc: Marcel Holtmann <marcel@holtmann.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>