x86/uaccess: Zero the 8-byte get_range case on failure on 32-bit

While zeroing the upper 32 bits of an 8-byte getuser on 32-bit x86 was
fixed by commit 8c860ed825cb ("x86/uaccess: Fix missed zeroing of ia32 u64
get_user() range checking") it was broken again in commit 8a2462df1547
("x86/uaccess: Improve the 8-byte getuser() case").

This is because the register which holds the upper 32 bits (%ecx) is being
cleared _after_ the check_range, so if the range check fails, %ecx is never
cleared.

This can be reproduced with:
./tools/testing/kunit/kunit.py run --arch i386 usercopy

Instead, clear %ecx _before_ check_range in the 8-byte case. This
reintroduces a bit of the ugliness we were trying to avoid by adding
another #ifndef CONFIG_X86_64, but at least keeps check_range from needing
a separate bad_get_user_8 jump.

Fixes: 8a2462df1547 ("x86/uaccess: Improve the 8-byte getuser() case")
Signed-off-by: David Gow <davidgow@google.com>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Acked-by: Linus Torvalds <torvalds@linux-foundation.org>
Link: https://lore.kernel.org/all/20240731073031.4045579-1-davidgow@google.com

authored by David Gow and committed by Thomas Gleixner dd35a093 3db03fb4

Changed files
+3 -1
arch
x86
lib
+3 -1
arch/x86/lib/getuser.S
··· 88 88 EXPORT_SYMBOL(__get_user_4) 89 89 90 90 SYM_FUNC_START(__get_user_8) 91 + #ifndef CONFIG_X86_64 92 + xor %ecx,%ecx 93 + #endif 91 94 check_range size=8 92 95 ASM_STAC 93 96 #ifdef CONFIG_X86_64 94 97 UACCESS movq (%_ASM_AX),%rdx 95 98 #else 96 - xor %ecx,%ecx 97 99 UACCESS movl (%_ASM_AX),%edx 98 100 UACCESS movl 4(%_ASM_AX),%ecx 99 101 #endif