Linux kernel mirror (for testing) git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
kernel os linux

ipv6: fix memory leak in fib6_rule_suppress

The kernel leaks memory when a `fib` rule is present in IPv6 nftables
firewall rules and a suppress_prefix rule is present in the IPv6 routing
rules (used by certain tools such as wg-quick). In such scenarios, every
incoming packet will leak an allocation in `ip6_dst_cache` slab cache.

After some hours of `bpftrace`-ing and source code reading, I tracked
down the issue to ca7a03c41753 ("ipv6: do not free rt if
FIB_LOOKUP_NOREF is set on suppress rule").

The problem with that change is that the generic `args->flags` always have
`FIB_LOOKUP_NOREF` set[1][2] but the IPv6-specific flag
`RT6_LOOKUP_F_DST_NOREF` might not be, leading to `fib6_rule_suppress` not
decreasing the refcount when needed.

How to reproduce:
- Add the following nftables rule to a prerouting chain:
meta nfproto ipv6 fib saddr . mark . iif oif missing drop
This can be done with:
sudo nft create table inet test
sudo nft create chain inet test test_chain '{ type filter hook prerouting priority filter + 10; policy accept; }'
sudo nft add rule inet test test_chain meta nfproto ipv6 fib saddr . mark . iif oif missing drop
- Run:
sudo ip -6 rule add table main suppress_prefixlength 0
- Watch `sudo slabtop -o | grep ip6_dst_cache` to see memory usage increase
with every incoming ipv6 packet.

This patch exposes the protocol-specific flags to the protocol
specific `suppress` function, and check the protocol-specific `flags`
argument for RT6_LOOKUP_F_DST_NOREF instead of the generic
FIB_LOOKUP_NOREF when decreasing the refcount, like this.

[1]: https://github.com/torvalds/linux/blob/ca7a03c4175366a92cee0ccc4fec0038c3266e26/net/ipv6/fib6_rules.c#L71
[2]: https://github.com/torvalds/linux/blob/ca7a03c4175366a92cee0ccc4fec0038c3266e26/net/ipv6/fib6_rules.c#L99

Link: https://bugzilla.kernel.org/show_bug.cgi?id=215105
Fixes: ca7a03c41753 ("ipv6: do not free rt if FIB_LOOKUP_NOREF is set on suppress rule")
Cc: stable@vger.kernel.org
Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>
Signed-off-by: David S. Miller <davem@davemloft.net>

authored by

msizanoen1 and committed by
David S. Miller
cdef4852 d00a50cf

+7 -4
+3 -1
include/net/fib_rules.h
··· 69 69 int (*action)(struct fib_rule *, 70 70 struct flowi *, int, 71 71 struct fib_lookup_arg *); 72 - bool (*suppress)(struct fib_rule *, 72 + bool (*suppress)(struct fib_rule *, int, 73 73 struct fib_lookup_arg *); 74 74 int (*match)(struct fib_rule *, 75 75 struct flowi *, int); ··· 218 218 struct fib_lookup_arg *arg)); 219 219 220 220 INDIRECT_CALLABLE_DECLARE(bool fib6_rule_suppress(struct fib_rule *rule, 221 + int flags, 221 222 struct fib_lookup_arg *arg)); 222 223 INDIRECT_CALLABLE_DECLARE(bool fib4_rule_suppress(struct fib_rule *rule, 224 + int flags, 223 225 struct fib_lookup_arg *arg)); 224 226 #endif
+1 -1
net/core/fib_rules.c
··· 323 323 if (!err && ops->suppress && INDIRECT_CALL_MT(ops->suppress, 324 324 fib6_rule_suppress, 325 325 fib4_rule_suppress, 326 - rule, arg)) 326 + rule, flags, arg)) 327 327 continue; 328 328 329 329 if (err != -EAGAIN) {
+1
net/ipv4/fib_rules.c
··· 141 141 } 142 142 143 143 INDIRECT_CALLABLE_SCOPE bool fib4_rule_suppress(struct fib_rule *rule, 144 + int flags, 144 145 struct fib_lookup_arg *arg) 145 146 { 146 147 struct fib_result *result = (struct fib_result *) arg->result;
+2 -2
net/ipv6/fib6_rules.c
··· 267 267 } 268 268 269 269 INDIRECT_CALLABLE_SCOPE bool fib6_rule_suppress(struct fib_rule *rule, 270 + int flags, 270 271 struct fib_lookup_arg *arg) 271 272 { 272 273 struct fib6_result *res = arg->result; ··· 295 294 return false; 296 295 297 296 suppress_route: 298 - if (!(arg->flags & FIB_LOOKUP_NOREF)) 299 - ip6_rt_put(rt); 297 + ip6_rt_put_flags(rt, flags); 300 298 return true; 301 299 } 302 300