Linux kernel mirror (for testing) git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
kernel os linux

perf tools: Replace _SC_NPROCESSORS_CONF with max_present_cpu in cpu_topology_map

There are 2 problems wrt. cpu_topology_map on systems with sparse CPUs:

1. offline/absent CPUs will have their socket_id and core_id set to -1
which triggers:
"socket_id number is too big.You may need to upgrade the perf tool."

2. size of cpu_topology_map (perf_env.cpu[]) is allocated based on
_SC_NPROCESSORS_CONF, but can be indexed with CPU ids going above.
Users of perf_env.cpu[] are using CPU id as index. This can lead
to read beyond what was allocated:
==19991== Invalid read of size 4
==19991== at 0x490CEB: check_cpu_topology (topology.c:69)
==19991== by 0x490CEB: test_session_topology (topology.c:106)
...

For example:
_SC_NPROCESSORS_CONF == 16
available: 2 nodes (0-1)
node 0 cpus: 0 6 8 10 16 22 24 26
node 0 size: 12004 MB
node 0 free: 9470 MB
node 1 cpus: 1 7 9 11 23 25 27
node 1 size: 12093 MB
node 1 free: 9406 MB
node distances:
node 0 1
0: 10 20
1: 20 10

This patch changes HEADER_NRCPUS.nr_cpus_available from _SC_NPROCESSORS_CONF
to max_present_cpu and updates any user of cpu_topology_map to iterate
with nr_cpus_avail.

As a consequence HEADER_CPU_TOPOLOGY core_id and socket_id lists get longer,
but maintain compatibility with pre-patch state - index to cpu_topology_map is
CPU id.

perf test 36 -v
36: Session topology :
--- start ---
test child forked, pid 22211
templ file: /tmp/perf-test-gmdX5i
CPU 0, core 0, socket 0
CPU 1, core 0, socket 1
CPU 6, core 10, socket 0
CPU 7, core 10, socket 1
CPU 8, core 1, socket 0
CPU 9, core 1, socket 1
CPU 10, core 9, socket 0
CPU 11, core 9, socket 1
CPU 16, core 0, socket 0
CPU 22, core 10, socket 0
CPU 23, core 10, socket 1
CPU 24, core 1, socket 0
CPU 25, core 1, socket 1
CPU 26, core 9, socket 0
CPU 27, core 9, socket 1
test child finished with 0
---- end ----
Session topology: Ok

Signed-off-by: Jan Stancek <jstancek@redhat.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Masami Hiramatsu <mhiramat@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/d7c05c6445fca74a8442c2c73cfffd349c52c44f.1487146877.git.jstancek@redhat.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>

authored by

Jan Stancek and committed by
Arnaldo Carvalho de Melo
da8a58b5 43db2843

+10 -14
+1 -1
tools/perf/builtin-stat.c
··· 1765 1765 1766 1766 cpu = map->map[idx]; 1767 1767 1768 - if (cpu >= env->nr_cpus_online) 1768 + if (cpu >= env->nr_cpus_avail) 1769 1769 return -1; 1770 1770 1771 1771 return cpu;
+3 -1
tools/perf/tests/topology.c
··· 65 65 session = perf_session__new(&file, false, NULL); 66 66 TEST_ASSERT_VAL("can't get session", session); 67 67 68 - for (i = 0; i < session->header.env.nr_cpus_online; i++) { 68 + for (i = 0; i < session->header.env.nr_cpus_avail; i++) { 69 + if (!cpu_map__has(map, i)) 70 + continue; 69 71 pr_debug("CPU %d, core %d, socket %d\n", i, 70 72 session->header.env.cpu[i].core_id, 71 73 session->header.env.cpu[i].socket_id);
+1 -1
tools/perf/util/env.c
··· 66 66 return 0; 67 67 68 68 if (env->nr_cpus_avail == 0) 69 - env->nr_cpus_avail = sysconf(_SC_NPROCESSORS_CONF); 69 + env->nr_cpus_avail = cpu__max_present_cpu(); 70 70 71 71 nr_cpus = env->nr_cpus_avail; 72 72 if (nr_cpus == -1)
+5 -11
tools/perf/util/header.c
··· 295 295 u32 nrc, nra; 296 296 int ret; 297 297 298 - nr = sysconf(_SC_NPROCESSORS_CONF); 299 - if (nr < 0) 300 - return -1; 301 - 302 - nrc = (u32)(nr & UINT_MAX); 298 + nrc = cpu__max_present_cpu(); 303 299 304 300 nr = sysconf(_SC_NPROCESSORS_ONLN); 305 301 if (nr < 0) ··· 509 513 int ret = -1; 510 514 struct cpu_map *map; 511 515 512 - ncpus = sysconf(_SC_NPROCESSORS_CONF); 513 - if (ncpus < 0) 514 - return NULL; 516 + ncpus = cpu__max_present_cpu(); 515 517 516 518 /* build online CPU map */ 517 519 map = cpu_map__new(NULL); ··· 1133 1139 { 1134 1140 int nr, i; 1135 1141 char *str; 1136 - int cpu_nr = ph->env.nr_cpus_online; 1142 + int cpu_nr = ph->env.nr_cpus_avail; 1137 1143 1138 1144 nr = ph->env.nr_sibling_cores; 1139 1145 str = ph->env.sibling_cores; ··· 1788 1794 u32 nr, i; 1789 1795 char *str; 1790 1796 struct strbuf sb; 1791 - int cpu_nr = ph->env.nr_cpus_online; 1797 + int cpu_nr = ph->env.nr_cpus_avail; 1792 1798 u64 size = 0; 1793 1799 1794 1800 ph->env.cpu = calloc(cpu_nr, sizeof(*ph->env.cpu)); ··· 1869 1875 if (ph->needs_swap) 1870 1876 nr = bswap_32(nr); 1871 1877 1872 - if (nr > (u32)cpu_nr) { 1878 + if (nr != (u32)-1 && nr > (u32)cpu_nr) { 1873 1879 pr_debug("socket_id number is too big." 1874 1880 "You may need to upgrade the perf tool.\n"); 1875 1881 goto free_cpu;