Linux kernel mirror (for testing) git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
kernel os linux

Documentation/x86: Rename resctrl_ui.rst and add two errata to the file

Intel Memory Bandwidth Monitoring (MBM) counters may report system
memory bandwidth incorrectly on some Intel processors. This is reported
in documented in erratum SKX99, erratum BDF102 and in the RDT reference
manual, see Documentation/x86/index.rst.

To work around the errata, MBM total and local readings are corrected
using a correction factor table.

Since the correction factor table is not publicly documented anywhere,
document the table and the errata in Documentation/x86/resctrl.rst for
future reference.

[ bp: Move web links to the doc, massage. ]

Suggested-by: Borislav Petkov <bp@alien8.de>
Signed-off-by: Fenghua Yu <fenghua.yu@intel.com>
Signed-off-by: Borislav Petkov <bp@suse.de>
Reviewed-by: Tony Luck <tony.luck@intel.com>
Link: https://lkml.kernel.org/r/20201014004927.1839452-2-fenghua.yu@intel.com

authored by

Fenghua Yu and committed by
Borislav Petkov
d1b22e36 3650b228

+94 -1
+1 -1
Documentation/x86/index.rst
··· 27 27 pti 28 28 mds 29 29 microcode 30 - resctrl_ui 30 + resctrl 31 31 tsx_async_abort 32 32 usb-legacy-support 33 33 i386/index
+93
Documentation/x86/resctrl_ui.rst Documentation/x86/resctrl.rst
··· 1209 1209 1210 1210 # cat /sys/fs/resctrl/p1/mon_data/mon_L3_00/llc_occupancy 1211 1211 11234000 1212 + 1213 + Intel RDT Errata 1214 + ================ 1215 + 1216 + Intel MBM Counters May Report System Memory Bandwidth Incorrectly 1217 + ----------------------------------------------------------------- 1218 + 1219 + Errata SKX99 for Skylake server and BDF102 for Broadwell server. 1220 + 1221 + Problem: Intel Memory Bandwidth Monitoring (MBM) counters track metrics 1222 + according to the assigned Resource Monitor ID (RMID) for that logical 1223 + core. The IA32_QM_CTR register (MSR 0xC8E), used to report these 1224 + metrics, may report incorrect system bandwidth for certain RMID values. 1225 + 1226 + Implication: Due to the errata, system memory bandwidth may not match 1227 + what is reported. 1228 + 1229 + Workaround: MBM total and local readings are corrected according to the 1230 + following correction factor table: 1231 + 1232 + +---------------+---------------+---------------+-----------------+ 1233 + |core count |rmid count |rmid threshold |correction factor| 1234 + +---------------+---------------+---------------+-----------------+ 1235 + |1 |8 |0 |1.000000 | 1236 + +---------------+---------------+---------------+-----------------+ 1237 + |2 |16 |0 |1.000000 | 1238 + +---------------+---------------+---------------+-----------------+ 1239 + |3 |24 |15 |0.969650 | 1240 + +---------------+---------------+---------------+-----------------+ 1241 + |4 |32 |0 |1.000000 | 1242 + +---------------+---------------+---------------+-----------------+ 1243 + |6 |48 |31 |0.969650 | 1244 + +---------------+---------------+---------------+-----------------+ 1245 + |7 |56 |47 |1.142857 | 1246 + +---------------+---------------+---------------+-----------------+ 1247 + |8 |64 |0 |1.000000 | 1248 + +---------------+---------------+---------------+-----------------+ 1249 + |9 |72 |63 |1.185115 | 1250 + +---------------+---------------+---------------+-----------------+ 1251 + |10 |80 |63 |1.066553 | 1252 + +---------------+---------------+---------------+-----------------+ 1253 + |11 |88 |79 |1.454545 | 1254 + +---------------+---------------+---------------+-----------------+ 1255 + |12 |96 |0 |1.000000 | 1256 + +---------------+---------------+---------------+-----------------+ 1257 + |13 |104 |95 |1.230769 | 1258 + +---------------+---------------+---------------+-----------------+ 1259 + |14 |112 |95 |1.142857 | 1260 + +---------------+---------------+---------------+-----------------+ 1261 + |15 |120 |95 |1.066667 | 1262 + +---------------+---------------+---------------+-----------------+ 1263 + |16 |128 |0 |1.000000 | 1264 + +---------------+---------------+---------------+-----------------+ 1265 + |17 |136 |127 |1.254863 | 1266 + +---------------+---------------+---------------+-----------------+ 1267 + |18 |144 |127 |1.185255 | 1268 + +---------------+---------------+---------------+-----------------+ 1269 + |19 |152 |0 |1.000000 | 1270 + +---------------+---------------+---------------+-----------------+ 1271 + |20 |160 |127 |1.066667 | 1272 + +---------------+---------------+---------------+-----------------+ 1273 + |21 |168 |0 |1.000000 | 1274 + +---------------+---------------+---------------+-----------------+ 1275 + |22 |176 |159 |1.454334 | 1276 + +---------------+---------------+---------------+-----------------+ 1277 + |23 |184 |0 |1.000000 | 1278 + +---------------+---------------+---------------+-----------------+ 1279 + |24 |192 |127 |0.969744 | 1280 + +---------------+---------------+---------------+-----------------+ 1281 + |25 |200 |191 |1.280246 | 1282 + +---------------+---------------+---------------+-----------------+ 1283 + |26 |208 |191 |1.230921 | 1284 + +---------------+---------------+---------------+-----------------+ 1285 + |27 |216 |0 |1.000000 | 1286 + +---------------+---------------+---------------+-----------------+ 1287 + |28 |224 |191 |1.143118 | 1288 + +---------------+---------------+---------------+-----------------+ 1289 + 1290 + If rmid > rmid threshold, MBM total and local values should be multiplied 1291 + by the correction factor. 1292 + 1293 + See: 1294 + 1295 + 1. Erratum SKX99 in Intel Xeon Processor Scalable Family Specification Update: 1296 + http://web.archive.org/web/20200716124958/https://www.intel.com/content/www/us/en/processors/xeon/scalable/xeon-scalable-spec-update.html 1297 + 1298 + 2. Erratum BDF102 in Intel Xeon E5-2600 v4 Processor Product Family Specification Update: 1299 + http://web.archive.org/web/20191125200531/https://www.intel.com/content/dam/www/public/us/en/documents/specification-updates/xeon-e5-v4-spec-update.pdf 1300 + 1301 + 3. The errata in Intel Resource Director Technology (Intel RDT) on 2nd Generation Intel Xeon Scalable Processors Reference Manual: 1302 + https://software.intel.com/content/www/us/en/develop/articles/intel-resource-director-technology-rdt-reference-manual.html 1303 + 1304 + for further information.