Linux kernel mirror (for testing) git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
kernel os linux

uprobes/core: Decrement uprobe count before the pages are unmapped

Uprobes has a callback (uprobe_munmap()) in the unmap path to
maintain the uprobes count.

In the exit path this callback gets called in unlink_file_vma().
However by the time unlink_file_vma() is called, the pages would
have been unmapped (in unmap_vmas()) and the task->rss_stat counts
accounted (in zap_pte_range()).

If the exiting process has probepoints, uprobe_munmap() checks if
the breakpoint instruction was around before decrementing the probe
count.

This results in a file backed page being reread by uprobe_munmap()
and hence it does not find the breakpoint.

This patch fixes this problem by moving the callback to
unmap_single_vma(). Since unmap_single_vma() may not unmap the
complete vma, add start and end parameters to uprobe_munmap().

This bug became apparent courtesy of commit c3f0327f8e9d
("mm: add rss counters consistency check").

Signed-off-by: Srikar Dronamraju <srikar@linux.vnet.ibm.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Ananth N Mavinakayanahalli <ananth@in.ibm.com>
Cc: Jim Keniston <jkenisto@linux.vnet.ibm.com>
Cc: Linux-mm <linux-mm@kvack.org>
Cc: Oleg Nesterov <oleg@redhat.com>
Cc: Andi Kleen <andi@firstfloor.org>
Cc: Christoph Hellwig <hch@infradead.org>
Cc: Steven Rostedt <rostedt@goodmis.org>
Cc: Arnaldo Carvalho de Melo <acme@infradead.org>
Cc: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
Cc: Anton Arapov <anton@redhat.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/20120411103527.23245.9835.sendpatchset@srdronam.in.ibm.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>

authored by

Srikar Dronamraju and committed by
Ingo Molnar
cbc91f71 7396fa81

+12 -8
+3 -2
include/linux/uprobes.h
··· 107 107 extern int uprobe_register(struct inode *inode, loff_t offset, struct uprobe_consumer *uc); 108 108 extern void uprobe_unregister(struct inode *inode, loff_t offset, struct uprobe_consumer *uc); 109 109 extern int uprobe_mmap(struct vm_area_struct *vma); 110 - extern void uprobe_munmap(struct vm_area_struct *vma); 110 + extern void uprobe_munmap(struct vm_area_struct *vma, unsigned long start, unsigned long end); 111 111 extern void uprobe_free_utask(struct task_struct *t); 112 112 extern void uprobe_copy_process(struct task_struct *t); 113 113 extern unsigned long __weak uprobe_get_swbp_addr(struct pt_regs *regs); ··· 134 134 { 135 135 return 0; 136 136 } 137 - static inline void uprobe_munmap(struct vm_area_struct *vma) 137 + static inline void 138 + uprobe_munmap(struct vm_area_struct *vma, unsigned long start, unsigned long end) 138 139 { 139 140 } 140 141 static inline void uprobe_notify_resume(struct pt_regs *regs)
+2 -2
kernel/events/uprobes.c
··· 1112 1112 /* 1113 1113 * Called in context of a munmap of a vma. 1114 1114 */ 1115 - void uprobe_munmap(struct vm_area_struct *vma) 1115 + void uprobe_munmap(struct vm_area_struct *vma, unsigned long start, unsigned long end) 1116 1116 { 1117 1117 struct list_head tmp_list; 1118 1118 struct uprobe *uprobe, *u; ··· 1138 1138 list_del(&uprobe->pending_list); 1139 1139 vaddr = vma_address(vma, uprobe->offset); 1140 1140 1141 - if (vaddr >= vma->vm_start && vaddr < vma->vm_end) { 1141 + if (vaddr >= start && vaddr < end) { 1142 1142 /* 1143 1143 * An unregister could have removed the probe before 1144 1144 * unmap. So check before we decrement the count.
+3
mm/memory.c
··· 1307 1307 if (end <= vma->vm_start) 1308 1308 return; 1309 1309 1310 + if (vma->vm_file) 1311 + uprobe_munmap(vma, start, end); 1312 + 1310 1313 if (vma->vm_flags & VM_ACCOUNT) 1311 1314 *nr_accounted += (end - start) >> PAGE_SHIFT; 1312 1315
+4 -4
mm/mmap.c
··· 218 218 mutex_lock(&mapping->i_mmap_mutex); 219 219 __remove_shared_vm_struct(vma, file, mapping); 220 220 mutex_unlock(&mapping->i_mmap_mutex); 221 - uprobe_munmap(vma); 222 221 } 223 222 } 224 223 ··· 547 548 mapping = file->f_mapping; 548 549 if (!(vma->vm_flags & VM_NONLINEAR)) { 549 550 root = &mapping->i_mmap; 550 - uprobe_munmap(vma); 551 + uprobe_munmap(vma, vma->vm_start, vma->vm_end); 551 552 552 553 if (adjust_next) 553 - uprobe_munmap(next); 554 + uprobe_munmap(next, next->vm_start, 555 + next->vm_end); 554 556 } 555 557 556 558 mutex_lock(&mapping->i_mmap_mutex); ··· 632 632 633 633 if (remove_next) { 634 634 if (file) { 635 - uprobe_munmap(next); 635 + uprobe_munmap(next, next->vm_start, next->vm_end); 636 636 fput(file); 637 637 if (next->vm_flags & VM_EXECUTABLE) 638 638 removed_exe_file_vma(mm);