Linux kernel mirror (for testing) git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
kernel os linux

KVM: IOMMU: hva align mapping page size

When determining the page size we could use to map with the IOMMU, the
page size should also be aligned with the hva, not just the gfn. The
gfn may not reflect the real alignment within the hugetlbfs file.

Most of the time, this works fine. However, if the hugetlbfs file is
backed by non-contiguous huge pages, a multi-huge page memslot starts at
an unaligned offset within the hugetlbfs file, and the gfn is aligned
with respect to the huge page size, kvm_host_page_size() will return the
huge page size and we will use that to map with the IOMMU.

When we later unpin that same memslot, the IOMMU returns the unmap size
as the huge page size, and we happily unpin that many pfns in
monotonically increasing order, not realizing we are spanning
non-contiguous huge pages and partially unpin the wrong huge page.

Ensure the IOMMU mapping page size is aligned with the hva corresponding
to the gfn, which does reflect the alignment within the hugetlbfs file.

Reviewed-by: Marcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: Greg Edwards <gedwards@ddn.com>
Cc: stable@vger.kernel.org
Signed-off-by: Gleb Natapov <gleb@redhat.com>

authored by

Greg Edwards and committed by
Gleb Natapov
27ef63c7 a9d4e439

+4
+4
virt/kvm/iommu.c
··· 103 103 while ((gfn << PAGE_SHIFT) & (page_size - 1)) 104 104 page_size >>= 1; 105 105 106 + /* Make sure hva is aligned to the page size we want to map */ 107 + while (__gfn_to_hva_memslot(slot, gfn) & (page_size - 1)) 108 + page_size >>= 1; 109 + 106 110 /* 107 111 * Pin all pages we are about to map in memory. This is 108 112 * important because we unmap and unpin in 4kb steps later.