| HugeTLBpage on ARM64 |
| ==================== |
| |
| Hugepage relies on making efficient use of TLBs to improve performance of |
| address translations. The benefit depends on both - |
| |
| - the size of hugepages |
| - size of entries supported by the TLBs |
| |
| The ARM64 port supports two flavours of hugepages. |
| |
| 1) Block mappings at the pud/pmd level |
| -------------------------------------- |
| |
| These are regular hugepages where a pmd or a pud page table entry points to a |
| block of memory. Regardless of the supported size of entries in TLB, block |
| mappings reduce the depth of page table walk needed to translate hugepage |
| addresses. |
| |
| 2) Using the Contiguous bit |
| --------------------------- |
| |
| The architecture provides a contiguous bit in the translation table entries |
| (D4.5.3, ARM DDI 0487C.a) that hints to the MMU to indicate that it is one of a |
| contiguous set of entries that can be cached in a single TLB entry. |
| |
| The contiguous bit is used in Linux to increase the mapping size at the pmd and |
| pte (last) level. The number of supported contiguous entries varies by page size |
| and level of the page table. |
| |
| |
| The following hugepage sizes are supported - |
| |
| CONT PTE PMD CONT PMD PUD |
| -------- --- -------- --- |
| 4K: 64K 2M 32M 1G |
| 16K: 2M 32M 1G |
| 64K: 2M 512M 16G |