Message ID | 4d3cc24f06ae75d99449f1d505cbff9ba53819a9.1638381245.git.reinette.chatre@intel.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
Series | x86/sgx and selftests/sgx: Support SGX2 | expand |
On Wed, Dec 01, 2021 at 11:23:22AM -0800, Reinette Chatre wrote: > The page reclaimer ensures availability of EPC pages across all > enclaves. In support of this it runs independently from the individual > enclaves in order to take locks from the different enclaves as it writes > pages to swap. > > When needing to load a page from swap an EPC page needs to be available for > its contents to be loaded into. Loading an existing enclave page from swap > does not reclaim EPC pages directly if none are available, instead the > reclaimer is woken when the available EPC pages are found to be below a > watermark. > > When iterating over a large number of pages in an oversubscribed > environment there is a race between the reclaimer woken up and EPC pages > reclaimed fast enough for the page operations to proceed. > > Instead of tuning the race between the page operations and the reclaimer > the page operations instead makes sure that there are EPC pages available. > > Signed-off-by: Reinette Chatre <reinette.chatre@intel.com> Why this needs to be part of this patch set? /Jarkko
Hi Jarkko, On 12/4/2021 3:47 PM, Jarkko Sakkinen wrote: > On Wed, Dec 01, 2021 at 11:23:22AM -0800, Reinette Chatre wrote: >> The page reclaimer ensures availability of EPC pages across all >> enclaves. In support of this it runs independently from the individual >> enclaves in order to take locks from the different enclaves as it writes >> pages to swap. >> >> When needing to load a page from swap an EPC page needs to be available for >> its contents to be loaded into. Loading an existing enclave page from swap >> does not reclaim EPC pages directly if none are available, instead the >> reclaimer is woken when the available EPC pages are found to be below a >> watermark. >> >> When iterating over a large number of pages in an oversubscribed >> environment there is a race between the reclaimer woken up and EPC pages >> reclaimed fast enough for the page operations to proceed. >> >> Instead of tuning the race between the page operations and the reclaimer >> the page operations instead makes sure that there are EPC pages available. >> >> Signed-off-by: Reinette Chatre <reinette.chatre@intel.com> > > Why this needs to be part of this patch set? When pages are modified they are required to be in the EPC and thus potentially need to be loaded from swap. When needing to modify a large number of pages in an oversubscribed environment there is a problem with the reclaimer providing free EPC pages fast enough for all the page modification operations to proceed. What that means is that if a user attempts to modify a large range of pages in an oversubscribed environment it is likely that the operation will fail to complete but instead it would result in partial success of as many pages as was on the free list. This is because the reclaimer may not run fast enough to free up sufficient EPC pages in a dynamic way. This becomes complicated for user space. It could increase the priority of the reclaimer but that has been found to be insufficient*. There would still not be a guarantee that after one page modification call fails enough pages would have been freed up in support of a second page modification call. With this change it would be ensured that when pages are being modified that there are sufficient EPC pages available to support the modifications. Reinette * The test that follows this patch was used to explore this scenario.
diff --git a/arch/x86/kernel/cpu/sgx/ioctl.c b/arch/x86/kernel/cpu/sgx/ioctl.c index d11da6c53b26..fc2737b3c7cc 100644 --- a/arch/x86/kernel/cpu/sgx/ioctl.c +++ b/arch/x86/kernel/cpu/sgx/ioctl.c @@ -739,6 +739,8 @@ static long sgx_page_modp(struct sgx_encl *encl, struct sgx_page_modp *modp) for (c = 0 ; c < modp->length; c += PAGE_SIZE) { addr = encl->base + modp->offset + c; + sgx_direct_reclaim(); + mutex_lock(&encl->lock); entry = sgx_encl_load_page(encl, addr); @@ -962,6 +964,8 @@ static long sgx_page_modt(struct sgx_encl *encl, struct sgx_page_modt *modt) for (c = 0 ; c < modt->length; c += PAGE_SIZE) { addr = encl->base + modt->offset + c; + sgx_direct_reclaim(); + mutex_lock(&encl->lock); entry = sgx_encl_load_page(encl, addr); @@ -1187,6 +1191,8 @@ static long sgx_page_remove(struct sgx_encl *encl, for (c = 0 ; c < params->length; c += PAGE_SIZE) { addr = encl->base + params->offset + c; + sgx_direct_reclaim(); + mutex_lock(&encl->lock); entry = sgx_encl_load_page(encl, addr); diff --git a/arch/x86/kernel/cpu/sgx/main.c b/arch/x86/kernel/cpu/sgx/main.c index 887648ce6084..bc3fc57f5f08 100644 --- a/arch/x86/kernel/cpu/sgx/main.c +++ b/arch/x86/kernel/cpu/sgx/main.c @@ -376,6 +376,12 @@ static bool sgx_should_reclaim(unsigned long watermark) !list_empty(&sgx_active_page_list); } +void sgx_direct_reclaim(void) +{ + if (sgx_should_reclaim(SGX_NR_LOW_PAGES)) + sgx_reclaim_pages(); +} + static int ksgxd(void *p) { set_freezable(); diff --git a/arch/x86/kernel/cpu/sgx/sgx.h b/arch/x86/kernel/cpu/sgx/sgx.h index ca89d625aa74..02af24acaacc 100644 --- a/arch/x86/kernel/cpu/sgx/sgx.h +++ b/arch/x86/kernel/cpu/sgx/sgx.h @@ -85,6 +85,7 @@ static inline void *sgx_get_epc_virt_addr(struct sgx_epc_page *page) struct sgx_epc_page *__sgx_alloc_epc_page(void); void sgx_free_epc_page(struct sgx_epc_page *page); +void sgx_direct_reclaim(void); void sgx_mark_page_reclaimable(struct sgx_epc_page *page); int sgx_unmark_page_reclaimable(struct sgx_epc_page *page); struct sgx_epc_page *sgx_alloc_epc_page(void *owner, bool reclaim);
The page reclaimer ensures availability of EPC pages across all enclaves. In support of this it runs independently from the individual enclaves in order to take locks from the different enclaves as it writes pages to swap. When needing to load a page from swap an EPC page needs to be available for its contents to be loaded into. Loading an existing enclave page from swap does not reclaim EPC pages directly if none are available, instead the reclaimer is woken when the available EPC pages are found to be below a watermark. When iterating over a large number of pages in an oversubscribed environment there is a race between the reclaimer woken up and EPC pages reclaimed fast enough for the page operations to proceed. Instead of tuning the race between the page operations and the reclaimer the page operations instead makes sure that there are EPC pages available. Signed-off-by: Reinette Chatre <reinette.chatre@intel.com> --- arch/x86/kernel/cpu/sgx/ioctl.c | 6 ++++++ arch/x86/kernel/cpu/sgx/main.c | 6 ++++++ arch/x86/kernel/cpu/sgx/sgx.h | 1 + 3 files changed, 13 insertions(+)