[03/25] x86/sgx: Support VMA permissions exceeding enclave permissions

=== Summary ===

An SGX VMA can only be created if its permissions are the same or
weaker than the Enclave Page Cache Map (EPCM) permissions. After VMA
creation this rule continues to be enforced by the page fault handler.

With SGX2 the EPCM permissions of a page can change after VMA
creation resulting in the VMA exceeding the EPCM permissions and the
page fault handler incorrectly blocking access.

Enable the VMA's pages to remain accessible while ensuring that
the page table entries are installed to match the EPCM permissions
without exceeding the VMA permissions.

=== Full Changelog ===

An SGX enclave is an area of memory where parts of an application
can reside. First an enclave is created and loaded (from
non-enclave memory) with the code and data of an application,
then user space can map (mmap()) the enclave memory to
be able to enter the enclave at its defined entry points for
execution within it.

The hardware maintains a secure structure, the Enclave Page Cache Map
(EPCM), that tracks the contents of the enclave. Of interest here is
its tracking of the enclave page permissions. When a page is loaded
into the enclave its permissions are specified and recorded in the
EPCM. In parallel the OS maintains the page table permissions and
the rule is that page table permissions are never allowed to exceed
EPCM permissions.

A new mapping (mmap()) of enclave memory can only succeed if the
mapping has the same or weaker permissions than the permissions that
were vetted during enclave creation. This is enforced by
sgx_encl_may_map() that is called on the mmap() as well as mprotect()
paths. This permission verification remains.

One feature of SGX2 is to support the modification of enclave page
permissions after enclave creation. Enclave pages may thus already be
mapped at the time their enclave permissions are changed resulting
in the VMA's permissions potentially exceeding the enclave page
permissions.

Enable permissions of existing VMAs to exceed enclave page permissions
in preparation for dynamic enclave page permission changes.
New VMAs that attempt to exceed enclave page permissions continue to be
unsupported.

Reasons why permissions of existing VMAs are allowed to exceed enclave
page permissions instead of dynamically changing VMA permissions when
enclave page permissions change are:
1) Changing VMA permissions involve splitting VMAs which is an operation
   that can fail. Additionally the actual changing of page permissions
   of a range of pages could also fail on any of the pages involved.
   Handling these error cases causes problems. For example, if an
   enclave page permission change fails and the VMA has already been
   split then it is not possible to undo the VMA split nor possible to
   undo the enclave page permission changes that did succeed before the
   failure.
2) The OS has little insight into the user space where EPCM permissions
   are controlled from. For example, a RW page may be made RO just
   before it is made RX and splitting the VMAs while the VMAs may change
   soon is unnecessary.

Remove the extra permission check called on a page fault
(vm_operations_struct->fault) or during debugging
(vm_operations_struct->access) when loading the enclave page from swap
that ensures that the VMA permissions do not exceed the enclave
permissions. Since a VMA could only exist if it passed the original
permission checks during mmap() and a VMA may indeed exceed the page
permissions this extra permission check is no longer appropriate.

With the permission check removed, ensure that page table entries do
not blindly inherit the VMA permissions but instead the permissions
that the VMA and enclave agree on. PTEs for writable pages (from VMA and
enclave perspective) are installed with the writable bit set, reducing
the need for this additional flow to the permission mismatch cases
handled next.

Signed-off-by: Reinette Chatre <reinette.chatre@intel.com>
---
 arch/x86/kernel/cpu/sgx/encl.c | 38 ++++++++++++++++++----------------
 1 file changed, 20 insertions(+), 18 deletions(-)

Message ID	7e622156315c9c22c3ef84a7c0aeb01b5c001ff9.1638381245.git.reinette.chatre@intel.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-sgx-owner@kernel.org> X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 00EFAC4332F for <linux-sgx@archiver.kernel.org>; Wed, 1 Dec 2021 19:23:57 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1352696AbhLAT1Q (ORCPT <rfc822;linux-sgx@archiver.kernel.org>); Wed, 1 Dec 2021 14:27:16 -0500 Received: from mga04.intel.com ([192.55.52.120]:46904 "EHLO mga04.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S245195AbhLAT1I (ORCPT <rfc822;linux-sgx@vger.kernel.org>); Wed, 1 Dec 2021 14:27:08 -0500 X-IronPort-AV: E=McAfee;i="6200,9189,10185"; a="235267922" X-IronPort-AV: E=Sophos;i="5.87,279,1631602800"; d="scan'208";a="235267922" Received: from orsmga007.jf.intel.com ([10.7.209.58]) by fmsmga104.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 01 Dec 2021 11:23:42 -0800 X-IronPort-AV: E=Sophos;i="5.87,279,1631602800"; d="scan'208";a="500380443" Received: from rchatre-ws.ostc.intel.com ([10.54.69.144]) by orsmga007-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 01 Dec 2021 11:23:42 -0800 From: Reinette Chatre <reinette.chatre@intel.com> To: dave.hansen@linux.intel.com, jarkko@kernel.org, tglx@linutronix.de, bp@alien8.de, luto@kernel.org, mingo@redhat.com, linux-sgx@vger.kernel.org, x86@kernel.org Cc: seanjc@google.com, kai.huang@intel.com, cathy.zhang@intel.com, cedric.xing@intel.com, haitao.huang@intel.com, mark.shanahan@intel.com, hpa@zytor.com, linux-kernel@vger.kernel.org Subject: [PATCH 03/25] x86/sgx: Support VMA permissions exceeding enclave permissions Date: Wed, 1 Dec 2021 11:23:01 -0800 Message-Id: <7e622156315c9c22c3ef84a7c0aeb01b5c001ff9.1638381245.git.reinette.chatre@intel.com> X-Mailer: git-send-email 2.25.1 In-Reply-To: <cover.1638381245.git.reinette.chatre@intel.com> References: <cover.1638381245.git.reinette.chatre@intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: <linux-sgx.vger.kernel.org> X-Mailing-List: linux-sgx@vger.kernel.org
Series	x86/sgx and selftests/sgx: Support SGX2 \| expand [00/25] x86/sgx and selftests/sgx: Support SGX2 [01/25] x86/sgx: Add shortlog descriptions to ENCLS wrappers [02/25] x86/sgx: Add wrappers for SGX2 functions [03/25] x86/sgx: Support VMA permissions exceeding enclave permissions [04/25] x86/sgx: Add pfn_mkwrite() handler for present PTEs [05/25] x86/sgx: Introduce runtime protection bits [06/25] x86/sgx: Use more generic name for enclave cpumask function [07/25] x86/sgx: Move PTE zap code to separate function [08/25] x86/sgx: Make SGX IPI callback available internally [09/25] x86/sgx: Keep record of SGX page type [10/25] x86/sgx: Support enclave page permission changes [11/25] selftests/sgx: Add test for EPCM permission changes [12/25] selftests/sgx: Add test for TCS page permission changes [13/25] x86/sgx: Support adding of pages to initialized enclave [14/25] x86/sgx: Tighten accessible memory range after enclave initialization [15/25] selftests/sgx: Test two different SGX2 EAUG flows [16/25] x86/sgx: Support modifying SGX page type [17/25] x86/sgx: Support complete page removal [18/25] selftests/sgx: Introduce dynamic entry point [19/25] selftests/sgx: Introduce TCS initialization enclave operation [20/25] selftests/sgx: Test complete changing of page type flow [21/25] selftests/sgx: Test faulty enclave behavior [22/25] selftests/sgx: Test invalid access to removed enclave page [23/25] selftests/sgx: Test reclaiming of untouched page [24/25] x86/sgx: Free up EPC pages directly to support large page ranges [25/25] selftests/sgx: Page removal stress test

[03/25] x86/sgx: Support VMA permissions exceeding enclave permissions

Commit Message

Comments

Patch