From patchwork Tue Jan 9 19:38:01 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Elliot Berman X-Patchwork-Id: 13515290 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 4D841C4725D for ; Tue, 9 Jan 2024 19:39:10 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 71AC78D0013; Tue, 9 Jan 2024 14:38:16 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 6C47D8D0010; Tue, 9 Jan 2024 14:38:16 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 4C9028D0013; Tue, 9 Jan 2024 14:38:16 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0013.hostedemail.com [216.40.44.13]) by kanga.kvack.org (Postfix) with ESMTP id 352158D0010 for ; Tue, 9 Jan 2024 14:38:16 -0500 (EST) Received: from smtpin15.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay10.hostedemail.com (Postfix) with ESMTP id 0EF1DC09B4 for ; Tue, 9 Jan 2024 19:38:16 +0000 (UTC) X-FDA: 81660783792.15.ED91F73 Received: from mx0b-0031df01.pphosted.com (mx0b-0031df01.pphosted.com [205.220.180.131]) by imf22.hostedemail.com (Postfix) with ESMTP id E30D0C0009 for ; Tue, 9 Jan 2024 19:38:13 +0000 (UTC) Authentication-Results: imf22.hostedemail.com; dkim=pass header.d=quicinc.com header.s=qcppdkim1 header.b=iM1PMyMP; spf=pass (imf22.hostedemail.com: domain of quic_eberman@quicinc.com designates 205.220.180.131 as permitted sender) smtp.mailfrom=quic_eberman@quicinc.com; dmarc=pass (policy=none) header.from=quicinc.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1704829094; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=TnTPpzUDBWIYZLcHE8dxPtRIMqaLPUN2IK1SZ81q1vU=; b=bNr1zOI+DWfLUvZMmN5aAaOsmePAejk3JxfHZUu/YXqpiQBK047PwBSOOFEs/q8AsWT+GV DcyFT7hitsS27nTwBoNyaDt1cqR+fd6/4s1Bor9fsJv9tZ78Kb17+5teQMtMkWqQwhCS+y 4tdHm9csQmA+pXaoD8Mj4IJuaKB/55c= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1704829094; a=rsa-sha256; cv=none; b=GGVwQqZ05BlEcTNG1T0OjckUH/cgzgX7j6jP5DUJs0/isdX+Dv2mgz6mPLrX1Zhf3P/Sdj 7u+orIH+giNNlGNb+IZ1CifJBkP0NLEjvxVv+8Ihr3bL+Qlg+yc/tzbf1ir5IXNIt+o0M7 jAdlpzK/jsVknz8mQI5dye7Sebu+L10= ARC-Authentication-Results: i=1; imf22.hostedemail.com; dkim=pass header.d=quicinc.com header.s=qcppdkim1 header.b=iM1PMyMP; spf=pass (imf22.hostedemail.com: domain of quic_eberman@quicinc.com designates 205.220.180.131 as permitted sender) smtp.mailfrom=quic_eberman@quicinc.com; dmarc=pass (policy=none) header.from=quicinc.com Received: from pps.filterd (m0279873.ppops.net [127.0.0.1]) by mx0a-0031df01.pphosted.com (8.17.1.24/8.17.1.24) with ESMTP id 409DRiAN016723; Tue, 9 Jan 2024 19:38:05 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=quicinc.com; h= from:date:subject:mime-version:content-type :content-transfer-encoding:message-id:references:in-reply-to:to :cc; s=qcppdkim1; bh=TnTPpzUDBWIYZLcHE8dxPtRIMqaLPUN2IK1SZ81q1vU =; b=iM1PMyMPrfgGq7LwkLipapCuBEO88AtOhN+uO8TjnDt5kr8v4PX+sZKPacv S2FVlPAu6WMphynffHZFe02BbDtNyxhIBkvvQBNYEqgLNLsF/BnkY04RI8+WE+MX IIFN1DOGAbDnI6QZgnJWZ5YKScN+GWB6j3g21qiiZaDm/hos6+62F7qXZmj9mE4u HSlyLdD/pgNXvmzZEa1H4kfgKfyXo7dwnPaLk6OWKRRtVoskH8vcoNavWCCO0y7K bYDcWzG5qjPECJDsCSSte8ubav2S3PLDkL4S2NhKU4QkHIwOJNpLrJymr8NL58fg ipODtktStLltwSX6v4wB+2nptdw== Received: from nasanppmta04.qualcomm.com (i-global254.qualcomm.com [199.106.103.254]) by mx0a-0031df01.pphosted.com (PPS) with ESMTPS id 3vh3g699py-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Tue, 09 Jan 2024 19:38:05 +0000 (GMT) Received: from nasanex01b.na.qualcomm.com (nasanex01b.na.qualcomm.com [10.46.141.250]) by NASANPPMTA04.qualcomm.com (8.17.1.5/8.17.1.5) with ESMTPS id 409Jc4qw030540 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Tue, 9 Jan 2024 19:38:04 GMT Received: from hu-eberman-lv.qualcomm.com (10.49.16.6) by nasanex01b.na.qualcomm.com (10.46.141.250) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1118.40; Tue, 9 Jan 2024 11:38:03 -0800 From: Elliot Berman Date: Tue, 9 Jan 2024 11:38:01 -0800 Subject: [PATCH v16 23/34] virt: gunyah: guestmem: Initialize RM mem parcels from guestmem MIME-Version: 1.0 Message-ID: <20240109-gunyah-v16-23-634904bf4ce9@quicinc.com> References: <20240109-gunyah-v16-0-634904bf4ce9@quicinc.com> In-Reply-To: <20240109-gunyah-v16-0-634904bf4ce9@quicinc.com> To: Alex Elder , Srinivas Kandagatla , Murali Nalajal , Trilok Soni , Srivatsa Vaddagiri , Carl van Schaik , Philip Derrin , Prakruthi Deepak Heragu , Jonathan Corbet , Rob Herring , Krzysztof Kozlowski , Conor Dooley , Catalin Marinas , Will Deacon , Konrad Dybcio , Bjorn Andersson , Dmitry Baryshkov , "Fuad Tabba" , Sean Christopherson , "Andrew Morton" CC: , , , , , , Elliot Berman X-Mailer: b4 0.13-dev X-Originating-IP: [10.49.16.6] X-ClientProxiedBy: nalasex01c.na.qualcomm.com (10.47.97.35) To nasanex01b.na.qualcomm.com (10.46.141.250) X-QCInternal: smtphost X-Proofpoint-Virus-Version: vendor=nai engine=6200 definitions=5800 signatures=585085 X-Proofpoint-GUID: UAaQhQuWqi5aWtBsqKdQLmvnLI__Nw8A X-Proofpoint-ORIG-GUID: UAaQhQuWqi5aWtBsqKdQLmvnLI__Nw8A X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.272,Aquarius:18.0.997,Hydra:6.0.619,FMLib:17.11.176.26 definitions=2023-12-09_02,2023-12-07_01,2023-05-22_02 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 mlxscore=0 bulkscore=0 spamscore=0 malwarescore=0 phishscore=0 clxscore=1015 impostorscore=0 suspectscore=0 lowpriorityscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.19.0-2311290000 definitions=main-2401090158 X-Rspamd-Queue-Id: E30D0C0009 X-Rspam-User: X-Rspamd-Server: rspam11 X-Stat-Signature: zn8qab4hef3jq5p6wrchz3zy8caqn8zh X-HE-Tag: 1704829093-702011 X-HE-Meta: U2FsdGVkX188Cii9fBhHN3v08trPsmnONXph3aLWbbcR2sBrD2A0AoaxDpeqC9GDwSkHa91x60rSmSay526gzi9mUDR78tWYcQ4fFAGWNnE1i1dZLjyXIpwEoj1fF8O5QB2mNk7hm9WABsaXvy8LG6rF91W30GWd3Yxp9xgdU7bEpBwiVOy77QftM/VWCWLvO8VdcJuL4UV0aATHrlU/zId9JmAwn4YyVV6lPsS6Ca53P+eQQWOR43zRFFuJE9mzV5b7+2KT5IPb3/JcHDNqkAGeGpEG5YQxfh6/NlOckM5Lah2TDpi3tXq1F4txb8Ax4Brxt1/w7xqwDkj9HloUIRRj5gBgns15iwhURq7qgZhyismIqwleo2zXSsMsoE/Xp5A72oRny/eSkYoA3vzjmGjTgVzO4t9wQdDNe5rOmIZinSy1fVMBGGMXPdHsnPjrlvzsF9OymJZjrtJt5JbCzPveU3YH+LWyBTiFmWTd56FfpfenuQ1RoLy8L35BTJi9V0zbdcIg7jobZMuDTqO6I+kFdgYZLEgOF2VyGoLcoBnJMaez2l8XgqgVz72pyg5AlzclA1Rq3pQpkp8wIpjjYuwlsdz5A28piRmQ94c6jUHIajWCa3vCrKiw9jdVw7a7sUlrh6w0t3iTQ/8PWMVIye8vdXDzavwOk1H4VwTH0ppnpHnPXgxfRyxzKL58uY82wN7fG48KNRRAEE2ZxEw6Y2YdAoY3p2La1LPULYcwsZXxLdhgS+Uih/MMkKQrKUcNYQ1Dw76MQtNeKPMoeMZoCVbUn2hoZ4U4OfyT7EpzidNoqNFmx3rPH4foVSEvlZcLA/mRV8bNB1cgYb6M3RqjE8p1bx2T47f50PQt96NEOrMhz95UnnB0/NGARoGpvF/kh8GYYomlXyqgscboxp5uMnNqgJkUv0MQUw+aeRH2mMWTPKxxnsKoZw2TX08hl29DTFNTUnY96pqNwatdMPo DSMgAGBS xvpz7CR40JPKg1a8gcFah1C9PuH+gQbNAcJv1B9xwYW1XFTgWJBUYyoJ4dqs3rtdQ/4u7I5GDL4hMENfC6UshURriHBBPb4VksgVBcmkC2gwZn/gYr4C8+XG9el7SUNbbcMLdWhiQjkzVI5Xs4mZC0hKzUI3wJJnOg7yHFzY/9s0h96RHo3tKJ1QG1Poo6wUGRY5uj9QkdxX21bqlQScwq66YrIG5zpR2SKDnDTS87QaeVLuQPyLEPMciBxaTw7TqvmbzEaP37UqLMoZjCkztkRAcQhwZZ1dv1DmFJv8ZAXcf9E9KMxkWlgHynCXNiw0dRNPMd2wxAEmg6v1YnmQ17xBxixrfPXfxSUsuMdsvg41c9rCSLS2kSsoCf6dDr7lqJLyNs1MC/7js9l3X+P//H1OA/5Eyq6ZdXqxB X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Gunyah Resource Manager sets up a virtual machine based on a device tree which lives in guest memory. Resource manager requires this memory to be provided as a memory parcel for it to read and manipulate. Implement a function to construct a memory parcel from a guestmem binding. Signed-off-by: Elliot Berman --- drivers/virt/gunyah/guest_memfd.c | 190 ++++++++++++++++++++++++++++++++++++++ drivers/virt/gunyah/vm_mgr.h | 6 ++ 2 files changed, 196 insertions(+) diff --git a/drivers/virt/gunyah/guest_memfd.c b/drivers/virt/gunyah/guest_memfd.c index 71686f1946da..5eeac6ac451e 100644 --- a/drivers/virt/gunyah/guest_memfd.c +++ b/drivers/virt/gunyah/guest_memfd.c @@ -653,3 +653,193 @@ int gunyah_gmem_modify_mapping(struct gunyah_vm *ghvm, fput(file); return ret; } + +int gunyah_gmem_share_parcel(struct gunyah_vm *ghvm, struct gunyah_rm_mem_parcel *parcel, + u64 *gfn, u64 *nr) +{ + struct folio *folio, *prev_folio; + unsigned long nr_entries, i, j, start, end; + struct gunyah_gmem_binding *b; + bool lend; + int ret; + + parcel->mem_handle = GUNYAH_MEM_HANDLE_INVAL; + + if (!*nr) + return -EINVAL; + + + down_read(&ghvm->bindings_lock); + b = mtree_load(&ghvm->bindings, *gfn); + if (!b || *gfn > b->gfn + b->nr || *gfn < b->gfn) { + ret = -ENOENT; + goto unlock; + } + + /** + * Generally, indices can be based on gfn, guest_memfd offset, or + * offset into binding. start and end are based on offset into binding. + */ + start = *gfn - b->gfn; + + if (start + *nr > b->nr) { + ret = -ENOENT; + goto unlock; + } + + end = start + *nr; + lend = parcel->n_acl_entries == 1 || gunyah_guest_mem_is_lend(ghvm, b->flags); + + /** + * First, calculate the number of physically discontiguous regions + * the parcel covers. Each memory entry corresponds to one folio. + * In future, each memory entry could correspond to contiguous + * folios that are also adjacent in guest_memfd, but parcels + * are only being used for small amounts of memory for now, so + * this optimization is premature. + */ + nr_entries = 0; + prev_folio = NULL; + for (i = start + b->i_off; i < end + b->i_off;) { + folio = gunyah_gmem_get_folio(file_inode(b->file), i); /* A */ + if (!folio) { + ret = -ENOMEM; + goto out; + } + + nr_entries++; + i = folio_index(folio) + folio_nr_pages(folio); + } + end = i - b->i_off; + + parcel->mem_entries = + kcalloc(nr_entries, sizeof(*parcel->mem_entries), GFP_KERNEL); + if (!parcel->mem_entries) { + ret = -ENOMEM; + goto out; + } + + /** + * Walk through all the folios again, now filling the mem_entries array. + */ + j = 0; + prev_folio = NULL; + for (i = start + b->i_off; i < end + b->i_off; j++) { + folio = filemap_get_folio(file_inode(b->file)->i_mapping, i); /* B */ + if (WARN_ON(IS_ERR(folio))) { + ret = PTR_ERR(folio); + i = end + b->i_off; + goto out; + } + + if (lend) + folio_set_private(folio); + + parcel->mem_entries[j].size = cpu_to_le64(folio_size(folio)); + parcel->mem_entries[j].phys_addr = cpu_to_le64(PFN_PHYS(folio_pfn(folio))); + i = folio_index(folio) + folio_nr_pages(folio); + folio_put(folio); /* B */ + } + BUG_ON(j != nr_entries); + parcel->n_mem_entries = nr_entries; + + if (lend) + parcel->n_acl_entries = 1; + + parcel->acl_entries = kcalloc(parcel->n_acl_entries, + sizeof(*parcel->acl_entries), GFP_KERNEL); + if (!parcel->n_acl_entries) { + ret = -ENOMEM; + goto free_entries; + } + + parcel->acl_entries[0].vmid = cpu_to_le16(ghvm->vmid); + if (b->flags & GUNYAH_MEM_ALLOW_READ) + parcel->acl_entries[0].perms |= GUNYAH_RM_ACL_R; + if (b->flags & GUNYAH_MEM_ALLOW_WRITE) + parcel->acl_entries[0].perms |= GUNYAH_RM_ACL_W; + if (b->flags & GUNYAH_MEM_ALLOW_EXEC) + parcel->acl_entries[0].perms |= GUNYAH_RM_ACL_X; + + if (!lend) { + u16 host_vmid; + + ret = gunyah_rm_get_vmid(ghvm->rm, &host_vmid); + if (ret) + goto free_acl; + + parcel->acl_entries[1].vmid = cpu_to_le16(host_vmid); + parcel->acl_entries[1].perms = GUNYAH_RM_ACL_R | GUNYAH_RM_ACL_W | GUNYAH_RM_ACL_X; + } + + parcel->mem_handle = GUNYAH_MEM_HANDLE_INVAL; + folio = filemap_get_folio(file_inode(b->file)->i_mapping, start); /* C */ + *gfn = folio_index(folio) - b->i_off + b->gfn; + *nr = end - (folio_index(folio) - b->i_off); + folio_put(folio); /* C */ + + ret = gunyah_rm_mem_share(ghvm->rm, parcel); + goto out; +free_acl: + kfree(parcel->acl_entries); + parcel->acl_entries = NULL; +free_entries: + kfree(parcel->mem_entries); + parcel->mem_entries = NULL; + parcel->n_mem_entries = 0; +out: + /* unlock the folios */ + for (j = start + b->i_off; j < i;) { + folio = filemap_get_folio(file_inode(b->file)->i_mapping, j); /* D */ + if (IS_ERR(folio)) + continue; + j = folio_index(folio) + folio_nr_pages(folio); + folio_unlock(folio); /* A */ + folio_put(folio); /* D */ + if (ret) + folio_put(folio); /* A */ + /* matching folio_put for A is done at + * (1) gunyah_gmem_reclaim_parcel or + * (2) after gunyah_gmem_parcel_to_paged, gunyah_vm_reclaim_folio + */ + } +unlock: + up_read(&ghvm->bindings_lock); + return ret; +} + +int gunyah_gmem_reclaim_parcel(struct gunyah_vm *ghvm, + struct gunyah_rm_mem_parcel *parcel, u64 gfn, + u64 nr) +{ + struct gunyah_rm_mem_entry *entry; + struct folio *folio; + pgoff_t i; + int ret; + + if (parcel->mem_handle != GUNYAH_MEM_HANDLE_INVAL) { + ret = gunyah_rm_mem_reclaim(ghvm->rm, parcel); + if (ret) { + dev_err(ghvm->parent, "Failed to reclaim parcel: %d\n", + ret); + /* We can't reclaim the pages -- hold onto the pages + * forever because we don't know what state the memory + * is in + */ + return ret; + } + parcel->mem_handle = GUNYAH_MEM_HANDLE_INVAL; + + for (i = 0; i < parcel->n_mem_entries; i++) { + entry = &parcel->mem_entries[i]; + + folio = pfn_folio(PHYS_PFN(le64_to_cpu(entry->phys_addr))); + folio_put(folio); /* A */ + } + + kfree(parcel->mem_entries); + kfree(parcel->acl_entries); + } + + return 0; +} diff --git a/drivers/virt/gunyah/vm_mgr.h b/drivers/virt/gunyah/vm_mgr.h index 518d05eeb642..a79c11f1c3a5 100644 --- a/drivers/virt/gunyah/vm_mgr.h +++ b/drivers/virt/gunyah/vm_mgr.h @@ -119,5 +119,11 @@ int gunyah_gmem_modify_mapping(struct gunyah_vm *ghvm, struct gunyah_map_mem_args *args); struct gunyah_gmem_binding; void gunyah_gmem_remove_binding(struct gunyah_gmem_binding *binding); +int gunyah_gmem_share_parcel(struct gunyah_vm *ghvm, + struct gunyah_rm_mem_parcel *parcel, u64 *gfn, + u64 *nr); +int gunyah_gmem_reclaim_parcel(struct gunyah_vm *ghvm, + struct gunyah_rm_mem_parcel *parcel, u64 gfn, + u64 nr); #endif